BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy10434
(593 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|270006267|gb|EFA02715.1| hypothetical protein TcasGA2_TC008439 [Tribolium castaneum]
Length = 648
Score = 648 bits (1672), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 326/587 (55%), Positives = 409/587 (69%), Gaps = 21/587 (3%)
Query: 13 CTLLFNDAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQH 71
C ++F + + + A KKP+II+I+ADD+G+NDV FHGS++IPTPNIDALAYNG+ILN H
Sbjct: 5 CKIIFFGSVVASFAQTKKPNIIVIVADDMGFNDVGFHGSNEIPTPNIDALAYNGVILNSH 64
Query: 72 YVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
Y QALCTPSRSA +TGKYPIH+GMQH VILE EPWGLPL E +LPQYLK GYATHAIGK
Sbjct: 65 YTQALCTPSRSAFLTGKYPIHLGMQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIGK 124
Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
WHLGFFR+ YTPT+RGFDSH+GYWQGLQDYY H+ ATF P G DMR NM VD G
Sbjct: 125 WHLGFFRKEYTPTYRGFDSHFGYWQGLQDYYKHTVHATFTPEHGYDMRRNMTVDWSAQGK 184
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
YST L+T+ A+ +I EHN PMF+YLAHLA H+GN +P QAPDEE+AKF I+DPERR
Sbjct: 185 YSTTLFTDEAVRLIREHNTENPMFMYLAHLAPHSGNDDDPLQAPDEEIAKFGHIADPERR 244
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
YA MVS LD+SVG+VIAALR MLENSI++FM+DNGA GIH+N GSN+PLRG K++
Sbjct: 245 IYAAMVSMLDKSVGSVIAALRDKHMLENSIIVFMSDNGAKPDGIHANHGSNYPLRGNKNS 304
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVL 370
W+G MR VAAIWSP +K+ Q+VS+ L HISDWLPT AAG+ + +DGV+ W +
Sbjct: 305 AWEGAMRCVAAIWSPLIKKPQRVSNSLMHISDWLPTFYTAAGLNKTELPKMDGVDMWASI 364
Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE-IDKY 429
++G + R+E+LHNID + N Y ALRV + KY+ G+ NG+SD WYG + + + Y
Sbjct: 365 SEGKDSPRTELLHNIDEIYN----YGALRVGNWKYLYGSTTNGKSDGWYGSSGRDPLYTY 420
Query: 430 SPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC- 488
VL S+ G T L T QIK+KH D ++ +L E I LR A V+C
Sbjct: 421 DDSAVLASQTGSTLAGLTTYQQIKEKHQGD-----TNFTHKLLDSETIKTLRGAAEVKCP 475
Query: 489 --NYDN--KGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPG 544
N++ + CN+ PCLFNI +DPCEQ NLA + ++ +E LA +K T +P
Sbjct: 476 RVNFEEIPESKKCNAVESPCLFNIKEDPCEQINLAAERPMIVLNMEMALARFKQTALPIR 535
Query: 545 NKPFDKRADPARWNNIWVPWYDELDKQKAIETMRLHKPLLSPTTITL 591
N P D ADPA+WNN WV W D D +K +++ LSP I L
Sbjct: 536 NVPRDPNADPAKWNNTWVNWQDYEDVKK----QKIYFNTLSPLAIGL 578
>gi|189236827|ref|XP_972832.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
Length = 646
Score = 639 bits (1649), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 324/587 (55%), Positives = 407/587 (69%), Gaps = 23/587 (3%)
Query: 13 CTLLFNDAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQH 71
C ++F + + + A KKP+II+I+ADD+G+NDV FHGS++IPTPNIDALAYNG+ILN H
Sbjct: 5 CKIIFFGSVVASFAQTKKPNIIVIVADDMGFNDVGFHGSNEIPTPNIDALAYNGVILNSH 64
Query: 72 YVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
Y QALCTPSRSA +TGKYPIH+GMQH VILE EPWGLPL E +LPQYLK GYATHAIGK
Sbjct: 65 YTQALCTPSRSAFLTGKYPIHLGMQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIGK 124
Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
WHLGFFR+ YTPT+RGFDSH+GYWQGLQDYY H+ F P G DMR NM VD G
Sbjct: 125 WHLGFFRKEYTPTYRGFDSHFGYWQGLQDYYKHTVH--FTPEHGYDMRRNMTVDWSAQGK 182
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
YST L+T+ A+ +I EHN PMF+YLAHLA H+GN +P QAPDEE+AKF I+DPERR
Sbjct: 183 YSTTLFTDEAVRLIREHNTENPMFMYLAHLAPHSGNDDDPLQAPDEEIAKFGHIADPERR 242
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
YA MVS LD+SVG+VIAALR MLENSI++FM+DNGA GIH+N GSN+PLRG K++
Sbjct: 243 IYAAMVSMLDKSVGSVIAALRDKHMLENSIIVFMSDNGAKPDGIHANHGSNYPLRGNKNS 302
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVL 370
W+G MR VAAIWSP +K+ Q+VS+ L HISDWLPT AAG+ + +DGV+ W +
Sbjct: 303 AWEGAMRCVAAIWSPLIKKPQRVSNSLMHISDWLPTFYTAAGLNKTELPKMDGVDMWASI 362
Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE-IDKY 429
++G + R+E+LHNID + N Y ALRV + KY+ G+ NG+SD WYG + + + Y
Sbjct: 363 SEGKDSPRTELLHNIDEIYN----YGALRVGNWKYLYGSTTNGKSDGWYGSSGRDPLYTY 418
Query: 430 SPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC- 488
VL S+ G T L T QIK+KH D ++ +L E I LR A V+C
Sbjct: 419 DDSAVLASQTGSTLAGLTTYQQIKEKHQGD-----TNFTHKLLDSETIKTLRGAAEVKCP 473
Query: 489 --NYDN--KGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPG 544
N++ + CN+ PCLFNI +DPCEQ NLA + ++ +E LA +K T +P
Sbjct: 474 RVNFEEIPESKKCNAVESPCLFNIKEDPCEQINLAAERPMIVLNMEMALARFKQTALPIR 533
Query: 545 NKPFDKRADPARWNNIWVPWYDELDKQKAIETMRLHKPLLSPTTITL 591
N P D ADPA+WNN WV W D D +K +++ LSP I L
Sbjct: 534 NVPRDPNADPAKWNNTWVNWQDYEDVKK----QKIYFNTLSPLAIGL 576
>gi|156552077|ref|XP_001604760.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 710
Score = 616 bits (1589), Expect = e-173, Method: Compositional matrix adjust.
Identities = 308/559 (55%), Positives = 395/559 (70%), Gaps = 27/559 (4%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
P+I++I+ADD+GWNDVSFHGS QIPTPNIDALAYNG+ILN HYV ALCTPSRSAL+TGKY
Sbjct: 44 PNIVMIIADDMGWNDVSFHGSDQIPTPNIDALAYNGVILNSHYVSALCTPSRSALLTGKY 103
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
PIH GMQH VILE EP GLPL EK+LPQYLKEAGYATHAIGKWH GF R YTPT+RGFD
Sbjct: 104 PIHTGMQHLVILEAEPRGLPLHEKILPQYLKEAGYATHAIGKWHQGFHRREYTPTYRGFD 163
Query: 150 SHYGYWQGLQDYYDHSCKAT--FEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
SH+GYWQGLQDYY H ++ E + G DMR NM + T G YSTDL+T+ A+ +I E
Sbjct: 164 SHFGYWQGLQDYYTHEVGSSNPKEGFLGFDMRRNMSLARDTYGKYSTDLFTDEAVRLIEE 223
Query: 208 HN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H ++ PMFLYLAHLA H+GN EP QAPDEEVAKF + DPERR YA M+S+LD+SVG
Sbjct: 224 HRPEAGPMFLYLAHLAPHSGNDNEPLQAPDEEVAKFSYVEDPERRIYAAMMSKLDQSVGE 283
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
V++ALR+ ML+NSIV+FMADNGA + GIH N+GSN+PLRG+K++ W+G +RG AA+WSP
Sbjct: 284 VVSALRRKNMLQNSIVVFMADNGAATQGIHYNRGSNYPLRGIKASAWEGAVRGAAAVWSP 343
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIE---INDTSLDGVNQWDVLTKGAKT-KRSEIL 382
+++ +++ +EL I+DWLPTL +A+G+ ++DGV+QW ++ A + R+EIL
Sbjct: 344 LIQRPKRIYNELMSIADWLPTLLSASGLRDVVRVSANIDGVDQWPAISGVAPSPPRNEIL 403
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDK--------YSPKEV 434
NID + N Y+ALR + KYV GT NG+ EWYG+T ++ Y P+ V
Sbjct: 404 VNIDPIFN----YSALRRGEFKYVLGTVGNGE--EWYGETGRPENQGLEGASPTYDPETV 457
Query: 435 LYSKAGITFNALKTKLQIKQKHAA--DPKANSSD---ALRTILTDEKILELREFARVRCN 489
L SKAG L T Q+ + A D D A +LT +++L+LR A +RC
Sbjct: 458 LMSKAGTAIAGLLTAKQVSEIRAVRKDTYGRVKDDEMATSKLLTVDELLKLRSSASLRCT 517
Query: 490 Y-DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF 548
+++ C+ PCLFNI +DPCEQ NLA S+ +L LE+ L Y+ T +PP N P
Sbjct: 518 VPESERVACHPLQSPCLFNIKEDPCEQRNLAASRAMILATLEEALLKYRVTALPPSNVPN 577
Query: 549 DKRADPARWNNIWVPWYDE 567
D +A+PA WN+ WV W D+
Sbjct: 578 DPKANPAFWNHTWVNWQDD 596
>gi|380012883|ref|XP_003690503.1| PREDICTED: arylsulfatase J-like [Apis florea]
Length = 671
Score = 616 bits (1588), Expect = e-173, Method: Compositional matrix adjust.
Identities = 301/561 (53%), Positives = 394/561 (70%), Gaps = 23/561 (4%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
+TA +KPHI+IILADDLGWNDVSFHGS QIPTPNIDALAYNG+ILN HYV ALCTPSRSA
Sbjct: 22 STAKQKPHIVIILADDLGWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSA 81
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
LMTGK PIH+GMQH V+ EP GLPL+EKLLP+YL+E GY THA+GKWHLG+F++ YTP
Sbjct: 82 LMTGKNPIHLGMQHSVLFPAEPRGLPLSEKLLPEYLREVGYKTHAVGKWHLGYFKKEYTP 141
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKA---TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
T+RGFDSH+GYW GLQDYY H + + Y+G DMR N+ V T+G YSTDL+T
Sbjct: 142 TYRGFDSHFGYWNGLQDYYTHITQEPDPVYSEYKGFDMRRNLTVAWDTVGKYSTDLFTNE 201
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
A+ +I EHN +PMFLYLAHLA H GN + F+APDEE+AKF I DPERR A +VS+L
Sbjct: 202 AVRLINEHNIDQPMFLYLAHLAPHKGNEEQLFRAPDEEIAKFSYILDPERRIQAAVVSKL 261
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
D+SVG+V+ ALR GMLENSIV+FM+DNGAPS G+ SN+GSN+PLRG+K++PW+GG RGV
Sbjct: 262 DQSVGDVMDALRNRGMLENSIVVFMSDNGAPSQGLLSNEGSNYPLRGIKNSPWEGGTRGV 321
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRS 379
AAIWSP +K++++VS+++ +SDWLPTL +AAG++ ++DG + W L + RS
Sbjct: 322 AAIWSPLIKKSKRVSNQMMFMSDWLPTLLSAAGVDRRQLGNIDGYDLWSALVLNKISSRS 381
Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID----KYSPKEVL 435
E++ NID++ + YAA R D KY+ G G + W G + + + +Y P ++L
Sbjct: 382 EVMLNIDDLSD----YAAFRRGDFKYIIGRTETGSA--WLGASGDPSEGISPQYDPHKIL 435
Query: 436 YSKAGITFNAL---KTKLQIKQKHAADPKANSSDALRT-----ILTDEKILELREFARVR 487
YSK G+ + + K +++K++ AL+T IL+ E I ELR A+++
Sbjct: 436 YSKTGVAISGIITAKQAMELKEERRKRSVKIYDTALKTNFEEKILSVENISELRRKAQIK 495
Query: 488 CNYDNKG-AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNK 546
CN K C PCLFNI DPCE NLAE + ++ LE L Y+ T++PP N
Sbjct: 496 CNVKEKDRIPCEPMTAPCLFNIEKDPCEMVNLAEKRPMIMAILERILMKYRLTVIPPSNL 555
Query: 547 PFDKRADPARWNNIWVPWYDE 567
D RA+P+ WNN W WY++
Sbjct: 556 DGDPRANPSLWNNTWTSWYED 576
>gi|328783191|ref|XP_396281.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
Length = 713
Score = 612 bits (1579), Expect = e-172, Method: Compositional matrix adjust.
Identities = 304/559 (54%), Positives = 389/559 (69%), Gaps = 24/559 (4%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TT +KPHIIIILADDLGWNDVSFHGS QIPTPNIDALAYNG+ILN HYV ALCTPSRS
Sbjct: 65 STTTKQKPHIIIILADDLGWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRS 124
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
ALMTGK PIH+GMQH V+ EP GLPL+EKLLP+YL+E GY THA+GKWHLG+F++ YT
Sbjct: 125 ALMTGKNPIHLGMQHSVLFPTEPRGLPLSEKLLPEYLREIGYKTHAVGKWHLGYFKKEYT 184
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKA---TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
PT+RGFDSH+GYW GLQDYY H + F ++G DMR N+ V T+G YSTDL+T
Sbjct: 185 PTYRGFDSHFGYWNGLQDYYTHITQEPDPAFSEFKGFDMRRNLTVAWDTVGKYSTDLFTN 244
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
AI +I EH+ +PMFLYLAHLAVH GN + F+APDEE+AKF I DPERR A +VS+
Sbjct: 245 EAIRLINEHDTDRPMFLYLAHLAVHKGNEEQLFRAPDEEIAKFSYILDPERRIQAAVVSK 304
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
LD+SVG+V+ ALR GMLENSIVLFM+DNGAPS GI +NKGSN+PLRG+K++PW+GG RG
Sbjct: 305 LDQSVGDVMDALRNRGMLENSIVLFMSDNGAPSQGILNNKGSNYPLRGIKNSPWEGGTRG 364
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR 378
VAAIWSP +K++++VS+++ ISDWLPTL +AAG+ ++DG + W L + R
Sbjct: 365 VAAIWSPLIKKSKRVSNQMMFISDWLPTLLSAAGVNRKQLGNIDGFDLWSALVLNKVSSR 424
Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID----KYSPKEV 434
SE++ NID++ + YA+ R + KY+ G G +W G + + + +Y P ++
Sbjct: 425 SEVVLNIDDLSD----YASFRRGNFKYIIGRTETGS--DWLGASGDPSEGISPQYDPYKI 478
Query: 435 LYSKAGITFNAL---KTKLQIKQKHAADPKANSSDALRT-----ILTDEKILELREFARV 486
LYSK G+ + + K +++K++ K + AL T IL+ E I ELR A++
Sbjct: 479 LYSKTGVAISGIITAKQAMELKERRKRSIKIYDT-ALETKFEEKILSVENISELRRKAQI 537
Query: 487 RCNYDNKG-AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN 545
+CN K C PCLFNI DPCE NLAE + ++ LE L Y+ T +P N
Sbjct: 538 KCNVQEKDRIPCEPMTAPCLFNIEKDPCEMVNLAERRPVIMAILERILMKYRITAIPASN 597
Query: 546 KPFDKRADPARWNNIWVPW 564
D RADP WNN W W
Sbjct: 598 LDGDPRADPTLWNNTWTSW 616
>gi|383859596|ref|XP_003705279.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
Length = 689
Score = 607 bits (1566), Expect = e-171, Method: Compositional matrix adjust.
Identities = 303/548 (55%), Positives = 382/548 (69%), Gaps = 15/548 (2%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+KPHIIIILADDLGWNDVSFHGS QIPTPNIDA+AYNG+ILN HYV ALCTPSR+ALMTG
Sbjct: 57 QKPHIIIILADDLGWNDVSFHGSDQIPTPNIDAIAYNGIILNSHYVAALCTPSRTALMTG 116
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
K PIH+GMQH V+L EP GLPL+EKLLP+YL+E GY THA+GKWHLG+FR YTPTFRG
Sbjct: 117 KNPIHLGMQHSVLLPSEPRGLPLSEKLLPEYLREVGYRTHAVGKWHLGYFRREYTPTFRG 176
Query: 148 FDSHYGYWQGLQDYYDHSCKA---TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
FD+H+GYW GLQDYY H + F + GLDMR N+ T G YSTDL+TE A+ +
Sbjct: 177 FDTHFGYWNGLQDYYTHITQEPDPQFGEFMGLDMRRNLTAAWDTQGKYSTDLFTEEAVRL 236
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
I EH+K PMFLYLAHLA H GN +A DE++A+F I DPERR A +VS++D+SV
Sbjct: 237 INEHDKDDPMFLYLAHLAPHKGNPNRLLRASDEDIARFSYILDPERRIQAAVVSKVDQSV 296
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
G V+ ALRK GMLENSIVLFM+DNGAP+ G SN+GSN+PLRG+K +PW+GG RGVAAIW
Sbjct: 297 GEVMDALRKRGMLENSIVLFMSDNGAPTQGSLSNQGSNYPLRGIKDSPWEGGTRGVAAIW 356
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILH 383
SP +K++Q+VS+++ I+DWLPTL +AAG++ ++DG + W L + RS+++
Sbjct: 357 SPLIKESQRVSNQMMFIADWLPTLLSAAGVDSKQLGNIDGFDLWPALRSDKISPRSDVVL 416
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID----KYSPKEVLYSKA 439
NID++ YA++R D KYV G G++ W G T + KY P +VLYSKA
Sbjct: 417 NIDDISK----YASIRRGDFKYVIGETETGRA--WVGATGAPSEGIPPKYDPDKVLYSKA 470
Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG-AHCN 498
G+ + + T Q K+ ++ D IL+ EKI +LR ARV+CN + C+
Sbjct: 471 GVAISGVLTAKQAKELKKKKNTSSKIDFEEKILSAEKIFQLRNDARVQCNVKEENRIPCD 530
Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
T PCLFNI DPCE NLA+ + +L +E L Y+ T VPP N D RA+PA WN
Sbjct: 531 PTAAPCLFNIEKDPCEMVNLADKRPLILAIMERVLMNYRLTTVPPSNVNGDPRANPALWN 590
Query: 559 NIWVPWYD 566
N W W D
Sbjct: 591 NTWTTWDD 598
>gi|242025556|ref|XP_002433190.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212518731|gb|EEB20452.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 570
Score = 553 bits (1425), Expect = e-155, Method: Compositional matrix adjust.
Identities = 272/570 (47%), Positives = 387/570 (67%), Gaps = 47/570 (8%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
+T ++ F + ++P+IIIILADDLGWNDVSFHGS+QI TPNIDALAYNG+ILN
Sbjct: 31 ITLSIFFQNV---VDGIERPNIIIILADDLGWNDVSFHGSNQIQTPNIDALAYNGIILNS 87
Query: 71 HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
HYV ALCTPSR++LMTGKYP +GMQH VIL EPWGLPL E L+P+Y + GYATHA+G
Sbjct: 88 HYVPALCTPSRASLMTGKYPTSLGMQHLVILSPEPWGLPLNETLMPEYFNKNGYATHAVG 147
Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
KWHLGFF++ YTP +RGFDSH+G+W G QDYYDH+ + + +G DMR N +VD G
Sbjct: 148 KWHLGFFKKEYTPIYRGFDSHFGHWNGFQDYYDHTTMS--DSLKGYDMRRNFEVDYSYQG 205
Query: 191 IYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
+Y+TD++T+ AI +I HN K P+FLYL+HLA H+GN PFQAP++E++K I+DP
Sbjct: 206 MYTTDVFTKEAIKIIDNHNSQKGPLFLYLSHLAPHSGNPDNPFQAPEDEISKHECINDPG 265
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
R+ YA MV++LDESVG V++AL K+ ML NSI++FM+DNGA ++G+HSN+GSN+PLRG+K
Sbjct: 266 RKIYAAMVTKLDESVGQVVSALEKNKMLNNSIIIFMSDNGAATYGLHSNRGSNYPLRGLK 325
Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL----DGVN 365
+PW+GG+RG AAIWSP+L +T++VS +L H+SDWLPTL AAG+ + T L DG++
Sbjct: 326 ESPWEGGVRGTAAIWSPFLNKTKRVSKQLMHMSDWLPTLLTAAGLNYSSTQLINKIDGID 385
Query: 366 QWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGD-TDN 424
W+VL+ + R E+ +N D ++N Y++L +D KYV GT G++D W+ + + N
Sbjct: 386 MWNVLSNDLPSPRKEVFNNYDEIEN----YSSLMIDSWKYVEGTAQEGKADYWFEEPSRN 441
Query: 425 EIDKY--SPKEVLYSKAGITF----NALKTKLQIKQKHAADPKANSSDALRTILTDEKIL 478
+Y S +++ + T + L I + + D K + + +LT + +L
Sbjct: 442 NCSEYRVSNEDIFRLRRDSTIICDNPTFSSSLSITRNNHTDVK----NKTKYVLTCDPLL 497
Query: 479 ELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKS 538
+ + CLFN+ DDPCE+ NLA+ D++K+++++L K
Sbjct: 498 K----------------------RFCLFNLNDDPCERLNLADVFPDVVKRIKNRLLELKK 535
Query: 539 TMVPPGNKPFDKRADPARWNNIWVPWYDEL 568
++V P NKP D ++P +N WV W D++
Sbjct: 536 SVVKPLNKPEDPYSNPMFYNGTWVNWKDDI 565
>gi|307191747|gb|EFN75189.1| Arylsulfatase B [Harpegnathos saltator]
Length = 583
Score = 553 bits (1424), Expect = e-154, Method: Compositional matrix adjust.
Identities = 279/549 (50%), Positives = 367/549 (66%), Gaps = 36/549 (6%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
KKPHIIIILADDLGWNDVSFHGS+QIPTPNIDALAY G++L HYV ALCTPSR+AL+TG
Sbjct: 39 KKPHIIIILADDLGWNDVSFHGSNQIPTPNIDALAYYGVLLKNHYVAALCTPSRAALLTG 98
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
KYPIH+GMQH I EP GLPL EKLLPQYLK+ Y TH +GKWHLG+++ YTP +RG
Sbjct: 99 KYPIHLGMQHEAIFPSEPRGLPLEEKLLPQYLKDMNYVTHIVGKWHLGYYKMEYTPLYRG 158
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQ---GLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
FD+H+GYW GLQDYY H T EPY G+DMR N V T+G YS DLYT+ A+ +
Sbjct: 159 FDTHFGYWNGLQDYYSHK---TAEPYTLNIGMDMRRNFTVAWDTMGKYSVDLYTDEAVRL 215
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
I HN PMFLYLA +A HAGN + QA EE+ KF I DP+R+ YA +VS+LDESV
Sbjct: 216 INTHNTDNPMFLYLAQIAPHAGNANQLPQALPEEIEKFSYIIDPKRKRYAAVVSKLDESV 275
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
G V+ AL+ ML NSI+LFM+DNGAP+ G SN GSN+P RG+K T W+GG+RG AAIW
Sbjct: 276 GKVVEALKNRNMLSNSIILFMSDNGAPTSGFLSNGGSNYPFRGIKKTLWEGGVRGAAAIW 335
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILH 383
SP +K+ ++VS L H++DWLPTL +AAG + ++DG++ W+ + G + R E+L
Sbjct: 336 SPLIKKRERVSYHLMHVTDWLPTLLSAAGGNKEEIGTIDGLDMWETIVYGDCSPRKEVLI 395
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDK----YSPKEVLYSKA 439
NID++ + + A+R+ KYV G + ++++W GD+ + YSP+ VL SK
Sbjct: 396 NIDDICDSE----AIRIGKYKYVRG--ESKKNNDWVGDSGRNSSEQRPPYSPETVLNSKV 449
Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY-DNKGAHCN 498
GI +L P N+ LT K+LELR+ A V CN + + C+
Sbjct: 450 GIAIASL-------------PGNNN-----VALTSAKMLELRQQAEVHCNVTEEEKVQCD 491
Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
+ + PCLF++ DPCE N+ + + + K+L++ L ++STM+ P N+ D RA P WN
Sbjct: 492 TQLAPCLFDLERDPCEMINIIKQEPLVAKELQEVLLQHRSTMLLPNNRIIDVRASPILWN 551
Query: 559 NIWVPWYDE 567
WV W D+
Sbjct: 552 CTWVCWMDD 560
>gi|383853606|ref|XP_003702313.1| PREDICTED: arylsulfatase J-like [Megachile rotundata]
Length = 544
Score = 548 bits (1412), Expect = e-153, Method: Compositional matrix adjust.
Identities = 271/561 (48%), Positives = 363/561 (64%), Gaps = 30/561 (5%)
Query: 10 ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
A+ L + TA ++PHI+ ILADDLGWNDV FHGS QIPTPNIDALAY+GL+L+
Sbjct: 6 AVLWAFLIVPFLRHVTAAERPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLD 65
Query: 70 QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
++YV +CTPSRSALMTGKYPIH GMQHGV+ EP GLPL EKLLP+YLKE GY TH +
Sbjct: 66 RYYVSPICTPSRSALMTGKYPIHTGMQHGVLKCAEPRGLPLQEKLLPEYLKELGYRTHIV 125
Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI 189
GKWHLGF+ + YTPT+RGFDSH G+W G QDY+DH+ A PY GLDMR M+
Sbjct: 126 GKWHLGFYTKQYTPTYRGFDSHIGFWSGHQDYFDHT--AVESPYWGLDMRRGMEAAWDLH 183
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
G YSTD++T A+ +I HN SKP+FLYLAH AVH+GN Y+P APD +VAKF +I D
Sbjct: 184 GQYSTDVFTSEAVKLINNHNDSKPLFLYLAHAAVHSGNPYDPLPAPDVDVAKFTNIFDYN 243
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
RR +A M+S+LDESVG V+ AL++ GML+NSI++F DNG P G + N SN PLRG K
Sbjct: 244 RRRFAAMLSKLDESVGEVVEALQRKGMLKNSIIVFSTDNGGPPAGFNLNAASNWPLRGTK 303
Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDV 369
+T W+GG+RG IWSP L + +VS +LFHISDWLPTL AAG + ++ S+DG++ W
Sbjct: 304 NTLWEGGVRGAGLIWSPRLIRPGRVSRQLFHISDWLPTLLTAAGGDPSNLSIDGLDLWHA 363
Query: 370 LTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKY 429
L++ ++ R +LHNID++ + + + + KY+ G+ GQ D WYG
Sbjct: 364 LSEDTESPRQSVLHNIDDIFG----VSGITIGEWKYIQGSTYEGQWDGWYGP-------- 411
Query: 430 SPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN 489
S +E +Y G+ P + ++ L+ E + LRE A ++C
Sbjct: 412 SGREWVYDIGGVI---------------GSPTGRAVASVGYSLSSENVRRLREQAMIKCR 456
Query: 490 YDNKG-AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF 548
N C + PCLF++ DPCE NNL ++ + + + L + +T + PGN P+
Sbjct: 457 PRNDTLPLCKPLIAPCLFHVLKDPCEDNNLINEYPEIARAMREDLKKWNATALLPGNLPW 516
Query: 549 DKRADPARWNNIWVPWYDELD 569
DKRA+P W++ W+ + D LD
Sbjct: 517 DKRANPELWDHTWINFGDHLD 537
>gi|340710385|ref|XP_003393772.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
Length = 545
Score = 541 bits (1395), Expect = e-151, Method: Compositional matrix adjust.
Identities = 274/548 (50%), Positives = 360/548 (65%), Gaps = 30/548 (5%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+ A +PHI+ ILADDLGWNDV FHGS QIPTPNIDALAY+GL+L+++YV +CTPSRS
Sbjct: 20 HVAAAVQPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRS 79
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
ALMTGK+PIH GMQHGV+ EP GLPL EKLLPQYL+E GY+TH +GKWHLGF+ + YT
Sbjct: 80 ALMTGKHPIHTGMQHGVLKCAEPRGLPLHEKLLPQYLRELGYSTHIVGKWHLGFYTKEYT 139
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
P +RGFDSH G+W G DY+DHS A PY GLDMR + G YSTD++T+ A+
Sbjct: 140 PMYRGFDSHIGFWSGHHDYFDHS--AVESPYWGLDMRRGLNSAWDLHGQYSTDIFTKEAV 197
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
+I +HN S+PMFLYL+H AVH+GN+Y P APD++VAKF +I + ERR +AGM+S+LDE
Sbjct: 198 KLINDHNASRPMFLYLSHAAVHSGNSYNPLPAPDQDVAKFTNIFNYERRRFAGMLSKLDE 257
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
SVG V+ ALRK GML+NS+++F DNG P G + N SN PLRG K+T W+GG+RG
Sbjct: 258 SVGQVVEALRKKGMLKNSVIVFSTDNGGPPAGFNLNAASNWPLRGTKNTLWEGGVRGAGL 317
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
+WSP L + +VS +LFHISDWLPTL AAG ++ S+DG++ W L++ ++ R +L
Sbjct: 318 LWSPRLVRPGRVSRQLFHISDWLPTLITAAGGNPSNLSIDGMDLWYALSEDTESPRKMVL 377
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
HNID++ A + D KY+ G+ NGQ D WYG + E Y V+ S AG
Sbjct: 378 HNIDDIFG----VAGITYGDWKYIQGSTYNGQWDGWYGPSGREW-VYDIGGVIGSTAGRA 432
Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG-AHCNSTV 501
++ L S+DA+R LRE A ++C N C
Sbjct: 433 VASVGLAL-------------SADAIR---------RLREDAMIKCPPKNDSLPLCKPLE 470
Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
PCLFN+ DPCE NNL L+ +L+DKL +T + PGN P+D +A+P W++ W
Sbjct: 471 APCLFNVQQDPCEDNNLINKYPSLVTELQDKLRKLNATAILPGNLPWDNKANPDYWDHTW 530
Query: 562 VPWYDELD 569
+ D L+
Sbjct: 531 TNFGDYLN 538
>gi|350415537|ref|XP_003490674.1| PREDICTED: arylsulfatase J-like [Bombus impatiens]
Length = 545
Score = 539 bits (1389), Expect = e-150, Method: Compositional matrix adjust.
Identities = 273/548 (49%), Positives = 357/548 (65%), Gaps = 30/548 (5%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+ A +PHI+ ILADDLGWNDV FHGS QIPTPNIDALAY+GL+L+++YV +CTPSRS
Sbjct: 20 HVAAAVQPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRS 79
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
ALMTGK+PIH GMQHGV+ EP GLPL EKLLPQYL+E GY+TH +GKWHLGF+ + YT
Sbjct: 80 ALMTGKHPIHTGMQHGVLKCAEPRGLPLHEKLLPQYLRELGYSTHIVGKWHLGFYTKEYT 139
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
P +RGFDSH G+W G DY+DHS A PY GLDMR + G YSTD++T+ A+
Sbjct: 140 PMYRGFDSHIGFWSGHHDYFDHS--AVESPYWGLDMRRGLNSAWDLHGQYSTDIFTKEAV 197
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
+I +HN S+PMFLYL H AVH+GN+Y P PD++VAKF +I + ERR +AGM+S+LDE
Sbjct: 198 KLINDHNASRPMFLYLPHAAVHSGNSYNPLPVPDQDVAKFTNIFNYERRRFAGMLSKLDE 257
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
SVG V+ ALRK GML+NS+++F DNG P G + N SN PLRG K+T W+GG+RG
Sbjct: 258 SVGQVVEALRKKGMLKNSVIVFSTDNGGPPAGFNLNAASNWPLRGTKNTLWEGGVRGAGL 317
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
+WSP L + +VS +LFHISDWLPTL AAG ++ S+DG++ W L++ ++ R +L
Sbjct: 318 LWSPRLVRPGRVSRQLFHISDWLPTLITAAGGNPSNLSIDGMDLWYALSEDTESPRKMVL 377
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
HNID++ A + D KY+ G+ NGQ D WYG + E Y V+ S AG
Sbjct: 378 HNIDDIFG----VAGITYGDWKYIQGSTYNGQWDGWYGPSGREW-VYDIGGVIGSTAGRA 432
Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG-AHCNSTV 501
++ L S+DA+R LRE A ++C N C
Sbjct: 433 VASVGLTL-------------SADAIR---------RLREDAMIKCPPKNDSLPLCKPLE 470
Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
PCLFN+ DPCE NNL L+ +L D+L +T + PGN P+D RA+P W++ W
Sbjct: 471 APCLFNVQQDPCEDNNLINKYPSLVTELRDRLRKLNATAILPGNLPWDNRANPDYWDHTW 530
Query: 562 VPWYDELD 569
+ D L+
Sbjct: 531 TNFGDYLN 538
>gi|118779434|ref|XP_309303.3| AGAP011348-PA [Anopheles gambiae str. PEST]
gi|116131546|gb|EAA05277.3| AGAP011348-PA [Anopheles gambiae str. PEST]
Length = 573
Score = 538 bits (1385), Expect = e-150, Method: Compositional matrix adjust.
Identities = 272/571 (47%), Positives = 365/571 (63%), Gaps = 34/571 (5%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
+ +T LL+ +T ++P+I+II+ADDLGWNDVSFHGS+QIPTPNIDALAY+G+IL
Sbjct: 7 YLVTLWLLWLGGGASTAHQERPNIVIIVADDLGWNDVSFHGSNQIPTPNIDALAYDGIIL 66
Query: 69 NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
N+HYV LCTPSR++LMTGK+P++IGMQ VI+ EPWGL L +KL+PQY +EAGY TH
Sbjct: 67 NRHYVPPLCTPSRASLMTGKHPMNIGMQDHVIISDEPWGLGLDQKLMPQYFREAGYRTHL 126
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
+GKWHLGFFR YTPT+RGFDSH+GY DY+DHS + +GLDMR N V+
Sbjct: 127 VGKWHLGFFRRAYTPTYRGFDSHFGYLGPYIDYWDHSLQMNETSARGLDMRRNTAVNYDA 186
Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
G Y+TDL+ + A+ +I HN+SKP+FL L HLA H GN +P QAP +E+AKF I DP
Sbjct: 187 NGTYATDLFNDEAVRLIDSHNRSKPLFLVLTHLAPHTGNEDDPLQAPADEIAKFDYIQDP 246
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
+RRT A MVSR+D VG + +L + ML N+I+LF ADNGAP+ GIH+N GSN+PLRG
Sbjct: 247 KRRTLAAMVSRIDTGVGRIYRSLERRRMLNNTIILFYADNGAPTLGIHANSGSNYPLRGQ 306
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI--NDTSLDGVNQ 366
K +PW+G +RG A IWS L + VS++ H+SDWLPTL AAGI I N++ +DG NQ
Sbjct: 307 KESPWEGAVRGAALIWSRLLPRKGIVSNQWLHVSDWLPTLGHAAGIRISPNNSPIDGQNQ 366
Query: 367 WDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
W LT + + R+ +++N N + Y++ KYV GT G D W G +
Sbjct: 367 WSTLTSSSNSGRTVVMNNAHN----EFTYSSYIKRGWKYVNGTSFKGAYDRWLGQLKDG- 421
Query: 427 DKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARV 486
++ S +E Y K I + + +Q LT + + LR A V
Sbjct: 422 EQISHEE--YYKRLIASQTIGSSMQ--------------------LTRDDVKRLRSKAMV 459
Query: 487 RCNYDNKGAH-----CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMV 541
C+ + C VKPCLFN+ +DPCE+ NLA+ +LLK L+ + +K +
Sbjct: 460 ECSSAQGTVYVQAIPCEPLVKPCLFNVVEDPCERVNLADQYPELLKDLQSDVERFKRDAL 519
Query: 542 PPGNKPFDKRADPARWNNIWVPWYDELDKQK 572
PP N+P D R+DPA N W W DE+D+Q+
Sbjct: 520 PPRNQPSDSRSDPALHNYTWTWWQDEMDEQQ 550
>gi|328789569|ref|XP_624454.2| PREDICTED: arylsulfatase J-like [Apis mellifera]
Length = 546
Score = 533 bits (1372), Expect = e-148, Method: Compositional matrix adjust.
Identities = 268/558 (48%), Positives = 360/558 (64%), Gaps = 30/558 (5%)
Query: 13 CTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY 72
C L + A +PHI+ ILADDLGWNDV FHG SQIPTPNIDALAY GL+L+++Y
Sbjct: 11 CICLTVFVVDHVVASARPHIVFILADDLGWNDVGFHGLSQIPTPNIDALAYTGLLLDRYY 70
Query: 73 VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKW 132
V +CTPSRSALMTGK+PIH GMQHGV+ EP GLPL EKLLP+YL+ GY+TH +GKW
Sbjct: 71 VSPICTPSRSALMTGKHPIHTGMQHGVLKCAEPRGLPLQEKLLPEYLRNLGYSTHMVGKW 130
Query: 133 HLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIY 192
HLGF+ + YTPT+RGFDSH G+W G DY+DH+ A EPY GLDMR ++ G Y
Sbjct: 131 HLGFYTKEYTPTYRGFDSHLGFWSGHHDYFDHT--AVEEPYWGLDMRRGLEPAWDLHGQY 188
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
STD++T+ A+ +I HN S+PMFLYL+H AVH+GN Y P A D +VAKF I D RR
Sbjct: 189 STDVFTKEAVRLIDNHNTSRPMFLYLSHAAVHSGNPYNPLPAHDHDVAKFPKILDYNRRR 248
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
+AGM+S+LDESVG V+ ALRK MLENS+++F DNG P G + N SN PLRG K+T
Sbjct: 249 FAGMLSKLDESVGLVVEALRKRKMLENSVIVFSTDNGGPPAGFNLNAASNFPLRGTKNTL 308
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
W+GG+RG +WSP L ++S +L HISDWLPTL A G + ++ ++DGV+ W L +
Sbjct: 309 WEGGVRGTGLVWSPKLINPGRISRQLIHISDWLPTLLTAVGGDPSNLAIDGVDMWRALNE 368
Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPK 432
+ R +LHNID++ A + D K++ G+ NGQ D+WYG + + Y P
Sbjct: 369 DTISPRKMVLHNIDDIYG----IAGITYGDWKFIQGSTYNGQWDDWYGPSGRDW-AYDPY 423
Query: 433 EVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDN 492
V+ S AG +A +S L LT I +L++ A ++C+ N
Sbjct: 424 AVMESAAG--------------------RAAASVGLS--LTAGNIEKLQQGAAIKCSTRN 461
Query: 493 KG-AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR 551
+ C PCLFN+ +DPCE+NNL E ++ L++KL++ ++ + PGN P+DKR
Sbjct: 462 ENLPLCKPLEAPCLFNVREDPCEKNNLIEKYPSIVSDLKEKLSLLNASAILPGNLPWDKR 521
Query: 552 ADPARWNNIWVPWYDELD 569
+P W++ W + D L+
Sbjct: 522 GNPDYWDHTWTNFGDYLN 539
>gi|270005303|gb|EFA01751.1| hypothetical protein TcasGA2_TC007349 [Tribolium castaneum]
Length = 543
Score = 526 bits (1355), Expect = e-146, Method: Compositional matrix adjust.
Identities = 261/565 (46%), Positives = 353/565 (62%), Gaps = 30/565 (5%)
Query: 15 LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ 74
L+ F+ A ++P++I ILADDLGWNDV FHGS QIPTPNIDALAY+GLIL +YV
Sbjct: 8 LILFPLFVTIRASERPNVIFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLILQNYYVT 67
Query: 75 ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL 134
+CTPSRSALMTGKYPIH GMQH V+ EP GLPLTEK+LP+YL+E GY +GKWHL
Sbjct: 68 PICTPSRSALMTGKYPIHTGMQHTVLFGAEPRGLPLTEKILPEYLRELGYTNRLVGKWHL 127
Query: 135 GFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYST 194
G + + YTP +RGFDSH GYW G QDYYDH+ A P G DMR NM + G YST
Sbjct: 128 GSYTKEYTPLYRGFDSHLGYWTGHQDYYDHT--AVENPGWGFDMRRNMDLAYDLHGQYST 185
Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
D++T+ A+ +I HN + P+FLYLAH+AVH+ N Y P APDE V KF +I +R+ +A
Sbjct: 186 DVFTQEAVKIIENHNTTNPLFLYLAHVAVHSANPYNPLPAPDETVEKFSNIPSYKRQRFA 245
Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
M+S+LD+SVG V+ +L K ML+NSI++F DNG + G + N SN PLRG+K+T W+
Sbjct: 246 AMLSKLDDSVGAVVESLSKRKMLKNSIIIFSTDNGGAAAGFNLNAASNFPLRGVKNTVWE 305
Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGA 374
GG+RG +WSP +KQ Q+V+ + HISDWLPTL +A G + ++DG++ W+ L++
Sbjct: 306 GGVRGAGLLWSPLIKQPQRVAQQFMHISDWLPTLLSAVGANSSLKNIDGLDLWESLSQNK 365
Query: 375 KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
+ RSEILHNID++ AAL V K V GT NG D WYG + + Y V
Sbjct: 366 NSPRSEILHNIDDIYG----NAALTVGAWKLVKGTTYNGAWDGWYGPSGRDY-TYDLSLV 420
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
+ S G+ ++ L ++I +LR +R+ C D +
Sbjct: 421 VNSPTGVALKSINMS----------------------LVPDEITKLRNASRIDCATD-QP 457
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
C + CLFN+ DPCE NLA+ ++ LE+ L Y +T +PPGN P D R +P
Sbjct: 458 IPCKPLEETCLFNVIQDPCELRNLAKENPKIVMNLEETLNKYNNTALPPGNLPLDPRGNP 517
Query: 555 ARWNNIWVPWYDELDKQKAIETMRL 579
WNN W + D + ++ ++ + +
Sbjct: 518 KYWNNTWTNFGDYIKEKSSLYRVEI 542
>gi|170040779|ref|XP_001848165.1| arylsulfatase B [Culex quinquefasciatus]
gi|167864376|gb|EDS27759.1| arylsulfatase B [Culex quinquefasciatus]
Length = 585
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 270/580 (46%), Positives = 359/580 (61%), Gaps = 48/580 (8%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P+I++I+ADDLGWNDVSFHGS QIPTPNIDALAY+G+ILN+HY LCTPSR++LMTGK
Sbjct: 30 RPNIVVIIADDLGWNDVSFHGSLQIPTPNIDALAYSGIILNRHYAPPLCTPSRASLMTGK 89
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
+PI+IGMQH VI EPWGL L +KL+P+Y +EAGY T +GKWHLGFFR+ YTPT RGF
Sbjct: 90 HPINIGMQHHVIEVDEPWGLGLDQKLMPEYFREAGYRTRLVGKWHLGFFRKAYTPTMRGF 149
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
DSHYGY DY+DHS + + +GLDMR N+QVD G Y+TDL+T A+ +I +H
Sbjct: 150 DSHYGYIGPYIDYWDHSLQMSNTSTRGLDMRRNLQVDYSARGTYATDLFTREAVRLIHDH 209
Query: 209 NKSK--PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
N++ P+FL + HLA H GN +P QAP+E+V F I DP+RR A M+S+LD+ VG
Sbjct: 210 NQTSANPLFLVVTHLAPHTGNEDDPMQAPEEDVELFSFIKDPKRRVLAAMISKLDDGVGQ 269
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
++ AL+K GML N++VLF ADNGAP+ G HSN GSN PLRG K +PW+G +R AAIWSP
Sbjct: 270 IVQALKKSGMLNNTVVLFYADNGAPTIGKHSNGGSNFPLRGQKYSPWEGAVRTAAAIWSP 329
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEIN--DTSLDGVNQWDVLTKGAKTKRSEILHN 384
L T +VS++ H+SDWLPTL AAGIE N + +DG +QW+ L T R+ ++HN
Sbjct: 330 LLNNTGRVSNQWIHVSDWLPTLARAAGIEANFSGSEIDGQDQWEALNNPNLTGRTTVMHN 389
Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFN 444
ID ++Y++ + K V GT G D+W G D E + + + + +
Sbjct: 390 IDQF----QHYSSYTKNGWKLVNGTSWEGAFDDWMGSLDEEDELSEEEYSVRLISSVVGR 445
Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGA-HCNSTVKP 503
K L + + +R A V C Y G C P
Sbjct: 446 LTKVDL------------------------DDVARIRRDATVHCPYIPGGTIDCEPQKSP 481
Query: 504 CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
CLFNI +DPCE NN+A + DLLKQL + Y+ + P N+P D +DP +N W
Sbjct: 482 CLFNILEDPCETNNIANQRPDLLKQLGQDVERYRLSATEPRNQPADPYSDPRYYNRTWTW 541
Query: 564 WYDELDKQKAIE---------------TMRLHKPLLSPTT 588
W DELD+ + I+ + + KP+L P+T
Sbjct: 542 WQDELDEARDIKFPLAIFIVSITFIAIMLIVVKPILCPST 581
>gi|380026538|ref|XP_003697007.1| PREDICTED: arylsulfatase J-like [Apis florea]
Length = 543
Score = 523 bits (1348), Expect = e-146, Method: Compositional matrix adjust.
Identities = 264/551 (47%), Positives = 360/551 (65%), Gaps = 30/551 (5%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
F+ +PHI+ ILADDLGWNDV FHGS QIPTPNIDALAY+GL+L+++YV +CTP
Sbjct: 15 VFVVGHVAARPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVSPICTP 74
Query: 80 SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
SRSALMTGK+PIH GMQHGV+ EP GLPL EKLLP+Y ++ GY+TH +GKWHLGF+ +
Sbjct: 75 SRSALMTGKHPIHTGMQHGVLKCAEPRGLPLHEKLLPEYFRDLGYSTHIVGKWHLGFYTK 134
Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
YTP +RGFDSH G+W G DY+DHS A EPY GLDMR ++ G YSTD++T+
Sbjct: 135 EYTPMYRGFDSHIGFWSGHHDYFDHS--AVEEPYWGLDMRRGLEPAWDLHGQYSTDVFTK 192
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
A+ +I HN S+PMFLYLAH AVH+GN Y P A D++VAKF +I + RR +AG++S+
Sbjct: 193 EAVKLIDNHNTSRPMFLYLAHAAVHSGNPYNPLPAHDQDVAKFTNIFNYNRRRFAGILSK 252
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
LDESVG V+ ALR+ MLENS+++F DNG G + N SN PLRG K+T W+GG+RG
Sbjct: 253 LDESVGLVVEALRRRKMLENSVIVFSTDNGGAPAGFNLNAASNWPLRGTKNTLWEGGVRG 312
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
V +WSP L + +V +L HISDWLPTL AAG + ++ ++DG++ W L++ ++ R
Sbjct: 313 VGLLWSPKLIKPGRVGRQLIHISDWLPTLLTAAGGDPSNLTIDGMDIWHALSEDTESPRK 372
Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA 439
+LHNID++ A + D K++ G+ NGQ D WYG S +E +Y
Sbjct: 373 MVLHNIDDIYG----VAGITYGDWKFIQGSTYNGQWDGWYGP--------SGREWVYDVG 420
Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG-AHCN 498
G+ +A +A +S L L+ + I +LRE A ++C N C
Sbjct: 421 GVIGSAAG-------------RAVASVGLS--LSADNIRKLREDAMIKCPPRNDNLPLCK 465
Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
PCLFN+ DPCE NNL E ++ L+++L + ++ V PGN P+DKRA+P W+
Sbjct: 466 PLEAPCLFNVRQDPCEDNNLIEKYPSVVSDLKERLRVLNASAVLPGNLPWDKRANPDYWD 525
Query: 559 NIWVPWYDELD 569
+ W+ + D L+
Sbjct: 526 HTWINFGDYLN 536
>gi|312382061|gb|EFR27642.1| hypothetical protein AND_05535 [Anopheles darlingi]
Length = 881
Score = 519 bits (1336), Expect = e-144, Method: Compositional matrix adjust.
Identities = 262/570 (45%), Positives = 358/570 (62%), Gaps = 34/570 (5%)
Query: 1 MTWARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDA 60
M R + L+ F + PHI+ I+ADD+G NDV FHGS+QIPTPNIDA
Sbjct: 5 MMGKRITIVTMVALLISGPTFAEPQTTESPHIVFIMADDMGLNDVGFHGSNQIPTPNIDA 64
Query: 61 LAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLK 120
LAY+G+ILN+HY +CTPSR+ALMTG++P+++GMQH VI EPWGL L EKLLPQY +
Sbjct: 65 LAYDGIILNRHYSAPMCTPSRAALMTGRHPMNVGMQHYVIDSDEPWGLGLQEKLLPQYFR 124
Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
EAGY TH IGKWHLGF+ E Y PT RGFD+H GY DY+ + K ++G D+R
Sbjct: 125 EAGYRTHLIGKWHLGFYAEPYLPTNRGFDTHIGYLGPYIDYWSYISKMDSATFEGYDLRQ 184
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
N+ V+ K G Y+TD +TEAA+ +I HN++ + M L L HLA H GN P QAP+E +
Sbjct: 185 NLAVNYKPNGTYATDYFTEAAVEIIRSHNRTGERMLLVLNHLAPHTGNDDAPLQAPEETI 244
Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
KF I D +RRTYA M+S+LD+SVG V AL++HGMLENSI++F++DNG + G+HSN
Sbjct: 245 EKFAYIRDTDRRTYAAMMSKLDDSVGQVYRALQQHGMLENSIIVFISDNGGVTRGMHSNS 304
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI--- 356
GSN+PLRG K +PW+G +R A IWSP LK Q+VS++ FHISDWLPTL +AAGI
Sbjct: 305 GSNYPLRGQKHSPWEGAVRTAALIWSPLLKDRQRVSNQWFHISDWLPTLASAAGIPAYRG 364
Query: 357 -NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQS 415
N + +DG++QW+ L G R +++N+D + Y + + LKYV GT +G +
Sbjct: 365 DNLSDIDGIDQWEALAYGTGNPRQRVMNNLDEIFG----YTSYVDNGLKYVNGTTLDGVN 420
Query: 416 DEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDE 475
D WYG D+ + L S A L+T + +S R DE
Sbjct: 421 DGWYGAIDD-------TDTLPSNA----EYLRTVM-------------ASHVGRLTKPDE 456
Query: 476 KILE-LREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLA 534
K++ LRE AR+ C D CN +PCLF+I +DPCE ++L E D ++Q+++++
Sbjct: 457 KVVSYLREHARIDCGNDIGNRDCNPLRRPCLFDIINDPCEMHDLFERYPDKVRQIQERVE 516
Query: 535 IYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
Y+ P N+P D ADP+++ +W W
Sbjct: 517 KYRLRAAKPRNRPHDPAADPSKFGGVWNWW 546
>gi|307167595|gb|EFN61139.1| Arylsulfatase B [Camponotus floridanus]
Length = 519
Score = 519 bits (1336), Expect = e-144, Method: Compositional matrix adjust.
Identities = 262/523 (50%), Positives = 349/523 (66%), Gaps = 31/523 (5%)
Query: 41 GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
GWNDV FHGS QIPTPNIDALAY+GL+L+++YV ++CTPSRSALMTGKYPIH GMQH ++
Sbjct: 10 GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTSICTPSRSALMTGKYPIHTGMQHSIL 69
Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
EP GLPL EK+LP+YL+E GY+TH +GKWHLGF++ YTPT+RGFD+H GYW G D
Sbjct: 70 KGAEPRGLPLHEKILPEYLRELGYSTHIVGKWHLGFYKREYTPTYRGFDTHIGYWTGHHD 129
Query: 161 YYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAH 220
YYDH+ A PY GLDMR M+ G YSTD++T+ A+ +I HN S+PMFLYLAH
Sbjct: 130 YYDHT--AVENPYWGLDMRRGMKPAWDLHGEYSTDVFTKEAVKLINNHNSSRPMFLYLAH 187
Query: 221 LAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENS 280
AVH+GN Y P APDEEVAKF +I D RR +AG++S+LD+SVG V+ AL K+ ML++S
Sbjct: 188 AAVHSGNPYNPLPAPDEEVAKFNNIFDYNRRRFAGILSKLDQSVGQVVEALYKNNMLQDS 247
Query: 281 IVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
+++F DNG P+ G + N SN PLRG+K+T W+GG+RG IWSP L + +VS ++ H
Sbjct: 248 VIVFSTDNGGPAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPRLVRAGRVSRQMIH 307
Query: 341 ISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALR 399
I+DWLPTL AAG N TS+DGVN WD L++ ++ R+ ILHNID++ +A+
Sbjct: 308 ITDWLPTLITAAGGNASNLTSIDGVNLWDALSEDTESPRTNILHNIDDIYG----VSAIT 363
Query: 400 VDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAAD 459
V D K++ G+ NG D WYG + E Y V+ S AG
Sbjct: 364 VGDWKFIQGSTYNGVWDGWYGPSGREW-IYDADGVISSTAG------------------- 403
Query: 460 PKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-CNSTVKPCLFNITDDPCEQNNL 518
+A +S L +T E I LRE A ++C N C +PCLFNI DPCE NNL
Sbjct: 404 -RAIASVGLS--VTTEIISSLRENAMIKCPPRNDSLPICRPLQEPCLFNIYQDPCEDNNL 460
Query: 519 AESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
+ ++++L+++L + +T + PGN P+D +ADP W++ W
Sbjct: 461 VKQFPTIVRKLQEELKKFNNTAILPGNLPWDSKADPNLWDHTW 503
>gi|307207313|gb|EFN85063.1| Arylsulfatase B [Harpegnathos saltator]
Length = 532
Score = 515 bits (1326), Expect = e-143, Method: Compositional matrix adjust.
Identities = 255/522 (48%), Positives = 349/522 (66%), Gaps = 30/522 (5%)
Query: 41 GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
GWNDV FHGS QIPTPNIDALAY+GL+L+++YV +CTPSRSALMTGKYPIHIGMQHGV+
Sbjct: 26 GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKYPIHIGMQHGVL 85
Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
EP GLPL EK+LP+YL++ GY+TH +GKWHLGF+ + YTPT+RGF SH G+W G QD
Sbjct: 86 KGAEPRGLPLHEKILPEYLRDLGYSTHIVGKWHLGFYTKEYTPTYRGFASHTGFWTGHQD 145
Query: 161 YYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAH 220
Y+DH+ A PY GLDMR +M+ G YSTD++T+ A+ +I HN S+P+FLYLAH
Sbjct: 146 YFDHT--AVESPYWGLDMRRDMEPAWDLHGQYSTDVFTKEALRLIDRHNSSRPLFLYLAH 203
Query: 221 LAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENS 280
AVH+GN Y P APDEEVAKF +I D RR +AG++S+LD+SVG V+ LRK GML +S
Sbjct: 204 AAVHSGNPYNPLPAPDEEVAKFDNIFDYNRRRFAGILSKLDKSVGQVVEGLRKKGMLRDS 263
Query: 281 IVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
+++F DNG P+ G + N SN PLRG+K+T W+GG+RG IWSP L ++ +VS +L H
Sbjct: 264 VIIFSTDNGGPAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPKLTRSGRVSRQLLH 323
Query: 341 ISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRV 400
I+DWLPTL AAG + ++ ++DG++ W L+ ++ R+ +LHNID++ +A+ V
Sbjct: 324 ITDWLPTLITAAGGDPSNLNIDGMDLWSALSDDTESPRTSVLHNIDDIYG----VSAVTV 379
Query: 401 DDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADP 460
D K V G+ NG D WYG + E Y V+ S+AG
Sbjct: 380 GDWKLVQGSTYNGAWDGWYGPSGREW-VYDVGGVIGSEAG-------------------- 418
Query: 461 KANSSDALRTILTDEKILELREFARVRCNYDNKG-AHCNSTVKPCLFNITDDPCEQNNLA 519
+A +S L +T E + LRE A ++C N C +PCLFN+ DPCE NNL
Sbjct: 419 RAVASVGLG--ITAETVRMLREKAMIKCPPKNDSLPLCKPLQEPCLFNVYQDPCEDNNLV 476
Query: 520 ESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
++++L+++L + S+ + PGN P+D + DP+ W++ W
Sbjct: 477 RQSPTIVRKLQEELKKFNSSAIYPGNLPWDNKGDPSLWDHTW 518
>gi|157108842|ref|XP_001650409.1| arylsulfatase b [Aedes aegypti]
gi|108879187|gb|EAT43412.1| AAEL005134-PA [Aedes aegypti]
Length = 675
Score = 514 bits (1325), Expect = e-143, Method: Compositional matrix adjust.
Identities = 256/557 (45%), Positives = 357/557 (64%), Gaps = 38/557 (6%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
++PHI+II+ADD+GWNDV FHGS+QIPTPNIDALAY+G+ILN+HY +CTPSR++LMTG
Sbjct: 30 QRPHIVIIMADDMGWNDVGFHGSNQIPTPNIDALAYDGIILNRHYTAPMCTPSRASLMTG 89
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
K PI+IGMQH VI+ EPWGL L +K++P+Y KEAGY TH +GKWHLGF + YTPT RG
Sbjct: 90 KNPINIGMQHYVIVSDEPWGLGLDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRG 149
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEP---YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
FD+H GY DY+D++ K F P +QG DMR+N+ VD + G Y+TD +T+AA ++
Sbjct: 150 FDTHVGYLGPYVDYWDYTLK--FSPPKSFQGYDMRNNLNVDYDSNGTYATDHFTKAASSI 207
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
I H+ P+FL + HLA HA N +P QAP+E++ KF ISD RR YA MVS+LD+SV
Sbjct: 208 IERHDTKDPLFLVVNHLAPHAANDDDPLQAPEEDIRKFDYISDERRRIYAAMVSKLDDSV 267
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
G + +LR ML+NSI+LFM+DNGAP+ +H+N GSN+PLRG+KS PW+ R VAAIW
Sbjct: 268 GQIFNSLRSKNMLDNSIILFMSDNGAPTAALHANTGSNYPLRGIKSVPWEAATRCVAAIW 327
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEI----NDTSLDGVNQWDVLTKGAKTKRSE 380
SP L++ Q+VS++ HISDWLPTL +AAGI+I + + +DG +QW+ L+ R
Sbjct: 328 SPLLQERQRVSNQFIHISDWLPTLASAAGIDIPFSKDHSEIDGQDQWEALSYDTGNPRRV 387
Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV--LYSK 438
+L+ ID + Y++ + K+V GT +NG D WYG + S + L +
Sbjct: 388 VLNMIDEIYG----YSSYMENGFKFVNGTYSNGSYDGWYGQPNTSDQTLSDDQYIDLVLQ 443
Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
IT A +T ++ + I LR+ ARV CN+ + CN
Sbjct: 444 TEITRWAGET-----------------------ISRDTIKYLRKHARVNCNHQPEANKCN 480
Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
+PCLF+I +DPCE N+L+ ++L + Y+ P NKP D A+PA +
Sbjct: 481 PLKRPCLFDIINDPCELNDLSHKFPMKFRELRSTVQTYRRLATKPRNKPADPAANPANFG 540
Query: 559 NIWVPWYDELDKQKAIE 575
+W W + +++ I+
Sbjct: 541 GVWTWWRPDQTREELIK 557
>gi|157108840|ref|XP_001650408.1| arylsulfatase b [Aedes aegypti]
gi|108879186|gb|EAT43411.1| AAEL005134-PB [Aedes aegypti]
Length = 607
Score = 513 bits (1320), Expect = e-142, Method: Compositional matrix adjust.
Identities = 256/557 (45%), Positives = 357/557 (64%), Gaps = 38/557 (6%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
++PHI+II+ADD+GWNDV FHGS+QIPTPNIDALAY+G+ILN+HY +CTPSR++LMTG
Sbjct: 30 QRPHIVIIMADDMGWNDVGFHGSNQIPTPNIDALAYDGIILNRHYTAPMCTPSRASLMTG 89
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
K PI+IGMQH VI+ EPWGL L +K++P+Y KEAGY TH +GKWHLGF + YTPT RG
Sbjct: 90 KNPINIGMQHYVIVSDEPWGLGLDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRG 149
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEP---YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
FD+H GY DY+D++ K F P +QG DMR+N+ VD + G Y+TD +T+AA ++
Sbjct: 150 FDTHVGYLGPYVDYWDYTLK--FSPPKSFQGYDMRNNLNVDYDSNGTYATDHFTKAASSI 207
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
I H+ P+FL + HLA HA N +P QAP+E++ KF ISD RR YA MVS+LD+SV
Sbjct: 208 IERHDTKDPLFLVVNHLAPHAANDDDPLQAPEEDIRKFDYISDERRRIYAAMVSKLDDSV 267
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
G + +LR ML+NSI+LFM+DNGAP+ +H+N GSN+PLRG+KS PW+ R VAAIW
Sbjct: 268 GQIFNSLRSKNMLDNSIILFMSDNGAPTAALHANTGSNYPLRGIKSVPWEAATRCVAAIW 327
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEI----NDTSLDGVNQWDVLTKGAKTKRSE 380
SP L++ Q+VS++ HISDWLPTL +AAGI+I + + +DG +QW+ L+ R
Sbjct: 328 SPLLQERQRVSNQFIHISDWLPTLASAAGIDIPFSKDHSEIDGQDQWEALSYDTGNPRRV 387
Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV--LYSK 438
+L+ ID + Y++ + K+V GT +NG D WYG + S + L +
Sbjct: 388 VLNMIDEIYG----YSSYMENGFKFVNGTYSNGSYDGWYGQPNTSDQTLSDDQYIDLVLQ 443
Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
IT A +T ++ + I LR+ ARV CN+ + CN
Sbjct: 444 TEITRWAGET-----------------------ISRDTIKYLRKHARVNCNHQPEANKCN 480
Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
+PCLF+I +DPCE N+L+ ++L + Y+ P NKP D A+PA +
Sbjct: 481 PLKRPCLFDIINDPCELNDLSHKFPMKFRELRSTVQTYRRLATKPRNKPADPAANPANFG 540
Query: 559 NIWVPWYDELDKQKAIE 575
+W W + +++ I+
Sbjct: 541 GVWTWWRPDQTREELIK 557
>gi|332024600|gb|EGI64798.1| Arylsulfatase J [Acromyrmex echinatior]
Length = 528
Score = 512 bits (1319), Expect = e-142, Method: Compositional matrix adjust.
Identities = 254/530 (47%), Positives = 351/530 (66%), Gaps = 30/530 (5%)
Query: 33 IIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIH 92
+I+L GWNDV FHGS QIPTPNIDALAY+GL+L+++YV +CTPSRSALMTGKYPIH
Sbjct: 13 LILLLAVHGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKYPIH 72
Query: 93 IGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHY 152
GMQHGV+ EP GLPL EKLLP+YL+E GY TH +GKWHLGF+++ YTPT+RGFD+H
Sbjct: 73 TGMQHGVLKGAEPRGLPLREKLLPEYLRELGYNTHIVGKWHLGFYKKEYTPTYRGFDTHI 132
Query: 153 GYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSK 212
G+W G DY+DH+ A PY GLD+R MQ G YSTD++T+ A+ +I HN S+
Sbjct: 133 GFWTGHHDYFDHT--AVENPYWGLDIRRGMQPAWDLHGQYSTDIFTKEAVRLIDNHNSSR 190
Query: 213 PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALR 272
PMFLYLAH AVH+GN Y P APDEEVAKF +I D RR +AGM+S+LD+SVG V+ ALR
Sbjct: 191 PMFLYLAHAAVHSGNPYNPLPAPDEEVAKFNNIFDYNRRRFAGMLSKLDQSVGQVVDALR 250
Query: 273 KHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ 332
K ML +SI++F +DNG P+ G + N SN PLRG+K+T W+GG+RG IWSP L +
Sbjct: 251 KRDMLRDSIIIFSSDNGGPAAGFNLNAASNWPLRGVKNTLWEGGVRGTGLIWSPRLVRPG 310
Query: 333 KVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ 392
+VS ++ HI+DWLPTL AAG + ++ ++DG++ W+ L++ ++ R+ +LHNID++
Sbjct: 311 RVSRQMLHITDWLPTLITAAGGDSSNLTVDGMDLWNALSEDTESPRANVLHNIDDIYG-- 368
Query: 393 KYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQI 452
+A+ V D K + G+ G D WYG S +E +Y G+
Sbjct: 369 --VSAITVGDWKLIRGSTYKGVWDGWYGP--------SGREWVYDVDGVI---------- 408
Query: 453 KQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-CNSTVKPCLFNITDD 511
+ A++ ++ +T + +L LRE A ++C N C + CLFNI D
Sbjct: 409 -----SSTTAHAIASVGLGITTDVVLMLRENALIKCPLRNDSLPICKPLEEACLFNIYQD 463
Query: 512 PCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
PCE NNL + ++++L+D+L + S+ + PGN P+D + DP W++ W
Sbjct: 464 PCEDNNLVKQFPTIVRKLQDELKKFNSSALLPGNLPWDSKGDPNLWDHTW 513
>gi|158287209|ref|XP_564139.3| AGAP011347-PA [Anopheles gambiae str. PEST]
gi|157019541|gb|EAL41524.3| AGAP011347-PA [Anopheles gambiae str. PEST]
Length = 634
Score = 511 bits (1317), Expect = e-142, Method: Compositional matrix adjust.
Identities = 255/566 (45%), Positives = 357/566 (63%), Gaps = 33/566 (5%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
PK+PHI++I ADD+GWNDV FHGS+QI TP+IDALAY+G+ILN+HY +CTPSR+ALMT
Sbjct: 31 PKRPHIVLITADDMGWNDVGFHGSNQIATPHIDALAYDGVILNRHYSAPMCTPSRAALMT 90
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G++PI++GMQH VI EPWGL L ++++PQY + AGY TH IGKWHLGFF E Y PT R
Sbjct: 91 GRHPINVGMQHYVIDSDEPWGLGLDQRIMPQYFRAAGYRTHMIGKWHLGFFTEHYIPTNR 150
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
GFD+H GY DY+ + K ++G DMR N V+ G Y+TD +T AA ++IA
Sbjct: 151 GFDTHIGYLGPYVDYWSYVSKMNSGTFEGYDMRQNQFVNYAANGTYATDYFTSAARDIIA 210
Query: 207 EHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
+H KS +PM L + HLA HAGN +P QAP E + +F I + +RRTYA M+S+LD+SVG
Sbjct: 211 QHGKSGQPMLLVMNHLAPHAGNDDDPLQAPQETIDRFAYIGNRDRRTYAAMMSKLDDSVG 270
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
V +LR +GML+NSI++F++DNG + G+HSN GSN+P RG K +PW+G +R A IWS
Sbjct: 271 AVYESLRANGMLDNSIIVFLSDNGGVTRGMHSNTGSNYPFRGQKHSPWEGAVRTAALIWS 330
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS---LDGVNQWDVLTKGAKTKRSEIL 382
P LK TQ+VS++ FHISDWLPTL +AAGI + ++S +DG++QW+ L G R ++
Sbjct: 331 PLLKDTQRVSNQWFHISDWLPTLASAAGIALGNSSFSDIDGIDQWEALAYGTGNPRQRLM 390
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
+N+D + Y + + KY+ GT NG +D WYG+TD +D+ E S T
Sbjct: 391 NNLDEIFG----YTSYMENGFKYINGTTLNGVNDAWYGETD-PLDQLPDDEQYLSSVMAT 445
Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILE-LREFARVRCNYDNKGAHCNSTV 501
A +L DEK++ LR A V C CN V
Sbjct: 446 RIARAGRL-----------------------DEKLISYLRRHATVNCGNSIATKPCNPLV 482
Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
KPCLF+I +DPCE +++++ L+ + ++L +Y+ P N+P D ADP+++ +W
Sbjct: 483 KPCLFDIINDPCEMHDISDEYPKRLRGIRNRLELYRLRAAKPRNRPHDPEADPSKFGGVW 542
Query: 562 VPWYDELDKQKAIETMRLHKPLLSPT 587
W + K +++ L PT
Sbjct: 543 NWWRTDPTKAASLQDEGDDSALPPPT 568
>gi|345495280|ref|XP_001606377.2| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 545
Score = 508 bits (1308), Expect = e-141, Method: Compositional matrix adjust.
Identities = 266/556 (47%), Positives = 353/556 (63%), Gaps = 34/556 (6%)
Query: 11 LTCTLLFNDAFLNTTAPKK--PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
L LL ++ AP+ PHII I+ADDLGWNDV FHGS QIPTPNIDALAY+GLIL
Sbjct: 7 LFSYLLAARNWIVAAAPETTPPHIIFIVADDLGWNDVGFHGSGQIPTPNIDALAYSGLIL 66
Query: 69 NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
N++YV +CTPSRSALMTGKYPIH GMQ GV+ EP GLPL EKLLP+YL+E GY TH
Sbjct: 67 NRYYVSPICTPSRSALMTGKYPIHTGMQRGVLKGAEPRGLPLKEKLLPEYLRELGYRTHI 126
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
+GKWHLGF+ + YTPT+RGF+SH GYW G QDYYDHS A EPY G+DMR NM+
Sbjct: 127 VGKWHLGFYTKEYTPTYRGFESHLGYWTGHQDYYDHS--AVEEPYWGMDMRRNMEPAWDL 184
Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
G YSTD++T+ A+ +I HN S+PMFLYLAH AVH+ N Y P DE V K + I +
Sbjct: 185 HGQYSTDVFTKEAVKLIKSHNASQPMFLYLAHAAVHSANPYNPLPVSDEIVDK-IKIPEY 243
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
+RR +AGM S +D SVG V+ AL K ML + +++F DNG P+ G + N SN PLRG+
Sbjct: 244 KRRRFAGMASAMDASVGAVVEALEKRNMLRDCLIVFSTDNGGPAEGFNLNAASNWPLRGV 303
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQW 367
K T ++GG+RGV +WS L + ++S FHI+DWLPTL AAG +N T +DG++ W
Sbjct: 304 KDTLFEGGVRGVGLVWSERLARPGRLSEATFHIADWLPTLLRAAGGYVNATGKIDGLDAW 363
Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID 427
+ L ++ R +LHNID+V AA+ VD K + G+ G+ ++WYG + +
Sbjct: 364 EALADDKESPRRVVLHNIDDVLGT----AAISVDQWKLIKGSTYRGRWNQWYGPSGRDY- 418
Query: 428 KYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR 487
Y + V ++ + ALK L L K++ELR A +
Sbjct: 419 PYDVEAV--AEGCLVAKALKR-------------------LGRGLDRGKLVELRNQATIH 457
Query: 488 CNYDNKGAH--CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN 545
C + CN CL ++ DPCE+ NLA+ Q D++++LE++LA ++ VPPGN
Sbjct: 458 CGEQPQQPQKACNPLEAACLIDVRADPCERRNLADEQPDVVRRLEEELARLNASAVPPGN 517
Query: 546 KPFDKRADPARWNNIW 561
P+D +ADPA W++ W
Sbjct: 518 LPWDSQADPALWDHTW 533
>gi|170040781|ref|XP_001848166.1| arylsulfatase b [Culex quinquefasciatus]
gi|167864377|gb|EDS27760.1| arylsulfatase b [Culex quinquefasciatus]
Length = 657
Score = 503 bits (1296), Expect = e-140, Method: Compositional matrix adjust.
Identities = 251/549 (45%), Positives = 339/549 (61%), Gaps = 48/549 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+KPHI+II+ADD+GWNDV FHGS+QIPTPNIDALAY G+ILN+HY +CTPSR+A+MTG
Sbjct: 29 QKPHIVIIMADDMGWNDVGFHGSNQIPTPNIDALAYGGIILNRHYTAPMCTPSRAAIMTG 88
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+ PI +GMQH VI EPWGL L +K++P+Y +EAGY TH +GKWHLGFF + YTPT RG
Sbjct: 89 RNPISVGMQHYVIDSDEPWGLGLDQKIMPEYFREAGYRTHLVGKWHLGFFAQQYTPTMRG 148
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FDSH Y G DMRHN+ VD G Y+TD +T+AA +I +
Sbjct: 149 FDSHTNY-------------------TGYDMRHNLAVDYDANGTYATDHFTDAASRIIDK 189
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
HN S+P+FL + HLA H GN +P QAP+E + KF ISD RR YA MVS+LD+SVG +
Sbjct: 190 HNPSEPLFLMVNHLAPHTGNDNDPLQAPEERIKKFEHISDENRRIYAAMVSKLDDSVGAI 249
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
+LR M++NSI+LF++DNGAPSFG+HSN GSN PLRG K++PW+G R VAAIWSP
Sbjct: 250 FKSLRAKNMIQNSIILFISDNGAPSFGLHSNSGSNFPLRGQKNSPWEGATRNVAAIWSPL 309
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEI----NDTSLDGVNQWDVLTKGAKTKRSEILH 383
L++ Q+VS++ HISDWLPT + AGI + N + +DGV+QW+ L+ R IL+
Sbjct: 310 LEERQRVSNQYIHISDWLPTFASIAGIRLPQARNSSEVDGVDQWEALSYDTGYPRRVILN 369
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
ID + Y++ + KYV GT G D WYG +NE S +E
Sbjct: 370 MIDEIFG----YSSYMENGFKYVNGTTGGGNFDGWYGQLNNEDPHPSDEEY--------- 416
Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKP 503
+KT L+ + + +L+ E I LR+ +RV C+ CN +
Sbjct: 417 --IKTVLETEIAQLSG----------DVLSSELIKHLRKHSRVVCHKPLHPTTCNPLKRA 464
Query: 504 CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
CLF+I +DPCE N+++ + ++L K+ + P NKP D +PA +W
Sbjct: 465 CLFDIVNDPCELNDISAREPFKFRELRTKVETLRRLAKKPRNKPHDPAGNPANHGGVWTW 524
Query: 564 WYDELDKQK 572
W ++ Q+
Sbjct: 525 WRKDISAQE 533
>gi|328699373|ref|XP_001945817.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 567
Score = 501 bits (1291), Expect = e-139, Method: Compositional matrix adjust.
Identities = 255/562 (45%), Positives = 351/562 (62%), Gaps = 35/562 (6%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
L T KP+I+ ILADDLGWND+SFHGS +IPTPNIDALA+NG++LN Y Q +CTPSR
Sbjct: 20 LARTQSSKPNIVFILADDLGWNDLSFHGSDEIPTPNIDALAFNGIVLNNLYTQPVCTPSR 79
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
ALMTGKYPI +GMQ EP GLPL+EKLLP+YL+E GY T AIGKWHLGF+++ Y
Sbjct: 80 VALMTGKYPIKLGMQGPPTYGAEPNGLPLSEKLLPEYLRELGYTTRAIGKWHLGFYKQAY 139
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCK---ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYT 198
TPT RGFDSH+GY+ G YYD+ + F +QG DMR N + +G Y+TD++T
Sbjct: 140 TPTRRGFDSHFGYYTGYVSYYDYLLQDVYQNFGEFQGFDMRRNDTIAWDVVGKYATDVFT 199
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ A+ +I E ++P+F+YLAH+AVH GN + +AP EV KF I DP RR YA MVS
Sbjct: 200 DEAVRLIKEQPANQPLFMYLAHVAVHTGNRGKYLEAPQSEVNKFNHILDPNRRIYAAMVS 259
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF-----------GIHSNKGSNHPLRG 307
+LDESVG V+ AL + ML+N+I++FM+DNG+PSF G+ +N GSN P RG
Sbjct: 260 KLDESVGRVVEALTEKKMLQNTIIVFMSDNGSPSFDGSGRNFRPEAGVTANWGSNFPYRG 319
Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN--DTSLDGVN 365
+K+T W+GG++ + IW+P+ ++ +VS +L HI+DWLPTL +AAG +LDGV+
Sbjct: 320 IKNTLWEGGVKSASFIWAPYFQENPRVSKQLMHITDWLPTLYSAAGGNPGFLPKNLDGVD 379
Query: 366 QWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE 425
QW LT ++R+ +L NID ++ YA + D+ K + G+ NG D ++G T
Sbjct: 380 QWTSLTLNLPSQRNFVLLNIDE----KQRYAGILKDNWKLIVGSTANGTLDGFFGATRPR 435
Query: 426 IDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFAR 485
Y+ VLYS AG L L ++ + + ++L +R +
Sbjct: 436 T-PYNATAVLYSPAGRALTKLSDSLPF-----------ATAGFGPMSSVRELLAMRYEST 483
Query: 486 VRCNYDNKGAHCNSTV---KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP 542
+RCN G S + CLF++ DPCE NN+A+ + QL + L Y+ +VP
Sbjct: 484 IRCNPSAAGPRTQSPCPAGEACLFDLVADPCETNNVAKKYVAVSGQLYEALKYYRRLLVP 543
Query: 543 PGNKPFDKRADPARWNNIWVPW 564
N+PFD A+PAR+NN W W
Sbjct: 544 QTNRPFDPAANPARFNNTWSTW 565
>gi|195591209|ref|XP_002085335.1| GD14734 [Drosophila simulans]
gi|194197344|gb|EDX10920.1| GD14734 [Drosophila simulans]
Length = 579
Score = 501 bits (1290), Expect = e-139, Method: Compositional matrix adjust.
Identities = 266/593 (44%), Positives = 362/593 (61%), Gaps = 37/593 (6%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
L + +F+ AF + KP+I+IIL DD+G NDVSFHGS+QI TPNIDALAYNG++LN+
Sbjct: 10 LVVSTIFSLAF-GSGYSTKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNK 68
Query: 71 HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
HYV LCTPSR+ L+TGKYPIH GMQH VI+ EPWGLP E+L+P+ ++AGY+TH +G
Sbjct: 69 HYVPNLCTPSRATLLTGKYPIHTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVG 128
Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTI 189
KWHLGF+R+ TPT RGFD H+GY+ G DYYDH + Y GLD R +++ +
Sbjct: 129 KWHLGFWRKDLTPTMRGFDHHFGYYNGYIDYYDHQVRLLDRNYSAGLDFRRDLEPCPEAN 188
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
G Y+T+ +T A +I +H+KSKP+F+ L+HLAVH GN P QAP+EEVAKF I DP+
Sbjct: 189 GTYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPK 248
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
RRTYAGMVS LD+SV I AL+ +GML NSI+L +DNGAP+ GIHSN GSN+P RG K
Sbjct: 249 RRTYAGMVSSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQK 308
Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWD 368
+PW+GG+R A+WSP LK+ VS++ H DWLPTL AAG+ + D LDG+N W
Sbjct: 309 ESPWEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAGVSLPQDLPLDGINLWP 368
Query: 369 VLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD-NEID 427
+L+ + K ++H +D V Y++ D LKYV G+ G+ D+W G+ + NE D
Sbjct: 369 MLSGNEEPKPRTMIHVLDEVFG----YSSYMRDTLKYVNGSSFEGRYDQWLGELETNEDD 424
Query: 428 KYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR 487
S + ++H S R LT+++I ++R A
Sbjct: 425 PLS--------------------ESYEQHVLASDVQSLLGNRG-LTEDRIRQMRSEATET 463
Query: 488 C------NYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMV 541
C N C PC F++ DPCE+ NLA+ L+QL D++ + T +
Sbjct: 464 CPPVEGQNPLEPHFKCEPLKAPCFFDLAKDPCERYNLAQIYPLQLQQLADEVEQIRKTAI 523
Query: 542 PPGNKPF-DKRADPARWNNIWVPWYDELDKQKAIETMRLHKPLLSPTTITLIY 593
P P D RA+P N W W++ D Q + T ++ L + I L+Y
Sbjct: 524 PSARVPHSDSRANPTFHNGNW-EWWNNTDTQSSSGTFSINLWLAMGSIIGLVY 575
>gi|195441662|ref|XP_002068622.1| GK20325 [Drosophila willistoni]
gi|194164707|gb|EDW79608.1| GK20325 [Drosophila willistoni]
Length = 550
Score = 500 bits (1288), Expect = e-139, Method: Compositional matrix adjust.
Identities = 260/559 (46%), Positives = 351/559 (62%), Gaps = 43/559 (7%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
T KP+IIIIL DD+G+NDVSFHGS+QI TPNIDALAYNG++LN+ YV LCTPSR+ L
Sbjct: 22 TPSTKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGVLLNKLYVPNLCTPSRATL 81
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
+TGKYPIH GMQH VI+ EPWGLP E+L+P++ ++AGY+T IGKWHLGF+ + TPT
Sbjct: 82 LTGKYPIHTGMQHYVIITDEPWGLPKQERLMPEFFRDAGYSTQLIGKWHLGFWEKDLTPT 141
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPY-QGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
RGFD HYGY+ G DYYDH+ + Y +G+D R ++ + G Y+TD +T A
Sbjct: 142 MRGFDHHYGYYNGYIDYYDHTLHMLTKNYTKGVDFRRDLDPCPQDNGTYATDAFTAEAKR 201
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
VI +H+KS+P+F+ L+HLAVH GN P QAP+EEVAKF I+DP+RRTYAGM+S LD+S
Sbjct: 202 VIEQHDKSRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHITDPKRRTYAGMISSLDKS 261
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
VG + AL +GML NSI+L +DNGAP+ GIHSN+GSN+P RG K +PW+GG+R AI
Sbjct: 262 VGQTVKALSDNGMLNNSIILLYSDNGAPTVGIHSNQGSNYPFRGQKESPWEGGIRSAGAI 321
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEIL 382
WSP L+Q VS+++ H DWLPTL AAA +E+ + LDGVN W L+ + + +L
Sbjct: 322 WSPLLQQRGYVSNQIIHAIDWLPTLAAAANVELPENLKLDGVNMWPSLSGNIEPQPRRLL 381
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGD----TDNEIDKYSPKEVLYSK 438
H +D+V Y++ D+LKYV G+ GQ D+W GD D+ + Y ++VL S+
Sbjct: 382 HVMDDVFG----YSSYMRDNLKYVNGSSFEGQYDQWLGDLESGQDDPLSPYYAQQVLSSE 437
Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-- 496
+ +T+E+I ++RE A RC N+ H
Sbjct: 438 VHAVIGSQS------------------------VTNERIEQMREEATHRCPL-NEQDHLQ 472
Query: 497 ----CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF-DKR 551
C PC FN+ DPCE+ NLA L+ L ++ ++ VP P D+R
Sbjct: 473 EIYLCEPLKAPCFFNLAKDPCERFNLANLYPLQLQFLSQEVEQFRQNAVPSARVPISDER 532
Query: 552 ADPARWNNIWVPWYDELDK 570
A+P W W++E K
Sbjct: 533 ANPKLHGGYW-EWWNETAK 550
>gi|195328507|ref|XP_002030956.1| GM25726 [Drosophila sechellia]
gi|194119899|gb|EDW41942.1| GM25726 [Drosophila sechellia]
Length = 579
Score = 498 bits (1283), Expect = e-138, Method: Compositional matrix adjust.
Identities = 265/593 (44%), Positives = 362/593 (61%), Gaps = 37/593 (6%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
L + +F+ AF + KP+I+IIL DD+G NDVSFHGS+QI TPNIDALAYNG++LN+
Sbjct: 10 LVVSTIFSLAF-GSGYSTKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNK 68
Query: 71 HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
HYV LCTPSR+ L+TGKYPI+ GMQH VI+ EPWGLP E+L+P+ ++AGY+TH +G
Sbjct: 69 HYVPNLCTPSRATLLTGKYPIYTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVG 128
Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTI 189
KWHLGF+R+ TPT RGFD H+GY+ G DYYDH + Y GLD R +++ +
Sbjct: 129 KWHLGFWRKDLTPTMRGFDHHFGYYNGYIDYYDHQVRLLDRNYSAGLDFRRDLEPCPEAN 188
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
G Y+T+ +T A +I +H+KSKP+F+ L+HLAVH GN P QAP+EEVAKF I DP+
Sbjct: 189 GTYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPK 248
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
RRTYAGMVS LD+SV I AL+ +GML NSI+L +DNGAP+ GIHSN GSN+P RG K
Sbjct: 249 RRTYAGMVSSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQK 308
Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWD 368
+PW+GG+R A+WSP LK+ VS++ H DWLPTL AAG+ + D LDG+N W
Sbjct: 309 ESPWEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAGVSLPQDLPLDGINLWP 368
Query: 369 VLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD-NEID 427
+L+ + K ++H +D V Y++ D LKYV G+ G+ D+W G+ + NE D
Sbjct: 369 MLSGNEEPKPRTMIHVLDEVFG----YSSYMKDTLKYVNGSSFEGRYDQWLGELETNEDD 424
Query: 428 KYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR 487
S + +KH S R LT+++I ++R A
Sbjct: 425 PLS--------------------ESYEKHVLASDVQSLLGNRG-LTEDRIRQMRSEATET 463
Query: 488 C------NYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMV 541
C N C PC F++ DPCE+ NLA+ L++L D++ + T +
Sbjct: 464 CPPVEGQNPLEPHFKCEPLKAPCFFDLAKDPCERYNLAQIYPLQLQELADEVEQIRKTAI 523
Query: 542 PPGNKPF-DKRADPARWNNIWVPWYDELDKQKAIETMRLHKPLLSPTTITLIY 593
P P D RA+P N W W++ D Q + T ++ L + I L+Y
Sbjct: 524 PSARVPHSDSRANPTFHNANW-EWWNNTDTQSSSGTFSINLWLAMVSIIGLVY 575
>gi|24666175|ref|NP_649023.1| CG7402 [Drosophila melanogaster]
gi|7293925|gb|AAF49287.1| CG7402 [Drosophila melanogaster]
Length = 579
Score = 498 bits (1283), Expect = e-138, Method: Compositional matrix adjust.
Identities = 261/575 (45%), Positives = 351/575 (61%), Gaps = 36/575 (6%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP+I+IIL DD+G NDVSFHGS+QI TPNIDALAYNG++LN+HYV LCTPSR+ L+TGK
Sbjct: 27 KPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGK 86
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YPIH GMQH VI+ EPWGLP E+L+P+ ++AGY+TH +GKWHLGF+R+ TPT RGF
Sbjct: 87 YPIHTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGF 146
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
D H+GY+ G DYYDH + Y GLD R +++ + G Y+T+ +T A +I +
Sbjct: 147 DHHFGYYNGYIDYYDHQVRMLDRNYSAGLDFRRDLEPCPEANGTYATEAFTSEAKRIIEQ 206
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
H+KSKP+F+ L+HLAVH GN P QAP+EEVAKF I DP+RRTYAGM+S LD+SV
Sbjct: 207 HDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKSVAQT 266
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
I AL+ +GML NSI+L +DNGAP+ GIHSN GSN+P RG K +PW+GG+R A+WSP
Sbjct: 267 IGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKESPWEGGIRSAGALWSPL 326
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
LK+ VS++ H DWLPTL AAG+ + D LDG+N W +L+ + K ++H +D
Sbjct: 327 LKERGYVSNQAIHAVDWLPTLAGAAGVSLPQDLPLDGINLWPMLSGNEEPKPRTMIHVLD 386
Query: 387 NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD-NEIDKYSPKEVLYSKAGITFNA 445
V Y++ D LKYV G+ G+ D+W G+ + NE D
Sbjct: 387 EVFG----YSSYMRDTLKYVNGSSFKGRYDQWLGELETNEDDPLG--------------- 427
Query: 446 LKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC------NYDNKGAHCNS 499
+ ++H S R LT ++I ++R A C N C
Sbjct: 428 -----ESYEQHVLASDVQSLLGNRG-LTKDRIRQMRSEATETCPPIEGQNPLESHFKCEP 481
Query: 500 TVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF-DKRADPARWN 558
PC F++ DPCE+ NLA+ L+QL D+L + T +P P D RA+P N
Sbjct: 482 LKAPCFFDLAKDPCERYNLAQMYPLQLQQLADELEQIRKTAIPSARVPHSDSRANPTFHN 541
Query: 559 NIWVPWYDELDKQKAIETMRLHKPLLSPTTITLIY 593
W W++ D Q T ++ L + I L+Y
Sbjct: 542 GNW-EWWNNTDTQSGSGTFSINLWLAMGSIIGLVY 575
>gi|194748066|ref|XP_001956470.1| GF24578 [Drosophila ananassae]
gi|190623752|gb|EDV39276.1| GF24578 [Drosophila ananassae]
Length = 542
Score = 493 bits (1268), Expect = e-136, Method: Compositional matrix adjust.
Identities = 258/564 (45%), Positives = 353/564 (62%), Gaps = 36/564 (6%)
Query: 40 LGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGV 99
+G NDVSFHGS+QI TPNIDALAYNG++LN+HYV LCTPSR+ L+TGKYPIH GMQH V
Sbjct: 1 MGMNDVSFHGSNQILTPNIDALAYNGVLLNKHYVPNLCTPSRATLLTGKYPIHTGMQHWV 60
Query: 100 ILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ 159
I+ EPWGLP E+L+P+ +EAGY+TH +GKWHLGF+R+ TPT RGFD HYGY+ G
Sbjct: 61 IITDEPWGLPKKERLMPELFREAGYSTHLVGKWHLGFWRQDLTPTMRGFDHHYGYYNGYI 120
Query: 160 DYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
DYYDH + Y GLD R + + + K G Y+T+ +T A +I EH+KSKP+F+ L
Sbjct: 121 DYYDHQVRLLGTNYSAGLDFRRDFEPNPKANGTYATEAFTSEAKRIIEEHDKSKPLFMVL 180
Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
+HLAVH GN P QAP+EEVAKF I DP+RRTYAGM+S LD+SVG + AL+ + ML
Sbjct: 181 SHLAVHTGNEDNPMQAPEEEVAKFSHIKDPKRRTYAGMISSLDKSVGQTMRALKDNDMLN 240
Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL 338
N+IVL +DNGAP+ GIHSN GSN+P RG K +PW+GG+R V A+WSP L++ VS++
Sbjct: 241 NTIVLLYSDNGAPTVGIHSNAGSNYPYRGQKESPWEGGIRSVGALWSPLLEKQGYVSNQA 300
Query: 339 FHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAA 397
H DWLPTL AAAG+ + D LDG N W L++ A+ K + +H +D V Y++
Sbjct: 301 IHAIDWLPTLAAAAGVSLPQDLELDGENLWPSLSEKAEPKPRKFIHVLDEVFG----YSS 356
Query: 398 LRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQ-IKQKH 456
D LKYV G+ +GQ D W GD E ++ P +Y + L +++Q + H
Sbjct: 357 YMSDTLKYVNGSSFDGQYDYWLGDL--EANEDDPLSGIYGE-----QVLASEVQSVLGSH 409
Query: 457 AADPKANSSDALRTILTDEKILELREFARVRC----NYDNKGAH--CNSTVKPCLFNITD 510
LT+++IL++R A +C D H C V PC F++
Sbjct: 410 G--------------LTEDRILQMRSEATQKCPAIEGQDPLQPHYRCEPLVAPCFFDLAQ 455
Query: 511 DPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF-DKRADPARWNNIWVPWYDELD 569
DPCE+ NLA L++L ++ ++ T +P PF D+RADP N W W++ +
Sbjct: 456 DPCERYNLARLYPLQLEKLRQEVEEFRQTAIPSARVPFSDERADPIFHNGNW-EWWNNTE 514
Query: 570 KQKAIETMRLHKPLLSPTTITLIY 593
+ T + ++ + + L+Y
Sbjct: 515 SDSSAGTCSRNLWIILGSIVALVY 538
>gi|242025544|ref|XP_002433184.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212518725|gb|EEB20446.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 610
Score = 493 bits (1268), Expect = e-136, Method: Compositional matrix adjust.
Identities = 263/554 (47%), Positives = 352/554 (63%), Gaps = 32/554 (5%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K+P+II ILADDLGWND+SFHGS QI TPN+DALAYNG+ILN YV +CTPSR+ALMTG
Sbjct: 39 KQPNIIFILADDLGWNDLSFHGSDQIQTPNLDALAYNGVILNSQYVLPVCTPSRAALMTG 98
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
YPIH GMQ + EP LP KLLP YLK+ GY T +GKWHLG++++ +TPT+RG
Sbjct: 99 MYPIHNGMQGLPLEASEPRALP-AGKLLPSYLKDLGYTTRMVGKWHLGYYQKEFTPTYRG 157
Query: 148 FDSHYGYWQGLQDYYDHSC-----KATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
FDSH GYW G+ YYD+ + G DMR N+ G Y+T+++T+ A+
Sbjct: 158 FDSHLGYWNGIVSYYDYILQEDDNRKPRSSLNGFDMRRNITPAYDLQGRYATEMFTDEAV 217
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
++I HNK+ P+FLY++HLAVHAGN + +AP E + KFL I+DP RRT+AGMV +LDE
Sbjct: 218 HLIRSHNKNTPLFLYMSHLAVHAGNPGKFLEAPQEAINKFLHIADPNRRTFAGMVQKLDE 277
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
SVG +IAAL + ML NSI++F ADNGAP+ G + N GSN+PLRG+K T W+GG RGV+
Sbjct: 278 SVGRLIAALEESNMLHNSIIVFSADNGAPTIGGYPNWGSNYPLRGIKETLWEGGTRGVSF 337
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEI 381
IWSP +K T +VS+EL HISDWLPTL AAAG + N +LDGVNQW +++ ++R+EI
Sbjct: 338 IWSPLIKNTPRVSNELIHISDWLPTLYAAAGGNMENVKNLDGVNQWPSISENLPSERNEI 397
Query: 382 LHNIDNVDNPQKYYAALRVDDLKY----VAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
L NID + AA+R++ KY V GT NNG DE++G+ N+ + +P
Sbjct: 398 LLNIDEILQT----AAIRMNSRKYHWKLVVGTVNNGTYDEYFGEGTNKFPENNPP----- 448
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH- 496
FNA+ Q + + L + EK+ LR A +RC+Y +
Sbjct: 449 ---YDFNAVAN----SQVYRTLERLAQKRLLTYPTSKEKMKVLRRDATIRCDYKIINPNR 501
Query: 497 CNSTVKP---CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR-A 552
CN + P CL++I DPCE NNL ++ L L ++ ++VP +K + A
Sbjct: 502 CNPSDGPNQVCLYDIKRDPCELNNLTPFYPNVASYLFRALVQHRESLVPQESKELQQDLA 561
Query: 553 DPARWNNIWVPWYD 566
DP +N W PW D
Sbjct: 562 DPVNFNGTWTPWID 575
>gi|198466304|ref|XP_002135153.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
gi|198150538|gb|EDY73780.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
Length = 577
Score = 491 bits (1265), Expect = e-136, Method: Compositional matrix adjust.
Identities = 259/557 (46%), Positives = 344/557 (61%), Gaps = 34/557 (6%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A KP+IIIIL DD+G+NDVSFHGS+QI TPNIDALAYNG++LN+HYV LCTPSR+ L+
Sbjct: 24 ASAKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGILLNRHYVPNLCTPSRATLL 83
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TGKYPIH GMQH VI+ EPWGLP E+L+P+ ++AGYATH +GKWHLGF+R+ TPT
Sbjct: 84 TGKYPIHTGMQHFVIVTDEPWGLPRQERLMPELFRDAGYATHLVGKWHLGFWRKDLTPTM 143
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
RGFD H+GY+ G DYYD + + Y GLD R +++ + G Y+T+ +T A V
Sbjct: 144 RGFDHHFGYYNGYMDYYDQTVRMLDRNYSTGLDFRRDLEPCREAEGTYATEAFTTEARKV 203
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
I H+K++P+F+ L+HLAVH GN P QAP+EEVAKF I DP+RRTYAGM+S LD+SV
Sbjct: 204 IERHDKNRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHIRDPKRRTYAGMISSLDKSV 263
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
G + AL +GML NSIVL +DNGAP+ GIHSN GSN+P RG K +PW+GG+R V A+W
Sbjct: 264 GQTMRALADNGMLNNSIVLLYSDNGAPTVGIHSNAGSNYPFRGQKESPWEGGIRSVGALW 323
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
SP L+Q VS++ H DWLPTL AAAG+ + +D LDG+N W L+ A+ + +LH
Sbjct: 324 SPLLQQRGYVSNQAIHAIDWLPTLAAAAGVSLPSDLRLDGLNLWPSLSASAQPQPRNLLH 383
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGD-TDNEIDKYSPKEVLYSKAGIT 442
+D+V Y++ D LKYV G+ G+ D W G+ D E D S +A
Sbjct: 384 VLDDVFG----YSSYTQDTLKYVNGSSFEGRYDHWLGELEDGEEDPNSRDYGQLVRASEV 439
Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-----C 497
L + TD++I ++R+ A RC + C
Sbjct: 440 QAVLGS---------------------VAPTDDRIRQMRDEATQRCPIQERDPEDPVYLC 478
Query: 498 NSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKP-FDKRADPAR 556
CLF++ +DPCE+ NLA L+ L ++ + T +P P D RADPA
Sbjct: 479 EPLKAACLFDLANDPCERYNLANLYPLQLQLLAQEVEELRLTALPSARVPRADVRADPAL 538
Query: 557 WNNIWVPWYDELDKQKA 573
+ W W + L A
Sbjct: 539 HDGHWEWWNNTLPGSSA 555
>gi|281363223|ref|NP_610807.3| CG8646 [Drosophila melanogaster]
gi|17945274|gb|AAL48694.1| RE14504p [Drosophila melanogaster]
gi|272432448|gb|AAF58475.2| CG8646 [Drosophila melanogaster]
Length = 562
Score = 490 bits (1261), Expect = e-135, Method: Compositional matrix adjust.
Identities = 268/570 (47%), Positives = 350/570 (61%), Gaps = 38/570 (6%)
Query: 7 YFFALTCTLL-FNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
+F L C LL DA +P KP+II ILADDLG+NDV FHGS++IPTPNIDALAY+G
Sbjct: 2 WFLWLICLLLPIIDAAEVEKSPAKPNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSG 61
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+ILN++YV +CTPSRSALMTGKYPIH GMQH V+ EP GLPL EK+LPQYL E GY
Sbjct: 62 IILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYT 121
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
+H GKWHLG ++ YTP +RGF SH G+W G QDY DH+ A GLDMR+ QV
Sbjct: 122 SHIAGKWHLGHWKLKYTPLYRGFSSHVGFWSGHQDYNDHT--AVENNQWGLDMRNGTQVA 179
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
G Y+TD+ T+ ++ VIA HN +K P+FLY+AH A H+ N Y P PD +V K
Sbjct: 180 YDLHGHYTTDVITDHSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMSH 239
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
I + +RR +A MVS++D SVG ++ LRK MLENSI++F +DNG P+ G + N SN+P
Sbjct: 240 IPNYKRRKFAAMVSKMDNSVGQIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYP 299
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL--- 361
L+G+K+T W+GG+R +WSP LK++Q+VS++ HI DWLPTL AAG + ++L
Sbjct: 300 LKGVKNTLWEGGVRAAGLMWSPLLKKSQRVSNQTMHIIDWLPTLLEAAGGQPALSNLSKQ 359
Query: 362 -DGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
DG + W L + + R +LHNID++ AAL V D K V GT+ G D WYG
Sbjct: 360 IDGQSIWRALVQDKASPRLNVLHNIDDIWGS----AALSVGDWKLVKGTNYRGSWDGWYG 415
Query: 421 DTDNEIDK-YSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILE 479
+ Y + V S+AG ALK + +D R
Sbjct: 416 PAGERDPRLYDWQLVGRSRAGKALEALKM------------LPSRADQQR---------- 453
Query: 480 LREFARVRC-NYDNKGAHCNSTV--KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIY 536
+R A V C ++G C +T PCLF+I DDPCEQ NLA+ +++ L +L +
Sbjct: 454 IRAAATVSCPGQSSQGTSCVATAFSAPCLFHIRDDPCEQYNLAKQYPEVVNALMTELERF 513
Query: 537 KSTMVPPGNKPFDKRADPARWNNIWVPWYD 566
+T VPP NKP D RADP WN W + D
Sbjct: 514 NATAVPPSNKPADPRADPRFWNYTWTNFGD 543
>gi|195494692|ref|XP_002094947.1| GE19935 [Drosophila yakuba]
gi|194181048|gb|EDW94659.1| GE19935 [Drosophila yakuba]
Length = 577
Score = 489 bits (1259), Expect = e-135, Method: Compositional matrix adjust.
Identities = 252/562 (44%), Positives = 343/562 (61%), Gaps = 42/562 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP+I+IIL DD+G NDVSFHGS+QI TPNIDALAYNG++LN+HYV LCTPSR+ L+TGK
Sbjct: 27 KPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGK 86
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YPIH GMQH VI+ EPWGLP E+L+P+ ++AGY+TH +GKWHLGF+R+ TPT RGF
Sbjct: 87 YPIHTGMQHFVIITDEPWGLPSRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGF 146
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPY-QGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
D H+GY+ G DYYDH + Y GLD R +++ + G Y+T+ +T A +I +
Sbjct: 147 DHHFGYYNGYIDYYDHQVRMLDRNYSHGLDFRRDLEPCPEANGTYATEAFTSEAKRIIEQ 206
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
H+KSKP+F+ ++HLAVH GN P QAP+EEVAKF I DP+RRTYAGM+S LD+SV
Sbjct: 207 HDKSKPLFMVMSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKSVAQT 266
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
I AL+ + ML NSI+L +DNGAP+ GIHSN GSN+P RG K +PW+GG+R A+WSP
Sbjct: 267 IGALKDNDMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKESPWEGGIRSAGALWSPL 326
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
LK+ VS++ H DWLPTL AAG+ + D LDG+N W +L+ + K ++H +D
Sbjct: 327 LKERGYVSNQAIHAIDWLPTLAGAAGVSLPQDLPLDGINLWPMLSGNEEPKPRTMIHVLD 386
Query: 387 NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNAL 446
V Y++ D LKY+ G+ G+ D W G+ L
Sbjct: 387 EVFG----YSSYMKDTLKYINGSSFQGRYDHWLGE------------------------L 418
Query: 447 KTKLQIKQKHAADPKANSSDALRTI----LTDEKILELREFARVRC------NYDNKGAH 496
KT + + +SD + LT+++I ++R A C N
Sbjct: 419 KTNEDHPLSESYEQHVLASDVQSILGNRWLTEDRIRQMRSEATEMCPPVEGLNPLEPQFK 478
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF-DKRADPA 555
C PC F++ DPCE+ NLAE+ L+QL D++ + T +P P+ D RA+P
Sbjct: 479 CEPLKAPCFFDLAKDPCERYNLAETYPLQLQQLADEVEQIRKTAIPSARVPYSDSRANPT 538
Query: 556 RWNNIWVPWYDELDKQKAIETM 577
N W W++ D + T
Sbjct: 539 FHNGNW-EWWNNTDTDSSSGTF 559
>gi|194871664|ref|XP_001972882.1| GG13640 [Drosophila erecta]
gi|190654665|gb|EDV51908.1| GG13640 [Drosophila erecta]
Length = 578
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 255/583 (43%), Positives = 352/583 (60%), Gaps = 34/583 (5%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
K+ L + + + A + + KP+I+IIL DD+G NDVSFHGS+QI TPNIDALAYNG
Sbjct: 5 KFHILLVVSAIVSLA-VGSAYSTKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNG 63
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
++LN+HYV LCTPSR+ L+TGKYPIH GMQH VI+ EPWGLP E+L+P+ +EAGY+
Sbjct: 64 ILLNKHYVPNLCTPSRATLLTGKYPIHTGMQHFVIITDEPWGLPQRERLMPEIFREAGYS 123
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQV 184
TH +GKWHLGF+ + TPT RGFD H+GY+ G DYYDH + Y GLD R +++
Sbjct: 124 THLVGKWHLGFWHKDLTPTRRGFDHHFGYYNGYIDYYDHQVRMLDRNYSAGLDFRRDLEP 183
Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
+ G Y+T+ +T A +I +H+KSKP+F+ ++HLAVH GN P QAP+EEVAKF
Sbjct: 184 CPEANGTYATEAFTAEAKRIIEQHDKSKPLFMVMSHLAVHTGNEDSPMQAPEEEVAKFPH 243
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
I DP+RRTYAGM+S LD+SV + AL+ +GML NSI+L +DNGAP+ GIHSN GSN+P
Sbjct: 244 IRDPKRRTYAGMISSLDKSVAQTVGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYP 303
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDG 363
RG K +PW+GG+R A+WSP LK+ VS++ H DWLPTL AAG+ + D LDG
Sbjct: 304 YRGQKESPWEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAGVSLPQDLPLDG 363
Query: 364 VNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD 423
+N W +L+ + K ++H +D V Y++ D LKYV G+ G+ D+W G+ +
Sbjct: 364 INLWPLLSGNEQPKPRTMIHVLDEVFG----YSSYMKDTLKYVNGSSFQGRYDQWLGELE 419
Query: 424 NEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREF 483
D P Y ++H S R LT+++I ++R
Sbjct: 420 TSAD--DPLSESY-----------------EQHVLASDVQSILGNRG-LTEDRIRQMRSE 459
Query: 484 ARVRC------NYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYK 537
A C N C PC F++ +DPCE+ NLA + L+QL D++ + +
Sbjct: 460 ATELCPPVEGQNPLEPHFKCEPLKAPCFFDLANDPCERYNLARTYPLQLQQLADEVVLIR 519
Query: 538 STMVPPGNKPF-DKRADPARWNNIWVPWYDELDKQKAIETMRL 579
T + D RA+PA N W W + A ++ L
Sbjct: 520 KTAISSARVAHSDSRANPAYHNGNWEWWNNTPSSSSATFSINL 562
>gi|158300602|ref|XP_552160.3| AGAP012047-PA [Anopheles gambiae str. PEST]
gi|157013239|gb|EAL38777.3| AGAP012047-PA [Anopheles gambiae str. PEST]
Length = 564
Score = 483 bits (1242), Expect = e-133, Method: Compositional matrix adjust.
Identities = 258/545 (47%), Positives = 332/545 (60%), Gaps = 36/545 (6%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P+II ILADDLGWNDV FHGS+QIPTPN+DALAY+G+ILN++YV +CTPSRSALMTGK
Sbjct: 38 RPNIIFILADDLGWNDVGFHGSAQIPTPNLDALAYSGIILNRYYVNPICTPSRSALMTGK 97
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YPIH GMQH V+ EP GLPL+EKLLPQYLK+ GY+ H +GKWHLG ++ +TP RGF
Sbjct: 98 YPIHTGMQHTVLYAMEPRGLPLSEKLLPQYLKDLGYSNHIVGKWHLGHYQLRFTPMQRGF 157
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
DSH G+W G DH+ A + GLDMR V G Y+T + AI ++ H
Sbjct: 158 DSHTGFWTGHHHMNDHT--AVEHGHWGLDMRRGYDVAYDLHGQYTTHVLGAEAIAIVQGH 215
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
NKS P+FLY+AH AVH+ N Y+ APDE VA I + RR +A M+ LD +VG+++
Sbjct: 216 NKSSPLFLYVAHAAVHSANPYDFLPAPDETVANLGHIENYRRRKFAAMMVELDRTVGSLV 275
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
AL MLEN+I++F +DNG P+ G + N SN PLRG+K+T W+GG+R IWSP L
Sbjct: 276 DALHARDMLENTIIVFSSDNGGPADGFNDNAASNWPLRGVKNTLWEGGLRAAGFIWSPLL 335
Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEIN--DTSLDGVNQWDVLTKGAKTKRSEILHNID 386
+ +VS ++ + DWLPTL AAG ++ T LDG++ W L+ GA T+R EILHNID
Sbjct: 336 QNVSRVSHQMVQVCDWLPTLYEAAGGNVSALPTDLDGISVWHELSSGAPTRRIEILHNID 395
Query: 387 NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNAL 446
++ AAL V + K V G+ N D WYG AGI
Sbjct: 396 DIWGT----AALTVGNWKLVKGSHYNRTWDGWYG-----------------PAGIR---- 430
Query: 447 KTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH----CNSTVK 502
K K A P + L + T E+I +LR A V C GAH C+ +
Sbjct: 431 DEKAYALDKLANSPAGKVMEELSLLPTPERITQLRREATVSC---GAGAHMASECDPLER 487
Query: 503 PCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWV 562
PCL+++ DPCE NNLAE +L+ L +LA Y ST VPP N D R +P RWN W
Sbjct: 488 PCLYDVESDPCEYNNLAEEHLHILQSLLARLADYNSTAVPPSNLEDDPRGEPQRWNFTWH 547
Query: 563 PWYDE 567
+ DE
Sbjct: 548 NFGDE 552
>gi|170050440|ref|XP_001861313.1| arylsulfatase b [Culex quinquefasciatus]
gi|167872047|gb|EDS35430.1| arylsulfatase b [Culex quinquefasciatus]
Length = 552
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 253/565 (44%), Positives = 339/565 (60%), Gaps = 32/565 (5%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
+ +L L T++ P+II ILADDLGWNDV FHGS+QIPTPN+DALAY+G+I
Sbjct: 3 LVVIAVLILAFTIVLVTSSKPPPNIIFILADDLGWNDVGFHGSAQIPTPNLDALAYSGII 62
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
LN++YV +CTPSR+ALMTG+YPIH GMQH V+ EP GLPL EKLLP+YL+E GY H
Sbjct: 63 LNRYYVTPICTPSRAALMTGRYPIHTGMQHAVLYGMEPRGLPLEEKLLPEYLRELGYKNH 122
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
+GKWHLG + YTP RGFDSH G+W G +DHS T GLDMR V
Sbjct: 123 IVGKWHLGHYTRRYTPLERGFDSHVGFWTGHHHMFDHSAVET--ETWGLDMRRGYDVAYD 180
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
G Y+T + + A+ I H+ P+FLY+AH AVH+ N Y+ APD VA +
Sbjct: 181 LHGKYTTHVIRDEAVARIGNHSIGDPLFLYVAHAAVHSANPYDFLPAPDVTVAGLEHVEP 240
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
RR +A M+S LDESVG ++ AL+ GML+N+I++F +DNG P+ G +SN SN PLRG
Sbjct: 241 YPRRKFAAMLSELDESVGAIVEALKIKGMLDNTIIVFSSDNGGPAEGFNSNAASNWPLRG 300
Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG--IEINDTSLDGVN 365
+K+T W+GG+R IWSP + ++++VS ++ HISDWLPTL AAG +E +LDG+N
Sbjct: 301 VKNTLWEGGVRAAGFIWSPLIAESRRVSQQMIHISDWLPTLLDAAGYNVETLPKTLDGIN 360
Query: 366 QWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT-DN 424
W L G T+R EILHNID++ AAL V++ K + GT+ GQ D WYG D
Sbjct: 361 VWKSLLNGNSTERREILHNIDDIWGS----AALTVNNWKILKGTNYKGQWDNWYGPPGDR 416
Query: 425 EIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFA 484
+ + Y+ V + P + + + T+ I LR A
Sbjct: 417 DPEAYNITAVQHC----------------------PTGKALSEMNLLPTEATIRNLRLNA 454
Query: 485 RVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPG 544
V C+ + + A CN +PCL+++ DPCE +NLAE +L + KL Y +T VPPG
Sbjct: 455 TVHCSKEVR-ATCNPLEQPCLYDVLQDPCELDNLAERFPAILDSMLKKLDAYNATAVPPG 513
Query: 545 NKPFDKRADPARWNNIWVPWYDELD 569
N D R DP W W + D+L+
Sbjct: 514 NLALDTRGDPRFWGYTWHNFGDDLE 538
>gi|242024962|ref|XP_002432895.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
gi|212518404|gb|EEB20157.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
Length = 533
Score = 476 bits (1226), Expect = e-131, Method: Compositional matrix adjust.
Identities = 258/550 (46%), Positives = 349/550 (63%), Gaps = 38/550 (6%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K PHII I+ADDLGWNDV +HGS +IPTPNIDALAYNG+ILN++YV +CTPSRSALMTG
Sbjct: 8 KPPHIIFIMADDLGWNDVGYHGSDEIPTPNIDALAYNGIILNRYYVLPVCTPSRSALMTG 67
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
++PIH GMQH V+ E GLPLTEKLLP+YL++ GY+TH +GKWHLGF+++ YTP +RG
Sbjct: 68 RHPIHNGMQHRVLFGVETRGLPLTEKLLPEYLQKLGYSTHIVGKWHLGFYKKEYTPLYRG 127
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
F+SH G+W G QDYYDH+ + E GLDMRH M+ G YST +YT ++ +I
Sbjct: 128 FESHIGFWTGHQDYYDHTAEE--ERLWGLDMRHGMKPAWYLHGEYSTHVYTRESVKIIKN 185
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
+N +KP+FLY+AH AVH+GN Y P APD+ V K I + RR YA MVS LD SVGN+
Sbjct: 186 YNSTKPLFLYVAHAAVHSGNKYNPLPAPDKTVDKLDHIQNYNRRRYAAMVSELDTSVGNI 245
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
I+AL+ ML NSIV+F DNG P+ G++ N SN P+RG+K+T W+GG+RGV +WS
Sbjct: 246 ISALKDANMLSNSIVIFSTDNGGPANGLNYNYASNWPMRGVKNTLWEGGVRGVGVVWSSK 305
Query: 328 LKQTQ--KVSSELFHISDWLPTLCAAAGIEIND--TSLDGVNQWDVLTKGAKTKRSEILH 383
K +VS+ + HI DWLPTL AAAG ++ D +LDG N W+ L + R+EILH
Sbjct: 306 FKPNSGGRVSNRMMHICDWLPTLYAAAGGDVRDLGENLDGYNMWNSLLDDLPSPRNEILH 365
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD-NEIDKYSPKEVLYSKAGIT 442
NID++ +AL + K V GT NG D W+G T N Y+ +VL S +
Sbjct: 366 NIDDIFGN----SALTMSQWKLVKGTTYNGFYDSWFGPTGRNPGYFYNIGQVLSSLSNKA 421
Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC------NYDNKGAH 496
+ L+T L E I +RE A + C + +++ ++
Sbjct: 422 LSTLQT---------------------VPLNPESIRRIREEATIDCGNHQLISSESRFSN 460
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
CN + CLFN+ +DPCE +N+A+S ++K++E++L + ST V P D R +P
Sbjct: 461 CNPLKESCLFNVDEDPCEIDNVAKSFPQVVKEMEERLLYWNSTAVKPAKVINDLRGNPKF 520
Query: 557 WNNIWVPWYD 566
W +W + D
Sbjct: 521 WGYVWTNFGD 530
>gi|357612332|gb|EHJ67925.1| hypothetical protein KGM_21236 [Danaus plexippus]
Length = 563
Score = 471 bits (1211), Expect = e-130, Method: Compositional matrix adjust.
Identities = 239/562 (42%), Positives = 336/562 (59%), Gaps = 31/562 (5%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
L L + + ++P+I++I+ADDLGWNDV FHGS+QIPTPNID +A++G+ L+
Sbjct: 30 LVLLLFIVTSLSDCECHERPNIVLIIADDLGWNDVGFHGSNQIPTPNIDIMAWSGVSLHN 89
Query: 71 HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
+YV +CTPSR+ALMTGKYPIH GMQH VI EP GLPLTEK+LPQYLKE GY TH +G
Sbjct: 90 YYVTPICTPSRAALMTGKYPIHTGMQHTVIFAAEPRGLPLTEKILPQYLKELGYKTHLVG 149
Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
KWHLG +++ Y P RGFDSH G+W G D YDH+ + + Y G D R + + G
Sbjct: 150 KWHLGSYKKEYLPLNRGFDSHLGFWNGKIDMYDHTNQE--KGYWGFDFRRDFSTAHDLFG 207
Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
Y+TD+YT A+ +I HN S P+FL L+H AVH GN EP +AP++ F I D +R
Sbjct: 208 QYATDVYTNEAVKIIKSHNTSSPLFLMLSHSAVHTGNPSEPIRAPEKLFVNFTHIQDFQR 267
Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
R +A ++++LDESVG V+AAL+ G+L +SIV+F DNG + G + N SN+PLRG+K+
Sbjct: 268 RKFAAVLTKLDESVGEVVAALKAKGVLNDSIVVFTTDNGGAAAGFNDNAASNYPLRGVKN 327
Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDV 369
T W+GG+RG +WSP++ + +V+++ H+ DWLPTL +AAG+ ++ +DGV+QW
Sbjct: 328 TLWEGGVRGAGWLWSPFIDKRSRVATQRMHLVDWLPTLLSAAGMNVSSIKHIDGVDQWCA 387
Query: 370 LTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKY 429
L++ + R ++HNID+ + A++ K GT+ G D WYG E Y
Sbjct: 388 LSQDLPSARESLVHNIDD----ESGSASITYKQWKVHKGTNYGGSWDGWYGPAGRE-GAY 442
Query: 430 SPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN 489
+L SKA A + + + E IL LR A V C
Sbjct: 443 DTTRLLASKA----------------------AGALLDIGMLPDTEHILRLRSEATVECG 480
Query: 490 YDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFD 549
D + C PCLFNI +DPCE NLA+ D+L+ + +L T VPP N+P
Sbjct: 481 -DREALPCRPLEAPCLFNIDEDPCETRNLADIHPDVLQVMLKELDRVNRTAVPPNNQPLT 539
Query: 550 KRADPARWNNIWVPWYDELDKQ 571
DP W + + D ++ +
Sbjct: 540 PGGDPKYWGYVITNFGDYINNE 561
>gi|443694453|gb|ELT95582.1| hypothetical protein CAPTEDRAFT_115907 [Capitella teleta]
Length = 561
Score = 469 bits (1207), Expect = e-129, Method: Compositional matrix adjust.
Identities = 248/567 (43%), Positives = 344/567 (60%), Gaps = 25/567 (4%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
F LL ++ T+P+ P+I+ ILADDLGWNDV FHGS Q+ TPN+DALAY+G+IL
Sbjct: 6 FRCVAFLLLQLTAVSATSPR-PNIVFILADDLGWNDVGFHGSEQVLTPNLDALAYDGVIL 64
Query: 69 NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
+YVQ +CTPSR+ALMTG++PIH GMQ VI EP+GL L E + PQYLK+ GY TH
Sbjct: 65 ENYYVQPICTPSRAALMTGRHPIHTGMQQNVIYSAEPYGLGLNEIIFPQYLKQIGYKTHI 124
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH----SCKATFEPYQGLDMRHNMQV 184
+GKWHLGFF E YTP RGFDSHYGY+ G +DY+ H + + +F GLD R N +V
Sbjct: 125 VGKWHLGFFAEQYTPIERGFDSHYGYYMGAEDYWVHIAGNAHEVSFNASWGLDFRRNGEV 184
Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTY---EPFQAPDEEVAK 241
G YST+L+T A N+IA HN+S+P+F+Y+A AVH+ N Y +AP + K
Sbjct: 185 VKTAFGQYSTELFTTEAENIIASHNQSEPLFMYVAQQAVHSANPYTGDAELEAPFKYYEK 244
Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
F I + +RR +A MVS LD+SVGN+ AL G+LEN+I+ F DNG P+ G N S
Sbjct: 245 FPHIKNEKRRKFAAMVSALDDSVGNITKALHAKGILENTIICFSTDNGGPASGFDFNDAS 304
Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS- 360
N PLRG+K + W+GG+RG +WSP +K + S + I D+LPTL AAG +++ S
Sbjct: 305 NWPLRGVKESLWEGGVRGTGFVWSPLMKTSGYTSPAMMQIFDFLPTLLKAAGYDMSKLSP 364
Query: 361 -LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY 419
LDG++QW+++++ R+E+LHNIDN + + ALR++++K V + +S +
Sbjct: 365 DLDGIDQWEIISERKDAVRTEMLHNIDNTEGTR----ALRINEMKVVVSQGGSTRSWPLW 420
Query: 420 GDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILE 479
TD Y+ + S L K I + A ++ + LR + + L
Sbjct: 421 YSTDQFYGGYNHTDHSVSA------DLSAKWDIDSEVAPRVNSDLTSVLRNL---GRPLR 471
Query: 480 LREFARVRCNYDNKGAH--CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYK 537
++ C A C+ KPCLFNIT DPCE +NLA+ + D+L + D+L Y+
Sbjct: 472 SGTPLKIHCGPVPPSARGSCDVGKKPCLFNITADPCEFHNLADERPDMLSTMLDRLDQYQ 531
Query: 538 STMVPPGNKPFDKRADPARWNNIWVPW 564
ST V P NKP D P++ IW PW
Sbjct: 532 STAVAPRNKPVDPNGLPSKHGGIWEPW 558
>gi|321470034|gb|EFX81012.1| hypothetical protein DAPPUDRAFT_303738 [Daphnia pulex]
Length = 557
Score = 469 bits (1207), Expect = e-129, Method: Compositional matrix adjust.
Identities = 252/569 (44%), Positives = 336/569 (59%), Gaps = 25/569 (4%)
Query: 7 YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
Y + ++L + ++ + ++P++I ILADD GWNDVSFHGS QIPTPN+DALA++GL
Sbjct: 3 YLHIVALSVLVSSSWASGKQIQQPNLIFILADDFGWNDVSFHGSKQIPTPNLDALAFSGL 62
Query: 67 ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
IL +YV LCTPSRSALMTGK+PIH GMQH V+ +GLPL+E LP+YLK+ GY
Sbjct: 63 ILQNYYVTPLCTPSRSALMTGKHPIHTGMQHDVLYGYSRYGLPLSEITLPEYLKDLGYKN 122
Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDN 186
H +GKWHLG ++ VYTP FRGFDSHYGYW G QDYYDH+ A G DMR N VD
Sbjct: 123 HIVGKWHLGHYKSVYTPLFRGFDSHYGYWTGHQDYYDHT--AVEWNAWGYDMRRNHSVDW 180
Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
G Y+T L T+ A +VI +H+ S P+FLY+AHLAVH+ N Y P QAP+E V F I
Sbjct: 181 SAYGKYTTTLLTDEACDVITKHDVSSPLFLYVAHLAVHSANPYSPLQAPEETVEMFSSIE 240
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
+ +RR YA MV LD SVG ++ +L + MLEN++++F DNG P+ G + N SN PL+
Sbjct: 241 NLQRRRYAAMVHELDVSVGKIVKSLGDNNMLENTVIVFSTDNGGPAEGFNQNAASNWPLK 300
Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIE--INDTS-- 360
G+K+TPW+GG+R IWSP + ++++ V S L ISDWLPTL AAG D +
Sbjct: 301 GVKNTPWEGGVRAAGLIWSPLIPKSRRGQVMSNLMDISDWLPTLFEAAGQSRICGDVATL 360
Query: 361 --LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEW 418
LDGV+ W+ + + R+ +LHNID+ Q YAA+R ++ K V GT G D W
Sbjct: 361 KGLDGVSHWNTVLYDKPSARNHVLHNIDD----QLGYAAIRKENWKLVKGTTYEGSWDGW 416
Query: 419 YGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKIL 478
YG P + N++ I + D A A+ + + +
Sbjct: 417 YG----------PSGRTNESSTNFINSVDDVSLIYKNAVLDSDAAQYIAMLGMSATQNLY 466
Query: 479 ELREFARVRCNYDNKGAH-CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYK 537
+ A + C AH C CLF+I DPCE NNLAE + + L L+ Y
Sbjct: 467 QQLLDAEISCGERPVDAHPCLPLDALCLFDIEQDPCEYNNLAEKMPHITEDLLSLLSWYN 526
Query: 538 STMVPPGNKPFDKRADPARWNNIWVPWYD 566
ST V P N D ++P WN + W D
Sbjct: 527 STAVAPLNTSPDPMSNPKYWNYTYTNWAD 555
>gi|195333848|ref|XP_002033598.1| GM21416 [Drosophila sechellia]
gi|194125568|gb|EDW47611.1| GM21416 [Drosophila sechellia]
Length = 542
Score = 468 bits (1205), Expect = e-129, Method: Compositional matrix adjust.
Identities = 260/570 (45%), Positives = 339/570 (59%), Gaps = 58/570 (10%)
Query: 7 YFFALTCTLL-FNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
+F L C LL DA N P KP+II ILADDLG+NDV FHGS++IPTPNIDALAY+G
Sbjct: 2 WFLWLICLLLPIIDAAENEKPPAKPNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSG 61
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+ILN++YV +CTPSRSALMTGKYPIH GMQH V+ EP GLPL EK+LPQYL E GY
Sbjct: 62 IILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYT 121
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
+H GKWHLG ++ YTP +RGF SH+ GLDMR+ QV
Sbjct: 122 SHIAGKWHLGHWKLKYTPLYRGFSSHW----------------------GLDMRNGTQVA 159
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
G Y+TD+ T+ ++ VIA HN +K P+FLY+AH A H+ N Y P PD +V K
Sbjct: 160 YDLHGHYTTDVITDQSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMSH 219
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
I + +RR +A MVS++D+SVG ++ LRK MLENSI++F +DNG P+ G + N SN+P
Sbjct: 220 IPNYKRRKFAAMVSKMDDSVGQIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYP 279
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL--- 361
L+G+K+T W+GG+R +WSP LK++Q+VS++ HI DWLPTL AAG + +L
Sbjct: 280 LKGVKNTLWEGGVRAAGLMWSPLLKKSQRVSNQTMHIVDWLPTLLEAAGGQPALANLSKQ 339
Query: 362 -DGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
DG + W L + R +LHNID++ AAL V D K V GT+ G D WYG
Sbjct: 340 IDGQSIWRALVHDKASPRLNVLHNIDDIWGS----AALSVGDWKLVKGTNYRGSWDGWYG 395
Query: 421 DTDNEIDK-YSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILE 479
+ Y + V S+AG ALK + +D R
Sbjct: 396 PAGERDPRLYDWQLVGRSRAGKALEALKM------------LPSRADQQR---------- 433
Query: 480 LREFARVRC-NYDNKGAHCNSTV--KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIY 536
+R A V C ++G C +T PCLF+I DDPCEQ NLA+ +++ L +L +
Sbjct: 434 IRAAATVSCPGQSSQGTSCVATAFSAPCLFHIRDDPCEQYNLAKQYPEVVSALMAELERF 493
Query: 537 KSTMVPPGNKPFDKRADPARWNNIWVPWYD 566
+T VPP NKP D RADP WN W + D
Sbjct: 494 NATAVPPSNKPADPRADPRFWNFTWTNFGD 523
>gi|195021979|ref|XP_001985494.1| GH14469 [Drosophila grimshawi]
gi|193898976|gb|EDV97842.1| GH14469 [Drosophila grimshawi]
Length = 560
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 249/549 (45%), Positives = 348/549 (63%), Gaps = 34/549 (6%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
TA K P+III++ADD+G++D+S G+ + TPNIDALAY+G +L++ Y ++CTPSR AL
Sbjct: 28 TASKLPNIIIVMADDMGFDDISIRGAREFLTPNIDALAYHGRLLDRLYAPSMCTPSRGAL 87
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
++GKYPIH G QH VIL EPWGL L L+P+ ++AGY+T+ +GKWHLGF R YTPT
Sbjct: 88 LSGKYPIHTGTQHSVILNEEPWGLALNATLMPEIFRDAGYSTNLVGKWHLGFVRPEYTPT 147
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
+RGFD HYGYW G DYY + + Y G D R NMQV+ G+Y TDL T A
Sbjct: 148 YRGFDYHYGYWGGYIDYYQRRSQMPSDNYSMGYDFRRNMQVECTDRGVYVTDLLTNEAER 207
Query: 204 VIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
+I E K +P+FL L+HLA H GN +P QAP+EE+ KF+ I+DP RR YA MVS+LDE
Sbjct: 208 IIREQPAKEQPLFLILSHLAPHTGNEIDPLQAPEEELQKFVHINDPNRRLYAAMVSKLDE 267
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
SVG VI AL + LENSIV+F +DNGAPS GI +N GSN PL+G K +PW+GG+R A
Sbjct: 268 SVGRVITALARTDQLENSIVIFYSDNGAPSVGIFANTGSNWPLKGQKDSPWEGGLRVAGA 327
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLT--KGAKTKRS 379
IWSP LK + ++ ++ D+LPTL AAGIE++ + LDG++ W L+ K A
Sbjct: 328 IWSPLLKARGNLFTQPIYVGDFLPTLAHAAGIELDSSLHLDGIDLWPQLSGPKDAPHVPR 387
Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA 439
EILH +D+V Q +L + KYV GT + GQ D+ T ++D P+ +Y A
Sbjct: 388 EILHQLDDVGRMQ----SLLLGQYKYVNGTTSKGQYDQLL--TYRKLDDLDPRASIYGMA 441
Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNS 499
+++A K +S ++ I T ++I L A V C+ +G CN+
Sbjct: 442 --------------VRNSAVSKELASFDIQQI-TQDRIDILSRSAAVICSDMQRG--CNA 484
Query: 500 TVKPCLFNITDDPCEQNNLAESQ--TDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA-- 555
++ CL++++ DPCEQNNLA S+ +++L + ++ +S V PGN+P ++ ADP+
Sbjct: 485 LLEECLYDLSTDPCEQNNLAYSKQHSEILAIMRQRVQQLRSGAVAPGNRPNERDADPSLH 544
Query: 556 --RWNNIWV 562
W+N V
Sbjct: 545 ECTWDNFGV 553
>gi|242023422|ref|XP_002432133.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212517507|gb|EEB19395.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 514
Score = 464 bits (1195), Expect = e-128, Method: Compositional matrix adjust.
Identities = 253/541 (46%), Positives = 327/541 (60%), Gaps = 59/541 (10%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PHIIIILADDLGWNDV FHGS+QIPTPNIDALA+ G+ILN +YV +CTPSRSAL+TGKY
Sbjct: 31 PHIIIILADDLGWNDVGFHGSNQIPTPNIDALAFTGIILNNYYVAPVCTPSRSALLTGKY 90
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
PIH G+QHGVI P+GL L EKLLP+YL+ Y T +GKWHLG F++ YTP +RGFD
Sbjct: 91 PIHTGLQHGVIHGSAPYGLNLNEKLLPEYLRSLNYVTRHVGKWHLGSFKKDYTPEYRGFD 150
Query: 150 SHYGYWQGLQDYYDHSCKATFEP-YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
SHYGYW G QDYYDH+ A P + G DMR M V G Y+TDL+T A+ VI H
Sbjct: 151 SHYGYWTGHQDYYDHT--AIENPGFWGYDMRRGMNVTRSDFGYYTTDLFTNEAVKVIKGH 208
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
+ +KP+FLYLAHLA H+GN Y P QAP E VAKF I D RR +AGM+S+LDESVG V+
Sbjct: 209 DSNKPLFLYLAHLATHSGNKYSPLQAPAETVAKFNYIKDKNRRLFAGMLSKLDESVGKVV 268
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
AL M+ N ++LF DNG P+ G + N SN PLRG+K T W+GG+RGV IWSP+L
Sbjct: 269 EALADSNMINNCVILFSTDNGGPAGGFNLNAASNWPLRGVKDTLWEGGVRGVGFIWSPFL 328
Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ KVS+ + HI+DWLPTL + + + +DG+N W L+KG + R EIL NID
Sbjct: 329 -PSSKVSNAMIHITDWLPTLLSLTNASNSISDIDGINVWPNLSKGLPSVRKEILLNIDT- 386
Query: 389 DNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKT 448
++ AALR + KY G NN ++WYG + +++ Y E+ S A +K
Sbjct: 387 ---ERKIAALRYKNWKYKKGNANNW--NKWYGPSGRDMN-YDWNELYNSDAAKAIKHIK- 439
Query: 449 KLQIKQKHAADPKANSSDALRTILTDEK-ILELREFARVRCNYDNKGAHCNSTVKPCLFN 507
+L ++K ++EL +V CN + + +
Sbjct: 440 ----------------------VLPNKKLVMELINATQVTCNVKEEEEEEDES------- 470
Query: 508 ITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDE 567
+ CE+ + Q+ S MVPP NKP D R++P W+ W W D
Sbjct: 471 -DQNSCER--------VIWHQI--------SKMVPPRNKPVDIRSNPVYWDYTWTNWMDY 513
Query: 568 L 568
L
Sbjct: 514 L 514
>gi|403182690|gb|EJY57566.1| AAEL017192-PA [Aedes aegypti]
Length = 1007
Score = 464 bits (1193), Expect = e-128, Method: Compositional matrix adjust.
Identities = 249/566 (43%), Positives = 335/566 (59%), Gaps = 52/566 (9%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
P++I+I+ADDLGWNDVSFH S QI TPNID LAY+G+ILN+HY T S+ ALMTG +
Sbjct: 456 PNVIVIVADDLGWNDVSFHSSKQIFTPNIDVLAYHGVILNRHYCAPFGTASQVALMTGSH 515
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
P+ +G Q +PW L KL+P+Y ++AGYATH IGKW LGF R+ YTPT RGFD
Sbjct: 516 PLSVGTQSASNEPDQPWTLDPELKLMPEYFRDAGYATHLIGKWGLGFSRKDYTPTQRGFD 575
Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
SH+G+ DY+DHS + +GLDMR N+ VD G Y+TDL+ A+ +I EH+
Sbjct: 576 SHFGFLGPYIDYWDHSLRLRNTSTRGLDMRRNLDVDYSVNGSYATDLFNGEAVRLIREHD 635
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+ KP+ L L HLA H GN +P QAP EEV KF I D +RR A M+S++DE VG +I
Sbjct: 636 QKKPLLLVLTHLAPHTGNEDDPMQAPAEEVEKFDYIRDEKRRVLAAMISKVDEGVGQIIQ 695
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM--------------------- 308
L + ML+NSIVLF ADNGAP+ G+HSN GSN PLRG+
Sbjct: 696 TLAERDMLDNSIVLFYADNGAPTVGMHSNSGSNFPLRGVCIDNWSTLHIRCTNGCFSFQQ 755
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQ-KVSSELFHISDWLPTLCAAAGIE--INDTSLDGVN 365
K +PW+G +R VAAIWSP L T +VS++ H+SDWLPTL AAGIE + +DG N
Sbjct: 756 KYSPWEGAVRTVAAIWSPLLNLTAGRVSNQWIHVSDWLPTLAHAAGIEGIPIGSEIDGRN 815
Query: 366 QWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE 425
QW+ L A T R+ +++NID + +Y++ + KYV GT G+ D W G+ ++E
Sbjct: 816 QWEALKNPAITVRNVVMNNIDEL----HHYSSYSRNGWKYVNGTSWEGKFDNWMGELEDE 871
Query: 426 IDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFAR 485
+ + V+ + + L E + LR A
Sbjct: 872 DELSEEEYVVRLAGSVVGRMMPLDL------------------------EHVARLRRAAT 907
Query: 486 VRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN 545
V C + + C PCLFN+ DDPCE+NN+A Q D+L++L +++ Y+ T V P N
Sbjct: 908 VECEVEGQVKACQPMESPCLFNLMDDPCERNNVALDQLDILEELREEVERYRQTAVEPRN 967
Query: 546 KPFDKRADPARWNNIWVPWYDELDKQ 571
KP D R+DP +NN W W DE+D +
Sbjct: 968 KPADPRSDPGFYNNTWTWWLDEIDSR 993
>gi|198455736|ref|XP_001360091.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
gi|198135374|gb|EAL24665.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
Length = 545
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 258/551 (46%), Positives = 328/551 (59%), Gaps = 56/551 (10%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
P KPHII ILADDLG+NDV FHGS+QIPTPNIDALAY+G+ILN++YV +CTPSRSALMT
Sbjct: 23 PPKPHIIFILADDLGFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVAPICTPSRSALMT 82
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
GKYPIH GMQH V+ EP GLPL EK+LPQYL + GY +H GKWHLG ++ YTP FR
Sbjct: 83 GKYPIHTGMQHTVLYAAEPRGLPLKEKILPQYLNDLGYTSHISGKWHLGHWKLEYTPLFR 142
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
GF SH W GLDMR+ V G Y+TD+ T+ +++VIA
Sbjct: 143 GFSSHKNLW-------------------GLDMRNGTDVAYNLHGQYTTDVITKHSVSVIA 183
Query: 207 EHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
H+ +K P+FLY+AH A H+GN Y P PD++V K I +RR YA MV+++DESVG
Sbjct: 184 NHDAAKGPLFLYVAHAAGHSGNPYNPLPVPDDDVMKLDHILHYKRRRYAAMVAKMDESVG 243
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
++ LRK MLENSI++F +DNG P+ G + N SN+PLRG+K+T W+GG+R +WS
Sbjct: 244 VIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLRGVKNTLWEGGVRAAGLVWS 303
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAG-----IEIN-DTSLDGVNQWDVLTKGAKTKRS 379
P L++ Q+V E HISDWLPTL AAG N SLDG++ W L + R
Sbjct: 304 PLLEKRQRVVDETMHISDWLPTLIEAAGGIPALANANLSQSLDGMSIWQTLVHAYPSPRK 363
Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG-DTDNEIDKYSPKEVLYSK 438
+LHNID++ AAL V D K + GT+ NG+ D WYG + + Y VL S
Sbjct: 364 TVLHNIDDIWGS----AALTVGDWKMIQGTNYNGRWDGWYGPGGERDPHLYDWNLVLKSS 419
Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC-NYDNKGAHC 497
+G LK + +D R LRE A V C +G C
Sbjct: 420 SGKAMTGLKM------------LPSRADQQR----------LRETATVTCQGQSTQGTSC 457
Query: 498 NSTV--KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
+T PCLF++ DDPCEQ NLAE +L L +L ST VPP N+P D RA P
Sbjct: 458 VATAFSAPCLFHVRDDPCEQFNLAEQYPKVLDLLMAELQHINSTAVPPSNQPDDPRASPM 517
Query: 556 RWNNIWVPWYD 566
+N W + D
Sbjct: 518 YFNYTWTNFGD 528
>gi|195148952|ref|XP_002015426.1| GL11077 [Drosophila persimilis]
gi|194109273|gb|EDW31316.1| GL11077 [Drosophila persimilis]
Length = 545
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 258/551 (46%), Positives = 328/551 (59%), Gaps = 56/551 (10%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
P KPHII ILADDLG+NDV FHGS+QIPTPNIDALAY+G+ILN++YV +CTPSRSALMT
Sbjct: 23 PPKPHIIFILADDLGFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVAPICTPSRSALMT 82
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
GKYPIH GMQH V+ EP GLPL EK+LPQYL + GY +H GKWHLG ++ YTP FR
Sbjct: 83 GKYPIHTGMQHTVLYAAEPRGLPLKEKILPQYLNDLGYTSHISGKWHLGHWKLEYTPLFR 142
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
GF SH W GLDMR+ V G Y+TD+ T+ +++VIA
Sbjct: 143 GFSSHKNLW-------------------GLDMRNGTDVAYNLHGQYTTDVITKHSVSVIA 183
Query: 207 EHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
H+ +K P+FLY+AH A H+GN Y P PD++V K I +RR YA MV+++DESVG
Sbjct: 184 NHDAAKGPLFLYVAHAAGHSGNPYNPLPVPDDDVMKLDHILHYKRRRYAAMVAKMDESVG 243
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
++ LRK MLENSI++F +DNG P+ G + N SN+PLRG+K+T W+GG+R +WS
Sbjct: 244 VIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLRGVKNTLWEGGVRAAGLVWS 303
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAG-----IEIN-DTSLDGVNQWDVLTKGAKTKRS 379
P L++ Q+V E HISDWLPTL AAG N SLDG++ W L + R
Sbjct: 304 PLLEKRQRVVDETMHISDWLPTLIEAAGGIPALANANLSQSLDGMSIWQTLVHAYPSPRK 363
Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG-DTDNEIDKYSPKEVLYSK 438
+LHNID++ AAL V D K + GT+ NG+ D WYG + + Y VL S
Sbjct: 364 TVLHNIDDIWGS----AALTVGDWKMIQGTNYNGRWDGWYGPGGERDPHLYDWNLVLKSS 419
Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC-NYDNKGAHC 497
+G LK + +D R LRE A V C +G C
Sbjct: 420 SGKAMTGLKM------------LPSRADQQR----------LRETATVTCQGQSTQGTSC 457
Query: 498 NSTV--KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
+T PCLF++ DDPCEQ NLAE +L L +L ST VPP N+P D RA P
Sbjct: 458 VATAFSAPCLFHVRDDPCEQFNLAEQYPKVLDLLVAELQHINSTAVPPSNQPDDPRASPM 517
Query: 556 RWNNIWVPWYD 566
+N W + D
Sbjct: 518 YFNYTWTNFGD 528
>gi|195380485|ref|XP_002049001.1| GJ21349 [Drosophila virilis]
gi|194143798|gb|EDW60194.1| GJ21349 [Drosophila virilis]
Length = 531
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 256/557 (45%), Positives = 331/557 (59%), Gaps = 56/557 (10%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+PHII ILADDLG+NDV F GS+QIPTPNIDALAY+GLILN++YV +CTPSRSALMTGK
Sbjct: 20 QPHIIFILADDLGFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALMTGK 79
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YPIH GMQH V+ EP GLPL K+LPQYL + GY +H GKWHLG ++ VYTP +RGF
Sbjct: 80 YPIHTGMQHTVLYAAEPRGLPLDLKILPQYLNDLGYTSHIAGKWHLGHWQRVYTPLYRGF 139
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
SH+GYW GLDMR+ +V G YSTD+ T+ ++NVI++H
Sbjct: 140 SSHHGYW-------------------GLDMRNGTEVAYDLHGQYSTDVITQHSLNVISKH 180
Query: 209 NKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
N +K P+FLY+AH AVH+GN Y P D+ V + I +RR YA +V+ +DESVG +
Sbjct: 181 NATKGPLFLYVAHAAVHSGNPYNPLPVKDDAVRRLDTIQHYKRRKYAALVTAMDESVGKI 240
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
+ LRK MLE SI++F DNG P+ G +SN SN+PLRG+K+T W+GG+RG A +WSP
Sbjct: 241 VEQLRKSRMLETSIIVFSTDNGGPAEGFNSNFASNYPLRGVKNTLWEGGVRGAALLWSPQ 300
Query: 328 LKQTQKVSSELFHISDWLPTLC-----AAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
L + + + + HI DWLPTL AA ++ LDG++ W L K + R +L
Sbjct: 301 LTKRPRTAEQTMHIVDWLPTLVEAAGGEAALAKLGTAKLDGISLWQALLKNETSPRKSVL 360
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD-NEIDKYSPKEVLYSKAGI 441
HNID++ AAL V D K V GT+ NG D WYG Y + V S G
Sbjct: 361 HNIDDIWGS----AALSVGDWKLVKGTNYNGSWDGWYGPAGMRNPHDYDWQSVAKSATGQ 416
Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN----YDNKGAHC 497
LK PK ++D +R LRE A V C D C
Sbjct: 417 AMQQLKML----------PK--TADMIR----------LREAANVICKDANPKDAPHIAC 454
Query: 498 NSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARW 557
N +PCLFN+ +DPCEQ N A+ +L+ L +L + +T VPP NKP D RADP W
Sbjct: 455 NPLSEPCLFNVREDPCEQRNQAKQHPKVLELLLKELQLLNATAVPPANKPDDPRADPRLW 514
Query: 558 NNIWVPWYDELDKQKAI 574
N+ W + D ++ I
Sbjct: 515 NHTWTNFGDYIETNDCI 531
>gi|291242646|ref|XP_002741217.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 526
Score = 461 bits (1186), Expect = e-127, Method: Compositional matrix adjust.
Identities = 247/552 (44%), Positives = 334/552 (60%), Gaps = 56/552 (10%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K+P+II ILADDLGW+DVSFHGS QIPTPNID LAY+G++L+ +YVQ +CTP+R AL+TG
Sbjct: 25 KQPNIIFILADDLGWDDVSFHGSHQIPTPNIDELAYSGVLLHNYYVQPVCTPTRGALLTG 84
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+YP+H+G+QH VI EP GLPL E LP YLK+ GY+TH +GKWHLGFF + YTPT+RG
Sbjct: 85 RYPMHLGLQHFVITPNEPVGLPLNETTLPTYLKKLGYSTHMVGKWHLGFFAKEYTPTYRG 144
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FDSHYGY+ G QDYY H+ A + G D+RHNM + T G Y +L+T A +I +
Sbjct: 145 FDSHYGYFLGHQDYYTHN--ALWNNQWGFDLRHNMDLQRSTFGEYGPELFTTQAEKLIYD 202
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEP---FQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
H+ KP+FLY AH AVH GN+ +AP + ++F I+D +RR YAGMVS LD+SV
Sbjct: 203 HDHKKPLFLYFAHQAVHYGNSGPNGTLLEAPYKYTSRFPHIADHQRRIYAGMVSALDDSV 262
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
GN+ AL+K G+ NSI++F DNG P H + N PLRG K+T W+GG+RG A I
Sbjct: 263 GNITRALKKSGLYNNSIIVFSTDNGGP----HECEAFNTPLRGTKNTLWEGGIRGAAFIH 318
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILH 383
S L++ ++V + H+ DWLPT+ AG ++D +LDG N WD L+ G + R+EILH
Sbjct: 319 SVLLEKPKRVCEGMMHVVDWLPTMYHVAGGNLSDLHNLDGFNMWDTLSSGVSSPRTEILH 378
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
NID P AALRV +LK + G +N+ D + P ++ + K
Sbjct: 379 NID----PITQTAALRVGNLKVIVG--------------ENKSDWFVPPQITFGK----- 415
Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKP 503
N ++ + +D + + + + Y N C+ KP
Sbjct: 416 -------------------NGTNLTKQFGSDHRGIIVHCSSVPPVAYKN----CDPKQKP 452
Query: 504 CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
CL+NI +DPCE NLA+ D+L+ L KL Y ST V P P D R++P+ N+ W P
Sbjct: 453 CLYNIKNDPCEYYNLADEYPDVLQSLLYKLNDYNSTSVTPWVIPPDPRSNPSLHNDTWSP 512
Query: 564 WYDELDKQKAIE 575
W D D E
Sbjct: 513 WIDLADIAPDFE 524
>gi|195485249|ref|XP_002091013.1| GE12487 [Drosophila yakuba]
gi|194177114|gb|EDW90725.1| GE12487 [Drosophila yakuba]
Length = 544
Score = 461 bits (1186), Expect = e-127, Method: Compositional matrix adjust.
Identities = 253/581 (43%), Positives = 342/581 (58%), Gaps = 58/581 (9%)
Query: 7 YFFALTCTLLFN-DAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
+F L C LL + P +P+II ILADDLG+NDV FHGS++IPTPNIDALAY+G
Sbjct: 2 WFLWLICLLLPTLISGEEEIRPAQPNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSG 61
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+ILN++YV +CTPSRSALMTGKYPIH GMQH V+ EP GLPL EK+LPQYL E GY
Sbjct: 62 IILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYT 121
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
+H GKWHLG ++ YTP RGF SH+ GLDMR+ QV
Sbjct: 122 SHIAGKWHLGHWKLKYTPLHRGFSSHW----------------------GLDMRNGTQVA 159
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
G Y+TD+ T+ ++ VIA HN +K P+FLY+AH A H+ N Y P PD +V K
Sbjct: 160 YDLHGHYTTDVITDHSVKVIASHNATKGPLFLYVAHAACHSSNPYNPLPVPDNDVLKMSH 219
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
I +RR +A MVS++D+SVG ++ LRK MLENSI++F +DNG P+ G + N SN+P
Sbjct: 220 IPHYKRRKFAAMVSKMDDSVGQIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYP 279
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG----IEINDTS 360
L+G+K+T W+GG+R +WSP LK++Q+VS++ HI DWLPTL AAG + +
Sbjct: 280 LKGVKNTLWEGGVRAAGLMWSPLLKKSQRVSNQTMHIIDWLPTLLEAAGGPPAVANLNQQ 339
Query: 361 LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
+DG + W L + + R +LHNID++ AAL V D K V GT+ G D WYG
Sbjct: 340 IDGQSIWRALVQDKASPRLNVLHNIDDIWGS----AALSVGDWKLVKGTNYRGSWDGWYG 395
Query: 421 DT-DNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILE 479
+ + +Y + V S+ G DAL+ + +
Sbjct: 396 PAGERDPRQYDWQLVAKSRVGKAL----------------------DALKMLPSRADQQR 433
Query: 480 LREFARVRC-NYDNKGAHCNSTV--KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIY 536
+R A V C ++G C +T PCLF+I DDPCEQ NLA+ +++ L +L +
Sbjct: 434 IRAAATVTCPAQSSQGTSCLATAFSAPCLFHIRDDPCEQYNLAKQYPEVVNALMTELERF 493
Query: 537 KSTMVPPGNKPFDKRADPARWNNIWVPWYDELDKQKAIETM 577
+T VPP NKP D RADP WN W + D + + ++T+
Sbjct: 494 NATAVPPSNKPADPRADPRFWNYTWTNFGDYQNTRNDLDTL 534
>gi|195057745|ref|XP_001995315.1| GH22700 [Drosophila grimshawi]
gi|193899521|gb|EDV98387.1| GH22700 [Drosophila grimshawi]
Length = 542
Score = 460 bits (1184), Expect = e-127, Method: Compositional matrix adjust.
Identities = 253/564 (44%), Positives = 334/564 (59%), Gaps = 57/564 (10%)
Query: 13 CTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY 72
C + F +T +P+II ILADDLG+NDV F GS+QIPTPNIDALAY+GLILN++Y
Sbjct: 18 CPISFALVACQST---QPNIIFILADDLGFNDVGFRGSAQIPTPNIDALAYSGLILNRYY 74
Query: 73 VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKW 132
V +CTPSRSALMTGKYPIH GMQH V+ EP GLPL K+LPQYL E GY +H GKW
Sbjct: 75 VNPICTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLDLKILPQYLNELGYTSHIAGKW 134
Query: 133 HLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIY 192
HLG ++ VYTP +RGF SH+GYW GLDMR+ ++ G Y
Sbjct: 135 HLGHWKRVYTPLYRGFSSHHGYW-------------------GLDMRNGTEIAYDLHGQY 175
Query: 193 STDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
+TD+ T+ ++ VIA H +K P+FLY+AH AVH+GN Y P A D+ V + I +RR
Sbjct: 176 TTDVITQHSLQVIANHKPAKGPLFLYVAHAAVHSGNPYNPLPASDDAVRRLDKIQHYKRR 235
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
YA +V+ +DESVG ++ LRK MLENSI++F DNG P+ G +SN SN+PLRG+K+T
Sbjct: 236 KYAALVTAMDESVGKIVEQLRKSRMLENSIIVFSTDNGGPAEGFNSNFASNYPLRGVKNT 295
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-----NDTSLDGVNQ 366
W+GG+RG A +WSP L + + + +L H+ DWLPTL AAG ++ LDGV+
Sbjct: 296 LWEGGVRGAALLWSPQLTKRPRTAEQLMHMVDWLPTLVEAAGGKVALSKLEAAKLDGVSL 355
Query: 367 WDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
W L + + R+ ILHNID++ AAL V K V G+ NG D WYG
Sbjct: 356 WQALVRNEASPRNSILHNIDDIWGS----AALSVGVWKLVQGSHYNGSWDGWYG------ 405
Query: 427 DKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARV 486
AG+ N Q+ K AA + L+ + I+ LR+ A V
Sbjct: 406 -----------PAGLR-NPHDYNWQLVTKCAA---GQAMQQLKMLPQQADIIRLRKAATV 450
Query: 487 RCN----YDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP 542
C + CN + CLFN+ DDPCEQ NLA+ +L L ++L ++ +T +P
Sbjct: 451 SCKEAKVKPSPDIACNPLKEICLFNVLDDPCEQRNLAKQHPKVLALLLNELQLFNATAIP 510
Query: 543 PGNKPFDKRADPARWNNIWVPWYD 566
P NKP D R DP WN+ W + D
Sbjct: 511 PANKPDDPRGDPRLWNHTWTNFGD 534
>gi|194756524|ref|XP_001960527.1| GF13402 [Drosophila ananassae]
gi|190621825|gb|EDV37349.1| GF13402 [Drosophila ananassae]
Length = 541
Score = 460 bits (1183), Expect = e-126, Method: Compositional matrix adjust.
Identities = 258/584 (44%), Positives = 343/584 (58%), Gaps = 60/584 (10%)
Query: 7 YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
+ F F + ++PK P+II ILADDLG+NDV FHGS+QIPTPNIDALAY+G+
Sbjct: 5 WLFIFLILPQFLKGEVGNSSPK-PNIIFILADDLGFNDVGFHGSAQIPTPNIDALAYSGI 63
Query: 67 ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
ILN++YV +CTPSRSALMTGKYPIH GMQH V+ EP GL L EK+LPQYL + GY +
Sbjct: 64 ILNRYYVTPICTPSRSALMTGKYPIHTGMQHAVLYAAEPRGLSLKEKILPQYLNDLGYTS 123
Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDN 186
H GKWHLG ++ YTP FRGF SH+ GLDMR+ +V
Sbjct: 124 HIAGKWHLGHWKLKYTPLFRGFSSHW----------------------GLDMRNGTEVAY 161
Query: 187 KTIGIYSTDLYTEAAINVIAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
G Y+TD+ T+ A+ VIA HN S P+FLY+AH A H+ N Y P PD EV K I
Sbjct: 162 DLHGRYTTDVITDHAVKVIANHNTTSGPLFLYVAHAACHSSNPYNPLPVPDNEVMKLGHI 221
Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
+RR +A MV+++DESVG ++ LRK MLENSI++F +DNG P+ G + N SN+PL
Sbjct: 222 PHYKRRKFAAMVTKMDESVGLIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPL 281
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL---- 361
+G+K+T W+GG+R IWSP LK Q+VS++ HISDWLPTL AAG ++L
Sbjct: 282 KGVKNTLWEGGVRAAGLIWSPLLKSRQRVSTQTIHISDWLPTLLEAAGGAAALSNLSKGI 341
Query: 362 DGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGD 421
DG + W L + R +++HNID++ A L V D K V GT+ G+ D WYG
Sbjct: 342 DGQSIWQALVHDEPSPRLKVIHNIDDIWGS----AGLSVGDWKLVKGTNYYGKWDGWYGP 397
Query: 422 T-DNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILEL 480
+ + Y +++ S++G N LK + + + L
Sbjct: 398 AGERDPHLYDWQQIPKSRSGKALNGLK----------------------MLPSQAEQQRL 435
Query: 481 REFARVRC-NYDNKGAHCNSTV--KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYK 537
R A V+C +G+ C +T PCLFNI DDPCEQ NLAE +LK L ++L
Sbjct: 436 RAAATVKCLGQSPQGSSCKATAFSAPCLFNIHDDPCEQFNLAEQYPKVLKALMEELKYQN 495
Query: 538 STMVPPGNKPFDKRADPARWNNIWVPW--YDELDKQKAIETMRL 579
ST VPP NKP D+RA P WN W + YD L+ + + +++
Sbjct: 496 STAVPPANKPGDRRASPKYWNYTWTNFGDYDLLNNKPVLGDLKV 539
>gi|195431744|ref|XP_002063888.1| GK15669 [Drosophila willistoni]
gi|194159973|gb|EDW74874.1| GK15669 [Drosophila willistoni]
Length = 556
Score = 456 bits (1173), Expect = e-125, Method: Compositional matrix adjust.
Identities = 254/548 (46%), Positives = 328/548 (59%), Gaps = 55/548 (10%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KPHII ILADDLG+NDV FHGS+QIPTPNIDALAY+GLILN++YV +CTPSRS+LMTGK
Sbjct: 25 KPHIIYILADDLGFNDVGFHGSAQIPTPNIDALAYSGLILNRYYVAPICTPSRSSLMTGK 84
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH GMQH V+ EP GLPL EKLLPQYL + GY +H GKWHLG ++ YTP RGF
Sbjct: 85 YAIHTGMQHTVLYGAEPRGLPLEEKLLPQYLNDLGYTSHIAGKWHLGHWKMPYTPLRRGF 144
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
+SH+G+W GLDMR+ +V + G Y+TD+ T+ +I+VIA H
Sbjct: 145 NSHHGFW-------------------GLDMRNGSRVAYELHGQYTTDVITQHSIDVIANH 185
Query: 209 NKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
SK P+FLY+AH A H+GN Y P D+EV K I +RR YA MVS+LDESVG +
Sbjct: 186 PVSKGPLFLYVAHAAAHSGNPYNPLPVADDEVIKLGRIEHYKRRRYAAMVSKLDESVGII 245
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
+ LR+ MLENSI++F DNG + G + N SN+PLRG+K+T W+GG+RGV +WS
Sbjct: 246 VDQLRRSNMLENSIIVFATDNGGAAEGFNLNFASNYPLRGVKNTLWEGGVRGVGLLWSQR 305
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAG----IEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
+ +V+ + HI+DWLPT AG + TSLDG + W L + + R +LH
Sbjct: 306 YLKRPRVAEQTMHITDWLPTFVEVAGGKEALANFTTSLDGQSIWQALVQDEASPRKSVLH 365
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT-DNEIDKYSPKEVLYSKAGIT 442
NID++ N +AL V D K + GT+ G D WYG + + Y+ ++V S AG
Sbjct: 366 NIDDIWNS----SALTVGDWKLIKGTNYKGAWDGWYGPAGERDPQLYNWQQVAKSLAG-- 419
Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC----NYDNKGAHCN 498
K LQ+K + +D R LR A V C + KG+ CN
Sbjct: 420 ----KAMLQLKV------LPSRADQQR----------LRSAATVNCPNQQSNTQKGSVCN 459
Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
PCLFN+ DDPCEQ NLA +L L +L + ST +PP N P D R +P WN
Sbjct: 460 PLQTPCLFNLRDDPCEQYNLATQYPLVLNSLLSELKHFNSTAIPPSNLPDDPRGNPKFWN 519
Query: 559 NIWVPWYD 566
W + D
Sbjct: 520 YTWTNFGD 527
>gi|195494733|ref|XP_002094965.1| GE22117 [Drosophila yakuba]
gi|194181066|gb|EDW94677.1| GE22117 [Drosophila yakuba]
Length = 565
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 241/545 (44%), Positives = 333/545 (61%), Gaps = 30/545 (5%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
+ A ++P+IIII+ADD+G++DVSF G + TPNIDALAY+G +L++ Y A+CTPSR A
Sbjct: 21 SAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGA 80
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L++G+YPIH G QH VI EPW L L L+P+ KEAGY+T+ +GKWHLGF R YTP
Sbjct: 81 LLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTP 140
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
T RGFD H+GYW DY+ K Y G D R NM+++ + G+Y TDL T A
Sbjct: 141 TRRGFDYHFGYWGAYIDYFQRRSKMPVANYSLGYDFRRNMELECRDRGVYVTDLLTAEAE 200
Query: 203 NVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+I H K +P+FL L+HLA H N +P QAP+EE+ KF I DP RR YA M+S+LD
Sbjct: 201 RLIKGHAGKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMISKLD 260
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
+SVG ++++L LENSIV+F +DNGAPS G+ +N GSN PLRG K+TPW+GG+R
Sbjct: 261 QSVGRIMSSLAATDQLENSIVIFYSDNGAPSVGMFANTGSNFPLRGQKNTPWEGGVRVAG 320
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR-- 378
AIWS L+ + S+ +++DWLPTLC AAGI+++ T LDG++ W L+ A
Sbjct: 321 AIWSSKLQARGSIFSQPLYVADWLPTLCRAAGIKLDSTLKLDGIDLWAELSGSADAPHVP 380
Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
EILH +D+V AAL++ KYV GT G+ D T E+D P+E Y+
Sbjct: 381 REILHILDDVWR----LAALQLGQWKYVNGTTAAGRYDSLL--TYRELDDLDPRESRYA- 433
Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
+ +++A +A S LR LT ++I R A VRC + CN
Sbjct: 434 -------------VTVRNSATSRALSRYDLRR-LTQQRISITRRLAAVRCGDLQRS--CN 477
Query: 499 STVKPCLFNITDDPCEQNNL--AESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ CL++I+ DPCEQNNL +E +D+L L ++ +++ PGN+ ADP
Sbjct: 478 PLLEECLYDISTDPCEQNNLVYSEKHSDVLAALRRRVQELRASASRPGNRASMPEADPTL 537
Query: 557 WNNIW 561
W
Sbjct: 538 HTCAW 542
>gi|156408341|ref|XP_001641815.1| predicted protein [Nematostella vectensis]
gi|156228955|gb|EDO49752.1| predicted protein [Nematostella vectensis]
Length = 512
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 245/554 (44%), Positives = 326/554 (58%), Gaps = 57/554 (10%)
Query: 16 LFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA 75
L + F+ TA KKPHI+ I+ADDLGW+DVSFHGS QIPTPNID LA G+ILN +YV
Sbjct: 10 LMSLPFVALTANKKPHIVFIVADDLGWDDVSFHGSGQIPTPNIDGLAKTGVILNNYYVSP 69
Query: 76 LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
+CTP+RSA+MTGKYPIH GMQH VIL +P+GL L E L+PQYLK GYATH +GKWHLG
Sbjct: 70 ICTPTRSAIMTGKYPIHTGMQHSVILAAQPYGLGLNETLMPQYLKRLGYATHGVGKWHLG 129
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
FF+ YTP RGFDS++GYW G DY+DHS + GLD+ + Q G YS+D
Sbjct: 130 FFKYEYTPIQRGFDSYFGYWCGKGDYWDHSNNEKYG--WGLDLHDSEQDVWTEWGHYSSD 187
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
L+ E A+NVI+ HN S P+FLYL AVH+ N +P QAP + + KF +I D RR +A
Sbjct: 188 LFAEKAVNVISTHNASVPLFLYLPFQAVHSANFIQPLQAPPDLIDKFKNIKDERRRIFAA 247
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
MVS +D ++ V+ +L+ M NSI++F DNG P+ G SN SN PLRG+K T W+G
Sbjct: 248 MVSSMDGAIKKVVDSLKARSMYNNSIIVFTTDNGGPANGFDSNMASNFPLRGVKRTLWEG 307
Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGA 374
G+RG A I SP + + +V +EL H+SDWLPTL AG +I+D +LDG + WD ++ A
Sbjct: 308 GIRGTAFIHSPLITKPGRVMTELMHVSDWLPTLYTVAGGDIHDLQNLDGFDLWDSISTDA 367
Query: 375 KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
+ R E++HNID P + AA R + K V + WY
Sbjct: 368 MSPREEMVHNID----PVNWEAAYRFREWKIV--VNQTKYMSGWYP-------------- 407
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNK- 493
++ ++ H A LR+ A V+C +
Sbjct: 408 ------------LPNIEEREPHPA--------------------TLRD-AVVKCGPPPEI 434
Query: 494 GAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRAD 553
+C ++ PCLFNI +DPCE NLA+ + ++L + L YK MVP N P D A+
Sbjct: 435 PVNCTASDGPCLFNIKNDPCEYVNLAKKELEILNNMLIWLEGYKKGMVPIRNTPLDPSAN 494
Query: 554 PARWNNIWVPWYDE 567
PA + +W W D+
Sbjct: 495 PANYGGVWTAWKDQ 508
>gi|195328473|ref|XP_002030939.1| GM24306 [Drosophila sechellia]
gi|194119882|gb|EDW41925.1| GM24306 [Drosophila sechellia]
Length = 554
Score = 452 bits (1164), Expect = e-124, Method: Compositional matrix adjust.
Identities = 240/546 (43%), Positives = 333/546 (60%), Gaps = 30/546 (5%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+ A ++P+IIII+ADD+G++DVSF G + TPNIDALAY+G +L++ Y A+CTPSR
Sbjct: 20 QSAAARRPNIIIIMADDMGFDDVSFRGGREFVTPNIDALAYHGRLLDRLYAPAMCTPSRG 79
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
AL++G+YPIH G QH VI EPW L L L+P+ KEAGY+T+ +GKWHLGF R YT
Sbjct: 80 ALLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYT 139
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
PT RGFD H+GYW DY+ K Y G D R NM ++ + G+Y TDL T A
Sbjct: 140 PTRRGFDYHFGYWGAYIDYFQRRSKMPVANYSLGYDFRRNMDLECRDRGVYVTDLLTTEA 199
Query: 202 INVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
+I +H +K +P+FL L+HLA H N +P QAP+EE+ KF I DP RR YA M+S+L
Sbjct: 200 ERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMISKL 259
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
D+SVG +I+ L L+NSIV+F +DNGAPS G+ SN GSN PLRG K+TPW+GG+R
Sbjct: 260 DQSVGRIISTLSSTDQLDNSIVIFYSDNGAPSVGMFSNTGSNFPLRGQKNTPWEGGVRVA 319
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR- 378
AIWS L+ + S+ +++DWLPTL AAGIE++ + LDG++ W L+ A
Sbjct: 320 GAIWSSKLQARGSIFSQPLYVADWLPTLSRAAGIELDSSLKLDGIDLWPELSGSADAPHV 379
Query: 379 -SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
EILH +D+V +AL++ KYV GT +G+ D T E+D P+E Y+
Sbjct: 380 PREILHILDDV----WRLSALQMGQWKYVNGTTADGRYDSVL--TYRELDDLDPRESRYA 433
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHC 497
+ +++A +A S LR LT ++I R A VRC + C
Sbjct: 434 --------------VTVRNSATSRALSRYDLRR-LTQQRISLTRRLAAVRCGDLQRS--C 476
Query: 498 NSTVKPCLFNITDDPCEQNNL--AESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
N ++ CL++I+ DPCEQNNL +E +D+L L ++ +++ PGN+ ADP
Sbjct: 477 NPLLEECLYDISTDPCEQNNLVYSERHSDVLTALRRRVRELRASASRPGNRASMPEADPT 536
Query: 556 RWNNIW 561
W
Sbjct: 537 LHTCAW 542
>gi|195166561|ref|XP_002024103.1| GL22855 [Drosophila persimilis]
gi|194107458|gb|EDW29501.1| GL22855 [Drosophila persimilis]
Length = 559
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 249/557 (44%), Positives = 330/557 (59%), Gaps = 56/557 (10%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A KP+IIIIL DD+G+NDVSFHGS+QI TPNIDALAYNG++LN+HYV LCTPSR+ L+
Sbjct: 24 ASAKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGILLNRHYVPNLCTPSRATLL 83
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TGKYPIH GMQH VI+ EPWGLP E+L+P+ ++AGYATH +GKWHLGF+R+ TPT
Sbjct: 84 TGKYPIHTGMQHFVIVTDEPWGLPRQERLMPELFRDAGYATHLVGKWHLGFWRKDLTPTM 143
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
RGFD H+GY+ G DYYD + + Y GLD R +++ + G Y+T+ +T A V
Sbjct: 144 RGFDHHFGYYNGYMDYYDQTVRMLDRNYSTGLDFRRDLEPCREAEGTYATEAFTTEARKV 203
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
I H+KS+P+F+ L+HLAVH GN P QAP+EEVAKF I DP+RRTYAGM+S LD+SV
Sbjct: 204 IERHDKSRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHIRDPKRRTYAGMISSLDKSV 263
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
G + AL +GML NSIVL +DNGAP+ GIHSN GSN+P RG
Sbjct: 264 GQTMRALADNGMLNNSIVLLYSDNGAPTVGIHSNAGSNYPFRG----------------- 306
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
Q VS++ H DWLPTL AAAG+ + +D LDG+N W L+ A+ + +LH
Sbjct: 307 -----QRGYVSNQAIHAIDWLPTLAAAAGVSLPSDLRLDGLNLWPSLSASAQPQPRNLLH 361
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGD-TDNEIDKYSPKEVLYSKAGIT 442
+D+V Y++ D LKYV G+ G+ D W G+ D E D S +A
Sbjct: 362 VLDDVFG----YSSYTQDTLKYVNGSSFEGRYDHWLGELEDGEEDPNSKDYGQLVRA--- 414
Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-----C 497
+++Q A TD++I ++R+ A RC + C
Sbjct: 415 -----SEVQAVLGSVAP-------------TDDRIRQMRDEATQRCPIQERDPEDPVYLC 456
Query: 498 NSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKP-FDKRADPAR 556
CLF++ +DPCE+ NLA L+ L ++ + T +P P D RADPA
Sbjct: 457 EPLKAACLFDLANDPCERYNLANLYPLQLQLLAQEVEELRLTALPSARVPRADVRADPAL 516
Query: 557 WNNIWVPWYDELDKQKA 573
+ W W + L A
Sbjct: 517 HDGHWEWWNNTLPGSSA 533
>gi|328788246|ref|XP_395125.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
Length = 562
Score = 451 bits (1160), Expect = e-124, Method: Compositional matrix adjust.
Identities = 239/580 (41%), Positives = 357/580 (61%), Gaps = 55/580 (9%)
Query: 5 RKYFFALTCTLLFNDAFLNT--------TAPKKPHIIIILADDLGWNDVSFHGSSQIPTP 56
RK FF L +++F + K+P+II+I+ADDLGWNDVSFHG++QIPTP
Sbjct: 6 RKTFFLLATSVIFVTGCIYAWIQDSKVWERVKRPNIIVIMADDLGWNDVSFHGANQIPTP 65
Query: 57 NIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLP 116
NIDALAYNG+IL +HYV +CTPSR+A +TG+YPI GMQ + GEP +PL LLP
Sbjct: 66 NIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGYPLKAGEPRAIPLNNTLLP 125
Query: 117 QYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGL 176
+YL++ GYATH +GKWH+G++ + +TPT RGFD+ +GY+ G Y++H+ K + + G
Sbjct: 126 EYLRKLGYATHLVGKWHVGYYSDYHTPTRRGFDTFFGYYSGYISYFNHTIKQ--DDHIGY 183
Query: 177 DMRHNMQVDNKTIGI-----YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP 231
D+ ++ ++K + I Y+TDL TE A N+I H++ KP++L L HLA H+ + E
Sbjct: 184 DLHYD---NSKNLSIDYNFEYTTDLITERAENIIKNHDRRKPLYLQLCHLAAHSSDAKEV 240
Query: 232 FQAPDEEV--AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG 289
+ DE+ A I D RR YAG+V+ +DESVG VI AL + MLENSI++F++DNG
Sbjct: 241 MEVRDEQETNATLKYIEDYNRRKYAGVVTAMDESVGRVIKALGQSSMLENSIIVFISDNG 300
Query: 290 APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
A + G+ N GSN+PLRG+K T ++GG+RGVA ++S ++ + ++S+EL HI+DWLPT
Sbjct: 301 AQTEGLLENYGSNYPLRGLKFTLFEGGIRGVACVYSRLIQNSSRISNELMHITDWLPTFY 360
Query: 350 AAAG--IEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVA 407
+AAG +E + ++DGV+QWD + G ++KR +L NID V++ ++ + K +
Sbjct: 361 SAAGGNLENLEENMDGVDQWDTIVSGKESKRESVLLNIDEVED----VSSALIGKYKLII 416
Query: 408 GTDNNGQSDEWYGDTDNEID--KYSPKEVLYSKAGIT-FNALKTKLQIKQKHAADPKANS 464
N Q +++YGD + +Y+ VL+S AG T F+A L IK
Sbjct: 417 -NGKNIQYNDYYGDNGTSVSYPEYNVSSVLHSIAGSTIFHATNFTLNIKD---------- 465
Query: 465 SDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTD 524
+++LR ARV CN + C V CLF+I +DPCE +L+ +
Sbjct: 466 ------------VIKLRNKARVVCNNFTSYSKC---VDKCLFDIYNDPCETTDLSNEHPE 510
Query: 525 LLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
++K+L +A Y+ ++ N P D R+ P +N W+PW
Sbjct: 511 IVKKLNYFIAEYEKVLLHQTNAPVDPRSYPENFNGTWMPW 550
>gi|195124259|ref|XP_002006611.1| GI18487 [Drosophila mojavensis]
gi|193911679|gb|EDW10546.1| GI18487 [Drosophila mojavensis]
Length = 528
Score = 451 bits (1159), Expect = e-124, Method: Compositional matrix adjust.
Identities = 251/560 (44%), Positives = 327/560 (58%), Gaps = 59/560 (10%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A +PHII ILADDLG+NDV F GS+QIPTPNIDALAY+GLILN++YV +CTPSRSALM
Sbjct: 17 AIAQPHIIFILADDLGFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALM 76
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
T KYPIH GMQH V+ EP GLPL K+LPQYL + GY +H GKWHLG ++ VYTP +
Sbjct: 77 TAKYPIHTGMQHTVLYAAEPRGLPLNLKILPQYLNDLGYTSHIAGKWHLGHWKRVYTPLY 136
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
RGF SH+ GLDMR+ ++ G Y+TD+ T+ ++NVI
Sbjct: 137 RGFSSHW----------------------GLDMRNGTELAYDLHGQYTTDVITQHSLNVI 174
Query: 206 AEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
A+HN +K P+FLY+AH AVH+GN Y P A D+ V + I D +RR YA +V+ +D+SV
Sbjct: 175 AKHNSTKGPLFLYVAHAAVHSGNPYNPLPAKDDIVRRLGTIQDYKRRKYAALVTAMDDSV 234
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
G ++ LRK ML+NSI++F DNG P+ G +SN SN+PLRG+K+T W+GG+RG +W
Sbjct: 235 GRIVEQLRKSHMLDNSIIIFSTDNGGPAEGFNSNFASNYPLRGVKNTLWEGGVRGAGLVW 294
Query: 325 SPWLKQTQKVSSELFHISDWLPT-----LCAAAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
S L + +V+ + HI+DWLPT AA ++N+ LDG++ W L K + R
Sbjct: 295 SAQLSKRPRVADQTMHIADWLPTLLEAAGGEAALAQLNNAQLDGMSVWQALLKNEVSPRK 354
Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE-IDKYSPKEVLYSK 438
+LHNID++ AAL V D K V GT+ NG D WYG + Y V S
Sbjct: 355 SVLHNIDDIWGS----AALSVGDWKLVKGTNYNGSWDGWYGPAGTRNLRDYDWYLVAKSL 410
Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN----YDNKG 494
AG LK PK ++D +R RE A V C D
Sbjct: 411 AGQALQQLKVL----------PK--TADMMRA----------REAANVICKDARLKDAPN 448
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
CN PCLFNI +DPCEQ N A+ +L+ + +L + ST VPP NKP D P
Sbjct: 449 ISCNPLKAPCLFNIREDPCEQRNQADVYPKVLELMLKELQLINSTAVPPANKPDDPLGYP 508
Query: 555 ARWNNIWVPWYDELDKQKAI 574
WN W + D +D I
Sbjct: 509 RFWNYTWTNFGDYVDTNDCI 528
>gi|386771363|ref|NP_730304.2| CG32191 [Drosophila melanogaster]
gi|229368437|gb|ACQ59088.1| MIP05773p [Drosophila melanogaster]
gi|383291992|gb|AAN11683.2| CG32191 [Drosophila melanogaster]
Length = 564
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 239/545 (43%), Positives = 331/545 (60%), Gaps = 30/545 (5%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
+ A ++P+IIII+ADD+G++DVSF G + TPNIDALAY+G +L++ Y A+CTPSR A
Sbjct: 21 SAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGA 80
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L++G+YPIH G QH VI EPW L L L+P+ KEAGY+T+ +GKWHLGF R YTP
Sbjct: 81 LLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTP 140
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
T RGFD H+GYW DY+ K Y G D R NM+++ + G+Y TDL T A
Sbjct: 141 TRRGFDYHFGYWGAYIDYFQRRSKMPVANYSLGYDFRRNMELECRDRGVYVTDLLTAEAE 200
Query: 203 NVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+I +H +K +P+FL L+HLA H N +P QAP+EE+ KF I DP RR YA M+S+LD
Sbjct: 201 RLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMISKLD 260
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
+SVG +I AL LENSIV+F +DNGAPS G+ SN GSN PLRG K+TPW+GG+R
Sbjct: 261 QSVGRIITALSSTDQLENSIVIFYSDNGAPSVGMFSNTGSNFPLRGQKNTPWEGGVRVAG 320
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR-- 378
AIWS L+ + + +++DWLPTL AA IE++ + LDG++ W L+ A
Sbjct: 321 AIWSSGLQARGSIFRQPLYVADWLPTLSRAADIELDSSLKLDGIDLWPELSGSADAPHVP 380
Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
EILH +D+V +AL++ KYV GT +G+ D T E+D P++ Y+
Sbjct: 381 REILHILDDV----WRLSALQMGQWKYVNGTTASGRYDSVL--TYRELDDLDPRDSRYA- 433
Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
+ +++A +A S LR LT ++I R A VRC + CN
Sbjct: 434 -------------VTVRNSATSRALSRYDLRR-LTQQRISLTRRLAAVRCGDLQRS--CN 477
Query: 499 STVKPCLFNITDDPCEQNNL--AESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ CL++I DPCEQNNL +E +D+L L ++ +++ PGN+ ADP
Sbjct: 478 PLLEECLYDILSDPCEQNNLVYSERHSDVLTALRRRVQELRASASRPGNRASMPEADPTL 537
Query: 557 WNNIW 561
W
Sbjct: 538 HTCAW 542
>gi|194919176|ref|XP_001983034.1| GG19815 [Drosophila erecta]
gi|190647645|gb|EDV45033.1| GG19815 [Drosophila erecta]
Length = 565
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 238/545 (43%), Positives = 332/545 (60%), Gaps = 30/545 (5%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
+ A ++P+IIII+ADD+G++DVSF G + TPNIDALAY+G +L++ Y A+CTPSR A
Sbjct: 21 SAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRFYAPAMCTPSRGA 80
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L++G+YPIH G QH VI EPW L L L+P+ K AGY+T+ +GKWHLGF R YTP
Sbjct: 81 LLSGRYPIHTGTQHFVISNKEPWALTLNATLMPEIFKGAGYSTNLVGKWHLGFSRPEYTP 140
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
T RGFD H+GYW DY+ K Y G D R NM+++ + G Y TDL T A
Sbjct: 141 TRRGFDYHFGYWGAYIDYFQRRSKMPVANYSLGYDFRRNMELECRDRGAYVTDLLTAEAE 200
Query: 203 NVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+I +H +K +P+FL L+HLA H N +P QAP+EE+ KF I DP RR YA ++S+LD
Sbjct: 201 RLIKDHADKEQPLFLMLSHLAAHTANKDDPLQAPEEEIQKFAYIKDPNRRKYAAIISKLD 260
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
+SVG +++ L LENSIV+F +DNGAPS G+ +N GSN PLRG K+TPW+GG+R
Sbjct: 261 QSVGRIMSTLAATDQLENSIVIFYSDNGAPSLGMFANTGSNFPLRGQKNTPWEGGVRVAG 320
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR-- 378
AIWS L+ + S+ +++DWLPTL AAGIE++ + LDG++ W L+ A
Sbjct: 321 AIWSSKLQARGSIFSQPLYVADWLPTLSHAAGIELDSSLKLDGIDLWPELSGSADAPHVP 380
Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
EILH +D++ AAL++ KYV GT G+ D T E+D P+E Y+
Sbjct: 381 REILHILDDIWR----VAALQLGQWKYVNGTTAAGRYDSLL--TYRELDDLDPRESRYA- 433
Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
+ +++A +A S LR LT ++I R A VRC + CN
Sbjct: 434 -------------VTVRNSATSRALSRYDLRR-LTQQRISITRRLAAVRCGDLQRS--CN 477
Query: 499 STVKPCLFNITDDPCEQNNL--AESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ CL++I+ DPCEQNNL +E +D+L L ++ +++ PGN+ ADPA
Sbjct: 478 PLLEECLYDISTDPCEQNNLVYSERHSDVLAALRRRVRELRASASRPGNRASMPEADPAL 537
Query: 557 WNNIW 561
W
Sbjct: 538 HTCAW 542
>gi|195021983|ref|XP_001985495.1| GH14468 [Drosophila grimshawi]
gi|193898977|gb|EDV97843.1| GH14468 [Drosophila grimshawi]
Length = 619
Score = 448 bits (1153), Expect = e-123, Method: Compositional matrix adjust.
Identities = 243/549 (44%), Positives = 339/549 (61%), Gaps = 34/549 (6%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
TA K P+III++ADD+G++D+S G+ + TPNIDALAY+G +L++ Y ++CTPSR AL
Sbjct: 28 TANKLPNIIIVMADDMGFDDISIRGAREFLTPNIDALAYHGRLLDRLYAPSMCTPSRGAL 87
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
++GKYPIH G QH VI EPWGL L L+P+ +EAGY+T+ +GKWHLGF R YTPT
Sbjct: 88 LSGKYPIHTGTQHYVIGNEEPWGLALNTTLMPEIFREAGYSTNLVGKWHLGFSRPEYTPT 147
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
RGFD+HYGYW DYY + Y G D R NMQV+ G+Y TDL T A
Sbjct: 148 HRGFDNHYGYWGAYIDYYQRRSQMPLGNYSVGYDFRRNMQVECTDRGVYVTDLLTNEAER 207
Query: 204 VIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
+I E K +P+FL L+HLA H GNT +P QAP+EE+ KF I DP RR YA MVS+LDE
Sbjct: 208 IIREQPAKEQPLFLILSHLAPHTGNTNKPLQAPEEELRKFTHIKDPNRRLYAAMVSKLDE 267
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
SVG VI AL + LENSIV+F +DNGAPS G+ SN GSN PL+G K++PW+GG+R A
Sbjct: 268 SVGRVITALARTDQLENSIVIFYSDNGAPSVGMFSNTGSNWPLKGQKNSPWEGGLRVAGA 327
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLT--KGAKTKRS 379
IWSP LK + ++ ++ D+LPTL AAGIE++ + LDG++ W L+ K A
Sbjct: 328 IWSPLLKARGNLFTQPIYVGDFLPTLAHAAGIELDSSLHLDGIDLWPQLSGPKDAPHVPR 387
Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA 439
EILH +D+V Q + KYV GT GQ D+ T E+D P+ Y
Sbjct: 388 EILHQLDDVWRVQSLFQG----QYKYVNGTTLAGQFDQVL--TYRELDDLDPRASRYV-- 439
Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNS 499
++ +++ +A + L+ I T +I + A V C+ +G CN+
Sbjct: 440 ------------VEVRNSPASRALARYDLQRI-TQARINTISRSAAVICSDMQRG--CNA 484
Query: 500 TVKPCLFNITDDPCEQNNLAESQ--TDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA-- 555
++ CL++++ DPCEQNNLA S+ +++L + ++ ++ + P N+P ++ ADP+
Sbjct: 485 LLEECLYDLSTDPCEQNNLAYSKQHSEILTSMRQRVQQLRAGAMAPNNRPSERDADPSLH 544
Query: 556 --RWNNIWV 562
W+N V
Sbjct: 545 ECTWDNFGV 553
>gi|291244830|ref|XP_002742299.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
partial [Saccoglossus kowalevskii]
Length = 559
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 214/407 (52%), Positives = 293/407 (71%), Gaps = 10/407 (2%)
Query: 16 LFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA 75
+ ++ L T++ +PHII+I+ADDLGW+DVSFHGS QIPTPNID LAY+G++L+ +YVQ
Sbjct: 16 ILSECCLATSS--QPHIIVIIADDLGWDDVSFHGSDQIPTPNIDELAYSGVLLHNYYVQP 73
Query: 76 LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
+CTP+R+ALMTG++PIH+G+Q GVI+ P+GLPL E ++PQYLK GY TH +GKWHLG
Sbjct: 74 ICTPTRAALMTGRHPIHLGLQDGVIVASHPYGLPLNETIMPQYLKPLGYDTHIVGKWHLG 133
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP-YQGLDMRHNMQVDNKTIGIYST 194
FF YTP +RGFD+H+GY+ G + YYDH+ + EP Y GLD R+N ++ G YST
Sbjct: 134 FFAWQYTPLYRGFDTHFGYYNGEEGYYDHTAE---EPKYIGLDFRNNTELFKSAYGEYST 190
Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
+L+T A +I HNK+KP+FLYLAH AVH+GN+Y P +AP + ++F I D RRT+A
Sbjct: 191 ELFTSYAEKIIHNHNKNKPLFLYLAHQAVHSGNSYSPLEAPYKYTSRFPYIQDERRRTFA 250
Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
GMVS LD+SVGN+ AL+ G+ EN I+LF DNG P+ G +N SN PLRG+K T W+
Sbjct: 251 GMVSALDDSVGNITRALKHSGLYENCIILFSTDNGGPAAGFDANYASNWPLRGIKHTLWE 310
Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND--TSLDGVNQWDVLTK 372
GG+RG I SP L++ ++S+++ H+ DWLPT+ AG ++ T+LDG N WD L++
Sbjct: 311 GGVRGDGFIHSPLLEKPGRISTDMIHVCDWLPTIYHIAGGNVSSLPTNLDGFNVWDTLSR 370
Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY 419
G + R+EILHNID N AALRV + K + GT+ +G+ D WY
Sbjct: 371 GTPSPRTEILHNIDPHKNHNA--AALRVGEFKIILGTEEHGEWDGWY 415
>gi|198466274|ref|XP_001353949.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
gi|198150525|gb|EAL29685.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
Length = 575
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 240/566 (42%), Positives = 337/566 (59%), Gaps = 43/566 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P+IIIILADD+G++DVSF G + TPNIDALA++G IL++ Y A+CTPSR AL++G+
Sbjct: 36 RPNIIIILADDMGFDDVSFRGGREFLTPNIDALAFHGRILDRLYAPAMCTPSRGALLSGR 95
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YPIH G QH VI EPWGL L L+P+ ++AGY+T+ IGKWHLGF R YTPT RGF
Sbjct: 96 YPIHTGTQHFVISNEEPWGLTLNATLMPEIFQQAGYSTNLIGKWHLGFSRPEYTPTRRGF 155
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
D HYGYW DYY K Y G D R NM+++ + G+Y TDL T A VI E
Sbjct: 156 DYHYGYWGAYIDYYQRRSKMPARNYSLGYDFRRNMELECRDRGVYVTDLLTNEAERVIRE 215
Query: 208 H-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
+ +P+FL L+HLA H N +P QAP+EE+ KF I DP RR YA MVS+LD+SVG
Sbjct: 216 REGQEEPLFLVLSHLATHTANEDDPLQAPEEEIRKFAYIKDPNRRKYAAMVSKLDQSVGR 275
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
+++AL G LENSIV+F +DNGAPS G+ +N GSN PLRG K+TPW+GG+R AIWS
Sbjct: 276 IVSALNSTGQLENSIVIFYSDNGAPSVGMFANTGSNWPLRGQKNTPWEGGVRVAGAIWSA 335
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR--SEILH 383
L+ + ++ +++DWLP+L AAGIE+ T LDG++ W L EILH
Sbjct: 336 QLQARGNIFTQPIYVADWLPSLAHAAGIELPHTLELDGIDLWPQLAGPVDAPHVPREILH 395
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
+D+ + + ++ KY+ GT + GQ D+ T E+D P+E Y
Sbjct: 396 ILDD----EWRVTSFQMGHWKYINGTTSAGQYDQVL--TYRELDDLDPRESRYV------ 443
Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKP 503
+ +++ +A + +R +T ++++ LR A VRC +G CN V+
Sbjct: 444 --------VTVRNSPTSRALARHDIRR-MTQQRVMTLRRSAAVRCGDLQRG--CNPLVEE 492
Query: 504 CLFNITDDPCEQNNLAES--QTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
C+++I DDPCE+NNL S +++L L ++ + + PGN+ +R+DP W
Sbjct: 493 CVYDIWDDPCEKNNLVYSTRHSEVLAALRQRVQLLREGASAPGNREGIRRSDPTLHKCTW 552
Query: 562 VPWYDELDKQKAIETMRLHKPLLSPT 587
+T + KP L+P
Sbjct: 553 -------------DTFEVEKPALAPV 565
>gi|195591175|ref|XP_002085318.1| GD12374 [Drosophila simulans]
gi|194197327|gb|EDX10903.1| GD12374 [Drosophila simulans]
Length = 554
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 239/546 (43%), Positives = 331/546 (60%), Gaps = 30/546 (5%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+ A ++P+IIII+ADD+G++DVSF G + TPNIDALAY+G +L++ Y A+CTPSR
Sbjct: 20 QSAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRG 79
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
AL++G+YPIH G QH VI EPW L L L+P+ KEAGY+T+ +GKWHLGF R YT
Sbjct: 80 ALLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYT 139
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
PT RGFD H+GYW DY+ K Y G D R NM ++ + G+Y TDL T A
Sbjct: 140 PTRRGFDYHFGYWGAYIDYFQRRSKMPVANYSLGYDFRRNMDLECRDRGVYVTDLLTTEA 199
Query: 202 INVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
+I +H +K +P+FL L+HLA H N +P QAP+EE+ KF I D RR YA M+S+L
Sbjct: 200 ERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDSNRRKYAAMISKL 259
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
D+SVG +I+ L LENSIV+F +DNGAPS G+ SN GSN PLRG K+TPW+GG+R
Sbjct: 260 DQSVGRIISTLSSTDQLENSIVIFYSDNGAPSVGMFSNTGSNFPLRGQKNTPWEGGVRVA 319
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR- 378
AIW+ L+ + S+ +++DWLPTL AAGIE++ + LDG++ W L+ A
Sbjct: 320 GAIWNSKLQARGSIFSQPLYVADWLPTLSRAAGIELDSSLKLDGIDLWPELSGTADAPHV 379
Query: 379 -SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
EILH +D+V +AL++ KYV GT G+ D T E+D P+E Y+
Sbjct: 380 PREILHILDDV----WRLSALQMGQWKYVNGTTAAGRYDSVL--TYRELDDLDPRESRYA 433
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHC 497
+ +++A +A S LR LT ++I R A VRC + C
Sbjct: 434 --------------VTVRNSATSRALSRYDLRR-LTQQRISLTRRLAAVRCGDLQRS--C 476
Query: 498 NSTVKPCLFNITDDPCEQNNL--AESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
N ++ CL++I+ DPCEQNNL +E +D+L L ++ +++ PGN+ ADP
Sbjct: 477 NPLLEECLYDISTDPCEQNNLVYSERHSDVLTALRRRVQELRASASRPGNRASMPEADPT 536
Query: 556 RWNNIW 561
W
Sbjct: 537 LHTCAW 542
>gi|156406805|ref|XP_001641235.1| predicted protein [Nematostella vectensis]
gi|156228373|gb|EDO49172.1| predicted protein [Nematostella vectensis]
Length = 498
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 244/541 (45%), Positives = 322/541 (59%), Gaps = 49/541 (9%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PHII+++ADDLGW+D+SFHGS QIPTPN+DALA +G+ILN +YV + TPSR++ MTGKY
Sbjct: 1 PHIIMVVADDLGWDDISFHGSPQIPTPNLDALANSGVILNNYYVSPMDTPSRASFMTGKY 60
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
PIH+G+QH + +P+G+PLTEK LP++L+E GY THA+GKW LGFF + YTPT+RGFD
Sbjct: 61 PIHMGVQHDTLHNRQPFGVPLTEKFLPEFLREMGYQTHAVGKWQLGFFAKEYTPTYRGFD 120
Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
S +G+W +DYY+H + G+D+R N+ V N G+Y T+L A VI H+
Sbjct: 121 SFFGFWTSHEDYYNHVAN---DGGYGIDLRRNLDVSNDHTGVYGTELLAREADEVIENHS 177
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
KP+FLYLAH AVH GN EP QAP V KF I+D RRT+AGMVS LDES+ ++
Sbjct: 178 GDKPLFLYLAHQAVHVGNMDEPLQAPKRHVDKFKYITDERRRTFAGMVSSLDESMRQLVT 237
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
+L++ G+ +NSI++F DNG + G+ + GSN PLRG K+T W+GG+RGVA + SP +K
Sbjct: 238 SLKRKGLYQNSIIIFTTDNGGAAGGLDMSAGSNFPLRGNKNTLWEGGVRGVAFVHSPLIK 297
Query: 330 QTQKVSSELFHISDWLPTL-CAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ +V L H SDWLPTL A G +DG + WD ++KG + R E+LH ID +
Sbjct: 298 RPGRVYDGLMHASDWLPTLHLLAGGSPKLLGQIDGYDLWDGISKGTPSPRYEVLHGIDTM 357
Query: 389 DNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG--DTDNEIDKYSPKEVLYSKAGITFNAL 446
Q AALRV D K + D SD WY + ++E+D Y K L S A +
Sbjct: 358 ---QGNRAALRVGDYKVILNQDLKFYSD-WYKRPEGEHELD-YIRKPRLLSNATV----- 407
Query: 447 KTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLF 506
IK H +L L CN KPCLF
Sbjct: 408 --DCTIKSPHP-------------LLYTHAPL------------------CNPQKKPCLF 434
Query: 507 NITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYD 566
NI DPCE +NLA + LK L D+L Y M P D A+P R +W PW D
Sbjct: 435 NIKWDPCEYHNLAAFMPNTLKVLLDRLKFYAEGMKPVIYPRADDLANPDRLGGVWTPWKD 494
Query: 567 E 567
+
Sbjct: 495 D 495
>gi|194871740|ref|XP_001972898.1| GG15780 [Drosophila erecta]
gi|190654681|gb|EDV51924.1| GG15780 [Drosophila erecta]
Length = 565
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 238/545 (43%), Positives = 332/545 (60%), Gaps = 30/545 (5%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
+ A ++P+IIII+ADD+G++DVSF G + TPNIDALAY+G +L++ Y A+CTPSR A
Sbjct: 21 SAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRFYAPAMCTPSRGA 80
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L++G+YPIH G QH VI EPW L L L+P+ K AGY+T+ +GKWHLGF R YTP
Sbjct: 81 LLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKGAGYSTNLVGKWHLGFSRPEYTP 140
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
T RGFD H+GYW DY+ K Y G D R NM+++ + G Y TDL T A
Sbjct: 141 TRRGFDYHFGYWGAYIDYFQRRSKMPVANYSLGYDFRRNMELECRDRGAYVTDLLTAEAE 200
Query: 203 NVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+I +H +K +P+FL L+HLA H N +P QAP+EE+ KF I DP RR YA ++S+LD
Sbjct: 201 RLIKDHADKEQPLFLMLSHLAAHTANKDDPLQAPEEEIQKFAYIKDPNRRKYAAIISKLD 260
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
+SVG +++ L LENSIV+F +DNGAPS G+ +N GSN PLRG K+TPW+GG+R
Sbjct: 261 QSVGRIMSTLAATDQLENSIVIFYSDNGAPSVGMFANTGSNFPLRGQKNTPWEGGVRVAG 320
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR-- 378
AIWS L+ + S+ +++DWLPTL AAGIE++ + LDG++ W L+ A
Sbjct: 321 AIWSSKLQARGSIFSQPLYVADWLPTLSHAAGIELDSSLKLDGIDLWPELSGSADAPHVP 380
Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
EILH +D++ AAL++ KYV GT G+ D T E+D P+E Y+
Sbjct: 381 REILHILDDIWR----VAALQLGQWKYVNGTTAAGRYDSLL--TYRELDDLDPRESRYA- 433
Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
+ +++A +A S LR LT ++I R A VRC + CN
Sbjct: 434 -------------VTVRNSATSRALSRYDLRR-LTQQRISITRRLAAVRCGDLQRS--CN 477
Query: 499 STVKPCLFNITDDPCEQNNL--AESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ CL++I+ DPCEQNNL +E +D+L L ++ +++ PGN+ ADPA
Sbjct: 478 PLLEECLYDISTDPCEQNNLVYSERHSDVLAALRRRVRELRASASRPGNRASMPEADPAL 537
Query: 557 WNNIW 561
W
Sbjct: 538 HTCAW 542
>gi|195166525|ref|XP_002024085.1| GL22750 [Drosophila persimilis]
gi|194107440|gb|EDW29483.1| GL22750 [Drosophila persimilis]
Length = 575
Score = 442 bits (1138), Expect = e-121, Method: Compositional matrix adjust.
Identities = 238/566 (42%), Positives = 335/566 (59%), Gaps = 43/566 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P+IIIILADD+G++DVSF G + TPNIDALA++G IL++ Y A+CTPSR AL++G+
Sbjct: 36 RPNIIIILADDMGFDDVSFRGGREFLTPNIDALAFHGRILDRLYAPAMCTPSRGALLSGR 95
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YPIH G QH VI EPWGL L L+P+ ++AGY+T+ IGKWHLGF R YTPT RGF
Sbjct: 96 YPIHTGTQHFVISNEEPWGLTLNATLMPEIFQQAGYSTNLIGKWHLGFSRPEYTPTRRGF 155
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
D HYGYW DYY K Y G D R NM+++ + G+Y TDL T A VI E
Sbjct: 156 DYHYGYWGAYIDYYQRRSKMPARNYSLGYDFRRNMELECRDRGVYVTDLLTNEAERVIRE 215
Query: 208 H-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
+ +P+FL L+HLA H N +P QAP+EE+ KF I DP RR YA MVS+LD+SVG
Sbjct: 216 REGQEQPLFLVLSHLATHTANEDDPLQAPEEEIRKFAYIKDPNRRKYAAMVSKLDQSVGR 275
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
+++AL G LENSIV+F +DNGAPS G+ +N GSN PLRG K+TPW+GG+R AIWS
Sbjct: 276 IVSALNSTGQLENSIVIFYSDNGAPSVGMFANTGSNWPLRGQKNTPWEGGVRVAGAIWSA 335
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR--SEILH 383
L+ + ++ +++DWLP+L AAGI + T LDG++ W L EILH
Sbjct: 336 QLQARGNIFTQPIYVADWLPSLAHAAGIGLPHTLELDGIDLWPQLAGPVDAPHVPREILH 395
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
+D+ + + ++ KY+ GT + GQ D+ T E+D P+E Y
Sbjct: 396 ILDD----EWRVTSFQMGHWKYINGTTSAGQYDQVL--TYRELDDLDPRESRYV------ 443
Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKP 503
+ +++ +A + +R +T ++++ LR A VRC +G CN V+
Sbjct: 444 --------VTVRNSPTSRALARHDIRR-MTQQRVMTLRRSAAVRCGDLQRG--CNPLVEE 492
Query: 504 CLFNITDDPCEQNNLAES--QTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
C+++I DDPCE+NNL S +++L L ++ + + PGN+ +R+DP W
Sbjct: 493 CVYDIWDDPCEKNNLVYSTRHSEVLAALRQRVQMLREGASAPGNRQGIRRSDPTLHKCTW 552
Query: 562 VPWYDELDKQKAIETMRLHKPLLSPT 587
+ + KP L+P
Sbjct: 553 -------------DVFEVEKPALAPV 565
>gi|443734861|gb|ELU18717.1| hypothetical protein CAPTEDRAFT_218441 [Capitella teleta]
Length = 500
Score = 442 bits (1137), Expect = e-121, Method: Compositional matrix adjust.
Identities = 241/558 (43%), Positives = 324/558 (58%), Gaps = 70/558 (12%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
+ C LL + L + P+++ ILADDLGW+D+S HGS +IPTPNID LA +G++LN
Sbjct: 8 IACFLLLSS--LRNSFAGHPNVVFILADDLGWDDISLHGSQEIPTPNIDLLATDGILLNN 65
Query: 71 HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
+YVQ +CTPSR+ALMTG++P+H+G+QH VI+ +P+GL L E LLPQYLK GY+TH +G
Sbjct: 66 YYVQPICTPSRAALMTGRHPVHLGLQHDVIVWAQPYGLGLNETLLPQYLKTLGYSTHMVG 125
Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
KWHLGF+ + +TPT RGFDSH GY+ G +DYYDH+ T + + GLD H+ +VD G
Sbjct: 126 KWHLGFYDKEHTPTKRGFDSHLGYYTGCEDYYDHTWGFTKQDW-GLDFWHDREVDRSAFG 184
Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP--FQAPDEEVAKFLDISDP 248
YST+++T A VIAEH+ SKP+FLYLA AVH+GN QAP + V F+ I
Sbjct: 185 QYSTEVFTSEAERVIAEHDVSKPLFLYLAQQAVHSGNPGNKVRLQAPWKYVKNFMGIKSE 244
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
ERR +AGMVS LD+SV NV AL + G+L ++I++F DNG P+ N N PLRG
Sbjct: 245 ERRVFAGMVSALDDSVSNVTKALHEKGILNDTIIIFSTDNGGPANRQDYNDACNWPLRGS 304
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT--SLDGVNQ 366
K T W+GG+RG IWSP L+ + VS L I DW+PT+ AAG +++ SLDG++Q
Sbjct: 305 KRTMWEGGVRGNGFIWSPLLENSGYVSEHLMQIVDWVPTVLEAAGFDMSKLPPSLDGLSQ 364
Query: 367 WDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
W L++ + R +LHNID ++ ALR D+K V G GDT+
Sbjct: 365 WKALSRHESSPRKVLLHNIDPINGD----GALRTTDMKIVFGG----------GDTERYP 410
Query: 427 DKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARV 486
Y P+E L K SS A
Sbjct: 411 GWYEPEEALIRSPS--------------------KRPSSTA------------------- 431
Query: 487 RCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNK 546
C +KPC+FNIT DPCE N+A+ D+L++L K+ + T VPP N
Sbjct: 432 ----------CKPWLKPCVFNITADPCEYYNIADFHPDILQELLQKVDEFNQTAVPPRNV 481
Query: 547 PFDKRADPARWNNIWVPW 564
P D + P N +W PW
Sbjct: 482 PSDPKGLPINNNGVWGPW 499
>gi|380025315|ref|XP_003696421.1| PREDICTED: arylsulfatase B-like [Apis florea]
Length = 546
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 230/548 (41%), Positives = 338/548 (61%), Gaps = 48/548 (8%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P+II+I+ADDLGWNDVSFHG++QIPTPNIDALAYNG+IL +HYV +CTPSR+A +TG+
Sbjct: 23 RPNIIVIMADDLGWNDVSFHGANQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGR 82
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YPI GMQ + GEP +PL LLP+YL++ GYATH +GKWH+G++ + +TPT RGF
Sbjct: 83 YPIRTGMQGYPLKAGEPRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPTRRGF 142
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI-----YSTDLYTEAAIN 203
D+ +GY+ G Y++H+ K + G D+ ++ ++K + + Y TDL TE A N
Sbjct: 143 DTFFGYYNGYISYFNHTIKQN--NHVGYDLHYH---NSKNLSVAYNFEYITDLITERAEN 197
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV--AKFLDISDPERRTYAGMVSRLD 261
+I H++ KP++L L+HLAVH+ + E + DE+ A I D RR YAG+V+ +D
Sbjct: 198 IIKNHDRRKPLYLQLSHLAVHSSDAKEVMEVRDEQETNATLEYIEDYNRRKYAGVVTAMD 257
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
ESVG VI AL + MLENSI++FM+DNGA + G+ N GSN+PLRG+K T ++GG+RGVA
Sbjct: 258 ESVGRVIKALGQSSMLENSIIVFMSDNGAQTEGLLENYGSNYPLRGLKFTLFEGGIRGVA 317
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAG--IEINDTSLDGVNQWDVLTKGAKTKRS 379
++S ++ + ++S+EL HI+DWLPT +AAG +E + ++DGV+QWD + G ++KR
Sbjct: 318 CVYSRLIQNSSRISNELIHITDWLPTFYSAAGGNLENLEENMDGVDQWDTIVSGRESKRG 377
Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID--KYSPKEVLYS 437
+L NID V+N KY N Q ++YGD + +Y+ VL+S
Sbjct: 378 SVLLNIDEVENVSSALLG------KYKLINGKNIQYSDYYGDNGTSVSYPEYNVSSVLHS 431
Query: 438 KAGIT-FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH 496
G T F+ L I E +++LR ARV CN
Sbjct: 432 FVGSTIFHTTNFTLNI----------------------EDVIKLRNKARVVCNNFTSYPK 469
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
C V CLF++ +DPCE +L++ +++K+L + Y+ ++ N P D + P
Sbjct: 470 C---VDKCLFDVYNDPCETTDLSDEYPEIVKKLNYFITEYEKVLLHQTNAPVDPHSYPEN 526
Query: 557 WNNIWVPW 564
+N W+PW
Sbjct: 527 FNGTWMPW 534
>gi|195441668|ref|XP_002068625.1| GK20324 [Drosophila willistoni]
gi|194164710|gb|EDW79611.1| GK20324 [Drosophila willistoni]
Length = 525
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 241/537 (44%), Positives = 325/537 (60%), Gaps = 24/537 (4%)
Query: 36 LADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGM 95
+ADDLG++DVSF GS+ TPNIDALAY+G+ILN Y A+CTPSRSAL+TGKYPI GM
Sbjct: 1 MADDLGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTPAMCTPSRSALLTGKYPISTGM 60
Query: 96 QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYW 155
QH VI+ +PWGLPL E + + ++ GY T +GKWHLG + +TPT RGFD+H GY
Sbjct: 61 QHYVIVNDQPWGLPLNETTMAEIFQQNGYYTSLLGKWHLGMSQRNFTPTKRGFDTHLGYL 120
Query: 156 QGLQDYYDHSCKATFEPY-QGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN-KSKP 213
DYYD + + + Y +G D R N++ + +G Y TD+ ++AA+ +I +HN +KP
Sbjct: 121 GAYIDYYDQTYLQSSQNYSRGHDFRDNLEASHDKVGHYVTDILSDAAVELIEKHNVTAKP 180
Query: 214 MFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRK 273
+FL L+HLA HA N +P QAP EE+++F I + R YA MVSRLD+SVG VI AL
Sbjct: 181 LFLLLSHLAPHAANDNDPLQAPMEELSQFEYIQNKSHRYYAAMVSRLDKSVGRVIEALAN 240
Query: 274 HGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK 333
ML NSIVLF++DNG PS G H+N SN+PLRG K TPW+GG+R AAIWS ++
Sbjct: 241 REMLRNSIVLFLSDNGGPSVGEHANTASNYPLRGQKHTPWEGGIRSSAAIWSTEFERLGS 300
Query: 334 VSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ 392
V + +I D LPTLC+AAGI ++ + LDG+N W L G ++ EI+H+ID VD PQ
Sbjct: 301 VWKQRIYIGDLLPTLCSAAGISLDPSMHLDGLNLWSALKYGYESVEREIVHSIDAVD-PQ 359
Query: 393 KYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQI 452
+ A +R K + GT N+GQ D W G D+ P+ V Y + I + +LQ+
Sbjct: 360 PHLAYMR-GKWKIINGTTNSGQYDGWLG--QRITDEVDPRSVHYDQL-IRNTTVWQQLQL 415
Query: 453 KQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDP 512
A P+ + I ELR+ A++ C C + PC F+I DP
Sbjct: 416 L---ADKPQ-----------QERNITELRQQAQILCPTPANAKPCMPLIAPCFFDIEVDP 461
Query: 513 CEQNNLAES--QTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDE 567
CEQNNL + T + L +++ Y P NKP D + DP + N W W DE
Sbjct: 462 CEQNNLYDEYRNTSIFLDLWERIDQYARNAHLPNNKPADPKCDPRYYRNEWTWWQDE 518
>gi|241619161|ref|XP_002407085.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215500931|gb|EEC10425.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 588
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 241/563 (42%), Positives = 327/563 (58%), Gaps = 34/563 (6%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F AL LL + + ++PHI+ ILADDLGWNDVS++G QI TPNIDALA+NG+
Sbjct: 47 FIALAAMLLTS----RRSPVRQPHIVFILADDLGWNDVSYNGCPQIRTPNIDALAWNGIR 102
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
L ++Y Q +CTPSR+ALMTG+YPIH GMQH VIL+ EP GLPL KLLPQ+L + GY +
Sbjct: 103 LQRYYTQPMCTPSRAALMTGRYPIHTGMQHFVILQNEPRGLPLKFKLLPQWLGDLGYVSQ 162
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQVD 185
+GKWHLGF+++ YTPT RGF H G W G DYY H K F + GLD R +
Sbjct: 163 MLGKWHLGFYKKEYTPTMRGFQKHIGSWGGFVDYYSHIRFNKIGFS-HSGLDFRQGLSEG 221
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
+ G Y T+ TEAA VI H KP+FLYLAHLA H N ++P Q P + K+ DI
Sbjct: 222 REFDGQYYTEFMTEAATRVIENHPLEKPLFLYLAHLAPHGANRHDPLQVPKKYSDKYHDI 281
Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
R YAGMVS LDESVG V+ AL K GML +++++F +DNG + G + N S+ P
Sbjct: 282 GHWNRTMYAGMVSALDESVGAVVEALGKRGMLSDTVLVFSSDNGGDTNGENPNYASSWPF 341
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVS-SELFHISDWLPTLCAAAGIEINDTS-LDG 363
+G K T W+GG+ IWSP + + +FHISDWLPTL AG + +D +DG
Sbjct: 342 KGQKRTLWEGGIHVPGFIWSPLFSGMRGFDYNNIFHISDWLPTLYQLAGGDPSDLGDIDG 401
Query: 364 VNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD 423
++ D L++ ++T R E+L NID ++N +A+ K V+G G DEW+
Sbjct: 402 ISHLDSLSRRSETPRKELLINIDPIEN----VSAIIEGHFKLVSGVVRGGVLDEWFQVPG 457
Query: 424 NEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREF 483
N IT++ + + + + A N+ + D
Sbjct: 458 N----------------ITWDYNRARQECETSLVARVLRNAGHDVACGSEDGSFP----- 496
Query: 484 ARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPP 543
++C + C TV PCLF+++ DPCE NN+AE ++L++L KL Y+ T VPP
Sbjct: 497 TPIKCGKRDPSKPCVPTVAPCLFDLSKDPCEYNNIAEQHNEVLQRLLGKLEGYRETSVPP 556
Query: 544 GNKPFDKRADPARWNNIWVPWYD 566
GN P D++AD A NN+W W D
Sbjct: 557 GNLPSDEQADAALHNNVWSSWGD 579
>gi|195436072|ref|XP_002066002.1| GK11604 [Drosophila willistoni]
gi|194162087|gb|EDW76988.1| GK11604 [Drosophila willistoni]
Length = 567
Score = 439 bits (1129), Expect = e-120, Method: Compositional matrix adjust.
Identities = 245/551 (44%), Positives = 334/551 (60%), Gaps = 35/551 (6%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
TT K+P+IIIILADD+G++DVSF G + TPNIDALAY+G IL+ Y A+CTPSR A
Sbjct: 22 TTTTKRPNIIIILADDMGFDDVSFRGGREFLTPNIDALAYHGRILDNLYAPAMCTPSRGA 81
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L++G+YP H G QH VI EPWGL L L+P+ KEAGY+T+ IGKWHLGF YTP
Sbjct: 82 LLSGRYPAHTGTQHFVISNEEPWGLTLNATLMPEIFKEAGYSTNLIGKWHLGFASPEYTP 141
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
T RGFD HYGYW DYY + Y G D R NM ++ + G+Y TDL T+ A
Sbjct: 142 TRRGFDYHYGYWGAYIDYYQRRSQMPVANYSMGYDFRRNMDLECQNRGVYITDLLTQEAE 201
Query: 203 NVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+VI E + + FL L+HLA H N +P QAP+EE+ KF I DP RR YA MV++LD
Sbjct: 202 HVIREKAAANETFFLMLSHLATHTANDNDPLQAPEEEIRKFAYIKDPRRRKYAAMVAKLD 261
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
+SVG +I L + LENSIV+F +DNGAPS G +N GSN PLRG K+TPW+GG+R
Sbjct: 262 QSVGRIITTLAQTEQLENSIVIFYSDNGAPSVGQFANTGSNWPLRGQKNTPWEGGIRVAG 321
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS--LDGVNQWDVL--TKGAKTK 377
AIWS L+ ++ +++DWLPTL AAGI++ ++S LDG++ W L +K A
Sbjct: 322 AIWSTQLQARGNKFTQPIYVADWLPTLAHAAGIDLPESSLQLDGLDLWPQLAGSKDAAHV 381
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
EILH +D+V + +++ + KY+ GT +G D+ T E+D P+ Y
Sbjct: 382 PREILHALDDV----WHVSSMLLGHWKYINGTTASGTYDQVL--TYRELDDLDPRNSRYV 435
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHC 497
++++ A+ A+ D R L +KI LR A VRC +G C
Sbjct: 436 ------------VEVRNSPASRALAH-FDLQR--LGQQKITSLRHSASVRCGNLQRG--C 478
Query: 498 NSTVKPCLFNITDDPCEQNNLAESQTD--LLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
N V+ CL++I DPCEQNNL S+ +L L +L ++ + +PPGN+P +++DP+
Sbjct: 479 NPLVEECLYDIAADPCEQNNLIYSKQHEAVLMALRKRLQEFRLSALPPGNRPSMRQSDPS 538
Query: 556 ----RWNNIWV 562
W+N V
Sbjct: 539 LHECTWDNFLV 549
>gi|194748074|ref|XP_001956474.1| GF24576 [Drosophila ananassae]
gi|190623756|gb|EDV39280.1| GF24576 [Drosophila ananassae]
Length = 583
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 244/561 (43%), Positives = 330/561 (58%), Gaps = 34/561 (6%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
T KP+IIII+ADDLG++DVSF GS+ TPNIDALAY+G+ILN Y +CTPSR+A
Sbjct: 29 TEISGKPNIIIIMADDLGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAPMCTPSRAA 88
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L+TGKYPI+ GMQH VI+ +PWGLPL E + + GY T IGKWHLG + YTP
Sbjct: 89 LLTGKYPINTGMQHYVIVNDQPWGLPLNETTMADIFRGNGYRTSLIGKWHLGMSQRNYTP 148
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
T RGFD H GY DYY+ S + + Y+G D R N++ +++ + Y TD+ T+AA+
Sbjct: 149 TLRGFDYHLGYLGAYVDYYNQSYEQVSKGYRGHDFRENLKPNHEHVDKYVTDILTDAAVR 208
Query: 204 VIAEH---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
I +H N SKP+FL L HLA HA N +P QAP +E++ F I D R YA MVSRL
Sbjct: 209 EIDDHAAKNNSKPLFLLLNHLAPHAANDADPMQAPADELSGFEYIRDETHRYYAAMVSRL 268
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
D+SVG+V+ AL + ML+NSI+LF++DNG P+ G H+ SN+PLRG K++PW+GG+R
Sbjct: 269 DKSVGHVVDALARQDMLQNSILLFLSDNGGPTLGEHATTASNYPLRGQKNSPWEGGIRSS 328
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRS 379
AAIWS ++ V + +I D LPTL AAAGI + D LDG+N W L G ++
Sbjct: 329 AAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAGISPDPDLRLDGLNLWSALKYGYESVER 388
Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGD-TDNEIDKYSPK-EVLYS 437
EI+H ID + P+ + A R K V+GT N+G+ D W G+ E+D S + E L
Sbjct: 389 EIVHVIDE-EAPEPHLAYTR-GKWKVVSGTTNSGRYDGWLGERVTTEVDPRSQEYEALIQ 446
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH- 496
+ LQ+++ + I LR+ AR+ C + H
Sbjct: 447 NTSVW-------LQLQKVSGG---------------ERNISGLRQQARIECPATTEDEHT 484
Query: 497 -CNSTVKPCLFNITDDPCEQNNLAES--QTDLLKQLEDKLAIYKSTMVPPGNKPFDKRAD 553
C PCLF+I DPCEQNNL E T + + L ++ + + PP NKP D +
Sbjct: 485 PCLPLEAPCLFDIEADPCEQNNLYEEYKNTTVFQDLWQRVQQFTAQAHPPNNKPGDPSCN 544
Query: 554 PARWNNIWVPWYDELDKQKAI 574
PA ++N W W DE A
Sbjct: 545 PAFYHNEWTWWQDETPSNSAF 565
>gi|189236319|ref|XP_975218.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
Length = 536
Score = 437 bits (1123), Expect = e-119, Method: Compositional matrix adjust.
Identities = 239/548 (43%), Positives = 337/548 (61%), Gaps = 30/548 (5%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP+++ I+ADDLGWNDV FHGS+QIPTPNIDALAYNG+ILN HY Q+ TPSR+AL+TGK
Sbjct: 7 KPNVVFIVADDLGWNDVGFHGSNQIPTPNIDALAYNGIILNSHYSQSFGTPSRAALLTGK 66
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YP+ +G+Q I E LP K++ +Y K+ GYATH +GKWHLG R TPTFRGF
Sbjct: 67 YPMKLGLQGPSITPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTFRGF 125
Query: 149 DSHYGYWQGLQDYYDHSC--KATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
D +G++ G YYD+ K + Y G D+R + G Y+TDL+ E A++VI
Sbjct: 126 DHFFGFYNGFTSYYDYVSNWKINDKEYSGFDLRRDTVPSWNDAGKYATDLFAEHAVDVIQ 185
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
+HN + P+F+ +AHLAVH GN + +AP E V KF I DP RRTYA MVS+LD+S+G
Sbjct: 186 KHNVNTPLFMMIAHLAVHVGNEGKWLEAPQETVNKFKHIRDPNRRTYAAMVSKLDDSIGA 245
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
V AL ML+N+IV+F++DNGAP+ G H N GSN+PLRG+K T ++GG+R VA IWSP
Sbjct: 246 VFEALEAKNMLQNTIVVFISDNGAPTVGPHHNWGSNYPLRGIKDTLFEGGVRTVACIWSP 305
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAG--IEINDTSLDGVNQWDVLTKGAKTKRSEILHN 384
L Q+ +VS++L HI+DWLPTL A G + + D LDG++QW L + R++I N
Sbjct: 306 LLVQSSRVSTDLIHITDWLPTLFTAVGGDLSVLDPDLDGIDQWSSLVYDLPSARNDIPLN 365
Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFN 444
ID + AALR K + GT NG + ++G NE + +E Y+ + I +
Sbjct: 366 IDE----KTRNAALRFSYWKLIVGTSGNGSYNGYFGAPLNE----NIEEQQYNTSAINDS 417
Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC-NYDNKGAHCN-STVK 502
+ +I +K +P L++ LR A ++C + K C+ ++
Sbjct: 418 PVG---RIAKKINYNP-----------LSETDFDGLRRVATLKCLDAKAKRNPCDPASGA 463
Query: 503 PCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFD-KRADPARWNNIW 561
CL+NI +DPCE+N+LA+ ++++++ L Y+ +VP D + ADP + W
Sbjct: 464 VCLYNIPNDPCEENDLAKFFPSVVRRIKRTLVDYRQGLVPQIETSIDIENADPKLFQFAW 523
Query: 562 VPWYDELD 569
PW D D
Sbjct: 524 NPWRDCSD 531
>gi|270005853|gb|EFA02301.1| hypothetical protein TcasGA2_TC007966 [Tribolium castaneum]
Length = 558
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 239/551 (43%), Positives = 339/551 (61%), Gaps = 30/551 (5%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A +P+++ I+ADDLGWNDV FHGS+QIPTPNIDALAYNG+ILN HY Q+ TPSR+AL+
Sbjct: 26 ATGQPNVVFIVADDLGWNDVGFHGSNQIPTPNIDALAYNGIILNSHYSQSFGTPSRAALL 85
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TGKYP+ +G+Q I E LP K++ +Y K+ GYATH +GKWHLG R TPTF
Sbjct: 86 TGKYPMKLGLQGPSITPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTF 144
Query: 146 RGFDSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
RGFD +G++ G YYD+ + K + Y G D+R + G Y+TDL+ E A++
Sbjct: 145 RGFDHFFGFYNGFTSYYDYVSNWKINDKEYSGFDLRRDTVPSWNDAGKYATDLFAEHAVD 204
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
VI +HN + P+F+ +AHLAVH GN + +AP E V KF I DP RRTYA MVS+LD+S
Sbjct: 205 VIQKHNVNTPLFMMIAHLAVHVGNEGKWLEAPQETVNKFKHIRDPNRRTYAAMVSKLDDS 264
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
+G V AL ML+N+IV+F++DNGAP+ G H N GSN+PLRG+K T ++GG+R VA I
Sbjct: 265 IGAVFEALEAKNMLQNTIVVFISDNGAPTVGPHHNWGSNYPLRGIKDTLFEGGVRTVACI 324
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAG--IEINDTSLDGVNQWDVLTKGAKTKRSEI 381
WSP L Q+ +VS++L HI+DWLPTL A G + + D LDG++QW L + R++I
Sbjct: 325 WSPLLVQSSRVSTDLIHITDWLPTLFTAVGGDLSVLDPDLDGIDQWSSLVYDLPSARNDI 384
Query: 382 LHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGI 441
NID + AALR K + GT NG + ++G NE + +E Y+ + I
Sbjct: 385 PLNIDE----KTRNAALRFSYWKLIVGTSGNGSYNGYFGAPLNE----NIEEQQYNTSAI 436
Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC-NYDNKGAHCN-S 499
+ + +I +K +P L++ LR A ++C + K C+ +
Sbjct: 437 NDSPVG---RIAKKINYNP-----------LSETDFDGLRRVATLKCLDAKAKRNPCDPA 482
Query: 500 TVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFD-KRADPARWN 558
+ CL+NI +DPCE+N+LA+ ++++++ L Y+ +VP D + ADP +
Sbjct: 483 SGAVCLYNIPNDPCEENDLAKFFPSVVRRIKRTLVDYRQGLVPQIETSIDIENADPKLFQ 542
Query: 559 NIWVPWYDELD 569
W PW D D
Sbjct: 543 FAWNPWRDCSD 553
>gi|194748096|ref|XP_001956485.1| GF25237 [Drosophila ananassae]
gi|190623767|gb|EDV39291.1| GF25237 [Drosophila ananassae]
Length = 570
Score = 434 bits (1116), Expect = e-119, Method: Compositional matrix adjust.
Identities = 241/563 (42%), Positives = 323/563 (57%), Gaps = 32/563 (5%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F L C A ++P+IIII+ADD+G++DVSF G + TPNIDALAY+G +
Sbjct: 8 FLLLLCLRGSKANGAEMEATRRPNIIIIMADDMGFDDVSFRGGREFITPNIDALAYHGRL 67
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
L++ Y A+CTPSR AL++G+YPIH G QH VI EPW L L+P+ + AGY+T+
Sbjct: 68 LDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVISNEEPWALDSNATLMPEIFQRAGYSTN 127
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDN 186
+GKWHLGF R YTPT RGFD HYGYW DYY K Y G D R NM+++
Sbjct: 128 LVGKWHLGFSRPEYTPTHRGFDYHYGYWGAYIDYYQRRSKMPVANYSLGYDFRRNMELEC 187
Query: 187 KTIGIYSTDLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
+ G Y TDL T A +I E K KP+FL L+HLA H N +P QAP+EE+ KF I
Sbjct: 188 RDRGTYVTDLLTTEAERLIKEQAGKDKPLFLMLSHLATHTANEDDPLQAPEEEIRKFSYI 247
Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
DP RR YA MVS+LD+SVG +I+ L LENSIV+F +DNGAPS G+ +N GSN PL
Sbjct: 248 KDPNRRKYAAMVSKLDQSVGRIISTLASTDQLENSIVIFYSDNGAPSVGMFANTGSNWPL 307
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND---TSLD 362
RG K+TPW+GG+R A+WS L+ + ++ ++ DWLPTL AA IE+ LD
Sbjct: 308 RGQKNTPWEGGVRVAGAVWSSKLEARGSIFTQPIYVGDWLPTLAHAADIELEPKILAGLD 367
Query: 363 GVNQWDVLTKGAKTKR--SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
G++ W L+ A EILH++D+V +L + KYV GT + GQ D
Sbjct: 368 GIDLWPQLSGAADAPHVPREILHSLDDVWK----VGSLLMGHWKYVNGTTSAGQYDSVL- 422
Query: 421 DTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILEL 480
T E D P+E Y+ + +++ +A + LR LT ++I L
Sbjct: 423 -TYREFDDIDPRESRYA--------------VTIRNSPASRALTRLDLRR-LTQQRIAGL 466
Query: 481 REFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLA--ESQTDLLKQLEDKLAIYKS 538
R A V C + CN ++ CLF+I+ DPCEQNNLA E +D+L L + ++
Sbjct: 467 RRSAAVHCGDFQRS--CNPLLEDCLFDISADPCEQNNLAYSERHSDVLAALRKRFRELRA 524
Query: 539 TMVPPGNKPFDKRADPARWNNIW 561
+ PGN+ ADP W
Sbjct: 525 SASRPGNRASMPSADPTWHTCTW 547
>gi|195379278|ref|XP_002048407.1| GJ13952 [Drosophila virilis]
gi|194155565|gb|EDW70749.1| GJ13952 [Drosophila virilis]
Length = 574
Score = 433 bits (1113), Expect = e-118, Method: Compositional matrix adjust.
Identities = 235/549 (42%), Positives = 331/549 (60%), Gaps = 34/549 (6%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
T K P++III+ADD+G++D+S G + TPNIDAL ++G +L++ Y A+CTPSR AL
Sbjct: 30 TGNKLPNVIIIMADDMGFDDISLRGGREFLTPNIDALGFHGRLLDRLYAPAMCTPSRGAL 89
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
++GKYPIH G QH VI EPW L L L+P+ + AGY+T+ +GKWHLGF R YTPT
Sbjct: 90 LSGKYPIHTGTQHFVISNEEPWSLMLNTTLMPEIFRSAGYSTNLVGKWHLGFARPEYTPT 149
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPY-QGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
RGFD HYGYW DYY + + Y G D R NM+V+ G+Y TDL T A
Sbjct: 150 HRGFDYHYGYWGAYIDYYQRRSQMPEKTYIVGYDFRRNMEVECTDRGVYVTDLLTNEAER 209
Query: 204 VIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
+I E K KP+FL + HLA H N +P QAP+EEV KFL I DP R YA MVS+LD+
Sbjct: 210 IIQETAAKQKPLFLMINHLATHTANDNDPLQAPEEEVQKFLHIKDPNHRKYAAMVSKLDQ 269
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
SVG VI AL + LENSIV+F +DNGAPS G+ +N GSN PL+G K++PW+GG+R
Sbjct: 270 SVGRVITALARAEQLENSIVIFYSDNGAPSVGMFANTGSNWPLKGQKNSPWEGGVRVAGV 329
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVL--TKGAKTKRS 379
IWS L + ++ ++ D+LPTL AAGIE+ ++ LDG++ W L +K A
Sbjct: 330 IWSQLLTARGHLFTQPIYVGDFLPTLAHAAGIELPESLQLDGIDLWPQLAGSKDAPHVPR 389
Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA 439
EILH++D+V ++++L + KYV GT GQ D+ T E+D+ P+ Y
Sbjct: 390 EILHSLDDV----SHFSSLLLGQWKYVNGTTLAGQFDQQL--THRELDELDPRARRYV-- 441
Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNS 499
++ +++ KA + L+ LT ++ LR A VRC +G CN+
Sbjct: 442 ------------VEVRNSPASKALARYDLQR-LTQHRMHTLRRLATVRCGDMQRG--CNA 486
Query: 500 TVKPCLFNITDDPCEQNNLA--ESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP--- 554
++ CL++++ DPCE NNLA E ++L + ++ ++ +PPGN+P A+P
Sbjct: 487 LLEECLYDLSVDPCELNNLAYSERHAEILATMRQRVQQLRAEALPPGNRPGVSYANPNLH 546
Query: 555 -ARWNNIWV 562
W+N V
Sbjct: 547 ECTWDNFEV 555
>gi|194883566|ref|XP_001975872.1| GG20328 [Drosophila erecta]
gi|190659059|gb|EDV56272.1| GG20328 [Drosophila erecta]
Length = 478
Score = 431 bits (1107), Expect = e-118, Method: Compositional matrix adjust.
Identities = 240/524 (45%), Positives = 313/524 (59%), Gaps = 58/524 (11%)
Query: 7 YFFALTCTLLFN-DAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
+F L C LL N A + P +P+II ILADDLG+NDV FHGS+ IPTPNIDALAY+G
Sbjct: 2 WFLWLICLLLPNMIAGEDEKPPAQPNIIFILADDLGFNDVGFHGSADIPTPNIDALAYSG 61
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+ILN++YV +CTPSRSALMTGKYPIH GMQH V+ EP GLPL EK+LPQYL E GYA
Sbjct: 62 IILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYA 121
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
+H GKWHLG ++ YTP +RGF SH+ GLDMR+ +V
Sbjct: 122 SHIAGKWHLGHWKLKYTPLYRGFSSHW----------------------GLDMRNGTEVA 159
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
G Y+TD+ T+ ++ VIA HN +K P+FLY+AH A H+ N Y P PD +V K
Sbjct: 160 YDLHGHYTTDVITDHSVKVIASHNATKGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMAH 219
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
I +RR +A MVS++D+SVG ++ LRK MLENSI++F +DNG P+ G + N SN+P
Sbjct: 220 IPHYKRRKFAAMVSKMDDSVGQIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYP 279
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG----IEINDTS 360
L+G+K+T W+GG+R +WSP LK++Q+VS++ HI DWLPTL AAG +
Sbjct: 280 LKGVKNTLWEGGVRAAGLMWSPLLKKSQRVSNQTMHIIDWLPTLLEAAGGPAAVANLSKQ 339
Query: 361 LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
+DG + W L + + R +LHNID++ AAL V D K V GT+ G D WYG
Sbjct: 340 IDGQSIWRALVQDKASPRLNVLHNIDDIWGS----AALSVGDWKLVKGTNYRGSWDGWYG 395
Query: 421 DTDNEIDK-YSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILE 479
+ Y + V S+AG ALK + +D R
Sbjct: 396 PAGERNPRLYDWQMVARSRAGKALEALKM------------LPSQADQQR---------- 433
Query: 480 LREFARVRC-NYDNKGAHCNSTV--KPCLFNITDDPCEQNNLAE 520
+R A V C +KG C +T PCLF+I DDPCEQ NLA+
Sbjct: 434 IRAAATVSCPAQSSKGTSCLATAFSAPCLFHIRDDPCEQYNLAK 477
>gi|24666163|ref|NP_649020.1| CG7408, isoform B [Drosophila melanogaster]
gi|281366395|ref|NP_001163462.1| CG7408, isoform C [Drosophila melanogaster]
gi|281366397|ref|NP_001163463.1| CG7408, isoform D [Drosophila melanogaster]
gi|23093214|gb|AAF49290.2| CG7408, isoform B [Drosophila melanogaster]
gi|272455230|gb|ACZ94733.1| CG7408, isoform C [Drosophila melanogaster]
gi|272455231|gb|ACZ94734.1| CG7408, isoform D [Drosophila melanogaster]
Length = 585
Score = 430 bits (1105), Expect = e-117, Method: Compositional matrix adjust.
Identities = 242/547 (44%), Positives = 322/547 (58%), Gaps = 31/547 (5%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP+IIII+ADDLG++DVSF GS+ TPNIDALAY+G+ILN YV +CTPSR+AL+TGK
Sbjct: 34 KPNIIIIMADDLGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGK 93
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YPI+ GMQH VI+ +PWGLPL E + + +E GY T +GKWHLG + +TPT RGF
Sbjct: 94 YPINTGMQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGF 153
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
D H GY DYY S + + Y G D R +++ + +G Y TDL T+AA+ I +H
Sbjct: 154 DRHLGYLGAYVDYYTQSYEQQNKGYNGHDFRDSLKSTHDHVGHYVTDLLTDAAVKEIEDH 213
Query: 209 ---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
N S+P+FL L HLA HA N +P QAP EEV++F IS+ R YA MVSRLD+SVG
Sbjct: 214 GSKNSSQPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRYYAAMVSRLDKSVG 273
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
+VI AL + ML+NSI+LF++DNG P+ G HS SN+PLRG K++PW+G +R AAIWS
Sbjct: 274 SVIDALARQEMLQNSIILFLSDNGGPTQGQHSTTASNYPLRGQKNSPWEGALRSSAAIWS 333
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
++ V + +I D LPTL AAAGI + LDG+N W L G ++ EI+H
Sbjct: 334 TEFERLGSVWKQQIYIGDLLPTLAAAAGISPDPALHLDGLNLWSALKYGYESVEREIVHV 393
Query: 385 ID-NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
ID +V P Y + K ++GT N G D W G E + P+ V Y +
Sbjct: 394 IDEDVAEPHLSYTRGK---WKVISGTTNQGLYDGWLG--HRETSEVDPRAVEYEE---LV 445
Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-CNSTVK 502
LQ++Q + I ELR+ +R+ C G C
Sbjct: 446 RNTSVWLQLQQ---------------VSFGERNISELRDQSRIECPDPATGVKPCLPLEG 490
Query: 503 PCLFNITDDPCEQNNL-AESQ-TDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
PCLF+I DPCE++NL AE Q + + L ++ + PP NKP D DP ++N
Sbjct: 491 PCLFDIEADPCERSNLYAEYQNSTIFLDLWSRIQQFAKQAHPPNNKPGDPNCDPRFYHNE 550
Query: 561 WVPWYDE 567
W W DE
Sbjct: 551 WTWWQDE 557
>gi|449680619|ref|XP_002157149.2| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
Length = 502
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 239/557 (42%), Positives = 327/557 (58%), Gaps = 68/557 (12%)
Query: 14 TLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV 73
T + + KPHII+I+ADDLGWND+SFHGS++IPTPNID LA NG+IL+ +YV
Sbjct: 4 TFVLTSFIMIVIHADKPHIIMIVADDLGWNDISFHGSNEIPTPNIDRLANNGVILDNYYV 63
Query: 74 QALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWH 133
+CTPSRSA+MTG+YPIH GMQ I P+G+ L EK LPQYLK+ GY TH +GKWH
Sbjct: 64 LPICTPSRSAIMTGRYPIHTGMQQDTIFGPNPYGVGLNEKFLPQYLKQQGYKTHGVGKWH 123
Query: 134 LGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS 193
LGFF + YTPT+RGFDS+YG + G DY++HS T Y GLD+ N G YS
Sbjct: 124 LGFFAKQYTPTYRGFDSYYGSYLGKGDYWNHSNTET---YSGLDLHDNENGVFSQDGNYS 180
Query: 194 TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYE-PFQAPDEEVAKFLDISDPERRT 252
T++YT AI+ I HN S+P+FLYLA+ AVH+ NT E P QAP E + KF I +RR
Sbjct: 181 TEMYTAEAISCINNHNSSEPLFLYLAYQAVHSANTEEDPLQAPQEWIDKFSYIKHEQRRK 240
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
YA M+ +D VG V AL K ML+NSI++F DNG P+ G N +N PLRG+K+T
Sbjct: 241 YAAMLGYMDYGVGRVHDALAKKKMLDNSIIIFTTDNGGPANGFDYNWANNFPLRGVKATL 300
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
++GG+RGV+ ++S L ++ +VS EL HI+DWLPTL AG +++D LDG +QW L
Sbjct: 301 FEGGVRGVSFVYSK-LIESPRVSHELIHITDWLPTLVNLAGGKVSDGFLDGFDQWATLQN 359
Query: 373 GAKTKRSEILHNIDNVDNPQKYY--AALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
++R+E+L NID +K + ALRV K + + + W D +
Sbjct: 360 KQSSQRNEVLLNID-----EKVWKNEALRVGSWKIIK------EGNYW--------DGWY 400
Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
P ++ +F+ L + V+C +
Sbjct: 401 PPPSFNEQSNNSFSYLSST------------------------------------VKCGH 424
Query: 491 DNKGA--HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPP-GNKP 547
D HC+S CLF+I +DPCE N+L++ ++L +L ++L Y+ +MVPP N
Sbjct: 425 DIPIVINHCDSY---CLFHIDEDPCEINDLSKKFPEVLAELINRLNTYRQSMVPPRNNMT 481
Query: 548 FDKRADPARWNNIWVPW 564
D R+DP N +W PW
Sbjct: 482 IDPRSDPKLHNGVWQPW 498
>gi|390361962|ref|XP_789345.3| PREDICTED: arylsulfatase I-like, partial [Strongylocentrotus
purpuratus]
Length = 514
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 230/543 (42%), Positives = 321/543 (59%), Gaps = 56/543 (10%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+ PHII I+ADDLGW+DVS HGSSQIPTPNID LA +G+ L +YV LCTPSRSA+MTG
Sbjct: 9 QPPHIIFIVADDLGWDDVSLHGSSQIPTPNIDTLAQDGVTLTNYYVSPLCTPSRSAIMTG 68
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
++PIH G+Q GVI P+GL L EK + QYLK GY+THA+GKWHLG+F + YTPT+RG
Sbjct: 69 RHPIHTGLQFGVISPEAPYGLGLEEKTMAQYLKTLGYSTHAVGKWHLGYFAKEYTPTWRG 128
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FDS +G++ G DYY H + G D+ N +V G YSTD++ + A +I
Sbjct: 129 FDSFFGFYNGRGDYYTH--EEVQSEVSGYDLHKNGKVYRPAFGQYSTDIFNQEAEQIIKA 186
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEP-FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
HN S+P+FLYLAH AVHAG + QAPD+ +F I RR YA MVS LD+SVG+
Sbjct: 187 HNASQPLFLYLAHQAVHAGVYPDRLLQAPDKYYQRFPHIETEGRRMYAAMVSALDDSVGS 246
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSF----GIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
+ LR G+ +NSI++F DNG ++ G H+ SN PLRG K T W+GG+RG A
Sbjct: 247 ISQTLRDAGLYDNSIIVFTTDNGGATYDFFDGTHA---SNWPLRGGKHTLWEGGVRGTAF 303
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEI 381
+ SP +K+ ++ S ++ H+ DWLPTL + AG + + + DG +QWD L+ A + R+EI
Sbjct: 304 VNSPLIKKPRRFSDQMMHVCDWLPTLHSVAGGDPKELKNSDGFDQWDALSNNAPSPRNEI 363
Query: 382 LHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGI 441
LHNI+ P + YAA+RV D K G +++ Y P+ + A
Sbjct: 364 LHNIE----PIEKYAAIRVGDFKLHYGAFGGKIGSKYFSGW------YPPEGATSNMAEY 413
Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTV 501
N+ + + KQ +A+ C+Y ++
Sbjct: 414 VPNSFRVRCPSKQANAS---------------------------TNCDYGDR-------- 438
Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
KPCL+NI DDPCE NN+A+ D+++ + +L Y+ + V P N+P D +++P N+W
Sbjct: 439 KPCLYNIRDDPCEYNNIADWNQDIVEAMTARLREYRRSAVEPRNQPLDPQSNPKLHGNVW 498
Query: 562 VPW 564
PW
Sbjct: 499 EPW 501
>gi|427793479|gb|JAA62191.1| Putative arylsulfatase b, partial [Rhipicephalus pulchellus]
Length = 512
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 231/539 (42%), Positives = 319/539 (59%), Gaps = 39/539 (7%)
Query: 41 GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
GWNDVS+HG QI TPNIDALA+NG+ L ++Y Q LCTPSR+AL+TG+YPI++G+QH VI
Sbjct: 1 GWNDVSYHGCPQIRTPNIDALAWNGIRLRRYYAQPLCTPSRAALLTGRYPINMGLQHSVI 60
Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
EP GLPL++ LLPQ+L + GY TH +GKWH+GFF++ YTPT RGF+ H G+W D
Sbjct: 61 YNEEPRGLPLSDTLLPQWLADLGYVTHHLGKWHIGFFKKEYTPTMRGFERHVGFWGAYID 120
Query: 161 YYDHSCKATFEPYQ--GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
YY H KA P + GLDMR N+ + G Y T L T+ A+ VI H KP+FLYL
Sbjct: 121 YYKHE-KAYLGPTRSPGLDMRRNLFLARNDTGRYVTQLLTKEALEVIENHPVDKPLFLYL 179
Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
AHLA H+ +P Q PD+ + ++ DI ER YAGMVS LDESVG V AL K+GML
Sbjct: 180 AHLAPHSAGPQDPLQVPDKYIEQYRDIGSTERMKYAGMVSALDESVGLVFEALAKNGMLS 239
Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE- 337
N++ +F +DNGA + G ++N S+ PLRG K TPW+GG+R A +WS L + +
Sbjct: 240 NTLFVFTSDNGADAAGPNANYASSWPLRGQKYTPWEGGVRAAALLWSHALGPRVRAGGDY 299
Query: 338 --LFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTK-----RSEILHNIDNVD 389
LFHI+DWLPTL AG + + +DGV+ + L A + R+E+L NID +D
Sbjct: 300 EHLFHITDWLPTLYQLAGGDPAHLGDIDGVSHVNALVGDASPQPPQPPRTEVLVNIDPID 359
Query: 390 NPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTK 449
N Y+A+ K+VAG G ++WY N +D S + + +A LK+
Sbjct: 360 N----YSAIIDGRFKFVAGVAERGIYEQWYPILGN-VDWNSTEPLEQCEASPVARVLKS- 413
Query: 450 LQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNIT 509
+ A P + A + C + C TV PCLF++
Sbjct: 414 --FGPEPACGPSGGA-------------------APIDCGVRDPAKACQPTVAPCLFDLR 452
Query: 510 DDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDEL 568
+DPCE N+++ + +L+K L KL Y+ +PP N P D +DP ++N W+PW ++L
Sbjct: 453 EDPCEYNDVSAERPELVKSLLTKLRRYEDASMPPANIPHDPLSDPGLYDNAWIPWREKL 511
>gi|195328499|ref|XP_002030952.1| GM25725 [Drosophila sechellia]
gi|194119895|gb|EDW41938.1| GM25725 [Drosophila sechellia]
Length = 585
Score = 427 bits (1098), Expect = e-117, Method: Compositional matrix adjust.
Identities = 241/547 (44%), Positives = 322/547 (58%), Gaps = 31/547 (5%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP+IIII+ADDLG++DVSF GS TPNIDALAY+G+ILN YV +CTPSR+AL+TGK
Sbjct: 34 KPNIIIIMADDLGFDDVSFRGSDNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGK 93
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YPI+ GMQH VI+ +PWGLPL E + + +E GY T +GKWHLG + +TPT RGF
Sbjct: 94 YPINTGMQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGF 153
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
D H+GY DYY S + + Y G D R N++ + +G Y TD+ T+AA+ I +H
Sbjct: 154 DRHFGYLGAYVDYYTQSYEQQNKGYNGHDFRDNLKSTHDHVGHYVTDVLTDAAVKEIEDH 213
Query: 209 ---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
N S+P+FL L HLA HA N +P QAP EEV++F IS+ R YA MVSRLD+SVG
Sbjct: 214 GSKNSSQPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKAHRYYAAMVSRLDKSVG 273
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
+VI AL + ML+NSI+LF++DNG P+ G HS SN+PLRG K++PW+G +R AAIWS
Sbjct: 274 SVIDALARQEMLQNSIILFLSDNGGPTEGQHSTTASNYPLRGQKNSPWEGALRSSAAIWS 333
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
++ V + +I D LPTL AAAGI + LDG+N W L G ++ EI+H
Sbjct: 334 TEFERLGSVWKQQIYIGDLLPTLAAAAGISPDPALHLDGLNLWSALKYGYESVEREIVHV 393
Query: 385 ID-NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
ID +V P Y + K ++GT N G D W G E + P V Y +
Sbjct: 394 IDEDVAVPHLSYTRGK---WKVISGTTNEGLYDGWLG--HRETSEVDPGAVEYEE---LV 445
Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-CNSTVK 502
LQ++Q + + I ELR+ +R+ C G C
Sbjct: 446 RNTSVWLQLQQVTSG---------------ERNISELRDKSRIECPDPATGVKPCLPLEG 490
Query: 503 PCLFNITDDPCEQNNL-AESQ-TDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
PCLF+I DPCE++NL AE Q + + L +++ + P NKP D DP ++N
Sbjct: 491 PCLFDIEADPCERSNLYAEYQNSTIFLDLWERIQQFAKQAHSPNNKPGDPNCDPRFYHNE 550
Query: 561 WVPWYDE 567
W W DE
Sbjct: 551 WTWWQDE 557
>gi|194871676|ref|XP_001972885.1| GG13639 [Drosophila erecta]
gi|190654668|gb|EDV51911.1| GG13639 [Drosophila erecta]
Length = 584
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 248/575 (43%), Positives = 329/575 (57%), Gaps = 38/575 (6%)
Query: 2 TWARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL 61
TW F + C L N A +P+IIII+ADDLG++DVSF GS+ TPNIDAL
Sbjct: 12 TWLLAGF--VLCIALSNGII---AASDRPNIIIIMADDLGFDDVSFRGSNNFLTPNIDAL 66
Query: 62 AYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKE 121
AY+G+ILN YV +CTPSR+AL+TGKYPI+ GMQH VI+ +PWGLPL E + + +E
Sbjct: 67 AYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPLNETTMAEIFRE 126
Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN 181
GY T +GKWHLGF + +TPT RGFD H GY DYY S + + Y G D R N
Sbjct: 127 NGYRTSLLGKWHLGFSQRNFTPTQRGFDRHLGYLGAYVDYYTQSYEQQSKGYNGHDFRDN 186
Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
++ + +G Y TD+ T+AA+ I +H N S+P+FL L HLA HA N P QAP EE
Sbjct: 187 LKSSHDQVGHYITDVLTDAAVKEIEDHASKNSSQPLFLLLNHLAPHAANDDNPMQAPAEE 246
Query: 239 VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSN 298
V++F I + R YA MVS+LD+SVG+VI AL + ML+NSIVLF++DNG P+ G HS
Sbjct: 247 VSRFEYIRNKTHRYYAAMVSKLDQSVGSVIDALARQEMLQNSIVLFLSDNGGPTQGQHST 306
Query: 299 KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
SN+PLRG K +PW+G +R AAIWS ++ V + +I D LPTL AAAGI +
Sbjct: 307 TASNYPLRGQKDSPWEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAGISPDP 366
Query: 359 T-SLDGVNQWDVLTKGAKTKRSEILHNID-NVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
LDG+N W L G ++ EI+H ID +V P Y + K ++GT N G D
Sbjct: 367 ALHLDGLNLWSALKYGYESVEREIVHVIDEDVAVPHLSYTRGK---WKVISGTTNQGLYD 423
Query: 417 EWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEK 476
W G E + P+ V Y + L + Q+ L+ + + E+
Sbjct: 424 GWLG--HRETSEVDPRAVEYEE-------LVRNTSVWQQ------------LQQVTSGER 462
Query: 477 -ILELREFARVRCNYDNKGAH-CNSTVKPCLFNITDDPCEQNNL-AE-SQTDLLKQLEDK 532
I ELR+ A + C G C PCLF+I DPCE++NL AE + L +
Sbjct: 463 NISELRDQASIECPNPATGVRPCLPLEAPCLFDIEADPCERSNLYAEYKNSTTFLDLWAR 522
Query: 533 LAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDE 567
+ + PP NKP D DP ++N W W DE
Sbjct: 523 IQQFAKQAQPPNNKPADPDCDPRFFHNEWTWWQDE 557
>gi|195591201|ref|XP_002085331.1| GD14733 [Drosophila simulans]
gi|194197340|gb|EDX10916.1| GD14733 [Drosophila simulans]
Length = 585
Score = 426 bits (1094), Expect = e-116, Method: Compositional matrix adjust.
Identities = 240/547 (43%), Positives = 321/547 (58%), Gaps = 31/547 (5%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP+IIII+ADDLG++DVSF GS TPNIDALAY+G+ILN YV +CTPSR+AL+TGK
Sbjct: 34 KPNIIIIMADDLGFDDVSFRGSDNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGK 93
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YPI+ GMQH VI+ +PWGLP+ E + + +E GY T +GKWHLG + +TPT RGF
Sbjct: 94 YPINTGMQHYVIVNDQPWGLPINETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGF 153
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
D H+GY DYY S + + Y G D R N++ + +G Y TD+ T+AA+ I +H
Sbjct: 154 DRHFGYLGAYVDYYTQSYEQQNKGYNGHDFRDNLKSTHDYVGHYVTDVLTDAAVKEIEDH 213
Query: 209 ---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
N S+P+FL L HLA HA N +P QAP EEV++F IS+ R YA MVSRLD+SVG
Sbjct: 214 GSKNSSQPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRYYAAMVSRLDKSVG 273
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
+VI AL + ML+NSI+LF++DNG P+ G HS SN+PLRG K++PW+G +R AAIWS
Sbjct: 274 SVIDALARQEMLQNSIILFLSDNGGPTEGQHSTTASNYPLRGQKNSPWEGALRSSAAIWS 333
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
++ V + +I D LPTL AAAGI + LDG+N W L G ++ EI+H
Sbjct: 334 TEFERLGSVWKQQIYIGDLLPTLAAAAGISPDPALHLDGLNLWSALKYGYESVEREIVHV 393
Query: 385 ID-NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
ID +V P Y + K ++GT N G D W G E + P+ V Y +
Sbjct: 394 IDEDVAVPHLSYTRGK---WKVISGTTNEGLYDGWLG--HRETSEVDPRAVEYEE---LI 445
Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-CNSTVK 502
LQ++Q + + I ELR +R+ C G C
Sbjct: 446 RNTSVWLQLQQVSSG---------------ERNISELRYQSRIECPDPATGVKPCLPLEG 490
Query: 503 PCLFNITDDPCEQNNL-AESQ-TDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
PCLF+I DPCE++NL AE Q + + L ++ + P NKP D DP ++N
Sbjct: 491 PCLFDIEADPCERSNLYAEYQNSTIFLDLWARIQQFAKQAHSPNNKPGDPNCDPRFYHNE 550
Query: 561 WVPWYDE 567
W W DE
Sbjct: 551 WTWWQDE 557
>gi|193641124|ref|XP_001950120.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 599
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 244/585 (41%), Positives = 334/585 (57%), Gaps = 47/585 (8%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
+ L +LL + +FL K+PHII+I+ADDLGWNDV FHGS QIPTPNIDALAYNG
Sbjct: 10 RILIILVSSLLSDCSFLRGPNKKQPHIILIVADDLGWNDVGFHGSIQIPTPNIDALAYNG 69
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+ILN+HYVQ CTPSR+AL+TGKYPI G+Q I+ G P LPL EK+LPQYLK+ GY+
Sbjct: 70 VILNRHYVQPTCTPSRAALLTGKYPIRYGLQGFPIIAGVPLALPLNEKILPQYLKDLGYS 129
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-V 184
TH +GKWHLG + +TP RGFDSH+GYW G Y + + G D R +
Sbjct: 130 THLVGKWHLGANKNQHTPIKRGFDSHFGYWNGFISYRNSTHSTGL--MVGKDARRGFERA 187
Query: 185 DNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAG----NTYEPFQAPDEEV 239
++ + Y+TD++T+ A VI N KPMFL ++HLAVH G N E ++
Sbjct: 188 GDEMVDRYATDIFTDEANKVIKLCKNHDKPMFLMVSHLAVHTGVPGPNILEVSNKTHNDI 247
Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS---FGIH 296
+F I + ERR YAGM++ LDESVG++I +L +GMLE+SIVLF++DNGAP+ +
Sbjct: 248 -RFDYIENKERRLYAGMLTSLDESVGSIIESLDNNGMLEDSIVLFISDNGAPADDPIWGY 306
Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
N GSN PLRG K DGG+RGVAAIWSPWLK+ ++ LFHI+DWLPTL AAG +
Sbjct: 307 GNSGSNWPLRGEKGAVLDGGVRGVAAIWSPWLKKKHRIFENLFHITDWLPTLYTAAGGDF 366
Query: 357 NDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ-------KYYAALRVDDLKYVAG 408
D +DGV+QW+ LT+ +K RS +L NID + K + R D+ Y
Sbjct: 367 EDLGQIDGVDQWESLTETSKRTRSMVLINIDETSGEEALIFNHWKVVKSNRTSDIAYYL- 425
Query: 409 TDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDAL 468
++ G+ N Y+ V S AG + + + + +D +
Sbjct: 426 --------KYSGEPGNVGPDYNMTAVAESLAGSRLSKINCLV------------SKADCM 465
Query: 469 RTILTDEKILELREFARV------RCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQ 522
T+ T + +R A++ R + + A PCLF+I DPCE N+A
Sbjct: 466 DTLTTYDLFYNVRSQAKIDGKCSERISDPDPNARYECFDSPCLFDIQRDPCEYQNIASRN 525
Query: 523 TDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDE 567
+ L D LA +K+ M D ++DP ++ W W ++
Sbjct: 526 PEALNMTADILAQFKNEMTMQNYPETDPKSDPRFFDGYWDTWLEK 570
>gi|195494699|ref|XP_002094950.1| GE19934 [Drosophila yakuba]
gi|194181051|gb|EDW94662.1| GE19934 [Drosophila yakuba]
Length = 591
Score = 423 bits (1088), Expect = e-115, Method: Compositional matrix adjust.
Identities = 237/548 (43%), Positives = 316/548 (57%), Gaps = 31/548 (5%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP+IIII+ADDLG++DVSF GS+ TPNIDALAY+G+ILN YV +CTPSR+AL+TGK
Sbjct: 34 KPNIIIIMADDLGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGK 93
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YPI+ GMQH VI+ +PWGLPL E + + +E GY T +GKWHLGF + +TPT RGF
Sbjct: 94 YPINTGMQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGFSQRNFTPTQRGF 153
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
D H+GY DYY S + + Y G D R N++ + +G Y TD+ T+AA+ I +H
Sbjct: 154 DRHFGYLGAYVDYYTQSYEQQNKGYNGHDFRDNLKSTHDHVGRYITDVLTDAAVKEIEDH 213
Query: 209 ---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
N S+P+FL L HLA HA N P QAP EEV++F I + R YA MVS+LD+SVG
Sbjct: 214 GSKNSSQPLFLLLNHLAPHAANDDNPMQAPAEEVSRFEYIGNKTHRYYAAMVSKLDQSVG 273
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
VI AL + ML+NSIVLF++DNG P+ G HS SN+PLRG K +PW+G +R AAIWS
Sbjct: 274 YVIDALARQEMLQNSIVLFLSDNGGPTQGQHSTTASNYPLRGQKDSPWEGALRSSAAIWS 333
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
++ V + +I D LPTL A AGI + LDG+N W L G ++ EI+H
Sbjct: 334 TEFERLGSVWKQQIYIGDLLPTLAAGAGISPDPALHLDGLNLWSALKYGYESVEREIVHV 393
Query: 385 ID-NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
ID +V P Y + K ++GT N G D W G E + P+ +Y +
Sbjct: 394 IDEDVAMPHLSYTRGK---WKVISGTTNQGLYDGWLG--HRETSEVDPRAAVYEELVRNT 448
Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-CNSTVK 502
+ + Q+ + I ELR+ A + C G C
Sbjct: 449 SVWRQLQQVS------------------FGERNISELRDQASIECPDTANGVRPCLPLEA 490
Query: 503 PCLFNITDDPCEQNNL-AE-SQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
PCLF+I DPCE++NL AE + L ++ + PP NKP D DP ++N
Sbjct: 491 PCLFDIEADPCERSNLYAEYKNSTTFLDLWARIQQFAKQSHPPNNKPGDPDCDPRFYHNE 550
Query: 561 WVPWYDEL 568
W W DE+
Sbjct: 551 WTWWQDEM 558
>gi|380025784|ref|XP_003696648.1| PREDICTED: arylsulfatase B-like [Apis florea]
Length = 579
Score = 423 bits (1088), Expect = e-115, Method: Compositional matrix adjust.
Identities = 234/550 (42%), Positives = 325/550 (59%), Gaps = 54/550 (9%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PHII+ +ADDLGWNDV FHGS+QIPTPNIDALAYNG+ILN+HYV TPSR A TG Y
Sbjct: 31 PHIIVFMADDLGWNDVGFHGSNQIPTPNIDALAYNGIILNRHYVLPSSTPSRIAFFTGLY 90
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
PI IGMQ I GEP GLPL K+LP++L+ GY T IGKWH+G+ YTP RGFD
Sbjct: 91 PIRIGMQGDGIRGGEPRGLPLHIKILPEHLRGLGYTTKLIGKWHMGYHTPQYTPLHRGFD 150
Query: 150 SHYGYWQGLQDYYDHSCK----ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
+ +G++ YYD+ ++ ++G D H ++ + Y TDL+T+ AI +I
Sbjct: 151 TFFGFYNSHITYYDYEYSNQNMTGYDMHRGDDPAHGIKRE------YVTDLFTKEAIKII 204
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA-KFLDISDPERRTYAGMVSRLDESV 264
H +P++L ++HLAVHA P + PD+ + + + I +P RR YA MVS+LDESV
Sbjct: 205 ENHELPRPLYLQISHLAVHA-----PIEQPDDSSSDEIIQIREPNRRKYAKMVSKLDESV 259
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
G ++ AL + GML +S++LF+ DNGA S G + N GSN+PLRG K T ++GG+RGVAA+W
Sbjct: 260 GRIVHALGEKGMLRDSLILFLTDNGAASIGRYRNYGSNYPLRGTKYTLYEGGVRGVAALW 319
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILH 383
S L++ +V +L HI+DWLPTL +AAG ++ D +DG++QW VL++ R ++L
Sbjct: 320 SSRLEKAARVFKKLIHITDWLPTLYSAAGGDLKDLGEIDGIDQWRVLSEEQGRGREKLLL 379
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA---G 440
NID V K A+ K + G+ NG D++YGD+ ++ EV+ A
Sbjct: 380 NIDEV---MKTEGAI-YSRFKLLRGSIGNGYYDKYYGDSGRTLETPPYTEVVLKSAVSQS 435
Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN------YDNKG 494
IT+ H P S +++LR A V+C+ Y +
Sbjct: 436 ITY------------HLGGPVTQPS----------TMIQLRREATVQCHPNVTYYYRHSF 473
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
CN V CLF+I +DPCE N+AE+ + + L+ L Y ++ KP D ADP
Sbjct: 474 TFCN--VTECLFDIVNDPCETKNIAEAYARIARDLDLYLEQYGRVLMKQIRKPVDWLADP 531
Query: 555 ARWNNIWVPW 564
R NN W PW
Sbjct: 532 RRRNNTWEPW 541
>gi|193641058|ref|XP_001942872.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 575
Score = 422 bits (1086), Expect = e-115, Method: Compositional matrix adjust.
Identities = 253/578 (43%), Positives = 327/578 (56%), Gaps = 29/578 (5%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F L L D L K+PHI++I+ADDLGWNDV FHGS QIPTPNIDALAYNG I
Sbjct: 11 IFILIVNLQLTDYVLLRELKKRPHIVVIVADDLGWNDVGFHGSIQIPTPNIDALAYNGAI 70
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
LN+HYVQ CTPSR+AL+TGKYPI G+Q I G+ LP EK+LPQYLKE GY+TH
Sbjct: 71 LNRHYVQPTCTPSRAALLTGKYPIRYGLQGPPIASGKASALPTNEKILPQYLKELGYSTH 130
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
+GKWHLG +++ +TPT RGFDSH+GYW G Y + S AT G+D R +
Sbjct: 131 LVGKWHLGHYQKRFTPTKRGFDSHFGYWNGYISYRN-STHAT-RTMSGIDARRGFERAGN 188
Query: 188 TI--GIYSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNT----YEPFQAPDEEVA 240
+ Y+TD++TE A +I + ++ MFL ++HLAVH+GN+ E + A
Sbjct: 189 EMDRDRYATDVFTEEARKIIESSKRENTEMFLMVSHLAVHSGNSGPNHLEVLNKTYNDEA 248
Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS---FGIHS 297
F I + RR YAGMV+ LD+SVG +I +L +H MLENSIV+F++DNGAP+ H
Sbjct: 249 -FGYIENENRRLYAGMVTSLDDSVGKIIESLHEHEMLENSIVVFISDNGAPTDDPLWGHD 307
Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN 357
N GSN PLRG K++ +GG+RGVAAIWSPWLK+ VS+ LFHI+DWLPTL AAG
Sbjct: 308 NFGSNWPLRGKKASVLEGGVRGVAAIWSPWLKEKNTVSNNLFHITDWLPTLYTAAGGNFE 367
Query: 358 DTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG--TDNNGQ 414
D +DGVNQW LT +K RS+IL NID + AL + K V T N
Sbjct: 368 DLGKIDGVNQWKSLTDTSKQVRSKILINIDETRGEE----ALIFNQWKVVKSNRTSNMVF 423
Query: 415 SDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTD 474
+ ++ GD N KY+ V+ S G + + Q D N+S A
Sbjct: 424 NLKYAGDPGNAGPKYNMSGVVASTVGKRLSKINCLTQ-------DSCMNASTARDVFYVS 476
Query: 475 EKILELR-EFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKL 533
+ ++ + R + N CLF+I DPCE N+A+ L L
Sbjct: 477 RSLAKVDGQTCPERVSNPNPDQQYECFDGYCLFDILQDPCEYRNVAKQNPQTLDATIHML 536
Query: 534 AIYKSTMVPPGNKP-FDKRADPARWNNIWVPWYDELDK 570
+K M P D ADP R+ W PW + DK
Sbjct: 537 EQFKKEMTMQAPDPIIDPDADPRRFAGYWEPWLEPFDK 574
>gi|195128415|ref|XP_002008659.1| GI13615 [Drosophila mojavensis]
gi|193920268|gb|EDW19135.1| GI13615 [Drosophila mojavensis]
Length = 576
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 232/544 (42%), Positives = 323/544 (59%), Gaps = 30/544 (5%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
T K P+IIII+ADD+G++D+S G + TPNIDALA++G +L++ Y A+CTPSR AL
Sbjct: 30 TGSKLPNIIIIMADDMGFDDLSIRGGREFLTPNIDALAFHGRLLDRLYAPAMCTPSRGAL 89
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
++GKYPIH G QH VI EPW L L L+P+ + AGY+T+ +GKWHLG+ R +TPT
Sbjct: 90 LSGKYPIHTGTQHFVISNQEPWSLKLNTTLMPEIFRAAGYSTNLVGKWHLGYARPEFTPT 149
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPY-QGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
RGFD HYGYW DYY + + Y G D R NM+V+ G+Y TDL T A
Sbjct: 150 HRGFDYHYGYWGAYIDYYQRRSQMPDKTYIMGYDFRRNMEVECADRGVYMTDLLTNEAER 209
Query: 204 VIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
VI E K +P+FL + HLAVH GN +P Q P+EE+ KF I DP R YA MVS+LD+
Sbjct: 210 VIQETAAKQQPLFLMINHLAVHTGNDNDPLQVPEEELQKFTHIKDPNHRKYAAMVSKLDK 269
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
SVG VI AL + LENSIV+F +DNGAPS G+ +N GSN PL+G K++PW+GG+R A
Sbjct: 270 SVGRVIGALARAEQLENSIVIFYSDNGAPSVGMFANTGSNWPLKGQKNSPWEGGLRVAGA 329
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLT--KGAKTKRS 379
IWSP LK V ++ ++ D+LPTL AAGIE+ D+ LDG++ W L+ K A
Sbjct: 330 IWSPLLKARSSVFTQPIYVGDFLPTLAHAAGIELPDSLQLDGIDLWPQLSGPKEAPHVPR 389
Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA 439
EILH++D+V N ++L + KYV GT GQ D+ E+D P+ Y
Sbjct: 390 EILHSLDDVWN----VSSLLLGHWKYVNGTTLAGQYDQQL--VQRELDDLDPRANRY--- 440
Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNS 499
++ L + D + + ++ I+ D A+V C +G CN+
Sbjct: 441 ---VVEVRNSLASRALARFDQQRLTQHRMKNIMID---------AKVDCGDMQRG--CNA 486
Query: 500 TVKPCLFNITDDPCEQNNLA--ESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARW 557
++ CL++++ DPCE+NNLA E ++L + ++ ++ VP N P A+P
Sbjct: 487 LLEECLYDMSVDPCERNNLAYSEQHAEILATMRQRIKQLRAGAVPIQNLPGVAYANPTLH 546
Query: 558 NNIW 561
W
Sbjct: 547 ECTW 550
>gi|328788250|ref|XP_624148.3| PREDICTED: arylsulfatase B-like isoform 2 [Apis mellifera]
Length = 564
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 237/549 (43%), Positives = 323/549 (58%), Gaps = 52/549 (9%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PHII+ +ADDLGWNDV FHGS+QIPTPNIDALAYNG+ILN+HYV TPSR A TG Y
Sbjct: 31 PHIIVFMADDLGWNDVGFHGSNQIPTPNIDALAYNGIILNRHYVLPSSTPSRIAFFTGLY 90
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
PI IGMQ I GEP GLPL K+LP++L+ GY T IGKWH+GF YTP RGFD
Sbjct: 91 PIRIGMQGDGIRGGEPRGLPLHIKILPEHLRGLGYVTKLIGKWHMGFHTLQYTPLHRGFD 150
Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI---YSTDLYTEAAINVIA 206
+ +G++ YYD+ + + + G DM D+ G+ Y+TDL+T AI +I
Sbjct: 151 TFFGFYNSHITYYDY--EYSNQNMTGYDMHCG---DDPAYGMKREYATDLFTNEAIKIIE 205
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA-KFLDISDPERRTYAGMVSRLDESVG 265
H +P++L ++HLAVHA P + PD+ + + I +P RR YA MVS+LDESVG
Sbjct: 206 NHELPRPLYLQISHLAVHA-----PIEQPDDSSRDEIVQIREPNRRKYAKMVSKLDESVG 260
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
V+ AL + GML +S++LF+ DNGA S G + N GSN+PLRG K T ++GG+RGVAA+WS
Sbjct: 261 RVVHALGEKGMLRDSLILFLTDNGAASIGRYRNYGSNYPLRGTKYTLYEGGVRGVAALWS 320
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHN 384
L++ +V +L HI+DWLPTL +AAG ++ D +DG++QW VL++G R ++L N
Sbjct: 321 SRLEKGARVFKKLIHITDWLPTLYSAAGGDLKDLGKIDGIDQWRVLSEGQGHGREKLLLN 380
Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA---GI 441
ID V + K + G+ NG D++YGD+ ++ EV+ A I
Sbjct: 381 IDEVMITE----GAIYSRFKLLRGSVGNGYYDKYYGDSGRTLETPPYTEVVLKSAVSQSI 436
Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN------YDNKGA 495
T+ H P S +++LR A V+C+ Y +
Sbjct: 437 TY------------HLGGPVTQPS----------TMVQLRREATVQCHPNMSYYYRHSFT 474
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
CN V CLF+I +DPCE N+AE+ + + L+ L Y ++ KP D ADP
Sbjct: 475 FCN--VTECLFDIVNDPCETKNIAEAYARIARDLDLYLEHYGRVLMKQIRKPVDWLADPK 532
Query: 556 RWNNIWVPW 564
R NN W PW
Sbjct: 533 RRNNTWEPW 541
>gi|91084739|ref|XP_970972.1| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
gi|270008608|gb|EFA05056.1| hypothetical protein TcasGA2_TC015151 [Tribolium castaneum]
Length = 558
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 229/566 (40%), Positives = 324/566 (57%), Gaps = 43/566 (7%)
Query: 14 TLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV 73
+L+ N + + +PHII+I+ DD+G ND+ ++QIPTPNIDAL YNG++L+++YV
Sbjct: 11 SLIQNSSQVCRVRRPRPHIIVIMGDDMGHNDIGLR-TNQIPTPNIDALGYNGVVLDRYYV 69
Query: 74 QALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWH 133
Q CTPSR+A +TG YPI MQ I+ GE LPL L+PQ+LK GY TH +GKWH
Sbjct: 70 QNACTPSRAAFLTGNYPIRSAMQGLPIVAGENRSLPLNMPLMPQHLKNLGYRTHIVGKWH 129
Query: 134 LGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS 193
LG TPT +GFDSH+GYW G YYD+ +G D+ + + G Y+
Sbjct: 130 LGSAYRSSTPTEKGFDSHFGYWNGFTGYYDYFTDFNSTAIEGFDLHDRFETERGYQGQYA 189
Query: 194 TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK--FLDISDPERR 251
T ++TE A+++I HN ++P+FL + HLA HAG P+E A+ + I DP RR
Sbjct: 190 TRVFTERALDIIEGHNTTRPLFLLMTHLAAHAGRDGTELGVPNEVEAQRTYSYIQDPRRR 249
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
YA +V+ LD S+G V+ L + MLENSI+LF +DNGAP+ G ++N GSN PLRG+K T
Sbjct: 250 LYAEIVAELDRSIGQVVRKLSERQMLENSIILFFSDNGAPTVGPYTNSGSNWPLRGIKLT 309
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVL 370
++GG+RG A I+SP LK+ V+ EL H+SDWLPT AAAG + D +DGVNQW L
Sbjct: 310 NFEGGIRGTATIFSPLLKKRGYVNKELIHVSDWLPTFYAAAGGNLADLGPIDGVNQWPTL 369
Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDD--LKYVAGTDNNGQSDEWYGDTDNEIDK 428
+ + RSEIL NI+ DN ++ D+ K V G G D ++GD+ D
Sbjct: 370 SLDTPSPRSEILVNINEQDN----TTSIITDNGRFKLVTGAFEGGTYDGYFGDSGRSPDT 425
Query: 429 --YSPKEVLYSKAGITFNALK----TKLQIKQKHAA-DPKANSSDALRTILTDEKILELR 481
Y P VL S+ I L T+ QI+ A D +D+ R L
Sbjct: 426 PPYDPFAVLQSETNIAIQELTQTPITRQQIRVTRAQIDLSWCRNDSFRPPL--------- 476
Query: 482 EFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMV 541
+C+ +PCLF++ +DPCE N+ E ++ ++L ++A + +V
Sbjct: 477 --------------NCS---QPCLFDLENDPCETTNIIEENPEIAERLLRRIADFFEVLV 519
Query: 542 PPGNKPFDKRADPARWNNIWVPWYDE 567
P G+ D A+P R+NN W W D+
Sbjct: 520 PQGDSTIDPNANPQRYNNTWCTWLDD 545
>gi|241844558|ref|XP_002415497.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215509709|gb|EEC19162.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 529
Score = 421 bits (1083), Expect = e-115, Method: Compositional matrix adjust.
Identities = 239/557 (42%), Positives = 316/557 (56%), Gaps = 32/557 (5%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
+ L A + ++ P+II ILADDLGW DVSF G QIPTPN+D LA G+ILN
Sbjct: 1 MCLILALQTAVFSKSSTVPPNIIFILADDLGWADVSFRGDPQIPTPNLDVLASQGIILNN 60
Query: 71 HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
+YVQ LC PSR ALM+G YPIH G+QH V GEPWGLP ++P+YLK GYATH IG
Sbjct: 61 YYVQPLCAPSRGALMSGLYPIHTGLQHLVPGPGEPWGLPTNLTIMPEYLKNLGYATHMIG 120
Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
KWHLG+ +E YTPT RGFDS YGY G +DYYDH+ + GLD N
Sbjct: 121 KWHLGYHKESYTPTRRGFDSFYGYLNGGEDYYDHTI--LWSNASGLDFWENTTPVRNEGN 178
Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
YST+L+T+ A ++I H+ +KPMFLY +H AVH G+ +AP +A F I + R
Sbjct: 179 HYSTELFTKKAQSLIKHHDPAKPMFLYFSHQAVHCGDYKVELEAPALAIAHFPYIKELNR 238
Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
+AG V LD+SVG V+ AL K GML NSIV+F DNG +G+ N G N PLRG K
Sbjct: 239 SIHAGAVYELDKSVGLVMEALNKRGMLSNSIVIFSTDNGGLPWGVEPNSGYNWPLRGSKE 298
Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDV 369
T W+GG RG A +WSP L ++ ++S+++ HI+DWLPTL +AAG ++ ++DG + W
Sbjct: 299 TNWEGGARGAAFVWSPLLFKSGRLSNQMMHITDWLPTLYSAAGGNVSTLGNIDGKDMWKA 358
Query: 370 LTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG--DTDNEID 427
L++ ++ R E+L NID ++N +AL V K V G+ N G D +D
Sbjct: 359 LSEDLESPRQEVLINIDPIENS----SALIVGRHKVVLGSFNEGSHDMRMKAPGGSRPVD 414
Query: 428 KYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR 487
+ +L S+ G LK ++Q LT R A VR
Sbjct: 415 GLD-QMMLSSRTG---KVLKDFYNVRQ-----------------LTVRP--NWRNEAVVR 451
Query: 488 CNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKP 547
C+ + + P F++ DPCE NNLA S L++L K+ Y MVPP NKP
Sbjct: 452 CDRYAPPNNFVAASPPYYFDLEHDPCELNNLAASNVTELEELIKKIKEYAKGMVPPANKP 511
Query: 548 FDKRADPARWNNIWVPW 564
D R P + +W PW
Sbjct: 512 LDPRGLPKYNHGLWGPW 528
>gi|307187653|gb|EFN72625.1| Arylsulfatase B [Camponotus floridanus]
Length = 525
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 233/551 (42%), Positives = 323/551 (58%), Gaps = 56/551 (10%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
N + P+I++I+ADDLGWNDVSFHG+ +IPTPNIDALAYNG+ILN+HYV LCTPSR+
Sbjct: 7 NDVPKQPPNIVVIMADDLGWNDVSFHGADEIPTPNIDALAYNGVILNRHYVLPLCTPSRT 66
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
A +TGKYPI GMQ V+ EP G+PL + LLP+YL++ GYATH +GKWH+G+ + YT
Sbjct: 67 AFLTGKYPIRTGMQGYVLQPAEPRGIPLNDTLLPEYLRKLGYATHLVGKWHVGYHTKNYT 126
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI---YSTDLYTE 199
PT RGFD+ GY+ G Y++H+ + Y G D H + +N+TI Y TD+ T+
Sbjct: 127 PTRRGFDTFLGYYNGYIHYFNHTILDEEQKYLGYDF-HRIVGENRTIEYRYDYITDIITD 185
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD--EEVAKFLDISDPERRTYAGMV 257
N+I HN +KP++L ++H A H+G Q D E A I D RR YA +V
Sbjct: 186 EVENIIFSHNPAKPLYLQVSHDAAHSGGIGIEMQVRDWKETNATLGYIEDINRRKYASVV 245
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
+ LDESVG +I ALRK ML+NSI++F++DNGA + G N GSN+PLRG+K + ++GG+
Sbjct: 246 ATLDESVGRIIDALRKTDMLKNSIIVFISDNGAQTEGFLQNYGSNYPLRGLKFSLFEGGV 305
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKT 376
RG A I+SP + + +VS++LFHI+DWLPTL +AAG +D LDG +QW + +
Sbjct: 306 RGAAYIYSPLIDRLSRVSTQLFHITDWLPTLYSAAGGNSSDLKQLDGFDQWSAIKSAEDS 365
Query: 377 KRSEILHNIDNVDNP--QKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
KR IL NID D QKYY YS
Sbjct: 366 KRKSILINIDERDTSEYQKYY--------------------------------NYSGNNA 393
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRT-ILTDEKILELREFARVRCNYDNK 493
LY K +NA+ A P A++ ++ T +L KI++LR+ A + C N
Sbjct: 394 LYPK----YNAINI--------LASPVASAIASISTSMLNANKIMQLRKEATIVCK--NF 439
Query: 494 GAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRAD 553
N T + CLF++ +DPCE +L+ +++L + YKS +V N P D
Sbjct: 440 MDFSNCTNRSCLFDVKEDPCETTDLSAKYPKEVERLNMFIDRYKSVLVKQPNTPIDPAGY 499
Query: 554 PARWNNIWVPW 564
+NN W+PW
Sbjct: 500 AYHFNNTWIPW 510
>gi|340727296|ref|XP_003401982.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
Length = 579
Score = 420 bits (1080), Expect = e-115, Method: Compositional matrix adjust.
Identities = 233/548 (42%), Positives = 322/548 (58%), Gaps = 48/548 (8%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PHII+ +ADDLGWNDVSFHGS+QIPTPNIDAL YNG+ILN+HYV TPSR+A TG+Y
Sbjct: 31 PHIIVFMADDLGWNDVSFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQY 90
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
PI IGMQ I GEP GLPL K+LP++L+ GY T IGKWH+G++ YTP RGFD
Sbjct: 91 PIRIGMQGADIRGGEPRGLPLNIKILPEHLRGLGYTTKLIGKWHMGYYTPQYTPLHRGFD 150
Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI---YSTDLYTEAAINVIA 206
+ G++ YYD++ + G DM D+ G+ Y+TD++T AIN+I
Sbjct: 151 TFLGFYNSYISYYDYNYSN--QNMSGYDMHRG---DDPAYGMNREYATDMFTSEAINIIE 205
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAP-----DEEVAKFLDISDPERRTYAGMVSRLD 261
H ++P++L L+HLAVH+ P + P D E + I +P RR YA MVS+LD
Sbjct: 206 NHELNRPLYLQLSHLAVHS-----PLEQPANVYNDREP---IHIREPNRRKYAKMVSKLD 257
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
ESVG + AL + GML++S++LF+ DNGA S G + N GSN+PLRG K T ++GG+RGVA
Sbjct: 258 ESVGRIFHALGEKGMLKDSLILFLTDNGAASVGRYRNYGSNYPLRGTKYTLYEGGVRGVA 317
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
A+WS L++ +VS+EL HI+DWLPTL +AAG ++ D +DG++QW +L++G R +
Sbjct: 318 ALWSSRLQRAARVSNELVHITDWLPTLYSAAGGDLKDLGEIDGIDQWRLLSEGRGRGRDK 377
Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAG 440
+L NID V K A+ K + GT +G D +YGD+ ++ + K+
Sbjct: 378 LLLNIDEV---LKTEGAI-YSRFKLLRGTIESGYYDGYYGDSGKTLETLPTYTDIVMKST 433
Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN----YDNKGAH 496
++ + H P S +R LR+ A V+CN Y +
Sbjct: 434 VS--------ESLTYHLGGPVTQPSTMVR----------LRQDATVQCNANFTYYRRQML 475
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
V CLF+I +DPCE N+A++ + + L L Y +V KP D ADP
Sbjct: 476 TPCNVTECLFDIVNDPCETRNIADTYVRIARDLNYYLEQYSQILVKQTKKPIDWSADPRN 535
Query: 557 WNNIWVPW 564
N W PW
Sbjct: 536 KNGTWGPW 543
>gi|241598569|ref|XP_002404905.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215502397|gb|EEC11891.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 533
Score = 420 bits (1080), Expect = e-115, Method: Compositional matrix adjust.
Identities = 234/553 (42%), Positives = 319/553 (57%), Gaps = 48/553 (8%)
Query: 22 LNTTAPKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPS 80
L +A KKP +II +L DDLGW+DVSFHGS QIPTPN+D LA +G+ILN +YVQ CTPS
Sbjct: 17 LGVSASKKPPNIIFMLVDDLGWDDVSFHGSPQIPTPNMDVLAGDGVILNNYYVQHFCTPS 76
Query: 81 RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
R+ALMTG YPIH G+Q VI +P+GLPL K++P++ K+ GY TH IGKWHLG+FR+
Sbjct: 77 RAALMTGLYPIHNGLQDFVIDVAQPYGLPLYLKVMPEFFKDMGYETHMIGKWHLGYFRKE 136
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
YTPT+RGFDS YGY+ G +DYY+HS Q ++R V K I Y + TE
Sbjct: 137 YTPTYRGFDSFYGYYNGAEDYYNHSITKVIS--QSYNIRQG-SVTKKRIENYIKN--TEL 191
Query: 201 AINVIAEHNK---SKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
+ + + S+P+FLYLA+ +VH P +AP+E + KF I + R +AG V
Sbjct: 192 VLLSLTFYLSILFSQPLFLYLAYQSVHG-----PLEAPEENIMKFPYIGEENRTIFAGAV 246
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LD+SVG V+ L + ML+N+I++ DNG FG+ SN+G N PLRG K T W+GG
Sbjct: 247 DALDQSVGEVLETLYEADMLDNTIIVLSTDNGGLPFGVQSNRGYNFPLRGAKGTLWEGGT 306
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL---DGVNQWDVLTKGA 374
RG A IWSP L Q ++VS ++ HI+DWLPTL +AAG+ N SL DG + W L+
Sbjct: 307 RGSAFIWSPLLNQNRRVSDQMLHITDWLPTLYSAAGMRGNPVSLGKVDGYDMWTQLSYNL 366
Query: 375 KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
+ R E+L+NID P +ALR + K V GT +G+ D+ + T Y E+
Sbjct: 367 PSSRYEVLYNID----PTDLNSALRYTNYKLVLGTYEDGEFDQRFRTTGGS-RPYGDLEI 421
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKIL---ELREFARVRCNYD 491
A ++ ++ L+ L R+ A V C +
Sbjct: 422 -----------------------AMAQSKAARVLKGFYNRGNFLLPRNWRQKATVNCVSN 458
Query: 492 NKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR 551
N + S P LF++T DPCE NN+A ++ L+ L+ +L Y S+ + P NKP D R
Sbjct: 459 NGPKNFVSQAPPYLFDLTKDPCELNNIASAEAGLVDFLKQRLQSYASSSIAPLNKPTDPR 518
Query: 552 ADPARWNNIWVPW 564
P N W PW
Sbjct: 519 GYPENLNGTWGPW 531
>gi|350422910|ref|XP_003493325.1| PREDICTED: arylsulfatase I-like [Bombus impatiens]
Length = 563
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 223/545 (40%), Positives = 330/545 (60%), Gaps = 40/545 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
++P+II+I+ADDLGWNDVSFHG+ QIPTPNIDALAYNG+IL +HYV +CTPSR+A +TG
Sbjct: 37 QRPNIIVIMADDLGWNDVSFHGADQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTG 96
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+YPI GMQ + GE +PL LLP+YL++ GYATH +GKWH+G++ + +TP +RG
Sbjct: 97 RYPIRTGMQGYPLKAGEERAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPAYRG 156
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM--QVDNKTIGIYSTDLYTEAAINVI 205
FD+ GY+ G Y+ H+ + + G D+ +++ + K Y TDL TE A ++I
Sbjct: 157 FDTFLGYYSGYITYFKHTIEQNL--HVGYDLHYDVAGNLSVKYSHEYMTDLITERAEDII 214
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV--AKFLDISDPERRTYAGMVSRLDES 263
HN+SKP++L L+H+A H+ + + DEE A I D +RR AG+V+ +DES
Sbjct: 215 FNHNRSKPLYLQLSHVAAHSSDAKANMEVRDEEETNATLGYIEDFDRRKLAGVVTAMDES 274
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
VG V+ ALR+ MLENSI++FM+DNGA + G+ +N GSN+PLRG+K T ++GG+RGVA +
Sbjct: 275 VGRVVQALRQTNMLENSIIVFMSDNGAQTVGLLNNYGSNYPLRGLKFTLFEGGVRGVACV 334
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND--TSLDGVNQWDVLTKGAKTKRSEI 381
+SP +K ++S++L H++DWLPT +AAG + D +LDGV+QW + KT+R +
Sbjct: 335 YSPAIKNPSRISNQLIHVTDWLPTFYSAAGGNLEDLEENLDGVDQWATIVSEKKTRRKNV 394
Query: 382 LHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGD--TDNEIDKYSPKEVLYSKA 439
L NI D Q AL + K + G N ++YGD T +Y+ VL+S A
Sbjct: 395 LLNI---DEKQALSGAL-MGRYKLINGA--NVSYGDYYGDSGTSESYPEYNVSNVLHSAA 448
Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNS 499
G A+ S+A E+I+ELR A V C+ C
Sbjct: 449 GSAL------------------ASVSNA---TFNSERIIELRRAATVVCDNFTSYPRC-- 485
Query: 500 TVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNN 559
+ CLF++ DPCE +++ +++ +L +A Y + + + D R+ P +N
Sbjct: 486 -LDRCLFDVYKDPCETTDVSREHPEVVDELNRFIAEYFAVLKTQTSGLVDPRSYPEHFNG 544
Query: 560 IWVPW 564
W+PW
Sbjct: 545 TWMPW 549
>gi|195166553|ref|XP_002024099.1| GL22854 [Drosophila persimilis]
gi|194107454|gb|EDW29497.1| GL22854 [Drosophila persimilis]
Length = 548
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 243/560 (43%), Positives = 327/560 (58%), Gaps = 33/560 (5%)
Query: 36 LADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGM 95
+ADD+G++DVSF GS+ TPNIDALAY+G+ILN Y A+CTPSR+AL+TGKYPI+ GM
Sbjct: 1 MADDMGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAAMCTPSRAALLTGKYPINTGM 60
Query: 96 QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYW 155
QH VI+ +PWGLP EK + + +E GY T +GKWHLG + +TPT RGFD H GY
Sbjct: 61 QHYVIVNNQPWGLPQQEKTMAEIFRENGYYTSLLGKWHLGMSQRNFTPTQRGFDHHLGYL 120
Query: 156 QGLQDYYDHSCKATFEPY-QGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH---NKS 211
DYYD + + + Y +G D R N+ V + +G Y TD+ ++AA+ +I +H N S
Sbjct: 121 GAYVDYYDQTYQQNGKNYARGHDFRLNLNVTHDQVGHYVTDVLSDAAVELIEQHSGSNSS 180
Query: 212 KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAAL 271
+P+FL L+HLA HA N +P QAP EE+AKF I + R YA MVSRLD SVG VI AL
Sbjct: 181 QPLFLLLSHLAPHAANADDPMQAPAEELAKFEYIRNETHRYYAAMVSRLDRSVGRVIDAL 240
Query: 272 RKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQT 331
+ ML++SIVLF++DNG P+ G HS+ SN+PLRG K++PW+GG+R AAIWS ++
Sbjct: 241 ARQQMLQDSIVLFLSDNGGPTVGEHSSTASNYPLRGQKNSPWEGGLRSSAAIWSTEFERL 300
Query: 332 QKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDN 390
V + +I D LPTL AAAGI + D LDG+N W L G ++ EI+HNID +
Sbjct: 301 GTVWKQQTYIGDLLPTLAAAAGIAPDPDLKLDGMNLWSALKYGYESLEREIVHNIDM--D 358
Query: 391 PQKYYAALRVDDLKYVAGTDNNGQSDEWYGD-TDNEIDKYSPKEVLYSKAGITFNALKTK 449
P Y A K V GT N D W G+ E+D P+ Y + I ++
Sbjct: 359 PSAPYLAYTRGKWKVVNGTTNAALFDGWLGERVTREVD---PRSANYEEL-IRNTSVWQH 414
Query: 450 LQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH--CNSTVKPCLFN 507
LQ+ L + I LR+ A++ C + GA C + PCLF+
Sbjct: 415 LQL---------------LMGSQNERNITSLRQQAQIECP-EPPGATKPCLPLLAPCLFD 458
Query: 508 ITDDPCEQNNLAES--QTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWY 565
I DPCE+NNL E T + L ++ + S P N P D DP ++N W W
Sbjct: 459 IDADPCERNNLYEDYRNTTIFLDLLQRIEQFASQAHAPNNMPEDPDCDPRFYHNEWTWWQ 518
Query: 566 DELDKQKAIE-TMRLHKPLL 584
DE AI +++ PLL
Sbjct: 519 DEQASGAAIMWSVKRLNPLL 538
>gi|307215080|gb|EFN89887.1| Arylsulfatase B [Harpegnathos saltator]
Length = 593
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 242/582 (41%), Positives = 325/582 (55%), Gaps = 55/582 (9%)
Query: 4 ARKYFFALTCTLLFNDA--FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL 61
R L C LLF DA F +T PHII+ + DDLGWNDVSFHGS+QIPTPNIDAL
Sbjct: 7 GRSNLLVLCCILLFGDALGFYDTA----PHIIVFMVDDLGWNDVSFHGSNQIPTPNIDAL 62
Query: 62 AYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKE 121
YNG+ILN+HYV TPSR+A TG YPI IGMQ I GEP GLPL ++LP+YL+
Sbjct: 63 GYNGIILNRHYVLPSSTPSRAAFFTGLYPIRIGMQGEGIQGGEPRGLPLNIRILPEYLRG 122
Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN 181
GY T IGKWH+G+ +TP RGFD+ G++ YYD+ + +++ G DM
Sbjct: 123 LGYTTKLIGKWHVGYHTPQHTPLHRGFDAFLGFYNSHVSYYDY--RYSYQNMSGYDMHRG 180
Query: 182 MQVDNKTIGI---YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
DN G+ Y+TDL+T+ A+ +I H +P++L ++HLAVHA P ++PD++
Sbjct: 181 ---DNPAYGLNAEYATDLFTDEAMKIIQRHEPPRPLYLQISHLAVHA-----PIESPDDD 232
Query: 239 -----VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
+F I + RR YA MV+RLDESVG ++ L GML +S++LF+ DNGAP+
Sbjct: 233 HRNSNRERFKHIPEVNRRNYASMVTRLDESVGRIVHTLGNRGMLRDSLILFLTDNGAPTI 292
Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
G N GSN PLRGMK T ++GG+RGVA +WSP L++ +V +EL HI+DWLPT +AAG
Sbjct: 293 GKFRNWGSNLPLRGMKYTLYEGGVRGVAVLWSPRLRKAARVCNELVHITDWLPTFYSAAG 352
Query: 354 IEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNN 412
++ D +DGVNQW +L G R +L NID V + K + G+
Sbjct: 353 GDLKDLGEIDGVNQWRMLNDGEPVARQSLLLNIDEVSRTE----GAIYKSFKLLRGSIEG 408
Query: 413 GQSDEWYGDTDN------EIDKYSPKEVLYSKAGITFNALKTKL-QIKQKHAADPKANSS 465
G D ++ + E K P EV T LK+ + Q H P S
Sbjct: 409 GHYDGYHTIREPSWIEHVEHKKNEPTEV----PSYTETVLKSLVSQSITYHLGGPVTQPS 464
Query: 466 DALRTILTDEKILELREFARVRCNYD---NKGAHCNSTVKPCLFNITDDPCEQNNLAESQ 522
I +LR+ A V C + N+ CN V CLF+I +DPCE N+A+
Sbjct: 465 ----------TINQLRQEATVHCRSNGSYNRFTTCN--VTECLFDINNDPCETKNIAQQY 512
Query: 523 TDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
+ ++L+ L Y + P D ADP R N W PW
Sbjct: 513 PRIARELDLFLEKYGRNVTRQMKVPVDWLADPRRTNGTWEPW 554
>gi|340727298|ref|XP_003401983.1| PREDICTED: arylsulfatase B-like [Bombus terrestris]
Length = 563
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 218/544 (40%), Positives = 326/544 (59%), Gaps = 40/544 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P+II+I+ADDLGWNDVSFHG+ QIPTPNIDALAYNG+IL +HYV +CTPSR+A +TG+
Sbjct: 38 RPNIIVIMADDLGWNDVSFHGADQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGR 97
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YPI GMQ + GE +PL LLP+YL++ GYATH +GKWH+G++ + +TP +RGF
Sbjct: 98 YPIRTGMQGHPLDPGEVRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPAYRGF 157
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM--QVDNKTIGIYSTDLYTEAAINVIA 206
D+ GY+ G Y++H+ + + G D+ +++ + K Y TDL TE A ++I
Sbjct: 158 DTFLGYYSGFMTYFNHTIEQNH--HVGYDLHYDVAGNLSVKYSHEYMTDLITERAEDIIL 215
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV--AKFLDISDPERRTYAGMVSRLDESV 264
HN SKP++L L+H+A H+ N + + DEE A I D +RR AG+V+ LDESV
Sbjct: 216 NHNHSKPLYLQLSHIAAHSSNINKTVEVRDEEETNATLGYIEDFDRRKLAGVVTALDESV 275
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
G V+ AL++ MLENSI+LFM+DNGA + G+ N GSN+PLRGMK T ++GG+RG A ++
Sbjct: 276 GRVVQALKQANMLENSIILFMSDNGAQTVGLLDNSGSNYPLRGMKFTLFEGGVRGAACVY 335
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND--TSLDGVNQWDVLTKGAKTKRSEIL 382
SP +K ++S++L H++DWLPT +AAG + D +LDGV+QW + T+R IL
Sbjct: 336 SPAIKNPSRISNQLIHVTDWLPTFYSAAGGNLEDLEENLDGVDQWATIVSEENTRRKNIL 395
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT--DNEIDKYSPKEVLYSKAG 440
NID ++ + + K + G Q ++YGD+ +Y+ VL+S AG
Sbjct: 396 LNIDEKED----LSGALMGRYKLINGA--KLQYGDYYGDSGMSESYPEYNVSNVLHSAAG 449
Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNST 500
++ L E+I+ELR A V C+ C
Sbjct: 450 FAVASVSN---------------------VTLNSERIIELRRAAMVVCDNFTSYPKC--- 485
Query: 501 VKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
+ CLF++ +DPCE +++ +++ ++ +A Y + ++ D R+ P +N
Sbjct: 486 LDRCLFDVYNDPCETTDVSREHPEVVDEINRFIAEYFTVLMTQTPTKTDPRSYPENFNGT 545
Query: 561 WVPW 564
W+PW
Sbjct: 546 WMPW 549
>gi|383847821|ref|XP_003699551.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
Length = 575
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 235/547 (42%), Positives = 324/547 (59%), Gaps = 48/547 (8%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PHII+I+ADDLGWNDV FHGS+QIPTPNIDAL YNG+ILN+HYV TPSRSA +TG Y
Sbjct: 27 PHIIVIMADDLGWNDVGFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRSAFLTGLY 86
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
PI IGMQ I GEP GLPL K+LP++L++ GY T IGKWH+G+ YTP RGFD
Sbjct: 87 PIRIGMQGDGIRGGEPRGLPLDIKILPEHLRDLGYTTKLIGKWHMGYHTPQYTPLHRGFD 146
Query: 150 SHYGYWQGLQDYYDHSCK----ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
G++ YYD+ + ++ ++G D H + + Y+TDL+T+ A+ +I
Sbjct: 147 FFLGFYNSHISYYDYHYSNQNMSGYDLHRGDDPAHGINRE------YATDLFTKEAVRMI 200
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV--AKFLDISDPERRTYAGMVSRLDES 263
H +P++L ++HLAVHA P + P +E +F I +P RR YA MVSRLD+S
Sbjct: 201 ETHELPRPLYLQISHLAVHA-----PLEQPRDEYNDGRFSYIREPNRRKYARMVSRLDDS 255
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
VG ++ AL + GML +S++LF+ DNGA S G + N GSN+PLRG K T ++GG+R VAA+
Sbjct: 256 VGRIVHALGEKGMLRDSLILFLTDNGAASVGRYRNWGSNYPLRGTKYTLYEGGVRSVAAL 315
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEIL 382
WSP L++ +VS++L HI+DWLPTL +AAG ++ D +DG++ W VL++G R ++L
Sbjct: 316 WSPRLQKVARVSNQLIHITDWLPTLYSAAGGDLKDLGQIDGIDLWRVLSEGYGRGRDKLL 375
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDN--EIDKYSPKEVLYSKAG 440
NID + QK A+ K + G+ NG D +YGD+ EI Y+ VL S
Sbjct: 376 LNIDEI---QKTEGAI-YSRFKLLRGSVENGYYDGFYGDSGRTLEIPPYT-DVVLRSSVS 430
Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN---YDNKGAHC 497
Q H P S +++LR A V+C Y + A
Sbjct: 431 ----------QSITYHLGGPVTQPS----------TMIQLRRDATVQCQNMTYHYRHAFT 470
Query: 498 NSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARW 557
V CLF+I +DPCE N+A++ + + L+ L Y ++ P D ADP R
Sbjct: 471 TCNVTECLFDIVNDPCETKNVADAYPRIARDLDLYLEHYGHILMRQIKHPVDWLADPRRR 530
Query: 558 NNIWVPW 564
NN W PW
Sbjct: 531 NNTWEPW 537
>gi|198466297|ref|XP_002135151.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
gi|198150535|gb|EDY73778.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
Length = 548
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 238/548 (43%), Positives = 318/548 (58%), Gaps = 30/548 (5%)
Query: 36 LADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGM 95
+ADD+G++DVSF GS+ TPNIDALAY+G+ILN Y A+CTPSR+AL+TGKYPI+ GM
Sbjct: 1 MADDMGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAAMCTPSRAALLTGKYPINTGM 60
Query: 96 QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYW 155
QH VI+ +PWGLP EK + + +E GY T +GKWHLG + +TPT RGFD H GY
Sbjct: 61 QHYVIVNNQPWGLPQQEKTMAEIFRENGYYTSLLGKWHLGMSQRNFTPTQRGFDHHLGYL 120
Query: 156 QGLQDYYDHSCKATFEPY-QGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH---NKS 211
DYYD + + + Y +G D R N+ V + +G Y TD+ ++AA+ +I +H N S
Sbjct: 121 GAYVDYYDQTYQQNGKNYARGHDFRLNLNVTHDQVGHYVTDVLSDAAVELIEQHSGSNSS 180
Query: 212 KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAAL 271
+P+FL L+HLA HA N +P QAP EE+AKF I + R YA MVSRLD SVG VI AL
Sbjct: 181 QPLFLLLSHLAPHAANADDPMQAPAEELAKFEYIRNETHRHYAAMVSRLDRSVGRVIDAL 240
Query: 272 RKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQT 331
+ ML++SIVLF++DNG P+ G HS SN+PLRG K++PW+GG+R AAIWS ++
Sbjct: 241 ARQQMLQDSIVLFLSDNGGPTVGEHSTTASNYPLRGQKNSPWEGGLRSSAAIWSTEFERL 300
Query: 332 QKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDN 390
V + +I D LPTL AAAGI + D LDG+N W L G ++ EI+HNID +
Sbjct: 301 GTVWKQQTYIGDLLPTLAAAAGIAPDPDLKLDGMNLWSALKYGYESLEREIVHNIDM--D 358
Query: 391 PQKYYAALRVDDLKYVAGTDNNGQSDEWYGD-TDNEIDKYSPKEVLYSKAGITFNALKTK 449
P Y A K V GT N D W G+ E+D P+ Y + I ++
Sbjct: 359 PSAPYLAYTRGKWKVVNGTTNAALFDGWLGERVTREMD---PRSANYEEL-IRNTSVWQH 414
Query: 450 LQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-CNSTVKPCLFNI 508
LQ+ L + I LR+ A++ C + C + PCLF+I
Sbjct: 415 LQL---------------LMGSQNERNITSLRQQAQIECPEPSGATKPCLPLLAPCLFDI 459
Query: 509 TDDPCEQNNLAES--QTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYD 566
DPCE+NNL E T + L ++ + S P N P D DP ++N W W D
Sbjct: 460 DADPCERNNLYEDYRNTTIFLDLLQRIEQFASQAHAPNNMPEDPDCDPRFYHNEWTWWQD 519
Query: 567 ELDKQKAI 574
E AI
Sbjct: 520 EQASGAAI 527
>gi|443724925|gb|ELU12719.1| hypothetical protein CAPTEDRAFT_140387 [Capitella teleta]
Length = 542
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 237/566 (41%), Positives = 326/566 (57%), Gaps = 51/566 (9%)
Query: 16 LFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA 75
+F N+ P PHII ILADD GW+DV FHGS +IPTPN+DALA +G+IL+ HY Q
Sbjct: 12 FLTGSFGNSHRP--PHIIFILADDFGWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQP 69
Query: 76 LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
LCTPSR +L+TGK+PI IG+Q GVI +P+GL L EKLLP+YLK GY +H +GKWHLG
Sbjct: 70 LCTPSRGSLLTGKHPIQIGLQRGVIYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLG 129
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
FF + YTP RGFDSHYG++ +DY H GLD N Q D G YST
Sbjct: 130 FFADEYTPMRRGFDSHYGFYGASEDYMTH-----IGGMGGLDFWLNGQPDRSGQGHYSTT 184
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP---FQAPDEEVAKFLDISDPERRT 252
L+T A ++AEHN+++PMFLY +H AVH + E AP + KF I+D E +
Sbjct: 185 LFTTKAEQLLAEHNQTEPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVECQH 244
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK---GSNHPLRGMK 309
AGM+S LDESVGN+ +L +GMLEN+I++F +DNGAP H N SN+PLRG K
Sbjct: 245 MAGMISALDESVGNLTKSLHTNGMLENTIIIFSSDNGAP----HENTDVCSSNYPLRGAK 300
Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLD--GVNQW 367
++ W+GG R A +WSP LK++ VS ++ HISDWLPTL AAG ++ D GV+QW
Sbjct: 301 TSVWEGGTRVPAFVWSPLLKKSGYVSKQMMHISDWLPTLLEAAGYNMSALPGDLYGVSQW 360
Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV----AGTDNNGQSDEWYGDTD 423
L + + R+ +L+ N D+ ++ ALRV D+K + A ++G + +
Sbjct: 361 QALQENGPSARNSMLY---NADHKKQGSWALRVGDMKLLQAEGATQLHSGWQQPFAINLQ 417
Query: 424 NEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREF 483
+++ P Y +Q +Q A +R T + F
Sbjct: 418 SQLPSNEPNNTNY-------------VQEEQSQVASILEELGRGMRNSTT-------KYF 457
Query: 484 ARVRCNYDNKGA---HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTM 540
V C + + G C T +PCL+N+T DPCE +NLA+ +++++LE L
Sbjct: 458 --VNCGFISSGVTSHSCIDTGRPCLYNVTADPCEYHNLAKDLPEVVERLEASLDDLLKMY 515
Query: 541 VPPGNKPFDKRADPARWNNIWVPWYD 566
+ P NKP D A P +W+PW +
Sbjct: 516 IEPINKPADPAAYPKNHGGVWMPWIE 541
>gi|156547171|ref|XP_001603886.1| PREDICTED: arylsulfatase B [Nasonia vitripennis]
Length = 581
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 241/595 (40%), Positives = 340/595 (57%), Gaps = 53/595 (8%)
Query: 10 ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
A+ C L + P PHI+IILADD+GWNDVSFHG+++IPTPNIDALAYNG+ILN
Sbjct: 15 AIVCFLGYGTLAWGKAIPLPPHIVIILADDMGWNDVSFHGANEIPTPNIDALAYNGVILN 74
Query: 70 QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
++Y +CTPSRSALMTG+YPI GMQ + EP G+PL L+P+ ++ GY T +
Sbjct: 75 KYYTMPICTPSRSALMTGRYPIRDGMQGTPMRPAEPRGIPLNVSLMPEQMRRLGYETRLV 134
Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQVDNK 187
GKWHLG+ E YTP RGFD+ +GY+ G YYD+ T E G D+ H + D+
Sbjct: 135 GKWHLGYTTEDYTPVRRGFDTFFGYYNGFISYYDYWIGWNDTNE-VTGYDL-HRDESDSF 192
Query: 188 TIG---IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTY--EPFQA--PDEEVA 240
+ Y TDL T+ A +I + +KP+FL ++HLAVHAG+ +P + D+ A
Sbjct: 193 ELAHSSEYFTDLITDEAEKIIRNNKNAKPLFLEISHLAVHAGSKVHDDPLEVRRTDDVNA 252
Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
F I D + R YAGM++ LDESVG V+ AL++ MLENSI++FM+DNGAP+ G+++N G
Sbjct: 253 SFPYIEDYQHRKYAGMMAALDESVGRVVKALKEAEMLENSIIIFMSDNGAPTVGLYNNTG 312
Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS 360
SN+P+RG+K ++G R A I+SP +K +VS EL HI DWLPTL AAG D
Sbjct: 313 SNYPMRGIKGGMFEGAARAAACIFSPLIKAHSRVSEELMHIVDWLPTLYTAAGGNPMDLQ 372
Query: 361 --------LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNN 412
LDGV+QW + G + R +L NID Q + AA+ + K V G
Sbjct: 373 SQFDGALPLDGVSQWSSIVAGGPSSRQSLLVNIDEA---QGFEAAI-IGRHKLVKGMT-- 426
Query: 413 GQSDEWYGDTDNE--IDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRT 470
+ D +YG++ N+ Y+ K+VL S AG + L A P A
Sbjct: 427 -KEDGYYGNSGNDPSFPAYNVKKVLSSTAGASIGKLAG--------FASPSA-------- 469
Query: 471 ILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLE 530
+ L LR+ + + C A+C+ T CLF+++ DPCE +L+ ++K+LE
Sbjct: 470 ----RRALWLRQKSVITCKPFTSAANCSGT---CLFDLSKDPCETRDLSSKLPLIVKKLE 522
Query: 531 DKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYD-ELDKQKAI-ETMRLHKPL 583
L Y+ ++P N P D P +N +++PW E D + + H+PL
Sbjct: 523 SFLGEYRRVLMPQTNSPQDACGLPKYFNGVYMPWRTIEFDNVFGLTNSTTCHRPL 577
>gi|443722750|gb|ELU11510.1| hypothetical protein CAPTEDRAFT_23094, partial [Capitella teleta]
Length = 549
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 238/563 (42%), Positives = 330/563 (58%), Gaps = 40/563 (7%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PHII ILADD GW+DV FHGS +IPTPN+DALA +G+IL+ HY Q LCTPSR +L+TGK+
Sbjct: 1 PHIIFILADDFGWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKH 60
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
PI IG+Q GVI +P+GL L EKLLP+YLK GY +H +GKWHLGFF + YTP RGFD
Sbjct: 61 PIQIGLQRGVIYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFD 120
Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
SHYG++ +DY H GLD N Q D G YST L+T A ++AEHN
Sbjct: 121 SHYGFYGASEDYMTH-----IGGMGGLDFWLNGQPDRSGQGHYSTTLFTTKAEQLLAEHN 175
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQ---APDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
+++PMFLY +H AVH + E AP + KF I+D E + AGM+S LDESVGN
Sbjct: 176 QTEPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVECQHMAGMISALDESVGN 235
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNK---GSNHPLRGMKSTPWDGGMRGVAAI 323
+ +L +GMLEN+I++F +DNGAP H N SN+PLRG K++ W+GG R A +
Sbjct: 236 LTKSLHTNGMLENTIIIFSSDNGAP----HENTDVCSSNYPLRGAKTSVWEGGTRVPAFV 291
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLD--GVNQWDVLTKGAKTKRSEI 381
WSP LK++ VS ++ HISDWLPTL AAG ++ D GV+QW L + + R+ +
Sbjct: 292 WSPLLKKSGYVSKQMMHISDWLPTLLEAAGYNMSALPGDLYGVSQWQALQENGPSARNSM 351
Query: 382 LHNIDNVDNPQKYYAALRVDDLKYV----AGTDNNGQSDEWYGDTDNEIDKYSPKEVLY- 436
L+ N D+ ++ ALRV D+K + A ++G + + +++ P Y
Sbjct: 352 LY---NADHKKQGSWALRVGDMKLLQAEGATQLHSGWQQPFAINLQSQLPSNEPNNTNYV 408
Query: 437 ----SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARV------ 486
S+ +T L K++H A +S + + I + R+ R
Sbjct: 409 QEEQSQVASILEEHET-LAGKRRHEATFWL-ASTLCKQLAEPAVIQQARQNKRRSGRTVP 466
Query: 487 RCNYDNKGA---HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPP 543
C + + G C +T +PCL+N+T DPCE NLAE +++++LE +L + P
Sbjct: 467 NCGFASTGVAPRSCVNTGRPCLYNVTADPCEYYNLAEYLPEVVERLETRLDDLLKRSIEP 526
Query: 544 GNKPFDKRADPARWNNIWVPWYD 566
NKP D A P +W+PW +
Sbjct: 527 INKPADPAAYPKNHGGVWMPWIE 549
>gi|390356459|ref|XP_003728793.1| PREDICTED: arylsulfatase I-like [Strongylocentrotus purpuratus]
Length = 613
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 240/599 (40%), Positives = 340/599 (56%), Gaps = 54/599 (9%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+ P+II ILADDLGWNDVSFHGSSQIPTP+IDALA G+IL +YV +CTP+RSA+MTG
Sbjct: 26 QPPNIIFILADDLGWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTRSAIMTG 85
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK-------------WHL 134
K+PIH G++HGVI P+GL L EKL+PQYL+E GY TH +GK + L
Sbjct: 86 KHPIHTGLEHGVIGVSHPYGLGLEEKLMPQYLRELGYRTHMVGKVSLEHGVIGVSHPYGL 145
Query: 135 G--------FFREV-----------YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG 175
G + RE+ TP+ RGF+S YGY+ G+ DYY H + G
Sbjct: 146 GLEEKLMHQYLRELGYRTHMVGKESLTPSHRGFESFYGYYAGMGDYYTHEITSDGN-MTG 204
Query: 176 LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGN-TYEPFQA 234
D + V G YST+++TE +I +HN +P+++YLAH AVH+ N + QA
Sbjct: 205 FDFHMDGSVHKPVFGQYSTEIFTERTQEIILKHNPKEPLYIYLAHQAVHSANYDGQRLQA 264
Query: 235 PDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
P E +F +I+ RR YA MV+ LD+S+GN+ +L++ + N++++F DNG P+ G
Sbjct: 265 PHEYYKRFPNITHENRRKYAAMVAALDDSIGNITQSLKESSLYNNTLIVFTTDNGGPANG 324
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
N SN PLRGMK T W+GG+RGV +W +++ ++S + H+ DW+PTL AG
Sbjct: 325 FDFNYASNWPLRGMKHTTWEGGLRGVGFLWGALIEKPGRMSDGMMHVCDWVPTLYGLAGG 384
Query: 355 EINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNG 413
+ LDG++ W +L++G ++ R EILH ID P +A+R+ D K V G G
Sbjct: 385 NTSTLQHLDGIDVWPMLSRGKQSPRQEILHTID----PLLDLSAIRIGDYKLVQGQRGTG 440
Query: 414 QSDEWYGDTDNEIDKYSPK-EVLYS-KAGITFNALKT---KLQIKQKHAADPKANSSDAL 468
++ WY + K SP+ E+L++ + +A++ KL Q+ + +
Sbjct: 441 RNG-WY--PPEGVGKQSPRQEILHTIDPLLDLSAIRIGDYKLVQGQRGTGRNGWYPPEGV 497
Query: 469 RTILTDEKILELREFARVRCNYD--NKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLL 526
D K + F V C Y N +CN KPCLFNI DPCE NN+A D++
Sbjct: 498 SLADLDSKPVP-NAFV-VSCPYKPANASTNCNPYKKPCLFNIRHDPCEFNNIANWNQDIV 555
Query: 527 KQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDELDKQKAIETMRLHKPLLS 585
+ L ++ Y++TMVP NK D R+ P +N WVPW D D A T R KP +
Sbjct: 556 ELLLTRIEEYRATMVPIRNKEPDPRSYPKYHDNCWVPWVD--DTSTAPGT-RFTKPFFN 611
>gi|443732842|gb|ELU17406.1| hypothetical protein CAPTEDRAFT_127365 [Capitella teleta]
Length = 502
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 232/525 (44%), Positives = 309/525 (58%), Gaps = 35/525 (6%)
Query: 11 LTCTLLFNDAFL-----NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
L C + +F N+ P P+I+ ILADDLGW+DVSFHGS Q+PTPNIDALA +G
Sbjct: 2 LRCAFVLTLSFFLECSGNSEIP--PNIVFILADDLGWDDVSFHGSRQVPTPNIDALASDG 59
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+IL+ +YV LC+PSR ALMTGK+PI IG+Q GVI+ +P GL L EKLLP+YL GY
Sbjct: 60 IILDNYYVHTLCSPSRGALMTGKHPIQIGLQRGVIMPAQPSGLGLKEKLLPEYLNTLGYK 119
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
+H +GKWHLG E YTP RGF+SH+G++QG + Y H C + GLD N + D
Sbjct: 120 SHMVGKWHLGMCAEEYTPMHRGFESHFGFYQGCESYTTHMCGNS-----GLDFWLNEEPD 174
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
+ G YST L+T A ++AEH+ + PMFLYLAH AVH GN + F APD+ K I
Sbjct: 175 HSAGGQYSTSLFTAKAEQLLAEHDTASPMFLYLAHQAVHVGNQDQKFYAPDKYTDKLSFI 234
Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK-GSNHP 304
SD RR AGM++ LDESVG V AL GMLEN+I++F +DNGAP G + GSN P
Sbjct: 235 SDDRRRQMAGMLTALDESVGKVTKALHAKGMLENTIIVFSSDNGAPHIGNSDHTVGSNFP 294
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN--DTSLD 362
LRG K + W+GG R A +WSP K++ +S +L HISDWLPTL AAG + + T+L
Sbjct: 295 LRGGKPSIWEGGSRVPAFVWSPLFKKSGYISKQLMHISDWLPTLLQAAGYDTSALPTNLY 354
Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT 422
GV+QW L + + R +L+N+ + + ALRV D+K + + ++ WY
Sbjct: 355 GVSQWQTLQENGPSPRDSVLYNV--IAGCKDKSWALRVGDMKLLRAQGKSKKNAGWYQPF 412
Query: 423 DNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELRE 482
++I PK S+ + + +L Q + LR + EK
Sbjct: 413 SSDILS-QPK----SRDPAIISVVPAELLQSQ---------VTPILRELGRGEKSSSTEH 458
Query: 483 FARVRCNYDNKGA--HCNSTVKPCLFNITDDPCEQNNLAESQTDL 525
R+ C G +C KPCLFNIT DPCE +NLA ++
Sbjct: 459 --RIDCGAPPAGVVYNCIGMDKPCLFNITADPCEYHNLANDMPEI 501
>gi|156547173|ref|XP_001603910.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 578
Score = 414 bits (1063), Expect = e-113, Method: Compositional matrix adjust.
Identities = 237/574 (41%), Positives = 335/574 (58%), Gaps = 48/574 (8%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
RK+ LT L + + +PHII+ +ADDLGWNDV FHG++QIPTPNIDAL YN
Sbjct: 2 RKFIALLTIAGLLGSSCADFEG--RPHIIVFMADDLGWNDVGFHGATQIPTPNIDALGYN 59
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGY 124
G+ILN+HYV C+P+R+A +TGKYPI +GMQ I GEP GLP+ + LP+YL+ GY
Sbjct: 60 GIILNKHYVLPSCSPTRAAFLTGKYPIRMGMQGAGIAGGEPRGLPVHVQTLPEYLQGLGY 119
Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV 184
T+ IGKWH+G+ + P RGFD YG++ YYD+ + + G DM N +
Sbjct: 120 ETNLIGKWHVGYHTPKHLPNRRGFDYFYGFYNSHIGYYDY--RYSQGNMSGFDMHINGET 177
Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA---K 241
T G+Y+TD +T+AAI+VI H+ PM+L ++HLA HA P P E+ +
Sbjct: 178 AYGTDGVYATDRFTQAAIDVIYRHDLESPMYLQVSHLAPHA-----PMDVPFEDNPYDDE 232
Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
F IS+P+RR YA MV+RLD+S+G ++++L GML+NS+++FM+DNG + G N S
Sbjct: 233 FRHISEPKRRAYAKMVARLDDSLGRIVSSLGDRGMLKNSVIVFMSDNGGATIGKFRNWAS 292
Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS- 360
N PLRG K T ++GG+R VAA+WSP L +VS +LFH++DWLPTL AG ++ D
Sbjct: 293 NWPLRGTKYTLFEGGVRSVAAMWSPKLLSKGRVSEQLFHVTDWLPTLFQVAGGDMRDLGP 352
Query: 361 LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
+DGV+QWDVL+ G R +IL NID + AA+ K V G+ +G +Y
Sbjct: 353 IDGVSQWDVLSIGTGNVRDKILLNIDEYTQTE---AAID-KRFKIVRGSLYHG----YYD 404
Query: 421 DTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKIL-E 479
+ + EI + K +Y+ + I +A+ + ++H P +T E ++ E
Sbjct: 405 NVEGEIGR-GHKNPMYNTSRILKSAVSDAI---REHLGVP-----------VTQESVMWE 449
Query: 480 LREFARVRCNYDNKGAH---------CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLE 530
L+ A V C + CN T CLF++ DPCE NN+A+S L++LE
Sbjct: 450 LQRQATVLCRPNMTNMQYTSRYTLPSCNQTA--CLFDLGIDPCETNNVAKSYPAQLQELE 507
Query: 531 DKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
L Y + + P D ADP RWN W PW
Sbjct: 508 LFLDKYSNLLTRQIRMPVDYMADPRRWNYTWQPW 541
>gi|241619159|ref|XP_002407084.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215500930|gb|EEC10424.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 502
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 217/503 (43%), Positives = 295/503 (58%), Gaps = 29/503 (5%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K+PHI+ ILADDLGWNDVS+HGS QI TPNIDALA+NG+ LN++Y Q LCTPSRSA +TG
Sbjct: 24 KQPHIVFILADDLGWNDVSYHGSPQILTPNIDALAWNGIRLNRYYTQPLCTPSRSAFLTG 83
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
YP++ GMQH VIL EP GLPL KLLPQ+L + GY + +GKWHLG+++E YTPT RG
Sbjct: 84 CYPMNTGMQHSVILTTEPRGLPLHYKLLPQWLGDFGYVSRMLGKWHLGYYKEEYTPTMRG 143
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPY--QGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
F SH G W+G DYY H +++ + G D R +MQ + G Y T + TE A+ +I
Sbjct: 144 FQSHVGSWEGFSDYYSHIMDFSWQTWSISGHDFRRDMQKSKEDDGRYYTHVMTEEALKII 203
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
+H KP+FLY+AHLAVH+GN EP +AP + ++DI P R YAGM+S LDESVG
Sbjct: 204 KDHPNEKPLFLYIAHLAVHSGNQPEPLKAPTKYTDPYMDIGHPSRTLYAGMLSVLDESVG 263
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
V AL + G+L +IV+ DNGA S+ GS P RG K T W+GG+R A +WS
Sbjct: 264 AVFEALGRRGILNETIVVLTTDNGAGITTQFSSFGSGWPFRGQKGTAWEGGVRVPAVVWS 323
Query: 326 PWLKQTQ-KVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEILH 383
P + + V LFH+SDWLPT AG + N +DG++Q D L K R+EI+
Sbjct: 324 PLFSEHRGAVVESLFHVSDWLPTFYELAGGDPSNLGEIDGISQLDTLRHREKLPRTEIVL 383
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
NID ++N +A+ V K + G D G D+WY N + + +A + +
Sbjct: 384 NIDPIEN----VSAIIVYPFKLMQGDDQGGHYDDWYPFMGN-VTRTQKDAREGCEASVVY 438
Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKP 503
+ KH+ D + + V+C + C VKP
Sbjct: 439 RVM--------KHSGFVVTCGQDPVTYATS------------VKCGEKDPTKACKPIVKP 478
Query: 504 CLFNITDDPCEQNNLAESQTDLL 526
CLF+++ DPCE N+++ ++
Sbjct: 479 CLFDLSKDPCEYNDISAENIEVF 501
>gi|391327192|ref|XP_003738089.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 594
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 227/536 (42%), Positives = 312/536 (58%), Gaps = 29/536 (5%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
++++ILADD+GWNDVSF GS QIPTPN+DALA G+IL HYVQ CTPSR+AL+TG YP
Sbjct: 84 NVVVILADDMGWNDVSFTGSGQIPTPNLDALASAGVILQNHYVQPFCTPSRAALLTGMYP 143
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH GMQH VI EPWGLPL KLLPQ+LK+ GY TH IGKWHLG F++ + PT RGFDS
Sbjct: 144 IHSGMQHYVIRSREPWGLPLDFKLLPQHLKDLGYRTHLIGKWHLGQFKKEFLPTRRGFDS 203
Query: 151 HYGYWQGLQDYYDHSCKATFEPYQGLDM-RHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
H GY+ G DY+ H+ T++ LD + + ++ Y+T L+T+ A +I +H+
Sbjct: 204 HLGYYNGYIDYFTHN--HTYKRDSALDFFKDEVPYHSEE---YATRLFTDRAEEIIRDHD 258
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
P+FLY AHLAVH +PFQAP E + KF + D R T+A M++ LD SVG V+
Sbjct: 259 VDNPLFLYFAHLAVHRATDRDPFQAPQETIDKFSYVGDRNRTTFAAMLAELDVSVGRVVE 318
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL K G+L+N+I+LF +DNG + N G+N PLRG K T ++GG R A +WSP ++
Sbjct: 319 ALAKKGILDNTIILFSSDNGGQATAPMENTGTNFPLRGQKRTLFEGGTRVPAFVWSPLIR 378
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ ++V ++ HI DWLPT+ + AG + N +DG+N W ++ G + R+EIL+NID
Sbjct: 379 RPRRVFYDMVHIVDWLPTIYSLAGGDPRNLGRIDGMNVWGAVSDGMASPRTEILYNID-- 436
Query: 389 DNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKT 448
P + A+RV K + G N EWY + +S E K+ +
Sbjct: 437 --PVQRDLAIRVGRYKLLYGPMYN--LTEWYDNRGFGYASHSVLERYMDKSAVA------ 486
Query: 449 KLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNI 508
++ + K D R + RE RV C + CN + +PCLF++
Sbjct: 487 --RVLRDLGYWWKGREEDYSR-----RPTWQWREETRVHCGNPSSATACNPSEQPCLFDL 539
Query: 509 TDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
DPCE NNL T + + L +L + + P NKP D P N W PW
Sbjct: 540 ATDPCEYNNLYGQLTQVARYLLSRLEAHARSARPINNKPDDYSFQP---NATWGPW 592
>gi|195582835|ref|XP_002081231.1| GD10911 [Drosophila simulans]
gi|194193240|gb|EDX06816.1| GD10911 [Drosophila simulans]
Length = 633
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 218/448 (48%), Positives = 283/448 (63%), Gaps = 33/448 (7%)
Query: 7 YFFALTCTLL-FNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
+F L C LL DA N P KP+II ILADDLG+NDV FHGS++IPTPNIDALAY+G
Sbjct: 2 WFLWLICLLLPIIDAAENEKPPAKPNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSG 61
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+ILN++YV +CTPSRSALMTGKYPIH GMQH V+ EP GLPL EK+LPQYL E GY
Sbjct: 62 IILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYT 121
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
+H GKWHLG ++ YTP +RGF SH+ GLDMR+ QV
Sbjct: 122 SHIAGKWHLGHWKLKYTPLYRGFSSHW----------------------GLDMRNGTQVA 159
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
G Y+TD+ TE ++ VIA HN +K P+FLY+AH A H+ N Y P PD +V K
Sbjct: 160 YDLHGHYTTDVITEHSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMSH 219
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
I + +RR +A MVS++D+SVG ++ LRK MLENSI++F +DNG P+ G + N SN+P
Sbjct: 220 IPNYKRRKFAAMVSKMDDSVGQIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYP 279
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL--- 361
L+G+K+T W+GG+R +WSP LK++Q+VS++ HI DWLPTL AAG + +L
Sbjct: 280 LKGVKNTLWEGGVRAAGLMWSPLLKKSQRVSNQTMHIVDWLPTLLEAAGGQPALANLSKQ 339
Query: 362 -DGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
DG + W L + + R +LHNID++ AA+ V D K V GT+ G D WYG
Sbjct: 340 IDGQSIWRALVQDKASPRLNVLHNIDDIWGS----AAISVGDWKLVKGTNYRGSWDGWYG 395
Query: 421 DTDNEIDK-YSPKEVLYSKAGITFNALK 447
+ Y + V S+AG LK
Sbjct: 396 PAGERDPRLYDWQLVGRSRAGKALEGLK 423
>gi|350422929|ref|XP_003493332.1| PREDICTED: arylsulfatase B-like [Bombus impatiens]
Length = 581
Score = 410 bits (1054), Expect = e-112, Method: Compositional matrix adjust.
Identities = 234/555 (42%), Positives = 323/555 (58%), Gaps = 50/555 (9%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PHII+ +ADDLGWNDVSFHGS+QIPTPNIDAL YNG+ILN+HYV TPSR+A TG+Y
Sbjct: 31 PHIIVFMADDLGWNDVSFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQY 90
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
PI IGMQ I GEP GLPL K+LP++L+ GY T IGKWH+G++ YTP RGFD
Sbjct: 91 PIRIGMQGADIRGGEPRGLPLNIKILPEHLRGLGYTTKLIGKWHMGYYTPQYTPLHRGFD 150
Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI---YSTDLYTEAAINVIA 206
+ G++ + + + G DM D+ G+ Y+TD++T AIN+I
Sbjct: 151 TFLGFYNSYIS--YYDYSYSNQNMSGYDMHRG---DDPAYGMNREYATDMFTREAINIIE 205
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAP-----DEEVAKFLDISDPERRTYAGMVSRLD 261
H ++P++L L+HLAVHA P + P D E + I +P RR YA MVS+LD
Sbjct: 206 NHELNRPLYLQLSHLAVHA-----PLEQPMNVYNDREP---IHIREPNRRKYAKMVSKLD 257
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
ESVG + AL + GML++S++LF+ DNGA S G + N GSN+PLRG K T ++GG+RGVA
Sbjct: 258 ESVGRIFHALGEKGMLKDSLILFLTDNGAASVGRYRNYGSNYPLRGTKYTLYEGGVRGVA 317
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRS- 379
A+WS L++ +VS+EL HI+DWLPTL +AAG ++ D +DG++QW +L++G+ R
Sbjct: 318 ALWSSRLQKAARVSNELVHITDWLPTLYSAAGGDLKDLGKIDGIDQWRLLSEGSGRGRGR 377
Query: 380 -EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
++L NID V K A+ K + GT+ NG D +YGD+ ++ + K
Sbjct: 378 DKLLLNIDEV---LKTEGAI-YSRFKLLRGTNENGYYDGYYGDSGKTLETLPTYTDIVMK 433
Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN----YDNKG 494
+ ++ + H P S +R LR+ A V+CN Y +
Sbjct: 434 STVS--------ESLTYHLGGPVTQPSTMVR----------LRQDATVQCNANFTYYRRQ 475
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
V CLF+I +DPCE N+AE+ + + L L Y +V KP D ADP
Sbjct: 476 MLTPCNVTECLFDIVNDPCETRNIAETYVRIARDLNYFLEEYSHILVKQMKKPIDWSADP 535
Query: 555 ARWNNIWVPWYDELD 569
N+ W PW D
Sbjct: 536 RNRNDTWGPWVKSGD 550
>gi|157103779|ref|XP_001648126.1| arylsulfatase b [Aedes aegypti]
gi|108880481|gb|EAT44706.1| AAEL003960-PA [Aedes aegypti]
Length = 472
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 217/487 (44%), Positives = 289/487 (59%), Gaps = 32/487 (6%)
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
MTGKYPIH GMQH V+ EP GLPLTEKLLPQYLKE GY H GKWHLG + +TP
Sbjct: 1 MTGKYPIHTGMQHAVLYGMEPRGLPLTEKLLPQYLKELGYKNHIYGKWHLGSYTRKHTPL 60
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
RGFDSH G+W G +DH+ T GLDMR V G Y+T + + ++
Sbjct: 61 ERGFDSHVGFWTGHHHMFDHTAVETNA--WGLDMRRGFDVAYDLHGYYTTHVIRDESVAA 118
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
I HN S+PMFLY++H A H+ N Y+ APDE V + IS+ RR +A M++ LD+SV
Sbjct: 119 IRAHNTSQPMFLYVSHAATHSANPYDFLPAPDETVERLAGISNYSRRKFAAMLTELDQSV 178
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
G +I AL + GML+NSI++F DNG P+ G ++N SN PLRG K+T W+GG+RG IW
Sbjct: 179 GAIIVALSERGMLDNSIIVFSTDNGGPAEGFNNNAASNWPLRGTKNTLWEGGVRGAGCIW 238
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND--TSLDGVNQWDVLTKGAKTKRSEIL 382
SP +++ ++VS ++ HISDWLPTL AAG ++N ++LDG++ W L G T+R EIL
Sbjct: 239 SPLIEEKRRVSHQIMHISDWLPTLLDAAGYDLNKLPSNLDGISVWPELRNGDVTRRHEIL 298
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT-DNEIDKYSPKEVLYSKAGI 441
HNID++ AAL VDD K V GT+ G+ D WYG D++ Y+ + G
Sbjct: 299 HNIDDIWGS----AALTVDDWKVVKGTNYEGKWDAWYGPAGDHDTKSYNLSSIFGCPTGK 354
Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTV 501
+ LK + +E+I++LR A V+C + CN
Sbjct: 355 ALSKLK----------------------MLPPEEEIIKLRREATVQCANGTRNK-CNPLE 391
Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
PCLF++ +DPCE NLA+ D+L+ L KL+ Y +T VPPGN P D R DP W W
Sbjct: 392 SPCLFDLFNDPCEFENLADQYPDILEVLLKKLSDYNATAVPPGNMPLDSRGDPRFWGYTW 451
Query: 562 VPWYDEL 568
+ DEL
Sbjct: 452 HNFGDEL 458
>gi|427781895|gb|JAA56399.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
Length = 554
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 227/541 (41%), Positives = 302/541 (55%), Gaps = 30/541 (5%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A +PHI+ ILADDLGW+DVSFHGSSQIPTPN+D LA +G+ILN +YV CTPSR+ALM
Sbjct: 32 AKMRPHIVFILADDLGWDDVSFHGSSQIPTPNLDTLAADGVILNNYYVTPFCTPSRAALM 91
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG YPI GMQ I EPWGLP ++LPQYLKE GY TH +GKWHLG ++E TPT
Sbjct: 92 TGLYPIRTGMQGMPIDVAEPWGLPTDVRILPQYLKEFGYETHLVGKWHLGSYKESLTPTC 151
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
RGFDS YGY+ G DY+ H+ ++E + GLD N + IG YST ++T+ A +I
Sbjct: 152 RGFDSFYGYYYGESDYFAHTI--SYENHTGLDFWLNKKPVWSEIGTYSTSVFTKRAQYII 209
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
KSKP+ L + H A H E QAP E + KF I + R YAGMV LD+SVG
Sbjct: 210 ENRTKSKPLLLVITHQATHCALERERLQAPQENIDKFPYIGEKNRTIYAGMVDALDQSVG 269
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
V+ AL GMLEN+I++F +DNG +G H+++G N PLRG K T W+GG R A +WS
Sbjct: 270 QVVKALSDAGMLENTIIVFSSDNGGGPWGEHNSRGFNWPLRGAKGTVWEGGTRAAAFVWS 329
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEILHN 384
P L + ++VS +L HI+DWLPT + AG LDG + W L+ G ++ R E+L+N
Sbjct: 330 PLLARRRRVSHQLMHITDWLPTFYSIAGGNAAKLVELDGHDMWRPLSHGRRSPRVELLYN 389
Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFN 444
D KY + + + +Y D G+ + D P+ L A +
Sbjct: 390 YD-----YKYTKSAALRNHRYKLVLDETGR----FADRYKVPGGSRPRHDLDYLAAKSTA 440
Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC 504
A + K+K PK+ R AR+ C K S
Sbjct: 441 ASVLRDFYKKKQLNFPKS-----------------WRRKARLTCGKQKKINFSWSATSVY 483
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPP-GNKPFDKRADPARWNNIWVP 563
LF++ DPCE NNLA +++ L+ +L Y + +P N +D R+ P W P
Sbjct: 484 LFDVVTDPCELNNLASKLPNVVASLKKRLDAYGAAALPERNNDTYDPRSFPENHGGTWAP 543
Query: 564 W 564
W
Sbjct: 544 W 544
>gi|307187655|gb|EFN72627.1| Arylsulfatase B [Camponotus floridanus]
Length = 591
Score = 407 bits (1045), Expect = e-110, Method: Compositional matrix adjust.
Identities = 223/545 (40%), Positives = 317/545 (58%), Gaps = 30/545 (5%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PHII+I+ADDLGW+DV FHGS QI TPNIDAL YNG+ILN+HYV TPSR+A TG+Y
Sbjct: 30 PHIIVIMADDLGWDDVGFHGSDQIRTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQY 89
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
PI +GMQ I GEP G+PL ++LP++L++ GY T IGKWHLG++ +TP RGFD
Sbjct: 90 PIRMGMQGEDIQGGEPRGIPLNVRILPEFLRDLGYMTKLIGKWHLGYYTPQHTPLRRGFD 149
Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
S G++ YY++ K +F+ G DM + Y TD +T+ AI +I ++
Sbjct: 150 SFLGFYNSHVSYYNY--KYSFQNMSGYDMHRGDAPAYGSTDKYVTDFFTDEAIKIIEYYD 207
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEV---AKFLDISDPERRTYAGMVSRLDESVGN 266
S+P++L ++HLAVHA P + P + ++FL I + RR YA MV RLD SVG
Sbjct: 208 PSRPLYLQISHLAVHA-----PLEGPQDYNHYDSQFLHIREINRRKYAKMVLRLDNSVGR 262
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
++ AL GML+NS++LF+ DNGA G N GSN+PLRGMK T ++GG+RGVA +WSP
Sbjct: 263 IVQALGNKGMLKNSLILFLTDNGAAPIGKFRNYGSNYPLRGMKYTLYEGGVRGVALLWSP 322
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
L++T +VS EL H++DWLPTL + AG ++ D +DG++QW +L + R +L NI
Sbjct: 323 RLRKTARVSDELMHVTDWLPTLYSIAGGDVRDLGEIDGIDQWCMLNGSLPSARDRLLLNI 382
Query: 386 DNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNA 445
D V + K V G+ G D ++ D I++ P ++ T
Sbjct: 383 DEVSKTE----GAIYKQFKLVRGSIEGGLYDGYFYD---HIERNVPIMLIMPSDPKTI-- 433
Query: 446 LKTKLQIKQKHAADPKANSSDALRTILTD-----EKILELREFARVRCNYDNKG-AHCNS 499
+ + ++ S+ + + L D I++LR A V C N+ CN
Sbjct: 434 --VQEDVPPYTETILRSVVSETITSFLGDPVTQPSTIIQLRREATVNCRSRNRSFLTCN- 490
Query: 500 TVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNN 559
V CLF++ +DPCE NN+AE + + ++L+ L Y + P D ADP + NN
Sbjct: 491 -VTECLFDVNNDPCETNNIAEQYSRISRELDKSLEQYWHIVKRQMKIPVDWLADPRKTNN 549
Query: 560 IWVPW 564
W PW
Sbjct: 550 TWEPW 554
>gi|307215079|gb|EFN89886.1| Arylsulfatase B [Harpegnathos saltator]
Length = 557
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 235/570 (41%), Positives = 342/570 (60%), Gaps = 49/570 (8%)
Query: 8 FFALTCTLLFNDAFLNTTAPKK--PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
F C ++ + N+ A K P+I++I+ADDLGWNDVSF+G +IPTPNID+LAYNG
Sbjct: 13 LFIFVCDVILKNISRNSQATSKQPPNIVVIMADDLGWNDVSFNGGDEIPTPNIDSLAYNG 72
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+ILN+HYV +CTPSR+A TG+YPI GMQ + EP +PL LLPQYL++ GYA
Sbjct: 73 VILNRHYVLPICTPSRTAFFTGQYPIRSGMQGYPLQGAEPRSIPLNNILLPQYLRKLGYA 132
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
TH +GKWH+G+ +TPT RGFD+ GY+ G +Y+ H+ + G D+ H D
Sbjct: 133 THLVGKWHVGYQTNNHTPTNRGFDTFLGYYSGYIEYFSHNLVENGQ--SGYDI-HRSVGD 189
Query: 186 NKTIGI---YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNT-----YEPFQAPDE 237
N TI Y TDL T+ A N+I+ HN +KP++L LAHLA HA ++A ++
Sbjct: 190 NHTIEYRYDYMTDLITDEAENIISSHNPAKPLYLQLAHLAPHASTVDDVIEVRSWKATND 249
Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG-IH 296
+ DI+ RR +A V +DES+G V+ ALR+ ML+NSI++FM+DNGAP+ I
Sbjct: 250 TLGYIRDIN---RRKFASAVVTMDESIGRVVDALRRADMLKNSIIVFMSDNGAPTKDQIL 306
Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
N GSN+PLRGMK + ++G +RGVA I+SP + +VS++LFHI+DWLPTL AAAG +
Sbjct: 307 YNFGSNYPLRGMKQSFFEGAVRGVACIYSPLIDFPSRVSTQLFHITDWLPTLYAAAGGDA 366
Query: 357 ND-TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQS 415
+D +LDG++QW ++ +R +L NI+ + + AAL + K + + +
Sbjct: 367 SDLKALDGIDQWSMIKNADNGRRQFLLVNINEKTDSK---AAL-IGRYKLIR---DQSEY 419
Query: 416 DEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANS-SDALRTILTD 474
++YG T K+ Y + +T N L A P A++ +D R+ L
Sbjct: 420 TKYYGSTG--------KDPSYCEYNVT-NVL-----------ASPVASAIADISRSSLDI 459
Query: 475 EKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLA 534
+KI +LRE +RV C ++ + C T + CLF+I DPCE N++ +LK+L +
Sbjct: 460 DKIKQLREKSRVVCRV-SRFSRC--TKRTCLFDIKKDPCETTNVSSKYPKILKKLNALID 516
Query: 535 IYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
Y+S +V NKP D P +N+ W+PW
Sbjct: 517 SYQSILVNQTNKPVDPAGFPCNFNDTWMPW 546
>gi|307187654|gb|EFN72626.1| Arylsulfatase B [Camponotus floridanus]
Length = 525
Score = 405 bits (1042), Expect = e-110, Method: Compositional matrix adjust.
Identities = 234/558 (41%), Positives = 329/558 (58%), Gaps = 41/558 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+ P+I++I+ADDLGWNDVSFHG+ +IPTPNIDALAYNG+ILN+HYV +CTPSR+A +TG
Sbjct: 2 QPPNIVVIIADDLGWNDVSFHGADEIPTPNIDALAYNGVILNRHYVLPICTPSRTAFLTG 61
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
KYPI GMQ + EP G+ L LLP+YL++ GYATH +GKWH+G+ Y PT RG
Sbjct: 62 KYPIRTGMQGYPLQGAEPRGILLNNILLPEYLQKLGYATHLVGKWHVGYHTRNYGPTHRG 121
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGI---YSTDLYTEAAIN 203
FD+ GY+ G Y++H+ +E Q G D+ H + D+ I Y TDL T+ A N
Sbjct: 122 FDTFAGYYNGYIQYFNHTL---YESEQLGYDL-HRIIGDDHKIEYRYDYMTDLITDEAEN 177
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD--EEVAKFLDISDPERRTYAGMVSRLD 261
+I+ HN +KP++L +AHLA H+ + E + + E A I D RR YA +V+ LD
Sbjct: 178 IISSHNPAKPLYLQVAHLAAHSSDAEEEMEVRNWKETNATLGYIEDINRRKYASVVATLD 237
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
ESVG VI AL+K ML NSI++F++DNGA + G N GSN+PLRG+K + ++GG+RG A
Sbjct: 238 ESVGRVIDALKKTDMLNNSIIVFISDNGAQTEGFLQNYGSNYPLRGLKFSLFEGGIRGAA 297
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSE 380
I+SP + + +VS++LFHI+DWLPTL +AAG +D LDG +QW + +KRS
Sbjct: 298 CIYSPLIDRPSRVSNQLFHITDWLPTLYSAAGGNSSDLKQLDGFDQWSAIKSAKNSKRSS 357
Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAG 440
IL NID +N + AAL + K V T + ++Y YS LY K
Sbjct: 358 ILINIDEKENSE---AAL-IGHYKLVTDT---SEYQKYY--------NYSGNNALYPKYN 402
Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNST 500
+T L + + N+S KI++LR+ A + C N N T
Sbjct: 403 VT-TILASPAASAIASISISTLNAS----------KIMQLRKEATIICK--NFTDFSNCT 449
Query: 501 VKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
+ CLF++ DPCE +L +K+L + YKS ++ N P D P +N+
Sbjct: 450 NRSCLFDLNKDPCETTDLFSKHPKDVKRLNMFIDRYKSVLMNQTNAPVDPAGFPRNFNDT 509
Query: 561 WVPWYDELDKQKAIETMR 578
W+PW D ++I +R
Sbjct: 510 WMPWLP--DDYQSISILR 525
>gi|241654408|ref|XP_002411325.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215503955|gb|EEC13449.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 510
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 233/545 (42%), Positives = 304/545 (55%), Gaps = 41/545 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K+P+I+ IL DDLGW DVSFHGS+QIPTPNID LA +G+ILN +YV CTPSR+ALMTG
Sbjct: 1 KRPNIVFILGDDLGWGDVSFHGSTQIPTPNIDVLAGDGVILNNYYVLPTCTPSRAALMTG 60
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
YPIH GMQ +I PWGLPL K+LPQ+ ++ GY + IGKWHLGFF+ Y P RG
Sbjct: 61 LYPIHTGMQSDIIEPAAPWGLPLENKILPQHFRDLGYDVNMIGKWHLGFFKTPYVPIKRG 120
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FD+ +G++ G DYY+H+ + QG+ + + I L T++ +NV +
Sbjct: 121 FDTFFGFYTGSNDYYNHTSGSE-NNEQGVSLVATLTAAWGRIAQGVEPLLTDSPLNVYPQ 179
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
P F Y +H AVH+ EPFQAP V KF I + R YAGM+ LDESVG V
Sbjct: 180 -----PFFCYFSHHAVHSALMAEPFQAPARNVLKFPYIGESNRTIYAGMLDALDESVGRV 234
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
+ AL GMLE++I++F +DNG G SN+G N PLRG K T W+G +R A +WSP
Sbjct: 235 VEALDNAGMLEDTIIVFSSDNGGAPCGEASNQGFNWPLRGAKFTLWEGSVRVPAFVWSPK 294
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNID 386
+ +VS++L HISDWLPTL AAG +++ +DGV+ W LT + +KR EIL NID
Sbjct: 295 FLKQSRVSNQLMHISDWLPTLYTAAGGDVSALGVIDGVDMWHSLTHSSASKRKEILLNID 354
Query: 387 NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY---GDT--DNEIDKYSPKEVLYSKAGI 441
N + LR K V G +G+ ++ Y G T N ID+ +
Sbjct: 355 ASFN----ISGLRYKQYKLVVGGGFDGELNDHYYFTGGTRPTNNIDELRNES-------- 402
Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTV 501
T + K KQ+ + P L E V C D +
Sbjct: 403 TMARVLKKFYEKQRRSWSP-----------------LTWSENVAVDCRRDRLTENFVPMQ 445
Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
P LFNI DPCE NNLA+++ L+ L KL Y ST+VP +P D RA P IW
Sbjct: 446 PPYLFNIEQDPCELNNLAKTKVHTLRFLMKKLDAYNSTVVPSLFQPDDPRAYPEYHGGIW 505
Query: 562 VPWYD 566
PW D
Sbjct: 506 SPWLD 510
>gi|443701814|gb|ELU00075.1| hypothetical protein CAPTEDRAFT_177949 [Capitella teleta]
Length = 545
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 221/550 (40%), Positives = 309/550 (56%), Gaps = 45/550 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
++P+I+ ILADDLGW+D+S HGS QIPTPNIDALA +G++LN +YVQ +CTPSR+AL+TG
Sbjct: 25 QRPNIVYILADDLGWDDISLHGSEQIPTPNIDALAADGILLNNYYVQPICTPSRAALLTG 84
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
K+P+H+G+QH I +GL L E++LP+YL GY +H +GKWHLG+F +TPT+RG
Sbjct: 85 KHPVHLGLQHNTIPAPSAYGLGLNERILPEYLNTLGYDSHMVGKWHLGYFTPQHTPTYRG 144
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
F SH+GY G +DY DH+ F GLD ++ ++ + G YST+++T A
Sbjct: 145 FKSHFGYLNGCEDYLDHTLAYDFSTLGMDGWGLDFWNDTKIHRTSFGQYSTEIFTTRAEE 204
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGN---TYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
+I N +P+FLY++H AVH+GN QAP + KF I ERR AGM+S L
Sbjct: 205 LI-RSNTGEPLFLYMSHQAVHSGNPGLNGSKLQAPWKYFNKFNYIQSDERRRLAGMISAL 263
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
D+SVGN+ L GML N+I++F DNG P+ N SN PLRG K T W+GG+RG
Sbjct: 264 DDSVGNITRTLHAKGMLNNTIIVFSTDNGGPADRFDDNCASNWPLRGGKRTMWEGGLRGN 323
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLD--GVNQWDVLTKGAKTKR 378
+WSP LK + VS I DWLPTL AAG ++ D G++QW L++ + + R
Sbjct: 324 GFVWSPLLKHSNYVSEHFMSIMDWLPTLLDAAGYDLGKLPRDVEGMSQWKSLSENSPSPR 383
Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
+LHNID P LR+ D+K V G N WY P E+ +
Sbjct: 384 DYLLHNID----PLTGEGGLRLGDMKVVFGGGNTKAHRGWY-----------PPEMRWGN 428
Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTD--EKILELREFARVRCN--YDNKG 494
++ P L I D ++ + +V C +
Sbjct: 429 ETVS----------------APSHLFETDLWDIFHDLGRPPVQKKISVQVECGPAPPDVT 472
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
+C S ++PCLFNI+ DPCE +NL+E Q D+L +L + +T+ PP + D P
Sbjct: 473 TNCKSWIEPCLFNISADPCEYHNLSEDQPDMLGKLLSIVKAINATVTPPYDNSDDPNGYP 532
Query: 555 ARWNNIWVPW 564
R +W PW
Sbjct: 533 GRHGGVWKPW 542
>gi|156537546|ref|XP_001607560.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 571
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 228/573 (39%), Positives = 329/573 (57%), Gaps = 58/573 (10%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F+ T +F + +PHI++I+ADD+GWNDV FHGS++IPTPNIDALAY G+I
Sbjct: 10 LFSFILTPIFGPIY------GQPHIVVIIADDMGWNDVGFHGSNEIPTPNIDALAYGGVI 63
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
LN+HY CTPSR+A +TG++PI +G+Q + EP G+PL E+LLP+YL+E GY T
Sbjct: 64 LNRHYALPTCTPSRTAFLTGRHPIRMGLQGIPMNVAEPRGVPLHERLLPEYLRELGYVTR 123
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ---- 183
+GKWHLG++ + +TPT RGFDS GY+ G+ Y++H+ T + + G+D +
Sbjct: 124 LVGKWHLGYYTDKHTPTRRGFDSFVGYYGGVITYFNHT--VTKDKHTGIDYHWDTSGKIE 181
Query: 184 -VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD-EEVAK 241
DN Y TD ++ A VI H++ KP+FL LAH+A HA +P + + EV
Sbjct: 182 PFDNDQ---YVTDFISDQAEAVIKNHDRKKPLFLQLAHVAAHASENRDPIEVRNMTEVND 238
Query: 242 FLD-ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF-GIHSNK 299
L I D RR YAG+V+ +D+SVG V+ AL+ ML NSI++FM+DNG+P+ ++N
Sbjct: 239 TLSYIPDINRRKYAGVVTAMDDSVGRVVKALKDANMLSNSIIIFMSDNGSPTAEAPYTNY 298
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
GSN+PLRG+K+T ++GG+R A ++SP LK +VS ELFHI+DW PTL AG +++
Sbjct: 299 GSNYPLRGIKATVFEGGVRVPACVFSPRLKDRFRVSDELFHITDWFPTLYKLAGGDLSKI 358
Query: 360 S-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEW 418
LDGV+QW ++ K+ R +L NID V NP+ + K + G + + D++
Sbjct: 359 QDLDGVDQWSSISGSQKSNRESLLVNIDEVSNPEAAISGY----YKLIRGIN---RYDDY 411
Query: 419 YGDTDNEIDKYSPKE-----VLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILT 473
YG N+ YSPK VL S AG +L + QK
Sbjct: 412 YGKDGND---YSPKTYDVTGVLSSLAGRAIASLGNQYLPPQK------------------ 450
Query: 474 DEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKL 533
+I ELR A +RC + C T CLF+I DPCE N+ + ++ L +
Sbjct: 451 --RITELRNKATLRCEKKDDRPSCRDT---CLFDIVKDPCETTNIIDEHPEMEIVLNTYI 505
Query: 534 AIYKSTMVPPGNKPFDKRADPARWNNIWVPWYD 566
YK+ ++ N D P +N W+PW D
Sbjct: 506 QGYKAVLMNQTNTYIDPAGYPENFNGYWMPWMD 538
>gi|241638976|ref|XP_002410783.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215503545|gb|EEC13039.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 527
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 220/536 (41%), Positives = 310/536 (57%), Gaps = 34/536 (6%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
P+II +L D LGW+DVSFHGS QIPTPN+DALA +G+ILN +YVQ +CTPSR+ALMTG Y
Sbjct: 23 PNIIFMLVDGLGWDDVSFHGSPQIPTPNMDALAADGIILNNYYVQPVCTPSRAALMTGMY 82
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
PIH G+QHGV+L EP GLPL K++P+Y K+ GY TH IGKW+LG++ + YTPT+RGFD
Sbjct: 83 PIHTGLQHGVLLAAEPNGLPLEFKIMPEYFKDLGYETHLIGKWNLGYYMKEYTPTYRGFD 142
Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
S YG++ +DY+ H+ + ++ + M + +Y T L + ++++
Sbjct: 143 SFYGFYNYEEDYFTHNLEF-------VNQSNAMVWRPSSFCVYLT-LSPDTGFDLLSASI 194
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+ P FLYL+H +VH + +P QAP+E +AKF I D R YAGMV LDES G+V+
Sbjct: 195 ERGPFFLYLSHQSVHGASGNDPLQAPEENIAKFPYIGDERRTKYAGMVDALDESAGDVLE 254
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL + GML N+I++ + NG S G+ SN GSN PLRG K W+GG R A IWSP L
Sbjct: 255 ALYEAGMLANTIIVMSSANGGLSSGVESNAGSNFPLRGGKGALWEGGTRASAFIWSPLLY 314
Query: 330 QTQKVSSELFHISDWLPTLCAAA-GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
Q +VS ++ HI+DWLPTL AAA G N +LDG N W+ L+ + R E+L+NID +
Sbjct: 315 QKNRVSDQMMHITDWLPTLYAAAQGNPFNLGTLDGQNLWNHLSYNLPSPRYELLYNIDPI 374
Query: 389 DNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKT 448
D +A+R + K V G + + +E T Y ++ +++ K
Sbjct: 375 DQT----SAIRFLNYKLVLGESSGSEENEHRLKTPGNSRPYRDLDIAMAQS-------KA 423
Query: 449 KLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNI 508
++ H R IL + R+ A + C D++ + P LF++
Sbjct: 424 ARVLRNFHN-----------RGILFMDS--NWRQKAALNCG-DDERENVVPGAPPYLFDL 469
Query: 509 TDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
DPCE N+ + + L +L IY+ T + P NK D R P N W PW
Sbjct: 470 AKDPCEMRNIVLIEPGWVSILTLRLHIYEQTSIAPSNKQIDPRGFPENLNGTWAPW 525
>gi|390364995|ref|XP_798154.3| PREDICTED: arylsulfatase I-like isoform 2 [Strongylocentrotus
purpuratus]
Length = 476
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 209/515 (40%), Positives = 296/515 (57%), Gaps = 58/515 (11%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
FF L C L + T P P+II ++ADDLGWNDVSFHGSSQIPTP+IDALA G+I
Sbjct: 13 FFLLGCML----ECVYTAQP--PNIIFVVADDLGWNDVSFHGSSQIPTPHIDALAQEGVI 66
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
L +YV +CTP+RSA+MTGK+PIH G+Q+ VI+ EP+GL E ++PQYL+ GY TH
Sbjct: 67 LTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVIIADEPYGLGTNETIMPQYLRSLGYRTH 126
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
+GKWHLGFF+E TP+ RGF+S+YGY+ G+QDY+ H ++T G D N +
Sbjct: 127 MVGKWHLGFFKESLTPSHRGFESYYGYYGGMQDYFTH--ESTEHTLTGFDFHVNGSIYKP 184
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGN-TYEPFQAPDEEVAKFLDIS 246
G YST++YTE +I HN +P+++YLAH AVH+ N + QAP + +F +I+
Sbjct: 185 VFGQYSTEIYTEKTQEIIRNHNPQEPLYIYLAHQAVHSANYNGQRLQAPYKYYERFPNIT 244
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
+ RR +A MVS LD+S+GN+ L++ + N++++F DNG P+ G +N +N PLR
Sbjct: 245 NENRRKFAAMVSALDDSLGNITQTLKESSLYNNTVIVFTTDNGGPAHGFDANYANNWPLR 304
Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVN 365
G+K T W+GG+RG +W +++ + S + H+ DW+PTL AG + LDG++
Sbjct: 305 GVKDTTWEGGLRGAGFLWGALIEKPGRTSDGMMHVCDWVPTLYGLAGGNTSTLQHLDGID 364
Query: 366 QWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE 425
W +L++ + R EILHNID V N +A+R+ D K V G + NG +WY
Sbjct: 365 VWPMLSRAEPSPREEILHNIDPVRN----VSAIRIGDYKLVQGQNYNGSWSDWY------ 414
Query: 426 IDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFAR 485
P+ SS + D K +
Sbjct: 415 ---------------------------------PPEGESS-----VDVDSKPVPNAFVVS 436
Query: 486 VRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAE 520
N +C+ KPCLFNI DPCE NN+A+
Sbjct: 437 CPSKPANASTNCDPKEKPCLFNIRHDPCEFNNIAD 471
>gi|328705055|ref|XP_001946210.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 470
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 206/486 (42%), Positives = 282/486 (58%), Gaps = 34/486 (6%)
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
P+ QH VILE EPWGLPL E LLP++L + GY +HA+GKWHLG+F++ YTPT+RGF
Sbjct: 5 PLLARTQHNVILEPEPWGLPLNEILLPEHLNKLGYTSHAVGKWHLGYFKKAYTPTYRGFK 64
Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
S YG+W G QDYY H +ATF ++G DMR ++ D ++G YST L+T+ A ++I +HN
Sbjct: 65 SFYGFWNGYQDYYTHMVQATFASFEGFDMRRDLNPDWSSVGKYSTHLFTKEATDIITKHN 124
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
S P+FLYLAHLA HAG P QAP E++ F I D RR YAGM+ LD SV V
Sbjct: 125 NSVPLFLYLAHLAPHAGTYENPLQAPQEDINSFQSIKDKYRRKYAGMMKNLDNSVAEVFN 184
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL + ML+N+I +F++DNGAP+ GIH N GSN PL+G K+TPW+GG+R A +WS L
Sbjct: 185 ALHTNNMLDNTIFVFVSDNGAPTNGIHRNYGSNWPLKGEKATPWEGGIRTAAFVWSKLLS 244
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ + + + L HISDWLPTL AAG D ++DG++ W+ + K+ R +ILHNID++
Sbjct: 245 KKKTLPNPLMHISDWLPTLYQAAGGNTQDLGAIDGISMWESFSNNIKSPRKQILHNIDDI 304
Query: 389 DNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKT 448
YAA+R + KY+ G+ G D W G +Y+ +L S T
Sbjct: 305 TG----YAAIRDTNFKYIKGSTFLGYLDYWSGSLSPSTIRYNVDAILNS----------T 350
Query: 449 KLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNI 508
I + D LT I LR + VRC+++ C +KPCLF+I
Sbjct: 351 TAGILSQINGDS-----------LTPSLIESLRNSSTVRCDFEKDMKPCKPFIKPCLFHI 399
Query: 509 TDDPCEQNNLAESQTDLLKQ--------LEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
DPCE NL +++ E L ++ + P N K A+PA +NN
Sbjct: 400 VKDPCELVNLNYKPNSKMRKFVEAKIDYFETSLGKFRESASKPMNVRGTKDANPALYNNT 459
Query: 561 WVPWYD 566
W+ W D
Sbjct: 460 WISWDD 465
>gi|260795396|ref|XP_002592691.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
gi|229277914|gb|EEN48702.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
Length = 485
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 195/384 (50%), Positives = 258/384 (67%), Gaps = 9/384 (2%)
Query: 41 GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
GWNDVSFHGS QIPTPN+D+LAY+G+IL +YV +CTP+RSA+MTG++PIH G+QHGVI
Sbjct: 3 GWNDVSFHGSDQIPTPNLDSLAYSGVILGNYYVSPICTPTRSAIMTGRHPIHTGLQHGVI 62
Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
P+GLPL E +LPQYLK GYATH +GKWHLG +TPTFRGFDS++GY G +
Sbjct: 63 SGATPFGLPLNETILPQYLKPLGYATHIVGKWHLGHHAWEFTPTFRGFDSYFGYLTGKDN 122
Query: 161 YYDHSCKATFEP----YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFL 216
YYDH+ + P Y+GLD+R+ + G YST+L+ A +I H+ SKP+FL
Sbjct: 123 YYDHTDDESNSPEELGYKGLDLRNGTEPVWTENGTYSTELFATEAERIITSHDTSKPLFL 182
Query: 217 YLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGM 276
YL H AVH+GN P QAP + + KF I P RRT+A MVS LD++VGNV AL GM
Sbjct: 183 YLPHQAVHSGNPDNPLQAPQKYIDKFPHIQHPGRRTFAAMVSALDDAVGNVTKALSARGM 242
Query: 277 LENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS 336
LENS+++F DNG P+ G N SN PLRG+K+T W+GG+ G + SP +KQ ++ +
Sbjct: 243 LENSVIIFTTDNGGPAAGFDQNYASNWPLRGVKNTLWEGGVHGTGFVHSPLIKQPKRTTH 302
Query: 337 ELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYY 395
EL H+ D LPT+ AG + + +LDG N W+ +++G ++ R E+LHNID P++
Sbjct: 303 ELLHVCDLLPTIYELAGGDSTELKNLDGTNVWETISRGVQSPRVEVLHNID----PKRKT 358
Query: 396 AALRVDDLKYVAGTDNNGQSDEWY 419
AALR D K + G G D WY
Sbjct: 359 AALRYGDYKIILGEAYKGAWDGWY 382
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 32/60 (53%)
Query: 510 DDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDELD 569
+DPCE NN+A+ L+ L +L Y T VP N+P D R +P +W PW + D
Sbjct: 398 EDPCEFNNIADWNKPLVNFLMGRLEAYSETAVPIRNQPTDPRGNPKYHGGVWSPWINLTD 457
>gi|91084737|ref|XP_970917.1| PREDICTED: similar to arylsulfatase B [Tribolium castaneum]
Length = 531
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 224/581 (38%), Positives = 326/581 (56%), Gaps = 60/581 (10%)
Query: 11 LTCTLLFNDAFLNTTA----PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
+ C L F+ T++ KKP+IIII+ADDLG+NDVSFHGSSQIPTPN+ +A G+
Sbjct: 1 MVCVLGLLLYFILTSSKPSTAKKPNIIIIIADDLGYNDVSFHGSSQIPTPNLAKMATRGI 60
Query: 67 ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
IL++ Y Q+ CTPSR+AL+TG+YPI GMQ + GE LPL +P + + GY T
Sbjct: 61 ILDRFYTQSTCTPSRTALLTGQYPIRSGMQGYPLKAGENRSLPLNMPTMPLHFQNLGYKT 120
Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP---YQGLDMRHNMQ 183
H +GKWHLG + TP +GFDSH+GYW G Y+D+ + + +GLD+ +
Sbjct: 121 HLVGKWHLGAAYKEDTPLGKGFDSHFGYWNGFVGYFDYVSFSKMDNGTLVKGLDLHDQFE 180
Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA--K 241
+ G Y+T+L+TE +++VI H+ P+FL ++HLA H G PD + +
Sbjct: 181 PVWGSQGRYATELFTERSLDVIEGHDVRVPLFLVVSHLAAHTGQNGSELGVPDVDQTNHE 240
Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
F I DP RR YAG+VS LD S+G ++A L + ML+NSIVLF +DNGA + G++ N GS
Sbjct: 241 FSYIQDPRRRLYAGVVSHLDASIGRIMAKLDEKQMLDNSIVLFFSDNGAQTVGMYENSGS 300
Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS- 360
N PLRG+K + ++GG+R A I+SP + VS L HISDWLPTL +AAG ++
Sbjct: 301 NWPLRGVKFSDFEGGVRVAATIYSPLFHKKGYVSEHLIHISDWLPTLYSAAGGDVAHLGQ 360
Query: 361 LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
+DG++QWD LT + R+EIL NID VD + +A +R D K + G+ + G D++YG
Sbjct: 361 IDGIDQWDALTNNNPSNRTEILINIDEVD---ENFAIIR-DKFKLIQGSYHEGTFDQYYG 416
Query: 421 DTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILEL 480
D+ + +P +P ++D
Sbjct: 417 DSGRGPENPTP---------------------------NPNHTTTDL------------- 436
Query: 481 REFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTM 540
C ++ N T K CLF++ DPCE N+ ES+ ++ QL +K+A + +
Sbjct: 437 -----SWCRAPDQTPILNCT-KGCLFDLDKDPCETTNIIESEPEIANQLYEKIAQFWKEL 490
Query: 541 VPPGNKPFDKRADPARWNNIWVPWYDELDKQKAIETMRLHK 581
VP NK D ++DP +NN W W ++ K R+ +
Sbjct: 491 VPQRNKDTDPKSDPIFYNNTWCTWLEDASCHKTTNIERISR 531
>gi|427779723|gb|JAA55313.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
Length = 593
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 228/578 (39%), Positives = 304/578 (52%), Gaps = 65/578 (11%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A +PHI+ ILADDLGW+DVSFHGSSQIPTPN+D LA +G+ILN +YV CTPSR+ALM
Sbjct: 32 AKMRPHIVFILADDLGWDDVSFHGSSQIPTPNLDTLAADGVILNNYYVTPFCTPSRAALM 91
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG YPI GMQ I EPWGLP ++LPQYLKE GY TH +GKWHLG ++E TPT
Sbjct: 92 TGLYPIRTGMQGMPIDVAEPWGLPTDVRILPQYLKEFGYETHLVGKWHLGSYKESLTPTC 151
Query: 146 RGFDSHYGYWQGLQDYYDHS------------------CKA------------------- 168
RGFDS YGY+ G DY+ H+ C+
Sbjct: 152 RGFDSFYGYYYGESDYFAHTISYVRHLSWAFFLTRKCXCRGFDSFYGYYYGESDYFAHTI 211
Query: 169 TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNT 228
++E + GLD N + IG YST ++T+ A +I KSKP+ L + H A H
Sbjct: 212 SYENHTGLDFWLNKKPVWSEIGTYSTSVFTKRAQYIIENRTKSKPLLLVITHQATHCALE 271
Query: 229 YEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN 288
E QAP E + KF I + R YAGMV LD+SVG V+ AL GMLEN+I++F +DN
Sbjct: 272 RERLQAPQENIDKFPYIGEKNRTIYAGMVDALDQSVGQVVKALSDAGMLENTIIVFSSDN 331
Query: 289 GAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
G +G H+++G N PLRG K T W+GG R A +WSP L + ++VS +L HI+DWLPT
Sbjct: 332 GGGPWGEHNSRGFNWPLRGAKGTVWEGGTRAAAFVWSPLLARRRRVSHQLMHITDWLPTF 391
Query: 349 CAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVA 407
+ AG LDG + W L+ G ++ R E+L+N D KY + + + +Y
Sbjct: 392 YSIAGGNAAKLVELDGHDMWRPLSHGRRSPRVELLYNYD-----YKYTKSAALRNHRYKL 446
Query: 408 GTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDA 467
D G+ + D P+ L A + A + K+K PK+
Sbjct: 447 VLDETGR----FADRYKVPGGSRPRHDLDYLAAKSTAASVLRDFYKKKQLNFPKS----- 497
Query: 468 LRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLK 527
R AR+ C K S LF++ DPCE NNLA +++
Sbjct: 498 ------------WRRKARLTCGKQKKINFSWSATSVYLFDVVTDPCELNNLASKLPNVVA 545
Query: 528 QLEDKLAIYKSTMVPP-GNKPFDKRADPARWNNIWVPW 564
L+ +L Y + +P N +D R+ P W PW
Sbjct: 546 SLKKRLDAYGAAALPERNNDTYDPRSFPENHGGTWAPW 583
>gi|241378410|ref|XP_002409154.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215497456|gb|EEC06950.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 511
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 221/492 (44%), Positives = 285/492 (57%), Gaps = 47/492 (9%)
Query: 41 GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
GW+DVSFHGS+QIPTPN+D LA +G+ILN +YVQ CTPSR+ALMTG YPIH GMQHGV+
Sbjct: 2 GWDDVSFHGSAQIPTPNMDTLAADGIILNNYYVQPACTPSRAALMTGLYPIHTGMQHGVL 61
Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
EP+GLPL+ ++PQYLK GY TH +GKW+LG ++ YTPTFRGFDS YGY+ ++D
Sbjct: 62 SPAEPYGLPLSVSIMPQYLKNLGYETHIVGKWNLGNYKLSYTPTFRGFDSFYGYYSAVED 121
Query: 161 YYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAH 220
YY+H+ ++ GLD N Q GIYST LYTE +I + SKP FLYL +
Sbjct: 122 YYNHT--VLWDNQTGLDFWLNTQPLRNVSGIYSTQLYTERTKFLIENRDVSKPFFLYLPY 179
Query: 221 LAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENS 280
AVH GN +P QAP E + KF I + R +AGM LDES+G V AL ML+N+
Sbjct: 180 QAVHCGNFDDPLQAPQENIDKFPYIGEENRTIFAGMTDALDESIGEVFRALFDADMLDNT 239
Query: 281 IVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
I++ ADNG FG SN+G N PLRG K T W+GG R A +WSP LK+ VS ++ H
Sbjct: 240 IIVLSADNGGLPFGFESNRGYNFPLRGGKGTLWEGGCRASAFVWSPLLKRKGVVSDQMMH 299
Query: 341 ISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALR 399
I+DW+PTL +AAG LDG + W L G+ + R EIL+NID +D +ALR
Sbjct: 300 ITDWVPTLYSAAGGNPQTLGKLDGTDMWVPLAYGSASPRYEILYNIDPIDR----NSALR 355
Query: 400 VDDLKYVAGTDNNGQSDEW---------YGDTDNEIDKYSPKEVLYSKAGITFNALKTKL 450
+ K V G N G D+ YGD D + + +VL G + LK
Sbjct: 356 YTNYKLVLGIANGGAYDQRYLTTGGSRPYGDLDLLMAQSEAAQVLREYYGTGY--LKFPD 413
Query: 451 QIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITD 510
Q +Q+ A + C + + S P LF++T
Sbjct: 414 QWRQR----------------------------ATLTCG-NKVPKNFVSLSPPYLFDLTA 444
Query: 511 DPCEQNNLAESQ 522
DPCE NN+A SQ
Sbjct: 445 DPCEINNIANSQ 456
>gi|241156195|ref|XP_002407716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215494207|gb|EEC03848.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 548
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 230/560 (41%), Positives = 317/560 (56%), Gaps = 42/560 (7%)
Query: 13 CTLLFNDAFLNTTAPKK---PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
C+L+F L T A K PHI+ ILADDLGW+DVSFHGSSQIPTPNID LA +G+ L+
Sbjct: 21 CSLVF----LATVALAKKMPPHIVFILADDLGWDDVSFHGSSQIPTPNIDVLAADGITLH 76
Query: 70 QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
+YVQ +CTPSR+ALMTG YPI GMQH VI EPWGLPL KL+P++LK+ GY+TH +
Sbjct: 77 NYYVQPMCTPSRAALMTGLYPIRTGMQHWVIRSPEPWGLPLELKLMPEHLKDLGYSTHLV 136
Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI 189
GK + + + + ++ + GLD R+ + +
Sbjct: 137 GKVLFDLKKFIVSVNRLCINESTEVCHTFVSAVTLCIYFVYKSHAGLDFRNGEEPFHNDT 196
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
G Y+T L+T+ AI++I +HN++KP+FLYL+HLA H EP QAPDE V KF I + +
Sbjct: 197 GQYATTLFTDRAISIIEQHNQTKPLFLYLSHLAPHGATHDEPLQAPDENVEKFDYIGEED 256
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
R YAGMV LD SVG V+ AL + GMLENSIV F +DNGA FG SN+G N PLRG+K
Sbjct: 257 RTIYAGMVDALDVSVGRVVEALSRAGMLENSIVAFSSDNGAVPFGFRSNRGFNWPLRGIK 316
Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWD 368
++ W+GG+R +WSP L+++ ++S+++ HI+DWLPTL AAAG N LDG N W+
Sbjct: 317 ASVWEGGVRVPGFVWSPLLRKSARLSTQMMHITDWLPTLYAAAGGNTKNLGQLDGHNMWE 376
Query: 369 VLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDK 428
L++G ++ R+EIL NID P A LR K + N G TD +
Sbjct: 377 ALSRGWRSPRTEILLNID----PITGSAGLRFGKHKLLIPGVN--------GTTDAHVPT 424
Query: 429 YS-PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREF---A 484
P+ + T +Q ++ ++ LR + I R + A
Sbjct: 425 TGYPRPT---------EDMDTMMQ---------QSLAAKVLRRFYNVDGIWFRRGWENSA 466
Query: 485 RVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPG 544
V C ++ S LF+I DPCE NLA ++ ++ + KL Y + M+PP
Sbjct: 467 EVDCGPAEWQSNFVSGRVYNLFDIERDPCEMINLAATRRGTVRMMMRKLQSYAAAMIPPR 526
Query: 545 NKPFDKRADPARWNNIWVPW 564
NK D R+ P + +W W
Sbjct: 527 NKEEDPRSFPEKHGGVWTTW 546
>gi|391326893|ref|XP_003737944.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 528
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 218/561 (38%), Positives = 304/561 (54%), Gaps = 83/561 (14%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A P+I++I+ DDLGW+D+S HGS QIPTPNID LA G++L +Y QA+CTPSR ALM
Sbjct: 17 AASSPNIVLIVIDDLGWDDISLHGSDQIPTPNIDKLAAEGVLLENYYTQAICTPSRGALM 76
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TGKYPIH+G+Q+ VI +P+GLP K++PQYL Y +H IGKWHLG R PT
Sbjct: 77 TGKYPIHLGLQYDVIQGAQPYGLPTDFKIMPQYLSGTCYKSHIIGKWHLGHSRSELLPTR 136
Query: 146 RGFDSHYGYWQGLQDYYDH----SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGF SH+G+ G DY++H S E Y GLD+ N K G Y+ DL+T+ A
Sbjct: 137 RGFHSHFGFRLGHSDYFNHWGEESSPVKNEMYAGLDLWSNEVPIKKYHGTYANDLFTKRA 196
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL-DISDPERRTYAGMVSRL 260
I+++ HNK+ P+FLYLAH AVH G+ P QAP V KF I D +R+ YA MVS +
Sbjct: 197 ISILETHNKTTPLFLYLAHQAVHVGDGENPLQAPLSFVKKFEGKIYDEKRKHYAAMVSAM 256
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
D S+G ++ + +G EN+I+ DNG P+ G+ S+ SNHPLRG K T W+GG RG
Sbjct: 257 DASIGELMHGISTNGFAENTIIFLTNDNGGPTNGMASSGSSNHPLRGCKYTLWEGGTRGS 316
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
A W P K + HI+D LPTL A+G + + DG + WD L+ A+ ++
Sbjct: 317 ALFWYP-KKLGAGTYKGMAHITDVLPTLLTASGNAVAG-NFDGYDLWDALSTNAQPRKET 374
Query: 381 I---------------LHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE 425
+ L+NID P + A+R D K + NG+ WY
Sbjct: 375 VEPQRTYGNEHDFHSLLYNID----PVTHSWAVRSGDFKLITDNALNGEYSGWY------ 424
Query: 426 IDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFAR 485
P S + RT
Sbjct: 425 ----------------------------------PTVESVNLTRT--------------- 435
Query: 486 VRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN 545
V+C +K C+ + +PCLFN+ DPCE+N++ + + D++++L KL Y+++M P N
Sbjct: 436 VKCGSASKA--CDGSKRPCLFNLAADPCEENDIFDERPDMVRKLMVKLNEYQTSMRPMEN 493
Query: 546 KPFDKRADPARWNNIWVPWYD 566
+P ADPAR++ +W+PW D
Sbjct: 494 RPNSAAADPARFDFVWMPWED 514
>gi|391325967|ref|XP_003737498.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 513
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 221/569 (38%), Positives = 313/569 (55%), Gaps = 72/569 (12%)
Query: 6 KYFFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
+ F L C F ++ + K P++++I+ DDLGW+D+ HGSSQIPTPNID LA
Sbjct: 3 RILFFLFCLRSFGTCDISGPSGETKAPNVVLIVVDDLGWDDIGLHGSSQIPTPNIDKLAE 62
Query: 64 NGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAG 123
G++L+ +Y Q +CTPSR++LMTGKYP+ +G+QH VI P+GLP K++PQYL +
Sbjct: 63 EGVVLDNYYTQPICTPSRASLMTGKYPVRLGLQHDVISAATPFGLPSNFKIMPQYLHDKN 122
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH----SCKATFEPYQGLDMR 179
Y H +GKWHLG R + PT RGF H+GY G D+Y H + GLD+
Sbjct: 123 YDCHIVGKWHLGHSRSEFLPTRRGFKDHFGYRLGSSDHYSHYGADDSDVPGSLFYGLDLW 182
Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
HN + G YSTD+YT + +++ HNKS+P+FLYLA+ AVHAGN + QAP V
Sbjct: 183 HNEVPAKEFNGKYSTDIYTHRSTDILRMHNKSRPLFLYLAYQAVHAGNPDQALQAPQSIV 242
Query: 240 AKF-LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSN 298
+F I + RR YA MVS +D ++GNV+ A+R +G N++V F DNG P +
Sbjct: 243 DRFSSSIRNDRRRRYAAMVSAVDTAIGNVMGAIRANGFAGNTLVFFTNDNGGPINANDRS 302
Query: 299 KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
GSN+PLR K T W+GG+RG W+P + + K L HI D LPT+ +AAG+ +
Sbjct: 303 PGSNYPLRAGKFTLWEGGVRGTGIFWAPQVLKPGKFGG-LSHIVDVLPTILSAAGMS-SV 360
Query: 359 TSLDGVNQWDVLT-KGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDE 417
LDGV+ W L+ K ++ R+E+L NID VD + AA R+ D K + +NG D
Sbjct: 361 PELDGVSLWKSLSEKRSENPRTELLLNIDPVD----HTAAYRLGDFKLIK---SNGLMDG 413
Query: 418 WYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKI 477
W+ D AN S RTI
Sbjct: 414 WFRTPD-------------------------------------AANHS---RTI------ 427
Query: 478 LELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYK 537
+C D+ C +PCLFNI +DPCEQ+++ D++K++ ++L Y+
Sbjct: 428 ---------KCKLDDPVFICRGVDRPCLFNIAEDPCEQDDVYADNPDIVKEIMERLHRYR 478
Query: 538 STMVPPGNKPFDKRADPARWNNIWVPWYD 566
M P +K + ADPAR++ IW+PW D
Sbjct: 479 HEMHPVEDKNVTRLADPARFDYIWMPWED 507
>gi|390364993|ref|XP_003730725.1| PREDICTED: arylsulfatase I-like isoform 1 [Strongylocentrotus
purpuratus]
Length = 479
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 205/516 (39%), Positives = 292/516 (56%), Gaps = 57/516 (11%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
FF L C L + T P P+II ++ADDLGWNDVSFHGSSQIPTP+IDALA G+I
Sbjct: 13 FFLLGCML----ECVYTAQP--PNIIFVVADDLGWNDVSFHGSSQIPTPHIDALAQEGVI 66
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
L +YV +CTP+RSA+MTGK+PIH G+Q+ VI+ EP+GL E ++PQYL+ GY TH
Sbjct: 67 LTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVIIADEPYGLGTNETIMPQYLRSLGYRTH 126
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFE-PYQGLDMRHNMQVDN 186
+GKWHLGF+ + +TP RGF+S +GY+ G QDY+ H + + G D N +
Sbjct: 127 MVGKWHLGFYSKEHTPIERGFESTFGYYLGQQDYFTHETQVKRKHTLTGFDFHVNGSIYK 186
Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGN-TYEPFQAPDEEVAKFLDI 245
G YST++YTE +I HN +P+++YLAH AVH+ N + QAP + +F +I
Sbjct: 187 PVFGQYSTEIYTEKTQEIIRNHNPQEPLYIYLAHQAVHSANYNGQRLQAPYKYYERFPNI 246
Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
++ RR +A MVS LD+S+GN+ L++ + N++++F DNG P+ G +N +N PL
Sbjct: 247 TNENRRKFAAMVSALDDSLGNITQTLKESSLYNNTVIVFTTDNGGPAHGFDANYANNWPL 306
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGV 364
RG+K T W+GG+RG +W +++ + S + H+ DW+PTL AG + LDG+
Sbjct: 307 RGVKDTTWEGGLRGAGFLWGALIEKPGRTSDGMMHVCDWVPTLYGLAGGNTSTLQHLDGI 366
Query: 365 NQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDN 424
+ W +L++ + R EILHNID V N +A+R+ D K V G + NG +WY
Sbjct: 367 DVWPMLSRAEPSPREEILHNIDPVRN----VSAIRIGDYKLVQGQNYNGSWSDWY----- 417
Query: 425 EIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFA 484
P+ SS + D K +
Sbjct: 418 ----------------------------------PPEGESS-----VDVDSKPVPNAFVV 438
Query: 485 RVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAE 520
N +C+ KPCLFNI DPCE NN+A+
Sbjct: 439 SCPSKPANASTNCDPKEKPCLFNIRHDPCEFNNIAD 474
>gi|270008609|gb|EFA05057.1| hypothetical protein TcasGA2_TC015152 [Tribolium castaneum]
Length = 563
Score = 391 bits (1004), Expect = e-106, Method: Compositional matrix adjust.
Identities = 226/599 (37%), Positives = 326/599 (54%), Gaps = 77/599 (12%)
Query: 10 ALTCTLLFNDAFLNTTA----PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
+ C L F+ T++ KKP+IIII+ADDLG+NDVSFHGSSQIPTPN+ +A G
Sbjct: 15 VMVCVLGLLLYFILTSSKPSTAKKPNIIIIIADDLGYNDVSFHGSSQIPTPNLAKMATRG 74
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+IL++ Y Q+ CTPSR+AL+TG+YPI GMQ + GE LPL +P + + GY
Sbjct: 75 IILDRFYTQSTCTPSRTALLTGQYPIRSGMQGYPLKAGENRSLPLNMPTMPLHFQNLGYK 134
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP---YQGLDMRHNM 182
TH +GKWHLG + TP +GFDSH+GYW G Y+D+ + + +GLD+
Sbjct: 135 THLVGKWHLGAAYKEDTPLGKGFDSHFGYWNGFVGYFDYVSFSKMDNGTLVKGLDLHDQF 194
Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA-- 240
+ + G Y+T+L+TE +++VI H+ P+FL ++HLA H G PD +
Sbjct: 195 EPVWGSQGRYATELFTERSLDVIEGHDVRVPLFLVVSHLAAHTGQNGSELGVPDVDQTNH 254
Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
+F I DP RR YAG+VS LD S+G ++A L + ML+NSIVLF +DNGA + G++ N G
Sbjct: 255 EFSYIQDPRRRLYAGVVSHLDASIGRIMAKLDEKQMLDNSIVLFFSDNGAQTVGMYENSG 314
Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS 360
SN PLRG+K + ++GG+R A I+SP + VS L HISDWLPTL +AAG ++
Sbjct: 315 SNWPLRGVKFSDFEGGVRVAATIYSPLFHKKGYVSEHLIHISDWLPTLYSAAGGDVAHLG 374
Query: 361 -LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDN-------- 411
+DG++QWD LT + R+EIL NID VD + +A +R D K + GT N
Sbjct: 375 QIDGIDQWDALTNNNPSNRTEILINIDEVD---ENFAIIR-DKFKLIQGTPNYYYQQTLL 430
Query: 412 ---------NGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKA 462
G D++YGD+ + +P +P
Sbjct: 431 FNCKSGSYHEGTFDQYYGDSGRGPENPTP---------------------------NPNH 463
Query: 463 NSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQ 522
++D C ++ N T K CLF++ DPCE N+ ES+
Sbjct: 464 TTTDL------------------SWCRAPDQTPILNCT-KGCLFDLDKDPCETTNIIESE 504
Query: 523 TDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDELDKQKAIETMRLHK 581
++ QL +K+A + +VP NK D ++DP +NN W W ++ K R+ +
Sbjct: 505 PEIANQLYEKIAQFWKELVPQRNKDTDPKSDPIFYNNTWCTWLEDASCHKTTNIERISR 563
>gi|156402612|ref|XP_001639684.1| predicted protein [Nematostella vectensis]
gi|156226814|gb|EDO47621.1| predicted protein [Nematostella vectensis]
Length = 380
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 189/365 (51%), Positives = 255/365 (69%), Gaps = 7/365 (1%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A KPHII+I+ADDLGW+DVSFHGS QIPTPN+D LA G+ILN +YV +CTP+R++LM
Sbjct: 1 AQNKPHIIVIVADDLGWDDVSFHGSPQIPTPNLDYLATRGVILNNYYVSPICTPTRASLM 60
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TGKYPIH+GMQH VI +P+GLPL E LPQYL+ GY T IGKWHLGFF + YTPT+
Sbjct: 61 TGKYPIHLGMQHFVIYAAQPYGLPLGEITLPQYLQIQGYKTAGIGKWHLGFFAKEYTPTY 120
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFE-PYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
RGFDS YG W DY++H+ +FE + G DMR+NM+ Y+T+++T A+ V
Sbjct: 121 RGFDSFYGMWSAKADYWNHT---SFENGFWGTDMRNNMEPVTTDKDKYATEVFTREALKV 177
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP-ERRTYAGMVSRLDES 263
I HNKS+P+FLY+AH A H+ N ++P QAP+++V KF + D ER+ YA MV+ +D+S
Sbjct: 178 IENHNKSEPLFLYIAHQAPHSANPHDPLQAPEDKVKKFSGVIDKIERQQYAAMVTCVDDS 237
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
+G V AL K+ ML NS++LF DNG G + N+GSN+PLRG K W+GG+RG A I
Sbjct: 238 IGEVFRALEKNRMLNNSVILFTTDNGGAPHGFNRNQGSNYPLRGGKDMMWEGGVRGTAFI 297
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN--DTSLDGVNQWDVLTKGAKTKRSEI 381
+S +K +VS++L ++DW+PTL AG + +DG N W+ ++ + R E+
Sbjct: 298 YSDLIKHKGRVSTDLIDVTDWVPTLYYLAGGTPGYLEPHMDGKNVWETISNKEPSPRKEV 357
Query: 382 LHNID 386
LHNID
Sbjct: 358 LHNID 362
>gi|449684458|ref|XP_002164438.2| PREDICTED: arylsulfatase I-like, partial [Hydra magnipapillata]
Length = 784
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 220/552 (39%), Positives = 303/552 (54%), Gaps = 70/552 (12%)
Query: 41 GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
GW+DVSFHGS QIPTPNID+LA +G+ILN +YV +++ MTGKY H+G QHGV+
Sbjct: 1 GWDDVSFHGSPQIPTPNIDSLAKSGVILNNYYVSPSSFATKTEFMTGKYATHLGTQHGVL 60
Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGL-- 158
+P+GLP TEK+LPQYLKEAGY +A+GKW LG+++E P RGFD YG GL
Sbjct: 61 HNKQPFGLPHTEKILPQYLKEAGYNNYAVGKWALGYYKEEMLPWKRGFDFFYG---GLTS 117
Query: 159 --QDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFL 216
+DYY HS F+ GLD+R N +V + G Y T++YT A+N+I +N +KP+FL
Sbjct: 118 SGKDYYTHSA---FDENYGLDLRRNNEVIHNETGNYITEVYTREAVNIIKNYNDNKPLFL 174
Query: 217 YLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGM 276
Y+AH AVH GN +P QAP+ + K I + +R+ +AGMV LDESV N+ AL + G+
Sbjct: 175 YVAHQAVHTGNADDPLQAPESYLKKLNHIKNIKRKLFAGMVLALDESVLNITIALAQKGL 234
Query: 277 LENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS 336
L+N++++F D G G + SN PLRG K T W+GG+R VA ++S +K +VS
Sbjct: 235 LDNTVIVFTTDAGGAVGGQELSSASNFPLRGSKLTVWEGGIRAVAFVYSDLIKNKGRVSL 294
Query: 337 ELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYA 396
E++H +DWL T AGI +DG + W +++G + R EILH D + K
Sbjct: 295 EMYHSTDWLLTTLGLAGINTKGIKVDGFDVWKSISEGEVSPRFEILHQFDPL---TKNLC 351
Query: 397 ALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKH 456
+LRV D K V D G +WY
Sbjct: 352 SLRVGDYKLVINQD-IGFYGDWY------------------------------------- 373
Query: 457 AADPKANSSDALRTILTDEKILELREFARVRC--NY-----DNKGAHCNSTVKPCLFNIT 509
P+ LR + K +E A+V C NY + C+ KPC+FNI
Sbjct: 374 ---PRPAEIGELRYL----KRIETLPNAKVTCDKNYPHPFLEMHAPPCDPMKKPCMFNIQ 426
Query: 510 DDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDELD 569
DPCE +NLAE + LK L D+L Y+ T+V P D +ADP + +W PW +
Sbjct: 427 WDPCEFHNLAEFMPNTLKVLLDRLQFYRLTLVKPTYPSVDSQADPENSDGVWSPWVNS-- 484
Query: 570 KQKAIETMRLHK 581
+I M H+
Sbjct: 485 ---SIALMTFHQ 493
>gi|391345592|ref|XP_003747069.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 557
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 208/497 (41%), Positives = 302/497 (60%), Gaps = 29/497 (5%)
Query: 27 PKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
PKKP ++++ILADD+GWND SFHGS++IPTPN+DALA +G+IL HY Q +CTP+R+AL+
Sbjct: 76 PKKPTNVVMILADDMGWNDASFHGSAEIPTPNLDALASSGVILQSHYAQPMCTPTRAALL 135
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG YP H GMQ+ VI GEPWGLPL K+LP YL EA Y +H +GKWHLG + PT
Sbjct: 136 TGLYPFHTGMQNFVIRTGEPWGLPLDYKILPHYLDEAYYHSHLVGKWHLGMHNPAFLPTA 195
Query: 146 RGFDSHYGYWQGLQDYYDHS-CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
R F++H GY+ G DY+ H + GLD H + + G Y+T L+T+ A+N+
Sbjct: 196 RHFNTHVGYYNGFIDYFTHEHISPGNDSLIGLDW-HINEENENEEG-YATHLFTKRAVNL 253
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
I H ++P+F+ L+HLA HAG +PFQAP E + KF I D R+ YA MV LD+SV
Sbjct: 254 IENHKSTEPLFILLSHLAPHAGCKRDPFQAPRESIEKFAHIKDQNRKVYAAMVDELDQSV 313
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
G V+ AL + +L++++++F++DNG + G+ +N GSN+P RG K T ++GG R A +W
Sbjct: 314 GQVVEALYRKRLLDSTMIVFLSDNGGQTTGVMNNTGSNYPFRGQKRTLFEGGTRVSAFVW 373
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHN 384
S + + ++ S L H++DWLPT+ AG+ LDG +QW +L++GA + R+E+L+N
Sbjct: 374 STDIVKKPRIESGLMHVTDWLPTILKRAGLSY-PRELDGRDQWSMLSEGAPSARAEVLYN 432
Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNGQS--DEWYGDTDNEIDKYSPKEVLYSKAGIT 442
++ VD AALR D K + G NG S ++W D ++ + +T
Sbjct: 433 LNYVDQS----AALRNDRFKLIVGP-RNGVSLLNDW--DQLHKGKHSRAQLDGMMDESLT 485
Query: 443 FNALK-TKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTV 501
LK L K +H A+ S R+ A ++C +K C+
Sbjct: 486 ARTLKEAGLWWKMRHGDTLSADHS-------------TWRDKAEIKCGSHDKTL-CDVHR 531
Query: 502 KPCLFNITDDPCEQNNL 518
PCLF++ +DPCE NN+
Sbjct: 532 SPCLFDLENDPCELNNV 548
>gi|156378148|ref|XP_001631006.1| predicted protein [Nematostella vectensis]
gi|156218038|gb|EDO38943.1| predicted protein [Nematostella vectensis]
Length = 584
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 220/569 (38%), Positives = 305/569 (53%), Gaps = 100/569 (17%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
KKPHIIII+ADDLGW+DVSFHGS QIPTP ID A G+ILN +YV +CTPSR+++MTG
Sbjct: 25 KKPHIIIIVADDLGWDDVSFHGSPQIPTPYIDFYANRGVILNNYYVSPMCTPSRASMMTG 84
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
KYPI++GM WHLGFF + YTP +RG
Sbjct: 85 KYPINLGM------------------------------------WHLGFFTKEYTPVYRG 108
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FDS YG+W DY++HS + G+D+R NM+ G Y T+L+T A+ VI
Sbjct: 109 FDSFYGFWNAKTDYWNHSSYEN--NFWGVDLRDNMEPVQSEDGTYGTELFTREAVKVIEA 166
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD---------ISDPERRTYAGMVS 258
H+ S P+FLY+AH AVH N EP QAP +++ L I D +R+ YA MV+
Sbjct: 167 HDTSTPLFLYVAHQAVHTANPNEPLQAPQDKIDVSLKQRQQRFKGTIDDDQRQVYAAMVT 226
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LD+SVG++ AAL K ML +S+V+F DNG +G++ N+GSN PLRG K W+GG++
Sbjct: 227 SLDQSVGDIFAALSKRHMLRDSVVIFTTDNGGAPYGLNWNRGSNFPLRGGKDMLWEGGVK 286
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG--IEINDTSLDGVNQWDVLTKGAKT 376
GVA ++S +KQ +VS EL ++DW+PT+ AG E ++DG N W +++GA +
Sbjct: 287 GVAFVYSDLIKQKGRVSKELIDVTDWVPTIYHLAGGTAEFLVPNMDGKNVWSTISEGAPS 346
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
R EILHNID P + +A LR K V G D+ + WY D+Y
Sbjct: 347 PRDEILHNID----PWRKFAGLRKGKYKIVQGMDDTYKGVGWY-------DRYP------ 389
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC--NYDNKG 494
HA L+ K EL A + C +D +
Sbjct: 390 ------------------GHA--------------LSSMKQPELLPGAVIDCKKTFDEER 417
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
+S K CLF++ +DPCE ++L+ ++L +++ +L YK+ +PP P +K A+P
Sbjct: 418 KCDSSDGKFCLFDMEEDPCEYHDLSNQLPEVLAEMKTRLEYYKNIALPPWFPPINKAANP 477
Query: 555 ARWNNIWVPWYDELDKQKAIETMRLHKPL 583
A + W PW DK I R K L
Sbjct: 478 ANFGGFWSPWKRLKDKSSVINVKRALKKL 506
>gi|270008947|gb|EFA05395.1| hypothetical protein TcasGA2_TC015567 [Tribolium castaneum]
Length = 513
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 219/539 (40%), Positives = 308/539 (57%), Gaps = 42/539 (7%)
Query: 36 LADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGM 95
+ADD+G NDV F+GS QIPTP+IDALAYNG+IL++ Y Q CTPSR+AL+TG+YPI +GM
Sbjct: 1 MADDVGRNDVGFYGSGQIPTPSIDALAYNGIILDRFYTQCSCTPSRAALLTGQYPIRLGM 60
Query: 96 QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYW 155
Q I GE LPL +PQYLK GY TH +GKWHLG+ TP RGFDSH+GYW
Sbjct: 61 QGLPIRAGENKSLPLDVVTMPQYLKRLGYKTHLVGKWHLGYAHIEDTPLQRGFDSHFGYW 120
Query: 156 QGLQDYYDHSCKATFE-----PYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNK 210
G Y++++ A +E +G D+ + + G Y+TDL+T A+ +I EHN
Sbjct: 121 NGFVGYFNYT--AVYELANDTMVKGFDLFDGVVPAWQERGKYATDLFTHKAMKIIDEHNS 178
Query: 211 SKPMFLYLAHLAVHAGNTYEPFQAPDEEVA--KFLDISDPERRTYAGMVSRLDESVGNVI 268
KP+FL LAHLA H G PD A +F I +P+RR YA +VSRLD+SVG ++
Sbjct: 179 EKPLFLVLAHLAGHTGEDGVELGVPDVAQAETRFSFIKNPKRRLYADIVSRLDDSVGQIV 238
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
L + +L+NSI+LF++DNGA + G++ N GSN PLRG+K T ++GG+R AAI+SP
Sbjct: 239 EKLDEKNLLDNSIILFLSDNGAQTIGVYENSGSNWPLRGLKLTEFEGGIRAPAAIYSPLF 298
Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDN 387
Q VS EL HI+D+LPT AAAG + +DGVNQW+VL++ KR EIL N +
Sbjct: 299 HQKGYVSKELIHITDFLPTFYAAAGGNLEYLGKIDGVNQWEVLSQQKPGKRDEILVNFNE 358
Query: 388 VDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALK 447
+DN +A+ + K V G + +++GD+ +P++ Y+ F A+
Sbjct: 359 IDN----VSAIIMGQYKLVQGRLAFPEYGDYHGDSGR-----NPEDPPYN-----FTAVL 404
Query: 448 TKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFN 507
+K I Q++ KI LR + K + CLFN
Sbjct: 405 SK-NINQQNR-----------------NKIESLRNQTDLSWCRGEKNQPLIDCTEGCLFN 446
Query: 508 ITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYD 566
+ DPCE N+ + + + +L +L + +VP FD ++P +N W W D
Sbjct: 447 LESDPCETTNIIQEEKIIAGKLLGRLFEFSQILVPQQVAIFDPESNPEFYNGTWCTWLD 505
>gi|346464549|gb|AEO32119.1| hypothetical protein [Amblyomma maculatum]
Length = 531
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 225/556 (40%), Positives = 312/556 (56%), Gaps = 46/556 (8%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F ++ C + + T K PHI+ IL DDLGW D S HGS+QIPTPN+DALA G++
Sbjct: 7 FISVACV---AGSSVKTPFRKPPHIVFILVDDLGWADTSLHGSAQIPTPNLDALASTGVL 63
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
LN +YVQ LC+PSR ALMTG YP H G++ ++ + GLPL K+LP++LK+ GY TH
Sbjct: 64 LNNYYVQPLCSPSRGALMTGLYPAHNGIRMPLV-GAQVAGLPLQFKILPEHLKDLGYETH 122
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
+GKWHLG+F YTPT+RGFD+ +G+ G DYY + E + GLD +
Sbjct: 123 IVGKWHLGYFNLNYTPTYRGFDTFFGFHNGPIDYYRGIMEQ--EGHVGLDFWNGTSALPL 180
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
Y+T A ++IA N SKP+FLYLAH AVH+ + E QAP E KF I D
Sbjct: 181 KERTYATARLQNHAKSIIANRNTSKPLFLYLAHQAVHSVYSPEFLQAPVENTKKFPYIRD 240
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLR 306
R+ AGM+ LD+SVG ++ L K G+LE++I++F DNG P N+G N PLR
Sbjct: 241 SSRKILAGMMDALDQSVGELVEELEKAGILEDTILVFSTDNGGHPYTRADPNRGFNWPLR 300
Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVN 365
G+K T W+GG+RG A IWSP L+ +++VS++L HISDWLPTL +AAG ++ LDG++
Sbjct: 301 GIKGTVWEGGIRGTAFIWSPRLEVSRRVSTQLMHISDWLPTLYSAAGGDVQTLGPLDGID 360
Query: 366 QWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE 425
W L+ G + R E+L +N + AALR + K V GT ++ S +
Sbjct: 361 MWQSLSTGGPSPRIEVLLEFNN----ETDRAALRYLNHKIVLGTFSDEVSKRY------- 409
Query: 426 IDKYSPKEVLYS-KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKIL------ 478
EV+ + GI +AL+ + + L+ + ++
Sbjct: 410 -------EVIGGVRPGIDLDALQR------------NSKAGRVLKRFYGNPQLFLHGFDK 450
Query: 479 ELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKS 538
R A VRCN + + NS+ K LF++ DPCE NLA+S LL L KL Y
Sbjct: 451 SWRRRATVRCNEHGR-TNFNSSSKVFLFDLESDPCELRNLADSNVTLLTDLFVKLGAYLR 509
Query: 539 TMVPPGNKPFDKRADP 554
T+ P NK D + P
Sbjct: 510 TLRPEINKEIDPKGFP 525
>gi|195403369|ref|XP_002060263.1| GJ19825 [Drosophila virilis]
gi|194140907|gb|EDW57358.1| GJ19825 [Drosophila virilis]
Length = 324
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 177/304 (58%), Positives = 223/304 (73%), Gaps = 1/304 (0%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F + LLF T P P+IIIIL DD+G+NDVSFHGS+QI TPNIDA AYNG+I
Sbjct: 6 FTTILSALLFLRQVAGVTQPTNPNIIIILIDDMGFNDVSFHGSNQILTPNIDAFAYNGVI 65
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
LN++YV LCTPSR+AL+TGKYPIH GMQH V + EPWGLPL E+L+PQ+ ++AGY+T
Sbjct: 66 LNRYYVPNLCTPSRAALLTGKYPIHNGMQHFVQIPDEPWGLPLGERLMPQFFRDAGYSTQ 125
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDN 186
+GKWHLGF+R+ +TP RGFD H+GY+ G DYYDH+ Y G D R ++Q +
Sbjct: 126 LVGKWHLGFWRQDHTPIMRGFDHHFGYYNGYIDYYDHTHYMLDRNYTAGADFRRDLQRCH 185
Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
G Y+T+ +T+ A +I +H+ S+P+F+ L+HLAVH GN +P QAP EEVAKF+ IS
Sbjct: 186 SDNGTYATEAFTKEARRIIEQHDLSRPLFMVLSHLAVHTGNENQPMQAPYEEVAKFVHIS 245
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
DP+RRTYAGMVS LD+SVG I AL GML NSIVL +DNGAP+ GIHSN GSNHP R
Sbjct: 246 DPKRRTYAGMVSSLDKSVGQTIRALSDRGMLNNSIVLLYSDNGAPTLGIHSNSGSNHPFR 305
Query: 307 GMKS 310
G+ +
Sbjct: 306 GVST 309
>gi|346465011|gb|AEO32350.1| hypothetical protein [Amblyomma maculatum]
Length = 500
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 221/526 (42%), Positives = 307/526 (58%), Gaps = 34/526 (6%)
Query: 11 LTCTLLFNDAFLNTTA-----PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
L+C L + A + T+ P PHI+ IL DDLGW D S HGS+QIPTPN+DALA G
Sbjct: 2 LSCLLFISVACVTGTSAKTPYPTPPHIVFILVDDLGWADTSLHGSAQIPTPNLDALASTG 61
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
++LN +YVQ LC+PSR+ALMTG YP H G++ ++ + GLPL K+LP++LK+ GY
Sbjct: 62 VLLNNYYVQPLCSPSRAALMTGLYPAHNGIRM-PLMGAQVAGLPLQFKILPEHLKDLGYE 120
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
TH +GKWHLG YTPT+RGFD+ +G++ G DYY + E + GLD + +
Sbjct: 121 THMVGKWHLGHSSLNYTPTYRGFDTFFGFYNGPIDYYHGIMEQ--EGHIGLDFWNGTRAL 178
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
IY+T + + A +IA N SKP+FLYLAH AVH+ E QAP E KF +
Sbjct: 179 PLEERIYATTRFQDHANYIIANRNASKPLFLYLAHQAVHSAYEPEFLQAPGENTKKFPFL 238
Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHP 304
D R++ AGMV LD+SVG ++ L K G+LE++I++F DNG P N+G N P
Sbjct: 239 GDASRKSLAGMVDALDQSVGELVEELEKAGILEDTILVFSTDNGGHPYTRSDPNRGFNWP 298
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDG 363
LRG+K T W+GG+RG A IWSP L+ + +VS++L HISDWLPTL +AAG + N LDG
Sbjct: 299 LRGIKGTVWEGGIRGTAFIWSPRLELSSRVSTQLMHISDWLPTLYSAAGGNVQNLGRLDG 358
Query: 364 VNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD 423
++ W L+ G + R+E+L +N + AALR + K V G +N S+ +
Sbjct: 359 IDMWQSLSTGGPSPRTEVLLEFNN----ETDRAALRYLNHKIVLGIYSNEVSERY----- 409
Query: 424 NEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREF 483
+ V + GI + L+ + + A N LR D+ R
Sbjct: 410 --------EVVGGVRPGIDLDVLQRNSK-AGRVLARFYGNPQLFLRGF--DKS---WRSR 455
Query: 484 ARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQL 529
A VRCN +N + NS+ + LF++ DPCE +NLA+S LL L
Sbjct: 456 ATVRCN-ENGPTNFNSSSQVFLFDLETDPCELHNLADSNLTLLTDL 500
>gi|156359506|ref|XP_001624809.1| predicted protein [Nematostella vectensis]
gi|156211610|gb|EDO32709.1| predicted protein [Nematostella vectensis]
Length = 488
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 219/556 (39%), Positives = 304/556 (54%), Gaps = 76/556 (13%)
Query: 10 ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
AL+ LL + L+ PHI+ IL DDLGW D+ +HGS I TPNI+ LA +G+IL+
Sbjct: 6 ALSLVLL-SAQILSEANAIPPHIVFILVDDLGWFDLGYHGSV-IRTPNINQLAGDGIILD 63
Query: 70 QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
+YVQ LCTP+RSALMTGKYPIH+G QHGVIL G+P GLPL LP+ LK+ GYATH +
Sbjct: 64 NYYVQPLCTPTRSALMTGKYPIHLGTQHGVILPGQPMGLPLDSSTLPEQLKQQGYATHIV 123
Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI 189
GKWHLGF++E + PT RGFDS YGY+ G +D++ H+ LD R N +
Sbjct: 124 GKWHLGFYKEDFVPTKRGFDSFYGYYCGAEDHFTHNVLGF------LDFRDNDLIVKDQK 177
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
G Y T +T+ A++ I HN S P+FLYL VH P QAP E + K+ I D
Sbjct: 178 GTYGTRAFTKRAVDTIHRHNSSSPLFLYLPFQNVHG-----PVQAPPEYIDKYSFIKDKT 232
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
RRT+A MV +DE++GNV +AL+ G+ EN++++F DNG GIH+ G N+PLRG K
Sbjct: 233 RRTHAAMVDIMDEAIGNVTSALKSAGLWENTLLVFSTDNG----GIHTAGGYNYPLRGEK 288
Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDV 369
+T W+GG+RG + P + + + L H++DW PTL AG + D SLDGV+ WD
Sbjct: 289 NTLWEGGVRGAGFVSGPMAPRHGMIYNGLMHVTDWYPTLVHLAGGSMQD-SLDGVDLWDA 347
Query: 370 LTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKY 429
L + + R EILHNID L++D
Sbjct: 348 LKQDTASPRKEILHNID-----------LKID---------------------------- 368
Query: 430 SPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN 489
P ++ G+ + KL +K +A T L ELRE + N
Sbjct: 369 VPTAFVFE--GVALRSRDMKLLLKVPNA------------TWLVPP---ELRE-PGITAN 410
Query: 490 YDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFD 549
A ++ L+NIT DP E+++L+ D++ +++ ++ Y+ +VPP K D
Sbjct: 411 EGLGSAKMQEVIEVALYNITADPTERHDLSGKFPDIVNEMKKRVDFYRGGLVPPIIKKND 470
Query: 550 KRAD-PARWNNIWVPW 564
+A A N W PW
Sbjct: 471 PKAVLTAIKNGAWSPW 486
>gi|241676246|ref|XP_002411524.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215504222|gb|EEC13716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 490
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 216/521 (41%), Positives = 292/521 (56%), Gaps = 50/521 (9%)
Query: 11 LTCTLLFNDAFLNTTA----PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
+ C +L + FL T K+P+I+ IL DDLGW DVSFHGS+QIPTPNID LA +G+
Sbjct: 3 VLCRILLDAVFLLGTILQCHAKRPNIVFILGDDLGWGDVSFHGSTQIPTPNIDVLAGDGV 62
Query: 67 ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
ILN +YV LCTPSR+ALMTG YPI GMQ I PWGLPL K+LPQ+ K+ GY
Sbjct: 63 ILNNYYVLPLCTPSRAALMTGLYPIRNGMQLTSIQAAGPWGLPLENKILPQHFKDLGYDV 122
Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDN 186
+ IGKWHLGFF+ Y P RGFD+ +G++ G DYY+H+ ++ + L + ++QV+
Sbjct: 123 NMIGKWHLGFFKTPYVPIKRGFDTFFGFYTGSNDYYNHTSGSSHR--KILAVTSSVQVNT 180
Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPM-------FLYLAHLAVHAGNTYEPFQAPDEEV 239
G S L+ ++V H P+ F Y++H AVH E FQAP V
Sbjct: 181 LEKGRLS--LWGPRELSVCFLHQIYSPLNFYLQPFFCYISHQAVHHALNAEMFQAPARNV 238
Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
KF I +P R YAGM+ LDES+G V+ AL G+LE++I++F +DNG +G+ SN+
Sbjct: 239 LKFSYIGEPNRTIYAGMLDALDESMGRVVEALSSAGILEDTIIVFSSDNGGSPYGLESNR 298
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
G N PLRG K T W+G +R A +W+P L + +VS++L HISDWLPTL AAG +
Sbjct: 299 GYNWPLRGAKFTLWEGSVRVPAFVWNPKLLKKSRVSNQLMHISDWLPTLYTAAGGNASAL 358
Query: 360 -SLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEW 418
+DGV+ W LT G+ +KR EIL NID++ N + LR K V G +G+ D+
Sbjct: 359 GGIDGVDMWHSLTHGSASKRKEILLNIDSLYN----ISGLRYKQYKLVVGGGFDGELDDH 414
Query: 419 Y---GDT--DNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILT 473
Y G T N ID+ + T + +L KQ + P A S +
Sbjct: 415 YYFPGGTRPTNNIDQLRNQS--------TMARVLKELYEKQDRSWSPLAWSKN------- 459
Query: 474 DEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCE 514
V C + + P LFNI +DPCE
Sbjct: 460 ----------VAVDCRRNRLTENFVPMQPPYLFNIEEDPCE 490
>gi|443705024|gb|ELU01769.1| hypothetical protein CAPTEDRAFT_23096, partial [Capitella teleta]
Length = 354
Score = 365 bits (937), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 188/363 (51%), Positives = 243/363 (66%), Gaps = 17/363 (4%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PHII ILADD GW+DV FHGS +IPTPN+DALA +G+IL+ HY Q LCTPSR +L+TGK+
Sbjct: 1 PHIIFILADDFGWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKH 60
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
PI IG+Q GVI +P+GL L EKLLP+YLK GY +H +GKWHLGFF + YTP RGFD
Sbjct: 61 PIQIGLQRGVIYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFD 120
Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
SHYG++ +DY H GLD N Q D G YST L+T A ++AEHN
Sbjct: 121 SHYGFYGASEDYMTH-----IGGMGGLDFWLNGQPDRSGQGHYSTTLFTTKAEQLLAEHN 175
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQ---APDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
+++PMFLY +H AVH + E AP + KF I+D E + AGM+S LDESVGN
Sbjct: 176 QTEPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVECQHMAGMISALDESVGN 235
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNK---GSNHPLRGMKSTPWDGGMRGVAAI 323
+ +L +GMLEN+I++F +DNGAP H N SN+PLRG KS+ W+GG R A +
Sbjct: 236 LTKSLHANGMLENTIIIFSSDNGAP----HGNTDVCSSNYPLRGAKSSVWEGGTRVPAFV 291
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLD--GVNQWDVLTKGAKTKRSEI 381
WSP LK++ VS ++ HISDWLPTL AAG +++ D GV+QW L + + R+ +
Sbjct: 292 WSPLLKKSGYVSKQMMHISDWLPTLLKAAGYDMSALPGDLYGVSQWQALQENGPSARNSM 351
Query: 382 LHN 384
L+N
Sbjct: 352 LYN 354
>gi|260803290|ref|XP_002596523.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
gi|229281781|gb|EEN52535.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
Length = 492
Score = 364 bits (934), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 213/544 (39%), Positives = 305/544 (56%), Gaps = 73/544 (13%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+KPHI+ ILADD GWND+ +H SS I TPN+D LA G+ L +Y+Q +C+PSR+ LMTG
Sbjct: 14 RKPHIVFILADDYGWNDIGYH-SSLIQTPNLDRLAQEGVKLENYYIQPICSPSRAQLMTG 72
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y I GMQH V++ P GLPL E LPQ LKE+GYATH +GKWHLG F++ Y PT+RG
Sbjct: 73 RYQIRYGMQHSVLMSDRPHGLPLGEVTLPQVLKESGYATHIVGKWHLGHFKKEYLPTWRG 132
Query: 148 FDSHYGYWQGLQDYYDHSCK----ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
FD+ +G+ G +DY+ H T E Y+ D + G YST ++ +I+
Sbjct: 133 FDTFFGFLGGGEDYFTHRIPNEIVETPETYRAFDFWDGSKPCLSENGSYSTHVFARKSID 192
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
+I+ H+K KPMFLYL AVHA P +AP+E + K+ I RTYAG+ + +DE+
Sbjct: 193 LISRHDKDKPMFLYLPFQAVHA-----PLEAPEEFINKYTHIRSKNMRTYAGVTTAMDEA 247
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
VGN+ AL+ HGM EN++++F DNGA S GSN PLRG K+T W+GG+RGV +
Sbjct: 248 VGNITRALKDHGMWENTVLIFSTDNGA------SKAGSNWPLRGFKNTLWEGGVRGVGFV 301
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEIL 382
SP LK Q++S L HISDW PT+ AG T LDG + WD +++G + R+EIL
Sbjct: 302 SSPLLKVKQRISDALLHISDWFPTIVRIAGGSTFGTKPLDGYDVWDTISEGTPSPRTEIL 361
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
HNID + +A T+++ ++I +S T
Sbjct: 362 HNIDPL-----------------IAATNSDA----------SKIHGHS-----------T 383
Query: 443 FN-ALKTKLQIKQ-KHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNST 500
FN ++K ++++ K N L+T D KI E R + +
Sbjct: 384 FNTSVKAAIRVRDWKLLTGNPGNGDWQLQTF--DGKIKEKSTGVRGKQLW---------- 431
Query: 501 VKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
LFNI DP E+ ++++ +++++L KL+ Y T VP D R++PA ++
Sbjct: 432 ----LFNIRSDPQERKDVSDEYPEVVQRLLTKLSAYNQTAVPVYWPDIDPRSNPALHGDL 487
Query: 561 WVPW 564
PW
Sbjct: 488 VGPW 491
>gi|291190498|ref|NP_001167123.1| Arylsulfatase B precursor [Salmo salar]
gi|223648254|gb|ACN10885.1| Arylsulfatase B precursor [Salmo salar]
Length = 528
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 222/567 (39%), Positives = 301/567 (53%), Gaps = 60/567 (10%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
A LL FL P PHI+ ILADD GWNDV +HGS +I TPN+D L+ G+ L
Sbjct: 10 IATAIILLSCPLFLEGKQP--PHIVFILADDFGWNDVGYHGS-EIKTPNLDKLSAKGVRL 66
Query: 69 NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
+YVQ LCTPSR+ LMTG+Y I GMQH +I +P+ +PL EKLLPQ ++EAGYATH
Sbjct: 67 ENYYVQPLCTPSRNQLMTGRYQIRTGMQHQIIWPCQPYCVPLDEKLLPQLMREAGYATHM 126
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQV 184
+GKWHLG +R+ PT RGFDS++GY G +DY+ H + P +D+R +V
Sbjct: 127 VGKWHLGMYRKDCLPTRRGFDSYFGYLTGSEDYFSHQRCSYVPPLNVTRCAVDLREGEEV 186
Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
G YST L+T+ ++IA+ N KP+FLY+A AVHA P Q P+ VA +
Sbjct: 187 ATGYTGTYSTQLFTDRVTSIIAKQNSKKPLFLYVALQAVHA-----PLQVPERYVAPYSF 241
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
I DP RR YAGMVS +DE+VGN+ AL++ G+ +N++++F DNG + GSN P
Sbjct: 242 IKDPNRRLYAGMVSAMDEAVGNITLALQETGLWDNTVLVFSTDNGGQTLA----GGSNWP 297
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDG 363
LRG K + W+GG+RGV + SP LKQ VS +L HISDWLPTL AG N T LDG
Sbjct: 298 LRGRKWSLWEGGVRGVGFVTSPLLKQPGTVSRQLIHISDWLPTLVGLAGGSTNGTKPLDG 357
Query: 364 VNQWDVLTKGAKTKRSEILHNIDNV-----DNPQKYYAALRVDDLKYVAGTDNNGQSDEW 418
N W+ ++KG + R E+LHNID + P L V + G
Sbjct: 358 YNVWNAISKGYASPRLELLHNIDPLYVDIAPCPGSASQRLPVSNAPVSGG---------- 407
Query: 419 YGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKIL 478
G T N P S ++ +A +Q K +LT
Sbjct: 408 -GSTSN------PWSWSNSGFNVSIHAAIRSMQWK-----------------LLTGYPGC 443
Query: 479 ELREFARVRCNYDNKGAHCNSTVKPC-LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYK 537
+ R F R + + +KP LF++ DP E++ +++ +++ L +L Y
Sbjct: 444 D-RWFTRPGLENTSGTIRSVNPLKPVMLFDVEADPEERDEVSDQHPNVVDNLLRRLGHYL 502
Query: 538 STMVPPGNKPFDKRADPARWNNIWVPW 564
P D R DP W PW
Sbjct: 503 RRSQPITFPDDDPRCDPG--TGAWGPW 527
>gi|74140818|dbj|BAE34455.1| unnamed protein product [Mus musculus]
Length = 458
Score = 361 bits (926), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 185/369 (50%), Positives = 242/369 (65%), Gaps = 15/369 (4%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
+ A + PH++ +LADDLGWND+ FHGS I TP++DALA G++L+ +YVQ LCTPSRS
Sbjct: 40 SGATQPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQ 98
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L+TG+Y IH+G+QH +I+ +P +PL EKLLPQ LKEAGYATH +GKWHLG +R+ P
Sbjct: 99 LLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLP 158
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTE 199
T RGFD+++GY G +DYY H A E G LD+R + + IYST+++T+
Sbjct: 159 TRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTK 218
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
A VIA H KP+FLYLA +VH +P Q P+E + + I D RR YAGMVS
Sbjct: 219 RATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSL 273
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
+DE+VGNV AL+ HG+ N++ +F DNG G + G+N PLRG K T W+GG+RG
Sbjct: 274 MDEAVGNVTKALKSHGLWNNTVFIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRG 329
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKR 378
+ SP LKQ S EL HISDWLPTL AG N T LDG N W +++G + R
Sbjct: 330 TGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGSTNGTKPLDGFNMWKTISEGHPSPR 389
Query: 379 SEILHNIDN 387
E+LHNID
Sbjct: 390 VELLHNIDQ 398
>gi|432885639|ref|XP_004074694.1| PREDICTED: arylsulfatase B-like [Oryzias latipes]
Length = 520
Score = 360 bits (925), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 210/546 (38%), Positives = 298/546 (54%), Gaps = 54/546 (9%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
T A +PHI+ I+ADD GWNDV +H +S+I TPN+D L+ G+ L +YVQ LC+PSR+
Sbjct: 23 TLAAPRPHIVFIMADDFGWNDVGYH-NSEIKTPNLDLLSAKGVRLQNYYVQPLCSPSRNQ 81
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
LMTG+Y IH GMQH +I +P+ +PL EKLLPQ LKEAGYATH +GKWHLG +++ P
Sbjct: 82 LMTGRYQIHTGMQHQIIWPCQPYCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYKKDCLP 141
Query: 144 TFRGFDSHYGYWQGLQDYYDHS----CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
T RGFDS++GY+ G +DYY H+ A LD+R +V G YST+L+++
Sbjct: 142 THRGFDSYFGYYLGSEDYYTHTRCYPITALNLTRCALDLRDGEEVATAYKGAYSTELFSQ 201
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
A++VIA+HN S+P+FLY+A AVH EP Q P+ V + I D RR YAGMVS
Sbjct: 202 RAVSVIAKHNASQPLFLYVAMQAVH-----EPLQVPERYVTPYSFIKDVSRRKYAGMVSA 256
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
+DE+VGN+ ALR+ G+ +N++++F DNG G GSN PLRG K W+GG+RG
Sbjct: 257 MDEAVGNISLALRQAGLWDNTVLVFSTDNG----GQTRAGGSNWPLRGRKWALWEGGIRG 312
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKR 378
V + SP L+ V++EL HISDWLPTL AG + T LDG N W+ ++KG + R
Sbjct: 313 VGFVSSPLLENPGTVNNELMHISDWLPTLVRLAGGSTSGTKPLDGFNMWNTISKGFASPR 372
Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
E+LHNID + Y + T G E + + + ++ L K
Sbjct: 373 LELLHNIDTL------YEDTSPCPGRQQQLTLAQGFRGETWARSSFNVSIHAAIRYLNWK 426
Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
+ T + + ++ + S D L+ ++
Sbjct: 427 LLTGYPGCPTWFPRPEHNTSESSSTSVDPLKPVM-------------------------- 460
Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
LF+I DP E+N ++ ++ L +L + +P D R DP +
Sbjct: 461 ------LFDIQKDPEERNEVSAQFPAVVDYLLTRLQSHLKNSLPINFPDDDPRCDPGP-S 513
Query: 559 NIWVPW 564
W PW
Sbjct: 514 GAWGPW 519
>gi|156362330|ref|XP_001625732.1| predicted protein [Nematostella vectensis]
gi|156212578|gb|EDO33632.1| predicted protein [Nematostella vectensis]
Length = 491
Score = 360 bits (923), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 199/478 (41%), Positives = 288/478 (60%), Gaps = 48/478 (10%)
Query: 15 LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ 74
L N L ++A KPH++ +LADDLGW+DV FHGS +I TPNID LA NG+IL+ +YVQ
Sbjct: 12 LCLNVVVLQSSA--KPHLLFVLADDLGWSDVGFHGS-KIQTPNIDRLAANGVILDNYYVQ 68
Query: 75 ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL 134
+CTP+R++LMTGKYPIH G+QHG+I G P+GLPL LLPQ L++AGY+TH +GKWHL
Sbjct: 69 PVCTPTRASLMTGKYPIHTGLQHGIIHNGRPYGLPLNLTLLPQKLRKAGYSTHMLGKWHL 128
Query: 135 GFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYST 194
GF+ TPT+RGFD+ YG++ G +++Y H LD+R N ++ G YS
Sbjct: 129 GFYNWESTPTYRGFDTFYGFYSGAENHYTHVQD------HYLDLRDNEEIVRDQNGTYSA 182
Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
L+T+ A ++ H+ S P+F+Y+A VH+ P QAP E + ++ I DP RRTYA
Sbjct: 183 HLFTKRAEQIVRAHDPSTPLFMYMAFQNVHS-----PVQAPKEYIDRYSFIKDPLRRTYA 237
Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
MV+ +D+++GN+ A K G+ EN+I++F DNG G+ N G ++PLRG K T W+
Sbjct: 238 AMVTIMDDALGNLTRAFDKAGLWENTILIFSTDNG----GVPKNGGYDYPLRGRKDTLWE 293
Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKG 373
GG+RGVA + L+Q+ L H++DW PTL + AG ++ D LDG + W+ ++ G
Sbjct: 294 GGVRGVAFVHGVALEQSGVKCKALMHVTDWYPTLVSLAGGSLDEDEDLDGYDVWESISHG 353
Query: 374 AKTKRSEILHNIDNVDNP--------QKYYAALRVDDLK----------YVAGTDNNGQS 415
++ R E+LHNID ++ P LRV D+K ++ D NG
Sbjct: 354 VESPRKELLHNIDTINIPPGDGSLGFSTTGIGLRVGDMKLLMAVPNISYFIPPEDRNGSV 413
Query: 416 DEWYGDTDN-----EIDKYSPKEVLYSKAGI--TFNALKTKLQIKQKH---AADPKAN 463
D WY ++N E+ Y+ Y K + + T+LQ++ +H A P AN
Sbjct: 414 D-WYIHSNNKVPMVEVALYNITADPYEKHDLHDKLPDVVTRLQLRVEHYRKTAVPPAN 470
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 40/65 (61%), Gaps = 1/65 (1%)
Query: 501 VKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRA-DPARWNN 559
V+ L+NIT DP E+++L + D++ +L+ ++ Y+ T VPP NKP D A A+ N
Sbjct: 426 VEVALYNITADPYEKHDLHDKLPDVVTRLQLRVEHYRKTAVPPANKPKDPYARQVAKQNG 485
Query: 560 IWVPW 564
W PW
Sbjct: 486 AWTPW 490
>gi|125656171|ref|NP_033842.3| arylsulfatase B precursor [Mus musculus]
gi|81158036|tpe|CAI84992.1| TPA: arylsulfatase B [Mus musculus]
gi|195934801|gb|AAI68412.1| Arylsulfatase B [synthetic construct]
Length = 534
Score = 359 bits (922), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 184/369 (49%), Positives = 242/369 (65%), Gaps = 15/369 (4%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
+ A + PH++ +LADDLGWND+ FHGS I TP++DALA G++L+ +YVQ LCTPSRS
Sbjct: 40 SGATQPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQ 98
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L+TG+Y IH+G+QH +I+ +P +PL EKLLPQ LKEAGYATH +GKWHLG +R+ P
Sbjct: 99 LLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLP 158
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTE 199
T RGFD+++GY G +DYY H A E G LD+R + + IYST+++T+
Sbjct: 159 TRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTK 218
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
A VIA H KP+FLYLA +VH +P Q P+E + + I D RR YAGMVS
Sbjct: 219 RATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSL 273
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
+DE+VGNV AL+ HG+ N++ +F DNG G + G+N PLRG K T W+GG+RG
Sbjct: 274 MDEAVGNVTKALKSHGLWNNTVFIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRG 329
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKR 378
+ SP LKQ S EL HI+DWLPTL AG N T LDG N W +++G + R
Sbjct: 330 TGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNGTKPLDGFNMWKTISEGHPSPR 389
Query: 379 SEILHNIDN 387
E+LHNID
Sbjct: 390 VELLHNIDQ 398
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LF+I DP E+++++ +++ L +L Y VP P D R DP + +W PW
Sbjct: 475 LFDINQDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSHFPPLDPRCDP-KSTGVWSPW 533
>gi|122065132|sp|P50429.3|ARSB_MOUSE RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
Flags: Precursor
gi|74152170|dbj|BAE32375.1| unnamed protein product [Mus musculus]
Length = 534
Score = 359 bits (921), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 184/369 (49%), Positives = 242/369 (65%), Gaps = 15/369 (4%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
+ A + PH++ +LADDLGWND+ FHGS I TP++DALA G++L+ +YVQ LCTPSRS
Sbjct: 40 SGATQPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQ 98
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L+TG+Y IH+G+QH +I+ +P +PL EKLLPQ LKEAGYATH +GKWHLG +R+ P
Sbjct: 99 LLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLP 158
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTE 199
T RGFD+++GY G +DYY H A E G LD+R + + IYST+++T+
Sbjct: 159 TRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTK 218
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
A VIA H KP+FLYLA +VH +P Q P+E + + I D RR YAGMVS
Sbjct: 219 RATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSL 273
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
+DE+VGNV AL+ HG+ N++ +F DNG G + G+N PLRG K T W+GG+RG
Sbjct: 274 MDEAVGNVTKALKSHGLWNNTVFIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRG 329
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKR 378
+ SP LKQ S EL HI+DWLPTL AG N T LDG N W +++G + R
Sbjct: 330 TGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNGTKPLDGFNMWKTISEGHPSPR 389
Query: 379 SEILHNIDN 387
E+LHNID
Sbjct: 390 VELLHNIDQ 398
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LF+I DP E+++++ +++ L +L Y VP P D R DP + +W PW
Sbjct: 475 LFDINQDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSHFPPLDPRCDP-KSTGVWSPW 533
>gi|405956212|gb|EKC22964.1| Arylsulfatase B [Crassostrea gigas]
Length = 491
Score = 358 bits (919), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 187/405 (46%), Positives = 253/405 (62%), Gaps = 26/405 (6%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
L+ KPHI++I+ADD G+ND+ +HGS +I TPN+D LA G+ L +YVQ +CTP+R
Sbjct: 16 LSKCQTTKPHILLIIADDFGYNDIGYHGS-EIKTPNLDKLAGEGVKLENYYVQPICTPTR 74
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
S LM+G+Y IH G+QHGVI +P GLPL +LPQ LKE GY+THA+GKWHLGF++E Y
Sbjct: 75 SQLMSGRYQIHTGLQHGVIRPPQPNGLPLDSAILPQKLKEVGYSTHAVGKWHLGFYKEEY 134
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
PT RGFDSH+GY G +DY+ H F G D+R NM N T G+YST L+ + A
Sbjct: 135 LPTNRGFDSHFGYLTGAEDYFKHD--RCFGAMCGTDLRDNMNPANYT-GVYSTHLFAQKA 191
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
V+ HN KP+FLYL +VHA P Q P++ ++ I D +RRTYAGMVS +D
Sbjct: 192 AEVVNNHNTDKPLFLYLPFQSVHA-----PLQVPEQYTKPYMHIQDKQRRTYAGMVSAMD 246
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E++GN+ +K GM N++++F DNG IH+ G+N PLRG K + W+GG+ GV
Sbjct: 247 EAIGNLTDVFKKRGMWNNTLMVFSTDNGGQ---IHAG-GNNFPLRGWKDSLWEGGVHGVG 302
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKT-KRS 379
+ LK+ VS +L H+SDW PTL + G +N T LDG++QW + +G+ R
Sbjct: 303 LVHGQMLKRKGSVSRDLIHVSDWFPTLISMVGGTLNGTKPLDGMDQWKTINEGSSPGPRQ 362
Query: 380 EILHNIDNVDNP-----------QKYYAALRVDDLKYVAGTDNNG 413
ILHNID + P + AA+R+ D K V G NG
Sbjct: 363 SILHNIDPMYPPVGQRVPSSPFDNRIRAAVRMGDWKLVTGFPGNG 407
Score = 43.9 bits (102), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 32/60 (53%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT+DP E N+++ + ++ + + LA Y T VP D +A+P W PW
Sbjct: 431 LFNITEDPNEHNDVSGTYQHVVINMLELLAQYNKTAVPCRYPAMDSKANPKYHGGYWGPW 490
>gi|410929555|ref|XP_003978165.1| PREDICTED: arylsulfatase B-like [Takifugu rubripes]
Length = 516
Score = 355 bits (912), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 220/565 (38%), Positives = 307/565 (54%), Gaps = 63/565 (11%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F +T L FL A K+PHI+ ILADD GW DV +H S+I TP +D L+ G+
Sbjct: 6 FLIITGLFLSFPGFL---AAKQPHIVFILADDFGWYDVGYH-LSEIRTPILDKLSSGGVR 61
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
L +YVQ LCTPSR+ LMTG+Y IH GMQH +I +P+ +PL EKLLPQ +KEAGYATH
Sbjct: 62 LENYYVQPLCTPSRNQLMTGRYQIHTGMQHQIIWPCQPYCVPLDEKLLPQLMKEAGYATH 121
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYD----HSCKATFEPYQGLDMRHNMQ 183
+GKWHLG +++ PT RGFDS+ GY G +DYY H A LD+R
Sbjct: 122 MVGKWHLGMYKKDCLPTRRGFDSYLGYLTGSEDYYTHIRCHPISALNLTRCALDLREAEA 181
Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
V G YST+L+++ A+++I +H ++P+FLY+A AVHA P Q P+ VA +
Sbjct: 182 VARSYKGTYSTELFSQRAVSIIEKHTSTEPLFLYVAFQAVHA-----PLQVPERYVAPYS 236
Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
I D RR+YAGMVS +DE+VGN+ AL+K G+ EN++++F DNG + + GSN
Sbjct: 237 FIQDHSRRSYAGMVSAMDEAVGNITLALQKRGLWENTVLVFSTDNGGQTL----SGGSNW 292
Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LD 362
PLRG K + W+GG+RGV + SP L+Q +S +L HISDWLPTL AG N T LD
Sbjct: 293 PLRGRKWSLWEGGVRGVGFVASPLLEQPGTISRQLIHISDWLPTLVGLAGGSTNGTKPLD 352
Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVD-DLKYVAGTDNNGQSDEWYGD 421
G + W+ ++KG + R E+LHNID + N R L VAG +++ +
Sbjct: 353 GFDVWNTISKGFASPRLELLHNIDPLYNDIAPCPGTRRPLTLAQVAG--GASRTNSGFNV 410
Query: 422 TDNEIDKYSPKEVL--YSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILE 479
+ + +YS ++L Y I F P N SD + E
Sbjct: 411 SIHAAIRYSNWKLLTGYPGCDIWF--------------PRPGHNGSD------STPSEAE 450
Query: 480 LREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKST 539
L++ R L+NI DP E+N ++ ++ +L +L ++ +
Sbjct: 451 LQKPVR-------------------LYNIEKDPEERNEVSAQFPAVVDRLLARLYHHQKS 491
Query: 540 MVPPGNKPFDKRADPARWNNIWVPW 564
VP D + DP W PW
Sbjct: 492 AVPINFPDDDPKCDPGP-TGAWGPW 515
>gi|260794559|ref|XP_002592276.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
gi|229277492|gb|EEN48287.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
Length = 520
Score = 353 bits (906), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 191/426 (44%), Positives = 258/426 (60%), Gaps = 31/426 (7%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F L C L F + A KP+I+ ILADD GWND+ +HGS I TPN+D LA G+
Sbjct: 21 FLLLVCVLSGGKTFAQS-ASGKPNIVFILADDYGWNDIGYHGSV-IRTPNLDRLAAEGVK 78
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
L +Y+Q +CTPSRS LMTG+Y IH G+QH +I +P GLPL E LPQ LKE GY+TH
Sbjct: 79 LENYYIQPICTPSRSQLMTGRYQIHFGLQHSIIWPPQPSGLPLDEVTLPQRLKEGGYSTH 138
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKA---TFEP-YQGLDMRHNMQ 183
+GKWHLGF++E YTP RGFD+ YG+ G +++Y H F P + GLD+R +
Sbjct: 139 IVGKWHLGFYKEEYTPLHRGFDTFYGFLTGSENHYSHRNSGGMPGFRPGWNGLDLRDQDR 198
Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
G YST L+ + AI VIA+ +KSKPMFLYL AVHA P QAP + ++ +
Sbjct: 199 PVTDQNGTYSTHLFAKKAIEVIAQQDKSKPMFLYLPFQAVHA-----PLQAPQKYISMYR 253
Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
I+D RR YA M + +DE+VGN+ AL+++G+ +N++++F DNG G G+N
Sbjct: 254 HINDYNRRMYAAMTTAMDEAVGNITDALKQYGLWDNTVLVFSTDNG----GQVMAGGNNW 309
Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LD 362
PLRG K + W+GG+RGV + SP +K + S L HISDW PTL AG N T LD
Sbjct: 310 PLRGWKGSLWEGGIRGVGFVNSPLIKAKGRTSDALIHISDWFPTLVGLAGGSTNGTKPLD 369
Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDN------PQKY---------YAALRVDDLKYVA 407
G + W+ +++G + R EILHNID + P ++ +AA+R D K +
Sbjct: 370 GHDVWEAISEGKPSPRKEILHNIDPLFRTILYLPPHQWGDDTFNTSMHAAIRSGDWKLLT 429
Query: 408 GTDNNG 413
G NG
Sbjct: 430 GYPGNG 435
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 38/60 (63%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNI +DP E+ +L++ +++ L +KLA YK + VP P D RA+PA ++W PW
Sbjct: 460 LFNIREDPEERTDLSKKYPLVVQDLLEKLAAYKKSSVPIFYPPPDPRANPALHGDLWSPW 519
>gi|348535399|ref|XP_003455188.1| PREDICTED: arylsulfatase B [Oreochromis niloticus]
Length = 519
Score = 352 bits (903), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 182/393 (46%), Positives = 248/393 (63%), Gaps = 27/393 (6%)
Query: 4 ARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
+ ++ + T + N A T A ++PH++ ILADD GW D+ +HGS +I TPN+D L+
Sbjct: 6 GKAVYWIIASTFIVNLA--QTLAARQPHVVFILADDFGWYDIGYHGS-EIRTPNLDKLSA 62
Query: 64 NGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAG 123
G+ L +YVQ LCTPSR+ LMTG+Y IH GMQH +I +P+ +PL EKLLPQ +KEAG
Sbjct: 63 GGVRLENYYVQPLCTPSRNQLMTGRYQIHTGMQHQIIWPCQPYCVPLDEKLLPQLMKEAG 122
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ--------- 174
YATH +GKWHLG +++ PT RGFD++ GY G +DY+ H F YQ
Sbjct: 123 YATHMVGKWHLGMYKKDCLPTRRGFDTYLGYLTGSEDYFTH-----FRCYQSPSLNLSRC 177
Query: 175 GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA 234
LD+R +V G+YST+L ++ AI++I H KP+F+Y+A AVHA P Q
Sbjct: 178 ALDLRDGEEVATGYKGVYSTELLSQRAISIIERHISQKPLFMYVALQAVHA-----PLQV 232
Query: 235 PDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
P+ V + I D RR YAGMVS +DE+VGN+ AL+K G+ N++++F DNG +
Sbjct: 233 PERYVTPYSFIKDTNRRLYAGMVSAMDEAVGNITMALQKEGLWNNTVLVFSTDNGGQTL- 291
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
+ GSN PLRG K + W+GG+RGV + SP LKQ V+ EL HISDWLPTL AG
Sbjct: 292 ---SGGSNWPLRGRKWSLWEGGVRGVGFVASPLLKQPGTVNRELIHISDWLPTLVGLAGG 348
Query: 355 EINDTS-LDGVNQWDVLTKGAKTKRSEILHNID 386
T LDG + W+ ++KG + R E+LHNID
Sbjct: 349 STTGTKPLDGFDVWNAISKGFASPRLELLHNID 381
>gi|296483766|tpg|DAA25881.1| TPA: arylsulfatase B [Bos taurus]
Length = 429
Score = 350 bits (897), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 178/366 (48%), Positives = 236/366 (64%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS+ I TP +DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 41 ASRPPHLVFVLADDLGWNDVGFHGSA-IRTPRLDALAAGGVLLDNYYTQPLCTPSRSQLL 99
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y IH G+QH +IL +P +PL EKLLPQ LKEAGYATH +GKWHLG +R+ PT
Sbjct: 100 TGRYQIHTGLQHQIILPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTR 159
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++TE A
Sbjct: 160 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTERA 219
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+I H KP+FLYLA +VH EP Q P+E + + I D RR YAGM S +D
Sbjct: 220 TTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASVMD 274
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL + G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 275 EAVGNVTAALERRGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 330
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LK+ + EL HISDWLPTL AG N T LDG + W+ +++G+ + R E
Sbjct: 331 FVASPLLKRKGVKTRELIHISDWLPTLVKLAGGSTNGTKPLDGFDVWNTISEGSPSPRME 390
Query: 381 ILHNID 386
+LHNID
Sbjct: 391 LLHNID 396
>gi|344272682|ref|XP_003408160.1| PREDICTED: arylsulfatase B [Loxodonta africana]
Length = 532
Score = 349 bits (895), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 182/367 (49%), Positives = 239/367 (65%), Gaps = 15/367 (4%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
+A + PH++ +LADDLGWNDV FHGSS I TP++DALA G+ L+ +YVQ LCTPSRS L
Sbjct: 39 SAGRPPHLVFLLADDLGWNDVGFHGSS-IRTPHLDALAAGGVRLDNYYVQPLCTPSRSQL 97
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
++G+Y IH G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 98 LSGRYQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPT 157
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEA 200
RGFD+++GY G +DYY H + LD R +V +YST+++TE
Sbjct: 158 RRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTER 217
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
A +IA H KP+FLYLA +VH EP Q P+E + + I D RR YAGMVS +
Sbjct: 218 ATALIANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIEDKNRRHYAGMVSLM 272
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
DE+VGNV AAL+K G+ +N++ +F DNG + G+N PLRG K T W+GG+RGV
Sbjct: 273 DEAVGNVTAALKKRGLWDNTVFIFSTDNGGQTLA----GGNNWPLRGRKWTLWEGGVRGV 328
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRS 379
+ SP LKQ + EL HISDWLPTL AG N T LDG + W +++G+ + R
Sbjct: 329 GFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTNGTKPLDGFDVWKTVSEGSPSPRV 388
Query: 380 EILHNID 386
E+LHNID
Sbjct: 389 ELLHNID 395
>gi|410924964|ref|XP_003975951.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
Length = 574
Score = 349 bits (895), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 214/551 (38%), Positives = 295/551 (53%), Gaps = 81/551 (14%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K+PHI+ IL DD G+ND+ +H + I TP +D LA G+ L +YVQ +CTPSRS LMTG
Sbjct: 44 KQPHIVFILIDDQGFNDIGYHNPT-IKTPTLDKLAAEGVRLENYYVQPICTPSRSQLMTG 102
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y IH G+QH +I +P LP LP+ L++AGY+TH +GKWHLGF+R+ PT +G
Sbjct: 103 RYQIHTGLQHSIIRPSQPSCLPSHMDTLPERLRQAGYSTHLVGKWHLGFYRKACLPTRKG 162
Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
FD+ +G G D+Y++ SC G D+ V G YST L+T+ A ++
Sbjct: 163 FDTFFGSLTGSVDHYNYLSCDGPG--VCGYDLHDGEGVAWGQEGKYSTTLFTQRARKILE 220
Query: 207 EHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
HN + KP+FL L+ AVH P Q P + + D+++ RR A MVS +DE+V
Sbjct: 221 SHNPTEKPLFLLLSLQAVHT-----PLQTPKSYIYPYRDMANIARRKLAAMVSTVDEAVR 275
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
NV ALRK+G NS++++ DNGA F GSN PLRG K T W+GG+RGVA + S
Sbjct: 276 NVTYALRKYGFYRNSVIIYSTDNGAQPF----TGGSNWPLRGRKGTYWEGGIRGVAFVHS 331
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
P LK+ ++VS L HI+DW PTL AG I+ + LDG + W +++G + R EILHN
Sbjct: 332 PLLKRRRRVSKALLHITDWFPTLVGLAGGNISQSRGLDGFDVWPTISEGRDSPRQEILHN 391
Query: 385 IDNVDN--------PQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
ID + AA+RV D K + G +G + P +VL
Sbjct: 392 IDPLHKHVTSESLWDTSIQAAIRVGDWKLLTGYPGHGD--------------WVPPQVLP 437
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH 496
+ G + D R IL+ K
Sbjct: 438 TLPGRWW----------------------DLERNILSPYK-------------------- 455
Query: 497 CNSTVKPC-LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
NST K LFNIT DPCE+++LA+ + D++ QL +LA Y T VP P D RADP+
Sbjct: 456 -NSTYKNIWLFNITADPCERHDLADQRPDVVLQLLARLAYYNQTAVPVYFPPDDPRADPS 514
Query: 556 RWNNIWVPWYD 566
+ WVPW +
Sbjct: 515 QHEGAWVPWVN 525
>gi|260794561|ref|XP_002592277.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
gi|229277493|gb|EEN48288.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
Length = 598
Score = 349 bits (895), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 182/411 (44%), Positives = 253/411 (61%), Gaps = 30/411 (7%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
+ ++ KP+I+ ILADD GWND+ +HGS I TPN+D LA G+ L +YVQ LC+PSR
Sbjct: 114 IQESSSGKPNIVFILADDYGWNDIGYHGSV-IRTPNLDRLAAEGVKLENYYVQPLCSPSR 172
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
LMTG+Y I G+QH +I +P GLPL E LPQ LKE GY+TH +GKWHLGF+++ Y
Sbjct: 173 CQLMTGRYQIRYGLQHSLIWPPQPSGLPLDEVTLPQRLKEGGYSTHIVGKWHLGFYKQDY 232
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATF----EPYQGLDMRHNMQVDNKTIGIYSTDLY 197
TPT RGFD+ YGY G +DY+ H K + + GLD+R + G YST L+
Sbjct: 233 TPTHRGFDTFYGYLTGAEDYWTHRQKGGLPGQPQTWSGLDLRDQNRPVTDQNGTYSTHLF 292
Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
AI +IA+ +K+KPMFL+L+ AVH +P QAP+E+++++ ISD RR YA M
Sbjct: 293 ANKAIEIIAQQDKNKPMFLFLSFQAVH-----DPLQAPEEDISRYSHISDTNRRVYAAMT 347
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
+ +D++VGNV AL+++G+ +N++++F DNG G G N PLRG K + W+GG+
Sbjct: 348 TIMDQAVGNVTRALKQYGLWDNTVLIFSTDNG----GRVDRGGINWPLRGWKGSLWEGGV 403
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKT 376
RGV + SP +K + S L HISDW PTL A N T LDG + W+ ++ G +
Sbjct: 404 RGVGFVNSPLIKAKGRTSDALIHISDWFPTLVGLASGSTNGTKPLDGHDVWEAISDGKPS 463
Query: 377 KRSEILHNID----NVDNPQKY-----------YAALRVDDLKYVAGTDNN 412
R EILHNID V +P+ + +AA+R D K + G N
Sbjct: 464 PRREILHNIDPMFHTVPSPRPHQWGDRVFNTSVHAAIRSGDWKLLTGYPGN 514
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 38/60 (63%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNI +DP E+ +L++ ++++L +KLA Y T VP FD +A+PA NIW PW
Sbjct: 538 LFNIREDPEERTDLSQKHPGVVQELLEKLARYNRTAVPVFYPSFDPQANPALHGNIWGPW 597
>gi|155372077|ref|NP_001094645.1| arylsulfatase B precursor [Bos taurus]
gi|151554899|gb|AAI48140.1| ARSB protein [Bos taurus]
Length = 533
Score = 348 bits (892), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 178/366 (48%), Positives = 236/366 (64%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS+ I TP +DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 41 ASRPPHLVFVLADDLGWNDVGFHGSA-IRTPRLDALAAGGVLLDNYYTQPLCTPSRSQLL 99
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y IH G+QH +IL +P +PL EKLLPQ LKEAGYATH +GKWHLG +R+ PT
Sbjct: 100 TGRYQIHTGLQHQIILPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTR 159
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++TE A
Sbjct: 160 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTERA 219
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+I H KP+FLYLA +VH EP Q P+E + + I D RR YAGM S +D
Sbjct: 220 TTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASVMD 274
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL + G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 275 EAVGNVTAALERRGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 330
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LK+ + EL HISDWLPTL AG N T LDG + W+ +++G+ + R E
Sbjct: 331 FVASPLLKRKGVKTRELIHISDWLPTLVKLAGGSTNGTKPLDGFDVWNTISEGSPSPRME 390
Query: 381 ILHNID 386
+LHNID
Sbjct: 391 LLHNID 396
>gi|348557289|ref|XP_003464452.1| PREDICTED: arylsulfatase B-like [Cavia porcellus]
Length = 520
Score = 347 bits (891), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 179/364 (49%), Positives = 238/364 (65%), Gaps = 15/364 (4%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+ PH++++LADDLGWNDV FHGS ++ TP++DALA G+ L+ +Y Q LCTPSRS L+TG
Sbjct: 30 QPPHLVLVLADDLGWNDVGFHGS-RLRTPHLDALAAGGVRLDNYYTQPLCTPSRSQLLTG 88
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y IH G+QH +I +P +PL EKLLPQ LKEAGYATH +GKWHLG +R+ PT RG
Sbjct: 89 RYQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRG 148
Query: 148 FDSHYGYWQGLQDYYDHS----CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
FD+++GY G +DY+ H KA LD R +V + +YS +++ AI+
Sbjct: 149 FDTYFGYLLGSEDYFSHEHCVFIKALNVTRCALDFRDGEEVATEYKNMYSANIFANRAIS 208
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
+IA H KP+FLYLA +VH EP Q P+E + + I D RR YAGMVS +DE+
Sbjct: 209 LIANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIRDKNRRLYAGMVSLMDEA 263
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
VGNV AAL HG+ N++ +F DNG + G+N PLRG K T W+GG+RGV +
Sbjct: 264 VGNVTAALESHGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWTLWEGGIRGVGFV 319
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEIL 382
SP LKQ + EL HISDWLPTL AG T +LDG N W+ +++G+ + R E+L
Sbjct: 320 ASPLLKQKGVRTRELIHISDWLPTLVKLAGGSTQGTKALDGFNVWETISEGSPSPRVELL 379
Query: 383 HNID 386
HNID
Sbjct: 380 HNID 383
>gi|77993374|ref|NP_254278.1| arylsulfatase B precursor [Rattus norvegicus]
gi|148887336|sp|P50430.2|ARSB_RAT RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S
gi|81158016|tpe|CAI84982.1| TPA: arylsulfatase B [Rattus norvegicus]
gi|195539740|gb|AAI68241.1| Arylsulfatase B [Rattus norvegicus]
Length = 528
Score = 347 bits (891), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 179/366 (48%), Positives = 241/366 (65%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A PH++ +LADDLGWND+ FHGS I TP++DALA G++L+ +YVQ LCTPSRS L+
Sbjct: 36 AAPPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLL 94
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y IH+G+QH +I+ +P +PL EKLLPQ LK+AGYATH +GKWHLG +R+ PT
Sbjct: 95 TGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTR 154
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H A E G LD+R + + IYST+++T+ A
Sbjct: 155 RGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRA 214
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+IA H KP+FLYLA +VH +P Q P+E + + I D RR YAGMVS LD
Sbjct: 215 TTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLD 269
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AL+ G+ N++++F DNG G + G+N PLRG K T W+GG+RG
Sbjct: 270 EAVGNVTKALKSRGLWNNTVLIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRGAG 325
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ S EL HI+DWLPTL AG + T LDG + W+ +++G+ + R E
Sbjct: 326 FVASPLLKQKGVKSRELMHITDWLPTLVNLAGGSTHGTKPLDGFDVWETISEGSPSPRVE 385
Query: 381 ILHNID 386
+L NID
Sbjct: 386 LLLNID 391
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LF+I DP E+++++ +++ L +L Y VP P D R DP + +W PW
Sbjct: 469 LFDINRDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSYFPPLDPRCDP-KGTGVWSPW 527
>gi|391330458|ref|XP_003739677.1| PREDICTED: arylsulfatase J-like [Metaseiulus occidentalis]
Length = 633
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 203/565 (35%), Positives = 306/565 (54%), Gaps = 38/565 (6%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+ P II+ + DD+GW+DVSFH + QIPTPNIDA+ + ++LN HYVQA CTPSR AL+TG
Sbjct: 55 RAPDIILFIVDDMGWSDVSFHANGQIPTPNIDAMCSDAVLLNSHYVQASCTPSRGALLTG 114
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
KYPI IG+Q VI G L L +LLPQYL++ GYATH +GKWHLGF+ E Y P RG
Sbjct: 115 KYPIKIGLQEYVIQPGRQEALHLKHRLLPQYLRDLGYATHLVGKWHLGFYAEDYLPENRG 174
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FDS YG++ G YY+HS + + G D N + D G Y+TD+ T+ ++I
Sbjct: 175 FDSFYGFYNGAGTYYNHSA-SDADGRIGYDWHLNKESDPDAHGKYATDIITQRVKHLIQS 233
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE--------RRTYAGMVSR 259
+ KPMFL ++H+A H G+ + D + I DPE R YAGM+
Sbjct: 234 RDPEKPMFLMISHMAPHGGDNEDELFEVDRQW-----IEDPEIAHIMVESRTKYAGMIRA 288
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK--GSNHPLRGMKSTPWDGGM 317
LD++ G + AL++ ML+NS++LF +DNG PS S K SN PLRG K + W+GG+
Sbjct: 289 LDDAFGQTVVALKEAEMLQNSVILFTSDNGGPSIVDESYKTGASNWPLRGQKFSMWEGGV 348
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND--TSLDGVNQWDVLTKGAK 375
R + +WSP LK + ++ L+HI+DWLPT AG E+ D +DG + WD + +
Sbjct: 349 RVSSFLWSPLLKSQRYINKHLYHITDWLPTFYKLAGGELRDLPIDIDGHDIWDSIDRNTD 408
Query: 376 TKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGT---DNNGQSDEWYGDTDNEIDKYSPK 432
+ R +L N D A+R + K + G + + D WY +D +
Sbjct: 409 SPRKNLLINYDTGGTEN---FAVRRGNFKLMHGKPPPEVDPWYDSWYATAGGTMDDFMTL 465
Query: 433 EVLYSKAGITFNALKTKLQIKQKHAADPKANSSDAL--RTILTDEKILELREFA---RVR 487
+ K+ +T++ L + +++ +A R + + L+ R + ++
Sbjct: 466 QKAMKKS-VTYDILSRRPDFEERRDDWREAGKVKCKINRGMADSPRGLDYRTYTVDDMLK 524
Query: 488 CNYDNKGAHC-NSTVKPCLFNITDDPCEQNNLAESQTD-LLKQLEDKLAIY---KSTMVP 542
+ C NS +PCLF++ +DPCE + + + ++++L + Y K V
Sbjct: 525 ASVIKNDEICTNSNEQPCLFDLLEDPCELRQYSGADREQIIRELSEIAEQYLRLKEAAVA 584
Query: 543 PGNKPF---DKRADPARWNNIWVPW 564
P + D ADP ++N W+P+
Sbjct: 585 PWSSRMEHTDADADPQKFNGTWMPY 609
>gi|449514410|ref|XP_002188440.2| PREDICTED: arylsulfatase B, partial [Taeniopygia guttata]
Length = 491
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 208/546 (38%), Positives = 301/546 (55%), Gaps = 67/546 (12%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PH++++LADDLGW DV +HGS+ I TP +DAL G+ L ++Y Q LCTPSRS L++G+Y
Sbjct: 1 PHLVLVLADDLGWGDVGWHGSA-IRTPRLDALGAGGVRLERYYTQPLCTPSRSQLLSGRY 59
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
IH G+QH +I +P LPL EKLLP+ L+EAGY TH +GKWHLG +R+ PT RGFD
Sbjct: 60 QIHTGLQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMYRKECLPTHRGFD 119
Query: 150 SHYGYWQGLQDYYDHS----CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
+++GY G +DYY H KA LD R +V +YST+L+TE AI+VI
Sbjct: 120 TYFGYLLGSEDYYTHDRCVFIKAKNVTRCALDFRDGEEVATGFKNVYSTNLFTERAIDVI 179
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
A H KP+FLYLA +VH EP + P++ V + I D +RR YAGMV+ +DE+VG
Sbjct: 180 ANHKTEKPLFLYLAFQSVH-----EPLEVPEKYVKPYSSIKDVKRRHYAGMVTLMDEAVG 234
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
N+ AL+ +G+ +N++ +F DNG + G+N PLRG K T W+GG+RGV + S
Sbjct: 235 NLTDALKTYGLWDNTVFVFSTDNGGQTLA----GGNNWPLRGRKWTLWEGGVRGVGFVAS 290
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHN 384
P LK+ S EL HISDWLPTL AG N T LDG + W+ +++G + R E+LHN
Sbjct: 291 PLLKRKGVESHELIHISDWLPTLVHLAGGHTNGTKPLDGFDVWNTISEGRPSPRVELLHN 350
Query: 385 IDN--VDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
ID VD+ + L +D + + ++ D ++ + + ++ ++L G +
Sbjct: 351 IDPMFVDD----FPCLGIDKRTSLPQSSSSPWDDTFFNISIHAAIRHGKWKLLTGYPGCS 406
Query: 443 F----NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
+L + QI+ ++DP
Sbjct: 407 HWFPPPSLYNESQIQ---SSDP-------------------------------------- 425
Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
ST + LF+I DP E+ +++E ++++L +L Y VP D DPA
Sbjct: 426 STKRLWLFDIVHDPEEKYDVSEKYPHVVEKLLSRLQYYHKRSVPVFYPDEDPACDPAA-T 484
Query: 559 NIWVPW 564
W PW
Sbjct: 485 GAWGPW 490
>gi|187956367|gb|AAI50662.1| Arsb protein [Mus musculus]
Length = 431
Score = 346 bits (887), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 177/353 (50%), Positives = 231/353 (65%), Gaps = 15/353 (4%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
+ A + PH++ +LADDLGWND+ FHGS I TP++DALA G++L+ +YVQ LCTPSRS
Sbjct: 40 SGATQPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQ 98
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L+TG+Y IH+G+QH +I+ +P +PL EKLLPQ LKEAGYATH +GKWHLG +R+ P
Sbjct: 99 LLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLP 158
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTE 199
T RGFD+++GY G +DYY H A E G LD+R + + IYST+++T+
Sbjct: 159 TRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTK 218
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
A VIA H KP+FLYLA +VH +P Q P+E + + I D RR YAGMVS
Sbjct: 219 RATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSL 273
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
+DE+VGNV AL+ HG+ N++ +F DNG G + G+N PLRG K T W+GG+RG
Sbjct: 274 MDEAVGNVTKALKSHGLWNNTVFIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRG 329
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLT 371
+ SP LKQ S EL HISDWLPTL AG N T LDG N W ++
Sbjct: 330 TGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGSTNGTKPLDGFNMWKTIS 382
>gi|443690889|gb|ELT92899.1| hypothetical protein CAPTEDRAFT_165852 [Capitella teleta]
Length = 484
Score = 345 bits (886), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 200/538 (37%), Positives = 290/538 (53%), Gaps = 84/538 (15%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+KP+I++ILADDLGWNDV ++ + +I TPN+D LA NG+ILN Y + C+PSR+AL+TG
Sbjct: 26 QKPNILLILADDLGWNDVGWN-NPEIKTPNLDRLASNGVILNASYALSTCSPSRTALLTG 84
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+YP +G+QHGV+ +G+P+GLPL LLPQ LK GY+THAIGKWHLGF R YTPTFRG
Sbjct: 85 RYPFKLGLQHGVVKKGKPYGLPLNITLLPQKLKHLGYSTHAIGKWHLGFCRWEYTPTFRG 144
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FDS YG++ G +DYY A +G D R N +V I YST Y A+ +I
Sbjct: 145 FDSFYGFYSGSEDYYKRKTAAI----RGYDFRMNTKVFKPKIKKYSTLDYGRRAVKIIQA 200
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
H +++P+FLY+ AVH+ P Q P K+ +I D RR ++GMVS +D ++G
Sbjct: 201 HKRTEPLFLYMPFQAVHS-----PLQVPKSFEFKYRNIVDRNRRIFSGMVSAMDSAIGMT 255
Query: 268 IAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
I AL++ ML N++++F++DNG P FG G+N PLRG K+T W+GG R + +W
Sbjct: 256 IKALKQRNMLTNTMIVFLSDNGGTPFFG-----GNNWPLRGSKATLWEGGTRVPSFVWGK 310
Query: 327 WLKQTQK-VSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
L Q S+E+ H DW PTL + AG + +DGV+QW +L++ ++R E ++ I
Sbjct: 311 MLLQKAGYTSNEMIHAVDWFPTLVSLAG-GTPEKDVDGVDQWQMLSQRQPSRRKEFVYGI 369
Query: 386 DNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNA 445
+++ AA+R K + GT G WY + + K
Sbjct: 370 NSITG----RAAIRRGKYKLIEGTA--GHYSGWYPAPSSRVVK----------------- 406
Query: 446 LKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCL 505
+K Q+ ++H NST+ L
Sbjct: 407 MKRMKQLNKRH----------------------------------------VNSTL---L 423
Query: 506 FNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
+NI DP E N+L+ L + + +L YK+ V N ++ + P W W P
Sbjct: 424 YNIQVDPIESNDLSAKYPHLTQFMLRRLNRYKAQEVEAFNPRKERYSQPKFWGGNWSP 481
>gi|114326200|ref|NP_001041598.1| arylsulfatase B precursor [Canis lupus familiaris]
gi|81158050|tpe|CAI84999.1| TPA: arylsulfatase B [Canis lupus familiaris]
Length = 535
Score = 345 bits (885), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 213/548 (38%), Positives = 292/548 (53%), Gaps = 75/548 (13%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
H++ +LADDLGW+DV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+TG+Y
Sbjct: 48 HLVFVLADDLGWHDVGFHGS-RIRTPHLDALAAAGVLLDNYYTQPLCTPSRSQLLTGRYQ 106
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT RGFD+
Sbjct: 107 IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 166
Query: 151 HYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
++GY G +DYY H + LD R +V +YST+++TE A +I+
Sbjct: 167 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALIS 226
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H KP+FLYLA +VH EP Q P+E + + I D RR YAGMVS +DE+VGN
Sbjct: 227 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIHDKNRRYYAGMVSLMDEAVGN 281
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
V AAL+ HG+ N++ +F DNG + G+N PLRG K + W+GG+RGV + SP
Sbjct: 282 VTAALKSHGLWNNTVFVFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVGFVASP 337
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
LK+ S EL HISDWLPTL AG T LDG + W +++G+ + R E+LHNI
Sbjct: 338 LLKRKGVKSRELVHISDWLPTLVGLAGGSTKGTKPLDGFDVWRTISEGSPSPRMELLHNI 397
Query: 386 DNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNA 445
D +V + GQS +P + S G + +
Sbjct: 398 ----------------DPNFVDISPCPGQS-------------LAPAKDDSSHPG--YFS 426
Query: 446 LKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC- 504
T L +H + K+L + C + H N +V P
Sbjct: 427 FNTSLHAAIRHG----------------NWKLLT--GYPGCGCWFPPPSQH-NVSVIPSS 467
Query: 505 --------LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
LF+I DP E+++L+ ++KQL +L Y VP D R DP +
Sbjct: 468 DPPTKTLWLFDIDQDPEERHDLSRDHPHVVKQLLSRLQFYHKHSVPVYFPAQDPRCDP-K 526
Query: 557 WNNIWVPW 564
W PW
Sbjct: 527 GTGAWGPW 534
>gi|26350439|dbj|BAC38859.1| unnamed protein product [Mus musculus]
Length = 431
Score = 344 bits (883), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 176/351 (50%), Positives = 230/351 (65%), Gaps = 15/351 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWND+ FHGS I TP++DALA G++L+ +YVQ LCTPSRS L+
Sbjct: 42 ATQPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLL 100
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y IH+G+QH +I+ +P +PL EKLLPQ LKEAGYATH +GKWHLG +R+ PT
Sbjct: 101 TGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTR 160
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H A E G LD+R + + IYST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKRA 220
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
VIA H KP+FLYLA +VH +P Q P+E + + I D RR YAGMVS +D
Sbjct: 221 TTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSLMD 275
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AL+ HG+ N++ +F DNG G + G+N PLRG K T W+GG+RG
Sbjct: 276 EAVGNVTKALKSHGLWNNTVFIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRGTG 331
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLT 371
+ SP LKQ S EL HI+DWLPTL AG N T LDG N W ++
Sbjct: 332 FVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNGTKPLDGFNMWKTIS 382
>gi|390360193|ref|XP_788463.2| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 537
Score = 344 bits (883), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 210/568 (36%), Positives = 295/568 (51%), Gaps = 64/568 (11%)
Query: 10 ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
ALT T+L + K PHI+ I+ADD GW DV +H S+ I TPN+D LA G+ L
Sbjct: 20 ALTVTVLICTGLIKGATGKPPHIVFIVADDYGWFDVGYHNST-IKTPNLDLLASRGVKLE 78
Query: 70 QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
+YVQ +C+PSRS LMTG+Y IH G+QH VI+ +P LPL E LPQ LKE+GYATH +
Sbjct: 79 NYYVQPICSPSRSQLMTGRYQIHTGLQHFVIIAPQPNCLPLNETTLPQKLKESGYATHLV 138
Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ------GLDMRHNMQ 183
GKWHLGF++ P RGFDS +GY G+QDY+ H +F + G+D N +
Sbjct: 139 GKWHLGFYKNECMPLQRGFDSSFGYLSGMQDYWTHFRSGSFPGFPEGNHWLGIDFWDNNR 198
Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
V + G YS ++TE A VI +HN ++P+FLYL +VH P Q P++ + +
Sbjct: 199 VAWEYTGNYSQFVFTERAQRVIQQHNPNQPLFLYLPLQSVHG-----PLQVPEKYMKPYA 253
Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
D R+TYAGMV+ +DE+VG V+ +L++ G+ +++++F DNG G G+N
Sbjct: 254 HFQDVGRQTYAGMVATMDEAVGKVVDSLQEAGLWNDTVLVFTTDNG----GTPGKSGNNW 309
Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLC-AAAGIEINDTS 360
PLRG K+T W+GG+ GV I P + + VS HISDW PTL AG +
Sbjct: 310 PLRGTKNTLWEGGVHGVGFITGPMIPAGVQGTVSKHFMHISDWFPTLIEGVAGGNTAGLA 369
Query: 361 LDGVNQWDVLTKGAKTKRSEILHNID---NVDNPQKYYAALRVDDLKYVAGTDNNGQSDE 417
LD N W+ +TKG + R E+LHNID D+P Y
Sbjct: 370 LDSYNMWNSITKGTPSPRKELLHNIDPYIRADHPFGY----------------------- 406
Query: 418 WYGDTDNEIDKYSPKEVLYSKAGITFNA-LKTKLQIKQKHAADPKANSSDALRTILTDEK 476
D E D P LY K F+ ++ +++ + S +
Sbjct: 407 ---GYDEETDMIYPLSGLYPKMAAEFSTDMRAAIRVGEWKLLTGFPGRSGWYPPPEWNIH 463
Query: 477 ILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIY 536
++ E A NK LFNIT DPCE+N+L+ +++ +L +L Y
Sbjct: 464 PIDPVEAA-------NKVT--------WLFNITADPCEKNDLSYQHPEVVTELVGRLEAY 508
Query: 537 KSTMVPPGNKPFDKRADPARWNNIWVPW 564
T VP +ADPA + +W PW
Sbjct: 509 YKTSVPVRFPNQTVKADPALHHGVWGPW 536
>gi|198415046|ref|XP_002127641.1| PREDICTED: similar to Arylsulfatase B precursor (ASB)
(N-acetylgalactosamine-4-sulfatase) (G4S) [Ciona
intestinalis]
Length = 522
Score = 344 bits (882), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 208/571 (36%), Positives = 306/571 (53%), Gaps = 79/571 (13%)
Query: 13 CTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY 72
C L+F L + +KP+I++I+ADDLG+NDV +H + I TPNI+ LA +G+IL +Y
Sbjct: 11 CFLVFVQV-LYVSGHQKPNILLIVADDLGFNDVGYH-NPDIYTPNINKLAKDGVILESYY 68
Query: 73 VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKW 132
VQ +CTPSRS LMTG+Y IH G+QH VI +P LP+ E +LPQ LKEAGY THA+GKW
Sbjct: 69 VQPICTPSRSQLMTGRYQIHTGLQHSVIFAPQPNCLPVDEIILPQKLKEAGYTTHAVGKW 128
Query: 133 HLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATF------EPYQGLDMRHNMQVDN 186
HLGF+++ PT RGFD+ YGY+ G +DYY A F G D N + +
Sbjct: 129 HLGFYKKECLPTSRGFDTFYGYYCGAEDYYTKQVHANFHFGNKTRRVSGFDFHDNSRTEW 188
Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
+ G YS+ LY + A+ +I HN S P+F+YL +VH P Q P + + ++ I
Sbjct: 189 EANGTYSSYLYRDRAVRIIKSHNSSIPLFMYLPFQSVHF-----PLQVPAKYIKRYRHIK 243
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS-FGIHSNKGSNHPL 305
D +RRT++ MV+ +DE++G+V+ AL+ G+ ++++V+F DNG + FG G+N PL
Sbjct: 244 DRKRRTFSAMVTAMDEAIGSVVDALKWKGIWQDTLVVFTTDNGGQTLFG-----GNNWPL 298
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGV 364
RG K++ W+GG+RGV + ++ + S+EL HISDW PTL AG + T LD
Sbjct: 299 RGRKASLWEGGVRGVGLVRGYGIRDKGRSSNELVHISDWFPTLLYIAGANASGTKPLDSY 358
Query: 365 NQWDVLT-KGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD 423
N WD + K + R E+LHNID + + Y
Sbjct: 359 NVWDAIANKDVVSPRVELLHNIDPLFT--RPYTP-------------------------- 390
Query: 424 NEIDKYSPKEVLY-SKAGITFNALKTKLQIK--QKHAADPKANSSDALRTILTDEKILEL 480
+PK + Y +K G+ L+ K + H DP + D T
Sbjct: 391 ------TPKLIPYPNKHGVNTTKGHAALRYKHWKLHTGDPGYDKWDLPPT---------- 434
Query: 481 REFARVRCNYDNKGAHCNSTVKPC-------LFNITDDPCEQNNLAESQTDLLKQLEDKL 533
+ CN + NS K LF+I +DP E+ NLA+ ++ L K+
Sbjct: 435 ----KSYCNLSGMHTNTNSKYKSTHLVPSVHLFDIENDPQERKNLAQIYPKVVDLLLGKI 490
Query: 534 AIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
A+Y + VP P+D +++P+ +W PW
Sbjct: 491 AMYNKSAVPINFPPYDPKSNPSLHGGVWEPW 521
>gi|22450123|emb|CAD33828.1| glucosinolate sulphatase [Plutella xylostella]
Length = 547
Score = 343 bits (881), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 207/552 (37%), Positives = 306/552 (55%), Gaps = 40/552 (7%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
A L +A KPH+I+I+ADD+GW+D S HGS + TPN+D L +G+ L+++Y ALC+P
Sbjct: 13 AALCVSAATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSP 72
Query: 80 SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG--FF 137
+R+A++TGKY +GMQ + E G+PL E+L+ QYL++AGY T +GKWH+G FF
Sbjct: 73 ARTAVLTGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFF 132
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFE--PYQGLDMRHNMQVDNKTIGIYSTD 195
++ PT+RGF++H+G G DYY+++ + + P GL + ++Q D T G Y TD
Sbjct: 133 EQL--PTYRGFENHFGVRGGFIDYYEYNAQEQLDGRPVTGLCLFDDLQPDWTTEG-YITD 189
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
+YTE + +I HN S+P++L L H A H GN QAP EEV + RR +A
Sbjct: 190 VYTEKSTTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAA 249
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
MV +LD+S+G ++A L K GMLEN+I+ F DNGAP+ G+ +N GSN+PLRG+K +PW+G
Sbjct: 250 MVKKLDDSIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANSGSNYPLRGVKKSPWEG 309
Query: 316 GMRGVAAIW-----SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
G+RG A IW +P KV H +DW+PTL A G +I LDG+ W +
Sbjct: 310 GIRGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIGEKI-PAGLDGIPMWSHI 368
Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
+ + R+EI ID+ N ++++ + K V GT + S YG+ I
Sbjct: 369 IENKPSPRTEIF-EIDDYFN----HSSVTLGRHKLVKGTIDESLSKH-YGEDLRGIIGTP 422
Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
P K KL+ D KA S I D ++ R+ A V C
Sbjct: 423 PD-------------YKQKLR-------DSKAWESLETIGIPLDADVMADRDEAIVTCGN 462
Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP-PGNKPFD 549
S CL++I +DPCE +L+E L + L +L ++ ++P G D
Sbjct: 463 VVPKPCSPSAESWCLYDIIEDPCELRDLSEELPQLAQILLYRLEQEEAKIIPREGQYVAD 522
Query: 550 KRADPARWNNIW 561
++ P +N W
Sbjct: 523 PKSAPKYFNYTW 534
>gi|363744029|ref|XP_003642960.1| PREDICTED: arylsulfatase B [Gallus gallus]
Length = 514
Score = 343 bits (880), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 209/544 (38%), Positives = 296/544 (54%), Gaps = 63/544 (11%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PH++++LADDLGW DV +HGS+ I TP +DAL G+ L ++Y Q LCTPSRS L++G+Y
Sbjct: 24 PHLVLVLADDLGWGDVGWHGSA-IRTPRLDALGAGGVRLERYYTQPLCTPSRSQLLSGRY 82
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
IH G+QH +I +P LPL EKLLP+ LK+AGY TH +GKWHLG +R+ PT RGFD
Sbjct: 83 QIHTGLQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPTRRGFD 142
Query: 150 SHYGYWQGLQDYYDHS----CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
+++GY G +DYY H KA LD R +V +YST+L+TE AI++I
Sbjct: 143 TYFGYLLGSEDYYSHDHCVLIKAKNVTRCALDFRDGEEVATGFKNMYSTNLFTERAIDLI 202
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
A H KP+FLYLA +VH EP + E + + I D +RR YAGMVS +DE+VG
Sbjct: 203 ANHKTEKPLFLYLAFQSVH-----EPLEVSAEYMKPYSSIKDVKRRRYAGMVSLMDEAVG 257
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
N+ AL+++G+ N++++F DNG + G+N PLRG K T W+GG+RGV + S
Sbjct: 258 NLTDALKEYGLWNNTVLVFSTDNGGQTMA----GGNNWPLRGRKWTLWEGGVRGVGFVAS 313
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHN 384
P LKQ S EL HISDWLPTL AG N T LDG + W +++G + R E+LHN
Sbjct: 314 PLLKQKGVESHELIHISDWLPTLVHLAGGHTNGTKPLDGFDVWKTISEGRPSPRVELLHN 373
Query: 385 IDN--VDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVL--YSKAG 440
ID VD+ Y L +D+ + +++ + + + + ++ ++L Y G
Sbjct: 374 IDPMFVDD----YPCLGIDNRTSLPQINSSPWDNTLFNISVHAAIRHGKWKLLTGYPGCG 429
Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNST 500
F P +S +++ K L L
Sbjct: 430 HWF--------------PPPSLSSESQIQSYDPPMKRLWL-------------------- 455
Query: 501 VKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
F+I DP E+ L+E ++K+L +L Y VP D + DPA +
Sbjct: 456 -----FDIVHDPEEKYELSEKYPHVVKKLLSRLQYYYKRSVPVFYPDEDPQCDPAA-TGV 509
Query: 561 WVPW 564
W PW
Sbjct: 510 WGPW 513
>gi|405975640|gb|EKC40194.1| Arylsulfatase B [Crassostrea gigas]
Length = 484
Score = 343 bits (880), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 203/538 (37%), Positives = 289/538 (53%), Gaps = 82/538 (15%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K PHI+ I+ADDLGWNDV +H + + TPNID LA GLILNQ Y Q LC+PSR A MTG
Sbjct: 24 KSPHILFIVADDLGWNDVGYHNPA-MKTPNIDKLAREGLILNQTYFQPLCSPSRHAFMTG 82
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
YP G+QH VI+ +P PL K LPQ LK+ GYATH +GKWHLG TPT+RG
Sbjct: 83 YYPYRAGLQHLVIMPWQPVCSPLNMKFLPQRLKDLGYATHMVGKWHLGMCNWDCTPTYRG 142
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FDS +G++ DYY H Y+ LD R N + G YST +T A ++I +
Sbjct: 143 FDSFFGFYHAKADYYSHIS------YKYLDYRDNEKPVKNLNGTYSTFTFTSRAQDIIKK 196
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
HN S+P+FLY+A H EP Q P + + + RR Y+GMVS LDE++GN+
Sbjct: 197 HNSSQPLFLYMAFPIPH-----EPLQVPQQYEDMYPNTMSKGRRQYSGMVSALDEAIGNI 251
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
LR+ GM ++++++F +DNG G + G+N+PLRG K T ++GG R V+ +
Sbjct: 252 TDQLRESGMYDDTLIVFTSDNG----GSPMDYGNNYPLRGAKFTVYEGGTRVVSFVHGAG 307
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
L++T + + H DW TL AAAG E D DGV+QW + + + R+E+++N D
Sbjct: 308 LQKTGEAYESMMHAVDWSDTLVAAAGGE-PDMRDDGVSQWTAIAQNSPGPRTELVYNFDP 366
Query: 388 VDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALK 447
P++ +AA+RV D K + G G +WY
Sbjct: 367 DSAPEQGHAAIRVGDYKLIHGYP--GLFPDWY---------------------------- 396
Query: 448 TKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFN 507
++ Q+H + T+ +LE+ E A + LF
Sbjct: 397 ---KVDQQHEFN-------------TEPSVLEMLEHAE----------------RYQLFY 424
Query: 508 ITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF--DKRADPARWNNIWVP 563
I DDP EQN+LA+S+ D++K+L +L + +PP N P D ++DP ++N+W P
Sbjct: 425 IKDDPTEQNDLAQSKPDIVKKLSQRLLMLTKNALPP-NYPLVPDPKSDPTHFDNVWSP 481
>gi|214010121|ref|NP_001135731.1| arylsulfatase B precursor [Felis catus]
gi|461542|sp|P33727.1|ARSB_FELCA RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
Flags: Precursor
gi|258856|gb|AAB23941.1| arylsulfatase B [Felis catus]
Length = 535
Score = 342 bits (878), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 178/366 (48%), Positives = 234/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDVSFHGS+ I TP++D LA G++L+ +Y Q LCTPSRS L+
Sbjct: 43 ADRPPHLVFVLADDLGWNDVSFHGSN-IRTPHLDELAAGGVLLDNYYTQPLCTPSRSQLL 101
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y IH G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 102 TGRYQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H A + LD R QV +YST+++TE A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCALIDSLNVTRCALDFRDGEQVATGYKNMYSTNIFTERA 221
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 222 TALITSHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMD 276
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ HG+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 277 EAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGIRGVG 332
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A G LDG + W +++G+ + R E
Sbjct: 333 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGSTKGTKPLDGFDVWKTISEGSPSPRKE 392
Query: 381 ILHNID 386
+LHNID
Sbjct: 393 LLHNID 398
>gi|351697185|gb|EHB00104.1| Arylsulfatase B, partial [Heterocephalus glaber]
Length = 503
Score = 342 bits (878), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 177/364 (48%), Positives = 238/364 (65%), Gaps = 15/364 (4%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+ PH++ +LADDLGWNDV FHGS ++ TP++DALA G+ L+ +Y Q LCTPSRS L+TG
Sbjct: 14 RPPHLVFVLADDLGWNDVGFHGS-RLRTPHLDALAAGGVQLDNYYTQPLCTPSRSQLLTG 72
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y IH G+QH +I +P +PL EKLLPQ L+EAGYATH +GKWHLG +++ PT RG
Sbjct: 73 RYQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLQEAGYATHMVGKWHLGMYQKECLPTRRG 132
Query: 148 FDSHYGYWQGLQDYYDHS----CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
FD+++GY G +DYY H KA LD R +V + +YST+++T A +
Sbjct: 133 FDTYFGYLLGSEDYYTHEHCVFIKALNVTRCALDFRDGEEVATQYKNLYSTNIFTNRATS 192
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
+IA H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +DE+
Sbjct: 193 LIANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHLYAGMVSLMDEA 247
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
VGNV AAL+ HG+ N++ +F DNG + G+N PLRG K T W+GG+RGV +
Sbjct: 248 VGNVTAALKSHGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWTLWEGGVRGVGFV 303
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEIL 382
SP LKQ + EL HISDWLPTL AG + + T LDG + W +++G+ + R E+L
Sbjct: 304 ASPLLKQKGVKNRELIHISDWLPTLVKLAGGDTHGTKPLDGFDVWRTISEGSPSPRMELL 363
Query: 383 HNID 386
HNID
Sbjct: 364 HNID 367
>gi|296194262|ref|XP_002744878.1| PREDICTED: arylsulfatase B [Callithrix jacchus]
Length = 534
Score = 342 bits (877), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 176/366 (48%), Positives = 234/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 42 ASRPPHLVFVLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y IH G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 101 TGRYQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 221 TTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMD 275
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ HG+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 276 EAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 331
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDMWKTISEGSPSPRVE 391
Query: 381 ILHNID 386
+LHNID
Sbjct: 392 LLHNID 397
>gi|395825538|ref|XP_003785985.1| PREDICTED: arylsulfatase B [Otolemur garnettii]
Length = 532
Score = 341 bits (875), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 177/369 (47%), Positives = 236/369 (63%), Gaps = 15/369 (4%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+ A + PH++ +LADDLGWNDV FHGSS I TP++DALA G++L+ +Y Q LCTPSRS
Sbjct: 37 GSGAGRPPHLVFVLADDLGWNDVGFHGSS-IRTPHLDALAAGGVLLDNYYTQPLCTPSRS 95
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
L+TG+Y I +G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+
Sbjct: 96 QLLTGRYQIRMGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECL 155
Query: 143 PTFRGFDSHYGYWQGLQDYYDHS----CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYT 198
PT RGFD+++GY G +DYY H A LD R +V +YST+++T
Sbjct: 156 PTRRGFDTYFGYLLGSEDYYSHERCTLINALNVTRCALDFRDGEEVATGYKNMYSTNIFT 215
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
E A +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS
Sbjct: 216 ERATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVS 270
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
+DE++GNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+R
Sbjct: 271 LMDEAIGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVR 326
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTK 377
GV + SP LKQ + EL HISDWLPTL AG N T LDG + W+ +++G+ +
Sbjct: 327 GVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTNGTKPLDGFDVWETISEGSPSP 386
Query: 378 RSEILHNID 386
R E+LHNID
Sbjct: 387 RVELLHNID 395
>gi|327263080|ref|XP_003216349.1| PREDICTED: arylsulfatase B-like [Anolis carolinensis]
Length = 521
Score = 341 bits (874), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 176/364 (48%), Positives = 235/364 (64%), Gaps = 15/364 (4%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PH++ +LADDLGW DV +HGS QI TP +DAL+ G+ L ++Y+Q LCTPSRS L+TG+Y
Sbjct: 33 PHLVFVLADDLGWQDVGWHGS-QIRTPVLDALSAAGVRLERYYIQPLCTPSRSQLLTGRY 91
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
IH G+QH +I +P +PL EKLLP+ LKEAGY TH +GKWHLG +R PT RGFD
Sbjct: 92 QIHTGLQHEIIWPCQPSCVPLDEKLLPELLKEAGYVTHMVGKWHLGMYRNECLPTRRGFD 151
Query: 150 SHYGYWQGLQDYYDHS-CKATFEP---YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
+++GY G +DYY H C LD+R ++ + +YST+++T+ A ++I
Sbjct: 152 TYFGYLLGSEDYYSHEHCVPIVSKNVTRCALDLRDGEKIADGFKNMYSTNVFTQRAQDLI 211
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
A H KP+FLYLA +VH EP Q P++ V + I D +RR YAGMVS LDE+VG
Sbjct: 212 ANHQPEKPLFLYLALQSVH-----EPLQVPEKYVEPYSFIKDEKRRKYAGMVSILDEAVG 266
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
N+ AL+ G+ N++++F DNG + G+N PLRG K T W+GG+RG + S
Sbjct: 267 NITEALKTRGLWNNTVLIFSTDNGGQTLA----GGNNWPLRGRKWTLWEGGVRGAGFVAS 322
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHN 384
P LKQ S EL HISDWLPTL AG NDT LDG + W L++G + R E+LHN
Sbjct: 323 PLLKQRGVESHELIHISDWLPTLVKLAGGSTNDTKPLDGFDVWKTLSEGEPSPRKELLHN 382
Query: 385 IDNV 388
ID +
Sbjct: 383 IDPI 386
>gi|109077724|ref|XP_001108177.1| PREDICTED: arylsulfatase B isoform 2 [Macaca mulatta]
Length = 414
Score = 340 bits (873), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 176/366 (48%), Positives = 234/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 42 ASRPPHLVFVLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 221 TALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 331
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL AG IN T LDG + W +++G+ + R E
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHINGTKPLDGFDVWKTISEGSPSPRVE 391
Query: 381 ILHNID 386
+LHNID
Sbjct: 392 LLHNID 397
>gi|22450117|emb|CAC86342.1| glucosinolate sulfatase [Plutella xylostella]
gi|22450119|emb|CAC86343.1| glucosinolate sulfatase [Plutella xylostella]
Length = 532
Score = 340 bits (873), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 206/549 (37%), Positives = 305/549 (55%), Gaps = 40/549 (7%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
A L +A KPH+I+I+ADD+GW+D S HGS + TPN+D L +G+ L+++Y ALC+P
Sbjct: 13 AALCVSAATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSP 72
Query: 80 SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG--FF 137
+R+A++TGKY +GMQ + E G+PL E+L+ QYL++AGY T +GKWH+G FF
Sbjct: 73 ARTAVLTGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFF 132
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFE--PYQGLDMRHNMQVDNKTIGIYSTD 195
++ PT+RGF++H+G G DYY+++ + + P GL + ++Q D T G Y TD
Sbjct: 133 EQL--PTYRGFENHFGVRGGFIDYYEYNAQEQLDGRPVTGLCLFDDLQPDWTTEG-YITD 189
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
+YTE + +I HN S+P++L L H A H GN QAP EEV + RR +A
Sbjct: 190 VYTEKSTTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAA 249
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
MV +LD+S+G ++A L K GMLEN+I+ F DNGAP+ G+ +N GSN+PLRG+K +PW+G
Sbjct: 250 MVKKLDDSIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANSGSNYPLRGVKKSPWEG 309
Query: 316 GMRGVAAIW-----SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
G+RG A IW +P KV H +DW+PTL A G +I LDG+ W +
Sbjct: 310 GIRGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIGEKI-PAGLDGIPMWSHI 368
Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
+ + R+EI ID+ N ++++ + K V GT + S YG+ I
Sbjct: 369 IENKPSPRTEIF-EIDDYFN----HSSVTLGRHKLVKGTIDESLSKH-YGEDLRGIIGTP 422
Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
P K KL+ D KA S I D ++ R+ A V C
Sbjct: 423 PD-------------YKQKLR-------DSKAWESLETIGIPLDADVMADRDEAIVTCGN 462
Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP-PGNKPFD 549
S CL++I +DPCE +L+E L + L +L ++ ++P G D
Sbjct: 463 VVPKPCSPSAESWCLYDIIEDPCELRDLSEELPQLAQILLYRLEQEEAKIIPREGQYVAD 522
Query: 550 KRADPARWN 558
++ P +N
Sbjct: 523 PKSAPKYFN 531
>gi|22450115|emb|CAC86338.1| glucosinolate sulfatase [Plutella xylostella]
Length = 532
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 206/549 (37%), Positives = 305/549 (55%), Gaps = 40/549 (7%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
A L +A KPH+I+I+ADD+GW+D S HGS + TPN+D L +G+ L+++Y ALC+P
Sbjct: 13 AALCVSAATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSP 72
Query: 80 SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG--FF 137
+R+A++TGKY +GMQ + E G+PL E+L+ QYL++AGY T +GKWH+G FF
Sbjct: 73 ARTAVLTGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFF 132
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFE--PYQGLDMRHNMQVDNKTIGIYSTD 195
++ PT+RGF++H+G G DYY+++ + + P GL + ++Q D T G Y TD
Sbjct: 133 EQL--PTYRGFENHFGVRGGFIDYYEYNAQEQLDGRPVTGLCLFDDLQPDWTTEG-YITD 189
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
+YTE + +I HN S+P++L L H A H GN QAP EEV + RR +A
Sbjct: 190 VYTEKSTTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAA 249
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
MV +LD+S+G ++A L K GMLEN+I+ F DNGAP+ G+ +N GSN+PLRG+K +PW+G
Sbjct: 250 MVKKLDDSIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANSGSNYPLRGVKKSPWEG 309
Query: 316 GMRGVAAIW-----SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
G+RG A IW +P KV H +DW+PTL A G +I LDG+ W +
Sbjct: 310 GIRGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIGEKI-PAGLDGIPMWSHI 368
Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
+ + R+EI ID+ N ++++ + K V GT + S YG+ I
Sbjct: 369 IENKPSPRTEIF-EIDDYFN----HSSVTLGRHKLVKGTIDESLSKH-YGEDLRGIIGTP 422
Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
P K KL+ D KA S I D ++ R+ A V C
Sbjct: 423 PD-------------YKQKLR-------DSKAWESLETIGIPLDADVMADRDEAIVTCGN 462
Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP-PGNKPFD 549
S CL++I +DPCE +L+E L + L +L ++ ++P G D
Sbjct: 463 VVPKPCSPSAESWCLYDIIEDPCELRDLSEELPQLAQILLYRLEQEEAKIIPREGQYVSD 522
Query: 550 KRADPARWN 558
++ P +N
Sbjct: 523 PKSAPKYFN 531
>gi|126697310|gb|ABO26612.1| arylsulfatase [Haliotis discus discus]
Length = 481
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 184/415 (44%), Positives = 255/415 (61%), Gaps = 23/415 (5%)
Query: 8 FFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
F L CT+L + + A + HI+ I+ADDLGWND+ FH + I TPNID LA G
Sbjct: 2 FVQLLCTVLVIINLCDDVSAAGRPRHIVFIVADDLGWNDIGFH-NPDIITPNIDKLAREG 60
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
L+LN HYVQ LC+PSR+A M+G YP G+QH VILE +P LPL +LPQ LKE GYA
Sbjct: 61 LLLNHHYVQPLCSPSRAAFMSGYYPFKTGLQHSVILENQPVCLPLNITILPQKLKELGYA 120
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
TH +GKWH GF TPT+RGFDS +GY+ ++DYY H + LD R+N
Sbjct: 121 THIVGKWHNGFCSWNCTPTYRGFDSFFGYYGAMEDYYTHVIRGF------LDYRNNTTPV 174
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
G YST +T+ A ++I HN+S+P+FLYLA+ AV Y P + P + A + +I
Sbjct: 175 WTDNGTYSTLRFTDVATDIIERHNQSQPLFLYLAYQAV-----YGPIEVPAKYEAMYPNI 229
Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
RR ++GMVS LDE+VGNV LR+ G+++++++LF ADNG G G+N+PL
Sbjct: 230 KSENRRKFSGMVSALDEAVGNVTKTLRQRGLMDDTLILFTADNG----GGVDESGNNYPL 285
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA-GIEINDTSLDGV 364
RG K T ++GG R V ++ L++T V + H DWLPTL AAA G ++D DG+
Sbjct: 286 RGSKFTVYEGGTRAVGFMYGSGLQKTGTVFDGMIHAVDWLPTLTAAAGGTPVSDR--DGI 343
Query: 365 NQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY 419
N W L+ + + R+E+++N D+ P + +AA+RV D K + G G +WY
Sbjct: 344 NLWPSLSTASPSPRTEVVYNYDSHPQPVQGHAAIRVGDYKLIDGYP--GPFPDWY 396
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 40/61 (65%), Gaps = 3/61 (4%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF--DKRADPARWNNIWV 562
LFN+ DDP E+N+L+ + D++K+L +LA YK VPP N P D ++PA + N+W
Sbjct: 419 LFNLKDDPNERNDLSNFRPDMVKKLAARLAWYKKQAVPP-NFPETPDDLSNPALYGNVWS 477
Query: 563 P 563
P
Sbjct: 478 P 478
>gi|291231206|ref|XP_002735556.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 516
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 185/450 (41%), Positives = 261/450 (58%), Gaps = 38/450 (8%)
Query: 8 FFALTCTLLFNDA---FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
+FA+ L F L +P+ HI+ ILADD G++D+ +H S I TPN+D LA
Sbjct: 15 WFAMDFRLFFTTLIVLLLRPVSPEPTHIVFILADDYGFHDIGYH-DSIIKTPNLDRLASE 73
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGY 124
G+ L +YVQ +CTP+RS LM+G+Y IH G+QHG+I +P LP+ E +PQ LKE+GY
Sbjct: 74 GVKLENYYVQPICTPTRSQLMSGRYQIHTGLQHGIIWPCQPSCLPINEVTIPQKLKESGY 133
Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ- 183
ATH +GKWHLG +++ PT RGFD+ +GY G +DYY H+ +++ + G+D R NMQ
Sbjct: 134 ATHIVGKWHLGMYKKECLPTERGFDTFFGYLTGSEDYYTHN--RSYDKFHGMDFRENMQI 191
Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
V + G YST ++ E A N+I H+ P+FLYL AVH P Q PD+ +
Sbjct: 192 VQPQYNGQYSTHVFAEKAKNIIKSHDPQIPLFLYLPFQAVHG-----PLQVPDQYEKPYA 246
Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
+I++ +RRTYAGMVS +DE+VGN+ L+ G+ +NS+++F DNG G+N
Sbjct: 247 NITNKQRRTYAGMVSCMDEAVGNITQTLKDTGLYDNSVIIFSTDNGGQVIA----GGNNW 302
Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIEINDTS- 360
PLRG K + W+GGM GV + SP + + S EL H++DW PT+ AG +N T
Sbjct: 303 PLRGWKGSLWEGGMHGVGFVNSPLINHQARGTTSKELIHVTDWFPTIVHLAGGSVNGTQP 362
Query: 361 LDGVNQWDVLTKGAKTKRSEILHNID------NVDNPQKYY----------AALRVDDLK 404
LDG NQW+ ++ G K+ R E+LHNID N Y AALRV D K
Sbjct: 363 LDGYNQWETISNGKKSPRKELLHNIDPNYIALQTSNNNMIYRTDMFNASIRAALRVGDWK 422
Query: 405 YVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
+ G N W ++ I P ++
Sbjct: 423 ILTGDPGN---SSWIPPPESSITSPHPTDI 449
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 31/60 (51%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNI +DP E +L+ + D + +L +KL Y T VP D DP + + W PW
Sbjct: 456 LFNIAEDPNEYKDLSLFRPDKVNELLEKLQQYYKTSVPVRYPDNDPACDPGKHSGAWGPW 515
>gi|109077718|ref|XP_001108389.1| PREDICTED: arylsulfatase B isoform 6 [Macaca mulatta]
Length = 534
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 176/366 (48%), Positives = 234/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 42 ASRPPHLVFVLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 221 TALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 331
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL AG IN T LDG + W +++G+ + R E
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHINGTKPLDGFDVWKTISEGSPSPRVE 391
Query: 381 ILHNID 386
+LHNID
Sbjct: 392 LLHNID 397
>gi|380795845|gb|AFE69798.1| arylsulfatase B isoform 1 precursor, partial [Macaca mulatta]
Length = 506
Score = 339 bits (870), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 176/366 (48%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 14 ASRPPHLVFVLADDLGWNDVGFHGSC-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 72
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 73 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 132
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 133 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 192
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 193 TALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 247
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 248 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 303
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL AG IN T LDG + W +++G+ + R E
Sbjct: 304 FVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHINGTKPLDGFDVWKTISEGSPSPRVE 363
Query: 381 ILHNID 386
+LHNID
Sbjct: 364 LLHNID 369
>gi|260794113|ref|XP_002592054.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
gi|229277268|gb|EEN48065.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
Length = 478
Score = 339 bits (870), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 183/403 (45%), Positives = 244/403 (60%), Gaps = 29/403 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KPHI+ ILADD GWND+ +HGS I TPN+D LA G+ L +YVQ +C+PSR LMTG+
Sbjct: 2 KPHIVFILADDYGWNDIGYHGSF-IKTPNLDRLASEGVKLENYYVQPICSPSREQLMTGR 60
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QH VI P GLPL E LPQ LKE GY+TH +GKWHLGFFR+ Y P RGF
Sbjct: 61 YQIHYGLQHSVITHDRPHGLPLDEVTLPQKLKEIGYSTHLVGKWHLGFFRQEYLPLRRGF 120
Query: 149 DSHYGYWQGLQDYYDHSCKATF----EPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
D+ YG+ G +DY+ H + Y GLD+R + G YST L+ AI++
Sbjct: 121 DTFYGFLTGGEDYWSHRRPNVYSTDASEYHGLDLRDQDEPVLDQNGTYSTHLFQRKAIDI 180
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
IA H+++KPMFLYL+ AVHA P QAP+ + K+ + + R YA MV+ +DE+V
Sbjct: 181 IAHHDRNKPMFLYLSFQAVHA-----PLQAPEGYIEKYKSVQNILIRKYAAMVTAMDEAV 235
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
GNV AL+ G+ +N++++F DNGA + GSN PLRG K+T W+GG+RGV +
Sbjct: 236 GNVTDALKGSGLWDNTVLIFSTDNGA-----RRSAGSNWPLRGWKNTLWEGGIRGVCFVT 290
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILH 383
S L++ S L HISDW PTL A +D+ LDG + W+ ++ GAK+ R EIL
Sbjct: 291 SNLLERKGTKSDALIHISDWFPTLIHLAQGSTHDSKPLDGFDVWNSISTGAKSTREEILL 350
Query: 384 NIDNVDNPQKY-------------YAALRVDDLKYVAGTDNNG 413
NID + + YAA+R D K + G + G
Sbjct: 351 NIDPLKCKIGFTCASGGGNFNTLTYAAIRSRDWKLLTGFQDQG 393
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 36/60 (60%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNI +DP E+ +L+E D+++ L KL Y T VPP P D RA+P +++ PW
Sbjct: 418 LFNIRNDPQERTDLSEKYPDVVQDLLMKLKTYNRTAVPPFWPPRDPRANPVLHGDVFGPW 477
>gi|148668607|gb|EDL00926.1| arylsulfatase B [Mus musculus]
Length = 556
Score = 339 bits (870), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 176/369 (47%), Positives = 234/369 (63%), Gaps = 24/369 (6%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
+ A + PH++ +LADDLGWND+ FHGS I TP++DALA G++L+ +YVQ LCTPSRS
Sbjct: 71 SGATQPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQ 129
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L+TG+Y IH+G+QH +I+ +P +PL EKLLPQ LKEAGYATH +GKWHLG +R+ P
Sbjct: 130 LLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLP 189
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTE 199
T RGFD+++GY G +DYY H A E G LD+R + + IYST+++T+
Sbjct: 190 TRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTK 249
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
A VIA H K + ++P Q P+E + + I D RR YAGMVS
Sbjct: 250 RATTVIANHPPEK--------------SVHDPLQVPEEYMEPYGFIQDKHRRIYAGMVSL 295
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
+DE+VGNV AL+ HG+ N++ +F DNG G + G+N PLRG K T W+GG+RG
Sbjct: 296 MDEAVGNVTKALKSHGLWNNTVFIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRG 351
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKR 378
+ SP LKQ S EL HI+DWLPTL AG N T LDG N W +++G + R
Sbjct: 352 TGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNGTKPLDGFNMWKTISEGHPSPR 411
Query: 379 SEILHNIDN 387
E+LHNID
Sbjct: 412 VELLHNIDQ 420
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LF+I DP E+++++ +++ L +L Y VP P D R DP + +W PW
Sbjct: 497 LFDINQDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSHFPPLDPRCDP-KSTGVWSPW 555
>gi|426384277|ref|XP_004058697.1| PREDICTED: arylsulfatase B-like [Gorilla gorilla gorilla]
Length = 408
Score = 338 bits (867), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 234/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 43 ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 101
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 102 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 221
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H + KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 222 IALITNHPREKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 276
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 277 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 332
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 333 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 392
Query: 381 ILHNID 386
+LHNID
Sbjct: 393 LLHNID 398
>gi|126317548|ref|XP_001381590.1| PREDICTED: arylsulfatase B [Monodelphis domestica]
Length = 522
Score = 338 bits (867), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 236/366 (64%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A +PHI+ +LADDLGWNDV +H S I TP++DAL+ G+ L +Y Q LCTPSRS L+
Sbjct: 30 ATGQPHIVFVLADDLGWNDVGYH-DSNIFTPHLDALSAQGVRLENYYTQPLCTPSRSQLL 88
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y IH G+QH +I +P +PL EKLLP+ L+EAGY TH +GKWHLG FR+ PT
Sbjct: 89 TGRYQIHTGLQHQIIWPCQPSCIPLDEKLLPELLREAGYVTHMVGKWHLGMFRKECLPTR 148
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+ +GY G +DYY H + + LD R + +YST+++TE A
Sbjct: 149 RGFDTFFGYLLGSEDYYTHKRCVHIDALKVTRCALDFRDGEDIAAGYENMYSTNVFTERA 208
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+N+IA H KP+FLYLA +VH EP Q P+E + + I + R+ YAGMV+ +D
Sbjct: 209 VNLIANHPAQKPLFLYLALQSVH-----EPLQVPEEYLQPYDFIQNKNRQHYAGMVTLMD 263
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AL+K+G+ N++ +F DNG + G+N PLRG K T W+GG+RGV
Sbjct: 264 EAVGNVTRALQKYGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWTLWEGGIRGVG 319
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LK+ + SSEL HISDWLPTL AG N T LDG + W +++G+ + R E
Sbjct: 320 FVSSPLLKRKRVKSSELIHISDWLPTLVNLAGGRTNGTKPLDGFDVWKTISEGSPSPRME 379
Query: 381 ILHNID 386
+LHNID
Sbjct: 380 LLHNID 385
>gi|390360370|ref|XP_791935.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 374
Score = 337 bits (865), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 172/354 (48%), Positives = 228/354 (64%), Gaps = 4/354 (1%)
Query: 21 FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPS 80
FL + K ++ + GW+DVS HGSSQI TPNID LA G+ L +YV +CTP+
Sbjct: 23 FLKHDSADKFDGCVMSEIEQGWDDVSLHGSSQILTPNIDTLAQEGVTLTNYYVSPICTPT 82
Query: 81 RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
RSA+MTGK+PIH GMQH I EPWGL L EK + Q+LK GY+THA+GKWHLG+F E
Sbjct: 83 RSAIMTGKHPIHTGMQHDTIGADEPWGLGLDEKTMAQHLKSLGYSTHAVGKWHLGYFAED 142
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
Y PT RGFDS +GY+ G DYY H + T + G D+ N +V G YST+++TE
Sbjct: 143 YIPTRRGFDSFFGYYNGRGDYYTH--EDTEGGFGGYDLHKNGEVHWPDFGQYSTEIFTEE 200
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAG-NTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
A +I HN S+P+FLYLAH AVHAG + +AP + F +I RR +A MV+
Sbjct: 201 AQQIIKTHNASQPLFLYLAHQAVHAGVYGKDLVEAPHKYYQMFPNIKTEGRRMFAAMVAG 260
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
LD+SVGN+ L+ G+ NSI++F DNG P+ G N SN PLRG K T W+GG+RG
Sbjct: 261 LDDSVGNISQTLQDAGLYNNSIIVFTTDNGGPTNGYDGNHASNWPLRGCKHTLWEGGVRG 320
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTK 372
A + SP +++ ++ S + H+ DWLPTL AG + + T+LDG +QWD L+K
Sbjct: 321 TAFVNSPLIEKPRRFSDRMMHVCDWLPTLYGVAGGDPKELTNLDGFDQWDALSK 374
>gi|441598315|ref|XP_004087449.1| PREDICTED: arylsulfatase B isoform 2 [Nomascus leucogenys]
Length = 415
Score = 337 bits (863), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 43 ASRPPHLVFVLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 101
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 102 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 221
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 222 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 276
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 277 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 332
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 333 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 392
Query: 381 ILHNID 386
+LHNID
Sbjct: 393 LLHNID 398
>gi|158255166|dbj|BAF83554.1| unnamed protein product [Homo sapiens]
Length = 413
Score = 337 bits (863), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 41 ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 99
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 100 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 159
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 160 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 219
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 220 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 274
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 275 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 330
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 331 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDMWKTISEGSPSPRIE 390
Query: 381 ILHNID 386
+LHNID
Sbjct: 391 LLHNID 396
>gi|410260410|gb|JAA18171.1| arylsulfatase B [Pan troglodytes]
gi|410341767|gb|JAA39830.1| arylsulfatase B [Pan troglodytes]
Length = 414
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 42 ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 221 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 331
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 332 FVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 391
Query: 381 ILHNID 386
+LHNID
Sbjct: 392 LLHNID 397
>gi|114599506|ref|XP_001140908.1| PREDICTED: arylsulfatase B isoform 2 [Pan troglodytes]
Length = 415
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 43 ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 101
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 102 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 221
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 222 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 276
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 277 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 332
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 333 FVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 392
Query: 381 ILHNID 386
+LHNID
Sbjct: 393 LLHNID 398
>gi|179030|gb|AAA51779.1| arylsulfatase B precursor [Homo sapiens]
Length = 533
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 41 ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 99
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 100 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 159
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 160 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 219
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 220 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 274
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 275 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 330
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 331 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDMWKTISEGSPSPRIE 390
Query: 381 ILHNID 386
+LHNID
Sbjct: 391 LLHNID 396
>gi|332224806|ref|XP_003261559.1| PREDICTED: arylsulfatase B isoform 1 [Nomascus leucogenys]
Length = 535
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 43 ASRPPHLVFVLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 101
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 102 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 221
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 222 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 276
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 277 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 332
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 333 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 392
Query: 381 ILHNID 386
+LHNID
Sbjct: 393 LLHNID 398
>gi|410226854|gb|JAA10646.1| arylsulfatase B [Pan troglodytes]
gi|410292330|gb|JAA24765.1| arylsulfatase B [Pan troglodytes]
Length = 414
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 42 ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 221 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 331
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 391
Query: 381 ILHNID 386
+LHNID
Sbjct: 392 LLHNID 397
>gi|38569407|ref|NP_942002.1| arylsulfatase B isoform 2 precursor [Homo sapiens]
gi|20809799|gb|AAH29051.1| Arylsulfatase B [Homo sapiens]
gi|119616228|gb|EAW95822.1| arylsulfatase B, isoform CRA_b [Homo sapiens]
Length = 413
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 41 ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 99
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 100 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 159
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 160 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 219
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 220 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 274
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 275 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 330
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 331 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 390
Query: 381 ILHNID 386
+LHNID
Sbjct: 391 LLHNID 396
>gi|350400025|ref|XP_003485710.1| PREDICTED: arylsulfatase B-like, partial [Bombus impatiens]
Length = 301
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 157/230 (68%), Positives = 184/230 (80%), Gaps = 3/230 (1%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+KPHIIIILADDLGWNDVSFHGS QIPTPNIDALAYNG+ILN HYV ALCTPSRSALMTG
Sbjct: 72 QKPHIIIILADDLGWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTG 131
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
K PIH+GMQH V+ EP GLPL+EKLLPQYL+E GY THA+GKWHLG+F++ YTPT+RG
Sbjct: 132 KNPIHLGMQHSVLYPTEPRGLPLSEKLLPQYLQEIGYKTHAVGKWHLGYFKKQYTPTYRG 191
Query: 148 FDSHYGYWQGLQDYYDHSCK---ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
FDSH+GYW GL+DYY H + + + Y+G DMR N+ V T G Y+TDL+T A+ +
Sbjct: 192 FDSHFGYWNGLEDYYTHIAQEPDSQYNEYKGFDMRRNLTVAWDTAGKYATDLFTNEAVRL 251
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
I EH+ +PMFLYLAHLAVH GN + +APDEE+AKF I DPERR A
Sbjct: 252 INEHDTERPMFLYLAHLAVHKGNENQLLRAPDEEIAKFSYILDPERRIQA 301
>gi|410260404|gb|JAA18168.1| arylsulfatase B [Pan troglodytes]
gi|410260406|gb|JAA18169.1| arylsulfatase B [Pan troglodytes]
gi|410260408|gb|JAA18170.1| arylsulfatase B [Pan troglodytes]
gi|410341765|gb|JAA39829.1| arylsulfatase B [Pan troglodytes]
gi|410341769|gb|JAA39831.1| arylsulfatase B [Pan troglodytes]
Length = 534
Score = 336 bits (861), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 42 ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 221 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 331
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 332 FVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 391
Query: 381 ILHNID 386
+LHNID
Sbjct: 392 LLHNID 397
>gi|410260412|gb|JAA18172.1| arylsulfatase B [Pan troglodytes]
gi|410341771|gb|JAA39832.1| arylsulfatase B [Pan troglodytes]
Length = 535
Score = 336 bits (861), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 43 ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 101
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 102 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 221
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 222 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 276
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 277 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 332
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 333 FVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 392
Query: 381 ILHNID 386
+LHNID
Sbjct: 393 LLHNID 398
>gi|38569405|ref|NP_000037.2| arylsulfatase B isoform 1 precursor [Homo sapiens]
gi|114223|sp|P15848.1|ARSB_HUMAN RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
Flags: Precursor
gi|179077|gb|AAA51784.1| arylsulfatase B precursor (EC 3.1.6.1) [Homo sapiens]
gi|119616227|gb|EAW95821.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
gi|119616229|gb|EAW95823.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
Length = 533
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 41 ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 99
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 100 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 159
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 160 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 219
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 220 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 274
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 275 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 330
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 331 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 390
Query: 381 ILHNID 386
+LHNID
Sbjct: 391 LLHNID 396
>gi|410226860|gb|JAA10649.1| arylsulfatase B [Pan troglodytes]
gi|410292332|gb|JAA24766.1| arylsulfatase B [Pan troglodytes]
Length = 535
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 43 ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 101
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 102 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 221
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 222 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 276
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 277 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 332
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 333 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 392
Query: 381 ILHNID 386
+LHNID
Sbjct: 393 LLHNID 398
>gi|410226852|gb|JAA10645.1| arylsulfatase B [Pan troglodytes]
gi|410226856|gb|JAA10647.1| arylsulfatase B [Pan troglodytes]
gi|410226858|gb|JAA10648.1| arylsulfatase B [Pan troglodytes]
gi|410292328|gb|JAA24764.1| arylsulfatase B [Pan troglodytes]
Length = 534
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 42 ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 221 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 331
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 391
Query: 381 ILHNID 386
+LHNID
Sbjct: 392 LLHNID 397
>gi|825628|emb|CAA51272.1| arylsulfatase [Homo sapiens]
gi|189067435|dbj|BAG37417.1| unnamed protein product [Homo sapiens]
Length = 533
Score = 335 bits (860), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 41 ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 99
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 100 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 159
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 160 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 219
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 220 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 274
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 275 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 330
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 331 FVASPLLKQKGVKNRELIHISDWLPTLMKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 390
Query: 381 ILHNID 386
+LHNID
Sbjct: 391 LLHNID 396
>gi|297675538|ref|XP_002815731.1| PREDICTED: arylsulfatase B [Pongo abelii]
Length = 534
Score = 335 bits (859), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 42 ASQPPHLVFLLADDLGWNDVGFHGS-RIHTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
I +I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 221 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGIRGVG 331
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 391
Query: 381 ILHNID 386
+LHNID
Sbjct: 392 LLHNID 397
>gi|119893510|ref|XP_611819.3| PREDICTED: arylsulfatase J [Bos taurus]
gi|297475606|ref|XP_002688145.1| PREDICTED: arylsulfatase J [Bos taurus]
gi|296486797|tpg|DAA28910.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase-like [Bos taurus]
Length = 599
Score = 335 bits (858), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 206/552 (37%), Positives = 292/552 (52%), Gaps = 70/552 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+ TA +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 70 SVTALSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 128
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 129 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 188
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN G+YST +YT
Sbjct: 189 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GVYSTQMYT 244
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A H+ KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 245 QRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLS 299
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LDE++ NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+
Sbjct: 300 CLDEAINNVTLALKMYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 354
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKT 376
R + + SP LK V EL HI+DW PTL + A +I++ LDG + W+ +++G ++
Sbjct: 355 RAIGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDENIQLDGYDVWETISEGLRS 414
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
R +ILHNID + K NG Y
Sbjct: 415 PRVDILHNIDPIYTKAK------------------NGSWAAGY----------------- 439
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
GI A+++ +++K SD + + F+ + N + N+
Sbjct: 440 ---GIWNTAIQSAIRVKHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 486
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
++ LFNIT DP E+ +L+ ++KQL +L+ + T VP P D R++P
Sbjct: 487 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPR 546
Query: 556 RWNNIWVPWYDE 567
+W PWY E
Sbjct: 547 LNGGVWGPWYKE 558
>gi|149698442|ref|XP_001503367.1| PREDICTED: arylsulfatase J [Equus caballus]
Length = 598
Score = 334 bits (857), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 205/549 (37%), Positives = 291/549 (53%), Gaps = 68/549 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+T + +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STASTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A H+ KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 244 QRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPSRYFEHYRSIVNINRRRYAAMLS 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+ +G NSI+++ +DNG G + GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 414
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
R +ILHNID + K NG L +
Sbjct: 415 RVDILHNIDPIYTKAK------------------NGS--------------------LAA 436
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 437 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERIT 486
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ LFNIT DP E+ +L+ ++K+L +L+ Y T VP P D R++P
Sbjct: 487 LSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQYNKTAVPVRYPPKDPRSNPRL 546
Query: 557 WNNIWVPWY 565
+W PWY
Sbjct: 547 NGGVWGPWY 555
>gi|301621823|ref|XP_002940244.1| PREDICTED: arylsulfatase B-like [Xenopus (Silurana) tropicalis]
Length = 502
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 175/366 (47%), Positives = 226/366 (61%), Gaps = 15/366 (4%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A PHI+ +LADDLGWNDV FHGS +I TP +D L+ G+ L +Y Q LCTPSRS L+
Sbjct: 9 ATSPPHIVFVLADDLGWNDVGFHGS-EILTPTLDFLSGQGVRLAGYYTQPLCTPSRSQLL 67
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
+G+Y IH G+QH +I +P PL +KLLP+ LKE GY TH +GKWHLG ++ PT
Sbjct: 68 SGRYQIHTGLQHQIIWPCQPHCHPLEDKLLPELLKERGYVTHMVGKWHLGMYKTDCLPTR 127
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFDS++GYW G +DYY H LD R YST L+T+ A
Sbjct: 128 RGFDSYFGYWTGGEDYYSHERCYLITTLNITRCALDFRDGEVPATDYQMKYSTHLFTDRA 187
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+++I HN KP+FLYLA+ AVH+ P Q PD+ + + I D RR YAGMVS LD
Sbjct: 188 VDLITNHNPEKPLFLYLAYQAVHS-----PLQVPDQYIEPYTSIHDKNRRLYAGMVSVLD 242
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+V NV AL+K G+ N++ +F DNG + + G+N PLRG K T W+GG+RGV
Sbjct: 243 EAVANVTNALKKQGLWNNTVFIFSTDNGGQTL----SGGNNWPLRGRKWTLWEGGIRGVG 298
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ S LKQ S EL HISDWLPTL AG N T LDG + W +++G+ + R E
Sbjct: 299 FVTSSLLKQKGVTSKELIHISDWLPTLVNLAGGSTNSTKPLDGFDVWKTISEGSPSPRVE 358
Query: 381 ILHNID 386
+LHNID
Sbjct: 359 LLHNID 364
>gi|405964468|gb|EKC29950.1| Arylsulfatase B [Crassostrea gigas]
Length = 483
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 201/544 (36%), Positives = 288/544 (52%), Gaps = 80/544 (14%)
Query: 21 FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPS 80
FL + K PHI+ I+ADD GWNDV F S + TPNID LA +G+ILN YV +CTPS
Sbjct: 16 FLPYVSAKPPHIVFIVADDYGWNDVGFRNPSVL-TPNIDKLARSGMILNSSYVMPVCTPS 74
Query: 81 RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
R++ MTG+Y G+QH VIL + PL LPQ LKE GYATHAIGKWHLGF +
Sbjct: 75 RNSFMTGQYAFKSGLQHIVILPQQATCAPLNNTFLPQKLKELGYATHAIGKWHLGFCKWE 134
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
TPT+RGFD+ YGY+ G +DYY+ S G D R N N T G YST LY
Sbjct: 135 CTPTYRGFDTFYGYYNGAEDYYNLSIAG------GKDFRDNRTPVNAT-GEYSTILYARR 187
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK-FLDISDPERRTYAGMVSR 259
A ++I +H+ SKPM++YL +VH EP Q E + + + RRTY GMV+
Sbjct: 188 AESIIKDHDASKPMYMYLPFQSVH-----EPLQVSLESSSDIYSKVHTESRRTYLGMVTA 242
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
+DE++GN+ AL+ GM +++++F ADNG G + G+N+PLRG K+T ++GG R
Sbjct: 243 MDEAIGNITKALKTKGMFNDTLIVFTADNG----GWITYAGNNYPLRGGKTTVFEGGTRA 298
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
A + ++++ V + H DW+PT+ +AAG D +LDG++QWD L ++KR+
Sbjct: 299 TAFVSGAGIQKSNYVYPGMIHAVDWMPTVLSAAG-GTPDPTLDGIDQWDSLRTAGESKRT 357
Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA 439
E ++NID++ +AA+R+ D K + G G +W Y P +V+
Sbjct: 358 EFIYNIDDLRPEICGHAAIRMGDYKLIDGYP--GAFQDW----------YKPDQVV---- 401
Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNS 499
SD + D + + R F +D
Sbjct: 402 -------------------------SD----LYEDVSLNDTRPF------FD-------- 418
Query: 500 TVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNN 559
K LFN+ DDP E +L++ D++K+L+ +L Y +VP + P +
Sbjct: 419 --KMMLFNLKDDPNEHVDLSKKHPDIVKKLKSRLEYYHKQIVPANFPKNSASSAPTNYGG 476
Query: 560 IWVP 563
W P
Sbjct: 477 FWTP 480
>gi|84370328|ref|NP_001033588.1| arylsulfatase I precursor [Mus musculus]
gi|123779975|sp|Q32KI9.1|ARSI_MOUSE RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
gi|81158040|tpe|CAI84994.1| TPA: arylsulfatase I [Mus musculus]
gi|148677850|gb|EDL09797.1| mCG6034 [Mus musculus]
gi|187954139|gb|AAI38971.1| Arylsulfatase i [Mus musculus]
gi|187954429|gb|AAI41170.1| Arylsulfatase i [Mus musculus]
Length = 573
Score = 331 bits (849), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 202/554 (36%), Positives = 289/554 (52%), Gaps = 90/554 (16%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
P+ PHII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+T
Sbjct: 44 PQPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 102
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT R
Sbjct: 103 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRR 162
Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
GFD+ G G DYY + +C G D+ V G YST LY + A +++
Sbjct: 163 GFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHIL 220
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
A HN P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V
Sbjct: 221 ASHNPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVR 275
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
N+ AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + S
Sbjct: 276 NITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHS 331
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
P LK+ ++ S L HI+DW PTL AG + LDG + W +++G + R+EILHN
Sbjct: 332 PLLKKKRRTSRALVHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHN 391
Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
ID + N ++ AA+RV + K + G D YGD +
Sbjct: 392 IDPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WI 437
Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
P + L S G +N E++ +R+
Sbjct: 438 PPQTLASFPGSWWNL-----------------------------ERMASIRQAV------ 462
Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDK 550
LFNI+ DP E+ +LA + D+++ L +LA Y T +P +
Sbjct: 463 -------------WLFNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAANP 509
Query: 551 RADPARWNNIWVPW 564
RA P W PW
Sbjct: 510 RAHPDFNGGAWGPW 523
>gi|6863176|gb|AAF30402.1|AF109924_1 sulfatase 1 precursor [Helix pomatia]
Length = 503
Score = 331 bits (849), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 204/551 (37%), Positives = 289/551 (52%), Gaps = 96/551 (17%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P+I+ +LADD G++DV +HGS +I TP +DAL+ +G+ L +YVQ +CTP+RS LM+G+
Sbjct: 33 QPNIVFVLADDFGFHDVGYHGS-EIHTPTLDALSASGVRLENYYVQPICTPTRSQLMSGR 91
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QHG+I +P LP L LKE+GYATH +GKWHLGF+++ Y P RGF
Sbjct: 92 YQIHTGLQHGIINSCQPNALPNDSPTLADKLKESGYATHMVGKWHLGFYKQEYLPWNRGF 151
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
D+++GY +DY++H+ + + LD+R N G YS L+T AI+V+ H
Sbjct: 152 DTYFGYLNAAEDYFNHNVP--WRQVRYLDLRDNNGPVRNETGQYSAHLFTGKAIDVVQSH 209
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
N SKP+FLYLA+ +VHA P + P++ K+ +I+D RRT+AGMVS LDE V N+
Sbjct: 210 NTSKPLFLYLAYQSVHA-----PLEVPEKYEHKYRNITDKNRRTFAGMVSALDEGVANLT 264
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
AL+ G+ N++++F DNG IH+ G+N+PLRG K++ W+GG GV + L
Sbjct: 265 QALKDKGLWNNTVLIFSTDNGGQ---IHAG-GNNYPLRGWKASLWEGGFHGVGFVSGGAL 320
Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDN 387
K++ VS L H+SDW PTL AG +N T LDG NQWD ++ + R +LHNID
Sbjct: 321 KRSGAVSKGLIHVSDWFPTLVTLAGGNLNGTKPLDGFNQWDTISNETPSPREILLHNID- 379
Query: 388 VDNPQK------------YYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVL 435
+ PQK AA+RV D K + G NG W D + Y E+
Sbjct: 380 ILYPQKGVPLYSNTWDTRVRAAIRVGDYKLITGDPGNG---SWVPPPDGHL--YFVPEIQ 434
Query: 436 YSKAGIT--FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNK 493
S A FN ADP ++ L+ EK LE+ ++ ++N
Sbjct: 435 ESAAKNVWLFNI-----------TADPNEHND------LSSEKPLEVLRLLQILVQFNN- 476
Query: 494 GAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRAD 553
T VPP D R D
Sbjct: 477 ---------------------------------------------TAVPPRYPAPDPRCD 491
Query: 554 PARWNNIWVPW 564
PA ++W PW
Sbjct: 492 PALHGDVWGPW 502
>gi|157831133|pdb|1FSU|A Chain A, 4-Sulfatase (Human)
Length = 492
Score = 331 bits (849), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 173/364 (47%), Positives = 231/364 (63%), Gaps = 15/364 (4%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+ PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q L TPSRS L+TG
Sbjct: 2 RPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLXTPSRSQLLTG 60
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT RG
Sbjct: 61 RYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRG 120
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
FD+++GY G +DYY H + LD R +V +YST+++T+ AI
Sbjct: 121 FDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIA 180
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
+I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +DE+
Sbjct: 181 LITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEA 235
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV +
Sbjct: 236 VGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVGFV 291
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEIL 382
SP LKQ + EL HISDWLPTL A N T LDG + W +++G+ + R E+L
Sbjct: 292 ASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIELL 351
Query: 383 HNID 386
HNID
Sbjct: 352 HNID 355
>gi|355750329|gb|EHH54667.1| hypothetical protein EGM_15550 [Macaca fascicularis]
Length = 569
Score = 331 bits (848), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 201/554 (36%), Positives = 290/554 (52%), Gaps = 90/554 (16%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
P+ PHII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+T
Sbjct: 44 PQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLT 102
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT R
Sbjct: 103 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRR 162
Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
GFD+ G G DYY + +C G D+ V G YST LY + A +++
Sbjct: 163 GFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHIL 220
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
A H+ +P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V
Sbjct: 221 ASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVR 275
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
N+ AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + S
Sbjct: 276 NITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHS 331
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
P LK+ Q+ S L HI+DW PTL AG + LDG + W +++G + R+EILHN
Sbjct: 332 PLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHN 391
Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
ID + N ++ AA+RV + K + G D YGD +
Sbjct: 392 IDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WI 437
Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
P + L + G +N E++ +R+
Sbjct: 438 PPQTLATFPGSWWNL-----------------------------ERMASVRQAV------ 462
Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDK 550
LFNI+ DP E+ +LA + D+++ L +LA Y T +P +
Sbjct: 463 -------------WLFNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP 509
Query: 551 RADPARWNNIWVPW 564
RA P W PW
Sbjct: 510 RAHPEFNGGAWGPW 523
>gi|109079349|ref|XP_001108178.1| PREDICTED: arylsulfatase I-like [Macaca mulatta]
gi|402873074|ref|XP_003900411.1| PREDICTED: arylsulfatase I [Papio anubis]
gi|355691752|gb|EHH26937.1| hypothetical protein EGK_17023 [Macaca mulatta]
Length = 569
Score = 330 bits (846), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 201/554 (36%), Positives = 290/554 (52%), Gaps = 90/554 (16%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
P+ PHII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+T
Sbjct: 44 PQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLT 102
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT R
Sbjct: 103 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRR 162
Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
GFD+ G G DYY + +C G D+ V G YST LY + A +++
Sbjct: 163 GFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHIL 220
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
A H+ +P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V
Sbjct: 221 ASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVR 275
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
N+ AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + S
Sbjct: 276 NITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHS 331
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
P LK+ Q+ S L HI+DW PTL AG + LDG + W +++G + R+EILHN
Sbjct: 332 PLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHN 391
Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
ID + N ++ AA+RV + K + G D YGD +
Sbjct: 392 IDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WI 437
Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
P + L + G +N E++ +R+
Sbjct: 438 PPQTLATFPGSWWNL-----------------------------ERMASVRQAV------ 462
Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDK 550
LFNI+ DP E+ +LA + D+++ L +LA Y T +P +
Sbjct: 463 -------------WLFNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP 509
Query: 551 RADPARWNNIWVPW 564
RA P W PW
Sbjct: 510 RAHPDFNGGAWGPW 523
>gi|260813923|ref|XP_002601665.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
gi|229286967|gb|EEN57677.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
Length = 478
Score = 330 bits (846), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 180/405 (44%), Positives = 240/405 (59%), Gaps = 33/405 (8%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
P+I+ ILADD GW+D+ +HGS I TP +D LA G+ L +YVQ +C+PSR LMTG+Y
Sbjct: 1 PNIVFILADDYGWDDIGYHGSF-IQTPKLDRLAKEGVKLENYYVQPICSPSRCQLMTGRY 59
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
I G+QH VI P GLPL E LPQ LKE GY+T+ +GKWHLGFFR+ + P RGFD
Sbjct: 60 QIRYGLQHSVITSDRPHGLPLDEVTLPQKLKENGYSTYVVGKWHLGFFRKEHMPLQRGFD 119
Query: 150 SHYGYWQGLQDYYDHSCKATFE----PYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
YGY G +DY+ H + + GLD+R + G YST L+ + AI +I
Sbjct: 120 KFYGYLTGGEDYWTHRRPNLYAKDPLAFHGLDLRDQDKPVLDQNGTYSTHLFAKKAIEII 179
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
H +SKPMFLYL AVH+ P +AP E V K+ D+ +P +TYA MVS +DE+VG
Sbjct: 180 QNHERSKPMFLYLPFQAVHS-----PLEAPTEYVEKYKDVKNPLVKTYAAMVSAMDEAVG 234
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
NV AL++ + +N++++F DNGA N GSN PLRG K+T W+GG+RGV + S
Sbjct: 235 NVTDALKRTRLWDNTVLIFSTDNGA-----RRNAGSNWPLRGWKNTLWEGGVRGVGFVTS 289
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHN 384
L++ ++ L HISDW PTL + N T LDG + WD ++ G + R EILHN
Sbjct: 290 KLLRKGKRKCDALIHISDWFPTLLRISRSVSNGTKPLDGFDIWDTISSGTPSPRKEILHN 349
Query: 385 I----------------DNVDNPQKYYAALRVDDLKYVAGTDNNG 413
I D++ NP YAA+R D K + G G
Sbjct: 350 IDPLHPKTDLSSLDSLHDHIFNPS-VYAAIRSGDWKLLTGDQGLG 393
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 38/60 (63%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNI DP E+N+L+ D++++L ++LA Y T +PP D RA+PA +++ PW
Sbjct: 418 LFNIRSDPQERNDLSVQYPDVVQELLERLAAYNKTALPPFWPSRDPRANPALHGDLFGPW 477
>gi|242008416|ref|XP_002425002.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212508631|gb|EEB12264.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 532
Score = 329 bits (843), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 207/551 (37%), Positives = 294/551 (53%), Gaps = 47/551 (8%)
Query: 7 YF-FALTCTLLFNDAFL-NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
YF F ++ N A + +TA + PHII IL DDLGW D FHGS QI TPN+DALAY+
Sbjct: 3 YFKFQQIFIIVLNLAVVCKSTAQQPPHIITILIDDLGWTDTGFHGSDQIKTPNMDALAYS 62
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGY 124
G+ILN+HYV CTPSRSAL+TG YPI GMQ + G+ LPL+ KL P++LK GY
Sbjct: 63 GMILNRHYVLPSCTPSRSALLTGLYPIRTGMQGMPLKGGDVRNLPLSFKLKPEFLKNLGY 122
Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKAT-----FEPYQGLDMR 179
TH +GKWHLG+ + P RGFDS +GY+ G DY+ T E + G D+
Sbjct: 123 RTHLVGKWHLGYRTINHLPNQRGFDSFFGYYNGYVDYFKFGHNQTVAGEKIEYFYGYDLH 182
Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP-DEE 238
N ++ Y+T L+T A +I HN+S+P++LY +HLA H G+ + P D +
Sbjct: 183 RNGEIYQTDKDTYATRLFTREAEKIIKNHNESEPLYLYFSHLATHTGDDDIGMEVPEDAD 242
Query: 239 VAK-FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH- 296
V K + I RR +AG + LD+SVG V+ AL++ ML+NSI+L M+DNG + +
Sbjct: 243 VNKTYGHIKHYGRRAFAGCLEELDKSVGEVMEALKEKNMLDNSIILIMSDNGGHTVSVDL 302
Query: 297 -SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
N SN PL G K T ++GG+R VA IWSP L + ++ + HI+DWLPTL +AAG
Sbjct: 303 PPNWSSNWPLGGTKFTLFEGGVRSVALIWSPLLPKGV-INDDFIHITDWLPTLYSAAGGN 361
Query: 356 INDTSL-DGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQ 414
+ + DG++QW+ L + + R IL NID+ + R LK G G
Sbjct: 362 PKELGIFDGIDQWEALKRNSAVARENILLNIDDKTGSEGVIIENRWKVLK--GGYKTYGS 419
Query: 415 SDEWYGDTDN--EIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTIL 472
D +G +N KY P +++ S+ G + LK + L
Sbjct: 420 YDFHWGKPENIHPNPKYDPNDIINSQTGQIVSNLK-------------------HYKNSL 460
Query: 473 TDEKILELREFARVRCN--YDNKGAHCNSTVKP---------CLFNITDDPCEQNNLAES 521
I LR+ + V C +N + ++P CLF+I +D CE N + +
Sbjct: 461 NHSSIETLRKKSTVTCQPLVNNIPMNDYGFIEPSPEVRCKDFCLFDIINDQCEVNPVENN 520
Query: 522 QTDLLKQLEDK 532
+ +E +
Sbjct: 521 ELIKFDNVESR 531
>gi|296193239|ref|XP_002744413.1| PREDICTED: arylsulfatase I [Callithrix jacchus]
Length = 569
Score = 329 bits (843), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 200/556 (35%), Positives = 291/556 (52%), Gaps = 90/556 (16%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
+ P+ PHII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L
Sbjct: 42 SPPQPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQL 100
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
+TG+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT
Sbjct: 101 LTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPT 160
Query: 145 FRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
RGFD+ G G DYY + +C G D+ V G YST LY + A +
Sbjct: 161 RRGFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRASH 218
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
++A H+ +P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+
Sbjct: 219 ILANHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEA 273
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
V N+ AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ +
Sbjct: 274 VRNITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFV 329
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEIL 382
SP LK+ ++ S L HI+DW PTL AG + LDG + W +++G + R+EIL
Sbjct: 330 HSPLLKRKRRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEIL 389
Query: 383 HNIDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDK 428
HNID + N ++ AA+RV + K + G D YGD
Sbjct: 390 HNIDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD------- 435
Query: 429 YSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC 488
+ P + L + G +N E++ +R+
Sbjct: 436 WIPPQTLATFPGSWWNL-----------------------------ERMASVRQAV---- 462
Query: 489 NYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF 548
LFNI+ DP E+ +LA + D+++ L +LA Y T +P
Sbjct: 463 ---------------WLFNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAE 507
Query: 549 DKRADPARWNNIWVPW 564
+ RA P W PW
Sbjct: 508 NPRAHPDFNGGAWGPW 523
>gi|291231208|ref|XP_002735557.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 490
Score = 328 bits (842), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 175/402 (43%), Positives = 244/402 (60%), Gaps = 32/402 (7%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HI+ ILADD G++D+ +H S I TPN+D LA G+ L +YVQ CTP+RS LM+G+Y
Sbjct: 15 HIVFILADDYGFHDIGYH-DSIIKTPNLDRLASEGVKLENYYVQPKCTPTRSQLMSGRYQ 73
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QHG+I +P LP+ E +PQ LKE+GYATH +GKWHLG +++ PT RGFD+
Sbjct: 74 IHTGLQHGIIWPCQPSCLPINEVTIPQKLKESGYATHIVGKWHLGMYKKECLPTERGFDT 133
Query: 151 HYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-VDNKTIGIYSTDLYTEAAINVIAEHN 209
+GY G +DYY H+ +++ + G+D R NMQ V + G YST ++ E A N+I H+
Sbjct: 134 FFGYLTGSEDYYTHN--RSYDKFHGMDFRENMQIVQPQYNGQYSTHVFAEKAKNIIKSHD 191
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
P+FLYL AVH P Q PD+ + +I++ +RRTYAGMVS +DE+VGN+
Sbjct: 192 PQIPLFLYLPLHAVHG-----PLQVPDQYEKPYTNITNKQRRTYAGMVSCMDEAVGNITQ 246
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
L G+ +NS+++F DNG G + G+N PLRG K + W+GGM GV + SP +
Sbjct: 247 TLEDAGLYDNSVIIFSTDNG----GQVTAGGNNWPLRGWKGSLWEGGMHGVGFVNSPLIN 302
Query: 330 QTQK--VSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNID 386
+ S EL H++DW PT+ AG ++ T LDG NQW+ ++ G K+ R E+LHNID
Sbjct: 303 HQARGTTSKELIHVTDWFPTIVHLAGGSVSGTQPLDGYNQWETISNGKKSPRKELLHNID 362
Query: 387 ------NVDNPQKYY----------AALRVDDLKYVAGTDNN 412
N Y AA+RV D K + G N
Sbjct: 363 PKRTTLQTSNNNMIYRKDMFDASIQAAMRVGDWKILTGNPGN 404
Score = 40.4 bits (93), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 32/60 (53%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNI +DP E N+L+ S+ D + +L + L Y T VP + DP + + W PW
Sbjct: 430 LFNIAEDPNEYNDLSLSRPDKVNELLEILQQYYKTSVPVHYPDDEPACDPDKNDGAWGPW 489
>gi|109389362|ref|NP_078866.3| arylsulfatase J precursor [Homo sapiens]
gi|74722580|sp|Q5FYB0.1|ARSJ_HUMAN RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
gi|58201086|gb|AAW66666.1| arylsulfatase J [Homo sapiens]
gi|124376924|gb|AAI32880.1| ARSJ protein [Homo sapiens]
gi|124376926|gb|AAI32882.1| ARSJ protein [Homo sapiens]
gi|219521550|gb|AAI44266.1| ARSJ protein [Homo sapiens]
Length = 599
Score = 328 bits (842), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 204/557 (36%), Positives = 293/557 (52%), Gaps = 70/557 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TT+ +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LDE++ NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 353
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
R V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 354 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 413
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
RVD L + ++ W
Sbjct: 414 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 435
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
+ GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 436 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 485
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 486 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 545
Query: 556 RWNNIWVPWYDELDKQK 572
+W PWY E K+K
Sbjct: 546 LNGGVWGPWYKEETKKK 562
>gi|426345299|ref|XP_004040357.1| PREDICTED: arylsulfatase J [Gorilla gorilla gorilla]
Length = 599
Score = 328 bits (841), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 204/557 (36%), Positives = 293/557 (52%), Gaps = 70/557 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TT+ +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LDE++ NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 353
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
R V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 354 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 413
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
RVD L + ++ W
Sbjct: 414 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 435
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
+ GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 436 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 485
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 486 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 545
Query: 556 RWNNIWVPWYDELDKQK 572
+W PWY E K+K
Sbjct: 546 LNGGVWGPWYKEETKKK 562
>gi|410038636|ref|XP_526667.3| PREDICTED: arylsulfatase J isoform 2 [Pan troglodytes]
gi|410210212|gb|JAA02325.1| arylsulfatase family, member J [Pan troglodytes]
gi|410253696|gb|JAA14815.1| arylsulfatase family, member J [Pan troglodytes]
gi|410298378|gb|JAA27789.1| arylsulfatase family, member J [Pan troglodytes]
gi|410351985|gb|JAA42596.1| arylsulfatase family, member J [Pan troglodytes]
Length = 598
Score = 328 bits (841), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 204/557 (36%), Positives = 293/557 (52%), Gaps = 70/557 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TT+ +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LDE++ NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 353
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
R V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 354 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 413
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
RVD L + ++ W
Sbjct: 414 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 435
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
+ GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 436 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 485
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 486 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 545
Query: 556 RWNNIWVPWYDELDKQK 572
+W PWY E K+K
Sbjct: 546 LNGGVWGPWYKEETKKK 562
>gi|332822312|ref|XP_527073.3| PREDICTED: arylsulfatase I isoform 2 [Pan troglodytes]
Length = 569
Score = 328 bits (841), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 205/578 (35%), Positives = 296/578 (51%), Gaps = 92/578 (15%)
Query: 3 WARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALA 62
WA+ F A F+ + P PHII IL DD G++DV +HGS I TP +D LA
Sbjct: 22 WAKPSFVADGPGEAXXXXFIYPSRP--PHIIFILTDDQGYHDVGYHGS-DIETPTLDRLA 78
Query: 63 YNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEA 122
G+ L +Y+Q +CTPSRS L+TG+Y IH G+QH +I +P LPL + LP L+EA
Sbjct: 79 AKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSIIRPQQPNCLPLDQVTLPHKLQEA 138
Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHN 181
GY+TH +GKWHLGF+R+ PT RGFD+ G G DYY + +C G D+
Sbjct: 139 GYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEG 196
Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
V G YST LY + A +++A H+ +P+FLY+A AVH P Q+P E + +
Sbjct: 197 ENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYR 251
Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
+ + + RR YA MV+ +DE+V N+ AL+++G NS+++F +DNG +F + GS
Sbjct: 252 YRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTF----SGGS 307
Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-S 360
N PLRG K T W+GG+RG+ + SP LK+ Q+ S L HI+DW PTL AG +
Sbjct: 308 NWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAADG 367
Query: 361 LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKY--------------YAALRVDDLKYV 406
LDG + W +++G + R+EILHNID + N ++ AA+RV + K +
Sbjct: 368 LDGYDVWPAISEGRASPRTEILHNIDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLL 427
Query: 407 AGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSD 466
G D YGD + P + L + G +N
Sbjct: 428 TG-------DPGYGD-------WIPPQTLATFPGSWWNL--------------------- 452
Query: 467 ALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLL 526
E++ +R+ LFNI+ DP E+ +LA + D++
Sbjct: 453 --------ERMASVRQAV-------------------WLFNISADPYEREDLAGQRPDVV 485
Query: 527 KQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
+ L +LA Y T +P + RA P W PW
Sbjct: 486 RTLLARLAEYNRTAIPVRYPAENPRAHPDFNGGAWGPW 523
>gi|149059062|gb|EDM10069.1| arylsulfatase B [Rattus norvegicus]
Length = 517
Score = 328 bits (841), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 171/366 (46%), Positives = 233/366 (63%), Gaps = 24/366 (6%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A PH++ +LADDLGWND+ FHGS I TP++DALA G++L+ +YVQ LCTPSRS L+
Sbjct: 36 AAPPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLL 94
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y IH+G+QH +I+ +P +PL EKLLPQ LK+AGYATH +GKWHLG +R+ PT
Sbjct: 95 TGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTR 154
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H A E G LD+R + + IYST+++T+ A
Sbjct: 155 RGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRA 214
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+IA H K + ++P Q P+E + + I D RR YAGMVS LD
Sbjct: 215 TTLIANHPPEK--------------SVHDPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLD 260
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+VGNV AL+ G+ N++++F DNG G + G+N PLRG K T W+GG+RG
Sbjct: 261 EAVGNVTKALKSRGLWNNTVLIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRGAG 316
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
+ SP LKQ S EL HI+DWLPTL AG + T LDG + W+ +++G+ + R E
Sbjct: 317 FVASPLLKQKGVKSRELMHITDWLPTLVNLAGGSTHGTKPLDGFDVWETISEGSPSPRVE 376
Query: 381 ILHNID 386
+L NID
Sbjct: 377 LLLNID 382
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LF+I DP E+++++ +++ L +L Y VP P D R DP + +W PW
Sbjct: 458 LFDINRDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSYFPPLDPRCDP-KGTGVWSPW 516
>gi|100801472|emb|CAJ18095.1| arylsulfatase J [Homo sapiens]
Length = 596
Score = 328 bits (840), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 203/556 (36%), Positives = 292/556 (52%), Gaps = 68/556 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TT+ +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+ +G NSI+++ +DNG G + GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 414
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
RVD L + ++ W +
Sbjct: 415 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 436
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 437 GYGIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERIT 486
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 487 LSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 546
Query: 557 WNNIWVPWYDELDKQK 572
+W PWY E K+K
Sbjct: 547 NGGVWGPWYKEETKKK 562
>gi|340384741|ref|XP_003390869.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 490
Score = 328 bits (840), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 199/559 (35%), Positives = 297/559 (53%), Gaps = 73/559 (13%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
F+L LL A L ++A P+II+ + DDLGWND S+ GS I TPNID LA G+ L
Sbjct: 4 FSLVLFLLAFSAPLVSSA-NSPNIILFVIDDLGWNDTSYQGS-DIQTPNIDKLAEEGIRL 61
Query: 69 NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
Q+YVQ LC+PSRSAL+ GKYP H+G+ HGVI G P+GL L E + +LK+ GY+THA
Sbjct: 62 KQYYVQPLCSPSRSALLAGKYPYHLGLAHGVITNGHPYGLGLNETTIADHLKKGGYSTHA 121
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
+GKW LG + +TPT+RGFD+ YGY+ +DYY H LD R+N
Sbjct: 122 VGKWDLGMHKWEFTPTYRGFDTFYGYYDADEDYYTHKVGGY------LDFRNNTDPVKDE 175
Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
G YST L+T+A + I + S P F+Y A+ +VH+ P +APD + K P
Sbjct: 176 DGTYSTFLFTKAIEDAINAKSDS-PFFIYGAYQSVHS-----PLEAPDTYLEK-CHSPYP 228
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
R+ + GM+ LDE + N+ + L+ G+L+++I++ DNG G + SN PLRG
Sbjct: 229 NRKIFCGMMLALDEGISNITSLLQTKGLLDDTIIILTTDNG----GQTALGSSNWPLRGN 284
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAA-AGIEINDTSLDGVNQW 367
K+T ++GG+RG++ +WS L+++ ++ + HI+DW PT+ AG+ ++ LDG N W
Sbjct: 285 KATVFEGGVRGISFVWSTKLRKSNYDNNAMMHITDWYPTIIEGIAGMTLDTKGLDGFNMW 344
Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID 427
D + + R+EILH +D P KY D+ G Q D W
Sbjct: 345 DTINDDTPSPRTEILHQLD----PPKY------DERTQFIGQAALRQGD-W--------- 384
Query: 428 KYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR 487
KL I Q + ++ L + +L +
Sbjct: 385 ---------------------KLIIGQPNCSEK-----------LGPHTVYDLCPIGWIH 412
Query: 488 CNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKP 547
N + N ++ LFN+ DDP E+N L+ S +++ +L++ + Y +T + + P
Sbjct: 413 LNGTIEEPEVNPSLT-WLFNVKDDPNERNELSASHPEIVSKLKELIEAYNATHIVQMDPP 471
Query: 548 FDKRADPARWNNIWVPWYD 566
D R+DP ++ NIW PW D
Sbjct: 472 IDPRSDPKKFGNIWTPWLD 490
>gi|126697478|gb|ABO26696.1| sulfatase 1B precursor [Haliotis discus discus]
Length = 382
Score = 328 bits (840), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 165/375 (44%), Positives = 235/375 (62%), Gaps = 15/375 (4%)
Query: 12 TCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQH 71
T TL+F +L + KPH++ I+ADDLGWNDV F + Q+ TP++D LA G+ILN
Sbjct: 9 TFTLIF--VYLYGSQAAKPHVVFIVADDLGWNDVGFR-NPQVLTPHLDKLAKAGVILNSS 65
Query: 72 YVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
YVQ LC+PSR+ M+G +P H G+Q GVI P +P+ LPQ LKE GY+THA+GK
Sbjct: 66 YVQPLCSPSRNCFMSGYFPYHTGLQDGVIRPASPGFVPIKFTFLPQKLKELGYSTHAVGK 125
Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
WHLGF YTPT+RGFD+ GY+ G +DYY H+ + ++ + G D+R N V + G
Sbjct: 126 WHLGFCNLKYTPTYRGFDTFVGYYIGAEDYYKHTRE--YDKFSGYDLRFNTSVYTEAKGK 183
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
YST ++ E A+++I H+ P++LYL AVHA P + P E + I D RR
Sbjct: 184 YSTRVFAERAVDIIKSHDTDTPLYLYLPFQAVHA-----PLEVPPEYENLYKHIHDLPRR 238
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
TY GM+S LDE+V N+ AL + G+++N +++F DNG P G +N PLRG K+T
Sbjct: 239 TYCGMISALDEAVANITNALEETGLIDNLLLVFTTDNGGPFNG----AANNLPLRGCKAT 294
Query: 312 PWDGGMRGVAAIWSP-WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
W+GG +G I+S LK+T +++ + H DW PT AG E D ++DG++Q+D+L
Sbjct: 295 LWEGGTKGAGFIYSKTLLKKTGYLNTGMMHAVDWYPTFVELAGGENTDPNMDGISQYDML 354
Query: 371 TKGAKTKRSEILHNI 385
G +KR E ++NI
Sbjct: 355 INGGPSKRKEFVYNI 369
>gi|397519903|ref|XP_003830091.1| PREDICTED: arylsulfatase J [Pan paniscus]
Length = 596
Score = 328 bits (840), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 203/556 (36%), Positives = 292/556 (52%), Gaps = 68/556 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TT+ +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+ +G NSI+++ +DNG G + GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 414
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
RVD L + ++ W +
Sbjct: 415 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 436
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 437 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERIT 486
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 487 LSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 546
Query: 557 WNNIWVPWYDELDKQK 572
+W PWY E K+K
Sbjct: 547 NGGVWGPWYKEETKKK 562
>gi|402870284|ref|XP_003899162.1| PREDICTED: arylsulfatase J [Papio anubis]
Length = 597
Score = 328 bits (840), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 204/557 (36%), Positives = 293/557 (52%), Gaps = 70/557 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TT+ +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LDE++ NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 353
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
R V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 354 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 413
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
RVD L + ++ W
Sbjct: 414 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 435
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
+ GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 436 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERI 485
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 486 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 545
Query: 556 RWNNIWVPWYDELDKQK 572
+W PWY E K+K
Sbjct: 546 LNGGVWGPWYKEETKKK 562
>gi|109075466|ref|XP_001096903.1| PREDICTED: arylsulfatase J [Macaca mulatta]
Length = 596
Score = 328 bits (840), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 203/556 (36%), Positives = 292/556 (52%), Gaps = 68/556 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TT+ +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+ +G NSI+++ +DNG G + GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 414
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
RVD L + ++ W +
Sbjct: 415 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 436
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 437 GYGIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERIT 486
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 487 LSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 546
Query: 557 WNNIWVPWYDELDKQK 572
+W PWY E K+K
Sbjct: 547 NGGVWGPWYKEETKKK 562
>gi|301621596|ref|XP_002940132.1| PREDICTED: arylsulfatase I-like [Xenopus (Silurana) tropicalis]
Length = 575
Score = 328 bits (840), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 196/548 (35%), Positives = 288/548 (52%), Gaps = 63/548 (11%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
A L T + PHII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTP
Sbjct: 39 AKLQGTPSRPPHIIFILTDDQGFHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTP 97
Query: 80 SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
SRS L+TG+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+++
Sbjct: 98 SRSQLITGRYQIHTGLQHSIIRPRQPNCLPLHQVTLPQKLQEAGYSTHMVGKWHLGFYKK 157
Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYT 198
PT RGFD+ G G DYY + +C G D+ V G YST LY
Sbjct: 158 ECLPTRRGFDTFLGSLTGNVDYYSYDNCDG--PGVCGFDLHEGENVAWDQAGKYSTLLYA 215
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN +P+F+Y+A AVH P Q+P E + + + + RR YA MV+
Sbjct: 216 QRVNQILASHNPQQPIFIYVAFQAVHT-----PLQSPREYIYPYRGMGNVARRKYAAMVT 270
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
+D +V N+ AL+K+G NS+++F +DNG +F + GSN PLRG K T W+GG+R
Sbjct: 271 CMDAAVKNITRALKKYGYYSNSVIVFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVR 326
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTK 377
G+ + SP +K+ ++ S L HI+DW PTL AG I+ LDG + W +++G ++
Sbjct: 327 GLGFVHSPLIKKKRRTSRALMHITDWYPTLVKLAGGNISSAEGLDGYDVWGAISEGKESP 386
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
R+EILHNID + N K +G+ GQ
Sbjct: 387 RTEILHNIDPLYNLAK-------------SGSLEEGQ----------------------- 410
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHC 497
G+ A++ +++K SD + + ++ N H
Sbjct: 411 --GVWNTAVQASIRVKDFKLLTGDPGYSDWIPP----------QTLTNFPGSWWNLERHT 458
Query: 498 NSTVKPC-LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
+ K LFNIT DP E+++L+ S+ +++K+L +L Y T +P D R +P
Sbjct: 459 DGARKSLWLFNITADPFERHDLSSSKPEVVKELLIRLGYYNRTAIPVRYPAEDPRGNPEL 518
Query: 557 WNNIWVPW 564
W PW
Sbjct: 519 NGGAWGPW 526
>gi|355749520|gb|EHH53919.1| hypothetical protein EGM_14634 [Macaca fascicularis]
Length = 599
Score = 327 bits (839), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 204/557 (36%), Positives = 293/557 (52%), Gaps = 70/557 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TT+ +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LDE++ NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 353
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
R V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 354 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 413
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
RVD L + ++ W
Sbjct: 414 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 435
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
+ GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 436 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 485
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 486 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 545
Query: 556 RWNNIWVPWYDELDKQK 572
+W PWY E K+K
Sbjct: 546 LNGGVWGPWYKEETKKK 562
>gi|355687554|gb|EHH26138.1| hypothetical protein EGK_16035 [Macaca mulatta]
Length = 599
Score = 327 bits (839), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 204/557 (36%), Positives = 293/557 (52%), Gaps = 70/557 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TT+ +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LDE++ NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 353
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
R V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 354 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 413
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
RVD L + ++ W
Sbjct: 414 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 435
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
+ GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 436 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERI 485
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 486 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 545
Query: 556 RWNNIWVPWYDELDKQK 572
+W PWY E K+K
Sbjct: 546 LNGGVWGPWYKEETKKK 562
>gi|291401248|ref|XP_002717219.1| PREDICTED: arylsulfatase J [Oryctolagus cuniculus]
Length = 601
Score = 327 bits (838), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 204/553 (36%), Positives = 290/553 (52%), Gaps = 68/553 (12%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS +
Sbjct: 74 ATSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFI 132
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+ PT
Sbjct: 133 TGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTK 192
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV----DNKTIGIYSTDLYTEAA 201
RGFD+ +G G DYY H K G D+ N DN GIYST +YT+
Sbjct: 193 RGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDSAAWDHDN---GIYSTQMYTQRV 248
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
++A HN +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S LD
Sbjct: 249 QQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLD 303
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+V NV AL+ +G NSI+++ +DNG G + GSN PLRG K T W+GG+R V
Sbjct: 304 EAVSNVTLALKTYGFYNNSILIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIRAVG 359
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSE 380
+ SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 360 FVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP--- 416
Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAG 440
RVD L + ++ W + G
Sbjct: 417 ------------------RVDILHNIDPIYTKAKNGSWA-----------------AGYG 441
Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNS 499
I A+++ ++++ SD + + F+ + N + N+ ++
Sbjct: 442 IWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERITLST 491
Query: 500 TVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNN 559
LFNIT DP E+ +L+ ++KQL +L+ + T VP P D R++P
Sbjct: 492 GKSIWLFNITADPYERVDLSSRYPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGG 551
Query: 560 IWVPWYDELDKQK 572
+W PWY E +K+K
Sbjct: 552 VWGPWYKEENKKK 564
>gi|351698063|gb|EHB00982.1| Arylsulfatase J [Heterocephalus glaber]
Length = 593
Score = 327 bits (838), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 203/548 (37%), Positives = 291/548 (53%), Gaps = 70/548 (12%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
P +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS +T
Sbjct: 67 PSQPHLIFILADDQGFRDVGYHGS-EIRTPTLDKLAAEGVKLENYYVQPICTPSRSQFIT 125
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
GKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+ PT R
Sbjct: 126 GKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKR 185
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEAAI 202
GFD+ +G G DYY H K G D+ N DN GIYST +YT+
Sbjct: 186 GFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYEN---DNAAWDHDNGIYSTQMYTQRVQ 241
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
++A HN +KP+FLY+A+ AVH+ P QAP + + I + RR YA M+S LDE
Sbjct: 242 QILASHNPTKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDE 296
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
++ NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+R V
Sbjct: 297 AIHNVTLALKAYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGIRAVG 351
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSE 380
+ SP LK V EL HI+DW PTL + A +I+ DT LDG + W+ +++G ++ R
Sbjct: 352 FVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDTQLDGYDVWETISEGLRSPR-- 409
Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAG 440
+D + N Y + + + AG G
Sbjct: 410 ----VDILHNIDPIYT--KAKNGSWAAGY------------------------------G 433
Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNS 499
I A+++ ++++ SD + + F+ + N + N+ ++
Sbjct: 434 IWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERITLST 483
Query: 500 TVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNN 559
LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 484 GKSIWLFNITADPYERVDLSSRYPGIVKKLLKRLSQFNKTAVPVRYPPKDPRSNPRLNGG 543
Query: 560 IWVPWYDE 567
+W PWY E
Sbjct: 544 VWGPWYKE 551
>gi|114145559|ref|NP_001041346.1| arylsulfatase I precursor [Rattus norvegicus]
gi|123779983|sp|Q32KJ8.1|ARSI_RAT RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
gi|81158022|tpe|CAI84985.1| TPA: arylsulfatase I [Rattus norvegicus]
gi|149064375|gb|EDM14578.1| similar to RIKEN cDNA 9330196J05 [Rattus norvegicus]
Length = 573
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 200/550 (36%), Positives = 288/550 (52%), Gaps = 90/550 (16%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 48 HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHS 224
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
KP+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+
Sbjct: 225 PQKPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ ++ S L HI+DW PTL AG + LDG + W +++G + R+EILHNID +
Sbjct: 336 KKRRTSRALVHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHNIDPL 395
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
N ++ AA+RV + K + G D YGD + P +
Sbjct: 396 YNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
L S G +N E++ +R+
Sbjct: 442 LASFPGSWWNL-----------------------------ERMASIRQAV---------- 462
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFNI+ DP E+ +LA+ + D+++ L +LA Y T +P + RA P
Sbjct: 463 ---------WLFNISADPYEREDLADQRPDVVRTLLARLADYNRTAIPVRYPAANPRAHP 513
Query: 555 ARWNNIWVPW 564
W PW
Sbjct: 514 DFNGGAWGPW 523
>gi|426350600|ref|XP_004042858.1| PREDICTED: arylsulfatase I [Gorilla gorilla gorilla]
Length = 569
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 199/550 (36%), Positives = 287/550 (52%), Gaps = 90/550 (16%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 48 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHS 224
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ Q+ S L HI+DW PTL AG + LDG + W +++G + R+EILHNID +
Sbjct: 336 RKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDMWPAISEGRASPRTEILHNIDPL 395
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
N ++ AA+RV + K + G D YGD + P +
Sbjct: 396 YNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
L + G +N E++ +R+
Sbjct: 442 LATFPGSWWNL-----------------------------ERMASVRQAV---------- 462
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFNI+ DP E+ +LA + D+++ L +LA Y T +P + RA P
Sbjct: 463 ---------WLFNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENPRAHP 513
Query: 555 ARWNNIWVPW 564
W PW
Sbjct: 514 DFNGGAWGPW 523
>gi|403275516|ref|XP_003929486.1| PREDICTED: arylsulfatase J [Saimiri boliviensis boliviensis]
Length = 601
Score = 325 bits (834), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 203/556 (36%), Positives = 292/556 (52%), Gaps = 70/556 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TT+ +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 71 STTSASQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 129
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 130 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 189
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 190 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYEN---DNAAWDSDNGIYSTQMYT 245
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN +KP+FLYLA+ AVH+ P QAP + I + RR YA M+S
Sbjct: 246 QRVQQILASHNPTKPLFLYLAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 300
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LDE++ NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+
Sbjct: 301 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 355
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
R V + SP LK V EL HI+DW PTL + A +I+ D +DG + W+ +++G ++
Sbjct: 356 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQIDGYDIWETISEGLRS 415
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
RVD L + ++ W
Sbjct: 416 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 437
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
+ GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 438 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 487
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 488 TLSTGKSIWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 547
Query: 556 RWNNIWVPWYDELDKQ 571
+W PWY E K+
Sbjct: 548 LNGGVWGPWYKEETKK 563
>gi|432098813|gb|ELK28308.1| Arylsulfatase I [Myotis davidii]
Length = 571
Score = 325 bits (834), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 199/559 (35%), Positives = 290/559 (51%), Gaps = 92/559 (16%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+ P+ PHII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS
Sbjct: 40 SVAPPQPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRS 98
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
L+TG+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+
Sbjct: 99 QLLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECL 158
Query: 143 PTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
PT RGFD+ G G DYY + +C G D+ V G YST LY +
Sbjct: 159 PTRRGFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRV 216
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+++A H+ +P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +D
Sbjct: 217 SHILASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRYRSMGNVARRKYAAMVTCMD 271
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+V N+ AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+
Sbjct: 272 EAVRNITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLG 327
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAG--IEINDTSLDGVNQWDVLTKGAKTKRS 379
+ SP LK+ ++ S L HI+DW PTL AG + + D LDG + W +++G + R+
Sbjct: 328 FVHSPLLKRKRRTSRALVHITDWYPTLVGLAGGTVSVAD-GLDGYDVWPAISEGRASPRT 386
Query: 380 EILHNIDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNE 425
EILHNID + N ++ AA+RV + K + G D YGD
Sbjct: 387 EILHNIDPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD---- 435
Query: 426 IDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFAR 485
+ P + L + G +N E++ R+
Sbjct: 436 ---WIPPQTLAAFPGSWWNL-----------------------------ERMASARQAV- 462
Query: 486 VRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN 545
LFNI+ DP E+ +LA + D+++ L +L Y T +P
Sbjct: 463 ------------------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRY 504
Query: 546 KPFDKRADPARWNNIWVPW 564
+ RA P W PW
Sbjct: 505 PAENPRAHPDFNGGAWGPW 523
>gi|59797060|ref|NP_001012301.1| arylsulfatase I precursor [Homo sapiens]
gi|74722581|sp|Q5FYB1.1|ARSI_HUMAN RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
gi|58201084|gb|AAW66665.1| arylsulfatase I [Homo sapiens]
gi|120538357|gb|AAI29997.1| Arylsulfatase family, member I [Homo sapiens]
gi|120538621|gb|AAI29996.1| Arylsulfatase family, member I [Homo sapiens]
gi|220983388|dbj|BAH11166.1| arylsulfatase I [Homo sapiens]
Length = 569
Score = 325 bits (834), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 199/550 (36%), Positives = 287/550 (52%), Gaps = 90/550 (16%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 48 HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHS 224
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ Q+ S L HI+DW PTL AG + LDG + W +++G + R+EILHNID +
Sbjct: 336 RKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHNIDPL 395
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
N ++ AA+RV + K + G D YGD + P +
Sbjct: 396 YNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
L + G +N E++ +R+
Sbjct: 442 LATFPGSWWNL-----------------------------ERMASVRQAV---------- 462
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFNI+ DP E+ +LA + D+++ L +LA Y T +P + RA P
Sbjct: 463 ---------WLFNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENPRAHP 513
Query: 555 ARWNNIWVPW 564
W PW
Sbjct: 514 DFNGGAWGPW 523
>gi|397517762|ref|XP_003829075.1| PREDICTED: arylsulfatase I [Pan paniscus]
gi|410214522|gb|JAA04480.1| arylsulfatase family, member I [Pan troglodytes]
gi|410261150|gb|JAA18541.1| arylsulfatase family, member I [Pan troglodytes]
gi|410300016|gb|JAA28608.1| arylsulfatase family, member I [Pan troglodytes]
gi|410336277|gb|JAA37085.1| arylsulfatase family, member I [Pan troglodytes]
Length = 569
Score = 325 bits (834), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 199/550 (36%), Positives = 287/550 (52%), Gaps = 90/550 (16%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 48 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHS 224
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ Q+ S L HI+DW PTL AG + LDG + W +++G + R+EILHNID +
Sbjct: 336 RKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHNIDPL 395
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
N ++ AA+RV + K + G D YGD + P +
Sbjct: 396 YNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
L + G +N E++ +R+
Sbjct: 442 LATFPGSWWNL-----------------------------ERMASVRQAV---------- 462
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFNI+ DP E+ +LA + D+++ L +LA Y T +P + RA P
Sbjct: 463 ---------WLFNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENPRAHP 513
Query: 555 ARWNNIWVPW 564
W PW
Sbjct: 514 DFNGGAWGPW 523
>gi|58477551|gb|AAH89445.1| ARSJ protein [Homo sapiens]
Length = 578
Score = 325 bits (833), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 202/556 (36%), Positives = 291/556 (52%), Gaps = 68/556 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TT+ +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+ +G NSI+++ +DNG G + GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 414
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
RVD L + ++ W +
Sbjct: 415 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 436
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 437 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERIT 486
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ FNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 487 LSTGKSVWPFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 546
Query: 557 WNNIWVPWYDELDKQK 572
+W PWY E K+K
Sbjct: 547 NGGVWGPWYKEETKKK 562
>gi|296195717|ref|XP_002745502.1| PREDICTED: arylsulfatase J [Callithrix jacchus]
Length = 605
Score = 325 bits (833), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 202/556 (36%), Positives = 292/556 (52%), Gaps = 70/556 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TT+ +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 75 STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 133
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 134 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 193
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 194 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 249
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 250 QRVQKILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 304
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LDE++ NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+
Sbjct: 305 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 359
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
R V + SP LK V EL HI+DW PTL + A +I+ D +DG + W+ +++G ++
Sbjct: 360 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQIDGYDIWETISEGLRS 419
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
RVD L + ++ W
Sbjct: 420 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 441
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
+ GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 442 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 491
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 492 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 551
Query: 556 RWNNIWVPWYDELDKQ 571
+W PWY E K+
Sbjct: 552 LNGGVWGPWYKEETKK 567
>gi|441658369|ref|XP_003269374.2| PREDICTED: arylsulfatase J [Nomascus leucogenys]
Length = 597
Score = 325 bits (833), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 204/557 (36%), Positives = 294/557 (52%), Gaps = 71/557 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TT+ +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 68 STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 126
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 127 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 186
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 187 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 242
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 243 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 297
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LDE++ NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+
Sbjct: 298 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 352
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
R V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 353 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 412
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
RVD L + ++ W
Sbjct: 413 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 434
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
+ GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 435 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 484
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP + P D R++P
Sbjct: 485 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPV-SIPQDPRSNPR 543
Query: 556 RWNNIWVPWYDELDKQK 572
+W PWY E K+K
Sbjct: 544 LNGGVWGPWYKEETKKK 560
>gi|440901665|gb|ELR52564.1| Arylsulfatase I, partial [Bos grunniens mutus]
Length = 565
Score = 325 bits (833), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 199/554 (35%), Positives = 287/554 (51%), Gaps = 90/554 (16%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
P+ PHII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+T
Sbjct: 38 PQPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 96
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y IH G+QH +I +P LPL + LPQ L+E GY+TH +GKWHLGF+R+ PT R
Sbjct: 97 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRR 156
Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
GFD+ G G DYY + +C G D+ V G YST LY + +++
Sbjct: 157 GFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTLLYAQRVSHIL 214
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
A H+ +P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V
Sbjct: 215 ASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVR 269
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
N+ AL++HG NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + S
Sbjct: 270 NITWALKRHGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHS 325
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
P LK+ ++ S L HI+DW PTL A AG + LDG + W +++G + R+EILHN
Sbjct: 326 PLLKRKRRTSRALVHITDWYPTLVALAGGTASAADGLDGYDVWPAISEGRASPRTEILHN 385
Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
ID + N ++ AA+RV + K + G D YGD +
Sbjct: 386 IDPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WI 431
Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
P + L + G +N E++ R+
Sbjct: 432 PPQTLAAFPGSWWNL-----------------------------ERMASARQAV------ 456
Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDK 550
LFNI+ DP E+ +LA + D+++ L +L Y T +P +
Sbjct: 457 -------------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP 503
Query: 551 RADPARWNNIWVPW 564
RA P W PW
Sbjct: 504 RAHPDFNGGAWGPW 517
>gi|61876881|ref|XP_593725.1| PREDICTED: arylsulfatase I [Bos taurus]
gi|297477411|ref|XP_002689338.1| PREDICTED: arylsulfatase I [Bos taurus]
gi|296485188|tpg|DAA27303.1| TPA: arylsulfatase I-like [Bos taurus]
Length = 574
Score = 325 bits (833), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 199/554 (35%), Positives = 287/554 (51%), Gaps = 90/554 (16%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
P+ PHII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+T
Sbjct: 47 PQPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 105
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y IH G+QH +I +P LPL + LPQ L+E GY+TH +GKWHLGF+R+ PT R
Sbjct: 106 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRR 165
Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
GFD+ G G DYY + +C G D+ V G YST LY + +++
Sbjct: 166 GFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTLLYAQRVSHIL 223
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
A H+ +P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V
Sbjct: 224 ASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVR 278
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
N+ AL++HG NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + S
Sbjct: 279 NITWALKRHGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHS 334
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
P LK+ ++ S L HI+DW PTL A AG + LDG + W +++G + R+EILHN
Sbjct: 335 PLLKRKRRTSRALVHITDWYPTLVALAGGTASAADGLDGYDVWPAISEGRASPRTEILHN 394
Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
ID + N ++ AA+RV + K + G D YGD +
Sbjct: 395 IDPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WI 440
Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
P + L + G +N E++ R+
Sbjct: 441 PPQTLAAFPGSWWNL-----------------------------ERMASARQAV------ 465
Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDK 550
LFNI+ DP E+ +LA + D+++ L +L Y T +P +
Sbjct: 466 -------------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP 512
Query: 551 RADPARWNNIWVPW 564
RA P W PW
Sbjct: 513 RAHPDFNGGAWGPW 526
>gi|443709644|gb|ELU04236.1| hypothetical protein CAPTEDRAFT_53259, partial [Capitella teleta]
Length = 476
Score = 325 bits (833), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 178/397 (44%), Positives = 248/397 (62%), Gaps = 28/397 (7%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
P+I++I+ADDLG +DV +HGS I TPNID LAY G+ L +YVQ +CTPSRS LMTG+Y
Sbjct: 1 PNILLIVADDLGHHDVGYHGSV-IKTPNIDHLAYTGVRLENYYVQPICTPSRSQLMTGRY 59
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
IH G+QH +I +P +PL +LP+ LK+ GY+TH +GKWHLGF+++ P RGFD
Sbjct: 60 QIHTGLQHNIINPFQPNAVPLDLPMLPEVLKQNGYSTHMVGKWHLGFYKDEVLPMNRGFD 119
Query: 150 SHYGYWQGLQDYYDH-SCKA---TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
S+YGY G +DY+ H C A + G+D+R++ +VD G YST L+ E A +V+
Sbjct: 120 SYYGYLTGSEDYFTHRRCGALPGANKTVCGIDLRNDFEVDWNQTGKYSTQLFAEKAEDVV 179
Query: 206 ---AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
A H +P+FLY+A AVHA P Q P+E + + DI DP+RR AGMV+ +DE
Sbjct: 180 RKHAVHQPDQPLFLYVAFQAVHA-----PNQVPNEYLKPY-DIDDPKRRLLAGMVTCMDE 233
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
+VGN+ AA + G+ +N++V+F DNG ++ N G+N PLRG K + W+GGMRGV
Sbjct: 234 AVGNITAAFKDTGLWDNTVVVFTTDNGG---DVNENSGNNWPLRGWKHSLWEGGMRGVGF 290
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEI 381
+ SP LK + L H+SDW PTL AG + S LDG + W + + + R E+
Sbjct: 291 VNSPLLKSFGYPVNGLMHVSDWFPTLLNLAGANTSSISGLDGHDIWKSIAEREDSPRKEL 350
Query: 382 LHNIDNVDNPQ----------KYYAALRVDDLKYVAG 408
LHNID + P+ + +ALRV D K + G
Sbjct: 351 LHNIDPLYPPRGSKLNHTYDVRVRSALRVGDFKIITG 387
Score = 42.7 bits (99), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 32/60 (53%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFN+ +DPCE +L+ + +L + DKL Y +T VP D R++P + PW
Sbjct: 417 LFNVHEDPCETTDLSVALPGVLTMMLDKLLAYNATAVPCVFPDDDPRSNPEEHGGAFGPW 476
>gi|340373449|ref|XP_003385254.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 491
Score = 325 bits (832), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 193/552 (34%), Positives = 291/552 (52%), Gaps = 87/552 (15%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
++ P+II+ + DDLGWND S+ GS I TPNID LA G+ L Q+YVQ LC+PSRSA
Sbjct: 18 VSSANNPNIILFVIDDLGWNDTSYQGS-DIQTPNIDKLAEEGIRLKQYYVQPLCSPSRSA 76
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L+ GKYP H+G+ HGVI G P+GL L E + +LK+ GY+THA+GKW LG + +TP
Sbjct: 77 LLAGKYPYHLGLAHGVITNGHPYGLGLNETTIADHLKKGGYSTHAVGKWDLGMHKWEFTP 136
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
T+RGFD+ YGY+ +DYY H LD R+N G YST L+T+A +
Sbjct: 137 TYRGFDTFYGYYDADEDYYTHKVGGY------LDFRNNTDPVKDEDGTYSTFLFTKAIED 190
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
I + S P F+Y A+ +VH P +APD + K +I P R+ + GM+ LDE
Sbjct: 191 AINAKSDS-PFFIYGAYQSVHG-----PLEAPDIYLNK-CNIPYPNRKIFCGMMLALDEG 243
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
+ N+ + L+ G+L+++I++ DNG G + SN PLRG K+T ++GG+RG++ +
Sbjct: 244 ISNITSLLQTKGLLDDTIIILTTDNG----GQTALGSSNWPLRGNKATVFEGGVRGISFV 299
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAA-AGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
WS L+++ ++ + HI+DW PT+ AG+ ++ LDG N WD + + R+EIL
Sbjct: 300 WSTKLRKSNYDNNAMMHITDWYPTIIEGIAGMTLDTKGLDGFNMWDTINDDTPSPRTEIL 359
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
H + +P +Y DNE + + + L
Sbjct: 360 HQL----DPPRY----------------------------DNERNPFIGQAALRQG---- 383
Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVK 502
KL I Q N S+ + L K N H N T++
Sbjct: 384 ----DWKLIIGQ-------PNCSEKIGPHLVGNKC-------------PNGWVHLNGTIE 419
Query: 503 P--------CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFN+ DDP E+N L+ S +++ +L++++ Y +T + + P D R+DP
Sbjct: 420 EPEVNPSLTWLFNVKDDPNERNELSASHPEIVSKLKERIEAYNATHIVQMSPPIDPRSDP 479
Query: 555 ARWNNIWVPWYD 566
++ N+W PW D
Sbjct: 480 KKFGNVWTPWLD 491
>gi|291387626|ref|XP_002710353.1| PREDICTED: arylsulfatase I-like [Oryctolagus cuniculus]
Length = 571
Score = 324 bits (831), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 199/550 (36%), Positives = 287/550 (52%), Gaps = 90/550 (16%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 48 HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHS 224
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+
Sbjct: 225 PRRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ ++ S L HI+DW PTL AG + LDG + W +++G + R+EILHNID +
Sbjct: 336 RKRRTSRALVHITDWYPTLVGLAGGSASAADGLDGYDVWPAISEGRASPRTEILHNIDPL 395
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
N ++ AA+RV + K + G D YGD + P +
Sbjct: 396 YNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
L S G +N E++ +R+
Sbjct: 442 LASFPGSWWNL-----------------------------ERMASVRQAV---------- 462
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFNI+ DP E+ +LA + D+++ L +LA Y T +P + RA P
Sbjct: 463 ---------WLFNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAENPRAHP 513
Query: 555 ARWNNIWVPW 564
W PW
Sbjct: 514 DFNGGAWGPW 523
>gi|311250496|ref|XP_003124150.1| PREDICTED: arylsulfatase I-like [Sus scrofa]
Length = 573
Score = 324 bits (831), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 197/553 (35%), Positives = 286/553 (51%), Gaps = 90/553 (16%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
++PHII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG
Sbjct: 47 QQPHIIFILTDDQGYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTG 105
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y IH G+QH +I +P LPL + LPQ L++ GYATH +GKWHLGF+R+ PT RG
Sbjct: 106 RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQRLQQLGYATHMVGKWHLGFYRKECLPTRRG 165
Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
FD+ G G DYY + +C G D+ V G YST LY + ++A
Sbjct: 166 FDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTLLYAQRVSRILA 223
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H+ +P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N
Sbjct: 224 GHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRYRGMGNVARRKYAAMVTCMDEAVRN 278
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
+ AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP
Sbjct: 279 ITGALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSP 334
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNI 385
LK+T++ S L HI+DW PTL AG + LDG + W +++G + R+EILHNI
Sbjct: 335 LLKRTRRTSRALLHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNI 394
Query: 386 DNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSP 431
D + N ++ AA+RV + K + G D YGD + P
Sbjct: 395 DPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIP 440
Query: 432 KEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYD 491
+ L + G +N E++ R+
Sbjct: 441 PQTLAAFPGSWWNL-----------------------------ERMASARQAV------- 464
Query: 492 NKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR 551
LFNI+ DP E+ +LA + D+++ L +L Y T +P + R
Sbjct: 465 ------------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENPR 512
Query: 552 ADPARWNNIWVPW 564
A P W PW
Sbjct: 513 AHPDFNGGAWGPW 525
>gi|403285505|ref|XP_003934063.1| PREDICTED: arylsulfatase I [Saimiri boliviensis boliviensis]
Length = 572
Score = 324 bits (831), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 198/550 (36%), Positives = 287/550 (52%), Gaps = 90/550 (16%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 48 HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILANHS 224
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ ++ S L HI+DW PTL AG + LDG + W +++G + R+EILHNID +
Sbjct: 336 RKRRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHNIDPL 395
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
N ++ AA+RV + K + G D YGD + P +
Sbjct: 396 YNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
L + G +N E++ +R+
Sbjct: 442 LATFPGSWWNL-----------------------------ERMASVRQAV---------- 462
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFNI+ DP E+ +LA + D+++ L +LA Y T +P + RA P
Sbjct: 463 ---------WLFNISADPYEREDLAGQRPDVVRTLLARLAAYNRTAIPVRYPAENPRAHP 513
Query: 555 ARWNNIWVPW 564
W PW
Sbjct: 514 EFNGGAWGPW 523
>gi|344277505|ref|XP_003410541.1| PREDICTED: arylsulfatase J [Loxodonta africana]
Length = 599
Score = 324 bits (830), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 203/552 (36%), Positives = 290/552 (52%), Gaps = 70/552 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TTA +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STTATSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV----DNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N + DN GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDRAAWDYDN---GIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN KP+FLY+A+ AVH+ P QAP + I + RR YA M++
Sbjct: 244 QRVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLA 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LDE++ NV AL +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+
Sbjct: 299 CLDEAINNVTLALETYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 353
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
R V + SP LK V EL HI+DW PTL A +I+ D LDG + W+ +++G ++
Sbjct: 354 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLITLAEGQIDEDVQLDGYDIWETISEGLRS 413
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
R +D + N Y + + + AG
Sbjct: 414 PR------VDILHNIDPIYT--KAKNGSWAAGY--------------------------- 438
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 439 ---GIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERI 485
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 486 TLSTGKTIWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 545
Query: 556 RWNNIWVPWYDE 567
+W PWY E
Sbjct: 546 LNGGVWGPWYKE 557
>gi|363733898|ref|XP_420639.3| PREDICTED: arylsulfatase J [Gallus gallus]
Length = 573
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 200/543 (36%), Positives = 285/543 (52%), Gaps = 68/543 (12%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+PHI+ ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS +TGK
Sbjct: 49 QPHIVFILADDQGFRDVGYHGS-EIRTPTLDKLAAEGVKLENYYVQPMCTPSRSQFITGK 107
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R PT RGF
Sbjct: 108 YQIHTGLQHSIIRPTQPNCLPLDNITLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRGF 167
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEAAINV 204
D+ +G G DYY H K G D+ N DN GIYST +YT+ +
Sbjct: 168 DTFFGSLLGSGDYYTH-FKCDSPGICGYDLYEN---DNAAWDHDNGIYSTQMYTQKVQQI 223
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
+A HN KP+FLY+A+ AVH+ P QAP + + I++ RR YA M++ LDE++
Sbjct: 224 LASHNPRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEAI 278
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
NV AL+K+G +NSI+++ +DNG GSN PLRG K T W+GG+R V +
Sbjct: 279 NNVTLALKKYGYYDNSIIIYSSDNGGQPMA----GGSNWPLRGSKGTYWEGGIRAVGFVH 334
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILH 383
SP LK V EL HI+DW PTL A +I+ D LDG + W+ +++G ++ R
Sbjct: 335 SPLLKNKGSVCKELVHITDWFPTLITLAEGQIDEDIQLDGYDIWETISEGRRSPR----- 389
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
+D + N Y + + + AG GI
Sbjct: 390 -VDILHNIDPIYT--KAKNGSWAAGY------------------------------GIWN 416
Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNSTVK 502
A+++ +++ SD + + F+ V N + N+ ++
Sbjct: 417 TAIQSAIRVNHWKLLTGNPGYSDWVPP----------QAFSNVGPNRWHNERVSWSAGKT 466
Query: 503 PCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWV 562
LFNIT DP E+ +L+ D++KQL +L+ + T VP P D R++P +W
Sbjct: 467 VWLFNITADPYERVDLSAKYPDVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPKLNGGVWG 526
Query: 563 PWY 565
PW+
Sbjct: 527 PWF 529
>gi|326677480|ref|XP_003200848.1| PREDICTED: arylsulfatase B-like, partial [Danio rerio]
Length = 358
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 166/348 (47%), Positives = 223/348 (64%), Gaps = 15/348 (4%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+ PH+++ILADDLGWNDV FHGS +I TP++D LA G+ L+ +YVQ LCTPSR+ LMTG
Sbjct: 21 RPPHVVLILADDLGWNDVGFHGS-EIKTPHLDRLAAQGVRLDNYYVQPLCTPSRNQLMTG 79
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y I G+QH +I +P+ +PL EKLLPQ L+E GY TH +GKWHLG F++ PT RG
Sbjct: 80 RYQIRTGLQHQIIWPCQPYCVPLDEKLLPQVLRERGYHTHMVGKWHLGMFQKDCLPTHRG 139
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
F S +GY G +DYY H + P LD+R V G YST+L TE A +
Sbjct: 140 FQSFFGYLTGSEDYYTHKRCSPIAPLNVTRCALDLRDGDAVALNYSGRYSTELLTERATH 199
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
+I +H +P+FLY+A AVHA P Q PD +A + I DP RR YAGMVS +DE+
Sbjct: 200 IITQHTPDQPLFLYVALQAVHA-----PLQVPDHYIAPYSFIQDPHRRRYAGMVSAMDEA 254
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
VGN+ L++ G+ +N++++F DNG + G+N PLRG K T W+GG+RGVA +
Sbjct: 255 VGNITHTLQETGLWDNTVLIFSTDNGGQTL----YGGNNWPLRGRKWTLWEGGVRGVAFV 310
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVL 370
P +++ V+ EL HISDWLPT+ AG N T LDG + W +
Sbjct: 311 SGPLIEKPGAVNRELIHISDWLPTIAGLAGASTNGTKPLDGFDVWQSI 358
>gi|126331176|ref|XP_001365999.1| PREDICTED: arylsulfatase J [Monodelphis domestica]
Length = 607
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 199/550 (36%), Positives = 286/550 (52%), Gaps = 68/550 (12%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+PHI+ ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS +TGK
Sbjct: 83 QPHIVFILADDQGFRDVGYHGS-EIKTPTLDKLAAQGVKLENYYVQPICTPSRSQFITGK 141
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+ PT RGF
Sbjct: 142 YQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGF 201
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEAAINV 204
D+ +G G DYY H K G D+ N DN GIYST +YT+ +
Sbjct: 202 DTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYEN---DNAAWDHDNGIYSTQMYTQKVQQI 257
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
+A H+ KP+FLY+A+ AVH+ P QAP + + I + RR YA M+S LDE++
Sbjct: 258 LASHDPRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDEAI 312
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
NV AL+++G +NSI+++ +DNG GSN PLRG K T W+GG+R V +
Sbjct: 313 NNVTLALKRYGFYDNSIIIYSSDNGGQPMA----GGSNWPLRGSKGTYWEGGIRAVGFVH 368
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
SP LK + EL HI+DW PTL A G DT LDG + W+ +++G ++
Sbjct: 369 SPLLKNKGTICKELVHITDWYPTLITLAEGWIDEDTQLDGYDIWETISEGRRSP------ 422
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
RVD L + ++ W + GI
Sbjct: 423 ---------------RVDILHNIDPIYTKAKNGSWA-----------------AGFGIWN 450
Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNSTVK 502
A+++ ++++ SD + + F+ N + N+ ++
Sbjct: 451 TAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNAGPNRWHNERISWSTGKS 500
Query: 503 PCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWV 562
LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P +W
Sbjct: 501 VWLFNITADPYERVDLSSRYPGIVKKLLKRLSQFNKTAVPVRYPPKDPRSNPRLNGGVWG 560
Query: 563 PWYDELDKQK 572
PWY E DK++
Sbjct: 561 PWYKEDDKKR 570
>gi|344265150|ref|XP_003404649.1| PREDICTED: arylsulfatase I-like [Loxodonta africana]
Length = 573
Score = 323 bits (828), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 199/550 (36%), Positives = 286/550 (52%), Gaps = 90/550 (16%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 50 HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 108
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 109 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 168
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + A +++A H+
Sbjct: 169 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTLLYAQRASHILASHS 226
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+
Sbjct: 227 PRRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 281
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NSI++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 282 ALKRYGFYNNSIIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 337
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ ++ S L HI+DW PTL AG + LDG + W +++G + R+EILHNID +
Sbjct: 338 RKRRTSRALVHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNIDPL 397
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
N ++ AA+RV + K + G D YGD + P +
Sbjct: 398 YNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 443
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
L + G +N E++ R+
Sbjct: 444 LAAFPGSWWNL-----------------------------ERMASARQAV---------- 464
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFNI+ DP E+ +LA + D+++ L +LA Y T +P + RA P
Sbjct: 465 ---------WLFNISADPYEREDLAGQRPDVVRALLARLAYYNRTAIPVRYPAENPRAHP 515
Query: 555 ARWNNIWVPW 564
W PW
Sbjct: 516 DFNGGAWGPW 525
>gi|351713085|gb|EHB16004.1| Arylsulfatase I [Heterocephalus glaber]
Length = 573
Score = 323 bits (828), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 201/550 (36%), Positives = 286/550 (52%), Gaps = 90/550 (16%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 49 HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 107
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 108 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMLGKWHLGFYRKECLPTRRGFDT 167
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + A +++A H+
Sbjct: 168 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHS 225
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ +S+ RR YA MV+ +DE+V N+
Sbjct: 226 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMSNVSRRKYAAMVTCMDEAVRNITW 280
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NSI++F +DNG +F + GSN PLRG K T W+GG+RG+ I SP LK
Sbjct: 281 ALKRYGFYNNSIIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFIHSPLLK 336
Query: 330 QTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ ++ S L HI+DW PTL A G LDG + W +++G + R+EILHNID +
Sbjct: 337 RKRRTSHALVHITDWYPTLVGLARGTVSAADGLDGYDVWPAISEGRASPRTEILHNIDPL 396
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
N ++ AA+RV + K + G D YGD + P +
Sbjct: 397 YNHARHGSLEGGFGIWNTAIQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 442
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
L + G +N E++ R+
Sbjct: 443 LAAFPGSWWNL-----------------------------ERMASARQAV---------- 463
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFNI+ DP E+ +LA + D+++ L +LA Y T +P + RA P
Sbjct: 464 ---------WLFNISADPYEREDLAGQRPDVVRALLARLADYNRTAIPVRYPAENPRAHP 514
Query: 555 ARWNNIWVPW 564
W PW
Sbjct: 515 DFNGGAWGPW 524
>gi|291226838|ref|XP_002733395.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 498
Score = 323 bits (828), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 202/559 (36%), Positives = 300/559 (53%), Gaps = 77/559 (13%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
A +L + + + + +KPH++ I+ADDLGWND+ ++ S I TP +D LA G+I
Sbjct: 11 MLAFALVVLASMSLVICSGSRKPHVVFIVADDLGWNDIGYNNPS-IFTPTLDKLAREGVI 69
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
LNQ YV +CTP R++LM+G Y +G+QH V+ EP GLPL L+PQ +KE GY T+
Sbjct: 70 LNQSYVLPMCTPDRASLMSGYYAYRVGLQHKVLDHAEPAGLPLNFTLIPQRMKEHGYTTY 129
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
+GKWHLGF + YTPT+RGFD YG++ +DY++H+ + LD+R+ +VD
Sbjct: 130 MLGKWHLGFCKWEYTPTYRGFDHFYGFYNAAEDYFNHTTS------KYLDLRNGKEVDWS 183
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
G YST +Y E A IA HNKS PM++YL +VH +AP + + + I D
Sbjct: 184 KNGTYSTYMYAEKATEYIATHNKSTPMYMYLPFQSVHG-----VIEAPQKYLDMYTFIHD 238
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
RR + MV+ +D++V VIAAL ++ + +N++++F DNG P+ +N G+N PLRG
Sbjct: 239 TNRRIKSAMVTAMDDAVKVVIAALERYELWDNTLLVFTTDNGGPA--NPNNAGNNWPLRG 296
Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQW 367
K T W+GG RGV + L++ V++E+ H++DW PTL AG + D+ +DG+N W
Sbjct: 297 SKLTLWEGGTRGVGFVHGKMLEKKGYVNNEMMHVTDWYPTLLHIAGGKA-DSDMDGLNVW 355
Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID 427
D L KG + R E ++NID ++ AA+RV D K + G G D W
Sbjct: 356 DTLNKGYPSPRKEFIYNIDEIEPTG---AAIRVGDYKLIVG--KAGHPDGWI-------- 402
Query: 428 KYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR 487
A + D + E+IL + E+ +
Sbjct: 403 ---------------------------------PAPTIDGVW-----EEILPMNEYEGL- 423
Query: 488 CNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP---PG 544
D + + N T LFNI DP E+NNLAE + + +L +L K +VP P
Sbjct: 424 --LDGRLSKDNDTY---LFNIKADPTERNNLAEQMPEKVAELSARLEELKQKLVPAIRPA 478
Query: 545 NKPFDKRADPARWNNIWVP 563
P K +DP ++ W P
Sbjct: 479 PDP--KGSDPRKFGGAWSP 495
>gi|291232535|ref|XP_002736216.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 784
Score = 323 bits (828), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 182/425 (42%), Positives = 254/425 (59%), Gaps = 37/425 (8%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
L L+ ++ +KPHII ILADDLGW+DV +HGS I TPNIDALA G+ L+
Sbjct: 23 LVYVLILEVTDIHVNGRQKPHIIFILADDLGWSDVGYHGSV-IKTPNIDALASEGVKLDN 81
Query: 71 HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
+Y+ LCTPSR L++G+Y IH G+QH I +P LP+ E +LPQ LKE GYATH +G
Sbjct: 82 YYMSLLCTPSRGQLLSGRYEIHTGLQHRTIDMMQPLCLPIDETILPQKLKERGYATHMVG 141
Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV-DNKTI 189
KWHLGF+R+ P +RGFD+ G++QG+ DYY H+ Y G D R N V K
Sbjct: 142 KWHLGFYRKECLPNYRGFDTFMGFYQGMADYYYHNISTGI--YHGWDFRRNNDVIAQKYA 199
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE--EVAKFLDISD 247
G YST ++ + A +I +HN P+FL+L+ A+H P Q P ++ K L ++
Sbjct: 200 GQYSTHVFADEAQIIIMKHNPEVPLFLFLSFQAIHL-----PLQVPSRYADMYKTLIPNN 254
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
+R+ YAGMV+ +DE+VG V+ AL+ G+ ENS+++F DNG GI G N PLRG
Sbjct: 255 ADRQKYAGMVTCMDEAVGRVVTALKHTGLWENSVLVFSTDNG----GI---LGGNWPLRG 307
Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIEINDTS-LDGV 364
K++ ++GG+RGV + SP L + V+ EL H++DWLPTL AG ++ T LDG
Sbjct: 308 SKASLYEGGVRGVGFVTSPLLPSRVRGTVNRELLHMTDWLPTLVRLAGGDLMGTKPLDGY 367
Query: 365 NQWDVLTKGAKTKRSEILHNIDNV--------DNPQK--------YYAALRVDDLKYVAG 408
NQW+ +++G + R E+L ID + D+P AA+RV D K + G
Sbjct: 368 NQWETISQGTPSVRDEVLLTIDPLIAKAHHFKDDPYADNDSFNVLIRAAIRVGDWKLITG 427
Query: 409 TDNNG 413
NG
Sbjct: 428 PPGNG 432
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 5/63 (7%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP---PGNKPFDKRADPARWNNIW 561
L+N+ DP E+N+ + + D+++ L ++ Y +T VP P + ADPAR +W
Sbjct: 456 LYNVVADPNERNDYSHDRPDIVRFLLARIGHYYNTSVPVLDPSQEIL--LADPARHGGVW 513
Query: 562 VPW 564
PW
Sbjct: 514 SPW 516
>gi|326919013|ref|XP_003205778.1| PREDICTED: arylsulfatase J-like [Meleagris gallopavo]
Length = 573
Score = 323 bits (827), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 199/547 (36%), Positives = 284/547 (51%), Gaps = 68/547 (12%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
T +PHI+ ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 45 TPSSQPHIVFILADDQGFRDVGYHGS-EIRTPTLDKLAAEGVKLENYYVQPMCTPSRSQF 103
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R PT
Sbjct: 104 ITGKYQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPT 163
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEA 200
RGFD+ +G G DYY H K G D+ N DN GIYST +YT+
Sbjct: 164 QRGFDTFFGSLLGSGDYYTH-FKCDSPGICGYDLYEN---DNAAWDHDNGIYSTQMYTQK 219
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
++A HN KP+FLY+A+ AVH+ P QAP + + I++ RR YA M++ L
Sbjct: 220 VQQILASHNPRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACL 274
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
DE++ NV AL+K+G +NSI+++ +DNG GSN PLRG K T W+GG+R V
Sbjct: 275 DEAINNVTLALKKYGYYDNSIIIYSSDNGGQPMA----GGSNWPLRGSKGTYWEGGIRAV 330
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRS 379
+ SP LK V EL HI+DW PTL A +I+ D LDG + W+ +++G ++
Sbjct: 331 GFVHSPLLKNKGSVCKELVHITDWFPTLITLAEGQIDEDIQLDGYDIWETISEGRRSP-- 388
Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA 439
RVD L + ++ W +
Sbjct: 389 -------------------RVDILHNIDPIYTKAKNGSWA-----------------AGY 412
Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCN 498
GI A+++ +++ SD + + F+ + N + N+ +
Sbjct: 413 GIWNTAIQSAIRVNHWKLLTGNPGYSDWVPP----------QAFSNMGPNRWHNERVSWS 462
Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
+ LFNIT DP E+ +L+ D++KQL +L+ + T VP P D R++P
Sbjct: 463 AGKTVWLFNITADPYERVDLSAKYPDVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPKLNG 522
Query: 559 NIWVPWY 565
+W PW+
Sbjct: 523 GVWGPWF 529
>gi|395817250|ref|XP_003782086.1| PREDICTED: arylsulfatase I [Otolemur garnettii]
Length = 572
Score = 323 bits (827), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 198/550 (36%), Positives = 287/550 (52%), Gaps = 90/550 (16%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 48 HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 106
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHS 224
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ ++ S L HI+DW PTL AG + LDG + W +++G + R+EILHNID +
Sbjct: 336 RKRRTSRALVHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNIDPL 395
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
N ++ AA+RV + K + G D YGD + P +
Sbjct: 396 YNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
L + G +N E++ +R+
Sbjct: 442 LAAFPGSWWNL-----------------------------ERMASVRQAV---------- 462
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFNI+ DP E+ +LA + D+++ L +LA Y T +P + RA P
Sbjct: 463 ---------WLFNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAENPRAHP 513
Query: 555 ARWNNIWVPW 564
W PW
Sbjct: 514 DFNGGAWGPW 523
>gi|281339106|gb|EFB14690.1| hypothetical protein PANDA_011975 [Ailuropoda melanoleuca]
Length = 595
Score = 323 bits (827), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 201/548 (36%), Positives = 288/548 (52%), Gaps = 68/548 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TTA +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STTAASQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A H+ KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 244 QRVQQILASHDPRKPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+ +G NSI+++ +DNG G + GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDVWETISEGLRSP 414
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
R +D + N Y + + + AG
Sbjct: 415 R------VDILHNIDPIYT--KAKNGSWAAGY---------------------------- 438
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 439 --GIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERIS 486
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ LFNIT DP E+ +L+ ++KQL +L+ + T VP P D R++P
Sbjct: 487 LSTGKSVWLFNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRL 546
Query: 557 WNNIWVPW 564
+W PW
Sbjct: 547 NGGVWGPW 554
>gi|426231237|ref|XP_004009646.1| PREDICTED: arylsulfatase J [Ovis aries]
Length = 599
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 202/552 (36%), Positives = 290/552 (52%), Gaps = 70/552 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+ TA +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 70 SVTAVSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 128
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 129 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 188
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 189 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 244
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A H+ KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 245 QRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLS 299
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LDE++ NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+
Sbjct: 300 CLDEAINNVTLALKMYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 354
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKT 376
R V + SP LK V EL HI+DW PTL + A +I++ LDG + W+ +++G ++
Sbjct: 355 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDENIQLDGYDVWETISEGLRS 414
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
R +D + N Y + + + AG
Sbjct: 415 PR------VDILHNIDPIYT--KAKNGSWAAGY--------------------------- 439
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 440 ---GIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 486
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
++ LFNIT DP E+ +L+ ++KQL +L+ + T VP P D R++P
Sbjct: 487 TLSTGKSVWLFNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPR 546
Query: 556 RWNNIWVPWYDE 567
+W PWY E
Sbjct: 547 LNGGVWGPWYKE 558
>gi|114326206|ref|NP_001041581.1| arylsulfatase J [Canis lupus familiaris]
gi|81158066|tpe|CAI85007.1| TPA: arylsulfatase J [Canis lupus familiaris]
Length = 598
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 202/549 (36%), Positives = 289/549 (52%), Gaps = 70/549 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TTA +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 67 STTATSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 125
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 126 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 185
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 186 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 241
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A H+ KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 242 QRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 296
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LDE++ NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+
Sbjct: 297 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 351
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
R V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 352 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDVWETISEGLRS 411
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
R +D + N Y + + + AG
Sbjct: 412 PR------VDILHNIDPIYT--KAKNGSWAAGY--------------------------- 436
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 437 ---GIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 483
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
++ LFNIT DP E+ +L+ ++KQL +L+ + T VP P D R++P
Sbjct: 484 TLSTGKSVWLFNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPR 543
Query: 556 RWNNIWVPW 564
+W PW
Sbjct: 544 LNGGVWGPW 552
>gi|301775027|ref|XP_002922933.1| PREDICTED: arylsulfatase J-like [Ailuropoda melanoleuca]
Length = 600
Score = 322 bits (825), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 201/548 (36%), Positives = 288/548 (52%), Gaps = 68/548 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TTA +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STTAASQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A H+ KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 244 QRVQQILASHDPRKPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+ +G NSI+++ +DNG G + GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDVWETISEGLRSP 414
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
R +D + N Y + + + AG
Sbjct: 415 R------VDILHNIDPIYT--KAKNGSWAAGY---------------------------- 438
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 439 --GIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERIS 486
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ LFNIT DP E+ +L+ ++KQL +L+ + T VP P D R++P
Sbjct: 487 LSTGKSVWLFNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRL 546
Query: 557 WNNIWVPW 564
+W PW
Sbjct: 547 NGGVWGPW 554
>gi|443696989|gb|ELT97571.1| hypothetical protein CAPTEDRAFT_178894 [Capitella teleta]
Length = 503
Score = 322 bits (825), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 177/404 (43%), Positives = 244/404 (60%), Gaps = 28/404 (6%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A K+P+I+ ILADD G++DV +HGS+ I TPNID LA+ G+ L +YVQ +CTP+RS L+
Sbjct: 27 ASKQPNILFILADDYGYHDVGYHGSA-IRTPNIDRLAFEGVRLENYYVQPICTPTRSQLL 85
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
+G+Y IH G+QH +I +P LP L L+E GYA H +GKWHLGF++E Y PT
Sbjct: 86 SGRYQIHTGLQHSIIWAAQPNALPKELPTLADKLREEGYANHIVGKWHLGFYKEEYVPTN 145
Query: 146 RGFDSHYGYWQGLQDYYDHS-CKATFEPYQ--GLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
RGFDS YGY G + YY+ + C A GLD R N + N++ YST LY E
Sbjct: 146 RGFDSFYGYLTGSEFYYNKTYCLAQINRSDVCGLDFRENDRSINES--EYSTHLYAERTK 203
Query: 203 NVIAEHNKS---KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
++A+H + +P+FLYLA +VH P + P + + I D R+ YAGMVS
Sbjct: 204 QLVADHTSAHPDQPLFLYLALQSVHG-----PLEVPAQYRTPYKHIKDENRQIYAGMVSC 258
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
+DE+V NV A +K+G+ +N+I++F DNG G + G+N PLRG K + WDGGMRG
Sbjct: 259 MDEAVKNVTDAFKKYGLWDNTILVFSTDNG----GQVAEGGNNWPLRGWKGSLWDGGMRG 314
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
V + S LKQ V+ +L H+SDW PTL AG + LDG +QW ++ GA+ R+
Sbjct: 315 VGFVHSTLLKQKGAVAHQLMHVSDWFPTLLNQAGGNASSVKLDGFDQWAAISDGAEGPRT 374
Query: 380 EILHNIDNVDNPQ----------KYYAALRVDDLKYVAGTDNNG 413
EILHNID ++ + AALR++ K + G NG
Sbjct: 375 EILHNIDPLNGKRGKRTEEVFDNTVRAALRMNKWKLLTGEPGNG 418
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 35/60 (58%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LF+I DP E +N+A +++K + KLA Y +T VPP P+D R+ P +W PW
Sbjct: 443 LFDIEADPYEISNVAAENPEVVKTMLLKLAAYNATAVPPFYPPYDPRSYPENHGGVWGPW 502
>gi|301765544|ref|XP_002918191.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Ailuropoda
melanoleuca]
Length = 573
Score = 322 bits (825), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 197/553 (35%), Positives = 286/553 (51%), Gaps = 90/553 (16%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+ PHII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG
Sbjct: 46 QPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTG 104
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RG
Sbjct: 105 RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRG 164
Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
FD+ G G DYY + +C G D+ V G YST LY + +++A
Sbjct: 165 FDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILA 222
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H+ +P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N
Sbjct: 223 SHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRYRAMGNVARRKYAAMVTCMDEAVRN 277
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
+ AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP
Sbjct: 278 ITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSP 333
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNI 385
LK+ ++ S L HI+DW PTL AG + LDG + W +++G + R+EILHNI
Sbjct: 334 LLKRKRRTSRALVHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNI 393
Query: 386 DNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSP 431
D + N ++ AA+RV + K + G D YGD + P
Sbjct: 394 DPLYNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIP 439
Query: 432 KEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYD 491
+ L + G +N E++ R+
Sbjct: 440 PQTLAAFPGSWWNL-----------------------------ERMASARQAV------- 463
Query: 492 NKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR 551
LFNI+ DP E+ +LA + D+++ L +L Y T +P + R
Sbjct: 464 ------------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENPR 511
Query: 552 ADPARWNNIWVPW 564
A P W PW
Sbjct: 512 AHPDFNGGAWGPW 524
>gi|355669614|gb|AER94587.1| arylsulfatase family, member J [Mustela putorius furo]
Length = 600
Score = 322 bits (825), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 201/549 (36%), Positives = 288/549 (52%), Gaps = 68/549 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+T A +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STAATSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDRLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A H+ KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 244 QRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+ +G NSI+++ +DNG G + GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIVYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDVWETISEGLRSP 414
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
R +D + N Y + + + AG
Sbjct: 415 R------VDILHNIDPIYT--KAKNGSWAAGY---------------------------- 438
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 439 --GIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERIT 486
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ LFNIT DP E+ +L+ ++KQL +L+ + T VP P D R++P
Sbjct: 487 LSTGKSVWLFNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRL 546
Query: 557 WNNIWVPWY 565
+W PWY
Sbjct: 547 NGGVWGPWY 555
>gi|241595184|ref|XP_002404450.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215502341|gb|EEC11835.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 311
Score = 322 bits (824), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 161/316 (50%), Positives = 208/316 (65%), Gaps = 8/316 (2%)
Query: 41 GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
GW+D S HGSSQIPTPN+DA+A +G+ILNQHYVQ LCTPSR+ALMTG+YP H+GMQH VI
Sbjct: 1 GWDDTSIHGSSQIPTPNMDAIAADGIILNQHYVQPLCTPSRAALMTGRYPFHVGMQHSVI 60
Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
EPW LPL L+P+Y + GY TH +GKWHLG++ Y P RGFD+ G++ D
Sbjct: 61 KPAEPWALPLNYTLMPEYFRCLGYKTHMVGKWHLGYYDRQYVPIKRGFDTFLGFYNPSLD 120
Query: 161 YYDHSCKATFEPYQGLDMR---HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLY 217
YY+ + T + G D R N + K Y+T YT + +I HNKS PMFL+
Sbjct: 121 YYNQNF--TGNNHTGHDFRCGDQNYWAEEKE---YATYYYTNKTVEIIRRHNKSAPMFLF 175
Query: 218 LAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGML 277
L+H A H Q P V F I + R +AGMV LD+SVG V+ L+K GML
Sbjct: 176 LSHQAPHVSGGRPLLQVPTHGVRNFSYIGENNRTLFAGMVDALDQSVGIVVDTLQKAGML 235
Query: 278 ENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE 337
++I++F +DNGA +G SN+G N PLRG K T ++GG+RG A +WS LK T++VS++
Sbjct: 236 NDTIIVFCSDNGALPWGFKSNRGYNWPLRGGKFTLYEGGVRGTAFLWSRRLKNTKRVSNQ 295
Query: 338 LFHISDWLPTLCAAAG 353
L HISDWLPTL +AAG
Sbjct: 296 LMHISDWLPTLYSAAG 311
>gi|405977794|gb|EKC42228.1| Arylsulfatase I [Crassostrea gigas]
Length = 545
Score = 321 bits (823), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 195/553 (35%), Positives = 294/553 (53%), Gaps = 50/553 (9%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PHI+ I+ADDLGWNDV +H + I TPN+D +A G+ILN YV +CTPSR++ +
Sbjct: 24 AKRPPHILYIVADDLGWNDVGWH-NPDIKTPNLDRMAGGGVILNSSYVHPICTPSRNSFL 82
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG YP +G+ I + + L LP+ LK+ GY+TH IGKWHLGF E YTPT
Sbjct: 83 TGVYPFRVGLSGTAITPHQARFMSLKTPTLPEKLKKLGYSTHMIGKWHLGFCNERYTPTR 142
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
RGFDS G++ G QDYY H+ +G D R N V YST Y + A+++I
Sbjct: 143 RGFDSFLGFYTGTQDYYKHTTA------KGYDFRFNQTVFYPPKKQYSTKTYAKRAVDII 196
Query: 206 AEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
EH + K P+FLYLA +VH P Q P + ++ +I + +RR Y+GMV+ +D+++
Sbjct: 197 TEHKRKKNPLFLYLAFQSVHT-----PLQVPKKYEKQYNNIKNKDRRVYSGMVTAMDDAI 251
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
G V+AA RKH + N IV+F DNG + + G+N PLRG K+T W+GG R V+ ++
Sbjct: 252 GAVMAAFRKHKFMGNLIVVFTTDNGGAAHIV----GNNFPLRGSKTTVWEGGTRAVSFVY 307
Query: 325 SP-WLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEIL 382
S LK+T LFH +DW PT+ A AG + + LDGVNQW ++ G ++R+E +
Sbjct: 308 SKNLLKKTGYTHDGLFHATDWFPTILAVAGGKPKQEKQLDGVNQWPMIRSGKPSRRTEFV 367
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT----------DNEIDKYSPK 432
+ ID + +A+R+ K + G N G + WY +NE D
Sbjct: 368 YEIDK----SRPVSAIRMGKYKLIVG--NPGDYNNWYRIAKSNSFCPIYDENETDSDEDI 421
Query: 433 EVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDN 492
+ S I NA K +K + P E+ +L++ + + +
Sbjct: 422 FIYESLNNIVRNA---KGYVKWLSSKLPGYRMVRKWHRYRQLERCRKLKKRDKAKWRFP- 477
Query: 493 KGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIY--KSTMVPPGNKPFDK 550
+ L+++ DP E+ N+A + L+++++ +L Y K+ V +K
Sbjct: 478 ---------RYLLYDLEADPSERKNIAGYKPVLVRKMKARLEHYQKKTKHVAGQSKRKVA 528
Query: 551 RADPARWNNIWVP 563
ADP + +W P
Sbjct: 529 AADPKHHHGVWTP 541
>gi|326928585|ref|XP_003210457.1| PREDICTED: arylsulfatase I-like [Meleagris gallopavo]
Length = 574
Score = 321 bits (823), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 196/540 (36%), Positives = 285/540 (52%), Gaps = 63/540 (11%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+ PHII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG
Sbjct: 48 RPPHIIFILTDDQGYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITG 106
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+++ PT RG
Sbjct: 107 RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRG 166
Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
FD+ G G DYY + +C G D+ V G YST LY + ++A
Sbjct: 167 FDTFLGSLTGNVDYYTYDNCDG--PGVCGYDLHEGENVAWDQSGKYSTFLYAQRVSKILA 224
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H+ +P+F+Y+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N
Sbjct: 225 SHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKN 279
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
+ AL+K+G +NS+++F DNG +F + GSN PLRG K T W+GG+RG+ + SP
Sbjct: 280 ITWALKKYGYYDNSVIVFSTDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGIGFVHSP 335
Query: 327 WLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
+K+ ++ S L HI+DW PTL + A G N LDG N W +++G ++ R+EILHNI
Sbjct: 336 LIKRKRRTSWALVHITDWYPTLVSLARGNLSNVPGLDGYNVWPAISEGKESPRTEILHNI 395
Query: 386 DNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT-DNEIDKYSPKEVLYSKAGITFN 444
D + N K Y +L + + + EW T D + P + L + G +N
Sbjct: 396 DPLYNHAK-YGSLEDGFGIWNTAVQASIRVGEWKLLTGDPGYSDWIPPQTLTNFPGSWWN 454
Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC 504
+ +D LR +
Sbjct: 455 LERL----------------TDGLRKSV-------------------------------W 467
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E+ +L+E + D+++ L +L Y T +P + RA P W PW
Sbjct: 468 LFNITADPYERYDLSEQRPDIVRALLTRLVHYNRTAIPVRYPAENPRAHPDFNGGAWGPW 527
>gi|449267146|gb|EMC78112.1| Arylsulfatase I [Columba livia]
Length = 573
Score = 321 bits (822), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 197/548 (35%), Positives = 289/548 (52%), Gaps = 63/548 (11%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+ PHII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG
Sbjct: 48 RPPHIIFILTDDQGYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITG 106
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+++ PT RG
Sbjct: 107 RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRG 166
Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
FD+ G G DYY + +C G D+ V G YST LY + ++A
Sbjct: 167 FDTFLGSLTGNVDYYTYDNCDG--PGVCGYDLHEGEDVAWDQSGKYSTFLYAQRVSKILA 224
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H+ +P+F+Y+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N
Sbjct: 225 SHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKN 279
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
+ AL+K+G +NS+++F DNG +F + GSN PLRG K T W+GG+RG+ + SP
Sbjct: 280 ITWALKKYGYYDNSVIVFSTDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGIGFVHSP 335
Query: 327 WLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
+K+ ++ S L HI+DW PTL + A G N LDG N W +++G ++ R+EILHNI
Sbjct: 336 LIKRKRRTSWALVHITDWYPTLVSLARGNLSNVPGLDGYNVWPAISEGKESPRTEILHNI 395
Query: 386 DNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT-DNEIDKYSPKEVLYSKAGITFN 444
D + N K Y +L + + + EW T D + P + L + G +N
Sbjct: 396 DPLYNHAK-YGSLEDGFGIWNTAVQASIRVGEWKLLTGDPGYSDWIPPQTLTNFPGSWWN 454
Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC 504
+ +D LR +
Sbjct: 455 LERL----------------TDGLRKSV-------------------------------W 467
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E+ +L++ + D+++ L +L Y T +P + RA P W PW
Sbjct: 468 LFNITADPYERYDLSDQRPDVVRTLLMRLVHYNQTAIPVRYPAENPRAHPDFNGGAWGPW 527
Query: 565 YDELDKQK 572
E D ++
Sbjct: 528 ASEDDAEE 535
>gi|338713661|ref|XP_003362935.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Equus
caballus]
Length = 523
Score = 321 bits (822), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 173/369 (46%), Positives = 228/369 (61%), Gaps = 19/369 (5%)
Query: 26 APKKP--HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
AP +P ++ +L DD+G +DV H S+ TP +DALA GL+L+ +Y Q LC PSRS
Sbjct: 29 APGRPAAKLVYMLTDDMGXHDVGLH-ESRFSTPRLDALAAGGLLLDNYYTQPLCXPSRSQ 87
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L++G+Y IH G+QH +I +P LPL EKLLPQ LKEAGY TH +GKWHLG +R+ P
Sbjct: 88 LLSGRYQIHTGLQHQIIWPCQPSCLPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 147
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATF-----EPYQGLDMRHNMQVDNKTIGIYSTDLYT 198
T RGFD+++GY G +DYY H + TF LD R +V +YST ++T
Sbjct: 148 TRRGFDTYFGYLLGSEDYYSHE-RCTFIDALNVTRCALDFRDGEEVATGYKNMYSTSVFT 206
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
E A +I H KP+FLYLA +VH +P Q P+E + + I D R YAGMVS
Sbjct: 207 ERATALITNHPPEKPLFLYLALQSVH-----QPLQVPEEYLKPYDFIQDKNRYHYAGMVS 261
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE+VGNV AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+R
Sbjct: 262 LLDEAVGNVTAALKSRGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVR 317
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTK 377
GV + SP LKQ + EL HISDWLPTL AG N T LDG N W +++G+ +
Sbjct: 318 GVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGSTNGTKPLDGFNVWKTISEGSPSP 377
Query: 378 RSEILHNID 386
R E+LHNID
Sbjct: 378 RVELLHNID 386
>gi|218563492|sp|Q32KH7.2|ARSI_CANFA RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
Length = 573
Score = 321 bits (822), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 196/550 (35%), Positives = 285/550 (51%), Gaps = 90/550 (16%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 49 HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 107
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 108 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 167
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + +++A H+
Sbjct: 168 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHS 225
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+ +
Sbjct: 226 PRRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITS 280
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 281 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 336
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ ++ S L HI+DW PTL AG + LDG + W +++G + R+EILHNID +
Sbjct: 337 RKRRTSRALVHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNIDPL 396
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
N ++ AA+RV + K + G D YGD + P +
Sbjct: 397 YNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 442
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
L + G +N E++ R+
Sbjct: 443 LAAFPGSWWNL-----------------------------ERMASARQAV---------- 463
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFNI+ DP E+ +LA + D+++ L +L Y T +P + RA P
Sbjct: 464 ---------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENPRAHP 514
Query: 555 ARWNNIWVPW 564
W PW
Sbjct: 515 DFNGGAWGPW 524
>gi|50755099|ref|XP_425212.1| PREDICTED: arylsulfatase I [Gallus gallus]
Length = 574
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 196/540 (36%), Positives = 285/540 (52%), Gaps = 63/540 (11%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+ PHII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG
Sbjct: 48 RPPHIIFILTDDQGYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITG 106
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+++ PT RG
Sbjct: 107 RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRG 166
Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
FD+ G G DYY + +C G D+ V G YST LY + ++A
Sbjct: 167 FDTFLGSLTGNVDYYTYDNCDG--PGVCGYDLHEGENVAWDQSGKYSTFLYAQRVSKILA 224
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H+ +P+F+Y+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N
Sbjct: 225 SHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKN 279
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
+ AL+K+G +NS+++F DNG +F + GSN PLRG K T W+GG+RG+ + SP
Sbjct: 280 ITWALKKYGYYDNSVIVFSTDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGIGFVHSP 335
Query: 327 WLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
+K+ ++ S L HI+DW PTL + A G N LDG N W +++G ++ R+EILHNI
Sbjct: 336 LIKRKRRTSWALVHITDWYPTLVSLARGNLSNVPGLDGYNVWPAISEGKESPRTEILHNI 395
Query: 386 DNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT-DNEIDKYSPKEVLYSKAGITFN 444
D + N K Y +L + + + EW T D + P + L + G +N
Sbjct: 396 DPLYNHAK-YGSLEDGFGIWNTAVQASIRVGEWKLLTGDPGYSDWIPPQTLTNFPGSWWN 454
Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC 504
+ +D LR +
Sbjct: 455 LERL----------------TDGLRKSV-------------------------------W 467
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E+ +L+E + D+++ L +L Y T +P + RA P W PW
Sbjct: 468 LFNITADPYERYDLSEQRPDVVRALLTRLVHYNRTAIPVRYPAENPRAHPDFNGGAWGPW 527
>gi|26343103|dbj|BAC35208.1| unnamed protein product [Mus musculus]
Length = 555
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 199/551 (36%), Positives = 287/551 (52%), Gaps = 68/551 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
T +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 67 GTAGTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 125
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 126 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCM 185
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 186 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGVCGYDLYENDNAAWDYDN---GIYSTQMYT 241
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A H+ +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 242 QRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 296
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+++G NSI+++ +DNG G + GSN PLRG K T W+GG+R
Sbjct: 297 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 352
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 353 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 412
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
RVD L + ++ W +
Sbjct: 413 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 434
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 435 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QAFSNLGPNRWHNERIT 484
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 485 LSTGKSIWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 544
Query: 557 WNNIWVPWYDE 567
+W PWY E
Sbjct: 545 NGGVWGPWYKE 555
>gi|114326196|ref|NP_001041583.1| arylsulfatase I precursor [Canis lupus familiaris]
gi|81158064|tpe|CAI85006.1| TPA: arylsulfatase I [Canis lupus familiaris]
Length = 575
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 196/550 (35%), Positives = 285/550 (51%), Gaps = 90/550 (16%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 51 HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 109
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 110 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 169
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + +++A H+
Sbjct: 170 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHS 227
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+ +
Sbjct: 228 PRRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITS 282
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 283 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 338
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ ++ S L HI+DW PTL AG + LDG + W +++G + R+EILHNID +
Sbjct: 339 RKRRTSRALVHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNIDPL 398
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
N ++ AA+RV + K + G D YGD + P +
Sbjct: 399 YNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 444
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
L + G +N E++ R+
Sbjct: 445 LAAFPGSWWNL-----------------------------ERMASARQAV---------- 465
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFNI+ DP E+ +LA + D+++ L +L Y T +P + RA P
Sbjct: 466 ---------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENPRAHP 516
Query: 555 ARWNNIWVPW 564
W PW
Sbjct: 517 DFNGGAWGPW 526
>gi|301609482|ref|XP_002934299.1| PREDICTED: arylsulfatase J-like [Xenopus (Silurana) tropicalis]
Length = 564
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 164/360 (45%), Positives = 229/360 (63%), Gaps = 13/360 (3%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+PHI+ ILADD G+ D+ +HGS +I TP +D LA G+ L +YVQ +C+PSRS +TGK
Sbjct: 46 QPHIVFILADDQGYRDIGYHGS-EIRTPTLDKLASEGVRLENYYVQPICSPSRSQFITGK 104
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QH +I +P LPL LPQ LK+AGY TH +GKWHLGF+++ PT RGF
Sbjct: 105 YQIHTGLQHSIIRPSQPNCLPLDNMTLPQKLKKAGYQTHMVGKWHLGFYKKECMPTQRGF 164
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-NKTIGIYSTDLYTEAAINVIAE 207
DS +G G DYY+H K G D+ N ++ GIYST++YT+ + +++
Sbjct: 165 DSFFGSLLGSGDYYNH-YKCDSPGICGYDLYENNNAAWDQDNGIYSTEMYTQRVLQILSS 223
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
HN +KP+FLY+A+ AVH+ P QAP + + I + RR YA M+S LD++V NV
Sbjct: 224 HNPNKPLFLYIAYQAVHS-----PLQAPGNYLDNYNSILNENRRRYAAMLSCLDDAVNNV 278
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
AL+K+G +NSI+++ +DNG G+N PLRG K T W+GG+R V + SP
Sbjct: 279 TTALKKYGFYDNSIIIYSSDNGGQPMA----GGNNWPLRGSKGTYWEGGIRAVGFVHSPI 334
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILHNID 386
LK + EL HI+DW PTL AG EI+ + LDG + W+ +++G ++ R EILHNID
Sbjct: 335 LKMKGYICKELVHITDWFPTLVTLAGGEIDEEIQLDGYDIWETISEGKRSPRQEILHNID 394
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 41/68 (60%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E+ +L++ D++KQL +LA + T VP P D R++P +W PW
Sbjct: 466 LFNITADPYERIDLSDRYPDIVKQLLRRLAQFNKTAVPVRYPPRDPRSNPKLNGGVWGPW 525
Query: 565 YDELDKQK 572
Y + K+K
Sbjct: 526 YKDHSKRK 533
>gi|390364061|ref|XP_792027.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 524
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 215/579 (37%), Positives = 300/579 (51%), Gaps = 86/579 (14%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQ---IPTPNIDALAYN 64
F + C L ++ A + P+++ ILADD G+NDV +HG S I TPN+D LA
Sbjct: 9 FVVIVCLTLLTSG--SSYAEQLPNVVFILADDYGFNDVGYHGRSHGSAILTPNLDMLAGE 66
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGY 124
G+ L +YVQ +C+P+RS LM+G+Y IH G+QHGVI +P LPL E LPQ LKE GY
Sbjct: 67 GVKLENYYVQPICSPTRSQLMSGRYQIHTGLQHGVIRPPQPNCLPLDEVTLPQKLKENGY 126
Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSC-KATFEPYQGLDMRHNMQ 183
AT+ +GKWH+GF+ + PT RGFDS++ + +D++ SC
Sbjct: 127 ATNMVGKWHIGFYLDACLPTERGFDSYFAW----EDHF--SCLPXXXXXXXXXXXXXXXX 180
Query: 184 VDNKTI-----GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
NKT G YST L+T I+VI H+K+KP+F+YLA+ AVH P + PD
Sbjct: 181 XANKTPVFQYEGQYSTHLFTNKTIDVIERHDKTKPLFIYLAYQAVHF-----PLEVPDSY 235
Query: 239 VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSN 298
+ +++I+D RRTYAGMVS +DE VGNV AL++ G+ +N+I++F DNG P I S
Sbjct: 236 MDPYMNITDKNRRTYAGMVSCMDEGVGNVTRALKEAGLYDNTIIIFSTDNGGP---IQSG 292
Query: 299 KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAA-AGIE 355
+N PLRG K + W+GG+ GV + SP L + K V+ EL H+SDWLPT+ A AG
Sbjct: 293 -ANNWPLRGSKGSLWEGGIHGVGFVHSPLLPTSVKGTVNHELIHVSDWLPTIVAGIAGGA 351
Query: 356 INDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQ 414
+N T LDG N W ++ A + R EILHNID++ Y A + + +G+
Sbjct: 352 LNGTKPLDGYNVWSSISGTASSPRKEILHNIDSL------YIAPEILPSYF------DGE 399
Query: 415 SDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQ-KHAADPKANSSD------- 466
W DK+ +++ L++ K P A++SD
Sbjct: 400 DFLWRFLARVLEDKFD-------------TSIRAALRVGDWKIKTGPDASASDVWQPPAE 446
Query: 467 -ALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDL 525
L TI EK K LFNIT DP E+ +L+ D+
Sbjct: 447 SGLHTIYPIEK----------------------PNQKVWLFNITADPLERTDLSGKYPDV 484
Query: 526 LKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
+ + +L Y VP D ADPA W PW
Sbjct: 485 VDEFLQRLEEYYKGSVPVRYPDPDLEADPALHGGSWGPW 523
>gi|148680337|gb|EDL12284.1| arylsulfatase J [Mus musculus]
Length = 572
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 199/550 (36%), Positives = 287/550 (52%), Gaps = 70/550 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
T +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 41 GTAGTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 99
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 100 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCM 159
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 160 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGVCGYDLYENDNAAWDYDN---GIYSTQMYT 215
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A H+ +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 216 QRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 270
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LDE++ NV AL+++G NSI+++ +DNG P+ G GSN PLRG K T W+GG+
Sbjct: 271 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 325
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
R V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 326 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 385
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
RVD L + ++ W
Sbjct: 386 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 407
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
+ GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 408 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QAFSNLGPNRWHNERI 457
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 458 TLSTGKSIWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 517
Query: 556 RWNNIWVPWY 565
+W PWY
Sbjct: 518 LNGGVWGPWY 527
>gi|291227280|ref|XP_002733615.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 499
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 185/454 (40%), Positives = 263/454 (57%), Gaps = 47/454 (10%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
+ +CTL+ + +PHII ILADDLGW+DV +HGS I TP+IDALA G
Sbjct: 7 SFLLVASCTLV---------SSSRPHIIFILADDLGWSDVGYHGSV-IKTPHIDALASEG 56
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+ L+ +Y LCTPSRS LMTG+Y IH G+QH I +P LP+ E +LPQ LK+ GYA
Sbjct: 57 VKLDNYYTSLLCTPSRSQLMTGRYEIHTGLQHRTIDMMQPLCLPIDETILPQKLKDRGYA 116
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH-NMQV 184
TH +GKWHLGF+R+ P RGFD+ G++Q + DYY H+ + G D R N +
Sbjct: 117 THMVGKWHLGFYRQECLPNNRGFDTFMGFYQAMGDYYYHNVSTG--KFNGWDFRRDNDVI 174
Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE--EVAKF 242
+ G YST ++ + A ++I++HN P+FL+L+ A+H P Q P ++
Sbjct: 175 AERYAGQYSTHVFADEARDIISKHNPDVPLFLFLSFQAIHF-----PLQVPSRYADIYNT 229
Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
L + +RRTYAGMV+ +DE+VG V+A L+ G+ ENS+++F DNG GIHS G N
Sbjct: 230 LIPNSADRRTYAGMVTCMDEAVGGVVATLKHSGLWENSVLVFSTDNG----GIHS-LGGN 284
Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIEI-NDT 359
PLRG K++ ++GG+RGVA + SP L + ++ EL H++DW PTL AG + +
Sbjct: 285 WPLRGGKASLYEGGVRGVAFVTSPLLPSRVRGTINKELLHMTDWFPTLVRLAGGHLMGNK 344
Query: 360 SLDGVNQWDVLTKGAKTKRSEILHNIDN--------VDNPQK--------YYAALRVDDL 403
LDG NQWD +++ + R+EIL I+ VD+P AA+RV D
Sbjct: 345 RLDGYNQWDTISQETPSLRNEILLTINPLKRKPNHFVDDPYADTDSFDILMRAAIRVGDW 404
Query: 404 KYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
K + G NG W + + P+E +S
Sbjct: 405 KLITGPTGNG---SWVTPAEFGFNSLHPEERTHS 435
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 22/61 (36%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFD-KRADPARWNNIWVP 563
L+N+ DP E+++ + + D++ L D++ Y T VP N P + RADPA W P
Sbjct: 438 LYNVIADPNERHDYSRDRPDIVGALLDRIGHYYKTSVPVLNPPQEVTRADPALHGGAWSP 497
Query: 564 W 564
W
Sbjct: 498 W 498
>gi|114145538|ref|NP_001041352.1| arylsulfatase J [Rattus norvegicus]
gi|81158024|tpe|CAI84986.1| TPA: arylsulfatase J [Rattus norvegicus]
gi|149025900|gb|EDL82143.1| arylsulfatase J [Rattus norvegicus]
Length = 597
Score = 319 bits (818), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 198/549 (36%), Positives = 286/549 (52%), Gaps = 68/549 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
T +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 67 GTAVTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 125
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 126 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCM 185
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 186 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGVCGYDLYENDNAAWDYDN---GIYSTQMYT 241
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A H+ +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 242 QRVQQILASHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 296
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+++G NSI+++ +DNG G + GSN PLRG K T W+GG+R
Sbjct: 297 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 352
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 353 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 412
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
RVD L + ++ W +
Sbjct: 413 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 434
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 435 GYGIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQAFSNLGPNRWHNERIT 484
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 485 LSTGKSIWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 544
Query: 557 WNNIWVPWY 565
+W PWY
Sbjct: 545 NGGVWGPWY 553
>gi|410949653|ref|XP_003981535.1| PREDICTED: arylsulfatase I, partial [Felis catus]
Length = 570
Score = 319 bits (818), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 196/550 (35%), Positives = 284/550 (51%), Gaps = 90/550 (16%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 46 HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 104
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 105 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 164
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + +++A H+
Sbjct: 165 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILASHS 222
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+
Sbjct: 223 PRRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 277
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 278 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 333
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ ++ S L HI+DW PTL AG + LDG + W +++G + R+EILHNID +
Sbjct: 334 RKRRTSRALVHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNIDPL 393
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
N ++ AA+RV + K + G D YGD + P +
Sbjct: 394 YNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 439
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
L + G +N E++ R+
Sbjct: 440 LAAFPGSWWNL-----------------------------ERMASARQAV---------- 460
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFNI+ DP E+ +LA + D+++ L +L Y T +P + RA P
Sbjct: 461 ---------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENPRAHP 511
Query: 555 ARWNNIWVPW 564
W PW
Sbjct: 512 DFNGGAWGPW 521
>gi|27734088|ref|NP_775627.1| arylsulfatase J precursor [Mus musculus]
gi|77416378|sp|Q8BM89.1|ARSJ_MOUSE RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
gi|26329953|dbj|BAC28715.1| unnamed protein product [Mus musculus]
gi|81158042|tpe|CAI84995.1| TPA: arylsulfatase J [Mus musculus]
gi|109734872|gb|AAI17814.1| Arylsulfatase J [Mus musculus]
Length = 598
Score = 319 bits (818), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 198/549 (36%), Positives = 286/549 (52%), Gaps = 68/549 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
T +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 67 GTAGTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 125
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 126 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCM 185
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 186 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGVCGYDLYENDNAAWDYDN---GIYSTQMYT 241
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A H+ +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 242 QRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 296
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+++G NSI+++ +DNG G + GSN PLRG K T W+GG+R
Sbjct: 297 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 352
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 353 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 412
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
RVD L + ++ W +
Sbjct: 413 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 434
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 435 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QAFSNLGPNRWHNERIT 484
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 485 LSTGKSIWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 544
Query: 557 WNNIWVPWY 565
+W PWY
Sbjct: 545 NGGVWGPWY 553
>gi|260788446|ref|XP_002589261.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
gi|229274436|gb|EEN45272.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
Length = 470
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 173/400 (43%), Positives = 238/400 (59%), Gaps = 30/400 (7%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
P+II ILADD GW+D+ +H + I TPN+D LA G+ L +YVQ +C+PSR LMTG+Y
Sbjct: 1 PNIIFILADDYGWDDIGYH-NHFIHTPNLDRLASEGVKLENYYVQPVCSPSREQLMTGRY 59
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
IH G+QHGVI P GLPL E LPQ LK+ GY T+ +GKWHLGF ++ YTP +RGFD
Sbjct: 60 QIHYGLQHGVIRNDRPHGLPLDEVTLPQRLKDNGYRTYMVGKWHLGFCKKEYTPLYRGFD 119
Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
YG+ G +DY+ H ++ +GLD+R + G YST L+ A ++I +H+
Sbjct: 120 KFYGFLTGSEDYWTHR---RYKGVRGLDLRDQDEPVLDENGTYSTHLFARKATDMILKHD 176
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+++PMFLYL AVH P Q P++ + +++ I+ R YA V+ +DE+VGNV
Sbjct: 177 QNQPMFLYLPFQAVHG-----PLQVPEKYLQEYMHINFTVDRIYAAQVTAMDEAVGNVTN 231
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+ G+ EN++++F DNGA + GSN PLRG K+T W+GG+RGV + S LK
Sbjct: 232 ALKNSGLWENTVLIFSTDNGA-----RRHTGSNWPLRGWKNTLWEGGVRGVGFVNSNLLK 286
Query: 330 QTQKVSSELFHISDWLPTLCAAA-GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ K S L HISDW PTL + G LDG + WD ++KG + R E+LHNID
Sbjct: 287 RKGKTSDALIHISDWFPTLLRISDGSTTGTKPLDGFDVWDAISKGKTSPRKELLHNIDPF 346
Query: 389 DNPQ---------------KYYAALRVDDLKYVAGTDNNG 413
+ Q Y+ALR D K + G G
Sbjct: 347 YDDQTGSYSPLSYDNIFNTSSYSALRSGDWKLLTGYQGAG 386
>gi|26330047|dbj|BAC28762.1| unnamed protein product [Mus musculus]
Length = 614
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 198/549 (36%), Positives = 286/549 (52%), Gaps = 68/549 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
T +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 67 GTAGTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 125
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 126 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCM 185
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 186 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGVCGYDLYENDNAAWDYDN---GIYSTQMYT 241
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A H+ +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 242 QRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 296
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+++G NSI+++ +DNG G + GSN PLRG K T W+GG+R
Sbjct: 297 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 352
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 353 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 412
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
RVD L + ++ W +
Sbjct: 413 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 434
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 435 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QAFSNLGPNRWHNERIT 484
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 485 LSTGKSIWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 544
Query: 557 WNNIWVPWY 565
+W PWY
Sbjct: 545 NGGVWGPWY 553
>gi|344239533|gb|EGV95636.1| Arylsulfatase J [Cricetulus griseus]
Length = 571
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 197/549 (35%), Positives = 285/549 (51%), Gaps = 68/549 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+T +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 41 GSTGTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 99
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 100 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 159
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 160 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 215
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A H+ +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 216 QRVQQILASHDPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLS 270
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+++G NSI+++ +DNG GSN PLRG K T W+GG+R
Sbjct: 271 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPMA----GGSNWPLRGSKGTYWEGGIR 326
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 327 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 386
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
RVD L + ++ W +
Sbjct: 387 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 408
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 409 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QAFSNLGPNRWHNERIT 458
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 459 LSTGKSIWLFNITADPYERVDLSGRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 518
Query: 557 WNNIWVPWY 565
+W PWY
Sbjct: 519 NGGVWGPWY 527
>gi|72159051|ref|XP_791089.1| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 545
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 190/458 (41%), Positives = 259/458 (56%), Gaps = 45/458 (9%)
Query: 20 AFLNTTAPKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCT 78
F + P++P HI+ ILADD G+ND+ + + + TPN+D LA G+ L+ +YVQ +CT
Sbjct: 48 GFQPSRNPRRPPHIVFILADDYGFNDIGYRNPA-MRTPNLDYLAAEGIKLDNYYVQPICT 106
Query: 79 PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
PSR+ LM+GKY IH G+QH +I +P LPL LPQ LKEAGYATH GKWHLGF++
Sbjct: 107 PSRAQLMSGKYQIHTGLQHSIIWPPQPNCLPLDLPTLPQKLKEAGYATHMAGKWHLGFYK 166
Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPY------QGLDMRHNMQVDNKTIGIY 192
+ PT RGFDS G G D++ H+ + PY +GLD R +Q N GIY
Sbjct: 167 KECWPTNRGFDSFLGILLGKGDHFLHTEEGGGGPYPSTWPWEGLDFRDGLQSTNAYSGIY 226
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDISDPERR 251
ST + E N+I +H+K KP+FLY++ AVH P Q P+ + F I D +RR
Sbjct: 227 STHVIAERVENIIEKHDKDKPLFLYVSFQAVHT-----PLQVPESYLQPFESSIQDEKRR 281
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
YAGM +DE+VGN+ L+K G+ ++++++F +DNG G SN PLRG K+T
Sbjct: 282 IYAGMTYCMDEAVGNITKKLKKQGLWDDTVLVFSSDNG----GNIDQGASNWPLRGSKTT 337
Query: 312 PWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLC-AAAGIEINDTSLDGVNQWD 368
W+GG+R V + SP L + K VS EL ISDW PTL AG ++ T LDG N W+
Sbjct: 338 LWEGGVRAVGFVTSPLLSERMKGTVSRELIDISDWYPTLIEGVAGWTLSGTKLDGYNIWE 397
Query: 369 VLTKGAKTKRSEILHNIDNVDNP-----------------QKYYAALRVDDLKYVAG--T 409
L G + R E+LHNID + P + YAALR D K V G +
Sbjct: 398 TLRSGKPSARVELLHNIDPLITPPSTWPNESIAAAHNSFSTRTYAALRYKDWKIVTGYXS 457
Query: 410 DNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALK 447
NNG WY ++ + E+L K+ FN +
Sbjct: 458 INNG----WYSPAESSKQSVA-SEILPGKSVWLFNITR 490
Score = 38.5 bits (88), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 32/60 (53%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E ++L+ + ++ L ++L Y+S P D +A+PA IW W
Sbjct: 485 LFNITRDPREFHDLSNQEPAIVNFLLERLESYQSGASPVLYPDIDTKANPAGHGGIWDSW 544
>gi|26338057|dbj|BAC32714.1| unnamed protein product [Mus musculus]
Length = 570
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 198/549 (36%), Positives = 286/549 (52%), Gaps = 68/549 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
T +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 67 GTAGTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 125
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 126 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCM 185
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 186 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGVCGYDLYENDNAAWDYDN---GIYSTQMYT 241
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A H+ +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 242 QRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 296
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+++G NSI+++ +DNG G + GSN PLRG K T W+GG+R
Sbjct: 297 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 352
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 353 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 412
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
RVD L + ++ W +
Sbjct: 413 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 434
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 435 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QAFSNLGPNRWHNERIT 484
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 485 LSTGKSIWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 544
Query: 557 WNNIWVPWY 565
+W PWY
Sbjct: 545 NGGVWGPWY 553
>gi|354502405|ref|XP_003513277.1| PREDICTED: arylsulfatase J [Cricetulus griseus]
Length = 597
Score = 318 bits (816), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 198/549 (36%), Positives = 286/549 (52%), Gaps = 68/549 (12%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+T +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 67 GSTGTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 125
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+
Sbjct: 126 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 185
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 186 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 241
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A H+ +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 242 QRVQQILASHDPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLS 296
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+++G NSI+++ +DNG G GSN PLRG K T W+GG+R
Sbjct: 297 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNG----GQPMAGGSNWPLRGSKGTYWEGGIR 352
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 353 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 412
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
RVD L + ++ W +
Sbjct: 413 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 434
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
GI A+++ ++++ SD + + F+ + N + N+
Sbjct: 435 GYGIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQAFSNLGPNRWHNERIT 484
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
++ LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P
Sbjct: 485 LSTGKSIWLFNITADPYERVDLSGRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 544
Query: 557 WNNIWVPWY 565
+W PWY
Sbjct: 545 NGGVWGPWY 553
>gi|443704600|gb|ELU01579.1| hypothetical protein CAPTEDRAFT_176799 [Capitella teleta]
Length = 476
Score = 318 bits (815), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 175/400 (43%), Positives = 235/400 (58%), Gaps = 28/400 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K P+I+ I++DD GW+D+ +HGS +I TP +D LAYNG+ L +YVQ +C+P+RS M+G
Sbjct: 6 KPPNILFIMSDDYGWHDIGYHGS-KIRTPVLDDLAYNGIRLENYYVQPICSPTRSQFMSG 64
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
Y IH G+QH VI + GLPL + ++EAGYATH GKWHLG+++E Y P RG
Sbjct: 65 VYQIHTGLQHNVIWPAQANGLPLEFPTIADKMREAGYATHMAGKWHLGYYKEEYLPHNRG 124
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FD++YGY G +DYYD K+ PY G D R N + + YST LY ++
Sbjct: 125 FDTYYGYLNGCEDYYD---KSYCHPYCGYDFRLNDDI-QWNLTDYSTYLYVSRVNEILLN 180
Query: 208 H---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
H + KP+FLYL +VH EP + P E K+ I D RRTYAGMV+ +DE+V
Sbjct: 181 HKIYSPDKPLFLYLPLQSVH-----EPLEVPKEYSDKYSHIKDNNRRTYAGMVAAMDEAV 235
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
G + +K+ M + +++ F DNG IH G+N PLRG K + W+GGMRGV +
Sbjct: 236 GKIRDLFKKYEMWDETVLAFSTDNGGQ---IHRG-GNNWPLRGWKISLWEGGMRGVGFVH 291
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHN 384
SP L + S L H+SDWLPTL AG LDG N WD+LTKG ++ R EILHN
Sbjct: 292 SPLLPNSGGKSKGLMHVSDWLPTLVHVAGGNTTGMKLDGYNVWDMLTKGTESPRKEILHN 351
Query: 385 IDNVDNPQ-----------KYYAALRVDDLKYVAGTDNNG 413
ID + P+ + AA+R D K + G NG
Sbjct: 352 IDPLMEPKGHARPNSTFDTRVRAAIRYGDWKLITGDPGNG 391
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 20/60 (33%), Positives = 32/60 (53%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFN+ DP E N+L+ ++ QL ++L Y +T VP P + ++PA + PW
Sbjct: 416 LFNVDADPNETNDLSAVYPQIVDQLLERLCQYNATSVPVRFPPLEPESNPAFHGGAFGPW 475
>gi|405964464|gb|EKC29946.1| Arylsulfatase B [Crassostrea gigas]
Length = 482
Score = 318 bits (814), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 168/388 (43%), Positives = 236/388 (60%), Gaps = 18/388 (4%)
Query: 21 FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPS 80
FL + K PHI+ I+ADD GWNDV F + + TPNID LA++G++LN YV +CTPS
Sbjct: 16 FLPYVSAKPPHILFIVADDYGWNDVGFR-NPDVLTPNIDKLAHSGMVLNSSYVMPVCTPS 74
Query: 81 RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
R++ MTG Y G+QH IL + PL LPQ LKE GYATHAIGKWHLGF +
Sbjct: 75 RNSFMTGHYAFKSGLQHLAILPKQAACAPLNYTFLPQKLKELGYATHAIGKWHLGFCKWE 134
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
TPT+RGFD+ +G++ G +DYY S G D R N N T G YST LY
Sbjct: 135 CTPTYRGFDTFFGFYNGQEDYYTLSVAG------GKDFRDNRTPVNAT-GEYSTFLYARR 187
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
A ++I EH+ SKPM++YL +VH EP Q P + + + RRTY GMV+ +
Sbjct: 188 AESIIKEHDASKPMYMYLPFQSVH-----EPLQVPQKYSDMYSKVHTESRRTYLGMVTAM 242
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
DE++GN+ AL+ GM +++++F ADNG G + G+N+PLRG K+T ++GG R
Sbjct: 243 DEAIGNITKALKTKGMFNDTLIIFTADNG----GWITFAGNNYPLRGGKATVFEGGTRAT 298
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
A + ++++ V + H DW+PT+ AAG D +LDG++QWD L ++KR+E
Sbjct: 299 AFVSGAGIQKSNYVYPGMIHAVDWMPTVLRAAG-GTPDPTLDGIDQWDSLRTAGESKRTE 357
Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAG 408
++NID++ +AA+R+ D K V G
Sbjct: 358 FIYNIDDLRPEICGHAAIRMGDYKLVDG 385
Score = 42.7 bits (99), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 31/62 (50%)
Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
K LFN+ DDP E +L++ D++K+L+ +L Y +VP + P + W
Sbjct: 418 KVMLFNLKDDPNEHVDLSKKHPDIVKKLKSRLEYYHKQIVPANFPKNSASSAPTNYGGFW 477
Query: 562 VP 563
P
Sbjct: 478 TP 479
>gi|37182416|gb|AAQ89010.1| APRG372 [Homo sapiens]
Length = 515
Score = 317 bits (811), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 173/376 (46%), Positives = 229/376 (60%), Gaps = 19/376 (5%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
+TT+ +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 69 STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF R+
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFNRKECM 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYEN---DNAAWDYDNGIYSTQMYT 243
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN +KP+FLY A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYTAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LDE++ NV AL+ +G NSI+++ +DNG G + GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
V + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 414
Query: 378 RSEILHNIDNVDNPQK 393
R +ILHNID QK
Sbjct: 415 RVDILHNIDPYTPRQK 430
>gi|426231093|ref|XP_004009578.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Ovis aries]
Length = 597
Score = 317 bits (811), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 198/554 (35%), Positives = 282/554 (50%), Gaps = 95/554 (17%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
P+ PHII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+T
Sbjct: 75 PQPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 133
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y IH G+QH +I +P LPL + LPQ L+E GY+TH +GKWHLGF+R+ PT R
Sbjct: 134 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRR 193
Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
GFD+ G G DYY + +C G D+ V G YST LY + +++
Sbjct: 194 GFDTFLGSLTGNVDYYTYDNCDGPG--VCGFDLHEGENVAWGLSGQYSTLLYAQRVSHIL 251
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
A H+ +P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V
Sbjct: 252 ASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVR 306
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
N+ AL++HG NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + S
Sbjct: 307 NITWALKRHGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHS 362
Query: 326 PWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHN 384
P LK+ ++ S L HI+DW PTL A A G LDG + W +++G + R+EILHN
Sbjct: 363 PLLKRKRRTSRALVHITDWYPTLVALAGGTASAADGLDGYDVWPAISEGRASPRTEILHN 422
Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
ID + N ++ AA+RV + K + G D YGD +
Sbjct: 423 IDPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WI 468
Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
P + L + G +N E++ R+
Sbjct: 469 PPQTLAAFPGSWWNL-----------------------------ERMASARQAV------ 493
Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDK 550
LFNI+ DP E+ +LA + D +L Y T +P +
Sbjct: 494 -------------WLFNISADPYEREDLAGQRPDXA-----RLVDYNRTAIPVRYPAENP 535
Query: 551 RADPARWNNIWVPW 564
RA P W PW
Sbjct: 536 RAHPDFNGGAWGPW 549
>gi|405964467|gb|EKC29949.1| Arylsulfatase B [Crassostrea gigas]
Length = 482
Score = 317 bits (811), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 171/409 (41%), Positives = 243/409 (59%), Gaps = 22/409 (5%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
+TC++ + FL + K PHI+ I+ADDLGWNDV F + + TPNID LA +G+ILN
Sbjct: 8 ITCSIFVH--FLLCVSAKPPHILFIVADDLGWNDVGFR-NPDVLTPNIDKLARSGMILNS 64
Query: 71 HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
YV +CTPSR++ MTG Y G+QH I + PL LPQ LKE GYATHAIG
Sbjct: 65 SYVMPVCTPSRNSFMTGHYAFKSGLQHLAINPQQATCAPLNYTFLPQKLKELGYATHAIG 124
Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
KWHLGF + TPT+RGFD+ +G++ G +DYY S G D R N N T G
Sbjct: 125 KWHLGFCKWECTPTYRGFDTFFGFYNGQEDYYTLSVAG------GKDFRDNKVPVNAT-G 177
Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
YST LY+ A ++I EH+ SKP+++YL +VHA P Q P+ + ++ R
Sbjct: 178 EYSTFLYSRRAESIIKEHDASKPIYMYLPFQSVHA-----PLQVPNRYTDMYTNVHTKSR 232
Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
RTY GMV+ +DE++GN+ AL+ G+ +++++F DNG G G+N+PLRG K+
Sbjct: 233 RTYMGMVTAMDEAIGNITKALKTKGIFNDTLIIFTTDNG----GAVKFAGNNYPLRGGKA 288
Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
T ++GG R A + +++ S L H DW+PT+ +AAG D+ LDG++QWD L
Sbjct: 289 TLFEGGTRATAFVTGAGIQKFNYEYSGLIHAVDWMPTVLSAAG-GTADSKLDGIDQWDSL 347
Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY 419
++KR+E ++NID+ +AA+R+ D K + G G WY
Sbjct: 348 RTARESKRNEFIYNIDDFRPEIGGHAAIRIGDYKLIDGYP--GLHQGWY 394
Score = 42.0 bits (97), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 19/62 (30%), Positives = 31/62 (50%)
Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
K LFN+ DDP E +L+ D++K+L+ +L Y + +VP + P + W
Sbjct: 418 KMMLFNLQDDPNEHVDLSRKHPDIVKKLKSRLEYYHNQIVPANFPKNSASSAPTSYGGFW 477
Query: 562 VP 563
P
Sbjct: 478 SP 479
>gi|348583281|ref|XP_003477401.1| PREDICTED: arylsulfatase I [Cavia porcellus]
Length = 572
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 171/403 (42%), Positives = 243/403 (60%), Gaps = 28/403 (6%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
P++PHII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+T
Sbjct: 45 PQRPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 103
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT R
Sbjct: 104 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRR 163
Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
GFD+ G G DYY + +C G D+ V G YST LY + A +++
Sbjct: 164 GFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHIL 221
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
A HN +P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V
Sbjct: 222 ASHNPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVR 276
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
N+ AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ I S
Sbjct: 277 NITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFIHS 332
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
P LK+ + S L HI+DW PTL AG ++ LDG + W +++G + R+EILHN
Sbjct: 333 PLLKRKRWTSHALVHITDWYPTLVGLAGGTVSAADGLDGYDVWPAVSEGRASPRTEILHN 392
Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNG 413
ID + N ++ AA+RV + K + G +G
Sbjct: 393 IDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTGDPGSG 435
Score = 43.1 bits (100), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 32/60 (53%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNI+ DP E+ +LA + D+++ L +LA Y T +P + RA P W PW
Sbjct: 465 LFNISADPYEREDLAGQRPDVVRALLARLADYNRTAIPVRYPAENPRAHPDFNGGAWGPW 524
>gi|395851353|ref|XP_003798225.1| PREDICTED: arylsulfatase J [Otolemur garnettii]
Length = 661
Score = 316 bits (809), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 180/412 (43%), Positives = 241/412 (58%), Gaps = 35/412 (8%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
T A +PH+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS
Sbjct: 132 TAATSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 190
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
+TGKY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+ P
Sbjct: 191 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 250
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYTE 199
T RGFD+ +G G DYY H K G D+ N DN GIYST +YT+
Sbjct: 251 TKRGFDTFFGSLLGSGDYYTH-YKCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYTQ 306
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
++A HN +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S
Sbjct: 307 RVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPVRYFEHYRSIVNINRRRYAAMLSC 361
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
LDE++ NV AL+ +G NSI+++ +DNG G + GSN PLRG K T W+GG+R
Sbjct: 362 LDEAIHNVTLALKAYGFYNNSIIIYSSDNG----GQPAAGGSNWPLRGSKGTYWEGGIRA 417
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKR 378
+ + SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++ R
Sbjct: 418 IGFVHSPLLKNKGTVCRELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSPR 477
Query: 379 SEILHNIDNVDNPQK--------------YYAALRVDDLKYVAGTDNNGQSD 416
+ILHNID + K +A+RV K + G N G SD
Sbjct: 478 VDILHNIDPIYTKAKNGSLAAGYGIWNTAIQSAIRVQHWKLLTG--NPGYSD 527
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 35/61 (57%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P +W PW
Sbjct: 557 LFNITADPYERVDLSHRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVWGPW 616
Query: 565 Y 565
Y
Sbjct: 617 Y 617
>gi|405975641|gb|EKC40195.1| Arylsulfatase B [Crassostrea gigas]
Length = 684
Score = 315 bits (807), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 172/392 (43%), Positives = 236/392 (60%), Gaps = 19/392 (4%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K PHI+ I+ADDLGWNDV FH + + TPNID LA GLILNQ Y+Q LC+PSR ALMTG
Sbjct: 224 KSPHILFIVADDLGWNDVGFHNPA-MKTPNIDKLAREGLILNQTYLQPLCSPSRHALMTG 282
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
YP H G+QH VIL +P PL K LPQ LK+ GYATH +GKWHLGF TPT+RG
Sbjct: 283 YYPYHAGLQHLVILPWQPVCSPLKMKFLPQRLKDIGYATHMVGKWHLGFCSWNCTPTYRG 342
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FDS +GY+ D+Y H+ Y LD R N + G YST + A ++I +
Sbjct: 343 FDSFFGYYNAQGDHYSHTW------YNYLDYRDNEKPVKNLNGTYSTFTFVSRAQDIIKK 396
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
HN S+P+FLY+A VH +P Q P + + +I RR ++GMVS LDE++GN+
Sbjct: 397 HNSSQPLFLYMAFQNVH-----DPIQVPKQYEDMYPNIKTRGRRQFSGMVSALDEAIGNI 451
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
LR+ GM ++++++F +DNG G G+N+PLRG K T ++GG R V+ +
Sbjct: 452 TDQLRESGMYDDTLIIFTSDNG----GWPKYYGNNYPLRGGKFTVYEGGTRVVSFVHGAG 507
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
L++T + + H DW TL AAG + D DGV+QW + R+E+++N D+
Sbjct: 508 LQKTGEAYESIMHAVDWSDTLVEAAGGKP-DLMGDGVSQWSSIANNFPGPRTELVYNFDS 566
Query: 388 VDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY 419
P++ +AA+RV D K + G G +WY
Sbjct: 567 DLAPEEGHAAIRVGDYKLIDGYP--GLYPDWY 596
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 41/61 (67%), Gaps = 3/61 (4%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF--DKRADPARWNNIWV 562
LFN+ DDP E N+L++S+ D++ ++ +L + VPP N P D +++P++++++W
Sbjct: 622 LFNLKDDPTEHNDLSKSRPDIVTEMSQRLRLLTQNAVPP-NYPLVPDPKSNPSKFDDVWS 680
Query: 563 P 563
P
Sbjct: 681 P 681
>gi|327274122|ref|XP_003221827.1| PREDICTED: arylsulfatase J-like [Anolis carolinensis]
Length = 564
Score = 314 bits (805), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 198/542 (36%), Positives = 285/542 (52%), Gaps = 70/542 (12%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
H+I ILADD G+ DV +HGS +I TP +D LA G+ L +YVQ +CTPSRS +TG+Y
Sbjct: 44 HLIFILADDQGFRDVGYHGS-EIRTPTLDRLAAEGVKLENYYVQPMCTPSRSQFITGRYQ 102
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH VI +P LPL LPQ LKEAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 103 IHTGLQHSVIRPTQPNCLPLDNATLPQKLKEAGYSTHMVGKWHLGFYRKECMPTQRGFDT 162
Query: 151 HYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEAAINVIA 206
+G G DYY H K G D+ N DN GIYST +YT+ ++A
Sbjct: 163 FFGSLLGSGDYYTHY-KCDSPRMCGYDLYEN---DNAAWDHDNGIYSTQMYTQKVQQILA 218
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
HN KP+FLY+A+ AVH+ P QAP ++ I++ RR YA M++ LDE++ N
Sbjct: 219 SHNPRKPIFLYIAYQAVHS-----PLQAPGMYYERYRSINNINRRRYAAMLACLDEAINN 273
Query: 267 VIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
V AL+++G NS++++ +DNG P+ G G+N PLRG K T W+GG+R V + S
Sbjct: 274 VTLALKRYGYYNNSVIIYSSDNGGQPTAG-----GNNWPLRGSKGTYWEGGIRAVGFVHS 328
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILHN 384
P LK V EL HI+DW PTL A +I+ DT LDG + W+ ++ +++RS
Sbjct: 329 PLLKNKGCVCKELIHITDWFPTLITLAEGQIDEDTLLDGYDVWETIS---ESRRSP---- 381
Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFN 444
R+D L + ++ W + GI
Sbjct: 382 --------------RMDILHNIDPIYTKAKNGSWA-----------------AGFGIWNT 410
Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR-CNYDNKGAHCNSTVKP 503
A+++ +++ SD + + F+ V + N+ +
Sbjct: 411 AIQSAIRVNHWKLLTGNPGYSD----------WVPPQSFSNVGPSRWHNERVSWTAGKTV 460
Query: 504 CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
LFNIT DP E+ +L+ D++KQL +L+ + T VP P D R++P +W P
Sbjct: 461 WLFNITADPYERVDLSSKYPDVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPNFNGGVWGP 520
Query: 564 WY 565
W+
Sbjct: 521 WF 522
>gi|125820285|ref|XP_692237.2| PREDICTED: arylsulfatase I-like [Danio rerio]
Length = 568
Score = 312 bits (799), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 172/404 (42%), Positives = 240/404 (59%), Gaps = 29/404 (7%)
Query: 27 PKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
PK+P HII IL DD G+ND+ +H S +I +P +D LA G+ L +YVQ LCTPSRS L+
Sbjct: 40 PKQPPHIIFILTDDQGFNDIGYH-SGEIRSPTLDKLASEGVRLENYYVQPLCTPSRSQLI 98
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y IH G+QH +I +P LPL LPQ L+E GY+TH +GKWHLGF+R+ PT
Sbjct: 99 TGRYQIHTGLQHSIIRPRQPNCLPLDVVTLPQRLQEIGYSTHMVGKWHLGFYRKDCLPTR 158
Query: 146 RGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
RGF +++G G DYY + SC + G D+ V G YST LYT+ +
Sbjct: 159 RGFHTYFGSLTGSVDYYTYGSCDG--KSLCGFDLHEGESVAWGRAGKYSTHLYTQRVRKI 216
Query: 205 IAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
+A H+ S+P+F++L+ AVH P + P E + + +I + RR YA MVS +DE+
Sbjct: 217 LATHDPTSQPLFIFLSLQAVHT-----PLKPPKEYIYPYRNIGNVPRRKYAAMVSIVDEA 271
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
V N+ ALRK+G NS+V+F DNGA GSN PLRG K T W+GG+RGV +
Sbjct: 272 VRNITYALRKYGFYRNSVVIFSTDNGAQPL----TGGSNWPLRGCKGTYWEGGIRGVGFV 327
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEIL 382
SP ++ +++S +L HI+DW PTL AG ++ LDG + W +++G ++ R EIL
Sbjct: 328 HSPLIRHRRRISRDLIHITDWYPTLVGLAGGNVSQHQGLDGFDVWPTISEGKESPRLEIL 387
Query: 383 HNIDNVDNPQK-------------YYAALRVDDLKYVAGTDNNG 413
HNID ++ + AA+RV D K + G NG
Sbjct: 388 HNIDPLNRRSRGSLKDGYGLWDTTVQAAIRVGDWKLLTGDPGNG 431
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 43/68 (63%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFN+T DPCE+++LA + D++++L +LA + T +P P D RA+P+ W PW
Sbjct: 462 LFNVTGDPCERHDLAVHRPDVVRELLARLAFHNRTAIPVRYPPDDTRANPSANGGAWRPW 521
Query: 565 YDELDKQK 572
E D+++
Sbjct: 522 VGEDDEEE 529
>gi|444723687|gb|ELW64328.1| Arylsulfatase I [Tupaia chinensis]
Length = 613
Score = 311 bits (798), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 168/399 (42%), Positives = 240/399 (60%), Gaps = 28/399 (7%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 93 HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 151
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 152 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 211
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + A +++A H+
Sbjct: 212 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHS 269
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+
Sbjct: 270 PRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 324
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 325 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 380
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ ++ S L HI+DW PTL AG ++ LDG + W +++G + R+EILHNID +
Sbjct: 381 RKRRTSRALVHITDWYPTLVGLAGGVVSAVDGLDGYDVWPAISEGRASPRTEILHNIDPL 440
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNG 413
N ++ AA+RV + K + G NG
Sbjct: 441 YNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTGDPGNG 479
Score = 43.5 bits (101), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 32/60 (53%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNI+ DP E+ +LA + D+++ L +LA Y T +P + RA P W PW
Sbjct: 505 LFNISADPYERKDLAGQRPDVVRTLLARLADYNRTAIPVRYPAENPRAHPDFNGGAWGPW 564
>gi|260794509|ref|XP_002592251.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
gi|229277467|gb|EEN48262.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
Length = 487
Score = 311 bits (798), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 177/413 (42%), Positives = 244/413 (59%), Gaps = 30/413 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KPHI+ I+ADDLGWNDV +H + + TP +D LA+ G+ILNQ YV +CTPSR+A MTG
Sbjct: 28 KPHILFIVADDLGWNDVGWH-NPDVKTPVLDKLAHEGVILNQSYVNYVCTPSRTAFMTGY 86
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YP H G QH V L + G+P LP+ LK+ GYATH +GKWHLGF YTPT+RGF
Sbjct: 87 YPYHAGSQHLVFLPQQAQGIPYNFTFLPEKLKDLGYATHMVGKWHLGFCNWKYTPTYRGF 146
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
DS YGY+ +DYY H GLD+R + +V N G Y T +T+ +++I +H
Sbjct: 147 DSFYGYYNADEDYYTHVVAG------GLDLRDDKEVVNTKNGQYGTYFFTDRMVDIIEKH 200
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
P+FLYL VH EP + P+ ++++ + RR GMVS LDE+VGNV
Sbjct: 201 PADTPLFLYLPFQNVH-----EPLEVPERFENIYMNVQNENRRKLLGMVSALDEAVGNVT 255
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
A++ G+ +N++V+F DNG G G+N+PLRG K T W+GG RGVA + L
Sbjct: 256 MAMKNAGLWDNTLVIFTTDNG----GWIIASGNNYPLRGGKVTLWEGGTRGVAFVHGKML 311
Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
++T ++E+ H DW PT+ AAA D ++DGV+Q D + G + R+E ++NID+V
Sbjct: 312 QKTGYTNNEMIHAVDWFPTILAAA-GGTADDAMDGVSQLDTIMSGQPSPRTEFVYNIDDV 370
Query: 389 DNPQKYYA-------ALRVDDLKYVAGTDNNGQSDEWYGD---TDNEIDKYSP 431
P K A LRV D K + G G+ D W TD+++D P
Sbjct: 371 -FPTKQGAIRIICTLCLRVGDYKLIEGV--AGKPDGWIHPDNLTDSQLDSGDP 420
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/59 (45%), Positives = 38/59 (64%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
LFN+ DDP E +NLA+S D LK+++ KLA Y+ ++VP N P D ++ P W W P
Sbjct: 426 LFNLKDDPTEHHNLADSMPDKLKEMQAKLAEYRKSLVPAINPPPDPKSFPKNWGGAWSP 484
>gi|1089794|dbj|BAA08412.1| Arylsulfatase B [Rattus norvegicus]
Length = 473
Score = 311 bits (798), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 164/344 (47%), Positives = 223/344 (64%), Gaps = 15/344 (4%)
Query: 48 HGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWG 107
HGS I TP++DALA G++L+ +YVQ LCTPSRS L+TG+Y IH+G+QH +I+ +P
Sbjct: 3 HGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHMGLQHYLIMTCQPNC 61
Query: 108 LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCK 167
+PL EKLLPQ LK+AG +TH +GKWHLG +R+ PT RGFD+++GY G +DYY H
Sbjct: 62 VPLDEKLLPQLLKDAGSSTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEAC 121
Query: 168 ATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAV 223
A E G LD+R + + IYST+++T+ A +IA H KP+FLYLA +V
Sbjct: 122 APIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRATTLIANHPPEKPLFLYLAFQSV 181
Query: 224 HAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVL 283
H +P Q P+E + + I D RR YAGMVS LDE+VGNV AL+ G+ N++++
Sbjct: 182 H-----DPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLDEAVGNVTKALKSRGLWNNTVLI 236
Query: 284 FMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISD 343
F DNG G + G+N PLRG K T W+GG+RG + SP LKQ S EL HI+D
Sbjct: 237 FSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITD 292
Query: 344 WLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNID 386
WLPTL AG + T LDG + W+ +++G+ + R E+L NID
Sbjct: 293 WLPTLVNLAGGSTHGTKPLDGFDVWETISEGSPSPRVELLLNID 336
Score = 40.8 bits (94), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LF+I DP E+++++ +++ L +L Y VP P D R DP + +W PW
Sbjct: 414 LFDINRDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSYFPPLDPRCDP-KGTGVWSPW 472
>gi|327265410|ref|XP_003217501.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Anolis
carolinensis]
Length = 580
Score = 311 bits (797), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 191/538 (35%), Positives = 282/538 (52%), Gaps = 65/538 (12%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
H+I IL DD G++D +HGS I TP +D LA G+ L +Y++ +CT SRS L+TG+Y
Sbjct: 59 HLIFILTDDQGFHDXGYHGSD-IXTPTLDRLAAEGVKLENYYIRPICTLSRSQLITGRYQ 117
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LP + LPQ L+EAGY+TH +GKWHLGF+R+ PT RGFD+
Sbjct: 118 IHTGLQHSIIRPQQPNCLPFNQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 177
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V + G YST LY + ++A HN
Sbjct: 178 FLGSLTGNVDYYTYDNCDG--PGVCGYDLHDGENVAWEQSGKYSTFLYAQRVNKILAAHN 235
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+F+Y+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+
Sbjct: 236 PKEPIFIYIAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITW 290
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+K+G +NS+++F DNG +F + GSN PLRG K T W+GG+RG+ + SP +K
Sbjct: 291 ALKKYGYYDNSVIVFSTDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIK 346
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+++S L HI+DW PTL A +++ LDG N W +++G ++ R+EILHNID +
Sbjct: 347 HKRRISRALIHITDWYPTLVTLAQGNVSEADDLDGYNVWPAISEGKESPRTEILHNIDPL 406
Query: 389 DNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKT 448
N K YG L S GI A++
Sbjct: 407 YNHAK-------------------------YGS-------------LESGFGIWNTAVQA 428
Query: 449 KLQIK--QKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLF 506
+++ + DP + +T+ T F N + +V LF
Sbjct: 429 SVRVGDWKLLTGDPGYSDWIPPQTLAT---------FPGSWWNLERMTGGMRKSV--WLF 477
Query: 507 NITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
NIT DP E+ +L+E + D+++ L +L Y T +P + RA P W PW
Sbjct: 478 NITADPYERYDLSEQRPDVVRALLLRLVRYNQTAIPVRYPAENPRAHPDFNGGAWGPW 535
>gi|260832084|ref|XP_002610988.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
gi|229296357|gb|EEN66998.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
Length = 494
Score = 311 bits (796), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 158/360 (43%), Positives = 224/360 (62%), Gaps = 12/360 (3%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KPHI+ I+ADDLGWNDV +H + + TP +D LA+ G+ILNQ YV +CTPSR+A MTG
Sbjct: 22 KPHIVFIVADDLGWNDVGWH-NPDVRTPVLDQLAHEGVILNQSYVNYVCTPSRTAFMTGY 80
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
+P H+G QH V +P + LP+ LK GYATH +GKWHLGF +TPTFRGF
Sbjct: 81 FPYHVGSQHLVFRPDQPSAILSNFTFLPEKLKSLGYATHMVGKWHLGFCNWKFTPTFRGF 140
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
DS+YGY+ G +DY+ H ++ G+D N V G YS L+++ A +++ +H
Sbjct: 141 DSYYGYYSGAEDYFTH-FRSIRNGTGGIDFHDNKDVVTDQNGTYSAYLFSQRAADIVNKH 199
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
+ + P+FLYL VHA P + P + ++ D RR GMVS LDE+VGNV
Sbjct: 200 DPNTPLFLYLPFQNVHA-----PLEVPKRFEDMYANVQDENRRKLLGMVSALDEAVGNVT 254
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
A++K G+ +N++++F DNG G+ G+N+PLRG K+T W+GG RGVA L
Sbjct: 255 MAMKKTGLWDNTLLIFSTDNG----GMVIQSGNNYPLRGGKTTLWEGGTRGVAFAHGKML 310
Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
++T ++E+ H DW PTL AAAG D +DGV+Q D + G + R+E ++NID++
Sbjct: 311 QKTGYTNNEMIHAVDWFPTLLAAAG-GTADPGMDGVSQLDTILSGKPSPRTEFVYNIDDI 369
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/78 (39%), Positives = 44/78 (56%), Gaps = 3/78 (3%)
Query: 486 VRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN 545
+ ++ N G N T LFN+ DDP E +NLA+S D LK+++ KL Y+ T+VP N
Sbjct: 417 MTASHPNSGDPINGT---WLFNLKDDPTEHHNLADSMPDKLKEMQAKLDGYRKTLVPAIN 473
Query: 546 KPFDKRADPARWNNIWVP 563
D ++DP W W P
Sbjct: 474 PKSDPKSDPKNWGGAWSP 491
>gi|156380740|ref|XP_001631925.1| predicted protein [Nematostella vectensis]
gi|156218974|gb|EDO39862.1| predicted protein [Nematostella vectensis]
Length = 540
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 182/425 (42%), Positives = 251/425 (59%), Gaps = 35/425 (8%)
Query: 13 CTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQ-IPTPNIDALAYNGLILNQH 71
C + L+ P PHI+ IL DDLGW+DV +H S + TPNID LA G+ L +
Sbjct: 20 CCQWPGCSLLSMAGP--PHIMFILMDDLGWSDVGYHNISHAVKTPNIDKLASQGVKLMSY 77
Query: 72 YVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
Y Q +CTPSR ALMTGKYPIH+GMQH VI PWG+P +PQ L+ GY T IGK
Sbjct: 78 YSQPMCTPSRGALMTGKYPIHLGMQHFVINITSPWGMPRRFPTIPQKLRTLGYRTSMIGK 137
Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
WHLGFF YTP RGFDS G++ G QD++ HS K F LD R + + N+ G
Sbjct: 138 WHLGFFDWDYTPLRRGFDSFLGFFAGEQDHWRHS-KMGF-----LDFRRDEEPANEYGGQ 191
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
+STD++T+ AIN+ HN S+P+FL L++ AVH P QA +V K +SD +R+
Sbjct: 192 HSTDVFTQEAINIAMRHNASQPLFLLLSYAAVHT-----PLQAHPNDVNKIGGVSDKDRQ 246
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKS 310
Y GM+ D S+G +I +++G+ N+++++ +DNGA P G G N PLRG KS
Sbjct: 247 NYLGMMGAADWSIGRLIDVYKRNGLWNNTLMIWASDNGAQPGKG----GGYNWPLRGYKS 302
Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
+ ++GG+R A + L++ ++LFH++DW PTL AG E+ + +DGV+QW L
Sbjct: 303 SLFEGGVRVPAFVHGEMLQRKGGTVNDLFHVTDWYPTLVKLAGGEV-EPDIDGVDQWPTL 361
Query: 371 TKGAKTKRSEILHNIDNVDNPQK---------YY--AALRVDDLKYVAGTDNNGQSDEWY 419
++G +KR EILHNID N ++ YY AALR +K V + G WY
Sbjct: 362 SEGKPSKREEILHNIDIPANQEEERMAPRGFNYYSGAALRRGHMKLVYKMGDAG----WY 417
Query: 420 GDTDN 424
+N
Sbjct: 418 QLPEN 422
>gi|432879612|ref|XP_004073512.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
Length = 673
Score = 310 bits (794), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 168/413 (40%), Positives = 245/413 (59%), Gaps = 30/413 (7%)
Query: 17 FNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQAL 76
FND + + PHII I+ DD G+ND+ +H SS I TP +D LA G+ L +Y+Q +
Sbjct: 38 FND---RASGVQPPHIIFIMTDDQGFNDIGYH-SSDIKTPTLDKLAAKGVKLENYYIQPI 93
Query: 77 CTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF 136
CTPSRS +TG+Y IH G+QH +I +P LP + LPQ L++ GY+TH +GKWHLGF
Sbjct: 94 CTPSRSQFITGRYQIHTGLQHSIIRPRQPNCLPFDQVTLPQRLQQLGYSTHMVGKWHLGF 153
Query: 137 FREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
+++ PT RGFD+++G G +YY +S E G D+ V G YST L
Sbjct: 154 YKKECLPTRRGFDTYFGSLTGSVNYYTYSSCDGPE-LCGFDLHEGESVAWDQGGKYSTHL 212
Query: 197 YTEAAINVIAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
+T+ ++A H+ +S+P+F++L+ AVH+ P Q+P E + + + + RR YA
Sbjct: 213 FTQRVRKILARHDPQSQPLFIFLSFQAVHS-----PLQSPKEYICPYSGLENVARRKYAA 267
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
MVS +DE+V N+ ALRK+G +NS+++F DNG G + GSN PLRG K T W+G
Sbjct: 268 MVSTVDEAVRNITYALRKYGYYQNSVLIFSTDNG----GQPLSGGSNWPLRGRKGTYWEG 323
Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGA 374
G+RG+ + SP LK+ ++VS L HI+DW PTL A G E LDG N W+ +++G
Sbjct: 324 GIRGLGFVHSPLLKKRKRVSKALVHITDWYPTLVGLAGGNESMTDGLDGYNVWEAISEGK 383
Query: 375 KTKRSEILHNIDNVDNPQK--------------YYAALRVDDLKYVAGTDNNG 413
++ R EILHNID + N + A++R+ D K + G +G
Sbjct: 384 ESPRLEILHNIDPLFNHARSGSLQQGFGIWNTAVQASIRMGDWKLLTGNPGHG 436
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 32/72 (44%), Positives = 43/72 (59%), Gaps = 2/72 (2%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNI++DP E+ +L+E + D++KQL +LA Y T VP D RADP WVPW
Sbjct: 466 LFNISEDPYERFDLSEQRPDVVKQLLARLAYYNRTAVPVRFPSEDPRADPHLNGGAWVPW 525
Query: 565 YDELDKQKAIET 576
DK + IE+
Sbjct: 526 V--ADKMEEIES 535
>gi|291239530|ref|XP_002739676.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 507
Score = 310 bits (794), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 168/401 (41%), Positives = 240/401 (59%), Gaps = 31/401 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P+I+ ILADDLGW+DV +H S I TPNID LA G+ L +YV LCTP+R+ LMTG+
Sbjct: 28 RPNIVFILADDLGWHDVGYH-DSIIRTPNIDKLAAEGVKLENYYVTPLCTPTRAVLMTGR 86
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH GMQHGV++ EP LP E L+PQ LKE+GY TH +GKWHLGF++ TP RGF
Sbjct: 87 YQIHTGMQHGVLMAQEPRCLPTDEVLMPQKLKESGYTTHMVGKWHLGFYKWACTPNHRGF 146
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-VDNKTIGIYSTDLYTEAAINVIAE 207
D+ +G + DY++H+ D+R Q V + +G YST ++ E A +I +
Sbjct: 147 DTFFGMYLAGGDYFNHTRLCHGRRLAAWDLRDGDQVVAPEYVGKYSTIVFAEKAQEIIKK 206
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL-DISDPERRTYAGMVSRLDESVGN 266
H+ + P+FLYL+ AVHA P Q P+ + + DI D RR YAGM + +DE++GN
Sbjct: 207 HDPTNPLFLYLSFQAVHA-----PLQVPERYINMYKDDIRDESRRIYAGMTTCMDEAIGN 261
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
V L + G+L+N+I++F +DNG G SN+PLRG K T ++GG+R A ++SP
Sbjct: 262 VTRTLARRGLLDNTIIVFSSDNG----GAKLYGASNYPLRGQKGTLYEGGVRAPAFVYSP 317
Query: 327 WLKQTQK--VSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILH 383
L + + EL ++ DW PT AG ++ LDGVNQW ++ +KR+EI+H
Sbjct: 318 LLSNRVRGTATHELMYVGDWFPTFLHLAGGNLHGVKPLDGVNQWHTVSAAMPSKRTEIVH 377
Query: 384 NIDNV--------DNP--------QKYYAALRVDDLKYVAG 408
NI+ + D P + A++RV + K + G
Sbjct: 378 NIEPLRGKPPRVKDEPWLTNPYFDIRVMASIRVGEWKLITG 418
>gi|348514291|ref|XP_003444674.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
Length = 570
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 170/405 (41%), Positives = 244/405 (60%), Gaps = 29/405 (7%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
P+ PHII I+ DD G+ND+ +H SS I TP +D LA +G+ L +Y+Q +CTPSRS +
Sbjct: 44 VPQPPHIIFIMTDDQGFNDIGYH-SSDIRTPVLDKLAADGVKLENYYIQPICTPSRSQFI 102
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y IH G+QH +I +P LP + LPQ L+E GY+TH +GKWHLGF+++ PT
Sbjct: 103 TGRYQIHTGLQHSIIRPCQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTR 162
Query: 146 RGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
RGFD+++G G +YY + C G D+ V G YST LYT+ +
Sbjct: 163 RGFDTYFGSLTGSVNYYTYDGCDGA--GLCGFDLHEGESVAWGQSGKYSTHLYTQRVRKI 220
Query: 205 IAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
+A H+ +S+P+F++L+ AVH P Q PDE + +L + + RR YA MVS +DE+
Sbjct: 221 LATHDPQSQPLFIFLSFQAVHT-----PLQYPDEYIYPYLGLENVARRKYAAMVSAVDEA 275
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
V N+ ALRK+G +NS+++F DNG G + GSN PLRG K T W+GG+RG+ +
Sbjct: 276 VRNITYALRKYGYYQNSVIIFSTDNG----GQPLSGGSNWPLRGRKGTYWEGGIRGLGFV 331
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEIL 382
SP L++ ++VS L HI+DW PTL AG + + T +DG N W+ +++G ++ R EIL
Sbjct: 332 HSPLLRKKRRVSKALVHITDWYPTLVGLAGGDESLTEGVDGYNVWEAISEGKESPRLEIL 391
Query: 383 HNIDNVDNP------QKYY--------AALRVDDLKYVAGTDNNG 413
HNID + N QK Y A++R D K + G G
Sbjct: 392 HNIDPLYNHARTGSLQKGYGIWNTAIQASIRAGDWKLLTGDPGYG 436
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 35/60 (58%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNI+ DP E+ +L+E + D++KQL +L Y T VP D RADP WVPW
Sbjct: 466 LFNISGDPYERYDLSEQRPDVVKQLLARLVYYNRTAVPVRYPSEDPRADPYLNGGAWVPW 525
>gi|189521775|ref|XP_688265.2| PREDICTED: hypothetical protein LOC559800 [Danio rerio]
Length = 1542
Score = 309 bits (792), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 169/398 (42%), Positives = 232/398 (58%), Gaps = 29/398 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+PHII I+ DD G+ DV +HGS +I TP +D LA G+ L +YVQ LC+PSRS LMTG+
Sbjct: 1026 QPHIIFIMVDDQGFRDVGYHGS-EIKTPTLDRLAAAGVKLENYYVQPLCSPSRSQLMTGR 1084
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QH +I +P LPL LPQ LK AGY+TH +GKWHLGF++ PT RGF
Sbjct: 1085 YQIHTGLQHSIIRPTQPNCLPLENITLPQKLKNAGYSTHMVGKWHLGFYKRACMPTQRGF 1144
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-NKTIGIYSTDLYTEAAINVIAE 207
D+ +G G DYY H K G D+ + + G+YST +YT+ A+N++A
Sbjct: 1145 DTFFGSLLGSGDYYSH-YKCDSPGLCGYDLHEGEEAAWEQDRGVYSTIMYTQKAVNILAS 1203
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
HN +P+FLYLA AVH+ P Q P + ++ I + RR YA MVS LDE+V N+
Sbjct: 1204 HNPKRPIFLYLAFQAVHS-----PLQVPARYLERYKSIPNLHRRKYAAMVSCLDEAVRNL 1258
Query: 268 IAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
AL+++G +N ++++ +DNG P G GSN PLRG K + W+GG+R V + SP
Sbjct: 1259 TLALKQYGYYDNMVMVYSSDNGGQPMAG-----GSNWPLRGSKGSYWEGGIRAVGFVHSP 1313
Query: 327 WLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
L + S L HI+DW PTL G + +LDG + W+ +++G + R +ILHNI
Sbjct: 1314 LLVKKGTKSKALIHITDWYPTLVMLGEGTLDENQNLDGYDVWEAISEGRPSPRLDILHNI 1373
Query: 386 DNVDNPQK--------------YYAALRVDDLKYVAGT 409
D + K AALRV D K + G
Sbjct: 1374 DPIYTKAKNGSWKAGYGIWNTAVQAALRVGDWKLLTGV 1411
Score = 45.8 bits (107), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 33/61 (54%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E+ +L+ ++K++ KL Y VP P D R++P +W PW
Sbjct: 1447 LFNITADPYERVDLSRQYPHVVKKMLFKLLRYSKNAVPCRYPPKDYRSNPQYNGGVWGPW 1506
Query: 565 Y 565
Y
Sbjct: 1507 Y 1507
>gi|126291233|ref|XP_001378869.1| PREDICTED: arylsulfatase I [Monodelphis domestica]
Length = 584
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 165/394 (41%), Positives = 237/394 (60%), Gaps = 28/394 (7%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 55 HIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 113
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+E GY+TH +GKWHLGF+++ PT RGFD+
Sbjct: 114 IHTGLQHSIIRPRQPSCLPLDQVTLPQKLQEVGYSTHMVGKWHLGFYKKACLPTRRGFDT 173
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V + G YST LY + A ++A H+
Sbjct: 174 FLGSLTGNVDYYTYDNCDG--PGVCGYDLHEGENVAWEQSGQYSTLLYAQRASQILASHS 231
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+
Sbjct: 232 PHQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 286
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G +NS+++F DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 287 ALKRYGYYDNSVIIFSTDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLLK 342
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ ++ S L HI+DW PTL AG +++ LDG N W +++G + R EILHNID +
Sbjct: 343 RKRRTSRALVHITDWYPTLVTLAGGSVSEAEGLDGYNVWPAISEGQVSPRMEILHNIDPL 402
Query: 389 DNPQKY--------------YAALRVDDLKYVAG 408
N K+ A++RV + K + G
Sbjct: 403 YNHAKHGSLEGGFGIWNTAVQASIRVGEWKLLTG 436
Score = 43.1 bits (100), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 21/60 (35%), Positives = 31/60 (51%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNI+ DP E+ +LA + D+++ L +L Y T +P + RA P W PW
Sbjct: 472 LFNISADPYEREDLAAQRPDVVRSLLARLVYYNRTAIPVRYPAENPRAHPDFNGGAWGPW 531
>gi|291239534|ref|XP_002739678.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 648
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 178/422 (42%), Positives = 247/422 (58%), Gaps = 35/422 (8%)
Query: 20 AFLNTT--APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALC 77
+F+++T + K PHI+ LADDLGW+DV +H S I TPNID LA G+ L +YV +C
Sbjct: 16 SFVSSTVASTKPPHIVFFLADDLGWHDVGYH-DSIIRTPNIDKLAAEGVKLENYYVTPIC 74
Query: 78 TPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
TP+R+ LMTG+Y IH MQHGV++ E LP E L+PQ LKE+GY+TH +GKWHLGF+
Sbjct: 75 TPTRAVLMTGRYQIHTTMQHGVLMAQEQRCLPTDEVLMPQKLKESGYSTHMVGKWHLGFY 134
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH-NMQVDNKTIGIYSTDL 196
+ TP RGFD+ +G++ +Y+ H+ K D+R + V + G YST L
Sbjct: 135 KWDCTPNHRGFDTFFGFYLAGGEYFTHTRKCHGHRLDAWDLRDGDKMVGPEYTGEYSTML 194
Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDISDPERRTYAG 255
Y A VI + + + PMFLY++ AVHA P + PD + DI D R+ YAG
Sbjct: 195 YARKAQEVIRKQDPNVPMFLYVSFQAVHA-----PLEVPDSYADAYGKDIYDQSRKLYAG 249
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADN-GAPSFGIHSNKGSNHPLRGMKSTPWD 314
M S +DE++GNV L+ GM +N++V+F DN GA FG SN+PLRG K ++
Sbjct: 250 MTSCMDEAIGNVTQTLKDRGMWDNTVVVFSTDNGGATGFG-----ASNYPLRGQKCGHYE 304
Query: 315 GGMRGVAAIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLT 371
GG+RG A + SP LK + + +EL ++ DWLPT AG + T LDGVNQW+ ++
Sbjct: 305 GGVRGPAFVSSPLLKPHVRGTKNTELMYVGDWLPTFVHLAGGTTHGTKPLDGVNQWEAIS 364
Query: 372 KGAKTKRSEILHNIDNVD-------------NPQ---KYYAALRVDDLKYVAGTDNNGQS 415
+G ++R EILHNID + NP + AA+RV K + G N
Sbjct: 365 EGLPSRRKEILHNIDPIRAKSPTLKTEPWMLNPYFDIRVMAAIRVGHWKLLTGQKGNANW 424
Query: 416 DE 417
E
Sbjct: 425 QE 426
>gi|292620475|ref|XP_002664306.1| PREDICTED: arylsulfatase I-like [Danio rerio]
Length = 558
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 171/403 (42%), Positives = 241/403 (59%), Gaps = 31/403 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+PHII I+ DD G+ND+ +H ++ I TP +D LA G+ L +Y+Q +CTPSRS +TG+
Sbjct: 37 RPHIIFIMTDDQGFNDIGYH-NTDIHTPTLDRLAAAGVKLENYYIQPICTPSRSQFITGR 95
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QH +I +P LP + LPQ L+EAGYATH +GKWHLGF++ PT RGF
Sbjct: 96 YQIHTGLQHSIIRSRQPSCLPFGLRTLPQRLQEAGYATHMVGKWHLGFYKRDCLPTRRGF 155
Query: 149 DSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
++++G G DYY + SC G D+ +V G YST LYT+ ++A
Sbjct: 156 NTYFGSLTGSVDYYTYKSCDG--PKVCGFDLHDGERVAWGQGGRYSTHLYTQRVRKILAA 213
Query: 208 HN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H+ S+P+F++L+ AVH P Q+P E V + + + RR YAGMVS +DE+V N
Sbjct: 214 HDPSSQPLFIFLSFQAVHT-----PLQSPKEYVYPYRRMGNMFRRKYAGMVSAVDEAVRN 268
Query: 267 VIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
V ALRK+G +N+++ F DNG P FG GSN PLRG K T W+GG+RG+ + S
Sbjct: 269 VTYALRKYGYYKNTVIFFSTDNGGQPLFG-----GSNWPLRGRKGTYWEGGVRGIGFVHS 323
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
P L++ ++VS L HI+DW PTL AG ++ LDG + WD ++ G ++ R EILHN
Sbjct: 324 PLLRRRRRVSRALIHITDWYPTLMRLAGGNVSQAEGLDGFDMWDTISDGKESPRFEILHN 383
Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNG 413
ID + N ++ AA+RV D K + G G
Sbjct: 384 IDPLYNAARHGSLKNGYGIWNTAVQAAVRVGDWKLLTGNPGYG 426
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 29/69 (42%), Positives = 39/69 (56%)
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
H + LFN+ DDP E+ +LAE + D++KQL +L+ Y T VP D RADP+
Sbjct: 447 HTEAKKSLWLFNVADDPYERFDLAERRPDVVKQLLARLSFYNRTAVPVRYPSEDPRADPS 506
Query: 556 RWNNIWVPW 564
R W PW
Sbjct: 507 RNGGAWGPW 515
>gi|241596950|ref|XP_002404637.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215500440|gb|EEC09934.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 406
Score = 308 bits (788), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 161/376 (42%), Positives = 228/376 (60%), Gaps = 6/376 (1%)
Query: 21 FLNTTAP--KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCT 78
FL T+ + P++++ILADDLGW+DVS+HGS QIPTPNID LA +G+IL +YVQ L T
Sbjct: 18 FLEATSASSQPPNLVMILADDLGWHDVSYHGSDQIPTPNIDVLAMDGIILFHNYVQPLST 77
Query: 79 PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQY---LKEAGYATHAIGKWHLG 135
P+R+AL+TG YPIH G Q I +P GL LLPQ L + + A WHLG
Sbjct: 78 PTRAALLTGLYPIHTGTQRLDIGSADPIGLSADFTLLPQLSVTLADNFTSLGARSGWHLG 137
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
F ++ + PT RGFD+ YG + G DY+ H + G ++ + + G Y T
Sbjct: 138 FCKDEFKPTKRGFDTFYGIYNGDSDYWTHFARDNNIDVSGHALKDEKRALVEEAGRYLTS 197
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
L A+ +I K+KP FLY A AVH G + QAP E ++KF ++D +R+ +AG
Sbjct: 198 LLANQAVQLIHNRPKNKPFFLYFAPTAVHCGGSNGSLQAPKEYISKFGYLADYDRQLFAG 257
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
++ LD+SVG ++ AL G L N+++ F DNG G +N N PLRG K T +G
Sbjct: 258 SLAELDKSVGLIVEALYVSGQLNNTVIAFSTDNGGAPVGFSANTSPNWPLRGTKGTVAEG 317
Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGA 374
G+RG +WS L +V+ +LFH++DW+PT AAG + D T +DGV+QW+ LT+GA
Sbjct: 318 GVRGPGFLWSSSLTTRGRVTQQLFHVTDWMPTFYTAAGGQAKDLTMIDGVDQWESLTRGA 377
Query: 375 KTKRSEILHNIDNVDN 390
+ R E+L+NID + +
Sbjct: 378 PSPRKEVLYNIDPISD 393
>gi|410913855|ref|XP_003970404.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
Length = 570
Score = 307 bits (787), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 173/415 (41%), Positives = 243/415 (58%), Gaps = 36/415 (8%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
+ P PHII I+ DD G+ND+ +H SS I TP +D LA +G+ L +Y+Q +CTPSRS
Sbjct: 42 SPVPTSPHIIFIMTDDQGFNDIGYH-SSDIKTPVLDKLAADGVKLENYYIQPICTPSRSQ 100
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L+TG+Y IH G+QH +I +P LP + LPQ L+E GY+TH +GKWHLGF+++ P
Sbjct: 101 LITGRYQIHTGLQHSIIRPRQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLP 160
Query: 144 TFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
T RGFD+++G G +YY + SC G D+ V G YST LYT+
Sbjct: 161 TRRGFDTYFGSLTGSVNYYTYDSCDG--PGMCGFDLHEGESVAWSQKGKYSTHLYTQRVR 218
Query: 203 NVIAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
++A H+ +S+P+F++L+ AVH P Q P E + + + + RR YA MVS +D
Sbjct: 219 KILATHDPRSQPLFIFLSFQAVHT-----PLQCPREYIYPYRGLENIARRKYAAMVSAVD 273
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E+V N+ LRK+G ENSI++F DNG + GSN PLRG K T W+GG+RG+
Sbjct: 274 EAVRNITYGLRKYGYYENSIMIFSTDNGGQPL----SGGSNWPLRGRKGTYWEGGVRGLG 329
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
I SP L++ ++VS L HI+DW PTL A G E + +DG + W+ ++ G ++ R E
Sbjct: 330 FIHSPLLRKKKRVSKALVHITDWYPTLVGLAGGKESHIEGVDGYDVWEAISDGRESPRLE 389
Query: 381 ILHNIDNVDNP------QKYY--------AALRVDDLKYVAGTDNNGQSDEWYGD 421
ILHNID + N QK Y A++R D K + G D YGD
Sbjct: 390 ILHNIDPLYNHARSGSLQKGYGIWNTAVQASIRAGDWKLLTG-------DPGYGD 437
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 35/60 (58%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNI+ DP E+ +L+E + D++KQL +L Y T VP D RADP WVPW
Sbjct: 466 LFNISGDPYERLDLSEQRPDVVKQLLARLVYYNRTAVPVRYPTEDLRADPHLNGGAWVPW 525
>gi|224067708|ref|XP_002198824.1| PREDICTED: arylsulfatase I [Taeniopygia guttata]
Length = 575
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 170/411 (41%), Positives = 244/411 (59%), Gaps = 30/411 (7%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
L T + PHII IL DD G++D+ +HGS I TP +D LA G+ L +Y+Q +CTPSR
Sbjct: 42 LPPTFTRPPHIIFILTDDQGYHDIGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSR 100
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
S L+TG+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+++
Sbjct: 101 SQLITGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKEC 160
Query: 142 TPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
PT RGFD+ G G DYY + +C G D+ V G YST LY +
Sbjct: 161 LPTRRGFDTFLGSLTGNVDYYTYDNCDG--PDVCGYDLHEGEDVAWDQSGKYSTFLYAQR 218
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
++A H+ +P+F+Y+A AVH P Q+P E + ++ + + RR YA MV+ +
Sbjct: 219 VSKILASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCM 273
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
DE+V N+ AL+K+G +NS+++F DNG +F + GSN PLRG K T W+GG+RG+
Sbjct: 274 DEAVKNITWALKKYGYYDNSVMVFSTDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGI 329
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
+ SP +K+ ++ S L HI+DW PTL + A G N LDG + W +++G + R+
Sbjct: 330 GFVHSPLIKRKRRTSWALVHITDWYPTLVSLARGNLSNVQGLDGYDVWPAISEGKDSPRT 389
Query: 380 EILHNIDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSD 416
EILHNID + N KY A++RV + K + G + G SD
Sbjct: 390 EILHNIDPLYNHAKYGSLEDGFGIWNTAVQASIRVGEWKLLTG--DPGYSD 438
Score = 45.4 bits (106), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 36/68 (52%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E+ +L+E + D+++ L +L Y T +P + RA P W PW
Sbjct: 468 LFNITADPYERYDLSEQRPDVVRTLLTRLVHYNRTAIPVRYPAENPRAHPDFNGGAWGPW 527
Query: 565 YDELDKQK 572
E D ++
Sbjct: 528 ASEDDGEE 535
>gi|332016484|gb|EGI57377.1| Arylsulfatase B [Acromyrmex echinatior]
Length = 438
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 178/451 (39%), Positives = 263/451 (58%), Gaps = 39/451 (8%)
Query: 76 LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
+CTPSR A ++G++P+ GMQ + EP GL L + LLP+YL++ GY TH +GKWH+G
Sbjct: 3 VCTPSRVAFLSGRHPLRTGMQGYPLKAAEPRGLHLNDTLLPEYLRKLGYTTHLLGKWHVG 62
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI---Y 192
+ Y PT RGFD+ GY+ G+ Y++H+ + + G D+ H + DN T+ Y
Sbjct: 63 YLTRNYVPTRRGFDTFLGYFNGVIQYFNHTIEENEQV--GYDL-HRIVGDNHTVEYRYDY 119
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD--EEVAKFLDISDPER 250
T+L TE A N+I+ HN KPM+L LAHLA HA N E + D E A I D R
Sbjct: 120 MTNLITEEAENIISSHNTEKPMYLQLAHLASHASNAEEAMEVYDWEETNATLGYIQDVNR 179
Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG-IHSNKGSNHPLRGMK 309
R +A +V+ LD+S+G V+ AL K ML+N+I+LF++DNGA + G ++ N GSN+PLRG+K
Sbjct: 180 RKFASVVTTLDKSIGRVVDALNKKDMLKNTIILFISDNGAQTEGPLYQNFGSNYPLRGLK 239
Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWD 368
+ ++GG+RG A I+SP ++ + KVS++LFHI+DWLPTL +AAG +D LDG++QW
Sbjct: 240 FSLYEGGVRGAACIYSPLIENSSKVSTQLFHITDWLPTLYSAAGGNSSDLYQLDGIDQWS 299
Query: 369 VLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDK 428
+ +KR ++ NI++VD Y+A + K + + + Y D K
Sbjct: 300 TIKTQEASKRKSVVINIEDVD-----YSAALMGRYKLIQDKSDFQKHYTNYKGNDPSYPK 354
Query: 429 YSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC 488
Y+ +L S AA AN S+ I+T KI ELR+ A V C
Sbjct: 355 YNVTNILESP------------------AASAIANISNH---IVTANKIEELRKEATVVC 393
Query: 489 NYDNKGAHCNSTVKPCLFNITDDPCEQNNLA 519
+ + C S + CLF+I +DPCE +++
Sbjct: 394 E-NLMTSDCES--RTCLFDIYEDPCEVTDIS 421
>gi|47215546|emb|CAG06276.1| unnamed protein product [Tetraodon nigroviridis]
Length = 527
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 171/416 (41%), Positives = 244/416 (58%), Gaps = 36/416 (8%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
++ P PHII I+ DD G+ND+ +H SS I TP +D LA +G+ L +Y+Q +CTPSRS
Sbjct: 41 RSSVPTPPHIIFIMTDDQGFNDIGYH-SSDIKTPVLDKLAADGVKLENYYIQPICTPSRS 99
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
L+TG+Y IH G+QH +I +P LP + LPQ L+E GY+TH +GKWHLGF+++
Sbjct: 100 QLITGRYQIHTGLQHSIIRPRQPNCLPFDQITLPQRLQELGYSTHMVGKWHLGFYKKECL 159
Query: 143 PTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
PT RGFD+++G G +YY + SC G D+ V G YST LYT+
Sbjct: 160 PTRRGFDTYFGSLTGSVNYYTYDSCDG--PGVCGFDLHEGESVAWSQRGKYSTHLYTQRV 217
Query: 202 INVIAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
++A H+ +S+P+F++L+ AVH P Q P E + + + + RR YA MVS +
Sbjct: 218 RKILATHDPQSQPLFIFLSFQAVHT-----PLQCPQEYIYPYRGLENIARRKYAAMVSAV 272
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
DE+V N+ LRK+G +NSI++F DNG + GSN PLRG K T W+GG+RG+
Sbjct: 273 DEAVRNITYGLRKYGYYQNSIIIFSTDNGGQPL----SGGSNWPLRGRKGTYWEGGVRGL 328
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
+ SP L++ ++VS L HI+DW PTL A G E + +DG + W+ ++ G ++ R
Sbjct: 329 GFVHSPLLRKKRRVSKALVHITDWYPTLVGLAGGDESHIEGVDGYDVWEAISDGKESPRL 388
Query: 380 EILHNIDNVDNP------QKYY--------AALRVDDLKYVAGTDNNGQSDEWYGD 421
EILHNID + N QK Y A++R D K + G D YGD
Sbjct: 389 EILHNIDPLYNHARSGSLQKGYGIWNTAVQASIRAGDWKLLTG-------DPGYGD 437
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 35/60 (58%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNI+ DP E+ +L+E + D++KQL +L Y T VP D RADP WVPW
Sbjct: 466 LFNISGDPYERFDLSEQRPDVVKQLLARLVYYNRTAVPVRYPTEDLRADPHLNGGAWVPW 525
>gi|241834402|ref|XP_002414990.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215509202|gb|EEC18655.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 425
Score = 305 bits (781), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 184/451 (40%), Positives = 248/451 (54%), Gaps = 42/451 (9%)
Query: 76 LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
LCTPSRSALMTG YPIH GMQ VIL PWGLPL K+LPQ+ K+ GY + IGKWHLG
Sbjct: 3 LCTPSRSALMTGLYPIHNGMQSTVILPAAPWGLPLENKILPQHFKDLGYDVNMIGKWHLG 62
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
FF+ Y P RGFD+ +G + G DYY+H+ + + + + + V T G+
Sbjct: 63 FFQTPYVPMKRGFDTFFGLYTGSNDYYNHTSGSVSKKQEQITGALCLTVGGITHGV--EP 120
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
L T++ +N + P F Y +H AVH E FQAP V KF I + R YAG
Sbjct: 121 LLTDSPLNFYPQ-----PFFCYFSHQAVHRALDSETFQAPARNVLKFPYIEESNRTIYAG 175
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
M+ LDES+G V+ AL GMLE++I++F +DNG +GI SN+G N PLRG K T W+G
Sbjct: 176 MLDALDESMGRVVEALSSAGMLEDTIIVFSSDNGGGPYGIESNRGFNWPLRGAKFTVWEG 235
Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGA 374
G+R A +WSP + +VS++L HISDWLPTL AAG + + ++DGV+ W+ LT +
Sbjct: 236 GVRVPAFVWSPKFLKKSRVSNQLMHISDWLPTLYTAAGGDASALGTIDGVDMWNSLTHRS 295
Query: 375 KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY---GDT--DNEIDKY 429
+KR EIL N +++ N + LR K V G +G+ D+ Y G T N+ID+
Sbjct: 296 ASKRKEILLNSNSLYN----ISGLRYKQYKLVVGGGFDGELDDHYYFPGGTRPTNDIDQL 351
Query: 430 SPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN 489
+ T L+ K KQ + P A S++ V C
Sbjct: 352 RNEST-------TARVLR-KFYEKQDRSWSPLAWSNN-----------------VAVDCR 386
Query: 490 YDNKGAHCNSTVKPCLFNITDDPCEQNNLAE 520
+ + P LFNI DPCE NNLA+
Sbjct: 387 RNKLTENFVPMQPPYLFNIEKDPCELNNLAK 417
>gi|156364432|ref|XP_001626352.1| predicted protein [Nematostella vectensis]
gi|156213225|gb|EDO34252.1| predicted protein [Nematostella vectensis]
Length = 270
Score = 305 bits (781), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 145/271 (53%), Positives = 187/271 (69%), Gaps = 6/271 (2%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
KKP+I++I+ADDLGW+DVSFHGSSQIPTP ID LA G+ILN +YV +CTP+R++LMTG
Sbjct: 1 KKPNIVMIVADDLGWDDVSFHGSSQIPTPTIDKLASEGVILNSYYVSPICTPTRASLMTG 60
Query: 88 KYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
K+P+++GM H + +P+GLPL E PQY+K GY TH IGKWHLGFF + YTPT
Sbjct: 61 KHPMNLGMLIHTHATVFGTQPYGLPLGETTTPQYMKSLGYVTHGIGKWHLGFFEKEYTPT 120
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
+RGFDS YG+W G +DY+DHS + E G D+R N + G Y T+L+ E A +
Sbjct: 121 YRGFDSFYGFWNGKEDYWDHSSQ---EDVWGTDLRDNEKPVRNESGHYGTELFAERAAQI 177
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
I HN++KP++LYLA VH+ N EP QAP + KF IS P+RR YA MVS LDESV
Sbjct: 178 IHLHNQTKPLYLYLAQQGVHSANGNEPLQAPKRLIKKFSHISSPKRRIYAAMVSSLDESV 237
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGI 295
V AL + GML N++++F DNG G
Sbjct: 238 ETVHKALSETGMLNNTVLVFTTDNGGAPRGF 268
>gi|115947271|ref|XP_790151.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 500
Score = 304 bits (779), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 166/422 (39%), Positives = 246/422 (58%), Gaps = 25/422 (5%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F + ++ N KP+I+ IL DDLG+NDV +HGS +I TPNID LA G+
Sbjct: 5 LFFINLIIISNVCSKKIIQSTKPNIVFILIDDLGYNDVGYHGS-EIYTPNIDKLAREGVR 63
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
L +YVQ +CTP+RS L++G+Y IH G+QH I +P LP L+EAGYATH
Sbjct: 64 LENYYVQPICTPTRSQLLSGRYQIHTGLQHSYIRPAQPLCLPTNLPTFADKLREAGYATH 123
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-----SCKATFEPYQGLDMRHNM 182
A+GKWHLGF+++ PT RGFDS++GY G +DY+ H + G+D+ N
Sbjct: 124 AVGKWHLGFYKKECLPTQRGFDSYFGYLTGGEDYWTHHRAGDGLLPNSNHWLGMDLWDNE 183
Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
+V + +G YS ++TE ++A+H +P LYL +VH+ P Q P ++
Sbjct: 184 KVAWEYVGNYSAFVFTEKIQQIVAQHPVEQPFLLYLPFQSVHS-----PLQVPSSYEERY 238
Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
+I + RR YAGM++ LDE+VG ++ +L++ G+ +N++++F DNG G + G+N
Sbjct: 239 KNIKNTNRRIYAGMMTCLDEAVGKIVHSLQQAGVWDNTVLIFSTDNG----GEVAAGGNN 294
Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLC-AAAGIEINDT 359
PLRG K + W+GGMRGV + S L + + V+ +L H+SDW PTL AG +++
Sbjct: 295 WPLRGWKRSIWEGGMRGVGFVNSRLLPASVQGTVNKQLIHVSDWFPTLVQGVAGSSLDNI 354
Query: 360 SLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYY-----AALRVDDLKYVAG--TDNN 412
+LDG N W ++ + R E+LHN+D N K + AA+RV K + G NN
Sbjct: 355 TLDGFNMWKTISSNTSSPRQELLHNMDPWPNQPKEFNANISAAIRVGHWKLLTGDPGSNN 414
Query: 413 GQ 414
GQ
Sbjct: 415 GQ 416
Score = 42.0 bits (97), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 34/60 (56%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E ++L+ S +++++L KL Y + VP + A+PA +W PW
Sbjct: 440 LFNITADPNEYHDLSTSHPEVVEELMAKLKKYYADSVPVQYPSEVEEANPAYHQGVWGPW 499
>gi|432911274|ref|XP_004078601.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
Length = 572
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 172/409 (42%), Positives = 241/409 (58%), Gaps = 38/409 (9%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PHI+ IL DD G+ND+ +H + I TP +D LA G+ L +YVQ +CTPSRS L+TG+Y
Sbjct: 44 PHIVFILIDDQGFNDIGYHNPT-IKTPTLDKLAAEGVKLENYYVQPICTPSRSQLLTGRY 102
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
IH G+QH +I +P LP LP+ L+ AGY+TH +GKWHLGF+R+ PT +GFD
Sbjct: 103 QIHTGLQHSIIRSRQPSCLPRHMDTLPETLRRAGYSTHMVGKWHLGFYRKSCLPTRKGFD 162
Query: 150 SHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
+ +G G DYY + SC G D+ N +V G YST L+T+ A ++ H
Sbjct: 163 TFFGSLTGSVDYYSYGSCNG--PGVCGYDLHDNERVAWGHEGKYSTTLFTQRAHKILKSH 220
Query: 209 NKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
+ + +P+FL L+ AVH P Q P V + D+++ +RR +A MVS +DE+V NV
Sbjct: 221 DPADRPLFLLLSLQAVHG-----PLQPPKSFVYLYRDMANVDRRKFAAMVSTVDEAVRNV 275
Query: 268 IAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
ALRK+G +NS++++ DNGA P G GSN PLRG K T W+GG+RGVA + SP
Sbjct: 276 TYALRKYGYYKNSVIIYSTDNGAQPYVG-----GSNWPLRGRKGTYWEGGVRGVAFVHSP 330
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNI 385
LK ++VS+ L HI+DW PTL AG I+ + LDG + W +++G ++ R EILHNI
Sbjct: 331 LLKHRRRVSTALLHITDWFPTLVGLAGGNISQSPGLDGFDVWPTISEGKESPRKEILHNI 390
Query: 386 DNVDN-PQK--------------------YYAALRVDDLKYVAGTDNNG 413
+ + P K AA+RV D K + G +G
Sbjct: 391 NPLHRAPSKRGICNHSMLPSGLSAVWDTSVQAAIRVGDWKLLTGDPGDG 439
Score = 52.0 bits (123), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 32/71 (45%), Positives = 43/71 (60%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E+ +L++ + D++ QL +LA Y T VP P D +ADP R WVPW
Sbjct: 471 LFNITADPYERQDLSQQRPDVVLQLLGRLAFYNQTAVPVYFPPDDPQADPDRHQGAWVPW 530
Query: 565 YDELDKQKAIE 575
DE D++ E
Sbjct: 531 VDEDDEEGKYE 541
>gi|395736371|ref|XP_003780537.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Pongo abelii]
Length = 481
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 170/416 (40%), Positives = 240/416 (57%), Gaps = 35/416 (8%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
L P I+ DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSR
Sbjct: 27 LVADGPGXXXXXFIVTDDQGYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSR 85
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
S L+TG+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+
Sbjct: 86 SQLLTGRYQIHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKEC 145
Query: 142 TPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
PT RGFD+ G G DYY + +C G D+ V G YST LY +
Sbjct: 146 LPTRRGFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQR 203
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
A +++A H+ +P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +
Sbjct: 204 ASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCM 258
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
DE+V N+ AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+
Sbjct: 259 DEAVRNITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGL 314
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRS 379
+ SP LK+ Q+ S L HI+DW PTL AG + LDG N W +++G + R+
Sbjct: 315 GFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYNVWPAISEGRASPRT 374
Query: 380 EILHNIDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGD 421
EILHNID + N ++ AA+RV + K + G D YGD
Sbjct: 375 EILHNIDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD 423
>gi|126697470|gb|ABO26692.1| sulfatase 1A precursor [Haliotis discus discus]
Length = 477
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 167/397 (42%), Positives = 233/397 (58%), Gaps = 19/397 (4%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
+TA KPHI+ I+ DDLGWNDV + + I TP +D LA +G+ILN YVQ LC+PSR+
Sbjct: 18 STAADKPHIVFIVVDDLGWNDVGWI-NPDIKTPTLDRLARSGVILNSSYVQPLCSPSRNC 76
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
M+GK+P H G+Q V+ +P +P +PQ LK GY TH +GKWHLGF YTP
Sbjct: 77 FMSGKFPYHTGLQDKVVFIEQPKYMPANLTTIPQRLKTLGYDTHMVGKWHLGFCNWKYTP 136
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
T+RGFD+ GY+ G++DY+ H + Y G D R+ V G YST ++ AI+
Sbjct: 137 TYRGFDTFMGYYAGMEDYFTH-VRDEVADYNGYDFRNMTDVYKGAQGEYSTYVFANRAID 195
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
+I H+KSKPM+LYL AVH P Q P + ++ I D R+ Y+GMVS LD++
Sbjct: 196 IIMNHDKSKPMYLYLPFQAVHV-----PLQVPTKYSDRYSHIHDLNRKVYSGMVSALDDA 250
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
V N+ L G ++N +++F DNG P + +N PLRG K+T W+GG RG A +
Sbjct: 251 VYNITTVLEDLGFMDNLLLVFTTDNGGPP----TRGANNWPLRGGKNTLWEGGTRGAAFV 306
Query: 324 WS-PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
+S LK+ LFH DW PTL AG + + +DG++Q D++ G + R++ +
Sbjct: 307 YSKTLLKKKGYTHPGLFHAVDWYPTLLDIAGGDSSQLDIDGLSQVDMILNGDSSVRNQFV 366
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY 419
+NI + Y AALR DLK + G N G S WY
Sbjct: 367 YNIYD----DHYAAALRWGDLKLIVG--NPGYS-GWY 396
Score = 42.4 bits (98), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 20/59 (33%), Positives = 32/59 (54%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
L+NI+ DP E ++++ DL+ L LA T VP + P D R++P ++ W P
Sbjct: 418 LYNISTDPTEHEDISKDYPDLVNFLLQMLARKNGTRVPAMDPPSDPRSNPVNFDGSWSP 476
>gi|291222022|ref|XP_002731018.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 1410
Score = 304 bits (778), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 154/368 (41%), Positives = 231/368 (62%), Gaps = 22/368 (5%)
Query: 32 IIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPI 91
I+ I+ DD+GWNDV +HGSS I TP++D LA G L +YV LC+PSR +TGK+ I
Sbjct: 29 IVFIMIDDVGWNDVGYHGSS-ISTPHMDTLAKEGTKLENYYVAHLCSPSRGMFLTGKHMI 87
Query: 92 HIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSH 151
H+GM+ G+I+ E LP+ E + Q LK Y+THAIGKWH+G++++ P RGFD+
Sbjct: 88 HLGMEGGIIMPFERKCLPVNEATIAQELKLKNYSTHAIGKWHVGYYKKACLPNNRGFDTF 147
Query: 152 YGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKS 211
+G G D+Y H +E Y R+N+ + + G YSTDLY A NVI H+ S
Sbjct: 148 FGIIGGCADHYTHKNTHWWELY-----RNNISIAQEYQGHYSTDLYAREATNVIRNHDAS 202
Query: 212 KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAAL 271
KPMF+YLA A H P QAP + + + +I DP+RR YA MV+ +D+++GNV+ L
Sbjct: 203 KPMFMYLAFQAAHL-----PLQAPRKYIDMYSNIEDPDRRVYAAMVTCMDDAIGNVVDQL 257
Query: 272 RKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQT 331
R+ G+ N++++ DNG G+N PL+G K++ W+GG+RGVA + SP+L
Sbjct: 258 REAGLWNNTVLIVSTDNGGAKIS-----GNNWPLKGSKASLWEGGVRGVAFVTSPFLADH 312
Query: 332 QKVSS--ELFHISDWLPTLCAAAGI---EINDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
+ +S +L H++DW PT+ AG+ I +LDGVNQW+ +++ +T+R+++L N+
Sbjct: 313 VRGTSNHQLMHVTDWFPTMLHVAGVYSDVIKSMTLDGVNQWNTISENRQTERNDVLINL- 371
Query: 387 NVDNPQKY 394
NV+ K+
Sbjct: 372 NVNRKVKH 379
>gi|291221493|ref|XP_002730757.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 585
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 169/385 (43%), Positives = 238/385 (61%), Gaps = 18/385 (4%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K PHI+++LADD+GWNDV ++ + +PTP ++ LA NG+ILN Y Q C+PSR+AL+TG
Sbjct: 106 KPPHIVLVLADDVGWNDVGWN-NDFMPTPILNELASNGVILNNTYSQPACSPSRAALLTG 164
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
KYP + G+QH V+ E P+ LPL LL LKE GY HAIGKWHLGF YTP +RG
Sbjct: 165 KYPANAGIQHLVVQEQHPYYLPLHNTLLSTKLKELGYMNHAIGKWHLGFCNWKYTPLWRG 224
Query: 148 FDSHYGYWQG-LQDYYDHSCKATF--EP-YQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
FDS YG + G L DY H + F EP GLD+R N V G + T L+TE A
Sbjct: 225 FDSFYGIFNGYLSDYSTHIVHSPFIGEPGASGLDLRDNTGVVAHENGTHVTYLFTERAER 284
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
+I HN + P+FLY A L H P AP ++++D RR YAGM+S +DE+
Sbjct: 285 IIRNHNPAAPLFLYTALLVPHG-----PLDAP-PGFENSVEVNDTRRREYAGMISAMDEA 338
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
VGN+ AL+ GM +++ +F++DNG F G+N+P RG K++ W+G ++ I
Sbjct: 339 VGNITNALKDKGMWNDTLFIFLSDNGGDMF----YAGNNYPFRGNKASLWEGSVKIPGFI 394
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
+ L++ ++EL H +D TL +AAG +D +DG+NQWD + +G +KRSEIL
Sbjct: 395 YGNMLEKKGYSNNELIHFTDLFATLLSAAGGTPDD-DIDGINQWDTVCRGDASKRSEILL 453
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAG 408
++D + + AALR++D KY+ G
Sbjct: 454 HLDTFEATRG--AALRMNDYKYIEG 476
>gi|348520018|ref|XP_003447526.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
Length = 732
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 162/370 (43%), Positives = 227/370 (61%), Gaps = 18/370 (4%)
Query: 23 NTTAPKK---PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
+ TA KK PHI+ IL DD G+ND+ +H + I TP +D LA G+ L +YVQ +CTP
Sbjct: 39 DETAQKKKNQPHIVFILTDDQGFNDIGYHNPT-IKTPTLDKLAAEGVKLENYYVQPICTP 97
Query: 80 SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
SRS L+TG+Y IH G+QH +I +P LP LP+ L+EAGY TH +GKWHLGF+R+
Sbjct: 98 SRSQLITGRYQIHTGLQHSIIRPRQPSCLPSHMDTLPERLREAGYTTHMVGKWHLGFYRK 157
Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYT 198
PT +GFD+ +G G DYY + SC + G D+ + V G YST L+T
Sbjct: 158 ACLPTRKGFDTFFGSLTGSVDYYSYESCDG--KDLCGYDLHDDEGVAWGQEGKYSTTLFT 215
Query: 199 EAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
+ A ++ H+ + +P+FL L+ AVH P P+ + F D+ + RR +AGMV
Sbjct: 216 QRARKILESHDPAERPLFLLLSFQAVHT-----PLNPPESYIYPFRDMVNVARRNFAGMV 270
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
S +DE+V NV ALRK+G NS++++ DNGA F G+N PLRG K T W+GG+
Sbjct: 271 STVDEAVRNVTYALRKYGYYRNSVIIYSTDNGAQPF----TGGNNWPLRGRKGTYWEGGV 326
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKT 376
RGV + SP L++ ++VS L HI+DW PTL A I+ + LDG + W +++G ++
Sbjct: 327 RGVGFVHSPLLRRRRRVSKALMHITDWFPTLVGLARGNISQSQGLDGFDVWPAISEGRES 386
Query: 377 KRSEILHNID 386
R EILHNID
Sbjct: 387 PRKEILHNID 396
Score = 59.7 bits (143), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 28/60 (46%), Positives = 40/60 (66%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E+++LA+ + D+++QL +LA Y T VP P D RA+P++ WVPW
Sbjct: 622 LFNITGDPYERHDLADQRPDIVQQLLARLAYYNQTAVPVYFPPDDPRANPSQHRGAWVPW 681
>gi|391330456|ref|XP_003739676.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 631
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 195/580 (33%), Positives = 304/580 (52%), Gaps = 56/580 (9%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P+II++LADDLG++DVSFHG+ QIPTPN+DA+A +G+ILN+HY TPSR A TGK
Sbjct: 41 RPNIIVVLADDLGYDDVSFHGNEQIPTPNLDAMAADGVILNRHYAAMSGTPSRGAFFTGK 100
Query: 89 YPIHIGMQHGVILEGE-PWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
P+ IG+ G IL+G GL L ++LP Y+++ GY TH IG+W LGF++E PT RG
Sbjct: 101 LPLRIGLNEGPILKGVYGTGLSLEHEVLPFYMRDLGYETHLIGRWGLGFYKESLLPTNRG 160
Query: 148 FDSHYGYWQGLQDYYDHSCK----ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
FD+HYG + Y H + + D+ + + D G Y TDLY
Sbjct: 161 FDTHYGPYSDSASYSSHLSREDAWKSLSVPPAYDLHRDGKPDFSGFGSYVTDLYKGRFER 220
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP---DEEVAKFLDISDPERRTYAGMVSRL 260
+I + + KP+F+ L+H H G ++ P P ++FL I D RR+YA V L
Sbjct: 221 IIEQ--RRKPLFIVLSHQTPH-GASFGPLHQPPPRTNRASQFLHIKDRSRRSYARTVESL 277
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFG--IHSNKGSNHPLRGMKSTPWDGGMR 318
D+S +++ L+++ +L S++LF +DNG +F SN PLRG K+T W+GG+R
Sbjct: 278 DDSFAHMVDVLQRNNILNESVILFSSDNGGTTFKDPYFKTGASNWPLRGQKNTQWEGGVR 337
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWD-VLTKGAKT 376
+ +WS + + VS EL+H DWLPT G +I D +DG + WD ++ + +
Sbjct: 338 VSSFVWSSQITEPY-VSEELYHFVDWLPTFRRLGGGDIGDLKDIDGFDIWDSIVNRTTAS 396
Query: 377 KRSEILHNIDN---VDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKE 433
R++IL D VD+P +AA+ + KYV+G +W G ++ + Y P
Sbjct: 397 PRTKILLAYDRDFTVDSPG--FAAI-IGKFKYVSG--------KWSGR--DQTNHYVPAS 443
Query: 434 VLYSKAGITFNALKTKLQI----KQKHAADPKANSSDALRTI-----------LTDEKIL 478
YS+ L+ ++ + P A+ + R ++ +KI+
Sbjct: 444 FKYSEDSQVLERLRNDSKVWKNLRDIGRPGPAADWREQARIKCSWGAHLTCNGVSCDKIM 503
Query: 479 ELREFAR----VRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLA----ESQTDLLKQLE 530
E +R +R N C+ +PCLF++ +DPCE N+A + + D+ +
Sbjct: 504 ETFRGSRGNVEIRVEPVNVSHACDFAHEPCLFDLENDPCETTNIANDMPQIRDDIGSHII 563
Query: 531 DKLAIYKSTMVPPGNKPFDKRADPA-RWNNIWVPWYDELD 569
+ +ST + FD+ ++PA R +W PW D
Sbjct: 564 NDFDPVESTARLRYDACFDQNSNPANRDPPVWEPWLSGCD 603
>gi|348542810|ref|XP_003458877.1| PREDICTED: arylsulfatase J [Oreochromis niloticus]
Length = 551
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 159/363 (43%), Positives = 222/363 (61%), Gaps = 14/363 (3%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+PHII IL DD G+ DV +HGS I TP +D LA G+ L +YVQ LC+PSRS LMTG+
Sbjct: 23 QPHIIFILVDDQGFRDVGYHGSD-IKTPTLDRLAAEGVKLENYYVQPLCSPSRSQLMTGR 81
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QH VI +P LPL LPQ LK AGY+TH +GKWHLGF++ PT RGF
Sbjct: 82 YQIHTGLQHSVIRAAQPNCLPLENVTLPQKLKNAGYSTHMVGKWHLGFYKRGCLPTQRGF 141
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-NKTIGIYSTDLYTEAAINVIAE 207
D+ +G G DYY H K G D+ + + G+YST+++T+ A++++A
Sbjct: 142 DTFFGSLLGSGDYYSHY-KCQGPSMCGYDLYEGEEAAWEQDRGLYSTEMFTQKAVSILAN 200
Query: 208 HN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
HN + +P+FLYLA+ AVH+ P Q P + ++ I +P RR YA MVS LDE++ N
Sbjct: 201 HNPRKQPLFLYLAYQAVHS-----PLQVPARYLERYKGIPNPYRRKYAAMVSCLDEAIHN 255
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
+ AL+ +G +N+++++ +DNG GSN PLRG K+T W+GG+R V + SP
Sbjct: 256 LTLALKHYGYYDNTVIVYSSDNGGQPLA----GGSNWPLRGSKATYWEGGIRTVGFVHSP 311
Query: 327 WLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
L L HI+DW PTL G D +LDG + W+ +++G + R +ILHNI
Sbjct: 312 LLMNKGTKCRSLIHITDWFPTLVTLGEGTLDEDLNLDGYDVWETISEGRPSPRHDILHNI 371
Query: 386 DNV 388
D +
Sbjct: 372 DPI 374
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 42/73 (57%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E+ +L++ ++K++ +LA Y T VP D R++P +W PW
Sbjct: 445 LFNITADPYERVDLSQRYPHIVKKMLMRLAQYNKTAVPVCYPTKDLRSNPQYNGGVWGPW 504
Query: 565 YDELDKQKAIETM 577
+ ++ +Q+ + M
Sbjct: 505 HKDVREQQEEDEM 517
>gi|198428954|ref|XP_002125106.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
Length = 562
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 197/581 (33%), Positives = 287/581 (49%), Gaps = 93/581 (16%)
Query: 28 KKPHIIIILADDLGWNDVSFHGS---SQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
++PH++ +LADD G+ND+ +H S + TP +D+LA G+IL +YVQ +C+P+R L
Sbjct: 32 QRPHVVFVLADDFGFNDIGYHAREHYSDMYTPFLDSLAAKGVILENYYVQPICSPTRGQL 91
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
+TG+Y IH G+ HG+I +P+GLPL LL Q L++ GY T+ +GKWHLGFFRE Y P
Sbjct: 92 LTGRYQIHTGLAHGIIRAAQPYGLPLDNILLSQQLRQCGYKTNMVGKWHLGFFREEYLPW 151
Query: 145 FRGFDSHYGYWQ-GLQDYYDHSCK----ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
RGF + +G+ G+ + + C+ F Y +D R+ N T G YST+L+
Sbjct: 152 NRGFQNFFGFLNGGVNHFTRYHCEPKKTRRFCGYDMIDSRYG--PTNATYGEYSTNLFIR 209
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
+ +I +HNK KPMFLYL+ AVH P Q P++ + +F I D RR YAGMV
Sbjct: 210 KSKEMIDKHNKQKPMFLYLSLQAVHG-----PLQVPNQYLKRFKHIRDKNRRIYAGMVYA 264
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
+D + ++ L++ M +N+I +F DNG G + G+N PLRG K T W+GG+RG
Sbjct: 265 MDRGIRQLVKHLKRARMWKNTIFIFSTDNG----GQTTRGGNNWPLRGKKGTLWEGGIRG 320
Query: 320 VAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIE--INDTSLDGVNQWDVLTKGAKT 376
V + P T +V+ EL H+SDW PT+ +A + +DG +QW+ L +
Sbjct: 321 VGFVHGKPLQVTTPRVNKELLHVSDWYPTIMSATHCPYVVGTPPIDGYDQWETLRSNKTS 380
Query: 377 KRSEILHNIDNVDNP-----------QKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE 425
KR+EILHNID + AA+R D K + G +EW
Sbjct: 381 KRTEILHNIDPLHKKLTESEFREGFDVSVRAAIRSGDWKLLTGYAG---MNEW------- 430
Query: 426 IDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFAR 485
P E + I QI+ P NS+ +R E E RE
Sbjct: 431 ---VVPPECTHGNITIG--------QIRDDFHVYP--NSTKNVRLFKITEDPHEERE--- 474
Query: 486 VRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN 545
++D E NL LL +L++ Y ++ +P
Sbjct: 475 ----------------------VSDQYPEVVNL------LLARLKE----YNTSSLPVDY 502
Query: 546 KPFDKRADPARWNNIWVPW--YDELDKQKAIETMRLHKPLL 584
DK++DP R W+PW K+K R KP L
Sbjct: 503 PHNDKKSDPKRHGGFWLPWRGVGRKYKRKKFRPHRARKPKL 543
>gi|115644393|ref|XP_781330.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 588
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 164/369 (44%), Positives = 234/369 (63%), Gaps = 17/369 (4%)
Query: 30 PHIIIILADDLGWNDVSFH---GSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
PH+I+I+ADDLG+NDV +H G S I TPNID +AY+G+ L +YVQ +CTP+RS L+T
Sbjct: 94 PHVIMIIADDLGYNDVGYHAKYGRSMIRTPNIDEMAYSGVRLENYYVQPVCTPTRSQLIT 153
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y IH GMQH + G P LPL E L Q LK+ GY+THA+GKWHLG+ + P+ R
Sbjct: 154 GRYQIHTGMQHLNLFPGRPCCLPLDETTLAQALKKQGYSTHAVGKWHLGYAWKDCLPSRR 213
Query: 147 GFDSHYGYWQGLQDYYDHSCKATF--EPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
GF+S +G G D++ H+ A F + G M +N ++ K G +ST LYT A +
Sbjct: 214 GFESFFGNIMGSADHWSHNKTALFGDKLVMGKSMYYNERIYWKHEGTFSTTLYTNRARQL 273
Query: 205 IAEHNKSKPMFLYLAHLAVHAG-NTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
I + ++KP+FLYL++ AVH N E + P E + I + +RR YAG+V+ LDE+
Sbjct: 274 IRKQPRNKPLFLYLSYEAVHTPLNVPEQYAKPYEGI-----IHNSKRRRYAGLVNILDEA 328
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
V NV AL+ +G+ +NS+++F DNG + G+N PLRG KST W+GG+RGV +
Sbjct: 329 VRNVTEALKYNGLYDNSVIIFTTDNGGRP--KPRSVGNNWPLRGGKSTLWEGGIRGVGFV 386
Query: 324 WSPWLKQTQK--VSSELFHISDWLPTLCA--AAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
SP + + V+ +L H+SDW PT+ A G + + LDG +QW ++KG ++ R
Sbjct: 387 HSPLIPWELRGTVNRQLIHVSDWFPTIVXGIAGGKLVTNKPLDGXHQWKTISKGTESNRH 446
Query: 380 EILHNIDNV 388
EILHNID +
Sbjct: 447 EILHNIDPI 455
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN-KPFDKRADPARWNNIWVP 563
L+NI DP E+ N+A Q ++ +L +L Y++T V P + P D+R +P WVP
Sbjct: 525 LYNIKKDPNERQNMAPYQKKIVYRLLKRLQDYQNTAVTPIHLGPKDERGNPKYHGGAWVP 584
Query: 564 WYDE 567
W D
Sbjct: 585 WMDR 588
>gi|291230656|ref|XP_002735281.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 522
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 185/480 (38%), Positives = 254/480 (52%), Gaps = 78/480 (16%)
Query: 2 TWARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL 61
TW R L LL L T + +PHII ILADDLGW+DV +HGS + TP IDAL
Sbjct: 4 TWLR----CLLSILLGVSLSLATASATRPHIIFILADDLGWDDVGYHGSV-MKTPYIDAL 58
Query: 62 AYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKE 121
A G+ L +Y+ +LCTPSRS L+TG+Y IH G+QHG IL +P LPL E LPQ LKE
Sbjct: 59 AAEGVTLENYYMPSLCTPSRSVLLTGRYEIHTGLQHGTILMMQPLCLPLDEITLPQKLKE 118
Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN 181
GY TH +GKWHLGF+R+ P RGFDS G++Q + D++ H+ A+ + G D R N
Sbjct: 119 EGYDTHMVGKWHLGFYRKECLPNNRGFDSFLGFYQAMGDHFYHNISASPGHFNGFDFRRN 178
Query: 182 MQ-VDNKTIGIYSTDLYTEAAIN--------------------------VIAEHNKSKPM 214
V ++ G YST ++ IN + + +N +P+
Sbjct: 179 NDVVADQYAGKYSTHIFXXXFINTQTLSFVCVNNVKGVYFRGYLSSFTPITSSYNPQQPL 238
Query: 215 FLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI--SDPERRTYAGMVSRLDESVGNVIAALR 272
FLYL+ AVH P Q P + D+ +D +RR YAGMV+ +DE+VG V L+
Sbjct: 239 FLYLSFQAVHT-----PLQVPSRYAELYNDLIPNDEDRRIYAGMVTCMDEAVGYVTTVLK 293
Query: 273 KHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ 332
+ + ENS+V+F LRG K T ++GG+RGV + SP L +
Sbjct: 294 QSALWENSVVIFSTX-----------------LRGGKGTLYEGGVRGVGFVTSPLLHEQV 336
Query: 333 K--VSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV- 388
+ ++ EL HISDW PTL AG ++ LDG NQW+ +++G + R+EIL NID +
Sbjct: 337 RGTINHELIHISDWFPTLTRLAGGDLMGGGRLDGYNQWETISQGTPSLRTEILLNIDPLY 396
Query: 389 -------DNP--------QKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKE 433
D+P AA+RV D K + G NG W T++ I P+
Sbjct: 397 TAVRNLRDDPWAESTCFNVSMRAAIRVGDWKLITGGAGNG---SWIPPTESGIAALHPER 453
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 21/61 (34%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR-ADPARWNNIWVP 563
L+N+ DP E+ + + + D+ ++L +L Y T VPP + R ADPA + +W P
Sbjct: 461 LYNVHVDPYERMDYSHERPDIAEELLQRLGHYYRTSVPPNFPQQEVRSADPANYGGVWSP 520
Query: 564 W 564
W
Sbjct: 521 W 521
>gi|397466741|ref|XP_003805104.1| PREDICTED: arylsulfatase B [Pan paniscus]
Length = 513
Score = 300 bits (767), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 157/337 (46%), Positives = 210/337 (62%), Gaps = 14/337 (4%)
Query: 55 TPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKL 114
TP++DALA G++L+ +Y Q LCTPSRS L+TG+Y I G+QH +I +P +PL EKL
Sbjct: 49 TPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKL 108
Query: 115 LPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ 174
LPQ LKEAGY TH +GKWHLG +R+ PT RGFD+++GY G +DYY H +
Sbjct: 109 LPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLIDALN 168
Query: 175 ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYE 230
LD R +V +YST+++T+ AI +I H KP+FLYLA +VH E
Sbjct: 169 VTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----E 223
Query: 231 PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
P Q P+E + + I D R YAGMVS +DE+VGNV AAL+ G+ N++ +F DNG
Sbjct: 224 PLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGG 283
Query: 291 PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
+ G+N PLRG K + W+GG+RGV + SP LKQ + EL HISDWLPTL
Sbjct: 284 QTLA----GGNNWPLRGRKWSLWEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVK 339
Query: 351 AAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNID 386
A N T LDG + W +++G+ + R E+LHNID
Sbjct: 340 LARGHTNGTKPLDGFDVWKTISEGSPSPRIELLHNID 376
>gi|291224485|ref|XP_002732234.1| PREDICTED: jumonji domain containing 2c [Saccoglossus kowalevskii]
Length = 1941
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 170/409 (41%), Positives = 252/409 (61%), Gaps = 28/409 (6%)
Query: 7 YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
+ FA+ ++F + T P PHI++ILADDLGWND+ + + QI TP +D LAY G+
Sbjct: 23 FLFAVAA-IMFITLYDKKTNP--PHIVLILADDLGWNDIGW-NNLQIKTPVLDKLAYEGV 78
Query: 67 ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
I NQ YVQ LCTP+R+ALMTG YP IGMQH ++L +P GLPL K+LPQ LK+AGY
Sbjct: 79 IFNQTYVQPLCTPTRAALMTGYYPFRIGMQHQMVLPFQPSGLPLHLKILPQKLKQAGYIN 138
Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP------YQGLDMRH 180
H +GKWHLG+ YTP RGFDS YG + +H+ K + P Y+G D R
Sbjct: 139 HIVGKWHLGYCNWEYTPLNRGFDSFYG---SFSNSVNHNNKISQLPISDHSKYKGYDFRD 195
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE-V 239
N V + G T L+T+ +++I+ H+K PMF+Y G+ PDE+
Sbjct: 196 NTGV-VQNDGQPLTKLFTQRVVDIISNHHKDYPMFMYFPINQPSKGS------EPDEKYT 248
Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
A + +++D + R + +S +D++VG+VI AL+ GM E ++++F++DNGA F +
Sbjct: 249 ALYPNVTDDQVRQWYATLSLMDDAVGHVIDALKSRGMWEETLLIFISDNGAFDFM----E 304
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
G+N PLRG +T ++GGMR A +K+T +++EL HI+DW T+ + AGI+ +
Sbjct: 305 GTNLPLRGAGTTLFEGGMRVPAIAHGKMIKKTGYINNELNHITDWHSTILSLAGIQ-PEP 363
Query: 360 SLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG 408
LDG+N W +++G + R+EI++NID +D + AALR+ D K + G
Sbjct: 364 DLDGINIWGTVSEGKASARNEIVYNIDEIDECRG--AALRLGDWKVITG 410
Score = 41.2 bits (95), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 18/59 (30%), Positives = 32/59 (54%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
LFN+ +DP E++N+AE ++ L D+L Y+ +VP + K +P + + P
Sbjct: 451 LFNLKEDPEERHNVAEKYPEVFSALLDRLNEYRKNVVPAIDTSLVKEGNPENYGGVLTP 509
>gi|241025894|ref|XP_002406215.1| arylsulfatase J, putative [Ixodes scapularis]
gi|215491896|gb|EEC01537.1| arylsulfatase J, putative [Ixodes scapularis]
Length = 437
Score = 299 bits (765), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 167/410 (40%), Positives = 241/410 (58%), Gaps = 18/410 (4%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
A+T LL + F TT KP+I+I+ DDLGW DV FHGS QIP PNIDALA +G+IL
Sbjct: 4 LAVTWVLLSHLEFSFTT---KPNIVIVTVDDLGWADVGFHGSRQIPVPNIDALAADGVIL 60
Query: 69 NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
N++Y Q SR L+TG YP G+ V+L +P LP ++LP + K GY TH
Sbjct: 61 NKYYAQPWPLSSRIGLLTGIYPYRTGVGR-VMLPCQPVALPSVFRILPTFFKSLGYRTHF 119
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
+G W+LGF+++ +TP RGFD+ Y W G DY+ H + QG D+R N +
Sbjct: 120 VGVWNLGFYKKRFTPVNRGFDTAYAKWTGPGDYWTHDMQTKM---QGFDLRLNDDLMWNQ 176
Query: 189 IGIYSTDLYTEAA--INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
G+YST L+TE A + +P+ L L+H A+H N + Q P E + F I
Sbjct: 177 TGVYSTRLFTERADPTRKLCFFVLHQPLLLILSHQALHTANYHGSLQCPLEHLDSFGFIR 236
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS-NHPL 305
D +RR +AG ++ +D+S+G + AL +L N+I++F++ +G G +N S N PL
Sbjct: 237 DRKRRIFAGALTEVDQSLGQIFEALHNRRLLNNTILVFVSASGGMPVGDFTNNLSFNFPL 296
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS----L 361
RG K T ++GG+R A +WSP L +T++ SS+L H++DWLPTL +AAG +I L
Sbjct: 297 RGTKGTCFEGGVRVPAFVWSPLLNKTRRTSSQLMHVTDWLPTLFSAAGGDIKKLGAVFDL 356
Query: 362 DGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDN 411
D VN W+ L+ ++ R+EI+HNID + AL+ + K +A T N
Sbjct: 357 DSVNMWEALSGDLQSPRTEIIHNIDLAG----HNFALQEGNYKVIAVTKN 402
>gi|21430588|gb|AAM50972.1| RE13542p [Drosophila melanogaster]
Length = 300
Score = 299 bits (765), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 150/299 (50%), Positives = 197/299 (65%), Gaps = 4/299 (1%)
Query: 76 LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
+CTPSR+AL+TGKYPI+ GMQH VI+ +PWGLPL E + + +E GY T +GKWHLG
Sbjct: 1 MCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLG 60
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
+ +TPT RGFD H GY DYY S + + Y G D R +++ + +G Y TD
Sbjct: 61 LSQRNFTPTERGFDRHLGYLGAYVDYYTQSYEQQNKGYNGHDFRDSLKSTHDHVGHYVTD 120
Query: 196 LYTEAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
L T+AA+ I +H N S+P+FL L HLA HA N +P QAP EEV++F IS+ R
Sbjct: 121 LLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRY 180
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
YA MVSRLD+SVG+VI AL + ML+NSI+LF++DNG P+ G HS SN+PLRG K++P
Sbjct: 181 YAAMVSRLDKSVGSVIDALARQEMLQNSIILFLSDNGGPTQGQHSTTASNYPLRGQKNSP 240
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVL 370
W+G +R AAIWS ++ V + +I D LPTL AAAGI + LDG+N W L
Sbjct: 241 WEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAGISPDPALHLDGLNLWSAL 299
>gi|198420473|ref|XP_002123848.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
Length = 517
Score = 298 bits (764), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 195/567 (34%), Positives = 287/567 (50%), Gaps = 63/567 (11%)
Query: 8 FFALTCTLLFNDAFLNTTAP-KKPHIIIILADDLGWNDVSFHG---SSQIPTPNIDALAY 63
+ L LLF L A KP+I+ ILADD G+ND+ +H S + TP +D+LA
Sbjct: 1 MYLLRVVLLFGLHVLCVPAELTKPNIVFILADDYGFNDIGYHAVEHHSDMKTPFLDSLAM 60
Query: 64 NGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAG 123
G+ L +Y+Q +C+PSRS LM+G+Y IH G+QH VI + GLPL +LP+ L + G
Sbjct: 61 AGVRLENYYIQPICSPSRSVLMSGRYQIHTGLQHYVISPQQRNGLPLDNIILPEQLHKCG 120
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNM 182
Y TH +GKWHLGF+++ Y P RGF+S++GY G +DYY C G DM
Sbjct: 121 YDTHMVGKWHLGFYKDEYLPWKRGFNSYFGYLTGGEDYYTKWRCDGKL---CGYDMTSEK 177
Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
N T G YS +L+ A I +H+K+KP+FLY+A +VH+ P + P+ F
Sbjct: 178 GPTNATYGQYSANLFANKANEAIDKHDKTKPLFLYVAFQSVHS-----PMEVPESYAKPF 232
Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
I + R+ Y GMV+ +DE+V N+ L+ G+ +N+I++F ADNG + + G+N
Sbjct: 233 DYIKNHNRKMYGGMVAAMDEAVKNITEHLQAAGLWDNTILVFSADNGGQTL----SGGNN 288
Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK---VSSELFHISDWLPTLCAAAGIEINDT 359
PLRG K T W+GG++GV + L V++E+ HISDW PT+ A +
Sbjct: 289 WPLRGRKLTLWEGGIKGVGFVHGKILNVPNPNYIVNNEMIHISDWFPTIMEATQCPYVEG 348
Query: 360 S--LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDE 417
+ LDGVNQWD + + R EIL NID + P +Y +++ G D + +
Sbjct: 349 TQKLDGVNQWDTIFGKTVSARKEILLNIDPLFTPL-FYKETPKQNIQ--MGFDTSVHAGI 405
Query: 418 WYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKI 477
YG+ L T LQ + P+ + +E+
Sbjct: 406 RYGEW----------------------KLLTGLQGDDRWIEPPEWKNK-------AEEEQ 436
Query: 478 LELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYK 537
E E + G +T L+NI +DP EQ +++ ++ L KLA Y
Sbjct: 437 QEDTE---------HPGFQKFATKSVQLYNIVNDPYEQVEVSDLYPSVVNDLLSKLAAYN 487
Query: 538 STMVPPGNKPFDKRADPARWNNIWVPW 564
+T VP P D +DP+ W PW
Sbjct: 488 ATAVPVNYPPSDPTSDPSLHGGFWGPW 514
>gi|322778941|gb|EFZ09355.1| hypothetical protein SINV_05168 [Solenopsis invicta]
Length = 775
Score = 296 bits (758), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 185/483 (38%), Positives = 266/483 (55%), Gaps = 40/483 (8%)
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y + GMQ + E G+PL LLP+YL+ GY TH +GKWH+G+ + + PT RGF
Sbjct: 295 YKLLRGMQGYPLRGAERRGIPLNNTLLPEYLRRLGYTTHLVGKWHVGYHTKNFGPTRRGF 354
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGI---YSTDLYTEAAINV 204
D+ GY+ G+ Y++H+ +E Q G D+ H + +N T+ Y TDL T+ ++
Sbjct: 355 DTFLGYYTGMIQYFNHTL---YESGQLGYDL-HRIVGENHTVEYRYDYMTDLLTDEVESI 410
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD--EEVAKFLDISDPERRTYAGMVSRLDE 262
I+ HN KPM+L L+HLA HA + E + D E F I D RR YA +V+ LD+
Sbjct: 411 ISSHNTEKPMYLQLSHLAPHASDAEEVMEVRDWKETNDTFGYIKDLNRRKYASVVATLDK 470
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
SVG V+ AL++ +L N+I++FMADNGA + G +N GSN+PLRG+K T ++G +RGVA
Sbjct: 471 SVGRVVDALKRKDILNNTIIIFMADNGAQTEGYLANYGSNYPLRGVKFTLFEGAVRGVAC 530
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEI 381
I+SP + ++ +VS++LFHI+DWLPTL +AAG +D LDG+NQW + K +R I
Sbjct: 531 IYSPLIDRSSRVSTKLFHITDWLPTLYSAAGGNPSDLDQLDGINQWSTIKKVKIDERRSI 590
Query: 382 LHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGI 441
L NI DN AAL KY D + D D YS V Y K +
Sbjct: 591 LVNIQE-DNE----AALMG---KYKLVRDRS--------DNQKHYDYYSGNNVSYPKYNV 634
Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTV 501
T N L + + + +LT KI LR+ A + C + C +
Sbjct: 635 T-NVL----------TSPAASAIASVSAHVLTAAKIKALRKKATIVCKNLTNPSSCKN-- 681
Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
+ CLFNI DPCE +L+ T ++++L + YKS ++ N P D P +NN W
Sbjct: 682 RTCLFNIYKDPCEFTDLSSKNTKVVERLSVFIDSYKSVLMKESNTPVDPAGFPCHFNNTW 741
Query: 562 VPW 564
+PW
Sbjct: 742 MPW 744
Score = 237 bits (605), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 188/316 (59%), Gaps = 28/316 (8%)
Query: 95 MQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGY 154
MQ I GEP GLPL K+LP++L+ GY T+ IGKWHLG+ +TP++RGFD G+
Sbjct: 1 MQGEDIQGGEPRGLPLNVKILPEHLQGLGYTTNLIGKWHLGYHTLQHTPSYRGFDYFCGF 60
Query: 155 WQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI---YSTDLYTEAAINVIAEHNKS 211
+ Y+D+ K +++ G DM D G+ Y TDL+T+ A+ +I H+
Sbjct: 61 YNSHVSYHDY--KYSYQNMSGYDMHCG---DAPAYGLNDKYVTDLFTDKAVKIIENHDSF 115
Query: 212 KPMFLYLAHLAVHAGNTYEPFQAP---DEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
+P++L ++HLAVHA P + P D +F+ I + RR YA MVS+LD+SVG ++
Sbjct: 116 RPLYLQISHLAVHA-----PLENPQDYDHSDRRFIHIREQHRRKYARMVSKLDDSVGRIV 170
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
L GML +S++LF+ DNGA G N GSN+PLRG GG+RGV+ +WSP L
Sbjct: 171 HTLGNRGMLRDSLILFLTDNGAAPIGKFRNFGSNYPLRG-------GGVRGVSVLWSPRL 223
Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDN 387
+ +V L HISDWLPTL AAA +++D +DGV+QW +L G R ++L NID
Sbjct: 224 NKPARVCDNLIHISDWLPTLYAAADGDLSDLGEIDGVDQWRMLNDGHPIVRDKLLLNIDE 283
Query: 388 VDNPQ----KYYAALR 399
+ + K+Y LR
Sbjct: 284 ISKTEGAIYKHYKLLR 299
>gi|410906623|ref|XP_003966791.1| PREDICTED: arylsulfatase J-like [Takifugu rubripes]
Length = 560
Score = 296 bits (757), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 158/369 (42%), Positives = 231/369 (62%), Gaps = 24/369 (6%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K+PH++ IL DD G+ DV +HGS +I TP +D LA G+ L +YVQ LC+PSRS LMTG
Sbjct: 36 KQPHVVFILVDDQGFRDVGYHGS-EIKTPTLDRLAAQGVKLENYYVQPLCSPSRSQLMTG 94
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y IH G+QH +I +P LPL LP LK+AGYATH +GKWHLGF++ PT RG
Sbjct: 95 RYQIHTGLQHSIIRATQPNCLPLENVTLPLKLKQAGYATHMVGKWHLGFYKRGCLPTQRG 154
Query: 148 FDSHYGYWQGLQDYYDH-SCKA----TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
FD+ +G G D+Y H C+A ++ Y+G + + G+YST ++T+ AI
Sbjct: 155 FDTFFGSLLGSGDHYSHYKCEAPGMCGYDLYEGEEA-----AWEQDRGLYSTVMFTQKAI 209
Query: 203 NVIAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+++A+H+ KP+FLYLA+ AVH+ P Q P + ++ IS+ RR YA MVS LD
Sbjct: 210 SILAKHDPHRKPLFLYLAYQAVHS-----PLQVPSRYLERYKGISNVHRRKYAAMVSCLD 264
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
E++ N+ AL+++G +N+++++ +DNG P G GSN PLRG K++ W+GG+R V
Sbjct: 265 EAIRNLTLALKRYGYYDNTVLVYSSDNGGQPLLG-----GSNWPLRGSKASYWEGGIRAV 319
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
+ SP L+ L HI+DW PTL + G +LDG + W+ +++G + R
Sbjct: 320 GFVHSPLLRNKGTKCRSLIHITDWFPTLVSLGEGTLEEHLNLDGYDVWEAISEGRPSPRH 379
Query: 380 EILHNIDNV 388
+ILHNID +
Sbjct: 380 DILHNIDPI 388
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 35/61 (57%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E+ +L++ ++K++ +LA Y T VP D R++P +W PW
Sbjct: 459 LFNITADPYERVDLSQRYPQIVKKMLIRLAQYNKTAVPVRYPAKDLRSNPQYNGGVWGPW 518
Query: 565 Y 565
Y
Sbjct: 519 Y 519
>gi|332016485|gb|EGI57378.1| Arylsulfatase I [Acromyrmex echinatior]
Length = 502
Score = 293 bits (749), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 180/486 (37%), Positives = 256/486 (52%), Gaps = 35/486 (7%)
Query: 93 IGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHY 152
+GMQ I GEP GLPL+ ++LP++L+ GY T IGKWHLG+ +TP RGFDS
Sbjct: 1 MGMQGESIQGGEPRGLPLSVRILPEHLRGLGYTTKMIGKWHLGYHTPQHTPLHRGFDSFL 60
Query: 153 GYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSK 212
G++ YYD+ K +++ G DM Y TDL+T+ A+ +I H+ S+
Sbjct: 61 GFYNSHVSYYDY--KYSYQNMSGYDMHRGDAPAYGLTDKYVTDLFTDEAVRIIQTHDPSR 118
Query: 213 PMFLYLAHLAVHAGNTYEPFQAP---DEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
P++L ++HLAVHA P + P D +F+ I + RR YA MVS+LD+SVG ++
Sbjct: 119 PLYLQISHLAVHA-----PLENPQDYDHYDKRFMHIVEQNRRKYAKMVSKLDDSVGLIVN 173
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
L GML +S++LF+ DNGA G N GSN PLRGMK T ++GG+RGVA +WSP L
Sbjct: 174 TLGSRGMLRDSVILFLTDNGAAPVGKFRNYGSNFPLRGMKYTLYEGGVRGVAVLWSPRLS 233
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ +V + L HI+DWLPTL AAA ++ D +DGVNQW +L G R ++L NID +
Sbjct: 234 KAARVCNNLMHITDWLPTLYAAADGDLRDLGKIDGVNQWRMLNDGYPIVRDKLLLNIDEI 293
Query: 389 DNPQ----KYYAALR--VDDLKYVAGTDNNGQSD--EWYGDTDNEIDKYSPKEVLYSKAG 440
+ K Y LR ++D Y D + + D Y S K + +
Sbjct: 294 SKTEGAIYKQYKLLRGSIEDGHY----DGHHRHDLERGYNILPPFFGNNSRKNIQENVPP 349
Query: 441 ITFNALKTKL-QIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYD-NKGAHCN 498
T L + + Q H P S +++LR A V C ++ + +
Sbjct: 350 YTETILISPVSQSITYHLGGPVTQPS----------TMVQLRNEATVSCRFNVSYYSFVT 399
Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
V CLF+I +DPCE N+AE + ++L+ L Y + D ADP R N
Sbjct: 400 CNVTECLFDIENDPCETKNIAEQYPRIARELDQILEQYARNVAMQIKPSVDWLADPRRTN 459
Query: 559 NIWVPW 564
N W PW
Sbjct: 460 NTWEPW 465
>gi|291220870|ref|XP_002730451.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 519
Score = 292 bits (748), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 169/427 (39%), Positives = 246/427 (57%), Gaps = 38/427 (8%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
FAL LL + +P+II IL+DD GWND+ +H S I +PNI+AL +G+
Sbjct: 2 IFALLEILLC--FHITVIVGSQPNIIFILSDDHGWNDIGYH-SHIIRSPNINALCNDGVR 58
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
L +Y+Q CTPSRS LM+G+Y IH G+QH VI +P LPL E L + LKE GYATH
Sbjct: 59 LENYYIQPGCTPSRSQLMSGRYQIHTGLQHSVIRNDQPNCLPLDEVTLAEKLKEVGYATH 118
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM-RHNMQVDN 186
+GKWHLGF+ PT RGFDS +GY G +DYY H G D+ RH V
Sbjct: 119 LVGKWHLGFYTPSCLPTRRGFDSFFGYLIGQEDYYKHIHDG------GYDLKRHETDVSK 172
Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-I 245
+ G Y+T ++T A N+I H+ S P+FLY+++ +VHA Q P+ + I
Sbjct: 173 QYQGDYTTHVFTSEAQNIIKSHDPSTPLFLYMSYQSVHANY----LQVPEHYSNMYNGVI 228
Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
D +RR AGMV+ +DE +GN+ L++ + N+I++F +DNG G ++ G+N PL
Sbjct: 229 DDEDRRIVAGMVTCMDEGIGNITQTLKETELWNNTIIIFSSDNG----GDPNDGGNNWPL 284
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIEINDTS-LD 362
RG K T W+G + G+ ++S + + + V++E+ H+SDW PT+ AG +N T LD
Sbjct: 285 RGEKGTHWEGAIHGLGFVYSSDIAKDVRGTVNTEMIHVSDWFPTIVNLAGGSLNGTKPLD 344
Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDNP----QKYY------------AALRVDDLKYV 406
G +QW +++GA++ R+EIL NID + P + ++ A+RV D K +
Sbjct: 345 GFDQWQTISQGAESPRTEILINIDPLVPPPSDMEHFFHGWNGTYNTSMCGAIRVGDWKLI 404
Query: 407 AGTDNNG 413
G G
Sbjct: 405 TGYPGEG 411
Score = 47.0 bits (110), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 34/63 (53%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNI +DP E +L+E D + +L K+ Y T VP P D ADP+ + IW PW
Sbjct: 436 LFNIAEDPTETTDLSEDNLDKVIELLIKMDEYLLTSVPVVYPPNDPLADPSLNDGIWRPW 495
Query: 565 YDE 567
E
Sbjct: 496 SPE 498
>gi|68437903|ref|XP_692213.1| PREDICTED: arylsulfatase I-like [Danio rerio]
Length = 562
Score = 292 bits (748), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 165/410 (40%), Positives = 234/410 (57%), Gaps = 35/410 (8%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PHI+ ++ DD G+ND+ +HGS +I TP +D LA G+ L +YVQ +C+PSRS LMTG+Y
Sbjct: 30 PHIVFVMVDDQGYNDIGYHGS-EIQTPVLDQLAGEGVKLENYYVQPICSPSRSQLMTGRY 88
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
IH G+QH +I +P LP LP+ L+EAGY+TH +GKWHLGF PT RGF
Sbjct: 89 QIHTGLQHSIIRARQPLCLPPDTPTLPERLQEAGYSTHMVGKWHLGFCHPECLPTSRGFQ 148
Query: 150 SHYGYWQGLQDYYD-HSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
S G G D++ SC T G D+ + + G YST LYTE +++ H
Sbjct: 149 SFLGSLTGSGDHFSFQSCDGT--EACGFDLHDGDRPAWELRGNYSTRLYTERVKDILRRH 206
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
+ KP+FLY+A AVH P QAP + ++ + + RR YA MVS +DESVG ++
Sbjct: 207 DHRKPLFLYVALQAVHT-----PLQAPGHLLRRYQALGNRPRRHYAAMVSGVDESVGEIV 261
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
+ LR+ G +NS++++ +DNG + G N PLRG K + W+GG+R V + SP L
Sbjct: 262 SELRERGYYDNSVLIYSSDNGGQPL----SGGCNWPLRGGKGSYWEGGVRAVGFVHSPLL 317
Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDN 387
K+ VS L HISDW PTL + AG +D++ +DG + W+ ++ G R+EIL NID
Sbjct: 318 KRKGVVSHALIHISDWYPTLLSLAGYRESDSNHVDGQDVWETISSGLPCPRTEILFNIDP 377
Query: 388 VD------NPQ------------KYYAALRVDDLKYVAGTDNNGQSDEWY 419
V NP+ AA+R D K + G +G +W+
Sbjct: 378 VSRRHGDINPKLLTLNGFGIWDTGVRAAIRAGDWKLLTGNIGDG---DWF 424
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 38/63 (60%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E+ +LAE++ +++K L +L+ Y T V P N P D ADP +W PW
Sbjct: 452 LFNITADPYERADLAEARPEVVKVLLTRLSEYNRTAVSPRNPPDDPMADPQLHGGVWTPW 511
Query: 565 YDE 567
+
Sbjct: 512 LGQ 514
>gi|390369306|ref|XP_003731620.1| PREDICTED: uncharacterized protein LOC763377 [Strongylocentrotus
purpuratus]
Length = 784
Score = 292 bits (747), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 152/377 (40%), Positives = 226/377 (59%), Gaps = 18/377 (4%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F + ++ N + KP+I+ IL DDLG+NDV +HGS +I TPNID LA G+
Sbjct: 5 LFFINLLIISNVCSKKFSQLTKPNIVFILIDDLGYNDVGYHGS-EIYTPNIDKLAREGVR 63
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
L +YVQ +CTP+RS L++G+Y IH G+QH I +P LP L+EAGYATH
Sbjct: 64 LENYYVQPICTPTRSQLLSGRYQIHTGLQHSYIRPAQPLCLPTNLPTFADKLREAGYATH 123
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
A+GKWHLGF+++ PT RGFDS++GY G +DY+ H K L +RH +V +
Sbjct: 124 AVGKWHLGFYKKECLPTQRGFDSYFGYLTGGEDYWTHHRKRPX-----LALRHVDKVAWE 178
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
G YS ++TE ++A+H +P LYL +VH+ P Q P ++ +I +
Sbjct: 179 YGGYYSAFVFTEKIQQIVAQHPVEQPFLLYLPFQSVHS-----PLQVPSSYEERYKNIKN 233
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
RR YAGM++ LDE+VG ++ +L++ G+ +N++++F DNG G + G+N PLRG
Sbjct: 234 TNRRIYAGMMTCLDEAVGKIVHSLQQAGLWDNTVLIFSTDNG----GEVAAGGNNWPLRG 289
Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLC-AAAGIEINDTSLDGV 364
K + W+GGMRGV + SP L + + V+ +L H+SDW PTL AG +++ +LDG
Sbjct: 290 WKRSIWEGGMRGVGFVNSPLLPASVQGTVNKQLIHVSDWFPTLVQGVAGSSLDNITLDGF 349
Query: 365 NQWDVLTKGAKTKRSEI 381
N W + + + R+ I
Sbjct: 350 NMWKTIREELQKCRTSI 366
>gi|260788430|ref|XP_002589253.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
gi|229274428|gb|EEN45264.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
Length = 449
Score = 291 bits (744), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 161/389 (41%), Positives = 219/389 (56%), Gaps = 34/389 (8%)
Query: 61 LAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLK 120
LA G+ L +Y+Q +C+PSR LMTG+Y I G+QH VI P GLPL E LPQ L+
Sbjct: 2 LASEGVKLENYYIQPICSPSRCQLMTGRYQIRYGLQHSVITSDRPHGLPLDEVTLPQKLR 61
Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATF----EPYQGL 176
E GY ++ +GKWHLGFFR+ Y P RGFD YGY G +DY+ H + + GL
Sbjct: 62 ENGYRSYIVGKWHLGFFRKEYMPLQRGFDRFYGYLTGGEDYWTHRRPNGYARDPSAFHGL 121
Query: 177 DMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
D+R + G YST L+ + AI I H +SKPMFLYL AVH+ P +AP
Sbjct: 122 DLRDQDKPVLDQNGTYSTHLFAQKAIEFILSHERSKPMFLYLPFQAVHS-----PLEAPT 176
Query: 237 EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
+ V K+ D++ P + YA MV+ +DE+VGNV AL++ G+ +N++++F DNGA
Sbjct: 177 KYVEKYKDVTHPLIKIYAAMVTAMDEAVGNVTDALKRTGLWDNTVLIFSTDNGA-----R 231
Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
N GSN PLRG K+T W+GG+RGV + S L++ + S L HISDW PTL +
Sbjct: 232 ENAGSNWPLRGWKNTLWEGGVRGVGFVNSKLLRKGNRKSDALIHISDWFPTLLRISRSVS 291
Query: 357 NDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ---------------KYYAALRV 400
N T LDG + WD ++ G + R EILHNID + YAA+R
Sbjct: 292 NGTKPLDGFDIWDTISTGTPSPRKEILHNIDPLHRKTASSSFNLLYDDIFNISVYAAIRS 351
Query: 401 DDLKYVAGTDNNGQSDEWYGDTDNEIDKY 429
D K + G +G + D+D K+
Sbjct: 352 GDWKLLTGDQGHGG----WKDSDTSGSKF 376
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/79 (35%), Positives = 44/79 (55%), Gaps = 5/79 (6%)
Query: 491 DNKGAHCNSTVKP-----CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN 545
D G+ V P LFNI +DP E+++L++ D++++L KLA Y T +PP
Sbjct: 370 DTSGSKFEYAVDPPTKHLWLFNIRNDPQERSDLSDRYPDVVQELLQKLAAYNKTALPPFW 429
Query: 546 KPFDKRADPARWNNIWVPW 564
D RA+PA +++ PW
Sbjct: 430 PSRDPRANPALHGDLFGPW 448
>gi|241601772|ref|XP_002405049.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215500554|gb|EEC10048.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 451
Score = 291 bits (744), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 183/484 (37%), Positives = 246/484 (50%), Gaps = 41/484 (8%)
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
I GMQ VI + PWGLPL K++PQ+ K+ GY + IGKWHLGFF+ YTP RGFDS
Sbjct: 1 ILSGMQAEVIRDAAPWGLPLELKIMPQHFKDLGYQVNMIGKWHLGFFQTPYTPPKRGFDS 60
Query: 151 HYGYWQGLQDYYDHSCKATFEP--YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
+GY+ G DYY+H+ P Y + V K YST + + +
Sbjct: 61 FFGYYTGENDYYNHTAGTLNRPPLYGTFKTKQTRCVLAK----YSTHTWGLLEAFPLLLN 116
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
S+P F Y ++ AVHA PFQAP+ V KF I + R YAGM+ LDES+G V+
Sbjct: 117 LHSQPFFCYFSYQAVHAALMNVPFQAPERNVLKFPYIGETNRTLYAGMLDSLDESIGRVV 176
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
L GMLE++I+ F +DNG + SN+G N PLRG K T W+GG+R A +WS
Sbjct: 177 EILGNAGMLEDTIIAFSSDNGGLPYNAFSNRGFNWPLRGAKGTLWEGGVRVPAFVWSSKF 236
Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
+ +VS +L HISDWLPTL +AAG ++ N SLDGVN W LT G + R E+L ID
Sbjct: 237 LKKPRVSDQLMHISDWLPTLYSAAGGDVSNLGSLDGVNMWRYLTLGLGSARKEVLLTID- 295
Query: 388 VDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG-----DTDNEIDKYSPKEVLYSKAGIT 442
P A LR + K V G + + D+ Y N++ + K+ ++ T
Sbjct: 296 ---PMYNLAGLRYKNYKLVVGEGFDEELDDRYAFPGGLRPTNDVPQLR-KDSKVARVLKT 351
Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVK 502
F + + + +P S D LT+ + +
Sbjct: 352 FYGKQNRNWSPLDWSKNP---SVDCRERWLTNNFV---------------------ARRP 387
Query: 503 PCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWV 562
P LFN+ DPCE NNLA S+ +L + KLA Y +T VPP P D R P +W
Sbjct: 388 PYLFNLARDPCELNNLAGSKRIILHFMMHKLAAYNATAVPPRFLPEDPRGYPEYHGGVWS 447
Query: 563 PWYD 566
PW D
Sbjct: 448 PWLD 451
>gi|242002216|ref|XP_002435751.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215499087|gb|EEC08581.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 433
Score = 289 bits (739), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 180/480 (37%), Positives = 252/480 (52%), Gaps = 57/480 (11%)
Query: 92 HIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL--GFFREVYT-PTFRGF 148
+ GMQH V++E EP GLPL KLLP++L + GY + K L R VY P R
Sbjct: 1 YTGMQHFVLMEMEPRGLPLNFKLLPEWLGDLGYVHNQTTKQTLLHKELRFVYAFPYLR-- 58
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
+G D R + + G Y T+L TE AI +I H
Sbjct: 59 -------------------------RGFDFRRGLLGGREFDGRYYTELMTEEAIRIIETH 93
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
KP+FLYL+HLA H+GN +P Q P + + DI R YAGMVS LDESVG V+
Sbjct: 94 PLEKPLFLYLSHLAPHSGNLRDPLQVPKKFSDMYHDIGSRNRTIYAGMVSALDESVGAVV 153
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
AL + GML +++++F +DNGA S G + N S P +G K TPW+GG+R A IWSP L
Sbjct: 154 EALGRRGMLGDTVLVFSSDNGANSRGENPNYASAWPFKGQKITPWEGGVRAPAIIWSPSL 213
Query: 329 KQTQ-KVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNID 386
TQ + S +FHISDWLPTL AG + +D +DG++Q + L G + R E+L NID
Sbjct: 214 SGTQGRDYSNIFHISDWLPTLYQLAGGDPSDLGDIDGISQLEALRHGTEAPRKEVLINID 273
Query: 387 NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNAL 446
+DN A+ K V+GT G D+W+ N + ++
Sbjct: 274 PIDN----VTAIIEGHFKLVSGTVKGGALDDWFQIPGN----------------VAWDTD 313
Query: 447 KTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLF 506
+T+ + + AA NS + T + + + V+C + C T+ PCLF
Sbjct: 314 RTRHECETSVAAQVLRNSGHDV-TCGSGQGSYAI----PVKCGRRDTSKPCTPTLAPCLF 368
Query: 507 NITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYD 566
++++DPCE NN+AE ++++++L KLA+YK+T PGN P D RA PA N WV W D
Sbjct: 369 DLSEDPCEYNNVAEQHSEVVRRLLRKLAVYKATSWSPGNVPEDPRAHPALHNGAWVSWGD 428
>gi|291227811|ref|XP_002733876.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 539
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 157/377 (41%), Positives = 218/377 (57%), Gaps = 14/377 (3%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
LT +L A ++PHI+ ILADDLG+ DV + S + TPNID LA G+ L +
Sbjct: 42 LTRLILLTSYLTVIQATQRPHIVFILADDLGYFDVGYRNGSIVKTPNIDKLAAEGVKLER 101
Query: 71 HYVQALCTPSRSALMTGKYPIHIGMQHGVI-LEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
HY Q C PSRS LM G+Y IH G + G + +P LKE GYATH +
Sbjct: 102 HYAQPSCMPSRSCLMMGRYQIHTGFNYKCTDGSGSQLCMHPDTITIPMKLKENGYATHMV 161
Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM-RHNMQVDNKT 188
GKWHLG R P +GFD+ +GY +DYY H A E D+ R+ V +
Sbjct: 162 GKWHLGNIRWECLPNAKGFDTFFGYHGASEDYYTHFSPAGRE---CRDLWRNRDDVAQEY 218
Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
G YST ++T A+N+I H+ SKPMFLYL + AVH P Q P++ + D
Sbjct: 219 YGQYSTHIFTNEALNIIENHDVSKPMFLYLPYQAVHG-----PLQVPEKYYNMYERPLDD 273
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
RR YA M + +DE +GNV AL++ GM +N++ +F++DNGA G N PLRG
Sbjct: 274 IRRKYAAMATCMDEGIGNVTNALKQRGMWDNTVFIFVSDNGAQ----FPTGGRNWPLRGG 329
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWD 368
K + ++GG+R V+ + SP +++ Q+ S+E+ H++DWLPTL AG ++D LDGVN WD
Sbjct: 330 KGSVFEGGIRVVSFVTSPLIERPQRSSNEMIHLTDWLPTLTHLAGGTVDDVELDGVNMWD 389
Query: 369 VLTKGAKTKRSEILHNI 385
++KG+ T R E+L I
Sbjct: 390 TISKGSATPRKELLVQI 406
>gi|291237236|ref|XP_002738543.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 514
Score = 285 bits (729), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 160/384 (41%), Positives = 231/384 (60%), Gaps = 18/384 (4%)
Query: 32 IIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPI 91
III+ GWNDV +H + +PTPN++ LA G+IL+ Y Q +CTPSR ALMTGKYP
Sbjct: 41 IIILTISASGWNDVGWH-NDFMPTPNLNTLAREGVILDNMYSQPICTPSRVALMTGKYPA 99
Query: 92 HIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSH 151
+GMQH V+L P+ LP L + LKE GY H +GKWHLG YTP +RGFDSH
Sbjct: 100 KVGMQHFVVLPMRPYYLPGNYATLAEKLKEQGYTNHIVGKWHLGSCDWKYTPMWRGFDSH 159
Query: 152 YGYWQGLQDYYDHSCKATFEPYQGL---DMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
YG +G+ ++ + T+ P G+ D+R N + G ++T L++E A ++ H
Sbjct: 160 YGCHEGVTSNFE-THMLTWPPVVGVSGRDLRDNTGLVTHENGTHNTMLFSERAERIVKNH 218
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
N P+FLY+ ++A H P QAP + + + + D +RRT+AGM++ +D+ VGN+
Sbjct: 219 NPESPLFLYVPYMAPHF-----PLQAP-QGFEEAVQLDDTDRRTFAGMMAAMDQGVGNIT 272
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
AL+ GM ++++ +F++DNG G GSN+P RG K+T W+GG++ A I L
Sbjct: 273 QALKDKGMWDDTLFVFLSDNG----GDKMFAGSNYPYRGNKATMWEGGVKVPAFIRGSML 328
Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
K + V++EL+H +D TL AG D +DG NQWD KG +KRSEIL ++D
Sbjct: 329 KTSGTVNNELYHFTDIFATLLKVAG-GTPDEDIDGKNQWDSFCKGKPSKRSEILLHLD-- 385
Query: 389 DNPQKYYAALRVDDLKYVAGTDNN 412
++P AALR+ D KY+ G N
Sbjct: 386 EHPVTNGAALRMGDYKYMEGVTIN 409
>gi|313232584|emb|CBY19254.1| unnamed protein product [Oikopleura dioica]
Length = 506
Score = 285 bits (728), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 179/546 (32%), Positives = 277/546 (50%), Gaps = 65/546 (11%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
L+ KKP+++I++ADDLGWND+S H +S + TPN+D L L L +YV +CTP+R
Sbjct: 11 LSVAIAKKPNVLILIADDLGWNDISLH-NSYLSTPNVDGLIQESLHLQSYYVNPICTPTR 69
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
S LM+G+Y IH G+QH VIL +P GLPLT+ + P+ K+ GY TH +GKWHLGF+ E
Sbjct: 70 SVLMSGRYQIHTGLQHAVILGAQPNGLPLTDPVQPEIFKDCGYRTHMVGKWHLGFYDEKM 129
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
P RGF+SHYGY G + +Y+HS + G+D R N + G YS DL+ +
Sbjct: 130 VPENRGFESHYGYLIGAEGHYNHS--QFMQGQNGVDFRDGGASTNSSWGQYSADLFAKRV 187
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+++ H+ + +++Y+ VH P +AP V +F I D +RR YA M +D
Sbjct: 188 EDLVEAHDVEESLYMYVGLQNVHY-----PLEAPQHYVDQFSWIKDRDRRVYAAMTKSMD 242
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
++VG +I + + G+ E++IV F DNG + G+N PLRG+K+T W+GG+RG+
Sbjct: 243 DTVGRIINSFKNKGIWEDTIVYFTTDNGGSTL----YGGNNWPLRGLKNTLWEGGIRGIG 298
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVLTKGAKTKRSE 380
+I P + ++ +L H+ D PTL DG +Q +L +++
Sbjct: 299 SIKIPGVSADKR--DQLMHVVDMFPTLLDLTSCPNTRKVKFDGKSQAKMLVDNDESQSDS 356
Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAG 440
L NI D L+ G DN W+ D D + +AG
Sbjct: 357 FLINI---------------DPLRKNKGADNR----TWHSDFDVRV-----------QAG 386
Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRT--ILTDEKILELREFARVRCNYDNKGAHCN 498
+ + K A SSD T I E + R+ + + N++ +
Sbjct: 387 LRWQQWKLLT----------GAPSSDNYPTGHIYPPEWPQDGRKNS--QSNFE------S 428
Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
S + LF+I +DP E ++ T++++++ + L+ Y ST VP D P +
Sbjct: 429 SNLNVRLFDIDNDPLEDFEISNQHTEIVEKMLNMLSSYNSTAVPCRWPNPDPAGLPDQNG 488
Query: 559 NIWVPW 564
W PW
Sbjct: 489 GFWKPW 494
>gi|428202415|ref|YP_007081004.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
gi|427979847|gb|AFY77447.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
Length = 538
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 176/471 (37%), Positives = 272/471 (57%), Gaps = 28/471 (5%)
Query: 8 FFALTCTLLFNDAFLN---TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
F ++ ++++ F++ A +KP+I+II+ADDLGWNDV +H +S+I TPN+D LA +
Sbjct: 18 FISVAIAAIYSNFFVSDGVRAAAQKPNIVIIMADDLGWNDVGYH-NSEIKTPNLDKLAES 76
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGY 124
L++ YV + CTP+R+ALMTG++P GM GVI + GLPL EK + Q LKEAGY
Sbjct: 77 STRLDRFYVTSSCTPTRAALMTGRHPSRYGMSSGVIWPWDKVGLPLEEKTIAQTLKEAGY 136
Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ- 183
T +GKWHLG ++E Y PT RGFD HYG++ G DY+ H A + GLD N Q
Sbjct: 137 YTAIVGKWHLGHYKEEYLPTRRGFDYHYGHYCGSIDYFTHQLDAGIQ--GGLDWHRNEQP 194
Query: 184 VDNKTIGIYSTDLYTEAAINVI--AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
V+ + Y+TDL + A+ +I ++NKS P+FLY++ A HA P QA ++++
Sbjct: 195 VEEEG---YATDLLAQEAVKLIRDCDYNKS-PLFLYVSFNAPHA-----PLQAKEKDIKN 245
Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
+ +I D RR +A V +DE+VG ++AAL++ + +N+++ F +DNG S +G
Sbjct: 246 YANIQDEGRRIFAAQVQSMDEAVGRIVAALKEKQVWDNTLLFFTSDNGGGS-DQPWTRGD 304
Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTS 360
N PLRG K T +DGG+R I P + +V ++F + D PTL AG+ E +
Sbjct: 305 NRPLRGQKGTLYDGGVRVPTIISFPAQLKGGQVIEQVFSVVDLYPTLAKLAGVKESSQNQ 364
Query: 361 LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
LDGV+ + + G+ T+R E+L + +P + AAL ++ K V N+ +YG
Sbjct: 365 LDGVDILESIATGS-TRRDEVLIQYQALASPPWFKAALVKENYKLVL---NSTSPSPYYG 420
Query: 421 DTDNEIDKYSPKE---VLYSKAGITFNALKTKLQIKQKHAADPKANSSDAL 468
+ K P E + YS+ L+ +++ +K A P N+S +
Sbjct: 421 IAELFNLKNDPLETRNLAYSEPEKVLE-LQKRVEAHKKEAKSPILNNSTEM 470
>gi|405379584|ref|ZP_11033433.1| arylsulfatase A family protein [Rhizobium sp. CF142]
gi|397323967|gb|EJJ28356.1| arylsulfatase A family protein [Rhizobium sp. CF142]
Length = 502
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 168/388 (43%), Positives = 225/388 (57%), Gaps = 31/388 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP+I+ ILADDLGW DV FHGS I TPNID LA G L Q+YVQ +CTP+R+A MTG+
Sbjct: 59 KPNIVYILADDLGWKDVGFHGSD-IKTPNIDELAEKGARLEQYYVQPMCTPTRAAFMTGR 117
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YP GMQ VI +G +GL L + LLP+ LKEAGYAT A GKWHLG + + P RGF
Sbjct: 118 YPFRYGMQTAVIPQGGTYGLALDDHLLPELLKEAGYATAASGKWHLGHAKTAFWPRQRGF 177
Query: 149 DSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
DS YG G D++ H S + Y+ D DN L+ + A+ VI E
Sbjct: 178 DSFYGALLGEIDHFTHKSANGNADWYRNNDALEEEGFDNV--------LFADEAVRVINE 229
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
H+++KP+FLYLA + H PFQAP E + + I+D RR YA M+S +D+ +G V
Sbjct: 230 HDQAKPLFLYLAFTSPHT-----PFQAPKEFLERNSHIADESRRNYAAMISVMDDGIGRV 284
Query: 268 IAALRKHGMLENSIVLFMADNGA---PSFGIHSN-KGS----NHPLRGMKSTPWDGGMRG 319
+ AL GM EN++++F +DNG F SN KG N P R K T ++GG R
Sbjct: 285 VEALEMRGMRENTLIVFHSDNGGVVDAFFAGESNVKGKLPADNGPYREGKGTTYEGGTRV 344
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKR 378
V + P K ++ + H+ D PTL A AG E+ + LDG++ W VL +G + R
Sbjct: 345 VGLVNWPG-KIEPGTANGMMHVVDMYPTLAALAGAELGKNKPLDGMDMWPVLGEGKPSPR 403
Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYV 406
+EI++NI+ + AA+R D K V
Sbjct: 404 TEIVYNIEPMT------AAVRQGDWKLV 425
>gi|291227813|ref|XP_002733877.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 490
Score = 283 bits (723), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 153/364 (42%), Positives = 219/364 (60%), Gaps = 16/364 (4%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
TA +PHII ILADDLG+ DV + S I TPNID LA G+ L +HY Q C PSR+ L
Sbjct: 7 TATHRPHIIFILADDLGYFDVGYRNRSVIKTPNIDKLAAEGVKLERHYAQPSCLPSRACL 66
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKL--LPQYLKEAGYATHAIGKWHLGFFREVYT 142
M G+Y IH G + + + + + + LP +K+ GY TH IGKWHLG
Sbjct: 67 MMGRYQIHTGYRDECMNDTR-YQRCMNHDIVTLPMKMKQNGYVTHMIGKWHLGNNNWDCL 125
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM-RHNMQVDNKTIGIYSTDLYTEAA 201
P +GFD+++GY +DYY H D+ R M V +K IG YST ++TE A
Sbjct: 126 PNAKGFDTYFGYNAAAEDYYTHMLSGR---QNCSDLWRDRMDVADKYIGQYSTRIFTEEA 182
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+N+I H+ S+PMF+YLAH AVHA P Q P E + + + RR YA MV+ +D
Sbjct: 183 VNIIENHDISQPMFMYLAHQAVHA-----PLQVPQEYYSLYETTLNDVRRKYAAMVTCMD 237
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E +GNV AL++ GM +N++++F++DNG G S G N PLRG K + ++GG+R V+
Sbjct: 238 EGIGNVTEALKESGMWDNTVLVFVSDNG----GELSAGGRNWPLRGGKGSVFEGGIRTVS 293
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEI 381
+ S +++ Q++S+ + HI+DW PTL AG I+D +DG+N WD ++ G+ R+E+
Sbjct: 294 FVTSSLIERPQRISNAMIHITDWFPTLTHLAGGTIDDEMVDGINMWDTISTGSVEPRTEL 353
Query: 382 LHNI 385
L I
Sbjct: 354 LIKI 357
>gi|291225021|ref|XP_002732506.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 497
Score = 282 bits (722), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 178/542 (32%), Positives = 270/542 (49%), Gaps = 80/542 (14%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+PHII+I DDLGWNDV +H + I PN+ LA +G+I NQ Y CTPSR+A+MTG
Sbjct: 26 QPHIILIFVDDLGWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGL 84
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YP G QH ++ P GLPL KLLP+ LKE GY+TH +GKWHLGF ++ Y PT RGF
Sbjct: 85 YPFKTGNQHQMLFNLHPSGLPLEFKLLPEKLKEIGYSTHMVGKWHLGFCKDEYLPTNRGF 144
Query: 149 DSHYGYWQ-GLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
DSHYG W G+ DY P +G D R N V K+ G Y + + A +++
Sbjct: 145 DSHYGIWTLGVGDY--DKMNGVLSPSKGYDFRDNTGVVQKSNG-YLALMLGDRAEHIVNT 201
Query: 208 HNKSKPMFLYLA------HLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
H P+FL HLA+ P+E K+ DI D R + G ++ +D
Sbjct: 202 HYPGTPLFLAFTLDIPAKHLAI-----------PEEYENKYADIEDSRTRHFYGKLTMMD 250
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
+ +G V+ AL+ G+ ++S+++F+ DNGA + S GSN+P RG+ T ++G R +
Sbjct: 251 DIIGGVVDALKNRGIYDDSLIIFIGDNGA----LSSQAGSNYPFRGIAGTLFEGATRVPS 306
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEI 381
+ +K+T S+EL+ I D T+ AG + + LDG++ W +KG + R+EI
Sbjct: 307 IVSGKGIKKTGYESNELYSIVDIHRTILDVAGAKA-EPELDGLSMWPTWSKGKPSPRNEI 365
Query: 382 LHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGI 441
++NID D+P AA+R+ D K + G + G TDN + P
Sbjct: 366 IYNID--DDPVSPGAAIRIGDYKLITGHPDLLYPYRDIGITDNYYNYGDP---------- 413
Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTV 501
L + L + A P + L ++
Sbjct: 414 ----LASGLPTADQAAMAPPPLDVEYLFNVIV---------------------------- 441
Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
DP E+N+++ D+++ L D+L ++ ++PP N+ + A +N++W
Sbjct: 442 ---------DPEERNDISADHPDIVQDLRDRLDEHRKLLIPPVNQTTELAGSAANFNDVW 492
Query: 562 VP 563
P
Sbjct: 493 SP 494
>gi|260816811|ref|XP_002603281.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
gi|229288599|gb|EEN59292.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
Length = 357
Score = 281 bits (719), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 153/360 (42%), Positives = 216/360 (60%), Gaps = 18/360 (5%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KPHI++I+ADDLGW+DVS++ + + PN+ LA G+I NQ Y Q LCTPSR+AL+TGK
Sbjct: 7 KPHILLIVADDLGWSDVSWN-NPNVVMPNLHTLATTGVIFNQTYSQRLCTPSRTALLTGK 65
Query: 89 YPIHIGMQ-HGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+P +GMQ + E GLPL E+LLPQ LK+ GYATH +GKWHLG + YTPT RG
Sbjct: 66 FPYRLGMQVQKSMFEKNSHGLPLDEELLPQKLKKLGYATHMVGKWHLGSCKWEYTPTERG 125
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FDS YG+ G +DYY H + +GLD + G+YST+ + A N+I++
Sbjct: 126 FDSFYGFHHGGEDYYTHMSE------RGLDFWDGRTSVSDRNGVYSTESFARRAENIISQ 179
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
H+ + P+FLYL +VH P Q P + F I D R++ GM + LD+++ V
Sbjct: 180 HDPNTPLFLYLPFQSVHT-----PHQVPSSYLQTFSTIQDDNRKSILGMATALDDAIKRV 234
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
AL+ G+ +N++ +FM+DNG G + SN PLRG K T W+GG R A +
Sbjct: 235 TDALKGKGLWDNTLTIFMSDNG----GNYVQGQSNWPLRGAKGTLWEGGTRVPAFVHGNM 290
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
L++T + H D LPTL + AG D LDG N W ++ GA + R+E +++ID+
Sbjct: 291 LERTGYTYHGMMHGVDILPTLVSVAG-GTEDAGLDGKNMWQSISTGADSPRTEFVYDIDS 349
>gi|410956991|ref|XP_003985119.1| PREDICTED: arylsulfatase J [Felis catus]
Length = 621
Score = 280 bits (717), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 151/336 (44%), Positives = 204/336 (60%), Gaps = 18/336 (5%)
Query: 58 IDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQ 117
+D LA G+ L +YVQ +CTPSRS +TGKY IH G+QH +I +P LPL LPQ
Sbjct: 80 LDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQ 139
Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
LKE GY+TH +GKWHLGF+R+ PT RGFD+ +G G DYY H K G D
Sbjct: 140 KLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYD 198
Query: 178 MRHNMQVDNKTI----GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQ 233
+ N DN G+YST +YT+ ++A H+ KP+FLY+A+ AVH+ P Q
Sbjct: 199 LYEN---DNAAWDYDNGLYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQ 250
Query: 234 APDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
AP + I + RR YA M+S LDE++ NV AL+ +G NSI+++ +DNG
Sbjct: 251 APGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNG---- 306
Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
G + GSN PLRG K T W+GG+R V + SP LK V EL HI+DW PTL + A
Sbjct: 307 GQPTAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAE 366
Query: 354 IEIN-DTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+I+ DT LDG + W+ +++G ++ R +ILHNID +
Sbjct: 367 GQIDEDTQLDGYDVWETISEGLRSPRVDILHNIDPI 402
>gi|390341601|ref|XP_796347.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 497
Score = 280 bits (715), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 185/546 (33%), Positives = 263/546 (48%), Gaps = 93/546 (17%)
Query: 29 KPHIIIILADDLGWNDVSFHG---SSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
+P+++ ILADD G+NDV +HG S + TPN+D LA G+ L ++YVQ +C+P+RS L+
Sbjct: 34 RPNVVFILADDFGYNDVGYHGREHGSMVLTPNLDGLAGEGVKLEKYYVQPICSPTRSQLL 93
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
+G+Y IH G+Q+GVI +P LPL E LPQ LKE YATH +GKWH+GF+++ TPT
Sbjct: 94 SGRYQIHTGLQYGVIRPAQPHCLPLDEVTLPQKLKERDYATHMVGKWHIGFYKDACTPTE 153
Query: 146 RGFDSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
RGFDS++GY G +DYY H S + + +GLD+ N + G YST L+T AI+
Sbjct: 154 RGFDSYFGYLSGAEDYYSHSRSFQIGSKTLKGLDLMANKTPAFQYKGQYSTHLFTSKAID 213
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
VI H +SKP+FLYLA+ AVH+ P Q P + + +I+ RR YAGMVS +DE
Sbjct: 214 VINNHERSKPLFLYLAYQAVHS-----PLQVPSKYEEPYANITSSARRAYAGMVSCMDEG 268
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
+GNV AL G+ N+I++F + G + L G T
Sbjct: 269 IGNVTRALVDAGLYNNTIIIF-------------STGMSSSLVGKNMTD----------- 304
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
++T L + + AA DT G++ + R E+LH
Sbjct: 305 -----RETNSCGKLLIVGLACVDIVNVAAVYPTEDTDQKGMD---------PSPRQELLH 350
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
N+D +D D G + + G E K KA +
Sbjct: 351 NLDPLD-----------------VAADFEGDA-KCSGKKTTECRMIPGKFNTSMKAALRV 392
Query: 444 NALKTKLQIKQKHAADPKAN-----SSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
K + K++ D + N L TI +K +
Sbjct: 393 GDWKIQTGSGAKYSGDVQVNLWKPPPESGLHTIYPVQKPKQ------------------- 433
Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
K LFNIT DP E+N+L+++ D++ QL +L Y VPP P D ADPA
Sbjct: 434 ---KVWLFNITADPLEENDLSDTYPDVVAQLLHRLQYYYKDSVPPRYPPPDVNADPALHG 490
Query: 559 NIWVPW 564
W PW
Sbjct: 491 GAWGPW 496
>gi|260786699|ref|XP_002588394.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
gi|229273556|gb|EEN44405.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
Length = 353
Score = 279 bits (714), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 156/358 (43%), Positives = 214/358 (59%), Gaps = 18/358 (5%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PHI++I+ADDLGW+DVS++ + + PN+ LA G+I NQ Y Q LCTPSR+AL+TGK+
Sbjct: 1 PHILLIVADDLGWSDVSWN-NPNVVMPNLHTLATTGVIFNQTYCQRLCTPSRTALLTGKF 59
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
P +GMQ I + GLPL E+LLPQ LK+ GYATH IGKWHLG + YTPT RGFD
Sbjct: 60 PYRLGMQR-PIRHKKAHGLPLDEELLPQKLKKLGYATHMIGKWHLGCCKWEYTPTERGFD 118
Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
S YGY +G QDYY H GLD + G+YST+ + A N+I++H+
Sbjct: 119 SFYGYHRGSQDYYTHMSDG------GLDFWEGKTAISDQNGVYSTESFATRAENIISQHD 172
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+ P+FLYL VH P Q P + F I D R++ GMV+ LD+++ V
Sbjct: 173 PNTPLFLYLPLQPVHT-----PHQVPSSYLQTFSTIQDHNRKSILGMVAALDDAIKRVTD 227
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+ G+ N++ +FM+DNG G + + SN PLRG K T W+GG R A L+
Sbjct: 228 ALQGKGLWNNTLTIFMSDNG----GDYLDGQSNWPLRGAKGTVWEGGTRVPAFAHGNMLE 283
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
+T + H D LPTL + AG D LDG N W ++ GA++ R+E +++ID+
Sbjct: 284 RTGYTYHGMMHGVDILPTLVSVAG-GTEDAGLDGKNMWQSISTGAESPRTEFVYDIDS 340
>gi|291236278|ref|XP_002738066.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 508
Score = 279 bits (714), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 190/548 (34%), Positives = 270/548 (49%), Gaps = 74/548 (13%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P+I+ +LAD+LGW+D+ +HGS ++ TP++D LA G+ L +YVQ +CTP+RS LM+G+
Sbjct: 22 QPNIVFVLADELGWHDIGYHGS-RVQTPHLDKLASEGIKLENYYVQPMCTPTRSQLMSGR 80
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QH VI + LPL E + Q LKEAGY+TH +GKWHLG + + PT RGF
Sbjct: 81 YQIHTGLQHTVINPDQRSCLPLDEVTIAQKLKEAGYSTHMVGKWHLGHYTKGCLPTKRGF 140
Query: 149 DSHYGYWQGLQDYYDHS----CK---ATFEPYQGLDMRHNMQ--VDNKTIGIYSTDLYTE 199
DS +G++ DYY + CK T +G D+ N + V G Y T + +
Sbjct: 141 DSFFGFYNCAVDYYTYEKGKFCKFENETVLRMRGTDLWRNDEEHVAPYYQGHYQTHVLAK 200
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
A +VI +HN SKP+FLYLA AVH P + P + D+ D RR M +
Sbjct: 201 EAEDVIRKHNPSKPLFLYLAFGAVHV-----PLEVPKVYEDMYADVKDNSRRILLAMATC 255
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
+D+S+ +V AL+ G+ +N+I +F DNGA + + G + PL+G K++ ++GG+R
Sbjct: 256 MDDSIADVTGALKDTGLWDNTIFIFSTDNGAAT----AYGGCSWPLKGGKTSLFEGGIRA 311
Query: 320 VAAIWSPWLKQTQKVSS--ELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKT 376
V + SP L + + EL H+SDW PTL AG E++ T LDG NQW+ ++ G +
Sbjct: 312 VGFVTSPLLPPDMQGTENMELMHVSDWFPTLVHVAGGELSGTKPLDGYNQWETISLGRPS 371
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
+R+EIL NID + + + D K+ + EW T N Y Y
Sbjct: 372 QRTEILINIDPLIPCEACANHTKWDHKKFDVHVHAAIRMGEWKLITGNPHHNYWVTPAEY 431
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH 496
+ K+ AADP N D L
Sbjct: 432 PNMKNIYLNEPGKMVWLYNIAADP--NEQDDL---------------------------- 461
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
S P + NI L D+LA Y T VP D ADPAR
Sbjct: 462 --SDTFPGIVNI--------------------LLDRLAEYNETAVPVQFPDCDTGADPAR 499
Query: 557 WNNIWVPW 564
W PW
Sbjct: 500 RGGSWGPW 507
>gi|291227809|ref|XP_002733875.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 505
Score = 279 bits (714), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 155/362 (42%), Positives = 218/362 (60%), Gaps = 17/362 (4%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
++P+II ILADDLG+ DV + S + TPNID LA G+ L ++Y Q+ C PSRS LM G
Sbjct: 24 EQPNIIFILADDLGYFDVGYREGSIVKTPNIDKLAAEGVKLERYYAQSSCMPSRSCLMMG 83
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKL--LPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
+Y IH G + L+G+ L + LP L++ GYATH IGKWHLG R+ PT
Sbjct: 84 RYQIHTGFDYRC-LDGQLTRLCMAPDTVTLPMKLRQYGYATHMIGKWHLGHERKECVPTH 142
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-VDNKTIGIYSTDLYTEAAINV 204
+GFD+ +GY ++YY H+ A P + D+ NM+ V G YST YT+ A ++
Sbjct: 143 KGFDTFFGYHGAAENYYTHT--ALGRP-RCHDLWRNMENVAEDYDGQYSTLFYTKEAQDI 199
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE-EVAKFLDISDPERRTYAGMVSRLDES 263
I H+K KP F+YL++ AVH P + P + + D R+ YA M + +DE
Sbjct: 200 IKNHDKKKPFFMYLSYQAVHG-----PLEVPSNYRMPYYTSGVDESRQIYAAMATCMDEG 254
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
+GN+ L+ GML N+I++F +DNGA H+ G N PLRG K T W+GG+R +A +
Sbjct: 255 IGNITETLKASGMLNNTILVFSSDNGAQ---FHTG-GRNWPLRGGKGTMWEGGVRVLAFV 310
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
SP LK+++ V+ E+ H SDW PT AG ++D LDGV+QWD ++ GA TKR E+L
Sbjct: 311 NSPLLKKSRTVNHEMIHTSDWFPTFVRLAGGTVDDVELDGVDQWDTISTGAPTKRQELLI 370
Query: 384 NI 385
I
Sbjct: 371 QI 372
>gi|348501876|ref|XP_003438495.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
Length = 571
Score = 278 bits (712), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 160/406 (39%), Positives = 215/406 (52%), Gaps = 31/406 (7%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
P+ PH I I+ DD G+ D+ +HGS I TP +D LA G+ L +YVQ +C+PSRS LMT
Sbjct: 43 PRPPHFIFIMVDDQGYGDIGYHGSD-IHTPVLDRLAAEGVKLENYYVQPICSPSRSQLMT 101
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y IH G+QH +I +P LP LP+ L EAGYATH +GKWHLGF R PT R
Sbjct: 102 GRYQIHTGLQHSIIRPRQPLCLPPDSPTLPERLAEAGYATHMVGKWHLGFCRPSCLPTGR 161
Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
GF S G G D++ + SC G D+ + + G YST LY + ++
Sbjct: 162 GFQSFLGTLTGSGDHFSYQSCDGA--EACGFDLHDGDRPAWEMAGNYSTLLYIDRVKQIL 219
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
H+ P+FLYL+ A H P Q PD + + + RR YA M+S LD V
Sbjct: 220 KRHDPHTPLFLYLSLQAAHT-----PLQVPDHFLHLYDSHRNRLRRHYAAMLSCLDSGVA 274
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
V+ L+ G+ ENS++++ +DNG + GSN PLRG K T W+GG+R V + S
Sbjct: 275 QVVQELKTQGLYENSVLIYSSDNGGQPL----SGGSNWPLRGGKGTYWEGGIRAVGFVHS 330
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
P LK+ VS L H+SDW PTL AG + + LDG + W +++G R+EIL NI
Sbjct: 331 PLLKKKGIVSRALIHVSDWYPTLLGLAGAQQSHHGLDGHDVWGSISEGLPCPRTEILFNI 390
Query: 386 DNVDNPQ------------------KYYAALRVDDLKYVAGTDNNG 413
D V AALR D K + G +G
Sbjct: 391 DPVSRKAGEPYDKALILSGFGIWDTAVRAALRAGDWKLLTGNVGDG 436
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 35/60 (58%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFN++ DP E+ +LAE++ +++K L +LA Y T V N P D ADP W PW
Sbjct: 467 LFNVSADPYERYDLAETRPEVVKHLMTRLAEYNQTAVMARNPPDDPMADPELHGGAWSPW 526
>gi|291236518|ref|XP_002738186.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 473
Score = 278 bits (712), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 156/363 (42%), Positives = 212/363 (58%), Gaps = 46/363 (12%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+PHI+ ILADDLGW+DV +H S+I TPNID LA G+ L +YV LCTPSR+ LMTG+
Sbjct: 23 RPHILFILADDLGWHDVGYH-DSEIQTPNIDMLAAEGVKLENYYVTPLCTPSRAVLMTGR 81
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH GMQHGV++ P LP E LLPQ LK++GY+TH +GKWHLGF + TP RGF
Sbjct: 82 YLIHSGMQHGVLVAQNPRCLPTDEILLPQMLKDSGYSTHMVGKWHLGFCKFQCTPNHRGF 141
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
D+ +G++ A P+ R A H
Sbjct: 142 DTFFGWYNA----------AVISPFISCRRR-------------------SAGCEFRVSH 172
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRLDESVGNV 267
+ PMF+YL AVHA P Q PD+ + D I D RRTYAGM S +DE++GN+
Sbjct: 173 T-TVPMFIYLPFQAVHA-----PLQVPDKYSDMYKDTIHDESRRTYAGMTSCMDEAIGNI 226
Query: 268 IAALRKHGMLENSIVLFMADN-GAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
L+ G+ N++++F DN GA +FG SN+PLRG K++ ++GG+RG A + SP
Sbjct: 227 TRTLKDKGIWNNTVIVFSTDNGGARTFG-----ASNYPLRGQKASNFEGGIRGPAFVSSP 281
Query: 327 WLKQTQK--VSSELFHISDWLPTLC-AAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
L + +++EL +I DW PT A G I LDG+NQW+ +++G +KR EI++
Sbjct: 282 LLDPLVRGTINNELMYIGDWFPTFMNLAGGHVIGSKPLDGINQWETISRGVPSKRVEIVY 341
Query: 384 NID 386
NID
Sbjct: 342 NID 344
Score = 46.2 bits (108), Expect = 0.047, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 34/61 (55%), Gaps = 1/61 (1%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN-KPFDKRADPARWNNIWVP 563
L++I +D E+ NL + D++ L +L + +T VPPG F RADPAR W P
Sbjct: 412 LYHIKEDAEEKVNLVHYRPDIVFMLLQRLHYHNTTAVPPGEVDKFSVRADPARHGMAWNP 471
Query: 564 W 564
W
Sbjct: 472 W 472
>gi|291225029|ref|XP_002732496.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 497
Score = 277 bits (709), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 177/537 (32%), Positives = 273/537 (50%), Gaps = 70/537 (13%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+PH+I+I DD+GWNDV +H + I PN+ LA +G+I NQ Y CTPSR+A+MTG
Sbjct: 26 QPHVILIFVDDMGWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGL 84
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YP G QH ++ P G+PL KLLP+ LKE GY+TH +GKWHLGF ++ Y PT RGF
Sbjct: 85 YPFKTGNQHQMVFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGF 144
Query: 149 DSHYGYWQ-GLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
DSHYG W G+ DY P G D R NM V K+ G Y + + A +++
Sbjct: 145 DSHYGLWTLGVGDY--DKMNGVLSPSAGYDFRDNMGVVPKSDG-YLALMLGDRAEHIVNT 201
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
H P+FL L + A + + P+E K+ DI D R + G ++ +D+ +G V
Sbjct: 202 HYPGTPLFLAFT-LDIPAKH----LEIPEEYEEKYSDIEDDRTRQFYGKLTMMDDIIGGV 256
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
+ AL+ GM ++S+++F+ DNGA + S GSN+P RG+ T ++G R + +
Sbjct: 257 VDALKNRGMYDDSLIIFIGDNGA----LSSQSGSNYPFRGIAGTMFEGATRVPSIVSGKG 312
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
+K+T S+EL+ I D T+ AG + + LDG++ W ++ + R+EIL+NID
Sbjct: 313 IKKTGYESNELYSIVDIHRTILDVAGAKA-EPELDGLSMWPTWSEDKPSPRNEILYNID- 370
Query: 388 VDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI-DKYSPKEVLYSKAGITFNAL 446
D+P AA+R+ D K + G D Y D + D Y Y+ + L
Sbjct: 371 -DDPVSPGAAIRIGDYKLIT-----GHPDLLYPYRDISLTDNY------YNYGDAPASGL 418
Query: 447 KTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLF 506
+ Q Q HA P + + L ++ D
Sbjct: 419 PSTDQ--QSHA--PAPQNVEYLFNVVVD-------------------------------- 442
Query: 507 NITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
P E+N+++ D+++ L D+L ++ ++PP N+ + +N++W P
Sbjct: 443 -----PEERNDISADHPDIVQDLRDRLDEHRKLLIPPVNQTTELAGAAGNFNDVWSP 494
>gi|354473078|ref|XP_003498763.1| PREDICTED: arylsulfatase B [Cricetulus griseus]
Length = 460
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 178/486 (36%), Positives = 245/486 (50%), Gaps = 70/486 (14%)
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P +PL EKLLPQ LKEAGYATH +GKWHLG ++ PT RGFD+
Sbjct: 32 IHTGLQHLLIRVCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMCQKECLPTRRGFDT 91
Query: 151 HYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
++GY G ++YY H A EP LD R + K YST ++++ A +IA
Sbjct: 92 YFGYLLGSENYYTHEVCALIEPLNVTRCALDFRDGEEPATKYNNTYSTHIFSQRATTLIA 151
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H KP+FLYLA +VH +P Q P+E + + I D RR YAGMVS +DE+VGN
Sbjct: 152 NHPPEKPLFLYLALQSVH-----DPLQVPEEYMKPYDFIQDKHRRIYAGMVSLMDEAVGN 206
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
V AAL+ HG+ N++ +F DNG + + G+N PLRG K T W+GG+RGV + SP
Sbjct: 207 VTAALKSHGLWNNTVFVFSTDNGGQTL----SGGNNWPLRGRKWTLWEGGIRGVGFVASP 262
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
LKQ S EL HISDWLPTL AG T LDG + W +++G + R E+LHNI
Sbjct: 263 LLKQKGVKSRELIHISDWLPTLVNLAGGSTYGTKPLDGFDVWRTISEGTPSPRVELLHNI 322
Query: 386 -----DNVDNPQKYYAALRVDD--LKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
D+ P K A + D LKY A + + + +Y ++L
Sbjct: 323 DQDFFDSSPCPGKSMAEAKNDSFPLKYSA-----------FNTSIHAGIRYKNWKLLTGH 371
Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
G F P ++ +R++ + K L
Sbjct: 372 PGCGF------------WFPPPSQSNISEIRSLDSSSKTL-------------------- 399
Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
LF+I DP E+++L+E ++ +L +L Y VP P D R DP
Sbjct: 400 -----WLFDINQDPEERHDLSEKHPHIVLKLLSRLQYYHEHSVPSYFPPMDPRCDPKD-T 453
Query: 559 NIWVPW 564
+W PW
Sbjct: 454 GVWSPW 459
>gi|260819479|ref|XP_002605064.1| hypothetical protein BRAFLDRAFT_124136 [Branchiostoma floridae]
gi|229290394|gb|EEN61074.1| hypothetical protein BRAFLDRAFT_124136 [Branchiostoma floridae]
Length = 485
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 181/539 (33%), Positives = 269/539 (49%), Gaps = 87/539 (16%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP+I+ I+ADDLGWNDV +H + + TP +D LA G+ILNQ YV +CTPSR+A MTG
Sbjct: 27 KPNILFIVADDLGWNDVGWH-NPDVKTPVLDQLANEGVILNQCYVNYVCTPSRTAFMTGY 85
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
+P H+G QH V + G+P LP+ LK+ GYATH +GK
Sbjct: 86 FPYHVGSQHTVFFPQQAQGIPSKFPFLPEKLKQLGYATHMVGK----------------- 128
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
+ V G Y T +T+ ++++ +H
Sbjct: 129 -------------------------------DDKDVVKTRSGQYGTYFFTDRVVDIVEKH 157
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
P+F+YL + VH +P Q P + + D RR GMVS LDE+VGNV
Sbjct: 158 PADTPLFMYLPYQNVH-----QPLQVPQRFEDMYASVQDENRRKLLGMVSALDEAVGNVT 212
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
A++K G+ +N++V+F DNG G S G+N+PLRG K T W+GG RGVA L
Sbjct: 213 MAMKKAGLWDNTLVIFTTDNG----GWISASGNNYPLRGGKVTLWEGGTRGVAFAHGKML 268
Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
++T ++E+ H DW PT+ AAAG D ++DGV+Q D + G + R+E ++NID+
Sbjct: 269 QKTGYTNNEMIHAVDWFPTILAAAG-GTADEAIDGVSQLDTIMSGQPSPRTEFVYNIDD- 326
Query: 389 DNPQKYYAALRVDDLKYVAGTDNNGQSDEWY---GDTDNEIDKYSPKEVLYSKAGITFNA 445
+ P K A+RV D K + G G+ D W T +++D P
Sbjct: 327 EFPTK-QGAIRVGDYKLIEGY--AGKPDGWILPPNLTGSQLDSGDP-------------V 370
Query: 446 LKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFAR-VRCNYDNKGAHCNSTVKPC 504
+ T L K + DP + T L + K + R R + +++ + +
Sbjct: 371 IGTWL-FNLKVSGDP------VIGTWLFNLKAGQRRPRHRDLALQPESQPDSGDPVIGTW 423
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
LFN+ DDP E +NLA+S D L++++ KLA Y+ ++VP D +++P W W P
Sbjct: 424 LFNLKDDPTEHHNLADSMPDKLREMQAKLAEYRKSLVPAIFPAPDPKSNPDNWGGAWSP 482
>gi|326437895|gb|EGD83465.1| hypothetical protein PTSG_04073 [Salpingoeca sp. ATCC 50818]
Length = 562
Score = 277 bits (708), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 191/572 (33%), Positives = 283/572 (49%), Gaps = 78/572 (13%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+P+I+ I+ DDLG++DV F S QI TPNID ++++++G
Sbjct: 33 SQPNIVFIVIDDLGFDDVGFK-SHQIKTPNID---------------------QASILSG 70
Query: 88 KYPIHIGMQHGV---ILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
+Y +H HG+ I + +GLPL LPQ LK GY THAIGKWHLGF++ YTPT
Sbjct: 71 RYAMH----HGIVNWIPPKDSYGLPLNHTTLPQLLKNGGYDTHAIGKWHLGFYKWDYTPT 126
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFE------PYQGLDMRHNMQVDNKTIGIYSTDLYT 198
FRGF+S GY+ G ++Y+ H ++ P G + Q+ G YST +++
Sbjct: 127 FRGFNSFLGYYSGGENYFTHKNGPAYDMHRDPLPSCGQNCS---QIAFDLQGQYSTTIFS 183
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMV 257
+ A+ VI +H KP+FLYLA+ AVH EP QAP + + D I D +RRT+AGM+
Sbjct: 184 DEAVRVIDDHIGPKPLFLYLAYQAVH-----EPAQAPQSYIDPYTDLIPDAQRRTFAGML 238
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
S LDE +GNV AL+++G N++++F DNG P G S N+P+RG K ++GG+
Sbjct: 239 SCLDEGIGNVTRALKRNGFYNNTLIVFTTDNGGPILGGDSVGARNYPMRGGKHAVYEGGV 298
Query: 318 RGVAAIWSPW--LKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGA 374
RG+A + P +T L H +DWLPTLC AAG+E + SLDGV+ WD +T
Sbjct: 299 RGIAFVHPPQNTFVETGAGYHNLMHAADWLPTLCDAAGVECHTQQSLDGVSHWDAMTTNT 358
Query: 375 KT--KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGT------------DNNGQSDEWYG 420
+ + +L N ++ + A R D K++ GT + G + YG
Sbjct: 359 NSTLRGDVVLGNATDLCDATNCGFAYRKGDWKFIRGTGGMPFSWSTPMNQSGGNCNLAYG 418
Query: 421 DTDNEIDKYSPKEVLYSKAG-ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILE 479
+D S V + A + T A+ +N+ +L
Sbjct: 419 QC---VDGASLGRVTAATAADCCYTCTNTTKCTYSTFLANQTSNNC-----LLFASSTAS 470
Query: 480 LREFARVRCNYDNKGAHCNSTVKPC---LFNITDDPCEQNNLAESQTDLLKQLEDKLAIY 536
+ + +D K +T+ P LFNI DP E ++A S D+ KQ+ +L
Sbjct: 471 TLDTNCISGAHDPKQVP-YTTLNPLGYQLFNIAQDPNEHTDVASSHADVAKQMNSELDSI 529
Query: 537 KSTMVPPGNK---PFDKRADP-ARWNNIWVPW 564
+ V N PF P R+ + VPW
Sbjct: 530 LQSYVQAQNDPNCPFPTSGWPQTRYGPVMVPW 561
>gi|219110117|ref|XP_002176810.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411345|gb|EEC51273.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 564
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 167/415 (40%), Positives = 237/415 (57%), Gaps = 39/415 (9%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
N+++ +PHI++I+ DDLG +D+ H +S I TP+ D LA +GL L+Q+YV C+P+R+
Sbjct: 57 NSSSFFRPHILMIIMDDLGSHDLGIHENSGIQTPHADQLARDGLYLDQYYVLPYCSPTRA 116
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+L++G+YP+H G H ++ + E GLPL E+ LPQ L+ AGY HA+GKWH+G R T
Sbjct: 117 SLLSGRYPLHTGC-HTIVNDWETQGLPLDEETLPQVLRRAGYQAHAVGKWHVGHSRWTQT 175
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKA----TFEPYQGLDMRHNMQVDNKTI----GIYST 194
PTFRGF S +G++ G QDY H + +E + D R D + G YST
Sbjct: 176 PTFRGFQSFFGFYLGAQDYNTHIKQGERGNAYEMH--WDARGKCGRDCSRLVDERGNYST 233
Query: 195 DLYTEAAINVIAEHNKS--KPMFLYLAHLAVHAGNTYEPFQAPDE-----EVAKFLDISD 247
++T AI VI H + +P+FLYLAH AVH P Q P+ E A + + +D
Sbjct: 234 HVFTREAIRVIENHPQRPHEPLFLYLAHQAVHW-----PDQVPETYRKFYEGATYSNWTD 288
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
+R+TYAGM+S DES+GNV AL+ GM EN++V+F DNG P+ + SN+P RG
Sbjct: 289 -QRKTYAGMLSAADESIGNVTKALQDAGMWENTLVVFTTDNGGPTAVCAAQGSSNYPKRG 347
Query: 308 MKSTPWDGGMRGVAAIWSP-WLK----------QTQKVSSELFHISDWLPTLCAAAGIEI 356
K T ++GG G + P W K +T ++ S++FH+ DWLPTL G
Sbjct: 348 GKCTVYEGGTTGDGFVSGPAWNKVARSRKKEYSETLELYSKVFHVVDWLPTLARMTGATP 407
Query: 357 NDTSLDGVNQWDVLTK---GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG 408
N LDGVNQWD + + A R E+ N Q Y A+R K + G
Sbjct: 408 NGKPLDGVNQWDSMLQREPSAPPPREEVFVGYAYFGN-QWYGPAIRYKHWKLIQG 461
Score = 40.0 bits (92), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 19/64 (29%), Positives = 34/64 (53%), Gaps = 4/64 (6%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNK----PFDKRADPARWNNI 560
L+++ DP E N+A S +++ L+ KL Y ++ VPP + PF + + +
Sbjct: 490 LYDLQSDPSETQNIASSYPLIVQILQGKLIEYHASFVPPISNDPTCPFTGTTNTSTFGPT 549
Query: 561 WVPW 564
W+PW
Sbjct: 550 WLPW 553
>gi|432904444|ref|XP_004077334.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
Length = 572
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 158/404 (39%), Positives = 223/404 (55%), Gaps = 31/404 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KPH+I I+ DD G+ D+ +HGS + TP +D LA G+ L +YVQ +C+PSRS LMTG+
Sbjct: 43 KPHLIFIMVDDQGYADIGYHGS-DVHTPVLDQLAAEGVKLENYYVQPICSPSRSQLMTGR 101
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QH +I +P LP LP+ L +AGY TH +GKWHLGF R PT RGF
Sbjct: 102 YQIHTGLQHSIIRPRQPLCLPPDIPTLPECLLKAGYHTHMVGKWHLGFCRPSCLPTRRGF 161
Query: 149 DSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
S +G G D++ + SC G D+ + + G YST LY E ++
Sbjct: 162 QSFFGTLTGSGDHFSYQSCDGA--EACGFDLHDGSRPAWEMRGNYSTLLYIERVKQILRN 219
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
H+ + P+FLYL+ A H P Q P + + ++ RR YA M+S LD+ V V
Sbjct: 220 HDPNTPLFLYLSLQAAHT-----PLQVPAHFLNPYKSQNNRHRRHYAAMLSCLDDGVAQV 274
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
+ L+ G+ +NSI+++ +DNG + GSN PLRG K + W+GG+R V + SP
Sbjct: 275 VKELKTTGIYDNSILIYSSDNGGQPL----SGGSNWPLRGGKGSYWEGGIRAVGFVHSPL 330
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
LK+ +S E+ H+SDW PTL AG + LDG + W+ +++G+ + RSEIL NID
Sbjct: 331 LKKKGVISKEMIHVSDWYPTLLGLAGALQSHNGLDGHDVWESISEGSPSPRSEILFNIDP 390
Query: 388 VD-NPQKYY-----------------AALRVDDLKYVAGTDNNG 413
V P + Y AALRV D + G +G
Sbjct: 391 VSRRPGEPYEKALVLNGFGIWDTAVRAALRVGDWNLLTGNVGDG 434
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 24/60 (40%), Positives = 35/60 (58%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFN++ DP E+++LAE +++K L +LA Y T + N P D ADP +W PW
Sbjct: 465 LFNVSGDPYERSDLAELHPEVVKHLLTRLAEYNQTAMIANNPPDDPMADPELHGGVWSPW 524
>gi|291221683|ref|XP_002730830.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 499
Score = 276 bits (705), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 155/382 (40%), Positives = 220/382 (57%), Gaps = 15/382 (3%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
++PHI++I DDLGWNDV +H + I P + LA +G+I NQ Y CTPSR+A+MTG
Sbjct: 25 EQPHIVLIFVDDLGWNDVEWH-NPDIKMPVLSKLAADGVIFNQSYTHPTCTPSRAAMMTG 83
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
YP G QH +I P G+PL KLLP+ LKE GY TH +GKWHLGF +E Y PT RG
Sbjct: 84 MYPFRTGNQHQMIFNLHPSGVPLEFKLLPEKLKEVGYFTHMVGKWHLGFCKEEYLPTSRG 143
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FDSHYG W +YD P +G D R N+ V K+ Y T + E A ++I
Sbjct: 144 FDSHYGLWTLGVGHYD-KMNGVLSPSEGYDFRDNIGVVPKS-DEYLTLMLAERAEHIING 201
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
H P+FL + + A + + PD A + DI D R + G +S +D +G V
Sbjct: 202 HYNKHPLFLQFT-MDIPAKH----LEIPDTFEALYCDIEDERTRKFYGKLSLMDHVIGRV 256
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
+ AL+ M ++++++F+ DNGA + S GSN+P RG+ T ++G R + I
Sbjct: 257 VDALKAREMWDDTVLMFIGDNGA----LASQAGSNYPFRGIAGTLFEGATRVPSLIHGSM 312
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEILHNID 386
L++T V++EL+++ D T+ A AG E LDG+N WD +KG K+ R E ++NID
Sbjct: 313 LRETGYVNNELWNLVDLHRTILALAGAESGSMCELDGMNMWDTFSKGKKSPREEHVYNID 372
Query: 387 NVDNPQKYYAALRVDDLKYVAG 408
D+P AA+R+ D K + G
Sbjct: 373 --DDPISPGAAIRIGDYKLITG 392
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 20/59 (33%), Positives = 36/59 (61%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
LFN+ DP E+NN+A+ D+++ L D+L ++ ++PP N+ + D A + +W P
Sbjct: 438 LFNVVVDPSEKNNIADQHPDIVQNLRDRLDEHRKLLIPPVNQTTELAGDAANYCGVWSP 496
>gi|291241933|ref|XP_002740864.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 496
Score = 274 bits (701), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 163/383 (42%), Positives = 223/383 (58%), Gaps = 27/383 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+PHI+ + DDLGWNDV +H +S I TP ID LA +G+ LN +YV + C PSR+ L++G+
Sbjct: 24 QPHIVFFVVDDLGWNDVGYH-NSYIKTPTIDMLAKSGVRLNNYYVASHCVPSRNMLISGR 82
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
+ I IG+QHG I P GLPL E + LKE GYATH IGKWH G + P RGF
Sbjct: 83 HVIDIGLQHGEI-GYYPRGLPLDEFTIADKLKEIGYATHLIGKWHCGCYSNHSLPHNRGF 141
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGL-DMRHNMQ-VDNKTIGIYSTDLYTEAAINVIA 206
D+ +GY D+Y H + GL D+R N + V K G YST +Y A N+IA
Sbjct: 142 DTFFGYLGSSDDHYTHIIMS-----NGLADLRLNDECVGYKYFGDYSTIMYANEAKNIIA 196
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRLDESVG 265
+H+++KP+FL LA AVH +P Q PD + + I D ERRTYAGM S +DE++
Sbjct: 197 QHDENKPLFLMLAFSAVH-----KPIQVPDVYMQYYKSTIHDNERRTYAGMASCVDEAIA 251
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
N+ AL GML+NS+++ DNG S G+N PLRG K++ W+GG+RGVA ++S
Sbjct: 252 NITEALENKGMLKNSVIVLTTDNGGGSV------GNNWPLRGRKNSHWEGGVRGVAFVYS 305
Query: 326 PWLKQTQKVS--SELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
L + + SEL HI+DW PTL G +L GV+QW+++ + R E+L
Sbjct: 306 DLLPMDVRGTENSELMHITDWFPTLVKLGGGNPCERKNLYGVDQWEMIRGMEPSARGEVL 365
Query: 383 HNIDNVDNPQKYYAALRVDDLKY 405
+D + Y R KY
Sbjct: 366 IALDTTRKASEEY---RTGPFKY 385
>gi|190891646|ref|YP_001978188.1| sulfatase [Rhizobium etli CIAT 652]
gi|190696925|gb|ACE91010.1| putative sulfatase protein [Rhizobium etli CIAT 652]
Length = 498
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 170/408 (41%), Positives = 230/408 (56%), Gaps = 36/408 (8%)
Query: 10 ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
L TL A +T KP+I+ I+ADDLGW DV FHGS I TPNID LA G L
Sbjct: 39 GLAATLASGTALAQST---KPNIVFIVADDLGWKDVGFHGSD-IKTPNIDQLAEKGGRLE 94
Query: 70 QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
Q Y Q +CTP+R+ALMTG+YP GMQ VI +G +GL L + LLP+ LKEAGYAT A
Sbjct: 95 QFYAQPMCTPTRAALMTGRYPFRYGMQTAVIPQGGTYGLALDDYLLPEMLKEAGYATAAS 154
Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKT 188
GKWHLG + P RGFDS YG G D++ H S + Y R+N ++
Sbjct: 155 GKWHLGHADTAFWPRQRGFDSFYGALLGEIDHFTHKSANGNADWY-----RNNEAIEEAG 209
Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
+ L+ A+ VI EH++SKP+FLYLA + H PFQAP E + + I+D
Sbjct: 210 ---FDNILFATEAVRVINEHDQSKPLFLYLAFTSPHT-----PFQAPKEYLDRNSHIADE 261
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA---PSFGIHSN-KGS--- 301
RR YA M+S +D+ +G V+ AL GM EN+++LF +DNG F S+ KG
Sbjct: 262 SRRAYAAMISVMDDGIGRVVEALETRGMRENTLILFHSDNGGVVDAFFAGESDVKGKLPA 321
Query: 302 -NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE-LFHISDWLPTLCAAAGIEIN-D 358
N P R K T ++GG R V I W + + S++ + H+ D PTL A AG +
Sbjct: 322 DNGPYRQGKGTTYEGGTRVVGLI--NWPGKIEPGSADGMMHVVDMYPTLAALAGATPGKN 379
Query: 359 TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
LDG++ W +L +G + R+EI++N++ + AA+R D K V
Sbjct: 380 KPLDGMDMWPMLGEGKPSPRTEIVYNVEPM------TAAVRQGDWKLV 421
>gi|291227815|ref|XP_002733878.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 508
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 153/392 (39%), Positives = 228/392 (58%), Gaps = 17/392 (4%)
Query: 7 YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
+ LT L + L TA ++PHI+ ILADDLG+ DV + S + TPNID LA G+
Sbjct: 4 FVIILTLISLMTNFDLARTA-QRPHIVFILADDLGYFDVGYRNGSIVKTPNIDKLAAEGV 62
Query: 67 ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKL--LPQYLKEAGY 124
L +HY Q C PSRS LM G+Y IH G + +G+ L + +P LKE GY
Sbjct: 63 KLERHYAQPSCMPSRSCLMMGRYQIHTGFDYRC-KDGKRSQLCMHPDTITMPMKLKENGY 121
Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV 184
ATH IGKWHLG R P +GFD+ +GY ++DY+ H A + R++ +V
Sbjct: 122 ATHMIGKWHLGNIRWECLPNAKGFDTFFGYLSAIEDYFTHYTPAGANCHDFW--RNHDEV 179
Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
+ G YST L+T+ A ++I H+ ++PMF+YL++ AVH P Q P + D
Sbjct: 180 ADDYKGQYSTHLFTKEAQDIIKNHDINQPMFMYLSYQAVHG-----PLQVPPHYYDMYND 234
Query: 245 --ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
R+TYA M + +DE +GNV + L++ GM +N++++F++DNGA H+ G N
Sbjct: 235 GPRVSTTRQTYAAMATCMDEGIGNVTSTLKQRGMWDNTVLIFVSDNGAQ---FHTG-GRN 290
Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLD 362
PLRG K + ++GG+R V+ + SP +++ +++S ++ H+ DW PTL AG ++D LD
Sbjct: 291 WPLRGGKGSVFEGGIRVVSFVTSPLIERPRRISEDIVHLMDWFPTLTHLAGGTVDDVMLD 350
Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDNPQKY 394
GVN WD ++KG + R E+L +I D Y
Sbjct: 351 GVNVWDTISKGTLSPRRELLVHISATDGVTNY 382
>gi|158424485|ref|YP_001525777.1| twin-arginine translocation pathway signal [Azorhizobium
caulinodans ORS 571]
gi|158331374|dbj|BAF88859.1| twin-arginine translocation pathway signal precursor [Azorhizobium
caulinodans ORS 571]
Length = 490
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 156/388 (40%), Positives = 217/388 (55%), Gaps = 24/388 (6%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F A L + A +PHI+ ILADDLG+ DV FHGS I TPN+D LA G
Sbjct: 25 FMAAVAGLSALASPARAQAAPRPHILYILADDLGFADVGFHGS-DIKTPNLDKLAATGAT 83
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
L Q Y Q +CTP+R+ALMTG+YP+ G+Q GVI G +GL E LLPQ LK GY+T
Sbjct: 84 LGQFYTQPMCTPTRAALMTGRYPLRYGLQTGVIPSGASYGLATDEFLLPQALKSVGYSTA 143
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
IGKWHLG ++ + P RGFD YG G D+Y H + Y R N QV +
Sbjct: 144 LIGKWHLGHAKQDFWPRQRGFDYFYGPLVGEIDHYKHEAHGVVDWY-----RDNKQVVEE 198
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
Y T+L+ A +I H+ P+FLYLA A H PFQAP V ++ +I+D
Sbjct: 199 G---YDTELFGTDAARLIGAHDPKTPLFLYLAFTAPHT-----PFQAPQAYVDRYPNITD 250
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG------- 300
P RR YA M+S +D+ +G V+AAL + GM EN++++F +DNG + +G
Sbjct: 251 PARRLYAAMISAMDDQIGKVVAALEQRGMRENTLIVFHSDNGGTRSKMFVGEGAFHGELP 310
Query: 301 -SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
SN P R K T ++GG R VAA+ + + T + + H+ D +PTL AG ++ +
Sbjct: 311 ASNAPYRDGKGTLYEGGTR-VAALANWPGRITPGAADGVMHVVDMMPTLARLAGANLDRS 369
Query: 360 S-LDGVNQWDVLTKGAKTKRSEILHNID 386
LDGV+ W L+ + R +++N++
Sbjct: 370 KPLDGVDVWSALSSARTSPRDGVVYNVE 397
>gi|220906870|ref|YP_002482181.1| sulfatase [Cyanothece sp. PCC 7425]
gi|219863481|gb|ACL43820.1| sulfatase [Cyanothece sp. PCC 7425]
Length = 495
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 161/418 (38%), Positives = 228/418 (54%), Gaps = 39/418 (9%)
Query: 8 FFALTCTLLFNDAFLNTT---------APKKPHIIIILADDLGWNDVSFHGSSQIPTPNI 58
F TC + + LN + + PHI+ I++DD GW DV FHGS I TPN+
Sbjct: 17 FAVFTCCITWKLLTLNQQDLPVAVAQQSSQPPHILFIMSDDQGWKDVGFHGSD-IRTPNL 75
Query: 59 DALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQY 118
D LA G L Q+Y Q +CTPSR+AL+TG+YP G+Q VI +GLP E LLPQ
Sbjct: 76 DQLAKTGARLEQYYSQPMCTPSRAALLTGRYPHRYGLQTLVIPSAGKYGLPTDEYLLPQA 135
Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM 178
LKEAGY T +GKWHLG Y P RGFD YG G DY+ HS + Y
Sbjct: 136 LKEAGYETAIVGKWHLGHADPKYWPRQRGFDYQYGPLLGEIDYFTHSAHGKVDWY----- 190
Query: 179 RHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
R+N + + Y T L + A+ +I +HN P+FLYLA A HA P+QAP +
Sbjct: 191 RNNQLIKEEG---YVTTLLGQDAVKLIEKHNPKTPLFLYLAFTAPHA-----PYQAPQKY 242
Query: 239 VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG---- 294
+ ++ I+DP RR YA M++ +D+ +G V+AAL K GM N++++F +DNG P
Sbjct: 243 LDQYKTIADPNRRAYAAMITAMDDQIGQVVAALEKRGMRNNTLIVFQSDNGGPRSAQFTG 302
Query: 295 -IHSNKGS----NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
+ ++ G+ N P R K++ ++GG R VA P Q V + HI D PTL
Sbjct: 303 EVDTSGGTIPADNGPYRDGKASLYEGGTRVVALANWPGKIQPGTVVNHPIHIVDMYPTLT 362
Query: 350 AAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
A + + + LDG+N W L++ + RS+++++I+ + AAL +D K V
Sbjct: 363 GLASVSVGKNKPLDGLNIWPALSEAKPSPRSQVVYDIE------PFRAALSQEDWKLV 414
>gi|313233524|emb|CBY09696.1| unnamed protein product [Oikopleura dioica]
Length = 609
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 179/566 (31%), Positives = 273/566 (48%), Gaps = 112/566 (19%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K+PH + ILADD+G++D+ +H ++I TP +D LA G+ L Q+YVQ +CTP+R LMTG
Sbjct: 96 KQPHFLFILADDMGYHDIGYH-QAEILTPFMDKLATTGVRLEQYYVQPVCTPTRVQLMTG 154
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y I GMQHGV+ +P G+PL EKLLP+ L++ GY T IGKWHLG F E Y P RG
Sbjct: 155 RYQIRYGMQHGVVRPPQPDGVPLDEKLLPEALRKCGYNTEMIGKWHLGMFTEDYLPQNRG 214
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI-----GIYSTDLYTEAAI 202
FD G++ G QD+Y H+ F G D R + I YST ++ +
Sbjct: 215 FDHFMGFYTGSQDFYSHN--KCFSGMCGYDFREATAGQPEVIRWDLNNTYSTGVFADELE 272
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
+++ N S+P F YL+ AVH+ P QAP + + ++ ++ + +RR Y MV+ +D
Sbjct: 273 KRLSKMNPSEPSFTYLSFQAVHS-----PLQAPRKFIDQYRNLKNRKRRNYNAMVTAMDT 327
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
++ V+ A +K +++I++F +DNG G SN PLRG K + ++GG+R +
Sbjct: 328 AIYKVVKAYKKFKFWDDTILIFSSDNG----GNLKGGASNWPLRGAKGSLFEGGVRTIGF 383
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND---TSLDGVNQWDVL---TKGAKT 376
+ SP L ++DW PTL AG + +D LDGV+Q L + G
Sbjct: 384 VHSP-----------LLPVTDWFPTLLQLAGCKTSDYGGKPLDGVSQVRQLWMGSNGNDH 432
Query: 377 KRSEILHNID------NVDNPQKY------------YAALRVDDLKYVAGTDNNGQSDEW 418
R EILH +D N +P+++ ALR K + G + Q + W
Sbjct: 433 ARDEILHGLDPLNKVKNTVDPRQFPIIKNRTFSVDVEGALRWKQWKLITGKQSPDQ-NRW 491
Query: 419 YGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKIL 478
+P +Y A F+ + +PK + L ++ D
Sbjct: 492 ----------STPPTTMYGGAKTIFDKV------------EPKKSKLVRLYNVVDD---- 525
Query: 479 ELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKS 538
P E+N++++ ++ + KLA Y
Sbjct: 526 ---------------------------------PFERNDISDEHPQVVNVMLTKLADYYD 552
Query: 539 TMVPPGNKPFDKRADPARWNNIWVPW 564
VPP +D RADPA + +W PW
Sbjct: 553 ASVPPRYPAYDLRADPALGDGVWRPW 578
>gi|291233691|ref|XP_002736785.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 499
Score = 272 bits (696), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 182/488 (37%), Positives = 265/488 (54%), Gaps = 44/488 (9%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K PHI++ILADDLGWNDV +H + ++ P ++ LA +G+I NQ YVQ CTP+R+ALM+G
Sbjct: 29 KPPHIVLILADDLGWNDVGWH-NPEVKMPVLNQLAADGVIFNQAYVQPTCTPTRAALMSG 87
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
YP G QH ++L P GLPL K LPQ LK+ GY TH +GKWHLGF +E + PT RG
Sbjct: 88 YYPFKTGNQHQLLLNLHPGGLPLRFKTLPQRLKDVGYLTHIVGKWHLGFCKEAFLPTNRG 147
Query: 148 FDSHYGYWQGLQDYYDHSCKAT-FEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
FDS YG GL H K G D N V +T Y + + A+ +I
Sbjct: 148 FDSFYG---GLTLGTSHFSKMNGILSTPGYDFYDNSGVVPQT-NDYLAFMLADRAVKIIN 203
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H + P+F+Y + + V A + + P E A + +I+D R + G +S +D++VG+
Sbjct: 204 GHYQEYPLFMYFS-MDVPAKSP----EVPPEYEALYANITDDRTRRFYGKLSVMDDAVGS 258
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
V+ AL+K M ++++++F+ DNGA + S+ GSN PLRG+ +T ++G R A I
Sbjct: 259 VVDALKKRDMWDDTLLIFIGDNGA----LASSSGSNWPLRGIAATLFEGATRVPAFIHGN 314
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
LKQT ++EL+HI D T+ + AG E + +DG++ WD +K + R+EI++NID
Sbjct: 315 MLKQTGYENNELWHIVDLHKTILSLAGAE-SSCEIDGMDMWDTFSKNESSPRNEIVYNID 373
Query: 387 NVDNPQKYYAALRVDDLKYVAGTDN-------NGQSDEW--YGDTDNE-------IDKYS 430
D+P AA+RV D K +AG + QSD W YGD D+
Sbjct: 374 --DDPISPGAAIRVGDYKLIAGNPDLFYPLRLINQSDGWFNYGDVPTSGLPVLPTGDEPP 431
Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
P V Y FN + + + + AD ALR L + +I + +V +
Sbjct: 432 PPNVTY-----LFNVIDDPEE--RNNIADDHPEIVQALRQRLDEHRIYYMPAVNQV---F 481
Query: 491 DNKGAHCN 498
D G N
Sbjct: 482 DEAGDPSN 489
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 38/59 (64%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
LFN+ DDP E+NN+A+ ++++ L +L ++ +P N+ FD+ DP+ ++ ++ P
Sbjct: 438 LFNVIDDPEERNNIADDHPEIVQALRQRLDEHRIYYMPAVNQVFDEAGDPSNFDGVFSP 496
>gi|198417507|ref|XP_002121051.1| PREDICTED: similar to arylsulfatase B [Ciona intestinalis]
Length = 518
Score = 271 bits (693), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 157/394 (39%), Positives = 230/394 (58%), Gaps = 26/394 (6%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
TT P P+I+++LADDLGWNDVS+H +S + PN+ LA G+IL Y Q CTPSR+A
Sbjct: 26 TTNP--PNIVMVLADDLGWNDVSWH-NSIVQMPNLQDLAERGVILEHAYAQEKCTPSRAA 82
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
+TG+YPI+ GMQ V++ + GLP+ KLLP YLK+ GYATH IGKWH+G+ E YTP
Sbjct: 83 FLTGRYPINTGMQEEVVVATQMSGLPIEFKLLPSYLKDQGYATHMIGKWHVGYCDEAYTP 142
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
T RGFDSHYG++ Y ++S + G D R ++ ++ G Y+T +T+ A
Sbjct: 143 TRRGFDSHYGFYNSGISYSNYSSTEGTD--VGYDYRDDLALNLAAEGKYTTTDFTDQAKT 200
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD--ISDPERRTYAGMVSRLD 261
+I H+++ PMFLY+A+ A H PF+ +E D + D R+TY GM+S LD
Sbjct: 201 LIDNHDQTNPMFLYMAYNAPHT-----PFEV-EESYRDIYDGNLRDGNRKTYLGMISALD 254
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGA--PSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
E VG ++ L++ GM N++ +F +DNG P G G+N PLRG K + ++GG R
Sbjct: 255 EQVGQLVDKLKEVGMWSNTVFVFYSDNGGTQPQSG---QSGNNFPLRGKKGSLFEGGYRV 311
Query: 320 ----VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGA 374
+A + L+ SS LFHISD T A AG + +DG+N W ++
Sbjct: 312 PAFLIARTRAGNLELIASTSSTLFHISDMFATFIALAGGDAAVPADIDGINAWPAISGNT 371
Query: 375 KTKRSEILHNI---DNVDNPQKYYAALRVDDLKY 405
+ R+ +++NI + +N + + DL+Y
Sbjct: 372 PSPRTTLVYNIITKNFANNARNTEPKAAIRDLQY 405
>gi|313235834|emb|CBY19819.1| unnamed protein product [Oikopleura dioica]
Length = 518
Score = 271 bits (692), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 178/547 (32%), Positives = 277/547 (50%), Gaps = 58/547 (10%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
+++ ++ DD GW D HGS ++ TPN+DA+A +G++L ++Y Q +C+P+RS+L+TG+YP
Sbjct: 19 NVVYLMIDDFGWGDFGVHGS-KLETPNLDAIARDGILLEKYYTQQVCSPTRSSLLTGRYP 77
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
I GMQH VIL G+ G+P LLPQ LK G+ATH +GKWH G E P RGFDS
Sbjct: 78 IRYGMQHNVILAGQTTGIPKEYALLPQDLKSCGFATHMVGKWHCGHSHEYMLPFNRGFDS 137
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
+YGY QG +D+Y C+A + + G+D N T G Y T++Y+ V+ + +
Sbjct: 138 YYGYLQGAEDHYSRIQCQA--KEWCGVDFCTENGPTNSTWGTYGTEIYSAQVAQVLDKVS 195
Query: 210 K-SKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD-ESVGNV 267
K KP FLY A VH +P QAP+ KF I DP+RRTYA MVS +D E+VG+
Sbjct: 196 KEEKPFFLYYAMQNVH-----DPLQAPEHYKIKFDWIEDPDRRTYAAMVSIMDEETVGDF 250
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
+ L + + ++++++ ADNG G + G+N PLR K + ++GG+ A +
Sbjct: 251 MTMLHERNLYDDTLIIITADNG----GETRSGGNNFPLRSQKWSLYEGGVHVNAMLSGGL 306
Query: 328 LK-QTQKVSSELFHISDWLPTLCAAAGIE--INDTSLDGVNQWDVLTKGAKTKRSEILHN 384
+K Q++ L H+SDW T+ A + + + LDG++QW + + A R I+HN
Sbjct: 307 IKDQSKSTFDGLVHVSDWRETILEAMECKPPADKSDLDGISQWQAILQNASPPRQSIVHN 366
Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFN 444
ID + K V GTD + D I+ S + Y + +
Sbjct: 367 IDPI---------------KTVKGTDERQWVQKLAADASFNINSQS--AIRYGR----WK 405
Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC 504
L P + + L D+ + E E T
Sbjct: 406 LLTGDPAFGVPDGNIPPPEDARPVWQKLLDDSMFESSE---------------TLTRLVQ 450
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR----WNNI 560
L++I +DP E+++++++ D++K D L+ + T V P D ++P +
Sbjct: 451 LYDIEEDPFEKHDISDNHPDIVKIGLDLLSRHNETAVWPNFPHTDPNSNPETRSDGFAGF 510
Query: 561 WVPWYDE 567
W PW E
Sbjct: 511 WWPWVQE 517
>gi|443321855|ref|ZP_21050894.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
gi|442788399|gb|ELR98093.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
Length = 469
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 154/380 (40%), Positives = 229/380 (60%), Gaps = 29/380 (7%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
L T F A KP+II+ILADDLGWNDV FHGS +I TPN+D LA +G+ L
Sbjct: 14 LTLIITYYFRGANELKAEINKPNIIVILADDLGWNDVGFHGS-EIKTPNLDKLAASGVRL 72
Query: 69 NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKEAGYA 125
+ YV+++CTP+R+A +TG++P GM GVI +PW GLPL EK + + LKEAGY
Sbjct: 73 ERFYVKSVCTPTRAAFLTGRHPFRYGMSTGVI---KPWDKVGLPLEEKTIAETLKEAGYY 129
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
T +GKWHLG ++E + PT RGFD HYG++ G DY+ H+ + + G H ++
Sbjct: 130 TAILGKWHLGHYQESFLPTSRGFDYHYGHYLGGIDYFTHN-----DDFLGALDWHRNRIH 184
Query: 186 NKTIGIYSTDLYTEAAINVIAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
K G Y+TDL + A+ +I HN + +P+FLY+A A H P A E++ +L
Sbjct: 185 LKEEG-YATDLIGQEAVKLINNHNYEQQPLFLYIAFNAPHT-----PLHAKTEDIEDYLT 238
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
I D +RR +A V +DE++G +I +L + +N+ V F++DNG + +N+G N P
Sbjct: 239 IDDEKRRVFAAQVQSMDEAIGKIIQSLENQQVCDNTFVFFVSDNGGSV--MRANRGDNRP 296
Query: 305 LRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLD 362
LRG K++ ++GG+R A + + P L Q++ +++F I D PTL AG+E + LD
Sbjct: 297 LRGGKNSLYEGGVRVPAIVSYPPKLSANQEI-NQIFSIVDLYPTLAKLAGVESVEGKQLD 355
Query: 363 GVNQWDVLTKGAKTKRSEIL 382
G+ D+ K ++R E+L
Sbjct: 356 GL---DIFAK--HSRRDELL 370
>gi|345316675|ref|XP_001517879.2| PREDICTED: arylsulfatase B-like [Ornithorhynchus anatinus]
Length = 782
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 200/323 (61%), Gaps = 15/323 (4%)
Query: 69 NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
N YV + ++S T + IH G+QH +I +P +PL EKLLP+ LKEAGYATH
Sbjct: 333 NYGYVPSYLNATKSP-HTKREQIHTGLQHQIIWACQPNCVPLDEKLLPELLKEAGYATHM 391
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQV 184
+GKWHLG +R+ PT RGFDS++GY G +DYY H LD R +V
Sbjct: 392 VGKWHLGMYRKECLPTRRGFDSYFGYLLGSEDYYSHERCVLIRSLNVTRCALDFRDGEEV 451
Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
+YST+++ + A+++IA H KP+FLYLA +VH EP Q P+E V +
Sbjct: 452 AVGYKNMYSTNVFAKRAVDLIANHPPDKPLFLYLAFQSVH-----EPLQVPEEYVKPYSF 506
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
I + +RR YAGMVS +DE+VGNV AAL++HG+ +N++++F DNG + G+N P
Sbjct: 507 IQNKKRRNYAGMVSIMDEAVGNVTAALKRHGLWDNTVLIFSTDNGGQTLA----GGNNWP 562
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDG 363
LRG K T W+GG+RG+ + SP LKQ + EL HISDWLPTL AG +N T LDG
Sbjct: 563 LRGRKWTLWEGGVRGIGFVASPLLKQKGVKNRELIHISDWLPTLVNLAGGHVNGTKPLDG 622
Query: 364 VNQWDVLTKGAKTKRSEILHNID 386
+ W +++G+ + R E+LHNID
Sbjct: 623 FDVWKTISEGSPSPRVELLHNID 645
>gi|357023853|ref|ZP_09086021.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
gi|355544286|gb|EHH13394.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
Length = 501
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 152/369 (41%), Positives = 213/369 (57%), Gaps = 24/369 (6%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
P +P+I+ I+ADD+G+ D F+GS IPTPN+D LA G L Q Y +CTP+R+ALMT
Sbjct: 57 PARPNIVYIVADDMGFGDAGFNGS-DIPTPNLDKLAAEGARLEQFYALPMCTPTRAALMT 115
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+YP+ G+Q GVI + LP+ E LLPQ LK+ GY T +GKWHLG + + P R
Sbjct: 116 GRYPLRYGLQVGVIPAAGTYSLPVDEYLLPQALKDTGYTTAMVGKWHLGHAKPEFWPRQR 175
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
GFD YG G D++ HS + Y R+N ++ + L+ A+ V+
Sbjct: 176 GFDYFYGALVGEIDHFKHSSHGVKDWY-----RNNKPLNETG---FDNTLFGNEAVRVVE 227
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H P+FLYLA A H PFQAP + + +F DI+D RR YA M+S +D+ VG
Sbjct: 228 RHEGKSPLFLYLAFTAPHT-----PFQAPQDYLDRFKDIADDNRRKYAAMISVMDDGVGK 282
Query: 267 VIAALRKHGMLENSIVLFMADNGAPS---FGIHSNKG-----SNHPLRGMKSTPWDGGMR 318
V+AAL K GM +N+I++F +DNG F S G SN P R K T ++GG R
Sbjct: 283 VVAALEKKGMRDNTIIVFHSDNGGVKNSLFAGDSKVGGELPASNGPYRDGKGTLYEGGTR 342
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTK 377
VAA+ + K V + H+ D PTL A A ++ + LDG+N WD L++G +
Sbjct: 343 -VAALANWPGKIKPGVVDSMVHVVDMYPTLAAVADAKLEKNKPLDGMNVWDALSEGKPSP 401
Query: 378 RSEILHNID 386
RSE+++N+D
Sbjct: 402 RSEVVYNVD 410
>gi|323449751|gb|EGB05637.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 533
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 173/522 (33%), Positives = 262/522 (50%), Gaps = 76/522 (14%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
+I+ ++ADDLG+NDV FHGS QIPTP +DALA +G+ L ++ +C+PSR+++++G++
Sbjct: 57 NILFLVADDLGFNDVGFHGSKQIPTPRLDALAADGVDLLNYHTHPVCSPSRASMLSGRHA 116
Query: 91 IHIGMQHGVIL---EGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
IH HG+ + +G + L L +LLP+ L+ GY THA+GKWHLG PT RG
Sbjct: 117 IH----HGIYMPFAQGTAYHLSLEYELLPEALRRLGYETHAVGKWHLGQNTRAALPTGRG 172
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FDS GYW G +DY+ H C + D +N G+YS +T+ A++V+A
Sbjct: 173 FDSFLGYWSGAEDYFAHDCAGAY------DFANNETTAWAYDGVYSAYSFTDRAVDVVA- 225
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
+ S P FLY+A VH P + P + F + RR +GMVS LDE+VGNV
Sbjct: 226 -SASTPYFLYVAWQNVH-----WPLEVPARYLEPFGHVPAGPRRNVSGMVSALDEAVGNV 279
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
AL+ ++++V+F++DNG P+ G +N PLRG K+T ++GG R V + +P
Sbjct: 280 TGALKARRAYDDAVVVFVSDNGGPTNGDEGTASNNWPLRGGKNTLYEGGTRVVGLLKAPG 339
Query: 328 LKQTQKVSSELFHISDWLPTL--CAAAGIEINDTSL---DGVNQWDVLTKGAKTKRSEIL 382
+ ++ ++L H DW PTL A G + + L DGV+ +DVL +GA + +L
Sbjct: 340 VAKSSASRAKL-HAVDWFPTLYRFAGGGPLVLNPPLAPGDGVDSYDVLARGAPGRDELLL 398
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVA-GTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGI 441
N + AL V D K++ GT W T + P+ V+
Sbjct: 399 EAHAPGSNDTVHGEALIVGDWKFLRIGTVRPDSEAGW---TRPPGQEAHPRSVV------ 449
Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTV 501
A P+ ++A R
Sbjct: 450 -------------DCGAAPEPADANATRR---------------------------QCVA 469
Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPP 543
PCLF++ DPCE+ + A D+L L+ +L Y++T VPP
Sbjct: 470 APCLFDVAADPCERVDRAVEFPDVLAALDARLDAYRATAVPP 511
>gi|395510440|ref|XP_003759483.1| PREDICTED: arylsulfatase B [Sarcophilus harrisii]
Length = 659
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 159/365 (43%), Positives = 203/365 (55%), Gaps = 20/365 (5%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
++PHI+ +LADDLGWNDV +H S I TP++DALA G+ L +Y Q LCTPSRS L+TG
Sbjct: 172 RQPHIVFVLADDLGWNDVGYH-DSNIFTPHLDALAAGGVRLENYYTQPLCTPSRSQLLTG 230
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y IH G+QH +I +P LPL EKLLP+ L+EAGY TH +GKWHLG FR+ PT RG
Sbjct: 231 RYQIHTGLQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMFRKECLPTRRG 290
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
FD+ +GY G +DYY H + LD R V YST+++TE AI+
Sbjct: 291 FDTFFGYLLGSEDYYSHKHCVHIDALNVTRCALDFRDGEDVAEGYNNTYSTNIFTEKAID 350
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
+IA+H K L V + P EE FL T ++
Sbjct: 351 LIAKHPAQKGT----EDLIVILDSRGVPTLVSSEEQVSFLCKDQSSDHTSLQII-----H 401
Query: 264 VGNVIAALRKHGMLENSIVLF-MADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
G++ ENS F DNG + G+N PLRG K T W+GG+RGV
Sbjct: 402 FGSIKNVQTPRASSENSSSAFPFPDNGGQTLA----GGNNWPLRGRKWTLWEGGIRGVGF 457
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEI 381
+ SP LKQT S EL HISDWLPTL AG N T LDG + W +++G + R E+
Sbjct: 458 VASPLLKQTGVQSRELIHISDWLPTLVNLAGGHTNSTKPLDGFDVWKTISEGNPSPRMEL 517
Query: 382 LHNID 386
LHNID
Sbjct: 518 LHNID 522
Score = 38.9 bits (89), Expect = 7.7, Method: Compositional matrix adjust.
Identities = 20/60 (33%), Positives = 32/60 (53%), Gaps = 1/60 (1%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LF+I DP E+ +L+ ++K+L D+L Y+ + VP D DP + +W PW
Sbjct: 600 LFDIQQDPEERYDLSSKHPSIVKKLLDRLQYYQKSSVPVFYPDEDPHCDP-KATGVWGPW 658
>gi|260788444|ref|XP_002589260.1| hypothetical protein BRAFLDRAFT_213058 [Branchiostoma floridae]
gi|229274435|gb|EEN45271.1| hypothetical protein BRAFLDRAFT_213058 [Branchiostoma floridae]
Length = 455
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 194/552 (35%), Positives = 265/552 (48%), Gaps = 115/552 (20%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
P+I+ ILADD GWND+ +H +S I TPN+D LA G+ L +YVQ +C+PSR LMTG+Y
Sbjct: 1 PNIVFILADDYGWNDIGYH-NSFIRTPNLDRLASEGVKLENYYVQPICSPSREQLMTGRY 59
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
IH G+QH VI+ P GLPL E LPQ LKE GY+T+ +GKWHLGFFR+ Y P RGF+
Sbjct: 60 QIHYGLQHSVIMCDRPHGLPLDEVTLPQRLKENGYSTYMVGKWHLGFFRKEYMPLQRGFE 119
Query: 150 SHYGYWQGLQDYYDHSCKATFEP----YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
+GY G +DY+ H + F + GLD+R + G YST L+ AI +I
Sbjct: 120 RFFGYLTGGEDYWTHRKPSQFSKDPSEFHGLDLRDQDKPVLDQNGTYSTHLFARKAIEMI 179
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
H++SKPMFLYL AVH P +AP+E + DI + RTYA MV+ +DE+VG
Sbjct: 180 LNHDQSKPMFLYLPFQAVHG-----PLEAPEEYKRIYEDIDNSLVRTYAAMVTVMDEAVG 234
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG----GMRGVA 321
NV AL+ GM N+I++F + A + +DG G+R V
Sbjct: 235 NVTGALKHRGMWRNTILIFSTASSAFT---------------QSQIKYDGTISYGIRLVK 279
Query: 322 -AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
I + L Q LF D L + + + + GA + R E
Sbjct: 280 FCILALILSDVQPHYINLFPYLDMLFFMS-----------------FKICSTGAASPRKE 322
Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAG 440
ILHNID P K + L+ S + YGD N
Sbjct: 323 ILHNID----PLKPSSPLQ--------------SSHKLYGDIFN---------------- 348
Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNST 500
+ ALR+ D KIL ++ R + ++G N
Sbjct: 349 ---------------------TSVYAALRS--GDWKILT---GSQDRGGWKHEGPSHNLF 382
Query: 501 VKP--------CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRA 552
KP LFNI DP E+ +L++S +++ + KLA Y T V P P D RA
Sbjct: 383 DKPNDPPEKSIWLFNIRSDPEEKKDLSDSHPGIVQDMLIKLAEYNKTAVSPFWPPRDLRA 442
Query: 553 DPARWNNIWVPW 564
+PA ++W PW
Sbjct: 443 NPALHGDLWGPW 454
>gi|350414673|ref|XP_003490384.1| PREDICTED: hypothetical protein LOC100748821, partial [Bombus
impatiens]
Length = 413
Score = 268 bits (685), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 138/325 (42%), Positives = 210/325 (64%), Gaps = 20/325 (6%)
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
+VS+LD+SVG+V+ ALR GMLENSIV+FM+DNGAP+ GI SN+GSN+PLRG+K++PW+G
Sbjct: 1 VVSKLDQSVGDVMDALRSRGMLENSIVVFMSDNGAPTNGILSNQGSNYPLRGIKNSPWEG 60
Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGA 374
G RGVAAIWSP ++++++VS+++ +SDWLPTL +AAG++ I ++DG + W L G
Sbjct: 61 GTRGVAAIWSPLIRKSKRVSNQMMFMSDWLPTLLSAAGVDRIQLGNIDGFDLWPSLVSGK 120
Query: 375 KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID----KYS 430
+ RSEI+ NID++ + YAA+R D KY+ G G + W G + + + +Y
Sbjct: 121 ISPRSEIVINIDDLAD----YAAIRRGDFKYIIGRTETGIA--WLGASGDPSEGVSPRYD 174
Query: 431 PKEVLYSKAGITFNAL---KTKLQIKQKHAADPK-----ANSSDALRTILTDEKILELRE 482
P +VLYSK G+ + + K +++ +K + + ++ ++ ILT E I+E+R+
Sbjct: 175 PHKVLYSKTGVAISGIITAKQAMELNKKKKENIRIVYDTSSKTNFQEKILTSEDIIEMRK 234
Query: 483 FARVRCNYDNKGA-HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMV 541
A+++CN K C + PCLFNI DPCE NLAE +L E L ++ T++
Sbjct: 235 KAQIKCNVTEKDKIPCEPMIAPCLFNIEQDPCEMVNLAERNPVILTIFETILIRHRLTVI 294
Query: 542 PPGNKPFDKRADPARWNNIWVPWYD 566
PP N D R++P+ WNN W W +
Sbjct: 295 PPSNLDGDPRSNPSLWNNTWTSWAE 319
>gi|300433302|gb|ADK13094.1| arylsulfatase [Dicathais orbita]
Length = 571
Score = 267 bits (683), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 190/563 (33%), Positives = 286/563 (50%), Gaps = 67/563 (11%)
Query: 21 FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPS 80
F+ + P+ + I+ADDLG+NDVS+H + Q+ TPN+ +A NG+IL + Y QA CTPS
Sbjct: 23 FVAAQNDRPPNFLYIMADDLGYNDVSWH-NPQVLTPNLGKMAKNGVILTESYSQAACTPS 81
Query: 81 RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
R++ MTG YP IG+Q+ V+ EG +PL LP+ LKEAGY +H +GKWHLG R
Sbjct: 82 RASYMTGYYPFRIGVQNSVVREGMEDYVPLDVDFLPKRLKEAGYVSHLVGKWHLGHCRRD 141
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKAT-----FEPYQG---LDMRHNMQVDNKTIGIY 192
TP RGFD+ G G DYY +A F+P D N + Y
Sbjct: 142 VTPPGRGFDTFLGLLNGYNDYYTKKIRAIASHEDFDPNAPGTIYDFFSNYTLQPSPETDY 201
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA-PDEEVAKFLDISDPERR 251
+TD++T AI +I + +K P FL L + A H P Q P + D+ D RR
Sbjct: 202 TTDIFTNRAIELI-QQSKDTPFFLALHYTAPHW-----PLQKHPGFSDKDYPDVKDLNRR 255
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
Y ++ +D +G V+ AL++H +L+N++++F +DNG G +N PLR K+T
Sbjct: 256 IYLSSITAMDTGIGKVVDALKQHNLLDNTLIVFQSDNG----GDTHFSANNGPLRDRKTT 311
Query: 312 PWDGGMRGVAAIWSP-WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
++GG++ + + P L T + S +LFHISDW T+ +AAG+E + +DG++QW++L
Sbjct: 312 LYEGGVKVPSMAYGPGLLTNTPRTSDDLFHISDWFTTILSAAGLEAGN--VDGIDQWNLL 369
Query: 371 TKGAKTKRSEILHNI--DNV-----------DNPQKYYAALR---VD----DLKYVAGTD 410
G+K+ R ++NI DN NP+ + A R VD DL
Sbjct: 370 RYGSKSPRQTFIYNIHGDNAAIRAGRYKLIEGNPKAFQARRRRRSVDVLDQDLAVHGSAL 429
Query: 411 NNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRT 470
+ DE + D Y E KA + LK QI+ P +
Sbjct: 430 TRHKRDEEFVDRR----PYESDEDKVMKAFSSLTGLKKLEQIR------PTGD------- 472
Query: 471 ILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLE 530
T + IL + E ++R + A C+ K L+N+T DP E+ +L + + ++ +
Sbjct: 473 -WTMDSILSVLE--KIRPDQVKVQADCH---KCQLYNLTADPSEKKDLVKEKPAVVLFMH 526
Query: 531 DKLAIYKSTMVPPG-NKPFDKRA 552
L +YK+ V P PF K A
Sbjct: 527 TLLELYKAEEVEPVLAYPFKKDA 549
>gi|171910063|ref|ZP_02925533.1| twin-arginine translocation pathway signal precursor
[Verrucomicrobium spinosum DSM 4136]
Length = 496
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 159/390 (40%), Positives = 219/390 (56%), Gaps = 27/390 (6%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P+I+ ILADDLG +DV + GS +I TP++D LA G L+Q YVQ +C+P+R+AL+TG+
Sbjct: 52 RPNIVFILADDLGSHDVGWRGS-EIKTPHLDELARAGATLDQFYVQPVCSPTRAALLTGR 110
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YP G Q GV+ +GLPL E+ LPQ LKEAGY T GKWHLG F+ Y PT RGF
Sbjct: 111 YPFRYGFQTGVVRPWAEYGLPLEERTLPQALKEAGYETAITGKWHLGHFQPAYLPTKRGF 170
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
D YG++ G+ DYY H + G D N Q +N G YST+L + A + E
Sbjct: 171 DHQYGHYNGMLDYYTHIR------HGGFDWHRNDQ-ENHDEG-YSTELVGKEAARRVRER 222
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
+KS+P+FLY+ VH+ P Q P E + +R+TYAGMV+ LD +VG ++
Sbjct: 223 DKSRPLFLYVPFNGVHS-----PHQVPVEYETPYTQFEG-KRKTYAGMVTALDAAVGRIV 276
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA-AIWSPW 327
A+R ++EN++ +F +DNG P+ SN PLR K T ++GG+R A A W
Sbjct: 277 QAVRDEKLVENTLFIFSSDNGGPN---PKQLTSNGPLRAGKGTVYEGGVRVCAFATWPGK 333
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNID 386
+K V + + HI+DW PTL AG + LDG + W + +GA + IL N
Sbjct: 334 IKPGTSVQAPI-HIADWFPTLLGLAGASLKQPLPLDGRDVWATVAEGAASPHDTILINTT 392
Query: 387 NVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
A+R D K V G N D
Sbjct: 393 PSG------GAIRSGDWKLVIGGANRDDED 416
>gi|395545568|ref|XP_003774672.1| PREDICTED: arylsulfatase J-like, partial [Sarcophilus harrisii]
Length = 466
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 170/490 (34%), Positives = 245/490 (50%), Gaps = 67/490 (13%)
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+ PT RGF
Sbjct: 1 YQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGF 60
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEAAINV 204
D+ +G G DYY H K G D+ N DN GIYST +YT+ +
Sbjct: 61 DTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYEN---DNAAWDHDNGIYSTQMYTQKVQQI 116
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
+A HN KP+FLY+A+ AVH+ P QAP + + I + RR YA M+S LDE++
Sbjct: 117 LASHNPRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDEAI 171
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
NV AL+++G +NSI+++ +DNG GSN PLRG K T W+GG+R V +
Sbjct: 172 NNVTLALKRYGFYDNSIIIYSSDNGGQPMA----GGSNWPLRGSKGTYWEGGIRAVGFVH 227
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
SP LK + EL HI+DW PTL A G D LDG + W+ +++G ++ R +ILH
Sbjct: 228 SPLLKNKGTICKELVHITDWYPTLITLAEGWIDEDIQLDGYDIWETISEGLRSPRVDILH 287
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
NID + + + + AG GI
Sbjct: 288 NIDPI--------YTKAKNGSWAAG------------------------------FGIWN 309
Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNSTVK 502
A+++ +++K SD + + F+ N + N+ ++
Sbjct: 310 TAIQSAIRVKHWKLLTGNPGYSDWVPP----------QSFSNAGPNRWHNERISWSAGKS 359
Query: 503 PCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWV 562
LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P +W
Sbjct: 360 VWLFNITADPYERVDLSSKYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVWG 419
Query: 563 PWYDELDKQK 572
PWY E DK +
Sbjct: 420 PWYKEDDKTR 429
>gi|449278684|gb|EMC86475.1| Arylsulfatase B, partial [Columba livia]
Length = 431
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 140/301 (46%), Positives = 189/301 (62%), Gaps = 14/301 (4%)
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL EKLLP+ L+EAGY TH +GKWHLG +++ PT RGFD+
Sbjct: 1 IHTGLQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMYKKECLPTHRGFDT 60
Query: 151 HYGYWQGLQDYYDHS----CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
++GY G +DYY H KA LD R +V +YST+L+TE AI++IA
Sbjct: 61 YFGYLLGSEDYYSHDRCVLIKAKNITRCALDFRDGEEVATGFKNMYSTNLFTERAIDLIA 120
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H KP+FLYLA +VH EP + P+E + + I D +RR YAGMV+ +DE+VGN
Sbjct: 121 HHKTEKPLFLYLAFQSVH-----EPLEVPEEYMKPYSSIKDAKRRHYAGMVTLMDEAVGN 175
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
+ AL+++G+ N++++F DNG + G+N PLRG K T W+GG+RGV + SP
Sbjct: 176 LTDALKRYGLWNNTVLVFSTDNGGQTLA----GGNNWPLRGRKWTLWEGGVRGVGFVASP 231
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
LKQ S EL HISDWLPTL AG N T LDG + W +++G + R E+LHNI
Sbjct: 232 LLKQKGVESHELIHISDWLPTLVHLAGGHTNGTKPLDGFDVWKAISEGRPSPRVELLHNI 291
Query: 386 D 386
D
Sbjct: 292 D 292
>gi|241680395|ref|XP_002412684.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215506486|gb|EEC15980.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 461
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 163/440 (37%), Positives = 234/440 (53%), Gaps = 35/440 (7%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
+ A ++PHI++I+ADDLGW DVSFHGS QIPTPNID LA +GL L+ +YVQ+ +R
Sbjct: 1 MTAQASRRPHIVVIVADDLGWGDVSFHGSRQIPTPNIDTLAGDGLALHHYYVQSSGAATR 60
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
SALMTG YPIH G++H + +PL+ ++ ++ + H+GF E
Sbjct: 61 SALMTGLYPIHTGIKHASHICISARLIPLSGGIVRCHIVLTIKYAFMVNIPHIGFLFEHI 120
Query: 142 TPTFRGFDSHY---GYWQG-------LQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
+ ++ + GYW LQ + + F + +MR
Sbjct: 121 SKGAHAVNASWWTNGYWSKRFIHAFFLQLFMNAEIFFFFNQWPCREMR------------ 168
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
+ HN S P+ LYLAH AVH GN Y P +AP V KF I D RR
Sbjct: 169 -------SPKWKIAPLHNCSVPLLLYLAHQAVHVGNAYHPLEAPSSYVKKFRYIKDLRRR 221
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
+AGMV+ LD+SVG+V AL++ G+LE++IVL +D+G + G+ GSN PLRG K T
Sbjct: 222 IFAGMVAALDQSVGDVFRALQRKGILEDTIVLLTSDSGGAAGGVEQGVGSNWPLRGTKKT 281
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVL 370
W+GG+R V +WS L + +V+S+L H S + P A + +I D S+DG + W VL
Sbjct: 282 LWEGGVRAVGLLWSAKLARG-RVASQLIHRSAYFPLTNADSCGDITDLGSIDGKDLWTVL 340
Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
+ R+E+L+NID + AA+RV K V G ++ Q D+WY +++
Sbjct: 341 VDDRPSPRTELLYNIDY----RWKLAAIRVGPYKLVLGASSSDQFDDWYEPIESDATNPY 396
Query: 431 PKEVLYSKAGITFNALKTKL 450
+ KA T AL KL
Sbjct: 397 LESAALRKALFTDCALLAKL 416
>gi|430746415|ref|YP_007205544.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430018135|gb|AGA29849.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 474
Score = 265 bits (677), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 159/380 (41%), Positives = 218/380 (57%), Gaps = 27/380 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP++++ILADD+GW DV +H S+I TP++D LA +G L Q YVQ +C+P+R+ALMTG+
Sbjct: 36 KPNVVLILADDMGWGDVGWH-DSEIKTPHLDKLAASGTRLEQFYVQPVCSPTRAALMTGR 94
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YP+ G+Q GV+ +GLPL E+ LPQ LKE GY T GKWHLG F+ Y PT RGF
Sbjct: 95 YPMRHGLQVGVVRPWAQYGLPLNERTLPQALKEVGYETAICGKWHLGHFQPEYLPTHRGF 154
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
D YG++ G DY+ H G D + +V N+ G YST L A +I H
Sbjct: 155 DHQYGHYNGALDYFTHIRDG------GFDWHRDDRV-NRDEG-YSTHLIGREATRIIGHH 206
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
+ SKP+FLY+ AVHA P Q P++ + + +P RRTYAGM++ +DE+VG +
Sbjct: 207 DTSKPLFLYVPFNAVHA-----PHQVPEKYKEPYSKLKEP-RRTYAGMLAAMDEAVGQIA 260
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPW 327
+A+ GM N+I LF +DNG P+ G SN PLRG K T ++GG+R A I W
Sbjct: 261 SAIDGKGMRSNTIFLFSSDNGGPAPG---QVTSNGPLRGQKGTLYEGGVRVPAFISWEGH 317
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNID 386
L+ V++ L HI D PTL AG + + DG + W + + IL N
Sbjct: 318 LRPGTVVNAPL-HIVDLFPTLLTLAGAPLEQSLPTDGRDAWTAIAHHGPSPHDAILLNT- 375
Query: 387 NVDNPQKYYAALRVDDLKYV 406
P A+RV D K +
Sbjct: 376 ---TPDS--GAIRVGDWKLI 390
>gi|440894324|gb|ELR46807.1| Arylsulfatase J, partial [Bos grunniens mutus]
Length = 466
Score = 265 bits (677), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 168/487 (34%), Positives = 247/487 (50%), Gaps = 69/487 (14%)
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+ PT RG
Sbjct: 1 RYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRG 60
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYTEAAIN 203
FD+ +G G DYY H K G D+ N DN G+YST +YT+
Sbjct: 61 FDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GVYSTQMYTQRVQQ 116
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
++A H+ KP+FLY+A+ AVH+ P QAP + I + RR YA M+S LDE+
Sbjct: 117 ILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEA 171
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
+ NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+R +
Sbjct: 172 INNVTLALKMYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGIRAIGF 226
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEI 381
+ SP LK V EL HI+DW PTL + A +I++ LDG + W+ +++G ++ R +I
Sbjct: 227 VHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDENIQLDGYDVWETISEGLRSPRVDI 286
Query: 382 LHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGI 441
LHNID + + + + AG GI
Sbjct: 287 LHNIDPI--------YTKAKNGSWAAG------------------------------YGI 308
Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNST 500
A+++ +++K SD + + F+ + N + N+ ++
Sbjct: 309 WNTAIQSAIRVKHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERITLSTG 358
Query: 501 VKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
LFNIT DP E+ +L+ ++KQL +L+ + T VP P D R++P +
Sbjct: 359 KSVWLFNITADPYERVDLSNKYPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGV 418
Query: 561 WVPWYDE 567
W PWY E
Sbjct: 419 WGPWYKE 425
>gi|291236588|ref|XP_002738221.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 504
Score = 265 bits (676), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 151/383 (39%), Positives = 224/383 (58%), Gaps = 34/383 (8%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
F+ +C L+ + + + HII I+ DDLGWNDV +H + I TP ID LA NG+ L
Sbjct: 13 FSSSCLLI------DGLSSDQAHIIFIVVDDLGWNDVGYH-NLYIKTPTIDRLANNGVKL 65
Query: 69 NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
N +Y LC PSR+ LM+G++ HGVI+ G P GLPL E + LKEAGY+TH
Sbjct: 66 NNYYAANLCVPSRNMLMSGRH------VHGVIM-GYPRGLPLNETTIANKLKEAGYSTHL 118
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN-MQVDNK 187
+GKW+ GF+ + + P RGFD+ +G+ +D+Y H + D+R N + V +K
Sbjct: 119 VGKWNCGFYSKEFLPHNRGFDTFFGFVDSKEDHYTHMV------HDISDLRRNDLCVADK 172
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL-DIS 246
G YST +Y +I H+ +KP+FL+++ AVH EP Q P +++ I
Sbjct: 173 YYGNYSTIMYGNEGTTIIDNHDTNKPLFLFMSFSAVH-----EPLQVPSVYEKEYIPTID 227
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
D +RR YAGMVS +DE++ N+I L ML+NSI++F +DNG G+N PLR
Sbjct: 228 DTDRRIYAGMVSCVDEAIANIIETLENKDMLKNSIIVFTSDNGGDPL----YSGNNWPLR 283
Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEIN-DTSLDG 363
G K++ W+GG+R + ++S L + + +EL HI+DW PTL +G I+ + L G
Sbjct: 284 GWKASNWEGGVRALGFVYSEILPTNARGTDNNELMHITDWFPTLVDISGGNIHTEELLYG 343
Query: 364 VNQWDVLTKGAKTKRSEILHNID 386
V+QW V+ AK+ R ++L +D
Sbjct: 344 VSQWGVIRGLAKSPREDVLIALD 366
>gi|154248610|ref|YP_001419568.1| sulfatase [Xanthobacter autotrophicus Py2]
gi|154162695|gb|ABS69911.1| sulfatase [Xanthobacter autotrophicus Py2]
Length = 491
Score = 264 bits (675), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 151/389 (38%), Positives = 216/389 (55%), Gaps = 27/389 (6%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F A L + +PHI+ ILADDLG+ DV FHGS I TPN+D LA G
Sbjct: 27 FLAAVAGLSLLSSGARAADAPRPHIVYILADDLGFADVGFHGS-DIKTPNLDHLAAQGAR 85
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
L Q Y Q CTP+R+A +TG+YP+H G+Q G I G +GL E LLPQ LK+ GY T
Sbjct: 86 LGQFYTQPFCTPTRAAFLTGRYPLHYGLQVGAIPSGAKYGLATDEFLLPQALKDVGYRTA 145
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
+GKWHLG + + P RGFDS YG G D++ H + Y N QV +
Sbjct: 146 LVGKWHLGHADQKFWPRQRGFDSFYGPLVGEIDHFKHEAHGVTDWYH-----DNTQVKEE 200
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
Y T+L+ + A+ +IA H+ P+FLYLA A H PFQAP + ++ I+
Sbjct: 201 G---YDTELFGKEAVRLIAAHDPKTPLFLYLAFTAPHT-----PFQAPQSYLDQYAHIAA 252
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG------- 300
P+RR YA M++ +D+ +G+V+AAL GM EN++++F +DNG + + +G
Sbjct: 253 PQRRAYAAMITAMDDQIGHVVAALTSRGMRENTLIVFHSDNGGTRSKMFAGEGAVAGDLP 312
Query: 301 -SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE-LFHISDWLPTLCAAAGIEIND 358
SN P R K + ++GG R VA + W + ++E + H+ D LPTL AG +
Sbjct: 313 ASNAPYRDGKGSLYEGGTRVVA--LANWPGRIAPGAAEGVMHVVDMLPTLAKLAGASLAK 370
Query: 359 TS-LDGVNQWDVLTKGAKTKRSEILHNID 386
+ LDGV+ W L G + R+ I++N++
Sbjct: 371 SKPLDGVDVWPALAAG-QAGRAGIVYNVE 398
>gi|87306948|ref|ZP_01089094.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
gi|87290321|gb|EAQ82209.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
Length = 455
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 162/408 (39%), Positives = 226/408 (55%), Gaps = 28/408 (6%)
Query: 1 MTWARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDA 60
M R + TL T AP+ P+I+ +LADDLG DVS+ GS I TP +DA
Sbjct: 1 MPAIRSLILTIALTLASVATTFATDAPR-PNIVFLLADDLGGADVSWRGSP-IKTPQLDA 58
Query: 61 LAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLK 120
LA +G L Q YVQ +C+P+RSAL+TG+YP+ G+Q GV+ +GLPL E+ L + L+
Sbjct: 59 LANSGAKLEQFYVQPVCSPTRSALLTGRYPMRYGLQVGVVRPWADYGLPLDERTLAEALQ 118
Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
+AGY T +GKWHLG Y P RGFD YG++ G DY+ H G D
Sbjct: 119 DAGYETAIVGKWHLGHVSPAYLPMARGFDHQYGHYNGALDYFTHDRDG------GHDWHK 172
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ V N+ G Y+T L + A+ VI + +K KP+FLY+ AVH+ P Q P+ A
Sbjct: 173 DDHV-NRDEG-YATHLIAQEAVRVIQDRDKKKPLFLYVPFNAVHS-----PLQVPESYAA 225
Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
+ D+ R+ YAGMV+ LDE+VG ++ +++ ML+N++ +F +DNG P G ++ G
Sbjct: 226 PYGDMKK-RRQAYAGMVAALDEAVGQIVDEIQRQEMLDNTLFIFSSDNGGPEPGKLTDNG 284
Query: 301 SNHPLRGMKSTPWDGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
PLRG K T ++GG+R A A W + KV + L HI DW PTL AG +
Sbjct: 285 ---PLRGGKHTLYEGGVRVCAFASWKGRIAPGSKVEAPL-HIVDWYPTLIELAGGSLQQA 340
Query: 360 S-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
LDG N W +T G + I+ NI + A+RV D K V
Sbjct: 341 KPLDGRNIWPSITTGEPSPHDVIVCNITPTE------GAIRVGDWKLV 382
>gi|291233195|ref|XP_002736539.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 513
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 159/430 (36%), Positives = 232/430 (53%), Gaps = 40/430 (9%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F C L+ + K+P+I+ ILADD GW+D+ +HGS + +P +D LA G+
Sbjct: 6 LFRAVCFLIL--IRITEGGDKRPNIVFILADDFGWHDIGYHGSI-VRSPYMDFLASEGVK 62
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
L +YVQ +C+P+R+ LM+G+Y I G+QH VI + LP E + Q +KEAGYATH
Sbjct: 63 LENYYVQPMCSPTRAQLMSGRYQIRYGLQHLVIQPDQRACLPPDEVTIAQKMKEAGYATH 122
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD--------MR 179
+GKWHLGF+R+ P RGFD+ +G+ L +Y + P G R
Sbjct: 123 MVGKWHLGFYRKECLPINRGFDTFFGFLNCLIYHYTYDFGYWHTPESGNKTIMFGWDLFR 182
Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
++ V + G YST L+ E A VI H++ PMFLYL AVH P P E
Sbjct: 183 NHDCVAKEHKGEYSTILFAEEAQRVIWNHDQETPMFLYLPFAAVH-----NPLDVPREYE 237
Query: 240 AKFLD-ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSN 298
A + I D +RR A MV+ +DE++ N+ AL GM +N++++F DNG G S
Sbjct: 238 AMYEGIIEDEQRRKKAAMVTCMDEAIWNITKALMDTGMWDNTVLIFSTDNG----GATSV 293
Query: 299 KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEI 356
G+N PLRG K + ++GG+RGV + SP L + + + ++L H++DW PT A +
Sbjct: 294 GGNNWPLRGGKRSMFEGGIRGVGFVTSPLLDEAVRGTENNQLIHVTDWFPTFVHLAEGNL 353
Query: 357 NDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVD-----------NPQKYY-----AALR 399
+ T LDG NQWD +++G + R+E L NID ++ P Y+ +R
Sbjct: 354 SGTKPLDGFNQWDTISQGMFSPRNEFLINIDPMEICANPLATNQFRPNPYFDVCVRGGIR 413
Query: 400 VDDLKYVAGT 409
V K V G
Sbjct: 414 VGKWKLVTGV 423
>gi|291225027|ref|XP_002732497.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 461
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 173/526 (32%), Positives = 264/526 (50%), Gaps = 70/526 (13%)
Query: 40 LGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGV 99
+GWNDV +H +S I PN+ LA +G+I NQ Y CTPSR+A+MTG YP G QH +
Sbjct: 1 MGWNDVHWH-NSDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59
Query: 100 ILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQ-GL 158
+ P G+PL KLLP+ LKE GY+TH +GKWHLGF ++ Y PT RGFDSHYG W G+
Sbjct: 60 VFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119
Query: 159 QDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
DY P G D R NM V K+ Y + + A +++ H P+FL
Sbjct: 120 SDY--DKMNGVLSPSAGYDFRDNMGVVPKS-DDYLALMLGDRAEHIVNTHYPGTPLFLAF 176
Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
L + A + + P+E K+ +I D R + G ++ +D+ +G V+ AL+ GM +
Sbjct: 177 T-LDIPAKH----LEIPEEYEEKYAEIEDDRTRQFYGKLTMMDDIIGGVVDALKNRGMYD 231
Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL 338
+S+++F+ DNGA + S GSN+P RG+ T ++G R + + +K+T S+EL
Sbjct: 232 DSLIIFIGDNGA----LSSQSGSNYPFRGIAGTLFEGATRVPSIVSGKGIKKTGYESNEL 287
Query: 339 FHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAAL 398
+ I D T+ AG E + LDG++ W +KG + R+EIL+NID D+P AA+
Sbjct: 288 YSIVDIHRTILDVAGAEA-EPELDGLSMWPTWSKGKPSPRNEILYNID--DDPVSPGAAI 344
Query: 399 RVDDLKYVAGTDNNGQSDEWYGDTDNEI-DKYSPKEVLYSKAGITFNALKTKLQIKQKHA 457
R+ D K + G D Y D + D Y Y+ + L + Q Q HA
Sbjct: 345 RIGDYKLIT-----GHPDLLYPYRDISLTDNY------YNYGDAPASGLPSTDQ--QSHA 391
Query: 458 ADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNN 517
P + + L I+ D P E+N+
Sbjct: 392 --PAPQNVEYLFNIVVD-------------------------------------PEERND 412
Query: 518 LAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
++ D+++ L +L ++ ++PP N+ + +N++W P
Sbjct: 413 ISADHPDIVQDLRARLDEHRKLLIPPVNQTTELAGAAGNFNDVWSP 458
>gi|440902784|gb|ELR53530.1| Arylsulfatase B, partial [Bos grunniens mutus]
Length = 431
Score = 263 bits (671), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 138/303 (45%), Positives = 185/303 (61%), Gaps = 14/303 (4%)
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
+ IH G+QH +I +P +PL EKLLPQ LKEAGYATH +GKWHLG +R+ PT RGF
Sbjct: 1 FKIHTGLQHQIIWPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGF 60
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
D+++GY G +DYY H + LD R +V +YST+++TE A +
Sbjct: 61 DTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTERATTL 120
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
I H KP+FLYLA +VH EP Q P+E + + I D RR YAGM S +DE+V
Sbjct: 121 ITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASVMDEAV 175
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
GNV AAL + G+ N++ +F DNG + G+N PLRG K + W+GG+RGV +
Sbjct: 176 GNVTAALERRGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVGFVA 231
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILH 383
SP LK+ + EL HISDWLPTL AG N T LDG + W+ +++G+ + R E+LH
Sbjct: 232 SPLLKRKGVKTRELIHISDWLPTLVKLAGGSTNGTKPLDGFDVWNTISEGSPSPRMELLH 291
Query: 384 NID 386
NID
Sbjct: 292 NID 294
>gi|283778949|ref|YP_003369704.1| sulfatase [Pirellula staleyi DSM 6068]
gi|283437402|gb|ADB15844.1| sulfatase [Pirellula staleyi DSM 6068]
Length = 486
Score = 263 bits (671), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 168/477 (35%), Positives = 239/477 (50%), Gaps = 55/477 (11%)
Query: 3 WARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALA 62
WA A T A + A ++P+I+ I+ADDLGW DV F+G ++I TPNIDALA
Sbjct: 9 WAAVLLVAFT-------AVASQAADRQPNIVHIVADDLGWKDVGFNGCTEIKTPNIDALA 61
Query: 63 YNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEA 122
G +Q YVQ +CTP+R+ LMTG++P G+Q VI +GL +E L+PQ L +A
Sbjct: 62 KGGAKFSQFYVQNMCTPTRACLMTGRFPYRYGLQTIVIPTAAGYGLDTSEYLMPQCLGDA 121
Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM 182
GY T IGKWHLG + Y P RGFD YG G DY+ H + + R N
Sbjct: 122 GYKTAIIGKWHLGHADQKYWPKQRGFDYQYGAMIGELDYFTHDEHGVLDWF-----RDNK 176
Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
V + Y+T L + A+ I + KP +LYL A H P+QAP E + K+
Sbjct: 177 PVHEQG---YTTTLIGDDAVKYIHGQDGKKPFYLYLTFNAPHT-----PYQAPKEYITKY 228
Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS- 301
L+I++P RRTYA MV LDE++G V+AAL + G+ EN+++ F +DNG + + + +
Sbjct: 229 LNIAEPTRRTYAAMVDCLDENIGKVVAALDQKGLRENTLIFFHSDNGGTKDKMFAGQMAD 288
Query: 302 ---------NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
N P R K + ++GG R A P + Q V + H D PT A A
Sbjct: 289 MSKVVLPCDNGPYRNGKGSLFEGGSRVCALANWPGKIKAQTVDG-MIHAVDLYPTFAALA 347
Query: 353 GIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDN 411
G I LDG N WD + +G + R+E ++I + + A LR D K +
Sbjct: 348 GASIAKCKPLDGTNVWDTIAEGKPSPRTEFFYSI------EPFRAGLRQGDWKLI----- 396
Query: 412 NGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT------FNALKTKLQIKQKHAADPKA 462
W + +D Y+ E Y K I ++ +++ K AA P A
Sbjct: 397 ------WRTMLPSSVDLYNLAEDPYEKNNIAAAHPDKVATMQARIETASKDAAKPLA 447
>gi|431918040|gb|ELK17268.1| Arylsulfatase I [Pteropus alecto]
Length = 547
Score = 262 bits (669), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 177/550 (32%), Positives = 261/550 (47%), Gaps = 115/550 (20%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 50 HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 108
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + + RGFD+
Sbjct: 109 IHTGLQHSIIRPRQPNCLPLDQSXXXXXXR-------------------------RGFDT 143
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V G YST LY + +++A H+
Sbjct: 144 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILASHS 201
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+
Sbjct: 202 PRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 256
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP LK
Sbjct: 257 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 312
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ ++ S L HI+DW PTL AG + LDG + W +++G + R+EILHNID +
Sbjct: 313 RKRRTSRALVHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNIDPL 372
Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
N ++ AA+RV + K + G D YGD + P +
Sbjct: 373 YNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 418
Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
L + G +N E++ R+
Sbjct: 419 LAAFPGSWWNL-----------------------------ERMASARQAV---------- 439
Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
LFNI+ DP E+ +LA + D+++ L +LA Y T +P + RA P
Sbjct: 440 ---------WLFNISADPYEREDLAGQRPDVVRALLARLADYNRTAIPVRYPAENPRAHP 490
Query: 555 ARWNNIWVPW 564
W PW
Sbjct: 491 DFNGGAWGPW 500
>gi|308512479|ref|XP_003118422.1| CRE-SUL-3 protein [Caenorhabditis remanei]
gi|308239068|gb|EFO83020.1| CRE-SUL-3 protein [Caenorhabditis remanei]
Length = 500
Score = 262 bits (669), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 146/409 (35%), Positives = 234/409 (57%), Gaps = 39/409 (9%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY--NGLILNQHYVQALCTPSRS 82
T +KP+++ I+ADDLG+ND+ + S+ + TPN+ LA+ N +L YV LCTP+RS
Sbjct: 26 TLTQKPNVLFIMADDLGFNDLDWKDST-LHTPNLRNLAFHKNTALLTNSYVNQLCTPTRS 84
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
A MTG YP +G Q+GV L EP G+P L + +++ Y+T+ +GKWHLG+ ++ +
Sbjct: 85 AFMTGYYPFRVGTQNGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFL 144
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSC----KATFEPYQGLDMRHNMQVDNKTI------GIY 192
PT RGFD YG++ Y++HS + +GLD+ + + K++ G+Y
Sbjct: 145 PTNRGFDYFYGFYGPQTGYFNHSADQYHRELKRVVKGLDLFEEVG-NGKSVPDFSQNGVY 203
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVH------AGNTYEPFQAPDEEVAKFLD-- 244
STDL+T+ A++V+ HN +KP F++L++ AVH + P A EV + D
Sbjct: 204 STDLFTDVAMSVLDNHNTTKPFFMFLSYQAVHPPLQIKSEKKLTPKIATHFEVPRSFDKY 263
Query: 245 ---ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS-FGIHSNKG 300
+ D RR Y GM++ +D ++G ++ L+ + EN++++F +DNG S FG
Sbjct: 264 CSRVKDRYRRIYCGMLTAMDFAIGRLVEYLKASNLYENTVIVFTSDNGGTSNFG-----A 318
Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS 360
SN PLRG K T W+GG + + SP + +FH+ DW T+ + G+EI D+
Sbjct: 319 SNAPLRGEKDTIWEGGTKTTTFVHSPMYIEEGGTREMMFHVVDWHATILSITGLEI-DSY 377
Query: 361 LDGVNQWDVLTKGA-KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG 408
DG+NQW+ + G K +R + ++NIDN + +A+R D K + G
Sbjct: 378 GDGINQWEYIKTGRPKFRRFQFVYNIDN------HGSAIRDGDYKLIVG 420
>gi|313234414|emb|CBY24613.1| unnamed protein product [Oikopleura dioica]
Length = 532
Score = 261 bits (668), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 190/585 (32%), Positives = 278/585 (47%), Gaps = 76/585 (12%)
Query: 10 ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
++ LF F A + H+++I+ADD GW DVSFH + I TPNID L GL L
Sbjct: 3 GISILYLFWYLFSFVYATQAKHVLVIIADDFGWADVSFHNTGGIQTPNIDRLVGGGLELT 62
Query: 70 QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT-HA 128
+Y Q +CTP+RSAL+TG+YPIH G+Q VI + GL E LLP+YL+ H
Sbjct: 63 NYYTQHICTPTRSALLTGRYPIHTGLQTNVIAISQSSGLQRDEMLLPEYLESCDIKQRHM 122
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM-RHNMQVD-- 185
+GKWH+G TP RGF++ GY G +D+Y + G+D H+ +
Sbjct: 123 VGKWHVGHGHSWMTPWKRGFETFSGYLAGAEDHYTREWCMEQTDWCGVDYSEHSASLSGP 182
Query: 186 -NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
N + G YS DLY + ++ + +K F+Y A VH P QA + + K+
Sbjct: 183 TNSSWGKYSGDLYLQKMSEIVKSIDPTKDSFIYFAPQHVHY-----PLQALERHLVKYSW 237
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
I D RR YA MV+ LDE VG+VI K+ +L+N+I F+ADNG G + G+N P
Sbjct: 238 IKDTNRRNYAAMVASLDEIVGDVIDLYHKNRLLKNTIGFFVADNG----GETRDGGNNWP 293
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGV 364
L G K T +GG++ ++ + + + LFH+SD LPT+ A I +DG
Sbjct: 294 LSGQKWTLLEGGVKATGFMFGGLVPLGKY--NGLFHVSDILPTVMEAFNCPIK-KKIDGY 350
Query: 365 NQWDVLTKGAKTKRSEILHNID--------------NVDNPQKYYAALRVDDLKYVAGTD 410
QW+ + RS+ILHNID N D + AALR K + G
Sbjct: 351 AQWEAIIYSNDFPRSQILHNIDPTKINKVANDSRTWNSDWDVRVQAALRKGPWKLITGDP 410
Query: 411 NNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRT 470
G T +E+ ++ + + G KT+ I Q SSD
Sbjct: 411 ---------GFTSDEVPEFPVSPPEWHQQG----PYKTRGLISQI-----TGKSSDYY-- 450
Query: 471 ILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLE 530
+DN H N V+ L+ I +DP E ++++ +++++
Sbjct: 451 -------------------FDN---HRNKLVR--LYKIDEDPTESRDISDQNPSIVEEML 486
Query: 531 DKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWY-DELDKQKAI 574
+ L Y STMVPP D ++P W PW DE + +K +
Sbjct: 487 NILIKYNSTMVPPQLPDLDPFSNPKLHEGFWFPWISDEYNSKKQM 531
>gi|291225017|ref|XP_002732505.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 497
Score = 261 bits (667), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 145/381 (38%), Positives = 218/381 (57%), Gaps = 17/381 (4%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+PH+I+I DD+GWNDV +H + I PN+ LA +G+I Q Y CTPSR+A+MTG
Sbjct: 26 QPHVILIFVDDMGWNDVHWH-NPDIAMPNLMDLADDGVIFEQSYTHPTCTPSRAAMMTGL 84
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YP G QH ++ P G+PL KLLP+ LKE GYATH +GKWHLGF ++ Y PT RGF
Sbjct: 85 YPFKTGNQHQMVFNLHPSGVPLNFKLLPEKLKEVGYATHMVGKWHLGFCKDEYLPTNRGF 144
Query: 149 DSHYGYWQ-GLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
DSHYG W G+ DY P G D R N++V K+ G Y + + A ++
Sbjct: 145 DSHYGLWTLGVGDY--DKLNGVLSPSAGYDFRDNLEVVPKSDG-YLALMLGDRAEEIVNN 201
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
H+ P+FL L + A + + P+E + DI D R + G ++ +D+ +G V
Sbjct: 202 HSPETPLFLVFT-LDIPAKH----LEIPEEYEELYADIEDDRTRQFYGKLTMMDDIIGGV 256
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
+ AL+ M ++S+ +F+ DNGA + S GSN+P RG+ T ++G R + +
Sbjct: 257 VDALKDRDMYDDSLFIFIGDNGA----LSSQAGSNYPFRGIAGTLFEGATRVPSIVSGKG 312
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
+++T S+EL+ I D T+ AG + + LDG++ W ++ + R+EI++NID
Sbjct: 313 IEKTGYKSNELYSIVDIHRTILDVAGAKA-EPELDGLSMWPTWSEDEPSPRNEIIYNID- 370
Query: 388 VDNPQKYYAALRVDDLKYVAG 408
D+P AA+R+ D K + G
Sbjct: 371 -DDPVSPGAAIRIGDYKLITG 390
Score = 46.2 bits (108), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 36/59 (61%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
LFNI DDP E+++++ D+++ L D+L ++ + PP N+ + A +N++W P
Sbjct: 436 LFNIVDDPEERHDISADNQDIVQDLRDRLDEHRKLLHPPVNQTTELAGSAANFNDVWSP 494
>gi|346472067|gb|AEO35878.1| hypothetical protein [Amblyomma maculatum]
Length = 514
Score = 260 bits (664), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 145/372 (38%), Positives = 214/372 (57%), Gaps = 18/372 (4%)
Query: 41 GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
GWNDVS+H + ++ +P ++ LA G+IL+QHY CTP+R+ALMTG+YP +G+Q I
Sbjct: 36 GWNDVSWH-NERMESPILEQLAKEGVILDQHYALPTCTPTRAALMTGRYPYKLGIQSHGI 94
Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
EP GLPL L + LK GY THA GKWHLG+ + TPT RGFD+ G++ G QD
Sbjct: 95 RTLEPNGLPLGVTTLAEELKRTGYTTHAFGKWHLGYCNQSLTPTRRGFDTFRGFYVGGQD 154
Query: 161 YYDHSCKA--TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
Y+ H+ T +G D R+ +VD G+Y+T L ++ I E KPMFLY+
Sbjct: 155 YFSHTLSGGKTSATAKGYDYRNGDEVDYSAKGVYTTTLIANHVLSAIEESQPDKPMFLYV 214
Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
A AVHA P Q P + +P+R+ M++ +D +VG ++ L+ M
Sbjct: 215 AFQAVHA-----PLQVPTQYRKMCSIYRNPKRKLLCEMLAVMDHAVGLIVKKLKMKDMWR 269
Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL 338
N++++F++DNG G G+N PLRG K+T ++GG R A + P ++ + SS +
Sbjct: 270 NTLLVFISDNG----GQILYGGNNWPLRGNKNTLFEGGTRVPAFVAGPLIRNGGRNSSSI 325
Query: 339 FHISDWLPTLCAAAGIEINDTSL---DGVNQWDVLTKGA-KTKRSEILHNIDNVDNPQKY 394
H+ DW PTL +AAG E ++ L DGV+QW + G R+E ++N+D K
Sbjct: 326 IHVVDWFPTLLSAAGGESAESQLKDMDGVDQWAAIRDGTLDGTRTEFVYNLDCKKG--KL 383
Query: 395 YAALRVDDLKYV 406
A+R DLK++
Sbjct: 384 VGAIRDGDLKFI 395
>gi|390362801|ref|XP_793927.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 739
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 154/363 (42%), Positives = 211/363 (58%), Gaps = 29/363 (7%)
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QHGVI +P LPL E LPQ LKE+GYAT+ +GKWH+GF+ + TPT RGFDS
Sbjct: 318 IHTGLQHGVIRHAQPNCLPLDEVTLPQKLKESGYATNMVGKWHIGFYLDACTPTERGFDS 377
Query: 151 HYGYWQGLQDYYDHS--CK------ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
+G+ G +D+Y HS C+ + + G D+ N G YST L+T I
Sbjct: 378 FFGFLVGGEDHYTHSGHCEVYNADDGSTKTLSGHDLHANKTPVFDYEGQYSTHLFTNKTI 437
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
+ I H+K+KP+F+YLA+ AVH P Q PD + + +I+D RRTYAGMVS +DE
Sbjct: 438 DAIERHDKTKPLFMYLAYQAVHL-----PLQVPDRYMDPYKNIADENRRTYAGMVSCMDE 492
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
+GNV AL+ G+ +N+I++F DNG P I S +N PLRG K W+GG+ GV
Sbjct: 493 GIGNVTRALKDAGLYDNTIIIFSTDNGGP---IQSG-ANNWPLRGSKGNMWEGGIHGVGF 548
Query: 323 IWSPWLKQTQK--VSSELFHISDWLPTLCAA-AGIEINDTS-LDGVNQWDVLTKGAKTKR 378
+ SP L + K V+ +L H+SDWLPT+ A AG +N T LDG + W+ ++ A + R
Sbjct: 549 VHSPLLPTSVKGTVNHDLMHVSDWLPTIVAGIAGGALNGTKPLDGFDVWNSISGTASSPR 608
Query: 379 SEILHNID-------NVDNPQKYYAALRVDDLKYVAGTDNNGQS-DEWYGDTDNEIDKYS 430
+EILHNID N AALRV D K G D + + + W ++ +D
Sbjct: 609 TEILHNIDLKKKQPANEKFDTSIRAALRVGDWKIKTGADYDVTTHNVWMPPPESGLDTIY 668
Query: 431 PKE 433
P E
Sbjct: 669 PTE 671
Score = 47.8 bits (112), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 31/60 (51%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP EQ +L+E D++ +L KL Y VP D ADPA W PW
Sbjct: 679 LFNITADPLEQTDLSEKYPDVVNELVQKLEEYYEGSVPMRYPDEDLEADPALRGGSWGPW 738
>gi|301769831|ref|XP_002920339.1| PREDICTED: arylsulfatase B-like [Ailuropoda melanoleuca]
Length = 519
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 137/301 (45%), Positives = 183/301 (60%), Gaps = 14/301 (4%)
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT RGFD+
Sbjct: 91 IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 150
Query: 151 HYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
++GY G +DYY H + LD R +V +YST+++TE A +I
Sbjct: 151 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALIT 210
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +DE+VGN
Sbjct: 211 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYNFIQDKNRHYYAGMVSLMDEAVGN 265
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
V AAL+ HG+ N++ +F DNG + G+N PLRG K + W+GG+RGV + SP
Sbjct: 266 VTAALKSHGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGIRGVGFVASP 321
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
LKQ + EL HISDWLPTL A N + LDG + W+ +++G+ + R E+LHNI
Sbjct: 322 LLKQKGVKNRELIHISDWLPTLVKLARGSTNGSKPLDGFDVWETISEGSPSPRMELLHNI 381
Query: 386 D 386
D
Sbjct: 382 D 382
>gi|167515556|ref|XP_001742119.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778743|gb|EDQ92357.1| predicted protein [Monosiga brevicollis MX1]
Length = 339
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 138/324 (42%), Positives = 200/324 (61%), Gaps = 17/324 (5%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K+P+I+ I+ADDLGWNDVS HGS QIPTP+IDA+A++G+ L ++VQ +CTP+RS ++G
Sbjct: 31 KRPNIVFIVADDLGWNDVSLHGSPQIPTPHIDAIAHSGVHLTNYHVQPVCTPTRSTFLSG 90
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
++ IH G+ + +G L L+ LLP YLK+ GY T A+GKWHLG E PT RG
Sbjct: 91 RHVIHTGI-YMPFAQGTALRLNLSYTLLPAYLKKLGYRTAAVGKWHLGQNVEKALPTGRG 149
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FD + GYW G +DYY H + + G D + + K YST ++ E A+N I E
Sbjct: 150 FDEYLGYWSGAEDYYTH------DTHGGYDFQDGTECAIKYNNTYSTYIFAERAVNTILE 203
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
+ +P+FLY A VH P +AP E VA+F I + ER+ A M S LD++VGN+
Sbjct: 204 ADPEQPLFLYTAFQNVH-----WPLEAPAEYVARFSHIPNSERQYVAAMTSILDDAVGNI 258
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
AL++ + +N+I++F +DNG P +H + + +N+PLRG K+T W+GG + V I
Sbjct: 259 TDALKRSRIADNTILIFTSDNGGP---VHDENTESNNYPLRGGKNTLWNGGTQVVGMIAG 315
Query: 326 PWLKQTQKVSSELFHISDWLPTLC 349
++ + H SDWLP+L
Sbjct: 316 KGIENPGTDCHGMMHASDWLPSLV 339
>gi|313213139|emb|CBY36997.1| unnamed protein product [Oikopleura dioica]
Length = 532
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 186/574 (32%), Positives = 272/574 (47%), Gaps = 75/574 (13%)
Query: 10 ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
++ LF F A + H+++I+ADD GW DVSFH + I TPNID L GL L
Sbjct: 3 GISILYLFWYLFSFVYATQAKHVLVIIADDFGWADVSFHNTGGIQTPNIDRLVGGGLELT 62
Query: 70 QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT-HA 128
+Y Q +CTP+RSAL+TG+YPIH G+Q VI + GL E LLP+YL+ H
Sbjct: 63 NYYTQHICTPTRSALLTGRYPIHTGLQTNVIAISQSSGLQRDEMLLPEYLESCDIKQRHM 122
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM-RHNMQVD-- 185
+GKWH+G TP RGF++ GY G +D+Y + G+D H+ +
Sbjct: 123 VGKWHVGHGHSWMTPWKRGFETFSGYLAGAEDHYTREWCMEQTDWCGVDYSEHSASLSGP 182
Query: 186 -NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
N + G YS DLY + ++ + +K F+Y A VH P QA + + K+
Sbjct: 183 TNSSWGKYSGDLYLQKMSEIVKSIDPTKDSFIYFAPQHVHY-----PLQALERHLVKYSW 237
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
I D RR YA MV+ LDE VG+V+ K+ +L+N+I F+ADNG G + G+N P
Sbjct: 238 IKDTNRRNYAAMVASLDEIVGDVVDLYHKNRLLKNTIGFFVADNG----GETRDGGNNWP 293
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGV 364
L G K T +GG++ ++ + + + LFH+SD LPT+ A I +DG
Sbjct: 294 LSGQKWTLLEGGVKATGFMFGGLVPLGK--YNGLFHVSDILPTVMEAFNCPIK-KKIDGY 350
Query: 365 NQWDVLTKGAKTKRSEILHNID--------------NVDNPQKYYAALRVDDLKYVAGTD 410
QW+ + RS+ILHNID N D + AALR K + G
Sbjct: 351 AQWEAIIYSNDFPRSQILHNIDPTKINKVANDSRTWNSDWDVRVQAALRKGPWKLITGDP 410
Query: 411 NNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRT 470
G T +E+ ++ + + G KT+ I Q SSD
Sbjct: 411 ---------GFTSDEVPEFPVSPPEWHQQG----PYKTRGLISQI-----TGKSSDYY-- 450
Query: 471 ILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLE 530
+DN H N V+ L+ I +DP E ++++ +++++
Sbjct: 451 -------------------FDN---HRNKLVR--LYKIDEDPTESRDISDQNPSIVEEML 486
Query: 531 DKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
+ L Y STMVPP D ++P W PW
Sbjct: 487 NILIKYNSTMVPPQLPDLDPFSNPKLHEGFWFPW 520
>gi|323454531|gb|EGB10401.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 530
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 148/407 (36%), Positives = 226/407 (55%), Gaps = 32/407 (7%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIP-TPNIDALAYNGLILNQHYVQALCTPSRSA 83
+AP+ PHI+ +L DD+GWND+ + + TP + LA NG+ L Q+Y Q+ CT SR+A
Sbjct: 22 SAPETPHIVYVLFDDMGWNDIGYQSTDMAALTPVLSDLAENGVKLTQYYTQSTCTVSRAA 81
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L++G P+H G+ HG I+ P GLPL KLLPQYL+E+GY T+ +GKW +G F E Y P
Sbjct: 82 LLSGVLPMHNGISHGTIVMDSPIGLPLKYKLLPQYLQESGYRTYMVGKWDIGHFNEEYLP 141
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATF--EPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD +G++ Y+ H + P D+R+ + Y+TDL+ E A
Sbjct: 142 HNRGFDHFFGFYGADITYFSHISSRGYCANPNCFPDLRNEDETMANASMRYTTDLFRERA 201
Query: 202 INVIAEH---NKSKPMFLYLAHLAVHAGNTYEPFQAPDE----EVAKFLDISDPERRTYA 254
+ + H + + P+FLYL+ A H P AP E E ++ ERR +A
Sbjct: 202 VGFVEGHAANHATDPLFLYLSFNAPH-----YPTSAPQEFMRNEAELLAPFTNRERRVFA 256
Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH-------SNKGSNHPLRG 307
+V+ D SVG V+ AL+ G +S++LF +DNGA + SN GSN PLRG
Sbjct: 257 AVVNYADRSVGRVVEALKATGAYNDSVILFASDNGAVPTTCNQSHCTSESNTGSNWPLRG 316
Query: 308 MKSTPWDGGMRGVAAIWSPWL---KQTQKVSSELFHISDWLPTLCAAAGIEINDTSL--- 361
MK+T W+GG R A + +P + + + + HI+DW+PT+ G+ + ++L
Sbjct: 317 MKATYWEGGCRVPAFVHAPKYLGDRASGSLYQGIVHITDWIPTI--VRGVLRSSSTLLGA 374
Query: 362 --DGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
G + WD + K A R+E+L+ +D+ + A+RVD+ K++
Sbjct: 375 EVSGADHWDSIRKLATPPRTELLYGVDHCVAGGQLTGAIRVDNWKFI 421
>gi|281353470|gb|EFB29054.1| hypothetical protein PANDA_009046 [Ailuropoda melanoleuca]
Length = 431
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 137/303 (45%), Positives = 184/303 (60%), Gaps = 14/303 (4%)
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
+ IH G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT RGF
Sbjct: 1 FKIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGF 60
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
D+++GY G +DYY H + LD R +V +YST+++TE A +
Sbjct: 61 DTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATAL 120
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +DE+V
Sbjct: 121 ITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYNFIQDKNRHYYAGMVSLMDEAV 175
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
GNV AAL+ HG+ N++ +F DNG + G+N PLRG K + W+GG+RGV +
Sbjct: 176 GNVTAALKSHGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGIRGVGFVA 231
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILH 383
SP LKQ + EL HISDWLPTL A N + LDG + W+ +++G+ + R E+LH
Sbjct: 232 SPLLKQKGVKNRELIHISDWLPTLVKLARGSTNGSKPLDGFDVWETISEGSPSPRMELLH 291
Query: 384 NID 386
NID
Sbjct: 292 NID 294
>gi|429206655|ref|ZP_19197919.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
AKP1]
gi|428190241|gb|EKX58789.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
AKP1]
Length = 498
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 150/368 (40%), Positives = 210/368 (57%), Gaps = 26/368 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+PHI+ IL DDLG+ DV +HGS + TPN+D LA G L Q Y Q LCTP+R+ALMTG
Sbjct: 52 RPHILYILVDDLGYADVGYHGSD-VKTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGC 110
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YP+ G+Q GVI G +GL E LLPQ LKEAGY T +GKWHLG + Y P RGF
Sbjct: 111 YPMRYGLQTGVIPSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGF 170
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
D YG G D++ H + Y R N V Y T+L+ AI +I EH
Sbjct: 171 DYFYGPLVGEIDHFKHEAHGITDWY-----RDNEMVKEPG---YDTELFGADAIRLIEEH 222
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
+ + P+++YL+ A H P+QAPD+ + DI+D R YA M+S +D+ VG V+
Sbjct: 223 DSATPLYMYLSFTAPHT-----PYQAPDKYKDLYPDIADEGRNAYAAMISCMDDQVGPVL 277
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGS--------NHPLRGMKSTPWDGGMRGV 320
AL + GM E+++V+F +DNG + + +G+ N PLR K T ++GG R V
Sbjct: 278 QALERRGMREDTLVIFHSDNGGTRSKMFAGEGAVAGELPPRNDPLREGKGTLYEGGTRVV 337
Query: 321 A-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKR 378
A A W + + + + H+ D LPTL A EI + LDG++ W ++ G + R
Sbjct: 338 ALANWPGRIPAGE--THGMMHVVDMLPTLAGLAQAEIAHAGQLDGMDVWQAISAGKASPR 395
Query: 379 SEILHNID 386
E+++NI+
Sbjct: 396 EEVVYNIE 403
>gi|221640917|ref|YP_002527179.1| twin-arginine translocation pathway signal protein [Rhodobacter
sphaeroides KD131]
gi|221161698|gb|ACM02678.1| Twin-arginine translocation pathway signal [Rhodobacter sphaeroides
KD131]
Length = 509
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 149/368 (40%), Positives = 211/368 (57%), Gaps = 26/368 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+PHI+ IL DDLG+ DV +HGS + TPN+D LA G L Q Y Q LCTP+R+ALMTG+
Sbjct: 63 RPHILYILVDDLGYADVGYHGSD-VKTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGR 121
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YP+ G+Q GVI G +GL E LLPQ LKEAGY T +GKWHLG + Y P RG
Sbjct: 122 YPMRYGLQTGVIPSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGV 181
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
D YG G D++ H + Y R N V Y T+L+ AI +I EH
Sbjct: 182 DYFYGPLVGEIDHFKHEAHGITDWY-----RDNEMVKEPG---YDTELFGADAIRLIEEH 233
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
+ + P+++YL+ A H P+QAPD+ + DI+D R+ YA M+S +D+ VG V+
Sbjct: 234 DSATPLYMYLSFTAPHT-----PYQAPDKYKDLYPDIADEGRKAYAAMISCMDDQVGLVL 288
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGS--------NHPLRGMKSTPWDGGMRGV 320
AL + GM E+++V+F +DNG + + +G+ N PLR K T ++GG R V
Sbjct: 289 QALERRGMREDTLVIFHSDNGGTRSKMFAGEGAVAGELPPRNDPLREGKGTLYEGGTRVV 348
Query: 321 A-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKR 378
A A W + + + + H+ D LPTL A EI + LDG++ W ++ G + R
Sbjct: 349 ALANWPGRIPAGE--THGMMHVVDMLPTLAGLAQAEIAHAGQLDGMDVWQAISAGKASPR 406
Query: 379 SEILHNID 386
E+++NI+
Sbjct: 407 EEVVYNIE 414
>gi|291230930|ref|XP_002735418.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
partial [Saccoglossus kowalevskii]
Length = 480
Score = 259 bits (661), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 150/386 (38%), Positives = 220/386 (56%), Gaps = 17/386 (4%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
T +PHII+I DD+GWNDV +H + I PN+ LA +G+I NQ Y CTPSR+A
Sbjct: 4 TEKSDQPHIILIFVDDMGWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAA 62
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
+MTG YP G QH ++ P G+PL KLLP+ KE GY+TH +GKWHLGF ++ Y P
Sbjct: 63 MMTGLYPFKTGNQHQMVFNLHPSGVPLEFKLLPEKFKEVGYSTHMVGKWHLGFCKDEYLP 122
Query: 144 TFRGFDSHYGYWQ-GLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
T RGFDSHYG W G+ DY P G D R NM V K+ G Y + + A
Sbjct: 123 TNRGFDSHYGIWTLGVGDY--DKMNGVLSPSAGYDFRDNMGVVPKSNG-YLALMLGDRAE 179
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
+++ H P+FL L + A + + P+E + DI D R + G ++ +D+
Sbjct: 180 HIVNNHYPGTPLFLAFT-LDIPAKH----LEIPEEYEEPYADIEDDRTRQFYGKLTMMDD 234
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
+G V+ AL+ GM ++S+++F+ DNGA + S GSN+P RG+ T ++G R +
Sbjct: 235 IIGGVVDALKNRGMYDDSLIIFIGDNGA----LSSQAGSNYPFRGIAGTLFEGATRVPSI 290
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
+ +K+T S+EL+ I D T+ AG + + LDG++ W ++ + R+EIL
Sbjct: 291 VSGKGIKKTGYESNELYSIVDIHRTILDVAGAKA-EPELDGLSMWPTWSEDKPSPRNEIL 349
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAG 408
+NID D+P AA+R+ D K + G
Sbjct: 350 YNID--DDPVSPGAAIRIGDYKLITG 373
Score = 46.6 bits (109), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 17/59 (28%), Positives = 38/59 (64%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
LFN+ DP E+N+++ + D++++L D+L +++ ++PP N+ + +N++W P
Sbjct: 419 LFNVVVDPEERNDISADRPDIVQELRDRLDEHRNLLIPPVNQTTELAGAAGNFNDVWSP 477
>gi|241634070|ref|XP_002410502.1| arylsulfatase J, putative [Ixodes scapularis]
gi|215503435|gb|EEC12929.1| arylsulfatase J, putative [Ixodes scapularis]
Length = 480
Score = 258 bits (660), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 188/554 (33%), Positives = 253/554 (45%), Gaps = 106/554 (19%)
Query: 41 GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
GW DVSFHGS+QIPTPNID LA +G+IL+ +Y LCTPSR+ALMTG YPIH GM GVI
Sbjct: 5 GWGDVSFHGSTQIPTPNIDVLAGDGVILDNYYALPLCTPSRAALMTGLYPIHTGMHAGVI 64
Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
+ PWGL L K++PQ+ ++ GY + IGK H H G
Sbjct: 65 QDAAPWGLTLETKIMPQHFEDLGYEVNMIGKSH-----------------HDG------- 100
Query: 161 YYDHSCKATFEPYQGLDMRH------------NMQVDNKTIGIYSTDLYTEAAINVIAE- 207
C G+D+ H N + KT Y T VI
Sbjct: 101 -----CHNFDSTKSGIDLLHTPLISSIPGQHDNTWMYGKTYNFYRTRAICRLQKRVIQGV 155
Query: 208 ---------HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
S+P F YL+H AVH+ +PFQAP + KF I + R YAG
Sbjct: 156 VVYLIRFLLDLHSQPFFCYLSHQAVHSALMKDPFQAPARNLLKFPYIGETNRTVYAGETK 215
Query: 259 RLDES-VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
+ +G + +L K + +++ +DNG FG+ SN+G N PLRG K+ W+GG+
Sbjct: 216 KTHLPWLGLALVSLVKTKNYVDILLMGFSDNGGSPFGLFSNRGFNWPLRGAKANLWEGGV 275
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKT 376
R A +WS + +VS +L HISDWLPTL +AAG ++ N S+DG++ W LT G +
Sbjct: 276 RVPAFVWSSKFLKKPRVSDQLMHISDWLPTLYSAAGGDVSNLGSMDGIDMWRYLTLGFGS 335
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
R E+L ID P+ AALR + K V G N E+D Y
Sbjct: 336 PRKEVLLTID----PKNNLAALRYKNYKLVVGEGFN-----------EELDGR------Y 374
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH 496
S G L+ + Q + D K+ ARV ++ K
Sbjct: 375 SFPG----GLRPTNDVPQ----------------LRKDSKV------ARVLKSFYGKQNR 408
Query: 497 CNSTVKPCLFNITDDPCEQ----NNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRA 552
S + + D C + NN Q + + KLA Y +T VPP +P D RA
Sbjct: 409 NWSPLDWSKNPVVD--CRERWLTNNFVARQPPYFRFMMSKLAAYNATAVPPLFQPQDPRA 466
Query: 553 DPARWNNIWVPWYD 566
P +W PW D
Sbjct: 467 YPEYHGGVWSPWLD 480
>gi|171909641|ref|ZP_02925111.1| twin-arginine translocation pathway signal precursor
[Verrucomicrobium spinosum DSM 4136]
Length = 486
Score = 258 bits (660), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 156/393 (39%), Positives = 217/393 (55%), Gaps = 33/393 (8%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
AP P+II I+ADDLGW DV F+G I TP++DALA G Q YVQ +CTP+R+ALM
Sbjct: 26 APSAPNIIHIVADDLGWQDVGFNGCKDIQTPHLDALAKGGARFTQFYVQPMCTPTRAALM 85
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+YP G+Q VI +GL E LLPQ L++AGY T IGKWHLG + + P
Sbjct: 86 TGRYPFRYGLQTAVIPSVSTYGLDTGEYLLPQCLQDAGYTTAIIGKWHLGHADKKFWPKQ 145
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
RGF+ YG G DYY HS + + R N V + Y+T+L A+ +
Sbjct: 146 RGFEYQYGAMIGELDYYTHSEHGVLDWF-----RDNEPVHEEG---YTTNLLGADAVKYL 197
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
+ +P +LYLA A H P+QAP E + ++ I+DP RRTYAGMV+ LD+++G
Sbjct: 198 EKQKADRPFYLYLAFNAPHT-----PYQAPQEYIDRYTHIADPTRRTYAGMVACLDDNIG 252
Query: 266 NVIAALRKHGMLENSIVLFMADNGA---PSF-GIHSNKGS------NHPLRGMKSTPWDG 315
V+AAL K G+ EN+++LF +DNG P F G+ ++ N P R K + ++G
Sbjct: 253 RVVAALDKKGLRENTLILFHSDNGGTHNPMFAGVMADVSKIKIPCDNSPYRDGKGSLFEG 312
Query: 316 GMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKG 373
G+R A A W +K L H +D T AG + LDGV+ W + +G
Sbjct: 313 GVRVCALANWPGKIKPA--TVDGLIHATDIFTTFTKLAGASTSRCKPLDGVDAWPTMAEG 370
Query: 374 AKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
+ R E+++N+ + Y AALR D K +
Sbjct: 371 RPSLREEVVYNV------EPYRAALRQGDWKLI 397
>gi|21483362|gb|AAM52656.1| HL07902p [Drosophila melanogaster]
Length = 365
Score = 258 bits (659), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 151/387 (39%), Positives = 208/387 (53%), Gaps = 35/387 (9%)
Query: 216 LYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHG 275
+ L+HLAVH GN P QAP+EEVAKF I DP+RRTYAGM+S LD+SV I AL+ +G
Sbjct: 1 MVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKSVAQTIGALKDNG 60
Query: 276 MLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS 335
ML NSI+L +DNGAP+ GIHSN GSN+P RG K +PW+GG+R A+WSP LK+ VS
Sbjct: 61 MLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKESPWEGGIRSAGALWSPLLKERGYVS 120
Query: 336 SELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKY 394
++ H DWLPTL AAG+ + D LDG+N W +L+ + K ++H +D V
Sbjct: 121 NQAIHAVDWLPTLAGAAGVSLPQDLPLDGINLWPMLSGNEEPKPRTMIHVLDEVFG---- 176
Query: 395 YAALRVDDLKYVAGTDNNGQSDEWYGDTD-NEIDKYSPKEVLYSKAGITFNALKTKLQIK 453
Y++ D LKYV G+ G+ D+W G+ + NE D +
Sbjct: 177 YSSYMRDTLKYVNGSSFKGRYDQWLGELETNEDDPLG--------------------ESY 216
Query: 454 QKHAADPKANSSDALRTILTDEKILELREFARVRC------NYDNKGAHCNSTVKPCLFN 507
++H S R LT ++I ++R A C N C PC F+
Sbjct: 217 EQHVLASDVQSLLGNRG-LTKDRIRQMRSEATETCPPIEGQNPLESHFKCEPLKAPCFFD 275
Query: 508 ITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF-DKRADPARWNNIWVPWYD 566
+ DPCE+ NLA+ L+QL D+L + T +P P D RA+P N W W++
Sbjct: 276 LAKDPCERYNLAQMYPLQLQQLADELEQIRKTAIPSARVPHSDSRANPTFHNGNW-EWWN 334
Query: 567 ELDKQKAIETMRLHKPLLSPTTITLIY 593
D Q T ++ L + I L+Y
Sbjct: 335 NTDTQSGSGTFSINLWLAMGSIIGLVY 361
>gi|403256717|ref|XP_003921000.1| PREDICTED: arylsulfatase B [Saimiri boliviensis boliviensis]
Length = 551
Score = 258 bits (659), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 136/301 (45%), Positives = 182/301 (60%), Gaps = 14/301 (4%)
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
I G+QH +I +P +PL EKLLPQ+LKEAGY TH +GKWHLG +R+ PT RGFD+
Sbjct: 123 IRTGLQHQIIWPCQPSCVPLDEKLLPQFLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 182
Query: 151 HYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
++GY G +DYY H + LD R +V +YST+++T+ A +I
Sbjct: 183 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAATLIT 242
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +DE+VGN
Sbjct: 243 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGN 297
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
V AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV + SP
Sbjct: 298 VTAALKSRGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVGFVASP 353
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
LKQ + EL HISDWLPTL AG N T LDG + W +++G+ + R E+LHNI
Sbjct: 354 LLKQKGVKNRELIHISDWLPTLVKLAGGHTNGTKPLDGFDVWKTISEGSPSPRVELLHNI 413
Query: 386 D 386
D
Sbjct: 414 D 414
>gi|149726732|ref|XP_001501521.1| PREDICTED: arylsulfatase I-like [Equus caballus]
Length = 476
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 165/493 (33%), Positives = 244/493 (49%), Gaps = 89/493 (18%)
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+R+ PT RG
Sbjct: 8 RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRG 67
Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
FD+ G G DYY + +C G D+ V G YST LY + +++A
Sbjct: 68 FDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILA 125
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H+ +P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N
Sbjct: 126 SHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRN 180
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
+ AL+++G NS+++F +DNG +F + GSN PLRG K T W+GG+RG+ + SP
Sbjct: 181 ITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSP 236
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNI 385
LK+ ++ S L HI+DW PTL AG + T LDG + W +++G + R+EILHNI
Sbjct: 237 LLKRKRRTSRALVHITDWYPTLVGLAGGTASATDGLDGYDVWPAISEGRASPRTEILHNI 296
Query: 386 DNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSP 431
D + N ++ AA+RV + K + G D YGD + P
Sbjct: 297 DPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIP 342
Query: 432 KEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYD 491
+ L + G +N E++ R+
Sbjct: 343 PQTLAAFPGSWWNL-----------------------------ERMASARQAV------- 366
Query: 492 NKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR 551
LFNI+ DP E+ +LA + D+++ L +L Y T +P + R
Sbjct: 367 ------------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENPR 414
Query: 552 ADPARWNNIWVPW 564
A P W PW
Sbjct: 415 AHPDFNGGAWGPW 427
>gi|443705042|gb|ELU01787.1| hypothetical protein CAPTEDRAFT_153777 [Capitella teleta]
Length = 551
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 175/517 (33%), Positives = 269/517 (52%), Gaps = 47/517 (9%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P+II I DD G+NDV F + I +PN+DALA +G+IL Y C+PSR + M+G+
Sbjct: 25 QPNIIWIFGDDYGFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFMSGR 83
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y MQHGVIL+ +P L L +LP YLKE GY THA GKWHLG+ R+ TPT RGF
Sbjct: 84 YSYKSAMQHGVILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGF 143
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
DS G + G ++Y+H+ AT Y D +VD IG +S DL + E+
Sbjct: 144 DSFSGGFSGEGEFYEHTT-ATGGYY---DWHLGTEVDYDAIGKHSEDLIGYYVNKTLDEY 199
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF--LDISDPERRTYAGMVSRLDESVGN 266
++S P+F+Y+A VH+ P E +A + LD++D +R+ Y G+VS +D +G
Sbjct: 200 DQSSPLFMYVAFHNVHS-----PLDPKPEFLALYDDLDVTD-DRKKYLGLVSGMDYIIGG 253
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
++ +++ G+ EN+ +LF +DNG +G N P RG KST W+GG + + SP
Sbjct: 254 IVDKIKEIGIYENTYILFSSDNGG-----DVGEGDNSPHRGGKSTLWEGGCKANSWTHSP 308
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNI 385
L T ++ L H++DWLPT+ AG + + LDGV Q D++ + R+ +++NI
Sbjct: 309 LLGATGVENNGLMHVTDWLPTIVDLAGGSVPASDGLDGVVQTDMVINNGDSARTTMIYNI 368
Query: 386 D-NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFN 444
D D Q + + +L+Y GQ D + D KY Y +A +
Sbjct: 369 DRESDIGQPKFGEMACRNLQYKLIWGFPGQQDGYGADA-----KYIYDIPFYQEAAQAY- 422
Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC 504
I + + D K +LT ++ L + D + A + T
Sbjct: 423 -------IDAEGSTDKKRGGP----YVLTADQTSVLNNYMSSVHVSDQELA--DGTGSML 469
Query: 505 LFNITDDPCEQNNLAES--------QTDLLKQLEDKL 533
LF++ +DP E +L+ES + +L+K ++D +
Sbjct: 470 LFDLINDPNETTDLSESTDPDVVAAKAELIKMIQDAV 506
>gi|443321854|ref|ZP_21050893.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
gi|442788398|gb|ELR98092.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
Length = 476
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 152/384 (39%), Positives = 217/384 (56%), Gaps = 30/384 (7%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
LT T F A KP+II+ILADDLGWNDV FHGS +I T N+D LA +G+ L
Sbjct: 14 LTLTITYYFRGANNLKAEINKPNIIVILADDLGWNDVGFHGS-EIKTTNLDKLAVSGVRL 72
Query: 69 NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKEAGYA 125
+ YV+A+CTP+R+A +TG++P GM + PW GLPL EK + + LKEAGY
Sbjct: 73 ERFYVKAMCTPTRAAFLTGRHPFRYGMS---AINVTPWSETGLPLEEKTIAETLKEAGYY 129
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
T +GKWHLG ++E Y PT RGFD HYG++ DY+ H GLD N
Sbjct: 130 TAILGKWHLGHYQESYLPTSRGFDYHYGHYLAGIDYFTHKSG------DGLDWHRN---- 179
Query: 186 NKTIGI--YSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
N + I YSTDL + A+ +I H+ K P+FLY+A A H P QA E++ +
Sbjct: 180 NNPVYIEGYSTDLIAQDAVQLINNHDYHKNPLFLYIAFNAPHI-----PLQAKAEDLEDY 234
Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG---IHSNK 299
L I D +RR +A V +DE++G++I +L+ + N++++F +DNG + +
Sbjct: 235 LTIEDEQRRLFAAQVQSMDEAIGSIIQSLQAKQVWNNTLLVFTSDNGGEIVANDIRFTGR 294
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-D 358
G N PLRG K ++GG+R I P + K ++F I D PT AG++IN +
Sbjct: 295 GDNRPLRGGKRNLYEGGVRVPTIISYPGHLSSGKTVEQMFSIVDLYPTFAKLAGLKINQE 354
Query: 359 TSLDGVNQWDVLTKGAKTKRSEIL 382
+DGV+ + + + T R E+L
Sbjct: 355 QQIDGVDILESIAE-KNTNRDELL 377
>gi|426233825|ref|XP_004023235.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Ovis aries]
Length = 475
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 140/335 (41%), Positives = 190/335 (56%), Gaps = 30/335 (8%)
Query: 56 PNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLL 115
PN+ +L YN +S + IH G+QH +I +P +PL EKLL
Sbjct: 30 PNLGSLVYN-------------VEEQSCPFFRNFQIHTGLQHQIIWPCQPSCIPLDEKLL 76
Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ- 174
PQ LKEAGYATH +GKWHLG +R+ PT RGFD+++GY G +DYY H +
Sbjct: 77 PQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTVIDALNV 136
Query: 175 ---GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP 231
LD R +V +YST+++TE A +I H KP+FLYLA +VH EP
Sbjct: 137 TRCALDFRDGEEVATGYKNMYSTNVFTERATTLITTHPPEKPLFLYLALQSVH-----EP 191
Query: 232 FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
Q P+E + + I D RR YAGM S +DE+VGNV AAL G+ N++ +F DNG
Sbjct: 192 LQVPEEYLKPYDFIQDKNRRHYAGMASLMDEAVGNVTAALESRGLWNNTVFIFSTDNGGQ 251
Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAA 351
+ G+N PLRG K + W+GG+RGV + SP LK+ + EL HISDWLP L
Sbjct: 252 TLA----GGNNWPLRGRKWSLWEGGVRGVGFVASPLLKRKGVKTRELIHISDWLPPLXGT 307
Query: 352 AGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
G + LDG + W+ +++G+ + R E+LHNID
Sbjct: 308 DGTK----PLDGFDVWNTISEGSPSPRMELLHNID 338
>gi|291239589|ref|XP_002739705.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 489
Score = 256 bits (654), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 161/426 (37%), Positives = 234/426 (54%), Gaps = 32/426 (7%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
AL C ++ K+PHI++ILA+DLGWNDV +H ++ I P ++ LA +G+I
Sbjct: 8 LVALVCVIVVESG-------KQPHIVLILAEDLGWNDVGWH-NADIKMPILNQLAADGVI 59
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
NQ YVQ CTPSR+AL+TG YP I QH ++L E GL L K LP+ LK+ GY TH
Sbjct: 60 FNQSYVQPACTPSRAALLTGYYPFKIQRQHQMLLNLEADGLSLDLKTLPEMLKDVGYLTH 119
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
+GKWHLGF +E Y P RGFDS YG W L +S + P G D R N V +
Sbjct: 120 LVGKWHLGFCKEEYLPNKRGFDSFYG-WLTLGTTL-YSKENIIAP--GYDFRDNTGVVQE 175
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
+ Y + E A++++ H K P+FL + +A+ + + P E + DI D
Sbjct: 176 S-DTYLPFMLAERAVDIVMGHYKEYPLFLEFS-MALSG----KFLEVPQEYEDLYSDIED 229
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
+R + G +S +D S+G ++ AL+ GM E+++ +F +DNGA + S GSN PLRG
Sbjct: 230 DRQRKFYGKLSVMDNSIGLLVDALKTRGMWEDTLFIFTSDNGA----LASESGSNWPLRG 285
Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQW 367
+T ++G R LK T V++EL HI D T+ AG + + +DG++ W
Sbjct: 286 NIATLFEGATRVPTFFHGSMLKNTGYVNNELMHIVDLHKTIIELAGAK-TECDIDGMDLW 344
Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDN-------NGQSDEWYG 420
++KG + R+E ++NID DN AA+RV D K + G + +SD+WY
Sbjct: 345 KTISKGKPSPRTEFVYNID--DNELSPGAAIRVGDYKLITGHPDLLYPFRMLNRSDDWYN 402
Query: 421 DTDNEI 426
D I
Sbjct: 403 YGDEPI 408
Score = 59.3 bits (142), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/59 (44%), Positives = 37/59 (62%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
LFNI DDP E+NNLAE + +++L +L Y+ +VPP N+ FD DP + +W P
Sbjct: 428 LFNIIDDPEERNNLAEEYPNKVEELRHRLDEYREHLVPPVNRVFDPAGDPVNFGGVWSP 486
>gi|443691100|gb|ELT93060.1| hypothetical protein CAPTEDRAFT_21969 [Capitella teleta]
Length = 529
Score = 256 bits (654), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 177/526 (33%), Positives = 272/526 (51%), Gaps = 49/526 (9%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P+II I DD G+NDV F + I +PN+DALA +G+IL Y C+PSR + ++G+
Sbjct: 3 QPNIIWIFGDDYGFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFLSGR 61
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y MQHGVIL+ +P L L +LP YLKE GY THA GKWHLG+ R+ TPT RGF
Sbjct: 62 YSYKSAMQHGVILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGF 121
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
DS G + G ++Y+H+ AT Y D +VD IG +S DL + E+
Sbjct: 122 DSFSGGFSGEGEFYEHTT-ATGGYY---DWHLGTEVDYDAIGKHSEDLIGYYVNKTLDEY 177
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF--LDISDPERRTYAGMVSRLDESVGN 266
++S P+F+Y+A VH+ P E +A + LD++D +R+ Y G+VS +D +G
Sbjct: 178 DQSSPLFMYVAFHNVHS-----PLDPKPEFLALYDDLDVTD-DRKKYLGLVSGMDYIIGG 231
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
++ +++ G+ EN+ +LF +DNG +G N P RG KST W+GG + + SP
Sbjct: 232 IVDKIKEIGIYENTYILFSSDNGG-----DVGEGDNSPHRGGKSTLWEGGCKANSWTHSP 286
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNI 385
L T ++ L H++DWLPT+ AG + + LDGV Q D++ + R+ +++NI
Sbjct: 287 LLGATGVENNGLMHVTDWLPTIVDLAGGSVPASDGLDGVVQTDMVINNGDSARTTMIYNI 346
Query: 386 D-NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFN 444
D D Q + + +L+Y GQ D + D KY Y +A +
Sbjct: 347 DRESDIGQPKFGEMACRNLQYKLIWGFPGQQDGYGADA-----KYIYDIPFYQEAAQAY- 400
Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC 504
I + + D K +LT ++ L + D + A + T
Sbjct: 401 -------IDAEGSTDKKRGGP----YVLTADQTSVLNNYMSSVHVSDQELA--DGTGSML 447
Query: 505 LFNITDDPCEQNNLAES--------QTDLLKQLEDKLAIYKSTMVP 542
LF++ +DP E +L+ES + +L+K ++D A+ VP
Sbjct: 448 LFDLINDPNETTDLSESTDPDIVAAKAELIKMIQD--AVNDGGYVP 491
>gi|443706067|gb|ELU02328.1| hypothetical protein CAPTEDRAFT_179702 [Capitella teleta]
Length = 501
Score = 256 bits (653), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 159/422 (37%), Positives = 233/422 (55%), Gaps = 27/422 (6%)
Query: 21 FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPS 80
F K+P+ + I ADDLG++D+ + + TP +D LA G+ +YV C+PS
Sbjct: 13 FQGIHTSKQPNFVWIFADDLGYHDIGLR-NPDLHTPTLDKLATKGVQFKNNYVMHACSPS 71
Query: 81 RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
R + M+G+YP MQ VI P PL K LP+YLKE GY THA+GKWHLG+ RE
Sbjct: 72 RHSFMSGRYPFTSQMQKDVIFPVSPDCSPLKLKFLPEYLKELGYGTHAVGKWHLGYCRED 131
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
PT RGFDS YG G DY H + D N VD G +S DL+T A
Sbjct: 132 CMPTSRGFDSFYGTLDGEGDYLTHMSAGFY------DWHTNGTVDRSKSGQHSQDLHTAA 185
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL--DISDPERRTYAGMVS 258
++I + KP FLY A A N + P QA +++ K+ ++SD ++ Y G+++
Sbjct: 186 LADIIERQTEEKPFFLYFA-----AQNPHTPLQAKKKDMKKYKNQNLSDTLKK-YYGLIT 239
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LD V + L K +L N+ ++F+ADNGA GI GSN P RG KST W+GG++
Sbjct: 240 GLDGMVKTIKDTLEKKNLLNNTYIIFLADNGA-DVGI----GSNEPFRGTKSTLWEGGVK 294
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTK 377
++ I+SP LK+T + L HI+DW+PTL + AG + ++ +DG+NQ ++T K++
Sbjct: 295 SLSFIYSPLLKKTGYENDGLIHITDWMPTLVSLAGGSVPESEGIDGINQVAMVTDARKSE 354
Query: 378 RSEILHNID---NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
R ++++N+D P A+R + K + G GQ+D YG T + K +V
Sbjct: 355 RKDMVYNLDMESQFGLPSSGQIAVRDERYKLLWGFP--GQND-GYGLTTTYLAKVEEFKV 411
Query: 435 LY 436
+Y
Sbjct: 412 IY 413
>gi|449499600|ref|XP_002193905.2| PREDICTED: arylsulfatase J [Taeniopygia guttata]
Length = 488
Score = 255 bits (652), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 166/483 (34%), Positives = 237/483 (49%), Gaps = 67/483 (13%)
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R PT RGF
Sbjct: 23 YQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRGF 82
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV----DNKTIGIYSTDLYTEAAINV 204
DS +G G DYY H K G D+ N DN GIYST +YT+ +
Sbjct: 83 DSFFGSLLGSGDYYTH-FKCDSPGICGYDLYENNNAAWDHDN---GIYSTQMYTQKVQQI 138
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
+A HN KP+FLY+A+ AVH+ P QAP + I++ RR YA M++ LDE++
Sbjct: 139 LASHNPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSINNINRRRYAAMLACLDEAI 193
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
NV ALRK+G ENSI+++ +DNG GSN PLRG K T W+GG+R V +
Sbjct: 194 NNVTLALRKYGYYENSIIIYSSDNGGQPMA----GGSNWPLRGSKGTYWEGGIRAVGFVH 249
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILH 383
SP LK V EL HI+DW PTL A +I+ D LDG + W+ +++G ++
Sbjct: 250 SPLLKNKGSVCKELVHITDWFPTLITLAEGQIDEDIQLDGYDVWETISEGRRSP------ 303
Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
RVD L + ++ W + GI
Sbjct: 304 ---------------RVDILHNIDPIYTKAKNGSWA-----------------AGYGIWN 331
Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNSTVK 502
A+++ +++ SD + + F+ N + N+ ++
Sbjct: 332 TAIQSAIRVNHWKLLTGNPGYSD----------WVPPQAFSNAGPNRWHNERVSWSAGKT 381
Query: 503 PCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWV 562
LFNIT DP E+ +L+ +++KQL +L+ + T VP P D R++P +W
Sbjct: 382 VWLFNITADPYERVDLSARHPEVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPKLNGGVWG 441
Query: 563 PWY 565
PW+
Sbjct: 442 PWF 444
>gi|449265842|gb|EMC76972.1| Arylsulfatase J, partial [Columba livia]
Length = 467
Score = 255 bits (652), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 165/484 (34%), Positives = 239/484 (49%), Gaps = 67/484 (13%)
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R PT RG
Sbjct: 1 RYQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRG 60
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEAAIN 203
FD+ +G G DYY H K G D+ N DN GIYST +YT+
Sbjct: 61 FDTFFGSLLGSGDYYTH-FKCDSPGICGYDLYEN---DNAAWDHDNGIYSTQMYTQKVQQ 116
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
++A HN SKP+FLY+A+ AVH+ P QAP + + I++ RR YA M++ LDE+
Sbjct: 117 ILASHNPSKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEA 171
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
+ NV AL+K+G +NSI+++ +DNG GSN PLRG K T W+GG+R V +
Sbjct: 172 INNVTLALKKYGYYDNSIIIYSSDNGGQPMA----GGSNWPLRGSKGTYWEGGIRAVGFV 227
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEIL 382
SP LK V EL HI+DW PTL A +I+ D LDG + W+ +++G ++
Sbjct: 228 HSPLLKNKGSVCKELVHITDWFPTLITLAEGQIDEDIQLDGYDIWETISEGRRSP----- 282
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
RVD L + ++ W + GI
Sbjct: 283 ----------------RVDILHNIDPIYTKAKNGSWA-----------------AGYGIW 309
Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNSTV 501
A+++ +++ SD + + F+ N + N+ +
Sbjct: 310 NTAIQSAIRVNHWKLLTGNPGYSD----------WVPPQAFSNAGPNRWHNERVSWTAGK 359
Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
LFNIT DP E+ +L+ D++KQL +L+ + T VP P D R++P +W
Sbjct: 360 TVWLFNITADPYERVDLSAKYPDVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPKLNGGVW 419
Query: 562 VPWY 565
PW+
Sbjct: 420 GPWF 423
>gi|432104048|gb|ELK30879.1| Arylsulfatase J [Myotis davidii]
Length = 496
Score = 255 bits (651), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 193/341 (56%), Gaps = 26/341 (7%)
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QH VI +P LPL LPQ L+EAGYATH +GKWHLGF+R PT RGF
Sbjct: 31 YQIHTGLQHSVIRPAQPNCLPLGHATLPQKLREAGYATHMVGKWHLGFYRRECMPTHRGF 90
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-NKTIGIYSTDLYTEAAINVIAE 207
D+ +G G DYY H K G D+ N + + G+YST +YT ++A
Sbjct: 91 DTFFGSLLGSGDYYTH-YKCDGPGACGYDLYENDRAAWDHDDGVYSTHMYTRRVQQILAA 149
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
H+ KP+FLY+A+ AVH+ P QAP + I++ RR YA M+S LDE+VGNV
Sbjct: 150 HDPGKPLFLYIAYQAVHS-----PLQAPSRYFRHYRSIANTHRRRYAAMLSCLDEAVGNV 204
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
ALR HG+ NSI+++ +DNG G S GSN PLRG K T W+GG+R V + SP
Sbjct: 205 TLALRAHGLYNNSILIYSSDNG----GQPSAGGSNWPLRGSKGTYWEGGIRAVGFVHSPL 260
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
L+Q V EL HI+DW PTL A A I D LDG + W+ +++G ++ R +ILHNID
Sbjct: 261 LRQRGTVCRELVHITDWYPTLVALAEGHIEEDAHLDGYDVWETISEGLRSPRVDILHNID 320
Query: 387 NVDNPQKY--------------YAALRVDDLKYVAGTDNNG 413
+ K+ +A+RV K + G G
Sbjct: 321 PIYTKAKHGSWAAGYGIWNTAVQSAIRVQHWKLLTGNPGYG 361
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/63 (38%), Positives = 36/63 (57%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E+ +L+ +++QL +L+ + T VP P D R++P +W PW
Sbjct: 392 LFNITADPYERVDLSGRYPGVVRQLLRRLSQFNRTAVPVRYPPGDPRSNPRLNGGVWGPW 451
Query: 565 YDE 567
Y E
Sbjct: 452 YIE 454
>gi|355691424|gb|EHH26609.1| Arylsulfatase B, partial [Macaca mulatta]
Length = 430
Score = 255 bits (651), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 136/301 (45%), Positives = 181/301 (60%), Gaps = 14/301 (4%)
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT RGFD+
Sbjct: 2 IRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 61
Query: 151 HYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
++GY G +DYY H + LD R +V +YST+++T+ A +I
Sbjct: 62 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATALIT 121
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +DE+VGN
Sbjct: 122 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGN 176
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
V AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV + SP
Sbjct: 177 VTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVGFVASP 232
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
LKQ + EL HISDWLPTL AG N T LDG + W +++G+ + R E+LHNI
Sbjct: 233 LLKQKGVKNRELIHISDWLPTLVKLAGGHTNGTKPLDGFDVWKTISEGSPSPRVELLHNI 292
Query: 386 D 386
D
Sbjct: 293 D 293
>gi|291232045|ref|XP_002735970.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 500
Score = 255 bits (651), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 178/548 (32%), Positives = 273/548 (49%), Gaps = 80/548 (14%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K PHII I+ DDLGWNDV ++ + TP ID LA +G+ L +YV + C PSR+ LMTG
Sbjct: 21 KPPHIIFIVVDDLGWNDVGYN-NPVFKTPTIDRLAGSGVKLLNYYVASHCLPSRNMLMTG 79
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
++ I +G+QH P LPL E + + LK GY+TH +GKWH GF+ + P RG
Sbjct: 80 RHAIQLGIQHDD-YGFHPRSLPLNETTIAEPLKHVGYSTHIVGKWHCGFYSDNCLPHNRG 138
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-----VDNKTIGIYSTDLYTEAAI 202
FD+ +G+ D+Y HS + HN++ + K IG YST ++
Sbjct: 139 FDTFFGFVGAGIDHYTHS--------DHFNHMHNLRKNDDCIAKKYIGKYSTTIFANEGK 190
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRLD 261
++I H+++KP+FLYL+ AVHA P + P + ++ I D +RRTYA M S +D
Sbjct: 191 DIINAHDQNKPLFLYLSFSAVHA-----PLEVPSSYLKQYESTIHDEDRRTYAAMTSCVD 245
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E++ N+ AL + ML+NSI++F +DNG + ++ G+N PLRG K T W+GG+R +
Sbjct: 246 EAIANITEALEEKNMLKNSIIVFTSDNGG---ALVNSAGNNWPLRGGKHTSWEGGVRAIG 302
Query: 322 AIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKR 378
++ L + + + + L HI+DW PTL AG + L GV+QW+++T K+ R
Sbjct: 303 FVYGDLLPRHSRGTENTGLMHITDWFPTLITLAGGNPHVGKKLYGVDQWNMITGKDKSGR 362
Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
++L + Q+ Y A Y G N D + +
Sbjct: 363 DDVLITL------QEDYKA----GAHYRTGIYKNNAFD------------------ITTH 394
Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALR--TILTDEKILELREFARVRCNYDNKGAH 496
A I K K P SD ++ I +D +I+E +F R
Sbjct: 395 AAIKMG--------KWKLLTGPIGGPSDWVKPQEINSDHRIIEDPDFDATRI-------- 438
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
V+ L+NI DDP E +L+E ++ ++ KL + +P ++ + RAD
Sbjct: 439 ----VR--LYNIEDDPTETTDLSEINQHIVIEMLAKLHNFSKQSLPAQSRIY-PRADFDG 491
Query: 557 WNNIWVPW 564
PW
Sbjct: 492 MGECISPW 499
>gi|402871949|ref|XP_003899908.1| PREDICTED: arylsulfatase B-like, partial [Papio anubis]
Length = 316
Score = 254 bits (650), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 127/263 (48%), Positives = 171/263 (65%), Gaps = 10/263 (3%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + PH++ +LADDLGWNDV FHGS +I TP++DALA G++L+ +Y Q LCTPSRS L+
Sbjct: 42 ASRPPHLVFVLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
RGFD+++GY G +DYY H + LD R +V +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+I H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +D
Sbjct: 221 TALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275
Query: 262 ESVGNVIAALRKHGMLENSIVLF 284
E+VGNV AAL+ G+ N++ +F
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIF 298
>gi|327291280|ref|XP_003230349.1| PREDICTED: arylsulfatase J-like, partial [Anolis carolinensis]
Length = 471
Score = 254 bits (650), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 167/490 (34%), Positives = 247/490 (50%), Gaps = 69/490 (14%)
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
L++ +Y IH G+QH VI +P LPL LPQ LKEAGY+TH +GKWHLGF+R+
Sbjct: 2 VLVSSRYQIHTGLQHSVIRPTQPNCLPLDNATLPQKLKEAGYSTHMVGKWHLGFYRKECM 61
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYT 198
PT RGFD+ +G G DYY H K G D+ N DN GIYST +YT
Sbjct: 62 PTQRGFDTFFGSLLGSGDYYTHY-KCDSPRMCGYDLYEN---DNAAWDHDNGIYSTQMYT 117
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ ++A HN KP+FLY+A+ AVH+ P QAP ++ I++ RR YA M++
Sbjct: 118 QKVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPGTYYERYRSINNINRRRYAAMLA 172
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LDE++ NV AL+++G NS++++ +DNG P+ G G+N PLRG K T W+GG+
Sbjct: 173 CLDEAINNVTLALKRYGYYNNSVIIYSSDNGGQPTAG-----GNNWPLRGSKGTYWEGGI 227
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
R V + SP LK V EL HI+DW PTL A +I+ DT LDG + W+ +++ +
Sbjct: 228 RAVGFVHSPLLKNKGCVCKELIHITDWFPTLITLAEGQIDEDTLLDGYDVWETISE---S 284
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
+RS R+D L + ++ W
Sbjct: 285 RRSP------------------RMDILHNIDPIYTKAKNGSWA----------------- 309
Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR-CNYDNKGA 495
+ GI A+++ +++ SD + + F+ V + N+
Sbjct: 310 AGFGIWNTAIQSAIRVNHWKLLTGNPGYSD----------WVPPQSFSNVGPSRWHNERV 359
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
+ LFNIT DP E+ +L+ D++KQL +L+ + T VP P D R++P
Sbjct: 360 SWTAGKTVWLFNITADPYERVDLSSKYPDVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPN 419
Query: 556 RWNNIWVPWY 565
+W PW+
Sbjct: 420 FNGGVWGPWF 429
>gi|355750020|gb|EHH54358.1| Arylsulfatase B, partial [Macaca fascicularis]
Length = 430
Score = 254 bits (650), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 136/301 (45%), Positives = 181/301 (60%), Gaps = 14/301 (4%)
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
I G+QH +I +P +PL EKLLPQ LKEAGY TH +GKWHLG +R+ PT RGFD+
Sbjct: 2 IRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 61
Query: 151 HYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
++GY G +DYY H + LD R +V +YST+++T+ A +I
Sbjct: 62 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATALIT 121
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H KP+FLYLA +VH EP Q P+E + + I D R YAGMVS +DE+VGN
Sbjct: 122 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGN 176
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
V AAL+ G+ N++ +F DNG + G+N PLRG K + W+GG+RGV + SP
Sbjct: 177 VTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVGFVASP 232
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
LKQ + EL HISDWLPTL AG N T LDG + W +++G+ + R E+LHNI
Sbjct: 233 LLKQKGVKNRELIHISDWLPTLVKLAGGHTNGTKPLDGFDVWKTISEGSPSPRMELLHNI 292
Query: 386 D 386
D
Sbjct: 293 D 293
>gi|390336076|ref|XP_784530.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 545
Score = 254 bits (650), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 154/370 (41%), Positives = 213/370 (57%), Gaps = 39/370 (10%)
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QHGVI +P LP+ E LPQ LKE GY T+ +GKWH+GF+ + TPT RGF
Sbjct: 122 YWIHTGLQHGVIRHAQPNCLPVDEVTLPQKLKENGYTTNMVGKWHIGFYLDACTPTERGF 181
Query: 149 DSHYGYWQGLQDYYDH--SCKA------TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
DS +G+ G +D+Y SC+A + + G D+ N G YST L+T
Sbjct: 182 DSFFGFLIGAEDHYTRMLSCEAYNANDGSTKTLSGHDLHANKTPVFDYEGQYSTHLFTNK 241
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
++VI H+K+KP+F+YLA+ AVHA P Q PD + + +I+D RRTYAGMVS +
Sbjct: 242 TVDVIERHDKTKPLFIYLAYQAVHA-----PLQVPDSYMDPYKNIADKTRRTYAGMVSCM 296
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH-PLRGMKSTPWDGGMRG 319
DE +GNV AL+ G+ +N+I++F DNG H ++G N+ PLRG K + W+GG+ G
Sbjct: 297 DEGIGNVTQALKDAGLYDNTIIIFSTDNGG-----HIDRGGNNWPLRGSKGSMWEGGIHG 351
Query: 320 VAAIWSPWLKQTQK--VSSELFHISDWLPTLCAA-AGIEINDTS-LDGVNQWDVLTKGAK 375
V + SP L K V+ EL H+SDWLPT+ A AG +N T LDG N W + A
Sbjct: 352 VGFVHSPLLPDAVKGTVNHELMHVSDWLPTIVAGIAGGALNGTKPLDGYNVWSSIRGTAS 411
Query: 376 TKRSEILHNIDNVDNPQK-----------YYAALRVDDLKYVAGTDNNGQS-DEWYGDTD 423
+ ++E+LHNI NP+K AALRV D K G + + +EW +
Sbjct: 412 SPQTELLHNI----NPKKKRPENEKFDTSMKAALRVGDWKIKTGANYDVTVFNEWMPPPE 467
Query: 424 NEIDKYSPKE 433
+ +D P E
Sbjct: 468 SGLDTIYPTE 477
Score = 79.0 bits (193), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 35/62 (56%), Positives = 44/62 (70%)
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QHGVI +P LP+ E LPQ LKE GY T+ +GKWH+GF+ + TPT RGFDS
Sbjct: 38 IHTGLQHGVIRHAQPNCLPMDEVTLPQKLKENGYTTNMVGKWHIGFYLDACTPTERGFDS 97
Query: 151 HY 152
+
Sbjct: 98 FF 99
Score = 45.8 bits (107), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 30/60 (50%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP EQ +LAE ++ +L KL Y VP D ADPA W PW
Sbjct: 485 LFNITADPLEQTDLAEKYPGVVDELVHKLEEYYKGSVPMRYPDEDLEADPALHGGSWGPW 544
>gi|168701793|ref|ZP_02734070.1| twin-arginine translocation pathway signal precursor [Gemmata
obscuriglobus UQM 2246]
Length = 459
Score = 254 bits (649), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 151/406 (37%), Positives = 216/406 (53%), Gaps = 26/406 (6%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
F + LL + AP +P ++ +LADDLG D F G +I TPNID +A G L
Sbjct: 4 FTVLLFLLAPASVARAEAPAQPSVVFLLADDLGREDCGFMGGKEIKTPNIDKIAAAGATL 63
Query: 69 NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
+ Y Q +C+P+R+A MTG+YP+ G+Q GV+ +GLPL E+ + Q LK+AGY T
Sbjct: 64 DAFYAQPVCSPTRAAFMTGRYPMRHGLQVGVVRPWAQYGLPLDERTVAQGLKDAGYTTAV 123
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
IGKWHLG F Y PT RGFD YG++ G DY+ H G D + +V++
Sbjct: 124 IGKWHLGHFAPEYLPTKRGFDHQYGHYNGALDYFTHIRDG------GFDWHRDDKVNSDE 177
Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
YST L + A+ + H KP FLY+ AVHA P Q P+E + + +
Sbjct: 178 --GYSTHLVAKDAVQFVQTHAGKKPFFLYVPFNAVHA-----PHQVPEEYLKPYGHLKG- 229
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
ER+ YAGM++ +DE+VG ++AA+ K G+ +N++ +F++DNG P G+ + SN R
Sbjct: 230 ERQKYAGMLAAMDEAVGQIVAAVEKAGVRKNTLFVFLSDNGGPQPGVVT---SNGKFRAG 286
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG-IEINDTSLDGVNQW 367
K T ++GG+R A + +E H+ DW PTL G +DG++ W
Sbjct: 287 KGTLYEGGVRVAACAAFDGHIRPGTTITEPLHVVDWYPTLLKLGGPTGEQQLPVDGLDIW 346
Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVA--GTDN 411
VLT G + IL N P A+R D K V GTD+
Sbjct: 347 PVLTGGKPSPHGAILLNT----TPNT--GAVRAGDWKLVVKNGTDD 386
>gi|350587859|ref|XP_003129285.2| PREDICTED: arylsulfatase J [Sus scrofa]
Length = 539
Score = 254 bits (649), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 167/491 (34%), Positives = 247/491 (50%), Gaps = 69/491 (14%)
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+R+ PT RGF
Sbjct: 75 YQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGF 134
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV----DNKTIGIYSTDLYTEAAINV 204
D+ +G G DYY H K G D+ N DN GIYST +YT+ +
Sbjct: 135 DTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENENAAWDYDN---GIYSTQMYTQRVQQI 190
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
+A H+ +P+FLY+A+ AVH+ P QAP + I + RR YA M+S LDE++
Sbjct: 191 LASHDPKRPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAI 245
Query: 265 GNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+R V +
Sbjct: 246 NNVTLALKMYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGIRAVGFV 300
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEIL 382
SP LK V EL HI+DW PTL + A +I+ D LDG + W+ +++G ++ R
Sbjct: 301 HSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDVWETISEGLRSPR---- 356
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
+D + N Y + + + AG GI
Sbjct: 357 --VDILHNIDPIYT--KAKNGSWAAGY------------------------------GIW 382
Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNSTV 501
A+++ ++++ SD + + F+ + N + N+ ++
Sbjct: 383 NTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERITLSTGK 432
Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
LFNIT DP E+ +L+ ++KQL +L+ + T VP P D R++P +W
Sbjct: 433 SVWLFNITADPYERVDLSNRYPGVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 492
Query: 562 VPWYDELDKQK 572
PWY E +K++
Sbjct: 493 GPWYKEENKKQ 503
>gi|198434445|ref|XP_002131042.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
Length = 512
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 151/373 (40%), Positives = 207/373 (55%), Gaps = 21/373 (5%)
Query: 26 APKKPHIIIILADDLGWNDVSFHG---SSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
A KKPH+++ILADDLG+NDV + G S TP +D+LA NG+ L +Y ++C+P+R
Sbjct: 25 ATKKPHVVLILADDLGYNDVGYWGQNHGSAAKTPFLDSLAENGVRLENYYTHSVCSPTRG 84
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
ALMTG+ I IG+ HG+I + GLPL LLP+ L GY T IGKWHLGF Y
Sbjct: 85 ALMTGRNRIDIGLAHGIIHTTQIEGLPLDNVLLPEQLSNCGYNTQMIGKWHLGFSSSKYA 144
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSC-KATFEPYQGLDMRHNMQ-VDNKTIGIYSTDLYTEA 200
P RGF YG+ G ++Y+ A G+D + N+T G YS +Y
Sbjct: 145 PWNRGFHGFYGFLAGSENYWSKWLPMARHSNIGGVDFTDSTTGPTNETWGQYSAHVYASR 204
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
A VI H++SKP+FLYL H P AP F DI D +R Y MVS L
Sbjct: 205 ARYVIQHHDQSKPLFLYLPLQTPHT-----PLGAPSHYYEPFKDIEDDDRMKYLSMVSVL 259
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
DE+V NV L+ GM E+++++F DNG G + G+N P RG K+T ++GG++ V
Sbjct: 260 DETVRNVTNYLKDAGMWEDTLLIFSTDNG----GEVKHGGNNWPYRGTKNTLYEGGVKAV 315
Query: 321 AAIWSPWLKQ---TQKVSSELFHISDWLPTLCAAAGIEI----NDTSLDGVNQWDVLTKG 373
+ L Q+V+ EL H++DW PT+ AAA + L G+NQW L
Sbjct: 316 GFVHGKILGDLSPAQRVNRELIHVTDWYPTIMAAADCPYVSGNSRNKLYGLNQWGALRGE 375
Query: 374 AKTKRSEILHNID 386
++R+EI+H ID
Sbjct: 376 RGSQRTEIVHTID 388
>gi|355669602|gb|AER94582.1| arylsulfatase B [Mustela putorius furo]
Length = 418
Score = 253 bits (647), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 136/292 (46%), Positives = 177/292 (60%), Gaps = 14/292 (4%)
Query: 100 ILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ 159
I +P +PL EKLLPQ LKEAGY TH +GKWHLG FR+ PT RGFD+++GY G +
Sbjct: 1 IWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMFRKECLPTRRGFDTYFGYLLGSE 60
Query: 160 DYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMF 215
DYY H + LD R +V +YST+++TE A +IA H KP+F
Sbjct: 61 DYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALIANHPPEKPLF 120
Query: 216 LYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHG 275
LYLA +VH EP Q P+E + + I D R YAGMVS +DE+VGNV AAL+ HG
Sbjct: 121 LYLALQSVH-----EPLQVPEEYLKPYKFIQDKNRHHYAGMVSLMDEAVGNVTAALKSHG 175
Query: 276 MLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS 335
+ N++ +F DNG + G+N PLRG K + W+GG+RGV + SP LKQ +
Sbjct: 176 LWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGIRGVGFVASPLLKQKGIKN 231
Query: 336 SELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNID 386
EL HISDWLPTL AG N T LDG + W +++G+ + R E+LHNID
Sbjct: 232 RELIHISDWLPTLVTLAGGSTNGTKPLDGFDVWKTISEGSPSPRMELLHNID 283
>gi|390361328|ref|XP_780209.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 469
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 181/549 (32%), Positives = 263/549 (47%), Gaps = 115/549 (20%)
Query: 26 APKKPHIIIILADDLGWNDVSFHG-SSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
+P PHI+ ++ DDLG+NDV +H S I T NIDALA GL L +YV LCTPSRS
Sbjct: 25 SPPTPHIVFVMVDDLGYNDVGYHSDGSAIETDNIDALAAGGLKLESYYVAPLCTPSRSQF 84
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
++GKY IH GMQH VI P LPL + + L +AGYATH +GKWHLGF+++ P
Sbjct: 85 LSGKYLIHNGMQHLVIDPRVPRCLPLGDDTMANKLTDAGYATHLVGKWHLGFYKQECWPL 144
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
RGF S +G M + T+
Sbjct: 145 NRGFQSFFG----------------------------MLLGQSTV--------------- 161
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRLDES 263
+ +KPMF+YL++ A H PF+ PDE + I++ RRTYAGMV+ LDES
Sbjct: 162 --DQGGTKPMFMYLSYQAPHL-----PFEVPDEYFVSYRGKINNRNRRTYAGMVTMLDES 214
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
+G + L++ G+ +++ +F DNG N G+N PLRG+K ++GG+RGV +
Sbjct: 215 IGKLTDTLKEEGLWNDTVFIFSTDNGGVG---KKNAGNNWPLRGVKGNYFEGGIRGVGFV 271
Query: 324 WSPWLKQTQK--VSSELFHISDWLPTLC-AAAGIEINDT--SLDGVNQWDVLTKG-AKTK 377
P L + +S++L HISDW PTL A + +N T LDGVN WDV+++G +
Sbjct: 272 AGPLLSTNVQGTISTDLMHISDWYPTLVEGVAKVTLNHTELGLDGVNMWDVISQGESGDP 331
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
EI++ N+D + A++ D++ D YS +
Sbjct: 332 DREIVY---NIDPDMQRPASI------------------------DSKYDSYSTIFDISI 364
Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHC 497
A I + +++ + AD +D L + + FA V K
Sbjct: 365 HASIR----RGDMKLVTGYQAD-----TDWLAPVESG--------FANVYATDAGKAI-- 405
Query: 498 NSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIY--KSTMVPPGNKPFDKRADPA 555
L+NIT DP EQ++L+++ D + L +L Y S P N D +DPA
Sbjct: 406 ------WLYNITADPTEQDDLSDTMADEVVDLLGRLQSYYTGSRFEAPANPGIDCDSDPA 459
Query: 556 RWNNIWVPW 564
++W W
Sbjct: 460 LHGDLWTNW 468
>gi|291236973|ref|XP_002738412.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 843
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 143/365 (39%), Positives = 209/365 (57%), Gaps = 29/365 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K PHII I+ DDLGWNDV ++ + TP ID LA +G+ L +YV + C PSR+ LMTG
Sbjct: 364 KPPHIIFIVVDDLGWNDVGYN-NPVFKTPTIDRLAGSGVKLLNYYVASHCLPSRNMLMTG 422
Query: 88 KYPIHIGMQ-HGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
++ I +G+Q HG P LPL E + Q LK+ GY+TH IGKWH GF+ + P R
Sbjct: 423 RHAIQLGIQRHG--FGYHPRSLPLDETTIAQPLKQVGYSTHIIGKWHCGFYSDNCLPHNR 480
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-----VDNKTIGIYSTDLYTEAA 201
GFD+ +G+ ++Y HS + HN++ + + IG YST +Y
Sbjct: 481 GFDTFFGFVGAGIEHYTHS--------DHFNHMHNLRKNDDCIAKQYIGKYSTTIYANEG 532
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRL 260
N+I H+++KP FLYL+ AVH P + P + ++ I D +RRTYA M S +
Sbjct: 533 KNIINAHDQNKPFFLYLSFSAVHT-----PLEVPSSYLKQYESTIYDEDRRTYAAMTSCV 587
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
DE++ N+ AL + ML+NSI++F +DNG + + GSN PLRG K T W+GG++ V
Sbjct: 588 DEAIANITEALEEKDMLKNSIIVFTSDNGG---AVERSAGSNWPLRGGKHTIWEGGVKAV 644
Query: 321 AAIWSPWLKQTQKVSSE--LFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
++ L + + L HI+DW PTL AG + L GV+QW ++T K++
Sbjct: 645 GFVYGELLPHHSRGTENTGLMHITDWFPTLVTLAGANPHVGKKLYGVDQWKMVTGEGKSR 704
Query: 378 RSEIL 382
R ++L
Sbjct: 705 RDDVL 709
>gi|348564635|ref|XP_003468110.1| PREDICTED: arylsulfatase J-like [Cavia porcellus]
Length = 543
Score = 252 bits (643), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 166/486 (34%), Positives = 245/486 (50%), Gaps = 69/486 (14%)
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF+++ PT RGF
Sbjct: 78 YQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYKKECMPTKRGF 137
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEAAINV 204
D+ +G G DYY H K G D+ N DN GIYST +YT+ +
Sbjct: 138 DTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYEN---DNAAWDHDNGIYSTQMYTQRVQQI 193
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
+A HN +KP+FLY+A+ AVH+ P QAP + I + RR YA M+S LDE++
Sbjct: 194 LASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAI 248
Query: 265 GNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
NV AL+ +G NSI+++ +DNG P+ G GSN PLRG K T W+GG+R V +
Sbjct: 249 HNVTLALKAYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGIRAVGFV 303
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEIL 382
SP LK V EL HI+DW PTL + A +I++ T LDG + W+ +++G ++ R
Sbjct: 304 HSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEHTQLDGYDVWETISEGFRSPR---- 359
Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
+D + N Y + + + AG GI
Sbjct: 360 --VDILHNIDPIYT--KAKNGSWAAGY------------------------------GIW 385
Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNSTV 501
A+++ ++++ SD + + F+ + N + N+ ++
Sbjct: 386 NTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QAFSNLGPNRWHNERITLSTGK 435
Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
LFNIT DP E+ +L+ ++K+L +L+ + T VP P D R++P +W
Sbjct: 436 SIWLFNITADPYERVDLSSRYPGIVKKLLKRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 495
Query: 562 VPWYDE 567
PWY E
Sbjct: 496 GPWYKE 501
>gi|298712440|emb|CBJ33216.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 726
Score = 251 bits (642), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 186/587 (31%), Positives = 290/587 (49%), Gaps = 92/587 (15%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQI-PTPNIDALAYNGLILNQHYVQALCTPSR 81
+++ P++ I+ DD+GWND+ +H + TPN+D L+ +G+ ++Q+Y ++CTP+R
Sbjct: 105 SSSTADNPNVFFIMIDDMGWNDIGYHSTDLANVTPNLDRLSASGVKVSQYYSMSICTPAR 164
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
+ALMTG+YP+ G+Q+ VI G PWGLPLTEKLLP+Y+ EAGY +H +GKWHLG + +
Sbjct: 165 AALMTGRYPVRYGLQYNVIQPGAPWGLPLTEKLLPEYMNEAGYESHMVGKWHLGSYTHAH 224
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSC----------------KAT--------FEPYQGLD 177
TP RGF++++GY + Y+ H AT F+P G D
Sbjct: 225 TPHRRGFETYFGYLNDEEMYWTHQTWTATINGRKFFDFGFGNATGFYDVIERFDPPPGDD 284
Query: 178 ----MRHNMQVDNKTI---GIYSTDLYTEAAINVIAEH--NKSKPMFLYLAHLAVHAGNT 228
V + ++ G YST+++T+ A+ ++++ + P+FLYL+H AVH
Sbjct: 285 DLVSTGPTSSVYSSSLEIKGDYSTEIFTDRALEILSQKTPHDENPLFLYLSHQAVHDPLG 344
Query: 229 YEPFQA-PDEEVAKFLDISDPE-----RRTYAGMVSRLDESVGNVIAALRKHGMLENSIV 282
P A EE A + D R +A ++ LD+S+G ++ L G LENSI+
Sbjct: 345 VPPADAFSPEEWAVLTALEDESSDASLRVRFAKVLMYLDKSIGRLVDYLETEGWLENSII 404
Query: 283 LFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE---LF 339
+ +DNG G S GSN+PLRG+K + W+GG++ A ++SP + +E L
Sbjct: 405 VVASDNG----GCPSTGGSNYPLRGLKHSNWEGGVKVPAFVYSPSHIPEEHWGTEYRGLM 460
Query: 340 HISDWLPTLCAAAGIEI--NDTSLDGVNQWDVLT------KGAKTKRSEILHNIDNVDNP 391
H++DWLPTL AA E+ + LDGV+ W + ++ E +++
Sbjct: 461 HVTDWLPTLADAANFELLGSHGDLDGVSHWKHIVPAPGDDTATVSESPEAAAAVEDNRLK 520
Query: 392 QKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS------KAGIT--- 442
+ A VD G G S YG E+ + +EVLY+ A IT
Sbjct: 521 RARTATAAVDGGGGGGGGGGGGSS---YG--LGEVSYGARREVLYNFDPYVLGANITDDI 575
Query: 443 -----------FNALKTKLQIKQKHAADPKANSSDALRTILTDE--KILELREFARVRCN 489
F K K + ++ L LTDE EL E ++
Sbjct: 576 GEADFTEVQGAFRQGKWKFMFNSWCSGYYAFDNKTLLADDLTDEDNTCQELGECSQC--- 632
Query: 490 YDNKGAHCNS---TVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKL 533
GA+C S LF++ DDP E NNLA ++++ L ++
Sbjct: 633 ----GANCGSLSIDYTDWLFDLEDDPRELNNLANEHPEVVESLRRRV 675
>gi|345326826|ref|XP_003431088.1| PREDICTED: arylsulfatase J [Ornithorhynchus anatinus]
Length = 504
Score = 251 bits (642), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 167/503 (33%), Positives = 251/503 (49%), Gaps = 69/503 (13%)
Query: 77 CTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF 136
C + + KY IH G+QH +I +P LPL LPQ LKE GY+TH +GKWHLGF
Sbjct: 24 CPQNEGSCFQQKYQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGF 83
Query: 137 FREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIY 192
+R+ PT RGFD+ +G G DYY H K G D+ N + DN GIY
Sbjct: 84 YRKECMPTQRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDDAAWEYDN---GIY 139
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
ST +YT+ ++A H+ +P+FLY+A+ AVH+ P QAP + I + RR
Sbjct: 140 STQMYTQRVQQILASHDPRRPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRR 194
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKST 311
YA M+S LDE++ NV AL+++G +NSI+++ +DNG P+ G GSN PLRG K T
Sbjct: 195 YAAMLSCLDEAIHNVTLALKRYGFYDNSILIYSSDNGGQPTAG-----GSNWPLRGSKGT 249
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVL 370
W+GG+R V + SP LK V EL HI+DW PTL A ++++ LDG + W+ +
Sbjct: 250 YWEGGIRAVGFVHSPLLKNKGTVCRELVHITDWYPTLITLAEGQLDEGVRLDGYDVWETI 309
Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
++ +RS RVD L + ++ W
Sbjct: 310 ---SEGRRSP------------------RVDILHNIDPIYTKAKNGSWA----------- 337
Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN- 489
+ GI A+++ ++++ S+ + + F+ + N
Sbjct: 338 ------AGFGIWNTAIQSAIRVRHWKLLTGNPGFSE----------WVPPQAFSNLGPNR 381
Query: 490 YDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFD 549
+ N+ ++ LFNIT DP E+ +L+ ++KQL +L+ + T VP P D
Sbjct: 382 WHNERISWSAGKSVWLFNITADPYERVDLSARYPGVVKQLLRRLSQFNKTAVPVRYPPRD 441
Query: 550 KRADPARWNNIWVPWYDELDKQK 572
R++P +W PWY E +K+K
Sbjct: 442 PRSNPRLNGGVWGPWYREEEKKK 464
>gi|442749327|gb|JAA66823.1| Putative arylsulfatase b precursor [Ixodes ricinus]
Length = 274
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 123/226 (54%), Positives = 156/226 (69%), Gaps = 2/226 (0%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
P+II ILADDLGW+DVSFHGS QIPTPN+DALA +G+ILNQHY QALCTPSR+ALMTG+Y
Sbjct: 48 PNIIFILADDLGWDDVSFHGSPQIPTPNMDALAADGIILNQHYAQALCTPSRAALMTGRY 107
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
PI+ GMQH VI GEPWGLPL +L+P++ + GY TH +GKWHLG F++ + P RGFD
Sbjct: 108 PIYTGMQHFVIQPGEPWGLPLEYRLMPEFFSDLGYKTHMVGKWHLGSFKKDFIPVRRGFD 167
Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
S YG++ QDYY+ + T + G D N + YST YTE A+++I HN
Sbjct: 168 SFYGFYNADQDYYNKTL--TEGEHTGYDFWLNEDIHIYPNNRYSTHHYTERAVSLIRSHN 225
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
S+P+FLYL++ G +AP+E V KFL I + R YAG
Sbjct: 226 PSQPLFLYLSYXXXXVGTGPSLLEAPEENVNKFLYIPEKNRTIYAG 271
>gi|403049780|ref|ZP_10904264.1| sulfatase [SAR86 cluster bacterium SAR86D]
Length = 515
Score = 250 bits (639), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 153/375 (40%), Positives = 212/375 (56%), Gaps = 41/375 (10%)
Query: 21 FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPS 80
F + +P++IIIL DDLGW DVS+H + IPTPNID+ NG+ LN+ Y C+P+
Sbjct: 9 FFTLSLCSQPNVIIILTDDLGWGDVSYH-NGFIPTPNIDSFVSNGIELNRFYANPTCSPT 67
Query: 81 RSALMTGKYPIHIGMQHGVIL-----EGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
R++L+TG +HI HGVI E GLP K++P+Y KEAGY T GKWHLG
Sbjct: 68 RASLLTG---LHI-FNHGVIRPFMNPSAEQTGLPEHLKIMPEYFKEAGYQTALSGKWHLG 123
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYST 194
+E Y PT RGFDS YG+ G YYDH + ++ G+ + + YST
Sbjct: 124 MHKEEYLPTNRGFDSSYGHMLGGIGYYDHVHTNRMDWHRDGVSLNEDG---------YST 174
Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
+L + AIN+I + +P+FLY+A A H P +AP+E+V FL I D R YA
Sbjct: 175 ELIADEAINIIKNKDDDRPLFLYVAFNAPHT-----PIEAPEEDVNNFLYIEDELDRNYA 229
Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF-----------GIHSNKGSNH 303
+S+LD +G +I +++ G+LE +I++F++DNG P F G+ KGS
Sbjct: 230 ANISKLDIEIGRIINSIKDLGILEETIIIFLSDNG-PVFDINPIVKTIAPGLTKAKGSTA 288
Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELF-HISDWLPTLCAAAGIEINDTSL- 361
LRG K++ DGG+R AAIW W +K S+ F I D LPTL A GI ++ L
Sbjct: 289 GLRGSKTSALDGGIRVPAAIW--WKGVLEKSKSDQFIFIQDLLPTLLTATGIGYDENLLI 346
Query: 362 DGVNQWDVLTKGAKT 376
DGV++W L T
Sbjct: 347 DGVDRWSNLITNKIT 361
>gi|443693750|gb|ELT95037.1| hypothetical protein CAPTEDRAFT_126817, partial [Capitella teleta]
Length = 318
Score = 250 bits (639), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 141/333 (42%), Positives = 196/333 (58%), Gaps = 20/333 (6%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+KP+ + I+ADDLG++D+ I TPN+DALA G+IL +YVQALC+PSR ALMTG
Sbjct: 1 RKPNFVFIMADDLGYHDIGLRNPDVI-TPNLDALASKGVILTNNYVQALCSPSRHALMTG 59
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+YP MQ V+L E L KLLPQYLKE GY H IGKWHLG+ RE PT RG
Sbjct: 60 RYPYKSAMQSFVVLPFEAKCTGLEYKLLPQYLKELGYENHLIGKWHLGYCREECLPTSRG 119
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FDS YG G DY++H+ + D N +V ++ G +S DL + + AE
Sbjct: 120 FDSFYGLLDGAGDYWEHTTSGVY------DWHLNDEVFHEAYGNHSQDLELDRLDKLFAE 173
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL--DISDPERRTYAGMVSRLDESVG 265
H+ P+FLY A N + P Q +E ++ + D SD RR Y + S LD VG
Sbjct: 174 HDNKDPLFLYFA-----PQNPHVPSQPTEEYLSMYSADDFSDVRRR-YLALTSALDSMVG 227
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
++ L+ + M++N+ ++FM+DNGA +G N P RG K++ ++GG + + I+S
Sbjct: 228 KIVQHLKDNCMMDNTYLIFMSDNGA-----DPGEGQNTPFRGGKTSLFEGGTKSNSFIYS 282
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
LK+T+ + L HI+DWLPTL AG +I D
Sbjct: 283 QLLKKTEYENDGLMHITDWLPTLVKLAGGDIED 315
>gi|443702858|gb|ELU00682.1| hypothetical protein CAPTEDRAFT_125641 [Capitella teleta]
Length = 370
Score = 249 bits (636), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 150/372 (40%), Positives = 208/372 (55%), Gaps = 25/372 (6%)
Query: 21 FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPS 80
FL + P+II ILADDLG+NDV F + I TPNID LA G+++ Y +CTPS
Sbjct: 2 FLFPGKNRPPNIIFILADDLGYNDVGFR-NPDIITPNIDKLARKGVVMTNSYSTHVCTPS 60
Query: 81 RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
R ALMTG+YP GMQ+ VI P L K LPQYLK GY THA+GKWHLG R+
Sbjct: 61 RHALMTGRYPYKTGMQNFVIPGDAPVCSGLEYKFLPQYLKSLGYNTHAVGKWHLGDCRDE 120
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
PT RGFDS YG G Y++H T+ + D +N +D G +S DL +
Sbjct: 121 CLPTERGFDSFYGLLLGGGGYWNH----TYTLFGAYDWFNNKDLDLSANGTHSQDLMVDR 176
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL-----DISDPERRTYAG 255
V A HN+ +PMFLY A H P E FL + RR Y G
Sbjct: 177 LSAVFASHNREEPMFLYFAPQNPH---------TPSEATENFLRHYPSSMYTEIRRQYLG 227
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
+VS LD VG ++ + + ML+N+ ++F +DNG + + N P RG K + ++G
Sbjct: 228 LVSGLDWMVGELLELVIANNMLDNTYIIFQSDNGGDAV-----EALNGPYRGGKGSLFEG 282
Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGA 374
G + V I+SP LK+ ++ HI+DW+PT+ + AG E+ +DG+NQ ++ KGA
Sbjct: 283 GSKVVNFIYSPLLKKVGYENNGWMHITDWMPTIISLAGGEVPLVEGIDGINQMSMVLKGA 342
Query: 375 KTKRSEILHNID 386
++KR+ +++NID
Sbjct: 343 ESKRTSMVYNID 354
>gi|115533416|ref|NP_001041231.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
gi|351060347|emb|CCD68015.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
Length = 488
Score = 249 bits (636), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 143/404 (35%), Positives = 228/404 (56%), Gaps = 41/404 (10%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY--NGLILNQHYVQALCTPSRS 82
TA +KP+++ I+ADDLG++DV + S+ + TPN+ LA+ N +L+ YV LCTP+RS
Sbjct: 26 TATQKPNVLFIMADDLGFSDVDWKDST-LHTPNLRHLAFHKNTALLSNSYVNQLCTPTRS 84
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
A MTG YP +G Q+GV L EP G+P L + +++ Y+T+ +GKWHLG+ ++ +
Sbjct: 85 AFMTGYYPFRVGTQNGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFL 144
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSC----KATFEPYQGLDMRHNMQV-----DNKTIGIYS 193
PT RGFD YG++ Y++HS + +GLD+ + D G+YS
Sbjct: 145 PTNRGFDYFYGFYGPQTGYFNHSADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYS 204
Query: 194 TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE-------EVAKFLDIS 246
TDL+T+ A++V+ HN SKP F++L++ AVH P Q + + A F+ S
Sbjct: 205 TDLFTDVAMSVLDNHNNSKPFFMFLSYQAVHP-----PLQVSQQSKTIGQGKEATFILRS 259
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS-FGIHSNKGSNHPL 305
M++ +D ++G ++ L+ + EN++++F +DNG + FG SN PL
Sbjct: 260 HAHS---TRMLTAMDFAIGRLVEYLKASNLYENTVIVFTSDNGGTANFG-----ASNAPL 311
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
RG K T W+GG + + SP + +FH+ DW T+ + G+EI D+ DG+N
Sbjct: 312 RGEKDTIWEGGTKTTTFVHSPMYIEEGGTRDMMFHVVDWHATILSITGLEI-DSYGDGIN 370
Query: 366 QWDVLTKGA-KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG 408
QW+ L G K +R + ++NIDN + +A+R D K + G
Sbjct: 371 QWEYLKTGRPKFRRFQFVYNIDN------HGSAIRDGDYKLIVG 408
>gi|421593685|ref|ZP_16038213.1| sulfatase [Rhizobium sp. Pop5]
gi|403700318|gb|EJZ17522.1| sulfatase [Rhizobium sp. Pop5]
Length = 497
Score = 249 bits (635), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 155/388 (39%), Positives = 210/388 (54%), Gaps = 33/388 (8%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
P+I+ I++DD GW DV +HGS I TPNID LA G L Q YVQ +CTPSR+A MTG+Y
Sbjct: 55 PNIVYIISDDSGWKDVGYHGSD-IRTPNIDRLAAEGARLEQFYVQPMCTPSRAAFMTGRY 113
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
P G+Q VI + +GL L E LPQ LK+AGY T GKWHLG + Y P RGFD
Sbjct: 114 PFRYGLQTAVIPQSGTYGLALDEYPLPQVLKDAGYYTAMSGKWHLGHSKTAYWPRQRGFD 173
Query: 150 SHYGYWQGLQDYYDHSCKATFEP--YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
S YG G D++ H A P Y+ DN IG A+ VI +
Sbjct: 174 SFYGALLGEIDHFTHKA-ANGNPDWYRNNKALKEEGYDNILIG--------AEAVRVINK 224
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
H++ KP+FLYLA A H P+QAP E + + I+D RR YA M+S +D+ +G V
Sbjct: 225 HDQQKPLFLYLAFTAPHT-----PYQAPKEYLDRNSHIADESRRKYAAMISVMDDEIGKV 279
Query: 268 IAALRKHGMLENSIVLFMADNG----APSFGIHSNKGS----NHPLRGMKSTPWDGGMRG 319
+AAL K GM +N++++F +DNG A G KG N P R K T ++GG R
Sbjct: 280 VAALEKRGMRDNTLIVFHSDNGGVVNAFFAGESEVKGKLPADNGPYREGKGTTYEGGTRV 339
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKR 378
+ P + + + H+ D PTL G + + LDG++ W L +G + R
Sbjct: 340 AGLVNWPGKIKPGTIDGPM-HVVDMYPTLAGLVGATLGKNKPLDGMDMWPTLAEGKPSPR 398
Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYV 406
+EI++NI+ + AA+R D K V
Sbjct: 399 TEIVYNIEPMT------AAVRQGDWKLV 420
>gi|313247306|emb|CBY15582.1| unnamed protein product [Oikopleura dioica]
Length = 486
Score = 248 bits (633), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 146/361 (40%), Positives = 201/361 (55%), Gaps = 36/361 (9%)
Query: 47 FHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW 106
+ S I TPNIDA++ G+ L +YVQ +CTPSRS L++G+Y IH G+QH +I G P
Sbjct: 5 YKSSDIIKTPNIDAISAAGVRLENYYVQPICTPSRSQLLSGRYQIHTGLQHQLIWMGMPS 64
Query: 107 GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYD--- 163
LPL +LLP+ ++ GY T A GKWHLG+ + TP RGF + GY G +DYY
Sbjct: 65 ALPLDTELLPETMRNCGYHTMAAGKWHLGYAKTANTPWGRGFHNFTGYLGGSEDYYKKTR 124
Query: 164 ----HSCKATFEPYQGLDMRHNMQ-----VDNKTIGIYSTDLYTEAAINVIAEHNKSKPM 214
H C G+D + + V N YS Y A N I +K KP
Sbjct: 125 CIDHHKC--------GIDQNTDGEIFGERVYNADASEYSAFKYIRQAKNYIDGRDKDKPF 176
Query: 215 FLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRLDESVGNVIAALRK 273
FLYL +VHA P +AP E +AK+ I D R+ YAGMVS +DE +GN+ + L++
Sbjct: 177 FLYLPMQSVHA-----PLEAPAEYIAKYDGIIEDKPRKIYAGMVSVMDEGIGNITSHLKE 231
Query: 274 HGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK 333
G+ N+I +F DNG ++ G+N PLRG K+ W+GG+ G + + + +K
Sbjct: 232 SGLWNNTIFIFSTDNGGQAY----VGGNNLPLRGNKAGYWEGGIHGNGFVAGGYFEDRRK 287
Query: 334 ----VSSELFHISDWLPTLCAAAGIEI--NDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
V++EL HISDW PT+ A E N LDG++QW +L K+ R EILHNID
Sbjct: 288 GSELVNNELIHISDWYPTIMEATSCEKAQNAPDLDGMSQWKMLLDLEKSPRDEILHNIDE 347
Query: 388 V 388
+
Sbjct: 348 M 348
Score = 43.5 bits (101), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 32/60 (53%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LF++ DP E N +++ D+++ L D+L+ + ST VP D ++ P W PW
Sbjct: 425 LFDVIKDPTESNEISDYHEDIVEFLLDRLSYWNSTQVPINFPAVDPKSFPEYHQGFWKPW 484
>gi|313212736|emb|CBY36668.1| unnamed protein product [Oikopleura dioica]
Length = 602
Score = 248 bits (632), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 148/395 (37%), Positives = 217/395 (54%), Gaps = 37/395 (9%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
F KKPH+I +L DDLG++D+ + I +PNIDALA + L L +HYVQ CTP
Sbjct: 35 TFSQKVLEKKPHVIFVLVDDLGFDDLGYVNRDVI-SPNIDALAKDALHLKKHYVQPSCTP 93
Query: 80 SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
SR+A +TG+Y I +GMQ GVI EP G+PL E LL + K+ GY T GKWHLGF+
Sbjct: 94 SRAAFLTGRYNIRMGMQSGVIRAPEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTY 153
Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI-------GIY 192
+ P RGFD YG++ G QD+Y H E Y G N V+N TI G Y
Sbjct: 154 KHCPQNRGFDRFYGFYLGSQDFYFHDS-GRLEAYPG-----NGDVENDTILDDFHTNGTY 207
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP---E 249
ST L+ + IN +A+H+ + P+F Y++ VH P Q ++ + + +D E
Sbjct: 208 STKLFVDDFINDLAKHDPAVPLFNYVSFQDVHG-----PLQTVNKFKKHYENKTDTWTHE 262
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
R + ++ LD +G ++ AL++ N++++F +DNG G + SN PLRG K
Sbjct: 263 RILISTKITTLDHHIGRMVTALKEKNYWNNTVLVFTSDNG----GQPNEGASNWPLRGSK 318
Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEI---NDTSLDGV 364
T +DGG++ A + SP L+ K + LFH+SDW PTL +G ++ LDG
Sbjct: 319 GTIYDGGLKSRAFVASPLLQNRMKGQNYNYLFHVSDWFPTLLTLSGCDVPRKEGRPLDGK 378
Query: 365 NQWDVLTKGAKTKRSEILHNID-----NVDNPQKY 394
++ K R+E+LH +D + D+P+ +
Sbjct: 379 IH-ELFVYKPKKARNEMLHYLDPLSQFDYDDPRHF 412
>gi|443698985|gb|ELT98690.1| hypothetical protein CAPTEDRAFT_103525, partial [Capitella teleta]
gi|443734460|gb|ELU18442.1| hypothetical protein CAPTEDRAFT_129771, partial [Capitella teleta]
Length = 333
Score = 247 bits (631), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 143/341 (41%), Positives = 195/341 (57%), Gaps = 25/341 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+KP+ + ILADDLG++D+ I TPN+DALA G+IL +YVQALCTPSR ALMTG
Sbjct: 1 RKPNFVFILADDLGYHDIGLRNPDVI-TPNLDALASKGVILTNNYVQALCTPSRHALMTG 59
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+YP MQ VIL L KLLPQYLK+ GY H +GKWHLG+ R+ PT RG
Sbjct: 60 RYPSASAMQTSVILPMRAKCTGLEYKLLPQYLKDLGYKNHMVGKWHLGYCRDECLPTSRG 119
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDM-----RHNMQVDNKTIGIYSTDLYTEAAI 202
FD+ YG + G DY+ H+ ++ + D+ + QV + + DL E
Sbjct: 120 FDTFYGLYAGTGDYWSHTFFGKYDWHTNADIDFEANSTHSQVRSSYMNFVLQDLEMERLD 179
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA-----KFLDISDPERRTYAGMV 257
V EH+ P+FLY A N + P Q DE + KF DI RR Y G+
Sbjct: 180 KVFDEHDSKDPLFLYFA-----PQNPHTPSQPTDEFLNLYPEDKFNDI----RRKYLGLT 230
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
S LD VG ++ L ++GM+ N+ V+F++DNGA +G N P RG KS+ ++GG
Sbjct: 231 SGLDAMVGKIVEKLIENGMMNNTYVIFVSDNGA-----DPPEGLNTPFRGGKSSLFEGGT 285
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
+ + I+SP LK+T+ + L HI+DWLPTL AG ++ D
Sbjct: 286 KSNSFIYSPLLKKTEYENDGLMHITDWLPTLVRLAGGQVMD 326
>gi|410446533|ref|ZP_11300636.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
gi|409980205|gb|EKO36956.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
Length = 517
Score = 247 bits (631), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 152/379 (40%), Positives = 215/379 (56%), Gaps = 39/379 (10%)
Query: 14 TLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV 73
L+F F++ + KP++IIIL DDLGW DVS+HG IPTPNIDALA NG+ LN+ Y
Sbjct: 3 VLIFLILFVSASIYSKPNVIIILTDDLGWGDVSYHGG-HIPTPNIDALAKNGVELNRFYA 61
Query: 74 QALCTPSRSALMTGKYPIHIGMQHGVIL-----EGEPWGLPLTEKLLPQYLKEAGYATHA 128
+C+P+R++L+TG +HI HG+I E +GLP+ K++PQ+ KEAGY T
Sbjct: 62 SPVCSPTRASLLTG---LHI-FNHGIIRPLANPTAEQYGLPVDLKIMPQFFKEAGYQTAL 117
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
GKWHLG E Y PT RGFD YG+ G Y+DH + LD N + +
Sbjct: 118 SGKWHLGMHLEEYWPTNRGFDQSYGHMLGGIGYFDHVHSSR------LDWHRNEEPLFED 171
Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
YST+L A+ +I + ++P+FLY+A A H P QAPD+ + F I DP
Sbjct: 172 --GYSTELIANEAVRIIETKDPNRPLFLYVAFNAPHT-----PIQAPDKNIELFSYIEDP 224
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF-----------GIHS 297
R YA V+ LD +G +I A+ G+LE +I++F +DNG P F G+
Sbjct: 225 LDRAYAANVNALDAEIGKIIYAIEDQGILEETIIVFFSDNG-PVFDINPIIAVLAPGLVE 283
Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
+GS L G K++ +GG+R A+I W L ++ S + + D LPTL +AA IE+
Sbjct: 284 ARGSTAGLPGSKTSAREGGIRVPASIMWKGVLDNSK--SDQYIFVQDLLPTLLSAAEIEV 341
Query: 357 NDTSL-DGVNQWDVLTKGA 374
+D + DG ++W L GA
Sbjct: 342 SDLKVFDGTDKWQNLVTGA 360
>gi|291243527|ref|XP_002741646.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 506
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 157/441 (35%), Positives = 236/441 (53%), Gaps = 43/441 (9%)
Query: 11 LTCTLLFNDAFLNTT-APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
L C++L ++ + K+PHI++ILA+DLGWNDV +H + + P ++ LA +G+I N
Sbjct: 3 LACSILVALVCVSVVKSDKQPHIVLILAEDLGWNDVGWH-NPDLKMPILNQLAADGVIFN 61
Query: 70 QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
Q YVQ CTPSRSAL TG YP I QH ++L E GL L K LP+ LK+ GY TH +
Sbjct: 62 QSYVQPACTPSRSALFTGYYPFKIKRQHQMLLNLEADGLSLDLKTLPEMLKDVGYLTHLV 121
Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI 189
GKWHLGF +E Y P RGFDS YG+ D Y + + P G D R N V ++
Sbjct: 122 GKWHLGFCKEEYLPNKRGFDSFYGWLTLGTDLY--TKENVLAP--GYDFRDNTGVVQES- 176
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
Y + E A++++ H K P+FL + +A+ + + P + + DI D
Sbjct: 177 DTYLPFMLAERAVDIVMGHYKEYPLFLEFS-MALPG----KFLEVPQDYEDLYSDIDDDR 231
Query: 250 RRTYAGM-----------------VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
R + G+ +S +D S+G ++ AL+ GM ++++ +F +DNGA
Sbjct: 232 TRKFYGICNLNIVVSKNTKLPNRKLSAMDNSIGLLVEALKTRGMWDDTLFIFTSDNGA-- 289
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
+ GSN PLRG +T ++GG R + LK+ V++EL HI D T+ A
Sbjct: 290 --FATESGSNWPLRGNIATLFEGGTRVPTFFYGSMLKKAGYVNNELIHIVDLHKTIIELA 347
Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDN- 411
G ++ +DG++ W ++KG + R+E ++NID DN AA+RV D K + G +
Sbjct: 348 G-ATTESDIDGMDMWKTISKGEPSPRTEFVYNID--DNELSPGAAIRVGDYKLITGHPDL 404
Query: 412 ------NGQSDEWYGDTDNEI 426
+SD+WY D I
Sbjct: 405 MYPFRLLNRSDDWYNYGDEPI 425
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/59 (38%), Positives = 36/59 (61%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
LFNI DDP E+NNLA+ + +++L +L Y+ +VPP ++ FD D + +W P
Sbjct: 445 LFNIIDDPEERNNLADVYPNKVEELRHRLDEYREHLVPPESRIFDPSGDAVNFGGVWSP 503
>gi|406832341|ref|ZP_11091935.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 490
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 159/404 (39%), Positives = 220/404 (54%), Gaps = 42/404 (10%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
K+F + T L +F +A ++P+I++I+ADDLG D+ + GS +I TP+IDALA G
Sbjct: 5 KWFLPIFVTTL---SFGAASAAERPNIVVIVADDLGNADLGYRGS-KIKTPHIDALAKGG 60
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+ L +Y LCTP+R+ALMTG+YP+ G+Q VI +GLP EK LPQ LKE GY
Sbjct: 61 VRLESYYGLPLCTPARAALMTGRYPMRQGLQTLVIFPSHRYGLPTDEKTLPQALKEVGYH 120
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
T +GKWHLG + + P RGFD YG G DY+ T E +D + N +
Sbjct: 121 TAMVGKWHLGHADKKFWPQNRGFDHFYGNVVGEVDYF------TRERGGVVDWQRNGEFL 174
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
+ Y DL A+ +IA H+K+KP+FLY A LA HA P+QAP ++ + DI
Sbjct: 175 RED--GYYVDLIGTEAVKLIAGHDKAKPLFLYFASLAPHA-----PYQAPKADIDAYNDI 227
Query: 246 SDPE-RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG---- 300
D E RTYAGM+S LD VG V+AAL +GM EN+I+ F DNG + + +
Sbjct: 228 FDNEMHRTYAGMISNLDRQVGRVVAALETNGMRENTIIFFTTDNGGATSALFATGARAPE 287
Query: 301 ----------------SNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISD 343
SN P G K + DGG+ A + W LK + V +E H D
Sbjct: 288 EREASGGVALGSKPPCSNLPFSGGKGSLKDGGVHLPAIVNWPGRLKPS--VVNEPLHHVD 345
Query: 344 WLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNID 386
+PTL A AG + + + DG + W L +G + EIL N++
Sbjct: 346 LMPTLLALAGGKGDPSKPFDGKDAWGTLAEGKPSPHEEILINVE 389
>gi|291238558|ref|XP_002739195.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 495
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 173/526 (32%), Positives = 266/526 (50%), Gaps = 79/526 (15%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K PHII I+ DDLGWNDV ++ + TP ID LA NG+ L +YV + C PSR+ LMTG
Sbjct: 21 KPPHIIFIVVDDLGWNDVGYN-NPVFKTPTIDRLAGNGVKLLNYYVASHCLPSRNMLMTG 79
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
++ I +G+ + P LPL E + + LK AGY+ H +GKWH G++ + P RG
Sbjct: 80 RHAIQLGIPNDG-FGYHPRSLPLDETTIAEPLKHAGYSNHIVGKWHCGYYADNCLPHNRG 138
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-----VDNKTIGIYSTDLYTEAAI 202
FD+ +G+ D+Y HS + HN++ + K IG YST ++
Sbjct: 139 FDTFFGFVGAGIDHYTHS--------DHFNHMHNLRKNDDCIAKKYIGKYSTTIFANEGK 190
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRLD 261
++I H+++KP+FLYL+ AVHA P + P + ++ I D +RRTYA M S +D
Sbjct: 191 DIINAHDQNKPLFLYLSFSAVHA-----PLEVPSSYLKQYESTIHDEDRRTYAAMTSCVD 245
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
E++ N+ AL + ML+NSI++F +DNG + ++ G+N PLRG K T W+GG+R V
Sbjct: 246 EAIANITEALEEKNMLKNSIIVFTSDNGG---ALVNSAGNNWPLRGGKHTSWEGGVRAVG 302
Query: 322 AIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKR 378
++ L + + + + L HI+DW PTL AG + L GV+QW+++T K+ R
Sbjct: 303 FVYGDLLPRHSRGTENTGLMHITDWFPTLITLAGGNPHVGKKLYGVDQWNMITGEDKSGR 362
Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
++L ++ DN K A R K +N D + +
Sbjct: 363 DDVLITLEE-DN--KAGAPYRTGIYK------------------NNAFD-------ITTH 394
Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALR--TILTDEKILELREFARVRCNYDNKGAH 496
A I K K P SD ++ I +D +I+E R+F
Sbjct: 395 AAIKMG--------KWKLLTGPIGGPSDWVKPQEINSDHRIVEDRDF------------- 433
Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP 542
++T L+NI DD E +L+E ++ ++ KL + VP
Sbjct: 434 -DATRMVRLYNIEDDQTETTDLSEINQHIVIEMLAKLHNFSKQSVP 478
>gi|443705385|gb|ELU01963.1| hypothetical protein CAPTEDRAFT_143986, partial [Capitella teleta]
Length = 345
Score = 246 bits (628), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 142/352 (40%), Positives = 194/352 (55%), Gaps = 18/352 (5%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F T L+ N L +KP+ + I+ADDLG++D+ I TPN+DALA G+I
Sbjct: 4 LFLFTGVLVTNAYLLQKDKVRKPNFVFIMADDLGYHDIGLRNPDVI-TPNLDALASKGVI 62
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
L +YVQALCTPSR ALM+G+YP MQ VI E L K LPQYLKE GY H
Sbjct: 63 LTNNYVQALCTPSRHALMSGRYPSASAMQSMVIQPMEARCAGLEYKFLPQYLKELGYKNH 122
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
+GKWHLG+ R+ PT RGFD+ YG + G DY+ H E + D N + +
Sbjct: 123 MVGKWHLGYCRDECLPTSRGFDTFYGLYAGAGDYWSH------EIFGKYDWHINGDIHFE 176
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDIS 246
G +S DL E V EH+ P+FLY A N + P Q DE + + D
Sbjct: 177 ANGTHSQDLEMEGLDKVFDEHDSKDPLFLYFA-----PQNPHTPSQPTDEFLNLYPEDKF 231
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
D RR Y G+ S LD VG ++ L +G+++N+ V+F++DNGA +G N P R
Sbjct: 232 DDIRRKYLGLTSGLDAMVGKIVEKLENNGLMDNTYVIFVSDNGA-----DPPEGLNTPFR 286
Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
G K + ++GG + + ++SP L +T + L HI+DWLPTL AG ++ D
Sbjct: 287 GGKGSLFEGGTKSNSFVYSPLLNKTGYENDGLMHITDWLPTLVKLAGGKVMD 338
>gi|313242955|emb|CBY39683.1| unnamed protein product [Oikopleura dioica]
Length = 581
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 145/382 (37%), Positives = 210/382 (54%), Gaps = 32/382 (8%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
F KKPH+I +L DDLG++D+ + I +PNIDALA + L L +HYVQ CTP
Sbjct: 14 TFSQKVLEKKPHVIFVLVDDLGFDDLGYVNRDVI-SPNIDALAKDALHLKKHYVQPSCTP 72
Query: 80 SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
SR+A +TG+Y I +GMQ GVI EP G+PL E LL + K+ GY T GKWHLGF+
Sbjct: 73 SRAAFLTGRYNIRMGMQSGVIRATEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTY 132
Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI-------GIY 192
+ P RGFD YG++ G QD+Y H + Y G N V+N TI G Y
Sbjct: 133 KHCPQIRGFDRFYGFYLGSQDFYFHDS-GRLKAYPG-----NGDVENDTILDDLHTNGTY 186
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP---E 249
ST L+ + IN +A+H+ + P+F Y++ VH P Q ++ + + +D E
Sbjct: 187 STKLFVDDFINDLAKHDPAVPLFNYVSFQDVHG-----PLQTVNKFKKHYENKTDTWTHE 241
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
R + ++ LD +G ++ AL++ N++++F +DNG G SN PLRG K
Sbjct: 242 RILISTKITTLDHHIGKMVTALKEKNYWNNTVLVFTSDNG----GQPKEGASNWPLRGSK 297
Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEI---NDTSLDGV 364
T +DGG++ A I SP L+ K + LFH+SDW PTL +G ++ LDG
Sbjct: 298 GTIYDGGLKSRAFIASPLLQNRMKGQNYNYLFHVSDWFPTLLTLSGCDVPRKEGRPLDGK 357
Query: 365 NQWDVLTKGAKTKRSEILHNID 386
++ K R+++LH +D
Sbjct: 358 IH-ELFVYKPKKARNKMLHYLD 378
>gi|313236221|emb|CBY11544.1| unnamed protein product [Oikopleura dioica]
Length = 511
Score = 246 bits (627), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 145/392 (36%), Positives = 217/392 (55%), Gaps = 35/392 (8%)
Query: 7 YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
Y F L+ N A ++P+I++I+ADDLG +DVS HG+ I TPN+D LA +G+
Sbjct: 6 YIFGLSA---------NIFAEQQPNIVVIVADDLGKHDVSMHGAD-IYTPNLDMLARDGV 55
Query: 67 ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
+LN +YVQ +C+P+R +LMTG+YP +G+QH ++ P GLPL E ++PQY+KE GY T
Sbjct: 56 LLNNYYVQPVCSPTRGSLMTGRYPYRLGLQHENLVGYRPAGLPLDEYIMPQYMKECGYKT 115
Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ---GLDMRHNMQ 183
+ +GKW LGFF++ Y P RGFD +G G QDYY Y G D+R
Sbjct: 116 YMVGKWQLGFFKDNYLPWKRGFDEFFGQLLGGQDYYSRRKCLKLRNYGNLCGYDLRTEQG 175
Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
T Y LY + A HNK+ P+++Y+A +VH P QAP + V+ +
Sbjct: 176 PVRDTSMKYQPFLYADKAREKFFAHNKTDPLYMYVAFQSVH-----RPLQAPQQYVSLYN 230
Query: 244 D---ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
D +S+ + R +AGMV+ LD +VG ++ L+ G+ N+++ F DNG
Sbjct: 231 DNVQMSNGQ-RVFAGMVTALDAAVGEIVDGLKDSGLWNNTLIYFSNDNG-------REDD 282
Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWL--KQTQKVSSELFHISDWLPTLCAAAGIE-IN 357
S+ L+G K+ W GG+ G P + K K ++L H+SD LPT+ +AA +
Sbjct: 283 SDFVLKGGKNDFWQGGITGYGFFSGPAIPRKMRGKKMNDLTHVSDVLPTMLSAAKCHTLP 342
Query: 358 DTSLDGVNQWDVL---TKGAKTKRSEILHNID 386
+DG + W L + G KR EI+ ++
Sbjct: 343 PKVIDGHDLWSRLQGDSLGQDEKRKEIIMQMN 374
>gi|313215020|emb|CBY41206.1| unnamed protein product [Oikopleura dioica]
Length = 427
Score = 245 bits (626), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 142/376 (37%), Positives = 213/376 (56%), Gaps = 26/376 (6%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
N A ++P+I++I+ADDLG +DVS HG+ I TPN+D LA +G++LN +YVQ +C+P+R
Sbjct: 13 NIFAEQQPNIVVIVADDLGKHDVSMHGAD-IYTPNLDMLARDGVLLNNYYVQPVCSPTRG 71
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+LMTG+YP +G+QH ++ P GLPL E ++PQY+KE GY T+ +GKW LGFF++ Y
Sbjct: 72 SLMTGRYPYRLGLQHENLVGYRPAGLPLDEYIMPQYMKECGYKTYMVGKWQLGFFKDNYL 131
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ---GLDMRHNMQVDNKTIGIYSTDLYTE 199
P RGFD +G G QDYY Y G D+R T Y LY +
Sbjct: 132 PWKRGFDEFFGQLLGGQDYYSRRKCLKLRNYGNLCGYDLRTEQGPVRDTSMKYQPFLYAD 191
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD---ISDPERRTYAGM 256
A HNK+ P+++Y+A +VH P QAP + V+ + D +S+ + R +AGM
Sbjct: 192 KAREKFFAHNKTDPLYMYVAFQSVH-----RPLQAPQQYVSLYNDNVQMSNGQ-RVFAGM 245
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
V+ LD +VG ++ L+ G+ N+++ F DNG S+ L+G K+ W GG
Sbjct: 246 VTALDAAVGEIVDGLKDSGLWNNTLIYFSNDNG-------REDDSDFVLKGGKNDFWQGG 298
Query: 317 MRGVAAIWSPWL--KQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVL--- 370
+ G P + K K ++L H+SD LPT+ +AA + +DG + W L
Sbjct: 299 ITGYGFFSGPAIPRKMRGKKMNDLTHVSDVLPTMLSAAKCHTLPPKVIDGHDLWSRLQGD 358
Query: 371 TKGAKTKRSEILHNID 386
+ G KR EI+ ++
Sbjct: 359 SLGQDEKRKEIIMQMN 374
>gi|402821074|ref|ZP_10870630.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
IMCC14465]
gi|402510105|gb|EJW20378.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
IMCC14465]
Length = 526
Score = 243 bits (621), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 153/411 (37%), Positives = 224/411 (54%), Gaps = 30/411 (7%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
N + ++P+I+IILADDLGW DV +HGS I TP+ID LA G LN+ Y C+P+R+
Sbjct: 44 NNKSDERPNIVIILADDLGWGDVGYHGS-DIQTPHIDRLAKEGAKLNRFYATPFCSPTRA 102
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
ALMTG+ P+ +G+ + V++ E G+ L E LPQ + AGY T +GKWHLG E +T
Sbjct: 103 ALMTGRDPLKLGVAYSVLMPWENGGVSLDEHFLPQSFQAAGYNTAMVGKWHLGHTIEQHT 162
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-VDNKTIGIYSTDLYTEAA 201
P RGFD YG+ Y+DH G D + N + VD+ G Y+TD++ A
Sbjct: 163 PNARGFDLFYGHMHTQVSYFDHQIA------NGHDFQENGKPVDHN--GEYATDVHGAQA 214
Query: 202 INVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF---LDISDPERRTYAGMV 257
I + +K+KP LY+ LA H+ P +AP+ K+ LD+ R+TYA MV
Sbjct: 215 ARFITDLRDKTKPFLLYVPFLAPHS-----PMEAPEALKDKYSSRLDLPGSTRKTYAAMV 269
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
+D+++G V+ AL + G+ +N+IV F +DNG G + N P RG K ++GG+
Sbjct: 270 DSMDQAIGKVLTALDEEGVADNTIVFFFSDNG----GFENYGSDNGPYRGGKLEVYEGGI 325
Query: 318 RGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAK 375
R A + W L +V E+ + D LPTL AAG+E + +DGVN+W +T
Sbjct: 326 RVTAVMRWPEKLAAGSEV-DEIVSVMDLLPTLTHAAGVENGTEKKIDGVNRWSTITGEKV 384
Query: 376 TKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
+KR L+ N+ K+ + K V D+ S D DN++
Sbjct: 385 SKRKGALYFASNIPVYNKFQLGVIDGKWKLVQNIDHKMTST----DFDNQL 431
>gi|443692243|gb|ELT93883.1| hypothetical protein CAPTEDRAFT_107171, partial [Capitella teleta]
Length = 330
Score = 243 bits (619), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 144/341 (42%), Positives = 195/341 (57%), Gaps = 27/341 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
KKP+ + ILADDLG+ND+ I TPN+DALA G+IL +YVQA+C+PSR ALMTG
Sbjct: 1 KKPNFVFILADDLGYNDLGLRDPDVI-TPNMDALASKGVILTNNYVQAVCSPSRHALMTG 59
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+YP MQ VI E L K LPQYLK+ GY H IGKWHLG+ RE PT RG
Sbjct: 60 RYPSASAMQSIVIQPMEAKCSGLKYKFLPQYLKDLGYKNHMIGKWHLGYCREECLPTSRG 119
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQ--GLDMRH---NMQVDNKTIGIYSTDLYTEAAI 202
FD+ YG + DY++H ++ + G+D + QV ++ IY DL E
Sbjct: 120 FDTFYGLYASSGDYWEHGIMGMYDWHTEAGVDFEARGTHAQVGSRYWHIY--DLEMERLD 177
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA-----KFLDISDPERRTYAGMV 257
V EH+ P+FLY A N++ P Q DE + KF DI RR Y G+
Sbjct: 178 KVFDEHDNKDPLFLYFA-----PQNSHTPHQPTDEFINLYSEDKFSDI----RRKYLGLT 228
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
S LD VG ++ L ++ M+ N+ V+F++DNGA +G N P RG K + ++GG
Sbjct: 229 SGLDAMVGKIVDKLTENDMMNNTYVIFVSDNGADPI-----EGRNAPFRGGKGSLFEGGT 283
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
+ + I+SP LK+T+ + L HI+DWLPTL AG ++ D
Sbjct: 284 KSSSFIYSPLLKKTEYENDGLMHITDWLPTLVKLAGGQVKD 324
>gi|443704179|gb|ELU01354.1| hypothetical protein CAPTEDRAFT_182406 [Capitella teleta]
Length = 548
Score = 242 bits (618), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 146/396 (36%), Positives = 218/396 (55%), Gaps = 26/396 (6%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP+II ADD G++D + +S I TPNID L +G+ Y C+PSRS+ ++G+
Sbjct: 28 KPNIIWFFADDYGYHDAGYR-NSDIHTPNIDQLVADGISFTNAYSAQQCSPSRSSFLSGR 86
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y GMQHGVI + + L L YLKE Y THA GKWHLG+ + TPT+RGF
Sbjct: 87 YAYTSGMQHGVISDTAAHCMDLKYNFLSDYLKELNYNTHASGKWHLGYCNKECTPTYRGF 146
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGL-DMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
D+ G + G YY+H+ +QG+ D + ++D GI+S DL +N + E
Sbjct: 147 DTFSGGYSGEGKYYEHTT------FQGMYDWHNGTKLDRSASGIHSQDLIEADLLNQLDE 200
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE-RRTYAGMVSRLDESVGN 266
N P+F Y A H NT+ P Q E A + D+ P R+ Y G+VS +D +G
Sbjct: 201 -NDGTPLFYY---AAFH--NTHSPMQPKPEHEALYDDLDAPSTRKKYLGLVSGMDAVIGK 254
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
V+A L++ M +N+ + F +DNG +H +G N P RG KS+ +DGG + SP
Sbjct: 255 VVAKLKEKDMFDNTYIFFSSDNGGD---VH--EGDNSPKRGAKSSLFDGGCHAHSWASSP 309
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNI 385
LK+ + L HI+DWLPT+ AG E+ ++ +DG+ Q D++T +KR+++++NI
Sbjct: 310 LLKKKGVEADGLLHITDWLPTIVKLAGGEVPESDGIDGIEQVDLITNAGPSKRTDMIYNI 369
Query: 386 D---NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEW 418
D + P + A+ DL+Y G+SD +
Sbjct: 370 DMELEIGTPD--FGAMAARDLRYKLIWGFEGRSDGY 403
>gi|291235506|ref|XP_002737685.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 658
Score = 242 bits (618), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 152/409 (37%), Positives = 223/409 (54%), Gaps = 43/409 (10%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K+P+I+ +LADDLGW+DV +HGS I TPNID LA G+ L +YV + C PSR L+TG
Sbjct: 202 KRPNIVFVLADDLGWHDVGYHGSI-IDTPNIDHLAAEGVKLENYYVSSWCAPSRVNLLTG 260
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF--REVYTPTF 145
+Y I G+ +G + + + + E L L EAGY T +GKWHL F RE Y P
Sbjct: 261 RYRIRTGL-YGDVCD----FMGIHEITLADKLYEAGYYTAMVGKWHLSGFQHRECY-PAH 314
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
RGF + G+ G Q+Y+ H + Y ++ + + G YST +Y E A +I
Sbjct: 315 RGFQTFLGFHGGSQNYFTHRRGGSNSEYDF--WANDTSIGREYDGRYSTMVYAEEAQRII 372
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDISDPERRTYAGMVSRLDESV 264
H +P+FLYL+ AVH+ P P K+ I D +RR YA M S +DE++
Sbjct: 373 RHHRTEQPLFLYLSFQAVHS-----PLLVPSAYEDKYRTGIEDDKRRVYAAMTSCMDEAI 427
Query: 265 GNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
GNV L++ G+ ++ +F +DNG P+FG GSN PLRG K ++GG+R V +
Sbjct: 428 GNVTDTLKETGLWNKTVFIFSSDNGGTPTFG-----GSNWPLRGRKGLWYEGGIRAVGFV 482
Query: 324 WSPWLKQ--TQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
S L T + EL HI+DW PTL A +++ LDG++QW+ +++G KT R E
Sbjct: 483 NSYLLSNHVTGSTNHELIHITDWFPTLLRLAKTDLHRLKPLDGIDQWETISQGKKTNRKE 542
Query: 381 ILHNI------DNVDNPQKY-----------YAALRVDDLKYVAGTDNN 412
+L N N++ ++ +AA+RV D K + G ++
Sbjct: 543 VLLNFLPGNISGNINENREGEATADYFDTMPFAAIRVGDWKLLTGNKDS 591
>gi|291235057|ref|XP_002737462.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
partial [Saccoglossus kowalevskii]
Length = 355
Score = 241 bits (615), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 139/354 (39%), Positives = 207/354 (58%), Gaps = 29/354 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K PHII I+ DDLGWNDV ++ + TP ID LA NG+ L +YV + C PSR+ LMTG
Sbjct: 21 KPPHIIFIVVDDLGWNDVGYN-NPVFKTPTIDRLAGNGVKLLNYYVASHCLPSRNMLMTG 79
Query: 88 KYPIHIGM-QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
++ I +G+ Q G P LPL E + + LK AGY+TH +GKWH G++ + P R
Sbjct: 80 RHAIQLGIPQDG--FGYHPRSLPLDETTIAEPLKHAGYSTHIVGKWHCGYYADNCLPHNR 137
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-----VDNKTIGIYSTDLYTEAA 201
GFD+ +G+ D+Y HS + HN++ + K IG YST ++
Sbjct: 138 GFDTFFGFVGAGIDHYTHS--------DHFNHMHNLRKNDDCIAKKYIGKYSTTIFANEG 189
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRL 260
++I H+++KP+FLYL+ AVHA P + P + ++ I D +RRTYA M S +
Sbjct: 190 KDIINAHDQNKPLFLYLSFSAVHA-----PLEVPSSYLKQYESTIHDEDRRTYAAMTSCV 244
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
DE++ N+ AL + ML+NSI++F +DNG + ++ G+N PLRG K T W+GG+R V
Sbjct: 245 DEAIANITEALEEKNMLKNSIIVFTSDNGG---ALVNSAGNNWPLRGGKHTSWEGGVRAV 301
Query: 321 AAIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLT 371
++ L + + + + L HI+DW PTL AG + L GV+QW+++T
Sbjct: 302 GFVYGDLLPRHSRGTENTGLMHITDWFPTLITLAGGNPHVGKKLYGVDQWNMIT 355
>gi|241176208|ref|XP_002399501.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215495150|gb|EEC04791.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 418
Score = 241 bits (615), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 164/480 (34%), Positives = 230/480 (47%), Gaps = 72/480 (15%)
Query: 94 GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG 153
G+QH I EGE WGLPL L+PQ+ K+ GYATH +GK +G F T R H
Sbjct: 1 GLQHLTIEEGEAWGLPLRFTLMPQHFKKLGYATHMVGK--VGQFIISDTSCRRYVKRH-- 56
Query: 154 YWQGLQDYYDHSCK---ATFEP----YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
Q CK EP + GLD+ N G Y TD+ T+ A+++IA
Sbjct: 57 ----TQKSCSQRCKRERKVKEPCGGDHIGLDLWDNEVAVRTECGNYDTDMLTDKAVSIIA 112
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H+ +KP+FLYL+H + H+ T P QAP++ +AKF I + R YAGMV +D++ G
Sbjct: 113 SHDTTKPLFLYLSHRSAHSAFTSAPLQAPEKNIAKFGYIGERNRTIYAGMVDSMDDAFGR 172
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
++ AL G+L N+IV+F +DNGA N+G N PLRG K T W+G +R A IWSP
Sbjct: 173 LVEALSGAGVLNNTIVVFSSDNGAVPIIEFPNRGFNWPLRGAKKTLWEGAVRVPAFIWSP 232
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
L+ + +VS ++ HI DWLPT + AG ++ DG + W L++G K+ R E+L NI
Sbjct: 233 LLESSGRVSDQMMHIVDWLPTFYSVAGGNLSHLGEQDGFDMWKALSEGTKSPREEMLLNI 292
Query: 386 DNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNA 445
D V N ++LR + K V GT +NG D + T V + + +
Sbjct: 293 DPVLN----TSSLRYKNHKVVLGTYHNGSFDHRFKTTGRRPGWRRRATVRCGECNLAKDN 348
Query: 446 LKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCL 505
+ K +S L ++ D PC
Sbjct: 349 V--------------KPSSPPYLFDVVAD----------------------------PCE 366
Query: 506 FNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNK-PFDKRADPARWNNIWVPW 564
N NLA ++L+QL +L Y +T V P N D R+ P R N W PW
Sbjct: 367 LN---------NLAPEHPEILRQLLKRLEDYATTAVDPENVYETDARSFPERHNGTWAPW 417
>gi|443700441|gb|ELT99395.1| hypothetical protein CAPTEDRAFT_208054 [Capitella teleta]
Length = 558
Score = 241 bits (614), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 143/402 (35%), Positives = 216/402 (53%), Gaps = 25/402 (6%)
Query: 29 KPHIIIILADDL-----GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
KP++I ADD G+ D + S+ I TPNID L +G+ Y C+PSRS+
Sbjct: 65 KPNVIWFFADDYDLALPGFQDAGYRNSA-IHTPNIDKLVGDGISFTNAYSSQQCSPSRSS 123
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
++G+YP GMQHGVI + P + L K L YLK+ Y THA+GKWHLG+ + TP
Sbjct: 124 FLSGRYPYKSGMQHGVISDEGPNCMDLKFKFLSDYLKDLNYNTHAVGKWHLGYCNKKCTP 183
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
T+RGFD+ G + G DYY+H+ + D + + D G++S DL EA +
Sbjct: 184 TYRGFDTFSGGYSGEGDYYEHTTHGGW-----YDWHNGTETDLTAAGVHSQDL-VEADMM 237
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI-SDPERRTYAGMVSRLDE 262
+ N P F Y A H NT+ P Q E A + D+ + P R+ Y G+VS +D
Sbjct: 238 KNLDENDGTPFFYY---TAFH--NTHSPLQPKPEHKALYNDMDATPSRKKYLGLVSGMDA 292
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
+GNV+A L+ GM +N+ + F +DNG N+G N P RG KS+ ++GG +
Sbjct: 293 VIGNVVAKLKAKGMFDNTYIFFSSDNGGD-----VNEGDNSPWRGAKSSLFEGGCHAHSW 347
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
SP +++ + L H++DWLPT+ AG + D +DG+ Q D++T +KR+ ++
Sbjct: 348 ASSPLIRKKGVEADGLLHMTDWLPTILTLAGGTVPD-GIDGIEQVDLITNAGLSKRTNMI 406
Query: 383 HNID-NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD 423
+NID ++ + + DL+Y G+SD + D D
Sbjct: 407 YNIDMELETGPPNFGCMAARDLRYKLIWGFEGRSDGYGVDAD 448
>gi|313219945|emb|CBY43646.1| unnamed protein product [Oikopleura dioica]
Length = 1683
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 158/507 (31%), Positives = 247/507 (48%), Gaps = 64/507 (12%)
Query: 61 LAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLK 120
L L L +YV +CTP+RS LM+G+Y IH G+QH VIL +P GLPLT+ + P+ K
Sbjct: 1226 LIQESLHLQSYYVNPICTPTRSVLMSGRYQIHTGLQHAVILGAQPNGLPLTDPVQPEIFK 1285
Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
+ GY TH +GKWHLGF+ E P RGF+SHYGY G + +Y+HS + G+D R
Sbjct: 1286 DCGYRTHMVGKWHLGFYDEKMVPENRGFESHYGYLIGAEGHYNHS--QFMQGQHGVDFRD 1343
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
N + G YS DL+ + +++ H+ + +++Y+ VH P +AP V
Sbjct: 1344 GGASTNSSWGQYSADLFVKRVEDLVEAHDVEESLYMYVGLQNVHY-----PLEAPQHYVD 1398
Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
+F I D +RR YA M +D++VG +I + + G+ E++IV F DNG + G
Sbjct: 1399 QFSWIKDRDRRVYAAMTKSMDDTVGRIINSFKNKGIWEDTIVYFTTDNGGSTL----YGG 1454
Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDT 359
+N PLRG+K+T W+GG+RG+ +I P + ++ +L H+ D PTL
Sbjct: 1455 NNWPLRGLKNTLWEGGIRGIGSIKIPGVSADKR--DQLMHVVDMFPTLLDLTSCPNTRKV 1512
Query: 360 SLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY 419
DG +Q +L +++ L N VD L+ G DN W+
Sbjct: 1513 KFDGKSQAKMLVDNDESQSDSFLIN---------------VDPLRKNKGADNR----TWH 1553
Query: 420 GDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRT--ILTDEKI 477
+ D + +AG+ + K A SSD T I E
Sbjct: 1554 SEFDVRV-----------QAGLRWQQWKLLT----------GAPSSDNYPTGHIYPPEWP 1592
Query: 478 LELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYK 537
+ R+ + + N++ +S + LF+I +DP E ++ T++++++ + L+ Y
Sbjct: 1593 QDGRKNS--QSNFE------SSNLNVRLFDIDNDPLEDFEISNQHTEIVEKMLNMLSSYN 1644
Query: 538 STMVPPGNKPFDKRADPARWNNIWVPW 564
ST VP D P + W PW
Sbjct: 1645 STAVPCRWPNPDPAGLPDQNGGFWKPW 1671
>gi|323454261|gb|EGB10131.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 635
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 144/383 (37%), Positives = 213/383 (55%), Gaps = 35/383 (9%)
Query: 30 PHIIIILADDLGWNDVSF----HGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
PH+++ILADD+G+ND+ + H ++Q+ TP +D LA G+ L ++Y Q C+PSR AL+
Sbjct: 118 PHLVLILADDMGYNDIGYNRAPHQTNQVSTPFLDELASEGVTLTRYYTQCDCSPSRGALL 177
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG YP G+ HGVI+ WGLPL L+PQ+L Y +HAIGKW +G + + PT
Sbjct: 178 TGLYPASTGLYHGVIVTQSHWGLPLEYHLIPQFLPSR-YRSHAIGKWDVGHYTWNHVPTG 236
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGL--------DMRHNMQVDNKTIG-----IY 192
RGF S+ G++ DYY H A Y D ++ D G Y
Sbjct: 237 RGFHSYVGFYGTDIDYYTHEIGAGCNSYNCSSAIKRCMNDSITDLNYDGAATGDEYYNRY 296
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV----AKFLDISDP 248
STD++T+ A+ ++ + P+FLY+A AVH P +A D + A FL +
Sbjct: 297 STDIFTDRAVELLRTESARNPLFLYVAFNAVHG-----PLEADDATIDRFNASFLYEART 351
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
ER T+A +D +V +++ + G +NSIVLF +DNGA + GSN PLRG
Sbjct: 352 ERATFAAAALLMDRAVESIVKTMHAVGAYDNSIVLFASDNGATL--AQTGGGSNWPLRGS 409
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIEINDTSLDGVNQ 366
K TP++GG+R A + SP L ++ + LFH++D LPTL AAG + +DGVNQ
Sbjct: 410 KFTPYEGGVRVPAFLHSPLLGSGRRGITHAGLFHVTDILPTLVHAAGGS-PPSDIDGVNQ 468
Query: 367 WDVLTKG---AKTKRSEILHNID 386
+ L A + R+++L ++D
Sbjct: 469 YHALFGSDATASSPRADVLVHVD 491
>gi|323452295|gb|EGB08169.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 614
Score = 240 bits (613), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 151/438 (34%), Positives = 230/438 (52%), Gaps = 45/438 (10%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIP-TPNIDALAYNGLILNQHYVQALCT 78
A L A +PH++ ++ DD+G+NDV + S TP +D L +G+ +++ Y Q +CT
Sbjct: 53 AKLAAAAETRPHVVFVMLDDVGFNDVGYASSDLGEMTPFLDGLMADGVRVDRLYGQQVCT 112
Query: 79 PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
PSR+A++TGK PIH+ +QH + EPWGLP E L QYLK GY+TH +GKWHLG +
Sbjct: 113 PSRAAMLTGKLPIHLELQHWQVAPSEPWGLPTREATLAQYLKALGYSTHMVGKWHLGHYN 172
Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYT 198
TP RGFDS YG++ G DY H + D+ + + G + T L
Sbjct: 173 NASTPLNRGFDSFYGFYSGGVDYLTHDPSTGYVWRCYRDLWDDERPVTDAHGQHQTSLMN 232
Query: 199 EAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV----AKFLDISDPERR 251
E AI V+ H KS+P+F Y+++ N + P Q P E + A LDI + +R+
Sbjct: 233 ERAIAVLERHAVEKKSEPVFAYVSY-----PNAHLPLQPPTELLERRNATLLDIPNHDRK 287
Query: 252 TYAGMVSRLDESVGNVI-AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
+A ++ DES+ N+ L+ G+ + ++++ +DNG + GSN+PLRG K
Sbjct: 288 NFAALMMNADESLANLTNRGLKATGLYDTAVIVIASDNGGQVSAM--GGGSNYPLRGEKK 345
Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSS--ELFHISDWLPTLCAAA---------------G 353
++GG+R A I SP L + + SS +L H+SDWLPTL A G
Sbjct: 346 YLFEGGVRAHAVIHSPLLPRAARGSSYGKLMHMSDWLPTLLQGALGASADELAGLKLPRG 405
Query: 354 IEINDTSLDGVNQWDVLTKGAKTK----RSEILHNIDNVDNPQKYY----AALRVDDLKY 405
+ + ++DGV+ W + R E+L NID +D Y AAL + D K+
Sbjct: 406 VVNDYGAIDGVDHWAHVVGAVDAAAAAPRDELLLNIDYLDGSGDYLGYFRAALIIGDWKF 465
Query: 406 VAGTDNNGQSDEWYGDTD 423
+ N ++ W+ +D
Sbjct: 466 I----YNEKNISWFAPSD 479
>gi|443703066|gb|ELU00815.1| hypothetical protein CAPTEDRAFT_95989 [Capitella teleta]
Length = 382
Score = 238 bits (606), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 141/388 (36%), Positives = 214/388 (55%), Gaps = 27/388 (6%)
Query: 3 WARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALA 62
++R FF + D+ P+II I DD G+NDV F + I +PN+DALA
Sbjct: 2 FSRSNFFPAAANIKRGDS--------PPNIIWIFGDDYGFNDVGFRNPNVI-SPNMDALA 52
Query: 63 YNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEA 122
+G+IL Y C+PSR + ++G+Y MQHGVIL+ +P L L +LP YLKE
Sbjct: 53 QSGIILTNAYTAPQCSPSRGSFLSGRYSYKSAMQHGVILDNKPQCLGLDYTILPGYLKEL 112
Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM 182
GY THA GKWHLG+ R+ TPT RGFDS G + G ++Y+H+ AT Y D
Sbjct: 113 GYETHAFGKWHLGYCRDECTPTHRGFDSFSGGFSGEGEFYEHT-TATGGYY---DWHLGT 168
Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
+VD IG +S DL + E+++S P+F+Y+A VH+ P E +A +
Sbjct: 169 EVDYDAIGKHSEDLIGYYVNKTLDEYDQSSPLFMYVAFHNVHS-----PLDPKPEFLALY 223
Query: 243 LDI---SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
D+ +D ++G+VS +D +G ++ +++ G+ EN+ +LF +DNG +
Sbjct: 224 DDLDASTDYSGFFFSGLVSGMDYIIGGIVDKIKEIGIYENTYILFSSDNGG-----DVGE 278
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
G N P RG KST W+GG + + SP L + ++ L H++DWLPT+ AG + +
Sbjct: 279 GDNSPHRGGKSTLWEGGCKANSWTHSPLLGASGVENNGLMHVTDWLPTIVDLAGGSVPAS 338
Query: 360 -SLDGVNQWDVLTKGAKTKRSEILHNID 386
LDG Q D++ + R+ +++NID
Sbjct: 339 DGLDGFVQTDMVINNGDSARTTMIYNID 366
>gi|115533418|ref|NP_001041232.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
gi|351060348|emb|CCD68016.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
Length = 452
Score = 237 bits (604), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 137/394 (34%), Positives = 218/394 (55%), Gaps = 43/394 (10%)
Query: 36 LADDLGWNDVSFHGSSQIPTPNIDALAY--NGLILNQHYVQALCTPSRSALMTGKYPIHI 93
+ADDLG++DV + S+ + TPN+ LA+ N +L+ YV LCTP+RSA MTG YP +
Sbjct: 1 MADDLGFSDVDWKDST-LHTPNLRHLAFHKNTALLSNSYVNQLCTPTRSAFMTGYYPFRV 59
Query: 94 GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG 153
G Q+GV L EP G+P L + +++ Y+T+ +GKWHLG+ ++ + PT RGFD YG
Sbjct: 60 GTQNGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYG 119
Query: 154 YWQGLQDYYDHSC----KATFEPYQGLDMRHNMQV-----DNKTIGIYSTDLYTEAAINV 204
++ Y++HS + +GLD+ + D G+YSTDL+T+ A++V
Sbjct: 120 FYGPQTGYFNHSADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSV 179
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY--------AGM 256
+ HN SKP F++L++ AVH P Q + I + T+ M
Sbjct: 180 LDNHNNSKPFFMFLSYQAVHP-----PLQVSQQSKT----IGQGKEATFILRSHAHSTRM 230
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS-FGIHSNKGSNHPLRGMKSTPWDG 315
++ +D ++G ++ L+ + EN++++F +DNG + FG SN PLRG K T W+G
Sbjct: 231 LTAMDFAIGRLVEYLKASNLYENTVIVFTSDNGGTANFG-----ASNAPLRGEKDTIWEG 285
Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGA- 374
G + + SP + +FH+ DW T+ + G+EI D+ DG+NQW+ L G
Sbjct: 286 GTKTTTFVHSPMYIEEGGTRDMMFHVVDWHATILSITGLEI-DSYGDGINQWEYLKTGRP 344
Query: 375 KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG 408
K +R + ++NIDN + +A+R D K + G
Sbjct: 345 KFRRFQFVYNIDN------HGSAIRDGDYKLIVG 372
>gi|313219878|emb|CBY30794.1| unnamed protein product [Oikopleura dioica]
Length = 481
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 143/378 (37%), Positives = 208/378 (55%), Gaps = 23/378 (6%)
Query: 15 LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ 74
+F + F + KKPHII ++ DDLG++D+ + I +PNID LA N L + +Y Q
Sbjct: 9 FVFRECFGHGRFSKKPHIIFVMIDDLGFDDLGYVNDDVI-SPNIDFLAKNALHIENYYNQ 67
Query: 75 ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL 134
CTPSR+A MTG+Y I GMQ GVI EP +PL+E LLPQ LK+ GY T GKWHL
Sbjct: 68 PSCTPSRAAFMTGRYNIRYGMQSGVIKPDEPEAIPLSETLLPQALKKCGYNTSMHGKWHL 127
Query: 135 GFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYST 194
GF+ E + P RGFD +G++ G QDY+ H Y+ R +++D G YST
Sbjct: 128 GFYTEKHCPQNRGFDRFFGFYLGSQDYFYHDSGNNCYLYEPC-YREKVRLDLN--GTYST 184
Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
E I + E++ P+F +L+ VH P Q E F + + Y
Sbjct: 185 KAIAEDFIAKLDEYDPETPLFEFLSFQEVHG-----PLQWLSEYSTLF---KEKDWTHYR 236
Query: 255 GMVSR----LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
M+SR +D +G V+ L++ G +++I++ +DNG G SN PLRG K
Sbjct: 237 KMLSRKIVIVDHFIGEVVKKLKEKGFWKDTILVITSDNG----GQTREGASNWPLRGRKG 292
Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSS--ELFHISDWLPTLCAAAGIEINDTSLDGVNQWD 368
++GG+R A I SP L + K SS +FH++DW PTL +G + D++LDG Q D
Sbjct: 293 DVFEGGIRSRAFIHSPKLPNSLKGSSFPHVFHVTDWFPTLLRFSGCQQPDSNLDGEAQ-D 351
Query: 369 VLTKGAKTKRSEILHNID 386
+ + + KR+ +L+ +D
Sbjct: 352 IFHQKPEVKRTHLLNFLD 369
>gi|410932865|ref|XP_003979813.1| PREDICTED: arylsulfatase I-like, partial [Takifugu rubripes]
Length = 316
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 127/303 (41%), Positives = 181/303 (59%), Gaps = 14/303 (4%)
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LP + LPQ L+E GY+TH +GKWHLGF+++ PT RGFD+
Sbjct: 2 IHTGLQHSIIRPRQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDT 61
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
++G G +YY + SC G D+ V G YST LYT+ ++A H+
Sbjct: 62 YFGSLTGSVNYYTYDSCDG--PGMCGFDLHEGESVAWSQKGKYSTHLYTQRVRKILATHD 119
Query: 210 -KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
+S+P+F++L+ AVH P Q P E + + + + RR YA MVS +DE+V N+
Sbjct: 120 PRSQPLFIFLSFQAVHT-----PLQCPREYIYPYRGLENIARRKYAAMVSAVDEAVRNIT 174
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
LRK+G ENSI++F DNG + GSN PLRG K T W+GG+RG+ I SP L
Sbjct: 175 YGLRKYGYYENSIMIFSTDNGGQPL----SGGSNWPLRGRKGTYWEGGVRGLGFIHSPLL 230
Query: 329 KQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
++ ++VS L HI+DW PTL A G E + +DG + W+ ++ G ++ R EILHNID
Sbjct: 231 RKKKRVSKALVHITDWYPTLVGLAGGKESHIEGVDGYDVWEAISDGRESPRLEILHNIDP 290
Query: 388 VDN 390
+ N
Sbjct: 291 LYN 293
>gi|402820941|ref|ZP_10870501.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
IMCC14465]
gi|402510173|gb|EJW20442.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
IMCC14465]
Length = 496
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 148/412 (35%), Positives = 219/412 (53%), Gaps = 28/412 (6%)
Query: 13 CTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY 72
LLF A +KP+I+IILADDLG+ D+ F GS I TPN+D LA G++LN+ Y
Sbjct: 6 LILLFVAFISGAKASEKPNIVIILADDLGYADLGFRGS-DIQTPNLDRLAAEGIVLNRFY 64
Query: 73 VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKW 132
+CTP+RSALMT + PI +G + + E G+ E +P+ K+AGY T IGKW
Sbjct: 65 SLPICTPTRSALMTARDPIKLGTAYAGLQPWENGGVSPDEHFMPESFKKAGYQTAMIGKW 124
Query: 133 HLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI--G 190
H+G E P RGFD +G+ DYY H+ H++Q + K++
Sbjct: 125 HIGRQYESLVPHHRGFDHFFGHLNTQVDYYTHASAGG----------HDLQENGKSLKRD 174
Query: 191 IYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF---LDIS 246
Y+TD++ + ++ + E + SKP LY+ LA H+ P QAP+ AK+ L+
Sbjct: 175 AYATDIHGDESVRYLKEIRDPSKPFLLYVPFLAPHS-----PMQAPEALEAKYKSRLNFP 229
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
P +RTYA MV +D+++G ++ L + G+ +N+I+LF +DNG G G N P R
Sbjct: 230 VP-KRTYAAMVDSMDQAIGRIMDELDRQGVSDNTIILFFSDNG----GFAGFGGVNKPYR 284
Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
G K ++GG+R A + P +++ + D PTL AAA IE+ N+ +DG+N
Sbjct: 285 GGKLETFEGGIRVNAIMRWPGAIPAGSKTAQAISVMDVFPTLAAAADIEMHNEKKIDGIN 344
Query: 366 QWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDE 417
LTK KR + L N + A+ D K V D+N +S E
Sbjct: 345 ILKALTKAKNIKRPKPLFFASNSPEYNSFRLAMIDGDHKLVQIIDHNNRSTE 396
>gi|443706557|gb|ELU02545.1| hypothetical protein CAPTEDRAFT_109345 [Capitella teleta]
Length = 370
Score = 236 bits (602), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 133/353 (37%), Positives = 207/353 (58%), Gaps = 26/353 (7%)
Query: 41 GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
G++D+ F I TPNIDALA G+I +YVQ++CTPSR AL+TG+YP MQ+ VI
Sbjct: 10 GYHDLGFRNPDVI-TPNIDALATEGVIFTNNYVQSVCTPSRHALLTGRYPHRSAMQNLVI 68
Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
+ + L K LP+YLK+ GY+THA+GKWH+G+ RE PT RGFDS +G + G
Sbjct: 69 MSNQARCTGLGYKFLPEYLKDLGYSTHAVGKWHVGYCREECLPTHRGFDSFFGLYDGDGY 128
Query: 161 YYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAH 220
Y++H+ +T P D ++ V + GI+S DL E ++ N +P FLY +
Sbjct: 129 YWNHT--STVIP-GAFDWNNSTGVYLEARGIHSEDLGAERLTAILDGQNAKEPFFLYFSP 185
Query: 221 LAVHAGNTYEPFQAPDEEVAKFLDISDPE------RRTYAGMVSRLDESVGNVIAALRKH 274
H P + A+FL++ PE RR Y G+VS LD VG ++ L+++
Sbjct: 186 QNPH---------TPSQPQAEFLNLY-PENRYPDIRRKYLGLVSGLDRMVGGIVDGLKRN 235
Query: 275 GMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKV 334
G++ N+ ++F++DNGA +G N P RG K + ++GG + + I+SP L +
Sbjct: 236 GLMNNTYIIFVSDNGA-----DPEEGLNDPFRGGKGSLFEGGTKSASFIYSPLLNKAGYE 290
Query: 335 SSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEILHNID 386
++ L HI+DW+PT+ AG I + LDG++Q D++ G ++R +++++ID
Sbjct: 291 NNGLMHITDWMPTILKLAGGSIPEGEQLDGIDQSDMVKNGKASQRKKMVYSID 343
>gi|354488427|ref|XP_003506371.1| PREDICTED: arylsulfatase I [Cricetulus griseus]
Length = 560
Score = 235 bits (600), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 125/285 (43%), Positives = 177/285 (62%), Gaps = 14/285 (4%)
Query: 23 NTTAPKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
++ AP +P HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSR
Sbjct: 94 SSAAPSQPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSR 152
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
S L+TG+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGFFR+
Sbjct: 153 SQLLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFFRKEC 212
Query: 142 TPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
P+ RGFD+ G G DYY + +C G D+ V G YST LY +
Sbjct: 213 LPSCRGFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGETVAWGLSGQYSTMLYAQR 270
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
A +++A H+ P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +
Sbjct: 271 ASHILASHSPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCM 325
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
DE+V N+ AL+++G NS+++F +DNG +F + GSN PL
Sbjct: 326 DEAVRNITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPL 366
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 78/219 (35%), Gaps = 76/219 (34%)
Query: 360 SLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKY--------------YAALRVDDLKY 405
+ G + W + +G + R+EILHNID + N ++ AA+RV + K
Sbjct: 357 TFSGGSNWPLXXEGRASPRTEILHNIDPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKL 416
Query: 406 VAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSS 465
+ G D YGD + P + L S G +N L I+Q
Sbjct: 417 LTG-------DPGYGD-------WIPPQTLASFPGSWWN-LARMASIRQAV--------- 452
Query: 466 DALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDL 525
LFNI+ DP E+ +LA + D+
Sbjct: 453 --------------------------------------WLFNISADPYEREDLAGQRPDV 474
Query: 526 LKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
++ L +LA Y T +P + RA P W PW
Sbjct: 475 VRTLLARLADYNRTAIPVRYPAANPRAHPDFNGGAWGPW 513
>gi|313232487|emb|CBY24155.1| unnamed protein product [Oikopleura dioica]
Length = 481
Score = 235 bits (599), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 141/378 (37%), Positives = 206/378 (54%), Gaps = 23/378 (6%)
Query: 15 LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ 74
+F + F + KKPHII ++ DDLG++D+ + I +PNID LA N L + +Y Q
Sbjct: 9 FVFRECFGHGRFSKKPHIIFVMIDDLGFDDLGYVNDDVI-SPNIDFLAKNALHIENYYNQ 67
Query: 75 ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL 134
CTPSR+A MTG+Y I GMQ GVI EP +PL+E LLPQ K+ GY T GKWHL
Sbjct: 68 PSCTPSRAAFMTGRYNIRYGMQSGVIKPDEPEAIPLSETLLPQAFKKCGYNTSMHGKWHL 127
Query: 135 GFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKAT---FEPYQGLDMRHNMQVDNKTIGI 191
GF+ E + P RGFD +G++ G QDY+ H +EP +R ++ G
Sbjct: 128 GFYTEKHCPQNRGFDRFFGFYLGSQDYFYHDSGNNCYLYEPNGTEKVRLDLN------GT 181
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP-ER 250
YST E I + E++ P+F +L+ VH P Q EV K + ER
Sbjct: 182 YSTKAIAEDFIAKLDEYDPETPLFEFLSFQEVHG-----PLQWLPAEVLKMAGGWEGWER 236
Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
+ + + +D +G V+ L++ G +++I++ +DNG G SN PLRG K
Sbjct: 237 KMLSRKIVIVDHFIGEVVKKLKEKGFWKDTILVITSDNG----GQTREGASNWPLRGRKG 292
Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSS--ELFHISDWLPTLCAAAGIEINDTSLDGVNQWD 368
++GG+R A I SP L + K SS +FH++DW PTL +G + D++LDG Q D
Sbjct: 293 DVFEGGIRSRAFIHSPKLPDSLKGSSFQHVFHVTDWFPTLLRFSGCQQPDSNLDGKAQ-D 351
Query: 369 VLTKGAKTKRSEILHNID 386
+ + KR+ +L+ +D
Sbjct: 352 IFHNKPEVKRTHLLNFLD 369
>gi|241789348|ref|XP_002400616.1| sulfatase, putative [Ixodes scapularis]
gi|215510801|gb|EEC20254.1| sulfatase, putative [Ixodes scapularis]
Length = 224
Score = 234 bits (597), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 117/226 (51%), Positives = 149/226 (65%), Gaps = 3/226 (1%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
P+II +LADDLGW DVSF G QIPTPN+D LA G+ILN +YV LCTPSR ALM+G Y
Sbjct: 2 PNIIFLLADDLGWADVSFRGDPQIPTPNLDVLASQGIILNNYYVLHLCTPSRGALMSGLY 61
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
PIH G+QH V L EP GLPL ++P++LK GY TH IGKW+LG+++E YTPT RGFD
Sbjct: 62 PIHTGLQHYVQLPAEPHGLPLNVTIMPEHLKNLGYTTHMIGKWNLGYYKESYTPTRRGFD 121
Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
S YG+ G +DYYDH+ GLD YSTDL+T+ A+ +I +H+
Sbjct: 122 SFYGFLNGGEDYYDHTILFV---STGLDFWDGTTPVRNQSHHYSTDLFTKKALALIKDHD 178
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
++KPMFLY +H AVH+G+ +AP +A F I + R AG
Sbjct: 179 QAKPMFLYFSHQAVHSGDFKVELEAPASAIAHFPYIKEQNRSIIAG 224
>gi|323456975|gb|EGB12841.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 536
Score = 234 bits (597), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 140/371 (37%), Positives = 202/371 (54%), Gaps = 19/371 (5%)
Query: 29 KPHIIIILADDLGWNDVSFHG----SSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
+PH+ +I+ADDLG+ DVS++G ++ + TP +D LA +G+ L + Y Q CTP+R+AL
Sbjct: 32 RPHVFLIVADDLGYGDVSYNGDGSLTNAVATPYLDRLAADGITLTRFYSQCDCTPARAAL 91
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
+TG+YP + GMQH V+ W LP LLP L E GY HAIGKW +G R TPT
Sbjct: 92 LTGRYPSNTGMQHEVVTAQSQWSLPHEFALLPSALPE-GYRKHAIGKWDVGHARAADTPT 150
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
RGFDSH GY+ Y +H+ + DM H+ T YST L+ + A+ +
Sbjct: 151 ARGFDSHLGYYGAEITYDEHAALRSCSNGTIRDMNHDGATLAATEDRYSTHLFADHAMAL 210
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK----FLDISDPERRTYAGMVSRL 260
+ +FLYL AVH +P A V + FLD SD R T+A + L
Sbjct: 211 VDREADEYKLFLYLCFQAVH-----QPLAADAALVKRFADAFLDESDAARSTFAAVALEL 265
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
D +V + + G ++++ FM+DNGA S GSN PLRG K T ++GG+R
Sbjct: 266 DRAVERFVEFTKARGAYDDAVFFFMSDNGATL--AQSGGGSNWPLRGSKFTAYEGGVRVP 323
Query: 321 AAIWSPWLKQTQKVSSE--LFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTK 377
A + S + ++ ++ LFH+ D LPTL A A + +DGV+Q D + GA
Sbjct: 324 AFLHSARVPPARRGAAHAGLFHVVDVLPTLAAFAESADRVPAGVDGVDQRDAVLHGAAAP 383
Query: 378 RSEILHNIDNV 388
R+E L + D++
Sbjct: 384 RTEALLHADDL 394
>gi|424863174|ref|ZP_18287087.1| N-acetylgalactosamine-4-sulfatase [SAR86 cluster bacterium SAR86A]
gi|400757795|gb|EJP72006.1| N-acetylgalactosamine-4-sulfatase [SAR86 cluster bacterium SAR86A]
Length = 519
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 141/376 (37%), Positives = 209/376 (55%), Gaps = 39/376 (10%)
Query: 13 CTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY 72
L+F+ + L++ ++P+IIII+ DDLGW DVS++G I TPNID LA +GL +N+ Y
Sbjct: 6 LILVFSISLLSS---ERPNIIIIITDDLGWGDVSYNGGP-INTPNIDKLADDGLQMNRFY 61
Query: 73 VQALCTPSRSALMTGKYPIHIGMQHGVIL-----EGEPWGLPLTEKLLPQYLKEAGYATH 127
C+P+R+AL TG +++G+I E +GLPL K+LP+YLKE GY T
Sbjct: 62 SAPTCSPTRAALFTGIN----SLKNGIIRPLNNPTAERYGLPLKHKILPEYLKEIGYQTA 117
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
GKWHLG F + Y P RGF+S YG+ G Y+DH+ LD N ++ +
Sbjct: 118 LSGKWHLGMFSDEYLPRNRGFESTYGHLGGGIGYFDHALSGR------LDWHRNGEILYE 171
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
YST L + AI +I N P+FLY+A A H P QA ++ + DISD
Sbjct: 172 D--GYSTTLIADEAIRIIENKNNETPLFLYVAFNAPHT-----PIQAEEKIINNLSDISD 224
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH----------- 296
+ R YA + LD +G +I A+ G+L +I++F +DNG P F I
Sbjct: 225 KKERVYAANIITLDREIGKIIDAIESKGILNETIIIFFSDNG-PVFDIDPIAATIAPDIL 283
Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
+KG+ L+G K + ++GG+R A I+ + + + +S++ F + D LPTL A I
Sbjct: 284 DSKGNTLGLKGSKGSAYEGGIRVPAVIYYKGILE-KSLSNQFFFVDDILPTLFGALNINP 342
Query: 357 NDTSLDGVNQWDVLTK 372
N ++ GVN+W+ L K
Sbjct: 343 NQNNITGVNRWNDLIK 358
>gi|443692244|gb|ELT93884.1| hypothetical protein CAPTEDRAFT_107177, partial [Capitella teleta]
Length = 328
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 135/336 (40%), Positives = 189/336 (56%), Gaps = 18/336 (5%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
P+ + ILADDLG+ND+ F + + TPN+D LA G+IL +YVQA+CTPSR ALMTG+Y
Sbjct: 1 PNFVFILADDLGYNDIGFR-NPDVQTPNLDYLANKGVILTNNYVQAVCTPSRHALMTGRY 59
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
P MQ+ VI + L K LPQYLKE GY H IGKW+LG+ RE PT RGFD
Sbjct: 60 PYRSAMQNFVINPDQAKCTALEYKFLPQYLKELGYQNHLIGKWNLGYCREECLPTSRGFD 119
Query: 150 SHYGYWQGLQDYYDHSCKATFEPY-QGLDMRHNMQVDNKTIGIYS-----TDLYTEAAIN 203
S +G G DY++H+ ++ Q L + + I I DL +
Sbjct: 120 SFFGLLDGAGDYWEHTTYGLYDCTGQSLAGMACLCEFTQKISILMPVICFQDLELDRLDK 179
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDISDPERRTYAGMVSRLDE 262
+ EH+ +P+FLY A N + P Q DE + K+ D + RR Y + S LD
Sbjct: 180 IFTEHDNKQPLFLYFA-----PQNPHTPSQPTDEFLNKYSADDFEDVRRRYLALTSGLDS 234
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
VG V+ L + M++N+ ++F++DNGA +G N P RG K + ++GG + +
Sbjct: 235 MVGKVVELLNDNHMMDNTYLIFLSDNGA-----DPAEGLNAPFRGGKGSLFEGGTKSSSF 289
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
I+SP LK+T+ + L HI+DWLPTL AG + D
Sbjct: 290 IYSPLLKKTEYENDGLMHITDWLPTLVKLAGGHVED 325
>gi|432963754|ref|XP_004086820.1| PREDICTED: uncharacterized protein LOC101172478 [Oryzias latipes]
Length = 873
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 124/303 (40%), Positives = 180/303 (59%), Gaps = 13/303 (4%)
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y IH G+QH VI +P LPL LPQ L++AGY+TH +GKWHLGF++ PT RGF
Sbjct: 421 YQIHTGLQHSVIRATQPNCLPLENVTLPQKLRDAGYSTHMVGKWHLGFYKRGCLPTQRGF 480
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-NKTIGIYSTDLYTEAAINVIAE 207
DS +G G DYY H K G D+ + + G+YST ++T A++++A
Sbjct: 481 DSFFGSLLGSGDYYSHY-KCEVPGMCGYDLYEGEEAAWEQDRGLYSTVMFTRKAVSILAH 539
Query: 208 HN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
HN ++ P+FLYLA+ AVH+ P Q P + ++ IS+ RR YA MVS LDE++ N
Sbjct: 540 HNPRAHPLFLYLAYQAVHS-----PLQVPARYLERYKGISNLSRRKYAAMVSCLDEAIHN 594
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
+ AL+++G +N+++++ +DNG GSN PLRG K+T W+GG+R V + SP
Sbjct: 595 LTLALKRYGYYDNTVIVYSSDNGGQPLA----GGSNWPLRGSKATYWEGGIRAVGFVHSP 650
Query: 327 WLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
L + L HI+DW PTL G D +LDG + W+ +++G + R +ILHNI
Sbjct: 651 LLMKKGTKCRSLIHITDWFPTLVTLGEGTLDEDLNLDGYDVWEAISEGRPSPRLDILHNI 710
Query: 386 DNV 388
D +
Sbjct: 711 DPI 713
Score = 45.8 bits (107), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 4/77 (5%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E+ +L++ ++K + +LA Y T VP D R++P +W PW
Sbjct: 784 LFNITADPYERVDLSQHYPHIVKTMLLRLAQYNRTAVPVRYPAKDLRSNPQYNGGVWGPW 843
Query: 565 YD----ELDKQKAIETM 577
Y E D+++ T+
Sbjct: 844 YKDERGEPDEEEQYNTL 860
>gi|298706923|emb|CBJ29750.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 706
Score = 233 bits (593), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 139/418 (33%), Positives = 227/418 (54%), Gaps = 64/418 (15%)
Query: 29 KPHIIIILADDLGWNDVSFHGSS-QIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
KP++ I+ DD+GWND+ + Q TP++D LA G+ + +Y ++CTP+R++LMTG
Sbjct: 130 KPNVFFIMVDDMGWNDIGYQSVDLQGVTPHLDRLAAGGVKMTNYYTMSICTPARASLMTG 189
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y + G+Q+ VI G PWGLPLTEK+ P+Y+K+AGY TH IGKWH+G + + P+ RG
Sbjct: 190 RYVMRYGLQYSVIQPGAPWGLPLTEKIFPEYMKDAGYETHMIGKWHIGSYTSRHIPSQRG 249
Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEP--------------YQGLDMRHNMQVDNKTI--- 189
FD++ GY + Y+ H S A + Y +D Q+ + TI
Sbjct: 250 FDTYLGYLNDEEMYWTHQSWTAKYNGRKFFDFGFGNATGYYDVIDREVPDQLRDGTISDG 309
Query: 190 ---------------------GIYSTDLYTEAAINVIAEHN--KSKPMFLYLAHLAVHAG 226
G YST ++ A+ +I E +P+++YLAH AVH
Sbjct: 310 DDDETMLSATSSYSSSGGFFKGRYSTQVFHNRALEIIHEKTPYDDEPLYMYLAHQAVHDP 369
Query: 227 NTYEPFQ--APDE-EVAKFLDISDPE-----RRTYAGMVSRLDESVGNVIAALRKHGMLE 278
P + PD+ ++ + ++ E R +A ++ LD ++G ++ L +G +E
Sbjct: 370 LGLPPAEHFTPDQLDLLERIETGSEEGTEALRARFAKVLMYLDYTIGELVEYLETNGWME 429
Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE- 337
NSI++ +DNG G S GSN+PLRG+K + W+GG + + ++S ++ +E
Sbjct: 430 NSIIIVASDNG----GCPSCGGSNYPLRGIKHSYWEGGTKVPSFVYSKSHIPEERWGTEY 485
Query: 338 --LFHISDWLPTLCAAAGIEINDTS--LDGVNQWDVLTKGAK-----TKRSEILHNID 386
L H++DWLPT+ AA I+++ + DGV+QW ++T A + R E+++N D
Sbjct: 486 DGLMHVTDWLPTIAGAAEIDLDGGAGPFDGVDQWPLITSVAAEAEFGSARDEMMYNFD 543
>gi|291241212|ref|XP_002740506.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 534
Score = 231 bits (589), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 142/373 (38%), Positives = 204/373 (54%), Gaps = 33/373 (8%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P+I+ +LADDLGW+DV +H S I TPNID LA G+ L +YV + C PSR LMTG+
Sbjct: 68 RPNIVFVLADDLGWHDVGYH-DSVIKTPNIDQLAAEGVKLENYYVSSWCAPSRVNLMTGR 126
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL-GFFREVYTPTFRG 147
Y I G+ +G + + + + E L L EAGY T +GKWHL GF PT RG
Sbjct: 127 YRIRTGL-YGDVCD----FMGIHETTLADKLYEAGYYTAMVGKWHLSGFEHAECYPTHRG 181
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
F + GY G Q+Y+ H PY ++ V K G YST ++ + A +I +
Sbjct: 182 FQTFLGYHGGSQNYFTHRRGGPHAPYDFWANDTSIAV--KYEGQYSTMIFADEAQRIIRQ 239
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRLDESVGN 266
HN +P+FLYL+ AVH P P ++ I D +RR YA M S +DE++GN
Sbjct: 240 HNTKQPLFLYLSFQAVHV-----PLLVPPSYEDQYRSLIEDDKRRVYAAMASCMDEAIGN 294
Query: 267 VIAALRKHGMLENSIVLFMA-----------DNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
V A L++ G+ N++ +F + DNGA ++ GSN PLRG K ++G
Sbjct: 295 VTATLKEKGLWNNTVFIFSSARLNFKHDFCLDNGATP----TDGGSNWPLRGRKGLWYEG 350
Query: 316 GMRGVAAIWSPWLKQ--TQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTK 372
G+R V + S L T + EL H++DW PTL A +++ LDG++QW+ +++
Sbjct: 351 GIRAVGFVNSHLLSDHVTGTTNHELIHVTDWFPTLLRLAKADLHGLKPLDGIDQWETISR 410
Query: 373 GAKTKRSEILHNI 385
G T R E+L N
Sbjct: 411 GNVTNRKEVLLNF 423
>gi|291244839|ref|XP_002742301.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 263
Score = 231 bits (588), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 114/248 (45%), Positives = 162/248 (65%), Gaps = 6/248 (2%)
Query: 174 QGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQ 233
G D+R N+ V G Y+T+++T A +I +HNK P+FLYLAHLAVH+GN P +
Sbjct: 1 MGYDLRDNVNVTYYMNGSYTTEVFTTKAEQIIKKHNKQNPLFLYLAHLAVHSGNPSNPLE 60
Query: 234 APDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
AP++ +++F I D R+T+AGMVS LDESVGN+ L+ GM N+I++F DNG P+
Sbjct: 61 APEKYMSRFPHIQDKNRKTFAGMVSALDESVGNLTKVLKDSGMYNNTIIIFTTDNGGPAA 120
Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
G ++N SN PLRG+K+T W+GG+RG + SP L++ +VS+++ H+ DWLPT+ AG
Sbjct: 121 GFNNNHASNWPLRGIKATLWEGGVRGDGFVHSPLLEKPGRVSTDMIHVCDWLPTIYHIAG 180
Query: 354 IEIND--TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDN 411
++ T+LDG N WD L++G+ + R+EILHNID P AALRV K + G
Sbjct: 181 GNVSSLPTNLDGFNVWDTLSRGSVSPRTEILHNID----PILQIAALRVGKYKIIMGECY 236
Query: 412 NGQSDEWY 419
G D WY
Sbjct: 237 RGNWDGWY 244
>gi|156340112|ref|XP_001620356.1| hypothetical protein NEMVEDRAFT_v1g148421 [Nematostella vectensis]
gi|156205165|gb|EDO28256.1| predicted protein [Nematostella vectensis]
Length = 260
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 122/253 (48%), Positives = 160/253 (63%), Gaps = 12/253 (4%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQ-IPTPNIDALAYNGLILNQHYVQALCTPSRS 82
T+A K PHI+ IL DDLGW+DV +H S + TPNID LA G+ L +Y Q +CTPSR
Sbjct: 19 TSATKPPHILFILMDDLGWSDVGYHNISHAVKTPNIDKLASQGVKLMSYYSQPMCTPSRG 78
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
ALMTGKYPIH+GMQH VI PWG+P +PQ L+ GY T IGKWHLGFF YT
Sbjct: 79 ALMTGKYPIHLGMQHFVINITSPWGMPRRFPTIPQKLRTLGYRTSMIGKWHLGFFDWDYT 138
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
P RGFDS G++ G QD++ HS K F LD R + + N+ G +STD++T+ AI
Sbjct: 139 PLRRGFDSFLGFFAGEQDHWRHS-KMGF-----LDFRRDEEPANEYGGQHSTDVFTQEAI 192
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
N+ HN S+P+FL L++ AVH P QA +V K +SD +R+ Y GM+ D
Sbjct: 193 NIAMRHNASQPLFLLLSYAAVHT-----PLQAHPNDVNKIGGVSDKDRQNYLGMMGAADW 247
Query: 263 SVGNVIAALRKHG 275
S+G +I +++G
Sbjct: 248 SIGRLIDVYKRNG 260
>gi|298710054|emb|CBJ31771.1| Formylglycine-dependent sulfatase, C-terminal fragment
Formylglycine-dependent sulfatase, N-terminal
[Ectocarpus siliculosus]
Length = 588
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 182/625 (29%), Positives = 286/625 (45%), Gaps = 144/625 (23%)
Query: 36 LADDLGWNDVSFHGSS-QIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIG 94
+ DD+GW+D+ + + TP +D LA G+ + +Y CTP+R++LMTG+Y + G
Sbjct: 1 MIDDMGWDDIGYQSVDLKGVTPVLDKLAAGGVKITNYYTMNTCTPARASLMTGRYTVRYG 60
Query: 95 MQHGVILE-GEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG 153
MQ+ V + GEPWG+PL+EK+LP+Y KEAGY TH +GKWHLG + P+ RGFD++ G
Sbjct: 61 MQYNVAINPGEPWGVPLSEKMLPEYFKEAGYGTHLVGKWHLGSHSPEHIPSQRGFDTYMG 120
Query: 154 YWQGLQDYYDHS-----------CKATF---------------EPYQGLDMRHNMQV--- 184
Y G + Y+ H C F + G D R + +
Sbjct: 121 YVGGFEAYWTHETVGVISDGRHVCDFGFGNASGYYDIIDRPSSQDIYGSDHREDQKEAEE 180
Query: 185 ---DNKTIGIYS-------------------------TDLYTEAAINVIAEHN--KSKPM 214
D G++S T ++ E AI ++ + +P+
Sbjct: 181 NRDDGDGEGVHSTTPTKTPWASSFRSNSGVMKGQQYSTQMFQERAIEILETKSPFDEEPI 240
Query: 215 FLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-----ISDPE----RRTYAGMVSRLDESVG 265
+++LAH AVH+ P + E+ LD I D + RR ++ LD+SVG
Sbjct: 241 YMHLAHQAVHSPLGLPPADSFSEDETLLLDEIEGKIGDVQGAQTRRHLTEVLMFLDKSVG 300
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
++I L G +ENSIV+ +DNG S + GSN+PLRG K++ W+GG + A ++S
Sbjct: 301 HLIHYLEDEGWMENSIVVVASDNGGDS----NEGGSNYPLRGKKASYWEGGSKVPAFVYS 356
Query: 326 PWLKQTQKVSSE---LFHISDWLPTLCAAAGIEINDTS--LDGVNQWDVLTKGAKT---- 376
+ SE L H++DWLPTL A AG+E++ ++ LDGVNQW+ + A
Sbjct: 357 TSHIPEARRGSEYDGLMHVTDWLPTLAAGAGLEMSGSAGPLDGVNQWEHIVGSAAAVDGG 416
Query: 377 -KRSEILHNID------NVDNPQKYYAAL---RVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
R+E+L+N D N ++ + +L RV D Y G +G
Sbjct: 417 RPRTELLYNYDPYRLLPNSNHCYCCWCSLCHYRVYDTGYAQGAFRSG------------- 463
Query: 427 DKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARV 486
KY + TF+ +L ++ + +L +
Sbjct: 464 -KYKMMFNVSCHGWYTFDG------------------------DMLEEDHLTDLSKVCGG 498
Query: 487 RCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDK-----LAIYKSTMV 541
CN + G + LF++ DP E NNL + D+ +++ D+ Y T+
Sbjct: 499 TCN--DGGTCSDEATSDYLFDLDQDPRESNNLIDELPDVAQEMRDRWYEVTFREYTDTVF 556
Query: 542 PPGN-KPFDKRADPARWNNIW-VPW 564
P + K +D A+ NN W +PW
Sbjct: 557 EPLDLKAYDVWAE----NNWWMIPW 577
>gi|323452003|gb|EGB07878.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 1818
Score = 229 bits (583), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 145/390 (37%), Positives = 211/390 (54%), Gaps = 41/390 (10%)
Query: 28 KKPHIIIILADDLGWNDVSFHG----SSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
K PH++ LADD+G+ DV ++G ++++ TP IDALA G+ L+++Y Q CTPSR+A
Sbjct: 42 KPPHVVFFLADDVGFGDVGYNGDPTLTNRVSTPVIDALADAGVKLSRYYTQPDCTPSRAA 101
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
L++GKYP G HGV+ WGLPL LLP+ L A Y +HA+GKW +G P
Sbjct: 102 LLSGKYPATTGTYHGVLNPQSTWGLPLEHALLPEALPGA-YRSHAVGKWDVGHSSAKRLP 160
Query: 144 TFRGFDSHYG----YWQGLQDYYDHSC---KATFEPYQGLDMRHNMQVDNKTI------- 189
RGFDS G ++ + DY H QV T+
Sbjct: 161 EARGFDSFLGFYLCFYGPMIDYSTHEIHDHDLACAGDACAAALAKCQVRGSTVADFSGLG 220
Query: 190 -------GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK- 241
G+Y+TD++ + A+++I P+FLY+A AVH P +A + +VA
Sbjct: 221 GQRRDYDGMYTTDVFADRAVDLIEAEAADHPLFLYVAFNAVHG-----PLEADEADVAAF 275
Query: 242 ---FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSN 298
FL SD +R T+A + +D ++G V+ AL+ G EN++V+F +DNGA
Sbjct: 276 EATFLPESDSKRATFAAVALAMDRAIGRVVDALQATGAYENALVVFASDNGA--IPGQMG 333
Query: 299 KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIEI 356
GSN PLRG K + W+GG+R A + SP L + V LFH++DWLPT+ AAAG +
Sbjct: 334 GGSNWPLRGGKFSAWEGGVRVPAFVHSPLLPAHMRGAVYDGLFHVADWLPTIAAAAGAAV 393
Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
+ S DGV+ + L RSE+L ++D
Sbjct: 394 D--SGDGVSHYRALFADGPAPRSELLVHVD 421
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 136/396 (34%), Positives = 218/396 (55%), Gaps = 42/396 (10%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHG----SSQIPTPNIDALAYNGLILNQHYVQALCT 78
+ A PH+++ LADD G++DV ++ ++Q+ TP +D+LA G+ L ++Y Q CT
Sbjct: 612 DDAASNPPHLLLFLADDAGFDDVGYNSDPSKTNQVQTPFLDSLAAGGVKLARYYTQPDCT 671
Query: 79 PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
PSR+AL++G YP GM H +I WGL L +L+PQ L A Y +HA+GKW +G +
Sbjct: 672 PSRAALLSGMYPASSGMYHKMITAQSNWGLDLDLELIPQRLP-AAYRSHAVGKWDVGHYT 730
Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH--------------NMQV 184
+ P FRGF S G++ + DYY H T + + +++ +
Sbjct: 731 WSHVPQFRGFRSFLGFYSPIIDYYTHETFDTLQCLEEMELTECEARLASECSSSIKDFNF 790
Query: 185 DNKTI----GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV- 239
D + G YSTD++ A ++I + P+FLY+A AVHA P +A DE+V
Sbjct: 791 DGDPLPLADGTYSTDVFAARARDLIRKEAPKHPLFLYVAFNAVHA-----PLEA-DEDVI 844
Query: 240 ----AKFLDISDPERRTYAGMVSRLDESVGNVIAALRK-HGMLENSIVLFMADNGAPSFG 294
+ FL+ + R ++A + +D ++ V+ L+ G ENS+++F +DNG
Sbjct: 845 ESFASSFLEEARDARASFAAVALGMDRAMEAVVDELKAVDGAYENSVIMFASDNGGLPGQ 904
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS--ELFHISDWLPTLCAAA 352
I GSN PLRG K +P++GG+R A + SP L ++ + LFH++DWLPTL A
Sbjct: 905 IAG--GSNWPLRGQKFSPFEGGVRVPAFVHSPLLPAARRGGAYGGLFHVTDWLPTLVRLA 962
Query: 353 GIEINDTSLDGVNQWDVL--TKGAKTKRSEILHNID 386
G ++DGV+Q++ + A+ R+E+L ++D
Sbjct: 963 G-GAPPRNVDGVDQYEAIFGEHPAEHPRTELLVHVD 997
>gi|395504882|ref|XP_003756775.1| PREDICTED: arylsulfatase I [Sarcophilus harrisii]
Length = 598
Score = 227 bits (579), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 115/264 (43%), Positives = 164/264 (62%), Gaps = 9/264 (3%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSRS L+TG+Y
Sbjct: 215 HIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 273
Query: 91 IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
IH G+QH +I +P LPL + LPQ L+E GY+TH +GKWHLGF+++ PT RGFD+
Sbjct: 274 IHTGLQHSIIRPRQPSCLPLDQVTLPQKLQEVGYSTHMVGKWHLGFYKKACLPTRRGFDT 333
Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
G G DYY + +C G D+ V + G YST LY + A ++A HN
Sbjct: 334 FLGSLTGNVDYYTYDNCDGP--GVCGYDLHEGESVAWEQSGQYSTLLYAQRASQILASHN 391
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N+
Sbjct: 392 PRQPLFLYVAFQAVHT-----PLQSPREYLYRYRSMGNVARRKYAAMVTCMDEAVRNITW 446
Query: 270 ALRKHGMLENSIVLFMADNGAPSF 293
AL+++G +NS+++F DN +F
Sbjct: 447 ALKRYGYYDNSVIIFSTDNXLANF 470
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 32/60 (53%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNI+ DP E+ +LA + D+++ L +LA Y T +P + RA P W PW
Sbjct: 490 LFNISADPYEREDLAAQRPDIVRSLLARLAYYNRTAIPVRYPAENPRAHPDFNGGAWGPW 549
>gi|1030057|emb|CAA63067.1| murine arylsulfatase B [Mus musculus]
Length = 252
Score = 227 bits (579), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 121/257 (47%), Positives = 156/257 (60%), Gaps = 14/257 (5%)
Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LD 177
AGYATH +GKWHLG +R+ PT RGFD+++GY G +DYY H A E G LD
Sbjct: 1 AGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALD 60
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
+R + + IYST+++T+ A VIA H KP+FLYLA +VH +P Q P+E
Sbjct: 61 LRDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEE 115
Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
+ + I D RR YAGMVS +DE+VGNV AL+ HG+ N++ +F DNG G
Sbjct: 116 YMEPYGFIQDKHRRIYAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNG----GQTR 171
Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN 357
+ G+N PLRG K T W+GG+RG + SP LKQ S EL HISDWLPTL AG N
Sbjct: 172 SGGNNWPLRGRKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGSTN 231
Query: 358 DTS-LDGVNQWDVLTKG 373
T LDG N W +++G
Sbjct: 232 GTKPLDGFNMWKTISEG 248
>gi|117558328|gb|AAI27451.1| Arsb protein [Rattus norvegicus]
Length = 523
Score = 227 bits (579), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 121/268 (45%), Positives = 164/268 (61%), Gaps = 14/268 (5%)
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMR 179
YATH +GKWHLG +R+ PT RGFD+++GY G +DYY H A E G LD+R
Sbjct: 128 YATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLR 187
Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
+ + IYST+++T+ A +IA H KP+FLYLA +VH +P Q P+E +
Sbjct: 188 DGEEPAKEYTDIYSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYM 242
Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
+ I D RR YAGMVS LDE+VGNV AL+ G+ N++++F DNG G +
Sbjct: 243 EPYDFIQDKHRRIYAGMVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNG----GQTRSG 298
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
G+N PLRG K T W+GG+RG + SP LKQ S EL HI+DWLPTL AG + T
Sbjct: 299 GNNWPLRGRKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAGGSTHGT 358
Query: 360 S-LDGVNQWDVLTKGAKTKRSEILHNID 386
LDG + W+ +++G+ + R E+L NID
Sbjct: 359 KPLDGFDVWETISEGSPSPRVELLLNID 386
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LF+I DP E+++++ +++ L +L Y VP P D R DP + +W PW
Sbjct: 464 LFDINRDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSYFPPLDPRCDP-KGTGVWSPW 522
>gi|344250866|gb|EGW06970.1| Arylsulfatase I [Cricetulus griseus]
Length = 484
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 118/268 (44%), Positives = 168/268 (62%), Gaps = 10/268 (3%)
Query: 23 NTTAPKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
++ AP +P HII IL DD G++DV +HGS I TP +D LA G+ L +Y+Q +CTPSR
Sbjct: 41 SSAAPSQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSR 99
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
S L+TG+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGFFR+
Sbjct: 100 SQLLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFFRKEC 159
Query: 142 TPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
P+ RGFD+ G G DYY + +C G D+ V G YST LY +
Sbjct: 160 LPSCRGFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGETVAWGLSGQYSTMLYAQR 217
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
A +++A H+ P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +
Sbjct: 218 ASHILASHSPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCM 272
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADN 288
DE+V N+ AL+++G NS+++F +DN
Sbjct: 273 DEAVRNITWALKRYGFYNNSVIIFSSDN 300
Score = 43.5 bits (101), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 22/60 (36%), Positives = 32/60 (53%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNI+ DP E+ +LA + D+++ L +LA Y T +P + RA P W PW
Sbjct: 378 LFNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAANPRAHPDFNGGAWGPW 437
>gi|298706912|emb|CBJ29739.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 781
Score = 226 bits (575), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 140/429 (32%), Positives = 220/429 (51%), Gaps = 72/429 (16%)
Query: 18 NDAFLNTTAPKKPHIIIILADDLGWNDVSFHGS--SQIPTPNIDALAYNGLILNQHYVQA 75
D + + KP++ L DD+G+ D+ + + S+I TPN+DALA G+ L+ +Y
Sbjct: 205 GDGGPGSVSTSKPNVFFFLIDDMGFGDMGYQSTDLSEI-TPNLDALAAGGVKLSNYYTMT 263
Query: 76 LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
LCTP+R+++MTG+YP+ GMQ+ VI+ G PWGLP +EK+LP+Y+ EAGY +H +GKWHLG
Sbjct: 264 LCTPARASIMTGRYPVRYGMQYSVIMPGSPWGLPTSEKILPEYMNEAGYESHMVGKWHLG 323
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFE----------------------- 171
+R+ P+ RGF + GY G++ YY H + +A+ +
Sbjct: 324 SYRDESLPSQRGFKTFLGYLNGIETYYSHKNPEASVDGQYFFDFGYGNATGYHDVTLQNH 383
Query: 172 ------------PYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN--KSKPMFLY 217
P G M + D G YSTD + A ++ P+F+Y
Sbjct: 384 DENVGGPCTDGGPRWGDVMENEDPADVCFTGTYSTDAFVGRAKQIVKSKAPFDEDPLFMY 443
Query: 218 LAHLAVHAGNTYEPFQAPDEEVAKFL----DISDPERRT-YAGMVSRLDESVGNVIAALR 272
+AH +VH+ P++ E L D ++ E+RT +AG++ LD+ +G + L
Sbjct: 444 IAHQSVHSPTGPAPYEEFSPEELALLEGVKDATESEKRTAFAGVLLYLDKRIGEFVDMLE 503
Query: 273 KHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ 332
+ G LENS+++ +DNGA + GSN+PLRG K + ++GG R L + Q
Sbjct: 504 QEGWLENSVIVVSSDNGA----CPDDGGSNYPLRGSKQSVFEGGSRTGYQPSPALLGKRQ 559
Query: 333 ----KVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL----TKGA------KTKR 378
SS + LC + + +LDGV+QW+ L +GA K R
Sbjct: 560 PAGLNTSSPRY--------LCNPCNLPDSAGALDGVDQWECLKSNGVRGAGSGNTSKPPR 611
Query: 379 SEILHNIDN 387
+E+L+N D+
Sbjct: 612 TELLYNWDS 620
>gi|323452769|gb|EGB08642.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 1517
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 138/365 (37%), Positives = 195/365 (53%), Gaps = 44/365 (12%)
Query: 18 NDAFLNTTAPKKPHIIIILADDLGWNDVSF-HGSSQIPTPNIDALAYNGLILNQHYVQAL 76
+D L + A PH++ L DDLG++DV + + + TP+IDALA G+ L+++Y
Sbjct: 59 DDRDLGSDAASYPHVVFALMDDLGFDDVGYGNADGAVATPHIDALAKEGVTLSRYYSAFS 118
Query: 77 CTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF 136
CTP+R AL+TG P +G+QHG + +PWGLP +LPQ+L + GY +H +GKWHLG
Sbjct: 119 CTPARGALLTGLSPHRLGLQHGQVFPEQPWGLPSKFSILPQHLAKLGYRSHLVGKWHLGH 178
Query: 137 FREVYTPTFRGFDSHYGYWQGLQDYYDH----SCKATFEP-YQG---------------- 175
F PT RGFDS +G G Q Y H CK + Y+G
Sbjct: 179 FSAERLPTARGFDSFFGGLDGAQYYATHIDAMDCKLPGDVLYRGFEVGDYDSLKAVTAEH 238
Query: 176 ---LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNK-----SKPMFLYLAHLAVHAGN 227
D+R N G YST L+ A +I H+K KP+FL L+ AVHA
Sbjct: 239 GCYFDLRENNDRVEDLFGSYSTQLFGRKAEELIDAHSKRADAAEKPLFLLLSFNAVHA-- 296
Query: 228 TYEPFQAPD---EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLF 284
P AP+ E L++++ RR +A + +D++VGNV+AAL + M NSI++F
Sbjct: 297 ---PVWAPEDTYETHPDLLNVTNGNRRKFAAALRLVDDAVGNVVAALEEARMDANSILIF 353
Query: 285 MADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISD 343
+DNGA P G GSN PLRG K ++GG+R A + +P FH++D
Sbjct: 354 ASDNGANPEHG-----GSNAPLRGSKGYLFEGGVRVPAFVRAPKYLPRGATYDHPFHVTD 408
Query: 344 WLPTL 348
W+PTL
Sbjct: 409 WVPTL 413
>gi|405964717|gb|EKC30169.1| Arylsulfatase B [Crassostrea gigas]
Length = 898
Score = 224 bits (572), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 117/298 (39%), Positives = 179/298 (60%), Gaps = 25/298 (8%)
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
+GKWH+GF+++ Y PT RGFDS++GY G +DYY H + G+D+R N N +
Sbjct: 526 VGKWHVGFYKKEYLPTSRGFDSYFGYLTGSEDYYTH--HRCYGKMCGMDLRENAVPANYS 583
Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
GIYST L+ + A +++ +H+ +P+FLYL +VH+ P Q P++ + +L I D
Sbjct: 584 -GIYSTYLFAQKAADIVQKHDSDQPLFLYLPFQSVHS-----PLQVPEKYIQPYLHIKDK 637
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
+RR YAGMV+ +DE++GN+ ++ G+ N++++F DNG G G+N+PLRG
Sbjct: 638 KRRKYAGMVAAMDEAIGNLTEVFKQKGIWNNTLMVFSTDNG----GQILEGGNNYPLRGW 693
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQW 367
K++ W+GGM GV + LK+ VS L H++DW PTL + AG +N T LDGV+QW
Sbjct: 694 KASLWEGGMHGVGFVHGQMLKRKGMVSKALMHVTDWFPTLVSLAGGNLNGTKPLDGVDQW 753
Query: 368 DVLTKGAKT-KRSEILHNIDNVDNP-----------QKYYAALRVDDLKYVAGTDNNG 413
+++G+ +RS ILHNID + P + AA+R D K + G+ NG
Sbjct: 754 KTISEGSTVNQRSSILHNIDPLFQPVGHRLPKSPFDNRIRAAVRFGDWKLITGSPGNG 811
Score = 82.8 bits (203), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 36/77 (46%), Positives = 55/77 (71%), Gaps = 1/77 (1%)
Query: 15 LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ 74
+LF + F +PHI+ +LADD G+ND+ +HGS +I TPN+D LA +G+ L +YVQ
Sbjct: 12 VLFFNTFPVQGKSSQPHILFVLADDFGYNDIGYHGS-EIKTPNLDKLAGDGIRLENYYVQ 70
Query: 75 ALCTPSRSALMTGKYPI 91
+CTP+RS L++G+Y +
Sbjct: 71 PICTPTRSQLLSGRYQV 87
Score = 46.2 bits (108), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 23/60 (38%), Positives = 31/60 (51%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E ++A ++ Q+ D LA Y T VP D++A+P W PW
Sbjct: 838 LFNITADPYEHKDVAHINPKVVMQMLDLLAQYNQTAVPCRYPKMDEKANPKYLGGYWGPW 897
>gi|313228866|emb|CBY18017.1| unnamed protein product [Oikopleura dioica]
Length = 482
Score = 223 bits (567), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 141/387 (36%), Positives = 199/387 (51%), Gaps = 32/387 (8%)
Query: 17 FNDAFLNTT-APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA 75
F AFL + KP++I+ILADDLG+ND+ ++ + I PN+ +LA NG IL+ YVQ
Sbjct: 69 FFAAFLVAKGSSSKPNVILILADDLGFNDMPWNNPAII-APNLHSLAKNGTILSNFYVQP 127
Query: 76 LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
+CTPSRSALMT +YPI +G+Q VI +P LPL E + + AGY TH +GKWHLG
Sbjct: 128 VCTPSRSALMTSRYPIRLGLQTDVITAPQPSCLPLDEVTIGNEFQSAGYTTHIVGKWHLG 187
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-----GLDMRHNMQVDNKTIG 190
+ P RGFD+ GY G +DYY + P Q G D N K G
Sbjct: 188 HYCPQCLPNNRGFDTFRGYLTGAEDYYKKTFCIPLVPNQRPAACGFDFYDNENRMPKANG 247
Query: 191 IYSTDL-------------YTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPD 236
YST + +A+ VI H SK P FLYL +VH P P
Sbjct: 248 TYSTYQVLIYLFIHTIILKFADASREVIKSHEGSKTPFFLYLPFQSVHY-----PVMVPK 302
Query: 237 EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
+ + + R+ Y GMV+ +DE+VG+++ L+ G EN++ + DNG G+
Sbjct: 303 NYSDLYPNEKNHTRKEYMGMVTAMDEAVGHIVDELKAAGAYENTLFFYATDNG----GLV 358
Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE- 355
G N P RG K+T W GG+R A I P + + + HI D T+ A G +
Sbjct: 359 GAGGRNAPFRGQKATLWQGGLRVPAFISGPGVPSASNYKN-MMHIVDVQMTVLDAIGHQR 417
Query: 356 INDTSLDGVNQWDVLTKGAKTKRSEIL 382
+ +DGV+ W +T G + + ++L
Sbjct: 418 TGEKHVDGVSHWSAITAGDEKETMDLL 444
>gi|323454930|gb|EGB10799.1| hypothetical protein AURANDRAFT_22180 [Aureococcus anophagefferens]
Length = 530
Score = 222 bits (566), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 137/369 (37%), Positives = 194/369 (52%), Gaps = 24/369 (6%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPT--PNIDALAYNGLILNQHYVQALCTPSRSALM 85
K+PHI +L DDLG+ND +S + P++ ALA + + +Y LCTP+R A M
Sbjct: 18 KQPHIAFMLVDDLGFNDAWAVNTSDVAGAWPHVGALANESVRIATYYTTMLCTPTRGAFM 77
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+ P +G+ HGVI + +GLP E + LK AGYAT +GKWHLG F + PT
Sbjct: 78 TGRLPQRLGLHHGVIGGFQDYGLPANETTIADKLKGAGYATAHVGKWHLGNFDDASEPTR 137
Query: 146 RGFDSHYGYWQGLQDYYDHSCKAT--FEPYQGLDMRHNMQVDN--KTIGIYSTDLYTEAA 201
RGFD+ YGY G +D++ + F+ D VD+ K G +++ L+ + A
Sbjct: 138 RGFDASYGYQNGEEDHFTRILQGLGDFQASSLADGASYYTVDDLLKINGTHNSFLFVDRA 197
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP----DEEVAKFLDIS----DPERRTY 253
+ V+ H+ S P+FLYLA HA P QAP D+ + S D RR +
Sbjct: 198 VAVVGAHDASIPLFLYLALQDTHA-----PIQAPAGYADDAACASIGSSRHYPDGARREF 252
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
GMV DES+ NV AA+ + NS+++ DNG GI G+N PLRG K PW
Sbjct: 253 CGMVRAADESIANVTAAIDANFGKRNSVIVIAGDNG----GIVKGGGNNWPLRGQKGQPW 308
Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKG 373
+GG+R A I +P + H+ D TL A AG + +DG++ W L G
Sbjct: 309 EGGVRNHALIRAPGRLAPGLYDKGMVHVVDLHATLVALAG-GASAYEIDGLDVWAALRDG 367
Query: 374 AKTKRSEIL 382
+ + RSE+L
Sbjct: 368 SPSPRSEVL 376
>gi|319952005|ref|YP_004163272.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
gi|319420665|gb|ADV47774.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
Length = 484
Score = 222 bits (566), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 136/378 (35%), Positives = 208/378 (55%), Gaps = 33/378 (8%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
++ +LF+ + ++P+I++IL DDLG+NDV F+GS+ I TPN+D LA +G I
Sbjct: 12 VSAVVLFSSCINAQSQTEQPNILLILCDDLGYNDVGFNGSTDITTPNLDQLAQDGTIFTS 71
Query: 71 HYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPWG--LPLTEKLLPQYLKEAGYATH 127
YV C PSR+AL+TG+YP +G Q + G G + + EK + +++AGY T
Sbjct: 72 AYVAHPFCGPSRAALLTGRYPHTLGSQFNLPANGASTGKGISVEEKFMGVPMQKAGYYTG 131
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----------LD 177
AIGKWHLG E Y P RGF+ YG+ G Y+ K ++ + L
Sbjct: 132 AIGKWHLGETAE-YHPNKRGFNDFYGFLGGGHKYFPEEYKLQYKHQKEMGTKNINDYVLP 190
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
+ HN + + Y TD+ + I E H+K KP FLYLA+ A H P +A +
Sbjct: 191 LEHNGAIVEEND--YLTDVLSREGIRFTKEAHDKKKPFFLYLAYNAPHV-----PLEAKE 243
Query: 237 EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
+++ KF DI D +RRTYA MV +D VG ++A+L+K G +N++++F++DNG H
Sbjct: 244 KDLEKFKDIEDIDRRTYAAMVYAVDRGVGEIVASLKKTGQFDNTLIIFLSDNGG-----H 298
Query: 297 SNKGS-NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCAAAG 353
+ G+ N+PL G K W+GG R ++ W K+ +K + +S D PT+
Sbjct: 299 TGHGANNYPLTGRKGDTWEGGFR--VPMFFHWPKKIKKGQKFDYPVSALDLYPTIAHIGQ 356
Query: 354 IEI-NDTSLDGVNQWDVL 370
+I +D LDG N W+ +
Sbjct: 357 AKIGSDKILDGKNIWEAI 374
>gi|328726458|ref|XP_001949756.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 402
Score = 222 bits (565), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 152/392 (38%), Positives = 204/392 (52%), Gaps = 25/392 (6%)
Query: 192 YSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNT----YEPFQAPDEEVAKFLDIS 246
Y+TD++TE A +I + ++ MFL ++HLAVH+GN+ E + A F I
Sbjct: 22 YATDVFTEEARKIIESSKRENTEMFLMVSHLAVHSGNSGPNHLEVLNKTYNDEA-FGYIE 80
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS---FGIHSNKGSNH 303
+ RR YAGMV+ LD+SVG +I +L +H MLENSIV+FM+DNGAP+ H N GSN
Sbjct: 81 NENRRLYAGMVTSLDDSVGKIIESLHEHEMLENSIVVFMSDNGAPTDDPLWGHDNFGSNW 140
Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LD 362
PLRG K++ +GG+RGVAAIWSPWLK+ VS+ LFHI+DWLPTL AAG D +D
Sbjct: 141 PLRGKKASVLEGGVRGVAAIWSPWLKEKNTVSNNLFHITDWLPTLYTAAGGNFEDLGKID 200
Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG--TDNNGQSDEWYG 420
GVNQW LT +K RS+IL NID + AL + K V T N + ++ G
Sbjct: 201 GVNQWKSLTDTSKQVRSKILINIDETRGEE----ALIFNQWKVVKSNRTSNMVFNLKYAG 256
Query: 421 DTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILEL 480
D N KY+ V+ S G + + Q D N+S A + ++
Sbjct: 257 DPGNAGPKYNMSGVVASTVGKRLSKINCLTQ-------DSCMNASTARDVFYVSRSLAKV 309
Query: 481 R-EFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKST 539
+ R + N CLF+I DPCE N+A+ L L +K
Sbjct: 310 DGQTCPERVSNPNPDQQYECFDGYCLFDILQDPCEYRNVAKQNPQTLDATIHMLEQFKKE 369
Query: 540 MVPPGNKP-FDKRADPARWNNIWVPWYDELDK 570
M P D ADP R+ W PW + DK
Sbjct: 370 MTMQAPDPIIDPDADPRRFAGYWEPWLEPFDK 401
>gi|291513548|emb|CBK62758.1| Arylsulfatase A and related enzymes [Alistipes shahii WAL 8301]
Length = 467
Score = 222 bits (565), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 143/356 (40%), Positives = 192/356 (53%), Gaps = 31/356 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
++P+I+ I+ADDLGW DV +HGS IPTPNIDALA G+ +N+ Y + +P+R+ LMTG
Sbjct: 30 ERPNILFIVADDLGWGDVGYHGSV-IPTPNIDALAARGIEMNRFYTAPVSSPTRAGLMTG 88
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI-GKWHLGFFREVYTPTFR 146
+YP G++ VI +GL E+ L L GYA AI GKWHLG R Y P R
Sbjct: 89 RYPSRFGIRKTVIPPWRDYGLDPEEQTLADMLAANGYAHRAIVGKWHLGHGRRAYYPLNR 148
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
GF YG G DY+ H + + HN + G YSTDL + A+ I
Sbjct: 149 GFTHFYGCLNGALDYFTHEREGELD-------WHNDWESCRDEG-YSTDLIADEAVRCIG 200
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA------KFLDISDPER--RTYAGMVS 258
+ P FLY+A A H PFQAP++E+A KF ++ E+ TY MV+
Sbjct: 201 GYASEGPFFLYVAFNAPHT-----PFQAPEDEIAEHISPEKFAALTPREKDAYTYRAMVT 255
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG--SNHPLRGMKSTPWDGG 316
R+D+ VG ++ L + G++EN++VLFM+DNG GI + G S PLRG K WDGG
Sbjct: 256 RMDKGVGRILEKLERSGLVENTLVLFMSDNG----GIPNLPGGSSCAPLRGHKGREWDGG 311
Query: 317 MRGVAAIWSPWLKQTQKVSSE-LFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLT 371
+R V A+ + + V SE L D PT+ A G DGVN LT
Sbjct: 312 LR-VPAVLCRKGRFAEGVRSEQLAAFVDMAPTIAALVGAGEPPRPYDGVNLLPALT 366
>gi|340367645|ref|XP_003382364.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 493
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 144/413 (34%), Positives = 221/413 (53%), Gaps = 32/413 (7%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
L C L+ K P+I+ +L DDLG+ DV F + I +PN D LA GL+LN+
Sbjct: 4 LQCLLVCVLCSCFVATAKLPNIVFVLIDDLGFADVGFRNPA-ISSPNFDQLAKTGLVLNR 62
Query: 71 HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
HYV C+PSR++L+TG++P H Q +++ G + +LP LK A Y+TH +G
Sbjct: 63 HYVFKYCSPSRASLLTGRWPHH-AHQWNPLMDST-IGTNINMTMLPAKLKAANYSTHMVG 120
Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
KWHLGFF Y P RGFD+ G++ +D+ + + +D N D + G
Sbjct: 121 KWHLGFFDPRYLPINRGFDTSTGFFGCCEDHMNEKSGCS------IDYWKNNAPDPRN-G 173
Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
Y Y + +V++ +N P+FLYL VHA PFQAP E + + S E+
Sbjct: 174 TYDAYNYRDDLTDVMSNYNTENPLFLYLPLHNVHA-----PFQAPKEWLDIYPANSTCEK 228
Query: 251 R-TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN-GAPSFGIHSNKGSNHPLRGM 308
R T M+S D G+++ L+K G+ +N+I++ ADN GAP GSN+PL+G
Sbjct: 229 RHTLQAMISVADNVTGHLVELLKKKGIWDNTIMVISADNGGAPC------AGSNYPLKGC 282
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDTS-----L 361
K+T ++GG+R VA + L ++++ S HI+DW PT C AG++ +D+ +
Sbjct: 283 KTTFFEGGVRSVAFVNGGLLPESRRGQSTDGFIHIADWYPTFCKLAGVDPDDSGTGKFPV 342
Query: 362 DGVNQWDVLT-KGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNG 413
DG++ W ++T + KT+ EI+ N D + A+ V + K V G G
Sbjct: 343 DGLDVWPIITGENEKTQHEEIVLGY-NFDFRRPGQGAIIVGNYKLVVGLQEIG 394
>gi|410039868|ref|XP_003950702.1| PREDICTED: arylsulfatase I isoform 1 [Pan troglodytes]
Length = 426
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 146/452 (32%), Positives = 218/452 (48%), Gaps = 89/452 (19%)
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNK 187
+GKWHLGF+R+ PT RGFD+ G G DYY + +C G D+ V
Sbjct: 2 VGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWG 59
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
G YST LY + A +++A H+ +P+FLY+A AVH P Q+P E + ++ + +
Sbjct: 60 LSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGN 114
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
RR YA MV+ +DE+V N+ AL+++G NS+++F +DNG +F + GSN PLRG
Sbjct: 115 VARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRG 170
Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQ 366
K T W+GG+RG+ + SP LK+ Q+ S L HI+DW PTL AG + LDG +
Sbjct: 171 RKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDV 230
Query: 367 WDVLTKGAKTKRSEILHNIDNVDNPQKY--------------YAALRVDDLKYVAGTDNN 412
W +++G + R+EILHNID + N ++ AA+RV + K + G
Sbjct: 231 WPAISEGRASPRTEILHNIDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG---- 286
Query: 413 GQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTIL 472
D YGD + P + L + G +N
Sbjct: 287 ---DPGYGD-------WIPPQTLATFPGSWWNL--------------------------- 309
Query: 473 TDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDK 532
E++ +R+ LFNI+ DP E+ +LA + D+++ L +
Sbjct: 310 --ERMASVRQAV-------------------WLFNISADPYEREDLAGQRPDVVRTLLAR 348
Query: 533 LAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LA Y T +P + RA P W PW
Sbjct: 349 LAEYNRTAIPVRYPAENPRAHPDFNGGAWGPW 380
>gi|119582144|gb|EAW61740.1| arylsulfatase family, member I [Homo sapiens]
Length = 426
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 146/452 (32%), Positives = 218/452 (48%), Gaps = 89/452 (19%)
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNK 187
+GKWHLGF+R+ PT RGFD+ G G DYY + +C G D+ V
Sbjct: 2 VGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWG 59
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
G YST LY + A +++A H+ +P+FLY+A AVH P Q+P E + ++ + +
Sbjct: 60 LSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGN 114
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
RR YA MV+ +DE+V N+ AL+++G NS+++F +DNG +F + GSN PLRG
Sbjct: 115 VARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRG 170
Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQ 366
K T W+GG+RG+ + SP LK+ Q+ S L HI+DW PTL AG + LDG +
Sbjct: 171 RKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDV 230
Query: 367 WDVLTKGAKTKRSEILHNIDNVDNPQKY--------------YAALRVDDLKYVAGTDNN 412
W +++G + R+EILHNID + N ++ AA+RV + K + G
Sbjct: 231 WPAISEGRASPRTEILHNIDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG---- 286
Query: 413 GQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTIL 472
D YGD + P + L + G +N
Sbjct: 287 ---DPGYGD-------WIPPQTLATFPGSWWNL--------------------------- 309
Query: 473 TDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDK 532
E++ +R+ LFNI+ DP E+ +LA + D+++ L +
Sbjct: 310 --ERMASVRQAV-------------------WLFNISADPYEREDLAGQRPDVVRTLLAR 348
Query: 533 LAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LA Y T +P + RA P W PW
Sbjct: 349 LAEYNRTAIPVRYPAENPRAHPDFNGGAWGPW 380
>gi|323453557|gb|EGB09428.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 1605
Score = 218 bits (556), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 165/567 (29%), Positives = 254/567 (44%), Gaps = 74/567 (13%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIP-TPNIDALAYNGLILNQHYVQALCT 78
L TT KPH++ +L DD+G NDV + + TP ID LA G+ L ++Y +CT
Sbjct: 82 GLLATTDSAKPHVLFVLVDDMGSNDVGYQSHDMVGVTPFIDGLAEQGVRLKEYYSMHMCT 141
Query: 79 PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
P+R+ALM+G YP+ IGMQ I PWG+P + + LK GY TH +GKW LG +
Sbjct: 142 PARAALMSGHYPMRIGMQLENIKPDSPWGMPRELTTMAETLKNLGYNTHGVGKWGLGHSQ 201
Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP---------------YQGLDMRHNMQ 183
+ P RGFD+ YGY DYY H A FE + +++Q
Sbjct: 202 HGFLPVNRGFDTWYGYLSDEIDYYSHEYPAPFETVEDGATVMASFTDYVFMERSRPYDLQ 261
Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQA-PDEEVAK 241
G +S++LYT+ ++ N S+ P+F+Y A H P A D E A+
Sbjct: 262 YMPDLNGTHSSELYTQRVQQIVKSANASREPLFVYYASQMTHGPLDVPPRSAFSDAEWAR 321
Query: 242 F-LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
DI RR +A ++ LD+SVG V+ +++ + +++ +F +DNG G + G
Sbjct: 322 IGADIKGTVRRKFASLLLALDQSVGRVVEEMKRSDVWRDTLFVFASDNG----GCWAQGG 377
Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQT--QKVSSELFHISDWLPTLC--------- 349
+N+PLRG K W+GG++ A ++S + + + LFH++DW PTL
Sbjct: 378 TNYPLRGGKHFLWEGGVKVPAFVYSELIPSSYRNRAHRGLFHVTDWYPTLVKLVASSNSV 437
Query: 350 ----------AAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALR 399
+E D LDG QW+ L A R E+ N+ Q Y
Sbjct: 438 LASTNDDDDETTTALEKVD-DLDGFFQWEALMGLADPPRVEVPINV------QSYAFTCT 490
Query: 400 VDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKT---KLQIKQKH 456
+ + D+ G D P +L + T AL+ KL + +
Sbjct: 491 GPNGTLLQDADSLGAVS----------DCADPWHILVRQGNATRAALRYGDWKLLVNE-- 538
Query: 457 AADP----KANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDP 512
A+P A +DAL L D + DN G + LF++ +DP
Sbjct: 539 FAEPWYGVPAEGADALGLDLADGSA----DLGDDEGPMDNCGTAVGTEAHTYLFDLKNDP 594
Query: 513 CEQNNLAESQTDLLKQLEDKLAIYKST 539
E +L+ + ++ L+ +L +T
Sbjct: 595 SETMDLSGEKPEVRDALKARLTELMAT 621
>gi|392390175|ref|YP_006426778.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
15997]
gi|390521253|gb|AFL96984.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
15997]
Length = 467
Score = 218 bits (556), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 142/386 (36%), Positives = 213/386 (55%), Gaps = 28/386 (7%)
Query: 1 MTWARKYF-FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNID 59
MT+ +K+ +L L++ TT KP+II+IL+DD G+ D +G+ +IPTPNI+
Sbjct: 1 MTYLKKWLSISLLVVLIYTTQAQTTT---KPNIIVILSDDAGYADFECYGNKEIPTPNIN 57
Query: 60 ALAYNGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQH---GVILEG---EPWGLPLTE 112
LA G + ++ YV A +C PSR+ L+TG+Y G ++ G EG E GL L+E
Sbjct: 58 RLAKEGTLFSKAYVSASVCAPSRAGLLTGRYQQRFGFENNPTGKPREGFKKEDMGLALSE 117
Query: 113 KLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP 172
K + +KE GY T A+GKWHLG + + P RGFD YG+ +G +D++ K E
Sbjct: 118 KTIGDRMKEEGYRTLAVGKWHLGNDAKFF-PLKRGFDEFYGFQEGHRDFFSFKKKRA-EK 175
Query: 173 YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEP 231
Y D N ++ + Y TD++T+ A+ I E+ +K +P F+YLA+ AVH P
Sbjct: 176 YALWD---NDKIIPEEEITYLTDMFTDKALKFIDENADKKQPFFIYLAYNAVHT-----P 227
Query: 232 FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
QA ++ KF + R+TY M+S +D ++G V+ L+ G+ +N++V+F+ DNG
Sbjct: 228 LQAKKNDLDKFAQVGSEGRQTYDAMLSNMDYNIGRVMQELKNKGIDDNTLVIFLNDNG-- 285
Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
G +N N LRGMK + W+GG+R G W+ + V + D LPT A
Sbjct: 286 --GATTNYSDNGQLRGMKGSVWEGGVRVGYIMRWNGKI-PANVVYDKAVSSLDILPTSLA 342
Query: 351 AAGIEINDTSLDGVNQWDVLTKGAKT 376
AAG D LDGVN + K + T
Sbjct: 343 AAGNTRKDKHLDGVNLLPYIQKNSGT 368
>gi|109897214|ref|YP_660469.1| sulfatase [Pseudoalteromonas atlantica T6c]
gi|109699495|gb|ABG39415.1| sulfatase [Pseudoalteromonas atlantica T6c]
Length = 500
Score = 218 bits (554), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 133/370 (35%), Positives = 197/370 (53%), Gaps = 27/370 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
+KP+I+ +LADDLG+NDV F+GS+ I TPN+D LA NG+ + YV C PSR+A+MT
Sbjct: 38 EKPNILFVLADDLGYNDVGFNGSTDIKTPNLDGLAKNGMTFDAAYVAHPFCGPSRAAIMT 97
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+YP IG Q + + G+ E + Q +K AGY T A+GKWHLG E Y P
Sbjct: 98 GRYPHKIGAQFNLPEDNSNVGVSADELFIAQTMKSAGYFTGAMGKWHLGEASE-YHPNKH 156
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPY--QGLD--------MRHNMQVDNKTIGIYSTDL 196
GFD YG+ G +Y+ +A + QG+ + HN + +T Y TD
Sbjct: 157 GFDEFYGFLGGGHNYFPEQFEAAYNKRVAQGMTNINMYLTPLEHNGKEVRETE--YITDG 214
Query: 197 YTEAAINVIAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
+ A+N + + K KP FLYLA+ A H P QA +E++A F I D +RRTYAG
Sbjct: 215 LSREAVNFVDKAAAKKKPFFLYLAYNAPHV-----PLQAKEEDMAMFSQIKDKKRRTYAG 269
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
MV +D VG ++ L+K+G +N++++F +DNG G +N+PL+ K + +G
Sbjct: 270 MVYAVDRGVGRIVEQLKKNGQFDNTVIVFTSDNG----GKLGQGANNYPLKEGKGSVQEG 325
Query: 316 GMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKG 373
G R + W +K + S + + D PT G + D LDG + W +
Sbjct: 326 GFRTPMLVHWPKHMKAGSRFSHPVLAL-DLYPTFAGLGGAVLPEDKKLDGKDIWADIQAN 384
Query: 374 AKTKRSEILH 383
+ E ++
Sbjct: 385 TAPHKDEFIY 394
>gi|193784090|dbj|BAG53634.1| unnamed protein product [Homo sapiens]
Length = 426
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 145/452 (32%), Positives = 218/452 (48%), Gaps = 89/452 (19%)
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNK 187
+GKWHLGF+R+ PT RGFD+ G + G DYY + +C G D+ V
Sbjct: 2 VGKWHLGFYRKECLPTRRGFDTFLGSFTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWG 59
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
G YST Y + A +++A H+ +P+FLY+A AVH P Q+P E + ++ + +
Sbjct: 60 LSGQYSTMPYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGN 114
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
RR YA MV+ +DE+V N+ AL+++G NS+++F +DNG +F + GSN PLRG
Sbjct: 115 VARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRG 170
Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQ 366
K T W+GG+RG+ + SP LK+ Q+ S L HI+DW PTL AG + LDG +
Sbjct: 171 RKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDV 230
Query: 367 WDVLTKGAKTKRSEILHNIDNVDNPQKY--------------YAALRVDDLKYVAGTDNN 412
W +++G + R+EILHNID + N ++ AA+RV + K + G
Sbjct: 231 WPAISEGRASPRTEILHNIDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG---- 286
Query: 413 GQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTIL 472
D YGD + P + L + G +N
Sbjct: 287 ---DPGYGD-------WIPPQTLATFPGSWWNL--------------------------- 309
Query: 473 TDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDK 532
E++ +R+ LFNI+ DP E+ +LA + D+++ L +
Sbjct: 310 --ERMASVRQAV-------------------WLFNISADPYEREDLAGQRPDVVRTLLAR 348
Query: 533 LAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LA Y T +P + RA P W PW
Sbjct: 349 LAEYNRTAIPVRYPAENPRAHPDFNGGAWGPW 380
>gi|326431091|gb|EGD76661.1| hypothetical protein PTSG_08011 [Salpingoeca sp. ATCC 50818]
Length = 511
Score = 217 bits (552), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 129/356 (36%), Positives = 189/356 (53%), Gaps = 60/356 (16%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+PHI+ ++ DD GWND F G+ ++ TP ID L G+ LNQHYVQ +C+P+R+ALMTG+
Sbjct: 13 RPHILFVVVDDHGWNDCGFAGT-RVKTPTIDTLRSEGIALNQHYVQKVCSPTRAALMTGR 71
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YP G+Q G L E LLPQY+K AGY T A+GKWHLGF +TPTFRGF
Sbjct: 72 YPHRYGLQF-PFCGGAAMALNSNETLLPQYMKSAGYTTRAVGKWHLGFTEWQFTPTFRGF 130
Query: 149 DSHYGYWQGLQDYYDH-----------------------SC-----KATFEPYQGLDMRH 180
DS YG++ +DY+ H SC KA FE G D +H
Sbjct: 131 DSFYGFYSCAEDYFFHGLGFKNSSGAHVKSLDFHDDARPSCGADCSKAAFEAV-GTDWQH 189
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
YST L+ ++++ H+ S+P+FLY A + +T+ P + P
Sbjct: 190 -----------YSTTLFAGRIVDIVDGHDPSQPLFLYFA-----SQDTHNPGEVPQCYKD 233
Query: 241 KF-LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP-SFGIHSN 298
++ + D RR A ++ +D ++ N+ A +++GM +N ++++ ADNG P + N
Sbjct: 234 RYNTTVPDQVRRKLAAKLTTVDSALANITEAYKRNGMYDNLLIVYTADNGGPITVQSGPN 293
Query: 299 KG--------SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL---FHISD 343
KG SN+PLRG K ++GG+R A + P LK ++ + HI D
Sbjct: 294 KGKQEDSIGSSNYPLRGGKHNAYEGGVRSTAFLSGPALKTLARLVRKYQYKLHIGD 349
>gi|340377481|ref|XP_003387258.1| PREDICTED: arylsulfatase I-like [Amphimedon queenslandica]
Length = 507
Score = 216 bits (550), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 134/392 (34%), Positives = 203/392 (51%), Gaps = 39/392 (9%)
Query: 8 FFALTCTLLFNDAFL-NTTAPKKPHIIIILADDLGWNDVSFH---GSSQIPTPNIDALAY 63
FA+T LL + KPHI+++L DD GW +V +H + ++ TPNID L
Sbjct: 1 MFAVTLALLSMIGLVAGQPVTLKPHIVMMLVDDWGWANVGYHRNPPTREVVTPNIDDLVK 60
Query: 64 NGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEP-------WGLPLTEKLLP 116
GL LNQHY C+PSRS+L++G+ PIH+ Q+ P +P +
Sbjct: 61 QGLELNQHYAYRCCSPSRSSLISGRLPIHVSDQNIAPTNYNPNDPISGFSAIPRNMTGIA 120
Query: 117 QYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH---SCKATFEPY 173
+ +KEAGYATH +GKW G +TP RGFD+ +GY+ DYY SC T
Sbjct: 121 EKMKEAGYATHQVGKWDAGMATPDHTPKGRGFDTSFGYFHHYNDYYTEVVDSCNGTGV-- 178
Query: 174 QGLDMRHNMQVDNKTIGI----YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTY 229
+D+ + Q + G Y L+ E ++++++H+ S P+FLY A VH
Sbjct: 179 --VDLWNTDQPAHGINGTGPDKYEEALFRERLLDIVSKHDPSTPLFLYYAPHIVHT---- 232
Query: 230 EPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG 289
P Q PDE + KF I D +R Y MV+ LD+ VG++++AL+K GM +N + + +DNG
Sbjct: 233 -PLQVPDEYLNKFSFIDDKDRMYYHAMVNYLDDVVGDLVSALKKKGMWDNLLYVTSSDNG 291
Query: 290 APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL--KQTQKVSSELFHISDWLPT 347
P + +N+PL+G K + W GG+R A + L K + + H++DW T
Sbjct: 292 GPVYPF--GGANNYPLKGGKLSDWQGGIRVNAFVSGGHLPEKMRGQKTDGYIHLADWYGT 349
Query: 348 LCAAAGIEINDTS--------LDGVNQWDVLT 371
CA AG++ D +D N W +++
Sbjct: 350 FCAIAGVDPTDEKAAKAKLPPVDSYNMWPLIS 381
Score = 45.1 bits (105), Expect = 0.097, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 35/66 (53%), Gaps = 3/66 (4%)
Query: 504 CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN---KPFDKRADPARWNNI 560
CL+NI DDP E+ NLA S D LK++++K Y +T P P +A ++
Sbjct: 442 CLYNIKDDPEERKNLASSMPDKLKEMQEKSKKYLATYFNPDRGHASPLACQAAVNKYGGF 501
Query: 561 WVPWYD 566
W P+ D
Sbjct: 502 WGPFVD 507
>gi|374619563|ref|ZP_09692097.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374302790|gb|EHQ56974.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 539
Score = 216 bits (550), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 137/409 (33%), Positives = 204/409 (49%), Gaps = 56/409 (13%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
F L+ + + + A KP+II+++ADDLGW DV FHG+ I TP++D +A G L
Sbjct: 7 FCLSVLVFLTSS--SVFAGDKPNIIVMVADDLGWADVGFHGNQIIETPSLDRIAAEGTQL 64
Query: 69 NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKEAGYA 125
N+ Y +C+P+R+ALMTG+ PI +G+ + I+ PW G+ E LP+ AGY
Sbjct: 65 NRFYTTPICSPTRAALMTGRDPIRLGVAYSTIM---PWHNNGIHPEETFLPELFAGAGYQ 121
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD----MRHN 181
T +GKWHLG ++ Y P RGF+ YG+ H+ F P+ L R+
Sbjct: 122 TAMVGKWHLGHAQQTYHPNARGFEHFYGHL--------HTEVGFFPPFASLGGKDFQRNG 173
Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
+ +D++ Y + L + I E + +KP F+Y+ +A H P AP++ AK
Sbjct: 174 VSIDDQG---YESYLLADEVSRYIRERDAAKPFFIYMPFIAPHT-----PLDAPEDLKAK 225
Query: 242 FLDISDPE-----------------------RRTYAGMVSRLDESVGNVIAALRKHGMLE 278
+ D+ D R YA +V +D+S+G V+ L G+ +
Sbjct: 226 YADMEDDRGKSRSKMADNTRFMAKVTRRESARPMYAAVVDGMDQSIGRVLDTLDAEGLTD 285
Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSE 337
N+IVLF +DNG + I N PLRG K ++GG+R VA + W + KV S
Sbjct: 286 NTIVLFFSDNGGAVYAI--GGADNAPLRGGKGDTFEGGIRVVATMRWPEKIAPGGKVDS- 342
Query: 338 LFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
+ + D PTL AAAGIE LDG N + G R ++L I
Sbjct: 343 IMSVMDVFPTLLAAAGIEPATRHRLDGRNLLPAIVDGEDVAREDLLFFI 391
>gi|403182689|gb|EJY57565.1| AAEL017303-PA [Aedes aegypti]
Length = 176
Score = 215 bits (548), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 94/140 (67%), Positives = 117/140 (83%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
++P+I++I+ DDLGWNDVSFHGSSQIPTPNIDALAY G+ILN+HY LCTPSR++LM+G
Sbjct: 28 ERPNIVVIVTDDLGWNDVSFHGSSQIPTPNIDALAYQGIILNRHYTPPLCTPSRASLMSG 87
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
K+PI++GMQH VI EPWGL L +KL+P+Y +EAGY T +GKWHLGFFR+ YTPT RG
Sbjct: 88 KHPINVGMQHHVIESNEPWGLGLDQKLMPEYFREAGYRTRLVGKWHLGFFRKAYTPTRRG 147
Query: 148 FDSHYGYWQGLQDYYDHSCK 167
FDSH+GY DY+DHS +
Sbjct: 148 FDSHFGYIGPYIDYWDHSLQ 167
>gi|323455793|gb|EGB11661.1| hypothetical protein AURANDRAFT_70842 [Aureococcus anophagefferens]
Length = 1489
Score = 215 bits (548), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 175/550 (31%), Positives = 252/550 (45%), Gaps = 129/550 (23%)
Query: 26 APKKPHIIIILADDLGWNDVSF--HGSSQIP--TPNIDALAYNGLILNQHYVQALCTPSR 81
A + PHI + L DD G D+ + G Q+ TP +D LA +G+ L+++Y Q LCTPSR
Sbjct: 103 ASEYPHIFLSLIDDQGKADMGYGNDGLDQLAEATPFMDLLADDGIKLDKYYSQQLCTPSR 162
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
SAL++G YPIH GMQH VI +G+P KL+P +LK
Sbjct: 163 SALLSGYYPIHTGMQHDVIQPESMFGMPAGHKLMPSFLK--------------------- 201
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
D Y+ H+C + Y DM+ N + +IG+YST L+ +A
Sbjct: 202 -------DDQLWYYN---HKSPHACDGSNCYY---DMQRNGAIAEDSIGVYSTFLFVDAF 248
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP------DEEVAKFLDISDPERRTYAG 255
+V+ + S P+F+YL+ VHA P P E++ I DP RRT+A
Sbjct: 249 TDVLENEDASTPLFMYLSWQNVHA-----PLDPPPASFYTKTELSLVNSIEDPHRRTFAS 303
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
+ LD ++ V+ A++K G +NSI++ +DNG G + G N+P RG K ++G
Sbjct: 304 ITIVLDNAMKKVVGAMKKSGFYDNSILVVASDNG----GCSGSGGYNYPYRGGKQYLYEG 359
Query: 316 GMRGVAAIWSPWLKQTQKVSS--ELFHISDWLPTLCAAAGIEINDT----SLDGVNQW-D 368
G+ A I SP + ++ + +S LFH+SDWLPTL A ++ T LDGV+QW
Sbjct: 360 GIHVNAFIHSPLIPKSARGTSYQGLFHVSDWLPTLVTGA-MKTEPTMLPADLDGVDQWHA 418
Query: 369 VLTKG----AKTKRSEILHNID--------NVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
+L G A R+EILHNID NV + A+RV D+K V G D +
Sbjct: 419 ILGSGRGAPAAYPRTEILHNIDLWSLEIYGNVTKLRTPIQAIRVGDMKLVMGQDASTH-- 476
Query: 417 EWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEK 476
YSP E + GI + P S++ D+K
Sbjct: 477 ------------YSPTETTCPE-GIGYCM--------------PSYESTEDCSYEYGDQK 509
Query: 477 ILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIY 536
L+NIT DP E +L+ L+ LE KLA Y
Sbjct: 510 Y---------------------------LYNITADPSELVDLSTVHPQLVSTLEAKLAGY 542
Query: 537 KSTMVPPGNK 546
++TMV P +
Sbjct: 543 EATMVSPAYR 552
>gi|119626708|gb|EAX06303.1| hCG20717, isoform CRA_a [Homo sapiens]
Length = 427
Score = 215 bits (548), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 147/451 (32%), Positives = 220/451 (48%), Gaps = 69/451 (15%)
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQV 184
+GKWHLGF+R+ PT RGFD+ +G G DYY H K G D+ N
Sbjct: 2 VGKWHLGFYRKECMPTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDY 60
Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
DN GIYST +YT+ ++A HN +KP+FLY+A+ AVH+ P QAP +
Sbjct: 61 DN---GIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRS 112
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNH 303
I + RR YA M+S LDE++ NV AL+ +G NSI+++ +DNG P+ G GSN
Sbjct: 113 IININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNW 167
Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLD 362
PLRG K T W+GG+R V + SP LK V EL HI+DW PTL + A +I+ D LD
Sbjct: 168 PLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLD 227
Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT 422
G + W+ +++G ++ RVD L + ++ W
Sbjct: 228 GYDIWETISEGLRSP---------------------RVDILHNIDPIYTKAKNGSWA--- 263
Query: 423 DNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELRE 482
+ GI A+++ ++++ SD + +
Sbjct: 264 --------------AGYGIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQS 299
Query: 483 FARVRCN-YDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMV 541
F+ + N + N+ ++ LFNIT DP E+ +L+ ++K+L +L+ + T V
Sbjct: 300 FSNLGPNRWHNERITLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAV 359
Query: 542 PPGNKPFDKRADPARWNNIWVPWYDELDKQK 572
P P D R++P +W PWY E K+K
Sbjct: 360 PVRYPPKDPRSNPRLNGGVWGPWYKEETKKK 390
>gi|254515652|ref|ZP_05127712.1| arylsulfatase B [gamma proteobacterium NOR5-3]
gi|219675374|gb|EED31740.1| arylsulfatase B [gamma proteobacterium NOR5-3]
Length = 507
Score = 215 bits (547), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 146/430 (33%), Positives = 226/430 (52%), Gaps = 61/430 (14%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P+++IILADD+GWNDV +HGS I TP+ID LA GL L++ Y Q C+P+R+AL++G+
Sbjct: 40 RPNVVIILADDMGWNDVGYHGS-DIHTPHIDQLAAEGLELDRFYAQTACSPTRAALLSGQ 98
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
+G+ + + + P GL L +K++P Y ++AGY T +GKWHLGF+ Y P RGF
Sbjct: 99 SSQSLGI-YSPLSKLNPTGLALDQKIMPAYFRDAGYQTFMVGKWHLGFYEPEYRPLARGF 157
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
D YG G Y++H GLD + N + + YST L + +I +
Sbjct: 158 DHFYGNLTGGVGYWNHVHGG------GLDWQRNGKTLRQE--GYSTHLQSAEITRLIQQR 209
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
+ KP+FLY A A H P +AP + +A++ I +P RR +A MV+ LD ++G ++
Sbjct: 210 DPEKPLFLYAAFNAPHL-----PNEAPADTLARYAHIENPNRRIHAAMVTELDSAIGQLM 264
Query: 269 AALRKHGMLENSIVLFMADNGA------PS------------FG----------IHSNK- 299
L GMLEN+++ FM+DNG PS FG I +N
Sbjct: 265 ETLSTEGMLENTLIWFMSDNGGLNRTAMPSGLVSMSQRLEDWFGKPLFPKTLEFIRTNAL 324
Query: 300 --GS-NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
GS N P R K + ++GG R + ++ +++ +++ + D LPTL +A I+
Sbjct: 325 DGGSDNSPHRKGKQSIYEGGARVPSFVYWKGRLSPERI-TQMVTVKDVLPTLLSATDIDA 383
Query: 357 ND------TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKY-----YAALRVDDLKY 405
N T GV+QW LT+G + + L I+ +D Y AL+ DL+
Sbjct: 384 NGLPTATTTESAGVDQWPGLTRGEFIQAPDYL--INGMDGEALYRFPWKLLALKSGDLEL 441
Query: 406 VAGTDNNGQS 415
++ G+S
Sbjct: 442 YNVDEDPGES 451
>gi|241844556|ref|XP_002415496.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215509708|gb|EEC19161.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 328
Score = 215 bits (547), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 136/357 (38%), Positives = 185/357 (51%), Gaps = 36/357 (10%)
Query: 214 MFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRK 273
MFLY +H AVH G+ +AP VA F I R +AG V LD+SVG V+ AL K
Sbjct: 1 MFLYFSHQAVHCGDYKVKLEAPASAVAHFPYIKSQNRSIHAGAVYELDKSVGLVVEALNK 60
Query: 274 HGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK 333
GML NSIV+F DNG +GI N G N PLRG K+T W+GG+RG A +WSP L ++ +
Sbjct: 61 RGMLSNSIVVFSTDNGGLPWGIDPNSGYNWPLRGAKATLWEGGVRGAAFVWSPLLFKSGR 120
Query: 334 VSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ 392
+S++L HI+DWLPTL +AAG ++ DG + W+ L++ ++ R EIL NID P
Sbjct: 121 LSNQLMHITDWLPTLYSAAGGNLSTLGDTDGKDMWEALSEDLESPRQEILINID----PT 176
Query: 393 KYYAALRVDDLKYVAGTDNNGQSDEWY-----GDTDNEIDKYSPKEVLYSKAGITFNALK 447
+AL V K V GT GQ D+ T + +D+ +L S+ G
Sbjct: 177 GNTSALIVGRRKVVLGTFQGGQHDQRMKAPGGSRTVDGLDQM----MLSSRTGKVLKDFY 232
Query: 448 TKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFN 507
Q++ +R +E + VRC+ + + P F+
Sbjct: 233 NVRQLR--------------VRPNWRNETV--------VRCDQYAPRDNFVAASPPYYFD 270
Query: 508 ITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
+ DPCE NNLA S LK+L K+ Y MVPPGNKP D R P + +W PW
Sbjct: 271 LEHDPCELNNLAASNVTDLKELIKKIEEYGKGMVPPGNKPLDPRGLPQNNHGLWGPW 327
>gi|241999506|ref|XP_002434396.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215497726|gb|EEC07220.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 458
Score = 215 bits (547), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 163/504 (32%), Positives = 228/504 (45%), Gaps = 85/504 (16%)
Query: 94 GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK--WHLGFFREVYTPTFRGFDSH 151
GMQH +I EP GL L KLLPQ+L + GY TH IGK HL H
Sbjct: 1 GMQHSIIFNEEPRGLSLRFKLLPQWLADRGYTTHMIGKVAMHLSL-------------QH 47
Query: 152 YGYWQGLQDYYDHS----CKATFEPYQ-----------GLDMRHNMQVDNKTIGIYSTDL 196
Y Y Q + H + F P GLD R ++ + G Y T L
Sbjct: 48 YSYEQIKHLHCRHLNSKLAEVFFRPQTLVLLLQPGGDTGLDFRRDLSISPNESGTYITQL 107
Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDISDPERRTYA- 254
T A VIA H K + H + A D E + F LD ++ A
Sbjct: 108 LTREAEAVIANHPIDK-----VCH-----------YLASDRETSVFALDTVVICTKSIAI 151
Query: 255 --------GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
GMVS LDESVG+V +L GML+N++ +F +DNGA + ++N S+ P +
Sbjct: 152 SFHKCLLPGMVSALDESVGSVFESLNHRGMLDNTVFVFSSDNGADTDSANANTASSWPFK 211
Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQ-TQKVSSELFHISDWLPTLCAAAG--IEINDTSLDG 363
G K TPW+GG+R A IWSP V + +FHISDWLPTL AG E +DG
Sbjct: 212 GQKYTPWEGGVRAPAVIWSPLFSGLLGSVYNNMFHISDWLPTLYQLAGGLAEGMADDIDG 271
Query: 364 VNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD 423
++ LT G + R E+L NID V+N Y+A+ K V G + D+WY
Sbjct: 272 LSHLHSLTGGTEPPRHELLINIDPVEN----YSAIIEGHFKLVKGRASGSAFDKWYSIPG 327
Query: 424 NEI-DKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELRE 482
N D P++ S + ++ + P + + D + RE
Sbjct: 328 NVTWDSDMPRKQCESSVVV---------RVLRNIGLSPVCGAGRTRYAVPVDCGQRDPRE 378
Query: 483 FARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP 542
+ C CLF+++ DPCE N+++ +++++L KLA YK+T +
Sbjct: 379 KS------------CAPERSACLFDLSKDPCEYNDVSGQHPEVVERLLKKLAGYKATALK 426
Query: 543 PGNKPFDKRADPARWNNIWVPWYD 566
P N P D R++P NN+WV W D
Sbjct: 427 PANLPDDPRSNPGLHNNVWVAWDD 450
>gi|383110963|ref|ZP_09931781.1| hypothetical protein BSGG_2068 [Bacteroides sp. D2]
gi|313694533|gb|EFS31368.1| hypothetical protein BSGG_2068 [Bacteroides sp. D2]
Length = 458
Score = 214 bits (546), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 132/360 (36%), Positives = 196/360 (54%), Gaps = 33/360 (9%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
+K FF L C AFL+ ++P+I+I+LADDLGW DV FHGS +I TP +DAL
Sbjct: 2 KKEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLDALVGE 59
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
G+ L + Y + TP+R+ LMTG+YP G++ VI PW GL E+ + L
Sbjct: 60 GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116
Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
GY A IGKWHLG ++V+ P RGF YG+ G DY+D + + + + + H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ YST+L T+ AI I + K P LY+A+ A H P QA ++++
Sbjct: 177 DKG--------YSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223
Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
+ D D ++ TY+ MVS +D +G ++ AL+K G+++N+ +F +DNG S G
Sbjct: 224 LYCDNFDSLTPKEQKKVTYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--SAG 281
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
+ + S+ PLRG K WDGG+ A + W KQ + +SS++ D +PTL G
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLCWKKAEKQYKNLSSQVTGFVDLVPTLKELVG 339
>gi|414068777|ref|ZP_11404774.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410808616|gb|EKS14585.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 480
Score = 214 bits (546), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 143/394 (36%), Positives = 205/394 (52%), Gaps = 35/394 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
+KP+II+I+ADDLG+ DV F+GS I TPNID LA +G + YV C PSR+ALMT
Sbjct: 28 QKPNIILIVADDLGYADVGFNGSKDIITPNIDDLAKSGTSFSDAYVAHPFCGPSRAALMT 87
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+YP IG Q + G G+P K + + L E Y T A+GKWH+G + Y P R
Sbjct: 88 GRYPHKIGSQFNLPTRGSNVGVPTDAKFISKLLNENNYFTGALGKWHMGDAPQ-YHPNKR 146
Query: 147 GFDSHYGYWQGLQDYYD------------HSCKATFEPYQGLDMRHNMQVDNKTIGIYST 194
GFD +YG+ G +Y+ K FE L+ HN + +T Y T
Sbjct: 147 GFDEYYGFLGGGHNYFPDQYQPQYKKQQAQGLKNIFEYITPLE--HNGKEVKETQ--YIT 202
Query: 195 DLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
D + A+N + + NK P FLYLA+ A H P QA DE++A F +I + +R+TY
Sbjct: 203 DALSREAVNFVDKAVNKKNPFFLYLAYNAPHV-----PLQAKDEDMAMFPNIKNKDRKTY 257
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
AGMV +D VG ++ L+K+ +N++++FM+DNG G S +N PL+ K +
Sbjct: 258 AGMVYAVDRGVGKLVETLKKNNQYDNTLIVFMSDNG----GKLSKGANNFPLKAGKGSTQ 313
Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTK 372
+GG R P K S D PT A AG ++ + LDG N W K
Sbjct: 314 EGGFRVPMLFHWPKHVPAGKRFSHPVSALDLYPTFAALAGAKVEENQHLDGTNMWPAFIK 373
Query: 373 GAKTKRSEILHNIDNVDNPQKYY--AALRVDDLK 404
+ E ++ + + +K Y AA+R++ K
Sbjct: 374 NENPHKDEPIYALRH----RKGYSDAAIRMNQWK 403
>gi|340367649|ref|XP_003382366.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 495
Score = 214 bits (546), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 140/399 (35%), Positives = 212/399 (53%), Gaps = 30/399 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K P+I+ +L DD G+ DV F + I +PN D LA GL+LN+HYV C PSR++L+TG
Sbjct: 22 KLPNIVFVLVDDWGFADVGFRNPA-ISSPNFDQLAKTGLVLNRHYVFKYCAPSRASLLTG 80
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
++P H+ Q + + G L + P LK A Y+TH +GKWH GFF Y P RG
Sbjct: 81 RWPHHV-YQWNLATDATA-GTNLNMTMFPAKLKAANYSTHMVGKWHQGFFDPRYLPINRG 138
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FD+ G+ G +D+ T +D N D + G Y +Y + ++I
Sbjct: 139 FDTSSGFLCGSEDH------MTQNAICAIDYWKNNAPDPRN-GTYDAYIYRDDLTDIINS 191
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR-TYAGMVSRLDESVGN 266
HN +P+FLYL VH P QAP E + + S ++R T MVS D G+
Sbjct: 192 HNTDEPLFLYLPLHNVHL-----PLQAPKEWLDIYPANSTCDKRHTLQAMVSVADNVTGH 246
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
++ L+K G+ +N+I++ ADNG G H + GSN+PLRG K T ++GG+R +A +
Sbjct: 247 LVELLKKKGLWDNTIMVISADNG----GAHCS-GSNYPLRGCKETLFEGGVRSLAFVNGG 301
Query: 327 WLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDT-----SLDGVNQWDVLT-KGAKTKR 378
L ++++ S HI+DW T C AG++ +D+ S+DG++ W ++T + KT+
Sbjct: 302 LLPESRRGQSTDGFIHIADWYTTFCKLAGVDPDDSGTGKFSVDGLDVWPIITGENEKTQH 361
Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDE 417
EI+ N D A+ V + K + G N + D
Sbjct: 362 EEIVLGY-NFDFNHPGQGAIIVGNFKLLVGLQNAFECDS 399
>gi|291225025|ref|XP_002732499.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 307
Score = 214 bits (546), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 121/315 (38%), Positives = 177/315 (56%), Gaps = 14/315 (4%)
Query: 40 LGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGV 99
+GWNDV +H + I PN+ LA +G+I NQ Y CTPSR+A+MTG YP G QH +
Sbjct: 1 MGWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59
Query: 100 ILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQ-GL 158
P G+PL KLLP+ LKE GY+TH +GKWHLGF ++ Y PT RGFDSHYG W G+
Sbjct: 60 AFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119
Query: 159 QDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
DY P G D R NM V K+ Y + + A +++ H P+FL
Sbjct: 120 GDY--DKMDGVLSPSAGYDFRDNMGVVPKS-DDYLALMLGDRAEHIVNTHYPGTPLFLAF 176
Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
L + A + + P+E K+ +I D R + G ++ +D+ +G V+ A++ GM +
Sbjct: 177 T-LDIPAKH----LEIPEEYEEKYAEIEDDRTRQFYGKLTMMDDIIGGVVDAMKNRGMYD 231
Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL 338
+S+++F+ DN G+ S GSN+P RG+ T ++G R + + +K+T S+EL
Sbjct: 232 DSLIIFIGDNE----GLSSQSGSNYPFRGIAGTLFEGATRVPSIVSGKGIKKTGYESNEL 287
Query: 339 FHISDWLPTLCAAAG 353
+ I D T+ AG
Sbjct: 288 YSIVDIHRTILDVAG 302
>gi|221119831|ref|XP_002168522.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
Length = 223
Score = 214 bits (546), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 108/225 (48%), Positives = 146/225 (64%), Gaps = 7/225 (3%)
Query: 95 MQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGY 154
MQ I WG+ L EK LPQYLK GY THAIGKWHLGFF + YTPT+RGFDS YGY
Sbjct: 1 MQSDTIFAANAWGVGLDEKFLPQYLKNVGYQTHAIGKWHLGFFSKEYTPTYRGFDSFYGY 60
Query: 155 WQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEAAINVIAEHNK 210
+ G DY+DHS + + GLD+ ++ +K I G YST +Y+ AI+ I HN
Sbjct: 61 YGGQADYWDHSLAS--NGWWGLDLHYDTPSSSKNIFNQWGNYSTAMYSMEAIDRIRNHNS 118
Query: 211 SKPMFLYLAHLAVHAGNTYE-PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
++PMFLYLA+ AVH+ N E P QAP E V KF I R+ YA M+ +D +G + +
Sbjct: 119 TQPMFLYLAYQAVHSANLREYPLQAPQEWVDKFSHIKHKGRQNYAAMLGVMDHWIGQITS 178
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
L K ML++SI++F +DNG P+ G+++N +N+PLRG K+T ++
Sbjct: 179 ELLKQNMLDDSIIIFTSDNGGPANGLNNNWATNYPLRGTKTTVYE 223
>gi|443734654|gb|ELU18562.1| hypothetical protein CAPTEDRAFT_195389, partial [Capitella teleta]
Length = 330
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 130/328 (39%), Positives = 174/328 (53%), Gaps = 44/328 (13%)
Query: 28 KKPHIIIILADDLG------------------WNDVSFHGSSQIPTPNIDALAYNGLILN 69
KKP+ + ILADDLG +ND+ I TPN+DALA G+IL
Sbjct: 19 KKPNFVFILADDLGGMLASKLKNFQACSFSVGYNDLGLRDPDVI-TPNMDALASKGVILT 77
Query: 70 QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
+YVQA+C+PSR ALMTG+YP MQ VI E L K LPQYLK+ GY H I
Sbjct: 78 NNYVQAVCSPSRHALMTGRYPSASAMQSIVIQPMEAKCSGLKYKFLPQYLKDLGYKNHMI 137
Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI 189
GKWHLG+ RE PT RGFD+ YG + DY++H + D VD +
Sbjct: 138 GKWHLGYCREECLPTSRGFDTFYGLYASSGDYWEHGIMGMY------DWHTEAGVDFEAR 191
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA-----KFLD 244
G ++ DL E V EH+ P+FLY A N++ P Q DE + KF D
Sbjct: 192 GTHAQDLEIERLDKVFDEHDNKDPLFLYFA-----PQNSHTPLQPTDEFINLYPEDKFSD 246
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
I RR Y G+ S LD VG ++ L ++ M+ N+ V+F++DNGA +G N P
Sbjct: 247 I----RRKYLGLTSGLDAMVGKIVDKLTENDMMNNTYVIFVSDNGADPI-----EGRNAP 297
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQ 332
RG K + ++GG + + I+SP LK+T+
Sbjct: 298 FRGGKGSLFEGGTKSSSFIYSPLLKKTE 325
>gi|340367651|ref|XP_003382367.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 494
Score = 214 bits (544), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 139/397 (35%), Positives = 211/397 (53%), Gaps = 30/397 (7%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
P+II +L DD G+ DV F + I +PN D LA GL+LN+HYV C+PSR++L+TG++
Sbjct: 23 PNIIFVLVDDWGFADVGFKNPA-ISSPNFDHLAKTGLVLNRHYVYMYCSPSRASLLTGRW 81
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
P H + + G L ++P LK A YATH +GKWH GFF Y P RGFD
Sbjct: 82 PHHTHQWN--LGNNSTAGTNLAMTMIPAKLKAANYATHMVGKWHQGFFDPRYLPINRGFD 139
Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
+ G+ G +D+ T +D N D + G Y +Y + ++I HN
Sbjct: 140 TSSGFLCGSEDH------MTQNAICAIDYWKNNAPDPRN-GTYDAYIYRDDLTDIINSHN 192
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR-TYAGMVSRLDESVGNVI 268
++P+FLYL VH P QAP E + + S E+R T M+S D G+++
Sbjct: 193 TNEPLFLYLPLHNVHL-----PLQAPKEWLDIYPANSTCEKRHTLQAMISVADNVTGHLV 247
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
L+K G+ +N+I++ ADNG G H + GSN+PLRG K T ++GG+R +A + L
Sbjct: 248 ELLKKKGIWDNTIMVISADNG----GAHCS-GSNYPLRGCKETLFEGGVRSLAFVNGGLL 302
Query: 329 KQTQKVSSE--LFHISDWLPTLCAAAGIEINDTS-----LDGVNQWDVLT-KGAKTKRSE 380
++++ S HI+DW T C AG++ +D+ +DG++ W ++T + KT+ E
Sbjct: 303 PESRRGQSTDGFIHIADWYTTFCKLAGVDPDDSGTGKFPVDGLDVWPIITGENEKTQHEE 362
Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDE 417
I+ N D A+ V + K + G N + D
Sbjct: 363 IVLGY-NFDFNHPGQGAIIVGNFKLLVGLQNAFECDS 398
>gi|423214938|ref|ZP_17201466.1| hypothetical protein HMPREF1074_02998 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692201|gb|EIY85439.1| hypothetical protein HMPREF1074_02998 [Bacteroides xylanisolvens
CL03T12C04]
Length = 458
Score = 214 bits (544), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 130/355 (36%), Positives = 195/355 (54%), Gaps = 33/355 (9%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
+K FF L C AFL+ ++P+I+I+LADDLGW DV FHGS +I TP +DAL
Sbjct: 2 KKEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLDALVGE 59
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
G+ L + Y + TP+R+ LMTG+YP G++ VI PW GL E+ + L
Sbjct: 60 GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116
Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
GY A IGKWHLG ++V+ P RGF YG+ G DY+D + + + + + H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ YST+L T+ AI I + K P LY+A+ A H P QA ++++
Sbjct: 177 DKG--------YSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223
Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
+ D D ++ TY+ MVS +D +G ++ AL+K G+++N+ +F +DNG + G
Sbjct: 224 LYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTL 348
+ + S+ PLRG K WDGG+ A + W KQ + +SS++ D +PTL
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334
>gi|299147176|ref|ZP_07040243.1| arylsulfatase B [Bacteroides sp. 3_1_23]
gi|298515061|gb|EFI38943.1| arylsulfatase B [Bacteroides sp. 3_1_23]
Length = 458
Score = 213 bits (543), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 130/355 (36%), Positives = 195/355 (54%), Gaps = 33/355 (9%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
+K FF L C AFL+ ++P+I+I+LADDLGW DV FHGS +I TP +DAL
Sbjct: 2 KKEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLDALVGE 59
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
G+ L + Y + TP+R+ LMTG+YP G++ VI PW GL E+ + L
Sbjct: 60 GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116
Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
GY A IGKWHLG ++V+ P RGF YG+ G DY+D + + + + + H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ YST+L T+ AI I + K P LY+A+ A H P QA ++++
Sbjct: 177 DKG--------YSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223
Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
+ D D ++ TY+ MVS +D +G ++ AL+K G+++N+ +F +DNG + G
Sbjct: 224 LYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTL 348
+ + S+ PLRG K WDGG+ A + W KQ + +SS++ D +PTL
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334
>gi|336415441|ref|ZP_08595781.1| hypothetical protein HMPREF1017_02889 [Bacteroides ovatus
3_8_47FAA]
gi|335941037|gb|EGN02899.1| hypothetical protein HMPREF1017_02889 [Bacteroides ovatus
3_8_47FAA]
Length = 458
Score = 213 bits (543), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 130/355 (36%), Positives = 195/355 (54%), Gaps = 33/355 (9%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
+K FF L C AFL+ ++P+I+I+LADDLGW DV FHGS +I TP +DAL
Sbjct: 2 KKEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLDALVGE 59
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
G+ L + Y + TP+R+ LMTG+YP G++ VI PW GL E+ + L
Sbjct: 60 GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116
Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
GY A IGKWHLG ++V+ P RGF YG+ G DY+D + + + + + H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ YST+L T+ AI I + K P LY+A+ A H P QA ++++
Sbjct: 177 DKG--------YSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223
Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
+ D D ++ TY+ MVS +D +G ++ AL+K G+++N+ +F +DNG + G
Sbjct: 224 LYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTL 348
+ + S+ PLRG K WDGG+ A + W KQ + +SS++ D +PTL
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334
>gi|296121469|ref|YP_003629247.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296013809|gb|ADG67048.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 487
Score = 213 bits (543), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 133/361 (36%), Positives = 190/361 (52%), Gaps = 32/361 (8%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
A +P+I++I+ DD+G+ DV FHG IPTPN+DALA +G+ YV C+P+R+ L
Sbjct: 33 AADRPNILLIVGDDMGYADVGFHGCKDIPTPNLDALAKSGVQFTSGYVTGPYCSPTRAGL 92
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
+TG+Y G H G GLPLTE + LK+ GY T +GKWHLG + P
Sbjct: 93 LTGRYQQRFG--HEFNPSGANTGLPLTEVTIADRLKQVGYTTGLVGKWHLGS-QPAMHPQ 149
Query: 145 FRGFDSHYGYWQGLQDYYD-HSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
RGF+ G+ G ++D EP + +D Y+TDL+ A++
Sbjct: 150 ERGFEEFIGFLGGAHSFFDAQGILRGHEPVKTID--------------YTTDLFGREAVS 195
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
I +H + KP FLYL+ AVH P A ++ +AK ISD ERRTYA M+ +DE+
Sbjct: 196 FIEKH-RDKPWFLYLSFNAVHT-----PMHATEDRMAKLASISDQERRTYAAMMLAMDEA 249
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSF-GIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
+G V+ L G + ++V+F++DNG P+ G+ N N PLRG K T +GG+R
Sbjct: 250 IGKVLTQLETTGQKQKTLVMFISDNGGPTMPGVTINGSINTPLRGSKRTTLEGGIRVPFV 309
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
+ P S + + D T A AG+E D DGVN L + K+SE+
Sbjct: 310 VSWPGKIAPAVFDSPVIQL-DLTATALAVAGVE-KDVKSDGVNLLPYL----QGKQSEVP 363
Query: 383 H 383
H
Sbjct: 364 H 364
>gi|293372058|ref|ZP_06618453.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
gi|292632962|gb|EFF51547.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
Length = 458
Score = 213 bits (543), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 131/360 (36%), Positives = 196/360 (54%), Gaps = 33/360 (9%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
+K FF L C AFL+ ++P+I+I+LADDLGW DV FHGS +I TP +DAL
Sbjct: 2 KKEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLDALVGE 59
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
G+ L + Y + TP+R+ LMTG+YP G++ VI PW GL E+ + L
Sbjct: 60 GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116
Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
GY A IGKWHLG ++V+ P RGF YG+ G DY+D + + + + + H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ YST+L T+ AI I + K P LY+A+ A H P QA ++++
Sbjct: 177 DKG--------YSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223
Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
+ D D ++ TY+ MVS +D +G ++ AL+K G+++N+ +F +DNG + G
Sbjct: 224 LYTDNFDSLTPKEQKKVTYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
+ + S+ PLRG K WDGG+ A + W KQ + +SS++ D +PTL G
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTLKDLVG 339
>gi|372209242|ref|ZP_09497044.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 479
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 140/373 (37%), Positives = 199/373 (53%), Gaps = 31/373 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
K+P+I+IIL DDLG+NDV F+GS I TPN+D LA NG+I+ YV C PSR+++MT
Sbjct: 26 KRPNIVIILCDDLGYNDVGFNGSKDIKTPNLDKLADNGMIMTAGYVAHPFCGPSRTSIMT 85
Query: 87 GKYPIHIGMQHGVI--LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
GKY +G Q + EG +G+PL K + + L+EAGY T A GKWHLG + P
Sbjct: 86 GKYAHTMGAQFNIPSESEGTGYGIPLNNKFISKELQEAGYYTGAFGKWHLG-ADTPFHPN 144
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD----------MRHN-MQVDNKTIGIYS 193
RGFD YG+ G DY K +E + + HN +VD K Y
Sbjct: 145 KRGFDEFYGFLGGGHDYIPEQYKPKYEFLKQRGSKNIRDYIKPLEHNGTEVDEKE---YI 201
Query: 194 TDLYTEAAINVIAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
TD + A+N + + + K +P F+YLA+ A H P QA E++A F I D +RRT
Sbjct: 202 TDGLSREAVNFVYKASEKKQPFFMYLAYNAPHV-----PLQAKKEDMAVFKSIKDEKRRT 256
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
YA MV +D VG ++ AL+ + LEN++++F +DNG G +N PL K +
Sbjct: 257 YAAMVYAVDRGVGKLVEALKANQQLENTLLVFFSDNG----GKLGKGANNFPLTEGKGSA 312
Query: 313 WDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVL 370
++GG R W +K QK + + + D PT + A + N +L G N L
Sbjct: 313 YEGGHRVPMFFHWPNRIKAGQKFHNPVSAL-DLYPTFASLAKTKPSNAKNLAGTNILPHL 371
Query: 371 TKGAKTKRSEILH 383
KG + E+L+
Sbjct: 372 QKGTNPHKDEMLY 384
>gi|340368073|ref|XP_003382577.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 507
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 139/408 (34%), Positives = 211/408 (51%), Gaps = 44/408 (10%)
Query: 28 KKPHIIIILADDLGWNDVSFH---GSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
+KPHI+++L DD GW +V +H + ++ TPNID L GL L+QHY C+PSRS L
Sbjct: 21 QKPHIVLMLVDDWGWANVGYHRNPPTREVVTPNIDDLVKQGLELDQHYAYKFCSPSRSCL 80
Query: 85 MTGKYPIHIGMQHGVILEGEP-------WGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
M+G+ PIH+ + P +P + + +KEAGYATH +GKW G
Sbjct: 81 MSGRLPIHVNDLNLAPTNYNPNDPVSGFSAIPRNMTGIAEKMKEAGYATHQVGKWDAGMA 140
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSC--KATFEPYQGL-DMRHNMQVDNKT-IGIYS 193
+TP RGFD+ +GY+ DYY + + P L D H N T Y
Sbjct: 141 TPDHTPKGRGFDTSFGYYHHDNDYYTEVVGPQCSGSPIVDLWDTDHPAHGINGTGPDKYE 200
Query: 194 TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
L+ E ++V+++H+ + P+FLY A H P Q PD+ + KF I D +R+ Y
Sbjct: 201 EGLFKERLMDVVSKHDPNTPLFLYYAPHIAHT-----PLQVPDDYLNKFSFIDDSDRKYY 255
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS-NHPLRGMKSTP 312
MV+ LD+ VG+++AAL++ GM +N + + +DNG P I++ G+ N+PL+G K T
Sbjct: 256 HAMVNYLDDVVGDLVAALKQKGMWDNLLYVTSSDNGGP---IYAGGGANNYPLKGGKLTD 312
Query: 313 WDGGMRGVAAIWSPWL--KQTQKVSSELFHISDWLPTLCAAAGIEINDTS--------LD 362
W GG+R A + +L K + + H++DW T CA AG++ D +D
Sbjct: 313 WQGGIRVNAFVSGGYLPEKMRGQKTDGYVHLADWYGTFCAIAGVDPTDEKAAKAKLPPVD 372
Query: 363 GVNQWDVLTKGAKTK-RSEILHNIDNVDNPQKYYAALRVDDLKYVAGT 409
N W ++ T R++I +I+ AL D K + GT
Sbjct: 373 SYNMWPFISGETDTSPRTDIPASIN----------ALISGDYKILIGT 410
Score = 43.5 bits (101), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 35/64 (54%), Gaps = 3/64 (4%)
Query: 504 CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN---KPFDKRADPARWNNI 560
CL+NI DDP E++NLA S + LK +++KL Y +T P P +A ++
Sbjct: 442 CLYNIKDDPEERHNLASSMPEQLKTMQEKLKKYLATYFNPDRGHVSPLACQAAVNKYGGF 501
Query: 561 WVPW 564
W P+
Sbjct: 502 WGPF 505
>gi|423294191|ref|ZP_17272318.1| hypothetical protein HMPREF1070_00983 [Bacteroides ovatus
CL03T12C18]
gi|392676448|gb|EIY69884.1| hypothetical protein HMPREF1070_00983 [Bacteroides ovatus
CL03T12C18]
Length = 458
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 131/360 (36%), Positives = 197/360 (54%), Gaps = 33/360 (9%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
+K FF L C AFL+ ++P+I+I+LADDLGW DV FHGS +I TP +DAL
Sbjct: 2 KKEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLDALVGE 59
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
G+ L + Y + TP+R+ LMTG+YP G++ VI PW GL E+ + L
Sbjct: 60 GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETVADMLAR 116
Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
GY A IGKWHLG ++V+ P RGF YG+ G DY+D + + + + + H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ YST+L T+ AI+ I + K P LY+A+ A H P QA ++++
Sbjct: 177 DKG--------YSTELITKEAIHCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223
Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
+ D D ++ TY+ MVS +D +G ++ AL+K G+++N+ +F +DNG + G
Sbjct: 224 LYTDNFDSLTPKEQKKVTYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
+ + S+ PLRG K WDGG+ A + W KQ + +SS++ D +PTL G
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTLKDLVG 339
>gi|430741545|ref|YP_007200674.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430013265|gb|AGA24979.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 474
Score = 213 bits (541), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 129/346 (37%), Positives = 188/346 (54%), Gaps = 28/346 (8%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
+P++++++ADDLG+ D+ F G+ IPTP++DALA G+ YV C+P+R+ L+TG
Sbjct: 36 RPNVVLLVADDLGYGDLGFQGARDIPTPHLDALAQGGVRCTSGYVSGPYCSPTRAGLLTG 95
Query: 88 KYPIHIGMQHG-----VILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+Y G + + GLP+TE L LK AGYAT +GKWHLG +
Sbjct: 96 RYQQRFGHEFNPGGGGGAAAAKNVGLPVTETTLADRLKAAGYATGLVGKWHLGS-EAKFH 154
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
P RGFD +G+ G Y+ + + Y+G ++ Y TD ++ A+
Sbjct: 155 PQKRGFDEFFGFLGGQHTYF---ASKSGDVYRGTEVVKEEA--------YLTDAFSREAL 203
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
+ I H K P FL L+ AVH P A ++ VA+F I DP+RRTYA M+S LDE
Sbjct: 204 SFIDRH-KDHPFFLQLSFNAVHT-----PMDATEDRVARFSSIEDPKRRTYAAMLSALDE 257
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSF-GIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
+ GNVI LR G+ E ++++F +DNG P+ G +N N PLRG K T +GG+R
Sbjct: 258 AAGNVIERLRAAGLEEKTLIIFFSDNGGPTMAGTTTNGSKNDPLRGSKRTTLEGGVRVPF 317
Query: 322 AI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVN 365
+ W L + + + D LPT AAAG++ + LDGVN
Sbjct: 318 VLSWKGTLPAGKVYDQPIIQL-DILPTALAAAGVDAEPEWKLDGVN 362
>gi|410642189|ref|ZP_11352707.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
gi|410648635|ref|ZP_11359039.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
gi|410131832|dbj|GAC07438.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
gi|410138506|dbj|GAC10894.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
Length = 473
Score = 213 bits (541), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 135/361 (37%), Positives = 197/361 (54%), Gaps = 23/361 (6%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
N A P+I+ IL DDLG+ DV + GS QI TPNID LA G+ L Y +C+P+R+
Sbjct: 36 NNQATGAPNILFILLDDLGYGDVGYLGS-QIQTPNIDNLASQGVTLKHGYAYPICSPTRA 94
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
ALMTGK P++ G+ +E + LP +P+ +EAGY T +GKWHLG +
Sbjct: 95 ALMTGKNPLNFGIDGP--MENDAM-LPEDLTTMPERFQEAGYQTWMVGKWHLGMAKRSAM 151
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
P RGFD YG+ G DYY H + GLD ++N + G + T+L T AI
Sbjct: 152 PHNRGFDDFYGFLGGFVDYYTHVY------FGGLDWQNN-DTSLREEG-FVTELLTAKAI 203
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
+ I KP F+YL++ A H P Q + + +R+ +A M S +D+
Sbjct: 204 DKITHFKGDKPFFMYLSYSAPHT-----PLQYVPNAINDYSGFESKDRQVFADMTSDVDQ 258
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
+G V+AAL K G+ EN+IV+FM+DNG + +N GS L+G KS+ ++GG+R
Sbjct: 259 DIGEVLAALDKRGIAENTIVVFMSDNGG-NLEAGANNGS---LQGGKSSVYEGGVRVPVL 314
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKG-AKTKRSEI 381
I P Q++ ++ +F DW PTL AA I+ DGV+Q+ L + KT + E+
Sbjct: 315 IRWPAQLQSKTMTQPMF-TQDWAPTLLEAANIDYQPNDFDGVSQFVNLKEAQEKTFKREV 373
Query: 382 L 382
+
Sbjct: 374 V 374
>gi|345510992|ref|ZP_08790548.1| arylsulfatase B [Bacteroides sp. D1]
gi|229442597|gb|EEO48388.1| arylsulfatase B [Bacteroides sp. D1]
Length = 498
Score = 213 bits (541), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 130/360 (36%), Positives = 196/360 (54%), Gaps = 33/360 (9%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
++ FF L C AFL+ ++P+I+I+LADDLGW DV FHGS +I TP++DAL
Sbjct: 42 KEEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPSLDALVGE 99
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
G+ L + Y + TP+R+ LMTG+YP G++ VI PW GL E+ + L
Sbjct: 100 GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 156
Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
GY A IGKWHLG ++V+ P RGF YG+ G DY+D + + + + + H
Sbjct: 157 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 216
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ YST+L T+ AI I + K P LY+A+ A H P QA ++++
Sbjct: 217 DKG--------YSTELITQEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 263
Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
+ D D ++ TY+ MVS +D +G ++ AL+K G+++N+ +F +DNG + G
Sbjct: 264 LYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 321
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
+ + S+ PLRG K WDGG A + W KQ + +SS++ D +PTL G
Sbjct: 322 VPGS--SSGPLRGHKFDEWDGGGHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTLKDLVG 379
>gi|340619110|ref|YP_004737563.1| sulfatase [Zobellia galactanivorans]
gi|339733907|emb|CAZ97284.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
Length = 511
Score = 213 bits (541), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 132/377 (35%), Positives = 204/377 (54%), Gaps = 34/377 (9%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTG 87
+P+I+++L DDLG+ DV F+GS+ I TP +D LA NG I YV C PSRSA++TG
Sbjct: 47 RPNILVVLCDDLGYADVGFNGSTDILTPELDNLAQNGSIFTSAYVAHPFCGPSRSAILTG 106
Query: 88 KYPIHIGMQHGVIL-----EGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+YP G + + + + G+P+ E + + L+ AGY T AIGKWHLG + +
Sbjct: 107 RYPHLTGTAYNLFHNSSEDDKDNMGVPVEETYMSKVLQNAGYYTSAIGKWHLGAAPK-FH 165
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----------LDMRHNMQVDNKTIGIY 192
P RGFD YG+ G DY+ + T++ + M HN + N+T Y
Sbjct: 166 PNKRGFDDFYGFLGGGHDYFPSEYQKTYKAQKKAGNPNIRDYVFPMEHNGKPANETE--Y 223
Query: 193 STDLYTEAAI-NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
TD ++ AI N+ K +P F+YLA+ A H P QA E+VAKF I D +RR
Sbjct: 224 ITDGFSREAIKNIKIAAAKKQPFFIYLAYNAPHV-----PLQAKAEDVAKFAHIKDKDRR 278
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
TYA MV +D VG ++ L++ +N++++F++DNG G ++ +N+PL+G K
Sbjct: 279 TYAAMVYAVDRGVGKIVQTLKETKQFDNTLIVFLSDNG----GNFNHGANNYPLKGTKGD 334
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCAAAGIEI-NDTSLDGVNQWD 368
W+GG R ++ W K+ +K F +S D PT A ++ LDG N D
Sbjct: 335 TWEGGYR--VPMFFHWPKKIKKDQRFDFPVSSLDLYPTFTGLAEAKLPKGKQLDGKNIMD 392
Query: 369 VLTKGAKTKRSEILHNI 385
+ K + + E+++++
Sbjct: 393 DVLKNTEPYKDEMIYSL 409
>gi|374620849|ref|ZP_09693383.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374304076|gb|EHQ58260.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 551
Score = 213 bits (541), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 124/379 (32%), Positives = 201/379 (53%), Gaps = 41/379 (10%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P++I+++ADDLGWNDV +HG + I TP++D LA G+ LN+ Y +C+P+R+ALMTG+
Sbjct: 35 RPNVIVMVADDLGWNDVGYHGGN-IDTPSLDKLAEQGVQLNRFYTTPICSPTRAALMTGR 93
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
P+ +G+ +GVIL + G+ E +PQ + AGY T +GKWHLG + Y P RGF
Sbjct: 94 DPMRLGIAYGVILPWDNIGVNPAEHFMPQSFQAAGYQTAMVGKWHLGHAQMTYHPNQRGF 153
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN-MQVDNKTIGIYSTDLYTEAAINVIAE 207
+ YG+ +Y F G D + N + +D++ Y T L + I +
Sbjct: 154 EHFYGHLHTEVGFY-----PPFANVGGKDFQENGVSIDDEG---YETYLLADEVSRYIRD 205
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI-----------SDPERR----- 251
++ KP F+Y+ +A H P AP E K+ DI +D RR
Sbjct: 206 RDEEKPFFIYMPFIAPHT-----PLDAPQELQEKYKDIETDLAPARSNQTDSTRRMAKLM 260
Query: 252 -------TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
YA +V +D+++G V+ L + G+ +N+IVLF +DNG ++ N P
Sbjct: 261 MQPSARPMYAAVVDAMDQAIGQVLDTLDEEGLADNTIVLFFSDNGGAAYSY--GGADNAP 318
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDG 363
LRG K ++GG+R V+ + P + + + ++ + D PTL A G+ +N+ DG
Sbjct: 319 LRGGKGETFEGGIRVVSLMRWPGVLEPAQSFDQIMTVMDVFPTLAEATGVTPLNNFPFDG 378
Query: 364 VNQWDVLTKGAKTKRSEIL 382
+ W + +G +R E++
Sbjct: 379 SSLWSSIKEGEIHERDELV 397
>gi|313219585|emb|CBY30507.1| unnamed protein product [Oikopleura dioica]
Length = 617
Score = 212 bits (540), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 139/419 (33%), Positives = 212/419 (50%), Gaps = 62/419 (14%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
++P+I+ I+ADDLG+ND+ ++ S + TPNI+ LA NG+IL+ HY Q +C+PSRS +TG
Sbjct: 156 EQPNIVFIMADDLGYNDIGYN-SIEAFTPNINYLAKNGIILDSHYSQPVCSPSRSQFLTG 214
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y GMQH IL +P G+PLTEK+LP+ KE GY+T GKWH G F E Y PT RG
Sbjct: 215 RYSFRYGMQHRNILPTQPHGVPLTEKMLPEVFKECGYSTFGTGKWHQGMFHESYLPTSRG 274
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEP-------YQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
FD G + G + H K P + G DMR V+ + S L E
Sbjct: 275 FDKFVGSYSGSSQHSTHE-KCFNSPQRDSPFSFCGYDMREAEVVEARQNLTGSDFLLREI 333
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL-----DISDPERRTYAG 255
I + VH T +P DE++ + L + + +R
Sbjct: 334 CIRI-----------------TVHGPLTTDP---RDEKIYEELYERHRNETIYKRTKMLA 373
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
MV+ LD ++G +I + +K+G +N+I F +DNG G + + +N P RG K+ +G
Sbjct: 374 MVTGLDRAIGRIIESFKKYGKYKNTIFAFASDNG----GSLTYRANNMPFRGGKNDLLEG 429
Query: 316 GMRGVAAIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEIND--TSLDGVNQWD-VL 370
G+R ++ + S + + S S LF I+DW PTL A + LDG + ++
Sbjct: 430 GVRTLSFLHSSLIPNDKIGSRYSNLFDITDWFPTLLDAGKCSQKNLPNQLDGKSHFENFF 489
Query: 371 TKGAKTKRSEILHNID-------NVDNPQKYY------------AALRVDDLKYVAGTD 410
+ G + R EI+H++D V++P+ + A+RV K + G +
Sbjct: 490 SAGNASVRDEIVHHLDPLKKADAEVEDPRPFVKVYGFKYDIRMKTAIRVGKYKLITGAE 548
>gi|423290535|ref|ZP_17269384.1| hypothetical protein HMPREF1069_04427 [Bacteroides ovatus
CL02T12C04]
gi|392665922|gb|EIY59445.1| hypothetical protein HMPREF1069_04427 [Bacteroides ovatus
CL02T12C04]
Length = 458
Score = 212 bits (540), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 131/360 (36%), Positives = 195/360 (54%), Gaps = 33/360 (9%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
+K FF L C AFL ++P+I+I+LADDLGW DV FHGS +I TP +DAL
Sbjct: 2 KKEFFGLLCGCTLLPAFLYAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLDALVGE 59
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
G+ L + Y + TP+R+ LMTG+YP G++ VI PW GL E+ + L
Sbjct: 60 GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116
Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
GY A IGKWHLG ++V+ P RGF YG+ G DY+D + + + + + H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ YST+L T+ AI I + K P LY+A+ A H P QA ++++
Sbjct: 177 DKG--------YSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223
Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
+ D D ++ TY+ MVS +D +G ++ AL+K G+++N+ +F +DNG + G
Sbjct: 224 LYTDNFDSLTPKEQKKVTYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
+ + S+ PLRG K WDGG+ A + W KQ + +SS++ D +PTL G
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTLKDLVG 339
>gi|160885330|ref|ZP_02066333.1| hypothetical protein BACOVA_03329 [Bacteroides ovatus ATCC 8483]
gi|156108952|gb|EDO10697.1| arylsulfatase [Bacteroides ovatus ATCC 8483]
Length = 458
Score = 212 bits (540), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 130/360 (36%), Positives = 196/360 (54%), Gaps = 33/360 (9%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
+K FF L C AFL+ ++P+I+I+LADDLGW DV FHGS +I TP +DAL
Sbjct: 2 KKEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLDALVGE 59
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
G+ L + Y + TP+R+ LMTG+YP G++ VI PW GL E+ + L
Sbjct: 60 GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116
Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
GY A IGKWHLG ++V+ P RGF YG+ G DY+D + + + + + H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ YST+L T+ AI I + K P LY+A+ A H P QA ++++
Sbjct: 177 DKG--------YSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223
Query: 241 KF------LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
+ L + ++ TY+ MVS +D +G ++ AL+K G+++N+ +F +DNG + G
Sbjct: 224 LYTNNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
+ + S+ PLRG K WDGG+ A + W KQ + +SS++ D +PTL G
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTLKDLVG 339
>gi|262405390|ref|ZP_06081940.1| arylsulfatase B [Bacteroides sp. 2_1_22]
gi|262356265|gb|EEZ05355.1| arylsulfatase B [Bacteroides sp. 2_1_22]
Length = 458
Score = 212 bits (540), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 130/360 (36%), Positives = 196/360 (54%), Gaps = 33/360 (9%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
++ FF L C AFL+ ++P+I+I+LADDLGW DV FHGS +I TP++DAL
Sbjct: 2 KEEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPSLDALVGE 59
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
G+ L + Y + TP+R+ LMTG+YP G++ VI PW GL E+ + L
Sbjct: 60 GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116
Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
GY A IGKWHLG ++V+ P RGF YG+ G DY+D + + + + + H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ YST+L T+ AI I + K P LY+A+ A H P QA ++++
Sbjct: 177 DKG--------YSTELITQEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223
Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
+ D D ++ TY+ MVS +D +G ++ AL+K G+++N+ +F +DNG + G
Sbjct: 224 LYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
+ + S+ PLRG K WDGG A + W KQ + +SS++ D +PTL G
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGGHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTLKDLVG 339
>gi|390361685|ref|XP_789654.2| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like
[Strongylocentrotus purpuratus]
Length = 401
Score = 212 bits (539), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 122/285 (42%), Positives = 168/285 (58%), Gaps = 27/285 (9%)
Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATF------EP 172
L++AGYATHA+GKWH+G ++ PT RGFDS +G G +Y H +
Sbjct: 5 LRKAGYATHAVGKWHMGLYKTCCWPTNRGFDSFFGLLTGRGGFYTHKHYGGHPGLVDSKN 64
Query: 173 YQGLDMRHNM-QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP 231
+ G D+R N+ QV G+YST L+T+ + N+I HN+SKP+FLY + AVH P
Sbjct: 65 WSGYDLRDNLEQVAQDYQGVYSTHLFTQKSQNIIRRHNRSKPLFLYHSFQAVHY-----P 119
Query: 232 FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
+ P + F I+D RRTYAGMV +DE+VGN+ L+K G+ N+I++F +DNGA
Sbjct: 120 LEVPPRYMEDFNYIADERRRTYAGMVKCMDEAVGNLTRTLKKTGLWNNTIIIFSSDNGAN 179
Query: 292 -SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTL 348
++G GSN PLRGMK + W+GG++ V I SP L + + V++ LFH++DW PTL
Sbjct: 180 FNYG-----GSNWPLRGMKRSLWEGGIKSVGFIASPLLPKLVRGTVNNNLFHVTDWFPTL 234
Query: 349 C-AAAGIEINDTSLDGVNQWDVLTKGAKTK------RSEILHNID 386
A + T LDG N W L KT+ R EILHNID
Sbjct: 235 VRGVARGSLKGTHLDGHNLWKHLLNKKKTRGKDSWPRKEILHNID 279
>gi|291225031|ref|XP_002732500.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 286
Score = 211 bits (538), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 119/299 (39%), Positives = 170/299 (56%), Gaps = 14/299 (4%)
Query: 40 LGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGV 99
+GWNDV +H + I PN+ LA +G+I NQ Y CTPSR+A+MTG YP G QH +
Sbjct: 1 MGWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59
Query: 100 ILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQ-GL 158
+ P G+PL KLLP+ LKE GYATH +GKWHLGF ++ Y PT RGFDSHYG W G+
Sbjct: 60 VFNLHPSGVPLNFKLLPEKLKEVGYATHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119
Query: 159 QDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
DY P G D R NM V K+ G Y + + A +++ H P+FL
Sbjct: 120 GDY--DKLNGVLSPSAGYDFRDNMGVVPKSDG-YLALMLGDRAEHIVNTHYPGTPLFLTF 176
Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
L + A + + P+E + DI D R + G ++ +D+ +G V+ AL+ GM +
Sbjct: 177 T-LDIPA----KHLEIPEEYEEAYADIEDDRTRQFYGKLTMMDDIIGGVVDALKNRGMYD 231
Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE 337
+S+++F+ DNGA S S GSN+P RG+ T ++G + + +K+T S+E
Sbjct: 232 DSLIIFIGDNGALS----SQSGSNYPFRGIAGTLFEGATCVPSIVSGKGIKKTGYESNE 286
>gi|295086308|emb|CBK67831.1| Arylsulfatase A and related enzymes [Bacteroides xylanisolvens
XB1A]
Length = 458
Score = 211 bits (538), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 129/360 (35%), Positives = 196/360 (54%), Gaps = 33/360 (9%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
++ FF L C AFL+ ++P+I+I+LADDLGW DV FHGS +I TP++DAL
Sbjct: 2 KEEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPSLDALVGE 59
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
G+ L + Y + TP+R+ LMTG+YP G++ VI PW GL E+ + L
Sbjct: 60 GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116
Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
GY A IGKWHLG ++V+ P RGF YG+ G DY+D + + + + + H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ YST+L T+ AI I + K P LY+A+ A H P QA ++++
Sbjct: 177 DKG--------YSTELITQEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223
Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
+ D D ++ TY+ MVS +D +G ++ AL+K G+++N+ +F +DNG + G
Sbjct: 224 LYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
+ + S+ PLRG K WDGG+ A + W KQ + +SS++ D +P L G
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPMLKDLVG 339
>gi|449680632|ref|XP_004209636.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
Length = 336
Score = 211 bits (538), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 138/379 (36%), Positives = 196/379 (51%), Gaps = 62/379 (16%)
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYE-PFQAPDEEVAKFLDISDP 248
G YST +Y+ AI+ I HN ++PMFLYLA+ AVH+ N E P QAP E V KF I
Sbjct: 13 GNYSTAMYSMEAIDRIRNHNSTQPMFLYLAYQAVHSANLREYPLQAPQEWVDKFSHIKHK 72
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
R+ YA M+ +D +G + + L K ML++SI++F +DNG P+ G+++N +N+PLRG
Sbjct: 73 GRQNYAAMLGVMDHWIGQITSELLKQNMLDDSIIIFTSDNGGPANGLNNNWATNYPLRGT 132
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQW 367
K+T ++GG+RG A IWS L + +VS++L HI+DWLPTL +AAG++ +LDG+N W
Sbjct: 133 KTTVYEGGVRGAACIWSKELSKNPRVSNDLMHITDWLPTLLSAAGVDTTLLPNLDGLNLW 192
Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYY-AALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
D L + R+E+L NID P Y AALRV D K V N D W
Sbjct: 193 DTLINQVPSPRTEVLINID----PLLYMNAALRVGDWKIVK---QNNFYDGW-------- 237
Query: 427 DKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARV 486
Y P E+ +++ H + P L D+K +L
Sbjct: 238 --YPPPEI------------SNEIEEYDVHCSPP------PLLPRSCDKKDGKL------ 271
Query: 487 RCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNK 546
CLFNI DPCE +L+ S + + + ++L Y +MV
Sbjct: 272 -----------------CLFNIKYDPCEYIDLSASNPHIYQFMLERLNFYNKSMVESRRN 314
Query: 547 PF-DKRADPARWNNIWVPW 564
F D ADP +N +W W
Sbjct: 315 TFRDPLADPKLYNGVWKSW 333
>gi|167520280|ref|XP_001744479.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776810|gb|EDQ90428.1| predicted protein [Monosiga brevicollis MX1]
Length = 314
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 125/325 (38%), Positives = 183/325 (56%), Gaps = 37/325 (11%)
Query: 40 LGWNDVSFHGSS--------QIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPI 91
LG+ DVS+ + IPTP+ID LA +G+ L +YV LC+P+R++L++G+Y
Sbjct: 7 LGFTDVSYKNQTYPDVAFGPMIPTPSIDELALSGVRLESYYVNQLCSPTRTSLLSGRYAY 66
Query: 92 HIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSH 151
+GM VI++G P LPLT K + ++L+E G+AT A GKW G TPT RGFD+
Sbjct: 67 TLGMNAEVIVDGHPDQLPLTVKTIGEHLQEGGWATFAGGKWDAGMTSWGCTPTCRGFDTF 126
Query: 152 YGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ---VDNKTIGIYSTDLYT---EAAINVI 205
G++ DY+ H LD+RH+ D+ G+Y T+L T + I
Sbjct: 127 SGFYNAFNDYFTHRVG------NYLDLRHDFAPDLADDNHTGVYMTELLTSRVQQFITTA 180
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE----RRTYAGMVSRLD 261
AE +S F Y+AH AVHA P Q P + + + PE RR GM+ +D
Sbjct: 181 AETEQST--FAYVAHQAVHA-----PNQVPMSYLEGYCLDTIPEDRPTRRILCGMMRAVD 233
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
ESV N+ A ++ G+ +++++F DNG G GSN+PLRG K+T ++GGMRGV
Sbjct: 234 ESVRNITATYKQLGLWNDTVLIFTTDNG----GNPETGGSNYPLRGQKATTFEGGMRGVG 289
Query: 322 AIWSPWLKQTQK--VSSELFHISDW 344
+ SP L ++Q+ +S EL H+SDW
Sbjct: 290 FVNSPLLNESQRGFISDELIHVSDW 314
>gi|340373299|ref|XP_003385179.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 508
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 141/433 (32%), Positives = 211/433 (48%), Gaps = 50/433 (11%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFH---GSSQIPTPNIDALA 62
K F L CT A L P P+I+++L DD GW +V +H + ++ TPNID+L
Sbjct: 5 KAAFILLCTT----ACLAVPPPPGPNIVMMLVDDWGWANVGYHRNPPTKEVVTPNIDSLV 60
Query: 63 YNGLILNQHYVQALCTPSRSALMTGKYPIHI---GMQHGVILEGEPW----GLPLTEKLL 115
GL L+QHYV +C+PSRS+LM+G+ PIH+ ++ +P +P +
Sbjct: 61 RQGLELDQHYVFNVCSPSRSSLMSGRLPIHVNDLNIEPDYYNPDDPVSGFSAIPRNMTGI 120
Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG 175
Q +K GY TH +GKW G +TP RGFDS +GY+ D+Y + +P
Sbjct: 121 AQKMKLGGYDTHQVGKWDAGMATHTHTPKGRGFDSSFGYFHHANDFY---TEIDGKPCNK 177
Query: 176 LDMRHNMQVDNKTIGI-------YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNT 228
+ D G+ Y L+ E + V+ EH+ KP+FLY A VHA
Sbjct: 178 TKIVDIWVTDKPGYGLNGTGPDNYEEGLFKEQLLKVVNEHDTGKPLFLYYAPHIVHA--- 234
Query: 229 YEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN 288
P Q P KF I D +R+ Y MV+ LD+ VG ++ AL+K G+ EN + + +DN
Sbjct: 235 --PLQVPQRYQDKFSFIDDHDRQIYHAMVNYLDDVVGELVDALKKKGLWENLLFVTSSDN 292
Query: 289 GAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL--KQTQKVSSELFHISDWLP 346
G P + +N+PL+G K++ W GG+R A + +L K K H++DW
Sbjct: 293 GGPVY--PGGGANNYPLKGGKASDWQGGIRSNAFVSGGYLPEKMRGKKLDGYIHLADWYA 350
Query: 347 TLCAAAGIEINDTS--------LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAAL 398
T CA A ++ D + +D +N W +++ T VD P Y L
Sbjct: 351 TFCALANVDPTDEAAAKADLPPIDSLNMWPLISGETMTSP--------RVDIPIS-YKTL 401
Query: 399 RVDDLKYVAGTDN 411
+ K + GT+N
Sbjct: 402 ISGEYKILTGTNN 414
Score = 41.6 bits (96), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 3/64 (4%)
Query: 504 CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKST-MVPPGNKPFDKRADPA--RWNNI 560
CL+NI DP E+ +LA D LK+++ KLA Y+ T P P+ D A +N
Sbjct: 443 CLYNIMTDPEERVDLAAKMPDKLKEMQQKLAQYQKTHFAPDRGSPWPGACDAALNTYNGF 502
Query: 561 WVPW 564
W P+
Sbjct: 503 WGPF 506
>gi|340378605|ref|XP_003387818.1| PREDICTED: hypothetical protein LOC100637044 [Amphimedon
queenslandica]
Length = 2318
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 135/369 (36%), Positives = 198/369 (53%), Gaps = 33/369 (8%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KPHI+ +L DD G+ D SF + I TPN L NGLILN+HYV C+PSR++ +TG+
Sbjct: 1140 KPHIVFVLVDDWGFADASFRNPA-IKTPNFQYLVDNGLILNRHYVFKYCSPSRASFLTGR 1198
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
+P H+ + L G + +LP LK AGYATH +GKWH G + Y P RGF
Sbjct: 1199 FPHHVHQWNPTPLG--MVGTNINMTMLPAKLKNAGYATHMVGKWHQGLYDPAYLPINRGF 1256
Query: 149 DSHYGYWQGLQDYYDHS--CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
D+ G+ Q + +++ + C F D R N D+ IY+ DL T V
Sbjct: 1257 DTSSGFLQAEEGHFNQTIGCAVDFWKNDAPDTR-NGTCDSY---IYNKDLTT-----VFN 1307
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE-RRTYAGMVSRLDESVG 265
EH+ SKP+FLYL VHA P +AP E + + S + RR MVS D G
Sbjct: 1308 EHDASKPLFLYLPLHNVHA-----PIEAPQEWLDLYPANSTCDTRRILQAMVSVADNVTG 1362
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
+V+ LR++ M +N++++ ADNG KGSN+PL+G K T ++GG+R +A
Sbjct: 1363 HVVNLLRENDMWDNTLMVVSADNGGADC-----KGSNYPLKGSKYTFFEGGVRVIAFASG 1417
Query: 326 PWLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDTS-----LDGVNQWDVLT-KGAKTK 377
+ ++ S HI+DW T C AG++ ND+ +DG++ W ++T + +T+
Sbjct: 1418 GLIPAGRRGKSTDGFIHIADWYTTFCKMAGVDHNDSGKGKFPVDGLDVWPIITGENEETQ 1477
Query: 378 RSEILHNID 386
EI+ D
Sbjct: 1478 HEEIVLGYD 1486
>gi|372210445|ref|ZP_09498247.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 474
Score = 210 bits (535), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 136/386 (35%), Positives = 211/386 (54%), Gaps = 22/386 (5%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
+KY+ LT L F L K+P+I+IIL+DD GW DV F+G++ IPTPN++ALA +
Sbjct: 2 KKYYGYLTLLLAFCCTTLQAQKSKQPNILIILSDDQGWGDVGFNGATDIPTPNLNALAKD 61
Query: 65 GLILNQHY-VQALCTPSRSALMTGKYPIHIGMQHGVILEGEP----WGLPLTEKLLPQYL 119
G+I +Q Y C+PSR+ L+TG+Y G ++ E + GLPL E ++ + L
Sbjct: 62 GVIFSQGYSSHPYCSPSRAGLLTGRYQQKFGHENNPENEKQNEDTVIGLPLNELMISEVL 121
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR 179
++ Y T AIGKWHLG + + P RGF +G+ G +Y+ + E G+ M+
Sbjct: 122 QQNNYHTCAIGKWHLGNAHK-FLPNQRGFKDWFGFSGGGFNYWGKTTPKNKE--LGV-MK 177
Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEE 238
+ V T+ Y TD ++ AIN I +++K+ +P F+YLA+ A HA P QA E
Sbjct: 178 NGKPVPENTL-TYLTDDFSNQAINYIDQYSKTEQPFFMYLAYNAPHA-----PIQATKEY 231
Query: 239 VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSN 298
I + ER YA MV+ +D +G VI L++ G+ N++++F +DNG HS+
Sbjct: 232 TNLVTHIENGERAAYAAMVAGMDAGIGKVIQKLKETGVYNNTLIVFYSDNGG-----HSH 286
Query: 299 KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-N 357
S+ P RG K ++GG+R I P + + E D PT+ AA ++
Sbjct: 287 GASSAPYRGHKGMLFEGGIREPFVISWPNKIKGNRTYKEPIIALDIFPTILAATNTQVPA 346
Query: 358 DTSLDGVNQWDVLTKGAKTKRSEILH 383
++DGVN +LTK +KT + ++
Sbjct: 347 SKNIDGVNLLPILTKKSKTLKERPMY 372
>gi|326433715|gb|EGD79285.1| hypothetical protein PTSG_12912 [Salpingoeca sp. ATCC 50818]
Length = 562
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 161/580 (27%), Positives = 260/580 (44%), Gaps = 116/580 (20%)
Query: 28 KKPHIIIILADDLGWNDVSFHGS----SQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
+ PHI+++L DDLGW DV +H S S I TP ID L G+ L +HYV +C+P+R++
Sbjct: 51 QPPHILMVLVDDLGWADVGYHRSGPHKSDIQTPTIDKLVSQGIALERHYVHKVCSPTRAS 110
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
+G+ P+H G+ V+L G+P + Q+L +AGYA+H +GKW +G +TP
Sbjct: 111 FQSGRLPVH-GIDGQVVLCAPRAGIPENMTTVAQHLNKAGYASHFVGKWDVGMATPSHTP 169
Query: 144 TFRGFDSHYGY------------WQGLQDYYDHS--CKA--TFEPYQGLDMRHNMQVDNK 187
RG+++ Y WQG Q+ H CKA F+ + D N
Sbjct: 170 HGRGYNTSLNYFGHANWMWNQDEWQGSQNNVSHRPPCKAPDCFKDFWDTDR----PAHNL 225
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
+ +Y L+ + ++I H+ S+P+FL A H P QAP E +F +I
Sbjct: 226 SGTLYEEQLFVQRITDIIEAHDPSQPLFLTYASKVAH-----YPLQAPIEYQQQFANIEP 280
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI-------HSNKG 300
P RR Y MV LD+++ + ++K GM N++++F +DNG P H ++G
Sbjct: 281 PSRRVYHAMVKYLDDNLYYITTLMQKKGMWNNTLMVFSSDNGGPVMAFDGDCDASHPSRG 340
Query: 301 ----------SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE--LFHISDWLPTL 348
+N+PLRG K + ++GG+R A + + Q+ S+ + HI+DW T
Sbjct: 341 YMCYNGEAGANNYPLRGGKYSFFEGGIRVNAFVSGGLIPADQRGSNRTGIMHIADWYATF 400
Query: 349 CAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG 408
CA AG++ D N + + HN+ Y+AG
Sbjct: 401 CALAGVDPTDHRAAAANLPPIDS-----------HNM-----------------WPYLAG 432
Query: 409 TDNNGQSDEWYGDTDNEIDKYSPKE-VLYSKAGITFNALKTKLQIKQKHAADPKANSSDA 467
++SP++ VL + + F K L A++P AN S
Sbjct: 433 Q-----------------ARFSPRDTVLVDRDVLVFGDYKLLLG-----ASNPMANWSGP 470
Query: 468 LRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLK 527
T +R N + G LFN+ +DP E+++LAE + D+LK
Sbjct: 471 AYPNATTAT----SPISRYTLNCSDPGC---------LFNVVEDPEERHDLAEEKPDILK 517
Query: 528 QLEDKLAIYKSTMVPPGNKPFDK---RADPARWNNIWVPW 564
++ L ++ P P D + +++ + PW
Sbjct: 518 LMKFTLTELAKSIFPYNPGPVDPQCMKTAETKYDGFYGPW 557
>gi|294647729|ref|ZP_06725288.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|294809280|ref|ZP_06767994.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
gi|292636934|gb|EFF55393.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|294443524|gb|EFG12277.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
Length = 458
Score = 210 bits (534), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 129/360 (35%), Positives = 195/360 (54%), Gaps = 33/360 (9%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
++ FF L C AFL+ ++P+I+I+LADDLGW DV FHGS +I TP++DAL
Sbjct: 2 KEEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPSLDALVGE 59
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
G+ L + Y + TP+R+ LMTG+YP G++ VI PW GL E+ + L
Sbjct: 60 GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116
Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
GY A IGKWHLG ++V+ P RG YG+ G DY+D + + + + + H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGLSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ YST+L T+ AI I + K P LY+A+ A H P QA ++++
Sbjct: 177 DKG--------YSTELITQEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223
Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
+ D D ++ TY+ MVS +D +G ++ AL+K G+++N+ +F +DNG + G
Sbjct: 224 LYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
+ + S+ PLRG K WDGG A + W KQ + +SS++ D +PTL G
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGGHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTLKDLVG 339
>gi|386821789|ref|ZP_10109005.1| arylsulfatase A family protein [Joostella marina DSM 19592]
gi|386426895|gb|EIJ40725.1| arylsulfatase A family protein [Joostella marina DSM 19592]
Length = 474
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 137/381 (35%), Positives = 204/381 (53%), Gaps = 43/381 (11%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPS 80
++T A K+P+I+ ILADD G+ D F GSS+ TP++D LA + +Q YV A+C PS
Sbjct: 28 VSTLAQKQPNIVFILADDAGYADFGFQGSSEFKTPHLDQLASQSIRFSQAYVSAAVCGPS 87
Query: 81 RSALMTGKYPIHIGMQHGVI--------LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKW 132
R+ ++TGKY G + + G+ GLPL +KLLP+YLKE GY T GKW
Sbjct: 88 RAGILTGKYQQRFGYEENNVPGYMSASATTGDEMGLPLDQKLLPEYLKEQGYKTALFGKW 147
Query: 133 HLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG-- 190
H+G + + PT RGFD+ YG+ G + YY+ + + ++N Q D G
Sbjct: 148 HMG-NADKFHPTKRGFDTFYGFRGGARSYYEFNE----------NNKNNRQEDRLERGFG 196
Query: 191 ------IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
+Y TD EA + I E N+ +P F+YL+ AVH P +A +++ +F +
Sbjct: 197 NFEESKLYLTDALAEATTDFI-EKNQKQPFFVYLSFNAVHT-----PMEARPDDLKQFPN 250
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
+ +R+T A M LD + GNV+A L + G+ EN++++F DNG PS +N SN P
Sbjct: 251 LKG-KRKTLAAMTLGLDRACGNVLAKLDELGVRENTLIVFTNDNGGPS---GTNASSNKP 306
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCA-AAGIEINDTSL 361
L G K+ +GG+R W KQ ++ + + IS D LPT A G E N +
Sbjct: 307 LGGTKANHLEGGIR--VPFLMSWPKQLKENTVYDYPISTLDLLPTFYQIAGGNEKNINDI 364
Query: 362 DGVNQWDVLTKGAKTKRSEIL 382
DGVN ++ + + E L
Sbjct: 365 DGVNLLPFISGKNEARPHETL 385
>gi|313228605|emb|CBY07397.1| unnamed protein product [Oikopleura dioica]
Length = 492
Score = 209 bits (533), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 141/406 (34%), Positives = 203/406 (50%), Gaps = 68/406 (16%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
F KKPH+I +L DDLG++D+ + I +PNIDALA + L L +HYVQ CTP
Sbjct: 14 TFSQKVLEKKPHVIFVLVDDLGFDDLGYVNRDVI-SPNIDALAKDALHLKKHYVQPSCTP 72
Query: 80 SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
SR+A +TG+Y I +GMQ GVI EP G+PL E LL + K+ GY T GKWHLGF+
Sbjct: 73 SRAAFLTGRYNIRMGMQSGVIRPSEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTY 132
Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDH------SCKATFE--------------PYQ---GL 176
+ P RGFD YG++ G QD+Y H S K ++E P+ G
Sbjct: 133 KHCPQNRGFDRFYGFYLGSQDFYFHDSGRLSSHKFSYEDEVSGVNNKRWLQGPHGRPWGY 192
Query: 177 DMRH----NMQVDNKTI-------GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA 225
D R N V+N TI G YST L+ IN +A+H+ + +F Y++ VH
Sbjct: 193 DFREAYPANGDVENDTILDDFHTNGTYSTKLFVNDFINDLAKHDPAVSLFNYVSFQDVHG 252
Query: 226 GNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFM 285
P Q ++ + + +D + +++ S+++F
Sbjct: 253 -----PLQTVNKFKKHYKNKTDTWTHERILISTKI------------------TSLLVFT 289
Query: 286 ADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS--SELFHISD 343
+DNG G SN PLRG K T +DGG++ A + SP L+ K + LFH+SD
Sbjct: 290 SDNG----GRPKEGASNWPLRGSKGTIYDGGLKSRAFVASPLLQNRMKGQNYNYLFHVSD 345
Query: 344 WLPTLCAAAGIEI---NDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
W PTL +G ++ LDG ++ K R+++LH +D
Sbjct: 346 WFPTLLTLSGCDVPRKEGRPLDGKIH-ELFVYKPKKARNKMLHYLD 390
>gi|149197772|ref|ZP_01874821.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149138993|gb|EDM27397.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 441
Score = 209 bits (533), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 141/395 (35%), Positives = 214/395 (54%), Gaps = 56/395 (14%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F+L CT L A + P+IIIILADD G +D S +GS Q+ TP+ID++A+NG+
Sbjct: 7 LFSLLCTSLL--------ANEPPNIIIILADDAGSSDFSCYGSKQLLTPHIDSIAHNGIK 58
Query: 68 LNQHYV-QALCTPSRSALMTGKYPIHIG----MQHGVILEGEP--WGLPLTEKLLPQYLK 120
Q Y ++C+PSR+ L+TG+Y G + H +P GLP+TE L LK
Sbjct: 59 FTQAYTASSVCSPSRAGLLTGRYQQTFGHLANIPHSKHSANDPELLGLPVTEITLADSLK 118
Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
E GY+TH IGKWHLG + + P RGFD+ YG+ G + Y + G ++R
Sbjct: 119 ELGYSTHCIGKWHLG-EADHFHPNARGFDNFYGFLSGARTY-----------FLGGELRG 166
Query: 181 NMQ--VDNKTIGI----YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA 234
+M + NK Y+T+++T+ AI +I E + KP F+YL+H AVH P A
Sbjct: 167 DMDRIMRNKEFAEPSSGYTTEVFTQEAIRIIQEE-QDKPFFIYLSHNAVHG-----PMDA 220
Query: 235 PDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
DE++ + D +P R+ Y+G++ LD+ G ++ AL+ EN+++ FM+DNG P+
Sbjct: 221 KDEDIMSY-DFKNPLRKKYSGLMKNLDDQTGLLLQALKDSKQYENTLIFFMSDNGGPT-- 277
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQ-TQKVSSEL-----FHISDWLPTL 348
N SN PLRG K + ++GG R +P+L Q +K+S+ L D T
Sbjct: 278 -THNGSSNWPLRGFKGSEFEGGNR------TPFLLQWPEKISAGLSSDKPIIAYDVFATC 330
Query: 349 CAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEIL 382
AAG E + D + G++ V+ K +T ++
Sbjct: 331 IQAAGGELVTDRTYHGIDLLPVINKPQETNARKLF 365
>gi|406830958|ref|ZP_11090552.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 441
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 142/380 (37%), Positives = 197/380 (51%), Gaps = 27/380 (7%)
Query: 7 YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
+FFA+ L + A A K P+I++I+ADDLG+ DV FHG IPTP+ID+LA +G
Sbjct: 10 FFFAILWMGLASAA----DAAKLPNILVIVADDLGYADVGFHGGKDIPTPHIDSLAASGT 65
Query: 67 ILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+ YV C+P+R+ L+TG+Y G H G GLPLTE + L+ AGYA
Sbjct: 66 RFSSGYVSGPYCSPTRAGLLTGRYQQRFG--HEFNPGGANKGLPLTETTIADRLQAAGYA 123
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
T +GKWHLG + P RGF +G+ G Y+D Q D++
Sbjct: 124 TGLVGKWHLGT-DPKFHPLKRGFGEFFGFLAGHHTYFDK---------QEADIQRG--TT 171
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
T Y TD + A++ I H ++ P FLYLA AVH P +A D+ + F I
Sbjct: 172 KVTEPGYLTDAFGREAVSFIERH-QNHPFFLYLAFNAVHT-----PLEADDDRLNAFSSI 225
Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF-GIHSNKGSNHP 304
RRTYA M+S +D+++G V+A+LR+ EN+++ F++DNG P+ G N N P
Sbjct: 226 DATPRRTYAAMLSAMDDAIGKVLASLRETAQDENTLIFFISDNGGPTMPGTTINSSINAP 285
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDG 363
LRG K T +GG+R + P V + D PT AAAG+ I D LDG
Sbjct: 286 LRGSKRTTLEGGIRVPFVVRWPGHVPAGAVYAHPVIQLDIQPTALAAAGVTIQPDWKLDG 345
Query: 364 VNQWDVLTKGAKTKRSEILH 383
VN +T K E L+
Sbjct: 346 VNLLPFVTGQQKNIPHETLY 365
>gi|325106503|ref|YP_004276157.1| sulfatase [Pedobacter saltans DSM 12145]
gi|324975351|gb|ADY54335.1| sulfatase [Pedobacter saltans DSM 12145]
Length = 470
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 131/370 (35%), Positives = 199/370 (53%), Gaps = 28/370 (7%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
+ + ++C + A KKP++I+I++DD G+ D +G +IPTPNIDALA N
Sbjct: 7 KAFLLCISCVTVVAQA-----QQKKPNVIVIVSDDAGYADFGCYGGKEIPTPNIDALAKN 61
Query: 65 GLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVI-LEGEPW-----GLPLTEKLLPQ 117
G + YV A +C PSR+ ++TG Y G +H + L +P+ G+ K +
Sbjct: 62 GTLFTDAYVSASVCAPSRAGILTGMYQQRFGFEHNISELPVKPYTLNDVGMDPKIKTIGD 121
Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
+K GY T AIGKWH G + Y P RGFD YG+ G + ++ + P L
Sbjct: 122 QMKHNGYRTIAIGKWHQGDLPQ-YFPLKRGFDEFYGFVGGHRSFFGYPGGKA--PSHELA 178
Query: 178 MRHNMQ-VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
+ N + V TIG Y TD++T+ AI+ + E NKSKP F+YLA+ AVH P A
Sbjct: 179 LFDNDKIVPENTIG-YLTDMFTDKAISFVKE-NKSKPFFMYLAYNAVHV-----PMNAKK 231
Query: 237 EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
E + +F +I+DP R+ YA M+ LD+ VG ++ L++ + +N++++F+ DNGA +
Sbjct: 232 ELMDRFPNITDPGRKAYAAMMVSLDDGVGELVKTLKEQNLYDNTLIVFVNDNGAAT---- 287
Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA-GIE 355
N N LRG+K + W+GG+R + P KV S++ D +PT A G
Sbjct: 288 GNYADNGKLRGLKGSKWEGGIRVAYIMQYPDQIPAGKVYSKMVSGLDIMPTAVGAGKGEL 347
Query: 356 INDTSLDGVN 365
I+ DG N
Sbjct: 348 ISGQVTDGKN 357
>gi|340367643|ref|XP_003382363.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 493
Score = 209 bits (532), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 138/396 (34%), Positives = 209/396 (52%), Gaps = 32/396 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K P+I+ +L DD G+ DV F + I +PN D LA GL+LN+HYV C+PSR++ +TG
Sbjct: 21 KVPNIVFVLVDDWGFADVGFRNPA-ISSPNFDQLAKTGLVLNRHYVFKYCSPSRASFLTG 79
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
++P H Q +++ G L +LP LK A YATH +GKWHLGFF Y P RG
Sbjct: 80 RWPHH-AHQWNPLMDNM-IGTNLNMTMLPAKLKAANYATHMVGKWHLGFFDPRYLPINRG 137
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FD+ G+ G +D+ + + +D N D + G Y Y + +V+
Sbjct: 138 FDTSTGFLGGGEDHMNEKSGCS------IDYWKNNAPDPRN-GTYDAYNYRDDLTDVMNN 190
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR-TYAGMVSRLDESVGN 266
HN P+F YL VH P QAP E + + S E+R MVS D G+
Sbjct: 191 HNADNPLFFYLPLHNVHT-----PLQAPKEWLDIYPANSTCEKRHILQAMVSVADNVTGH 245
Query: 267 VIAALRKHGMLENSIVLFMADN-GAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
++ L+K GM +N+I++ ADN GAP GSN+PL+G KST ++GG+R +A +
Sbjct: 246 LVELLKKKGMWDNTIMVISADNGGAPC------SGSNYPLKGCKSTFFEGGIRSLAFVNG 299
Query: 326 PWLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDTS-----LDGVNQWDVLT-KGAKTK 377
L ++++ S HI+DW T C AG++ +D+ +DG++ W ++T + KT+
Sbjct: 300 GLLPESRRGQSTDGFIHIADWYTTFCKLAGVDPDDSGTGKFPVDGLDVWPIITGENEKTQ 359
Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNG 413
EI+ N ++ AL + + K + G G
Sbjct: 360 HEEIVLAY-NFNHSHPEQGALIMGNYKLIVGPQGYG 394
>gi|372210595|ref|ZP_09498397.1| sulfatase [Flavobacteriaceae bacterium S85]
Length = 472
Score = 209 bits (532), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 139/373 (37%), Positives = 193/373 (51%), Gaps = 34/373 (9%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F LT + F + KKP+II+ +ADD+G+ D F G++ I TPN+D LA NG
Sbjct: 7 LFTLTIIGI---TFGHAQKNKKPNIIVFMADDMGYADTGFTGATDIQTPNLDNLAKNGAF 63
Query: 68 LNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEP--------WGLPLTEKLLPQY 118
Q Y A C PSR+AL++G+Y QH E P G+ + EKL P+
Sbjct: 64 FKQGYANHAYCGPSRAALLSGRY------QHRFGFETNPAYDPANPHMGIDVGEKLFPKR 117
Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYD-HSCKATFEPYQGLD 177
L+EAGY T AIGKWHLG E + P RGFD YG+ G DY+ K +E Y
Sbjct: 118 LQEAGYKTGAIGKWHLGAAAE-FHPLNRGFDYFYGFLGGGHDYFRIDGTKKVWEAYLQPL 176
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
+R N + DN G +T L +AA V + NK P FLY+A+ A H P QAP E
Sbjct: 177 VR-NKRADNFE-GYLTTALSNDAAQFV--KDNKENPFFLYVAYNAPHM-----PLQAPKE 227
Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
++A++ I D +RR YA MV +D +G VI +L+ G+ EN+++ F++DNG P S
Sbjct: 228 DIARYAHIKDNKRRVYAAMVDVMDRGIGTVIQSLKDSGIYENTLIFFLSDNGGPQSKGGS 287
Query: 298 NKG----SNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
KG N P RG K +DGG+ A W +K V+ + + + A
Sbjct: 288 TKGWNGSDNQPFRGGKGNLYDGGVHVPFFASWPNKIKPGTVVNYPVNSLDIGRTAVALAG 347
Query: 353 GIEINDTSLDGVN 365
G +++GVN
Sbjct: 348 GDATAAPAMEGVN 360
>gi|340380159|ref|XP_003388591.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 500
Score = 209 bits (532), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 126/375 (33%), Positives = 201/375 (53%), Gaps = 41/375 (10%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
P KPHI+ +L DD G+ D SF + I TP+ L NGLILN+HYV C+PSR++ +T
Sbjct: 25 PDKPHIVFVLVDDWGFADASFRNPA-IKTPSFQYLVDNGLILNRHYVFKYCSPSRASFLT 83
Query: 87 GKYPIHIGMQH----GVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
G++P H+ + G++ G + +LP LK AGY+TH +GKWH G + Y
Sbjct: 84 GRFPHHVHQWNPTPPGMV------GTNINMTMLPAKLKTAGYSTHMVGKWHQGLYDPAYL 137
Query: 143 PTFRGFDSHYGYWQGLQDYYDHS--CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
P RGFD+ G+ Q + +++ + C F D R+ G Y + +Y +
Sbjct: 138 PVNRGFDTSSGFLQAGEGHFNQTIGCAVDFWKNHAPDTRN---------GTYDSYIYNKD 188
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDISDPERRTYAGMVSR 259
+ ++H+ SKP+FLYL VHA P +AP E + + ++ + RR MVS
Sbjct: 189 LTTIFSKHDASKPLFLYLPLHNVHA-----PIEAPQEWLDLYPVNSTCATRRVLQAMVSV 243
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
D G+V+ L+++ M +N+I++ ADNG KGSN+PL+G K T ++GG+R
Sbjct: 244 ADNVTGHVVNLLKENDMWDNTIMVISADNGGADC-----KGSNYPLKGSKHTFFEGGVRV 298
Query: 320 VAAIWSPWLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDTS-----LDGVNQWDVLT- 371
+A + ++ S H++DW T C AG++ ND+ +DG++ W ++T
Sbjct: 299 IAFASGGLIPAARRGKSTDGFIHVADWYTTFCKMAGVDHNDSGKGKFPVDGLDVWPIITG 358
Query: 372 KGAKTKRSEILHNID 386
+ +T+ EI+ D
Sbjct: 359 ENEETQHEEIVLGYD 373
>gi|296124181|ref|YP_003631959.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296016521|gb|ADG69760.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 470
Score = 209 bits (531), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 134/364 (36%), Positives = 194/364 (53%), Gaps = 26/364 (7%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQAL-CTPSRS 82
T +KP++II ADDLGW + G+ QIPTP+ID++A NG+ Q +V A C+PSR+
Sbjct: 35 TQTSRKPNVIIFYADDLGWGETGIQGNPQIPTPHIDSIAKNGVRCTQGFVAATYCSPSRA 94
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
L+TG+YP G + I GL L E L L GY T +GKWHLG E Y
Sbjct: 95 GLLTGRYPTRFGHEFNRIANVS--GLDLQETTLADRLHGLGYKTACVGKWHLGDGPE-YR 151
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR-HNMQVDNKTIGIYSTDLYTEAA 201
PT RGFD +G + F P + +D R N + Y+TD Y + +
Sbjct: 152 PTKRGFDEFFG---------TLANTPFFHPTKFVDSRVSNDVAEVSDENFYTTDEYAKRS 202
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+ I + +S P FLYL A HA P QAP + + +F I+DP+R+ +A M+S +D
Sbjct: 203 VEWIGQQQQS-PWFLYLPFNAQHA-----PLQAPQKYLDRFESIADPKRKLFAAMMSAMD 256
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
+++G V+ +R+ G EN++V F++DNG P+ G S G PLRG K T ++GG R
Sbjct: 257 DAIGQVLGKVRELGQEENTLVFFISDNGGPTQGTTSQNG---PLRGFKMTTFEGGTRVPF 313
Query: 322 AI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRS 379
+ W L + + + ++ D LPT+ AAG +I+ LDGV+ T K
Sbjct: 314 LVQWKGKLPAGKTYDNPVINL-DVLPTVLTAAGSKIDPAWKLDGVDLVPYFTSSIANKPH 372
Query: 380 EILH 383
E L+
Sbjct: 373 ETLY 376
>gi|149197396|ref|ZP_01874447.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139414|gb|EDM27816.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 465
Score = 207 bits (528), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 137/404 (33%), Positives = 207/404 (51%), Gaps = 29/404 (7%)
Query: 15 LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ 74
++F LN +P++I+I+ADDLG+NDV F+G ++IPTP ID++A NG+ Y
Sbjct: 6 IIFILISLNAICASRPNLIVIMADDLGYNDVGFNGCTEIPTPGIDSIAQNGVKFTNGYTS 65
Query: 75 -ALCTPSRSALMTGKYPIHIGMQHGVI--LEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
++C PSR+ +TG+Y G + L LP +E + + L + GY IGK
Sbjct: 66 YSVCGPSRAGFITGRYQQRFGFERNPQWNLTDPNSALPKSEMTIAESLTQVGYHCGIIGK 125
Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDY--------YDHSCKATFEPYQGLDMRHNMQ 183
WHLG P RGFD +G+ G + + K + Y+ R++
Sbjct: 126 WHLGA-EPSLRPNKRGFDEFFGHLGGGHRFMPEDLVIQHTEEVKNELDSYRSWITRNDTP 184
Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
V Y T+ +++ A++ I + N KP FL+L++ A H P QA ++ +A+F
Sbjct: 185 VKTTK---YLTEEFSDEAVSFI-KRNHQKPFFLFLSYNAPHL-----PLQATEKYLARFP 235
Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
I DP+R+TYA MVS +D+ V V+ +L++ + +N+IV F++DNG PS H NK N
Sbjct: 236 HIKDPKRKTYAAMVSAVDDGVSQVMQSLKETNIADNTIVFFLSDNGGPS---HKNKSDNF 292
Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLD 362
PL+G KS W+GG R A+ P Q ++V D T+ + A + D LD
Sbjct: 293 PLKGQKSDVWEGGFRVPFAMQYPAAIQAKQVYDHPVSSLDIFATIASLAQSPTHADKPLD 352
Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
GVN +T G KT+ I D Q Y +R D K V
Sbjct: 353 GVNLIPFIT-GEKTQAPHAQIFIRKFD--QSRY-VVRQGDFKLV 392
>gi|325286699|ref|YP_004262489.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
gi|324322153|gb|ADY29618.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
Length = 494
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 134/375 (35%), Positives = 198/375 (52%), Gaps = 31/375 (8%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSA 83
T K+P+I+++L DDLG++DV F+GS+ I TP +D LA G I YV C PSR+A
Sbjct: 37 TETKQPNILLVLCDDLGYSDVGFNGSTDIKTPELDKLANAGTIFTSAYVAHPFCGPSRAA 96
Query: 84 LMTGKYPIHIGMQHGVILEGEPWG--LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
L+TGKYP IG Q + GE G + E+ + + L+E+GY T AIGKWHLG E +
Sbjct: 97 LLTGKYPHTIGSQFNLPANGESLGKGIDTNEQFIAKTLQESGYYTGAIGKWHLGATEEFH 156
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----------LDMRHNMQVDNKTIGI 191
P RGF YG+ G +Y+ +A ++ + L + HN + +T
Sbjct: 157 -PNQRGFTDFYGFLGGGHNYFPEQYQAQYQKQKKAKKKIIRDYILPLEHNGKEVKETE-- 213
Query: 192 YSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
Y TD ++ A + E NK KP FLYLA+ A H P +A +E++ KF I D +R
Sbjct: 214 YLTDAFSREASRFVKEASNKKKPFFLYLAYNAPHV-----PLEAKEEDLEKFSVIKDKDR 268
Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
RTYA MV +D + ++ L++ LEN++++F++DNG G + +N+PL G K
Sbjct: 269 RTYAAMVYAVDRGIKQIVNTLKETNQLENTLIIFLSDNG----GNTDHGATNNPLHGRKG 324
Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCAAAGIEINDTS-LDGVNQW 367
W+GG R ++ P K K + F I+ D PTL ++ D L G N W
Sbjct: 325 DTWEGGYRVPMFMYCP--KNIAKGHTNNFPITSLDLYPTLAHIGKAKLPDNKVLHGKNVW 382
Query: 368 DVLTKGAKTKRSEIL 382
+ + + E L
Sbjct: 383 ESILNNKNIHKKETL 397
>gi|313236789|emb|CBY12041.1| unnamed protein product [Oikopleura dioica]
gi|313242643|emb|CBY39450.1| unnamed protein product [Oikopleura dioica]
Length = 622
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 130/410 (31%), Positives = 212/410 (51%), Gaps = 33/410 (8%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
FA+ L ++F T +P+I+ IL DDLGW DV ++ TP +D L NG
Sbjct: 8 FAIVAVFLKRESFAQETEDLRPNIVYILVDDLGWADVGWNNKGLESTPFMDKLVKNGTQF 67
Query: 69 NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
Q Y C+PSR+ +TG+Y GM I P+G+ +K LP+YLKE GY THA
Sbjct: 68 TQMYSSHRCSPSRAMALTGRYAFRSGMGSFPIAREVPFGMNTQDKTLPEYLKEVGYDTHA 127
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGL--DMRHNMQVDN 186
+GKWHLG Y PT RGFD+ YG++ G DY H K + Y + ++D
Sbjct: 128 VGKWHLGVCNSSYLPTSRGFDTFYGHYSGAVDYRGHFIKRSKNFYHDFFDNTIEQHKLDL 187
Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
++ G ++TDL+ + I+++ E +SK P ++YLA A H EP +AP + +A+ L+
Sbjct: 188 ESDGQWTTDLFRDRTIDILKEAKRSKTPAYVYLAFNAPH-----EPTRAPADLIARILE- 241
Query: 246 SDPE----RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP--SFGIHSN- 298
P R + V+++D +V + +K E ++++FM+DNG FG +
Sbjct: 242 KHPNLPYTRAEHLAAVNQIDTAVRQIYQQAKKDVKRE-TLIVFMSDNGGAVKKFGGGRSP 300
Query: 299 --------------KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDW 344
+ N+P +G K++ ++GG+ + + S K ++ + FHI DW
Sbjct: 301 DNTGHEYDDEESLPRACNYPFKGFKNSLFEGGILSPSFLISTARKFSRTRFNFPFHIIDW 360
Query: 345 LPTLCAAAGIEINDTSLDGVNQWDVL-TKGAKTKRSEILHNIDNVDNPQK 393
PT AGI + D DG+++ L ++ K++R+ ++ + + N +K
Sbjct: 361 APTFLDFAGI-VPDADFDGISRKSELESRWIKSQRTNFVYGVLDFYNEKK 409
>gi|340367653|ref|XP_003382368.1| PREDICTED: arylsulfatase J-like [Amphimedon queenslandica]
Length = 504
Score = 206 bits (524), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 133/402 (33%), Positives = 212/402 (52%), Gaps = 44/402 (10%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY------------VQA 75
K P+I+ +L DD G+ DV F + I +PN D LA GL+LN+HY V
Sbjct: 22 KLPNIVFVLVDDWGFADVGFRNPA-ISSPNFDQLAKTGLVLNRHYLAKTGLVLNRHYVFK 80
Query: 76 LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
C+PSR++ +TG++P H + + + + G L ++P LK A Y+TH +GKWH G
Sbjct: 81 YCSPSRASFLTGRWPHHAHQWN--LPQSDMAGTNLNMTMIPAKLKAAKYSTHMVGKWHQG 138
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
FF Y P RGFD+ G+ G +D+ T E +D N D + G Y
Sbjct: 139 FFDPRYLPINRGFDTSSGFLGGGEDH------MTQEAECAIDYWKNDAPDPRN-GTYDAY 191
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDISDPERRTYA 254
Y + +++ HN + P+F+YL VHA P QAPDE + + ++ + RRTY
Sbjct: 192 NYGDDLTDIMNNHNVNDPLFIYLPLHNVHA-----PLQAPDEWLNIYSVNSTCENRRTYQ 246
Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
MVS D G ++ L+K GM +N+I++ ADNG + G GSN+PL+G K + ++
Sbjct: 247 AMVSVADNVTGYLVQLLKKKGMWDNTIIIISADNGGAACG-----GSNYPLKGCKGSFFE 301
Query: 315 GGMRGVAAIWSPWLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDTS-----LDGVNQW 367
GG+R +A + L ++++ S HI+DW T C AG++ +D+ +DG++ W
Sbjct: 302 GGVRALAFVNGGLLPESRRGESTDGFIHIADWYTTFCKLAGVDPDDSGTGKFPVDGLDVW 361
Query: 368 DVLT-KGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG 408
++T + KT+ +I+ D ++ A+ + D K + G
Sbjct: 362 PIITGEDEKTQHEDIILGYDFF---SRHTGAIIMGDYKLLVG 400
>gi|406832516|ref|ZP_11092110.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 453
Score = 206 bits (524), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 132/364 (36%), Positives = 199/364 (54%), Gaps = 30/364 (8%)
Query: 11 LTCTLLFNDAFL---NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
L LLF+ + A KP+I++I+ADDLG+ D+ F G IPTP ID LA +G+
Sbjct: 11 LAAALLFSMTAVVGWAADASSKPNIVLIVADDLGYGDLGFQGGRDIPTPRIDGLARSGVT 70
Query: 68 LNQHYVQA-LCTPSRSALMTGKYPIHIGMQHG---VILEGEPWGLPLTEKLLPQYLKEAG 123
+ YV C+P+R+ L+TG+Y G + E +GL L E+ LPQ LK+AG
Sbjct: 71 CSSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNPGNAARVTETFGLSLEERTLPQRLKQAG 130
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ 183
YAT +GKWHLGF + + P RGFD +G+ G Y+ + + +P +R +
Sbjct: 131 YATGIVGKWHLGFAPQ-FQPLERGFDEFFGFLGGAHPYFPDAN--SNDP-----IRRGRE 182
Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
++ Y TD + ++ I + NK++P FLYLA AVH P A E + +F
Sbjct: 183 AVVESE--YLTDAFARESVAYI-DRNKNRPFFLYLAFNAVH-----NPQHAKPEHLERFK 234
Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
I+D RRTYAGM++ +D++VG V+ L + + ++V+F++DNG P +N +N
Sbjct: 235 SIADERRRTYAGMLTAMDDAVGQVLDKLAAEKLEQKTLVVFISDNGGPP----ANGSTNT 290
Query: 304 PLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPT-LCAAAGIEINDTSL 361
PL G K+T W+GG+R + W L +K + + D LPT L AA + + L
Sbjct: 291 PLNGQKATTWEGGVRVPFVVSWPGQLAAGKKYEEPVIQL-DLLPTILTAAGVVADDAAKL 349
Query: 362 DGVN 365
DGV+
Sbjct: 350 DGVD 353
>gi|423330335|ref|ZP_17308119.1| hypothetical protein HMPREF1075_00132 [Parabacteroides distasonis
CL03T12C09]
gi|409231951|gb|EKN24799.1| hypothetical protein HMPREF1075_00132 [Parabacteroides distasonis
CL03T12C09]
Length = 459
Score = 206 bits (523), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 143/387 (36%), Positives = 204/387 (52%), Gaps = 25/387 (6%)
Query: 6 KYFFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
K LT +L N LN + A KP+ +II DD+G+ D+S +G+ I TPNID +A
Sbjct: 5 KSLSVLTAAVLSNSLSLNAASDAANKPNFVIIFCDDMGYGDLSCYGNPTIRTPNIDRMAC 64
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYL 119
G+ L Q YV A + TPSR+ALMTG+ P+ G+ + V+ GL E + + L
Sbjct: 65 EGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDEVTIAKVL 124
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD 177
+++GYAT +GKWHLG F Y PT GFD+++G Y + + A P L
Sbjct: 125 QQSGYATGCVGKWHLGAFSP-YLPTDHGFDTYFGIPYSNDMSPVQNKGAHARNFPPTPL- 182
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
+ Q++++ T YTE A++ I +H+K +P FLY AH H P
Sbjct: 183 IVDGKQIESEPDQGELTRRYTEKAVSFIKDHSK-EPFFLYFAHTFPH---------IPLY 232
Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
A+F S +R Y +V +D SVG V+ ALR++G+ EN+ V+F +DNG P H
Sbjct: 233 TNARFAGTS--KRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG-PWLTEHE 289
Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI- 356
N GS PL+ K T W+GG R A W P K ++ E+ D PT + AGIE
Sbjct: 290 NGGSAGPLKDGKGTWWEGGFRVPAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQP 348
Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILH 383
D LDGVNQ +L + + R E+ +
Sbjct: 349 KDLVLDGVNQTGLLFEEKHSARDEVYY 375
>gi|149197407|ref|ZP_01874458.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149139425|gb|EDM27827.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 454
Score = 206 bits (523), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 133/356 (37%), Positives = 193/356 (54%), Gaps = 30/356 (8%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
+ TLLF+ + L T KP+I+IILADDLG+ DV +HG +IPTPNID +A G+ +
Sbjct: 3 IFITLLFSCSLLWAT--DKPNILIILADDLGYADVGYHGLEEIPTPNIDRIANEGVQFSA 60
Query: 71 HYVQ-ALCTPSRSALMTGKYPIHIGMQHGV-----ILEGEPWGLPLTEKLLPQYLKEAGY 124
Y ++C P+R+ALM+G Y IG + G+ + E G+P K L QY +EAGY
Sbjct: 61 GYSNGSICGPTRAALMSGVYQQRIGCE-GICGGRKLNEHVVVGMPREVKTLAQYFQEAGY 119
Query: 125 ATHAIGKWHLG---FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN 181
AT GKWHLG F + PT RGFD +G +G Y D + Y +R +
Sbjct: 120 ATGLFGKWHLGGERLFDKTLMPTSRGFDEFFGILEGASLYDDTVNRE--RKY----IRQD 173
Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
+D + G Y TD A++ I KP FLYL AVHA P QA ++ + +
Sbjct: 174 TVIDYE--GEYFTDAIGREAVSFITRKG-DKPFFLYLPFTAVHA-----PMQASEKYMQR 225
Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
F I+DP RR +A M+S +D+++G V AL G+L+N++++F +DNG +N
Sbjct: 226 FAHIADPNRRVFAAMLSAMDDNIGRVFDALEHQGILDNTLIVFWSDNGGKP---DNNYSL 282
Query: 302 NHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
NHPL+G K+ ++GG+R A + W K + + D P+ AA I +
Sbjct: 283 NHPLKGQKTQFYEGGIRVPACVRWPKGQIPAGKTLDQPVFLMDIFPSALEAAQITV 338
>gi|340367689|ref|XP_003382386.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 493
Score = 206 bits (523), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 138/400 (34%), Positives = 204/400 (51%), Gaps = 34/400 (8%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP+ + +L DD G+ DVSF + I +PN + LA GLIL++HYV C+PSR++ +TG+
Sbjct: 21 KPNFVFVLVDDWGYADVSFRNPA-IHSPNFEKLAKEGLILDRHYVFKYCSPSRASFLTGR 79
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
+P H + E G L ++P LK A Y TH IGKWH G ++E Y P RGF
Sbjct: 80 WPHHAHQWNPP--EDALVGANLKMTMIPAKLKLARYKTHMIGKWHEGLYKEAYLPINRGF 137
Query: 149 DSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
D+ G+ G +++ + C F G D R+ G Y Y + ++I
Sbjct: 138 DTMSGFLGGGENHMNQQVGCATDFWKNDGPDSRN---------GSYDAYTYRDDLTDIIT 188
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE-RRTYAGMVSRLDESVG 265
HN S P FLYL VH P QAPDE + + S + RR Y MVS D G
Sbjct: 189 NHNPSDPFFLYLPLHNVHT-----PLQAPDEWLNIYPANSTCQNRRIYQAMVSVADNVTG 243
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
+++ L+ +GM +N++++ ADNG G GSN+PL+G K T ++GG+R +A +
Sbjct: 244 HLVKLLQDNGMWDNTLLVISADNGGAPCG-----GSNYPLKGSKGTFYEGGVRSLAFVNG 298
Query: 326 PWLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDTS-----LDGVNQWDVLT-KGAKTK 377
L +K S HI+DW T C AGI+ +D+ +DG+N W ++ + KT
Sbjct: 299 GLLPTNRKGKSTQGFIHIADWYTTFCQLAGIDHDDSGSGKFPVDGMNVWPIINGENEKTL 358
Query: 378 RSEILHNID-NVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
+I D + N + AL V + K + G + D
Sbjct: 359 HEQIPLGYDFDYGNLRPNQGALIVGNNKLIVGPQDTKSCD 398
>gi|323451422|gb|EGB07299.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 512
Score = 206 bits (523), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 157/554 (28%), Positives = 251/554 (45%), Gaps = 80/554 (14%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KPH+I +LADD GWN+ ++ + TP+ DAL G+ L HYV C+P+RS+ ++G+
Sbjct: 20 KPHLIFLLADDEGWNNGFYN--EDVLTPHTDALVAAGVKLTNHYVYKFCSPTRSSFLSGR 77
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
PIH+ ++ E G+PL + + ++ AGY T +GKWH G +TP RGF
Sbjct: 78 LPIHVNQENSAT-EQAGAGVPLGMSTIGETMRAAGYRTAHVGKWHCGMATPAHTPYGRGF 136
Query: 149 DSHYGYWQGLQDYYDH--SCKATFEP-----YQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
DS G++ +D+Y +A E G+D+ + G Y +T AA
Sbjct: 137 DSSIGFFNFGEDHYTQVRGGQALEEGGASGQCSGVDLWRDGAPARGENGTYGGYTFTRAA 196
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+ + E N +KP+FL+ A +H P + P E VA++ + + R T GM + LD
Sbjct: 197 VAAV-EANDTKPLFLFAAFQNLH-----PPLEVPPEYVARYAN--ETLRTTINGMAAFLD 248
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
+SVGNV AL+K M ++S++++ DNG G + G + P RG K + + GG+R VA
Sbjct: 249 DSVGNVTKALKKQNMYDSSLIVYTPDNG----GYLTQGGDDAPFRGGKFSDFQGGVR-VA 303
Query: 322 AIWSPWLKQT---QKVSSELFHISDWLPTLCAAAGIEINDTSL------DGVNQWDVLTK 372
A S L S+ L H+ DW+ T A G I ++ D ++ W L++
Sbjct: 304 AFISGGLVPAGLRGTASAALVHVCDWMATFAALGGGAIPADAVAGLPAPDSLDVWPALSR 363
Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPK 432
G + R+E+ +I D + + + D + Y G +
Sbjct: 364 GRPSPRTEVPLSILPADGAARAF--VDFDGVGYFVGGE---------------------- 399
Query: 433 EVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDN 492
G+ +K L + D N++ A +A R
Sbjct: 400 -------GLIVGDMKLLLGYQHIGPFDKVTNATCAGVA----------GPWAAGR----- 437
Query: 493 KGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPP--GNKPFDK 550
G C+ CL+N+T DP E ++LA ++ + L +L ++T+ P G K
Sbjct: 438 PGVPCDCATSGCLYNVTADPTESDDLAAARPGAVAALRTRLEALRATVYAPDRGAKDDAA 497
Query: 551 RADPARWNNIWVPW 564
A A W PW
Sbjct: 498 CAQIATNGGFWGPW 511
>gi|340373733|ref|XP_003385394.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 389
Score = 205 bits (522), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 132/377 (35%), Positives = 203/377 (53%), Gaps = 40/377 (10%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN-GLILNQHYVQALCT 78
A T KP+++ +L DD G+ DVSF + I +P+ ++LA GLIL++HYV C+
Sbjct: 13 AIAAATVNAKPNLVFVLVDDWGFADVSFRNPA-ISSPHFESLATKEGLILDRHYVFKYCS 71
Query: 79 PSRSALMTGKYPIHIGM----QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL 134
PSR++ +TG++P H Q G++ G + ++P LK AGY TH +GKWH
Sbjct: 72 PSRASFLTGRFPHHAHQWNPPQSGLV------GANINMTMIPAKLKTAGYKTHMVGKWHE 125
Query: 135 GFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYST 194
GF+ + + P RGFD+ G+ G +D+ T E +D N +D + G Y
Sbjct: 126 GFYLKKFLPINRGFDTMSGFLGGGEDH------MTQEIICAVDYWKNDAIDTRN-GTYDD 178
Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDISDPERRTY 253
Y E ++ H+ S P FLYL VHA P QAPDE + + ++ + +RRT
Sbjct: 179 YTYVEDLSKILTNHDPSDPFFLYLPLHNVHA-----PLQAPDEFLNLYPVNSTCSDRRTI 233
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN-GAPSFGIHSNKGSNHPLRGMKSTP 312
MVS D G V+ L+ GM +N++++ ADN GAP +GSN+PL+G K T
Sbjct: 234 QAMVSVADNVTGFVVKMLKDKGMWDNTLIVVSADNGGAPC------EGSNYPLKGSKMTF 287
Query: 313 WDGGMRGVAAIWSPWLKQTQ--KVSSELFHISDWLPTLCAAAGIEINDTS-----LDGVN 365
++GG+R +A + L + K + HI+DW T C AG++ +D+ +DG+N
Sbjct: 288 YEGGVRSLAFVSGGLLPSDRRGKKTEGFIHIADWYTTFCQLAGVDSDDSGAGKFPVDGMN 347
Query: 366 QWDVLT-KGAKTKRSEI 381
W +LT + KT+ +I
Sbjct: 348 VWPILTGENEKTQHEQI 364
>gi|410446790|ref|ZP_11300893.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
gi|409980462|gb|EKO37213.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
Length = 540
Score = 205 bits (521), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 137/425 (32%), Positives = 210/425 (49%), Gaps = 47/425 (11%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
F L L L K+P+III L DDLGW D+S G+ I TP ID+L GL L
Sbjct: 3 FKLFIVSLVTFCALANAEEKRPNIIIFLVDDLGWADISLRGAP-IDTPAIDSLFSEGLTL 61
Query: 69 NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKEAGYA 125
++ Y +C+P+R+ALMTG+ P+ +G+ + V++ PW G+ E +P+ K AGY
Sbjct: 62 DRFYTTPICSPTRAALMTGRDPLRLGISYSVVM---PWMNNGVHPDEHFMPESFKAAGYQ 118
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN-MQV 184
T +GKWHLG +E++ P RGFD YG+ Y+ F G+D + N + +
Sbjct: 119 TAMVGKWHLGHSQEIFHPNARGFDDFYGHLHTEVGYF-----LPFANQGGVDFQRNGVTI 173
Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
++ Y T L + A I +K KP FLY+ +A H+ P +APD+ V K+ +
Sbjct: 174 ADEG---YETFLLADEASRWIKARDKDKPFFLYMPFIAPHS-----PLEAPDDLVKKYEN 225
Query: 245 ISDPE---------------------RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVL 283
+ D R YA +V LD+++G V+ L K G+ E +I+L
Sbjct: 226 LEDTRELTRSAAIDRTRTLNGFSPSARPMYAAVVDGLDQAIGQVLDTLTKEGIEEETIIL 285
Query: 284 FMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHIS 342
F +DNG + N PLRG K ++GG+R VAA+ W L + S + +
Sbjct: 286 FSSDNGGAA--YAGGGADNFPLRGGKGDTYEGGIRVVAAMRWKGTLDAGKSFES-IMTVM 342
Query: 343 DWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVD 401
D PTL +AAG+ +N + G + W + G K + ++ N +++ + D
Sbjct: 343 DVFPTLASAAGVSMLNTKEIWGRDLWSAIQNGKNIKLDKEIYFASETPNYGEFHTTVFND 402
Query: 402 DLKYV 406
K V
Sbjct: 403 KWKLV 407
>gi|294053963|ref|YP_003547621.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613296|gb|ADE53451.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 478
Score = 205 bits (521), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 121/345 (35%), Positives = 185/345 (53%), Gaps = 17/345 (4%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTG 87
+P+II+I+ADD+G+ D F G++ I TPN+D LA +G+I NQ YV A C PSR+ + G
Sbjct: 21 RPNIIVIMADDMGYADAGFTGATDILTPNLDKLAESGVIFNQGYVTHAFCGPSRAGFLAG 80
Query: 88 KYPIHIGMQHGVILE-GEPW-GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
+Y G +H + P G+ + E L P L++ GY T IGKWHLG Y P
Sbjct: 81 RYQHRFGFEHNTPYDPANPLAGIDVRETLFPARLQDVGYTTGIIGKWHLGASSPFY-PLN 139
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
RGFD YG+ G DY++ + + N +V N Y T + A+ +
Sbjct: 140 RGFDYFYGFLTGGHDYFEIDVTQPVKSAYLQGLFRNKRVAN--FEGYLTTALSRDAVQFV 197
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
+ NK P FL+L++ A H +P QAP E++A++ I D +RR YA MV +D +G
Sbjct: 198 ND-NKENPFFLFLSYNAPH-----QPLQAPQEDIARYAHIKDKKRRVYAAMVDVMDRGIG 251
Query: 266 NVIAALRKHGMLENSIVLFMADNGAP-SFGIHSNKG---SNHPLRGMKSTPWDGGMR-GV 320
V+ AL ++G+ EN+++ F++DNG P S H KG SN P RG K+ ++GG+
Sbjct: 252 EVVTALDRNGLRENTLIFFLSDNGGPVSSKKHPGKGNGSSNGPFRGGKTDFYEGGVHVPF 311
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
A W ++ Q + + + A G ++ +++GVN
Sbjct: 312 IASWPAKIQPGQVYDKPVISLDIGRTAVELAGGDPLSGNAMEGVN 356
>gi|255014752|ref|ZP_05286878.1| arylsulfatase A [Bacteroides sp. 2_1_7]
gi|410102839|ref|ZP_11297764.1| hypothetical protein HMPREF0999_01536 [Parabacteroides sp. D25]
gi|409237966|gb|EKN30761.1| hypothetical protein HMPREF0999_01536 [Parabacteroides sp. D25]
Length = 459
Score = 204 bits (520), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 143/387 (36%), Positives = 203/387 (52%), Gaps = 25/387 (6%)
Query: 6 KYFFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
K LT +L N LN + A KP+ +II DD+G+ D+S +G+ I TPNID +A
Sbjct: 5 KSLSVLTAAVLSNSLSLNAASDAANKPNFVIIFCDDMGYGDLSCYGNPTIRTPNIDRMAC 64
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYL 119
G+ L Q YV A + TPSR+ALMTG+ P+ G+ + V+ GL E + + L
Sbjct: 65 EGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDEVTIAKVL 124
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD 177
++ GYAT +GKWHLG F Y PT GFD+++G Y + + A P L
Sbjct: 125 QQNGYATGCVGKWHLGAFSP-YLPTDHGFDTYFGIPYSNDMSPVQNKGAHARNFPSTPL- 182
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
+ Q++++ T YTE A++ I +H+K +P FLY AH H P
Sbjct: 183 ILDGKQIESEPDQGELTRRYTEKAVSFIKDHSK-EPFFLYFAHTFPH---------IPLY 232
Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
A+F S +R Y +V +D SVG V+ ALR++G+ EN+ V+F +DNG P H
Sbjct: 233 TNARFEGTS--KRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG-PWLTEHE 289
Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI- 356
N GS PL+ K T W+GG R A W P K ++ E+ D PT + AGIE
Sbjct: 290 NGGSAGPLKDGKGTWWEGGFRVPAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQP 348
Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILH 383
D LDGVNQ +L + + R E+ +
Sbjct: 349 KDLVLDGVNQTGLLFEEKHSARDEVYY 375
>gi|283779108|ref|YP_003369863.1| sulfatase [Pirellula staleyi DSM 6068]
gi|283437561|gb|ADB16003.1| sulfatase [Pirellula staleyi DSM 6068]
Length = 468
Score = 204 bits (520), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 129/348 (37%), Positives = 194/348 (55%), Gaps = 25/348 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
+P+I++I+ DD+G++D+ HG IPTP++DALA +G+ YV C+P+R+ L+TG
Sbjct: 29 RPNIVVIVGDDMGYHDLGVHGCKDIPTPHLDALATSGVRCTSGYVSGPYCSPTRAGLLTG 88
Query: 88 KYPIHIGMQH--GVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
+Y G + G GE GLPL+E L LK+ GY T +GKWHLG E P
Sbjct: 89 RYQQRFGHEFNPGPTPTGE-IGLPLSETTLADRLKKVGYKTGMVGKWHLGN-DEKRHPLS 146
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM-RHNMQVDNKTIGIYSTDLYTEAAINV 204
RGFD +G+ G + Y+ A+ G + R VD K Y TD + A+
Sbjct: 147 RGFDEFFGFLGGARTYFATPGNAS----AGTKLLRGREVVDEKE---YLTDAFAREAVAY 199
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
I + +K+ P FLYL AVH P +A + + +F +SDP+R+ Y M+S +D++V
Sbjct: 200 I-DRSKASPFFLYLTFNAVHT-----PMEASQKYLDRFTAVSDPKRQKYCAMMSAMDDAV 253
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI- 323
G V+A L + +LEN+++ F++DNG P+ +N G N PLRG K+T W+GG+R +
Sbjct: 254 GQVVAKLEREKLLENTLIFFVSDNGGPT---AANTGDNTPLRGFKATTWEGGIRVPYFVS 310
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLT 371
W + + + I D++PT AAAG + + DGVN LT
Sbjct: 311 WKGKIPAGKTYDQPVIQI-DFVPTALAAAGAPAAEKT-DGVNLLPYLT 356
>gi|262383796|ref|ZP_06076932.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
gi|262294694|gb|EEY82626.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
Length = 459
Score = 204 bits (520), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 143/387 (36%), Positives = 203/387 (52%), Gaps = 25/387 (6%)
Query: 6 KYFFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
K LT +L N LN + A KP+ +II DD+G+ D+S +G+ I TPNID +A
Sbjct: 5 KSLSVLTAAVLSNSLSLNAASDAANKPNFVIIFCDDMGYGDLSCYGNPTIRTPNIDRMAC 64
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYL 119
G+ L Q YV A + TPSR+ALMTG+ P+ G+ + V+ GL E + + L
Sbjct: 65 EGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDEVTIAKVL 124
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD 177
++ GYAT +GKWHLG F Y PT GFD+++G Y + + A P L
Sbjct: 125 QQNGYATGCVGKWHLGAFSP-YLPTDHGFDTYFGIPYSNDMSPVQNKGAHARNFPSTPL- 182
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
+ Q++++ T YTE A++ I +H+K +P FLY AH H P
Sbjct: 183 ILDGKQIESEPDQGELTRRYTEKAVSFIKDHSK-EPFFLYFAHTFPH---------IPLY 232
Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
A+F S +R Y +V +D SVG V+ ALR++G+ EN+ V+F +DNG P H
Sbjct: 233 TNARFEGTS--KRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG-PWLTEHE 289
Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI- 356
N GS PL+ K T W+GG R A W P K ++ E+ D PT + AGIE
Sbjct: 290 NGGSAGPLKDGKGTWWEGGFRVPAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQP 348
Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILH 383
D LDGVNQ +L + + R E+ +
Sbjct: 349 KDLVLDGVNQTGLLFEEKHSARDEVYY 375
>gi|150008924|ref|YP_001303667.1| arylsulfatase [Parabacteroides distasonis ATCC 8503]
gi|149937348|gb|ABR44045.1| arylsulfatase A [Parabacteroides distasonis ATCC 8503]
Length = 459
Score = 204 bits (520), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 143/387 (36%), Positives = 203/387 (52%), Gaps = 25/387 (6%)
Query: 6 KYFFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
K LT +L N LN + A KP+ +II DD+G+ D+S +G+ I TPNID +A
Sbjct: 5 KSLSVLTAAVLSNSLSLNAASDAANKPNFVIIFCDDMGYGDLSCYGNPTIRTPNIDRMAC 64
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYL 119
G+ L Q YV A + TPSR+ALMTG+ P+ G+ + V+ GL E + + L
Sbjct: 65 EGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDEVTIAKVL 124
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD 177
+++GYAT +GKWHLG F Y PT GFD+++G Y + + A P L
Sbjct: 125 QQSGYATGCVGKWHLGAFSP-YLPTDHGFDTYFGIPYSNDMSPVQNKGAHARNFPPTPL- 182
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
+ Q++++ T YTE A++ I H+K +P FLY AH H P
Sbjct: 183 IVDGKQIESEPDQGELTRRYTEKAVSFIKNHSK-EPFFLYFAHTFPH---------IPLY 232
Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
A+F S +R Y +V +D SVG V+ ALR++G+ EN+ V+F +DNG P H
Sbjct: 233 TNARFEGTS--KRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG-PWLTEHE 289
Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI- 356
N GS PL+ K T W+GG R A W P K ++ E+ D PT + AGIE
Sbjct: 290 NGGSAGPLKDGKGTWWEGGFRVPAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQP 348
Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILH 383
D LDGVNQ +L + + R E+ +
Sbjct: 349 KDLVLDGVNQTGLLFEEKHSARDEVYY 375
>gi|340367647|ref|XP_003382365.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 490
Score = 204 bits (519), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 136/401 (33%), Positives = 210/401 (52%), Gaps = 50/401 (12%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K P+I+ +L DD G+ DV F + I +PN D LA GL+LN+HYV C PSR++L+TG
Sbjct: 22 KLPNIVFVLVDDWGFADVGFRNPA-ISSPNFDQLAKTGLVLNRHYVFKYCAPSRASLLTG 80
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
++P H+ Q + + G L + P LK A Y+TH +GKWH GFF Y P RG
Sbjct: 81 RWPHHV-YQWNLATDATA-GTNLNMTMFPAKLKAANYSTHMVGKWHQGFFDPRYLPINRG 138
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQG--LDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
FD+ GY G D+++ QG +D N D + G Y + Y + N++
Sbjct: 139 FDTSSGYLCGWVDHFNQK--------QGCAVDCWKNNAPDPRN-GTYDSYYYRDDLTNIV 189
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI-----SDPERRTYAGMVSRL 260
H+ + P+F+YL VH P +AP E +LDI + R T MVS
Sbjct: 190 NNHDANNPLFIYLPLHNVHT-----PLEAPKE----WLDIYPANSTCKTRHTLQAMVSVA 240
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
D G+++ L++ M +N+I++ ADNG SN+PLRG K T ++GG+R +
Sbjct: 241 DNVTGHLVELLKEKEMWDNTIMVISADNGGDQC-----VASNYPLRGCKYTFFEGGVRSL 295
Query: 321 AAIWSPWLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDTS-----LDGVNQWDVLT-K 372
A + L ++++ S HISDW T C AG++ +D+ +DG++ W ++T +
Sbjct: 296 AFVNGGLLPESRRGQSTDGFIHISDWYTTFCKLAGVDPDDSGTGKFPVDGLDVWPIITGE 355
Query: 373 GAKTKRSEIL--HNIDNVDNPQKYYAALRVDDLKYVAGTDN 411
KT+ EI+ +N + + A+ VD+ K + G+ N
Sbjct: 356 NEKTQHQEIVLGYNFSS-------HGAIIVDNYKLIVGSQN 389
>gi|298375848|ref|ZP_06985804.1| sulfatase family protein [Bacteroides sp. 3_1_19]
gi|298266885|gb|EFI08542.1| sulfatase family protein [Bacteroides sp. 3_1_19]
Length = 459
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 143/387 (36%), Positives = 203/387 (52%), Gaps = 25/387 (6%)
Query: 6 KYFFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
K LT +L N LN + A KP+ +II DD+G+ D+S +G+ I TPNID +A
Sbjct: 5 KSLSVLTAAVLSNSLSLNAASDAVNKPNFVIIFCDDMGYGDLSCYGNPTIRTPNIDRMAC 64
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYL 119
G+ L Q YV A + TPSR+ALMTG+ P+ G+ + V+ GL E + + L
Sbjct: 65 EGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDEVTIAKVL 124
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD 177
++ GYAT +GKWHLG F Y PT GFD+++G Y + + A P L
Sbjct: 125 QQNGYATGCVGKWHLGAFSP-YLPTDHGFDTYFGIPYSNDMSPVQNKGAHARNFPPTPL- 182
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
+ Q++++ T YTE A++ I +H+K +P FLY AH H P
Sbjct: 183 IVDGKQIESEPDQGELTRRYTEKAVSFIKDHSK-EPFFLYFAHTFPH---------IPLY 232
Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
A+F S +R Y +V +D SVG V+ ALR++G+ EN+ V+F +DNG P H
Sbjct: 233 TNARFEGTS--KRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG-PWLTKHE 289
Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI- 356
N GS PL+ K T W+GG R A W P K ++ E+ D PT + AGIE
Sbjct: 290 NGGSAGPLKDGKGTWWEGGFRVPAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQP 348
Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILH 383
D LDGVNQ +L + + R E+ +
Sbjct: 349 KDLVLDGVNQTGLLFEEKHSARDEVYY 375
>gi|256841082|ref|ZP_05546589.1| arylsulfatase A [Parabacteroides sp. D13]
gi|256736925|gb|EEU50252.1| arylsulfatase A [Parabacteroides sp. D13]
Length = 459
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 142/387 (36%), Positives = 204/387 (52%), Gaps = 25/387 (6%)
Query: 6 KYFFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
K LT +L N LN + A KP+ +II DD+G+ D+S +G+ I TPNID +A
Sbjct: 5 KSLSVLTAAVLSNSLSLNAASDAANKPNFVIIFCDDMGYGDLSCYGNPTIRTPNIDRMAC 64
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYL 119
G+ L Q YV A + TPSR+ALMTG+ P+ G+ + V+ GL E + + L
Sbjct: 65 EGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDEVTIAKVL 124
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD 177
+++GYAT +GKWHLG F Y PT GFD+++G Y + + A P L
Sbjct: 125 QQSGYATGCVGKWHLGAFSP-YLPTDHGFDTYFGIPYSNDMSPVQNKGAHARNFPPTPL- 182
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
+ +++++ T YTE A++ I +H+K +P FLY AH H P
Sbjct: 183 ILDGKEIESEPDQGELTRRYTEKAVSFIKDHSK-EPFFLYFAHTFPH---------IPLY 232
Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
A+F S +R Y +V +D SVG V+ ALR++G+ EN+ V+F +DNG P H
Sbjct: 233 TNARFEGTS--KRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG-PWLTEHE 289
Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI- 356
N GS PL+ K T W+GG R A W P K ++ E+ D PT + AGIE
Sbjct: 290 NGGSAGPLKDGKGTWWEGGFRVPAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQP 348
Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILH 383
D LDGVNQ +L + + R E+ +
Sbjct: 349 KDLVLDGVNQTGLLFEEKHSARDEVYY 375
>gi|340369111|ref|XP_003383092.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 507
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 128/384 (33%), Positives = 197/384 (51%), Gaps = 48/384 (12%)
Query: 30 PHIIIILADDLGWNDVSFH---GSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
PHI+++L DD GW +V +H + ++ TPNID L GL LNQHY C+PSRS+L++
Sbjct: 25 PHIVMMLVDDWGWANVGYHRNPPTREVVTPNIDKLVKEGLELNQHYAFRFCSPSRSSLIS 84
Query: 87 GKYPIHIGMQHGVILEGEP-------WGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
G+ PIH+ + P +P + + ++ AGYATH +GKW G
Sbjct: 85 GRLPIHVNDLNLAPTNYNPNDPVSGFSAIPRNMTGIAEKMRGAGYATHQVGKWDAGMATS 144
Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDH---SCK-------ATFEPYQGLDMRHNMQVDNKTI 189
+TP RGF + +GY+ DYY CK T +P G++ DN
Sbjct: 145 DHTPKGRGFQTSFGYFHHANDYYTEIAGKCKIPIVDLWETDKPAHGIN---GTGPDN--- 198
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
Y L+ E ++V++ H+ S P+FLY A VHA P++ PD + KF I +
Sbjct: 199 --YEEGLFKERVLSVVSNHDPSTPLFLYYAPHIVHA-----PYEVPDRYLNKFSFIDNHL 251
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS-NHPLRGM 308
R+ Y MV+ LD+ VG + AL+ GM +N + + +DNG P +H G+ N+PL+G
Sbjct: 252 RQYYHAMVNYLDDVVGELTDALKNKGMWDNLLFVTSSDNGGP---VHEGTGANNYPLKGG 308
Query: 309 KSTPWDGGMRGVAAIWSPWL--KQTQKVSSELFHISDWLPTLCAAAGIEINDTS------ 360
K+T W GG+R A + +L K + + H++DW T CA AG++ D
Sbjct: 309 KTTDWQGGVRVNAFVSGGYLPEKMRGQKTDGYIHLADWYGTFCAIAGVDPTDEKAAKAKL 368
Query: 361 --LDGVNQWDVLTKGAKTK-RSEI 381
+D N W +++ T R+++
Sbjct: 369 PPVDSYNMWPLISGETDTSPRTDV 392
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 23/57 (40%), Positives = 34/57 (59%), Gaps = 2/57 (3%)
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRA 552
HC CL+NI DDP E+NN+A + ++LK+++DKL Y++T P P A
Sbjct: 436 HCGD--DGCLYNIKDDPEERNNIATKEPEMLKKMQDKLKNYQATYFNPDRGPVSPDA 490
>gi|29348898|ref|NP_812401.1| arylsulfatase [Bacteroides thetaiotaomicron VPI-5482]
gi|383125065|ref|ZP_09945724.1| hypothetical protein BSIG_5384 [Bacteroides sp. 1_1_6]
gi|29340804|gb|AAO78595.1| arylsulfatase B precursor [Bacteroides thetaiotaomicron VPI-5482]
gi|251837419|gb|EES65514.1| hypothetical protein BSIG_5384 [Bacteroides sp. 1_1_6]
Length = 458
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 123/337 (36%), Positives = 186/337 (55%), Gaps = 32/337 (9%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
++P+I+I+LADDLGW DV FHGS +I TP ++ALA G++L++ Y + TP+R+ LMTG
Sbjct: 24 ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLNALAAEGVVLDRFYTAPISTPTRAGLMTG 82
Query: 88 KYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKEAGYATHA-IGKWHLGFFREVYTP 143
+YP G++ VI PW GL E+ L L GY+ A IGKWHLG R+V+ P
Sbjct: 83 RYPNRFGIRTTVI---PPWREDGLDENEETLADMLARNGYSNRAIIGKWHLGHTRKVHYP 139
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
RGF YG+ G DY+DH + + + + ++ YST+L T+ A+
Sbjct: 140 INRGFSHFYGHLNGAIDYFDHMREGELDWHNDWETCYDKG--------YSTELITQEAVR 191
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD------ISDPERRTYAGMV 257
I + K P LY+A+ A H P QA ++++ + D + +R TY MV
Sbjct: 192 CINTYEKEGPFLLYVAYNAPHT-----PLQAQEKDIELYCDDFGSLTPKEQKRVTYQAMV 246
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
S +D +G ++ AL+K G+++N+ ++F +DNG P+ S+ G LRG K WDGG+
Sbjct: 247 SCMDRGIGTIVDALKKKGIMDNTFLIFFSDNG-PAGVPGSSSGK---LRGRKFDEWDGGV 302
Query: 318 RGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
R A W + +SS++ D +PTL G
Sbjct: 303 RVPAVFYWKRAESNYKNLSSQVTGFVDIVPTLKELVG 339
>gi|326428402|gb|EGD73972.1| arylsulfatase B [Salpingoeca sp. ATCC 50818]
Length = 545
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 125/402 (31%), Positives = 207/402 (51%), Gaps = 47/402 (11%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KPHI+ I+ DD G+++ ++ TPN+D LA +GL+L+Q Y CTPSRS+ ++G+
Sbjct: 40 KPHIVFIMIDDWGFHNFGIRNQTEAKTPNMDKLARDGLLLDQAYSYFWCTPSRSSFLSGR 99
Query: 89 YPIHIGMQHGVILEGEPW----------GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
P+H+ H + W G+P +P ++++AGY TH +GKW G
Sbjct: 100 LPLHV--FHSNRVSSASWDSQHPDTAGVGIPRNMTTIPAFMRKAGYKTHMVGKWDAGIAT 157
Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHS------------CKATFEPYQGLDMRHNMQVDN 186
++P RGFDS Y+ +YY ++ C+ +D+ ++ + +
Sbjct: 158 PQHSPLGRGFDSSLHYFNHDNNYYAYNYTDTVSVQFPVKCQLLKYVTGFVDLWNSTEPAD 217
Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
IG Y ++ + A++VI++H+ S P+FLY A HA P Q P + +F I
Sbjct: 218 APIGTYEEHVFRDHALDVISKHDASTPLFLYYASHIAHA-----PLQVPQAYLDRFQHIP 272
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF----GIHSN---K 299
DP RRTY MV D+ VGN+ AAL+ GM EN++++ +DNG + G+ ++
Sbjct: 273 DPIRRTYQAMVLAADDVVGNITAALKAKGMWENTLLVVHSDNGGAIYQCLDGMDASLCGG 332
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIEIN 357
SN PLRG K +P+ GG+R A + +L ++ V + L ++DW+ T C AG+
Sbjct: 333 ASNWPLRGGKLSPFQGGIRVNAFVNGGFLPSHRRGQVETGLVVLADWMATFCELAGVACE 392
Query: 358 D--------TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNP 391
D ++D + W +L+ GA + + + I +P
Sbjct: 393 DPIAKKANLPAIDSQSLWSLLS-GANSTSPRVEYPITFDQSP 433
>gi|298706913|emb|CBJ29740.1| Formylglycine-dependent sulfatase, C-terminal fragment [Ectocarpus
siliculosus]
Length = 597
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 144/387 (37%), Positives = 206/387 (53%), Gaps = 54/387 (13%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSS--QIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
++ +P++I IL DD+G NDV + + Q+ TP ++ LA G+IL+ +Y +CTPSR+
Sbjct: 69 SSSGQPNVIFILVDDVGTNDVGYESTDLWQL-TPFMNTLAAEGVILDDYYSNEICTPSRA 127
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+LMTG+ GMQ GV+ + WGLP+ E L + + AGY+TH GKWHLG +
Sbjct: 128 SLMTGRDSFRTGMQFGVVEDSAAWGLPIDEVTLAERFQAAGYSTHMTGKWHLGVYSNANY 187
Query: 143 PTFRGFDSHYGYWQGLQDYYDH-------------SCKATFEPYQGLDMRHNMQVDN--- 186
P RGFD+ GY G + YY H +C F Y D N +
Sbjct: 188 PFSRGFDTFLGYTGGGEGYYTHRECVTPEFEGGQYTCYQDFG-YGDKDGYINFTTNTTRK 246
Query: 187 --KTIGIYSTDLYTEAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPFQAP------ 235
+ G YST + T+ AI V EH + S P+FLY+AH AVH EP +P
Sbjct: 247 GPEMTGKYSTTVITDRAIEVAREHVEKSPSDPLFLYVAHQAVH-----EPLDSPPPGSFT 301
Query: 236 DEEVA--KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PS 292
DEEV + LD S ERR A +V LD+ V + AL G+L+N++++F +DNGA PS
Sbjct: 302 DEEVELLEGLDYSHRERRALASIVYYLDKEVRRLHTALDALGVLDNAVIVFASDNGACPS 361
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS------PWLKQTQKVSSELFHISDWLP 346
G GSN+PLRG K T + GG+R ++S P + + S + H DW P
Sbjct: 362 KG-----GSNYPLRGFKHTTFQGGVRVPGFVYSKSADLIPEEARGTRYSG-MMHSMDWTP 415
Query: 347 TLCAAA-GIEINDTS--LDGVNQWDVL 370
TL + G+ +++ L G +QWD +
Sbjct: 416 TLGSVVPGLPLDEMGKELSGFDQWDAI 442
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 35/72 (48%), Gaps = 2/72 (2%)
Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
C+ V LF+I DP E NLAES + ++ ++ + V PG KP D A
Sbjct: 526 QCDGVVFDFLFHIETDPLETENLAESMPEKYAEMVERFDVATKDEVVPGFKPQDPSGMEA 585
Query: 556 --RWNNIWVPWY 565
++NN V W+
Sbjct: 586 WTKFNNWVVAWH 597
>gi|390356461|ref|XP_793612.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 181
Score = 203 bits (516), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 92/157 (58%), Positives = 119/157 (75%), Gaps = 6/157 (3%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
FF L C L + T P P++I ++ADDLGWNDVSFHGSSQIPTP+IDALA G+I
Sbjct: 13 FFLLGCML----ECVYTAQP--PNVIFVVADDLGWNDVSFHGSSQIPTPHIDALAQEGVI 66
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
L +YV +CTP+RSA+MTGK+PIH G+Q+ VI+ EP+GL E ++PQYL+ GY TH
Sbjct: 67 LTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVIIADEPYGLGTNETIMPQYLRSLGYRTH 126
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH 164
+GKWHLGFF+E TP+ RGF+S+YGY+ G+QDY+ H
Sbjct: 127 MVGKWHLGFFKESLTPSHRGFESYYGYYGGMQDYFTH 163
>gi|449671825|ref|XP_002165184.2| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
Length = 160
Score = 203 bits (516), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 93/154 (60%), Positives = 117/154 (75%), Gaps = 3/154 (1%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+KPHII+I+ADDLGWND+ FHGS +I TPNID LA NG++L+ +YV +CTPSRSA+MTG
Sbjct: 8 EKPHIILIVADDLGWNDIGFHGSKEISTPNIDRLATNGVVLDNYYVLPICTPSRSAIMTG 67
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+YPIH GMQ I P+G+ L EK LPQYLK+ GY T+ +GKWHLGFF + YTPT+RG
Sbjct: 68 RYPIHTGMQQDTIYGPNPYGVSLNEKFLPQYLKQQGYKTYGVGKWHLGFFAKEYTPTYRG 127
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN 181
FDS+YG + G DY++HS TF GLD+ N
Sbjct: 128 FDSYYGSYLGKGDYWNHSNTETF---TGLDLHDN 158
>gi|227540472|ref|ZP_03970521.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
spiritivorum ATCC 33300]
gi|227239796|gb|EEI89811.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
spiritivorum ATCC 33300]
Length = 466
Score = 203 bits (516), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 129/380 (33%), Positives = 207/380 (54%), Gaps = 26/380 (6%)
Query: 14 TLLFNDAFLNT---TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
LF L T +A ++P+II+IL+DD G+ D +GS IPTP+IDALA G+
Sbjct: 8 VFLFTLFMLTTGSVSAQQRPNIILILSDDGGYEDFGCYGSQDIPTPHIDALAKGGIRFTN 67
Query: 71 HYVQA-LCTPSRSALMTGKYPIHIGMQHGVI------LEGEPWGLPLTEKLLPQYLKEAG 123
YV A +C PSR+ L+ G+Y G +H V + + GL T + + ++ G
Sbjct: 68 SYVTASVCAPSRAGLLMGQYQQRSGFEHNVSDLPADGYQIQDIGLSDTVRTIADQMQSNG 127
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ 183
Y T AIGKWH G + P +GF+ +G+ G + ++ + + + + + +
Sbjct: 128 YETMAIGKWHQGN-ETKHHPLHKGFNHFFGFIGGHRSFF--PIRTAIKQEEKI-LNDYTE 183
Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
VD K + Y TD++T+ AI+ + + + KP F+YL++ AVH P +A +++A+F
Sbjct: 184 VDEKDV-YYLTDMFTDKAISYMRQ-KRDKPYFIYLSYNAVHT-----PVEATPQKLAQFA 236
Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
+ + RR+YAG++S +D+ +G ++ L+K G LEN++++F+ DNGA + +N N
Sbjct: 237 RLKNAHRRSYAGLMSAMDDGIGRIVEELKKTGQLENTLLIFLNDNGAAT----NNGADNG 292
Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA-GIEINDTSLD 362
PLRG+K + W+GG+R + P Q K L D +PT AAA G +D LD
Sbjct: 293 PLRGLKGSKWEGGIRVPMIMHWPKQLQAGKDYKGLVSSLDLVPTCVAAAQGTVRSDMQLD 352
Query: 363 GVNQWDVLTKGAKTKRSEIL 382
GVN LT K + + L
Sbjct: 353 GVNILPFLTGKNKQEPHDFL 372
>gi|346464573|gb|AEO32131.1| hypothetical protein [Amblyomma maculatum]
Length = 418
Score = 202 bits (515), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 148/432 (34%), Positives = 206/432 (47%), Gaps = 32/432 (7%)
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
Y PT RGFD+ YGY+ + Y+ K F G D N + G Y T L T+
Sbjct: 1 YLPTRRGFDTFYGYYNTAKYYFTQ--KLNFYNICGRDYWDNETLVTDKDGTYDTHLITDR 58
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
A+++I+ H+ KP+F+Y+A LA H T AP +AKF I D R AG V L
Sbjct: 59 AVDIISSHDPDKPLFIYMAPLAAHGQTTNLTVDAPQRNLAKFPYIGDHNRTLLAGAVDAL 118
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
DESVG ++AAL + ML NS+++F +DNG +G SN GSN PLRG K T W+GG+R
Sbjct: 119 DESVGRIVAALHQRRMLANSVLVFTSDNGGIPWGDFSNTGSNWPLRGTKGTMWEGGVRLP 178
Query: 321 AAIWSPWLKQTQK-VSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTK- 377
A +WS L + V H+ DWLPTL +AAG I D ++DG++ W VL+ +
Sbjct: 179 AVVWSSMLAPGSRVVVPHPMHLVDWLPTLYSAAGGHIGDLGNIDGIDLWPVLSSAKTSSK 238
Query: 378 -----RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEW-YGDTDNEIDKYSP 431
R+E L N D +A R D K VA T + + +GD + + +P
Sbjct: 239 STSWPRTEFLINYDT----DTGLSAYRDGDYKLVAVTTPKSKDARYPFGDPGLQRNVPTP 294
Query: 432 KEV---LYSKAGI----TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFA 484
+ K I T A + Q D + +++L E R A
Sbjct: 295 GGIPPDRCRKGAIKRLNTLMASSLAWRTLQGFYGDGEIGRAESLAP--------EWRVQA 346
Query: 485 RVRCNYDNKGAHCNSTVKPC--LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP 542
VRCN + ++ LF+I DPCE NNLA S ++ +L KL Y + +
Sbjct: 347 SVRCNPRKETVPGLEQLREGNYLFDIKRDPCELNNLAHSLPSVVAKLSKKLQSYVAITMA 406
Query: 543 PGNKPFDKRADP 554
P N D P
Sbjct: 407 PANITMDPAGLP 418
>gi|300771261|ref|ZP_07081137.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
ATCC 33861]
gi|300761931|gb|EFK58751.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
ATCC 33861]
Length = 466
Score = 202 bits (514), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 131/383 (34%), Positives = 209/383 (54%), Gaps = 30/383 (7%)
Query: 13 CTLLFNDAFLNT---TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
LF L T +A ++P+II+IL+DD G+ D +GS IPTP+IDALA G+
Sbjct: 7 IVFLFTLFMLTTGSVSAQQRPNIILILSDDGGYEDFGCYGSQDIPTPHIDALAKGGVRFT 66
Query: 70 QHYVQA-LCTPSRSALMTGKYPIHIGMQHGVI------LEGEPWGLPLTEKLLPQYLKEA 122
YV A +C PSR+ L+ G+Y G +H V + + GL T + + ++
Sbjct: 67 NSYVTASVCAPSRAGLLMGQYQQRSGFEHNVSDLPADGYQMQDIGLSDTVRTIADQMQSN 126
Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM 182
GY T AIGKWH G + P +GF+ +G+ G + ++ + + + + +
Sbjct: 127 GYETMAIGKWHQGN-ETKHHPLHKGFNHFFGFIGGHRSFF--PIRTAIKQEEKI-LNDYT 182
Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
+VD K + Y TD++T+ AI+ + + + KP F+YL++ AVH P +A +++A+F
Sbjct: 183 EVDEKDV-YYLTDMFTDKAISYMRQ-KRDKPYFIYLSYNAVHT-----PVEATPQKLAQF 235
Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
+ D +RR+YAG++S +D+ +G ++ L+K G LEN++++F+ DNGA + +N N
Sbjct: 236 AHLKDAQRRSYAGLMSAMDDGIGRIVEELKKTGQLENTLLIFLNDNGAAT----NNGADN 291
Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQ--TQKVSSELFHISDWLPTLCAAA-GIEINDT 359
PLRG+K + W+GG+R + W KQ K L D +PT AAA G +D
Sbjct: 292 GPLRGLKGSKWEGGIR--VPMIMHWPKQLPAGKDYKGLVSSLDLVPTCVAAAQGTVRSDM 349
Query: 360 SLDGVNQWDVLTKGAKTKRSEIL 382
LDGVN LT K + + L
Sbjct: 350 QLDGVNILPFLTGKNKQEPHDFL 372
>gi|372210513|ref|ZP_09498315.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 465
Score = 202 bits (514), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 133/376 (35%), Positives = 194/376 (51%), Gaps = 46/376 (12%)
Query: 10 ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
+T + F L A KP+I+++ ADD G+ D F GS + TPN+D LA +G+
Sbjct: 3 VITFLICFFGIHLVVNAQNKPNIVLLFADDAGYMDFGFQGSKVMKTPNLDKLAKSGVTFT 62
Query: 70 QHYVQ-ALCTPSRSALMTGKYPIHIG---------MQHGVILEGEPWGLPLTEKLLPQYL 119
Q YV C PSR+ +MTGKY G M L+G+ GLPL +KL+ YL
Sbjct: 63 QGYVSDPTCGPSRAGMMTGKYQARFGYEEINVPGYMSSHSALKGDEMGLPLDQKLMSNYL 122
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYY-------DHSCKATFEP 172
K+ GY T GKWHLG + + P RGFD YG+ G + Y+ D + F
Sbjct: 123 KDLGYKTAVYGKWHLG-NADRFHPLNRGFDEFYGFRGGARSYFAYKNPTGDRKMETNFGQ 181
Query: 173 YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPF 232
Y+ + H Y+TD++ E A++ I E NK P F+YL+ AVH P
Sbjct: 182 YE--EPNH-----------YATDVFAEKAVHFI-ERNKEHPFFIYLSFNAVHT-----PM 222
Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
+A + ++A+F +++ +R+ A M LD + G V+ L++ G+ N+IV+F DNG PS
Sbjct: 223 EATEADLAQFPNLTG-KRQQLAAMTLALDRACGKVLNKLKELGIDRNTIVVFSNDNGGPS 281
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCA 350
N +N PL G KS +GG+R W K+ +K S + + D LPT A
Sbjct: 282 ---DKNASNNKPLSGTKSNHLEGGIR--VPFLMSWPKEIKKKSRFNYPVVTFDLLPTFYA 336
Query: 351 AAGIEIND-TSLDGVN 365
AAG ++ D ++GVN
Sbjct: 337 AAGGDLKDLKDINGVN 352
>gi|326426859|gb|EGD72429.1| hypothetical protein PTSG_00448 [Salpingoeca sp. ATCC 50818]
Length = 540
Score = 202 bits (514), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 126/390 (32%), Positives = 210/390 (53%), Gaps = 43/390 (11%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K+PHII L DDLGWN SFH +++I TP + L NG+ L HY C+P+R++ +TG
Sbjct: 28 KQPHIIFFLTDDLGWNAPSFH-NNEIITPTLHHLHANGVELYSHYTYMFCSPTRASFLTG 86
Query: 88 KYPIHIGMQHGVILE-GEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
++P M + +L GL L+ L LK+A Y+TH IGKWHLG +++ YTP +R
Sbjct: 87 RFPYKHEMTNTNLLPPTRMLGLDLSYTTLADKLKQANYSTHHIGKWHLGMYKKEYTPRYR 146
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
GFD+ +G+ G +++Y + F +D+ G ++ ++T+AA+++I
Sbjct: 147 GFDTTFGFLTGGENHY---TQRAFVSPPAVDLWDEEAPAYGMNGTWTGKMFTDAALDIIR 203
Query: 207 EHNK-------SKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE---RRTYAGM 256
+ + + P+F+Y A VHA P Q+P +A+ + + + + T+ GM
Sbjct: 204 NNAQLRNATGDAPPLFIYFALHDVHA-----PTQSP-VRLAQLYNFTGDDAGLKSTFYGM 257
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK---------------GS 301
+S +DE+V NV AL+ GM ++++++ +DNG+P+ G + GS
Sbjct: 258 ISGVDEAVRNVTDALKAAGMWNDTLLVWTSDNGSPTMGSFNGGCKNVVPDSRGGACMCGS 317
Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWL--KQTQKVSSELFHISDWLPTLCAAAGIEIND- 358
N+PLRG K T ++GG+R + P L K L H+ DW TLC+ AG++ D
Sbjct: 318 NYPLRGSKHTNFEGGVRTPTFVSGPVLPAHMRGKRLDGLMHVCDWYSTLCSVAGVDPADP 377
Query: 359 ---TSLDGVNQWDVLTKG-AKTKRSEILHN 384
LD ++ W ++ +++ R I+H+
Sbjct: 378 GGIAPLDAIDMWPYISGAVSESPRDFIVHD 407
>gi|323456816|gb|EGB12682.1| hypothetical protein AURANDRAFT_60668 [Aureococcus anophagefferens]
Length = 534
Score = 201 bits (512), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 135/399 (33%), Positives = 187/399 (46%), Gaps = 56/399 (14%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
L A KPH++ +L DD+G+ DV + S + +P +D LA GL L +HY C PSR
Sbjct: 19 LARAADTKPHLLFVLGDDVGFGDVGYS-SPDVISPTLDRLAAEGLKLGRHYAYMWCAPSR 77
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKE-AGYATHAIGKWHLGFFREV 140
+AL+TG YP G+ G LPL LLP L++ AGY T A+GKWHLGF E
Sbjct: 78 AALLTGYYPSTTGVYS---TSGAQNALPLEFALLPGLLRDRAGYRTAAVGKWHLGFMSEA 134
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDH------SCKATFEPYQGLDMRHNMQVDNKTIGIYST 194
P RGFD +G+ G +D+Y C F+ + R D G YS
Sbjct: 135 DLPERRGFDGFFGFLDGGEDHYSRVGAGAPGCDRVFDLWDS-RRRGPATDDPSAFGRYSA 193
Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHA----------------------------- 225
+LY EAA + I H+ ++P+FLY A H+
Sbjct: 194 ELYGEAAADAIRGHDAAEPLFLYAAFQVAHSPLEVPPAKYFDAYADAALCANASTKGFAC 253
Query: 226 --GNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVL 283
P++AP+ + +R A MV+ LD +V N+ AL M + +++
Sbjct: 254 ARPGADAPYRAPNHDCGC-------DRLVVAAMVTALDAAVANMTTALASKNM-ADFVLV 305
Query: 284 FMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISD 343
+ DNG P H N G LRG K T ++GG+R A + SP L + + L H+ D
Sbjct: 306 YSGDNGGPEIEGHWNGG----LRGGKWTWFEGGVRPAAFVHSPLLTRRGWHNGTL-HLVD 360
Query: 344 WLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
WLPT A AG +D + DG +QW L GA R L
Sbjct: 361 WLPTFLALAGAARDDDAFDGADQWASLAFGAAPAREATL 399
>gi|323456753|gb|EGB12619.1| hypothetical protein AURANDRAFT_70521 [Aureococcus anophagefferens]
Length = 913
Score = 201 bits (512), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 129/389 (33%), Positives = 205/389 (52%), Gaps = 38/389 (9%)
Query: 18 NDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALC 77
N A L + A KPHI+ L DDLGW+DV +H S + TP + ALA +G++L++ Y C
Sbjct: 399 NAAPLPSDAAPKPHIVYFLVDDLGWHDVPWHNPS-LKTPTLAALAADGVVLDRFYAYRFC 457
Query: 78 TPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
+PSRS+L++G+YP+H+ Q+ + + G+ + + + LK AGYATH +GKWH G
Sbjct: 458 SPSRSSLLSGRYPMHVN-QYNMAGDALGGGVHVNMTTIAKKLKGAGYATHQLGKWHAGQS 516
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYS 193
P RGFD+ GY G +D++ + P G+ ++ + G Y
Sbjct: 517 SADLVPAARGFDTSLGYLNGAEDHWTQA-----RPACGVGNFVDLYATDGPAFGKNGTYG 571
Query: 194 TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE---- 249
+Y +AA+++IA+H+ S P+F+Y A HA P Q PD A+ DP+
Sbjct: 572 AQIYHDAALDIIADHDASVPLFVYFAFQINHA-----PMQVPD-AYARVYPCDDPKTCGV 625
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
R TY M + D+ V N AAL+ GM ++++++ +DNG PS G ++ +N PLRG K
Sbjct: 626 RSTYQAMTAMADDVVRNATAALKAKGMWDDTLIVLTSDNGGPS-GTDADSSNNFPLRGGK 684
Query: 310 STPWDGGMRGVAAIWSPWL-----KQTQKVSSELFHISDWLPTLCAAAGIEINDT----- 359
+ +GG+R + +L +T + HI DW T AG++ D
Sbjct: 685 YSDLEGGVRAAGLVAGGFLPDAARGRTLDGAGAYVHICDWYATFARLAGVDAADDASDAD 744
Query: 360 -----SLDGVNQWDVLTKGA-KTKRSEIL 382
++D ++ WD L+ A + R+EI+
Sbjct: 745 GRPLPAVDSLDVWDFLSGAADASPRTEIV 773
>gi|449138178|ref|ZP_21773473.1| arylsulfatase B [Rhodopirellula europaea 6C]
gi|448883202|gb|EMB13740.1| arylsulfatase B [Rhodopirellula europaea 6C]
Length = 489
Score = 201 bits (511), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 136/409 (33%), Positives = 206/409 (50%), Gaps = 53/409 (12%)
Query: 7 YFFALTCTLLFNDAFLNTTAP---KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
Y FA+ L A ++ + PHI+ I+ADDLGWNDV FHGS I TPNID LA
Sbjct: 20 YAFAILGCFLGAGAKVSAETDVEMQPPHIVHIIADDLGWNDVGFHGS-DIRTPNIDRLAR 78
Query: 64 NGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAG 123
++L++ YV +C+P+R+ ++TG YP G+ GV+ + GLP + P++L + G
Sbjct: 79 ESVVLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVVSPTKKHGLPSELETTPEHLAKLG 138
Query: 124 YATHAI-GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM 182
Y A+ GKWHLG ++ P G YG++ G DY+ + ++ D H
Sbjct: 139 YDHRAMFGKWHLGLASTLFHPLRHGMTEFYGHYNGAIDYFSRERFGQLDWHRNHDSVHEE 198
Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA-------- 234
YST+L A ++ I H +P++ Y+A A H+ P QA
Sbjct: 199 G--------YSTELVGNAVVDFIDRHAGQQPLYAYVAFNAPHS-----PLQALRSDLDDY 245
Query: 235 -----------PDEEVAKFLDI--------SDPERRTYAGMVSRLDESVGNVIAALRKHG 275
D ++AK + + R+T+A M + +D +G ++ A+ ++G
Sbjct: 246 GFDPTGKLAPNTDRKIAKRENALEYGKRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRNG 305
Query: 276 MLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKV 334
M EN++V+F +DNGA P G GSN PLRG K T W+GG+R VA + P
Sbjct: 306 MRENTLVVFHSDNGADPKHG-----GSNEPLRGNKFTTWEGGVRVVATMRWPREFPAGVT 360
Query: 335 SSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLT-KGAKTKRSEIL 382
+ D LPT+ AAAG + + + DGVN L+ K A +R+ +L
Sbjct: 361 YESVASYVDLLPTMVAAAGGQPPERT-DGVNLLPFLSGKAAPPERTMLL 408
>gi|343084004|ref|YP_004773299.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342352538|gb|AEL25068.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 445
Score = 201 bits (511), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 133/372 (35%), Positives = 203/372 (54%), Gaps = 34/372 (9%)
Query: 7 YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
YFF L L + + +P+I++I+ADDLG+ D+S +G+ I TPN+D LA G+
Sbjct: 5 YFFLLMSAWLLSCGGEKESLSMRPNIVLIMADDLGYGDLSCYGNEYINTPNLDLLASEGV 64
Query: 67 IL-NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEG-EPWGLPLTEKLLPQYLKEAGY 124
+ + H ++C+P+R+ALMTGKY G++ V + GL L E L + LK+ GY
Sbjct: 65 LFTDYHSNGSVCSPTRAALMTGKYQQRTGVEGVVTAKSHRDVGLALAEVTLAEELKQLGY 124
Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV 184
T GKWHLG + + + PT +GFD G+ G DY+ H + + LD +++
Sbjct: 125 NTGMFGKWHLG-YDKAFNPTLQGFDEFVGFVSGNVDYHGHIDQEGY-----LDWWDGVKI 178
Query: 185 DNKTIGIYSTDLYTEAAINVIAEHN---KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
N+ Y+TDL +E + I EHN K P FLYL H A H+ P+Q ++V +
Sbjct: 179 KNEK--GYTTDLISEYGVKFIQEHNPEVKRAPFFLYLPHEAPHS-----PYQRRIDKVLR 231
Query: 242 FLD-------ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
+ I D Y MV +DE VG ++ +L++ G EN+IV+F++DNGA +
Sbjct: 232 EIGTAGTQEVIQDSISSIYKEMVEVMDEGVGRIMQSLKETGQYENTIVIFISDNGANHY- 290
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS-PWLKQTQKVSSELFHISDWLPTLCAAAG 353
G N LRG K+ P++GG R V AI+S P + V+++ D LPTL G
Sbjct: 291 -----GDNGGLRGFKAGPYEGGSR-VPAIFSFPKEVKGGSVNNQTVLSMDLLPTLLDFIG 344
Query: 354 IEINDTSLDGVN 365
+ + TS+DG++
Sbjct: 345 RKPS-TSVDGIS 355
>gi|119504674|ref|ZP_01626753.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
gi|119459696|gb|EAW40792.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
Length = 545
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 122/387 (31%), Positives = 197/387 (50%), Gaps = 42/387 (10%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
+ C +L A + +KP+I+I++ADDLGW DV +HG I TP++D LA G+ LN+
Sbjct: 17 MMCVILGVPA---SAQSQKPNILIMVADDLGWADVGYHGGD-IDTPSLDRLAQQGVRLNR 72
Query: 71 HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
Y +C+P+R+ALMTG+ PI +G+ +GVI + G+ E +P+ + AGY T IG
Sbjct: 73 FYTTPICSPTRAALMTGRDPIRLGVTYGVIFPWDNIGVHPDEHFMPETFQAAGYQTAIIG 132
Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
KWHLG + Y P RGF+ YG+ +Y + +Q R+ + +D++
Sbjct: 133 KWHLGHAQMTYHPNNRGFEHFYGHLHTEVGFYPPFSNQGGKDFQ----RNGVSIDDQG-- 186
Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD--- 247
Y T L + I E ++ +P +Y+ +A H P AP E K+ DI
Sbjct: 187 -YETYLLADEVSRYIRERDRDRPFLVYMPFIAPHT-----PLDAPVELQDKYKDIETDLP 240
Query: 248 --------------------PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMAD 287
R YA +V +D+++G V+ L + G+ +N+IVLF +D
Sbjct: 241 MARSRQTDDTRLISRVMLQPSARPMYAAVVDAMDQAIGRVLDTLDQEGISDNTIVLFFSD 300
Query: 288 NGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPT 347
NG ++ +N PLRG K ++GG+R + + P + + ++ ++ + D PT
Sbjct: 301 NGGAAYSY--GGANNAPLRGGKGETFEGGIRVTSLMRWPAMLEPGQIFEQIMSVMDVFPT 358
Query: 348 LCAAAGIEI-NDTSLDGVNQWDVLTKG 373
L AA + N+ +LDG + W L G
Sbjct: 359 LVDAADVRPGNNFALDGRSMWTALKSG 385
>gi|301311962|ref|ZP_07217884.1| sulfatase family protein [Bacteroides sp. 20_3]
gi|423339482|ref|ZP_17317223.1| hypothetical protein HMPREF1059_03148 [Parabacteroides distasonis
CL09T03C24]
gi|300830064|gb|EFK60712.1| sulfatase family protein [Bacteroides sp. 20_3]
gi|409230863|gb|EKN23724.1| hypothetical protein HMPREF1059_03148 [Parabacteroides distasonis
CL09T03C24]
Length = 459
Score = 200 bits (509), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 142/387 (36%), Positives = 202/387 (52%), Gaps = 25/387 (6%)
Query: 6 KYFFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
K LT +L N LN + A KP+ +II DD+G+ D+S +G+ I TPNID +A
Sbjct: 5 KSLSVLTAAVLSNSLSLNAASDAANKPNFVIIFCDDMGYGDLSCYGNPTIRTPNIDRMAC 64
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYL 119
G+ L Q YV A + TPSR+ALMTG+ P+ G+ + V+ GL E + + L
Sbjct: 65 EGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDEVTIAKVL 124
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD 177
++ GYAT +GKWHLG F Y PT GFD+++G Y + + A P L
Sbjct: 125 QQNGYATGCVGKWHLGAFSP-YLPTDHGFDTYFGIPYSNDMSPVRNKGAHARNFPPTPL- 182
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
+ Q++++ T YTE A++ I +H+K +P FLY AH H P
Sbjct: 183 ILDGKQIESEPDQGELTRRYTEKAVSFIKDHSK-EPFFLYFAHTFPH---------IPLY 232
Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
A+F S +R Y +V +D SVG V+ ALR++G+ EN+ V+F +DNG P H
Sbjct: 233 TNARFEGTS--KRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG-PWLTEHE 289
Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI- 356
N GS PL+ K T W+GG R A W K ++ E+ D PT + AGIE
Sbjct: 290 NGGSAGPLKDGKGTWWEGGFRVPAICWM-LGKINPAINDEIMTSMDLYPTFLSMAGIEQP 348
Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILH 383
D LDGVNQ +L + + R E+ +
Sbjct: 349 KDLVLDGVNQTGLLFEEKHSARDEVYY 375
>gi|187735676|ref|YP_001877788.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
gi|187425728|gb|ACD05007.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
Length = 465
Score = 200 bits (509), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 144/433 (33%), Positives = 217/433 (50%), Gaps = 41/433 (9%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
+ C L A P++I+ILADDLG+ D+ GS QI TP++D LA G+ ++
Sbjct: 10 ILCGLFCGTAVAQPRLSSPPNMIVILADDLGYGDLGCTGSKQIKTPSLDRLAREGVFCSR 69
Query: 71 HYVQA-LCTPSRSALMTGKYPIHIGM------QHGVILEGEPWGLPLTEKLLPQYLKEAG 123
YV A +C+PSR L+TG++P G+ Q + E +GLP TEKL+P+YL G
Sbjct: 70 AYVTAPMCSPSRMGLLTGRFPKRYGITTNPNIQMDYLPESH-YGLPQTEKLIPEYLAPCG 128
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ 183
Y + GKWHLG + YTP RGF +G+ G + Y+ +A +GL+ +M
Sbjct: 129 YRSAVFGKWHLGHTKG-YTPPERGFTHWWGFLGGSRHYFPVKKEA-----EGLN--PSMI 180
Query: 184 VDN---KTIGIYSTDLYTEAAINVIAEHNK-SKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
V N KT Y TD T+ A+ + E K KP F+++++ A H N +P E++
Sbjct: 181 VSNFTDKTDITYLTDDITDRAVEFLQEAGKDKKPFFMFVSYNAPHWPNEAKP-----EDI 235
Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
AKF ++ + ERR Y MV +D +G ++ AL+ G+ +++IV+F++DNG G
Sbjct: 236 AKFRNVQNGERRVYCAMVYAMDRGIGRILDALKADGLEKDTIVVFLSDNG----GAPEAS 291
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQ--TQKVSSELFHISDWLPTLCAAAGIEIN 357
N P RG K ++GG+R I P K+ V + D LP L A G I
Sbjct: 292 SCNAPFRGAKRQHFEGGVRVPFIIRYPADKRLVPGSVCRQPVSSVDLLPALLKANGRHI- 350
Query: 358 DTSLDGVNQWDVL-TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
LDG++ +++ KGA R+ Y +A+ D+KY+ D Q
Sbjct: 351 PRKLDGMDILELVGNKGAPVPRTFFWCT--------DYTSAVLTGDMKYLLVPDRAPQFY 402
Query: 417 EWYGDTDNEIDKY 429
D + D Y
Sbjct: 403 NVADDPQEQRDLY 415
>gi|440713713|ref|ZP_20894310.1| arylsulfatase B [Rhodopirellula baltica SWK14]
gi|436441429|gb|ELP34656.1| arylsulfatase B [Rhodopirellula baltica SWK14]
Length = 472
Score = 200 bits (509), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 136/412 (33%), Positives = 205/412 (49%), Gaps = 52/412 (12%)
Query: 9 FALTCTLLFNDAFLNTTA---PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
FA+ F A ++ A + PHI+ I+ADDLGWNDV FHGS +I TPNID LA
Sbjct: 5 FAIVACFAFVGANVSAEANGGSQPPHIVHIIADDLGWNDVGFHGS-EIRTPNIDRLANES 63
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+ L++ YV +C+P+R+ ++TG YP G GV+ + GLP + P++L + GY
Sbjct: 64 VTLDRFYVTPICSPTRAGVLTGLYPFRFGFWGGVVSPTKKHGLPPQLETTPEHLSKLGYD 123
Query: 126 THAI-GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV 184
A+ GKWHLG ++ P G YG++ G DY+ + ++ D H
Sbjct: 124 HRAMFGKWHLGLASTLFHPLQHGMTEFYGHYNGAIDYFSRERFGQLDWHRNFDSVHEEG- 182
Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA---------- 234
YST+L A ++ I + + P++ Y+A A H+ P QA
Sbjct: 183 -------YSTELVGNAVVDFIDRNANAGPVYAYVAFNAPHS-----PLQALRSDLDEYGF 230
Query: 235 -PDEEVAKFLD--ISDPE--------------RRTYAGMVSRLDESVGNVIAALRKHGML 277
P+ ++A D I+ E R+T+A M + +D +G ++ A+ ++GM
Sbjct: 231 DPNNKLAPNTDRKIAKREKALDYGKRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRNGMR 290
Query: 278 ENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS 336
EN++V+F +DNGA P G GSN PLRG K T W+GG+R VA + P
Sbjct: 291 ENTLVVFHSDNGADPKHG-----GSNEPLRGNKFTTWEGGVRVVAMMRWPNELPAGITYD 345
Query: 337 ELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ D LPT+ AAAG + + DG+N L+ A IL + + V
Sbjct: 346 SVTSYVDLLPTMVAAAGGPPPEET-DGINLLPFLSGKASPSERTILLDAETV 396
>gi|298715187|emb|CBJ27859.1| Formylglycine-dependent sulfatase, C-terminal fragment
Formylglycine-dependent sulfatase, N-terminal
[Ectocarpus siliculosus]
Length = 610
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 138/390 (35%), Positives = 206/390 (52%), Gaps = 59/390 (15%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIP-TPNIDALAYNGLILNQHYVQALCTPSRSA 83
+A +PH+I IL DD+G ND+ + + TP +D+LA +G++L+ +Y LCTPSR++
Sbjct: 65 SASGQPHVIFILIDDVGTNDMGYRSTDLWELTPFLDSLASSGVVLDNYYTNQLCTPSRAS 124
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
LMTG+ GMQHG++ PWGLP E L K AGY+TH GKWHLG F + P
Sbjct: 125 LMTGRDSFRTGMQHGIVDYSAPWGLPFEEVTLADRFKAAGYSTHMTGKWHLGVFSDASYP 184
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEP-YQG-----------------LDMRHNMQVD 185
RGFD+ GY G + YY+HS F P ++G +D N +
Sbjct: 185 FSRGFDTFLGYTGGGEGYYNHS--TCFTPTFEGGEYSCLKDFGYGDEDGYIDYTTNTTKE 242
Query: 186 NKT-IGIYSTDLYTEAAINVIAEHNKSK----PMFLYLAHLAVHAGNTYEPFQAP----- 235
+ YST + T+ AI+V EH + P+FLY+A+ A H P AP
Sbjct: 243 GPAMVDNYSTTIMTDRAIDVAREHTGTASSDDPLFLYVAYQAAHT-----PLDAPPPGTF 297
Query: 236 ----DEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
+E +A+ + +P+R +A ++ LD+ V + AL G+L+N++++ +DNGA
Sbjct: 298 TEEEEETLARVVS-DEPDRHVFARIIYYLDKEVRRLHDALEALGVLDNAVIVLASDNGAC 356
Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE--------LFHISD 343
S S+ GSNHPLRG K+T + GG+R A ++S K T+ + E + H +D
Sbjct: 357 S----SSGGSNHPLRGFKNTTFQGGVRVPAFVYS---KSTELIPEEARGTTYSGMMHSTD 409
Query: 344 WLPTLC-AAAGIEINDTS--LDGVNQWDVL 370
W TL G+ + L G +QWD +
Sbjct: 410 WTQTLGRVVPGLPLAGMGKELSGFDQWDAI 439
>gi|260816809|ref|XP_002603280.1| hypothetical protein BRAFLDRAFT_126970 [Branchiostoma floridae]
gi|229288598|gb|EEN59291.1| hypothetical protein BRAFLDRAFT_126970 [Branchiostoma floridae]
Length = 377
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 105/224 (46%), Positives = 141/224 (62%), Gaps = 13/224 (5%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KPHI++I+ADDLGW+DVS++ + + TPN+ LA G+I NQ Y Q C+PSR+AL+TGK
Sbjct: 24 KPHILLIVADDLGWSDVSWN-NPYVVTPNLHTLATTGVIFNQTYAQPTCSPSRTALLTGK 82
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
+P +GMQ V+ +P GLPL E+LLPQ LK+ GYATH +GKWHLG + YTPT RGF
Sbjct: 83 FPFRLGMQR-VMDSKKPHGLPLDEELLPQKLKKLGYATHMVGKWHLGSCKWEYTPTERGF 141
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
DS YGY G QDYY H +GLD + G+YST+ + A N+I++H
Sbjct: 142 DSFYGYHHGSQDYYTHKSA------RGLDFWDGKTSISDQNGVYSTESFATRAENIISQH 195
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
+ + P+FLYL +VH P Q P + F I D R++
Sbjct: 196 DPNTPLFLYLPFQSVHT-----PHQVPSSYLQTFSTIQDDNRKS 234
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 84/210 (40%), Gaps = 72/210 (34%)
Query: 356 INDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQS 415
I D LDG N W ++ GA + R+E +++ID+ N +Y A+R+ D K V G N
Sbjct: 235 ILDPGLDGKNMWQSISTGADSPRNEFVYDIDSGAN--RY--AIRIGDYKLVTG--NRAPK 288
Query: 416 DEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDE 475
D+W Y+P E+ + GI
Sbjct: 289 DDW----------YAPAEM--QQMGI---------------------------------- 302
Query: 476 KILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNL--AESQTDLLKQLEDKL 533
L E A D+ G P LFNI +DP E++NL A + + + +L
Sbjct: 303 ----LTEIA------DDTG--------PFLFNIKEDPLEKSNLYNAAEYASIQRAMRRRL 344
Query: 534 AIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
K VP + D ADP+ + N+W+P
Sbjct: 345 NELKPFEVPFPDLADDPAADPSNYGNVWMP 374
>gi|449680623|ref|XP_004209633.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
Length = 327
Score = 199 bits (507), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 134/374 (35%), Positives = 196/374 (52%), Gaps = 63/374 (16%)
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNT-YEPFQAPDEEVAKFLDISDPERRTYA 254
+YT AI+ I HN S+P+FLYLA AVH+ N+ EP QAP E + KF I +RR YA
Sbjct: 1 MYTSKAISYITNHNCSEPLFLYLAFQAVHSSNSEVEPLQAPQEWIDKFSYIKHEQRRKYA 60
Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
M++ +D +G V AL + ML+NS+++F DNG PS G ++N +N PLRG+K+T ++
Sbjct: 61 AMLAYMDYGIGRVHEALIEKKMLDNSVIIFTTDNGGPSNGFNNNWANNFPLRGVKATLFE 120
Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKG 373
GG+RGV ++S ++++ +VS +L H++DWLPTL + AG ++D S LDG +QW L
Sbjct: 121 GGVRGVGFVYSKLIEKS-RVSHDLIHVTDWLPTLVSLAGGNVSDISFLDGFDQWATLQNK 179
Query: 374 AKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKE 433
++R+EIL NID + ALRV K + N D WY
Sbjct: 180 QSSQRNEILLNIDEHVWKNE---ALRVGSWKLIKEGLKNW--DGWY-------------- 220
Query: 434 VLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNK 493
P N S DE + + ++C +D
Sbjct: 221 ------------------------PPPSYNES-------MDESL----NSSILKCGHDFP 245
Query: 494 GA--HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPP-GNKPFDK 550
HC+ CLF++ +DPCE +L++ D+L + +L IYK +MVPP N D
Sbjct: 246 VVINHCDDY---CLFHVVEDPCEIVDLSKKFPDVLATMMYRLNIYKQSMVPPRNNMTIDS 302
Query: 551 RADPARWNNIWVPW 564
R+DP N +W PW
Sbjct: 303 RSDPKLHNGVWQPW 316
>gi|298706368|emb|CBJ29377.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 653
Score = 199 bits (506), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 165/590 (27%), Positives = 271/590 (45%), Gaps = 85/590 (14%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
F ++ + P++++++ DDLGW DV FH ++ TPN+DA+ G+ L+ Y CTP
Sbjct: 27 GFGSSATTENPNVLVVVVDDLGWKDVGFHDTT-FSTPNLDAMVAEGVELSTFYTAPTCTP 85
Query: 80 SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
SR+ LMTG+Y IGMQ V+ EP G+PLTE + + L+ AGY+T A+GKWHLG
Sbjct: 86 SRAQLMTGRYSYRIGMQDSVLHTTEPRGVPLTETFVGEKLQAAGYSTAAVGKWHLGMHMP 145
Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEP---------YQGLDMRHNMQV--DNK 187
+ P RGFD YG G +Y H S F P + G ++ +V DN+
Sbjct: 146 QFLPVERGFDDFYGILTGGGGHYTHMSVSPEFTPRGGSGKTRTFSGPNIVEGSEVSDDNQ 205
Query: 188 TIGIYSTDLYTEAAINVIAEHN--KSKPMFLYLAHLAVH----------AGNTYEPFQAP 235
+ST+LYT+ A + + + +P FLYL++ A+H G + E A
Sbjct: 206 DTRTHSTELYTKKAAEYVEAMSAKEDQPWFLYLSYQAIHDPIETDDSWINGRSCESISAE 265
Query: 236 DEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI 295
+ + + + RR GMV+++D +G + L G +N++V+F +DNG G+
Sbjct: 266 NSSIGETDETDYDNRRIACGMVAQMDNGLGALRGLLENLGEWDNTVVIFFSDNG----GL 321
Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS-------SELFHISDWLPTL 348
S+ N P RG K W+GG+ A + ++ + + L H++D T+
Sbjct: 322 PSHGSVNRPFRGGKGDYWEGGVHVPAFVSGGFVSAALRRTGVEPYRYGHLTHVTDVHATI 381
Query: 349 CAAAGIEIND------TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDD 402
AG+ D LDG++ WD L + R ++ N+++ + A+R +
Sbjct: 382 LGLAGVPGADDGGNGEAELDGIDLWDSLVETKAPARQAVVINLNSPNFASS--GAVRWSN 439
Query: 403 LKYVAGTDNNGQSDEWYGDTDNEIDKYSPKE-VLYSKAGITFNALKTKLQIKQKHAADPK 461
K + + PKE V+YS+ +++KL ++ +
Sbjct: 440 YKLIRNPE--------------------PKETVIYSR-------VQSKL--VEEEGVVSE 470
Query: 462 ANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQ-NNLAE 520
+ + A+ T+ + E + + +D + S V+ C + P E NL
Sbjct: 471 STLASAISTVQAQIHLSEPKMYM-----FDLEANPSESVVEGCGTDRALGPVESCANLYS 525
Query: 521 --SQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR--ADPARWNNIWVPWYD 566
+ D K+LE + V P + D ADP+ W WVPW D
Sbjct: 526 IPAFRDARKKLEGIIRKADRESVAPTLRWEDDGPLADPSNWGG-WVPWRD 574
>gi|421614608|ref|ZP_16055661.1| arylsulfatase B [Rhodopirellula baltica SH28]
gi|408494617|gb|EKJ99222.1| arylsulfatase B [Rhodopirellula baltica SH28]
Length = 472
Score = 199 bits (506), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 135/414 (32%), Positives = 205/414 (49%), Gaps = 52/414 (12%)
Query: 7 YFFALTCTLLFNDAFLNTT---APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
+ FA+ F A ++ + PHI+ I+ADDLGWNDV FHGS +I TPNID LA
Sbjct: 3 FPFAIMTCFAFVGANVSAETNGGSRPPHIVHIIADDLGWNDVGFHGS-EIRTPNIDRLAS 61
Query: 64 NGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAG 123
+ L++ YV +C+P+R+ ++TG YP G+ GV+ + GLP + P++L + G
Sbjct: 62 ESVTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVVSPTKKHGLPPQLETTPEHLSKLG 121
Query: 124 YATHAI-GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM 182
Y A+ GKWHLG ++ P G YG++ G DY+ + ++ D H
Sbjct: 122 YDHRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAIDYFSRERFGQLDWHRDFDSVHEE 181
Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA-------- 234
YST+L A ++ I + + P++ Y+A A H+ P QA
Sbjct: 182 G--------YSTELVGNAVVDFIDRNANAGPVYAYVAFNAPHS-----PLQALRSDLDEY 228
Query: 235 -----------PDEEVA---KFLDI-----SDPERRTYAGMVSRLDESVGNVIAALRKHG 275
D ++A K LD + R+T+A M + +D +G ++ A+ ++G
Sbjct: 229 GFDPTGKLAPNTDRKIAKREKALDYGKRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRNG 288
Query: 276 MLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKV 334
M EN++V+F +DNGA P G GSN PLRG K T W+GG+R VA + P
Sbjct: 289 MRENTLVVFHSDNGADPKHG-----GSNEPLRGNKFTTWEGGVRVVAMMRWPRELPAGIA 343
Query: 335 SSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ D LPT+ AAAG + + DG+N L+ A IL + + V
Sbjct: 344 YDSVTSYVDLLPTMVAAAGGPPPEET-DGINLLPFLSGKASPPERTILLDAETV 396
>gi|443716207|gb|ELU07840.1| hypothetical protein CAPTEDRAFT_136127, partial [Capitella teleta]
Length = 311
Score = 199 bits (505), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 116/310 (37%), Positives = 176/310 (56%), Gaps = 18/310 (5%)
Query: 80 SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
SR + ++G+Y MQHGVIL+ +P L L +LP YLKE GY THA GKWHLG+ R+
Sbjct: 1 SRGSFLSGRYSYKSAMQHGVILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRD 60
Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
TPT RGFDS G + G ++Y+H+ AT Y D +VD IG +S DL
Sbjct: 61 ECTPTHRGFDSFSGGFSGEGEFYEHT-TATGGYY---DWHLGTEVDYDAIGKHSEDLIGY 116
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF--LDISDPERRTYAGMV 257
+ E+++S P+F+Y+A VH+ P E +A + LD++D +R+ Y G+V
Sbjct: 117 YVNKTLDEYDQSSPLFMYVAFHNVHS-----PLDPKPEFLALYDDLDVTD-DRKKYLGLV 170
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
S +D +G ++ +++ G+ EN+ +LF +DNG +G N P RG KST W+GG
Sbjct: 171 SGMDYIIGGIVDKIKEIGIYENTYILFSSDNGGD-----VGEGDNSPHRGGKSTLWEGGC 225
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKT 376
+ + SP L + ++ L H++DWLPT+ AG + LDG Q D++ +
Sbjct: 226 KANSWTHSPLLGASGVENNGLMHVTDWLPTIVDLAGGSVPASDGLDGFVQTDMVINNGDS 285
Query: 377 KRSEILHNID 386
R+ +++NID
Sbjct: 286 ARTTMIYNID 295
>gi|431796835|ref|YP_007223739.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
gi|430787600|gb|AGA77729.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
Length = 470
Score = 198 bits (504), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 131/372 (35%), Positives = 194/372 (52%), Gaps = 30/372 (8%)
Query: 8 FFALTCTLLFNDAFLNTTAPKK------PHIIIILADDLGWNDVSFHGSSQIPTPNIDAL 61
F L L+F+ + + P P+ ++I+ADDLG+ D+SF GS+Q+ TP+ID L
Sbjct: 7 FLFLQSLLIFSLSAQIISTPSTSGRNSHPNFVLIVADDLGYGDLSFTGSTQVKTPHIDEL 66
Query: 62 AYNGLILNQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEP------WGLPLTEKL 114
A +G+ + YV A+C+PSR+ L+TG+ + G + + +P GLP+ K
Sbjct: 67 AASGVFFPEGYVSSAVCSPSRAGLLTGRNQVSFGYDNN-LANSQPGFDPAFLGLPVNVKT 125
Query: 115 LPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ 174
+ +LK+ GY T +GKWHLG+ + ++P RGFD +GY G DY++ S E +
Sbjct: 126 VGDHLKKLGYVTGLVGKWHLGY-EDQFSPLNRGFDEFWGYLGGGHDYFEAS-----EAKR 179
Query: 175 GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA 234
G + Y TD + IN I H K +P FLY + A H P QA
Sbjct: 180 GYKAKIKCNYKTPQEITYITDDKGDECINFIQRH-KDEPFFLYASFNAPHT-----PMQA 233
Query: 235 PDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
E++A + I D +RRTYA MV RLD +VG ++ A+ + +LEN+I +F++DNG P
Sbjct: 234 TAEDLAIYQHIEDRKRRTYAAMVHRLDINVGRIVEAVEEADLLENTIFVFISDNGGP--- 290
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
+ +N N P G K T +GG+R I P + KV D PT A AG
Sbjct: 291 VMTNGSINAPYNGKKGTLLEGGIRVPFFISWPGHLEPGKVFEHPVSSLDLTPTFVALAGG 350
Query: 355 EINDT-SLDGVN 365
E + DG+N
Sbjct: 351 ESKEEDQFDGLN 362
>gi|196231555|ref|ZP_03130413.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196224408|gb|EDY18920.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 467
Score = 198 bits (504), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 153/418 (36%), Positives = 196/418 (46%), Gaps = 51/418 (12%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F L + L D TAP +P+ I ILADDLGW DV FH +PTPN+D LA GL
Sbjct: 24 LFGLVVSSLAAD-----TAPLRPNFIFILADDLGWGDVGFH-HGNVPTPNLDHLAGEGLE 77
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
L QHYV +C+P+R A ++G+Y + V P L + LK GY T
Sbjct: 78 LMQHYVYPVCSPTRCAFLSGRY----ASRFSVTTPQNPRAFRWDTVTLARALKSVGYDTA 133
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
GKWHLG E + P GFD YG G +DH Y+ + D K
Sbjct: 134 LCGKWHLGSKPE-WGPQKFGFDHSYGSLAGGVGPWDHH-------YKIGEFTQTWHRDGK 185
Query: 188 TI--GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LD 244
I + TDL T+ A+ + E KP FLY+ AVH P + PDE + +
Sbjct: 186 LIEEQGHVTDLITKEAVEWL-ESRTDKPFFLYVPFTAVHI-----PIREPDEILQRVPAS 239
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK----- 299
I+ P R Y V LD+SVG ++ AL K G N++V+F +DNGA G+ +N
Sbjct: 240 ITKPSLRHYGANVMHLDDSVGKILVALEKTGKAGNTLVIFGSDNGAIP-GVENNDPLYPP 298
Query: 300 ---------GSNHPLRGMKSTPWDGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLC 349
GSN PL GMK ++GG+ A A W LK + + L HI+DW+PT C
Sbjct: 299 DHYPPGPAGGSNEPLHGMKGEVYEGGIHTAAVARWPGQLKPGKFLG--LAHITDWMPTFC 356
Query: 350 AAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
A AG + D DG N W LT K I V P ALR D K V
Sbjct: 357 ALAGYKPEKDLKWDGQNIWPQLTGAEPVKPRTIY-----VAGPGFRSKALRDGDWKLV 409
>gi|323450479|gb|EGB06360.1| hypothetical protein AURANDRAFT_13369, partial [Aureococcus
anophagefferens]
Length = 304
Score = 198 bits (503), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 175/314 (55%), Gaps = 31/314 (9%)
Query: 77 CTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF 136
CTPSR +L++G+YP+ +G+Q GV+ WGL +LLP+ L+ AGY T +GKWHLG
Sbjct: 1 CTPSRGSLLSGRYPVRLGLQRGVVSAASKWGLDPAIRLLPETLQTAGYRTALVGKWHLGH 60
Query: 137 FREVYTPTFRGFDSHYGYWQGLQDYYDH------SCKATFEPYQG---LDMRHNMQVDNK 187
+ E TP RGF YG+ G D++ H +C + G LD + N +
Sbjct: 61 YNESLTPARRGFGESYGFLGGFVDHFTHVAEAPPACASETPATDGTCALDWQVNGLPRPR 120
Query: 188 TIGIYSTDLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDE----EVAKF 242
G+Y+ +AA ++A H K +P+FLY A H EP PD+ ++
Sbjct: 121 ATGVYAAYACRDAARRLVASHEGKEEPLFLYYALPTPH-----EPITPPDDVETYDLLPK 175
Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
LD++ RR YA +V+ D ++G++ AAL+KH + N++++ ++DNGA + G GSN
Sbjct: 176 LDVAG-RRRDYAAVVAATDGAIGDLQAALQKHALWPNTLLVLLSDNGASAAG-----GSN 229
Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDT- 359
PLRG K ++GG+R A + L Q + SS L H++DW PTL AAG+ +
Sbjct: 230 APLRGAKGGYFEGGVRTRALLAGSLLPQNVRGSSHEGLVHVADWAPTLLNAAGLRPDLVA 289
Query: 360 ---SLDGVNQWDVL 370
++DGV+QWD L
Sbjct: 290 GFDAMDGVDQWDAL 303
>gi|421611065|ref|ZP_16052220.1| arylsulfatase B [Rhodopirellula baltica SH28]
gi|408498167|gb|EKK02671.1| arylsulfatase B [Rhodopirellula baltica SH28]
Length = 498
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 130/368 (35%), Positives = 192/368 (52%), Gaps = 39/368 (10%)
Query: 4 ARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
A F L +F P+I++ILADD+G+ D+ GS + TPN+D LA
Sbjct: 8 AVSRFVFLPLIFVFATEASRAAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAE 67
Query: 64 NGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-------GLPLTEKLL 115
+G++ +Q YV A+C+PSR+ L+TG+ P G + + E + GLP +EK L
Sbjct: 68 SGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTL 127
Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG 175
+L AGYAT IGKWHLG E++ P RGFD G G Y+ + K E
Sbjct: 128 ADHLGAAGYATALIGKWHLGM-GEMHHPNRRGFDHFCGMLTGSHHYFPTTMKHVIE---- 182
Query: 176 LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPF 232
R+ +VD + Y TD +T+ + I +H N +P F++ ++ A H P
Sbjct: 183 ---RNGKRVDGFS-SEYLTDFFTDEGLRFIDQHESANPDQPWFVFFSYNAPHT-----PM 233
Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
A + ++A+F +I + +RRTYA M+ LD VG + L + G EN++++F +DNG
Sbjct: 234 HATEADLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNG--- 290
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS-----PWLKQTQKVSSELFHISDWLPT 347
G +N N PLRG+K + +GG+R V IW+ P + V S L D LPT
Sbjct: 291 -GATNNGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVRYDGVISSL----DLLPT 344
Query: 348 LCAAAGIE 355
C+AAG E
Sbjct: 345 FCSAAGAE 352
>gi|255530697|ref|YP_003091069.1| sulfatase [Pedobacter heparinus DSM 2366]
gi|255343681|gb|ACU03007.1| sulfatase [Pedobacter heparinus DSM 2366]
Length = 472
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 126/363 (34%), Positives = 193/363 (53%), Gaps = 23/363 (6%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
KP++I+I++DD G+ D +G QIPTPNIDA+A G YV A +C PSR+ ++TG
Sbjct: 28 KPNVIVIVSDDAGYVDFGCYGGKQIPTPNIDAIAKQGTRFTDAYVSASVCAPSRAGILTG 87
Query: 88 KYPIHIGMQH---GVILEG---EPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
+Y G +H V+ G G+ +E+ + ++ GY T AIGKWH G +
Sbjct: 88 RYQQRFGFEHNTSNVLAPGYKITDVGMDPSEQTIGNEMQANGYKTIAIGKWHQG-DEPKH 146
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
P RGF+ YG+ G +D++ + K T E + +N ++ + Y TD++T+ A
Sbjct: 147 FPLNRGFNEFYGFTGGHRDFFAYKGKRTNEH----ALYNNKEIVPENEITYLTDMFTDKA 202
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+ I NK KP F+YL++ AVH P A + + ++ I+D RR YA M++ LD
Sbjct: 203 TSFITA-NKDKPFFMYLSYNAVHT-----PMNAKKDLMERYASIADTGRRAYAAMMTSLD 256
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
+ +G V+A L+ + + +N++++F+ DNG G N N PLRGMK + W+GG+R
Sbjct: 257 DGIGKVMATLKANQLDKNTLIIFINDNG----GATVNSSDNGPLRGMKGSKWEGGIRVAM 312
Query: 322 AIWSPWLKQTQKVSSELFHISDWLPT-LCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
+ P K S D LPT + A G + LDGVN L+ G K E
Sbjct: 313 MMKWPGHIAANKTDSRPVSSLDILPTAIGAGKGKQKGTKKLDGVNLLPYLSAGNKKTPHE 372
Query: 381 ILH 383
L+
Sbjct: 373 ALY 375
>gi|32471439|ref|NP_864432.1| arylsulfatase B [precursor] [Rhodopirellula baltica SH 1]
gi|32443280|emb|CAD72111.1| Arylsulfatase B [Precursor] [Rhodopirellula baltica SH 1]
Length = 579
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 133/412 (32%), Positives = 204/412 (49%), Gaps = 51/412 (12%)
Query: 8 FFALTCTLLF--NDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
F +TC L N + + PHI+ I+ADDLGWNDV FHGS +I TPNID LA
Sbjct: 112 FAIMTCFALVGANVSAETNGGSQPPHIVHIIADDLGWNDVGFHGS-EIRTPNIDRLASES 170
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+ L++ YV +C+P+R+ ++TG YP G+ GV+ + GLP + P++L + GY
Sbjct: 171 VTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVVSPSKKHGLPPQLETAPEHLSKLGYD 230
Query: 126 THAI-GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV 184
A+ GKWHLG ++ P G YG++ G DY+ + ++ D H
Sbjct: 231 HRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAIDYFSRERFGQLDWHRDFDSVHEEG- 289
Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA---------- 234
YST+L A ++ I + + P++ Y+A A H+ P QA
Sbjct: 290 -------YSTELVGNAVVDFIDRNANAGPVYAYVAFNAPHS-----PLQALRSDLDEYGF 337
Query: 235 -PDEEVAKFLD--ISDPE--------------RRTYAGMVSRLDESVGNVIAALRKHGML 277
P+ ++A D I+ E R+T+A M + +D +G ++ A+ ++GM
Sbjct: 338 DPNNKLAPNTDRKIAKREKALDYGKRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRNGMR 397
Query: 278 ENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS 336
EN++V+F +DNGA P G G+N PLRG K T W+GG+R VA + P
Sbjct: 398 ENTLVVFHSDNGADPKHG-----GNNEPLRGNKFTTWEGGVRVVAMMRWPNELPAGITYD 452
Query: 337 ELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ D LP++ AAG + + DG+N L+ A IL + + V
Sbjct: 453 SVTSYVDLLPSMVGAAGSPPPEET-DGINLLPFLSGKASPPERTILLDAETV 503
>gi|340620621|ref|YP_004739074.1| sulfatase [Zobellia galactanivorans]
gi|339735418|emb|CAZ98795.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
Length = 462
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 134/395 (33%), Positives = 200/395 (50%), Gaps = 42/395 (10%)
Query: 1 MTWARKYF-FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNID 59
M W+R+ F F + T+LF+ A KP+I+ IL+DD G+ D FHGS + TP +D
Sbjct: 1 MKWSRRVFLFFASFTMLFS-----GWAQDKPNIVFILSDDAGYADFGFHGSKEFKTPELD 55
Query: 60 ALAYNGLILNQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVI--------LEGEPWGLPL 110
A N + +Q YV A+C PSR+ L+TGKY G + + L G+ GLPL
Sbjct: 56 KFAKNAVRFSQAYVSAAVCGPSRAGLLTGKYQQKFGFEENNVPGLMSKNGLTGDDMGLPL 115
Query: 111 TEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATF 170
+K + YLKE GY T GKWH G + + PT RGFD YG+ G + Y
Sbjct: 116 DQKTIADYLKEQGYRTALFGKWHQG-NADRFHPTKRGFDEFYGFRGGARSYMPFGADNEL 174
Query: 171 EPYQGLDMRHNMQVDNKTIGI-----YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA 225
R+ +++ G Y T+ + AI I E N+ P F+YLA AVH
Sbjct: 175 -------TRNEDRLERGFGGFQEHEGYLTEELADEAIAFI-ERNQKNPFFVYLAFNAVHT 226
Query: 226 GNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFM 285
P +A +E++ K +S +R+T A M +D + G V+ ++ G+ +N++++F
Sbjct: 227 -----PMEAKEEDLKKVKGLSG-KRKTLAAMTIAMDRACGKVLDRIKALGLDKNTLIVFT 280
Query: 286 ADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWL 345
DNG PS +N+ SN PL G K+ +GG+R + P + V D L
Sbjct: 281 NDNGGPS---DANESSNLPLSGTKANHLEGGIRVPFLMSWPGKLKPNTVYENPISTLDLL 337
Query: 346 PTLCAAAGIEINDTSLDGVNQWDVL--TKGAKTKR 378
PT AAG ++N ++DG++ D+L +G K R
Sbjct: 338 PTFVTAAGGDVN--AIDGLDGVDLLPYLQGKKQGR 370
>gi|392966318|ref|ZP_10331737.1| sulfatase [Fibrisoma limi BUZ 3]
gi|387845382|emb|CCH53783.1| sulfatase [Fibrisoma limi BUZ 3]
Length = 461
Score = 196 bits (499), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 133/381 (34%), Positives = 206/381 (54%), Gaps = 35/381 (9%)
Query: 1 MTWARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDA 60
+ WA F L + + A + P+I+ ILADDLG+ D+S GS+ + TP+ID+
Sbjct: 11 LLWAVTSFLPLPKS----SSASQPAANRPPNIVFILADDLGYGDLSCFGSTDLKTPHIDS 66
Query: 61 LAYNGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILE--GEPWG-LPLTEKLLP 116
L G+ Y ++C+PSR+AL+TG+YP +G+ GVI + + WG L + LLP
Sbjct: 67 LIGAGMRFTNFYANSSVCSPSRAALLTGRYPERVGVP-GVIRDEVQDSWGYLASSATLLP 125
Query: 117 QYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGL 176
YL++ GY + IGKWHLG P RGF YG +G+ D Y + G
Sbjct: 126 TYLRKQGYHSANIGKWHLGL-ESPNLPNERGFQEFYGLLEGMMDDYVVKLR------HGQ 178
Query: 177 D-MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQA 234
+ +RHN QV + ++TD++T+AA+ + + K P FLYLA+ A H +P Q
Sbjct: 179 NFLRHNGQVIDPPG--HATDVFTDAAVRYLNDRKAKKDPFFLYLAYTAPH-----DPLQP 231
Query: 235 PD---EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
P V K DP+R ++ LD+++G V+A LR +G L N++V+F +DNG
Sbjct: 232 PVAYLNRVQKRQPGIDPQRAKLVALIEHLDDNIGRVLATLRANGQLANTLVVFTSDNG-- 289
Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
G K +N RG+K ++GG+R A+WS ++ + S L + DW PTL
Sbjct: 290 --GWGPGKANNGQFRGVKGQFYEGGIRIPTGAMWSGHIEAGSE-SGALLQLMDWFPTLLQ 346
Query: 351 AAGIEINDTSLDGVNQWDVLT 371
AAG + D ++G++ +++T
Sbjct: 347 AAGASVPD-GINGLSFLNLMT 366
>gi|449138580|ref|ZP_21773837.1| arylsulfatase B [Rhodopirellula europaea 6C]
gi|448882842|gb|EMB13399.1| arylsulfatase B [Rhodopirellula europaea 6C]
Length = 498
Score = 196 bits (498), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 129/368 (35%), Positives = 199/368 (54%), Gaps = 41/368 (11%)
Query: 4 ARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
+R F +LT + +AP P+I++ILADD+G+ D+ GS + TPN+D LA
Sbjct: 10 SRLVFLSLTVFFATKASRAAESAP--PNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAE 67
Query: 64 NGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-------GLPLTEKLL 115
+G++ +Q YV A+C+PSR+ L+TG+ P G + + E + GLP++EK L
Sbjct: 68 SGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDERYATRPELLGLPVSEKTL 127
Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG 175
+L AGYAT IGKWHLG E++ P RGFD G G Y+ + E
Sbjct: 128 GDHLGAAGYATALIGKWHLG-MGEMHHPNRRGFDHFCGMLTGSHHYFPTTMNHVIE---- 182
Query: 176 LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSK---PMFLYLAHLAVHAGNTYEPF 232
R+ +V++ + Y TD +T+ + I +H +K P F++ ++ A H P
Sbjct: 183 ---RNGQRVEDFS-NEYLTDFFTDEGLRFIDQHEAAKPDQPWFVFYSYNAPHT-----PM 233
Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
A + ++A+F +I + +RRTYA M+ LD VG + L + G +N++++F +DNG
Sbjct: 234 HATEADLARFANIQNKKRRTYAAMMFALDRGVGRIRKHLEETGQWKNTLLVFFSDNG--- 290
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS-----PWLKQTQKVSSELFHISDWLPT 347
G +N N PLRG+K + +GG+R V IW+ P K+ V S L D LPT
Sbjct: 291 -GATNNGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGKRYDGVISSL----DLLPT 344
Query: 348 LCAAAGIE 355
C+AAG +
Sbjct: 345 FCSAAGAQ 352
>gi|323451693|gb|EGB07569.1| hypothetical protein AURANDRAFT_2707, partial [Aureococcus
anophagefferens]
Length = 351
Score = 196 bits (497), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 131/349 (37%), Positives = 186/349 (53%), Gaps = 29/349 (8%)
Query: 28 KKPHIIIILADDLGWNDVSF----HGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
+ P++I+ LADDLG NDV + G +I +P IDALA L+L++ Y Q +CTP+R+A
Sbjct: 9 RPPNLIVFLADDLGRNDVGYAHRDGGPGRIASPRIDALAAESLVLDRFYAQPMCTPTRAA 68
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
LMTG++ G+ + V+L + GLP E + + L++AGY+TH IGKWHLGF ++ P
Sbjct: 69 LMTGRHAYRTGLAYFVLLANQGTGLPAAEVTVAERLRDAGYSTHMIGKWHLGFAKKAMLP 128
Query: 144 TFRGFDSHYGYWQGLQDYYDHSC-----------KAT-FEPYQGLDMRHN-------MQV 184
T RGFD +GY G DY+ H +AT EP M H+ +
Sbjct: 129 TSRGFDRFFGYCLGSSDYWLHQSPEWVPGVPSTDRATGAEPPTTGGMGHDLWDGATPLPR 188
Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
KT ++S DL+ A A + KP+FLY A A HA P + A
Sbjct: 189 TPKTENVHSADLFAARATETFATAPRDKPLFLYYASQAPHAHFVGAPPRHVAAAAAMGFA 248
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
DP R A +V+ LDE VGNV+ +R EN+++ FM DNG P G + S+ P
Sbjct: 249 AGDP-RADVAALVAALDEQVGNVVDGVRASRGWENTVLWFMGDNG-PEPGTGA---SSWP 303
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
LRG K T ++GG+R + +++P + SS H+ D PTL AG
Sbjct: 304 LRGAKRTLFEGGVREPSFVYAPG-RVAPGASSAPVHVIDVAPTLLGLAG 351
>gi|417301111|ref|ZP_12088281.1| arylsulfatase B [Rhodopirellula baltica WH47]
gi|327542540|gb|EGF29014.1| arylsulfatase B [Rhodopirellula baltica WH47]
Length = 498
Score = 195 bits (496), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 129/368 (35%), Positives = 192/368 (52%), Gaps = 39/368 (10%)
Query: 4 ARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
A F L +F P+I++ILADD+G+ D+ GS + TPN+D LA
Sbjct: 8 AVSRFVFLPLIFVFATEASRAAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAE 67
Query: 64 NGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-------GLPLTEKLL 115
+G++ +Q YV A+C+PSR+ L+TG+ P G + + E + GLP +EK L
Sbjct: 68 SGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPKSEKTL 127
Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG 175
+L AGYAT IGKWHLG E++ P RGFD G G Y+ + E
Sbjct: 128 ADHLGAAGYATALIGKWHLGM-GEMHHPNRRGFDHFCGMLTGSHHYFPTTMNHVIE---- 182
Query: 176 LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPF 232
R+ +V+N + Y TD +T+ + I +H N +P F++ ++ A H P
Sbjct: 183 ---RNGKRVENFS-SEYLTDFFTDEGLRFIDQHKSANPDQPWFVFFSYNAPHT-----PM 233
Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
A + ++A+F +I + +RRTYA M+ LD VG + L + G EN++++F +DNG
Sbjct: 234 HATEADLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNG--- 290
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS-----PWLKQTQKVSSELFHISDWLPT 347
G +N N PLRG+K + +GG+R V IW+ P + V S L D LPT
Sbjct: 291 -GATNNGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVRYDGVISSL----DLLPT 344
Query: 348 LCAAAGIE 355
C+AAG E
Sbjct: 345 FCSAAGAE 352
>gi|410617068|ref|ZP_11328044.1| sulfatase [Glaciecola polaris LMG 21857]
gi|410163337|dbj|GAC32182.1| sulfatase [Glaciecola polaris LMG 21857]
Length = 488
Score = 195 bits (495), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 125/350 (35%), Positives = 189/350 (54%), Gaps = 21/350 (6%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRS 82
T A +P+II+ILADDLG+ D+ F GS +I TPNIDALA+ G++ + YV C PSR+
Sbjct: 29 TKADSRPNIIVILADDLGYGDLGFTGSREIKTPNIDALAHKGVVFSNAYVTHPYCGPSRA 88
Query: 83 ALMTGKYPIHIGMQ-HGVILEGEPW-GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
L+TG+Y GM+ + +P+ GLP+ E + +++AGY T IGKWH+G
Sbjct: 89 GLLTGRYQARFGMEINAAHSPDDPFMGLPVDEPTFAKRMQKAGYKTAVIGKWHMGSHPN- 147
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-NKTIGIYSTDLYTE 199
+ P RGFD YG+ G DY+ S K + E Y R+ N+ + +T + E
Sbjct: 148 FHPNNRGFDYFYGFLGGGHDYFPESVKVSNEEYSIPLSRNGKPAQLNEYL---TTAISKE 204
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
AA + S+P +Y+A+ A H +P +A +++AK+ I D RRTYA MV
Sbjct: 205 AAEFAM---TTSQPFMMYVAYNAPH-----QPLEATQKDLAKYQHIEDINRRTYAAMVDS 256
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGA---PSFGIHSNKGSNHPLRGMKSTPWDGG 316
+D+ +G ++ L++ G EN+++ F++DNG S+ +SN SN P R K + +GG
Sbjct: 257 MDQGIGRLVNGLKQSGKFENTLIFFLSDNGGVYPESWMPNSNWASNKPFRRGKVSLTEGG 316
Query: 317 MR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
+ A W + L D T AG +++D LDGVN
Sbjct: 317 VHVPFIAHWPAGFGNKGQFDG-LVSSLDIAATSLGLAGADVHDAKLDGVN 365
>gi|340619607|ref|YP_004738060.1| sulfatase [Zobellia galactanivorans]
gi|339734404|emb|CAZ97781.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
Length = 463
Score = 195 bits (495), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 138/383 (36%), Positives = 205/383 (53%), Gaps = 45/383 (11%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
K+ +AL L+ AF + KKP+II++L DD GW DV F+G++ IPTPN+D LA G
Sbjct: 4 KFLYALALALM--TAFSSLAQEKKPNIIVVLTDDQGWADVGFNGATDIPTPNLDRLASEG 61
Query: 66 LILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGE---PWGLPLTEKLLPQYLKE 121
++ + YV C+PSR+ L+TG+Y G + + E G PL+EK++P+ LKE
Sbjct: 62 IVFDNAYVSHPYCSPSRAGLLTGRYQARFGHDCNMPYDSENDDTVGTPLSEKMIPEALKE 121
Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYY---DHSCKATF---EPYQG 175
GY T AIGKWHLG ++ P +GFD +G+ G +Y+ D K EP
Sbjct: 122 HGYRTSAIGKWHLGDHPSLH-PIHQGFDHWFGFAGGGMNYWGIPDGPIKTIVRNGEPVPQ 180
Query: 176 LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP 235
++R Y TD +T+ AI+ I + + KP F+YLA+ A HA P QA
Sbjct: 181 NELR------------YLTDDFTDEAIDFITKKD-DKPFFMYLAYNAPHA-----PDQAT 222
Query: 236 DEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI 295
+ + + I R YA MV+ +D +VG + + L +GM EN+I++F++DNG G
Sbjct: 223 KQYLEQTKHIEYGGRSIYAAMVNAVDANVGRIDSTLVANGMKENTILVFLSDNGGR--GE 280
Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQ---KVSSELFHISDWLPTLCAA 351
H++ N P RG K ++GG++ I W LK Q K+ S L D PT A
Sbjct: 281 HAD---NRPYRGHKGMLFEGGIKVPFFITWPKKLKGGQHYEKMVSSL----DLFPTFLEA 333
Query: 352 AGIEI-NDTSLDGVNQWDVLTKG 373
AG ++ ++ LDG + +G
Sbjct: 334 AGGKVKKESQLDGTTLLPYILEG 356
>gi|417301514|ref|ZP_12088666.1| arylsulfatase B [Rhodopirellula baltica WH47]
gi|327542201|gb|EGF28693.1| arylsulfatase B [Rhodopirellula baltica WH47]
Length = 489
Score = 195 bits (495), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 134/412 (32%), Positives = 202/412 (49%), Gaps = 51/412 (12%)
Query: 8 FFALTCTLLFNDAFLNTT--APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
F +TC L + T + PHI+ I+ADDLGWNDV FHGS +I TPNID LA
Sbjct: 22 FAIMTCFSLVGENVSAETNGGSQPPHIVHIIADDLGWNDVGFHGS-EIRTPNIDRLASES 80
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+ L++ YV +C+P+R+ ++TG YP G+ GV+ + GLP + P++L + GY
Sbjct: 81 VTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVVSPTKKHGLPTLLETTPEHLSKLGYD 140
Query: 126 THAI-GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV 184
A+ GKWHLG ++ P G YG++ G DY+ + ++ D H
Sbjct: 141 HRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAIDYFSRERFGQLDWHRDFDSVHEEG- 199
Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA---------- 234
YST+L A ++ I + + ++ Y+A A H+ P QA
Sbjct: 200 -------YSTELVGNAVVDFIDRNANAGAVYAYVAFNAPHS-----PLQALRSDLDEYGF 247
Query: 235 ---------PDEEVA---KFLDI-----SDPERRTYAGMVSRLDESVGNVIAALRKHGML 277
D ++A K LD + R+T+A M + +D +G ++ A+ ++GM
Sbjct: 248 DPNNKLSPNTDRKIAKREKALDYGKRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRNGMR 307
Query: 278 ENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS 336
EN++V+F +DNGA P G GSN PLRG K T W+GG+R VA + P
Sbjct: 308 ENTLVVFHSDNGADPKHG-----GSNEPLRGNKFTTWEGGVRVVAMMRWPNELPAGITYD 362
Query: 337 ELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
+ D LP + AAAG + + DG+N L+ A IL + + V
Sbjct: 363 SVTSYVDLLPIMVAAAGGPPPEET-DGINLLPFLSGKASPPERTILLDAETV 413
>gi|32470862|ref|NP_863855.1| arylsulfatase B [Rhodopirellula baltica SH 1]
gi|32443007|emb|CAD71528.1| arylsulfatase B [Rhodopirellula baltica SH 1]
Length = 520
Score = 194 bits (494), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 126/364 (34%), Positives = 190/364 (52%), Gaps = 31/364 (8%)
Query: 4 ARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
A F L +F P+I++ILADD+G+ D+ GS + TPN+D LA
Sbjct: 30 AVSRFVFLPLIFVFATEASRAAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAE 89
Query: 64 NGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-------GLPLTEKLL 115
+G++ +Q YV A+C+PSR+ L+T + P G + + E + GLP +EK L
Sbjct: 90 SGVLCSQAYVASAVCSPSRAGLLTSRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTL 149
Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG 175
+L AGYAT IGKWHLG E++ P RGFD G G Y+ + K E
Sbjct: 150 ADHLGAAGYATALIGKWHLGM-GEMHHPNRRGFDHFCGMLTGSHHYFPATMKHVIE---- 204
Query: 176 LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPF 232
R+ +VD+ + Y TD +T+ + I +H N +P F++ ++ A H P
Sbjct: 205 ---RNGKRVDDFS-SEYLTDFFTDEGLRFIDQHKSANPDQPWFVFFSYNAPHT-----PM 255
Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
A + ++A+F +I + +RRTYA M+ LD VG + L + G EN++++F +DNG
Sbjct: 256 HATEADLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNG--- 312
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS-PWLKQTQKVSSELFHISDWLPTLCAA 351
G +N N PLRG+K + +GG+R V IW+ P + + D LPT C+A
Sbjct: 313 -GATNNGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVLYDGVVSSLDLLPTFCSA 370
Query: 352 AGIE 355
AG E
Sbjct: 371 AGAE 374
>gi|431899642|gb|ELK07596.1| Arylsulfatase J [Pteropus alecto]
Length = 637
Score = 194 bits (494), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 122/320 (38%), Positives = 165/320 (51%), Gaps = 29/320 (9%)
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y IH G+QH VI +P LPL LPQ L+EAGYATH +GKWHLGF+R PT RG
Sbjct: 109 QYQIHTGLQHSVIRPTQPNCLPLDAATLPQKLREAGYATHMVGKWHLGFYRTECLPTRRG 168
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-NKTIGIYSTDLYTEAAINVIA 206
FDS +G G DYY H K G D+ N + + G YST +YT+ ++A
Sbjct: 169 FDSFFGSLLGSGDYYTHY-KCDSPGVCGYDLHENDRAAWGRGGGAYSTQMYTQRVQRILA 227
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H+ KP+FLY+A+ AVH+ P QAP + + I++ RR YA M+S LDE+V N
Sbjct: 228 SHDPGKPLFLYIAYQAVHS-----PLQAPGRYLGHYRSIANARRRRYAAMLSCLDEAVAN 282
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
V AL+ HG +NS++++ +DNG G + GSN PLRG K T W+GG+R V + SP
Sbjct: 283 VTLALKAHGFYDNSVIVYSSDNG----GQPAAGGSNWPLRGRKGTYWEGGVRAVGFVHSP 338
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQ------------------WD 368
L+Q W+
Sbjct: 339 LLRQRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLWE 398
Query: 369 VLTKGAKTKRSEILHNIDNV 388
VL++G ++ R +ILHNID V
Sbjct: 399 VLSEGQRSPRVDILHNIDPV 418
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 36/63 (57%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
LFNIT DP E+ +L++ +++QL +L+ + T VP P D R +P +W PW
Sbjct: 488 LFNITADPYERVDLSDRYPGVVRQLLRRLSRFNRTAVPVRYPPRDPRGNPQLHGGVWGPW 547
Query: 565 YDE 567
Y +
Sbjct: 548 YHQ 550
>gi|326433598|gb|EGD79168.1| hypothetical protein PTSG_09899 [Salpingoeca sp. ATCC 50818]
Length = 515
Score = 194 bits (492), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 122/372 (32%), Positives = 194/372 (52%), Gaps = 29/372 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGS--------SQIPTPNIDALAYNGLILNQHYVQALCTP 79
+P+I++I+ DD+GW + H + S + TP + L G+ L +HY C+P
Sbjct: 26 SQPNIVLIIGDDVGWANTELHCNISDPAQVCSDVKTPFLLQLKDEGVHLLRHYAYKFCSP 85
Query: 80 SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
+RSALM+G+ PIH+ + ++G G+ L +LP+ L GYATH +GKWHLG
Sbjct: 86 TRSALMSGRLPIHVNEVNPTYIKGG--GVSLNFTMLPEKLSRVGYATHHVGKWHLGMSSH 143
Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
PT RGFDS GY G +D++ + T G+D+ N Y L+
Sbjct: 144 SRVPTGRGFDSSLGYLLGAEDHWKQT--VTQGNATGVDLWLNEGPAYLRNNEYGDLLFGR 201
Query: 200 AAINVIAEHNKS---KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
A+ VI +H K+ P+FLY+ H+ P + P E ++ I +R GM
Sbjct: 202 RAVEVIHDHAKNHRDSPLFLYVGLQVNHS-----PAEVPKEYQDRYPGIQ-VDRAIVNGM 255
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
S +D+++G + AL + M E+++V++++DNG S GI++N G+N+PLRG K T ++GG
Sbjct: 256 SSVMDDAIGKITTALWETTMWEDTLVVYISDNGGAS-GINANNGNNYPLRGGKHTDFEGG 314
Query: 317 MRGVAAIWSPWLKQTQKVSSE--LFHISDWLPTLCAAAGIEIND-----TSLDGVNQWDV 369
+R A + +L Q+ S+ H DW TL AG D +DG++ W +
Sbjct: 315 VRTAALVSGGFLPDDQRGSTRDGYMHACDWYTTLAKLAGASPADDVEGLPGVDGMDMWPM 374
Query: 370 LTKGAKTKRSEI 381
+ K+ R E+
Sbjct: 375 IANNEKSPRHEV 386
Score = 40.8 bits (94), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 5/66 (7%)
Query: 504 CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA-----RWN 558
CLFNI +DP E +++ DL +L+D+L+ ST G DPA R++
Sbjct: 443 CLFNIKEDPTEHHDIKFDNLDLFWKLKDRLSKLASTAYQTGEDSEGDCEDPAHVMETRYH 502
Query: 559 NIWVPW 564
W P+
Sbjct: 503 GFWGPF 508
>gi|341889947|gb|EGT45882.1| CBN-SUL-3 protein [Caenorhabditis brenneri]
Length = 432
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 132/402 (32%), Positives = 202/402 (50%), Gaps = 87/402 (21%)
Query: 22 LNTTAP--KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY--NGLILNQHYVQALC 77
+N +P +KP+++ I+ADDLG+ND+ + S+ + TPN+ LA+ N +L YV LC
Sbjct: 23 VNGQSPPTQKPNVLFIMADDLGFNDLDWKDST-LHTPNLRNLAFHKNTALLTNSYVNQLC 81
Query: 78 TPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
TP+RSA MTG YP +G Q GV L EP G+P L + +++ Y+T+ +GKWHLG+
Sbjct: 82 TPTRSAFMTGYYPFRVGTQAGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYC 141
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSC----KATFEPYQGLDMRHNMQVDNKTI---- 189
++ + PT RGFD YG++ Y++HS + +GLD+ + + K++
Sbjct: 142 KKEFLPTNRGFDYFYGFYGPQTGYFNHSADQYHRELRRVVKGLDLFEEVG-NGKSVPDFS 200
Query: 190 --GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
G+YSTDL+T+ A++VI H NT +PF F+ +S
Sbjct: 201 QNGVYSTDLFTDVAMSVIDNH------------------NTTKPF---------FMFLS- 232
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
Y + H L+N G +FG SN PLRG
Sbjct: 233 -----YQAV-----------------HPPLQNG--------GTSNFG-----ASNAPLRG 257
Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQW 367
K T W+GG + I SP + +FH+ DW T+ A G+EI D+ DG+NQW
Sbjct: 258 EKDTIWEGGTKTTTFIHSPMYVEEGGEREMMFHVVDWHATVLAITGLEI-DSYGDGINQW 316
Query: 368 DVLTKGA-KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG 408
+ + G K +R + ++NIDN + +A+R D K + G
Sbjct: 317 EYIKTGRPKFRRFQFVYNIDN------HGSAIRDGDYKLIVG 352
>gi|326428223|gb|EGD73793.1| arylsulfatase [Salpingoeca sp. ATCC 50818]
Length = 697
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 118/364 (32%), Positives = 184/364 (50%), Gaps = 40/364 (10%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
PHI+ L DDLG+ + + TP+ID LA NG +L + Y C+P+RS+L++G+
Sbjct: 75 PHIVYFLVDDLGYANTGVTNDEPL-TPHIDELARNGTMLRRFYTYKYCSPTRSSLLSGRL 133
Query: 90 PIHIGMQHGVILEGEP-----WGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
PIH+ E P G+ L + L+ AGYATH +GKWH G P
Sbjct: 134 PIHVNQ------ENRPPPVPGGGIDQNMTTLAEVLQRAGYATHQVGKWHCGMSSPDRLPV 187
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQG-LDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
RGF+S +GY G + +Y + QG +D+ + Q G Y T +Y A
Sbjct: 188 HRGFNSSFGYLSGAEHHYS-------QIRQGYVDLWRDTQPAYGENGTYGTFMYAREAER 240
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
+I H+ + P+FLYLA VH+ P + PD + K+ D+ R+ MVS +DE+
Sbjct: 241 IIHAHDPTTPLFLYLAFQNVHS-----PLEVPDRFLDKYPDVDYRPRKHCLAMVSAVDEA 295
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
+ NV ALR+ G+ ++++++F +DNG P + +N+P RG K+ ++GG+R VA +
Sbjct: 296 INNVTLALRRAGLFDDTLIIFSSDNGGP-----HDHANNYPFRGAKTADFEGGVRAVAFV 350
Query: 324 WSPWLKQTQKVSS--ELFHISDWLPTLCAAAGIEIND--------TSLDGVNQWDVLTKG 373
+ + ++ L H+ DW TL AG + D +DG + W +LT
Sbjct: 351 SGGVVPKNMVGTNVHGLMHVCDWYSTLARLAGEDPTDHRAARSNLPPIDGFDMWPMLTGA 410
Query: 374 AKTK 377
T
Sbjct: 411 NSTS 414
>gi|443700719|gb|ELT99563.1| hypothetical protein CAPTEDRAFT_110993 [Capitella teleta]
Length = 339
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 114/304 (37%), Positives = 167/304 (54%), Gaps = 20/304 (6%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP+II ADD G+ D + +S I TPNID L +G+ Y C+PSRSA ++G+
Sbjct: 31 KPNIIWFFADDYGYQDAGYR-NSDIHTPNIDKLVADGISFTNAYSAQQCSPSRSAFLSGR 89
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y GMQHGVI + + + L + YLKE Y THA GKWHLG+ + TPT+RGF
Sbjct: 90 YAYTSGMQHGVIGDTKAHCMDLKYNFISDYLKELKYNTHASGKWHLGYCNKECTPTYRGF 149
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGL-DMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
D+ G + G +YY+H+ +QG+ D + ++D GI+S D+ +N + +
Sbjct: 150 DTFSGGYSGEGEYYEHT------TFQGMYDWHNGTELDRSASGIHSQDMIEADLLNQL-D 202
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI-SDPERRTYAGMVSRLDESVGN 266
N P+F Y A H NT+ P E A + D+ + P RR Y G+VS +D +G
Sbjct: 203 KNDGTPLFYY---AAFH--NTHSPMHPKPEHEALYDDLDAPPTRRKYLGLVSGMDAVIGK 257
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
V+ L++ M +N+ + F +DNG +H +G N P RG KS+ +DGG + SP
Sbjct: 258 VVTKLKEKDMFDNTYIFFSSDNGGD---VH--EGDNSPKRGAKSSLFDGGCHAHSWASSP 312
Query: 327 WLKQ 330
LK+
Sbjct: 313 LLKK 316
>gi|430741674|ref|YP_007200803.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430013394|gb|AGA25108.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 454
Score = 193 bits (490), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 131/365 (35%), Positives = 190/365 (52%), Gaps = 40/365 (10%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
K+P+++++LADDLGW DV F+G ++ TPN+D LA G + Y A +C PSR+ALMT
Sbjct: 38 KRPNVLLLLADDLGWGDVGFNGRTEWATPNLDRLAARGTTFKRFYTAAVVCAPSRAALMT 97
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE---VYT- 142
G+Y IH G+ LP E L + K GY T GKWH G R+ Y
Sbjct: 98 GRYTIHDGVSRNND------DLPAREVTLAEAFKTHGYDTALFGKWHHGQPRDGSKTYVH 151
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
P +GFD +G+ K +E Y + H ++ K + YS D++ + AI
Sbjct: 152 PMDQGFDEFFGF---------TDAKHAWEKYP-EQLWHGREL--KPVSGYSDDMFADHAI 199
Query: 203 NVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA----KF--LDISDPERRTYAG 255
+ + H K P FLY+ + NT+ +AP EEVA KF +D S P R TYA
Sbjct: 200 DFLKRHKEKPTPFFLYVPFI-----NTHFNIEAPAEEVALHQGKFTEVDPSKPIRATYAA 254
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP----SFGIHSNKGSNHPLRGMKST 311
MV++LD++VG ++ AL G+ +++V+F++D+GA + G SN P RG K T
Sbjct: 255 MVTQLDKNVGRIMTALEGLGLAADTLVVFVSDHGATFESGNLGASDYHDSNRPFRGQKRT 314
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVL 370
W+GG+R P +S+E+ H++D PTL AA G +DG + W
Sbjct: 315 LWEGGIRVPGVACWPGHVPAGVISNEVVHMTDLFPTLLAATGAHPEAAWHVDGADLWPTW 374
Query: 371 TKGAK 375
T AK
Sbjct: 375 TGKAK 379
>gi|149175233|ref|ZP_01853855.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
gi|148845842|gb|EDL60183.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
Length = 459
Score = 193 bits (490), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 130/376 (34%), Positives = 206/376 (54%), Gaps = 35/376 (9%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F A++C L+ A A + P+I++I+ADDLG+ D++ +G+ Q+ TP+ID LA + L
Sbjct: 15 FIAVSCLLIRFSA--AEAAQQPPNIVLIMADDLGYGDLACYGNKQVKTPHIDRLAASALK 72
Query: 68 LNQ-HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEP---WGLPLTEKLLPQYLKEAG 123
H A+CTP+R+A++TG+Y G Q L G+ GLP + + LK+ G
Sbjct: 73 FTDFHSAGAMCTPTRAAMLTGQYQQRFGRQFESALSGKSNHDIGLPHQAVTMAELLKQQG 132
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ 183
YAT GKWHLG ++ + PT +GFD G G D++ H ++ E D HN +
Sbjct: 133 YATACFGKWHLG-YQPPWLPTNQGFDLFRGLTSGDGDHHTHVDRSGNE-----DWWHNNE 186
Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP-------- 235
+ + Y+ DL ++ ++ + E N+++P FLY+ HLA+H P+Q P
Sbjct: 187 ISMEK--GYTADLLSKYSVAFM-EANRTRPFFLYVPHLAIHF-----PWQGPQDPPHRKA 238
Query: 236 --DEEVAKFLDISDPERRT--YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA- 290
D K+ I DP + M+ LD+SVG +++AL++ + +N++V+F +DNG
Sbjct: 239 GQDYHAGKWGIIPDPGNVSPHTTAMIESLDQSVGKILSALKRLDLEQNTLVIFTSDNGGY 298
Query: 291 PSFGIH-SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
++G + N SN PLRG K+T ++GG R I P + T V+ + H D LPTL
Sbjct: 299 LTYGKNFQNISSNGPLRGQKATLYEGGHRVPCLISWPGVI-TAGVTDQTAHSVDLLPTLA 357
Query: 350 AAAGIEINDTSLDGVN 365
AAGI + DG++
Sbjct: 358 QAAGISATNFQTDGLD 373
>gi|149178145|ref|ZP_01856740.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
gi|148843065|gb|EDL57433.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
Length = 460
Score = 193 bits (490), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 127/397 (31%), Positives = 208/397 (52%), Gaps = 29/397 (7%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
A ++P+I+II DD G NDV +GS +IPTP+ID LA GL+ Q+Y A+CTPSR +
Sbjct: 24 AAERPNILIIFTDDQGINDVGCYGS-EIPTPHIDQLAKEGLLFRQYYSASAICTPSRFGI 82
Query: 85 MTGKYPIH-----IGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
+TG+ P +G + + G+ E + L++ GY T +GKWHLG E
Sbjct: 83 LTGRNPTRSQDQLLGALMFMSDIDQNRGIQPGETTIADVLQQNGYQTALLGKWHLGHGTE 142
Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
+ PT GFD G+ G DY+ + + Y + RH + Y+TDL TE
Sbjct: 143 SFLPTAHGFDLFRGHTGGCIDYFTMTYGNIPDWYH--NQRHVSENG------YATDLITE 194
Query: 200 AAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEP--------FQAPDEEVAKFLDISDPER 250
A + + + + KP FL+L++ A H G + P QA +++ + I D R
Sbjct: 195 EAEHFLKDQQTTDKPFFLFLSYNAPHFGKGWSPGDQSPVNIMQARGDDLKRVGTIKDKVR 254
Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
R +A M LD+ +G V+++L+ +G+ +N++V+FM D+G G + G+N P RG K+
Sbjct: 255 REFAAMTVSLDDGIGRVMSSLKNNGLDQNTLVIFMTDHG----GDYVYGGNNQPFRGAKA 310
Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
T ++GG+R I P + ++E+ D PT+C A ++ + +LDG + +L
Sbjct: 311 TLFEGGIRVPCIIRWPGKIKAGTETNEVAWALDLFPTICHFANVDTDGLTLDGKDISGLL 370
Query: 371 TKGAKTKRSEILHNID-NVDNPQKYYAALRVDDLKYV 406
T+ E+ + + + + ++ALR D KY+
Sbjct: 371 TRQTPVGTRELYWQLGPHAELKRGRWSALRQGDWKYI 407
>gi|406834549|ref|ZP_11094143.1| Cerebroside-sulfatase [Schlesneria paludicola DSM 18645]
Length = 470
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 128/381 (33%), Positives = 193/381 (50%), Gaps = 19/381 (4%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F AL C A A KP++I+IL DD G D + G+ + TP+ID LA G+
Sbjct: 25 FLALFCFTASGLASAADQATGKPNVIVILTDDQGTIDANCFGAKDLETPHIDGLAARGVR 84
Query: 68 LNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
L Q Y A +C+PSR+ L+TG+YP+ G+ V G+P T+ + + K AGYAT
Sbjct: 85 LTQFYAAAPVCSPSRAGLLTGRYPVRGGVPGNVSSTAGVAGMPTTQVTMAETFKAAGYAT 144
Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDN 186
IGKWHLG + E P +GF + +G+ G D + H F +QG + RH++ D
Sbjct: 145 AHIGKWHLG-YSEDTMPLAQGFQTSFGHMGGCIDNFSH-----FFYWQGPN-RHDLHKDG 197
Query: 187 KTI---GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
+ + G + DL E A I N+ +P F+Y A H P+Q + ++
Sbjct: 198 REVQRPGRFFPDLMVEEASQFIGT-NRERPFFVYFALNGPH-----YPYQGDPRWLERYK 251
Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG-APSFGIHSNKGSN 302
D+ P R YA VS +DE +G ++ + + G+ E +IV+F +D+G + H GS
Sbjct: 252 DLPYP-RNLYAAFVSSIDERIGGLLKRVDELGLREKTIVVFQSDHGHSTEERAHLGGGSA 310
Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLD 362
RG K + ++GG+R A I P + S+L H DWLPTL A A + + + +D
Sbjct: 311 GIYRGAKFSLFEGGIRVPAIISWPGRLPQGEARSQLAHGCDWLPTLAALAAVPLVEPDID 370
Query: 363 GVNQWDVLTKGAKTKRSEILH 383
GVN +L ++H
Sbjct: 371 GVNLVPILNSADAATPHGLVH 391
>gi|440716553|ref|ZP_20897058.1| arylsulfatase B [Rhodopirellula baltica SWK14]
gi|436438412|gb|ELP31962.1| arylsulfatase B [Rhodopirellula baltica SWK14]
Length = 498
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 128/368 (34%), Positives = 190/368 (51%), Gaps = 39/368 (10%)
Query: 4 ARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
A F L +F P+I++ILADD+G+ D+ GS + TPN+D LA
Sbjct: 8 AVSRFVFLPLIFVFATEASRAAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAE 67
Query: 64 NGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-------GLPLTEKLL 115
+G++ +Q YV A+C+PSR+ L+TG+ P G + + E + GLP +EK L
Sbjct: 68 SGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTL 127
Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG 175
+L AGYAT IGKWHLG E++ P RGFD G G Y+ + E
Sbjct: 128 ADHLGAAGYATALIGKWHLG-MGEMHHPNRRGFDHFCGMLTGGHHYFPTTMNHVIE---- 182
Query: 176 LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPF 232
R+ +V+N + Y TD +T+ + I +H N +P F++ ++ A H P
Sbjct: 183 ---RNGKRVENFS-SEYLTDFFTDEGLRFIDQHESANPDQPWFVFFSYNAPHT-----PM 233
Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
A + ++A+F +I +RRTYA M+ LD VG + L + G N++++F +DNG
Sbjct: 234 HATEADLARFANIQTQKRRTYAAMMYALDRGVGRIREHLEESGQWGNTLLVFFSDNG--- 290
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS-----PWLKQTQKVSSELFHISDWLPT 347
G +N N PLRG+K + +GG+R V IW+ P + V S L D LPT
Sbjct: 291 -GATNNGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVRYDGVVSSL----DLLPT 344
Query: 348 LCAAAGIE 355
C+AAG E
Sbjct: 345 FCSAAGAE 352
>gi|372210598|ref|ZP_09498400.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 468
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 132/382 (34%), Positives = 200/382 (52%), Gaps = 34/382 (8%)
Query: 13 CTLLFNDAF--LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
C LL F L+T+A KK +I+ + ADD G+ D F GS PTPN+D LA G++ Q
Sbjct: 7 CLLLVALVFFSLSTSAQKKKNIVFLFADDAGYFDFGFQGSKTFPTPNLDQLAKEGMVFKQ 66
Query: 71 HY-VQALCTPSRSALMTGKYPIHIGMQHGVI---------LEGEPWGLPLTEKLLPQYLK 120
Y A+C PSR+ L+TG+Y G + + L G+ GLPL EK + YL
Sbjct: 67 AYTTAAVCGPSRAGLLTGRYQQRFGFEENNVPGYMSKSSKLLGDDMGLPLDEKTMADYLG 126
Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCK-ATFEPYQGLDMR 179
+ GY + +GKWH+G + Y P RGF YG+ G + +Y + K A +P L++
Sbjct: 127 KLGYQSIVLGKWHMG-NADRYHPLKRGFTEFYGFRGGARSFYPLTQKQAADKPEDRLEIG 185
Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
+ + K Y T +AA + I + NK KP F+Y++ AVH+ P QA E++
Sbjct: 186 YKKYQEPKK---YLTYDLADAACDFI-DRNKKKPFFMYVSFNAVHS-----PMQATQEDL 236
Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
K ++ +R+ A M LD++ G ++ L++ G+ ++++++F DNG P K
Sbjct: 237 KKIKGLTG-KRKVLAAMTLALDKACGQILEKLKEEGLEDDTLIVFSNDNGGPD----GTK 291
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCAAAGIEIN 357
N+PL G KS +GG+R + P K Q SS + +S D LPT AG + +
Sbjct: 292 TCNYPLSGCKSNNLEGGIRVPFVMKLP--KVIQPGSSYEYPVSTLDLLPTFVNIAGGKAD 349
Query: 358 DTS-LDGVNQWDVLTKGAKTKR 378
+ LDGV+ +T G TKR
Sbjct: 350 KIAQLDGVDLLPYIT-GKDTKR 370
>gi|149199924|ref|ZP_01876952.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149136993|gb|EDM25418.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 455
Score = 192 bits (488), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 134/359 (37%), Positives = 191/359 (53%), Gaps = 42/359 (11%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQ--IPTPNIDALAY 63
K F AL C A KP+II+IL+DD G+ DVS++ I TP+ DALA
Sbjct: 7 KLFLALLCVNFV------ALADSKPNIIVILSDDQGYADVSYNPEHDDYISTPHTDALAK 60
Query: 64 NGLILNQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-GLPLTEKLLPQYLKE 121
+G+I ++ Y ++C+ +RS LMTG+Y ++G+ GE G L K +P YLKE
Sbjct: 61 SGVIFHRGYTSGSVCSTTRSGLMTGRYQ----QRYGIYTAGEGGTGTDLNAKFIPNYLKE 116
Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYW-QGLQDYY----DHSCKATFEPYQGL 176
AGY + A GKWHLG + Y P RGFD YG+ +G D++ ++ K Y+GL
Sbjct: 117 AGYKSMAFGKWHLGHEMK-YHPLHRGFDDFYGFMGRGAHDFFRLEKEYDGKFGGPIYRGL 175
Query: 177 DMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
+ +D+K Y T TE + I E NK KP F Y+A+ AVH P QAP
Sbjct: 176 E-----PIDDKG---YLTTRITEETVKFI-EENKDKPFFAYVAYNAVHT-----PAQAPA 221
Query: 237 EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
E++ D R M+ LD VG ++ L+KH + EN+I+++++DNG G
Sbjct: 222 EDIKAV--SGDETRDILVAMLKHLDLGVGEIVKTLKKHDIYENTIIIYLSDNG----GAK 275
Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
S +N PLRG+K +DGG+R + W +K Q S + + D LPTL AAG+
Sbjct: 276 SMVANNKPLRGVKHDIYDGGIRVPFLMSWPAQIKAGQDTQSPVISL-DILPTLLDAAGL 333
>gi|423219918|ref|ZP_17206414.1| hypothetical protein HMPREF1061_03187 [Bacteroides caccae
CL03T12C61]
gi|392624181|gb|EIY18274.1| hypothetical protein HMPREF1061_03187 [Bacteroides caccae
CL03T12C61]
Length = 463
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 132/363 (36%), Positives = 191/363 (52%), Gaps = 34/363 (9%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
K P+I++IL DD G+ND F GS ++ TPNIDAL G++ +V A + +PSR+ L+T
Sbjct: 30 KNPNILVILIDDAGYNDFGFMGSKEMQTPNIDALTSEGVVFTDAHVAATVSSPSRACLIT 89
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y G + L GLPL E+ + + K GY T AIGKWHLG R+ P R
Sbjct: 90 GRYGHRFGYE--CNLSDRTNGLPLEEETIAEVFKTNGYRTAAIGKWHLG-SRDEQHPNNR 146
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI--YSTDLYTEAAINV 204
GFD YG G +DY+ ++ K + P N+ ++++ + Y TD ++E A+
Sbjct: 147 GFDLFYGMKAGGRDYF-YNEKKSDRPGD----ERNLLLNDRQVKFEKYLTDAFSEKAVEF 201
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
I E S+P +YLA+ AVH P QA DE++AKF R+ A M LD V
Sbjct: 202 INE--SSQPFMMYLAYNAVHT-----PMQATDEDMAKF---EGHPRQKLAAMTYALDRGV 251
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAI 323
G VI L+ G +N+++ F++DNG G +N+ SN+PL+G K ++GG R +
Sbjct: 252 GTVIRGLKDSGKFDNTLIFFLSDNG----GATTNQSSNYPLKGFKGNKFEGGHRVPYFVV 307
Query: 324 WS---PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
W P K + S L D T AA I LDGV+ L KG K + +
Sbjct: 308 WKNGIPAGKPYDGLVSSL----DIFATAIDAASISKTRNRLDGVSLLPYL-KGKKGEPHK 362
Query: 381 ILH 383
+L+
Sbjct: 363 VLY 365
>gi|388257121|ref|ZP_10134301.1| sulfatase [Cellvibrio sp. BR]
gi|387939325|gb|EIK45876.1| sulfatase [Cellvibrio sp. BR]
Length = 484
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 134/397 (33%), Positives = 198/397 (49%), Gaps = 34/397 (8%)
Query: 7 YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSF-HGSSQIPTPNIDALAYNG 65
+ ++C+ L D K+P+I++ILADDLG+NDV F +G ++I TP +DALA G
Sbjct: 13 FMVVISCSALGAD--------KRPNIVVILADDLGYNDVGFTNGQTEIKTPRLDALANEG 64
Query: 66 LILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWGLPLTEKLLPQYLKEA 122
++ YV C PSR+ L+TG+Y GM++ V + + GLPLTEK P L+E
Sbjct: 65 VVFENGYVTHPYCGPSRAGLITGRYQARFGMENNVTYSPDDKYMGLPLTEKTFPARLQEV 124
Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM 182
GY T GKWHLG + P RGFD YG+ G +Y Y MR
Sbjct: 125 GYKTAIFGKWHLGGAPH-FQPNERGFDYFYGFLDGGHNYMPGEVHLGAGGYLLPIMR--- 180
Query: 183 QVDNKTIGIY----STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
NK + + +T L +AA + + P F+Y+++ A HA P QAP
Sbjct: 181 ---NKGVAEFDEYLTTALSRDAARYIERTSKEQAPFFIYMSYNAPHA-----PLQAPQNY 232
Query: 239 VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP---SFGI 295
+ K+ I D +RRTYA MV +DE +G +I AL K G L+N+++ F++DNG S+
Sbjct: 233 LEKYAHIKDEKRRTYAAMVDAMDEGIGMLIDALDKSGELDNTLIFFLSDNGGLYPYSWWP 292
Query: 296 HSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
N P R K +GG+ A W + + ++++ L D T A AG
Sbjct: 293 DYTWADNSPFRFGKVALTEGGVHVPFIAHWPKKITKGKRLNG-LVSALDIAATSLALAGA 351
Query: 355 EINDTSLDGVNQWDVLT-KGAKTKRSEILHNIDNVDN 390
D LDGV+ +T K ++ + ++ DN
Sbjct: 352 NTKDAKLDGVDLTAYVTGKSKQSPHKALFWRMEEADN 388
>gi|340369113|ref|XP_003383093.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 374
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 113/321 (35%), Positives = 170/321 (52%), Gaps = 28/321 (8%)
Query: 30 PHIIIILADDLGWNDVSFH---GSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
PHI+++L DD GW +V +H + ++ TPNID L GL LNQHYV C+PSRS+L++
Sbjct: 25 PHIVMMLVDDWGWANVGYHRNPPTCEVVTPNIDKLVKEGLELNQHYVYKFCSPSRSSLIS 84
Query: 87 GKYPIHIGMQHGVILEGEP-------WGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
G+ PIH+ Q+ + P +P + + ++ AGYATH +GKW G
Sbjct: 85 GRLPIHVNDQNLLPTNYNPDDPVSGFSAIPRNMTGIAEKMRGAGYATHQVGKWDAGMATS 144
Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI-------Y 192
+TP RGF + +GY+ DYY + A Q +D+ + D GI Y
Sbjct: 145 DHTPKGRGFQTSFGYFHHANDYYKETASACNRT-QIVDL---WETDKPAHGINGTGPDNY 200
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
L+ E ++V++ H+ S P+FLY A VH P Q PD + F I+D +R+
Sbjct: 201 EEGLFKERVLDVVSNHDPSTPLFLYYAPHIVHT-----PLQVPDRYLKLFSFINDHDRQY 255
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
Y MV+ LD+ VG + AL+ GM +N + + +DNG P + +N+PLRG K T
Sbjct: 256 YHAMVNYLDDVVGEITDALKNKGMWDNLLFVTSSDNGGPVY--PGGGANNYPLRGGKVTD 313
Query: 313 WDGGMRGVAAIWSPWLKQTQK 333
W GG+R A + +L + +
Sbjct: 314 WQGGVRVNAFVSGGYLPEKMR 334
>gi|153807102|ref|ZP_01959770.1| hypothetical protein BACCAC_01379 [Bacteroides caccae ATCC 43185]
gi|149130222|gb|EDM21432.1| arylsulfatase [Bacteroides caccae ATCC 43185]
Length = 463
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 132/369 (35%), Positives = 193/369 (52%), Gaps = 34/369 (9%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPS 80
+ + K P+I++IL DD G+ND F GS ++ TPNIDAL G++ +V A + +PS
Sbjct: 24 VESKTDKNPNILVILIDDAGYNDFGFMGSKEMQTPNIDALTSEGVVFTDAHVAATVSSPS 83
Query: 81 RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
R+ L+TG+Y G + L GLPL E+ + + K GY T AIGKWHLG R+
Sbjct: 84 RACLITGRYGHRFGYE--CNLSDRTNGLPLEEETIAEVFKTNGYRTAAIGKWHLG-SRDE 140
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI--YSTDLYT 198
P RGFD YG G +DY+ ++ K + P N+ ++++ + Y TD ++
Sbjct: 141 QHPNNRGFDLFYGMKAGGRDYF-YNEKKSDRPGD----ERNLLLNDRQVKFEKYLTDAFS 195
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
E A+ I E S+P +YLA+ AVH P QA DE++AKF R+ A M
Sbjct: 196 EKAVEFINE--SSQPFMMYLAYNAVHT-----PMQATDEDMAKF---EGHPRQKLAAMTY 245
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LD VG VI L+ G +N+++ F++DNG G +N+ SN+PL+G K ++GG R
Sbjct: 246 ALDRGVGTVIRGLKDSGKFDNTLIFFLSDNG----GATTNQSSNYPLKGFKGNKFEGGHR 301
Query: 319 -GVAAIWS---PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGA 374
+W P K + S L D T AA I LDGV+ L KG
Sbjct: 302 VPYFVVWKNGIPAGKPYDGLVSSL----DIFATAIDAASISKTRNRLDGVSLLPYL-KGK 356
Query: 375 KTKRSEILH 383
K + ++L+
Sbjct: 357 KGEPHKVLY 365
>gi|408793054|ref|ZP_11204664.1| type I phosphodiesterase/nucleotide pyrophosphatase [Leptospira
meyeri serovar Hardjo str. Went 5]
gi|408464464|gb|EKJ88189.1| type I phosphodiesterase/nucleotide pyrophosphatase [Leptospira
meyeri serovar Hardjo str. Went 5]
Length = 583
Score = 192 bits (487), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 147/432 (34%), Positives = 218/432 (50%), Gaps = 67/432 (15%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQ----IPTPNIDALAYNGLILNQHYV-QALCT 78
T + P++I+I+ADDLG+NDV+ +G +PTPNID++A +G+ + Y A+CT
Sbjct: 91 TLKRRPPNVIVIMADDLGFNDVTTYGGGYADLGVPTPNIDSIAKDGVRFDSGYSGSAVCT 150
Query: 79 PSRSALMTGKYPIHIGMQH----------GVILEGEP---------------------WG 107
SR+ALMTG+YP G++ G L +P G
Sbjct: 151 VSRAALMTGRYPSRFGVEFTPTPGALARVGADLYADPNRLYPVVIDKEKAEKSKSFNELG 210
Query: 108 LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHY----GYWQGLQDYYD 163
+P +E + + LKE GY + IGKWHLG E+ P +GFD G + + D
Sbjct: 211 MPGSEITIAEVLKERGYHSVHIGKWHLGSTEEM-RPNKQGFDESLFMESGLYLPVDDPNV 269
Query: 164 HSCKATFEPYQGL---DMRHNMQVDNKT---IGIYSTDLYTEAAINVIAEHNKSKPMFLY 217
++ K F+P +MR + + Y TD +T+ A+ VI + NK +P FL+
Sbjct: 270 YNAKQDFDPIDRFLWPNMRFGVSYNGGKWFEPSRYLTDYFTDEAVKVI-KTNKHRPFFLF 328
Query: 218 LAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGML 277
LAH AVH P QA E+ I D +R Y M+ LD SVG ++A+L++ G+
Sbjct: 329 LAHWAVHT-----PLQASKEDYDALSHIKDHRKRVYLSMIRSLDRSVGKILASLKEEGLD 383
Query: 278 ENSIVLFMADNGAPSF-GIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVS 335
EN+IV+F +DNGAP++ G+ N P RG K T + GG+R A W +K K
Sbjct: 384 ENTIVIFTSDNGAPNYIGLPD---VNSPYRGWKLTLFQGGIRVPYVAKWPGHIKPGTKYQ 440
Query: 336 SELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKY 394
+ + +I D LPT+ AAGI++ D +DGVN L KG ++ L D Y
Sbjct: 441 NAITNI-DILPTVANAAGIKLPLDREIDGVNLLPYL-KGQAIQKPRPLFWSDG------Y 492
Query: 395 YAALRVDDLKYV 406
Y ++ D K +
Sbjct: 493 YQTVQADGWKLI 504
>gi|325285341|ref|YP_004261131.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga lytica DSM 7489]
gi|324320795|gb|ADY28260.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga lytica DSM 7489]
Length = 460
Score = 191 bits (486), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 126/351 (35%), Positives = 190/351 (54%), Gaps = 33/351 (9%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSA 83
TA K+P+II++L DD GW DV F+G++ IPTPN+D +A G+I + YV C+PSR+
Sbjct: 21 TAQKQPNIIVVLTDDQGWADVGFNGATDIPTPNLDRIASEGVIFSNGYVSHPYCSPSRAG 80
Query: 84 LMTGKYPIHIGMQHGVILEGE---PWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
L+TG+Y G + EGE G PL+EKL+ + LKE GY T AIGKWH+G +
Sbjct: 81 LLTGRYQARFGHDCNMPYEGENDATVGTPLSEKLISEALKEQGYRTSAIGKWHIGDHPNL 140
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
+ P +GFD +G+ G +Y+ + Y R+ V + + Y TD +T
Sbjct: 141 HPPA-QGFDHWFGFPGGSMNYWGKATSKIQTIY-----RNTKPVAEEEL-TYLTDDFTNE 193
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD----ISDPERRTYAGM 256
AIN I + +K+ P F+YLA+ A H APD K+L+ I R YA M
Sbjct: 194 AINFINKKDKN-PFFIYLAYNAPH---------APDHATKKYLEQTKHIEYAGRSVYAAM 243
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
V+ +D ++G + + L +G+ +N+I++F++DNG N P RG K ++GG
Sbjct: 244 VNAVDANIGKIDSTLVANGIKDNTILVFLSDNGG-----RIQHADNRPYRGHKGMLFEGG 298
Query: 317 MRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLC-AAAGIEINDTSLDGVN 365
++ I W +K+ K + + + D PTL AA G + LDGV+
Sbjct: 299 IKVPFFISWPSKIKEDSKYTKPISSL-DLFPTLLNAAGGNAAIEKQLDGVD 348
>gi|196229618|ref|ZP_03128482.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196225944|gb|EDY20450.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 490
Score = 191 bits (486), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 141/394 (35%), Positives = 198/394 (50%), Gaps = 38/394 (9%)
Query: 1 MTWARKYFFALTCTLLFNDAFLNTTAP-KKPHIIIILADDLGWNDVSFHGSSQIPTPNID 59
M+ RK F ++ L + + P K+P+II+I++DD G+ D SF GS I TPN+D
Sbjct: 9 MSLLRKAFTSVAALSLASSSVRADDTPTKRPNIIVIVSDDQGYADASFQGSKDILTPNLD 68
Query: 60 ALAYNGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-PWG-LPLTEKLLP 116
ALA +G+ + YV A +C+PSR+ LMTG+Y G + ++ E P LP E LLP
Sbjct: 69 ALAKSGVRCTRGYVTAPVCSPSRAGLMTGRYQERFGHHNNIVAEAALPIAHLPSNETLLP 128
Query: 117 QYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYY----------DHSC 166
Q L +AGY T +GKWHLG ++ P RGFD +G G DY+ D S
Sbjct: 129 QVLAKAGYYTAMVGKWHLG-LQDGCRPYERGFDEFFGIITGGHDYFVNHPEERAVGDQSY 187
Query: 167 KATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE-HNK--SKPMFLYLAHLAV 223
KA E N V G Y TD + A+ +I E H K +P+FLYLA A
Sbjct: 188 KARIE--------RNGPVGEAVPG-YLTDAFGADAVRIIRESHTKRPDQPLFLYLAFNAP 238
Query: 224 HAGNTYEPFQAPDEEVAKF-LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIV 282
H P QAP + V + +RRTYA ++ +D SVG V AAL+++GM +++ +
Sbjct: 239 HT-----PTQAPKDLVDTMPATLESKDRRTYAAQITSMDASVGKVRAALKENGMEKDTFI 293
Query: 283 LFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS 342
+F +DNG G + N PLR K + ++GG+R P V
Sbjct: 294 VFFSDNG----GANHPYYDNTPLRDHKGSLYEGGIRVPFFAVYPGHIPAGSVCELPVTSL 349
Query: 343 DWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAK 375
D T CA AG + + LD V+ VL A+
Sbjct: 350 DVFATACALAGTKPETSHPLDSVDMLPVLEGNAR 383
>gi|414072362|ref|ZP_11408307.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410805226|gb|EKS11247.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 473
Score = 191 bits (486), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 129/355 (36%), Positives = 189/355 (53%), Gaps = 33/355 (9%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
+KP+I++I ADD G+ D F GS+Q+ TPN+D LA +G+ Q YV + C PSR+ LMT
Sbjct: 27 EKPNIVLIFADDAGFGDFGFQGSTQLKTPNLDKLAQSGVRFTQGYVSDSTCGPSRAGLMT 86
Query: 87 GKYPIHIG---------MQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
GKY G M L+G GLPL +K + YLKE GY T GKWHLG
Sbjct: 87 GKYQQRFGYEEINVPGFMSDNSALKGADMGLPLDQKTMGDYLKEQGYKTAVFGKWHLG-D 145
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDN-KTIGIY 192
+ + P RGFD+ G+ G + Y+++S + E G D + N + Y
Sbjct: 146 ADRFHPLKRGFDTFLGFRGGDRSYFNYSEQ---EMKNGNKHFFDKKLERDFGNYEEPKEY 202
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
TD+ + A I E NK +P F+YLA AVH P ++ +++AKF +++ +R+
Sbjct: 203 LTDVLGKEAAKYI-EQNKDEPFFIYLAFNAVHT-----PLESDPKDLAKFPNLTG-KRKE 255
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
A M LD + G V+ L++ G+ +N+IV+F DNG PS N +N PL G KS
Sbjct: 256 LAAMTLGLDRASGYVLDKLKELGLDDNTIVVFSNDNGGPS---DKNASNNAPLAGTKSNQ 312
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCAAAGIEINDTSLDGVN 365
+GG+R W K + S+ + +S D LPT +AA + + +DGV+
Sbjct: 313 LEGGIR--VPFLISWPKHIKPGSTYDYPVSTLDLLPTFYSAAKGKALGSDIDGVD 365
>gi|149197416|ref|ZP_01874467.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139434|gb|EDM27836.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 455
Score = 191 bits (486), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 135/383 (35%), Positives = 199/383 (51%), Gaps = 29/383 (7%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
L +L+F A +KP+II+ILADDLG+ D+ F G+ I TP+IDALA +G+ Q
Sbjct: 3 LIFSLIFFTYSTLALAAQKPNIILILADDLGYEDLGFLGAPDIKTPHIDALARSGMNFTQ 62
Query: 71 HYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE------PWGLPLTEKLLPQYLKEAG 123
Y A +C PSR+ L+TG+Y G GE G+PL E+++ LK A
Sbjct: 63 GYQSASVCGPSRAGLLTGRYQQLFGSGENPPETGELSKRFPDAGIPLDEQMIFDLLKPAA 122
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ 183
Y T IGKWH+G E PT R D +YG+ G Y + P R+N
Sbjct: 123 YTTGVIGKWHMGLSHE-QRPTQRSVDYYYGFLNGAHSYREAKMDMKGAPMTWPIFRNNEP 181
Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
V Y+T+++ + +N I + NK KP FLY+++ +VH P++A +++ +
Sbjct: 182 V---PFSGYTTEVFNDEGVNFI-KRNKDKPFFLYMSYNSVHG-----PWEAQPKDLQRSD 232
Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK---- 299
I RR Y+ M+ +D+ VG +I L+ G+ EN++V+FM+DNGAP+ +H +
Sbjct: 233 HIKKKWRRIYSAMLISMDDGVGRLIQTLKDEGIYENTLVIFMSDNGAPN-NLHEAERAGD 291
Query: 300 --GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCAAAGIE 355
SN LRG K ++GG+R V I S W + K S+ +S D +PTL +
Sbjct: 292 YLASNGSLRGRKGDTYEGGIR-VPYIMS-WPQVIPKQSTYQHPVSGLDIVPTLIHISQAA 349
Query: 356 INDTSLDGVNQWDVLTKGAKTKR 378
L GVN +T G KT R
Sbjct: 350 PAKKELSGVNLMPYIT-GEKTSR 371
>gi|241244123|ref|XP_002402253.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215496292|gb|EEC05932.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 196
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 101/225 (44%), Positives = 134/225 (59%), Gaps = 44/225 (19%)
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
+G WHLG+++E YTPT RGF+S YGY+ +DYY+H+ LD+
Sbjct: 11 LGAWHLGYYKEAYTPTERGFESFYGYYNFGEDYYNHT----------LDL---------- 50
Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
P+FLYL+HLAVHAG Y P QAP+E V KF I
Sbjct: 51 ------------------------PLFLYLSHLAVHAGTQYGPIQAPEENVQKFDYIGVK 86
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
R YAG V LD+SVG V AL K ML+N++++F++DNGA +G SN GSN PLRG
Sbjct: 87 NRSLYAGAVDALDQSVGIVFQALEKKDMLQNTVLVFLSDNGAIPWGEWSNAGSNWPLRGA 146
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
K T W+GG+R A +WSP L++ Q+VS++L H+SDW+PTL + AG
Sbjct: 147 KFTLWEGGVRVPAFVWSPLLRKPQRVSNQLMHVSDWVPTLYSLAG 191
>gi|325286704|ref|YP_004262494.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
gi|324322158|gb|ADY29623.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
Length = 484
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 137/377 (36%), Positives = 188/377 (49%), Gaps = 40/377 (10%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
R F L T+ +N A +KP+II + +DD G+ D F GS + TPN+D LA +
Sbjct: 3 RNLIFILLLTVA-----INLNAQEKPNIIFLFSDDAGYADFGFQGSKIMKTPNLDKLAKS 57
Query: 65 GLILNQHYV-QALCTPSRSALMTGKYPIHIG---------MQHGVILEGEPWGLPLTEKL 114
G Q YV A C PSR+ L+TGKY G M + GLPL +
Sbjct: 58 GAKFTQGYVTDATCGPSRAGLITGKYQQRFGYEEINVPGYMSANSKFLADDMGLPLDQLT 117
Query: 115 LPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDY--YDHSCKATFEP 172
+ +LK+ GY T GKWHLG + Y PT RGFD YG+ G ++Y Y+ KA
Sbjct: 118 IADHLKKLGYKTAMYGKWHLG-DADRYHPTKRGFDEFYGFRGGARNYFGYNDVSKAN--- 173
Query: 173 YQGLDMRHNMQVDN-KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP 231
LD R N + Y TD + A++ I E NK P F+YLA AVH P
Sbjct: 174 ---LDNRMERGFGNYQEPTEYVTDALAKEAVSFI-EKNKGNPFFIYLAFNAVHT-----P 224
Query: 232 FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
QA E++ KF +++ +R+ A M LD + G V+ L++ G+ +N+IV+F DNG P
Sbjct: 225 MQATKEDLNKFPNLTG-KRKELAAMTLALDRACGTVLNKLKELGLDKNTIVVFSNDNGGP 283
Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLC 349
+ N N PL G KS +GG+R W KQ + + F +S D LPT
Sbjct: 284 T---DKNASLNLPLSGTKSNHLEGGIR--VPFLISWPKQIKSKTVYNFPVSTLDLLPTFY 338
Query: 350 AAAGIEIND-TSLDGVN 365
AA G + D +DGV+
Sbjct: 339 AAGGGNVADLKDIDGVD 355
>gi|311748319|ref|ZP_07722104.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
gi|126576822|gb|EAZ81070.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
Length = 472
Score = 189 bits (480), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 129/356 (36%), Positives = 191/356 (53%), Gaps = 32/356 (8%)
Query: 26 APKKPH-IIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSA 83
+PKK + +++I+ADDLG+ D+ F GS+QI TP++D LA NG+ Q YV A+C+PSR+
Sbjct: 30 SPKKHYNLVLIVADDLGYGDLGFTGSTQIKTPHLDQLATNGVTFTQGYVSSAVCSPSRAG 89
Query: 84 LMTGKYPIHIGMQHGVILEG-EPW------GLPLTEKLLPQYLKEAGYATHAIGKWHLGF 136
+TG + G H L G EP G+PL++K + +L + GY IGKWHLG
Sbjct: 90 FITGINQVEFG--HDNNLAGVEPGFDIAYNGMPLSQKTIADHLNKLGYVNGLIGKWHLGK 147
Query: 137 FREVYTPTFRGFDSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQVDNKTIGIYST 194
+ P RGFD +GY G DY++ + K EP + N + + Y T
Sbjct: 148 -EPQFHPLKRGFDEFWGYTGGGHDYFESLPNGKGYKEP-----LESNFKTPDPIT--YIT 199
Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
D +++ I H K +P FL+ A A H P QA +E++A + I D +RRTYA
Sbjct: 200 DDVGNESVDFIERH-KDEPFFLFAAFNAPHT-----PMQALEEDLALYQHIEDKKRRTYA 253
Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
MV RLD +VG ++ +L + G+ EN++V+F +DNG P+ SN N P RG K +
Sbjct: 254 AMVHRLDLNVGKIMTSLEEQGLSENTLVVFFSDNGGPT---DSNASLNAPYRGQKGILLE 310
Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
GG+ + P L + E D +PT A AG +TS+D + D++
Sbjct: 311 GGIHVPFVMNLPGLLPEGLIYQEQVTSLDVVPTFLALAG--DTETSMDMFSGVDLI 364
>gi|196230145|ref|ZP_03129008.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196225742|gb|EDY20249.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 487
Score = 189 bits (479), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 129/355 (36%), Positives = 190/355 (53%), Gaps = 28/355 (7%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ-HYVQALCTPSRSA 83
A + P++I+I+ADDLG+ DV +G+ TPN D LA+ G H QA+C+ SR+A
Sbjct: 28 AADRPPNVILIVADDLGYADVGVYGAKGFETPNFDRLAHEGRRFTDFHVAQAVCSASRAA 87
Query: 84 LMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
+MTG YP IG++ + EPW G+ E +PQ K GYAT +GKWHLG E
Sbjct: 88 IMTGCYPNRIGIEGAM----EPWYKFGISDQELTMPQMFKRKGYATGMVGKWHLGTPTE- 142
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCK-ATFEP---YQGLDMRHNMQVDNKTIGIYSTDL 196
+ PT RGFD +G + H K F P Y+G D N ++++ + +T
Sbjct: 143 FLPTHRGFDEWFGLPYSNDQWPLHPEKPGKFPPLPLYEG-DKVINPGINHRDMEQLTTQ- 200
Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
YTE A+N I + N KP FLY+A H P D KF + +R Y
Sbjct: 201 YTERAVNFI-DRNHDKPFFLYVAQTMPHV-----PLAVSD----KFRGAT--KRGLYGDA 248
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
V +D S G ++AAL+K+G+ +++I++F++DNG P ++ GS +PLR K+T WDGG
Sbjct: 249 VEEIDWSTGEILAALKKYGLEKDTIIMFLSDNG-PWLIFGNHAGSAYPLREGKTTTWDGG 307
Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLT 371
R I P VS+E+ D +PTL G ++ + +DG++ W +L
Sbjct: 308 TRVPFIISWPGHIPAGTVSTEMACAIDLMPTLAKLIGAKMPEHKIDGLDIWPLLA 362
>gi|86141258|ref|ZP_01059804.1| arylsulfatase B precursor [Leeuwenhoekiella blandensis MED217]
gi|85831817|gb|EAQ50272.1| arylsulfatase B precursor [Leeuwenhoekiella blandensis MED217]
Length = 461
Score = 189 bits (479), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 120/365 (32%), Positives = 189/365 (51%), Gaps = 27/365 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K P+ ++I+ADD GWND SFHGS +I TPN+D LA GL L++ Y C+P+R++L+TG
Sbjct: 37 KTPNFLVIIADDAGWNDFSFHGS-EIQTPNLDQLAGKGLTLDRFYTYPTCSPARASLLTG 95
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+ +G+ + E LP + LPQ L + Y T +GKWHLG E P G
Sbjct: 96 RPASRMGIVAPISGRSE-LNLPDSITTLPQALSKLNYKTALMGKWHLGLKPES-GPEVYG 153
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FD YG+ G D Y H+ K + R+ + K + TDL T++A++ I
Sbjct: 154 FDFSYGFLHGQLDQYAHTYKNGDSTW----YRNGKFISEKG---HVTDLLTQSAVHYIDT 206
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
+ +L +A+ A H P Q P E + K+ I D RR YA ++ +D +G +
Sbjct: 207 LQTDQNFYLQVAYSAPHI-----PLQEPQEWLEKYTGIKDSSRRAYAAAMTHMDAGIGEI 261
Query: 268 IAALRKHGMLENSIVLFMADNGAPS-----------FGIHSNKGSNHPLRGMKSTPWDGG 316
+ L+ + +N++VLF++DNGA +G + + GSN PLR K++ ++G
Sbjct: 262 LQKLKDKDLEKNTVVLFVSDNGAQEKWVPNTQYDGKYGPNYSLGSNLPLRDFKTSNYEGA 321
Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKT 376
+R V AI S S+ +++DW+PT A E ++++GVN +L +
Sbjct: 322 LR-VPAIISWPENLNSGTSTNYINVTDWMPTFLNWANAEELPSTVEGVNAQHLLKTASVA 380
Query: 377 KRSEI 381
+ I
Sbjct: 381 REKPI 385
>gi|149198444|ref|ZP_01875489.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149138450|gb|EDM26858.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 458
Score = 189 bits (479), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 130/373 (34%), Positives = 200/373 (53%), Gaps = 20/373 (5%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILN 69
+ T +F L + P++IIIL DD G+ DV F+G IPTP++D++A NG+ ++
Sbjct: 1 MNKTFIFLYLILMNSIIAGPNLIIILTDDQGYQDVGFNGCKDIPTPHLDSIAQNGVNCID 60
Query: 70 QHYVQALCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWGLPLTEKLLPQYLKEAGYATH 127
H +C PSR+ L+TG+Y G + E GLPL EK + + LKE GY++
Sbjct: 61 AHVSYPVCGPSRAGLLTGRYQDRFGFTTNPTVNPENPIAGLPLEEKNIAEVLKEVGYSSS 120
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYY--DHSCKATFEPYQGLD-MRHNMQV 184
IGKWH+G ++ P RGFD +G+ G DY+ ++ K E + D R ++
Sbjct: 121 IIGKWHMGT-HPIHHPLNRGFDHFFGFLSGGHDYFPAKYNLKDLSEVKRIWDWYRTHLIR 179
Query: 185 DNKTIGI---YSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
D + I + Y TD+ T+AA++ I + ++ KP LYL++ A H P QA ++ +
Sbjct: 180 DRERIQVSEGYLTDILTDAAVDFIDKKASEKKPFMLYLSYNAPHT-----PLQASEKYLK 234
Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
+F I D +RRTYA MVS +D+ VG V+A L+KH + E+++V F++DNG P G
Sbjct: 235 RFTHIKDSKRRTYAAMVSAVDDGVGRVMARLKKHKIEEDTLVFFLSDNGGPLKGPMPFT- 293
Query: 301 SNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-D 358
N P K + +GG R A+ W + Q + + D L T+ A ++ +
Sbjct: 294 DNGPFSKGKGSLHEGGTRVPFAVQWKGTIPSGQIYKKPMSSL-DILATITALNKLKPHPK 352
Query: 359 TSLDGVNQWDVLT 371
LDGVN LT
Sbjct: 353 RPLDGVNIIPYLT 365
>gi|336424342|ref|ZP_08604383.1| hypothetical protein HMPREF0994_00389 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336003446|gb|EGN33530.1| hypothetical protein HMPREF0994_00389 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 460
Score = 189 bits (479), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 128/384 (33%), Positives = 188/384 (48%), Gaps = 30/384 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
+ P++I ILADD+G+ D TPN+D L G ++ Y + +C P+R+AL+T
Sbjct: 6 QHPNVIYILADDMGYGDFGIFSDGSARTPNLDRLVRQGCAMSHCYAASPVCAPARAALLT 65
Query: 87 GKYPIHIGMQHGVILEGEPWG---LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
G+YP + G + E G + L E L + GY T IGKWHLG + Y P
Sbjct: 66 GRYP----HRTGAVDTYEAIGGDRMALREVTLADVYRANGYRTGLIGKWHLGLIGKEYHP 121
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
RGFD+ G+ G DYY + LD ++ + G Y TD+ TE +I
Sbjct: 122 CRRGFDTFIGFRGGWSDYYQYK----------LDRNGILEASD---GTYMTDVITEESIR 168
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
I E N+ +P FL+ A+ A H PFQ P+E A F +P T GM++ +DE
Sbjct: 169 FIRE-NREQPFFLHAAYNAPHF-----PFQCPEEYAAPFRKRFNPTLATLYGMIACMDEG 222
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
+G ++ L + G+ +N+IV+F +DNG +G +N+ N L G K ++GG+R A I
Sbjct: 223 IGKLLDVLEETGLSDNTIVIFASDNGPQLYG-DTNR-YNCYLNGQKGEAFEGGIRVPAVI 280
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEIL 382
P FH DW PTL +A +E+ D SLDG N+ D + G+ +
Sbjct: 281 RWPGHIAPDSRCHAFFHGVDWFPTLLSACRLELPDGVSLDGKNRLDEILLGSSSPSEPYY 340
Query: 383 HNIDNVDNPQKYYAALRVDDLKYV 406
+ K AA+R K V
Sbjct: 341 WQWNRFTPVSKCNAAVRCGKWKLV 364
>gi|86142609|ref|ZP_01061048.1| arylsulfatase A [Leeuwenhoekiella blandensis MED217]
gi|85830641|gb|EAQ49099.1| arylsulfatase A [Leeuwenhoekiella blandensis MED217]
Length = 477
Score = 189 bits (479), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 134/402 (33%), Positives = 194/402 (48%), Gaps = 43/402 (10%)
Query: 6 KYFFALTCTLLFNDAF-LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
K+ AL +L ++ KP+ +II ADD+G+ D+S +G I TPN+D +A
Sbjct: 7 KFKLALFIAILLGGTITISAQIEDKPNFVIIFADDMGYGDLSSYGHPTIKTPNLDKMAME 66
Query: 65 GLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLK 120
G YV ++CTPSR+A+MTG+ P+ GM V +PW GLP +E + + LK
Sbjct: 67 GQKWTNFYVGSSVCTPSRAAIMTGRLPVRNGMTSKVHRVLQPWSTNGLPQSEITIAEQLK 126
Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGL---------------QDYYDHS 165
+ GY T A GKWHLG +E Y P GFD YW G+ QDY++
Sbjct: 127 KGGYGTAAFGKWHLG-DKEEYLPVNNGFD----YWYGVPYSNDMDITLEMNEPQDYWNFW 181
Query: 166 CKATFEPYQGLD---MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLA 222
+ + MR+ ++ T YT AI I EH K P F+YLA+
Sbjct: 182 KSEERKEISSFNIPLMRNTEIIERPANQHTITKRYTNEAIEYIKEH-KENPFFIYLAYNL 240
Query: 223 VHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIV 282
H P A +E K +R Y +V +D +VG V+ AL++ G+ EN+IV
Sbjct: 241 PHV-----PLFASEEFEGK------SKRGLYGDVVEEIDHNVGRVMEALKEQGIAENTIV 289
Query: 283 LFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS 342
+F +DNG P + GS LR K + W+GGMR WSP + +EL
Sbjct: 290 VFTSDNG-PWLVMDEEGGSAGLLRDGKGSTWEGGMREPGIFWSPG-NLKPGIVTELGSTM 347
Query: 343 DWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
D T AG+E+ +D LDG + L +G ++ R + +
Sbjct: 348 DLFTTFSKLAGVELPDDRILDGEDLSSTLFEGKESPRETMFY 389
>gi|149197521|ref|ZP_01874572.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139539|gb|EDM27941.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 465
Score = 188 bits (478), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 131/360 (36%), Positives = 192/360 (53%), Gaps = 23/360 (6%)
Query: 15 LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGS-SQIPTPNIDALAYNGLILNQHYV 73
L F A L+ +A K P+II+ILADDLG+ DVS+HG+ + TP+ID++A +G Y
Sbjct: 9 LCFVLASLSASAAK-PNIIVILADDLGYGDVSYHGTLKETTTPHIDSIAQSGAWFQNGYS 67
Query: 74 QA-LCTPSRSALMTGKYPIHIGMQHGV----ILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
A +C PSR+ L++G+Y G + + + GLPL++KL+P+ L + GYAT
Sbjct: 68 AAPVCGPSRAGLLSGRYQQRFGYYDNIGPFTLNKDVEAGLPLSQKLIPEILVKEGYATGM 127
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQG-LQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
+GKWH G + + P RGF YG+ G + ++ T + + G R N +V+N
Sbjct: 128 VGKWHDGD-QHKFWPYNRGFQEFYGFNNGAINNWVLKGENHTVDEW-GAVHRENKRVENS 185
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
G Y T+ + A+ I H K++P FLYL+ AVH P QAP +F I
Sbjct: 186 --GEYMTEAFGREAVEFIDRH-KTEPFFLYLSFNAVHG-----PLQAPKSYTNQFKHIKP 237
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
R M+ +D+++G V+ LRK G+ EN+I+ F +DNG G +S G RG
Sbjct: 238 ENRALCLAMLKSMDDNIGLVLEKLRKEGLEENTIIFFTSDNGGKLKGNYSFNGK---YRG 294
Query: 308 MKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVN 365
K+T +DGG+ A+ W + K H D T+ AAAG+EI D LDG N
Sbjct: 295 EKNTVFDGGLHVPYAVQWKAQIPAQTKALEAPVHSIDLAHTIFAAAGVEIKDEYKLDGRN 354
>gi|319951998|ref|YP_004163265.1| n-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
gi|319420658|gb|ADV47767.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
Length = 471
Score = 188 bits (477), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 128/376 (34%), Positives = 191/376 (50%), Gaps = 46/376 (12%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F ++ C F ++A +KP+II + +DD G+ D F GS+++ TPN+D LA +G+
Sbjct: 7 FLSICC-------FSISSAQEKPNIIFLFSDDAGFADFGFQGSTEMKTPNLDKLANSGVK 59
Query: 68 LNQHYV-QALCTPSRSALMTGKYPIHIGMQH----GVILEGEPW-----GLPLTEKLLPQ 117
Q YV A C PSR+ L+TGKY G + G + E + GLPL + +
Sbjct: 60 FTQGYVTDATCGPSRAGLITGKYQQRFGYEEINVPGYMSENSKFLADDMGLPLDQLTIGD 119
Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
YLK+ GY T GKWHLG + + P RGFD YG+ G + Y+ + ++
Sbjct: 120 YLKKLGYNTAMYGKWHLG-NADRFHPMNRGFDEFYGFRGGARSYFGYDVASS-------- 170
Query: 178 MRHNMQVDNKTIGI-----YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPF 232
H+ +++ Y TD + AI+ I E NK P F+YLA AVH P
Sbjct: 171 AHHDTKMERGFGNFEEPQEYVTDALADEAISFI-EKNKKNPFFIYLAFNAVHT-----PM 224
Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
+A E++ +F +++ +R+ A M LD + G V+ L++ G+ +N+I++F DNG P+
Sbjct: 225 EATKEDLNQFPNLTG-KRKELAAMTLALDRACGRVLDKLKQLGLDKNTIIVFSNDNGGPT 283
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCA 350
N N PL G KS +GG+R W K Q ++ F IS D LPT A
Sbjct: 284 ---DKNASLNLPLSGTKSNHLEGGIR--VPFLMSWPKHIQPKTTYTFPISTMDLLPTFYA 338
Query: 351 AAGIEIND-TSLDGVN 365
A G D +DG+N
Sbjct: 339 AGGGNTKDLQEIDGIN 354
>gi|251798133|ref|YP_003012864.1| sulfatase [Paenibacillus sp. JDR-2]
gi|247545759|gb|ACT02778.1| sulfatase [Paenibacillus sp. JDR-2]
Length = 434
Score = 188 bits (477), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 126/372 (33%), Positives = 195/372 (52%), Gaps = 29/372 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
K+P+II+ DDLG+ D+ +GS + TP++D LA G+ Y + +C+PSR++L+T
Sbjct: 2 KRPNIIVFYCDDLGYGDLGCYGSDAMKTPHLDQLASEGIRFTNWYSNSPVCSPSRASLLT 61
Query: 87 GKYPIHIGMQHGVILEGEPW--GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
GKYP G+ IL G+ GL L + L LKE GY T GKWHLG E Y P
Sbjct: 62 GKYPAKAGVTS--ILGGKRGTKGLSLEQTTLASALKEHGYHTALFGKWHLGASAE-YGPN 118
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEAA 201
GFD YG+ G DYY H + G++ H++ + + G Y T+ T A
Sbjct: 119 AHGFDQFYGFRAGCIDYYSHIF--YWGQGGGVNPVHDLWRNETEVWENGEYMTEAITREA 176
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+ I +P F+Y+A+ A H P AP + +F D+ P+RR A M++ +D
Sbjct: 177 TSYIDAAPDDEPYFMYVAYNAPHY-----PMHAPKAYLDRFPDLP-PDRRIMAAMIAAVD 230
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS---------NKGSNHPLRGMKSTP 312
+ VG ++ AL++ G E++I+ F +DNG PS + GS RG K++
Sbjct: 231 DGVGEIVKALKQKGAYEDTIIFFSSDNG-PSTESRNWLDGTEDLYYGGSAGRFRGHKASL 289
Query: 313 WDGGMRGVAAIWSP--WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
++GG+R A + P +Q ++S E+F + D PT+ +GI SLDG + +D L
Sbjct: 290 FEGGIREPAILSYPAGLAEQQGQISDEMFAMMDIFPTMLELSGIGTEGYSLDGHSVFDAL 349
Query: 371 TKGAKTKRSEIL 382
+ A + R ++
Sbjct: 350 SGNALSPRKQLF 361
>gi|414070343|ref|ZP_11406329.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410807260|gb|EKS13240.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 469
Score = 188 bits (477), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 125/354 (35%), Positives = 184/354 (51%), Gaps = 20/354 (5%)
Query: 19 DAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALC 77
++F + +KP+I++ILADDLG+ D+ F GS +I TPNIDALA NG YV C
Sbjct: 6 NSFNVNASEQKPNIVVILADDLGYGDLGFTGSKEIKTPNIDALASNGTRFKNAYVTHPYC 65
Query: 78 TPSRSALMTGKYPIHIGMQHGV--ILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
PSR L+TG+Y +GM++ V + + + GLPL+E L++ GY T GKWHLG
Sbjct: 66 GPSRVGLLTGRYQARLGMENNVSYMPQDKYMGLPLSENTFANRLQDVGYHTSVFGKWHLG 125
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
+ P RGFD YG+ G +Y + Y L + N QV Y T
Sbjct: 126 GAPH-FQPNKRGFDYFYGFLDGGHNYMPDQVTVGGDGYS-LPLMRNTQV--TEFDEYLTT 181
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
+ A+ I +S P F+YL++ A H P QAP E + K+ I D +RR YA
Sbjct: 182 ALSRDAVKYIHRQQES-PFFMYLSYNAPHT-----PLQAPAEYIEKYKHIEDEDRRVYAA 235
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA---PSFGIHSNKGSNHPLRGMKSTP 312
M+ +DE +G V+ AL+ L+N++++F++DNG S+ S+ N P R K+
Sbjct: 236 MIDVMDEGIGMVVDALKDINQLDNTLIIFLSDNGGVFPESWQPTSDWADNSPFRRGKAAL 295
Query: 313 WDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
+GG+ A W + + ++ + L D T A AG + + LDGVN
Sbjct: 296 LEGGIHVPFIAHWPGVIPKGEEFTG-LVSSLDIAATSVALAGAD--NKKLDGVN 346
>gi|323452297|gb|EGB08171.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 889
Score = 188 bits (477), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 170/600 (28%), Positives = 257/600 (42%), Gaps = 150/600 (25%)
Query: 58 IDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQ 117
++ LA G+ L Y CT SR+AL TGKYP+H+G+Q VI EP G+PL E L Q
Sbjct: 1 MNKLASGGVTLTSFYASPTCTASRAALFTGKYPMHLGLQDSVIHPSEPRGVPLKETFLSQ 60
Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFE------ 171
LK+AGY T +GKWHLGF + YTP+ RGFD +G G D+ H+ +F
Sbjct: 61 KLKDAGYGTVFVGKWHLGFHQAAYTPSQRGFDEFFGILTGGGDHVAHASTESFAMRGPDY 120
Query: 172 -----PYQGLDMRHNMQ-VDNKTIGI---YSTDLYTEAAINVIAEHNKSK--PMFLYLAH 220
G ++ HN + V G+ +ST LYT + +A++ + P+FL +++
Sbjct: 121 KSKITTMTGSNLWHNGRPVGEDEEGVRDTHSTHLYTAKSRAYVAKYAEMADVPLFLTVSY 180
Query: 221 LAVHAGNTYEPFQAPDEEVAKFLDISDPE------------RRTYAGMVSRLDESVGNVI 268
AVHA P Q PDE V + E RR GMVS +DES+ ++
Sbjct: 181 QAVHA-----PIQVPDEYVDGSAYANGCESLRAGSASATRLRRQLCGMVSMVDESLASIH 235
Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
L+ M EN+++ F++DNG GI + SN LRG K + ++GG+R A + +L
Sbjct: 236 DDLKAEAMWENTLLFFLSDNG----GILRHGSSNLDLRGEKGSYYEGGIRVPAFVSGGYL 291
Query: 329 KQ---TQKVSSELFHISDWLPTLCAAAGIE------------------------------ 355
+ +V L HI+D T AG+
Sbjct: 292 AKLLSAGRVFEGLVHITDLHATALRVAGVSSKEDKRAAPAVAAAKVGHLRRLEPGQQPQS 351
Query: 356 -----INDT--------SLDGVNQWDVLTKGAK---TKRSEILHNIDNVDNPQKYYA--A 397
++D SLDGV+QWD GA ++R+ + HNI N + + A A
Sbjct: 352 YAETALDDVLSHEHLTDSLDGVDQWDAFVSGASAAASQRTHVAHNI----NSELFGAAGA 407
Query: 398 LRVDDLKYVAGTDNNGQSDEWYGD---TDNEIDKYSPKEVLYSK----AGIT--FNALKT 448
LRV + K + YG D+ D+ +V++ K G T F+ +
Sbjct: 408 LRVGEYKLIVEARVTESEIYEYGQHMLQDDNWDQSELSQVIHQKLLRTPGTTSLFDVVAN 467
Query: 449 KLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNI 508
++ ++ D ++ + +L EFA V+
Sbjct: 468 PNELDEEDCDDARSC-----------RNLYDLDEFADVKA-------------------- 496
Query: 509 TDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDEL 568
T L K E + + +ST + + P ADPA + +W PW D+L
Sbjct: 497 --------------TILEKWAELRDSTPESTEIWLDDGPL---ADPALFGGVWTPWRDDL 539
>gi|149197520|ref|ZP_01874571.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139538|gb|EDM27940.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 446
Score = 188 bits (477), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 122/366 (33%), Positives = 188/366 (51%), Gaps = 21/366 (5%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
A KP+I+++ ADD+GW DV++HG TP IDA+A G+ Q Y A +C PSR+ +
Sbjct: 16 AADKPNIVLVFADDMGWGDVAYHGVEDAQTPAIDAIAKGGVWFEQGYAAASVCGPSRAGI 75
Query: 85 MTGKYPIHIGMQHGVILEGEP-WGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
+TG+Y GV+ G+ G+P ++K + + LK AGY + A GKWHLG + + P
Sbjct: 76 LTGRYQ----QLFGVVTNGDADKGIPKSQKNIAELLKPAGYKSGAFGKWHLGSKKGQF-P 130
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
RGFD+ YG+ G DYY K + + N + + G Y T+ T+ A+
Sbjct: 131 NDRGFDTFYGFHFGAHDYYRADKKLNKKKKGYAPIYFNQDIVDYKEGDYLTEKITDHAVE 190
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
I E NK +P F+Y+A+ +VH+ P+Q PDE +A+ + RR + MV +D+
Sbjct: 191 FI-EENKDQPFFMYVAYNSVHS-----PWQVPDEYLARIPESVPAYRRLFLAMVLAMDDG 244
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS-NKGS-----NHPLRGMKSTPWDGGM 317
VG + A L++ + EN+I +F DNG+P G N+G + RG K ++GG+
Sbjct: 245 VGRIRAKLKELNLDENTIFVFTTDNGSPKIGNKKPNEGQYRMSMSQGFRGYKGDTYEGGI 304
Query: 318 R-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKT 376
R W +K K + + D PT +AA +E + G + L K
Sbjct: 305 RVPFCMSWPKKIKSGNKFEAPVI-AYDLAPTFLSAASLEYSTKQFSGKDLLPYLEDEQKG 363
Query: 377 KRSEIL 382
+ E L
Sbjct: 364 RPHETL 369
>gi|323451705|gb|EGB07581.1| hypothetical protein AURANDRAFT_27261 [Aureococcus anophagefferens]
Length = 614
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 165/576 (28%), Positives = 264/576 (45%), Gaps = 97/576 (16%)
Query: 19 DAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALC 77
D F +T+A +P++++ + DD G DVS++ ++ + TP + ++ +G +L+ Y C
Sbjct: 32 DEFSSTSADAPRPNMLLTVVDDWGSGDVSYNDAA-LHTPELQRMSEHGFVLDNFYAAPTC 90
Query: 78 TPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
TPSR+ LMTG+Y I GMQ VI EP G+PL E+ L Q L +AGY T AIGKWHLG
Sbjct: 91 TPSRAMLMTGRYNIRNGMQDSVIHSTEPRGVPLDERFLSQKLSDAGYRTAAIGKWHLGMH 150
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLY 197
R+ YTP RGFD YG G + H K + + D
Sbjct: 151 RDAYTPLKRGFDLFYGILTGGGSHTGHFSKGD------------DKKGDDKKDAKKDDAK 198
Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE-VAKFLDISDPERRTYAGM 256
A +A K+ P F+YL++ AVH +P D+ V++ IS R T GM
Sbjct: 199 KGGAHRRLAASGKA-PWFVYLSYQAVH-----DPVTVGDDRYVSETAAISTTNRPTLCGM 252
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
++ +D+ + + +L +I + ++DNG G+ ++ +N+PLRG K+ ++GG
Sbjct: 253 IAEIDDGLKTLRLSLENAQAWAQTIFVVLSDNG----GVRAHGSANYPLRGEKAQYYEGG 308
Query: 317 MRGVAAIWSPWLKQTQK--------VSSELFHISDWLPTLCAAAGIEI--NDTSLDGVNQ 366
++ A + ++++ + S+ L HI+D T + AG E +D LDGV+
Sbjct: 309 VKVPAVLAGGYVEEALRRAGGAERLRSASLAHITDVHATFLSLAGYEGGDDDKPLDGVSL 368
Query: 367 WDVLTK------GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
+ L + GA RS++L NI++ A+RV D K + + N S
Sbjct: 369 FSHLVQSASGVAGAAPPRSDVLININSALFAGS--GAIRVGDHKLMVNPEPNESS----- 421
Query: 421 DTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILEL 480
+YSK +++ AA A +D ++I+
Sbjct: 422 --------------IYSK-------------VRKALAAQKGAVPTDTFQSIV-------- 446
Query: 481 REFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTM 540
+ + D GA S LF++ +P E+ + A S L D +A Y
Sbjct: 447 -----AQAHKDALGAPSIS-----LFDLKKNPYERTDCATSDPAESCNLYD-VAAYADQE 495
Query: 541 VPPGNKPFDKR---ADPARWNNIWVPWYDELDKQKA 573
P ++ ADPA + ++W PW D D KA
Sbjct: 496 AQPSTFAWEDDGPLADPALFGSVWAPWRDGDDTPKA 531
>gi|323454325|gb|EGB10195.1| hypothetical protein AURANDRAFT_62802 [Aureococcus anophagefferens]
Length = 2048
Score = 187 bits (476), Expect = 1e-44, Method: Composition-based stats.
Identities = 133/397 (33%), Positives = 199/397 (50%), Gaps = 61/397 (15%)
Query: 16 LFNDAFLN---TTAPKKPHIIIILADDLGWNDVSFHGSS-QIPTPNIDALAYNGLILNQH 71
L + A+L+ + ++ +I++ILADD+G+NDV + + TP +D+LA G+ L +
Sbjct: 180 LLDTAYLDELPSATTQQQNILLILADDMGYNDVGYQSTDLGAATPTLDSLASRGVELASY 239
Query: 72 YVQALCTPSRSALMTGKYPIHIGMQH---GVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
Y CTP+RS+L T Y + GM + G + +GLPL L+ + LK GYATH
Sbjct: 240 YTLPSCTPARSSLFTANYAANNGMGYDGPGTYVIDSSYGLPLGATLMSEVLKGRGYATH- 298
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGL---DMRHNMQVD 185
++ Y+ H T +P G D+ + VD
Sbjct: 299 -----------------------------IETYFSHEAYGT-DPNNGTSYCDLMRSNSVD 328
Query: 186 NKTIG-----IYSTDLYTEAAINVIAEH--NKSKPMFLYLAHLAVHAGNTYEP---FQAP 235
+G YSTDLY ++V+ EH P+FLY A A HA P AP
Sbjct: 329 GVKVGECYLDEYSTDLYARRLVDVVGEHATTSDSPLFLYFAAQAPHAPLEEPPPMNATAP 388
Query: 236 DEE-VAKFLDISDP--ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
E +A + D +RRT+A MV LD +V V+ AL +L+N++V F +DNG
Sbjct: 389 QEAMLAALVGAGDSWGKRRTFAKMVMNLDNNVRKVVDALAAAKLLQNTVVAFASDNG--- 445
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQ--TQKVSSELFHISDWLPTLCA 350
G + GSN PLRG K + ++GG+R A +WS L++ V LFH++DW PTL A
Sbjct: 446 -GCPRDGGSNWPLRGTKFSQYEGGVRVPAFLWSSTLQKRGAGTVFEGLFHVADWFPTLVA 504
Query: 351 AAGIEINDTSLDGVNQWDVLTKGAKTK-RSEILHNID 386
+AG+ +DGVNQ+D L + R E+L +++
Sbjct: 505 SAGVRATVGKIDGVNQYDALFDATQNAPRDEVLIHLN 541
>gi|150008923|ref|YP_001303666.1| secreted sulfatase [Parabacteroides distasonis ATCC 8503]
gi|149937347|gb|ABR44044.1| putative secreted sulfatase precursor [Parabacteroides distasonis
ATCC 8503]
Length = 452
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 143/420 (34%), Positives = 218/420 (51%), Gaps = 39/420 (9%)
Query: 6 KYFFALTCTLLFNDAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQ-IPTPNIDALAY 63
KY C L F+ A T A P P+I++IL DDLG+ D+S G+++ I TP+ID L
Sbjct: 3 KYILTAACGLAFSAASAQTKALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLN 62
Query: 64 NGL-ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWG-LPLTEKLLPQYL 119
G+ N H + +PSR++L+TG+YP +G+ GVI + + WG L LLPQ L
Sbjct: 63 EGVRFTNFHANCPVSSPSRASLLTGRYPDMVGVP-GVIRTHKEDSWGYLSEDAVLLPQML 121
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD-YYDHSCKATFEPYQGLDM 178
K+ GY +GKW+LG TPT RGFD + G+ + D YY H + M
Sbjct: 122 KKRGYHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH------RRFGNNYM 174
Query: 179 RHNM-QVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
R N+ ++D + ++T+++++ AI +++ K +P FLYLA+ A H P Q P
Sbjct: 175 RENLKEIDPQG---HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQ 226
Query: 237 E--EVAKFLDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
E E K + S PE+R ++ LD +VG V AL+K+G LEN+I++F +DNG
Sbjct: 227 EWLEKVKKREPSLPEKRAKIVALIEHLDYNVGRVYEALKKNGQLENTIIIFTSDNG---- 282
Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ-KVSSELFHISDWLPTLCAAA 352
G +N P RG K ++GG+R I+ W Q + V +SD PTLC
Sbjct: 283 GQDDAGANNGPFRGAKQDMYEGGIRVAGGIY--WKNQIRPAVRDNFVMLSDMFPTLCDLT 340
Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ---KYYAALRVDDLKYVAGT 409
+ ++ +DG++ +L + ++H + N + K Y A R D K + T
Sbjct: 341 AVPVSH-EIDGISILPLLRGEEQDTGDRMVHWVRREGNSRYGGKAYYASRYKDYKILQNT 399
>gi|440715767|ref|ZP_20896296.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
gi|436439253|gb|ELP32723.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
Length = 826
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 132/358 (36%), Positives = 192/358 (53%), Gaps = 33/358 (9%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
A +P+I++I+ADDLG++DV F+G +IPTP++D LA +G++ Y C+PSR+ L
Sbjct: 41 AKDRPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGL 100
Query: 85 MTGKYPIHIGMQ---------HGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
+TG++ G + HG E P G+PL+E L LKEAGY T AIGKWHLG
Sbjct: 101 LTGRHQQRFGHESNPEPDTQWHG---EDTP-GMPLSETTLADALKEAGYVTGAIGKWHLG 156
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
+ + P RGFD +G+ G Y+ K +P G+ R + VD KT+ + D
Sbjct: 157 DAKPFW-PNHRGFDEWFGFSGGGFSYWGDLGKK--DPLLGV-HRGDEPVDPKTLTHLTDD 212
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
TE A+ I H +S+P FLYLA+ A HA P A + K I R Y
Sbjct: 213 FSTE-AVKFIQRH-ESEPFFLYLAYNAPHA-----PDHATRAHLQKTAHIEYGGRAVYGA 265
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
MV+ +DE +G V+ +R+ G+ EN++V+F +DNG N P RG K ++G
Sbjct: 266 MVAGMDEGIGRVVDQIRESGLGENTMVIFYSDNGG-----RREHAVNFPYRGHKGMLFEG 320
Query: 316 GMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLT 371
G+R + W ++ K S + + D PT AAAG++ + + LDG N VLT
Sbjct: 321 GIRVPFLVSWPGTVRSGMKEESPITAL-DLFPTALAAAGMDPSQNDKLDGQNLLPVLT 377
>gi|256841083|ref|ZP_05546590.1| arylsulphatase A [Parabacteroides sp. D13]
gi|256736926|gb|EEU50253.1| arylsulphatase A [Parabacteroides sp. D13]
Length = 452
Score = 187 bits (475), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 141/419 (33%), Positives = 217/419 (51%), Gaps = 37/419 (8%)
Query: 6 KYFFALTCTLLFNDAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQ-IPTPNIDALAY 63
KY C L F+ A T A P P+I++IL DDLG+ D+S G+++ I TP+ID L
Sbjct: 3 KYILTAACGLAFSAASAQTKALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLN 62
Query: 64 NGL-ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWG-LPLTEKLLPQYL 119
G+ N H + +PSR++L+TG+YP +G+ GVI + + WG L LLPQ L
Sbjct: 63 EGVRFTNFHANCPVSSPSRASLLTGRYPDMVGVP-GVIRTHKEDSWGYLSEDAVLLPQML 121
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR 179
K+ GY +GKW+LG TPT RGFD + G+ + D Y C+ + MR
Sbjct: 122 KKRGYHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTHCR-----FGNNYMR 175
Query: 180 HNM-QVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
N+ ++D + ++T+++++ AI +++ K +P FLYLA+ A H P Q P E
Sbjct: 176 ENLKEIDPQG---HATEVFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQE 227
Query: 238 --EVAKFLDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
E K + S PE+R ++ LD +VG V AL ++G LEN+I++F +DNG G
Sbjct: 228 WLEKVKKREPSLPEKRAKIVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNG----G 283
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ-KVSSELFHISDWLPTLCAAAG 353
+N P RG K ++GG+R I+ W Q + V +SD PTLC
Sbjct: 284 QDDAGANNGPFRGAKQDMYEGGIRVAGGIY--WKNQIRPAVRDNFVMLSDMFPTLCDLTA 341
Query: 354 IEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ---KYYAALRVDDLKYVAGT 409
+ ++ +DG++ +L + ++H + N + K Y A R D K + T
Sbjct: 342 VPVSH-EIDGISILPLLRGEEQDTGDRMVHWVRREGNSRYGGKAYYASRYKDYKILQNT 399
>gi|149177395|ref|ZP_01855999.1| arylsulfatase A [Planctomyces maris DSM 8797]
gi|148843728|gb|EDL58087.1| arylsulfatase A [Planctomyces maris DSM 8797]
Length = 474
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 138/395 (34%), Positives = 194/395 (49%), Gaps = 35/395 (8%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
KY L L + + A P++I+I ADDLG+ D+ G I TP +D +A G
Sbjct: 13 KYASPLIALFLLLTSVADLNAAPGPNVIVIYADDLGYGDLGCFGHPTIKTPALDQMAAEG 72
Query: 66 LILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKE 121
+ Q Y A +CTPSR+AL+TG+YPI GM + V+ G+P +E L + LK
Sbjct: 73 MKFTQFYSAAPVCTPSRAALLTGRYPIRSGMCSDKRRVLFPNSGGGIPASEVTLAEALKA 132
Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYD--HSCKATFEP---YQ 174
AGY T +GKWHLG + + PT GFDS++G Y + D H +P +
Sbjct: 133 AGYKTACVGKWHLGHLPQ-FLPTNNGFDSYFGIPYSNDMDRVADRKHGRSIFLKPEVKFW 191
Query: 175 GLDMRHNMQV-----DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTY 229
+ + N +V D TI T YTE AI +I + NK +P F+YLAH H
Sbjct: 192 NVPLMRNTEVVELPADQTTI----TKRYTEEAIKLI-QQNKQQPFFIYLAHNMPHV---- 242
Query: 230 EPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG 289
F++P+ F D S R Y ++ +D SVG V+ L+ G+ +N+IV F +DNG
Sbjct: 243 PLFRSPE-----FQDKS--RRGLYGDVIEEIDWSVGEVLKELKTQGLDQNTIVWFCSDNG 295
Query: 290 APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
P + GS LR K + W+GGMR W P + VS EL D T
Sbjct: 296 -PWLIFNEQGGSAGLLRDGKGSTWEGGMREPTIAWWPGHIPSGSVSQELGSTMDIYTTSI 354
Query: 350 AAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
AG + D +DGV+ VL K + R E+ +
Sbjct: 355 KLAGGSVPTDQVVDGVDLTPVLLKKENSPREELAY 389
>gi|422293430|gb|EKU20730.1| arylsulfatase B [Nannochloropsis gaditana CCMP526]
gi|422295486|gb|EKU22785.1| arylsulfatase B [Nannochloropsis gaditana CCMP526]
Length = 703
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 140/401 (34%), Positives = 210/401 (52%), Gaps = 42/401 (10%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIP----TPNIDALAYNGLILNQHYVQALCT 78
N K PH+++I+ DD G DV ++ S + P TP +D+LA + L ++YV +CT
Sbjct: 55 NPQIAKPPHVLMIVVDDAGVQDVGYNASPESPLRGKTPVLDSLAAESVRLKEYYVHPVCT 114
Query: 79 PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLK-EAGYATHAIGKWHLGFF 137
P+R+AL+TG+Y +++GM +I + GL + LP+ LK EA Y+TH +GKWHLG
Sbjct: 115 PTRAALLTGRYAVNVGMPFPLIGDAIS-GLDGSIPTLPEMLKSEANYSTHLVGKWHLGAA 173
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLY 197
+ P RGFDS YG D+Y + ++ +VD K ++T L+
Sbjct: 174 KAKNRPLARGFDSFYGLLGASFDHYTKKMGEVRDLWKNEAEVPAKEVDEKE---HATTLF 230
Query: 198 TEAAINVIAEHNKS----------KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL-DIS 246
+ A+ VI EH+ P+FLYLA+ A HA P QA DE+ K D+
Sbjct: 231 SREAVKVIEEHSARGHAGAKDGDMDPLFLYLAYSAPHA-----PLQA-DEKFMKLCSDVP 284
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
+ RRT+ M+++LDE +GNV A+L+ HGM E+++V+F +DNG G G N+ R
Sbjct: 285 NRHRRTFCAMMAQLDEGIGNVTASLKAHGMWEDTVVVFTSDNG----GYFLVGGFNYNYR 340
Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE---LFHISDWLPTLCAAA-----GIEIND 358
G KS W+GG+R A I +P K+ + + H++DW+PTL A G E
Sbjct: 341 GGKSGGWEGGVRVPAFIRAP--KRYNFAPRDFRGMAHVADWMPTLMGMATGREEGRERMG 398
Query: 359 TSLDGVNQWDVLTKGAKTK--RSEILHNIDNVDNPQKYYAA 397
D + +G K K R+E L +D N Y +
Sbjct: 399 DGYDLRKAFRGDEEGGKEKSPRNEALLQLDIFQNTSAYRSG 439
>gi|405952520|gb|EKC20320.1| Arylsulfatase B [Crassostrea gigas]
Length = 500
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 131/415 (31%), Positives = 199/415 (47%), Gaps = 102/415 (24%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A +KP+II I DD+G ND+ ++ + ++ TPN+D LA NG+IL +YV +C+PSR+A M
Sbjct: 26 ANQKPNIIFIAVDDMGNNDIGYN-NPEVDTPNLDNLANNGVILESNYVYPVCSPSRAAFM 84
Query: 86 TGKYPIHIGMQHGVILEGEPWGLP-----LTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
TG+Y IG Q G + +P + + EKL Y GYA H IGKWHLG+ ++
Sbjct: 85 TGRYAHKIGFQRGPVEHKQPAYIESNYKTVAEKLTTNY----GYAAHMIGKWHLGYCKDA 140
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV--------DNKTIGIY 192
TPT RGFDS YG++ G ++YY T+ + D R N+ + + Y
Sbjct: 141 VTPTNRGFDSFYGFYGGQENYY------TYTSARYKDFRDNLTAVTPQNPNYPREDVDGY 194
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL--------- 243
ST Y + AI ++ H+KS P+FLYLA A H+ P QAP E K+
Sbjct: 195 STFEYKKRAIEIVGNHDKSVPLFLYLAFQAPHS-----PLQAPQELKDKYPGDWTYTDWD 249
Query: 244 -DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
R T+ MV+ +D++VG V + G+L+N+I++ + H+
Sbjct: 250 GSTKTTSRATFYAMVTAVDQAVGEVYTEMETQGLLDNAIIVLL----------HA----- 294
Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG------IEI 356
S WL PT + AG +I
Sbjct: 295 ----------------------SDWL-----------------PTFISLAGGSADLTTDI 315
Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRV---DDLKYVAG 408
+ TS+DGV+Q +L ++ RS+I+ NI +++ P Y ++ + KYV G
Sbjct: 316 DLTSIDGVDQTQMLLNNGESARSKIILNIYDLERPSTIYGIVKQYGPEVWKYVQG 370
>gi|294053770|ref|YP_003547428.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613103|gb|ADE53258.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 491
Score = 186 bits (473), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 138/402 (34%), Positives = 201/402 (50%), Gaps = 36/402 (8%)
Query: 1 MTWARKYFFA-LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNID 59
MTW R A C L A L +A ++P+I+IILADDLG++D+ + GS++I +P ID
Sbjct: 1 MTWFRIQLLATFACALT---AVL--SADERPNILIILADDLGYSDLGYTGSTEIESPVID 55
Query: 60 ALAYNGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILE--GEPWGLPLTEKLLP 116
LA NG+I YV C PSR+ L+TG++ GM+ + GLP+ E
Sbjct: 56 KLANNGVIFANGYVTHPYCGPSRAGLITGRHQARFGMEINATYSPFDQHMGLPVDEPTFA 115
Query: 117 QYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFE----- 171
+ L+ AGY T IGKWHLG + + P RGFD YG+ G DY+ S E
Sbjct: 116 KRLQPAGYRTGIIGKWHLGAAPQFH-PNNRGFDYFYGFLSGGHDYFPESVNTHLELVLPN 174
Query: 172 --PYQGLDMRHNMQV-DNKTIGIY----STDLYTEAAINVIAEHNKSKPMFLYLAHLAVH 224
P G + + + NK + +T L +AA V + +P LYLA+ A H
Sbjct: 175 GKPNYGANEGTLLPLLRNKNAAEFDDYLTTALSKDAARFVTSSE---QPFCLYLAYNAPH 231
Query: 225 AGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLF 284
P QAP E +AK+ I DP+RR YA M+ +D +G V+ AL + G L+N+++ F
Sbjct: 232 T-----PLQAPKETIAKYSHIKDPKRRIYAAMIDEMDAGIGLVVDALEQSGKLDNTLIFF 286
Query: 285 MADNGA---PSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFH 340
++DNG ++ +S+ N P + K++ ++GG A W LK + +
Sbjct: 287 LSDNGGCVPQAWDSYSDFADNGPFKDGKTSFYEGGTHVPFIAHWPAKLK--PQTYDNIIS 344
Query: 341 ISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
D T A AG + + LDGVN LT + E L
Sbjct: 345 SLDLAATAVALAGGDDSGKPLDGVNLIPFLTGETTGQPHEAL 386
>gi|348550278|ref|XP_003460959.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Cavia porcellus]
Length = 502
Score = 186 bits (473), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 126/363 (34%), Positives = 181/363 (49%), Gaps = 26/363 (7%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMTGK 88
P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+AL+TG+
Sbjct: 10 PNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGR 69
Query: 89 YPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
PI G ++ + G+P +E+LLPQ LKEAGYAT +GKWHLG R +
Sbjct: 70 LPIRNGFYTTNGHARNAYTPQEIVGGIPDSERLLPQLLKEAGYATKIVGKWHLGH-RPQF 128
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLY 197
P GFD +G YD+ + Y+ DM ++ KT T +Y
Sbjct: 129 HPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRNWDMVGRFYEEFPINVKTGESNLTQIY 188
Query: 198 TEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
+ A++ I + ++ P FLY A A HA P FL IS +R Y
Sbjct: 189 LQEALDFIRQQQAAQHPFFLYWAVDATHA---------PVYASRPFLGIS--QRGRYGDA 237
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
V +D+SVG +++ LR + ENS V F +DNGA S GSN P K T ++GG
Sbjct: 238 VREIDDSVGKILSLLRTLSIAENSFVFFTSDNGAALISAPSQGGSNGPFLCGKQTTFEGG 297
Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAK 375
MR A W P +VS +L I D T A AG++ +D +DG+N + +G
Sbjct: 298 MREPAIAWWPKHIPAGQVSHQLGSIMDLFTTSLALAGLKPPSDRVIDGLNLLPTMLQGQL 357
Query: 376 TKR 378
R
Sbjct: 358 MDR 360
>gi|160891516|ref|ZP_02072519.1| hypothetical protein BACUNI_03967 [Bacteroides uniformis ATCC 8492]
gi|317478375|ref|ZP_07937539.1| sulfatase [Bacteroides sp. 4_1_36]
gi|156858923|gb|EDO52354.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
gi|316905534|gb|EFV27324.1| sulfatase [Bacteroides sp. 4_1_36]
Length = 525
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 122/386 (31%), Positives = 195/386 (50%), Gaps = 43/386 (11%)
Query: 11 LTCTLLFNDAFLNTTAP----------KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDA 60
++CTL+ A L + ++P+I++++ADD+GW DV + G+ + TPNIDA
Sbjct: 3 VSCTLVSVAALLPFSGSNAGNVQRDKSQRPNIVLVIADDMGWGDVGYQGAVDVSTPNIDA 62
Query: 61 LAYNGLILNQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWG-LPLTEKLLPQY 118
LA G+ +Q YV ++ PSR+ ++TG Y G + + PW +P + L +
Sbjct: 63 LARRGVQFSQGYVSCSISGPSRAGILTGVYQQRFGFYNNL----HPWAKIPEGQSTLGEM 118
Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM 178
+++ GYAT +GKWH+ E +P RGFD YG+W DYY + K E Y +
Sbjct: 119 VRDCGYATGFVGKWHMADSPE-QSPNRRGFDQFYGFWSDTHDYYRSTDKPGVELYDFCPL 177
Query: 179 RHNMQVDN--KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
N ++ G Y TD +T A+ I +H S P L L++ AVH+ P+Q P+
Sbjct: 178 YRNGEIQPPLHESGEYITDCFTREAVEFIDKH-ASSPFLLCLSYNAVHS-----PWQVPE 231
Query: 237 EEVAKF--LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP-SF 293
V + +R+ +A MV LD+ +G V+ +LRK+G+ EN++ + ++DNG+P
Sbjct: 232 HYVNRLEGRRFHHEDRKVFAAMVLALDDGIGRVMESLRKNGLEENTLFILISDNGSPRGQ 291
Query: 294 GIHSNKG------------SNHPLRGMKSTPWDGGMRGVAAI--WSPWLKQTQKVSSELF 339
GI + G S P RG K+ ++GG+R V I W L Q + +
Sbjct: 292 GIECSTGYEYKDRGNTTMSSPGPFRGYKADTYEGGIR-VPYIMSWPSELPQGMVYDNPVI 350
Query: 340 HISDWLPTLCAAAGIEINDTSLDGVN 365
+ + + A G SLDGV+
Sbjct: 351 SLDIFPTVMQAVGGTSRQKYSLDGVS 376
>gi|241601300|ref|XP_002405277.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215502504|gb|EEC11998.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 234
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 97/225 (43%), Positives = 138/225 (61%), Gaps = 4/225 (1%)
Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV--DNKT 188
+WHLG++ Y P RGFD+ G++ DYY + TF+ + G D + ++ +K
Sbjct: 12 QWHLGYYDSKYVPVNRGFDTFLGFYSPALDYYTQNF--TFDNHTGHDFWNGDKIYWVDKE 69
Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
YST YT + +I HNKS P+FL L+H A H Q P + + F I +
Sbjct: 70 KKQYSTHYYTRKTVQLIQVHNKSTPLFLLLSHQAPHTSGGPTLLQVPKKGIRDFSYIQEE 129
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
R +AGMV LD+SVG+V+ AL+K ML ++I++F DNGA +G SN+GSN PLRG
Sbjct: 130 NRTLFAGMVDALDQSVGSVVDALQKADMLTDTILVFCTDNGALPWGFKSNRGSNWPLRGT 189
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
K T ++GG+R A IWSP + +++VSS+L HI+DWLPTL +AAG
Sbjct: 190 KFTLYEGGVRSTAFIWSPLVNNSKRVSSQLMHITDWLPTLYSAAG 234
>gi|305665421|ref|YP_003861708.1| arylsulfatase A [Maribacter sp. HTCC2170]
gi|88710176|gb|EAR02408.1| arylsulfatase A [Maribacter sp. HTCC2170]
Length = 469
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 130/381 (34%), Positives = 190/381 (49%), Gaps = 31/381 (8%)
Query: 19 DAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LC 77
D F + P+ III ADDLG+ D+S +G+ I TPN+D +A+ G YV A +C
Sbjct: 20 DIFGQKANQQTPNFIIIFADDLGYGDLSTYGNPTIHTPNLDRMAHEGQKWTNFYVGASVC 79
Query: 78 TPSRSALMTGKYPIHIGMQHG---VILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL 134
TPSR+AL+TG+ P+ GM V+ GLP E L + LK AGY+T AIGKWHL
Sbjct: 80 TPSRAALLTGRLPVRSGMASSKTRVLFPNSKNGLPANEITLAEQLKTAGYSTAAIGKWHL 139
Query: 135 GFFREVYTPTFRGFDSHYGY-------WQGLQDY----YDHSCKATFEPYQGLDMRHNMQ 183
G +E Y PT GFD ++G + G +Y ++ E Y +++
Sbjct: 140 G-HKEQYLPTNNGFDYYFGIPYSNDMDYIGKANYKNLAQNNPSDIKIENYNVPLLQNTKI 198
Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
++ T T YT+ I I E NK +P F+YLAH H F + + E
Sbjct: 199 IERPTNQNTITKRYTQETIKFIKE-NKREPFFVYLAHSLPHI----PLFTSKEFE----- 248
Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
+ Y +V +D VG ++ AL + G+ +N+IV+F +DNG P ++ GS
Sbjct: 249 --GHSKNGIYGDVVEEIDHGVGEILNALEEEGLDQNTIVVFTSDNG-PWLPFKTHGGSAG 305
Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLD 362
LR K + W+GGMR W P L + + + +++ D T AG+EI ND +D
Sbjct: 306 LLRAGKGSTWEGGMREPGIFWGPGLVK-KGLVTDMGSTMDLFTTFSTMAGVEIPNDRIID 364
Query: 363 GVNQWDVLTKGAKTKRSEILH 383
G + L G + R IL+
Sbjct: 365 GHDLSKTLRNGDASPRQHILY 385
>gi|417303628|ref|ZP_12090677.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
gi|327540049|gb|EGF26644.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
Length = 826
Score = 185 bits (470), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 131/359 (36%), Positives = 189/359 (52%), Gaps = 35/359 (9%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
A +P+I++I+ADDLG++DV F+G +IPTP++D LA +G++ Y C+PSR+ L
Sbjct: 41 AKDRPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGL 100
Query: 85 MTGKYPIHIGMQ---------HGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
+TG++ G + HG E P G+PLTE L LKEAGY T AIGKWHLG
Sbjct: 101 LTGRHQQRFGHESNPEPDTQWHG---EDTP-GMPLTETTLADALKEAGYVTGAIGKWHLG 156
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
+ + P RGFD +G+ G Y+ K +P G+ R + VD KT+ + D
Sbjct: 157 DAKPFW-PNRRGFDEWFGFSGGGFSYWGDLGKK--DPLLGV-HRGDEPVDPKTLTHLTDD 212
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
TE A+ I H +++P FLYLA+ A HA P A + K I R Y
Sbjct: 213 FSTE-AVKFIQRH-ETEPFFLYLAYNAPHA-----PDHATRAHLQKTAHIEYGGRAVYGA 265
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
MV+ +DE +G V+ +R+ G+ EN++++F +DNG N P RG K ++G
Sbjct: 266 MVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGG-----RREHAVNFPYRGHKGMLFEG 320
Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCAAAGIEIN-DTSLDGVNQWDVLT 371
G+R W + E I+ D PT AAAG++ + + LDG N VLT
Sbjct: 321 GIR--VPFLVSWPGTVRSGMKEELPITALDLFPTALAAAGMDPSRNDKLDGQNLLPVLT 377
>gi|298375849|ref|ZP_06985805.1| arylsulfatase [Bacteroides sp. 3_1_19]
gi|423330336|ref|ZP_17308120.1| hypothetical protein HMPREF1075_00133 [Parabacteroides distasonis
CL03T12C09]
gi|298266886|gb|EFI08543.1| arylsulfatase [Bacteroides sp. 3_1_19]
gi|409231952|gb|EKN24800.1| hypothetical protein HMPREF1075_00133 [Parabacteroides distasonis
CL03T12C09]
Length = 452
Score = 185 bits (470), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 142/420 (33%), Positives = 217/420 (51%), Gaps = 39/420 (9%)
Query: 6 KYFFALTCTLLFNDAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQ-IPTPNIDALAY 63
KY C L F+ A T A P P+I++IL DDLG+ D+S G+++ I TP+ID L
Sbjct: 3 KYILTAACGLAFSAASAQTKALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLN 62
Query: 64 NGL-ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWG-LPLTEKLLPQYL 119
G+ N H + +PSR++L+TG+YP +G+ GVI + + WG L LLPQ L
Sbjct: 63 EGVRFTNFHANCPVSSPSRASLLTGRYPDMVGVP-GVIRTHKEDSWGYLSEDAVLLPQML 121
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD-YYDHSCKATFEPYQGLDM 178
K+ GY +GKW+LG TPT RGFD + G+ + D YY H + M
Sbjct: 122 KKRGYHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH------RRFGNNYM 174
Query: 179 RHNM-QVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
R N+ ++D + ++T+++++ AI +++ K +P FLYLA+ A H P Q P
Sbjct: 175 RENLKEIDPQG---HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQ 226
Query: 237 E--EVAKFLDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
E E K + S PE+R ++ LD +VG V AL ++G LEN+I++F +DNG
Sbjct: 227 EWLEKVKKREPSLPEKRAKIVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNG---- 282
Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ-KVSSELFHISDWLPTLCAAA 352
G +N P RG K ++GG+R I+ W Q + V +SD PTLC
Sbjct: 283 GQDDAGANNGPFRGAKQDMYEGGIRVAGGIY--WKNQIRPAVRDNFVMLSDMFPTLCDLT 340
Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ---KYYAALRVDDLKYVAGT 409
+ ++ +DG++ +L + ++H + N + K Y A R D K + T
Sbjct: 341 AVPVSH-EIDGISILPLLRGEEQDTGDRMVHWVRREGNSRYGGKAYYASRYKDYKILQNT 399
>gi|421613320|ref|ZP_16054406.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
gi|408495914|gb|EKK00487.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
Length = 826
Score = 185 bits (470), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 128/358 (35%), Positives = 192/358 (53%), Gaps = 33/358 (9%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
A +P+I++I+ADDLG++DV F+G +IPTP++D LA +G++ Y C+PSR+ L
Sbjct: 41 AKDRPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGL 100
Query: 85 MTGKYPIHIGMQ---------HGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
+TG++ G + HG E P G+PL+E L LKEAGY T AIGKWHLG
Sbjct: 101 LTGRHQQRFGHESNPEPDTQWHG---EDTP-GMPLSETTLADALKEAGYVTGAIGKWHLG 156
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
+ + P RGFD +G+ G Y+ K +P G+ R + V+ KT+ + D
Sbjct: 157 DAKPFW-PNHRGFDEWFGFSGGGFSYWGDLGKK--DPLLGV-HRGDEPVEPKTLTHLTDD 212
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
TEA + + N+++P FLYLA+ A HA P A + K I R Y
Sbjct: 213 FSTEAVKFI--QRNETEPFFLYLAYNAPHA-----PDHATRAHLQKTTHIEYGGRAVYGA 265
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
MV+ +DE +G V+ +R+ G+ EN++++F +DNG N P RG K ++G
Sbjct: 266 MVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGG-----RREHAVNFPYRGHKGMLFEG 320
Query: 316 GMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLT 371
G+R + W ++ K S + + D PT AAAG++ + + LDG N VLT
Sbjct: 321 GIRVPFLVSWPGTVRSGMKEESPITAL-DLFPTALAAAGMDPSQNDKLDGQNLLPVLT 377
>gi|6863178|gb|AAF30403.1|AF109925_1 sulfatase 2 precursor [Helix pomatia]
Length = 266
Score = 185 bits (469), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 98/239 (41%), Positives = 140/239 (58%), Gaps = 35/239 (14%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
++P+I+I++ADD G+ D+ +HG+ + TPN+D LA G+ L +YVQ +C+P+RS LMTG
Sbjct: 26 QQPNIVIVVADDYGYRDIGYHGA-EFATPNLDKLAAEGVKLENYYVQPICSPTRSQLMTG 84
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y IH G+QH +I +P+GLPL + LK GY+THAIGKWHLG +++ YTP +RG
Sbjct: 85 RYQIHTGLQHDIIWPSQPYGLPLQFPTIADMLKSVGYSTHAIGKWHLGLYKKEYTPLYRG 144
Query: 148 FDSHYGYWQGLQDYY---------------DHSCKATFEP-------------YQGLDMR 179
FDS+YGY +G +DYY D S ++ P + G D+R
Sbjct: 145 FDSYYGYLEGGEDYYTYYNCDTFHNRTTPADTSILESYSPKNILLGKHEDENKWCGYDLR 204
Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
+ G YST LYT+ AI++I KP LYLA+ AVH+ P + P E
Sbjct: 205 DMNEPVTDMNGTYSTHLYTKKAIDIINGASTGGKPFLLYLAYQAVHS-----PMEVPAE 258
>gi|284040883|ref|YP_003390813.1| sulfatase [Spirosoma linguale DSM 74]
gi|283820176|gb|ADB42014.1| sulfatase [Spirosoma linguale DSM 74]
Length = 453
Score = 185 bits (469), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 128/359 (35%), Positives = 200/359 (55%), Gaps = 43/359 (11%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
KKP+I++ILADDLG+ D+S +G+ I TP+ID+L G+ + Y ++C+PSR+AL++
Sbjct: 27 KKPNILLILADDLGYGDLSSYGAPDIRTPHIDSLVRAGMRFSHFYANSSVCSPSRAALLS 86
Query: 87 GKYPIHIGMQHGVI--LEGEPWG-LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
G+YP +G+ GVI + + WG L + LLP LK+ GY T +GKWHLG P
Sbjct: 87 GRYPEQVGVP-GVIRTMPDDNWGYLSPSAVLLPSILKKNGYYTALVGKWHLG-LEPPNLP 144
Query: 144 TFRGFDSHYGYWQG--LQDYYDHSCKATFEPYQGLDMRHN---MQVDNKTIGI--YSTDL 196
RGFD +G+ +G + DYY H +RH+ M+++ +TI ++TDL
Sbjct: 145 NDRGFDLFHGF-EGDMMDDYYTH-------------LRHDRNYMRLNRQTINPQGHATDL 190
Query: 197 YTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL----DISDPERR 251
+T+ A + + + +S P FLYLA+ A H +P Q P + +AK IS+ +R
Sbjct: 191 FTQWATDYLEQRAGQSNPFFLYLAYNAPH-----DPIQPPADWLAKVKARQPGISE-KRA 244
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
G++ +D+ +G VI LR G+ EN++++F++DNG F + +N PLR K
Sbjct: 245 KLVGLIEHMDDGIGKVIQTLRAKGLYENTLIVFVSDNGGKLF----DGATNGPLRSGKGH 300
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
++GG+R A + P Q S + + D PTL A G IN +DG + +L
Sbjct: 301 MYEGGIRIPACVVWPGKVAAQSQSQQPLLLMDIFPTLAEATGTVIN-YPIDGRSFLSIL 358
>gi|311746665|ref|ZP_07720450.1| sulfatase family protein [Algoriphagus sp. PR1]
gi|311302556|gb|EAZ82500.2| sulfatase family protein [Algoriphagus sp. PR1]
Length = 465
Score = 185 bits (469), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 137/392 (34%), Positives = 198/392 (50%), Gaps = 30/392 (7%)
Query: 4 ARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
R F T L+ N + + +KP+ III DD G+ DV G I TPN+D +A
Sbjct: 6 VRSTLFLFTFLLIANYSLAQS---QKPNFIIIFTDDQGYGDVGTFGHPTIKTPNLDQMAM 62
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYL 119
G YV A +CTPSRSA+MTG+ P+ GM V+ GLP TE + + L
Sbjct: 63 EGQKWTNFYVAANVCTPSRSAIMTGRLPVRTGMYSNTRRVLFPDSGGGLPATENTIAKLL 122
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATF-----EP 172
K +GY+T AIGKWHLG E Y PT GFD+++G Y + D + + F E
Sbjct: 123 KTSGYSTAAIGKWHLGHLPE-YLPTSHGFDTYFGIPYSNDMDRINDVTAQEAFASPKPEY 181
Query: 173 YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPF 232
+ MR ++ T YTE A++ I + NK +P F+YLAH H P
Sbjct: 182 FNVPLMRDKEIIERPADQTTITKRYTEEAVSYI-KANKDQPFFIYLAHSLPHI-----PL 235
Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
A ++ + K ER Y ++ +D SVG +++ L+ G+ +N+ V+F +DNG P
Sbjct: 236 FASEDFLTK------SERGLYGDVIEEIDWSVGQILSTLKSEGIDKNTYVIFTSDNG-PW 288
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
+ GS+ L G K T ++GG+R IW P + Q V S++ D LPT+ + +
Sbjct: 289 LVYNEQGGSSGGLFGGKGTSYEGGVRVPTIIWGPGNIKPQ-VVSKIGSTLDLLPTISSLS 347
Query: 353 GIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
G EI ND DG + VL K+ R E+ +
Sbjct: 348 GTEIPNDRIYDGYDLSPVLRGENKSPREEMFY 379
>gi|410628682|ref|ZP_11339400.1| arylsulfatase B [Glaciecola mesophila KMM 241]
gi|410151686|dbj|GAC26169.1| arylsulfatase B [Glaciecola mesophila KMM 241]
Length = 510
Score = 184 bits (468), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 120/352 (34%), Positives = 190/352 (53%), Gaps = 27/352 (7%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSA 83
A ++P+I++ILADDLG+ D+ F GS +I TPNIDALA NG++ YV C PSR+
Sbjct: 52 VAKERPNIVVILADDLGYADLGFTGSKEIFTPNIDALANNGVVFKNGYVTHPYCGPSRAG 111
Query: 84 LMTGKYPIHIGMQ-HGVILEGEPW-GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
L+TG+Y GM+ + +P+ GLP+ E + +++AGY T +GKWH+G +
Sbjct: 112 LLTGRYQARFGMEVNAAHSPDDPYMGLPVEELTFAKRMQQAGYKTAVMGKWHMGSHPN-F 170
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-NKTIGIYSTDLYTEA 200
P RGFD +G+ G DY+ S K + Y R+ N+ + +T + EA
Sbjct: 171 HPNNRGFDEFFGFLGGGHDYFPESVKVSSAEYSIALSRNGKPAQLNEYL---TTAISKEA 227
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
A V A +P +Y+A+ A H+ P QA ++++AK+ I+D +RRTYA M+ +
Sbjct: 228 ARFVSA---TEQPFMMYVAYNAPHS-----PLQATEQDLAKYQHIADLDRRTYAAMIDSM 279
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH-------SNKGSNHPLRGMKSTPW 313
D+ VG ++AAL++ L N+++ F++DNG G++ S+ +N P R K +
Sbjct: 280 DQGVGRIVAALKQSEKLHNTLIFFLSDNG----GVYPEEWMPDSDWANNAPFRRGKVSLT 335
Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
+GG+ P + + L D T A A +++ LDGVN
Sbjct: 336 EGGVHVPFIAHWPAGFSLKSEYAGLVSSLDIAATSLALARADVSQAKLDGVN 387
>gi|449137658|ref|ZP_21772978.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
gi|448883711|gb|EMB14224.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
Length = 810
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 132/381 (34%), Positives = 196/381 (51%), Gaps = 37/381 (9%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
R F + L A + A ++P+I++I+ADDLG++DV F+G +IPTP +D LA
Sbjct: 4 RTKFLSSMLIGLLVLAASSVDAKERPNIVLIVADDLGYSDVGFNGCKEIPTPRLDELAGE 63
Query: 65 GLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEP-----W------GLPLTE 112
G++ Y C+PSR+ L+TG+Y G EG P W G+PL+E
Sbjct: 64 GVVFTNGYASHPYCSPSRAGLLTGRYQQRFGH------EGNPEPDPQWHGDDTPGMPLSE 117
Query: 113 KLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP 172
L LKEAGY T AIGKWHLG + + P RGFD +G+ G Y+ + +P
Sbjct: 118 TTLADALKEAGYVTGAIGKWHLGDAKPFW-PNRRGFDEWFGFSGGGLSYWGDLGR--MDP 174
Query: 173 YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPF 232
G+ R + VD K++ Y TD ++ A+ I H ++ P FLYLA+ A HA P
Sbjct: 175 LLGV-HRGDEPVDRKSL-TYLTDDFSTEAVKFIQRH-ETDPFFLYLAYNAPHA-----PD 226
Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
QA + K I R Y MV+ +DE +G V+ +R+ G+ +N++++F +DNG
Sbjct: 227 QATRAHLEKTAHIEYGGRAVYGAMVAGMDEGIGRVVDQIRESGLDDNTLIVFYSDNGG-- 284
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAA 351
N P RG K ++GG+R + W L + S + + D PT+ AA
Sbjct: 285 ---RREHAMNLPYRGHKGMLFEGGIRVPFLVSWPGTLPGGVREQSPVTAL-DVFPTVLAA 340
Query: 352 AGIEIN-DTSLDGVNQWDVLT 371
AG++ + + LDG N LT
Sbjct: 341 AGLDPSKNEKLDGQNLLAGLT 361
>gi|32473617|ref|NP_866611.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
SH 1]
gi|32398297|emb|CAD78392.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
SH 1]
Length = 543
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 130/357 (36%), Positives = 194/357 (54%), Gaps = 31/357 (8%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
A +P+I++I+ADDLG++DV F+G +IPTP++D LA +G++ Y C+PSR+ L
Sbjct: 41 AKDRPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGL 100
Query: 85 MTGKYPIHIGMQHGVILEGEP-W------GLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
+TG++ G HG E + W G+PL+E L LKEAGY T AIGKWHLG
Sbjct: 101 LTGRHQQRFG--HGSNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLGDA 158
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYY-DHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
+ + P RGFD +G+ G Y+ D K +P G+ R + VD KT+ + TD
Sbjct: 159 KPFW-PNRRGFDEWFGFSGGGFSYWGDLGMK---DPLLGV-HRGDEPVDPKTL-THLTDD 212
Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
++ A+ I H +++P FLYLA+ A HA P A + K I R Y M
Sbjct: 213 FSTEAVKFIQRH-ETEPFFLYLAYNAPHA-----PDHATRAHLQKTAHIEYGGRAVYGAM 266
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
V+ +DE +G V+ +R+ G+ EN++++F +DNG N P RG K ++GG
Sbjct: 267 VAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGG-----RREHAVNFPYRGHKGMLFEGG 321
Query: 317 MRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLT 371
+R + W ++ K S + + D PT AAAG++ + + LDG N VLT
Sbjct: 322 IRVPFLVSWPGTVRSGMKEESPITAL-DLFPTALAAAGMDPSQNDKLDGQNLLPVLT 377
>gi|319954018|ref|YP_004165285.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
gi|319422678|gb|ADV49787.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
Length = 434
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 123/337 (36%), Positives = 179/337 (53%), Gaps = 23/337 (6%)
Query: 34 IILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGKYPIH 92
+IL DD GW DV F+G++ IPTPN+D LA G+I + YV C+PSR+ L+TG+Y
Sbjct: 1 MILTDDQGWADVGFNGATDIPTPNLDRLASEGVIFSNGYVSHPYCSPSRAGLLTGRYQAR 60
Query: 93 IGMQHGVILEGE---PWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
G + +G+ G PL+EK++ + LKE GY T AIGKWHLG ++Y P +GFD
Sbjct: 61 FGHDCNMPYDGKNDASVGTPLSEKMISEALKEQGYRTSAIGKWHLGDHPDLYPPA-QGFD 119
Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
+G+ G +Y+ S Y+ N +V + Y TD +T AI I + +
Sbjct: 120 HWFGFPGGGMNYWGESKNEIQTIYR------NRKVVPEEELTYLTDDFTTEAIRFITQKD 173
Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
+ KP F+YLA+ A HA P QA E + K I R YA MV+ +D +VG + +
Sbjct: 174 E-KPFFMYLAYNAPHA-----PDQATKEYLEKTKHIEYAGRSVYAAMVNAVDANVGKIDS 227
Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
L +G+ EN+I++F++DNG N P RG K ++GG++ I P
Sbjct: 228 TLIANGLKENTILVFLSDNGG-----RIEHADNRPYRGHKGMLFEGGIKVPFFITWPTKI 282
Query: 330 QTQKVSSELFHISDWLPT-LCAAAGIEINDTSLDGVN 365
+ + +E D PT L AA G + LDGV+
Sbjct: 283 KEKYAYNEPISSLDLFPTFLNAAGGKAKKERQLDGVD 319
>gi|449677596|ref|XP_004208885.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
Length = 193
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 88/152 (57%), Positives = 109/152 (71%), Gaps = 5/152 (3%)
Query: 39 DLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGM--- 95
+LG+NDVSFHGS QIPTPNID +A G+ILN +YV +CTPSRSA+MTG+YPIH G+
Sbjct: 44 NLGFNDVSFHGSKQIPTPNIDKIAKEGVILNNYYVLPICTPSRSAIMTGRYPIHTGIFYH 103
Query: 96 QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYW 155
+ I WG+ L EK LPQYLK GY THAIGKWHLGFF + YTPT+RGFDS YGY+
Sbjct: 104 TYDTIFAANAWGVGLDEKFLPQYLKNVGYQTHAIGKWHLGFFSKEYTPTYRGFDSFYGYY 163
Query: 156 QGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
G DY+DHS + + GLD+ ++ +K
Sbjct: 164 GGQADYWDHSLAS--NGWWGLDLHYDTPSSSK 193
>gi|301311961|ref|ZP_07217883.1| putative arylsulfatase [Bacteroides sp. 20_3]
gi|423339483|ref|ZP_17317224.1| hypothetical protein HMPREF1059_03149 [Parabacteroides distasonis
CL09T03C24]
gi|300830063|gb|EFK60711.1| putative arylsulfatase [Bacteroides sp. 20_3]
gi|409230864|gb|EKN23725.1| hypothetical protein HMPREF1059_03149 [Parabacteroides distasonis
CL09T03C24]
Length = 452
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 142/421 (33%), Positives = 218/421 (51%), Gaps = 41/421 (9%)
Query: 6 KYFFALTCTLLFNDAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQ-IPTPNIDALAY 63
KY C L F+ A T A P P+I++IL DDLG+ D+S G+++ I TP+ID L
Sbjct: 3 KYILTAACGLAFSAASAQTKALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLN 62
Query: 64 NGL-ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWG-LPLTEKLLPQYL 119
G+ N H + +PSR++L+TG+YP +G+ GVI + + WG L LLPQ L
Sbjct: 63 EGVRFTNFHANCPVSSPSRASLLTGRYPDMVGVP-GVIRTHKEDSWGYLSEDAVLLPQML 121
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD-YYDHSCKATFEPYQGLDM 178
K+ GY +GKW+LG TPT RGFD + G+ + D YY H + M
Sbjct: 122 KKRGYHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH------RRFGNNYM 174
Query: 179 RHNM-QVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
R N+ ++D + ++T+++++ AI +++ K +P FLYLA+ A H P Q P
Sbjct: 175 RENLKEIDPQG---HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQ 226
Query: 237 E--EVAKFLDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
E E K + S PE+R ++ LD +VG V AL ++G LEN+I++F +DNG
Sbjct: 227 EWLEKVKKREPSLPEKRAKIVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNG---- 282
Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ-KVSSELFHISDWLPTLCAAA 352
G +N P RG K ++GG+R I+ W Q + V +SD PTLC
Sbjct: 283 GQDDAGANNGPFRGAKQDMYEGGIRVAGGIY--WKNQIRPAVRDNFVMLSDMFPTLCDLT 340
Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNP----QKYYAALRVDDLKYVAG 408
+ ++ +DG++ +L + ++H + N Q YYA+ + D K +
Sbjct: 341 AVPVSH-EIDGISILPLLRGEEQDTGDRMVHWVRREGNSRYGGQAYYAS-QYRDFKILQN 398
Query: 409 T 409
T
Sbjct: 399 T 399
>gi|241756057|ref|XP_002401379.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215508427|gb|EEC17881.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 287
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 111/312 (35%), Positives = 163/312 (52%), Gaps = 31/312 (9%)
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
V +D+S+G VI AL K ML NSIV+F DNG G+ SN G N PLRG K+T W+GG
Sbjct: 2 VYEMDQSIGLVIEALHKRNMLGNSIVIFSTDNGGLPTGLGSNYGFNWPLRGFKATLWEGG 61
Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAK 375
+RG A IWSP L+++ ++SS++ HI+DWLPTL +AAG +++ +DG++ W+ L +
Sbjct: 62 VRGAAFIWSPLLRRSGRISSQMMHITDWLPTLYSAAGGDVSRLGVIDGIDMWEALCEDLP 121
Query: 376 TKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVL 435
+ R EIL NID+ N +AL V + K + P E+
Sbjct: 122 SPRHEILLNIDSAANS----SALIVGNRKVF-------------------LTSSFPDELR 158
Query: 436 YSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKIL---ELREFARVRCNYDN 492
Y ++ + ++ K+ + LR EK+ R+ V C+ N
Sbjct: 159 YHRSEVP----GGTRPVEGLDEMMLKSRTGTVLRDFYKVEKLSIRPNWRKEVVVNCSQYN 214
Query: 493 KGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRA 552
G++ + P F+I DPCE NLA + +L +KLA Y +TM+PP NKP D R
Sbjct: 215 PGSNFVANSSPYYFDIQRDPCELENLAAVNATEVDELMEKLAAYAATMIPPANKPMDPRG 274
Query: 553 DPARWNNIWVPW 564
P + +W PW
Sbjct: 275 LPKYNHGLWAPW 286
>gi|154250816|ref|YP_001411640.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
gi|154154766|gb|ABS61983.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
Length = 553
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 138/400 (34%), Positives = 195/400 (48%), Gaps = 56/400 (14%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
+ P+I++ILADDLG+ND+S G +PTPNID++A G Y A C PSR+ +MT
Sbjct: 69 RPPNIVVILADDLGFNDISHFGGGIVPTPNIDSIARGGANFTSAYSGTAACAPSRAMIMT 128
Query: 87 GKYPIHIGMQ----------------------HGVILEGE------PW---GLPLTEKLL 115
G+Y G + H ++++ E P+ GLP +E L
Sbjct: 129 GRYGTRTGFEFTPTPPGMTRIVDMFYNDGTRTHEMLVDREAAAKAPPFREQGLPGSEITL 188
Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGL---QDYYD-HSCKATFE 171
+ LK GY IGKWHLG E + P +GFD GL +D D + K F+
Sbjct: 189 AEALKPKGYHNIHIGKWHLGNAPE-FLPNAQGFDESVMLESGLFLPEDSPDVVNAKLPFD 247
Query: 172 PYQGLDMRHNMQVDNKTIGI-------YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVH 224
P + MQ G Y TD YT+ AI I E N+++P FLYLAH VH
Sbjct: 248 PIDQF-LWARMQYATSYNGSAWFEPKGYLTDFYTDEAIKAI-EANRNRPFFLYLAHWGVH 305
Query: 225 AGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLF 284
P QA + I D R YA M+ LD SVG V+ +L+++G+ EN++V+F
Sbjct: 306 T-----PLQASKADYDALSHIEDERLRVYAAMIVALDRSVGRVLQSLKENGLEENTLVIF 360
Query: 285 MADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISD 343
+DNGAP G N P RG K T ++GG+R A W + + ++ + H+ D
Sbjct: 361 SSDNGAP--GYIGLPDVNKPYRGWKLTFFEGGIRVPFFAKWPARIPAGTERTTPVAHL-D 417
Query: 344 WLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEIL 382
PT+ AAAG E+ D +DG++ +G K I
Sbjct: 418 MFPTIVAAAGGELPADRVIDGIDLLPYAARGEKPAPRPIF 457
>gi|255014751|ref|ZP_05286877.1| putative secreted sulfatase precursor [Bacteroides sp. 2_1_7]
gi|410102840|ref|ZP_11297765.1| hypothetical protein HMPREF0999_01537 [Parabacteroides sp. D25]
gi|409237967|gb|EKN30762.1| hypothetical protein HMPREF0999_01537 [Parabacteroides sp. D25]
Length = 452
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 142/421 (33%), Positives = 218/421 (51%), Gaps = 41/421 (9%)
Query: 6 KYFFALTCTLLFNDAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQ-IPTPNIDALAY 63
KY C L F+ A T A P P+I++IL DDLG+ D+S G+++ I TP+ID L
Sbjct: 3 KYILTAACGLAFSAASAQTKALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLN 62
Query: 64 NGL-ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWG-LPLTEKLLPQYL 119
G+ N H + +PSR++L+TG+YP +G+ GVI + + WG L LLPQ L
Sbjct: 63 EGVRFTNFHANCPVSSPSRASLLTGRYPDMVGVP-GVIRTHKEDSWGYLSEDAVLLPQML 121
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD-YYDHSCKATFEPYQGLDM 178
K+ GY +GKW+LG TPT RGFD + G+ + D YY H + M
Sbjct: 122 KKRGYHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH------RRFGNNYM 174
Query: 179 RHNM-QVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
R N+ ++D + ++T+++++ AI +++ K +P FLYLA+ A H P Q P
Sbjct: 175 RENLKEIDPQG---HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQ 226
Query: 237 E--EVAKFLDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
E E K + S PE+R ++ LD +VG V AL ++G LEN+I++F +DNG
Sbjct: 227 EWLEKVKKREPSLPEKRAKIVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNG---- 282
Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ-KVSSELFHISDWLPTLCAAA 352
G +N P RG K ++GG+R I+ W Q + V +SD PTLC
Sbjct: 283 GQDDAGANNGPFRGAKQDMYEGGIRVAGGIY--WKNQIRPAVRDNFVMLSDMFPTLCDLT 340
Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNP----QKYYAALRVDDLKYVAG 408
+ ++ +DG++ +L + ++H + N Q YYA+ + D K +
Sbjct: 341 AVPVSH-EIDGISILPLLRGEEQDTGDRMVHWVRREGNSRYGGQAYYAS-QYRDFKILQN 398
Query: 409 T 409
T
Sbjct: 399 T 399
>gi|325107240|ref|YP_004268308.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324967508|gb|ADY58286.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 469
Score = 184 bits (466), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 139/389 (35%), Positives = 198/389 (50%), Gaps = 38/389 (9%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F L C L + + P P+ ++I DDLG+ D+ G I TP +D +A G
Sbjct: 10 FGLLFCNLPGDGQAAEKSTP--PNFVVIFCDDLGYGDLGCFGHPSISTPQLDQMADEGQR 67
Query: 68 LNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAG 123
Q YV A +CTPSR+AL+TG+ PI GM + V+ GLP E + + LK+ G
Sbjct: 68 WTQFYVGASVCTPSRAALLTGRLPIRNGMMSAKRRVLFPDSKGGLPAQELTIAELLKKNG 127
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGY--------WQGLQDYYDHSCK-----ATF 170
YAT AIGKWHLG R + PT GFDS++G +G +Y + + +
Sbjct: 128 YATAAIGKWHLG-HRSEFLPTEHGFDSYWGIPYSNDMDREKGHPNYRQKAAEDPHYFSPI 186
Query: 171 EPYQGLDMRHNMQVDNKTIGIYS-TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTY 229
E YQ + + N +V + ++ T YTE AI I + N+ +P FLYLAH H
Sbjct: 187 EQYQ-VPIIDNKEVVERPANQHTITKRYTEKAIEFI-KGNREQPFFLYLAHNLPHI---- 240
Query: 230 EPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG 289
P A D FL+ S R Y +V+ +D VG ++ L++ G+ EN++V+F +DNG
Sbjct: 241 -PLYASD----SFLETSP--RGIYGDVVTEIDHGVGQILDTLKETGLAENTLVVFTSDNG 293
Query: 290 APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
P ++ GS LR K ++GGMR +W P + KV +EL D LPT C
Sbjct: 294 -PWLLFDTHGGSAGLLREGKGATFEGGMRVPTVMWWPGTLEP-KVQTELGTTMDLLPTFC 351
Query: 350 AAAGIEI-NDTSLDGVNQWDVLTKGAKTK 377
+ AG+ I D LDG + L KG T+
Sbjct: 352 SLAGVGIPEDRVLDG-HDLSSLLKGETTE 379
>gi|156368526|ref|XP_001627744.1| predicted protein [Nematostella vectensis]
gi|156214663|gb|EDO35644.1| predicted protein [Nematostella vectensis]
Length = 157
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 79/139 (56%), Positives = 104/139 (74%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A K PHI+ I DDLGW+DV +H + + TPNID LA G++L +YVQ +CTP+R +
Sbjct: 18 AAKPPHIVFIFVDDLGWSDVGYHNITDLKTPNIDRLAGEGVVLENYYVQPICTPARGTFL 77
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
+G+YPIH G+QH I E EP+GLPL LLPQ LK+AGY+THA+GKWHLGFF + YTP +
Sbjct: 78 SGRYPIHTGLQHSNIHETEPFGLPLDFTLLPQKLKKAGYSTHAVGKWHLGFFEKEYTPLY 137
Query: 146 RGFDSHYGYWQGLQDYYDH 164
RGFD+ +GY+ G +D+Y H
Sbjct: 138 RGFDTFFGYYSGSEDHYTH 156
>gi|262383795|ref|ZP_06076931.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
gi|262294693|gb|EEY82625.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
Length = 452
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 142/421 (33%), Positives = 220/421 (52%), Gaps = 41/421 (9%)
Query: 6 KYFFALTCTLLFNDAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQ-IPTPNIDALAY 63
KY C L F+ A T A P P+I++IL DDLG+ D+S G+++ I TP+ID L
Sbjct: 3 KYILTAACGLAFSAASAQTKALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLN 62
Query: 64 NGL-ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWG-LPLTEKLLPQYL 119
G+ N H + +PSR++L+TG+YP +G+ GVI + + WG L LLPQ L
Sbjct: 63 EGVRFTNFHANCPVSSPSRASLLTGRYPDMVGVP-GVIRTHKEDSWGYLSEDAVLLPQML 121
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD-YYDHSCKATFEPYQGLDM 178
K+ GY +GKW+LG TPT RGFD + G+ + D YY H + M
Sbjct: 122 KKRGYHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH------RRFGNNYM 174
Query: 179 RHNM-QVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
R N+ ++D + ++T+++++ AI +++ K +P FLYLA+ A H P Q P
Sbjct: 175 RENLKEIDPQG---HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQ 226
Query: 237 E--EVAKFLDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
E E K + S PE+R ++ LD +VG V AL+K+G LEN+I++F +DNG
Sbjct: 227 EWLEKVKKREPSLPEKRAKIVALIEHLDYNVGRVYEALKKNGQLENTIIIFTSDNG---- 282
Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELF-HISDWLPTLCAAA 352
G +N P RG K ++GG+R I+ W + + E F +SD PTLC
Sbjct: 283 GQDDAGANNGPFRGAKQDMYEGGIRVAGGIY--WKNRIRPAVRENFVMLSDMFPTLCDLT 340
Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNP----QKYYAALRVDDLKYVAG 408
+ ++ +DG++ +L + +++ + N Q YYA+ + D K +
Sbjct: 341 AVPVSH-EIDGISILPLLRGEEQDTGDRMVYWVRREGNSRYGGQAYYAS-QYRDFKILQN 398
Query: 409 T 409
T
Sbjct: 399 T 399
>gi|345317091|ref|XP_001516952.2| PREDICTED: arylsulfatase I-like, partial [Ornithorhynchus anatinus]
Length = 264
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 97/230 (42%), Positives = 139/230 (60%), Gaps = 12/230 (5%)
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y IH G+QH +I +P LPL + LPQ L+EAGY+TH +GKWHLGF+++ PT RG
Sbjct: 6 RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRG 65
Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
FD+ G G DYY + +C G D+ V + G YST LY + A ++A
Sbjct: 66 FDTFLGSLTGNVDYYTYDNCDG--PGVCGYDLHEGENVAWEQSGKYSTLLYAQRAGKILA 123
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
H+ +P+FLY+A AVH P Q+P E + ++ + + RR YA MV+ +DE+V N
Sbjct: 124 SHDPRRPLFLYVAFQAVHT-----PLQSPREYLYRYRAMGNVARRKYAAMVTCMDEAVRN 178
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
+ AL+K+G +NS+++F DNG +F + GSN PLRG K T W+GG
Sbjct: 179 ITRALKKYGYYDNSVIVFSTDNGGQTF----SGGSNWPLRGRKGTYWEGG 224
>gi|319954036|ref|YP_004165303.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
gi|319422696|gb|ADV49805.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
Length = 467
Score = 183 bits (464), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 134/387 (34%), Positives = 193/387 (49%), Gaps = 43/387 (11%)
Query: 16 LFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ- 74
F +FL TA ++P+I+ IL+DD G+ D F GS + TP +D LA + +Q YV
Sbjct: 15 FFKVSFL--TAQQQPNIVFILSDDAGYADFGFQGSKEFKTPELDKLAKKSIKFSQAYVSA 72
Query: 75 ALCTPSRSALMTGKYPIHIGMQHGVI--------LEGEPWGLPLTEKLLPQYLKEAGYAT 126
A+C PSR+ ++TGKY G + + L G+ GLPL + + YL++ GY T
Sbjct: 73 AVCGPSRAGILTGKYQQKFGFEENNVPGYMSTSGLVGDEMGLPLDQITIANYLQDLGYKT 132
Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDY--YDHSCKAT-----FEPYQGLDMR 179
GKWH G + + PT RGFD YG+ G + Y YD S + E G +
Sbjct: 133 ALFGKWHQG-NADRFHPTKRGFDEFYGFRGGARSYMPYDDSNPLSKNEDRLERGFGNFLE 191
Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
H Y TD AI+ I NK P F+YL+ AVH P +A E++
Sbjct: 192 HEG---------YLTDELAHEAISFI-NRNKKHPFFIYLSFNAVHT-----PMEATAEDL 236
Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
+F + +R+T A M +D ++G V+ L G+ N++++F DNG PS +N
Sbjct: 237 EQFPHLKG-KRKTLAAMTLAMDRAIGTVLKELETQGLSNNTLIVFTNDNGGPS---DAND 292
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPT-LCAAAGIEI 356
N PL G K+ +GG+R V + S W KQ QK + + IS D PT L AA+G
Sbjct: 293 SDNSPLSGTKANHLEGGIR-VPFLMS-WPKQLQKNKNYNYPISTLDLFPTFLSAASGNSS 350
Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILH 383
LDGVN L + + E+L+
Sbjct: 351 AMEELDGVNLLPYLKEENTERPHELLY 377
>gi|149196558|ref|ZP_01873612.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149140238|gb|EDM28637.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 443
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 118/346 (34%), Positives = 173/346 (50%), Gaps = 21/346 (6%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
A + + ++ILADDLG+ DV F GSSQI TP+ID LA +G+I +Q YV ++C PSR+ L
Sbjct: 5 ASNRLNFVLILADDLGYGDVGFTGSSQIKTPHIDRLAKDGVIFSQGYVSSSVCGPSRAGL 64
Query: 85 MTGKYPIHIGMQHGVI-----LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
MTGK + G + + + E GLP++EK L L E GY +GKWHLG +E
Sbjct: 65 MTGKNQVRFGFDNNLTNYLPQFKDEFHGLPISEKTLATRLAEKGYVNGLVGKWHLG-DKE 123
Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
Y P RGF +GY G Y+ +G D Y TD +
Sbjct: 124 QYHPLKRGFHEFWGYLGGGHHYFRSKPNG-----KGYDCPIECNYKTPQPISYITDDKGD 178
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
++ I H K +P FL+ + A HA P A +E++ + I +RR Y M
Sbjct: 179 ECVDFIRRH-KDEPFFLFASFNAPHA-----PMHAKEEDLKLYSHIEGEKRRAYCAMTHN 232
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
LD +VG ++A L+ + +N++V+F++DNG P+ +N N PLRG K +GG+R
Sbjct: 233 LDLNVGKIVAELKAQNIYDNTVVVFLSDNGGPT---PNNASLNAPLRGGKGILLEGGIRV 289
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
+ P Q ++ D + AG + +DGVN
Sbjct: 290 PFTMTWPAKIQAGSSFNKAVSSIDLAASFTELAGAKFRAEEMDGVN 335
>gi|149198313|ref|ZP_01875359.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
HTCC2155]
gi|149138609|gb|EDM27016.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
HTCC2155]
Length = 476
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 134/376 (35%), Positives = 190/376 (50%), Gaps = 29/376 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
+P+IIII DD G+ D+S G + + TP ID +A G L YV A +CTPSR+ALMTG
Sbjct: 31 QPNIIIIFTDDQGYADLSCFGGTHVSTPRIDQMAAEGAKLTSFYVAAPVCTPSRAALMTG 90
Query: 88 KYPIHIGMQHG----VILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
YP I M G V+L G+ GL E + + LK GY T GKWHLG E + P
Sbjct: 91 TYPKRIDMARGSNFVVLLAGDKKGLNPKEITIAEVLKAVGYKTGMFGKWHLGDQPE-FLP 149
Query: 144 TFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
T +GFD +G Y + Y+ F LD +++D Y T +TE A
Sbjct: 150 TRQGFDEFFGLPYSHDIHPYHPQQSHFKFPSLPLLDGEEVIEMDPD--ADYLTKRFTERA 207
Query: 202 INVIAEHNKSKPMFLYL----AHLAVHAGNTYEPFQAPDEEVAKFLDISD-PERRTYAGM 256
+ I E NK +P FLY+ H +HA Y P+E AK ++ + +T +
Sbjct: 208 VQFI-EKNKDQPFFLYMPHPIPHTPLHASPAYMK-DLPEEIKAKLSKENNYIDYKTRTQL 265
Query: 257 VSR----LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
+SR +D SVG ++ AL+K+G+ +N+IV+F +DNG P+ G + PLRG K+
Sbjct: 266 LSRAIGEVDWSVGQILDALKKNGIDDNTIVIFSSDNG-PAIGKAT------PLRGRKNQT 318
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLT 371
+GGMR I P + E+ D LPT AG EI +D +DG + W V+
Sbjct: 319 LEGGMRVPTVIHWPGKIPAGIANDEIMTTMDLLPTFAKLAGAEIPSDRVIDGKDIWAVIR 378
Query: 372 KGAKTKRSEILHNIDN 387
K+ ++ N
Sbjct: 379 GEEKSPHQAFFYHKGN 394
>gi|359400556|ref|ZP_09193534.1| sulfatase [Novosphingobium pentaromativorans US6-1]
gi|357597898|gb|EHJ59638.1| sulfatase [Novosphingobium pentaromativorans US6-1]
Length = 493
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 130/379 (34%), Positives = 187/379 (49%), Gaps = 41/379 (10%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
+P++++IL DDL W DVS +G+S I TPNIDA+ G + YV A +C+ SR+ L++G
Sbjct: 44 RPNVVVILVDDLAWTDVSANGASTIQTPNIDAIGAAGATFSSGYVTASVCSVSRAGLLSG 103
Query: 88 KYPIHIGMQHGVILEG---EPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
+ P G + + G + GLP+ L + LK GY T GKWH G E Y PT
Sbjct: 104 RNPTTFGFMYNITERGAVDKGAGLPVDVPTLAERLKPLGYDTAVFGKWHQGSTPEFY-PT 162
Query: 145 FRGFDSHYGYWQGLQDYYDH------SCKATFEPYQ------------GLDMRHNMQVDN 186
RGFD +G+ G Y + +F+ YQ G DM Q
Sbjct: 163 RRGFDEFFGFLAGESIYATPDTPGIVTTPTSFDKYQPKEREGAQRLFQGPDMEPVAQPHK 222
Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
Y T+ TE A++ I K P FLY+A+ A H P Q P + +I
Sbjct: 223 -----YLTEEITEHAVDFIDRKADGKDPFFLYVAYNAPHW-----PLQVPRTYYDRHSEI 272
Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP-SFGIHSNKGSNHP 304
+DP RRTY M+ LD VG ++ AL G+ E+++V+F++DNG P FG + HP
Sbjct: 273 ADPARRTYVAMIDALDAGVGQILKALDDKGLREDTLVVFLSDNGCPVQFGYCTQ---THP 329
Query: 305 LRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLD 362
G K T +GGMR A W + Q++++ + + D +PT+ A + LD
Sbjct: 330 WGGGKFTYLEGGMRVPFMASWPGHIAAGQRIATPVSSL-DIVPTVLEAVSPGKTLPEELD 388
Query: 363 GVNQWDVLTKGAKTKRSEI 381
GV+ D L A +RS I
Sbjct: 389 GVDLLDTLAGTAPKERSFI 407
>gi|196232399|ref|ZP_03131252.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196223471|gb|EDY17988.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 452
Score = 182 bits (462), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 135/370 (36%), Positives = 199/370 (53%), Gaps = 39/370 (10%)
Query: 7 YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
+ ALTC+ DA KP+II+ILADDLG + S G+ I TPNID L + GL
Sbjct: 12 FAVALTCSAFAVDA-------SKPNIIVILADDLGRGEYSDFGTKDIQTPNIDRLCHEGL 64
Query: 67 ILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEP---WG-LPLTEKLLPQYLKE 121
+ Y + +C+PSR+ALMTG +P +G+ GVI E + WG L +L PQ LK
Sbjct: 65 TFDNFYANSCVCSPSRAALMTGCFPNRVGVP-GVIREEDQDNNWGWLAQGVRLFPQLLKP 123
Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ-DYYDHSCKATFEPYQGLDMRH 180
AGY + +GKWHLG TPT RGFD G+ + DY+ H Q MRH
Sbjct: 124 AGYQSAIVGKWHLG-INSPNTPTERGFDRFDGFLGDMMDDYWTH-----LRHGQNF-MRH 176
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEV 239
N +V + ++TD++T+ A+ + E +KSK P FLYLA+ A H +P Q + +
Sbjct: 177 NQEVVSPE--GHATDVFTDWALRYLDERSKSKEPFFLYLAYNAPH-----DPIQPKPDWL 229
Query: 240 AKFLDISDP----ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI 295
AK + +P +R ++ LD+ +G V+A L + G+ +N+IVLF +DNG G+
Sbjct: 230 AK-VKAREPGLSEKRAKLVALIEHLDDGIGKVLAKLDETGLAKNTIVLFSSDNG----GV 284
Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
SN+ +N P R K+ ++GG+R + P S+ + + D LPT G +
Sbjct: 285 LSNEANNGPWRSGKTHMYEGGLRVPCMVRWPGHVAPGTHSNRVALLMDILPTALEIVG-Q 343
Query: 356 INDTSLDGVN 365
++DGV+
Sbjct: 344 TPPPNIDGVS 353
>gi|402824519|ref|ZP_10873876.1| sulfatase family protein [Sphingomonas sp. LH128]
gi|402261924|gb|EJU11930.1| sulfatase family protein [Sphingomonas sp. LH128]
Length = 455
Score = 182 bits (461), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 127/389 (32%), Positives = 186/389 (47%), Gaps = 35/389 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
+KP+ I+IL DDLG+ D+ G + IPTPN++ +A G++L +Y Q +CTPSR+ L+T
Sbjct: 30 RKPNFIVILCDDLGYGDIEPTGGTAIPTPNLNRMAREGMVLTDYYAPQNICTPSRAGLLT 89
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+YPI G+ G IL+ + LPL E +P+ L+ AGY + GKWHLG + PT
Sbjct: 90 GRYPIRTGLAWGAILQPDKRILPLDEVTIPKALQPAGYRSGLFGKWHLGHLGPDWQPTRH 149
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
GFD YG Y H K + + + + Y E A+N I
Sbjct: 150 GFDRFYGI------PYSHDMKP-LQIFTDTSPDRRASEEEPVMAELQQRFYEE-AVNFI- 200
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
E NK +P F+ LA A H N P D+ + Y +V +D VG
Sbjct: 201 EANKDQPFFVELALSAPHLANYPNPRHRDDKMAS-----------AYGEVVEEIDAIVGA 249
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK-STPWDGGMRGVAAIWS 325
++ LR G+ +++VLF +DNG P + +GS+ PLR K +DGG R W
Sbjct: 250 LMVKLRTLGLDRDTLVLFTSDNG-PWY-----EGSSGPLRDRKGGAGYDGGYRVPCIAWQ 303
Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHN 384
P + D+LPT CA AG + +DG + +L +GA++ +L
Sbjct: 304 PGTVPAGLRCDSIAMGIDFLPTFCAMAGTPVPAGPVIDGKDISQILLRGAESPHESLLLF 363
Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNG 413
D + +R KYVA T +G
Sbjct: 364 DD------EEVTGIRTQRWKYVAQTYYHG 386
>gi|343086301|ref|YP_004775596.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354835|gb|AEL27365.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 481
Score = 182 bits (461), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 129/386 (33%), Positives = 191/386 (49%), Gaps = 29/386 (7%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
++C F +KP+II+I ADDLG+ D+ G I TPN+D +A+ G
Sbjct: 14 ISCIGFFVTTAEARQGGEKPNIIVIFADDLGYGDLGVFGHPSIKTPNLDKMAFEGQKWTN 73
Query: 71 HYVQA-LCTPSRSALMTGKYPIHIGMQ---HGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
YV A +CTPSR+ L+TG+ PI GM V+ GLP +E + + LK GY T
Sbjct: 74 FYVAAPVCTPSRAGLLTGRLPIRSGMSSDDRRVLFPDSDGGLPQSEITIAKALKGNGYQT 133
Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQ-----DYYDHSCKATFEPYQGLDMR 179
AIGKWHLG + Y PT GFDS++G Y + D++ + F+ Y M+
Sbjct: 134 AAIGKWHLG-HKSPYLPTDHGFDSYFGIPYSNDMDKIDKTDHFTLTDNEKFDAYNVPLMK 192
Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
+ T T YTE A++ I K +P F+YLAH H P
Sbjct: 193 DAEIAERPTDQRTLTKRYTEEAVSKIKAF-KDEPFFIYLAHNLPH---------IPLFRS 242
Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
A F D S Y ++ +D SVG +++ L++ G+ EN++V+F +DNG P ++
Sbjct: 243 AAFKDQS--LGGIYGDVIEEIDWSVGQILSTLKEEGIAENTLVVFTSDNG-PWHVFKTHG 299
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ-KVSSELFHISDWLPTLCAAAGIEIND 358
G+ LRG K ++GGMR W W Q + V ++ D LPT CA +G E+ D
Sbjct: 300 GTAGLLRGAKGGTFEGGMREPTVFW--WPAQIKPGVVMDMGTTMDLLPTFCAISGTELPD 357
Query: 359 TSL-DGVNQWDVLTKGAKTKRSEILH 383
+ DG + +L K++R + +
Sbjct: 358 DRIYDGYDISGLLKGTGKSERETVFY 383
>gi|171910115|ref|ZP_02925585.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
Length = 460
Score = 182 bits (461), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 130/368 (35%), Positives = 184/368 (50%), Gaps = 37/368 (10%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSR 81
+ A + P+I+II ADDLG+ D+ +GS I TP++D +A GL YV + +CTPSR
Sbjct: 23 SQAADRPPNIVIIFADDLGYGDLGCYGSPTIATPHLDQMAAEGLRFTDFYVASEVCTPSR 82
Query: 82 SALMTGKYPIHIGM--QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
+AL+TG+YP+ GM + V+ GLP E LP+ LK GYAT +GKWHLG E
Sbjct: 83 AALLTGRYPVRSGMYGKRRVLFPNSTGGLPAGEITLPEALKARGYATAHVGKWHLG-IHE 141
Query: 140 VYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD------MRHNMQVDNKTIGI 191
P +GFD +G Y + D +T P +D +R+ V+ +
Sbjct: 142 GSRPLDQGFDQSFGLPYSNDMDARPDLPKGSTGSPTPPIDGWNVPLLRNGEVVEKPADQV 201
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
+ T YTE A+ I + KS+P FLY+AH H P A + +
Sbjct: 202 HLTGHYTEEAVKFI-QQKKSQPFFLYMAHSFPHV-----PLFASENFKGR------SRAG 249
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
Y V LD SVG V+ LRK G+ EN++V F +DNG P + + GS PL+ K +
Sbjct: 250 IYGDTVEELDWSVGQVLQTLRKEGLSENTLVFFTSDNG-PWLIMGAQGGSAGPLKDGKGS 308
Query: 312 PWDGGMRGVAAIW-----SPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
W+GGMR W P + +T S +LF S A AG E+ D +DG +
Sbjct: 309 HWEGGMRVPGIAWWPGHIKPGVSRTPANSMDLFVTS------VALAGAEVPKDRVIDGTD 362
Query: 366 QWDVLTKG 373
+L +G
Sbjct: 363 IRPLLLEG 370
>gi|388257120|ref|ZP_10134300.1| sulfatase [Cellvibrio sp. BR]
gi|387939324|gb|EIK45875.1| sulfatase [Cellvibrio sp. BR]
Length = 474
Score = 182 bits (461), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 116/347 (33%), Positives = 176/347 (50%), Gaps = 23/347 (6%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
KP+I+ IL+DD G+ D F GS++IPTPN+D LA G++ Q YV A +C PSR+ L+TG
Sbjct: 35 KPNIVFILSDDAGYADFGFQGSTEIPTPNLDQLAQEGVVFKQAYVSASVCGPSRAGLLTG 94
Query: 88 KYPIHIGMQHGVI--------LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
KYP G + + G+ G+ L + + YL E GY T IGKWH G +
Sbjct: 95 KYPQRFGFEENNVPGYMSSSGATGDDMGMRLDQLTMANYLAERGYRTSLIGKWHQG-NED 153
Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
+ P RGFD +G+ G + Y+ + + R+ +Y TD
Sbjct: 154 RFHPLKRGFDEFFGFRGGARSYFPFTQAHPSSRREDFLERNFNNYGESP--LYLTDALAN 211
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
I I + NK +P F +L+ A H P +A ++ +F ++ ER+ YA M+
Sbjct: 212 ETIEFI-KRNKHQPFFTFLSLSAPHY-----PMEAEKSDLDQFPHLTG-ERKIYAAMMLN 264
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
+D ++G V+ L++ G+ EN++V+F DNG PS N N PL G K+ +GG+R
Sbjct: 265 MDRAIGRVLQTLQQEGLSENTLVIFTNDNGGPS---DHNGSINLPLSGTKANLLEGGIRV 321
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVN 365
+ P + + + D LPT AAAG ++ T DGV+
Sbjct: 322 PMIMRWPGVTKPGSSYDPMVSTLDLLPTFYAAAGGDVRQVTDWDGVD 368
>gi|325108643|ref|YP_004269711.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324968911|gb|ADY59689.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 484
Score = 181 bits (460), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 129/399 (32%), Positives = 192/399 (48%), Gaps = 61/399 (15%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMTGK 88
P+I++I++DD G+ND+ S I TP++D LA G L YV CTPSR++L+TG+
Sbjct: 28 PNILLIVSDDQGYNDLGVLNSDLI-TPHLDRLAAEGTRLTDFYVAWPACTPSRASLLTGR 86
Query: 89 YPIHIGMQHGVILEGEPWG-----------------LPLTEKLLPQYLKEAGYATHAIGK 131
YP G+ + E +G + EKLLP+YLK+ GY + GK
Sbjct: 87 YPQRNGIYDMIRNEAPDYGYKYKPAEYEVSFERIGGMDQREKLLPEYLKKLGYTSAIFGK 146
Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
W LG + + PT RGFD YG+ DY+ H E Y M + + G
Sbjct: 147 WDLGSLKR-FLPTNRGFDEFYGFVNTGIDYFTH------ERYGVPSMFRQTSLTEEDRGE 199
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-----FQAPDEEVAKFLDIS 246
Y+T+L+ A+ + S+P LYL A H ++ +P QAP+E A + +
Sbjct: 200 YATELFKREALAFLDRAEASEPFLLYLPFNAPHNSSSLDPRIRSTVQAPEEYKAMYPAV- 258
Query: 247 DPERRT---------------------YAGMVSRLDESVGNVIAALRKHGMLENSIVLFM 285
DPE RT Y V+ +D ++G V+ L G+L+N+IV+F
Sbjct: 259 DPEYRTVNRYRYGNPAQVVTPVARQRDYRAAVTCMDAAIGEVLGKLEDKGVLDNTIVIFF 318
Query: 286 ADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS-DW 344
+DNG S N PLRG K+ W+GG+R + P +++ F S +
Sbjct: 319 SDNGG------SGGADNSPLRGHKAQVWEGGIRVPCLVRWPDGGVPAGATNDSFLSSLEL 372
Query: 345 LPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEIL 382
+P+L AAG I D +DG N W LT+ A + R+E+
Sbjct: 373 VPSLVKAAGGTIPEDIVMDGRNWWPALTEAASSPRNEMF 411
>gi|32477899|ref|NP_870893.1| iduronate-sulfatase or arylsulfatase A [Rhodopirellula baltica SH
1]
gi|32448456|emb|CAD77971.1| iduronate-sulfatase or arylsulfatase A [Rhodopirellula baltica SH
1]
Length = 1012
Score = 181 bits (460), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 140/419 (33%), Positives = 202/419 (48%), Gaps = 33/419 (7%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
T KP+ I+IL DD G+ D+S G+ + TP ID +A G L YV A +CTPSR+
Sbjct: 565 TAETTKPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAAPVCTPSRA 624
Query: 83 ALMTGKYPIHI----GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
LMTG YP I G GV+L G+P GL E + + LK AGY T GKWHLG
Sbjct: 625 GLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFGKWHLGDQP 684
Query: 139 EVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
E + PT +GFD +G Y + ++ F P L +++D + T
Sbjct: 685 E-FLPTKQGFDEFFGIPYSHDIHPFHPRQNHYHFPPLPLLQNDTVIEMDPDA--DFLTKR 741
Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAG-NTYEPF-QAPDEEVAKFLDISD------P 248
TE A++ I E NK +P FLYL H HA + PF + ++V ++ D
Sbjct: 742 LTEQAVSFI-ERNKDQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAAIEKEDGNIDYAT 800
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
+ ++ +D SVG ++ ALR +G+ E ++VLF +DNG P ++++ G LRG
Sbjct: 801 RANLFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPPKNTLYASPGE---LRGH 857
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQW 367
K T ++GGMR + P + EL D LPT AG I D +DG + W
Sbjct: 858 KGTTFEGGMREPTVVRWPGQIPAGHQNDELMTAMDLLPTFAKLAGAAIPTDRVIDGKDIW 917
Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
L +T ++ N AA+R K NNG + + Y D +N++
Sbjct: 918 PTLKGETQTPHDAFFYHRGN------QLAAVRSGKWKLHV---NNGVAKQLY-DLENDL 966
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 117/400 (29%), Positives = 173/400 (43%), Gaps = 71/400 (17%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG-LILNQHYVQALCTPSR 81
+ A + P++++I DDLG+ D+ +G++++ TPNID LA G + H A+CTPSR
Sbjct: 33 SVAAERPPNVVLIFVDDLGYGDLGCYGATKLSTPNIDRLAAEGRRFTDAHSASAVCTPSR 92
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWG-LPLTE--------KLLPQYLKEAGYATHAIGKW 132
L+TG+YP+ G+ WG LP T K + + K GYAT +GKW
Sbjct: 93 YGLLTGQYPVRAMGGQGI------WGPLPTTSGLIIDTNTKTIGKVFKNKGYATACLGKW 146
Query: 133 HLGFFREVY--------TPTFRGFDSHYGY--------WQGLQD----YYDHSCKATF-- 170
HLGF E P GFD ++G + + D YD S +
Sbjct: 147 HLGFKEEPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGG 206
Query: 171 ---EPYQGLDMRHNMQVDNKTIG------IYSTD----LYTEAAINVIAEHNKSKPMFLY 217
P +++ N+ G IY + L TE A+ I E K++P FLY
Sbjct: 207 KPVSPTPMFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITE-KKNEPFFLY 265
Query: 218 LAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGML 277
A +H T P +F S + Y V LD VG ++ +L +G+
Sbjct: 266 FATPNIHHPFTPAP---------RFKGTS--QCGLYGDFVHELDWMVGEIVQSLEDNGLT 314
Query: 278 ENSIVLFMADNG-----APSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQT 331
+N++VLF +DNG A I + N L G K W+GG R + A W +K
Sbjct: 315 DNTLVLFTSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIKAG 374
Query: 332 QKVSSELFHISDWLPTLCAAAGIEINDTSL-DGVNQWDVL 370
+ S +L D T A E+ + D +N L
Sbjct: 375 TQ-SDQLISQVDLFATFSALTEQEMPSSEQKDSINMLPAL 413
>gi|149199999|ref|ZP_01877025.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149136872|gb|EDM25299.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 512
Score = 181 bits (460), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 139/401 (34%), Positives = 202/401 (50%), Gaps = 56/401 (13%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
T A K+P+II+I ADD+G++DV +HG+ +I TPNID++A G+ +Q YV A +C PSR+
Sbjct: 15 TFADKQPNIILIFADDMGYDDVGYHGNKRIITPNIDSIAEQGVQFSQGYVSASVCGPSRA 74
Query: 83 ALMTGKYPIHIGMQHGVILEGEP-------WGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
L+TG Y G G P GLP ++ ++ + LK GY IGKWH+G
Sbjct: 75 GLLTGVYQQRFGCGENPNGSGYPNQMKYPMAGLPQSQSMISEELKTLGYTNGMIGKWHMG 134
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCK-ATFEPYQGLDMRHNMQVDNKTIGI--- 191
F + P RG+D YG+ G DY + + + A + + M+ NK I
Sbjct: 135 FDMSL-RPNQRGYDFFYGFINGSHDYTEWTQEFAKGKSRWPIFRNEEMEPANKAQYIDVF 193
Query: 192 -----------YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
Y TDL+T+ A+N I + N KP FLYLA+ AVH P+Q +
Sbjct: 194 KEKGVKVVDENYLTDLFTDEAVNFI-DRNADKPFFLYLAYNAVH-----HPWQTTQHALD 247
Query: 241 KFLDISDPER-RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF-GI-HS 297
K + D + +A MV +DE +G V+ L++ + +N+I++F++DNG+P GI HS
Sbjct: 248 KTAHLKDDKNYHVFASMVYAMDEGIGKVMKKLKEKNIDDNTIIIFLSDNGSPQGQGIEHS 307
Query: 298 NKGSNHP-----------LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DW 344
K N RG K ++GG+R I W +Q QK + IS D
Sbjct: 308 PKDPNRHRGGFTMSSTGIFRGYKGDTYEGGIRVPFCI--KWPQQIQKGTKYDMPISALDL 365
Query: 345 LPTLCAAAGIEINDT------SLDGVNQWDVLTKGAKTKRS 379
PTL AAG ND + DGV+ L + + KRS
Sbjct: 366 QPTLVKAAG--GNDKKPQKGFAYDGVDILPYLKEDKEIKRS 404
>gi|421614584|ref|ZP_16055638.1| arylsulfatase A [Rhodopirellula baltica SH28]
gi|408494671|gb|EKJ99275.1| arylsulfatase A [Rhodopirellula baltica SH28]
Length = 1012
Score = 181 bits (460), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 139/419 (33%), Positives = 202/419 (48%), Gaps = 33/419 (7%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
T KP+ I+IL DD G+ D+S G+ + TP ID +A G L YV A +CTPSR+
Sbjct: 565 TAGTTKPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAAPVCTPSRA 624
Query: 83 ALMTGKYPIHI----GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
LMTG YP I G GV+L G+P GL E + + LK AGY T GKWHLG
Sbjct: 625 GLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFGKWHLGDQP 684
Query: 139 EVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
E + PT +GFD +G Y + ++ F P L +++D + T
Sbjct: 685 E-FLPTKQGFDEFFGIPYSHDIHPFHPRQNHYHFPPLPLLQNDTVIEMDPDA--DFLTKR 741
Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAG-NTYEPF-QAPDEEVAKFLDISD------P 248
TE A++ I E NK +P FLYL H HA + PF + ++V ++ D
Sbjct: 742 LTEQAVSFI-ERNKDQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAAIEKEDGNIDYAT 800
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
+ ++ +D SVG ++ ALR +G+ E ++VLF +DNG P ++++ G LRG
Sbjct: 801 RANLFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPPKNTLYASPGE---LRGH 857
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQW 367
K T ++GGMR + P + + EL D LPT AG D +DG + W
Sbjct: 858 KGTTFEGGMREPTVVRWPGQIPAEHQNDELMTAMDLLPTFAKLAGAATPTDRVIDGKDIW 917
Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
L +T ++ N AA+R K NNG + + Y D +N++
Sbjct: 918 PTLKGETQTPHDAFFYHRGN------QLAAVRSGKWKLHV---NNGVAKQLY-DLENDL 966
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 117/420 (27%), Positives = 175/420 (41%), Gaps = 82/420 (19%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG-L 66
+A+ +L + A + P++++I DDLG+ D+S +G++++ TPNID LA G
Sbjct: 19 LYAVALVMLLG-CGTSVAAERPPNVVLIFVDDLGYGDLSCYGATKLSTPNIDRLAAKGRR 77
Query: 67 ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWG-LPLTEKL--------LPQ 117
+ H A+CTPSR L+TG+YP+ G+ WG LP T L + +
Sbjct: 78 FTDAHSASAVCTPSRYGLLTGQYPVRAMGGQGI------WGPLPTTSGLIIDTNTQTIGK 131
Query: 118 YLKEAGYATHAIGKWHLGFFREVY--------TPTFRGFDSHYGY--------WQGLQD- 160
K GYAT +GKWHLGF E P GFD ++G + + D
Sbjct: 132 VFKNKGYATACLGKWHLGFKEEPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDD 191
Query: 161 -----------------------YYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLY 197
+ + + + + G H + D KT L
Sbjct: 192 SIFGYDPNDPLVYGGKPVSPTPMFPEEASVKSPNRFSGALKAHEIYDDEKT-----GTLL 246
Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
TE A+ I E K +P FLY A +H T P +F S + Y V
Sbjct: 247 TERAVKWITE-KKDEPFFLYFATPNIHHPFTPAP---------RFKGTS--QCGLYGDFV 294
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNG-----APSFGIHSNKGSNHPLRGMKSTP 312
LD VG ++ +L +G+ +N++VLF +DNG A I + N L G K
Sbjct: 295 HELDWMVGEIVQSLEDNGLTDNTLVLFTSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGV 354
Query: 313 WDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL-DGVNQWDVL 370
W+GG R + A W +K + S +L D T A E+ + D +N L
Sbjct: 355 WEGGHRVPLIAKWPGKIKAGTQ-SDQLISQVDLFATFSALTEQEMPSSEQKDSINMLPAL 413
>gi|355689580|gb|AER98880.1| galactosamine -6-sulfate sulfatase [Mustela putorius furo]
Length = 503
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 122/367 (33%), Positives = 180/367 (49%), Gaps = 26/367 (7%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
APK P+++++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+AL
Sbjct: 8 APKPPNLVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAAL 67
Query: 85 MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
+TG+ PI G ++ + G+P E+LLP+ LK AGYA+ +GKWHLG
Sbjct: 68 LTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPAEERLLPELLKGAGYASKIVGKWHLGH- 126
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYS 193
R + P RGFD +G YD+ + Y+ +M ++ KT
Sbjct: 127 RPQFHPLKRGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANL 186
Query: 194 TDLYTEAAINVIA-EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
T LYT+ A++ + +H +P FLY A A HA P FL S R
Sbjct: 187 TQLYTQEALDFVQRQHAARRPFFLYWAIDATHA---------PVYASRPFLGTS--RRGR 235
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
Y V +D+SVG +++ LR + E++ V F +DNGA GSN P K T
Sbjct: 236 YGDAVREVDDSVGKILSLLRDLRIAEDTFVFFTSDNGAALISAPKQGGSNGPFLCGKQTT 295
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLT 371
++GGMR A W P +VS +L I D T + AG+ +D +DG++ +
Sbjct: 296 FEGGMREPAIAWWPGRVPAGQVSHQLGSIMDLFTTSLSLAGLAPPSDRVIDGLDLLPAML 355
Query: 372 KGAKTKR 378
G T R
Sbjct: 356 WGQLTDR 362
>gi|430745365|ref|YP_007204494.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430017085|gb|AGA28799.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 476
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 128/376 (34%), Positives = 203/376 (53%), Gaps = 37/376 (9%)
Query: 3 WARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALA 62
+A+ FFA+ +AF + +P++++ILADDLG+ D+S +G++ + TPNIDAL
Sbjct: 4 FAQWIFFAVVVLAGVPEAFGASPDAGRPNVLLILADDLGYGDLSSYGAADLKTPNIDALV 63
Query: 63 YNGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWG-LPLTEKLLPQY 118
+G+ ++ Y + +C+P+R+AL+TG YP +G+ GVI + WG L LLPQ
Sbjct: 64 ASGVRFDRFYANSPVCSPTRAALLTGCYPDLVGVP-GVIRTHPDDSWGVLSPQAVLLPQV 122
Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ-DYYDHSCKATFEPYQGLD 177
LK AGY T +GKWHLG P+ RGFD +G+ + DY++H G++
Sbjct: 123 LKGAGYHTALVGKWHLG-LSGASLPSRRGFDLFHGFLGDMMDDYHNHR-------RHGIN 174
Query: 178 --MRHNMQVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQA 234
R + ++D K ++TDL+++ AI+ + E + +P FL LA+ H P Q
Sbjct: 175 YMRRDDREIDPKG---HATDLFSQWAIDFLNESKGQDRPFFLELAYNVPHT-----PVQP 226
Query: 235 PDEEVAKFLDIS---DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
P E + K + DP+R ++ LD +G V+AALR +G E ++++F +DNG
Sbjct: 227 PQEWLEKVRRRAPNLDPKRARLVALIEHLDHGIGQVLAALRANGQAERTLIVFTSDNGGQ 286
Query: 292 -SFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
G H P RG K ++GG+R + A+W P + S + D PTLC
Sbjct: 287 LDAGAHCG-----PYRGGKQDMYEGGIRVPLCAVW-PGRIEPGTRSDRVAVSMDLFPTLC 340
Query: 350 AAAGIEINDTSLDGVN 365
AAG + +DG++
Sbjct: 341 EAAGARF-EHPVDGIS 355
>gi|410635995|ref|ZP_11346602.1| arylsulfatase [Glaciecola lipolytica E3]
gi|410144672|dbj|GAC13807.1| arylsulfatase [Glaciecola lipolytica E3]
Length = 499
Score = 181 bits (459), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 129/372 (34%), Positives = 185/372 (49%), Gaps = 23/372 (6%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
++P++I+I DD G+ DV G + TPN+D +A GL+L YV + LC+PSR+ LMT
Sbjct: 43 RQPNVIVIFTDDQGYEDVGVFGGDHVLTPNLDKMAEEGLMLTDFYVPSPLCSPSRAGLMT 102
Query: 87 GKYPIHIGMQHG----VILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
G YP + M G V+L + GL E + + LK GYAT GKWHLG E +
Sbjct: 103 GSYPRRVDMATGSNFPVLLAADTKGLNPAEITIAEVLKSVGYATGIFGKWHLGDQPE-FL 161
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI-YSTDLYTEAA 201
PT +GFD +G H +A F+ + L + N V Y T TE A
Sbjct: 162 PTRQGFDEFFGLPYSHDIAPTHKRQAHFK-FPDLPLMENENVIELNPNPEYLTRRITERA 220
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAG--NTYEPFQAPDEEVAKFLDISDPE------RRTY 253
I+ I H + P FLYL H H + E D+ L+ISD + + Y
Sbjct: 221 IDFIERH-QDAPFFLYLPHPMPHGPLHVSQEFLNTVDKATLARLNISDADIAANSKKAMY 279
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
++ LD +VGN++ L++ + N+IV+F +DNG PS G ++ G+ L G K+
Sbjct: 280 PLVIGELDNAVGNILDTLKRLNLDSNTIVIFTSDNG-PS-GRRNSDGTERYLSGHKTLTQ 337
Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTK 372
+GGMR +W P VS L D LPT+ AG ++ D +DG N W +L
Sbjct: 338 EGGMRVPTVVWGPSQIPANTVSQALTTSMDILPTVAFLAGADVPTDRVIDGKNIWPIL-- 395
Query: 373 GAKTKRSEILHN 384
A K +E H+
Sbjct: 396 -AAEKDAESPHD 406
>gi|323448512|gb|EGB04410.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 572
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 134/407 (32%), Positives = 196/407 (48%), Gaps = 58/407 (14%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIP-------TPNIDALAYNGLILNQHYVQALCT 78
A PHII I+ADDLGW+ V +H + TP +DAL G+ L +HY C+
Sbjct: 32 AAPPPHIIHIVADDLGWSYVDWHRDPRTADPPDTPLTPRLDALRRAGVTLERHYAFQYCS 91
Query: 79 PSRSALMTGKYPIHIGMQHGVILEGEP------W-GLPLTEKLLPQYLKEAGYATHAIGK 131
PSRSAL++G+ PI + +Q+ V P W G+P L + L+ GY T A+GK
Sbjct: 92 PSRSALLSGRNPISVNVQNVVPEVSNPEDPVGGWQGIPTAMTGLAEVLRRRGYRTEAVGK 151
Query: 132 WHLGFFREVYT-PTFRGFDSHYGYWQGLQDYY---DHSC-----------KATFE-PYQG 175
W +G + P RGFD+ GYW DY+ + +C ATF+ P
Sbjct: 152 WDVGMATAAHHHPRARGFDAWTGYWHHANDYWTFAEETCDGEAVQDLWHYNATFDGPATW 211
Query: 176 LDMRHNMQVDNKT-----IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYE 230
L + N+T +Y L + A+ + EH+ + P+FLY A VH
Sbjct: 212 LRNGPSCTQGNQTHTAASPCVYEEQLLADHAVRAVEEHDDTAPLFLYWALHLVHM----- 266
Query: 231 PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
P Q PD +A+F I D RR MV D+ VG ++ AL + GML N++V F +DNG
Sbjct: 267 PLQVPDAYLARFDGIPDEPRRFMTAMVRYADDEVGRLVDALERRGMLANAVVTFHSDNGG 326
Query: 291 PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS--SELFHISDWLPTL 348
G + G+N PL G K + ++GG+R A + + + ++ + S+L + DW T
Sbjct: 327 EIMGAGACGGNNWPLTGGKFSNFEGGIRVNAFVSGGAVPEARRGAALSQLSTLWDWYATY 386
Query: 349 CAAAGIEINDTS------------LDGVNQWDVLTKG-AKTKRSEIL 382
AGI D+ LD V+QW +T A R E++
Sbjct: 387 ---AGIAGADSIDDAAAAAAGLPPLDSVDQWPYVTGANATPPRREVV 430
>gi|296121201|ref|YP_003628979.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296013541|gb|ADG66780.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 479
Score = 180 bits (457), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 119/345 (34%), Positives = 178/345 (51%), Gaps = 14/345 (4%)
Query: 14 TLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV 73
LL + T+ +P+I++I+ADDLG+ D+ G +IPTP++D LA +G+ YV
Sbjct: 22 ALLAAEDVNQTSKSGRPNILVIMADDLGYADLGVQGGCEIPTPHLDQLAASGIRCTNAYV 81
Query: 74 QA-LCTPSRSALMTGKYPIHIGMQHGV-ILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
A C+PSR+ +TGKY G + + E GLPL E + L+ GY T IGK
Sbjct: 82 SAPYCSPSRAGFLTGKYQTRFGHEFNPHVGEEAKLGLPLEEVTIANLLQTEGYRTALIGK 141
Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIG 190
WH GF ++ + P RGFD +G+ G +Y H KA F DM + + G
Sbjct: 142 WHQGFSKD-HHPQSRGFDEFFGFLVGGHNYLLHKEVKARFGTAHSHDMIYRGREVEPQEG 200
Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-FQAPDEEVAKFLDISDPE 249
Y+TDL+T A+ ++ +KP FLYL++ AVH P Q E K P
Sbjct: 201 -YATDLFTNEALRWMSG-PPNKPWFLYLSYNAVHTPLEIAPHLQKRIPESVKL-----PA 253
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG--IHSNKGSNHPLRG 307
RR Y +++ LD+S+G + L +HG+ E ++++F++DNG + N G NHPLRG
Sbjct: 254 RRGYLSLLAGLDDSIGRITQHLSQHGLREKTLIIFLSDNGGSGRAPILAYNSGLNHPLRG 313
Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
K +GG+R + P + + + D LPT+C A
Sbjct: 314 DKGQTLEGGIRVPFFVSWPGQLPARTIYEQPIISLDLLPTVCQLA 358
>gi|410628681|ref|ZP_11339399.1| sulfatase [Glaciecola mesophila KMM 241]
gi|410151685|dbj|GAC26168.1| sulfatase [Glaciecola mesophila KMM 241]
Length = 502
Score = 180 bits (457), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 122/350 (34%), Positives = 178/350 (50%), Gaps = 31/350 (8%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMTGK 88
P+I++IL+DD G+ D F GSSQI TPN+D LA + Q YV A+C PSR+ ++TGK
Sbjct: 60 PNIVLILSDDAGYGDFGFQGSSQIRTPNLDNLAVQSTVFTQAYVSAAVCGPSRAGILTGK 119
Query: 89 YPIHIGMQHGVI--------LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
Y G + + L G+ GLPL ++ + YL GY+T IGKWH G +
Sbjct: 120 YQQRFGFEENNVPGYMSDSGLTGDDMGLPLNQRTIGDYLTHFGYSTALIGKWHQG-NADK 178
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGL----DMRHNMQVDNKTIGIYSTDL 196
+ PT RGF+ YG+ G + Y++ F P + D K Y T
Sbjct: 179 FHPTKRGFEHFYGFRGGARSYFE------FGPNNPVSYPEDRLERGFAHYKESPHYLTQA 232
Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
+ AI I + N+ +P FL+L+ AVH P A E++A+F +S +R+ A M
Sbjct: 233 LADEAIKFI-KQNQREPFFLFLSFNAVHT-----PMDANKEDLAQFPQLSG-KRQRVAAM 285
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
+D +G V+A+L + G+ EN++++F DNG PS +N N PL G K+ +GG
Sbjct: 286 TLSMDREIGKVLASLDELGLSENTLLVFTNDNGGPS---DTNASDNTPLSGTKANHLEGG 342
Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVN 365
+R + P E D LPT AAAG + + + LDG N
Sbjct: 343 IRVPFLMRWPAGNIGPNTYHEPISTLDLLPTFFAAAGGKKSAVADLDGTN 392
>gi|109897220|ref|YP_660475.1| sulfatase [Pseudoalteromonas atlantica T6c]
gi|109699501|gb|ABG39421.1| sulfatase [Pseudoalteromonas atlantica T6c]
Length = 471
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 123/380 (32%), Positives = 185/380 (48%), Gaps = 25/380 (6%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
+ F + L + A + + K+P+I+ + +DD G+ D F GS + TPN+D LA
Sbjct: 2 KSVFHRVVAALSISVACTSLSYAKQPNIVFLFSDDAGYADFGFQGSETMKTPNLDQLASE 61
Query: 65 GLILNQHYVQ-ALCTPSRSALMTGKYPIHIG---------MQHGVILEGEPWGLPLTEKL 114
G+ Q YV + C PSR+ +MTG+Y G M ++G G+PL E
Sbjct: 62 GVRFTQGYVSDSTCGPSRAGIMTGRYQQKFGYEEINVPGYMSEHSAIKGAEMGIPLDEVT 121
Query: 115 LPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP-- 172
+ Y+K GY T GKWHLG E++ P RGFD YG+ G + Y+ + A
Sbjct: 122 MGDYMKSLGYRTAFYGKWHLGGTDELH-PMHRGFDEFYGFRGGDRSYWAYEVNAPERKSA 180
Query: 173 -YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP 231
+ + H + + G Y TD+ E A N E KP F++L+ AVH P
Sbjct: 181 VFTDKKLEHGIDQFQEHEG-YLTDVLAEKA-NQFIEKAPDKPFFIFLSFNAVHT-----P 233
Query: 232 FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
+A E++AKF + +R+ A M LD + G V+ L++ G+ ++++V+F DNG P
Sbjct: 234 MEATPEDLAKFPQLKG-KRKEVAAMTLALDRASGAVLNKLKELGLEDDTLVVFSNDNGGP 292
Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAA 351
+ N SN+PL G KS +GG+R + P KV + D LPT A
Sbjct: 293 T---DKNASSNYPLAGTKSNFLEGGIRVPFLVKWPAKLAAGKVYDKPVSTLDLLPTFFKA 349
Query: 352 AGIEINDTSLDGVNQWDVLT 371
G E + LDGV+ +T
Sbjct: 350 GGGEEVMSELDGVDLMPYIT 369
>gi|194919165|ref|XP_001983032.1| GG19813 [Drosophila erecta]
gi|190647643|gb|EDV45031.1| GG19813 [Drosophila erecta]
Length = 299
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 112/299 (37%), Positives = 164/299 (54%), Gaps = 28/299 (9%)
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
++ L LENSIV+F +DNGAPS G+ +N GSN PLRG K+TPW+GG+R AIWS
Sbjct: 1 MSTLAATDQLENSIVIFYSDNGAPSVGMFANTGSNFPLRGQKNTPWEGGVRVAGAIWSSK 60
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR--SEILHN 384
L+ + S+ +++DWLPTL AAGIE++ + LDG++ W L+ A EILH
Sbjct: 61 LQARGSIFSQPLYVADWLPTLSHAAGIELDSSLKLDGIDLWPELSGSADAPHVPREILHI 120
Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFN 444
+D++ AAL++ KYV GT G+ D T E+D P+E Y+
Sbjct: 121 LDDI----WRVAALQLGQWKYVNGTTAAGRYDSLL--TYRELDDLDPRESRYA------- 167
Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC 504
+ +++A +A S LR LT ++I R A VRC + CN ++ C
Sbjct: 168 -------VTVRNSATSRALSRYDLRR-LTQQRISITRRLAAVRCGDLQRS--CNPLLEEC 217
Query: 505 LFNITDDPCEQNNL--AESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
L++I+ DPCEQNNL +E +D+L L ++ +++ PGN+ ADPA W
Sbjct: 218 LYDISTDPCEQNNLVYSERHSDVLAALRRRVRELRASASRPGNRASMPEADPALHTCAW 276
>gi|87306602|ref|ZP_01088749.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
marina DSM 3645]
gi|87290781|gb|EAQ82668.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
marina DSM 3645]
Length = 468
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 145/487 (29%), Positives = 220/487 (45%), Gaps = 55/487 (11%)
Query: 7 YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
+ AL LL ++A + + P I++I++DD G+ D+S G + TP +D LA +G
Sbjct: 9 FVCALASALLVSNAVAAEKSKRPPSIVLIVSDDQGFADLSCIGDNGCRTPRLDQLAASGT 68
Query: 67 ILNQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-----------------GL 108
L YV CTPSR++LMTG+YP G + E + G
Sbjct: 69 RLTSFYVSWPACTPSRASLMTGRYPQRNGTYDMIRNEAPDYDYLYTPEEYAVTAERILGT 128
Query: 109 PLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKA 168
L E L LK+AGY + GKW G + Y P RGFD +YG+ DY+ H
Sbjct: 129 DLQEVFLADVLKQAGYVSAVFGKWDGGQLKR-YLPLQRGFDQYYGFANTGVDYFTH---- 183
Query: 169 TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNT 228
E Y M + Q + G Y TDL+ AI I E N +P FLYL A H+ +
Sbjct: 184 --ERYGVPSMFRDNQPTEEDKGTYLTDLFEREAIRFIDE-NHDRPFFLYLPFNAPHSASN 240
Query: 229 YEP-----FQAPDEEVAKF---LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENS 280
+ QAP E + F + R+ Y V R+DE++G V+ L++H + +N+
Sbjct: 241 LDRSIRGFAQAPQEYLDHFPGGESKQEKRRQAYLAAVERMDEAIGKVVDQLQQHQIADNT 300
Query: 281 IVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
+++F++DN N PLRG K+ ++GG R + P KVS++
Sbjct: 301 LIIFLSDN------GGGGGADNSPLRGGKAKMFEGGNRVPCIVHWPGKVPAGKVSNQFLT 354
Query: 341 ISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALR 399
+ PT+ AA G ++ +D DG + VL GA + R E+ ++ A R
Sbjct: 355 SLEVFPTVIAAIGGKLPDDVIYDGFDMLPVLN-GASSPREEMFWK-------RRGDVAAR 406
Query: 400 VDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAAD 459
V D K+V G D + D + D + +K F+A +++ AAD
Sbjct: 407 VGDWKWVDSAAGKGLFDLAH-DIGEKKDLSKEHPEMLAKLKARFDAWTAEME-----AAD 460
Query: 460 PKANSSD 466
P+ D
Sbjct: 461 PRGPFRD 467
>gi|449135790|ref|ZP_21771228.1| arylsulfatase A [Rhodopirellula europaea 6C]
gi|448885592|gb|EMB16025.1| arylsulfatase A [Rhodopirellula europaea 6C]
Length = 476
Score = 179 bits (455), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 145/414 (35%), Positives = 205/414 (49%), Gaps = 66/414 (15%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+KP++I+ILADDLG+ D+S +G+ I TP+ID +A G N YV +C+P+R+ALMTG
Sbjct: 25 QKPNVIVILADDLGYGDLSCYGADDIATPSIDRMAAEGAKFNSFYVSPVCSPTRAALMTG 84
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+ +G+ GV+ GL E LP+ LK+ GYAT IGKWHLG ++++ P G
Sbjct: 85 SHSTRVGI-GGVMFPRNNHGLNPDEITLPELLKDQGYATAIIGKWHLG-NQDMFQPLKHG 142
Query: 148 FDSHYGYWQGL-----QDYYDHS------CKATFEPYQ--GLDMRHNMQV----DNKTIG 190
FD YW G Q YY ++ CK E Y G+ R DN I
Sbjct: 143 FD----YWYGTPASNNQFYYPNTKTYALDCKYR-EGYSCDGILKRETAACPLIEDNIVIE 197
Query: 191 IYS-----TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
+ + T YT I I NK KP F+YLAH H P A + KF+
Sbjct: 198 VPADQTQFTQRYTRETIRFIT-RNKDKPFFVYLAHNMPHI-----PLHASE----KFVGS 247
Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
S +R Y + LD S G ++ +L++ G+ N++V+F +DNG P+ G GS PL
Sbjct: 248 S--KRGIYGDTIQELDWSTGEILRSLKELGLDNNTLVIFTSDNG-PNTG---KGGSAGPL 301
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGV 364
+G KS+ +GG+R P + S E D LPTL A G+ +D +DG
Sbjct: 302 KGGKSSTLEGGVRVPFVARWPGMIPAGTESDEPITGMDLLPTLTRIAVGVVPDDRVIDGK 361
Query: 365 NQWDVLTKGAKTKRSEILHNIDNVDNPQK--YY------AALRVDDLKYVAGTD 410
+ W +LT+ + K +P K YY A+RV D KY T+
Sbjct: 362 DIWPLLTRKSDAK------------SPHKAVYYLRGRGVQAIRVGDWKYRVATE 403
>gi|171184398|ref|NP_057931.3| N-acetylgalactosamine-6-sulfatase isoform 1 precursor [Mus
musculus]
gi|124007189|sp|Q571E4.2|GALNS_MOUSE RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|74198064|dbj|BAE35212.1| unnamed protein product [Mus musculus]
gi|148679747|gb|EDL11694.1| galactosamine (N-acetyl)-6-sulfate sulfatase, isoform CRA_b [Mus
musculus]
Length = 520
Score = 179 bits (455), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 121/366 (33%), Positives = 181/366 (49%), Gaps = 26/366 (7%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALM 85
P+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+AL+
Sbjct: 25 PQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALL 84
Query: 86 TGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
TG+ PI G ++ + G+P +E LLP+ LK+AGY +GKWHLG R
Sbjct: 85 TGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-R 143
Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYST 194
+ P GFD +G YD+ K Y+ +M ++ KT T
Sbjct: 144 PQFHPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLT 203
Query: 195 DLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
LYT+ A++ I +H + P FLY A A HA P +FL S R Y
Sbjct: 204 QLYTQEALDFIQTQHARQSPFFLYWAIDATHA---------PVYASRQFLGTS--LRGRY 252
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
V +D+SVG +++ L+ G+ +N+ V F +DNGA + GSN P K T +
Sbjct: 253 GDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLCGKQTTF 312
Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTK 372
+GGMR A W P +VS +L I D T + AG++ +D +DG++ + K
Sbjct: 313 EGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRVIDGLDLLPTMLK 372
Query: 373 GAKTKR 378
G R
Sbjct: 373 GQMMDR 378
>gi|60359902|dbj|BAD90170.1| mFLJ00319 protein [Mus musculus]
Length = 537
Score = 179 bits (455), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 121/366 (33%), Positives = 181/366 (49%), Gaps = 26/366 (7%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALM 85
P+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+AL+
Sbjct: 42 PQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALL 101
Query: 86 TGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
TG+ PI G ++ + G+P +E LLP+ LK+AGY +GKWHLG R
Sbjct: 102 TGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-R 160
Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYST 194
+ P GFD +G YD+ K Y+ +M ++ KT T
Sbjct: 161 PQFHPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLT 220
Query: 195 DLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
LYT+ A++ I +H + P FLY A A HA P +FL S R Y
Sbjct: 221 QLYTQEALDFIQTQHARQSPFFLYWAIDATHA---------PVYASRQFLGTS--LRGRY 269
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
V +D+SVG +++ L+ G+ +N+ V F +DNGA + GSN P K T +
Sbjct: 270 GDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLCGKQTTF 329
Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTK 372
+GGMR A W P +VS +L I D T + AG++ +D +DG++ + K
Sbjct: 330 EGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRVIDGLDLLPTMLK 389
Query: 373 GAKTKR 378
G R
Sbjct: 390 GQMMDR 395
>gi|325285326|ref|YP_004261116.1| arylsulfatase [Cellulophaga lytica DSM 7489]
gi|324320780|gb|ADY28245.1| Arylsulfatase [Cellulophaga lytica DSM 7489]
Length = 466
Score = 179 bits (454), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 125/372 (33%), Positives = 192/372 (51%), Gaps = 39/372 (10%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F L +LLF +KP+I+ IL+DD G+ D F GS++ TP +D LA +
Sbjct: 13 FVLLEVSLLF--------GQQKPNIVFILSDDAGYADFGFQGSTEFKTPELDKLAKKSVK 64
Query: 68 LNQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVI--------LEGEPWGLPLTEKLLPQY 118
Q YV A+C PSR+ ++TG+Y G + + L G+ GLP + + Y
Sbjct: 65 FTQAYVSAAVCGPSRAGMLTGQYQQKFGFEENNVPGYMSSSGLLGDQMGLPTNQLTIADY 124
Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDY--YDHSCKATFEPYQGL 176
LK+ GY T GKWH+G +++ PT RGFD YG+ G + Y YD + T + L
Sbjct: 125 LKKQGYTTALFGKWHMG-NADLFHPTKRGFDEFYGFRGGSRSYLPYDKNNVLT-KNEDRL 182
Query: 177 DMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
+ +++K Y TD AI+ I + NK P F+YL+ AVH P +A
Sbjct: 183 EKGFGNYLEHKG---YLTDKLANEAISFINK-NKKTPFFIYLSFNAVHT-----PMEATK 233
Query: 237 EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
E++ F ++ +R+T A M +D ++G V+ L+K G+ +N++V+F DNG PS
Sbjct: 234 EDLKYFSHLTG-KRKTLAAMTLAMDRAIGTVLDELKKQGLDKNTLVVFTNDNGGPS---D 289
Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPT-LCAAAG 353
+N+ +N+PL G K+ +GG+R W + +K ++ + IS D PT L A G
Sbjct: 290 ANQSNNNPLSGTKANHLEGGIR--VPFLMSWPGKLKKNTTYNYPISTLDLFPTFLNVANG 347
Query: 354 IEINDTSLDGVN 365
LDGV+
Sbjct: 348 NSSKLKQLDGVD 359
>gi|260062069|ref|YP_003195149.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
gi|88783631|gb|EAR14802.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
Length = 492
Score = 179 bits (454), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 127/377 (33%), Positives = 185/377 (49%), Gaps = 35/377 (9%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALM 85
P+KP+ II+ ADDLG+ D+S G I T N+D +A G YV A +CTPSR+ L+
Sbjct: 47 PEKPNFIIVFADDLGYGDLSSFGHPTIHTKNLDRMAAEGQKWTNFYVAASVCTPSRAGLL 106
Query: 86 TGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
TG+ P+ G+ + GV G+P +E L + LK+AGYAT +GKWHLG +E Y
Sbjct: 107 TGRLPVRNGLTSNEIGVFFPDSHNGMPASEITLAEQLKKAGYATGMVGKWHLG-HKEEYL 165
Query: 143 PTFRGFDSHYGY-----------WQGLQDYYDHSCKA----TFEPYQGLDMRHNMQVDNK 187
P GFD ++G + QDY+ + E Y +R +++
Sbjct: 166 PPNHGFDDYFGIPYSNDMDFTGQFTSYQDYFGRYTERYESLKTEEYNVPLIRGTEEIERP 225
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
T Y + A+ I EH K +P F+YLAH H P DE F S
Sbjct: 226 VNQNTITKRYNDEAVKWIREH-KDEPFFMYLAHSLPHV-----PLFTSDE----FRGTS- 274
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
R Y +V +D VG ++ L G+ EN+IV+F +DNG P + GS LR
Sbjct: 275 -ARGLYGDVVEEIDHGVGQIMELLEAEGLAENTIVVFTSDNG-PWLPTGISGGSAGLLRE 332
Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQ 366
K T W+GGMR W+P + KV ++ D T + AG+ + +D +DGV+
Sbjct: 333 GKGTTWEGGMREPTIFWAPGMLPA-KVVMDMGSTLDLFNTFSSLAGVPMPDDREMDGVDL 391
Query: 367 WDVLTKGAKTKRSEILH 383
+L A++ R E+ +
Sbjct: 392 SPILFGDAESPRKEMFY 408
>gi|260062070|ref|YP_003195150.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
gi|88783632|gb|EAR14803.1| arylsulphatase A [Robiginitalea biformata HTCC2501]
Length = 459
Score = 179 bits (454), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 144/417 (34%), Positives = 209/417 (50%), Gaps = 39/417 (9%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F++ C P P+I+ IL DDLG+ D+S G++ + +PNIDALA NG+
Sbjct: 20 LFSIGCLAAATGTCYAQERPDAPNILCILVDDLGYGDLSCQGATDLQSPNIDALAANGMR 79
Query: 68 LNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEP--WG-LPLTEKLLPQYLKEAG 123
Y + +C+PSR+AL+TG+YP +G+ GVI + WG L L+P L AG
Sbjct: 80 FTNFYANSTVCSPSRAALLTGRYPDLVGVP-GVIRQNPENNWGNLADDAVLIPSELNPAG 138
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ-DYYDHSCKATFEPYQGLD-MRHN 181
Y T IGKWHLG E TP RGF G+ + DY+DH G++ MR N
Sbjct: 139 YHTGIIGKWHLG-LEEPDTPNDRGFTYFKGFLGDMMDDYWDHR-------RGGINWMRLN 190
Query: 182 M-QVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
++D K ++TDL+T+ I+ + E + +P FLYLA+ A H P Q P E +
Sbjct: 191 REEIDPKG---HATDLFTDWTIDFLKERQGEEQPFFLYLAYNAPHF-----PIQPPREWL 242
Query: 240 AKFLDISDP----ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI 295
K + +P +R V LD SVG V+ AL+ G+ EN++V+F++DNG G
Sbjct: 243 DKVRE-REPNLTEKRAKNVAFVEHLDYSVGRVMEALKTTGLEENTLVVFVSDNG----GA 297
Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
SN PLRG K ++GG+R A + S + D PT C AG +
Sbjct: 298 LWYAQSNGPLRGGKQDMYEGGIRVPAIFYWKGKIAPGTTSDNTALLMDLFPTFCELAGRK 357
Query: 356 INDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVD---NPQKYYAALRVDDLKYVAGT 409
+ ++DG++ LT A+ + L+ + Q YYAA R D K + T
Sbjct: 358 PPE-NVDGISLVPTLTGQAQDTANRYLYWVRREGGDYGGQAYYAA-RFGDFKILQNT 412
>gi|313216787|emb|CBY38029.1| unnamed protein product [Oikopleura dioica]
Length = 383
Score = 178 bits (452), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 90/203 (44%), Positives = 125/203 (61%), Gaps = 8/203 (3%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
K+PH + ILADD+G++D+ +H ++I TP +D LA G+ L Q+YVQ +CTP+R LMTG
Sbjct: 174 KQPHFLFILADDMGYHDIGYH-QAEILTPFMDKLATTGVRLEQYYVQPVCTPTRVQLMTG 232
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y I GMQHGV+ +P G+PL EKLLP+ L++ GY T IGKWHLG F E Y P RG
Sbjct: 233 RYQIRYGMQHGVVRPPQPDGVPLDEKLLPEALRKCGYNTEMIGKWHLGMFTEDYLPQNRG 292
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI-----GIYSTDLYTEAAI 202
FD G++ G QD+Y H+ F G D R + I YST ++ +
Sbjct: 293 FDHFMGFYTGSQDFYSHN--KCFSGMCGYDFREATAGQPEVIRWDLNNTYSTGVFADELE 350
Query: 203 NVIAEHNKSKPMFLYLAHLAVHA 225
+++ N S+P F YL+ AVH+
Sbjct: 351 KRLSKMNPSEPSFTYLSFQAVHS 373
>gi|399033016|ref|ZP_10732099.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
gi|398068627|gb|EJL60037.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
Length = 460
Score = 178 bits (452), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 128/369 (34%), Positives = 194/369 (52%), Gaps = 33/369 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+KP+II+I+ADD GWNDV +HGS I TPNID LA NG+ LN+ Y C+PSR++L TG
Sbjct: 33 EKPNIILIIADDAGWNDVEYHGSV-IQTPNIDFLAKNGVELNRFYANPTCSPSRASLFTG 91
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+ +G+ I + + LP + LP+ L + Y T IGKWHLG ++ P G
Sbjct: 92 RPASRMGIV-APISDKSQFKLPDSIATLPKLLHQNNYQTALIGKWHLG-LQQSSGPKAYG 149
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FD YG+ G D Y H K + + R+ +D + ++TDL T AI+ ++E
Sbjct: 150 FDYSYGFLHGQIDQYTHLYKNGDKSW----YRNGEFIDEQG---HATDLITNEAIHWLSE 202
Query: 208 HNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRLDESVG 265
S K FL +A+ A H P Q + +++ I D RR +A +S LD S+G
Sbjct: 203 KRDSNKNFFLEVAYSAPHF-----PLQEEQKWKDPYMNSIKDSSRRDFAAAMSHLDYSIG 257
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPS--------FGIHSNK---GSNHPLRGMKSTPWD 314
++ L++ + +N++V+FM+DNGA G+H + G N PLR K++ ++
Sbjct: 258 ILLEKLKQENLDKNTLVIFMSDNGAMENWDSRNEYNGVHPSNTVLGDNSPLRDWKTSNYE 317
Query: 315 GGMRGVAAI--WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
GG+R V I W LK + S I D LP++ AG + S++G N W + +
Sbjct: 318 GGIR-VPCIFYWKDHLKNYK--SESYISIIDLLPSILFLAGDKNLPQSIEGKNVWPAIAE 374
Query: 373 GAKTKRSEI 381
+I
Sbjct: 375 NKAIANQQI 383
>gi|311748006|ref|ZP_07721791.1| sulfatase family protein [Algoriphagus sp. PR1]
gi|126574986|gb|EAZ79344.1| sulfatase family protein [Algoriphagus sp. PR1]
Length = 529
Score = 178 bits (452), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 134/407 (32%), Positives = 196/407 (48%), Gaps = 69/407 (16%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSR 81
N P P+I+ IL DDLG+NDVSF GS TPNID +A G+I Y A+C+PSR
Sbjct: 22 NPEPPSNPNIVFILVDDLGYNDVSFMGSEFYETPNIDKVASKGMIFTNGYANSAVCSPSR 81
Query: 82 SALMTGKYPIHIGMQHGV-ILEGEPWG--------LP------LTEKL--LPQYLKEAGY 124
++LMTG++ G+ + +GE W LP L++++ LP+ LK+ GY
Sbjct: 82 ASLMTGRFTAVHGITDWIGAPQGEAWRSYKRYSKLLPAEYNHQLSKEITTLPEALKQEGY 141
Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYG-------YWQGLQDYYDHSCKATFEPYQGLD 177
T GKWHLG E PT GFD + G Y G +++ FE +G+
Sbjct: 142 KTFFAGKWHLGSAEENSLPTDHGFDINKGGYHVGGPYSGGYFSPFENPFMEDFEEEKGIS 201
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
+ +M++ +T N+ E NK + YL+ AVHA P Q E
Sbjct: 202 L--SMKLAKET--------------NLFIEQNKEEKFLAYLSFYAVHA-----PIQTSKE 240
Query: 238 EVAKFLDISD---------------PERR-----TYAGMVSRLDESVGNVIAALRKHGML 277
+ +K+ + ++ P R+ YAG++ ++DE+VG V+ L K G+
Sbjct: 241 KWSKYRNKAEAMGIDSTGFEMERVLPARKYQDNPVYAGLIEQMDEAVGAVVDQLEKLGLS 300
Query: 278 ENSIVLFMADNGAPSFGIHSNKGSN-HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS 336
EN+I++F +DNG G N +N PLRG K W+GG R I+ PWL+ + +
Sbjct: 301 ENTIIVFTSDNGGVVSG--DNYSTNLDPLRGGKGYQWEGGTRIPYMIYVPWLENNGQRND 358
Query: 337 ELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
D+ PTL AG+ LDG + VL G +RS H
Sbjct: 359 TPVSGVDFYPTLLDFAGVNEISQPLDGQSIRPVLEGGTLEERSIYWH 405
>gi|443724469|gb|ELU12468.1| hypothetical protein CAPTEDRAFT_223869 [Capitella teleta]
Length = 441
Score = 178 bits (452), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 109/314 (34%), Positives = 171/314 (54%), Gaps = 21/314 (6%)
Query: 108 LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCK 167
+ L + YLKE Y THA GKWHLG+ + TPT+RGFD+ G + G +YY+H+
Sbjct: 1 MDLKYNFISDYLKELKYNTHASGKWHLGYCNKECTPTYRGFDTFSGGYSGEGEYYEHTT- 59
Query: 168 ATFEPYQGL-DMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG 226
+QG+ D + ++D GI+S D+ +N + + N P+F Y A H
Sbjct: 60 -----FQGMYDWHNGTELDRSASGIHSQDMIEADLLNQL-DKNDGTPLFYY---AAFH-- 108
Query: 227 NTYEPFQAPDEEVAKFLDI-SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFM 285
NT+ P E A + D+ + P RR Y G+VS +D +G V+A L++ M +N+ + F
Sbjct: 109 NTHSPMHPKPEHEALYDDLDAPPTRRKYLGLVSGMDAVIGKVVAKLKEKDMFDNTYIFFS 168
Query: 286 ADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWL 345
+DNG +H +G N P RG KS+ +DGG + SP LK+ + L HI+DWL
Sbjct: 169 SDNGGD---VH--EGDNSPKRGAKSSLFDGGCHAHSWASSPLLKKKGVEADGLLHITDWL 223
Query: 346 PTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNID-NVDNPQKYYAALRVDDLK 404
PT+ AG E+ D +DG+ Q D++T +KR+++++NID ++ + + DL+
Sbjct: 224 PTIVKLAGGEVPD-GIDGIEQVDLITNAGPSKRTDMIYNIDMELETGTPDFGTMAARDLR 282
Query: 405 YVAGTDNNGQSDEW 418
Y G+SD +
Sbjct: 283 YKLIWGFEGRSDGY 296
>gi|444523022|gb|ELV13435.1| Arylsulfatase J [Tupaia chinensis]
Length = 383
Score = 178 bits (452), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 120/386 (31%), Positives = 188/386 (48%), Gaps = 61/386 (15%)
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
GIYST +YT+ ++A HN +KP+FLY+A+ AVH+ P QAP + I +
Sbjct: 19 GIYSTQMYTQRVQQILASHNPAKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININ 73
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGM 308
RR YA M++ LDE++ NV AL+ +G +NSI+++ +DNG P+ G GSN PLRG
Sbjct: 74 RRRYAAMLACLDEAIHNVTLALKTYGFYDNSILIYSSDNGGQPTAG-----GSNWPLRGS 128
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQW 367
K T W+GG+R V + SP LK V EL HI+DW PTL + A +I+ D LDG + W
Sbjct: 129 KGTYWEGGIRAVGFVHSPLLKNRGTVCRELVHITDWYPTLISLAEGQIDEDIQLDGYDIW 188
Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID 427
+ +++G ++ RVD L + ++ W
Sbjct: 189 ETISEGLRSP---------------------RVDILHNIDPIYTKAKNGSWA-------- 219
Query: 428 KYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR 487
+ GI A+++ ++++ SD + + F+ +
Sbjct: 220 ---------AGYGIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQAFSNLG 260
Query: 488 -CNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNK 546
+ N+ ++ LFNIT DP E+ +L+ ++KQL +L+ + T VP
Sbjct: 261 PTRWHNERVTLSTGKSVWLFNITADPYERVDLSSRYPGIVKQLLRRLSQFNKTAVPVRYP 320
Query: 547 PFDKRADPARWNNIWVPWYDELDKQK 572
P D R++P +W PWY E K+K
Sbjct: 321 PKDPRSNPRLNGGVWGPWYKEEYKKK 346
>gi|431799329|ref|YP_007226233.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
gi|430790094|gb|AGA80223.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
Length = 454
Score = 178 bits (451), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 124/368 (33%), Positives = 186/368 (50%), Gaps = 38/368 (10%)
Query: 13 CTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY 72
C++L + T + P+II+I ADDLG+ D+ +G I TP +D +A G+ Q Y
Sbjct: 18 CSVL-----VAQTPKRPPNIIVIFADDLGYGDLGCYGHPTIQTPYLDRMAKEGMRFTQFY 72
Query: 73 VQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
V A +CTPSR+AL+TG+ PI GM + GV+ GLP TE + LK AGY T
Sbjct: 73 VGADVCTPSRAALLTGRLPIRYGMAGEERGVLFPDSSKGLPHTEATMASALKAAGYRTGI 132
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGY-WQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
+GKWHLG E Y PT GFD ++G + H+ Y+ ++ ++ +D +
Sbjct: 133 VGKWHLGHLPE-YLPTTHGFDFYFGIPYSNDMRPNPHNHVPPLPLYRNEEVI-DLDIDQR 190
Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLY----LAHLAVHAGNTYEPFQAPDEEVAKFL 243
+ T YT+ AI I E N+ +P FLY H+ ++A + +E
Sbjct: 191 QL----TKRYTKEAIRFI-EENQERPFFLYYPNNFPHVPLYASSDFE------------- 232
Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
+R Y +VS LD SVG ++ L+ + EN++V+F +DNG P GS
Sbjct: 233 --GKSKRGIYGDVVSELDWSVGRILEKLKALDLAENTLVIFTSDNG-PWLSQKEKGGSAG 289
Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLD 362
L K++ ++GGMR A W P + ++S+ L D PT AG+ + D +LD
Sbjct: 290 LLFEGKASTYEGGMRVPAIAWWPGTIKPNQISTALVASMDLYPTFLTMAGLNMPQDKTLD 349
Query: 363 GVNQWDVL 370
G N +L
Sbjct: 350 GDNILPIL 357
>gi|7527462|gb|AAF63155.1|AF111346_1 N-acetylgalactosamine-6-sulfate sulfatase [Mus musculus]
gi|7576473|gb|AAF63858.1| N-acetylgalactosamine-6-sulfate sulfatase [Mus musculus]
Length = 520
Score = 177 bits (450), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 120/366 (32%), Positives = 181/366 (49%), Gaps = 26/366 (7%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALM 85
P+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+AL+
Sbjct: 25 PQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALL 84
Query: 86 TGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
TG+ PI G ++ + G+P +E LLP+ LK+AGY +GKWHLG R
Sbjct: 85 TGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-R 143
Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYST 194
+ P GF+ +G YD+ K Y+ +M ++ KT T
Sbjct: 144 PQFHPLKHGFNEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLT 203
Query: 195 DLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
LYT+ A++ I +H + P FLY A A HA P +FL S R Y
Sbjct: 204 QLYTQEALDFIQTQHARQSPFFLYWAIDATHA---------PVYASRQFLGTS--LRGRY 252
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
V +D+SVG +++ L+ G+ +N+ V F +DNGA + GSN P K T +
Sbjct: 253 GDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLCGKQTTF 312
Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTK 372
+GGMR A W P +VS +L I D T + AG++ +D +DG++ + K
Sbjct: 313 EGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRVIDGLDLLPTMLK 372
Query: 373 GAKTKR 378
G R
Sbjct: 373 GQMMDR 378
>gi|445495948|ref|ZP_21462992.1| sulfatase [Janthinobacterium sp. HH01]
gi|444792109|gb|ELX13656.1| sulfatase [Janthinobacterium sp. HH01]
Length = 471
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 118/330 (35%), Positives = 171/330 (51%), Gaps = 24/330 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
++P+II ILADDLG D+ +G + I TPN+D LA G+ Q Y A+C+ +R AL+T
Sbjct: 31 EQPNIIFILADDLGHADLGVYGQTDIRTPNLDKLAGQGIRFTQAYANSAVCSATRFALIT 90
Query: 87 GKYPIHI--GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
G+Y + G++ + + GLP T LP LK+ GY T IGKWHLG+ + P
Sbjct: 91 GRYQYRLRGGLEEPIAGASDTLGLPRTHPTLPSLLKKQGYGTALIGKWHLGYL-PTFGPL 149
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
G+DS +G + G DY+ H K P D+ + +V IG Y TDL A++
Sbjct: 150 KSGYDSFFGNYGGAIDYFTH--KPGVGPQVKEDL-YEGEVPVHQIGYY-TDLLGARAVDF 205
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP-DEEVAKFLD----ISDPERRTYAGMVSR 259
+ + KP L L + A H P++ P DE +++ +D + TY MV
Sbjct: 206 VQKQQAGKPFLLSLHYTAPH-----WPWEGPGDEAISRDIDNIFHYDGGDLATYGRMVES 260
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
LD S+G V+ AL G+ EN+IV+F +DNG F PL G K+ +GG+R
Sbjct: 261 LDASIGRVLKALEDKGLAENTIVIFTSDNGGERF------SKTWPLTGQKTELLEGGIRV 314
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
A + P +VS ++ DWLPTL
Sbjct: 315 PAIVRWPAKIAAGQVSEQVAISMDWLPTLL 344
>gi|13278373|gb|AAH04002.1| Galactosamine (N-acetyl)-6-sulfate sulfatase [Mus musculus]
Length = 520
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 121/367 (32%), Positives = 180/367 (49%), Gaps = 26/367 (7%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
AP+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+AL
Sbjct: 24 APQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAAL 83
Query: 85 MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
+TG+ PI G ++ + G+P +E LLP+ LK+AGY +GKWHLG
Sbjct: 84 LTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH- 142
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYS 193
R + P GFD +G YD+ K Y+ +M ++ KT
Sbjct: 143 RPQFHPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANL 202
Query: 194 TDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
T LY + A++ I +H + P FLY A A HA P +FL S R
Sbjct: 203 TQLYLQEALDFIRTQHARQGPFFLYWAIDATHA---------PVYASRQFLGTS--LRGR 251
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
Y V +D+SVG ++ L+ G+ +N+ V F +DNGA + GSN P K T
Sbjct: 252 YGDAVREIDDSVGKILNLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLCGKQTT 311
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLT 371
++GGMR A W P +VS +L I D T + AG++ +D +DG++ +
Sbjct: 312 FEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRVIDGLDLLPTML 371
Query: 372 KGAKTKR 378
KG R
Sbjct: 372 KGQMMDR 378
>gi|426242290|ref|XP_004015007.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Ovis aries]
Length = 522
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 121/367 (32%), Positives = 178/367 (48%), Gaps = 26/367 (7%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
AP+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+AL
Sbjct: 26 APQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDQMATEGMLFPNFYTANPLCSPSRAAL 85
Query: 85 MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
+TG+ PI G ++ + G+P +E LLP LK AGYA+ +GKWHLG
Sbjct: 86 LTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWHLGH- 144
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYS 193
R + P GFD +G YD+ + Y+ +M ++ KT
Sbjct: 145 RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTGEANL 204
Query: 194 TDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
T +Y + A+ I + +P FLY A A HA P FL S +R
Sbjct: 205 TQIYLQEALEFIQRQQAAHRPFFLYWAVDATHA---------PVYASKPFLGTS--QRGR 253
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
Y + LD+SVG ++ L+ G+ EN+ V F +DNGA GSN P K T
Sbjct: 254 YGDAIRELDDSVGRILRLLQDLGIAENTFVFFTSDNGAALISAPRQGGSNGPFLCGKQTT 313
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLT 371
++GGMR A W P +VS +L I D T + AG+E D ++DG++ +
Sbjct: 314 FEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAGLEPPRDRAIDGLDLLPAML 373
Query: 372 KGAKTKR 378
+G T R
Sbjct: 374 QGRLTDR 380
>gi|4503899|ref|NP_000503.1| N-acetylgalactosamine-6-sulfatase precursor [Homo sapiens]
gi|462148|sp|P34059.1|GALNS_HUMAN RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|618426|gb|AAC51350.1| N-acetylgalactosamine 6-sulphatase [Homo sapiens]
gi|870751|dbj|BAA04535.1| N-acetylgalactosamine 6-sulfate sulfatase [Homo sapiens]
gi|33440495|gb|AAH56151.1| Galactosamine (N-acetyl)-6-sulfate sulfatase [Homo sapiens]
gi|37589093|gb|AAH50684.2| Galactosamine (N-acetyl)-6-sulfate sulfatase [Homo sapiens]
gi|119587163|gb|EAW66759.1| galactosamine (N-acetyl)-6-sulfate sulfatase (Morquio syndrome,
mucopolysaccharidosis type IVA), isoform CRA_c [Homo
sapiens]
Length = 522
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 120/378 (31%), Positives = 182/378 (48%), Gaps = 25/378 (6%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRS 82
+ AP+ P+I+++L DD+GW D+ +G TPN+D +A GL+ Y LC+PSR+
Sbjct: 25 SGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRA 84
Query: 83 ALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
AL+TG+ PI G ++ + G+P +E+LLP+ LK+AGY + +GKWHLG
Sbjct: 85 ALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG 144
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGI 191
R + P GFD +G YD+ + Y+ +M ++ KT
Sbjct: 145 H-RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEA 203
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
T +Y + A++ I + P FLY A A HA P FL S +R
Sbjct: 204 NLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRG 252
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
Y V +D+S+G ++ L+ + +N+ V F +DNGA GSN P K T
Sbjct: 253 RYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQT 312
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVL 370
++GGMR A W P +VS +L I D T A AG+ +D ++DG+N L
Sbjct: 313 TFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTL 372
Query: 371 TKGAKTKRSEILHNIDNV 388
+G R + D +
Sbjct: 373 LQGRLMDRPIFYYRGDTL 390
>gi|189069200|dbj|BAG35538.1| unnamed protein product [Homo sapiens]
Length = 522
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 120/378 (31%), Positives = 182/378 (48%), Gaps = 25/378 (6%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRS 82
+ AP+ P+I+++L DD+GW D+ +G TPN+D +A GL+ Y LC+PSR+
Sbjct: 25 SGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAGGLLFPNFYSANPLCSPSRA 84
Query: 83 ALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
AL+TG+ PI G ++ + G+P +E+LLP+ LK+AGY + +GKWHLG
Sbjct: 85 ALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG 144
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGI 191
R + P GFD +G YD+ + Y+ +M ++ KT
Sbjct: 145 H-RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEA 203
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
T +Y + A++ I + P FLY A A HA P FL S +R
Sbjct: 204 NLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRG 252
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
Y V +D+S+G ++ L+ + +N+ V F +DNGA GSN P K T
Sbjct: 253 RYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQT 312
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVL 370
++GGMR A W P +VS +L I D T A AG+ +D ++DG+N L
Sbjct: 313 TFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTL 372
Query: 371 TKGAKTKRSEILHNIDNV 388
+G R + D +
Sbjct: 373 LQGRLMDRPIFYYRGDTL 390
>gi|167519809|ref|XP_001744244.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777330|gb|EDQ90947.1| predicted protein [Monosiga brevicollis MX1]
Length = 328
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 122/350 (34%), Positives = 177/350 (50%), Gaps = 49/350 (14%)
Query: 31 HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN-GLILNQHYVQALCTPSRSALMTGKY 89
H + I+ADDLG+ D+ + I TPNID LA G+IL+ Y C+PSR++ +TG+
Sbjct: 1 HHLQIMADDLGYYDLGYRNPDSI-TPNIDQLATQEGVILDNAYGYRYCSPSRASFLTGRV 59
Query: 90 PIHIGMQHGVILEGEPWGLP------LTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
PIH+ +G P GL L ++P L+ AGY T +GKWH G P
Sbjct: 60 PIHV-------HQGNP-GLAAAGCTNLNYTMIPAQLRRAGYRTAMVGKWHQGASLPECLP 111
Query: 144 TFRGFDSHYGYWQGLQDYYDHS-----CKAT-FEPYQGLDMRHNMQVDNKTIGIYSTDLY 197
RGFD+ +GY G +D+ D + C T F G +R N G YS+ Y
Sbjct: 112 VNRGFDTSFGYLSGEEDHMDQTTNGGQCNVTDFWLDSGPAIRRN--------GTYSSFQY 163
Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL-----------DIS 246
+A +++I +H +P+ LY A VH P++ P+ AKF S
Sbjct: 164 NDAIVDIIQQHAPEQPLMLYAALQNVHG-----PYEVPERYRAKFPADKNCTETGDNPTS 218
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG-SNHPL 305
R MV+ LD+ VGN+ AL+ +GM N++++F +DNG G + +NHPL
Sbjct: 219 QCTRDFMLAMVNVLDDIVGNITRALKANGMWNNTVLVFSSDNGGAVPGAPQHGSMNNHPL 278
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS--ELFHISDWLPTLCAAAG 353
RG K ++GG+R A + SP L ++ + + L HISDW T C AG
Sbjct: 279 RGGKVAYFEGGVRTAAFVASPLLPKSIRGTELRGLIHISDWYATFCHLAG 328
>gi|256421633|ref|YP_003122286.1| sulfatase [Chitinophaga pinensis DSM 2588]
gi|256036541|gb|ACU60085.1| sulfatase [Chitinophaga pinensis DSM 2588]
Length = 452
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 131/396 (33%), Positives = 197/396 (49%), Gaps = 34/396 (8%)
Query: 10 ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
AL+C F A L K+P+++II DD G DV+ +G+ + TPNID LA G++ +
Sbjct: 11 ALSC---FMAAPLFAQQQKRPNVLIIYTDDQGTLDVNCYGAKDLHTPNIDRLAKEGVLFS 67
Query: 70 QHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
Q Y A +C+PSR++L+TG+YP + + E G+P ++ + + K+ GY T
Sbjct: 68 QFYAAAPVCSPSRASLLTGRYPQRAQLDNNAPSEEGHAGMPGSQYTMAEMFKDGGYTTAH 127
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
IGKWH+G+ E P +GFD +G+ G D Y H Y RH++ + +
Sbjct: 128 IGKWHIGYSPET-MPNQQGFDYSFGFMGGCIDNYSHYF------YWAGPNRHDLWRNGQE 180
Query: 189 I---GIYSTDLYTEAAINVIAEHNKS--KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
I G + DL T +N E NK KP FLY A H P Q ++ +
Sbjct: 181 IWEDGKFFADL-TVQEVNGFLEKNKRADKPFFLYWAINMPHY-----PLQGQEKWRQYYK 234
Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP----SFGIHSNK 299
D+ P RR YA VS +DE +G V+ L + G+ EN+IV+F +D G SFG
Sbjct: 235 DLPAP-RRMYAAAVSTMDEKIGQVLQQLDRLGLAENTIVVFQSDQGHSTEDRSFG---GG 290
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
G P RG K + ++GG+R A I W+ L + + V +L DW PTL + +
Sbjct: 291 GFTGPYRGAKFSLFEGGIRVPAIIRWTGHLPKNE-VRDQLCVNIDWYPTLAGLCKVALPQ 349
Query: 359 TSLDGVNQWDVLTKGAKTKRSEIL--HNIDNVDNPQ 392
+DG + V+T + +I + +NPQ
Sbjct: 350 RKIDGKDIQQVITSSKTSSPHDIFFWQSQGTKENPQ 385
>gi|407938758|ref|YP_006854399.1| sulfatase [Acidovorax sp. KKS102]
gi|407896552|gb|AFU45761.1| sulfatase [Acidovorax sp. KKS102]
Length = 452
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 123/356 (34%), Positives = 182/356 (51%), Gaps = 26/356 (7%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRS 82
+ + + P+I+ ILADDLGW D+S +G + TP++DALA G+ Q Y A+C+ +R
Sbjct: 10 SASTQAPNIVFILADDLGWADLSVYGQADFTTPHLDALAGEGVRFTQAYANSAVCSATRF 69
Query: 83 ALMTGKYPIHI--GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
AL+TG+Y + G++ ++ + GLP LP L++AGY T IGKWHLG
Sbjct: 70 ALITGRYQYRLRGGLEEPLVRKAHVHGLPPEHPTLPSLLRDAGYDTALIGKWHLGSL-PT 128
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
+ P G+D +G + G DY+ H + L + +V + +G Y T L +
Sbjct: 129 FGPLKSGYDRFFGNYGGAIDYFTHKPGVGDAVARDL---YEGEVPVERVGYY-TQLLADE 184
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP-DEEVA----KFLDISDPERRTYAG 255
A I E + +KP FL L A H P+ P DEE++ TYA
Sbjct: 185 ATRWIGERSAAKPFFLSLHFTAPHW-----PWVGPEDEEISLGLKDLFHYDGGNLATYAR 239
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
MV LD++VG V+ AL+ G+ +N+IV+F +DNG F P G K+ +G
Sbjct: 240 MVRSLDKAVGQVLEALKAQGLADNTIVVFTSDNGGERF------SKTWPFTGQKTELLEG 293
Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVL 370
G+R + P + +VS ++ DWLPTL AAAG+ + D DG N VL
Sbjct: 294 GIRVPTLLRWP-ARIAPQVSDQVTATMDWLPTLLAAAGVAPHADYPPDGENILPVL 348
>gi|119587161|gb|EAW66757.1| galactosamine (N-acetyl)-6-sulfate sulfatase (Morquio syndrome,
mucopolysaccharidosis type IVA), isoform CRA_a [Homo
sapiens]
Length = 708
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 120/378 (31%), Positives = 182/378 (48%), Gaps = 25/378 (6%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRS 82
+ AP+ P+I+++L DD+GW D+ +G TPN+D +A GL+ Y LC+PSR+
Sbjct: 25 SGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRA 84
Query: 83 ALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
AL+TG+ PI G ++ + G+P +E+LLP+ LK+AGY + +GKWHLG
Sbjct: 85 ALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG 144
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGI 191
R + P GFD +G YD+ + Y+ +M ++ KT
Sbjct: 145 H-RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEA 203
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
T +Y + A++ I + P FLY A A HA P FL S +R
Sbjct: 204 NLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRG 252
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
Y V +D+S+G ++ L+ + +N+ V F +DNGA GSN P K T
Sbjct: 253 RYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQT 312
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVL 370
++GGMR A W P +VS +L I D T A AG+ +D ++DG+N L
Sbjct: 313 TFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTL 372
Query: 371 TKGAKTKRSEILHNIDNV 388
+G R + D +
Sbjct: 373 LQGRLMDRPIFYYRGDTL 390
>gi|440715400|ref|ZP_20895947.1| iduronate-2-sulfatase [Rhodopirellula baltica SWK14]
gi|436439744|gb|ELP33158.1| iduronate-2-sulfatase [Rhodopirellula baltica SWK14]
Length = 1004
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 138/419 (32%), Positives = 200/419 (47%), Gaps = 33/419 (7%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
T KP+ I+IL DD G+ D+S G+ + TP ID +A G L YV A +CTPSR+
Sbjct: 557 TAGTTKPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAAPVCTPSRA 616
Query: 83 ALMTGKYPIHI----GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
LMTG YP I G GV+L G+P GL E + + LK AGY T GKWHLG
Sbjct: 617 GLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFGKWHLGDQP 676
Query: 139 EVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
E + PT +GFD +G Y + ++ F L +++D + T
Sbjct: 677 E-FLPTKQGFDEFFGIPYSHDIHPFHPRQNHYHFPSLPLLQNDTVIEMDPDA--DFLTKR 733
Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAG-NTYEPF-QAPDEEVAKFLDISD------P 248
TE A++ I E NK +P FLYL H HA + PF + ++V ++ D
Sbjct: 734 LTEQAVSFI-ERNKDQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAAIEKEDGNIDYAT 792
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
+ ++ +D SVG ++ ALR +G+ E ++VLF +DNG P ++++ G LRG
Sbjct: 793 RANLFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPPKNTLYASPGE---LRGH 849
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQW 367
K T ++GGMR + P + EL D LPT AG D +DG + W
Sbjct: 850 KGTTFEGGMREPTVVRWPGQIPAGHQNDELMTAMDLLPTFAKLAGAATPTDRVIDGKDIW 909
Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
L +T ++ N AA+R K NNG + + Y D +N++
Sbjct: 910 PTLKGETQTPHDAFFYHRGN------QLAAVRSGKWKLHV---NNGVAKQLY-DLENDL 958
Score = 122 bits (307), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 113/379 (29%), Positives = 165/379 (43%), Gaps = 70/379 (18%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG-LILNQHYVQALCTPSR 81
+ A + P++++I DDLG+ D+ +G++++ TPNID LA G + H A+CTPSR
Sbjct: 25 SVAAERPPNVVLIFVDDLGYGDLGCYGATKLSTPNIDRLAAEGRRFTDAHSASAVCTPSR 84
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWG-LPLTE--------KLLPQYLKEAGYATHAIGKW 132
L+TG+YP+ G+ WG LP T K + + K GYAT +GKW
Sbjct: 85 YGLLTGQYPVRAMGGQGI------WGPLPTTSGLIIDTNTKTIGKVFKNKGYATACLGKW 138
Query: 133 HLGFFREVY--------TPTFRGFDSHYGY--------WQGLQD----YYDHSCKATF-- 170
HLGF E P GFD ++G + + D YD S +
Sbjct: 139 HLGFKEEPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGG 198
Query: 171 ---EPYQGLDMRHNMQVDNKTIG------IYSTD----LYTEAAINVIAEHNKSKPMFLY 217
P +++ N+ G IY + L TE A+ I E K +P FLY
Sbjct: 199 KPVSPTPMFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITE-KKDEPFFLY 257
Query: 218 LAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGML 277
A +H T P +F S + Y V LD VG ++ +L +G+
Sbjct: 258 FATPNIHHPFTPAP---------RFKGTS--QCGLYGDFVHELDWMVGEIVQSLEDNGLT 306
Query: 278 ENSIVLFMADNG-----APSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQT 331
+N++VLF +DNG A I + N L G K W+GG R + A W +K
Sbjct: 307 DNTLVLFTSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIKAG 366
Query: 332 QKVSSELFHISDWLPTLCA 350
+ S +L D T A
Sbjct: 367 TQ-SDQLISQVDLFATFSA 384
>gi|223936859|ref|ZP_03628768.1| sulfatase [bacterium Ellin514]
gi|223894428|gb|EEF60880.1| sulfatase [bacterium Ellin514]
Length = 460
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 142/404 (35%), Positives = 192/404 (47%), Gaps = 36/404 (8%)
Query: 1 MTWARKYFFALTCTLLFNDAFL-NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNID 59
MT KY L L F A K P+I+IILADDLG+ D+ +G I TPN+D
Sbjct: 1 MTRISKYL--LVVAFLIAATFAPGVFAAKAPNIVIILADDLGYGDLGCYGHPSIRTPNLD 58
Query: 60 ALAYNGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQ---HGVILEGEPWGLPLTEKLL 115
+A G+ YV A +CTPSR+ LMTG++PI GM + V+ GLP E +
Sbjct: 59 RMAAEGIRFTDFYVAANVCTPSRAGLMTGRWPIRSGMAGSTNDVLRVFSMGGLPTNEITI 118
Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGY-----WQGLQDYYDHSCKATF 170
LK GYAT IGKWHLG + + PT GFD YG + ++ + ++
Sbjct: 119 AAALKSKGYATACIGKWHLGHELQ-FLPTHHGFDYFYGLRFSNDMEPVRGKIPKNASSSL 177
Query: 171 EPYQGLDMRHNMQVDNKTIGIYSTDL------YTEAAINVIAEHNKSKPMFLYLAHLAVH 224
P L+ ++ + N I TDL YTE AI I NK +P FLY H H
Sbjct: 178 HP--KLEWWNSALLQNDKILEQPTDLSTLTRRYTEEAIKFI-HQNKKQPFFLYFPHTFPH 234
Query: 225 AGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLF 284
P A D F + S R Y +V LD SVG V+ LRK G+ EN++V F
Sbjct: 235 V-----PLFASD----AFKEHS--ARGLYGDVVEELDWSVGQVLNTLRKEGLAENTLVFF 283
Query: 285 MADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDW 344
+DNG P GS PL+ K + W+GGMR A W P K ++ EL D
Sbjct: 284 TSDNG-PWLIRDLAGGSAGPLKDGKGSTWEGGMREPAIAWWPG-KIKPAINHELVCSLDL 341
Query: 345 LPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
T AG+ + D +DG++ +L ++R + + N
Sbjct: 342 FTTSLCLAGVALPPDRVIDGLDMRPMLFGTGHSQRQLMFYYKKN 385
>gi|150008689|ref|YP_001303432.1| arylsulfatase [Parabacteroides distasonis ATCC 8503]
gi|149937113|gb|ABR43810.1| arylsulfatase A [Parabacteroides distasonis ATCC 8503]
Length = 452
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 130/380 (34%), Positives = 184/380 (48%), Gaps = 26/380 (6%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
KY +T L D + P KP+II+I DD+G+ D+S GS I TPNID +A G
Sbjct: 5 KYILYITGVLASYDGH---SQPTKPNIIVINCDDMGYGDLSCFGSPTIKTPNIDRMAIEG 61
Query: 66 LILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKE 121
+ YV A + +PSR+ L+TG+ + GM Q V+ GLP E + + LK+
Sbjct: 62 QKWSSFYVSASVSSPSRAGLLTGRLGVRTGMYGDQRRVLFPDSKGGLPSEELTIAELLKQ 121
Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKA--TFEPYQGLDMR 179
AGY T IGKWHLG E Y P GFD YGY K T PY+ +
Sbjct: 122 AGYHTACIGKWHLGHLPE-YMPLRHGFDYFYGYPYSNDMSRKEQIKLGNTKYPYEYIIYE 180
Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
+++ + T TEAAI I + N++ P FLYLAH H P A +
Sbjct: 181 QEKELEREPQQYNLTQQVTEAAIRYI-KSNENSPFFLYLAHPMPHM-----PVYASTDFQ 234
Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
K R Y V LD SVG ++ L+ G+ +N++V+F +DNG P
Sbjct: 235 GK------SARGKYGDTVEELDWSVGQILQTLKSEGLDKNTLVIFTSDNG-PWLLCKQEG 287
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-ND 358
GS PL+ K++ ++GG R +W +K +++ D LPT C AGI + +D
Sbjct: 288 GSPGPLKDGKASMFEGGFRVPCIMWGAMVKPG--YITDMASTLDLLPTFCEIAGIPLPSD 345
Query: 359 TSLDGVNQWDVLTKGAKTKR 378
DG++ +VL + KR
Sbjct: 346 RHYDGISLLNVLKDKSTCKR 365
>gi|114145565|ref|NP_001041316.1| N-acetylgalactosamine-6-sulfatase precursor [Rattus norvegicus]
gi|123779981|sp|Q32KJ6.1|GALNS_RAT RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|81158026|tpe|CAI84987.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Rattus
norvegicus]
Length = 524
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 121/373 (32%), Positives = 181/373 (48%), Gaps = 26/373 (6%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCT 78
L AP+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+
Sbjct: 22 GLLAAGAPQPPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCS 81
Query: 79 PSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
PSR+AL+TG+ PI G ++ + G+P +E LLP+ LK+AGY +GK
Sbjct: 82 PSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGK 141
Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNK 187
WHLG R + P GFD +G YD+ K Y+ +M ++ K
Sbjct: 142 WHLGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLK 200
Query: 188 TIGIYSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
T T LY + A++ I +H + P FLY A A HA P +FL S
Sbjct: 201 TGEANLTQLYLQEALDFIRTQHARQSPFFLYWAIDATHA---------PVYASKQFLGTS 251
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
R Y V +D+SVG +++ L+ G+ +N+ V F +DNGA GSN P
Sbjct: 252 --LRGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPKEGGSNGPFL 309
Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVN 365
K T ++GGMR A W P +VS +L I D T + AG++ +D +DG++
Sbjct: 310 CGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRVIDGLD 369
Query: 366 QWDVLTKGAKTKR 378
+ +G R
Sbjct: 370 LLPTMLQGHIIDR 382
>gi|449132446|ref|ZP_21768529.1| arylsulfatase A [Rhodopirellula europaea 6C]
gi|448888348|gb|EMB18668.1| arylsulfatase A [Rhodopirellula europaea 6C]
Length = 1001
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 131/375 (34%), Positives = 187/375 (49%), Gaps = 24/375 (6%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
T KP+ I+IL DD G+ D+S G + TP ID +A G L YV A +CTPSR+
Sbjct: 554 TAGTTKPNFIVILTDDQGYGDLSCFGGKHVATPRIDQMAAEGSRLTSFYVAAPVCTPSRA 613
Query: 83 ALMTGKYPIHIGM----QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
LMTG YP IGM GV+L G+ GL E + + LK GY T GKWHLG
Sbjct: 614 GLMTGCYPKRIGMAMGSNFGVLLAGDRKGLHPNEITIAEVLKTVGYRTGMFGKWHLGDQP 673
Query: 139 EVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
E + PT +GFD +G Y + ++ F P L+ +N V+ + T
Sbjct: 674 E-FLPTKQGFDEFFGLPYSHDIHPFHPRQNHYQFPPLPLLE--NNTVVEMEPNADLLTKR 730
Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAG-NTYEPFQ---APDEEVAKFLDISDPERRT 252
TE A++ I + NK +P FLY+ H HA + PF A D VA + + + T
Sbjct: 731 ITERAVSFI-DRNKDQPFFLYIPHPIPHAPLHVSPPFMKDVADDVVVAINKEDGNIDYAT 789
Query: 253 YAGM----VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
A + ++ +D SVG ++ AL+ +G+ E ++VLF +DNG P + ++ G LRG
Sbjct: 790 RANLFHQAIAEIDWSVGQILDALKSNGLDEKTMVLFTSDNGPPKNTLFASAGQ---LRGH 846
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQW 367
K T ++GG+R + P + ++EL D LPT AG I D +DG + W
Sbjct: 847 KGTAFEGGVREPTVVRWPGQVPAGQENNELMSTMDLLPTFAKLAGAAIPTDRVIDGKDIW 906
Query: 368 DVLTKGAKTKRSEIL 382
L KG E+
Sbjct: 907 TTL-KGETQSPHEVF 920
Score = 119 bits (299), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 113/379 (29%), Positives = 169/379 (44%), Gaps = 70/379 (18%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG-LILNQHYVQALCTPSR 81
+ A + P++++I DDLG+ D+S +G++++ TPNID LA G + H A+CTPSR
Sbjct: 22 SAAAERPPNVVLIFVDDLGYGDLSCYGATKLSTPNIDQLAAEGRRFTDAHSASAVCTPSR 81
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWG-LPLTEKL--------LPQYLKEAGYATHAIGKW 132
L+TG+YP+ GV WG LP T L + + K+ GYAT +GKW
Sbjct: 82 YGLLTGQYPVRAMGGQGV------WGPLPTTSGLIIDTNTQTIGKVFKDKGYATACLGKW 135
Query: 133 HLGFFREVY--------TPTFRGFDSHYGY--------WQGLQD----YYDHSCKATF-- 170
HLGF E P GFD ++G + + D YD + +
Sbjct: 136 HLGFKEEPNDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPNDPLVYGG 195
Query: 171 ---EPYQGLDMRHNMQVDNKTIG------IYSTD----LYTEAAINVIAEHNKSKPMFLY 217
P +++ N+ G IY + L T+ A+ I+E K + FLY
Sbjct: 196 KPVSPTPTFTAEASIKSPNRFSGALKAHEIYDDEKTGTLLTDRAVKWISE-KKDELFFLY 254
Query: 218 LAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGML 277
A +H T P +F S + Y V LD VG ++ +L +G+
Sbjct: 255 FATPNIHHPFTPAP---------RFKGTS--QCGLYGDFVHELDWMVGEIVKSLEANGLT 303
Query: 278 ENSIVLFMADNGA--PSFGIHSNKGSNHP---LRGMKSTPWDGGMR-GVAAIWSPWLKQT 331
+N++VLF +DNGA G + K + P L G K W+GG R + A W +K
Sbjct: 304 DNTLVLFTSDNGAMLNRAGRDAVKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIKPG 363
Query: 332 QKVSSELFHISDWLPTLCA 350
+ S +L D T A
Sbjct: 364 TQ-SDQLISQVDLFATFSA 381
>gi|256841279|ref|ZP_05546786.1| arylsulfatase A [Parabacteroides sp. D13]
gi|262383570|ref|ZP_06076706.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
gi|423330544|ref|ZP_17308328.1| hypothetical protein HMPREF1075_00341 [Parabacteroides distasonis
CL03T12C09]
gi|256737122|gb|EEU50449.1| arylsulfatase A [Parabacteroides sp. D13]
gi|262294468|gb|EEY82400.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
gi|409232160|gb|EKN25008.1| hypothetical protein HMPREF1075_00341 [Parabacteroides distasonis
CL03T12C09]
Length = 452
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 130/380 (34%), Positives = 184/380 (48%), Gaps = 26/380 (6%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
KY +T L D + P KP+II+I DD+G+ D+S GS I TPNID +A G
Sbjct: 5 KYILYITGVLASYDGH---SQPTKPNIIVINCDDMGYGDLSCFGSPTIKTPNIDRMAIEG 61
Query: 66 LILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKE 121
+ YV A + +PSR+ L+TG+ + GM Q V+ GLP E + + LK+
Sbjct: 62 QKWSSFYVSASVSSPSRAGLLTGRLGVRTGMYGDQRRVLFPDSKGGLPSEELTIAELLKQ 121
Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKA--TFEPYQGLDMR 179
AGY T IGKWHLG E Y P GFD YGY K T PY+ +
Sbjct: 122 AGYHTACIGKWHLGHLPE-YMPLRHGFDYFYGYPYSNDMSRKEQIKLGNTKYPYEYIIYE 180
Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
+++ + T TEAAI I + N++ P FLYLAH H P A +
Sbjct: 181 QEKELEREPQQYNLTQQVTEAAIRYI-KSNENSPFFLYLAHPMPHM-----PVYASTDFQ 234
Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
K R Y V LD SVG ++ L+ G+ +N++V+F +DNG P
Sbjct: 235 GK------SARGKYGDTVEELDWSVGQILQTLKSEGLDKNTLVIFTSDNG-PWLLCKQEG 287
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-ND 358
GS PL+ K++ ++GG R +W +K +++ D LPT C AGI + +D
Sbjct: 288 GSPGPLKDGKASMFEGGFRVPCIMWGAMVKPG--YITDMASTLDLLPTFCEIAGIPLPSD 345
Query: 359 TSLDGVNQWDVLTKGAKTKR 378
DG++ +VL + KR
Sbjct: 346 RHYDGISLLNVLKDKSTCKR 365
>gi|229822457|ref|YP_002883983.1| Cerebroside-sulfatase [Beutenbergia cavernae DSM 12333]
gi|229568370|gb|ACQ82221.1| Cerebroside-sulfatase [Beutenbergia cavernae DSM 12333]
Length = 458
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 121/328 (36%), Positives = 172/328 (52%), Gaps = 28/328 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
++P+I++I ADDLG+ D+ +GS + TP++D LA G+ L Y+ + +C+PSR ++T
Sbjct: 3 QRPNIVLINADDLGYGDLGCYGSMRNDTPHLDRLAAEGVRLTDFYMASPVCSPSRGGMLT 62
Query: 87 GKYPIHIG----MQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
G YP IG + V+ G+P GL E+ + + L +AGYAT AIGKWH G E +
Sbjct: 63 GCYPPRIGFGEFVGRPVLFPGDPVGLDPAERTMARVLGDAGYATAAIGKWHCGDQPE-FL 121
Query: 143 PTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
PT GFDS++G + + +H G + Q D +++ T+ YT A
Sbjct: 122 PTRHGFDSYFGIPFSNDMGRQREHEDWPPLPLMSGESVVQE-QPDQRSL----TERYTVA 176
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
A I E N +P FLYLAH+ VH P P A FL S Y G V+ L
Sbjct: 177 ATRFI-EENAHQPFFLYLAHMYVHV-----PLFVP----APFLAAS--RNGGYGGAVAAL 224
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
D S G V+ LR+ G+ EN+IV+F +DNG+ + G GSN PLRG K+ W+GG R
Sbjct: 225 DWSTGVVMDTLRRLGLEENTIVVFTSDNGSRARG---EGGSNDPLRGHKAQTWEGGQRVA 281
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTL 348
+ P V + D LPT
Sbjct: 282 CVVRWPAAIPAGGVCDAVTRSIDLLPTF 309
>gi|325109725|ref|YP_004270793.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324969993|gb|ADY60771.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 471
Score = 176 bits (447), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 124/380 (32%), Positives = 187/380 (49%), Gaps = 24/380 (6%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
+ C L + L +P++I+I++DD GW+DV F+G +IPTP++DALA +G
Sbjct: 4 RLVLVCLCVLGVLFSLLPEANAGQPNVIVIVSDDQGWSDVGFNGCKEIPTPHLDALAKSG 63
Query: 66 LILNQHYV-QALCTPSRSALMTGKYPIHIGMQ-----HGVILEGEPWGLPLTEKLLPQYL 119
+ + Y C+PSR+ L+TG+Y G + HG E GLPL+E LL
Sbjct: 64 VAFDCGYASHPYCSPSRAGLLTGRYQQRFGHECNPGAHGNDDAIEMEGLPLSETLLSTVF 123
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR 179
+ AGY T AIGKWHLG + + PT RGF+ +G+ G Y+ K P G+ +R
Sbjct: 124 RNAGYRTGAIGKWHLGDEPQFW-PTERGFEEWFGFSGGGLSYWGDLGKKP--PLHGV-LR 179
Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
+ V + + D TEA V E N+++P FLYLA+ A HA P Q E +
Sbjct: 180 NGDVVPKDELTYLTDDFSTEAVKFV--EENRARPFFLYLAYNAPHA-----PDQVTREHL 232
Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
K I R Y MV+ +DE +G V+ AL + + +++ F +DNG +
Sbjct: 233 KKTSHIEYGGRAVYGAMVAAMDEGIGRVVQALDDQNLRDETVIFFYSDNGG-----RATL 287
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
N P RG K ++GG+R + W ++ ++ + + + L AA
Sbjct: 288 ARNFPYRGHKGMLFEGGIRVPFCVSWPEQIEGGRRYEPPVTALDIFPTVLAAAGVDAPAA 347
Query: 359 TSLDGVNQWDVLTKGAKTKR 378
LDGVN L G+K +R
Sbjct: 348 LQLDGVNLLPYLA-GSKQRR 366
>gi|431907857|gb|ELK11464.1| Arylsulfatase B [Pteropus alecto]
Length = 274
Score = 176 bits (447), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 99/226 (43%), Positives = 134/226 (59%), Gaps = 18/226 (7%)
Query: 64 NGLILN-QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEA 122
NG L +H++ L A++T K IH G+QH +I +P +PL EKLLPQ LKEA
Sbjct: 2 NGCTLTLEHFLMEL------AILTLK--IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEA 53
Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHS----CKATFEPYQGLDM 178
GY TH +GKWHLG +R+ PT RGFD+++GY G +DYY H A LD
Sbjct: 54 GYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLINALNVTRCALDF 113
Query: 179 RHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
R +V +YST+++T+ A +I H KP+FLYLA +VH EP Q P+E
Sbjct: 114 RDGEEVATGYENMYSTNIFTQRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEY 168
Query: 239 VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLF 284
+ + I D R YAGMVS +DE+VGNV AAL+ HG+ N++ +F
Sbjct: 169 LKPYDFIQDKNRHYYAGMVSLMDEAVGNVTAALKSHGLWNNTVFIF 214
>gi|75910438|ref|YP_324734.1| twin-arginine translocation pathway signal protein [Anabaena
variabilis ATCC 29413]
gi|75704163|gb|ABA23839.1| Twin-arginine translocation pathway signal [Anabaena variabilis
ATCC 29413]
Length = 457
Score = 176 bits (447), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 122/380 (32%), Positives = 199/380 (52%), Gaps = 37/380 (9%)
Query: 16 LFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-Q 74
LF+ A T +P+++ IL DD+GW D+S +G + TPN+D LA G+ Y Q
Sbjct: 31 LFSRA---TAQSSRPNVVFILVDDMGWGDLSIYGRTDYETPNLDRLARQGVRFTNAYANQ 87
Query: 75 ALCTPSRSALMTGKYP--IHIGMQHGVILEGEP----WGLPLTEKLLPQYLKEAGYATHA 128
+CTP+R A +TG+Y + +G++ + +P G+P + + LK GY T
Sbjct: 88 TVCTPTRIAFLTGRYQARLPVGLREPLGARSQPASNNIGIPANQPTIASLLKANGYETAL 147
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
+GKWH G + + P +GFD ++G+ G +Y+ H+ LD+ N V +
Sbjct: 148 VGKWHAG-YPPNFGPLQKGFDEYFGHLSGGIEYFTHTGTDRI-----LDLYEN-DVPVQR 200
Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD- 247
G Y TDL+T+ A+ I + S+P +L L + A H P+Q P+++ + +++
Sbjct: 201 SG-YVTDLFTDRAVEFI-QRPHSRPFYLSLHYNAPH-----WPWQGPNDQASTAFYLTNG 253
Query: 248 ----PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
+ TYA MV LD+ VG V+ AL G +N++V+F +DNG F SN G
Sbjct: 254 YTVGGSQATYAAMVKSLDDGVGRVLDALEASGQADNTLVIFTSDNGGERF---SNFG--- 307
Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLD 362
P RG K++ ++GG+R A I P + Q +VS+++ D T+ AA G + + D
Sbjct: 308 PFRGQKASLYEGGIRVPAIIRYPGVTQANQVSNQVIITFDLTATILAATGTSFHPNYPPD 367
Query: 363 GVNQWDVLTKGAKTKRSEIL 382
G N +L +G +++ S L
Sbjct: 368 GQNLLPLL-RGDRSEFSRTL 386
>gi|410617069|ref|ZP_11328045.1| arylsulfatase B [Glaciecola polaris LMG 21857]
gi|410163338|dbj|GAC32183.1| arylsulfatase B [Glaciecola polaris LMG 21857]
Length = 482
Score = 176 bits (447), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 118/352 (33%), Positives = 179/352 (50%), Gaps = 27/352 (7%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSA 83
T+P P+I+ IL+DD G+ D F GS + TPN+D LA G + Q YV A+C PSR+
Sbjct: 35 TSP--PNILFILSDDAGYADFGFQGSDVMRTPNLDKLASQGTVFTQAYVSAAVCGPSRAG 92
Query: 84 LMTGKYPIHIGMQHGVI--------LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
++TGKY G + + L G+ GLPL +K + YL+E GY T IGKWH G
Sbjct: 93 ILTGKYQQRFGYEENNVPGYMSQSGLTGDDMGLPLDQKTMADYLRERGYKTALIGKWHQG 152
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFE-PYQGLDMRHNMQVDNKTIGIYST 194
+ + PT RGFD YG+ G + Y+ + P L+ ++K Y T
Sbjct: 153 -NADRFHPTKRGFDEFYGFRGGARSYFGFGAQNPVSYPEDKLERGFAHFQESKR---YLT 208
Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
+ + I + N+ P F++L+ AVH P +A ++A+F ++ +R+ A
Sbjct: 209 EALATETVEFI-KRNQKHPFFVFLSFNAVHT-----PMEAKPADLAQFSNLKG-KRQQLA 261
Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
M +D +G V+ L + G+ +N++V+F DNG PS +N +N PL G K+ +
Sbjct: 262 AMTLSMDREIGKVLNTLDELGLSDNTLVVFTNDNGGPS---DTNASNNGPLSGTKANHLE 318
Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPT-LCAAAGIEINDTSLDGVN 365
GG+R + P K +Q D LPT AA G E ++DGV+
Sbjct: 319 GGIRVPFIMRWPAGKVSQGEYPNPISTLDLLPTFFSAAGGNEQQLAAIDGVD 370
>gi|417305609|ref|ZP_12092563.1| arylsulfatase A [Rhodopirellula baltica WH47]
gi|327538042|gb|EGF24732.1| arylsulfatase A [Rhodopirellula baltica WH47]
Length = 1004
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 141/419 (33%), Positives = 201/419 (47%), Gaps = 33/419 (7%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
T KP+ I+IL DD G+ D+S G+ + TP ID +A G L YV A +CTPSR+
Sbjct: 557 TAGTTKPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAAPVCTPSRA 616
Query: 83 ALMTGKYPIHI----GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
LMTG YP I G GV+L G+P GL E + + L AGY T GKWHLG
Sbjct: 617 GLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLNTAGYRTGMFGKWHLGDQP 676
Query: 139 EVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
E + PT +GFD +G Y + ++ F L +++D + T
Sbjct: 677 E-FLPTKQGFDEFFGIPYSHDIHPFHPRQNXYHFPSLPLLQNDTVIEMDPDA--DFLTKR 733
Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAG-NTYEPFQ---APDEEVAKFLDISDPERRT 252
TE A++ I E NK +P FLYL H HA + PF A D A + + + T
Sbjct: 734 LTEQAVSFI-ERNKDQPFFLYLPHPIPHAPLHASPPFMEDVADDVIAAIEKEDGNIDYAT 792
Query: 253 YAGM----VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
A + ++ +D SVG ++ ALR +G+ E ++VLF +DNG P ++++ G LRG
Sbjct: 793 RANLFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPPKNTLYASPGE---LRGH 849
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQW 367
K T ++GGMR + P + EL D LPT AG I D +DG + W
Sbjct: 850 KGTTFEGGMREPTVVRWPGQIPAGHQNDELMTAMDLLPTFAKLAGAAIPTDRVIDGKDIW 909
Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
L +T ++ N AA+R K NNG + + Y D +N++
Sbjct: 910 PTLKGETQTPHDAFFYHRGN------QLAAVRSGKWKLHV---NNGVAKQLY-DLENDL 958
Score = 123 bits (308), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 114/379 (30%), Positives = 165/379 (43%), Gaps = 70/379 (18%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG-LILNQHYVQALCTPSR 81
+ A + P++++I DDLG+ DV +G++++ TPNID LA G + H A+CTPSR
Sbjct: 25 SVAAERPPNVVLIFVDDLGYGDVGCYGATKLSTPNIDRLAAEGRRFTDAHSASAVCTPSR 84
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWG-LPLTEKL--------LPQYLKEAGYATHAIGKW 132
L+TG+YP+ G+ WG LP T L + + K GYAT +GKW
Sbjct: 85 YGLLTGQYPVRAMGGKGI------WGPLPTTSGLIIDTNTQTIGKVFKNKGYATACLGKW 138
Query: 133 HLGFFREVY--------TPTFRGFDSHYGY--------WQGLQD----YYDHSCKATF-- 170
HLGF E P GFD ++G + + D YD S +
Sbjct: 139 HLGFKEEPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGG 198
Query: 171 ---EPYQGLDMRHNMQVDNKTIG------IYSTD----LYTEAAINVIAEHNKSKPMFLY 217
P +++ N+ G IY + L TE A+ I E K +P FLY
Sbjct: 199 KPVSPTPMFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITE-KKDEPFFLY 257
Query: 218 LAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGML 277
A +H T P +F S + Y V LD VG ++ +L +G+
Sbjct: 258 FATPNIHHPFTPAP---------RFKGTS--QCGLYGDFVHELDWMVGEIVQSLEDNGLT 306
Query: 278 ENSIVLFMADNG-----APSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQT 331
+N++VLF +DNG A I + N L G K W+GG R + A W +K
Sbjct: 307 DNTLVLFTSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIKAG 366
Query: 332 QKVSSELFHISDWLPTLCA 350
+ S +L D T A
Sbjct: 367 TQ-SDQLISQVDLFATFSA 384
>gi|380796101|gb|AFE69926.1| N-acetylgalactosamine-6-sulfatase precursor, partial [Macaca
mulatta]
Length = 503
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 120/378 (31%), Positives = 180/378 (47%), Gaps = 25/378 (6%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRS 82
+ AP+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+
Sbjct: 6 SGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRA 65
Query: 83 ALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
AL+TG+ PI G ++ + G+P +E+LLP+ LK+AGY + +GKWHLG
Sbjct: 66 ALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG 125
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGI 191
R + P GFD +G YD+ + Y+ +M ++ KT
Sbjct: 126 H-RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEA 184
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
T +Y + A++ I + P FLY A A HA P FL S +R
Sbjct: 185 NLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRG 233
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
Y V +D+SVG ++ L + +N+ V F +DNGA GSN P K T
Sbjct: 234 RYGDAVREIDDSVGKMLELLHDLHVADNTFVFFTSDNGAALISAPEQAGSNGPFLCGKQT 293
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVL 370
++GGMR A W P +VS +L I D T A AG+ D ++DG+N L
Sbjct: 294 TFEGGMREPALAWWPGHIAAGQVSHQLGSIMDLFTTSLALAGLTPPRDRAIDGLNLLPAL 353
Query: 371 TKGAKTKRSEILHNIDNV 388
+G R + D +
Sbjct: 354 LQGRLMDRPVFYYRGDTL 371
>gi|149038400|gb|EDL92760.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Rattus norvegicus]
Length = 466
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 121/373 (32%), Positives = 181/373 (48%), Gaps = 26/373 (6%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCT 78
L AP+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+
Sbjct: 22 GLLAAGAPQPPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCS 81
Query: 79 PSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
PSR+AL+TG+ PI G ++ + G+P +E LLP+ LK+AGY +GK
Sbjct: 82 PSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGK 141
Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNK 187
WHLG R + P GFD +G YD+ K Y+ +M ++ K
Sbjct: 142 WHLGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLK 200
Query: 188 TIGIYSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
T T LY + A++ I +H + P FLY A A HA P +FL S
Sbjct: 201 TGEANLTQLYLQEALDFIRTQHARQSPFFLYWAIDATHA---------PVYASKQFLGTS 251
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
R Y V +D+SVG +++ L+ G+ +N+ V F +DNGA GSN P
Sbjct: 252 --LRGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPKEGGSNGPFL 309
Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVN 365
K T ++GGMR A W P +VS +L I D T + AG++ +D +DG++
Sbjct: 310 CGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRVIDGLD 369
Query: 366 QWDVLTKGAKTKR 378
+ +G R
Sbjct: 370 LLPTMLQGHIIDR 382
>gi|449138314|ref|ZP_21773584.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
gi|448883087|gb|EMB13631.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
Length = 485
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 129/424 (30%), Positives = 195/424 (45%), Gaps = 61/424 (14%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
R + L TL + +A +P+I++I++DD G+ND+ G+ I TP +D LA
Sbjct: 5 RNWMIHLFTTLAMLLVAIKASAADRPNILLIVSDDQGYNDLGQLGNGII-TPALDRLANE 63
Query: 65 GLILNQHYVQ-ALCTPSRSALMTGKYPIHIGM------------------QHGVILEGEP 105
G L YV CTPSR++L+TG+YP G+ Q+ V E
Sbjct: 64 GTRLTNFYVAWPACTPSRASLLTGRYPQRNGIYDMIRNEAPDYGHRYTPDQYAVTFE-RI 122
Query: 106 WGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHS 165
G+ E ++P LK AGY + GKW LG R + P RGFD YG+ DY+ H
Sbjct: 123 GGMDEREVIIPAVLKRAGYKSGIYGKWDLGALRRML-PNSRGFDDFYGFVNTGIDYFTH- 180
Query: 166 CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA 225
E Y M N++ + G Y T L+ A+ + EH +P FLY+ A H
Sbjct: 181 -----ERYGVPCMVRNLEPTEEDKGTYCTYLFQREALRFLDEHAGEEPFFLYVPFNAPHN 235
Query: 226 GNTYEP-----FQAPDEEVAKF------------------LDISDPE--RRTYAGMVSRL 260
++ +P QAP+ A + ++ P+ RR Y V+ +
Sbjct: 236 SSSLDPKIRSSVQAPERFKAMYPPVESETRVTDRYRYGTPATVATPQARRRDYRAAVTCM 295
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
D ++G ++ L MLE++IV+F +DNG S N PLRG K+ W+GG+R
Sbjct: 296 DAAIGEMLDRLEAKQMLEDTIVVFFSDNGG------SGGADNSPLRGHKAQTWEGGIRVP 349
Query: 321 AAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKR 378
+ W+ V+ E + LP+ +AAG+E D LDG + W L + + R
Sbjct: 350 CIVRWTAGKIPAGAVNDEFLTSLELLPSFASAAGLEPPADIVLDGFDWWPTLRGESASPR 409
Query: 379 SEIL 382
E+
Sbjct: 410 EEMF 413
>gi|410215590|gb|JAA05014.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
gi|410254514|gb|JAA15224.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
gi|410288780|gb|JAA22990.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
gi|410330541|gb|JAA34217.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
Length = 522
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 119/378 (31%), Positives = 182/378 (48%), Gaps = 25/378 (6%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRS 82
+ AP+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+
Sbjct: 25 SGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRA 84
Query: 83 ALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
AL+TG+ PI G ++ + G+P +E+LLP+ LK+AGY + +GKWHLG
Sbjct: 85 ALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG 144
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGI 191
R + P GFD +G YD+ + Y+ +M ++ KT
Sbjct: 145 H-RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEA 203
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
T +Y + A++ I + P FLY A A HA P FL S +R
Sbjct: 204 NLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRG 252
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
Y V +D+S+G ++ L+ + +N+ V F +DNGA GSN P K T
Sbjct: 253 RYGDAVREIDDSIGKMLELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQT 312
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVL 370
++GGMR A W P +VS +L I D T A AG+ +D ++DG+N L
Sbjct: 313 TFEGGMREPALAWWPGHVAAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTL 372
Query: 371 TKGAKTKRSEILHNIDNV 388
+G R + D +
Sbjct: 373 LQGRLMDRPIFYYRGDTL 390
>gi|182677550|ref|YP_001831696.1| sulfatase [Beijerinckia indica subsp. indica ATCC 9039]
gi|182633433|gb|ACB94207.1| sulfatase [Beijerinckia indica subsp. indica ATCC 9039]
Length = 486
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 137/422 (32%), Positives = 204/422 (48%), Gaps = 43/422 (10%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSA 83
T + P+II ILADDLG+ DVS +G + TPNID + G L Q Y A+C+ +R+A
Sbjct: 32 TGDRAPNIIFILADDLGYADVSIYGRPDLSTPNIDGIGLKGARLLQAYANSAVCSATRTA 91
Query: 84 LMTGKYPI--HIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
L+TG+Y +G++ I GLP LP LK+AGY T IGKWHLG + +
Sbjct: 92 LLTGRYQYRERVGLEE-PIAGNIHVGLPPQRPTLPSLLKKAGYTTTLIGKWHLGTLPD-F 149
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
P G+D YG+ G DYY H K T + D + Y T+L + A
Sbjct: 150 GPLQSGYDHFYGFRGGAVDYYSH--KGTDDQDDLWDQDTKVHQTG-----YLTELLGDRA 202
Query: 202 INVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV-------AKFLDISDPERRTY 253
I I A +P F+ L H + P++AP +E + D + TY
Sbjct: 203 IETINASAKTGQPFFISL-----HFNAPHWPWEAPGDEAESARVAGTRLFDFDGGSQATY 257
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
GM++ +D +G ++ AL+ +G+ EN+IV+F +DNG F P G K+
Sbjct: 258 RGMIAAMDLQIGRIVQALQANGISENTIVIFTSDNGGERF------ADTWPFTGRKTELL 311
Query: 314 DGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLT 371
+GG+R A + W +K Q + + DWLPTL AAAG E + + DG+N L+
Sbjct: 312 EGGLRIPALVSWPARIKADQTIDQVSISM-DWLPTLLAAAGSEPDPNFPSDGINLLPFLS 370
Query: 372 KGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSP 431
+ ++ N Q+ A+R D KY+ DN D + D+ +++ +
Sbjct: 371 ESKAAIPRKLFWRYKA--NAQR---AVRDGDYKYLKIRDN----DFLFNVVDDPLERVNL 421
Query: 432 KE 433
KE
Sbjct: 422 KE 423
>gi|294053962|ref|YP_003547620.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613295|gb|ADE53450.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 494
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 133/381 (34%), Positives = 179/381 (46%), Gaps = 41/381 (10%)
Query: 10 ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
AL T+L A L + KP++I+ILADDLG+ DV F GS++I TP +D LA G+I N
Sbjct: 3 ALCTTVLGLIAGLCSIEANKPNLILILADDLGYADVGFTGSTEIQTPVLDRLAAGGVIFN 62
Query: 70 QHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPW----GLPLTEKLLPQYLKEAGY 124
YV A C PSR+ L+TG+Y G++ V P+ GLP EK LK++GY
Sbjct: 63 NGYVTHAYCGPSRAGLITGRYQARFGVE--VNFPYAPFDPHSGLPTDEKTFATRLKQSGY 120
Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFE------------- 171
T IGKWHLG Y P RGFD YG+ G DY + T
Sbjct: 121 RTAMIGKWHLGAAYP-YHPNNRGFDYFYGFLGGAHDYMPENTSTTVPLTLENGKVNHMAN 179
Query: 172 --PYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTY 229
Y L MR+N+ N Y T + A I P +YL++ A H
Sbjct: 180 AGSYLPL-MRNNV---NAEFDEYLTTALSRDAARFI--EKTEGPFCVYLSYNAPHT---- 229
Query: 230 EPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG 289
P QAP + K+ I +RRTYA M+ +D+ +G V+ AL K G LEN+++ F +DNG
Sbjct: 230 -PLQAPKALIEKYAHIESQKRRTYAAMIDSMDQGIGRVVDALEKSGKLENTLIFFFSDNG 288
Query: 290 A--PSFGIHSNK-GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DW 344
P G N P RG K +GG + W K S+ + +S D
Sbjct: 289 GPRPKPGYEGEDFADNTPYRGGKGAMLEGGTH--VPFIAYWPKGIPAGSTYDYPVSALDV 346
Query: 345 LPTLCAAAGIEINDTSLDGVN 365
T + + L+GVN
Sbjct: 347 AATFVELGEGDTSGHPLEGVN 367
>gi|160891517|ref|ZP_02072520.1| hypothetical protein BACUNI_03968 [Bacteroides uniformis ATCC 8492]
gi|317478374|ref|ZP_07937538.1| sulfatase [Bacteroides sp. 4_1_36]
gi|156858924|gb|EDO52355.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
gi|316905533|gb|EFV27323.1| sulfatase [Bacteroides sp. 4_1_36]
Length = 448
Score = 176 bits (446), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 128/380 (33%), Positives = 195/380 (51%), Gaps = 34/380 (8%)
Query: 10 ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
A T + + + + ++P+ ++I DD+G+ D+ +G+ I TPNID +A G+
Sbjct: 12 ATTVSQMMFSSCSSEQEKEQPNFVLIFCDDMGYGDLGCYGNPTIHTPNIDRMASEGMKFT 71
Query: 70 QHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
Q YV A + TPSRSALMTG+ P+ G+ + V+ GL E + + L++ GYA
Sbjct: 72 QFYVGASVSTPSRSALMTGRLPVRNGLYGDKRDVLFPDSKAGLGQDEVTVAKLLQQNGYA 131
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
T +GKWHLG F Y PT GFD +G Y + A+ MR + ++
Sbjct: 132 TACVGKWHLGHFSP-YLPTDHGFDDFFGI-----PYSNDMRPASL-------MRGDSVLE 178
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG-NTYEPFQAPDEEVAKFLD 244
K T YTE A++ I + NK++P FLYLAH H +T E F+
Sbjct: 179 IKPEQGELTRRYTEYAVDYI-KKNKNRPFFLYLAHTFPHTPLHTNERFEGRS-------- 229
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
R Y +V LD SVG V+ AL+++G+ EN++V+F +DNG P N GS
Sbjct: 230 ----NRGLYGDVVEELDWSVGEVLKALKENGLDENTLVIFTSDNG-PWLVQRLNGGSAGA 284
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDG 363
LR KST W+GG+R V A++ + + D LPT AGI + + +LDG
Sbjct: 285 LRQGKSTCWEGGLR-VPAVFRMPGRIAPCTQQGMMATMDILPTFLNMAGIPLPEGVTLDG 343
Query: 364 VNQWDVLTKGAKTKRSEILH 383
++Q +L + ++ R E+ +
Sbjct: 344 MDQTPMLWEQRESSRDEMYY 363
>gi|149199849|ref|ZP_01876878.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
gi|149137020|gb|EDM25444.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
Length = 467
Score = 176 bits (445), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 128/403 (31%), Positives = 195/403 (48%), Gaps = 58/403 (14%)
Query: 7 YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
+FF+L+ F D KP++++I ADD G+ D+S GS I TP ID +A G
Sbjct: 10 FFFSLSA---FAD---------KPNVVLIFADDQGYGDLSCFGSETIKTPYIDQMAKEGR 57
Query: 67 ILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
V + +CTPSR+AL+TG YP +G+ GV+ GL E + + K GYA
Sbjct: 58 KFTSFMVASPVCTPSRAALLTGSYPKRVGLHKGVLFPQSTTGLNPAEYTIGDHFKSLGYA 117
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGY-WQGLQDYYDHSCKATFEPYQGLDMRHN--- 181
T GKWHLG E P GFD +YG + ++ D+ K P G+D+ N
Sbjct: 118 TACYGKWHLGHHPETL-PISNGFDEYYGIPYSNDMNHPDNKNKPKGGP-GGMDLLWNDPE 175
Query: 182 ------------------MQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAV 223
+ VD +T+ T YT+ AI+ I E NK K F+YL H
Sbjct: 176 STLTKWQTPLMEGAKIVEIPVDQRTV----TRRYTDKAISFI-ERNKEKSFFIYLPHSMP 230
Query: 224 HAGNTYEPFQAPDEEVAKFLDISDPE-RRTYAGMVSRLDESVGNVIAALRKHGMLENSIV 282
H P PDE I DP+ + Y ++ +D+ VG + L++ G+ +N+IV
Sbjct: 231 HI-----PLYVPDE-------IRDPDPKNAYINVIEHMDDQVGRITKKLKELGLDKNTIV 278
Query: 283 LFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS 342
++ DNG P + GS LR K + ++GG R +W+P L ++EL
Sbjct: 279 IYTTDNG-PWLPFKHHGGSAGELRDGKGSTFEGGQRVPCVMWAPGLIPAGTETNELMTTI 337
Query: 343 DWLPTLCAAAGIEI-NDTSLDGVNQWDVLT-KGAKTKRSEILH 383
D LP+L A + + +DG++Q ++T K ++ R+E ++
Sbjct: 338 DLLPSLAALSSSTLPGKNKIDGIDQSALITGKAERSARNEFIY 380
>gi|239626021|ref|ZP_04669052.1| arylsulfatase E [Clostridiales bacterium 1_7_47_FAA]
gi|239520251|gb|EEQ60117.1| arylsulfatase E [Clostridiales bacterium 1_7_47FAA]
Length = 483
Score = 176 bits (445), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 124/371 (33%), Positives = 187/371 (50%), Gaps = 27/371 (7%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSA 83
T KKP+II+ L DD G+ D+S GS+ + TPN+D LA G Y A+C+PSR+
Sbjct: 12 TIMKKPNIIVFLTDDQGYGDLSCMGSTDVCTPNLDILAAGGARFTDFYAGSAVCSPSRAC 71
Query: 84 LMTGKYPIHIGMQHGVILEG--EPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
L+TG+YP G++ IL G GL LK+ GY T +GKWHLG E
Sbjct: 72 LLTGRYPYMTGVRS--ILGGIKTTTGLNPGIPTFASALKDLGYTTGMVGKWHLGAVPEC- 128
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDN----KTIGIYSTDLY 197
PT GFD G+ G+ DY+ H + G++ H++ ++ K G YST+L+
Sbjct: 129 RPTHMGFDYFCGFLSGVNDYFSHIHYTEANSHPGINPNHDLWENDERCLKYTGEYSTELF 188
Query: 198 TEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
+ I E K P LY A A H P AP + + +F + + +R+ A M
Sbjct: 189 ARKGLEFIREQVEKDMPFALYCAFNAPHY-----PMHAPYKYLERFKHLPE-DRQIMAAM 242
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI--------HSNKGSNHPLRGM 308
+S +D+ VG ++ L++ G+ ++I+ F +DNG + GS L+G
Sbjct: 243 LSAVDDGVGEIMNYLKRRGIFNDTIIYFQSDNGPSKESRNWLDERKDYYYGGSTGGLKGH 302
Query: 309 KSTPWDGGMRGVAAIWS-PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQW 367
K + +DGG+R V AI+S P + +V SE +D PT AAG +D + G +
Sbjct: 303 KFSLFDGGIR-VPAIFSWPAMVPAGQVISEPCMGTDIFPTFINAAGGNASDYEISGCDIL 361
Query: 368 DVLTKGAKTKR 378
V+T GA+ +
Sbjct: 362 PVMTIGARRDK 372
>gi|47522740|ref|NP_999120.1| N-acetylgalactosamine-6-sulfatase precursor [Sus scrofa]
gi|75054309|sp|Q8WNQ7.1|GALNS_PIG RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|18028088|gb|AAL55968.1|AF322917_1 N-acetylgalactosamine-6-sulfatase precursor [Sus scrofa]
Length = 522
Score = 175 bits (444), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 123/379 (32%), Positives = 184/379 (48%), Gaps = 27/379 (7%)
Query: 15 LLFNDAFLNTT-APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV 73
L+ + A L T AP+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y
Sbjct: 14 LVLSAAGLGVTGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYA 73
Query: 74 -QALCTPSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
LC+PSR+AL+TG+ PI G ++ + G+P E LLP+ LK AGYA
Sbjct: 74 ANPLCSPSRAALLTGRLPIRTGFYTTNGHARNAYTPQEIVGGIPDPEHLLPELLKGAGYA 133
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHN 181
+ +GKWHLG R + P GFD +G YD+ + Y+ +M
Sbjct: 134 SKIVGKWHLGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEE 192
Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
++ KT T +Y + A++ I + P FLY A A HA P
Sbjct: 193 FPINLKTGESNLTQIYLQEALDFIKRQQATHHPFFLYWAIDATHA---------PVYASR 243
Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
FL S +R Y V +D+SVG ++ LR + N+ V F +DNGA G
Sbjct: 244 AFLGTS--QRGRYGDAVREIDDSVGRIVGLLRDLKIAGNTFVFFTSDNGAALVSAPKQGG 301
Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDT 359
SN P K T ++GGMR A W P +VS +L + D T + AG+E +D
Sbjct: 302 SNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSVMDLFTTSLSLAGLEPPSDR 361
Query: 360 SLDGVNQWDVLTKGAKTKR 378
++DG++ + +G T+R
Sbjct: 362 AIDGLDLLPAMLQGRLTER 380
>gi|395856891|ref|XP_003800850.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Otolemur garnettii]
Length = 526
Score = 175 bits (444), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 121/367 (32%), Positives = 177/367 (48%), Gaps = 26/367 (7%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
AP+ P+I+++L DD+GW D+ +G TPN+D +A G + Y LC+PSR+AL
Sbjct: 30 APQSPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAAL 89
Query: 85 MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
+TG+ PI G ++ + G+P +E LLP+ LKEAGY + +GKWHLG
Sbjct: 90 LTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPRSEHLLPELLKEAGYISKIVGKWHLGH- 148
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYS 193
R + P GFD +G YD+ + Y+ +M ++ KT
Sbjct: 149 RPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEEFPINLKTGESNL 208
Query: 194 TDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
T +Y + A++ I + P FLY A A HA P FL S +R
Sbjct: 209 TQIYLQEALDFIKRQQAQQHPFFLYWAIDATHA---------PVYASKAFLGTS--QRGR 257
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
Y V +D+SVG ++ LR + EN+ V F +DNGA GSN P K T
Sbjct: 258 YGDAVREIDDSVGKILDLLRGLSIGENTFVFFTSDNGAALISAPRQGGSNGPFLCGKQTT 317
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLT 371
++GGMR A W P +VS +L I D T + AG+E +D ++DG++ L
Sbjct: 318 FEGGMREPAIAWWPGHIPAAQVSHQLGSIMDLFTTSLSLAGLEPPSDRTIDGLDLLPTLL 377
Query: 372 KGAKTKR 378
+G R
Sbjct: 378 QGRLVDR 384
>gi|116621986|ref|YP_824142.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
gi|116225148|gb|ABJ83857.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
Length = 461
Score = 175 bits (444), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 127/359 (35%), Positives = 178/359 (49%), Gaps = 39/359 (10%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMT 86
++P+I++ILADDLG+ D+ +GS I TPNID LA G Y +C+PSR+ALMT
Sbjct: 26 RQPNIVVILADDLGYGDLGCYGSP-IATPNIDRLAEEGARFTSFYSASPVCSPSRAALMT 84
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+YP + + V+L GLP +E + Q LK AGY T IGKWH+G Y PT R
Sbjct: 85 GRYPTRVEVP--VVLGPGDAGLPDSEITMAQVLKSAGYRTSCIGKWHIG-STPGYLPTNR 141
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
GFD +G +S T P VD T+ T +T+ A++ +
Sbjct: 142 GFDEFFG--------VPYSADITPCPLMRGSSVVAPAVDCSTL----TSSFTQEALDFM- 188
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
+ P FLYLAH A H P A + + D+ V LD S G
Sbjct: 189 RRAQDNPFFLYLAHTAPHLPLAASPRFAGQSGLGMYADV-----------VQELDWSTGQ 237
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
V+AAL+ G+ N++V+F +DNG P + +GS LRG K ++GGMR P
Sbjct: 238 VMAALKATGLDSNTLVMFSSDNG-PWY-----QGSQGKLRGRKGETYEGGMREPFLARYP 291
Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
+ + + L D LPTL AG + LDGV+ W VLT +R+E+ ++
Sbjct: 292 GVIPSGIGCAGLATTMDLLPTLARLAGAQTPSNPLDGVDIWPVLTG----ERAEVDRDV 346
>gi|386287174|ref|ZP_10064349.1| sulfatase family protein [gamma proteobacterium BDW918]
gi|385279706|gb|EIF43643.1| sulfatase family protein [gamma proteobacterium BDW918]
Length = 555
Score = 175 bits (444), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 131/389 (33%), Positives = 192/389 (49%), Gaps = 62/389 (15%)
Query: 28 KKPHIIIILADDLGWNDVSFHG----SSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
+ P++I+ILADDLG ND+S +G + Q+ TPNID LA +G +Q Y A C PSR+
Sbjct: 63 RPPNVIVILADDLGINDISTYGGGLANGQVQTPNIDELAASGANFSQAYSGAGTCAPSRA 122
Query: 83 ALMTGKYPIHIGMQHGVILEG--------------------------------EPWGLPL 110
L+TG+YP G + +G + GLP
Sbjct: 123 MLLTGRYPTRTGFEFTPTPDGMGAMATMIASSHDNGLPPSIYNKAGADKIPSFDEQGLPS 182
Query: 111 TEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH----SC 166
+E L + L++ GY T IGKWHLG + ++P +GF+ GL DH +
Sbjct: 183 SEITLAEILRDRGYYTAHIGKWHLGRGKG-FSPEEQGFEQSLLMHSGLYLPVDHPDVVNA 241
Query: 167 KATFEPYQGL---DMRHNMQVDNKTIGI----YSTDLYTEAAINVIAEHNKSKPMFLYLA 219
K F+P MR+ ++ Y TD +T+ +I +I + NK++P FLYLA
Sbjct: 242 KVDFDPIDQFLWAKMRYAAAFNDGGPVFEPKSYLTDYWTDESIKII-KANKNRPFFLYLA 300
Query: 220 HLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLEN 279
H H P QA E+ DI R YA MV +D SVG ++A L++ G+ +N
Sbjct: 301 HWGAHT-----PLQATREDYEAVGDIKPHRARVYAAMVRAVDRSVGRILATLKEEGLSDN 355
Query: 280 SIVLFMADNGAPSF-GIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSE 337
+IV+F +DNG + GI N P RG K T ++GG+R + W + + + +
Sbjct: 356 TIVVFSSDNGGAGYIGIPD---INAPYRGWKLTMFEGGIRVPLFMRWPAHIARNTSIDTP 412
Query: 338 LFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
HI D+LPTL AAA ++ D +DG N
Sbjct: 413 AAHI-DFLPTLVAAAQAKLPEDRIIDGAN 440
>gi|196231892|ref|ZP_03130748.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196224014|gb|EDY18528.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 486
Score = 175 bits (443), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 134/413 (32%), Positives = 201/413 (48%), Gaps = 40/413 (9%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
P KP+I+ ILADD+GW+D+ +G+ TPNID A + Y ++C+PSRS LMT
Sbjct: 23 PDKPNILFILADDMGWSDLGCYGADLHETPNIDRFASGAVRFTSAYAMSVCSPSRSTLMT 82
Query: 87 GKYPIHI-------GMQHG-----VILEGEP-WGLPLTEKLLPQYLKEAGYATHAIGKWH 133
GK+ + G Q G + E E W LP +EK + YLK AGY T IGKWH
Sbjct: 83 GKHAARLHFTIWAEGAQEGGAKNRELREAESIWNLPNSEKTIATYLKSAGYLTALIGKWH 142
Query: 134 LGFFREVYTPTFRGFDSHYG--YWQGLQDY-YDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
LG + + P GFD + G W Q + + +S T P + R+ ++ G
Sbjct: 143 LGDWE--HYPEAHGFDINIGGTNWGAPQTFWWPYSGSGTHGP----EFRYIPHLEYGHPG 196
Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV----AKFLDIS 246
Y TD T+ AI VI +H +P F+YLAH AVH P +A +++ AK+ D
Sbjct: 197 EYLTDRLTDEAIKVI-DHAGDQPFFVYLAHHAVHT-----PIEAKADDIQHFDAKYRDGM 250
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG------ 300
+ YA M LDE+VG V+ L++ G+ +N++V+F +DNG G+ G
Sbjct: 251 NHRHTIYAAMNKELDENVGRVLEHLKERGLDKNTVVIFASDNGG-YIGVDKVSGKNMPVT 309
Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS 360
+N PLR K ++GG+R I P + E ++D L T G +
Sbjct: 310 NNAPLRSGKGALYEGGIRVPLIIRWPGVTPNGATCDEPVILTDMLQTFLHITGQPPATDA 369
Query: 361 LDGVNQWDVLTK-GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNN 412
DG++ +L AK R + + + + +A+R D K + ++N
Sbjct: 370 TDGMDISPLLKDPSAKLNRDALFFHYPHYYHTTTPVSAIRARDWKLLEFYEDN 422
>gi|323455471|gb|EGB11339.1| hypothetical protein AURANDRAFT_2359, partial [Aureococcus
anophagefferens]
Length = 367
Score = 174 bits (442), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 119/371 (32%), Positives = 189/371 (50%), Gaps = 24/371 (6%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIP-----TPNIDALAYNGLILNQHYVQALCTPSRSAL 84
PHI+I+LAD++GW V +H +P TP +DAL +GL L++HY C+PSRSAL
Sbjct: 1 PHIVIMLADNVGWAVVEWHRPPALPAREISTPKLDALRASGLELDRHYTYKFCSPSRSAL 60
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF-REVYTP 143
M+G+ P H+ + + G P +L+P L++AGY TH +GKWHLG R + TP
Sbjct: 61 MSGRLPAHVNVYNDDPAMAGA-GAPAHMRLMPAMLRDAGYRTHFVGKWHLGMASRSLQTP 119
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
RGFD+ GY+ +YYD + D+ + Y ++ + ++
Sbjct: 120 AARGFDTSLGYFHSTNNYYDGTRAEGCGDAAHADLWDGDGPSHLNGTAYEELVFRDRVVD 179
Query: 204 VIAEHNKSKPMFLYLA-HLAV-------HAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
V+ H+ ++P+F+Y A H + AG + Q KF I D +RR
Sbjct: 180 VVEAHDAARPLFIYYALHTSCVGYDPDGTAGGERDSLQPDRAYYDKFSFIDDDDRRANVA 239
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH-PLRGMKSTPWD 314
MV+ +D+ G+V+ ALR G+ +++L+ +DNG +H GSN PLRG ++
Sbjct: 240 MVALMDDVAGDVVGALRTAGLWAKTLLLWSSDNGG---AVHLGGGSNTWPLRGGYYNNFE 296
Query: 315 GGMRGVAAIWSPWLKQTQ--KVSSELFHISDWLPTLCAAAGIEINDT---SLDGVNQWDV 369
GG+R A + L + K+ +E +I+DW T AG +D ++D ++ WD
Sbjct: 297 GGVRAPAFLAGGALPEVARGKLLTEWIYIADWYATFARLAGASPDDGAVPAVDSLDVWDA 356
Query: 370 LTKGAKTKRSE 380
+ + R E
Sbjct: 357 IAGDGASPRVE 367
>gi|410611985|ref|ZP_11323071.1| N-acetylgalactosamine-6-sulfatase [Glaciecola psychrophila 170]
gi|410168398|dbj|GAC36960.1| N-acetylgalactosamine-6-sulfatase [Glaciecola psychrophila 170]
Length = 508
Score = 174 bits (442), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 129/365 (35%), Positives = 185/365 (50%), Gaps = 42/365 (11%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
T+ P KP+I+ I ADDLG+ D+ +GS+ I TPNID +A G ++ Y + +CTPSR+
Sbjct: 44 TSPPDKPNIVFIFADDLGYGDIGAYGSTTINTPNIDKMAAQGAKFDEFYAASPVCTPSRA 103
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
L+TG+YPI G+ H V G+ E + + LK AGYAT +GKWHLG E Y
Sbjct: 104 GLLTGRYPIRQGI-HNVFFPESFQGMDPEEITIAEVLKGAGYATGLVGKWHLG-HHEQYM 161
Query: 143 PTFRGFDSHYGY-----WQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLY 197
P +GFD +G GL YY ++ FE +VD + Y T Y
Sbjct: 162 PWNQGFDEFFGLPYSNDMGGL--YYFNNKDIDFE-----------EVDQR----YMTKTY 204
Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
T+ A+ I +H + +P FLYLAH H + +P+ E Y +V
Sbjct: 205 TDQALQFIDKH-QEQPFFLYLAHNMPHV----PIYASPEFE-------GQSAGGAYGDVV 252
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LD SVG V+ L + + EN++V+F +DNG P + GS PLR K ++GGM
Sbjct: 253 EELDWSVGQVLKRLDELNLSENTLVIFTSDNG-PWLLMGDEGGSAGPLREGKQFTFEGGM 311
Query: 318 R-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLT-KGA 374
R A W + Q K L + DWLPT A + + D +DG + +LT KG
Sbjct: 312 RVPTVAYWPGTISQGTKPQG-LATMMDWLPTFANLANVPVPADRVIDGRDITGLLTGKGT 370
Query: 375 KTKRS 379
+ +++
Sbjct: 371 RAEQT 375
>gi|254444367|ref|ZP_05057843.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
gi|198258675|gb|EDY82983.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
Length = 462
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 122/361 (33%), Positives = 181/361 (50%), Gaps = 29/361 (8%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRS 82
++A K P+I+ I ADDLG+ND+S +G++ I TP ID+L G+ Y +C+PSR+
Sbjct: 29 SSAEKPPNIVFIFADDLGYNDLSSYGATDIATPAIDSLGEQGIRFTDFYSASPVCSPSRA 88
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
AL+TG+YPI G+ GV G+ E + + L+E GY T +GKWHLG ++ +
Sbjct: 89 ALLTGRYPIRQGIT-GVFWPQSFDGIDPAETTIAELLQENGYRTGLVGKWHLGHHQK-HL 146
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
P GF S++G +S Y + + +VD Y+T YTE A+
Sbjct: 147 PLQNGFHSYFG--------IPYSNDMDMVVYMRGNDVESYEVDQH----YTTRRYTEEAV 194
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
I E NK +P FLYLAH H P F+ S +R Y ++ LD
Sbjct: 195 QFI-EQNKDQPFFLYLAHSMPH---------VPIYASENFVGTS--KRGLYGDVIQELDW 242
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
SV ++ L KH + EN++V+F +DNG P + GS PLR K +DGGMR
Sbjct: 243 SVAQILDTLDKHQLSENTLVVFTSDNG-PWTALKHLGGSAAPLREGKMFTFDGGMRVPCL 301
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEI 381
+ P + S + ++ DW PT A ++ + S+DG++ DVLT +E
Sbjct: 302 VRWPAQIPAGQTSHAMANMMDWFPTFSRIANLDTPKSRSIDGLDITDVLTGSGPRADNEF 361
Query: 382 L 382
Sbjct: 362 F 362
>gi|302535739|ref|ZP_07288081.1| arylsulfatase [Streptomyces sp. C]
gi|302444634|gb|EFL16450.1| arylsulfatase [Streptomyces sp. C]
Length = 495
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 130/413 (31%), Positives = 188/413 (45%), Gaps = 34/413 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
++P+I+ IL DDLGW D+S +GS I TPN+D LA G+ Y A C+P+R +L T
Sbjct: 78 RRPNILFILGDDLGWADLSSYGSPHIKTPNLDRLARQGVRFTDAYSGSATCSPTRFSLYT 137
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+YP + + GL T L LKEAGYAT IGKWH G+ + Y+PT
Sbjct: 138 GRYPARTRGGLAEPIADKSAGLDPTHPTLASLLKEAGYATALIGKWHAGYLPD-YSPTKS 196
Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
G+D+ +G + G +YY ++ Y+G +++ Y T + TE A +
Sbjct: 197 GWDTFFGNFGGALEYYSKLGLGGEYDLYEGDAEYKDLR--------YYTRIITERASEYV 248
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER---------RTYAGM 256
+ + +P L L H + A E+ + + D R Y M
Sbjct: 249 SRDHGGRPWLLNLNFTTPHWPWIADGDTAESAEIVRRIKAGDARALWHQDGGSVRKYKQM 308
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
V LD SVG V+ ALR+ G EN++V+F +DNG F N PL G K++ +GG
Sbjct: 309 VEDLDRSVGEVLQALRRSGQEENTLVVFSSDNGGERFSY------NWPLSGNKASLQEGG 362
Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAK 375
+R + P +VS DW TL AG + LDGV+ L +G K
Sbjct: 363 IRVPNIVRWPARLDGGQVSRVPVFSPDWTATLLEVAGTRPHRAYPLDGVSLAGYLLRGEK 422
Query: 376 TKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDK 428
++ + + ALR D KY G Q + GD + DK
Sbjct: 423 PAERDLFWRV-------RGERALRRGDWKYYRGKSGRDQLFDLAGDLREQADK 468
>gi|260060774|ref|YP_003193854.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
gi|88784904|gb|EAR16073.1| arylsulfatase A (precursor) [Robiginitalea biformata HTCC2501]
Length = 526
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 127/396 (32%), Positives = 192/396 (48%), Gaps = 47/396 (11%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
A + P+I+II DD G++DV +G+ IPTPN+DA+A +GL+L Y Q +C+ SR+ L
Sbjct: 70 ADRPPNIVIIFTDDQGYSDVGVYGARDIPTPNLDAMAADGLLLTNFYAAQPVCSASRAGL 129
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
+TG YP +G+ H ++ P GL E+ L + L++ GY T GKWHLG + + PT
Sbjct: 130 LTGCYPNRVGI-HNALMPNSPVGLNPAEETLAELLRQQGYRTGIFGKWHLGDHPD-FLPT 187
Query: 145 FRGFDSHYG------YW--QGLQD-YYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
GFD +G W LQ +D +E + +D + ++ + I
Sbjct: 188 RHGFDEFFGIPYSNDMWPLHPLQGPVFDFGPLPLYEQERVVDTLEDQRLLTRQI------ 241
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
TE +++ I H K +P FLY+ H H P D K R Y
Sbjct: 242 --TERSVDFINRH-KEEPFFLYVPHPQPHV-----PLFVSDAFRGK------SGRGLYGD 287
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
++ +D SVG V+ AL +G+ +++ V+F +DNG P ++ G PLR K T W+G
Sbjct: 288 VIMEIDWSVGQVLGALEDNGLTDDTWVIFTSDNG-PWLAYGNHSGRAEPLREGKGTNWEG 346
Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAK 375
G+R + P KV E D LPT+ + G +DG N W +L+ GA+
Sbjct: 347 GVREPCIMKFPGRLPRGKVLDEPLMAIDLLPTIASVTGSPQPGREIDGKNAWGLLS-GAE 405
Query: 376 TKRSEILHNIDNVDNPQ-KYYAALRVDDLKYVAGTD 410
+ PQ YY RV++L+ V D
Sbjct: 406 AR------------GPQDAYYFYYRVNELQAVRDGD 429
>gi|440716877|ref|ZP_20897380.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
gi|436438070|gb|ELP31646.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
Length = 485
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 128/404 (31%), Positives = 190/404 (47%), Gaps = 59/404 (14%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRS 82
+A +P+I++I++DD G+ND+ G+ I TP +D LA G L YV CTPSR+
Sbjct: 24 ASAADRPNILLIVSDDQGYNDLGQLGNGII-TPALDRLAKEGTRLTNFYVAWPACTPSRA 82
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLT-----------------EKLLPQYLKEAGYA 125
+L+TG+YP G+ + E +G T E +LP L+ AGY
Sbjct: 83 SLLTGRYPQRNGIYDMIRNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAVLRPAGYK 142
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
+ GKW LG + + PT RGFD YG+ DY+ H E Y M N++
Sbjct: 143 SGIFGKWDLGALQRML-PTSRGFDDFYGFVNTGIDYFTH------ERYGVPCMVRNLEPT 195
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-----FQAPDEEVA 240
G Y T L+ A+ + EH ++P FLY+ A H ++ +P QAPD+ A
Sbjct: 196 EADKGTYCTYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSLDPTIRSSVQAPDQFKA 255
Query: 241 KF------------------LDISDPE--RRTYAGMVSRLDESVGNVIAALRKHGMLENS 280
+ ++ P+ RR Y V+ +D ++G ++ L ML+N+
Sbjct: 256 MYPPVEVETRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAIGEMLDRLEAKQMLDNT 315
Query: 281 IVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
IV+F +DNG S N PLRG K+ W+GG+R + P K V + F
Sbjct: 316 IVVFFSDNGG------SGGADNSPLRGHKAQTWEGGIRVPCLVRWPAGKIPAGVVNNEFL 369
Query: 341 IS-DWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEIL 382
S + LP+ AAG+E LDG N W L A++ R E+
Sbjct: 370 TSLELLPSFAIAAGVEPPPGVVLDGFNWWPTLRGEAESPRKEMF 413
>gi|72110136|ref|XP_780327.1| PREDICTED: arylsulfatase A-like [Strongylocentrotus purpuratus]
Length = 527
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 127/412 (30%), Positives = 196/412 (47%), Gaps = 38/412 (9%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTGK 88
P+I+I+ ADD+G+ D+ +G TPN+ LA GL+ Q Y A +C+PSR+AL+TG+
Sbjct: 27 PNIVILFADDVGYGDLEVYGHPTSSTPNLTKLAEGGLVFTQFYAGAPVCSPSRAALLTGR 86
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTFRG 147
Y + G+ GV++ GLP E + + +K GY+T +GKWHLG Y PT +G
Sbjct: 87 YAVRSGIYPGVLVPTSTGGLPHNETTIAEIVKTRGYSTSMVGKWHLGVGENGTYLPTNQG 146
Query: 148 FDSHYGYWQGLQDYYDHS---CKATFEP-----------YQGLDMRHNMQVDNKTIGIYS 193
FD + G Y H C F P Y + +N + + + + +
Sbjct: 147 FDEYLGVP------YSHDMCPCYNCFYPNDPCWNTCDLNYTYCPLFNNTDILEQPVNLTT 200
Query: 194 TD-LYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
D Y + A +VI + N P FLY A H P +F + + R
Sbjct: 201 LDATYVQRAKSVIQRNANAGTPFFLYYAFQHAH---------HPQFASKEFRNAT--ARG 249
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
T+ ++ LD VG V+ L G+L+N++VLF +DNG PS + G+ L+ K T
Sbjct: 250 TFGDSLAELDWCVGQVMEQLESSGVLDNTLVLFTSDNG-PSLRNENRGGNAGLLKCGKGT 308
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLT 371
++GG R A + P + ++EL D+LPT+ G + +LDGV+ +L
Sbjct: 309 TYEGGQRVPAIAYWPQ-RIAPGRTTELAGNMDFLPTIANLVGATLPSVALDGVDMAPILF 367
Query: 372 KGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD 423
+G K+ R + NP+ A+R + K T SDE+ D D
Sbjct: 368 EGKKSARDTFFYYFTE-PNPKYGIYAVRYNQYKAHYYTQGQQNSDEFNHDKD 418
>gi|305665425|ref|YP_003861712.1| arylsulfatase A [Maribacter sp. HTCC2170]
gi|88710180|gb|EAR02412.1| arylsulfatase A (precursor) [Maribacter sp. HTCC2170]
Length = 489
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 132/410 (32%), Positives = 201/410 (49%), Gaps = 43/410 (10%)
Query: 6 KYFFALTCTLLF-------NDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNI 58
++ F + LF ND + +P P++++I DD G+ DV GS I TP++
Sbjct: 13 RFQFLFVISFLFILLSCKDNDQKVTKKSPP-PNVVLIFTDDQGYQDVGVFGSPNIKTPHL 71
Query: 59 DALAYNGLILNQHY-VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQ 117
D +A G+ L +Y QA+C+ SR+ L+TG YP IG+ H + G G+ +E + +
Sbjct: 72 DRMASEGVKLTSYYSAQAVCSASRAGLLTGCYPNRIGI-HNALGPGNTHGINSSETTMAE 130
Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
LK GY T GKWHLG E + P GFD +G + H + + L
Sbjct: 131 MLKNNGYKTAIFGKWHLGHHPE-FMPNKHGFDEFFGIPYSNDMWPYHPQQGPIFNFPDLP 189
Query: 178 MRHNMQV-DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
+ N V D T T TE +++ I NK P FLY+ H H P
Sbjct: 190 LYENETVIDTLTEQSNLTTQITERSVDFI-NRNKDNPFFLYVPHPQPHV---------PL 239
Query: 237 EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGI 295
KF SD R Y ++ +D SVG +I AL+K+G+ EN+IV+F +DNG S+G
Sbjct: 240 FVSEKFKGKSD--RGLYGDVIMEIDWSVGQIIEALKKNGLEENTIVIFTSDNGPWLSYGN 297
Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP-WLKQTQKVSSELFHISDWLPTLCAAAGI 354
H+ GS P R K T W+GG R + P LK + + + I D LPTL +
Sbjct: 298 HA--GSALPFREGKGTAWEGGQREPFIMKYPNKLKAGEVIDVPMMAI-DLLPTLASLTNS 354
Query: 355 EINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQK-YYAALRVDDL 403
++ + S+DG N W++++ +++++PQK Y+ RV++L
Sbjct: 355 KLPELSIDGKNVWNIISG-------------ESIESPQKAYFFYYRVNEL 391
>gi|325110321|ref|YP_004271389.1| arylsulfatase [Planctomyces brasiliensis DSM 5305]
gi|324970589|gb|ADY61367.1| Arylsulfatase [Planctomyces brasiliensis DSM 5305]
Length = 980
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 127/369 (34%), Positives = 187/369 (50%), Gaps = 38/369 (10%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSS-----QIPTPNIDALAYNGLILNQHYVQA-LCTP 79
A +P+I+I ADD G D F G TP++DALA G+ YV A +CTP
Sbjct: 21 AADQPNILIFFADDAGNADFGFQGGGINGDFANLTPHLDALAEGGVRFTNGYVSAAVCTP 80
Query: 80 SRSALMTGKYPIHIGMQ--HGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
SR+ ++TG+Y G++ +G I E GLP +E + L++AGY T+AIGKWHLG
Sbjct: 81 SRAGMLTGRYQHRFGVETVYGRIPEA---GLPASEITMADTLRKAGYRTYAIGKWHLGEH 137
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI-----GIY 192
+ P RGFD YG G + TF PY+G + +Q + + Y
Sbjct: 138 LHEHLPNQRGFDEFYGALTGAR---------TFFPYRGNNPGSKLQRNGVFLPEPLDQPY 188
Query: 193 STDLYTEAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
TDL + I +H + + P FLYLA AVH +P + D + DIS P+
Sbjct: 189 FTDLLARQTVAYIDDHVANHANAPFFLYLAFTAVHTPLEADPKRLDDRRIQ---DISPPQ 245
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH---SNKGSNHPLR 306
R+T A M +D++VG V+A LR++ + EN+IV+F++DNG P N N L+
Sbjct: 246 RKTLAAMTIAMDDAVGTVMAKLRENNLTENTIVVFLSDNGGPEDNRSLRAPNWSDNGALQ 305
Query: 307 GMKSTPWDGGMRGVAAI-WSPWLKQ--TQKVSSELFHISDWLPTLCAAAGIEI-NDTSLD 362
G KS ++GG+R I W + + + K ++ D PT AA + D LD
Sbjct: 306 GNKSQNFEGGIRVPFVIHWPRGIARGLSGKSLPDVVTALDLFPTFAEAAKATLPKDRELD 365
Query: 363 GVNQWDVLT 371
G++ +L+
Sbjct: 366 GMSLASLLS 374
>gi|241708194|ref|XP_002413315.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215507129|gb|EEC16623.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 294
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 115/318 (36%), Positives = 155/318 (48%), Gaps = 34/318 (10%)
Query: 213 PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALR 272
P+FLYLAH A H+GN + + P+ ++ DI R YAGMVS LDESVG V AL
Sbjct: 1 PLFLYLAHQAPHSGN-FGRLEVPETYTEQYRDIGSKNRTLYAGMVSALDESVGAVFEALD 59
Query: 273 KHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ 332
K GML NSIV+F +DNG N GS PLRG K + +GG+R A +WSP L T+
Sbjct: 60 KRGMLSNSIVVFSSDNGGAPDEETYNSGSPWPLRGQKMSLSEGGVRAPALLWSPLLSATE 119
Query: 333 -KVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDN 390
+ LFHI+DWLPT AG + ++DGV+Q LT+ R EIL NID
Sbjct: 120 GSIFDGLFHITDWLPTFYELAGGSPSSLGAIDGVSQLGALTRQTTAPRKEILINID---- 175
Query: 391 PQKYYAALRVDDLKYVAGTDNNGQSDEWY---GDTDNEIDKYSPKEVLYSKAGITFNALK 447
P++ +A+ + K V G G + WY G D E DK P++
Sbjct: 176 PEENCSAIIEGNFKLVLGVAEQGLYENWYPLSGHVDWESDK--PRQECEK---------S 224
Query: 448 TKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFN 507
++ + H P S V+C + C T CLFN
Sbjct: 225 VVARVLEHHGVRPVCGSRTGAYA-------------TPVQCGARDPSRECVPTKSLCLFN 271
Query: 508 ITDDPCEQNNLAESQTDL 525
++ DPCE NN+A+ ++
Sbjct: 272 LSKDPCEYNNVADQHPEV 289
>gi|296124231|ref|YP_003632009.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296016571|gb|ADG69810.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 467
Score = 174 bits (441), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 139/426 (32%), Positives = 210/426 (49%), Gaps = 49/426 (11%)
Query: 9 FALTCTL-LFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F T L L A TT+ ++P+I++IL+DD G + S G + TP+ID++ NG+
Sbjct: 17 FQFTSQLALAQRATTETTSERRPNILLILSDDCGHAEFSIQGHPRYKTPHIDSIGKNGVH 76
Query: 68 LNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILE-GEPWGLPLTEKLLPQYLKEAGYA 125
Q YV +C+PSR+ L+ G+Y G + + E GLP +E LLPQ LKE GY
Sbjct: 77 FRQGYVSGCVCSPSRAGLLAGRYQQRFGHEFNIPPAYSETNGLPRSETLLPQLLKEDGYR 136
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
T A+GKWHLG+ + + P RGF +YG+ QG + Y+ +R +
Sbjct: 137 TIALGKWHLGYAPQFH-PMERGFTDYYGFLQGSRSYFPLKKPTRLNQM----LRDRTAIP 191
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
+ G Y TD + AI I + +S+P +YLA A H +P++ A L
Sbjct: 192 EEQFG-YMTDHLADEAIAYIKQW-QSQPWMMYLAFNATH---------SPNDATAVDLQA 240
Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
+D + YA M LD +VG V+ AL++ G+ ++++V+F+ DNG G H N GS H
Sbjct: 241 AD-GNKIYA-MTIALDRAVGKVLDALKECGLSKDTLVIFINDNGGA--GGHDN-GSLH-- 293
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG--------IEIN 357
G K + W+GG R + P + +V E D PT+ AG I +
Sbjct: 294 -GKKGSTWEGGTRIPFLVQYPAKIPSGQVIDEPVIALDLFPTILDVAGLGDAELKKIPFD 352
Query: 358 DTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYA----ALRVDDLKYVAGTDNNG 413
LDG++ +T KT+R VD P + + A+R +LK V+G D+ G
Sbjct: 353 PEKLDGISLIPRMT--GKTQRL--------VDRPLYWKSGKRWAIRQGNLKAVSGNDDQG 402
Query: 414 QSDEWY 419
E +
Sbjct: 403 DQVELF 408
>gi|298706919|emb|CBJ29746.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 616
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 126/382 (32%), Positives = 192/382 (50%), Gaps = 62/382 (16%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIP-TPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
++P++I IL DD+G ND+ + + TP +D+L+ G+ L ++Y LCTPSR++LMT
Sbjct: 118 QQPNVIFILIDDVGTNDIGYQSTDLWELTPFMDSLSSEGVRLTKYYTNQLCTPSRASLMT 177
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+ GMQ+ V+ + WGLPL E L + K GKWHLG + + + P R
Sbjct: 178 GRDTFRTGMQYEVVEDSGAWGLPLEEVTLAERFK--------TGKWHLGMYSDAHYPFAR 229
Query: 147 GFDSHYGYWQGLQDYYDH-SCKA-TFE----------PYQGLDMRHNMQVDNKT------ 188
GFD+ GY ++ Y H C TFE Y D N + N T
Sbjct: 230 GFDTFLGYMGAVRGYSSHEGCNTPTFEGGEYSCFKDFGYGDKDGYIN-HITNTTRQGPSF 288
Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP------DEEVAKF 242
+G YST + T+ AI V EH + P FLY++H AVH+ P P +EE A
Sbjct: 289 VGNYSTTIITDRAIEVAKEHGE-DPFFLYVSHQAVHS-----PVDPPPDDYFTEEEHATL 342
Query: 243 LDI--SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNK 299
+ D R+ +A ++ LD+ + + L G L+N++++ +DNGA P+ G
Sbjct: 343 ARVVSEDAYRQRFARVILFLDKEMRRLHDELDALGALDNTVLVVASDNGACPTAG----- 397
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE--------LFHISDWLPTLCAA 351
GSN+PLRG K T ++GG+R A ++S K T + E + H +DW PT +
Sbjct: 398 GSNYPLRGYKHTIFEGGVRVPAFVYS---KSTDLIPEEARGTRYSGMMHSTDWTPTFGSV 454
Query: 352 A-GIEINDTS--LDGVNQWDVL 370
+ + + L G + WD +
Sbjct: 455 VPDLPLGEMGKELSGFDHWDAI 476
>gi|170726755|ref|YP_001760781.1| sulfatase [Shewanella woodyi ATCC 51908]
gi|169812102|gb|ACA86686.1| sulfatase [Shewanella woodyi ATCC 51908]
Length = 548
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 137/389 (35%), Positives = 190/389 (48%), Gaps = 57/389 (14%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG-LILNQHYVQALCTPSR 81
++T P+I++ILADD+G NDVS G I TPNID LA G L N + A C PSR
Sbjct: 56 DSTKTDLPNIVLILADDMGINDVSTFGGGMIETPNIDKLAAKGALFTNGYSGHANCAPSR 115
Query: 82 SALMTGK---------YPIHIGMQHGVIL------EGEP-----------------WGLP 109
+AL+TG+ PI GM + G P GLP
Sbjct: 116 AALLTGRDATRTGYDTTPIPDGMSRIIAAIENNEDNGRPEMSYSAEADATNPTYDNRGLP 175
Query: 110 LTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH--SCK 167
+E L+P+ LKE+GY T IGKWHLG E+ P +GFD GL DH S
Sbjct: 176 GSEILIPEILKESGYHTMHIGKWHLGRSPEMM-PNAQGFDESLMMDSGLYLPVDHPESVN 234
Query: 168 ATFEPYQGLD------MRHNMQVDNKTI---GIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
A E GLD MR+++ + I Y TD +TE A I E N ++P FLYL
Sbjct: 235 APVES-SGLDRFIWATMRYSVNWNGGEIFKPNGYLTDYFTEEAEKAI-EANANRPFFLYL 292
Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
AH H P QA + DI +R YA M+ +D SV V+A L K G+ +
Sbjct: 293 AHWGPH-----NPVQAKRADYEAVGDIQPHNKRVYAAMLRSIDRSVERVMAKLEKQGIAD 347
Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSE 337
N+IV+ +DNG + ++ N P RG K+T ++GG+R + W + ++ +
Sbjct: 348 NTIVILSSDNGGADYVAINDL--NKPYRGWKNTFFEGGIRVPFSVTWPNVIDESTVIEEP 405
Query: 338 LFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
+ HI D +PT+ A ++ D +DGV+
Sbjct: 406 VNHI-DLMPTIINMANADLPQDREIDGVD 433
>gi|432851909|ref|XP_004067102.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like isoform 1
[Oryzias latipes]
Length = 525
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 126/372 (33%), Positives = 177/372 (47%), Gaps = 26/372 (6%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSR 81
NT P+III+L DD+GW D+ G TPN+DA+A G++ Y LC+PSR
Sbjct: 24 NTPNTSSPNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAAQGILFPDFYTANPLCSPSR 83
Query: 82 SALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL 134
+AL+TG+ PI G ++ + G+ E LLPQ LKE GY +GKWHL
Sbjct: 84 AALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGISKDEILLPQLLKEKGYVNKIVGKWHL 143
Query: 135 GFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR----HNMQVDNKTIG 190
G R Y P GFD +G Y+++ + Y +M ++D KT
Sbjct: 144 GH-RPQYLPLENGFDEWFGAPNCHFGPYNNTVRPNIPVYNNSEMLGRYFEEFKIDKKTGE 202
Query: 191 IYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
T +Y EA ++ I+ ++K P FLY A A H+ P FL S +
Sbjct: 203 SNLTQMYLEAGLDFISRQAEAKRPFFLYWAADATHS---------PVYASKPFLGKS--Q 251
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
R Y V LD SVG ++ L++ G+ N+ V F +DNGA + GSN P K
Sbjct: 252 RGRYGDAVMELDYSVGRILTKLQRLGIDNNTFVFFTSDNGAALMSGPNEGGSNGPFLCGK 311
Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWD 368
T ++GGMR A W P + VS + ++ D T A AGI D LDG++
Sbjct: 312 ETTFEGGMREPAIAWWPGRIKGGTVSFQPANVMDLFTTSLALAGIGPPGDRVLDGLDLTP 371
Query: 369 VLTKGAKTKRSE 380
VL + T +S
Sbjct: 372 VLFNHSHTLQSR 383
>gi|340616348|ref|YP_004734801.1| sulfatase [Zobellia galactanivorans]
gi|339731145|emb|CAZ94409.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
Length = 489
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 128/404 (31%), Positives = 198/404 (49%), Gaps = 46/404 (11%)
Query: 4 ARKYFFALTCTLLF----NDAFLNTT------APKKPHIIIILADDLGWNDVSFHGSSQI 53
R ++F L + F N A T +P++P+I+ I ADDLG+ D+ F GS
Sbjct: 3 GRIFYFTLLIAVCFGSCKNTASKETVLHEEKHSPRQPNIVFIFADDLGFADLGFTGSDTH 62
Query: 54 PTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGKYPIHIG-MQHGVILE-GEPW---- 106
TPN+D LA + ++ Y C PSR ++MTGKYP +G + HG + P
Sbjct: 63 LTPNLDKLAKESVYFDRAYSSHPTCAPSRMSIMTGKYPARLGAVSHGKLGGVAHPGPNDN 122
Query: 107 GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSC 166
GLP+TE + + LK+ GY T IGKWH+G + P RGFD + C
Sbjct: 123 GLPMTETTIGEALKKEGYTTAHIGKWHIG--KGENNPGTRGFDVDIASNE-------FCC 173
Query: 167 KATFE-PYQGLDMRHNM-----QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAH 220
++ P++ + + + ++++ G + TD E A+ I ++ KP FL ++
Sbjct: 174 PGSYMYPFESNNEKQRVASKIPDLEDRKPGDFLTDALAEEAVKFIHSTDE-KPFFLNMSF 232
Query: 221 LAVHAGNTYEPFQAPDEEVAKFLDISDPERR----TYAGMVSRLDESVGNVIAALRKHGM 276
AVH P A E+V K+ + P+ R TYAG+V LD++VG ++ AL + G+
Sbjct: 233 YAVHT-----PITAIPEKVEKYKRLIGPDARQKNPTYAGLVEHLDDAVGAILKALEEKGI 287
Query: 277 LENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS 336
++N+I++F +DNG I N PLR K + ++GG R + P + Q VS
Sbjct: 288 IDNTIIVFTSDNGGE---ILHGITDNFPLRDGKGSSYEGGTRVPLLVKWPGVTQVNTVSH 344
Query: 337 ELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
E D+ PT + AG + +DG + L K K K E
Sbjct: 345 ERIIGFDYYPTFLSMAGANPDLHDIDG-RDFSPLLKNPKEKLEE 387
>gi|320105193|ref|YP_004180784.1| sulfatase [Isosphaera pallida ATCC 43644]
gi|319752475|gb|ADV64235.1| sulfatase [Isosphaera pallida ATCC 43644]
Length = 481
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 129/393 (32%), Positives = 194/393 (49%), Gaps = 45/393 (11%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ-HYVQALCTPSRSALMTGK 88
PHI++I+ DDLG+ D+ +G+ I TP ID+LA +G L+ H +CTP+R+AL+TG+
Sbjct: 65 PHIVLIMTDDLGYADLGCYGAPDIATPRIDSLARDGARLSHFHSPGPVCTPTRAALLTGR 124
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
+P +G++ + GLP+ E +L + LKE GY T +GKWHLG +R + P GF
Sbjct: 125 WPQRVGLEWALSASDTEPGLPVEEPILSRPLKEVGYRTVMVGKWHLG-YRPEFGPNAHGF 183
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI--YSTDLYTEAAINVIA 206
D +G G D+Y H + ++ + + + K + + YSTDL ++ A+ I
Sbjct: 184 DEFFGLLSGNVDHYSH---------REINGKEDWYENTKPVRVEGYSTDLLSDRAVAAIQ 234
Query: 207 E-----HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE------RRTYAG 255
+ ++ +P++LY+A+ AVH PFQ P + DI D R Y
Sbjct: 235 KTAAQPPDQRQPLWLYVAYNAVH-----WPFQPP----GRPQDIRDRSTWFNGTRADYVK 285
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
MV +D VG ++ AL GM ++++V+F DNG N P K T W+G
Sbjct: 286 MVESIDAGVGRILDALEAGGMADHTLVIFTNDNGG------ERLSDNGPFFHHKGTVWEG 339
Query: 316 GMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKG 373
G R I W + + + D T+ AAA I N + LDGVN +L
Sbjct: 340 GHRVPCLIRWPGRIAAGTEFGQPTIGM-DLTATILAAAKIPPNPERPLDGVNLVPILRGE 398
Query: 374 AKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
+ ID + QK AALR KYV
Sbjct: 399 VPCPDRALFWRIDRPNRKQK--AALR-GRWKYV 428
>gi|255690672|ref|ZP_05414347.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
gi|260623696|gb|EEX46567.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
Length = 459
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 120/377 (31%), Positives = 190/377 (50%), Gaps = 39/377 (10%)
Query: 6 KYFFALTCTLLFNDAF--LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
++F A TC L AF + A K+P+ +II+ADD+G+ DV +G+ I TPNID +A
Sbjct: 6 RHFVAATCALA---AFSPVEMMAQKQPNFVIIVADDMGYGDVGIYGNEYIKTPNIDQIAR 62
Query: 64 NGLILNQ-HYVQALCTPSRSALMTGKYPIHIGMQHGVIL----EGEPWGLPLTEKLLPQY 118
G++ H ++ +P+R L+TG+Y G++ +++ + + GLP E +
Sbjct: 63 EGMMFTDFHSNGSVSSPTRCGLLTGRYQQRAGLEKVLLVPRDDKDKEVGLPSEEITFAKI 122
Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCK-ATFEPYQGLD 177
L + GY T IGKWHLG+ ++ + F GF G+ G DY H + + + GL+
Sbjct: 123 LGDNGYRTALIGKWHLGYLQKHHPMNF-GFQKFVGFKSGNVDYQSHRNRYGDMDWWDGLE 181
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
M K + Y+T L T + + I E NK KP LY+AH A H+ P Q PDE
Sbjct: 182 M--------KDMSGYTTTLLTTLSEDYIKE-NKDKPFCLYIAHAAPHS-----PMQGPDE 227
Query: 238 EVAKFLDISDPERRT-------YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
+ + + ++ + Y MV LD SVG ++ L+K+ + EN+ V+F +DNG
Sbjct: 228 KAVRTEATPEGDKNSDRSNKEIYKDMVEELDWSVGRILETLKKYKLDENTFVVFFSDNGP 287
Query: 291 PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
+ +N GS +G K +PW+GG R + P + + D PT+
Sbjct: 288 ----VINNGGSAGGYKGAKGSPWEGGHRVPGICYMPGTIKEGTTCEQTVMSFDLFPTMLD 343
Query: 351 AAGIEINDT--SLDGVN 365
A I +D+ LDG +
Sbjct: 344 MADIHYDDSKKKLDGTS 360
>gi|334145238|ref|YP_004538448.1| sulfatase family protein [Novosphingobium sp. PP1Y]
gi|333937122|emb|CCA90481.1| sulfatase family protein [Novosphingobium sp. PP1Y]
Length = 425
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 131/389 (33%), Positives = 187/389 (48%), Gaps = 34/389 (8%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPS 80
+ A + ++I+IL DDLG+ D+S G+ I TPNID +A G +Q Y A LCTPS
Sbjct: 1 MAANARRPRNVIVILCDDLGYGDLSITGARGIKTPNIDRMAREGRTFSQFYAAANLCTPS 60
Query: 81 RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
R+ L+TG+YP+ G+ VIL + LP +E +P LK A YAT GKWHLG
Sbjct: 61 RAGLLTGRYPVRTGLGDKVILYNDDRVLPTSEVTIPTALKTAEYATGLFGKWHLGHRGPD 120
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
+ PT GFD + G+ +D S G + QV + + ++ E
Sbjct: 121 WLPTHHGFDR----FVGIPYSHDMSPLVLVRAEAG---KEAHQVPTEITPL--QQIFCEE 171
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
A I E N +P F+ LA A H P + P A F S Y V +
Sbjct: 172 AEQFITE-NAERPFFVELALSAPHL-----PCRPP----APFAGRSG--AGAYGDSVEEV 219
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK-STPWDGGMRG 319
D+ VG ++A LRK G+ +++VLF +DNG P + +GS+ PLR K +DG +
Sbjct: 220 DDKVGRILALLRKLGIERDTLVLFTSDNG-PWY-----EGSSGPLRQRKGGGGYDGASKV 273
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
WSP + D+LPT CA AG+ + + LDG + ++ GA+
Sbjct: 274 PMIAWSPGSVSASSHCDAIAMAIDFLPTCCALAGVPLPEVELDGKDISSLIFDGAEQSPH 333
Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAG 408
E L DN D A+R K+V+G
Sbjct: 334 EELVLFDNEDP-----VAIRTQHWKFVSG 357
>gi|403072042|pdb|4FDI|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A
gi|403072043|pdb|4FDI|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A
gi|403072044|pdb|4FDJ|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
With Galnac
gi|403072045|pdb|4FDJ|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
With Galnac
Length = 502
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/376 (31%), Positives = 180/376 (47%), Gaps = 25/376 (6%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
AP+ P+I+++L DD+GW D+ +G TPN+D +A GL+ Y L +PSR+AL
Sbjct: 1 APQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLXSPSRAAL 60
Query: 85 MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
+TG+ PI G ++ + G+P +E+LLP+ LK+AGY + +GKWHLG
Sbjct: 61 LTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLGH- 119
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYS 193
R + P GFD +G YD+ + Y+ +M ++ KT
Sbjct: 120 RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL 179
Query: 194 TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
T +Y + A++ I + P FLY A A HA P FL S +R Y
Sbjct: 180 TQIYLQEALDFIKRQARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRGRY 228
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
V +D+S+G ++ L+ + +N+ V F +DNGA GSN P K T +
Sbjct: 229 GDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTF 288
Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVLTK 372
+GGMR A W P +VS +L I D T A AG+ +D ++DG+N L +
Sbjct: 289 EGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTLLQ 348
Query: 373 GAKTKRSEILHNIDNV 388
G R + D +
Sbjct: 349 GRLMDRPIFYYRGDTL 364
>gi|301608268|ref|XP_002933701.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Xenopus
(Silurana) tropicalis]
Length = 520
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 121/385 (31%), Positives = 182/385 (47%), Gaps = 26/385 (6%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
+AL L L +P+I+++L DD+GW D+ G TPN+D +A G++
Sbjct: 6 MYALALGFLNPVLVLGGMNGTRPNILLLLMDDMGWGDLGVFGEPSRETPNLDKMASEGML 65
Query: 68 LNQHYV-QALCTPSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYL 119
Y LC+PSR+AL+TG+ PI G ++ + G+P +E L P+ L
Sbjct: 66 FPNFYTANPLCSPSRAALLTGRLPIRNGFYTTNSHARNAYTPQEIMGGIPDSEVLFPELL 125
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM- 178
K+AGY IGKWHLG R Y P RGFD +G YD+ + Y+ DM
Sbjct: 126 KKAGYVNKIIGKWHLGH-RPSYHPLRRGFDEWFGSPNCHFGPYDNKQRPNIPVYRDWDMV 184
Query: 179 ---RHNMQVDNKTIGIYSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQA 234
+ +D+KT T +Y + A+ I + +P FLY + A HA
Sbjct: 185 GRYYEDFNIDHKTGESNLTQIYLQEALEFINRQAGDQQPFFLYWSIDATHA--------- 235
Query: 235 PDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
P FL S R Y V +D S+G ++ L + +++++ V F +DNGA
Sbjct: 236 PVYASRPFLGTS--RRGLYGDAVREIDFSIGKILGLLNQKDIIKDTFVFFTSDNGAALIS 293
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
GSN P K T ++GGMR W P +VS +L I D T + AG+
Sbjct: 294 APMQGGSNGPFLCGKQTTFEGGMRVPGIAWWPGHVAPGQVSYQLGSIMDLFTTSLSLAGV 353
Query: 355 EI-NDTSLDGVNQWDVLTKGAKTKR 378
EI D ++DG++ V+ + +T+R
Sbjct: 354 EIPKDRAIDGIDLSPVILESKETER 378
>gi|338212321|ref|YP_004656376.1| N-acetylgalactosamine-6-sulfatase [Runella slithyformis DSM 19594]
gi|336306142|gb|AEI49244.1| N-acetylgalactosamine-6-sulfatase [Runella slithyformis DSM 19594]
Length = 446
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 126/360 (35%), Positives = 194/360 (53%), Gaps = 36/360 (10%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F C LLF + + + +KP+I++IL DD+GW +V +GS TP++D LA G+
Sbjct: 7 FIRHCCFLLFFTS--SAFSQQKPNIVMILVDDMGWGEVGIYGSHYCQTPHLDNLAKQGMR 64
Query: 68 LNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEG--EPWGLPLTEKL-LPQYLKEAG 123
Y + +C+P+R+ALMTG+YP +G+ GVI WG + + LPQ LK AG
Sbjct: 65 FTNFYANSTVCSPTRAALMTGRYPDLVGVP-GVIRGNVENSWGYFSPKAVTLPQVLKTAG 123
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ-DYYDHSCKATFEPYQGLD-MRHN 181
Y T +GKWHLG E + P GF +G+ + + DYY H +G + MR N
Sbjct: 124 YRTAMVGKWHLGLEPENH-PNRHGFTHFHGFLEDMMDDYYTHL-------REGKNWMRLN 175
Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
QV + + ++TDL+T+ AI + K+ +P FLYLA+ A H P Q P E +
Sbjct: 176 GQVIDPS--GHATDLFTQWAIEYLNTQKKNPQPFFLYLAYNAPHF-----PVQPPREWLE 228
Query: 241 KFLDISDP----ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
K + +P +R ++ LDES+G V+ ALR++G +N++++F +DNG G+
Sbjct: 229 K-VKKRNPNLSEKRAKLTALIEHLDESIGKVMEALRQNGQADNTLIVFSSDNG----GLL 283
Query: 297 SNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
S++ N +G K T ++GG+R A+W + K + D PT C AAG +
Sbjct: 284 SDEADNGIWKGGKQTMYEGGIRVPTIAVWKNKIIPGTKTDFRALTM-DLFPTFCEAAGTK 342
>gi|313232711|emb|CBY19381.1| unnamed protein product [Oikopleura dioica]
Length = 453
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 144/493 (29%), Positives = 221/493 (44%), Gaps = 73/493 (14%)
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
MTG+YP+ G+Q GVI EP G+PL E LLPQ +E GY+T GKWHLG++ + + P
Sbjct: 1 MTGRYPMRYGLQRGVIRPDEPEGIPLREILLPQGFRECGYSTLMSGKWHLGYYTKAHRPQ 60
Query: 145 FRGFDSHYGYWQGLQDYYDH-----SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
RGFD YG++ G QD+Y H S A+F +G + + + G YST L +
Sbjct: 61 NRGFDRFYGFYVGSQDFYFHRTGFKSDGASFWSEEGYSTGEEI-IRHDAWGHYSTTLLVD 119
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP--DEEVAKFLDISDPERRTYAGMV 257
AI + N P+F YL+ HA P Q P E+ K + ER+ + V
Sbjct: 120 DAIKQLRNRNPGNPLFHYLSIQDPHA-----PEQRPLRLEQSIKHFALRG-ERKIFVTKV 173
Query: 258 SRLDESVGNVIAALRKHGM--LENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
+D +G + L++ + N+IV+F +DNG P SN PLRG K +DG
Sbjct: 174 KSMDLEIGRYLDVLKRIYIPDYNNTIVIFSSDNGGP----RKLGASNWPLRGGKGCIFDG 229
Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI--NDTSLDGVNQWD-VLTK 372
R A I P +K K +FH DW+PTL ++ + LDGV+ +L
Sbjct: 230 AFRSRALIHIPGMK--HKNYPHMFHAVDWMPTLMEMLSCKLPKDHLPLDGVSHMKTILGM 287
Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPK 432
+ R E+L +ID + QK + R D Y+
Sbjct: 288 TDRAPREEMLLDIDPEEVLQKTASDAR--DFSYL-------------------------- 319
Query: 433 EVLYSKAGITFNALKTKLQIKQ-KHAADPKANSSDALRTILTDEKILELREFARVRCNYD 491
K+G + ++ L+ K+ K P + + + ++ + NY
Sbjct: 320 ----KKSGFD-SRMRAALRYKEWKIITGPPCSGTKC--GVQSENR------------NYY 360
Query: 492 NKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR 551
+ + + L+NIT DP E +L++ ++ K L KL++Y ++ DK+
Sbjct: 361 KAEEYVSKSRSVRLYNITADPRESYDLSDKYPEVTKALLLKLSVYHEEQSLFMDRMRDKK 420
Query: 552 ADPARWNNIWVPW 564
A P + W PW
Sbjct: 421 ALPETSEHTWFPW 433
>gi|329744562|ref|NP_001193258.1| N-acetylgalactosamine-6-sulfatase precursor [Bos taurus]
gi|296478055|tpg|DAA20170.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Bos taurus]
Length = 522
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 120/367 (32%), Positives = 176/367 (47%), Gaps = 26/367 (7%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
A + P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+AL
Sbjct: 26 ALQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNFYTANPLCSPSRAAL 85
Query: 85 MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
+TG+ PI G ++ + G+P +E LLP LK AGYA+ +GKWHLG
Sbjct: 86 LTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWHLGH- 144
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYS 193
R + P GFD +G YD+ + Y+ +M ++ KT
Sbjct: 145 RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTGEANL 204
Query: 194 TDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
T +Y + A+ I + +P FLY A A HA P FL S +R
Sbjct: 205 TQIYLQEALEFIQRQQAAHRPFFLYWAVDATHA---------PIYASKPFLGTS--QRGR 253
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
Y + LD+SVG ++ LR + EN+ V F +DNGA GSN P K T
Sbjct: 254 YGDAIRELDDSVGRILRLLRDLSIAENTFVFFTSDNGAALISAPRQGGSNGPFLCGKQTT 313
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLT 371
++GGMR A W P +VS +L I D T + AG+E D ++DG++ +
Sbjct: 314 FEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAGLEPPRDRAIDGLDLLPAML 373
Query: 372 KGAKTKR 378
+G T R
Sbjct: 374 QGRLTDR 380
>gi|340377674|ref|XP_003387354.1| PREDICTED: arylsulfatase A-like [Amphimedon queenslandica]
Length = 522
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 125/397 (31%), Positives = 188/397 (47%), Gaps = 42/397 (10%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
A KP+III+ ADDLG+ D+ +G TPN++++A GL+L Q Y +C+PSR+AL
Sbjct: 21 AVNKPNIIILFADDLGYGDLQIYGHPTSSTPNLNSMASEGLVLTQFYSANPVCSPSRAAL 80
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTP 143
MTG+YP G+ GV+ GLP E + + LK AGY T +GKWHLG Y P
Sbjct: 81 MTGRYPPRTGIWPGVLNPNSIGGLPHNETTIAEALKTAGYKTGMVGKWHLGVGENGTYLP 140
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-------VDNKTIGIY-STD 195
T GFD Y+ G+ Y + C T Y ++ V ++ +TD
Sbjct: 141 TNHGFD----YYLGIP-YSNDECPCTICYYPNAPCNSPLKIPCNPSFVSCPVCPVFENTD 195
Query: 196 L-------------YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
+ Y+ AA+ I + P FLY+A VH +P Q DE+
Sbjct: 196 IIEQPAHLPTLSTKYSNAALGFIRANAGKNPFFLYMAFTHVH-----QP-QFSDEDYTNT 249
Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
R + ++ LD VG +++ L+ + +N+ V F +DNG PS + G+
Sbjct: 250 T-----LRGPFGDALAELDNEVGLIMSTLKDTQVDDNTFVFFASDNG-PSLRVQQQGGNA 303
Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLD 362
LR K + WDGGMR A W P K Q ++EL D PT+ + G + +D
Sbjct: 304 GLLRCGKGSTWDGGMREPAIAWWPG-KIRQGKTAELAATVDLFPTIMSITGAKPGSAFID 362
Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALR 399
G++ +L + R ++ + +P Y A+R
Sbjct: 363 GIDMSPILFDSKPSNRESYVY-FPAIYDPNIGYHAIR 398
>gi|372221524|ref|ZP_09499945.1| sulfatase [Mesoflavibacter zeaxanthinifaciens S86]
Length = 461
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 119/375 (31%), Positives = 194/375 (51%), Gaps = 35/375 (9%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
N+ ++P+I++++ADD GWNDV +HGS +I TP +D+LA NG L + YV C+PSR+
Sbjct: 29 NSLPKEQPNILLVIADDAGWNDVGYHGS-KIKTPVLDSLANNGAKLERFYVAPTCSPSRA 87
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
AL+TG +G+ + + + LP + LP+ +K+ GY T GKWHLG E
Sbjct: 88 ALLTGIPASRLGIVAPIAGKSK-IALPDSLVTLPKAMKKLGYRTALFGKWHLGLTPE-NG 145
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ--GLDMRHNMQVDNKTIGIYSTDLYTEA 200
P GFD+ YG+ G D Y H K + G ++ + V TDL T+A
Sbjct: 146 PQAYGFDTSYGFLHGQIDQYTHEYKNGDPSWHKNGKFLKEDGHV---------TDLLTDA 196
Query: 201 AINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI-SDPERRTYAGMVS 258
AI + K++ P F+ LA+ A H P Q + + I +D R YA ++
Sbjct: 197 AIAYFNQETKTETPSFVTLAYSAPHF-----PLQEEERYKRPYRTIFTDSSRVDYAAAMT 251
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-----------PSFGIHSNKGSNHPLRG 307
LD + G ++ +L++ G L+N+++LFM+DNGA G ++ G N PL+
Sbjct: 252 HLDIAFGRLLNSLKRSGKLDNTLILFMSDNGAMKNWYPKDQYQGKHGPNTTLGDNTPLKD 311
Query: 308 MKSTPWDGGMRGVAAIWSPWLKQ-TQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQ 366
K+T ++G +R A ++ W K + + D LPTL G + +++G +
Sbjct: 312 YKTTNYEGAIRVPAFVY--WKNHFNGKTLDDFIGVIDVLPTLVELTGASVKPNTIEGSSF 369
Query: 367 WDVLTKGAKTKRSEI 381
++L ++TK ++
Sbjct: 370 LNLLNGASQTKIPQL 384
>gi|320102821|ref|YP_004178412.1| sulfatase [Isosphaera pallida ATCC 43644]
gi|319750103|gb|ADV61863.1| sulfatase [Isosphaera pallida ATCC 43644]
Length = 480
Score = 172 bits (437), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 122/371 (32%), Positives = 184/371 (49%), Gaps = 41/371 (11%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
L TL + ++ +P+I++ILADDLGW D+S++G Q TP++D LA G ++
Sbjct: 17 LAATLGGEASGVHARETDRPNIVLILADDLGWGDLSYNGRIQWSTPHLDDLAARGARFDR 76
Query: 71 HYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPW----GLPLTEKLLPQYLKEAGYA 125
Y +C PSR+AL+TGK PIH G W LP E + + LK GY
Sbjct: 77 FYTAGVVCAPSRAALLTGKNPIHCGT----------WRNDHDLPAEEVTIAEALKAKGYH 126
Query: 126 THAIGKWHLGF-FREVYT---PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN 181
T GKWH G RE T P +GFD +G+ + P + R
Sbjct: 127 TALYGKWHQGRPTREGGTRTHPLDQGFDETFGFLSATHAWEKF-------PNHLFNGRDR 179
Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
++V G Y+ D +T+ + IA H + P FLYL + H +AP++E+A
Sbjct: 180 VEVPQ---GRYADDWFTDKGVEFIARHKDDPTPFFLYLPLTSTHFH-----IEAPEDEIA 231
Query: 241 KFLD-ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG----I 295
K SDP + YA +V+R+D VG ++ A+R G+ ++++++F +D+GA G
Sbjct: 232 KHRSTFSDPIQTAYAAIVTRMDAQVGRIVQAVRDAGLEDSTLIVFTSDHGATFEGGSKEA 291
Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
S+ SNHP RG K T ++GG+R + P + ++ S D LPT AG
Sbjct: 292 SSSLDSNHPFRGQKRTVYEGGIRVPGIMTYPGVIPAGRIVSTPVQTIDLLPTFLELAGGR 351
Query: 356 IN-DTSLDGVN 365
D +DG+N
Sbjct: 352 PEPDWKVDGLN 362
>gi|285808548|gb|ADC36070.1| sulfatase [uncultured bacterium 213]
Length = 478
Score = 172 bits (437), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 121/361 (33%), Positives = 186/361 (51%), Gaps = 30/361 (8%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSA 83
TAP+ P+I+ I+ADDLG+ DVS +G + TPN+D +A G+ Q Y A+C+ +R+A
Sbjct: 34 TAPRPPNIVFIMADDLGYADVSCYGRPDLNTPNVDRVALKGVRFLQAYANSAVCSATRTA 93
Query: 84 LMTGKYP--IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
L+TG+Y + IG++ + + G GLP LP L++AGY T +GKWHLG + +
Sbjct: 94 LITGRYQYRLPIGLEEPLGI-GRDVGLPPEHPTLPSLLRKAGYRTTLLGKWHLGALPK-F 151
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
P G+D YG+ G DYY H+ + + V + G Y TDL A
Sbjct: 152 GPLQSGYDHFYGFRGGSVDYYTHAGPDQRD------DLWDDDVPLRQSG-YLTDLLGSRA 204
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER-------RTYA 254
++VI ++ S FL +++H + P++AP +E S R +TY
Sbjct: 205 VDVINGYSHSDRPFL----VSLHFSAPHWPWEAPGDEAESNRIRSTTLRHYDGGTQKTYQ 260
Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
M+ ++D +G V+ AL +G +N+IV+F +DNG F P G K+ +
Sbjct: 261 RMIEQMDLQIGRVLGALDANGATDNTIVVFTSDNGGERF------ADTWPFTGRKTELLE 314
Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKG 373
GG+R A I P + + ++ DWLPTL AAAG+ + DG+N +LT
Sbjct: 315 GGLRIPALISWPARIRQGATTDQVAMSMDWLPTLLAAAGVAPDRAFPPDGLNLLPLLTAN 374
Query: 374 A 374
A
Sbjct: 375 A 375
>gi|325108489|ref|YP_004269557.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324968757|gb|ADY59535.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 455
Score = 172 bits (437), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 117/346 (33%), Positives = 180/346 (52%), Gaps = 29/346 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
++P++++I +DD G D +GS + TP ID LA G+ Q Y +C+ SR+ L+T
Sbjct: 30 ERPNVLVIYSDDQGTIDAGCYGSDDLTTPAIDHLAETGIRFTQMYAPSCICSASRAGLLT 89
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+ P+ G+ V + G+P +E L K+AGY T +GKWHLG + P +
Sbjct: 90 GRLPVRAGVPANVSSKKGAEGMPASEVTLGDVFKQAGYRTAHVGKWHLG-YTPATMPLGQ 148
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEAAIN 203
GFDS +G+ G D Y H F + G + RH++ D K + G + DL +
Sbjct: 149 GFDSSFGHMGGCIDNYSH-----FFYWNGPN-RHDLWRDGKEVFHDGQFFPDLLVDEMQR 202
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
++ E ++ +P+F+Y A H P+Q D+ + D+ P RR YA VS LDE
Sbjct: 203 LLTEKSE-QPLFVYWAINVPHY-----PYQGTDKWRNYYADLETP-RREYAAFVSTLDEK 255
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPS-----FGIHSNKGSNHPLRGMKSTPWDGGMR 318
+G V+A L + G EN+IV+F +D+G + FG GS P RG K ++GG+R
Sbjct: 256 LGEVLATLEETGQRENTIVVFQSDHGHSTEERAFFG----GGSAGPYRGAKGCLFEGGIR 311
Query: 319 GVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDG 363
+ I W L Q + V ++ DWLPTL ++++ LDG
Sbjct: 312 VPSVISWPAGLPQGE-VRDQMAVGCDWLPTLVELTKVDVDLPLLDG 356
>gi|410631133|ref|ZP_11341814.1| arylsulfatase B [Glaciecola arctica BSs20135]
gi|410149335|dbj|GAC18681.1| arylsulfatase B [Glaciecola arctica BSs20135]
Length = 454
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 125/367 (34%), Positives = 185/367 (50%), Gaps = 35/367 (9%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
T+ P+I+IILADD+G+ D+S +GS +I TP ID LA G Q Y A +C+PSR+
Sbjct: 32 TSAVPPNIVIILADDMGFGDLSINGSGKIKTPAIDQLARTGFNFTQFYASANICSPSRAG 91
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
LMTG+YPI + + VI G+ GLP TE L + A Y T IGKWHLG F E + P
Sbjct: 92 LMTGRYPIRTSLGYNVITAGDTHGLPKTEDTLAELALRADYRTMLIGKWHLGRFPE-HAP 150
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCK-ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
GF++ YG HS A F Y G D V+ T+ T YTE ++
Sbjct: 151 MKHGFETFYG--------VPHSNDMANFALYSG-DNIVEQPVEQSTL----TLRYTEKSV 197
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
+ I + N P L+++H H P A ++ K + Y +V LD
Sbjct: 198 SFI-DANAKHPFLLFVSHTMPHI-----PLYASEDFAGK------SDAGVYGDVVEELDY 245
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
S G ++AAL+++G+ +NSI++ +DNG P F +G+ L+G K + ++G R
Sbjct: 246 STGKIVAALKRNGIFDNSIIIITSDNG-PFF-----EGATAGLKGGKGSTYEGAYRVPLI 299
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
I P V+ + D LPT+ A + + +DG + +L +GA+ E L
Sbjct: 300 ISWPAGIANPAVTDAIAMNIDILPTIAEAIESKPAASIIDGRSLLPLL-RGAEKSPHEYL 358
Query: 383 HNIDNVD 389
+ +N D
Sbjct: 359 YYFNNED 365
>gi|421613760|ref|ZP_16054831.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
gi|408495346|gb|EKJ99933.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
Length = 485
Score = 172 bits (435), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 127/404 (31%), Positives = 190/404 (47%), Gaps = 59/404 (14%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRS 82
+A +P+I++I +DD G+ND+ G+ I TP +D LA G L YV CTPSR+
Sbjct: 24 ASAADRPNILLIASDDQGYNDLGQLGNGII-TPALDRLAKEGTRLTNFYVAWPACTPSRA 82
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLT-----------------EKLLPQYLKEAGYA 125
+L+TG+YP G+ + E +G T E +LP L+ AGY
Sbjct: 83 SLLTGRYPQRNGIYDMIRNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAMLRPAGYK 142
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
+ GKW LG + + PT RGFD YG+ DY+ H E Y M N++
Sbjct: 143 SGIYGKWDLGALQRML-PTSRGFDDFYGFVNTGIDYFTH------ERYGVPCMVRNLEPT 195
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-----FQAPDEEVA 240
G Y T L+ A+ + EH ++P FLY+ A H ++ +P QAPD+ A
Sbjct: 196 EADKGTYCTYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSLDPTIRSSVQAPDQFKA 255
Query: 241 KF------------------LDISDPE--RRTYAGMVSRLDESVGNVIAALRKHGMLENS 280
+ ++ P+ RR Y V+ +D ++G ++ L ML+ +
Sbjct: 256 MYPPVEVETRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAIGEMLDRLEAKQMLDET 315
Query: 281 IVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
+V+F +DNG S N PLRG K+ W+GG+R + P K V ++ F
Sbjct: 316 MVVFFSDNGG------SGGADNSPLRGHKAQTWEGGIRVPCLVRWPAGKIPAGVVNDEFL 369
Query: 341 IS-DWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEIL 382
S + LP+ AAAG+E LDG N W L A++ R E+
Sbjct: 370 TSLELLPSFAAAAGVEPPPGVVLDGFNWWPTLRGEAESPRKEMF 413
>gi|418530519|ref|ZP_13096442.1| sulfatase [Comamonas testosteroni ATCC 11996]
gi|371452238|gb|EHN65267.1| sulfatase [Comamonas testosteroni ATCC 11996]
Length = 440
Score = 172 bits (435), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 126/365 (34%), Positives = 180/365 (49%), Gaps = 36/365 (9%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGK 88
P+I+ ILADDLGW D+ +G++ TP++D LA G+ NQ Y A+C+ +R AL+TG+
Sbjct: 4 PNIVFILADDLGWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGR 63
Query: 89 YP--IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
Y + G++ + G GLP LP LK+AGY T IGKWHLG Y P
Sbjct: 64 YQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLG-KPPAYGPQLS 122
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEAAIN 203
G+D +G G DY+ H +P G ++ N+ + G Y+ L EA
Sbjct: 123 GYDYFFGNHSGAIDYFTH------KPGVGEQFSSDLWQGNEPVERTGYYTYILGDEATRY 176
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE-----VAKFLDISDPERRTYAGMVS 258
V + KP FL L A H P++ PD+E + + Y +V
Sbjct: 177 VHERKGQDKPFFLSLHFTAPHW-----PWEGPDDEHVAHSIKDLFHYDGGSLKKYGEIVE 231
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LD++VG V+ AL G +N+IV+F +DNG F P G K+ +GG+R
Sbjct: 232 ALDKAVGQVLQALDDSGQADNTIVIFTSDNGGERF------SKTWPFTGQKTELLEGGIR 285
Query: 319 GVAAI-WSPWLK-QTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLT-KGA 374
+ WS ++ Q Q+ + F DWLPTL AAAG + D DG N +L A
Sbjct: 286 VPTLLRWSARIQPQVQEQVTASF---DWLPTLLAAAGARQHPDFPSDGQNILPILEGHAA 342
Query: 375 KTKRS 379
+T+RS
Sbjct: 343 QTERS 347
>gi|296122982|ref|YP_003630760.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296015322|gb|ADG68561.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 466
Score = 172 bits (435), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 127/349 (36%), Positives = 168/349 (48%), Gaps = 27/349 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
KP+I++I ADDLG+ D+ +GS I TP++D +A GL Y A +CTPSR+AL+TG
Sbjct: 35 KPNIVVIFADDLGYGDLGCYGSPTIRTPHLDEMAAEGLRFTDFYSAAEVCTPSRAALLTG 94
Query: 88 KYPIHIGM--QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
+ PI GM V+ GLP E + + LKE GYAT IGKWHLG P
Sbjct: 95 RLPIRSGMCGARRVLFPNSKGGLPPAEITIAEALKEKGYATAQIGKWHLG-IHPGSRPLD 153
Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD------MRHNMQVDNKTIGIYSTDLY 197
+GFD +G Y + D +T P LD +R+ V+ T Y
Sbjct: 154 QGFDQSFGLPYSNDMDARADLPKGSTGSPNPPLDGWNVALLRNGEVVEQPANQTTLTKRY 213
Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
TE AI I E K+ P FLY+ H H P A + K Y V
Sbjct: 214 TEEAIKFITE-KKNVPFFLYMPHTFPHV-----PMFASQDFKGK------SRAGIYGDAV 261
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LD SVG V+ AL + G+ EN++V F +DNG P + GS L+ K + W+GGM
Sbjct: 262 EELDWSVGQVLGALCREGIAENTLVFFSSDNG-PWLIMGDQGGSAGLLKDGKGSTWEGGM 320
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
R W P + V+S+L D PT A AG + D DGV+
Sbjct: 321 RVPGIAWMP-SRIKPGVTSQLASAMDVFPTALALAGASLPKDVVFDGVD 368
>gi|149175125|ref|ZP_01853748.1| N-acetylgalactosamine-6-sulfate sulfatase [Planctomyces maris DSM
8797]
gi|148846103|gb|EDL60443.1| N-acetylgalactosamine-6-sulfate sulfatase [Planctomyces maris DSM
8797]
Length = 413
Score = 171 bits (434), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 117/344 (34%), Positives = 179/344 (52%), Gaps = 35/344 (10%)
Query: 36 LADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ-HYVQALCTPSRSALMTGKYPIHIG 94
+ADDLG+ D+S +GS TP++D LA NG+ H A+C+P+R+ L+TG+Y G
Sbjct: 1 MADDLGYGDLSCYGSQNCNTPHLDRLAANGIRFTDFHSSGAVCSPTRAGLLTGRYQQRAG 60
Query: 95 MQHGVIL----EGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
+ GV+ + GL E L Q L++AGY T GKWHLG+ R+ Y PTFRGF
Sbjct: 61 ID-GVVYANPKKNRHHGLQKNEITLAQCLQDAGYQTGMFGKWHLGYQRQ-YNPTFRGFQQ 118
Query: 151 HYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNK 210
GY G DY+ H + D HN +++ + G Y T L + A+ I + +
Sbjct: 119 FVGYVSGNVDYFAH-----LDGTGVFDWWHNAELNREEQG-YVTHLINDHALEFIRQQ-Q 171
Query: 211 SKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK---FLDISDPERR----TYAGMVSRLDES 263
KP F+Y+AH AVH+ P+Q P ++ + DI +R+ Y M + +D+
Sbjct: 172 EKPFFVYIAHEAVHS-----PYQGPHDQPMRKEGGGDIKSAKRKDIANAYREMNTEMDKG 226
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA-A 322
+G ++ L++ + E + + F++DNGA + GSN LRG K + W+GG R A A
Sbjct: 227 IGQIVDVLKEVNLTEKTFIFFLSDNGA------NKNGSNGKLRGFKGSLWEGGHRVPAIA 280
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
W + + + I D +PT+ A +I LDGV+
Sbjct: 281 CWPGRIPEGTVCDEPVISI-DLMPTILELANAKIPAGHKLDGVS 323
>gi|299530773|ref|ZP_07044188.1| sulfatase [Comamonas testosteroni S44]
gi|298721289|gb|EFI62231.1| sulfatase [Comamonas testosteroni S44]
Length = 440
Score = 171 bits (434), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 124/364 (34%), Positives = 179/364 (49%), Gaps = 34/364 (9%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGK 88
P+I+ ILADDLGW D+ +G++ TP++D LA G+ NQ Y A+C+ +R AL+TG+
Sbjct: 4 PNIVFILADDLGWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGR 63
Query: 89 YP--IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
Y + G++ + G GLP LP LK+AGY T IGKWHLG Y P
Sbjct: 64 YQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLG-KPPAYGPQLS 122
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEAAIN 203
G+D +G G DY+ H +P G ++ N+ + G Y+ L EA
Sbjct: 123 GYDYFFGNHSGAIDYFTH------KPGVGEQFSSDLWQGNEPVERTGYYTYILGDEATRY 176
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE-----VAKFLDISDPERRTYAGMVS 258
V + KP FL L A H P++ PD+E + + Y +V
Sbjct: 177 VHERKGQDKPFFLSLHFTAPHW-----PWEGPDDEHVAHSIKDLFHYDGGSLKKYGEIVE 231
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LD++VG V+ AL G +N+IV+F +DNG F P G K+ +GG+R
Sbjct: 232 ALDKAVGQVLQALDDSGQADNTIVIFTSDNGGERF------SKTWPFTGQKTELLEGGIR 285
Query: 319 GVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLT-KGAK 375
+ WS ++ +V ++ DWLPTL AAAG + D DG N +L A+
Sbjct: 286 VPTLLRWSARIQ--PQVQEQITASFDWLPTLLAAAGARQHPDFPSDGQNILPILEGHAAQ 343
Query: 376 TKRS 379
T+RS
Sbjct: 344 TERS 347
>gi|432851911|ref|XP_004067103.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like isoform 2
[Oryzias latipes]
Length = 523
Score = 171 bits (434), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 122/357 (34%), Positives = 171/357 (47%), Gaps = 26/357 (7%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSR 81
NT P+III+L DD+GW D+ G TPN+DA+A G++ Y LC+PSR
Sbjct: 24 NTPNTSSPNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAAQGILFPDFYTANPLCSPSR 83
Query: 82 SALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL 134
+AL+TG+ PI G ++ + G+ E LLPQ LKE GY +GKWHL
Sbjct: 84 AALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGISKDEILLPQLLKEKGYVNKIVGKWHL 143
Query: 135 GFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR----HNMQVDNKTIG 190
G R Y P GFD +G Y+++ + Y +M ++D KT
Sbjct: 144 GH-RPQYLPLENGFDEWFGAPNCHFGPYNNTVRPNIPVYNNSEMLGRYFEEFKIDKKTGE 202
Query: 191 IYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
T +Y EA ++ I+ ++K P FLY A A H+ P FL S +
Sbjct: 203 SNLTQMYLEAGLDFISRQAEAKRPFFLYWAADATHS---------PVYASKPFLGKS--Q 251
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
R Y V LD SVG ++ L++ G+ N+ V F +DNGA + GSN P K
Sbjct: 252 RGRYGDAVMELDYSVGRILTKLQRLGIDNNTFVFFTSDNGAALMSGPNEGGSNGPFLCGK 311
Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVN 365
T ++GGMR A W P + VS + ++ D T A AGI D LDG++
Sbjct: 312 ETTFEGGMREPAIAWWPGRIKGGTVSFQPANVMDLFTTSLALAGIGPPGDRVLDGLD 368
>gi|430746414|ref|YP_007205543.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430018134|gb|AGA29848.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 590
Score = 171 bits (434), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 123/354 (34%), Positives = 179/354 (50%), Gaps = 44/354 (12%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
A + P++++ILADD GW D+S HG++ + TPNID+LA +G + + YV +C P+R+ +
Sbjct: 20 AVEPPNVVVILADDQGWGDLSVHGNTNLKTPNIDSLARDGALFERFYVCPVCAPTRAEFL 79
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG+Y G++ GV GE L L EK + + K AGYAT A GKWH G + Y P
Sbjct: 80 TGRYHPRGGVR-GVTSGGE--RLDLNEKTIAETFKSAGYATGAFGKWHNG-TQFPYHPNA 135
Query: 146 RGFDSHYGYWQG-LQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
RGFD +YG+ G +Y+D + P QG + TD T+ AI+
Sbjct: 136 RGFDEYYGFTSGHWGEYFDPPLEHNGRPVQG--------------NGFITDDLTDHAISF 181
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF------LDISDPER------RT 252
I + +K +P F YL H+ P Q PD KF L D ER R
Sbjct: 182 I-KASKDRPFFCYLPFNTPHS-----PMQVPDRFYDKFKNAALKLRAHDAEREDLMMTRA 235
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
M +DE+VG V+ L + + +IVL+ +DNG S+ N +RG K +
Sbjct: 236 ALAMCENIDENVGRVLKTLEDLSLDQKTIVLYFSDNGPNSWRW------NGGMRGRKGST 289
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
+GG+R I P + + +++ D LPTL AGI + +D LDGV+
Sbjct: 290 DEGGVRSPLLIRWPKQIRPETRVAKISAAIDLLPTLTDLAGIPVKSDKPLDGVS 343
>gi|325109524|ref|YP_004270592.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324969792|gb|ADY60570.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 486
Score = 171 bits (434), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 125/362 (34%), Positives = 184/362 (50%), Gaps = 36/362 (9%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
+KP+II+ILADDLG+ D++ +G+ I TP ID +A G+ N YV A+C+P+R++LMTG
Sbjct: 28 EKPNIIVILADDLGYGDLACYGAKDIATPAIDRMATEGVKCNSFYVSAVCSPTRASLMTG 87
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+ I +G+ GV+ GL E LP+ LK+ GYAT IGKWHLG +++ P G
Sbjct: 88 SHSIRVGI-GGVMFPRNNHGLNPDEITLPELLKDQGYATAIIGKWHLG-NEDMFQPMNHG 145
Query: 148 FDSHYG--------YWQGLQDY---------YDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
FD YG Y+ ++ Y Y + T E +R N+ ++
Sbjct: 146 FDYWYGTPASNNQFYYPTIKKYAADCVFREGYTRNGILTRETAACPLIRDNVVIEVPADQ 205
Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
T YT I I E N +P F+YLAH H P A + +F+ SD R
Sbjct: 206 SQFTQRYTRETIRFITE-NHEQPFFIYLAHNMPHI-----PLHASE----RFVGSSD--R 253
Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
Y + LD S G ++ AL++ G+ N++V+F +DNG P+ G GS PL+G K
Sbjct: 254 GIYGDTIQELDWSTGEILRALKELGLDSNTLVIFTSDNG-PNTG---KGGSAGPLKGGKG 309
Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDV 369
+ ++GG+R P S E D PTL AG ++ +D +DG + +
Sbjct: 310 STYEGGVRVPFVARWPGAIPAGIESDEAMTGMDLPPTLTRLAGGDVPDDRVIDGKDISPL 369
Query: 370 LT 371
LT
Sbjct: 370 LT 371
>gi|32475143|ref|NP_868137.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH 1]
gi|32445684|emb|CAD78415.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH 1]
Length = 485
Score = 171 bits (433), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 125/404 (30%), Positives = 187/404 (46%), Gaps = 59/404 (14%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRS 82
+A +P+I++I++DD G+ND+ G+ I TP +D LA G L YV CTPSR+
Sbjct: 24 ASAADRPNILLIVSDDQGYNDLGQLGNGII-TPALDRLAKEGTRLTNFYVAWPACTPSRA 82
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLT-----------------EKLLPQYLKEAGYA 125
+L+TG+YP G+ + E +G T E +LP L+ AGY
Sbjct: 83 SLLTGRYPQRNGIYDMIRNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAVLRPAGYK 142
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
+ GKW LG + + PT RGFD YG+ DY+ H E Y M N++
Sbjct: 143 SGIFGKWDLGALQRML-PTSRGFDDFYGFVNTGIDYFTH------ERYGVPCMVRNLEPT 195
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-----FQAPDEEVA 240
G Y T L+ A+ + EH ++P FLY+ A H ++ P QAPD+ A
Sbjct: 196 EADKGTYCTYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSLVPTIRSSVQAPDQFKA 255
Query: 241 KF------------------LDISDPE--RRTYAGMVSRLDESVGNVIAALRKHGMLENS 280
+ ++ P+ RR Y V+ +D ++G ++ L ML+ +
Sbjct: 256 MYPPVEVETRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAIGEILDRLEAKQMLDET 315
Query: 281 IVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELF 339
IV+F +DNG S N PLRG K+ W+GG+R + W V+ E
Sbjct: 316 IVVFFSDNGG------SGGADNSPLRGHKAQTWEGGIRVPCLVRWPAGQIPAGVVNDEFL 369
Query: 340 HISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEIL 382
+ LP+ AAAG+E LDG + W L A++ R E+
Sbjct: 370 TSLELLPSFAAAAGVEPPPGVVLDGFDWWPTLRGEAESPRKEMF 413
>gi|325110588|ref|YP_004271656.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324970856|gb|ADY61634.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 458
Score = 171 bits (433), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 121/375 (32%), Positives = 181/375 (48%), Gaps = 32/375 (8%)
Query: 3 WARKYFFALTCTLLFNDAFLNTTAPKKPH--IIIILADDLGWNDVSFHGSSQIPTPNIDA 60
W + F L T + T A K H +++I+ADDLG+ D+S +G + TP +D
Sbjct: 8 WFTRCFLLLAVTTVALTVSQGTAAEKPSHRNVVVIMADDLGYGDLSCYGCERYQTPFLDQ 67
Query: 61 LAYNGLILNQ-HYVQALCTPSRSALMTGKYPIHIGMQHGVILEG-EPWGLPLTEKLLPQY 118
+A G+ + H +C+P+R+AL+TG+Y G+ + G GLPL +
Sbjct: 68 MAAEGMKWSDFHSNGNVCSPTRAALLTGQYQQRWGVDRVITAAGHRDRGLPLEAVTFAET 127
Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM 178
+ GYAT GKWH G+ + + P +GFD GY G D+ H +A D
Sbjct: 128 FSQHGYATAIYGKWHQGY-QARFNPVRQGFDHFVGYVSGNVDFQSHIDQAG-----QADW 181
Query: 179 RHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
HN ++ + Y T L T+ A+ I E NK +P LYL H H P+Q P++
Sbjct: 182 WHNDELTEEAG--YVTHLITQYAVKFI-EDNKEQPFCLYLPHETPHY-----PYQGPNDL 233
Query: 239 V-----AKFLD--ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
KF++ D + YA M+ +D +G V+ L+KHG+ E ++VLF +DNG
Sbjct: 234 ADRTVGGKFVNHGSRDDKAAAYAEMMIEMDRGIGEVLQTLQKHGLAEQTLVLFFSDNGGT 293
Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAA 351
S G N LRG K + W+GG R W+P + +L D PTL
Sbjct: 294 SL------GDNGGLRGTKGSDWEGGHRVPMIAWNPETVPAGRDCDQLAITHDIFPTLADY 347
Query: 352 AGIEI-NDTSLDGVN 365
A + + D SLDG++
Sbjct: 348 ADLPLPKDQSLDGIS 362
>gi|351729940|ref|ZP_08947631.1| sulfatase [Acidovorax radicis N35]
Length = 446
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 129/397 (32%), Positives = 193/397 (48%), Gaps = 36/397 (9%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
+ P+I+ ILADDLGW D+S +G + TP++DALA G+ Q Y A+C+ +R AL+T
Sbjct: 7 QPPNIVFILADDLGWADLSVYGQADFSTPHLDALAGEGVRFTQAYANSAVCSATRFALIT 66
Query: 87 GKYPIHI--GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
G+Y + G++ ++ + GLP LP L+ AGY T IGKWHLG + P
Sbjct: 67 GRYQYRLRGGLEEPLVRKAHVHGLPPEHPTLPSLLRNAGYDTALIGKWHLGSL-PTFGPL 125
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM---QVDNKTIGIYSTDLYTEAA 201
G++ +G + G DY+ H +P G + ++ +V + +G Y+ L EA+
Sbjct: 126 KSGYNRFFGNYGGAIDYFTH------KPGVGEAVPRDLYEGEVPVERVGYYTQLLADEAS 179
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP-DEEVA----KFLDISDPERRTYAGM 256
+ SKP FL L A H P+ P DEEV+ TY M
Sbjct: 180 GWLRGRSAGSKPFFLSLHFTAPHW-----PWVGPEDEEVSLGLKDLFHYDGGNLHTYGKM 234
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
V LD +VG V+ AL++ G+ +N+IV+F +DNG F P G K+ +GG
Sbjct: 235 VRSLDGAVGQVLHALQEQGLADNTIVVFTSDNGGERF------SKTWPFTGQKTELLEGG 288
Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAK 375
+R + P + +VS ++ DWLPTL AAAG+ + D DG N VL A+
Sbjct: 289 IRVPTLLRWP-ARIAPQVSDQVTATMDWLPTLLAAAGVAPDADYPSDGENILPVLEGHAR 347
Query: 376 TKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNN 412
+ + + A+R KY+ DN
Sbjct: 348 VHPRTLFWRYKS-----QRQRAVREGRFKYLKINDNE 379
>gi|346225028|ref|ZP_08846170.1| sulfatase [Anaerophaga thermohalophila DSM 12881]
gi|346226998|ref|ZP_08848140.1| sulfatase [Anaerophaga thermohalophila DSM 12881]
Length = 488
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 123/374 (32%), Positives = 181/374 (48%), Gaps = 34/374 (9%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
KP+ I+I ADDLG+ D+ G I TPN+D +A+ G YV A +CTPSR+ L+TG
Sbjct: 35 KPNFIVIFADDLGYGDLGIFGHPTINTPNLDRMAFEGQKWTNFYVAAPVCTPSRAGLLTG 94
Query: 88 KYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
+ PI GM + V+ GLP +E + + LKE GY T AIGKWHLG + + PT
Sbjct: 95 RLPIRSGMCSDKRRVLFPDSNGGLPQSEITIARALKENGYQTAAIGKWHLG-HKSPFLPT 153
Query: 145 FRGFDSHYGY----------WQGLQDYYDHSCK----ATFEPYQGLDMRHNMQVDNKTIG 190
GFDS++G G +Y K F Y +R++ ++
Sbjct: 154 DHGFDSYFGIPYSNDMDRVPGTGTGNYITSQIKLAEEENFHAYNVPILRNDKIIERPADQ 213
Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
T YTE A+ I + K P F+YLAH H P +F D+S
Sbjct: 214 RTITKRYTEEAVAKI-KSMKDGPFFIYLAHNLPH---------IPLFRSEEFKDVS--LA 261
Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
Y ++ +D SVG ++ L + G+ EN++V+F +DNG P ++ G+ LRG K
Sbjct: 262 GFYGDVIEEIDWSVGQILKTLEEEGLDENTLVVFTSDNG-PWHIFETHGGTAGLLRGAKG 320
Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL-DGVNQWDV 369
++GGMR W P K Q V ++ D PT C +G ++ D + DG + +
Sbjct: 321 GTFEGGMREPTIFWWPG-KLKQGVVMDMATTMDLFPTFCNLSGTKLPDDRIYDGYDISPL 379
Query: 370 LTKGAKTKRSEILH 383
L K+ R + +
Sbjct: 380 LLGTGKSPRDVVFY 393
>gi|315644664|ref|ZP_07897795.1| sulfatase [Paenibacillus vortex V453]
gi|315279923|gb|EFU43222.1| sulfatase [Paenibacillus vortex V453]
Length = 439
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 111/373 (29%), Positives = 195/373 (52%), Gaps = 27/373 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
K+P+ +I+ DDLG+ D+ +GS + TP++D LA G+ Y + +C+PSR++L+T
Sbjct: 2 KQPNFVIVYCDDLGYGDLGCYGSDSVRTPHLDGLADEGVRFTNWYSNSPVCSPSRASLLT 61
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
GKYP+ G+ + + GLP E L + LK AGY T GKWHLG +E +P
Sbjct: 62 GKYPVRAGVGEILGAKRGSHGLPAAEVTLAKALKPAGYRTALYGKWHLGLSKET-SPNAH 120
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEAAIN 203
GFD +G+ G D+Y H + G++ H++ + + G Y T+L TE +++
Sbjct: 121 GFDEFFGFKAGCVDFYSHIF--YWGQGHGVNPLHDLWENETEVWENGRYMTELITERSVD 178
Query: 204 VIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
I + + P FL+ ++ A H P AP E + +F + +R+ A M++ +D+
Sbjct: 179 FIKRSREQEAPFFLFASYNAPHY-----PMHAPQEYMDRFAHLP-WDRQVMAAMIAAVDD 232
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH--------SNKGSNHPLRGMKSTPWD 314
VG ++ AL++ G E++++ F +DNG S + GS RG K++ ++
Sbjct: 233 GVGEIVEALKEAGCYEDTVIFFSSDNGPSSESRNWLDGTEDIYYGGSAGIFRGHKASLFE 292
Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-----INDTSLDGVNQWDV 369
GG+R A + P + +V E+ + D +PT AG++ + +LDG + ++
Sbjct: 293 GGIREPAILNWPNGWEGGQVRDEVAAMMDIVPTFLDLAGVDLAAGPLQGITLDGSSLKEM 352
Query: 370 LTKGAKTKRSEIL 382
L +G + ++
Sbjct: 353 LFRGKPSPHKQLF 365
>gi|114326198|ref|NP_001041585.1| N-acetylgalactosamine-6-sulfatase precursor [Canis lupus
familiaris]
gi|122138594|sp|Q32KH5.1|GALNS_CANFA RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|81158068|tpe|CAI85008.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Canis lupus
familiaris]
Length = 522
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 118/376 (31%), Positives = 179/376 (47%), Gaps = 26/376 (6%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALM 85
P+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+AL+
Sbjct: 27 PQPPNILLLLMDDMGWGDLGIYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALL 86
Query: 86 TGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
TG+ PI G ++ + G+P E +LP+ LKEAGY + +GKWHLG R
Sbjct: 87 TGRLPIRNGFYTTNRHARNAYTPQEIVGGIPDQEHVLPELLKEAGYVSKIVGKWHLGH-R 145
Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYST 194
+ P GFD +G YD+ + Y+ +M ++ KT T
Sbjct: 146 PQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANLT 205
Query: 195 DLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
+Y + A++ I + +P FLY A A HA P FL S +R Y
Sbjct: 206 QVYLQEALDFIKRQQAAQRPFFLYWAIDATHA---------PVYASRPFLGTS--QRGRY 254
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
V +D SVG +++ L+ + EN+ V F +DNGA + GSN P K T +
Sbjct: 255 GDAVREIDNSVGKILSLLQDLRISENTFVFFTSDNGAALISAPNQGGSNGPFLCGKQTTF 314
Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTK 372
+GGMR A W P +VS +L I D T + AG+ +D +DG++ +
Sbjct: 315 EGGMREPAIAWWPGRIPAGRVSHQLGSIMDLFTTSLSLAGLAPPSDRVIDGLDLLPAMLG 374
Query: 373 GAKTKRSEILHNIDNV 388
G T R + D +
Sbjct: 375 GQLTDRPIFYYRGDTL 390
>gi|332663784|ref|YP_004446572.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
gi|332332598|gb|AEE49699.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
Length = 580
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 138/461 (29%), Positives = 206/461 (44%), Gaps = 103/461 (22%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F A + + P KP+II+I+ DDLG++D +GS +I TPNID LAY GL
Sbjct: 11 FLAFSASAQKKGKTTPAPTPSKPNIILIMVDDLGYSDFGAYGS-EIQTPNIDKLAYGGLR 69
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQ-------YLK 120
L + Y ++C P+R++L+TG+YP G+ + GLP + L Q L+
Sbjct: 70 LKEFYNNSICAPTRASLITGQYPHKAGLGY----FNTNLGLPAYQGWLNQESLTFGEVLQ 125
Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
+ GY T+ GKWH+G +Y P RGF+ YG+ G +YYD S P +++
Sbjct: 126 QGGYNTYLTGKWHVG-NDSLYWPNQRGFNKFYGFIGGASNYYDISPYPEKAP--PVELVE 182
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
N Q N G Y TD T A++ I E +K KP FLYLA A H P QAP E++A
Sbjct: 183 NNQRINLAPGKYLTDEITNHALSYINE-SKDKPFFLYLAFNAPHW-----PLQAPAEDIA 236
Query: 241 KF---------------------LDISDPERR---------------------------T 252
K+ L I+DP ++
Sbjct: 237 KYKGKYKIGWDSLRAQRYRRQLALGIADPRQKVAERDPEVVAWESLTFDEKELWQRKMEV 296
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS--------NKG---- 300
YA MV +D+SVG V+ L+K +N++++ ++DNGA G++ N G
Sbjct: 297 YAAMVDHVDQSVGRVVEELKKLKKDDNTLIVLISDNGAQG-GLNQLGQRRRQRNSGPIGS 355
Query: 301 --------------SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLP 346
SN P R K+T +GG+ W P + ++ H+ D P
Sbjct: 356 AGSYDYQEQNWAYVSNTPFRNYKATSHEGGISSPFIAWFPKKIKAGTITKGTGHLIDLAP 415
Query: 347 TLCAAAGIE-------INDTSLDGVNQWDVLTKGAKTKRSE 380
T AG+ + +L GV+ ++L +G R+E
Sbjct: 416 TFYDVAGVNYPTNYRGVQTNTLPGVSLKNLLFEGNDLVRNE 456
>gi|410907337|ref|XP_003967148.1| PREDICTED: arylsulfatase A-like [Takifugu rubripes]
Length = 501
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 124/423 (29%), Positives = 195/423 (46%), Gaps = 47/423 (11%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCT 78
L + P+ +++ ADDLG+ D+ +G TPN+D LA GL Y + +C+
Sbjct: 13 VLLGSCTSSPPNFVLLFADDLGFGDLGCYGHPTSLTPNLDRLAAGGLRFTDFYCTSPVCS 72
Query: 79 PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF-F 137
PSR++L+TG+Y G+ GV+ G GLPL E + + LK GYAT A+GKWHLG
Sbjct: 73 PSRASLLTGRYQTRSGVYPGVLYPGSRGGLPLNETTIAEVLKPLGYATAAVGKWHLGIGA 132
Query: 138 REVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
+ PT +GFD + G Y + + +C GL V T+ + +
Sbjct: 133 NGTFLPTRQGFDQYLGIPYSHDMGPCQNLTCFPPDVKCFGL-----CDVGTVTVPLMYNE 187
Query: 196 LYTEAAINVIAEHN---------------KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ + +N + N K +P FLY H Y F P
Sbjct: 188 VIKQQPVNFLDLENAYRDFASDFISTSAKKRQPFFLYFPSHHTH----YPQFAGPGAAGM 243
Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
R + + LD ++G+++ L G+L N+++LF +DNG P S G
Sbjct: 244 SL-------RGPFGDALLELDNTIGSLLETLEGTGVLNNTLILFTSDNG-PELMRMSRGG 295
Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK-VSSELFHISDWLPTLCAAAGIEINDT 359
+ PLR K T ++GGMR A + W Q V+ E+ D LPT+ + AG ++
Sbjct: 296 NAGPLRCGKGTTYEGGMREPAIAY--WRGIIQPGVTHEMASTLDILPTIASLAGAKLPQV 353
Query: 360 SLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYA--ALRVDDLK---YVAGTDNNGQ 414
LDGV+ ++L K+KR ++ NP + Y A+R++ K Y G ++G
Sbjct: 354 MLDGVDMSNILFSQGKSKRETMMF---YPTNPSETYGVFAIRLEKYKAHFYTQGASHSGT 410
Query: 415 SDE 417
+ +
Sbjct: 411 TPD 413
>gi|404447701|ref|ZP_11012695.1| sulfatase [Indibacter alkaliphilus LW1]
gi|403766287|gb|EJZ27159.1| sulfatase [Indibacter alkaliphilus LW1]
Length = 486
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 115/350 (32%), Positives = 183/350 (52%), Gaps = 28/350 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
K P+++II DD G+NDV GS I TPN+D +A G+ + YV QA+C+ SR+AL+T
Sbjct: 39 KLPNVVIIFTDDQGYNDVGVFGSKDISTPNLDNMAKEGVRFSNFYVAQAVCSASRAALLT 98
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G Y +G+ HG + GL E + + LK GY T GKWHLG E + P +
Sbjct: 99 GTYSNRLGI-HGALDHSSNHGLNPDETTIAEMLKPLGYQTAIFGKWHLGHHPE-FLPVQQ 156
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEP----YQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
GFD ++G + +H + P Y+ ++ + D + T + TE ++
Sbjct: 157 GFDEYFGIPYSNDMWPNHPETKNYYPPLPLYENDEVVDTLHHDQSML----TTVLTEKSL 212
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
+ I E NK P FLYLAH H P D+ K ER Y ++ +D
Sbjct: 213 DFI-ERNKETPFFLYLAHPMPHV-----PLFVSDKFEGK------SERGLYGDVIMEIDW 260
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
SVG V+ +L+ G+ EN++V+F++DNG S+ +H+ GS +PL+ K T WDGG++ +
Sbjct: 261 SVGQVLESLKSKGLEENTLVIFLSDNGPWLSYSLHA--GSAYPLKEGKGTSWDGGVKIPS 318
Query: 322 AI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
+ W + + + ++ I D LPT+ G ++ + +DG + W ++
Sbjct: 319 IMQWKGKIPEGKVQTNPAMSI-DILPTIAEITGAKLPELPIDGKSIWKMI 367
>gi|432862125|ref|XP_004069735.1| PREDICTED: arylsulfatase A-like [Oryzias latipes]
Length = 500
Score = 171 bits (432), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 124/408 (30%), Positives = 195/408 (47%), Gaps = 42/408 (10%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCT 78
+ L + P+ +++ ADDLG+ D+ G TPN+D LA GL Y + +C+
Sbjct: 12 SLLCVISASPPNFVLLFADDLGFGDLGCFGHPSSLTPNLDRLAAGGLRFTDFYCTSPVCS 71
Query: 79 PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF-F 137
PSR++L+TG+Y G+ GV+ G GLPL E + + LK GYAT A+GKWHLG
Sbjct: 72 PSRASLLTGRYQTRSGVYPGVLYPGSIGGLPLNETTIAEVLKPLGYATAAMGKWHLGLGA 131
Query: 138 REVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEP-----------YQGLDMRHNMQV 184
+ PT +GFD + G Y + ++ +C F P + + HN +
Sbjct: 132 NGTFLPTRQGFDQYLGIPYSHDMGPCHNLTC---FPPDVKCFGFCDLGTVTVPLMHNEVI 188
Query: 185 DNKTIGIYSTD-LYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
+ + + Y++ A N IA +K +P FLY H Y + P
Sbjct: 189 KQQPVDFLDLERAYSDFATNFIASSASKKQPFFLYYPSHHTH----YPQYAGPGAAGHSL 244
Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
R + + +D ++GN++A L + G++ N++V F +DNG P + GS
Sbjct: 245 -------RGPFGDALMEMDNTIGNLMATLEQTGVINNTLVFFTSDNG-PELMRRARGGSA 296
Query: 303 HPLRGMKSTPWDGGMR--GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS 360
L+ K T ++GGMR G+ A W +K V+ E+ D LPT A AG ++
Sbjct: 297 GLLKCGKGTTYEGGMREPGI-AYWPGTIK--PGVTHEMASTLDILPTFAALAGAKLPPVM 353
Query: 361 LDGVNQWDVLTKGAKTKRSEIL-HNIDNVDNPQKYYAALRVDDLKYVA 407
LDGV+ D+L K+KR ++ + +D P + Y + KY A
Sbjct: 354 LDGVDMMDILFNQGKSKREAMMFYPVD----PTEQYGLFAIRFGKYKA 397
>gi|255038246|ref|YP_003088867.1| sulfatase [Dyadobacter fermentans DSM 18053]
gi|254951002|gb|ACT95702.1| sulfatase [Dyadobacter fermentans DSM 18053]
Length = 481
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 138/420 (32%), Positives = 202/420 (48%), Gaps = 61/420 (14%)
Query: 5 RKYFFA---LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL 61
RKY LT + L A P++P+I+ ILADDLG+ DV F+G I TPNID L
Sbjct: 2 RKYLLLIPLLTSSFLTQRADAQAPKPQRPNIVFILADDLGYGDVGFNGQKLIKTPNIDKL 61
Query: 62 AYNGLILNQHYV-QALCTPSRSALMTGKYPIH--IGMQHGVILEGEPWGLPLTEKL--LP 116
A G+I NQ Y ++C PSRS+L+TG++ H I GV EG+ P+ + + L
Sbjct: 62 AKEGMIFNQFYAGTSVCAPSRSSLLTGQHTGHTYIRGNKGVEPEGQ---QPIADSVTTLA 118
Query: 117 QYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGY-WQGLQDYY------DHSCKAT 169
+ LK++GY T A GKW LG P +GFD YGY Q L Y D+S K
Sbjct: 119 EVLKKSGYVTAAFGKWGLGPVGSEGDPNKQGFDRFYGYNCQSLAHRYYPEHLWDNSKKIL 178
Query: 170 FEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTY 229
E +GL + NK Y+ DL + A++ + + +P FL+L ++ HA
Sbjct: 179 LEGNKGL-------IHNKE---YAPDLIQKKALSFVNAQDGKQPFFLFLPYILPHA---- 224
Query: 230 EPFQAPDEEVAKFLDISDPER-----------------------RTYAGMVSRLDESVGN 266
PD+ + ++ E+ T+A MV+RLD VG
Sbjct: 225 -ELVVPDDSLFRYYKGKFEEKPHKGADYGPGANGGGYASQDFPHATFAAMVARLDLYVGQ 283
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNK--GSNHPLRGMKSTPWDGGMRG-VAAI 323
V+ AL+K G+ +N++V+F +DNG G + S RG+K ++GG+R AA
Sbjct: 284 VMNALKKKGLDKNTLVIFTSDNGPHVEGGADPRFFNSGAGFRGVKRDLYEGGIREPFAAR 343
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
W +K K S + D LPT A ++DG++ D L A K+ + L+
Sbjct: 344 WPAAIKPGSK-SDYIGAFWDILPTFAELANAPA-PRNIDGISFTDALKGKAIQKKHDYLY 401
>gi|296122901|ref|YP_003630679.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296015241|gb|ADG68480.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 519
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 172/335 (51%), Gaps = 24/335 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+P+II+++ DD G+ D+S HG+ + TP++D L + Q +V C P+R+++MT +
Sbjct: 45 RPNIILMMTDDQGYGDLSLHGNPVVKTPHLDQLGRQSVRFEQFHVSPTCAPTRASIMTSR 104
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
+ G+ H ILE E L L +LPQ+LK AGY T GKWHLG + Y P RGF
Sbjct: 105 HEFSSGVTH-TILERE--RLSLKATILPQFLKRAGYTTGIFGKWHLG-DEDAYQPGKRGF 160
Query: 149 DS---HYGYWQGLQDYYDHSC-KATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
D H G G+ Y SC A Y +RHN + Y T ++ + AI
Sbjct: 161 DEVFIHGG--GGIGQSYPGSCGDAPLNKYFNPVIRHNGKF--VATNGYCTKVFVDQAITW 216
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
I+ ++P F Y+ A HA P P E +L+ + + GM++ D+ +
Sbjct: 217 ISSQPDNQPFFCYITPNAPHA-----PLDCPKEYYEPYLEHVPEDVARFYGMITHWDDQL 271
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
G ++ AL + +++IV+FM DNG+ + H + G +R K TP++GG+R V A W
Sbjct: 272 GRLLKALEDRDISKDTIVIFMTDNGSATGAKHFSAG----MRANKGTPYEGGIR-VPAFW 326
Query: 325 SPWLKQTQ-KVSSELFHISDWLPTLCAAAGIEIND 358
S W Q +V E+ D LPTL A + + D
Sbjct: 327 S-WAGHWQPQVRQEVTCHYDILPTLTELANVPVAD 360
>gi|149177363|ref|ZP_01855968.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
gi|148843888|gb|EDL58246.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
Length = 466
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 122/343 (35%), Positives = 180/343 (52%), Gaps = 32/343 (9%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
K+P+I++I AD+LG+ D+ +G+ + TP +D LA G+ L Y + CT SR+ L+T
Sbjct: 33 KRPNILLITADNLGYGDLGCYGNPVMKTPMLDQLASEGVRLTDFYTASPTCTVSRATLLT 92
Query: 87 GKYPIHIGMQHGVIL-EGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
G+YP IG+ H + E GL +E L+P+YLK+ GY T GKW++G F PT
Sbjct: 93 GRYPQRIGLNHQLSADENYGDGLRKSEVLIPEYLKQQGYRTACFGKWNVG-FSPGSRPTE 151
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI--YSTDLYTEAAIN 203
RGFD +G+ G DYY H Y G RH++ K + + YSTDL+ +AA
Sbjct: 152 RGFDEFFGFAAGNIDYYHHY-------YAG---RHDLWRGLKEVFVEGYSTDLFADAACQ 201
Query: 204 VIAEHNKSKPMFLYLAHLAVH--AGNTYEP-----FQAPDEEVAKF---LDISDPERRTY 253
I+ + +P F+YL A H + +P +QAPD K+ +P+ R Y
Sbjct: 202 YISAES-DQPFFIYLPFNAPHFPSQRNKQPGQGNEWQAPDLAFEKYGYDPQTKNPQER-Y 259
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK---GSNHPLRGMKS 310
+V+ LD ++G V+ L G+ + +IV++ +DNGA F + SN PLR
Sbjct: 260 RAVVTALDSAIGRVLKQLDTSGLRDQTIVIWYSDNGA--FMLKERGLEVASNKPLRDGGV 317
Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
T W+GG+R A I P + V+ D LPTL AG
Sbjct: 318 TLWEGGIRVPAIIRYPGHLKAGTVNQSPLISLDILPTLITLAG 360
>gi|374374259|ref|ZP_09631918.1| Cerebroside-sulfatase [Niabella soli DSM 19437]
gi|373233701|gb|EHP53495.1| Cerebroside-sulfatase [Niabella soli DSM 19437]
Length = 476
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 139/405 (34%), Positives = 196/405 (48%), Gaps = 46/405 (11%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
FFA TLL A KP+I++ILADDLG+ DV +G +I TP+IDALA GL
Sbjct: 7 FFASFLTLLIRLP-TRLAAQTKPNIVLILADDLGYGDVGVYGQQKIKTPHIDALAKEGLR 65
Query: 68 LNQHYV-QALCTPSRSALMTGKYPIHIGMQH--GVILEGEPWGLPLTEKLLPQYLKEAGY 124
+ Y A+C PSRS+L+TG++ H ++ GV EG+ W LP + LK AGY
Sbjct: 66 FTRFYCGTAVCAPSRSSLLTGQHTGHTPIRGNIGVKPEGQ-WPLPGNTVTIATLLKSAGY 124
Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGY-WQGL-QDYY-DHSCKATFEPYQGLDMRHN 181
+T GKW LGF P +GFD YGY Q L DY+ DH + R
Sbjct: 125 STGCFGKWGLGFIGTSGDPLKQGFDRFYGYNCQTLAHDYFPDHLWNN--------EQRVE 176
Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG----------NTY-- 229
+ ++ IYS L E A+ I +H + +P FL LA+ HA N Y
Sbjct: 177 LSNTHEKQTIYSAQLIHEQAMQFIRQHQR-QPFFLLLAYTLPHAALQLPKDDTLYNYYKK 235
Query: 230 ----EPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFM 285
+P + K + YA MV +LD VG V L+ G+ +N+++LF
Sbjct: 236 LFNEKPVPVAEHWNGKGYEPQAYPHAAYAAMVGKLDAYVGEVQQQLKVLGLDKNTMILFA 295
Query: 286 ADNGA-------PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL 338
+DNG P+F +SN G LRG+K ++GG+R W P Q S+++
Sbjct: 296 SDNGPHREGGNDPAF-FNSNGG----LRGIKRDLYEGGIRSPFIAWWPGTIQAGTQSNQV 350
Query: 339 FHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
D LPT A AGI + + DG++ L K ++ + L+
Sbjct: 351 GAFWDLLPTFAALAGIGMPANN-DGISMLPTLLARGKQQQHDHLY 394
>gi|340369799|ref|XP_003383435.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Amphimedon
queenslandica]
Length = 523
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 120/368 (32%), Positives = 182/368 (49%), Gaps = 27/368 (7%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
+P P++I +L DD+GW D+ +G TPN+D +A G++L Y LC+PSR+A+
Sbjct: 29 SPTTPNMIFMLMDDMGWGDLGVYGHPVKETPNLDKMALEGMLLPDFYSANPLCSPSRAAM 88
Query: 85 MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
+TG+ PI G ++ + G+P +E L P+ L++ GYAT IGKWHLG
Sbjct: 89 LTGRLPIRNGFYTTNAHARNAYTPQDIVGGIPDSEILYPELLQKNGYATMIIGKWHLG-Q 147
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR----HNMQVDNKTIGIYS 193
+ Y P GFD +G +D + Y+ M + +++KT
Sbjct: 148 QTHYHPLKHGFDEFFGSTNCHFGPFDGKEQPNMPVYRNATMAGRYYQDFPINHKTGESNL 207
Query: 194 TDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
T YT+ AI I ++ + KP FLY A H P FL S +R
Sbjct: 208 TVEYTQEAIKFINKNAANKKPFFLYWTPDATH---------TPLFASKDFLGTS--QRGL 256
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
Y V LD SVG ++ L+ G+ ++++V+F +DNG ++ S GSN P K T
Sbjct: 257 YGDAVKELDYSVGQILDTLKTLGIDKDTLVIFSSDNGGATYAKESG-GSNSPFLCGKETT 315
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVLT 371
++GGMR W P Q +VS +L + DW T AGI E +D +DG++ +
Sbjct: 316 FEGGMREPTIAWWPGTIQPGQVSHQLGSLMDWYSTALDLAGISEPSDRIIDGISLAPLFK 375
Query: 372 KGAKTKRS 379
KG +T R+
Sbjct: 376 KGQETNRT 383
>gi|301788958|ref|XP_002929896.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like, partial
[Ailuropoda melanoleuca]
Length = 519
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 119/378 (31%), Positives = 178/378 (47%), Gaps = 26/378 (6%)
Query: 15 LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-V 73
+L F PK P+I+++L DD+GW D+ +G TPN+D +A G++ Y
Sbjct: 12 VLSAAGFGAMGTPKPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSA 71
Query: 74 QALCTPSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
LC+PSR+AL+TG+ PI G ++ + G+P E LLP+ LK AGYA+
Sbjct: 72 NPLCSPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPDGEHLLPELLKGAGYAS 131
Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNM 182
+GKWHLG R + P GFD +G YD+ + Y+ +M
Sbjct: 132 KIVGKWHLGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEF 190
Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
++ +T T +Y + A++ + + +P FLY A HA P
Sbjct: 191 PINLQTGEANLTQVYLQEALDFMKRQQVAQRPFFLYWAIDGTHA---------PVYASRP 241
Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
FL S R Y V +D+SVG +++ LR + EN+ V F +DNGA GS
Sbjct: 242 FLGTS--RRGRYGDAVREIDDSVGKILSLLRDLRITENTFVFFTSDNGAALISAPKQGGS 299
Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTS 360
N P K T ++GGMR A W P +VS +L + D T A AG+ +D
Sbjct: 300 NGPFLCGKQTTFEGGMREPAIAWWPGRIPAGQVSHQLGSVMDLFTTSLALAGLAPPSDRV 359
Query: 361 LDGVNQWDVLTKGAKTKR 378
+DG++ + +G T R
Sbjct: 360 IDGLDLLPAMLRGQLTDR 377
>gi|223936271|ref|ZP_03628184.1| sulfatase [bacterium Ellin514]
gi|223895133|gb|EEF61581.1| sulfatase [bacterium Ellin514]
Length = 488
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 131/403 (32%), Positives = 202/403 (50%), Gaps = 32/403 (7%)
Query: 1 MTWARKYFFALTCTLLFNDAFLNTT---APKKPHIIIILADDLGWNDVSFHGSSQIPTPN 57
+ W+ F L L +DA +T AP++P+II+ILADDLG+ D+ +G +QI TPN
Sbjct: 13 LLWSMVSFSGLLT--LTSDAQTSTNRPPAPRRPNIILILADDLGYGDLGCYGQTQIKTPN 70
Query: 58 IDALAYNGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHG--VILEGEPWGLPLTEKL 114
ID LA +G+ Y +C PSR+ LMTGK H+ ++ + L GE E
Sbjct: 71 IDKLAEDGMKFTSFYAGSTVCAPSRATLMTGKNTGHVNIRGNADLSLNGE-------ELT 123
Query: 115 LPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ--DYY-DHSCKATFE 171
+ + LK AGYAT IGKW LG P +GFD + GY +Q DYY H ++ +
Sbjct: 124 IAKILKLAGYATGCIGKWGLGNEGSPGLPGRQGFDEYLGYLDQVQAHDYYPTHLFRSDSK 183
Query: 172 PYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI-----AEHNKSKPMFLYLAHLAVHAG 226
+ D+K G+YS D +T++A+N + ++ NK + FLYL + HA
Sbjct: 184 GEESKIALTENDADHK--GLYSNDFFTQSALNYLRINKPSKLNKHRSFFLYLPYTLPHAN 241
Query: 227 NTYEPFQAPDEEVAKFLDISDPE----RRTYAGMVSRLDESVGNVIAALRKHGMLENSIV 282
N EV ++ + + A M++RLD VG ++ L+K + EN++V
Sbjct: 242 NELGNRTGNGMEVPSTEPYTNEQWPQVEKNKAAMITRLDHYVGEIMDYLKKSKLDENTVV 301
Query: 283 LFMADNGAPSFGIHSNK--GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
+F +DNG G + K S LRG+K ++GG+R + P + +S
Sbjct: 302 IFASDNGPHKEGGVNPKYFNSAGGLRGIKRDLYEGGIRVPFIVRWPARVKAGSISDAPLA 361
Query: 341 ISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
D+LPT A + T++DG++ L A+T R + L+
Sbjct: 362 FWDFLPTAAEIARTS-SPTNIDGISFLPTLLGKAQTNRHQYLY 403
>gi|449692603|ref|XP_004213101.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
Length = 181
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 90/181 (49%), Positives = 116/181 (64%), Gaps = 4/181 (2%)
Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
WHLGFF + YTPT+RGFDS+YG + G DY++HS E Y GLD+ N G
Sbjct: 1 WHLGFFAKQYTPTYRGFDSYYGSYLGKGDYWNHSNT---ETYSGLDLHDNENGVFSQDGN 57
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYE-PFQAPDEEVAKFLDISDPER 250
YST++YT AI+ I HN S+P+FLYLA+ AVH+ NT E P QAP E + KF I +R
Sbjct: 58 YSTEMYTAEAISCINNHNSSEPLFLYLAYQAVHSANTEEDPLQAPQEWIDKFSYIKHEQR 117
Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
R YA M+ +D VG V AL + ML+NSI++F DNG P+ G N +N PLRG+K+
Sbjct: 118 RKYAAMLGYMDYGVGRVHDALAEKKMLDNSIIIFTTDNGGPANGFDYNWANNFPLRGVKA 177
Query: 311 T 311
T
Sbjct: 178 T 178
>gi|325109525|ref|YP_004270593.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324969793|gb|ADY60571.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 453
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 131/416 (31%), Positives = 206/416 (49%), Gaps = 60/416 (14%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F L +LF L A ++P+I+ IL DD G+ D+ +GS++I TP+ID LA G+
Sbjct: 15 FETLLSLVLFAGGQL-AAADQRPNIVFILTDDQGYGDLGCYGSTEIATPHIDRLAAEGMK 73
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
YV C+P+R+A MTG + +G+ VI + GL E + + L+ AGY T
Sbjct: 74 FESFYVHNRCSPTRAAFMTGCHAQRVGLGK-VIYRKDKTGLSADEITVAELLQGAGYRTG 132
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-N 186
+GKWHLG + E ++P GFD+ YG+ + +D + + + R ++ D +
Sbjct: 133 IVGKWHLGEY-EQFSPVHHGFDTFYGFME-----WDDNRTTAID-----NNRTIVERDVS 181
Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
KT G++S L A I I E N+ +P FLY A H ++P + +F S
Sbjct: 182 KTDGVHSPKLLA-AGIEFITE-NQDRPFFLYYASPLPH--TRWKPHE-------RFAGSS 230
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
E+ TY +V +D VG ++ L K G+ +N++V+F +DNG P + + GS PLR
Sbjct: 231 --EQGTYGDVVQEIDWQVGELMQTLDKLGLTDNTLVVFASDNG-PQLNVEGH-GSAGPLR 286
Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
K T ++GG+R + P + V++E+ I D LPT C AG+++ +D +DG +
Sbjct: 287 DGKWTNFEGGIRVPCLMRWPSVVPAGSVNNEIVGIIDMLPTFCEIAGVDVPSDRVIDGRS 346
Query: 366 ----------------------------QWDVLTKGAK--TKRSEILHNIDNVDNP 391
QW +L KGAK +S+ + D V P
Sbjct: 347 ILPYMRGESVHPPIHDTFVVPGATIRHQQWKLLIKGAKPGGSKSDRVGKTDRVPAP 402
>gi|423293951|ref|ZP_17272078.1| hypothetical protein HMPREF1070_00743 [Bacteroides ovatus
CL03T12C18]
gi|392677172|gb|EIY70591.1| hypothetical protein HMPREF1070_00743 [Bacteroides ovatus
CL03T12C18]
Length = 461
Score = 170 bits (430), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 119/377 (31%), Positives = 188/377 (49%), Gaps = 39/377 (10%)
Query: 6 KYFFALTCTLLFNDAF--LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
++F A TC L AF + A K+P+ +II+ADD+G+ DV +G+ I TPNID +A
Sbjct: 6 RHFVAATCALA---AFSPVEMMAQKQPNFVIIVADDMGYGDVGIYGNEYIKTPNIDQMAR 62
Query: 64 NGLILNQ-HYVQALCTPSRSALMTGKYPIHIGMQHGVIL----EGEPWGLPLTEKLLPQY 118
G++ H ++ +P+R L+TG+Y G++ +++ + + GL E +
Sbjct: 63 EGMMFTDFHSNGSVSSPTRCGLLTGRYQQRAGLEKVLLVPRDDKDKEVGLQSEEITFAKI 122
Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCK-ATFEPYQGLD 177
L + GY T IGKWHLG+ ++ + F GF G+ G DY H + + + GL+
Sbjct: 123 LGDNGYRTALIGKWHLGYLQKHHPMNF-GFQKFVGFKSGNVDYQSHRNRYGDVDWWDGLE 181
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
M K + Y+T L T + + I E NK KP LY+AH A H+ P Q P+E
Sbjct: 182 M--------KNMPGYTTTLLTTLSEDYIKE-NKDKPFCLYIAHAAPHS-----PMQGPNE 227
Query: 238 EVAKF-------LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
+ + + P + Y MV LD SVG ++ L+K+ + +N+ V+F +DNG
Sbjct: 228 KAIRTEATLEGDKNSDRPNKEIYKDMVEELDWSVGRILETLKKYKLDKNTFVVFFSDNGP 287
Query: 291 PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
+ +N GS RG K PW+GG R + P + + + D PT+
Sbjct: 288 ----VINNGGSTGGYRGAKGAPWEGGHRVPGICYMPGTVKGGTMCKQTVMSFDLFPTMLD 343
Query: 351 AAGIEINDT--SLDGVN 365
A I +D+ LDG +
Sbjct: 344 MANIHYDDSKKKLDGAS 360
>gi|440908775|gb|ELR58760.1| N-acetylgalactosamine-6-sulfatase, partial [Bos grunniens mutus]
Length = 525
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 120/367 (32%), Positives = 178/367 (48%), Gaps = 27/367 (7%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
A + P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+AL
Sbjct: 30 ALQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNFYTANPLCSPSRAAL 89
Query: 85 MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
+TG+ PI G ++ + G+P +E LLP LK AGYA+ +GKWHLG
Sbjct: 90 LTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWHLG-H 148
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYS 193
R + P GFD +G YD+ + Y+ +M ++ KT
Sbjct: 149 RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTGEANL 208
Query: 194 TDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
T +Y + A+ I + +P FLY A A H AP FL S +R
Sbjct: 209 TQIYLQEALEFIQRQQAAHRPFFLYWAVDATH---------APVYASKPFLGTS--QRGR 257
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
Y + LD+SVG ++ LR + EN+ V F +DNGA + +GSN P K T
Sbjct: 258 YGDAIRELDDSVGRILRLLRDLSIAENTFVFFTSDNGA-ALISAPRQGSNGPFLCGKQTT 316
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLT 371
++GGMR A W P +VS +L I D T + AG+E D ++DG++ +
Sbjct: 317 FEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAGLEPPRDRAIDGLDLLPAML 376
Query: 372 KGAKTKR 378
+G T R
Sbjct: 377 QGRLTDR 383
>gi|343087074|ref|YP_004776369.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342355608|gb|AEL28138.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 510
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 118/333 (35%), Positives = 173/333 (51%), Gaps = 25/333 (7%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
P+II+++ DD G+ D+S HGS + TPN+D L + L V C P+RSA+M+G++
Sbjct: 33 PNIILVMTDDQGYGDISAHGSPDVFTPNMDKLKSQSIRLEDFQVSPTCAPTRSAIMSGRH 92
Query: 90 PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
P G+ H ILE E L LT LPQ LK GY + GKWHLG +E Y P RGFD
Sbjct: 93 PFKNGITH-TILERERMALGLT--TLPQVLKRGGYTSGIFGKWHLGDEQE-YQPDSRGFD 148
Query: 150 SHYGYWQ-GLQDYYDHSC-----KATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
+ + G+ Y SC F P ++HN V KT G + +T+A
Sbjct: 149 EVFIHGAGGIGQAYAGSCADVPDNGYFNPI----IKHN-GVFVKTEGFCTDVFFTQALSW 203
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE-RRTYAGMVSRLDE 262
+ + + +P F Y+ A H PF AP++ KF+D S PE + + GM+ +D+
Sbjct: 204 IKQKSSAEEPFFAYITTNAPHG-----PFIAPEKYKQKFIDQSYPEDAQGFYGMIENVDD 258
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS--NHPLRGMKSTPWDGGMRGV 320
++G ++ L G+ +N+I++FM DNG G ++ G N +RG K + ++GG R
Sbjct: 259 NLGVLMDKLDAWGIADNTILIFMTDNGKTYGGYNTVHGETYNAGMRGFKGSVYEGGTRVP 318
Query: 321 AAI-WSPWLKQTQKVSSELFHISDWLPTLCAAA 352
I W KQ Q++ L H D LPT A
Sbjct: 319 FFIRWPKVFKQGQEIDVMLNHY-DILPTFAEIA 350
>gi|347528892|ref|YP_004835639.1| putative arylsulfatase [Sphingobium sp. SYK-6]
gi|345137573|dbj|BAK67182.1| putative arylsulfatase [Sphingobium sp. SYK-6]
Length = 458
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 118/373 (31%), Positives = 184/373 (49%), Gaps = 27/373 (7%)
Query: 3 WARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALA 62
++R+ F L A + + PHI+ I+ADDLG+ D+SF G ++ TP+ID++A
Sbjct: 2 FSRRAFLGTAAALPLA-ARVEAAPARPPHIVFIMADDLGYADLSFTGRTEYATPHIDSIA 60
Query: 63 YNGLILNQHYV-QALCTPSRSALMTGKYPIHI--GMQHGVILEGEPWGLPLTEKLLPQYL 119
G + Y ++C+P+R AL TG+Y + G+ + EP GLP + L
Sbjct: 61 EQGCFFSDAYANSSVCSPTRMALATGRYQYRLRGGLDEPIGGGPEPVGLPPEHPTIASRL 120
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR 179
+EAGY T +GKWHLG ++P G+D +G G DY+ HS + + D+
Sbjct: 121 REAGYRTVLVGKWHLG-SPPRFSPLLSGYDRFFGIHGGGADYFTHS--QSMQGGGSSDLY 177
Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEE 238
V ++ Y TD+ TE A+ I ++ KP+FL L + A H P++ P++E
Sbjct: 178 DGSSVTDRH--GYLTDILTERAVAEIDLAARTRKPLFLSLHYTAPH-----WPWEGPEDE 230
Query: 239 -----VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
+ L+ T+A MV LD+ VG V+AAL GM +++IV+F +DNG F
Sbjct: 231 ALAAQIKSLLNYDGGSDSTFAKMVQALDKGVGKVLAALEAQGMTQDTIVVFTSDNGGERF 290
Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
P GMK +GG+R + P S+++ D++PT A AG
Sbjct: 291 ------SKVWPFDGMKGDLLEGGIRVPMMMRWPGKIPAGSRSAQIAMTMDYMPTFLATAG 344
Query: 354 IEINDT-SLDGVN 365
+ DG+N
Sbjct: 345 GSADPAFPSDGIN 357
>gi|171912352|ref|ZP_02927822.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
Length = 491
Score = 169 bits (428), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 132/383 (34%), Positives = 186/383 (48%), Gaps = 33/383 (8%)
Query: 10 ALTCTLL-FNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
AL L F+ A + A P++++I ADDLG+ D+ G+ TPN+D +A G+
Sbjct: 13 ALAGVLFSFHTATAASGAQGSPNVVVIFADDLGYGDLGCFGAKGQATPNLDRMAAEGVKF 72
Query: 69 NQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
+ YV Q +C+ SR ALMTG YP +G++ G + G G+ E L + +K+ GYAT
Sbjct: 73 ERFYVAQPVCSASRMALMTGCYPNRVGIK-GALGPGAKVGISKEETTLAELVKQNGYATA 131
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ---V 184
A GKWHLG + + P GFD + G + H P Q R V
Sbjct: 132 AFGKWHLGDDPQ-FLPVRHGFDEYLGLPYSNDMWPYHPELVNLTPEQRKKRRGFPALPLV 190
Query: 185 DNKTIGIYS---------TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP 235
D I + T YT+ A+ I NK KP LYLAH H P
Sbjct: 191 DGDRIILPEVTTVEQTRLTTWYTQRAVKFI-NTNKDKPFLLYLAHSMPHV-----PLFVS 244
Query: 236 DEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFG 294
D+ + Y ++ +D SVG V+ AL+++G+ EN++V+F +DNG S+G
Sbjct: 245 DKYKG------NSANGVYGDVIEEIDWSVGEVLKALKENGLDENTLVIFTSDNGPWLSYG 298
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
HS GS PLR K T W+GG+R P + KV +E D LPT+ A G
Sbjct: 299 THS--GSPGPLREGKGTVWEGGVREPFVARWPGKIPSGKVVTEPAMTIDILPTVAALTGS 356
Query: 355 EINDTSLDGVNQWDVL--TKGAK 375
++ D +DG N W +L T GAK
Sbjct: 357 KLPDHKIDGRNIWPLLSATDGAK 379
>gi|443704175|gb|ELU01350.1| hypothetical protein CAPTEDRAFT_214223 [Capitella teleta]
Length = 336
Score = 169 bits (428), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 97/256 (37%), Positives = 140/256 (54%), Gaps = 15/256 (5%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP+II ADD G+ D + +S I TPNID L +G+ Y C+PSRS+ ++G+
Sbjct: 93 KPNIIWFFADDYGYQDAGYR-NSDIHTPNIDQLVADGISFTNAYSAQQCSPSRSSFLSGR 151
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y GMQHGVI + + L L YLKE Y THA GKWHLG+ + TPT+RGF
Sbjct: 152 YAYTSGMQHGVISDTAAHCMDLKYNFLSDYLKELNYNTHASGKWHLGYCNKECTPTYRGF 211
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGL-DMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
D+ G + G YY+H+ +QG+ D + ++D GI+S DL +N + E
Sbjct: 212 DTFSGGYSGEGKYYEHTT------FQGMYDWHNGTKLDRSASGIHSQDLIEADLLNQLDE 265
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE-RRTYAGMVSRLDESVGN 266
N P+F Y A NT+ P Q E A + D+ P R+ Y G+VS +D +G
Sbjct: 266 -NDGTPLFYYAAFQ-----NTHSPMQPKPEHEALYDDLDAPSTRKKYLGLVSGMDAVIGK 319
Query: 267 VIAALRKHGMLENSIV 282
V+A L++ M +N+++
Sbjct: 320 VVAKLKEKDMFDNTVM 335
>gi|149199736|ref|ZP_01876767.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149137141|gb|EDM25563.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 585
Score = 169 bits (428), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 182/365 (49%), Gaps = 48/365 (13%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
KKP+III L DD GW D+S +G+ I TPNID+LA++G + YVQ +C+P+R+ L+TG
Sbjct: 20 KKPNIIIFLTDDQGWGDLSINGNKDISTPNIDSLAHDGALFENFYVQPVCSPTRAELLTG 79
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+Y G++ GE + L E+ + K+AGYAT A GKWH G + Y P RG
Sbjct: 80 RYAFRSGVR-STSEGGERFN--LDEQTIADVFKKAGYATGAFGKWHSG-MQYPYHPNGRG 135
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
FD YG+ C + Y + HN ++ K G + D +T A++ I E
Sbjct: 136 FDEFYGF-----------CSGHWGNYYSPMLEHNGKI-VKGQG-FCVDDFTSKAMDFI-E 181
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY-------------- 253
NK KP +YL + H+ P Q PD K+ D + + TY
Sbjct: 182 ENKEKPFLVYLPYNTPHS-----PMQVPDRFWEKYQDKTLTMKNTYNKKWGHGEDFVKAA 236
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG--APSFGIHSNKGSNHPLRGMKST 311
M +D +VG V+ L + + +N+IVL+ +DNG +P + N ++G K +
Sbjct: 237 LAMCENIDWNVGRVLNKLDELQIADNTIVLYFSDNGPNSPRW--------NGEMKGKKGS 288
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVL 370
+GG+R + P +V E+ D LPTL + AGIE + LDG++ +L
Sbjct: 289 VDEGGVRSPLHLRWPQKVAQGQVIKEVTGAIDLLPTLSSMAGIEFQSKKKLDGLDFSSLL 348
Query: 371 TKGAK 375
T K
Sbjct: 349 TGKEK 353
>gi|311745970|ref|ZP_07719755.1| arylsulfatase [Algoriphagus sp. PR1]
gi|311302452|gb|EAZ80458.2| arylsulfatase [Algoriphagus sp. PR1]
Length = 476
Score = 169 bits (428), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 132/406 (32%), Positives = 188/406 (46%), Gaps = 74/406 (18%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
+P+KP+II ILADDLG+ D+ F G I TPNID LA G+ + Y + +C PSRS+
Sbjct: 14 ASPEKPNIIFILADDLGYGDLGFLGQEYIETPNIDRLAKEGMFFSDFYSGSTVCAPSRSS 73
Query: 84 LMTGKYPIHIGMQHGVILEGEPWG-LPLTEKLL--PQYLKEAGYATHAIGKWHLGFFREV 140
+TG + H ++ E +P G P+ + +L + +K+AGY T A GKW LGF
Sbjct: 74 FLTGMHTGHTPIRGNS--EVQPEGQFPMPDSVLTVAKVMKQAGYVTGAFGKWGLGFIGST 131
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV------DNKTIGIYST 194
P +GF+ YGY +C+ Y + HN + D T G Y+
Sbjct: 132 GEPASQGFEEFYGY----------NCQRYAHRYYPAYLWHNDEKIDLPGNDWTTKGDYAP 181
Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV-----AKFLDIS--- 246
DL E + I E NK P FL++ + HA APD+E+ A F D
Sbjct: 182 DLIQEKTLEFIDE-NKENPFFLFMPIVTPHA-----ELAAPDDEIMEKYRAMFPDEKPYV 235
Query: 247 -------DPERR------------TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMAD 287
P+ R T+A MV R+D VG V+ L +G+ EN++++F +D
Sbjct: 236 AGNGADYGPDMRIPGYQSQPYPHATFAAMVERIDRYVGEVLQKLESNGLSENTVIIFTSD 295
Query: 288 NGAPSFGIHSNKG-------SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
NGA H G SN RG K ++GG+R W P K S H
Sbjct: 296 NGA-----HREGGADPEFFDSNGKFRGFKRDLYEGGVRAPMIAWWP---NKIKAGSTTDH 347
Query: 341 IS---DWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
+S D LPT A G +I S+DG++ L ++ E L+
Sbjct: 348 VSAFWDLLPTFAAIGGEKIT-KSIDGISFLPTLLGEGNQQKHEFLY 392
>gi|149177301|ref|ZP_01855906.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
gi|148843826|gb|EDL58184.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
Length = 501
Score = 169 bits (428), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 125/427 (29%), Positives = 188/427 (44%), Gaps = 67/427 (15%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F +L AF A P+II+I++DD G+ D+ GS +I TP++D LA G
Sbjct: 16 FISLALLFCLCSAFGLKAAETPPNIIMIVSDDQGYRDLGSFGSEEIMTPHLDRLAKEGAK 75
Query: 68 LNQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-----------------GLP 109
L YV CTPSR +L+TG+YP G+ + E + G+
Sbjct: 76 LTSFYVTWPACTPSRGSLLTGRYPQRNGIYDMIRNEAPDFGHKYKPAEYEVTFERIGGMD 135
Query: 110 LTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKAT 169
+ EKLLP LK AGY + GKW LG + + P RGFD YG+ DY+ H
Sbjct: 136 VREKLLPALLKPAGYVSAIYGKWDLGIHKR-FLPLARGFDDFYGFTNTGIDYFTH----- 189
Query: 170 FEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTY 229
E Y M N Q + G Y T L+ A+ I E N KP FLYL A H ++
Sbjct: 190 -ERYGVPSMYRNNQPTEEDKGTYCTYLFQREAVRFIKE-NHQKPFFLYLPFNAPHGASSL 247
Query: 230 EP-----FQAPDEEVAKFLDISD-----------------------------PERRTYAG 255
+P QAP++ + + D R Y
Sbjct: 248 DPRIRGGAQAPEKYKNMYPHLKDTLVTKKKTGRYEFRERPDGPVIHQGVSASKRRLEYVA 307
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
++ +D+++G V+ L ++ + +N+IV+F +DNG S N PL+G K ++G
Sbjct: 308 SITCMDDAIGEVLGLLDEYQIADNTIVVFFSDNGG------SGGADNSPLKGKKGMMFEG 361
Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGA 374
G+R + P + V+ EL + +PT A I + + +DG + VL
Sbjct: 362 GIRVPCLVRYPAKIKPGTVNDELLTSLELVPTFLKEAAIPLPENVVIDGYDMLPVLMGKT 421
Query: 375 KTKRSEI 381
+ R+E+
Sbjct: 422 TSPRNEM 428
>gi|294053769|ref|YP_003547427.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613102|gb|ADE53257.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 490
Score = 169 bits (428), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 119/365 (32%), Positives = 173/365 (47%), Gaps = 40/365 (10%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
A +P+ I I+ADDLG+ D+ F+GS + TP +D LA+NG+I YV C PSR+ L
Sbjct: 19 ADNRPNFIFIVADDLGYGDLGFNGSKVVETPTLDRLAHNGVIFQNGYVTHPYCGPSRAGL 78
Query: 85 MTGKYPIHIGMQHGVILEGEPW----GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
+TG+Y GM+ + L P+ GLPL EK + ++ AGY T IGKWHLG E
Sbjct: 79 ITGRYQARFGME--INLTYSPFDLHQGLPLDEKTFAERVRPAGYRTGMIGKWHLG-ASEP 135
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDH-----------SCKATFEPYQGLDMRHNMQVDNKTI 189
+ P RGFD YG+ G DY+ + K + +G +
Sbjct: 136 FHPNSRGFDYFYGFLSGGHDYFPEMVNTHPKLLLPNGKPHYSHNEGCTQPLLRNKNAAEF 195
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
Y T ++ A +++ KP LYLA+ A H P AP E V K+ + +
Sbjct: 196 NEYLTTALSKDAARFVSD--SEKPFLLYLAYNAPHG-----PLHAPKELVEKYSKLEKDK 248
Query: 250 RR-TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG-------S 301
RR TY M+ +D+ +G ++ AL + G L+N+++ FM+DNG G+ S G
Sbjct: 249 RRATYLAMIDSMDQGIGLIVEALEQSGKLDNTLIFFMSDNG----GVQSKPGHEYETWAD 304
Query: 302 NHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS 360
N P R K + +GG A W Q + + D T +G +
Sbjct: 305 NGPFRNGKGSMREGGSHVPFIAHWPKGFPQGITYPHPVSSL-DLTATAVELSGGDATGKP 363
Query: 361 LDGVN 365
LDGVN
Sbjct: 364 LDGVN 368
>gi|340619489|ref|YP_004737942.1| sulfatase [Zobellia galactanivorans]
gi|339734286|emb|CAZ97663.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
Length = 483
Score = 169 bits (428), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 126/392 (32%), Positives = 201/392 (51%), Gaps = 64/392 (16%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
+ T+L F T K+P+I+ IL DDLGW D++ +GS+ TPNID LA G+ +
Sbjct: 13 VLLTVLGPTVFAQTD--KQPNIVFILVDDLGWMDIAANGSTFYETPNIDQLAKEGIRFTK 70
Query: 71 HYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEP---WGLPLTEKLLPQYLKEAGYAT 126
Y + +C+P+R++++TGK P I + + G P LPL E L P+ L++AGY T
Sbjct: 71 AYAASPICSPTRASILTGKNPARIDLTQWIGGPGNPDYLRNLPLEEVLFPELLQDAGYKT 130
Query: 127 HAIGKWHLGFF--REVYTPTFRGFDSHY-GYWQG---LQDYYDHSCKATFEPYQGLDMRH 180
+GKWHL E + P +GFD + G+++G +++ Y F P+ ++
Sbjct: 131 AFMGKWHLNNVAGEEKFWPDKQGFDVNVAGHFRGGLYIKNKY-------FSPWNIPNL-- 181
Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+N G Y TD + AI+ I E++K+ P LY + +VHA PF AP + V
Sbjct: 182 ----ENGPEGEYMTDRLAKEAIDFIDENSKA-PFLLYFSLYSVHA-----PFDAPADRVE 231
Query: 241 KF------LDISDPER-------------------RTYAGMVSRLDESVGNVIAALRKHG 275
K+ L ++D ER TYA MV +D +VG ++ L++ G
Sbjct: 232 KYEKKKKALALTDKERFAEEKNAQKPFKYRIKQDHPTYAAMVESMDMAVGKILGKLQEKG 291
Query: 276 MLENSIVLFMADNGAPSFGIHSNKG---SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ 332
+ +N++V+F +DNG G+ +++G +N PLR K ++GG+R A I P +
Sbjct: 292 IADNTVVIFFSDNG----GLSTSEGIPTANTPLRAGKGWLYEGGIREPAIIRWPGTIKPG 347
Query: 333 KVSSELFHISDWLPTLCAAAGIEIN-DTSLDG 363
VS + D+ PT+ G + D +DG
Sbjct: 348 TVSDAVITSMDFYPTILEMTGQALRPDLHVDG 379
>gi|383780433|ref|YP_005464999.1| putative sulfatase [Actinoplanes missouriensis 431]
gi|381373665|dbj|BAL90483.1| putative sulfatase [Actinoplanes missouriensis 431]
Length = 464
Score = 169 bits (427), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 126/375 (33%), Positives = 184/375 (49%), Gaps = 39/375 (10%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMT 86
K+P+I+ ILADDLGW D+S +GS +I TPN+D LA +G+ Q Y A+C+P+R AL T
Sbjct: 46 KRPNILFILADDLGWADLSSYGSPEIRTPNLDRLARSGVRFTQGYSAGAVCSPTRVALYT 105
Query: 87 GKYPIHI--GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
G+ P G+ + E G+P T L LK+ GYAT IGKWH GF ++P
Sbjct: 106 GRLPGRTPGGLPEPIGSVNERDGIPATHPTLASLLKKTGYATALIGKWHGGFLPN-FSPL 164
Query: 145 FRGFDSHYGYWQGLQDYYD-HSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
G+D +G + G DYY +S ++ ++G + + G Y TD+ + A +
Sbjct: 165 RTGWDYFFGNYSGGVDYYSKYSHTGKYDLFEG-------ETEVPDDGRYYTDVIADKAED 217
Query: 204 VIAEHNK-SKPMFLYLAHLAVHAGNTYEPFQAP-----DEEVAKFLDISDPER------- 250
I + K KP L L + H P++AP EE+ + + +
Sbjct: 218 WITKAKKQGKPWLLNLNFTSPHW-----PWEAPGDRKISEEITAKIKAGNTQAIFHNDGG 272
Query: 251 --RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
TY MV LD +G V+ ALR+ G EN++V+F +DNG F N PL G
Sbjct: 273 SLETYTKMVENLDARIGQVLNALRRSGQEENTLVVFASDNGGERFSY------NWPLTGN 326
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQW 367
K + +GG+R + P ++ +VS E DW TL AG + D LDG +
Sbjct: 327 KGSLSEGGIRVPTILRWPARIRSGQVSHEPVVTHDWTATLLEIAGAAPSPDHPLDGASFA 386
Query: 368 DVLTKGAKTKRSEIL 382
L +GA +I
Sbjct: 387 GYLLRGADAPERDIF 401
>gi|264677229|ref|YP_003277135.1| sulfatase [Comamonas testosteroni CNB-2]
gi|262207741|gb|ACY31839.1| sulfatase [Comamonas testosteroni CNB-2]
Length = 440
Score = 169 bits (427), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 126/363 (34%), Positives = 175/363 (48%), Gaps = 32/363 (8%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGK 88
P+I+ ILADDLGW D+ +G++ TP++D LA G+ NQ Y A+C+ +R AL+TG+
Sbjct: 4 PNIVFILADDLGWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGR 63
Query: 89 YP--IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
Y + G++ + G GLP LP LK+AGY T IGKWHLG Y P
Sbjct: 64 YQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLG-KPPSYGPQRS 122
Query: 147 GFDSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
G+D +G G DY+ H F P D+ + +T G Y+ L EA V
Sbjct: 123 GYDYFFGNHSGAIDYFTHKPGVGEQFSP----DLWQGSEPVERT-GYYTYILGDEATRYV 177
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE-----VAKFLDISDPERRTYAGMVSR 259
+ KP FL L A H P++ PD+E + + Y +V
Sbjct: 178 HERKGQDKPFFLSLHFTAPHW-----PWEGPDDEHVAHNIKDLFHYDGGSLKKYGEIVEA 232
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
LD++VG V+ AL G +N+IV+F +DNG F P G K+ +GG+R
Sbjct: 233 LDKAVGQVLQALDDTGQADNTIVIFTSDNGGERF------SKTWPFTGQKTELLEGGIRV 286
Query: 320 VAAIWSPWLKQTQKVSSELFHIS-DWLPTLCAAAGIEIN-DTSLDGVNQWDVLT-KGAKT 376
+ W + Q E +S DWLPTL AAAG + D DG N +L A T
Sbjct: 287 PTLL--RWSARIQPQVQEQVTVSFDWLPTLLAAAGARPHPDFPSDGQNILPILEGHAANT 344
Query: 377 KRS 379
+RS
Sbjct: 345 ERS 347
>gi|354581367|ref|ZP_09000271.1| sulfatase [Paenibacillus lactis 154]
gi|353201695|gb|EHB67148.1| sulfatase [Paenibacillus lactis 154]
Length = 446
Score = 169 bits (427), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 117/375 (31%), Positives = 194/375 (51%), Gaps = 31/375 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
K+P+ I I DDLG+ D+ +GS + TPN+D+LA G+ Y + +C+PSR++L+T
Sbjct: 9 KQPNFIDIYCDDLGYGDLGCYGSDTVTTPNLDSLAGEGIRFTNWYSNSPVCSPSRASLLT 68
Query: 87 GKYPIHIGMQHGVILEGEPW--GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
GKYP G+ G IL + GLP TE L + LK AGY T GKWHLG E +P
Sbjct: 69 GKYPARAGV--GEILGAKRGLDGLPSTEVTLAKALKPAGYRTALYGKWHLGVSEET-SPN 125
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEAA 201
GFD +G+ G D+Y H + G++ H++ + + G Y T+L TE +
Sbjct: 126 AHGFDEFFGFKAGCIDFYSHIF--YWGQGHGVNPLHDLWENETEVWENGRYMTELITERS 183
Query: 202 INVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
++ I + + P FL++++ A H P AP E + +F + +R+ A M++ +
Sbjct: 184 VDFIKRSREQEDPFFLFVSYNAPHY-----PMHAPKEYMDRFAHLP-WDRQVMAAMIAAV 237
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH--------SNKGSNHPLRGMKSTP 312
D+ VG ++ AL+ G EN+++ F +DNG S + G+ RG K++
Sbjct: 238 DDGVGEIVKALKAAGCYENTVIFFSSDNGPSSESRNWLDGTEDVYYGGTAGIFRGHKASL 297
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-----INDTSLDGVNQW 367
++GG+R A + P + +V E+ + D +PT AG++ + LDG +
Sbjct: 298 FEGGIREPAILSWPNGLEGGQVRDEVGVMMDIVPTFLDLAGVDPATRPLEGAELDGSSLK 357
Query: 368 DVLTKGAKTKRSEIL 382
++L G + ++
Sbjct: 358 EMLLMGEPSPHKQLF 372
>gi|223938010|ref|ZP_03629909.1| sulfatase [bacterium Ellin514]
gi|223893411|gb|EEF59873.1| sulfatase [bacterium Ellin514]
Length = 500
Score = 169 bits (427), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 143/471 (30%), Positives = 216/471 (45%), Gaps = 83/471 (17%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSAL 84
A +P+ + ILADDLGW DV F+GS+ TPN+D LA G+ Y ++C+P+R+++
Sbjct: 35 AADRPNFVFILADDLGWKDVGFNGSTFYETPNLDRLAREGMRFTDAYAACSVCSPTRASI 94
Query: 85 MTGKYPIHIGMQHGVILEGEP-------------WGLPLTEKLLPQYLKEAGYATHAIGK 131
MTGKYP + + L G P LP E L + L+E GY T IGK
Sbjct: 95 MTGKYPARLHLTD--WLPGRPDKPDQILKHPKIITELPAAEITLAKALQEGGYKTAFIGK 152
Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
WHLG + P GFD + G G + S F PY+ N + + +G
Sbjct: 153 WHLGGLG--HWPEQAGFDINIG---GCGMGHPSSY---FSPYK------NPTLKDGPVGE 198
Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP-------DEEVAKFLD 244
Y D T+ A+ I E+ K P LYL+H +VH P QA ++V +
Sbjct: 199 YLADRLTDEAVKFI-ENTKGTPFLLYLSHYSVHT-----PLQAKKGLIEKYQKKVMQLPP 252
Query: 245 ISDPERRT--------------YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
PE T YA M+ LDESVG V+ L++ G+ +N++++F +DNG
Sbjct: 253 TKGPEFVTEGNTNARQVQNQPIYAAMMQSLDESVGRVLDKLKELGLDKNTVIIFTSDNG- 311
Query: 291 PSFGIHSNKG---SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPT 347
G+ + +G SN PLR K P++GG+R + P + + VS +D+ PT
Sbjct: 312 ---GLSTAEGAPTSNMPLRAGKGWPYEGGVREPLVVKWPGVTKAASVSDHQVMSTDYYPT 368
Query: 348 LCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
L AG+ + LDG++ L +R H + +++R D K +
Sbjct: 369 LLEIAGLPARPEQHLDGISFTPALRGKEMGERPLFWHYPHYSNQGGAPSSSIRKGDWKLI 428
Query: 407 AGTDNNGQSDEWYGDTDNEIDKYS------PKEVLYSKAGITFNALKTKLQ 451
EWY +N I+ ++ K L S + + LK++LQ
Sbjct: 429 ----------EWY--EENRIELFNLRLDVGEKNDLASTSALKREELKSELQ 467
>gi|423293952|ref|ZP_17272079.1| hypothetical protein HMPREF1070_00744 [Bacteroides ovatus
CL03T12C18]
gi|392677173|gb|EIY70592.1| hypothetical protein HMPREF1070_00744 [Bacteroides ovatus
CL03T12C18]
Length = 453
Score = 169 bits (427), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 125/421 (29%), Positives = 212/421 (50%), Gaps = 37/421 (8%)
Query: 1 MTWARKYFFALTCTLLFNDAFLNTTAPKK-PHIIIILADDLGWNDVSFHGSSQIPTPNID 59
M + AL T+ A + T K P++++IL DDLG D+S + + TPNID
Sbjct: 1 MKIEKIALLALGTTMTSGAALASNTVDKATPNVLLILVDDLGLGDLSCQYAKDLSTPNID 60
Query: 60 ALAYNGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEG--EPWG-LPLTEKLL 115
+ G+ L+ Y ++ +PSR+AL+TG++P +G+ GVI + WG + +
Sbjct: 61 RIFETGVRLDNFYANSSVSSPSRAALLTGRFPAMVGVP-GVIRPSIDQNWGYFDPSAVTM 119
Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ-DYYDHSCKATFEPYQ 174
P+ LK GY T IGKWHLG ++ P RGF+ +G+ + DYY H +
Sbjct: 120 PEVLKNGGYRTALIGKWHLG-WKSPNLPNERGFEHFHGFLADMMDDYYTHQRQGG----- 173
Query: 175 GLDMRHNMQVDNKTIGI--YSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEP 231
+ M +++K I ++T+L+T +++ I + K K P FLYLA+ A H+ P
Sbjct: 174 -----NYMYLNDKEITPKGHATELFTSWSVDYIKKEAKEKNPFFLYLAYNAPHS-----P 223
Query: 232 FQAPDEEVAKF--LDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN 288
Q P E V K D S P +R ++ LD+++G V+ AL + G L N++V+F +DN
Sbjct: 224 LQPPVEWVNKVQERDKSLPVKRARLIALIEHLDDNIGKVMQALEESGQLSNTLVIFASDN 283
Query: 289 GAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
G G + +N P RG K ++GG+ A+ P + + + + + D +PT+
Sbjct: 284 G----GDRGSMANNGPTRGAKGDMFEGGIHVPCALNMPGVFEGGRRDNHFVVMMDLMPTI 339
Query: 349 CAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ---KYYAALRVDDLKY 405
C + + +DG++ D + + + + + N PQ K +A+R ++ K
Sbjct: 340 CDFVNVPVRH-EIDGISVLDAIKRKKQNTEDRFVFWLRNEGGPQFGGKSQSAVRYNEYKL 398
Query: 406 V 406
+
Sbjct: 399 L 399
>gi|221068581|ref|ZP_03544686.1| sulfatase [Comamonas testosteroni KF-1]
gi|220713604|gb|EED68972.1| sulfatase [Comamonas testosteroni KF-1]
Length = 440
Score = 169 bits (427), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 129/364 (35%), Positives = 180/364 (49%), Gaps = 34/364 (9%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGK 88
P+I+ ILADDLGW D+ +G++ TP++D LA G+ NQ Y A+C+ +R AL+TG+
Sbjct: 4 PNIVFILADDLGWADLGVYGATDFKTPHLDHLASQGVRFNQAYANSAVCSATRIALITGR 63
Query: 89 YP--IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
Y + G++ + G GLP LP LK+AGY T IGKWHLG Y P
Sbjct: 64 YQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLG-KPPAYGPQRS 122
Query: 147 GFDSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
G+D +G G DY+ H F P D+ + +T G Y+ L EA V
Sbjct: 123 GYDYFFGNHSGAIDYFTHKPGVGEQFSP----DLWQGNEPVERT-GYYTYILGDEATRYV 177
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP-DEEVAK----FLDISDPERRTYAGMVSR 259
++ KP FL L A H P++ P DE VA+ + Y +V
Sbjct: 178 HERKDQDKPFFLSLHFTAPHW-----PWEGPGDEHVARSIKDLFHYDGGSLKKYGEIVEA 232
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
LD++VG V+ AL G +N+IV+F +DNG F P G K+ +GG+R
Sbjct: 233 LDKAVGQVLQALDDTGQADNTIVIFTSDNGGERF------SKTWPFTGQKTELLEGGIRV 286
Query: 320 VAAI-WSPWLK-QTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTK-GAK 375
+ WS ++ Q Q+ + F DWLPTL AAAG + D DG N +L A
Sbjct: 287 PTLLRWSARIRPQVQEQVTASF---DWLPTLLAAAGARPHPDFPSDGQNILPILQGIAAN 343
Query: 376 TKRS 379
T+RS
Sbjct: 344 TERS 347
>gi|431797524|ref|YP_007224428.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
gi|430788289|gb|AGA78418.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
Length = 442
Score = 168 bits (426), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 134/426 (31%), Positives = 207/426 (48%), Gaps = 57/426 (13%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
K +F L + A + A ++P+II+I++DD G+ D SF + TPNID +A G
Sbjct: 2 KRYFTLLLFIWLPLALV--VAQERPNIIVIVSDDAGYADFSFQSTKLTTTPNIDRIAQEG 59
Query: 66 LILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVI------LEGEPWGLPLTEKLLPQY 118
YV A +C+PSR+ L++G G + I ++ +G+P K + +Y
Sbjct: 60 AKFTNAYVTASVCSPSRAGLLSGTNQPEFGHIYNYIKGVEYSIQQHEYGIPHNIKTIGEY 119
Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM 178
L+ GYAT A GKWH GF E + P +GFD +G+ G Y H+ KA +D+
Sbjct: 120 LQPLGYATAAFGKWHEGFAEE-FQPEQKGFDHFWGFLWGANHY--HTGKA-------IDV 169
Query: 179 RHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
+ + + T Y TD T+ A++ + H + P FLY+A+ AVH+ P +A E
Sbjct: 170 IDDGKPVDPTTIPYMTDAITDEALSFASAHEED-PFFLYIAYNAVHS-----PLEAKPEH 223
Query: 239 VA----KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
+A KF D DP+R A M LDE+VG + L G L+N+I+ F+ DNG G
Sbjct: 224 IALYEGKFPD--DPKREILAAMTHSLDENVGKIFNRLEAMGELDNTIIFFINDNG----G 277
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
N PL+G K ++GG+R A+ P + + V + D LP++ A G
Sbjct: 278 SKHISADNSPLKGYKGQLYEGGIRVPFAVRWPGKIKGKTVCHSITSSLDILPSILDAVGY 337
Query: 355 EINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ------KYYAALRVDDLKYVAG 408
+ L+G + VL E++HN N ++ + +Y A+R K +
Sbjct: 338 TARN--LEGQS---VL---------ELIHNPQNFEHRELFWHTSQYGGAIRAGQWKLI-- 381
Query: 409 TDNNGQ 414
D NG
Sbjct: 382 IDQNGH 387
>gi|126304968|ref|XP_001376926.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Monodelphis
domestica]
Length = 520
Score = 168 bits (426), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 119/385 (30%), Positives = 184/385 (47%), Gaps = 27/385 (7%)
Query: 9 FALTCTLLFNDAFLNTTAPKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
L +LLF A L A +P +I+ +L DD+GW D+ G TP++D +A G++
Sbjct: 6 LVLGVSLLFGWAPLGAGATSQPPNIVFLLMDDMGWGDLGVFGEPSRETPHLDQMAAEGML 65
Query: 68 LNQHY-VQALCTPSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYL 119
Y LC+PSR+AL+TG+ PI G ++ + G+P +E LLP+ L
Sbjct: 66 FPNFYTANPLCSPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPDSEFLLPELL 125
Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM- 178
K+AGY +GKWHLG R + P GFD +G YD+ Y+ +M
Sbjct: 126 KKAGYVNKIVGKWHLGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNKAMPNIPVYRNWEMV 184
Query: 179 ---RHNMQVDNKTIGIYSTDLYTEAAINVIAEHNK-SKPMFLYLAHLAVHAGNTYEPFQA 234
+ +++KT T +Y + A++ I + +P FLY A A HA
Sbjct: 185 GRFYEDFPINHKTGEANLTQIYLKEAVDFIKKQQAHQQPFFLYWAIDATHA--------- 235
Query: 235 PDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
P FL S +R Y V +D+S+G ++ L+ G+ EN+ V F +DNGA
Sbjct: 236 PVYASKSFLGTS--QRGRYGDAVREIDDSIGKILKLLQDLGIKENTFVFFTSDNGAALIS 293
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
GSN P K T ++GG+R A W P +VS +L ++ D T + AG+
Sbjct: 294 APKEGGSNGPFLCGKQTTFEGGLREPAIAWWPGHIPAGQVSHQLGNVMDLFTTGLSLAGL 353
Query: 355 E-INDTSLDGVNQWDVLTKGAKTKR 378
+ +D +DG++ + +G R
Sbjct: 354 KPPSDREIDGIDLLPAIVQGKVIDR 378
>gi|343082868|ref|YP_004772163.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342351402|gb|AEL23932.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 462
Score = 168 bits (426), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 121/416 (29%), Positives = 201/416 (48%), Gaps = 48/416 (11%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
+KP++++I++DD G++D+ G+ I TP +D LA G+ L YV + CTPSRS L+T
Sbjct: 23 QKPNVLLIISDDQGYHDLGSFGNKDILTPYLDQLAAEGVRLTNFYVTSSACTPSRSGLLT 82
Query: 87 GKYPIHIG----MQHGVILEGEPW-------------GLPLTEKLLPQYLKEAGYATHAI 129
G+YP G ++ ++ G + G L E + + LKE GY I
Sbjct: 83 GRYPQRNGTFELFRNNMVNYGHQYTDYEYSVSPERILGTDLREIFISELLKEGGYTNGYI 142
Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI 189
GKW LG + Y P +G+D +YG+ DY+ H E Y M Q+ +
Sbjct: 143 GKWDLGQLKR-YLPMQQGYDKYYGFANTGIDYFTH------ERYGSPSMIDGNQLSLEDK 195
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-FQAPDEEVAKFLDI--- 245
G Y+T L+ A+ I ++ KP FL LA A H+ + +P + + K+L +
Sbjct: 196 GRYTTALFEREALEFI-NGSQGKPFFLTLAFNAPHSASNLDPEIKGTVQATEKYLKMYPK 254
Query: 246 SDPER----RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
S+ +R R Y V+++D+S+G ++ +++ G+ EN++V+F++DNG S
Sbjct: 255 SNTKRGEKQRAYKAAVTQMDQSIGTILERIKEMGLEENTLVIFLSDNGGMSL------AD 308
Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTS 360
N+PL+G K+ ++GG+R + P + + + P + AAAG+E+ D
Sbjct: 309 NYPLKGGKAQFFEGGVRVPCLVKWPEKIDAGQTNDAFLSSLELFPMIVAAAGLELPKDVI 368
Query: 361 LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
DG + VL + R+ + A RV + K+V NG D
Sbjct: 369 YDGFDMLPVLQGKSGVDRAALYWEFRGDQ-------AARVGNWKWVQSAKGNGLFD 417
>gi|254483658|ref|ZP_05096879.1| sulfatase, putative [marine gamma proteobacterium HTCC2148]
gi|214036091|gb|EEB76777.1| sulfatase, putative [marine gamma proteobacterium HTCC2148]
Length = 558
Score = 168 bits (426), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 133/406 (32%), Positives = 195/406 (48%), Gaps = 64/406 (15%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGS--SQIPTPNIDALAYNGLILNQHYVQA-LCTP 79
+ + + P+II+I+ADDLG+ND+S G + TP ID LA G + Q Y A C P
Sbjct: 59 SASGEQPPNIILIVADDLGYNDISTFGGGVGGMQTPGIDRLAAEGAVFTQSYSGASTCAP 118
Query: 80 SRSALMTGKYPIHIGMQH-----------GVIL-----------------EGEP----WG 107
SR+ +MTG+YP G + GV+ E +P G
Sbjct: 119 SRAMMMTGRYPTRTGFEFTPTPAGMAKIVGVVAASMDSGLPVSVYDEALEESQPPYERKG 178
Query: 108 LPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTFRGFDSHYGYWQGLQDYYDH-- 164
LP +E + + LK GY T IGKWHLG RE P +GFD GL D
Sbjct: 179 LPASEVTIAETLKAKGYYTAHIGKWHLG--RENGMAPHDQGFDDSLLMQSGLYLPEDDPN 236
Query: 165 --SCKATFEPY-------QGLDMRHNM-QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPM 214
+ K +F+P G +N D Y TD +T+ +I VI + NK++P
Sbjct: 237 VVNAKVSFDPIDRFLWAGMGFSASYNSGDADKFKPAGYLTDYWTDESIKVI-KANKNRPF 295
Query: 215 FLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKH 274
FLYLAH H P Q+ E+ I +R YAGM+ +D SV ++ L
Sbjct: 296 FLYLAHWGPHT-----PLQSTREDYDALSGIEPHRKRVYAGMIRAVDRSVSRILDTLEAE 350
Query: 275 GMLENSIVLFMADNGAPSF-GIHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQ 332
G+ ++++V+F +DNG + GI N P RG K T ++GG+R + W +
Sbjct: 351 GIADDTVVVFTSDNGGAGYIGIQD---VNSPFRGWKITMFEGGLRVPMFVKWPKRIAAGS 407
Query: 333 KVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVLT-KGAKT 376
V + + HI D +PTL AAAG + ++DGV+ ++T +GA++
Sbjct: 408 SVDTPVAHI-DMMPTLAAAAGASQPQGVAIDGVSLLPLVTGEGAQS 452
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 19/34 (55%), Positives = 25/34 (73%)
Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKS 538
L+N+ DDP EQNNLA S D LK+LE LA +++
Sbjct: 489 LYNLADDPTEQNNLAASYPDKLKELEALLAAHQA 522
>gi|220919599|ref|YP_002494902.1| sulfatase [Methylobacterium nodulans ORS 2060]
gi|219952019|gb|ACL62410.1| sulfatase [Methylobacterium nodulans ORS 2060]
Length = 482
Score = 168 bits (426), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 130/404 (32%), Positives = 201/404 (49%), Gaps = 47/404 (11%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
A P+I+ I+ADDLG+ DV +G + TP+ID +A G+ Q Y A+C+ +R+AL
Sbjct: 33 AAGPPNIVFIMADDLGYADVGCYGRPDLKTPHIDRIAARGVRFTQGYANSAVCSATRTAL 92
Query: 85 MTGKYP--IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+TG+Y + +G++ V GLP LP L+ AGY T +GKWHLG + +
Sbjct: 93 ITGRYQYRLPVGLEEPVTTRFT-GGLPAEHPTLPGLLRRAGYRTALVGKWHLGSVAK-FG 150
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM---QVDNKTIGIYSTDLYTE 199
P RG+D+ +G+ G DY+ H +G D + ++ V + +G Y+T L
Sbjct: 151 PLQRGYDTFWGFRGGTIDYFSH---------RGTDDKPDLWDGDVRVEEVG-YATHLIGR 200
Query: 200 AAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR------- 251
A+ I ++ +P FL L H + P++ P++E A+ I+ R
Sbjct: 201 RAVETIGTAVRAGRPFFLSL-----HFNAPHWPWETPEDE-AESRRIAGSNLRHFDGGTQ 254
Query: 252 -TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
TY +V+ +D +G V+ AL +HG+ +++IV+F +DNG F P G K+
Sbjct: 255 ATYHRIVAEMDAQIGRVLDALEQHGIAQDTIVVFTSDNGGERF------SDTWPFTGRKT 308
Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDV 369
+GG+R A + P VS++ DWLPTL AAAG + DG++ +
Sbjct: 309 ELLEGGLRIPAIVSWPRRLPAGVVSAQPMITMDWLPTLLAAAGASPDPVYPPDGIDLLPI 368
Query: 370 LT-KGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNN 412
LT + A+ RS N AALR DLKY+ DN
Sbjct: 369 LTGEAAERPRSFYWRYRANAQ------AALREGDLKYLKIRDNT 406
>gi|386819433|ref|ZP_10106649.1| arylsulfatase A family protein [Joostella marina DSM 19592]
gi|386424539|gb|EIJ38369.1| arylsulfatase A family protein [Joostella marina DSM 19592]
Length = 489
Score = 168 bits (426), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 119/360 (33%), Positives = 180/360 (50%), Gaps = 21/360 (5%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILNQHYVQALCTPSRSALMT 86
KKP+++ IL DD G+ DV G+ I TP++D +A G+ +++ Q +C+ SR++ +T
Sbjct: 40 KKPNVVFILTDDQGFEDVGCFGAPAIKTPHLDQMAKEGIKLIDFSATQPVCSASRASYLT 99
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G YP IG+ H ++ GL E + LKE GY T GKWHLG E + PT +
Sbjct: 100 GCYPNRIGI-HNALMPDAKIGLNPEEITIADMLKEQGYNTAIFGKWHLGDAPE-FMPTKQ 157
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV-DNKTIGIYSTDLYTEAAINVI 205
GFD ++G + H + + L + N +V D T TE A++ I
Sbjct: 158 GFDEYFGIPYSNDMWPHHPQQGPIFNFDPLPLYQNEKVIDTLNDQSLLTTQITEHAVSFI 217
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
+ NK+ P FLY+AH H P D+ K ER Y ++ +D SVG
Sbjct: 218 -QKNKNNPFFLYVAHPQPHV-----PLFVSDKFKGK------SERGLYGDVIMEIDWSVG 265
Query: 266 NVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAI 323
++ L+ G+ EN++V+F +DNG S+G HS GS P R K T W+GG R A
Sbjct: 266 EILKTLKAEGIDENTLVIFTSDNGPWLSYGNHS--GSAKPFREGKGTSWEGGQRVPFIAR 323
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
+ L Q + +++ + I D PT+ A + +DG N WD+LT+ T E +
Sbjct: 324 FPGKLPQNKTITTPVMGI-DIFPTIAALTNSTLPKKKIDGKNVWDILTEKTTTTPHEAFY 382
>gi|218262868|ref|ZP_03477199.1| hypothetical protein PRABACTJOHN_02879 [Parabacteroides johnsonii
DSM 18315]
gi|218223078|gb|EEC95728.1| hypothetical protein PRABACTJOHN_02879 [Parabacteroides johnsonii
DSM 18315]
Length = 461
Score = 168 bits (426), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 111/314 (35%), Positives = 164/314 (52%), Gaps = 30/314 (9%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
+KP+I+I+LADD G+ D F GS+ I TPNID LA G I +V A + +PSRS ++T
Sbjct: 29 RKPNILILLADDAGYADFGFMGSADIQTPNIDRLAAEGRIFTDAHVAATVSSPSRSMMLT 88
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y G + + G+ G+P E+LLP LK GY T IGKWHLG P +
Sbjct: 89 GRYGQRYGYECNLDKPGD--GIPDDEELLPALLKRYGYRTGCIGKWHLG-SEPSQRPNAK 145
Query: 147 GFDSHYGYWQGLQDY-YDHSCK---ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
GFD+ YG G + Y YD + YQ +N Q + Y TD A
Sbjct: 146 GFDTFYGLLAGHRSYFYDPETSDKDGNLQQYQ-----YNGQ--KLSFDGYFTDELASKAR 198
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
+AE +P LY++ A H+ P +A +E++A+F R+ YA M+ LD
Sbjct: 199 QFVAE--SEQPFMLYMSFTAPHS-----PNEAAEEDLARF---EGQPRQKYAAMMYALDR 248
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVA 321
VG ++ L+ G +++I+ F++DNG G +N+ SN PL+G K ++GG R
Sbjct: 249 GVGKIVDELKAAGKFDDTIIFFLSDNG----GSTTNQSSNLPLKGFKGNKFEGGQRVPFF 304
Query: 322 AIWSPWLKQTQKVS 335
+W K+ Q+ +
Sbjct: 305 VVWGDRFKRDQRFT 318
>gi|343086062|ref|YP_004775357.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354596|gb|AEL27126.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 444
Score = 168 bits (426), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 117/348 (33%), Positives = 168/348 (48%), Gaps = 30/348 (8%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSR 81
NT +P P+II+ DD G+ D+ +G+ TP++D LA G+ YV A +CTPSR
Sbjct: 29 NTISP--PNIIVFFTDDQGYADLGVYGAEDFETPHLDQLASEGIRFTNFYVPATVCTPSR 86
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
+ L+TG+YP + V+ GL + + LK AGY+T IGKWHLG ++ Y
Sbjct: 87 AGLLTGQYPKRSNLHEAVLFPYSEGGLSPQAFTMAELLKGAGYSTACIGKWHLG-HKDEY 145
Query: 142 TPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQV-DNKTIGIYSTDLYT 198
P +GFD+ YG Y + +YY + P L N +V +N + Y T YT
Sbjct: 146 MPYNQGFDTFYGVPYSNDMDNYYYKNIDFQSPP---LPFYENTKVIENGSDQRYLTKRYT 202
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
E + I + KP F+YLAH H F +P E + Y ++
Sbjct: 203 EETVKRIKNRGE-KPFFIYLAHNMPHT----PLFASPAFE-------GKSKNGLYGDVIM 250
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LD S G +I AL++ G+ EN+I +F +DNG KGS PLRG K+ W+GG R
Sbjct: 251 ELDWSAGEIINALKEEGIYENTIFVFTSDNGPA-------KGSAKPLRGKKAQTWEGGQR 303
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
I P V+ E D PT A +I + +DG++
Sbjct: 304 VPGIITWPGSIPRGVVTDEFVSTLDLFPTFAKLASTKIPEEIKIDGMD 351
>gi|329928435|ref|ZP_08282305.1| putative cerebroside-sulfatase [Paenibacillus sp. HGF5]
gi|328937871|gb|EGG34277.1| putative cerebroside-sulfatase [Paenibacillus sp. HGF5]
Length = 443
Score = 168 bits (426), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 110/350 (31%), Positives = 181/350 (51%), Gaps = 22/350 (6%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
K+P+ I+I DDLG+ D+ +GS + TP++D LA G+ Y + +C+PSR++L+T
Sbjct: 6 KQPNFIVIYCDDLGYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLLT 65
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
GKYP G+ + + GLP E L + LK AGY T GKWHLG E +P
Sbjct: 66 GKYPARAGVGEILGAKRGSHGLPADEVTLAKALKPAGYRTALFGKWHLGLSEET-SPNAH 124
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEAAIN 203
GFD +G+ G D+Y H + G++ H++ + + G Y T+L TE +++
Sbjct: 125 GFDEFFGFKAGCVDFYSHIF--YWGQAHGVNPLHDLWENETEVWENGRYMTELITERSVD 182
Query: 204 VIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
I + + P FL+ ++ A H P AP E + +F + +R+ A M++ +D+
Sbjct: 183 FIQRSREQEAPFFLFASYNAPHY-----PMHAPQEYMDRFAHLP-WDRQVMAAMIAAVDD 236
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH--------SNKGSNHPLRGMKSTPWD 314
VG ++ AL++ G E++++ F +DNG S + GS RG K++ ++
Sbjct: 237 GVGEIVKALKEAGCYEDTVIFFSSDNGPSSESRNWLDGTEDVYYGGSAGIFRGHKASLFE 296
Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGV 364
GG+R A + P + +V E+ + D PT AG++ L GV
Sbjct: 297 GGIREPAILSWPNGWEGGQVRDEVVAMMDIAPTFLDLAGVDPAAGPLQGV 346
>gi|149196020|ref|ZP_01873076.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149140867|gb|EDM29264.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 462
Score = 168 bits (426), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 118/367 (32%), Positives = 176/367 (47%), Gaps = 26/367 (7%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPS 80
L A KP++I+I+ADDLG+ DV F+G IPTP ID++A NG+ + Y ++C PS
Sbjct: 15 LIVQASSKPNLIVIMADDLGYADVGFNGCKDIPTPGIDSIANNGVKFSSGYTSYSVCGPS 74
Query: 81 RSALMTGKYPIHIGMQHGVI--LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
R+ +TG+Y G + L LP +E + + L++ GY IGKWHLG
Sbjct: 75 RAGFITGRYQQRFGFERNPQWNLTDPNSALPKSEMTIAESLQQVGYHCGIIGKWHLG-AE 133
Query: 139 EVYTPTFRGFDSHYGYWQGLQDYY--------DHSCKATFEPYQGLDMRHNMQVDNKTIG 190
P RGF+ +G+ G Y+ K + Y R++ V
Sbjct: 134 PSLRPNQRGFNEFFGHLGGGHAYFPEKLRIIKTEDVKNEMDSYASYITRNDTPVKTTK-- 191
Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
Y TD +++ AI + E N +P FL+L++ A H P QA + + +F I D R
Sbjct: 192 -YLTDEFSDEAIRFV-EKNYEQPFFLFLSYNAPHT-----PLQATQKYLDRFPHIEDQNR 244
Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
RTY MVS +D+ + ++ L + +++IV F++DNG P+ N N LRG K
Sbjct: 245 RTYCAMVSAMDDGIVRLMKKLDALKIKDDTIVFFLSDNGGPT---TKNNSDNGDLRGKKG 301
Query: 311 TPWDGGMRGVAAIWSPW-LKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDV 369
++GG R A+ P LK Q + + + A D LDGVN
Sbjct: 302 DVFEGGFRVPFAMQYPRELKPGQTYDLPVSSLDIFATIASLAQSPTHADKPLDGVNLLPY 361
Query: 370 LTKGAKT 376
L KG K+
Sbjct: 362 L-KGEKS 367
>gi|240977207|ref|XP_002402636.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215491217|gb|EEC00858.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 159
Score = 168 bits (426), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 77/157 (49%), Positives = 113/157 (71%)
Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
YTE A+++I HN S+P+FLYL++ A H G +AP+E + KFL I + R TYAGM
Sbjct: 1 YTERAVSLIRSHNPSQPLFLYLSYQAPHVGTGPSLLEAPEENMKKFLHIPEKNRTTYAGM 60
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
V LD+S+G ++ AL++ ++ N+IV+F +DNGA +G+ SN+G N PLRG K T W+GG
Sbjct: 61 VDALDQSIGTIMEALQEADLIGNTIVVFTSDNGALPWGVKSNRGYNWPLRGGKFTLWEGG 120
Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
+R IWSP L+ +++VS+++ HI+DWLPTL +AAG
Sbjct: 121 LRTTTFIWSPLLEMSRRVSNQMMHITDWLPTLYSAAG 157
>gi|373956153|ref|ZP_09616113.1| sulfatase [Mucilaginibacter paludis DSM 18603]
gi|373892753|gb|EHQ28650.1| sulfatase [Mucilaginibacter paludis DSM 18603]
Length = 468
Score = 168 bits (425), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 121/348 (34%), Positives = 180/348 (51%), Gaps = 32/348 (9%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
++P+II ILADDLG+ DV +G ++I TPNID LA G L Y A +C+PSR L+T
Sbjct: 33 QQPNIIFILADDLGYGDVGVYGQTKILTPNIDLLAREGTKLTDFYAGAPVCSPSRGVLLT 92
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEK-----LLPQ------YLKEAGYATHAIGKWHLG 135
G H ++ + ++G G ++K LLPQ L+ +GY T IGKWH+
Sbjct: 93 GLNTGHATIRGNMAIKGGLAGNKGSDKIRRAGLLPQDSTIGDLLQRSGYTTGLIGKWHVD 152
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP---YQGLDMRHNMQVDNKTIGIY 192
F + TP GFD YG+ Y + +T+ P Y+ + Q N Y
Sbjct: 153 GFDTLATPQRHGFDYFYGWLVS----YPQTYASTYWPDTWYRDGKLVAVPQNQNGQKNYY 208
Query: 193 STDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
++++ T+ AI +A+H KS KP FL + H N + P AP + + D + +++
Sbjct: 209 TSEIITDDAIRYLAKHKKSKKPFFLMVNH-----SNPHSPLDAPHNTIYEGKDWTS-DQK 262
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG------SNHPL 305
TYA MV+ LD SVG + L + G+ +N+IV+F +DNG S S S+ PL
Sbjct: 263 TYAAMVTYLDSSVGKIRKYLIESGLDKNTIVIFTSDNGPRSEPTTSLTAVSDFFHSSGPL 322
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
RG K +GG+R +W+ ++ VS + +D LPTL AG
Sbjct: 323 RGYKRDVTEGGIRVPMIVWNKNRIKSDAVSHTPGYFADILPTLAGIAG 370
>gi|149196405|ref|ZP_01873460.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
araneosa HTCC2155]
gi|149140666|gb|EDM29064.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
araneosa HTCC2155]
Length = 472
Score = 168 bits (425), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 124/368 (33%), Positives = 186/368 (50%), Gaps = 29/368 (7%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
A KP+II IL DDLG+ +V ++G I TP +D LA G+ HY A+C PSR++L
Sbjct: 17 AQMKPNIIYILCDDLGYGEVGYNGQKMIQTPELDKLASKGMRFTDHYCGNAVCAPSRASL 76
Query: 85 MTGKYPIHIGMQHGVILEGEPWG---LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
+TGK+P H ++ G P G +P + L + +K AGYAT IGKW LG F
Sbjct: 77 ITGKHPGHAFIRANS--PGYPDGQTPIPADSETLGKLMKRAGYATACIGKWGLGGFHNAG 134
Query: 142 TPTFRGFDSHYGY--WQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
P +GFD YGY + +YY E + + N YS DL T
Sbjct: 135 NPHKQGFDHFYGYTDQRKAHNYYPEYLWRNGE-------KEMLNNKNGEENDYSHDLMTV 187
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP-ERRTYAGMVS 258
A+ I E K +P FLYLA+L H +Q PD +A++ D P E + +A M S
Sbjct: 188 DALKYI-EEKKDQPFFLYLAYLIPHV-----KYQVPD--LAQYKDKDWPKEMKIHAAMTS 239
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK--GSNHPLRGMKSTPWDGG 316
R+D +G + L + G+ +N++++F +DNGA +S K ++ L+G+K + +DGG
Sbjct: 240 RMDRDIGTIARRLEELGIADNTLIMFNSDNGAHGKS-NSEKFFNTSGDLKGLKRSMYDGG 298
Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQW-DVLTKGAK 375
+R + P Q VS + D +PT G E DG++ +L K ++
Sbjct: 299 VRSPMIAYWPGTIQAGSVSDHISAFWDMMPTFSELTG-EPFKGETDGISMLPTLLGKDSE 357
Query: 376 TKRSEILH 383
K+ + L+
Sbjct: 358 QKQHKYLY 365
>gi|406661522|ref|ZP_11069640.1| Arylsulfatase [Cecembia lonarensis LW9]
gi|405554671|gb|EKB49747.1| Arylsulfatase [Cecembia lonarensis LW9]
Length = 477
Score = 168 bits (425), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 122/342 (35%), Positives = 175/342 (51%), Gaps = 30/342 (8%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGK 88
P+II+I DD G++DV +G+S I TP++D LA GL YV QA+C+ SR+AL+TG
Sbjct: 31 PNIILIFTDDQGYHDVGVYGASDIETPHLDQLASEGLQFTNFYVAQAVCSASRAALLTGT 90
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YP +G+ HG + GL E + LK GYAT GKWHLG E + PT +GF
Sbjct: 91 YPNRLGI-HGALDHSSKHGLHPEEATIADLLKPLGYATAVFGKWHLGHHPE-FLPTNQGF 148
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS--TDLYTEAAINVIA 206
D ++G + +H + Y L + N +V + S T +TE +I+ I
Sbjct: 149 DEYFGIPYSNDMWPNHPQTKDY--YPPLPIYQNDKVVDTIWNDQSMFTTWFTEKSIDFI- 205
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
E NK +P FLYLAH H P D KF S+ R Y ++ +D SVG
Sbjct: 206 ERNKDRPFFLYLAHPMPHV-----PLFVSD----KFKGKSN--RGLYGDVIMEIDWSVGQ 254
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI--- 323
++ L+++ + EN+I++F++DNG P + GS +PLR K T WDGG++ A +
Sbjct: 255 IMQTLKQNDLDENTIIIFLSDNG-PWLSYSEHSGSAYPLREGKGTSWDGGVKVPAIMRWK 313
Query: 324 --WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDG 363
W L Q + D LP L G E+ +DG
Sbjct: 314 GTWPSGLVQKNPAMT-----IDILPMLAHLTGAELPVKKIDG 350
>gi|114799529|ref|YP_761144.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
gi|114739703|gb|ABI77828.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
Length = 459
Score = 168 bits (425), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 126/355 (35%), Positives = 181/355 (50%), Gaps = 38/355 (10%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
A K P+IIII+ADDLGW D+S +G++ I TPNID + G+ L Y + +C+PSR+AL
Sbjct: 35 AAKPPNIIIIMADDLGWGDISLNGAALIETPNIDRIGQEGIQLTDFYAGSNVCSPSRAAL 94
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
+TG+YPI GMQH VI GLP E + + LK AGY T +GKWHLG +E Y PT
Sbjct: 95 LTGRYPIRSGMQH-VIFPHSQDGLPAEEITISEMLKNAGYRTGMVGKWHLG-HQEEYWPT 152
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
+GFD YG Y + A F+ Y+G ++ + D + + Y +AA
Sbjct: 153 NQGFDWFYGV------PYSNDM-APFDLYRGKEIIES-PADQSQLSLN----YAKAAKEF 200
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
I E + KP FLY A H P P++ + Y +V +D +
Sbjct: 201 I-EDSSDKPFFLYYAETFPHI-----PLFVPEDRSGT------SDAGLYGDVVETVDAGI 248
Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
G V+ L + G+ ++++++F +DNG P F +GS RG K +GG R
Sbjct: 249 GIVLDTLDEAGVADDTLIIFTSDNG-PWF-----EGSAGEFRGRKGETHEGGFR--VPFL 300
Query: 325 SPWLKQTQK--VSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKT 376
+ W K VS E+ D LPT + +G + D +DG + +LT GA T
Sbjct: 301 ARWPGHIPKGSVSHEMAMNIDLLPTAASLSGATLPADRVIDGKDLTSLLTAGAPT 355
>gi|300114943|ref|YP_003761518.1| sulfatase [Nitrosococcus watsonii C-113]
gi|299540880|gb|ADJ29197.1| sulfatase [Nitrosococcus watsonii C-113]
Length = 463
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 128/413 (30%), Positives = 195/413 (47%), Gaps = 62/413 (15%)
Query: 2 TWARKYFFALTCTLLFNDAFLNTTAP------------KKPHIIIILADDLGWNDVSFHG 49
T+ R++F TL F A + + P +P+I++I+ADD+G+ DV +G
Sbjct: 5 TYTRRHFLK---TLGFAGASMALSWPIDSLSLVAERERGRPNIVLIVADDMGYGDVGCYG 61
Query: 50 SSQIPTPNIDALAYNGLILNQ-HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-- 106
+ I TPN+DALA G H LCTP+R+AL+TG Y +G+Q +I + + +
Sbjct: 62 NQHIKTPNLDALAKRGARFTDFHSNGPLCTPTRAALLTGCYQQRVGLQ--IIPKDQRYAM 119
Query: 107 --GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH 164
+PL E + LK GY+T IGKWHLG R ++P+ +GFD ++G Y H
Sbjct: 120 AKAMPLAEITFAEALKAVGYSTALIGKWHLG-DRPSFSPSRQGFDEYFGI------PYSH 172
Query: 165 SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLY-TEAAINVIAEHNKSKPMFLY----LA 219
+ + L + N ++ + Y TE A+ I++H KS P LY +
Sbjct: 173 DMHPWRKSFPPLPLMRNEEIIELNPDLDDMTQYCTEEAVQFISKH-KSNPFLLYMPHPMP 231
Query: 220 HLAVHAGNTY---------EPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAA 270
H VH + Q D++ KFL Y+ + +D SVG VI
Sbjct: 232 HQPVHVSKRFAQRFSKEQLAAIQGEDKKSRKFL---------YSAAIEEIDWSVGEVIKT 282
Query: 271 LRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLK 329
++ G+ + ++V F +DNG P+ GS PLRG K W+GG R A W +
Sbjct: 283 IKSLGIEDKTLVAFTSDNG-PAI------GSAGPLRGKKGALWEGGHRVPFIACWRGKI- 334
Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
Q V E+ D PT+ A AG + + +DGVN +L +G K +
Sbjct: 335 QPNVVIDEIAMSMDLFPTMAAMAGASLPEKKIDGVNLLPLLCEGGKLSERTVF 387
>gi|423226077|ref|ZP_17212543.1| hypothetical protein HMPREF1062_04729 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392630595|gb|EIY24583.1| hypothetical protein HMPREF1062_04729 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 483
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 119/367 (32%), Positives = 181/367 (49%), Gaps = 31/367 (8%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
T +KP+ + I DD+G++DVS +G ++ TPNIDALA G+ Y + + +PSR+
Sbjct: 29 TKSEKPNFVFIYMDDMGYSDVSCYGETRWTTPNIDALAAEGIKFTDCYAASPISSPSRAG 88
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
+TG+YP +G+Q GV G+ E + + LK GYAT IGKWHLG RE Y P
Sbjct: 89 FLTGRYPARMGIQ-GVFYPDSYTGMAPEEVTMAEVLKVQGYATACIGKWHLG-SREKYLP 146
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
+GFD ++G +S + + Y + +D + T YTE A++
Sbjct: 147 LQQGFDEYFG--------IPYSNDMSAQVYLRGNEVEEFHIDINNV----TKKYTEEAVD 194
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
I +P FL+LAH +H P DE K Y V +D S
Sbjct: 195 YI-RRKADQPFFLFLAHSMMHV-----PIYVSDEFAGK------SGAGIYGDAVLEVDWS 242
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAA 322
VG ++ LR+ G+ +N++V+F +DNG P G PLR K+T ++GG+R A
Sbjct: 243 VGRIMETLRELGLDDNTLVVFTSDNG-PWLQEGPLGGRALPLREGKTTAFEGGVRVPCIA 301
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
W +K V++++ + DW PT+ A +G + D LDG + VL K +
Sbjct: 302 YWKGQIKPV--VNTDVVSLLDWFPTVTALSGGILPDVRLDGYDLTAVLNGTGKRASEDYA 359
Query: 383 HNIDNVD 389
+ +N D
Sbjct: 360 YFRNNRD 366
>gi|296121751|ref|YP_003629529.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296014091|gb|ADG67330.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 536
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 139/402 (34%), Positives = 194/402 (48%), Gaps = 52/402 (12%)
Query: 14 TLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV 73
+L N+ L + +P+++ ILADDLGW +V G S+IPTPNID LA G+ L +HY
Sbjct: 23 SLWANELTLISHQSPRPNVVFILADDLGWGEVGCFGQSKIPTPNIDRLASRGVKLTRHYS 82
Query: 74 QA-LCTPSRSALMTGKYPIHI---GMQHGVI-----LEGEPWGLPLTEKLL--PQYLKEA 122
A C PSR LMTGK+ H G Q + EG+ PL++K L + ++A
Sbjct: 83 GAPTCAPSRCVLMTGKHLGHAEIRGNQQAKVKLPQFTEGQ---HPLSDKALTIARQFQKA 139
Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGY-WQGLQDYY--------DHSCKATFEPY 173
GYAT A GKW LG P +GFD +GY Q L Y S +P
Sbjct: 140 GYATGAFGKWGLGPVGSTGEPNRQGFDEFFGYNCQALAHSYFPKALWKNAESIVNNEKPV 199
Query: 174 QGLDMRHNMQVDNKTIGI--YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG----- 226
G + +V + Y+ L A++ I H++ +P FLYL H
Sbjct: 200 PGHKKQPEGEVTMEAYQGENYAPRLIMAEALSFIDRHHQ-QPFFLYLPFTEPHVAMQPPP 258
Query: 227 NTYEPFQAP-DEEVAK----FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSI 281
E F DE V + +L P R YA M+ LD VG+VI +L KHG+LE ++
Sbjct: 259 KIVEEFPVEWDERVYRGDGGYLPHPRP-RAAYAAMIRDLDNHVGDVITSLEKHGLLEKTL 317
Query: 282 VLFMADNGA------PSFGIHSNK----GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQT 331
++F +DNGA P F + S L+G K + ++GG+R V AI S W Q
Sbjct: 318 IVFTSDNGATHASANPDFHVGGADPLFFNSTRELKGFKGSIYEGGLR-VPAIVS-WPGQI 375
Query: 332 QKVSS--ELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLT 371
++ + DW PTLC A + + + LDGVN +LT
Sbjct: 376 PPATTINTPSYFPDWFPTLCNATQLPLPE-GLDGVNLLPLLT 416
>gi|119475675|ref|ZP_01616028.1| arylsulfatase A [marine gamma proteobacterium HTCC2143]
gi|119451878|gb|EAW33111.1| arylsulfatase A [marine gamma proteobacterium HTCC2143]
Length = 479
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 118/376 (31%), Positives = 187/376 (49%), Gaps = 43/376 (11%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMT 86
+ P++III ADD+G+ D+ +G I +PN+D +A G+ Y ++CTPSR+ L+T
Sbjct: 36 QSPNVIIIFADDMGYGDIGAYGHPTIRSPNLDQMAAEGIKWTNFYAASSVCTPSRAGLLT 95
Query: 87 GKYPIHIGMQHG---VILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
G+ P+ GM H V+ GLP TE + + LKE Y T +GKWHLG + P
Sbjct: 96 GRLPVRSGMAHDQIRVLFPTSTGGLPTTEITIAKALKEKDYRTALVGKWHLG-----HLP 150
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD-----------MRHNMQVDNKTIGIY 192
F+ D + + G+ DH K Q + M++ ++
Sbjct: 151 GFQPLDHGFDEYFGIPYSNDHDLKKELSYIQTITHAKDGDFNVPLMQNRSIIERPANQNT 210
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
T YT+ A++ I + N ++P FLYLAH H P A D+ F SD R
Sbjct: 211 ITKRYTQEAVSFIKK-NSNQPFFLYLAHSMPHV-----PLFASDQ----FRGSSD--RGL 258
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
Y ++ +D SVG V++ L + G+ EN++V+F +DNG P + ++ GS L+ K T
Sbjct: 259 YGDVIEEIDWSVGQVLSTLSEQGISENTLVVFTSDNG-PWLIMGAHGGSAGLLKSGKGTS 317
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHIS----DWLPTLCAAAGIEI-NDTSLDGVNQW 367
++GGMR A W P +K+ + H + D PT+ + AGI++ +D S DG +
Sbjct: 318 YEGGMREPAIFWWP-----EKIKPAVAHNTASTLDLFPTIMSIAGIDMPSDRSYDGYDLS 372
Query: 368 DVLTKGAKTKRSEILH 383
+ + +R I +
Sbjct: 373 PTMFEQKSNERKNIFY 388
>gi|374619517|ref|ZP_09692051.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374302744|gb|EHQ56928.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 556
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 128/370 (34%), Positives = 175/370 (47%), Gaps = 58/370 (15%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSS----QIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
+ P+II+I+ADDLG+ND+S G ++ TP+ID LA +G++ Q Y A C PSR+
Sbjct: 63 RPPNIILIVADDLGYNDISTFGGGLDGGRVKTPHIDQLAADGVVFTQSYSGAGTCAPSRA 122
Query: 83 ALMTGKYPIHIGMQHGVILEG--------------------------------EPWGLPL 110
LMTG+YP G + G E GLP
Sbjct: 123 MLMTGRYPTRTGFEFTPTPSGMAPMLSRISAEMGRGTPSMIYDAALDESKPPYEQQGLPP 182
Query: 111 TEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGL----QDYYDHSC 166
E + + LKE GYAT IGKWHLG ++ P +GFD GL D +
Sbjct: 183 EEVTIAEILKERGYATFHIGKWHLGR-QDGMAPHEQGFDQSLLMASGLFLPEDDPNVVNA 241
Query: 167 KATFEPY-QGLDMR-------HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
K F+P Q L R ++ D G Y TD +T+ +IN+I NK++P FLYL
Sbjct: 242 KLDFDPIDQFLWARMAFANSFNSGDQDRFEPGGYLTDYWTDESINII-NANKNRPFFLYL 300
Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
H VH P QA E+ DI+ +R YA M+ LD SVG + AL + G+ +
Sbjct: 301 GHWGVHT-----PLQATREDYEAVGDITPHRKRVYAAMIRSLDRSVGRINKALEELGIAD 355
Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL 338
N++V+F DNG P G N P RG K + ++GG+R + P VS E
Sbjct: 356 NTLVVFTNDNGGP--GYIGLPDINKPFRGWKISQFEGGIRVPLMMKWPARISPGTVSEEP 413
Query: 339 FHISDWLPTL 348
D +PTL
Sbjct: 414 VAHIDVMPTL 423
>gi|423214667|ref|ZP_17201195.1| hypothetical protein HMPREF1074_02727 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692573|gb|EIY85810.1| hypothetical protein HMPREF1074_02727 [Bacteroides xylanisolvens
CL03T12C04]
Length = 459
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 118/376 (31%), Positives = 188/376 (50%), Gaps = 37/376 (9%)
Query: 6 KYFFALTCTLLFNDAF--LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
++F A TC L AF + A K+P+ +II+ADD+G+ DV +G+ I TPNID +A
Sbjct: 6 RHFVAATCALA---AFSPVEMMAQKQPNFVIIVADDMGYGDVGIYGNEYIRTPNIDQMAR 62
Query: 64 NGLILNQ-HYVQALCTPSRSALMTGKYPIHIGMQHGVIL----EGEPWGLPLTEKLLPQY 118
G++ H ++ +P+R L+TG+Y G++ +++ + + GLP E +
Sbjct: 63 KGMMFTDFHSNGSVSSPTRCGLLTGRYQQRAGLERVLLVPRNDKDKEVGLPSEEITFAKV 122
Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM 178
+ + GY T IGKWHLG+ ++ + F GF G+ G DY H + Y +D
Sbjct: 123 MGDNGYRTALIGKWHLGYLQKHHPMNF-GFQKFVGFKSGNVDYQSHRNR-----YGDMDW 176
Query: 179 RHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
++V K + Y+T L T + + I E NK KP LY+AH A H+ P Q P E+
Sbjct: 177 WDGLEV--KDMPGYTTTLLTTLSEDYIKE-NKDKPFCLYIAHAAPHS-----PMQGPHEK 228
Query: 239 VAKFLDISDPERRT-------YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
+ + ++ + Y MV LD SVG ++ L+K+ + EN+ V+F +DNG
Sbjct: 229 AIRTEATPEGDKNSDRSNKEIYKDMVEELDWSVGRILETLKKYKLDENTFVVFFSDNGP- 287
Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAA 351
+ +N GS +G K +PW+G R + P + + D PT+
Sbjct: 288 ---VINNGGSAGGYKGAKGSPWEGAHRVPGICYMPGTIKEGTTCEQTVMSFDLFPTMLDM 344
Query: 352 AGIEINDT--SLDGVN 365
A I ND+ LDG +
Sbjct: 345 ADIHYNDSKKKLDGTS 360
>gi|381205411|ref|ZP_09912482.1| sulfatase [SAR324 cluster bacterium JCVI-SC AAA005]
Length = 448
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 126/412 (30%), Positives = 203/412 (49%), Gaps = 44/412 (10%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
K ++I+I+ DD G+ D+S G++ TPN+D LA G Y + +C+PSR+AL+TG
Sbjct: 2 KKNVIVIMTDDQGYGDLSCMGATDFKTPNLDRLANEGARFKCWYSNSPVCSPSRAALLTG 61
Query: 88 KYPIHIGMQHGVILEG--EPWGLPLTEKLLPQYLKEA-GYATHAIGKWHLGFFREVYTPT 144
+YP + G++ IL+G + GLP LP+ L++ GY T GKWHLG E P
Sbjct: 62 RYPANAGVR--AILDGHRKATGLPKETLTLPKILRDRLGYHTAMSGKWHLGL-AEGCRPQ 118
Query: 145 FRGFDSHYGYWQGLQDYYDHSCK-ATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEA 200
GF+ +G+ G D++ H G++ H++ V+NK + G Y T++ TE
Sbjct: 119 DHGFEQWFGFLAGCVDFFSHIFYWGMNREGPGMNPTHDLWVNNKEVWKNGEYLTEVITEY 178
Query: 201 AINVIAEHNK-SKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
+I + K SK FLY+ + A H P AP + + +F + +R+ A MVS
Sbjct: 179 SIQALRRAKKDSKSFFLYIPYNAPHY-----PMHAPQKYMDRFGGLP-WDRQVMAAMVSA 232
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGA-------------PSFGIHSNKGSNHPLR 306
+D+ +G + L + +E++I F +DNG P +G GS+ +
Sbjct: 233 VDDGIGEIFNELARLDFIEDTITFFTSDNGPSRESRNWLDGTLDPYYG-----GSSGGFK 287
Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQ 366
G K + ++GG+R + P + K+ E F D LPT+C ++I+D LDG++Q
Sbjct: 288 GHKFSLFEGGVRVPGIVHYPGVVPASKIVEEPFCSFDMLPTICEWLHLDISDMELDGISQ 347
Query: 367 WDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVA-GTDNNGQSDE 417
L ++ +D+ ++R D K V G GQ DE
Sbjct: 348 AGFLEGKQPALERDLFWEMDD-------QTSIRRGDWKLVLNGQIVEGQPDE 392
>gi|410912979|ref|XP_003969966.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Takifugu
rubripes]
Length = 519
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 122/363 (33%), Positives = 176/363 (48%), Gaps = 26/363 (7%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG-LILNQHYVQALCTPSRSALMTGK 88
P+III+L DD+GW D+ G TPN+DA+A G L+LN + LC+PSR+AL+TG+
Sbjct: 27 PNIIIMLMDDMGWGDLGAFGQPSKETPNLDAMAAQGMLLLNFYTANPLCSPSRAALLTGR 86
Query: 89 YPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
P+ G ++ + G+ E LLPQ LK+ GY +GKWHLG R Y
Sbjct: 87 LPVRNGFYTTNGHARNAYTPQEIVGGISKDEILLPQMLKKRGYFNKIVGKWHLGH-RPQY 145
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLY 197
P GFD +G Y++S + Y+ M ++D KT T +Y
Sbjct: 146 LPLEHGFDEWFGAPNCHFGPYNNSVRPNIPVYRNSWMLGRYYEEFKIDKKTGESNLTQMY 205
Query: 198 TEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
++ I ++ KP FLY A A HA P A + + K +R Y
Sbjct: 206 LLEGLDFIQSQAEAQKPFFLYWAPDATHA-----PVYASKDFLGK------SQRGRYGDA 254
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
V LD SVG +++ LR+ G+ N+ V F +DNGA + GSN P K T ++GG
Sbjct: 255 VMELDYSVGQILSLLRRLGIDSNTFVFFTSDNGAALMSGPNESGSNGPFLCGKETTFEGG 314
Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAK 375
MR A W P + VS +L ++ D T A AG+ +D ++DG++ VL
Sbjct: 315 MREPAIAWWPGHIREGTVSFQLANVMDLFTTSLALAGMSPPDDRTIDGMDLTPVLLHNTV 374
Query: 376 TKR 378
R
Sbjct: 375 LNR 377
>gi|423341845|ref|ZP_17319560.1| hypothetical protein HMPREF1077_00990 [Parabacteroides johnsonii
CL02T12C29]
gi|409219938|gb|EKN12897.1| hypothetical protein HMPREF1077_00990 [Parabacteroides johnsonii
CL02T12C29]
Length = 461
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 111/311 (35%), Positives = 163/311 (52%), Gaps = 30/311 (9%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
+KP+I+I+LADD G+ D F GS+ I TPNID LA G I +V A + +PSRS ++T
Sbjct: 29 RKPNILILLADDAGYADFGFMGSADIQTPNIDRLAAEGRIFTDAHVAATVSSPSRSMMLT 88
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y G + + G+ G+P E+LLP LK GY T IGKWHLG + P +
Sbjct: 89 GRYGQRYGYECNLDKPGD--GIPDDEELLPALLKRYGYRTGCIGKWHLG-SKPFQRPNAK 145
Query: 147 GFDSHYGYWQGLQDY-YDHSCK---ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
GFD+ YG G + Y YD + YQ +N Q + Y TD A
Sbjct: 146 GFDTFYGLLAGHRSYFYDPETSDKDGNLQQYQ-----YNGQ--KLSFDGYFTDELASKAR 198
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
+AE +P LY++ A H+ P +A +E++A+F R+ YA M+ LD
Sbjct: 199 QFVAE--SEQPFMLYMSFTAPHS-----PNEATEEDLARF---EGQPRQKYAAMMYALDR 248
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVA 321
VG ++ L+ G +++I+ F++DNG G +N+ SN PL+G K ++GG R
Sbjct: 249 GVGKIVDELKAVGKFDDTIIFFLSDNG----GSTTNQSSNLPLKGFKGNKFEGGQRVPFF 304
Query: 322 AIWSPWLKQTQ 332
+W K+ Q
Sbjct: 305 VVWGDRFKRDQ 315
>gi|224537481|ref|ZP_03678020.1| hypothetical protein BACCELL_02360 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520919|gb|EEF90024.1| hypothetical protein BACCELL_02360 [Bacteroides cellulosilyticus
DSM 14838]
Length = 525
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 119/367 (32%), Positives = 181/367 (49%), Gaps = 31/367 (8%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
T +KP+ + I DD+G++DVS +G ++ TPNIDALA G+ Y + + +PSR+
Sbjct: 71 TKSEKPNFVFIYMDDMGYSDVSCYGETRWTTPNIDALAAEGIKFTDCYAASPISSPSRAG 130
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
+TG+YP +G+Q GV G+ E + + LK GYAT IGKWHLG RE Y P
Sbjct: 131 FLTGRYPARMGIQ-GVFYPDSYTGMAPEEVTMAEVLKVQGYATACIGKWHLG-SREKYLP 188
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
+GFD ++G +S + + Y + +D + T YTE A++
Sbjct: 189 LQQGFDEYFG--------IPYSNDMSAQVYLRGNEVEEFHIDINNV----TKKYTEEAVD 236
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
I +P FL+LAH +H P DE K Y V +D S
Sbjct: 237 YI-RRKADQPFFLFLAHSMMHV-----PIYVSDEFAGK------SGAGIYGDAVLEVDWS 284
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAA 322
VG ++ LR+ G+ +N++V+F +DNG P G PLR K+T ++GG+R A
Sbjct: 285 VGRIMETLRELGLDDNTLVVFTSDNG-PWLQEGPLGGRALPLREGKTTAFEGGVRVPCIA 343
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
W +K V++++ + DW PT+ A +G + D LDG + VL K +
Sbjct: 344 YWKGQIKPV--VNTDVVSLLDWFPTVTALSGGILPDVRLDGYDLTAVLNGTGKRASEDYA 401
Query: 383 HNIDNVD 389
+ +N D
Sbjct: 402 YFRNNRD 408
>gi|305667515|ref|YP_003863802.1| N-acetylgalactosamine 6-sulfatase [Maribacter sp. HTCC2170]
gi|88709563|gb|EAR01796.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Maribacter sp. HTCC2170]
Length = 596
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 120/356 (33%), Positives = 187/356 (52%), Gaps = 45/356 (12%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP++++I+ DD GW D+SF+G++ + TPNIDA+A NG YVQ +C+P+R+ L+TGK
Sbjct: 36 KPNVVLIMTDDQGWGDLSFNGNTNLSTPNIDAIAKNGASFQNFYVQPVCSPTRAELLTGK 95
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y +G+ + GE + E + + K+AGY T A GKWH G + Y P RGF
Sbjct: 96 YAARLGV-YSTSTGGERFN--SKETTIAEIFKKAGYKTTAYGKWHSG-MQPPYHPNSRGF 151
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
D +YG+ G Y F P + HN ++ K G DL T ++ I E
Sbjct: 152 DDYYGFTSGHWGNY-------FSPM----LEHNGEIV-KGEGFLVDDL-TNKGLDFITE- 197
Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF----LDI--------SDPERRTYAGM 256
NK+ P FLYL + H+ P Q P+E +F LD+ S+ R M
Sbjct: 198 NKNNPFFLYLPYNTPHS-----PMQVPNEYWERFEKKKLDMRYQGNEEESENFTRAALAM 252
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
V +D ++G + L++ G+ EN+I+++++DNG G N G +RG K + +GG
Sbjct: 253 VENIDFNMGRLTNKLKELGLEENTIIVYLSDNGPN--GWRWNGG----MRGRKGSTDEGG 306
Query: 317 MRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLT 371
+R I W + + +K+ S++ D LPTL + AG IN ++ ++ D+ T
Sbjct: 307 VRSPFFIQWKNTIPKNKKI-SQIAGAIDILPTLTSLAG--INQPTIKSIDGKDLKT 359
>gi|410030097|ref|ZP_11279927.1| sulfatase [Marinilabilia sp. AK2]
Length = 476
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 124/343 (36%), Positives = 176/343 (51%), Gaps = 32/343 (9%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGK 88
P+IIII DD G++DV G+S I TP++D LA G+ YV QA+C+ SR+AL+TG
Sbjct: 30 PNIIIIFTDDQGYHDVGVFGASDIATPHLDQLAAEGVQFTNFYVAQAVCSASRAALLTGV 89
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
Y +G+ HG + +GL E + LK GYAT GKWHLG + E + PT +GF
Sbjct: 90 YSNRLGI-HGALDHMSRYGLHPEEATIADILKPLGYATAMFGKWHLGHYPE-FLPTNQGF 147
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS--TDLYTEAAINVIA 206
D + G + +H + Y L + N +V + S T L+TE +I+ I
Sbjct: 148 DEYLGIPYSNDMWPNHPQTKDY--YPPLPLYQNDKVIDTIWNDQSMFTTLFTEKSIDFI- 204
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
E NK +P FLYLAH H P D KF S+ R Y ++ +D SVG
Sbjct: 205 ERNKDRPFFLYLAHPMPHV-----PLFVSD----KFKGKSN--RGLYGDVIMEIDWSVGQ 253
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI--- 323
++ L+ + + EN+I++F++DNG P + GS +PLR K T WDGG++ V I
Sbjct: 254 IMETLKHYDLDENTIIIFLSDNG-PWLSYSGHSGSAYPLREGKGTSWDGGVK-VPVIMRW 311
Query: 324 ---WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDG 363
W+ L Q S D LPTL G + + +DG
Sbjct: 312 KGTWASGLVQPNPAMS-----IDILPTLAYLTGASLPEKKIDG 349
>gi|375143552|ref|YP_005005993.1| Cerebroside-sulfatase [Niastella koreensis GR20-10]
gi|361057598|gb|AEV96589.1| Cerebroside-sulfatase [Niastella koreensis GR20-10]
Length = 482
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 131/364 (35%), Positives = 183/364 (50%), Gaps = 42/364 (11%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTG 87
KP+II ILADDLG+ D+ +G I TPN+DA+A G Q Y A+C PSRS+ +TG
Sbjct: 34 KPNIIFILADDLGYGDLGCYGQKLIQTPNLDAMAKQGTRFTQFYAGTAVCAPSRSSFLTG 93
Query: 88 KYPIH--IGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
++ H I GV EG+ W +P + + + LK+AGYAT GKW LG P
Sbjct: 94 QHTGHTPIRGNKGVQPEGQ-WPIPDSAVTIAEVLKKAGYATGDFGKWGLGPVASSGDPIK 152
Query: 146 RGFDSHYGY-WQGL-QDYY-DHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
+GFD+ YGY Q L DYY DH +E +D N YS DL + A+
Sbjct: 153 QGFDAFYGYNCQTLAHDYYPDH----LWENDTRIDFAANTPEHPTD---YSADLIHQKAL 205
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD----EEVAKFLD---ISDPE------ 249
I +H K P F++L++ HA Q PD E+ K L+ I P+
Sbjct: 206 QFIDQH-KGNPFFVFLSYTLPHAA-----LQVPDDSLFEKYKKLLNEKPIPVPKWNGKNY 259
Query: 250 ------RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK--GS 301
R YA MVSRLD VG V+ L++ G+ +N++V+F +DNG G + S
Sbjct: 260 APQAYPRAAYAAMVSRLDVYVGQVLQKLKEAGIDKNTLVIFSSDNGPHKEGGYDPDYFNS 319
Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL 361
N PLRG+K ++GGMR W P + S + D+LPT A + ++
Sbjct: 320 NGPLRGIKRDLYEGGMREPMIAWWPGKIKAGATSDYVGAFWDFLPTFAELAK-ALKPANI 378
Query: 362 DGVN 365
DG++
Sbjct: 379 DGIS 382
>gi|355563805|gb|EHH20367.1| hypothetical protein EGK_03210 [Macaca mulatta]
Length = 507
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 124/384 (32%), Positives = 186/384 (48%), Gaps = 29/384 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
+ P+I++I ADDLG+ D+ +G TPN+D LA GL L YV A LCTPSR+AL+T
Sbjct: 19 RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRLTDFYVPASLCTPSRAALLT 78
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
G+ P+ +GM GV++ GLPL E L + L GY T GKWHLG E + P
Sbjct: 79 GRLPVRMGMYPGVLVPSSRGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGSEGAFLPPH 138
Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGIYSTD 195
+GF G Y + +C P G + + N+ V+ + + +
Sbjct: 139 QGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPRLE 198
Query: 196 L-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
Y A +++A+ + +P FLY A H Y F F + S R +
Sbjct: 199 ARYVAFARDLMADAQRQDRPFFLYYASHHTH----YPQFSG-----QSFAERSG--RGPF 247
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
+ LD +VG ++ A+R G+LE ++V+F ADNG + + S G + LR K T +
Sbjct: 248 GDSLMELDAAVGTLMTAIRDLGLLEETLVIFTADNGPETMRM-SRGGCSGLLRCGKGTTY 306
Query: 314 DGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
+GG+R A A W + V+ EL D LPTL A AG + + +LDG + +L
Sbjct: 307 EGGVREPALAFWPGHI--APGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLG 364
Query: 373 GAKTKRSEILHNIDNVDNPQKYYA 396
K+ R + + D + +A
Sbjct: 365 TGKSPRQSLFFYPSDPDEVRGVFA 388
>gi|241143299|ref|XP_002404958.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215493668|gb|EEC03309.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 166
Score = 167 bits (423), Expect = 1e-38, Method: Composition-based stats.
Identities = 83/161 (51%), Positives = 110/161 (68%), Gaps = 1/161 (0%)
Query: 213 PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALR 272
P+FLYL+HLAVHAG Y P QAP E + KF I R YAG V LD+SVG V AL
Sbjct: 1 PLFLYLSHLAVHAGTQYGPIQAPKENLRKFDYIGVKNRSLYAGAVDALDQSVGIVFQALE 60
Query: 273 KHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ 332
K ML+N++++F++DNGA +G SN GSN PLRG K T W+GG+R A +W P L++ Q
Sbjct: 61 KKDMLQNTMLVFISDNGAIPWGEWSNAGSNWPLRGAKFTLWEGGVRVPAFVWGPLLRKPQ 120
Query: 333 KVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTK 372
+VS++L H+SDW+PTL + AG I D +DG+N W L++
Sbjct: 121 RVSNQLMHVSDWVPTLYSLAGGNIADLGVIDGLNMWQALSR 161
>gi|414070344|ref|ZP_11406330.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410807261|gb|EKS13241.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 470
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 119/394 (30%), Positives = 195/394 (49%), Gaps = 28/394 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
+KP+I++++ADD G++D F GS + TP ID LA ++ Q YV A+C PSR+ L T
Sbjct: 25 EKPNILLVIADDAGYHDFGFQGSEVMQTPTIDKLASQSVVFEQAYVTAAVCGPSRAGLYT 84
Query: 87 GKYPIHIGMQHGVI--------LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
GKY G + + G+ GLP T+ + ++LKE GY T GKWH G
Sbjct: 85 GKYQQRFGFEENNVPGYMSKSGFTGDKMGLPFTQVTMAEHLKELGYHTGLFGKWHQG-NH 143
Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYT 198
+ Y PT RGFD+ YG+ +G + Y+ +S + + Y M + + + G + L T
Sbjct: 144 DDYHPTKRGFDNFYGFREGARGYFAYSNEEQ-QAYPSQKMERDFKHYIEHEGYLTDALAT 202
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
+ + + +P F L+ AVHA P QA + ++A+F ++ +R+ A M
Sbjct: 203 QTSHFIGQSVVNKQPFFAVLSFSAVHA-----PMQATNADLAQFPHLTG-KRKILAAMNF 256
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
+D ++ V+ L + +N++V+F+ DNG P+ N +N+PL G K+ +GG+R
Sbjct: 257 AMDRAIAVVLDKLNALKISDNTLVVFINDNGGPT---DQNYANNYPLSGTKANHLEGGIR 313
Query: 319 GVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKT 376
+ W L + D LPT AA ++ + LDGV+ L +
Sbjct: 314 VPMLLKWPKRLSSNNTRYTYPVSSLDLLPTFIHAANGKVTEQMQLDGVDLMPFLLSKSTQ 373
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTD 410
+ + L+ ++ AA+R DLK + D
Sbjct: 374 RPHQTLY------WKKESRAAIRDKDLKLLRFPD 401
>gi|343086188|ref|YP_004775483.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354722|gb|AEL27252.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 491
Score = 167 bits (423), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 121/351 (34%), Positives = 179/351 (50%), Gaps = 29/351 (8%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILNQHYVQALCTPSRSALM 85
P P+I+II DD G+ DV G+ I TPN+D +A NG+ N H A+C+ SR++++
Sbjct: 40 PSPPNIVIIFTDDQGYEDVGVFGAKDIRTPNLDRMANNGVQFTNFHVANAVCSASRASIL 99
Query: 86 TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
TG Y +G+ G + + GL E + + LK GYAT +GKWHLG E + PT
Sbjct: 100 TGCYSNRLGI-FGALSDRSKHGLSPEETTIAEMLKPKGYATGIVGKWHLGHL-EPFLPTK 157
Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-VDNKTIGIYSTDLYTEAAI 202
+GFD +G Y + Y+ + Y L + N +D T T YTE A+
Sbjct: 158 QGFDEFFGLPYSNDMWPYHPERP----DDYPPLPLYQNETIIDTLTEQSMLTTWYTEKAV 213
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
+ I + N +P FLYLAH H P D+ K E Y ++ +D
Sbjct: 214 DFI-DRNSHQPFFLYLAHNMPHV-----PLFVSDKFKGK------SEAGIYGDVIMEIDW 261
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
SVG V+ AL+K+G+ EN++V+F +DNG S+G H+ GS + K T W+GG+R V
Sbjct: 262 SVGQVMEALKKNGLEENTLVIFTSDNGPWLSYGTHA--GSTGIFKEGKGTSWEGGIR-VP 318
Query: 322 AIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
AI W + K V ++ D LPT+ G + ++DG N W ++
Sbjct: 319 AIMQ-WKGKIPKGEVQEKMAMTIDLLPTIAHITGAPLPQKTIDGKNIWPLI 368
>gi|343086300|ref|YP_004775595.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354834|gb|AEL27364.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 454
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 133/416 (31%), Positives = 211/416 (50%), Gaps = 36/416 (8%)
Query: 9 FALTCTLLFNDAFLNTTAP--KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
LTC +L T +KP+I+II DD G++DVS++G+ I TP+IDA+ +G+
Sbjct: 9 LGLTCMMLGLGIVHAQTIKNNEKPNILIIFTDDQGYHDVSYYGTEDIQTPHIDAIRNDGM 68
Query: 67 ILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEP---WGLPLTEK-LLPQYLKE 121
+ Y + +C P+R++LM+G+YP H+G+ ++ P WG E LLP LK+
Sbjct: 69 QFDYFYTNSPVCAPTRASLMSGRYPDHVGVPG--LVRSNPANNWGYLNPETILLPAMLKK 126
Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN 181
GY T IGKW+LG P GFD +G+ + + D Y T + MR N
Sbjct: 127 GGYHTAHIGKWNLGLMHP-NLPNDHGFDYFHGWLEDMMDDY-----LTHRRHGKNYMRLN 180
Query: 182 MQ-VDNKTIGIYSTDLYTEAAINVIAEHNK-SKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
+ +D K ++TDL+T+ ++N I E K +P FLYLA+ A H P Q P V
Sbjct: 181 EEIIDPKG---HATDLFTQWSVNYIKEREKEERPFFLYLAYNAPHF-----PVQPPKSWV 232
Query: 240 AKF--LDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
K + PE+R + LD+ +G VI AL++ G EN++++F +DNG G
Sbjct: 233 EKVNKREPGLPEKRANLIAFIEHLDDGIGQVINALKETGQYENTLIIFTSDNG----GHL 288
Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
+ +N LR K + ++GG+R AI P + S ++ D PTL AG+++
Sbjct: 289 PDLANNGLLRDGKQSMYEGGLRVPTAISWPAKIKAGSTSDQVNLSMDIFPTLTEIAGVDL 348
Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ---KYYAALRVDDLKYVAGT 409
D +++G + L + + L+ + + K Y ALR+ D K + +
Sbjct: 349 -DHAINGRSFLPTLLGQNMKEEARPLYFVRREGGTRYGGKAYHALRLGDWKLLQNS 403
>gi|223936836|ref|ZP_03628745.1| sulfatase [bacterium Ellin514]
gi|223894405|gb|EEF60857.1| sulfatase [bacterium Ellin514]
Length = 477
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 135/392 (34%), Positives = 185/392 (47%), Gaps = 52/392 (13%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILN 69
L LL L+T A KP+I+ ILADDLG+ DV+ +GS TPNID LA +G+ +
Sbjct: 4 LLSLLLMAVFCLSTKAANKPNIVFILADDLGYTDVACYGSKYYETPNIDKLAKDGIKFTD 63
Query: 70 QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW------------GLPLTEKLLPQ 117
H C P+R++LM+G+Y G+ ++ W LPL + L Q
Sbjct: 64 GHTCGPNCQPTRASLMSGQYGPRTGVYTVGSIDRFAWQTRSLHPVENVTKLPLDKITLAQ 123
Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
LK+AGYAT GKWHLG +E + P RGFD + L H D
Sbjct: 124 SLKKAGYATGMFGKWHLGEDKE-HHPAQRGFD------EALVSMGVH-----------FD 165
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
N +VD Y D T+ A++ I H K +P FLYL H AVH +P QA E
Sbjct: 166 FVTNPKVDYPK-DEYLADFLTDKALDFIKRH-KDEPFFLYLPHYAVH-----KPLQAKKE 218
Query: 238 EVAKF---LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
+ KF + TYA M++ +DESVG V+A L + + +N++V+F +DNG G
Sbjct: 219 LIQKFSAKQGVDGHHNPTYAAMIASVDESVGRVVALLDELKLSDNTLVIFSSDNGG--VG 276
Query: 295 IHSNKG--------SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLP 346
+ +G N+PLRG K ++GG R P KV + D P
Sbjct: 277 GYQREGIKKAGDVTDNNPLRGGKGMLYEGGHRVPYIFRWPGKIPAGKVCDQPIISIDLYP 336
Query: 347 TLCAAAGIEINDT-SLDGVNQWDVLTKGAKTK 377
TL AG + + LDG + VL G K
Sbjct: 337 TLLELAGAKAPEKYPLDGTSYLKVLKSGGMKK 368
>gi|355785121|gb|EHH65972.1| hypothetical protein EGM_02852 [Macaca fascicularis]
Length = 507
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 125/384 (32%), Positives = 184/384 (47%), Gaps = 29/384 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
+ P+I++I ADDLG+ D+ +G TPN+D LA GL L YV A LCTPSR+AL+T
Sbjct: 19 RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRLTDFYVPAPLCTPSRAALLT 78
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
G+ P+ GM GV++ GLPL E L + L GY T GKWHLG E + P
Sbjct: 79 GRLPVRTGMYPGVLVPSSRGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGSEGAFLPPH 138
Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGIYSTD 195
+GF G Y + +C P G + + N+ V+ + + +
Sbjct: 139 QGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPRLE 198
Query: 196 L-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
Y A +++A+ + +P FLY A H Y F + AK R +
Sbjct: 199 ARYVAFARDLMADAQRQDRPFFLYYASHPTH----YPQFSG--QNFAK-----RSGRGPF 247
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
+ LD +VG ++ A+R G+LE ++V+F ADNG P S G + LR K T +
Sbjct: 248 GDSLMELDAAVGTLMTAIRDLGLLEETLVIFTADNG-PETMRMSRGGCSGLLRCGKGTTY 306
Query: 314 DGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
+GG+R A A W + V+ EL D LPTL A AG + + +LDG + +L
Sbjct: 307 EGGVREPALAFWPGHI--APGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLG 364
Query: 373 GAKTKRSEILHNIDNVDNPQKYYA 396
K+ R + + D + +A
Sbjct: 365 TGKSPRQSLFFYPSDPDEVRGVFA 388
>gi|168701882|ref|ZP_02734159.1| Arylsulfatase [Gemmata obscuriglobus UQM 2246]
Length = 467
Score = 167 bits (422), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 124/378 (32%), Positives = 193/378 (51%), Gaps = 38/378 (10%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
KP+I++I+ADDLG ++ +G ++I TP+ID LA G + Y + +C PSR LMTG
Sbjct: 25 KPNIVLIVADDLGCFELGCYGQTKIKTPHIDKLAQGGAKFTRFYSGSPVCAPSRCVLMTG 84
Query: 88 KYPIHIGMQHGVILEGEPWG-LPLTEK--LLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
K+ H +++ V E +P G P+ + + LK GYAT A+GKW LG F +P
Sbjct: 85 KHSGHATVRNNV--EAKPEGQFPIRAEDVTVADALKAHGYATGAMGKWGLGMFDTAGSPL 142
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
GFD +GY + + H + + ++++ N D KT ++ DL+ E A+
Sbjct: 143 KHGFDLFFGY-NCQRHAHSHYPTYIYRNDKRVELKGN---DGKTGKQFTQDLFEEEALGF 198
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI--SDPE------------- 249
I E NK+KP FLYL H Q P++ + ++ DP
Sbjct: 199 I-EANKAKPFFLYLPFTVPHVA-----VQVPEDSLNEYKGQLGDDPAYDGKKGYQPHPAP 252
Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI----HSNKGSNHPL 305
YA MV+R+D SVG V+ L G+ +N++VLF +DNG P+ + S S L
Sbjct: 253 HAGYAAMVTRMDRSVGRVVEKLNALGLEKNTLVLFTSDNG-PTHNVGGADSSFFNSAGKL 311
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
RG+K + ++GG+R + P + S + D LPTLCA AG + +++DG++
Sbjct: 312 RGLKGSVYEGGIRVPFIAYQPGTIKAGTESDAPLYFPDVLPTLCAFAGTKA-PSAIDGIS 370
Query: 366 QWDVLTKGAKTKRSEILH 383
+L KG K + L+
Sbjct: 371 FLPLL-KGEKQPTHDFLY 387
>gi|109094666|ref|XP_001113032.1| PREDICTED: arylsulfatase A isoform 2 [Macaca mulatta]
gi|109094668|ref|XP_001113064.1| PREDICTED: arylsulfatase A isoform 3 [Macaca mulatta]
gi|109094670|ref|XP_001113092.1| PREDICTED: arylsulfatase A isoform 4 [Macaca mulatta]
Length = 507
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 125/384 (32%), Positives = 184/384 (47%), Gaps = 29/384 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
+ P+I++I ADDLG+ D+ +G TPN+D LA GL L YV A LCTPSR+AL+T
Sbjct: 19 RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRLTDFYVPASLCTPSRAALLT 78
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
G+ P+ GM GV++ GLPL E L + L GY T GKWHLG E + P
Sbjct: 79 GRLPVRTGMYPGVLVPSSRGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGSEGAFLPPH 138
Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGIYSTD 195
+GF G Y + +C P G + + N+ V+ + + +
Sbjct: 139 QGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPRLE 198
Query: 196 L-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
Y A +++A+ + +P FLY A H Y F F + S R +
Sbjct: 199 ARYVAFARDLMADAQRQDRPFFLYYASHHTH----YPQFSG-----QNFAERSG--RGPF 247
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
+ LD +VG ++ A+R G+LE ++V+F ADNG P S G + LR K T +
Sbjct: 248 GDSLMELDAAVGTLMTAIRDLGLLEETLVIFTADNG-PETMRMSRGGCSGLLRCGKGTTY 306
Query: 314 DGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
+GG+R A A W + V+ EL D LPTL A AG + + +LDG + +L
Sbjct: 307 EGGVREPALAFWPGHI--APGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLG 364
Query: 373 GAKTKRSEILHNIDNVDNPQKYYA 396
K+ R + + D + +A
Sbjct: 365 TGKSPRQSLFFYPSDPDEVRGVFA 388
>gi|427388367|ref|ZP_18884250.1| hypothetical protein HMPREF9447_05283 [Bacteroides oleiciplenus YIT
12058]
gi|425724950|gb|EKU87824.1| hypothetical protein HMPREF9447_05283 [Bacteroides oleiciplenus YIT
12058]
Length = 457
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 122/348 (35%), Positives = 186/348 (53%), Gaps = 32/348 (9%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
+KP+I++I+ DDLG D+S + + TPNID L +G+ LN Y ++ +PSR+ L+T
Sbjct: 35 QKPNILLIMVDDLGLGDLSCQYARDLQTPNIDKLFSDGVRLNNFYANSSVSSPSRAGLLT 94
Query: 87 GKYPIHIGMQHGVILE-GEPWGLPLTEK-LLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
G YP +G+ + E WG E LLPQ +K+A Y T IGKWHLG E P
Sbjct: 95 GCYPDMVGVPGVIRTETTASWGYLSPEAILLPQMMKKAEYNTAIIGKWHLG-LEEPNLPN 153
Query: 145 FRGFDSHYGYWQGLQD-YYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
RGFD +G+ + D YY H + MR N +V T ++TDL++ A++
Sbjct: 154 SRGFDFFHGFLGDMMDNYYTHLRRGHNY------MRLNEEVIEPT--GHATDLFSSWALD 205
Query: 204 VIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD----ISDPERRTYAGMVS 258
KSK P FLYLA+ A H P Q P E + + + IS+ +R ++
Sbjct: 206 YFDTMRKSKVPFFLYLAYNAPHV-----PLQPPTEWLERVREREPGISE-KRAKLVALIE 259
Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
LD ++G V AL + G L+N++++F +DNG + G +N G P+RG K ++GG++
Sbjct: 260 HLDYNIGRVYKALEESGQLDNTLIIFCSDNGGDA-GAEANNG---PVRGSKGDMYEGGIK 315
Query: 319 GVAAIWSPWLKQTQKVS-SELFHISDWLPTLCAAAGIEINDTSLDGVN 365
A++ W + S +L +SD PTLC I +N +DG++
Sbjct: 316 VPCAVY--WRNHLKPRSIDDLVIMSDIFPTLCDLVSIPVNH-RIDGIS 360
>gi|440718008|ref|ZP_20898478.1| arylsulfatase A [Rhodopirellula baltica SWK14]
gi|436436890|gb|ELP30586.1| arylsulfatase A [Rhodopirellula baltica SWK14]
Length = 479
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 132/425 (31%), Positives = 203/425 (47%), Gaps = 44/425 (10%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
+ A+T LF ++ +P++++ILADDLG+ D+S +G+ I TPNID +A G
Sbjct: 2 RTIVAITFVCLFVAQVCAESSEGRPNVVVILADDLGYGDLSCYGAEDISTPNIDRMATEG 61
Query: 66 LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+ YV ++C+P+R++LMTG + +G+ GV+ GL E LP+ LK GYA
Sbjct: 62 AKFSSFYVSSVCSPTRASLMTGSHSTRVGI-GGVLFPRNNHGLNSDEITLPELLKNQGYA 120
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYG--------YWQGLQDYYDHSCKATFEPYQGLD 177
T IGKWHLG +++ P GFD YG Y+ ++ Y +G+
Sbjct: 121 TAIIGKWHLG-NEDMFQPLNHGFDYWYGTPSSNSQFYYPTIKKYAADCVFREGFTREGIL 179
Query: 178 MRHNMQV----DNKTIGIYS-----TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNT 228
R DN I + + T YT I I + NK K F+YL H H
Sbjct: 180 KRETAACPLIEDNIVIEVPADQTQFTQRYTRETIRFITQ-NKDKSFFVYLGHNMPHI--- 235
Query: 229 YEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN 288
P A + KF+ S +R Y + LD S G ++ AL++ G+ +N++V+F +DN
Sbjct: 236 --PLHASE----KFVGSS--KRGIYGDTIQELDWSTGEILRALKELGLDQNTLVIFTSDN 287
Query: 289 GAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPT 347
G P+ G GS L+G K + +GG+R A W + + + + D LPT
Sbjct: 288 G-PNTG---KGGSAGVLKGGKGSTLEGGVRVPFVARWPGTIPAGTEFDEAITGM-DLLPT 342
Query: 348 LCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRS-EILHNIDNVDNPQKYYAALRVDDLKY 405
L AG E+ +D +DG + +L K E ++ + + +RV D KY
Sbjct: 343 LTKLAGGEVPDDRVIDGKDIGSLLAGKPDAKSPHEAIYYLRG-----RGVDGIRVGDWKY 397
Query: 406 VAGTD 410
TD
Sbjct: 398 RMATD 402
>gi|87309449|ref|ZP_01091584.1| arylsulfatase A (precursor) [Blastopirellula marina DSM 3645]
gi|87287757|gb|EAQ79656.1| arylsulfatase A (precursor) [Blastopirellula marina DSM 3645]
Length = 478
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 123/374 (32%), Positives = 179/374 (47%), Gaps = 28/374 (7%)
Query: 10 ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
+LT +L F + P+I++I DD+G+ D+ G+ PTP++D LA G I
Sbjct: 5 SLTLSLGFATVTVAAEKNATPNIVVIFIDDMGYADIGPFGAKDYPTPHLDQLAQEGTICT 64
Query: 70 QHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
YV QA+C+ SR+ L+TG Y IG+ G + G+ E L + K+ GYAT
Sbjct: 65 DFYVTQAVCSASRAGLLTGCYNNRIGIL-GALGPQSKIGISAEETTLAEICKQKGYATAC 123
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEP----YQGLDMRHNM 182
GKWHLG E + P GFD + G Y + Y+ T + Y L +
Sbjct: 124 YGKWHLGHHEE-FLPLQHGFDDYVGLPYSNDMWPYHPELRHLTKDQQQKRYPDLPLYEKN 182
Query: 183 QVDNKTIGIYS----TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
++ + + T LYTE A+ I + N ++P FLY+ H VH P D+
Sbjct: 183 EIIDTEVTPEDQRNLTTLYTEKAVKFI-DDNHAQPFFLYVPHSMVHV-----PLYVSDKF 236
Query: 239 VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHS 297
K + +V +D SVG +I ALR++ + EN++V+F ADNG S+G H+
Sbjct: 237 AGK------SGAGLFGDVVMEVDWSVGQIIEALRRNMLDENTLVIFTADNGPWLSYGDHA 290
Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN 357
GS PLR K T +DGG R W P K S D LPT+ G ++
Sbjct: 291 --GSAGPLREGKGTMFDGGCREPTIFWQPGTIPAGKTCSTPMMTIDILPTVADLIGAKLP 348
Query: 358 DTSLDGVNQWDVLT 371
D +DG N W ++T
Sbjct: 349 DHKIDGKNIWPIVT 362
>gi|281346853|gb|EFB22437.1| hypothetical protein PANDA_020197 [Ailuropoda melanoleuca]
Length = 520
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 120/380 (31%), Positives = 179/380 (47%), Gaps = 28/380 (7%)
Query: 15 LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-V 73
+L F PK P+I+++L DD+GW D+ +G TPN+D +A G++ Y
Sbjct: 11 VLSAAGFGAMGTPKPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSA 70
Query: 74 QALCTPSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
LC+PSR+AL+TG+ PI G ++ + G+P E LLP+ LK AGYA+
Sbjct: 71 NPLCSPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPDGEHLLPELLKGAGYAS 130
Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNM 182
+GKWHLG R + P GFD +G YD+ + Y+ +M
Sbjct: 131 KIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEF 189
Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
++ +T T +Y + A++ + + +P FLY A H AP
Sbjct: 190 PINLQTGEANLTQVYLQEALDFMKRQQVAQRPFFLYWAIDGTH---------APVYASRP 240
Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP--SFGIHSNK 299
FL S R Y V +D+SVG +++ LR + EN+ V F +DNGA S
Sbjct: 241 FLGTS--RRGRYGDAVREIDDSVGKILSLLRDLRITENTFVFFTSDNGAALISAPKQGQC 298
Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-IND 358
GSN P K T ++GGMR A W P +VS +L + D T A AG+ +D
Sbjct: 299 GSNGPFLCGKQTTFEGGMREPAIAWWPGRIPAGQVSHQLGSVMDLFTTSLALAGLAPPSD 358
Query: 359 TSLDGVNQWDVLTKGAKTKR 378
+DG++ + +G T R
Sbjct: 359 RVIDGLDLLPAMLRGQLTDR 378
>gi|300773187|ref|ZP_07083056.1| N-acetylgalactosamine-6-sulfatase [Sphingobacterium spiritivorum
ATCC 33861]
gi|300759358|gb|EFK56185.1| N-acetylgalactosamine-6-sulfatase [Sphingobacterium spiritivorum
ATCC 33861]
Length = 443
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/361 (31%), Positives = 174/361 (48%), Gaps = 25/361 (6%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
+Y L LL A N++A +P+ III DD+G+ DV +G+ I TPN+D +A G
Sbjct: 5 RYLITLFICLL---AVFNSSAQTQPNFIIIYVDDMGYGDVGINGNPNIETPNLDRMAMEG 61
Query: 66 LILNQHY-VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGY 124
+ + +Y CT SR AL+TGKYP G + V+ + G+ E + + LKE GY
Sbjct: 62 MRFSNYYSASPACTASRYALLTGKYPSRAGFRW-VLNPTDQIGIHQQESTIAERLKEKGY 120
Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV 184
T GKWHLG R+ + P GFD + G Y + + L +++
Sbjct: 121 RTAIYGKWHLGSTRKEFLPLANGFDEYVGL-----PYSNDMIPPKYPDIALLSGYDTLEL 175
Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
+ + T LYTE AI I + N +P F+YL + H P A ++ + K
Sbjct: 176 NPDQSKL--TRLYTEKAIAFITK-NAKQPFFIYLPYAMPHT-----PLHASEDFLGK--- 224
Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
+R Y +V LD +G ++ L+++ + + + V+F +DNG P + N GS
Sbjct: 225 ---SKRGLYGDVVQELDHHIGRLLTFLKENKLDQQTYVVFTSDNG-PWLIQNQNGGSAGL 280
Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGV 364
R K + W+GGMR +W V +E+F D LPT+ A AGI +DG
Sbjct: 281 FRDGKGSTWEGGMREPFFLWGHHTIPKGYVENEVFTALDMLPTITALAGISAGPNKIDGT 340
Query: 365 N 365
N
Sbjct: 341 N 341
>gi|302370951|ref|NP_001180574.1| N-acetylgalactosamine-6-sulfatase isoform 2 precursor [Mus
musculus]
gi|26329565|dbj|BAC28521.1| unnamed protein product [Mus musculus]
Length = 440
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 157/313 (50%), Gaps = 25/313 (7%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALM 85
P+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+AL+
Sbjct: 25 PQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALL 84
Query: 86 TGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
TG+ PI G ++ + G+P +E LLP+ LK+AGY +GKWHLG R
Sbjct: 85 TGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-R 143
Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYST 194
+ P GFD +G YD+ K Y+ +M ++ KT T
Sbjct: 144 PQFHPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLT 203
Query: 195 DLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
LYT+ A++ I +H + P FLY A A HA P +FL S R Y
Sbjct: 204 QLYTQEALDFIQTQHARQSPFFLYWAIDATHA---------PVYASRQFLGTS--LRGRY 252
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
V +D+SVG +++ L+ G+ +N+ V F +DNGA + GSN P K T +
Sbjct: 253 GDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLCGKQTTF 312
Query: 314 DGGMRGVAAIWSP 326
+GGMR A W P
Sbjct: 313 EGGMREPAIAWWP 325
>gi|116620007|ref|YP_822163.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
gi|116223169|gb|ABJ81878.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
Length = 443
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 116/342 (33%), Positives = 176/342 (51%), Gaps = 35/342 (10%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
A +P++++++ DDLG +D+ + G++ + TP+IDALA GL Y A +C P+RSA+
Sbjct: 23 AADRPNVLVVVLDDLGCHDLGYLGAADLKTPHIDALAARGLKFRNWYSNAPVCAPARSAI 82
Query: 85 MTGKYPIHIGM-QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
+TG++P G+ +G L G+P L LK +GY T GKWHLG E P
Sbjct: 83 LTGRFPASAGVPDNGPALA---HGIPT----LASVLKGSGYQTGCFGKWHLGSTDET-AP 134
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEA 200
T GFDS YG+ G DYY H Y G D H++ + I G Y T+ +
Sbjct: 135 TGHGFDSFYGFHSGCVDYYSHRF------YWG-DNYHDLWHNRTEIFEDGRYLTERIADE 187
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
A I +++P Y+A A H P AP + A+F +++ PER+TYA M++ +
Sbjct: 188 AAGFIG---RNRPFLGYVAFNAPH-----YPMHAPAQYKARFPNLA-PERQTYAAMIAAV 238
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPS------FGIHSNKGSNHPLRGMKSTPWD 314
D+ +G + AL G EN+++ F+ DNGA + G + G N +G K + +D
Sbjct: 239 DDGIGQIQRALETTGAAENTLMFFIGDNGATTEKRAGLNGDFATAGDNGVFKGYKFSLFD 298
Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
GGM + P + + EL D LPT+C A G +
Sbjct: 299 GGMHVPGFVSWPAGIRKGGWTDELAMSMDILPTICRATGAPL 340
>gi|50753991|ref|XP_414208.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Gallus gallus]
Length = 513
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 117/371 (31%), Positives = 178/371 (47%), Gaps = 28/371 (7%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPS 80
L AP P+++++L DD+GW D+ G TPN+D +A G++ Y LC+PS
Sbjct: 15 LGCAAP--PNVVLLLMDDMGWGDLGAFGEPSKETPNLDQMASEGMLFLDFYAANPLCSPS 72
Query: 81 RSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWH 133
R+AL+TG+ P+ G ++ + G+ +E LLP+ LK+AGY IGKWH
Sbjct: 73 RAALLTGRLPVRNGFYTTNAHARNAYTPQDIVGGIQDSEILLPELLKKAGYTNKIIGKWH 132
Query: 134 LGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTI 189
LG R + P GFD +G YD+ Y+ +M + ++D +T
Sbjct: 133 LGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNRALPNIPVYRDWEMIGRYYEDFKIDLRTG 191
Query: 190 GIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
T +Y + A++ I++ S+ P FLY A A HA P FL S
Sbjct: 192 EANLTQIYLQEALDFISKQQASQQPFFLYWAIDATHA---------PVYASKHFLGTS-- 240
Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
+R Y V +D+SVG ++ L+K G+ EN+ V F +DNGA GSN P
Sbjct: 241 QRGRYGDAVREIDDSVGKILKHLQKLGISENTFVFFTSDNGAALISAPKQGGSNGPFLCG 300
Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQW 367
K T ++GGMR A W P VS +L + D T + G++ +D +DG++
Sbjct: 301 KQTTFEGGMREPAIAWWPGHIPAGSVSRQLGSVMDLFTTSLSLVGLQPPSDRQIDGIDLL 360
Query: 368 DVLTKGAKTKR 378
+ +G R
Sbjct: 361 PAILQGKLIDR 371
>gi|149195951|ref|ZP_01873007.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149140798|gb|EDM29195.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 464
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 119/363 (32%), Positives = 180/363 (49%), Gaps = 46/363 (12%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP+++I DD G DV+ +GS + TPN+D LA +G+ Q Y +C P+R+ LMTG+
Sbjct: 31 KPNLVIFFTDDQGTLDVNCYGSKDLYTPNMDKLAEDGVRFTQAYAHQVCCPARAMLMTGR 90
Query: 89 YPIHIGMQHGVILEGEPWG-----LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
+P + H +G+ G + L E L + LK++GY T GKWHLG + Y P
Sbjct: 91 HPQRSNVNHWT--QGDAKGPKTRNMNLEEYTLAEALKDSGYKTALFGKWHLGAHLD-YGP 147
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEA 200
T +GFD YG G D Y+H F +G H++ K + G Y +L T+
Sbjct: 148 TKQGFDEFYGIRGGFIDNYNH----YFLHGEGF---HDLYEGTKEVFDEGKYFPNLVTDR 200
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
A+N I + NK+ P FL+LA H P QA + ++ ++ P R++YA M+S
Sbjct: 201 ALNFI-DRNKNNPFFLFLAFNIPHY-----PEQADPKFDERYKNMKMP-RQSYAKMISTT 253
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH----------------- 303
D+ +G +++ L++HG+ +N+I++FM+DNG H K NH
Sbjct: 254 DDHMGQIMSKLQEHGIYDNTIIIFMSDNGHSRERNHI-KFDNHKSGLAKNTKYGALGGGG 312
Query: 304 ---PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS 360
RG KS ++GG+R A I P V + DW+PT+ IE
Sbjct: 313 NTGKWRGNKSNFYEGGIRVPAIITFPNKLPKGAVRDQAITAMDWMPTVLELCNIEPPKIK 372
Query: 361 LDG 363
DG
Sbjct: 373 FDG 375
>gi|148679746|gb|EDL11693.1| galactosamine (N-acetyl)-6-sulfate sulfatase, isoform CRA_a [Mus
musculus]
Length = 462
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 157/313 (50%), Gaps = 25/313 (7%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALM 85
P+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+AL+
Sbjct: 47 PQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALL 106
Query: 86 TGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
TG+ PI G ++ + G+P +E LLP+ LK+AGY +GKWHLG R
Sbjct: 107 TGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-R 165
Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYST 194
+ P GFD +G YD+ K Y+ +M ++ KT T
Sbjct: 166 PQFHPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLT 225
Query: 195 DLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
LYT+ A++ I +H + P FLY A A HA P +FL S R Y
Sbjct: 226 QLYTQEALDFIQTQHARQSPFFLYWAIDATHA---------PVYASRQFLGTS--LRGRY 274
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
V +D+SVG +++ L+ G+ +N+ V F +DNGA + GSN P K T +
Sbjct: 275 GDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLCGKQTTF 334
Query: 314 DGGMRGVAAIWSP 326
+GGMR A W P
Sbjct: 335 EGGMREPAIAWWP 347
>gi|404486643|ref|ZP_11021833.1| hypothetical protein HMPREF9448_02276 [Barnesiella intestinihominis
YIT 11860]
gi|404336461|gb|EJZ62922.1| hypothetical protein HMPREF9448_02276 [Barnesiella intestinihominis
YIT 11860]
Length = 478
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 130/420 (30%), Positives = 194/420 (46%), Gaps = 62/420 (14%)
Query: 5 RKYFFALTC-TLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
+K F ++ +L + + + +K ++I ILADDLG+ D+ +G I TPNID LA
Sbjct: 2 KKVFLSVALLGVLISSSCTTKSKKQKMNVIYILADDLGYGDLGCYGQKIIKTPNIDRLAE 61
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQ----HGVILEGEPWGLPLTEKLLPQY 118
G++ QHY + PSRS LMTG++ H ++ HG E EP LP L +
Sbjct: 62 EGMLFTQHYSGCTVSAPSRSCLMTGQHTGHTYIRGNQSHGTEGE-EP--LPGNTYTLARM 118
Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM 178
+K+AGYAT A GKW LG+ PT GFD +GY +C+ Y +
Sbjct: 119 MKDAGYATGAFGKWGLGYPCSEGDPTNLGFDEFFGY----------NCQRQAHHYYPYHL 168
Query: 179 RHNMQV------DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA------- 225
HN + + Y+ DL E A+ I + N+SKP FLYL ++ HA
Sbjct: 169 WHNQEKVMLPGNEGSKTETYAQDLIQEKALQFIVD-NQSKPFFLYLPYILPHAELVSPED 227
Query: 226 ------------GNTYEP---FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAA 270
G YE + P + + +P +A MV+R D VG ++
Sbjct: 228 SILAMYKGKIEEGKPYEGVDDIKNPYYKYGGYCSSENP-HADFASMVTRFDAYVGEIMQT 286
Query: 271 LRKHGMLENSIVLFMADNGAPSFGIHSNKGSN-------HPLRGMKSTPWDGGMRGVAAI 323
L++ G+ +N+IV F +DNG H G++ P RG+K ++GG+R
Sbjct: 287 LKRLGLDKNTIVFFASDNGP-----HQEGGADPDFFNSYGPFRGIKRDMYEGGIRVPLIA 341
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
W+P + S + D +PTL AG EI DG++ L G K+ E L+
Sbjct: 342 WAPGKVEAGTCSDHISAFWDVMPTLADIAGGEIKG-ECDGISFAPTLFGGKGQKQHEYLY 400
>gi|261404208|ref|YP_003240449.1| sulfatase [Paenibacillus sp. Y412MC10]
gi|261280671|gb|ACX62642.1| sulfatase [Paenibacillus sp. Y412MC10]
Length = 452
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 109/350 (31%), Positives = 181/350 (51%), Gaps = 22/350 (6%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
K+P+ I+I DDLG+ D+ +GS + TP++D LA G+ Y + +C+PSR++L+T
Sbjct: 15 KQPNFIVIYCDDLGYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLLT 74
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
GKYP G+ + + GLP E L + LK AGY T GKWHLG E +P
Sbjct: 75 GKYPARAGVGEILGAKRGSHGLPADEVTLAKALKPAGYRTALYGKWHLGLSEET-SPNAH 133
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEAAIN 203
GFD +G+ G D+Y H + G++ H++ + + G Y T+L TE +++
Sbjct: 134 GFDEFFGFKAGCVDFYSHIF--YWGQAHGVNPLHDLWENETEVWENGRYMTELITERSVD 191
Query: 204 VIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
I + + P FL+ ++ A H P AP + + +F + +R+ A M++ +D+
Sbjct: 192 FIQRSREQEAPFFLFASYNAPHY-----PMHAPQKYMDRFAHLP-WDRQVMAAMIAAVDD 245
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH--------SNKGSNHPLRGMKSTPWD 314
VG ++ AL++ G E++++ F +DNG S + GS RG K++ ++
Sbjct: 246 GVGKIVKALKEAGCYEDTVIFFSSDNGPSSESRNWLDGTEDVYYGGSAGIFRGHKASLFE 305
Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGV 364
GG+R A + P + +V E+ + D PT AG++ L GV
Sbjct: 306 GGIREPAILSWPNGWEGGQVRDEVAAMMDLAPTFLDLAGVDPAAGPLQGV 355
>gi|348516447|ref|XP_003445750.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Oreochromis
niloticus]
Length = 525
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 120/360 (33%), Positives = 170/360 (47%), Gaps = 26/360 (7%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGK 88
P+III+L DD+GW D+ G TPN+DA+A G++L Y LC+PSR+AL+TG+
Sbjct: 31 PNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAAEGMLLPNFYTANPLCSPSRAALLTGR 90
Query: 89 YPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
PI G ++ + G+ E LLPQ LK GY + +GKWHLG R Y
Sbjct: 91 LPIRNGFYTTNAHARNAYTPQEIVGGISKDEILLPQLLKTKGYVSKIVGKWHLGH-RPQY 149
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLY 197
P GFD +G Y+ K Y +M + ++D T T +Y
Sbjct: 150 LPLKNGFDEWFGSPNCHFGPYNDQSKPNIPVYNNSEMLGRFYEDFKIDRNTGESNLTQIY 209
Query: 198 TEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
++ I K++ P FLY A A HA P +FL S +R Y
Sbjct: 210 LMEGLDFILRQTKAQQPFFLYWAVDATHA---------PVYASKRFLGKS--QRGRYGDA 258
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
V LD S+G ++ LR G+ N+ V F +DNGA + GSN P K T ++GG
Sbjct: 259 VMELDYSIGQILKWLRTLGIDNNTFVFFTSDNGAAVMSGPNESGSNGPFLCGKETTFEGG 318
Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAK 375
MR A W P + V+ +L ++ D T AGI +D LDG++ VL ++
Sbjct: 319 MREPAIAWWPGHIKEGSVNFQLANVMDLFTTSLVLAGINPPDDRILDGLDLTPVLLNSSQ 378
>gi|423239862|ref|ZP_17220977.1| hypothetical protein HMPREF1065_01600 [Bacteroides dorei
CL03T12C01]
gi|392644851|gb|EIY38585.1| hypothetical protein HMPREF1065_01600 [Bacteroides dorei
CL03T12C01]
Length = 464
Score = 166 bits (419), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 131/402 (32%), Positives = 187/402 (46%), Gaps = 46/402 (11%)
Query: 12 TCTLLFNDAFLN--------TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
TC +LF+ A L+ TTA K P++I I+ADDLG D+ +G QI TPNID +A
Sbjct: 4 TCKILFSAALLSSGSMMAQTTTAEKSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQ 63
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQ 117
NG+ QHY + + PSR AL+TGK+ H ++ + G LP E +
Sbjct: 64 NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVAD 123
Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
K Y T +GKW +G P GFD YGY L + HS F +
Sbjct: 124 IFKTKNYVTGCVGKWGMGGPGTEGMPGKHGFDYFYGY---LGQRFAHSYYPEFLH----E 176
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG--------NTY 229
+ +D K YS DL E A+N I E N KP FLY + HA Y
Sbjct: 177 NEQKIMLDGK---YYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLDIMGEVMTEY 232
Query: 230 E------PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVL 283
E PF + R YA MV+ LD+SVG +I L++ G+ +++I++
Sbjct: 233 EGEFCETPFGGSKDGYKS----QQTPRAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIV 288
Query: 284 FMADNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHI 341
F +DNG S G H S SN P RG K ++GG+R I P + V++ +
Sbjct: 289 FTSDNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAF 348
Query: 342 SDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
D+LPT+ +I ++DG++ LT K + ++
Sbjct: 349 WDFLPTIGELVQADIPQ-NIDGISYLPTLTGKGIQKEHDCIY 389
>gi|414344125|ref|YP_006985646.1| sulfatase [Gluconobacter oxydans H24]
gi|411029460|gb|AFW02715.1| sulfatase [Gluconobacter oxydans H24]
Length = 478
Score = 166 bits (419), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 114/349 (32%), Positives = 172/349 (49%), Gaps = 30/349 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
K P+II ILADDLG+ DVS G+ TP+ID +A+ G+ L + Y A+C+ +R+AL+T
Sbjct: 43 KAPNIIFILADDLGYADVSCFGTPGFTTPSIDRIAHEGIKLTRAYANSAVCSATRTALIT 102
Query: 87 GKYP--IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
G+Y I G++ + GLP LP+ L+ AGY T +GKWHLG E + P
Sbjct: 103 GRYQDRIACGLEEPIASSSGKIGLPPGLPTLPEQLRRAGYKTALVGKWHLGNPPE-FGPL 161
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
G+D YG G DY+ H K P + R + + + Y T L + A+ +
Sbjct: 162 KSGYDHFYGISGGAADYFTHQGK----PGEDQFYRDDHLIHDHG---YLTGLLGDEAVRL 214
Query: 205 IAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV------AKFLDISDPERRTYAGMV 257
+ +++ P FL L H + P++ P++E K D R+TY M+
Sbjct: 215 VETFASETSPFFLSL-----HFNAPHWPWEGPEDEAESKRLKGKIFDYDGGTRKTYGRMI 269
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
+D+ +G V+A L + + EN+IV+F +DNG F P GMK +GG+
Sbjct: 270 KAMDDQIGRVLATLDSYHLTENTIVVFTSDNGGERF------SDTWPFTGMKGELLEGGL 323
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVN 365
R A + P S + DWLP+ AA G + T DG++
Sbjct: 324 RVPAVLRWPARVAAGSESDQTLITMDWLPSFLAATGATPDATFPSDGIS 372
>gi|291238687|ref|XP_002739253.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 512
Score = 166 bits (419), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 117/395 (29%), Positives = 193/395 (48%), Gaps = 37/395 (9%)
Query: 7 YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
+F+ + +F F++ +P+I+I+ ADD+G+ D+ +G TP++D LA NGL
Sbjct: 2 FFWRIQLFAVFVQYFIDKCKTAEPNIVILFADDVGYGDLGCYGHPTTSTPHLDKLAANGL 61
Query: 67 ILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+L Q Y + +C+PSR++L+TG+ P G+ GV + GLPL E + + LK Y
Sbjct: 62 LLTQFYAGSPVCSPSRASLLTGRLPPRTGVYPGVFQANDSGGLPLNETTIAELLKTVNYQ 121
Query: 126 THAIGKWHLGFFR-EVYTPTFRGFDSHYG-------------YWQGLQDYYDHSCKATFE 171
T IGKWHLG + Y PT GFD ++G +W SC + +
Sbjct: 122 TGMIGKWHLGVGENQKYLPTNFGFDYYFGIPYTHAQCPCVTCFWP------RDSCYGSCK 175
Query: 172 PY-QGLDMRHNMQVDNKTIGIYS-TDLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNT 228
PY G + +N ++ ++ + + + + Y AA + IA++ + KP FLY A H
Sbjct: 176 PYFPGCPLFYNDEIVDQPVDLLTLNEQYANAARDFIAKNAQEKKPFFLYYAFQHCH---- 231
Query: 229 YEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN 288
P KF + + R + ++ +D SVG VI L++ + +N++V F +DN
Sbjct: 232 -----HPQFAGKKFRNST--SRGVFGDSLAEIDWSVGQVIKELKQQDIDKNTLVFFSSDN 284
Query: 289 GAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
G PS + G L+ K T ++GG R + P + + + S EL D LPT+
Sbjct: 285 G-PSIQWKTRGGVAGLLKCGKGTTYEGGQRVPGIAYWPGMIKPGR-SMELASTLDLLPTI 342
Query: 349 CAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
+ + +LDGV+ +L K R +
Sbjct: 343 AKMVNGILPNVTLDGVDMGPILFTQDKGLRDTFFY 377
>gi|198432447|ref|XP_002128343.1| PREDICTED: similar to galactosamine (N-acetyl)-6-sulfate sulfatase
[Ciona intestinalis]
Length = 513
Score = 165 bits (418), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 124/392 (31%), Positives = 174/392 (44%), Gaps = 27/392 (6%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
L+C LL + KP+II +L DD+GW D+ +G TPN+D +A G +
Sbjct: 2 LSCFLLLLMCGYSLAESSKPNIIFMLMDDMGWGDLGINGQPSKETPNLDNMAKEGTLFTD 61
Query: 71 HY-VQALCTPSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEA 122
Y LC+PSR+AL+TG+ PI G +G + G+P E L+ + L A
Sbjct: 62 FYSANPLCSPSRAALLTGRLPIRNGFYTSNYHGHNGYTPQHIVGGIPDHEILVSELLSSA 121
Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM---- 178
GY IGKWHLG +E Y P GF +G YD Y +M
Sbjct: 122 GYTNKLIGKWHLGQ-QEQYLPLKHGFHEWFGSPNCHFGPYDDKTTPNIPVYNNTEMVGRY 180
Query: 179 RHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDE 237
+++ T Y + A++ I ++ KP FLY A A H+ P
Sbjct: 181 YEEFAIESHKYLSNMTQYYIQEALDFIERMERNEKPFFLYWAPDATHS---------PVY 231
Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
F S R Y V LD VG +I L++ G+ +N++VLF +DNGA G
Sbjct: 232 SSPMFRGAS--RRGPYGDAVMELDYGVGVIIQKLKQLGLDKNTLVLFSSDNGAAMIGSAF 289
Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI- 356
GSN P K T ++GG+R W P Q +VS ++ + DW T A I +
Sbjct: 290 ENGSNGPFLCGKQTTFEGGIREPTIAWGPGFVQQGQVSHQVGSLMDWFATALDLASIAVP 349
Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
D +DGV+ VL A+ R I H N
Sbjct: 350 KDRIIDGVSLLPVLNGAAEFDRP-IFHYRGNA 380
>gi|284041117|ref|YP_003391047.1| sulfatase [Spirosoma linguale DSM 74]
gi|283820410|gb|ADB42248.1| sulfatase [Spirosoma linguale DSM 74]
Length = 486
Score = 165 bits (418), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 125/397 (31%), Positives = 196/397 (49%), Gaps = 34/397 (8%)
Query: 20 AFLNTTAPKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILNQHYVQALC 77
A++ AP P ++++ DDLG+ D+S G+ TPN+D +A G N QA+C
Sbjct: 27 AWVEKPAPATPPNVVLFFMDDLGYGDLSVTGALDYTTPNLDKMAAEGTRFTNFLAAQAVC 86
Query: 78 TPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
+ SR+AL+TG YP +G+ +G + P GL E+ L + LKE GYAT GKWHLG
Sbjct: 87 SASRAALLTGCYPNRLGL-YGALGPNSPIGLNPNEETLAELLKERGYATGMFGKWHLGDN 145
Query: 138 REVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRH-NMQVDNKTIGIYST 194
++ + P +GFD +YG Y + + +A + P + +D ++ + T
Sbjct: 146 KQ-FLPMQQGFDEYYGVPYSHDMWPLHPAQAQAKYPPLRWIDGNEPGPEIKDLNDAGKIT 204
Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
TE A++ I H K KP FLY+ H H P A+F S R +
Sbjct: 205 GTITEKAVSFIRNHKK-KPFFLYVPHPLPH---------VPLATSARFKGQS--ARGIFG 252
Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
+++ LD SVG ++ L++ G+ +N++V+F++DNG P + GS+ R K T ++
Sbjct: 253 DVLTELDWSVGQIMNELKQQGLDKNTLVIFISDNG-PWLNYGDHAGSSGGFREGKGTSFE 311
Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGA 374
GG R + P + +VS++L D LPT+ G + +DGV+ W L KG
Sbjct: 312 GGHRVPCLVRWPGVVPAGRVSNKLLTALDILPTVANVCGARLPKQRIDGVD-WVALLKG- 369
Query: 375 KTKRSEILHNIDNVDNPQ-KYYAALRVDDLKYVAGTD 410
DN P+ K+Y R + L+ V D
Sbjct: 370 -----------DNSVTPRDKFYYYYRKNSLEAVRQGD 395
>gi|393784381|ref|ZP_10372546.1| hypothetical protein HMPREF1071_03414 [Bacteroides salyersiae
CL02T12C01]
gi|392666157|gb|EIY59674.1| hypothetical protein HMPREF1071_03414 [Bacteroides salyersiae
CL02T12C01]
Length = 456
Score = 165 bits (418), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 116/350 (33%), Positives = 192/350 (54%), Gaps = 30/350 (8%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGK 88
P+I++I+ DDLG D+S + + TPNID L + G+ ++ Y ++ +PSR+ L+TG
Sbjct: 36 PNILLIMVDDLGLGDLSCQYAKDVQTPNIDRLFHQGVRMDNFYANSSVSSPSRAGLLTGC 95
Query: 89 YPIHIGMQHGVILEGEP---WG-LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
+P +G+ GVI +P WG L LLP+ +K+AGY T IGKWHLG P
Sbjct: 96 FPDMVGVP-GVI-RTDPKGSWGYLSPDATLLPEMMKKAGYQTAIIGKWHLG-LESPNLPN 152
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
RGFD +G+ + D Y+ + F Y L+ ++D + ++TDL+T+ A +
Sbjct: 153 ERGFDFFHGFLGDMMDDYNTHLRRGFN-YMRLNTE---EIDPQG---HATDLFTDWASDY 205
Query: 205 IAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAP---DEEVAKFLDISDPERRTYAGMVSRL 260
I + K KP FLYLA+ A H+ P Q P +++V K R ++ +
Sbjct: 206 IFKARKQDKPFFLYLAYNAPHS-----PLQPPVEWEKKVRKRHPNISETRGKLVALIEHM 260
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
D+ VG VI +L K G L+N++++F +DNG G ++ +N P+RG K +DGG++
Sbjct: 261 DDGVGRVINSLEKSGQLDNTLIIFCSDNG----GDRKSEANNGPVRGDKGDMYDGGIKVA 316
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
+++ + ++V++ L +SD PTLC + +N +DG++ +L
Sbjct: 317 CSLYWKGHLEHRRVNN-LVMMSDIFPTLCDLVQLPVNH-RIDGISVLPLL 364
>gi|149174645|ref|ZP_01853271.1| sulfatase [Planctomyces maris DSM 8797]
gi|148846755|gb|EDL61092.1| sulfatase [Planctomyces maris DSM 8797]
Length = 405
Score = 165 bits (417), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 112/352 (31%), Positives = 180/352 (51%), Gaps = 25/352 (7%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPS 80
+ + +KP++III DD G D++ +G+ + TP++D++A G+ Q Y A +C+PS
Sbjct: 1 MGAISSEKPNVIIIFTDDQGSVDLNCYGAKDLITPHMDSIARRGIRFTQFYASAPVCSPS 60
Query: 81 RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
R+ ++TG++P G+ V G+P + + + +++AGY T IGKWHLG+ E
Sbjct: 61 RAGMLTGRFPARAGVPGNVSSHHGKSGMPTEQITIAEMMQQAGYQTAHIGKWHLGYTPET 120
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLY 197
P +GF++ +G+ G D Y H F + G + RH++ + K + G + DL
Sbjct: 121 -MPHGQGFETSFGHMGGCIDNYSH-----FFYWNGPN-RHDLWENGKEVWRDGAFFPDLM 173
Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
E + I + KP FLY A H P Q ++ + +S P R YA V
Sbjct: 174 VEQCQDYIRKAG-DKPFFLYWAINVPHY-----PLQGKEKWRKTYAHLSSP-RDKYAAFV 226
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAP----SFGIHSNKGSNHPLRGMKSTPW 313
S +D+ +G V+A L + E +I++F +D+G +FG GS P RG K + +
Sbjct: 227 STMDDCIGEVLATLDACQLREKTIIIFQSDHGHSHEERTFG---GGGSAGPYRGAKFSLF 283
Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
+GG+R A I P +V +L DWLPT+ A G + LDG N
Sbjct: 284 EGGIRVPAMISWPGTIAEGEVRDQLATGCDWLPTISALTGAPLPAHHLDGKN 335
>gi|410638700|ref|ZP_11349253.1| sulfatase [Glaciecola lipolytica E3]
gi|410141228|dbj|GAC16458.1| sulfatase [Glaciecola lipolytica E3]
Length = 522
Score = 165 bits (417), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 129/383 (33%), Positives = 191/383 (49%), Gaps = 52/383 (13%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
A KKP++I IL DDLG D+ +G I TPN+ +A G++ +QHY +C PSR+AL
Sbjct: 40 AAKKPNVIYILVDDLGIGDIEPYGQELINTPNLQRMADQGMVFSQHYAGNPVCAPSRAAL 99
Query: 85 MTGKYPIHIGMQHGVILEG----EPWG-LPLT--EKLLPQYLKEAGYATHAIGKWHLGFF 137
MTG++ H ++ L G E +G +PL + + + +K+AGY T IGKW LG
Sbjct: 100 MTGQHSGHNQIRGNFELGGFADEEEFGQMPLQPGTETVAKLMKKAGYTTALIGKWGLGGP 159
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSC------------KATFEPYQGLDMRHNMQVD 185
PT +GFD +GY Q + + +P++GL +H+ D
Sbjct: 160 GSYGVPTKQGFDYFFGYLDQKQAHNHYPTHLWQNETWFPLNNEYLDPHKGLG-KHDDPND 218
Query: 186 NKTI-----GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
+ G ++ T+ A++ I ++ K +P FLYL++ A HA QAPDEE+A
Sbjct: 219 EASYEKFKRGDFAQQRLTDEALSYI-DNKKDEPFFLYLSYAAPHAA-----LQAPDEEIA 272
Query: 241 KFLDISDP---------------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFM 285
K+ D R T A M+S +DE VG V+ L+K G+ EN++V+F
Sbjct: 273 KYNHFEDTPYDGVADGGYVPSFRPRATRAAMISHIDEGVGLVLDKLKKLGLDENTLVIFS 332
Query: 286 ADNGAPSFGIHSNK---GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS 342
+DNG PSF ++ SN RG K +DGG+R P KVS +
Sbjct: 333 SDNG-PSFEGGADLEFFDSNAQYRGFKRDLYDGGIRMPTIARWPTKVPAGKVSDHVSAFW 391
Query: 343 DWLPTLCAAAGIEINDTSLDGVN 365
D LPTL IE ++DG++
Sbjct: 392 DVLPTLADLVEIEA-PKNIDGIS 413
>gi|423214668|ref|ZP_17201196.1| hypothetical protein HMPREF1074_02728 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692574|gb|EIY85811.1| hypothetical protein HMPREF1074_02728 [Bacteroides xylanisolvens
CL03T12C04]
Length = 453
Score = 165 bits (417), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 117/367 (31%), Positives = 191/367 (52%), Gaps = 31/367 (8%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSR 81
NT P++++IL DDLG D+S + + TPNID + G+ L+ Y ++ +PSR
Sbjct: 24 NTVDKSTPNVLLILVDDLGLGDLSCQYAKDLSTPNIDRIFETGVRLDNFYANSSVSSPSR 83
Query: 82 SALMTGKYPIHIGMQHGVILEG--EPWG-LPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
+AL+TG++P +G+ GVI + WG + +P+ LK GY T IGKWHLG +
Sbjct: 84 AALLTGRFPAMVGVP-GVIRPSVDQNWGYFDPSAVTMPEVLKNGGYRTALIGKWHLG-WE 141
Query: 139 EVYTPTFRGFDSHYGYWQGLQ-DYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLY 197
P RGFD +G+ + DYY H + Y ++ ++D K ++T+L+
Sbjct: 142 SPNLPNERGFDHFHGFLADMMDDYYTHRRQGGNYMYL-----NDKEIDPKG---HATELF 193
Query: 198 TEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKF--LDISDPERRT-Y 253
T +++ I + K K P FLYLA+ A H+ P Q P E V K D S P +R
Sbjct: 194 TSWSVDYIKKEAKEKNPFFLYLAYNAPHS-----PLQPPVEWVNKVQERDKSLPVKRARL 248
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
++ LD ++G VI +L + G L N++V+F +DNG G + +N P RG K +
Sbjct: 249 IALIEHLDYNIGKVIQSLEESGQLNNTLVIFASDNG----GDRGSMANNGPTRGAKGDMF 304
Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKG 373
+GG+ A+ P + + + + + D +PT+C + + +DG++ D +
Sbjct: 305 EGGIHVACALNMPGVFEGGRRDNHFVVMMDLMPTICDFVNVPVKH-EIDGISVLDAIK-- 361
Query: 374 AKTKRSE 380
KT+ +E
Sbjct: 362 GKTQNTE 368
>gi|336424148|ref|ZP_08604192.1| hypothetical protein HMPREF0994_00198 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336009236|gb|EGN39231.1| hypothetical protein HMPREF0994_00198 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 454
Score = 164 bits (416), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 127/366 (34%), Positives = 176/366 (48%), Gaps = 30/366 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
K P+II+I DDLG+ DV +GS TP ID LA NG+ Y+ + +C+PSR A+MT
Sbjct: 6 KCPNIILINCDDLGYGDVGCYGSGVNRTPGIDYLAQNGMRCTSFYMASPVCSPSRGAMMT 65
Query: 87 GKYPIHIGMQ----HGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
G YP IG + V+ G+ GL E + Q L+ AGY T +GKWH G +E +
Sbjct: 66 GCYPPRIGFESFEGEWVLFPGQGVGLNSDEVTIAQILQRAGYRTKLVGKWHCG-DQEEFL 124
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH--NMQVDNKTIGIYSTDLYTEA 200
PT GFD +YG D K F P L +Q D ++I T+ Y E
Sbjct: 125 PTNHGFDEYYGLPYS-NDMGRQVFKEDFPPLPLLSGTGIVELQPDQRSI----TERYVEQ 179
Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
I E NK+ FLYLAH+ VH P A +E V K ++ + V +
Sbjct: 180 CRTFIRE-NKNNKFFLYLAHMHVHL-----PLYAGEEFVKKSIN------GDFGACVEAI 227
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
D SV ++ LR G+ E+++V+F +DNG G + GSN PL+G K T W+GG R
Sbjct: 228 DWSVRVLLHELRSLGLEEDTLVIFTSDNG----GRGDHGGSNAPLKGNKGTTWEGGQRVP 283
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAA-GIEINDTSLDGVNQWDVLTKGAKTKRS 379
+ P + EL D+ TL G D D ++ V+ +G KT R
Sbjct: 284 CIFYWPGHIAPGQECRELICSLDFYRTLAELGHGTMPADRKNDSLDIRRVILEGGKTPRD 343
Query: 380 EILHNI 385
+ I
Sbjct: 344 TFFYYI 349
>gi|423348778|ref|ZP_17326457.1| hypothetical protein HMPREF1060_04129 [Parabacteroides merdae
CL03T12C32]
gi|409212738|gb|EKN05774.1| hypothetical protein HMPREF1060_04129 [Parabacteroides merdae
CL03T12C32]
Length = 461
Score = 164 bits (416), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 107/315 (33%), Positives = 165/315 (52%), Gaps = 32/315 (10%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
++P+I+I+LADD G+ D F G++ I TPNID LA G I +V A + +PSRS ++T
Sbjct: 29 QRPNILILLADDAGYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLT 88
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y G + + G+ GLP E+LLP LK Y T IGKWHLG P +
Sbjct: 89 GRYGQRYGYECNLDKPGD--GLPDDEELLPALLKRYDYRTGCIGKWHLG-SEPSQRPNAK 145
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM---QVDNKTIGI--YSTDLYTEAA 201
GFD+ YG G + Y+ ++P + D N+ Q + + + Y TD A
Sbjct: 146 GFDTFYGLLAGHRSYF-------YDP-ETSDKDGNLQQYQYNGRKLSFDSYFTDELASKA 197
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+ E +P LY++ A H+ P +A +E++A+F R+ YA M+ LD
Sbjct: 198 QQFVTE--SEQPFMLYMSFTAPHS-----PNEATEEDLARF---EGQPRQKYAAMMYALD 247
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GV 320
VG ++ L+ G +N+I+ F++DNG G +N+ SN PL+G K ++GG R
Sbjct: 248 RGVGKIVDELKAAGKFDNTIIFFLSDNG----GSTTNQSSNLPLKGFKGNKFEGGQRVPF 303
Query: 321 AAIWSPWLKQTQKVS 335
+W K+ Q+ +
Sbjct: 304 FVVWGDRFKRDQRFT 318
>gi|149199917|ref|ZP_01876945.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
gi|149136986|gb|EDM25411.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
Length = 499
Score = 164 bits (416), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 125/388 (32%), Positives = 188/388 (48%), Gaps = 41/388 (10%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
F L +L F F T + P+ I I+ DD G+ D+ +G I TPNID +A G+
Sbjct: 3 FRTLIISLSFLLGF--TAKAEMPNFIFIMTDDQGYGDLGCYGHPIIKTPNIDKMADRGVR 60
Query: 68 LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
Y + C+P+R++LMTG + +G+ ++ GL +P+ LKE GY T
Sbjct: 61 FTDFYARHKCSPARASLMTGAFNFRVGVG-SIVYPNSTTGLIKEVVTIPEMLKEKGYTTA 119
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYG-----------YWQGLQDYYDHSCKATFEPY--- 173
IGKWHLG Y P +GFD ++G + + + + S + T E Y
Sbjct: 120 LIGKWHLG-HTAGYLPRDQGFDYYFGVPGTNHGDAKTHKLPVAEGFKPSGEFTIEDYWAD 178
Query: 174 --QGLDMRHNMQVDNKTIGIYSTDL------YTEAAINVIAEHNKSKPMFLYLAHLAVHA 225
+G+ + + N + + TD+ YT A+ I E NK KP FLY AH
Sbjct: 179 KGKGVHGNSTILMKNDNVIEWPTDITQLTKRYTHDAVRYIKE-NKDKPFFLYFAH----- 232
Query: 226 GNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFM 285
G + P+ A F SD Y M+ +D SVG VI AL+++G+ + +I+ F
Sbjct: 233 GTPHHPYTVD----AAFRGKSD--HGLYGDMIEEIDWSVGEVIKALQENGIEKKTIIAFT 286
Query: 286 ADNGAPSF--GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISD 343
+DNGA S H+ KGSN PL+G K + +GG+R + P K ++E+ + D
Sbjct: 287 SDNGADSKPNKEHAEKGSNLPLKGWKGSSEEGGVRVPFVLSWPGTLPEGKKTNEIASLMD 346
Query: 344 WLPTLCAAAGIEIN-DTSLDGVNQWDVL 370
PT A AGIE +DG N + ++
Sbjct: 347 IFPTYAALAGIEPEVPQKIDGNNIFPIM 374
>gi|423725143|ref|ZP_17699283.1| hypothetical protein HMPREF1078_03177 [Parabacteroides merdae
CL09T00C40]
gi|409234771|gb|EKN27595.1| hypothetical protein HMPREF1078_03177 [Parabacteroides merdae
CL09T00C40]
Length = 461
Score = 164 bits (416), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 107/315 (33%), Positives = 165/315 (52%), Gaps = 32/315 (10%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
++P+I+I+LADD G+ D F G++ I TPNID LA G I +V A + +PSRS ++T
Sbjct: 29 QRPNILILLADDAGYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLT 88
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y G + + G+ GLP E+LLP LK Y T IGKWHLG P +
Sbjct: 89 GRYGQRYGYECNLDKPGD--GLPDDEELLPALLKRYDYRTGCIGKWHLG-SEPSQRPNAK 145
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM---QVDNKTIGI--YSTDLYTEAA 201
GFD+ YG G + Y+ ++P + D N+ Q + + + Y TD A
Sbjct: 146 GFDTFYGLLAGHRSYF-------YDP-ETSDKDGNLQQYQYNGRKLSFDGYFTDELASKA 197
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+ E +P LY++ A H+ P +A +E++A+F R+ YA M+ LD
Sbjct: 198 QQFVTE--SEQPFMLYMSFTAPHS-----PNEATEEDLARF---EGQPRQKYAAMMYALD 247
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GV 320
VG ++ L+ G +N+I+ F++DNG G +N+ SN PL+G K ++GG R
Sbjct: 248 RGVGKIVDELKAAGKFDNTIIFFLSDNG----GSTTNQSSNLPLKGFKGNKFEGGQRVPF 303
Query: 321 AAIWSPWLKQTQKVS 335
+W K+ Q+ +
Sbjct: 304 FVVWGDRFKRDQRFT 318
>gi|154495003|ref|ZP_02034008.1| hypothetical protein PARMER_04049 [Parabacteroides merdae ATCC
43184]
gi|154085553|gb|EDN84598.1| arylsulfatase [Parabacteroides merdae ATCC 43184]
Length = 464
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 107/315 (33%), Positives = 165/315 (52%), Gaps = 32/315 (10%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
++P+I+I+LADD G+ D F G++ I TPNID LA G I +V A + +PSRS ++T
Sbjct: 32 QRPNILILLADDAGYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLT 91
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y G + + G+ GLP E+LLP LK Y T IGKWHLG P +
Sbjct: 92 GRYGQRYGYECNLDKPGD--GLPDDEELLPALLKRYDYRTGCIGKWHLG-SEPSQRPNAK 148
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM---QVDNKTIGI--YSTDLYTEAA 201
GFD+ YG G + Y+ ++P + D N+ Q + + + Y TD A
Sbjct: 149 GFDTFYGLLAGHRSYF-------YDP-ETSDKDGNLQQYQYNGRKLSFDGYFTDELASKA 200
Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
+ E +P LY++ A H+ P +A +E++A+F R+ YA M+ LD
Sbjct: 201 QQFVTE--SEQPFMLYMSFTAPHS-----PNEATEEDLARF---EGQPRQKYAAMMYALD 250
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GV 320
VG ++ L+ G +N+I+ F++DNG G +N+ SN PL+G K ++GG R
Sbjct: 251 RGVGKIVDELKAAGKFDNTIIFFLSDNG----GSTTNQSSNLPLKGFKGNKFEGGQRVPF 306
Query: 321 AAIWSPWLKQTQKVS 335
+W K+ Q+ +
Sbjct: 307 FVVWGDRFKRDQRFT 321
>gi|87309459|ref|ZP_01091594.1| arylsulphatase A [Blastopirellula marina DSM 3645]
gi|87287767|gb|EAQ79666.1| arylsulphatase A [Blastopirellula marina DSM 3645]
Length = 457
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 121/382 (31%), Positives = 177/382 (46%), Gaps = 43/382 (11%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
A LL + A + AP KP+I+ IL DD+G D +G++ TP+ID LA G+
Sbjct: 11 IAAILVLLASGALHSDAAPTKPNIVFILIDDMGCKDAGCYGATNFSTPHIDRLANQGMRF 70
Query: 69 NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGE--PWG----------LPLTEKLLP 116
Y +C+P+R++LMTGK+P + + + + G P G LPL EK +
Sbjct: 71 TDAYAAPVCSPTRASLMTGKHPARLHLTNFIPQIGRQLPAGKLIPPGFNHVLPLDEKTIA 130
Query: 117 QYLKEAGYATHAIGKWHLG-FFREVYTPTFRGFD-----SHYGYWQGLQDYYDHSCKATF 170
Q L GY IGKWHLG Y P RGFD H+G + + D
Sbjct: 131 QELHADGYQCAMIGKWHLGEEHGPEYRPQNRGFDRVVLSEHHGIFNYFYPFVDQQKWPYA 190
Query: 171 EPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYE 230
P G G Y D T+ AI+ + E N+ +P FLYL+H +VH
Sbjct: 191 GPLPG------------NPGDYLPDRLTDEAIDFVRE-NRERPFFLYLSHWSVHGR---- 233
Query: 231 PFQAPDEEVAKFLDISDPER-RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG 289
+ AP+ +AK+ + ER YA M+ +D SVG ++A L + + +N++ +FM+DNG
Sbjct: 234 -YFAPESLIAKYRERGLEERPAIYAAMMETVDNSVGRLMATLDELNLADNTLFVFMSDNG 292
Query: 290 APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
S PLRG K + ++GG+R + P + + S D PT
Sbjct: 293 G------ERITSMAPLRGSKGSLYEGGVRVPLIVRYPGVVKPNTTCSVPVISHDLFPTFL 346
Query: 350 AAAGIEINDTSLDGVNQWDVLT 371
A D LDG + +LT
Sbjct: 347 DFAERSYRDNKLDGHSIAGLLT 368
>gi|311747710|ref|ZP_07721495.1| sulfatase family protein [Algoriphagus sp. PR1]
gi|126575698|gb|EAZ80008.1| sulfatase family protein [Algoriphagus sp. PR1]
Length = 481
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 123/348 (35%), Positives = 179/348 (51%), Gaps = 31/348 (8%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRS 82
T P KP+I++I ADD+G+ D+ +G++Q TPN+D +A +G+ Q YV A+C+ SR+
Sbjct: 32 TEIPSKPNIVLIFADDMGYGDLGVYGATQWETPNLDKMASDGVRFTQFYVPHAVCSASRA 91
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
AL+TG Y + + G + GL E + + LK GYAT +GKWHLG + +
Sbjct: 92 ALLTGTYANRLEI-FGALDHSAKHGLNPEETTIAEMLKANGYATGIVGKWHLG-HQAPFL 149
Query: 143 PTFRGFDSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQV----DNKTIGIYSTDL 196
PT +GFDS+YG D + H K + P L + N V D++++ T
Sbjct: 150 PTEQGFDSYYGLPYS-NDMWPHHPEVKGYYPP---LPLYENTAVIDTLDDQSM---LTTN 202
Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
YTE A+ I E++K KP FLYLAH H P D+ K E Y +
Sbjct: 203 YTEKALEFI-ENSKDKPFFLYLAHSMTHV-----PLYVSDKFKGK------SEHGLYGDV 250
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDG 315
+ +D SVG V L + G+ EN+IV+F +DNG S+G H+ G L+ K T WDG
Sbjct: 251 MMEVDWSVGQVRNKLDELGLAENTIVIFTSDNGPWLSYGGHA--GLTGGLKEGKGTSWDG 308
Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDG 363
G+R P KV ++ D LPTL G ++ + +DG
Sbjct: 309 GIREPGIFVWPDHFPAGKVETQAAMTIDILPTLAEITGSKLPELPIDG 356
>gi|430744284|ref|YP_007203413.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430016004|gb|AGA27718.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 501
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 114/360 (31%), Positives = 178/360 (49%), Gaps = 24/360 (6%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
++ +F + +L + ++ +P +P+I++++ADD+G+ D S G+ I TPN+DA A
Sbjct: 3 QREWFVVASLVLVIGSRVHAASPPRPNIVLVMADDVGYGDFSCLGNPIIQTPNVDAFAKQ 62
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGY 124
+ +V C P+R+AL+TG++ G+ H ILE E L L L Q LK GY
Sbjct: 63 AVRFTDFHVSPTCAPTRAALLTGRHEFKSGVTH-TILERER--LSLKATTLAQVLKSKGY 119
Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQG--LQDYYDHSCKATFEPYQGLDMRHNM 182
AT GKWHLG + Y P RGFD + + G Q Y A Y + HN
Sbjct: 120 ATGVFGKWHLG-DEDAYQPGQRGFDEVFIHGAGGIGQTYAGSGGDAPGNSYFNPAILHNG 178
Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
+ KT G Y TD++ A+ I + K +P F Y+ A H+ P P+E +
Sbjct: 179 TFE-KTTG-YCTDVFFRQALTWIDQERKGDEPFFAYITPNAAHS-----PLDCPEEYAKR 231
Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
+ GM+ +D++ G ++A L+ G+ +++V+FM DNG G +
Sbjct: 232 HAGQVPENVAKFYGMIENIDDNFGTLVAKLKDWGIENDTLVIFMTDNG----GTAGTRTY 287
Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPW---LKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
N +RG K TP+ GG R V + W W K +V + H+ D LPTL G +++
Sbjct: 288 NAGMRGTKGTPYQGGTR-VPSFWR-WPSGFKGDVEVGALTAHL-DILPTLAEIVGSPLSE 344
>gi|383114003|ref|ZP_09934770.1| hypothetical protein BSGG_4804 [Bacteroides sp. D2]
gi|382948682|gb|EFS34104.2| hypothetical protein BSGG_4804 [Bacteroides sp. D2]
Length = 496
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 125/363 (34%), Positives = 192/363 (52%), Gaps = 25/363 (6%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGS-SQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
P KP+I+IILADD G+ V+ + +I TPNID LA +G+ Q Y L +P+R+ L
Sbjct: 34 PDKPNIVIILADDQGYGGVNCYPHIKKIVTPNIDKLAASGVQCMQGYTSGHLSSPTRAGL 93
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
MTGKY G +G+ G+P +KLL +YL E GY T IGKWHLG + + P
Sbjct: 94 MTGKYQQSFGF-YGLSTP-HVGGIPQDQKLLSEYLVENGYNTACIGKWHLGDYIRSH-PN 150
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEP-YQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
RGF + +G+ GL DYYD +++ Y GL + ++ T YST YT+ A++
Sbjct: 151 NRGFQTFFGFINGLHDYYDPLVGGSWDGVYNGLAFTLD-NMEPVTEMEYSTYEYTKRAVD 209
Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE---RRTYAGMVSRL 260
I + N P FLYL + A+H+ P QAP+E + + L I+ E M L
Sbjct: 210 FI-QKNADHPFFLYLPYNAIHS-----PLQAPEELIGE-LAINPQEIGKDDIARAMTFAL 262
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
D+ VG V+ L + G+ +N+I+ +++DNGA + S+K RG K + ++GG+R
Sbjct: 263 DQGVGKVVETLEQLGLRDNTIIFYLSDNGAVEY---SDKWE---FRGRKGSYYEGGIRVP 316
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
+ P + ++ D PT+ AG+ D + GVN L+ +T+ +
Sbjct: 317 FIVSYPAKLAKGTIYNKPVMSIDIAPTVMELAGLSHAD--MHGVNLLPYLSGKDRTEPHD 374
Query: 381 ILH 383
+L+
Sbjct: 375 VLY 377
>gi|343084600|ref|YP_004773895.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342353134|gb|AEL25664.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 472
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 118/366 (32%), Positives = 185/366 (50%), Gaps = 41/366 (11%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMT 86
+ P+I+ IL DDLGW D+ +GS TPNID L G+ Y +C+PSR++++T
Sbjct: 27 RPPNIVYILIDDLGWKDLGCYGSEFYETPNIDKLRDQGMKFTAAYSASPVCSPSRASILT 86
Query: 87 GKYPIHIGM-------------QHGVILEGEPW-GLPLTEKLLPQYLKEAGYATHAIGKW 132
GK P +IG + G I+ + + + L EK++P+ L ++GY + +IGKW
Sbjct: 87 GKNPANIGFTGHITAIGKHRYPEEGRIIPPDDYMHVSLEEKMIPEILLQSGYTSASIGKW 146
Query: 133 HLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATF-EPYQGLDMRHNM--QVDNKTI 189
H+G E + PT +GF + Y+H T+ P++ + + +DN+
Sbjct: 147 HVG-EEEKFFPTHQGFAINIA-------GYEHGSPPTYWGPFESEKSWNPVIKNLDNREE 198
Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDIS-- 246
G Y T+ T+ AIN I E NK P FLYL+H AVH P +APD + K+ L +
Sbjct: 199 GQYLTNRLTDEAINFIDE-NKEGPFFLYLSHYAVHT-----PLEAPDSLIKKYELKLEGQ 252
Query: 247 -DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
+ + YA M+ +D +VG ++ +L G+ N+IV+ +DNG +N PL
Sbjct: 253 MEQKNAIYAAMIENMDWNVGRLLKSLDSLGLEGNTIVILGSDNGG-----EGRVTNNVPL 307
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
R K ++GG+R I P + S D LPT+ AG+ + +DGV+
Sbjct: 308 REGKGYIYEGGIRVPLVIKWPGKVKPNSSSDVPVITDDMLPTIVEMAGLSNENQDIDGVS 367
Query: 366 QWDVLT 371
+LT
Sbjct: 368 LLPLLT 373
>gi|334139745|ref|YP_004532943.1| sulfatase [Novosphingobium sp. PP1Y]
gi|333937767|emb|CCA91125.1| sulfatase [Novosphingobium sp. PP1Y]
Length = 472
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 103/294 (35%), Positives = 150/294 (51%), Gaps = 26/294 (8%)
Query: 42 WNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVI 100
W DVS +G + +PTPNID +A G+ + YV A +C SR+ LMTG+ P G + +
Sbjct: 39 WADVSTYGRTDVPTPNIDRIAKTGVAFSSGYVAASVCAVSRAGLMTGRMPQRFGFTYNIN 98
Query: 101 LEGEPW-GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ 159
+G+ GLP+ +K + L+ GY T A GKWHLG R+ Y PT RGFD +G+ G
Sbjct: 99 DKGDVGAGLPVGQKTIADRLQPLGYRTAAFGKWHLGADRQFY-PTNRGFDEFFGFLAGET 157
Query: 160 DYYDHSCKATFEPYQGLDM--------RHNM-----QVDNKTIGIYSTDLYTEAAINVIA 206
+Y D +D H M Y T+ T+ A++ I
Sbjct: 158 NYVDPKTPGIVTTPTKVDKYEIGPGEGNHAMVEGPDARPADDFSKYLTNQITDRAVDFIN 217
Query: 207 EHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
+K P F Y+A+ A H P Q P +F ++ DP RRTY M++ +D+ VG
Sbjct: 218 RSADAKQPFFSYVAYNAPH-----WPLQVPQAYYDRFANVKDPVRRTYIAMIAAMDDGVG 272
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPS-FGIHSNKGSNHPLRGMKSTPWDGGMR 318
++ L + G+ EN+IV+F++DNG P FG + +NHP K T +GG R
Sbjct: 273 QILDTLEERGLRENTIVVFLSDNGCPERFGFCN---TNHPWGSGKFTYLEGGTR 323
>gi|405970955|gb|EKC35816.1| N-acetylgalactosamine-6-sulfatase [Crassostrea gigas]
Length = 511
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 124/385 (32%), Positives = 183/385 (47%), Gaps = 38/385 (9%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMTG 87
KP+ I++L DD+GW D+ G TP +D +A G++ Y LC+PSR+AL+TG
Sbjct: 19 KPNFILMLMDDMGWGDLGVFGEPNKETPYLDQMAAEGMLFPDFYSANPLCSPSRAALLTG 78
Query: 88 KYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
+ PI G ++ + G+P E LLP+ L++AGY + +GKWHLG +
Sbjct: 79 RLPIRNGFYTTNGHARNAYTPQNIVGGIPDEEILLPELLQKAGYKSKLVGKWHLG-HQAK 137
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD---NKTIGIYSTDLY 197
Y P GFD +G YD+ Y+ +M D K T LY
Sbjct: 138 YLPLKHGFDEWFGAPNCHFGPYDNVHTPNIPVYRNEEMAGRYYQDFKIEKNGESNLTQLY 197
Query: 198 TEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
T+ A+ I HNKSKP FLY A A H EP A FL S+ R Y +
Sbjct: 198 TKEAVEFITRMHNKSKPFFLYWAVDATH-----EPLYA----SKPFLGTSN--RGLYGDV 246
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
V LD +VG ++A+LR + N++V F +DNG ++ + GSN P K T ++GG
Sbjct: 247 VRELDSAVGKILASLRSLKIDNNTLVFFSSDNGGATYA-KQHGGSNGPFLCGKETTFEGG 305
Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAK 375
MR W P + +VS ++ + D T A + + D +DG+
Sbjct: 306 MREPTIAWWPGHIKKGQVSHQVGSLMDLYTTFVDFAEVAMPTDRVIDGI----------- 354
Query: 376 TKRSEILHNIDNVDNPQKYYAALRV 400
+ R+ +L+N NV P YY ++
Sbjct: 355 SLRASLLNNT-NVQRPLFYYRGNQI 378
>gi|417305610|ref|ZP_12092564.1| arylsulfatase A [Rhodopirellula baltica WH47]
gi|327538043|gb|EGF24733.1| arylsulfatase A [Rhodopirellula baltica WH47]
Length = 455
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 116/355 (32%), Positives = 178/355 (50%), Gaps = 41/355 (11%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
A + P+ I+I ADDLG+ D+S + SS + TP++D LA G +V A +C+PSR+AL
Sbjct: 16 ADRPPNFILIFADDLGYGDISCYDSSGVKTPHLDQLAAEGFRSTDFFVPANVCSPSRAAL 75
Query: 85 MTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
+TG+YP+ GM ++ + + + +G E +P+ L AGY + +GKWHLG E
Sbjct: 76 LTGRYPMRCGMPVARNENVAKYKDYGFAPDEITIPELLGPAGYRSLMVGKWHLGMELEGS 135
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD---MRHNMQVDNKTIGIYS-TDLY 197
P GFD ++G + +EP +G + + QV+ K + T Y
Sbjct: 136 HPLDAGFDEYFG------------IPSNYEPRRGKNHNTLYRGKQVEQKNVACEELTKRY 183
Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
T+ I+ I + K P F+Y++H VH N +P +PD F+ S E+ Y +
Sbjct: 184 TDEVIDFI-KRQKDDPFFVYVSHHIVH--NPLKP--SPD-----FVGTS--EKGKYGDFI 231
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LD S G ++ +R G+ EN++V+F +DNG + GS+ L G K +GG
Sbjct: 232 KELDHSTGRIMQTIRDAGLDENTLVIFTSDNGP------TRNGSSGELSGGKYCTMEGGH 285
Query: 318 RGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVL 370
R W+ + Q VS D LP C AG+ I +D +DG N VL
Sbjct: 286 RVPGMFRWTSKIAPNQ-VSDVTLTSMDLLPLFCQLAGVPIPDDRQIDGKNILPVL 339
>gi|344292944|ref|XP_003418184.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Loxodonta
africana]
Length = 513
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 120/367 (32%), Positives = 182/367 (49%), Gaps = 26/367 (7%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
AP+ P+I+++L DD+GW D+ +G TPN+D +A G++ Y LC+PSR+AL
Sbjct: 27 APQPPNIVLMLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAAL 86
Query: 85 MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
+TG+ PI G ++ + G+P +E LLP+ LK+A YAT +GKWHLG
Sbjct: 87 LTGRLPIRNGFYTTNGHARNAYTPQDIVGGIPDSEHLLPELLKKANYATKIVGKWHLGH- 145
Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYS 193
R + P GFD +G YD+ K Y+ +M ++ KT
Sbjct: 146 RPQFHPLKHGFDEWFGSPNCHFGPYDNRAKPNIPVYRDWEMVGRFYEEFPINLKTGEANL 205
Query: 194 TDLYTEAAINVIA-EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
T +Y + A++ I + ++ +P FLY A A HA P FL S +R
Sbjct: 206 TQIYLQEALDFIKRQQSQQRPFFLYWAIDATHA---------PVYASKPFLGTS--QRGR 254
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
Y V +D+SVG ++ LR G+ E++ + F +DNGA GSN P K T
Sbjct: 255 YGDAVREVDDSVGQLLQLLRGLGIAEHTFIFFTSDNGAALVSAPKEAGSNGPFLCGKQTT 314
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLT 371
++GG+R A W P +VS +L + D T + AG+E +D +DG++ VL
Sbjct: 315 FEGGVREPAIAWWPGHIPAGQVSHQLGSLMDLFTTSLSLAGLEPPSDREIDGLDLLPVLL 374
Query: 372 KGAKTKR 378
+G R
Sbjct: 375 RGHLIDR 381
>gi|149199915|ref|ZP_01876943.1| arylsulphatase A [Lentisphaera araneosa HTCC2155]
gi|149136984|gb|EDM25409.1| arylsulphatase A [Lentisphaera araneosa HTCC2155]
Length = 489
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 131/426 (30%), Positives = 204/426 (47%), Gaps = 46/426 (10%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
R++ +L C L F ++ A +KP+I+ L DDLG+ D+ +G+ TP ID LA
Sbjct: 6 RQFVTSLAC-LAFFTGVVSLQAQQKPNILFYLTDDLGYGDIGCYGAEGQYTPAIDQLAKE 64
Query: 65 GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGE-PWGLPLTEKLLPQYLKEAG 123
G + YV C+PSR+A MTG Y +G+ + E P GL +E LP+ +K AG
Sbjct: 65 GTKFSSFYVHQRCSPSRAAFMTGSYAHRVGLPQVIYKHREGPIGLNPSEITLPELMKTAG 124
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ 183
Y T +GKWHLG ++ + P G+D YG+ + ++ S ++ R +
Sbjct: 125 YNTALVGKWHLGEWKP-FHPLNHGYDYFYGFLKVIEGSEKPSL---------IENRKELA 174
Query: 184 VD-NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
KT G +AAIN + +H K+ P FL + HA P+ P E+
Sbjct: 175 SKIQKTEG--QAPGMVKAAINFMTKHKKN-PFFLVYSDPMPHA-----PY-FPSEQFK-- 223
Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK--- 299
+R Y ++ +D +++ AL + G+ EN+IV+F +DNG P + K
Sbjct: 224 ---GTSKRGNYGEVIHEIDWQFKHLMDALDELGLKENTIVVFTSDNGPP---VERQKKYD 277
Query: 300 -GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-N 357
G + PLR K T ++GG+R I P + S + I D LPT C AG+++ N
Sbjct: 278 VGLSGPLRDGKWTNFEGGVRVPFIIRWPGKVKVDASSDAMIGIIDMLPTFCELAGVDVPN 337
Query: 358 DTSLDGVNQWDVLTKGAKTK---RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQ 414
D +DGVN L ++K ++I+ + N KYYA + NN +
Sbjct: 338 DRVIDGVNILPQLLGDQESKALRETQIVPGATIIHNGWKYYAKQQ--------NPYNNKK 389
Query: 415 SDEWYG 420
++W G
Sbjct: 390 PEDWNG 395
>gi|423288159|ref|ZP_17267010.1| hypothetical protein HMPREF1069_02053 [Bacteroides ovatus
CL02T12C04]
gi|392671048|gb|EIY64524.1| hypothetical protein HMPREF1069_02053 [Bacteroides ovatus
CL02T12C04]
Length = 481
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 139/419 (33%), Positives = 198/419 (47%), Gaps = 67/419 (15%)
Query: 8 FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
FAL C L +P+++ ILADDLG+ DVS +G + TPNIDALA G+
Sbjct: 9 LFAL-CMLASCSVSEKNKLESRPNVVFILADDLGYGDVSCYGQEKFTTPNIDALASEGMK 67
Query: 68 LNQHYVQALCT---PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEK--LLPQYLKEA 122
Q Y A CT PSR +LMTG + H ++ L+GE LP+ E + + K+
Sbjct: 68 FTQFY--AGCTVSAPSRCSLMTGFHTGHAQVRGNRELKGEGQ-LPMAEGTYTIARMFKDN 124
Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM 182
GYAT A GKW LGF P +GFD +GY +C+ Y + HN
Sbjct: 125 GYATGAFGKWGLGFPGSEGDPNNQGFDEFFGY----------NCQRQAHRYYPSHLWHNQ 174
Query: 183 QV------DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA----------- 225
+ D K +Y+ DL E A++ I NK KP F ++ + HA
Sbjct: 175 EKVILEGNDTKNKTVYAPDLIHEKALDFI-RTNKDKPFFAFIPVIQPHAELLVPEDSIIE 233
Query: 226 ---GNTYE-PFQAPDEEVAKFLD--------ISDPE-RRTYAGMVSRLDESVGNVIAALR 272
G E PF A D+E A++ D S PE T+A MVSR+D+ VG+V L+
Sbjct: 234 KYRGKYPETPFVA-DKEGAEYGDPDFDVKAYCSQPEPHATFAAMVSRVDKYVGDVTGLLK 292
Query: 273 KHGMLENSIVLFMADNGAPSFGIHSNKG-------SNHPLRGMKSTPWDGGMRGVAAI-W 324
+ G+ +N+IV+F +DNG H G SN G K + DGG+R I W
Sbjct: 293 ELGIDDNTIVIFSSDNGP-----HLEGGADPDFWNSNGDFSGYKRSMTDGGIRVPMIIKW 347
Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
+K V + D++PT G++++ T DG++ V+T + K E L+
Sbjct: 348 GDRIK-AGSVEQHIGAFYDFMPTFADLLGVDVDKT--DGISLLPVITGSGEQKAHEFLY 403
>gi|291223783|ref|XP_002731888.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 516
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 124/422 (29%), Positives = 202/422 (47%), Gaps = 41/422 (9%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
++ KP+I+++ ADD+G+ D++ +G TPN+D +A NG++ Y A LC+PSR+A
Sbjct: 29 SSGSKPNIVVLFADDVGYGDLACYGHPASYTPNLDGMAKNGMLFTSFYSAANLCSPSRAA 88
Query: 84 LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYT 142
L+TG+Y I G+ V+ GLP E + LK+ GY T IGKW LG ++ V+
Sbjct: 89 LLTGRYHIRSGIYPFVLGPDSTGGLPRNETTIANILKQQGYHTAMIGKWDLGIGKDMVHF 148
Query: 143 PTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD-----------------MRHNMQ 183
P GFD +YG Y G C+ F P++G + NM+
Sbjct: 149 PMHYGFDKYYGVPYVHGAC-----PCETCFYPHEGCQQSVIGGDKCGFDKTYCPLFQNMR 203
Query: 184 VDNKTIGIYS-TDLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
+ + + + + + Y + A + I + N + FLYLA+ H P +
Sbjct: 204 IIQQPVDLLTLAEKYVDFAKDFITDSANAGESFFLYLAYNHCH---------FPQFAGKR 254
Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
F + + +R Y ++ +D SVG ++ LR + N++V F +DNGA + N GS
Sbjct: 255 FHNFT--QRGIYGDSLAEIDWSVGEILQQLRHSQIENNTLVFFTSDNGA-ALRDWYNGGS 311
Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL 361
+ L+ K+T ++GG R A + P S + D LPT+ +G + ++
Sbjct: 312 SGMLKCGKATTYEGGHRVPAIAYWPGQITAGVSSRDFLTTLDLLPTITKLSGSVLPKVTI 371
Query: 362 DGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGD 421
DG++ VL AK+ R +L+ D Y A+R K T+ + QS + D
Sbjct: 372 DGMDMSSVLFGQAKSARDNLLYYPAYPDKKIGAY-AIRYKHYKAHFYTEGSPQSGMYNAD 430
Query: 422 TD 423
TD
Sbjct: 431 TD 432
>gi|402884716|ref|XP_003905821.1| PREDICTED: arylsulfatase A [Papio anubis]
Length = 507
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 128/404 (31%), Positives = 192/404 (47%), Gaps = 30/404 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
+ P+I++I ADDLG+ D+ +G TPN+D LA GL L YV +LCTPSR+AL+T
Sbjct: 19 RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRLTDFYVPVSLCTPSRAALLT 78
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
G+ P+ +GM GV+ GLPL E L + L GY T GKWHLG E + P
Sbjct: 79 GRLPVRMGMYPGVLEPSSRGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGSEGAFLPPH 138
Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGIYSTD 195
+GF G Y ++ +C P G + + N+ V+ + + +
Sbjct: 139 QGFHRFLGIPYSHDQGPCWNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPRLE 198
Query: 196 L-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
Y A +++A+ + +P FLY A H Y F F + S R +
Sbjct: 199 ARYVAFARDLMADAQRQDRPFFLYYASHHTH----YPQFSG-----QNFAERSG--RGPF 247
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
+ LD +VG ++ A+R G+LE ++V+F ADNG + + S G + LR K T +
Sbjct: 248 GDSLMELDAAVGTLMTAIRDLGLLEETLVIFTADNGPETMRM-SRGGCSGLLRCGKGTTY 306
Query: 314 DGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
+GG+R A A W + V+ EL D LPTL A AG + + +LDG + +L
Sbjct: 307 EGGVREPALAFWPGHI--APGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLG 364
Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
K+ R + D + + A+R K T + SD
Sbjct: 365 TGKSPRQSLFFYPSYPDEVRGVF-AVRSGKYKAHFFTQGSAHSD 407
>gi|340617020|ref|YP_004735473.1| sulfatase [Zobellia galactanivorans]
gi|339731817|emb|CAZ95082.1| Sulfatase, family S1-20 [Zobellia galactanivorans]
Length = 452
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 127/383 (33%), Positives = 188/383 (49%), Gaps = 42/383 (10%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
AL C+ D+ KP+II ILADDLG+ D+ +G I TP+ID +A GL
Sbjct: 1 MALFCSCENTDSAYGKVKGSKPNIIFILADDLGYGDLGCYGQETISTPHIDRMASEGLRF 60
Query: 69 NQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
QHY A +C PSR++LMTG+ H+G +E LT + + LK+AGY T
Sbjct: 61 TQHYAGATVCAPSRNSLMTGQ---HMGNTTIKSMEKPIKDSDLT---VAEVLKDAGYRTG 114
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ--DYYDHSCKATFEPYQ-GLDMRHNMQV 184
IGKW LG +GFD +GY+ ++ +YY E + ++ +
Sbjct: 115 VIGKWGLGNVGTSGYANAQGFDYSFGYYDQIRAHNYYPDYLMENGEKFPLKNEVVYVSDS 174
Query: 185 DNKTIGI---------YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP 235
N +GI YS DL+TE A++ I K +P FLYLA+ HA N
Sbjct: 175 TNYAVGIGNAAIKKKEYSNDLFTEKALDFIKTDAK-EPFFLYLAYTIPHANNESFLINEH 233
Query: 236 DEEVAKFLDISD-----PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
EV + I D E++ A M++RLD VG ++ L+K+ + N++V+F +DNG
Sbjct: 234 GMEVPDY-GIYDKEPWPSEKKAGAAMITRLDSYVGQILDLLKKNNLDGNTLVIFTSDNGP 292
Query: 291 PSFGIHSNKG-------SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK-VSSELFHIS 342
H G SN PLRGMK ++GG+R + W + +K + ++
Sbjct: 293 -----HQEGGWQVDYFDSNGPLRGMKRDLYEGGIR--VPFIAKWPGKIEKGETDQVATFW 345
Query: 343 DWLPTLCAAAGIEINDTSLDGVN 365
D+LPT C +G + S DG++
Sbjct: 346 DFLPTACELSG-QDKPISTDGIS 367
>gi|149178470|ref|ZP_01857059.1| arylsulfatase A (precursor) [Planctomyces maris DSM 8797]
gi|148842683|gb|EDL57057.1| arylsulfatase A (precursor) [Planctomyces maris DSM 8797]
Length = 491
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 127/383 (33%), Positives = 190/383 (49%), Gaps = 36/383 (9%)
Query: 9 FALTCTLLFNDAFLNTT----APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
F + C L+F N A + P+ +II DD G+ DV GS I TPN+D +A
Sbjct: 22 FLVIC-LVFLSLLSNVADARPARQHPNFVIIFTDDQGYQDVGVFGSPNIKTPNLDQMAKE 80
Query: 65 GLILNQHY-VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAG 123
G+ Y QA+C+ SR AL+TG YP +G++ G + G+ E + + +K G
Sbjct: 81 GIRFTDFYAAQAVCSASRVALLTGCYPNRVGIR-GALGPQSKIGINAEETTIAEVVKPQG 139
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHN 181
YAT GKWHLG E + PT GFD ++G Y + ++ + K + L + N
Sbjct: 140 YATAIYGKWHLGHLPE-FLPTRHGFDEYFGLPYSNDMWPFHPTAGKR----FPDLPLIEN 194
Query: 182 MQVDN-KTIGIYSTDL---YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
V N K G L YTE A++ I + N KP FLY+ H H P D+
Sbjct: 195 ETVINPKVTGKEQAQLSTWYTERAVSFINK-NHDKPFFLYVPHSMPHV-----PLFVSDK 248
Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIH 296
K E+ Y ++ +D SVG + AL+++G+ EN++V+F +DNG S+G H
Sbjct: 249 FKGK------SEQGLYGDVIMEIDWSVGQIRQALKENGIEENTLVIFTSDNGPWLSYGDH 302
Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
+ GS PLR K T WDGG R + P V ++ D LPT+ +G ++
Sbjct: 303 A--GSALPLREGKGTAWDGGQREPCIMAWPGQIPAGSVCHQMAMTIDILPTIAYLSGGKV 360
Query: 357 -NDTSLDGVNQWDVLT--KGAKT 376
D +DG N W +++ +GA++
Sbjct: 361 PQDRIIDGKNIWPLMSGEQGAQS 383
>gi|294053911|ref|YP_003547569.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613244|gb|ADE53399.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 469
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 119/349 (34%), Positives = 179/349 (51%), Gaps = 30/349 (8%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTG 87
+P++I+I+ADDLG+ D+ + GS IPTPNID LA G+ YV C PSR+ +TG
Sbjct: 31 QPNVIVIMADDLGYGDLGYTGSKHIPTPNIDRLANEGVECTYGYVTHQYCGPSRAGFLTG 90
Query: 88 KYPIHIGMQ-------HGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
+Y G + H I G+P +E+L + L+ GY T +GKWH+G +
Sbjct: 91 RYQQRFGFETNPPYDRHNTIA-----GVPASERLFAERLQAVGYKTGIVGKWHIGS-HSI 144
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATF-EPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
+ P RGFD +G+ G D++ + E Y MR+ VD + Y T T+
Sbjct: 145 HHPNNRGFDFFFGFLGGGHDFFRVDTREPMDEGYLDPMMRNGSSVD---VEGYLTTQLTD 201
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
AI I E N+ P FL+L++ A HA P QAP+E +AKF + ERR Y+ MV
Sbjct: 202 EAIGFI-ERNEKDPFFLFLSYNAPHA-----PLQAPEESIAKFSHVEGKERRVYSAMVYE 255
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG--SNHPLRGMKSTPWDGGM 317
+D +G +++ L + E +I+ F++DNG P S SN RG K +DGG+
Sbjct: 256 MDLEIGRILSMLEDRDLAEETIIFFLSDNGGPPHWDKSQDSYTSNGAFRGYKGDTYDGGV 315
Query: 318 R-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
A W L ++ +S + + D T A G + ++ ++GVN
Sbjct: 316 HVPFLAYWPGTLPAGKQFTSPVVSL-DIARTAVALGGGD--ESEMEGVN 361
>gi|149701806|ref|XP_001488119.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Equus caballus]
Length = 491
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 113/359 (31%), Positives = 173/359 (48%), Gaps = 26/359 (7%)
Query: 34 IILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMTGKYPIH 92
I+ + ++GW D+ +G TPN+D +A G++ Y LC+PSR+AL+TG+ PI
Sbjct: 3 ILKSVNMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYTANPLCSPSRAALLTGRLPIR 62
Query: 93 IGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
G ++ + G+P +E+LLP+ LKEAGY + +GKWHLG R + P
Sbjct: 63 NGFYTTSGHARNAYTPQEIVGGIPDSERLLPELLKEAGYVSKIVGKWHLGH-RPQFHPLK 121
Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLYTEAA 201
GFD +G YD+ + Y+ +M ++ KT T +Y + A
Sbjct: 122 HGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEA 181
Query: 202 INVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
++ I ++ P FLY A A HA P FL S +R Y V +
Sbjct: 182 LDFIRRQQAARRPFFLYWAVDATHA---------PVYASKPFLGTS--QRGRYGDAVREI 230
Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
D+SVG ++ +L+ G+ E++ V F +DNGA GSN P K T ++GGMR
Sbjct: 231 DDSVGKILHSLQDLGIAESTFVFFTSDNGAALISAPKQGGSNGPFLCGKQTTFEGGMREP 290
Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKR 378
A W P +VS +L I D T + AG+E D ++DG++ + +G R
Sbjct: 291 AVAWWPGRIPAGQVSHQLGSIMDLFTTSLSLAGLEPPRDRAIDGLSLLPAMLQGQLMDR 349
>gi|430741911|ref|YP_007201040.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430013631|gb|AGA25345.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 469
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 127/378 (33%), Positives = 177/378 (46%), Gaps = 58/378 (15%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQAL-CTPSRSAL 84
A +KP+I+ ILADDLG+ DV+ +GS TP+ID LA GL Y C P+R+AL
Sbjct: 28 ADRKPNIVFILADDLGFTDVACYGSRYYETPHIDKLAAQGLKFTDGYTCGPNCQPTRAAL 87
Query: 85 MTGKYPIHIGMQHGVILEGEPW------------GLPLTEKLLPQYLKEAGYATHAIGKW 132
M+G+Y G+ ++ W GLPL + + Q L+ AGYAT GKW
Sbjct: 88 MSGQYGPRTGIYTVGSIDRFDWKSRPLRPVDNAQGLPLGKVTIAQTLQAAGYATGMFGKW 147
Query: 133 HLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIY 192
HLG + Y P RGFD +A Q D +V G Y
Sbjct: 148 HLGQQGD-YHPRRRGFD-----------------EAIVSMGQHFDFTTAPEV-TYPAGTY 188
Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS------ 246
D T+ A++ I H K +P FLYL H VH+ P+QA + V KF + +
Sbjct: 189 LADFLTDKAVDFITRH-KDEPFFLYLPHFGVHS-----PYQAKKDLVKKFAEKAAAGGHH 242
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG------ 300
DP TYA M++ +DESVG V+ L + G+ ++++V+F +DNG G + +G
Sbjct: 243 DP---TYAAMLASVDESVGRVVKTLDELGLSDDTLVIFSSDNGG--VGGYEREGIGKAGA 297
Query: 301 --SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEIN 357
N PLRG K ++GG+R P V + D PTL A G
Sbjct: 298 VTDNAPLRGGKGMLYEGGIRAPYLFRWPGKIPAGTVCDRAINSVDLYPTLVELAEGKAPE 357
Query: 358 DTSLDGVNQWDVLTKGAK 375
LDG + +LT G +
Sbjct: 358 SDPLDGTSYLRLLTSGGQ 375
>gi|119713178|gb|ABL97246.1| sulfatase [uncultured marine bacterium EB0_50A10]
Length = 544
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 124/407 (30%), Positives = 192/407 (47%), Gaps = 71/407 (17%)
Query: 29 KPHIIIILADDLGWNDVSFH----GSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSA 83
+P+II++LADD+G+ND+S H + T NIDALA +G++ + Y A C PSR++
Sbjct: 59 RPNIILVLADDMGYNDISIHNGGAADGTLQTKNIDALAKSGILFTRGYAANATCAPSRAS 118
Query: 84 LMTGKYPIHIG----------------------------MQHGVILEGEPW---GLPLTE 112
+MTGKYP G + V+ P+ G+P +
Sbjct: 119 IMTGKYPTRFGYEFTPIPAFGRTVLGWLAEEDNFELKQRIDREVVSNMPPFMEQGMPTEQ 178
Query: 113 KLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP 172
+ + L++AGY T IGKWHLG + P +GF G L DH
Sbjct: 179 ITIAEVLRDAGYYTAHIGKWHLGHEYGM-DPMSQGFQDSLGLVGPLYLPEDHP------- 230
Query: 173 YQGLDMRHNMQVDNKTIGI-----------------YSTDLYTEAAINVIAEHNKSKPMF 215
++ + + ++D G+ Y TD YT+ A+ VI E+NK++P F
Sbjct: 231 -DVVNAKFDTRIDKMIWGMGQYSANFNGGDLFAPDKYVTDYYTDEALKVI-ENNKNRPFF 288
Query: 216 LYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHG 275
LYL+H A+H P QA + + + + Y+GM++ LD SVG +I L++
Sbjct: 289 LYLSHWAIH-----NPLQALRSDFEQMSHMHGHNLQVYSGMINSLDRSVGKIIEKLKELD 343
Query: 276 MLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS 335
+ ++++F +DNG ++ I N N P RG K + +DGG+R I P K S
Sbjct: 344 IYGKTLIIFTSDNGGANY-IELND-INKPYRGWKISFFDGGIRVPYIISWPDEINPGKKS 401
Query: 336 SELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
H D PT+ AAGIE + LDGV+ + + +K + L
Sbjct: 402 ENAVHHFDIFPTILKAAGIE-STNELDGVDLMPFIKNDSSSKPHKTL 447
>gi|325107642|ref|YP_004268710.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324967910|gb|ADY58688.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 749
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 112/339 (33%), Positives = 170/339 (50%), Gaps = 26/339 (7%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
KP++I +L DD G+ D+ +G++++ TP IDALA G +Y A +C+PSR+ L+TG
Sbjct: 28 KPNVIFVLVDDQGYYDLGCYGATEVETPEIDALAAEGTRFTDYYAAAPICSPSRAGLLTG 87
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
YP +G V G+ E L + + GYAT IGKWHLG F E + P +G
Sbjct: 88 CYPRRVGNHIWVHRADSDTGIHPNELTLAELFHQNGYATACIGKWHLG-FHEPFLPQNQG 146
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS-TDLYTEAAINVIA 206
FD ++G L D FE G+ + N QV + T YT+ AI+ +
Sbjct: 147 FDHYFGLLHNL----DPVETVYFEEQGGVPLLRNDQVVQRPADPAELTKQYTDEAISWME 202
Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
+H + +P FLYL H +H P DE F S+ Y + LD ++G
Sbjct: 203 QH-RDQPFFLYLPHTMLH-----NPLGVSDE----FRGSSN--WGEYGDAIQELDHNMGR 250
Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
++ L++ + +N++V++ +DNG + L+G K + W+ G+R A W P
Sbjct: 251 LMQTLKRLEIDKNTVVVYASDNGR-----GPGRTDQQKLQGRKLSTWEAGIRVPAIAWGP 305
Query: 327 WLK-QTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDG 363
L Q+ VSS + DW PTL + AG E+ D +DG
Sbjct: 306 GLGLQSGAVSSTVVSAMDWYPTLASLAGFEVPADRVIDG 344
>gi|325109705|ref|YP_004270773.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324969973|gb|ADY60751.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 443
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 121/355 (34%), Positives = 178/355 (50%), Gaps = 31/355 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
++P++++IL DDLG+ D+ +GS I TP +D +A +G+ L Y A +CTP+R+ALMT
Sbjct: 32 QQPNVVLILVDDLGYGDLGCYGSESIRTPRLDRMAASGMKLTSFYAAAPICTPTRAALMT 91
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G Y +G+ + + E G+ +E L + +++ GY T +GKWHLG Y PT
Sbjct: 92 GCYATRVGLPTPLHVYDE-IGINESEFTLGEAMQQCGYETVCVGKWHLGHQPRFY-PTEH 149
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS----TDLYTEAAI 202
GF+ ++G G KA D D + I T+ TE A+
Sbjct: 150 GFNHYWGTPLGHMFNRPAVGKAIG------DTSDLFLDDTREIPFPEDADLTERLTEKAV 203
Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
I E + +P FL+LAH H EP A + KF D S + Y +V LD
Sbjct: 204 EFI-EAKRDRPFFLFLAHPMPH-----EPLAASE----KFADRS--QAGLYGDVVECLDW 251
Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
S G V+ AL + G EN++V F++DNG P G G+ LRG K P++GGMR
Sbjct: 252 STGEVLDALERTGQAENTLVAFLSDNG-PKKG----HGTTAGLRGFKHDPYEGGMRVPCI 306
Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLC-AAAGIEINDTSLDGVNQWDVLTKGAKT 376
I + L + S E+ ++ DW PTL A G +++ DG N +LT T
Sbjct: 307 IAAQGLIPAGQTSDEICNVMDWYPTLVNLAGGTTVSEQVRDGKNLTPLLTGETAT 361
>gi|325109298|ref|YP_004270366.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324969566|gb|ADY60344.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 463
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 187/362 (51%), Gaps = 35/362 (9%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
+ +L +LF + A KP++++IL DDLG+ D+S +G++ + +PNID L G
Sbjct: 11 QLLMSLVAVVLFGVSS-TAQAADKPNVLLILVDDLGYGDLSCYGATDLQSPNIDKLVSRG 69
Query: 66 LILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWG-LPLTEKLLPQYLKE 121
L Y +C+P+R+A+++GKYP +G+ GVI WG L +LLP L+
Sbjct: 70 LKFTNFYANCPVCSPTRAAILSGKYPDRVGVP-GVIRTHADNSWGYLAPEAELLPSLLQP 128
Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ-DYYDHSCKATFEPYQGLD-MR 179
AGY + IGKWHLG P RGFD GY + DYYDH G++ MR
Sbjct: 129 AGYHSAIIGKWHLG-LEAPNRPNDRGFDHFKGYLGDMMDDYYDHR-------RHGINYMR 180
Query: 180 HNMQ-VDNKTIGIYSTDLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPD- 236
N Q +D + ++TDL+TE + + + E + +P FLYLA+ A H P Q P
Sbjct: 181 ENEQEIDPEG---HATDLFTEWSCDYLKEQADNEQPFFLYLAYNAPHT-----PIQPPQD 232
Query: 237 --EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
E+V K +R ++ +D+ +G V+ L + G ++++V+F +DNG
Sbjct: 233 WLEKVKKREAGIGEQRAKLVALIEHMDDGIGQVLTCLEQTGQADDTLVIFTSDNGG-QLN 291
Query: 295 IHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
+ +N G+ R K + ++GG++ A +W ++ Q + + D+ T+ AG
Sbjct: 292 VGANNGAT---RDGKQSMYEGGLKVPTAVVWPGKIEAGQTTEFQALSM-DFCSTILDVAG 347
Query: 354 IE 355
+E
Sbjct: 348 VE 349
>gi|403282838|ref|XP_003932845.1| PREDICTED: arylsulfatase A [Saimiri boliviensis boliviensis]
Length = 509
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 129/398 (32%), Positives = 191/398 (47%), Gaps = 38/398 (9%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
+ P+I++I ADDLG+ D+ +G TPN+D LA GL YV +LCTPSR+AL+T
Sbjct: 21 RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLT 80
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
G+ P+ +GM GV++ GLPL E L + L GY T GKWHLG E + P
Sbjct: 81 GRLPVRMGMYPGVLVPSSRGGLPLEEVTLAEVLAAQGYLTGMAGKWHLGVGPEGAFLPPH 140
Query: 146 RGFD-------SH-YGYWQGLQDY-----YDHSCKATFEPYQGLDMRHNMQVDNKTIGIY 192
+GF SH G Q L + D C P + + N+ V+ + +
Sbjct: 141 QGFHRFLGIPYSHDQGPCQNLTCFPPDTPCDGGCDQGLVP---IPLLANLSVEAQPPWLP 197
Query: 193 STDL-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
+ Y A +++A+ + +P FLY A H Y F F + S R
Sbjct: 198 GLEARYIAFAHDLMADAQRQDRPFFLYYASHHTH----YPQFSG-----QSFAERSG--R 246
Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
+ + LD +VG ++ A+ G+LE ++V+F ADNG + + S G + LR K
Sbjct: 247 GPFGDSLMELDAAVGALMTAIGDLGLLEETLVIFTADNGPETMRM-SRGGCSGLLRCGKG 305
Query: 311 TPWDGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDV 369
T ++GG+R A A W + T V+ EL D LPTL A AG + + +LDG + D+
Sbjct: 306 TTYEGGVREPALAFWPGHI--TPGVTHELSSSLDLLPTLAALAGAPLPNVTLDGFDLSDL 363
Query: 370 LTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVA 407
L K+ R + P + + V + KY A
Sbjct: 364 LLGTGKSPRQSLFF---YPSYPDEVHGVFAVRNGKYKA 398
>gi|332663783|ref|YP_004446571.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
gi|332332597|gb|AEE49698.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
Length = 550
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 132/461 (28%), Positives = 206/461 (44%), Gaps = 96/461 (20%)
Query: 9 FALTCTLLFND--AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
F + LF+ A L KKP+II+I+ DD+G++D+ +GS +I TPNID LAY GL
Sbjct: 7 FIIALLALFSSIIAPLVQAQTKKPNIILIMVDDMGYSDLGAYGS-EIQTPNIDKLAYEGL 65
Query: 67 ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEP--WGLPLTEKL-LPQYLKEAG 123
L + Y ++C P+R++L+TG+YP G+ + + G P G E L + L++AG
Sbjct: 66 RLREFYNNSICAPTRASLITGQYPHKAGVGYFDVNLGIPPYQGYLNKESLTFGEVLRQAG 125
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ 183
Y+T GKWH+G ++ P RGFD +G G +Y+D Y + + N +
Sbjct: 126 YSTLLSGKWHVG-NDSLHWPKQRGFDRFFGVIGGGSNYFDAEPMPLGRQYPVVILEDNQR 184
Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNK-SKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
K Y TD T A+ + E NK KP FLYLA+ A H P QA E++AK+
Sbjct: 185 QKPKANSYYFTDEITNHAVQFLDEQNKMDKPFFLYLAYTAPHW-----PLQALPEDIAKY 239
Query: 243 -----------------------------------LDISDPE---------------RRT 252
D+ P+
Sbjct: 240 RGKYDTGWDALRKERLARQVKLGIITQAQADAAAARDVEVPDWDNLTFEEKQLWKAKMEV 299
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP--------------------S 292
YA M+ R D+ VG V+A L++ +N++++F++DNGAP +
Sbjct: 300 YAAMLDRADQGVGKVLAKLKELKKDDNTLIIFISDNGAPAEDMAHRDKNHAGRNTGPVGT 359
Query: 293 FGIHSNKG------SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLP 346
G ++G SN P R KS ++GG+ W P + ++ H+ D P
Sbjct: 360 AGSFESQGKPWSYVSNTPFRSFKSFAYEGGISSPFIAWFPGKIKAGSIAKGTAHLIDLAP 419
Query: 347 TLCAAAGIE-------INDTSLDGVNQWDVLTKGAKTKRSE 380
T AG++ ++ +L G + ++L K +R E
Sbjct: 420 TFYELAGLKYPSTYNGVSTNALQGKSLSNLLFKTEAWERGE 460
>gi|372210171|ref|ZP_09497973.1| sulfatase [Flavobacteriaceae bacterium S85]
Length = 651
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 135/449 (30%), Positives = 201/449 (44%), Gaps = 94/449 (20%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQ-------IPTPN 57
+KY F + + +T +P+II+I+ADDLG+ DV F+ + IPTP
Sbjct: 3 KKYLFFIFLLTVVGQYNAQSTKGSRPNIIVIVADDLGYADVGFNRDANFPAEKGVIPTPE 62
Query: 58 IDALAYNGLIL-NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILE-GEPWGLPLTEKLL 115
+D LA NG+I N H C PSR+ALMTG P IG+Q+ + + G+PL E
Sbjct: 63 LDQLANNGIICTNGHVAHPFCGPSRAALMTGVQPSRIGVQYNLPNDINTSLGIPLEETYF 122
Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATF----- 170
P+ L++ Y T A GKWHLGF + Y P RGFD +G+ G + Y++ + +
Sbjct: 123 PKILQQNNYHTAAFGKWHLGFTQGKYQPLDRGFDYFFGFLGGGKAYFEREYEDLYYRRLG 182
Query: 171 ------EPYQGLDMRHNMQV--DNKTIGIYSTDLYTEAAINVIAEH-NKSKPMFLYLAHL 221
YQ R V D Y TD+ T+ AIN IAE+ KS P F+Y+A+
Sbjct: 183 GSNPVTNEYQDPLQRQRDYVAKDEFNQDEYLTDILTDEAINYIAENKTKSDPFFMYVAYN 242
Query: 222 AVHAGNTYEPFQAPDEEVAKF----------LDISDPERRTYAGMVSRLDESVGNV---- 267
A H P QAP EE+A+F + SD + + A + +++ + V
Sbjct: 243 APHT-----PLQAPAEEIAQFKEENPNFESLVRNSDYLKESGAYELDKVERELKKVKDYD 297
Query: 268 ------------IAALRKH---GMLENSIV------------------------------ 282
+A R+H E+ IV
Sbjct: 298 YNSLSSSEKEEKVADYREHLLDKFTESRIVYATMVSNMDKNIGRIVAELKKDQKVFDNTM 357
Query: 283 -LFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFH 340
+F++DNG ++ S NHPL +K + ++GG R I W +K +K ++
Sbjct: 358 IVFLSDNGGYTY---SKGAVNHPLYALKGSVFEGGHRVPFFIHWPNKIKSPKKYKYQISS 414
Query: 341 ISDWLPTLCAAAGIEINDT-SLDGVNQWD 368
+ D PTL AG + T LDG++ D
Sbjct: 415 L-DLYPTLVDLAGASVPSTKKLDGLSFMD 442
>gi|332529144|ref|ZP_08405108.1| sulfatase [Hylemonella gracilis ATCC 19624]
gi|332041367|gb|EGI77729.1| sulfatase [Hylemonella gracilis ATCC 19624]
Length = 454
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 125/360 (34%), Positives = 180/360 (50%), Gaps = 35/360 (9%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSR 81
+TTA P+I+ ILADDLGW D+ +G+S TPN+D LA G+ Q Y A+C+ +R
Sbjct: 7 DTTA--APNILFILADDLGWADLGVYGASDFATPNLDRLAAQGVRFTQAYANSAVCSATR 64
Query: 82 SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
AL+TG+Y + + GLP LP L+EAGY T IGKWHLG Y
Sbjct: 65 IALITGRYQYRLPAGLEEPIARSDIGLPPEHPTLPSLLREAGYDTALIGKWHLG-KPPTY 123
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM---QVDNKTIGIYSTDLYT 198
P G+D +G G DY+ H +P G + ++ V + G Y+ L
Sbjct: 124 GPLKSGYDRFFGNIGGALDYFTH------KPGVGAQVPRDLWEGDVPVERTGYYTNILGD 177
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP-DEEVAKFLDISD------PERR 251
EA+ V A ++ KP FL L A H P++ P DE V++ +I+D
Sbjct: 178 EASAYVRAREDEKKPFFLSLHFTAPHW-----PWEGPGDEAVSR--EITDLFHYDGGNLA 230
Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
Y +V LD +VG V+AAL + G +N++V+F +DNG F P G K+
Sbjct: 231 KYGELVEALDAAVGQVLAALDETGQADNTLVVFTSDNGGERF------SKTWPFTGQKTE 284
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVL 370
+GG+R + P + +V +++ +DWLPTL AAA + + D DG N VL
Sbjct: 285 LLEGGLRVPTLLRWP-ARVRPQVQAQVTITADWLPTLLAAAQVWPHPDYPSDGENILPVL 343
>gi|404450774|ref|ZP_11015752.1| arylsulfatase A family protein [Indibacter alkaliphilus LW1]
gi|403763554|gb|EJZ24502.1| arylsulfatase A family protein [Indibacter alkaliphilus LW1]
Length = 469
Score = 162 bits (411), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 118/369 (31%), Positives = 178/369 (48%), Gaps = 52/369 (14%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
+KP+I+ I+ADDLG D+ +G + IPTPN+D +A G+ N Y + +C PSR++LMT
Sbjct: 29 EKPNIVFIIADDLGIGDIGPYGQTIIPTPNLDKMASTGMKFNDFYAGSTVCAPSRASLMT 88
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+ H G H ++ + L K+LP+ LKEAGY GKW LG +P
Sbjct: 89 GQ---HTG--HTLVRGNGEYPLESGRKILPERLKEAGYTNAMFGKWGLGLANTHSSPEKV 143
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI--GIYSTDLYTEAAINV 204
G+D G+ H A ++ LD ++ + G Y+ DL+ E +I
Sbjct: 144 GWDHFVGHL--------HHVSAHYQKPDSLDCIIEGEISRMPVPKGKYANDLFVENSIEF 195
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD----------PE----- 249
I +P FLYL+ HA P+ FLD S PE
Sbjct: 196 IETQTDEQPFFLYLSLTVPHA-----ELLVPERYFQPFLDESGKSIFPSEKAHPEGLHYG 250
Query: 250 -----RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG------IHSN 298
+ YA +VS +D+ VG V++AL K G+ EN++V+F +DNG G +
Sbjct: 251 AQPYPKAAYAALVSSIDDYVGRVMSALEKIGIAENTLVIFTSDNGTHVEGGRRMEDVDYF 310
Query: 299 KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDW--LPTLCAAAGIEI 356
K S H RG+K +DGG++ + W + ++ S+ F + W PTL AGIE+
Sbjct: 311 KSSAH-FRGVKRDLFDGGIK--VPLMVNWPGKIKEGSTSDFRGAFWDLFPTLSHLAGIEV 367
Query: 357 NDTSLDGVN 365
+ DG++
Sbjct: 368 DRDDTDGIS 376
>gi|408672294|ref|YP_006872042.1| sulfatase [Emticicia oligotrophica DSM 17448]
gi|387853918|gb|AFK02015.1| sulfatase [Emticicia oligotrophica DSM 17448]
Length = 478
Score = 162 bits (411), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 125/375 (33%), Positives = 190/375 (50%), Gaps = 35/375 (9%)
Query: 5 RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
+K F ++ +L + T P+II+I DD+G+ D+S +G+ TPNID LA
Sbjct: 2 KKIFLSILTIILLQSS---TKKELSPNIILIFMDDMGYGDLSCYGAIDYRTPNIDKLASE 58
Query: 65 GL-ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAG 123
G+ N QA+C+ SR+AL+TG YP IG+ G ++ G+ E L + LK+ G
Sbjct: 59 GVRFTNFLAAQAVCSASRAALLTGCYPNRIGIS-GALMPTSKNGINPEETTLAEMLKQKG 117
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGL-QDYYDHSCKATFEPYQGLDMRH 180
YAT GKWHLG + + P GFD + G Y + YYD + + +
Sbjct: 118 YATGIFGKWHLGHHK-TFLPLQHGFDEYVGLPYSNDMWPVYYDGKPATSTQHMSAYPVLP 176
Query: 181 NMQVDNKTIGIYSTD-------LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQ 233
+Q ++K I + D LYTE A+ I ++NK KP FLYL H H P
Sbjct: 177 LIQNNDKKEEIKTLDDQAKLTTLYTEKAVEFIKKNNK-KPFFLYLPHSMPHV-----PIA 230
Query: 234 APDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PS 292
D+ K ++ YA ++ +D SVG ++ L+++G+ +N++V+F +DNG +
Sbjct: 231 VSDKFKGK------SKQGLYADLMMEIDWSVGEIMKTLKENGLDKNTLVIFTSDNGPWLN 284
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCA 350
FG H+ GS+ R K T ++GG R V I W T + V ++L D LPT+
Sbjct: 285 FGNHA--GSSGGFREGKGTSFEGGQR-VPCIMR-WKGTTPEGVVCNQLASTIDVLPTIAR 340
Query: 351 AAGIEINDTSLDGVN 365
G E+ +DGVN
Sbjct: 341 ITGAELPKNKIDGVN 355
>gi|406830821|ref|ZP_11090415.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 497
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 128/360 (35%), Positives = 174/360 (48%), Gaps = 29/360 (8%)
Query: 10 ALTCTLLFN-DAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
A C LLF L P+II+I+ DD G+ D S HG+ + TP++D L G+
Sbjct: 25 ASICLLLFTWGGRLPRLHAAPPNIIVIITDDQGYGDFSCHGNPVLKTPHLDRLHDEGVRF 84
Query: 69 NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
+V C P+RSA+MTG++ G+ H ILE E L LT LPQ LK GY T
Sbjct: 85 LDFHVSPTCAPTRSAIMTGRHEFRNGVTH-TILERER--LTLTATTLPQVLKTVGYTTGI 141
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGY-WQGLQDYYDHSC-KATFEPYQGLDMRHNMQVDN 186
GKWHLG Y P RGFD + + G+ Y SC A Y + HN +
Sbjct: 142 FGKWHLG-DEPAYRPGQRGFDETFIHGGGGIGQTYPGSCGDAPGNTYFDPAIMHNGMFE- 199
Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA-PDEEVAKFLDI 245
KT G Y TD++ A+ I +P F Y+A A H P Q P++E +
Sbjct: 200 KTQG-YCTDVFFGQAMTWINTVKGRQPFFCYIATNAPHG-----PLQVRPEDEQRYRGKV 253
Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP-SFGIHSNKGSNHP 304
+D + GM+S +D++VG ++A L G+ ++V+FM DNG IH N
Sbjct: 254 ADSNVAKFFGMISNIDDNVGRLLAKLTDWGIERETLVVFMNDNGGTVGVAIH-----NAG 308
Query: 305 LRGMKSTPWDGGMRGVAAIWS-PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDG 363
+RG K TPW GG R A+ W P + V S H+ D PT+ EI + LDG
Sbjct: 309 MRGQKGTPWLGGTR-AASFWRWPGTLKPVDVRSPCAHL-DVFPTIA-----EIAEIKLDG 361
>gi|354504055|ref|XP_003514094.1| PREDICTED: arylsulfatase A [Cricetulus griseus]
gi|344258436|gb|EGW14540.1| Arylsulfatase A [Cricetulus griseus]
Length = 518
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 139/448 (31%), Positives = 210/448 (46%), Gaps = 39/448 (8%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMTGK 88
P+I++I ADDLG+ D+ +G TPN+D LA GL YV +LCTPSR+AL+TG+
Sbjct: 21 PNILLIFADDLGYGDLGAYGHPSSSTPNLDQLANGGLRFTDFYVPVSLCTPSRAALLTGR 80
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTFRG 147
P+ GM GV+ GLPL E L + L GY T GKWHLG E + P +G
Sbjct: 81 LPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQG 140
Query: 148 FD-------SH-YGYWQGLQDY-YDHSCKATF-EPYQGLDMRHNMQVDNKTIGIYSTDL- 196
F SH G Q L + D C + + Y + + N+ V+ + + +
Sbjct: 141 FHRFLGIPYSHDQGPCQNLTCFPPDIPCSGSCDQGYVPIPLLANLSVEAQPPWLPGLEAR 200
Query: 197 YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
Y + +++A+ +S+P FLY A H Y F F SD R +
Sbjct: 201 YVSFSRDLMADAQRQSRPFFLYYASHHTH----YPQFSG-----ESFSQQSD--RGPFGD 249
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
+ LD ++G ++ A+ G+L ++V+F ADNG P SN G + LR K T ++G
Sbjct: 250 SLMELDAAIGALMTAVGDLGLLGETLVIFTADNG-PELMRMSNGGCSGLLRCGKGTTFEG 308
Query: 316 GMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGA 374
G+R A A W + T V+ EL D LPTL A G + + +LDGV+ +L
Sbjct: 309 GVREPALAFWPGHI--TPGVTHELASSLDLLPTLAALTGAPLPNVTLDGVDISPLLLGTG 366
Query: 375 KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVA-----GTDNNGQSDEWYGDTDNEIDKY 429
K+ R + P + + V D KY A G+ ++ + + N + +
Sbjct: 367 KSPRKSVFF---YPPYPDEVHGVFAVRDGKYKAHFFTQGSAHSDTTSDSACHAANPLTSH 423
Query: 430 SPKEV--LYSKAGITFNALKTKLQIKQK 455
P + L G +N L+T +I +
Sbjct: 424 EPPLLYDLSEDPGENYNLLETMDEISPE 451
>gi|114798452|ref|YP_760375.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
gi|114738626|gb|ABI76751.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
Length = 508
Score = 162 bits (410), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 131/404 (32%), Positives = 183/404 (45%), Gaps = 59/404 (14%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGS----SQIPTPNIDALAYNGLILNQHYV-QALCTPS 80
A + P++I ILADDLG ND+S G ++ TPNID LA G + + Y C PS
Sbjct: 15 AVRPPNVIFILADDLGINDISTFGGGMADGRVQTPNIDRLAAEGALFSTAYSGTGTCAPS 74
Query: 81 RSALMTGKYPIHIGMQHGVILEG-----------------------------EPW---GL 108
R+ LMTG+YP G ++ G P+ GL
Sbjct: 75 RAMLMTGRYPTRTGFEYTPTPPGMSRIVPMFANDMKTGLPPTEQVKENEKLMPPFAEQGL 134
Query: 109 PLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGL----QDYYDH 164
P E L + LK+ GY T IGKWHLG + P +GFD GL D
Sbjct: 135 PTEEVTLAEVLKDRGYHTVHIGKWHLG-NTSPFRPNDQGFDESLDMASGLFLPPGDPRGV 193
Query: 165 SCKATFEP-----YQGLDMRHNMQ-VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
+ F+P + +D + D G Y TD +T+ ++ VI + NK++P FLYL
Sbjct: 194 EARLDFDPIDKFLWARMDFAASYNGSDWFEPGGYLTDYWTDESLKVI-DANKNRPFFLYL 252
Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
AH VH P QA E+ DI R YA M+ +D SVG ++ L + G+ +
Sbjct: 253 AHWGVHT-----PLQATKEDYEAVGDIEPHRLRVYAAMLRAVDRSVGRIMEKLEEEGLAD 307
Query: 279 NSIVLFMADNGAPSF-GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE 337
N+I++F +DNG + GI N P RG K T ++GG+R + P Q V E
Sbjct: 308 NTIIVFSSDNGGAGYIGIPE---VNAPYRGWKITLFEGGIRVPLFMKWPGTIQPGTVIDE 364
Query: 338 LFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSE 380
D +PTL AA G + + +DG N + T R +
Sbjct: 365 PVAHIDLMPTLAAATGARLPSSVEIDGQNLLPLATGTGPITRPD 408
>gi|421614583|ref|ZP_16055637.1| arylsulfatase A [Rhodopirellula baltica SH28]
gi|408494670|gb|EKJ99274.1| arylsulfatase A [Rhodopirellula baltica SH28]
Length = 478
Score = 162 bits (410), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 115/356 (32%), Positives = 177/356 (49%), Gaps = 41/356 (11%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
A + P+ ++I ADDLG+ D+S + SS + TP++D LA G +V A +C+PSR+A
Sbjct: 38 VADRPPNFVLIFADDLGYGDISCYDSSGVKTPHLDQLAAEGFRSTDFFVPANVCSPSRAA 97
Query: 84 LMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
L+TG+YP+ GM ++ + + + +G E +P+ L AGY + +GKWHLG E
Sbjct: 98 LLTGRYPMRCGMPVARNENVAKYKAYGFSPDEITIPELLGPAGYRSLMVGKWHLGMELEG 157
Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD---MRHNMQVDNKTIGIYS-TDL 196
P GFD + G + +EP +G + + QV+ K + T
Sbjct: 158 SHPLDAGFDEYLG------------IPSNYEPRRGKNHNTLYRGKQVEQKNVACEELTKR 205
Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
YT+ I+ I E K P F+Y++H VH N +P +PD F+ S E+ Y
Sbjct: 206 YTDEVIDFI-ERQKDDPFFIYVSHHIVH--NPLKP--SPD-----FIGTS--EKGKYGDF 253
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
+ LD S G ++ +R G+ EN++V+F +DNG + GS+ L G K +GG
Sbjct: 254 IKELDHSTGRIMQTIRDAGLDENTLVIFTSDNGP------TRNGSSGELSGGKYCTMEGG 307
Query: 317 MRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVL 370
R W+ + Q VS D LP C AG+ I +D +DG + VL
Sbjct: 308 HRVPGMFRWTSKIAPNQ-VSDVTLTSMDLLPLFCELAGVPIPDDRQIDGKSILPVL 362
>gi|423315320|ref|ZP_17293250.1| hypothetical protein HMPREF1058_03862 [Bacteroides vulgatus
CL09T03C04]
gi|423315328|ref|ZP_17293258.1| hypothetical protein HMPREF1058_03870 [Bacteroides vulgatus
CL09T03C04]
gi|392679557|gb|EIY72938.1| hypothetical protein HMPREF1058_03862 [Bacteroides vulgatus
CL09T03C04]
gi|392679565|gb|EIY72946.1| hypothetical protein HMPREF1058_03870 [Bacteroides vulgatus
CL09T03C04]
Length = 464
Score = 162 bits (410), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 129/399 (32%), Positives = 188/399 (47%), Gaps = 40/399 (10%)
Query: 12 TCTLLFNDAFLN--------TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
T +LF+ A L+ TTA K P++I I+ADDLG D+ +G QI TPNID +A
Sbjct: 4 TRKILFSAALLSSGLTMARTTTAEKSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQ 63
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQ 117
NG+ QHY + + PSR AL+TGK+ H ++ + G LP E +
Sbjct: 64 NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVAD 123
Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
K Y T +GKW +G P GFD YGY L + HS F +
Sbjct: 124 IFKTKNYVTGCVGKWGMGGPGTEGVPGKHGFDYFYGY---LGQRFAHSYYPEFLH----E 176
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA-----GNTYEPF 232
+ +D K YS DL E A+N I E N KP FLY + HA G +
Sbjct: 177 NEQKIMLDGK---YYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLDIMGEAMTEY 232
Query: 233 QAPDEEVA------KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMA 286
+ E + +P R YA MV+ LD+SVG +I L++ G+ +++I++F +
Sbjct: 233 EGEFCETPFGGSRDGYKSQQNP-RAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIVFTS 291
Query: 287 DNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDW 344
DNG S G H S SN P RG K ++GG+R I P + V++ + D+
Sbjct: 292 DNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAFWDF 351
Query: 345 LPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
LPT+ +I ++DG++ LT K + ++
Sbjct: 352 LPTIGELVQADI-PQNIDGISYLPTLTGKGTQKEHDCIY 389
>gi|448409003|ref|ZP_21574483.1| sulfatase [Halosimplex carlsbadense 2-9-1]
gi|445673783|gb|ELZ26342.1| sulfatase [Halosimplex carlsbadense 2-9-1]
Length = 496
Score = 162 bits (410), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 126/390 (32%), Positives = 181/390 (46%), Gaps = 59/390 (15%)
Query: 27 PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQAL-CTPSRSALM 85
P++P+I+++ DDLGW D F GS TPN+DALA G++ Y A C PSR+
Sbjct: 47 PERPNIVVVYVDDLGWLDTGFQGSQYYDTPNLDALADEGVVATSGYANAPNCAPSRACFN 106
Query: 86 TGKYPIHIGMQHGVILEGEPWGL--------PLTEKL------LPQYLKEAGYATHAIGK 131
TG+Y G+ G + + P P +E+L + + L +AGYAT +GK
Sbjct: 107 TGQYTPRHGV--GTVTDPPPRRPEARRLETPPNSERLGTDNATIAEVLSDAGYATGFVGK 164
Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
WHLG + P RGFD + G D F PY+ N+ D G
Sbjct: 165 WHLGEADSRHGPLARGFDRNVG-----GDIVGRPPAGYFSPYR----LPNVTDD----GG 211
Query: 192 YSTDLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS---- 246
Y D T+ AI +A+H P FL + +VH P QAPD+ V + D+
Sbjct: 212 YLPDRLTDDAIEFVADHAGGPDPFFLLFSTYSVHG-----PLQAPDDAVDAYRDVDCWNG 266
Query: 247 --DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG---- 300
+P TY MVS LD +VG ++AAL + G+ E ++V+F +DNG G + G
Sbjct: 267 QCNP---TYGAMVSNLDANVGRLLAALDERGIREETLVVFTSDNGG--LGDYDEIGLPRD 321
Query: 301 -----SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
S PLRG K T ++GG+R + P + + SS +D+ PT AG
Sbjct: 322 WKRVTSQGPLRGGKRTLYEGGIRVPTVVSWPGVVPGGRRSSTPIAGTDFFPTFVDLAGAH 381
Query: 356 ---INDTSLDGVNQWDVLTKGAKTKRSEIL 382
+D +DG + LT G R +
Sbjct: 382 DAVPDDHVVDGRSLVPWLTDGTPPDRDALF 411
>gi|150002770|ref|YP_001297514.1| arylsulfatase [Bacteroides vulgatus ATCC 8482]
gi|149931194|gb|ABR37892.1| arylsulfatase precursor [Bacteroides vulgatus ATCC 8482]
Length = 464
Score = 162 bits (409), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 129/399 (32%), Positives = 188/399 (47%), Gaps = 40/399 (10%)
Query: 12 TCTLLFNDAFLN--------TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
T +LF+ A L+ TTA K P++I I+ADDLG D+ +G QI TPNID +A
Sbjct: 4 TRKILFSAALLSSGLTMAQTTTAEKSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQ 63
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQ 117
NG+ QHY + + PSR AL+TGK+ H ++ + G LP E +
Sbjct: 64 NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVAD 123
Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
K Y T +GKW +G P GFD YGY L + HS F +
Sbjct: 124 IFKTKNYVTGCVGKWGMGGPGTEGMPGKHGFDYFYGY---LGQRFAHSYYPEFLH----E 176
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA-----GNTYEPF 232
+ +D K YS DL E A+N I E N KP FLY + HA G +
Sbjct: 177 NEQKIMLDGK---YYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLDIMGEAMTEY 232
Query: 233 QAPDEEVA------KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMA 286
+ E + +P R YA MV+ LD+SVG +I L++ G+ +++I++F +
Sbjct: 233 EGEFCETPFGGSRDGYKSQQNP-RAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIVFTS 291
Query: 287 DNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDW 344
DNG S G H S SN P RG K ++GG+R I P + V++ + D+
Sbjct: 292 DNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAFWDF 351
Query: 345 LPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
LPT+ +I ++DG++ LT K + ++
Sbjct: 352 LPTIGELVQADI-PQNIDGISYLPTLTGKGTQKEHDCIY 389
>gi|212690903|ref|ZP_03299031.1| hypothetical protein BACDOR_00391 [Bacteroides dorei DSM 17855]
gi|237712609|ref|ZP_04543090.1| arylsulfatase [Bacteroides sp. 9_1_42FAA]
gi|212666135|gb|EEB26707.1| arylsulfatase [Bacteroides dorei DSM 17855]
gi|229453930|gb|EEO59651.1| arylsulfatase [Bacteroides sp. 9_1_42FAA]
Length = 464
Score = 162 bits (409), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 130/402 (32%), Positives = 187/402 (46%), Gaps = 46/402 (11%)
Query: 12 TCTLLFNDAFLN--------TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
T +LF+ A L+ TTA K P++I I+ADDLG D+ +G QI TPNID +A
Sbjct: 4 TRKILFSAALLSSGLTMAQTTTAEKSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQ 63
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQ 117
NG+ QHY + + PSR AL+TGK+ H ++ + G LP+ E +
Sbjct: 64 NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPVGEVTVAD 123
Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
K Y T +GKW +G P GFD YGY L + HS F +
Sbjct: 124 IFKTKNYVTGCVGKWGMGGPGTEGMPGKHGFDYFYGY---LGQRFAHSYYPEFLH----E 176
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG--------NTY 229
+ +D K YS DL E A+N I E N KP FLY + HA Y
Sbjct: 177 NEQKIMLDGK---YYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLDIMGEVMTEY 232
Query: 230 E------PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVL 283
E PF + R YA MV+ LD+SVG +I L++ G+ +++I++
Sbjct: 233 EGEFCETPFGGSKDGYKS----QQTPRAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIV 288
Query: 284 FMADNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHI 341
F +DNG S G H S SN P RG K ++GG+R I P + V++ +
Sbjct: 289 FTSDNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAF 348
Query: 342 SDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
D+LPT+ +I ++DG++ LT K + ++
Sbjct: 349 WDFLPTIGELVQADIPQ-NIDGISYLPTLTGKGIQKEHDCIY 389
>gi|167762012|ref|ZP_02434139.1| hypothetical protein BACSTE_00358 [Bacteroides stercoris ATCC
43183]
gi|167700104|gb|EDS16683.1| arylsulfatase [Bacteroides stercoris ATCC 43183]
Length = 463
Score = 162 bits (409), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 118/347 (34%), Positives = 174/347 (50%), Gaps = 27/347 (7%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPS 80
L T KP+II ILADD+G+ D+S +G+ I TPNID LA G Q Y + +PS
Sbjct: 27 LVTDGGDKPNIIFILADDMGYCDLSCYGNKYIETPNIDRLAATGTAFTQCYAGSGISSPS 86
Query: 81 RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLK-----------EAGYATHAI 129
R ALMTGK + ++ + G GL T+ + +L+ AGY T +
Sbjct: 87 RCALMTGKNTGNTTIRDNFCIAGGIEGLKGTKTIRRMHLQPNDTTIATVLGAAGYRTCLV 146
Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR--HNMQVDNK 187
KWHL F TP RGFD YG+ +S + PY + N++ +
Sbjct: 147 NKWHLDGFNPEATPLNRGFDEFYGWLISTA----YSNDPYYYPYWRFNNEKLENVKENEG 202
Query: 188 TIGI-YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
I ++TDL TE AI I NK+ P FLYLA+ A H EP+ DE +
Sbjct: 203 DKHIKHNTDLSTEDAIKFI-NRNKNNPFFLYLAYDAPH-----EPYNI-DETTWYDDEAW 255
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
D + YA +++ +D ++G ++A L + G+ EN++V+F +DNGA G L+
Sbjct: 256 DMNTKRYASLITHMDRAIGRLLAELDRLGLRENTLVIFASDNGAAKQAPLEELGCKGSLK 315
Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
GMK ++GG+R + P QK+++ + + D +PTL A AG
Sbjct: 316 GMKGQLYEGGIRVPFIVNQPGKVPVQKLNN-IIYFPDVMPTLAALAG 361
>gi|32477898|ref|NP_870892.1| arylsulphatase A [Rhodopirellula baltica SH 1]
gi|32448455|emb|CAD77970.1| arylsulphatase A [Rhodopirellula baltica SH 1]
Length = 478
Score = 162 bits (409), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 115/355 (32%), Positives = 177/355 (49%), Gaps = 41/355 (11%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
A + P+ ++I ADDLG+ D+S + SS + TP++D LA G +V A +C+PSR+AL
Sbjct: 39 ADRPPNFVLIFADDLGYGDISCYDSSGVKTPHLDQLAAEGFRSKDFFVPANVCSPSRAAL 98
Query: 85 MTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
+TG+YP+ GM ++ + + + +G E +P+ L AGY + +GKWHLG E
Sbjct: 99 LTGRYPMRCGMPVARNENVAKYKDYGFAPDEITIPELLGPAGYRSLMVGKWHLGMELEGS 158
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD---MRHNMQVDNKTIGIYS-TDLY 197
P GFD + G + +EP +G + + QV+ K + T Y
Sbjct: 159 HPLDAGFDEYLG------------IPSNYEPRRGKNHNTLYRGKQVEQKNVACEELTKRY 206
Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
T+ I+ I E K P F+Y++H VH N +P +PD F+ S E+ Y +
Sbjct: 207 TDEVIDFI-ERQKDDPFFIYVSHHIVH--NPLKP--SPD-----FVGTS--EKGKYGDFI 254
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LD S G ++ +R G+ EN++V+F +DNG + GS+ L G K +GG
Sbjct: 255 KELDHSTGRIMQTIRDAGLDENTLVIFTSDNGP------TRNGSSGELSGGKYCTMEGGH 308
Query: 318 RGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVL 370
R W+ + Q VS D LP C AG+ I +D +DG + VL
Sbjct: 309 RVPGMFRWTSKIAPNQ-VSDVTLTSMDLLPLFCELAGVPIPDDRQIDGKSILPVL 362
>gi|168334944|ref|ZP_02693063.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Epulopiscium sp.
'N.t. morphotype B']
Length = 594
Score = 162 bits (409), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 125/391 (31%), Positives = 189/391 (48%), Gaps = 41/391 (10%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
KKP++IIIL DD G+ D++ HG+ + TPN+D L N + L + LC PSR+AL++G
Sbjct: 3 KKPNVIIILTDDQGYGDMAAHGNPWLKTPNLDWLNDNAISLESFHTDPLCAPSRAALISG 62
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
KY G+ L G + P K + Y K+ GYAT G WHLG Y P RG
Sbjct: 63 KYSFGAGVYS--TLNGRYYMDP-NIKTIANYFKDGGYATAMFGXWHLGDVYP-YRPENRG 118
Query: 148 FDSHYGYWQGL----QDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
FD + G+ DY+++ D + + N Y TD++ ++A
Sbjct: 119 FDVATIFGGGVIGETPDYWNNDY---------FDDTYQVNGKNTKFEGYCTDVWFDSATK 169
Query: 204 VIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT-YAGMVSRLD 261
I + +KP F YL A H PF K++D+ P +R + GM+ +D
Sbjct: 170 FIDDQLAATKPFFCYLPTNAPHG-----PFNIAPSYYQKYIDMGVPPKRAKFFGMIEGID 224
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK------GSNHPLRGMKSTPWDG 315
E++G ++ LR +++I++F DNG + G ++K G N +RG K T ++G
Sbjct: 225 ENIGKLMDHLRAKNAFDDTIIIFFGDNGTAT-GCDTDKNGWVKDGFNAGMRGKKGTTYEG 283
Query: 316 GMRGVAAIWSP--WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKG 373
R I +P L + +KV L D LPTL G+ + +LDGV+ L G
Sbjct: 284 AHRNTCFITTPKNILGEARKVYG-LTAQFDLLPTLIDICGLPEGE-NLDGVSMAQALKAG 341
Query: 374 A---KTKRSEILHNIDNVDNPQKY--YAALR 399
R ++HN+ +D PQKY Y L+
Sbjct: 342 ETHLNEDRVIMIHNM-QLDMPQKYKDYTVLK 371
>gi|255690671|ref|ZP_05414346.1| twin-arginine translocation pathway signal [Bacteroides finegoldii
DSM 17565]
gi|260623695|gb|EEX46566.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
Length = 453
Score = 162 bits (409), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 117/367 (31%), Positives = 190/367 (51%), Gaps = 31/367 (8%)
Query: 23 NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSR 81
NT P++++IL DDLG D+S + + TPNID + G+ L+ Y ++ +PSR
Sbjct: 24 NTVDKAIPNVLLILVDDLGLGDLSCQYAKDLSTPNIDRIFETGVRLDNFYANSSVSSPSR 83
Query: 82 SALMTGKYPIHIGMQHGVILEG--EPWG-LPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
+AL+TG +P +G+ GVI + WG + +P+ LK GY T IGKWHLG +
Sbjct: 84 AALLTGCFPAMVGVP-GVIRPSIDQNWGYFGPSAVTMPEVLKNGGYRTALIGKWHLG-WE 141
Query: 139 EVYTPTFRGFDSHYGYWQGLQ-DYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLY 197
P RGFD +G+ + DYY H + Y ++ ++D K ++T+L+
Sbjct: 142 SPNLPNERGFDHFHGFLADMMDDYYTHRRQGGNYMY-----LNDKEIDPKG---HATELF 193
Query: 198 TEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKF--LDISDPERRT-Y 253
T +++ I + K K P FLYLA+ A H+ P Q P E V K D S P +R
Sbjct: 194 TSWSVDYIKKEAKEKNPFFLYLAYNAPHS-----PLQPPVEWVNKVQERDKSLPVKRARL 248
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
++ LD ++G VI +L + G L N++V+F +DNG G + +N P RG K +
Sbjct: 249 IALIEHLDYNIGKVIQSLEESGQLNNTLVIFASDNG----GDRGSMANNGPTRGAKGDMF 304
Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKG 373
+GG+ A+ P + + + + + D +PT+C + + +DG++ D +
Sbjct: 305 EGGIHVACALNMPGVFEGGRRDNHFVVMMDLMPTICDFVNVPVKH-EIDGISVLDAIK-- 361
Query: 374 AKTKRSE 380
KT+ +E
Sbjct: 362 GKTQNTE 368
>gi|338210219|ref|YP_004654266.1| Cerebroside-sulfatase [Runella slithyformis DSM 19594]
gi|336304032|gb|AEI47134.1| Cerebroside-sulfatase [Runella slithyformis DSM 19594]
Length = 490
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 121/354 (34%), Positives = 185/354 (52%), Gaps = 28/354 (7%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILNQHYVQALCTPSRSALMTGK 88
P+II+I DDLG+ D+S +G+ Q TPN+D LA G+ N QA+C+ SR+AL+TG
Sbjct: 34 PNIILIFMDDLGYGDLSCYGALQYRTPNLDRLAGEGIRFTNFLAAQAVCSASRAALLTGC 93
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YP +G+ G + GL E + + LK+ GYAT GKWHLG R + PT +GF
Sbjct: 94 YPNRVGIS-GALFPNAKVGLNPEETTIAELLKDKGYATGMFGKWHLG-DRPEFLPTRQGF 151
Query: 149 DSHYG--YWQGL-QDYYDHSCKATF-----EPYQGLDMRHNMQV--DNKTIGIYS--TDL 196
D + G Y + +YD S +T + + L + HN + KT+ + T +
Sbjct: 152 DEYVGLPYSNDMWPVHYDGSPISTTSDNRKKNFPFLPLLHNSDTLQEIKTLDDQAKLTGI 211
Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
YTE A+N I H KS P F+YL H H P AKF S ++ TY +
Sbjct: 212 YTERAVNFINRHKKS-PFFVYLPHSMPH---------VPIAASAKFKGKS--KQGTYGDV 259
Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
+ +D SVG ++ AL+++G+ +N++V+F +DNG P ++ GS+ LR K ++GG
Sbjct: 260 LMEIDWSVGEIMKALKENGLDKNTVVIFTSDNG-PWLNYGNHAGSSGGLREGKGNSFEGG 318
Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
R + + V ++L D LPT+ G ++ +DGV+ +L
Sbjct: 319 QRVPCIVRWKGITPAGLVCNKLTSTIDLLPTIAHICGTKLPARPIDGVDILPLL 372
>gi|326438104|gb|EGD83674.1| hypothetical protein PTSG_04280 [Salpingoeca sp. ATCC 50818]
Length = 693
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 127/412 (30%), Positives = 198/412 (48%), Gaps = 51/412 (12%)
Query: 28 KKPHIIIILADDLGWNDV-------SFHGSSQ-IPTPNIDALAYNGLILNQHYVQ-ALCT 78
+KP+ I++ ADDLG N++ + G++ I TPN+ LA G++ + Y +C+
Sbjct: 44 RKPNFIVLFADDLGVNEIRLDNVPNGYTGNNHTIATPNLQRLANEGMVFHNWYSAFHVCS 103
Query: 79 PSRSALMTGKYPIHIG-----MQHGVILEGEPW-----------GLPLTEKLLPQYLKEA 122
PSR+++MTG+Y I G +++ G P GLPL E + + L
Sbjct: 104 PSRASMMTGRYSIRSGIGSSPLKYAPDAPGGPLPSQVFSAEAVGGLPLNETTIAEALLPQ 163
Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYW----QGLQDY--YDHSCKATFEPYQGL 176
GYAT AIGKWHLG R++Y PT RGF+ +YG G+ + +D++ A EP
Sbjct: 164 GYATMAIGKWHLG-QRDIYLPTARGFEEYYGIPFSQDMGMSFWLPFDYNGIAPMEPTPIP 222
Query: 177 DMRHNMQVDNKTIGIYSTDLYTEAAIN-VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP 235
++ N V+ D Y + A V+ K +P FLYL VH N+ P
Sbjct: 223 KLKGNAVVEQPAALDKLADEYAQVAKEFVMRNAQKDRPFFLYLPFNHVHTPNSCSP---- 278
Query: 236 DEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI 295
+F S + V +D +G V+ AL+ G+ +N++VLF +DNGAP +
Sbjct: 279 -----RFCSTS--KSGPVGDAVQEMDWVIGEVMDALKHAGVDDNTLVLFTSDNGAP---L 328
Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
+++ N PLR K++ W+GG + W P S L D T+ AG +
Sbjct: 329 ANDEQGNLPLRDGKASTWEGGFKAPGIAWWPNRIPAGSQSKALVSTLDVFTTVITMAGGD 388
Query: 356 I-NDTSLDGVNQWDVLTKGAKTKRSEIL--HNIDNVDNPQKYYAALRVDDLK 404
+ D +DG++ +L G+K I+ HN V+ + Y A+R D K
Sbjct: 389 VPTDRIIDGIDLSPILFNGSKQGHDCIMFYHNAAAVNASGELY-AMRCGDHK 439
>gi|224536571|ref|ZP_03677110.1| hypothetical protein BACCELL_01446 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521827|gb|EEF90932.1| hypothetical protein BACCELL_01446 [Bacteroides cellulosilyticus
DSM 14838]
Length = 450
Score = 161 bits (408), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 121/388 (31%), Positives = 190/388 (48%), Gaps = 38/388 (9%)
Query: 13 CTL-LFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQH 71
CTL L +A T +KP+++ I DDLGW+DV ++GS+ + TPNID LA G+
Sbjct: 13 CTLPLGVNALAAQTQQEKPNVLFIAVDDLGWSDVGYNGSTLVATPNIDRLASMGVSFTDG 72
Query: 72 YVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
YV A + PSR+ +++G Y GMQ L+ +P K LP+ ++EAGY T +G
Sbjct: 73 YVTAPISGPSRNGMVSGMYSQKYGMQINADLKFAQ--VPAQHKTLPETMREAGYRTALVG 130
Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
KWH+ V FD Y +Y+ S P + +N+
Sbjct: 131 KWHVCRDANVV------FDEVYDRIDISSNYFPDSTGVYDGPRLPILAVKESTYENE--- 181
Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
Y TD T AI + + + + P FLYL + AVH P+QA + + +I D
Sbjct: 182 -YMTDKLTSHAIQFMNKQSNATPFFLYLGYNAVH-----NPWQAEKKYYDRLSNIKDEYM 235
Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA--------------PSFGIH 296
R A +++ DE++G ++ L +++N+I++F++DNG PSF
Sbjct: 236 RVQASLIACADENIGILLNYLENRKLIDNTIIVFVSDNGPAKGGPELKTWEGYDPSF--E 293
Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
G LRG K ++GG+R I + P L++ KV ++ D+ PT+C G E
Sbjct: 294 YVFGQMKTLRGHKVDLYEGGIRTPMIIAYKPLLREG-KVFRDMISTLDFYPTICEMTGSE 352
Query: 356 IND-TSLDGVNQWDVLTKGAKTKRSEIL 382
I + T+LDGV+ L + K + ++L
Sbjct: 353 IPEGTTLDGVSLVRYLREETKQQPHDML 380
>gi|294778194|ref|ZP_06743624.1| arylsulfatase [Bacteroides vulgatus PC510]
gi|294447960|gb|EFG16530.1| arylsulfatase [Bacteroides vulgatus PC510]
Length = 466
Score = 161 bits (408), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 130/402 (32%), Positives = 186/402 (46%), Gaps = 46/402 (11%)
Query: 12 TCTLLFNDAFLN--------TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
T +LF+ A L+ TTA K P++I I+ADDLG D+ +G QI TPNID +A
Sbjct: 6 TRKILFSAALLSSGLTMAQTTTAEKSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQ 65
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQ 117
NG+ QHY + + PSR AL+TGK+ H ++ + G LP E +
Sbjct: 66 NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVAD 125
Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
K Y T +GKW +G P GFD YGY L + HS F +
Sbjct: 126 IFKTKNYVTGCVGKWGMGGPGTEGMPGKHGFDYFYGY---LGQRFAHSYYPEFLH----E 178
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG--------NTY 229
+ +D K YS DL E A+N I E N KP FLY + HA Y
Sbjct: 179 NEQKIMLDGK---YYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLDIMGEAMTEY 234
Query: 230 E------PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVL 283
E PF + R YA MV+ LD+SVG +I L++ G+ +++I++
Sbjct: 235 EGEFCETPFGGSRDGYKS----QQTPRAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIV 290
Query: 284 FMADNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHI 341
F +DNG S G H S SN P RG K ++GG+R I P + V++ +
Sbjct: 291 FTSDNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAF 350
Query: 342 SDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
D+LPT+ +I ++DG++ LT K + ++
Sbjct: 351 WDFLPTIGELVQADIPQ-NIDGISYLPTLTGKGTQKEHDCIY 391
>gi|326802120|ref|YP_004319939.1| sulfatase [Sphingobacterium sp. 21]
gi|326552884|gb|ADZ81269.1| sulfatase [Sphingobacterium sp. 21]
Length = 471
Score = 161 bits (407), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 131/397 (32%), Positives = 192/397 (48%), Gaps = 65/397 (16%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
KKP+II ILADDLG+ DV F+G I TPN+D LA G+ NQ Y A+C PSRS+LM+
Sbjct: 24 KKPNIIFILADDLGYGDVGFNGQRLINTPNLDRLAKEGIRFNQFYAGTAVCAPSRSSLMS 83
Query: 87 GKYPIHIGMQH--GVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
G++ H ++ GV EG+ + +P + + + + LK+AGY T A GKW LG P
Sbjct: 84 GQHTGHTFIRGNLGVEPEGQ-YPIPDSVETIAETLKKAGYQTGAFGKWGLGPVGSEGDPN 142
Query: 145 FRGFDSHYGY-WQGLQDYY------DHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLY 197
+GFD YGY Q L Y D+S K + D+ ++ Q Y+ DL
Sbjct: 143 KQGFDRFYGYNCQSLAHRYYPTHLWDNSEKVVLT--ENGDLEYHKQ--------YAPDLI 192
Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER------- 250
E A+ I EH K +P FL+L ++ HA PD+ + ++ E+
Sbjct: 193 QEQALGFIEEH-KDRPFFLFLPYILPHA-----ELLVPDDSIFQYYKGKFEEKPFKGFDY 246
Query: 251 ----------------RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA---- 290
T+A MV+RLD VG + LR G+ EN++++F +DNG
Sbjct: 247 GKGAKSGGYASQQYPHSTFAAMVARLDLYVGQIEEKLRSLGLDENTLIVFTSDNGPHKEG 306
Query: 291 ---PSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLP 346
P+F S+ G LRG+K ++GG+R A W ++ Q+ S + D P
Sbjct: 307 GADPNF-FQSSGG----LRGIKRDLYEGGIRVPFVAKWPKAIRAGQQ-SEFIGAFWDIFP 360
Query: 347 TLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
T G S DG++ + L K+ E L+
Sbjct: 361 TFAELGGA-ATAASTDGISFVNTLKGQGTQKQHEYLY 396
>gi|440750638|ref|ZP_20929879.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
gi|436480856|gb|ELP37068.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
Length = 478
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 116/348 (33%), Positives = 171/348 (49%), Gaps = 30/348 (8%)
Query: 24 TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRS 82
T A +P++I+I DD+G+ D+ +G+ I TPN+D +A NG+ YV QA+C+ SR+
Sbjct: 27 TLAQDRPNVILIFVDDMGYGDLGAYGAEDIATPNLDQMAENGVKFTNFYVSQAVCSASRA 86
Query: 83 ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
+L+TG Y +G+ HG + GL E + + LKE GY T GKWHLG +
Sbjct: 87 SLLTGTYANRLGI-HGALDHTAKHGLNPEEITIAEMLKEQGYQTAMFGKWHLG-HHPPFL 144
Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGL-------DMRHNMQVDNKTIGIYSTD 195
PT +GFD YG + H + P L D HN Q +T+
Sbjct: 145 PTKQGFDEFYGIPYSNDMWPQHPQTKNYYPPLPLFFNDNVVDTLHNDQSR------LTTE 198
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
L TE A+N I + K +P FLYLAH H P KF S +R Y
Sbjct: 199 L-TERAVNFI-QRKKDQPFFLYLAHPMPH---------VPLFVSEKFRGKS--KRGLYGD 245
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
++ +D SVG ++ LR +G+ E ++V+F++DNG P + GS +PLR K T WDG
Sbjct: 246 VLMEIDWSVGQILKTLRDNGLEEKTLVIFISDNG-PWLSFSGHAGSAYPLREGKGTSWDG 304
Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDG 363
G++ A ++ V + D LPT+ ++ +DG
Sbjct: 305 GVKVPALMYWKGKIPAGTVQTNPAMTIDMLPTIAKLTRADLPKLPIDG 352
>gi|294011191|ref|YP_003544651.1| putative arylsulfatase A [Sphingobium japonicum UT26S]
gi|292674521|dbj|BAI96039.1| putative arylsulfatase A [Sphingobium japonicum UT26S]
Length = 468
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/353 (32%), Positives = 179/353 (50%), Gaps = 30/353 (8%)
Query: 25 TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
+ P++P+I+ I+ADD+G D+ +GS I TP IDA+A G+ Y + +C+P+R A
Sbjct: 33 SGPRRPNILYIMADDMGHADLGCYGSRDIRTPAIDAIAARGVKFGNAYANSCVCSPTRIA 92
Query: 84 LMTGKYP--IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
L+TG+Y IG++ + G+ LP + LP L++ GYAT +GKWH+G
Sbjct: 93 LLTGRYQGRFRIGLEEPIAFNGDELSLPRGTRTLPGLLRDLGYATSLVGKWHVGEL-PAS 151
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKAT-FEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
+P G+D +G G DY+ H+ E + + R +Q Y TDL
Sbjct: 152 SPLDHGYDYFFGIASGGTDYFAHATTINGHEMGKLFENRTEIQRPG-----YLTDLLGAK 206
Query: 201 AINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER------RTY 253
AI+ + + + +P F+ L A H P+Q P + A+ +DP + R Y
Sbjct: 207 AIDRMQQAARQDRPFFISLHFTAPH-----WPWQGPG-DAAQGSAAADPRQMDGGNVRAY 260
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
A M+ +D +VG V+A L G+ E++IV+F +DNG F + PL GMK
Sbjct: 261 AAMMEGMDANVGRVLAELAALGLEEDTIVIFTSDNGGERF------SDSWPLTGMKGELL 314
Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVN 365
+GG+R + P + S+++ D LPTL +AAG + LDG++
Sbjct: 315 EGGIRVPLIVSWPAHLPKGRDSAQVTMSMDALPTLLSAAGGDPGRVEGLDGID 367
>gi|291225019|ref|XP_002732502.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 197
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 79/160 (49%), Positives = 99/160 (61%), Gaps = 2/160 (1%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
+PH+I+I DD+GWNDV +H + I PN+ LA +G+I NQ Y CTPSR+A+M G
Sbjct: 26 QPHVILIFVDDMGWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMKGL 84
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YP G QH ++ P G+PL KLLP+ LKE GY+TH +GKWHLGF ++ Y PT RGF
Sbjct: 85 YPFKTGNQHQMVFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYQPTNRGF 144
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
DSHYG W YD P G D R NM V K+
Sbjct: 145 DSHYGLWTLGVGNYD-KMNGVLSPSAGYDFRDNMGVVPKS 183
>gi|440715401|ref|ZP_20895948.1| arylsulfatase A [Rhodopirellula baltica SWK14]
gi|436439745|gb|ELP33159.1| arylsulfatase A [Rhodopirellula baltica SWK14]
Length = 455
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/354 (31%), Positives = 175/354 (49%), Gaps = 39/354 (11%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
A + P+ ++I ADDLG+ D+S + SS + TP++D LA G +V A +C+PSR+AL
Sbjct: 16 ADRPPNFVLIFADDLGYGDISCYDSSGVKTPHLDQLAAEGFRSTDFFVPANVCSPSRAAL 75
Query: 85 MTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
+TG+YP+ GM ++ + + + +G E +P+ L AGY + +GKWHLG E
Sbjct: 76 LTGRYPMRCGMPVARNENVAKYKDYGFAPDEITIPELLGPAGYRSLMVGKWHLGMELEGS 135
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD---MRHNMQVDNKTIGIYS-TDLY 197
P GFD + G + +EP +G + + QV+ K + T Y
Sbjct: 136 HPLDAGFDEYLG------------IPSNYEPRRGKNHNTLYRGKQVEQKNVACEELTKRY 183
Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
T+ I+ I + K P F+Y++H VH N +P +PD F+ S E+ Y +
Sbjct: 184 TDEVIDFI-KRQKDDPFFVYVSHHIVH--NPLKP--SPD-----FVGTS--EKGKYGDFI 231
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
LD S G ++ +R G+ EN++V+F +DNG + GS+ L G K +GG
Sbjct: 232 KELDHSTGRIMQTIRDAGLDENTLVIFTSDNGP------TRNGSSGELSGGKYCTMEGGH 285
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVL 370
R P +VS D LP C AG+ I +D +DG + VL
Sbjct: 286 RVPGMFRWPSKIAPNQVSDVTLTSMDLLPLFCELAGVPIPDDRRIDGKSILPVL 339
>gi|265752292|ref|ZP_06088085.1| arylsulfatase [Bacteroides sp. 3_1_33FAA]
gi|423229021|ref|ZP_17215426.1| hypothetical protein HMPREF1063_01246 [Bacteroides dorei
CL02T00C15]
gi|423244861|ref|ZP_17225935.1| hypothetical protein HMPREF1064_02141 [Bacteroides dorei
CL02T12C06]
gi|263237084|gb|EEZ22554.1| arylsulfatase [Bacteroides sp. 3_1_33FAA]
gi|392634774|gb|EIY28686.1| hypothetical protein HMPREF1063_01246 [Bacteroides dorei
CL02T00C15]
gi|392640902|gb|EIY34693.1| hypothetical protein HMPREF1064_02141 [Bacteroides dorei
CL02T12C06]
Length = 464
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 130/402 (32%), Positives = 186/402 (46%), Gaps = 46/402 (11%)
Query: 12 TCTLLFNDAFLN--------TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
T +LF+ A L+ TTA K P++I I+ADDLG D+ +G QI TPNID +A
Sbjct: 4 TRKILFSAALLSSGLTMAQTTTAEKSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQ 63
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQ 117
NG+ QHY + + PSR AL+TGK+ H ++ + G LP E +
Sbjct: 64 NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVAD 123
Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
K Y T +GKW +G P GFD YGY L + HS F +
Sbjct: 124 IFKTKNYVTGCVGKWGMGGPGTEGMPGKHGFDYFYGY---LGQRFAHSYYPEFLH----E 176
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG--------NTY 229
+ +D K YS DL E A+N I E N KP FLY + HA Y
Sbjct: 177 NEQKIMLDGK---YYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLDIMGEVMTEY 232
Query: 230 E------PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVL 283
E PF + R YA MV+ LD+SVG +I L++ G+ +++I++
Sbjct: 233 EGEFCETPFGGSKDGYKS----QQTPRAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIV 288
Query: 284 FMADNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHI 341
F +DNG S G H S SN P RG K ++GG+R I P + V++ +
Sbjct: 289 FTSDNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAF 348
Query: 342 SDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
D+LPT+ +I ++DG++ LT K + ++
Sbjct: 349 WDFLPTIGELVQADIPQ-NIDGISYLPTLTGKGIQKEHDCIY 389
>gi|395801059|ref|ZP_10480329.1| sulfatase [Flavobacterium sp. F52]
gi|395436775|gb|EJG02699.1| sulfatase [Flavobacterium sp. F52]
Length = 484
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 129/371 (34%), Positives = 177/371 (47%), Gaps = 34/371 (9%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL-NQHYVQALCTPSRSALMTG 87
KP+IIII DD+G+ D +G + + TP+ + +A G+ L N + QA+CT SR+AL+TG
Sbjct: 27 KPNIIIINMDDMGYGDTEPYGMTGVNTPHFNQIAEEGMRLTNFNAGQAICTASRAALLTG 86
Query: 88 KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
YP IGM GV+L G L E+ + LK+ GY T GKWHLG Y PT
Sbjct: 87 CYPNRIGMS-GVLLPGAKHALNPNEETIASLLKKNGYKTANYGKWHLGNVLP-YWPTNYD 144
Query: 148 FDSHYG------YWQGLQDYYDH-----SCKATFEPYQGLDMRHNMQVDNKTIGIYS--- 193
FD +G W D Y +++F P L + N V + I
Sbjct: 145 FDEFFGIPYSHDVWPIDYDGYSKVTDSKDMRSSFPP---LPLISNTTVIDTIENIKDASK 201
Query: 194 -TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
T L+TE AI I + NK P FLYL H HA P D KF SD
Sbjct: 202 LTTLFTEKAIGFI-KKNKKNPFFLYLTHPLPHA-----PLAVSD----KFKGKSD--LGL 249
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
+ ++ LD S+G +I L K G+ +N++++ +DNG P N GS+ R KST
Sbjct: 250 FGDVIMELDWSIGEIINTLDKEGIAKNTVLIITSDNG-PWLVFGDNAGSSGGFREGKSTT 308
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
W+GG R I P + V+S L D LPT+ G + +DG+N VLT
Sbjct: 309 WEGGTRVPFLIRWPGKIKGGSVNSSLMTNMDLLPTIAKITGASLPKNQIDGLNFLPVLTG 368
Query: 373 GAKTKRSEILH 383
+ EI +
Sbjct: 369 KTASGPREIFY 379
>gi|345512484|ref|ZP_08792010.1| arylsulfatase [Bacteroides dorei 5_1_36/D4]
gi|345456366|gb|EEO45525.2| arylsulfatase [Bacteroides dorei 5_1_36/D4]
Length = 464
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 130/402 (32%), Positives = 186/402 (46%), Gaps = 46/402 (11%)
Query: 12 TCTLLFNDAFLN--------TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
T +LF+ A L+ TTA K P++I I+ADDLG D+ +G QI TPNID +A
Sbjct: 4 TRKILFSAALLSSGFTMAQTTTAEKSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQ 63
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQ 117
NG+ QHY + + PSR AL+TGK+ H ++ + G LP E +
Sbjct: 64 NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVAD 123
Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
K Y T +GKW +G P GFD YGY L + HS F +
Sbjct: 124 IFKTKNYVTGCVGKWGMGGPGTEGMPGKHGFDYFYGY---LGQRFAHSYYPEFLH----E 176
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG--------NTY 229
+ +D K YS DL E A+N I E N KP FLY + HA Y
Sbjct: 177 NEQKIMLDGK---YYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLDIMGEVMTEY 232
Query: 230 E------PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVL 283
E PF + R YA MV+ LD+SVG +I L++ G+ +++I++
Sbjct: 233 EGEFCETPFGGSKDGYKS----QQTPRAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIV 288
Query: 284 FMADNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHI 341
F +DNG S G H S SN P RG K ++GG+R I P + V++ +
Sbjct: 289 FTSDNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAF 348
Query: 342 SDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
D+LPT+ +I ++DG++ LT K + ++
Sbjct: 349 WDFLPTIGELVQADIPQ-NIDGISYLPTLTGKGIQKEHDCIY 389
>gi|340619477|ref|YP_004737930.1| sulfatase [Zobellia galactanivorans]
gi|339734274|emb|CAZ97651.1| Sulfatase, family S1-20 [Zobellia galactanivorans]
Length = 488
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 129/378 (34%), Positives = 190/378 (50%), Gaps = 52/378 (13%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPS 80
L T KP+I+ ILADDLG+ D+S +G + TPNID LA +G++ QHY + +C PS
Sbjct: 27 LKTAIATKPNIVYILADDLGYGDLSSYGQQKFSTPNIDKLAKDGILFTQHYSGSTVCAPS 86
Query: 81 RSALMTGKYPIHIGMQHG--VILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
RSAL+TG + H ++ ++ EG+ + LP L + +K+AGY T A GKW LGF
Sbjct: 87 RSALLTGMHTGHTVVRGNKEILPEGQ-YPLPDNTYTLAEAMKKAGYITGAFGKWGLGFPG 145
Query: 139 EVYTPTFRGFDSHYGY-WQGL-QDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
P +GFD+ +GY Q L +YY A + L + N +T Y+ +L
Sbjct: 146 SEGDPLNQGFDAFFGYNCQRLGHNYYPRHLWANKD---SLVLEGNSGFKKET---YAPEL 199
Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE------- 249
+ + I E NK P FLY+A + HA T AP+E + K+ PE
Sbjct: 200 IHDKTLEFI-EMNKDNPFFLYVASIIPHAELT-----APEEIMEKYKGKFAPETVYEGVD 253
Query: 250 ----------------RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
+A MV LD VG ++A L K G+ +N++V+F +DNG +
Sbjct: 254 DGPEYKLGPYGSQKEPHAAFAAMVHLLDAQVGEIMATLEKLGIADNTLVVFTSDNGPHTE 313
Query: 294 GIHSNK--GSNHPLRGMKSTPWDGGMR-GVAAIWSPWLK---QTQKVSSELFHISDWLPT 347
G + S+ P RG K ++GG+R + A W +K +T+ VS+ D PT
Sbjct: 314 GGADPEYFNSSGPFRGTKRDLYEGGIRVPMVAAWPGKIKPGSRTELVSA----FWDVFPT 369
Query: 348 LCAAAGIEINDTSLDGVN 365
AGIE + LDG++
Sbjct: 370 FSDIAGIETPE-GLDGIS 386
>gi|296122626|ref|YP_003630404.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296014966|gb|ADG68205.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 470
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 136/406 (33%), Positives = 197/406 (48%), Gaps = 49/406 (12%)
Query: 8 FFALTCTLLFNDA-FLNTTAPK---KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
+FAL LLF A F A + KP++++I DDLG D+ GSS TP IDALA
Sbjct: 6 YFAL---LLFMGAPFFPVEAKEMADKPNVLLIFIDDLGKTDIGIEGSSFYETPRIDALAK 62
Query: 64 NGLILNQHY-VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEA 122
+G Q Y +C+P+R+ALMTGK P +G+ + E + LP +E + Q +EA
Sbjct: 63 SGARFTQFYSAHPVCSPTRAALMTGKMPQRLGITDWIRPESD-VALPQSEVTIGQAFQEA 121
Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPY---QGLDMR 179
GY T +GKWHLG + + P RGFD G G Q + + PY Q D
Sbjct: 122 GYHTAYLGKWHLGHKPQQH-PAARGFDWTKGVNHGGQP------SSYYFPYKNPQKPDAP 174
Query: 180 HNM-QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA-----GNTYEPFQ 233
+N+ + Y TD+ T +AI + + ++++P FL LAH AVH N E +Q
Sbjct: 175 NNVPDFEKCQPEDYLTDVLTSSAIEHLQQRDRTRPFFLCLAHYAVHTPIQPPKNLVEKYQ 234
Query: 234 ----------APDEEVAKFLDISDPER--RTYAGMVSRLDESVGNVIAALRKHGMLENSI 281
+P E + + IS ++ YA MV LD VG ++ L+ G+L+ +I
Sbjct: 235 VKLATQKNPKSPGEGIQEGSAISRSQQDHPAYAAMVENLDTQVGRLLDELKTQGILDQTI 294
Query: 282 VLFMADNGAPSF--GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELF 339
V+F +DNG G N PLR K ++GG+R I P K+S ++
Sbjct: 295 VVFTSDNGGLCTLNGKSPGPTCNLPLRAGKGWTYEGGIRIPTYISWPG-----KISPQVL 349
Query: 340 HI----SDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
I D PTL + I T +DG++ +LTK + SE
Sbjct: 350 DIPAYTCDIYPTLLSLCQIPPRPTQHVDGISLAGLLTKSSSLPESE 395
>gi|417411296|gb|JAA52089.1| Putative arylsulfatase a, partial [Desmodus rotundus]
Length = 508
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 131/445 (29%), Positives = 206/445 (46%), Gaps = 33/445 (7%)
Query: 30 PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMTGK 88
P+I++I ADDLG+ D+ +G TPN+D LA GL YV +LCTPSR+AL+TG+
Sbjct: 22 PNIVLIFADDLGYGDLGAYGHPSSTTPNLDQLATAGLRFTDFYVPVSLCTPSRAALLTGR 81
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTFRG 147
P+ G+ GV+ G GLPL E L + L GY T GKWHLG E + P +G
Sbjct: 82 LPVRSGLYPGVLEPGSQGGLPLEEVTLAEVLATQGYLTGMAGKWHLGVGPEGAFLPPRQG 141
Query: 148 FDSHYG--YWQGLQDYYDHSCKATFEPYQGL--------DMRHNMQVDNKTIGIYSTDL- 196
F G Y + +C P G+ + N+ V+ + + +
Sbjct: 142 FHRFLGIPYSHDQGPCQNLTCFPPATPCDGICDQGLVPVPLLANLSVEAQPPWLPGLEAR 201
Query: 197 YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
Y A +++A+ + +P FLY A H Y F R +
Sbjct: 202 YVAFARDLMADAQRQGRPFFLYYASHHTH----YPQFSGQSSA-------GRSGRGPFGD 250
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
+ LD +VG ++AAL+ G+L ++V+F ADNG P S+ G + LR K T ++G
Sbjct: 251 SLMELDAAVGALMAALQDLGLLRQTLVIFTADNG-PETMRMSHGGCSGLLRCGKGTTFEG 309
Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAK 375
G+R A + P T V+ EL D LPTL A AG + + +LDGV+ +L K
Sbjct: 310 GVREPAVAFWPG-HITPGVTHELASSLDLLPTLAALAGAPLPNVTLDGVDLSSLLLGTGK 368
Query: 376 TKRSEILHNIDNVDNPQKYYAALRVDDLK---YVAGTDNNGQSDEWYGDTDNEIDKYSPK 432
+ R ++ D + + A+R K + G+ ++ + + N + + P
Sbjct: 369 SPRQTMVFYPAYPDEVRGVF-AVRKGKYKAHFFTQGSAHSDTTADPACHASNPLTAHEPP 427
Query: 433 EV--LYSKAGITFNALKTKLQIKQK 455
+ L G +N L++ ++ +
Sbjct: 428 LLFDLSEDPGENYNLLRSGAEVAPE 452
>gi|386381787|ref|ZP_10067486.1| sulfatase, partial [Streptomyces tsukubaensis NRRL18488]
gi|385670744|gb|EIF93788.1| sulfatase, partial [Streptomyces tsukubaensis NRRL18488]
Length = 426
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 123/425 (28%), Positives = 192/425 (45%), Gaps = 38/425 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
++P+++ IL DDLGW D+S +G+ +I TPN+D LA G+ Y A C+P+R +L T
Sbjct: 10 RRPNVLFILGDDLGWADLSSYGAPEIRTPNLDRLARQGVRFTDAYSGSATCSPTRFSLYT 69
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+YP + GL T L L++AGY T IGKWH G+ + Y+PT
Sbjct: 70 GRYPGRTKGGLAEPIADRSVGLEPTHPTLASLLRDAGYRTALIGKWHCGYLPD-YSPTRS 128
Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
G++ +G + G +YY S ++ Y+G +++ Y T + TE A+ +
Sbjct: 129 GWEEFFGNFGGALEYYSKLSSGGEYDLYEGDAEYRDLR--------YYTRILTERAVEYV 180
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT-----------YA 254
+ P L L H + + E+ + + D RR Y
Sbjct: 181 GRDHGRTPWLLNLNFTTPHWPWIADGDEETSAEIVRRIKAGD--RRALWHQDGGSLAKYR 238
Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
+V LD SVG V+AAL++ G E+++V F +DNG F N PL G K + D
Sbjct: 239 QLVEDLDRSVGKVLAALKRSGQEEDTLVFFASDNGGERFSY------NWPLSGNKGSLLD 292
Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKG 373
GG+R + + P + +VS + DW TL G + LDGV+ L G
Sbjct: 293 GGIRVPSILRWPARIEGGQVSGVPVYSPDWTATLLDVGGARPHPAYPLDGVSLKGYLLHG 352
Query: 374 AKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKE 433
A+ ++ + + ALR + KY G D + D + DK S +
Sbjct: 353 AEVPARDLFWRV-------RGERALRRGEWKYYRGKDGRDRLYRLSEDVREQADKASVRP 405
Query: 434 VLYSK 438
L ++
Sbjct: 406 ELLTE 410
>gi|171912351|ref|ZP_02927821.1| aryl-sulphate sulphohydrolase [Verrucomicrobium spinosum DSM 4136]
Length = 491
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 126/386 (32%), Positives = 182/386 (47%), Gaps = 55/386 (14%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPS 80
L+ P +P+I+ ILADDLG+ DV+ GS TPNID +A G+ Q Y C P+
Sbjct: 23 LHAATPARPNIVFILADDLGYTDVATLGSKYYETPNIDQMAAQGVKFTQGYSCGPNCQPT 82
Query: 81 RSALMTGKYPIHIGMQHGVILEGEPW------------GLPLTEKLLPQYLKEAGYATHA 128
R+ALM+G+Y G+ ++ PW LPL + L Q LK AGYAT
Sbjct: 83 RAALMSGQYGPRTGVYTVGGIDRFPWQTRPLRPVDNVVNLPLEKVTLAQTLKNAGYATAM 142
Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
GKWHLG E Y P+ RGFD G +D + K
Sbjct: 143 FGKWHLGN-GEAYHPSKRGFDEAI-VSNGRHINFDTNPKVAVPE---------------- 184
Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD---I 245
Y D T+ A++ I H +S P FLYL H VH+ P++A + +A+F D +
Sbjct: 185 -DAYLADFLTDRAVDFIQRHKES-PFFLYLPHYGVHS-----PYEAKADLIARFKDKPAV 237
Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG----- 300
YA M++ +DESVG ++ L + +N++V+F +DNG G + +G
Sbjct: 238 GGHHSPVYAAMLASVDESVGRILKLLDDLKLSQNTLVVFSSDNGG--VGGYEREGIDAKG 295
Query: 301 --SNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-- 355
N PLRG K ++GG+R W +K Q V++ + D PTL AG +
Sbjct: 296 ITDNAPLRGGKGMLYEGGVRVPYVFQWKGHIKAGQ-VNTTPINSVDLYPTLVELAGAQAP 354
Query: 356 -INDTSLDGVNQWDVLTKGAKTKRSE 380
+ + LDGV+ + L + T R
Sbjct: 355 AVAEQPLDGVSYVNELIEPGATPRGR 380
>gi|167524284|ref|XP_001746478.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775240|gb|EDQ88865.1| predicted protein [Monosiga brevicollis MX1]
Length = 520
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/400 (28%), Positives = 184/400 (46%), Gaps = 76/400 (19%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQ----IPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
+PHI +++ DD GW +V +H S + + TP IDAL G+ LN+HYV CTP+
Sbjct: 30 RPHIFMVIVDDFGWAEVGYHRSEEYKQDVQTPTIDALVQEGIELNRHYVHYACTPN---- 85
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
P G G+P + ++L+ AGYATH +GKW +G +TP
Sbjct: 86 -----PCQPGC-----------GMPQNMTGVAEHLQSAGYATHFVGKWDVGMTTPQHTPK 129
Query: 145 FRGFDSHYGY------------WQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIY 192
RG+D+ Y WQG Y+H ++ P + + +K
Sbjct: 130 GRGYDTSLNYFGHGNWMWSEAEWQG---SYNH--RSDLPPCEPPACFKDFWETDKPAAEL 184
Query: 193 STDLYTEAAI-----NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
+ Y EA +++ +H+ S P+FL A H P QAP E KF I +
Sbjct: 185 NGTQYEEAIFRDRVYSILEQHDPSTPLFLTYASKVAHY-----PLQAPIEYQEKFSFIEE 239
Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA------------PSFG- 294
P RR Y MV+ LD+++ N + L+ GM N++++ +DNG S G
Sbjct: 240 PHRRVYHAMVAFLDDNLANFTSLLKSKGMWNNTLMVLTSDNGGYVKSFLGDCDNNTSHGY 299
Query: 295 --IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS--SELFHISDWLPTLCA 350
+ G+N+PL+G K + +GG+R A + ++ + ++ + E+ H++DW T A
Sbjct: 300 ACFNGEAGANYPLQGGKYSTLEGGVRVNAFVSGGFVPEERRGTREDEMVHVADWYATFAA 359
Query: 351 AAGIEIND--------TSLDGVNQWDVLTKGAKTKRSEIL 382
AG++ D +D +N W +L+ + + L
Sbjct: 360 LAGVDPTDHRAAQHNLPPIDSLNIWPLLSGANVSSPRQTL 399
>gi|398828648|ref|ZP_10586848.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
gi|398217506|gb|EJN04023.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
Length = 470
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 123/390 (31%), Positives = 185/390 (47%), Gaps = 40/390 (10%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
KKP+II ILADDLG+ D+S +G I TP ID + +G+ L Q Y + +C+ +R+A+MT
Sbjct: 34 KKPNIIFILADDLGYADLSSYGHPTIRTPAIDKIGNDGVRLLQAYSNSPVCSATRTAIMT 93
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y + + L G GLP ++ LP LK+AGY T IGKWHLG + + Y P
Sbjct: 94 GQYQYRLALGLEEPLAGRDIGLPPSQTTLPSLLKQAGYETTLIGKWHLGAYPK-YGPLKS 152
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
G+D YG+ YY+H G D + K Y TDL + + +I
Sbjct: 153 GYDHFYGFRGSALSYYNH----------GKDFWEDDAPVEKA--GYFTDLLGDKTVELIQ 200
Query: 207 EHNK-SKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK--------FLDISDPERRTYAGMV 257
+ + +P F +VH + P++AP E+ K + Y +V
Sbjct: 201 KSDSCERPFF-----ASVHFNAPHWPWEAPGEQGKKEAARKKNSLAAFDAGTQDVYRQIV 255
Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
R+D VG ++ AL + EN+IV+F +DNG F + P G KS +GG+
Sbjct: 256 ERMDFQVGRILQALTDKKIRENTIVVFTSDNGGERF------SNTWPFSGKKSELLEGGI 309
Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKT 376
R + I P + + S ++ DWLPTL AA ++ DG++ +L
Sbjct: 310 RIPSLICWPAKIPSGQTSEQVTLSMDWLPTLLEAAETNVDQNHQSDGLSILPMLKGDEGV 369
Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
K + N Q+ A R+ D KY+
Sbjct: 370 KSRTVFWRYKG--NAQR---AARMGDYKYL 394
>gi|423330334|ref|ZP_17308118.1| hypothetical protein HMPREF1075_00131 [Parabacteroides distasonis
CL03T12C09]
gi|409231950|gb|EKN24798.1| hypothetical protein HMPREF1075_00131 [Parabacteroides distasonis
CL03T12C09]
Length = 468
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 124/394 (31%), Positives = 185/394 (46%), Gaps = 43/394 (10%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
K+ L L F+ A + T A ++P++II+ DD G+ D+ +GS++ TP+ID +A G
Sbjct: 2 KFISKLISVLAFSGAAVATQAAERPNVIIVFIDDFGYGDLGCYGSTKHRTPHIDQMAKEG 61
Query: 66 LILNQHYV-QALCTPSRSALMTGKYPIHIGMQ------------HGVILEGEPWGLPLTE 112
+ L YV ++ TPSRSAL+TG YP + M V+ GL E
Sbjct: 62 IRLTDFYVGSSVSTPSRSALLTGCYPRRVSMHVNADPTPLMSKGRQVLFPASHKGLNPGE 121
Query: 113 KLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP 172
+ + +KE GYAT IGKWHLG + + PT +GFD +YG Y + P
Sbjct: 122 ITIAELMKEQGYATACIGKWHLG-DQLPFLPTRQGFDYYYGI------PYSNDMD---RP 171
Query: 173 YQGLDMRHNMQVDNKTIGIYS-TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP 231
Y L + +V +G S T YT + I H +S P F+YL H H P
Sbjct: 172 YCPLPLMEQEEVIVAPVGHDSLTIRYTNKTVEFIKSHKES-PFFIYLCHNMTH-----NP 225
Query: 232 FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
A K + Y LD S+G ++ L++ G+ +N++V+F +DNGA
Sbjct: 226 LAASPAFKGK------SQNGLYGDATEELDWSMGVLLETLKEEGLDQNTLVIFTSDNGAD 279
Query: 292 S-FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
FG G+N PLRG K T ++GG R + P + + L D+LPTL
Sbjct: 280 EHFG-----GTNRPLRGQKGTTYEGGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAH 334
Query: 351 AAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
G + +D +DG N +L + +E +
Sbjct: 335 YCGYAVPSDRVIDGHNVSGILEGESMASPTETFY 368
>gi|254482912|ref|ZP_05096148.1| sulfatase, putative [marine gamma proteobacterium HTCC2148]
gi|214036784|gb|EEB77455.1| sulfatase, putative [marine gamma proteobacterium HTCC2148]
Length = 470
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 130/399 (32%), Positives = 197/399 (49%), Gaps = 40/399 (10%)
Query: 10 ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQI---PTPNIDALAYNGL 66
A TL+F+ F T A KP+I++++ D+ G+ +V +G + PTPNID++A G
Sbjct: 12 AYCSTLVFSVLFSVTAAASKPNIVMVVMDNFGYGEVGVYGGGMLRGAPTPNIDSIATEGF 71
Query: 67 ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
L V+A CTPSRS+LMTG+Y I + G +G+ E L + L +AGYAT
Sbjct: 72 QLTNFNVEAECTPSRSSLMTGRYGIRTRQRPNDEPRGIWYGITPWEITLAEMLSDAGYAT 131
Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSC---KATFEPYQGLDMRHNMQ 183
GKWHLG Y PT +GFD W G+ + D + +F+ G++ H M+
Sbjct: 132 GMFGKWHLGDEEGRY-PTDQGFDE----WYGIPNSSDQAFWPDSDSFQKDAGVEFTHVME 186
Query: 184 VDN----KTIGIYSTDL-------YTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEP 231
K +Y + T+ AI+ I K+ KP F YL + H EP
Sbjct: 187 SKRGQKPKKKDVYGREKRKTIDREITDRAIDFIKRKAKAGKPFFAYLPYTQTH-----EP 241
Query: 232 FQA-PDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
A PD + + ++A ++++ D VG ++ + G +N+I +F +DNG
Sbjct: 242 VDAHPDFKGST-------GNGSFADVLAQTDSYVGELLKTIDNLGFKDNTIFIFTSDNGR 294
Query: 291 PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
GI + G P RG P++G +R I P +KVS+++ H+ D PT+
Sbjct: 295 E--GIKRSFGFTGPWRGTMFAPYEGSLRVPFLIRYPDKIPAKKVSNDIVHLIDIFPTIAK 352
Query: 351 AAGIEI-NDTSLDGVNQWDVLT-KGAKTKRSEILHNIDN 387
+G EI D LDGV+Q D LT K K+ R ++ I N
Sbjct: 353 LSGGEIPQDRILDGVDQTDFLTGKSEKSARESVIIYIGN 391
>gi|432091635|gb|ELK24657.1| Arylsulfatase A [Myotis davidii]
Length = 557
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 124/380 (32%), Positives = 184/380 (48%), Gaps = 31/380 (8%)
Query: 18 NDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-AL 76
ND L P P+I++I ADDLG+ D+ +G TPN+D LA GL YV +L
Sbjct: 61 NDNGLAAAYP--PNIVLIFADDLGYGDLGSYGHPTSTTPNLDQLATAGLRFTDFYVPVSL 118
Query: 77 CTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF 136
CTPSR+AL+TG+ P+ G+ GV+ G GLPL E L + L + GY T GKWHLG
Sbjct: 119 CTPSRAALLTGRLPVRSGLYPGVLEPGSRGGLPLEEVTLAEILADQGYLTGMAGKWHLGV 178
Query: 137 FRE-VYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGL--------DMRHNMQVD 185
E + P +GF G Y + +C P G+ + N+ V+
Sbjct: 179 GPEGTFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCHGICDQGLVPVPLLANLSVE 238
Query: 186 NKTIGIYSTDL-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
+ + + Y A +++ + H + +P FLY A H Y F F
Sbjct: 239 AQPPWLPGLEARYVAFAHDLMVDAHRQGRPFFLYYASHHTH----YPQFSG-----QNFA 289
Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
S R + + LD +VG ++ A+ G+L++++V+F ADNG + + S G +
Sbjct: 290 GHSG--RGPFGDSLMELDSAVGALMTAVGDLGLLKDTLVIFTADNGPETMRM-SRGGCSG 346
Query: 304 PLRGMKSTPWDGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLD 362
PLR K T ++GG+R A A W + T V+ EL D LPTL A AG + + +LD
Sbjct: 347 PLRCGKGTTFEGGVREPALAFWPGHI--TPGVTHELASSLDLLPTLAALAGAPLPNVTLD 404
Query: 363 GVNQWDVLTKGAKTKRSEIL 382
G + +L K+ R +
Sbjct: 405 GFDLSPLLLGKGKSPRQTMF 424
>gi|67972647|ref|NP_033843.2| arylsulfatase A precursor [Mus musculus]
gi|341940244|sp|P50428.2|ARSA_MOUSE RecName: Full=Arylsulfatase A; Short=ASA; AltName:
Full=Cerebroside-sulfatase; Flags: Precursor
gi|12835776|dbj|BAB23356.1| unnamed protein product [Mus musculus]
gi|15030075|gb|AAH11284.1| Arylsulfatase A [Mus musculus]
gi|67867472|gb|AAH98075.1| Arylsulfatase A [Mus musculus]
gi|74208915|dbj|BAE21207.1| unnamed protein product [Mus musculus]
gi|148672389|gb|EDL04336.1| arylsulfatase A, isoform CRA_b [Mus musculus]
Length = 506
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 132/390 (33%), Positives = 189/390 (48%), Gaps = 33/390 (8%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
AL L A L+T +P P+I++I ADDLG+ D+ +G TPN+D LA GL
Sbjct: 1 MALGTLFLALAAGLSTASP--PNILLIFADDLGYGDLGSYGHPSSTTPNLDQLAEGGLRF 58
Query: 69 NQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
YV +LCTPSR+AL+TG+ P+ GM GV+ GLPL E L + L GY T
Sbjct: 59 TDFYVPVSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLAARGYLTG 118
Query: 128 AIGKWHLGFFRE-VYTPTFRGFD-------SH-YGYWQGLQDY-YDHSCKATFEPYQGL- 176
GKWHLG E + P +GF SH G Q L + D CK + QGL
Sbjct: 119 MAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGCD--QGLV 176
Query: 177 --DMRHNMQVDNKTIGIYSTDL-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPF 232
+ N+ V+ + + + Y + +++A+ + +P FLY A H Y F
Sbjct: 177 PIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTH----YPQF 232
Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
F S R + + LD +VG ++ + G+LE ++V+F ADNG P
Sbjct: 233 SG-----QSFTKRSG--RGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNG-PE 284
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
SN G + LR K T ++GG+R A ++ P T V+ EL D LPTL A
Sbjct: 285 LMRMSNGGCSGLLRCGKGTTFEGGVREPALVYWPG-HITPGVTHELASSLDLLPTLAALT 343
Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
G + + +LDGV+ +L K+ R +
Sbjct: 344 GAPLPNVTLDGVDISPLLLGTGKSPRKSVF 373
>gi|332265013|ref|XP_003281524.1| PREDICTED: arylsulfatase A isoform 1 [Nomascus leucogenys]
Length = 509
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 125/404 (30%), Positives = 191/404 (47%), Gaps = 30/404 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
+ P+I++I ADDLG+ D+ +G TPN+D LA GL YV +LCTPSR+AL+T
Sbjct: 21 RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDRLAAGGLRFTDFYVPVSLCTPSRAALLT 80
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
G+ P+ +GM GV++ GLPL E + + L GY T GKWHLG E + P
Sbjct: 81 GRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPH 140
Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGIYSTD 195
+GF G Y + +C P G + + N+ V+ + + +
Sbjct: 141 QGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLE 200
Query: 196 L-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
Y A +++A+ + +P FLY A H Y F F + S R +
Sbjct: 201 ARYVAFARDLMADAQRQDRPFFLYYASHHTH----YPQFSG-----QSFAERSG--RGPF 249
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
+ LD +VG ++ A+R G+LE ++V+F ADNG + + S G + LR K T +
Sbjct: 250 GDSLMELDAAVGTLMTAIRDLGLLEETLVIFTADNGPETMRM-SRGGCSGLLRCGKGTTY 308
Query: 314 DGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
+GG+R A A W + V+ EL D LPT+ A AG + + +LDG + +L
Sbjct: 309 EGGVREPALAFWPGHI--APGVTHELASSLDLLPTVAALAGAPLPNVTLDGFDLSPLLLG 366
Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
K+ R + D + + A+R K T + SD
Sbjct: 367 TGKSPRQSLFFYPSYPDEVRGVF-AVRSGKYKAHFFTQGSAHSD 409
>gi|262383797|ref|ZP_06076933.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
gi|262294695|gb|EEY82627.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
Length = 468
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 124/394 (31%), Positives = 185/394 (46%), Gaps = 43/394 (10%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
K+ L L F+ A + T A ++P++II+ DD G+ D+ +GS++ TP+ID +A G
Sbjct: 2 KFISKLISVLAFSGAAVATQAAERPNVIIVFIDDFGYGDLGCYGSTKHRTPHIDQMAKEG 61
Query: 66 LILNQHYV-QALCTPSRSALMTGKYPIHIGMQ------------HGVILEGEPWGLPLTE 112
+ L YV ++ TPSRSAL+TG YP + M V+ GL E
Sbjct: 62 IRLTDFYVGSSVSTPSRSALLTGCYPRRVSMHVNADPTPLMSKGRQVLFPASHKGLNPGE 121
Query: 113 KLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP 172
+ + +KE GYAT IGKWHLG + + PT +GFD +YG Y + P
Sbjct: 122 ITIAELMKEQGYATACIGKWHLG-DQLPFLPTRQGFDYYYGI------PYSNDMD---RP 171
Query: 173 YQGLDMRHNMQVDNKTIGIYS-TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP 231
Y L + +V +G S T YT + I H +S P F+YL H H P
Sbjct: 172 YCPLPLMEQEEVIVAPVGHDSLTIRYTNKTVEFIKSHKES-PFFIYLCHNMTH-----NP 225
Query: 232 FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
A K + Y LD S+G ++ L++ G+ +N++V+F +DNGA
Sbjct: 226 LAASPAFKGK------SQNGLYGDATEELDWSMGVLLETLKEEGLDQNTLVIFTSDNGAD 279
Query: 292 S-FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
FG G+N PLRG K T ++GG R + P + + L D+LPTL
Sbjct: 280 EHFG-----GTNRPLRGQKGTTYEGGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAH 334
Query: 351 AAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
G + +D +DG N +L + +E +
Sbjct: 335 YCGYAVPSDRVIDGHNVSGILEGESMASPTETFY 368
>gi|340377676|ref|XP_003387355.1| PREDICTED: arylsulfatase A-like [Amphimedon queenslandica]
Length = 501
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 129/429 (30%), Positives = 189/429 (44%), Gaps = 60/429 (13%)
Query: 15 LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-V 73
+F +N A KP+III+ ADDLG+ D+ +G TPN++++A GL+L Q Y
Sbjct: 10 CIFVSLRVNGAAVNKPNIIILFADDLGYGDLQIYGHPTSSTPNLNSMASEGLVLTQFYSA 69
Query: 74 QALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWH 133
+C+PSR+ALMTG+YP G+ GV+ GLP E + + LK AGY T +GKWH
Sbjct: 70 NPVCSPSRAALMTGRYPPRTGIWPGVLFPNSIGGLPHNETTIAEALKTAGYKTGMVGKWH 129
Query: 134 LGFFRE-VYTPTFRGFDSHYGYWQGLQDYYDHS-CKATFEP---------------YQGL 176
LG Y PT GFD Y+ G+ D C + P Y G
Sbjct: 130 LGVGENGTYLPTNHGFD----YYLGIPYSNDMGPCTICYYPNEACISPLKNPCNGGYSGC 185
Query: 177 DMRHNMQVDNKTIGIYS-TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP 235
+ N + + + + + Y+ AA I + P FLY+A +T++P Q
Sbjct: 186 PVFENADIIEQPVNFLTLAERYSNAATGFIKANAGKNPFFLYMAF-----SHTHKP-QFS 239
Query: 236 DEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI 295
EE T G+ + D V +N+ V F +DNG P+
Sbjct: 240 GEEFTN---------STLRGLSTLKDTQVD------------DNTFVFFTSDNG-PAQHT 277
Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
G+ LR K + WDGGMR W P K Q ++EL D PT+ + G +
Sbjct: 278 PPKGGNGGLLRCGKGSTWDGGMREPGIAWWPG-KIRQGRTAELAATVDLFPTIMSITGAK 336
Query: 356 INDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLK---YVAGTDNN 412
+DG++ +L + R ++ + NP Y A+R K Y +G
Sbjct: 337 SGPAFIDGIDMSPILFDSKPSNRESYVY-FPSSYNPSTGYHAIRWKQYKAHYYTSG---- 391
Query: 413 GQSDEWYGD 421
G D+ Y D
Sbjct: 392 GVCDKIYPD 400
>gi|332666885|ref|YP_004449673.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
1100]
gi|332335699|gb|AEE52800.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
1100]
Length = 443
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 140/465 (30%), Positives = 215/465 (46%), Gaps = 53/465 (11%)
Query: 13 CTLLFNDAFL--NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILN 69
C LF F+ +T + P+II I+ DDLG+ D+S +G TPNID LA G+ +N
Sbjct: 12 CLPLFVLTFMVYSTVHAQHPNIIYIMTDDLGYGDLSGYGRKDFLTPNIDKLAAQGIKFVN 71
Query: 70 QHYVQALCTPSRSALMTGKYPIH--IGMQHGVILEGEPWGLPLTEKL--LPQYLKEAGYA 125
+ LCTP+R+A MTG+YP +G+ + + LT + ++ +GY
Sbjct: 72 AYSAAPLCTPTRTAFMTGRYPARTPVGLMEPLTPSKRDSTVGLTAAFPSVATLMRASGYE 131
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
T IGKWHLGF + +P GFD +G G DY H H++ +
Sbjct: 132 TALIGKWHLGFLPQ-NSPVKNGFDYFFGIHSGAADYISHKTGPAGRRI------HDLYEN 184
Query: 186 NKTI--GIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
++ + Y TDL+T+ A+ + + HN KP FL L + A H P+Q P+++ +
Sbjct: 185 DQAVYPEGYLTDLFTQKAVTFLKQKHN--KPFFLTLTYNAAHW-----PWQGPNDK--PY 235
Query: 243 LDISD----PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSN 298
+D D YA M++RLDE +G +I L + + +IV+F DNG +
Sbjct: 236 VDSVDLRIGGSAAVYAAMMNRLDEGIGEIIKTLDEEQLSNQTIVIFTNDNGGEKY----- 290
Query: 299 KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS-DWLPTLCAAAGIEIN 357
N L K W+GG+R A + P K T S++ I+ DW T+ +A G +++
Sbjct: 291 -SDNGGLAKAKGALWEGGIRVPAIVKWPG-KITAGSSTQQVAITMDWTKTILSAGGAKMD 348
Query: 358 -DTSLDGVNQWDVLTKGAK-TKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQS 415
+ LDG++ +LT K R+ + A+R+ D KY+ D G+
Sbjct: 349 KNFPLDGIDLLPILTGDKKIIPRTIYWRTFQRLKQ-----KAIRMGDWKYL--QDEKGEY 401
Query: 416 DEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADP 460
N I K L +K I F LK K +K P
Sbjct: 402 ------LFNLIADQEEKNDLKAKEEIIFKRLKKKYADWEKTVLQP 440
>gi|340618178|ref|YP_004736631.1| sulfatase [Zobellia galactanivorans]
gi|339732975|emb|CAZ96350.1| Sulfatase, family S1-20 [Zobellia galactanivorans]
Length = 494
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 134/415 (32%), Positives = 201/415 (48%), Gaps = 50/415 (12%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
A +KP+II ILADDLG+ D+S +G + TP ID LA G+ QHY A +C P+RS+L
Sbjct: 42 AQQKPNIIYILADDLGYGDLSSYGQEKFETPRIDELAQRGMKFTQHYSGAPVCAPARSSL 101
Query: 85 MTGKYPIHIGMQHGVILEGEPWG-LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
MTG++ H ++ + LEGE LP + + LK+AGY T A GKW LGF P
Sbjct: 102 MTGQHTGHTPIRGNMELEGEGQTPLPGNSVTIAEMLKKAGYITGAFGKWGLGFIGTEGDP 161
Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV------DNKTIGIYSTDLY 197
+GFD YGY +C+ Y + HN + D Y+ D
Sbjct: 162 VAQGFDEFYGY----------NCQRMSHRYYPTHLWHNEEKVLLEGNDWTKTETYAPDKI 211
Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHA------GNTYEPFQAPDEE---VAKFLDISD- 247
A +N I E NK KP F Y+ + HA + ++ F+ EE A+ L SD
Sbjct: 212 QAATLNFI-ETNKEKPFFAYVPFVLPHAELISPNDSIFDKFKDKYEETPYTAENLYTSDY 270
Query: 248 ----------PERR---TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
P+ + +A MV R+D VG ++ L + G+ +N+IV+F +DNG + G
Sbjct: 271 GPDIVPQEYAPQLKPHAAFASMVYRMDVYVGQILDKLDELGIADNTIVMFTSDNGPHAEG 330
Query: 295 IHSNK--GSNHPLRGMKSTPWDGGMRG-VAAIWSPWLKQTQKVSSELFHISDWLPTLCAA 351
+ S+ L+G+K ++GG+R A+W +K S + D +PTL
Sbjct: 331 GADPEFFNSSGGLKGIKRDLYEGGIRSPFLAVWPNKIK-AGSTSDHVSAFWDLMPTLAEI 389
Query: 352 AGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
++ +S DG++ L K+ E L+ ++ N +K A+R D K V
Sbjct: 390 TNTDVPASS-DGISFLPTLLGKKDQKQHEYLYWEFSIQNGRK---AVRKGDWKGV 440
>gi|403260898|ref|XP_003922887.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Saimiri boliviensis
boliviensis]
Length = 482
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 113/362 (31%), Positives = 168/362 (46%), Gaps = 25/362 (6%)
Query: 40 LGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMTGKYPIHIGM--- 95
+GW D+ +G TPN+D +A G + Y LC+PSR+AL+TG+ PI G
Sbjct: 1 MGWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 60
Query: 96 ----QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSH 151
++ + G+P +E+LLP+ LKEAGY T +GKWHLG R + P GFD
Sbjct: 61 NAHARNAYTPQEIVGGIPDSEQLLPELLKEAGYVTKIVGKWHLGH-RPQFHPLKHGFDEW 119
Query: 152 YGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
+G YD+ + Y+ +M ++ KT T +Y + A++ I
Sbjct: 120 FGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKR 179
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
+ +P FLY A A HA P FL S R Y V +D+S+G +
Sbjct: 180 QARRRPFFLYWAVDATHA---------PVYASKPFLGTS--RRGRYGDAVREMDDSIGKM 228
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
+ L+ + +++ V F +DNGA GSN P K T ++GGMR A W P
Sbjct: 229 LQLLQDLRVADSTFVFFTSDNGAALISAPEQGGSNSPFLCGKQTTFEGGMREPALAWWPG 288
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
+VS +L I D T A AG+ +D ++DG+N L +G R + D
Sbjct: 289 HITPGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPALLQGRVMDRPVFYYRGD 348
Query: 387 NV 388
+
Sbjct: 349 TL 350
>gi|372210892|ref|ZP_09498694.1| N-acetylgalactosamine-6-sulfatase [Flavobacteriaceae bacterium S85]
Length = 463
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 117/346 (33%), Positives = 172/346 (49%), Gaps = 35/346 (10%)
Query: 6 KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
K + + F LN TA +KP++I ILADDLG+ D+S +G + TPNID LA G
Sbjct: 5 KTVLRIVSLVAFLTLGLNVTAQEKPNVIYILADDLGYGDLSCYGQKILSTPNIDRLAKKG 64
Query: 66 LILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGY 124
+ HY A+C+PSR+ LMTG+ P H+ + + EP G+ +LP+ K AGY
Sbjct: 65 MKFTDHYSGNAVCSPSRAVLMTGQNPAHVYITGNS--KAEP-GMDSEMTVLPELFKSAGY 121
Query: 125 ATHAIGKWHLG--FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM 182
T A GKW LG ++ P +GFD YG+ + T Y M HN
Sbjct: 122 VTGAFGKWGLGDTTAKDNRNPLHQGFDEFYGW----------KTQVTAHTYYPTKMVHNG 171
Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLY----LAHLAVHA---GNTYEPFQA 234
Q G Y D+ E A + I E++K+K P F Y + H A+HA + +
Sbjct: 172 QEVPIKKGTYVHDVIMEHAFDFIKENSKNKQPFFCYIPTAIPHAAMHAPKKLHKKWRKKL 231
Query: 235 PDEEVAKFLDISDPERR--------TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMA 286
P + +DPE + MV LD VG ++ L+K+ + +N++++F +
Sbjct: 232 PQFDTIIGTYYADPEPTPDVINPIAGFGAMVEHLDNQVGQILKMLKKYKVDKNTLIIFTS 291
Query: 287 DNGAPSFGIHSNK--GSNHPLRGMKSTPWDGGMRG-VAAIWSPWLK 329
DNGA G H+ + SN PLRG K ++GG+R + A W +K
Sbjct: 292 DNGAHHEGGHNPEFWNSNGPLRGGKRDVYEGGIRAPMLAHWPTKIK 337
>gi|432342876|ref|ZP_19592103.1| arylsulfatase [Rhodococcus wratislaviensis IFP 2016]
gi|430772100|gb|ELB87901.1| arylsulfatase [Rhodococcus wratislaviensis IFP 2016]
Length = 459
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 117/375 (31%), Positives = 177/375 (47%), Gaps = 37/375 (9%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP+++ IL D++GW D +G + PTP ID LA G+ N + V+A CTP+RSA+MTG+
Sbjct: 6 KPNVVFILVDNVGWGDFGVYGGTT-PTPRIDKLASEGIRFNDYNVEAQCTPTRSAIMTGR 64
Query: 89 YPIHIGMQHGVILEGEPW-GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
+P+ G + + GE G+ E + L EAGYAT GKWHLG E PT +G
Sbjct: 65 HPVRSG-TYKLSWPGEGMSGMAPWEYTIADLLSEAGYATALYGKWHLG-NTEGRLPTDQG 122
Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDM--------------RHNMQVDNKTIGIYS 193
FD W G++D +D + + YQ L + M +D I
Sbjct: 123 FDE----WWGIKDSWDEAGYTAWPLYQELGLPAPMIWEGRKGEQSTPAMPLDLDVRPIVD 178
Query: 194 TDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
+ I ++ + KP F+YL + +H P + + S
Sbjct: 179 EKYLVPKTVEFIKKNAAAKKPFFVYLGYSQMHPPMIANP---------NYANTSTARSGV 229
Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG-MKST 311
YA ++ +D+ VG ++ A++ G+ +N+I++ +DN H GSN P RG +T
Sbjct: 230 YADCLAEMDDRVGQIVDAVKAAGIEDNTIIVLSSDNAGGGALPHVGPGSNGPWRGNFFNT 289
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG---IEINDTSLDGVNQWD 368
P++G MR A I P V+++ DWLPTL G + D +DGV+
Sbjct: 290 PFEGSMRVSAMIRWPGAIPAGVVTAQTLAAVDWLPTLAGMVGASELVPRDRPIDGVDASA 349
Query: 369 -VLTKGAKTKRSEIL 382
+L K KT R L
Sbjct: 350 FMLGKSDKTGRDSYL 364
>gi|348551622|ref|XP_003461629.1| PREDICTED: arylsulfatase A [Cavia porcellus]
Length = 507
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 125/389 (32%), Positives = 185/389 (47%), Gaps = 53/389 (13%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCT 78
A L TT+P P+I++I ADDLG+ D+ +G TPN+D LA GL YV +LCT
Sbjct: 13 AGLATTSP--PNIVLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCT 70
Query: 79 PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
PSR+AL+TG+ P+ +GM GV+ GLPL E L + L GY T GKWHLG
Sbjct: 71 PSRAALLTGRLPVRMGMYPGVLGPKSKGGLPLEEVTLAEVLASHGYLTGMAGKWHLGV-- 128
Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDH---SCK--ATFEPYQGLD-----------MRHNM 182
P H+G+ + L Y H C+ F P D + N+
Sbjct: 129 ---GPGGAFLPPHHGFHRFLGIPYSHDQGPCQNLTCFPPATSCDGTCDQDEVPIPLLANL 185
Query: 183 QVDNKTIGI---------YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQ 233
++ + + ++ DL T+A + +P FLY A H Y F
Sbjct: 186 TIEAQPPWLPGLEARYVAFARDLMTDA-------QRQGRPFFLYYASHHTH----YPQFS 234
Query: 234 APDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
F+ S R + + LD +VG ++ ALR G+L +++V+F +DNG +
Sbjct: 235 G-----QSFVGRSG--RGPFGDSLMELDAAVGALMLALRDLGLLRDTLVIFTSDNGPETM 287
Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
+ S G++ LR K T ++GG+R A + P T V+ EL D LPTL A +G
Sbjct: 288 RM-SRGGNSGLLRCGKGTTFEGGVREPAVAFWPG-HITPGVTHELASSLDLLPTLAALSG 345
Query: 354 IEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
+ + +LDG++ VL K+ R +
Sbjct: 346 ASLPNVTLDGMDISSVLMSSGKSPRESLF 374
>gi|340619560|ref|YP_004738013.1| sulfatase [Zobellia galactanivorans]
gi|339734357|emb|CAZ97734.1| Sulfatase, family S1-14 [Zobellia galactanivorans]
Length = 483
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 128/405 (31%), Positives = 189/405 (46%), Gaps = 36/405 (8%)
Query: 7 YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
+ L L FN + K+P+++++ DD G+ DV +G++ TP+ID LA G
Sbjct: 6 FALVLVTLLSFNFSIAQNQDSKRPNVVLVFIDDEGYGDVGVYGATGFETPHIDQLAAKGT 65
Query: 67 ILNQHY-VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
+Y Q +C+ SR+ LMTG YP IG G + GL E + + KE Y
Sbjct: 66 RFTNYYSAQPVCSASRAGLMTGCYPNRIGFS-GALFPRHEIGLNKDELTMAEMFKEQDYK 124
Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-- 183
T GKWH+G+ +E + P GFD G D + H AT E R
Sbjct: 125 TACFGKWHMGWQKE-FLPLQHGFDEFVGLPYS-NDMWPHDS-ATGEHLSAEKNRAKFPDL 181
Query: 184 ---VDNKTIGIYS--------TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPF 232
NKTI T LYTE A++ I N ++P FLY+ H H P
Sbjct: 182 PIIEGNKTIAYVKSLKDQDKLTTLYTEKAVDFI-NRNANEPFFLYVPHTMGHI-----PL 235
Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
D+ K E+ Y ++ +D SVG + AL+++G+ +N+I +F DNG P
Sbjct: 236 GVSDKFRGK------SEQGLYGDVMMEIDWSVGEIDKALKQNGIADNTIFIFTTDNG-PW 288
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
++ GS+ LR K+T W+GG R I P VS +L D LPT A
Sbjct: 289 LNFGNHAGSSGGLREGKTTSWEGGQRVPFIIRWPGKTPEGTVSDKLACAVDLLPTFAAIT 348
Query: 353 GIEINDTSLDGVNQWDVLTKG--AKTKRSEIL--HNIDNVDNPQK 393
+++D +DGV+ L KG ++T R+ IL H +N++ +K
Sbjct: 349 NGKLSDNKIDGVD-ISPLWKGQFSETPRNTILYYHGKNNLNGVRK 392
>gi|363727329|ref|XP_424471.3| PREDICTED: arylsulfatase A [Gallus gallus]
Length = 506
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 132/411 (32%), Positives = 198/411 (48%), Gaps = 32/411 (7%)
Query: 22 LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPS 80
L A P +++LADDLG+ D+ +G TPN+D +A GL Y A+C+PS
Sbjct: 18 LRGAAGGPPSFVLLLADDLGFGDLGSYGHPSSATPNLDRMAARGLRFTDFYSSSAVCSPS 77
Query: 81 RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF-FRE 139
R+AL+TG++ + G+ GV G GLPL+E + + LK GYAT +GKWHLG R
Sbjct: 78 RAALLTGRFQMRSGIYPGVFYPGSRGGLPLSEVTIAEVLKAKGYATAIVGKWHLGLGARG 137
Query: 140 VYTPTFRGFD-------SH-YGYWQGLQDY-YDHSCKATFEPYQGL---DMRHNMQVDNK 187
+ P +GFD SH G Q L + D C T + QGL + N + +
Sbjct: 138 SFLPIHQGFDHFLGVPYSHDQGPCQNLTCFPPDIKCFGTCD--QGLVPVPLFWNQSIVQQ 195
Query: 188 TIGIYS-TDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
+ LY + A + IA+ + P LY A + +T+ P A E +
Sbjct: 196 PVSFPDLVPLYNKFARDFIADCARRGVPFLLYYA-----SHHTHYPQFASQEYAGR---- 246
Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
+R + +S D SVG ++ AL+++G+ + V F +DNG PS + GS+ L
Sbjct: 247 --SQRGPFGDALSEFDGSVGQLLQALQENGLENTTFVFFTSDNG-PSTMRMARGGSSGLL 303
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
+ K T ++GGMR A + P + V+ EL D LPTL A AG + + SLDG +
Sbjct: 304 KCGKGTTYEGGMREPAVAYWPG-RIAPGVTHELASTLDILPTLTALAGAALPNVSLDGYD 362
Query: 366 QWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
+L + K+ R + + D P A+R+ K T + SD
Sbjct: 363 LSPLLFESGKSPRQTMFFYPPSPD-PLHGPFAVRLGKYKAHYFTQGSFHSD 412
>gi|372210887|ref|ZP_09498689.1| sulfatase [Flavobacteriaceae bacterium S85]
Length = 476
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 128/426 (30%), Positives = 198/426 (46%), Gaps = 60/426 (14%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
K+P+II ILADD+G D+ +G +I TPN+DALA G+ QHY A+C PSRS+LMT
Sbjct: 29 KQPNIIYILADDMGIGDLGTYGQKKIKTPNLDALAAGGIKFTQHYSGSAVCAPSRSSLMT 88
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEK--LLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
G++ H ++ + L+G PL E L + LK+AGY T GKW LGF P
Sbjct: 89 GQHTGHTPIRGNMELKGIGGQQPLPESSITLAEMLKKAGYTTGMFGKWGLGFGEG--DPN 146
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDN------KTIGIYSTDLYT 198
+GFD YG Y D + P + HN ++D+ + +Y+ D
Sbjct: 147 KQGFDEFYG-------YNDQKLAHRYYPTH---LNHNQEIDSLEGNDWTSTEVYAQDKIQ 196
Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI------------- 245
+ + I + NK KP F Y+ + HA + V KF ++
Sbjct: 197 QKTLEFI-QTNKDKPFFAYVPMVLPHAELISPQDSILQQYVGKFDEVPYKNHPYLSDYGP 255
Query: 246 ---------SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
+ YA MV+R+D +G +++ L+K G+ N+I++F +DNGA H
Sbjct: 256 NMVYYKYCSQETPYAVYASMVTRMDAYIGQIVSKLKKLGIDNNTIIMFSSDNGA-----H 310
Query: 297 SNKGSN-------HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
+ G+N LRG+K ++GG+R + P + V+ + D +PTL
Sbjct: 311 AAGGANPEFFDSTDGLRGIKRDLYEGGIRAPFIVAWPGTIPSGVVTDHISAFWDVMPTLA 370
Query: 350 AAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGT 409
GI+ S DG++ L + E ++ N+ +K A+R+ K V
Sbjct: 371 DIVGIKAPKQS-DGISFLPTLLNQKQQNIHEYMYWEYNIKGGRK---AVRLGKWKGVIYK 426
Query: 410 DNNGQS 415
NN +S
Sbjct: 427 ANNKKS 432
>gi|241998988|ref|XP_002434137.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215495896|gb|EEC05537.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 167
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 77/158 (48%), Positives = 107/158 (67%)
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
L+T+ A+++I +H+ +KPMFLY +H AVH G+ +AP +A F I + R +AG
Sbjct: 1 LFTKKALSLIKDHDPAKPMFLYFSHQAVHCGDYKVKLEAPASAIAHFPYIKEQNRSIHAG 60
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
V LD+SVG V+ AL K GML+NSI++F DNG +G N G N PLRG K T W+G
Sbjct: 61 AVFELDKSVGLVMEALNKRGMLKNSIIVFSTDNGGLPWGKEPNSGYNWPLRGSKFTLWEG 120
Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
G+RG A +WSP L +T ++S++L HI+DWLPT +AAG
Sbjct: 121 GVRGAAFVWSPLLSKTGRLSNQLMHITDWLPTFYSAAG 158
>gi|326432029|gb|EGD77599.1| steroid sulfatase [Salpingoeca sp. ATCC 50818]
Length = 537
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 121/391 (30%), Positives = 185/391 (47%), Gaps = 47/391 (12%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
+ P+ ++++ DDLG+ D+ G + TP++D+L +G+ ++Q V A +CTPSR+ +T
Sbjct: 30 QPPNFLVLMVDDLGYGDLGCFGRKNVSTPHVDSLCRDGVKMDQFLVPASVCTPSRAGFLT 89
Query: 87 GKYPIHIGMQHGVI------LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF---- 136
G+ P GM V+ G+P G P +E LP+ L+E GYAT GKWHLG
Sbjct: 90 GRLPRRFGMTANVLPWRVMATTGQPTGFPDSEITLPEVLREKGYATGMSGKWHLGISNLT 149
Query: 137 FREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGL---DMRHNMQVDNKTI---- 189
+ P GFDS W GL H C +PY GL +R M + N T+
Sbjct: 150 HHWAHLPKQHGFDS----WLGLPYTNMHGC----DPYHGLQNESLRDCMVMANNTVVSQP 201
Query: 190 GIYS--TDLYTEAAINVIAEHNK-SKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
IYS T T+ AI+ I E ++P F Y+A++ VH F +P KF ++S
Sbjct: 202 TIYSNLTQTLTDHAIHFITEKAALNQPFFFYMAYVHVHT----PLFSSP-----KFKNVS 252
Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
+ V +D+SVG ++ AL + + N++VL +DNG S +G L+
Sbjct: 253 --AGGVFGDNVEEMDDSVGQILRALDEAHIANNTLVLLTSDNGPFLEEGWSRRGRTGGLK 310
Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
G K ++GG+R P + V E D +PT A A D +DG N
Sbjct: 311 GGKGQTYEGGIRVPGIARWPGVIPAGTVLHEAVTTLDLMPTFAALADTTAPTDRIIDGAN 370
Query: 366 QWDVLTKGAKTKRSEIL------HNIDNVDN 390
W+++ R + + HN+ N
Sbjct: 371 VWEMMVHPHNATRPDPIIWHYCGHNVTAARN 401
>gi|319643165|ref|ZP_07997794.1| arylsulfatase [Bacteroides sp. 3_1_40A]
gi|345520543|ref|ZP_08799930.1| arylsulfatase [Bacteroides sp. 4_3_47FAA]
gi|317385206|gb|EFV66156.1| arylsulfatase [Bacteroides sp. 3_1_40A]
gi|345456873|gb|EET15372.2| arylsulfatase [Bacteroides sp. 4_3_47FAA]
Length = 464
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 128/399 (32%), Positives = 187/399 (46%), Gaps = 40/399 (10%)
Query: 12 TCTLLFNDAFLN--------TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
T +LF+ A L+ TT K P++I I+ADDLG D+ +G QI TPNID +A
Sbjct: 4 TRKILFSVALLSSGLTMAQTTTVEKSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQ 63
Query: 64 NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQ 117
NG+ QHY + + PSR AL+TGK+ H ++ + G LP E +
Sbjct: 64 NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVAD 123
Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
K Y T +GKW +G P GFD YGY L + HS F +
Sbjct: 124 IFKTKNYVTGCVGKWGMGGPGTEGVPGKHGFDYFYGY---LGQRFAHSYYPEFLH----E 176
Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA-----GNTYEPF 232
+ +D K YS DL E A+N I E N KP FLY + HA G +
Sbjct: 177 NEQKIMLDGK---YYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLDIMGEAMTEY 232
Query: 233 QAPDEEVA------KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMA 286
+ E + +P R YA MV+ LD+SVG +I L++ G+ +++I++F +
Sbjct: 233 EGEFCETPFGGSRDGYKSQQNP-RAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIVFTS 291
Query: 287 DNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDW 344
DNG S G H S SN P RG K ++GG+R I P + V++ + D+
Sbjct: 292 DNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAFWDF 351
Query: 345 LPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
LPT+ +I ++DG++ LT K + ++
Sbjct: 352 LPTIGELVQADI-PQNIDGISYLPTLTGKGIQKEHDCIY 389
>gi|255035424|ref|YP_003086045.1| sulfatase [Dyadobacter fermentans DSM 18053]
gi|254948180|gb|ACT92880.1| sulfatase [Dyadobacter fermentans DSM 18053]
Length = 457
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 109/356 (30%), Positives = 172/356 (48%), Gaps = 25/356 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ-HYVQALCTPSRSALMT 86
+ P+I+ ILADDLG+ D+ HG + TPNIDALA G+I H +C+PSRS L+T
Sbjct: 21 QSPNIVFILADDLGYGDIGAHGQKLLRTPNIDALAKEGMIFTDIHAGAPVCSPSRSVLIT 80
Query: 87 GKYPIHIGMQHGVILEGEPWG-----------LPLTEKLLPQYLKEAGYATHAIGKWHLG 135
G + H ++ + G G L + + + + ++GY T GKWHL
Sbjct: 81 GLHTGHTTIRGNATIRGGIVGNKGKQTVRRANLAAGDFTVGKLMAQSGYTTALTGKWHLD 140
Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
+ + TP RGFD G+ Y + G+ ++ Q +N G Y+ D
Sbjct: 141 GYDTLATPIHRGFDQFSGWLIAYPGTYANGYWPAKRYVNGV-LKDVEQNENGRKGYYADD 199
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
L T+ ++ +A +K F+ L ++ + + P A D K D + + Y
Sbjct: 200 LTTDESLAFLAAQKDAKKPFV----LMINYNSPHSPLDAADSSAYKDRDWPQ-DMKIYGA 254
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG------SNHPLRGMK 309
V LDE+VG + L + G+ +N+IV F +DNG S G SN LRG K
Sbjct: 255 QVHHLDENVGKIKKYLTESGLAKNTIVFFCSDNGPRSEGTPQQTAIAEFFDSNGRLRGYK 314
Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
++GG+R +W+P + + VSSE + +D +PT AG +++ T+ DG +
Sbjct: 315 RDMYEGGIRVPMVVWAPGIVKPGSVSSEPAYFADIMPTFADIAGSKVSYTT-DGAS 369
>gi|424046834|ref|ZP_17784396.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Vibrio cholerae HENC-03]
gi|408884894|gb|EKM23622.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Vibrio cholerae HENC-03]
Length = 500
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 138/432 (31%), Positives = 196/432 (45%), Gaps = 58/432 (13%)
Query: 20 AFLNTTAPKKPHIIIILADDLGWNDVS-FH-GSSQIPTPNIDALAYNGLILNQHYVQALC 77
+F T A +P+I +I DD+G ++VS +H G TPNID++A G++L +Y Q C
Sbjct: 17 SFAATAAQDQPNIFVIFTDDIGVSNVSAYHRGLMSSQTPNIDSIAEKGMLLTDYYAQPSC 76
Query: 78 TPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
T RSA +TG+ P+ GM H V L G P GL LP+ LKE GY T GK HLG
Sbjct: 77 TAGRSAFLTGQLPVRTGM-HTVGLPGGPVGLNQNTPTLPELLKELGYVTGQFGKNHLG-D 134
Query: 138 REVYTPTFRGFDSHYGYWQGLQ--DYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYST- 194
R+ + PT GFD ++G+ L +Y + E +Q R+ + G S
Sbjct: 135 RDEFLPTMHGFDEYWGWLYHLNAMEYTEDPDWPKDEAFQKFAPRNVIHAKADGKGGQSIK 194
Query: 195 --------------DLYTEAAINVIAEHNKS-KPMFLYL----AHLAVHAGNTYEPFQAP 235
D AI+ I K+ KP F + H+ H YE
Sbjct: 195 DDGALTIERMRTLDDEVNRHAIDFIERAVKAEKPFFTWYCPSRGHVWTHLSPKYEAMLGT 254
Query: 236 DEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI 295
+ + + + D LD+ VG ++A L + G+ +N+IV+F ADNG P
Sbjct: 255 NGWGLQEVVMKD------------LDDHVGEMLAKLEELGVADNTIVIFTADNG-PEIMT 301
Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA--- 352
+ G P G K T W+GG+R I P KVS+ +F D LPTL AA
Sbjct: 302 WPDGGMT-PFHGEKGTTWEGGVRAPFLISWPEKIPANKVSNGMFDGMDLLPTLVEAAGGP 360
Query: 353 ---------GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDL 403
G + + LDG NQ ++LT K+ R+EI + + A+RV D
Sbjct: 361 ADLKQQMLKGYKGHKAHLDGYNQMEMLTNNGKSNRNEIFY------YERDKLQAVRVGDW 414
Query: 404 KYVAGTDNNGQS 415
K N+G S
Sbjct: 415 KAHFVVQNHGWS 426
>gi|32475083|ref|NP_868077.1| N-acetylgalactosamine-6-sulfate sulfatase [Rhodopirellula baltica
SH 1]
gi|32445623|emb|CAD75624.1| N-acetylgalactosamine-6-sulfate sulfatase [Rhodopirellula baltica
SH 1]
Length = 484
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 131/451 (29%), Positives = 202/451 (44%), Gaps = 52/451 (11%)
Query: 7 YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
+ C+ L + +P+I++ILADDLG+ D+ +G+ + TP +D LA G+
Sbjct: 16 WMLVALCSHACVPTLLRADSNDRPNIVLILADDLGYGDLGCYGNDEQATPVLDRLATQGV 75
Query: 67 ILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEG--------------EPWGLPLT 111
Q Y C+P+R+AL+TG+Y H+G I G GLP
Sbjct: 76 RWTQAYANGPECSPTRAALLTGRYQQHVGGLECAIGVGNVGRYDDAIRLHLVNELGLPAN 135
Query: 112 EKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATF 170
L + L GY T GKWHLG + ++P GFD G DYY + AT+
Sbjct: 136 RPTLAKRLSSVGYETALFGKWHLG-YEAKFSPMMHGFDEALYCIGGAMDYYHYLDSVATY 194
Query: 171 EPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTY 229
++ HN + + Y TD T+ A+ I + N + KP FLYL + A H
Sbjct: 195 ------NLFHNGRPISGE--GYFTDTITDQAVRFIGDRNANDKPFFLYLPYTAPHT---- 242
Query: 230 EPFQAPDEEVAKFLDISDPERR-------TYAGMVSRLDESVGNVIAALRKHGMLENSIV 282
P+QAP E L I P + Y MV +DE +G V+ A+ + M + ++V
Sbjct: 243 -PYQAPGESPVDPLPIDSPLWKQNADPPGVYRAMVRHMDEGIGKVLHAIEESKMTDRTLV 301
Query: 283 LFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHI 341
+F +DNG ++ N PLRG K ++GG+R + A W L + VS ++
Sbjct: 302 IFASDNGG------TSASRNEPLRGFKGQAFEGGIRVPLIARWPGHLPEG-VVSDQVTIT 354
Query: 342 SDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVD 401
D ++ AAAG I T D + DVL+ A + + + ++ +R
Sbjct: 355 FDLTASMLAAAG--ITPTQEDAMEGIDVLSLAANDEPVQPRTLYWRKPRDPQVWSGMRDG 412
Query: 402 DLKYV----AGTDNNGQSDEWYGDTDNEIDK 428
+ KYV A D EW + ++I +
Sbjct: 413 NWKYVRQEKATVDGRTSIQEWLFNLADDISE 443
>gi|254282184|ref|ZP_04957152.1| arylsulfatase A [gamma proteobacterium NOR51-B]
gi|219678387|gb|EED34736.1| arylsulfatase A [gamma proteobacterium NOR51-B]
Length = 535
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 124/427 (29%), Positives = 209/427 (48%), Gaps = 67/427 (15%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQI---PTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
++P+I++ILADD+GW ++ +G ++ PTP +D +A G+ HY + CTP+R AL
Sbjct: 68 RRPNILVILADDIGWGELGSYGGGKLKGAPTPALDQMADEGMRFLSHYTEPSCTPTRVAL 127
Query: 85 MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
MTG++P+ G+ V+ G+ GL E L + L EAGYAT GKWHLG +E + P
Sbjct: 128 MTGRHPVRTGLDE-VLFPGQVKGLVADEVTLAEVLSEAGYATGMFGKWHLGELQE-HQPQ 185
Query: 145 FRGFDSHYGY----------WQGLQDYYD-----------------HSCKATFE-PYQGL 176
++GFD Y Y W+ +YD + + TF+ P G+
Sbjct: 186 YQGFD--YAYYNLYNGAIWPWRENATHYDTENDTGITGPPYFIDIPEAYEETFDIPLHGI 243
Query: 177 DMRHNM-----QVDNKTIGIYST---DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNT 228
MR ++D ++ ++T +L E + +H P F Y A NT
Sbjct: 244 -MRAKRNTPAEEIDPLSLSRFNTFDNELTDEVIAFMRDQHEAGIPFFAYFAT------NT 296
Query: 229 YEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN 288
+ F PD + + +LD S+ + R + + D+++ + +L G+ EN++VL+++DN
Sbjct: 297 QQVFSCPDVD-SPYLDKSNCQAR----QLVQHDKNMARLFESLDNMGIDENTLVLWISDN 351
Query: 289 GAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
G P + + G + LRG KS ++GG+R P + ++ H+SDW T+
Sbjct: 352 G-PMNKFYPSTGFSW-LRGYKSEVYEGGVRTPGIAKWPGSIAPGQTPIDIVHVSDWYTTI 409
Query: 349 CAAAGIEI---NDTSLDGVNQWDVLTKGAKTKRSE-------ILHNIDNVDNPQKYYAAL 398
AG + +D +DGV+Q +L G R + I + + P +A+
Sbjct: 410 ANLAGAKAAIPDDRVIDGVDQRSLLFNGEGYSRRDYVFFYRYIAYKNLSSTGPASMLSAI 469
Query: 399 RVDDLKY 405
R+ D+K+
Sbjct: 470 RMGDIKF 476
>gi|354465430|ref|XP_003495183.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Cricetulus
griseus]
Length = 493
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 114/356 (32%), Positives = 169/356 (47%), Gaps = 26/356 (7%)
Query: 37 ADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMTGKYPIHIGM 95
A+ +GW D+ +G TPN+D +A G++ Y LC+PSR+AL+TG+ PI G
Sbjct: 8 AEVMGWGDLGVYGEPSRETPNLDQMALEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGF 67
Query: 96 -------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
++ + G+P +E LLP+ LK+AGY +GKWHLG R + P GF
Sbjct: 68 YTSNGHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQFHPLKHGF 126
Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLYTEAAINV 204
D +G YD+ K Y+ +M ++ KT T LY + A++
Sbjct: 127 DEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANLTQLYLQEALDF 186
Query: 205 I-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
I +H + P FLY A A HA P FL S R Y V +D+S
Sbjct: 187 IRTQHARQSPFFLYWAIDATHA---------PVYASRPFLGTS--HRGRYGDAVREIDDS 235
Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
VG +++ L+ G+ +N+ V F +DNGA GSN P K T ++GGMR A
Sbjct: 236 VGKILSLLQNLGISKNTFVFFTSDNGAALISAPKEGGSNGPFLCGKQTTFEGGMREPAIA 295
Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKR 378
W P +VS +L I D T + AG++ +D +DG++ + +G R
Sbjct: 296 WWPGHIAAGQVSHQLGSIMDLFTTSLSIAGLKPPSDRIIDGLDLLPTMLQGHVMDR 351
>gi|395819498|ref|XP_003783120.1| PREDICTED: arylsulfatase A [Otolemur garnettii]
Length = 507
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 127/395 (32%), Positives = 185/395 (46%), Gaps = 45/395 (11%)
Query: 10 ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
AL LL A L T +P P++++I ADDLG+ D+ +G TPN+D LA GL
Sbjct: 3 ALGAFLLALVAGLATASP--PNVVLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFT 60
Query: 70 QHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
YV +LCTPSR+AL+TG+ P+ +GM GV+ GLPL E L + L GY T
Sbjct: 61 DFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLKPSSQGGLPLEEVTLAEVLAAKGYLTGM 120
Query: 129 IGKWHLGFFRE-VYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LD 177
GKWHLG E ++ P GF G Y + +C P G +
Sbjct: 121 AGKWHLGVGPEGIFLPPHHGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIP 180
Query: 178 MRHNMQVDNKTIGI---------YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNT 228
+ N+ V+ + + ++ DL T+A + +P FLY A H
Sbjct: 181 LLANLTVEVQPPWLPGLEARYVAFARDLMTDA-------QRQGRPFFLYYASHHTH---- 229
Query: 229 YEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN 288
Y F F + S R + + LD +VG ++ A+ G+LE ++V+F ADN
Sbjct: 230 YPQFSG-----QSFAEHSG--RGPFGDSLMELDAAVGALMTAMEDLGLLEETVVIFTADN 282
Query: 289 GAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPT 347
G + + S G + LR K T ++GG+R A A W + T V+ EL D LPT
Sbjct: 283 GPETMRM-SRGGCSGLLRCGKGTTFEGGVREPALAFWPGHI--TPGVTHELASTLDLLPT 339
Query: 348 LCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
L A G + + +LDG + +L K+ R +
Sbjct: 340 LAALTGAPLPNVTLDGFDLSPLLLGTGKSPRQSLF 374
>gi|291232668|ref|XP_002736267.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 518
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 118/373 (31%), Positives = 177/373 (47%), Gaps = 28/373 (7%)
Query: 15 LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV- 73
L+F+ + + P KP++I++L DD+GW D+ G+ TPN+D +A G ++ Y
Sbjct: 12 LMFHAFYSTNSNPSKPNVILMLMDDMGWGDLGVLGNPAKETPNLDRMASEGALMTDFYAP 71
Query: 74 QALCTPSRSALMTGKYPIHIGM----QHGVILEGEPW---GLPLTEKLLPQYLKEAGYAT 126
LC+PSR++L+TG+ PI G H + G+P +E +LP+ L +AGY +
Sbjct: 72 NPLCSPSRASLLTGRLPIRNGFYTTNDHARCSYTPQYIVGGIPDSEIVLPELLNKAGYRS 131
Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNM 182
IGKWHLG + Y P GFD ++G YD+ + Y+ DM
Sbjct: 132 KIIGKWHLG-HQTQYHPLKHGFDEYFGAPNCHVGPYDNKKQPNIPVYRDADMIGRYYEEF 190
Query: 183 QVDNKTIGIYSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
++D K+ T ++ E AI I +H + FLY A H +P
Sbjct: 191 KID-KSGESNLTQMFIEEAIAFIEKQHQTGEQFFLYWTPDASH---------SPHYASKS 240
Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
FL S +R Y V LD VG ++ L++ + N+ V F +DNGA G + GS
Sbjct: 241 FLGTS--QRELYGDAVMELDYGVGQILGKLKELQIENNTFVFFSSDNGAALSG-GAYVGS 297
Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTS 360
N P K T ++GGMR + W P + KVS +L I D T AG+ + D
Sbjct: 298 NGPFLCGKHTTFEGGMREPSIAWWPGRIKPGKVSHQLGTIMDLFTTALDIAGVTLPTDRI 357
Query: 361 LDGVNQWDVLTKG 373
+DG + L G
Sbjct: 358 IDGTSLLPALVNG 370
>gi|148223113|ref|NP_001088386.1| galactosamine (N-acetyl)-6-sulfate sulfatase precursor [Xenopus
laevis]
gi|54038351|gb|AAH84661.1| LOC495239 protein [Xenopus laevis]
Length = 520
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 120/388 (30%), Positives = 182/388 (46%), Gaps = 34/388 (8%)
Query: 11 LTCTLLFNDAFLNTTAP------KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
+ C+L AFLN +P+I+++L DD+GW D+ G TPN+D +A
Sbjct: 5 IMCSLAL--AFLNPVLVLGGMNVTRPNILLLLMDDMGWGDLGVFGEPSRETPNLDKMASE 62
Query: 65 GLILNQHYV-QALCTPSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLP 116
G++ Y LC+PSR+AL+TG+ PI G ++ + G+ +E L P
Sbjct: 63 GMLFPNFYTANPLCSPSRAALLTGRLPIRNGFFTTNSHARNAYTPQEIVGGISDSEILFP 122
Query: 117 QYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGL 176
+ LK+AGY IGKWHLG R Y P GFD +G YD+ Y+
Sbjct: 123 ELLKKAGYINKIIGKWHLGH-RPSYHPLRHGFDEWFGSPNCHFGPYDNKQIPNIPLYRDW 181
Query: 177 DM----RHNMQVDNKTIGIYSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEP 231
+M +++KT T +Y + A+ I ++ +P FLY + A HA
Sbjct: 182 NMVGRYYEEFNINHKTGESNLTQIYLQEALEFIYSQAVAQQPFFLYWSIDATHA------ 235
Query: 232 FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
P FL S+ R Y V +D S+G ++ L G+ +++ V F +DNGA
Sbjct: 236 ---PVYASRPFLGTSN--RGLYGDAVREIDFSIGKILNLLNLTGIAKDTFVFFTSDNGAA 290
Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAA 351
GSN P K T ++GGMR W P +VS +L I D T +
Sbjct: 291 LISAPMQGGSNGPFLCGKQTTFEGGMRVPGIAWWPGHVAPGQVSYQLGSIMDLFTTSLSL 350
Query: 352 AGIEI-NDTSLDGVNQWDVLTKGAKTKR 378
AG+EI D ++DG++ V+ + +T+R
Sbjct: 351 AGLEIPRDRAIDGIDLSPVILESKETER 378
>gi|449138380|ref|ZP_21773646.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
gi|448882925|gb|EMB13473.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
Length = 467
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 137/444 (30%), Positives = 205/444 (46%), Gaps = 64/444 (14%)
Query: 7 YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
YF A T L D +P+I++ILADDLG+ D+ +G+ TP +D LA G+
Sbjct: 5 YFHACVPTSLRADPN------DRPNIVLILADDLGYGDLGCYGNDAQATPTLDRLARQGV 58
Query: 67 ILNQHYVQAL-CTPSRSALMTGKYPIHIGMQHGVILEG--------------EPWGLPLT 111
Q Y C+P+R+AL+TG+Y H+G I G GLP
Sbjct: 59 RWTQAYANGPECSPTRAALLTGRYQQHVGGLECAIGVGNVGRYDDAIRLHLVSELGLPAD 118
Query: 112 EKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATF 170
L + L + GY T GKWHLG+ + ++P GF+ G DYY + AT+
Sbjct: 119 RPTLAKRLSDVGYETALFGKWHLGYEAQ-FSPMQHGFEEALYCIGGAMDYYHYLDSVATY 177
Query: 171 EPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTY 229
+ R+ + + Y TD T+ A+ + + N S +P FLYL + A H
Sbjct: 178 NLF-----RNGRPISGEG---YFTDTITDHAVQYVRDRNDSEQPFFLYLPYTAPHT---- 225
Query: 230 EPFQAPDEEVAKFLDI-------SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIV 282
P+Q P + L + SD Y MV +DE +G V+ A+ + GM + ++V
Sbjct: 226 -PYQPPGQSPVDPLPLDSPLWKQSDDPPGVYRSMVRHMDEGIGKVLHAIEESGMTDRTLV 284
Query: 283 LFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHI 341
+F +DNG ++ N PLRG K ++GG+R + A W L + VS ++
Sbjct: 285 IFASDNGG------TSASRNEPLRGFKGQTFEGGIRVPLIARWPGHLPE-DTVSQQVTIT 337
Query: 342 SDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSE--ILHNIDNVDNPQKYYAAL 398
D ++ AAAGI + +++G+ DVL+ A + E LH D PQ + +
Sbjct: 338 FDLTASMLAAAGITPAQEDAMEGI---DVLSLAASGEPPEPRTLHWRKPRD-PQ-VWGGI 392
Query: 399 RVDDLKYV----AGTDNNGQSDEW 418
R LKYV A TD EW
Sbjct: 393 RDGKLKYVRQEKATTDGRTTIQEW 416
>gi|397468273|ref|XP_003805816.1| PREDICTED: N-acetylgalactosamine-6-sulfatase isoform 1 [Pan
paniscus]
Length = 482
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 112/362 (30%), Positives = 169/362 (46%), Gaps = 25/362 (6%)
Query: 40 LGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMTGKYPIHIGM--- 95
+GW D+ +G TPN+D +A G++ Y LC+PSR+AL+TG+ PI G
Sbjct: 1 MGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 60
Query: 96 ----QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSH 151
++ + G+P +E+LLP+ LK+AGY + +GKWHLG R + P GFD
Sbjct: 61 NAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLGH-RPQFHPLKHGFDEW 119
Query: 152 YGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
+G YD+ + Y+ +M ++ KT T +Y + A++ I
Sbjct: 120 FGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKR 179
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
+ P FLY A A HA P FL S +R Y V +D+S+G +
Sbjct: 180 QARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRGRYGDAVREIDDSIGKM 228
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
+ L+ + +N+ V F +DNGA GSN P K T ++GGMR A W P
Sbjct: 229 LELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPG 288
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
+VS +L I D T A AG+ +D ++DG+N L +G R + D
Sbjct: 289 HVAAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTLLQGRLMDRPIFYYRGD 348
Query: 387 NV 388
+
Sbjct: 349 TL 350
>gi|149196404|ref|ZP_01873459.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
araneosa HTCC2155]
gi|149140665|gb|EDM29063.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
araneosa HTCC2155]
Length = 456
Score = 159 bits (402), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 116/346 (33%), Positives = 186/346 (53%), Gaps = 24/346 (6%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTG 87
KP+II I+ DD+G+ + +G I TP +D +A GL L +Y A+C PSR +LMTG
Sbjct: 19 KPNIIFIMCDDMGYGQLGSYGQKMIKTPRLDQMAKEGLRLTDYYAGTAVCAPSRCSLMTG 78
Query: 88 KYPIHIGMQHGVILEGEPWG---LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
+ H+G + + P G +P + + +KEAGYAT IGKW LG+ P
Sbjct: 79 Q---HVGHTYIRGNKEYPTGQEPIPAETITVAEKMKEAGYATALIGKWGLGYPGSEGEPN 135
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
+GFD +GY + ++H K + L +++N K I YS + T+ A
Sbjct: 136 KQGFDYFFGY-NDQKHAHNHFPKFLLRNEETLTLKNN---SGKEIE-YSQYMLTDEAKGF 190
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP--DEEVAKFLDISDPER-RTYAGMVSRLD 261
I + NK P FLYLA++ H+ Q P DE ++ D S PE+ + +AGM+SRLD
Sbjct: 191 IKK-NKDNPFFLYLAYVIPHS-----RLQIPGDDECYLQYKDESWPEKQKKHAGMISRLD 244
Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH--PLRGMKSTPWDGGMRG 319
+ VG+++ L++ + EN++V+F +DNGA G + N PL G+K + ++GG+R
Sbjct: 245 KDVGSILDLLKEMNLAENTLVVFTSDNGAHREGGARPEFFNDSGPLSGIKRSMYEGGVRV 304
Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
P + + +VS+ + D +PT C G++ + +DG++
Sbjct: 305 PFIAHWPGVIKPGQVSNHIGAHWDLMPTACELGGVQPPE-GIDGIS 349
>gi|440749357|ref|ZP_20928605.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
gi|436482362|gb|ELP38485.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
Length = 484
Score = 159 bits (401), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 130/394 (32%), Positives = 184/394 (46%), Gaps = 57/394 (14%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
KP+II ILADDLG+ D+ G ++I TPN+D LA G++ HY A +C PSR+ALMTG
Sbjct: 33 KPNIIFILADDLGYGDLGVTGQTKIETPNLDKLAREGMLFTNHYSGATVCAPSRAALMTG 92
Query: 88 KYPIHIGMQHGVILEGEPWG---LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
+ H ++ E +P G +P + LP+ +K AGYAT A GKW LGF P
Sbjct: 93 LHTGHTPIRGNK--EIQPEGQHPMPDSVLTLPKMMKRAGYATGAFGKWGLGFVGTSGDPI 150
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDN--KTIGIYSTDLYTEAAI 202
+GFD +GY Q Y F Y L+ N ++ D+ +AA+
Sbjct: 151 NQGFDQFFGY--NCQRY----AHRYFPEYLWLNSEKVFLPGNDWTNKATFAPDVIQKAAL 204
Query: 203 NVIAEHNKSKPMFLYLAHLAVHA------------------GNTYEPFQAPDE------- 237
I E N P F+Y+ + HA G+ E F AP
Sbjct: 205 EFI-EANSGNPFFMYVPMVIPHAELAADEESPIFKKYRERFGDERE-FVAPAGWDYGPGI 262
Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
+ + + P R TYA MV +LD VG +IA L G+ EN+I++F +DNG H
Sbjct: 263 NIPGYQSVKHP-RATYATMVEKLDAHVGEIIAKLEVLGIRENTIIVFASDNGP-----HQ 316
Query: 298 NKG-------SNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
G SN RG K ++GG++ + A W P + VS + D +PT
Sbjct: 317 EGGNDPDFFDSNGIYRGYKRDLYEGGIKTSLIANW-PRKIKAGSVSDHVSAFWDLMPTFA 375
Query: 350 AAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
AG+ I D LDG++ L + + E L+
Sbjct: 376 ELAGVSI-DAELDGISMVPTLLGEGRQVQHEYLY 408
>gi|336413808|ref|ZP_08594157.1| hypothetical protein HMPREF1017_01265 [Bacteroides ovatus
3_8_47FAA]
gi|335934825|gb|EGM96808.1| hypothetical protein HMPREF1017_01265 [Bacteroides ovatus
3_8_47FAA]
Length = 472
Score = 159 bits (401), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 126/391 (32%), Positives = 187/391 (47%), Gaps = 32/391 (8%)
Query: 11 LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
L L+ A + K P+++ I+ADDLG D+ +G ++I TP IDALA G+ Q
Sbjct: 18 LMLPLISGQAQKQDSPSKTPNVVFIMADDLGIGDLGCYGQNRIKTPAIDALAAQGMKFTQ 77
Query: 71 HYVQA-LCTPSRSALMTGKYPIH--IGMQHGVILE-GEPWGLPLT--EKLLPQYLKEAGY 124
HY + + PSR L+TGK+ H I G E G + L L E + + K Y
Sbjct: 78 HYSGSTVSAPSRCVLLTGKHTGHSYIRGNKGYKAEDGRSYDLNLADEEITIGEMFKTKNY 137
Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYW-QG-LQDYYDHSCKATFEPYQGLDMRHNM 182
T +GKW LG P +GFD +GY QG +YY P Q + +
Sbjct: 138 VTACVGKWGLGGPSAEGHPNKQGFDYFFGYLGQGNAHNYY---------PVQLFENNTPV 188
Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA-----GNTYEPFQAPDE 237
+++K +YS D+ + A+ I + N KP F+YL HA N + E
Sbjct: 189 MLNHK---VYSHDMIMDKALEFI-DKNADKPFFVYLTPTIPHADIIVPNNELFDYDGKFE 244
Query: 238 EV---AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
EV K + R T+A MV+RLD V VI L G+L+N+I++F +DNG G
Sbjct: 245 EVPYPGKGYRPQEKPRATFAAMVTRLDRDVQRVIDLLADKGVLDNTIIIFTSDNGTHKEG 304
Query: 295 IHSNK--GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
H + SN P RGMK ++GG+R + P + VS ++ D++PT+C
Sbjct: 305 GHDPRYFDSNGPFRGMKRDLYEGGIRTPFIVKWPGVIPAGSVSFQVSAFWDFMPTMCELI 364
Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
G ++DG++ LT K K+ + L+
Sbjct: 365 GAPA-PANIDGLSYLPTLTGKGKQKQHDYLY 394
>gi|325110685|ref|YP_004271753.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324970953|gb|ADY61731.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 592
Score = 159 bits (401), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 123/376 (32%), Positives = 179/376 (47%), Gaps = 45/376 (11%)
Query: 4 ARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
R F+L+ L F A +P++I IL DD GW D+S HG+ I TPNID +A
Sbjct: 5 VRISVFSLS-ILAFLSYLCVAEAADRPNVIAILTDDQGWGDLSIHGNLAINTPNIDKMAR 63
Query: 64 NGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAG 123
G L + +V +C+P+R+ +TG++ + G+ + GE + E ++ + K AG
Sbjct: 64 EGAQLERFFVSPVCSPTRAEFLTGRHHVRCGV-YSTSAGGE--RVNADETMIGEVFKRAG 120
Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ 183
Y T A GKWH G + Y P RGFD YG+ G Y F P + HN
Sbjct: 121 YQTAAFGKWHSG-MQAPYHPNSRGFDEFYGFCSGHWGNY-------FSPM----LEHNG- 167
Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
D T + D T A++ I EH + +P F+YL + H+ P Q P+ K
Sbjct: 168 -DLVTGEGFIIDDLTNRAMSYIDEH-QQEPFFVYLPYNTPHS-----PMQVPERWYKKLA 220
Query: 244 DI----SDP--------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
D+ DP R M +D +VG ++ L + + EN+IVL+ DNG
Sbjct: 221 DLELTQEDPNASEKQLLHTRAALAMCENIDWNVGRLLEKLDQLNLSENTIVLYFCDNGP- 279
Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCA 350
+ K N +RG K + +GG+R I W +K KV ++ D LPTL
Sbjct: 280 -----NGKRYNGGMRGRKGSTDEGGVRSPLFIRWPAKIKPGTKV-PQICSAMDLLPTLAD 333
Query: 351 AAGIEINDTS-LDGVN 365
AGIE+ DT LDG++
Sbjct: 334 MAGIEVTDTKPLDGIS 349
>gi|325111172|ref|YP_004272240.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324971440|gb|ADY62218.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 474
Score = 159 bits (401), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 114/349 (32%), Positives = 174/349 (49%), Gaps = 29/349 (8%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
+ P+ +I+ DD G+ DV G+ TPN+D +A G YV QA+C+ SR++L+T
Sbjct: 21 ETPNFVIVFTDDQGYADVGCFGAEGYETPNLDRMAQEGRKFTSFYVSQAVCSASRTSLLT 80
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G Y + +G+ +G + G+ E L + +K+ GY T A+GKWHLG+ ++ + P
Sbjct: 81 GCYNVRVGI-NGALSHKSTVGISTEEMTLGELVKQQGYTTAAVGKWHLGY-QQKFLPLQH 138
Query: 147 GFDSHYG--YWQGLQDYYDHSCKATFEP-----YQGLDMRHNMQVDNKTIGIYS----TD 195
GFD ++G Y + Y+ T E Y L + ++ + + T
Sbjct: 139 GFDRYFGLPYSNDMWPYHPRLEGMTLEEARVKGYPDLPLYSGNEIIDAQVTPEDQRQLTT 198
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
YTE A++ I + N KP LYLAH H P D KF D ++ R Y
Sbjct: 199 QYTEQALDFI-DDNHDKPFLLYLAHSMPHV-----PLFVSD----KFQDKTEQGR--YGD 246
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWD 314
++ +D SVG V+A L+KHG+ +N++V+F +DNG S+G H+ GS P R K T W+
Sbjct: 247 VIQEIDWSVGQVLARLKKHGLDDNTLVIFTSDNGPWLSYGNHA--GSAGPFREGKGTSWE 304
Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDG 363
GG+R + P EL D PT+ A A + D +DG
Sbjct: 305 GGVRVPCIMRWPGKIPAGTSCDELAATIDIFPTVAALADAPLPDHKIDG 353
>gi|449134034|ref|ZP_21769542.1| arylsulfatase A [Rhodopirellula europaea 6C]
gi|448887354|gb|EMB17735.1| arylsulfatase A [Rhodopirellula europaea 6C]
Length = 728
Score = 159 bits (401), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 111/340 (32%), Positives = 168/340 (49%), Gaps = 26/340 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
+KP+I++IL DD G+ D+ +G++++ TP ID +A G+ +Y A +C+PSR+ L+T
Sbjct: 4 EKPNIVLILVDDQGYYDLGCYGATEVKTPRIDEMAGGGIRFTDYYAAAPICSPSRAGLLT 63
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G YP +G V G+ E L + K+ GY T IGKWHLG F E + P +
Sbjct: 64 GCYPRRVGNHVWVHRADSNTGIHSDELTLAELFKDNGYKTACIGKWHLG-FHEPFLPQNQ 122
Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS-TDLYTEAAINVI 205
GFD ++G L D FE G+ ++ + V + T LYT AI+ I
Sbjct: 123 GFDHYFGLLHNL----DPVETVYFEDVGGVPLQRDRDVVKRPADPDELTKLYTNEAIDFI 178
Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
E NK P LYL H +H P +E F S + Y + LD VG
Sbjct: 179 -EANKEGPFLLYLPHTMLH-----NPLGVSEE----FKGTS--QWGEYGDAIQELDHHVG 226
Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
+I L++ + +N++V++ +DNG + +RG K + ++GG+R A W
Sbjct: 227 RIIDVLKRLNIDDNTVVVYASDNGR-----GPGRNPQQKIRGRKLSTYEGGIRVPAIAWG 281
Query: 326 PWLKQTQKVSS-ELFHISDWLPTLCAAAGIEI-NDTSLDG 363
P L +SS + DW PTL A IEI ++ +DG
Sbjct: 282 PGLGLRAGISSPAVVRAMDWYPTLATLADIEIPSERVIDG 321
>gi|365877064|ref|ZP_09416570.1| Cerebroside-sulfatase [Elizabethkingia anophelis Ag1]
gi|442586911|ref|ZP_21005733.1| Cerebroside-sulfatase [Elizabethkingia anophelis R26]
gi|365755338|gb|EHM97271.1| Cerebroside-sulfatase [Elizabethkingia anophelis Ag1]
gi|442563318|gb|ELR80531.1| Cerebroside-sulfatase [Elizabethkingia anophelis R26]
Length = 454
Score = 159 bits (401), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 127/415 (30%), Positives = 195/415 (46%), Gaps = 42/415 (10%)
Query: 11 LTCTL--LFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
+ C L LF F N A K+P+II++L DD+G+ D+ +G+ I TP +D ++ NGL+
Sbjct: 5 VRCALIYLFTVLFSNARAQKQPNIIVVLTDDMGYADIGAYGNPVIKTPFLDQMSRNGLMA 64
Query: 69 NQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
+ V + CTPSR++++TG+Y + I G GLP E + + LK GY T
Sbjct: 65 TNYVVSSPTCTPSRASMLTGRYSSRYDLPWP-IAPGSKQGLPDDEVTIAEMLKANGYNTG 123
Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD----MRHNMQ 183
+GKWHLG + P +GFD +YG Y H KA PY D M N +
Sbjct: 124 MVGKWHLGDQKAENKPNGQGFDFYYGI------LYSHDYKA---PYVNTDIPIRMFRNTK 174
Query: 184 VD-NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
V+ K T LYT+ +IN I + K KP FLYLAH H P A+
Sbjct: 175 VEIEKPADSLLTRLYTKESINYIRQQKKDKPFFLYLAHNMPH---------LPVYYAAQS 225
Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG----------APS 292
+ + + +++ +DE + + L + GM +N+I +F +DNG
Sbjct: 226 SALKNEKGGALGAVITEMDEGLAAIWKVLEEKGMADNTIFVFSSDNGPWTNFPERMEGDG 285
Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAA 351
I S+ GS RG K+ ++GG R W +K ++S + ++ D LPT+
Sbjct: 286 VTIASHVGSAGIFRGSKAWSYEGGARVPFIVYWKNKIKPGTVLTSAISNL-DLLPTIARW 344
Query: 352 AGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKY 405
+ + LDG + +L K K SE +H + N A++ + KY
Sbjct: 345 TNSPLPKNRELDGQDISQLLEN--KVKESEYIHRPIFIVNGSTKPEAVKYGNWKY 397
>gi|329957723|ref|ZP_08298198.1| arylsulfatase [Bacteroides clarus YIT 12056]
gi|328522600|gb|EGF49709.1| arylsulfatase [Bacteroides clarus YIT 12056]
Length = 469
Score = 159 bits (401), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 124/354 (35%), Positives = 180/354 (50%), Gaps = 34/354 (9%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
++P+I+I+L DD G+ D+ +GS + TPN+D LA G Y A TPSR+AL+T
Sbjct: 36 RQPNIVILLTDDQGYGDMKCYGSPHLQTPNLDRLAAEGTRFTNFYAGAAASTPSRAALLT 95
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
G+Y +G+ GV+ + G+ +E L +YLK+ YAT +GKWHLG+ E Y P
Sbjct: 96 GRYAERVGVP-GVVDDTSENGIKRSEITLAEYLKQNDYATGIVGKWHLGYQPE-YLPLRH 153
Query: 147 GFDSHYG--YWQGLQDYY---DHSCKATFEPYQGLDMRHNMQVD--NKTIGIYSTDLYTE 199
GF +G Y + Y+ DH+ Y L + N +V N + +T L TE
Sbjct: 154 GFTEFFGLPYSNDMWPYHPAPDHA-------YPALPLYDNDRVIEYNAPVNQMTTRL-TE 205
Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
AI I +H K++P FLYL + H P D KF S YA ++
Sbjct: 206 RAIRFINDH-KAEPFFLYLPYTQPHV-----PLGVSD----KFKGKSG--EGLYADVIME 253
Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
+D SVG ++ L+ +G+ EN+IV+F +DNG S+G H GSN LR K T +DGG R
Sbjct: 254 IDWSVGEIMKTLKANGLDENTIVMFTSDNGPWLSYGNHG--GSNGGLREGKGTTFDGGQR 311
Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLT 371
+ P + KVS + D PTL ++ + DG + W +LT
Sbjct: 312 VPFIVRMPGVIPAGKVSDQFLSALDITPTLVNLTQSQMPRMNKFDGQDAWGILT 365
>gi|440717773|ref|ZP_20898250.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
gi|436437075|gb|ELP30749.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
Length = 474
Score = 159 bits (401), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 114/397 (28%), Positives = 195/397 (49%), Gaps = 37/397 (9%)
Query: 9 FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
FA T + + TT P++I++++DD GW DV F+G+ + TPN+DA+A G+
Sbjct: 12 FAFTLAVPATQSIAETTGTNSPNVILLMSDDQGWGDVGFNGNEVVQTPNLDAMASAGVRF 71
Query: 69 NQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
++ Y A LC+P+R + +TG+YP G IL G+ + E + + L++ GYAT
Sbjct: 72 DRFYAAAPLCSPTRGSCLTGRYPFRFG-----ILAAHTGGMRVGEITIAEMLQKRGYATG 126
Query: 128 AIGKWHLGFFR--EVYT------PTFRGFDSHYGYWQGLQDY--------YDHSCKATFE 171
GKWH+G+ + EV T P+ GFD ++ + + +D E
Sbjct: 127 MFGKWHIGWVKPDEVSTRGFYSPPSHHGFDEYFATTSAVPTWDPTITPQDWDSWGNGPGE 186
Query: 172 PYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYE 230
P++ G HN + + + + + + I I E N++KP F + A H E
Sbjct: 187 PWKGGFPYVHNGREAKENLSGDDSRVIMDRVIPFI-EANQAKPFFATVWFHAPH-----E 240
Query: 231 PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
P A EE K + +R+ Y G ++ +D+ VG + A LR+ G+ +N++V F +DNG
Sbjct: 241 PVVA-GEEFKKLYPKAGSKRKNYYGCITAMDQQVGRLRAKLRELGIEDNTVVFFCSDNG- 298
Query: 291 PSFGIHSNKG--SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
PS G+ + KG S P +G K T ++GG+ A P + D+LPT+
Sbjct: 299 PSDGL-AKKGVASAGPFKGHKHTMYEGGLLVPACAEWPGTIPAGTSTEVRCSTVDFLPTV 357
Query: 349 CAAAG---IEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
+ G ++ +DG++ ++ AK + ++
Sbjct: 358 ASIVGDSMVQKATRPIDGIDLMPLIRGEAKDRDRDLF 394
>gi|355710478|gb|EHH31942.1| hypothetical protein EGK_13112 [Macaca mulatta]
Length = 482
Score = 159 bits (401), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 113/362 (31%), Positives = 167/362 (46%), Gaps = 25/362 (6%)
Query: 40 LGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMTGKYPIHIGM--- 95
+GW D+ +G TPN+D +A G++ Y LC+PSR+AL+TG+ PI G
Sbjct: 1 MGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 60
Query: 96 ----QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSH 151
++ + G+P +E+LLP+ LK+AGY + +GKWHLG R + P GFD
Sbjct: 61 NAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLGH-RPQFHPLKHGFDEW 119
Query: 152 YGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
+G YD+ + Y+ +M ++ KT T +Y + A++ I
Sbjct: 120 FGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKR 179
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
+ P FLY A A HA P FL S +R Y V +D+SVG +
Sbjct: 180 QARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRGRYGDAVREIDDSVGKM 228
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
+ L + +N+ V F +DNGA GSN P K T ++GGMR A W P
Sbjct: 229 LELLHDLHVADNTFVFFTSDNGAALISAPEQAGSNGPFLCGKQTTFEGGMREPALAWWPG 288
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
+VS +L I D T A AG+ D ++DG+N L +G R + D
Sbjct: 289 HIAAGQVSHQLGSIMDLFTTSLALAGLTPPRDRAIDGLNLLPALLQGRLMDRPVFYYRGD 348
Query: 387 NV 388
+
Sbjct: 349 TL 350
>gi|393786360|ref|ZP_10374496.1| hypothetical protein HMPREF1068_00776 [Bacteroides nordii
CL02T12C05]
gi|392659989|gb|EIY53606.1| hypothetical protein HMPREF1068_00776 [Bacteroides nordii
CL02T12C05]
Length = 467
Score = 159 bits (401), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 120/372 (32%), Positives = 180/372 (48%), Gaps = 29/372 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
K P+++ I+ADDLG D+ +G +I TP IDALA G+ QHY + + PSR L+T
Sbjct: 31 KAPNVVFIMADDLGIGDLGCYGQERIKTPAIDALAKQGMKFTQHYSGSTVSAPSRCVLLT 90
Query: 87 GKYPIH--IGMQHGVILE-GEPWGLPLT--EKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
GK+ H I G I E G + L E + + K Y T +GKW LG
Sbjct: 91 GKHTGHSFIRGNKGYIAEDGRKYDFNLADEEVTVGEIFKTKNYTTACVGKWGLGGPSAEG 150
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
P +GFD +GY + P Q + +++D K +YS D+ + A
Sbjct: 151 HPNRQGFDYFFGYL-------GQGNAHNYFPVQLFENNTPVKLDYK---VYSHDIIMDKA 200
Query: 202 INVIAEHNKSKPMFLYLAHLAVHA-----GNTYEPFQAPDEE---VAKFLDISDPERRTY 253
+ + +H ++P FLYL HA N + EE V K+ P R T+
Sbjct: 201 LAFLDKH-ANEPFFLYLTPTIPHADIIVPNNELFDYDGKFEEVPYVGKYRPQEKP-RATF 258
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK--GSNHPLRGMKST 311
A MV+RLD + VI L + G+LEN+I++F +DNG G H SN P RG+K
Sbjct: 259 AAMVTRLDRDIQRVIDLLAEKGVLENTIIIFTSDNGTHKEGGHDPGYFDSNGPFRGIKRD 318
Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLT 371
++GG+R + P + VS ++ D++PT+C G + ++DG++ VLT
Sbjct: 319 LYEGGIRTPFIVRWPGVIPAGSVSYQISTFWDFMPTMCELIGAKA-PANIDGISYLPVLT 377
Query: 372 KGAKTKRSEILH 383
K K + L+
Sbjct: 378 GKGKQKEHDHLY 389
>gi|262382591|ref|ZP_06075728.1| arylsulfatase [Bacteroides sp. 2_1_33B]
gi|262295469|gb|EEY83400.1| arylsulfatase [Bacteroides sp. 2_1_33B]
Length = 471
Score = 159 bits (401), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 128/396 (32%), Positives = 191/396 (48%), Gaps = 64/396 (16%)
Query: 9 FALTCTLLFNDAFLNTTAPKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
++ TLL AP++P +++ ILADDLG+ DV +G I TPNID +A G++
Sbjct: 7 LCISGTLLLIGMSGCEKAPERPMNVVYILADDLGYGDVGCYGQQIIKTPNIDRMAKEGML 66
Query: 68 LNQHYVQALCT---PSRSALMTGKYPIHIGMQ--HGVILEGEPWGLPLTEK--LLPQYLK 120
QHY A CT PSR +LMTG + H ++ + EG+ P+ E L + +K
Sbjct: 67 FTQHY--AGCTVSAPSRCSLMTGLHTGHTQVRGNREITPEGQQ---PMREDTYTLGKLMK 121
Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
AGY T GKW LG V P GFD YGY +C+ + + H
Sbjct: 122 SAGYTTGIFGKWGLGNPGSVSIPNKMGFDEFYGY----------NCQRQSHSFYPDHLWH 171
Query: 181 NMQV------DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA--------- 225
N + +N YS DL E A+ I +H K +P F L + HA
Sbjct: 172 NEEKVLFPENENNACKTYSQDLIHEQALKFIRDH-KEQPFFAMLTYTLPHAELNLPHDSI 230
Query: 226 ----GNTYE--PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLEN 279
N++E P+ ++V + S+ ++A MVSRLDE VG+V+A L++ G+ +N
Sbjct: 231 YKMYENSFEETPYIGKFDKVYGGYNTSEKPLASFAAMVSRLDEYVGDVMAELKELGLDKN 290
Query: 280 SIVLFMADNGAPSFGIHSNKGSN-------HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ 332
+IV+F +DNG H G++ P RG+K ++GG+R W P T
Sbjct: 291 TIVIFTSDNGP-----HHEGGADPDFFKSYGPFRGIKRDVYEGGIRIPMVAWCP---GTI 342
Query: 333 KVSSELFHIS---DWLPTLCAAAGIEINDTSLDGVN 365
K ++ HIS D +PTL G + + +DG++
Sbjct: 343 KAGAQSDHISAFWDVMPTLAELTGTVLPE-QMDGIS 377
>gi|114221|sp|P15289.3|ARSA_HUMAN RecName: Full=Arylsulfatase A; Short=ASA; AltName:
Full=Cerebroside-sulfatase; Contains: RecName:
Full=Arylsulfatase A component B; Contains: RecName:
Full=Arylsulfatase A component C; Flags: Precursor
gi|28858|emb|CAA36399.1| arylsulfatase A [Homo sapiens]
gi|28860|emb|CAA36398.1| arylsulphatase a [Homo sapiens]
gi|30016943|gb|AAP03431.1| arylsulfatase A [Homo sapiens]
gi|189054653|dbj|BAG37503.1| unnamed protein product [Homo sapiens]
Length = 507
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 125/404 (30%), Positives = 190/404 (47%), Gaps = 30/404 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
+ P+I++I ADDLG+ D+ +G TPN+D LA GL YV +LCTPSR+AL+T
Sbjct: 19 RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLT 78
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
G+ P+ +GM GV++ GLPL E + + L GY T GKWHLG E + P
Sbjct: 79 GRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPH 138
Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGIYSTD 195
+GF G Y + +C P G + + N+ V+ + + +
Sbjct: 139 QGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLE 198
Query: 196 L-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
Y A +++A+ + +P FLY A H Y F F + S R +
Sbjct: 199 ARYMAFAHDLMADAQRQDRPFFLYYASHHTH----YPQFSG-----QSFAERSG--RGPF 247
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
+ LD +VG ++ A+ G+LE ++V+F ADNG + + S G + LR K T +
Sbjct: 248 GDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRM-SRGGCSGLLRCGKGTTY 306
Query: 314 DGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
+GG+R A A W + V+ EL D LPTL A AG + + +LDG + +L
Sbjct: 307 EGGVREPALAFWPGHI--APGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLG 364
Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
K+ R + D + + A+R K T + SD
Sbjct: 365 TGKSPRQSLFFYPSYPDEVRGVF-AVRTGKYKAHFFTQGSAHSD 407
>gi|87312329|ref|ZP_01094424.1| arylsulfatase A [Blastopirellula marina DSM 3645]
gi|87284951|gb|EAQ76890.1| arylsulfatase A [Blastopirellula marina DSM 3645]
Length = 477
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 126/374 (33%), Positives = 178/374 (47%), Gaps = 40/374 (10%)
Query: 29 KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
KP+ +II DDLG+ D+ GS TPN++A+A G+ L Y +C+PSR+ALMTG
Sbjct: 30 KPNFVIINIDDLGYADIEPFGSEVNRTPNLNAMADEGMKLTCFYAAPVCSPSRAALMTGC 89
Query: 89 YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
YP V+ G G+ E + + +KE GYAT IGKWHLG + + PT +GF
Sbjct: 90 YPKRALTIPHVLFPGNAEGMSPNEVTIAELMKEQGYATAIIGKWHLG-DQPDFLPTRQGF 148
Query: 149 DSHYGY---------WQGLQDYYDHSC---KATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
D +YG G++ Y K +P L + N V + + T+L
Sbjct: 149 DYYYGLPYSNDMGPAADGVKSNYGAPIPQRKGKGQP--PLPLLRNETVLQRVLAKDQTEL 206
Query: 197 ---YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
YTE AI I +H + KP FLYL H AVH P D K Y
Sbjct: 207 VTNYTEEAIQFIRDH-QEKPFFLYLPHSAVHF-----PMYPGDAFRGKN------SHGLY 254
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS-FGIHSNKGSNHPLRGMKSTP 312
V +D SVG V+ AL+ G+ + ++V+F +DNG + FG N PLR K+T
Sbjct: 255 NDWVEEVDWSVGQVLQALKDLGLDQRTLVIFTSDNGGQTRFG-----AVNKPLRAGKATT 309
Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLT 371
++GGMR + P S + + D LPTL A G D +DG + +L
Sbjct: 310 YEGGMRVPTIVRWPGKVPAGSSSDAVVGMIDVLPTLVKLAGGTTPTDRKIDGADIGPIL- 368
Query: 372 KGAKTKRSEILHNI 385
A K ++ H++
Sbjct: 369 --AGVKEAKSPHDV 380
>gi|313569791|ref|NP_000478.3| arylsulfatase A isoform a precursor [Homo sapiens]
gi|313569793|ref|NP_001078894.2| arylsulfatase A isoform a precursor [Homo sapiens]
gi|313569795|ref|NP_001078895.2| arylsulfatase A isoform a precursor [Homo sapiens]
gi|313569797|ref|NP_001078896.2| arylsulfatase A isoform a precursor [Homo sapiens]
gi|1399961|gb|AAB03341.1| arylsulfatase A [Homo sapiens]
gi|47678297|emb|CAG30269.1| ARSA [Homo sapiens]
gi|109451022|emb|CAK54372.1| ARSA [synthetic construct]
gi|109451600|emb|CAK54671.1| ARSA [synthetic construct]
gi|220983390|dbj|BAH11167.1| arylsulfatase A [Homo sapiens]
gi|261857504|dbj|BAI45274.1| arylsulfatase A [synthetic construct]
Length = 509
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 125/404 (30%), Positives = 190/404 (47%), Gaps = 30/404 (7%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
+ P+I++I ADDLG+ D+ +G TPN+D LA GL YV +LCTPSR+AL+T
Sbjct: 21 RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLT 80
Query: 87 GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
G+ P+ +GM GV++ GLPL E + + L GY T GKWHLG E + P
Sbjct: 81 GRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPH 140
Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGIYSTD 195
+GF G Y + +C P G + + N+ V+ + + +
Sbjct: 141 QGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLE 200
Query: 196 L-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
Y A +++A+ + +P FLY A H Y F F + S R +
Sbjct: 201 ARYMAFAHDLMADAQRQDRPFFLYYASHHTH----YPQFSG-----QSFAERSG--RGPF 249
Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
+ LD +VG ++ A+ G+LE ++V+F ADNG + + S G + LR K T +
Sbjct: 250 GDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRM-SRGGCSGLLRCGKGTTY 308
Query: 314 DGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
+GG+R A A W + V+ EL D LPTL A AG + + +LDG + +L
Sbjct: 309 EGGVREPALAFWPGHI--APGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLG 366
Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
K+ R + D + + A+R K T + SD
Sbjct: 367 TGKSPRQSLFFYPSYPDEVRGVF-AVRTGKYKAHFFTQGSAHSD 409
>gi|449132445|ref|ZP_21768528.1| arylsulfatase A [Rhodopirellula europaea 6C]
gi|448888347|gb|EMB18667.1| arylsulfatase A [Rhodopirellula europaea 6C]
Length = 478
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 115/358 (32%), Positives = 176/358 (49%), Gaps = 47/358 (13%)
Query: 26 APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
A + P+ ++I ADDLG+ D+S + SS + TP++DALA G +V A +C+PSR+AL
Sbjct: 39 ADRPPNFVLIFADDLGYGDISCYDSSGVKTPHLDALAEEGFRSTDFFVSANVCSPSRAAL 98
Query: 85 MTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
+TG+YP+ GM ++ + + + +G E +P+ L AGY + +GKWHLG E
Sbjct: 99 LTGRYPMRCGMPVARNENVAKYQDYGFAPEELTIPELLHPAGYRSLMVGKWHLGMEVEGS 158
Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS------TD 195
P GFD ++G + +EP +G + HN K + + T
Sbjct: 159 HPLDAGFDEYFG------------IPSNYEPRRGKN--HNTLYRGKAVEQENVPCEVLTK 204
Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
YT+ I I + + P F+Y++H VH N P + F+ S E+ Y
Sbjct: 205 RYTDEVIEFI-DRQQDDPFFIYVSHHIVH--NPLRPSK-------DFVGTS--EKGKYGD 252
Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
+ LD S G ++ A+R G+ EN++V+F +DNG + GS L G K +G
Sbjct: 253 FIKELDHSTGRIMQAIRDAGIDENTLVVFTSDNGP------TRNGSAGGLNGGKYCTMEG 306
Query: 316 GMRGVAAI--WSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVL 370
G R V I W+ + Q VS D LP C AG++ +D LDG N +L
Sbjct: 307 GHR-VPGIFRWAGKIAPNQ-VSDATLTSMDLLPLFCGLAGVQPPDDRKLDGKNILPIL 362
>gi|426383228|ref|XP_004058189.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Gorilla gorilla
gorilla]
Length = 528
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 112/362 (30%), Positives = 169/362 (46%), Gaps = 25/362 (6%)
Query: 40 LGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMTGKYPIHIGM--- 95
+GW D+ +G TPN+D +A G++ Y LC+PSR+AL+TG+ PI G
Sbjct: 47 MGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 106
Query: 96 ----QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSH 151
++ + G+P +E+LLP+ LK+AGY + +GKWHLG R + P GFD
Sbjct: 107 NAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLGH-RPQFHPLKHGFDEW 165
Query: 152 YGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
+G YD+ + Y+ +M ++ KT T +Y + A++ I
Sbjct: 166 FGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKR 225
Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
+ P FLY A A HA P FL S +R Y V +D+S+G +
Sbjct: 226 QARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRGRYGDAVREIDDSIGKM 274
Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
+ L+ + +N+ V F +DNGA GSN P K T ++GGMR A W P
Sbjct: 275 LELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPG 334
Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
+VS +L I D T A AG+ +D ++DG+N L +G R + D
Sbjct: 335 HVAAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTLLQGRLMDRPIFYYRGD 394
Query: 387 NV 388
+
Sbjct: 395 TL 396
>gi|326331907|ref|ZP_08198194.1| twin-arginine translocation pathway signal [Nocardioidaceae
bacterium Broad-1]
gi|325950404|gb|EGD42457.1| twin-arginine translocation pathway signal [Nocardioidaceae
bacterium Broad-1]
Length = 463
Score = 158 bits (400), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 122/404 (30%), Positives = 189/404 (46%), Gaps = 47/404 (11%)
Query: 28 KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMT 86
+KP++++IL DDLGW D+S +GS I TP++D LA +G+ Y ++C+P+R L T
Sbjct: 47 RKPNLLVILGDDLGWADLSAYGSPDIKTPHLDRLAASGVRFTDAYSASSVCSPTRFGLYT 106
Query: 87 GKYPIHI--GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
G+YP + G+ + + G+PL L LKE GYAT GKWH G+ Y+PT
Sbjct: 107 GRYPGRLAGGLPEPISAPNKLHGIPLDHPTLASLLKEQGYATALFGKWHCGYL-PWYSPT 165
Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
G+D +G + G DY+ + G + +V+ + + Y T++ TE A++
Sbjct: 166 RVGWDEFFGNFSGGLDYFSKIN------HNGAHDLYEDEVEVEDLRYY-TEIVTERAVSF 218
Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE--------------EVAKFLDISDPER 250
I E + P L L H P++ P++ E
Sbjct: 219 I-ERDHDTPWLLNLNFTTPHW-----PWEGPEDKAVSDELTARIEGGEKGVLFHTDGGSL 272
Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
TY MV LD+S+G VI AL + G L +++V F +DNG F + P G KS
Sbjct: 273 ATYKTMVEDLDQSIGQVIKALERTGQLRDTVVFFASDNGGERF------SNVWPFTGAKS 326
Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG-IEINDTSLDGVNQWDV 369
+GG+R + P + + ++VS E +DW T AG D LDG + +
Sbjct: 327 GLNEGGIRVPTILSWPGILRGRQVSDEPVVTTDWTATFLELAGATPAEDYPLDGASLVPL 386
Query: 370 LTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVA--GTDN 411
L G + ++ + + ALR LKY+ GTD+
Sbjct: 387 LFGGRPRETQDLFWRM-------RGQRALRRGSLKYLQLDGTDH 423
>gi|417300776|ref|ZP_12087967.1| arylsulfatase A [Rhodopirellula baltica WH47]
gi|327542927|gb|EGF29380.1| arylsulfatase A [Rhodopirellula baltica WH47]
Length = 450
Score = 158 bits (400), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 122/359 (33%), Positives = 174/359 (48%), Gaps = 37/359 (10%)
Query: 10 ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
+L C +L A K +I++ L DD G+ D+ GS + TPNID L G+
Sbjct: 17 SLICAVLSAWPTSVVEATDKLNIVVFLTDDQGYGDLGCFGSDSLETPNIDRLCKQGMKFT 76
Query: 70 QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
YV C+P+R A MTG + G VI + G+ E P+ LK AGY T +
Sbjct: 77 DFYVHQRCSPTRLAFMTGSHAHRAGCTK-VIYNRDRIGIHAGEVTTPELLKTAGYTTGIV 135
Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ----VD 185
GKWHLG + + + PT GFD YG+ L QG + N++ +
Sbjct: 136 GKWHLGEW-DAFNPTRHGFDFFYGFMIDLD--------------QGTGIYRNLKRIESIK 180
Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
+KT G +S L +AAI I E NK +P FLY A H P+ P+E +F
Sbjct: 181 HKTDGKHSPKLL-DAAIGFIKE-NKERPFFLYYASPLPHT-----PW-IPNE---RFKGT 229
Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
S R TY ++ +D VG ++ AL KH + EN++ +F +DNG P GI N G P
Sbjct: 230 S--RRSTYGDVIREIDWQVGELMNALDKHEIAENTLFIFASDNG-PVLGI--NGGDAGPY 284
Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDG 363
R K T ++GG+R + P + +S++ I D LPT CA A ++ D +DG
Sbjct: 285 RDGKWTDFEGGIRVPCIMRWPGTIEPGSTNSQITGIIDLLPTFCAMAEVDPPGDRVIDG 343
>gi|87309613|ref|ZP_01091747.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
marina DSM 3645]
gi|87287377|gb|EAQ79277.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
marina DSM 3645]
Length = 496
Score = 158 bits (400), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 131/439 (29%), Positives = 201/439 (45%), Gaps = 62/439 (14%)
Query: 23 NTTAPKK-PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPS 80
N+ PK+ P+I+ L DDLGW D+ +GSS TPN+D LA +G+ Y +C+P+
Sbjct: 35 NSAEPKRTPNIVFFLVDDLGWKDIGVYGSSFYHTPNVDGLAASGMRFTNAYAACQVCSPT 94
Query: 81 RSALMTGKYPIHIGM------------QHGVILEGEPWG--LPLTEKLLPQYLKEAGYAT 126
R+++MTGKYP +G+ + L P+ L L E L + LK+ GYAT
Sbjct: 95 RASIMTGKYPQRVGITDYIGAAQPDKWKRNTPLLPAPYQTRLALEETTLAEALKQRGYAT 154
Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDN 186
GKWHLG E P +GFD + G G+ + K F PY N ++ +
Sbjct: 155 FFAGKWHLG--PEGNWPEDQGFDVNIG---GIDRGGPYGGKKYFSPY------GNPRLTD 203
Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG-NTYEPFQAPDEEVAKFLDI 245
G + D + I +H + +P YL+ +VH E + +E+ + +
Sbjct: 204 GPDGEHLPDRLASETVKFIEQH-QDQPFLAYLSFYSVHTPLMAREDLKQKYDEIKQRIRF 262
Query: 246 SDP--------------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
+ P E YAGMV +D +VG V+ AL + + +N++V+F +DNG
Sbjct: 263 AGPIWGEEGKSKLRLVQEHSVYAGMVEAMDAAVGKVLDALDRLKLTDNTLVIFTSDNG-- 320
Query: 292 SFGIHSNKG---SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
G+ +++G SN PLRG K ++GG+R + P + S L D+LPT+
Sbjct: 321 --GLSTSEGHPTSNLPLRGGKGWMYEGGIREPLVVRYPGVTSPGSESDALVTSPDFLPTI 378
Query: 349 CAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKY-YAALRVDDLKYVA 407
A + DGV+ L +G R I + + N AA+R D K +
Sbjct: 379 LAVVDKPGDKIDTDGVSIISAL-EGKPLDRGPIFWHYPHYGNQGGSPTAAVREGDWKLI- 436
Query: 408 GTDNNGQSDEWYGDTDNEI 426
EWY D E+
Sbjct: 437 ---------EWYEDGKVEL 446
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.135 0.420
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,071,010,655
Number of Sequences: 23463169
Number of extensions: 448464816
Number of successful extensions: 1007133
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5213
Number of HSP's successfully gapped in prelim test: 4571
Number of HSP's that attempted gapping in prelim test: 967143
Number of HSP's gapped (non-prelim): 20612
length of query: 593
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 445
effective length of database: 8,886,646,355
effective search space: 3954557627975
effective search space used: 3954557627975
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 80 (35.4 bits)