BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy1575
(242 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|193641124|ref|XP_001950120.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 599
Score = 273 bits (698), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 129/222 (58%), Positives = 159/222 (71%), Gaps = 7/222 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
GWNDVGFHG IPTPNIDALAYNG++LNRHY PTCTPSRAA LTGKYP RYG+ P+
Sbjct: 45 GWNDVGFHGSIQIPTPNIDALAYNGVILNRHYVQPTCTPSRAALLTGKYPIRYGLQGFPI 104
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
AGV A+P+ EK+LPQYLK+LGYSTHL+GKWH+G NK + P RGFD+H GYWNG+++
Sbjct: 105 IAGVPLALPLNEKILPQYLKDLGYSTHLVGKWHLGANKNQHTPIKRGFDSHFGYWNGFIS 164
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK-SHNHSRPLFLQITH 200
Y +S H T VG DARR ER +M +Y TD FTD++ VIK NH +P+FL ++H
Sbjct: 165 YRNSTHSTGLMVGKDARRGFERAGDEMVDRYATDIFTDEANKVIKLCKNHDKPMFLMVSH 224
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVHTG G +L+V + ND F +I N +RRL+A
Sbjct: 225 LAVHTGVPG-----PNILEVSNKTHNDIRFDYIENKERRLYA 261
>gi|193641058|ref|XP_001942872.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 575
Score = 251 bits (640), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 121/223 (54%), Positives = 153/223 (68%), Gaps = 8/223 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP-V 81
GWNDVGFHG IPTPNIDALAYNG +LNRHY PTCTPSRAA LTGKYP RYG+ P +
Sbjct: 44 GWNDVGFHGSIQIPTPNIDALAYNGAILNRHYVQPTCTPSRAALLTGKYPIRYGLQGPPI 103
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+G A A+P EK+LPQYLKELGYSTHL+GKWH+G ++ P RGFD+H GYWNGY++
Sbjct: 104 ASGKASALPTNEKILPQYLKELGYSTHLVGKWHLGHYQKRFTPTKRGFDSHFGYWNGYIS 163
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMS-SKYLTDFFTDQSVHVIKSHNHSR-PLFLQIT 199
Y +S H T G+DARR ER +M +Y TD FT+++ +I+S +FL ++
Sbjct: 164 YRNSTHATRTMSGIDARRGFERAGNEMDRDRYATDVFTEEARKIIESSKRENTEMFLMVS 223
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
H AVH+G +G L+V + ND F +I N +RRL+A
Sbjct: 224 HLAVHSGNSG-----PNHLEVLNKTYNDEAFGYIENENRRLYA 261
>gi|357612332|gb|EHJ67925.1| hypothetical protein KGM_21236 [Danaus plexippus]
Length = 563
Score = 222 bits (565), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 110/221 (49%), Positives = 148/221 (66%), Gaps = 10/221 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG N IPTPNID +A++G+ L+ +Y P CTPSRAA +TGKYP G+ T +
Sbjct: 60 GWNDVGFHGSNQIPTPNIDIMAWSGVSLHNYYVTPICTPSRAALMTGKYPIHTGMQHTVI 119
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A + +P+TEK+LPQYLKELGY THL+GKWH+G K+E LP NRGFD+H+G+WNG +
Sbjct: 120 FAAEPRGLPLTEKILPQYLKELGYKTHLVGKWHLGSYKKEYLPLNRGFDSHLGFWNGKID 179
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
D ++ G D RR+ A + +Y TD +T+++V +IKSHN S PLFL ++H+
Sbjct: 180 MYDHTNQEKGYWGFDFRRDFST-AHDLFGQYATDVYTNEAVKIIKSHNTSSPLFLMLSHS 238
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVHTG P+ ++ P E+ F HI + RR FA
Sbjct: 239 AVHTGN------PSEPIRAP--EKLFVNFTHIQDFQRRKFA 271
>gi|350422910|ref|XP_003493325.1| PREDICTED: arylsulfatase I-like [Bombus impatiens]
Length = 563
Score = 216 bits (550), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 104/222 (46%), Positives = 146/222 (65%), Gaps = 8/222 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDV FHG + IPTPNIDALAYNG++L RHY LP CTPSR AFLTG+YP R G+ P+
Sbjct: 50 GWNDVSFHGADQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGYPL 109
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
AG +A+P+ LLP+YL++LGY+THL+GKWH+G + P RGFD +GY++GY+T
Sbjct: 110 KAGEERAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPAYRGFDTFLGYYSGYIT 169
Query: 142 YNDSIHETDFAVGLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y E + VG D ++ + + S +Y+TD T+++ +I +HN S+PL+LQ++H
Sbjct: 170 YFKHTIEQNLHVGYDLHYDVAGNLSVKYSHEYMTDLITERAEDIIFNHNRSKPLYLQLSH 229
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A H+ A ++V D EE + T +I + DRR A
Sbjct: 230 VAAHSSDA------KANMEVRDEEETNATLGYIEDFDRRKLA 265
>gi|242024962|ref|XP_002432895.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
gi|212518404|gb|EEB20157.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
Length = 533
Score = 215 bits (548), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 108/221 (48%), Positives = 149/221 (67%), Gaps = 10/221 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVG+HG ++IPTPNIDALAYNGI+LNR+Y LP CTPSR+A +TG++P G+ V
Sbjct: 21 GWNDVGYHGSDEIPTPNIDALAYNGIILNRYYVLPVCTPSRSALMTGRHPIHNGMQHRVL 80
Query: 83 AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
GV + +P+TEKLLP+YL++LGYSTH++GKWH+G K+E P RGF++H+G+W G+
Sbjct: 81 FGVETRGLPLTEKLLPEYLQKLGYSTHIVGKWHLGFYKKEYTPLYRGFESHIGFWTGHQD 140
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D E + GLD R M + A + +Y T +T +SV +IK++N ++PLFL + HA
Sbjct: 141 YYDHTAEEERLWGLDMRHGM-KPAWYLHGEYSTHVYTRESVKIIKNYNSTKPLFLYVAHA 199
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH+G N L PD + HI N +RR +A
Sbjct: 200 AVHSGNKYNP------LPAPDKTVD--KLDHIQNYNRRRYA 232
>gi|380025315|ref|XP_003696421.1| PREDICTED: arylsulfatase B-like [Apis florea]
Length = 546
Score = 214 bits (546), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 102/222 (45%), Positives = 149/222 (67%), Gaps = 8/222 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDV FHG N IPTPNIDALAYNG++L RHY LP CTPSR AFLTG+YP R G+ P+
Sbjct: 35 GWNDVSFHGANQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGYPL 94
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
AG +A+P+ LLP+YL++LGY+THL+GKWH+G + P RGFD GY+NGY++
Sbjct: 95 KAGEPRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPTRRGFDTFFGYYNGYIS 154
Query: 142 YNDSIHETDFAVGLDAR-RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y + + + VG D N + + + +Y+TD T+++ ++IK+H+ +PL+LQ++H
Sbjct: 155 YFNHTIKQNNHVGYDLHYHNSKNLSVAYNFEYITDLITERAENIIKNHDRRKPLYLQLSH 214
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH+ A +++V D +E + T +I + +RR +A
Sbjct: 215 LAVHSSDAKE------VMEVRDEQETNATLEYIEDYNRRKYA 250
>gi|328699373|ref|XP_001945817.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 567
Score = 214 bits (546), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 109/226 (48%), Positives = 145/226 (64%), Gaps = 15/226 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWND+ FHG ++IPTPNIDALA+NGIVLN YT P CTPSR A +TGKYP + G+ P
Sbjct: 39 GWNDLSFHGSDEIPTPNIDALAFNGIVLNNLYTQPVCTPSRVALMTGKYPIKLGMQGPPT 98
Query: 83 AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G +P++EKLLP+YL+ELGY+T IGKWH+G K+ P RGFD+H GY+ GY++
Sbjct: 99 YGAEPNGLPLSEKLLPEYLRELGYTTRAIGKWHLGFYKQAYTPTRRGFDSHFGYYTGYVS 158
Query: 142 YNDSIHETDFA-----VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y D + + + G D RRN + A + KY TD FTD++V +IK ++PLF+
Sbjct: 159 YYDYLLQDVYQNFGEFQGFDMRRN-DTIAWDVVGKYATDVFTDEAVRLIKEQPANQPLFM 217
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ H AVHTG G L+ P E N F HI +P+RR++A
Sbjct: 218 YLAHVAVHTGNRGK------YLEAPQSEVNK--FNHILDPNRRIYA 255
>gi|242025544|ref|XP_002433184.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212518725|gb|EEB20446.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 610
Score = 214 bits (545), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 109/228 (47%), Positives = 147/228 (64%), Gaps = 18/228 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWND+ FHG + I TPN+DALAYNG++LN Y LP CTPSRAA +TG YP G+ P+
Sbjct: 52 GWNDLSFHGSDQIQTPNLDALAYNGVILNSQYVLPVCTPSRAALMTGMYPIHNGMQGLPL 111
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A +A+P KLLP YLK+LGY+T ++GKWH+G ++E P RGFD+H+GYWNG ++
Sbjct: 112 EASEPRALPAG-KLLPSYLKDLGYTTRMVGKWHLGYYQKEFTPTYRGFDSHLGYWNGIVS 170
Query: 142 YNDSIHETD-------FAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
Y D I + D G D RRN+ A + +Y T+ FTD++VH+I+SHN + PL
Sbjct: 171 YYDYILQEDDNRKPRSSLNGFDMRRNITP-AYDLQGRYATEMFTDEAVHLIRSHNKNTPL 229
Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
FL ++H AVH G G L+ P +E F HI++P+RR FA
Sbjct: 230 FLYMSHLAVHAGNPGK------FLEAP--QEAINKFLHIADPNRRTFA 269
>gi|340727298|ref|XP_003401983.1| PREDICTED: arylsulfatase B-like [Bombus terrestris]
Length = 563
Score = 214 bits (544), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 102/222 (45%), Positives = 146/222 (65%), Gaps = 8/222 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
GWNDV FHG + IPTPNIDALAYNG++L RHY LP CTPSR AFLTG+YP R G+ P+
Sbjct: 50 GWNDVSFHGADQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGHPL 109
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G +A+P+ LLP+YL++LGY+THL+GKWH+G + P RGFD +GY++G++T
Sbjct: 110 DPGEVRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPAYRGFDTFLGYYSGFMT 169
Query: 142 YNDSIHETDFAVGLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y + E + VG D ++ + + S +Y+TD T+++ +I +HNHS+PL+LQ++H
Sbjct: 170 YFNHTIEQNHHVGYDLHYDVAGNLSVKYSHEYMTDLITERAEDIILNHNHSKPLYLQLSH 229
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A H+ ++V D EE + T +I + DRR A
Sbjct: 230 IAAHSSNINKT------VEVRDEEETNATLGYIEDFDRRKLA 265
>gi|332024600|gb|EGI64798.1| Arylsulfatase J [Acromyrmex echinatior]
Length = 528
Score = 212 bits (540), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 111/235 (47%), Positives = 149/235 (63%), Gaps = 10/235 (4%)
Query: 9 VAKAVPVTEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLT 68
V + + + L GWNDVGFHG IPTPNIDALAY+G++L+R+Y P CTPSR+A +T
Sbjct: 7 VQRTLSLILLLAVHGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMT 66
Query: 69 GKYPFRYGIDTPVGAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNR 127
GKYP G+ V G + +P+ EKLLP+YL+ELGY+TH++GKWH+G K+E P R
Sbjct: 67 GKYPIHTGMQHGVLKGAEPRGLPLREKLLPEYLRELGYNTHIVGKWHLGFYKKEYTPTYR 126
Query: 128 GFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS 187
GFD H+G+W G+ Y D + GLD RR M+ A + +Y TD FT ++V +I +
Sbjct: 127 GFDTHIGFWTGHHDYFDHTAVENPYWGLDIRRGMQP-AWDLHGQYSTDIFTKEAVRLIDN 185
Query: 188 HNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
HN SRP+FL + HAAVH+G P L PD E F +I + +RR FA
Sbjct: 186 HNSSRPMFLYLAHAAVHSGN------PYNPLPAPD--EEVAKFNNIFDYNRRRFA 232
>gi|156537546|ref|XP_001607560.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 571
Score = 212 bits (540), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 148/223 (66%), Gaps = 8/223 (3%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
GWNDVGFHG N+IPTPNIDALAY G++LNRHY LPTCTPSR AFLTG++P R G+ P
Sbjct: 36 MGWNDVGFHGSNEIPTPNIDALAYGGVILNRHYALPTCTPSRTAFLTGRHPIRMGLQGIP 95
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ + VP+ E+LLP+YL+ELGY T L+GKWH+G ++ P RGFD+ VGY+ G +
Sbjct: 96 MNVAEPRGVPLHERLLPEYLRELGYVTRLVGKWHLGYYTDKHTPTRRGFDSFVGYYGGVI 155
Query: 141 TYNDSIHETDFAVGLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
TY + D G+D + + P + +Y+TDF +DQ+ VIK+H+ +PLFLQ+
Sbjct: 156 TYFNHTVTKDKHTGIDYHWDTSGKIEPFDNDQYVTDFISDQAEAVIKNHDRKKPLFLQLA 215
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
H A H A + P ++V +M E + T ++I + +RR +A
Sbjct: 216 HVAAH---ASENRDP---IEVRNMTEVNDTLSYIPDINRRKYA 252
>gi|328788246|ref|XP_395125.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
Length = 562
Score = 210 bits (535), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 100/222 (45%), Positives = 146/222 (65%), Gaps = 8/222 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDV FHG N IPTPNIDALAYNG++L RHY LP CTPSR AFLTG+YP R G+ P+
Sbjct: 50 GWNDVSFHGANQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGYPL 109
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
AG +A+P+ LLP+YL++LGY+THL+GKWH+G + P RGFD GY++GY++
Sbjct: 110 KAGEPRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPTRRGFDTFFGYYSGYIS 169
Query: 142 YNDSIHETDFAVGLDAR-RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y + + D +G D N + + + +Y TD T+++ ++IK+H+ +PL+LQ+ H
Sbjct: 170 YFNHTIKQDDHIGYDLHYDNSKNLSIDYNFEYTTDLITERAENIIKNHDRRKPLYLQLCH 229
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A H+ A +++V D +E + T +I + +RR +A
Sbjct: 230 LAAHSSDAKE------VMEVRDEQETNATLKYIEDYNRRKYA 265
>gi|307167595|gb|EFN61139.1| Arylsulfatase B [Camponotus floridanus]
Length = 519
Score = 210 bits (534), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 108/229 (47%), Positives = 146/229 (63%), Gaps = 10/229 (4%)
Query: 15 VTEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFR 74
+ + + +GWNDVGFHG IPTPNIDALAY+G++L+R+Y CTPSR+A +TGKYP
Sbjct: 2 IWQVVFFEGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTSICTPSRSALMTGKYPIH 61
Query: 75 YGIDTPVGAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
G+ + G + +P+ EK+LP+YL+ELGYSTH++GKWH+G K E P RGFD H+
Sbjct: 62 TGMQHSILKGAEPRGLPLHEKILPEYLRELGYSTHIVGKWHLGFYKREYTPTYRGFDTHI 121
Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
GYW G+ Y D + GLD RR M + A + +Y TD FT ++V +I +HN SRP
Sbjct: 122 GYWTGHHDYYDHTAVENPYWGLDMRRGM-KPAWDLHGEYSTDVFTKEAVKLINNHNSSRP 180
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+FL + HAAVH+G P L PD E F +I + +RR FA
Sbjct: 181 MFLYLAHAAVHSGN------PYNPLPAPD--EEVAKFNNIFDYNRRRFA 221
>gi|345495280|ref|XP_001606377.2| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 545
Score = 210 bits (534), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 100/189 (52%), Positives = 131/189 (69%), Gaps = 2/189 (1%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVGFHG IPTPNIDALAY+G++LNR+Y P CTPSR+A +TGKYP G+ V
Sbjct: 39 GWNDVGFHGSGQIPTPNIDALAYSGLILNRYYVSPICTPSRSALMTGKYPIHTGMQRGVL 98
Query: 83 AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + +P+ EKLLP+YL+ELGY TH++GKWH+G +E P RGF++H+GYW G+
Sbjct: 99 KGAEPRGLPLKEKLLPEYLRELGYRTHIVGKWHLGFYTKEYTPTYRGFESHLGYWTGHQD 158
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D + G+D RRNME A + +Y TD FT ++V +IKSHN S+P+FL + HA
Sbjct: 159 YYDHSAVEEPYWGMDMRRNMEP-AWDLHGQYSTDVFTKEAVKLIKSHNASQPMFLYLAHA 217
Query: 202 AVHTGTAGN 210
AVH+ N
Sbjct: 218 AVHSANPYN 226
>gi|340710385|ref|XP_003393772.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
Length = 545
Score = 209 bits (531), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 105/221 (47%), Positives = 144/221 (65%), Gaps = 10/221 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GWNDVGFHG IPTPNIDALAY+G++L+R+Y P CTPSR+A +TGK+P G+ V
Sbjct: 38 GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKHPIHTGMQHGVL 97
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ EKLLPQYL+ELGYSTH++GKWH+G +E P RGFD+H+G+W+G+
Sbjct: 98 KCAEPRGLPLHEKLLPQYLRELGYSTHIVGKWHLGFYTKEYTPMYRGFDSHIGFWSGHHD 157
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D GLD RR + A + +Y TD FT ++V +I HN SRP+FL ++HA
Sbjct: 158 YFDHSAVESPYWGLDMRRGLNS-AWDLHGQYSTDIFTKEAVKLINDHNASRPMFLYLSHA 216
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH+G + N +P +++ F +I N +RR FA
Sbjct: 217 AVHSGNSYNP--------LPAPDQDVAKFTNIFNYERRRFA 249
>gi|350415537|ref|XP_003490674.1| PREDICTED: arylsulfatase J-like [Bombus impatiens]
Length = 545
Score = 209 bits (531), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 108/221 (48%), Positives = 144/221 (65%), Gaps = 10/221 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GWNDVGFHG IPTPNIDALAY+G++L+R+Y P CTPSR+A +TGK+P G+ V
Sbjct: 38 GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKHPIHTGMQHGVL 97
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ EKLLPQYL+ELGYSTH++GKWH+G +E P RGFD+H+G+W+G+
Sbjct: 98 KCAEPRGLPLHEKLLPQYLRELGYSTHIVGKWHLGFYTKEYTPMYRGFDSHIGFWSGHHD 157
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D GLD RR + A + +Y TD FT ++V +I HN SRP+FL + HA
Sbjct: 158 YFDHSAVESPYWGLDMRRGLNS-AWDLHGQYSTDIFTKEAVKLINDHNASRPMFLYLPHA 216
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH+G + N L VPD ++ F +I N +RR FA
Sbjct: 217 AVHSGNSYNP------LPVPD--QDVAKFTNIFNYERRRFA 249
>gi|307187653|gb|EFN72625.1| Arylsulfatase B [Camponotus floridanus]
Length = 525
Score = 209 bits (531), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 106/224 (47%), Positives = 147/224 (65%), Gaps = 10/224 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GWNDV FHG ++IPTPNIDALAYNG++LNRHY LP CTPSR AFLTGKYP R G+ V
Sbjct: 25 GWNDVSFHGADEIPTPNIDALAYNGVILNRHYVLPLCTPSRTAFLTGKYPIRTGMQGYVL 84
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ + LLP+YL++LGY+THL+GKWH+G + + P RGFD +GY+NGY+
Sbjct: 85 QPAEPRGIPLNDTLLPEYLRKLGYATHLVGKWHVGYHTKNYTPTRRGFDTFLGYYNGYIH 144
Query: 142 Y-NDSI-HETDFAVGLDARRNM-ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y N +I E +G D R + E + Y+TD TD+ ++I SHN ++PL+LQ+
Sbjct: 145 YFNHTILDEEQKYLGYDFHRIVGENRTIEYRYDYITDIITDEVENIIFSHNPAKPLYLQV 204
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+H A H+G G +QV D +E + T +I + +RR +A
Sbjct: 205 SHDAAHSGGIGIE------MQVRDWKETNATLGYIEDINRRKYA 242
>gi|383853606|ref|XP_003702313.1| PREDICTED: arylsulfatase J-like [Megachile rotundata]
Length = 544
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 107/221 (48%), Positives = 143/221 (64%), Gaps = 10/221 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GWNDVGFHG IPTPNIDALAY+G++L+R+Y P CTPSR+A +TGKYP G+ V
Sbjct: 37 GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVSPICTPSRSALMTGKYPIHTGMQHGVL 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ EKLLP+YLKELGY TH++GKWH+G ++ P RGFD+H+G+W+G+
Sbjct: 97 KCAEPRGLPLQEKLLPEYLKELGYRTHIVGKWHLGFYTKQYTPTYRGFDSHIGFWSGHQD 156
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D GLD RR ME A + +Y TD FT ++V +I +HN S+PLFL + HA
Sbjct: 157 YFDHTAVESPYWGLDMRRGMEA-AWDLHGQYSTDVFTSEAVKLINNHNDSKPLFLYLAHA 215
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH+G P L PD++ F +I + +RR FA
Sbjct: 216 AVHSGN------PYDPLPAPDVDV--AKFTNIFDYNRRRFA 248
>gi|307207313|gb|EFN85063.1| Arylsulfatase B [Harpegnathos saltator]
Length = 532
Score = 206 bits (524), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 107/222 (48%), Positives = 142/222 (63%), Gaps = 10/222 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
+GWNDVGFHG IPTPNIDALAY+G++L+R+Y P CTPSR+A +TGKYP G+ V
Sbjct: 25 EGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKYPIHIGMQHGV 84
Query: 82 GAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
G + +P+ EK+LP+YL++LGYSTH++GKWH+G +E P RGF +H G+W G+
Sbjct: 85 LKGAEPRGLPLHEKILPEYLRDLGYSTHIVGKWHLGFYTKEYTPTYRGFASHTGFWTGHQ 144
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D GLD RR+ME A + +Y TD FT +++ +I HN SRPLFL + H
Sbjct: 145 DYFDHTAVESPYWGLDMRRDMEP-AWDLHGQYSTDVFTKEALRLIDRHNSSRPLFLYLAH 203
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AAVH+G P L PD E F +I + +RR FA
Sbjct: 204 AAVHSGN------PYNPLPAPD--EEVAKFDNIFDYNRRRFA 237
>gi|307187654|gb|EFN72626.1| Arylsulfatase B [Camponotus floridanus]
Length = 525
Score = 206 bits (523), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 100/223 (44%), Positives = 148/223 (66%), Gaps = 10/223 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDV FHG ++IPTPNIDALAYNG++LNRHY LP CTPSR AFLTGKYP R G+ P+
Sbjct: 15 GWNDVSFHGADEIPTPNIDALAYNGVILNRHYVLPICTPSRTAFLTGKYPIRTGMQGYPL 74
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ + + LLP+YL++LGY+THL+GKWH+G + P +RGFD GY+NGY+
Sbjct: 75 QGAEPRGILLNNILLPEYLQKLGYATHLVGKWHVGYHTRNYGPTHRGFDTFAGYYNGYIQ 134
Query: 142 Y-NDSIHETDFAVGLDARRNM-ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y N +++E++ +G D R + + + + Y+TD TD++ ++I SHN ++PL+LQ+
Sbjct: 135 YFNHTLYESE-QLGYDLHRIIGDDHKIEYRYDYMTDLITDEAENIISSHNPAKPLYLQVA 193
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
H A H+ A ++V + +E + T +I + +RR +A
Sbjct: 194 HLAAHSSDAEEE------MEVRNWKETNATLGYIEDINRRKYA 230
>gi|270005303|gb|EFA01751.1| hypothetical protein TcasGA2_TC007349 [Tribolium castaneum]
Length = 543
Score = 204 bits (520), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 139/221 (62%), Gaps = 10/221 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVGFHG IPTPNIDALAY+G++L +Y P CTPSR+A +TGKYP G+ V
Sbjct: 34 GWNDVGFHGSGQIPTPNIDALAYSGLILQNYYVTPICTPSRSALMTGKYPIHTGMQHTVL 93
Query: 83 AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + +P+TEK+LP+YL+ELGY+ L+GKWH+G +E P RGFD+H+GYW G+
Sbjct: 94 FGAEPRGLPLTEKILPEYLRELGYTNRLVGKWHLGSYTKEYTPLYRGFDSHLGYWTGHQD 153
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D + G D RRNM+ A + +Y TD FT ++V +I++HN + PLFL + H
Sbjct: 154 YYDHTAVENPGWGFDMRRNMD-LAYDLHGQYSTDVFTQEAVKIIENHNTTNPLFLYLAHV 212
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH+ P L PD E F++I + R+ FA
Sbjct: 213 AVHSAN------PYNPLPAPD--ETVEKFSNIPSYKRQRFA 245
>gi|380026538|ref|XP_003697007.1| PREDICTED: arylsulfatase J-like [Apis florea]
Length = 543
Score = 204 bits (519), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 144/221 (65%), Gaps = 10/221 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GWNDVGFHG IPTPNIDALAY+G++L+R+Y P CTPSR+A +TGK+P G+ V
Sbjct: 36 GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVSPICTPSRSALMTGKHPIHTGMQHGVL 95
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ EKLLP+Y ++LGYSTH++GKWH+G +E P RGFD+H+G+W+G+
Sbjct: 96 KCAEPRGLPLHEKLLPEYFRDLGYSTHIVGKWHLGFYTKEYTPMYRGFDSHIGFWSGHHD 155
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D + GLD RR +E A + +Y TD FT ++V +I +HN SRP+FL + HA
Sbjct: 156 YFDHSAVEEPYWGLDMRRGLEP-AWDLHGQYSTDVFTKEAVKLIDNHNTSRPMFLYLAHA 214
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH+G N +P +++ F +I N +RR FA
Sbjct: 215 AVHSGNPYNP--------LPAHDQDVAKFTNIFNYNRRRFA 247
>gi|270008947|gb|EFA05395.1| hypothetical protein TcasGA2_TC015567 [Tribolium castaneum]
Length = 513
Score = 203 bits (517), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 106/225 (47%), Positives = 140/225 (62%), Gaps = 11/225 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G NDVGF+G IPTP+IDALAYNGI+L+R YT +CTPSRAA LTG+YP R G+ P+
Sbjct: 6 GRNDVGFYGSGQIPTPSIDALAYNGIILDRFYTQCSCTPSRAALLTGQYPIRLGMQGLPI 65
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
AG K++P+ +PQYLK LGY THL+GKWH+G E P RGFD+H GYWNG++
Sbjct: 66 RAGENKSLPLDVVTMPQYLKRLGYKTHLVGKWHLGYAHIEDTPLQRGFDSHFGYWNGFVG 125
Query: 142 YND--SIHETDFAVGLDARRNMERYAP--QMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y + +++E + + P Q KY TD FT +++ +I HN +PLFL
Sbjct: 126 YFNYTAVYELANDTMVKGFDLFDGVVPAWQERGKYATDLFTHKAMKIIDEHNSEKPLFLV 185
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ H A HTG G L VPD+ + + F+ I NP RRL+A
Sbjct: 186 LAHLAGHTGEDGVE------LGVPDVAQAETRFSFIKNPKRRLYA 224
>gi|195166561|ref|XP_002024103.1| GL22855 [Drosophila persimilis]
gi|194107458|gb|EDW29501.1| GL22855 [Drosophila persimilis]
Length = 559
Score = 202 bits (515), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 113/249 (45%), Positives = 151/249 (60%), Gaps = 27/249 (10%)
Query: 6 GAGVAKAVPVTEKLL--PQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSR 63
G G A A P +L G+NDV FHG N I TPNIDALAYNGI+LNRHY CTPSR
Sbjct: 20 GEGEASAKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGILLNRHYVPNLCTPSR 79
Query: 64 AAFLTGKYPFRYGI-------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG 116
A LTGKYP G+ D P G +P E+L+P+ ++ GY+THL+GKWH+G
Sbjct: 80 ATLLTGKYPIHTGMQHFVIVTDEPWG------LPRQERLMPELFRDAGYATHLVGKWHLG 133
Query: 117 CNKEELLPFNRGFDNHVGYWNGYLTYNDS---IHETDFAVGLDARRNMERYAPQMSSKYL 173
+++L P RGFD+H GY+NGY+ Y D + + +++ GLD RR++E + Y
Sbjct: 134 FWRKDLTPTMRGFDHHFGYYNGYMDYYDQTVRMLDRNYSTGLDFRRDLEP-CREAEGTYA 192
Query: 174 TDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHI 233
T+ FT ++ VI+ H+ SRPLF+ ++H AVHTG N +Q P EE FAHI
Sbjct: 193 TEAFTTEARKVIERHDKSRPLFMVLSHLAVHTGNEDNP------MQAP--EEEVAKFAHI 244
Query: 234 SNPDRRLFA 242
+P RR +A
Sbjct: 245 RDPKRRTYA 253
>gi|350400025|ref|XP_003485710.1| PREDICTED: arylsulfatase B-like, partial [Bombus impatiens]
Length = 301
Score = 202 bits (515), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 108/226 (47%), Positives = 139/226 (61%), Gaps = 15/226 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDV FHG + IPTPNIDALAYNGI+LN HY CTPSR+A +TGK P G+ V
Sbjct: 85 GWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKNPIHLGMQHSVL 144
Query: 83 AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P++EKLLPQYL+E+GY TH +GKWH+G K++ P RGFD+H GYWNG
Sbjct: 145 YPTEPRGLPLSEKLLPQYLQEIGYKTHAVGKWHLGYFKKQYTPTYRGFDSHFGYWNGLED 204
Query: 142 YNDSI-HETDFAV----GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y I E D G D RRN+ A + KY TD FT+++V +I H+ RP+FL
Sbjct: 205 YYTHIAQEPDSQYNEYKGFDMRRNLT-VAWDTAGKYATDLFTNEAVRLINEHDTERPMFL 263
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ H AVH G LL+ PD E F++I +P+RR+ A
Sbjct: 264 YLAHLAVHKGNENQ------LLRAPD--EEIAKFSYILDPERRIQA 301
>gi|189236827|ref|XP_972832.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
Length = 646
Score = 202 bits (513), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 109/223 (48%), Positives = 141/223 (63%), Gaps = 14/223 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+NDVGFHG N+IPTPNIDALAYNG++LN HYT CTPSR+AFLTGKYP G+ V
Sbjct: 34 GFNDVGFHGSNEIPTPNIDALAYNGVILNSHYTQALCTPSRSAFLTGKYPIHLGMQHLVI 93
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ E +LPQYLK GY+TH IGKWH+G ++E P RGFD+H GYW G
Sbjct: 94 LEPEPWGLPLNETILPQYLKRNGYATHAIGKWHLGFFRKEYTPTYRGFDSHFGYWQGLQD 153
Query: 142 -YNDSIHETDFAVGLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++H T G D RRNM ++ Q KY T FTD++V +I+ HN P+F+ +
Sbjct: 154 YYKHTVHFTP-EHGYDMRRNMTVDWSAQ--GKYSTTLFTDEAVRLIREHNTENPMFMYLA 210
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
H A H+G + LQ PD E F HI++P+RR++A
Sbjct: 211 HLAPHSGNDDDP------LQAPD--EEIAKFGHIADPERRIYA 245
>gi|328783191|ref|XP_396281.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
Length = 713
Score = 202 bits (513), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 107/226 (47%), Positives = 138/226 (61%), Gaps = 15/226 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDV FHG + IPTPNIDALAYNGI+LN HY CTPSR+A +TGK P G+ V
Sbjct: 83 GWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKNPIHLGMQHSVL 142
Query: 83 AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P++EKLLP+YL+E+GY TH +GKWH+G K+E P RGFD+H GYWNG
Sbjct: 143 FPTEPRGLPLSEKLLPEYLREIGYKTHAVGKWHLGYFKKEYTPTYRGFDSHFGYWNGLQD 202
Query: 142 YNDSI-HETDFAV----GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y I E D A G D RRN+ A KY TD FT++++ +I H+ RP+FL
Sbjct: 203 YYTHITQEPDPAFSEFKGFDMRRNLT-VAWDTVGKYSTDLFTNEAIRLINEHDTDRPMFL 261
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ H AVH G L + PD E F++I +P+RR+ A
Sbjct: 262 YLAHLAVHKGNEEQ------LFRAPD--EEIAKFSYILDPERRIQA 299
>gi|328789569|ref|XP_624454.2| PREDICTED: arylsulfatase J-like [Apis mellifera]
Length = 546
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 108/237 (45%), Positives = 149/237 (62%), Gaps = 12/237 (5%)
Query: 9 VAKAVPVTEKLLPQ--GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAF 66
VA A P +L GWNDVGFHG + IPTPNIDALAY G++L+R+Y P CTPSR+A
Sbjct: 23 VASARPHIVFILADDLGWNDVGFHGLSQIPTPNIDALAYTGLLLDRYYVSPICTPSRSAL 82
Query: 67 LTGKYPFRYGIDTPV-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPF 125
+TGK+P G+ V + +P+ EKLLP+YL+ LGYSTH++GKWH+G +E P
Sbjct: 83 MTGKHPIHTGMQHGVLKCAEPRGLPLQEKLLPEYLRNLGYSTHMVGKWHLGFYTKEYTPT 142
Query: 126 NRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI 185
RGFD+H+G+W+G+ Y D + GLD RR +E A + +Y TD FT ++V +I
Sbjct: 143 YRGFDSHLGFWSGHHDYFDHTAVEEPYWGLDMRRGLEP-AWDLHGQYSTDVFTKEAVRLI 201
Query: 186 KSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+HN SRP+FL ++HAAVH+G N +P + + F I + +RR FA
Sbjct: 202 DNHNTSRPMFLYLSHAAVHSGNPYNP--------LPAHDHDVAKFPKILDYNRRRFA 250
>gi|194748066|ref|XP_001956470.1| GF24578 [Drosophila ananassae]
gi|190623752|gb|EDV39276.1| GF24578 [Drosophila ananassae]
Length = 542
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 106/230 (46%), Positives = 144/230 (62%), Gaps = 25/230 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G NDV FHG N I TPNIDALAYNG++LN+HY CTPSRA LTGKYP G+
Sbjct: 2 GMNDVSFHGSNQILTPNIDALAYNGVLLNKHYVPNLCTPSRATLLTGKYPIHTGMQHWVI 61
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G +P E+L+P+ +E GYSTHL+GKWH+G +++L P RGFD+H GY
Sbjct: 62 ITDEPWG------LPKKERLMPELFREAGYSTHLVGKWHLGFWRQDLTPTMRGFDHHYGY 115
Query: 136 WNGYLTYND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
+NGY+ Y D + T+++ GLD RR+ E P+ + Y T+ FT ++ +I+ H+ S+
Sbjct: 116 YNGYIDYYDHQVRLLGTNYSAGLDFRRDFEP-NPKANGTYATEAFTSEAKRIIEEHDKSK 174
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLF+ ++H AVHTG N +Q P EE F+HI +P RR +A
Sbjct: 175 PLFMVLSHLAVHTGNEDNP------MQAP--EEEVAKFSHIKDPKRRTYA 216
>gi|270006267|gb|EFA02715.1| hypothetical protein TcasGA2_TC008439 [Tribolium castaneum]
Length = 648
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 108/224 (48%), Positives = 138/224 (61%), Gaps = 14/224 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+NDVGFHG N+IPTPNIDALAYNG++LN HYT CTPSR+AFLTGKYP G+ V
Sbjct: 34 GFNDVGFHGSNEIPTPNIDALAYNGVILNSHYTQALCTPSRSAFLTGKYPIHLGMQHLVI 93
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ E +LPQYLK GY+TH IGKWH+G ++E P RGFD+H GYW G
Sbjct: 94 LEPEPWGLPLNETILPQYLKRNGYATHAIGKWHLGFFRKEYTPTYRGFDSHFGYWQGLQD 153
Query: 142 YNDSIHETDFAV--GLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y F G D RRNM ++ Q KY T FTD++V +I+ HN P+F+ +
Sbjct: 154 YYKHTVHATFTPEHGYDMRRNMTVDWSAQ--GKYSTTLFTDEAVRLIREHNTENPMFMYL 211
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
H A H+G + LQ PD E F HI++P+RR++A
Sbjct: 212 AHLAPHSGNDDDP------LQAPD--EEIAKFGHIADPERRIYA 247
>gi|383847821|ref|XP_003699551.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
Length = 575
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 102/221 (46%), Positives = 143/221 (64%), Gaps = 13/221 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP-V 81
GWNDVGFHG N IPTPNIDAL YNGI+LNRHY LP+ TPSR+AFLTG YP R G+ +
Sbjct: 38 GWNDVGFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRSAFLTGLYPIRIGMQGDGI 97
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + +P+ K+LP++L++LGY+T LIGKWH+G + + P +RGFD +G++N +++
Sbjct: 98 RGGEPRGLPLDIKILPEHLRDLGYTTKLIGKWHMGYHTPQYTPLHRGFDFFLGFYNSHIS 157
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D + G D R + A ++ +Y TD FT ++V +I++H RPL+LQI+H
Sbjct: 158 YYDYHYSNQNMSGYDLHRG-DDPAHGINREYATDLFTKEAVRMIETHELPRPLYLQISHL 216
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH L+ P E ND F++I P+RR +A
Sbjct: 217 AVHAP-----------LEQPRDEYNDGRFSYIREPNRRKYA 246
>gi|198466304|ref|XP_002135153.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
gi|198150538|gb|EDY73780.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
Length = 577
Score = 201 bits (511), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 112/249 (44%), Positives = 151/249 (60%), Gaps = 27/249 (10%)
Query: 6 GAGVAKAVPVTEKLL--PQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSR 63
G G A A P +L G+NDV FHG N I TPNIDALAYNGI+LNRHY CTPSR
Sbjct: 20 GEGEASAKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGILLNRHYVPNLCTPSR 79
Query: 64 AAFLTGKYPFRYGI-------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG 116
A LTGKYP G+ D P G +P E+L+P+ ++ GY+THL+GKWH+G
Sbjct: 80 ATLLTGKYPIHTGMQHFVIVTDEPWG------LPRQERLMPELFRDAGYATHLVGKWHLG 133
Query: 117 CNKEELLPFNRGFDNHVGYWNGYLTYNDS---IHETDFAVGLDARRNMERYAPQMSSKYL 173
+++L P RGFD+H GY+NGY+ Y D + + +++ GLD RR++E + Y
Sbjct: 134 FWRKDLTPTMRGFDHHFGYYNGYMDYYDQTVRMLDRNYSTGLDFRRDLEP-CREAEGTYA 192
Query: 174 TDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHI 233
T+ FT ++ VI+ H+ +RPLF+ ++H AVHTG N +Q P EE FAHI
Sbjct: 193 TEAFTTEARKVIERHDKNRPLFMVLSHLAVHTGNEDNP------MQAP--EEEVAKFAHI 244
Query: 234 SNPDRRLFA 242
+P RR +A
Sbjct: 245 RDPKRRTYA 253
>gi|340727296|ref|XP_003401982.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
Length = 579
Score = 201 bits (511), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 100/221 (45%), Positives = 140/221 (63%), Gaps = 13/221 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDV FHG N IPTPNIDAL YNGI+LNRHY LP+ TPSR AF TG+YP R G+ +
Sbjct: 42 GWNDVSFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQYPIRIGMQGADI 101
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + +P+ K+LP++L+ LGY+T LIGKWH+G + P +RGFD +G++N Y++
Sbjct: 102 RGGEPRGLPLNIKILPEHLRGLGYTTKLIGKWHMGYYTPQYTPLHRGFDTFLGFYNSYIS 161
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D + G D R + A M+ +Y TD FT +++++I++H +RPL+LQ++H
Sbjct: 162 YYDYNYSNQNMSGYDMHRG-DDPAYGMNREYATDMFTSEAINIIENHELNRPLYLQLSHL 220
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH+ L+ P NDR HI P+RR +A
Sbjct: 221 AVHSP-----------LEQPANVYNDREPIHIREPNRRKYA 250
>gi|307215079|gb|EFN89886.1| Arylsulfatase B [Harpegnathos saltator]
Length = 557
Score = 201 bits (510), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 96/222 (43%), Positives = 142/222 (63%), Gaps = 8/222 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDV F+G ++IPTPNID+LAYNG++LNRHY LP CTPSR AF TG+YP R G+ P+
Sbjct: 48 GWNDVSFNGGDEIPTPNIDSLAYNGVILNRHYVLPICTPSRTAFFTGQYPIRSGMQGYPL 107
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+++P+ LLPQYL++LGY+THL+GKWH+G P NRGFD +GY++GY+
Sbjct: 108 QGAEPRSIPLNNILLPQYLRKLGYATHLVGKWHVGYQTNNHTPTNRGFDTFLGYYSGYIE 167
Query: 142 YNDSIHETDFAVGLDARRNM-ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y + G D R++ + + + Y+TD TD++ ++I SHN ++PL+LQ+ H
Sbjct: 168 YFSHNLVENGQSGYDIHRSVGDNHTIEYRYDYMTDLITDEAENIISSHNPAKPLYLQLAH 227
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A H T + +++V + + T +I + +RR FA
Sbjct: 228 LAPHASTVDD------VIEVRSWKATNDTLGYIRDINRRKFA 263
>gi|156547171|ref|XP_001603886.1| PREDICTED: arylsulfatase B [Nasonia vitripennis]
Length = 581
Score = 201 bits (510), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 98/243 (40%), Positives = 155/243 (63%), Gaps = 14/243 (5%)
Query: 10 AKAVPVTEKLL-----PQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRA 64
KA+P+ ++ GWNDV FHG N+IPTPNIDALAYNG++LN++YT+P CTPSR+
Sbjct: 28 GKAIPLPPHIVIILADDMGWNDVSFHGANEIPTPNIDALAYNGVILNKYYTMPICTPSRS 87
Query: 65 AFLTGKYPFRYGID-TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL 123
A +TG+YP R G+ TP+ + +P+ L+P+ ++ LGY T L+GKWH+G E+
Sbjct: 88 ALMTGRYPIRDGMQGTPMRPAEPRGIPLNVSLMPEQMRRLGYETRLVGKWHLGYTTEDYT 147
Query: 124 PFNRGFDNHVGYWNGYLTYND---SIHETDFAVGLDARRN-MERYAPQMSSKYLTDFFTD 179
P RGFD GY+NG+++Y D ++T+ G D R+ + + SS+Y TD TD
Sbjct: 148 PVRRGFDTFFGYYNGFISYYDYWIGWNDTNEVTGYDLHRDESDSFELAHSSEYFTDLITD 207
Query: 180 QSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRR 239
++ +I+++ +++PLFL+I+H AVH G+ K+ L+V ++ + +F +I + R
Sbjct: 208 EAEKIIRNNKNAKPLFLEISHLAVHAGS----KVHDDPLEVRRTDDVNASFPYIEDYQHR 263
Query: 240 LFA 242
+A
Sbjct: 264 KYA 266
>gi|156552077|ref|XP_001604760.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 710
Score = 200 bits (509), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 107/226 (47%), Positives = 139/226 (61%), Gaps = 15/226 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDV FHG + IPTPNIDALAYNG++LN HY CTPSR+A LTGKYP G+ +
Sbjct: 55 GWNDVSFHGSDQIPTPNIDALAYNGVILNSHYVSALCTPSRSALLTGKYPIHTGMQHLVI 114
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ EK+LPQYLKE GY+TH IGKWH G ++ E P RGFD+H GYW G
Sbjct: 115 LEAEPRGLPLHEKILPQYLKEAGYATHAIGKWHQGFHRREYTPTYRGFDSHFGYWQGLQD 174
Query: 142 YND----SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFL 196
Y S + + +G D RRNM A KY TD FTD++V +I+ H + P+FL
Sbjct: 175 YYTHEVGSSNPKEGFLGFDMRRNMS-LARDTYGKYSTDLFTDEAVRLIEEHRPEAGPMFL 233
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ H A H+ GN P LQ PD E F+++ +P+RR++A
Sbjct: 234 YLAHLAPHS---GNDNEP---LQAPD--EEVAKFSYVEDPERRIYA 271
>gi|312382061|gb|EFR27642.1| hypothetical protein AND_05535 [Anopheles darlingi]
Length = 881
Score = 200 bits (508), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 114/231 (49%), Positives = 141/231 (61%), Gaps = 27/231 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G NDVGFHG N IPTPNIDALAY+GI+LNRHY+ P CTPSRAA +TG++P G+
Sbjct: 45 GLNDVGFHGSNQIPTPNIDALAYDGIILNRHYSAPMCTPSRAALMTGRHPMNVGMQHYVI 104
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G G+ EKLLPQY +E GY THLIGKWH+G E LP NRGFD H+GY
Sbjct: 105 DSDEPWGLGLQ------EKLLPQYFREAGYRTHLIGKWHLGFYAEPYLPTNRGFDTHIGY 158
Query: 136 WNGYLTYNDSIHETDFAV--GLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHS- 191
Y+ Y I + D A G D R+N+ Y P + Y TD+FT+ +V +I+SHN +
Sbjct: 159 LGPYIDYWSYISKMDSATFEGYDLRQNLAVNYKP--NGTYATDYFTEAAVEIIRSHNRTG 216
Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ L + H A HT GN P LQ P EE FA+I + DRR +A
Sbjct: 217 ERMLLVLNHLAPHT---GNDDAP---LQAP--EETIEKFAYIRDTDRRTYA 259
>gi|158300602|ref|XP_552160.3| AGAP012047-PA [Anopheles gambiae str. PEST]
gi|157013239|gb|EAL38777.3| AGAP012047-PA [Anopheles gambiae str. PEST]
Length = 564
Score = 199 bits (507), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 103/221 (46%), Positives = 134/221 (60%), Gaps = 10/221 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG IPTPN+DALAY+GI+LNR+Y P CTPSR+A +TGKYP G+ T +
Sbjct: 50 GWNDVGFHGSAQIPTPNLDALAYSGIILNRYYVNPICTPSRSALMTGKYPIHTGMQHTVL 109
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A + +P++EKLLPQYLK+LGYS H++GKWH+G + P RGFD+H G+W G+
Sbjct: 110 YAMEPRGLPLSEKLLPQYLKDLGYSNHIVGKWHLGHYQLRFTPMQRGFDSHTGFWTGHHH 169
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
ND GLD RR + A + +Y T +++ +++ HN S PLFL + HA
Sbjct: 170 MNDHTAVEHGHWGLDMRRGYD-VAYDLHGQYTTHVLGAEAIAIVQGHNKSSPLFLYVAHA 228
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH+ P L PD E HI N RR FA
Sbjct: 229 AVHSAN------PYDFLPAPD--ETVANLGHIENYRRRKFA 261
>gi|281363223|ref|NP_610807.3| CG8646 [Drosophila melanogaster]
gi|17945274|gb|AAL48694.1| RE14504p [Drosophila melanogaster]
gi|272432448|gb|AAF58475.2| CG8646 [Drosophila melanogaster]
Length = 562
Score = 199 bits (507), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 110/223 (49%), Positives = 141/223 (63%), Gaps = 13/223 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+NDVGFHG +IPTPNIDALAY+GI+LNR+Y P CTPSR+A +TGKYP G+ T +
Sbjct: 37 GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A + +P+ EK+LPQYL ELGY++H+ GKWH+G K + P RGF +HVG+W+G+
Sbjct: 97 YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFSSHVGFWSGHQD 156
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
YND + GLD RN + A + Y TD TD SV VI +HN ++ PLFL + H
Sbjct: 157 YNDHTAVENNQWGLDM-RNGTQVAYDLHGHYTTDVITDHSVKVIANHNATKGPLFLYVAH 215
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRT-FAHISNPDRRLFA 242
AA H+ P L VPD ND +HI N RR FA
Sbjct: 216 AACHSSN------PYNPLPVPD---NDVIKMSHIPNYKRRKFA 249
>gi|24666175|ref|NP_649023.1| CG7402 [Drosophila melanogaster]
gi|7293925|gb|AAF49287.1| CG7402 [Drosophila melanogaster]
Length = 579
Score = 199 bits (505), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 106/230 (46%), Positives = 145/230 (63%), Gaps = 25/230 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G NDV FHG N I TPNIDALAYNGI+LN+HY CTPSRA LTGKYP G+
Sbjct: 39 GMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGKYPIHTGMQHFVI 98
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G +P E+L+P+ ++ GYSTHL+GKWH+G +++L P RGFD+H GY
Sbjct: 99 ITDEPWG------LPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGFDHHFGY 152
Query: 136 WNGYLTYND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
+NGY+ Y D + + +++ GLD RR++E P+ + Y T+ FT ++ +I+ H+ S+
Sbjct: 153 YNGYIDYYDHQVRMLDRNYSAGLDFRRDLEP-CPEANGTYATEAFTSEAKRIIEQHDKSK 211
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLF+ ++H AVHT GN P +Q P EE F HI +P RR +A
Sbjct: 212 PLFMVLSHLAVHT---GNEDSP---MQAP--EEEVAKFPHIRDPKRRTYA 253
>gi|194871664|ref|XP_001972882.1| GG13640 [Drosophila erecta]
gi|190654665|gb|EDV51908.1| GG13640 [Drosophila erecta]
Length = 578
Score = 198 bits (504), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 107/230 (46%), Positives = 144/230 (62%), Gaps = 25/230 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G NDV FHG N I TPNIDALAYNGI+LN+HY CTPSRA LTGKYP G+
Sbjct: 39 GMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGKYPIHTGMQHFVI 98
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G +P E+L+P+ +E GYSTHL+GKWH+G ++L P RGFD+H GY
Sbjct: 99 ITDEPWG------LPQRERLMPEIFREAGYSTHLVGKWHLGFWHKDLTPTRRGFDHHFGY 152
Query: 136 WNGYLTYND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
+NGY+ Y D + + +++ GLD RR++E P+ + Y T+ FT ++ +I+ H+ S+
Sbjct: 153 YNGYIDYYDHQVRMLDRNYSAGLDFRRDLEP-CPEANGTYATEAFTAEAKRIIEQHDKSK 211
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLF+ ++H AVHT GN P +Q P EE F HI +P RR +A
Sbjct: 212 PLFMVMSHLAVHT---GNEDSP---MQAP--EEEVAKFPHIRDPKRRTYA 253
>gi|195591209|ref|XP_002085335.1| GD14734 [Drosophila simulans]
gi|194197344|gb|EDX10920.1| GD14734 [Drosophila simulans]
Length = 579
Score = 198 bits (504), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 106/230 (46%), Positives = 145/230 (63%), Gaps = 25/230 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G NDV FHG N I TPNIDALAYNGI+LN+HY CTPSRA LTGKYP G+
Sbjct: 39 GMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGKYPIHTGMQHFVI 98
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G +P E+L+P+ ++ GYSTHL+GKWH+G +++L P RGFD+H GY
Sbjct: 99 ITDEPWG------LPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGFDHHFGY 152
Query: 136 WNGYLTYND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
+NGY+ Y D + + +++ GLD RR++E P+ + Y T+ FT ++ +I+ H+ S+
Sbjct: 153 YNGYIDYYDHQVRLLDRNYSAGLDFRRDLEP-CPEANGTYATEAFTSEAKRIIEQHDKSK 211
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLF+ ++H AVHT GN P +Q P EE F HI +P RR +A
Sbjct: 212 PLFMVLSHLAVHT---GNEDSP---MQAP--EEEVAKFPHIRDPKRRTYA 253
>gi|195328507|ref|XP_002030956.1| GM25726 [Drosophila sechellia]
gi|194119899|gb|EDW41942.1| GM25726 [Drosophila sechellia]
Length = 579
Score = 198 bits (504), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 106/230 (46%), Positives = 145/230 (63%), Gaps = 25/230 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G NDV FHG N I TPNIDALAYNGI+LN+HY CTPSRA LTGKYP G+
Sbjct: 39 GMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGKYPIYTGMQHFVI 98
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G +P E+L+P+ ++ GYSTHL+GKWH+G +++L P RGFD+H GY
Sbjct: 99 ITDEPWG------LPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGFDHHFGY 152
Query: 136 WNGYLTYND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
+NGY+ Y D + + +++ GLD RR++E P+ + Y T+ FT ++ +I+ H+ S+
Sbjct: 153 YNGYIDYYDHQVRLLDRNYSAGLDFRRDLEP-CPEANGTYATEAFTSEAKRIIEQHDKSK 211
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLF+ ++H AVHT GN P +Q P EE F HI +P RR +A
Sbjct: 212 PLFMVLSHLAVHT---GNEDSP---MQAP--EEEVAKFPHIRDPKRRTYA 253
>gi|380012883|ref|XP_003690503.1| PREDICTED: arylsulfatase J-like [Apis florea]
Length = 671
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 105/226 (46%), Positives = 136/226 (60%), Gaps = 15/226 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GWNDV FHG + IPTPNIDALAYNGI+LN HY CTPSR+A +TGK P G+ V
Sbjct: 39 GWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKNPIHLGMQHSVL 98
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P++EKLLP+YL+E+GY TH +GKWH+G K+E P RGFD+H GYWNG
Sbjct: 99 FPAEPRGLPLSEKLLPEYLREVGYKTHAVGKWHLGYFKKEYTPTYRGFDSHFGYWNGLQD 158
Query: 142 YNDSIHETDFAV-----GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y I + V G D RRN+ A KY TD FT+++V +I HN +P+FL
Sbjct: 159 YYTHITQEPDPVYSEYKGFDMRRNLT-VAWDTVGKYSTDLFTNEAVRLINEHNIDQPMFL 217
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ H A H G L + PD E F++I +P+RR+ A
Sbjct: 218 YLAHLAPHKGNEEQ------LFRAPD--EEIAKFSYILDPERRIQA 255
>gi|195441662|ref|XP_002068622.1| GK20325 [Drosophila willistoni]
gi|194164707|gb|EDW79608.1| GK20325 [Drosophila willistoni]
Length = 550
Score = 197 bits (502), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 107/230 (46%), Positives = 146/230 (63%), Gaps = 25/230 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G+NDV FHG N I TPNIDALAYNG++LN+ Y CTPSRA LTGKYP G+
Sbjct: 38 GFNDVSFHGSNQILTPNIDALAYNGVLLNKLYVPNLCTPSRATLLTGKYPIHTGMQHYVI 97
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G +P E+L+P++ ++ GYST LIGKWH+G +++L P RGFD+H GY
Sbjct: 98 ITDEPWG------LPKQERLMPEFFRDAGYSTQLIGKWHLGFWEKDLTPTMRGFDHHYGY 151
Query: 136 WNGYLTYND-SIH--ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
+NGY+ Y D ++H ++ G+D RR+++ PQ + Y TD FT ++ VI+ H+ SR
Sbjct: 152 YNGYIDYYDHTLHMLTKNYTKGVDFRRDLDP-CPQDNGTYATDAFTAEAKRVIEQHDKSR 210
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLF+ ++H AVHTG N +Q P EE FAHI++P RR +A
Sbjct: 211 PLFMVLSHLAVHTGNEDNP------MQAP--EEEVAKFAHITDPKRRTYA 252
>gi|195494692|ref|XP_002094947.1| GE19935 [Drosophila yakuba]
gi|194181048|gb|EDW94659.1| GE19935 [Drosophila yakuba]
Length = 577
Score = 197 bits (500), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 106/230 (46%), Positives = 145/230 (63%), Gaps = 25/230 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G NDV FHG N I TPNIDALAYNGI+LN+HY CTPSRA LTGKYP G+
Sbjct: 39 GMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGKYPIHTGMQHFVI 98
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G +P E+L+P+ ++ GYSTHL+GKWH+G +++L P RGFD+H GY
Sbjct: 99 ITDEPWG------LPSRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGFDHHFGY 152
Query: 136 WNGYLTYND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
+NGY+ Y D + + +++ GLD RR++E P+ + Y T+ FT ++ +I+ H+ S+
Sbjct: 153 YNGYIDYYDHQVRMLDRNYSHGLDFRRDLEP-CPEANGTYATEAFTSEAKRIIEQHDKSK 211
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLF+ ++H AVHT GN P +Q P EE F HI +P RR +A
Sbjct: 212 PLFMVMSHLAVHT---GNEDSP---MQAP--EEEVAKFPHIRDPKRRTYA 253
>gi|307191747|gb|EFN75189.1| Arylsulfatase B [Harpegnathos saltator]
Length = 583
Score = 196 bits (498), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 101/223 (45%), Positives = 130/223 (58%), Gaps = 12/223 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDV FHG N IPTPNIDALAY G++L HY CTPSRAA LTGKYP G+ +
Sbjct: 52 GWNDVSFHGSNQIPTPNIDALAYYGVLLKNHYVAALCTPSRAALLTGKYPIHLGMQHEAI 111
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ EKLLPQYLK++ Y TH++GKWH+G K E P RGFD H GYWNG
Sbjct: 112 FPSEPRGLPLEEKLLPQYLKDMNYVTHIVGKWHLGYYKMEYTPLYRGFDTHFGYWNGLQD 171
Query: 142 Y--NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y + + +G+D RRN A KY D +TD++V +I +HN P+FL +
Sbjct: 172 YYSHKTAEPYTLNIGMDMRRNF-TVAWDTMGKYSVDLYTDEAVRLINTHNTDNPMFLYLA 230
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A H G A +LP L E F++I +P R+ +A
Sbjct: 231 QIAPHAGNAN--QLPQAL------PEEIEKFSYIIDPKRKRYA 265
>gi|383859596|ref|XP_003705279.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
Length = 689
Score = 196 bits (498), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 104/226 (46%), Positives = 136/226 (60%), Gaps = 15/226 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDV FHG + IPTPNIDA+AYNGI+LN HY CTPSR A +TGK P G+ V
Sbjct: 70 GWNDVSFHGSDQIPTPNIDAIAYNGIILNSHYVAALCTPSRTALMTGKNPIHLGMQHSVL 129
Query: 83 A-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P++EKLLP+YL+E+GY TH +GKWH+G + E P RGFD H GYWNG
Sbjct: 130 LPSEPRGLPLSEKLLPEYLREVGYRTHAVGKWHLGYFRREYTPTFRGFDTHFGYWNGLQD 189
Query: 142 YNDSIHET---DFA--VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y I + F +GLD RRN+ A KY TD FT+++V +I H+ P+FL
Sbjct: 190 YYTHITQEPDPQFGEFMGLDMRRNLT-AAWDTQGKYSTDLFTEEAVRLINEHDKDDPMFL 248
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ H A H G P LL+ D E+ F++I +P+RR+ A
Sbjct: 249 YLAHLAPHKGN------PNRLLRASD--EDIARFSYILDPERRIQA 286
>gi|242008416|ref|XP_002425002.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212508631|gb|EEB12264.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 532
Score = 196 bits (498), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 101/228 (44%), Positives = 141/228 (61%), Gaps = 15/228 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW D GFHG + I TPN+DALAY+G++LNRHY LP+CTPSR+A LTG YP R G+ P+
Sbjct: 39 GWTDTGFHGSDQIKTPNMDALAYSGMILNRHYVLPSCTPSRSALLTGLYPIRTGMQGMPL 98
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + +P++ KL P++LK LGY THL+GKWH+G LP RGFD+ GY+NGY+
Sbjct: 99 KGGDVRNLPLSFKLKPEFLKNLGYRTHLVGKWHLGYRTINHLPNQRGFDSFFGYYNGYVD 158
Query: 142 Y-----NDSI--HETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
Y N ++ + ++ G D RN E Y + Y T FT ++ +IK+HN S PL
Sbjct: 159 YFKFGHNQTVAGEKIEYFYGYDLHRNGEIYQTDKDT-YATRLFTREAEKIIKNHNESEPL 217
Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+L +H A HTG ++VP+ + ++T+ HI + RR FA
Sbjct: 218 YLYFSHLATHTGDDDIG------MEVPEDADVNKTYGHIKHYGRRAFA 259
>gi|118779434|ref|XP_309303.3| AGAP011348-PA [Anopheles gambiae str. PEST]
gi|116131546|gb|EAA05277.3| AGAP011348-PA [Anopheles gambiae str. PEST]
Length = 573
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 109/230 (47%), Positives = 136/230 (59%), Gaps = 26/230 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
GWNDV FHG N IPTPNIDALAY+GI+LNRHY P CTPSRA+ +TGK+P G+
Sbjct: 39 GWNDVSFHGSNQIPTPNIDALAYDGIILNRHYVPPLCTPSRASLMTGKHPMNIGMQDHVI 98
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G G + +KL+PQY +E GY THL+GKWH+G + P RGFD+H GY
Sbjct: 99 ISDEPWGLG------LDQKLMPQYFREAGYRTHLVGKWHLGFFRRAYTPTYRGFDSHFGY 152
Query: 136 WNGYLTYND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
Y+ Y D ++ET A GLD RRN + Y TD F D++V +I SHN S+
Sbjct: 153 LGPYIDYWDHSLQMNETS-ARGLDMRRNTA-VNYDANGTYATDLFNDEAVRLIDSHNRSK 210
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLFL +TH A HTG + LQ P + F +I +P RR A
Sbjct: 211 PLFLVLTHLAPHTGNEDDP------LQAP--ADEIAKFDYIQDPKRRTLA 252
>gi|195403369|ref|XP_002060263.1| GJ19825 [Drosophila virilis]
gi|194140907|gb|EDW57358.1| GJ19825 [Drosophila virilis]
Length = 324
Score = 195 bits (495), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 104/230 (45%), Positives = 143/230 (62%), Gaps = 25/230 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G+NDV FHG N I TPNIDA AYNG++LNR+Y CTPSRAA LTGKYP G+
Sbjct: 39 GFNDVSFHGSNQILTPNIDAFAYNGVILNRYYVPNLCTPSRAALLTGKYPIHNGMQHFVQ 98
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G +P+ E+L+PQ+ ++ GYST L+GKWH+G +++ P RGFD+H GY
Sbjct: 99 IPDEPWG------LPLGERLMPQFFRDAGYSTQLVGKWHLGFWRQDHTPIMRGFDHHFGY 152
Query: 136 WNGYLTYNDSIH---ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
+NGY+ Y D H + ++ G D RR+++R + Y T+ FT ++ +I+ H+ SR
Sbjct: 153 YNGYIDYYDHTHYMLDRNYTAGADFRRDLQRCHSD-NGTYATEAFTKEARRIIEQHDLSR 211
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLF+ ++H AVHT GN P +Q P E F HIS+P RR +A
Sbjct: 212 PLFMVLSHLAVHT---GNENQP---MQAP--YEEVAKFVHISDPKRRTYA 253
>gi|380025784|ref|XP_003696648.1| PREDICTED: arylsulfatase B-like [Apis florea]
Length = 579
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 99/221 (44%), Positives = 137/221 (61%), Gaps = 14/221 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP-V 81
GWNDVGFHG N IPTPNIDALAYNGI+LNRHY LP+ TPSR AF TG YP R G+ +
Sbjct: 42 GWNDVGFHGSNQIPTPNIDALAYNGIILNRHYVLPSSTPSRIAFFTGLYPIRIGMQGDGI 101
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + +P+ K+LP++L+ LGY+T LIGKWH+G + + P +RGFD G++N ++T
Sbjct: 102 RGGEPRGLPLHIKILPEHLRGLGYTTKLIGKWHMGYHTPQYTPLHRGFDTFFGFYNSHIT 161
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D + G D R + A + +Y+TD FT +++ +I++H RPL+LQI+H
Sbjct: 162 YYDYEYSNQNMTGYDMHRG-DDPAHGIKREYVTDLFTKEAIKIIENHELPRPLYLQISHL 220
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH ++ PD +D I P+RR +A
Sbjct: 221 AVHAP-----------IEQPDDSSSDEII-QIREPNRRKYA 249
>gi|307187655|gb|EFN72627.1| Arylsulfatase B [Camponotus floridanus]
Length = 591
Score = 193 bits (490), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 96/221 (43%), Positives = 137/221 (61%), Gaps = 12/221 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+DVGFHG + I TPNIDAL YNGI+LNRHY LP+ TPSR AF TG+YP R G+ +
Sbjct: 41 GWDDVGFHGSDQIRTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQYPIRMGMQGEDI 100
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + +P+ ++LP++L++LGY T LIGKWH+G + P RGFD+ +G++N +++
Sbjct: 101 QGGEPRGIPLNVRILPEFLRDLGYMTKLIGKWHLGYYTPQHTPLRRGFDSFLGFYNSHVS 160
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y + + G D R + A + KY+TDFFTD+++ +I+ ++ SRPL+LQI+H
Sbjct: 161 YYNYKYSFQNMSGYDMHRG-DAPAYGSTDKYVTDFFTDEAIKIIEYYDPSRPLYLQISHL 219
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH G D D F HI +RR +A
Sbjct: 220 AVHAPLEGPQ----------DYNHYDSQFLHIREINRRKYA 250
>gi|307215080|gb|EFN89887.1| Arylsulfatase B [Harpegnathos saltator]
Length = 593
Score = 192 bits (489), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 99/224 (44%), Positives = 138/224 (61%), Gaps = 16/224 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP-V 81
GWNDV FHG N IPTPNIDAL YNGI+LNRHY LP+ TPSRAAF TG YP R G+ +
Sbjct: 42 GWNDVSFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRAAFFTGLYPIRIGMQGEGI 101
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + +P+ ++LP+YL+ LGY+T LIGKWH+G + + P +RGFD +G++N +++
Sbjct: 102 QGGEPRGLPLNIRILPEYLRGLGYTTKLIGKWHVGYHTPQHTPLHRGFDAFLGFYNSHVS 161
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D + G D R + A ++++Y TD FTD+++ +I+ H RPL+LQI+H
Sbjct: 162 YYDYRYSYQNMSGYDMHRG-DNPAYGLNAEYATDLFTDEAMKIIQRHEPPRPLYLQISHL 220
Query: 202 AVHTGTAGNAKLPTGLLQVPD---MEENDRTFAHISNPDRRLFA 242
AVH ++ PD N F HI +RR +A
Sbjct: 221 AVHAP-----------IESPDDDHRNSNRERFKHIPEVNRRNYA 253
>gi|242023422|ref|XP_002432133.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212517507|gb|EEB19395.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 514
Score = 192 bits (487), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 108/223 (48%), Positives = 135/223 (60%), Gaps = 13/223 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVGFHG N IPTPNIDALA+ GI+LN +Y P CTPSR+A LTGKYP G+ V
Sbjct: 42 GWNDVGFHGSNQIPTPNIDALAFTGIILNNYYVAPVCTPSRSALLTGKYPIHTGLQHGVI 101
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G A + + EKLLP+YL+ L Y T +GKWH+G K++ P RGFD+H GYW G+
Sbjct: 102 HGSAPYGLNLNEKLLPEYLRSLNYVTRHVGKWHLGSFKKDYTPEYRGFDSHYGYWTGHQD 161
Query: 142 YND--SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y D +I F G D RR M Y TD FT+++V VIK H+ ++PLFL +
Sbjct: 162 YYDHTAIENPGFW-GYDMRRGMNVTRSDFGY-YTTDLFTNEAVKVIKGHDSNKPLFLYLA 219
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
H A H+ GN P LQ P E F +I + +RRLFA
Sbjct: 220 HLATHS---GNKYSP---LQAP--AETVAKFNYIKDKNRRLFA 254
>gi|91084739|ref|XP_970972.1| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
gi|270008608|gb|EFA05056.1| hypothetical protein TcasGA2_TC015151 [Tribolium castaneum]
Length = 558
Score = 191 bits (486), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 100/224 (44%), Positives = 140/224 (62%), Gaps = 13/224 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G ND+G N IPTPNIDAL YNG+VL+R+Y CTPSRAAFLTG YP R + P+
Sbjct: 38 GHNDIGLR-TNQIPTPNIDALGYNGVVLDRYYVQNACTPSRAAFLTGNYPIRSAMQGLPI 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
AG +++P+ L+PQ+LK LGY TH++GKWH+G P +GFD+H GYWNG+
Sbjct: 97 VAGENRSLPLNMPLMPQHLKNLGYRTHIVGKWHLGSAYRSSTPTEKGFDSHFGYWNGFTG 156
Query: 142 YNDSIHETDF-AVGLDARRNMERYAPQM--SSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y D + TDF + ++ +R+ + +Y T FT++++ +I+ HN +RPLFL +
Sbjct: 157 YYD--YFTDFNSTAIEGFDLHDRFETERGYQGQYATRVFTERALDIIEGHNTTRPLFLLM 214
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
TH A H G G L VP+ E RT+++I +P RRL+A
Sbjct: 215 THLAAHAGRDGTE------LGVPNEVEAQRTYSYIQDPRRRLYA 252
>gi|328788250|ref|XP_624148.3| PREDICTED: arylsulfatase B-like isoform 2 [Apis mellifera]
Length = 564
Score = 191 bits (485), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 99/221 (44%), Positives = 134/221 (60%), Gaps = 14/221 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP-V 81
GWNDVGFHG N IPTPNIDALAYNGI+LNRHY LP+ TPSR AF TG YP R G+ +
Sbjct: 42 GWNDVGFHGSNQIPTPNIDALAYNGIILNRHYVLPSSTPSRIAFFTGLYPIRIGMQGDGI 101
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + +P+ K+LP++L+ LGY T LIGKWH+G + + P +RGFD G++N ++T
Sbjct: 102 RGGEPRGLPLHIKILPEHLRGLGYVTKLIGKWHMGFHTLQYTPLHRGFDTFFGFYNSHIT 161
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D + G D + A M +Y TD FT++++ +I++H RPL+LQI+H
Sbjct: 162 YYDYEYSNQNMTGYDMHCG-DDPAYGMKREYATDLFTNEAIKIIENHELPRPLYLQISHL 220
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH ++ PD D I P+RR +A
Sbjct: 221 AVHAP-----------IEQPDDSSRDE-IVQIREPNRRKYA 249
>gi|270008609|gb|EFA05057.1| hypothetical protein TcasGA2_TC015152 [Tribolium castaneum]
Length = 563
Score = 191 bits (484), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 95/225 (42%), Positives = 141/225 (62%), Gaps = 11/225 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
G+NDV FHG + IPTPN+ +A GI+L+R YT TCTPSR A LTG+YP R G+ P+
Sbjct: 50 GYNDVSFHGSSQIPTPNLAKMATRGIILDRFYTQSTCTPSRTALLTGQYPIRSGMQGYPL 109
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
AG +++P+ +P + + LGY THL+GKWH+G +E P +GFD+H GYWNG++
Sbjct: 110 KAGENRSLPLNMPTMPLHFQNLGYKTHLVGKWHLGAAYKEDTPLGKGFDSHFGYWNGFVG 169
Query: 142 YND--SIHETDFAVGLDARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y D S + D + +++ P S +Y T+ FT++S+ VI+ H+ PLFL
Sbjct: 170 YFDYVSFSKMDNGTLVKGLDLHDQFEPVWGSQGRYATELFTERSLDVIEGHDVRVPLFLV 229
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++H A HTG G+ L VPD+++ + F++I +P RRL+A
Sbjct: 230 VSHLAAHTGQNGSE------LGVPDVDQTNHEFSYIQDPRRRLYA 268
>gi|189236319|ref|XP_975218.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
Length = 536
Score = 190 bits (483), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 102/228 (44%), Positives = 142/228 (62%), Gaps = 21/228 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP-V 81
GWNDVGFHG N IPTPNIDALAYNGI+LN HY+ TPSRAA LTGKYP + G+ P +
Sbjct: 19 GWNDVGFHGSNQIPTPNIDALAYNGIILNSHYSQSFGTPSRAALLTGKYPMKLGLQGPSI 78
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+++P K++ +Y K++GY+THL+GKWH+G ++ P RGFD+ G++NG+ +
Sbjct: 79 TPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTFRGFDHFFGFYNGFTS 137
Query: 142 YND-----SIHETDFAVGLDARRNMERYAPQM--SSKYLTDFFTDQSVHVIKSHNHSRPL 194
Y D I++ +++ G D RR+ P + KY TD F + +V VI+ HN + PL
Sbjct: 138 YYDYVSNWKINDKEYS-GFDLRRDT---VPSWNDAGKYATDLFAEHAVDVIQKHNVNTPL 193
Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
F+ I H AVH G G L+ P +E F HI +P+RR +A
Sbjct: 194 FMMIAHLAVHVGNEGK------WLEAP--QETVNKFKHIRDPNRRTYA 233
>gi|91084737|ref|XP_970917.1| PREDICTED: similar to arylsulfatase B [Tribolium castaneum]
Length = 531
Score = 190 bits (483), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 95/225 (42%), Positives = 141/225 (62%), Gaps = 11/225 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
G+NDV FHG + IPTPN+ +A GI+L+R YT TCTPSR A LTG+YP R G+ P+
Sbjct: 35 GYNDVSFHGSSQIPTPNLAKMATRGIILDRFYTQSTCTPSRTALLTGQYPIRSGMQGYPL 94
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
AG +++P+ +P + + LGY THL+GKWH+G +E P +GFD+H GYWNG++
Sbjct: 95 KAGENRSLPLNMPTMPLHFQNLGYKTHLVGKWHLGAAYKEDTPLGKGFDSHFGYWNGFVG 154
Query: 142 YND--SIHETDFAVGLDARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y D S + D + +++ P S +Y T+ FT++S+ VI+ H+ PLFL
Sbjct: 155 YFDYVSFSKMDNGTLVKGLDLHDQFEPVWGSQGRYATELFTERSLDVIEGHDVRVPLFLV 214
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++H A HTG G+ L VPD+++ + F++I +P RRL+A
Sbjct: 215 VSHLAAHTGQNGSE------LGVPDVDQTNHEFSYIQDPRRRLYA 253
>gi|156547173|ref|XP_001603910.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 578
Score = 190 bits (483), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 105/226 (46%), Positives = 136/226 (60%), Gaps = 22/226 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVGFHG IPTPNIDAL YNGI+LN+HY LP+C+P+RAAFLTGKYP R G+ G
Sbjct: 36 GWNDVGFHGATQIPTPNIDALGYNGIILNKHYVLPSCSPTRAAFLTGKYPIRMGMQ---G 92
Query: 83 AGVA----KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
AG+A + +PV + LP+YL+ LGY T+LIGKWH+G + + LP RGFD G++N
Sbjct: 93 AGIAGGEPRGLPVHVQTLPEYLQGLGYETNLIGKWHVGYHTPKHLPNRRGFDYFYGFYNS 152
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
++ Y D + G D N E A Y TD FT ++ VI H+ P++LQ+
Sbjct: 153 HIGYYDYRYSQGNMSGFDMHINGET-AYGTDGVYATDRFTQAAIDVIYRHDLESPMYLQV 211
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEEN--DRTFAHISNPDRRLFA 242
+H A H + VP E+N D F HIS P RR +A
Sbjct: 212 SHLAPHAP-----------MDVP-FEDNPYDDEFRHISEPKRRAYA 245
>gi|270005853|gb|EFA02301.1| hypothetical protein TcasGA2_TC007966 [Tribolium castaneum]
Length = 558
Score = 190 bits (482), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 102/228 (44%), Positives = 142/228 (62%), Gaps = 21/228 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP-V 81
GWNDVGFHG N IPTPNIDALAYNGI+LN HY+ TPSRAA LTGKYP + G+ P +
Sbjct: 41 GWNDVGFHGSNQIPTPNIDALAYNGIILNSHYSQSFGTPSRAALLTGKYPMKLGLQGPSI 100
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+++P K++ +Y K++GY+THL+GKWH+G ++ P RGFD+ G++NG+ +
Sbjct: 101 TPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTFRGFDHFFGFYNGFTS 159
Query: 142 YND-----SIHETDFAVGLDARRNMERYAPQM--SSKYLTDFFTDQSVHVIKSHNHSRPL 194
Y D I++ +++ G D RR+ P + KY TD F + +V VI+ HN + PL
Sbjct: 160 YYDYVSNWKINDKEYS-GFDLRRDT---VPSWNDAGKYATDLFAEHAVDVIQKHNVNTPL 215
Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
F+ I H AVH G G L+ P +E F HI +P+RR +A
Sbjct: 216 FMMIAHLAVHVGNEGK------WLEAP--QETVNKFKHIRDPNRRTYA 255
>gi|170050440|ref|XP_001861313.1| arylsulfatase b [Culex quinquefasciatus]
gi|167872047|gb|EDS35430.1| arylsulfatase b [Culex quinquefasciatus]
Length = 552
Score = 188 bits (478), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 102/222 (45%), Positives = 134/222 (60%), Gaps = 12/222 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVGFHG IPTPN+DALAY+GI+LNR+Y P CTPSRAA +TG+YP G+ V
Sbjct: 36 GWNDVGFHGSAQIPTPNLDALAYSGIILNRYYVTPICTPSRAALMTGRYPIHTGMQHAVL 95
Query: 83 AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-YL 140
G+ + +P+ EKLLP+YL+ELGY H++GKWH+G P RGFD+HVG+W G +
Sbjct: 96 YGMEPRGLPLEEKLLPEYLRELGYKNHIVGKWHLGHYTRRYTPLERGFDSHVGFWTGHHH 155
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
++ S ET+ GLD RR + A + KY T D++V I +H+ PLFL + H
Sbjct: 156 MFDHSAVETE-TWGLDMRRGYD-VAYDLHGKYTTHVIRDEAVARIGNHSIGDPLFLYVAH 213
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AAVH+ P L PD+ H+ RR FA
Sbjct: 214 AAVHSAN------PYDFLPAPDVTV--AGLEHVEPYPRRKFA 247
>gi|242025556|ref|XP_002433190.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212518731|gb|EEB20452.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 570
Score = 186 bits (473), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 100/223 (44%), Positives = 133/223 (59%), Gaps = 13/223 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDV FHG N I TPNIDALAYNGI+LN HY CTPSRA+ +TGKYP G+ V
Sbjct: 58 GWNDVSFHGSNQIQTPNIDALAYNGIILNSHYVPALCTPSRASLMTGKYPTSLGMQHLVI 117
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ E L+P+Y + GY+TH +GKWH+G K+E P RGFD+H G+WNG+
Sbjct: 118 LSPEPWGLPLNETLMPEYFNKNGYATHAVGKWHLGFFKKEYTPIYRGFDSHFGHWNGFQD 177
Query: 142 YNDSIHETDFAVGLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQIT 199
Y D +D G D RRN E Y+ Q Y TD FT +++ +I +HN + PLFL ++
Sbjct: 178 YYDHTTMSDSLKGYDMRRNFEVDYSYQ--GMYTTDVFTKEAIKIIDNHNSQKGPLFLYLS 235
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
H A H+G P Q P E+ I++P R+++A
Sbjct: 236 HLAPHSGN------PDNPFQAP--EDEISKHECINDPGRKIYA 270
>gi|170040781|ref|XP_001848166.1| arylsulfatase b [Culex quinquefasciatus]
gi|167864377|gb|EDS27760.1| arylsulfatase b [Culex quinquefasciatus]
Length = 657
Score = 186 bits (471), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 105/227 (46%), Positives = 131/227 (57%), Gaps = 39/227 (17%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
GWNDVGFHG N IPTPNIDALAY GI+LNRHYT P CTPSRAA +TG+ P G+
Sbjct: 42 GWNDVGFHGSNQIPTPNIDALAYGGIILNRHYTAPMCTPSRAAIMTGRNPISVGMQHYVI 101
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G G + +K++P+Y +E GY THL+GKWH+G ++ P RGFD+H Y
Sbjct: 102 DSDEPWGLG------LDQKIMPEYFREAGYRTHLVGKWHLGFFAQQYTPTMRGFDSHTNY 155
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
GY D H + AV DA + Y TD FTD + +I HN S PLF
Sbjct: 156 -TGY----DMRH--NLAVDYDA-----------NGTYATDHFTDAASRIIDKHNPSEPLF 197
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L + H A HTG + LQ P EE + F HIS+ +RR++A
Sbjct: 198 LMVNHLAPHTGNDNDP------LQAP--EERIKKFEHISDENRRIYA 236
>gi|321470034|gb|EFX81012.1| hypothetical protein DAPPUDRAFT_303738 [Daphnia pulex]
Length = 557
Score = 186 bits (471), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 98/221 (44%), Positives = 127/221 (57%), Gaps = 10/221 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDV FHG IPTPN+DALA++G++L +Y P CTPSR+A +TGK+P G+ V
Sbjct: 37 GWNDVSFHGSKQIPTPNLDALAFSGLILQNYYVTPLCTPSRSALMTGKHPIHTGMQHDVL 96
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G ++ +P++E LP+YLK+LGY H++GKWH+G K P RGFD+H GYW G+
Sbjct: 97 YGYSRYGLPLSEITLPEYLKDLGYKNHIVGKWHLGHYKSVYTPLFRGFDSHYGYWTGHQD 156
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D A G D RRN KY T TD++ VI H+ S PLFL + H
Sbjct: 157 YYDHTAVEWNAWGYDMRRN-HSVDWSAYGKYTTTLLTDEACDVITKHDVSSPLFLYVAHL 215
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH+ P LQ P EE F+ I N RR +A
Sbjct: 216 AVHSAN------PYSPLQAP--EETVEMFSSIENLQRRRYA 248
>gi|157108842|ref|XP_001650409.1| arylsulfatase b [Aedes aegypti]
gi|108879187|gb|EAT43412.1| AAEL005134-PA [Aedes aegypti]
Length = 675
Score = 185 bits (470), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 106/232 (45%), Positives = 136/232 (58%), Gaps = 29/232 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
GWNDVGFHG N IPTPNIDALAY+GI+LNRHYT P CTPSRA+ +TGK P G+
Sbjct: 43 GWNDVGFHGSNQIPTPNIDALAYDGIILNRHYTAPMCTPSRASLMTGKNPINIGMQHYVI 102
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G G + +K++P+Y KE GY THL+GKWH+G + ++ P RGFD HVGY
Sbjct: 103 VSDEPWGLG------LDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRGFDTHVGY 156
Query: 136 WNGYLTYNDSIHETDFA-----VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNH 190
Y+ Y D + F+ G D R N+ + Y TD FT + +I+ H+
Sbjct: 157 LGPYVDYWD--YTLKFSPPKSFQGYDMRNNLN-VDYDSNGTYATDHFTKAASSIIERHDT 213
Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLFL + H A H A N P LQ P EE+ R F +IS+ RR++A
Sbjct: 214 KDPLFLVVNHLAPH---AANDDDP---LQAP--EEDIRKFDYISDERRRIYA 257
>gi|350422929|ref|XP_003493332.1| PREDICTED: arylsulfatase B-like [Bombus impatiens]
Length = 581
Score = 184 bits (468), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/221 (45%), Positives = 139/221 (62%), Gaps = 13/221 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDV FHG N IPTPNIDAL YNGI+LNRHY LP+ TPSR AF TG+YP R G+ +
Sbjct: 42 GWNDVSFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQYPIRIGMQGADI 101
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + +P+ K+LP++L+ LGY+T LIGKWH+G + P +RGFD +G++N Y++
Sbjct: 102 RGGEPRGLPLNIKILPEHLRGLGYTTKLIGKWHMGYYTPQYTPLHRGFDTFLGFYNSYIS 161
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D + G D R + A M+ +Y TD FT +++++I++H +RPL+LQ++H
Sbjct: 162 YYDYSYSNQNMSGYDMHRG-DDPAYGMNREYATDMFTREAINIIENHELNRPLYLQLSHL 220
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH L+ P NDR HI P+RR +A
Sbjct: 221 AVHAP-----------LEQPMNVYNDREPIHIREPNRRKYA 250
>gi|157108840|ref|XP_001650408.1| arylsulfatase b [Aedes aegypti]
gi|108879186|gb|EAT43411.1| AAEL005134-PB [Aedes aegypti]
Length = 607
Score = 184 bits (467), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 106/232 (45%), Positives = 136/232 (58%), Gaps = 29/232 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
GWNDVGFHG N IPTPNIDALAY+GI+LNRHYT P CTPSRA+ +TGK P G+
Sbjct: 43 GWNDVGFHGSNQIPTPNIDALAYDGIILNRHYTAPMCTPSRASLMTGKNPINIGMQHYVI 102
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G G + +K++P+Y KE GY THL+GKWH+G + ++ P RGFD HVGY
Sbjct: 103 VSDEPWGLG------LDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRGFDTHVGY 156
Query: 136 WNGYLTYNDSIHETDFA-----VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNH 190
Y+ Y D + F+ G D R N+ + Y TD FT + +I+ H+
Sbjct: 157 LGPYVDYWD--YTLKFSPPKSFQGYDMRNNLN-VDYDSNGTYATDHFTKAASSIIERHDT 213
Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLFL + H A H A N P LQ P EE+ R F +IS+ RR++A
Sbjct: 214 KDPLFLVVNHLAPH---AANDDDP---LQAP--EEDIRKFDYISDERRRIYA 257
>gi|170040779|ref|XP_001848165.1| arylsulfatase B [Culex quinquefasciatus]
gi|167864376|gb|EDS27759.1| arylsulfatase B [Culex quinquefasciatus]
Length = 585
Score = 184 bits (467), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 104/232 (44%), Positives = 140/232 (60%), Gaps = 28/232 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYG------ 76
GWNDV FHG IPTPNIDALAY+GI+LNRHY P CTPSRA+ +TGK+P G
Sbjct: 42 GWNDVSFHGSLQIPTPNIDALAYSGIILNRHYAPPLCTPSRASLMTGKHPINIGMQHHVI 101
Query: 77 -IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
+D P G G + +KL+P+Y +E GY T L+GKWH+G ++ P RGFD+H GY
Sbjct: 102 EVDEPWGLG------LDQKLMPEYFREAGYRTRLVGKWHLGFFRKAYTPTMRGFDSHYGY 155
Query: 136 WNGYLTYND-SIHETDFAV-GLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHS- 191
Y+ Y D S+ ++ + GLD RRN++ Y+ + Y TD FT ++V +I HN +
Sbjct: 156 IGPYIDYWDHSLQMSNTSTRGLDMRRNLQVDYSAR--GTYATDLFTREAVRLIHDHNQTS 213
Query: 192 -RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLFL +TH A HTG + +Q P EE+ F+ I +P RR+ A
Sbjct: 214 ANPLFLVVTHLAPHTGNEDDP------MQAP--EEDVELFSFIKDPKRRVLA 257
>gi|291244830|ref|XP_002742299.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
partial [Saccoglossus kowalevskii]
Length = 559
Score = 181 bits (458), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 95/221 (42%), Positives = 133/221 (60%), Gaps = 10/221 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
GW+DV FHG + IPTPNID LAY+G++L+ +Y P CTP+RAA +TG++P G+ D +
Sbjct: 39 GWDDVSFHGSDQIPTPNIDELAYSGVLLHNYYVQPICTPTRAALMTGRHPIHLGLQDGVI 98
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A +P+ E ++PQYLK LGY TH++GKWH+G + P RGFD H GY+NG
Sbjct: 99 VASHPYGLPLNETIMPQYLKPLGYDTHIVGKWHLGFFAWQYTPLYRGFDTHFGYYNGEEG 158
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D E +GLD R N E + +Y T+ FT + +I +HN ++PLFL + H
Sbjct: 159 YYDHTAEEPKYIGLDFRNNTELFK-SAYGEYSTELFTSYAEKIIHNHNKNKPLFLYLAHQ 217
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH+ GN+ P L+ P + F +I + RR FA
Sbjct: 218 AVHS---GNSYSP---LEAP--YKYTSRFPYIQDERRRTFA 250
>gi|158287209|ref|XP_564139.3| AGAP011347-PA [Anopheles gambiae str. PEST]
gi|157019541|gb|EAL41524.3| AGAP011347-PA [Anopheles gambiae str. PEST]
Length = 634
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 103/231 (44%), Positives = 136/231 (58%), Gaps = 27/231 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
GWNDVGFHG N I TP+IDALAY+G++LNRHY+ P CTPSRAA +TG++P G+
Sbjct: 45 GWNDVGFHGSNQIATPHIDALAYDGVILNRHYSAPMCTPSRAALMTGRHPINVGMQHYVI 104
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G G + ++++PQY + GY TH+IGKWH+G E +P NRGFD H+GY
Sbjct: 105 DSDEPWGLG------LDQRIMPQYFRAAGYRTHMIGKWHLGFFTEHYIPTNRGFDTHIGY 158
Query: 136 WNGYLTYNDSIHETDFAV--GLDARRN-MERYAPQMSSKYLTDFFTDQSVHVIKSHNHS- 191
Y+ Y + + + G D R+N YA + Y TD+FT + +I H S
Sbjct: 159 LGPYVDYWSYVSKMNSGTFEGYDMRQNQFVNYA--ANGTYATDYFTSAARDIIAQHGKSG 216
Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+P+ L + H A H AGN P LQ P E DR FA+I N DRR +A
Sbjct: 217 QPMLLVMNHLAPH---AGNDDDP---LQAP-QETIDR-FAYIGNRDRRTYA 259
>gi|391330458|ref|XP_003739677.1| PREDICTED: arylsulfatase J-like [Metaseiulus occidentalis]
Length = 633
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 95/222 (42%), Positives = 130/222 (58%), Gaps = 8/222 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
GW+DV FH IPTPNIDA+ + ++LN HY +CTPSR A LTGKYP + G+ + +
Sbjct: 68 GWSDVSFHANGQIPTPNIDAMCSDAVLLNSHYVQASCTPSRGALLTGKYPIKIGLQEYVI 127
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G +A+ + +LLPQYL++LGY+THL+GKWH+G E+ LP NRGFD+ G++NG T
Sbjct: 128 QPGRQEALHLKHRLLPQYLRDLGYATHLVGKWHLGFYAEDYLPENRGFDSFYGFYNGAGT 187
Query: 142 -YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
YN S + D +G D N E P KY TD T + H+I+S + +P+FL I+H
Sbjct: 188 YYNHSASDADGRIGYDWHLNKES-DPDAHGKYATDIITQRVKHLIQSRDPEKPMFLMISH 246
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A H G + L +V D AHI R +A
Sbjct: 247 MAPHGGDNEDE-----LFEVDRQWIEDPEIAHIMVESRTKYA 283
>gi|391327192|ref|XP_003738089.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 594
Score = 177 bits (450), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 97/221 (43%), Positives = 128/221 (57%), Gaps = 12/221 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDV F G IPTPN+DALA G++L HY P CTPSRAA LTG YP G+ +
Sbjct: 94 GWNDVSFTGSGQIPTPNLDALASAGVILQNHYVQPFCTPSRAALLTGMYPIHSGMQHYVI 153
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ KLLPQ+LK+LGY THLIGKWH+G K+E LP RGFD+H+GY+NGY+
Sbjct: 154 RSREPWGLPLDFKLLPQHLKDLGYRTHLIGKWHLGQFKKEFLPTRRGFDSHLGYYNGYID 213
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y H LD ++ P S +Y T FTD++ +I+ H+ PLFL H
Sbjct: 214 YFTHNHTYKRDSALDFFKDE---VPYHSEEYATRLFTDRAEEIIRDHDVDNPLFLYFAHL 270
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH T + Q P +E F+++ + +R FA
Sbjct: 271 AVHRATDRDP------FQAP--QETIDKFSYVGDRNRTTFA 303
>gi|194883566|ref|XP_001975872.1| GG20328 [Drosophila erecta]
gi|190659059|gb|EDV56272.1| GG20328 [Drosophila erecta]
Length = 478
Score = 177 bits (449), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 130/223 (58%), Gaps = 33/223 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+NDVGFHG DIPTPNIDALAY+GI+LNR+Y P CTPSR+A +TGKYP G+ T +
Sbjct: 37 GFNDVGFHGSADIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A + +P+ EK+LPQYL ELGY++H+ GKWH+G K + P RGF +H W
Sbjct: 97 YAAEPRGLPLEEKILPQYLNELGYASHIAGKWHLGHWKLKYTPLYRGFSSH---W----- 148
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
GLD R E A + Y TD TD SV VI SHN ++ PLFL + H
Sbjct: 149 ------------GLDMRNGTE-VAYDLHGHYTTDVITDHSVKVIASHNATKGPLFLYVAH 195
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRT-FAHISNPDRRLFA 242
AA H+ P L VPD ND AHI + RR FA
Sbjct: 196 AACHSSN------PYNPLPVPD---NDVIKMAHIPHYKRRKFA 229
>gi|260795396|ref|XP_002592691.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
gi|229277914|gb|EEN48702.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
Length = 485
Score = 177 bits (448), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 95/228 (41%), Positives = 134/228 (58%), Gaps = 16/228 (7%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QGWNDV FHG + IPTPN+D+LAY+G++L +Y P CTP+R+A +TG++P G+ V
Sbjct: 2 QGWNDVSFHGSDQIPTPNLDSLAYSGVILGNYYVSPICTPTRSAIMTGRHPIHTGLQHGV 61
Query: 82 GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+G +P+ E +LPQYLK LGY+TH++GKWH+G + E P RGFD++ GY G
Sbjct: 62 ISGATPFGLPLNETILPQYLKPLGYATHIVGKWHLGHHAWEFTPTFRGFDSYFGYLTGKD 121
Query: 141 TYNDSIHETDFA------VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
Y D + + GLD R E + + Y T+ F ++ +I SH+ S+PL
Sbjct: 122 NYYDHTDDESNSPEELGYKGLDLRNGTEPVWTE-NGTYSTELFATEAERIITSHDTSKPL 180
Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
FL + H AVH+G N LQ P ++ F HI +P RR FA
Sbjct: 181 FLYLPHQAVHSGNPDNP------LQAP--QKYIDKFPHIQHPGRRTFA 220
>gi|195333848|ref|XP_002033598.1| GM21416 [Drosophila sechellia]
gi|194125568|gb|EDW47611.1| GM21416 [Drosophila sechellia]
Length = 542
Score = 176 bits (447), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 105/223 (47%), Positives = 132/223 (59%), Gaps = 33/223 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+NDVGFHG +IPTPNIDALAY+GI+LNR+Y P CTPSR+A +TGKYP G+ T +
Sbjct: 37 GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A + +P+ EK+LPQYL ELGY++H+ GKWH+G K + P RGF +H W
Sbjct: 97 YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFSSH---W----- 148
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
GLD RN + A + Y TD TDQSV VI +HN ++ PLFL + H
Sbjct: 149 ------------GLDM-RNGTQVAYDLHGHYTTDVITDQSVKVIANHNATKGPLFLYVAH 195
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRT-FAHISNPDRRLFA 242
AA H+ P L VPD ND +HI N RR FA
Sbjct: 196 AACHSSN------PYNPLPVPD---NDVIKMSHIPNYKRRKFA 229
>gi|443694453|gb|ELT95582.1| hypothetical protein CAPTEDRAFT_115907 [Capitella teleta]
Length = 561
Score = 176 bits (447), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 98/232 (42%), Positives = 132/232 (56%), Gaps = 23/232 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVGFHG + TPN+DALAY+G++L +Y P CTPSRAA +TG++P G+ V
Sbjct: 37 GWNDVGFHGSEQVLTPNLDALAYDGVILENYYVQPICTPSRAALMTGRHPIHTGMQQNV- 95
Query: 83 AGVAKAVP----VTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+ A P + E + PQYLK++GY TH++GKWH+G E+ P RGFD+H GY+ G
Sbjct: 96 --IYSAEPYGLGLNEIIFPQYLKQIGYKTHIVGKWHLGFFAEQYTPIERGFDSHYGYYMG 153
Query: 139 YLTYNDSI----HETDF--AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
Y I HE F + GLD RRN E +Y T+ FT ++ ++I SHN S
Sbjct: 154 AEDYWVHIAGNAHEVSFNASWGLDFRRNGEVVKTAF-GQYSTELFTTEAENIIASHNQSE 212
Query: 193 PLFLQITHAAVHTGT--AGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLF+ + AVH+ G+A+L + F HI N RR FA
Sbjct: 213 PLFMYVAQQAVHSANPYTGDAELEAPF-------KYYEKFPHIKNEKRRKFA 257
>gi|241619159|ref|XP_002407084.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215500930|gb|EEC10424.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 502
Score = 176 bits (445), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 97/225 (43%), Positives = 134/225 (59%), Gaps = 14/225 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDV +HG I TPNIDALA+NGI LNR+YT P CTPSR+AFLTG YP G+ V
Sbjct: 37 GWNDVSYHGSPQILTPNIDALAWNGIRLNRYYTQPLCTPSRSAFLTGCYPMNTGMQHSVI 96
Query: 83 AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ KLLPQ+L + GY + ++GKWH+G KEE P RGF +HVG W G+
Sbjct: 97 LTTEPRGLPLHYKLLPQWLGDFGYVSRMLGKWHLGYYKEEYTPTMRGFQSHVGSWEGFSD 156
Query: 142 YNDSIHETDFAV----GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y I + + G D RR+M++ + + +Y T T++++ +IK H + +PLFL
Sbjct: 157 YYSHIMDFSWQTWSISGHDFRRDMQK-SKEDDGRYYTHVMTEEALKIIKDHPNEKPLFLY 215
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
I H AVH+ GN P L+ P + + I +P R L+A
Sbjct: 216 IAHLAVHS---GNQPEP---LKAPTKYTD--PYMDIGHPSRTLYA 252
>gi|195485249|ref|XP_002091013.1| GE12487 [Drosophila yakuba]
gi|194177114|gb|EDW90725.1| GE12487 [Drosophila yakuba]
Length = 544
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 104/223 (46%), Positives = 132/223 (59%), Gaps = 33/223 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+NDVGFHG +IPTPNIDALAY+GI+LNR+Y P CTPSR+A +TGKYP G+ T +
Sbjct: 37 GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A + +P+ EK+LPQYL ELGY++H+ GKWH+G K + P +RGF +H W
Sbjct: 97 YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLHRGFSSH---W----- 148
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
GLD RN + A + Y TD TD SV VI SHN ++ PLFL + H
Sbjct: 149 ------------GLDM-RNGTQVAYDLHGHYTTDVITDHSVKVIASHNATKGPLFLYVAH 195
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEEND-RTFAHISNPDRRLFA 242
AA H+ P L VPD ND +HI + RR FA
Sbjct: 196 AACHSSN------PYNPLPVPD---NDVLKMSHIPHYKRRKFA 229
>gi|291242646|ref|XP_002741217.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 526
Score = 174 bits (442), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 96/227 (42%), Positives = 131/227 (57%), Gaps = 19/227 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
GW+DV FHG + IPTPNID LAY+G++L+ +Y P CTP+R A LTG+YP G+
Sbjct: 38 GWDDVSFHGSHQIPTPNIDELAYSGVLLHNYYVQPVCTPTRGALLTGRYPMHLGLQHFVI 97
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
+ PVG +P+ E LP YLK+LGYSTH++GKWH+G +E P RGFD+H GY
Sbjct: 98 TPNEPVG------LPLNETTLPTYLKKLGYSTHMVGKWHLGFFAKEYTPTYRGFDSHYGY 151
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
+ G+ Y + G D R NM+ +Y + FT Q+ +I H+H +PLF
Sbjct: 152 FLGHQDYYTHNALWNNQWGFDLRHNMDLQRSTF-GEYGPELFTTQAEKLIYDHDHKKPLF 210
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L H AVH G +G P G L + R F HI++ RR++A
Sbjct: 211 LYFAHQAVHYGNSG----PNGTLLEAPYKYTSR-FPHIADHQRRIYA 252
>gi|195582835|ref|XP_002081231.1| GD10911 [Drosophila simulans]
gi|194193240|gb|EDX06816.1| GD10911 [Drosophila simulans]
Length = 633
Score = 173 bits (439), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 103/223 (46%), Positives = 131/223 (58%), Gaps = 33/223 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+NDVGFHG +IPTPNIDALAY+GI+LNR+Y P CTPSR+A +TGKYP G+ T +
Sbjct: 37 GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A + +P+ EK+LPQYL ELGY++H+ GKWH+G K + P RGF +H W
Sbjct: 97 YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFSSH---W----- 148
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
GLD RN + A + Y TD T+ SV VI +HN ++ PLFL + H
Sbjct: 149 ------------GLDM-RNGTQVAYDLHGHYTTDVITEHSVKVIANHNATKGPLFLYVAH 195
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRT-FAHISNPDRRLFA 242
AA H+ P L VPD ND +HI N RR FA
Sbjct: 196 AACHSSN------PYNPLPVPD---NDVIKMSHIPNYKRRKFA 229
>gi|427793479|gb|JAA62191.1| Putative arylsulfatase b, partial [Rhipicephalus pulchellus]
Length = 512
Score = 172 bits (437), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 93/206 (45%), Positives = 126/206 (61%), Gaps = 15/206 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDV +HG I TPNIDALA+NGI L R+Y P CTPSRAA LTG+YP G+ + +
Sbjct: 1 GWNDVSYHGCPQIRTPNIDALAWNGIRLRRYYAQPLCTPSRAALLTGRYPINMGLQHSVI 60
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+++ LLPQ+L +LGY TH +GKWHIG K+E P RGF+ HVG+W Y+
Sbjct: 61 YNEEPRGLPLSDTLLPQWLADLGYVTHHLGKWHIGFFKKEYTPTMRGFERHVGFWGAYID 120
Query: 142 YNDSIHETDF-----AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y HE + + GLD RRN+ A + +Y+T T +++ VI++H +PLFL
Sbjct: 121 YYK--HEKAYLGPTRSPGLDMRRNL-FLARNDTGRYVTQLLTKEALEVIENHPVDKPLFL 177
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPD 222
+ H A H+ P LQVPD
Sbjct: 178 YLAHLAPHSAG------PQDPLQVPD 197
>gi|156408341|ref|XP_001641815.1| predicted protein [Nematostella vectensis]
gi|156228955|gb|EDO49752.1| predicted protein [Nematostella vectensis]
Length = 512
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 93/221 (42%), Positives = 128/221 (57%), Gaps = 10/221 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+DV FHG IPTPNID LA G++LN +Y P CTP+R+A +TGKYP G+ + +
Sbjct: 35 GWDDVSFHGSGQIPTPNIDGLAKTGVILNNYYVSPICTPTRSAIMTGKYPIHTGMQHSVI 94
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A + + E L+PQYLK LGY+TH +GKWH+G K E P RGFD++ GYW G
Sbjct: 95 LAAQPYGLGLNETLMPQYLKRLGYATHGVGKWHLGFFKYEYTPIQRGFDSYFGYWCGKGD 154
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D + + GLD + E+ Y +D F +++V+VI +HN S PLFL +
Sbjct: 155 YWDHSNNEKYGWGLDL-HDSEQDVWTEWGHYSSDLFAEKAVNVISTHNASVPLFLYLPFQ 213
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH+ A L PD+ + F +I + RR+FA
Sbjct: 214 AVHS-----ANFIQPLQAPPDLIDK---FKNIKDERRRIFA 246
>gi|391325967|ref|XP_003737498.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 513
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 98/227 (43%), Positives = 129/227 (56%), Gaps = 15/227 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW+D+G HG + IPTPNID LA G+VL+ +YT P CTPSRA+ +TGKYP R G+ V
Sbjct: 40 GWDDIGLHGSSQIPTPNIDKLAEEGVVLDNYYTQPICTPSRASLMTGKYPVRLGLQHDVI 99
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
A +P K++PQYL + Y H++GKWH+G ++ E LP RGF +H GY G
Sbjct: 100 SAATPFGLPSNFKIMPQYLHDKNYDCHIVGKWHLGHSRSEFLPTRRGFKDHFGYRLGSSD 159
Query: 139 -YLTY--NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
Y Y +DS GLD N E A + + KY TD +T +S +++ HN SRPLF
Sbjct: 160 HYSHYGADDSDVPGSLFYGLDLWHN-EVPAKEFNGKYSTDIYTHRSTDILRMHNKSRPLF 218
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L + + AVH G P LQ P DR + I N RR +A
Sbjct: 219 LYLAYQAVHAGN------PDQALQAP-QSIVDRFSSSIRNDRRRRYA 258
>gi|427781895|gb|JAA56399.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
Length = 554
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 92/221 (41%), Positives = 126/221 (57%), Gaps = 10/221 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+DV FHG + IPTPN+D LA +G++LN +Y P CTPSRAA +TG YP R G+ P+
Sbjct: 47 GWDDVSFHGSSQIPTPNLDTLAADGVILNNYYVTPFCTPSRAALMTGLYPIRTGMQGMPI 106
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P ++LPQYLKE GY THL+GKWH+G KE L P RGFD+ GY+ G
Sbjct: 107 DVAEPWGLPTDVRILPQYLKEFGYETHLVGKWHLGSYKESLTPTCRGFDSFYGYYYGESD 166
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y + GLD N + ++ + Y T FT ++ ++I++ S+PL L ITH
Sbjct: 167 YFAHTISYENHTGLDFWLNKKPVWSEIGT-YSTSVFTKRAQYIIENRTKSKPLLLVITHQ 225
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A H L LQ P +EN F +I +R ++A
Sbjct: 226 ATHCA------LERERLQAP--QENIDKFPYIGEKNRTIYA 258
>gi|443734861|gb|ELU18717.1| hypothetical protein CAPTEDRAFT_218441 [Capitella teleta]
Length = 500
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 96/227 (42%), Positives = 133/227 (58%), Gaps = 19/227 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW+D+ HG +IPTPNID LA +GI+LN +Y P CTPSRAA +TG++P G+ V
Sbjct: 36 GWDDISLHGSQEIPTPNIDLLATDGILLNNYYVQPICTPSRAALMTGRHPVHLGLQHDV- 94
Query: 83 AGVAKAVP----VTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+ A P + E LLPQYLK LGYSTH++GKWH+G +E P RGFD+H+GY+ G
Sbjct: 95 --IVWAQPYGLGLNETLLPQYLKTLGYSTHMVGKWHLGFYDKEHTPTKRGFDSHLGYYTG 152
Query: 139 YLTYND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
Y D + D+ + R ++R A +Y T+ FT ++ VI H+ S+PLF
Sbjct: 153 CEDYYDHTWGFTKQDWGLDFWHDREVDRSA---FGQYSTEVFTSEAERVIAEHDVSKPLF 209
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L + AVH+G GN LQ P + + F I + +RR+FA
Sbjct: 210 LYLAQQAVHSGNPGNKV----RLQAP--WKYVKNFMGIKSEERRVFA 250
>gi|390361962|ref|XP_789345.3| PREDICTED: arylsulfatase I-like, partial [Strongylocentrotus
purpuratus]
Length = 514
Score = 170 bits (431), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 94/227 (41%), Positives = 126/227 (55%), Gaps = 21/227 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
GW+DV HG + IPTPNID LA +G+ L +Y P CTPSR+A +TG++P G+
Sbjct: 22 GWDDVSLHGSSQIPTPNIDTLAQDGVTLTNYYVSPLCTPSRSAIMTGRHPIHTGLQFGVI 81
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
+ P G G + EK + QYLK LGYSTH +GKWH+G +E P RGFD+ G+
Sbjct: 82 SPEAPYGLG------LEEKTMAQYLKTLGYSTHAVGKWHLGYFAKEYTPTWRGFDSFFGF 135
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
+NG Y G D +N + Y P +Y TD F ++ +IK+HN S+PLF
Sbjct: 136 YNGRGDYYTHEEVQSEVSGYDLHKNGKVYRPAF-GQYSTDIFNQEAEQIIKAHNASQPLF 194
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L + H AVH G P LLQ PD + + F HI RR++A
Sbjct: 195 LYLAHQAVHAGV-----YPDRLLQAPD--KYYQRFPHIETEGRRMYA 234
>gi|198455736|ref|XP_001360091.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
gi|198135374|gb|EAL24665.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
Length = 545
Score = 170 bits (431), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 99/222 (44%), Positives = 129/222 (58%), Gaps = 28/222 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+NDVGFHG IPTPNIDALAY+GI+LNR+Y P CTPSR+A +TGKYP G+ T +
Sbjct: 37 GFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A + +P+ EK+LPQYL +LGY++H+ GKWH+G K E P RGF +H W
Sbjct: 97 YAAEPRGLPLKEKILPQYLNDLGYTSHISGKWHLGHWKLEYTPLFRGFSSHKNLW----- 151
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
GLD RN A + +Y TD T SV VI +H+ ++ PLFL + H
Sbjct: 152 ------------GLDM-RNGTDVAYNLHGQYTTDVITKHSVSVIANHDAAKGPLFLYVAH 198
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AA H+G P L VPD ++ HI + RR +A
Sbjct: 199 AAGHSGN------PYNPLPVPD--DDVMKLDHILHYKRRRYA 232
>gi|195148952|ref|XP_002015426.1| GL11077 [Drosophila persimilis]
gi|194109273|gb|EDW31316.1| GL11077 [Drosophila persimilis]
Length = 545
Score = 170 bits (431), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 99/222 (44%), Positives = 129/222 (58%), Gaps = 28/222 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+NDVGFHG IPTPNIDALAY+GI+LNR+Y P CTPSR+A +TGKYP G+ T +
Sbjct: 37 GFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A + +P+ EK+LPQYL +LGY++H+ GKWH+G K E P RGF +H W
Sbjct: 97 YAAEPRGLPLKEKILPQYLNDLGYTSHISGKWHLGHWKLEYTPLFRGFSSHKNLW----- 151
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
GLD RN A + +Y TD T SV VI +H+ ++ PLFL + H
Sbjct: 152 ------------GLDM-RNGTDVAYNLHGQYTTDVITKHSVSVIANHDAAKGPLFLYVAH 198
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AA H+G P L VPD ++ HI + RR +A
Sbjct: 199 AAGHSGN------PYNPLPVPD--DDVMKLDHILHYKRRRYA 232
>gi|390360370|ref|XP_791935.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 374
Score = 170 bits (431), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 94/227 (41%), Positives = 134/227 (59%), Gaps = 15/227 (6%)
Query: 20 LPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID- 78
+ QGW+DV HG + I TPNID LA G+ L +Y P CTP+R+A +TGK+P G+
Sbjct: 40 IEQGWDDVSLHGSSQILTPNIDTLAQEGVTLTNYYVSPICTPTRSAIMTGKHPIHTGMQH 99
Query: 79 TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+GA + + EK + Q+LK LGYSTH +GKWH+G E+ +P RGFD+ GY+NG
Sbjct: 100 DTIGADEPWGLGLDEKTMAQHLKSLGYSTHAVGKWHLGYFAEDYIPTRRGFDSFFGYYNG 159
Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
Y T+ D+ E F G D +N E + P +Y T+ FT+++ +IK+HN S+PLF
Sbjct: 160 RGDYYTHEDT--EGGFG-GYDLHKNGEVHWPDF-GQYSTEIFTEEAQQIIKTHNASQPLF 215
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L + H AVH G G L++ P + + F +I RR+FA
Sbjct: 216 LYLAHQAVHAGVYGKD-----LVEAP--HKYYQMFPNIKTEGRRMFA 255
>gi|195431744|ref|XP_002063888.1| GK15669 [Drosophila willistoni]
gi|194159973|gb|EDW74874.1| GK15669 [Drosophila willistoni]
Length = 556
Score = 169 bits (428), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 89/190 (46%), Positives = 117/190 (61%), Gaps = 20/190 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+NDVGFHG IPTPNIDALAY+G++LNR+Y P CTPSR++ +TGKY G+ V
Sbjct: 37 GFNDVGFHGSAQIPTPNIDALAYSGLILNRYYVAPICTPSRSSLMTGKYAIHTGMQHTVL 96
Query: 83 AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + +P+ EKLLPQYL +LGY++H+ GKWH+G K P RGF++H G+W
Sbjct: 97 YGAEPRGLPLEEKLLPQYLNDLGYTSHIAGKWHLGHWKMPYTPLRRGFNSHHGFW----- 151
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
GLD RN R A ++ +Y TD T S+ VI +H S+ PLFL + H
Sbjct: 152 ------------GLDM-RNGSRVAYELHGQYTTDVITQHSIDVIANHPVSKGPLFLYVAH 198
Query: 201 AAVHTGTAGN 210
AA H+G N
Sbjct: 199 AAAHSGNPYN 208
>gi|241595184|ref|XP_002404450.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215502341|gb|EEC11835.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 311
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 88/221 (39%), Positives = 132/221 (59%), Gaps = 10/221 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+D HG + IPTPN+DA+A +GI+LN+HY P CTPSRAA +TG+YPF G+ + +
Sbjct: 1 GWDDTSIHGSSQIPTPNMDAIAADGIILNQHYVQPLCTPSRAALMTGRYPFHVGMQHSVI 60
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A+P+ L+P+Y + LGY TH++GKWH+G + +P RGFD +G++N L
Sbjct: 61 KPAEPWALPLNYTLMPEYFRCLGYKTHMVGKWHLGYYDRQYVPIKRGFDTFLGFYNPSLD 120
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y + + G D R + Y + +Y T ++T+++V +I+ HN S P+FL ++H
Sbjct: 121 YYNQNFTGNNHTGHDFRCGDQNYWAE-EKEYATYYYTNKTVEIIRRHNKSAPMFLFLSHQ 179
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A H + G LLQVP R F++I +R LFA
Sbjct: 180 APHV-SGGRP-----LLQVP--THGVRNFSYIGENNRTLFA 212
>gi|156364432|ref|XP_001626352.1| predicted protein [Nematostella vectensis]
gi|156213225|gb|EDO34252.1| predicted protein [Nematostella vectensis]
Length = 270
Score = 169 bits (427), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 92/224 (41%), Positives = 128/224 (57%), Gaps = 14/224 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW+DV FHG + IPTP ID LA G++LN +Y P CTP+RA+ +TGK+P G+
Sbjct: 14 GWDDVSFHGSSQIPTPTIDKLASEGVILNSYYVSPICTPTRASLMTGKHPMNLGMLIHTH 73
Query: 83 AGV----AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
A V +P+ E PQY+K LGY TH IGKWH+G ++E P RGFD+ G+WNG
Sbjct: 74 ATVFGTQPYGLPLGETTTPQYMKSLGYVTHGIGKWHLGFFEKEYTPTYRGFDSFYGFWNG 133
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y D + D G D R N E+ S Y T+ F +++ +I HN ++PL+L +
Sbjct: 134 KEDYWDHSSQED-VWGTDLRDN-EKPVRNESGHYGTELFAERAAQIIHLHNQTKPLYLYL 191
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
VH + N P LQ P + + F+HIS+P RR++A
Sbjct: 192 AQQGVH---SANGNEP---LQAP--KRLIKKFSHISSPKRRIYA 227
>gi|195057745|ref|XP_001995315.1| GH22700 [Drosophila grimshawi]
gi|193899521|gb|EDV98387.1| GH22700 [Drosophila grimshawi]
Length = 542
Score = 168 bits (425), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 95/222 (42%), Positives = 126/222 (56%), Gaps = 28/222 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+NDVGF G IPTPNIDALAY+G++LNR+Y P CTPSR+A +TGKYP G+ T +
Sbjct: 43 GFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALMTGKYPIHTGMQHTVL 102
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A + +P+ K+LPQYL ELGY++H+ GKWH+G K P RGF +H GYW
Sbjct: 103 YAAEPRGLPLDLKILPQYLNELGYTSHIAGKWHLGHWKRVYTPLYRGFSSHHGYW----- 157
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
GLD R E A + +Y TD T S+ VI +H ++ PLFL + H
Sbjct: 158 ------------GLDMRNGTE-IAYDLHGQYTTDVITQHSLQVIANHKPAKGPLFLYVAH 204
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AAVH+G N +P ++ R I + RR +A
Sbjct: 205 AAVHSGNPYNP--------LPASDDAVRRLDKIQHYKRRKYA 238
>gi|391345592|ref|XP_003747069.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 557
Score = 166 bits (421), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 91/224 (40%), Positives = 130/224 (58%), Gaps = 15/224 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
GWND FHG +IPTPN+DALA +G++L HY P CTP+RAA LTG YPF G+ +
Sbjct: 91 GWNDASFHGSAEIPTPNLDALASSGVILQSHYAQPMCTPTRAALLTGLYPFHTGMQNFVI 150
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G +P+ K+LP YL E Y +HL+GKWH+G + LP R F+ HVGY+NG++
Sbjct: 151 RTGEPWGLPLDYKILPHYLDEAYYHSHLVGKWHLGMHNPAFLPTARHFNTHVGYYNGFID 210
Query: 142 YNDSIHET---DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y H + D +GLD N E + Y T FT ++V++I++H + PLF+ +
Sbjct: 211 YFTHEHISPGNDSLIGLDWHINEEN---ENEEGYATHLFTKRAVNLIENHKSTEPLFILL 267
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+H A H G + Q P E+ FAHI + +R+++A
Sbjct: 268 SHLAPHAGCKRDP------FQAP--RESIEKFAHIKDQNRKVYA 303
>gi|195380485|ref|XP_002049001.1| GJ21349 [Drosophila virilis]
gi|194143798|gb|EDW60194.1| GJ21349 [Drosophila virilis]
Length = 531
Score = 166 bits (421), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 94/222 (42%), Positives = 127/222 (57%), Gaps = 28/222 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+NDVGF G IPTPNIDALAY+G++LNR+Y P CTPSR+A +TGKYP G+ T +
Sbjct: 32 GFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALMTGKYPIHTGMQHTVL 91
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A + +P+ K+LPQYL +LGY++H+ GKWH+G + P RGF +H GYW
Sbjct: 92 YAAEPRGLPLDLKILPQYLNDLGYTSHIAGKWHLGHWQRVYTPLYRGFSSHHGYW----- 146
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
GLD R E A + +Y TD T S++VI HN ++ PLFL + H
Sbjct: 147 ------------GLDMRNGTE-VAYDLHGQYSTDVITQHSLNVISKHNATKGPLFLYVAH 193
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AAVH+G N +P ++ R I + RR +A
Sbjct: 194 AAVHSGNPYNP--------LPVKDDAVRRLDTIQHYKRRKYA 227
>gi|194756524|ref|XP_001960527.1| GF13402 [Drosophila ananassae]
gi|190621825|gb|EDV37349.1| GF13402 [Drosophila ananassae]
Length = 541
Score = 166 bits (420), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 100/222 (45%), Positives = 125/222 (56%), Gaps = 31/222 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
G+NDVGFHG IPTPNIDALAY+GI+LNR+Y P CTPSR+A +TGKYP G+ V
Sbjct: 38 GFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVTPICTPSRSALMTGKYPIHTGMQHAVL 97
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A + + + EK+LPQYL +LGY++H+ GKWH+G K + P RGF +H W
Sbjct: 98 YAAEPRGLSLKEKILPQYLNDLGYTSHIAGKWHLGHWKLKYTPLFRGFSSH---W----- 149
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFLQITH 200
GLD R E A + +Y TD TD +V VI +HN S PLFL + H
Sbjct: 150 ------------GLDMRNGTE-VAYDLHGRYTTDVITDHAVKVIANHNTTSGPLFLYVAH 196
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AA H+ P L VPD E HI + RR FA
Sbjct: 197 AACHSSN------PYNPLPVPDNEV--MKLGHIPHYKRRKFA 230
>gi|346465011|gb|AEO32350.1| hypothetical protein [Amblyomma maculatum]
Length = 500
Score = 166 bits (420), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 90/220 (40%), Positives = 123/220 (55%), Gaps = 9/220 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW D HG IPTPN+DALA G++LN +Y P C+PSRAA +TG YP GI P+
Sbjct: 37 GWADTSLHGSAQIPTPNLDALASTGVLLNNYYVQPLCSPSRAALMTGLYPAHNGIRMPLM 96
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+P+ K+LP++LK+LGY TH++GKWH+G + P RGFD G++NG + Y
Sbjct: 97 GAQVAGLPLQFKILPEHLKDLGYETHMVGKWHLGHSSLNYTPTYRGFDTFFGFYNGPIDY 156
Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
I E + +GLD N R P Y T F D + ++I + N S+PLFL + H A
Sbjct: 157 YHGIMEQEGHIGLDF-WNGTRALPLEERIYATTRFQDHANYIIANRNASKPLFLYLAHQA 215
Query: 203 VHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
VH+ A P LQ P EN + F + + R+ A
Sbjct: 216 VHS-----AYEPE-FLQAPG--ENTKKFPFLGDASRKSLA 247
>gi|195021983|ref|XP_001985495.1| GH14468 [Drosophila grimshawi]
gi|193898977|gb|EDV97843.1| GH14468 [Drosophila grimshawi]
Length = 619
Score = 166 bits (420), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 92/225 (40%), Positives = 130/225 (57%), Gaps = 14/225 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G++D+ G + TPNIDALAY+G +L+R Y CTPSR A L+GKYP G +
Sbjct: 44 GFDDISIRGAREFLTPNIDALAYHGRLLDRLYAPSMCTPSRGALLSGKYPIHTGTQHYVI 103
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + + L+P+ +E GYST+L+GKWH+G ++ E P +RGFDNH GYW Y+
Sbjct: 104 GNEEPWGLALNTTLMPEIFREAGYSTNLVGKWHLGFSRPEYTPTHRGFDNHYGYWGAYID 163
Query: 142 YNDSIHET---DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
Y + +++VG D RRNM+ Y+TD T+++ +I+ +PLFL
Sbjct: 164 YYQRRSQMPLGNYSVGYDFRRNMQVECTDRGV-YVTDLLTNEAERIIREQPAKEQPLFLI 222
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++H A HT GN P LQ P EE R F HI +P+RRL+A
Sbjct: 223 LSHLAPHT---GNTNKP---LQAP--EEELRKFTHIKDPNRRLYA 259
>gi|390364995|ref|XP_798154.3| PREDICTED: arylsulfatase I-like isoform 2 [Strongylocentrotus
purpuratus]
Length = 476
Score = 165 bits (418), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/230 (39%), Positives = 132/230 (57%), Gaps = 27/230 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
GWNDV FHG + IPTP+IDALA G++L +Y P CTP+R+A +TGK+P G+
Sbjct: 40 GWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVI 99
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G G E ++PQYL+ LGY TH++GKWH+G KE L P +RGF+++ GY
Sbjct: 100 IADEPYGLG------TNETIMPQYLRSLGYRTHMVGKWHLGFFKESLTPSHRGFESYYGY 153
Query: 136 WNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
+ G Y T+ + H G D N Y P + +Y T+ +T+++ +I++HN
Sbjct: 154 YGGMQDYFTHESTEHT---LTGFDFHVNGSIYKP-VFGQYSTEIYTEKTQEIIRNHNPQE 209
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PL++ + H AVH+ + LQ P + F +I+N +RR FA
Sbjct: 210 PLYIYLAHQAVHSANYNGQR-----LQAP--YKYYERFPNITNENRRKFA 252
>gi|156406805|ref|XP_001641235.1| predicted protein [Nematostella vectensis]
gi|156228373|gb|EDO49172.1| predicted protein [Nematostella vectensis]
Length = 498
Score = 164 bits (416), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 91/227 (40%), Positives = 131/227 (57%), Gaps = 23/227 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID---- 78
GW+D+ FHG IPTPN+DALA +G++LN +Y P TPSRA+F+TGKYP G+
Sbjct: 12 GWDDISFHGSPQIPTPNLDALANSGVILNNYYVSPMDTPSRASFMTGKYPIHMGVQHDTL 71
Query: 79 ---TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G VP+TEK LP++L+E+GY TH +GKW +G +E P RGFD+ G+
Sbjct: 72 HNRQPFG------VPLTEKFLPEFLREMGYQTHAVGKWQLGFFAKEYTPTYRGFDSFFGF 125
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
W + Y + + D G+D RRN++ + + Y T+ ++ VI++H+ +PLF
Sbjct: 126 WTSHEDYYNHV-ANDGGYGIDLRRNLD-VSNDHTGVYGTELLAREADEVIENHSGDKPLF 183
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L + H AVH GN P LQ P + F +I++ RR FA
Sbjct: 184 LYLAHQAVHV---GNMDEP---LQAPKRHVD--KFKYITDERRRTFA 222
>gi|449684458|ref|XP_002164438.2| PREDICTED: arylsulfatase I-like, partial [Hydra magnipapillata]
Length = 784
Score = 164 bits (415), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 95/225 (42%), Positives = 134/225 (59%), Gaps = 18/225 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW+DV FHG IPTPNID+LA +G++LN +Y P+ ++ F+TGKY G V
Sbjct: 1 GWDDVSFHGSPQIPTPNIDSLAKSGVILNNYYVSPSSFATKTEFMTGKYATHLGTQHGVL 60
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P TEK+LPQYLKE GY+ + +GKW +G KEE+LP+ RGFD ++ G LT
Sbjct: 61 HNKQPFGLPHTEKILPQYLKEAGYNNYAVGKWALGYYKEEMLPWKRGFD----FFYGGLT 116
Query: 142 YNDSIHET----DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
+ + T D GLD RRN E + + Y+T+ +T ++V++IK++N ++PLFL
Sbjct: 117 SSGKDYYTHSAFDENYGLDLRRNNEVIHNE-TGNYITEVYTREAVNIIKNYNDNKPLFLY 175
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ H AVHTG A + LQ P E + HI N R+LFA
Sbjct: 176 VAHQAVHTGNADDP------LQAP--ESYLKKLNHIKNIKRKLFA 212
>gi|386771363|ref|NP_730304.2| CG32191 [Drosophila melanogaster]
gi|229368437|gb|ACQ59088.1| MIP05773p [Drosophila melanogaster]
gi|383291992|gb|AAN11683.2| CG32191 [Drosophila melanogaster]
Length = 564
Score = 162 bits (411), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 90/225 (40%), Positives = 130/225 (57%), Gaps = 14/225 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++DV F G + TPNIDALAY+G +L+R Y CTPSR A L+G+YP G V
Sbjct: 38 GFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ A+ + L+P+ KE GYST+L+GKWH+G ++ E P RGFD H GYW Y+
Sbjct: 98 SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157
Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
Y + ++++G D RRNME + Y+TD T ++ +IK H + +PLFL
Sbjct: 158 YFQRRSKMPVANYSLGYDFRRNME-LECRDRGVYVTDLLTAEAERLIKDHADKEQPLFLM 216
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++H A HT + LQ P EE + F++I +P+RR +A
Sbjct: 217 LSHLAAHTANEDDP------LQAP--EEEIQKFSYIKDPNRRKYA 253
>gi|442749327|gb|JAA66823.1| Putative arylsulfatase b precursor [Ixodes ricinus]
Length = 274
Score = 162 bits (411), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 86/221 (38%), Positives = 130/221 (58%), Gaps = 10/221 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+DV FHG IPTPN+DALA +GI+LN+HY CTPSRAA +TG+YP G+ +
Sbjct: 59 GWDDVSFHGSPQIPTPNMDALAADGIILNQHYAQALCTPSRAALMTGRYPIYTGMQHFVI 118
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G +P+ +L+P++ +LGY TH++GKWH+G K++ +P RGFD+ G++N
Sbjct: 119 QPGEPWGLPLEYRLMPEFFSDLGYKTHMVGKWHLGSFKKDFIPVRRGFDSFYGFYNADQD 178
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y + G D N E +++Y T +T+++V +I+SHN S+PLFL +++
Sbjct: 179 YYNKTLTEGEHTGYDFWLN-EDIHIYPNNRYSTHHYTERAVSLIRSHNPSQPLFLYLSYX 237
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
GT LL+ P EEN F +I +R ++A
Sbjct: 238 XXXVGTG------PSLLEAP--EENVNKFLYIPEKNRTIYA 270
>gi|390364993|ref|XP_003730725.1| PREDICTED: arylsulfatase I-like isoform 1 [Strongylocentrotus
purpuratus]
Length = 479
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 87/230 (37%), Positives = 128/230 (55%), Gaps = 24/230 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
GWNDV FHG + IPTP+IDALA G++L +Y P CTP+R+A +TGK+P G+
Sbjct: 40 GWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVI 99
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G G E ++PQYL+ LGY TH++GKWH+G +E P RGF++ GY
Sbjct: 100 IADEPYGLG------TNETIMPQYLRSLGYRTHMVGKWHLGFYSKEHTPIERGFESTFGY 153
Query: 136 WNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
+ G Y T+ + G D N Y P + +Y T+ +T+++ +I++HN
Sbjct: 154 YLGQQDYFTHETQVKRKHTLTGFDFHVNGSIYKP-VFGQYSTEIYTEKTQEIIRNHNPQE 212
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PL++ + H AVH+ + LQ P + F +I+N +RR FA
Sbjct: 213 PLYIYLAHQAVHSANYNGQR-----LQAP--YKYYERFPNITNENRRKFA 255
>gi|194871740|ref|XP_001972898.1| GG15780 [Drosophila erecta]
gi|190654681|gb|EDV51924.1| GG15780 [Drosophila erecta]
Length = 565
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 90/225 (40%), Positives = 129/225 (57%), Gaps = 14/225 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++DV F G + TPNIDALAY+G +L+R Y CTPSR A L+G+YP G V
Sbjct: 38 GFDDVSFRGGREFLTPNIDALAYHGRLLDRFYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ A+ + L+P+ K GYST+L+GKWH+G ++ E P RGFD H GYW Y+
Sbjct: 98 SNEEPWALTLNATLMPEIFKGAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157
Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
Y + ++++G D RRNME + Y+TD T ++ +IK H + +PLFL
Sbjct: 158 YFQRRSKMPVANYSLGYDFRRNMELECRDRGA-YVTDLLTAEAERLIKDHADKEQPLFLM 216
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++H A HT + LQ P EE + FA+I +P+RR +A
Sbjct: 217 LSHLAAHTANKDDP------LQAP--EEEIQKFAYIKDPNRRKYA 253
>gi|195494733|ref|XP_002094965.1| GE22117 [Drosophila yakuba]
gi|194181066|gb|EDW94677.1| GE22117 [Drosophila yakuba]
Length = 565
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 90/225 (40%), Positives = 129/225 (57%), Gaps = 14/225 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++DV F G + TPNIDALAY+G +L+R Y CTPSR A L+G+YP G V
Sbjct: 38 GFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ A+ + L+P+ KE GYST+L+GKWH+G ++ E P RGFD H GYW Y+
Sbjct: 98 SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157
Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
Y + ++++G D RRNME + Y+TD T ++ +IK H +PLFL
Sbjct: 158 YFQRRSKMPVANYSLGYDFRRNME-LECRDRGVYVTDLLTAEAERLIKGHAGKEQPLFLM 216
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++H A HT + LQ P EE + F++I +P+RR +A
Sbjct: 217 LSHLAAHTANEDDP------LQAP--EEEIQKFSYIKDPNRRKYA 253
>gi|194919176|ref|XP_001983034.1| GG19815 [Drosophila erecta]
gi|190647645|gb|EDV45033.1| GG19815 [Drosophila erecta]
Length = 565
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 90/225 (40%), Positives = 129/225 (57%), Gaps = 14/225 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++DV F G + TPNIDALAY+G +L+R Y CTPSR A L+G+YP G V
Sbjct: 38 GFDDVSFRGGREFLTPNIDALAYHGRLLDRFYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ A+ + L+P+ K GYST+L+GKWH+G ++ E P RGFD H GYW Y+
Sbjct: 98 SNKEPWALTLNATLMPEIFKGAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157
Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
Y + ++++G D RRNME + Y+TD T ++ +IK H + +PLFL
Sbjct: 158 YFQRRSKMPVANYSLGYDFRRNMELECRDRGA-YVTDLLTAEAERLIKDHADKEQPLFLM 216
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++H A HT + LQ P EE + FA+I +P+RR +A
Sbjct: 217 LSHLAAHTANKDDP------LQAP--EEEIQKFAYIKDPNRRKYA 253
>gi|443724925|gb|ELU12719.1| hypothetical protein CAPTEDRAFT_140387 [Capitella teleta]
Length = 542
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 81/190 (42%), Positives = 116/190 (61%), Gaps = 17/190 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW+DVGFHG IPTPN+DALA +GI+L+ HY+ P CTPSR + LTGK+P + G+ V
Sbjct: 35 GWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKHPIQIGLQRGV- 93
Query: 83 AGVAKAVP----VTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+ A P + EKLLP+YLK LGY +H++GKWH+G +E P RGFD+H G++
Sbjct: 94 --IYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFDSHYGFYGA 151
Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
Y+T+ + DF + R+ + + Y T FT ++ ++ HN + P+F
Sbjct: 152 SEDYMTHIGGMGGLDFWLNGQPDRSGQGH-------YSTTLFTTKAEQLLAEHNQTEPMF 204
Query: 196 LQITHAAVHT 205
L +H AVHT
Sbjct: 205 LYFSHQAVHT 214
>gi|443722750|gb|ELU11510.1| hypothetical protein CAPTEDRAFT_23094, partial [Capitella teleta]
Length = 549
Score = 162 bits (409), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 81/190 (42%), Positives = 116/190 (61%), Gaps = 17/190 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW+DVGFHG IPTPN+DALA +GI+L+ HY+ P CTPSR + LTGK+P + G+ V
Sbjct: 12 GWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKHPIQIGLQRGV- 70
Query: 83 AGVAKAVP----VTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+ A P + EKLLP+YLK LGY +H++GKWH+G +E P RGFD+H G++
Sbjct: 71 --IYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFDSHYGFYGA 128
Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
Y+T+ + DF + R+ + + Y T FT ++ ++ HN + P+F
Sbjct: 129 SEDYMTHIGGMGGLDFWLNGQPDRSGQGH-------YSTTLFTTKAEQLLAEHNQTEPMF 181
Query: 196 LQITHAAVHT 205
L +H AVHT
Sbjct: 182 LYFSHQAVHT 191
>gi|443705024|gb|ELU01769.1| hypothetical protein CAPTEDRAFT_23096, partial [Capitella teleta]
Length = 354
Score = 162 bits (409), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 81/190 (42%), Positives = 116/190 (61%), Gaps = 17/190 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW+DVGFHG IPTPN+DALA +GI+L+ HY+ P CTPSR + LTGK+P + G+ V
Sbjct: 12 GWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKHPIQIGLQRGV- 70
Query: 83 AGVAKAVP----VTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+ A P + EKLLP+YLK LGY +H++GKWH+G +E P RGFD+H G++
Sbjct: 71 --IYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFDSHYGFYGA 128
Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
Y+T+ + DF + R+ + + Y T FT ++ ++ HN + P+F
Sbjct: 129 SEDYMTHIGGMGGLDFWLNGQPDRSGQGH-------YSTTLFTTKAEQLLAEHNQTEPMF 181
Query: 196 LQITHAAVHT 205
L +H AVHT
Sbjct: 182 LYFSHQAVHT 191
>gi|449680619|ref|XP_002157149.2| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
Length = 502
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 96/223 (43%), Positives = 129/223 (57%), Gaps = 14/223 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI--DTP 80
GWND+ FHG N+IPTPNID LA NG++L+ +Y LP CTPSR+A +TG+YP G+ DT
Sbjct: 31 GWNDISFHGSNEIPTPNIDRLANNGVILDNYYVLPICTPSRSAIMTGRYPIHTGMQQDTI 90
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
G V + EK LPQYLK+ GY TH +GKWH+G ++ P RGFD++ G + G
Sbjct: 91 FGPN-PYGVGLNEKFLPQYLKQQGYKTHGVGKWHLGFFAKQYTPTYRGFDSYYGSYLGKG 149
Query: 141 TY-NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y N S ET GLD N E Y T+ +T +++ I +HN S PLFL +
Sbjct: 150 DYWNHSNTET--YSGLDLHDN-ENGVFSQDGNYSTEMYTAEAISCINNHNSSEPLFLYLA 206
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ AVH+ A LQ P +E F++I + RR +A
Sbjct: 207 YQAVHS-----ANTEEDPLQAP--QEWIDKFSYIKHEQRRKYA 242
>gi|195328473|ref|XP_002030939.1| GM24306 [Drosophila sechellia]
gi|194119882|gb|EDW41925.1| GM24306 [Drosophila sechellia]
Length = 554
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 89/225 (39%), Positives = 130/225 (57%), Gaps = 14/225 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++DV F G + TPNIDALAY+G +L+R Y CTPSR A L+G+YP G V
Sbjct: 38 GFDDVSFRGGREFVTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ A+ + L+P+ KE GYST+L+GKWH+G ++ E P RGFD H GYW Y+
Sbjct: 98 SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157
Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
Y + ++++G D RRNM+ + Y+TD T ++ +IK H + +PLFL
Sbjct: 158 YFQRRSKMPVANYSLGYDFRRNMD-LECRDRGVYVTDLLTTEAERLIKDHADKEQPLFLM 216
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++H A HT + LQ P EE + F++I +P+RR +A
Sbjct: 217 LSHLAAHTANEDDP------LQAP--EEEIQKFSYIKDPNRRKYA 253
>gi|241619161|ref|XP_002407085.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215500931|gb|EEC10425.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 588
Score = 160 bits (404), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 80/186 (43%), Positives = 112/186 (60%), Gaps = 5/186 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDV ++G I TPNIDALA+NGI L R+YT P CTPSRAA +TG+YP G+ +
Sbjct: 76 GWNDVSYNGCPQIRTPNIDALAWNGIRLQRYYTQPMCTPSRAALMTGRYPIHTGMQHFVI 135
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ KLLPQ+L +LGY + ++GKWH+G K+E P RGF H+G W G++
Sbjct: 136 LQNEPRGLPLKFKLLPQWLGDLGYVSQMLGKWHLGFYKKEYTPTMRGFQKHIGSWGGFVD 195
Query: 142 YNDSIHETDFAV---GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y I GLD R+ + + +Y T+F T+ + VI++H +PLFL +
Sbjct: 196 YYSHIRFNKIGFSHSGLDFRQGLSE-GREFDGQYYTEFMTEAATRVIENHPLEKPLFLYL 254
Query: 199 THAAVH 204
H A H
Sbjct: 255 AHLAPH 260
>gi|427779723|gb|JAA55313.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
Length = 593
Score = 159 bits (403), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 97/260 (37%), Positives = 133/260 (51%), Gaps = 49/260 (18%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+DV FHG + IPTPN+D LA +G++LN +Y P CTPSRAA +TG YP R G+ P+
Sbjct: 47 GWDDVSFHGSSQIPTPNLDTLAADGVILNNYYVTPFCTPSRAALMTGLYPIRTGMQGMPI 106
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY------ 135
+P ++LPQYLKE GY THL+GKWH+G KE L P RGFD+ GY
Sbjct: 107 DVAEPWGLPTDVRILPQYLKEFGYETHLVGKWHLGSYKESLTPTCRGFDSFYGYYYGESD 166
Query: 136 -------------WNGYLTYN------DSIH-----ETDFAV---------GLDARRNME 162
W +LT DS + E+D+ GLD N +
Sbjct: 167 YFAHTISYVRHLSWAFFLTRKCXCRGFDSFYGYYYGESDYFAHTISYENHTGLDFWLNKK 226
Query: 163 RYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPD 222
++ + Y T FT ++ ++I++ S+PL L ITH A H L LQ P
Sbjct: 227 PVWSEIGT-YSTSVFTKRAQYIIENRTKSKPLLLVITHQATHCA------LERERLQAP- 278
Query: 223 MEENDRTFAHISNPDRRLFA 242
+EN F +I +R ++A
Sbjct: 279 -QENIDKFPYIGEKNRTIYA 297
>gi|195124259|ref|XP_002006611.1| GI18487 [Drosophila mojavensis]
gi|193911679|gb|EDW10546.1| GI18487 [Drosophila mojavensis]
Length = 528
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 92/222 (41%), Positives = 124/222 (55%), Gaps = 31/222 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+NDVGF G IPTPNIDALAY+G++LNR+Y P CTPSR+A +T KYP G+ T +
Sbjct: 32 GFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALMTAKYPIHTGMQHTVL 91
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A + +P+ K+LPQYL +LGY++H+ GKWH+G K P RGF +H W
Sbjct: 92 YAAEPRGLPLNLKILPQYLNDLGYTSHIAGKWHLGHWKRVYTPLYRGFSSH---W----- 143
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
GLD R E A + +Y TD T S++VI HN ++ PLFL + H
Sbjct: 144 ------------GLDMRNGTE-LAYDLHGQYTTDVITQHSLNVIAKHNSTKGPLFLYVAH 190
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AAVH+G N +P ++ R I + RR +A
Sbjct: 191 AAVHSGNPYNP--------LPAKDDIVRRLGTIQDYKRRKYA 224
>gi|195591175|ref|XP_002085318.1| GD12374 [Drosophila simulans]
gi|194197327|gb|EDX10903.1| GD12374 [Drosophila simulans]
Length = 554
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 88/225 (39%), Positives = 129/225 (57%), Gaps = 14/225 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++DV F G + TPNIDALAY+G +L+R Y CTPSR A L+G+YP G V
Sbjct: 38 GFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ A+ + L+P+ KE GYST+L+GKWH+G ++ E P RGFD H GYW Y+
Sbjct: 98 SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157
Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
Y + ++++G D RRNM+ + Y+TD T ++ +IK H + +PLFL
Sbjct: 158 YFQRRSKMPVANYSLGYDFRRNMD-LECRDRGVYVTDLLTTEAERLIKDHADKEQPLFLM 216
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++H A HT + LQ P EE + F++I + +RR +A
Sbjct: 217 LSHLAAHTANEDDP------LQAP--EEEIQKFSYIKDSNRRKYA 253
>gi|195166525|ref|XP_002024085.1| GL22750 [Drosophila persimilis]
gi|194107440|gb|EDW29483.1| GL22750 [Drosophila persimilis]
Length = 575
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 88/225 (39%), Positives = 128/225 (56%), Gaps = 14/225 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++DV F G + TPNIDALA++G +L+R Y CTPSR A L+G+YP G V
Sbjct: 48 GFDDVSFRGGREFLTPNIDALAFHGRILDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 107
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ + + L+P+ ++ GYST+LIGKWH+G ++ E P RGFD H GYW Y+
Sbjct: 108 SNEEPWGLTLNATLMPEIFQQAGYSTNLIGKWHLGFSRPEYTPTRRGFDYHYGYWGAYID 167
Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFLQ 197
Y + ++++G D RRNME + Y+TD T+++ VI+ +PLFL
Sbjct: 168 YYQRRSKMPARNYSLGYDFRRNME-LECRDRGVYVTDLLTNEAERVIREREGQEQPLFLV 226
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++H A HT + LQ P EE R FA+I +P+RR +A
Sbjct: 227 LSHLATHTANEDDP------LQAP--EEEIRKFAYIKDPNRRKYA 263
>gi|198466274|ref|XP_001353949.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
gi|198150525|gb|EAL29685.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
Length = 575
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 88/225 (39%), Positives = 127/225 (56%), Gaps = 14/225 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++DV F G + TPNIDALA++G +L+R Y CTPSR A L+G+YP G V
Sbjct: 48 GFDDVSFRGGREFLTPNIDALAFHGRILDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 107
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ + + L+P+ ++ GYST+LIGKWH+G ++ E P RGFD H GYW Y+
Sbjct: 108 SNEEPWGLTLNATLMPEIFQQAGYSTNLIGKWHLGFSRPEYTPTRRGFDYHYGYWGAYID 167
Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFLQ 197
Y + ++++G D RRNME + Y+TD T+++ VI+ PLFL
Sbjct: 168 YYQRRSKMPARNYSLGYDFRRNME-LECRDRGVYVTDLLTNEAERVIREREGQEEPLFLV 226
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++H A HT + LQ P EE R FA+I +P+RR +A
Sbjct: 227 LSHLATHTANEDDP------LQAP--EEEIRKFAYIKDPNRRKYA 263
>gi|346464549|gb|AEO32119.1| hypothetical protein [Amblyomma maculatum]
Length = 531
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 91/221 (41%), Positives = 122/221 (55%), Gaps = 11/221 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP-V 81
GW D HG IPTPN+DALA G++LN +Y P C+PSR A +TG YP GI P V
Sbjct: 37 GWADTSLHGSAQIPTPNLDALASTGVLLNNYYVQPLCSPSRGALMTGLYPAHNGIRMPLV 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
GA VA +P+ K+LP++LK+LGY TH++GKWH+G P RGFD G+ NG +
Sbjct: 97 GAQVA-GLPLQFKILPEHLKDLGYETHIVGKWHLGYFNLNYTPTYRGFDTFFGFHNGPID 155
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y I E + VGLD N P Y T + + +I + N S+PLFL + H
Sbjct: 156 YYRGIMEQEGHVGLDF-WNGTSALPLKERTYATARLQNHAKSIIANRNTSKPLFLYLAHQ 214
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH+ + LQ P EN + F +I + R++ A
Sbjct: 215 AVHSVYSPE------FLQAP--VENTKKFPYIRDSSRKILA 247
>gi|410446533|ref|ZP_11300636.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
gi|409980205|gb|EKO36956.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
Length = 517
Score = 157 bits (398), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 91/222 (40%), Positives = 127/222 (57%), Gaps = 23/222 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW DV +HG + IPTPNIDALA NG+ LNR Y P C+P+RA+ LTG + F +GI P+
Sbjct: 30 GWGDVSYHGGH-IPTPNIDALAKNGVELNRFYASPVCSPTRASLLTGLHIFNHGIIRPLA 88
Query: 83 AGVAK--AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
A+ +PV K++PQ+ KE GY T L GKWH+G + EE P NRGFD G+ G +
Sbjct: 89 NPTAEQYGLPVDLKIMPQFFKEAGYQTALSGKWHLGMHLEEYWPTNRGFDQSYGHMLGGI 148
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D +H + LD RN E P Y T+ +++V +I++ + +RPLFL +
Sbjct: 149 GYFDHVHSSR----LDWHRNEE---PLFEDGYSTELIANEAVRIIETKDPNRPLFLYVAF 201
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A HT +Q PD +N F++I +P R +A
Sbjct: 202 NAPHTP-----------IQAPD--KNIELFSYIEDPLDRAYA 230
>gi|194748096|ref|XP_001956485.1| GF25237 [Drosophila ananassae]
gi|190623767|gb|EDV39291.1| GF25237 [Drosophila ananassae]
Length = 570
Score = 157 bits (398), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 88/225 (39%), Positives = 127/225 (56%), Gaps = 14/225 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++DV F G + TPNIDALAY+G +L+R Y CTPSR A L+G+YP G V
Sbjct: 41 GFDDVSFRGGREFITPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 100
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ A+ L+P+ + GYST+L+GKWH+G ++ E P +RGFD H GYW Y+
Sbjct: 101 SNEEPWALDSNATLMPEIFQRAGYSTNLVGKWHLGFSRPEYTPTHRGFDYHYGYWGAYID 160
Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
Y + ++++G D RRNME + Y+TD T ++ +IK +PLFL
Sbjct: 161 YYQRRSKMPVANYSLGYDFRRNMELECRDRGT-YVTDLLTTEAERLIKEQAGKDKPLFLM 219
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++H A HT + LQ P EE R F++I +P+RR +A
Sbjct: 220 LSHLATHTANEDDP------LQAP--EEEIRKFSYIKDPNRRKYA 256
>gi|195021979|ref|XP_001985494.1| GH14469 [Drosophila grimshawi]
gi|193898976|gb|EDV97842.1| GH14469 [Drosophila grimshawi]
Length = 560
Score = 157 bits (396), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 87/225 (38%), Positives = 127/225 (56%), Gaps = 14/225 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+ G + TPNIDALAY+G +L+R Y CTPSR A L+GKYP G V
Sbjct: 44 GFDDISIRGAREFLTPNIDALAYHGRLLDRLYAPSMCTPSRGALLSGKYPIHTGTQHSVI 103
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ + L+P+ ++ GYST+L+GKWH+G + E P RGFD H GYW GY+
Sbjct: 104 LNEEPWGLALNATLMPEIFRDAGYSTNLVGKWHLGFVRPEYTPTYRGFDYHYGYWGGYID 163
Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPLFLQ 197
Y + ++++G D RRNM+ Y+TD T+++ +I+ +PLFL
Sbjct: 164 YYQRRSQMPSDNYSMGYDFRRNMQVECTD-RGVYVTDLLTNEAERIIREQPAKEQPLFLI 222
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++H A HTG + LQ P EE + F HI++P+RRL+A
Sbjct: 223 LSHLAPHTGNEIDP------LQAP--EEELQKFVHINDPNRRLYA 259
>gi|241378410|ref|XP_002409154.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215497456|gb|EEC06950.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 511
Score = 157 bits (396), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 87/222 (39%), Positives = 125/222 (56%), Gaps = 10/222 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QGW+DV FHG IPTPN+D LA +GI+LN +Y P CTPSRAA +TG YP G+ V
Sbjct: 1 QGWDDVSFHGSAQIPTPNMDTLAADGIILNNYYVQPACTPSRAALMTGLYPIHTGMQHGV 60
Query: 82 -GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+P++ ++PQYLK LGY TH++GKW++G K P RGFD+ GY++
Sbjct: 61 LSPAEPYGLPLSVSIMPQYLKNLGYETHIVGKWNLGNYKLSYTPTFRGFDSFYGYYSAVE 120
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y + D GLD N + +S Y T +T+++ +I++ + S+P FL + +
Sbjct: 121 DYYNHTVLWDNQTGLDFWLNTQPLR-NVSGIYSTQLYTERTKFLIENRDVSKPFFLYLPY 179
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH G + LQ P +EN F +I +R +FA
Sbjct: 180 QAVHCGNFDDP------LQAP--QENIDKFPYIGEENRTIFA 213
>gi|443701814|gb|ELU00075.1| hypothetical protein CAPTEDRAFT_177949 [Capitella teleta]
Length = 545
Score = 157 bits (396), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 93/227 (40%), Positives = 128/227 (56%), Gaps = 14/227 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+D+ HG IPTPNIDALA +GI+LN +Y P CTPSRAA LTGK+P G+ +
Sbjct: 38 GWDDISLHGSEQIPTPNIDALAADGILLNNYYVQPICTPSRAALLTGKHPVHLGLQHNTI 97
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A A + + E++LP+YL LGY +H++GKWH+G + P RGF +H GY NG
Sbjct: 98 PAPSAYGLGLNERILPEYLNTLGYDSHMVGKWHLGYFTPQHTPTYRGFKSHFGYLNGCED 157
Query: 142 YNDSIHETDFAV------GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
Y D DF+ GLD + + + +Y T+ FT ++ +I+S N PLF
Sbjct: 158 YLDHTLAYDFSTLGMDGWGLDFWNDTKIHRTSF-GQYSTEIFTTRAEELIRS-NTGEPLF 215
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L ++H AVH+G G L LQ P N F +I + +RR A
Sbjct: 216 LYMSHQAVHSGNPG---LNGSKLQAPWKYFN--KFNYIQSDERRRLA 257
>gi|241844558|ref|XP_002415497.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215509709|gb|EEC19162.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 529
Score = 156 bits (395), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 81/185 (43%), Positives = 108/185 (58%), Gaps = 2/185 (1%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW DV F G+ IPTPN+D LA GI+LN +Y P C PSR A ++G YP G+ V
Sbjct: 31 GWADVSFRGDPQIPTPNLDVLASQGIILNNYYVQPLCAPSRGALMSGLYPIHTGLQHLVP 90
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G G +P ++P+YLK LGY+TH+IGKWH+G +KE P RGFD+ GY NG
Sbjct: 91 GPGEPWGLPTNLTIMPEYLKNLGYATHMIGKWHLGYHKESYTPTRRGFDSFYGYLNGGED 150
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D A GLD N + + Y T+ FT ++ +IK H+ ++P+FL +H
Sbjct: 151 YYDHTILWSNASGLDFWENTTPVRNE-GNHYSTELFTKKAQSLIKHHDPAKPMFLYFSHQ 209
Query: 202 AVHTG 206
AVH G
Sbjct: 210 AVHCG 214
>gi|403182690|gb|EJY57566.1| AAEL017192-PA [Aedes aegypti]
Length = 1007
Score = 155 bits (392), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 88/226 (38%), Positives = 127/226 (56%), Gaps = 18/226 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDV FH I TPNID LAY+G++LNRHY P T S+ A +TG +P G +
Sbjct: 467 GWNDVSFHSSKQIFTPNIDVLAYHGVILNRHYCAPFGTASQVALMTGSHPLSVGTQS--- 523
Query: 83 AGVAKAVPVT----EKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
A P T KL+P+Y ++ GY+THLIGKW +G ++++ P RGFD+H G+
Sbjct: 524 ASNEPDQPWTLDPELKLMPEYFRDAGYATHLIGKWGLGFSRKDYTPTQRGFDSHFGFLGP 583
Query: 139 YLTYND-SIHETDFAV-GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y+ Y D S+ + + GLD RRN++ ++ Y TD F ++V +I+ H+ +PL L
Sbjct: 584 YIDYWDHSLRLRNTSTRGLDMRRNLD-VDYSVNGSYATDLFNGEAVRLIREHDQKKPLLL 642
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+TH A HTG + +Q P E F +I + RR+ A
Sbjct: 643 VLTHLAPHTGNEDDP------MQAP--AEEVEKFDYIRDEKRRVLA 680
>gi|156402612|ref|XP_001639684.1| predicted protein [Nematostella vectensis]
gi|156226814|gb|EDO47621.1| predicted protein [Nematostella vectensis]
Length = 380
Score = 155 bits (392), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 91/219 (41%), Positives = 121/219 (55%), Gaps = 12/219 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+DV FHG IPTPN+D LA G++LN +Y P CTP+RA+ +TGKYP G+ +
Sbjct: 16 GWDDVSFHGSPQIPTPNLDYLATRGVILNNYYVSPICTPTRASLMTGKYPIHLGMQHFVI 75
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A +P+ E LPQYL+ GY T IGKWH+G +E P RGFD+ G W+
Sbjct: 76 YAAQPYGLPLGEITLPQYLQIQGYKTAGIGKWHLGFFAKEYTPTYRGFDSFYGMWSAKAD 135
Query: 142 Y-NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y N + E F G D R NME KY T+ FT +++ VI++HN S PLFL I H
Sbjct: 136 YWNHTSFENGFW-GTDMRNNMEPVTTD-KDKYATEVFTREALKVIENHNKSEPLFLYIAH 193
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRR 239
A H+ P LQ P E+ + F+ + + R
Sbjct: 194 QAPHSAN------PHDPLQAP--EDKVKKFSGVIDKIER 224
>gi|195166553|ref|XP_002024099.1| GL22854 [Drosophila persimilis]
gi|194107454|gb|EDW29497.1| GL22854 [Drosophila persimilis]
Length = 548
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 93/227 (40%), Positives = 128/227 (56%), Gaps = 16/227 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++DV F G N+ TPNIDALAY+G++LN YT CTPSRAA LTGKYP G+ V
Sbjct: 6 GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAAMCTPSRAALLTGKYPINTGMQHYVI 65
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P EK + + +E GY T L+GKWH+G ++ P RGFD+H+GY Y+
Sbjct: 66 VNNQPWGLPQQEKTMAEIFRENGYYTSLLGKWHLGMSQRNFTPTQRGFDHHLGYLGAYVD 125
Query: 142 YNDSIHET---DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NHSRPLF 195
Y D ++ ++A G D R N+ Q+ Y+TD +D +V +I+ H N S+PLF
Sbjct: 126 YYDQTYQQNGKNYARGHDFRLNLNVTHDQV-GHYVTDVLSDAAVELIEQHSGSNSSQPLF 184
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L ++H A H A NA P +Q P E F +I N R +A
Sbjct: 185 LLLSHLAPH---AANADDP---MQAP--AEELAKFEYIRNETHRYYA 223
>gi|443732842|gb|ELU17406.1| hypothetical protein CAPTEDRAFT_127365 [Capitella teleta]
Length = 502
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 87/226 (38%), Positives = 125/226 (55%), Gaps = 23/226 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW+DV FHG +PTPNIDALA +GI+L+ +Y C+PSR A +TGK+P + G+ V
Sbjct: 35 GWDDVSFHGSRQVPTPNIDALASDGIILDNYYVHTLCSPSRGALMTGKHPIQIGLQRGV- 93
Query: 83 AGVAKAVP----VTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+ A P + EKLLP+YL LGY +H++GKWH+G EE P +RGF++H G++ G
Sbjct: 94 --IMPAQPSGLGLKEKLLPEYLNTLGYKSHMVGKWHLGMCAEEYTPMHRGFESHFGFYQG 151
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSRPLFL 196
+Y + GLD N E P S+ +Y T FT ++ ++ H+ + P+FL
Sbjct: 152 CESYTTHMCGNS---GLDFWLNEE---PDHSAGGQYSTSLFTAKAEQLLAEHDTASPMFL 205
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ H AVH G PD + +F IS+ RR A
Sbjct: 206 YLAHQAVHVGNQDQK------FYAPDKYTDKLSF--ISDDRRRQMA 243
>gi|195128415|ref|XP_002008659.1| GI13615 [Drosophila mojavensis]
gi|193920268|gb|EDW19135.1| GI13615 [Drosophila mojavensis]
Length = 576
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 88/225 (39%), Positives = 126/225 (56%), Gaps = 14/225 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+ G + TPNIDALA++G +L+R Y CTPSR A L+GKYP G V
Sbjct: 46 GFDDLSIRGGREFLTPNIDALAFHGRLLDRLYAPAMCTPSRGALLSGKYPIHTGTQHFVI 105
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ ++ + L+P+ + GYST+L+GKWH+G + E P +RGFD H GYW Y+
Sbjct: 106 SNQEPWSLKLNTTLMPEIFRAAGYSTNLVGKWHLGYARPEFTPTHRGFDYHYGYWGAYID 165
Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK-SHNHSRPLFLQ 197
Y + + + +G D RRNME Y+TD T+++ VI+ + +PLFL
Sbjct: 166 YYQRRSQMPDKTYIMGYDFRRNMEVECAD-RGVYMTDLLTNEAERVIQETAAKQQPLFLM 224
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
I H AVHTG + LQVP EE + F HI +P+ R +A
Sbjct: 225 INHLAVHTGNDNDP------LQVP--EEELQKFTHIKDPNHRKYA 261
>gi|291231206|ref|XP_002735556.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 516
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 85/221 (38%), Positives = 125/221 (56%), Gaps = 15/221 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G++D+G+H ++ I TPN+D LA G+ L +Y P CTP+R+ ++G+Y G+ +
Sbjct: 51 GFHDIGYH-DSIIKTPNLDRLASEGVKLENYYVQPICTPTRSQLMSGRYQIHTGLQHGII 109
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ E +PQ LKE GY+TH++GKWH+G K+E LP RGFD GY G
Sbjct: 110 WPCQPSCLPINEVTIPQKLKESGYATHIVGKWHLGMYKKECLPTERGFDTFFGYLTGSED 169
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D G+D R NM+ PQ + +Y T F +++ ++IKSH+ PLFL +
Sbjct: 170 YYTHNRSYDKFHGMDFRENMQIVQPQYNGQYSTHVFAEKAKNIIKSHDPQIPLFLYLPFQ 229
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH G LQVPD E + +A+I+N RR +A
Sbjct: 230 AVH-----------GPLQVPDQYE--KPYANITNKQRRTYA 257
>gi|391326893|ref|XP_003737944.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 528
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 76/194 (39%), Positives = 112/194 (57%), Gaps = 6/194 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW+D+ HG + IPTPNID LA G++L +YT CTPSR A +TGKYP G+ V
Sbjct: 32 GWDDISLHGSDQIPTPNIDKLAAEGVLLENYYTQAICTPSRGALMTGKYPIHLGLQYDVI 91
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G +P K++PQYL Y +H+IGKWH+G ++ ELLP RGF +H G+ G+
Sbjct: 92 QGAQPYGLPTDFKIMPQYLSGTCYKSHIIGKWHLGHSRSELLPTRRGFHSHFGFRLGHSD 151
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSK-----YLTDFFTDQSVHVIKSHNHSRPLFL 196
Y + E V + ++ ++ ++ K Y D FT +++ ++++HN + PLFL
Sbjct: 152 YFNHWGEESSPVKNEMYAGLDLWSNEVPIKKYHGTYANDLFTKRAISILETHNKTTPLFL 211
Query: 197 QITHAAVHTGTAGN 210
+ H AVH G N
Sbjct: 212 YLAHQAVHVGDGEN 225
>gi|198466297|ref|XP_002135151.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
gi|198150535|gb|EDY73778.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
Length = 548
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 93/227 (40%), Positives = 128/227 (56%), Gaps = 16/227 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++DV F G N+ TPNIDALAY+G++LN YT CTPSRAA LTGKYP G+ V
Sbjct: 6 GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAAMCTPSRAALLTGKYPINTGMQHYVI 65
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P EK + + +E GY T L+GKWH+G ++ P RGFD+H+GY Y+
Sbjct: 66 VNNQPWGLPQQEKTMAEIFRENGYYTSLLGKWHLGMSQRNFTPTQRGFDHHLGYLGAYVD 125
Query: 142 YNDSIHET---DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NHSRPLF 195
Y D ++ ++A G D R N+ Q+ Y+TD +D +V +I+ H N S+PLF
Sbjct: 126 YYDQTYQQNGKNYARGHDFRLNLNVTHDQV-GHYVTDVLSDAAVELIEQHSGSNSSQPLF 184
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L ++H A H A NA P +Q P E F +I N R +A
Sbjct: 185 LLLSHLAPH---AANADDP---MQAP--AEELAKFEYIRNETHRHYA 223
>gi|443321855|ref|ZP_21050894.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
gi|442788399|gb|ELR98093.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
Length = 469
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 93/214 (43%), Positives = 123/214 (57%), Gaps = 16/214 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVGFHG ++I TPN+D LA +G+ L R Y CTP+RAAFLTG++PFRYG+ T V
Sbjct: 46 GWNDVGFHG-SEIKTPNLDKLAASGVRLERFYVKSVCTPTRAAFLTGRHPFRYGMSTGVI 104
Query: 83 AGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
K +P+ EK + + LKE GY T ++GKWH+G +E LP +RGFD H G++ G
Sbjct: 105 KPWDKVGLPLEEKTIAETLKEAGYYTAILGKWHLGHYQESFLPTSRGFDYHYGHYLGGID 164
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNH-SRPLFLQ 197
Y T+ND DF LD RN + Y TD ++V +I +HN+ +PLFL
Sbjct: 165 YFTHND-----DFLGALDWHRNRIHLKEE---GYATDLIGQEAVKLINNHNYEQQPLFLY 216
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFA 231
I A HT + L + D E R FA
Sbjct: 217 IAFNAPHTPLHAKTEDIEDYLTIDD--EKRRVFA 248
>gi|291190498|ref|NP_001167123.1| Arylsulfatase B precursor [Salmo salar]
gi|223648254|gb|ACN10885.1| Arylsulfatase B precursor [Salmo salar]
Length = 528
Score = 153 bits (386), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 88/229 (38%), Positives = 124/229 (54%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVG+HG ++I TPN+D L+ G+ L +Y P CTPSR +TG+Y R G+ +
Sbjct: 40 GWNDVGYHG-SEIKTPNLDKLSAKGVRLENYYVQPLCTPSRNQLMTGRYQIRTGMQHQII 98
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ ++E GY+TH++GKWH+G +++ LP RGFD++ GY G
Sbjct: 99 WPCQPYCVPLDEKLLPQLMREAGYATHMVGKWHLGMYRKDCLPTRRGFDSYFGYLTGSED 158
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
+Y ++ T AV L R E A + Y T FTD+ +I N +P
Sbjct: 159 YFSHQRCSYVPPLNVTRCAVDL---REGEEVATGYTGTYSTQLFTDRVTSIIAKQNSKKP 215
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + AVH LQVP E ++ I +P+RRL+A
Sbjct: 216 LFLYVALQAVHAP-----------LQVP--ERYVAPYSFIKDPNRRLYA 251
>gi|195436072|ref|XP_002066002.1| GK11604 [Drosophila willistoni]
gi|194162087|gb|EDW76988.1| GK11604 [Drosophila willistoni]
Length = 567
Score = 153 bits (386), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 89/225 (39%), Positives = 122/225 (54%), Gaps = 14/225 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++DV F G + TPNIDALAY+G +L+ Y CTPSR A L+G+YP G V
Sbjct: 39 GFDDVSFRGGREFLTPNIDALAYHGRILDNLYAPAMCTPSRGALLSGRYPAHTGTQHFVI 98
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ + + L+P+ KE GYST+LIGKWH+G E P RGFD H GYW Y+
Sbjct: 99 SNEEPWGLTLNATLMPEIFKEAGYSTNLIGKWHLGFASPEYTPTRRGFDYHYGYWGAYID 158
Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS-RPLFLQ 197
Y + ++++G D RRNM+ Q Y+TD T ++ HVI+ + FL
Sbjct: 159 YYQRRSQMPVANYSMGYDFRRNMDLEC-QNRGVYITDLLTQEAEHVIREKAAANETFFLM 217
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++H A HT + LQ P EE R FA+I +P RR +A
Sbjct: 218 LSHLATHTANDNDP------LQAP--EEEIRKFAYIKDPRRRKYA 254
>gi|443690889|gb|ELT92899.1| hypothetical protein CAPTEDRAFT_165852 [Capitella teleta]
Length = 484
Score = 153 bits (386), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 90/221 (40%), Positives = 128/221 (57%), Gaps = 18/221 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVG++ +I TPN+D LA NG++LN Y L TC+PSR A LTG+YPF+ G+ V
Sbjct: 39 GWNDVGWNNP-EIKTPNLDRLASNGVILNASYALSTCSPSRTALLTGRYPFKLGLQHGVV 97
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G +P+ LLPQ LK LGYSTH IGKWH+G + E P RGFD+ G+++G
Sbjct: 98 KKGKPYGLPLNITLLPQKLKHLGYSTHAIGKWHLGFCRWEYTPTFRGFDSFYGFYSGSED 157
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y +T G D R N + + P++ KY T + ++V +I++H + PLFL +
Sbjct: 158 YYK--RKTAAIRGYDFRMNTKVFKPKI-KKYSTLDYGRRAVKIIQAHKRTEPLFLYMPFQ 214
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH+ LQVP E + +I + +RR+F+
Sbjct: 215 AVHSP-----------LQVPKSFEFK--YRNIVDRNRRIFS 242
>gi|195441668|ref|XP_002068625.1| GK20324 [Drosophila willistoni]
gi|194164710|gb|EDW79611.1| GK20324 [Drosophila willistoni]
Length = 525
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 92/231 (39%), Positives = 131/231 (56%), Gaps = 26/231 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G++DV F G N+ TPNIDALAY+G++LN YT CTPSR+A LTGKYP G+
Sbjct: 6 GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTPAMCTPSRSALLTGKYPISTGMQHYVI 65
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G +P+ E + + ++ GY T L+GKWH+G ++ P RGFD H+GY
Sbjct: 66 VNDQPWG------LPLNETTMAEIFQQNGYYTSLLGKWHLGMSQRNFTPTKRGFDTHLGY 119
Query: 136 WNGYLTYNDSIH---ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HS 191
Y+ Y D + +++ G D R N+E ++ Y+TD +D +V +I+ HN +
Sbjct: 120 LGAYIDYYDQTYLQSSQNYSRGHDFRDNLEASHDKV-GHYVTDILSDAAVELIEKHNVTA 178
Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+PLFL ++H A H A N P LQ P MEE + F +I N R +A
Sbjct: 179 KPLFLLLSHLAPH---AANDNDP---LQAP-MEELSQ-FEYIQNKSHRYYA 221
>gi|291231208|ref|XP_002735557.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 490
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 84/221 (38%), Positives = 124/221 (56%), Gaps = 15/221 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G++D+G+H ++ I TPN+D LA G+ L +Y P CTP+R+ ++G+Y G+ +
Sbjct: 25 GFHDIGYH-DSIIKTPNLDRLASEGVKLENYYVQPKCTPTRSQLMSGRYQIHTGLQHGII 83
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ E +PQ LKE GY+TH++GKWH+G K+E LP RGFD GY G
Sbjct: 84 WPCQPSCLPINEVTIPQKLKESGYATHIVGKWHLGMYKKECLPTERGFDTFFGYLTGSED 143
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y D G+D R NM+ PQ + +Y T F +++ ++IKSH+ PLFL +
Sbjct: 144 YYTHNRSYDKFHGMDFRENMQIVQPQYNGQYSTHVFAEKAKNIIKSHDPQIPLFLYLPLH 203
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH G LQVPD E + + +I+N RR +A
Sbjct: 204 AVH-----------GPLQVPDQYE--KPYTNITNKQRRTYA 231
>gi|26350439|dbj|BAC38859.1| unnamed protein product [Mus musculus]
Length = 431
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 92/229 (40%), Positives = 131/229 (57%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWND+GFHG + I TP++DALA G+VL+ +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 GWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHLGLQHYLI 115
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 116 MTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y T+ +S++ T A+ L R+ E A + ++ Y T+ FT ++ VI +H +P
Sbjct: 176 YYTHEACAPIESLNGTRCALDL---RDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKP 232
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + I + RR++A
Sbjct: 233 LFLYLAFQSVHDP-----------LQVP--EEYMEPYGFIQDKHRRIYA 268
>gi|426384277|ref|XP_004058697.1| PREDICTED: arylsulfatase B-like [Gorilla gorilla gorilla]
Length = 408
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 58 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 177 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPREKP 233
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 234 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 269
>gi|194871676|ref|XP_001972885.1| GG13639 [Drosophila erecta]
gi|190654668|gb|EDV51911.1| GG13639 [Drosophila erecta]
Length = 584
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 93/232 (40%), Positives = 123/232 (53%), Gaps = 27/232 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G++DV F G N+ TPNIDALAY+G++LN Y P CTPSRAA LTGKYP G+
Sbjct: 46 GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G +P+ E + + +E GY T L+GKWH+G ++ P RGFD H+GY
Sbjct: 106 VNDQPWG------LPLNETTMAEIFRENGYRTSLLGKWHLGFSQRNFTPTQRGFDRHLGY 159
Query: 136 WNGYLTYNDSIHETDFA--VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NH 190
Y+ Y +E G D R N++ Q+ Y+TD TD +V I+ H N
Sbjct: 160 LGAYVDYYTQSYEQQSKGYNGHDFRDNLKSSHDQV-GHYITDVLTDAAVKEIEDHASKNS 218
Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
S+PLFL + H A H N +Q P EE R F +I N R +A
Sbjct: 219 SQPLFLLLNHLAPHAANDDNP------MQAP-AEEVSR-FEYIRNKTHRYYA 262
>gi|187956367|gb|AAI50662.1| Arsb protein [Mus musculus]
Length = 431
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 92/229 (40%), Positives = 131/229 (57%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWND+GFHG + I TP++DALA G+VL+ +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 GWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHLGLQHYLI 115
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 116 MTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y T+ +S++ T A+ L R+ E A + ++ Y T+ FT ++ VI +H +P
Sbjct: 176 YYTHEACAPIESLNGTRCALDL---RDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKP 232
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + I + RR++A
Sbjct: 233 LFLYLAFQSVHDP-----------LQVP--EEYMEPYGFIQDKHRRIYA 268
>gi|74140818|dbj|BAE34455.1| unnamed protein product [Mus musculus]
Length = 458
Score = 150 bits (379), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 92/229 (40%), Positives = 131/229 (57%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWND+GFHG + I TP++DALA G+VL+ +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 GWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHLGLQHYLI 115
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 116 MTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y T+ +S++ T A+ L R+ E A + ++ Y T+ FT ++ VI +H +P
Sbjct: 176 YYTHEACAPIESLNGTRCALDL---RDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKP 232
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + I + RR++A
Sbjct: 233 LFLYLAFQSVHDP-----------LQVP--EEYMEPYGFIQDKHRRIYA 268
>gi|291225025|ref|XP_002732499.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 307
Score = 150 bits (379), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 90/222 (40%), Positives = 122/222 (54%), Gaps = 18/222 (8%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
GWNDV +H DI PN+ LA +G++ N+ YT PTCTPSRAA +TG YPF+ G +
Sbjct: 1 MGWNDVHWHNP-DIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59
Query: 82 GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN-GY 139
+ VP+ KLLP+ LKE+GYSTH++GKWH+G K+E LP NRGFD+H G W G
Sbjct: 60 AFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y+ + G D R NM P+ S YL D++ H++ +H PLFL T
Sbjct: 120 GDYDKMDGVLSPSAGYDFRDNM-GVVPK-SDDYLALMLGDRAEHIVNTHYPGTPLFLAFT 177
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
+P L++P EE + +A I + R F
Sbjct: 178 -----------LDIPAKHLEIP--EEYEEKYAEIEDDRTRQF 206
>gi|402871949|ref|XP_003899908.1| PREDICTED: arylsulfatase B-like, partial [Papio anubis]
Length = 316
Score = 150 bits (378), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 127/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 57 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT ++ +I +H +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRATALITNHPPEKP 232
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 268
>gi|410260410|gb|JAA18171.1| arylsulfatase B [Pan troglodytes]
gi|410341767|gb|JAA39830.1| arylsulfatase B [Pan troglodytes]
Length = 414
Score = 150 bits (378), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 57 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 232
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 268
>gi|158255166|dbj|BAF83554.1| unnamed protein product [Homo sapiens]
Length = 413
Score = 150 bits (378), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 56 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 114
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 115 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 175 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 231
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 232 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 267
>gi|114599506|ref|XP_001140908.1| PREDICTED: arylsulfatase B isoform 2 [Pan troglodytes]
Length = 415
Score = 150 bits (378), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 58 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 177 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 233
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 234 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 269
>gi|125656171|ref|NP_033842.3| arylsulfatase B precursor [Mus musculus]
gi|81158036|tpe|CAI84992.1| TPA: arylsulfatase B [Mus musculus]
gi|195934801|gb|AAI68412.1| Arylsulfatase B [synthetic construct]
Length = 534
Score = 150 bits (378), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 92/229 (40%), Positives = 131/229 (57%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWND+GFHG + I TP++DALA G+VL+ +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 GWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHLGLQHYLI 115
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 116 MTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y T+ +S++ T A+ L R+ E A + ++ Y T+ FT ++ VI +H +P
Sbjct: 176 YYTHEACAPIESLNGTRCALDL---RDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKP 232
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + I + RR++A
Sbjct: 233 LFLYLAFQSVHDP-----------LQVP--EEYMEPYGFIQDKHRRIYA 268
>gi|410226854|gb|JAA10646.1| arylsulfatase B [Pan troglodytes]
gi|410292330|gb|JAA24765.1| arylsulfatase B [Pan troglodytes]
Length = 414
Score = 150 bits (378), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 57 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 232
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 268
>gi|38569407|ref|NP_942002.1| arylsulfatase B isoform 2 precursor [Homo sapiens]
gi|20809799|gb|AAH29051.1| Arylsulfatase B [Homo sapiens]
gi|119616228|gb|EAW95822.1| arylsulfatase B, isoform CRA_b [Homo sapiens]
Length = 413
Score = 150 bits (378), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 56 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 114
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 115 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 175 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 231
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 232 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 267
>gi|291225027|ref|XP_002732497.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 461
Score = 150 bits (378), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 90/222 (40%), Positives = 123/222 (55%), Gaps = 18/222 (8%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
GWNDV +H +DI PN+ LA +G++ N+ YT PTCTPSRAA +TG YPF+ G +
Sbjct: 1 MGWNDVHWH-NSDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59
Query: 82 GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN-GY 139
+ VP+ KLLP+ LKE+GYSTH++GKWH+G K+E LP NRGFD+H G W G
Sbjct: 60 VFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y+ + G D R NM P+ S YL D++ H++ +H PLFL T
Sbjct: 120 SDYDKMNGVLSPSAGYDFRDNM-GVVPK-SDDYLALMLGDRAEHIVNTHYPGTPLFLAFT 177
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
+P L++P EE + +A I + R F
Sbjct: 178 -----------LDIPAKHLEIP--EEYEEKYAEIEDDRTRQF 206
>gi|122065132|sp|P50429.3|ARSB_MOUSE RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
Flags: Precursor
gi|74152170|dbj|BAE32375.1| unnamed protein product [Mus musculus]
Length = 534
Score = 150 bits (378), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 92/229 (40%), Positives = 131/229 (57%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWND+GFHG + I TP++DALA G+VL+ +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 GWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHLGLQHYLI 115
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 116 MTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y T+ +S++ T A+ L R+ E A + ++ Y T+ FT ++ VI +H +P
Sbjct: 176 YYTHEACAPIESLNGTRCALDL---RDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKP 232
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + I + RR++A
Sbjct: 233 LFLYLAFQSVHDP-----------LQVP--EEYMEPYGFIQDKHRRIYA 268
>gi|441598315|ref|XP_004087449.1| PREDICTED: arylsulfatase B isoform 2 [Nomascus leucogenys]
Length = 415
Score = 150 bits (378), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 58 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 177 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 233
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 234 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 269
>gi|405977794|gb|EKC42228.1| Arylsulfatase I [Crassostrea gigas]
Length = 545
Score = 149 bits (377), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 89/222 (40%), Positives = 126/222 (56%), Gaps = 21/222 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVG+H DI TPN+D +A G++LN Y P CTPSR +FLTG YPFR G+ T +
Sbjct: 39 GWNDVGWHNP-DIKTPNLDRMAGGGVILNSSYVHPICTPSRNSFLTGVYPFRVGLSGTAI 97
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A+ + + LP+ LK+LGYSTH+IGKWH+G E P RGFD+ +G++ G
Sbjct: 98 TPHQARFMSLKTPTLPEKLKKLGYSTHMIGKWHLGFCNERYTPTRRGFDSFLGFYTGTQD 157
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITH 200
Y H T A G D R N + P +Y T + ++V +I H + PLFL +
Sbjct: 158 YYK--HTT--AKGYDFRFNQTVFYPP-KKQYSTKTYAKRAVDIITEHKRKKNPLFLYLAF 212
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+VHT LQVP ++ ++ + +I N DRR+++
Sbjct: 213 QSVHTP-----------LQVP--KKYEKQYNNIKNKDRRVYS 241
>gi|38569405|ref|NP_000037.2| arylsulfatase B isoform 1 precursor [Homo sapiens]
gi|114223|sp|P15848.1|ARSB_HUMAN RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
Flags: Precursor
gi|179077|gb|AAA51784.1| arylsulfatase B precursor (EC 3.1.6.1) [Homo sapiens]
gi|119616227|gb|EAW95821.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
gi|119616229|gb|EAW95823.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
Length = 533
Score = 149 bits (377), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 56 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 114
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 115 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 175 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 231
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 232 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 267
>gi|410226860|gb|JAA10649.1| arylsulfatase B [Pan troglodytes]
gi|410292332|gb|JAA24766.1| arylsulfatase B [Pan troglodytes]
Length = 535
Score = 149 bits (377), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 58 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 177 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 233
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 234 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 269
>gi|410226852|gb|JAA10645.1| arylsulfatase B [Pan troglodytes]
gi|410226856|gb|JAA10647.1| arylsulfatase B [Pan troglodytes]
gi|410226858|gb|JAA10648.1| arylsulfatase B [Pan troglodytes]
gi|410292328|gb|JAA24764.1| arylsulfatase B [Pan troglodytes]
Length = 534
Score = 149 bits (377), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 57 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 232
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 268
>gi|301621823|ref|XP_002940244.1| PREDICTED: arylsulfatase B-like [Xenopus (Silurana) tropicalis]
Length = 502
Score = 149 bits (377), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 93/229 (40%), Positives = 125/229 (54%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVGFHG ++I TP +D L+ G+ L +YT P CTPSR+ L+G+Y G+ +
Sbjct: 24 GWNDVGFHG-SEILTPTLDFLSGQGVRLAGYYTQPLCTPSRSQLLSGRYQIHTGLQHQII 82
Query: 83 AGVAKAV-PVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
P+ +KLLP+ LKE GY TH++GKWH+G K + LP RGFD++ GYW G
Sbjct: 83 WPCQPHCHPLEDKLLPELLKERGYVTHMVGKWHLGMYKTDCLPTRRGFDSYFGYWTGGED 142
Query: 142 YNDSIHETDFAV--------GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y HE + + LD R+ E A KY T FTD++V +I +HN +P
Sbjct: 143 YYS--HERCYLITTLNITRCALDF-RDGEVPATDYQMKYSTHLFTDRAVDLITNHNPEKP 199
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + + AVH+ LQVPD T H N RRL+A
Sbjct: 200 LFLYLAYQAVHSP-----------LQVPDQYIEPYTSIHDKN--RRLYA 235
>gi|296483766|tpg|DAA25881.1| TPA: arylsulfatase B [Bos taurus]
Length = 429
Score = 149 bits (377), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 89/229 (38%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVGFHG + I TP +DALA G++L+ +YT P CTPSR+ LTG+Y G+ +
Sbjct: 56 GWNDVGFHG-SAIRTPRLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 114
Query: 83 AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
+P+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 115 LPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT+++ +I +H +P
Sbjct: 175 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNVFTERATTLITNHPPEKP 231
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +RR +A
Sbjct: 232 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDRNRRYYA 267
>gi|179030|gb|AAA51779.1| arylsulfatase B precursor [Homo sapiens]
Length = 533
Score = 149 bits (377), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 56 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 114
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 115 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 175 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 231
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 232 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 267
>gi|410260404|gb|JAA18168.1| arylsulfatase B [Pan troglodytes]
gi|410260406|gb|JAA18169.1| arylsulfatase B [Pan troglodytes]
gi|410260408|gb|JAA18170.1| arylsulfatase B [Pan troglodytes]
gi|410341765|gb|JAA39829.1| arylsulfatase B [Pan troglodytes]
gi|410341769|gb|JAA39831.1| arylsulfatase B [Pan troglodytes]
Length = 534
Score = 149 bits (377), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 57 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 232
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 268
>gi|410260412|gb|JAA18172.1| arylsulfatase B [Pan troglodytes]
gi|410341771|gb|JAA39832.1| arylsulfatase B [Pan troglodytes]
Length = 535
Score = 149 bits (377), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 58 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 177 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 233
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 234 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 269
>gi|825628|emb|CAA51272.1| arylsulfatase [Homo sapiens]
gi|189067435|dbj|BAG37417.1| unnamed protein product [Homo sapiens]
Length = 533
Score = 149 bits (377), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 56 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 114
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 115 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 175 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 231
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 232 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 267
>gi|332224806|ref|XP_003261559.1| PREDICTED: arylsulfatase B isoform 1 [Nomascus leucogenys]
Length = 535
Score = 149 bits (376), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 58 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176
Query: 142 YN--------DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 177 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 233
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 234 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 269
>gi|195379278|ref|XP_002048407.1| GJ13952 [Drosophila virilis]
gi|194155565|gb|EDW70749.1| GJ13952 [Drosophila virilis]
Length = 574
Score = 149 bits (376), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 85/225 (37%), Positives = 122/225 (54%), Gaps = 14/225 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+ G + TPNIDAL ++G +L+R Y CTPSR A L+GKYP G V
Sbjct: 46 GFDDISLRGGREFLTPNIDALGFHGRLLDRLYAPAMCTPSRGALLSGKYPIHTGTQHFVI 105
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ ++ + L+P+ + GYST+L+GKWH+G + E P +RGFD H GYW Y+
Sbjct: 106 SNEEPWSLMLNTTLMPEIFRSAGYSTNLVGKWHLGFARPEYTPTHRGFDYHYGYWGAYID 165
Query: 142 Y---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK-SHNHSRPLFLQ 197
Y + E + VG D RRNME Y+TD T+++ +I+ + +PLFL
Sbjct: 166 YYQRRSQMPEKTYIVGYDFRRNMEVECTD-RGVYVTDLLTNEAERIIQETAAKQKPLFLM 224
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
I H A HT + LQ P EE + F HI +P+ R +A
Sbjct: 225 INHLATHTANDNDP------LQAP--EEEVQKFLHIKDPNHRKYA 261
>gi|297675538|ref|XP_002815731.1| PREDICTED: arylsulfatase B [Pongo abelii]
Length = 534
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 57 GWNDVGFHGSR-IHTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 142 YN--------DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 232
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 268
>gi|109077724|ref|XP_001108177.1| PREDICTED: arylsulfatase B isoform 2 [Macaca mulatta]
Length = 414
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 127/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 57 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT ++ +I +H +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRATALITNHPPEKP 232
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 268
>gi|195494699|ref|XP_002094950.1| GE19934 [Drosophila yakuba]
gi|194181051|gb|EDW94662.1| GE19934 [Drosophila yakuba]
Length = 591
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 92/232 (39%), Positives = 122/232 (52%), Gaps = 27/232 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G++DV F G N+ TPNIDALAY+G++LN Y P CTPSRAA LTGKYP G+
Sbjct: 46 GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G +P+ E + + +E GY T L+GKWH+G ++ P RGFD H GY
Sbjct: 106 VNDQPWG------LPLNETTMAEIFRENGYRTSLLGKWHLGFSQRNFTPTQRGFDRHFGY 159
Query: 136 WNGYLTYNDSIHETDFA--VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NH 190
Y+ Y +E G D R N++ + +Y+TD TD +V I+ H N
Sbjct: 160 LGAYVDYYTQSYEQQNKGYNGHDFRDNLKSTHDHV-GRYITDVLTDAAVKEIEDHGSKNS 218
Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
S+PLFL + H A H N +Q P EE R F +I N R +A
Sbjct: 219 SQPLFLLLNHLAPHAANDDNP------MQAP-AEEVSR-FEYIGNKTHRYYA 262
>gi|155372077|ref|NP_001094645.1| arylsulfatase B precursor [Bos taurus]
gi|151554899|gb|AAI48140.1| ARSB protein [Bos taurus]
Length = 533
Score = 149 bits (375), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 89/229 (38%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVGFHG + I TP +DALA G++L+ +YT P CTPSR+ LTG+Y G+ +
Sbjct: 56 GWNDVGFHG-SAIRTPRLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 114
Query: 83 AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
+P+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 115 LPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT+++ +I +H +P
Sbjct: 175 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNVFTERATTLITNHPPEKP 231
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +RR +A
Sbjct: 232 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDRNRRYYA 267
>gi|109077718|ref|XP_001108389.1| PREDICTED: arylsulfatase B isoform 6 [Macaca mulatta]
Length = 534
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 127/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 57 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT ++ +I +H +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRATALITNHPPEKP 232
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 268
>gi|24666163|ref|NP_649020.1| CG7408, isoform B [Drosophila melanogaster]
gi|281366395|ref|NP_001163462.1| CG7408, isoform C [Drosophila melanogaster]
gi|281366397|ref|NP_001163463.1| CG7408, isoform D [Drosophila melanogaster]
gi|23093214|gb|AAF49290.2| CG7408, isoform B [Drosophila melanogaster]
gi|272455230|gb|ACZ94733.1| CG7408, isoform C [Drosophila melanogaster]
gi|272455231|gb|ACZ94734.1| CG7408, isoform D [Drosophila melanogaster]
Length = 585
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 94/232 (40%), Positives = 125/232 (53%), Gaps = 27/232 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G++DV F G N+ TPNIDALAY+G++LN Y P CTPSRAA LTGKYP G+
Sbjct: 46 GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G +P+ E + + +E GY T L+GKWH+G ++ P RGFD H+GY
Sbjct: 106 VNDQPWG------LPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHLGY 159
Query: 136 WNGYLTYNDSIHETDFA--VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NH 190
Y+ Y +E G D R +++ + Y+TD TD +V I+ H N
Sbjct: 160 LGAYVDYYTQSYEQQNKGYNGHDFRDSLKSTHDHV-GHYVTDLLTDAAVKEIEDHGSKNS 218
Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
S+PLFL + H A H A N P +Q P EE R F +ISN R +A
Sbjct: 219 SQPLFLLLNHLAPH---AANDDDP---MQAP-AEEVSR-FEYISNKTHRYYA 262
>gi|395825538|ref|XP_003785985.1| PREDICTED: arylsulfatase B [Otolemur garnettii]
Length = 532
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 89/229 (38%), Positives = 129/229 (56%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG + I TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ +
Sbjct: 55 GWNDVGFHGSS-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRMGLQHQII 113
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 114 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 173
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T ++++ T A+ R+ E A + Y T+ FT+++ +I +H +P
Sbjct: 174 YYSHERCTLINALNVTRCALDF---RDGEEVATGYKNMYSTNIFTERATALITNHPPEKP 230
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 231 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 266
>gi|214010121|ref|NP_001135731.1| arylsulfatase B precursor [Felis catus]
gi|461542|sp|P33727.1|ARSB_FELCA RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
Flags: Precursor
gi|258856|gb|AAB23941.1| arylsulfatase B [Felis catus]
Length = 535
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 90/229 (39%), Positives = 127/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDV FHG N I TP++D LA G++L+ +YT P CTPSR+ LTG+Y G+ +
Sbjct: 58 GWNDVSFHGSN-IRTPHLDELAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 116
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176
Query: 142 YN--------DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y DS++ T A+ R+ E+ A + Y T+ FT+++ +I SH +P
Sbjct: 177 YYSHERCALIDSLNVTRCALDF---RDGEQVATGYKNMYSTNIFTERATALITSHPPEKP 233
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 234 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHYYA 269
>gi|405956212|gb|EKC22964.1| Arylsulfatase B [Crassostrea gigas]
Length = 491
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 87/225 (38%), Positives = 121/225 (53%), Gaps = 25/225 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ND+G+HG ++I TPN+D LA G+ L +Y P CTP+R+ ++G RY I T +
Sbjct: 35 GYNDIGYHG-SEIKTPNLDKLAGEGVKLENYYVQPICTPTRSQLMSG----RYQIHTGLQ 89
Query: 83 AGVAK-----AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
GV + +P+ +LPQ LKE+GYSTH +GKWH+G KEE LP NRGFD+H GY
Sbjct: 90 HGVIRPPQPNGLPLDSAILPQKLKEVGYSTHAVGKWHLGFYKEEYLPTNRGFDSHFGYLT 149
Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
G Y G D R NM + Y T F ++ V+ +HN +PLFL
Sbjct: 150 GAEDYFKHDRCFGAMCGTDLRDNMN--PANYTGVYSTHLFAQKAAEVVNNHNTDKPLFLY 207
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ +VH LQVP E+ + + HI + RR +A
Sbjct: 208 LPFQSVHAP-----------LQVP--EQYTKPYMHIQDKQRRTYA 239
>gi|380795845|gb|AFE69798.1| arylsulfatase B isoform 1 precursor, partial [Macaca mulatta]
Length = 506
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 95/254 (37%), Positives = 137/254 (53%), Gaps = 35/254 (13%)
Query: 4 PVGAGVAKAVP------VTEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLP 57
P+G+G + P + + L GWNDVGFHG I TP++DALA G++L+ +YT P
Sbjct: 7 PLGSGAEASRPPHLVFVLADDL---GWNDVGFHGSC-IRTPHLDALAAGGVLLDNYYTQP 62
Query: 58 TCTPSRAAFLTGKYPFRYGID-TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG 116
CTPSR+ LTG+Y R G+ + VP+ EKLLPQ LKE GY+TH++GKWH+G
Sbjct: 63 LCTPSRSQLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLG 122
Query: 117 CNKEELLPFNRGFDNHVGYWNGYLTYN--------DSIHETDFAVGLDARRNMERYAPQM 168
++E LP RGFD + GY G Y D+++ T A+ R+ E A
Sbjct: 123 MYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDF---RDGEEVATGY 179
Query: 169 SSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDR 228
+ Y T+ FT ++ +I +H +PLFL + +VH LQVP EE +
Sbjct: 180 KNMYSTNIFTKRATALITNHPPEKPLFLYLALQSVHEP-----------LQVP--EEYLK 226
Query: 229 TFAHISNPDRRLFA 242
+ I + +R +A
Sbjct: 227 PYDFIQDKNRHHYA 240
>gi|348557289|ref|XP_003464452.1| PREDICTED: arylsulfatase B-like [Cavia porcellus]
Length = 520
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 88/227 (38%), Positives = 125/227 (55%), Gaps = 22/227 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG + TP++DALA G+ L+ +YT P CTPSR+ LTG+Y G+ +
Sbjct: 43 GWNDVGFHGSR-LRTPHLDALAAGGVRLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 101
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 102 WPCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 161
Query: 142 YNDSIHETDFAVGLDAR------RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
Y H F L+ R+ E A + + Y + F ++++ +I +H +PLF
Sbjct: 162 YFSHEHCV-FIKALNVTRCALDFRDGEEVATEYKNMYSANIFANRAISLIANHPPEKPLF 220
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L + +VH LQVP EE + + I + +RRL+A
Sbjct: 221 LYLALQSVHEP-----------LQVP--EEYLKPYDFIRDKNRRLYA 254
>gi|241789348|ref|XP_002400616.1| sulfatase, putative [Ixodes scapularis]
gi|215510801|gb|EEC20254.1| sulfatase, putative [Ixodes scapularis]
Length = 224
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 80/186 (43%), Positives = 110/186 (59%), Gaps = 5/186 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW DV F G+ IPTPN+D LA GI+LN +Y L CTPSR A ++G YP G+ V
Sbjct: 13 GWADVSFRGDPQIPTPNLDVLASQGIILNNYYVLHLCTPSRGALMSGLYPIHTGLQHYVQ 72
Query: 83 A-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ ++P++LK LGY+TH+IGKW++G KE P RGFD+ G+ NG
Sbjct: 73 LPAEPHGLPLNVTIMPEHLKNLGYTTHMIGKWNLGYYKESYTPTRRGFDSFYGFLNGGED 132
Query: 142 YNDSIHETDF-AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D H F + GLD Q S Y TD FT +++ +IK H+ ++P+FL +H
Sbjct: 133 YYD--HTILFVSTGLDFWDGTTPVRNQ-SHHYSTDLFTKKALALIKDHDQAKPMFLYFSH 189
Query: 201 AAVHTG 206
AVH+G
Sbjct: 190 QAVHSG 195
>gi|351697185|gb|EHB00104.1| Arylsulfatase B, partial [Heterocephalus glaber]
Length = 503
Score = 147 bits (372), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 89/226 (39%), Positives = 126/226 (55%), Gaps = 20/226 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG + TP++DALA G+ L+ +YT P CTPSR+ LTG+Y G+ +
Sbjct: 27 GWNDVGFHGSR-LRTPHLDALAAGGVQLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 85
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ L+E GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 86 WPCQPSCVPLDEKLLPQLLQEAGYATHMVGKWHLGMYQKECLPTRRGFDTYFGYLLGSED 145
Query: 139 YLTYNDSIHETDFAVGLDAR--RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y T+ + V A R+ E A Q + Y T+ FT+++ +I +H +PLFL
Sbjct: 146 YYTHEHCVFIKALNVTRCALDFRDGEEVATQYKNLYSTNIFTNRATSLIANHPPEKPLFL 205
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ +VH LQVP EE + + I + +R L+A
Sbjct: 206 YLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHLYA 238
>gi|195591201|ref|XP_002085331.1| GD14733 [Drosophila simulans]
gi|194197340|gb|EDX10916.1| GD14733 [Drosophila simulans]
Length = 585
Score = 147 bits (372), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 94/232 (40%), Positives = 123/232 (53%), Gaps = 27/232 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G++DV F G ++ TPNIDALAY+G++LN Y P CTPSRAA LTGKYP G+
Sbjct: 46 GFDDVSFRGSDNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G +P+ E + + +E GY T L+GKWH+G ++ P RGFD H GY
Sbjct: 106 VNDQPWG------LPINETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHFGY 159
Query: 136 WNGYLTYNDSIHETDFA--VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NH 190
Y+ Y +E G D R N+ + Y+TD TD +V I+ H N
Sbjct: 160 LGAYVDYYTQSYEQQNKGYNGHDFRDNL-KSTHDYVGHYVTDVLTDAAVKEIEDHGSKNS 218
Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
S+PLFL + H A H A N P +Q P EE R F +ISN R +A
Sbjct: 219 SQPLFLLLNHLAPH---AANDDDP---MQAP-AEEVSR-FEYISNKTHRYYA 262
>gi|195328499|ref|XP_002030952.1| GM25725 [Drosophila sechellia]
gi|194119895|gb|EDW41938.1| GM25725 [Drosophila sechellia]
Length = 585
Score = 147 bits (371), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 94/232 (40%), Positives = 124/232 (53%), Gaps = 27/232 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G++DV F G ++ TPNIDALAY+G++LN Y P CTPSRAA LTGKYP G+
Sbjct: 46 GFDDVSFRGSDNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G +P+ E + + +E GY T L+GKWH+G ++ P RGFD H GY
Sbjct: 106 VNDQPWG------LPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHFGY 159
Query: 136 WNGYLTYNDSIHETDFA--VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NH 190
Y+ Y +E G D R N++ + Y+TD TD +V I+ H N
Sbjct: 160 LGAYVDYYTQSYEQQNKGYNGHDFRDNLKSTHDHV-GHYVTDVLTDAAVKEIEDHGSKNS 218
Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
S+PLFL + H A H A N P +Q P EE R F +ISN R +A
Sbjct: 219 SQPLFLLLNHLAPH---AANDDDP---MQAP-AEEVSR-FEYISNKAHRYYA 262
>gi|291225029|ref|XP_002732496.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 497
Score = 147 bits (370), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 89/222 (40%), Positives = 122/222 (54%), Gaps = 18/222 (8%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
GWNDV +H DI PN+ LA +G++ N+ YT PTCTPSRAA +TG YPF+ G +
Sbjct: 37 MGWNDVHWHNP-DIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 95
Query: 82 GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN-GY 139
+ VP+ KLLP+ LKE+GYSTH++GKWH+G K+E LP NRGFD+H G W G
Sbjct: 96 VFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 155
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y+ + G D R NM P+ S YL D++ H++ +H PLFL T
Sbjct: 156 GDYDKMNGVLSPSAGYDFRDNM-GVVPK-SDGYLALMLGDRAEHIVNTHYPGTPLFLAFT 213
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
+P L++P EE + ++ I + R F
Sbjct: 214 -----------LDIPAKHLEIP--EEYEEKYSDIEDDRTRQF 242
>gi|114326200|ref|NP_001041598.1| arylsulfatase B precursor [Canis lupus familiaris]
gi|81158050|tpe|CAI84999.1| TPA: arylsulfatase B [Canis lupus familiaris]
Length = 535
Score = 147 bits (370), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 89/229 (38%), Positives = 128/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+DVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y G+ +
Sbjct: 58 GWHDVGFHGSR-IRTPHLDALAAAGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 116
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T D+++ T A+ R+ E A + Y T+ FT+++ +I +H +P
Sbjct: 177 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTERATALISNHPPEKP 233
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +RR +A
Sbjct: 234 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIHDKNRRYYA 269
>gi|291225031|ref|XP_002732500.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 286
Score = 146 bits (369), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 89/222 (40%), Positives = 122/222 (54%), Gaps = 18/222 (8%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
GWNDV +H DI PN+ LA +G++ N+ YT PTCTPSRAA +TG YPF+ G +
Sbjct: 1 MGWNDVHWHNP-DIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59
Query: 82 GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN-GY 139
+ VP+ KLLP+ LKE+GY+TH++GKWH+G K+E LP NRGFD+H G W G
Sbjct: 60 VFNLHPSGVPLNFKLLPEKLKEVGYATHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y+ + G D R NM P+ S YL D++ H++ +H PLFL T
Sbjct: 120 GDYDKLNGVLSPSAGYDFRDNM-GVVPK-SDGYLALMLGDRAEHIVNTHYPGTPLFLTFT 177
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
+P L++P EE + +A I + R F
Sbjct: 178 -----------LDIPAKHLEIP--EEYEEAYADIEDDRTRQF 206
>gi|296194262|ref|XP_002744878.1| PREDICTED: arylsulfatase B [Callithrix jacchus]
Length = 534
Score = 146 bits (369), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 89/229 (38%), Positives = 126/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P CTPSR+ LTG+Y G+ +
Sbjct: 57 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 115
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 142 YN--------DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y D+++ T A+ R+ E A + Y T+ FT ++ +I +H +P
Sbjct: 176 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRATTLITNHPPEKP 232
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 233 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHYYA 268
>gi|241676246|ref|XP_002411524.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215504222|gb|EEC13716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 490
Score = 146 bits (369), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 87/232 (37%), Positives = 126/232 (54%), Gaps = 27/232 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW DV FHG IPTPNID LA +G++LN +Y LP CTPSRAA +TG YP R G+ T +
Sbjct: 37 GWGDVSFHGSTQIPTPNIDVLAGDGVILNNYYVLPLCTPSRAALMTGLYPIRNGMQLTSI 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A +P+ K+LPQ+ K+LGY ++IGKWH+G K +P RGFD G++ G
Sbjct: 97 QAAGPWGLPLENKILPQHFKDLGYDVNMIGKWHLGFFKTPYVPIKRGFDTFFGFYTGSND 156
Query: 142 Y----NDSIHETDFAVGLDARRN------MERYAP-QMSSKYLTDFFTDQSVHVIKSHNH 190
Y + S H AV + N + + P ++S +L ++ + ++
Sbjct: 157 YYNHTSGSSHRKILAVTSSVQVNTLEKGRLSLWGPRELSVCFLHQIYSPLNFYL------ 210
Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+P F I+H AVH A NA+ + Q P N F++I P+R ++A
Sbjct: 211 -QPFFCYISHQAVH--HALNAE----MFQAP--ARNVLKFSYIGEPNRTIYA 253
>gi|22450117|emb|CAC86342.1| glucosinolate sulfatase [Plutella xylostella]
gi|22450119|emb|CAC86343.1| glucosinolate sulfatase [Plutella xylostella]
Length = 532
Score = 146 bits (369), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 125/227 (55%), Gaps = 19/227 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+D HG + TPN+D L +G+ L+R+YT C+P+R A LTGKY G+ P+
Sbjct: 34 GWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTAVLTGKYAHTVGMQGMPL 93
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ E+L+ QYL++ GY T ++GKWH+G E LP RGF+NH G G++
Sbjct: 94 SNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLPTYRGFENHFGVRGGFID 153
Query: 142 YNDSIHETDFAVGLDARRN-----MERYAPQMSSK-YLTDFFTDQSVHVIKSHNHSRPLF 195
Y +E + LD R + P +++ Y+TD +T++S +I++HN S PL+
Sbjct: 154 Y----YEYNAQEQLDGRPVTGLCLFDDLQPDWTTEGYITDVYTEKSTTIIENHNVSEPLY 209
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L +TH A H G + LQ P E R H+ RR+FA
Sbjct: 210 LLLTHHAPHNGNEDAS------LQAP--PEEVRAQRHVELHPRRIFA 248
>gi|22450123|emb|CAD33828.1| glucosinolate sulphatase [Plutella xylostella]
Length = 547
Score = 146 bits (369), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 125/227 (55%), Gaps = 19/227 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+D HG + TPN+D L +G+ L+R+YT C+P+R A LTGKY G+ P+
Sbjct: 34 GWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTAVLTGKYAHTVGMQGMPL 93
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ E+L+ QYL++ GY T ++GKWH+G E LP RGF+NH G G++
Sbjct: 94 SNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLPTYRGFENHFGVRGGFID 153
Query: 142 YNDSIHETDFAVGLDARRN-----MERYAPQMSSK-YLTDFFTDQSVHVIKSHNHSRPLF 195
Y +E + LD R + P +++ Y+TD +T++S +I++HN S PL+
Sbjct: 154 Y----YEYNAQEQLDGRPVTGLCLFDDLQPDWTTEGYITDVYTEKSTTIIENHNVSEPLY 209
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L +TH A H G + LQ P E R H+ RR+FA
Sbjct: 210 LLLTHHAPHNGNEDAS------LQAP--PEEVRAQRHVELHPRRIFA 248
>gi|22450115|emb|CAC86338.1| glucosinolate sulfatase [Plutella xylostella]
Length = 532
Score = 146 bits (369), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 125/227 (55%), Gaps = 19/227 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+D HG + TPN+D L +G+ L+R+YT C+P+R A LTGKY G+ P+
Sbjct: 34 GWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTAVLTGKYAHTVGMQGMPL 93
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ E+L+ QYL++ GY T ++GKWH+G E LP RGF+NH G G++
Sbjct: 94 SNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLPTYRGFENHFGVRGGFID 153
Query: 142 YNDSIHETDFAVGLDARRN-----MERYAPQMSSK-YLTDFFTDQSVHVIKSHNHSRPLF 195
Y +E + LD R + P +++ Y+TD +T++S +I++HN S PL+
Sbjct: 154 Y----YEYNAQEQLDGRPVTGLCLFDDLQPDWTTEGYITDVYTEKSTTIIENHNVSEPLY 209
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L +TH A H G + LQ P E R H+ RR+FA
Sbjct: 210 LLLTHHAPHNGNEDAS------LQAP--PEEVRAQRHVELHPRRIFA 248
>gi|432885639|ref|XP_004074694.1| PREDICTED: arylsulfatase B-like [Oryzias latipes]
Length = 520
Score = 146 bits (368), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 88/229 (38%), Positives = 127/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVG+H ++I TPN+D L+ G+ L +Y P C+PSR +TG+Y G+ +
Sbjct: 40 GWNDVGYH-NSEIKTPNLDLLSAKGVRLQNYYVQPLCSPSRNQLMTGRYQIHTGMQHQII 98
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G K++ LP +RGFD++ GY+ G
Sbjct: 99 WPCQPYCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYKKDCLPTHRGFDSYFGYYLGSED 158
Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y T+ +++ T A+ L R+ E A Y T+ F+ ++V VI HN S+P
Sbjct: 159 YYTHTRCYPITALNLTRCALDL---RDGEEVATAYKGAYSTELFSQRAVSVIAKHNASQP 215
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + AVH LQVP E ++ I + RR +A
Sbjct: 216 LFLYVAMQAVHEP-----------LQVP--ERYVTPYSFIKDVSRRKYA 251
>gi|428202415|ref|YP_007081004.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
gi|427979847|gb|AFY77447.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
Length = 538
Score = 146 bits (368), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 85/224 (37%), Positives = 127/224 (56%), Gaps = 23/224 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVG+H ++I TPN+D LA + L+R Y +CTP+RAA +TG++P RYG+ + V
Sbjct: 54 GWNDVGYHN-SEIKTPNLDKLAESSTRLDRFYVTSSCTPTRAALMTGRHPSRYGMSSGVI 112
Query: 83 AGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
K +P+ EK + Q LKE GY T ++GKWH+G KEE LP RGFD H G++ G +
Sbjct: 113 WPWDKVGLPLEEKTIAQTLKEAGYYTAIVGKWHLGHYKEEYLPTRRGFDYHYGHYCGSID 172
Query: 142 YNDSIHETDFAV--GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQI 198
Y H+ D + GLD RN + P Y TD ++V +I+ ++++ PLFL +
Sbjct: 173 Y--FTHQLDAGIQGGLDWHRNEQ---PVEEEGYATDLLAQEAVKLIRDCDYNKSPLFLYV 227
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ A H E++ + +A+I + RR+FA
Sbjct: 228 SFNAPHAPLQAK-------------EKDIKNYANIQDEGRRIFA 258
>gi|241638976|ref|XP_002410783.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215503545|gb|EEC13039.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 527
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 88/221 (39%), Positives = 120/221 (54%), Gaps = 16/221 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW+DV FHG IPTPN+DALA +GI+LN +Y P CTPSRAA +TG YP G+ V
Sbjct: 34 GWDDVSFHGSPQIPTPNMDALAADGIILNNYYVQPVCTPSRAALMTGMYPIHTGLQHGVL 93
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A +P+ K++P+Y K+LGY THLIGKW++G +E P RGFD+ G++N
Sbjct: 94 LAAEPNGLPLEFKIMPEYFKDLGYETHLIGKWNLGYYMKEYTPTYRGFDSFYGFYNYEED 153
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y H +F + N + P YLT D ++ + P FL ++H
Sbjct: 154 Y--FTHNLEFV----NQSNAMVWRPSSFCVYLT-LSPDTGFDLLSASIERGPFFLYLSHQ 206
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+VH G +GN LQ P EEN F +I + R +A
Sbjct: 207 SVH-GASGNDP-----LQAP--EENIAKFPYIGDERRTKYA 239
>gi|77993374|ref|NP_254278.1| arylsulfatase B precursor [Rattus norvegicus]
gi|148887336|sp|P50430.2|ARSB_RAT RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S
gi|81158016|tpe|CAI84982.1| TPA: arylsulfatase B [Rattus norvegicus]
gi|195539740|gb|AAI68241.1| Arylsulfatase B [Rattus norvegicus]
Length = 528
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 89/229 (38%), Positives = 129/229 (56%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWND+GFHG + I TP++DALA G+VL+ +Y P CTPSR+ LTG+Y G+ +
Sbjct: 51 GWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHMGLQHYLI 109
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LK+ GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 110 MTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 169
Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y T+ + ++ T A+ L R+ E A + + Y T+ FT ++ +I +H +P
Sbjct: 170 YYTHEACAPIECLNGTRCALDL---RDGEEPAKEYTDIYSTNIFTKRATTLIANHPPEKP 226
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + I + RR++A
Sbjct: 227 LFLYLAFQSVHDP-----------LQVP--EEYMEPYDFIQDKHRRIYA 262
>gi|291221683|ref|XP_002730830.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 499
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 85/205 (41%), Positives = 118/205 (57%), Gaps = 16/205 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDV +H DI P + LA +G++ N+ YT PTCTPSRAA +TG YPFR G +
Sbjct: 38 GWNDVEWHNP-DIKMPVLSKLAADGVIFNQSYTHPTCTPSRAAMMTGMYPFRTGNQHQMI 96
Query: 83 AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ VP+ KLLP+ LKE+GY TH++GKWH+G KEE LP +RGFD+H G W +
Sbjct: 97 FNLHPSGVPLEFKLLPEKLKEVGYFTHMVGKWHLGFCKEEYLPTSRGFDSHYGLWTLGVG 156
Query: 142 YNDSIHET-DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
+ D ++ + G D R N+ P+ S +YLT +++ H+I H + PLFLQ T
Sbjct: 157 HYDKMNGVLSPSEGYDFRDNI-GVVPK-SDEYLTLMLAERAEHIINGHYNKHPLFLQFT- 213
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEE 225
+P L++PD E
Sbjct: 214 ----------MDIPAKHLEIPDTFE 228
>gi|291232535|ref|XP_002736216.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 784
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 85/221 (38%), Positives = 121/221 (54%), Gaps = 13/221 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+DVG+HG + I TPNIDALA G+ L+ +Y CTPSR L+G+Y G+ +
Sbjct: 53 GWSDVGYHG-SVIKTPNIDALASEGVKLDNYYMSLLCTPSRGQLLSGRYEIHTGLQHRTI 111
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ E +LPQ LKE GY+TH++GKWH+G ++E LP RGFD +G++ G
Sbjct: 112 DMMQPLCLPIDETILPQKLKERGYATHMVGKWHLGFYRKECLPNYRGFDTFMGFYQGMAD 171
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y T G D RRN + A + + +Y T F D++ +I HN PLFL ++
Sbjct: 172 YYYHNISTGIYHGWDFRRNNDVIAQKYAGQYSTHVFADEAQIIIMKHNPEVPLFLFLSFQ 231
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A+H LP LQVP + +N DR+ +A
Sbjct: 232 AIH--------LP---LQVPSRYADMYKTLIPNNADRQKYA 261
>gi|326677480|ref|XP_003200848.1| PREDICTED: arylsulfatase B-like, partial [Danio rerio]
Length = 358
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 88/229 (38%), Positives = 126/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG ++I TP++D LA G+ L+ +Y P CTPSR +TG+Y R G+ +
Sbjct: 34 GWNDVGFHG-SEIKTPHLDRLAAQGVRLDNYYVQPLCTPSRNQLMTGRYQIRTGLQHQII 92
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ L+E GY TH++GKWH+G +++ LP +RGF + GY G
Sbjct: 93 WPCQPYCVPLDEKLLPQVLRERGYHTHMVGKWHLGMFQKDCLPTHRGFQSFFGYLTGSED 152
Query: 139 YLTYNDS-----IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y T+ ++ T A+ L R+ + A S +Y T+ T+++ H+I H +P
Sbjct: 153 YYTHKRCSPIAPLNVTRCALDL---RDGDAVALNYSGRYSTELLTERATHIITQHTPDQP 209
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + AVH LQVPD +F I +P RR +A
Sbjct: 210 LFLYVALQAVHAP-----------LQVPDHYIAPYSF--IQDPHRRRYA 245
>gi|344272682|ref|XP_003408160.1| PREDICTED: arylsulfatase B [Loxodonta africana]
Length = 532
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 88/229 (38%), Positives = 127/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG + I TP++DALA G+ L+ +Y P CTPSR+ L+G+Y G+ +
Sbjct: 55 GWNDVGFHGSS-IRTPHLDALAAGGVRLDNYYVQPLCTPSRSQLLSGRYQIHTGLQHQII 113
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 114 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 173
Query: 142 YN--------DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y D+++ T A+ R+ E A + Y T+ FT+++ +I +H +P
Sbjct: 174 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNVFTERATALIANHPPEKP 230
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +RR +A
Sbjct: 231 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIEDKNRRHYA 266
>gi|198415046|ref|XP_002127641.1| PREDICTED: similar to Arylsulfatase B precursor (ASB)
(N-acetylgalactosamine-4-sulfatase) (G4S) [Ciona
intestinalis]
Length = 522
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 87/228 (38%), Positives = 125/228 (54%), Gaps = 22/228 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+NDVG+H DI TPNI+ LA +G++L +Y P CTPSR+ +TG+Y G+ + +
Sbjct: 38 GFNDVGYHNP-DIYTPNINKLAKDGVILESYYVQPICTPSRSQLMTGRYQIHTGLQHSVI 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A +PV E +LPQ LKE GY+TH +GKWH+G K+E LP +RGFD GY+ G
Sbjct: 97 FAPQPNCLPVDEIILPQKLKEAGYTTHAVGKWHLGFYKKECLPTSRGFDTFYGYYCGAED 156
Query: 142 YNDSIHETDFAVGLDARR-------NMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
Y +F G RR + R + + Y + + D++V +IKSHN S PL
Sbjct: 157 YYTKQVHANFHFGNKTRRVSGFDFHDNSRTEWEANGTYSSYLYRDRAVRIIKSHNSSIPL 216
Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
F+ + +VH P LQVP + + + HI + RR F+
Sbjct: 217 FMYLPFQSVH--------FP---LQVP--AKYIKRYRHIKDRKRRTFS 251
>gi|332016484|gb|EGI57377.1| Arylsulfatase B [Acromyrmex echinatior]
Length = 438
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 71/189 (37%), Positives = 114/189 (60%), Gaps = 8/189 (4%)
Query: 56 LPTCTPSRAAFLTGKYPFRYGID-TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWH 114
+P CTPSR AFL+G++P R G+ P+ A + + + + LLP+YL++LGY+THL+GKWH
Sbjct: 1 MPVCTPSRVAFLSGRHPLRTGMQGYPLKAAEPRGLHLNDTLLPEYLRKLGYTTHLLGKWH 60
Query: 115 IGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNM-ERYAPQMSSKYL 173
+G +P RGFD +GY+NG + Y + E + VG D R + + + + Y+
Sbjct: 61 VGYLTRNYVPTRRGFDTFLGYFNGVIQYFNHTIEENEQVGYDLHRIVGDNHTVEYRYDYM 120
Query: 174 TDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHI 233
T+ T+++ ++I SHN +P++LQ+ H A H A A ++V D EE + T +I
Sbjct: 121 TNLITEEAENIISSHNTEKPMYLQLAHLASHASNAEEA------MEVYDWEETNATLGYI 174
Query: 234 SNPDRRLFA 242
+ +RR FA
Sbjct: 175 QDVNRRKFA 183
>gi|198417507|ref|XP_002121051.1| PREDICTED: similar to arylsulfatase B [Ciona intestinalis]
Length = 518
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 108/184 (58%), Gaps = 3/184 (1%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
GWNDV +H + + PN+ LA G++L Y CTPSRAAFLTG+YP G+ + V
Sbjct: 41 GWNDVSWH-NSIVQMPNLQDLAERGVILEHAYAQEKCTPSRAAFLTGRYPINTGMQEEVV 99
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A +P+ KLLP YLK+ GY+TH+IGKWH+G E P RGFD+H G++N ++
Sbjct: 100 VATQMSGLPIEFKLLPSYLKDQGYATHMIGKWHVGYCDEAYTPTRRGFDSHYGFYNSGIS 159
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y++ VG D R ++ KY T FTDQ+ +I +H+ + P+FL + +
Sbjct: 160 YSNYSSTEGTDVGYDYRDDLALNL-AAEGKYTTTDFTDQAKTLIDNHDQTNPMFLYMAYN 218
Query: 202 AVHT 205
A HT
Sbjct: 219 APHT 222
>gi|157831133|pdb|1FSU|A Chain A, 4-Sulfatase (Human)
Length = 492
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 89/229 (38%), Positives = 127/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGFHG I TP++DALA G++L+ +YT P TPSR+ LTG+Y R G+ +
Sbjct: 15 GWNDVGFHGSR-IRTPHLDALAAGGVLLDNYYTQPLXTPSRSQLLTGRYQIRTGLQHQII 73
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 74 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 133
Query: 142 YN--------DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y D+++ T A+ R+ E A + Y T+ FT +++ +I +H +P
Sbjct: 134 YYSHERCTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP 190
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 191 LFLYLALQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 226
>gi|391330456|ref|XP_003739676.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 631
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 125/230 (54%), Gaps = 21/230 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
G++DV FHG IPTPN+DA+A +G++LNRHY + TPSR AF TGK P R G++ P+
Sbjct: 53 GYDDVSFHGNEQIPTPNLDAMAADGVILNRHYAAMSGTPSRGAFFTGKLPLRIGLNEGPI 112
Query: 82 GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
GV + + ++LP Y+++LGY THLIG+W +G KE LLP NRGFD H G ++
Sbjct: 113 LKGVYGTGLSLEHEVLPFYMRDLGYETHLIGRWGLGFYKESLLPTNRGFDTHYGPYSDSA 172
Query: 141 TYNDSIHETDFAVGL------DARRNMERYAPQMS--SKYLTDFFTDQSVHVIKSHNHSR 192
+Y+ + D L D R+ + P S Y+TD + + +I+ +
Sbjct: 173 SYSSHLSREDAWKSLSVPPAYDLHRDGK---PDFSGFGSYVTDLYKGRFERIIEQRR--K 227
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLF+ ++H H + G L Q P F HI + RR +A
Sbjct: 228 PLFIVLSHQTPHGASFGP------LHQPPPRTNRASQFLHIKDRSRRSYA 271
>gi|126697478|gb|ABO26696.1| sulfatase 1B precursor [Haliotis discus discus]
Length = 382
Score = 144 bits (364), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 86/221 (38%), Positives = 115/221 (52%), Gaps = 16/221 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
GWNDVGF + TP++D LA G++LN Y P C+PSR F++G +P+ G+ D +
Sbjct: 36 GWNDVGFRNPQ-VLTPHLDKLAKAGVILNSSYVQPLCSPSRNCFMSGYFPYHTGLQDGVI 94
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
VP+ LPQ LKELGYSTH +GKWH+G + P RGFD VGY+ G
Sbjct: 95 RPASPGFVPIKFTFLPQKLKELGYSTHAVGKWHLGFCNLKYTPTYRGFDTFVGYYIGAED 154
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y E D G D R N Y + KY T F +++V +IKSH+ PL+L +
Sbjct: 155 YYKHTREYDKFSGYDLRFNTSVYT-EAKGKYSTRVFAERAVDIIKSHDTDTPLYLYLPFQ 213
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH L+VP EN + HI + RR +
Sbjct: 214 AVHAP-----------LEVPPEYEN--LYKHIHDLPRRTYC 241
>gi|194748074|ref|XP_001956474.1| GF24576 [Drosophila ananassae]
gi|190623756|gb|EDV39280.1| GF24576 [Drosophila ananassae]
Length = 583
Score = 144 bits (363), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 93/232 (40%), Positives = 125/232 (53%), Gaps = 27/232 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G++DV F G N+ TPNIDALAY+G++LN YT P CTPSRAA LTGKYP G+
Sbjct: 46 GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAPMCTPSRAALLTGKYPINTGMQHYVI 105
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G +P+ E + + GY T LIGKWH+G ++ P RGFD H+GY
Sbjct: 106 VNDQPWG------LPLNETTMADIFRGNGYRTSLIGKWHLGMSQRNYTPTLRGFDYHLGY 159
Query: 136 WNGYLTYNDSIHE--TDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NH 190
Y+ Y + +E + G D R N++ + KY+TD TD +V I H N+
Sbjct: 160 LGAYVDYYNQSYEQVSKGYRGHDFRENLKPNHEHV-DKYVTDILTDAAVREIDDHAAKNN 218
Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
S+PLFL + H A H A N P +Q P E + F +I + R +A
Sbjct: 219 SKPLFLLLNHLAPH---AANDADP---MQAPADELSG--FEYIRDETHRYYA 262
>gi|291225021|ref|XP_002732506.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 497
Score = 144 bits (362), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 88/221 (39%), Positives = 120/221 (54%), Gaps = 18/221 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDV +H DI PN+ LA +G++ N+ YT PTCTPSRAA +TG YPF+ G +
Sbjct: 38 GWNDVHWHNP-DIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQML 96
Query: 83 AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN-GYL 140
+ +P+ KLLP+ LKE+GYSTH++GKWH+G K+E LP NRGFD+H G W G
Sbjct: 97 FNLHPSGLPLEFKLLPEKLKEIGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGIWTLGVG 156
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y+ + G D R N Q S+ YL D++ H++ +H PLFL T
Sbjct: 157 DYDKMNGVLSPSKGYDFRDNTG--VVQKSNGYLALMLGDRAEHIVNTHYPGTPLFLAFT- 213
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
+P L +P EE + +A I + R F
Sbjct: 214 ----------LDIPAKHLAIP--EEYENKYADIEDSRTRHF 242
>gi|260794561|ref|XP_002592277.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
gi|229277493|gb|EEN48288.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
Length = 598
Score = 144 bits (362), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 125/227 (55%), Gaps = 22/227 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWND+G+HG + I TPN+D LA G+ L +Y P C+PSR +TG+Y RYG+ + +
Sbjct: 133 GWNDIGYHG-SVIRTPNLDRLAAEGVKLENYYVQPLCSPSRCQLMTGRYQIRYGLQHSLI 191
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
+P+ E LPQ LKE GYSTH++GKWH+G K++ P +RGFD GY G
Sbjct: 192 WPPQPSGLPLDEVTLPQRLKEGGYSTHIVGKWHLGFYKQDYTPTHRGFDTFYGYLTGAED 251
Query: 139 YLTYNDS---IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
Y T+ + GLD R+ R + Y T F ++++ +I + ++P+F
Sbjct: 252 YWTHRQKGGLPGQPQTWSGLDL-RDQNRPVTDQNGTYSTHLFANKAIEIIAQQDKNKPMF 310
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L ++ AVH LQ P EE+ ++HIS+ +RR++A
Sbjct: 311 LFLSFQAVHDP-----------LQAP--EEDISRYSHISDTNRRVYA 344
>gi|403049780|ref|ZP_10904264.1| sulfatase [SAR86 cluster bacterium SAR86D]
Length = 515
Score = 144 bits (362), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 86/222 (38%), Positives = 122/222 (54%), Gaps = 15/222 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW DV +H IPTPNID+ NGI LNR Y PTC+P+RA+ LTG + F +G+ P
Sbjct: 29 GWGDVSYH-NGFIPTPNIDSFVSNGIELNRFYANPTCSPTRASLLTGLHIFNHGVIRPFM 87
Query: 83 AGVAKAVPVTE--KLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
A+ + E K++P+Y KE GY T L GKWH+G +KEE LP NRGFD+ G+ G +
Sbjct: 88 NPSAEQTGLPEHLKIMPEYFKEAGYQTALSGKWHLGMHKEEYLPTNRGFDSSYGHMLGGI 147
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSK-YLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y D +H R + R ++ Y T+ D+++++IK+ + RPLFL +
Sbjct: 148 GYYDHVHTN--------RMDWHRDGVSLNEDGYSTELIADEAINIIKNKDDDRPLFLYVA 199
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTF-AHISNPDRRL 240
A HT + L + D E DR + A+IS D +
Sbjct: 200 FNAPHTPIEAPEEDVNNFLYIED--ELDRNYAANISKLDIEI 239
>gi|126317548|ref|XP_001381590.1| PREDICTED: arylsulfatase B [Monodelphis domestica]
Length = 522
Score = 144 bits (362), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 87/226 (38%), Positives = 125/226 (55%), Gaps = 20/226 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVG+H N I TP++DAL+ G+ L +YT P CTPSR+ LTG+Y G+ +
Sbjct: 45 GWNDVGYHDSN-IFTPHLDALSAQGVRLENYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 103
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
+P+ EKLLP+ L+E GY TH++GKWH+G ++E LP RGFD GY G
Sbjct: 104 WPCQPSCIPLDEKLLPELLREAGYVTHMVGKWHLGMFRKECLPTRRGFDTFFGYLLGSED 163
Query: 139 YLTYNDSIHETDFAVGLDAR--RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y T+ +H V A R+ E A + Y T+ FT+++V++I +H +PLFL
Sbjct: 164 YYTHKRCVHIDALKVTRCALDFRDGEDIAAGYENMYSTNVFTERAVNLIANHPAQKPLFL 223
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ +VH LQVP EE + + I N +R+ +A
Sbjct: 224 YLALQSVHEP-----------LQVP--EEYLQPYDFIQNKNRQHYA 256
>gi|167515556|ref|XP_001742119.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778743|gb|EDQ92357.1| predicted protein [Monosiga brevicollis MX1]
Length = 339
Score = 143 bits (361), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 89/244 (36%), Positives = 128/244 (52%), Gaps = 31/244 (12%)
Query: 10 AKAVPVTEKLLPQ---------GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCT 60
A VP +E P GWNDV HG IPTP+IDA+A++G+ L ++ P CT
Sbjct: 22 ADGVPGSEAKRPNIVFIVADDLGWNDVSLHGSPQIPTPHIDAIAHSGVHLTNYHVQPVCT 81
Query: 61 PSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKE 120
P+R+ FL+G++ GI P G A + ++ LLP YLK+LGY T +GKWH+G N E
Sbjct: 82 PTRSTFLSGRHVIHTGIYMPFAQGTALRLNLSYTLLPAYLKKLGYRTAAVGKWHLGQNVE 141
Query: 121 ELLPFNRGFDNHVGYWNGYLTY--NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFT 178
+ LP RGFD ++GYW+G Y +D+ DF G + A + ++ Y T F
Sbjct: 142 KALPTGRGFDEYLGYWSGAEDYYTHDTHGGYDFQDGTEC-------AIKYNNTYSTYIFA 194
Query: 179 DQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDR 238
+++V+ I + +PLFL VH P L+ P E F+HI N +R
Sbjct: 195 ERAVNTILEADPEQPLFLYTAFQNVH--------WP---LEAP--AEYVARFSHIPNSER 241
Query: 239 RLFA 242
+ A
Sbjct: 242 QYVA 245
>gi|126697310|gb|ABO26612.1| arylsulfatase [Haliotis discus discus]
Length = 481
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 86/221 (38%), Positives = 119/221 (53%), Gaps = 20/221 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWND+GFH DI TPNID LA G++LN HY P C+PSRAAF++G YPF+ G+ + +
Sbjct: 37 GWNDIGFHNP-DIITPNIDKLAREGLLLNHHYVQPLCSPSRAAFMSGYYPFKTGLQHSVI 95
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ +LPQ LKELGY+TH++GKWH G P RGFD+ GY+
Sbjct: 96 LENQPVCLPLNITILPQKLKELGYATHIVGKWHNGFCSWNCTPTYRGFDSFFGYYGAMED 155
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y T G RN + Y T FTD + +I+ HN S+PLFL + +
Sbjct: 156 Y-----YTHVIRGFLDYRNNTTPVWTDNGTYSTLRFTDVATDIIERHNQSQPLFLYLAYQ 210
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AV+ G ++VP E + +I + +RR F+
Sbjct: 211 AVY-----------GPIEVPAKYE--AMYPNIKSENRRKFS 238
>gi|291225017|ref|XP_002732505.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 497
Score = 142 bits (359), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 87/222 (39%), Positives = 122/222 (54%), Gaps = 18/222 (8%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
GWNDV +H DI PN+ LA +G++ + YT PTCTPSRAA +TG YPF+ G +
Sbjct: 37 MGWNDVHWHNP-DIAMPNLMDLADDGVIFEQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 95
Query: 82 GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN-GY 139
+ VP+ KLLP+ LKE+GY+TH++GKWH+G K+E LP NRGFD+H G W G
Sbjct: 96 VFNLHPSGVPLNFKLLPEKLKEVGYATHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 155
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y+ + G D R N+E P+ S YL D++ ++ +H+ PLFL T
Sbjct: 156 GDYDKLNGVLSPSAGYDFRDNLE-VVPK-SDGYLALMLGDRAEEIVNNHSPETPLFLVFT 213
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
+P L++P EE + +A I + R F
Sbjct: 214 -----------LDIPAKHLEIP--EEYEELYADIEDDRTRQF 242
>gi|405964464|gb|EKC29946.1| Arylsulfatase B [Crassostrea gigas]
Length = 482
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/183 (41%), Positives = 106/183 (57%), Gaps = 8/183 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGF D+ TPNID LA++G+VLN Y +P CTPSR +F+TG Y F+ G+ +
Sbjct: 36 GWNDVGFRNP-DVLTPNIDKLAHSGMVLNSSYVMPVCTPSRNSFMTGHYAFKSGLQHLAI 94
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A P+ LPQ LKELGY+TH IGKWH+G K E P RGFD G++NG
Sbjct: 95 LPKQAACAPLNYTFLPQKLKELGYATHAIGKWHLGFCKWECTPTYRGFDTFFGFYNG--- 151
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
+ + A G D R N R + +Y T + ++ +IK H+ S+P+++ +
Sbjct: 152 -QEDYYTLSVAGGKDFRDN--RTPVNATGEYSTFLYARRAESIIKEHDASKPMYMYLPFQ 208
Query: 202 AVH 204
+VH
Sbjct: 209 SVH 211
>gi|405964468|gb|EKC29950.1| Arylsulfatase B [Crassostrea gigas]
Length = 483
Score = 142 bits (358), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 76/183 (41%), Positives = 106/183 (57%), Gaps = 8/183 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGF + + TPNID LA +G++LN Y +P CTPSR +F+TG+Y F+ G+ +
Sbjct: 36 GWNDVGFRNPS-VLTPNIDKLARSGMILNSSYVMPVCTPSRNSFMTGQYAFKSGLQHIVI 94
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A P+ LPQ LKELGY+TH IGKWH+G K E P RGFD GY+NG
Sbjct: 95 LPQQATCAPLNNTFLPQKLKELGYATHAIGKWHLGFCKWECTPTYRGFDTFYGYYNGAED 154
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y + A G D R N R + +Y T + ++ +IK H+ S+P+++ +
Sbjct: 155 Y----YNLSIAGGKDFRDN--RTPVNATGEYSTILYARRAESIIKDHDASKPMYMYLPFQ 208
Query: 202 AVH 204
+VH
Sbjct: 209 SVH 211
>gi|443321854|ref|ZP_21050893.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
gi|442788398|gb|ELR98092.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
Length = 476
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 89/222 (40%), Positives = 121/222 (54%), Gaps = 23/222 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
GWNDVGFHG ++I T N+D LA +G+ L R Y CTP+RAAFLTG++PFRYG+ V
Sbjct: 46 GWNDVGFHG-SEIKTTNLDKLAVSGVRLERFYVKAMCTPTRAAFLTGRHPFRYGMSAINV 104
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ EK + + LKE GY T ++GKWH+G +E LP +RGFD H G++ +
Sbjct: 105 TPWSETGLPLEEKTIAETLKEAGYYTAILGKWHLGHYQESYLPTSRGFDYHYGHYLAGID 164
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFLQITH 200
Y H++ GLD RN P Y TD +V +I +H+ H PLFL I
Sbjct: 165 Y--FTHKS--GDGLDWHRNNN---PVYIEGYSTDLIAQDAVQLINNHDYHKNPLFLYIAF 217
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A H A+ D+E+ + I + RRLFA
Sbjct: 218 NAPHIPLQAKAE---------DLED----YLTIEDEQRRLFA 246
>gi|390341601|ref|XP_796347.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 497
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 124/232 (53%), Gaps = 30/232 (12%)
Query: 23 GWNDVGFHGEND---IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYP----FRY 75
G+NDVG+HG + TPN+D LA G+ L ++Y P C+P+R+ L+G+Y +Y
Sbjct: 46 GYNDVGYHGREHGSMVLTPNLDGLAGEGVKLEKYYVQPICSPTRSQLLSGRYQIHTGLQY 105
Query: 76 GIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
G+ P +P+ E LPQ LKE Y+TH++GKWHIG K+ P RGFD++ GY
Sbjct: 106 GVIRPAQP---HCLPLDEVTLPQKLKERDYATHMVGKWHIGFYKDACTPTERGFDSYFGY 162
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAP-----QMSSKYLTDFFTDQSVHVIKSHNH 190
+G Y H F +G + ++ A Q +Y T FT +++ VI +H
Sbjct: 163 LSGAEDYYS--HSRSFQIGSKTLKGLDLMANKTPAFQYKGQYSTHLFTSKAIDVINNHER 220
Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
S+PLFL + + AVH+ LQVP E +A+I++ RR +A
Sbjct: 221 SKPLFLYLAYQAVHSP-----------LQVPSKYE--EPYANITSSARRAYA 259
>gi|291230930|ref|XP_002735418.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
partial [Saccoglossus kowalevskii]
Length = 480
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 80/180 (44%), Positives = 107/180 (59%), Gaps = 5/180 (2%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
GWNDV +H DI PN+ LA +G++ N+ YT PTCTPSRAA +TG YPF+ G +
Sbjct: 20 MGWNDVHWHNP-DIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 78
Query: 82 GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN-GY 139
+ VP+ KLLP+ KE+GYSTH++GKWH+G K+E LP NRGFD+H G W G
Sbjct: 79 VFNLHPSGVPLEFKLLPEKFKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGIWTLGV 138
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y+ + G D R NM P+ S+ YL D++ H++ +H PLFL T
Sbjct: 139 GDYDKMNGVLSPSAGYDFRDNM-GVVPK-SNGYLALMLGDRAEHIVNNHYPGTPLFLAFT 196
>gi|323449751|gb|EGB05637.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 533
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 78/184 (42%), Positives = 105/184 (57%), Gaps = 11/184 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+NDVGFHG IPTP +DALA +G+ L ++T P C+PSRA+ L+G++ +GI P
Sbjct: 67 GFNDVGFHGSKQIPTPRLDALAADGVDLLNYHTHPVCSPSRASMLSGRHAIHHGIYMPFA 126
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
G A + + +LLP+ L+ LGY TH +GKWH+G N LP RGFD+ +GYW+G Y
Sbjct: 127 QGTAYHLSLEYELLPEALRRLGYETHAVGKWHLGQNTRAALPTGRGFDSFLGYWSGAEDY 186
Query: 143 --NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
+D DFA N E A Y FTD++V V+ S S P FL +
Sbjct: 187 FAHDCAGAYDFA-------NNETTAWAYDGVYSAYSFTDRAVDVVAS--ASTPYFLYVAW 237
Query: 201 AAVH 204
VH
Sbjct: 238 QNVH 241
>gi|405964467|gb|EKC29949.1| Arylsulfatase B [Crassostrea gigas]
Length = 482
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/183 (40%), Positives = 106/183 (57%), Gaps = 8/183 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVGF D+ TPNID LA +G++LN Y +P CTPSR +F+TG Y F+ G+ +
Sbjct: 36 GWNDVGFRNP-DVLTPNIDKLARSGMILNSSYVMPVCTPSRNSFMTGHYAFKSGLQHLAI 94
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A P+ LPQ LKELGY+TH IGKWH+G K E P RGFD G++NG
Sbjct: 95 NPQQATCAPLNYTFLPQKLKELGYATHAIGKWHLGFCKWECTPTYRGFDTFFGFYNGQED 154
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y + A G D R N + + +Y T ++ ++ +IK H+ S+P+++ +
Sbjct: 155 Y----YTLSVAGGKDFRDN--KVPVNATGEYSTFLYSRRAESIIKEHDASKPIYMYLPFQ 208
Query: 202 AVH 204
+VH
Sbjct: 209 SVH 211
>gi|291227280|ref|XP_002733615.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 499
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 82/221 (37%), Positives = 122/221 (55%), Gaps = 13/221 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+DVG+HG + I TP+IDALA G+ L+ +YT CTPSR+ +TG+Y G+ +
Sbjct: 33 GWSDVGYHG-SVIKTPHIDALASEGVKLDNYYTSLLCTPSRSQLMTGRYEIHTGLQHRTI 91
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ E +LPQ LK+ GY+TH++GKWH+G ++E LP NRGFD +G++
Sbjct: 92 DMMQPLCLPIDETILPQKLKDRGYATHMVGKWHLGFYRQECLPNNRGFDTFMGFYQAMGD 151
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y T G D RR+ + A + + +Y T F D++ +I HN PLFL ++
Sbjct: 152 YYYHNVSTGKFNGWDFRRDNDVIAERYAGQYSTHVFADEARDIISKHNPDVPLFLFLSFQ 211
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A+H P LQVP + ++ DRR +A
Sbjct: 212 AIH--------FP---LQVPSRYADIYNTLIPNSADRRTYA 241
>gi|348535399|ref|XP_003455188.1| PREDICTED: arylsulfatase B [Oreochromis niloticus]
Length = 519
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 82/229 (35%), Positives = 126/229 (55%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW D+G+HG ++I TPN+D L+ G+ L +Y P CTPSR +TG+Y G+ +
Sbjct: 41 GWYDIGYHG-SEIRTPNLDKLSAGGVRLENYYVQPLCTPSRNQLMTGRYQIHTGMQHQII 99
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ +KE GY+TH++GKWH+G K++ LP RGFD ++GY G
Sbjct: 100 WPCQPYCVPLDEKLLPQLMKEAGYATHMVGKWHLGMYKKDCLPTRRGFDTYLGYLTGSED 159
Query: 139 YLTY-----NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y T+ + S++ + A+ L R+ E A Y T+ + +++ +I+ H +P
Sbjct: 160 YFTHFRCYQSPSLNLSRCALDL---RDGEEVATGYKGVYSTELLSQRAISIIERHISQKP 216
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LF+ + AVH LQVP E ++ I + +RRL+A
Sbjct: 217 LFMYVALQAVHAP-----------LQVP--ERYVTPYSFIKDTNRRLYA 252
>gi|403182689|gb|EJY57565.1| AAEL017303-PA [Aedes aegypti]
Length = 176
Score = 140 bits (353), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 70/140 (50%), Positives = 90/140 (64%), Gaps = 16/140 (11%)
Query: 12 AVPVTEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKY 71
V VT+ L GWNDV FHG + IPTPNIDALAY GI+LNRHYT P CTPSRA+ ++GK+
Sbjct: 33 VVIVTDDL---GWNDVSFHGSSQIPTPNIDALAYQGIILNRHYTPPLCTPSRASLMSGKH 89
Query: 72 PFRYGI-------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLP 124
P G+ + P G G + +KL+P+Y +E GY T L+GKWH+G ++ P
Sbjct: 90 PINVGMQHHVIESNEPWGLG------LDQKLMPEYFREAGYRTRLVGKWHLGFFRKAYTP 143
Query: 125 FNRGFDNHVGYWNGYLTYND 144
RGFD+H GY Y+ Y D
Sbjct: 144 TRRGFDSHFGYIGPYIDYWD 163
>gi|449514410|ref|XP_002188440.2| PREDICTED: arylsulfatase B, partial [Taeniopygia guttata]
Length = 491
Score = 139 bits (351), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 28/230 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW DVG+HG + I TP +DAL G+ L R+YT P CTPSR+ L+G+Y G+ +
Sbjct: 12 GWGDVGWHG-SAIRTPRLDALGAGGVRLERYYTQPLCTPSRSQLLSGRYQIHTGLQHQII 70
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ EKLLP+ L+E GY TH++GKWH+G ++E LP +RGFD + GYL
Sbjct: 71 WPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMYRKECLPTHRGFDTYF----GYLL 126
Query: 142 YNDSIHETDFAVGLDAR---------RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
++ + D V + A+ R+ E A + Y T+ FT++++ VI +H +
Sbjct: 127 GSEDYYTHDRCVFIKAKNVTRCALDFRDGEEVATGFKNVYSTNLFTERAIDVIANHKTEK 186
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLFL + +VH L+VP E+ + ++ I + RR +A
Sbjct: 187 PLFLYLAFQSVHEP-----------LEVP--EKYVKPYSSIKDVKRRHYA 223
>gi|430746415|ref|YP_007205544.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430018135|gb|AGA29849.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 474
Score = 139 bits (351), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 74/184 (40%), Positives = 108/184 (58%), Gaps = 9/184 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW DVG+H +++I TP++D LA +G L + Y P C+P+RAA +TG+YP R+G+ V
Sbjct: 48 GWGDVGWH-DSEIKTPHLDKLAASGTRLEQFYVQPVCSPTRAALMTGRYPMRHGLQVGVV 106
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A+ +P+ E+ LPQ LKE+GY T + GKWH+G + E LP +RGFD+ G++NG L
Sbjct: 107 RPWAQYGLPLNERTLPQALKEVGYETAICGKWHLGHFQPEYLPTHRGFDHQYGHYNGALD 166
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y I + F D R N + Y T ++ +I H+ S+PLFL +
Sbjct: 167 YFTHIRDGGFDWHRDDRVNRD-------EGYSTHLIGREATRIIGHHDTSKPLFLYVPFN 219
Query: 202 AVHT 205
AVH
Sbjct: 220 AVHA 223
>gi|156362330|ref|XP_001625732.1| predicted protein [Nematostella vectensis]
gi|156212578|gb|EDO33632.1| predicted protein [Nematostella vectensis]
Length = 491
Score = 139 bits (351), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 83/221 (37%), Positives = 121/221 (54%), Gaps = 20/221 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+DVGFHG I TPNID LA NG++L+ +Y P CTP+RA+ +TGKYP G+ +
Sbjct: 36 GWSDVGFHGSK-IQTPNIDRLAANGVILDNYYVQPVCTPTRASLMTGKYPIHTGLQHGII 94
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G +P+ LLPQ L++ GYSTH++GKWH+G E P RGFD G+++G
Sbjct: 95 HNGRPYGLPLNLTLLPQKLRKAGYSTHMLGKWHLGFYNWESTPTYRGFDTFYGFYSG--A 152
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
N H D + L R+ E + Y FT ++ ++++H+ S PLF+ +
Sbjct: 153 ENHYTHVQDHYLDL---RDNEEIVRDQNGTYSAHLFTKRAEQIVRAHDPSTPLFMYMAFQ 209
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
VH+ +Q P E DR ++ I +P RR +A
Sbjct: 210 NVHSP-----------VQAPK-EYIDR-YSFIKDPLRRTYA 237
>gi|241598569|ref|XP_002404905.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215502397|gb|EEC11891.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 533
Score = 139 bits (351), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 84/227 (37%), Positives = 121/227 (53%), Gaps = 27/227 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
GW+DV FHG IPTPN+D LA +G++LN +Y CTPSRAA +TG YP G+ D +
Sbjct: 37 GWDDVSFHGSPQIPTPNMDVLAGDGVILNNYYVQHFCTPSRAALMTGLYPIHNGLQDFVI 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ K++P++ K++GY TH+IGKWH+G ++E P RGFD+ GY+NG
Sbjct: 97 DVAQPYGLPLYLKVMPEFFKDMGYETHMIGKWHLGYFRKEYTPTYRGFDSFYGYYNGAED 156
Query: 142 -YNDSIHET-----DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
YN SI + + G ++ +E Y L S+ S S+PLF
Sbjct: 157 YYNHSITKVISQSYNIRQGSVTKKRIENYIKNTELVLL-------SLTFYLSILFSQPLF 209
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L + + +VH G L+ P EEN F +I +R +FA
Sbjct: 210 LYLAYQSVH-----------GPLEAP--EENIMKFPYIGEENRTIFA 243
>gi|260794113|ref|XP_002592054.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
gi|229277268|gb|EEN48065.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
Length = 478
Score = 139 bits (350), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 75/189 (39%), Positives = 109/189 (57%), Gaps = 9/189 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWND+G+HG I TPN+D LA G+ L +Y P C+PSR +TG+Y YG+ + +
Sbjct: 14 GWNDIGYHGSF-IKTPNLDRLASEGVKLENYYVQPICSPSREQLMTGRYQIHYGLQHSVI 72
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ E LPQ LKE+GYSTHL+GKWH+G ++E LP RGFD G+ G
Sbjct: 73 THDRPHGLPLDEVTLPQKLKEIGYSTHLVGKWHLGFFRQEYLPLRRGFDTFYGFLTGGED 132
Query: 142 Y----NDSIHETDFAV--GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
Y +++ TD + GLD R E Q + Y T F +++ +I H+ ++P+F
Sbjct: 133 YWSHRRPNVYSTDASEYHGLDLRDQDEPVLDQ-NGTYSTHLFQRKAIDIIAHHDRNKPMF 191
Query: 196 LQITHAAVH 204
L ++ AVH
Sbjct: 192 LYLSFQAVH 200
>gi|405379584|ref|ZP_11033433.1| arylsulfatase A family protein [Rhizobium sp. CF142]
gi|397323967|gb|EJJ28356.1| arylsulfatase A family protein [Rhizobium sp. CF142]
Length = 502
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 87/223 (39%), Positives = 116/223 (52%), Gaps = 25/223 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW DVGFHG +DI TPNID LA G L ++Y P CTP+RAAF+TG+YPFRYG+ T V
Sbjct: 71 GWKDVGFHG-SDIKTPNIDELAEKGARLEQYYVQPMCTPTRAAFMTGRYPFRYGMQTAVI 129
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + + + LLP+ LKE GY+T GKWH+G K P RGFD+ G G +
Sbjct: 130 PQGGTYGLALDDHLLPELLKEAGYATAASGKWHLGHAKTAFWPRQRGFDSFYGALLGEI- 188
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
D D RN + + L F D++V VI H+ ++PLFL +
Sbjct: 189 --DHFTHKSANGNADWYRNNDALEEEGFDNVL---FADEAVRVINEHDQAKPLFLYLAFT 243
Query: 202 AVHTGTAGNAKLPTGLLQVPD--MEENDRTFAHISNPDRRLFA 242
+ HT Q P +E N +HI++ RR +A
Sbjct: 244 SPHTP-----------FQAPKEFLERN----SHIADESRRNYA 271
>gi|260794559|ref|XP_002592276.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
gi|229277492|gb|EEN48287.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
Length = 520
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 80/226 (35%), Positives = 123/226 (54%), Gaps = 20/226 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWND+G+HG + I TPN+D LA G+ L +Y P CTPSR+ +TG+Y +G+ + +
Sbjct: 53 GWNDIGYHG-SVIRTPNLDRLAAEGVKLENYYIQPICTPSRSQLMTGRYQIHFGLQHSII 111
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
+P+ E LPQ LKE GYSTH++GKWH+G KEE P +RGFD G+ G
Sbjct: 112 WPPQPSGLPLDEVTLPQRLKEGGYSTHIVGKWHLGFYKEEYTPLHRGFDTFYGFLTGSEN 171
Query: 139 YLTYNDSIHETDFAVGLDA--RRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
+ ++ +S F G + R+ +R + Y T F +++ VI + S+P+FL
Sbjct: 172 HYSHRNSGGMPGFRPGWNGLDLRDQDRPVTDQNGTYSTHLFAKKAIEVIAQQDKSKPMFL 231
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ AVH LQ P ++ + HI++ +RR++A
Sbjct: 232 YLPFQAVHAP-----------LQAP--QKYISMYRHINDYNRRMYA 264
>gi|260786699|ref|XP_002588394.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
gi|229273556|gb|EEN44405.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
Length = 353
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 83/217 (38%), Positives = 121/217 (55%), Gaps = 19/217 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW+DV ++ N + PN+ LA G++ N+ Y CTPSR A LTGK+P+R G+ P+
Sbjct: 12 GWSDVSWNNPN-VVMPNLHTLATTGVIFNQTYCQRLCTPSRTALLTGKFPYRLGMQRPIR 70
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
A +P+ E+LLPQ LK+LGY+TH+IGKWH+GC K E P RGFD+ GY G Y
Sbjct: 71 HKKAHGLPLDEELLPQKLKKLGYATHMIGKWHLGCCKWEYTPTERGFDSFYGYHRGSQDY 130
Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
H +D GLD + Q + Y T+ F ++ ++I H+ + PLFL +
Sbjct: 131 --YTHMSDG--GLDFWEGKTAISDQ-NGVYSTESFATRAENIISQHDPNTPLFLYLPLQP 185
Query: 203 VHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRR 239
VHT ++P+ LQ TF+ I + +R+
Sbjct: 186 VHT----PHQVPSSYLQ---------TFSTIQDHNRK 209
>gi|220906870|ref|YP_002482181.1| sulfatase [Cyanothece sp. PCC 7425]
gi|219863481|gb|ACL43820.1| sulfatase [Cyanothece sp. PCC 7425]
Length = 495
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 88/223 (39%), Positives = 114/223 (51%), Gaps = 24/223 (10%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QGW DVGFHG +DI TPN+D LA G L ++Y+ P CTPSRAA LTG+YP RYG+ T V
Sbjct: 58 QGWKDVGFHG-SDIRTPNLDQLAKTGARLEQYYSQPMCTPSRAALLTGRYPHRYGLQTLV 116
Query: 82 GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
K +P E LLPQ LKE GY T ++GKWH+G + P RGFD G G +
Sbjct: 117 IPSAGKYGLPTDEYLLPQALKEAGYETAIVGKWHLGHADPKYWPRQRGFDYQYGPLLGEI 176
Query: 141 TY-NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y S H +D RN + Y+T +V +I+ HN PLFL +
Sbjct: 177 DYFTHSAHGK-----VDWYRNNQLIK---EEGYVTTLLGQDAVKLIEKHNPKTPLFLYLA 228
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A H Q P + + I++P+RR +A
Sbjct: 229 FTAPHAP-----------YQAPQKYLDQ--YKTIADPNRRAYA 258
>gi|241156195|ref|XP_002407716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215494207|gb|EEC03848.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 548
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 90/230 (39%), Positives = 126/230 (54%), Gaps = 26/230 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYG-----I 77
GW+DV FHG + IPTPNID LA +GI L+ +Y P CTPSRAA +TG YP R G I
Sbjct: 48 GWDDVSFHGSSQIPTPNIDVLAADGITLHNYYVQPMCTPSRAALMTGLYPIRTGMQHWVI 107
Query: 78 DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
+P G +P+ KL+P++LK+LGYSTHL+GK K+ ++ NR N
Sbjct: 108 RSPEPWG----LPLELKLMPEHLKDLGYSTHLVGKVLFDL-KKFIVSVNRLCINEST--E 160
Query: 138 GYLTYNDSIHETDFAV-----GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
T+ ++ + V GLD R E + + +Y T FTD+++ +I+ HN ++
Sbjct: 161 VCHTFVSAVTLCIYFVYKSHAGLDFRNGEEPFHND-TGQYATTLFTDRAISIIEQHNQTK 219
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLFL ++H A H T LQ PD EN F +I DR ++A
Sbjct: 220 PLFLYLSHLAPHGATHDEP------LQAPD--ENVEKFDYIGEEDRTIYA 261
>gi|198428954|ref|XP_002125106.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
Length = 562
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 83/234 (35%), Positives = 130/234 (55%), Gaps = 33/234 (14%)
Query: 23 GWNDVGFHGE---NDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
G+ND+G+H +D+ TP +D+LA G++L +Y P C+P+R LTG RY I T
Sbjct: 45 GFNDIGYHAREHYSDMYTPFLDSLAAKGVILENYYVQPICSPTRGQLLTG----RYQIHT 100
Query: 80 PVGAGVAKA-----VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
+ G+ +A +P+ LL Q L++ GY T+++GKWH+G +EE LP+NRGF N G
Sbjct: 101 GLAHGIIRAAQPYGLPLDNILLSQQLRQCGYKTNMVGKWHLGFFREEYLPWNRGFQNFFG 160
Query: 135 YWNG----YLTYNDSIHETDFAVGLDARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
+ NG + Y+ +T G D + RY P ++ +Y T+ F +S +I H
Sbjct: 161 FLNGGVNHFTRYHCEPKKTRRFCGYDMIDS--RYGPTNATYGEYSTNLFIRKSKEMIDKH 218
Query: 189 NHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
N +P+FL ++ AVH G LQVP+ + + F HI + +RR++A
Sbjct: 219 NKQKPMFLYLSLQAVH-----------GPLQVPN--QYLKRFKHIRDKNRRIYA 259
>gi|291220870|ref|XP_002730451.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 519
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 81/222 (36%), Positives = 120/222 (54%), Gaps = 17/222 (7%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
GWND+G+H + I +PNI+AL +G+ L +Y P CTPSR+ ++G+Y G+ +
Sbjct: 32 HGWNDIGYH-SHIIRSPNINALCNDGVRLENYYIQPGCTPSRSQLMSGRYQIHTGLQHSV 90
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ E L + LKE+GY+THL+GKWH+G LP RGFD+ GY G
Sbjct: 91 IRNDQPNCLPLDEVTLAEKLKEVGYATHLVGKWHLGFYTPSCLPTRRGFDSFFGYLIGQE 150
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y IH+ G D +R+ + Q Y T FT ++ ++IKSH+ S PLFL +++
Sbjct: 151 DYYKHIHDG----GYDLKRHETDVSKQYQGDYTTHVFTSEAQNIIKSHDPSTPLFLYMSY 206
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+VH LQVP+ N I + DRR+ A
Sbjct: 207 QSVH----------ANYLQVPEHYSNMYNGV-IDDEDRRIVA 237
>gi|327263080|ref|XP_003216349.1| PREDICTED: arylsulfatase B-like [Anolis carolinensis]
Length = 521
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 85/227 (37%), Positives = 118/227 (51%), Gaps = 22/227 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW DVG+HG + I TP +DAL+ G+ L R+Y P CTPSR+ LTG+Y G+ +
Sbjct: 44 GWQDVGWHG-SQIRTPVLDALSAAGVRLERYYIQPLCTPSRSQLLTGRYQIHTGLQHEII 102
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
VP+ EKLLP+ LKE GY TH++GKWH+G + E LP RGFD + GY G
Sbjct: 103 WPCQPSCVPLDEKLLPELLKEAGYVTHMVGKWHLGMYRNECLPTRRGFDTYFGYLLGSED 162
Query: 142 YNDSIHETDFA------VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
Y H LD R+ E+ A + Y T+ FT ++ +I +H +PLF
Sbjct: 163 YYSHEHCVPIVSKNVTRCALDL-RDGEKIADGFKNMYSTNVFTQRAQDLIANHQPEKPLF 221
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L + +VH LQVP E+ ++ I + RR +A
Sbjct: 222 LYLALQSVHEP-----------LQVP--EKYVEPYSFIKDEKRRKYA 255
>gi|390360193|ref|XP_788463.2| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 537
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 84/229 (36%), Positives = 121/229 (52%), Gaps = 24/229 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW DVG+H + I TPN+D LA G+ L +Y P C+PSR+ +TG+Y G+ +
Sbjct: 51 GWFDVGYH-NSTIKTPNLDLLASRGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHFVI 109
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A +P+ E LPQ LKE GY+THL+GKWH+G K E +P RGFD+ GY +G
Sbjct: 110 IAPQPNCLPLNETTLPQKLKESGYATHLVGKWHLGFYKNECMPLQRGFDSSFGYLSGMQD 169
Query: 142 YNDSIHETDFA--------VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y F +G+D N R A + + Y FT+++ VI+ HN ++P
Sbjct: 170 YWTHFRSGSFPGFPEGNHWLGIDFWDN-NRVAWEYTGNYSQFVFTERAQRVIQQHNPNQP 228
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH G LQVP E+ + +AH + R+ +A
Sbjct: 229 LFLYLPLQSVH-----------GPLQVP--EKYMKPYAHFQDVGRQTYA 264
>gi|348583281|ref|XP_003477401.1| PREDICTED: arylsulfatase I [Cavia porcellus]
Length = 572
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 76/188 (40%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 58 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 116
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 117 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 176
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SHN RPLFL
Sbjct: 177 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTMLYAQRASHILASHNPQRPLFLY 233
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 234 VAFQAVHT 241
>gi|158424485|ref|YP_001525777.1| twin-arginine translocation pathway signal [Azorhizobium
caulinodans ORS 571]
gi|158331374|dbj|BAF88859.1| twin-arginine translocation pathway signal precursor [Azorhizobium
caulinodans ORS 571]
Length = 490
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 89/221 (40%), Positives = 121/221 (54%), Gaps = 22/221 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
G+ DVGFHG +DI TPN+D LA G L + YT P CTP+RAA +TG+YP RYG+ T V
Sbjct: 58 GFADVGFHG-SDIKTPNLDKLAATGATLGQFYTQPMCTPTRAALMTGRYPLRYGLQTGVI 116
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+G + + E LLPQ LK +GYST LIGKWH+G K++ P RGFD G G +
Sbjct: 117 PSGASYGLATDEFLLPQALKSVGYSTALIGKWHLGHAKQDFWPRQRGFDYFYGPLVGEID 176
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
+ HE V D R+ ++ + Y T+ F + +I +H+ PLFL +
Sbjct: 177 HYK--HEAHGVV--DWYRDNKQV---VEEGYDTELFGTDAARLIGAHDPKTPLFLYLAFT 229
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A HT Q P DR + +I++P RRL+A
Sbjct: 230 APHTP-----------FQAP-QAYVDR-YPNITDPARRLYA 257
>gi|363744029|ref|XP_003642960.1| PREDICTED: arylsulfatase B [Gallus gallus]
Length = 514
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 73/192 (38%), Positives = 110/192 (57%), Gaps = 15/192 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW DVG+HG + I TP +DAL G+ L R+YT P CTPSR+ L+G+Y G+ +
Sbjct: 35 GWGDVGWHG-SAIRTPRLDALGAGGVRLERYYTQPLCTPSRSQLLSGRYQIHTGLQHQII 93
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ EKLLP+ LK+ GY TH++GKWH+G ++E LP RGFD + GYL
Sbjct: 94 WPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPTRRGFDTYF----GYLL 149
Query: 142 YNDSIHETDFAVGLDAR---------RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
++ + D V + A+ R+ E A + Y T+ FT++++ +I +H +
Sbjct: 150 GSEDYYSHDHCVLIKAKNVTRCALDFRDGEEVATGFKNMYSTNLFTERAIDLIANHKTEK 209
Query: 193 PLFLQITHAAVH 204
PLFL + +VH
Sbjct: 210 PLFLYLAFQSVH 221
>gi|424863174|ref|ZP_18287087.1| N-acetylgalactosamine-4-sulfatase [SAR86 cluster bacterium SAR86A]
gi|400757795|gb|EJP72006.1| N-acetylgalactosamine-4-sulfatase [SAR86 cluster bacterium SAR86A]
Length = 519
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 84/221 (38%), Positives = 121/221 (54%), Gaps = 13/221 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW DV ++G I TPNID LA +G+ +NR Y+ PTC+P+RAA TG + GI P+
Sbjct: 31 GWGDVSYNG-GPINTPNIDKLADDGLQMNRFYSAPTCSPTRAALFTGINSLKNGIIRPLN 89
Query: 83 AGVAK--AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
A+ +P+ K+LP+YLKE+GY T L GKWH+G +E LP NRGF++ G+ G +
Sbjct: 90 NPTAERYGLPLKHKILPEYLKEIGYQTALSGKWHLGMFSDEYLPRNRGFESTYGHLGGGI 149
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D + LD RN E Y T D+++ +I++ N+ PLFL +
Sbjct: 150 GYFDHA----LSGRLDWHRNGEIL---YEDGYSTTLIADEAIRIIENKNNETPLFLYVAF 202
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTF-AHISNPDRRL 240
A HT K+ L + D +E R + A+I DR +
Sbjct: 203 NAPHTPIQAEEKIINNLSDISDKKE--RVYAANIITLDREI 241
>gi|260816809|ref|XP_002603280.1| hypothetical protein BRAFLDRAFT_126970 [Branchiostoma floridae]
gi|229288598|gb|EEN59291.1| hypothetical protein BRAFLDRAFT_126970 [Branchiostoma floridae]
Length = 377
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 81/217 (37%), Positives = 124/217 (57%), Gaps = 19/217 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW+DV ++ + TPN+ LA G++ N+ Y PTC+PSR A LTGK+PFR G+ +
Sbjct: 36 GWSDVSWNNPY-VVTPNLHTLATTGVIFNQTYAQPTCSPSRTALLTGKFPFRLGMQRVMD 94
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+ +P+ E+LLPQ LK+LGY+TH++GKWH+G K E P RGFD+ GY +G Y
Sbjct: 95 SKKPHGLPLDEELLPQKLKKLGYATHMVGKWHLGSCKWEYTPTERGFDSFYGYHHGSQDY 154
Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
H++ A GLD + Q + Y T+ F ++ ++I H+ + PLFL + +
Sbjct: 155 --YTHKS--ARGLDFWDGKTSISDQ-NGVYSTESFATRAENIISQHDPNTPLFLYLPFQS 209
Query: 203 VHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRR 239
VHT ++P+ LQ TF+ I + +R+
Sbjct: 210 VHTP----HQVPSSYLQ---------TFSTIQDDNRK 233
>gi|390356459|ref|XP_003728793.1| PREDICTED: arylsulfatase I-like [Strongylocentrotus purpuratus]
Length = 613
Score = 137 bits (345), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 87/254 (34%), Positives = 129/254 (50%), Gaps = 42/254 (16%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GWNDV FHG + IPTP+IDALA G++L +Y P CTP+R+A +TGK+P G++ V
Sbjct: 39 GWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTRSAIMTGKHPIHTGLEHGVI 98
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWH-----IGCN------------------ 118
G + + EKL+PQYL+ELGY TH++GK IG +
Sbjct: 99 GVSHPYGLGLEEKLMPQYLRELGYRTHMVGKVSLEHGVIGVSHPYGLGLEEKLMHQYLRE 158
Query: 119 ---------KEELLPFNRGFDNHVGYWNGYLT-YNDSIHETDFAVGLDARRNMERYAPQM 168
KE L P +RGF++ GY+ G Y I G D + + P +
Sbjct: 159 LGYRTHMVGKESLTPSHRGFESFYGYYAGMGDYYTHEITSDGNMTGFDFHMDGSVHKP-V 217
Query: 169 SSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDR 228
+Y T+ FT+++ +I HN PL++ + H AVH+ + LQ P E +
Sbjct: 218 FGQYSTEIFTERTQEIILKHNPKEPLYIYLAHQAVHSANYDGQR-----LQAP--HEYYK 270
Query: 229 TFAHISNPDRRLFA 242
F +I++ +RR +A
Sbjct: 271 RFPNITHENRRKYA 284
>gi|395736371|ref|XP_003780537.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Pongo abelii]
Length = 481
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 45 QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 103
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 104 IRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 163
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SH+ RPLFL
Sbjct: 164 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 220
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 221 VAFQAVHT 228
>gi|426350600|ref|XP_004042858.1| PREDICTED: arylsulfatase I [Gorilla gorilla gorilla]
Length = 569
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 116 IRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SH+ RPLFL
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 233 VAFQAVHT 240
>gi|59797060|ref|NP_001012301.1| arylsulfatase I precursor [Homo sapiens]
gi|74722581|sp|Q5FYB1.1|ARSI_HUMAN RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
gi|58201084|gb|AAW66665.1| arylsulfatase I [Homo sapiens]
gi|120538357|gb|AAI29997.1| Arylsulfatase family, member I [Homo sapiens]
gi|120538621|gb|AAI29996.1| Arylsulfatase family, member I [Homo sapiens]
gi|220983388|dbj|BAH11166.1| arylsulfatase I [Homo sapiens]
Length = 569
Score = 137 bits (344), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 116 IRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SH+ RPLFL
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 233 VAFQAVHT 240
>gi|397517762|ref|XP_003829075.1| PREDICTED: arylsulfatase I [Pan paniscus]
gi|410214522|gb|JAA04480.1| arylsulfatase family, member I [Pan troglodytes]
gi|410261150|gb|JAA18541.1| arylsulfatase family, member I [Pan troglodytes]
gi|410300016|gb|JAA28608.1| arylsulfatase family, member I [Pan troglodytes]
gi|410336277|gb|JAA37085.1| arylsulfatase family, member I [Pan troglodytes]
Length = 569
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 116 IRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SH+ RPLFL
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 233 VAFQAVHT 240
>gi|355750329|gb|EHH54667.1| hypothetical protein EGM_15550 [Macaca fascicularis]
Length = 569
Score = 136 bits (343), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SH+ RPLFL
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 233 VAFQAVHT 240
>gi|109079349|ref|XP_001108178.1| PREDICTED: arylsulfatase I-like [Macaca mulatta]
gi|402873074|ref|XP_003900411.1| PREDICTED: arylsulfatase I [Papio anubis]
gi|355691752|gb|EHH26937.1| hypothetical protein EGK_17023 [Macaca mulatta]
Length = 569
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SH+ RPLFL
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 233 VAFQAVHT 240
>gi|395817250|ref|XP_003782086.1| PREDICTED: arylsulfatase I [Otolemur garnettii]
Length = 572
Score = 136 bits (343), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SH+ RPLFL
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 233 VAFQAVHT 240
>gi|260794509|ref|XP_002592251.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
gi|229277467|gb|EEN48262.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
Length = 487
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 84/218 (38%), Positives = 119/218 (54%), Gaps = 20/218 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVG+H D+ TP +D LA+ G++LN+ Y CTPSR AF+TG YP+ G V
Sbjct: 40 GWNDVGWHNP-DVKTPVLDKLAHEGVILNQSYVNYVCTPSRTAFMTGYYPYHAGSQHLVF 98
Query: 83 A-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A+ +P LP+ LK+LGY+TH++GKWH+G + P RGFD+ GY+N
Sbjct: 99 LPQQAQGIPYNFTFLPEKLKDLGYATHMVGKWHLGFCNWKYTPTYRGFDSFYGYYNADED 158
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y + A GLD R + E + + +Y T FFTD+ V +I+ H PLFL +
Sbjct: 159 YYTHV----VAGGLDLRDDKEVVNTK-NGQYGTYFFTDRMVDIIEKHPADTPLFLYLPFQ 213
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRR 239
VH L+VP+ EN + ++ N +RR
Sbjct: 214 NVHEP-----------LEVPERFEN--IYMNVQNENRR 238
>gi|344265150|ref|XP_003404649.1| PREDICTED: arylsulfatase I-like [Loxodonta africana]
Length = 573
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 59 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 117
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 118 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 177
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SH+ RPLFL
Sbjct: 178 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTLLYAQRASHILASHSPRRPLFLY 234
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 235 VAFQAVHT 242
>gi|291387626|ref|XP_002710353.1| PREDICTED: arylsulfatase I-like [Oryctolagus cuniculus]
Length = 571
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SH+ RPLFL
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTMLYAQRASHILASHSPRRPLFLY 232
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 233 VAFQAVHT 240
>gi|351713085|gb|EHB16004.1| Arylsulfatase I [Heterocephalus glaber]
Length = 573
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 58 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 116
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 117 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMLGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 176
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SH+ RPLFL
Sbjct: 177 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 233
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 234 VAFQAVHT 241
>gi|291239530|ref|XP_002739676.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 507
Score = 136 bits (342), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 121/230 (52%), Gaps = 30/230 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW+DVG+H ++ I TPNID LA G+ L +Y P CTP+RA +TG+Y G+ V
Sbjct: 40 GWHDVGYH-DSIIRTPNIDKLAAEGVKLENYYVTPLCTPTRAVLMTGRYQIHTGMQHGVL 98
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A + +P E L+PQ LKE GY+TH++GKWH+G K P +RGFD G YL
Sbjct: 99 MAQEPRCLPTDEVLMPQKLKESGYTTHMVGKWHLGFYKWACTPNHRGFDTFFGM---YLA 155
Query: 142 YNDSIHETDFAVG-----LDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
D + T G D R + AP+ KY T F +++ +IK H+ + PLFL
Sbjct: 156 GGDYFNHTRLCHGRRLAAWDLRDGDQVVAPEYVGKYSTIVFAEKAQEIIKKHDPTNPLFL 215
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPD----MEENDRTFAHISNPDRRLFA 242
++ AVH LQVP+ M ++D I + RR++A
Sbjct: 216 YLSFQAVHAP-----------LQVPERYINMYKDD-----IRDESRRIYA 249
>gi|301765544|ref|XP_002918191.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Ailuropoda
melanoleuca]
Length = 573
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 81/214 (37%), Positives = 116/214 (54%), Gaps = 14/214 (6%)
Query: 2 DTPVGAGVAKAVPVTEKLL------PQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT 55
D P AGV + P + QG++DVG+HG +DI TP +D LA G+ L +Y
Sbjct: 32 DGPGEAGVEQPXPSQPPHIIFILTDDQGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYI 90
Query: 56 LPTCTPSRAAFLTGKYPFRYGID-TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWH 114
P CTPSR+ LTG+Y G+ + + +P+ + LPQ L+E GYSTH++GKWH
Sbjct: 91 QPICTPSRSQLLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWH 150
Query: 115 IGCNKEELLPFNRGFDNHVGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSK 171
+G ++E LP RGFD +G G Y TY++ + G D E A +S +
Sbjct: 151 LGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQ 207
Query: 172 YLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
Y T + + H++ SH+ RPLFL + AVHT
Sbjct: 208 YSTMLYAQRVSHILASHSPRRPLFLYVAFQAVHT 241
>gi|84370328|ref|NP_001033588.1| arylsulfatase I precursor [Mus musculus]
gi|123779975|sp|Q32KI9.1|ARSI_MOUSE RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
gi|81158040|tpe|CAI84994.1| TPA: arylsulfatase I [Mus musculus]
gi|148677850|gb|EDL09797.1| mCG6034 [Mus musculus]
gi|187954139|gb|AAI38971.1| Arylsulfatase i [Mus musculus]
gi|187954429|gb|AAI41170.1| Arylsulfatase i [Mus musculus]
Length = 573
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SHN PLFL
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTMLYAQRASHILASHNPQNPLFLY 232
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 233 VAFQAVHT 240
>gi|313235834|emb|CBY19819.1| unnamed protein product [Oikopleura dioica]
Length = 518
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 83/225 (36%), Positives = 117/225 (52%), Gaps = 22/225 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW D G HG + TPN+DA+A +GI+L ++YT C+P+R++ LTG+YP RYG+ +
Sbjct: 29 GWGDFGVHGSK-LETPNLDAIARDGILLEKYYTQQVCSPTRSSLLTGRYPIRYGMQHNVI 87
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
AG +P LLPQ LK G++TH++GKWH G + E +LPFNRGFD++ GY G
Sbjct: 88 LAGQTTGIPKEYALLPQDLKSCGFATHMVGKWHCGHSHEYMLPFNRGFDSYYGYLQGAED 147
Query: 142 YNDSIH-ETDFAVGLDARRNMERYAPQMSS--KYLTDFFTDQSVHVI-KSHNHSRPLFLQ 197
+ I + G+D P S+ Y T+ ++ Q V+ K +P FL
Sbjct: 148 HYSRIQCQAKEWCGVDF---CTENGPTNSTWGTYGTEIYSAQVAQVLDKVSKEEKPFFLY 204
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
VH LQ P E F I +PDRR +A
Sbjct: 205 YAMQNVHDP-----------LQAP--EHYKIKFDWIEDPDRRTYA 236
>gi|410929555|ref|XP_003978165.1| PREDICTED: arylsulfatase B-like [Takifugu rubripes]
Length = 516
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 82/229 (35%), Positives = 120/229 (52%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW DVG+H ++I TP +D L+ G+ L +Y P CTPSR +TG+Y G+ +
Sbjct: 36 GWYDVGYH-LSEIRTPILDKLSSGGVRLENYYVQPLCTPSRNQLMTGRYQIHTGMQHQII 94
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
VP+ EKLLPQ +KE GY+TH++GKWH+G K++ LP RGFD+++GY G
Sbjct: 95 WPCQPYCVPLDEKLLPQLMKEAGYATHMVGKWHLGMYKKDCLPTRRGFDSYLGYLTGSED 154
Query: 142 YN--------DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y +++ T A+ L R E A Y T+ F+ ++V +I+ H + P
Sbjct: 155 YYTHIRCHPISALNLTRCALDL---REAEAVARSYKGTYSTELFSQRAVSIIEKHTSTEP 211
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + AVH LQVP E ++ I + RR +A
Sbjct: 212 LFLYVAFQAVHAP-----------LQVP--ERYVAPYSFIQDHSRRSYA 247
>gi|241654408|ref|XP_002411325.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215503955|gb|EEC13449.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 510
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 78/213 (36%), Positives = 115/213 (53%), Gaps = 14/213 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW DV FHG IPTPNID LA +G++LN +Y LPTCTPSRAA +TG YP G+ + +
Sbjct: 14 GWGDVSFHGSTQIPTPNIDVLAGDGVILNNYYVLPTCTPSRAALMTGLYPIHTGMQSDII 73
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
A +P+ K+LPQ+ ++LGY ++IGKWH+G K +P RGFD G++ G
Sbjct: 74 EPAAPWGLPLENKILPQHFRDLGYDVNMIGKWHLGFFKTPYVPIKRGFDTFFGFYTGSND 133
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y + + V L A A ++ + TD ++V + +P F
Sbjct: 134 YYNHTSGSENNEQGVSLVATLTA---AWGRIAQGVEPLLTDSPLNV-----YPQPFFCYF 185
Query: 199 THAAVHTGTAGNA-KLPT-GLLQVPDMEENDRT 229
+H AVH+ + P +L+ P + E++RT
Sbjct: 186 SHHAVHSALMAEPFQAPARNVLKFPYIGESNRT 218
>gi|426231093|ref|XP_004009578.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Ovis aries]
Length = 597
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 88 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 146
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+ELGYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 147 IRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 206
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + + H++ SH+ +PLFL
Sbjct: 207 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLY 263
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 264 VAFQAVHT 271
>gi|449677596|ref|XP_004208885.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
Length = 193
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/152 (48%), Positives = 91/152 (59%), Gaps = 6/152 (3%)
Query: 10 AKAVPVTEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTG 69
A V K + G+NDV FHG IPTPNID +A G++LN +Y LP CTPSR+A +TG
Sbjct: 33 ANIVSSNFKEINLGFNDVSFHGSKQIPTPNIDKIAKEGVILNNYYVLPICTPSRSAIMTG 92
Query: 70 KYPFRYGI-----DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLP 124
+YP GI DT + A A V + EK LPQYLK +GY TH IGKWH+G +E P
Sbjct: 93 RYPIHTGIFYHTYDT-IFAANAWGVGLDEKFLPQYLKNVGYQTHAIGKWHLGFFSKEYTP 151
Query: 125 FNRGFDNHVGYWNGYLTYNDSIHETDFAVGLD 156
RGFD+ GY+ G Y D ++ GLD
Sbjct: 152 TYRGFDSFYGYYGGQADYWDHSLASNGWWGLD 183
>gi|440901665|gb|ELR52564.1| Arylsulfatase I, partial [Bos grunniens mutus]
Length = 565
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 51 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 109
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+ELGYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 110 IRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 169
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + + H++ SH+ +PLFL
Sbjct: 170 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLY 226
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 227 VAFQAVHT 234
>gi|61876881|ref|XP_593725.1| PREDICTED: arylsulfatase I [Bos taurus]
gi|297477411|ref|XP_002689338.1| PREDICTED: arylsulfatase I [Bos taurus]
gi|296485188|tpg|DAA27303.1| TPA: arylsulfatase I-like [Bos taurus]
Length = 574
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 60 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 118
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+ELGYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 119 IRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 178
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + + H++ SH+ +PLFL
Sbjct: 179 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLY 235
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 236 VAFQAVHT 243
>gi|114326196|ref|NP_001041583.1| arylsulfatase I precursor [Canis lupus familiaris]
gi|81158064|tpe|CAI85006.1| TPA: arylsulfatase I [Canis lupus familiaris]
Length = 575
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 60 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 118
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 119 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 178
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + + H++ SH+ RPLFL
Sbjct: 179 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 235
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 236 VAFQAVHT 243
>gi|6863176|gb|AAF30402.1|AF109924_1 sulfatase 1 precursor [Helix pomatia]
Length = 503
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 79/224 (35%), Positives = 128/224 (57%), Gaps = 22/224 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G++DVG+HG ++I TP +DAL+ +G+ L +Y P CTP+R+ ++G+Y G+ +
Sbjct: 45 GFHDVGYHG-SEIHTPTLDALSASGVRLENYYVQPICTPTRSQLMSGRYQIHTGLQHGII 103
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
+ A+P L LKE GY+TH++GKWH+G K+E LP+NRGFD + GY N
Sbjct: 104 NSCQPNALPNDSPTLADKLKESGYATHMVGKWHLGFYKQEYLPWNRGFDTYFGYLNAAED 163
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y +N + + LD R N + + +Y FT +++ V++SHN S+PLFL +
Sbjct: 164 YFNHNVPWRQVRY---LDLRDNNGPVRNE-TGQYSAHLFTGKAIDVVQSHNTSKPLFLYL 219
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ +VH L+VP E+ + + +I++ +RR FA
Sbjct: 220 AYQSVHAP-----------LEVP--EKYEHKYRNITDKNRRTFA 250
>gi|218563492|sp|Q32KH7.2|ARSI_CANFA RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
Length = 573
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 58 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 116
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 117 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 176
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + + H++ SH+ RPLFL
Sbjct: 177 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 233
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 234 VAFQAVHT 241
>gi|410949653|ref|XP_003981535.1| PREDICTED: arylsulfatase I, partial [Felis catus]
Length = 570
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 75/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 55 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 113
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 114 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 173
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + + H++ SH+ RPLFL
Sbjct: 174 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 230
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 231 VAFQAVHT 238
>gi|114145559|ref|NP_001041346.1| arylsulfatase I precursor [Rattus norvegicus]
gi|123779983|sp|Q32KJ8.1|ARSI_RAT RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
gi|81158022|tpe|CAI84985.1| TPA: arylsulfatase I [Rattus norvegicus]
gi|149064375|gb|EDM14578.1| similar to RIKEN cDNA 9330196J05 [Rattus norvegicus]
Length = 573
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SH+ +PLFL
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTMLYAQRASHILASHSPQKPLFLY 232
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 233 VAFQAVHT 240
>gi|405975640|gb|EKC40194.1| Arylsulfatase B [Crassostrea gigas]
Length = 484
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 81/205 (39%), Positives = 111/205 (54%), Gaps = 18/205 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVG+H + TPNID LA G++LN+ Y P C+PSR AF+TG YP+R G+ V
Sbjct: 37 GWNDVGYHNPA-MKTPNIDKLAREGLILNQTYFQPLCSPSRHAFMTGYYPYRAGLQHLVI 95
Query: 83 AGVAKAV-PVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
P+ K LPQ LK+LGY+TH++GKWH+G + P RGFD+ G+++
Sbjct: 96 MPWQPVCSPLNMKFLPQRLKDLGYATHMVGKWHLGMCNWDCTPTYRGFDSFFGFYHAKAD 155
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y I LD R N E+ ++ Y T FT ++ +IK HN S+PLFL +
Sbjct: 156 YYSHISYK----YLDYRDN-EKPVKNLNGTYSTFTFTSRAQDIIKKHNSSQPLFLYMAFP 210
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEEN 226
+P LQVP E+
Sbjct: 211 -----------IPHEPLQVPQQYED 224
>gi|296193239|ref|XP_002744413.1| PREDICTED: arylsulfatase I [Callithrix jacchus]
Length = 569
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ +H+ RPLFL
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILANHSPQRPLFLY 232
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 233 VAFQAVHT 240
>gi|403285505|ref|XP_003934063.1| PREDICTED: arylsulfatase I [Saimiri boliviensis boliviensis]
Length = 572
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ +H+ RPLFL
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILANHSPQRPLFLY 232
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 233 VAFQAVHT 240
>gi|444723687|gb|ELW64328.1| Arylsulfatase I [Tupaia chinensis]
Length = 613
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 109/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 102 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 160
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 161 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 220
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SH+ +PLFL
Sbjct: 221 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTMLYAQRASHILASHSPRQPLFLY 277
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 278 VAFQAVHT 285
>gi|395504882|ref|XP_003756775.1| PREDICTED: arylsulfatase I [Sarcophilus harrisii]
Length = 598
Score = 135 bits (339), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 224 QGYHDVGYHG-SDIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 282
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E+GYSTH++GKWH+G K+ LP RGFD +G G
Sbjct: 283 IRPRQPSCLPLDQVTLPQKLQEVGYSTHMVGKWHLGFYKKACLPTRRGFDTFLGSLTGNV 342
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A + S +Y T + ++ ++ SHN +PLFL
Sbjct: 343 DYYTYDNC--DGPGVCGYDLHEG-ESVAWEQSGQYSTLLYAQRASQILASHNPRQPLFLY 399
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 400 VAFQAVHT 407
>gi|357023853|ref|ZP_09086021.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
gi|355544286|gb|EHH13394.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
Length = 501
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 77/186 (41%), Positives = 104/186 (55%), Gaps = 11/186 (5%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D GF+G +DIPTPN+D LA G L + Y LP CTP+RAA +TG+YP RYG+ V
Sbjct: 70 MGFGDAGFNG-SDIPTPNLDKLAAEGARLEQFYALPMCTPTRAALMTGRYPLRYGLQVGV 128
Query: 82 -GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
A ++PV E LLPQ LK+ GY+T ++GKWH+G K E P RGFD G G +
Sbjct: 129 IPAAGTYSLPVDEYLLPQALKDTGYTTAMVGKWHLGHAKPEFWPRQRGFDYFYGALVGEI 188
Query: 141 T-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
+ S H D RN + P + + F +++V V++ H PLFL +
Sbjct: 189 DHFKHSSHGVK-----DWYRNNK---PLNETGFDNTLFGNEAVRVVERHEGKSPLFLYLA 240
Query: 200 HAAVHT 205
A HT
Sbjct: 241 FTAPHT 246
>gi|148668607|gb|EDL00926.1| arylsulfatase B [Mus musculus]
Length = 556
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 88/229 (38%), Positives = 126/229 (55%), Gaps = 35/229 (15%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWND+GFHG + I TP++DALA G+VL+ +Y P CTPSR+ LTG+Y G+ +
Sbjct: 88 GWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHLGLQHYLI 146
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 147 MTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 206
Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y T+ +S++ T A+ L R+ E A + ++ Y T+ FT ++ VI +H +
Sbjct: 207 YYTHEACAPIESLNGTRCALDL---RDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEK- 262
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+VH LQVP EE + I + RR++A
Sbjct: 263 --------SVHDP-----------LQVP--EEYMEPYGFIQDKHRRIYA 290
>gi|332822312|ref|XP_527073.3| PREDICTED: arylsulfatase I isoform 2 [Pan troglodytes]
Length = 569
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LP L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 116 IRPQQPNCLPLDQVTLPHKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SH+ RPLFL
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 233 VAFQAVHT 240
>gi|260788444|ref|XP_002589260.1| hypothetical protein BRAFLDRAFT_213058 [Branchiostoma floridae]
gi|229274435|gb|EEN45271.1| hypothetical protein BRAFLDRAFT_213058 [Branchiostoma floridae]
Length = 455
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 79/227 (34%), Positives = 119/227 (52%), Gaps = 22/227 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWND+G+H + I TPN+D LA G+ L +Y P C+PSR +TG+Y YG+ + +
Sbjct: 12 GWNDIGYHN-SFIRTPNLDRLASEGVKLENYYVQPICSPSREQLMTGRYQIHYGLQHSVI 70
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ E LPQ LKE GYST+++GKWH+G ++E +P RGF+ GY G
Sbjct: 71 MCDRPHGLPLDEVTLPQRLKENGYSTYMVGKWHLGFFRKEYMPLQRGFERFFGYLTGGED 130
Query: 142 YNDSIHETDFAV------GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
Y + F+ GLD R+ ++ + Y T F +++ +I +H+ S+P+F
Sbjct: 131 YWTHRKPSQFSKDPSEFHGLDL-RDQDKPVLDQNGTYSTHLFARKAIEMILNHDQSKPMF 189
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L + AVH G L+ P EE R + I N R +A
Sbjct: 190 LYLPFQAVH-----------GPLEAP--EEYKRIYEDIDNSLVRTYA 223
>gi|171909641|ref|ZP_02925111.1| twin-arginine translocation pathway signal precursor
[Verrucomicrobium spinosum DSM 4136]
Length = 486
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 85/222 (38%), Positives = 114/222 (51%), Gaps = 23/222 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW DVGF+G DI TP++DALA G + Y P CTP+RAA +TG+YPFRYG+ T V
Sbjct: 41 GWQDVGFNGCKDIQTPHLDALAKGGARFTQFYVQPMCTPTRAALMTGRYPFRYGLQTAVI 100
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
V+ + E LLPQ L++ GY+T +IGKWH+G ++ P RGF+ G G L
Sbjct: 101 PSVSTYGLDTGEYLLPQCLQDAGYTTAIIGKWHLGHADKKFWPKQRGFEYQYGAMIGELD 160
Query: 142 -YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y S H LD R+ E P Y T+ +V ++ RP +L +
Sbjct: 161 YYTHSEHGV-----LDWFRDNE---PVHEEGYTTNLLGADAVKYLEKQKADRPFYLYLAF 212
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A HT Q P E DR + HI++P RR +A
Sbjct: 213 NAPHTP-----------YQAP-QEYIDR-YTHIADPTRRTYA 241
>gi|241025894|ref|XP_002406215.1| arylsulfatase J, putative [Ixodes scapularis]
gi|215491896|gb|EEC01537.1| arylsulfatase J, putative [Ixodes scapularis]
Length = 437
Score = 134 bits (337), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 83/222 (37%), Positives = 114/222 (51%), Gaps = 12/222 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW DVGFHG IP PNIDALA +G++LN++Y P SR LTG YP+R G+ +
Sbjct: 33 GWADVGFHGSRQIPVPNIDALAADGVILNKYYAQPWPLSSRIGLLTGIYPYRTGVGRVML 92
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
A+P ++LP + K LGY TH +G W++G K+ P NRGFD W G Y
Sbjct: 93 PCQPVALPSVFRILPTFFKSLGYRTHFVGVWNLGFYKKRFTPVNRGFDTAYAKWTGPGDY 152
Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQS--VHVIKSHNHSRPLFLQITH 200
+T G D R N + Q + Y T FT+++ + +PL L ++H
Sbjct: 153 WTHDMQTKMQ-GFDLRLNDDLMWNQ-TGVYSTRLFTERADPTRKLCFFVLHQPLLLILSH 210
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A+HT G LQ P +E D +F I + RR+FA
Sbjct: 211 QALHTANY------HGSLQCP-LEHLD-SFGFIRDRKRRIFA 244
>gi|354488427|ref|XP_003506371.1| PREDICTED: arylsulfatase I [Cricetulus griseus]
Length = 560
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 112 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 170
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 171 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFFRKECLPSCRGFDTFLGSLTGNV 230
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SH+ PLFL
Sbjct: 231 DYYTYDNC--DGPGVCGFDLHEG-ETVAWGLSGQYSTMLYAQRASHILASHSPQNPLFLY 287
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 288 VAFQAVHT 295
>gi|156359506|ref|XP_001624809.1| predicted protein [Nematostella vectensis]
gi|156211610|gb|EDO32709.1| predicted protein [Nematostella vectensis]
Length = 488
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 74/183 (40%), Positives = 103/183 (56%), Gaps = 7/183 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW D+G+HG + I TPNI+ LA +GI+L+ +Y P CTP+R+A +TGKYP G V
Sbjct: 36 GWFDLGYHG-SVIRTPNINQLAGDGIILDNYYVQPLCTPTRSALMTGKYPIHLGTQHGVI 94
Query: 83 A-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G +P+ LP+ LK+ GY+TH++GKWH+G KE+ +P RGFD+ GY+ G
Sbjct: 95 LPGQPMGLPLDSSTLPEQLKQQGYATHIVGKWHLGFYKEDFVPTKRGFDSFYGYYCGA-- 152
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
H T +G R+ + Y T FT ++V I HN S PLFL +
Sbjct: 153 ---EDHFTHNVLGFLDFRDNDLIVKDQKGTYGTRAFTKRAVDTIHRHNSSSPLFLYLPFQ 209
Query: 202 AVH 204
VH
Sbjct: 210 NVH 212
>gi|156340112|ref|XP_001620356.1| hypothetical protein NEMVEDRAFT_v1g148421 [Nematostella vectensis]
gi|156205165|gb|EDO28256.1| predicted protein [Nematostella vectensis]
Length = 260
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 76/190 (40%), Positives = 111/190 (58%), Gaps = 17/190 (8%)
Query: 23 GWNDVGFHG-ENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYG----- 76
GW+DVG+H + + TPNID LA G+ L +Y+ P CTPSR A +TGKYP G
Sbjct: 36 GWSDVGYHNISHAVKTPNIDKLASQGVKLMSYYSQPMCTPSRGALMTGKYPIHLGMQHFV 95
Query: 77 IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
I+ G+ + P +PQ L+ LGY T +IGKWH+G + P RGFD+ +G++
Sbjct: 96 INITSPWGMPRRFPT----IPQKLRTLGYRTSMIGKWHLGFFDWDYTPLRRGFDSFLGFF 151
Query: 137 NGYLTYNDSIHETDFAVG-LDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
G + H +G LD RR+ E A + ++ TD FT +++++ HN S+PLF
Sbjct: 152 AG-----EQDHWRHSKMGFLDFRRD-EEPANEYGGQHSTDVFTQEAINIAMRHNASQPLF 205
Query: 196 LQITHAAVHT 205
L +++AAVHT
Sbjct: 206 LLLSYAAVHT 215
>gi|260813923|ref|XP_002601665.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
gi|229286967|gb|EEN57677.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
Length = 478
Score = 134 bits (336), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 107/190 (56%), Gaps = 9/190 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+D+G+HG I TP +D LA G+ L +Y P C+PSR +TG+Y RYG+ + +
Sbjct: 12 GWDDIGYHGSF-IQTPKLDRLAKEGVKLENYYVQPICSPSRCQLMTGRYQIRYGLQHSVI 70
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ E LPQ LKE GYST+++GKWH+G ++E +P RGFD GY G
Sbjct: 71 TSDRPHGLPLDEVTLPQKLKENGYSTYVVGKWHLGFFRKEHMPLQRGFDKFYGYLTGGED 130
Query: 142 YNDSIHETDFAV------GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
Y +A GLD R+ ++ + Y T F +++ +I++H S+P+F
Sbjct: 131 YWTHRRPNLYAKDPLAFHGLDL-RDQDKPVLDQNGTYSTHLFAKKAIEIIQNHERSKPMF 189
Query: 196 LQITHAAVHT 205
L + AVH+
Sbjct: 190 LYLPFQAVHS 199
>gi|171910063|ref|ZP_02925533.1| twin-arginine translocation pathway signal precursor
[Verrucomicrobium spinosum DSM 4136]
Length = 496
Score = 134 bits (336), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 74/184 (40%), Positives = 104/184 (56%), Gaps = 9/184 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G +DVG+ G ++I TP++D LA G L++ Y P C+P+RAA LTG+YPFRYG T V
Sbjct: 64 GSHDVGWRG-SEIKTPHLDELARAGATLDQFYVQPVCSPTRAALLTGRYPFRYGFQTGVV 122
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A+ +P+ E+ LPQ LKE GY T + GKWH+G + LP RGFD+ G++NG L
Sbjct: 123 RPWAEYGLPLEERTLPQALKEAGYETAITGKWHLGHFQPAYLPTKRGFDHQYGHYNGMLD 182
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y I G D RN + Y T+ ++ ++ + SRPLFL +
Sbjct: 183 YYTHIRHG----GFDWHRNDQE---NHDEGYSTELVGKEAARRVRERDKSRPLFLYVPFN 235
Query: 202 AVHT 205
VH+
Sbjct: 236 GVHS 239
>gi|432098813|gb|ELK28308.1| Arylsulfatase I [Myotis davidii]
Length = 571
Score = 133 bits (335), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 116 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 175
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + + H++ SH+ +PLFL
Sbjct: 176 DYYTYDNC--DGPGVCGFDLHEG-ENVAWGLSGQYSTMLYAQRVSHILASHSPRQPLFLY 232
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 233 VAFQAVHT 240
>gi|190891646|ref|YP_001978188.1| sulfatase [Rhizobium etli CIAT 652]
gi|190696925|gb|ACE91010.1| putative sulfatase protein [Rhizobium etli CIAT 652]
Length = 498
Score = 133 bits (335), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 86/221 (38%), Positives = 112/221 (50%), Gaps = 21/221 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW DVGFHG +DI TPNID LA G L + Y P CTP+RAA +TG+YPFRYG+ T V
Sbjct: 67 GWKDVGFHG-SDIKTPNIDQLAEKGGRLEQFYAQPMCTPTRAALMTGRYPFRYGMQTAVI 125
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + + + LLP+ LKE GY+T GKWH+G P RGFD+ G G +
Sbjct: 126 PQGGTYGLALDDYLLPEMLKEAGYATAASGKWHLGHADTAFWPRQRGFDSFYGALLGEI- 184
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
D D RN E + + F ++V VI H+ S+PLFL +
Sbjct: 185 --DHFTHKSANGNADWYRNNEAIE---EAGFDNILFATEAVRVINEHDQSKPLFLYLAFT 239
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ HT Q P E DR +HI++ RR +A
Sbjct: 240 SPHTP-----------FQAPK-EYLDRN-SHIADESRRAYA 267
>gi|156380740|ref|XP_001631925.1| predicted protein [Nematostella vectensis]
gi|156218974|gb|EDO39862.1| predicted protein [Nematostella vectensis]
Length = 540
Score = 133 bits (335), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 76/190 (40%), Positives = 111/190 (58%), Gaps = 17/190 (8%)
Query: 23 GWNDVGFHG-ENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYG----- 76
GW+DVG+H + + TPNID LA G+ L +Y+ P CTPSR A +TGKYP G
Sbjct: 46 GWSDVGYHNISHAVKTPNIDKLASQGVKLMSYYSQPMCTPSRGALMTGKYPIHLGMQHFV 105
Query: 77 IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
I+ G+ + P +PQ L+ LGY T +IGKWH+G + P RGFD+ +G++
Sbjct: 106 INITSPWGMPRRFPT----IPQKLRTLGYRTSMIGKWHLGFFDWDYTPLRRGFDSFLGFF 161
Query: 137 NGYLTYNDSIHETDFAVG-LDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
G + H +G LD RR+ E A + ++ TD FT +++++ HN S+PLF
Sbjct: 162 AG-----EQDHWRHSKMGFLDFRRD-EEPANEYGGQHSTDVFTQEAINIAMRHNASQPLF 215
Query: 196 LQITHAAVHT 205
L +++AAVHT
Sbjct: 216 LLLSYAAVHT 225
>gi|344250866|gb|EGW06970.1| Arylsulfatase I [Cricetulus griseus]
Length = 484
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 108/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 59 QGYHDVGYHG-SDIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 117
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 118 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFFRKECLPSCRGFDTFLGSLTGNV 177
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + ++ H++ SH+ PLFL
Sbjct: 178 DYYTYDNC--DGPGVCGFDLHEG-ETVAWGLSGQYSTMLYAQRASHILASHSPQNPLFLY 234
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 235 VAFQAVHT 242
>gi|126697470|gb|ABO26692.1| sulfatase 1A precursor [Haliotis discus discus]
Length = 477
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 120/221 (54%), Gaps = 15/221 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
GWNDVG+ DI TP +D LA +G++LN Y P C+PSR F++GK+P+ G+ D V
Sbjct: 35 GWNDVGWINP-DIKTPTLDRLARSGVILNSSYVQPLCSPSRNCFMSGKFPYHTGLQDKVV 93
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
K +P +PQ LK LGY TH++GKWH+G + P RGFD +GY+ G
Sbjct: 94 FIEQPKYMPANLTTIPQRLKTLGYDTHMVGKWHLGFCNWKYTPTYRGFDTFMGYYAGMED 153
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y + + RNM +Y T F ++++ +I +H+ S+P++L +
Sbjct: 154 YFTHVRDEVADYNGYDFRNMTDVYKGAQGEYSTYVFANRAIDIIMNHDKSKPMYLYLPFQ 213
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH LQVP + +DR ++HI + +R++++
Sbjct: 214 AVHVP-----------LQVP-TKYSDR-YSHIHDLNRKVYS 241
>gi|323454531|gb|EGB10401.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 530
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 84/235 (35%), Positives = 121/235 (51%), Gaps = 29/235 (12%)
Query: 22 QGWNDVGFHGENDIP-TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI--- 77
GWND+G+ + TP + LA NG+ L ++YT TCT SRAA L+G P GI
Sbjct: 37 MGWNDIGYQSTDMAALTPVLSDLAENGVKLTQYYTQSTCTVSRAALLSGVLPMHNGISHG 96
Query: 78 ----DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
D+P+G +P+ KLLPQYL+E GY T+++GKW IG EE LP NRGFD+
Sbjct: 97 TIVMDSPIG------LPLKYKLLPQYLQESGYRTYMVGKWDIGHFNEEYLPHNRGFDHFF 150
Query: 134 GYWNGYLTYNDSIHETDFAVGLDA---RRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-- 188
G++ +TY I + + RN + S +Y TD F +++V ++ H
Sbjct: 151 GFYGADITYFSHISSRGYCANPNCFPDLRNEDETMANASMRYTTDLFRERAVGFVEGHAA 210
Query: 189 NHSR-PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
NH+ PLFL ++ A H T+ + M A +N +RR+FA
Sbjct: 211 NHATDPLFLYLSFNAPHYPTSAPQEF---------MRNEAELLAPFTNRERRVFA 256
>gi|443709644|gb|ELU04236.1| hypothetical protein CAPTEDRAFT_53259, partial [Capitella teleta]
Length = 476
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 121/230 (52%), Gaps = 26/230 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G +DVG+HG I TPNID LAY G+ L +Y P CTPSR+ +TG+Y G+ +
Sbjct: 12 GHHDVGYHGSV-IKTPNIDHLAYTGVRLENYYVQPICTPSRSQLMTGRYQIHTGLQHNII 70
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
AVP+ +LP+ LK+ GYSTH++GKWH+G K+E+LP NRGFD++ GY G
Sbjct: 71 NPFQPNAVPLDLPMLPEVLKQNGYSTHMVGKWHLGFYKDEVLPMNRGFDSYYGYLTGSED 130
Query: 139 YLTYNDS---IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS---HNHSR 192
Y T+ G+D R + E Q + KY T F +++ V++ H +
Sbjct: 131 YFTHRRCGALPGANKTVCGIDLRNDFEVDWNQ-TGKYSTQLFAEKAEDVVRKHAVHQPDQ 189
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLFL + AVH ++P L+ D I +P RRL A
Sbjct: 190 PLFLYVAFQAVHAPN----QVPNEYLKPYD----------IDDPKRRLLA 225
>gi|405975641|gb|EKC40195.1| Arylsulfatase B [Crassostrea gigas]
Length = 684
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 85/227 (37%), Positives = 117/227 (51%), Gaps = 32/227 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID---- 78
GWNDVGFH + TPNID LA G++LN+ Y P C+PSR A +TG YP+ G+
Sbjct: 237 GWNDVGFHNPA-MKTPNIDKLAREGLILNQTYLQPLCSPSRHALMTGYYPYHAGLQHLVI 295
Query: 79 ---TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
PV + P+ K LPQ LK++GY+TH++GKWH+G P RGFD+ GY
Sbjct: 296 LPWQPVCS------PLKMKFLPQRLKDIGYATHMVGKWHLGFCSWNCTPTYRGFDSFFGY 349
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
+N D T + LD R N E+ ++ Y T F ++ +IK HN S+PLF
Sbjct: 350 YNA---QGDHYSHTWYNY-LDYRDN-EKPVKNLNGTYSTFTFVSRAQDIIKKHNSSQPLF 404
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L + VH +QVP E+ + +I RR F+
Sbjct: 405 LYMAFQNVHDP-----------IQVPKQYED--MYPNIKTRGRRQFS 438
>gi|390356461|ref|XP_793612.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 181
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 61/127 (48%), Positives = 82/127 (64%), Gaps = 13/127 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
GWNDV FHG + IPTP+IDALA G++L +Y P CTP+R+A +TGK+P G+
Sbjct: 40 GWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVI 99
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D P G G E ++PQYL+ LGY TH++GKWH+G KE L P +RGF+++ GY
Sbjct: 100 IADEPYGLG------TNETIMPQYLRSLGYRTHMVGKWHLGFFKESLTPSHRGFESYYGY 153
Query: 136 WNGYLTY 142
+ G Y
Sbjct: 154 YGGMQDY 160
>gi|397466741|ref|XP_003805104.1| PREDICTED: arylsulfatase B [Pan paniscus]
Length = 513
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 80/215 (37%), Positives = 118/215 (54%), Gaps = 25/215 (11%)
Query: 37 TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPVGAGVAKAVPVTEKL 95
TP++DALA G++L+ +YT P CTPSR+ LTG+Y R G+ + VP+ EKL
Sbjct: 49 TPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKL 108
Query: 96 LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--------YLTYNDSIH 147
LPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G T D+++
Sbjct: 109 LPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLIDALN 168
Query: 148 ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGT 207
T A+ R+ E A + Y T+ FT +++ +I +H +PLFL + +VH
Sbjct: 169 VTRCALDF---RDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPLFLYLALQSVHEP- 224
Query: 208 AGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LQVP EE + + I + +R +A
Sbjct: 225 ----------LQVP--EEYLKPYDFIQDKNRHHYA 247
>gi|301609482|ref|XP_002934299.1| PREDICTED: arylsulfatase J-like [Xenopus (Silurana) tropicalis]
Length = 564
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 70/186 (37%), Positives = 106/186 (56%), Gaps = 3/186 (1%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ D+G+HG ++I TP +D LA G+ L +Y P C+PSR+ F+TGKY G+ +
Sbjct: 57 QGYRDIGYHG-SEIRTPTLDKLASEGVRLENYYVQPICSPSRSQFITGKYQIHTGLQHSI 115
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LK+ GY TH++GKWH+G K+E +P RGFD+ G G
Sbjct: 116 IRPSQPNCLPLDNMTLPQKLKKAGYQTHMVGKWHLGFYKKECMPTQRGFDSFFGSLLGSG 175
Query: 141 T-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
YN ++ G D N Q + Y T+ +T + + ++ SHN ++PLFL I
Sbjct: 176 DYYNHYKCDSPGICGYDLYENNNAAWDQDNGIYSTEMYTQRVLQILSSHNPNKPLFLYIA 235
Query: 200 HAAVHT 205
+ AVH+
Sbjct: 236 YQAVHS 241
>gi|157103779|ref|XP_001648126.1| arylsulfatase b [Aedes aegypti]
gi|108880481|gb|EAT44706.1| AAEL003960-PA [Aedes aegypti]
Length = 472
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 76/177 (42%), Positives = 98/177 (55%), Gaps = 10/177 (5%)
Query: 67 LTGKYPFRYGIDTPVGAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPF 125
+TGKYP G+ V G+ + +P+TEKLLPQYLKELGY H+ GKWH+G + P
Sbjct: 1 MTGKYPIHTGMQHAVLYGMEPRGLPLTEKLLPQYLKELGYKNHIYGKWHLGSYTRKHTPL 60
Query: 126 NRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI 185
RGFD+HVG+W G+ D A GLD RR + A + Y T D+SV I
Sbjct: 61 ERGFDSHVGFWTGHHHMFDHTAVETNAWGLDMRRGFD-VAYDLHGYYTTHVIRDESVAAI 119
Query: 186 KSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++HN S+P+FL ++HAA H+ P L PD E A ISN RR FA
Sbjct: 120 RAHNTSQPMFLYVSHAATHSAN------PYDFLPAPD--ETVERLAGISNYSRRKFA 168
>gi|154248610|ref|YP_001419568.1| sulfatase [Xanthobacter autotrophicus Py2]
gi|154162695|gb|ABS69911.1| sulfatase [Xanthobacter autotrophicus Py2]
Length = 491
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 85/224 (37%), Positives = 116/224 (51%), Gaps = 28/224 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
G+ DVGFHG +DI TPN+D LA G L + YT P CTP+RAAFLTG+YP YG+ +
Sbjct: 60 GFADVGFHG-SDIKTPNLDHLAAQGARLGQFYTQPFCTPTRAAFLTGRYPLHYGLQVGAI 118
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+G + E LLPQ LK++GY T L+GKWH+G ++ P RGFD+ G G +
Sbjct: 119 PSGAKYGLATDEFLLPQALKDVGYRTALVGKWHLGHADQKFWPRQRGFDSFYGPLVGEID 178
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSK---YLTDFFTDQSVHVIKSHNHSRPLFLQI 198
+ HE A + Y K Y T+ F ++V +I +H+ PLFL +
Sbjct: 179 HFK--HE--------AHGVTDWYHDNTQVKEEGYDTELFGKEAVRLIAAHDPKTPLFLYL 228
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A HT Q P + +AHI+ P RR +A
Sbjct: 229 AFTAPHT-----------PFQAPQSYLDQ--YAHIAAPQRRAYA 259
>gi|291236278|ref|XP_002738066.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 508
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 81/230 (35%), Positives = 120/230 (52%), Gaps = 25/230 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+D+G+HG + TP++D LA GI L +Y P CTP+R+ ++G+Y G+ T +
Sbjct: 34 GWHDIGYHGSR-VQTPHLDKLASEGIKLENYYVQPMCTPTRSQLMSGRYQIHTGLQHTVI 92
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN---G 138
+P+ E + Q LKE GYSTH++GKWH+G + LP RGFD+ G++N
Sbjct: 93 NPDQRSCLPLDEVTIAQKLKEAGYSTHMVGKWHLGHYTKGCLPTKRGFDSFFGFYNCAVD 152
Query: 139 YLTYNDSI-----HETDFAV-GLDARRNMERY-APQMSSKYLTDFFTDQSVHVIKSHNHS 191
Y TY +ET + G D RN E + AP Y T ++ VI+ HN S
Sbjct: 153 YYTYEKGKFCKFENETVLRMRGTDLWRNDEEHVAPYYQGHYQTHVLAKEAEDVIRKHNPS 212
Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
+PLFL + AVH L+VP + E+ +A + + RR+
Sbjct: 213 KPLFLYLAFGAVHVP-----------LEVPKVYED--MYADVKDNSRRIL 249
>gi|326431091|gb|EGD76661.1| hypothetical protein PTSG_08011 [Salpingoeca sp. ATCC 50818]
Length = 511
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 73/199 (36%), Positives = 99/199 (49%), Gaps = 18/199 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWND GF G + TP ID L GI LN+HY C+P+RAA +TG+YP RYG+ P
Sbjct: 25 GWNDCGFAGTR-VKTPTIDTLRSEGIALNQHYVQKVCSPTRAALMTGRYPHRYGLQFPFC 83
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
G A A+ E LLPQY+K GY+T +GKWH+G + + P RGFD+ G+++ Y
Sbjct: 84 GGAAMALNSNETLLPQYMKSAGYTTRAVGKWHLGFTEWQFTPTFRGFDSFYGFYSCAEDY 143
Query: 143 --------NDS---IHETDF------AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI 185
N S + DF + G D + Y T F + V ++
Sbjct: 144 FFHGLGFKNSSGAHVKSLDFHDDARPSCGADCSKAAFEAVGTDWQHYSTTLFAGRIVDIV 203
Query: 186 KSHNHSRPLFLQITHAAVH 204
H+ S+PLFL H
Sbjct: 204 DGHDPSQPLFLYFASQDTH 222
>gi|338713661|ref|XP_003362935.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Equus
caballus]
Length = 523
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 82/229 (35%), Positives = 123/229 (53%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G +DVG H E+ TP +DALA G++L+ +YT P C PSR+ L+G+Y G+ +
Sbjct: 46 GXHDVGLH-ESRFSTPRLDALAAGGLLLDNYYTQPLCXPSRSQLLSGRYQIHTGLQHQII 104
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
+P+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 105 WPCQPSCLPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 164
Query: 139 -----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
T+ D+++ T A+ R+ E A + Y T FT+++ +I +H +P
Sbjct: 165 YYSHERCTFIDALNVTRCALDF---RDGEEVATGYKNMYSTSVFTERATALITNHPPEKP 221
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LFL + +VH LQVP EE + + I + +R +A
Sbjct: 222 LFLYLALQSVHQP-----------LQVP--EEYLKPYDFIQDKNRYHYA 257
>gi|291233195|ref|XP_002736539.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 513
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 71/193 (36%), Positives = 110/193 (56%), Gaps = 13/193 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW+D+G+HG + + +P +D LA G+ L +Y P C+P+RA ++G+Y RYG+ V
Sbjct: 37 GWHDIGYHG-SIVRSPYMDFLASEGVKLENYYVQPMCSPTRAQLMSGRYQIRYGLQHLVI 95
Query: 83 AGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN---- 137
+A +P E + Q +KE GY+TH++GKWH+G ++E LP NRGFD G+ N
Sbjct: 96 QPDQRACLPPDEVTIAQKMKEAGYATHMVGKWHLGFYRKECLPINRGFDTFFGFLNCLIY 155
Query: 138 ------GYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
GY +S ++T G D RN + A + +Y T F +++ VI +H+
Sbjct: 156 HYTYDFGYWHTPESGNKT-IMFGWDLFRNHDCVAKEHKGEYSTILFAEEAQRVIWNHDQE 214
Query: 192 RPLFLQITHAAVH 204
P+FL + AAVH
Sbjct: 215 TPMFLYLPFAAVH 227
>gi|311250496|ref|XP_003124150.1| PREDICTED: arylsulfatase I-like [Sus scrofa]
Length = 573
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 72/188 (38%), Positives = 107/188 (56%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 59 QGYHDVGYHG-SDIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 117
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L++LGY+TH++GKWH+G ++E LP RGFD +G G
Sbjct: 118 IRPRQPNCLPLDQVTLPQRLQQLGYATHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 177
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A +S +Y T + + ++ H+ RPLFL
Sbjct: 178 DYYTYDNC--DGPGVCGFDLHEG-ESVAWGLSGQYSTLLYAQRVSRILAGHSPRRPLFLY 234
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 235 VAFQAVHT 242
>gi|126291233|ref|XP_001378869.1| PREDICTED: arylsulfatase I [Monodelphis domestica]
Length = 584
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 73/188 (38%), Positives = 108/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 64 QGYHDVGYHG-SDIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 122
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E+GYSTH++GKWH+G K+ LP RGFD +G G
Sbjct: 123 IRPRQPSCLPLDQVTLPQKLQEVGYSTHMVGKWHLGFYKKACLPTRRGFDTFLGSLTGNV 182
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A + S +Y T + ++ ++ SH+ +PLFL
Sbjct: 183 DYYTYDNC--DGPGVCGYDLHEG-ENVAWEQSGQYSTLLYAQRASQILASHSPHQPLFLY 239
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 240 VAFQAVHT 247
>gi|298712440|emb|CBJ33216.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 726
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 73/222 (32%), Positives = 120/222 (54%), Gaps = 44/222 (19%)
Query: 23 GWNDVGFHGENDIP--TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-T 79
GWND+G+H D+ TPN+D L+ +G+ ++++Y++ CTP+RAA +TG+YP RYG+
Sbjct: 123 GWNDIGYHS-TDLANVTPNLDRLSASGVKVSQYYSMSICTPARAALMTGRYPVRYGLQYN 181
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
+ G +P+TEKLLP+Y+ E GY +H++GKWH+G P RGF+ + GY N
Sbjct: 182 VIQPGAPWGLPLTEKLLPEYMNEAGYESHMVGKWHLGSYTHAHTPHRRGFETYFGYLNDE 241
Query: 140 LTYNDSIHETDFAVGLDARR--------------NMERYAP------------------- 166
Y H+T + ++ R+ +ER+ P
Sbjct: 242 EMY--WTHQT-WTATINGRKFFDFGFGNATGFYDVIERFDPPPGDDDLVSTGPTSSVYSS 298
Query: 167 --QMSSKYLTDFFTDQSVHVI--KSHNHSRPLFLQITHAAVH 204
++ Y T+ FTD+++ ++ K+ + PLFL ++H AVH
Sbjct: 299 SLEIKGDYSTEIFTDRALEILSQKTPHDENPLFLYLSHQAVH 340
>gi|37182416|gb|AAQ89010.1| APRG372 [Homo sapiens]
Length = 515
Score = 130 bits (328), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 86 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G N++E +P RGFD G G
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFNRKECMPTRRGFDTFFGSLLGSG 204
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SHN ++P+FL
Sbjct: 205 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLY 262
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ AVH+ LQ P R F H I N +RR +A
Sbjct: 263 TAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294
>gi|168701793|ref|ZP_02734070.1| twin-arginine translocation pathway signal precursor [Gemmata
obscuriglobus UQM 2246]
Length = 459
Score = 130 bits (328), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 70/184 (38%), Positives = 100/184 (54%), Gaps = 8/184 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G D GF G +I TPNID +A G L+ Y P C+P+RAAF+TG+YP R+G+ V
Sbjct: 36 GREDCGFMGGKEIKTPNIDKIAAAGATLDAFYAQPVCSPTRAAFMTGRYPMRHGLQVGVV 95
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A+ +P+ E+ + Q LK+ GY+T +IGKWH+G E LP RGFD+ G++NG L
Sbjct: 96 RPWAQYGLPLDERTVAQGLKDAGYTTAVIGKWHLGHFAPEYLPTKRGFDHQYGHYNGALD 155
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y I + F D + N + Y T +V +++H +P FL +
Sbjct: 156 YFTHIRDGGFDWHRDDKVNSD-------EGYSTHLVAKDAVQFVQTHAGKKPFFLYVPFN 208
Query: 202 AVHT 205
AVH
Sbjct: 209 AVHA 212
>gi|348514291|ref|XP_003444674.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
Length = 570
Score = 130 bits (328), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 84/226 (37%), Positives = 121/226 (53%), Gaps = 22/226 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ND+G+H +DI TP +D LA +G+ L +Y P CTPSR+ F+TG+Y G+ +
Sbjct: 58 QGFNDIGYH-SSDIRTPVLDKLAADGVKLENYYIQPICTPSRSQFITGRYQIHTGLQHSI 116
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P + LPQ L+ELGYSTH++GKWH+G K+E LP RGFD + G G
Sbjct: 117 IRPCQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDTYFGSLTGSV 176
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFL 196
Y TY+ + G D E A S KY T +T + ++ +H+ S+PLF+
Sbjct: 177 NYYTYDGC--DGAGLCGFDLHEG-ESVAWGQSGKYSTHLYTQRVRKILATHDPQSQPLFI 233
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++ AVHT LQ PD E + + N RR +A
Sbjct: 234 FLSFQAVHTP-----------LQYPD--EYIYPYLGLENVARRKYA 266
>gi|421593685|ref|ZP_16038213.1| sulfatase [Rhizobium sp. Pop5]
gi|403700318|gb|EJZ17522.1| sulfatase [Rhizobium sp. Pop5]
Length = 497
Score = 130 bits (327), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 93/247 (37%), Positives = 120/247 (48%), Gaps = 32/247 (12%)
Query: 5 VGAGVAKAVPVTEKLL-----PQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTC 59
GAG A+A ++ GW DVG+HG +DI TPNID LA G L + Y P C
Sbjct: 43 AGAGEARAQGAAPNIVYIISDDSGWKDVGYHG-SDIRTPNIDRLAAEGARLEQFYVQPMC 101
Query: 60 TPSRAAFLTGKYPFRYGIDTPVGAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCN 118
TPSRAAF+TG+YPFRYG+ T V + + E LPQ LK+ GY T + GKWH+G +
Sbjct: 102 TPSRAAFMTGRYPFRYGLQTAVIPQSGTYGLALDEYPLPQVLKDAGYYTAMSGKWHLGHS 161
Query: 119 KEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSK---YLTD 175
K P RGFD+ G G E D A N + Y + K Y
Sbjct: 162 KTAYWPRQRGFDSFYGALLG---------EIDHFTHKAANGNPDWYRNNKALKEEGYDNI 212
Query: 176 FFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISN 235
++V VI H+ +PLFL + A HT Q P E DR +HI++
Sbjct: 213 LIGAEAVRVINKHDQQKPLFLYLAFTAPHTP-----------YQAPK-EYLDRN-SHIAD 259
Query: 236 PDRRLFA 242
RR +A
Sbjct: 260 ESRRKYA 266
>gi|260803290|ref|XP_002596523.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
gi|229281781|gb|EEN52535.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
Length = 492
Score = 130 bits (327), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 117/226 (51%), Gaps = 20/226 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GWND+G+H + I TPN+D LA G+ L +Y P C+PSRA +TG+Y RYG+ V
Sbjct: 27 GWNDIGYH-SSLIQTPNLDRLAQEGVKLENYYIQPICSPSRAQLMTGRYQIRYGMQHSVL 85
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
+ +P+ E LPQ LKE GY+TH++GKWH+G K+E LP RGFD G+ G
Sbjct: 86 MSDRPHGLPLGEVTLPQVLKESGYATHIVGKWHLGHFKKEYLPTWRGFDTFFGFLGGGED 145
Query: 139 YLTYN--DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y T+ + I ET + + + Y T F +S+ +I H+ +P+FL
Sbjct: 146 YFTHRIPNEIVETPETYRAFDFWDGSKPCLSENGSYSTHVFARKSIDLISRHDKDKPMFL 205
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ AVH L+ P EE + HI + + R +A
Sbjct: 206 YLPFQAVHAP-----------LEAP--EEFINKYTHIRSKNMRTYA 238
>gi|149059062|gb|EDM10069.1| arylsulfatase B [Rattus norvegicus]
Length = 517
Score = 130 bits (327), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 85/229 (37%), Positives = 124/229 (54%), Gaps = 35/229 (15%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWND+GFHG + I TP++DALA G+VL+ +Y P CTPSR+ LTG+Y G+ +
Sbjct: 51 GWNDLGFHG-SVIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHMGLQHYLI 109
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
VP+ EKLLPQ LK+ GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 110 MTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 169
Query: 139 YLTYN-----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y T+ + ++ T A+ L R+ E A + + Y T+ FT ++ +I +H +
Sbjct: 170 YYTHEACAPIECLNGTRCALDL---RDGEEPAKEYTDIYSTNIFTKRATTLIANHPPEK- 225
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+VH LQVP EE + I + RR++A
Sbjct: 226 --------SVHDP-----------LQVP--EEYMEPYDFIQDKHRRIYA 253
>gi|432879612|ref|XP_004073512.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
Length = 673
Score = 129 bits (325), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 72/187 (38%), Positives = 105/187 (56%), Gaps = 5/187 (2%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ND+G+H +DI TP +D LA G+ L +Y P CTPSR+ F+TG+Y G+ +
Sbjct: 58 QGFNDIGYH-SSDIKTPTLDKLAAKGVKLENYYIQPICTPSRSQFITGRYQIHTGLQHSI 116
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P + LPQ L++LGYSTH++GKWH+G K+E LP RGFD + G G +
Sbjct: 117 IRPRQPNCLPFDQVTLPQRLQQLGYSTHMVGKWHLGFYKKECLPTRRGFDTYFGSLTGSV 176
Query: 141 T-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFLQI 198
Y S + G D E A KY T FT + ++ H+ S+PLF+ +
Sbjct: 177 NYYTYSSCDGPELCGFDLHEG-ESVAWDQGGKYSTHLFTQRVRKILARHDPQSQPLFIFL 235
Query: 199 THAAVHT 205
+ AVH+
Sbjct: 236 SFQAVHS 242
>gi|291401248|ref|XP_002717219.1| PREDICTED: arylsulfatase J [Oryctolagus cuniculus]
Length = 601
Score = 129 bits (325), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 88 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 146
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 147 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 206
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SHN ++P+FL
Sbjct: 207 DYYTHYKC--DSPGMCGYDLYENDSAAWDHDNGIYSTQMYTQRVQQILASHNPTKPIFLY 264
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
I + AVH+ LQ P R F H I N +RR +A
Sbjct: 265 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 296
>gi|291239589|ref|XP_002739705.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 489
Score = 129 bits (325), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 82/222 (36%), Positives = 119/222 (53%), Gaps = 23/222 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVG+H DI P ++ LA +G++ N+ Y P CTPSRAA LTG YPF+ +
Sbjct: 34 GWNDVGWHNA-DIKMPILNQLAADGVIFNQSYVQPACTPSRAALLTGYYPFKIQRQHQML 92
Query: 83 AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ A + + K LP+ LK++GY THL+GKWH+G KEE LP RGFD+ + G+LT
Sbjct: 93 LNLEADGLSLDLKTLPEMLKDVGYLTHLVGKWHLGFCKEEYLPNKRGFDS----FYGWLT 148
Query: 142 YNDSIH--ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
+++ E A G D R N Q S YL +++V ++ H PLFL+ +
Sbjct: 149 LGTTLYSKENIIAPGYDFRDNTG--VVQESDTYLPFMLAERAVDIVMGHYKEYPLFLEFS 206
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
A L L+VP +E + ++ I + +R F
Sbjct: 207 MA-----------LSGKFLEVP--QEYEDLYSDIEDDRQRKF 235
>gi|403275516|ref|XP_003929486.1| PREDICTED: arylsulfatase J [Saimiri boliviensis boliviensis]
Length = 601
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 88 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 146
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 147 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 206
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SHN ++PLFL
Sbjct: 207 DYYTHYKC--DSPGMCGYDLYENDNAAWDSDNGIYSTQMYTQRVQQILASHNPTKPLFLY 264
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ + AVH+ LQ P R F H I N +RR +A
Sbjct: 265 LAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 296
>gi|156378148|ref|XP_001631006.1| predicted protein [Nematostella vectensis]
gi|156218038|gb|EDO38943.1| predicted protein [Nematostella vectensis]
Length = 584
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 78/222 (35%), Positives = 107/222 (48%), Gaps = 39/222 (17%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW+DV FHG IPTP ID A G++LN +Y P CTPSRA+ +TGKYP
Sbjct: 38 GWDDVSFHGSPQIPTPYIDFYANRGVILNNYYVSPMCTPSRASMMTGKYPIN-------- 89
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+G WH+G +E P RGFD+ G+WN Y
Sbjct: 90 ---------------------------LGMWHLGFFTKEYTPVYRGFDSFYGFWNAKTDY 122
Query: 143 -NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
N S +E +F G+D R NME + Y T+ FT ++V VI++H+ S PLFL + H
Sbjct: 123 WNHSSYENNFW-GVDLRDNMEPVQSE-DGTYGTELFTREAVKVIEAHDTSTPLFLYVAHQ 180
Query: 202 AVHTGTAGN-AKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVHT + P + V + R I + R+++A
Sbjct: 181 AVHTANPNEPLQAPQDKIDVSLKQRQQRFKGTIDDDQRQVYA 222
>gi|441658369|ref|XP_003269374.2| PREDICTED: arylsulfatase J [Nomascus leucogenys]
Length = 597
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 85 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 143
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 144 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSG 203
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SHN ++P+FL
Sbjct: 204 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLY 261
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
I + AVH+ LQ P R F H I N +RR +A
Sbjct: 262 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 293
>gi|355749520|gb|EHH53919.1| hypothetical protein EGM_14634 [Macaca fascicularis]
Length = 599
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 116/226 (51%), Gaps = 22/226 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 86 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ SHN ++P+FL I
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIA 264
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ AVH+ LQ P R F H I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294
>gi|100801472|emb|CAJ18095.1| arylsulfatase J [Homo sapiens]
Length = 596
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 86 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSG 204
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SHN ++P+FL
Sbjct: 205 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLY 262
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
I + AVH+ LQ P R F H I N +RR +A
Sbjct: 263 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294
>gi|291221493|ref|XP_002730757.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 585
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 80/182 (43%), Positives = 109/182 (59%), Gaps = 11/182 (6%)
Query: 23 GWNDVGFHGEND-IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
GWNDVG++ ND +PTP ++ LA NG++LN Y+ P C+PSRAA LTGKYP GI
Sbjct: 119 GWNDVGWN--NDFMPTPILNELASNGVILNNTYSQPACSPSRAALLTGKYPANAGIQHLV 176
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
V +P+ LL LKELGY H IGKWH+G + P RGFD+ G +NGYL
Sbjct: 177 VQEQHPYYLPLHNTLLSTKLKELGYMNHAIGKWHLGFCNWKYTPLWRGFDSFYGIFNGYL 236
Query: 141 T-YNDSIHETDF-----AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
+ Y+ I + F A GLD R N A + + ++T FT+++ +I++HN + PL
Sbjct: 237 SDYSTHIVHSPFIGEPGASGLDLRDNTGVVAHE-NGTHVTYLFTERAERIIRNHNPAAPL 295
Query: 195 FL 196
FL
Sbjct: 296 FL 297
>gi|109389362|ref|NP_078866.3| arylsulfatase J precursor [Homo sapiens]
gi|74722580|sp|Q5FYB0.1|ARSJ_HUMAN RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
gi|58201086|gb|AAW66666.1| arylsulfatase J [Homo sapiens]
gi|124376924|gb|AAI32880.1| ARSJ protein [Homo sapiens]
gi|124376926|gb|AAI32882.1| ARSJ protein [Homo sapiens]
gi|219521550|gb|AAI44266.1| ARSJ protein [Homo sapiens]
Length = 599
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 86 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSG 204
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SHN ++P+FL
Sbjct: 205 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLY 262
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
I + AVH+ LQ P R F H I N +RR +A
Sbjct: 263 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294
>gi|397519903|ref|XP_003830091.1| PREDICTED: arylsulfatase J [Pan paniscus]
Length = 596
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 116/226 (51%), Gaps = 22/226 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 86 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSG 204
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ SHN ++P+FL I
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIA 264
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ AVH+ LQ P R F H I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294
>gi|402870284|ref|XP_003899162.1| PREDICTED: arylsulfatase J [Papio anubis]
Length = 597
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 116/226 (51%), Gaps = 22/226 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 86 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ SHN ++P+FL I
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIA 264
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ AVH+ LQ P R F H I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294
>gi|58477551|gb|AAH89445.1| ARSJ protein [Homo sapiens]
Length = 578
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 116/226 (51%), Gaps = 22/226 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 86 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSG 204
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ SHN ++P+FL I
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIA 264
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ AVH+ LQ P R F H I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294
>gi|109075466|ref|XP_001096903.1| PREDICTED: arylsulfatase J [Macaca mulatta]
Length = 596
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 116/226 (51%), Gaps = 22/226 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 86 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ SHN ++P+FL I
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIA 264
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ AVH+ LQ P R F H I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294
>gi|291239534|ref|XP_002739678.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 648
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 81/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW+DVG+H ++ I TPNID LA G+ L +Y P CTP+RA +TG RY I T +
Sbjct: 39 GWHDVGYH-DSIIRTPNIDKLAAEGVKLENYYVTPICTPTRAVLMTG----RYQIHTTMQ 93
Query: 83 AGV-----AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW- 136
GV + +P E L+PQ LKE GYSTH++GKWH+G K + P +RGFD G++
Sbjct: 94 HGVLMAQEQRCLPTDEVLMPQKLKESGYSTHMVGKWHLGFYKWDCTPNHRGFDTFFGFYL 153
Query: 137 --NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
Y T+ H D R + P+ + +Y T + ++ VI+ + + P+
Sbjct: 154 AGGEYFTHTRKCHGHRLD-AWDLRDGDKMVGPEYTGEYSTMLYARKAQEVIRKQDPNVPM 212
Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
FL ++ AVH L+VPD D I + R+L+A
Sbjct: 213 FLYVSFQAVHAP-----------LEVPD-SYADAYGKDIYDQSRKLYA 248
>gi|395851353|ref|XP_003798225.1| PREDICTED: arylsulfatase J [Otolemur garnettii]
Length = 661
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 71/188 (37%), Positives = 105/188 (55%), Gaps = 7/188 (3%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 148 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 206
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 207 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 266
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SHN ++P+FL
Sbjct: 267 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLY 324
Query: 198 ITHAAVHT 205
I + AVH+
Sbjct: 325 IAYQAVHS 332
>gi|426345299|ref|XP_004040357.1| PREDICTED: arylsulfatase J [Gorilla gorilla gorilla]
Length = 599
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 86 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSG 204
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SHN ++P+FL
Sbjct: 205 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLY 262
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
I + AVH+ LQ P R F H I N +RR +A
Sbjct: 263 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294
>gi|410038636|ref|XP_526667.3| PREDICTED: arylsulfatase J isoform 2 [Pan troglodytes]
gi|410210212|gb|JAA02325.1| arylsulfatase family, member J [Pan troglodytes]
gi|410253696|gb|JAA14815.1| arylsulfatase family, member J [Pan troglodytes]
gi|410298378|gb|JAA27789.1| arylsulfatase family, member J [Pan troglodytes]
gi|410351985|gb|JAA42596.1| arylsulfatase family, member J [Pan troglodytes]
Length = 598
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 86 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSG 204
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SHN ++P+FL
Sbjct: 205 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLY 262
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
I + AVH+ LQ P R F H I N +RR +A
Sbjct: 263 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294
>gi|355687554|gb|EHH26138.1| hypothetical protein EGK_16035 [Macaca mulatta]
Length = 599
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 116/226 (51%), Gaps = 22/226 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 86 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ SHN ++P+FL I
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIA 264
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ AVH+ LQ P R F H I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294
>gi|291222022|ref|XP_002731018.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 1410
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 83/225 (36%), Positives = 117/225 (52%), Gaps = 27/225 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVG+HG + I TP++D LA G L +Y C+PSR FLTGK+ G++ +
Sbjct: 38 GWNDVGYHG-SSISTPHMDTLAKEGTKLENYYVAHLCSPSRGMFLTGKHMIHLGMEGGII 96
Query: 83 AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
K +PV E + Q LK YSTH IGKWH+G K+ LP NRGFD G G
Sbjct: 97 MPFERKCLPVNEATIAQELKLKNYSTHAIGKWHVGYYKKACLPNNRGFDTFFGIIGG--C 154
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
+ H+ L RN A + Y TD + ++ +VI++H+ S+P+F+ +
Sbjct: 155 ADHYTHKNTHWWEL--YRNNISIAQEYQGHYSTDLYAREATNVIRNHDASKPMFMYLAFQ 212
Query: 202 AVHTGTAGNAKLPTGLLQVP----DMEENDRTFAHISNPDRRLFA 242
A H LP LQ P DM +++I +PDRR++A
Sbjct: 213 AAH--------LP---LQAPRKYIDM------YSNIEDPDRRVYA 240
>gi|348542810|ref|XP_003458877.1| PREDICTED: arylsulfatase J [Oreochromis niloticus]
Length = 551
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 117/230 (50%), Gaps = 29/230 (12%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG +DI TP +D LA G+ L +Y P C+PSR+ +TG+Y G+ +
Sbjct: 34 QGFRDVGYHG-SDIKTPTLDRLAAEGVKLENYYVQPLCSPSRSQLMTGRYQIHTGLQHSV 92
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG------ 134
+ A +P+ LPQ LK GYSTH++GKWH+G K LP RGFD G
Sbjct: 93 IRAAQPNCLPLENVTLPQKLKNAGYSTHMVGKWHLGFYKRGCLPTQRGFDTFFGSLLGSG 152
Query: 135 -YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSR 192
Y++ Y S+ D G +A +R Y T+ FT ++V ++ +HN +
Sbjct: 153 DYYSHYKCQGPSMCGYDLYEGEEAAWEQDR------GLYSTEMFTQKAVSILANHNPRKQ 206
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLFL + + AVH+ LQVP + I NP RR +A
Sbjct: 207 PLFLYLAYQAVHSP-----------LQVP--ARYLERYKGIPNPYRRKYA 243
>gi|351698063|gb|EHB00982.1| Arylsulfatase J [Heterocephalus glaber]
Length = 593
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/188 (37%), Positives = 105/188 (55%), Gaps = 7/188 (3%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 80 QGFRDVGYHG-SEIRTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 138
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 139 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 198
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SHN ++P+FL
Sbjct: 199 DYYTHYKC--DSPGMCGYDLYENDNAAWDHDNGIYSTQMYTQRVQQILASHNPTKPIFLY 256
Query: 198 ITHAAVHT 205
I + AVH+
Sbjct: 257 IAYQAVHS 264
>gi|296195717|ref|XP_002745502.1| PREDICTED: arylsulfatase J [Callithrix jacchus]
Length = 605
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 82/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 92 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 150
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 151 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 210
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SHN ++P+FL
Sbjct: 211 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQKILASHNPTKPIFLY 268
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
I + AVH+ LQ P R F H I N +RR +A
Sbjct: 269 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 300
>gi|1089794|dbj|BAA08412.1| Arylsulfatase B [Rattus norvegicus]
Length = 473
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 81/217 (37%), Positives = 118/217 (54%), Gaps = 25/217 (11%)
Query: 35 IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPVGAGVAKAVPVTE 93
I TP++DALA G+VL+ +Y P CTPSR+ LTG+Y G+ + VP+ E
Sbjct: 7 IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHMGLQHYLIMTCQPNCVPLDE 66
Query: 94 KLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG---YLTYN-----DS 145
KLLPQ LK+ G STH++GKWH+G ++E LP RGFD + GY G Y T+ +
Sbjct: 67 KLLPQLLKDAGSSTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEACAPIEC 126
Query: 146 IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
++ T A+ L R+ E A + + Y T+ FT ++ +I +H +PLFL + +VH
Sbjct: 127 LNGTRCALDL---RDGEEPAKEYTDIYSTNIFTKRATTLIANHPPEKPLFLYLAFQSVHD 183
Query: 206 GTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LQVP EE + I + RR++A
Sbjct: 184 P-----------LQVP--EEYMEPYDFIQDKHRRIYA 207
>gi|410913855|ref|XP_003970404.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
Length = 570
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/189 (38%), Positives = 107/189 (56%), Gaps = 9/189 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ND+G+H +DI TP +D LA +G+ L +Y P CTPSR+ +TG+Y G+ +
Sbjct: 58 QGFNDIGYH-SSDIKTPVLDKLAADGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSI 116
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P + LPQ L+ELGYSTH++GKWH+G K+E LP RGFD + G G
Sbjct: 117 IRPRQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDTYFGSLTGSV 176
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFL 196
Y TY+ + G D E A KY T +T + ++ +H+ S+PLF+
Sbjct: 177 NYYTYDSC--DGPGMCGFDLHEG-ESVAWSQKGKYSTHLYTQRVRKILATHDPRSQPLFI 233
Query: 197 QITHAAVHT 205
++ AVHT
Sbjct: 234 FLSFQAVHT 242
>gi|301621596|ref|XP_002940132.1| PREDICTED: arylsulfatase I-like [Xenopus (Silurana) tropicalis]
Length = 575
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/188 (37%), Positives = 106/188 (56%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ +TG+Y G+ +
Sbjct: 59 QGFHDVGYHG-SDIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSI 117
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G K+E LP RGFD +G G
Sbjct: 118 IRPRQPNCLPLHQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNV 177
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y +Y++ + G D E A + KY T + + ++ SHN +P+F+
Sbjct: 178 DYYSYDNC--DGPGVCGFDLHEG-ENVAWDQAGKYSTLLYAQRVNQILASHNPQQPIFIY 234
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 235 VAFQAVHT 242
>gi|327274122|ref|XP_003221827.1| PREDICTED: arylsulfatase J-like [Anolis carolinensis]
Length = 564
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 78/223 (34%), Positives = 114/223 (51%), Gaps = 16/223 (7%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TG+Y G+ +
Sbjct: 53 QGFRDVGYHG-SEIRTPTLDRLAAEGVKLENYYVQPMCTPSRSQFITGRYQIHTGLQHSV 111
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 112 IRPTQPNCLPLDNATLPQKLKEAGYSTHMVGKWHLGFYRKECMPTQRGFDTFFGSLLGSG 171
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ SHN +P+FL I
Sbjct: 172 DYYTHYKCDSPRMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIA 231
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ AVH+ LQ P M + I+N +RR +A
Sbjct: 232 YQAVHSP-----------LQAPGMYY--ERYRSINNINRRRYA 261
>gi|291225019|ref|XP_002732502.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 197
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/163 (44%), Positives = 96/163 (58%), Gaps = 5/163 (3%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
GWNDV +H DI PN+ LA +G++ N+ YT PTCTPSRAA + G YPF+ G +
Sbjct: 37 MGWNDVHWHNP-DIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMKGLYPFKTGNQHQM 95
Query: 82 GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN-GY 139
+ VP+ KLLP+ LKE+GYSTH++GKWH+G K+E P NRGFD+H G W G
Sbjct: 96 VFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYQPTNRGFDSHYGLWTLGV 155
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSV 182
Y+ + G D R NM P+ S YL +QS+
Sbjct: 156 GNYDKMNGVLSPSAGYDFRDNM-GVVPK-SDDYLALMLGEQSI 196
>gi|47215546|emb|CAG06276.1| unnamed protein product [Tetraodon nigroviridis]
Length = 527
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 73/189 (38%), Positives = 107/189 (56%), Gaps = 9/189 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ND+G+H +DI TP +D LA +G+ L +Y P CTPSR+ +TG+Y G+ +
Sbjct: 58 QGFNDIGYH-SSDIKTPVLDKLAADGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSI 116
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P + LPQ L+ELGYSTH++GKWH+G K+E LP RGFD + G G
Sbjct: 117 IRPRQPNCLPFDQITLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDTYFGSLTGSV 176
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFL 196
Y TY+ + G D E A KY T +T + ++ +H+ S+PLF+
Sbjct: 177 NYYTYDSC--DGPGVCGFDLHEG-ESVAWSQRGKYSTHLYTQRVRKILATHDPQSQPLFI 233
Query: 197 QITHAAVHT 205
++ AVHT
Sbjct: 234 FLSFQAVHT 242
>gi|326928585|ref|XP_003210457.1| PREDICTED: arylsulfatase I-like [Meleagris gallopavo]
Length = 574
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/188 (38%), Positives = 105/188 (55%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ +TG+Y G+ +
Sbjct: 60 QGYHDVGYHG-SDIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSI 118
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G K+E LP RGFD +G G
Sbjct: 119 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNV 178
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A S KY T + + ++ SH+ P+F+
Sbjct: 179 DYYTYDNC--DGPGVCGYDLHEG-ENVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 235
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 236 VAFQAVHT 243
>gi|344277505|ref|XP_003410541.1| PREDICTED: arylsulfatase J [Loxodonta africana]
Length = 599
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 82/228 (35%), Positives = 117/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 86 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SHN +P+FL
Sbjct: 205 DYYTHYKC--DSPGMCGYDLYENDRAAWDYDNGIYSTQMYTQRVQQILASHNPRKPIFLY 262
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
I + AVH+ LQ P R F H I N +RR +A
Sbjct: 263 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294
>gi|449671825|ref|XP_002165184.2| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
Length = 160
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 64/124 (51%), Positives = 81/124 (65%), Gaps = 10/124 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI--DTP 80
GWND+GFHG +I TPNID LA NG+VL+ +Y LP CTPSR+A +TG+YP G+ DT
Sbjct: 21 GWNDIGFHGSKEISTPNIDRLATNGVVLDNYYVLPICTPSRSAIMTGRYPIHTGMQQDTI 80
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG------ 134
G V + EK LPQYLK+ GY T+ +GKWH+G +E P RGFD++ G
Sbjct: 81 YGPN-PYGVSLNEKFLPQYLKQQGYKTYGVGKWHLGFFAKEYTPTYRGFDSYYGSYLGKG 139
Query: 135 -YWN 137
YWN
Sbjct: 140 DYWN 143
>gi|50755099|ref|XP_425212.1| PREDICTED: arylsulfatase I [Gallus gallus]
Length = 574
Score = 128 bits (321), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/188 (38%), Positives = 105/188 (55%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ +TG+Y G+ +
Sbjct: 60 QGYHDVGYHG-SDIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSI 118
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G K+E LP RGFD +G G
Sbjct: 119 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNV 178
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A S KY T + + ++ SH+ P+F+
Sbjct: 179 DYYTYDNC--DGPGVCGYDLHEG-ENVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 235
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 236 VAFQAVHT 243
>gi|241596950|ref|XP_002404637.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215500440|gb|EEC09934.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 406
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 118/230 (51%), Gaps = 23/230 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+DV +HG + IPTPNID LA +GI+L +Y P TP+RAA LTG YP G +
Sbjct: 40 GWHDVSYHGSDQIPTPNIDVLAMDGIILFHNYVQPLSTPTRAALLTGLYPIHTGTQRLDI 99
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGK---WHIGCNKEELLPFNRGFDNHVGYWNG 138
G+ + LLPQ L + +G WH+G K+E P RGFD G +NG
Sbjct: 100 GSADPIGLSADFTLLPQLSVTLADNFTSLGARSGWHLGFCKDEFKPTKRGFDTFYGIYNG 159
Query: 139 YLTYNDSIHETDFA------VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
DS + T FA V A ++ +R + + +YLT +Q+V +I + ++
Sbjct: 160 -----DSDYWTHFARDNNIDVSGHALKDEKRALVEEAGRYLTSLLANQAVQLIHNRPKNK 214
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
P FL AVH G + G LQ P +E F ++++ DR+LFA
Sbjct: 215 PFFLYFAPTAVHCGGS------NGSLQAP--KEYISKFGYLADYDRQLFA 256
>gi|449267146|gb|EMC78112.1| Arylsulfatase I [Columba livia]
Length = 573
Score = 127 bits (320), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 72/188 (38%), Positives = 105/188 (55%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ +TG+Y G+ +
Sbjct: 60 QGYHDVGYHG-SDIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSI 118
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G K+E LP RGFD +G G
Sbjct: 119 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNV 178
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A S KY T + + ++ SH+ P+F+
Sbjct: 179 DYYTYDNC--DGPGVCGYDLHEG-EDVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 235
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 236 VAFQAVHT 243
>gi|326919013|ref|XP_003205778.1| PREDICTED: arylsulfatase J-like [Meleagris gallopavo]
Length = 573
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 80/226 (35%), Positives = 115/226 (50%), Gaps = 22/226 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 60 QGFRDVGYHG-SEIRTPTLDKLAAEGVKLENYYVQPMCTPSRSQFITGKYQIHTGLQHSI 118
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G + E +P RGFD G G
Sbjct: 119 IRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRGFDTFFGSLLGSG 178
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ SHN +P+FL I
Sbjct: 179 DYYTHFKCDSPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIA 238
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ AVH+ LQ P + F H I+N +RR +A
Sbjct: 239 YQAVHSP-----------LQAP-----GKYFEHYRSINNINRRRYA 268
>gi|344239533|gb|EGV95636.1| Arylsulfatase J [Cricetulus griseus]
Length = 571
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 81/228 (35%), Positives = 118/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 58 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 116
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 117 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 176
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SH+ ++P+FL
Sbjct: 177 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPIFLY 234
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
I + AVH+ LQ P R F H I N +RR +A
Sbjct: 235 IAYQAVHSP-----------LQAP-----GRYFEHYRSIVNINRRRYA 266
>gi|354502405|ref|XP_003513277.1| PREDICTED: arylsulfatase J [Cricetulus griseus]
Length = 597
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 80/226 (35%), Positives = 116/226 (51%), Gaps = 22/226 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 84 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 142
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 143 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 202
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ SH+ ++P+FL I
Sbjct: 203 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPIFLYIA 262
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ AVH+ LQ P R F H I N +RR +A
Sbjct: 263 YQAVHSP-----------LQAP-----GRYFEHYRSIVNINRRRYA 292
>gi|149698442|ref|XP_001503367.1| PREDICTED: arylsulfatase J [Equus caballus]
Length = 598
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 80/226 (35%), Positives = 115/226 (50%), Gaps = 22/226 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 86 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ SH+ +P+FL I
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLYIA 264
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ AVH+ LQ P R F H I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----SRYFEHYRSIVNINRRRYA 294
>gi|224067708|ref|XP_002198824.1| PREDICTED: arylsulfatase I [Taeniopygia guttata]
Length = 575
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 71/188 (37%), Positives = 105/188 (55%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++D+G+HG +DI TP +D LA G+ L +Y P CTPSR+ +TG+Y G+ +
Sbjct: 60 QGYHDIGYHG-SDIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSI 118
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G K+E LP RGFD +G G
Sbjct: 119 IRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNV 178
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D E A S KY T + + ++ SH+ P+F+
Sbjct: 179 DYYTYDNC--DGPDVCGYDLHEG-EDVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 235
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 236 VAFQAVHT 243
>gi|363733898|ref|XP_420639.3| PREDICTED: arylsulfatase J [Gallus gallus]
Length = 573
Score = 127 bits (319), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 70/186 (37%), Positives = 101/186 (54%), Gaps = 3/186 (1%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 60 QGFRDVGYHG-SEIRTPTLDKLAAEGVKLENYYVQPMCTPSRSQFITGKYQIHTGLQHSI 118
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G + E +P RGFD G G
Sbjct: 119 IRPTQPNCLPLDNITLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRGFDTFFGSLLGSG 178
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ SHN +P+FL I
Sbjct: 179 DYYTHFKCDSPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIA 238
Query: 200 HAAVHT 205
+ AVH+
Sbjct: 239 YQAVHS 244
>gi|296121469|ref|YP_003629247.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296013809|gb|ADG67048.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 487
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 77/213 (36%), Positives = 115/213 (53%), Gaps = 20/213 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID-TP 80
G+ DVGFHG DIPTPN+DALA +G+ Y T P C+P+RA LTG+Y R+G + P
Sbjct: 48 GYADVGFHGCKDIPTPNLDALAKSGVQFTSGYVTGPYCSPTRAGLLTGRYQQRFGHEFNP 107
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
GA +P+TE + LK++GY+T L+GKWH+G ++ + P RGF+ +G+ G
Sbjct: 108 SGANT--GLPLTEVTIADRLKQVGYTTGLVGKWHLG-SQPAMHPQERGFEEFIGFLGGAH 164
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
++ DA+ + + P + Y TD F ++V I+ H +P FL ++
Sbjct: 165 SF------------FDAQGILRGHEPVKTIDYTTDLFGREAVSFIEKH-RDKPWFLYLSF 211
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHI 233
AVHT L + D E RT+A +
Sbjct: 212 NAVHTPMHATEDRMAKLASISDQER--RTYAAM 242
>gi|281339106|gb|EFB14690.1| hypothetical protein PANDA_011975 [Ailuropoda melanoleuca]
Length = 595
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 115/226 (50%), Gaps = 22/226 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 86 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ SH+ +PLFL I
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPLFLYIA 264
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ AVH+ LQ P R F H I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294
>gi|301775027|ref|XP_002922933.1| PREDICTED: arylsulfatase J-like [Ailuropoda melanoleuca]
Length = 600
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 81/226 (35%), Positives = 115/226 (50%), Gaps = 22/226 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 86 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ SH+ +PLFL I
Sbjct: 205 DYYTHYKCDSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPLFLYIA 264
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ AVH+ LQ P R F H I N +RR +A
Sbjct: 265 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294
>gi|395510440|ref|XP_003759483.1| PREDICTED: arylsulfatase B [Sarcophilus harrisii]
Length = 659
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 70/172 (40%), Positives = 100/172 (58%), Gaps = 7/172 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWNDVG+H N I TP++DALA G+ L +YT P CTPSR+ LTG+Y G+ +
Sbjct: 185 GWNDVGYHDSN-IFTPHLDALAAGGVRLENYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 243
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
+P+ EKLLP+ L+E GY TH++GKWH+G ++E LP RGFD GY G
Sbjct: 244 WPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMFRKECLPTRRGFDTFFGYLLGSED 303
Query: 139 YLTYNDSIHETDFAVGLDAR--RNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
Y ++ +H V A R+ E A ++ Y T+ FT++++ +I H
Sbjct: 304 YYSHKHCVHIDALNVTRCALDFRDGEDVAEGYNNTYSTNIFTEKAIDLIAKH 355
>gi|219110117|ref|XP_002176810.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411345|gb|EEC51273.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 564
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 68/191 (35%), Positives = 107/191 (56%), Gaps = 9/191 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G +D+G H + I TP+ D LA +G+ L+++Y LP C+P+RA+ L+G+YP G T V
Sbjct: 75 GSHDLGIHENSGIQTPHADQLARDGLYLDQYYVLPYCSPTRASLLSGRYPLHTGCHTIVN 134
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+ +P+ E+ LPQ L+ GY H +GKWH+G ++ P RGF + G++ G Y
Sbjct: 135 DWETQGLPLDEETLPQVLRRAGYQAHAVGKWHVGHSRWTQTPTFRGFQSFFGFYLGAQDY 194
Query: 143 NDSIHETD----FAVGLDARRNMERYAPQMSSK---YLTDFFTDQSVHVIKSHNHS--RP 193
N I + + + + DAR R ++ + Y T FT +++ VI++H P
Sbjct: 195 NTHIKQGERGNAYEMHWDARGKCGRDCSRLVDERGNYSTHVFTREAIRVIENHPQRPHEP 254
Query: 194 LFLQITHAAVH 204
LFL + H AVH
Sbjct: 255 LFLYLAHQAVH 265
>gi|291224485|ref|XP_002732234.1| PREDICTED: jumonji domain containing 2c [Saccoglossus kowalevskii]
Length = 1941
Score = 127 bits (318), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 69/180 (38%), Positives = 101/180 (56%), Gaps = 9/180 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GWND+G++ I TP +D LAY G++ N+ Y P CTP+RAA +TG YPFR G+ V
Sbjct: 54 GWNDIGWNNLQ-IKTPVLDKLAYEGVIFNQTYVQPLCTPTRAALMTGYYPFRIGMQHQMV 112
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ K+LPQ LK+ GY H++GKWH+G E P NRGFD+ G ++ +
Sbjct: 113 LPFQPSGLPLHLKILPQKLKQAGYINHIVGKWHLGYCNWEYTPLNRGFDSFYGSFSNSVN 172
Query: 142 YNDSIHETDFA-----VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
+N+ I + + G D R N Q + LT FT + V +I +H+ P+F+
Sbjct: 173 HNNKISQLPISDHSKYKGYDFRDNTG--VVQNDGQPLTKLFTQRVVDIISNHHKDYPMFM 230
>gi|426231237|ref|XP_004009646.1| PREDICTED: arylsulfatase J [Ovis aries]
Length = 599
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 81/228 (35%), Positives = 117/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 87 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 145
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 146 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 205
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SH+ +P+FL
Sbjct: 206 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLY 263
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
I + AVH+ LQ P R F H I N +RR +A
Sbjct: 264 IAYQAVHSP-----------LQAP-----GRYFEHYRSIVNINRRRYA 295
>gi|119893510|ref|XP_611819.3| PREDICTED: arylsulfatase J [Bos taurus]
gi|297475606|ref|XP_002688145.1| PREDICTED: arylsulfatase J [Bos taurus]
gi|296486797|tpg|DAA28910.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase-like [Bos taurus]
Length = 599
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 81/228 (35%), Positives = 117/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 87 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 145
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 146 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 205
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SH+ +P+FL
Sbjct: 206 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGVYSTQMYTQRVQQILASHDPRKPIFLY 263
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
I + AVH+ LQ P R F H I N +RR +A
Sbjct: 264 IAYQAVHSP-----------LQAP-----GRYFEHYRSIVNINRRRYA 295
>gi|126331176|ref|XP_001365999.1| PREDICTED: arylsulfatase J [Monodelphis domestica]
Length = 607
Score = 126 bits (317), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 69/186 (37%), Positives = 102/186 (54%), Gaps = 3/186 (1%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 94 QGFRDVGYHG-SEIKTPTLDKLAAQGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 152
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 153 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 212
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ SH+ +P+FL I
Sbjct: 213 DYYTHYKCDSPGMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHDPRKPIFLYIA 272
Query: 200 HAAVHT 205
+ AVH+
Sbjct: 273 YQAVHS 278
>gi|390369306|ref|XP_003731620.1| PREDICTED: uncharacterized protein LOC763377 [Strongylocentrotus
purpuratus]
Length = 784
Score = 126 bits (317), Expect = 7e-27, Method: Composition-based stats.
Identities = 74/224 (33%), Positives = 115/224 (51%), Gaps = 25/224 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+NDVG+HG ++I TPNID LA G+ L +Y P CTP+R+ L+G+Y G+ + +
Sbjct: 38 GYNDVGYHG-SEIYTPNIDKLAREGVRLENYYVQPICTPTRSQLLSGRYQIHTGLQHSYI 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
+P L+E GY+TH +GKWH+G K+E LP RGFD++ GY G
Sbjct: 97 RPAQPLCLPTNLPTFADKLREAGYATHAVGKWHLGFYKKECLPTQRGFDSYFGYLTGGED 156
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y T+ H + L R++++ A + Y FT++ ++ H +P L +
Sbjct: 157 YWTH----HRKRPXLAL---RHVDKVAWEYGGYYSAFVFTEKIQQIVAQHPVEQPFLLYL 209
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+VH+ LQVP E + +I N +RR++A
Sbjct: 210 PFQSVHSP-----------LQVPSSYE--ERYKNIKNTNRRIYA 240
>gi|114145538|ref|NP_001041352.1| arylsulfatase J [Rattus norvegicus]
gi|81158024|tpe|CAI84986.1| TPA: arylsulfatase J [Rattus norvegicus]
gi|149025900|gb|EDL82143.1| arylsulfatase J [Rattus norvegicus]
Length = 597
Score = 126 bits (317), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 79/226 (34%), Positives = 116/226 (51%), Gaps = 22/226 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 84 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 142
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G +++ +P RGFD G G
Sbjct: 143 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSG 202
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ SH+ ++PLFL +
Sbjct: 203 DYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPLFLYVA 262
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ AVH+ LQ P R F H I N +RR +A
Sbjct: 263 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 292
>gi|114326206|ref|NP_001041581.1| arylsulfatase J [Canis lupus familiaris]
gi|81158066|tpe|CAI85007.1| TPA: arylsulfatase J [Canis lupus familiaris]
Length = 598
Score = 126 bits (316), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 81/228 (35%), Positives = 117/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 84 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 142
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 143 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 202
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SH+ +P+FL
Sbjct: 203 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLY 260
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
I + AVH+ LQ P R F H I N +RR +A
Sbjct: 261 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 292
>gi|291243527|ref|XP_002741646.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 506
Score = 126 bits (316), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 117/222 (52%), Gaps = 23/222 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVG+H D+ P ++ LA +G++ N+ Y P CTPSR+A TG YPF+ +
Sbjct: 34 GWNDVGWHNP-DLKMPILNQLAADGVIFNQSYVQPACTPSRSALFTGYYPFKIKRQHQML 92
Query: 83 AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ A + + K LP+ LK++GY THL+GKWH+G KEE LP RGFD+ + G+LT
Sbjct: 93 LNLEADGLSLDLKTLPEMLKDVGYLTHLVGKWHLGFCKEEYLPNKRGFDS----FYGWLT 148
Query: 142 YNDSIH--ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
++ E A G D R N Q S YL +++V ++ H PLFL+ +
Sbjct: 149 LGTDLYTKENVLAPGYDFRDNTG--VVQESDTYLPFMLAERAVDIVMGHYKEYPLFLEFS 206
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
A LP L+VP ++ + ++ I + R F
Sbjct: 207 MA-----------LPGKFLEVP--QDYEDLYSDIDDDRTRKF 235
>gi|260816811|ref|XP_002603281.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
gi|229288599|gb|EEN59292.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
Length = 357
Score = 126 bits (316), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 122/221 (55%), Gaps = 25/221 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYG--IDTP 80
GW+DV ++ N + PN+ LA G++ N+ Y+ CTPSR A LTGK+P+R G +
Sbjct: 19 GWSDVSWNNPN-VVMPNLHTLATTGVIFNQTYSQRLCTPSRTALLTGKFPYRLGMQVQKS 77
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ + +P+ E+LLPQ LK+LGY+TH++GKWH+G K E P RGFD+ G+ +G
Sbjct: 78 MFEKNSHGLPLDEELLPQKLKKLGYATHMVGKWHLGSCKWEYTPTERGFDSFYGFHHGGE 137
Query: 141 TYNDSIHET--DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y + E DF D R ++ + Y T+ F ++ ++I H+ + PLFL +
Sbjct: 138 DYYTHMSERGLDF---WDGRTSVS----DRNGVYSTESFARRAENIISQHDPNTPLFLYL 190
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRR 239
+VHT ++P+ LQ TF+ I + +R+
Sbjct: 191 PFQSVHT----PHQVPSSYLQ---------TFSTIQDDNRK 218
>gi|355669614|gb|AER94587.1| arylsulfatase family, member J [Mustela putorius furo]
Length = 600
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 81/228 (35%), Positives = 117/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 86 QGFRDVGYHG-SEIKTPTLDRLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G ++E +P RGFD G G
Sbjct: 145 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSG 204
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ SH+ +P+FL
Sbjct: 205 DYYTHYKC--DSPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLY 262
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
I + AVH+ LQ P R F H I N +RR +A
Sbjct: 263 IAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 294
>gi|298706912|emb|CBJ29739.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 781
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/228 (32%), Positives = 113/228 (49%), Gaps = 43/228 (18%)
Query: 23 GWNDVGFHGENDIP--TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-T 79
G+ D+G+ D+ TPN+DALA G+ L+ +YT+ CTP+RA+ +TG+YP RYG+ +
Sbjct: 228 GFGDMGYQ-STDLSEITPNLDALAAGGVKLSNYYTMTLCTPARASIMTGRYPVRYGMQYS 286
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
+ G +P +EK+LP+Y+ E GY +H++GKWH+G ++E LP RGF +GY NG
Sbjct: 287 VIMPGSPWGLPTSEKILPEYMNEAGYESHMVGKWHLGSYRDESLPSQRGFKTFLGYLNGI 346
Query: 140 LTYN---------DSIHETDFAVG----------------------------LDARRNME 162
TY D + DF G D N +
Sbjct: 347 ETYYSHKNPEASVDGQYFFDFGYGNATGYHDVTLQNHDENVGGPCTDGGPRWGDVMENED 406
Query: 163 RYAPQMSSKYLTDFFTDQSVHVIKSHN--HSRPLFLQITHAAVHTGTA 208
+ Y TD F ++ ++KS PLF+ I H +VH+ T
Sbjct: 407 PADVCFTGTYSTDAFVGRAKQIVKSKAPFDEDPLFMYIAHQSVHSPTG 454
>gi|291230656|ref|XP_002735281.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 522
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 85/250 (34%), Positives = 121/250 (48%), Gaps = 43/250 (17%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+DVG+HG + + TP IDALA G+ L +Y CTPSR+ LTG+Y G+ +
Sbjct: 39 GWDDVGYHG-SVMKTPYIDALAAEGVTLENYYMPSLCTPSRSVLLTGRYEIHTGLQHGTI 97
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
+P+ E LPQ LKE GY TH++GKWH+G ++E LP NRGFD+ +G++
Sbjct: 98 LMMQPLCLPLDEITLPQKLKEEGYDTHMVGKWHLGFYRKECLPNNRGFDSFLGFYQAMGD 157
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVH--------------- 183
+ +N S F G D RRN + A Q + KY T F ++
Sbjct: 158 HFYHNISASPGHFN-GFDFRRNNDVVADQYAGKYSTHIFXXXFINTQTLSFVCVNNVKGV 216
Query: 184 -----------VIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH 232
+ S+N +PLFL ++ AVHT LQVP
Sbjct: 217 YFRGYLSSFTPITSSYNPQQPLFLYLSFQAVHTP-----------LQVPSRYAELYNDLI 265
Query: 233 ISNPDRRLFA 242
++ DRR++A
Sbjct: 266 PNDEDRRIYA 275
>gi|291226838|ref|XP_002733395.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 498
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 74/219 (33%), Positives = 119/219 (54%), Gaps = 20/219 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWND+G++ + I TP +D LA G++LN+ Y LP CTP RA+ ++G Y +R G+ V
Sbjct: 44 GWNDIGYNNPS-IFTPTLDKLAREGVILNQSYVLPMCTPDRASLMSGYYAYRVGLQHKVL 102
Query: 83 AGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A +P+ L+PQ +KE GY+T+++GKWH+G K E P RGFD+ G++N
Sbjct: 103 DHAEPAGLPLNFTLIPQRMKEHGYTTYMLGKWHLGFCKWEYTPTYRGFDHFYGFYNAAED 162
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y + H T + L RN + + Y T + +++ I +HN S P+++ +
Sbjct: 163 YFN--HTTSKYLDL---RNGKEVDWSKNGTYSTYMYAEKATEYIATHNKSTPMYMYLPFQ 217
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRL 240
+VH G+++ P + TF H +N RR+
Sbjct: 218 SVH-----------GVIEAPQKYLDMYTFIHDTN--RRI 243
>gi|148680337|gb|EDL12284.1| arylsulfatase J [Mus musculus]
Length = 572
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 79/228 (34%), Positives = 118/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 58 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 116
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G +++ +P RGFD G G
Sbjct: 117 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSG 176
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ +H+ ++PLFL
Sbjct: 177 DYYTHYKC--DSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLY 234
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ + AVH+ LQ P R F H I N +RR +A
Sbjct: 235 VAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 266
>gi|26330047|dbj|BAC28762.1| unnamed protein product [Mus musculus]
Length = 614
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 78/226 (34%), Positives = 116/226 (51%), Gaps = 22/226 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 84 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 142
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G +++ +P RGFD G G
Sbjct: 143 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSG 202
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ +H+ ++PLFL +
Sbjct: 203 DYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVA 262
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ AVH+ LQ P R F H I N +RR +A
Sbjct: 263 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 292
>gi|27734088|ref|NP_775627.1| arylsulfatase J precursor [Mus musculus]
gi|77416378|sp|Q8BM89.1|ARSJ_MOUSE RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
gi|26329953|dbj|BAC28715.1| unnamed protein product [Mus musculus]
gi|81158042|tpe|CAI84995.1| TPA: arylsulfatase J [Mus musculus]
gi|109734872|gb|AAI17814.1| Arylsulfatase J [Mus musculus]
Length = 598
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 79/228 (34%), Positives = 118/228 (51%), Gaps = 26/228 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 84 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 142
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ LKE+GYSTH++GKWH+G +++ +P RGFD G G
Sbjct: 143 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSG 202
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y T+ ++ G D N + Y T +T + ++ +H+ ++PLFL
Sbjct: 203 DYYTHYKC--DSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLY 260
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ + AVH+ LQ P R F H I N +RR +A
Sbjct: 261 VAYQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 292
>gi|26343103|dbj|BAC35208.1| unnamed protein product [Mus musculus]
Length = 555
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 78/226 (34%), Positives = 116/226 (51%), Gaps = 22/226 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 84 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 142
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G +++ +P RGFD G G
Sbjct: 143 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSG 202
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ +H+ ++PLFL +
Sbjct: 203 DYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVA 262
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ AVH+ LQ P R F H I N +RR +A
Sbjct: 263 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 292
>gi|26338057|dbj|BAC32714.1| unnamed protein product [Mus musculus]
Length = 570
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 78/226 (34%), Positives = 116/226 (51%), Gaps = 22/226 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P CTPSR+ F+TGKY G+ +
Sbjct: 84 QGFRDVGYHG-SEIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSI 142
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LKE+GYSTH++GKWH+G +++ +P RGFD G G
Sbjct: 143 IRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSG 202
Query: 141 TYNDSIH-ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ G D N + Y T +T + ++ +H+ ++PLFL +
Sbjct: 203 DYYTHYKCDSPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVA 262
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH---ISNPDRRLFA 242
+ AVH+ LQ P R F H I N +RR +A
Sbjct: 263 YQAVHSP-----------LQAP-----GRYFEHYRSIININRRRYA 292
>gi|68437903|ref|XP_692213.1| PREDICTED: arylsulfatase I-like [Danio rerio]
Length = 562
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 67/188 (35%), Positives = 108/188 (57%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ND+G+HG ++I TP +D LA G+ L +Y P C+PSR+ +TG+Y G+ +
Sbjct: 40 QGYNDIGYHG-SEIQTPVLDQLAGEGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHSI 98
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ A +P LP+ L+E GYSTH++GKWH+G E LP +RGF + +G G
Sbjct: 99 IRARQPLCLPPDTPTLPERLQEAGYSTHMVGKWHLGFCHPECLPTSRGFQSFLGSLTGSG 158
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
+ ++ + A G D + +R A ++ Y T +T++ +++ H+H +PLFL
Sbjct: 159 DHFSFQSC--DGTEACGFDL-HDGDRPAWELRGNYSTRLYTERVKDILRRHDHRKPLFLY 215
Query: 198 ITHAAVHT 205
+ AVHT
Sbjct: 216 VALQAVHT 223
>gi|443704600|gb|ELU01579.1| hypothetical protein CAPTEDRAFT_176799 [Capitella teleta]
Length = 476
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 74/226 (32%), Positives = 117/226 (51%), Gaps = 25/226 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+D+G+HG I TP +D LAYNGI L +Y P C+P+R+ F++G Y G+ +
Sbjct: 19 GWHDIGYHGSK-IRTPVLDDLAYNGIRLENYYVQPICSPTRSQFMSGVYQIHTGLQHNVI 77
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A +P+ + ++E GY+TH+ GKWH+G KEE LP NRGFD + GY NG
Sbjct: 78 WPAQANGLPLEFPTIADKMREAGYATHMAGKWHLGYYKEEYLPHNRGFDTYYGYLNGCED 137
Query: 142 YNDSIH-----ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y D + DF + D + N+ Y+ + + + + + ++ +PLFL
Sbjct: 138 YYDKSYCHPYCGYDFRLNDDIQWNLTDYSTYLYVSRVNEILLNHKI-----YSPDKPLFL 192
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ +VH L+VP +E ++HI + +RR +A
Sbjct: 193 YLPLQSVHEP-----------LEVP--KEYSDKYSHIKDNNRRTYA 225
>gi|254515652|ref|ZP_05127712.1| arylsulfatase B [gamma proteobacterium NOR5-3]
gi|219675374|gb|EED31740.1| arylsulfatase B [gamma proteobacterium NOR5-3]
Length = 507
Score = 124 bits (312), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 76/220 (34%), Positives = 116/220 (52%), Gaps = 21/220 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVG+HG +DI TP+ID LA G+ L+R Y C+P+RAA L+G+ GI +P+
Sbjct: 52 GWNDVGYHG-SDIHTPHIDQLAAEGLELDRFYAQTACSPTRAALLSGQSSQSLGIYSPLS 110
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+ + +K++P Y ++ GY T ++GKWH+G + E P RGFD+ G G + Y
Sbjct: 111 KLNPTGLALDQKIMPAYFRDAGYQTFMVGKWHLGFYEPEYRPLARGFDHFYGNLTGGVGY 170
Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
+ +H GLD +RN + + Y T + + +I+ + +PLFL A
Sbjct: 171 WNHVH----GGGLDWQRNGKTLRQE---GYSTHLQSAEITRLIQQRDPEKPLFLYAAFNA 223
Query: 203 VHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
H LP + P + +AHI NP+RR+ A
Sbjct: 224 PH--------LPN---EAP--ADTLARYAHIENPNRRIHA 250
>gi|189521775|ref|XP_688265.2| PREDICTED: hypothetical protein LOC559800 [Danio rerio]
Length = 1542
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 70/187 (37%), Positives = 101/187 (54%), Gaps = 5/187 (2%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P C+PSR+ +TG+Y G+ +
Sbjct: 1037 QGFRDVGYHG-SEIKTPTLDRLAAAGVKLENYYVQPLCSPSRSQLMTGRYQIHTGLQHSI 1095
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ LPQ LK GYSTH++GKWH+G K +P RGFD G G
Sbjct: 1096 IRPTQPNCLPLENITLPQKLKNAGYSTHMVGKWHLGFYKRACMPTQRGFDTFFGSLLGSG 1155
Query: 141 TYNDSIHETDF--AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y S ++ D G D E Q Y T +T ++V+++ SHN RP+FL +
Sbjct: 1156 DYY-SHYKCDSPGLCGYDLHEGEEAAWEQDRGVYSTIMYTQKAVNILASHNPKRPIFLYL 1214
Query: 199 THAAVHT 205
AVH+
Sbjct: 1215 AFQAVHS 1221
>gi|340619607|ref|YP_004738060.1| sulfatase [Zobellia galactanivorans]
gi|339734404|emb|CAZ97781.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
Length = 463
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 78/189 (41%), Positives = 109/189 (57%), Gaps = 12/189 (6%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDT- 79
QGW DVGF+G DIPTPN+D LA GIV + Y + P C+PSRA LTG+Y R+G D
Sbjct: 36 QGWADVGFNGATDIPTPNLDRLASEGIVFDNAYVSHPYCSPSRAGLLTGRYQARFGHDCN 95
Query: 80 -PVGAGVAKAV--PVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
P + V P++EK++P+ LKE GY T IGKWH+G + L P ++GFD+ G+
Sbjct: 96 MPYDSENDDTVGTPLSEKMIPEALKEHGYRTSAIGKWHLG-DHPSLHPIHQGFDHWFGFA 154
Query: 137 NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
G + Y I + + RN E PQ +YLTD FTD+++ I + +P F+
Sbjct: 155 GGGMNYW-GIPDGPIKTIV---RNGEP-VPQNELRYLTDDFTDEAIDFITKKD-DKPFFM 208
Query: 197 QITHAAVHT 205
+ + A H
Sbjct: 209 YLAYNAPHA 217
>gi|283778949|ref|YP_003369704.1| sulfatase [Pirellula staleyi DSM 6068]
gi|283437402|gb|ADB15844.1| sulfatase [Pirellula staleyi DSM 6068]
Length = 486
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 80/223 (35%), Positives = 111/223 (49%), Gaps = 25/223 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
GW DVGF+G +I TPNIDALA G ++ Y CTP+RA +TG++P+RYG+ T
Sbjct: 40 GWKDVGFNGCTEIKTPNIDALAKGGAKFSQFYVQNMCTPTRACLMTGRFPYRYGLQTIVI 99
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
P AG + +E L+PQ L + GY T +IGKWH+G ++ P RGFD G G
Sbjct: 100 PTAAGY--GLDTSEYLMPQCLGDAGYKTAIIGKWHLGHADQKYWPKQRGFDYQYGAMIGE 157
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
L Y + V LD R+ + P Y T D +V I + +P +L +T
Sbjct: 158 LDY---FTHDEHGV-LDWFRDNK---PVHEQGYTTTLIGDDAVKYIHGQDGKKPFYLYLT 210
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A HT Q P +E + +I+ P RR +A
Sbjct: 211 FNAPHTP-----------YQAP--KEYITKYLNIAEPTRRTYA 240
>gi|291241933|ref|XP_002740864.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 496
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 68/183 (37%), Positives = 98/183 (53%), Gaps = 5/183 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVG+H I TP ID LA +G+ LN +Y C PSR ++G++ G+
Sbjct: 36 GWNDVGYHNSY-IKTPTIDMLAKSGVRLNNYYVASHCVPSRNMLISGRHVIDIGLQHGEI 94
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+ +P+ E + LKE+GY+THLIGKWH GC LP NRGFD GY +
Sbjct: 95 GYYPRGLPLDEFTIADKLKEIGYATHLIGKWHCGCYSNHSLPHNRGFDTFFGYLG---SS 151
Query: 143 NDSIHETDFAVGL-DARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
+D + GL D R N E + Y T + +++ ++I H+ ++PLFL + +
Sbjct: 152 DDHYTHIIMSNGLADLRLNDECVGYKYFGDYSTIMYANEAKNIIAQHDENKPLFLMLAFS 211
Query: 202 AVH 204
AVH
Sbjct: 212 AVH 214
>gi|291227811|ref|XP_002733876.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 539
Score = 124 bits (310), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 73/206 (35%), Positives = 111/206 (53%), Gaps = 22/206 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
G+ DVG+ + + TPNID LA G+ L RHY P+C PSR+ + G+Y G +
Sbjct: 72 GYFDVGYRNGSIVKTPNIDKLAAEGVKLERHYAQPSCMPSRSCLMMGRYQIHTGFNYKCT 131
Query: 82 -GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
G+G + +P LKE GY+TH++GKWH+G + E LP +GFD GY
Sbjct: 132 DGSGSQLCMHPDTITIPMKLKENGYATHMVGKWHLGNIRWECLPNAKGFDTFFGYHGASE 191
Query: 141 TYNDSIHETDFA-VGLDAR---RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y T F+ G + R RN + A + +Y T FT++++++I++H+ S+P+FL
Sbjct: 192 DY-----YTHFSPAGRECRDLWRNRDDVAQEYYGQYSTHIFTNEALNIIENHDVSKPMFL 246
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPD 222
+ + AVH G LQVP+
Sbjct: 247 YLPYQAVH-----------GPLQVPE 261
>gi|429206655|ref|ZP_19197919.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
AKP1]
gi|428190241|gb|EKX58789.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
AKP1]
Length = 498
Score = 124 bits (310), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/185 (40%), Positives = 105/185 (56%), Gaps = 11/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
G+ DVG+HG +D+ TPN+D LA G L + YT P CTP+RAA +TG YP RYG+ T V
Sbjct: 64 GYADVGYHG-SDVKTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGCYPMRYGLQTGVI 122
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+G + E LLPQ LKE GY T L+GKWH+G ++ P RGFD G G +
Sbjct: 123 PSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGFDYFYGPLVGEID 182
Query: 142 YNDSIHETDFAVGL-DARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
+ HE A G+ D R+ E Y T+ F ++ +I+ H+ + PL++ ++
Sbjct: 183 HFK--HE---AHGITDWYRDNEMVK---EPGYDTELFGADAIRLIEEHDSATPLYMYLSF 234
Query: 201 AAVHT 205
A HT
Sbjct: 235 TAPHT 239
>gi|443696989|gb|ELT97571.1| hypothetical protein CAPTEDRAFT_178894 [Capitella teleta]
Length = 503
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 122/230 (53%), Gaps = 28/230 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G++DVG+HG + I TPNID LA+ G+ L +Y P CTP+R+ L+G+Y G+ + +
Sbjct: 42 GYHDVGYHG-SAIRTPNIDRLAFEGVRLENYYVQPICTPTRSQLLSGRYQIHTGLQHSII 100
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A A+P L L+E GY+ H++GKWH+G KEE +P NRGFD+ GY G
Sbjct: 101 WAAQPNALPKELPTLADKLREEGYANHIVGKWHLGFYKEEYVPTNRGFDSFYGYLTGSEF 160
Query: 142 YND------SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS---R 192
Y + I+ +D GLD R N S+Y T + +++ ++ H + +
Sbjct: 161 YYNKTYCLAQINRSD-VCGLDFRENDRSIN---ESEYSTHLYAERTKQLVADHTSAHPDQ 216
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLFL + +VH G L+VP + + HI + +R+++A
Sbjct: 217 PLFLYLALQSVH-----------GPLEVP--AQYRTPYKHIKDENRQIYA 253
>gi|291236588|ref|XP_002738221.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 504
Score = 123 bits (309), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 78/221 (35%), Positives = 113/221 (51%), Gaps = 25/221 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVG+H I TP ID LA NG+ LN +Y C PSR ++G++ V
Sbjct: 39 GWNDVGYHNLY-IKTPTIDRLANNGVKLNNYYAANLCVPSRNMLMSGRHVH------GVI 91
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT- 141
G + +P+ E + LKE GYSTHL+GKW+ G +E LP NRGFD G+ +
Sbjct: 92 MGYPRGLPLNETTIANKLKEAGYSTHLVGKWNCGFYSKEFLPHNRGFDTFFGFVDSKEDH 151
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y +H+ D RRN A + Y T + ++ +I +H+ ++PLFL ++ +
Sbjct: 152 YTHMVHDIS-----DLRRNDLCVADKYYGNYSTIMYGNEGTTIIDNHDTNKPLFLFMSFS 206
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH LQVP + E + I + DRR++A
Sbjct: 207 AVHEP-----------LQVPSVYEKEY-IPTIDDTDRRIYA 235
>gi|291237236|ref|XP_002738543.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 514
Score = 123 bits (309), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 122/230 (53%), Gaps = 14/230 (6%)
Query: 9 VAKAVPVTEKLLPQGWNDVGFHGEND-IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFL 67
+A + + GWNDVG+H ND +PTPN++ LA G++L+ Y+ P CTPSR A +
Sbjct: 36 IAMICIIILTISASGWNDVGWH--NDFMPTPNLNTLAREGVILDNMYSQPICTPSRVALM 93
Query: 68 TGKYPFRYGIDTPVGAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFN 126
TGKYP + G+ V + +P L + LKE GY+ H++GKWH+G + P
Sbjct: 94 TGKYPAKVGMQHFVVLPMRPYYLPGNYATLAEKLKEQGYTNHIVGKWHLGSCDWKYTPMW 153
Query: 127 RGFDNHVGYWNGYLTYNDSIHETDF--AVGLDAR--RNMERYAPQMSSKYLTDFFTDQSV 182
RGFD+H G G +T N H + VG+ R R+ + + T F++++
Sbjct: 154 RGFDSHYGCHEG-VTSNFETHMLTWPPVVGVSGRDLRDNTGLVTHENGTHNTMLFSERAE 212
Query: 183 HVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEEND-RTFA 231
++K+HN PLFL + + A H + P G + +++ D RTFA
Sbjct: 213 RIVKNHNPESPLFLYVPYMAPHFP----LQAPQGFEEAVQLDDTDRRTFA 258
>gi|221640917|ref|YP_002527179.1| twin-arginine translocation pathway signal protein [Rhodobacter
sphaeroides KD131]
gi|221161698|gb|ACM02678.1| Twin-arginine translocation pathway signal [Rhodobacter sphaeroides
KD131]
Length = 509
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 74/185 (40%), Positives = 105/185 (56%), Gaps = 11/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
G+ DVG+HG +D+ TPN+D LA G L + YT P CTP+RAA +TG+YP RYG+ T V
Sbjct: 75 GYADVGYHG-SDVKTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGRYPMRYGLQTGVI 133
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+G + E LLPQ LKE GY T L+GKWH+G ++ P RG D G G +
Sbjct: 134 PSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGVDYFYGPLVGEID 193
Query: 142 YNDSIHETDFAVGL-DARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
+ HE A G+ D R+ E Y T+ F ++ +I+ H+ + PL++ ++
Sbjct: 194 HFK--HE---AHGITDWYRDNEMVK---EPGYDTELFGADAIRLIEEHDSATPLYMYLSF 245
Query: 201 AAVHT 205
A HT
Sbjct: 246 TAPHT 250
>gi|291227815|ref|XP_002733878.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 508
Score = 123 bits (308), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 72/202 (35%), Positives = 104/202 (51%), Gaps = 16/202 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ DVG+ + + TPNID LA G+ L RHY P+C PSR+ + G+Y G D
Sbjct: 37 GYFDVGYRNGSIVKTPNIDKLAAEGVKLERHYAQPSCMPSRSCLMMGRYQIHTGFDYRCK 96
Query: 83 AGVAKAVPV--TEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
G + + +P LKE GY+TH+IGKWH+G + E LP +GFD GY +
Sbjct: 97 DGKRSQLCMHPDTITMPMKLKENGYATHMIGKWHLGNIRWECLPNAKGFDTFFGYLSAIE 156
Query: 141 TYNDSIHETDFAVGL-DARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y H T D RN + A +Y T FT ++ +IK+H+ ++P+F+ ++
Sbjct: 157 DY--FTHYTPAGANCHDFWRNHDEVADDYKGQYSTHLFTKEAQDIIKNHDINQPMFMYLS 214
Query: 200 HAAVHTGTAGNAKLPTGLLQVP 221
+ AVH G LQVP
Sbjct: 215 YQAVH-----------GPLQVP 225
>gi|313215020|emb|CBY41206.1| unnamed protein product [Oikopleura dioica]
Length = 427
Score = 123 bits (308), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 66/188 (35%), Positives = 104/188 (55%), Gaps = 8/188 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G +DV HG DI TPN+D LA +G++LN +Y P C+P+R + +TG+YP+R G+
Sbjct: 31 GKHDVSMHGA-DIYTPNLDMLARDGVLLNNYYVQPVCSPTRGSLMTGRYPYRLGLQHENL 89
Query: 83 AGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
G A +P+ E ++PQY+KE GY T+++GKW +G K+ LP+ RGFD G G
Sbjct: 90 VGYRPAGLPLDEYIMPQYMKECGYKTYMVGKWQLGFFKDNYLPWKRGFDEFFGQLLGGQD 149
Query: 139 YLTYNDSIHETDFA--VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y + + ++ G D R + S KY + D++ +HN + PL++
Sbjct: 150 YYSRRKCLKLRNYGNLCGYDLRTE-QGPVRDTSMKYQPFLYADKAREKFFAHNKTDPLYM 208
Query: 197 QITHAAVH 204
+ +VH
Sbjct: 209 YVAFQSVH 216
>gi|346472067|gb|AEO35878.1| hypothetical protein [Amblyomma maculatum]
Length = 514
Score = 122 bits (307), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 74/233 (31%), Positives = 112/233 (48%), Gaps = 18/233 (7%)
Query: 14 PVTEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPF 73
PV GWNDV +H E + +P ++ LA G++L++HY LPTCTP+RAA +TG+YP+
Sbjct: 27 PVVPARFKPGWNDVSWHNER-MESPILEQLAKEGVILDQHYALPTCTPTRAALMTGRYPY 85
Query: 74 RYGIDT-PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
+ GI + + +P+ L + LK GY+TH GKWH+G + L P RGFD
Sbjct: 86 KLGIQSHGIRTLEPNGLPLGVTTLAEELKRTGYTTHAFGKWHLGYCNQSLTPTRRGFDTF 145
Query: 133 VGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN 189
G++ G Y ++ S +T RN + Y T + + I+
Sbjct: 146 RGFYVGGQDYFSHTLSGGKTSATAKGYDYRNGDEVDYSAKGVYTTTLIANHVLSAIEESQ 205
Query: 190 HSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+P+FL + AVH LQVP + + + NP R+L
Sbjct: 206 PDKPMFLYVAFQAVHAP-----------LQVP--TQYRKMCSIYRNPKRKLLC 245
>gi|292620475|ref|XP_002664306.1| PREDICTED: arylsulfatase I-like [Danio rerio]
Length = 558
Score = 122 bits (307), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 70/189 (37%), Positives = 106/189 (56%), Gaps = 9/189 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ND+G+H DI TP +D LA G+ L +Y P CTPSR+ F+TG+Y G+ +
Sbjct: 48 QGFNDIGYH-NTDIHTPTLDRLAAAGVKLENYYIQPICTPSRSQFITGRYQIHTGLQHSI 106
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ + +P + LPQ L+E GY+TH++GKWH+G K + LP RGF+ + G G
Sbjct: 107 IRSRQPSCLPFGLRTLPQRLQEAGYATHMVGKWHLGFYKRDCLPTRRGFNTYFGSLTGSV 166
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFL 196
Y TY + G D + ER A +Y T +T + ++ +H+ S+PLF+
Sbjct: 167 DYYTYKSC--DGPKVCGFDL-HDGERVAWGQGGRYSTHLYTQRVRKILAAHDPSSQPLFI 223
Query: 197 QITHAAVHT 205
++ AVHT
Sbjct: 224 FLSFQAVHT 232
>gi|313236221|emb|CBY11544.1| unnamed protein product [Oikopleura dioica]
Length = 511
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/188 (35%), Positives = 104/188 (55%), Gaps = 8/188 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G +DV HG DI TPN+D LA +G++LN +Y P C+P+R + +TG+YP+R G+
Sbjct: 31 GKHDVSMHGA-DIYTPNLDMLARDGVLLNNYYVQPVCSPTRGSLMTGRYPYRLGLQHENL 89
Query: 83 AGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
G A +P+ E ++PQY+KE GY T+++GKW +G K+ LP+ RGFD G G
Sbjct: 90 VGYRPAGLPLDEYIMPQYMKECGYKTYMVGKWQLGFFKDNYLPWKRGFDEFFGQLLGGQD 149
Query: 139 YLTYNDSIHETDFA--VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y + + ++ G D R + S KY + D++ +HN + PL++
Sbjct: 150 YYSRRKCLKLRNYGNLCGYDLRTE-QGPVRDTSMKYQPFLYADKAREKFFAHNKTDPLYM 208
Query: 197 QITHAAVH 204
+ +VH
Sbjct: 209 YVAFQSVH 216
>gi|406832341|ref|ZP_11091935.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 490
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 78/216 (36%), Positives = 111/216 (51%), Gaps = 9/216 (4%)
Query: 26 DVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVGAGV 85
D+G+ G + I TP+IDALA G+ L +Y LP CTP+RAA +TG+YP R G+ T V
Sbjct: 40 DLGYRG-SKIKTPHIDALAKGGVRLESYYGLPLCTPARAALMTGRYPMRQGLQTLVIFPS 98
Query: 86 AK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYND 144
+ +P EK LPQ LKE+GY T ++GKWH+G ++ P NRGFD+ G G + Y
Sbjct: 99 HRYGLPTDEKTLPQALKEVGYHTAMVGKWHLGHADKKFWPQNRGFDHFYGNVVGEVDY-- 156
Query: 145 SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVH 204
+ +D +RN E Y D ++V +I H+ ++PLFL A H
Sbjct: 157 --FTRERGGVVDWQRNGEFL---REDGYYVDLIGTEAVKLIAGHDKAKPLFLYFASLAPH 211
Query: 205 TGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRL 240
+ D E + ISN DR++
Sbjct: 212 APYQAPKADIDAYNDIFDNEMHRTYAGMISNLDRQV 247
>gi|313216787|emb|CBY38029.1| unnamed protein product [Oikopleura dioica]
Length = 383
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 69/198 (34%), Positives = 110/198 (55%), Gaps = 18/198 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G+H + +I TP +D LA G+ L ++Y P CTP+R +TG+Y RYG+
Sbjct: 187 GYHDIGYH-QAEILTPFMDKLATTGVRLEQYYVQPVCTPTRVQLMTGRYQIRYGMQ---- 241
Query: 83 AGVAK-----AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
GV + VP+ EKLLP+ L++ GY+T +IGKWH+G E+ LP NRGFD+ +G++
Sbjct: 242 HGVVRPPQPDGVPLDEKLLPEALRKCGYNTEMIGKWHLGMFTEDYLPQNRGFDHFMGFYT 301
Query: 138 G---YLTYNDSIHET---DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
G + ++N DF + + R+ +++ Y T F D+ + N S
Sbjct: 302 GSQDFYSHNKCFSGMCGYDFREATAGQPEVIRW--DLNNTYSTGVFADELEKRLSKMNPS 359
Query: 192 RPLFLQITHAAVHTGTAG 209
P F ++ AVH+ G
Sbjct: 360 EPSFTYLSFQAVHSPLQG 377
>gi|327265410|ref|XP_003217501.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Anolis
carolinensis]
Length = 580
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/188 (37%), Positives = 104/188 (55%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++D G+HG +DI TP +D LA G+ L +Y P CT SR+ +TG+Y G+ +
Sbjct: 68 QGFHDXGYHG-SDIXTPTLDRLAAEGVKLENYYIRPICTLSRSQLITGRYQIHTGLQHSI 126
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P + LPQ L+E GYSTH++GKWH+G ++E LP RGFD +G G
Sbjct: 127 IRPQQPNCLPFNQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNV 186
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y TY++ + G D + E A + S KY T + + ++ +HN P+F+
Sbjct: 187 DYYTYDNC--DGPGVCGYDL-HDGENVAWEQSGKYSTFLYAQRVNKILAAHNPKEPIFIY 243
Query: 198 ITHAAVHT 205
I AVHT
Sbjct: 244 IAFQAVHT 251
>gi|291227809|ref|XP_002733875.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 505
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/188 (35%), Positives = 101/188 (53%), Gaps = 11/188 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ DVG+ + + TPNID LA G+ L R+Y +C PSR+ + G+Y G D
Sbjct: 37 GYFDVGYREGSIVKTPNIDKLAAEGVKLERYYAQSSCMPSRSCLMMGRYQIHTGFDYRCL 96
Query: 83 AGVAKAVPVTEKL--LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
G + + LP L++ GY+TH+IGKWH+G ++E +P ++GFD GY
Sbjct: 97 DGQLTRLCMAPDTVTLPMKLRQYGYATHMIGKWHLGHERKECVPTHKGFDTFFGYHGAAE 156
Query: 141 TYNDSIHETDFAVGL----DARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y T A+G D RNME A +Y T F+T ++ +IK+H+ +P F+
Sbjct: 157 NYY-----THTALGRPRCHDLWRNMENVAEDYDGQYSTLFYTKEAQDIIKNHDKKKPFFM 211
Query: 197 QITHAAVH 204
+++ AVH
Sbjct: 212 YLSYQAVH 219
>gi|300433302|gb|ADK13094.1| arylsulfatase [Dicathais orbita]
Length = 571
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 69/192 (35%), Positives = 103/192 (53%), Gaps = 12/192 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
G+NDV +H + TPN+ +A NG++L Y+ CTPSRA+++TG YPFR G+ ++ V
Sbjct: 43 GYNDVSWHNPQ-VLTPNLGKMAKNGVILTESYSQAACTPSRASYMTGYYPFRIGVQNSVV 101
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G+ VP+ LP+ LKE GY +HL+GKWH+G + ++ P RGFD +G NGY
Sbjct: 102 REGMEDYVPLDVDFLPKRLKEAGYVSHLVGKWHLGHCRRDVTPPGRGFDTFLGLLNGYND 161
Query: 142 -YNDSIHETDFAVGLDARRNMERY--------APQMSSKYLTDFFTDQSVHVIKSHNHSR 192
Y I D Y P + Y TD FT++++ +I+ +
Sbjct: 162 YYTKKIRAIASHEDFDPNAPGTIYDFFSNYTLQPSPETDYTTDIFTNRAIELIQQSKDT- 220
Query: 193 PLFLQITHAAVH 204
P FL + + A H
Sbjct: 221 PFFLALHYTAPH 232
>gi|149198444|ref|ZP_01875489.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149138450|gb|EDM26858.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 458
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 80/233 (34%), Positives = 121/233 (51%), Gaps = 26/233 (11%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGI-VLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP 80
QG+ DVGF+G DIPTP++D++A NG+ ++ H + P C PSRA LTG+Y R+G T
Sbjct: 30 QGYQDVGFNGCKDIPTPHLDSIAQNGVNCIDAHVSYPVCGPSRAGLLTGRYQDRFGFTTN 89
Query: 81 VGAGVAKAV---PVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
+ P+ EK + + LKE+GYS+ +IGKWH+G + P NRGFD+ G+ +
Sbjct: 90 PTVNPENPIAGLPLEEKNIAEVLKEVGYSSSIIGKWHMGTHPIH-HPLNRGFDHFFGFLS 148
Query: 138 GYLTYNDSIHE----TDFAVGLDARRN---MERYAPQMSSKYLTDFFTDQSVHVI-KSHN 189
G Y + + ++ D R +R Q+S YLTD TD +V I K +
Sbjct: 149 GGHDYFPAKYNLKDLSEVKRIWDWYRTHLIRDRERIQVSEGYLTDILTDAAVDFIDKKAS 208
Query: 190 HSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+P L +++ A HT + E+ + F HI + RR +A
Sbjct: 209 EKKPFMLYLSYNAPHTPLQAS-------------EKYLKRFTHIKDSKRRTYA 248
>gi|260788446|ref|XP_002589261.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
gi|229274436|gb|EEN45272.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
Length = 470
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 79/222 (35%), Positives = 114/222 (51%), Gaps = 19/222 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW+D+G+H I TPN+D LA G+ L +Y P C+PSR +TG+Y YG+ V
Sbjct: 12 GWDDIGYHNHF-IHTPNLDRLASEGVKLENYYVQPVCSPSREQLMTGRYQIHYGLQHGVI 70
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ E LPQ LK+ GY T+++GKWH+G K+E P RGFD G+ G
Sbjct: 71 RNDRPHGLPLDEVTLPQRLKDNGYRTYMVGKWHLGFCKKEYTPLYRGFDKFYGFLTGSED 130
Query: 142 YNDSIHETDFAV-GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y H V GLD R E + + Y T F ++ +I H+ ++P+FL +
Sbjct: 131 Y--WTHRRYKGVRGLDLRDQDEPVLDE-NGTYSTHLFARKATDMILKHDQNQPMFLYLPF 187
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH G LQVP E+ + + HI+ R++A
Sbjct: 188 QAVH-----------GPLQVP--EKYLQEYMHINFTVDRIYA 216
>gi|115947271|ref|XP_790151.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 500
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 75/228 (32%), Positives = 119/228 (52%), Gaps = 23/228 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+NDVG+HG ++I TPNID LA G+ L +Y P CTP+R+ L+G+Y G+ + +
Sbjct: 38 GYNDVGYHG-SEIYTPNIDKLAREGVRLENYYVQPICTPTRSQLLSGRYQIHTGLQHSYI 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
+P L+E GY+TH +GKWH+G K+E LP RGFD++ GY G
Sbjct: 97 RPAQPLCLPTNLPTFADKLREAGYATHAVGKWHLGFYKKECLPTQRGFDSYFGYLTGGED 156
Query: 139 YLTYNDS----IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
Y T++ + + ++ +G+D N E+ A + Y FT++ ++ H +P
Sbjct: 157 YWTHHRAGDGLLPNSNHWLGMDLWDN-EKVAWEYVGNYSAFVFTEKIQQIVAQHPVEQPF 215
Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L + +VH+ LQVP E + +I N +RR++A
Sbjct: 216 LLYLPFQSVHSP-----------LQVPSSYE--ERYKNIKNTNRRIYA 250
>gi|313233524|emb|CBY09696.1| unnamed protein product [Oikopleura dioica]
Length = 609
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 68/194 (35%), Positives = 109/194 (56%), Gaps = 18/194 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G+H + +I TP +D LA G+ L ++Y P CTP+R +TG+Y RYG+
Sbjct: 109 GYHDIGYH-QAEILTPFMDKLATTGVRLEQYYVQPVCTPTRVQLMTGRYQIRYGMQ---- 163
Query: 83 AGVAK-----AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
GV + VP+ EKLLP+ L++ GY+T +IGKWH+G E+ LP NRGFD+ +G++
Sbjct: 164 HGVVRPPQPDGVPLDEKLLPEALRKCGYNTEMIGKWHLGMFTEDYLPQNRGFDHFMGFYT 223
Query: 138 G---YLTYNDSIHET---DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
G + ++N DF + + R+ +++ Y T F D+ + N S
Sbjct: 224 GSQDFYSHNKCFSGMCGYDFREATAGQPEVIRW--DLNNTYSTGVFADELEKRLSKMNPS 281
Query: 192 RPLFLQITHAAVHT 205
P F ++ AVH+
Sbjct: 282 EPSFTYLSFQAVHS 295
>gi|196229618|ref|ZP_03128482.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196225944|gb|EDY20450.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 490
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 85/231 (36%), Positives = 118/231 (51%), Gaps = 24/231 (10%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
QG+ D F G DI TPN+DALA +G+ R Y T P C+PSRA +TG+Y R+G
Sbjct: 49 QGYADASFQGSKDILTPNLDALAKSGVRCTRGYVTAPVCSPSRAGLMTGRYQERFGHHNN 108
Query: 81 VGAGVAKAV---PVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
+ A A + P E LLPQ L + GY T ++GKWH+G ++ P+ RGFD G
Sbjct: 109 IVAEAALPIAHLPSNETLLPQVLAKAGYYTAMVGKWHLGL-QDGCRPYERGFDEFFGIIT 167
Query: 138 GYLTYNDSIHETDFAVGLDA-RRNMERYAP--QMSSKYLTDFFTDQSVHVIKSHNHSR-- 192
G Y + H + AVG + + +ER P + YLTD F +V +I+ + R
Sbjct: 168 GGHDYFVN-HPEERAVGDQSYKARIERNGPVGEAVPGYLTDAFGADAVRIIRESHTKRPD 226
Query: 193 -PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLFL + A HT T + P L+ D A + + DRR +A
Sbjct: 227 QPLFLYLAFNAPHTPT----QAPKDLV--------DTMPATLESKDRRTYA 265
>gi|298710054|emb|CBJ31771.1| Formylglycine-dependent sulfatase, C-terminal fragment
Formylglycine-dependent sulfatase, N-terminal
[Ectocarpus siliculosus]
Length = 588
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 87/144 (60%), Gaps = 12/144 (8%)
Query: 23 GWNDVGFHGEN-DIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI--DT 79
GW+D+G+ + TP +D LA G+ + +YT+ TCTP+RA+ +TG+Y RYG+ +
Sbjct: 6 GWDDIGYQSVDLKGVTPVLDKLAAGGVKITNYYTMNTCTPARASLMTGRYTVRYGMQYNV 65
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
+ G VP++EK+LP+Y KE GY THL+GKWH+G + E +P RGFD ++GY G+
Sbjct: 66 AINPGEPWGVPLSEKMLPEYFKEAGYGTHLVGKWHLGSHSPEHIPSQRGFDTYMGYVGGF 125
Query: 140 LTY---------NDSIHETDFAVG 154
Y +D H DF G
Sbjct: 126 EAYWTHETVGVISDGRHVCDFGFG 149
>gi|291236973|ref|XP_002738412.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 843
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 71/216 (32%), Positives = 118/216 (54%), Gaps = 10/216 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVG++ TP ID LA +G+ L +Y C PSR +TG++ + GI
Sbjct: 377 GWNDVGYNNPV-FKTPTIDRLAGSGVKLLNYYVASHCLPSRNMLMTGRHAIQLGIQRHGF 435
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+++P+ E + Q LK++GYSTH+IGKWH G + LP NRGFD G+ + +
Sbjct: 436 GYHPRSLPLDETTIAQPLKQVGYSTHIIGKWHCGFYSDNCLPHNRGFDTFFGFVGAGIEH 495
Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
H F + R+N + A Q KY T + ++ ++I +H+ ++P FL ++ +A
Sbjct: 496 --YTHSDHFNHMHNLRKNDDCIAKQYIGKYSTTIYANEGKNIINAHDQNKPFFLYLSFSA 553
Query: 203 VHTGTAGNAKLPTGLLQVPD---MEENDRTFAHISN 235
VHT ++P+ L+ + +E+ RT+A +++
Sbjct: 554 VHTP----LEVPSSYLKQYESTIYDEDRRTYAAMTS 585
>gi|432911274|ref|XP_004078601.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
Length = 572
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 81/224 (36%), Positives = 115/224 (51%), Gaps = 18/224 (8%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ND+G+H I TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 54 QGFNDIGYHNPT-IKTPTLDKLAAEGVKLENYYVQPICTPSRSQLLTGRYQIHTGLQHSI 112
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ + +P LP+ L+ GYSTH++GKWH+G ++ LP +GFD G G +
Sbjct: 113 IRSRQPSCLPRHMDTLPETLRRAGYSTHMVGKWHLGFYRKSCLPTRKGFDTFFGSLTGSV 172
Query: 141 TYNDSIHETDFAV-GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS-RPLFLQI 198
Y V G D N ER A KY T FT ++ ++KSH+ + RPLFL +
Sbjct: 173 DYYSYGSCNGPGVCGYDLHDN-ERVAWGHEGKYSTTLFTQRAHKILKSHDPADRPLFLLL 231
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ AVH G LQ P + + ++N DRR FA
Sbjct: 232 SLQAVH-----------GPLQPP--KSFVYLYRDMANVDRRKFA 262
>gi|149197396|ref|ZP_01874447.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139414|gb|EDM27816.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 465
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 121/230 (52%), Gaps = 25/230 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+NDVGF+G +IPTP ID++A NG+ YT C PSRA F+TG+Y R+G +
Sbjct: 32 GYNDVGFNGCTEIPTPGIDSIAQNGVKFTNGYTSYSVCGPSRAGFITGRYQQRFGFERNP 91
Query: 82 GAGVA---KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+ A+P +E + + L ++GY +IGKWH+G + L P RGFD G+ G
Sbjct: 92 QWNLTDPNSALPKSEMTIAESLTQVGYHCGIIGKWHLGA-EPSLRPNKRGFDEFFGHLGG 150
Query: 139 ---YLTYNDSI-HETDFAVGLDARRN--MERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
++ + I H + LD+ R+ P ++KYLT+ F+D++V IK NH +
Sbjct: 151 GHRFMPEDLVIQHTEEVKNELDSYRSWITRNDTPVKTTKYLTEEFSDEAVSFIK-RNHQK 209
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
P FL +++ A H L + E+ F HI +P R+ +A
Sbjct: 210 PFFLFLSYNAPH-------------LPLQATEKYLARFPHIKDPKRKTYA 246
>gi|410906623|ref|XP_003966791.1| PREDICTED: arylsulfatase J-like [Takifugu rubripes]
Length = 560
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 78/224 (34%), Positives = 112/224 (50%), Gaps = 17/224 (7%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ DVG+HG ++I TP +D LA G+ L +Y P C+PSR+ +TG+Y G+ +
Sbjct: 48 QGFRDVGYHG-SEIKTPTLDRLAAQGVKLENYYVQPLCSPSRSQLMTGRYQIHTGLQHSI 106
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ A +P+ LP LK+ GY+TH++GKWH+G K LP RGFD G G
Sbjct: 107 IRATQPNCLPLENVTLPLKLKQAGYATHMVGKWHLGFYKRGCLPTQRGFDTFFGSLLGSG 166
Query: 141 T-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFLQI 198
Y+ E G D E Q Y T FT +++ ++ H+ H +PLFL +
Sbjct: 167 DHYSHYKCEAPGMCGYDLYEGEEAAWEQDRGLYSTVMFTQKAISILAKHDPHRKPLFLYL 226
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ AVH+ LQVP + ISN RR +A
Sbjct: 227 AYQAVHSP-----------LQVP--SRYLERYKGISNVHRRKYA 257
>gi|443702858|gb|ELU00682.1| hypothetical protein CAPTEDRAFT_125641 [Capitella teleta]
Length = 370
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 74/184 (40%), Positives = 98/184 (53%), Gaps = 5/184 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+NDVGF DI TPNID LA G+V+ Y+ CTPSR A +TG+YP++ G+ V
Sbjct: 22 GYNDVGFRNP-DIITPNIDKLARKGVVMTNSYSTHVCTPSRHALMTGRYPYKTGMQNFVI 80
Query: 83 AGVAKAVPVTE-KLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G A E K LPQYLK LGY+TH +GKWH+G ++E LP RGFD+ G G
Sbjct: 81 PGDAPVCSGLEYKFLPQYLKSLGYNTHAVGKWHLGDCRDECLPTERGFDSFYGLLLGGGG 140
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y + + A ++++ A S+ D D+ V SHN P+FL
Sbjct: 141 YWNHTYTLFGAYDWFNNKDLDLSANGTHSQ---DLMVDRLSAVFASHNREEPMFLYFAPQ 197
Query: 202 AVHT 205
HT
Sbjct: 198 NPHT 201
>gi|410642189|ref|ZP_11352707.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
gi|410648635|ref|ZP_11359039.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
gi|410131832|dbj|GAC07438.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
gi|410138506|dbj|GAC10894.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
Length = 473
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 63/183 (34%), Positives = 100/183 (54%), Gaps = 10/183 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ DVG+ G + I TPNID LA G+ L Y P C+P+RAA +TGK P +GID P+
Sbjct: 54 GYGDVGYLG-SQIQTPNIDNLASQGVTLKHGYAYPICSPTRAALMTGKNPLNFGIDGPME 112
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+P +P+ +E GY T ++GKWH+G K +P NRGFD+ G+ G++ Y
Sbjct: 113 NDA--MLPEDLTTMPERFQEAGYQTWMVGKWHLGMAKRSAMPHNRGFDDFYGFLGGFVDY 170
Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
+ + GLD + N + ++T+ T +++ I +P F+ ++++A
Sbjct: 171 YTHV----YFGGLDWQNNDTSLREE---GFVTELLTAKAIDKITHFKGDKPFFMYLSYSA 223
Query: 203 VHT 205
HT
Sbjct: 224 PHT 226
>gi|313225802|emb|CBY07276.1| unnamed protein product [Oikopleura dioica]
Length = 207
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 62/130 (47%), Positives = 82/130 (63%), Gaps = 4/130 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+ND+G++ TPNI+ LA NGI+L+ HY+ P C+PSR+ FLTG+Y FRYG+ +
Sbjct: 6 GYNDIGYNSVEAF-TPNINYLAKNGIILDSHYSQPVCSPSRSQFLTGRYSFRYGMQHRNI 64
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
VP+TEK+LP+ KE GYST GKWH G E LP +RGFD VG ++G +
Sbjct: 65 LPTQPHGVPLTEKMLPEVFKECGYSTFGTGKWHQGMFHESYLPTSRGFDKFVGSYSG--S 122
Query: 142 YNDSIHETDF 151
S HE F
Sbjct: 123 SQHSTHEKCF 132
>gi|313219585|emb|CBY30507.1| unnamed protein product [Oikopleura dioica]
Length = 617
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 62/130 (47%), Positives = 82/130 (63%), Gaps = 4/130 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+ND+G++ TPNI+ LA NGI+L+ HY+ P C+PSR+ FLTG+Y FRYG+ +
Sbjct: 169 GYNDIGYNSIEAF-TPNINYLAKNGIILDSHYSQPVCSPSRSQFLTGRYSFRYGMQHRNI 227
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
VP+TEK+LP+ KE GYST GKWH G E LP +RGFD VG ++G +
Sbjct: 228 LPTQPHGVPLTEKMLPEVFKECGYSTFGTGKWHQGMFHESYLPTSRGFDKFVGSYSG--S 285
Query: 142 YNDSIHETDF 151
S HE F
Sbjct: 286 SQHSTHEKCF 295
>gi|260832084|ref|XP_002610988.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
gi|229296357|gb|EEN66998.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
Length = 494
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 116/221 (52%), Gaps = 21/221 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GWNDVG+H D+ TP +D LA+ G++LN+ Y CTPSR AF+TG +P+ G V
Sbjct: 34 GWNDVGWHNP-DVRTPVLDQLAHEGVILNQSYVNYVCTPSRTAFMTGYFPYHVGSQHLVF 92
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
A+ LP+ LK LGY+TH++GKWH+G + P RGFD++ GY++G
Sbjct: 93 RPDQPSAILSNFTFLPEKLKSLGYATHMVGKWHLGFCNWKFTPTFRGFDSYYGYYSGAED 152
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y T+ SI G+D N + Q + Y F+ ++ ++ H+ + PLFL +
Sbjct: 153 YFTHFRSIRNG--TGGIDFHDNKDVVTDQ-NGTYSAYLFSQRAADIVNKHDPNTPLFLYL 209
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRR 239
VH L+VP E+ +A++ + +RR
Sbjct: 210 PFQNVHAP-----------LEVPKRFED--MYANVQDENRR 237
>gi|291227813|ref|XP_002733877.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 490
Score = 119 bits (299), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 67/191 (35%), Positives = 103/191 (53%), Gaps = 17/191 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ DVG+ + I TPNID LA G+ L RHY P+C PSRA + G+Y G
Sbjct: 23 GYFDVGYRNRSVIKTPNIDKLAAEGVKLERHYAQPSCLPSRACLMMGRYQIHTGYRDECM 82
Query: 83 AGVAKAVPVTEKL--LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ + LP +K+ GY TH+IGKWH+G N + LP +GFD + GY
Sbjct: 83 NDTRYQRCMNHDIVTLPMKMKQNGYVTHMIGKWHLGNNNWDCLPNAKGFDTYFGY----- 137
Query: 141 TYNDSIHETDFAVGLDARRN---MERYAPQMSSKYL----TDFFTDQSVHVIKSHNHSRP 193
++ E + L R+N + R ++ KY+ T FT+++V++I++H+ S+P
Sbjct: 138 ---NAAAEDYYTHMLSGRQNCSDLWRDRMDVADKYIGQYSTRIFTEEAVNIIENHDISQP 194
Query: 194 LFLQITHAAVH 204
+F+ + H AVH
Sbjct: 195 MFMYLAHQAVH 205
>gi|443692244|gb|ELT93884.1| hypothetical protein CAPTEDRAFT_107177, partial [Capitella teleta]
Length = 328
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 68/191 (35%), Positives = 98/191 (51%), Gaps = 9/191 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
G+ND+GF D+ TPN+D LA G++L +Y CTPSR A +TG+YP+R + +
Sbjct: 12 GYNDIGFRNP-DVQTPNLDYLANKGVILTNNYVQAVCTPSRHALMTGRYPYRSAMQNFVI 70
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
AK + K LPQYLKELGY HLIGKW++G +EE LP +RGFD+ G +G
Sbjct: 71 NPDQAKCTALEYKFLPQYLKELGYQNHLIGKWNLGYCREECLPTSRGFDSFFGLLDGAGD 130
Query: 142 YNDSIHETDFAVGLDARRNM-------ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
Y + + + M ++ + M D D+ + H++ +PL
Sbjct: 131 YWEHTTYGLYDCTGQSLAGMACLCEFTQKISILMPVICFQDLELDRLDKIFTEHDNKQPL 190
Query: 195 FLQITHAAVHT 205
FL HT
Sbjct: 191 FLYFAPQNPHT 201
>gi|390364061|ref|XP_792027.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 524
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 80/249 (32%), Positives = 128/249 (51%), Gaps = 30/249 (12%)
Query: 6 GAGVAKAVPVTEKLLPQ--GWNDVGFHGEND---IPTPNIDALAYNGIVLNRHYTLPTCT 60
G+ A+ +P +L G+NDVG+HG + I TPN+D LA G+ L +Y P C+
Sbjct: 21 GSSYAEQLPNVVFILADDYGFNDVGYHGRSHGSAILTPNLDMLAGEGVKLENYYVQPICS 80
Query: 61 PSRAAFLTGKYPFRYGIDTPVGAGVAK-----AVPVTEKLLPQYLKELGYSTHLIGKWHI 115
P+R+ ++G RY I T + GV + +P+ E LPQ LKE GY+T+++GKWHI
Sbjct: 81 PTRSQLMSG----RYQIHTGLQHGVIRPPQPNCLPLDEVTLPQKLKENGYATNMVGKWHI 136
Query: 116 GCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAP--QMSSKYL 173
G + LP RGFD++ W + + P Q +Y
Sbjct: 137 GFYLDACLPTERGFDSYFA-WEDHFSCLPXXXXXXXXXXXXXXXXXANKTPVFQYEGQYS 195
Query: 174 TDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHI 233
T FT++++ VI+ H+ ++PLF+ + + AVH P L+VPD + + +I
Sbjct: 196 THLFTNKTIDVIERHDKTKPLFIYLAYQAVH--------FP---LEVPDSYMD--PYMNI 242
Query: 234 SNPDRRLFA 242
++ +RR +A
Sbjct: 243 TDKNRRTYA 251
>gi|319954018|ref|YP_004165285.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
gi|319422678|gb|ADV49787.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
Length = 434
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 73/190 (38%), Positives = 107/190 (56%), Gaps = 14/190 (7%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
QGW DVGF+G DIPTPN+D LA G++ + Y + P C+PSRA LTG+Y R+G D
Sbjct: 7 QGWADVGFNGATDIPTPNLDRLASEGVIFSNGYVSHPYCSPSRAGLLTGRYQARFGHDCN 66
Query: 81 V---GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
+ G A P++EK++ + LKE GY T IGKWH+G + +L P +GFD+ G+
Sbjct: 67 MPYDGKNDASVGTPLSEKMISEALKEQGYRTSAIGKWHLG-DHPDLYPPAQGFDHWFGFP 125
Query: 137 NGYLTY-NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
G + Y +S +E RN + P+ YLTD FT +++ I + +P F
Sbjct: 126 GGGMNYWGESKNEIQTIY-----RN-RKVVPEEELTYLTDDFTTEAIRFI-TQKDEKPFF 178
Query: 196 LQITHAAVHT 205
+ + + A H
Sbjct: 179 MYLAYNAPHA 188
>gi|291233691|ref|XP_002736785.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 499
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 66/178 (37%), Positives = 100/178 (56%), Gaps = 4/178 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVG+H ++ P ++ LA +G++ N+ Y PTCTP+RAA ++G YPF+ G +
Sbjct: 42 GWNDVGWHNP-EVKMPVLNQLAADGVIFNQAYVQPTCTPTRAALMSGYYPFKTGNQHQLL 100
Query: 83 AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ K LPQ LK++GY TH++GKWH+G KE LP NRGFD+ G +
Sbjct: 101 LNLHPGGLPLRFKTLPQRLKDVGYLTHIVGKWHLGFCKEAFLPTNRGFDSFYGGLTLGTS 160
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
+ ++ G D N PQ ++ YL D++V +I H PLF+ +
Sbjct: 161 HFSKMNGILSTPGYDFYDN-SGVVPQ-TNDYLAFMLADRAVKIINGHYQEYPLFMYFS 216
>gi|149196020|ref|ZP_01873076.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149140867|gb|EDM29264.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 462
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 75/221 (33%), Positives = 119/221 (53%), Gaps = 25/221 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ DVGF+G DIPTP ID++A NG+ + YT C PSRA F+TG+Y R+G +
Sbjct: 34 GYADVGFNGCKDIPTPGIDSIANNGVKFSSGYTSYSVCGPSRAGFITGRYQQRFGFERNP 93
Query: 82 GAGVA---KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+ A+P +E + + L+++GY +IGKWH+G + L P RGF+ G+ G
Sbjct: 94 QWNLTDPNSALPKSEMTIAESLQQVGYHCGIIGKWHLGA-EPSLRPNQRGFNEFFGHLGG 152
Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYA--------PQMSSKYLTDFFTDQSVHVIKS 187
Y I +T+ D + M+ YA P ++KYLTD F+D+++ ++
Sbjct: 153 GHAYFPEKLRIIKTE-----DVKNEMDSYASYITRNDTPVKTTKYLTDEFSDEAIRFVEK 207
Query: 188 HNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDR 228
N+ +P FL +++ A HT K L + P +E+ +R
Sbjct: 208 -NYEQPFFLFLSYNAPHTPLQATQKY---LDRFPHIEDQNR 244
>gi|115644393|ref|XP_781330.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 588
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 67/190 (35%), Positives = 107/190 (56%), Gaps = 7/190 (3%)
Query: 23 GWNDVGFH---GENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID- 78
G+NDVG+H G + I TPNID +AY+G+ L +Y P CTP+R+ +TG+Y G+
Sbjct: 105 GYNDVGYHAKYGRSMIRTPNIDEMAYSGVRLENYYVQPVCTPTRSQLITGRYQIHTGMQH 164
Query: 79 TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+ G +P+ E L Q LK+ GYSTH +GKWH+G ++ LP RGF++ G G
Sbjct: 165 LNLFPGRPCCLPLDETTLAQALKKQGYSTHAVGKWHLGYAWKDCLPSRRGFESFFGNIMG 224
Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
+ ++N + D V + ER + + T +T+++ +I+ ++PLF
Sbjct: 225 SADHWSHNKTALFGDKLVMGKSMYYNERIYWKHEGTFSTTLYTNRARQLIRKQPRNKPLF 284
Query: 196 LQITHAAVHT 205
L +++ AVHT
Sbjct: 285 LYLSYEAVHT 294
>gi|313236789|emb|CBY12041.1| unnamed protein product [Oikopleura dioica]
gi|313242643|emb|CBY39450.1| unnamed protein product [Oikopleura dioica]
Length = 622
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 108/202 (53%), Gaps = 5/202 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
GW DVG++ + TP +D L NG + Y+ C+PSRA LTG+Y FR G+ + P+
Sbjct: 40 GWADVGWNNKGLESTPFMDKLVKNGTQFTQMYSSHRCSPSRAMALTGRYAFRSGMGSFPI 99
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
V + +K LP+YLKE+GY TH +GKWH+G LP +RGFD G+++G +
Sbjct: 100 AREVPFGMNTQDKTLPEYLKEVGYDTHAVGKWHLGVCNSSYLPTSRGFDTFYGHYSGAVD 159
Query: 142 YNDSIHETDFAVGLDARRN-MERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSR-PLFLQ 197
Y + D N +E++ + S ++ TD F D+++ ++K S+ P ++
Sbjct: 160 YRGHFIKRSKNFYHDFFDNTIEQHKLDLESDGQWTTDLFRDRTIDILKEAKRSKTPAYVY 219
Query: 198 ITHAAVHTGTAGNAKLPTGLLQ 219
+ A H T A L +L+
Sbjct: 220 LAFNAPHEPTRAPADLIARILE 241
>gi|156368526|ref|XP_001627744.1| predicted protein [Nematostella vectensis]
gi|156214663|gb|EDO35644.1| predicted protein [Nematostella vectensis]
Length = 157
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 55/117 (47%), Positives = 75/117 (64%), Gaps = 1/117 (0%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+DVG+H D+ TPNID LA G+VL +Y P CTP+R FL+G+YP G+ + +
Sbjct: 33 GWSDVGYHNITDLKTPNIDRLAGEGVVLENYYVQPICTPARGTFLSGRYPIHTGLQHSNI 92
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+P+ LLPQ LK+ GYSTH +GKWH+G ++E P RGFD GY++G
Sbjct: 93 HETEPFGLPLDFTLLPQKLKKAGYSTHAVGKWHLGFFEKEYTPLYRGFDTFFGYYSG 149
>gi|348520018|ref|XP_003447526.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
Length = 732
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 69/188 (36%), Positives = 103/188 (54%), Gaps = 7/188 (3%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ND+G+H I TP +D LA G+ L +Y P CTPSR+ +TG+Y G+ +
Sbjct: 59 QGFNDIGYHNPT-IKTPTLDKLAAEGVKLENYYVQPICTPSRSQLITGRYQIHTGLQHSI 117
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P LP+ L+E GY+TH++GKWH+G ++ LP +GFD G G +
Sbjct: 118 IRPRQPSCLPSHMDTLPERLREAGYTTHMVGKWHLGFYRKACLPTRKGFDTFFGSLTGSV 177
Query: 141 TY--NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS-RPLFLQ 197
Y +S D G D + E A KY T FT ++ +++SH+ + RPLFL
Sbjct: 178 DYYSYESCDGKDLC-GYDLHDD-EGVAWGQEGKYSTTLFTQRARKILESHDPAERPLFLL 235
Query: 198 ITHAAVHT 205
++ AVHT
Sbjct: 236 LSFQAVHT 243
>gi|87306948|ref|ZP_01089094.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
gi|87290321|gb|EAQ82209.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
Length = 455
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 67/184 (36%), Positives = 99/184 (53%), Gaps = 9/184 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G DV + G + I TP +DALA +G L + Y P C+P+R+A LTG+YP RYG+ V
Sbjct: 40 GGADVSWRG-SPIKTPQLDALANSGAKLEQFYVQPVCSPTRSALLTGRYPMRYGLQVGVV 98
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A +P+ E+ L + L++ GY T ++GKWH+G LP RGFD+ G++NG L
Sbjct: 99 RPWADYGLPLDERTLAEALQDAGYETAIVGKWHLGHVSPAYLPMARGFDHQYGHYNGALD 158
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y H+ D ++ R Y T ++V VI+ + +PLFL +
Sbjct: 159 Y--FTHDRDGGHDWHKDDHVNR-----DEGYATHLIAQEAVRVIQDRDKKKPLFLYVPFN 211
Query: 202 AVHT 205
AVH+
Sbjct: 212 AVHS 215
>gi|410924964|ref|XP_003975951.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
Length = 574
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 69/187 (36%), Positives = 102/187 (54%), Gaps = 5/187 (2%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ND+G+H I TP +D LA G+ L +Y P CTPSR+ +TG+Y G+ +
Sbjct: 56 QGFNDIGYHNPT-IKTPTLDKLAAEGVRLENYYVQPICTPSRSQLMTGRYQIHTGLQHSI 114
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P LP+ L++ GYSTHL+GKWH+G ++ LP +GFD G G +
Sbjct: 115 IRPSQPSCLPSHMDTLPERLRQAGYSTHLVGKWHLGFYRKACLPTRKGFDTFFGSLTGSV 174
Query: 141 T-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS-RPLFLQI 198
YN + G D + E A KY T FT ++ +++SHN + +PLFL +
Sbjct: 175 DHYNYLSCDGPGVCGYDL-HDGEGVAWGQEGKYSTTLFTQRARKILESHNPTEKPLFLLL 233
Query: 199 THAAVHT 205
+ AVHT
Sbjct: 234 SLQAVHT 240
>gi|149197407|ref|ZP_01874458.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149139425|gb|EDM27827.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 454
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 75/229 (32%), Positives = 114/229 (49%), Gaps = 30/229 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G+ DVG+HG +IPTPNID +A G+ + Y+ + C P+RAA ++G Y R G +
Sbjct: 31 GYADVGYHGLEEIPTPNIDRIANEGVQFSAGYSNGSICGPTRAALMSGVYQQRIGCEGIC 90
Query: 82 GAG-----VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNK---EELLPFNRGFDNHV 133
G V +P K L QY +E GY+T L GKWH+G + + L+P +RGFD
Sbjct: 91 GGRKLNEHVVVGMPREVKTLAQYFQEAGYATGLFGKWHLGGERLFDKTLMPTSRGFDEFF 150
Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
G G Y+D+++ + D + E +Y TD ++V I + +P
Sbjct: 151 GILEGASLYDDTVNRERKYIRQDTVIDYE-------GEYFTDAIGREAVSFI-TRKGDKP 202
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
FL + AVH + E+ + FAHI++P+RR+FA
Sbjct: 203 FFLYLPFTAVHAPMQAS-------------EKYMQRFAHIADPNRRVFA 238
>gi|406832516|ref|ZP_11092110.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 453
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 73/187 (39%), Positives = 101/187 (54%), Gaps = 15/187 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+GF G DIPTP ID LA +G+ + Y + P C+P+RA LTG+Y R+G +
Sbjct: 44 GYGDLGFQGGRDIPTPRIDGLARSGVTCSSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNP 103
Query: 82 G--AGVAK--AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
G A V + + + E+ LPQ LK+ GY+T ++GKWH+G + P RGFD G+
Sbjct: 104 GNAARVTETFGLSLEERTLPQRLKQAGYATGIVGKWHLGF-APQFQPLERGFDEFFGFLG 162
Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
G Y + D RR E + S+YLTD F +SV I N +RP FL
Sbjct: 163 GAHPYFPDANSND-----PIRRGREAV---VESEYLTDAFARESVAYI-DRNKNRPFFLY 213
Query: 198 ITHAAVH 204
+ AVH
Sbjct: 214 LAFNAVH 220
>gi|198420473|ref|XP_002123848.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
Length = 517
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 67/226 (29%), Positives = 118/226 (52%), Gaps = 22/226 (9%)
Query: 23 GWNDVGFHG---ENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
G+ND+G+H +D+ TP +D+LA G+ L +Y P C+PSR+ ++G+Y G+
Sbjct: 35 GFNDIGYHAVEHHSDMKTPFLDSLAMAGVRLENYYIQPICSPSRSVLMSGRYQIHTGLQH 94
Query: 80 PVGAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
V + + +P+ +LP+ L + GY TH++GKWH+G K+E LP+ RGF+++ GY G
Sbjct: 95 YVISPQQRNGLPLDNIILPEQLHKCGYDTHMVGKWHLGFYKDEYLPWKRGFNSYFGYLTG 154
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y G D P ++ +Y + F +++ I H+ ++PLFL
Sbjct: 155 GEDYYTKWRCDGKLCGYDMTSEK---GPTNATYGQYSANLFANKANEAIDKHDKTKPLFL 211
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ +VH+ ++VP E + F +I N +R+++
Sbjct: 212 YVAFQSVHSP-----------MEVP--ESYAKPFDYIKNHNRKMYG 244
>gi|115533418|ref|NP_001041232.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
gi|351060348|emb|CCD68016.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
Length = 452
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 79/218 (36%), Positives = 120/218 (55%), Gaps = 22/218 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAY--NGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP 80
G++DV + ++ + TPN+ LA+ N +L+ Y CTP+R+AF+TG YPFR G
Sbjct: 6 GFSDVDWK-DSTLHTPNLRHLAFHKNTALLSNSYVNQLCTPTRSAFMTGYYPFRVGTQNG 64
Query: 81 VGAGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW--- 136
V + A VP L + +++L YST+L+GKWH+G K+E LP NRGFD G++
Sbjct: 65 VFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQ 124
Query: 137 NGYLTYN-DSIHETDFAV--GLDARRNM--ERYAPQMSSK--YLTDFFTDQSVHVIKSHN 189
GY ++ D H V GLD + + P S Y TD FTD ++ V+ +HN
Sbjct: 125 TGYFNHSADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSVLDNHN 184
Query: 190 HSRPLFLQITHAAVH--------TGTAGNAKLPTGLLQ 219
+S+P F+ +++ AVH + T G K T +L+
Sbjct: 185 NSKPFFMFLSYQAVHPPLQVSQQSKTIGQGKEATFILR 222
>gi|115533416|ref|NP_001041231.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
gi|351060347|emb|CCD68015.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
Length = 488
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 79/218 (36%), Positives = 120/218 (55%), Gaps = 22/218 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAY--NGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP 80
G++DV + ++ + TPN+ LA+ N +L+ Y CTP+R+AF+TG YPFR G
Sbjct: 42 GFSDVDWK-DSTLHTPNLRHLAFHKNTALLSNSYVNQLCTPTRSAFMTGYYPFRVGTQNG 100
Query: 81 VGAGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW--- 136
V + A VP L + +++L YST+L+GKWH+G K+E LP NRGFD G++
Sbjct: 101 VFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQ 160
Query: 137 NGYLTYN-DSIHETDFAV--GLDARRNM--ERYAPQMSSK--YLTDFFTDQSVHVIKSHN 189
GY ++ D H V GLD + + P S Y TD FTD ++ V+ +HN
Sbjct: 161 TGYFNHSADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSVLDNHN 220
Query: 190 HSRPLFLQITHAAVH--------TGTAGNAKLPTGLLQ 219
+S+P F+ +++ AVH + T G K T +L+
Sbjct: 221 NSKPFFMFLSYQAVHPPLQVSQQSKTIGQGKEATFILR 258
>gi|291232045|ref|XP_002735970.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 500
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 69/216 (31%), Positives = 116/216 (53%), Gaps = 10/216 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVG++ TP ID LA +G+ L +Y C PSR +TG++ + GI
Sbjct: 34 GWNDVGYNNPV-FKTPTIDRLAGSGVKLLNYYVASHCLPSRNMLMTGRHAIQLGIQHDDY 92
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+++P+ E + + LK +GYSTH++GKWH G + LP NRGFD G+ + +
Sbjct: 93 GFHPRSLPLNETTIAEPLKHVGYSTHIVGKWHCGFYSDNCLPHNRGFDTFFGFVGAGIDH 152
Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
H F + R+N + A + KY T F ++ +I +H+ ++PLFL ++ +A
Sbjct: 153 --YTHSDHFNHMHNLRKNDDCIAKKYIGKYSTTIFANEGKDIINAHDQNKPLFLYLSFSA 210
Query: 203 VHTGTAGNAKLPTGLLQVPDM---EENDRTFAHISN 235
VH ++P+ L+ + +E+ RT+A +++
Sbjct: 211 VH----APLEVPSSYLKQYESTIHDEDRRTYAAMTS 242
>gi|299473382|emb|CBN77780.1| Formylglycine-dependent sulfatase, C-terminal fragment
Formylglycine-dependent sulfatase, N-terminal
[Ectocarpus siliculosus]
Length = 623
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 51/122 (41%), Positives = 83/122 (68%), Gaps = 2/122 (1%)
Query: 23 GWNDVGFHGEN-DIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
GWND+G+ + TPN++ +A +G+ L+++Y++ CTP+RAA +TG+YP RYG V
Sbjct: 6 GWNDIGYQSTDMHAVTPNLNRIAESGVKLSQYYSMSICTPARAALMTGRYPVRYGFQYKV 65
Query: 82 -GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
G +P+TEKL PQ++ + GY++H++GKWH+G + + +P RGF+ ++GY G
Sbjct: 66 INVGAPWGLPLTEKLFPQFMNDAGYTSHMVGKWHLGSHTFDHMPHLRGFETYLGYTQGRE 125
Query: 141 TY 142
TY
Sbjct: 126 TY 127
>gi|291235057|ref|XP_002737462.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
partial [Saccoglossus kowalevskii]
Length = 355
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 70/216 (32%), Positives = 115/216 (53%), Gaps = 10/216 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVG++ TP ID LA NG+ L +Y C PSR +TG++ + GI
Sbjct: 34 GWNDVGYNNP-VFKTPTIDRLAGNGVKLLNYYVASHCLPSRNMLMTGRHAIQLGIPQDGF 92
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+++P+ E + + LK GYSTH++GKWH G + LP NRGFD G+ + +
Sbjct: 93 GYHPRSLPLDETTIAEPLKHAGYSTHIVGKWHCGYYADNCLPHNRGFDTFFGFVGAGIDH 152
Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
H F + R+N + A + KY T F ++ +I +H+ ++PLFL ++ +A
Sbjct: 153 --YTHSDHFNHMHNLRKNDDCIAKKYIGKYSTTIFANEGKDIINAHDQNKPLFLYLSFSA 210
Query: 203 VHTGTAGNAKLPTGLLQVPDM---EENDRTFAHISN 235
VH ++P+ L+ + +E+ RT+A +++
Sbjct: 211 VHAPL----EVPSSYLKQYESTIHDEDRRTYAAMTS 242
>gi|372210513|ref|ZP_09498315.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 465
Score = 117 bits (292), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 77/225 (34%), Positives = 111/225 (49%), Gaps = 25/225 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG---ID 78
G+ D GF G + TPN+D LA +G+ + Y + PTC PSRA +TGKY R+G I+
Sbjct: 34 GYMDFGFQGSKVMKTPNLDKLAKSGVTFTQGYVSDPTCGPSRAGMMTGKYQARFGYEEIN 93
Query: 79 TP-------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN 131
P G +P+ +KL+ YLK+LGY T + GKWH+G N + P NRGFD
Sbjct: 94 VPGYMSSHSALKGDEMGLPLDQKLMSNYLKDLGYKTAVYGKWHLG-NADRFHPLNRGFDE 152
Query: 132 HVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMS--SKYLTDFFTDQSVHVIKSHN 189
G+ G +Y + T R ME Q + Y TD F +++VH I+ N
Sbjct: 153 FYGFRGGARSYFAYKNPT-------GDRKMETNFGQYEEPNHYATDVFAEKAVHFIE-RN 204
Query: 190 HSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHIS 234
P F+ ++ AVHT L Q P++ + A ++
Sbjct: 205 KEHPFFIYLSFNAVHTPMEATE---ADLAQFPNLTGKRQQLAAMT 246
>gi|348501876|ref|XP_003438495.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
Length = 571
Score = 116 bits (291), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 70/205 (34%), Positives = 107/205 (52%), Gaps = 19/205 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ D+G+HG +DI TP +D LA G+ L +Y P C+PSR+ +TG+Y G+ +
Sbjct: 56 QGYGDIGYHG-SDIHTPVLDRLAAEGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHSI 114
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P LP+ L E GY+TH++GKWH+G + LP RGF + +G G
Sbjct: 115 IRPRQPLCLPPDSPTLPERLAEAGYATHMVGKWHLGFCRPSCLPTGRGFQSFLGTLTGSG 174
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
+ +Y + A G D + +R A +M+ Y T + D+ ++K H+ PLFL
Sbjct: 175 DHFSYQSC--DGAEACGFDL-HDGDRPAWEMAGNYSTLLYIDRVKQILKRHDPHTPLFLY 231
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPD 222
++ A HT LQVPD
Sbjct: 232 LSLQAAHTP-----------LQVPD 245
>gi|341889947|gb|EGT45882.1| CBN-SUL-3 protein [Caenorhabditis brenneri]
Length = 432
Score = 116 bits (291), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 74/195 (37%), Positives = 110/195 (56%), Gaps = 14/195 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAY--NGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP 80
G+ND+ + ++ + TPN+ LA+ N +L Y CTP+R+AF+TG YPFR G
Sbjct: 44 GFNDLDWK-DSTLHTPNLRNLAFHKNTALLTNSYVNQLCTPTRSAFMTGYYPFRVGTQAG 102
Query: 81 VGAGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW--- 136
V + A VP L + +++L YST+L+GKWH+G K+E LP NRGFD G++
Sbjct: 103 VFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQ 162
Query: 137 NGYLTYN-DSIHETDFAV--GLDARRNM--ERYAPQMSSK--YLTDFFTDQSVHVIKSHN 189
GY ++ D H V GLD + + P S Y TD FTD ++ VI +HN
Sbjct: 163 TGYFNHSADQYHRELRRVVKGLDLFEEVGNGKSVPDFSQNGVYSTDLFTDVAMSVIDNHN 222
Query: 190 HSRPLFLQITHAAVH 204
++P F+ +++ AVH
Sbjct: 223 TTKPFFMFLSYQAVH 237
>gi|29348898|ref|NP_812401.1| arylsulfatase [Bacteroides thetaiotaomicron VPI-5482]
gi|383125065|ref|ZP_09945724.1| hypothetical protein BSIG_5384 [Bacteroides sp. 1_1_6]
gi|29340804|gb|AAO78595.1| arylsulfatase B precursor [Bacteroides thetaiotaomicron VPI-5482]
gi|251837419|gb|EES65514.1| hypothetical protein BSIG_5384 [Bacteroides sp. 1_1_6]
Length = 458
Score = 116 bits (291), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 70/185 (37%), Positives = 99/185 (53%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW DVGFHG ++I TP ++ALA G+VL+R YT P TP+RA +TG+YP R+GI T V
Sbjct: 37 GWGDVGFHG-SEIKTPCLNALAAEGVVLDRFYTAPISTPTRAGLMTGRYPNRFGIRTTVI 95
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ E+ L L GYS +IGKWH+G ++ P NRGF + G+ NG +
Sbjct: 96 PPWREDGLDENEETLADMLARNGYSNRAIIGKWHLGHTRKVHYPINRGFSHFYGHLNGAI 155
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D + E + LD + E Y T+ T ++V I ++ P L + +
Sbjct: 156 DYFDHMREGE----LDWHNDWETC---YDKGYSTELITQEAVRCINTYEKEGPFLLYVAY 208
Query: 201 AAVHT 205
A HT
Sbjct: 209 NAPHT 213
>gi|443706557|gb|ELU02545.1| hypothetical protein CAPTEDRAFT_109345 [Capitella teleta]
Length = 370
Score = 116 bits (290), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 68/192 (35%), Positives = 100/192 (52%), Gaps = 4/192 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
G++D+GF D+ TPNIDALA G++ +Y CTPSR A LTG+YP R + V
Sbjct: 10 GYHDLGFRNP-DVITPNIDALATEGVIFTNNYVQSVCTPSRHALLTGRYPHRSAMQNLVI 68
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ A+ + K LP+YLK+LGYSTH +GKWH+G +EE LP +RGFD+ G ++G
Sbjct: 69 MSNQARCTGLGYKFLPEYLKDLGYSTHAVGKWHVGYCREECLPTHRGFDSFFGLYDGDGY 128
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y + H + G N + + D ++ ++ N P FL +
Sbjct: 129 YWN--HTSTVIPGAFDWNNSTGVYLEARGIHSEDLGAERLTAILDGQNAKEPFFLYFSPQ 186
Query: 202 AVHTGTAGNAKL 213
HT + A+
Sbjct: 187 NPHTPSQPQAEF 198
>gi|443693750|gb|ELT95037.1| hypothetical protein CAPTEDRAFT_126817, partial [Capitella teleta]
Length = 318
Score = 116 bits (290), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 96/175 (54%), Gaps = 7/175 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G D+ TPN+DALA G++L +Y C+PSR A +TG+YP++ + + V
Sbjct: 14 GYHDIGLRNP-DVITPNLDALASKGVILTNNYVQALCSPSRHALMTGRYPYKSAMQSFVV 72
Query: 83 AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
AK + KLLPQYLKELGY HLIGKWH+G +EE LP +RGFD+ G +G
Sbjct: 73 LPFEAKCTGLEYKLLPQYLKELGYENHLIGKWHLGYCREECLPTSRGFDSFYGLLDGAGD 132
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y + + L+ E Y + D D+ + H++ PLFL
Sbjct: 133 YWEHTTSGVYDWHLNDEVFHEAYG-----NHSQDLELDRLDKLFAEHDNKDPLFL 182
>gi|406830958|ref|ZP_11090552.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 441
Score = 116 bits (290), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 72/185 (38%), Positives = 104/185 (56%), Gaps = 16/185 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ DVGFHG DIPTP+ID+LA +G + Y + P C+P+RA LTG+Y R+G +
Sbjct: 40 GYADVGFHGGKDIPTPHIDSLAASGTRFSSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNP 99
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G G K +P+TE + L+ GY+T L+GKWH+G + + P RGF G+ G+ T
Sbjct: 100 G-GANKGLPLTETTIADRLQAAGYATGLVGKWHLGTDP-KFHPLKRGFGEFFGFLAGHHT 157
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSS-KYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D E D ++R +++ YLTD F ++V I+ H + P FL +
Sbjct: 158 YFDK-QEAD----------IQRGTTKVTEPGYLTDAFGREAVSFIERH-QNHPFFLYLAF 205
Query: 201 AAVHT 205
AVHT
Sbjct: 206 NAVHT 210
>gi|443698985|gb|ELT98690.1| hypothetical protein CAPTEDRAFT_103525, partial [Capitella teleta]
gi|443734460|gb|ELU18442.1| hypothetical protein CAPTEDRAFT_129771, partial [Capitella teleta]
Length = 333
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 98/190 (51%), Gaps = 8/190 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G D+ TPN+DALA G++L +Y CTPSR A +TG+YP + T V
Sbjct: 14 GYHDIGLRNP-DVITPNLDALASKGVILTNNYVQALCTPSRHALMTGRYPSASAMQTSVI 72
Query: 83 AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ AK + KLLPQYLK+LGY H++GKWH+G ++E LP +RGFD G + G
Sbjct: 73 LPMRAKCTGLEYKLLPQYLKDLGYKNHMVGKWHLGYCRDECLPTSRGFDTFYGLYAGTGD 132
Query: 142 Y--NDSIHETDFAVGLDARRNMERYAPQMSSKY----LTDFFTDQSVHVIKSHNHSRPLF 195
Y + + D+ D Q+ S Y L D ++ V H+ PLF
Sbjct: 133 YWSHTFFGKYDWHTNADIDFEANSTHSQVRSSYMNFVLQDLEMERLDKVFDEHDSKDPLF 192
Query: 196 LQITHAAVHT 205
L HT
Sbjct: 193 LYFAPQNPHT 202
>gi|443692243|gb|ELT93883.1| hypothetical protein CAPTEDRAFT_107171, partial [Capitella teleta]
Length = 330
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 73/194 (37%), Positives = 104/194 (53%), Gaps = 18/194 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ND+G + D+ TPN+DALA G++L +Y C+PSR A +TG+YP + + V
Sbjct: 14 GYNDLGLR-DPDVITPNMDALASKGVILTNNYVQAVCSPSRHALMTGRYPSASAMQSIVI 72
Query: 83 AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG------- 134
+ AK + K LPQYLK+LGY H+IGKWH+G +EE LP +RGFD G
Sbjct: 73 QPMEAKCSGLKYKFLPQYLKDLGYKNHMIGKWHLGYCREECLPTSRGFDTFYGLYASSGD 132
Query: 135 YW-NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKY--LTDFFTDQSVHVIKSHNHS 191
YW +G + D T+ V +AR Q+ S+Y + D ++ V H++
Sbjct: 133 YWEHGIMGMYD--WHTEAGVDFEARGT----HAQVGSRYWHIYDLEMERLDKVFDEHDNK 186
Query: 192 RPLFLQITHAAVHT 205
PLFL HT
Sbjct: 187 DPLFLYFAPQNSHT 200
>gi|125820285|ref|XP_692237.2| PREDICTED: arylsulfatase I-like [Danio rerio]
Length = 568
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/189 (35%), Positives = 104/189 (55%), Gaps = 9/189 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ND+G+H +I +P +D LA G+ L +Y P CTPSR+ +TG+Y G+ +
Sbjct: 54 QGFNDIGYH-SGEIRSPTLDKLASEGVRLENYYVQPLCTPSRSQLITGRYQIHTGLQHSI 112
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ LPQ L+E+GYSTH++GKWH+G +++ LP RGF + G G
Sbjct: 113 IRPRQPNCLPLDVVTLPQRLQEIGYSTHMVGKWHLGFYRKDCLPTRRGFHTYFGSLTGSV 172
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRPLFL 196
Y TY + G D E A + KY T +T + ++ +H+ S+PLF+
Sbjct: 173 DYYTYGSC--DGKSLCGFDLHEG-ESVAWGRAGKYSTHLYTQRVRKILATHDPTSQPLFI 229
Query: 197 QITHAAVHT 205
++ AVHT
Sbjct: 230 FLSLQAVHT 238
>gi|308512479|ref|XP_003118422.1| CRE-SUL-3 protein [Caenorhabditis remanei]
gi|308239068|gb|EFO83020.1| CRE-SUL-3 protein [Caenorhabditis remanei]
Length = 500
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 73/195 (37%), Positives = 110/195 (56%), Gaps = 14/195 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAY--NGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP 80
G+ND+ + ++ + TPN+ LA+ N +L Y CTP+R+AF+TG YPFR G
Sbjct: 42 GFNDLDWK-DSTLHTPNLRNLAFHKNTALLTNSYVNQLCTPTRSAFMTGYYPFRVGTQNG 100
Query: 81 VGAGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW--- 136
V + A VP L + +++L YST+L+GKWH+G K+E LP NRGFD G++
Sbjct: 101 VFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQ 160
Query: 137 NGYLTYN-DSIHETDFAV--GLDARRNM--ERYAPQMSSK--YLTDFFTDQSVHVIKSHN 189
GY ++ D H V GLD + + P S Y TD FTD ++ V+ +HN
Sbjct: 161 TGYFNHSADQYHRELKRVVKGLDLFEEVGNGKSVPDFSQNGVYSTDLFTDVAMSVLDNHN 220
Query: 190 HSRPLFLQITHAAVH 204
++P F+ +++ AVH
Sbjct: 221 TTKPFFMFLSYQAVH 235
>gi|298706923|emb|CBJ29750.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 706
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 77/117 (65%), Gaps = 2/117 (1%)
Query: 23 GWNDVGFHGEN-DIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
GWND+G+ + TP++D LA G+ + +YT+ CTP+RA+ +TG+Y RYG+ +
Sbjct: 142 GWNDIGYQSVDLQGVTPHLDRLAAGGVKMTNYYTMSICTPARASLMTGRYVMRYGLQYSV 201
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
+ G +P+TEK+ P+Y+K+ GY TH+IGKWHIG +P RGFD ++GY N
Sbjct: 202 IQPGAPWGLPLTEKIFPEYMKDAGYETHMIGKWHIGSYTSRHIPSQRGFDTYLGYLN 258
>gi|325109725|ref|YP_004270793.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324969993|gb|ADY60771.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 471
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 70/191 (36%), Positives = 108/191 (56%), Gaps = 12/191 (6%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
QGW+DVGF+G +IPTP++DALA +G+ + Y + P C+PSRA LTG+Y R+G +
Sbjct: 38 QGWSDVGFNGCKEIPTPHLDALAKSGVAFDCGYASHPYCSPSRAGLLTGRYQQRFGHECN 97
Query: 81 VGAG------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
GA + +P++E LL + GY T IGKWH+G ++ + P RGF+ G
Sbjct: 98 PGAHGNDDAIEMEGLPLSETLLSTVFRNAGYRTGAIGKWHLG-DEPQFWPTERGFEEWFG 156
Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
+ G L+Y + + G+ RN + P+ YLTD F+ ++V ++ N +RP
Sbjct: 157 FSGGGLSYWGDLGKKPPLHGV--LRNGD-VVPKDELTYLTDDFSTEAVKFVE-ENRARPF 212
Query: 195 FLQITHAAVHT 205
FL + + A H
Sbjct: 213 FLYLAYNAPHA 223
>gi|291238558|ref|XP_002739195.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 495
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 69/216 (31%), Positives = 114/216 (52%), Gaps = 10/216 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVG++ TP ID LA NG+ L +Y C PSR +TG++ + GI
Sbjct: 34 GWNDVGYNNPV-FKTPTIDRLAGNGVKLLNYYVASHCLPSRNMLMTGRHAIQLGIPNDGF 92
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+++P+ E + + LK GYS H++GKWH G + LP NRGFD G+ + +
Sbjct: 93 GYHPRSLPLDETTIAEPLKHAGYSNHIVGKWHCGYYADNCLPHNRGFDTFFGFVGAGIDH 152
Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAA 202
H F + R+N + A + KY T F ++ +I +H+ ++PLFL ++ +A
Sbjct: 153 --YTHSDHFNHMHNLRKNDDCIAKKYIGKYSTTIFANEGKDIINAHDQNKPLFLYLSFSA 210
Query: 203 VHTGTAGNAKLPTGLLQVPDM---EENDRTFAHISN 235
VH ++P+ L+ + +E+ RT+A +++
Sbjct: 211 VHAPL----EVPSSYLKQYESTIHDEDRRTYAAMTS 242
>gi|402820941|ref|ZP_10870501.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
IMCC14465]
gi|402510173|gb|EJW20442.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
IMCC14465]
Length = 496
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 73/188 (38%), Positives = 102/188 (54%), Gaps = 16/188 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ D+GF G +DI TPN+D LA GIVLNR Y+LP CTP+R+A +T + P + G
Sbjct: 34 GYADLGFRG-SDIQTPNLDRLAAEGIVLNRFYSLPICTPTRSALMTARDPIKLGT---AY 89
Query: 83 AGVA----KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
AG+ V E +P+ K+ GY T +IGKWHIG E L+P +RGFD+ G+ N
Sbjct: 90 AGLQPWENGGVSPDEHFMPESFKKAGYQTAMIGKWHIGRQYESLVPHHRGFDHFFGHLNT 149
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS-HNHSRPLFLQ 197
+ Y H + A G D + N + Y TD D+SV +K + S+P L
Sbjct: 150 QVDY--YTHAS--AGGHDLQENGKSLK---RDAYATDIHGDESVRYLKEIRDPSKPFLLY 202
Query: 198 ITHAAVHT 205
+ A H+
Sbjct: 203 VPFLAPHS 210
>gi|283779108|ref|YP_003369863.1| sulfatase [Pirellula staleyi DSM 6068]
gi|283437561|gb|ADB16003.1| sulfatase [Pirellula staleyi DSM 6068]
Length = 468
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 79/226 (34%), Positives = 114/226 (50%), Gaps = 29/226 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID--- 78
G++D+G HG DIPTP++DALA +G+ Y + P C+P+RA LTG+Y R+G +
Sbjct: 41 GYHDLGVHGCKDIPTPHLDALATSGVRCTSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNP 100
Query: 79 --TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
TP G +P++E L LK++GY T ++GKWH+G N E+ P +RGFD G+
Sbjct: 101 GPTPTG---EIGLPLSETTLADRLKKVGYKTGMVGKWHLG-NDEKRHPLSRGFDEFFGFL 156
Query: 137 NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
G TY + L R + +YLTD F ++V I S P FL
Sbjct: 157 GGARTYFATPGNASAGTKLLRGREVVD-----EKEYLTDAFAREAVAYIDRSKAS-PFFL 210
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+T AVHT + K DR F +S+P R+ +
Sbjct: 211 YLTFNAVHTPMEASQKY------------LDR-FTAVSDPKRQKYC 243
>gi|332016485|gb|EGI57378.1| Arylsulfatase I [Acromyrmex echinatior]
Length = 502
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 98/162 (60%), Gaps = 11/162 (6%)
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ G + +P++ ++LP++L+ LGY+T +IGKWH+G + + P +RGFD+ +G++N ++
Sbjct: 8 IQGGEPRGLPLSVRILPEHLRGLGYTTKMIGKWHLGYHTPQHTPLHRGFDSFLGFYNSHV 67
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
+Y D + G D R + A ++ KY+TD FTD++V +I++H+ SRPL+LQI+H
Sbjct: 68 SYYDYKYSYQNMSGYDMHRG-DAPAYGLTDKYVTDLFTDEAVRIIQTHDPSRPLYLQISH 126
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH L D + D+ F HI +RR +A
Sbjct: 127 LAVH----------APLENPQDYDHYDKRFMHIVEQNRRKYA 158
>gi|72159051|ref|XP_791089.1| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 545
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 80/233 (34%), Positives = 117/233 (50%), Gaps = 31/233 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+ND+G+ + TPN+D LA GI L+ +Y P CTPSRA ++GKY G+ + +
Sbjct: 70 GFNDIGYRNPA-MRTPNLDYLAAEGIKLDNYYVQPICTPSRAQLMSGKYQIHTGLQHSII 128
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ LPQ LKE GY+TH+ GKWH+G K+E P NRGFD+ +G L
Sbjct: 129 WPPQPNCLPLDLPTLPQKLKEAGYATHMAGKWHLGFYKKECWPTNRGFDSFLGI---LLG 185
Query: 142 YNDSIHETDFA-----------VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNH 190
D T+ GLD R ++ S Y T ++ ++I+ H+
Sbjct: 186 KGDHFLHTEEGGGGPYPSTWPWEGLDFRDGLQS-TNAYSGIYSTHVIAERVENIIEKHDK 244
Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTF-AHISNPDRRLFA 242
+PLFL ++ AVHT LQVP E + F + I + RR++A
Sbjct: 245 DKPLFLYVSFQAVHTP-----------LQVP--ESYLQPFESSIQDEKRRIYA 284
>gi|291513548|emb|CBK62758.1| Arylsulfatase A and related enzymes [Alistipes shahii WAL 8301]
Length = 467
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 71/185 (38%), Positives = 94/185 (50%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
GW DVG+HG + IPTPNIDALA GI +NR YT P +P+RA +TG+YP R+GI T +
Sbjct: 43 GWGDVGYHG-SVIPTPNIDALAARGIEMNRFYTAPVSSPTRAGLMTGRYPSRFGIRKTVI 101
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYS-THLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ E+ L L GY+ ++GKWH+G + P NRGF + G NG L
Sbjct: 102 PPWRDYGLDPEEQTLADMLAANGYAHRAIVGKWHLGHGRRAYYPLNRGFTHFYGCLNGAL 161
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y E + LD + E Y TD D++V I + P FL +
Sbjct: 162 DYFTHEREGE----LDWHNDWESC---RDEGYSTDLIADEAVRCIGGYASEGPFFLYVAF 214
Query: 201 AAVHT 205
A HT
Sbjct: 215 NAPHT 219
>gi|430741545|ref|YP_007200674.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430013265|gb|AGA24979.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 474
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 82/227 (36%), Positives = 113/227 (49%), Gaps = 32/227 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID-TP 80
G+ D+GF G DIPTP++DALA G+ Y + P C+P+RA LTG+Y R+G + P
Sbjct: 48 GYGDLGFQGARDIPTPHLDALAQGGVRCTSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNP 107
Query: 81 VGAGVAK-----AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
G G A +PVTE L LK GY+T L+GKWH+G ++ + P RGFD G+
Sbjct: 108 GGGGGAAAAKNVGLPVTETTLADRLKAAGYATGLVGKWHLG-SEAKFHPQKRGFDEFFGF 166
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
G TY S D R E + YLTD F+ +++ I H P F
Sbjct: 167 LGGQHTYFASKSG-------DVYRGTEVVKEEA---YLTDAFSREALSFIDRHK-DHPFF 215
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
LQ++ AVHT + E+ F+ I +P RR +A
Sbjct: 216 LQLSFNAVHT-------------PMDATEDRVARFSSIEDPKRRTYA 249
>gi|322778941|gb|EFZ09355.1| hypothetical protein SINV_05168 [Solenopsis invicta]
Length = 775
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 58/165 (35%), Positives = 97/165 (58%), Gaps = 9/165 (5%)
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
P+ + +P+ LLP+YL+ LGY+THL+GKWH+G + + P RGFD +GY+ G
Sbjct: 305 PLRGAERRGIPLNNTLLPEYLRRLGYTTHLVGKWHVGYHTKNFGPTRRGFDTFLGYYTGM 364
Query: 140 LTY-NDSIHETDFAVGLDARRNM-ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
+ Y N +++E+ +G D R + E + + Y+TD TD+ +I SHN +P++LQ
Sbjct: 365 IQYFNHTLYESG-QLGYDLHRIVGENHTVEYRYDYMTDLLTDEVESIISSHNTEKPMYLQ 423
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
++H A H A +++V D +E + TF +I + +RR +A
Sbjct: 424 LSHLAPHASDAEE------VMEVRDWKETNDTFGYIKDLNRRKYA 462
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 59/162 (36%), Positives = 94/162 (58%), Gaps = 11/162 (6%)
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ G + +P+ K+LP++L+ LGY+T+LIGKWH+G + + P RGFD G++N ++
Sbjct: 6 IQGGEPRGLPLNVKILPEHLQGLGYTTNLIGKWHLGYHTLQHTPSYRGFDYFCGFYNSHV 65
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
+Y+D + G D + A ++ KY+TD FTD++V +I++H+ RPL+LQI+H
Sbjct: 66 SYHDYKYSYQNMSGYDMHCG-DAPAYGLNDKYVTDLFTDKAVKIIENHDSFRPLYLQISH 124
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVH L D + +DR F HI RR +A
Sbjct: 125 LAVH----------APLENPQDYDHSDRRFIHIREQHRRKYA 156
>gi|325285341|ref|YP_004261131.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga lytica DSM 7489]
gi|324320795|gb|ADY28260.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga lytica DSM 7489]
Length = 460
Score = 114 bits (285), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 72/188 (38%), Positives = 106/188 (56%), Gaps = 12/188 (6%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
QGW DVGF+G DIPTPN+D +A G++ + Y + P C+PSRA LTG+Y R+G D
Sbjct: 36 QGWADVGFNGATDIPTPNLDRIASEGVIFSNGYVSHPYCSPSRAGLLTGRYQARFGHDCN 95
Query: 81 V---GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
+ G A P++EKL+ + LKE GY T IGKWHIG + L P +GFD+ G+
Sbjct: 96 MPYEGENDATVGTPLSEKLISEALKEQGYRTSAIGKWHIG-DHPNLHPPAQGFDHWFGFP 154
Query: 137 NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
G + Y + RN + A + + YLTD FT+++++ I + + P F+
Sbjct: 155 GGSMNYWGKATSKIQTI----YRNTKPVAEEELT-YLTDDFTNEAINFINKKDKN-PFFI 208
Query: 197 QITHAAVH 204
+ + A H
Sbjct: 209 YLAYNAPH 216
>gi|313232487|emb|CBY24155.1| unnamed protein product [Oikopleura dioica]
Length = 481
Score = 114 bits (285), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 64/183 (34%), Positives = 102/183 (55%), Gaps = 4/183 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
G++D+G+ +D+ +PNID LA N + + +Y P+CTPSRAAF+TG+Y RYG+ + V
Sbjct: 35 GFDDLGYVN-DDVISPNIDFLAKNALHIENYYNQPSCTPSRAAFMTGRYNIRYGMQSGVI 93
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+A+P++E LLPQ K+ GY+T + GKWH+G E+ P NRGFD G++ G
Sbjct: 94 KPDEPEAIPLSETLLPQAFKKCGYNTSMHGKWHLGFYTEKHCPQNRGFDRFFGFYLGSQD 153
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y H++ L E+ ++ Y T + + + ++ PLF ++
Sbjct: 154 Y--FYHDSGNNCYLYEPNGTEKVRLDLNGTYSTKAIAEDFIAKLDEYDPETPLFEFLSFQ 211
Query: 202 AVH 204
VH
Sbjct: 212 EVH 214
>gi|386821789|ref|ZP_10109005.1| arylsulfatase A family protein [Joostella marina DSM 19592]
gi|386426895|gb|EIJ40725.1| arylsulfatase A family protein [Joostella marina DSM 19592]
Length = 474
Score = 114 bits (285), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 73/222 (32%), Positives = 110/222 (49%), Gaps = 16/222 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG----- 76
G+ D GF G ++ TP++D LA I ++ Y + C PSRA LTGKY R+G
Sbjct: 47 GYADFGFQGSSEFKTPHLDQLASQSIRFSQAYVSAAVCGPSRAGILTGKYQQRFGYEENN 106
Query: 77 ----IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
+ G +P+ +KLLP+YLKE GY T L GKWH+G N ++ P RGFD
Sbjct: 107 VPGYMSASATTGDEMGLPLDQKLLPEYLKEQGYKTALFGKWHMG-NADKFHPTKRGFDTF 165
Query: 133 VGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
G+ G +Y + +E + + R + S YLTD + + I+ N +
Sbjct: 166 YGFRGGARSYYE-FNENNKNNRQEDRLERGFGNFEESKLYLTDALAEATTDFIEK-NQKQ 223
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHIS 234
P F+ ++ AVHT P L Q P+++ +T A ++
Sbjct: 224 PFFVYLSFNAVHTPMEAR---PDDLKQFPNLKGKRKTLAAMT 262
>gi|313219878|emb|CBY30794.1| unnamed protein product [Oikopleura dioica]
Length = 481
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/183 (35%), Positives = 103/183 (56%), Gaps = 4/183 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
G++D+G+ +D+ +PNID LA N + + +Y P+CTPSRAAF+TG+Y RYG+ + V
Sbjct: 35 GFDDLGY-VNDDVISPNIDFLAKNALHIENYYNQPSCTPSRAAFMTGRYNIRYGMQSGVI 93
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+A+P++E LLPQ LK+ GY+T + GKWH+G E+ P NRGFD G++ G
Sbjct: 94 KPDEPEAIPLSETLLPQALKKCGYNTSMHGKWHLGFYTEKHCPQNRGFDRFFGFYLGSQD 153
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y H++ L E+ ++ Y T + + + ++ PLF ++
Sbjct: 154 Y--FYHDSGNNCYLYEPCYREKVRLDLNGTYSTKAIAEDFIAKLDEYDPETPLFEFLSFQ 211
Query: 202 AVH 204
VH
Sbjct: 212 EVH 214
>gi|149197772|ref|ZP_01874821.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149138993|gb|EDM27397.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 441
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 72/195 (36%), Positives = 105/195 (53%), Gaps = 24/195 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYG--IDT 79
G +D +G + TP+ID++A+NGI + YT + C+PSRA LTG+Y +G +
Sbjct: 32 GSSDFSCYGSKQLLTPHIDSIAHNGIKFTQAYTASSVCSPSRAGLLTGRYQQTFGHLANI 91
Query: 80 PVGAGVAK-----AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
P A +PVTE L LKELGYSTH IGKWH+G + P RGFDN G
Sbjct: 92 PHSKHSANDPELLGLPVTEITLADSLKELGYSTHCIGKWHLG-EADHFHPNARGFDNFYG 150
Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYA-----PQMSSKYLTDFFTDQSVHVIKSHN 189
+ +G TY +G + R +M+R + SS Y T+ FT +++ +I+
Sbjct: 151 FLSGARTY---------FLGGELRGDMDRIMRNKEFAEPSSGYTTEVFTQEAIRIIQ-EE 200
Query: 190 HSRPLFLQITHAAVH 204
+P F+ ++H AVH
Sbjct: 201 QDKPFFIYLSHNAVH 215
>gi|291236518|ref|XP_002738186.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 473
Score = 114 bits (284), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 56/117 (47%), Positives = 76/117 (64%), Gaps = 2/117 (1%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW+DVG+H +++I TPNID LA G+ L +Y P CTPSRA +TG+Y G+ V
Sbjct: 35 GWHDVGYH-DSEIQTPNIDMLAAEGVKLENYYVTPLCTPSRAVLMTGRYLIHSGMQHGVL 93
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
A + +P E LLPQ LK+ GYSTH++GKWH+G K + P +RGFD G++N
Sbjct: 94 VAQNPRCLPTDEILLPQMLKDSGYSTHMVGKWHLGFCKFQCTPNHRGFDTFFGWYNA 150
>gi|86141258|ref|ZP_01059804.1| arylsulfatase B precursor [Leeuwenhoekiella blandensis MED217]
gi|85831817|gb|EAQ50272.1| arylsulfatase B precursor [Leeuwenhoekiella blandensis MED217]
Length = 461
Score = 114 bits (284), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 68/183 (37%), Positives = 98/183 (53%), Gaps = 10/183 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWND FHG ++I TPN+D LA G+ L+R YT PTC+P+RA+ LTG+ R GI P+
Sbjct: 50 GWNDFSFHG-SEIQTPNLDQLAGKGLTLDRFYTYPTCSPARASLLTGRPASRMGIVAPIS 108
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+P + LPQ L +L Y T L+GKWH+G K E P GFD G+ +G L
Sbjct: 109 GRSELNLPDSITTLPQALSKLNYKTALMGKWHLGL-KPESGPEVYGFDFSYGFLHGQLDQ 167
Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSK-YLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
++ + R +S K ++TD T +VH I + + +LQ+ ++
Sbjct: 168 YAHTYK-------NGDSTWYRNGKFISEKGHVTDLLTQSAVHYIDTLQTDQNFYLQVAYS 220
Query: 202 AVH 204
A H
Sbjct: 221 APH 223
>gi|372210445|ref|ZP_09498247.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 474
Score = 113 bits (282), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 72/234 (30%), Positives = 119/234 (50%), Gaps = 36/234 (15%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYG---- 76
QGW DVGF+G DIPTPN++ALA +G++ ++ Y+ P C+PSRA LTG+Y ++G
Sbjct: 37 QGWGDVGFNGATDIPTPNLNALAKDGVIFSQGYSSHPYCSPSRAGLLTGRYQQKFGHENN 96
Query: 77 -------IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF 129
DT +G +P+ E ++ + L++ Y T IGKWH+G N + LP RGF
Sbjct: 97 PENEKQNEDTVIG------LPLNELMISEVLQQNNYHTCAIGKWHLG-NAHKFLPNQRGF 149
Query: 130 DNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN 189
+ G+ G Y + +G+ + P+ + YLTD F++Q+++ I ++
Sbjct: 150 KDWFGFSGGGFNYWGKTTPKNKELGV---MKNGKPVPENTLTYLTDDFSNQAINYIDQYS 206
Query: 190 HS-RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ +P F+ + + A H + +E HI N +R +A
Sbjct: 207 KTEQPFFMYLAYNAPHA-------------PIQATKEYTNLVTHIENGERAAYA 247
>gi|443734654|gb|ELU18562.1| hypothetical protein CAPTEDRAFT_195389, partial [Capitella teleta]
Length = 330
Score = 113 bits (282), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 70/192 (36%), Positives = 99/192 (51%), Gaps = 23/192 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ND+G + D+ TPN+DALA G++L +Y C+PSR A +TG+YP + + V
Sbjct: 50 GYNDLGLR-DPDVITPNMDALASKGVILTNNYVQAVCSPSRHALMTGRYPSASAMQSIVI 108
Query: 83 AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG------- 134
+ AK + K LPQYLK+LGY H+IGKWH+G +EE LP +RGFD G
Sbjct: 109 QPMEAKCSGLKYKFLPQYLKDLGYKNHMIGKWHLGYCREECLPTSRGFDTFYGLYASSGD 168
Query: 135 YW-NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
YW +G + D T+ V +AR + D ++ V H++ P
Sbjct: 169 YWEHGIMGMYD--WHTEAGVDFEAR-----------GTHAQDLEIERLDKVFDEHDNKDP 215
Query: 194 LFLQITHAAVHT 205
LFL HT
Sbjct: 216 LFLYFAPQNSHT 227
>gi|340373449|ref|XP_003385254.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 491
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 76/221 (34%), Positives = 110/221 (49%), Gaps = 22/221 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GWND + G +DI TPNID LA GI L ++Y P C+PSR+A L GKYP+ G+ V
Sbjct: 35 GWNDTSYQG-SDIQTPNIDKLAEEGIRLKQYYVQPLCSPSRSALLAGKYPYHLGLAHGVI 93
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + + E + +LK+ GYSTH +GKW +G +K E P RGFD GY++
Sbjct: 94 TNGHPYGLGLNETTIADHLKKGGYSTHAVGKWDLGMHKWEFTPTYRGFDTFYGYYDA--- 150
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
++ + LD R N + + Y T FT I + + S P F+ +
Sbjct: 151 -DEDYYTHKVGGYLDFRNNTDPVKDE-DGTYSTFLFTKAIEDAINAKSDS-PFFIYGAYQ 207
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+VH G L+ PD+ N +I P+R++F
Sbjct: 208 SVH-----------GPLEAPDIYLNK---CNIPYPNRKIFC 234
>gi|313212736|emb|CBY36668.1| unnamed protein product [Oikopleura dioica]
Length = 602
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 66/185 (35%), Positives = 100/185 (54%), Gaps = 4/185 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
G++D+G+ D+ +PNIDALA + + L +HY P+CTPSRAAFLTG+Y R G+ + V
Sbjct: 56 GFDDLGYVNR-DVISPNIDALAKDALHLKKHYVQPSCTPSRAAFLTGRYNIRMGMQSGVI 114
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY-- 139
A + +P+ E LL + K+ GY T L GKWH+G + P NRGFD G++ G
Sbjct: 115 RAPEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTYKHCPQNRGFDRFYGFYLGSQD 174
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
++DS + D + + Y T F D ++ + H+ + PLF ++
Sbjct: 175 FYFHDSGRLEAYPGNGDVENDTILDDFHTNGTYSTKLFVDDFINDLAKHDPAVPLFNYVS 234
Query: 200 HAAVH 204
VH
Sbjct: 235 FQDVH 239
>gi|405952520|gb|EKC20320.1| Arylsulfatase B [Crassostrea gigas]
Length = 500
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 73/221 (33%), Positives = 113/221 (51%), Gaps = 28/221 (12%)
Query: 1 IDTPVGAGVAKAVPVTEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCT 60
I +GA K + + G ND+G++ ++ TPN+D LA NG++L +Y P C+
Sbjct: 19 IQLSLGAANQKPNIIFIAVDDMGNNDIGYNNP-EVDTPNLDNLANNGVILESNYVYPVCS 77
Query: 61 PSRAAFLTGKYPFRYGIDT-PVGAGVAKAVP-----VTEKLLPQYLKELGYSTHLIGKWH 114
PSRAAF+TG+Y + G PV + V EKL Y GY+ H+IGKWH
Sbjct: 78 PSRAAFMTGRYAHKIGFQRGPVEHKQPAYIESNYKTVAEKLTTNY----GYAAHMIGKWH 133
Query: 115 IGCNKEELLPFNRGFDNHVGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSS- 170
+G K+ + P NRGFD+ G++ G Y TY + ++ D R N+ PQ +
Sbjct: 134 LGYCKDAVTPTNRGFDSFYGFYGGQENYYTYTSARYK-------DFRDNLTAVTPQNPNY 186
Query: 171 ------KYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
Y T + +++ ++ +H+ S PLFL + A H+
Sbjct: 187 PREDVDGYSTFEYKKRAIEIVGNHDKSVPLFLYLAFQAPHS 227
>gi|294053963|ref|YP_003547621.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613296|gb|ADE53451.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 478
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 79/232 (34%), Positives = 115/232 (49%), Gaps = 36/232 (15%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID--T 79
G+ D GF G DI TPN+D LA +G++ N+ Y T C PSRA FL G+Y R+G + T
Sbjct: 33 GYADAGFTGATDILTPNLDKLAESGVIFNQGYVTHAFCGPSRAGFLAGRYQHRFGFEHNT 92
Query: 80 PVG-AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
P A + V E L P L+++GY+T +IGKWH+G + P NRGFD Y+ G
Sbjct: 93 PYDPANPLAGIDVRETLFPARLQDVGYTTGIIGKWHLGAS-SPFYPLNRGFD----YFYG 147
Query: 139 YLTYNDSIHETD--------FAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNH 190
+LT E D + GL + + + YLT + +V + + N
Sbjct: 148 FLTGGHDYFEIDVTQPVKSAYLQGLFRNKRVANF-----EGYLTTALSRDAVQFV-NDNK 201
Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
P FL +++ A H LQ P +E+ +AHI + RR++A
Sbjct: 202 ENPFFLFLSYNAPHQP-----------LQAP--QEDIARYAHIKDKKRRVYA 240
>gi|449138178|ref|ZP_21773473.1| arylsulfatase B [Rhodopirellula europaea 6C]
gi|448883202|gb|EMB13740.1| arylsulfatase B [Rhodopirellula europaea 6C]
Length = 489
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 69/185 (37%), Positives = 94/185 (50%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVGFHG +DI TPNID LA +VL+R Y P C+P+RA LTG YPFR+GI V
Sbjct: 57 GWNDVGFHG-SDIRTPNIDRLARESVVLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVV 115
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ K +P + P++L +LGY + GKWH+G P G G++NG +
Sbjct: 116 SPTKKHGLPSELETTPEHLAKLGYDHRAMFGKWHLGLASTLFHPLRHGMTEFYGHYNGAI 175
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y F LD RN + + Y T+ + V I H +PL+ +
Sbjct: 176 DY---FSRERFGQ-LDWHRNHDSVHEE---GYSTELVGNAVVDFIDRHAGQQPLYAYVAF 228
Query: 201 AAVHT 205
A H+
Sbjct: 229 NAPHS 233
>gi|323452769|gb|EGB08642.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 1517
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 82/259 (31%), Positives = 120/259 (46%), Gaps = 56/259 (21%)
Query: 23 GWNDVGF-HGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI---- 77
G++DVG+ + + + TP+IDALA G+ L+R+Y+ +CTP+R A LTG P R G+
Sbjct: 82 GFDDVGYGNADGAVATPHIDALAKEGVTLSRYYSAFSCTPARGALLTGLSPHRLGLQHGQ 141
Query: 78 ---DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
+ P G +P +LPQ+L +LGY +HL+GKWH+G E LP RGFD+ G
Sbjct: 142 VFPEQPWG------LPSKFSILPQHLAKLGYRSHLVGKWHLGHFSAERLPTARGFDSFFG 195
Query: 135 YWNGYLTYNDSIHETD-----------FAVG---------------LDARRNMERYAPQM 168
+G Y I D F VG D R N +R
Sbjct: 196 GLDGAQYYATHIDAMDCKLPGDVLYRGFEVGDYDSLKAVTAEHGCYFDLRENNDRVEDLF 255
Query: 169 SSKYLTDFFTDQSVHVIKSHNH-----SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDM 223
S Y T F ++ +I +H+ +PLFL ++ AVH + D
Sbjct: 256 GS-YSTQLFGRKAEELIDAHSKRADAAEKPLFLLLSFNAVH----------APVWAPEDT 304
Query: 224 EENDRTFAHISNPDRRLFA 242
E +++N +RR FA
Sbjct: 305 YETHPDLLNVTNGNRRKFA 323
>gi|298706368|emb|CBJ29377.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 653
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 73/117 (62%), Gaps = 2/117 (1%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
GW DVGFH + TPN+DA+ G+ L+ YT PTCTPSRA +TG+Y +R G+ D+ +
Sbjct: 48 GWKDVGFH-DTTFSTPNLDAMVAEGVELSTFYTAPTCTPSRAQLMTGRYSYRIGMQDSVL 106
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+ VP+TE + + L+ GYST +GKWH+G + + LP RGFD+ G G
Sbjct: 107 HTTEPRGVPLTETFVGEKLQAAGYSTAAVGKWHLGMHMPQFLPVERGFDDFYGILTG 163
>gi|388257120|ref|ZP_10134300.1| sulfatase [Cellvibrio sp. BR]
gi|387939324|gb|EIK45875.1| sulfatase [Cellvibrio sp. BR]
Length = 474
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 72/198 (36%), Positives = 99/198 (50%), Gaps = 25/198 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID--- 78
G+ D GF G +IPTPN+D LA G+V + Y + C PSRA LTGKYP R+G +
Sbjct: 47 GYADFGFQGSTEIPTPNLDQLAQEGVVFKQAYVSASVCGPSRAGLLTGKYPQRFGFEENN 106
Query: 79 -----TPVGA-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
+ GA G + + + + YL E GY T LIGKWH G N++ P RGFD
Sbjct: 107 VPGYMSSSGATGDDMGMRLDQLTMANYLAERGYRTSLIGKWHQG-NEDRFHPLKRGFDEF 165
Query: 133 VGYWNGYLTY------NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK 186
G+ G +Y + S DF RN Y S YLTD ++++ IK
Sbjct: 166 FGFRGGARSYFPFTQAHPSSRREDF-----LERNFNNYGE--SPLYLTDALANETIEFIK 218
Query: 187 SHNHSRPLFLQITHAAVH 204
+ H +P F ++ +A H
Sbjct: 219 RNKH-QPFFTFLSLSAPH 235
>gi|149199999|ref|ZP_01877025.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149136872|gb|EDM25299.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 512
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 75/209 (35%), Positives = 107/209 (51%), Gaps = 31/209 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGI-DTP 80
G++DVG+HG I TPNID++A G+ ++ Y + C PSRA LTG Y R+G + P
Sbjct: 32 GYDDVGYHGNKRIITPNIDSIAEQGVQFSQGYVSASVCGPSRAGLLTGVYQQRFGCGENP 91
Query: 81 VGAGV-------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
G+G +P ++ ++ + LK LGY+ +IGKWH+G + L P RG+D
Sbjct: 92 NGSGYPNQMKYPMAGLPQSQSMISEELKTLGYTNGMIGKWHMGFDM-SLRPNQRGYDFFY 150
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNME-------RYAPQMSSK--------YLTD 175
G+ NG Y + E FA G RN E +Y K YLTD
Sbjct: 151 GFINGSHDYTEWTQE--FAKGKSRWPIFRNEEMEPANKAQYIDVFKEKGVKVVDENYLTD 208
Query: 176 FFTDQSVHVIKSHNHSRPLFLQITHAAVH 204
FTD++V+ I N +P FL + + AVH
Sbjct: 209 LFTDEAVNFI-DRNADKPFFLYLAYNAVH 236
>gi|326801926|ref|YP_004319745.1| Cerebroside-sulfatase [Sphingobacterium sp. 21]
gi|326552690|gb|ADZ81075.1| Cerebroside-sulfatase [Sphingobacterium sp. 21]
Length = 454
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 67/206 (32%), Positives = 106/206 (51%), Gaps = 6/206 (2%)
Query: 6 GAGVAKAVPVTEKLLP--QGWNDVGFHGENDIPTPNIDALAYNGI-VLNRHYTLPTCTPS 62
G+G A+ P +L G++D+G +G I TP +D++A NG+ + T P+CTPS
Sbjct: 18 GSGSAQERPNIILVLADDMGYSDLGCYGSPSISTPFLDSMAANGVRATDFMVTSPSCTPS 77
Query: 63 RAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL 122
RA+ LTG+Y RY + P+G G +P E + + LKE+GY T L+GKWH+G +
Sbjct: 78 RASLLTGRYASRYNLPDPIGPGSTLGLPDEEITIAEMLKEVGYRTALVGKWHLGDKHDFN 137
Query: 123 LPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSV 182
P +GFD+ G + Y D +TD + + RN + + L+ ++++ +
Sbjct: 138 YPTGQGFDSFFGMLYSH-DYRDPYVKTDTTIKI--FRNPKPAIQGPADSNLSRIYSEEVI 194
Query: 183 HVIKSHNHSRPLFLQITHAAVHTGTA 208
IK +P FL H H A
Sbjct: 195 RFIKEQRKDQPFFLYYAHNMPHLPVA 220
>gi|423294191|ref|ZP_17272318.1| hypothetical protein HMPREF1070_00983 [Bacteroides ovatus
CL03T12C18]
gi|392676448|gb|EIY69884.1| hypothetical protein HMPREF1070_00983 [Bacteroides ovatus
CL03T12C18]
Length = 458
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 66/185 (35%), Positives = 95/185 (51%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW DVGFHG ++I TP +DAL G+ L R YT P TP+RA +TG+YP R+G+ + V
Sbjct: 37 GWGDVGFHG-SEIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ E+ + L GY +IGKWH+G K+ P NRGF + G+ NG +
Sbjct: 96 PPWREDGLDENEETVADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D E + LD + E Y T+ T +++H I ++ P L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITKEAIHCIDAYEKEGPFMLYVAY 208
Query: 201 AAVHT 205
A HT
Sbjct: 209 NAPHT 213
>gi|323456975|gb|EGB12841.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 536
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 68/199 (34%), Positives = 103/199 (51%), Gaps = 11/199 (5%)
Query: 23 GWNDVGFHGE----NDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID 78
G+ DV ++G+ N + TP +D LA +GI L R Y+ CTP+RAA LTG+YP G+
Sbjct: 44 GYGDVSYNGDGSLTNAVATPYLDRLAADGITLTRFYSQCDCTPARAALLTGRYPSNTGMQ 103
Query: 79 TPVGAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
V ++ ++P LLP L E GY H IGKW +G + P RGFD+H+GY+
Sbjct: 104 HEVVTAQSQWSLPHEFALLPSALPE-GYRKHAIGKWDVGHARAADTPTARGFDSHLGYYG 162
Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYAPQMSS---KYLTDFFTDQSVHVIKSHNHSRPL 194
+TY++ H + R+M +++ +Y T F D ++ ++ L
Sbjct: 163 AEITYDE--HAALRSCSNGTIRDMNHDGATLAATEDRYSTHLFADHAMALVDREADEYKL 220
Query: 195 FLQITHAAVHTGTAGNAKL 213
FL + AVH A +A L
Sbjct: 221 FLYLCFQAVHQPLAADAAL 239
>gi|323452295|gb|EGB08169.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 614
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 78/231 (33%), Positives = 115/231 (49%), Gaps = 25/231 (10%)
Query: 23 GWNDVGFHGENDIP--TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-T 79
G+NDVG+ +D+ TP +D L +G+ ++R Y CTPSRAA LTGK P +
Sbjct: 74 GFNDVGY-ASSDLGEMTPFLDGLMADGVRVDRLYGQQVCTPSRAAMLTGKLPIHLELQHW 132
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
V +P E L QYLK LGYSTH++GKWH+G P NRGFD+ G+++G
Sbjct: 133 QVAPSEPWGLPTREATLAQYLKALGYSTHMVGKWHLGHYNNASTPLNRGFDSFYGFYSGG 192
Query: 140 LTYNDSIHETDFAVGLDARRNM---ERYAPQMSSKYLTDFFTDQSVHVIKSH---NHSRP 193
+ Y H+ R++ ER ++ T ++++ V++ H S P
Sbjct: 193 VDY--LTHDPSTGYVWRCYRDLWDDERPVTDAHGQHQTSLMNERAIAVLERHAVEKKSEP 250
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPD--MEENDRTFAHISNPDRRLFA 242
+F +++ NA LP LQ P +E + T I N DR+ FA
Sbjct: 251 VFAYVSYP--------NAHLP---LQPPTELLERRNATLLDIPNHDRKNFA 290
>gi|260788430|ref|XP_002589253.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
gi|229274428|gb|EEN45264.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
Length = 449
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 92/170 (54%), Gaps = 8/170 (4%)
Query: 43 LAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPVGAGVAKAVPVTEKLLPQYLK 101
LA G+ L +Y P C+PSR +TG+Y RYG+ + + + +P+ E LPQ L+
Sbjct: 2 LASEGVKLENYYIQPICSPSRCQLMTGRYQIRYGLQHSVITSDRPHGLPLDEVTLPQKLR 61
Query: 102 ELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG---YLTY---NDSIHETDFAVGL 155
E GY ++++GKWH+G ++E +P RGFD GY G Y T+ N + GL
Sbjct: 62 ENGYRSYIVGKWHLGFFRKEYMPLQRGFDRFYGYLTGGEDYWTHRRPNGYARDPSAFHGL 121
Query: 156 DARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
D R+ ++ + Y T F +++ I SH S+P+FL + AVH+
Sbjct: 122 DL-RDQDKPVLDQNGTYSTHLFAQKAIEFILSHERSKPMFLYLPFQAVHS 170
>gi|440715767|ref|ZP_20896296.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
gi|436439253|gb|ELP32723.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
Length = 826
Score = 109 bits (273), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 105/190 (55%), Gaps = 12/190 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG----- 76
G++DVGF+G +IPTP++D LA +G+V Y + P C+PSRA LTG++ R+G
Sbjct: 56 GYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQRFGHESNP 115
Query: 77 -IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
DT +P++E L LKE GY T IGKWH+G + + P +RGFD G+
Sbjct: 116 EPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLG-DAKPFWPNHRGFDEWFGF 174
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
G +Y + + D +G+ R E P+ + +LTD F+ ++V I+ H S P F
Sbjct: 175 SGGGFSYWGDLGKKDPLLGV--HRGDEPVDPKTLT-HLTDDFSTEAVKFIQRHE-SEPFF 230
Query: 196 LQITHAAVHT 205
L + + A H
Sbjct: 231 LYLAYNAPHA 240
>gi|441597518|ref|XP_003266414.2| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Nomascus
leucogenys]
Length = 431
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 56/128 (43%), Positives = 79/128 (61%), Gaps = 5/128 (3%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG++DVG+HG +DI TP +D LA G+ L +Y P CTPSR+ LTG+Y G+ +
Sbjct: 57 QGYHDVGYHG-SDIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSI 115
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P+ + LPQ L+E GYSTH++GKWH+G ++E LP RGFD G G
Sbjct: 116 IRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFXGSLTGNV 175
Query: 139 -YLTYNDS 145
Y TY++
Sbjct: 176 DYYTYDNC 183
>gi|417303628|ref|ZP_12090677.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
gi|327540049|gb|EGF26644.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
Length = 826
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 104/190 (54%), Gaps = 12/190 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG----- 76
G++DVGF+G +IPTP++D LA +G+V Y + P C+PSRA LTG++ R+G
Sbjct: 56 GYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQRFGHESNP 115
Query: 77 -IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
DT +P+TE L LKE GY T IGKWH+G + + P RGFD G+
Sbjct: 116 EPDTQWHGEDTPGMPLTETTLADALKEAGYVTGAIGKWHLG-DAKPFWPNRRGFDEWFGF 174
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
G +Y + + D +G+ R E P+ + +LTD F+ ++V I+ H + P F
Sbjct: 175 SGGGFSYWGDLGKKDPLLGV--HRGDEPVDPKTLT-HLTDDFSTEAVKFIQRHE-TEPFF 230
Query: 196 LQITHAAVHT 205
L + + A H
Sbjct: 231 LYLAYNAPHA 240
>gi|313242955|emb|CBY39683.1| unnamed protein product [Oikopleura dioica]
Length = 581
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 99/185 (53%), Gaps = 4/185 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
G++D+G+ D+ +PNIDALA + + L +HY P+CTPSRAAFLTG+Y R G+ + V
Sbjct: 35 GFDDLGYVNR-DVISPNIDALAKDALHLKKHYVQPSCTPSRAAFLTGRYNIRMGMQSGVI 93
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY-- 139
A + +P+ E LL + K+ GY T L GKWH+G + P RGFD G++ G
Sbjct: 94 RATEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTYKHCPQIRGFDRFYGFYLGSQD 153
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
++DS + D + + Y T F D ++ + H+ + PLF ++
Sbjct: 154 FYFHDSGRLKAYPGNGDVENDTILDDLHTNGTYSTKLFVDDFINDLAKHDPAVPLFNYVS 213
Query: 200 HAAVH 204
VH
Sbjct: 214 FQDVH 218
>gi|443691100|gb|ELT93060.1| hypothetical protein CAPTEDRAFT_21969 [Capitella teleta]
Length = 529
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/184 (34%), Positives = 102/184 (55%), Gaps = 5/184 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+NDVGF N I +PN+DALA +GI+L YT P C+PSR +FL+G+Y ++ + V
Sbjct: 15 GFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFLSGRYSYKSAMQHGVI 73
Query: 83 A-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ + + +LP YLKELGY TH GKWH+G ++E P +RGFD+ G ++G
Sbjct: 74 LDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGFDSFSGGFSGEGE 133
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
+ + T ++ A S+ L ++ +++ + ++ S PLF+ +
Sbjct: 134 FYEHTTATGGYYDWHLGTEVDYDAIGKHSEDLIGYYVNKT---LDEYDQSSPLFMYVAFH 190
Query: 202 AVHT 205
VH+
Sbjct: 191 NVHS 194
>gi|443703066|gb|ELU00815.1| hypothetical protein CAPTEDRAFT_95989 [Capitella teleta]
Length = 382
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/184 (34%), Positives = 102/184 (55%), Gaps = 5/184 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+NDVGF N I +PN+DALA +GI+L YT P C+PSR +FL+G+Y ++ + V
Sbjct: 32 GFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFLSGRYSYKSAMQHGVI 90
Query: 83 A-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ + + +LP YLKELGY TH GKWH+G ++E P +RGFD+ G ++G
Sbjct: 91 LDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGFDSFSGGFSGEGE 150
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
+ + T ++ A S+ L ++ +++ + ++ S PLF+ +
Sbjct: 151 FYEHTTATGGYYDWHLGTEVDYDAIGKHSEDLIGYYVNKT---LDEYDQSSPLFMYVAFH 207
Query: 202 AVHT 205
VH+
Sbjct: 208 NVHS 211
>gi|323451693|gb|EGB07569.1| hypothetical protein AURANDRAFT_2707, partial [Aureococcus
anophagefferens]
Length = 351
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/209 (33%), Positives = 99/209 (47%), Gaps = 27/209 (12%)
Query: 23 GWNDVGFH----GENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID 78
G NDVG+ G I +P IDALA +VL+R Y P CTP+RAA +TG++ +R G+
Sbjct: 22 GRNDVGYAHRDGGPGRIASPRIDALAAESLVLDRFYAQPMCTPTRAALMTGRHAYRTGLA 81
Query: 79 TPVG-AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
V A +P E + + L++ GYSTH+IGKWH+G K+ +LP +RGFD GY
Sbjct: 82 YFVLLANQGTGLPAAEVTVAERLRDAGYSTHMIGKWHLGFAKKAMLPTSRGFDRFFGYCL 141
Query: 138 GYLTY--------NDSIHETDFAVG-------------LDARRNMERYAPQMSSKYLTDF 176
G Y + TD A G D + R P+ + + D
Sbjct: 142 GSSDYWLHQSPEWVPGVPSTDRATGAEPPTTGGMGHDLWDGATPLPR-TPKTENVHSADL 200
Query: 177 FTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
F ++ + +PLFL A H
Sbjct: 201 FAARATETFATAPRDKPLFLYYASQAPHA 229
>gi|392390175|ref|YP_006426778.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
15997]
gi|390521253|gb|AFL96984.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
15997]
Length = 467
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 68/200 (34%), Positives = 102/200 (51%), Gaps = 29/200 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDT-P 80
G+ D +G +IPTPNI+ LA G + ++ Y + C PSRA LTG+Y R+G + P
Sbjct: 39 GYADFECYGNKEIPTPNINRLAKEGTLFSKAYVSASVCAPSRAGLLTGRYQQRFGFENNP 98
Query: 81 VGA---GVAK---AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
G G K + ++EK + +KE GY T +GKWH+G N + P RGFD G
Sbjct: 99 TGKPREGFKKEDMGLALSEKTIGDRMKEEGYRTLAVGKWHLG-NDAKFFPLKRGFDEFYG 157
Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYA--------PQMSSKYLTDFFTDQSVHVIK 186
+ G H F+ ++ E+YA P+ YLTD FTD+++ I
Sbjct: 158 FQEG--------HRDFFSF---KKKRAEKYALWDNDKIIPEEEITYLTDMFTDKALKFID 206
Query: 187 SH-NHSRPLFLQITHAAVHT 205
+ + +P F+ + + AVHT
Sbjct: 207 ENADKKQPFFIYLAYNAVHT 226
>gi|295086308|emb|CBK67831.1| Arylsulfatase A and related enzymes [Bacteroides xylanisolvens
XB1A]
Length = 458
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 96/185 (51%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW DVGFHG ++I TP++DAL G+ L R YT P TP+RA +TG+YP R+G+ + V
Sbjct: 37 GWGDVGFHG-SEIKTPSLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ E+ + L GY +IGKWH+G K+ P NRGF + G+ NG +
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D E + LD + E + Y T+ T +++ I ++ P L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETCHDK---GYSTELITQEAIRCIDAYEKEGPFMLYVAY 208
Query: 201 AAVHT 205
A HT
Sbjct: 209 NAPHT 213
>gi|262405390|ref|ZP_06081940.1| arylsulfatase B [Bacteroides sp. 2_1_22]
gi|262356265|gb|EEZ05355.1| arylsulfatase B [Bacteroides sp. 2_1_22]
Length = 458
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 95/185 (51%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW DVGFHG ++I TP++DAL G+ L R YT P TP+RA +TG+YP R+G+ + V
Sbjct: 37 GWGDVGFHG-SEIKTPSLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ E+ + L GY +IGKWH+G K+ P NRGF + G+ NG +
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D E + LD + E Y T+ T +++ I ++ P L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITQEAIRCIDAYEKEGPFMLYVAY 208
Query: 201 AAVHT 205
A HT
Sbjct: 209 NAPHT 213
>gi|449137658|ref|ZP_21772978.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
gi|448883711|gb|EMB14224.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
Length = 810
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 69/190 (36%), Positives = 99/190 (52%), Gaps = 12/190 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG----- 76
G++DVGF+G +IPTP +D LA G+V Y + P C+PSRA LTG+Y R+G
Sbjct: 40 GYSDVGFNGCKEIPTPRLDELAGEGVVFTNGYASHPYCSPSRAGLLTGRYQQRFGHEGNP 99
Query: 77 -IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D +P++E L LKE GY T IGKWH+G + + P RGFD G+
Sbjct: 100 EPDPQWHGDDTPGMPLSETTLADALKEAGYVTGAIGKWHLG-DAKPFWPNRRGFDEWFGF 158
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
G L+Y + D +G+ + + S YLTD F+ ++V I+ H + P F
Sbjct: 159 SGGGLSYWGDLGRMDPLLGV---HRGDEPVDRKSLTYLTDDFSTEAVKFIQRH-ETDPFF 214
Query: 196 LQITHAAVHT 205
L + + A H
Sbjct: 215 LYLAYNAPHA 224
>gi|313241546|emb|CBY33792.1| unnamed protein product [Oikopleura dioica]
Length = 336
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 67/184 (36%), Positives = 96/184 (52%), Gaps = 26/184 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
GW+DV ++ + TP + L + I L+ Y+ CTPSRA+ LTGKY +R+G+ T P+
Sbjct: 41 GWSDVSWNNKKIKATPFLGQLEKHSITLSSSYSTHRCTPSRASLLTGKYAWRFGLGTDPI 100
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A A + + EKLLP+ L++ GYSTH +GKWH+G LP NRGFD G+ G L
Sbjct: 101 DANTAAGLDLKEKLLPEILRKNGYSTHHVGKWHLGHCNSSYLPHNRGFDTFYGHTGGVLN 160
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVH-------VIKSHNHSRPL 194
Y + + AVG + KYL F D +H +H+R L
Sbjct: 161 Y----FQHNRAVG--------------NCKYLDYFENDTPIHEKTGVYSTFDFGDHARKL 202
Query: 195 FLQI 198
+ +I
Sbjct: 203 YNKI 206
>gi|354581367|ref|ZP_09000271.1| sulfatase [Paenibacillus lactis 154]
gi|353201695|gb|EHB67148.1| sulfatase [Paenibacillus lactis 154]
Length = 446
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 67/187 (35%), Positives = 101/187 (54%), Gaps = 6/187 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G +G + + TPN+D+LA GI Y+ P C+PSRA+ LTGKYP R G+ +
Sbjct: 22 GYGDLGCYGSDTVTTPNLDSLAGEGIRFTNWYSNSPVCSPSRASLLTGKYPARAGVGEIL 81
Query: 82 GAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
GA +P TE L + LK GY T L GKWH+G + EE P GFD G+ G +
Sbjct: 82 GAKRGLDGLPSTEVTLAKALKPAGYRTALYGKWHLGVS-EETSPNAHGFDEFFGFKAGCI 140
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQM--SSKYLTDFFTDQSVHVIK-SHNHSRPLFLQ 197
+ I G++ ++ ++ + +Y+T+ T++SV IK S P FL
Sbjct: 141 DFYSHIFYWGQGHGVNPLHDLWENETEVWENGRYMTELITERSVDFIKRSREQEDPFFLF 200
Query: 198 ITHAAVH 204
+++ A H
Sbjct: 201 VSYNAPH 207
>gi|410956991|ref|XP_003985119.1| PREDICTED: arylsulfatase J [Felis catus]
Length = 621
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 101/208 (48%), Gaps = 21/208 (10%)
Query: 40 IDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPVGAGVAKAVPVTEKLLPQ 98
+D LA G+ L +Y P CTPSR+ F+TGKY G+ + + +P+ LPQ
Sbjct: 80 LDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQ 139
Query: 99 YLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIH-ETDFAVGLDA 157
LKE+GYSTH++GKWH+G ++E +P RGFD G G Y ++ G D
Sbjct: 140 KLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGDYYTHYKCDSPGMCGYDL 199
Query: 158 RRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGL 217
N + Y T +T + ++ SH+ +P+FL I + AVH+
Sbjct: 200 YENDNAAWDYDNGLYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----------P 248
Query: 218 LQVPDMEENDRTFAH---ISNPDRRLFA 242
LQ P R F H I N +RR +A
Sbjct: 249 LQAP-----GRYFEHYRSIININRRRYA 271
>gi|345510992|ref|ZP_08790548.1| arylsulfatase B [Bacteroides sp. D1]
gi|229442597|gb|EEO48388.1| arylsulfatase B [Bacteroides sp. D1]
Length = 498
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 95/185 (51%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW DVGFHG ++I TP++DAL G+ L R YT P TP+RA +TG+YP R+G+ + V
Sbjct: 77 GWGDVGFHG-SEIKTPSLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 135
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ E+ + L GY +IGKWH+G K+ P NRGF + G+ NG +
Sbjct: 136 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 195
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D E + LD + E Y T+ T +++ I ++ P L + +
Sbjct: 196 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITQEAIRCIDAYEKEGPFMLYVAY 248
Query: 201 AAVHT 205
A HT
Sbjct: 249 NAPHT 253
>gi|421613320|ref|ZP_16054406.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
gi|408495914|gb|EKK00487.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
Length = 826
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 69/190 (36%), Positives = 105/190 (55%), Gaps = 12/190 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG----- 76
G++DVGF+G +IPTP++D LA +G+V Y + P C+PSRA LTG++ R+G
Sbjct: 56 GYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQRFGHESNP 115
Query: 77 -IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
DT +P++E L LKE GY T IGKWH+G + + P +RGFD G+
Sbjct: 116 EPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLG-DAKPFWPNHRGFDEWFGF 174
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
G +Y + + D +G+ R E P+ + +LTD F+ ++V I+ N + P F
Sbjct: 175 SGGGFSYWGDLGKKDPLLGV--HRGDEPVEPKTLT-HLTDDFSTEAVKFIQ-RNETEPFF 230
Query: 196 LQITHAAVHT 205
L + + A H
Sbjct: 231 LYLAYNAPHA 240
>gi|443705042|gb|ELU01787.1| hypothetical protein CAPTEDRAFT_153777 [Capitella teleta]
Length = 551
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/184 (33%), Positives = 102/184 (55%), Gaps = 5/184 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+NDVGF N I +PN+DALA +GI+L YT P C+PSR +F++G+Y ++ + V
Sbjct: 37 GFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFMSGRYSYKSAMQHGVI 95
Query: 83 A-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ + + +LP YLKELGY TH GKWH+G ++E P +RGFD+ G ++G
Sbjct: 96 LDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGFDSFSGGFSGEGE 155
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
+ + T ++ A S+ L ++ +++ + ++ S PLF+ +
Sbjct: 156 FYEHTTATGGYYDWHLGTEVDYDAIGKHSEDLIGYYVNKT---LDEYDQSSPLFMYVAFH 212
Query: 202 AVHT 205
VH+
Sbjct: 213 NVHS 216
>gi|365877064|ref|ZP_09416570.1| Cerebroside-sulfatase [Elizabethkingia anophelis Ag1]
gi|442586911|ref|ZP_21005733.1| Cerebroside-sulfatase [Elizabethkingia anophelis R26]
gi|365755338|gb|EHM97271.1| Cerebroside-sulfatase [Elizabethkingia anophelis Ag1]
gi|442563318|gb|ELR80531.1| Cerebroside-sulfatase [Elizabethkingia anophelis R26]
Length = 454
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 61/183 (33%), Positives = 96/183 (52%), Gaps = 4/183 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIV-LNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G +G I TP +D ++ NG++ N + PTCTPSRA+ LTG+Y RY + P+
Sbjct: 37 GYADIGAYGNPVIKTPFLDQMSRNGLMATNYVVSSPTCTPSRASMLTGRYSSRYDLPWPI 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + +P E + + LK GY+T ++GKWH+G K E P +GFD + G +
Sbjct: 97 APGSKQGLPDDEVTIAEMLKANGYNTGMVGKWHLGDQKAENKPNGQGFDFYYGILYSH-D 155
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y TD + + +E P + LT +T +S++ I+ +P FL + H
Sbjct: 156 YKAPYVNTDIPIRMFRNTKVEIEKP--ADSLLTRLYTKESINYIRQQKKDKPFFLYLAHN 213
Query: 202 AVH 204
H
Sbjct: 214 MPH 216
>gi|440716880|ref|ZP_20897383.1| arylsulfatase [Rhodopirellula baltica SWK14]
gi|436438073|gb|ELP31649.1| arylsulfatase [Rhodopirellula baltica SWK14]
Length = 616
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 69/187 (36%), Positives = 93/187 (49%), Gaps = 24/187 (12%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QGW D+ HG I TP +DALA L+R Y P C P+RAA LTG+YP R G+
Sbjct: 72 QGWGDLAAHGNPKISTPTLDALANESARLDRFYVSPVCAPTRAALLTGRYPERSGV---- 127
Query: 82 GAGVA---KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
AGV + + E L + + GY T GKWH G + L P +GFD G+ G
Sbjct: 128 -AGVTGRREVMRAEETTLAELYRSAGYVTGCFGKWHNGA-QMPLHPNGQGFDEFFGFCGG 185
Query: 139 YLT-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
+ Y+D++ E RN P ++ Y+TD TD +V I++H H RP F
Sbjct: 186 HFNLYDDALLE----------RN---GTPVQTNGYITDVLTDAAVDFIQNH-HDRPFFCY 231
Query: 198 ITHAAVH 204
+ A H
Sbjct: 232 VPFNAPH 238
>gi|417303299|ref|ZP_12090357.1| arylsulfatase A [Rhodopirellula baltica WH47]
gi|327540271|gb|EGF26857.1| arylsulfatase A [Rhodopirellula baltica WH47]
Length = 616
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 69/187 (36%), Positives = 93/187 (49%), Gaps = 24/187 (12%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QGW D+ HG I TP +DALA L+R Y P C P+RAA LTG+YP R G+
Sbjct: 72 QGWGDLAAHGNPKISTPTLDALANKSARLDRFYVSPVCAPTRAALLTGRYPERSGV---- 127
Query: 82 GAGVA---KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
AGV + + E L + + GY T GKWH G + L P +GFD G+ G
Sbjct: 128 -AGVTGRREVMRAEEITLAELYRSAGYVTGCFGKWHNGA-QMPLHPNGQGFDEFFGFCGG 185
Query: 139 YLT-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
+ Y+D++ E RN P ++ Y+TD TD +V I++H H RP F
Sbjct: 186 HFNLYDDALLE----------RN---GTPVQTNGYITDVLTDAAVEFIQNH-HDRPFFCY 231
Query: 198 ITHAAVH 204
+ A H
Sbjct: 232 VPFNAPH 238
>gi|300773469|ref|ZP_07083338.1| cerebroside-sulfatase [Sphingobacterium spiritivorum ATCC 33861]
gi|300759640|gb|EFK56467.1| cerebroside-sulfatase [Sphingobacterium spiritivorum ATCC 33861]
Length = 449
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/194 (34%), Positives = 98/194 (50%), Gaps = 11/194 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGI-VLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G++D+G +G I TP +D +A G+ + T P+CTPSRA+ LTG+Y RY + P+
Sbjct: 23 GYSDLGCYGNPSIATPFLDKMAAKGVRATDYMVTSPSCTPSRASLLTGRYASRYNLPDPI 82
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL- 140
G G +P E + + LKE GY T LIGKWH+G + E LP +GFD Y+ G L
Sbjct: 83 GPGAKNGLPAQEVTIAEMLKEKGYRTALIGKWHLG-DHGEYLPNKQGFD----YFYGMLY 137
Query: 141 --TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y D +TD + + RN + + L+ +T++ I P FL
Sbjct: 138 SHDYRDPYVKTDTTIKI--FRNQTPVVTRPADSALSRIYTEEVKQYISQQKKGEPFFLYY 195
Query: 199 THAAVHTGTAGNAK 212
H H A +A+
Sbjct: 196 AHNMPHLPVAFSAE 209
>gi|340384741|ref|XP_003390869.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 490
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 75/223 (33%), Positives = 110/223 (49%), Gaps = 26/223 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GWND + G +DI TPNID LA GI L ++Y P C+PSR+A L GKYP+ G+ V
Sbjct: 35 GWNDTSYQG-SDIQTPNIDKLAEEGIRLKQYYVQPLCSPSRSALLAGKYPYHLGLAHGVI 93
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G + + E + +LK+ GYSTH +GKW +G +K E P RGFD GY++
Sbjct: 94 TNGHPYGLGLNETTIADHLKKGGYSTHAVGKWDLGMHKWEFTPTYRGFDTFYGYYDA--- 150
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
++ + LD R N + + Y T FT I + + S P F+ +
Sbjct: 151 -DEDYYTHKVGGYLDFRNNTDPVKDE-DGTYSTFLFTKAIEDAINAKSDS-PFFIYGAYQ 207
Query: 202 AVHTGTAGNAKLPTGLLQVPD--MEENDRTFAHISNPDRRLFA 242
+VH+ L+ PD +E+ H P+R++F
Sbjct: 208 SVHSP-----------LEAPDTYLEK-----CHSPYPNRKIFC 234
>gi|410611985|ref|ZP_11323071.1| N-acetylgalactosamine-6-sulfatase [Glaciecola psychrophila 170]
gi|410168398|dbj|GAC36960.1| N-acetylgalactosamine-6-sulfatase [Glaciecola psychrophila 170]
Length = 508
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 95/184 (51%), Gaps = 14/184 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G +G I TPNID +A G + Y P CTPSRA LTG+YP R GI
Sbjct: 61 GYGDIGAYGSTTINTPNIDKMAAQGAKFDEFYAASPVCTPSRAGLLTGRYPIRQGIHNVF 120
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ + E + + LK GY+T L+GKWH+G + E+ +P+N+GFD G L
Sbjct: 121 FPESFQGMDPEEITIAEVLKGAGYATGLVGKWHLG-HHEQYMPWNQGFDEFFG-----LP 174
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y++ + GL N + ++ +Y+T +TDQ++ I H +P FL + H
Sbjct: 175 YSNDMG------GLYYFNNKDIDFEEVDQRYMTKTYTDQALQFIDKH-QEQPFFLYLAHN 227
Query: 202 AVHT 205
H
Sbjct: 228 MPHV 231
>gi|291241212|ref|XP_002740506.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 534
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 76/224 (33%), Positives = 109/224 (48%), Gaps = 25/224 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW+DVG+H ++ I TPNID LA G+ L +Y C PSR +TG+Y R G+
Sbjct: 80 GWHDVGYH-DSVIKTPNIDQLAAEGVKLENYYVSSWCAPSRVNLMTGRYRIRTGL----Y 134
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI-GCNKEELLPFNRGFDNHVGYWNG--- 138
V + + E L L E GY T ++GKWH+ G E P +RGF +GY G
Sbjct: 135 GDVCDFMGIHETTLADKLYEAGYYTAMVGKWHLSGFEHAECYPTHRGFQTFLGYHGGSQN 194
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y T+ + D N A + +Y T F D++ +I+ HN +PLFL +
Sbjct: 195 YFTHRRGGPHAPY----DFWANDTSIAVKYEGQYSTMIFADEAQRIIRQHNTKQPLFLYL 250
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ AVH +P LL P E+ R+ I + RR++A
Sbjct: 251 SFQAVH--------VP--LLVPPSYEDQYRSL--IEDDKRRVYA 282
>gi|414072362|ref|ZP_11408307.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410805226|gb|EKS11247.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 473
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/226 (29%), Positives = 105/226 (46%), Gaps = 19/226 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGI---- 77
G+ D GF G + TPN+D LA +G+ + Y + TC PSRA +TGKY R+G
Sbjct: 40 GFGDFGFQGSTQLKTPNLDKLAQSGVRFTQGYVSDSTCGPSRAGLMTGKYQQRFGYEEIN 99
Query: 78 ------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN 131
D G +P+ +K + YLKE GY T + GKWH+G + + P RGFD
Sbjct: 100 VPGFMSDNSALKGADMGLPLDQKTMGDYLKEQGYKTAVFGKWHLG-DADRFHPLKRGFDT 158
Query: 132 HVGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
+G+ G Y Y++ + D + + + +YLTD ++ I+
Sbjct: 159 FLGFRGGDRSYFNYSEQEMKNGNKHFFDKKLERDFGNYEEPKEYLTDVLGKEAAKYIE-Q 217
Query: 189 NHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHIS 234
N P F+ + AVHT + P L + P++ + A ++
Sbjct: 218 NKDEPFFIYLAFNAVHTPLESD---PKDLAKFPNLTGKRKELAAMT 260
>gi|319952005|ref|YP_004163272.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
gi|319420665|gb|ADV47774.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
Length = 484
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 77/234 (32%), Positives = 112/234 (47%), Gaps = 28/234 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYG--IDT 79
G+NDVGF+G DI TPN+D LA +G + Y P C PSRAA LTG+YP G +
Sbjct: 42 GYNDVGFNGSTDITTPNLDQLAQDGTIFTSAYVAHPFCGPSRAALLTGRYPHTLGSQFNL 101
Query: 80 PV-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
P GA K + V EK + +++ GY T IGKWH+G E P RGF++ G+ G
Sbjct: 102 PANGASTGKGISVEEKFMGVPMQKAGYYTGAIGKWHLG-ETAEYHPNKRGFNDFYGFLGG 160
Query: 139 YLTYNDSIHETDFAVGLD-ARRNMERY--------APQMSSKYLTDFFTDQSVHVIK-SH 188
Y ++ + + +N+ Y A + YLTD + + + K +H
Sbjct: 161 GHKYFPEEYKLQYKHQKEMGTKNINDYVLPLEHNGAIVEENDYLTDVLSREGIRFTKEAH 220
Query: 189 NHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ +P FL + + A H K D+E+ F I + DRR +A
Sbjct: 221 DKKKPFFLYLAYNAPHVPLEAKEK---------DLEK----FKDIEDIDRRTYA 261
>gi|383110963|ref|ZP_09931781.1| hypothetical protein BSGG_2068 [Bacteroides sp. D2]
gi|313694533|gb|EFS31368.1| hypothetical protein BSGG_2068 [Bacteroides sp. D2]
Length = 458
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 95/185 (51%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW DVGFHG ++I TP +DAL G+ L R YT P TP+RA +TG+YP R+G+ + V
Sbjct: 37 GWGDVGFHG-SEIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ E+ + L GY +IGKWH+G K+ P NRGF + G+ NG +
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D E + LD + E + Y T+ T +++ I ++ P L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETCHDK---GYSTELITKEAIRCIDAYEKEGPFMLYVAY 208
Query: 201 AAVHT 205
A HT
Sbjct: 209 NAPHT 213
>gi|227536643|ref|ZP_03966692.1| sulfatase family protein [Sphingobacterium spiritivorum ATCC 33300]
gi|227243444|gb|EEI93459.1| sulfatase family protein [Sphingobacterium spiritivorum ATCC 33300]
Length = 461
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/194 (34%), Positives = 98/194 (50%), Gaps = 11/194 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGI-VLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G++D+G +G I TP +D +A G+ + T P+CTPSRA+ LTG+Y RY + P+
Sbjct: 35 GYSDLGCYGNPSISTPFLDKMAAKGVRATDYMVTSPSCTPSRASLLTGRYASRYNLPDPI 94
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL- 140
G G +P E + + LKE GY T LIGKWH+G + E LP +GFD Y+ G L
Sbjct: 95 GPGAKNGLPAQEVTIAEMLKEKGYHTALIGKWHLG-DHGEYLPNKQGFD----YFYGMLY 149
Query: 141 --TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y D +TD + + RN + + L+ +T++ I P FL
Sbjct: 150 SHDYRDPYVKTDTTIKI--FRNQTPVVTRPADSALSRIYTEEVKQYISQQKKGEPFFLYY 207
Query: 199 THAAVHTGTAGNAK 212
H H A +A+
Sbjct: 208 AHNMPHLPVAFSAE 221
>gi|299147176|ref|ZP_07040243.1| arylsulfatase B [Bacteroides sp. 3_1_23]
gi|298515061|gb|EFI38943.1| arylsulfatase B [Bacteroides sp. 3_1_23]
Length = 458
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 94/185 (50%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW DVGFHG ++I TP +DAL G+ L R YT P TP+RA +TG+YP R+G+ + V
Sbjct: 37 GWGDVGFHG-SEIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ E+ + L GY +IGKWH+G K+ P NRGF + G+ NG +
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D E + LD + E Y T+ T +++ I ++ P L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITKEAIRCIDAYEKEGPFMLYVAY 208
Query: 201 AAVHT 205
A HT
Sbjct: 209 NAPHT 213
>gi|313232584|emb|CBY19254.1| unnamed protein product [Oikopleura dioica]
Length = 506
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 66/223 (29%), Positives = 111/223 (49%), Gaps = 20/223 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWND+ H + TPN+D L + L +Y P CTP+R+ ++G+Y G+ V
Sbjct: 30 GWNDISLHNSY-LSTPNVDGLIQESLHLQSYYVNPICTPTRSVLMSGRYQIHTGLQHAVI 88
Query: 83 AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G +P+T+ + P+ K+ GY TH++GKWH+G E+++P NRGF++H GY G
Sbjct: 89 LGAQPNGLPLTDPVQPEIFKDCGYRTHMVGKWHLGFYDEKMVPENRGFESHYGYLIGAEG 148
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
+ + G+D R + A SS +Y D F + ++++H+ L++ +
Sbjct: 149 HYNHSQFMQGQNGVDFR---DGGASTNSSWGQYSADLFAKRVEDLVEAHDVEESLYMYVG 205
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
VH L+ P + F+ I + DRR++A
Sbjct: 206 LQNVHYP-----------LEAPQHYVD--QFSWIKDRDRRVYA 235
>gi|423214938|ref|ZP_17201466.1| hypothetical protein HMPREF1074_02998 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692201|gb|EIY85439.1| hypothetical protein HMPREF1074_02998 [Bacteroides xylanisolvens
CL03T12C04]
Length = 458
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 94/185 (50%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW DVGFHG ++I TP +DAL G+ L R YT P TP+RA +TG+YP R+G+ + V
Sbjct: 37 GWGDVGFHG-SEIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ E+ + L GY +IGKWH+G K+ P NRGF + G+ NG +
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D E + LD + E Y T+ T +++ I ++ P L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITKEAIRCIDAYEKEGPFMLYVAY 208
Query: 201 AAVHT 205
A HT
Sbjct: 209 NAPHT 213
>gi|372210595|ref|ZP_09498397.1| sulfatase [Flavobacteriaceae bacterium S85]
Length = 472
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 81/232 (34%), Positives = 111/232 (47%), Gaps = 36/232 (15%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D GF G DI TPN+D LA NG + Y C PSRAA L+G+Y R+G +T
Sbjct: 37 GYADTGFTGATDIQTPNLDNLAKNGAFFKQGYANHAYCGPSRAALLSGRYQHRFGFETNP 96
Query: 82 GAGVAK---AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
A + V EKL P+ L+E GY T IGKWH+G E P NRGFD Y+ G
Sbjct: 97 AYDPANPHMGIDVGEKLFPKRLQEAGYKTGAIGKWHLGA-AAEFHPLNRGFD----YFYG 151
Query: 139 YLTYNDSIHETDFAVGLDARRNMERY-APQMSSK-------YLTDFFTDQSVHVIKSHNH 190
+L D ++ E Y P + +K YLT ++ + +K N
Sbjct: 152 FLGGGHDYFRID-----GTKKVWEAYLQPLVRNKRADNFEGYLTTALSNDAAQFVKD-NK 205
Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
P FL + + A H +P LQ P +E+ +AHI + RR++A
Sbjct: 206 ENPFFLYVAYNAPH--------MP---LQAP--KEDIARYAHIKDNKRRVYA 244
>gi|423290535|ref|ZP_17269384.1| hypothetical protein HMPREF1069_04427 [Bacteroides ovatus
CL02T12C04]
gi|392665922|gb|EIY59445.1| hypothetical protein HMPREF1069_04427 [Bacteroides ovatus
CL02T12C04]
Length = 458
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 94/185 (50%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW DVGFHG ++I TP +DAL G+ L R YT P TP+RA +TG+YP R+G+ + V
Sbjct: 37 GWGDVGFHG-SEIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ E+ + L GY +IGKWH+G K+ P NRGF + G+ NG +
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D E + LD + E Y T+ T +++ I ++ P L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITKEAIRCIDAYEKEGPFMLYVAY 208
Query: 201 AAVHT 205
A HT
Sbjct: 209 NAPHT 213
>gi|336415441|ref|ZP_08595781.1| hypothetical protein HMPREF1017_02889 [Bacteroides ovatus
3_8_47FAA]
gi|335941037|gb|EGN02899.1| hypothetical protein HMPREF1017_02889 [Bacteroides ovatus
3_8_47FAA]
Length = 458
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 94/185 (50%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW DVGFHG ++I TP +DAL G+ L R YT P TP+RA +TG+YP R+G+ + V
Sbjct: 37 GWGDVGFHG-SEIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ E+ + L GY +IGKWH+G K+ P NRGF + G+ NG +
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D E + LD + E Y T+ T +++ I ++ P L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITKEAIRCIDAYEKEGPFMLYVAY 208
Query: 201 AAVHT 205
A HT
Sbjct: 209 NAPHT 213
>gi|293372058|ref|ZP_06618453.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
gi|292632962|gb|EFF51547.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
Length = 458
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 94/185 (50%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW DVGFHG ++I TP +DAL G+ L R YT P TP+RA +TG+YP R+G+ + V
Sbjct: 37 GWGDVGFHG-SEIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ E+ + L GY +IGKWH+G K+ P NRGF + G+ NG +
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D E + LD + E Y T+ T +++ I ++ P L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITKEAIRCIDAYEKEGPFMLYVAY 208
Query: 201 AAVHT 205
A HT
Sbjct: 209 NAPHT 213
>gi|160885330|ref|ZP_02066333.1| hypothetical protein BACOVA_03329 [Bacteroides ovatus ATCC 8483]
gi|156108952|gb|EDO10697.1| arylsulfatase [Bacteroides ovatus ATCC 8483]
Length = 458
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 94/185 (50%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW DVGFHG ++I TP +DAL G+ L R YT P TP+RA +TG+YP R+G+ + V
Sbjct: 37 GWGDVGFHG-SEIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ E+ + L GY +IGKWH+G K+ P NRGF + G+ NG +
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D E + LD + E Y T+ T +++ I ++ P L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITKEAIRCIDAYEKEGPFMLYVAY 208
Query: 201 AAVHT 205
A HT
Sbjct: 209 NAPHT 213
>gi|340367647|ref|XP_003382365.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 490
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 65/187 (34%), Positives = 97/187 (51%), Gaps = 16/187 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYP---FRYGIDT 79
G+ DVGF I +PN D LA G+VLNRHY C PSRA+ LTG++P +++ + T
Sbjct: 35 GFADVGFRNPA-ISSPNFDQLAKTGLVLNRHYVFKYCAPSRASLLTGRWPHHVYQWNLAT 93
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
AG + + P LK YSTH++GKWH G LP NRGFD GY G+
Sbjct: 94 DATAGTN----LNMTMFPAKLKAANYSTHMVGKWHQGFFDPRYLPINRGFDTSSGYLCGW 149
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQ-MSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
+ + + +D +N AP + Y + ++ D +++ +H+ + PLF+ +
Sbjct: 150 VDHFNQKQ----GCAVDCWKNN---APDPRNGTYDSYYYRDDLTNIVNNHDANNPLFIYL 202
Query: 199 THAAVHT 205
VHT
Sbjct: 203 PLHNVHT 209
>gi|328705055|ref|XP_001946210.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 470
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 63/159 (39%), Positives = 89/159 (55%), Gaps = 15/159 (9%)
Query: 88 AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIH 147
+P+ E LLP++L +LGY++H +GKWH+G K+ P RGF + G+WNGY Y +
Sbjct: 22 GLPLNEILLPEHLNKLGYTSHAVGKWHLGYFKKAYTPTYRGFKSFYGFWNGYQDYYTHMV 81
Query: 148 ETDFAV--GLDARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAV 203
+ FA G D RR++ P SS KY T FT ++ +I HN+S PLFL + H A
Sbjct: 82 QATFASFEGFDMRRDLN---PDWSSVGKYSTHLFTKEATDIITKHNNSVPLFLYLAHLAP 138
Query: 204 HTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
H GT N LQ P +E+ +F I + RR +A
Sbjct: 139 HAGTYENP------LQAP--QEDINSFQSIKDKYRRKYA 169
>gi|32473617|ref|NP_866611.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
SH 1]
gi|32398297|emb|CAD78392.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
SH 1]
Length = 543
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 69/190 (36%), Positives = 103/190 (54%), Gaps = 12/190 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG----- 76
G++DVGF+G +IPTP++D LA +G+V Y + P C+PSRA LTG++ R+G
Sbjct: 56 GYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQRFGHGSNP 115
Query: 77 -IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
DT +P++E L LKE GY T IGKWH+G + + P RGFD G+
Sbjct: 116 EPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLG-DAKPFWPNRRGFDEWFGF 174
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
G +Y + D +G+ R E P+ + +LTD F+ ++V I+ H + P F
Sbjct: 175 SGGGFSYWGDLGMKDPLLGV--HRGDEPVDPKTLT-HLTDDFSTEAVKFIQRHE-TEPFF 230
Query: 196 LQITHAAVHT 205
L + + A H
Sbjct: 231 LYLAYNAPHA 240
>gi|372221524|ref|ZP_09499945.1| sulfatase [Mesoflavibacter zeaxanthinifaciens S86]
Length = 461
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 67/183 (36%), Positives = 97/183 (53%), Gaps = 9/183 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVG+HG + I TP +D+LA NG L R Y PTC+PSRAA LTG R GI P+
Sbjct: 47 GWNDVGYHG-SKIKTPVLDSLANNGAKLERFYVAPTCSPSRAALLTGIPASRLGIVAPIA 105
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
A+P + LP+ +K+LGY T L GKWH+G E P GFD G+ +G +
Sbjct: 106 GKSKIALPDSLVTLPKAMKKLGYRTALFGKWHLGLTPEN-GPQAYGFDTSYGFLHGQI-- 162
Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQITHA 201
+ HE + G + ++ + ++TD TD ++ + P F+ + ++
Sbjct: 163 DQYTHE--YKNGDPSWHKNGKFLKE--DGHVTDLLTDAAIAYFNQETKTETPSFVTLAYS 218
Query: 202 AVH 204
A H
Sbjct: 219 APH 221
>gi|75910438|ref|YP_324734.1| twin-arginine translocation pathway signal protein [Anabaena
variabilis ATCC 29413]
gi|75704163|gb|ABA23839.1| Twin-arginine translocation pathway signal [Anabaena variabilis
ATCC 29413]
Length = 457
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 70/190 (36%), Positives = 95/190 (50%), Gaps = 16/190 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRY--GIDT 79
GW D+ +G D TPN+D LA G+ Y T CTP+R AFLTG+Y R G+
Sbjct: 53 GWGDLSIYGRTDYETPNLDRLARQGVRFTNAYANQTVCTPTRIAFLTGRYQARLPVGLRE 112
Query: 80 PVGAGVAKA-----VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
P+GA A +P + + LK GY T L+GKWH G P +GFD + G
Sbjct: 113 PLGARSQPASNNIGIPANQPTIASLLKANGYETALVGKWHAGY-PPNFGPLQKGFDEYFG 171
Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
+ +G + Y TD + L E P S Y+TD FTD++V I+ HSRP
Sbjct: 172 HLSGGIEYFTHTG-TDRILDL-----YENDVPVQRSGYVTDLFTDRAVEFIQ-RPHSRPF 224
Query: 195 FLQITHAAVH 204
+L + + A H
Sbjct: 225 YLSLHYNAPH 234
>gi|294647729|ref|ZP_06725288.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|294809280|ref|ZP_06767994.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
gi|292636934|gb|EFF55393.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|294443524|gb|EFG12277.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
Length = 458
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 64/185 (34%), Positives = 94/185 (50%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GW DVGFHG ++I TP++DAL G+ L R YT P TP+RA +TG+YP R+G+ + V
Sbjct: 37 GWGDVGFHG-SEIKTPSLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ E+ + L GY +IGKWH+G K+ P NRG + G+ NG +
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGLSHFYGHLNGAI 155
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D E + LD + E Y T+ T +++ I ++ P L + +
Sbjct: 156 DYFDLTREGE----LDWHNDWETC---HDKGYSTELITQEAIRCIDAYEKEGPFMLYVAY 208
Query: 201 AAVHT 205
A HT
Sbjct: 209 NAPHT 213
>gi|241634070|ref|XP_002410502.1| arylsulfatase J, putative [Ixodes scapularis]
gi|215503435|gb|EEC12929.1| arylsulfatase J, putative [Ixodes scapularis]
Length = 480
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 78/224 (34%), Positives = 106/224 (47%), Gaps = 35/224 (15%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QGW DV FHG IPTPNID LA +G++L+ +Y LP CTPSRAA +TG YP G+ V
Sbjct: 4 QGWGDVSFHGSTQIPTPNIDVLAGDGVILDNYYALPLCTPSRAALMTGLYPIHTGMHAGV 63
Query: 82 GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKW-HIGCNKEELLPFNRGFD--------- 130
A + + K++PQ+ ++LGY ++IGK H GC+ + G D
Sbjct: 64 IQDAAPWGLTLETKIMPQHFEDLGYEVNMIGKSHHDGCHNFD--STKSGIDLLHTPLISS 121
Query: 131 ---NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS 187
H W TYN + T L R Q YL F D
Sbjct: 122 IPGQHDNTWMYGKTYN--FYRTRAICRLQKR------VIQGVVVYLIRFLLDL------- 166
Query: 188 HNHSRPLFLQITHAAVHTGTAGNA-KLPT-GLLQVPDMEENDRT 229
HS+P F ++H AVH+ + + P LL+ P + E +RT
Sbjct: 167 --HSQPFFCYLSHQAVHSALMKDPFQAPARNLLKFPYIGETNRT 208
>gi|432904444|ref|XP_004077334.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
Length = 572
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 61/188 (32%), Positives = 100/188 (53%), Gaps = 8/188 (4%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
QG+ D+G+HG +D+ TP +D LA G+ L +Y P C+PSR+ +TG+Y G+ +
Sbjct: 54 QGYADIGYHG-SDVHTPVLDQLAAEGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHSI 112
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
+ +P LP+ L + GY TH++GKWH+G + LP RGF + G G
Sbjct: 113 IRPRQPLCLPPDIPTLPECLLKAGYHTHMVGKWHLGFCRPSCLPTRRGFQSFFGTLTGSG 172
Query: 139 -YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
+ +Y + A G D + R A +M Y T + ++ ++++H+ + PLFL
Sbjct: 173 DHFSYQSC--DGAEACGFDL-HDGSRPAWEMRGNYSTLLYIERVKQILRNHDPNTPLFLY 229
Query: 198 ITHAAVHT 205
++ A HT
Sbjct: 230 LSLQAAHT 237
>gi|298706913|emb|CBJ29740.1| Formylglycine-dependent sulfatase, C-terminal fragment [Ectocarpus
siliculosus]
Length = 597
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 68/205 (33%), Positives = 100/205 (48%), Gaps = 23/205 (11%)
Query: 23 GWNDVGFHGENDIP-TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
G NDVG+ + TP ++ LA G++L+ +Y+ CTPSRA+ +TG+ FR G+
Sbjct: 85 GTNDVGYESTDLWQLTPFMNTLAAEGVILDDYYSNEICTPSRASLMTGRDSFRTGMQFGV 144
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG------ 134
V A +P+ E L + + GYSTH+ GKWH+G PF+RGFD +G
Sbjct: 145 VEDSAAWGLPIDEVTLAERFQAAGYSTHMTGKWHLGVYSNANYPFSRGFDTFLGYTGGGE 204
Query: 135 ------------YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSV 182
+ G T D ++ N R P+M+ KY T TD+++
Sbjct: 205 GYYTHRECVTPEFEGGQYTCYQDFGYGDKDGYINFTTNTTRKGPEMTGKYSTTVITDRAI 264
Query: 183 HVIKSH---NHSRPLFLQITHAAVH 204
V + H + S PLFL + H AVH
Sbjct: 265 EVAREHVEKSPSDPLFLYVAHQAVH 289
>gi|187735676|ref|YP_001877788.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
gi|187425728|gb|ACD05007.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
Length = 465
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 71/228 (31%), Positives = 109/228 (47%), Gaps = 25/228 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G G I TP++D LA G+ +R Y T P C+PSR LTG++P RYGI T
Sbjct: 40 GYGDLGCTGSKQIKTPSLDRLAREGVFCSRAYVTAPMCSPSRMGLLTGRFPKRYGITTNP 99
Query: 82 GAGVAK------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
+ +P TEKL+P+YL GY + + GKWH+G K P RGF + G+
Sbjct: 100 NIQMDYLPESHYGLPQTEKLIPEYLAPCGYRSAVFGKWHLGHTK-GYTPPERGFTHWWGF 158
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK-SHNHSRPL 194
G Y E A GL+ + + + YLTD TD++V ++ + +P
Sbjct: 159 LGGSRHYFPVKKE---AEGLNPSMIVSNFTDKTDITYLTDDITDRAVEFLQEAGKDKKPF 215
Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
F+ +++ A H E+ F ++ N +RR++
Sbjct: 216 FMFVSYNAPHWPNEAKP-------------EDIAKFRNVQNGERRVYC 250
>gi|301308937|ref|ZP_07214882.1| arylsulfatase-like protein [Bacteroides sp. 20_3]
gi|423338414|ref|ZP_17316156.1| hypothetical protein HMPREF1059_02081 [Parabacteroides distasonis
CL09T03C24]
gi|300832963|gb|EFK63588.1| arylsulfatase-like protein [Bacteroides sp. 20_3]
gi|409233843|gb|EKN26675.1| hypothetical protein HMPREF1059_02081 [Parabacteroides distasonis
CL09T03C24]
Length = 589
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 77/248 (31%), Positives = 112/248 (45%), Gaps = 57/248 (22%)
Query: 5 VGAGVAKAVPVTEKLLP---------QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT 55
VGAG A +K LP QGW D+GF G + TPNID +A+ G +L Y
Sbjct: 13 VGAGCIPAF--AQKQLPNIIVMLSDDQGWGDLGFTGNTFVQTPNIDRIAHEGTILENFYV 70
Query: 56 LPTCTPSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI 115
P +P+RA FLTG+Y R G+++ G G + + EK + +Y +E GY+T L GKWH
Sbjct: 71 CPVSSPTRAEFLTGRYHVRSGVNSTTGGG--ERFNLGEKTIAEYFREAGYATSLFGKWHS 128
Query: 116 GCNKEELLPFNRGFDNHVG--------YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQ 167
G + P RGF+ G YWN L +N I +
Sbjct: 129 G-TQYPYHPNARGFEEFYGFCSGHWGNYWNPVLEHNGEIISGE----------------- 170
Query: 168 MSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEEN- 226
++ D TD+++ I+ H P F+ +++ H+ +QVPD N
Sbjct: 171 ---GFIIDDLTDKALDYIRDHKE-HPFFMFLSYNTPHSP-----------MQVPDSWWNR 215
Query: 227 --DRTFAH 232
DRT +
Sbjct: 216 VKDRTLSQ 223
>gi|298377639|ref|ZP_06987590.1| arylsulfatase [Bacteroides sp. 3_1_19]
gi|298265342|gb|EFI07004.1| arylsulfatase [Bacteroides sp. 3_1_19]
Length = 589
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 77/248 (31%), Positives = 112/248 (45%), Gaps = 57/248 (22%)
Query: 5 VGAGVAKAVPVTEKLLP---------QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT 55
VGAG A +K LP QGW D+GF G + TPNID +A+ G +L Y
Sbjct: 13 VGAGCIPAF--AQKQLPNIIVMLSDDQGWGDLGFTGNTFVQTPNIDRIAHEGTILENFYV 70
Query: 56 LPTCTPSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI 115
P +P+RA FLTG+Y R G+++ G G + + EK + +Y +E GY+T L GKWH
Sbjct: 71 CPVSSPTRAEFLTGRYHVRSGVNSTTGGG--ERFNLGEKTIAEYFREAGYATSLFGKWHS 128
Query: 116 GCNKEELLPFNRGFDNHVG--------YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQ 167
G + P RGF+ G YWN L +N I +
Sbjct: 129 G-TQYPYHPNARGFEEFYGFCSGHWGNYWNPVLEHNGEIISGE----------------- 170
Query: 168 MSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEEN- 226
++ D TD+++ I+ H P F+ +++ H+ +QVPD N
Sbjct: 171 ---GFIIDDLTDKALDYIRDHKE-HPFFMFLSYNTPHSP-----------MQVPDSWWNR 215
Query: 227 --DRTFAH 232
DRT +
Sbjct: 216 VKDRTLSQ 223
>gi|21430588|gb|AAM50972.1| RE13542p [Drosophila melanogaster]
Length = 300
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 74/196 (37%), Positives = 99/196 (50%), Gaps = 27/196 (13%)
Query: 59 CTPSRAAFLTGKYPFRYGI-------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIG 111
CTPSRAA LTGKYP G+ D P G +P+ E + + +E GY T L+G
Sbjct: 2 CTPSRAALLTGKYPINTGMQHYVIVNDQPWG------LPLNETTMAEIFRENGYRTSLLG 55
Query: 112 KWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFA--VGLDARRNMERYAPQMS 169
KWH+G ++ P RGFD H+GY Y+ Y +E G D R +++ +
Sbjct: 56 KWHLGLSQRNFTPTERGFDRHLGYLGAYVDYYTQSYEQQNKGYNGHDFRDSLKSTHDHV- 114
Query: 170 SKYLTDFFTDQSVHVIKSH---NHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEEN 226
Y+TD TD +V I+ H N S+PLFL + H A H A N P +Q P EE
Sbjct: 115 GHYVTDLLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPH---AANDDDP---MQAP-AEEV 167
Query: 227 DRTFAHISNPDRRLFA 242
R F +ISN R +A
Sbjct: 168 SR-FEYISNKTHRYYA 182
>gi|443705385|gb|ELU01963.1| hypothetical protein CAPTEDRAFT_143986, partial [Capitella teleta]
Length = 345
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 63/186 (33%), Positives = 96/186 (51%), Gaps = 11/186 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G D+ TPN+DALA G++L +Y CTPSR A ++G+YP + + V
Sbjct: 37 GYHDIGLRNP-DVITPNLDALASKGVILTNNYVQALCTPSRHALMSGRYPSASAMQSMVI 95
Query: 83 AGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ A+ + K LPQYLKELGY H++GKWH+G ++E LP +RGFD G + G
Sbjct: 96 QPMEARCAGLEYKFLPQYLKELGYKNHMVGKWHLGYCRDECLPTSRGFDTFYGLYAGAGD 155
Query: 142 Y--NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ++ + D+ + D + + + D + V H+ PLFL
Sbjct: 156 YWSHEIFGKYDWHINGDIHF-------EANGTHSQDLEMEGLDKVFDEHDSKDPLFLYFA 208
Query: 200 HAAVHT 205
HT
Sbjct: 209 PQNPHT 214
>gi|150010519|ref|YP_001305262.1| N-acetylgalactosamine 6-sulfatase [Parabacteroides distasonis ATCC
8503]
gi|149938943|gb|ABR45640.1| N-acetylgalactosamine 6-sulfatase [Parabacteroides distasonis ATCC
8503]
Length = 589
Score = 106 bits (265), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 77/248 (31%), Positives = 112/248 (45%), Gaps = 57/248 (22%)
Query: 5 VGAGVAKAVPVTEKLLP---------QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT 55
VGAG A +K LP QGW D+GF G + TPNID +A+ G +L Y
Sbjct: 13 VGAGCIPAF--AQKQLPNIIVMLSDDQGWGDLGFTGNTFVQTPNIDRIAHEGTILENFYV 70
Query: 56 LPTCTPSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI 115
P +P+RA FLTG+Y R G+++ G G + + EK + +Y +E GY+T L GKWH
Sbjct: 71 CPVSSPTRAEFLTGRYHVRSGVNSTTGGG--ERFNLGEKTIAEYFREAGYATSLFGKWHS 128
Query: 116 GCNKEELLPFNRGFDNHVG--------YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQ 167
G + P RGF+ G YWN L +N I +
Sbjct: 129 G-TQYPYHPNARGFEEFYGFCSGHWGNYWNPVLEHNGEIISGE----------------- 170
Query: 168 MSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEEN- 226
++ D TD+++ I+ H P F+ +++ H+ +QVPD N
Sbjct: 171 ---GFIIDDLTDKALDYIRDHKE-HPFFMFLSYNTPHSP-----------MQVPDSWWNR 215
Query: 227 --DRTFAH 232
DRT +
Sbjct: 216 VKDRTLSQ 223
>gi|374373208|ref|ZP_09630868.1| Cerebroside-sulfatase [Niabella soli DSM 19437]
gi|373234181|gb|EHP53974.1| Cerebroside-sulfatase [Niabella soli DSM 19437]
Length = 454
Score = 106 bits (265), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 60/183 (32%), Positives = 95/183 (51%), Gaps = 4/183 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIV-LNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G++D G +G I TP +D +A +G + N + P+CTPSRA+ LTG+Y RY + P+
Sbjct: 37 GYSDPGCYGNPVIQTPFLDKIARSGFMSTNYIVSSPSCTPSRASLLTGRYASRYNLPDPI 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G G +P E + + LK GY T ++GKWH+G + P +GFD G
Sbjct: 97 GPGSKLGLPDAEVTMAEMLKAAGYKTAMVGKWHLGDQHDYNYPTGQGFDRFYGMLYSQ-D 155
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y +TD + + R E + P+ S+ LT +T +S+ +I+ +P FL + +
Sbjct: 156 YRAPYVKTDTVIKIFRNRTPEIFKPEDST--LTQLYTKESIKIIREQRPGQPFFLYLAYN 213
Query: 202 AVH 204
H
Sbjct: 214 MPH 216
>gi|325106503|ref|YP_004276157.1| sulfatase [Pedobacter saltans DSM 12145]
gi|324975351|gb|ADY54335.1| sulfatase [Pedobacter saltans DSM 12145]
Length = 470
Score = 106 bits (265), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 77/234 (32%), Positives = 113/234 (48%), Gaps = 36/234 (15%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D G +G +IPTPNIDALA NG + Y + C PSRA LTG Y R+G + +
Sbjct: 38 GYADFGCYGGKEIPTPNIDALAKNGTLFTDAYVSASVCAPSRAGILTGMYQQRFGFEHNI 97
Query: 82 GAGVAKAVPVTE-------KLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
K + + K + +K GY T IGKWH G + + P RGFD G
Sbjct: 98 SELPVKPYTLNDVGMDPKIKTIGDQMKHNGYRTIAIGKWHQG-DLPQYFPLKRGFDEFYG 156
Query: 135 YWNGYLTY------NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
+ G+ ++ HE A + ++ P+ + YLTD FTD+++ +K
Sbjct: 157 FVGGHRSFFGYPGGKAPSHEL-------ALFDNDKIVPENTIGYLTDMFTDKAISFVK-E 208
Query: 189 NHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
N S+P F+ + + AVH NAK E DR F +I++P R+ +A
Sbjct: 209 NKSKPFFMYLAYNAVHVPM--NAK----------KELMDR-FPNITDPGRKAYA 249
>gi|449138580|ref|ZP_21773837.1| arylsulfatase B [Rhodopirellula europaea 6C]
gi|448882842|gb|EMB13399.1| arylsulfatase B [Rhodopirellula europaea 6C]
Length = 498
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 77/232 (33%), Positives = 112/232 (48%), Gaps = 33/232 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G G + TPN+D LA +G++ ++ Y C+PSRA LTG+ P R+G + +
Sbjct: 45 GYGDMGCMGSQTLQTPNLDRLAESGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNL 104
Query: 82 GAGVAK--------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
A + +PV+EK L +L GY+T LIGKWH+G E P RGFD+
Sbjct: 105 NASDERYATRPELLGLPVSEKTLGDHLGAAGYATALIGKWHLGMG-EMHHPNRRGFDHFC 163
Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR- 192
G G S H + RN +R S++YLTDFFTD+ + I H ++
Sbjct: 164 GMLTG------SHHYFPTTMNHVIERNGQR-VEDFSNEYLTDFFTDEGLRFIDQHEAAKP 216
Query: 193 --PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
P F+ ++ A HT E + FA+I N RR +A
Sbjct: 217 DQPWFVFYSYNAPHTPMHAT-------------EADLARFANIQNKKRRTYA 255
>gi|196231680|ref|ZP_03130537.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196224152|gb|EDY18665.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 474
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 82/260 (31%), Positives = 122/260 (46%), Gaps = 57/260 (21%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID-TP 80
G+ + G +G DIPTPNID L +G+ + Y + P C SRAA +TG+Y R+G + P
Sbjct: 39 GYGEPGCYGGKDIPTPNIDKLVASGVRFSSGYVSAPFCAASRAALMTGRYQTRFGFEYNP 98
Query: 81 VGAGVAK---AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY-- 135
+GA A +PV EK + L+++GY+T L+GKWH+G P RGFD G+
Sbjct: 99 IGAKNADPGTGLPVNEKTVADRLRDVGYATGLVGKWHLG-GTAPFHPQRRGFDEFFGFLH 157
Query: 136 ---------WNGYLT-----------------------YNDSIHETDFAVGLDARRNMER 163
W+G T ++ +HE + A DA + R
Sbjct: 158 EGHFYLPPPWSGATTWLRRKALPDGSQGRWTSPDGHTVWSTDLHENEPAY--DADNPLLR 215
Query: 164 YAPQMSSKY-LTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPD 222
+ + K LTD FT ++ I H ++P FL + + AVH+ G
Sbjct: 216 NSQPVEEKANLTDAFTREACSFIDRH-QAQPWFLYLAYNAVHSPLQGEDTY--------- 265
Query: 223 MEENDRTFAHISNPDRRLFA 242
ME+ F+HI + RR+FA
Sbjct: 266 MEK----FSHIGDIQRRIFA 281
>gi|319951998|ref|YP_004163265.1| n-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
gi|319420658|gb|ADV47767.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
Length = 471
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 69/200 (34%), Positives = 101/200 (50%), Gaps = 27/200 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D GF G ++ TPN+D LA +G+ + Y T TC PSRA +TGKY R+G +
Sbjct: 33 GFADFGFQGSTEMKTPNLDKLANSGVKFTQGYVTDATCGPSRAGLITGKYQQRFGYEEIN 92
Query: 82 GAGVAK----------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN 131
G +P+ + + YLK+LGY+T + GKWH+G N + P NRGFD
Sbjct: 93 VPGYMSENSKFLADDMGLPLDQLTIGDYLKKLGYNTAMYGKWHLG-NADRFHPMNRGFDE 151
Query: 132 HVGYWNGYLTY------NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI 185
G+ G +Y + + H+T G N E PQ +Y+TD D+++ I
Sbjct: 152 FYGFRGGARSYFGYDVASSAHHDTKMERGFG---NFEE--PQ---EYVTDALADEAISFI 203
Query: 186 KSHNHSRPLFLQITHAAVHT 205
+ N P F+ + AVHT
Sbjct: 204 EK-NKKNPFFIYLAFNAVHT 222
>gi|325286699|ref|YP_004262489.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
gi|324322153|gb|ADY29618.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
Length = 494
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 75/234 (32%), Positives = 111/234 (47%), Gaps = 28/234 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYG--IDT 79
G++DVGF+G DI TP +D LA G + Y P C PSRAA LTGKYP G +
Sbjct: 53 GYSDVGFNGSTDIKTPELDKLANAGTIFTSAYVAHPFCGPSRAALLTGKYPHTIGSQFNL 112
Query: 80 PV-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
P G + K + E+ + + L+E GY T IGKWH+G EE P RGF + G+ G
Sbjct: 113 PANGESLGKGIDTNEQFIAKTLQESGYYTGAIGKWHLGAT-EEFHPNQRGFTDFYGFLGG 171
Query: 139 YLTY--------NDSIHETDFAVGLDARRNMERYAPQM-SSKYLTDFFTDQSVHVIK-SH 188
Y + + D +E ++ ++YLTD F+ ++ +K +
Sbjct: 172 GHNYFPEQYQAQYQKQKKAKKKIIRDYILPLEHNGKEVKETEYLTDAFSREASRFVKEAS 231
Query: 189 NHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
N +P FL + + A H + + EE+ F+ I + DRR +A
Sbjct: 232 NKKKPFFLYLAYNAPH-------------VPLEAKEEDLEKFSVIKDKDRRTYA 272
>gi|313215712|emb|CBY16310.1| unnamed protein product [Oikopleura dioica]
Length = 350
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/121 (42%), Positives = 77/121 (63%), Gaps = 2/121 (1%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
G++D+G+ D+ +PNID LA N + ++Y P+CTPSRAA +TG+Y RYG+ + V
Sbjct: 130 GYDDLGYVNP-DVKSPNIDYLANNALHFEKYYNQPSCTPSRAALMTGRYNIRYGLQSGVI 188
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+A+P++E LLP+ K+ GY+T + GKWH+G EE P RGFD G++ G
Sbjct: 189 KPDEPEAIPLSETLLPKAFKKCGYNTSMHGKWHLGYYTEEHCPQKRGFDRFFGFYLGSQD 248
Query: 142 Y 142
Y
Sbjct: 249 Y 249
>gi|443706067|gb|ELU02328.1| hypothetical protein CAPTEDRAFT_179702 [Capitella teleta]
Length = 501
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 61/187 (32%), Positives = 96/187 (51%), Gaps = 13/187 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G D+ TP +D LA G+ +Y + C+PSR +F++G+YPF + V
Sbjct: 33 GYHDIGLRNP-DLHTPTLDKLATKGVQFKNNYVMHACSPSRHSFMSGRYPFTSQMQKDVI 91
Query: 83 AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
V+ P+ K LP+YLKELGY TH +GKWH+G +E+ +P +RGFD+ G +G
Sbjct: 92 FPVSPDCSPLKLKFLPEYLKELGYGTHAVGKWHLGYCREDCMPTSRGFDSFYGTLDGEGD 151
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
YLT+ + G R +++ + + L D +I+ +P FL
Sbjct: 152 YLTHMSAGFYDWHTNGTVDRSKSGQHSQDLHTAALAD--------IIERQTEEKPFFLYF 203
Query: 199 THAAVHT 205
HT
Sbjct: 204 AAQNPHT 210
>gi|417301111|ref|ZP_12088281.1| arylsulfatase B [Rhodopirellula baltica WH47]
gi|327542540|gb|EGF29014.1| arylsulfatase B [Rhodopirellula baltica WH47]
Length = 498
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 79/232 (34%), Positives = 112/232 (48%), Gaps = 33/232 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G G + TPN+D LA +G++ ++ Y C+PSRA LTG+ P R+G + +
Sbjct: 45 GYGDMGCMGSQTLQTPNLDRLAESGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNL 104
Query: 82 GAGVAK--------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
A +P +EK L +L GY+T LIGKWH+G E P RGFD+
Sbjct: 105 NASDENYATRPELLGLPKSEKTLADHLGAAGYATALIGKWHLGMG-EMHHPNRRGFDHFC 163
Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI---KSHNH 190
G G S H + RN +R SS+YLTDFFTD+ + I KS N
Sbjct: 164 GMLTG------SHHYFPTTMNHVIERNGKR-VENFSSEYLTDFFTDEGLRFIDQHKSANP 216
Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+P F+ ++ A HT + E + FA+I N RR +A
Sbjct: 217 DQPWFVFFSYNAPHT-------------PMHATEADLARFANIQNQKRRTYA 255
>gi|323453557|gb|EGB09428.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 1605
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 69/209 (33%), Positives = 105/209 (50%), Gaps = 39/209 (18%)
Query: 23 GWNDVGFHGENDIP-TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI---- 77
G NDVG+ + + TP ID LA G+ L +Y++ CTP+RAA ++G YP R G+
Sbjct: 103 GSNDVGYQSHDMVGVTPFIDGLAEQGVRLKEYYSMHMCTPARAALMSGHYPMRIGMQLEN 162
Query: 78 ---DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
D+P G +P + + LK LGY+TH +GKW +G ++ LP NRGFD G
Sbjct: 163 IKPDSPWG------MPRELTTMAETLKNLGYNTHGVGKWGLGHSQHGFLPVNRGFDTWYG 216
Query: 135 YWNGYLTYNDSIHE------------------TDFAVGLDARRNMERYAPQMSSKYLTDF 176
Y + + Y HE TD+ +R +Y P ++ + ++
Sbjct: 217 YLSDEIDYYS--HEYPAPFETVEDGATVMASFTDYVFMERSRPYDLQYMPDLNGTHSSEL 274
Query: 177 FTDQSVHVIKSHNHSR-PLFL----QITH 200
+T + ++KS N SR PLF+ Q+TH
Sbjct: 275 YTQRVQQIVKSANASREPLFVYYASQMTH 303
>gi|313228866|emb|CBY18017.1| unnamed protein product [Oikopleura dioica]
Length = 482
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 73/209 (34%), Positives = 100/209 (47%), Gaps = 33/209 (15%)
Query: 25 NDVGFHGENDIP-------TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI 77
+D+GF ND+P PN+ +LA NG +L+ Y P CTPSR+A +T +YP R G+
Sbjct: 91 DDLGF---NDMPWNNPAIIAPNLHSLAKNGTILSNFYVQPVCTPSRSALMTSRYPIRLGL 147
Query: 78 DTPV-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
T V A +P+ E + + GY+TH++GKWH+G + LP NRGFD GY
Sbjct: 148 QTDVITAPQPSCLPLDEVTIGNEFQSAGYTTHIVGKWHLGHYCPQCLPNNRGFDTFRGYL 207
Query: 137 NGYLTYNDSI-------HETDFAVGLDARRNMERYAPQMSSKYLTD-------------F 176
G Y ++ A G D N R P+ + Y T
Sbjct: 208 TGAEDYYKKTFCIPLVPNQRPAACGFDFYDNENR-MPKANGTYSTYQVLIYLFIHTIILK 266
Query: 177 FTDQSVHVIKSHNHSR-PLFLQITHAAVH 204
F D S VIKSH S+ P FL + +VH
Sbjct: 267 FADASREVIKSHEGSKTPFFLYLPFQSVH 295
>gi|325109241|ref|YP_004270309.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324969509|gb|ADY60287.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 485
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 79/256 (30%), Positives = 112/256 (43%), Gaps = 51/256 (19%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID-TP 80
G+ ++G G IPTP+ID+LA NG+ Y T C+PSRA LTG+Y R+G + P
Sbjct: 53 GYGELGCQGNPQIPTPHIDSLAANGVRFRCGYVTAAYCSPSRAGLLTGRYQSRFGYEQNP 112
Query: 81 VGAGVAK---AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
GA +P+ EK L + L + GY+T L+GKWH+G P RGFD G+ +
Sbjct: 113 TGARNEDPELGLPLEEKTLARRLHDAGYATGLVGKWHLG-GTARFHPLRRGFDEFFGFLH 171
Query: 138 G--------YLTYNDSIHETDFAVGLDARRNMERY-----------------------AP 166
Y + + G R ER P
Sbjct: 172 EGHFFVPPPYEGVSTFLRRRALPNGKTGRWGDERLMLSTHMGHDEPAYDANNPILRGGQP 231
Query: 167 QMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEEN 226
+ YLTD FT ++ I + N RP FL + + AVH+ G +
Sbjct: 232 VEEAAYLTDAFTREACDFI-ARNQDRPFFLYLAYNAVHSPLQG-------------ADAY 277
Query: 227 DRTFAHISNPDRRLFA 242
+ FAHI++ RR+FA
Sbjct: 278 MQQFAHIADQQRRIFA 293
>gi|262384881|ref|ZP_06078013.1| N-acetylgalactosamine 6-sulfatase [Bacteroides sp. 2_1_33B]
gi|262293597|gb|EEY81533.1| N-acetylgalactosamine 6-sulfatase [Bacteroides sp. 2_1_33B]
Length = 589
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 77/248 (31%), Positives = 111/248 (44%), Gaps = 57/248 (22%)
Query: 5 VGAGVAKAVPVTEKLLP---------QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT 55
VGAG A +K LP QGW D+GF G + TPNID +A+ G +L Y
Sbjct: 13 VGAGCIPAF--AQKQLPNIIVMLSDDQGWGDLGFTGNTFVQTPNIDRIAHEGTILENFYV 70
Query: 56 LPTCTPSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI 115
P +P+RA FLTG+Y R G+++ G G + EK + +Y +E GY+T L GKWH
Sbjct: 71 CPVSSPTRAEFLTGRYHVRSGVNSTTGGG--ERFNQGEKTIAEYFREAGYATSLFGKWHS 128
Query: 116 GCNKEELLPFNRGFDNHVG--------YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQ 167
G + P RGF+ G YWN L +N I +
Sbjct: 129 G-TQYPYHPNARGFEEFYGFCSGHWGNYWNPVLEHNGEIISGE----------------- 170
Query: 168 MSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEEN- 226
++ D TD+++ I+ H P F+ +++ H+ +QVPD N
Sbjct: 171 ---GFIIDDLTDKALDYIRDHKE-HPFFMFLSYNTPHSP-----------MQVPDSWWNR 215
Query: 227 --DRTFAH 232
DRT +
Sbjct: 216 VKDRTLSQ 223
>gi|296122626|ref|YP_003630404.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296014966|gb|ADG68205.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 470
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/190 (35%), Positives = 95/190 (50%), Gaps = 15/190 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G D+G G + TP IDALA +G + Y+ P C+P+RAA +TGK P R GI +
Sbjct: 40 GKTDIGIEGSSFYETPRIDALAKSGARFTQFYSAHPVCSPTRAALMTGKMPQRLGITDWI 99
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD-----NHVGYW 136
A+P +E + Q +E GY T +GKWH+G +K + P RGFD NH G
Sbjct: 100 RPESDVALPQSEVTIGQAFQEAGYHTAYLGKWHLG-HKPQQHPAARGFDWTKGVNHGGQP 158
Query: 137 NG-YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
+ Y Y + DA N+ + YLTD T ++ ++ + +RP F
Sbjct: 159 SSYYFPYKNPQKP-------DAPNNVPDFEKCQPEDYLTDVLTSSAIEHLQQRDRTRPFF 211
Query: 196 LQITHAAVHT 205
L + H AVHT
Sbjct: 212 LCLAHYAVHT 221
>gi|313213139|emb|CBY36997.1| unnamed protein product [Oikopleura dioica]
Length = 532
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/190 (34%), Positives = 97/190 (51%), Gaps = 9/190 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW DV FH I TPNID L G+ L +YT CTP+R+A LTG+YP G+ T V
Sbjct: 34 GWADVSFHNTGGIQTPNIDRLVGGGLELTNYYTQHICTPTRSALLTGRYPIHTGLQTNVI 93
Query: 83 A-GVAKAVPVTEKLLPQYLKELGYST-HLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
A + + E LLP+YL+ H++GKWH+G + P+ RGF+ GY G
Sbjct: 94 AISQSSGLQRDEMLLPEYLESCDIKQRHMVGKWHVGHGHSWMTPWKRGFETFSGYLAGAE 153
Query: 139 -YLTYNDSIHETDFAVGLD-ARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSRPL 194
+ T + +TD+ G+D + + P SS KY D + + ++KS + ++
Sbjct: 154 DHYTREWCMEQTDWC-GVDYSEHSASLSGPTNSSWGKYSGDLYLQKMSEIVKSIDPTKDS 212
Query: 195 FLQITHAAVH 204
F+ VH
Sbjct: 213 FIYFAPQHVH 222
>gi|313234414|emb|CBY24613.1| unnamed protein product [Oikopleura dioica]
Length = 532
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/190 (34%), Positives = 97/190 (51%), Gaps = 9/190 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW DV FH I TPNID L G+ L +YT CTP+R+A LTG+YP G+ T V
Sbjct: 34 GWADVSFHNTGGIQTPNIDRLVGGGLELTNYYTQHICTPTRSALLTGRYPIHTGLQTNVI 93
Query: 83 A-GVAKAVPVTEKLLPQYLKELGYST-HLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-- 138
A + + E LLP+YL+ H++GKWH+G + P+ RGF+ GY G
Sbjct: 94 AISQSSGLQRDEMLLPEYLESCDIKQRHMVGKWHVGHGHSWMTPWKRGFETFSGYLAGAE 153
Query: 139 -YLTYNDSIHETDFAVGLD-ARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSRPL 194
+ T + +TD+ G+D + + P SS KY D + + ++KS + ++
Sbjct: 154 DHYTREWCMEQTDWC-GVDYSEHSASLSGPTNSSWGKYSGDLYLQKMSEIVKSIDPTKDS 212
Query: 195 FLQITHAAVH 204
F+ VH
Sbjct: 213 FIYFAPQHVH 222
>gi|149197521|ref|ZP_01874572.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139539|gb|EDM27941.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 465
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 72/225 (32%), Positives = 113/225 (50%), Gaps = 23/225 (10%)
Query: 23 GWNDVGFHGE-NDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP 80
G+ DV +HG + TP+ID++A +G Y+ P C PSRA L+G+Y R+G
Sbjct: 34 GYGDVSYHGTLKETTTPHIDSIAQSGAWFQNGYSAAPVCGPSRAGLLSGRYQQRFGYYDN 93
Query: 81 VG-----AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
+G V +P+++KL+P+ L + GY+T ++GKWH G ++ + P+NRGF G+
Sbjct: 94 IGPFTLNKDVEAGLPLSQKLIPEILVKEGYATGMVGKWHDG-DQHKFWPYNRGFQEFYGF 152
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
NG + N + + V + E + S +Y+T+ F ++V I H + P F
Sbjct: 153 NNGAIN-NWVLKGENHTVDEWGAVHRENKRVENSGEYMTEAFGREAVEFIDRHK-TEPFF 210
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRL 240
L ++ AVH G LQ P N F HI +R L
Sbjct: 211 LYLSFNAVH-----------GPLQAPKSYTN--QFKHIKPENRAL 242
>gi|421611065|ref|ZP_16052220.1| arylsulfatase B [Rhodopirellula baltica SH28]
gi|408498167|gb|EKK02671.1| arylsulfatase B [Rhodopirellula baltica SH28]
Length = 498
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 78/232 (33%), Positives = 110/232 (47%), Gaps = 33/232 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G G + TPN+D LA +G++ ++ Y C+PSRA LTG+ P R+G + +
Sbjct: 45 GYGDMGCMGSQTLQTPNLDRLAESGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNL 104
Query: 82 GAGVAK--------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
A +P +EK L +L GY+T LIGKWH+G E P RGFD+
Sbjct: 105 NASDENYATRPELLGLPTSEKTLADHLGAAGYATALIGKWHLGMG-EMHHPNRRGFDHFC 163
Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH---NH 190
G G S H + RN +R SS+YLTDFFTD+ + I H N
Sbjct: 164 GMLTG------SHHYFPTTMKHVIERNGKR-VDGFSSEYLTDFFTDEGLRFIDQHESANP 216
Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+P F+ ++ A HT E + FA+I N RR +A
Sbjct: 217 DQPWFVFFSYNAPHTPMHAT-------------EADLARFANIQNQKRRTYA 255
>gi|374620849|ref|ZP_09693383.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374304076|gb|EHQ58260.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 551
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/194 (34%), Positives = 95/194 (48%), Gaps = 28/194 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GWNDVG+HG N I TP++D LA G+ LNR YT P C+P+RAA +TG+ P R GI V
Sbjct: 47 GWNDVGYHGGN-IDTPSLDKLAEQGVQLNRFYTTPICSPTRAALMTGRDPMRLGIAYGVI 105
Query: 82 ----GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
GV A E +PQ + GY T ++GKWH+G + P RGF++ G+ +
Sbjct: 106 LPWDNIGVNPA----EHFMPQSFQAAGYQTAMVGKWHLGHAQMTYHPNQRGFEHFYGHLH 161
Query: 138 ---GYLTYNDSIHETDF---AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
G+ ++ DF V +D Y T D+ I+ +
Sbjct: 162 TEVGFYPPFANVGGKDFQENGVSID------------DEGYETYLLADEVSRYIRDRDEE 209
Query: 192 RPLFLQITHAAVHT 205
+P F+ + A HT
Sbjct: 210 KPFFIYMPFIAPHT 223
>gi|402821074|ref|ZP_10870630.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
IMCC14465]
gi|402510105|gb|EJW20378.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
IMCC14465]
Length = 526
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/185 (36%), Positives = 98/185 (52%), Gaps = 9/185 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW DVG+HG +DI TP+ID LA G LNR Y P C+P+RAA +TG+ P + G+ V
Sbjct: 62 GWGDVGYHG-SDIQTPHIDRLAKEGAKLNRFYATPFCSPTRAALMTGRDPLKLGVAYSVL 120
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
V + E LPQ + GY+T ++GKWH+G E+ P RGFD G+ + ++
Sbjct: 121 MPWENGGVSLDEHFLPQSFQAAGYNTAMVGKWHLGHTIEQHTPNARGFDLFYGHMHTQVS 180
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS-HNHSRPLFLQITH 200
Y D H+ A G D + N + + +Y TD Q+ I + ++P L +
Sbjct: 181 YFD--HQ--IANGHDFQENGK--PVDHNGEYATDVHGAQAARFITDLRDKTKPFLLYVPF 234
Query: 201 AAVHT 205
A H+
Sbjct: 235 LAPHS 239
>gi|291231158|ref|XP_002735532.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 191
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 51/127 (40%), Positives = 75/127 (59%), Gaps = 5/127 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYG-IDTPV 81
GWND+G+H TPN+++LA +GI L +Y P CTPSR LTG+Y RYG + +
Sbjct: 35 GWNDIGYHNP-IFQTPNLNSLAADGIKLENYYVAPVCTPSRGQLLTGRYAMRYGLVHRNI 93
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW---NG 138
+P+ E LP+ +K+ GY+TH++GKWH G +P RGFD+ G++
Sbjct: 94 RPAQRMCLPLDEVTLPEKMKQAGYATHMVGKWHQGFYTPACIPTQRGFDSFFGFYICTED 153
Query: 139 YLTYNDS 145
Y T++ S
Sbjct: 154 YFTHSAS 160
>gi|410617069|ref|ZP_11328045.1| arylsulfatase B [Glaciecola polaris LMG 21857]
gi|410163338|dbj|GAC32183.1| arylsulfatase B [Glaciecola polaris LMG 21857]
Length = 482
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 73/223 (32%), Positives = 106/223 (47%), Gaps = 18/223 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID--- 78
G+ D GF G + + TPN+D LA G V + Y + C PSRA LTGKY R+G +
Sbjct: 49 GYADFGFQGSDVMRTPNLDKLASQGTVFTQAYVSAAVCGPSRAGILTGKYQQRFGYEENN 108
Query: 79 -----TPVG-AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
+ G G +P+ +K + YL+E GY T LIGKWH G N + P RGFD
Sbjct: 109 VPGYMSQSGLTGDDMGLPLDQKTMADYLRERGYKTALIGKWHQG-NADRFHPTKRGFDEF 167
Query: 133 VGYWNGYLTYNDSIHETDFAVGLDA-RRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
G+ G +Y + + D R + Q S +YLT+ ++V IK N
Sbjct: 168 YGFRGGARSYFGFGAQNPVSYPEDKLERGFAHF--QESKRYLTEALATETVEFIK-RNQK 224
Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHIS 234
P F+ ++ AVHT P L Q +++ + A ++
Sbjct: 225 HPFFVFLSFNAVHTPMEAK---PADLAQFSNLKGKRQQLAAMT 264
>gi|320105193|ref|YP_004180784.1| sulfatase [Isosphaera pallida ATCC 43644]
gi|319752475|gb|ADV64235.1| sulfatase [Isosphaera pallida ATCC 43644]
Length = 481
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 72/215 (33%), Positives = 113/215 (52%), Gaps = 28/215 (13%)
Query: 5 VGAGVAKAVP-----VTEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNR-HYTLPT 58
+G+G A+P +T+ L G+ D+G +G DI TP ID+LA +G L+ H P
Sbjct: 56 LGSGTNDALPHIVLIMTDDL---GYADLGCYGAPDIATPRIDSLARDGARLSHFHSPGPV 112
Query: 59 CTPSRAAFLTGKYPFRYGIDTPVGAG-VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGC 117
CTP+RAA LTG++P R G++ + A +PV E +L + LKE+GY T ++GKWH+G
Sbjct: 113 CTPTRAALLTGRWPQRVGLEWALSASDTEPGLPVEEPILSRPLKEVGYRTVMVGKWHLG- 171
Query: 118 NKEELLPFNRGFDNHVGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLT 174
+ E P GFD G +G + ++ + + D+ E P Y T
Sbjct: 172 YRPEFGPNAHGFDEFFGLLSGNVDHYSHREINGKEDW---------YENTKPVRVEGYST 222
Query: 175 DFFTDQSVHVIKSH-----NHSRPLFLQITHAAVH 204
D +D++V I+ + +PL+L + + AVH
Sbjct: 223 DLLSDRAVAAIQKTAAQPPDQRQPLWLYVAYNAVH 257
>gi|313212372|emb|CBY36360.1| unnamed protein product [Oikopleura dioica]
gi|313214813|emb|CBY41065.1| unnamed protein product [Oikopleura dioica]
Length = 174
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 74/119 (62%), Gaps = 5/119 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+ D+G++ ++ PN+D LA NGI+ + YT P CTPSRA F+TG+Y R G+ +
Sbjct: 53 GYADIGYNSDHAF-MPNMDFLANNGIIFDSFYTQPVCTPSRAQFMTGRYTNRLGLQHRNI 111
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ +P+ EK +P+Y +E GYST +IGKWH+G LP NRGF V + GY+
Sbjct: 112 LSAQPSGIPLDEKTVPEYFRECGYSTEMIGKWHLGLFTSNFLPHNRGF---VSGFQGYI 167
>gi|313214045|emb|CBY42606.1| unnamed protein product [Oikopleura dioica]
Length = 191
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/185 (32%), Positives = 98/185 (52%), Gaps = 5/185 (2%)
Query: 40 IDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PVGAGVAKAVPVTEKLLPQ 98
+D L NG + Y+ C+PSRA LTG+Y FR G+ + P+ V + +K LP+
Sbjct: 1 MDKLVKNGTQFTQMYSSHRCSPSRAMALTGRYAFRSGMGSFPIAREVPFGMNTQDKTLPE 60
Query: 99 YLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDAR 158
YLKE+GY TH +GKWH+G LP +RGFD G+++G + Y + D
Sbjct: 61 YLKEVGYDTHAVGKWHLGVCNSSYLPTSRGFDTFYGHYSGAVDYRGHFIKRSKNFYHDFF 120
Query: 159 RN-MERYAPQMSS--KYLTDFFTDQSVHVIKSHNHSR-PLFLQITHAAVHTGTAGNAKLP 214
N +E++ + S ++ TD F D+++ ++K S+ P ++ + A H T A L
Sbjct: 121 DNTIEQHKLDLESDGQWTTDLFRDRTIDILKEAKRSKTPAYVYLAFNAPHEPTRAPADLI 180
Query: 215 TGLLQ 219
+L+
Sbjct: 181 ARILE 185
>gi|405970955|gb|EKC35816.1| N-acetylgalactosamine-6-sulfatase [Crassostrea gigas]
Length = 511
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 65/196 (33%), Positives = 97/196 (49%), Gaps = 15/196 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G GE + TP +D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 31 GWGDLGVFGEPNKETPYLDQMAAEGMLFPDFYSANPLCSPSRAALLTGRLPIRNGFYTTN 90
Query: 82 G--------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
G + +P E LLP+ L++ GY + L+GKWH+G ++ + LP GFD
Sbjct: 91 GHARNAYTPQNIVGGIPDEEILLPELLQKAGYKSKLVGKWHLG-HQAKYLPLKHGFDEWF 149
Query: 134 GYWNGYLTYNDSIHETDFAV----GLDARRNMERYAPQMSSKYLTDFFTDQSVHVI-KSH 188
G N + D++H + V + R + + LT +T ++V I + H
Sbjct: 150 GAPNCHFGPYDNVHTPNIPVYRNEEMAGRYYQDFKIEKNGESNLTQLYTKEAVEFITRMH 209
Query: 189 NHSRPLFLQITHAAVH 204
N S+P FL A H
Sbjct: 210 NKSKPFFLYWAVDATH 225
>gi|421613763|ref|ZP_16054834.1| arylsulfatase A [Rhodopirellula baltica SH28]
gi|408495349|gb|EKJ99936.1| arylsulfatase A [Rhodopirellula baltica SH28]
Length = 616
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 67/187 (35%), Positives = 92/187 (49%), Gaps = 24/187 (12%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QGW D+ H I TP +DALA L+R Y P C P+RAA LTG+YP R G+
Sbjct: 72 QGWGDLAAHRNPKISTPTLDALANESARLDRFYVSPVCAPTRAALLTGRYPERSGV---- 127
Query: 82 GAGVA---KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
AGV + + E L + + GY+T GKWH G + L P +GF+ G+ G
Sbjct: 128 -AGVTGRREVMRAEETTLAELYRAAGYATGCFGKWHNGA-QMPLHPNGQGFNEFFGFCGG 185
Query: 139 YLT-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
+ Y+D++ E RN P + Y+TD TD +V I++H H RP F
Sbjct: 186 HFNLYDDALLE----------RN---GTPVQTKGYITDVLTDAAVEFIQNH-HDRPFFCY 231
Query: 198 ITHAAVH 204
+ A H
Sbjct: 232 VPFNAPH 238
>gi|32475139|ref|NP_868133.1| arylsulfatase [Rhodopirellula baltica SH 1]
gi|32445680|emb|CAD78411.1| arylsulfatase homolog b1498 [Rhodopirellula baltica SH 1]
Length = 656
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 67/187 (35%), Positives = 92/187 (49%), Gaps = 24/187 (12%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QGW D+ H I TP +DALA L+R Y P C P+RAA LTG+YP R G+
Sbjct: 112 QGWGDLAAHRNPKISTPTLDALANESARLDRFYVSPVCAPTRAALLTGRYPERSGV---- 167
Query: 82 GAGVA---KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
AGV + + E L + + GY+T GKWH G + L P +GF+ G+ G
Sbjct: 168 -AGVTGRREVMRAEETTLAELYRSAGYATGCFGKWHNGA-QMPLHPNGQGFNEFFGFCGG 225
Query: 139 YLT-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
+ Y+D++ E RN P + Y+TD TD +V I++H H RP F
Sbjct: 226 HFNLYDDALLE----------RN---GTPVQTKGYITDVLTDAAVEFIQNH-HDRPFFCY 271
Query: 198 ITHAAVH 204
+ A H
Sbjct: 272 VPFNAPH 278
>gi|160891516|ref|ZP_02072519.1| hypothetical protein BACUNI_03967 [Bacteroides uniformis ATCC 8492]
gi|317478375|ref|ZP_07937539.1| sulfatase [Bacteroides sp. 4_1_36]
gi|156858923|gb|EDO52354.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
gi|316905534|gb|EFV27324.1| sulfatase [Bacteroides sp. 4_1_36]
Length = 525
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 76/229 (33%), Positives = 111/229 (48%), Gaps = 29/229 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCT---PSRAAFLTGKYPFRYGIDT 79
GW DVG+ G D+ TPNIDALA G+ ++ Y +C+ PSRA LTG Y R+G
Sbjct: 43 GWGDVGYQGAVDVSTPNIDALARRGVQFSQGYV--SCSISGPSRAGILTGVYQQRFGFYN 100
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
+ +P + L + +++ GY+T +GKWH+ + E+ P RGFD G+W+
Sbjct: 101 NLHPWA--KIPEGQSTLGEMVRDCGYATGFVGKWHMADSPEQ-SPNRRGFDQFYGFWSDT 157
Query: 140 LTYNDS-----IHETDFAVGLDARRNMERYAP-QMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y S + DF RN E P S +Y+TD FT ++V I H S P
Sbjct: 158 HDYYRSTDKPGVELYDFC---PLYRNGEIQPPLHESGEYITDCFTREAVEFIDKHASS-P 213
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L +++ AVH+ QVP+ N + DR++FA
Sbjct: 214 FLLCLSYNAVHSP-----------WQVPEHYVNRLEGRRFHHEDRKVFA 251
>gi|313237610|emb|CBY12754.1| unnamed protein product [Oikopleura dioica]
Length = 168
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/109 (44%), Positives = 70/109 (64%), Gaps = 2/109 (1%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+ D+G++ ++ PN+D LA NGI+ + YT P CTPSRA F+TG+Y R G+ +
Sbjct: 53 GYADIGYNSDHAF-MPNMDFLANNGIIFDSFYTQPVCTPSRAQFMTGRYTNRLGLQHRNI 111
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD 130
+ +P+ EK +P+Y +E GYST +IGKWH+G LP NRGF+
Sbjct: 112 LSAQPSGIPLDEKTVPEYFRECGYSTEMIGKWHLGLFTSNFLPHNRGFN 160
>gi|399033016|ref|ZP_10732099.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
gi|398068627|gb|EJL60037.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
Length = 460
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 66/183 (36%), Positives = 97/183 (53%), Gaps = 9/183 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDV +HG + I TPNID LA NG+ LNR Y PTC+PSRA+ TG+ R GI P+
Sbjct: 46 GWNDVEYHG-SVIQTPNIDFLAKNGVELNRFYANPTCSPSRASLFTGRPASRMGIVAPIS 104
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+P + LP+ L + Y T LIGKWH+G ++ P GFD G+ +G +
Sbjct: 105 DKSQFKLPDSIATLPKLLHQNNYQTALIGKWHLGL-QQSSGPKAYGFDYSYGFLHGQIDQ 163
Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS-RPLFLQITHA 201
+++ RN E Q + TD T++++H + S + FL++ ++
Sbjct: 164 YTHLYKNG---DKSWYRNGEFIDEQ---GHATDLITNEAIHWLSEKRDSNKNFFLEVAYS 217
Query: 202 AVH 204
A H
Sbjct: 218 APH 220
>gi|340367649|ref|XP_003382366.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 495
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 70/187 (37%), Positives = 92/187 (49%), Gaps = 18/187 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYP---FRYGIDT 79
G+ DVGF I +PN D LA G+VLNRHY C PSRA+ LTG++P +++ + T
Sbjct: 35 GFADVGFRNPA-ISSPNFDQLAKTGLVLNRHYVFKYCAPSRASLLTGRWPHHVYQWNLAT 93
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
AG + + P LK YSTH++GKWH G LP NRGFD G+ G
Sbjct: 94 DATAGTN----LNMTMFPAKLKAANYSTHMVGKWHQGFFDPRYLPINRGFDTSSGFLCG- 148
Query: 140 LTYNDSIHETDFAV-GLDARRNMERYAPQ-MSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
H T A+ +D +N AP + Y + D +I SHN PLFL
Sbjct: 149 ----SEDHMTQNAICAIDYWKNN---APDPRNGTYDAYIYRDDLTDIINSHNTDEPLFLY 201
Query: 198 ITHAAVH 204
+ VH
Sbjct: 202 LPLHNVH 208
>gi|374619563|ref|ZP_09692097.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374302790|gb|EHQ56974.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 539
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 62/187 (33%), Positives = 93/187 (49%), Gaps = 13/187 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW DVGFHG I TP++D +A G LNR YT P C+P+RAA +TG+ P R G+ + +
Sbjct: 37 GWADVGFHGNQIIETPSLDRIAAEGTQLNRFYTTPICSPTRAALMTGRDPIRLGVAYSTI 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN---G 138
+ E LP+ GY T ++GKWH+G ++ P RGF++ G+ + G
Sbjct: 97 MPWHNNGIHPEETFLPELFAGAGYQTAMVGKWHLGHAQQTYHPNARGFEHFYGHLHTEVG 156
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
+ S+ DF +RN Q YL D+ I+ + ++P F+ +
Sbjct: 157 FFPPFASLGGKDF------QRNGVSIDDQGYESYL---LADEVSRYIRERDAAKPFFIYM 207
Query: 199 THAAVHT 205
A HT
Sbjct: 208 PFIAPHT 214
>gi|417301514|ref|ZP_12088666.1| arylsulfatase B [Rhodopirellula baltica WH47]
gi|327542201|gb|EGF28693.1| arylsulfatase B [Rhodopirellula baltica WH47]
Length = 489
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/159 (37%), Positives = 86/159 (54%), Gaps = 6/159 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVGFHG ++I TPNID LA + L+R Y P C+P+RA LTG YPFR+GI V
Sbjct: 57 GWNDVGFHG-SEIRTPNIDRLASESVTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVV 115
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ K +P + P++L +LGY + GKWH+G P + G G++NG +
Sbjct: 116 SPTKKHGLPTLLETTPEHLSKLGYDHRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAI 175
Query: 141 TY--NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFF 177
Y + + D+ D+ E Y+ ++ + DF
Sbjct: 176 DYFSRERFGQLDWHRDFDSVHE-EGYSTELVGNAVVDFI 213
>gi|323456816|gb|EGB12682.1| hypothetical protein AURANDRAFT_60668 [Aureococcus anophagefferens]
Length = 534
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 65/190 (34%), Positives = 91/190 (47%), Gaps = 10/190 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ DVG+ D+ +P +D LA G+ L RHY C PSRAA LTG YP G+ + G
Sbjct: 38 GFGDVGYS-SPDVISPTLDRLAAEGLKLGRHYAYMWCAPSRAALLTGYYPSTTGVYSTSG 96
Query: 83 AGVAKAVPVTEKLLPQYLKE-LGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
A A+P+ LLP L++ GY T +GKWH+G E LP RGFD G+ +G
Sbjct: 97 A--QNALPLEFALLPGLLRDRAGYRTAAVGKWHLGFMSEADLPERRGFDGFFGFLDGGED 154
Query: 139 -YLTYNDSIHETD--FAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
Y D F + RR P +Y + + + + I+ H+ + PLF
Sbjct: 155 HYSRVGAGAPGCDRVFDLWDSRRRGPATDDPSAFGRYSAELYGEAAADAIRGHDAAEPLF 214
Query: 196 LQITHAAVHT 205
L H+
Sbjct: 215 LYAAFQVAHS 224
>gi|325110321|ref|YP_004271389.1| arylsulfatase [Planctomyces brasiliensis DSM 5305]
gi|324970589|gb|ADY61367.1| Arylsulfatase [Planctomyces brasiliensis DSM 5305]
Length = 980
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 74/224 (33%), Positives = 103/224 (45%), Gaps = 18/224 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ G +G+ TP++DALA G+ N + + CTPSRA LTG+Y R+G++T
Sbjct: 41 GFQGGGINGDFANLTPHLDALAEGGVRFTNGYVSAAVCTPSRAGMLTGRYQHRFGVETVY 100
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G +P +E + L++ GY T+ IGKWH+G + E LP RGFD G G T
Sbjct: 101 GRIPEAGLPASEITMADTLRKAGYRTYAIGKWHLGEHLHEHLPNQRGFDEFYGALTGART 160
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH--NHSR-PLFLQI 198
+ G +RN + Y TD Q+V I H NH+ P FL +
Sbjct: 161 F---FPYRGNNPGSKLQRNGVFLPEPLDQPYFTDLLARQTVAYIDDHVANHANAPFFLYL 217
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
AVHT + K +DR IS P R+ A
Sbjct: 218 AFTAVHTPLEADPK-----------RLDDRRIQDISPPQRKTLA 250
>gi|410628682|ref|ZP_11339400.1| arylsulfatase B [Glaciecola mesophila KMM 241]
gi|410151686|dbj|GAC26169.1| arylsulfatase B [Glaciecola mesophila KMM 241]
Length = 510
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 117/243 (48%), Gaps = 30/243 (12%)
Query: 9 VAKAVPVTEKLLPQ--GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAA 65
VAK P +L G+ D+GF G +I TPNIDALA NG+V Y T P C PSRA
Sbjct: 52 VAKERPNIVVILADDLGYADLGFTGSKEIFTPNIDALANNGVVFKNGYVTHPYCGPSRAG 111
Query: 66 FLTGKYPFRYGIDTPVGAGVAK---AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL 122
LTG+Y R+G++ +PV E + +++ GY T ++GKWH+G +
Sbjct: 112 LLTGRYQARFGMEVNAAHSPDDPYMGLPVEELTFAKRMQQAGYKTAVMGKWHMGSHP-NF 170
Query: 123 LPFNRGFDNHVGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTD 179
P NRGFD G+ G Y + + ++++ L RN + P ++YLT +
Sbjct: 171 HPNNRGFDEFFGFLGGGHDYFPESVKVSSAEYSIALS--RNGK---PAQLNEYLTTAISK 225
Query: 180 QSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRR 239
++ + + +P + + + A H+ E++ + HI++ DRR
Sbjct: 226 EAARFVSA--TEQPFMMYVAYNAPHSPLQAT-------------EQDLAKYQHIADLDRR 270
Query: 240 LFA 242
+A
Sbjct: 271 TYA 273
>gi|323456753|gb|EGB12619.1| hypothetical protein AURANDRAFT_70521 [Aureococcus anophagefferens]
Length = 913
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 98/192 (51%), Gaps = 15/192 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW+DV +H + + TP + ALA +G+VL+R Y C+PSR++ L+G+YP G
Sbjct: 422 GWHDVPWHNPS-LKTPTLAALAADGVVLDRFYAYRFCSPSRSSLLSGRYPMHVNQYNMAG 480
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+ V V + + LK GY+TH +GKWH G + +L+P RGFD +GY NG
Sbjct: 481 DALGGGVHVNMTTIAKKLKGAGYATHQLGKWHAGQSSADLVPAARGFDTSLGYLNG---A 537
Query: 143 NDSIHETDFAVGLDARRNMERYAPQ-----MSSKYLTDFFTDQSVHVIKSHNHSRPLFL- 196
D + A G+ ++ YA + Y + D ++ +I H+ S PLF+
Sbjct: 538 EDHWTQARPACGVG--NFVDLYATDGPAFGKNGTYGAQIYHDAALDIIADHDASVPLFVY 595
Query: 197 ---QITHAAVHT 205
QI HA +
Sbjct: 596 FAFQINHAPMQV 607
>gi|410446790|ref|ZP_11300893.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
gi|409980462|gb|EKO37213.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
Length = 540
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 68/210 (32%), Positives = 101/210 (48%), Gaps = 8/210 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW D+ G I TP ID+L G+ L+R YT P C+P+RAA +TG+ P R GI + V
Sbjct: 35 GWADISLRGA-PIDTPAIDSLFSEGLTLDRFYTTPICSPTRAALMTGRDPLRLGISYSVV 93
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ V E +P+ K GY T ++GKWH+G ++E P RGFD+ G+ + +
Sbjct: 94 MPWMNNGVHPDEHFMPESFKAAGYQTAMVGKWHLGHSQEIFHPNARGFDDFYGHLHTEVG 153
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y G+D +RN A + Y T D++ IK+ + +P FL +
Sbjct: 154 YFLPFANQG---GVDFQRNGVTIADE---GYETFLLADEASRWIKARDKDKPFFLYMPFI 207
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFA 231
A H+ L + D E R+ A
Sbjct: 208 APHSPLEAPDDLVKKYENLEDTRELTRSAA 237
>gi|329928435|ref|ZP_08282305.1| putative cerebroside-sulfatase [Paenibacillus sp. HGF5]
gi|328937871|gb|EGG34277.1| putative cerebroside-sulfatase [Paenibacillus sp. HGF5]
Length = 443
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 65/187 (34%), Positives = 99/187 (52%), Gaps = 6/187 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G +G + + TP++D LA GI Y+ P C+PSRA+ LTGKYP R G+ +
Sbjct: 19 GYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLLTGKYPARAGVGEIL 78
Query: 82 GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
GA +P E L + LK GY T L GKWH+G + EE P GFD G+ G +
Sbjct: 79 GAKRGSHGLPADEVTLAKALKPAGYRTALFGKWHLGLS-EETSPNAHGFDEFFGFKAGCV 137
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQM--SSKYLTDFFTDQSVHVI-KSHNHSRPLFLQ 197
+ I A G++ ++ ++ + +Y+T+ T++SV I +S P FL
Sbjct: 138 DFYSHIFYWGQAHGVNPLHDLWENETEVWENGRYMTELITERSVDFIQRSREQEAPFFLF 197
Query: 198 ITHAAVH 204
++ A H
Sbjct: 198 ASYNAPH 204
>gi|298715187|emb|CBJ27859.1| Formylglycine-dependent sulfatase, C-terminal fragment
Formylglycine-dependent sulfatase, N-terminal
[Ectocarpus siliculosus]
Length = 610
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 68/207 (32%), Positives = 97/207 (46%), Gaps = 24/207 (11%)
Query: 23 GWNDVGFHGENDIP-TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
G ND+G+ + TP +D+LA +G+VL+ +YT CTPSRA+ +TG+ FR G+
Sbjct: 81 GTNDMGYRSTDLWELTPFLDSLASSGVVLDNYYTNQLCTPSRASLMTGRDSFRTGMQHGI 140
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD---------- 130
V +P E L K GYSTH+ GKWH+G + PF+RGFD
Sbjct: 141 VDYSAPWGLPFEEVTLADRFKAAGYSTHMTGKWHLGVFSDASYPFSRGFDTFLGYTGGGE 200
Query: 131 ---NHVGYWNGYLTYNDSIHETDFAVG-----LDARRNMERYAPQMSSKYLTDFFTDQSV 182
NH + + DF G +D N + P M Y T TD+++
Sbjct: 201 GYYNHSTCFTPTFEGGEYSCLKDFGYGDEDGYIDYTTNTTKEGPAMVDNYSTTIMTDRAI 260
Query: 183 HVIKSH----NHSRPLFLQITHAAVHT 205
V + H + PLFL + + A HT
Sbjct: 261 DVAREHTGTASSDDPLFLYVAYQAAHT 287
>gi|296121201|ref|YP_003628979.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296013541|gb|ADG66780.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 479
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 70/189 (37%), Positives = 101/189 (53%), Gaps = 10/189 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGI-VLNRHYTLPTCTPSRAAFLTGKYPFRYG--IDT 79
G+ D+G G +IPTP++D LA +GI N + + P C+PSRA FLTGKY R+G +
Sbjct: 49 GYADLGVQGGCEIPTPHLDQLAASGIRCTNAYVSAPYCSPSRAGFLTGKYQTRFGHEFNP 108
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG- 138
VG +P+ E + L+ GY T LIGKWH G +K+ P +RGFD G+ G
Sbjct: 109 HVGEEAKLGLPLEEVTIANLLQTEGYRTALIGKWHQGFSKDH-HPQSRGFDEFFGFLVGG 167
Query: 139 --YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
YL + + A D PQ Y TD FT++++ + S ++P FL
Sbjct: 168 HNYLLHKEVKARFGTAHSHDMIYRGREVEPQ--EGYATDLFTNEALRWM-SGPPNKPWFL 224
Query: 197 QITHAAVHT 205
+++ AVHT
Sbjct: 225 YLSYNAVHT 233
>gi|291235506|ref|XP_002737685.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 658
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 72/224 (32%), Positives = 107/224 (47%), Gaps = 25/224 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW+DVG+HG + I TPNID LA G+ L +Y C PSR LTG+Y R G+
Sbjct: 215 GWHDVGYHG-SIIDTPNIDHLAAEGVKLENYYVSSWCAPSRVNLLTGRYRIRTGL----Y 269
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI-GCNKEELLPFNRGFDNHVGYWNG--- 138
V + + E L L E GY T ++GKWH+ G E P +RGF +G+ G
Sbjct: 270 GDVCDFMGIHEITLADKLYEAGYYTAMVGKWHLSGFQHRECYPAHRGFQTFLGFHGGSQN 329
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y T+ +++ D N + +Y T + +++ +I+ H +PLFL +
Sbjct: 330 YFTHRRGGSNSEY----DFWANDTSIGREYDGRYSTMVYAEEAQRIIRHHRTEQPLFLYL 385
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ AVH+ L VP E D+ I + RR++A
Sbjct: 386 SFQAVHSP-----------LLVPSAYE-DKYRTGIEDDKRRVYA 417
>gi|330505678|ref|YP_004382547.1| putative sulfatase [Pseudomonas mendocina NK-01]
gi|328919964|gb|AEB60795.1| probable sulfatase precursor [Pseudomonas mendocina NK-01]
Length = 629
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 77/229 (33%), Positives = 112/229 (48%), Gaps = 26/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G ND+ G+ PTP +DAL+ + + L RHYT TC+PSRA+ ++G++P G G
Sbjct: 48 GNNDIASWGDGRAPTPTLDALSASAVRLRRHYTDSTCSPSRASLISGRHPVSVGFQAD-G 106
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--CNKEELLPFNRGFDNHVGYWNGYL 140
G++ + VT LP+ L+ LGY T +GKWH+G EE+ P +GFD YW G L
Sbjct: 107 LGLSPDL-VT---LPKSLRSLGYRTLHVGKWHLGEALEYEEIQPGQQGFD----YWFGML 158
Query: 141 TYNDSIHETDFAVGLDARRNMERY---------APQMSSKYLTDFFTDQSVHVIKSHNHS 191
N + + G RR AP YL D TD++V ++KS
Sbjct: 159 --NHFVLQGPGPDGRPVRRQPTHINPWLQDNGSAPAQHQGYLDDILTDKAVELVKSGVGE 216
Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRL 240
+P F+ + + HT + T Q PD E + FA +S D +
Sbjct: 217 KPWFINLWLFSPHTPYQPSPAFST---QFPDTPEG-KYFAILSQLDHNM 261
>gi|32470862|ref|NP_863855.1| arylsulfatase B [Rhodopirellula baltica SH 1]
gi|32443007|emb|CAD71528.1| arylsulfatase B [Rhodopirellula baltica SH 1]
Length = 520
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 78/232 (33%), Positives = 110/232 (47%), Gaps = 33/232 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G G + TPN+D LA +G++ ++ Y C+PSRA LT + P R+G + +
Sbjct: 67 GYGDMGCMGSQTLQTPNLDRLAESGVLCSQAYVASAVCSPSRAGLLTSRDPRRFGYEGNL 126
Query: 82 GAGVAK--------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
A +P +EK L +L GY+T LIGKWH+G E P RGFD+
Sbjct: 127 NASDENYATRPELLGLPTSEKTLADHLGAAGYATALIGKWHLGMG-EMHHPNRRGFDHFC 185
Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI---KSHNH 190
G G S H + RN +R SS+YLTDFFTD+ + I KS N
Sbjct: 186 GMLTG------SHHYFPATMKHVIERNGKR-VDDFSSEYLTDFFTDEGLRFIDQHKSANP 238
Query: 191 SRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+P F+ ++ A HT E + FA+I N RR +A
Sbjct: 239 DQPWFVFFSYNAPHTPMHAT-------------EADLARFANIQNQKRRTYA 277
>gi|6863178|gb|AAF30403.1|AF109925_1 sulfatase 2 precursor [Helix pomatia]
Length = 266
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 65/215 (30%), Positives = 102/215 (47%), Gaps = 34/215 (15%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
G+ D+G+HG + TPN+D LA G+ L +Y P C+P+R+ +TG+Y G+ +
Sbjct: 39 GYRDIGYHGA-EFATPNLDKLAAEGVKLENYYVQPICSPTRSQLMTGRYQIHTGLQHDII 97
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--- 138
+P+ + LK +GYSTH IGKWH+G K+E P RGFD++ GY G
Sbjct: 98 WPSQPYGLPLQFPTIADMLKSVGYSTHAIGKWHLGLYKKEYTPLYRGFDSYYGYLEGGED 157
Query: 139 -YLTYN-DSIH-------------------------ETDFAVGLDARRNMERYAPQMSSK 171
Y YN D+ H + + G D R+M M+
Sbjct: 158 YYTYYNCDTFHNRTTPADTSILESYSPKNILLGKHEDENKWCGYDL-RDMNEPVTDMNGT 216
Query: 172 YLTDFFTDQSVHVIK-SHNHSRPLFLQITHAAVHT 205
Y T +T +++ +I + +P L + + AVH+
Sbjct: 217 YSTHLYTKKAIDIINGASTGGKPFLLYLAYQAVHS 251
>gi|421614608|ref|ZP_16055661.1| arylsulfatase B [Rhodopirellula baltica SH28]
gi|408494617|gb|EKJ99222.1| arylsulfatase B [Rhodopirellula baltica SH28]
Length = 472
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 63/187 (33%), Positives = 97/187 (51%), Gaps = 14/187 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVGFHG ++I TPNID LA + L+R Y P C+P+RA LTG YPFR+GI V
Sbjct: 40 GWNDVGFHG-SEIRTPNIDRLASESVTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVV 98
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ K +P + P++L +LGY + GKWH+G P + G G++NG +
Sbjct: 99 SPTKKHGLPPQLETTPEHLSKLGYDHRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAI 158
Query: 141 TY--NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y + + D+ D+ E Y+ ++ + DF I + ++ P++ +
Sbjct: 159 DYFSRERFGQLDWHRDFDSVHE-EGYSTELVGNAVVDF--------IDRNANAGPVYAYV 209
Query: 199 THAAVHT 205
A H+
Sbjct: 210 AFNAPHS 216
>gi|355689580|gb|AER98880.1| galactosamine -6-sulfate sulfatase [Mustela putorius furo]
Length = 503
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 100/202 (49%), Gaps = 20/202 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 23 GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 82
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
G + +P E+LLP+ LK GY++ ++GKWH+G ++ + P RGFD
Sbjct: 83 GHARNAYTPQEIVGGIPAEERLLPELLKGAGYASKIVGKWHLG-HRPQFHPLKRGFDEWF 141
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVI-KS 187
G N + D+ + V D R E + + + LT +T +++ + +
Sbjct: 142 GSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQLYTQEALDFVQRQ 201
Query: 188 HNHSRPLFL----QITHAAVHT 205
H RP FL THA V+
Sbjct: 202 HAARRPFFLYWAIDATHAPVYA 223
>gi|313212712|emb|CBY36647.1| unnamed protein product [Oikopleura dioica]
Length = 260
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 59/158 (37%), Positives = 91/158 (57%), Gaps = 6/158 (3%)
Query: 59 CTPSRAAFLTGKYPFRYGIDT-PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGC 117
C+PSRA FLTG+Y FRYG+ + P+ + EKLLP+YLKE+GY TH +GKWH+G
Sbjct: 38 CSPSRAQFLTGRYAFRYGLGSDPISFENPIGMSTKEKLLPEYLKEVGYETHAVGKWHLGY 97
Query: 118 NKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVG--LDARRNMERYAPQMSSKYLTD 175
E P NRGFD +G++ G + Y+ H T A+G L+ N E + P+ ++ +
Sbjct: 98 CNESFQPHNRGFDTFLGHYGGGVDYH--THATQGALGSYLNHFLNGEPHIPEDGFEFASY 155
Query: 176 FFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKL 213
+++++ V++ N +P F+ + A H A L
Sbjct: 156 AWSNRTRKVLRE-NTDKPNFVYLAFNAPHEKVAAPQDL 192
>gi|300773187|ref|ZP_07083056.1| N-acetylgalactosamine-6-sulfatase [Sphingobacterium spiritivorum
ATCC 33861]
gi|300759358|gb|EFK56185.1| N-acetylgalactosamine-6-sulfatase [Sphingobacterium spiritivorum
ATCC 33861]
Length = 443
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 96/185 (51%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ DVG +G +I TPN+D +A G+ + +Y+ P CT SR A LTGKYP R G +
Sbjct: 37 GYGDVGINGNPNIETPNLDRMAMEGMRFSNYYSASPACTASRYALLTGKYPSRAGFRWVL 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ E + + LKE GY T + GKWH+G ++E LP GFD +VG L
Sbjct: 97 NPTDQIGIHQQESTIAERLKEKGYRTAIYGKWHLGSTRKEFLPLANGFDEYVG-----LP 151
Query: 142 Y-NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y ND I + L + + P S LT +T++++ I + N +P F+ + +
Sbjct: 152 YSNDMIPPKYPDIALLSGYDTLELNPDQSK--LTRLYTEKAIAFI-TKNAKQPFFIYLPY 208
Query: 201 AAVHT 205
A HT
Sbjct: 209 AMPHT 213
>gi|291227581|ref|XP_002733761.1| PREDICTED: arylsulfatase A-like, partial [Saccoglossus kowalevskii]
Length = 158
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 52/113 (46%), Positives = 70/113 (61%), Gaps = 2/113 (1%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPV 81
GWNDVG+H + I TP++D +A +G+ L +Y CTP+R FLTGK+ + + +
Sbjct: 36 GWNDVGYH-NSSISTPHMDTIANDGVKLESYYVGHVCTPTRGMFLTGKHMINLRLYNGII 94
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
G K +PV E + Q L+E Y+TH IGKWH+G KEE LP NRGFD G
Sbjct: 95 GGHDPKCLPVNEVTVAQKLREYNYATHAIGKWHLGYYKEECLPINRGFDTFFG 147
>gi|196231555|ref|ZP_03130413.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196224408|gb|EDY18920.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 467
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 69/185 (37%), Positives = 96/185 (51%), Gaps = 16/185 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW DVGFH N +PTPN+D LA G+ L +HY P C+P+R AFL+G+Y R+ + TP
Sbjct: 52 GWGDVGFHHGN-VPTPNLDHLAGEGLELMQHYVYPVCSPTRCAFLSGRYASRFSVTTPQN 110
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+ VT L + LK +GY T L GKWH+G +K E P GFD+ G G +
Sbjct: 111 PRAFRWDTVT---LARALKSVGYDTALCGKWHLG-SKPEWGPQKFGFDHSYGSLAGGVGP 166
Query: 143 ND---SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
D I E D + E+ ++TD T ++V ++S +P FL +
Sbjct: 167 WDHHYKIGEFTQTWHRDGKLIEEQ-------GHVTDLITKEAVEWLESRT-DKPFFLYVP 218
Query: 200 HAAVH 204
AVH
Sbjct: 219 FTAVH 223
>gi|340619110|ref|YP_004737563.1| sulfatase [Zobellia galactanivorans]
gi|339733907|emb|CAZ97284.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
Length = 511
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 75/237 (31%), Positives = 110/237 (46%), Gaps = 31/237 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ DVGF+G DI TP +D LA NG + Y P C PSR+A LTG+YP G +
Sbjct: 59 GYADVGFNGSTDILTPELDNLAQNGSIFTSAYVAHPFCGPSRSAILTGRYPHLTGTAYNL 118
Query: 82 GAGVAK------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
++ VPV E + + L+ GY T IGKWH+G + P RGFD+ G+
Sbjct: 119 FHNSSEDDKDNMGVPVEETYMSKVLQNAGYYTSAIGKWHLGA-APKFHPNKRGFDDFYGF 177
Query: 136 WNGYLTYNDSIHETDFAVGLDARR-NMERYA--------PQMSSKYLTDFFTDQSVHVIK 186
G Y S ++ + A N+ Y P ++Y+TD F+ +++ IK
Sbjct: 178 LGGGHDYFPSEYQKTYKAQKKAGNPNIRDYVFPMEHNGKPANETEYITDGFSREAIKNIK 237
Query: 187 -SHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ +P F+ + + A H A E+ FAHI + DRR +A
Sbjct: 238 IAAAKKQPFFIYLAYNAPHVPLQAKA-------------EDVAKFAHIKDKDRRTYA 281
>gi|260824685|ref|XP_002607298.1| hypothetical protein BRAFLDRAFT_88247 [Branchiostoma floridae]
gi|229292644|gb|EEN63308.1| hypothetical protein BRAFLDRAFT_88247 [Branchiostoma floridae]
Length = 178
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 9/124 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTP-------SRAAFLTGKYPFRY 75
GWNDVG+H D+ TP +D LA G++LN+ Y CTP SR AF+TG +P+
Sbjct: 39 GWNDVGWHNP-DVKTPVLDQLANEGVILNQSYVNYVCTPFPVVKSRSRTAFMTGYFPYHV 97
Query: 76 GIDTPVGAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
G V A+ +P LP+ LK+LGY+TH++GKWH+G P RGFD+ G
Sbjct: 98 GTQHQVFFPFQAQGIPSNFSFLPEKLKDLGYATHMVGKWHLGFCNWNYTPTYRGFDSFFG 157
Query: 135 YWNG 138
Y+NG
Sbjct: 158 YYNG 161
>gi|261404208|ref|YP_003240449.1| sulfatase [Paenibacillus sp. Y412MC10]
gi|261280671|gb|ACX62642.1| sulfatase [Paenibacillus sp. Y412MC10]
Length = 452
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 65/187 (34%), Positives = 99/187 (52%), Gaps = 6/187 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G +G + + TP++D LA GI Y+ P C+PSRA+ LTGKYP R G+ +
Sbjct: 28 GYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLLTGKYPARAGVGEIL 87
Query: 82 GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
GA +P E L + LK GY T L GKWH+G + EE P GFD G+ G +
Sbjct: 88 GAKRGSHGLPADEVTLAKALKPAGYRTALYGKWHLGLS-EETSPNAHGFDEFFGFKAGCV 146
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQM--SSKYLTDFFTDQSVHVI-KSHNHSRPLFLQ 197
+ I A G++ ++ ++ + +Y+T+ T++SV I +S P FL
Sbjct: 147 DFYSHIFYWGQAHGVNPLHDLWENETEVWENGRYMTELITERSVDFIQRSREQEAPFFLF 206
Query: 198 ITHAAVH 204
++ A H
Sbjct: 207 ASYNAPH 213
>gi|410097286|ref|ZP_11292268.1| hypothetical protein HMPREF1076_01446 [Parabacteroides goldsteinii
CL02T12C30]
gi|409224604|gb|EKN17536.1| hypothetical protein HMPREF1076_01446 [Parabacteroides goldsteinii
CL02T12C30]
Length = 446
Score = 103 bits (257), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 63/186 (33%), Positives = 93/186 (50%), Gaps = 10/186 (5%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP 80
G+ D+G G DI TPN+D +A +G+ +Y+ P T SR + LTG+YP R G
Sbjct: 39 MGYGDIGVTGHPDIKTPNLDRMALDGMRFTNYYSASPASTASRYSLLTGRYPVRAGFRWV 98
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ + + E + + LKE GY+T + GKWH+G K+E LP GFD +VG L
Sbjct: 99 LSPDAERGIHPRELTIAELLKEQGYATAIYGKWHLGSTKKEYLPLQNGFDEYVG-----L 153
Query: 141 TY-NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
Y ND I + L + R P S LT +T++++ IK H F+ +
Sbjct: 154 PYSNDMIPPKYPDIALMCGNDTLRMNPDQSE--LTALYTEKAISFIKKHKKEN-FFVYVP 210
Query: 200 HAAVHT 205
+A H
Sbjct: 211 YAMPHV 216
>gi|440716553|ref|ZP_20897058.1| arylsulfatase B [Rhodopirellula baltica SWK14]
gi|436438412|gb|ELP31962.1| arylsulfatase B [Rhodopirellula baltica SWK14]
Length = 498
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 70/197 (35%), Positives = 100/197 (50%), Gaps = 24/197 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G G + TPN+D LA +G++ ++ Y C+PSRA LTG+ P R+G + +
Sbjct: 45 GYGDMGCMGSQTLQTPNLDRLAESGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNL 104
Query: 82 GAGVAK--------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
A +P +EK L +L GY+T LIGKWH+G E P RGFD+
Sbjct: 105 NASDENYATRPELLGLPTSEKTLADHLGAAGYATALIGKWHLGMG-EMHHPNRRGFDHFC 163
Query: 134 GYWNGYLTYNDSIHETDFAVGLD--ARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH--- 188
G G Y F ++ RN +R SS+YLTDFFTD+ + I H
Sbjct: 164 GMLTGGHHY--------FPTTMNHVIERNGKR-VENFSSEYLTDFFTDEGLRFIDQHESA 214
Query: 189 NHSRPLFLQITHAAVHT 205
N +P F+ ++ A HT
Sbjct: 215 NPDQPWFVFFSYNAPHT 231
>gi|340620621|ref|YP_004739074.1| sulfatase [Zobellia galactanivorans]
gi|339735418|emb|CAZ98795.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
Length = 462
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 92/198 (46%), Gaps = 23/198 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D GFHG + TP +D A N + ++ Y + C PSRA LTGKY ++G +
Sbjct: 37 GYADFGFHGSKEFKTPELDKFAKNAVRFSQAYVSAAVCGPSRAGLLTGKYQQKFGFEENN 96
Query: 82 GAGVAK---------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
G+ +P+ +K + YLKE GY T L GKWH G N + P RGFD
Sbjct: 97 VPGLMSKNGLTGDDMGLPLDQKTIADYLKEQGYRTALFGKWHQG-NADRFHPTKRGFDEF 155
Query: 133 VGYWNGYLTYNDSIHETDFAVGLDARRNMERYAP-----QMSSKYLTDFFTDQSVHVIKS 187
G+ G +Y F + RN +R Q YLT+ D+++ I+
Sbjct: 156 YGFRGGARSY------MPFGADNELTRNEDRLERGFGGFQEHEGYLTEELADEAIAFIE- 208
Query: 188 HNHSRPLFLQITHAAVHT 205
N P F+ + AVHT
Sbjct: 209 RNQKNPFFVYLAFNAVHT 226
>gi|334139745|ref|YP_004532943.1| sulfatase [Novosphingobium sp. PP1Y]
gi|333937767|emb|CCA91125.1| sulfatase [Novosphingobium sp. PP1Y]
Length = 472
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 74/236 (31%), Positives = 108/236 (45%), Gaps = 32/236 (13%)
Query: 24 WNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPVG 82
W DV +G D+PTPNID +A G+ + Y + C SRA +TG+ P R+G +
Sbjct: 39 WADVSTYGRTDVPTPNIDRIAKTGVAFSSGYVAASVCAVSRAGLMTGRMPQRFGFTYNIN 98
Query: 83 --AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
V +PV +K + L+ LGY T GKWH+G ++ + P NRGFD G+ G
Sbjct: 99 DKGDVGAGLPVGQKTIADRLQPLGYRTAAFGKWHLGADR-QFYPTNRGFDEFFGFLAGET 157
Query: 141 TYND---------SIHETDFAVGLDARRNMERYAPQMS-----SKYLTDFFTDQSVHVI- 185
Y D + +G + P SKYLT+ TD++V I
Sbjct: 158 NYVDPKTPGIVTTPTKVDKYEIGPGEGNHAMVEGPDARPADDFSKYLTNQITDRAVDFIN 217
Query: 186 KSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
+S + +P F + + A H LQVP DR FA++ +P RR +
Sbjct: 218 RSADAKQPFFSYVAYNAPHWP-----------LQVP-QAYYDR-FANVKDPVRRTY 260
>gi|315644664|ref|ZP_07897795.1| sulfatase [Paenibacillus vortex V453]
gi|315279923|gb|EFU43222.1| sulfatase [Paenibacillus vortex V453]
Length = 439
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 64/187 (34%), Positives = 98/187 (52%), Gaps = 6/187 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G +G + + TP++D LA G+ Y+ P C+PSRA+ LTGKYP R G+ +
Sbjct: 15 GYGDLGCYGSDSVRTPHLDGLADEGVRFTNWYSNSPVCSPSRASLLTGKYPVRAGVGEIL 74
Query: 82 GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
GA +P E L + LK GY T L GKWH+G +K E P GFD G+ G +
Sbjct: 75 GAKRGSHGLPAAEVTLAKALKPAGYRTALYGKWHLGLSK-ETSPNAHGFDEFFGFKAGCV 133
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQM--SSKYLTDFFTDQSVHVIK-SHNHSRPLFLQ 197
+ I G++ ++ ++ + +Y+T+ T++SV IK S P FL
Sbjct: 134 DFYSHIFYWGQGHGVNPLHDLWENETEVWENGRYMTELITERSVDFIKRSREQEAPFFLF 193
Query: 198 ITHAAVH 204
++ A H
Sbjct: 194 ASYNAPH 200
>gi|313228605|emb|CBY07397.1| unnamed protein product [Oikopleura dioica]
Length = 492
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 52/117 (44%), Positives = 74/117 (63%), Gaps = 2/117 (1%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
G++D+G+ D+ +PNIDALA + + L +HY P+CTPSRAAFLTG+Y R G+ + V
Sbjct: 35 GFDDLGYVNR-DVISPNIDALAKDALHLKKHYVQPSCTPSRAAFLTGRYNIRMGMQSGVI 93
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+ +P+ E LL + K+ GY T L GKWH+G + P NRGFD G++ G
Sbjct: 94 RPSEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTYKHCPQNRGFDRFYGFYLG 150
>gi|440713713|ref|ZP_20894310.1| arylsulfatase B [Rhodopirellula baltica SWK14]
gi|436441429|gb|ELP34656.1| arylsulfatase B [Rhodopirellula baltica SWK14]
Length = 472
Score = 103 bits (256), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 64/185 (34%), Positives = 93/185 (50%), Gaps = 10/185 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVGFHG ++I TPNID LA + L+R Y P C+P+RA LTG YPFR+G V
Sbjct: 40 GWNDVGFHG-SEIRTPNIDRLANESVTLDRFYVTPICSPTRAGVLTGLYPFRFGFWGGVV 98
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ K +P + P++L +LGY + GKWH+G P G G++NG +
Sbjct: 99 SPTKKHGLPPQLETTPEHLSKLGYDHRAMFGKWHLGLASTLFHPLQHGMTEFYGHYNGAI 158
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y F LD RN + + Y T+ + V I + ++ P++ +
Sbjct: 159 DY---FSRERFGQ-LDWHRNFDSVHEE---GYSTELVGNAVVDFIDRNANAGPVYAYVAF 211
Query: 201 AAVHT 205
A H+
Sbjct: 212 NAPHS 216
>gi|198434445|ref|XP_002131042.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
Length = 512
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 71/225 (31%), Positives = 111/225 (49%), Gaps = 34/225 (15%)
Query: 23 GWNDVGFHGEND---IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
G+NDVG+ G+N TP +D+LA NG+ L +YT C+P+R A +TG R ID
Sbjct: 40 GYNDVGYWGQNHGSAAKTPFLDSLAENGVRLENYYTHSVCSPTRGALMTG----RNRIDI 95
Query: 80 PVGAGVA-----KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
+ G+ + +P+ LLP+ L GY+T +IGKWH+G + + P+NRGF G
Sbjct: 96 GLAHGIIHTTQIEGLPLDNVLLPEQLSNCGYNTQMIGKWHLGFSSSKYAPWNRGFHGFYG 155
Query: 135 -------YWNGYL--TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI 185
YW+ +L + +I DF N + +Y + ++ +VI
Sbjct: 156 FLAGSENYWSKWLPMARHSNIGGVDFTDSTTGPTN------ETWGQYSAHVYASRARYVI 209
Query: 186 KSHNHSRPLFLQITHAAVHT--GTAGNAKLPTGLLQVPDMEENDR 228
+ H+ S+PLFL + HT G + P D+E++DR
Sbjct: 210 QHHDQSKPLFLYLPLQTPHTPLGAPSHYYEP-----FKDIEDDDR 249
>gi|149177349|ref|ZP_01855954.1| N-acetylgalactosamine-4-sulfatase precursor [Planctomyces maris DSM
8797]
gi|148843874|gb|EDL58232.1| N-acetylgalactosamine-4-sulfatase precursor [Planctomyces maris DSM
8797]
Length = 472
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 75/237 (31%), Positives = 109/237 (45%), Gaps = 38/237 (16%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID-TP 80
G+ ++G G IPTP+ID+LA +GI + Y T P C+PSRA LTG+ P R+G + P
Sbjct: 37 GYGELGCQGNPQIPTPHIDSLASHGIRFTQAYVTAPNCSPSRAGLLTGRIPTRFGYEFNP 96
Query: 81 VGA---GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW- 136
+GA +P E+ + + L + GY+T LIGKWH+G + PF GFD G+
Sbjct: 97 IGARNEDSGTGLPPDEQTIAERLHDQGYTTCLIGKWHLG-GTADYHPFRHGFDEFFGFMH 155
Query: 137 --------------------------NGYLTYNDSIHETDFAV---GLDARRNMERYA-P 166
G + I+ T DA + R P
Sbjct: 156 EGHYFVPPPYHGVTTMLRRKTLPGRQKGRWISENLIYSTHMGYDEPDYDANNPIIRGGQP 215
Query: 167 QMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDM 223
++YLTD FT ++V I H +P FL + + AVH+ G K Q+ D+
Sbjct: 216 VNETEYLTDAFTREAVSFINRH-QDKPFFLYLAYNAVHSPLQGKKKDIQHFTQIEDI 271
>gi|313246966|emb|CBY35811.1| unnamed protein product [Oikopleura dioica]
Length = 388
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/158 (37%), Positives = 91/158 (57%), Gaps = 6/158 (3%)
Query: 59 CTPSRAAFLTGKYPFRYGIDT-PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGC 117
C+PSRA FLTG+Y FRYG+ + P+ + EKLLP+YLKE+GY TH +GKWH+G
Sbjct: 21 CSPSRAQFLTGRYAFRYGLGSDPISFENPIGMSTKEKLLPEYLKEVGYETHAVGKWHLGY 80
Query: 118 NKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVG--LDARRNMERYAPQMSSKYLTD 175
E P NRGFD +G++ G + Y+ H T A+G L+ N E + P+ ++ +
Sbjct: 81 CNESFQPHNRGFDTFLGHYGGGVDYH--THATQGALGSYLNHFLNGEPHIPEDGFEFASY 138
Query: 176 FFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKL 213
+++++ V++ N +P F+ + A H A L
Sbjct: 139 AWSNRTRKVLRE-NTDKPNFVYLAFNAPHEKVAAPQDL 175
>gi|423219918|ref|ZP_17206414.1| hypothetical protein HMPREF1061_03187 [Bacteroides caccae
CL03T12C61]
gi|392624181|gb|EIY18274.1| hypothetical protein HMPREF1061_03187 [Bacteroides caccae
CL03T12C61]
Length = 463
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 100/185 (54%), Gaps = 9/185 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ND GF G ++ TPNIDAL G+V + H +PSRA +TG+Y R+G + +
Sbjct: 43 GYNDFGFMGSKEMQTPNIDALTSEGVVFTDAHVAATVSSPSRACLITGRYGHRFGYECNL 102
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ E+ + + K GY T IGKWH+G +++E P NRGFD G G
Sbjct: 103 -SDRTNGLPLEEETIAEVFKTNGYRTAAIGKWHLG-SRDEQHPNNRGFDLFYGMKAGGRD 160
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMS-SKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y + ++D RN+ Q+ KYLTD F++++V I + S+P + + +
Sbjct: 161 YFYNEKKSDRP---GDERNLLLNDRQVKFEKYLTDAFSEKAVEFI--NESSQPFMMYLAY 215
Query: 201 AAVHT 205
AVHT
Sbjct: 216 NAVHT 220
>gi|119504674|ref|ZP_01626753.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
gi|119459696|gb|EAW40792.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
Length = 545
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/184 (34%), Positives = 91/184 (49%), Gaps = 8/184 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW DVG+HG DI TP++D LA G+ LNR YT P C+P+RAA +TG+ P R G+ V
Sbjct: 44 GWADVGYHG-GDIDTPSLDRLAQQGVRLNRFYTTPICSPTRAALMTGRDPIRLGVTYGVI 102
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
V E +P+ + GY T +IGKWH+G + P NRGF++ G+ + +
Sbjct: 103 FPWDNIGVHPDEHFMPETFQAAGYQTAIIGKWHLGHAQMTYHPNNRGFEHFYGHLHTEVG 162
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
+ G D +RN Q YL D+ I+ + RP + +
Sbjct: 163 FYPPFSNQG---GKDFQRNGVSIDDQGYETYL---LADEVSRYIRERDRDRPFLVYMPFI 216
Query: 202 AVHT 205
A HT
Sbjct: 217 APHT 220
>gi|116621986|ref|YP_824142.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
gi|116225148|gb|ABJ83857.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
Length = 461
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 68/192 (35%), Positives = 93/192 (48%), Gaps = 16/192 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G +G + I TPNID LA G Y+ P C+PSRAA +TG+YP R + +
Sbjct: 39 GYGDLGCYG-SPIATPNIDRLAEEGARFTSFYSASPVCSPSRAALMTGRYPTRVEVPVVL 97
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G G A +P +E + Q LK GY T IGKWHIG + LP NRGFD G +
Sbjct: 98 GPGDA-GLPDSEITMAQVLKSAGYRTSCIGKWHIG-STPGYLPTNRGFDEFFG-----VP 150
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y+ I R AP + LT FT +++ ++ P FL + H
Sbjct: 151 YSADITPCPLM------RGSSVVAPAVDCSTLTSSFTQEALDFMR-RAQDNPFFLYLAHT 203
Query: 202 AVHTGTAGNAKL 213
A H A + +
Sbjct: 204 APHLPLAASPRF 215
>gi|410628681|ref|ZP_11339399.1| sulfatase [Glaciecola mesophila KMM 241]
gi|410151685|dbj|GAC26168.1| sulfatase [Glaciecola mesophila KMM 241]
Length = 502
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 76/230 (33%), Positives = 108/230 (46%), Gaps = 19/230 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID--- 78
G+ D GF G + I TPN+D LA V + Y + C PSRA LTGKY R+G +
Sbjct: 71 GYGDFGFQGSSQIRTPNLDNLAVQSTVFTQAYVSAAVCGPSRAGILTGKYQQRFGFEENN 130
Query: 79 -----TPVG-AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
+ G G +P+ ++ + YL GYST LIGKWH G N ++ P RGF++
Sbjct: 131 VPGYMSDSGLTGDDMGLPLNQRTIGDYLTHFGYSTALIGKWHQG-NADKFHPTKRGFEHF 189
Query: 133 VGYWNGYLTYNDSIHETDFAVGLDA-RRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
G+ G +Y + + D R Y + S YLT D+++ IK N
Sbjct: 190 YGFRGGARSYFEFGPNNPVSYPEDRLERGFAHY--KESPHYLTQALADEAIKFIK-QNQR 246
Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDME-ENDRTFAHISNPDRRL 240
P FL ++ AVHT N + L Q P + + R A + DR +
Sbjct: 247 EPFFLFLSFNAVHTPMDANKE---DLAQFPQLSGKRQRVAAMTLSMDREI 293
>gi|296124181|ref|YP_003631959.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296016521|gb|ADG69760.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 470
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 77/221 (34%), Positives = 106/221 (47%), Gaps = 24/221 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
GW + G G IPTP+ID++A NG+ + + T C+PSRA LTG+YP R+G +
Sbjct: 52 GWGETGIQGNPQIPTPHIDSIAKNGVRCTQGFVAATYCSPSRAGLLTGRYPTRFGHEFNR 111
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A V+ + + E L L LGY T +GKWH+G + E P RGFD G L
Sbjct: 112 IANVS-GLDLQETTLADRLHGLGYKTACVGKWHLG-DGPEYRPTKRGFDEFF----GTLA 165
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
H T F +D+R + + + Y TD + +SV I S P FL +
Sbjct: 166 NTPFFHPTKF---VDSRVSNDVAEVSDENFYTTDEYAKRSVEWIGQQQQS-PWFLYLPFN 221
Query: 202 AVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
A H LQ P + DR F I++P R+LFA
Sbjct: 222 AQHAP-----------LQAP-QKYLDR-FESIADPKRKLFA 249
>gi|153807102|ref|ZP_01959770.1| hypothetical protein BACCAC_01379 [Bacteroides caccae ATCC 43185]
gi|149130222|gb|EDM21432.1| arylsulfatase [Bacteroides caccae ATCC 43185]
Length = 463
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 100/185 (54%), Gaps = 9/185 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ND GF G ++ TPNIDAL G+V + H +PSRA +TG+Y R+G + +
Sbjct: 43 GYNDFGFMGSKEMQTPNIDALTSEGVVFTDAHVAATVSSPSRACLITGRYGHRFGYECNL 102
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +P+ E+ + + K GY T IGKWH+G +++E P NRGFD G G
Sbjct: 103 -SDRTNGLPLEEETIAEVFKTNGYRTAAIGKWHLG-SRDEQHPNNRGFDLFYGMKAGGRD 160
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMS-SKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y + ++D RN+ Q+ KYLTD F++++V I + S+P + + +
Sbjct: 161 YFYNEKKSDRP---GDERNLLLNDRQVKFEKYLTDAFSEKAVEFI--NESSQPFMMYLAY 215
Query: 201 AAVHT 205
AVHT
Sbjct: 216 NAVHT 220
>gi|32471439|ref|NP_864432.1| arylsulfatase B [precursor] [Rhodopirellula baltica SH 1]
gi|32443280|emb|CAD72111.1| Arylsulfatase B [Precursor] [Rhodopirellula baltica SH 1]
Length = 579
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/187 (33%), Positives = 97/187 (51%), Gaps = 14/187 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWNDVGFHG ++I TPNID LA + L+R Y P C+P+RA LTG YPFR+GI V
Sbjct: 147 GWNDVGFHG-SEIRTPNIDRLASESVTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVV 205
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTH-LIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ K +P + P++L +LGY + GKWH+G P + G G++NG +
Sbjct: 206 SPSKKHGLPPQLETAPEHLSKLGYDHRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAI 265
Query: 141 TY--NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y + + D+ D+ E Y+ ++ + DF I + ++ P++ +
Sbjct: 266 DYFSRERFGQLDWHRDFDSVHE-EGYSTELVGNAVVDF--------IDRNANAGPVYAYV 316
Query: 199 THAAVHT 205
A H+
Sbjct: 317 AFNAPHS 323
>gi|340367643|ref|XP_003382363.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 493
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 67/184 (36%), Positives = 92/184 (50%), Gaps = 10/184 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ DVGF I +PN D LA G+VLNRHY C+PSRA+FLTG++P P+
Sbjct: 34 GFADVGFRNPA-ISSPNFDQLAKTGLVLNRHYVFKYCSPSRASFLTGRWPHHAHQWNPLM 92
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+ + +LP LK Y+TH++GKWH+G LP NRGFD G+ G
Sbjct: 93 DNMI-GTNLNMTMLPAKLKAANYATHMVGKWHLGFFDPRYLPINRGFDTSTGFLGG---G 148
Query: 143 NDSIHETDFAVGLDARRNMERYAPQ-MSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
D ++E +D +N AP + Y + D V+ +HN PLF +
Sbjct: 149 EDHMNEKS-GCSIDYWKNN---APDPRNGTYDAYNYRDDLTDVMNNHNADNPLFFYLPLH 204
Query: 202 AVHT 205
VHT
Sbjct: 205 NVHT 208
>gi|109897220|ref|YP_660475.1| sulfatase [Pseudoalteromonas atlantica T6c]
gi|109699501|gb|ABG39421.1| sulfatase [Pseudoalteromonas atlantica T6c]
Length = 471
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 70/226 (30%), Positives = 105/226 (46%), Gaps = 19/226 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG---ID 78
G+ D GF G + TPN+D LA G+ + Y + TC PSRA +TG+Y ++G I+
Sbjct: 38 GYADFGFQGSETMKTPNLDQLASEGVRFTQGYVSDSTCGPSRAGIMTGRYQQKFGYEEIN 97
Query: 79 TP-------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN 131
P G +P+ E + Y+K LGY T GKWH+G +EL P +RGFD
Sbjct: 98 VPGYMSEHSAIKGAEMGIPLDEVTMGDYMKSLGYRTAFYGKWHLGGT-DELHPMHRGFDE 156
Query: 132 HVGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
G+ G Y Y + E AV D + Q YLTD +++ I+
Sbjct: 157 FYGFRGGDRSYWAYEVNAPERKSAVFTDKKLEHGIDQFQEHEGYLTDVLAEKANQFIEKA 216
Query: 189 NHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHIS 234
+P F+ ++ AVHT P L + P ++ + A ++
Sbjct: 217 -PDKPFFIFLSFNAVHTPMEAT---PEDLAKFPQLKGKRKEVAAMT 258
>gi|449138311|ref|ZP_21773581.1| arylsulfatase [Rhodopirellula europaea 6C]
gi|448883084|gb|EMB13628.1| arylsulfatase [Rhodopirellula europaea 6C]
Length = 585
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 65/187 (34%), Positives = 90/187 (48%), Gaps = 24/187 (12%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QGW D+ HG + I TP +DALA L+R Y P C P+RAA LTG+YP R G+
Sbjct: 40 QGWGDLASHGNSKISTPTLDALANQSARLDRFYVSPVCAPTRAALLTGRYPERTGV---- 95
Query: 82 GAGVA---KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
AGV + + E L + + GY+T GKWH G + L P +GFD G+ G
Sbjct: 96 -AGVTGRREVMRAEETTLAEMFQAAGYATGCFGKWHNGA-QMPLHPNGQGFDEFFGFCGG 153
Query: 139 YLT-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
+ Y+D++ E RN P + Y+TD TD ++ + H P F
Sbjct: 154 HFNLYDDALLE----------RN---GTPVQTKGYITDVLTDAAIEFVNVH-RDHPFFCY 199
Query: 198 ITHAAVH 204
+ A H
Sbjct: 200 VPLNAPH 206
>gi|87307004|ref|ZP_01089150.1| arylsulfatase [Blastopirellula marina DSM 3645]
gi|87290377|gb|EAQ82265.1| arylsulfatase [Blastopirellula marina DSM 3645]
Length = 542
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/199 (31%), Positives = 102/199 (51%), Gaps = 23/199 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----D 78
G++D+G+HG +I TPNIDALA++G+ ++ Y C P+RA +TG YP + GI +
Sbjct: 40 GFSDLGYHG-GEIATPNIDALAHSGVRFSQFYNNGRCCPTRATLMTGLYPHQTGIGHMTE 98
Query: 79 TPVGAGVAKAVPVTEK--------LLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD 130
+P A P T + + + L++ GY+T + GKWH+G N + P RGF+
Sbjct: 99 SPGEANYGSGKPPTYQGYLNRNCVTIAEALQQQGYATLMSGKWHLGENDKSRWPLQRGFE 158
Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSK---YLTDFFTDQSVHVIKS 187
+ G +G Y + +G N + P+ ++ Y TD FTD ++ +K
Sbjct: 159 KYFGCLSGATLYFFPDGDRKMTLG-----NQQIAEPESTTDQPFYTTDAFTDYAIRFLKE 213
Query: 188 HN--HSRPLFLQITHAAVH 204
RP+FL + + A H
Sbjct: 214 EQAGQQRPMFLYLAYTAPH 232
>gi|323452003|gb|EGB07878.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 1818
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 75/222 (33%), Positives = 107/222 (48%), Gaps = 54/222 (24%)
Query: 23 GWNDVGFHGE----NDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFR---- 74
G+ DVG++G+ N + TP IDALA G+ L+R+YT P CTPSRAA L+GKYP
Sbjct: 55 GFGDVGYNGDPTLTNRVSTPVIDALADAGVKLSRYYTQPDCTPSRAALLSGKYPATTGTY 114
Query: 75 YGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
+G+ P +P+ LLP+ L Y +H +GKW +G + + LP RGFD+ +G
Sbjct: 115 HGVLNPQS---TWGLPLEHALLPEALPG-AYRSHAVGKWDVGHSSAKRLPEARGFDSFLG 170
Query: 135 YWNGYLTY-----NDSIHE---------------------------TDFAVGLDARRNME 162
+ YL + + S HE DF+ GL +R
Sbjct: 171 F---YLCFYGPMIDYSTHEIHDHDLACAGDACAAALAKCQVRGSTVADFS-GLGGQRR-- 224
Query: 163 RYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVH 204
Y TD F D++V +I++ PLFL + AVH
Sbjct: 225 ----DYDGMYTTDVFADRAVDLIEAEAADHPLFLYVAFNAVH 262
Score = 90.1 bits (222), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 67/209 (32%), Positives = 101/209 (48%), Gaps = 29/209 (13%)
Query: 23 GWNDVGFHGE----NDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI- 77
G++DVG++ + N + TP +D+LA G+ L R+YT P CTPSRAA L+G YP G+
Sbjct: 630 GFDDVGYNSDPSKTNQVQTPFLDSLAAGGVKLARYYTQPDCTPSRAALLSGMYPASSGMY 689
Query: 78 DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
+ A + + +L+PQ L Y +H +GKW +G +P RGF + +G+++
Sbjct: 690 HKMITAQSNWGLDLDLELIPQRLPA-AYRSHAVGKWDVGHYTWSHVPQFRGFRSFLGFYS 748
Query: 138 GYLTYNDSIHET-DFAVGLDARRNME---RYAPQMSSK-----------------YLTDF 176
+ Y HET D L+ E R A + SS Y TD
Sbjct: 749 PIIDYY--THETFDTLQCLEEMELTECEARLASECSSSIKDFNFDGDPLPLADGTYSTDV 806
Query: 177 FTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
F ++ +I+ PLFL + AVH
Sbjct: 807 FAARARDLIRKEAPKHPLFLYVAFNAVHA 835
>gi|340377481|ref|XP_003387258.1| PREDICTED: arylsulfatase I-like [Amphimedon queenslandica]
Length = 507
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 70/236 (29%), Positives = 110/236 (46%), Gaps = 32/236 (13%)
Query: 23 GWNDVGFHGE---NDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPF------ 73
GW +VG+H ++ TPNID L G+ LN+HY C+PSR++ ++G+ P
Sbjct: 35 GWANVGYHRNPPTREVVTPNIDDLVKQGLELNQHYAYRCCSPSRSSLISGRLPIHVSDQN 94
Query: 74 ----RYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF 129
Y + P+ A+P + + +KE GY+TH +GKW G + P RGF
Sbjct: 95 IAPTNYNPNDPISG--FSAIPRNMTGIAEKMKEAGYATHQVGKWDAGMATPDHTPKGRGF 152
Query: 130 DNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMS----SKYLTDFFTDQSVHVI 185
D GY++ Y Y + ++ G+ N ++ A ++ KY F ++ + ++
Sbjct: 153 DTSFGYFHHYNDYYTEVVDSCNGTGVVDLWNTDQPAHGINGTGPDKYEEALFRERLLDIV 212
Query: 186 KSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
H+ S PLFL VHT LQVPD N F+ I + DR +
Sbjct: 213 SKHDPSTPLFLYYAPHIVHT-----------PLQVPDEYLN--KFSFIDDKDRMYY 255
>gi|149197416|ref|ZP_01874467.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139434|gb|EDM27836.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 455
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 75/229 (32%), Positives = 117/229 (51%), Gaps = 27/229 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYP--FRYGIDT 79
G+ D+GF G DI TP+IDALA +G+ + Y + C PSRA LTG+Y F G +
Sbjct: 33 GYEDLGFLGAPDIKTPHIDALARSGMNFTQGYQSASVCGPSRAGLLTGRYQQLFGSGENP 92
Query: 80 PVGAGVAK-----AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
P ++K +P+ E+++ LK Y+T +IGKWH+G + E+ P R D + G
Sbjct: 93 PETGELSKRFPDAGIPLDEQMIFDLLKPAAYTTGVIGKWHMGLSHEQ-RPTQRSVDYYYG 151
Query: 135 YWNGYLTYNDSIHETDFA-VGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
+ NG +Y ++ + A + RN E P S Y T+ F D+ V+ IK N +P
Sbjct: 152 FLNGAHSYREAKMDMKGAPMTWPIFRNNE---PVPFSGYTTEVFNDEGVNFIK-RNKDKP 207
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
FL +++ +VH K D++ +D HI RR+++
Sbjct: 208 FFLYMSYNSVHGPWEAQPK---------DLQRSD----HIKKKWRRIYS 243
>gi|114326210|ref|NP_001041587.1| arylsulfatase E precursor [Canis lupus familiaris]
gi|81158056|tpe|CAI85002.1| TPA: arylsulfatase E [Canis lupus familiaris]
Length = 585
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/137 (43%), Positives = 77/137 (56%), Gaps = 14/137 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N I TPNID LA +G++L +H + CTPSRAAFLTG+YP R G+ +
Sbjct: 45 GIGDIGCYGNNSIRTPNIDRLAEDGVMLTQHIAAASVCTPSRAAFLTGRYPLRSGMVSSN 104
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNRGFD 130
G GV+ +P E + LK+ GY+T LIGKWH+G N E P N GFD
Sbjct: 105 GYRVLQWTGVSGGLPTNETTFAKILKDRGYATGLIGKWHLGLNCESSNDHCHHPLNHGFD 164
Query: 131 NHVGYWNGYLTYNDSIH 147
+ G + D IH
Sbjct: 165 HFYGM--PFSMMGDCIH 179
>gi|390361328|ref|XP_780209.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 469
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 50/114 (43%), Positives = 70/114 (61%), Gaps = 2/114 (1%)
Query: 23 GWNDVGFHGEND-IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TP 80
G+NDVG+H + I T NIDALA G+ L +Y P CTPSR+ FL+GKY G+
Sbjct: 40 GYNDVGYHSDGSAIETDNIDALAAGGLKLESYYVAPLCTPSRSQFLSGKYLIHNGMQHLV 99
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
+ V + +P+ + + L + GY+THL+GKWH+G K+E P NRGF + G
Sbjct: 100 IDPRVPRCLPLGDDTMANKLTDAGYATHLVGKWHLGFYKQECWPLNRGFQSFFG 153
>gi|372210598|ref|ZP_09498400.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 468
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 93/194 (47%), Gaps = 14/194 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGID--- 78
G+ D GF G PTPN+D LA G+V + YT C PSRA LTG+Y R+G +
Sbjct: 37 GYFDFGFQGSKTFPTPNLDQLAKEGMVFKQAYTTAAVCGPSRAGLLTGRYQQRFGFEENN 96
Query: 79 -------TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN 131
+ G +P+ EK + YL +LGY + ++GKWH+G N + P RGF
Sbjct: 97 VPGYMSKSSKLLGDDMGLPLDEKTMADYLGKLGYQSIVLGKWHMG-NADRYHPLKRGFTE 155
Query: 132 HVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
G+ G ++ + + A + R + Q KYLT D + I N
Sbjct: 156 FYGFRGGARSFY-PLTQKQAADKPEDRLEIGYKKYQEPKKYLTYDLADAACDFI-DRNKK 213
Query: 192 RPLFLQITHAAVHT 205
+P F+ ++ AVH+
Sbjct: 214 KPFFMYVSFNAVHS 227
>gi|149177301|ref|ZP_01855906.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
gi|148843826|gb|EDL58184.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
Length = 501
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/224 (33%), Positives = 105/224 (46%), Gaps = 27/224 (12%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGI--- 77
QG+ D+G G +I TP++D LA G L Y T P CTPSR + LTG+YP R GI
Sbjct: 48 QGYRDLGSFGSEEIMTPHLDRLAKEGAKLTSFYVTWPACTPSRGSLLTGRYPQRNGIYDM 107
Query: 78 ---------------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL 122
+ V + V EKLLP LK GY + + GKW +G +K
Sbjct: 108 IRNEAPDFGHKYKPAEYEVTFERIGGMDVREKLLPALLKPAGYVSAIYGKWDLGIHK-RF 166
Query: 123 LPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSV 182
LP RGFD+ G+ N + Y HE G+ + + + Y T F ++V
Sbjct: 167 LPLARGFDDFYGFTNTGIDY--FTHER---YGVPSMYRNNQPTEEDKGTYCTYLFQREAV 221
Query: 183 HVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEEN 226
IK NH +P FL + A H ++ + ++ G Q P+ +N
Sbjct: 222 RFIK-ENHQKPFFLYLPFNAPHGASSLDPRIRGG-AQAPEKYKN 263
>gi|291231643|ref|XP_002735773.1| PREDICTED: steroid sulfatase-like [Saccoglossus kowalevskii]
Length = 572
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 53/120 (44%), Positives = 73/120 (60%), Gaps = 8/120 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTP- 80
G D+G +G + I TPNID LA G+ L + P CTPSRAAFLTG+YP R G+ T
Sbjct: 40 GIGDLGCYGNDTIRTPNIDLLASEGVKLTHNIVPTPICTPSRAAFLTGRYPIRSGLGTSS 99
Query: 81 --VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKE----ELLPFNRGFDNHVG 134
+ AG + +P E + + LK++GY+T ++GKWH+G + E E P N+GFD G
Sbjct: 100 AFICAGCSAGMPTQEVTIAEMLKDVGYATAILGKWHLGIHSEEQNNEFHPLNQGFDYFYG 159
>gi|149196558|ref|ZP_01873612.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149140238|gb|EDM28637.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 443
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 74/227 (32%), Positives = 101/227 (44%), Gaps = 26/227 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDT-- 79
G+ DVGF G + I TP+ID LA +G++ ++ Y + C PSRA +TGK R+G D
Sbjct: 20 GYGDVGFTGSSQIKTPHIDRLAKDGVIFSQGYVSSSVCGPSRAGLMTGKNQVRFGFDNNL 79
Query: 80 ----PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P +P++EK L L E GY L+GKWH+G +KE+ P RGF GY
Sbjct: 80 TNYLPQFKDEFHGLPISEKTLATRLAEKGYVNGLVGKWHLG-DKEQYHPLKRGFHEFWGY 138
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
G Y G D PQ S Y+TD D+ V I+ H P F
Sbjct: 139 LGGGHHY---FRSKPNGKGYDCPIECNYKTPQPIS-YITDDKGDECVDFIRRHK-DEPFF 193
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L + A H EE+ + ++HI RR +
Sbjct: 194 LFASFNAPHAPMHAK-------------EEDLKLYSHIEGEKRRAYC 227
>gi|313247306|emb|CBY15582.1| unnamed protein product [Oikopleura dioica]
Length = 486
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 89/177 (50%), Gaps = 6/177 (3%)
Query: 35 IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPVGAGVAKAVPVTE 93
I TPNIDA++ G+ L +Y P CTPSR+ L+G+Y G+ + G+ A+P+
Sbjct: 11 IKTPNIDAISAAGVRLENYYVQPICTPSRSQLLSGRYQIHTGLQHQLIWMGMPSALPLDT 70
Query: 94 KLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETD-FA 152
+LLP+ ++ GY T GKWH+G K P+ RGF N GY G Y D
Sbjct: 71 ELLPETMRNCGYHTMAAGKWHLGYAKTANTPWGRGFHNFTGYLGGSEDYYKKTRCIDHHK 130
Query: 153 VGLDARRNMERYAPQM----SSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
G+D + E + ++ +S+Y + Q+ + I + +P FL + +VH
Sbjct: 131 CGIDQNTDGEIFGERVYNADASEYSAFKYIRQAKNYIDGRDKDKPFFLYLPMQSVHA 187
>gi|372210171|ref|ZP_09497973.1| sulfatase [Flavobacteriaceae bacterium S85]
Length = 651
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 101/208 (48%), Gaps = 25/208 (12%)
Query: 23 GWNDVGFHGEND-------IPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFR 74
G+ DVGF+ + + IPTP +D LA NGI+ N H P C PSRAA +TG P R
Sbjct: 39 GYADVGFNRDANFPAEKGVIPTPELDQLANNGIICTNGHVAHPFCGPSRAALMTGVQPSR 98
Query: 75 YGI--DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
G+ + P + +P+ E P+ L++ Y T GKWH+G + + P +RGFD
Sbjct: 99 IGVQYNLPNDINTSLGIPLEETYFPKILQQNNYHTAAFGKWHLGFTQGKYQPLDRGFDYF 158
Query: 133 VGYWNGYLTYNDSIHE--------------TDFAVGLDARRNMERYAPQMSSKYLTDFFT 178
G+ G Y + +E ++ L +R+ +YLTD T
Sbjct: 159 FGFLGGGKAYFEREYEDLYYRRLGGSNPVTNEYQDPLQRQRDYVAKDEFNQDEYLTDILT 218
Query: 179 DQSVHVI-KSHNHSRPLFLQITHAAVHT 205
D++++ I ++ S P F+ + + A HT
Sbjct: 219 DEAINYIAENKTKSDPFFMYVAYNAPHT 246
>gi|403260898|ref|XP_003922887.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Saimiri boliviensis
boliviensis]
Length = 482
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 96/200 (48%), Gaps = 19/200 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G + Y+ P C+PSRAA LTG+ P R G T
Sbjct: 2 GWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 61
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E+LLP+ LKE GY T ++GKWH+G ++ + P GFD
Sbjct: 62 AHARNAYTPQEIVGGIPDSEQLLPELLKEAGYVTKIVGKWHLG-HRPQFHPLKHGFDEWF 120
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 121 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 180
Query: 189 NHSRPLFL----QITHAAVH 204
RP FL THA V+
Sbjct: 181 ARRRPFFLYWAVDATHAPVY 200
>gi|414070344|ref|ZP_11406330.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410807261|gb|EKS13241.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 470
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 103/206 (50%), Gaps = 20/206 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID--- 78
G++D GF G + TP ID LA +V + Y T C PSRA TGKY R+G +
Sbjct: 38 GYHDFGFQGSEVMQTPTIDKLASQSVVFEQAYVTAAVCGPSRAGLYTGKYQQRFGFEENN 97
Query: 79 -----TPVG-AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
+ G G +P T+ + ++LKELGY T L GKWH G N ++ P RGFDN
Sbjct: 98 VPGYMSKSGFTGDKMGLPFTQVTMAEHLKELGYHTGLFGKWHQG-NHDDYHPTKRGFDNF 156
Query: 133 VGYWN---GYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI-KSH 188
G+ GY Y++ + + ++ R+ + Y YLTD Q+ H I +S
Sbjct: 157 YGFREGARGYFAYSNEEQQAYPSQKME--RDFKHYIEH--EGYLTDALATQTSHFIGQSV 212
Query: 189 NHSRPLFLQITHAAVHT-GTAGNAKL 213
+ +P F ++ +AVH A NA L
Sbjct: 213 VNKQPFFAVLSFSAVHAPMQATNADL 238
>gi|254444367|ref|ZP_05057843.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
gi|198258675|gb|EDY82983.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
Length = 462
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 96/183 (52%), Gaps = 14/183 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ND+ +G DI TP ID+L GI Y+ P C+PSRAA LTG+YP R GI
Sbjct: 46 GYNDLSSYGATDIATPAIDSLGEQGIRFTDFYSASPVCSPSRAALLTGRYPIRQGITGVF 105
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ E + + L+E GY T L+GKWH+G +++ LP GF ++ G +
Sbjct: 106 WPQSFDGIDPAETTIAELLQENGYRTGLVGKWHLGHHQKH-LPLQNGFHSYFG-----IP 159
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y++ + + G D +E Y ++ Y T +T+++V I+ N +P FL + H+
Sbjct: 160 YSNDMDMVVYMRGND----VESY--EVDQHYTTRRYTEEAVQFIE-QNKDQPFFLYLAHS 212
Query: 202 AVH 204
H
Sbjct: 213 MPH 215
>gi|323452509|gb|EGB08383.1| hypothetical protein AURANDRAFT_37517, partial [Aureococcus
anophagefferens]
Length = 235
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 56/148 (37%), Positives = 81/148 (54%), Gaps = 10/148 (6%)
Query: 23 GWNDVGFHG-----ENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI 77
GWND G+H E TP +D LA +G+ L +YT P C+PSRA +TG+Y R GI
Sbjct: 38 GWNDAGYHNGGRPNEGWTSTPTLDRLAASGVKLESYYTAPICSPSRAQIMTGRYQIRVGI 97
Query: 78 DTPV-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
GA +P+ E + L L Y T + GKWH+G ++ LP +RGFD H G++
Sbjct: 98 QHGCYGASQGTGLPLGEVTIADALSRLDYETWMFGKWHLGFDEAAFLPTSRGFDYHYGHY 157
Query: 137 NGYL-TYNDSIHETDFA---VGLDARRN 160
+ + +N ++ +T VGLD R+
Sbjct: 158 DACVNAWNHTVGKTGTEKPRVGLDWHRD 185
>gi|388257121|ref|ZP_10134301.1| sulfatase [Cellvibrio sp. BR]
gi|387939325|gb|EIK45876.1| sulfatase [Cellvibrio sp. BR]
Length = 484
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 73/226 (32%), Positives = 108/226 (47%), Gaps = 22/226 (9%)
Query: 23 GWNDVGF-HGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
G+NDVGF +G+ +I TP +DALA G+V Y T P C PSRA +TG+Y R+G++
Sbjct: 39 GYNDVGFTNGQTEIKTPRLDALANEGVVFENGYVTHPYCGPSRAGLITGRYQARFGMENN 98
Query: 81 VGAG---VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
V +P+TEK P L+E+GY T + GKWH+G P RGFD G+ +
Sbjct: 99 VTYSPDDKYMGLPLTEKTFPARLQEVGYKTAIFGKWHLG-GAPHFQPNERGFDYFYGFLD 157
Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFT-DQSVHVIKSHNHSRPLFL 196
G +N E G M +YLT + D + ++ ++ P F+
Sbjct: 158 G--GHNYMPGEVHLGAGGYLLPIMRNKGVAEFDEYLTTALSRDAARYIERTSKEQAPFFI 215
Query: 197 QITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+++ A H LQ P + +AHI + RR +A
Sbjct: 216 YMSYNAPHAP-----------LQAP--QNYLEKYAHIKDEKRRTYA 248
>gi|323454261|gb|EGB10131.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 635
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 67/210 (31%), Positives = 103/210 (49%), Gaps = 36/210 (17%)
Query: 22 QGWNDVGF----HGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI 77
G+ND+G+ H N + TP +D LA G+ L R+YT C+PSR A LTG YP G+
Sbjct: 128 MGYNDIGYNRAPHQTNQVSTPFLDELASEGVTLTRYYTQCDCSPSRGALLTGLYPASTGL 187
Query: 78 DTPVGAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
V + +P+ L+PQ+L Y +H IGKW +G +P RGF ++VG++
Sbjct: 188 YHGVIVTQSHWGLPLEYHLIPQFLPSR-YRSHAIGKWDVGHYTWNHVPTGRGFHSYVGFY 246
Query: 137 NGYLTY----------------------NDSIHETDFAVGLDARRNMERYAPQMSSKYLT 174
+ Y NDSI + ++ D + Y ++Y T
Sbjct: 247 GTDIDYYTHEIGAGCNSYNCSSAIKRCMNDSITDLNY----DGAATGDEY----YNRYST 298
Query: 175 DFFTDQSVHVIKSHNHSRPLFLQITHAAVH 204
D FTD++V ++++ + PLFL + AVH
Sbjct: 299 DIFTDRAVELLRTESARNPLFLYVAFNAVH 328
>gi|149701806|ref|XP_001488119.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Equus caballus]
Length = 491
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 67/207 (32%), Positives = 101/207 (48%), Gaps = 20/207 (9%)
Query: 18 KLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYG 76
K + GW D+G +GE TPN+D +A G++ YT P C+PSRAA LTG+ P R G
Sbjct: 5 KSVNMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYTANPLCSPSRAALLTGRLPIRNG 64
Query: 77 IDTPVGAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRG 128
T G + +P +E+LLP+ LKE GY + ++GKWH+G ++ + P G
Sbjct: 65 FYTTSGHARNAYTPQEIVGGIPDSERLLPELLKEAGYVSKIVGKWHLG-HRPQFHPLKHG 123
Query: 129 FDNHVGYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVH 183
FD G N + D+ + V D R E + + + LT + +++
Sbjct: 124 FDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALD 183
Query: 184 VIKSHNHS-RPLFL----QITHAAVHT 205
I+ + RP FL THA V+
Sbjct: 184 FIRRQQAARRPFFLYWAVDATHAPVYA 210
>gi|323454643|gb|EGB10513.1| hypothetical protein AURANDRAFT_62515 [Aureococcus anophagefferens]
Length = 981
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 66/204 (32%), Positives = 97/204 (47%), Gaps = 27/204 (13%)
Query: 25 NDVGFHG---ENDIPTP-NIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP 80
+DVG + +D+P P NI L G+ L +Y C+P+RAA L+GK+ + G
Sbjct: 83 DDVGMNDLWQSSDLPVPENIATLVAEGVELTAYYGQSMCSPARAALLSGKFVHKIGFSDK 142
Query: 81 VG------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
G A +VP+ L+P+ LK GY TH IGKW+IG E LP+ RGFD VG
Sbjct: 143 WGPKREVTAFSNYSVPLGHVLMPEALKRNGYGTHGIGKWNIGHCNEAYLPWMRGFDTFVG 202
Query: 135 YW--------------NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQ 180
Y + YL ND++ DF + R + + + Y T+ F +
Sbjct: 203 YLTDGIGYTDHVADGPSSYLYDNDALDLYDF---VSHERGVTKNGSAYAGAYTTEIFNAR 259
Query: 181 SVHVIKSHNHSRPLFLQITHAAVH 204
+ +++ PLFL + H VH
Sbjct: 260 AETILREEPSDAPLFLWLAHHGVH 283
>gi|255530697|ref|YP_003091069.1| sulfatase [Pedobacter heparinus DSM 2366]
gi|255343681|gb|ACU03007.1| sulfatase [Pedobacter heparinus DSM 2366]
Length = 472
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 74/228 (32%), Positives = 108/228 (47%), Gaps = 26/228 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D G +G IPTPNIDA+A G Y + C PSRA LTG+Y R+G +
Sbjct: 40 GYVDFGCYGGKQIPTPNIDAIAKQGTRFTDAYVSASVCAPSRAGILTGRYQQRFGFEHNT 99
Query: 82 GAGVAKAVPVT-------EKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
+A +T E+ + ++ GY T IGKWH G ++ + P NRGF+ G
Sbjct: 100 SNVLAPGYKITDVGMDPSEQTIGNEMQANGYKTIAIGKWHQG-DEPKHFPLNRGFNEFYG 158
Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
+ G + D A N + P+ YLTD FTD++ I + N +P
Sbjct: 159 FTGG---HRDFFAYKGKRTNEHALYNNKEIVPENEITYLTDMFTDKATSFITA-NKDKPF 214
Query: 195 FLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
F+ +++ AVHT NAK D+ E +A I++ RR +A
Sbjct: 215 FMYLSYNAVHTPM--NAK--------KDLMER---YASIADTGRRAYA 249
>gi|149199924|ref|ZP_01876952.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149136993|gb|EDM25418.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 455
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 67/188 (35%), Positives = 100/188 (53%), Gaps = 11/188 (5%)
Query: 22 QGWNDVGFHGEND--IPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGID 78
QG+ DV ++ E+D I TP+ DALA +G++ +R YT C+ +R+ +TG+Y RYGI
Sbjct: 35 QGYADVSYNPEHDDYISTPHTDALAKSGVIFHRGYTSGSVCSTTRSGLMTGRYQQRYGIY 94
Query: 79 TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW-N 137
T G + K +P YLKE GY + GKWH+G ++ + P +RGFD+ G+
Sbjct: 95 TAGEGGTG--TDLNAKFIPNYLKEAGYKSMAFGKWHLG-HEMKYHPLHRGFDDFYGFMGR 151
Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
G + E D G R +E P YLT T+++V I+ N +P F
Sbjct: 152 GAHDFFRLEKEYDGKFGGPIYRGLE---PIDDKGYLTTRITEETVKFIE-ENKDKPFFAY 207
Query: 198 ITHAAVHT 205
+ + AVHT
Sbjct: 208 VAYNAVHT 215
>gi|323144144|ref|ZP_08078781.1| arylsulfatase [Succinatimonas hippei YIT 12066]
gi|322416091|gb|EFY06788.1| arylsulfatase [Succinatimonas hippei YIT 12066]
Length = 472
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 72/211 (34%), Positives = 105/211 (49%), Gaps = 36/211 (17%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D G +G + TP IDALA G Y + P C+P+RA+ LTGKYP R G+ +
Sbjct: 16 GWTDTGCYGSSFYETPRIDALALEGARFTDAYASCPVCSPTRASILTGKYPARLGLTQWI 75
Query: 82 GA---GVAKAVPVTEKL------LPQYLKELGYSTHLIGKWHIGCNKEELL---PFNRGF 129
G G VP + L L + LK+ GY T +GKWH+ + EE P GF
Sbjct: 76 GGHSEGKLADVPYIDHLSTDEISLAKALKQGGYKTWHVGKWHLSKHNEERFDTYPDKHGF 135
Query: 130 DNHVGY------WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVH 183
D ++G +NGY + G++ + P+ +YLTD TD+++
Sbjct: 136 DVNIGGCHFGHPFNGYFS----------PYGIETLED----GPE--GEYLTDRLTDEAIK 179
Query: 184 VIK-SHNHSRPLFLQITHAAVHTGTAGNAKL 213
+IK S N +P F+ ++H AVHT + +L
Sbjct: 180 LIKGSKNDDKPWFMYLSHYAVHTPIECHEEL 210
>gi|313239626|emb|CBY14523.1| unnamed protein product [Oikopleura dioica]
gi|313245438|emb|CBY40171.1| unnamed protein product [Oikopleura dioica]
Length = 309
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 60/189 (31%), Positives = 97/189 (51%), Gaps = 10/189 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW DV ++ E + TPN++ + G Y+ TC+PSRAA LTG + +R G+D P
Sbjct: 89 GWADVSWNNEF-VKTPNLERIRKQGRTFTNLYSHSTCSPSRAALLTGIFAWRLGLDGAPF 147
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ +L+P K+L Y H IGKWH G + L P RGFD+ G+++G +
Sbjct: 148 NPTKVNGIPLGVELIPAKFKKLNYENHFIGKWHGGFCHQNLTPTERGFDSFYGFYSGAVN 207
Query: 142 YNDSIHETDF---AVGLDARR---NMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
Y HE+ + LD R E+ + + Y T FT++++ I + + +
Sbjct: 208 Y--LTHESKYDAKGAALDYREVKDGKEKILKEKNGVYTTADFTERALEKIDNFDENGGNL 265
Query: 196 LQITHAAVH 204
L +++ A H
Sbjct: 266 LFVSYNAPH 274
>gi|443734044|gb|ELU18180.1| hypothetical protein CAPTEDRAFT_89708, partial [Capitella teleta]
Length = 113
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 48/108 (44%), Positives = 70/108 (64%), Gaps = 4/108 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
GWNDVGFHG + TPN+DALAY+G++L +Y P CTPSRAA +TG++P G+ V
Sbjct: 1 GWNDVGFHGSEQVLTPNLDALAYDGVILENYYVQPICTPSRAALMTGRHPIHTGMQHGVI 60
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF 129
+ + + E++LP+YL+++GY TH +GK C + F+ GF
Sbjct: 61 ISSQPYGLDLKERILPEYLRDIGYKTHAVGKVCFICIADC---FDWGF 105
>gi|260060774|ref|YP_003193854.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
gi|88784904|gb|EAR16073.1| arylsulfatase A (precursor) [Robiginitalea biformata HTCC2501]
Length = 526
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 66/188 (35%), Positives = 92/188 (48%), Gaps = 7/188 (3%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP 80
QG++DVG +G DIPTPN+DA+A +G++L Y P C+ SRA LTG YP R GI
Sbjct: 84 QGYSDVGVYGARDIPTPNLDAMAADGLLLTNFYAAQPVCSASRAGLLTGCYPNRVGIHNA 143
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWNG 138
+ + E+ L + L++ GY T + GKWH+G + + LP GFD G Y N
Sbjct: 144 LMPNSPVGLNPAEETLAELLRQQGYRTGIFGKWHLG-DHPDFLPTRHGFDEFFGIPYSND 202
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMS-SKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
+ + F G ER + + LT T++SV I H P FL
Sbjct: 203 MWPLH-PLQGPVFDFGPLPLYEQERVVDTLEDQRLLTRQITERSVDFINRHKEE-PFFLY 260
Query: 198 ITHAAVHT 205
+ H H
Sbjct: 261 VPHPQPHV 268
>gi|340367651|ref|XP_003382367.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 494
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 69/189 (36%), Positives = 94/189 (49%), Gaps = 22/189 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYP-----FRYGI 77
G+ DVGF I +PN D LA G+VLNRHY C+PSRA+ LTG++P + G
Sbjct: 34 GFADVGFKNPA-ISSPNFDHLAKTGLVLNRHYVYMYCSPSRASLLTGRWPHHTHQWNLGN 92
Query: 78 DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
++ G +A ++P LK Y+TH++GKWH G LP NRGFD G+
Sbjct: 93 NSTAGTNLAMT------MIPAKLKAANYATHMVGKWHQGFFDPRYLPINRGFDTSSGFLC 146
Query: 138 GYLTYNDSIHETDFAV-GLDARRNMERYAPQ-MSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
G H T A+ +D +N AP + Y + D +I SHN + PLF
Sbjct: 147 G-----SEDHMTQNAICAIDYWKNN---APDPRNGTYDAYIYRDDLTDIINSHNTNEPLF 198
Query: 196 LQITHAAVH 204
L + VH
Sbjct: 199 LYLPLHNVH 207
>gi|380791197|gb|AFE67474.1| arylsulfatase E precursor, partial [Macaca mulatta]
Length = 232
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168
Query: 131 NHVG 134
+ G
Sbjct: 169 HFYG 172
>gi|326433715|gb|EGD79285.1| hypothetical protein PTSG_12912 [Salpingoeca sp. ATCC 50818]
Length = 562
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 71/237 (29%), Positives = 111/237 (46%), Gaps = 32/237 (13%)
Query: 23 GWNDVGFHG----ENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID 78
GW DVG+H ++DI TP ID L GI L RHY C+P+RA+F +G+ P +GID
Sbjct: 64 GWADVGYHRSGPHKSDIQTPTIDKLVSQGIALERHYVHKVCSPTRASFQSGRLPV-HGID 122
Query: 79 TPVGAGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY-- 135
V +A +P + Q+L + GY++H +GKW +G P RG++ + Y
Sbjct: 123 GQVVLCAPRAGIPENMTTVAQHLNKAGYASHFVGKWDVGMATPSHTPHGRGYNTSLNYFG 182
Query: 136 -----WNG--YLTYNDSIHETDFAVGLDARRNM---ERYAPQMS-SKYLTDFFTDQSVHV 184
WN + +++ D ++ +R A +S + Y F + +
Sbjct: 183 HANWMWNQDEWQGSQNNVSHRPPCKAPDCFKDFWDTDRPAHNLSGTLYEEQLFVQRITDI 242
Query: 185 IKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
I++H+ S+PLFL H LQ P E + FA+I P RR++
Sbjct: 243 IEAHDPSQPLFLTYASKVAHYP-----------LQAP--IEYQQQFANIEPPSRRVY 286
>gi|340367689|ref|XP_003382386.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 493
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 70/188 (37%), Positives = 96/188 (51%), Gaps = 18/188 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFR-YGIDTPV 81
G+ DV F I +PN + LA G++L+RHY C+PSRA+FLTG++P + + P
Sbjct: 33 GYADVSFRNPA-IHSPNFEKLAKEGLILDRHYVFKYCSPSRASFLTGRWPHHAHQWNPPE 91
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
A V + +T ++P LK Y TH+IGKWH G KE LP NRGFD G+ G
Sbjct: 92 DALVGANLKMT--MIPAKLKLARYKTHMIGKWHEGLYKEAYLPINRGFDTMSGFLGGGEN 149
Query: 142 Y-NDSI-HETDFAV--GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
+ N + TDF G D+R + Y + D +I +HN S P FL
Sbjct: 150 HMNQQVGCATDFWKNDGPDSR----------NGSYDAYTYRDDLTDIITNHNPSDPFFLY 199
Query: 198 ITHAAVHT 205
+ VHT
Sbjct: 200 LPLHNVHT 207
>gi|340367645|ref|XP_003382364.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 493
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 66/188 (35%), Positives = 94/188 (50%), Gaps = 20/188 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYP-----FRYGI 77
G+ DVGF I +PN D LA G+VLNRHY C+PSRA+ LTG++P + +
Sbjct: 34 GFADVGFRNPA-ISSPNFDQLAKTGLVLNRHYVFKYCSPSRASLLTGRWPHHAHQWNPLM 92
Query: 78 DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
D+ +G + +LP LK YSTH++GKWH+G LP NRGFD G+
Sbjct: 93 DSTIGTNI------NMTMLPAKLKAANYSTHMVGKWHLGFFDPRYLPINRGFDTSTGF-- 144
Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYAPQ-MSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
+ D ++E +D +N AP + Y + D V+ ++N PLFL
Sbjct: 145 -FGCCEDHMNEKS-GCSIDYWKNN---APDPRNGTYDAYNYRDDLTDVMSNYNTENPLFL 199
Query: 197 QITHAAVH 204
+ VH
Sbjct: 200 YLPLHNVH 207
>gi|431796835|ref|YP_007223739.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
gi|430787600|gb|AGA77729.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
Length = 470
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 70/227 (30%), Positives = 105/227 (46%), Gaps = 26/227 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+ F G + TP+ID LA +G+ Y + C+PSRA LTG+ +G D +
Sbjct: 46 GYGDLSFTGSTQVKTPHIDELAASGVFFPEGYVSSAVCSPSRAGLLTGRNQVSFGYDNNL 105
Query: 82 GAG------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
+PV K + +LK+LGY T L+GKWH+G +++ P NRGFD GY
Sbjct: 106 ANSQPGFDPAFLGLPVNVKTVGDHLKKLGYVTGLVGKWHLGY-EDQFSPLNRGFDEFWGY 164
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
G +D ++ G A+ PQ + Y+TD D+ ++ I+ H P F
Sbjct: 165 LGG---GHDYFEASEAKRGYKAKIKCNYKTPQEIT-YITDDKGDECINFIQRHK-DEPFF 219
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
L + A HT A E+ + HI + RR +A
Sbjct: 220 LYASFNAPHTPMQATA-------------EDLAIYQHIEDRKRRTYA 253
>gi|372209242|ref|ZP_09497044.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 479
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 67/234 (28%), Positives = 108/234 (46%), Gaps = 28/234 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G+NDVGF+G DI TPN+D LA NG+++ Y P C PSR + +TGKY G +
Sbjct: 39 GYNDVGFNGSKDIKTPNLDKLADNGMIMTAGYVAHPFCGPSRTSIMTGKYAHTMGAQFNI 98
Query: 82 ---GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
G +P+ K + + L+E GY T GKWH+G + P RGFD G+ G
Sbjct: 99 PSESEGTGYGIPLNNKFISKELQEAGYYTGAFGKWHLGAD-TPFHPNKRGFDEFYGFLGG 157
Query: 139 YLTYNDSIHETDFA-VGLDARRNMERYAPQM--------SSKYLTDFFTDQSVH-VIKSH 188
Y ++ + + +N+ Y + +Y+TD + ++V+ V K+
Sbjct: 158 GHDYIPEQYKPKYEFLKQRGSKNIRDYIKPLEHNGTEVDEKEYITDGLSREAVNFVYKAS 217
Query: 189 NHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+P F+ + + A H + + +E+ F I + RR +A
Sbjct: 218 EKKQPFFMYLAYNAPH-------------VPLQAKKEDMAVFKSIKDEKRRTYA 258
>gi|406833280|ref|ZP_11092874.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 1053
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 71/200 (35%), Positives = 96/200 (48%), Gaps = 23/200 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +G TPNID LA GI + Y P P+RAA LTG+YP R + V
Sbjct: 118 GWADLGCYGSKFHKTPNIDRLAQRGIRFTQAYAAAPIGQPTRAAILTGRYPQRMNLTASV 177
Query: 82 GAG------------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF 129
A VAKA+P+ E + + LK GY+T IGKWH+G E P +GF
Sbjct: 178 AADPHDSKRRLTPPDVAKALPLEEVTIAEALKAAGYATGCIGKWHLG--GEGFGPKEQGF 235
Query: 130 DNHVGYWNGYLTYNDSIHETDFAVGLDAR-RNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
D V Y +DFA +D + + + +YLTD ++ +KSH
Sbjct: 236 DVSVAAAATGAIY------SDFAPYVDVDGKPIPGLEQAPAGEYLTDRLALEAAKFVKSH 289
Query: 189 NHSRPLFLQITHAAVHTGTA 208
++P FL + H AVH A
Sbjct: 290 -QAKPFFLYLPHFAVHLPAA 308
>gi|71280931|ref|YP_269082.1| sulfatase [Colwellia psychrerythraea 34H]
gi|71146671|gb|AAZ27144.1| sulfatase family protein [Colwellia psychrerythraea 34H]
Length = 492
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 70/199 (35%), Positives = 97/199 (48%), Gaps = 21/199 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G D+ +G N TPNID LA +G+ + Y P C PSR A +G YP RYG+ P
Sbjct: 42 GRQDLSTYGSNFYETPNIDQLAADGMKFDNAYAAHPRCVPSRVAIFSGSYPTRYGV--PQ 99
Query: 82 GAGVAKA-VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV--GYWNG 138
G V K +P++ ++LKE GY T IGKWH+G KE P +GFD+ + G+W
Sbjct: 100 GERVGKHHLPLSAVTFGEHLKEAGYQTGYIGKWHLG--KEGGDPTKQGFDSSIMAGHWGA 157
Query: 139 ----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPL 194
Y Y + ++ G E +YLTD TD+++ I+ +P
Sbjct: 158 PPSYYFPYT-KMSKSGKNKGFAKVEGSEE-------EYLTDRLTDEALTFIE-QKKDQPF 208
Query: 195 FLQITHAAVHTGTAGNAKL 213
L + H AVHT G L
Sbjct: 209 LLVLAHYAVHTPIEGKPAL 227
>gi|340380159|ref|XP_003388591.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 500
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 60/171 (35%), Positives = 86/171 (50%), Gaps = 9/171 (5%)
Query: 35 IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEK 94
I TP+ L NG++LNRHY C+PSRA+FLTG++P P G+ +
Sbjct: 50 IKTPSFQYLVDNGLILNRHYVFKYCSPSRASFLTGRFPHHVHQWNPTPPGMV-GTNINMT 108
Query: 95 LLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVG 154
+LP LK GYSTH++GKWH G LP NRGFD +G+L +
Sbjct: 109 MLPAKLKTAGYSTHMVGKWHQGLYDPAYLPVNRGFDTS----SGFLQAGEGHFNQTIGCA 164
Query: 155 LDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS-HNHSRPLFLQITHAAVH 204
+D +N +AP + + ++ + I S H+ S+PLFL + VH
Sbjct: 165 VDFWKN---HAPDTRNGTYDSYIYNKDLTTIFSKHDASKPLFLYLPLHNVH 212
>gi|332223751|ref|XP_003261032.1| PREDICTED: arylsulfatase E isoform 2 [Nomascus leucogenys]
Length = 614
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 74 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPMRSGMVSSI 133
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 134 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 193
Query: 131 NHVG 134
+ G
Sbjct: 194 HFYG 197
>gi|410617068|ref|ZP_11328044.1| sulfatase [Glaciecola polaris LMG 21857]
gi|410163337|dbj|GAC32182.1| sulfatase [Glaciecola polaris LMG 21857]
Length = 488
Score = 99.8 bits (247), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 108/218 (49%), Gaps = 17/218 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+GF G +I TPNIDALA+ G+V + Y T P C PSRA LTG+Y R+G++
Sbjct: 46 GYGDLGFTGSREIKTPNIDALAHKGVVFSNAYVTHPYCGPSRAGLLTGRYQARFGMEINA 105
Query: 82 GAGVAK---AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+PV E + +++ GY T +IGKWH+G + P NRGFD G+ G
Sbjct: 106 AHSPDDPFMGLPVDEPTFAKRMQKAGYKTAVIGKWHMGSHP-NFHPNNRGFDYFYGFLGG 164
Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
Y + + ++++ L RN + P ++YLT + ++ + S+P
Sbjct: 165 GHDYFPESVKVSNEEYSIPLS--RNGK---PAQLNEYLTTAISKEAAEF--AMTTSQPFM 217
Query: 196 LQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHI 233
+ + + A H K + D+ N RT+A +
Sbjct: 218 MYVAYNAPHQPLEATQKDLAKYQHIEDI--NRRTYAAM 253
>gi|332223749|ref|XP_003261031.1| PREDICTED: arylsulfatase E isoform 1 [Nomascus leucogenys]
Length = 589
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPMRSGMVSSI 108
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168
Query: 131 NHVG 134
+ G
Sbjct: 169 HFYG 172
>gi|410988042|ref|XP_004000297.1| PREDICTED: arylsulfatase E [Felis catus]
Length = 585
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/124 (44%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G + I TPNID LA +G++L +H + CTPSRAAFLTG+YP R G+ +
Sbjct: 45 GIGDIGCYGNDTIRTPNIDRLARDGVMLTQHLAAASVCTPSRAAFLTGRYPLRSGMVSSN 104
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNRGFD 130
G GV+ +P E + LK+ GY+T LIGKWH+G N E P N GFD
Sbjct: 105 GYRVLQWTGVSGGLPTNETTFAKILKDRGYATGLIGKWHLGLNCESSNDHCHHPLNHGFD 164
Query: 131 NHVG 134
+ G
Sbjct: 165 HFYG 168
>gi|410340669|gb|JAA39281.1| arylsulfatase E (chondrodysplasia punctata 1) [Pan troglodytes]
Length = 599
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 58 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 117
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 118 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 177
Query: 131 NHVG 134
+ G
Sbjct: 178 HFYG 181
>gi|194227646|ref|XP_001495573.2| PREDICTED: arylsulfatase E [Equus caballus]
Length = 623
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 69/195 (35%), Positives = 101/195 (51%), Gaps = 22/195 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G DVG +G I TPNID LA +G++L +H + CTPSRAAFLTG+YP R G+ +
Sbjct: 83 GVGDVGCYGNTTIRTPNIDRLAKDGVMLTQHIAAASVCTPSRAAFLTGRYPVRSGMVSSN 142
Query: 82 GAGVAK------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNRGFD 130
G V + +P E + LK+ GY+T LIGKWH+G N + P N GFD
Sbjct: 143 GYRVLQWTAASGGLPTNETTFAKILKDTGYATGLIGKWHLGLNCQSSNDHCHHPLNHGFD 202
Query: 131 NHVGYWNGYLTYNDSIHE--TDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
+ G + D +H ++ VGL+ + N + Q+ + F T + +H++
Sbjct: 203 HFYGM--PFSMMGDCVHWELSEKRVGLENKLN---FCSQIMAIAALTFTTGKLIHLMAG- 256
Query: 189 NHSRPLFLQITHAAV 203
S L + T AA+
Sbjct: 257 --SWALVIWSTVAAI 269
>gi|414070343|ref|ZP_11406329.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410807260|gb|EKS13240.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 469
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 74/225 (32%), Positives = 104/225 (46%), Gaps = 23/225 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+GF G +I TPNIDALA NG N + T P C PSR LTG+Y R G++ V
Sbjct: 28 GYGDLGFTGSKEIKTPNIDALASNGTRFKNAYVTHPYCGPSRVGLLTGRYQARLGMENNV 87
Query: 82 G---AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+P++E L+++GY T + GKWH+G P RGFD G+ +G
Sbjct: 88 SYMPQDKYMGLPLSENTFANRLQDVGYHTSVFGKWHLG-GAPHFQPNKRGFDYFYGFLDG 146
Query: 139 YLTYN-DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y D + L RN + +YLT + +V I S P F+
Sbjct: 147 GHNYMPDQVTVGGDGYSLPLMRNTQVTE---FDEYLTTALSRDAVKYIHRQQES-PFFMY 202
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+++ A HT LQ P E + HI + DRR++A
Sbjct: 203 LSYNAPHTP-----------LQAP--AEYIEKYKHIEDEDRRVYA 234
>gi|426395032|ref|XP_004063784.1| PREDICTED: arylsulfatase E isoform 2 [Gorilla gorilla gorilla]
Length = 614
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 74 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 133
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 134 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 193
Query: 131 NHVG 134
+ G
Sbjct: 194 HFYG 197
>gi|120659872|gb|AAI30439.1| Arylsulfatase E (chondrodysplasia punctata 1) [Homo sapiens]
gi|313883184|gb|ADR83078.1| arylsulfatase E (chondrodysplasia punctata 1) [synthetic construct]
Length = 589
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 55/124 (44%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G DVG +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49 GIGDVGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168
Query: 131 NHVG 134
+ G
Sbjct: 169 HFYG 172
>gi|340378605|ref|XP_003387818.1| PREDICTED: hypothetical protein LOC100637044 [Amphimedon
queenslandica]
Length = 2318
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 62/183 (33%), Positives = 88/183 (48%), Gaps = 10/183 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ D F I TPN L NG++LNRHY C+PSRA+FLTG++P P
Sbjct: 1152 GFADASFRNP-AIKTPNFQYLVDNGLILNRHYVFKYCSPSRASFLTGRFPHHVHQWNPTP 1210
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
G+ + +LP LK GY+TH++GKWH G LP NRGFD +G+L
Sbjct: 1211 LGMV-GTNINMTMLPAKLKNAGYATHMVGKWHQGLYDPAYLPINRGFDTS----SGFLQA 1265
Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSV-HVIKSHNHSRPLFLQITHA 201
+ +D +N AP + + ++ + V H+ S+PLFL +
Sbjct: 1266 EEGHFNQTIGCAVDFWKND---APDTRNGTCDSYIYNKDLTTVFNEHDASKPLFLYLPLH 1322
Query: 202 AVH 204
VH
Sbjct: 1323 NVH 1325
>gi|297709347|ref|XP_002831396.1| PREDICTED: arylsulfatase E isoform 2 [Pongo abelii]
Length = 614
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 74 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 133
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 134 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 193
Query: 131 NHVG 134
+ G
Sbjct: 194 HFYG 197
>gi|297303268|ref|XP_002806170.1| PREDICTED: arylsulfatase E isoform 2 [Macaca mulatta]
Length = 614
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 74 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 133
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 134 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 193
Query: 131 NHVG 134
+ G
Sbjct: 194 HFYG 197
>gi|426395030|ref|XP_004063783.1| PREDICTED: arylsulfatase E isoform 1 [Gorilla gorilla gorilla]
Length = 589
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168
Query: 131 NHVG 134
+ G
Sbjct: 169 HFYG 172
>gi|332666885|ref|YP_004449673.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
1100]
gi|332335699|gb|AEE52800.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
1100]
Length = 443
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 72/199 (36%), Positives = 96/199 (48%), Gaps = 33/199 (16%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGI-VLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+ +G D TPNID LA GI +N + P CTP+R AF+TG+YP R TPV
Sbjct: 42 GYGDLSGYGRKDFLTPNIDKLAAQGIKFVNAYSAAPLCTPTRTAFMTGRYPAR----TPV 97
Query: 82 G-------------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRG 128
G G+ A P L ++ GY T LIGKWH+G + P G
Sbjct: 98 GLMEPLTPSKRDSTVGLTAAFPSVATL----MRASGYETALIGKWHLGFLPQN-SPVKNG 152
Query: 129 FDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMS---SKYLTDFFTDQSVHVI 185
FD G +G Y H+T G RR + Y + YLTD FT ++V +
Sbjct: 153 FDYFFGIHSGAADYIS--HKT----GPAGRRIHDLYENDQAVYPEGYLTDLFTQKAVTFL 206
Query: 186 KSHNHSRPLFLQITHAAVH 204
K H++P FL +T+ A H
Sbjct: 207 K-QKHNKPFFLTLTYNAAH 224
>gi|297709345|ref|XP_002831395.1| PREDICTED: arylsulfatase E isoform 1 [Pongo abelii]
Length = 589
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168
Query: 131 NHVG 134
+ G
Sbjct: 169 HFYG 172
>gi|301770871|ref|XP_002920858.1| PREDICTED: arylsulfatase E-like [Ailuropoda melanoleuca]
Length = 592
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 55/124 (44%), Positives = 72/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N I TPNID LA +G++L +H + CTPSRAAFLTG+YP R G+ +
Sbjct: 45 GIGDIGCYGNNSIRTPNIDRLAEDGVMLTQHVAAASVCTPSRAAFLTGRYPLRSGMVSSN 104
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G GV +P E + LK+ GY+T LIGKWH+G N + P N GFD
Sbjct: 105 GYRVLQWTGVPGGLPTNETTFAKILKDRGYATGLIGKWHLGLNCDSSSDHCHHPLNHGFD 164
Query: 131 NHVG 134
+ G
Sbjct: 165 HFYG 168
>gi|444722183|gb|ELW62881.1| N-acetylgalactosamine-6-sulfatase [Tupaia chinensis]
Length = 764
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 95/201 (47%), Gaps = 19/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP- 80
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 73 GWGDLGVYGEPSRETPNLDQMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRTGFYTTN 132
Query: 81 -------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E LLP+ LK GY + ++GKWH+G ++ + P GFD
Sbjct: 133 AHARNAYTPQEIVGGIPSSEHLLPELLKGAGYVSKIVGKWHLG-HRPQFHPLRHGFDEWF 191
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 192 GAPNCHFGPYDNKARPNIPVYRDWEMVGRFYEEFPISLKTGEANLTQIYLQEALDFIKRQ 251
Query: 189 NHSRPLFLQ----ITHAAVHT 205
RP FL THA V+
Sbjct: 252 AGRRPFFLHWAIDATHAPVYA 272
>gi|343084004|ref|YP_004773299.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342352538|gb|AEL25068.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 445
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 63/189 (33%), Positives = 97/189 (51%), Gaps = 13/189 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+ +G I TPN+D LA G++ + H C+P+RAA +TGKY R G++ V
Sbjct: 39 GYGDLSCYGNEYINTPNLDLLASEGVLFTDYHSNGSVCSPTRAALMTGKYQQRTGVEGVV 98
Query: 82 GAGVAKAV--PVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
A + V + E L + LK+LGY+T + GKWH+G +K P +GFD VG+ +G
Sbjct: 99 TAKSHRDVGLALAEVTLAEELKQLGYNTGMFGKWHLGYDK-AFNPTLQGFDEFVGFVSGN 157
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN---HSRPLFL 196
+ Y+ I + + D + + Y TD ++ V I+ HN P FL
Sbjct: 158 VDYHGHIDQEGYLDWWDGVK------IKNEKGYTTDLISEYGVKFIQEHNPEVKRAPFFL 211
Query: 197 QITHAAVHT 205
+ H A H+
Sbjct: 212 YLPHEAPHS 220
>gi|284039849|ref|YP_003389779.1| sulfatase [Spirosoma linguale DSM 74]
gi|283819142|gb|ADB40980.1| sulfatase [Spirosoma linguale DSM 74]
Length = 533
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 63/195 (32%), Positives = 100/195 (51%), Gaps = 19/195 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G +G ++ TPN+D LA GI L Y C P+RA+ LTG+YP G+ V
Sbjct: 52 GFSDIGCYG-GEVNTPNLDKLAAGGIKLRSFYNNARCCPTRASLLTGQYPHTVGMGLMVT 110
Query: 83 AGVAKAVPVTEK--------LLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
A P + + + + LKE GYST+++GKWH+G + E P RGF+++ G
Sbjct: 111 MPNAAIQPGSYQGFLDARYPTIAERLKETGYSTYMLGKWHVG-ERPEHWPLKRGFEHYFG 169
Query: 135 YWNGYLTYNDSI--HETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI---KSHN 189
+G +Y + I + + LD + + P Y+TD FTD +V + K
Sbjct: 170 LISGASSYYEIIPAEKGKRFIVLDDK----EFTPPADGFYMTDAFTDYAVQYLNQQKQEQ 225
Query: 190 HSRPLFLQITHAAVH 204
+P F+ + + A H
Sbjct: 226 ADKPFFMYLAYTAPH 240
>gi|62510430|sp|Q60HH5.1|ARSE_MACFA RecName: Full=Arylsulfatase E; Short=ASE; Flags: Precursor
gi|52782187|dbj|BAD51940.1| arylsulfatase E [Macaca fascicularis]
Length = 588
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168
Query: 131 NHVG 134
+ G
Sbjct: 169 HFYG 172
>gi|62897927|dbj|BAD96903.1| arylsulfatase E precursor variant [Homo sapiens]
Length = 589
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168
Query: 131 NHVG 134
+ G
Sbjct: 169 HFYG 172
>gi|414068777|ref|ZP_11404774.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410808616|gb|EKS14585.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 480
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 76/233 (32%), Positives = 111/233 (47%), Gaps = 28/233 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYG--IDT 79
G+ DVGF+G DI TPNID LA +G + Y P C PSRAA +TG+YP + G +
Sbjct: 41 GYADVGFNGSKDIITPNIDDLAKSGTSFSDAYVAHPFCGPSRAALMTGRYPHKIGSQFNL 100
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
P G VP K + + L E Y T +GKWH+G + + P RGFD + G+ G
Sbjct: 101 PT-RGSNVGVPTDAKFISKLLNENNYFTGALGKWHMG-DAPQYHPNKRGFDEYYGFLGGG 158
Query: 140 LTY-NDSIHETDFAVGLDARRNMERYAPQM--------SSKYLTDFFTDQSVHVI-KSHN 189
Y D +N+ Y + ++Y+TD + ++V+ + K+ N
Sbjct: 159 HNYFPDQYQPQYKKQQAQGLKNIFEYITPLEHNGKEVKETQYITDALSREAVNFVDKAVN 218
Query: 190 HSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
P FL + + A H +P LQ D E+ F +I N DR+ +A
Sbjct: 219 KKNPFFLYLAYNAPH--------VP---LQAKD--EDMAMFPNIKNKDRKTYA 258
>gi|325286704|ref|YP_004262494.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
gi|324322158|gb|ADY29623.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
Length = 484
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/197 (34%), Positives = 94/197 (47%), Gaps = 21/197 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D GF G + TPN+D LA +G + Y T TC PSRA +TGKY R+G +
Sbjct: 34 GYADFGFQGSKIMKTPNLDKLAKSGAKFTQGYVTDATCGPSRAGLITGKYQQRFGYEEIN 93
Query: 82 GAGVAKA----------VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN 131
G A +P+ + + +LK+LGY T + GKWH+G + + P RGFD
Sbjct: 94 VPGYMSANSKFLADDMGLPLDQLTIADHLKKLGYKTAMYGKWHLG-DADRYHPTKRGFDE 152
Query: 132 HVGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
G+ G Y YND LD R Q ++Y+TD ++V I+
Sbjct: 153 FYGFRGGARNYFGYNDVSK-----ANLDNRMERGFGNYQEPTEYVTDALAKEAVSFIEK- 206
Query: 189 NHSRPLFLQITHAAVHT 205
N P F+ + AVHT
Sbjct: 207 NKGNPFFIYLAFNAVHT 223
>gi|157266309|ref|NP_000038.2| arylsulfatase E precursor [Homo sapiens]
gi|77416850|sp|P51690.2|ARSE_HUMAN RecName: Full=Arylsulfatase E; Short=ASE; Flags: Precursor
gi|62897959|dbj|BAD96919.1| arylsulfatase E precursor variant [Homo sapiens]
gi|119619123|gb|EAW98717.1| arylsulfatase E (chondrodysplasia punctata 1), isoform CRA_a [Homo
sapiens]
Length = 589
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168
Query: 131 NHVG 134
+ G
Sbjct: 169 HFYG 172
>gi|355757152|gb|EHH60677.1| Arylsulfatase E [Macaca fascicularis]
Length = 589
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168
Query: 131 NHVG 134
+ G
Sbjct: 169 HFYG 172
>gi|109129828|ref|XP_001116129.1| PREDICTED: arylsulfatase E isoform 1 [Macaca mulatta]
gi|355704585|gb|EHH30510.1| Arylsulfatase E [Macaca mulatta]
Length = 589
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168
Query: 131 NHVG 134
+ G
Sbjct: 169 HFYG 172
>gi|109897214|ref|YP_660469.1| sulfatase [Pseudoalteromonas atlantica T6c]
gi|109699495|gb|ABG39415.1| sulfatase [Pseudoalteromonas atlantica T6c]
Length = 500
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 75/235 (31%), Positives = 111/235 (47%), Gaps = 32/235 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G+NDVGF+G DI TPN+D LA NG+ + Y P C PSRAA +TG+YP + G +
Sbjct: 51 GYNDVGFNGSTDIKTPNLDGLAKNGMTFDAAYVAHPFCGPSRAAIMTGRYPHKIGAQFNL 110
Query: 82 GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ V E + Q +K GY T +GKWH+G E P GFD G+ G
Sbjct: 111 PEDNSNVGVSADELFIAQTMKSAGYFTGAMGKWHLG-EASEYHPNKHGFDEFYGFLGGGH 169
Query: 141 TYNDSIHETDF----AVGLDARRNMERYAPQM--------SSKYLTDFFTDQSVHVI-KS 187
Y E + A G+ N+ Y + ++Y+TD + ++V+ + K+
Sbjct: 170 NYFPEQFEAAYNKRVAQGM---TNINMYLTPLEHNGKEVRETEYITDGLSREAVNFVDKA 226
Query: 188 HNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+P FL + + A H +P LQ EE+ F+ I + RR +A
Sbjct: 227 AAKKKPFFLYLAYNAPH--------VP---LQAK--EEDMAMFSQIKDKKRRTYA 268
>gi|348550278|ref|XP_003460959.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Cavia porcellus]
Length = 502
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 100/202 (49%), Gaps = 20/202 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 21 GWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 80
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
G + +P +E+LLPQ LKE GY+T ++GKWH+G ++ + P GFD
Sbjct: 81 GHARNAYTPQEIVGGIPDSERLLPQLLKEAGYATKIVGKWHLG-HRPQFHPLKHGFDEWF 139
Query: 134 GYWNGYLTYNDSIHETDFAVGLD---ARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V + R E + + + LT + +++ I+
Sbjct: 140 GSPNCHFGPYDNKARPNIPVYRNWDMVGRFYEEFPINVKTGESNLTQIYLQEALDFIRQQ 199
Query: 189 NHSR-PLFL----QITHAAVHT 205
++ P FL THA V+
Sbjct: 200 QAAQHPFFLYWAVDATHAPVYA 221
>gi|430746414|ref|YP_007205543.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430018134|gb|AGA29848.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 590
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 96/202 (47%), Gaps = 29/202 (14%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QGW D+ HG ++ TPNID+LA +G + R Y P C P+RA FLTG+Y R G+
Sbjct: 34 QGWGDLSVHGNTNLKTPNIDSLARDGALFERFYVCPVCAPTRAEFLTGRYHPRGGVRGVT 93
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL- 140
G + + + EK + + K GY+T GKWH G + P RGFD + G+ +G+
Sbjct: 94 SGG--ERLDLNEKTIAETFKSAGYATGAFGKWHNG-TQFPYHPNARGFDEYYGFTSGHWG 150
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D E + P + ++TD TD ++ IK+ + RP F +
Sbjct: 151 EYFDPPLEHN-------------GRPVQGNGFITDDLTDHAISFIKA-SKDRPFFCYLPF 196
Query: 201 AAVHTGTAGNAKLPTGLLQVPD 222
H+ +QVPD
Sbjct: 197 NTPHSP-----------MQVPD 207
>gi|354465430|ref|XP_003495183.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Cricetulus
griseus]
Length = 493
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/208 (31%), Positives = 102/208 (49%), Gaps = 20/208 (9%)
Query: 16 TEKLLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFR 74
TE+ GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R
Sbjct: 5 TERAEVMGWGDLGVYGEPSRETPNLDQMALEGMLFPNFYSANPLCSPSRAALLTGRLPIR 64
Query: 75 YGIDTPVGAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFN 126
G T G + +P +E LLP+ LK+ GY+ ++GKWH+G ++ + P
Sbjct: 65 NGFYTSNGHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLK 123
Query: 127 RGFDNHVGYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQS 181
GFD G N + D+ + + V D R E + + + LT + ++
Sbjct: 124 HGFDEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANLTQLYLQEA 183
Query: 182 VHVIKS-HNHSRPLFL----QITHAAVH 204
+ I++ H P FL THA V+
Sbjct: 184 LDFIRTQHARQSPFFLYWAIDATHAPVY 211
>gi|440908775|gb|ELR58760.1| N-acetylgalactosamine-6-sulfatase, partial [Bos grunniens mutus]
Length = 525
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 71/221 (32%), Positives = 105/221 (47%), Gaps = 24/221 (10%)
Query: 8 GVAKAVPVTEKLL----PQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPS 62
GVA+A+ LL GW D+G +GE TPN+D +A G++ YT P C+PS
Sbjct: 26 GVARALQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNFYTANPLCSPS 85
Query: 63 RAAFLTGKYPFRYGIDTPVGAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWH 114
RAA LTG+ P R G T G + +P +E LLP LK GY++ ++GKWH
Sbjct: 86 RAALLTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWH 145
Query: 115 IGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS- 170
+G ++ + P GFD G N + D+ + V D R E + + +
Sbjct: 146 LG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTG 204
Query: 171 -KYLTDFFTDQSVHVIKSHNHS-RPLFL----QITHAAVHT 205
LT + +++ I+ + RP FL THA V+
Sbjct: 205 EANLTQIYLQEALEFIQRQQAAHRPFFLYWAVDATHAPVYA 245
>gi|338213632|ref|YP_004657687.1| arylsulfatase [Runella slithyformis DSM 19594]
gi|336307453|gb|AEI50555.1| Arylsulfatase [Runella slithyformis DSM 19594]
Length = 535
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 64/196 (32%), Positives = 99/196 (50%), Gaps = 21/196 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G +G ++ TPNID +A NGI L Y C P+RA+ LTG+YP G+ V
Sbjct: 54 GFSDIGCYG-GEVNTPNIDQMAANGIKLRSFYNNARCCPTRASLLTGQYPHTVGMGLMVT 112
Query: 83 AGVAKAVPVTEK--------LLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
A P + + + + LK+ GY T+++GKWH+G + + P RGFDN+ G
Sbjct: 113 MPNAAIQPGSYQGFLDDRYPTIAEQLKKTGYHTYMLGKWHVG-ERPQHWPLKRGFDNYFG 171
Query: 135 YWNGYLTYNDSI---HETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI---KSH 188
+G +Y + I F V D + + P Y+TD FTD +V + K
Sbjct: 172 LISGASSYYEIIPAEKGKRFMVLDD-----KEFTPPSDGFYVTDAFTDYAVQYLNKQKQE 226
Query: 189 NHSRPLFLQITHAAVH 204
+P F+ + + A H
Sbjct: 227 AADKPFFMYLAYTAPH 242
>gi|410056148|ref|XP_003317386.2| PREDICTED: arylsulfatase E [Pan troglodytes]
Length = 750
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 53/121 (43%), Positives = 72/121 (59%), Gaps = 12/121 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 129 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 188
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 189 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 248
Query: 131 N 131
+
Sbjct: 249 H 249
>gi|116251005|ref|YP_766843.1| arylsulfatase [Rhizobium leguminosarum bv. viciae 3841]
gi|115255653|emb|CAK06734.1| putative arylsulfatase [Rhizobium leguminosarum bv. viciae 3841]
Length = 503
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 51/119 (42%), Positives = 72/119 (60%), Gaps = 6/119 (5%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW D G +G + TPN+D LA G++L Y+ PTCTP+R+A LTG+ P R G+
Sbjct: 51 GWGDPGLYGGGEAVGAATPNMDRLAREGLMLTSTYSQPTCTPTRSAILTGRLPVRTGLTR 110
Query: 80 PVGAG--VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
P+ AG + K E LP+ L E+GY+T L GKWH+G + E + P + GFD G++
Sbjct: 111 PILAGDKITKNPWAEEASLPKLLGEVGYATVLCGKWHVG-DVEGMRPHDVGFDEFYGFY 168
>gi|323452121|gb|EGB07996.1| hypothetical protein AURANDRAFT_64538 [Aureococcus anophagefferens]
Length = 1591
Score = 98.2 bits (243), Expect = 2e-18, Method: Composition-based stats.
Identities = 58/191 (30%), Positives = 95/191 (49%), Gaps = 13/191 (6%)
Query: 25 NDVGFHG---ENDIPTPN-IDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP 80
+DVG + D+P P + L +G+ L +Y C+P+RA +TGK+ + G
Sbjct: 1103 DDVGLNDLWRSTDLPKPTEMSKLVRDGVELTSYYGQSLCSPARATLMTGKFAHKIGFSDQ 1162
Query: 81 VGAGVAK-------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
G GV + +VP+ +LPQ +K LGY TH IGKW+IG + +P+ RGFD V
Sbjct: 1163 QG-GVREVTAYSNFSVPLGHDMLPQGMKRLGYQTHAIGKWNIGHCNVKYMPWQRGFDTFV 1221
Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
GY+ + Y D + +T ++ + + Y T FT+++ V+ P
Sbjct: 1222 GYFTDGIGYTDHVSDTANTYTVN-DGGLAFNGSEYEGTYTTALFTERAEKVLHDAPEDAP 1280
Query: 194 LFLQITHAAVH 204
LF+ + + +H
Sbjct: 1281 LFMWLAYHGMH 1291
>gi|440717770|ref|ZP_20898247.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
gi|436437072|gb|ELP30746.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
Length = 480
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/193 (34%), Positives = 94/193 (48%), Gaps = 27/193 (13%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QG+ DVG G DI TP +DA+A G+ L Y P C PSRAA +TG YP R
Sbjct: 23 QGYQDVGCFGSPDIRTPRLDAMAKEGMKLTSFYAQPICGPSRAALMTGCYPLRV-----A 77
Query: 82 GAGVAKAV-PVT---EKLLPQYLKELGYSTHLIGKWHIGCNKE-----ELLPFNRGFDNH 132
G K + P+ E + + LK GY+T GKW + + + +LLP +GFD
Sbjct: 78 ERGHTKQIHPILHEGEITIAEVLKTKGYATACFGKWDLAKHAQSGFFPDLLPTGQGFD-- 135
Query: 133 VGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
Y+ G T ND + + RN E P+ LT +TD+++ I+ N ++
Sbjct: 136 --YFYGTPTSNDRV--------ANLYRNEELIEPESDMATLTRRYTDEAISFIEK-NQNQ 184
Query: 193 PLFLQITHAAVHT 205
P F+ I H HT
Sbjct: 185 PFFVYIPHTMPHT 197
>gi|254516321|ref|ZP_05128380.1| steryl-sulfatase [gamma proteobacterium NOR5-3]
gi|219674744|gb|EED31111.1| steryl-sulfatase [gamma proteobacterium NOR5-3]
Length = 500
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 64/205 (31%), Positives = 99/205 (48%), Gaps = 25/205 (12%)
Query: 23 GWNDVGFHGEND----IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID 78
G+ D+G +G +PTP ID LA G++L + + P CTP+RAA LTG+Y R G+
Sbjct: 49 GYGDLGVYGSGGELRGMPTPRIDQLASEGMMLTQFFVEPGCTPTRAALLTGRYSQRAGLG 108
Query: 79 TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN-HVGYW- 136
+ + AG + +E L + K GY+T + GKWH+G K+ LP N+GFD HVG
Sbjct: 109 SIIIAGTPSTLQDSEVTLAELFKSQGYATAMTGKWHLGGEKQS-LPINQGFDEWHVGILQ 167
Query: 137 --NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSS--------------KYLTDFFTDQ 180
+G L Y D + + F+ A+ + + +++ +
Sbjct: 168 TTDGVL-YPDGMRRSGFSEAAIAKSQTAIWESEPGKDVVKKVRPYDLEYRRHIEGDIAEA 226
Query: 181 SVHVIKSHNHSR-PLFLQITHAAVH 204
SV IK + P FL + + VH
Sbjct: 227 SVKYIKEQAKEKEPFFLYVGWSHVH 251
>gi|329744562|ref|NP_001193258.1| N-acetylgalactosamine-6-sulfatase precursor [Bos taurus]
gi|296478055|tpg|DAA20170.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Bos taurus]
Length = 522
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 69/217 (31%), Positives = 102/217 (47%), Gaps = 20/217 (9%)
Query: 8 GVAKAVPVTEKLL----PQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPS 62
GVA+A+ LL GW D+G +GE TPN+D +A G++ YT P C+PS
Sbjct: 22 GVARALQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNFYTANPLCSPS 81
Query: 63 RAAFLTGKYPFRYGIDTPVGAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWH 114
RAA LTG+ P R G T G + +P +E LLP LK GY++ ++GKWH
Sbjct: 82 RAALLTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWH 141
Query: 115 IGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS- 170
+G ++ + P GFD G N + D+ + V D R E + + +
Sbjct: 142 LG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTG 200
Query: 171 -KYLTDFFTDQSVHVIKSHNHS-RPLFLQITHAAVHT 205
LT + +++ I+ + RP FL A H
Sbjct: 201 EANLTQIYLQEALEFIQRQQAAHRPFFLYWAVDATHA 237
>gi|417302808|ref|ZP_12089892.1| arylsulfatase B [Rhodopirellula baltica WH47]
gi|327540882|gb|EGF27442.1| arylsulfatase B [Rhodopirellula baltica WH47]
Length = 480
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 75/261 (28%), Positives = 114/261 (43%), Gaps = 62/261 (23%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGID-TP 80
G+ + G G +IPTP IDALA +G+ Y + C+PSRA FL+G+Y R+G D P
Sbjct: 46 GYGETGMMGNAEIPTPAIDALARSGVRCTSGYVTSSYCSPSRAGFLSGRYQSRFGYDLNP 105
Query: 81 VGAG---VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY-- 135
G +P +K ++L+ GY T LIGKWH+G + +P ++GFD G+
Sbjct: 106 TGERNNHPNAGLPPQQKTFVEHLQSAGYQTSLIGKWHLGTRPPQ-VPTSKGFDRFFGFLH 164
Query: 136 --------------W---------NGYLTYND--------SIHETDFAVG---LDARRNM 161
W G N I+E D+ G LD +
Sbjct: 165 EGHFYVPGPPFENVWTMLRDNTLPTGQFETNQRTIRGNYARINEPDYDAGNPMLDGSEPI 224
Query: 162 ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVP 221
+ + YLTD TD+++ I + S P + +++ AVH+ +
Sbjct: 225 DHW------NYLTDTITDKAIDAI-TQTASNPFAMVVSYNAVHSPMQASL---------- 267
Query: 222 DMEENDRTFAHISNPDRRLFA 242
E+ HI +P RR+FA
Sbjct: 268 ---EDHAAMEHIDDPQRRIFA 285
>gi|167519809|ref|XP_001744244.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777330|gb|EDQ90947.1| predicted protein [Monosiga brevicollis MX1]
Length = 328
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 66/189 (34%), Positives = 92/189 (48%), Gaps = 16/189 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALA-YNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G+ + I TPNID LA G++L+ Y C+PSRA+FLTG+ P P
Sbjct: 11 GYYDLGYRNPDSI-TPNIDQLATQEGVILDNAYGYRYCSPSRASFLTGRVPIHVHQGNP- 68
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
G A + ++P L+ GY T ++GKWH G + E LP NRGFD GY +G
Sbjct: 69 GLAAAGCTNLNYTMIPAQLRRAGYRTAMVGKWHQGASLPECLPVNRGFDTSFGYLSGEED 128
Query: 142 YND------SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
+ D + TDF LD+ + R S +Y D V +I+ H +PL
Sbjct: 129 HMDQTTNGGQCNVTDFW--LDSGPAIRRNGTYSSFQY-----NDAIVDIIQQHAPEQPLM 181
Query: 196 LQITHAAVH 204
L VH
Sbjct: 182 LYAALQNVH 190
>gi|32473691|ref|NP_866685.1| N-acetylgalactosamine-4-sulfatase precursor [Rhodopirellula baltica
SH 1]
gi|32444227|emb|CAD74224.1| N-acetylgalactosamine-4-sulfatase precursor [Rhodopirellula baltica
SH 1]
Length = 480
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 75/261 (28%), Positives = 114/261 (43%), Gaps = 62/261 (23%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGID-TP 80
G+ + G G +IPTP IDALA +G+ Y + C+PSRA FL+G+Y R+G D P
Sbjct: 46 GYGETGMMGNAEIPTPAIDALARSGVRCTSGYVTSSYCSPSRAGFLSGRYQSRFGYDLNP 105
Query: 81 VGAG---VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY-- 135
G +P +K ++L+ GY T LIGKWH+G + +P ++GFD G+
Sbjct: 106 TGERNNHPNAGLPPQQKTFVEHLQSAGYQTSLIGKWHLGTRPSQ-VPTSKGFDRFFGFLH 164
Query: 136 --------------W---------NGYLTYNDS--------IHETDFAVG---LDARRNM 161
W G N I+E D+ G LD +
Sbjct: 165 EGHFYVPGPPFENVWTMLRDNTLPTGRFETNQKTIRGNYARINEPDYDAGNPMLDGSEPI 224
Query: 162 ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVP 221
E + YLTD TD+++ I + S+P + +++ AVH+ +
Sbjct: 225 EHW------NYLTDSITDKAIDAI-TQTASKPFAMVVSYNAVHSPMQASL---------- 267
Query: 222 DMEENDRTFAHISNPDRRLFA 242
E+ I +P RR+FA
Sbjct: 268 ---EDHAAMELIDDPQRRIFA 285
>gi|325108643|ref|YP_004269711.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324968911|gb|ADY59689.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 484
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 69/220 (31%), Positives = 104/220 (47%), Gaps = 27/220 (12%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGI--- 77
QG+ND+G +D+ TP++D LA G L Y P CTPSRA+ LTG+YP R GI
Sbjct: 38 QGYNDLGVLN-SDLITPHLDRLAAEGTRLTDFYVAWPACTPSRASLLTGRYPQRNGIYDM 96
Query: 78 ---------------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL 122
+ V + EKLLP+YLK+LGY++ + GKW +G K
Sbjct: 97 IRNEAPDYGYKYKPAEYEVSFERIGGMDQREKLLPEYLKKLGYTSAIFGKWDLGSLK-RF 155
Query: 123 LPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSV 182
LP NRGFD G+ N + Y HE G+ + + +Y T+ F +++
Sbjct: 156 LPTNRGFDEFYGFVNTGIDY--FTHER---YGVPSMFRQTSLTEEDRGEYATELFKREAL 210
Query: 183 HVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPD 222
+ S P L + A H ++ + ++ +Q P+
Sbjct: 211 AFLDRAEASEPFLLYLPFNAPHNSSSLDPRI-RSTVQAPE 249
>gi|47230520|emb|CAF99713.1| unnamed protein product [Tetraodon nigroviridis]
Length = 554
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 99/202 (49%), Gaps = 20/202 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G G+ TPN+DA+A G++ YT P C+PSRAA LTG+ P R G T
Sbjct: 18 GWGDLGVFGQPSKETPNLDAMAAQGMLFPNFYTANPLCSPSRAALLTGRLPVRNGFYTTN 77
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
G + + E LLPQ LK+ GY + ++GKWH+G ++ + LP GFD +
Sbjct: 78 GHARNAYTPQEIVGGISKDEILLPQMLKKRGYISKIVGKWHLG-HRPQYLPLEHGFDEWL 136
Query: 134 GYWNGYL-TYNDSIHET----DFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
G N + YN+S+ + + L R +M LT + +S+ ++
Sbjct: 137 GAPNCHFGPYNNSVKPNIPVYNNSEMLGRYYEEFRIDRKMGESNLTQMYLLESLDFVRRQ 196
Query: 189 NHS-RPLFL----QITHAAVHT 205
+ RP FL THA V+
Sbjct: 197 AEAQRPFFLYWAPDATHAPVYA 218
>gi|300114943|ref|YP_003761518.1| sulfatase [Nitrosococcus watsonii C-113]
gi|299540880|gb|ADJ29197.1| sulfatase [Nitrosococcus watsonii C-113]
Length = 463
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 68/187 (36%), Positives = 93/187 (49%), Gaps = 12/187 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYG---ID 78
G+ DVG +G I TPN+DALA G + H P CTP+RAA LTG Y R G I
Sbjct: 53 GYGDVGCYGNQHIKTPNLDALAKRGARFTDFHSNGPLCTPTRAALLTGCYQQRVGLQIIP 112
Query: 79 TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+AKA+P+ E + LK +GYST LIGKWH+G ++ P +GFD + G
Sbjct: 113 KDQRYAMAKAMPLAEITFAEALKAVGYSTALIGKWHLG-DRPSFSPSRQGFDEYFG---- 167
Query: 139 YLTYNDSIHETDFAV-GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
+ Y+ +H + L RN E +T + T+++V I H S P L
Sbjct: 168 -IPYSHDMHPWRKSFPPLPLMRNEEIIELNPDLDDMTQYCTEEAVQFISKHK-SNPFLLY 225
Query: 198 ITHAAVH 204
+ H H
Sbjct: 226 MPHPMPH 232
>gi|149197520|ref|ZP_01874571.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139538|gb|EDM27940.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 446
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/185 (31%), Positives = 89/185 (48%), Gaps = 6/185 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
GW DV +HG D TP IDA+A G+ + Y + C PSRA LTG+Y +G+ T
Sbjct: 31 GWGDVAYHGVEDAQTPAIDAIAKGGVWFEQGYAAASVCGPSRAGILTGRYQQLFGVVT-- 88
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY-WNGYL 140
K +P ++K + + LK GY + GKWH+G K + P +RGFD G+ + +
Sbjct: 89 NGDADKGIPKSQKNIAELLKPAGYKSGAFGKWHLGSKKGQ-FPNDRGFDTFYGFHFGAHD 147
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y G + YLT+ TD +V I+ N +P F+ + +
Sbjct: 148 YYRADKKLNKKKKGYAPIYFNQDIVDYKEGDYLTEKITDHAVEFIEE-NKDQPFFMYVAY 206
Query: 201 AAVHT 205
+VH+
Sbjct: 207 NSVHS 211
>gi|421613374|ref|ZP_16054460.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula baltica SH28]
gi|408495968|gb|EKK00541.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula baltica SH28]
Length = 480
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 75/261 (28%), Positives = 115/261 (44%), Gaps = 62/261 (23%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGID-TP 80
G+ + G G +IPTP IDALA +G+ Y + C+PSRA FL+G+Y R+G D P
Sbjct: 46 GYGETGMMGNAEIPTPAIDALARSGVRCTSGYVTSSYCSPSRAGFLSGRYQSRFGYDLNP 105
Query: 81 VGAG---VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY-- 135
G +P +K ++L+ GY T LIGKWH+G + +P ++GFD G+
Sbjct: 106 TGERNNHPIAGLPPQQKTFIEHLQSAGYLTSLIGKWHLGTRPPQ-VPTSKGFDRFFGFLH 164
Query: 136 --------------WN---------GYLTYND--------SIHETDFAVG---LDARRNM 161
W G N I+E D+ G LD +
Sbjct: 165 EGHFYVPGPPFENVWTMLRDNTLPAGQFKTNQRTIRGNYARINEPDYDAGNPMLDGSEPI 224
Query: 162 ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVP 221
+ + YLTD TD+++ I + S+P + +++ AVH+ +
Sbjct: 225 DHW------NYLTDTITDKAIDSI-TQTPSKPFAMVVSYNAVHSPMQASL---------- 267
Query: 222 DMEENDRTFAHISNPDRRLFA 242
E+ HI +P RR+FA
Sbjct: 268 ---EDHAAMEHIDDPQRRIFA 285
>gi|319954036|ref|YP_004165303.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
gi|319422696|gb|ADV49805.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
Length = 467
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 71/229 (31%), Positives = 104/229 (45%), Gaps = 30/229 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGID--- 78
G+ D GF G + TP +D LA I ++ Y + C PSRA LTGKY ++G +
Sbjct: 38 GYADFGFQGSKEFKTPELDKLAKKSIKFSQAYVSAAVCGPSRAGILTGKYQQKFGFEENN 97
Query: 79 ------TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
T G +P+ + + YL++LGY T L GKWH G N + P RGFD
Sbjct: 98 VPGYMSTSGLVGDEMGLPLDQITIANYLQDLGYKTALFGKWHQG-NADRFHPTKRGFDEF 156
Query: 133 VGYWNG---YLTYND----SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI 185
G+ G Y+ Y+D S +E G E YLTD +++ I
Sbjct: 157 YGFRGGARSYMPYDDSNPLSKNEDRLERGFGNFLEHE--------GYLTDELAHEAISFI 208
Query: 186 KSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHIS 234
+ N P F+ ++ AVHT A+ L Q P ++ +T A ++
Sbjct: 209 -NRNKKHPFFIYLSFNAVHTPMEATAE---DLEQFPHLKGKRKTLAAMT 253
>gi|326428402|gb|EGD73972.1| arylsulfatase B [Salpingoeca sp. ATCC 50818]
Length = 545
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 72/243 (29%), Positives = 109/243 (44%), Gaps = 39/243 (16%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPF--------- 73
G+++ G + + TPN+D LA +G++L++ Y+ CTPSR++FL+G+ P
Sbjct: 52 GFHNFGIRNQTEAKTPNMDKLARDGLLLDQAYSYFWCTPSRSSFLSGRLPLHVFHSNRVS 111
Query: 74 --RYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN 131
+ P AGV +P +P ++++ GY TH++GKW G + P RGFD+
Sbjct: 112 SASWDSQHPDTAGV--GIPRNMTTIPAFMRKAGYKTHMVGKWDAGIATPQHSPLGRGFDS 169
Query: 132 HVGYW---NGYLTYN--DSI--------HETDFAVGLDARRNMERYAPQMSSKYLTDFFT 178
+ Y+ N Y YN D++ + G N A Y F
Sbjct: 170 SLHYFNHDNNYYAYNYTDTVSVQFPVKCQLLKYVTGFVDLWNSTEPADAPIGTYEEHVFR 229
Query: 179 DQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDR 238
D ++ VI H+ S PLFL H LQVP DR F HI +P R
Sbjct: 230 DHALDVISKHDASTPLFLYYASHIAHAP-----------LQVPQAYL-DR-FQHIPDPIR 276
Query: 239 RLF 241
R +
Sbjct: 277 RTY 279
>gi|332263239|ref|XP_003280658.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Nomascus leucogenys]
Length = 528
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 48 GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 107
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ VP +E+LLP+ LK+ GY + ++GKWH+G ++ + P GFD
Sbjct: 108 AHARNAYTPQEIVGGVPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 166
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 167 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 226
Query: 189 NHSRPLFL----QITHAAVHT 205
P FL THA V+
Sbjct: 227 ARRHPFFLYWAVDATHAPVYA 247
>gi|351712929|gb|EHB15848.1| N-acetylgalactosamine-6-sulfatase [Heterocephalus glaber]
Length = 482
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 99/202 (49%), Gaps = 20/202 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA L+G+ P R G T
Sbjct: 2 GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLSGRPPIRSGFYTTN 61
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E+LLP LKE GY+T ++GKWH+G ++ + P GFD
Sbjct: 62 AHARNAYTPQEIVGGIPDSERLLPSLLKEAGYATKIVGKWHLG-HRPQFHPLKHGFDEWF 120
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + + V D R E + + + LT + +++ IK
Sbjct: 121 GSPNCHFGPYDNKAKPNIPVYKDWEMVGRFYEEFPINVKTGESNLTQIYLQEALDFIKRQ 180
Query: 189 NHS-RPLFL----QITHAAVHT 205
+ RP FL THA V+
Sbjct: 181 QAARRPFFLYWAVDATHAPVYA 202
>gi|87309449|ref|ZP_01091584.1| arylsulfatase A (precursor) [Blastopirellula marina DSM 3645]
gi|87287757|gb|EAQ79656.1| arylsulfatase A (precursor) [Blastopirellula marina DSM 3645]
Length = 478
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 95/201 (47%), Gaps = 28/201 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G G D PTP++D LA G + Y T C+ SRA LTG Y R GI +
Sbjct: 36 GYADIGPFGAKDYPTPHLDQLAQEGTICTDFYVTQAVCSASRAGLLTGCYNNRIGILGAL 95
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWNGY 139
G + E L + K+ GY+T GKWH+G + EE LP GFD++VG Y N
Sbjct: 96 GPQSKIGISAEETTLAEICKQKGYATACYGKWHLG-HHEEFLPLQHGFDDYVGLPYSNDM 154
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQM----------------SSKYLTDFFTDQSVH 183
Y+ + L + +RY P + + LT +T+++V
Sbjct: 155 WPYHPELRH------LTKDQQQKRY-PDLPLYEKNEIIDTEVTPEDQRNLTTLYTEKAVK 207
Query: 184 VIKSHNHSRPLFLQITHAAVH 204
I NH++P FL + H+ VH
Sbjct: 208 FIDD-NHAQPFFLYVPHSMVH 227
>gi|449136003|ref|ZP_21771428.1| arylsulfatase A [Rhodopirellula europaea 6C]
gi|448885345|gb|EMB15791.1| arylsulfatase A [Rhodopirellula europaea 6C]
Length = 480
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 91/192 (47%), Gaps = 25/192 (13%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QG+ DVG G DI TP +DA+A +G+ Y P C PSRAA +TG YP R
Sbjct: 23 QGYQDVGCFGSPDIRTPRLDAMAKDGMKFTSFYAQPICGPSRAALMTGCYPLRVA----E 78
Query: 82 GAGVAKAVPV---TEKLLPQYLKELGYSTHLIGKWHIGCNKE-----ELLPFNRGFDNHV 133
+ + P+ E + + LK GY+T GKW + + + +LLP +GFD
Sbjct: 79 RGHIKQIHPILHEDEITIAEVLKTKGYATACFGKWDLAKHTQTDFFPDLLPTGQGFD--- 135
Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y+ G T ND + + RN E P LT +TD+++ I+ N +P
Sbjct: 136 -YFYGTPTSNDRV--------ANLYRNKELIEPDSDMATLTQRYTDEAISFIE-QNQDQP 185
Query: 194 LFLQITHAAVHT 205
F+ I H HT
Sbjct: 186 FFVYIPHTMPHT 197
>gi|432851909|ref|XP_004067102.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like isoform 1
[Oryzias latipes]
Length = 525
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 93/202 (46%), Gaps = 24/202 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G G+ TPN+DA+A GI+ YT P C+PSRAA LTG+ P R G T
Sbjct: 42 GWGDLGVFGQPSKETPNLDAMAAQGILFPDFYTANPLCSPSRAALLTGRLPIRNGFYTTN 101
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN-- 131
G + + E LLPQ LKE GY ++GKWH+G ++ + LP GFD
Sbjct: 102 GHARNAYTPQEIVGGISKDEILLPQLLKEKGYVNKIVGKWHLG-HRPQYLPLENGFDEWF 160
Query: 132 -----HVGYWNGYLTYNDSIHETDFAVGLDARRNMERYA--PQMSSKYLTDFFTDQSVHV 184
H G +N + N ++ +G R E + + LT + + +
Sbjct: 161 GAPNCHFGPYNNTVRPNIPVYNNSEMLG----RYFEEFKIDKKTGESNLTQMYLEAGLDF 216
Query: 185 IKSHNHS-RPLFLQITHAAVHT 205
I + RP FL A H+
Sbjct: 217 ISRQAEAKRPFFLYWAADATHS 238
>gi|344292944|ref|XP_003418184.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Loxodonta
africana]
Length = 513
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 99/202 (49%), Gaps = 20/202 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 42 GWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 101
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
G + +P +E LLP+ LK+ Y+T ++GKWH+G ++ + P GFD
Sbjct: 102 GHARNAYTPQDIVGGIPDSEHLLPELLKKANYATKIVGKWHLG-HRPQFHPLKHGFDEWF 160
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIK-S 187
G N + D+ + + V D R E + + + LT + +++ IK
Sbjct: 161 GSPNCHFGPYDNRAKPNIPVYRDWEMVGRFYEEFPINLKTGEANLTQIYLQEALDFIKRQ 220
Query: 188 HNHSRPLFL----QITHAAVHT 205
+ RP FL THA V+
Sbjct: 221 QSQQRPFFLYWAIDATHAPVYA 242
>gi|403255190|ref|XP_003920329.1| PREDICTED: arylsulfatase E isoform 2 [Saimiri boliviensis
boliviensis]
Length = 614
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 55/124 (44%), Positives = 71/124 (57%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA G+ L +H + + CTPSRAAFLTG+YP R G+ +
Sbjct: 74 GIGDIGCYGNNTMRTPNIDHLAEFGVKLTQHVSAASLCTPSRAAFLTGRYPIRSGMVSST 133
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G GV +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 134 GHRVLQWTGVPGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 193
Query: 131 NHVG 134
+ G
Sbjct: 194 SFYG 197
>gi|254482499|ref|ZP_05095738.1| sulfatase domain protein [marine gamma proteobacterium HTCC2148]
gi|214037190|gb|EEB77858.1| sulfatase domain protein [marine gamma proteobacterium HTCC2148]
Length = 602
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 64/186 (34%), Positives = 94/186 (50%), Gaps = 10/186 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ND+ + +D PTP +DA+A G+ RHY +CT SR A LTG+YP R G P
Sbjct: 27 GYNDLAINNGSDSPTPRLDAIAAQGVRFTRHYAESSCTASRVALLTGRYPARVGAH-PYL 85
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-YLT 141
G+ + LP L GY H++GKWH G + E P +GFD+ G+ N YL
Sbjct: 86 NGIDHEL----MTLPDALGSEGYIRHMVGKWHTGDSHRESRPEYQGFDHWFGFINQLYLR 141
Query: 142 --YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
+ + + ++ E Q +LTD TD+++ VIK + P FL ++
Sbjct: 142 GPHRSANYRRGKPTYINPWLENELGDLQQYEGHLTDILTDRALDVIKREQN--PWFLYLS 199
Query: 200 HAAVHT 205
+ A HT
Sbjct: 200 YYAPHT 205
>gi|403255188|ref|XP_003920328.1| PREDICTED: arylsulfatase E isoform 1 [Saimiri boliviensis
boliviensis]
Length = 589
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 55/124 (44%), Positives = 71/124 (57%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA G+ L +H + + CTPSRAAFLTG+YP R G+ +
Sbjct: 49 GIGDIGCYGNNTMRTPNIDHLAEFGVKLTQHVSAASLCTPSRAAFLTGRYPIRSGMVSST 108
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G GV +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 109 GHRVLQWTGVPGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168
Query: 131 NHVG 134
+ G
Sbjct: 169 SFYG 172
>gi|432851911|ref|XP_004067103.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like isoform 2
[Oryzias latipes]
Length = 523
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 93/202 (46%), Gaps = 24/202 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G G+ TPN+DA+A GI+ YT P C+PSRAA LTG+ P R G T
Sbjct: 42 GWGDLGVFGQPSKETPNLDAMAAQGILFPDFYTANPLCSPSRAALLTGRLPIRNGFYTTN 101
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN-- 131
G + + E LLPQ LKE GY ++GKWH+G ++ + LP GFD
Sbjct: 102 GHARNAYTPQEIVGGISKDEILLPQLLKEKGYVNKIVGKWHLG-HRPQYLPLENGFDEWF 160
Query: 132 -----HVGYWNGYLTYNDSIHETDFAVGLDARRNMERYA--PQMSSKYLTDFFTDQSVHV 184
H G +N + N ++ +G R E + + LT + + +
Sbjct: 161 GAPNCHFGPYNNTVRPNIPVYNNSEMLG----RYFEEFKIDKKTGESNLTQMYLEAGLDF 216
Query: 185 IKSHNHS-RPLFLQITHAAVHT 205
I + RP FL A H+
Sbjct: 217 ISRQAEAKRPFFLYWAADATHS 238
>gi|251798133|ref|YP_003012864.1| sulfatase [Paenibacillus sp. JDR-2]
gi|247545759|gb|ACT02778.1| sulfatase [Paenibacillus sp. JDR-2]
Length = 434
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 59/186 (31%), Positives = 96/186 (51%), Gaps = 5/186 (2%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G +G + + TP++D LA GI Y+ P C+PSRA+ LTGKYP + G+ + +
Sbjct: 15 GYGDLGCYGSDAMKTPHLDQLASEGIRFTNWYSNSPVCSPSRASLLTGKYPAKAGVTSIL 74
Query: 82 GAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
G K + + + L LKE GY T L GKWH+G + E P GFD G+ G +
Sbjct: 75 GGKRGTKGLSLEQTTLASALKEHGYHTALFGKWHLGASA-EYGPNAHGFDQFYGFRAGCI 133
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQM--SSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y I G++ ++ R ++ + +Y+T+ T ++ I + P F+ +
Sbjct: 134 DYYSHIFYWGQGGGVNPVHDLWRNETEVWENGEYMTEAITREATSYIDAAPDDEPYFMYV 193
Query: 199 THAAVH 204
+ A H
Sbjct: 194 AYNAPH 199
>gi|189053665|dbj|BAG35917.1| unnamed protein product [Homo sapiens]
Length = 589
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 72/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 109 GYRVLQWTGASGGPPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 168
Query: 131 NHVG 134
+ G
Sbjct: 169 HFYG 172
>gi|126304968|ref|XP_001376926.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Monodelphis
domestica]
Length = 520
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 73/231 (31%), Positives = 103/231 (44%), Gaps = 30/231 (12%)
Query: 3 TPVGAGVAKAVPVTEKLL--PQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTC 59
P+GAG P LL GW D+G GE TP++D +A G++ YT P C
Sbjct: 17 APLGAGATSQPPNIVFLLMDDMGWGDLGVFGEPSRETPHLDQMAAEGMLFPNFYTANPLC 76
Query: 60 TPSRAAFLTGKYPFRYGIDTPVGAG--------VAKAVPVTEKLLPQYLKELGYSTHLIG 111
+PSRAA LTG+ P R G T G + +P +E LLP+ LK+ GY ++G
Sbjct: 77 SPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPDSEFLLPELLKKAGYVNKIVG 136
Query: 112 KWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAP----- 166
KWH+G ++ + P GFD G N + D+ + V RN E
Sbjct: 137 KWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNKAMPNIPV----YRNWEMVGRFYEDF 191
Query: 167 ----QMSSKYLTDFFTDQSVHVIKSHN-HSRPLFL----QITHAAVHTGTA 208
+ LT + ++V IK H +P FL THA V+ +
Sbjct: 192 PINHKTGEANLTQIYLKEAVDFIKKQQAHQQPFFLYWAIDATHAPVYASKS 242
>gi|791004|emb|CAA58556.1| ARSE [Homo sapiens]
Length = 589
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 53/124 (42%), Positives = 73/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 49 GIGDIGCYGNNTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 108
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GF+
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFE 168
Query: 131 NHVG 134
+ G
Sbjct: 169 HFYG 172
>gi|430745365|ref|YP_007204494.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430017085|gb|AGA28799.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 476
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 66/194 (34%), Positives = 95/194 (48%), Gaps = 24/194 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGI---- 77
G+ D+ +G D+ TPNIDAL +G+ +R Y P C+P+RAA LTG YP G+
Sbjct: 42 GYGDLSSYGAADLKTPNIDALVASGVRFDRFYANSPVCSPTRAALLTGCYPDLVGVPGVI 101
Query: 78 ----DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
D G +AV LLPQ LK GY T L+GKWH+G + LP RGFD
Sbjct: 102 RTHPDDSWGVLSPQAV-----LLPQVLKGAGYHTALVGKWHLGLSGAS-LPSRRGFDLFH 155
Query: 134 GYWNGYLT--YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
G+ + +N H ++ D + + +A + S++ DF + S
Sbjct: 156 GFLGDMMDDYHNHRRHGINYMRRDDREIDPKGHATDLFSQWAIDFLNE-------SKGQD 208
Query: 192 RPLFLQITHAAVHT 205
RP FL++ + HT
Sbjct: 209 RPFFLELAYNVPHT 222
>gi|254435647|ref|ZP_05049154.1| sulfatase, putative [Nitrosococcus oceani AFC27]
gi|207088758|gb|EDZ66030.1| sulfatase, putative [Nitrosococcus oceani AFC27]
Length = 463
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 68/187 (36%), Positives = 94/187 (50%), Gaps = 12/187 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYG---ID 78
G+ DVG +G I TPN+DALA G + H P CTP+RAA LTG Y R G I
Sbjct: 53 GYGDVGCYGNQHIKTPNLDALAKKGARFTDFHSNGPLCTPTRAALLTGCYQQRVGLHIIP 112
Query: 79 TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+AKA+ + E + LK +GYST L+GKWH+G ++ LP +GFD + G
Sbjct: 113 KDQRYAMAKAMSLEEITFAEALKSVGYSTALVGKWHLG-DRPAFLPPRQGFDEYFG---- 167
Query: 139 YLTYNDSIHETDFAV-GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
+ Y+ +H + L R E +LT + T+++V I S N RP L
Sbjct: 168 -IPYSHDMHPWRKSFPPLPLMRGEEIVELNPDLDHLTQYCTEEAVKFI-SKNKDRPFLLY 225
Query: 198 ITHAAVH 204
+ H H
Sbjct: 226 MPHPMPH 232
>gi|77164258|ref|YP_342783.1| sulfatase [Nitrosococcus oceani ATCC 19707]
gi|76882572|gb|ABA57253.1| Sulfatase [Nitrosococcus oceani ATCC 19707]
Length = 440
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 68/187 (36%), Positives = 94/187 (50%), Gaps = 12/187 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYG---ID 78
G+ DVG +G I TPN+DALA G + H P CTP+RAA LTG Y R G I
Sbjct: 30 GYGDVGCYGNQHIKTPNLDALAKKGARFTDFHSNGPLCTPTRAALLTGCYQQRVGLHIIP 89
Query: 79 TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+AKA+ + E + LK +GYST L+GKWH+G ++ LP +GFD + G
Sbjct: 90 KDQRYAMAKAMSLEEITFAEALKSVGYSTALVGKWHLG-DRPAFLPPRQGFDEYFG---- 144
Query: 139 YLTYNDSIHETDFAV-GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
+ Y+ +H + L R E +LT + T+++V I S N RP L
Sbjct: 145 -IPYSHDMHPWRKSFPPLPLMRGEEIVELNPDLDHLTQYCTEEAVKFI-SKNKDRPFLLY 202
Query: 198 ITHAAVH 204
+ H H
Sbjct: 203 MPHPMPH 209
>gi|311748319|ref|ZP_07722104.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
gi|126576822|gb|EAZ81070.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
Length = 472
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 72/229 (31%), Positives = 107/229 (46%), Gaps = 30/229 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+GF G I TP++D LA NG+ + Y + C+PSRA F+TG +G D +
Sbjct: 46 GYGDLGFTGSTQIKTPHLDQLATNGVTFTQGYVSSAVCSPSRAGFITGINQVEFGHDNNL 105
Query: 82 GAGVA-------KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
AGV +P+++K + +L +LGY LIGKWH+G + + P RGFD G
Sbjct: 106 -AGVEPGFDIAYNGMPLSQKTIADHLNKLGYVNGLIGKWHLG-KEPQFHPLKRGFDEFWG 163
Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNME-RYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y G Y +S+ G + +E + Y+TD ++SV I+ H P
Sbjct: 164 YTGGGHDYFESLPN-----GKGYKEPLESNFKTPDPITYITDDVGNESVDFIERHK-DEP 217
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
FL A HT +EE+ + HI + RR +A
Sbjct: 218 FFLFAAFNAPHTPMQA-------------LEEDLALYQHIEDKKRRTYA 253
>gi|402909420|ref|XP_003917419.1| PREDICTED: arylsulfatase E [Papio anubis]
Length = 687
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 53/124 (42%), Positives = 72/124 (58%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 147 GIGDIGCYGNTTMRTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 206
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 207 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 266
Query: 131 NHVG 134
+ G
Sbjct: 267 HFYG 270
>gi|119587161|gb|EAW66757.1| galactosamine (N-acetyl)-6-sulfate sulfatase (Morquio syndrome,
mucopolysaccharidosis type IVA), isoform CRA_a [Homo
sapiens]
Length = 708
Score = 97.1 bits (240), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 42 GWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 101
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E+LLP+ LK+ GY + ++GKWH+G ++ + P GFD
Sbjct: 102 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 160
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 161 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 220
Query: 189 NHSRPLFL----QITHAAVHT 205
P FL THA V+
Sbjct: 221 ARHHPFFLYWAVDATHAPVYA 241
>gi|422293430|gb|EKU20730.1| arylsulfatase B [Nannochloropsis gaditana CCMP526]
gi|422295486|gb|EKU22785.1| arylsulfatase B [Nannochloropsis gaditana CCMP526]
Length = 703
Score = 97.1 bits (240), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 72/229 (31%), Positives = 107/229 (46%), Gaps = 25/229 (10%)
Query: 23 GWNDVGFHGENDIP----TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID 78
G DVG++ + P TP +D+LA + L +Y P CTP+RAA LTG+Y G+
Sbjct: 73 GVQDVGYNASPESPLRGKTPVLDSLAAESVRLKEYYVHPVCTPTRAALLTGRYAVNVGMP 132
Query: 79 TPVGAGVAKAVPVTEKLLPQYLK-ELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
P+ + + LP+ LK E YSTHL+GKWH+G K + P RGFD+ G
Sbjct: 133 FPLIGDAISGLDGSIPTLPEMLKSEANYSTHLVGKWHLGAAKAKNRPLARGFDSFYGLLG 192
Query: 138 GYLT-YNDSIHETDFAVGLDARRN-MERYAPQMSSK-YLTDFFTDQSVHVIKSHNHS--- 191
Y + E D +N E A ++ K + T F+ ++V VI+ H+
Sbjct: 193 ASFDHYTKKMGEVR-----DLWKNEAEVPAKEVDEKEHATTLFSREAVKVIEEHSARGHA 247
Query: 192 -------RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHI 233
PLFL + ++A H + K VP+ + RTF +
Sbjct: 248 GAKDGDMDPLFLYLAYSAPHAPLQADEKFMKLCSDVPN--RHRRTFCAM 294
>gi|426242290|ref|XP_004015007.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Ovis aries]
Length = 522
Score = 97.1 bits (240), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 97/202 (48%), Gaps = 20/202 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ YT P C+PSRAA LTG+ P R G T
Sbjct: 41 GWGDLGVYGEPSRETPNLDQMATEGMLFPNFYTANPLCSPSRAALLTGRLPIRSGFYTTN 100
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
G + +P +E LLP LK GY++ ++GKWH+G ++ + P GFD
Sbjct: 101 GHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWHLG-HRPQFHPLKHGFDEWF 159
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ I+
Sbjct: 160 GSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTGEANLTQIYLQEALEFIQRQ 219
Query: 189 NHS-RPLFL----QITHAAVHT 205
+ RP FL THA V+
Sbjct: 220 QAAHRPFFLYWAVDATHAPVYA 241
>gi|380796101|gb|AFE69926.1| N-acetylgalactosamine-6-sulfatase precursor, partial [Macaca
mulatta]
Length = 503
Score = 97.1 bits (240), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 23 GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 82
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E+LLP+ LK+ GY + ++GKWH+G ++ + P GFD
Sbjct: 83 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 141
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 142 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 201
Query: 189 NHSRPLFL----QITHAAVHT 205
P FL THA V+
Sbjct: 202 ARHHPFFLYWAVDATHAPVYA 222
>gi|441523101|ref|ZP_21004735.1| arylsulfatase [Gordonia sihwensis NBRC 108236]
gi|441457320|dbj|GAC62696.1| arylsulfatase [Gordonia sihwensis NBRC 108236]
Length = 783
Score = 97.1 bits (240), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 64/197 (32%), Positives = 103/197 (52%), Gaps = 23/197 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
G++DVG G +IPTPNID LA +G L+ ++T P C+P+RAA LTG P R G +
Sbjct: 56 GYSDVGPFGA-EIPTPNIDRLARSGFRLSNYHTTPVCSPARAALLTGVNPHRAGYGSVAN 114
Query: 80 --PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKE-------ELLPFNRGFD 130
P G+ + LP+ L+E GY+T +GKWH+ + + + P RGFD
Sbjct: 115 SDPGFPGLRLELADDVLTLPEILRESGYATFAVGKWHLVRDADMSPGRSRKSWPLQRGFD 174
Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI---KS 187
++ G G +S + + ++ +++ Y Y+TD TD+++ I ++
Sbjct: 175 SYYGSLEGL----NSFFNPNQLIADNSVVDVDEYP---DGYYVTDDLTDRAIDQITALRA 227
Query: 188 HNHSRPLFLQITHAAVH 204
H+ +P FL H A+H
Sbjct: 228 HDSDKPFFLYFAHIAMH 244
>gi|4503899|ref|NP_000503.1| N-acetylgalactosamine-6-sulfatase precursor [Homo sapiens]
gi|462148|sp|P34059.1|GALNS_HUMAN RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|618426|gb|AAC51350.1| N-acetylgalactosamine 6-sulphatase [Homo sapiens]
gi|870751|dbj|BAA04535.1| N-acetylgalactosamine 6-sulfate sulfatase [Homo sapiens]
gi|33440495|gb|AAH56151.1| Galactosamine (N-acetyl)-6-sulfate sulfatase [Homo sapiens]
gi|37589093|gb|AAH50684.2| Galactosamine (N-acetyl)-6-sulfate sulfatase [Homo sapiens]
gi|119587163|gb|EAW66759.1| galactosamine (N-acetyl)-6-sulfate sulfatase (Morquio syndrome,
mucopolysaccharidosis type IVA), isoform CRA_c [Homo
sapiens]
Length = 522
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 42 GWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 101
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E+LLP+ LK+ GY + ++GKWH+G ++ + P GFD
Sbjct: 102 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 160
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 161 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 220
Query: 189 NHSRPLFL----QITHAAVHT 205
P FL THA V+
Sbjct: 221 ARHHPFFLYWAVDATHAPVYA 241
>gi|336424342|ref|ZP_08604383.1| hypothetical protein HMPREF0994_00389 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336003446|gb|EGN33530.1| hypothetical protein HMPREF0994_00389 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 460
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 62/184 (33%), Positives = 87/184 (47%), Gaps = 16/184 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYG-IDTP 80
G+ D G + TPN+D L G ++ Y P C P+RAA LTG+YP R G +DT
Sbjct: 19 GYGDFGIFSDGSARTPNLDRLVRQGCAMSHCYAASPVCAPARAALLTGRYPHRTGAVDTY 78
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
G + + E L + GY T LIGKWH+G +E P RGFD +G+ G+
Sbjct: 79 EAIG-GDRMALREVTLADVYRANGYRTGLIGKWHLGLIGKEYHPCRRGFDTFIGFRGGWS 137
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y + LD +E Y+TD T++S+ I+ N +P FL +
Sbjct: 138 DY--------YQYKLDRNGILE----ASDGTYMTDVITEESIRFIRE-NREQPFFLHAAY 184
Query: 201 AAVH 204
A H
Sbjct: 185 NAPH 188
>gi|406661522|ref|ZP_11069640.1| Arylsulfatase [Cecembia lonarensis LW9]
gi|405554671|gb|EKB49747.1| Arylsulfatase [Cecembia lonarensis LW9]
Length = 477
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 63/186 (33%), Positives = 94/186 (50%), Gaps = 4/186 (2%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP 80
QG++DVG +G +DI TP++D LA G+ Y C+ SRAA LTG YP R GI
Sbjct: 41 QGYHDVGVYGASDIETPHLDQLASEGLQFTNFYVAQAVCSASRAALLTGTYPNRLGIHGA 100
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ + E + LK LGY+T + GKWH+G + E LP N+GFD + G
Sbjct: 101 LDHSSKHGLHPEEATIADLLKPLGYATAVFGKWHLG-HHPEFLPTNQGFDEYFGIPYSND 159
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYL-TDFFTDQSVHVIKSHNHSRPLFLQIT 199
+ + D+ L +N + + + + T +FT++S+ I+ N RP FL +
Sbjct: 160 MWPNHPQTKDYYPPLPIYQNDKVVDTIWNDQSMFTTWFTEKSIDFIE-RNKDRPFFLYLA 218
Query: 200 HAAVHT 205
H H
Sbjct: 219 HPMPHV 224
>gi|395508489|ref|XP_003758543.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Sarcophilus harrisii]
Length = 492
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 97/207 (46%), Gaps = 19/207 (9%)
Query: 20 LPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGID 78
L GW D+G GE TP++D +A G++ YT P C+PSRAA LTG+ P R G
Sbjct: 31 LQMGWGDLGVFGEPSKETPHLDQMAAEGMLFPNFYTANPLCSPSRAALLTGRLPIRNGFY 90
Query: 79 TPVGAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD 130
T + +P +E LLP+ LK+ GY ++GKWH+G ++ + P GFD
Sbjct: 91 TTNAHARNAYTPQEIVGGIPDSEFLLPELLKKAGYVNKIVGKWHLG-HRPQFHPLKHGFD 149
Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVI 185
G N + D+ + V + R E + + + LT + ++V I
Sbjct: 150 EWFGAPNCHFGPYDNKARPNIPVYRNWEMVGRFFEDFPINLKTGEANLTQIYLQEAVDFI 209
Query: 186 KSHNHSRPLFL----QITHAAVHTGTA 208
K H +P FL THA V+ +
Sbjct: 210 KQQAHQQPFFLYWAVDATHAPVYASKS 236
>gi|410215590|gb|JAA05014.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
gi|410254514|gb|JAA15224.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
gi|410288780|gb|JAA22990.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
gi|410330541|gb|JAA34217.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
Length = 522
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 42 GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 101
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E+LLP+ LK+ GY + ++GKWH+G ++ + P GFD
Sbjct: 102 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 160
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 161 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 220
Query: 189 NHSRPLFL----QITHAAVHT 205
P FL THA V+
Sbjct: 221 ARHHPFFLYWAVDATHAPVYA 241
>gi|426383228|ref|XP_004058189.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Gorilla gorilla
gorilla]
Length = 528
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 48 GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 107
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E+LLP+ LK+ GY + ++GKWH+G ++ + P GFD
Sbjct: 108 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 166
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 167 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 226
Query: 189 NHSRPLFL----QITHAAVHT 205
P FL THA V+
Sbjct: 227 ARHHPFFLYWAVDATHAPVYA 247
>gi|189069200|dbj|BAG35538.1| unnamed protein product [Homo sapiens]
Length = 522
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 42 GWGDLGVYGEPSRETPNLDRMAAGGLLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 101
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E+LLP+ LK+ GY + ++GKWH+G ++ + P GFD
Sbjct: 102 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 160
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 161 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 220
Query: 189 NHSRPLFL----QITHAAVHT 205
P FL THA V+
Sbjct: 221 ARHHPFFLYWAVDATHAPVYA 241
>gi|149199717|ref|ZP_01876749.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
araneosa HTCC2155]
gi|149137234|gb|EDM25655.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
araneosa HTCC2155]
Length = 486
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 65/197 (32%), Positives = 93/197 (47%), Gaps = 13/197 (6%)
Query: 27 VGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----DTP-- 80
+ +G DI TPNIDALA G++ N Y++P+CTPSR LTGKYPFR G D P
Sbjct: 48 ISCYGAEDIKTPNIDALAAGGMIFNNAYSMPSCTPSRTTLLTGKYPFRTGYVNHWDVPRW 107
Query: 81 -VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNR-GFDNHVGYWNG 138
+G K P T + +K+LGY T GKW + + E L + GFD+ W G
Sbjct: 108 GIGYFDWKQKPNTT--FARLMKDLGYRTFATGKWQLNDFRLEPLAMQKHGFDDWA-MWTG 164
Query: 139 YLTYNDSIHETDFAVG-LDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQ 197
T D HE +A N + + ++ D +TD ++ ++ N +P+ +
Sbjct: 165 CETSKDKTHEKKSTQRYWNAHINTKEGSKTYKGQFGPDLYTDHLINFMRK-NKDKPMCIY 223
Query: 198 ITHAAVHTGTAGNAKLP 214
HT A P
Sbjct: 224 YPMVLPHTPVAATPDEP 240
>gi|296234833|ref|XP_002762635.1| PREDICTED: arylsulfatase E [Callithrix jacchus]
Length = 449
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 55/124 (44%), Positives = 71/124 (57%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA G+ L +H + + CTPSRAAFLTG+YP R G+ + V
Sbjct: 49 GIGDIGCYGNNTMRTPNIDRLAEFGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSV 108
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNRGFD 130
G G + +P E + LKE GY+T LIGKWH+G N E P + GFD
Sbjct: 109 GYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESAGDHCHHPLHHGFD 168
Query: 131 NHVG 134
G
Sbjct: 169 YFYG 172
>gi|355710478|gb|EHH31942.1| hypothetical protein EGK_13112 [Macaca mulatta]
Length = 482
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 2 GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 61
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E+LLP+ LK+ GY + ++GKWH+G ++ + P GFD
Sbjct: 62 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 120
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 121 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 180
Query: 189 NHSRPLFL----QITHAAVHT 205
P FL THA V+
Sbjct: 181 ARHHPFFLYWAVDATHAPVYA 201
>gi|344237970|gb|EGV94073.1| N-acetylgalactosamine-6-sulfatase [Cricetulus griseus]
Length = 483
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 99/201 (49%), Gaps = 20/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 2 GWGDLGVYGEPSRETPNLDQMALEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTSN 61
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
G + +P +E LLP+ LK+ GY+ ++GKWH+G ++ + P GFD
Sbjct: 62 GHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWF 120
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKS- 187
G N + D+ + + V D R E + + + LT + +++ I++
Sbjct: 121 GSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANLTQLYLQEALDFIRTQ 180
Query: 188 HNHSRPLFL----QITHAAVH 204
H P FL THA V+
Sbjct: 181 HARQSPFFLYWAIDATHAPVY 201
>gi|167523060|ref|XP_001745867.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775668|gb|EDQ89291.1| predicted protein [Monosiga brevicollis MX1]
Length = 221
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 59/185 (31%), Positives = 97/185 (52%), Gaps = 14/185 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+ F G I TPNIDAL G++ + Y C+PSRA+ L+G+Y +G+ +
Sbjct: 41 GYDDLYFRGHQ-IRTPNIDALQEEGLLFTQMYMQDVCSPSRASILSGRYAMHHGVTDWIP 99
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+ + + + L ++E GY T +GKWH+G K P RGF++ +GY++G Y
Sbjct: 100 PRDSYGLMLNDTTLADKMREAGYDTRAVGKWHMGFYKWAYTPTFRGFNSFLGYYSGGEDY 159
Query: 143 NDSIHETDFAVGLDARRNMERY--------APQMSSKYLTDFFTDQSVHVIKSHNHSR-P 193
HETD A D R+ R+ A + +Y T F+++++ +I + P
Sbjct: 160 --FTHETDNAY--DMHRDEGRHCGPNCSIPAWDLKGQYSTTIFSEEAIRIINQRQAADPP 215
Query: 194 LFLQI 198
LFL +
Sbjct: 216 LFLYL 220
>gi|397468273|ref|XP_003805816.1| PREDICTED: N-acetylgalactosamine-6-sulfatase isoform 1 [Pan
paniscus]
Length = 482
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 96/201 (47%), Gaps = 19/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 2 GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 61
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E+LLP+ LK+ GY + ++GKWH+G ++ + P GFD
Sbjct: 62 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 120
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 121 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 180
Query: 189 NHSRPLFL----QITHAAVHT 205
P FL THA V+
Sbjct: 181 ARHHPFFLYWAVDATHAPVYA 201
>gi|371776857|ref|ZP_09483179.1| sulfatase [Anaerophaga sp. HS1]
Length = 542
Score = 96.7 bits (239), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 70/206 (33%), Positives = 101/206 (49%), Gaps = 39/206 (18%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G +G +I TPNID LA GI + Y C PSRA+ LTG YP R GI+
Sbjct: 41 GYSDLGCYG-GEIHTPNIDQLASQGIRFTQMYNTARCCPSRASLLTGHYPHRAGIN---- 95
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNK------EEL-------------- 122
G+ + + + + LKE GY T + GKWH+ K E+L
Sbjct: 96 -GMGVNLSMNTATIAEVLKENGYHTGMTGKWHLSETKPLDDPTEQLRWLAHRVDYGSFSP 154
Query: 123 ---LPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTD 179
P NRGFD H G G + Y D F++ + + E P+ Y+TDF T+
Sbjct: 155 LENYPCNRGFDEHWGVIWGVVNYFDP-----FSLVHNEKPIKE--VPE--DFYMTDFITE 205
Query: 180 QSVHVIKSHNH-SRPLFLQITHAAVH 204
+S+ +I S++ +P FL + H A H
Sbjct: 206 KSIELIDSYSKDDKPFFLYVAHTAPH 231
>gi|406661473|ref|ZP_11069592.1| Arylsulfatase precursor [Cecembia lonarensis LW9]
gi|405554747|gb|EKB49822.1| Arylsulfatase precursor [Cecembia lonarensis LW9]
Length = 478
Score = 96.3 bits (238), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 92/184 (50%), Gaps = 14/184 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G G +DI TPNID +A GI + P C+PSRA LTG+ P R GI+T
Sbjct: 58 GYGDLGCFGASDIATPNIDRIAAEGIKFTSFLSASPVCSPSRAGLLTGRMPQRMGINTVF 117
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ E + + LK GY T ++GKWH+G + E LP N+GF + G +
Sbjct: 118 FPESLTGMDPEEITIAEILKTKGYRTGIVGKWHLG-HLERFLPLNQGFYEYFG-----IP 171
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y++ + + R E A + +Y+T +T++S+ I + +P FL + H
Sbjct: 172 YSNDMASVVYM------RGNEVEAYHVDQRYMTRTYTEESLKFIDASG-DQPFFLYLAHN 224
Query: 202 AVHT 205
H
Sbjct: 225 MPHV 228
>gi|340619482|ref|YP_004737935.1| sulfatase [Zobellia galactanivorans]
gi|339734279|emb|CAZ97656.1| Sulfatase, family S1-17 [Zobellia galactanivorans]
Length = 586
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/192 (31%), Positives = 97/192 (50%), Gaps = 21/192 (10%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTP 80
QG+ D+G++G + TP IDA A + + P C P+RAA +TG++P R G+ DT
Sbjct: 39 QGFGDLGYYGNPHVKTPTIDAFARESVRFDEFIVSPVCAPTRAALMTGRHPLRTGVRDTY 98
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
G + +T L + LK+ GY+T ++GKWH+G N P ++GFD + + +G +
Sbjct: 99 RGGAIMSTNEIT---LAEMLKQEGYATGMVGKWHLGDNYPS-RPQDQGFDFTLRHLSGGI 154
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQM--------SSKYLTDFFTDQSVHVIKSHNHSR 192
T A+R+ + P + S Y +D FTD ++ I N ++
Sbjct: 155 GQPGDWPNT-------AKRDSSYFNPVLWKNGEMFQSEGYCSDVFTDVAIDFI-DQNKAK 206
Query: 193 PLFLQITHAAVH 204
P FL + + A H
Sbjct: 207 PFFLYLAYNAPH 218
>gi|449136530|ref|ZP_21771910.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula europaea 6C]
gi|448884847|gb|EMB15319.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula europaea 6C]
Length = 480
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 76/257 (29%), Positives = 114/257 (44%), Gaps = 54/257 (21%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGID-TP 80
G+ + G G +IPTP IDALA +G+ Y + C+PSRA F++G+Y R+G D P
Sbjct: 46 GYGETGMMGNAEIPTPAIDALARSGVRCTSGYVTSSYCSPSRAGFMSGRYQSRFGYDLNP 105
Query: 81 VGAG---VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY-- 135
G +P +K ++L+ GY T LIGKWH+G + +P ++GFD G+
Sbjct: 106 TGERNNHPNAGLPPQQKTFVEHLQSAGYHTSLIGKWHLGTRPPQ-VPTSKGFDRFFGFLH 164
Query: 136 --------------WNGYLTYNDSIHETDFAV------GLDARRNMERY----------A 165
W + ++S+ F G AR N Y
Sbjct: 165 EGHFYVPGPPYENVWT--MLRDNSLPAGQFETNQRTIRGNYARINEPAYDTGNPVLDGGE 222
Query: 166 PQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEE 225
P YLTD TD++V I S S P + +++ AVH+ + E
Sbjct: 223 PIDDWNYLTDTITDKAVDTI-SQAASNPFAMVVSYNAVHSPMQASL-------------E 268
Query: 226 NDRTFAHISNPDRRLFA 242
+ HI++P RR+FA
Sbjct: 269 DHAAMDHIADPQRRIFA 285
>gi|87309459|ref|ZP_01091594.1| arylsulphatase A [Blastopirellula marina DSM 3645]
gi|87287767|gb|EAQ79666.1| arylsulphatase A [Blastopirellula marina DSM 3645]
Length = 457
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 93/201 (46%), Gaps = 31/201 (15%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRY------- 75
G D G +G + TP+ID LA G+ Y P C+P+RA+ +TGK+P R
Sbjct: 43 GCKDAGCYGATNFSTPHIDRLANQGMRFTDAYAAPVCSPTRASLMTGKHPARLHLTNFIP 102
Query: 76 --GIDTPVGA----GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNK-EELLPFNRG 128
G P G G +P+ EK + Q L GY +IGKWH+G E P NRG
Sbjct: 103 QIGRQLPAGKLIPPGFNHVLPLDEKTIAQELHADGYQCAMIGKWHLGEEHGPEYRPQNRG 162
Query: 129 FD-----NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVH 183
FD H G +N + + D + +A L P YL D TD+++
Sbjct: 163 FDRVVLSEHHGIFNYFYPFVDQ-QKWPYAGPL----------PGNPGDYLPDRLTDEAID 211
Query: 184 VIKSHNHSRPLFLQITHAAVH 204
++ N RP FL ++H +VH
Sbjct: 212 FVRE-NRERPFFLYLSHWSVH 231
>gi|300771261|ref|ZP_07081137.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
ATCC 33861]
gi|300761931|gb|EFK58751.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
ATCC 33861]
Length = 466
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 68/231 (29%), Positives = 111/231 (48%), Gaps = 32/231 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGI---- 77
G+ D G +G DIPTP+IDALA G+ N + T C PSRA L G+Y R G
Sbjct: 38 GYEDFGCYGSQDIPTPHIDALAKGGVRFTNSYVTASVCAPSRAGLLMGQYQQRSGFEHNV 97
Query: 78 -DTPVGAGVAKAVPVTE--KLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
D P + + +++ + + ++ GY T IGKWH G N+ + P ++GF++ G
Sbjct: 98 SDLPADGYQMQDIGLSDTVRTIADQMQSNGYETMAIGKWHQG-NETKHHPLHKGFNHFFG 156
Query: 135 YWNGYLTY---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
+ G+ ++ +I + + + N + YLTD FTD+++ ++
Sbjct: 157 FIGGHRSFFPIRTAIKQEEKIL------NDYTEVDEKDVYYLTDMFTDKAISYMR-QKRD 209
Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+P F+ +++ AVHT P L Q FAH+ + RR +A
Sbjct: 210 KPYFIYLSYNAVHTPVEAT---PQKLAQ----------FAHLKDAQRRSYA 247
>gi|374619517|ref|ZP_09692051.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374302744|gb|EHQ56928.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 556
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 73/237 (30%), Positives = 113/237 (47%), Gaps = 59/237 (24%)
Query: 23 GWNDVG-FHGEND---IPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGI 77
G+ND+ F G D + TP+ID LA +G+V + Y+ TC PSRA +TG+YP R G
Sbjct: 76 GYNDISTFGGGLDGGRVKTPHIDQLAADGVVFTQSYSGAGTCAPSRAMLMTGRYPTRTGF 135
Query: 78 D-TPVGAGVA--------------------------------KAVPVTEKLLPQYLKELG 104
+ TP +G+A + +P E + + LKE G
Sbjct: 136 EFTPTPSGMAPMLSRISAEMGRGTPSMIYDAALDESKPPYEQQGLPPEEVTIAEILKERG 195
Query: 105 YSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLD-------A 157
Y+T IGKWH+G ++ + P +GFD + +G D + + + D A
Sbjct: 196 YATFHIGKWHLG-RQDGMAPHEQGFDQSLLMASGLFLPEDDPNVVNAKLDFDPIDQFLWA 254
Query: 158 RR---------NMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
R + +R+ P YLTD++TD+S+++I + N +RP FL + H VHT
Sbjct: 255 RMAFANSFNSGDQDRFEP---GGYLTDYWTDESINIINA-NKNRPFFLYLGHWGVHT 307
>gi|332663783|ref|YP_004446571.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
gi|332332597|gb|AEE49698.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
Length = 550
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 66/191 (34%), Positives = 98/191 (51%), Gaps = 14/191 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI---DT 79
G++D+G +G ++I TPNID LAY G+ L Y C P+RA+ +TG+YP + G+ D
Sbjct: 41 GYSDLGAYG-SEIQTPNIDKLAYEGLRLREFYNNSICAPTRASLITGQYPHKAGVGYFDV 99
Query: 80 PVGAGVAKAVPVTEKL-LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+G + E L + L++ GYST L GKWH+G N P RGFD G G
Sbjct: 100 NLGIPPYQGYLNKESLTFGEVLRQAGYSTLLSGKWHVG-NDSLHWPKQRGFDRFFGVIGG 158
Query: 139 YLTYNDS----IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNH-SRP 193
Y D+ + V L+ + +R P+ +S Y TD T+ +V + N +P
Sbjct: 159 GSNYFDAEPMPLGRQYPVVILE---DNQRQKPKANSYYFTDEITNHAVQFLDEQNKMDKP 215
Query: 194 LFLQITHAAVH 204
FL + + A H
Sbjct: 216 FFLYLAYTAPH 226
>gi|149177395|ref|ZP_01855999.1| arylsulfatase A [Planctomyces maris DSM 8797]
gi|148843728|gb|EDL58087.1| arylsulfatase A [Planctomyces maris DSM 8797]
Length = 474
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/200 (31%), Positives = 96/200 (48%), Gaps = 22/200 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDT-- 79
G+ D+G G I TP +D +A G+ + Y+ P CTPSRAA LTG+YP R G+ +
Sbjct: 48 GYGDLGCFGHPTIKTPALDQMAAEGMKFTQFYSAAPVCTPSRAALLTGRYPIRSGMCSDK 107
Query: 80 -----PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
P G +P +E L + LK GY T +GKWH+G + + LP N GFD++ G
Sbjct: 108 RRVLFPNSGG---GIPASEVTLAEALKAAGYKTACVGKWHLG-HLPQFLPTNNGFDSYFG 163
Query: 135 --YWNGYLTYNDSIHETDFAVGLDAR-------RNMERYAPQMSSKYLTDFFTDQSVHVI 185
Y N D H + + + RN E +T +T++++ +I
Sbjct: 164 IPYSNDMDRVADRKHGRSIFLKPEVKFWNVPLMRNTEVVELPADQTTITKRYTEEAIKLI 223
Query: 186 KSHNHSRPLFLQITHAAVHT 205
+ N +P F+ + H H
Sbjct: 224 Q-QNKQQPFFIYLAHNMPHV 242
>gi|423226077|ref|ZP_17212543.1| hypothetical protein HMPREF1062_04729 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392630595|gb|EIY24583.1| hypothetical protein HMPREF1062_04729 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 483
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 67/210 (31%), Positives = 105/210 (50%), Gaps = 22/210 (10%)
Query: 5 VGAGVAKAVPV-TEK-------LLPQGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL 56
VG +A P +EK + G++DV +GE TPNIDALA GI Y
Sbjct: 19 VGVSCTEATPTKSEKPNFVFIYMDDMGYSDVSCYGETRWTTPNIDALAAEGIKFTDCYAA 78
Query: 57 -PTCTPSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI 115
P +PSRA FLTG+YP R GI + E + + LK GY+T IGKWH+
Sbjct: 79 SPISSPSRAGFLTGRYPARMGIQGVFYPDSYTGMAPEEVTMAEVLKVQGYATACIGKWHL 138
Query: 116 GCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTD 175
G ++E+ LP +GFD + G + Y++ + + G +E + +++ +T
Sbjct: 139 G-SREKYLPLQQGFDEYFG-----IPYSNDMSAQVYLRG----NEVEEFHIDINN--VTK 186
Query: 176 FFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
+T+++V I+ +P FL + H+ +H
Sbjct: 187 KYTEEAVDYIR-RKADQPFFLFLAHSMMHV 215
>gi|343086062|ref|YP_004775357.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354596|gb|AEL27126.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 444
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 70/193 (36%), Positives = 95/193 (49%), Gaps = 17/193 (8%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTP 80
QG+ D+G +G D TP++D LA GI Y T CTPSRA LTG+YP R +
Sbjct: 44 QGYADLGVYGAEDFETPHLDQLASEGIRFTNFYVPATVCTPSRAGLLTGQYPKRSNLHEA 103
Query: 81 VGAGVAKAVPVTE-KLLPQ------YLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
V P +E L PQ LK GYST IGKWH+G +K+E +P+N+GFD
Sbjct: 104 V------LFPYSEGGLSPQAFTMAELLKGAGYSTACIGKWHLG-HKDEYMPYNQGFDTFY 156
Query: 134 GYWNGYLTYNDSIHETDF-AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
G N DF + L N + +YLT +T+++V IK+ +
Sbjct: 157 GVPYSNDMDNYYYKNIDFQSPPLPFYENTKVIENGSDQRYLTKRYTEETVKRIKNRGE-K 215
Query: 193 PLFLQITHAAVHT 205
P F+ + H HT
Sbjct: 216 PFFIYLAHNMPHT 228
>gi|410912979|ref|XP_003969966.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Takifugu
rubripes]
Length = 519
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 67/206 (32%), Positives = 99/206 (48%), Gaps = 28/206 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G G+ TPN+DA+A G++L YT P C+PSRAA LTG+ P R G T
Sbjct: 38 GWGDLGAFGQPSKETPNLDAMAAQGMLLLNFYTANPLCSPSRAALLTGRLPVRNGFYTTN 97
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN-- 131
G + + E LLPQ LK+ GY ++GKWH+G ++ + LP GFD
Sbjct: 98 GHARNAYTPQEIVGGISKDEILLPQMLKKRGYFNKIVGKWHLG-HRPQYLPLEHGFDEWF 156
Query: 132 -----HVGYWNGYLTYNDSIHETDFAVGLDARRNMERYA--PQMSSKYLTDFFTDQSVHV 184
H G +N + N ++ + +G R E + + LT + + +
Sbjct: 157 GAPNCHFGPYNNSVRPNIPVYRNSWMLG----RYYEEFKIDKKTGESNLTQMYLLEGLDF 212
Query: 185 IKSHNHS-RPLFL----QITHAAVHT 205
I+S + +P FL THA V+
Sbjct: 213 IQSQAEAQKPFFLYWAPDATHAPVYA 238
>gi|297284666|ref|XP_002802639.1| PREDICTED: hypothetical protein LOC697850 [Macaca mulatta]
Length = 1113
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/200 (31%), Positives = 95/200 (47%), Gaps = 19/200 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP- 80
GW D+G +GE TPN+D +A G + Y+ P C+PSRAA LTG+ P R G T
Sbjct: 42 GWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 101
Query: 81 -------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E+LLP+ LK+ GY + ++GKWH+G ++ + P GFD
Sbjct: 102 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 160
Query: 134 GYWNGYLTYNDSIHETDFAVGLD---ARRNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 161 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 220
Query: 189 NHSRPLFL----QITHAAVH 204
P FL THA V+
Sbjct: 221 ARHHPFFLYWAVDATHAPVY 240
>gi|440718712|ref|ZP_20899155.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
SWK14]
gi|436436039|gb|ELP29830.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
SWK14]
Length = 480
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 74/261 (28%), Positives = 114/261 (43%), Gaps = 62/261 (23%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGID-TP 80
G+ + G G +IPTP IDALA +G+ Y + C+PSRA FL+G+Y R+G D P
Sbjct: 46 GYGETGMMGNAEIPTPAIDALAQSGVRCTSGYVTSSYCSPSRAGFLSGRYQSRFGYDLNP 105
Query: 81 VGAG---VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY-- 135
G +P +K ++L+ GY T LIGKWH+G + +P ++GFD G+
Sbjct: 106 TGERNNHPNAGLPPQQKTFVEHLQSAGYQTSLIGKWHLGTRPPQ-VPTSKGFDRFFGFLH 164
Query: 136 --------------W---------NGYLTYND--------SIHETDFAVG---LDARRNM 161
W G N I+E D+ G LD +
Sbjct: 165 EGHFYVPGPPFENVWTMLRDNTLPTGRFETNQRTIRGNYARINEPDYDAGNPMLDDSEPI 224
Query: 162 ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVP 221
+ + YLTD T +++ I + S+P + +++ AVH+ +
Sbjct: 225 DHW------NYLTDTITAKAIDAI-TQTASKPFAMVVSYNAVHSPMQASL---------- 267
Query: 222 DMEENDRTFAHISNPDRRLFA 242
E+ HI +P RR+FA
Sbjct: 268 ---EDHAAMEHIGDPQRRIFA 285
>gi|223940482|ref|ZP_03632332.1| sulfatase [bacterium Ellin514]
gi|223890844|gb|EEF57355.1| sulfatase [bacterium Ellin514]
Length = 635
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 62/195 (31%), Positives = 94/195 (48%), Gaps = 19/195 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ D+G G TPN+D +A G+ L Y P CTPSRA LTG Y R + +
Sbjct: 36 GYGDIGPFGSTLNRTPNLDRMAKEGMKLTSFYAAPLCTPSRAQILTGCYAKRVSLPKVLS 95
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+ E+ + + LK GY+T IGKWH+G + E LP GFD+++G L Y
Sbjct: 96 PRSEVGLNTNEQTVAKLLKRQGYATMAIGKWHVG-DAPENLPTRHGFDHYLG-----LPY 149
Query: 143 NDSIHETDFAVGLDARRNMERYAPQMSSKY------------LTDFFTDQSVHVIKSHNH 190
++ + + A+R P + + LT+ +TD++V I++ N
Sbjct: 150 SNDMGGEEPGKDQPAKRGARPPLPLVRDEQVIEVVKPADQDRLTERYTDEAVKFIRA-ND 208
Query: 191 SRPLFLQITHAAVHT 205
+P FL + H AVH
Sbjct: 209 KQPFFLYLAHTAVHA 223
>gi|344308474|ref|XP_003422902.1| PREDICTED: arylsulfatase E [Loxodonta africana]
Length = 626
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 55/124 (44%), Positives = 70/124 (56%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G DVG +G + TPNI+ LA +G+ L +H PTCTPSRAAFLTG+YP R G+ +
Sbjct: 86 GIGDVGCYGNTTLRTPNINRLAEDGVTLTQHIAAAPTCTPSRAAFLTGRYPLRSGMVSSR 145
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G V+ +P +E + LK GY+T LIGKWH+G N E P N GFD
Sbjct: 146 GNRVLQWTAVSGGLPESETTFAKILKNEGYATGLIGKWHLGLNCESPSDHCHHPLNHGFD 205
Query: 131 NHVG 134
G
Sbjct: 206 YFYG 209
>gi|326384383|ref|ZP_08206064.1| arylsulfatase [Gordonia neofelifaecis NRRL B-59395]
gi|326196981|gb|EGD54174.1| arylsulfatase [Gordonia neofelifaecis NRRL B-59395]
Length = 784
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 104/198 (52%), Gaps = 25/198 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
G++D+G G +IPTPNID +A +G L+ ++T P C+P+RAA LTG P R G +
Sbjct: 57 GYSDIGPFGA-EIPTPNIDRIAASGYRLSNYHTTPVCSPARAALLTGVNPHRAGYGSVAN 115
Query: 80 --PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKE-------ELLPFNRGFD 130
P G+ + LP+ L+E GY+T +GKWH+ + + + P RGFD
Sbjct: 116 SDPGFPGLRLELADDVLTLPEILRESGYATFAVGKWHLVRDADMSPGRSRKSWPLQRGFD 175
Query: 131 NHVGYWNGYLT-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK--- 186
++ G G + +N + D +V +D E Y Y+TD TD+++ IK
Sbjct: 176 SYYGSLEGLNSFFNPNQLIADNSV-VDVDEYPEGY-------YVTDDLTDRAIDQIKALR 227
Query: 187 SHNHSRPLFLQITHAAVH 204
+H+ +P FL H A+H
Sbjct: 228 AHDADKPFFLYFAHIAMH 245
>gi|313217411|emb|CBY38513.1| unnamed protein product [Oikopleura dioica]
Length = 449
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 75/140 (53%), Gaps = 7/140 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW DV ++ E + TPN++ + G Y+ TC+PSRAA LTG Y +R G+D P
Sbjct: 89 GWADVSWNNEF-VKTPNLERIRKQGRTFTNLYSHSTCSPSRAALLTGIYAWRLGLDGAPF 147
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+P+ +L+P K+L Y H IGKWH G + L P RGFD+ G+++G +
Sbjct: 148 NPTKVNGIPLGVELIPAKFKKLNYENHFIGKWHGGFCHQNLTPTERGFDSFYGFYSGAVN 207
Query: 142 YNDSIHETDF---AVGLDAR 158
Y HE+ + LD R
Sbjct: 208 Y--LTHESKYDAKGAALDYR 225
>gi|223985528|ref|ZP_03635584.1| hypothetical protein HOLDEFILI_02890 [Holdemania filiformis DSM
12042]
gi|223962505|gb|EEF66961.1| hypothetical protein HOLDEFILI_02890 [Holdemania filiformis DSM
12042]
Length = 470
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 96/205 (46%), Gaps = 28/205 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGID--- 78
GW D+ G + TP+ID L G+ ++ Y P C+PSRA+ L+GKYP R +
Sbjct: 16 GWMDLSCQGSSFYETPHIDQLRREGMAFDQAYAACPVCSPSRASILSGKYPARLKVTDWI 75
Query: 79 ----------TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRG 128
+ A K + V+E + + +E GY T +GKWH+G KE P + G
Sbjct: 76 DHENYHPCRGKLIDAPYIKELSVSEFSMAKAFQEAGYQTWHVGKWHLG--KEATYPEHHG 133
Query: 129 FD-NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS 187
FD N G W G+ G + +ME + +YLTD ++ +I+S
Sbjct: 134 FDVNLGGSWWGHPK-----------KGYFSPYHMENLSDGPEGEYLTDRIGAEAAALIRS 182
Query: 188 HNHSRPLFLQITHAAVHTGTAGNAK 212
+ RP FL + H AVHT A+
Sbjct: 183 RDPQRPFFLNLWHYAVHTPLQAKAE 207
>gi|224537481|ref|ZP_03678020.1| hypothetical protein BACCELL_02360 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520919|gb|EEF90024.1| hypothetical protein BACCELL_02360 [Bacteroides cellulosilyticus
DSM 14838]
Length = 525
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 61/184 (33%), Positives = 96/184 (52%), Gaps = 14/184 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G++DV +GE TPNIDALA GI Y P +PSRA FLTG+YP R GI
Sbjct: 87 GYSDVSCYGETRWTTPNIDALAAEGIKFTDCYAASPISSPSRAGFLTGRYPARMGIQGVF 146
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ E + + LK GY+T IGKWH+G ++E+ LP +GFD + G +
Sbjct: 147 YPDSYTGMAPEEVTMAEVLKVQGYATACIGKWHLG-SREKYLPLQQGFDEYFG-----IP 200
Query: 142 YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHA 201
Y++ + + G +E + +++ +T +T+++V I+ +P FL + H+
Sbjct: 201 YSNDMSAQVYLRG----NEVEEFHIDINN--VTKKYTEEAVDYIR-RKADQPFFLFLAHS 253
Query: 202 AVHT 205
+H
Sbjct: 254 MMHV 257
>gi|148679746|gb|EDL11693.1| galactosamine (N-acetyl)-6-sulfate sulfatase, isoform CRA_a [Mus
musculus]
Length = 462
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 98/202 (48%), Gaps = 20/202 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 61 GWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 120
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E LLP+ LK+ GY+ ++GKWH+G ++ + P GFD
Sbjct: 121 AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWF 179
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYA--PQMSSKYLTDFFTDQSVHVIKS- 187
G N + D+ + + V D R E + + LT +T +++ I++
Sbjct: 180 GSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLTQLYTQEALDFIQTQ 239
Query: 188 HNHSRPLFL----QITHAAVHT 205
H P FL THA V+
Sbjct: 240 HARQSPFFLYWAIDATHAPVYA 261
>gi|409196554|ref|ZP_11225217.1| sulfatase [Marinilabilia salmonicolor JCM 21150]
Length = 542
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 67/206 (32%), Positives = 97/206 (47%), Gaps = 39/206 (18%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G +G +I TPNIDALA G+ + + C PSRA+ LTG YP + GID
Sbjct: 41 GYSDLGCYG-GEIQTPNIDALATGGVRFTQMHNTARCCPSRASLLTGHYPHKAGID---- 95
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNK------EEL-------------- 122
G+ + + + + LKE GY T + GKWH+ K E+L
Sbjct: 96 -GMGVNLSMNTATIAEVLKENGYHTGMTGKWHLSETKPVNDPDEQLRWMAHQVNYGPFSP 154
Query: 123 ---LPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTD 179
P NRGFD H G G + + D N E Y+TDF T+
Sbjct: 155 LENYPCNRGFDEHWGVIWGVVNFFDP---------FSLVHNEEPIKEVPDDFYMTDFVTE 205
Query: 180 QSVHVIKSHNH-SRPLFLQITHAAVH 204
+SV++I +++ +P FL + H A H
Sbjct: 206 KSVNLIDTYSKDDKPFFLYVAHTAPH 231
>gi|288870334|ref|ZP_06113738.2| sulfatase family protein [Clostridium hathewayi DSM 13479]
gi|288867589|gb|EFC99887.1| sulfatase family protein [Clostridium hathewayi DSM 13479]
Length = 471
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 66/203 (32%), Positives = 95/203 (46%), Gaps = 34/203 (16%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGI---- 77
GW D+ G TPNID L G+V N + + P C+PSRA+ LTGKYP R G+
Sbjct: 17 GWRDLACTGSTFYETPNIDRLCRQGMVFANSYASCPVCSPSRASCLTGKYPARLGVTDWI 76
Query: 78 ----------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNR 127
+ A K +P E + Q LK+ GY T +GKWH+G E P +
Sbjct: 77 DMEGTSHPLKGKLIDAPYIKHLPEGEYTIAQALKDAGYDTWHVGKWHLG--GREFYPEHF 134
Query: 128 GFDNHVG--YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI 185
GFD ++G W H D G + +E + +YLTD TD++V ++
Sbjct: 135 GFDVNIGGCSWG---------HPHD---GYFSPYGIETLSEGPEGEYLTDRITDEAVRLL 182
Query: 186 KSHNHS---RPLFLQITHAAVHT 205
+ +P ++ + H AVHT
Sbjct: 183 RKRQACGSRKPFYMNLCHYAVHT 205
>gi|146302379|ref|YP_001196970.1| sulfatase [Flavobacterium johnsoniae UW101]
gi|146156797|gb|ABQ07651.1| sulfatase [Flavobacterium johnsoniae UW101]
Length = 551
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/191 (34%), Positives = 94/191 (49%), Gaps = 12/191 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI---DT 79
G++D+G +G ++I TPN+D LA G+ L Y C P+RA+ LTG+Y + G+ D
Sbjct: 42 GYSDLGNYG-SEIKTPNLDKLASEGLRLREFYNNSICAPTRASLLTGQYQHKAGVGFFDV 100
Query: 80 PVGAGVAKAVPVTEKL-LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+G + E L L + + GYST L GKWH+G + P RGFD G G
Sbjct: 101 NLGLPAYQGYLNKESLTLGEVFRSGGYSTLLSGKWHVGSEDQAQWPNQRGFDKFYGILKG 160
Query: 139 YLTYNDS----IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRP 193
Y D+ +T + V L RN E P+ S Y TD + +V + N ++P
Sbjct: 161 ASNYFDTKPLPFGKTPYPVKL--IRNNEELHPKDDSYYFTDEIGNNAVTFLDEQNKENKP 218
Query: 194 LFLQITHAAVH 204
FL + A H
Sbjct: 219 FFLYLAFTAPH 229
>gi|421612348|ref|ZP_16053456.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
gi|408496803|gb|EKK01354.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
Length = 482
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 93/193 (48%), Gaps = 27/193 (13%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QG+ DVG G DI TP +DA+A G+ Y P C PSRAA +TG YP R
Sbjct: 25 QGYQDVGCFGSPDIRTPRLDAMAKGGMKFTSFYAQPICGPSRAALMTGCYPMRV-----A 79
Query: 82 GAGVAKAV-PV---TEKLLPQYLKELGYSTHLIGKWHIGCNKE-----ELLPFNRGFDNH 132
G K + P+ E + + LK GY++ GKW + + + +LLP +GFD
Sbjct: 80 ERGHTKQIHPILHEDEVTIAEVLKTKGYASACFGKWDLAKHAQSGFFPDLLPTGQGFD-- 137
Query: 133 VGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
Y+ G T ND + + RN E P+ LT +TD+++ I+ N ++
Sbjct: 138 --YFYGTPTSNDRV--------ANLYRNEELIEPESDMATLTRRYTDEAISFIEK-NQNQ 186
Query: 193 PLFLQITHAAVHT 205
P F+ I H HT
Sbjct: 187 PFFVYIPHTMPHT 199
>gi|171910116|ref|ZP_02925586.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
Length = 480
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 87/186 (46%), Gaps = 7/186 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G D+G G TP IDALA +G+ + Y+ P C+P+RAA +TGK P R GI +
Sbjct: 37 GSQDLGVEGSKFYETPAIDALAASGVRFSSFYSAHPVCSPTRAALMTGKVPQRVGITDYI 96
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY-- 139
A+P E + + GY T +GKWH+G + P GF G
Sbjct: 97 KPKSGVALPTAETTIGEAFAAQGYQTGYVGKWHLG-EADADQPAQHGFQWTAAVNRGGQP 155
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
+Y + D G D ++ P YLTD T +S+ +K + ++P FL +
Sbjct: 156 ASYYYPYRKKD---GKDTLWDVPDLEPGTEGDYLTDALTGKSLEFLKQRDTTKPFFLCFS 212
Query: 200 HAAVHT 205
H AVHT
Sbjct: 213 HYAVHT 218
>gi|302370951|ref|NP_001180574.1| N-acetylgalactosamine-6-sulfatase isoform 2 precursor [Mus
musculus]
gi|26329565|dbj|BAC28521.1| unnamed protein product [Mus musculus]
Length = 440
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 98/201 (48%), Gaps = 20/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 39 GWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 98
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E LLP+ LK+ GY+ ++GKWH+G ++ + P GFD
Sbjct: 99 AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWF 157
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYA--PQMSSKYLTDFFTDQSVHVIKS- 187
G N + D+ + + V D R E + + LT +T +++ I++
Sbjct: 158 GSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLTQLYTQEALDFIQTQ 217
Query: 188 HNHSRPLFL----QITHAAVH 204
H P FL THA V+
Sbjct: 218 HARQSPFFLYWAIDATHAPVY 238
>gi|355757044|gb|EHH60652.1| hypothetical protein EGM_12064 [Macaca fascicularis]
Length = 482
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 95/201 (47%), Gaps = 19/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G + Y+ P C+PSRAA LTG+ P R G T
Sbjct: 2 GWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 61
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E+LLP+ LK+ GY + ++GKWH+G ++ + P GFD
Sbjct: 62 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKVVGKWHLG-HRPQFHPLKHGFDEWF 120
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 121 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 180
Query: 189 NHSRPLFL----QITHAAVHT 205
P FL THA V+
Sbjct: 181 ARHHPFFLYWAVDATHAPVYA 201
>gi|403069089|ref|ZP_10910421.1| arylsulfatase [Oceanobacillus sp. Ndiop]
Length = 513
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 67/190 (35%), Positives = 96/190 (50%), Gaps = 22/190 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G++D+ +G +I TPN+D LA NG+ + Y C PSRA+ LTG YP + GI
Sbjct: 16 GFSDLSSYG-GEISTPNLDQLANNGLRFTQFYNSARCCPSRASLLTGLYPHQAGIGEMTE 74
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
+TP G K VT L + LKE GY T+L GKWH+G E +P RGFD+ G
Sbjct: 75 DRETPGYRGYLKNQCVT---LAEVLKEGGYHTYLSGKWHVG----ERMPTERGFDDFYGL 127
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI-KSHNHSRPL 194
G+ ++ D + G R + Y TD TD ++ I +S + +P
Sbjct: 128 LGGFASFWDKENYVRLPEGRPERSYSD------GEFYATDAITDHALDFIEESRSDEQPY 181
Query: 195 FLQITHAAVH 204
FL +++ A H
Sbjct: 182 FLYLSYNAPH 191
>gi|60359902|dbj|BAD90170.1| mFLJ00319 protein [Mus musculus]
Length = 537
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 98/201 (48%), Gaps = 20/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 56 GWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 115
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E LLP+ LK+ GY+ ++GKWH+G ++ + P GFD
Sbjct: 116 AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWF 174
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYA--PQMSSKYLTDFFTDQSVHVIKS- 187
G N + D+ + + V D R E + + LT +T +++ I++
Sbjct: 175 GSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLTQLYTQEALDFIQTQ 234
Query: 188 HNHSRPLFL----QITHAAVH 204
H P FL THA V+
Sbjct: 235 HARQSPFFLYWAIDATHAPVY 255
>gi|344338189|ref|ZP_08769122.1| sulfatase [Thiocapsa marina 5811]
gi|343802243|gb|EGV20184.1| sulfatase [Thiocapsa marina 5811]
Length = 531
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 54/119 (45%), Positives = 70/119 (58%), Gaps = 6/119 (5%)
Query: 23 GWNDVGFHGENDIP---TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW D G +G TPNID LA G+ L Y+ PTCTP+R+A LTG+ P R G+
Sbjct: 79 GWGDPGVYGGGAAIGAATPNIDRLAGEGLTLTSTYSQPTCTPTRSAILTGRLPVRTGLTR 138
Query: 80 PVGAG--VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
P+ AG +AK E LP+ L E GY+T L GKWH+G E + P + GFD + GY+
Sbjct: 139 PILAGDTLAKNPWADEISLPKLLGEAGYTTVLTGKWHVG-EAEGMRPQDIGFDEYYGYY 196
>gi|305665652|ref|YP_003861939.1| arylsulfatase [Maribacter sp. HTCC2170]
gi|88710408|gb|EAR02640.1| arylsulfatase [Maribacter sp. HTCC2170]
Length = 589
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 67/211 (31%), Positives = 99/211 (46%), Gaps = 34/211 (16%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTP 80
QG+ D+G+ G + TPNID+ A I +N Y P C P+RA+ +TG+Y R GI DT
Sbjct: 42 QGYGDLGYTGNPHVKTPNIDSFASESIRMNNFYVSPVCAPTRASLMTGRYSLRTGIRDTY 101
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
G + + VT + + LK+ Y T + GKWH+G N P ++GFD + + +G +
Sbjct: 102 NGGAIMASNEVT---IAEMLKQANYKTGVFGKWHLGDNYPS-RPNDQGFDESLIHLSGGM 157
Query: 141 TYNDSIHETDFAVGLDARR---------NMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
DF R N ER + Y +D F + ++ I+ NH
Sbjct: 158 G-----QVGDFTTYFQKERSYFDPVLWHNGER---ESYEGYCSDIFAENAIDFIEK-NHD 208
Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPD 222
+P F ++ A HT LQVPD
Sbjct: 209 QPFFCYLSFNAPHTP-----------LQVPD 228
>gi|398828648|ref|ZP_10586848.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
gi|398217506|gb|EJN04023.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
Length = 470
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 63/186 (33%), Positives = 96/186 (51%), Gaps = 17/186 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRY--GIDT 79
G+ D+ +G I TP ID + +G+ L + Y+ P C+ +R A +TG+Y +R G++
Sbjct: 47 GYADLSSYGHPTIRTPAIDKIGNDGVRLLQAYSNSPVCSATRTAIMTGQYQYRLALGLEE 106
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
P+ AG +P ++ LP LK+ GY T LIGKWH+G + P G+D+ G+
Sbjct: 107 PL-AGRDIGLPPSQTTLPSLLKQAGYETTLIGKWHLGA-YPKYGPLKSGYDHFYGFRGSA 164
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI-KSHNHSRPLFLQI 198
L+Y + H DF E AP + Y TD D++V +I KS + RP F +
Sbjct: 165 LSYYN--HGKDF---------WEDDAPVEKAGYFTDLLGDKTVELIQKSDSCERPFFASV 213
Query: 199 THAAVH 204
A H
Sbjct: 214 HFNAPH 219
>gi|171184398|ref|NP_057931.3| N-acetylgalactosamine-6-sulfatase isoform 1 precursor [Mus
musculus]
gi|124007189|sp|Q571E4.2|GALNS_MOUSE RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|74198064|dbj|BAE35212.1| unnamed protein product [Mus musculus]
gi|148679747|gb|EDL11694.1| galactosamine (N-acetyl)-6-sulfate sulfatase, isoform CRA_b [Mus
musculus]
Length = 520
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 98/201 (48%), Gaps = 20/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 39 GWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 98
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E LLP+ LK+ GY+ ++GKWH+G ++ + P GFD
Sbjct: 99 AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWF 157
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYA--PQMSSKYLTDFFTDQSVHVIKS- 187
G N + D+ + + V D R E + + LT +T +++ I++
Sbjct: 158 GSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLTQLYTQEALDFIQTQ 217
Query: 188 HNHSRPLFL----QITHAAVH 204
H P FL THA V+
Sbjct: 218 HARQSPFFLYWAIDATHAPVY 238
>gi|221043426|dbj|BAH13390.1| unnamed protein product [Homo sapiens]
Length = 614
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 53/124 (42%), Positives = 71/124 (57%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID LA G+ L +H + + CTPSRAAFLTG+YP R G+ + +
Sbjct: 74 GIGDIGCYGNNTMRTPNIDRLAEAGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSSI 133
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G G + +P E + LK GY+T LIGKWH+G N E P + GFD
Sbjct: 134 GYRVLQWTGASGGLPTNETTFAKILKGKGYATGLIGKWHLGLNCESASDHCHHPLHHGFD 193
Query: 131 NHVG 134
+ G
Sbjct: 194 HFYG 197
>gi|32471071|ref|NP_864064.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH 1]
gi|32396773|emb|CAD71738.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH 1]
Length = 490
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 93/193 (48%), Gaps = 27/193 (13%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QG+ DVG G DI TP +DA+A G+ Y P C PSRAA +TG YP R
Sbjct: 33 QGYEDVGCFGSPDIRTPRLDAMAKGGMKFTSFYAQPICGPSRAALMTGCYPMRV-----A 87
Query: 82 GAGVAKAV-PV---TEKLLPQYLKELGYSTHLIGKWHIGCNKE-----ELLPFNRGFDNH 132
G K + P+ E + + LK GY++ GKW + + + +LLP +GFD
Sbjct: 88 ERGHTKQIHPILHEDEVTIAEVLKTKGYASACFGKWDLAKHAQSGFFSDLLPTGQGFD-- 145
Query: 133 VGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
Y+ G T ND + + RN E P+ LT +TD+++ I+ N ++
Sbjct: 146 --YFYGTPTSNDRV--------ANLYRNEELIEPESDMATLTRRYTDEAISFIEK-NQNQ 194
Query: 193 PLFLQITHAAVHT 205
P F+ I H HT
Sbjct: 195 PFFVYIPHTMPHT 207
>gi|431796258|ref|YP_007223162.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
gi|430787023|gb|AGA77152.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
Length = 603
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 73/244 (29%), Positives = 113/244 (46%), Gaps = 31/244 (12%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTP 80
QG+ D GF G + TP +D LA + ++ Y P C P+RA+ +TG+Y R GI DT
Sbjct: 50 QGYGDFGFTGNPHVQTPVLDGLAEESMFFDQFYVSPVCAPTRASLMTGRYSLRTGIRDTY 109
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
G + VT + + LK+ GY T + GKWH+G N P ++GFD V + +G +
Sbjct: 110 NGGAIMATEEVT---IAEMLKDAGYRTGIFGKWHLGDNYPS-RPMDQGFDESVIHLSGGM 165
Query: 141 TYNDSIHETDFAVGLDARRN--MERYAPQMSSK-YLTDFFTDQSVHVIKSHN---HSRPL 194
I T + G + + + Q S K Y TD FT +++ + H+ +P
Sbjct: 166 GQVGDI--TTYYQGDSSYFDPVLWHNGQQESYKGYCTDIFTQEAIAFVSDHDGGEKRQPF 223
Query: 195 FLQITHAAVHT----------------GTAG--NAKLPTGLLQVPDMEENDRTFAHISNP 236
F+ ++ A HT T+G A +P+ + D E R +A + N
Sbjct: 224 FVYLSLNAPHTPLQVPDEYYQKYKDIDPTSGLDEAMMPSQEMTESDKEHARRVYAMVENI 283
Query: 237 DRRL 240
D L
Sbjct: 284 DDNL 287
>gi|227540472|ref|ZP_03970521.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
spiritivorum ATCC 33300]
gi|227239796|gb|EEI89811.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
spiritivorum ATCC 33300]
Length = 466
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 70/231 (30%), Positives = 108/231 (46%), Gaps = 32/231 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D G +G DIPTP+IDALA GI N + T C PSRA L G+Y R G + V
Sbjct: 38 GYEDFGCYGSQDIPTPHIDALAKGGIRFTNSYVTASVCAPSRAGLLMGQYQQRSGFEHNV 97
Query: 82 GAGVAKAVPV-------TEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
A + T + + ++ GY T IGKWH G N+ + P ++GF++ G
Sbjct: 98 SDLPADGYQIQDIGLSDTVRTIADQMQSNGYETMAIGKWHQG-NETKHHPLHKGFNHFFG 156
Query: 135 YWNGYLTY---NDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
+ G+ ++ +I + + + N + YLTD FTD+++ ++
Sbjct: 157 FIGGHRSFFPIRTAIKQEEKIL------NDYTEVDEKDVYYLTDMFTDKAISYMR-QKRD 209
Query: 192 RPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+P F+ +++ AVHT P L Q FA + N RR +A
Sbjct: 210 KPYFIYLSYNAVHTPVEAT---PQKLAQ----------FARLKNAHRRSYA 247
>gi|149198313|ref|ZP_01875359.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
HTCC2155]
gi|149138609|gb|EDM27016.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
HTCC2155]
Length = 476
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 71/201 (35%), Positives = 96/201 (47%), Gaps = 29/201 (14%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP 80
QG+ D+ G + TP ID +A G L Y P CTPSRAA +TG YP R ID
Sbjct: 42 QGYADLSCFGGTHVSTPRIDQMAAEGAKLTSFYVAAPVCTPSRAALMTGTYPKR--IDMA 99
Query: 81 VG-------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
G AG K + E + + LK +GY T + GKWH+G ++ E LP +GFD
Sbjct: 100 RGSNFVVLLAGDKKGLNPKEITIAEVLKAVGYKTGMFGKWHLG-DQPEFLPTRQGFDEFF 158
Query: 134 GYWNGYLTYNDSIH-----ETDFAVG----LDARRNMERYAPQMSSKYLTDFFTDQSVHV 184
G L Y+ IH ++ F LD +E + YLT FT+++V
Sbjct: 159 G-----LPYSHDIHPYHPQQSHFKFPSLPLLDGEEVIEM---DPDADYLTKRFTERAVQF 210
Query: 185 IKSHNHSRPLFLQITHAAVHT 205
I+ N +P FL + H HT
Sbjct: 211 IEK-NKDQPFFLYMPHPIPHT 230
>gi|408674712|ref|YP_006874460.1| sulfatase [Emticicia oligotrophica DSM 17448]
gi|387856336|gb|AFK04433.1| sulfatase [Emticicia oligotrophica DSM 17448]
Length = 518
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 70/225 (31%), Positives = 107/225 (47%), Gaps = 30/225 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G +G ++I TPN+D LA NG+ L Y C P+RA+ LTGKY G+ V
Sbjct: 43 GFSDIGCYG-SEISTPNLDKLAANGLKLRNFYNAGRCCPTRASLLTGKYSHAVGMGNMVS 101
Query: 83 AGVAK------------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD 130
K +VP + + LK++GY T++ GKWH+G + P RGF+
Sbjct: 102 FEDQKVPKDNYQGYLEPSVPT----IAEDLKKVGYHTYMTGKWHVG-ESPDYWPLKRGFE 156
Query: 131 NHVGYWNGYLTYNDSIHE--TDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
+ G +G +Y + + E F V D + Y Y TD FTD+++ ++S
Sbjct: 157 RYFGLISGASSYFEVLQEKRKRFVVQDD-----KEYVLPKDGYYATDAFTDKAIEFLESS 211
Query: 189 N-HSRPLFLQITHAA----VHTGTAGNAKLPTGLLQVPDMEENDR 228
+ + P FL + + A +H AK LQ D DR
Sbjct: 212 DKQNNPFFLYLAYTAPHFPLHAYEEDIAKYENFYLQGWDKTRTDR 256
>gi|340368306|ref|XP_003382693.1| PREDICTED: arylsulfatase J-like [Amphimedon queenslandica]
Length = 230
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 61/189 (32%), Positives = 92/189 (48%), Gaps = 22/189 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPF---RYGIDT 79
G+ DV F I +PN LA G++L+RHY C+P+R +FLTG++P +Y I
Sbjct: 36 GFADVSFRNPA-IKSPNFQKLAETGLILDRHYVYRYCSPTRVSFLTGRWPHHAHQYNIKP 94
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG- 138
G + +LP LK +GY TH++GKWH G + + LP NRGFD G+ +G
Sbjct: 95 NFQIGTN----INMTMLPAKLKTVGYKTHMVGKWHQGFFQPKFLPINRGFDTSSGFLSGA 150
Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
+ Y D +D RN + Y + + Y + D + +H +P+F
Sbjct: 151 EDHFTQYRD--------CAIDYWRN-DTYDTR-NGTYDAYTYKDDLTKIFDAHETQKPMF 200
Query: 196 LQITHAAVH 204
L + VH
Sbjct: 201 LYLPLHNVH 209
>gi|119619126|gb|EAW98720.1| arylsulfatase E (chondrodysplasia punctata 1), isoform CRA_d [Homo
sapiens]
Length = 599
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 54/125 (43%), Positives = 73/125 (58%), Gaps = 13/125 (10%)
Query: 23 GWNDVGFHGENDI-PTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTP 80
G D+G +G N + TPNID LA +G+ L +H + + CTPSRAAFLTG+YP R G+ +
Sbjct: 58 GIGDIGCYGNNTMRQTPNIDRLAEDGVKLTQHISAASLCTPSRAAFLTGRYPVRSGMVSS 117
Query: 81 VG------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGF 129
+G G + +P E + LKE GY+T LIGKWH+G N E P + GF
Sbjct: 118 IGYRVLQWTGASGGLPTNETTFAKILKEKGYATGLIGKWHLGLNCESASDHCHHPLHHGF 177
Query: 130 DNHVG 134
D+ G
Sbjct: 178 DHFYG 182
>gi|281349830|gb|EFB25414.1| hypothetical protein PANDA_009654 [Ailuropoda melanoleuca]
Length = 581
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/125 (44%), Positives = 72/125 (57%), Gaps = 13/125 (10%)
Query: 23 GWNDVGFHGENDI-PTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTP 80
G D+G +G N I TPNID LA +G++L +H + CTPSRAAFLTG+YP R G+ +
Sbjct: 41 GIGDIGCYGNNSIRQTPNIDRLAEDGVMLTQHVAAASVCTPSRAAFLTGRYPLRSGMVSS 100
Query: 81 VG------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGF 129
G GV +P E + LK+ GY+T LIGKWH+G N + P N GF
Sbjct: 101 NGYRVLQWTGVPGGLPTNETTFAKILKDRGYATGLIGKWHLGLNCDSSSDHCHHPLNHGF 160
Query: 130 DNHVG 134
D+ G
Sbjct: 161 DHFYG 165
>gi|149177363|ref|ZP_01855968.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
gi|148843888|gb|EDL58246.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
Length = 466
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 71/209 (33%), Positives = 100/209 (47%), Gaps = 21/209 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGI---- 77
G+ D+G +G + TP +D LA G+ L YT PTCT SRA LTG+YP R G+
Sbjct: 46 GYGDLGCYGNPVMKTPMLDQLASEGVRLTDFYTASPTCTVSRATLLTGRYPQRIGLNHQL 105
Query: 78 --DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
D G G+ K +E L+P+YLK+ GY T GKW++G + P RGFD G+
Sbjct: 106 SADENYGDGLRK----SEVLIPEYLKQQGYRTACFGKWNVGFSPGS-RPTERGFDEFFGF 160
Query: 136 WNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLF 195
G + Y + +A D R ++ Y TD F D + I S +P F
Sbjct: 161 AAGNIDY----YHHYYAGRHDLWRGLKEV---FVEGYSTDLFADAACQYI-SAESDQPFF 212
Query: 196 LQITHAAVHTGTAGNAKLPTG-LLQVPDM 223
+ + A H + N + G Q PD+
Sbjct: 213 IYLPFNAPHFPSQRNKQPGQGNEWQAPDL 241
>gi|417301290|ref|ZP_12088451.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
gi|327542405|gb|EGF28888.1| N-acetylgalactosamine-6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
Length = 482
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/189 (32%), Positives = 91/189 (48%), Gaps = 19/189 (10%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QG+ DVG G DI TP +DA+A G+ Y P C PSRAA +TG YP R +
Sbjct: 25 QGYQDVGCFGSPDIRTPRLDAMAKGGMKFTSFYAQPICGPSRAALMTGCYPMRVAERGHI 84
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKE-----ELLPFNRGFDNHVGYW 136
+ + E + + LK GY++ GKW + + + +LLP +GFD Y+
Sbjct: 85 KQ-IHPILHEDEVTIAEVLKTNGYASACFGKWDLAKHAQSGFFPDLLPTGQGFD----YF 139
Query: 137 NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
G T ND + + RN E P+ LT +TD+++ I+ N ++P F+
Sbjct: 140 YGTPTSNDRV--------ANLYRNEELIEPESDMATLTRRYTDEAISFIEK-NQNQPFFV 190
Query: 197 QITHAAVHT 205
I H HT
Sbjct: 191 YIPHTMPHT 199
>gi|421612351|ref|ZP_16053459.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
gi|408496806|gb|EKK01357.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
Length = 474
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 51/139 (36%), Positives = 78/139 (56%), Gaps = 13/139 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
QGW DVGF+G + TPN+DA+A G+ +R Y P C+P+R + LTG+YPFR+GI
Sbjct: 43 QGWGDVGFNGNEVVQTPNLDAMASAGVRFDRFYAAAPLCSPTRGSCLTGRYPFRFGILAA 102
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL--------PFNRGFDNH 132
G+ V E + + L++ GY+T + GKWHIG K + + P + GFD +
Sbjct: 103 HTGGMR----VGEITIAEMLQKRGYATGMFGKWHIGWVKPDEVSTRGFYSPPSHHGFDEY 158
Query: 133 VGYWNGYLTYNDSIHETDF 151
+ T++ +I D+
Sbjct: 159 FATTSAVPTWDPTITPQDW 177
>gi|114326198|ref|NP_001041585.1| N-acetylgalactosamine-6-sulfatase precursor [Canis lupus
familiaris]
gi|122138594|sp|Q32KH5.1|GALNS_CANFA RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|81158068|tpe|CAI85008.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Canis lupus
familiaris]
Length = 522
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 96/201 (47%), Gaps = 20/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP- 80
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 41 GWGDLGIYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 100
Query: 81 -------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P E +LP+ LKE GY + ++GKWH+G ++ + P GFD
Sbjct: 101 RHARNAYTPQEIVGGIPDQEHVLPELLKEAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 159
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 160 GSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQVYLQEALDFIKRQ 219
Query: 189 NHS-RPLFL----QITHAAVH 204
+ RP FL THA V+
Sbjct: 220 QAAQRPFFLYWAIDATHAPVY 240
>gi|32471068|ref|NP_864061.1| N-acetylgalactosamine-6-sulfatase [Rhodopirellula baltica SH 1]
gi|32396770|emb|CAD71735.1| N-acetylgalactosamine-6-sulfatase [Rhodopirellula baltica SH 1]
Length = 474
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 51/139 (36%), Positives = 78/139 (56%), Gaps = 13/139 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
QGW DVGF+G + TPN+DA+A G+ +R Y P C+P+R + LTG+YPFR+GI
Sbjct: 43 QGWGDVGFNGNEVVQTPNLDAMASAGVRFDRFYAAAPLCSPTRGSCLTGRYPFRFGILAA 102
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL--------PFNRGFDNH 132
G+ V E + + L++ GY+T + GKWHIG K + + P + GFD +
Sbjct: 103 HTGGMR----VGEITIAEMLQKRGYATGMFGKWHIGWVKPDEVSTRGFYSPPSHHGFDEY 158
Query: 133 VGYWNGYLTYNDSIHETDF 151
+ T++ +I D+
Sbjct: 159 FATTSAVPTWDPTITPQDW 177
>gi|417301293|ref|ZP_12088454.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
gi|327542408|gb|EGF28891.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
Length = 474
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 51/139 (36%), Positives = 78/139 (56%), Gaps = 13/139 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
QGW DVGF+G + TPN+DA+A G+ +R Y P C+P+R + LTG+YPFR+GI
Sbjct: 43 QGWGDVGFNGNEVVQTPNLDAMASAGVRFDRFYAAAPLCSPTRGSCLTGRYPFRFGILAA 102
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL--------PFNRGFDNH 132
G+ V E + + L++ GY+T + GKWHIG K + + P + GFD +
Sbjct: 103 HTGGMR----VGEITIAEMLQKRGYATGMFGKWHIGWVKPDEVSTRGFYSPPSHHGFDEY 158
Query: 133 VGYWNGYLTYNDSIHETDF 151
+ T++ +I D+
Sbjct: 159 FATTSAVPTWDPTITPQDW 177
>gi|313226814|emb|CBY21959.1| unnamed protein product [Oikopleura dioica]
Length = 582
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/125 (44%), Positives = 73/125 (58%), Gaps = 13/125 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGI---D 78
G DVGF+G + I T NID LA +G++L++H CTPSRAAFLTG+ P RYGI
Sbjct: 32 GSGDVGFNGNSTIGTTNIDQLAEDGVILDQHLAPASVCTPSRAAFLTGRLPIRYGIAANG 91
Query: 79 TPVGAGVAKA----VPVTEKLLPQYLKELGYSTHLIGKWHIGCN-----KEELLPFNRGF 129
T V + A +P +E + L++ GY T L+GKWH+G N + PFN GF
Sbjct: 92 TRVRVNIWNATPNGLPRSELTFAKVLQKEGYKTALVGKWHLGMNHNNNHDQNYHPFNHGF 151
Query: 130 DNHVG 134
D+ G
Sbjct: 152 DSWFG 156
>gi|440717773|ref|ZP_20898250.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
gi|436437075|gb|ELP30749.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
Length = 474
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 51/139 (36%), Positives = 78/139 (56%), Gaps = 13/139 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
QGW DVGF+G + TPN+DA+A G+ +R Y P C+P+R + LTG+YPFR+GI
Sbjct: 43 QGWGDVGFNGNEVVQTPNLDAMASAGVRFDRFYAAAPLCSPTRGSCLTGRYPFRFGILAA 102
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL--------PFNRGFDNH 132
G+ V E + + L++ GY+T + GKWHIG K + + P + GFD +
Sbjct: 103 HTGGMR----VGEITIAEMLQKRGYATGMFGKWHIGWVKPDEVSTRGFYSPPSHHGFDEY 158
Query: 133 VGYWNGYLTYNDSIHETDF 151
+ T++ +I D+
Sbjct: 159 FATTSAVPTWDPTITPQDW 177
>gi|395856891|ref|XP_003800850.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Otolemur garnettii]
Length = 526
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 95/205 (46%), Gaps = 20/205 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G + Y+ P C+PSRAA LTG+ P R G T
Sbjct: 45 GWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 104
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E LLP+ LKE GY + ++GKWH+G ++ + P GFD
Sbjct: 105 AHARNAYTPQEIVGGIPRSEHLLPELLKEAGYISKIVGKWHLG-HRPQFHPLKHGFDEWF 163
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 164 GSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEEFPINLKTGESNLTQIYLQEALDFIKRQ 223
Query: 189 N-HSRPLFL----QITHAAVHTGTA 208
P FL THA V+ A
Sbjct: 224 QAQQHPFFLYWAIDATHAPVYASKA 248
>gi|45383412|ref|NP_989703.1| arylsulfatase H precursor [Gallus gallus]
gi|33330173|gb|AAQ10453.1| arylsulfatase [Gallus gallus]
Length = 590
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 56/124 (45%), Positives = 69/124 (55%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGID--- 78
G DVG +G + I TPNID LA G+ L +H T P CTPSRAAFLTG+YP R G+D
Sbjct: 46 GIGDVGCYGNDTIRTPNIDRLAREGVKLTQHITAAPLCTPSRAAFLTGRYPIRSGMDAVN 105
Query: 79 ---TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNRGFD 130
G + +P E + L++ GYST LIGKWH+G N E P N GF+
Sbjct: 106 NYRVIFWNGGSGGLPPNETTFAKILQQQGYSTGLIGKWHLGVNCEHRNDHCHHPLNHGFE 165
Query: 131 NHVG 134
G
Sbjct: 166 YFYG 169
>gi|325109298|ref|YP_004270366.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324969566|gb|ADY60344.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 463
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 93/191 (48%), Gaps = 18/191 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYG----I 77
G+ D+ +G D+ +PNID L G+ Y P C+P+RAA L+GKYP R G I
Sbjct: 45 GYGDLSCYGATDLQSPNIDKLVSRGLKFTNFYANCPVCSPTRAAILSGKYPDRVGVPGVI 104
Query: 78 DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
T P E LLP L+ GY + +IGKWH+G P +RGFD+ GY
Sbjct: 105 RTHADNSWGYLAPEAE-LLPSLLQPAGYHSAIIGKWHLGLEAPN-RPNDRGFDHFKGYLG 162
Query: 138 GYLT--YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH-NHSRPL 194
+ Y+ H ++ R N + P+ + TD FT+ S +K ++ +P
Sbjct: 163 DMMDDYYDHRRHGINY-----MRENEQEIDPE---GHATDLFTEWSCDYLKEQADNEQPF 214
Query: 195 FLQITHAAVHT 205
FL + + A HT
Sbjct: 215 FLYLAYNAPHT 225
>gi|340616348|ref|YP_004734801.1| sulfatase [Zobellia galactanivorans]
gi|339731145|emb|CAZ94409.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
Length = 489
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 68/198 (34%), Positives = 100/198 (50%), Gaps = 25/198 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYG-IDTP 80
G+ D+GF G + TPN+D LA + +R Y+ PTC PSR + +TGKYP R G +
Sbjct: 50 GFADLGFTGSDTHLTPNLDKLAKESVYFDRAYSSHPTCAPSRMSIMTGKYPARLGAVSHG 109
Query: 81 VGAGVA------KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
GVA +P+TE + + LK+ GY+T IGKWHIG K E P RGFD +
Sbjct: 110 KLGGVAHPGPNDNGLPMTETTIGEALKKEGYTTAHIGKWHIG--KGENNPGTRGFDVDIA 167
Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYA---PQMSSK----YLTDFFTDQSVHVIKS 187
N+ + ++ +R A P + + +LTD +++V I S
Sbjct: 168 -------SNEFCCPGSYMYPFESNNEKQRVASKIPDLEDRKPGDFLTDALAEEAVKFIHS 220
Query: 188 HNHSRPLFLQITHAAVHT 205
+ +P FL ++ AVHT
Sbjct: 221 TD-EKPFFLNMSFYAVHT 237
>gi|87306992|ref|ZP_01089138.1| arylsulfatase [Blastopirellula marina DSM 3645]
gi|87290365|gb|EAQ82253.1| arylsulfatase [Blastopirellula marina DSM 3645]
Length = 710
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 66/191 (34%), Positives = 94/191 (49%), Gaps = 17/191 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G++D+G +G +IPTPNIDALA G + Y C PSRA+ +TG YP + GI
Sbjct: 40 GYSDLGCYG-GEIPTPNIDALAKRGARFTQVYNSARCCPSRASLMTGLYPTQAGIGDFTT 98
Query: 78 DTPV---GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
D P G G + L + LK GY + +GKWH+ E P RGFD G
Sbjct: 99 DRPSPDRGPGYLGRLNEQCVTLAEVLKPAGYGCYYVGKWHM---HPETGPIRRGFDEFYG 155
Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS-RP 193
Y ++ ++ ++ + L A R E PQ Y TD F D ++ I+ S +P
Sbjct: 156 Y---ARDHSHDQYDAEYYIRLPAGREKEIDPPQ-QDYYATDVFNDYALEFIRQGQQSDKP 211
Query: 194 LFLQITHAAVH 204
FL + H++ H
Sbjct: 212 WFLFLGHSSPH 222
>gi|313242390|emb|CBY34540.1| unnamed protein product [Oikopleura dioica]
Length = 582
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/125 (44%), Positives = 73/125 (58%), Gaps = 13/125 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGI---D 78
G DVGF+G + I T NID LA +G++L++H CTPSRAAFLTG+ P RYGI
Sbjct: 32 GSGDVGFNGNSTIGTTNIDQLAEDGVILDQHLAPASVCTPSRAAFLTGRLPIRYGIAANG 91
Query: 79 TPVGAGVAKA----VPVTEKLLPQYLKELGYSTHLIGKWHIGCN-----KEELLPFNRGF 129
T V + A +P +E + L++ GY T L+GKWH+G N + PFN GF
Sbjct: 92 TRVRVNIWNATPNGLPRSELTFAKVLQKEGYKTALVGKWHLGMNHNNNHDQNYHPFNHGF 151
Query: 130 DNHVG 134
D+ G
Sbjct: 152 DSWFG 156
>gi|149199736|ref|ZP_01876767.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149137141|gb|EDM25563.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 585
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 67/118 (56%), Gaps = 3/118 (2%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QGW D+ +G DI TPNID+LA++G + Y P C+P+RA LTG+Y FR G+ +
Sbjct: 32 QGWGDLSINGNKDISTPNIDSLAHDGALFENFYVQPVCSPTRAELLTGRYAFRSGVRSTS 91
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
G + + E+ + K+ GY+T GKWH G + P RGFD G+ +G+
Sbjct: 92 EGG--ERFNLDEQTIADVFKKAGYATGAFGKWHSGM-QYPYHPNGRGFDEFYGFCSGH 146
>gi|304309759|ref|YP_003809357.1| sulfatase [gamma proteobacterium HdN1]
gi|301795492|emb|CBL43690.1| probable sulfatase precursor [gamma proteobacterium HdN1]
Length = 661
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 67/193 (34%), Positives = 96/193 (49%), Gaps = 21/193 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G ND+ G+ TPN+D + + L R Y TC+PSRA+ LTG+YP R G P+
Sbjct: 76 GVNDIASWGDGSAQTPNLDKFSSESVRLRRDYGDSTCSPSRASLLTGQYPARVGF-LPIA 134
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKE--ELLPFNRGFDNHVGYWNGYL 140
G++ +P LP LK LGYST +GKWH+G E E+ P GFD YW G+L
Sbjct: 135 LGLSPDLPT----LPGSLKSLGYSTFHVGKWHLGEALEYPEIQPSYHGFD----YWMGFL 186
Query: 141 TYNDSIHETDFAVGLDARRN-------MERYAPQMSSK-YLTDFFTDQSVHVIKSHNHSR 192
+ + D L +R E AP + YL D D++V +I+ + +
Sbjct: 187 NHF-VLQGPDETGKLVSRVPTHINPWLQENGAPPVQRMGYLDDLLVDKAVELIE-NTGEK 244
Query: 193 PLFLQITHAAVHT 205
P F+ + + HT
Sbjct: 245 PWFINLWLYSPHT 257
>gi|154250816|ref|YP_001411640.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
gi|154154766|gb|ABS61983.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
Length = 553
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 73/227 (32%), Positives = 100/227 (44%), Gaps = 46/227 (20%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGID-TP 80
G+ND+ G +PTPNID++A G Y+ C PSRA +TG+Y R G + TP
Sbjct: 82 GFNDISHFGGGIVPTPNIDSIARGGANFTSAYSGTAACAPSRAMIMTGRYGTRTGFEFTP 141
Query: 81 VGAGV------------------------AKAVPVTEKLLP-------QYLKELGYSTHL 109
G+ AKA P E+ LP + LK GY
Sbjct: 142 TPPGMTRIVDMFYNDGTRTHEMLVDREAAAKAPPFREQGLPGSEITLAEALKPKGYHNIH 201
Query: 110 IGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMS 169
IGKWH+G N E LP +GFD V +G DS + + D Q +
Sbjct: 202 IGKWHLG-NAPEFLPNAQGFDESVMLESGLFLPEDSPDVVNAKLPFDPIDQFLWARMQYA 260
Query: 170 SK-----------YLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
+ YLTDF+TD+++ I++ N +RP FL + H VHT
Sbjct: 261 TSYNGSAWFEPKGYLTDFYTDEAIKAIEA-NRNRPFFLYLAHWGVHT 306
>gi|149175125|ref|ZP_01853748.1| N-acetylgalactosamine-6-sulfate sulfatase [Planctomyces maris DSM
8797]
gi|148846103|gb|EDL60443.1| N-acetylgalactosamine-6-sulfate sulfatase [Planctomyces maris DSM
8797]
Length = 413
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 64/189 (33%), Positives = 93/189 (49%), Gaps = 14/189 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+ +G + TP++D LA NGI + H + C+P+RA LTG+Y R GID V
Sbjct: 6 GYGDLSCYGSQNCNTPHLDRLAANGIRFTDFHSSGAVCSPTRAGLLTGRYQQRAGIDGVV 65
Query: 82 GAGVAK----AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
A K + E L Q L++ GY T + GKWH+G + + P RGF VGY +
Sbjct: 66 YANPKKNRHHGLQKNEITLAQCLQDAGYQTGMFGKWHLGYQR-QYNPTFRGFQQFVGYVS 124
Query: 138 GYLTYNDSIHETD-FAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
G + Y + T F +A N E Y+T D ++ I+ +P F+
Sbjct: 125 GNVDYFAHLDGTGVFDWWHNAELNREEQG------YVTHLINDHALEFIRQQ-QEKPFFV 177
Query: 197 QITHAAVHT 205
I H AVH+
Sbjct: 178 YIAHEAVHS 186
>gi|298706919|emb|CBJ29746.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 616
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 66/205 (32%), Positives = 93/205 (45%), Gaps = 33/205 (16%)
Query: 23 GWNDVGFHGENDIP-TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G ND+G+ + TP +D+L+ G+ L ++YT CTPSRA+ +TG+ FR G+ V
Sbjct: 131 GTNDIGYQSTDLWELTPFMDSLSSEGVRLTKYYTNQLCTPSRASLMTGRDTFRTGMQYEV 190
Query: 82 GAGV-AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
A +P+ E L + K GKWH+G + PF RGFD +GY
Sbjct: 191 VEDSGAWGLPLEEVTLAERFK--------TGKWHLGMYSDAHYPFARGFDTFLGYMGAVR 242
Query: 141 TYNDSIHE---------------TDFAVG-----LDARRNMERYAPQMSSKYLTDFFTDQ 180
Y S HE DF G ++ N R P Y T TD+
Sbjct: 243 GY--SSHEGCNTPTFEGGEYSCFKDFGYGDKDGYINHITNTTRQGPSFVGNYSTTIITDR 300
Query: 181 SVHVIKSHNHSRPLFLQITHAAVHT 205
++ V K H P FL ++H AVH+
Sbjct: 301 AIEVAKEHGED-PFFLYVSHQAVHS 324
>gi|321446094|gb|EFX60811.1| hypothetical protein DAPPUDRAFT_17868 [Daphnia pulex]
Length = 125
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 52/115 (45%), Positives = 68/115 (59%), Gaps = 4/115 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNR-HYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+ +G I TP++DALA G+ H CTPSRA LTG+YP R G+D P+
Sbjct: 12 GWGDLACYGGTAIKTPHLDALAGRGVRFTESHACDSVCTPSRAGLLTGRYPKRMGLDFPL 71
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--CNKEELLPFNRGFDNHVG 134
AG A + E LPQ LK GY T ++GKWH+G + L P N GFD+++G
Sbjct: 72 NAG-ATGLNAFETTLPQALKLRGYHTAMVGKWHLGDYTKDKGLNPTNFGFDSYLG 125
>gi|146275662|ref|YP_001165822.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
gi|145322353|gb|ABP64296.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
Length = 462
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 68/214 (31%), Positives = 101/214 (47%), Gaps = 27/214 (12%)
Query: 11 KAVPVTEKLLPQ------------GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LP 57
+A+ VT K P+ G+ D G I TP ID++ G++L + Y+ P
Sbjct: 22 QALAVTRKAAPERPNIVFIMADDLGYADTSATGSRHIRTPAIDSIGAGGVMLRQGYSSTP 81
Query: 58 TCTPSRAAFLTGKYPFRY--GIDTPVG--AGVAKAVPVTEKLLPQYLKELGYSTHLIGKW 113
C+P+R A LTG Y R+ G++ P+G A VP+ + +K LGY T L+GKW
Sbjct: 82 ICSPTRTALLTGCYAQRFAIGVEEPLGPNAPAGIGVPLDRPTIASVMKALGYRTSLVGKW 141
Query: 114 HIGCNKEELLPFNRGFDNHVGYWNG---YLTYNDSIHETDFAVGLDARRNMERYAPQMSS 170
H+G P G+D+ +G G Y + + VGL E A +
Sbjct: 142 HLG-EPPAHGPLKHGYDHFLGIVEGGADYFVHRMVMSGKPAGVGL-----AEDDAQTDRT 195
Query: 171 KYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVH 204
YLTD F D++V VI+ ++P FL + A H
Sbjct: 196 GYLTDIFGDEAVRVIE-EGGNQPFFLSLHFTAPH 228
>gi|453364754|dbj|GAC79720.1| putative arylsulfatase [Gordonia malaquae NBRC 108250]
Length = 783
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 102/197 (51%), Gaps = 23/197 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
G++D+G G +IPTPNID +A G L+ ++T P C+P+RAA LTG P R G +
Sbjct: 56 GYSDIGPFGA-EIPTPNIDRIAATGYRLSNYHTTPVCSPARAALLTGVNPHRAGYGSVAN 114
Query: 80 --PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKE-------ELLPFNRGFD 130
P G+ + LP+ L+E GY+T +GKWH+ + + + P RGFD
Sbjct: 115 SDPGFPGLRLELADDVLTLPEILRESGYATFAVGKWHLVRDADMSPGRSRKSWPLQRGFD 174
Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQS---VHVIKS 187
++ G G +S + + ++ +++ Y Y+TD TD++ V +++
Sbjct: 175 SYYGSLEGL----NSFFHPNQLIADNSVVDVDEYP---EGYYVTDDLTDRAIGQVKALRA 227
Query: 188 HNHSRPLFLQITHAAVH 204
H+ +P FL H A+H
Sbjct: 228 HDADKPFFLYFAHIAMH 244
>gi|453077746|ref|ZP_21980484.1| arylsulfatase [Rhodococcus triatomae BKS 15-14]
gi|452758328|gb|EME16720.1| arylsulfatase [Rhodococcus triatomae BKS 15-14]
Length = 769
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 101/198 (51%), Gaps = 25/198 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID---- 78
G++D+G G ++I TPN+D LA +G+ L ++T P C+PSRAA LTG P R G
Sbjct: 50 GYSDIGPFG-SEIETPNLDRLAASGVRLTNYHTTPLCSPSRAALLTGVNPHRAGYGFVAN 108
Query: 79 -TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI--------GCNKEELLPFNRGF 129
P G+ + + LP+ L+ GY+T+ +GKWH+ G ++ P RGF
Sbjct: 109 ADPGFPGLRLELSDDTQTLPEILRAGGYATYAVGKWHLVRDANIRPGSGRDS-WPTQRGF 167
Query: 130 DNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK--- 186
D + G G +S + V ++ +++ Y YLTD TD++V +K
Sbjct: 168 DRYYGSLEGL----NSFFHPNQLVSDNSAVDVDEYP---EGYYLTDDLTDKAVTYLKDLR 220
Query: 187 SHNHSRPLFLQITHAAVH 204
+H +P FL H A+H
Sbjct: 221 AHEPDKPFFLYFAHVAMH 238
>gi|323454250|gb|EGB10120.1| hypothetical protein AURANDRAFT_62683 [Aureococcus anophagefferens]
Length = 555
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 56/144 (38%), Positives = 78/144 (54%), Gaps = 21/144 (14%)
Query: 23 GWNDVGFHGENDIP----TPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID 78
GW+D+ D+P +P I LA G+ L +Y CTP+RAA LTGK+ R G
Sbjct: 13 GWDDL--WESRDLPPAVVSPTIFRLAKEGVKLTSYYGQSYCTPARAALLTGKFVHRLGFA 70
Query: 79 TP---------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF 129
+P V +VP+ +LLP +L+ LGY+TH +GKW++G E LP+ RGF
Sbjct: 71 SPEADYWGPLEVVGDANYSVPLGHELLPAHLRNLGYATHGVGKWNVGHCATEYLPWKRGF 130
Query: 130 DNHVGYWNGYLTYNDSIHETDFAV 153
D +GY ++D IH T AV
Sbjct: 131 DTFLGY------FSDGIHYTTHAV 148
>gi|326437895|gb|EGD83465.1| hypothetical protein PTSG_04073 [Salpingoeca sp. ATCC 50818]
Length = 562
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 104/218 (47%), Gaps = 37/218 (16%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++DVGF + I TPNID +A+ L+G+Y +GI +
Sbjct: 46 GFDDVGFK-SHQIKTPNID---------------------QASILSGRYAMHHGIVNWIP 83
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+ +P+ LPQ LK GY TH IGKWH+G K + P RGF++ +GY++G Y
Sbjct: 84 PKDSYGLPLNHTTLPQLLKNGGYDTHAIGKWHLGFYKWDYTPTFRGFNSFLGYYSGGENY 143
Query: 143 NDSIHETDFAVGLD----ARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
+ + + D +N + A + +Y T F+D++V VI H +PLFL +
Sbjct: 144 FTHKNGPAYDMHRDPLPSCGQNCSQIAFDLQGQYSTTIFSDEAVRVIDDHIGPKPLFLYL 203
Query: 199 THAAVHTGTAGNAKLPTGLLQ-----VPDMEENDRTFA 231
+ AVH A+ P + +PD + RTFA
Sbjct: 204 AYQAVHE----PAQAPQSYIDPYTDLIPDAQR--RTFA 235
>gi|114145565|ref|NP_001041316.1| N-acetylgalactosamine-6-sulfatase precursor [Rattus norvegicus]
gi|123779981|sp|Q32KJ6.1|GALNS_RAT RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|81158026|tpe|CAI84987.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Rattus
norvegicus]
Length = 524
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 98/201 (48%), Gaps = 20/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 43 GWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 102
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E LLP+ LK+ GY+ ++GKWH+G ++ + P GFD
Sbjct: 103 AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWF 161
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKS- 187
G N + D+ + + V D R E + + + LT + +++ I++
Sbjct: 162 GSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANLTQLYLQEALDFIRTQ 221
Query: 188 HNHSRPLFL----QITHAAVH 204
H P FL THA V+
Sbjct: 222 HARQSPFFLYWAIDATHAPVY 242
>gi|301623486|ref|XP_002941046.1| PREDICTED: arylsulfatase D-like [Xenopus (Silurana) tropicalis]
Length = 569
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 82/165 (49%), Gaps = 15/165 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G +VG +G N + TPNID LA G+ L H + CTPSRAAFLTG+YP R G+
Sbjct: 35 GIGEVGCYGNNTLRTPNIDRLAREGVRLTHHIAAASLCTPSRAAFLTGRYPIRSGMTGHE 94
Query: 82 G-------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN---KEELL--PFNRGF 129
G + V+ +P E + L+E GY+T +IGKWH+G N K + P N GF
Sbjct: 95 GGYLVLMWSAVSGGLPTNETTFAKILQEQGYTTGIIGKWHLGVNCRSKNDFCYHPLNHGF 154
Query: 130 DNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLT 174
D G Y ND + + R ++ YA + LT
Sbjct: 155 DYFYGL--TYTLINDCEESMPSEIHVPFRAKLQFYAQLFAMTLLT 197
>gi|399029424|ref|ZP_10730306.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
gi|398072706|gb|EJL63910.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
Length = 546
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 65/191 (34%), Positives = 94/191 (49%), Gaps = 12/191 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI---DT 79
G++D+G +G ++I TPN+D LA G L Y C P+RA+ LTG+Y + G+ D
Sbjct: 38 GYSDLGNYG-SEIKTPNLDKLAAEGTRLREFYNNSICAPTRASLLTGQYQHKAGVGYFDV 96
Query: 80 PVGAGVAKAVPVTEKL-LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+G + E L L + + GYST L GKWH+G + P RGFD G G
Sbjct: 97 NLGLPAYQGYLNKESLTLGEVFRSGGYSTILSGKWHVGSEDKSQWPNQRGFDKFYGILKG 156
Query: 139 YLTYNDS----IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRP 193
Y D+ +T + V L RN E P+ S Y TD + +V ++ N ++P
Sbjct: 157 ASNYFDTKPLPFGKTPYPVSL--IRNNEVLHPKDDSYYFTDEIGNNAVTFLEEQNKENKP 214
Query: 194 LFLQITHAAVH 204
FL + A H
Sbjct: 215 FFLYLAFTAPH 225
>gi|301788958|ref|XP_002929896.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like, partial
[Ailuropoda melanoleuca]
Length = 519
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 93/202 (46%), Gaps = 20/202 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 38 GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 97
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
G + +P E LLP+ LK GY++ ++GKWH+G ++ + P GFD
Sbjct: 98 GHARNAYTPQEIVGGIPDGEHLLPELLKGAGYASKIVGKWHLG-HRPQFHPLKHGFDEWF 156
Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAP-----QMSSKYLTDFFTDQSVHVIKSH 188
G N + D+ + V D Y Q LT + +++ +K
Sbjct: 157 GSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLQTGEANLTQVYLQEALDFMKRQ 216
Query: 189 N-HSRPLFLQI----THAAVHT 205
RP FL THA V+
Sbjct: 217 QVAQRPFFLYWAIDGTHAPVYA 238
>gi|281346853|gb|EFB22437.1| hypothetical protein PANDA_020197 [Ailuropoda melanoleuca]
Length = 520
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 93/202 (46%), Gaps = 20/202 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 37 GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 96
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
G + +P E LLP+ LK GY++ ++GKWH+G ++ + P GFD
Sbjct: 97 GHARNAYTPQEIVGGIPDGEHLLPELLKGAGYASKIVGKWHLG-HRPQFHPLKHGFDEWF 155
Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAP-----QMSSKYLTDFFTDQSVHVIKSH 188
G N + D+ + V D Y Q LT + +++ +K
Sbjct: 156 GSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLQTGEANLTQVYLQEALDFMKRQ 215
Query: 189 N-HSRPLFLQI----THAAVHT 205
RP FL THA V+
Sbjct: 216 QVAQRPFFLYWAIDGTHAPVYA 237
>gi|229822462|ref|YP_002883988.1| sulfatase [Beutenbergia cavernae DSM 12333]
gi|229568375|gb|ACQ82226.1| sulfatase [Beutenbergia cavernae DSM 12333]
Length = 478
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 69/201 (34%), Positives = 100/201 (49%), Gaps = 24/201 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G G TP+IDALA +G Y P C+P+RA+ LTGKYP R G+ +
Sbjct: 27 GWRDLGCFGSTFYETPHIDALAASGTRFTHSYAAAPVCSPTRASLLTGKYPARVGVTNWI 86
Query: 82 GAGVAKA---------VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
G A +P E L + L+ GY T +GKWH+G + LP + GFD +
Sbjct: 87 GGHAIGALRDVPYFHGLPQDEYALARALRAGGYRTWHVGKWHLGGGRH--LPEHHGFDLN 144
Query: 133 VGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
VG G + + + + +G +E AP ++LTD TD +V +++S + +
Sbjct: 145 VG---GSASGSPVSYYAPYGIG-----ALED-APD--GEFLTDRLTDVAVDLVRSSDDA- 192
Query: 193 PLFLQITHAAVHTGTAGNAKL 213
P L + H AVHT A L
Sbjct: 193 PFLLNLWHYAVHTPIEAPAHL 213
>gi|254511428|ref|ZP_05123495.1| arylsulfatase [Rhodobacteraceae bacterium KLH11]
gi|221535139|gb|EEE38127.1| arylsulfatase [Rhodobacteraceae bacterium KLH11]
Length = 545
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 45/96 (46%), Positives = 65/96 (67%), Gaps = 1/96 (1%)
Query: 35 IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEK 94
I TP+I+ LA G+ L R YT P+CTP+R A LTG++P R G++ A V + +P +E
Sbjct: 90 IETPSINQLATEGMSLMRMYTEPSCTPTRTAMLTGRHPIRAGVEEVKVALVGEGLPASEV 149
Query: 95 LLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD 130
LP+ LK++GY+T +GKWH G + E+ P N+GFD
Sbjct: 150 TLPEILKQVGYNTAHVGKWHQG-DIEQSYPHNQGFD 184
>gi|47522740|ref|NP_999120.1| N-acetylgalactosamine-6-sulfatase precursor [Sus scrofa]
gi|75054309|sp|Q8WNQ7.1|GALNS_PIG RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|18028088|gb|AAL55968.1|AF322917_1 N-acetylgalactosamine-6-sulfatase precursor [Sus scrofa]
Length = 522
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 65/205 (31%), Positives = 96/205 (46%), Gaps = 20/205 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y P C+PSRAA LTG+ P R G T
Sbjct: 41 GWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYAANPLCSPSRAALLTGRLPIRTGFYTTN 100
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
G + +P E LLP+ LK GY++ ++GKWH+G ++ + P GFD
Sbjct: 101 GHARNAYTPQEIVGGIPDPEHLLPELLKGAGYASKIVGKWHLG-HRPQFHPLKHGFDEWF 159
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 160 GSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEEFPINLKTGESNLTQIYLQEALDFIKRQ 219
Query: 189 NHS-RPLFL----QITHAAVHTGTA 208
+ P FL THA V+ A
Sbjct: 220 QATHHPFFLYWAIDATHAPVYASRA 244
>gi|149196937|ref|ZP_01873990.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
gi|149140047|gb|EDM28447.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
Length = 462
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 65/219 (29%), Positives = 103/219 (47%), Gaps = 41/219 (18%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP 80
QG+ND+ +G I +P ID LA G+ L +Y P C+ SRAA LTG+YP G+
Sbjct: 33 QGYNDLSCYGSKTIKSPRIDQLAEEGLKLTSYYVASPVCSASRAALLTGRYPKLVGV--- 89
Query: 81 VGAGVA------KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
GV K + + + + LK +GY+T +GKWH+G ++ E LP N+GFD++ G
Sbjct: 90 --PGVFFPNRGHKGLDPKHQTIAKLLKSVGYATKAVGKWHLG-DELEFLPTNQGFDSYYG 146
Query: 135 Y-------------WNGYLTYNDSIHETDFAVGLDARR----NMERYAPQM--------- 168
++ Y + + + +A + M+ P M
Sbjct: 147 IPYSNDMTPAFSMKYSENCLYREGVDQEALKKAFEANKIKPVGMKDKVPLMRNDECIEMP 206
Query: 169 -SSKYLTDFFTDQSVHVI-KSHNHSRPLFLQITHAAVHT 205
+T FTD+S+ I +S ++P FL + H+ HT
Sbjct: 207 ADQSTITKRFTDESIKFIDESTASNKPFFLYLAHSMPHT 245
>gi|345316675|ref|XP_001517879.2| PREDICTED: arylsulfatase B-like [Ornithorhynchus anatinus]
Length = 782
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 60/170 (35%), Positives = 88/170 (51%), Gaps = 24/170 (14%)
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ A VP+ EKLLP+ LKE GY+TH++GKWH+G ++E LP RGFD++ GY G
Sbjct: 363 IWACQPNCVPLDEKLLPELLKEAGYATHMVGKWHLGMYRKECLPTRRGFDSYFGYLLGSE 422
Query: 141 TYND--------SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
Y S++ T A+ R+ E A + Y T+ F ++V +I +H +
Sbjct: 423 DYYSHERCVLIRSLNVTRCALDF---RDGEEVAVGYKNMYSTNVFAKRAVDLIANHPPDK 479
Query: 193 PLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
PLFL + +VH LQVP EE + ++ I N RR +A
Sbjct: 480 PLFLYLAFQSVHEP-----------LQVP--EEYVKPYSFIQNKKRRNYA 516
>gi|332665095|ref|YP_004447883.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
gi|332333909|gb|AEE51010.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
Length = 531
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 96/191 (50%), Gaps = 14/191 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G +G + TPN+D LA GI L Y C P+RA+ LTG Y G+ V
Sbjct: 54 GYSDIGCYG-GEAQTPNLDKLATKGIKLRSFYNAGRCCPTRASLLTGNYSHAAGMGNMVS 112
Query: 83 AGVAKAVPVTEK--------LLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
K P + + ++L+++GY T++ GKWH+G + E P RGFD + G
Sbjct: 113 FDDQKVTPGPYQGYLDPNTPTIAEHLRQVGYHTYMTGKWHVG-ERPEHWPLKRGFDRYFG 171
Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRP 193
+G ++ + + E + ++ E PQ Y TD FTD+++ I+ S+P
Sbjct: 172 LISGASSFFEILQEKRKRYMV--LQDQEWVLPQ-EGFYATDAFTDRAIEFIQGQAPQSKP 228
Query: 194 LFLQITHAAVH 204
FL + + A H
Sbjct: 229 FFLYLAYTAPH 239
>gi|225012438|ref|ZP_03702874.1| sulfatase [Flavobacteria bacterium MS024-2A]
gi|225003415|gb|EEG41389.1| sulfatase [Flavobacteria bacterium MS024-2A]
Length = 471
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 65/192 (33%), Positives = 90/192 (46%), Gaps = 15/192 (7%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
QG+ D+G +G DI TPN+D LA G +Y T P C+ SRA+ LTG YP R GI
Sbjct: 35 QGFGDLGVYGATDIKTPNLDRLAGEGARFTSYYATQPVCSASRASILTGCYPDRIGIHNA 94
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
G + E L + LKE GY+T + GKWH+G + E P GFD + G +
Sbjct: 95 YSPGSKVGLNPEETTLAELLKEKGYATGIFGKWHLG-DAPEFQPRKHGFDEYYG-----I 148
Query: 141 TYNDSI---HETDFAV----GLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y++ + H AV + N +LT TD+++ IK N P
Sbjct: 149 LYSNDMWPKHPQQGAVFNFPDIKLYENETPLRVLEDQTFLTGALTDRAIDFIKK-NKENP 207
Query: 194 LFLQITHAAVHT 205
F+ + H H
Sbjct: 208 FFVYLPHPQPHV 219
>gi|326426859|gb|EGD72429.1| hypothetical protein PTSG_00448 [Salpingoeca sp. ATCC 50818]
Length = 540
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 98/207 (47%), Gaps = 26/207 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GWN FH N+I TP + L NG+ L HYT C+P+RA+FLTG++P+++ +
Sbjct: 41 GWNAPSFH-NNEIITPTLHHLHANGVELYSHYTYMFCSPTRASFLTGRFPYKHEMTN--- 96
Query: 83 AGVAKAVPVTEKL--------LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
+P T L L LK+ YSTH IGKWH+G K+E P RGFD G
Sbjct: 97 ---TNLLPPTRMLGLDLSYTTLADKLKQANYSTHHIGKWHLGMYKKEYTPRYRGFDTTFG 153
Query: 135 YWNGYLT-YNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR- 192
+ G Y + AV L + E A M+ + FTD ++ +I+++ R
Sbjct: 154 FLTGGENHYTQRAFVSPPAVDL---WDEEAPAYGMNGTWTGKMFTDAALDIIRNNAQLRN 210
Query: 193 ------PLFLQITHAAVHTGTAGNAKL 213
PLF+ VH T +L
Sbjct: 211 ATGDAPPLFIYFALHDVHAPTQSPVRL 237
>gi|443700719|gb|ELT99563.1| hypothetical protein CAPTEDRAFT_110993 [Capitella teleta]
Length = 339
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 51/118 (43%), Positives = 69/118 (58%), Gaps = 4/118 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
G+ D G+ +DI TPNID L +GI Y+ C+PSR+AFL+G+Y + G+ V
Sbjct: 43 GYQDAGYR-NSDIHTPNIDKLVADGISFTNAYSAQQCSPSRSAFLSGRYAYTSGMQHGVI 101
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG-CNKEELLPFNRGFDNHVGYWNG 138
G A + + + YLKEL Y+TH GKWH+G CNK E P RGFD G ++G
Sbjct: 102 GDTKAHCMDLKYNFISDYLKELKYNTHASGKWHLGYCNK-ECTPTYRGFDTFSGGYSG 158
>gi|285808548|gb|ADC36070.1| sulfatase [uncultured bacterium 213]
Length = 478
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 91/186 (48%), Gaps = 11/186 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGI-VLNRHYTLPTCTPSRAAFLTGKYPFRY--GIDT 79
G+ DV +G D+ TPN+D +A G+ L + C+ +R A +TG+Y +R G++
Sbjct: 50 GYADVSCYGRPDLNTPNVDRVALKGVRFLQAYANSAVCSATRTALITGRYQYRLPIGLEE 109
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
P+G G +P LP L++ GY T L+GKWH+G + P G+D+ G+ G
Sbjct: 110 PLGIGRDVGLPPEHPTLPSLLRKAGYRTTLLGKWHLGA-LPKFGPLQSGYDHFYGFRGGS 168
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS-RPLFLQI 198
+ Y A + P S YLTD ++V VI ++HS RP + +
Sbjct: 169 VDY------YTHAGPDQRDDLWDDDVPLRQSGYLTDLLGSRAVDVINGYSHSDRPFLVSL 222
Query: 199 THAAVH 204
+A H
Sbjct: 223 HFSAPH 228
>gi|296140673|ref|YP_003647916.1| sulfatase [Tsukamurella paurometabola DSM 20162]
gi|296028807|gb|ADG79577.1| sulfatase [Tsukamurella paurometabola DSM 20162]
Length = 766
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 97/203 (47%), Gaps = 35/203 (17%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G +G ++IPTP++DALA GI H+T P C+P+RAA LTG P R G
Sbjct: 50 GFSDIGPYG-SEIPTPHLDALAARGIRSVNHHTTPVCSPARAALLTGINPHRAGY----- 103
Query: 83 AGVAKAVPVTEKL----------LPQYLKELGYSTHLIGKWHIGCNK-------EELLPF 125
A VA + P L LP+ L+E GY+T+ +GKWH+ + P
Sbjct: 104 ASVANSDPGYPNLRLSLADDVLTLPEILREAGYATYAVGKWHLAKDSRLGPDADRGSWPL 163
Query: 126 NRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSK-YLTDFFTDQSVHV 184
RGFD++ G G N H R N + Y+TD TD +
Sbjct: 164 QRGFDHYYGSLEG---LNSFFHPNQL-----VRDNTADPVTEYPDDFYVTDALTDTATSW 215
Query: 185 IK---SHNHSRPLFLQITHAAVH 204
+K +H+ +P FL H A+H
Sbjct: 216 LKDLRAHDADKPFFLYFAHIAMH 238
>gi|149038400|gb|EDL92760.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Rattus norvegicus]
Length = 466
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 98/201 (48%), Gaps = 20/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 43 GWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 102
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E LLP+ LK+ GY+ ++GKWH+G ++ + P GFD
Sbjct: 103 AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWF 161
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKS- 187
G N + D+ + + V D R E + + + LT + +++ I++
Sbjct: 162 GSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANLTQLYLQEALDFIRTQ 221
Query: 188 HNHSRPLFL----QITHAAVH 204
H P FL THA V+
Sbjct: 222 HARQSPFFLYWAIDATHAPVY 242
>gi|241267368|ref|XP_002406367.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215496881|gb|EEC06521.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 158
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 83/145 (57%), Gaps = 3/145 (2%)
Query: 88 AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIH 147
A+P+ L+P+Y + LGY TH++GKWH+G + +P RGFD +G++N L Y +
Sbjct: 13 ALPLDYTLMPEYFRRLGYKTHMVGKWHLGYYDRKYVPLKRGFDTFIGFYNPSLDYYNQNF 72
Query: 148 ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVH-TG 206
+ G D R Y + +Y T ++T ++V +I+ H+ S P+FL ++H A H +G
Sbjct: 73 TGNNHTGHDFRCGDRNYWAE-EKEYATYYYTRKTVEIIRCHDKSTPMFLFLSHQAPHVSG 131
Query: 207 TAGNAKLPT-GLLQVPDMEENDRTF 230
++PT G+ V + EN+RT
Sbjct: 132 GRPLLQVPTHGVRNVSYIGENNRTL 156
>gi|422371415|ref|ZP_16451795.1| arylsulfatase [Escherichia coli MS 16-3]
gi|315296831|gb|EFU56120.1| arylsulfatase [Escherichia coli MS 16-3]
Length = 551
Score = 93.6 bits (231), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 52/116 (44%), Positives = 68/116 (58%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+YP +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYPIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|148223780|ref|NP_001086368.1| MGC82105 protein precursor [Xenopus laevis]
gi|49522125|gb|AAH75173.1| MGC82105 protein [Xenopus laevis]
Length = 569
Score = 93.6 bits (231), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 61/171 (35%), Positives = 85/171 (49%), Gaps = 15/171 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G +VG +G N + TPNID LA G+ L H + CTPSRAAFLTG+YP R G+
Sbjct: 35 GIGEVGCYGNNTLRTPNIDRLAREGVKLTHHIAASSLCTPSRAAFLTGRYPIRSGMTGHD 94
Query: 82 G-------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN---KEELL--PFNRGF 129
G + V+ +P E + L+E GY+T +IGKWH+G N +++ P N GF
Sbjct: 95 GGYLVLMWSAVSGGLPTNETTFAKILQEQGYTTGIIGKWHLGVNCRSRDDFCHHPLNHGF 154
Query: 130 DNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQ 180
D + G Y ND + + R + YA + LT T +
Sbjct: 155 DYYYGLL--YTLINDCQASMPSEIHVAFRAQLLFYAQLFAVTLLTAMVTKR 203
>gi|118084193|ref|XP_416855.2| PREDICTED: arylsulfatase D [Gallus gallus]
Length = 596
Score = 93.6 bits (231), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 56/127 (44%), Positives = 68/127 (53%), Gaps = 18/127 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP- 80
G DVG +G N I TPNID LA G+ L +H P CTPSRAAFLTG+YP R G+ +
Sbjct: 53 GIGDVGCYGNNTIRTPNIDRLAREGVKLTQHIAAAPLCTPSRAAFLTGRYPIRSGMASSN 112
Query: 81 --------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNR 127
G+G +P E + L++ GY+T LIGKWH G N E P N
Sbjct: 113 RYRALQWNAGSG---GLPANETTFARLLQQQGYTTGLIGKWHQGVNCESFSDHCHHPLNH 169
Query: 128 GFDNHVG 134
GFD G
Sbjct: 170 GFDYFYG 176
>gi|149196006|ref|ZP_01873062.1| putative exported uslfatase [Lentisphaera araneosa HTCC2155]
gi|149140853|gb|EDM29250.1| putative exported uslfatase [Lentisphaera araneosa HTCC2155]
Length = 713
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 68/206 (33%), Positives = 92/206 (44%), Gaps = 33/206 (16%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDT-- 79
GWND+ +G TP++D +A G Y P C+P+RA+ L GKYP R G+
Sbjct: 251 GWNDIACYGSQFYETPHLDKMAKEGFRFTDAYAANPVCSPTRASILLGKYPSRVGLSNHS 310
Query: 80 ----PVGAG-------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL---LPF 125
P G G V +P+ + L + LKE+GY T IGKWH+ + + P
Sbjct: 311 GSSGPKGPGHKLTPVPVKGNMPLEDITLAEALKEVGYKTAHIGKWHLQAHHDTSRNHFPE 370
Query: 126 NRGFD-NHVGYWNG-----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTD 179
GFD N G+ G Y Y H + N+ A YLTD TD
Sbjct: 371 KHGFDLNIAGHRMGQPGSFYFPYKSKQHPS---------TNVPDMADGQEGDYLTDKLTD 421
Query: 180 QSVHVIKSHNHSRPLFLQITHAAVHT 205
+++H IK N P FL + VHT
Sbjct: 422 KAIHYIKE-NKDTPFFLNFWYYTVHT 446
>gi|7527462|gb|AAF63155.1|AF111346_1 N-acetylgalactosamine-6-sulfate sulfatase [Mus musculus]
gi|7576473|gb|AAF63858.1| N-acetylgalactosamine-6-sulfate sulfatase [Mus musculus]
Length = 520
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 98/201 (48%), Gaps = 20/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 39 GWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 98
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E LLP+ LK+ GY+ ++GKWH+G ++ + P GF+
Sbjct: 99 AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFNEWF 157
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYA--PQMSSKYLTDFFTDQSVHVIKS- 187
G N + D+ + + V D R E + + LT +T +++ I++
Sbjct: 158 GSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLTQLYTQEALDFIQTQ 217
Query: 188 HNHSRPLFL----QITHAAVH 204
H P FL THA V+
Sbjct: 218 HARQSPFFLYWAIDATHAPVY 238
>gi|33601723|ref|NP_889283.1| sulfatase [Bordetella bronchiseptica RB50]
gi|412337890|ref|YP_006966645.1| sulfatase [Bordetella bronchiseptica 253]
gi|33576160|emb|CAE33239.1| probable sulfatase [Bordetella bronchiseptica RB50]
gi|408767724|emb|CCJ52480.1| probable sulfatase [Bordetella bronchiseptica 253]
Length = 464
Score = 93.2 bits (230), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 94/203 (46%), Gaps = 30/203 (14%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW ++G +G I PTP IDALA G C P+R+A +TG++P R G
Sbjct: 19 GWGELGCYGGGAIRGAPTPRIDALAAQGTQFLNFNVESDCVPTRSALMTGRHPVRTGAMQ 78
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
V AG+ + + E+ L Q E GY+T + GKWH+G +KE P +RGFD W G
Sbjct: 79 SVPAGLPQGLVPWERTLAQAFSEQGYATAMYGKWHLG-DKEGRYPKDRGFDE----WYGI 133
Query: 140 -LTYNDSIHETDFAVGLD-----------------ARRNMERYAPQMSSKYLTDFFTDQS 181
T N+S+ AVG D A R ERY +M + + + T +S
Sbjct: 134 PRTTNESMFME--AVGFDPDVVEVPYVMEGRKGSPAERR-ERYDLEMRRR-IDEVLTQRS 189
Query: 182 VHVIKSHNHSRPLFLQITHAAVH 204
I H P FL + +H
Sbjct: 190 CEFIGRHAGKAPFFLYVPLTQLH 212
>gi|453072694|ref|ZP_21975742.1| arylsulfatase [Rhodococcus qingshengii BKS 20-40]
gi|452757342|gb|EME15747.1| arylsulfatase [Rhodococcus qingshengii BKS 20-40]
Length = 773
Score = 93.2 bits (230), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 100/197 (50%), Gaps = 23/197 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G G ++I TP +D LA GI + ++T P C+PSRAA LTG P R G
Sbjct: 55 GYSDIGPFG-SEIETPTLDRLAAQGIRMTNYHTTPLCSPSRAALLTGLNPHRAGYGFVAN 113
Query: 83 A-----GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN-------KEELLPFNRGFD 130
A G+ + + LP+ L+ GY+T+ +GKWH+ + + P RGFD
Sbjct: 114 ADPGYPGLRLELADDVQTLPEILRGAGYATYAVGKWHLVRDANLAPGRSRDSWPTQRGFD 173
Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK---S 187
+ G G +S + + + ++ +++ Y YLTD TD++V IK +
Sbjct: 174 RYYGSLEGL----NSFYYPNQLISDNSVVDVDEYP---EGYYLTDDLTDKAVGYIKDLRA 226
Query: 188 HNHSRPLFLQITHAAVH 204
H+ +P FL H A+H
Sbjct: 227 HDQDKPFFLYFAHVAMH 243
>gi|410420178|ref|YP_006900627.1| sulfatase [Bordetella bronchiseptica MO149]
gi|408447473|emb|CCJ59148.1| probable sulfatase [Bordetella bronchiseptica MO149]
Length = 464
Score = 93.2 bits (230), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 94/203 (46%), Gaps = 30/203 (14%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW ++G +G I PTP IDALA G C P+R+A +TG++P R G
Sbjct: 19 GWGELGCYGGGAIRGAPTPRIDALAAQGTQFLNFNVESDCVPTRSALMTGRHPVRTGAMQ 78
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
V AG+ + + E+ L Q E GY+T + GKWH+G +KE P +RGFD W G
Sbjct: 79 SVPAGLPQGLVPWERTLAQAFSEQGYATAMYGKWHLG-DKEGRYPKDRGFDE----WYGI 133
Query: 140 -LTYNDSIHETDFAVGLD-----------------ARRNMERYAPQMSSKYLTDFFTDQS 181
T N+S+ AVG D A R ERY +M + + + T +S
Sbjct: 134 PRTTNESMFME--AVGFDPDVVEVPYVMEGRKGSPAERR-ERYDLEMRRR-IDEVLTQRS 189
Query: 182 VHVIKSHNHSRPLFLQITHAAVH 204
I H P FL + +H
Sbjct: 190 CEFIGRHAGKAPFFLYVPLTQLH 212
>gi|260804443|ref|XP_002597097.1| hypothetical protein BRAFLDRAFT_215750 [Branchiostoma floridae]
gi|229282360|gb|EEN53109.1| hypothetical protein BRAFLDRAFT_215750 [Branchiostoma floridae]
Length = 577
Score = 93.2 bits (230), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 50/103 (48%), Positives = 62/103 (60%), Gaps = 7/103 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGI---D 78
G DVG G + I TPNID++A G L +H P CTPSRAAFLTG+YP RYG+ D
Sbjct: 34 GIGDVGCFGNDTIRTPNIDSIAAKGAKLTQHLAAAPVCTPSRAAFLTGRYPIRYGMAGRD 93
Query: 79 TP---VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN 118
P V + +P +E PQ K+ GY T L+GKWH+G N
Sbjct: 94 LPMAFVQLAIPSGLPRSEVTFPQLAKDHGYQTALLGKWHLGLN 136
>gi|33597308|ref|NP_884951.1| sulfatase [Bordetella parapertussis 12822]
gi|427814649|ref|ZP_18981713.1| probable sulfatase [Bordetella bronchiseptica 1289]
gi|33573735|emb|CAE38032.1| probable sulfatase [Bordetella parapertussis]
gi|410565649|emb|CCN23207.1| probable sulfatase [Bordetella bronchiseptica 1289]
Length = 464
Score = 93.2 bits (230), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 94/203 (46%), Gaps = 30/203 (14%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW ++G +G I PTP IDALA G C P+R+A +TG++P R G
Sbjct: 19 GWGELGCYGGGAIRGAPTPRIDALAAQGTQFLNFNVESDCVPTRSALMTGRHPVRTGAMQ 78
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
V AG+ + + E+ L Q E GY+T + GKWH+G +KE P +RGFD W G
Sbjct: 79 SVPAGLPQGLVPWERTLAQAFSEQGYATAMYGKWHLG-DKEGRYPKDRGFDE----WYGI 133
Query: 140 -LTYNDSIHETDFAVGLD-----------------ARRNMERYAPQMSSKYLTDFFTDQS 181
T N+S+ AVG D A R ERY +M + + + T +S
Sbjct: 134 PRTTNESMFME--AVGFDPDVVEVPYVMEGRKGSPAERR-ERYDLEMRRR-IDEVLTQRS 189
Query: 182 VHVIKSHNHSRPLFLQITHAAVH 204
I H P FL + +H
Sbjct: 190 CEFIGRHAGKAPFFLYVPLTQLH 212
>gi|404216649|ref|YP_006670870.1| Arylsulfatase A-related enzyme [Gordonia sp. KTR9]
gi|403647448|gb|AFR50688.1| Arylsulfatase A-related enzyme [Gordonia sp. KTR9]
Length = 797
Score = 93.2 bits (230), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 65/197 (32%), Positives = 103/197 (52%), Gaps = 23/197 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
G++D+ G +I TPN+D LA NGI L+ ++T P C+P+RAA LTG P R G +
Sbjct: 72 GYSDIAPFGA-EIDTPNLDRLARNGIRLSNYHTTPVCSPARAALLTGLNPHRAGYGSVAN 130
Query: 80 --PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI------GCNKEE-LLPFNRGFD 130
P G+ + LP+ L+E GY+T+ +GKWH+ G ++ P RGFD
Sbjct: 131 SDPGFPGLRLELADDVLALPEILRESGYATYAVGKWHLVRDANMGPGRDRGSWPLQRGFD 190
Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS--- 187
++ G G +S + + ++ ++E Y Y+TD TD+++ IKS
Sbjct: 191 SYYGSLEGL----NSFFYPNQLIADNSVVDVETYP---EDYYVTDDLTDRAIGQIKSLRA 243
Query: 188 HNHSRPLFLQITHAAVH 204
+ ++P FL H A+H
Sbjct: 244 QDPTKPFFLYFAHIAMH 260
>gi|296231792|ref|XP_002761310.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Callithrix jacchus]
Length = 458
Score = 93.2 bits (230), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 77/162 (47%), Gaps = 31/162 (19%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G + Y+ P C+PSRAA LTG+ P R G T
Sbjct: 42 GWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTTN 101
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E+LLP+ LKE GY T ++GKWH+G ++ + P GFD
Sbjct: 102 AHARNAYTPQEIVGGIPDSEQLLPELLKEAGYVTKIVGKWHLG-HRPQFHPLKHGFDEWF 160
Query: 134 GY---------------------WNGYLTYNDSIHETDFAVG 154
G W Y D++ E D ++G
Sbjct: 161 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYGDAVREMDDSIG 202
>gi|221119831|ref|XP_002168522.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
Length = 223
Score = 93.2 bits (230), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 57/165 (34%), Positives = 89/165 (53%), Gaps = 10/165 (6%)
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ A A V + EK LPQYLK +GY TH IGKWH+G +E P RGFD+ GY+ G
Sbjct: 6 IFAANAWGVGLDEKFLPQYLKNVGYQTHAIGKWHLGFFSKEYTPTYRGFDSFYGYYGGQA 65
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSK---YLTDFFTDQSVHVIKSHNHSRPLFLQ 197
Y D ++ GLD + + + ++ Y T ++ +++ I++HN ++P+FL
Sbjct: 66 DYWDHSLASNGWWGLDLHYDTPSSSKNIFNQWGNYSTAMYSMEAIDRIRNHNSTQPMFLY 125
Query: 198 ITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ + AVH+ A L LQ P +E F+HI + R+ +A
Sbjct: 126 LAYQAVHS-----ANLREYPLQAP--QEWVDKFSHIKHKGRQNYA 163
>gi|445495948|ref|ZP_21462992.1| sulfatase [Janthinobacterium sp. HH01]
gi|444792109|gb|ELX13656.1| sulfatase [Janthinobacterium sp. HH01]
Length = 471
Score = 93.2 bits (230), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 91/184 (49%), Gaps = 10/184 (5%)
Query: 26 DVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRY--GIDTPV- 81
D+G +G+ DI TPN+D LA GI + Y C+ +R A +TG+Y +R G++ P+
Sbjct: 47 DLGVYGQTDIRTPNLDKLAGQGIRFTQAYANSAVCSATRFALITGRYQYRLRGGLEEPIA 106
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
GA +P T LP LK+ GY T LIGKWH+G P G+D+ G + G +
Sbjct: 107 GASDTLGLPRTHPTLPSLLKKQGYGTALIGKWHLGY-LPTFGPLKSGYDSFFGNYGGAID 165
Query: 142 YNDSIHETDFAVGLDARRNM-ERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y H+ VG + ++ E P Y TD ++V ++ +P L + +
Sbjct: 166 Y--FTHKP--GVGPQVKEDLYEGEVPVHQIGYYTDLLGARAVDFVQKQQAGKPFLLSLHY 221
Query: 201 AAVH 204
A H
Sbjct: 222 TAPH 225
>gi|395840581|ref|XP_003793133.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase E [Otolemur
garnettii]
Length = 811
Score = 93.2 bits (230), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 50/106 (47%), Positives = 65/106 (61%), Gaps = 7/106 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G DVG +G I TPNID LA +G++L +H + CTPSRAAFLTG+YP R G+ +
Sbjct: 45 GIGDVGCYGNRTIRTPNIDRLAEDGVMLTQHIAAASVCTPSRAAFLTGRYPVRSGMVSSD 104
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEE 121
G AGV+ +P E + L++ GY T LIGKWH+G N E
Sbjct: 105 GDRVLQWAGVSGGLPTNETTFAKILQDKGYVTGLIGKWHLGLNCES 150
>gi|226184211|dbj|BAH32315.1| probable arylsulfatase [Rhodococcus erythropolis PR4]
Length = 773
Score = 93.2 bits (230), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 100/197 (50%), Gaps = 23/197 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID---- 78
G++D+G G ++I TP +D LA GI + ++T P C+PSRAA LTG P R G
Sbjct: 55 GYSDIGPFG-SEIETPTLDRLASQGIRMTNYHTTPLCSPSRAALLTGLNPHRAGYGFVAN 113
Query: 79 -TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN-------KEELLPFNRGFD 130
P G+ + + LP+ L+ GY+T+ +GKWH+ + + P RGFD
Sbjct: 114 ADPGYPGLRLELADDVQTLPEILRGAGYATYAVGKWHLVRDANLAPGRSRDSWPTQRGFD 173
Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK---S 187
+ G G +S + + + ++ +++ Y YLTD TD++V IK +
Sbjct: 174 RYYGSLEGL----NSFYYPNQLISDNSVVDVDEYP---EGYYLTDDLTDKAVGYIKDLRA 226
Query: 188 HNHSRPLFLQITHAAVH 204
H+ +P FL H A+H
Sbjct: 227 HDQDKPFFLYFAHVAMH 243
>gi|311746665|ref|ZP_07720450.1| sulfatase family protein [Algoriphagus sp. PR1]
gi|311302556|gb|EAZ82500.2| sulfatase family protein [Algoriphagus sp. PR1]
Length = 465
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 99/202 (49%), Gaps = 27/202 (13%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLP-TCTPSRAAFLTGKYPFRYGIDT- 79
QG+ DVG G I TPN+D +A G Y CTPSR+A +TG+ P R G+ +
Sbjct: 39 QGYGDVGTFGHPTIKTPNLDQMAMEGQKWTNFYVAANVCTPSRSAIMTGRLPVRTGMYSN 98
Query: 80 ------PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
P G +P TE + + LK GYST IGKWH+G + E LP + GFD +
Sbjct: 99 TRRVLFPDSGG---GLPATENTIAKLLKTSGYSTAAIGKWHLG-HLPEYLPTSHGFDTYF 154
Query: 134 G--YWNGYLTYNDSIHETDFAVG---------LDARRNMERYAPQMSSKYLTDFFTDQSV 182
G Y N ND + FA + + +ER A Q + +T +T+++V
Sbjct: 155 GIPYSNDMDRINDVTAQEAFASPKPEYFNVPLMRDKEIIERPADQTT---ITKRYTEEAV 211
Query: 183 HVIKSHNHSRPLFLQITHAAVH 204
IK+ N +P F+ + H+ H
Sbjct: 212 SYIKA-NKDQPFFIYLAHSLPH 232
>gi|348555459|ref|XP_003463541.1| PREDICTED: steryl-sulfatase-like [Cavia porcellus]
Length = 580
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 53/125 (42%), Positives = 68/125 (54%), Gaps = 12/125 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G + TPNID LA G+ L +H P CTPSRAAF+TG+YP R G+ +
Sbjct: 33 GIGDLGCYGNQTLRTPNIDRLAGGGVKLTQHLAASPLCTPSRAAFMTGRYPIRLGMASHS 92
Query: 82 GAGV------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
GV + +P +E + LKE GYST LIGKWH+G N P GFD
Sbjct: 93 RMGVYLFTASSGGLPTSEVTFARLLKEQGYSTALIGKWHLGINCYNTTDFCHHPLRHGFD 152
Query: 131 NHVGY 135
G+
Sbjct: 153 YFYGF 157
>gi|345327068|ref|XP_001514429.2| PREDICTED: arylsulfatase E [Ornithorhynchus anatinus]
Length = 629
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/124 (43%), Positives = 69/124 (55%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G + + TPNID LA G+ L +H + + CTPSRAAFLTG+YP R G+ +
Sbjct: 85 GIGDLGCYGNDTLRTPNIDRLAQEGVRLTQHISAASVCTPSRAAFLTGRYPIRSGMVSSD 144
Query: 82 GAGVAK------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G V + +P E + L+E GYST LIGKWH G N E P N GFD
Sbjct: 145 GYRVLRWTACSGGLPANETTFGEILQEQGYSTGLIGKWHQGLNCERSWDHCHHPLNHGFD 204
Query: 131 NHVG 134
G
Sbjct: 205 YFFG 208
>gi|410635289|ref|ZP_11345904.1| N-acetylgalactosamine-6-sulfatase [Glaciecola lipolytica E3]
gi|410145262|dbj|GAC13109.1| N-acetylgalactosamine-6-sulfatase [Glaciecola lipolytica E3]
Length = 493
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 95/194 (48%), Gaps = 15/194 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLP--TCTPSRAAFLTGKYPFRYGIDTP 80
G+ D+ G + I TPNID++ G +R + +P C+PSRA+ LTG+YP R G+
Sbjct: 53 GYGDISSFGADGIRTPNIDSIGQEGFT-SRDFFIPANVCSPSRASLLTGRYPMRNGMPVA 111
Query: 81 VGAGVAKAVPV------TEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
V K V E +P+ LK GY + ++GKWH+G ++ P + GFD H+G
Sbjct: 112 VNPLSEKHVSSHFGLHPDEITIPEMLKPAGYRSLMVGKWHLGFQQKGSHPLDAGFDEHLG 171
Query: 135 YWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS 191
Y Y ++ + + D + R E ++ + +T +TD+ + I+
Sbjct: 172 LLGNY--YKARENDPRYPILKDNQTLYRGHEAVKEEIELEEVTQRYTDEVISFIEREKDG 229
Query: 192 RPLFLQITHAAVHT 205
P F+ H VH+
Sbjct: 230 -PFFVYFAHNIVHS 242
>gi|326913667|ref|XP_003203156.1| PREDICTED: arylsulfatase D-like [Meleagris gallopavo]
Length = 576
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/127 (44%), Positives = 68/127 (53%), Gaps = 18/127 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP- 80
G DVG +G N I TPNID LA G+ L +H P CTPSRAAFLTG+YP R G+ +
Sbjct: 33 GIGDVGCYGNNTIRTPNIDRLAREGVKLTQHIAAAPLCTPSRAAFLTGRYPIRSGMASSN 92
Query: 81 --------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNR 127
G+G +P E + L++ GY+T LIGKWH G N E P N
Sbjct: 93 QYRALQWNAGSG---GLPANETTFARILQQQGYTTGLIGKWHQGVNCESFNDHCHHPLNH 149
Query: 128 GFDNHVG 134
GFD G
Sbjct: 150 GFDYFYG 156
>gi|440902784|gb|ELR53530.1| Arylsulfatase B, partial [Bos grunniens mutus]
Length = 431
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 87/163 (53%), Gaps = 24/163 (14%)
Query: 88 AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYN---- 143
+P+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G Y
Sbjct: 19 CIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHER 78
Query: 144 ----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
D+++ T A+ R+ E A + Y T+ FT+++ +I +H +PLFL +
Sbjct: 79 CTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNVFTERATTLITNHPPEKPLFLYLA 135
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+VH LQVP EE + + I + +RR +A
Sbjct: 136 LQSVHEP-----------LQVP--EEYLKPYDFIQDRNRRYYA 165
>gi|149178470|ref|ZP_01857059.1| arylsulfatase A (precursor) [Planctomyces maris DSM 8797]
gi|148842683|gb|EDL57057.1| arylsulfatase A (precursor) [Planctomyces maris DSM 8797]
Length = 491
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 94/193 (48%), Gaps = 16/193 (8%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP 80
QG+ DVG G +I TPN+D +A GI Y C+ SR A LTG YP R GI
Sbjct: 56 QGYQDVGVFGSPNIKTPNLDQMAKEGIRFTDFYAAQAVCSASRVALLTGCYPNRVGIRGA 115
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+G + E + + +K GY+T + GKWH+G + E LP GFD + G L
Sbjct: 116 LGPQSKIGINAEETTIAEVVKPQGYATAIYGKWHLG-HLPEFLPTRHGFDEYFG-----L 169
Query: 141 TYNDSIHETDFAVG-----LDARRNMERYAPQMSSK---YLTDFFTDQSVHVIKSHNHSR 192
Y++ + G L N P+++ K L+ ++T+++V I + NH +
Sbjct: 170 PYSNDMWPFHPTAGKRFPDLPLIENETVINPKVTGKEQAQLSTWYTERAVSFI-NKNHDK 228
Query: 193 PLFLQITHAAVHT 205
P FL + H+ H
Sbjct: 229 PFFLYVPHSMPHV 241
>gi|421612498|ref|ZP_16053605.1| arylsulfatase A, partial [Rhodopirellula baltica SH28]
gi|408496794|gb|EKK01346.1| arylsulfatase A, partial [Rhodopirellula baltica SH28]
Length = 487
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 104/200 (52%), Gaps = 19/200 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTP 80
QG+ND+G +G +I TPN+D LA G Y+ C+PSRAA LTG YP R G+
Sbjct: 57 QGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQH 116
Query: 81 VGAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWN 137
V +K + E + +LK GY+T +GKWH+G +K E LP + GFD++ G Y N
Sbjct: 117 VLFPQSKHGLHPDEVTIADHLKSAGYATACVGKWHLGHHK-ETLPTSNGFDSYYGIPYSN 175
Query: 138 ----------GYLTYNDSIHETDFAVGL---DARRNMERYAPQMSSKYLTDFFTDQSVHV 184
G ++ +D + AV L ++ E + + +T +TD+++
Sbjct: 176 DMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTITRRYTDRAIEF 235
Query: 185 IKSHNHSRPLFLQITHAAVH 204
+++ N +P FL + H+ H
Sbjct: 236 VEA-NQDKPFFLYLPHSMPH 254
>gi|336118326|ref|YP_004573095.1| arylsulfatase [Microlunatus phosphovorus NM-1]
gi|334686107|dbj|BAK35692.1| arylsulfatase [Microlunatus phosphovorus NM-1]
Length = 785
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 114/230 (49%), Gaps = 38/230 (16%)
Query: 5 VGAGVAKAVPV--TEKLLPQG-------------WNDVGFHGENDIPTPNIDALAYNGIV 49
+G ++++VP E+ PQG + D+G +G ++I TP++D LA +G+
Sbjct: 25 IGRTISESVPAWPAERTAPQGSPNVIVIVVDDLGYADLGPYG-SEIATPHLDRLAADGVR 83
Query: 50 LNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVGA-----GVAKAVPVTEKLLPQYLKELG 104
++T P C+PSRAA LTG P + G P + +P L + L+E G
Sbjct: 84 FTNYHTTPLCSPSRAALLTGLNPHKAGFAFPANSDPGYPAYTFTLPDNAPTLAETLRERG 143
Query: 105 YSTHLIGKWHIGCNK-------EELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDA 157
Y+T +GKWH+ ++ + P RGFD + G G+ S+H V ++
Sbjct: 144 YATFALGKWHLTGDRLQHDGASKASWPCQRGFDRYFGALEGFT----SLHAPHRLVWDNS 199
Query: 158 RRNMERYAPQMSSKYLTDFFTDQSVHVI---KSHNHSRPLFLQITHAAVH 204
++ + + YLTD T++++ +I ++ + +P FL + HAAVH
Sbjct: 200 PYPVQEFP---ADYYLTDDLTERAIEMISTLRAADADKPFFLYLAHAAVH 246
>gi|334145238|ref|YP_004538448.1| sulfatase family protein [Novosphingobium sp. PP1Y]
gi|333937122|emb|CCA90481.1| sulfatase family protein [Novosphingobium sp. PP1Y]
Length = 425
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 89/184 (48%), Gaps = 11/184 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGI-DTP 80
G+ D+ G I TPNID +A G ++ Y CTPSRA LTG+YP R G+ D
Sbjct: 20 GYGDLSITGARGIKTPNIDRMAREGRTFSQFYAAANLCTPSRAGLLTGRYPVRTGLGDKV 79
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ + +P +E +P LK Y+T L GKWH+G + LP + GFD VG +
Sbjct: 80 ILYNDDRVLPTSEVTIPTALKTAEYATGLFGKWHLGHRGPDWLPTHHGFDRFVG-----I 134
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y+ H+ V + A E + L F +++ I + N RP F+++
Sbjct: 135 PYS---HDMSPLVLVRAEAGKEAHQVPTEITPLQQIFCEEAEQFI-TENAERPFFVELAL 190
Query: 201 AAVH 204
+A H
Sbjct: 191 SAPH 194
>gi|427822349|ref|ZP_18989411.1| probable sulfatase [Bordetella bronchiseptica Bbr77]
gi|410587614|emb|CCN02660.1| probable sulfatase [Bordetella bronchiseptica Bbr77]
Length = 464
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 94/203 (46%), Gaps = 30/203 (14%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW ++G +G I PTP IDALA G C P+R+A +TG++P R G
Sbjct: 19 GWGELGCYGGGAIRGAPTPRIDALAAQGTQFLNFNVESDCVPTRSALMTGRHPVRTGAMQ 78
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
V AG+ + + E+ L Q E GY+T + GKWH+G +KE P +RGFD W G
Sbjct: 79 SVPAGLPQGLVPWERTLAQAFSEQGYATAMYGKWHLG-DKEGRYPKDRGFDE----WYGI 133
Query: 140 -LTYNDSIHETDFAVGLD-----------------ARRNMERYAPQMSSKYLTDFFTDQS 181
T N+S+ AVG D A R ERY +M + + + T +S
Sbjct: 134 PRTTNESMFME--AVGFDPDVVEVPYVMEGRKGSPAERR-ERYDLEMRRR-IDEVLTQRS 189
Query: 182 VHVIKSHNHSRPLFLQITHAAVH 204
I H P FL + +H
Sbjct: 190 CEFIGRHAGKVPFFLYVPLTQLH 212
>gi|392969626|ref|ZP_10335041.1| sulfatase [Fibrisoma limi BUZ 3]
gi|387841820|emb|CCH57099.1| sulfatase [Fibrisoma limi BUZ 3]
Length = 477
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/197 (32%), Positives = 97/197 (49%), Gaps = 26/197 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+ +G TP++D+LA GI + Y+ P C+PSRAA LTGK+P R + +
Sbjct: 48 GYMDLRCYGNPYNETPHLDSLARRGIRFTQAYSACPVCSPSRAAILTGKHPARLHLTNFI 107
Query: 82 G------------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF 129
G A + +P +E L + LK+ GY T ++GKWH+G + L P +GF
Sbjct: 108 GGERVDTTSSLLPAEWRRYLPASETTLAELLKQQGYVTGMVGKWHLGNTGDSLTPTAQGF 167
Query: 130 DNHVGYW-NGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
D NG YN SI + V D + +YLTD TD ++ I +
Sbjct: 168 DYERQISKNGLDYYNYSIASNNKTVFEDTGK-----------EYLTDKLTDYALEFIDQN 216
Query: 189 NH-SRPLFLQITHAAVH 204
+PLFL + ++A H
Sbjct: 217 KAGQKPLFLYLAYSAPH 233
>gi|345330079|ref|XP_001507106.2| PREDICTED: arylsulfatase D-like [Ornithorhynchus anatinus]
Length = 607
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/125 (43%), Positives = 66/125 (52%), Gaps = 13/125 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYG----- 76
G D+G +G I TPNID LA G+ L +H P CTPSRA+FLTG+YP R G
Sbjct: 42 GIGDLGCYGNTTIRTPNIDRLAKEGVRLTQHLAAAPLCTPSRASFLTGRYPIRSGRMESE 101
Query: 77 --IDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGF 129
I V G + +P E + L++ GY+T LIGKWH G N E P N GF
Sbjct: 102 ELIRVIVWNGASGGLPANETTFARILQQQGYTTGLIGKWHQGVNCESRTDYCHHPLNHGF 161
Query: 130 DNHVG 134
D G
Sbjct: 162 DYFFG 166
>gi|429202673|ref|ZP_19194044.1| arylsulfatase [Streptomyces ipomoeae 91-03]
gi|428661782|gb|EKX61267.1| arylsulfatase [Streptomyces ipomoeae 91-03]
Length = 769
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/197 (32%), Positives = 92/197 (46%), Gaps = 23/197 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G G ++PTP +DALA G+ L ++TLP C+PSRAA LTG P R G
Sbjct: 49 GYSDIGPFGA-EVPTPVLDALAEQGVRLTNYHTLPLCSPSRAALLTGANPHRVGYAMVAN 107
Query: 83 A-----GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-------LPFNRGFD 130
A G + L + L+ GY+T+ +GKWH+ + P +GFD
Sbjct: 108 ADPGFPGYGMEIADDFPTLAETLRGAGYATYAVGKWHLARDASSSAAADRSNWPLQKGFD 167
Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS--- 187
+ G G LD E Y Y TD TDQ++ ++KS
Sbjct: 168 QYYGVLEGLTNLFHPHQLVRDNSPLDIDEFPEGY-------YYTDDITDQAIAMVKSLRA 220
Query: 188 HNHSRPLFLQITHAAVH 204
H+ +P FL + H AVH
Sbjct: 221 HDPDKPFFLYLAHNAVH 237
>gi|427819008|ref|ZP_18986071.1| probable sulfatase [Bordetella bronchiseptica D445]
gi|410570008|emb|CCN18144.1| probable sulfatase [Bordetella bronchiseptica D445]
Length = 464
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 94/203 (46%), Gaps = 30/203 (14%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW ++G +G I PTP IDALA G C P+R+A +TG++P R G
Sbjct: 19 GWGELGCYGGGAIRGAPTPRIDALAAQGTQFLNFNVESDCVPTRSALMTGRHPVRTGAMQ 78
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
V AG+ + + E+ L Q E GY+T + GKWH+G +KE P +RGFD W G
Sbjct: 79 SVPAGLPQGLVPWERTLAQAFSEQGYATAMYGKWHLG-DKEGRYPKDRGFDE----WYGI 133
Query: 140 -LTYNDSIHETDFAVGLD-----------------ARRNMERYAPQMSSKYLTDFFTDQS 181
T N+S+ AVG D A R ERY +M + + + T +S
Sbjct: 134 PRTTNESMFME--AVGFDPDVVEVPYVMEGRKGSPAERR-ERYDLEMRRR-IDEVLTQRS 189
Query: 182 VHVIKSHNHSRPLFLQITHAAVH 204
I H P FL + +H
Sbjct: 190 CEFIGRHAGKVPFFLYVPLTQLH 212
>gi|426233825|ref|XP_004023235.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Ovis aries]
Length = 475
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 87/163 (53%), Gaps = 24/163 (14%)
Query: 88 AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYN---- 143
+P+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G Y
Sbjct: 68 CIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHER 127
Query: 144 ----DSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
D+++ T A+ R+ E A + Y T+ FT+++ +I +H +PLFL +
Sbjct: 128 CTVIDALNVTRCALDF---RDGEEVATGYKNMYSTNVFTERATTLITTHPPEKPLFLYLA 184
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+VH LQVP EE + + I + +RR +A
Sbjct: 185 LQSVHEP-----------LQVP--EEYLKPYDFIQDKNRRHYA 214
>gi|395804314|ref|ZP_10483554.1| sulfatase [Flavobacterium sp. F52]
gi|395433413|gb|EJF99366.1| sulfatase [Flavobacterium sp. F52]
Length = 550
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/191 (32%), Positives = 93/191 (48%), Gaps = 12/191 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI---DT 79
G++D+G +G ++I TPN+D LA G L Y C P+RA+ LTG+Y + G+ D
Sbjct: 42 GYSDLGNYG-SEIKTPNLDRLAKEGTRLREFYNNSICAPTRASLLTGQYQHKAGVGYFDV 100
Query: 80 PVGAGVAKAVPVTEKL-LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+G + E L L + + GYST + GKWH+G + P RGFD G G
Sbjct: 101 NLGLPAYQGYLNKESLTLGEVFRSGGYSTLMSGKWHVGSEDQSQWPNQRGFDKFYGILKG 160
Query: 139 YLTYNDS----IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN-HSRP 193
Y D+ T + V + RN E P+ S Y TD + +V ++ N ++P
Sbjct: 161 ASNYFDTKPLPFGTTPYPVKM--IRNNEELHPKDDSYYFTDEIGNNAVTFLEEQNKENKP 218
Query: 194 LFLQITHAAVH 204
FL + A H
Sbjct: 219 FFLYLAFTAPH 229
>gi|357393955|ref|YP_004908796.1| putative arylsulfatase [Kitasatospora setae KM-6054]
gi|311900432|dbj|BAJ32840.1| putative arylsulfatase [Kitasatospora setae KM-6054]
Length = 778
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 93/197 (47%), Gaps = 23/197 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID---- 78
G++D+G G +++PTP +D LA G+ L ++T+P C+P+RAA LTG P R G
Sbjct: 54 GYSDIGPFG-SEVPTPTLDGLAERGVKLANYHTMPLCSPARAALLTGLNPHRVGYSFVAN 112
Query: 79 -TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI-------GCNKEELLPFNRGFD 130
P G + L Q L + GY+T+ +GKWH+ + P +GFD
Sbjct: 113 ADPGFPGYGMEIAGDIPTLAQTLHDAGYATYAVGKWHLTRDSASSAADNRANWPLQKGFD 172
Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS--- 187
+ G G + LD + Y Y TD TDQ++ ++KS
Sbjct: 173 QYYGVLEGLTSLFHPHQLVRDNSPLDIDEFPDGY-------YYTDDITDQAIGMVKSLRA 225
Query: 188 HNHSRPLFLQITHAAVH 204
H+ +P FL + H AVH
Sbjct: 226 HDADKPFFLYLAHNAVH 242
>gi|13278373|gb|AAH04002.1| Galactosamine (N-acetyl)-6-sulfate sulfatase [Mus musculus]
Length = 520
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 63/201 (31%), Positives = 97/201 (48%), Gaps = 20/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P C+PSRAA LTG+ P R G T
Sbjct: 39 GWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIRNGFYTTN 98
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E LLP+ LK+ GY+ ++GKWH+G ++ + P GFD
Sbjct: 99 AHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLG-HRPQFHPLKHGFDEWF 157
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYA--PQMSSKYLTDFFTDQSVHVIKS- 187
G N + D+ + + V D R E + + LT + +++ I++
Sbjct: 158 GSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLTQLYLQEALDFIRTQ 217
Query: 188 HNHSRPLFL----QITHAAVH 204
H P FL THA V+
Sbjct: 218 HARQGPFFLYWAIDATHAPVY 238
>gi|317479852|ref|ZP_07938971.1| sulfatase [Bacteroides sp. 4_1_36]
gi|316903981|gb|EFV25816.1| sulfatase [Bacteroides sp. 4_1_36]
Length = 541
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 94/198 (47%), Gaps = 23/198 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
G++DVG +G +IPTPNID LA G+ + Y P+RA+ LTG YP + GI
Sbjct: 48 GYSDVGCYG-GEIPTPNIDRLAQKGVRYTQFYNSGRSCPTRASLLTGLYPQQAGIGAMSE 106
Query: 80 ----------PVGAGVAKAVPVTEK---LLPQYLKELGYSTHLIGKWHIGCNKEELLPFN 126
P GV + + + + LKE GY T++ GKWH+G + +E P
Sbjct: 107 DPGIKKGEKHPENRGVHGYMGFLNRNCVTIAEVLKEAGYHTYMTGKWHVGMHGKEKWPLQ 166
Query: 127 RGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK 186
RGF++ G G +Y + + LD N APQ S Y TD FTD ++ I
Sbjct: 167 RGFEHFYGILAGASSYLKP--QGGRGLTLD---NTNLPAPQ-SPYYTTDAFTDYAIRFID 220
Query: 187 SHNHSRPLFLQITHAAVH 204
P FL + + A H
Sbjct: 221 EQTDDNPFFLYLAYNAPH 238
>gi|417301368|ref|ZP_12088525.1| arylsulfatase A [Rhodopirellula baltica WH47]
gi|327542298|gb|EGF28785.1| arylsulfatase A [Rhodopirellula baltica WH47]
Length = 470
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 104/200 (52%), Gaps = 19/200 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTP 80
QG+ND+G +G +I TPN+D LA G Y+ C+PSRAA LTG YP R G+
Sbjct: 38 QGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQH 97
Query: 81 VGAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWN 137
V +K + E + +LK GY+T +GKWH+G +K E LP + GFD++ G Y N
Sbjct: 98 VLFPQSKHGLHPDEVTIADHLKSAGYATACVGKWHLGHHK-ETLPTSNGFDSYYGIPYSN 156
Query: 138 ----------GYLTYNDSIHETDFAVGL---DARRNMERYAPQMSSKYLTDFFTDQSVHV 184
G ++ +D + AV L ++ E + + +T +TD+++
Sbjct: 157 DMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTVTRRYTDRAIEF 216
Query: 185 IKSHNHSRPLFLQITHAAVH 204
+++ N +P FL + H+ H
Sbjct: 217 VEA-NQDKPFFLYLPHSMPH 235
>gi|149198650|ref|ZP_01875694.1| arylsulfatase (aryl-sulfate sulphohydrolase) [Lentisphaera araneosa
HTCC2155]
gi|149138365|gb|EDM26774.1| arylsulfatase (aryl-sulfate sulphohydrolase) [Lentisphaera araneosa
HTCC2155]
Length = 569
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/195 (31%), Positives = 96/195 (49%), Gaps = 19/195 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ D+G +G +I TPN+D LA G+ + Y C P+RA+ LTG YP + GI +
Sbjct: 34 GYTDIGSYG-GEIDTPNLDGLAKEGLRFTQFYNTGRCCPTRASLLTGLYPHQAGIGHMMS 92
Query: 83 ----AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN-----KEEL--LPFNRGFDN 131
G + T + + LK YST+++GKWH+ N KE P NRGFD+
Sbjct: 93 DRGTDGYRGDLNKTSVTIAEVLKPAAYSTYMVGKWHVTKNLLNDDKESQYNWPLNRGFDH 152
Query: 132 HVGYWNGYLTYND--SIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHN 189
G +G ++ D S+ D + N Y P+ + Y TD +D ++ I H+
Sbjct: 153 FYGTIHGAGSFFDPNSLTRDDKYI---TPENDPEYQPE--TYYYTDAISDNAIKYINEHD 207
Query: 190 HSRPLFLQITHAAVH 204
+P F+ + + A H
Sbjct: 208 SQKPFFMYVAYTAAH 222
>gi|390167238|ref|ZP_10219235.1| putative arylsulfatase A [Sphingobium indicum B90A]
gi|389590183|gb|EIM68184.1| putative arylsulfatase A [Sphingobium indicum B90A]
Length = 468
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 59/190 (31%), Positives = 95/190 (50%), Gaps = 15/190 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYP--FRYGIDT 79
G D+G +G DI TP IDA+A G+ Y C+P+R A LTG+Y FR G++
Sbjct: 49 GHADLGCYGSRDIRTPAIDAIAARGVKFGNAYANSCVCSPTRIALLTGRYQGRFRIGLEE 108
Query: 80 PVG-AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
P+ G ++P + LP L++LGY+T L+GKWH+G P + G+D G +G
Sbjct: 109 PIAFNGDELSLPRGTRTLPGLLRDLGYATSLVGKWHVG-ELPASSPLDHGYDYFFGIASG 167
Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK-SHNHSRPL 194
Y + +I+ + + R ++R YLTD +++ ++ + RP
Sbjct: 168 GTDYFAHATTINGHEMGKLFENRTEIQR------PGYLTDLLGAKAIDRMRLAARQDRPF 221
Query: 195 FLQITHAAVH 204
F+ + A H
Sbjct: 222 FISLHFTAPH 231
>gi|323451705|gb|EGB07581.1| hypothetical protein AURANDRAFT_27261 [Aureococcus anophagefferens]
Length = 614
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 45/116 (38%), Positives = 65/116 (56%), Gaps = 7/116 (6%)
Query: 24 WNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI-DTPVG 82
+ND H TP + ++ +G VL+ Y PTCTPSRA +TG+Y R G+ D+ +
Sbjct: 61 YNDAALH------TPELQRMSEHGFVLDNFYAAPTCTPSRAMLMTGRYNIRNGMQDSVIH 114
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+ + VP+ E+ L Q L + GY T IGKWH+G +++ P RGFD G G
Sbjct: 115 STEPRGVPLDERFLSQKLSDAGYRTAAIGKWHLGMHRDAYTPLKRGFDLFYGILTG 170
>gi|346644762|ref|NP_001231049.1| arylsulfatase E precursor [Sus scrofa]
Length = 585
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 52/120 (43%), Positives = 70/120 (58%), Gaps = 12/120 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G + I TPNID LA +G++L +H + CTPSRAAFLTG+YP R G+ +
Sbjct: 45 GIGDLGCYGNHTIRTPNIDRLAADGVMLTQHLAAASLCTPSRAAFLTGRYPLRSGMVSST 104
Query: 82 GAGVAKAV------PVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G+ V + V P E + LK+ GY T L+GKWH+G N + P N GFD
Sbjct: 105 GSRVLQWVAASGGLPPNETTFAKILKDKGYVTGLVGKWHLGLNCDSSEDHCHHPLNHGFD 164
>gi|332663784|ref|YP_004446572.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
gi|332332598|gb|AEE49699.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
Length = 580
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 99/188 (52%), Gaps = 12/188 (6%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI---DT 79
G++D G +G ++I TPNID LAY G+ L Y C P+RA+ +TG+YP + G+ +T
Sbjct: 44 GYSDFGAYG-SEIQTPNIDKLAYGGLRLKEFYNNSICAPTRASLITGQYPHKAGLGYFNT 102
Query: 80 PVGAGVAKAVPVTEKL-LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+G + E L + L++ GY+T+L GKWH+G N P RGF+ G+ G
Sbjct: 103 NLGLPAYQGWLNQESLTFGEVLQQGGYNTYLTGKWHVG-NDSLYWPNQRGFNKFYGFIGG 161
Query: 139 YLTYNDSIHETDFAVGLDARRNMER--YAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
Y D + A ++ N +R AP KYLTD T+ ++ I + + +P FL
Sbjct: 162 ASNYYDISPYPEKAPPVELVENNQRINLAP---GKYLTDEITNHALSYI-NESKDKPFFL 217
Query: 197 QITHAAVH 204
+ A H
Sbjct: 218 YLAFNAPH 225
>gi|403072042|pdb|4FDI|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A
gi|403072043|pdb|4FDI|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A
gi|403072044|pdb|4FDJ|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
With Galnac
gi|403072045|pdb|4FDJ|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
With Galnac
Length = 502
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 62/201 (30%), Positives = 95/201 (47%), Gaps = 19/201 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +GE TPN+D +A G++ Y+ P +PSRAA LTG+ P R G T
Sbjct: 16 GWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLXSPSRAALLTGRLPIRNGFYTTN 75
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E+LLP+ LK+ GY + ++GKWH+G ++ + P GFD
Sbjct: 76 AHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 134
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSS--KYLTDFFTDQSVHVIKSH 188
G N + D+ + V D R E + + + LT + +++ IK
Sbjct: 135 GSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKRQ 194
Query: 189 NHSRPLFL----QITHAAVHT 205
P FL THA V+
Sbjct: 195 ARHHPFFLYWAVDATHAPVYA 215
>gi|160890611|ref|ZP_02071614.1| hypothetical protein BACUNI_03056 [Bacteroides uniformis ATCC 8492]
gi|156859610|gb|EDO53041.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
Length = 520
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 94/198 (47%), Gaps = 23/198 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
G++DVG +G +IPTPNID LA G+ + Y P+RA+ LTG YP + GI
Sbjct: 27 GYSDVGCYG-GEIPTPNIDRLAQKGVRYTQFYNSGRSCPTRASLLTGLYPQQAGIGAMSE 85
Query: 80 ----------PVGAGVAKAVPVTEK---LLPQYLKELGYSTHLIGKWHIGCNKEELLPFN 126
P GV + + + + LKE GY T++ GKWH+G + +E P
Sbjct: 86 DPGIKKGEKHPENRGVHGYMGFLNRNCVTIAEVLKEAGYHTYMTGKWHVGMHGKEKWPLQ 145
Query: 127 RGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK 186
RGF++ G G +Y + + LD N APQ S Y TD FTD ++ I
Sbjct: 146 RGFEHFYGILAGASSYLKP--QGGRGLTLD---NTNLPAPQ-SPYYTTDAFTDYAIRFID 199
Query: 187 SHNHSRPLFLQITHAAVH 204
P FL + + A H
Sbjct: 200 EQTDDNPFFLYLAYNAPH 217
>gi|291231637|ref|XP_002735770.1| PREDICTED: steroid sulfatase-like [Saccoglossus kowalevskii]
Length = 584
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 55/125 (44%), Positives = 71/125 (56%), Gaps = 13/125 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
G DVG G + I TPNID LA G+ LN + PTCTPSRAAFLTG+YP R G+ + +
Sbjct: 35 GIGDVGCFGNDTIRTPNIDRLAAEGVKLNHNMVPAPTCTPSRAAFLTGRYPIRMGLASRL 94
Query: 82 GA---GVAKAV---PVTEKLLPQYLKELGYSTHLIGKWHIGCNK------EELLPFNRGF 129
G A+ P +E + LKE GY+T ++GKWH+G + E P N+GF
Sbjct: 95 AGTMFGYNSAIGGMPSSEITFAELLKEAGYTTAVLGKWHLGLHSFSFGRNFEFHPLNQGF 154
Query: 130 DNHVG 134
D G
Sbjct: 155 DFFYG 159
>gi|443716274|gb|ELU07882.1| hypothetical protein CAPTEDRAFT_217757 [Capitella teleta]
Length = 324
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 72/118 (61%), Gaps = 6/118 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G G + + TP++D++ NG+ L+ + CTPSRAA +T +Y R G+ + +
Sbjct: 34 GIGDIGAFGNDTLRTPHVDSICENGVKLDHDLAAASLCTPSRAALMTSRYAIRTGMSSVI 93
Query: 82 GAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL----LPFNRGFDNHVG 134
+ ++ + +P +E LPQ L+E GY+T LIGKWH+G N++ L P RGFD G
Sbjct: 94 TSLMSPQGLPTSEHTLPQMLQEKGYATALIGKWHLGWNRQLLDQYYSPLKRGFDYFFG 151
>gi|340373299|ref|XP_003385179.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 508
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 68/249 (27%), Positives = 108/249 (43%), Gaps = 57/249 (22%)
Query: 23 GWNDVGFHGE---NDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFR----- 74
GW +VG+H ++ TPNID+L G+ L++HY C+PSR++ ++G+ P
Sbjct: 36 GWANVGYHRNPPTKEVVTPNIDSLVRQGLELDQHYVFNVCSPSRSSLMSGRLPIHVNDLN 95
Query: 75 -----YGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF 129
Y D PV A+P + Q +K GY TH +GKW G P RGF
Sbjct: 96 IEPDYYNPDDPVSG--FSAIPRNMTGIAQKMKLGGYDTHQVGKWDAGMATHTHTPKGRGF 153
Query: 130 DNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTD-------------- 175
D+ GY++ H DF +D + + ++ ++TD
Sbjct: 154 DSSFGYFH---------HANDFYTEIDGKPCNKT---KIVDIWVTDKPGYGLNGTGPDNY 201
Query: 176 ---FFTDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAH 232
F +Q + V+ H+ +PLFL VH LQVP ++ F+
Sbjct: 202 EEGLFKEQLLKVVNEHDTGKPLFLYYAPHIVHA-----------PLQVPQRYQD--KFSF 248
Query: 233 ISNPDRRLF 241
I + DR+++
Sbjct: 249 IDDHDRQIY 257
>gi|406833313|ref|ZP_11092907.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 613
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 70/233 (30%), Positives = 102/233 (43%), Gaps = 35/233 (15%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
GW D G + TPNID++A G+ L+R + P C P+RA FLTG+Y R G+ V
Sbjct: 38 GWGDYSHSGNQQVSTPNIDSIAKGGVSLDRFFVCPVCAPTRAEFLTGRYHPRGGVRG-VS 96
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY----WNG 138
G+ + + + EK L + GY+T GKWH G ++ P RGFD + GY W
Sbjct: 97 TGLER-LDLDEKTLADAFQAAGYATGAFGKWHNG-SQWPYHPTARGFDEYFGYTAGHWGE 154
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
Y F L+ M R + Y+ D TD+++ I +H +P +
Sbjct: 155 Y-----------FDAPLEDHGEMVR-----TKGYIVDVCTDRALQFIDAHQQ-KPFLCYV 197
Query: 199 THAAVHTGTAG--------NAKLPTGLLQVPDMEENDRT---FAHISNPDRRL 240
H+ A + + L PD E + T A I N DR +
Sbjct: 198 PFTTPHSPWAAPESDWMRFRDRPLSQLASEPDQEVPEHTRCALAMIENQDRNV 250
>gi|87312329|ref|ZP_01094424.1| arylsulfatase A [Blastopirellula marina DSM 3645]
gi|87284951|gb|EAQ76890.1| arylsulfatase A [Blastopirellula marina DSM 3645]
Length = 477
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/199 (30%), Positives = 105/199 (52%), Gaps = 21/199 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFR-YGIDTPV 81
G+ D+ G TPN++A+A G+ L Y P C+PSRAA +TG YP R I +
Sbjct: 42 GYADIEPFGSEVNRTPNLNAMADEGMKLTCFYAAPVCSPSRAALMTGCYPKRALTIPHVL 101
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWNGY 139
G A+ + E + + +KE GY+T +IGKWH+G ++ + LP +GFD + G Y N
Sbjct: 102 FPGNAEGMSPNEVTIAELMKEQGYATAIIGKWHLG-DQPDFLPTRQGFDYYYGLPYSNDM 160
Query: 140 LTYNDSIHETDFAVGLDARRN--------------MERYAPQMSSKYLTDFFTDQSVHVI 185
D + ++++ + R+ ++R + ++ +T+ +T++++ I
Sbjct: 161 GPAADGV-KSNYGAPIPQRKGKGQPPLPLLRNETVLQRVLAKDQTELVTN-YTEEAIQFI 218
Query: 186 KSHNHSRPLFLQITHAAVH 204
+ H +P FL + H+AVH
Sbjct: 219 RDH-QEKPFFLYLPHSAVH 236
>gi|294011191|ref|YP_003544651.1| putative arylsulfatase A [Sphingobium japonicum UT26S]
gi|292674521|dbj|BAI96039.1| putative arylsulfatase A [Sphingobium japonicum UT26S]
Length = 468
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/190 (31%), Positives = 95/190 (50%), Gaps = 15/190 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYP--FRYGIDT 79
G D+G +G DI TP IDA+A G+ Y C+P+R A LTG+Y FR G++
Sbjct: 49 GHADLGCYGSRDIRTPAIDAIAARGVKFGNAYANSCVCSPTRIALLTGRYQGRFRIGLEE 108
Query: 80 PVG-AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
P+ G ++P + LP L++LGY+T L+GKWH+G P + G+D G +G
Sbjct: 109 PIAFNGDELSLPRGTRTLPGLLRDLGYATSLVGKWHVG-ELPASSPLDHGYDYFFGIASG 167
Query: 139 ---YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSV-HVIKSHNHSRPL 194
Y + +I+ + + R ++R YLTD +++ + ++ RP
Sbjct: 168 GTDYFAHATTINGHEMGKLFENRTEIQR------PGYLTDLLGAKAIDRMQQAARQDRPF 221
Query: 195 FLQITHAAVH 204
F+ + A H
Sbjct: 222 FISLHFTAPH 231
>gi|443704175|gb|ELU01350.1| hypothetical protein CAPTEDRAFT_214223 [Capitella teleta]
Length = 336
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 50/118 (42%), Positives = 69/118 (58%), Gaps = 4/118 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ D G+ +DI TPNID L +GI Y+ C+PSR++FL+G+Y + G+ V
Sbjct: 105 GYQDAGYR-NSDIHTPNIDQLVADGISFTNAYSAQQCSPSRSSFLSGRYAYTSGMQHGVI 163
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIG-CNKEELLPFNRGFDNHVGYWNG 138
+ A + + L YLKEL Y+TH GKWH+G CNK E P RGFD G ++G
Sbjct: 164 SDTAAHCMDLKYNFLSDYLKELNYNTHASGKWHLGYCNK-ECTPTYRGFDTFSGGYSG 220
>gi|346992478|ref|ZP_08860550.1| sulfatase [Ruegeria sp. TW15]
Length = 546
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 45/96 (46%), Positives = 63/96 (65%), Gaps = 1/96 (1%)
Query: 35 IPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEK 94
I TP+I+ A G+ L R YT P+CTP+R A LTG++P R G+ A V + +P +E
Sbjct: 90 IETPSINQFATEGLSLMRMYTEPSCTPTRTAMLTGRHPVRAGVSEVKVALVGEGLPASEV 149
Query: 95 LLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD 130
LP+ LKE+GY+T +GKWH G + E+ P N+GFD
Sbjct: 150 TLPEILKEVGYNTVHVGKWHQG-DIEQAYPHNQGFD 184
>gi|126337083|ref|XP_001362844.1| PREDICTED: arylsulfatase E [Monodelphis domestica]
Length = 583
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 52/124 (41%), Positives = 71/124 (57%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N + TPNID+LA+ G+ L +H + CTPSRAA LTG+YP R G+ +
Sbjct: 43 GIGDIGCYGNNTMRTPNIDSLAHEGVKLTQHLAAASVCTPSRAALLTGRYPIRSGMVSDN 102
Query: 82 GAGVAK------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G V + +P E + L++ GY+T LIGKWH+G N E + P N GFD
Sbjct: 103 GYRVLQWTAASGGLPSNETTFAKILQKEGYATGLIGKWHLGLNCESSIDHCHHPLNHGFD 162
Query: 131 NHVG 134
G
Sbjct: 163 FFYG 166
>gi|403256717|ref|XP_003921000.1| PREDICTED: arylsulfatase B [Saimiri boliviensis boliviensis]
Length = 551
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 86/163 (52%), Gaps = 24/163 (14%)
Query: 88 AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--------Y 139
VP+ EKLLPQ+LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 139 CVPLDEKLLPQFLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHER 198
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
T D+++ T A+ R+ E A + Y T+ FT ++ +I +H +PLFL +
Sbjct: 199 CTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTKRAATLITNHPPEKPLFLYLA 255
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+VH LQVP EE + + I + +R +A
Sbjct: 256 LQSVHEP-----------LQVP--EEYLKPYDFIQDKNRHHYA 285
>gi|395527024|ref|XP_003765652.1| PREDICTED: arylsulfatase E [Sarcophilus harrisii]
Length = 585
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/124 (42%), Positives = 69/124 (55%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G N I TPNID LA G+ +H + CTPSRAAFLTG+YP R G+ +
Sbjct: 45 GIGDIGCYGNNTIRTPNIDRLAKEGVKFTQHIAAASVCTPSRAAFLTGRYPIRSGMTSYN 104
Query: 82 GAGVAK------AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G V + +P E + L++ GY+T LIGKWH+G N E + P N GFD
Sbjct: 105 GLPVLQWTATSGGLPSNETTFAKILQKEGYTTGLIGKWHLGLNCESRIDHCHHPLNHGFD 164
Query: 131 NHVG 134
G
Sbjct: 165 FFYG 168
>gi|115906036|ref|XP_797340.2| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 162
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 42/91 (46%), Positives = 58/91 (63%), Gaps = 1/91 (1%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGID-TPV 81
GW+DV HG + I TPNID LA G+ L +Y P CTP+R+A +TG++P G+ +
Sbjct: 44 GWDDVSLHGSSQILTPNIDTLAQEGVTLTNYYVSPICTPTRSAIMTGRHPIHTGMQHDTI 103
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGK 112
GA + + EK + Q+LK LGYSTH +GK
Sbjct: 104 GAAEPWGLGLDEKTMAQHLKSLGYSTHAVGK 134
>gi|296470448|tpg|DAA12563.1| TPA: arylsulfatase E [Bos taurus]
Length = 583
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/124 (44%), Positives = 70/124 (56%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G DVG +G I TPNID LA +G+ L +H P CTPSRAAFLTG+YP R G+ +
Sbjct: 45 GIGDVGCYGNTTIRTPNIDRLAADGVRLTQHLAAAPLCTPSRAAFLTGRYPLRSGMVSSQ 104
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--CNKEE---LLPFNRGFD 130
G V+ +P +E + LK GY+T LIGKWH+G C + P N GFD
Sbjct: 105 GLRVLQWTAVSGGLPPSEITFAKILKAKGYTTGLIGKWHLGLSCASPDDHCHHPLNHGFD 164
Query: 131 NHVG 134
+ G
Sbjct: 165 HFYG 168
>gi|147901243|ref|NP_001091457.1| arylsulfatase E precursor [Bos taurus]
gi|146186636|gb|AAI40584.1| ARSE protein [Bos taurus]
gi|152941128|gb|ABS45001.1| arylsulfatase E precursor [Bos taurus]
Length = 583
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/124 (44%), Positives = 70/124 (56%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G DVG +G I TPNID LA +G+ L +H P CTPSRAAFLTG+YP R G+ +
Sbjct: 45 GIGDVGCYGNTTIRTPNIDRLAADGVRLTQHLAAAPLCTPSRAAFLTGRYPLRSGMVSSQ 104
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--CNKEE---LLPFNRGFD 130
G V+ +P +E + LK GY+T LIGKWH+G C + P N GFD
Sbjct: 105 GLRVLQWTAVSGGLPPSEITFAKILKAKGYTTGLIGKWHLGLSCASPDDHCHHPLNHGFD 164
Query: 131 NHVG 134
+ G
Sbjct: 165 HFYG 168
>gi|433607608|ref|YP_007039977.1| Sulfatase [Saccharothrix espanaensis DSM 44229]
gi|407885461|emb|CCH33104.1| Sulfatase [Saccharothrix espanaensis DSM 44229]
Length = 760
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/206 (31%), Positives = 98/206 (47%), Gaps = 42/206 (20%)
Query: 23 GWNDVG-FHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G F GE + TPN+DALA G+ L ++T P C+PSRAA LTG P R G P
Sbjct: 48 GYADIGPFGGE--VATPNLDALAAGGLRLTNYHTTPLCSPSRAALLTGLNPHRAGFAFPA 105
Query: 82 GA-----GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI-------GCNKEELLPFNRGF 129
A + +P L + L++ GY+T +GKWH+ P RGF
Sbjct: 106 NADPGYPAYSFQLPDDAPSLAESLRDAGYATFAVGKWHLTRDAASHDAADRSSWPVQRGF 165
Query: 130 DNHVGYWNGY--------LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQS 181
D + G G L +++S ++ D+ G +LTD TD++
Sbjct: 166 DRYFGSLEGLTNLHHPHRLVWDNSPYDGDYPDGY----------------FLTDDLTDRA 209
Query: 182 VHVI---KSHNHSRPLFLQITHAAVH 204
V +I ++++ +P FL H A+H
Sbjct: 210 VRMIDTLRANDPDKPFFLYFAHHAMH 235
>gi|443704179|gb|ELU01354.1| hypothetical protein CAPTEDRAFT_182406 [Capitella teleta]
Length = 548
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 51/122 (41%), Positives = 71/122 (58%), Gaps = 4/122 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D G+ +DI TPNID L +GI Y+ C+PSR++FL+G+Y + G+ V
Sbjct: 40 GYHDAGYR-NSDIHTPNIDQLVADGISFTNAYSAQQCSPSRSSFLSGRYAYTSGMQHGVI 98
Query: 83 AGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIG-CNKEELLPFNRGFDNHVGYWNGYL 140
+ A + + L YLKEL Y+TH GKWH+G CNK E P RGFD G ++G
Sbjct: 99 SDTAAHCMDLKYNFLSDYLKELNYNTHASGKWHLGYCNK-ECTPTYRGFDTFSGGYSGEG 157
Query: 141 TY 142
Y
Sbjct: 158 KY 159
>gi|325106428|ref|YP_004276082.1| sulfatase [Pedobacter saltans DSM 12145]
gi|324975276|gb|ADY54260.1| sulfatase [Pedobacter saltans DSM 12145]
Length = 535
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 63/195 (32%), Positives = 102/195 (52%), Gaps = 25/195 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G G +D+ TPN+D +A G+ + Y C PSRA+ LTG Y + G+ V
Sbjct: 37 GYSDIGCFG-SDVQTPNLDEMASKGLKMANFYNASRCCPSRASLLTGLYAHQAGVGDMVN 95
Query: 83 A-------GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
A G VT + + L++ GY+T + GKWH+G NKE P RGFD + G
Sbjct: 96 ARPYPAYQGYLNKTSVT---IAEVLQKNGYNTIMGGKWHVGQNKEN-WPLQRGFDKYFGL 151
Query: 136 WNGYLTYNDSI-----HETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI-KSHN 189
+G +Y ++ + A+G E + P ++ Y TD +TD ++ I ++ N
Sbjct: 152 IDGANSYFENRPYRPNQKLTIALG------NEEFTPG-ANYYSTDAYTDYALRFIEETKN 204
Query: 190 HSRPLFLQITHAAVH 204
+++P FL + + A H
Sbjct: 205 NNKPFFLYLAYQAPH 219
>gi|119713178|gb|ABL97246.1| sulfatase [uncultured marine bacterium EB0_50A10]
Length = 544
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 75/250 (30%), Positives = 119/250 (47%), Gaps = 51/250 (20%)
Query: 2 DTPVGAGVAKAVPVTEKLLPQGWNDVGFH----GENDIPTPNIDALAYNGIVLNRHYTL- 56
DTPV + V + G+ND+ H + + T NIDALA +GI+ R Y
Sbjct: 52 DTPVDDNRPNIILVLADDM--GYNDISIHNGGAADGTLQTKNIDALAKSGILFTRGYAAN 109
Query: 57 PTCTPSRAAFLTGKYPFRYGID-TPVGA------------------------GVAKAVPV 91
TC PSRA+ +TGKYP R+G + TP+ A V+ P
Sbjct: 110 ATCAPSRASIMTGKYPTRFGYEFTPIPAFGRTVLGWLAEEDNFELKQRIDREVVSNMPPF 169
Query: 92 TEKLLP-------QYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYND 144
E+ +P + L++ GY T IGKWH+G ++ + P ++GF + +G D
Sbjct: 170 MEQGMPTEQITIAEVLRDAGYYTAHIGKWHLG-HEYGMDPMSQGFQDSLGLVGPLYLPED 228
Query: 145 --SIHETDFAVGLDAR-RNMERYAPQMS-------SKYLTDFFTDQSVHVIKSHNHSRPL 194
+ F +D M +Y+ + KY+TD++TD+++ VI+ +N +RP
Sbjct: 229 HPDVVNAKFDTRIDKMIWGMGQYSANFNGGDLFAPDKYVTDYYTDEALKVIE-NNKNRPF 287
Query: 195 FLQITHAAVH 204
FL ++H A+H
Sbjct: 288 FLYLSHWAIH 297
>gi|126337066|ref|XP_001381279.1| PREDICTED: steryl-sulfatase-like [Monodelphis domestica]
Length = 813
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/124 (42%), Positives = 67/124 (54%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G D G +G + TPNID +A G+ +H P CTPSRAAFLTG+YP R G+ +
Sbjct: 266 GIGDPGCYGNTTLRTPNIDRIAKGGVKFTQHLAASPLCTPSRAAFLTGRYPIRSGMASRS 325
Query: 82 GAGV------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--CNKEELL---PFNRGFD 130
GV + +P E + LK GYST LIGKWH+G CN + P N GFD
Sbjct: 326 KVGVFLFSASSGGLPTNEITFAKLLKNQGYSTALIGKWHLGINCNSRDDFCHHPLNHGFD 385
Query: 131 NHVG 134
+ G
Sbjct: 386 HFYG 389
>gi|392941987|ref|ZP_10307629.1| arylsulfatase A family protein [Frankia sp. QA3]
gi|392285281|gb|EIV91305.1| arylsulfatase A family protein [Frankia sp. QA3]
Length = 796
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 98/202 (48%), Gaps = 33/202 (16%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV- 81
G++D+G G +IPTP +D LA G+ L ++T+P C+P+RAA LTG P R G
Sbjct: 72 GYSDIGPFGA-EIPTPALDRLAERGVRLTNYHTMPLCSPARAALLTGLNPHRVGYAMVAN 130
Query: 82 --------GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI-------GCNKEELLPFN 126
G +A VP L Q L + GY+T+ +GKWH+ + P
Sbjct: 131 ADPGFPGYGMEIADDVPT----LAQLLHDAGYATYAVGKWHLTRDSASNAADDRRNWPLQ 186
Query: 127 RGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQM-SSKYLTDFFTDQSVHVI 185
+GFD + G G LT H+ R N ++ + Y TD TDQ++ ++
Sbjct: 187 KGFDQYYGVLEG-LTSLFHPHQL-------VRDNSPLQVDELPAGYYYTDDITDQAISMV 238
Query: 186 ---KSHNHSRPLFLQITHAAVH 204
++H+ +P FL + H AVH
Sbjct: 239 TSLRAHDPEKPFFLYLAHNAVH 260
>gi|325107642|ref|YP_004268710.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324967910|gb|ADY58688.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 749
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 94/188 (50%), Gaps = 12/188 (6%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
QG+ D+G +G ++ TP IDALA G +Y P C+PSRA LTG YP R G
Sbjct: 39 QGYYDLGCYGATEVETPEIDALAAEGTRFTDYYAAAPICSPSRAGLLTGCYPRRVGNHIW 98
Query: 81 V-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
V A + E L + + GY+T IGKWH+G + E LP N+GFD++ G
Sbjct: 99 VHRADSDTGIHPNELTLAELFHQNGYATACIGKWHLGFH-EPFLPQNQGFDHYFG----- 152
Query: 140 LTYNDSIHETDF---AVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFL 196
L +N ET + G+ RN + LT +TD+++ ++ H +P FL
Sbjct: 153 LLHNLDPVETVYFEEQGGVPLLRNDQVVQRPADPAELTKQYTDEAISWMEQH-RDQPFFL 211
Query: 197 QITHAAVH 204
+ H +H
Sbjct: 212 YLPHTMLH 219
>gi|87306602|ref|ZP_01088749.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
marina DSM 3645]
gi|87290781|gb|EAQ82668.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
marina DSM 3645]
Length = 468
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 69/219 (31%), Positives = 99/219 (45%), Gaps = 27/219 (12%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYG---- 76
QG+ D+ G+N TP +D LA +G L Y + P CTPSRA+ +TG+YP R G
Sbjct: 42 QGFADLSCIGDNGCRTPRLDQLAASGTRLTSFYVSWPACTPSRASLMTGRYPQRNGTYDM 101
Query: 77 ----------IDTP----VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL 122
+ TP V A + E L LK+ GY + + GKW G +
Sbjct: 102 IRNEAPDYDYLYTPEEYAVTAERILGTDLQEVFLADVLKQAGYVSAVFGKWD-GGQLKRY 160
Query: 123 LPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSV 182
LP RGFD + G+ N + Y HE G+ + + + YLTD F +++
Sbjct: 161 LPLQRGFDQYYGFANTGVDY--FTHER---YGVPSMFRDNQPTEEDKGTYLTDLFEREAI 215
Query: 183 HVIKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVP 221
I NH RP FL + A H+ + + + G Q P
Sbjct: 216 RFI-DENHDRPFFLYLPFNAPHSASNLDRSI-RGFAQAP 252
>gi|410030097|ref|ZP_11279927.1| sulfatase [Marinilabilia sp. AK2]
Length = 476
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/186 (33%), Positives = 92/186 (49%), Gaps = 4/186 (2%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP 80
QG++DVG G +DI TP++D LA G+ Y C+ SRAA LTG Y R GI
Sbjct: 40 QGYHDVGVFGASDIATPHLDQLAAEGVQFTNFYVAQAVCSASRAALLTGVYSNRLGIHGA 99
Query: 81 VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL 140
+ + E + LK LGY+T + GKWH+G + E LP N+GFD ++G
Sbjct: 100 LDHMSRYGLHPEEATIADILKPLGYATAMFGKWHLG-HYPEFLPTNQGFDEYLGIPYSND 158
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYL-TDFFTDQSVHVIKSHNHSRPLFLQIT 199
+ + D+ L +N + + + + T FT++S+ I+ N RP FL +
Sbjct: 159 MWPNHPQTKDYYPPLPLYQNDKVIDTIWNDQSMFTTLFTEKSIDFIE-RNKDRPFFLYLA 217
Query: 200 HAAVHT 205
H H
Sbjct: 218 HPMPHV 223
>gi|449278684|gb|EMC86475.1| Arylsulfatase B, partial [Columba livia]
Length = 431
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 87/164 (53%), Gaps = 26/164 (15%)
Query: 88 AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIH 147
+P+ EKLLP+ L+E GY TH++GKWH+G K+E LP +RGFD + GYL ++ +
Sbjct: 17 CLPLDEKLLPELLQEAGYVTHMVGKWHLGMYKKECLPTHRGFDTYF----GYLLGSEDYY 72
Query: 148 ETDFAVGLDAR---------RNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQI 198
D V + A+ R+ E A + Y T+ FT++++ +I H +PLFL +
Sbjct: 73 SHDRCVLIKAKNITRCALDFRDGEEVATGFKNMYSTNLFTERAIDLIAHHKTEKPLFLYL 132
Query: 199 THAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+VH L+VP EE + ++ I + RR +A
Sbjct: 133 AFQSVHEP-----------LEVP--EEYMKPYSSIKDAKRRHYA 163
>gi|340369799|ref|XP_003383435.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Amphimedon
queenslandica]
Length = 523
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 63/198 (31%), Positives = 94/198 (47%), Gaps = 16/198 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTP- 80
GW D+G +G TPN+D +A G++L Y+ P C+PSRAA LTG+ P R G T
Sbjct: 44 GWGDLGVYGHPVKETPNLDKMALEGMLLPDFYSANPLCSPSRAAMLTGRLPIRNGFYTTN 103
Query: 81 -------VGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E L P+ L++ GY+T +IGKWH+G + P GFD
Sbjct: 104 AHARNAYTPQDIVGGIPDSEILYPELLQKNGYATMIIGKWHLG-QQTHYHPLKHGFDEFF 162
Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAP-----QMSSKYLTDFFTDQSVHVI-KS 187
G N + D + + V +A Y + LT +T +++ I K+
Sbjct: 163 GSTNCHFGPFDGKEQPNMPVYRNATMAGRYYQDFPINHKTGESNLTVEYTQEAIKFINKN 222
Query: 188 HNHSRPLFLQITHAAVHT 205
+ +P FL T A HT
Sbjct: 223 AANKKPFFLYWTPDATHT 240
>gi|196231892|ref|ZP_03130748.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196224014|gb|EDY18528.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 486
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 69/214 (32%), Positives = 95/214 (44%), Gaps = 39/214 (18%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT-PV 81
GW+D+G +G + TPNID A + Y + C+PSR+ +TGK+ R
Sbjct: 37 GWSDLGCYGADLHETPNIDRFASGAVRFTSAYAMSVCSPSRSTLMTGKHAARLHFTIWAE 96
Query: 82 GAGVAKA-------------VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRG 128
GA A +P +EK + YLK GY T LIGKWH+G E P G
Sbjct: 97 GAQEGGAKNRELREAESIWNLPNSEKTIATYLKSAGYLTALIGKWHLG--DWEHYPEAHG 154
Query: 129 FDNHVG--YWNGYLTY-----NDSIHETDFAVGLDARRNMERYAPQMS----SKYLTDFF 177
FD ++G W T+ H +F RY P + +YLTD
Sbjct: 155 FDINIGGTNWGAPQTFWWPYSGSGTHGPEF-----------RYIPHLEYGHPGEYLTDRL 203
Query: 178 TDQSVHVIKSHNHSRPLFLQITHAAVHTGTAGNA 211
TD+++ VI H +P F+ + H AVHT A
Sbjct: 204 TDEAIKVI-DHAGDQPFFVYLAHHAVHTPIEAKA 236
>gi|301769831|ref|XP_002920339.1| PREDICTED: arylsulfatase B-like [Ailuropoda melanoleuca]
Length = 519
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 86/163 (52%), Gaps = 24/163 (14%)
Query: 88 AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--------Y 139
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 107 CVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHER 166
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
T D+++ T A+ R+ E A + Y T+ FT+++ +I +H +PLFL +
Sbjct: 167 CTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTERATALITNHPPEKPLFLYLA 223
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+VH LQVP EE + + I + +R +A
Sbjct: 224 LQSVHEP-----------LQVP--EEYLKPYNFIQDKNRHYYA 253
>gi|410635995|ref|ZP_11346602.1| arylsulfatase [Glaciecola lipolytica E3]
gi|410144672|dbj|GAC13807.1| arylsulfatase [Glaciecola lipolytica E3]
Length = 499
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 64/192 (33%), Positives = 97/192 (50%), Gaps = 13/192 (6%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDT- 79
QG+ DVG G + + TPN+D +A G++L Y P C+PSRA +TG YP R + T
Sbjct: 55 QGYEDVGVFGGDHVLTPNLDKMAEEGLMLTDFYVPSPLCSPSRAGLMTGSYPRRVDMATG 114
Query: 80 ---PV-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
PV A K + E + + LK +GY+T + GKWH+G ++ E LP +GFD G
Sbjct: 115 SNFPVLLAADTKGLNPAEITIAEVLKSVGYATGIFGKWHLG-DQPEFLPTRQGFDEFFGL 173
Query: 136 WNGY---LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR 192
+ T+ H + L N+ P + +YLT T++++ I+ H +
Sbjct: 174 PYSHDIAPTHKRQAHFKFPDLPLMENENVIELNP--NPEYLTRRITERAIDFIERHQDA- 230
Query: 193 PLFLQITHAAVH 204
P FL + H H
Sbjct: 231 PFFLYLPHPMPH 242
>gi|426256630|ref|XP_004021940.1| PREDICTED: arylsulfatase E [Ovis aries]
Length = 583
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/124 (42%), Positives = 71/124 (57%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G DVG +G + + TPNID LA +G+ L +H P CTPSRAAFLTG+YP R G+ +
Sbjct: 45 GIGDVGCYGNSTLRTPNIDRLAADGVRLTQHLAAAPVCTPSRAAFLTGRYPLRSGMVSSQ 104
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--CNKEE---LLPFNRGFD 130
G V+ +P +E + LK GY+T L+GKWH+G C + P N GFD
Sbjct: 105 GLRVLQWTAVSGGLPPSEITFAKILKAKGYTTGLVGKWHLGLSCASPDDHCHHPLNHGFD 164
Query: 131 NHVG 134
+ G
Sbjct: 165 HFYG 168
>gi|187735071|ref|YP_001877183.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
gi|187425123|gb|ACD04402.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
Length = 542
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 94/188 (50%), Gaps = 11/188 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----D 78
GW+D G +G ++IPTP +D LA G++ R YT C+PSRA+ +TG P + + D
Sbjct: 46 GWSDPGCYG-SEIPTPALDTLARQGMLATRLYTASRCSPSRASIMTGCEPHKVDVGLLDD 104
Query: 79 TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
G + LP+ LK+ GY T+L GKWH+G + P++RGFD G G
Sbjct: 105 DSGRPGYRGRLNPGIPTLPELLKKAGYRTYLSGKWHLGKVRGS-YPWDRGFDRSRGLLGG 163
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSR-PLFLQ 197
Y + ++ F G + + P+ Y+TD T ++ I SR P FL
Sbjct: 164 AADYYRPMPDSPF--GENGKLLRPEDLPE--DFYMTDDITKTALAYIGDAAKSRQPFFLY 219
Query: 198 ITHAAVHT 205
+ + A HT
Sbjct: 220 VAYTAPHT 227
>gi|344297991|ref|XP_003420678.1| PREDICTED: steryl-sulfatase [Loxodonta africana]
Length = 578
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/124 (42%), Positives = 68/124 (54%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G D G +G + TPNID LA G+ L +H P CTPSRAAF+TG+YP R G+ +
Sbjct: 33 GIGDPGCYGNKTLRTPNIDRLAQGGVKLTQHLAASPLCTPSRAAFMTGRYPIRSGMASSS 92
Query: 82 GAGV------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--CNKEELL---PFNRGFD 130
GV + +P +E + LK GYST LIGKWH+G CN + P + GFD
Sbjct: 93 RIGVYIVTASSGGLPTSEITFARLLKNQGYSTALIGKWHLGSNCNSKSDFCHHPLSHGFD 152
Query: 131 NHVG 134
G
Sbjct: 153 YFYG 156
>gi|332662522|ref|YP_004445310.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
1100]
gi|332331336|gb|AEE48437.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
1100]
Length = 449
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 67/197 (34%), Positives = 90/197 (45%), Gaps = 27/197 (13%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGI-VLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTP 80
G+ D+ +G D TPN+D LA GI +N + P C P+RAAF+TG+YP + TP
Sbjct: 41 MGYGDLSCYGRKDYTTPNLDKLASQGIKFVNAYSAAPVCNPTRAAFMTGRYPAK----TP 96
Query: 81 VGA-------------GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNR 127
+G G+ P L + GY T LIGKWH+G + P
Sbjct: 97 IGLIEPLTQSKRDSTFGLTAEFPSIATL----MSASGYETALIGKWHLGFLPQH-SPVKN 151
Query: 128 GFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS 187
GFD G + D I G A E P YLT+ F+ ++V IK
Sbjct: 152 GFDYFFGI---HSAAADYISHKSGLPGNRAHDLYENDTPVYPEGYLTNLFSQKAVAYIK- 207
Query: 188 HNHSRPLFLQITHAAVH 204
H++P FL IT+ AVH
Sbjct: 208 QKHNKPFFLTITYNAVH 224
>gi|229490602|ref|ZP_04384440.1| arylsulfatase [Rhodococcus erythropolis SK121]
gi|229322422|gb|EEN88205.1| arylsulfatase [Rhodococcus erythropolis SK121]
Length = 773
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 100/197 (50%), Gaps = 23/197 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G G ++I TP +D LA GI + ++T P C+PSRAA LTG P R G
Sbjct: 55 GYSDIGPFG-SEIETPTLDRLAAQGIRMTNYHTTPLCSPSRAALLTGLNPHRAGYGFVAN 113
Query: 83 A-----GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN-------KEELLPFNRGFD 130
A G+ + + LP+ L+ GY+T+ +GKWH+ + + P RGFD
Sbjct: 114 ADPGYPGLRLELADDVQTLPEILRGAGYATYAVGKWHLVRDANLAPGRSRDSWPTQRGFD 173
Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK---S 187
+ G G +S + + + ++ +++ Y YLT+ TD++V IK +
Sbjct: 174 RYYGSLEGL----NSFYYPNQLISDNSVVDVDEYP---EGYYLTEDLTDKAVGYIKDLRA 226
Query: 188 HNHSRPLFLQITHAAVH 204
H+ +P FL H A+H
Sbjct: 227 HDQDKPFFLYFAHVAMH 243
>gi|410988054|ref|XP_004000303.1| PREDICTED: steryl-sulfatase [Felis catus]
Length = 578
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/120 (44%), Positives = 69/120 (57%), Gaps = 12/120 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G + TPNID LA G+ L +H P CTPSRAAF+TG+YP R G+ +
Sbjct: 33 GIGDLGCYGNKTLRTPNIDRLAEGGVKLTQHLAASPLCTPSRAAFMTGRYPIRSGMASEF 92
Query: 82 GAGV------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--C-NKEELL--PFNRGFD 130
GV + +P +E + LK GYST LIGKWH+G C NK + P + GFD
Sbjct: 93 LVGVYLFSASSGGLPTSEITFAKLLKGQGYSTALIGKWHLGTNCHNKSDFCHHPLSHGFD 152
>gi|145391|gb|AAC32036.1| putative arylsulfatase [Escherichia coli]
Length = 475
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 52/116 (44%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWHIG NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGLTT-LPQLLHDQGYVTQAIGKWHIGENKES-QPQNVGFDDFRGF 210
>gi|119475675|ref|ZP_01616028.1| arylsulfatase A [marine gamma proteobacterium HTCC2143]
gi|119451878|gb|EAW33111.1| arylsulfatase A [marine gamma proteobacterium HTCC2143]
Length = 479
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 102/202 (50%), Gaps = 27/202 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGID--- 78
G+ D+G +G I +PN+D +A GI Y + CTPSRA LTG+ P R G+
Sbjct: 49 GYGDIGAYGHPTIRSPNLDQMAAEGIKWTNFYAASSVCTPSRAGLLTGRLPVRSGMAHDQ 108
Query: 79 ----TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
P G +P TE + + LKE Y T L+GKWH+G + P + GFD + G
Sbjct: 109 IRVLFPTSTG---GLPTTEITIAKALKEKDYRTALVGKWHLG-HLPGFQPLDHGFDEYFG 164
Query: 135 --YWNGY-----LTYNDSI---HETDFAVGLDARRN-MERYAPQMSSKYLTDFFTDQSVH 183
Y N + L+Y +I + DF V L R+ +ER A Q + +T +T ++V
Sbjct: 165 IPYSNDHDLKKELSYIQTITHAKDGDFNVPLMQNRSIIERPANQNT---ITKRYTQEAVS 221
Query: 184 VIKSHNHSRPLFLQITHAAVHT 205
IK N ++P FL + H+ H
Sbjct: 222 FIKK-NSNQPFFLYLAHSMPHV 242
>gi|198275209|ref|ZP_03207740.1| hypothetical protein BACPLE_01368 [Bacteroides plebeius DSM 17135]
gi|198271792|gb|EDY96062.1| arylsulfatase [Bacteroides plebeius DSM 17135]
Length = 509
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 96/204 (47%), Gaps = 25/204 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
GW DVG++G TPNID LA G++ Y + +PSR + +TGKYP R GI +
Sbjct: 42 GWADVGYNGSRFYETPNIDRLASEGMIFTDGYAAASISSPSRVSLMTGKYPARTGITDWI 101
Query: 82 ------------------GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL- 122
+ +P+ E + + KE GY+T+ +GKWH C ++ L
Sbjct: 102 PGYQYGLKPEQLKQYKMLAPEMPLNMPLEEVTMAEAFKEHGYATYHVGKWH--CAEDSLY 159
Query: 123 LPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQM-SSKYLTDFFTDQS 181
P +GFD ++G W + I + G Y P ++LTD D+S
Sbjct: 160 YPQYQGFDVNIGGW--LKGSPNGIRRSQGGKGAYCSPYRNPYLPDGPEGEFLTDRLGDES 217
Query: 182 VHVIKSHNHSRPLFLQITHAAVHT 205
+ +IK+ + +P FL + AVHT
Sbjct: 218 IKLIKNSSADKPFFLYLAFYAVHT 241
>gi|422831041|ref|ZP_16879191.1| hypothetical protein ESNG_03696 [Escherichia coli B093]
gi|371602932|gb|EHN91614.1| hypothetical protein ESNG_03696 [Escherichia coli B093]
Length = 270
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|325109524|ref|YP_004270592.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324969792|gb|ADY60570.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 486
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/205 (29%), Positives = 93/205 (45%), Gaps = 30/205 (14%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ D+ +G DI TP ID +A G+ N Y C+P+RA+ +TG + R GI +
Sbjct: 41 GYGDLACYGAKDIATPAIDRMATEGVKCNSFYVSAVCSPTRASLMTGSHSIRVGIGGVMF 100
Query: 83 AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTY 142
+ E LP+ LK+ GY+T +IGKWH+G N++ P N GFD YW G
Sbjct: 101 PRNNHGLNPDEITLPELLKDQGYATAIIGKWHLG-NEDMFQPMNHGFD----YWYGTPAS 155
Query: 143 NDS-----------------------IHETDFAVGLDARRNMERYAPQMSSKYLTDFFTD 179
N+ I + A R N+ P S++ T +T
Sbjct: 156 NNQFYYPTIKKYAADCVFREGYTRNGILTRETAACPLIRDNVVIEVPADQSQF-TQRYTR 214
Query: 180 QSVHVIKSHNHSRPLFLQITHAAVH 204
+++ I + NH +P F+ + H H
Sbjct: 215 ETIRFI-TENHEQPFFIYLAHNMPH 238
>gi|294053911|ref|YP_003547569.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613244|gb|ADE53399.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 469
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 74/233 (31%), Positives = 109/233 (46%), Gaps = 38/233 (16%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDT-- 79
G+ D+G+ G IPTPNID LA G+ Y T C PSRA FLTG+Y R+G +T
Sbjct: 43 GYGDLGYTGSKHIPTPNIDRLANEGVECTYGYVTHQYCGPSRAGFLTGRYQQRFGFETNP 102
Query: 80 PVGA-GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
P VP +E+L + L+ +GY T ++GKWHIG + P NR G
Sbjct: 103 PYDRHNTIAGVPASERLFAERLQAVGYKTGIVGKWHIGSHSIH-HPNNR----------G 151
Query: 139 YLTYNDSIHETDFAVGLDARRNMER--YAPQMSS-------KYLTDFFTDQSVHVIKSHN 189
+ + + +D R M+ P M + YLT TD+++ I+ N
Sbjct: 152 FDFFFGFLGGGHDFFRVDTREPMDEGYLDPMMRNGSSVDVEGYLTTQLTDEAIGFIE-RN 210
Query: 190 HSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
P FL +++ A P LQ P EE+ F+H+ +RR+++
Sbjct: 211 EKDPFFLFLSYNA-----------PHAPLQAP--EESIAKFSHVEGKERRVYS 250
>gi|449134034|ref|ZP_21769542.1| arylsulfatase A [Rhodopirellula europaea 6C]
gi|448887354|gb|EMB17735.1| arylsulfatase A [Rhodopirellula europaea 6C]
Length = 728
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 61/185 (32%), Positives = 95/185 (51%), Gaps = 6/185 (3%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTP 80
QG+ D+G +G ++ TP ID +A GI +Y P C+PSRA LTG YP R G
Sbjct: 16 QGYYDLGCYGATEVKTPRIDEMAGGGIRFTDYYAAAPICSPSRAGLLTGCYPRRVGNHVW 75
Query: 81 V-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
V A + E L + K+ GY T IGKWH+G + E LP N+GFD++ G +
Sbjct: 76 VHRADSNTGIHSDELTLAELFKDNGYKTACIGKWHLGFH-EPFLPQNQGFDHYFGLLHN- 133
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
L ++++ D G+ +R+ + LT +T++++ I++ N P L +
Sbjct: 134 LDPVETVYFEDVG-GVPLQRDRDVVKRPADPDELTKLYTNEAIDFIEA-NKEGPFLLYLP 191
Query: 200 HAAVH 204
H +H
Sbjct: 192 HTMLH 196
>gi|390368732|ref|XP_784356.2| PREDICTED: N-acetylgalactosamine-6-sulfatase-like
[Strongylocentrotus purpuratus]
Length = 482
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 65/195 (33%), Positives = 96/195 (49%), Gaps = 18/195 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +G TPN+D +A GI+L Y P +PSRAA LTG+ P R G T
Sbjct: 7 GWGDLGIYGNPAKETPNLDQMAAEGILLPDFYAANPLGSPSRAALLTGRLPIRNGFYTTN 66
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
G V +P +E LLP+ LK GY + ++GKWH+G + + LP GFD
Sbjct: 67 GHAHNAWSQQIVKGGIPDSEILLPKLLKLSGYKSKIVGKWHLG-HLPQYLPLKHGFDEWF 125
Query: 134 GYWNGYLTY--NDSIHETDFAVGLDARRNMERYAPQMSSKY-LTDFFTDQSVHVI-KSHN 189
G N ++ N ++ +G R E++ + + + LT + + ++ I KS
Sbjct: 126 GAPNCHIKSLPNIPVYRDSEMIG----RYFEQFIIEKNGESNLTQLYIKEGLNFIEKSAE 181
Query: 190 HSRPLFLQITHAAVH 204
+P FL T A H
Sbjct: 182 AKQPFFLYWTPDATH 196
>gi|218262868|ref|ZP_03477199.1| hypothetical protein PRABACTJOHN_02879 [Parabacteroides johnsonii
DSM 18315]
gi|218223078|gb|EEC95728.1| hypothetical protein PRABACTJOHN_02879 [Parabacteroides johnsonii
DSM 18315]
Length = 461
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/190 (32%), Positives = 89/190 (46%), Gaps = 19/190 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNG-IVLNRHYTLPTCTPSRAAFLTGKYPFRYG----I 77
G+ D GF G DI TPNID LA G I + H +PSR+ LTG+Y RYG +
Sbjct: 42 GYADFGFMGSADIQTPNIDRLAAEGRIFTDAHVAATVSSPSRSMMLTGRYGQRYGYECNL 101
Query: 78 DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWN 137
D P +P E+LLP LK GY T IGKWH+G + P +GFD G
Sbjct: 102 DKP-----GDGIPDDEELLPALLKRYGYRTGCIGKWHLGSEPSQ-RPNAKGFDTFYGLLA 155
Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH--NHSRPLF 195
G+ +Y ++ + + D N+++Y +FTD+ + +P
Sbjct: 156 GHRSY---FYDPETS---DKDGNLQQYQYNGQKLSFDGYFTDELASKARQFVAESEQPFM 209
Query: 196 LQITHAAVHT 205
L ++ A H+
Sbjct: 210 LYMSFTAPHS 219
>gi|397733173|ref|ZP_10499895.1| sulfatase family protein [Rhodococcus sp. JVH1]
gi|396930984|gb|EJI98171.1| sulfatase family protein [Rhodococcus sp. JVH1]
Length = 790
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 103/197 (52%), Gaps = 23/197 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
G++D+G G ++I TPN++ LA +G L+ ++T C+P+RAA LTG P R G +
Sbjct: 65 GYSDIGPFG-SEIDTPNLNRLADSGYRLSNYHTTSVCSPARAALLTGLNPHRAGYGSVAN 123
Query: 80 --PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN-------KEELLPFNRGFD 130
P G+ + L + L+ GY+TH +GKWH+ + + P RGFD
Sbjct: 124 FDPGFPGLRMELADDALSLAEILRANGYATHAVGKWHLARDTNLAPGRTRDSWPLQRGFD 183
Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS--- 187
++ G G +S + + + ++ ++E Y S Y+TD TD++V IKS
Sbjct: 184 SYYGSLEGL----NSFYYPNELISDNSVVDVEEYP---SDYYVTDDITDKAVSRIKSLRA 236
Query: 188 HNHSRPLFLQITHAAVH 204
H+ +P FL +H A+H
Sbjct: 237 HDADKPFFLYFSHIAMH 253
>gi|111020297|ref|YP_703269.1| arylsulfatase [Rhodococcus jostii RHA1]
gi|110819827|gb|ABG95111.1| arylsulfatase [Rhodococcus jostii RHA1]
Length = 790
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 103/197 (52%), Gaps = 23/197 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
G++D+G G ++I TPN++ LA +G L+ ++T C+P+RAA LTG P R G +
Sbjct: 65 GYSDIGPFG-SEIDTPNLNRLADSGYRLSNYHTTSVCSPARAALLTGLNPHRAGYGSVAN 123
Query: 80 --PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN-------KEELLPFNRGFD 130
P G+ + L + L+ GY+TH +GKWH+ + + P RGFD
Sbjct: 124 FDPGFPGLRMELADDALSLAEILRANGYATHAVGKWHLARDTNLAPGRTRDSWPLQRGFD 183
Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS--- 187
++ G G +S + + + ++ ++E Y S Y+TD TD++V IKS
Sbjct: 184 SYYGSLEGL----NSFYYPNELISDNSVVDVEEYP---SDYYVTDDITDKAVSRIKSLRA 236
Query: 188 HNHSRPLFLQITHAAVH 204
H+ +P FL +H A+H
Sbjct: 237 HDADKPFFLYFSHIAMH 253
>gi|281353470|gb|EFB29054.1| hypothetical protein PANDA_009046 [Ailuropoda melanoleuca]
Length = 431
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 86/163 (52%), Gaps = 24/163 (14%)
Query: 88 AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--------Y 139
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 19 CVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHER 78
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
T D+++ T A+ R+ E A + Y T+ FT+++ +I +H +PLFL +
Sbjct: 79 CTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTERATALITNHPPEKPLFLYLA 135
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+VH LQVP EE + + I + +R +A
Sbjct: 136 LQSVHEP-----------LQVP--EEYLKPYNFIQDKNRHYYA 165
>gi|395527008|ref|XP_003765645.1| PREDICTED: steryl-sulfatase [Sarcophilus harrisii]
Length = 585
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/124 (42%), Positives = 67/124 (54%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G D G +G + TPNID +A G+ +H P CTPSRAAFLTG+YP R G+ +
Sbjct: 38 GIGDPGCYGNTTLRTPNIDRIAKGGVKFTQHLAASPLCTPSRAAFLTGRYPVRSGMASRS 97
Query: 82 GAGV------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--CNKEELL---PFNRGFD 130
GV + +P E + LK GYST LIGKWH+G CN + P N GFD
Sbjct: 98 KVGVFLFSASSGGLPANEITFAKLLKNQGYSTALIGKWHLGINCNSRDDFCHHPLNHGFD 157
Query: 131 NHVG 134
+ G
Sbjct: 158 HFYG 161
>gi|348516447|ref|XP_003445750.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Oreochromis
niloticus]
Length = 525
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 97/212 (45%), Gaps = 42/212 (19%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G G+ TPN+DA+A G++L YT P C+PSRAA LTG+ P R G T
Sbjct: 42 GWGDLGVFGQPSKETPNLDAMAAEGMLLPNFYTANPLCSPSRAALLTGRLPIRNGFYTTN 101
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ + E LLPQ LK GY + ++GKWH+G ++ + LP GFD
Sbjct: 102 AHARNAYTPQEIVGGISKDEILLPQLLKTKGYVSKIVGKWHLG-HRPQYLPLKNGFDEWF 160
Query: 134 GYWNGYL-TYNDSIHETDFAVGLDARRNMERY-APQMSSKYLTDFFTDQSV--------- 182
G N + YND ++ N+ Y +M ++ DF D++
Sbjct: 161 GSPNCHFGPYNDQ-----------SKPNIPVYNNSEMLGRFYEDFKIDRNTGESNLTQIY 209
Query: 183 ------HVIKSHNHSRPLFL----QITHAAVH 204
+++ +P FL THA V+
Sbjct: 210 LMEGLDFILRQTKAQQPFFLYWAVDATHAPVY 241
>gi|196229912|ref|ZP_03128776.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196226238|gb|EDY20744.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 588
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 69/222 (31%), Positives = 103/222 (46%), Gaps = 34/222 (15%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
Q W D+ +G ++ TPNID+LA G +L+ + P C+P+RA FLTG+Y R G+
Sbjct: 37 QAWGDLSINGNTNLSTPNIDSLATTGALLDHFFVCPVCSPTRAEFLTGRYHLRGGVHG-- 94
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYL- 140
+ + + + E+ + + K GY+T GKWH G + P RGFD + G+ +G+
Sbjct: 95 VSSGGERLNLDERTIAEAFKAAGYATGAFGKWHNGM-QYPYHPNARGFDEYYGFCSGHWG 153
Query: 141 TYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITH 200
Y D+ E + + S +L D FT ++ I+ N RP F +
Sbjct: 154 DYFDAPIEHNGQI-------------VQSHGFLIDDFTQHAMDFIE-QNKDRPFFCYVPF 199
Query: 201 AAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
HT LQVP DR F SN D +L A
Sbjct: 200 NTPHTP-----------LQVP-----DRWFDKFSNMDLKLRA 225
>gi|147906969|ref|NP_001086084.1| arylsulfatase D precursor [Xenopus laevis]
gi|49257838|gb|AAH74170.1| MGC81982 protein [Xenopus laevis]
Length = 586
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 51/117 (43%), Positives = 67/117 (57%), Gaps = 7/117 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G + I TPNID LA G+ L +H + P CTPSRAAF+TG+YP R G++
Sbjct: 40 GIGDIGCYGNDTIRTPNIDRLAKEGLKLKQHISAAPLCTPSRAAFVTGRYPIRSGMELGS 99
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNH 132
G AG + +P E L++ GYST LIGKWH+G N F +NH
Sbjct: 100 GGRIIFWAGSSAGLPPNETTFATILQQQGYSTGLIGKWHLGVNCASRNDFCHHPNNH 156
>gi|336429765|ref|ZP_08609725.1| hypothetical protein HMPREF0994_05731 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336002095|gb|EGN32220.1| hypothetical protein HMPREF0994_05731 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 472
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 92/199 (46%), Gaps = 31/199 (15%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGI---- 77
GW D+G G TPNID + G+ + Y P C+PSRA+FL+G+YP R G+
Sbjct: 15 GWRDLGCSGSTFYETPNIDQMCREGMRFDCAYAACPVCSPSRASFLSGQYPARIGVTDWI 74
Query: 78 ----------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNR 127
+ A K +P + + L+ GY T +GKWH+G LP N
Sbjct: 75 DESGTFHPLKGKLIDAPYLKHMPENTITVAERLRNAGYQTWHVGKWHLGGGN--YLPENF 132
Query: 128 GFDNHVG--YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI 185
GFD ++G W G+ +Y G + ++ +YLTD TD+++ +I
Sbjct: 133 GFDVNIGGCEW-GHPSY-----------GYFSPYHIPTLEDGPEGEYLTDRLTDEAIDLI 180
Query: 186 KSHNHSRPLFLQITHAAVH 204
+ +P FL H AVH
Sbjct: 181 RKAPDDKPFFLNFCHYAVH 199
>gi|449275706|gb|EMC84474.1| Steryl-sulfatase, partial [Columba livia]
Length = 552
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 53/124 (42%), Positives = 67/124 (54%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G + TPNID LA G+ L +H P CTPSRAAFLTG+YP R G+
Sbjct: 14 GIGDLGCYGNRTLRTPNIDRLAEEGVTLTQHIAASPLCTPSRAAFLTGRYPIRSGMAAFS 73
Query: 82 GAGV------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNRGFD 130
GV + +P E + LK+ GY+T LIGKWH+G N E P + GFD
Sbjct: 74 RVGVFLFSASSGGLPSEEITFTKLLKQRGYATALIGKWHLGMNCESSNDFCHHPLSHGFD 133
Query: 131 NHVG 134
G
Sbjct: 134 YFYG 137
>gi|149178145|ref|ZP_01856740.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
gi|148843065|gb|EDL57433.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
Length = 460
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 71/232 (30%), Positives = 112/232 (48%), Gaps = 22/232 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTP 80
QG NDVG +G ++IPTP+ID LA G++ ++Y+ CTPSR LTG+ P R D
Sbjct: 38 QGINDVGCYG-SEIPTPHIDQLAKEGLLFRQYYSASAICTPSRFGILTGRNPTR-SQDQL 95
Query: 81 VGAGV-------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+GA + + + E + L++ GY T L+GKWH+G E LP GFD
Sbjct: 96 LGALMFMSDIDQNRGIQPGETTIADVLQQNGYQTALLGKWHLGHGTESFLPTAHGFDLFR 155
Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHS-R 192
G+ G + Y + D N + + Y TD T+++ H +K + +
Sbjct: 156 GHTGGCIDY----FTMTYGNIPDWYHNQRHVS---ENGYATDLITEEAEHFLKDQQTTDK 208
Query: 193 PLFLQITHAAVHTGTAGN--AKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
P FL +++ A H G + + P ++Q ++ + I + RR FA
Sbjct: 209 PFFLFLSYNAPHFGKGWSPGDQSPVNIMQA--RGDDLKRVGTIKDKVRREFA 258
>gi|229587773|ref|YP_002869892.1| arylsulfatase [Pseudomonas fluorescens SBW25]
gi|229359639|emb|CAY46482.1| arylsulfatase (ec 3.1.6.1) (aryl-sulfate sulphohydrolase)
[Pseudomonas fluorescens SBW25]
Length = 536
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 103/201 (51%), Gaps = 24/201 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G++D+G G +I TP++DALA NG+ L +T PTC+P+R+ LTG GI T
Sbjct: 16 GFSDLGAFG-GEISTPHLDALALNGLRLTDFHTAPTCSPTRSMLLTGTDHHIAGIGTMAE 74
Query: 83 AGVAKAVP-------VTEKL--LPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFD--- 130
A + + + +K+ LP+ L+E GY T + GKWH+G EL P RGF+
Sbjct: 75 ALTPELIGKPGYEGYLNDKVVALPELLREAGYQTLMSGKWHLGLTA-ELAPHARGFERSF 133
Query: 131 -------NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVH 183
NH G+ Y + + ++ A+ ++ R +E+ Y +D F D+ +H
Sbjct: 134 SLLPGAANHYGFEPTYDEHTPGLLKSTPALYIEDDRFVEQLPKDF---YSSDAFGDKLLH 190
Query: 184 VIKSHNHSRPLFLQITHAAVH 204
+K + +RP F + +A H
Sbjct: 191 YLKERDQARPFFAYLPFSAPH 211
>gi|348549768|ref|XP_003460705.1| PREDICTED: arylsulfatase E-like, partial [Cavia porcellus]
Length = 613
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/151 (37%), Positives = 78/151 (51%), Gaps = 12/151 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G + TPNID LA +G+ L H + CTPSRAAFLTG+YP R G+ +
Sbjct: 73 GIGDLGCYGNGTLRTPNIDRLAEHGVKLTHHIAAASVCTPSRAAFLTGRYPIRSGMVSYN 132
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELL-----PFNRGFD 130
G GV +P +E + LK+ GY+T LIGKWH+G N E P + GFD
Sbjct: 133 GYRVLQWTGVPGGLPASEVTFAKLLKDSGYTTGLIGKWHLGLNCETSSDHCHHPLSHGFD 192
Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNM 161
+ G + ++ VGL R +
Sbjct: 193 HFYGMPFSMMADCQQWALSERRVGLQNRLRL 223
>gi|149638294|ref|XP_001514413.1| PREDICTED: arylsulfatase D-like [Ornithorhynchus anatinus]
Length = 576
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 53/124 (42%), Positives = 69/124 (55%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G DVG +G + + TPNID LA G+ L +H P CTPSRAAFLTG++ FR G++
Sbjct: 32 GIGDVGCYGNDTLRTPNIDRLAKEGVKLTQHLAAAPLCTPSRAAFLTGRHAFRSGMEASN 91
Query: 82 G------AGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNRGFD 130
G G + +P+ E + L++ GY+T LIGKWH G N E P N GFD
Sbjct: 92 GYRALQWNGGSGGLPINETTFAKILQQQGYATGLIGKWHQGVNCESRNDSCHHPLNHGFD 151
Query: 131 NHVG 134
G
Sbjct: 152 FFYG 155
>gi|355669602|gb|AER94582.1| arylsulfatase B [Mustela putorius furo]
Length = 418
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 58/163 (35%), Positives = 86/163 (52%), Gaps = 24/163 (14%)
Query: 88 AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG--------Y 139
VP+ EKLLPQ LKE GY+TH++GKWH+G ++E LP RGFD + GY G
Sbjct: 8 CVPLDEKLLPQLLKEAGYTTHMVGKWHLGMFRKECLPTRRGFDTYFGYLLGSEDYYSHER 67
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
T D+++ T A+ R+ E A + Y T+ FT+++ +I +H +PLFL +
Sbjct: 68 CTLIDALNVTRCALDF---RDGEEVATGYKNMYSTNIFTERATALIANHPPEKPLFLYLA 124
Query: 200 HAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+VH LQVP EE + + I + +R +A
Sbjct: 125 LQSVHEP-----------LQVP--EEYLKPYKFIQDKNRHHYA 154
>gi|226362295|ref|YP_002780073.1| arylsulfatase [Rhodococcus opacus B4]
gi|226240780|dbj|BAH51128.1| arylsulfatase [Rhodococcus opacus B4]
Length = 787
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 63/197 (31%), Positives = 103/197 (52%), Gaps = 23/197 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
G++D+G G ++I TPN++ LA +G L+ ++T C+P+RAA LTG P R G +
Sbjct: 62 GYSDIGPFG-SEIDTPNLNRLADSGYRLSNYHTTSVCSPARAALLTGLNPHRAGYGSVAN 120
Query: 80 --PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN-------KEELLPFNRGFD 130
P G+ + L + L+ GY+TH +GKWH+ + + P RGFD
Sbjct: 121 FDPGFPGLRMELADDALSLAEILRANGYATHAVGKWHLARDTNLAPGRTRDSWPLQRGFD 180
Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS--- 187
++ G G +S + + + ++ ++E Y S Y+TD TD++V IKS
Sbjct: 181 SYYGSLEGL----NSFYYPNELISDNSVVDVEEYP---SDYYVTDDITDKAVARIKSLRA 233
Query: 188 HNHSRPLFLQITHAAVH 204
H+ +P FL +H A+H
Sbjct: 234 HDADKPFFLYFSHIAMH 250
>gi|340368073|ref|XP_003382577.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 507
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 75/277 (27%), Positives = 112/277 (40%), Gaps = 74/277 (26%)
Query: 7 AGVAKAVPVTEK-------LLPQGWNDVGFHGE---NDIPTPNIDALAYNGIVLNRHYTL 56
AG+ PV +K + GW +VG+H ++ TPNID L G+ L++HY
Sbjct: 11 AGLVAGQPVRQKPHIVLMLVDDWGWANVGYHRNPPTREVVTPNIDDLVKQGLELDQHYAY 70
Query: 57 PTCTPSRAAFLTGKYPF----------RYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYS 106
C+PSR+ ++G+ P Y + PV A+P + + +KE GY+
Sbjct: 71 KFCSPSRSCLMSGRLPIHVNDLNLAPTNYNPNDPVSG--FSAIPRNMTGIAEKMKEAGYA 128
Query: 107 THLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAP 166
TH +GKW G + P RGFD GY++ H+ D+ E P
Sbjct: 129 THQVGKWDAGMATPDHTPKGRGFDTSFGYYH---------HDNDYYT--------EVVGP 171
Query: 167 QMS----------------------SKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVH 204
Q S KY F ++ + V+ H+ + PLFL H
Sbjct: 172 QCSGSPIVDLWDTDHPAHGINGTGPDKYEEGLFKERLMDVVSKHDPNTPLFLYYAPHIAH 231
Query: 205 TGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLF 241
T LQVPD N F+ I + DR+ +
Sbjct: 232 T-----------PLQVPDDYLN--KFSFIDDSDRKYY 255
>gi|392966318|ref|ZP_10331737.1| sulfatase [Fibrisoma limi BUZ 3]
gi|387845382|emb|CCH53783.1| sulfatase [Fibrisoma limi BUZ 3]
Length = 461
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 62/189 (32%), Positives = 91/189 (48%), Gaps = 16/189 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+ G D+ TP+ID+L G+ Y + C+PSRAA LTG+YP R G+ +
Sbjct: 47 GYGDLSCFGSTDLKTPHIDSLIGAGMRFTNFYANSSVCSPSRAALLTGRYPERVGVPGVI 106
Query: 82 GAGVAKA---VPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
V + + + LLP YL++ GY + IGKWH+G LP RGF G G
Sbjct: 107 RDEVQDSWGYLASSATLLPTYLRKQGYHSANIGKWHLGLESPN-LPNERGFQEFYGLLEG 165
Query: 139 YLTYNDSIHETDFAVGLDARRNMERYAPQMSSK--YLTDFFTDQSVHVIKSHNHSR-PLF 195
+ D+ V L +N R+ Q+ + TD FTD +V + + P F
Sbjct: 166 MM--------DDYVVKLRHGQNFLRHNGQVIDPPGHATDVFTDAAVRYLNDRKAKKDPFF 217
Query: 196 LQITHAAVH 204
L + + A H
Sbjct: 218 LYLAYTAPH 226
>gi|424860530|ref|ZP_18284476.1| arylsulfatase [Rhodococcus opacus PD630]
gi|356659002|gb|EHI39366.1| arylsulfatase [Rhodococcus opacus PD630]
Length = 790
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 103/197 (52%), Gaps = 23/197 (11%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT--- 79
G++D+G G ++I TPN++ LA +G L+ ++T C+P+RAA LTG P R G +
Sbjct: 65 GYSDIGPFG-SEIDTPNLNRLADSGYRLSNYHTTSVCSPARAALLTGLNPHRAGYGSVAN 123
Query: 80 --PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCN-------KEELLPFNRGFD 130
P G+ + L + L+ GY+TH +GKWH+ + + P RGFD
Sbjct: 124 FDPGFPGLRMELADDALSLAEILRANGYATHAVGKWHLARDTNLAPGRTRDSWPLQRGFD 183
Query: 131 NHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS--- 187
++ G G +S + + + ++ ++E Y S Y+TD TD+++ IKS
Sbjct: 184 SYYGSLEGL----NSFYYPNELISDNSVVDVEEYP---SDYYVTDDITDKAISRIKSLRA 236
Query: 188 HNHSRPLFLQITHAAVH 204
H+ +P FL +H A+H
Sbjct: 237 HDADKPFFLYFSHIAMH 253
>gi|430741674|ref|YP_007200803.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430013394|gb|AGA25108.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 454
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 62/196 (31%), Positives = 82/196 (41%), Gaps = 38/196 (19%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLP-TCTPSRAAFLTGKYPFRYGIDTPV 81
GW DVGF+G + TPN+D LA G R YT C PSRAA +TG+Y G+
Sbjct: 51 GWGDVGFNGRTEWATPNLDRLAARGTTFKRFYTAAVVCAPSRAALMTGRYTIHDGVSRN- 109
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIG----CNKEELLPFNRGFDNHVGYWN 137
+P E L + K GY T L GKWH G +K + P ++GFD G+
Sbjct: 110 ----NDDLPAREVTLAEAFKTHGYDTALFGKWHHGQPRDGSKTYVHPMDQGFDEFFGF-- 163
Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYAPQM--------SSKYLTDFFTDQSVHVIKSHN 189
DA+ E+Y Q+ S Y D F D ++ +K H
Sbjct: 164 -----------------TDAKHAWEKYPEQLWHGRELKPVSGYSDDMFADHAIDFLKRHK 206
Query: 190 HS-RPLFLQITHAAVH 204
P FL + H
Sbjct: 207 EKPTPFFLYVPFINTH 222
>gi|449138001|ref|ZP_21773306.1| arylsulfatase A [Rhodopirellula europaea 6C]
gi|448883380|gb|EMB13908.1| arylsulfatase A [Rhodopirellula europaea 6C]
Length = 470
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 103/200 (51%), Gaps = 19/200 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTP 80
QG+ND+G +G +I TPN+D LA G Y+ C+PSRAA LTG YP R G+
Sbjct: 38 QGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQH 97
Query: 81 VGAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWN 137
V + + E + +LK GY+T +GKWH+G +K E LP + GFD++ G Y N
Sbjct: 98 VLFPQSNYGLHPEEVTIADHLKSAGYATACVGKWHLGHHK-ETLPTSNGFDSYYGIPYSN 156
Query: 138 ----------GYLTYNDSIHETDFAVGL---DARRNMERYAPQMSSKYLTDFFTDQSVHV 184
G ++ +D + AV L ++ E + + +T +TD+++
Sbjct: 157 DMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTITRRYTDRAIEF 216
Query: 185 IKSHNHSRPLFLQITHAAVH 204
+++ N +P FL + H+ H
Sbjct: 217 VEA-NQDKPFFLYLPHSMPH 235
>gi|326798263|ref|YP_004316082.1| sulfatase [Sphingobacterium sp. 21]
gi|326549027|gb|ADZ77412.1| sulfatase [Sphingobacterium sp. 21]
Length = 559
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 61/198 (30%), Positives = 94/198 (47%), Gaps = 24/198 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G++D+G +G +I TP++D+LA +G+ + Y C PSRA+ +TG YP + I
Sbjct: 49 GYSDLGCYG-GEIQTPHLDSLAASGLRFTQFYNAARCCPSRASLMTGLYPHQAAIGHMTN 107
Query: 78 --------DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF 129
D V + P T L + LK GY+T + GKWH+G ++E P RGF
Sbjct: 108 PSEHFTQHDYHVPGYRGELSPQTHTL-AEVLKTAGYTTLMTGKWHLGMERKEQWPLQRGF 166
Query: 130 DNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVI---K 186
D++ G +G Y + D + + + Y TD FTD ++ I K
Sbjct: 167 DHYYGILDGASNYFQPAQPRGITLDNDTLKVDD------PNFYTTDAFTDHAIQFIDQSK 220
Query: 187 SHNHSRPLFLQITHAAVH 204
+ RP FL + + A H
Sbjct: 221 QQDGERPFFLYLAYTAPH 238
>gi|443709810|gb|ELU04315.1| hypothetical protein CAPTEDRAFT_117141 [Capitella teleta]
Length = 562
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 72/118 (61%), Gaps = 6/118 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G G + + TP++D++ NG+ L+ + CTPSRAA +T +Y R G+ + +
Sbjct: 34 GIGDIGAFGNDTLRTPHVDSICENGVKLDHDLAAASLCTPSRAALMTSRYAIRTGMSSVI 93
Query: 82 GAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL----LPFNRGFDNHVG 134
+ ++ + +P +E LPQ L+E GY+T LIGKWH+G N++ L P RGFD G
Sbjct: 94 TSLMSPQGLPTSEHTLPQMLQEKGYATALIGKWHLGWNRQLLDQYYSPLKRGFDYFFG 151
>gi|223936836|ref|ZP_03628745.1| sulfatase [bacterium Ellin514]
gi|223894405|gb|EEF60857.1| sulfatase [bacterium Ellin514]
Length = 477
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 68/198 (34%), Positives = 95/198 (47%), Gaps = 37/198 (18%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ DV +G TPNID LA +GI +T P C P+RA+ ++G+Y R G+ T V
Sbjct: 34 GYTDVACYGSKYYETPNIDKLAKDGIKFTDGHTCGPNCQPTRASLMSGQYGPRTGVYT-V 92
Query: 82 GA---------------GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFN 126
G+ V K +P+ + L Q LK+ GY+T + GKWH+G +KE P
Sbjct: 93 GSIDRFAWQTRSLHPVENVTK-LPLDKITLAQSLKKAGYATGMFGKWHLGEDKEH-HPAQ 150
Query: 127 RGFDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIK 186
RGFD + V D N + P+ +YL DF TD+++ IK
Sbjct: 151 RGFDEAL---------------VSMGVHFDFVTNPKVDYPK--DEYLADFLTDKALDFIK 193
Query: 187 SHNHSRPLFLQITHAAVH 204
H P FL + H AVH
Sbjct: 194 RHK-DEPFFLYLPHYAVH 210
>gi|443716273|gb|ELU07881.1| hypothetical protein CAPTEDRAFT_43570, partial [Capitella teleta]
Length = 492
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 72/118 (61%), Gaps = 6/118 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G G + + TP++D++ NG+ L+ + CTPSRAA +T +Y R G+ + +
Sbjct: 14 GIGDIGAFGNDTLRTPHVDSICENGVKLDHDLAAASLCTPSRAALMTSRYAIRTGMSSVI 73
Query: 82 GAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL----LPFNRGFDNHVG 134
+ ++ + +P +E LPQ L+E GY+T LIGKWH+G N++ L P RGFD G
Sbjct: 74 TSLMSPQGLPTSEHTLPQMLQEKGYATALIGKWHLGWNRQLLDQYYSPLKRGFDYFFG 131
>gi|198432447|ref|XP_002128343.1| PREDICTED: similar to galactosamine (N-acetyl)-6-sulfate sulfatase
[Ciona intestinalis]
Length = 513
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 65/203 (32%), Positives = 98/203 (48%), Gaps = 26/203 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G +G+ TPN+D +A G + Y+ P C+PSRAA LTG+ P R G T
Sbjct: 32 GWGDLGINGQPSKETPNLDNMAKEGTLFTDFYSANPLCSPSRAALLTGRLPIRNGFYTSN 91
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGF---- 129
G + +P E L+ + L GY+ LIGKWH+G +E+ LP GF
Sbjct: 92 YHGHNGYTPQHIVGGIPDHEILVSELLSSAGYTNKLIGKWHLG-QQEQYLPLKHGFHEWF 150
Query: 130 ---DNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYL---TDFFTDQSVH 183
+ H G ++ T N ++ VG R E +A + S KYL T ++ +++
Sbjct: 151 GSPNCHFGPYDDKTTPNIPVYNNTEMVG----RYYEEFAIE-SHKYLSNMTQYYIQEALD 205
Query: 184 VI-KSHNHSRPLFLQITHAAVHT 205
I + + +P FL A H+
Sbjct: 206 FIERMERNEKPFFLYWAPDATHS 228
>gi|419117362|ref|ZP_13662369.1| sulfatase family protein [Escherichia coli DEC5A]
gi|419134020|ref|ZP_13678843.1| sulfatase family protein [Escherichia coli DEC5D]
gi|377957343|gb|EHV20878.1| sulfatase family protein [Escherichia coli DEC5A]
gi|377970376|gb|EHV33738.1| sulfatase family protein [Escherichia coli DEC5D]
Length = 531
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 61/168 (36%), Positives = 86/168 (51%), Gaps = 15/168 (8%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 77 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 136
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 137 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF---- 190
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDF-FTDQSVHVIK 186
+S+ + + DA N E S+Y+ F+ VH ++
Sbjct: 191 ----NSVSDM-YTEWRDAHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 233
>gi|291285214|ref|YP_003502032.1| Arylsulfatase precursor [Escherichia coli O55:H7 str. CB9615]
gi|387509248|ref|YP_006161504.1| arylsulfatase [Escherichia coli O55:H7 str. RM12579]
gi|419128593|ref|ZP_13673461.1| sulfatase family protein [Escherichia coli DEC5C]
gi|419139161|ref|ZP_13683950.1| sulfatase family protein [Escherichia coli DEC5E]
gi|209753344|gb|ACI74979.1| HemY protein [Escherichia coli]
gi|290765087|gb|ADD59048.1| Arylsulfatase precursor [Escherichia coli O55:H7 str. CB9615]
gi|374361242|gb|AEZ42949.1| arylsulfatase [Escherichia coli O55:H7 str. RM12579]
gi|377969336|gb|EHV32714.1| sulfatase family protein [Escherichia coli DEC5C]
gi|377980212|gb|EHV43478.1| sulfatase family protein [Escherichia coli DEC5E]
Length = 551
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 61/168 (36%), Positives = 86/168 (51%), Gaps = 15/168 (8%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF---- 210
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDF-FTDQSVHVIK 186
+S+ + + DA N E S+Y+ F+ VH ++
Sbjct: 211 ----NSVSDM-YTEWRDAHVNPEVALSPDRSEYIKQLPFSKDDVHAVR 253
>gi|171910115|ref|ZP_02925585.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
Length = 460
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 92/201 (45%), Gaps = 25/201 (12%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLP-TCTPSRAAFLTGKYPFR---YGID 78
G+ D+G +G I TP++D +A G+ Y CTPSRAA LTG+YP R YG
Sbjct: 41 GYGDLGCYGSPTIATPHLDQMAAEGLRFTDFYVASEVCTPSRAALLTGRYPVRSGMYGKR 100
Query: 79 TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+ +P E LP+ LK GY+T +GKWH+G + E P ++GFD G
Sbjct: 101 RVLFPNSTGGLPAGEITLPEALKARGYATAHVGKWHLGIH-EGSRPLDQGFDQSFG---- 155
Query: 139 YLTY-NDSIHETDFAVGLDAR-------------RNMERYAPQMSSKYLTDFFTDQSVHV 184
L Y ND D G RN E +LT +T+++V
Sbjct: 156 -LPYSNDMDARPDLPKGSTGSPTPPIDGWNVPLLRNGEVVEKPADQVHLTGHYTEEAVKF 214
Query: 185 IKSHNHSRPLFLQITHAAVHT 205
I+ S+P FL + H+ H
Sbjct: 215 IQ-QKKSQPFFLYMAHSFPHV 234
>gi|440713850|ref|ZP_20894444.1| arylsulfatase [Rhodopirellula baltica SWK14]
gi|436441359|gb|ELP34602.1| arylsulfatase [Rhodopirellula baltica SWK14]
Length = 1571
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 97/191 (50%), Gaps = 17/191 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G++D+G +G +I TPNIDALA +G+ L + Y C PSRA+ +TG YP + GI
Sbjct: 25 GYSDLGCYG-GEISTPNIDALAADGVKLTQVYNSARCCPSRASLMTGLYPTQAGIGDFTT 83
Query: 78 ---DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
+ G G + + + LK GY + +GKWH+ + P RGFD+ G
Sbjct: 84 REPNRTRGQGYLGRLRDDCVTMAEVLKPEGYGCYYVGKWHM---HPKTGPIKRGFDDFYG 140
Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS-HNHSRP 193
Y N ++ ++ D+ + L R ++ P Y TD F D ++ I+ + ++P
Sbjct: 141 YTN---DHSHDQYDADYYIRLPENR-VKEIDPPADQFYATDVFNDYAIEFIRQGQSTNKP 196
Query: 194 LFLQITHAAVH 204
FL + H++ H
Sbjct: 197 WFLFLGHSSPH 207
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 32/92 (34%), Positives = 53/92 (57%), Gaps = 6/92 (6%)
Query: 25 NDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPVGA 83
+D+ +G +PTPN++ LA G+V + Y T+ +C+PSR + +TG+YP G
Sbjct: 721 DDLSVYGNAFVPTPNLERLASKGLVFDNAYLTISSCSPSRCSMITGRYPHNTG-----AP 775
Query: 84 GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI 115
+ +P T++ Q L+E GY T + GK H+
Sbjct: 776 ELHTTLPETQRTFVQSLREAGYHTVISGKNHM 807
>gi|114799529|ref|YP_761144.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
gi|114739703|gb|ABI77828.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
Length = 459
Score = 90.9 bits (224), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 47/113 (41%), Positives = 63/113 (55%), Gaps = 2/113 (1%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLP-TCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+ +G I TPNID + GI L Y C+PSRAA LTG+YP R G+ +
Sbjct: 50 GWGDISLNGAALIETPNIDRIGQEGIQLTDFYAGSNVCSPSRAALLTGRYPIRSGMQHVI 109
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
+P E + + LK GY T ++GKWH+G ++EE P N+GFD G
Sbjct: 110 FPHSQDGLPAEEITISEMLKNAGYRTGMVGKWHLG-HQEEYWPTNQGFDWFYG 161
>gi|294053770|ref|YP_003547428.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613103|gb|ADE53258.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 491
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 71/238 (29%), Positives = 112/238 (47%), Gaps = 36/238 (15%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGID--- 78
G++D+G+ G +I +P ID LA NG++ N + T P C PSRA +TG++ R+G++
Sbjct: 37 GYSDLGYTGSTEIESPVIDKLANNGVIFANGYVTHPYCGPSRAGLITGRHQARFGMEINA 96
Query: 79 --TPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYW 136
+P + +PV E + L+ GY T +IGKWH+G + P NRGFD G+
Sbjct: 97 TYSPFDQHM--GLPVDEPTFAKRLQPAGYRTGIIGKWHLGA-APQFHPNNRGFDYFYGFL 153
Query: 137 NGYLTYNDSIHETDFAVGLDARR-----NMERYAPQMSSK-------YLTDFFTDQSVHV 184
+G Y T + L + N P + +K YLT + +
Sbjct: 154 SGGHDYFPESVNTHLELVLPNGKPNYGANEGTLLPLLRNKNAAEFDDYLTTALSKDAARF 213
Query: 185 IKSHNHSRPLFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
+ S +P L + + A HT LQ P +E ++HI +P RR++A
Sbjct: 214 VTS--SEQPFCLYLAYNAPHTP-----------LQAP--KETIAKYSHIKDPKRRIYA 256
>gi|332529144|ref|ZP_08405108.1| sulfatase [Hylemonella gracilis ATCC 19624]
gi|332041367|gb|EGI77729.1| sulfatase [Hylemonella gracilis ATCC 19624]
Length = 454
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 62/187 (33%), Positives = 90/187 (48%), Gaps = 11/187 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRY--GIDT 79
GW D+G +G +D TPN+D LA G+ + Y C+ +R A +TG+Y +R G++
Sbjct: 23 GWADLGVYGASDFATPNLDRLAAQGVRFTQAYANSAVCSATRIALITGRYQYRLPAGLEE 82
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
P+ A +P LP L+E GY T LIGKWH+G P G+D G G
Sbjct: 83 PI-ARSDIGLPPEHPTLPSLLREAGYDTALIGKWHLG-KPPTYGPLKSGYDRFFGNIGGA 140
Query: 140 LTYNDSIHETDFAVGLDARRNM-ERYAPQMSSKYLTDFFTDQ-SVHVIKSHNHSRPLFLQ 197
L Y H+ VG R++ E P + Y T+ D+ S +V + +P FL
Sbjct: 141 LDY--FTHKP--GVGAQVPRDLWEGDVPVERTGYYTNILGDEASAYVRAREDEKKPFFLS 196
Query: 198 ITHAAVH 204
+ A H
Sbjct: 197 LHFTAPH 203
>gi|82779021|ref|YP_405370.1| arylsulfatase [Shigella dysenteriae Sd197]
gi|81243169|gb|ABB63879.1| arylsulfatase [Shigella dysenteriae Sd197]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|294053962|ref|YP_003547620.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613295|gb|ADE53450.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 494
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 73/229 (31%), Positives = 104/229 (45%), Gaps = 18/229 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ DVGF G +I TP +D LA G++ N Y T C PSRA +TG+Y R+G++
Sbjct: 34 GYADVGFTGSTEIQTPVLDRLAAGGVIFNNGYVTHAYCGPSRAGLITGRYQARFGVEVNF 93
Query: 82 GAGVA---KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG 138
+P EK LK+ GY T +IGKWH+G P NRGFD G+ G
Sbjct: 94 PYAPFDPHSGLPTDEKTFATRLKQSGYRTAMIGKWHLGA-AYPYHPNNRGFDYFYGFLGG 152
Query: 139 YLTYNDSIHETDFAVGLDARR-----NMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRP 193
Y T + L+ + N Y P M + +F D+ + S + +R
Sbjct: 153 AHDYMPENTSTTVPLTLENGKVNHMANAGSYLPLMRNNVNAEF--DEYLTTALSRDAAR- 209
Query: 194 LFLQITHAAVHTGTAGNAKLPTGLLQVPDMEENDRTFAHISNPDRRLFA 242
F++ T + NA P LQ P + +AHI + RR +A
Sbjct: 210 -FIEKTEGPFCVYLSYNA--PHTPLQAP--KALIEKYAHIESQKRRTYA 253
>gi|16131653|ref|NP_418245.1| acrylsulfatase-like enzyme [Escherichia coli str. K-12 substr.
MG1655]
gi|170083285|ref|YP_001732605.1| acrylsulfatase-like protein [Escherichia coli str. K-12 substr.
DH10B]
gi|238902878|ref|YP_002928674.1| acrylsulfatase-like enzyme [Escherichia coli BW2952]
gi|300950438|ref|ZP_07164359.1| arylsulfatase [Escherichia coli MS 116-1]
gi|300955197|ref|ZP_07167593.1| arylsulfatase [Escherichia coli MS 175-1]
gi|386282536|ref|ZP_10060184.1| arylsulfatase [Escherichia sp. 4_1_40B]
gi|386597667|ref|YP_006094067.1| sulfatase [Escherichia coli DH1]
gi|387623453|ref|YP_006131081.1| acrylsulfatase-like protein [Escherichia coli DH1]
gi|388479449|ref|YP_491641.1| acrylsulfatase-like enzyme [Escherichia coli str. K-12 substr.
W3110]
gi|417265077|ref|ZP_12052456.1| arylsulfatase [Escherichia coli 2.3916]
gi|417279133|ref|ZP_12066443.1| arylsulfatase [Escherichia coli 3.2303]
gi|417294264|ref|ZP_12081543.1| arylsulfatase [Escherichia coli B41]
gi|417636751|ref|ZP_12286956.1| arylsulfatase [Escherichia coli STEC_S1191]
gi|417945692|ref|ZP_12588922.1| arylsulfatase [Escherichia coli XH140A]
gi|417977667|ref|ZP_12618448.1| arylsulfatase [Escherichia coli XH001]
gi|418305431|ref|ZP_12917225.1| arylsulfatase [Escherichia coli UMNF18]
gi|419150829|ref|ZP_13695474.1| sulfatase family protein [Escherichia coli DEC6B]
gi|419938621|ref|ZP_14455447.1| arylsulfatase [Escherichia coli 75]
gi|422818955|ref|ZP_16867167.1| arylsulfatase [Escherichia coli M919]
gi|423703323|ref|ZP_17677755.1| arylsulfatase [Escherichia coli H730]
gi|432629424|ref|ZP_19865388.1| arylsulfatase [Escherichia coli KTE77]
gi|432663050|ref|ZP_19898677.1| arylsulfatase [Escherichia coli KTE111]
gi|432687632|ref|ZP_19922919.1| arylsulfatase [Escherichia coli KTE156]
gi|432689129|ref|ZP_19924394.1| arylsulfatase [Escherichia coli KTE161]
gi|432739299|ref|ZP_19974026.1| arylsulfatase [Escherichia coli KTE42]
gi|432878167|ref|ZP_20095616.1| arylsulfatase [Escherichia coli KTE154]
gi|433050271|ref|ZP_20237590.1| arylsulfatase [Escherichia coli KTE120]
gi|442591326|ref|ZP_21009811.1| Arylsulfatase [Escherichia coli O10:K5(L):H4 str. ATCC 23506]
gi|450252901|ref|ZP_21902275.1| arylsulfatase [Escherichia coli S17]
gi|114256|sp|P25549.2|ASLA_ECOLI RecName: Full=Arylsulfatase; Short=AS; AltName: Full=Aryl-sulfate
sulphohydrolase; Flags: Precursor
gi|148200|gb|AAA67597.1| unknown [Escherichia coli str. K-12 substr. MG1655]
gi|1790233|gb|AAC76804.1| acrylsulfatase-like enzyme [Escherichia coli str. K-12 substr.
MG1655]
gi|85676250|dbj|BAE77500.1| acrylsulfatase-like enzyme [Escherichia coli str. K12 substr.
W3110]
gi|169891120|gb|ACB04827.1| acrylsulfatase-like enzyme [Escherichia coli str. K-12 substr.
DH10B]
gi|238859923|gb|ACR61921.1| acrylsulfatase-like enzyme [Escherichia coli BW2952]
gi|260451356|gb|ACX41778.1| sulfatase [Escherichia coli DH1]
gi|300317879|gb|EFJ67663.1| arylsulfatase [Escherichia coli MS 175-1]
gi|300450227|gb|EFK13847.1| arylsulfatase [Escherichia coli MS 116-1]
gi|315138377|dbj|BAJ45536.1| acrylsulfatase-like enzyme [Escherichia coli DH1]
gi|339417529|gb|AEJ59201.1| arylsulfatase [Escherichia coli UMNF18]
gi|342362592|gb|EGU26709.1| arylsulfatase [Escherichia coli XH140A]
gi|344192660|gb|EGV46749.1| arylsulfatase [Escherichia coli XH001]
gi|345384819|gb|EGX14677.1| arylsulfatase [Escherichia coli STEC_S1191]
gi|359333933|dbj|BAL40380.1| acrylsulfatase-like enzyme [Escherichia coli str. K-12 substr.
MDS42]
gi|377988755|gb|EHV51930.1| sulfatase family protein [Escherichia coli DEC6B]
gi|385537513|gb|EIF84384.1| arylsulfatase [Escherichia coli M919]
gi|385708462|gb|EIG45474.1| arylsulfatase [Escherichia coli H730]
gi|386120386|gb|EIG69015.1| arylsulfatase [Escherichia sp. 4_1_40B]
gi|386221259|gb|EII43703.1| arylsulfatase [Escherichia coli 2.3916]
gi|386237910|gb|EII74850.1| arylsulfatase [Escherichia coli 3.2303]
gi|386252452|gb|EIJ02144.1| arylsulfatase [Escherichia coli B41]
gi|388409969|gb|EIL70230.1| arylsulfatase [Escherichia coli 75]
gi|431160114|gb|ELE60632.1| arylsulfatase [Escherichia coli KTE77]
gi|431196490|gb|ELE95416.1| arylsulfatase [Escherichia coli KTE111]
gi|431218879|gb|ELF16304.1| arylsulfatase [Escherichia coli KTE156]
gi|431234376|gb|ELF29777.1| arylsulfatase [Escherichia coli KTE161]
gi|431278972|gb|ELF69943.1| arylsulfatase [Escherichia coli KTE42]
gi|431417407|gb|ELG99870.1| arylsulfatase [Escherichia coli KTE154]
gi|431561779|gb|ELI35141.1| arylsulfatase [Escherichia coli KTE120]
gi|441608564|emb|CCP95648.1| Arylsulfatase [Escherichia coli O10:K5(L):H4 str. ATCC 23506]
gi|449314180|gb|EMD04354.1| arylsulfatase [Escherichia coli S17]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|422836217|ref|ZP_16884265.1| arylsulfatase [Escherichia coli E101]
gi|371609566|gb|EHN98103.1| arylsulfatase [Escherichia coli E101]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|171912352|ref|ZP_02927822.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
Length = 491
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 60/197 (30%), Positives = 91/197 (46%), Gaps = 16/197 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G G TPN+D +A G+ R Y P C+ SR A +TG YP R GI +
Sbjct: 45 GYGDLGCFGAKGQATPNLDRMAAEGVKFERFYVAQPVCSASRMALMTGCYPNRVGIKGAL 104
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWNGY 139
G G + E L + +K+ GY+T GKWH+G + + LP GFD ++G Y N
Sbjct: 105 GPGAKVGISKEETTLAELVKQNGYATAAFGKWHLG-DDPQFLPVRHGFDEYLGLPYSNDM 163
Query: 140 LTY-----NDSIHETDFAVG------LDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH 188
Y N + + G +D R + + LT ++T ++V I +
Sbjct: 164 WPYHPELVNLTPEQRKKRRGFPALPLVDGDRIILPEVTTVEQTRLTTWYTQRAVKFINT- 222
Query: 189 NHSRPLFLQITHAAVHT 205
N +P L + H+ H
Sbjct: 223 NKDKPFLLYLAHSMPHV 239
>gi|432578071|ref|ZP_19814516.1| arylsulfatase [Escherichia coli KTE56]
gi|431111494|gb|ELE15393.1| arylsulfatase [Escherichia coli KTE56]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|416812738|ref|ZP_11890780.1| arylsulfatase [Escherichia coli O55:H7 str. 3256-97]
gi|419123311|ref|ZP_13668247.1| sulfatase family protein [Escherichia coli DEC5B]
gi|320655339|gb|EFX23281.1| arylsulfatase [Escherichia coli O55:H7 str. 3256-97 TW 07815]
gi|377960957|gb|EHV24432.1| sulfatase family protein [Escherichia coli DEC5B]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|331655483|ref|ZP_08356476.1| arylsulfatase [Escherichia coli M718]
gi|331046804|gb|EGI18888.1| arylsulfatase [Escherichia coli M718]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|301646135|ref|ZP_07246034.1| arylsulfatase [Escherichia coli MS 146-1]
gi|331644533|ref|ZP_08345653.1| arylsulfatase [Escherichia coli H736]
gi|432634707|ref|ZP_19870604.1| arylsulfatase [Escherichia coli KTE81]
gi|432706534|ref|ZP_19941627.1| arylsulfatase [Escherichia coli KTE171]
gi|301075604|gb|EFK90410.1| arylsulfatase [Escherichia coli MS 146-1]
gi|331036205|gb|EGI08440.1| arylsulfatase [Escherichia coli H736]
gi|431175847|gb|ELE75834.1| arylsulfatase [Escherichia coli KTE81]
gi|431239856|gb|ELF34322.1| arylsulfatase [Escherichia coli KTE171]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|386616609|ref|YP_006136275.1| arylsulfatase [Escherichia coli UMNK88]
gi|404377193|ref|ZP_10982332.1| arylsulfatase [Escherichia sp. 1_1_43]
gi|419177522|ref|ZP_13721328.1| sulfatase family protein [Escherichia coli DEC7B]
gi|421777517|ref|ZP_16214112.1| hypothetical protein ECAD30_36210 [Escherichia coli AD30]
gi|422769204|ref|ZP_16822925.1| sulfatase [Escherichia coli E1520]
gi|226838702|gb|EEH70730.1| arylsulfatase [Escherichia sp. 1_1_43]
gi|323934189|gb|EGB30620.1| sulfatase [Escherichia coli E1520]
gi|332345778|gb|AEE59112.1| arylsulfatase [Escherichia coli UMNK88]
gi|378028430|gb|EHV91048.1| sulfatase family protein [Escherichia coli DEC7B]
gi|408457431|gb|EKJ81227.1| hypothetical protein ECAD30_36210 [Escherichia coli AD30]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|149198127|ref|ZP_01875174.1| sulfatase family protein [Lentisphaera araneosa HTCC2155]
gi|149138729|gb|EDM27135.1| sulfatase family protein [Lentisphaera araneosa HTCC2155]
Length = 484
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 63/194 (32%), Positives = 92/194 (47%), Gaps = 17/194 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
G ND+ +G TP++D LA +G+ YT P C P+R A LTGKYP R+ + P
Sbjct: 31 GVNDLSCNGSTFYETPHMDQLAADGVKFTNAYTAFPRCLPARQALLTGKYPSRFDVQ-PY 89
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDN--HVGYWNGY 139
+ +P E + LKE GY T IGKWH+G ++ P +GFD+ H G+
Sbjct: 90 ---PKQHLPFEEVTFGEALKEEGYETSYIGKWHLGHKGQD--PSKQGFDHIVHTGHAGAT 144
Query: 140 LTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
++ + ++ ++E YLTD D++ IKS +P L +
Sbjct: 145 KSFF-------YPFPVEKGHSVENPVKGKEGDYLTDILRDEACEFIKS-KADKPFLLVMA 196
Query: 200 HAAVHTGTAGNAKL 213
H AVHT G L
Sbjct: 197 HYAVHTPLEGRPDL 210
>gi|432951060|ref|ZP_20144803.1| arylsulfatase [Escherichia coli KTE197]
gi|431477526|gb|ELH57294.1| arylsulfatase [Escherichia coli KTE197]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|196230145|ref|ZP_03129008.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196225742|gb|EDY20249.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 487
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 67/209 (32%), Positives = 97/209 (46%), Gaps = 34/209 (16%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVL-NRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ DVG +G TPN D LA+ G + H C+ SRAA +TG YP R GI+ +
Sbjct: 44 GYADVGVYGAKGFETPNFDRLAHEGRRFTDFHVAQAVCSASRAAIMTGCYPNRIGIEGAM 103
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ E +PQ K GY+T ++GKWH+G E LP +RGFD W G
Sbjct: 104 EPWYKFGISDQELTMPQMFKRKGYATGMVGKWHLG-TPTEFLPTHRGFDE----WFGLPY 158
Query: 142 YNDS---------------IHETDFAV--GLDARRNMERYAPQMSSKYLTDFFTDQSVHV 184
ND ++E D + G++ R+ME+ LT +T+++V+
Sbjct: 159 SNDQWPLHPEKPGKFPPLPLYEGDKVINPGIN-HRDMEQ---------LTTQYTERAVNF 208
Query: 185 IKSHNHSRPLFLQITHAAVHTGTAGNAKL 213
I NH +P FL + H A + K
Sbjct: 209 I-DRNHDKPFFLYVAQTMPHVPLAVSDKF 236
>gi|432452061|ref|ZP_19694315.1| arylsulfatase [Escherichia coli KTE193]
gi|433035723|ref|ZP_20223410.1| arylsulfatase [Escherichia coli KTE112]
gi|430977211|gb|ELC94062.1| arylsulfatase [Escherichia coli KTE193]
gi|431545828|gb|ELI20473.1| arylsulfatase [Escherichia coli KTE112]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGLTT-LPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|301029014|ref|ZP_07192169.1| arylsulfatase [Escherichia coli MS 196-1]
gi|299878025|gb|EFI86236.1| arylsulfatase [Escherichia coli MS 196-1]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|293417264|ref|ZP_06659889.1| arylsulfatase [Escherichia coli B185]
gi|416778747|ref|ZP_11876078.1| arylsulfatase [Escherichia coli O157:H7 str. G5101]
gi|416790105|ref|ZP_11880971.1| arylsulfatase [Escherichia coli O157:H- str. 493-89]
gi|416801879|ref|ZP_11885859.1| arylsulfatase [Escherichia coli O157:H- str. H 2687]
gi|419077994|ref|ZP_13623490.1| sulfatase family protein [Escherichia coli DEC3F]
gi|420283149|ref|ZP_14785379.1| arylsulfatase [Escherichia coli TW06591]
gi|425263807|ref|ZP_18655783.1| arylsulfatase [Escherichia coli EC96038]
gi|445014625|ref|ZP_21330719.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli PA48]
gi|209753338|gb|ACI74976.1| HemY protein [Escherichia coli]
gi|291431032|gb|EFF04027.1| arylsulfatase [Escherichia coli B185]
gi|320639283|gb|EFX08905.1| arylsulfatase [Escherichia coli O157:H7 str. G5101]
gi|320644668|gb|EFX13718.1| arylsulfatase [Escherichia coli O157:H- str. 493-89]
gi|320649993|gb|EFX18496.1| arylsulfatase [Escherichia coli O157:H- str. H 2687]
gi|377917014|gb|EHU81083.1| sulfatase family protein [Escherichia coli DEC3F]
gi|390779048|gb|EIO46785.1| arylsulfatase [Escherichia coli TW06591]
gi|408177243|gb|EKI04058.1| arylsulfatase [Escherichia coli EC96038]
gi|444620232|gb|ELV94241.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli PA48]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|255653026|ref|NP_001157425.1| steryl-sulfatase precursor [Equus caballus]
Length = 578
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 53/120 (44%), Positives = 68/120 (56%), Gaps = 12/120 (10%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G D G +G + TPNID LA G+ L +H P CTPSRAAF+TG+YP R G+ +
Sbjct: 33 GIGDPGCYGNKTLRTPNIDRLAEGGVKLTQHLAASPLCTPSRAAFMTGRYPIRSGMASQS 92
Query: 82 GAGV------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIG--C-NKEELL--PFNRGFD 130
GV + +P +E + LK GYST LIGKWH+G C NK + P + GFD
Sbjct: 93 KVGVFLFSASSGGLPTSEITFAKLLKNQGYSTALIGKWHLGTNCHNKTDFCHHPLSHGFD 152
>gi|387609603|ref|YP_006098459.1| arylsulfatase [Escherichia coli 042]
gi|284923903|emb|CBG37002.1| arylsulfatase [Escherichia coli 042]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|432943494|ref|ZP_20140329.1| arylsulfatase [Escherichia coli KTE196]
gi|433045335|ref|ZP_20232807.1| arylsulfatase [Escherichia coli KTE117]
gi|431466713|gb|ELH46730.1| arylsulfatase [Escherichia coli KTE196]
gi|431551968|gb|ELI25931.1| arylsulfatase [Escherichia coli KTE117]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|424818211|ref|ZP_18243362.1| arylsulfatase-like enzyme [Escherichia fergusonii ECD227]
gi|325499231|gb|EGC97090.1| arylsulfatase-like enzyme [Escherichia fergusonii ECD227]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|416833512|ref|ZP_11900392.1| arylsulfatase [Escherichia coli O157:H7 str. LSU-61]
gi|320666088|gb|EFX33102.1| arylsulfatase [Escherichia coli O157:H7 str. LSU-61]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|432866678|ref|ZP_20089015.1| arylsulfatase [Escherichia coli KTE146]
gi|431400801|gb|ELG84165.1| arylsulfatase [Escherichia coli KTE146]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|423341845|ref|ZP_17319560.1| hypothetical protein HMPREF1077_00990 [Parabacteroides johnsonii
CL02T12C29]
gi|409219938|gb|EKN12897.1| hypothetical protein HMPREF1077_00990 [Parabacteroides johnsonii
CL02T12C29]
Length = 461
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 89/194 (45%), Gaps = 27/194 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNG-IVLNRHYTLPTCTPSRAAFLTGKYPFRYG----I 77
G+ D GF G DI TPNID LA G I + H +PSR+ LTG+Y RYG +
Sbjct: 42 GYADFGFMGSADIQTPNIDRLAAEGRIFTDAHVAATVSSPSRSMMLTGRYGQRYGYECNL 101
Query: 78 DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNR----GFDNHV 133
D P +P E+LLP LK GY T IGKWH+G PF R GFD
Sbjct: 102 DKP-----GDGIPDDEELLPALLKRYGYRTGCIGKWHLGSK-----PFQRPNAKGFDTFY 151
Query: 134 GYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSH--NHS 191
G G+ +Y ++ + + D N+++Y +FTD+ +
Sbjct: 152 GLLAGHRSY---FYDPETS---DKDGNLQQYQYNGQKLSFDGYFTDELASKARQFVAESE 205
Query: 192 RPLFLQITHAAVHT 205
+P L ++ A H+
Sbjct: 206 QPFMLYMSFTAPHS 219
>gi|417588940|ref|ZP_12239701.1| arylsulfatase [Escherichia coli STEC_C165-02]
gi|432491631|ref|ZP_19733489.1| arylsulfatase [Escherichia coli KTE213]
gi|432841656|ref|ZP_20075110.1| arylsulfatase [Escherichia coli KTE140]
gi|433205551|ref|ZP_20389292.1| arylsulfatase [Escherichia coli KTE95]
gi|345331076|gb|EGW63537.1| arylsulfatase [Escherichia coli STEC_C165-02]
gi|431016987|gb|ELD30504.1| arylsulfatase [Escherichia coli KTE213]
gi|431384928|gb|ELG68918.1| arylsulfatase [Escherichia coli KTE140]
gi|431715513|gb|ELJ79661.1| arylsulfatase [Escherichia coli KTE95]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|366158865|ref|ZP_09458727.1| arylsulfatase [Escherichia sp. TW09308]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|331685518|ref|ZP_08386102.1| arylsulfatase [Escherichia coli H299]
gi|450195629|ref|ZP_21892583.1| arylsulfatase [Escherichia coli SEPT362]
gi|331077219|gb|EGI48433.1| arylsulfatase [Escherichia coli H299]
gi|449316170|gb|EMD06291.1| arylsulfatase [Escherichia coli SEPT362]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|416900367|ref|ZP_11929642.1| arylsulfatase [Escherichia coli STEC_7v]
gi|417116418|ref|ZP_11967279.1| arylsulfatase [Escherichia coli 1.2741]
gi|422335426|ref|ZP_16416425.1| arylsulfatase [Escherichia coli 4_1_47FAA]
gi|422784491|ref|ZP_16837271.1| sulfatase [Escherichia coli TW10509]
gi|422803393|ref|ZP_16851881.1| sulfatase [Escherichia coli M863]
gi|432768176|ref|ZP_20002565.1| arylsulfatase [Escherichia coli KTE50]
gi|432964607|ref|ZP_20153677.1| arylsulfatase [Escherichia coli KTE202]
gi|433065269|ref|ZP_20252170.1| arylsulfatase [Escherichia coli KTE125]
gi|323964045|gb|EGB59535.1| sulfatase [Escherichia coli M863]
gi|323974382|gb|EGB69510.1| sulfatase [Escherichia coli TW10509]
gi|327250650|gb|EGE62356.1| arylsulfatase [Escherichia coli STEC_7v]
gi|373243576|gb|EHP63078.1| arylsulfatase [Escherichia coli 4_1_47FAA]
gi|386138962|gb|EIG80117.1| arylsulfatase [Escherichia coli 1.2741]
gi|431321440|gb|ELG09041.1| arylsulfatase [Escherichia coli KTE50]
gi|431467324|gb|ELH47334.1| arylsulfatase [Escherichia coli KTE202]
gi|431577842|gb|ELI50465.1| arylsulfatase [Escherichia coli KTE125]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|432891395|ref|ZP_20104113.1| arylsulfatase [Escherichia coli KTE165]
gi|431429800|gb|ELH11635.1| arylsulfatase [Escherichia coli KTE165]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|432817589|ref|ZP_20051339.1| arylsulfatase [Escherichia coli KTE115]
gi|431360005|gb|ELG46626.1| arylsulfatase [Escherichia coli KTE115]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|432394470|ref|ZP_19637286.1| arylsulfatase [Escherichia coli KTE21]
gi|430913861|gb|ELC34980.1| arylsulfatase [Escherichia coli KTE21]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|331649624|ref|ZP_08350706.1| arylsulfatase [Escherichia coli M605]
gi|417664426|ref|ZP_12314005.1| arylsulfatase [Escherichia coli AA86]
gi|432399749|ref|ZP_19642522.1| arylsulfatase [Escherichia coli KTE25]
gi|432725267|ref|ZP_19960180.1| arylsulfatase [Escherichia coli KTE17]
gi|432729876|ref|ZP_19964748.1| arylsulfatase [Escherichia coli KTE18]
gi|432743565|ref|ZP_19978278.1| arylsulfatase [Escherichia coli KTE23]
gi|432988296|ref|ZP_20176975.1| arylsulfatase [Escherichia coli KTE217]
gi|433113077|ref|ZP_20298924.1| arylsulfatase [Escherichia coli KTE150]
gi|330908100|gb|EGH36619.1| arylsulfatase [Escherichia coli AA86]
gi|331041494|gb|EGI13642.1| arylsulfatase [Escherichia coli M605]
gi|430912911|gb|ELC34083.1| arylsulfatase [Escherichia coli KTE25]
gi|431262486|gb|ELF54476.1| arylsulfatase [Escherichia coli KTE17]
gi|431270646|gb|ELF61808.1| arylsulfatase [Escherichia coli KTE18]
gi|431280856|gb|ELF71765.1| arylsulfatase [Escherichia coli KTE23]
gi|431502009|gb|ELH80902.1| arylsulfatase [Escherichia coli KTE217]
gi|431624566|gb|ELI93182.1| arylsulfatase [Escherichia coli KTE150]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|432604635|ref|ZP_19840861.1| arylsulfatase [Escherichia coli KTE66]
gi|431136569|gb|ELE38427.1| arylsulfatase [Escherichia coli KTE66]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|15804389|ref|NP_290429.1| arylsulfatase [Escherichia coli O157:H7 str. EDL933]
gi|15833985|ref|NP_312758.1| arylsulfatase [Escherichia coli O157:H7 str. Sakai]
gi|168750392|ref|ZP_02775414.1| arylsulfatase [Escherichia coli O157:H7 str. EC4113]
gi|168753693|ref|ZP_02778700.1| arylsulfatase [Escherichia coli O157:H7 str. EC4401]
gi|168768077|ref|ZP_02793084.1| arylsulfatase [Escherichia coli O157:H7 str. EC4486]
gi|168775653|ref|ZP_02800660.1| arylsulfatase [Escherichia coli O157:H7 str. EC4196]
gi|168780695|ref|ZP_02805702.1| arylsulfatase [Escherichia coli O157:H7 str. EC4076]
gi|168786634|ref|ZP_02811641.1| arylsulfatase [Escherichia coli O157:H7 str. EC869]
gi|168801140|ref|ZP_02826147.1| arylsulfatase [Escherichia coli O157:H7 str. EC508]
gi|195938087|ref|ZP_03083469.1| arylsulfatase [Escherichia coli O157:H7 str. EC4024]
gi|208807165|ref|ZP_03249502.1| arylsulfatase [Escherichia coli O157:H7 str. EC4206]
gi|208812341|ref|ZP_03253670.1| arylsulfatase [Escherichia coli O157:H7 str. EC4045]
gi|208818746|ref|ZP_03259066.1| arylsulfatase [Escherichia coli O157:H7 str. EC4042]
gi|209399246|ref|YP_002273316.1| arylsulfatase [Escherichia coli O157:H7 str. EC4115]
gi|217324531|ref|ZP_03440615.1| arylsulfatase [Escherichia coli O157:H7 str. TW14588]
gi|254795796|ref|YP_003080633.1| acrylsulfatase-like enzyme [Escherichia coli O157:H7 str. TW14359]
gi|261225573|ref|ZP_05939854.1| acrylsulfatase-like enzyme [Escherichia coli O157:H7 str. FRIK2000]
gi|261255619|ref|ZP_05948152.1| acrylsulfatase-like enzyme [Escherichia coli O157:H7 str. FRIK966]
gi|387885028|ref|YP_006315330.1| arylsulfatase [Escherichia coli Xuzhou21]
gi|416307618|ref|ZP_11654659.1| Arylsulfatase [Escherichia coli O157:H7 str. 1044]
gi|416319752|ref|ZP_11662304.1| Arylsulfatase [Escherichia coli O157:H7 str. EC1212]
gi|416326910|ref|ZP_11666985.1| Arylsulfatase [Escherichia coli O157:H7 str. 1125]
gi|419043232|ref|ZP_13590209.1| sulfatase family protein [Escherichia coli DEC3A]
gi|419053675|ref|ZP_13600540.1| sulfatase family protein [Escherichia coli DEC3B]
gi|419065757|ref|ZP_13612456.1| sulfatase family protein [Escherichia coli DEC3D]
gi|419089105|ref|ZP_13634453.1| sulfatase family protein [Escherichia coli DEC4B]
gi|419094926|ref|ZP_13640200.1| sulfatase family protein [Escherichia coli DEC4C]
gi|420284129|ref|ZP_14786350.1| arylsulfatase [Escherichia coli TW10246]
gi|420289833|ref|ZP_14792003.1| arylsulfatase [Escherichia coli TW11039]
gi|420306826|ref|ZP_14808811.1| arylsulfatase [Escherichia coli TW10119]
gi|420312194|ref|ZP_14814119.1| arylsulfatase [Escherichia coli EC1738]
gi|420317848|ref|ZP_14819716.1| arylsulfatase [Escherichia coli EC1734]
gi|421826575|ref|ZP_16261927.1| arylsulfatase [Escherichia coli FRIK920]
gi|421833433|ref|ZP_16268710.1| arylsulfatase [Escherichia coli PA7]
gi|424086526|ref|ZP_17822995.1| arylsulfatase [Escherichia coli FDA517]
gi|424149999|ref|ZP_17881358.1| arylsulfatase [Escherichia coli PA15]
gi|424163724|ref|ZP_17886776.1| arylsulfatase [Escherichia coli PA24]
gi|424257376|ref|ZP_17892318.1| arylsulfatase [Escherichia coli PA25]
gi|424336064|ref|ZP_17898254.1| arylsulfatase [Escherichia coli PA28]
gi|424452330|ref|ZP_17903957.1| arylsulfatase [Escherichia coli PA32]
gi|424465025|ref|ZP_17915332.1| arylsulfatase [Escherichia coli PA39]
gi|424477749|ref|ZP_17927048.1| arylsulfatase [Escherichia coli PA42]
gi|424483530|ref|ZP_17932496.1| arylsulfatase [Escherichia coli TW07945]
gi|424489726|ref|ZP_17938248.1| arylsulfatase [Escherichia coli TW09098]
gi|424503046|ref|ZP_17949915.1| arylsulfatase [Escherichia coli EC4203]
gi|424509319|ref|ZP_17955671.1| arylsulfatase [Escherichia coli EC4196]
gi|424516725|ref|ZP_17961296.1| arylsulfatase [Escherichia coli TW14313]
gi|424522852|ref|ZP_17966941.1| arylsulfatase [Escherichia coli TW14301]
gi|424528725|ref|ZP_17972420.1| arylsulfatase [Escherichia coli EC4421]
gi|424565822|ref|ZP_18006808.1| arylsulfatase [Escherichia coli EC4437]
gi|425106679|ref|ZP_18508978.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 5.2239]
gi|425134377|ref|ZP_18535213.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 8.2524]
gi|425140970|ref|ZP_18541336.1| arylsulfatase [Escherichia coli 10.0833]
gi|425182830|ref|ZP_18580511.1| arylsulfatase [Escherichia coli FRIK1999]
gi|425214470|ref|ZP_18609857.1| arylsulfatase [Escherichia coli PA4]
gi|425220598|ref|ZP_18615545.1| arylsulfatase [Escherichia coli PA23]
gi|425227243|ref|ZP_18621694.1| arylsulfatase [Escherichia coli PA49]
gi|425233401|ref|ZP_18627425.1| arylsulfatase [Escherichia coli PA45]
gi|425239322|ref|ZP_18633027.1| arylsulfatase [Escherichia coli TT12B]
gi|425245557|ref|ZP_18638849.1| arylsulfatase [Escherichia coli MA6]
gi|425297274|ref|ZP_18687384.1| arylsulfatase [Escherichia coli PA38]
gi|425356984|ref|ZP_18743030.1| arylsulfatase [Escherichia coli EC1850]
gi|425362933|ref|ZP_18748565.1| arylsulfatase [Escherichia coli EC1856]
gi|425369198|ref|ZP_18754261.1| arylsulfatase [Escherichia coli EC1862]
gi|425395119|ref|ZP_18778210.1| arylsulfatase [Escherichia coli EC1868]
gi|425401173|ref|ZP_18783863.1| arylsulfatase [Escherichia coli EC1869]
gi|425407269|ref|ZP_18789474.1| arylsulfatase [Escherichia coli EC1870]
gi|425413627|ref|ZP_18795373.1| arylsulfatase [Escherichia coli NE098]
gi|425419942|ref|ZP_18801197.1| arylsulfatase [Escherichia coli FRIK523]
gi|425431239|ref|ZP_18811832.1| arylsulfatase [Escherichia coli 0.1304]
gi|428955719|ref|ZP_19027493.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 88.1042]
gi|428961741|ref|ZP_19033004.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 89.0511]
gi|428968345|ref|ZP_19039033.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 90.0091]
gi|428974127|ref|ZP_19044422.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 90.0039]
gi|428980562|ref|ZP_19050355.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 90.2281]
gi|428986322|ref|ZP_19055695.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 93.0055]
gi|428992434|ref|ZP_19061406.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 93.0056]
gi|428998330|ref|ZP_19066905.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 94.0618]
gi|429004718|ref|ZP_19072762.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 95.0183]
gi|429023077|ref|ZP_19089577.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 96.0428]
gi|429047250|ref|ZP_19111946.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 96.0107]
gi|444933149|ref|ZP_21252147.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 99.0814]
gi|444938616|ref|ZP_21257339.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 99.0815]
gi|444955357|ref|ZP_21273413.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 99.0848]
gi|444988025|ref|ZP_21304792.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli PA11]
gi|444998582|ref|ZP_21315071.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli PA13]
gi|445004127|ref|ZP_21320506.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli PA2]
gi|445009545|ref|ZP_21325764.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli PA47]
gi|445020547|ref|ZP_21336501.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli PA8]
gi|445031363|ref|ZP_21347018.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 99.1781]
gi|445061276|ref|ZP_21373782.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 99.0670]
gi|452967387|ref|ZP_21965614.1| arylsulfatase [Escherichia coli O157:H7 str. EC4009]
gi|12518665|gb|AAG58993.1|AE005611_3 arylsulfatase [Escherichia coli O157:H7 str. EDL933]
gi|13364207|dbj|BAB38154.1| arylsulfatase [Escherichia coli O157:H7 str. Sakai]
gi|187768864|gb|EDU32708.1| arylsulfatase [Escherichia coli O157:H7 str. EC4196]
gi|188015437|gb|EDU53559.1| arylsulfatase [Escherichia coli O157:H7 str. EC4113]
gi|189001398|gb|EDU70384.1| arylsulfatase [Escherichia coli O157:H7 str. EC4076]
gi|189359385|gb|EDU77804.1| arylsulfatase [Escherichia coli O157:H7 str. EC4401]
gi|189362713|gb|EDU81132.1| arylsulfatase [Escherichia coli O157:H7 str. EC4486]
gi|189373327|gb|EDU91743.1| arylsulfatase [Escherichia coli O157:H7 str. EC869]
gi|189376690|gb|EDU95106.1| arylsulfatase [Escherichia coli O157:H7 str. EC508]
gi|208726966|gb|EDZ76567.1| arylsulfatase [Escherichia coli O157:H7 str. EC4206]
gi|208733618|gb|EDZ82305.1| arylsulfatase [Escherichia coli O157:H7 str. EC4045]
gi|208738869|gb|EDZ86551.1| arylsulfatase [Escherichia coli O157:H7 str. EC4042]
gi|209160646|gb|ACI38079.1| arylsulfatase [Escherichia coli O157:H7 str. EC4115]
gi|209753340|gb|ACI74977.1| HemY protein [Escherichia coli]
gi|209753342|gb|ACI74978.1| HemY protein [Escherichia coli]
gi|209753346|gb|ACI74980.1| HemY protein [Escherichia coli]
gi|217320752|gb|EEC29176.1| arylsulfatase [Escherichia coli O157:H7 str. TW14588]
gi|254595196|gb|ACT74557.1| acrylsulfatase-like enzyme [Escherichia coli O157:H7 str. TW14359]
gi|320191108|gb|EFW65758.1| Arylsulfatase [Escherichia coli O157:H7 str. EC1212]
gi|326344255|gb|EGD68015.1| Arylsulfatase [Escherichia coli O157:H7 str. 1125]
gi|326347917|gb|EGD71631.1| Arylsulfatase [Escherichia coli O157:H7 str. 1044]
gi|377889357|gb|EHU53821.1| sulfatase family protein [Escherichia coli DEC3B]
gi|377900988|gb|EHU65312.1| sulfatase family protein [Escherichia coli DEC3A]
gi|377903743|gb|EHU68033.1| sulfatase family protein [Escherichia coli DEC3D]
gi|377926648|gb|EHU90578.1| sulfatase family protein [Escherichia coli DEC4B]
gi|377937826|gb|EHV01599.1| sulfatase family protein [Escherichia coli DEC4C]
gi|386798486|gb|AFJ31520.1| arylsulfatase [Escherichia coli Xuzhou21]
gi|390638282|gb|EIN17795.1| arylsulfatase [Escherichia coli FDA517]
gi|390697452|gb|EIN71872.1| arylsulfatase [Escherichia coli PA15]
gi|390717573|gb|EIN90355.1| arylsulfatase [Escherichia coli PA24]
gi|390718159|gb|EIN90917.1| arylsulfatase [Escherichia coli PA25]
gi|390724290|gb|EIN96850.1| arylsulfatase [Escherichia coli PA28]
gi|390737522|gb|EIO08810.1| arylsulfatase [Escherichia coli PA32]
gi|390758531|gb|EIO27972.1| arylsulfatase [Escherichia coli PA39]
gi|390764824|gb|EIO34019.1| arylsulfatase [Escherichia coli PA42]
gi|390786076|gb|EIO53604.1| arylsulfatase [Escherichia coli TW07945]
gi|390796617|gb|EIO63888.1| arylsulfatase [Escherichia coli TW10246]
gi|390800063|gb|EIO67176.1| arylsulfatase [Escherichia coli TW09098]
gi|390803137|gb|EIO70161.1| arylsulfatase [Escherichia coli TW11039]
gi|390813562|gb|EIO80172.1| arylsulfatase [Escherichia coli TW10119]
gi|390822474|gb|EIO88593.1| arylsulfatase [Escherichia coli EC4203]
gi|390827584|gb|EIO93340.1| arylsulfatase [Escherichia coli EC4196]
gi|390840744|gb|EIP04747.1| arylsulfatase [Escherichia coli TW14313]
gi|390842854|gb|EIP06687.1| arylsulfatase [Escherichia coli TW14301]
gi|390847768|gb|EIP11292.1| arylsulfatase [Escherichia coli EC4421]
gi|390890102|gb|EIP49788.1| arylsulfatase [Escherichia coli EC4437]
gi|390897906|gb|EIP57206.1| arylsulfatase [Escherichia coli EC1738]
gi|390905781|gb|EIP64706.1| arylsulfatase [Escherichia coli EC1734]
gi|408061394|gb|EKG95913.1| arylsulfatase [Escherichia coli PA7]
gi|408063893|gb|EKG98380.1| arylsulfatase [Escherichia coli FRIK920]
gi|408094561|gb|EKH27578.1| arylsulfatase [Escherichia coli FRIK1999]
gi|408125024|gb|EKH55664.1| arylsulfatase [Escherichia coli PA4]
gi|408134768|gb|EKH64584.1| arylsulfatase [Escherichia coli PA23]
gi|408136831|gb|EKH66561.1| arylsulfatase [Escherichia coli PA49]
gi|408143728|gb|EKH73002.1| arylsulfatase [Escherichia coli PA45]
gi|408152108|gb|EKH80557.1| arylsulfatase [Escherichia coli TT12B]
gi|408157151|gb|EKH85317.1| arylsulfatase [Escherichia coli MA6]
gi|408211269|gb|EKI35821.1| arylsulfatase [Escherichia coli PA38]
gi|408271069|gb|EKI91218.1| arylsulfatase [Escherichia coli EC1850]
gi|408274160|gb|EKI94185.1| arylsulfatase [Escherichia coli EC1856]
gi|408282170|gb|EKJ01508.1| arylsulfatase [Escherichia coli EC1862]
gi|408303342|gb|EKJ20804.1| arylsulfatase [Escherichia coli EC1868]
gi|408315829|gb|EKJ32128.1| arylsulfatase [Escherichia coli EC1869]
gi|408321282|gb|EKJ37321.1| arylsulfatase [Escherichia coli EC1870]
gi|408323022|gb|EKJ38991.1| arylsulfatase [Escherichia coli NE098]
gi|408333985|gb|EKJ48893.1| arylsulfatase [Escherichia coli FRIK523]
gi|408341923|gb|EKJ56359.1| arylsulfatase [Escherichia coli 0.1304]
gi|408544793|gb|EKK22239.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 5.2239]
gi|408575638|gb|EKK51291.1| arylsulfatase [Escherichia coli 10.0833]
gi|408578549|gb|EKK54066.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 8.2524]
gi|427201292|gb|EKV71685.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 88.1042]
gi|427201431|gb|EKV71813.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 89.0511]
gi|427217561|gb|EKV86619.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 90.0091]
gi|427221289|gb|EKV90150.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 90.2281]
gi|427224246|gb|EKV92963.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 90.0039]
gi|427237712|gb|EKW05236.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 93.0056]
gi|427238127|gb|EKW05647.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 93.0055]
gi|427242462|gb|EKW09869.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 94.0618]
gi|427255779|gb|EKW22020.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 95.0183]
gi|427273038|gb|EKW37738.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 96.0428]
gi|427295797|gb|EKW58879.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 96.0107]
gi|444534967|gb|ELV15132.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 99.0814]
gi|444545275|gb|ELV24202.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 99.0815]
gi|444559302|gb|ELV36536.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 99.0848]
gi|444589438|gb|ELV64773.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli PA11]
gi|444603250|gb|ELV77960.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli PA13]
gi|444612439|gb|ELV86732.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli PA2]
gi|444619015|gb|ELV93076.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli PA47]
gi|444626740|gb|ELW00530.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli PA8]
gi|444637079|gb|ELW10455.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 99.1781]
gi|444666662|gb|ELW38722.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Escherichia coli 99.0670]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|331675268|ref|ZP_08376019.1| arylsulfatase [Escherichia coli TA280]
gi|432855803|ref|ZP_20083494.1| arylsulfatase [Escherichia coli KTE144]
gi|331067554|gb|EGI38958.1| arylsulfatase [Escherichia coli TA280]
gi|431397088|gb|ELG80549.1| arylsulfatase [Escherichia coli KTE144]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|300930037|ref|ZP_07145468.1| arylsulfatase [Escherichia coli MS 187-1]
gi|300462052|gb|EFK25545.1| arylsulfatase [Escherichia coli MS 187-1]
Length = 551
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|386621472|ref|YP_006141052.1| Arysulfatase [Escherichia coli NA114]
gi|387831691|ref|YP_003351628.1| arylsulfatase [Escherichia coli SE15]
gi|417285665|ref|ZP_12072956.1| arylsulfatase [Escherichia coli TW07793]
gi|425302701|ref|ZP_18692579.1| sulfatase [Escherichia coli 07798]
gi|432424203|ref|ZP_19666739.1| arylsulfatase [Escherichia coli KTE178]
gi|432502356|ref|ZP_19744104.1| arylsulfatase [Escherichia coli KTE216]
gi|432561066|ref|ZP_19797718.1| arylsulfatase [Escherichia coli KTE49]
gi|432696664|ref|ZP_19931854.1| arylsulfatase [Escherichia coli KTE162]
gi|432708193|ref|ZP_19943267.1| arylsulfatase [Escherichia coli KTE6]
gi|432923069|ref|ZP_20125775.1| arylsulfatase [Escherichia coli KTE173]
gi|432929759|ref|ZP_20130711.1| arylsulfatase [Escherichia coli KTE175]
gi|432983306|ref|ZP_20172072.1| arylsulfatase [Escherichia coli KTE211]
gi|433098628|ref|ZP_20284793.1| arylsulfatase [Escherichia coli KTE139]
gi|433108057|ref|ZP_20294015.1| arylsulfatase [Escherichia coli KTE148]
gi|281180848|dbj|BAI57178.1| arylsulfatase [Escherichia coli SE15]
gi|333971973|gb|AEG38778.1| Arysulfatase [Escherichia coli NA114]
gi|386250906|gb|EII97073.1| arylsulfatase [Escherichia coli TW07793]
gi|408210360|gb|EKI34925.1| sulfatase [Escherichia coli 07798]
gi|430941426|gb|ELC61573.1| arylsulfatase [Escherichia coli KTE178]
gi|431025678|gb|ELD38776.1| arylsulfatase [Escherichia coli KTE216]
gi|431088262|gb|ELD94158.1| arylsulfatase [Escherichia coli KTE49]
gi|431230664|gb|ELF26439.1| arylsulfatase [Escherichia coli KTE162]
gi|431254637|gb|ELF47905.1| arylsulfatase [Escherichia coli KTE6]
gi|431434482|gb|ELH16131.1| arylsulfatase [Escherichia coli KTE173]
gi|431439906|gb|ELH21237.1| arylsulfatase [Escherichia coli KTE175]
gi|431487956|gb|ELH67597.1| arylsulfatase [Escherichia coli KTE211]
gi|431612056|gb|ELI81311.1| arylsulfatase [Escherichia coli KTE139]
gi|431623625|gb|ELI92256.1| arylsulfatase [Escherichia coli KTE148]
Length = 551
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|170681296|ref|YP_001746117.1| arylsulfatase [Escherichia coli SMS-3-5]
gi|218701288|ref|YP_002408917.1| arylsulfatase-like enzyme [Escherichia coli IAI39]
gi|218707435|ref|YP_002414954.1| arylsulfatase-like enzyme [Escherichia coli UMN026]
gi|251787058|ref|YP_003001362.1| arylsulfatase [Escherichia coli BL21(DE3)]
gi|253775576|ref|YP_003038407.1| sulfatase [Escherichia coli 'BL21-Gold(DE3)pLysS AG']
gi|254163742|ref|YP_003046850.1| acrylsulfatase-like enzyme [Escherichia coli B str. REL606]
gi|254290492|ref|YP_003056240.1| acrylsulfatase-like protein [Escherichia coli BL21(DE3)]
gi|300900653|ref|ZP_07118810.1| arylsulfatase [Escherichia coli MS 198-1]
gi|300939985|ref|ZP_07154612.1| arylsulfatase [Escherichia coli MS 21-1]
gi|301025769|ref|ZP_07189282.1| arylsulfatase [Escherichia coli MS 69-1]
gi|386626692|ref|YP_006146420.1| acrylsulfatase-like protein [Escherichia coli O7:K1 str. CE10]
gi|417142458|ref|ZP_11985033.1| arylsulfatase [Escherichia coli 97.0259]
gi|417310376|ref|ZP_12097190.1| Arylsulfatase [Escherichia coli PCN033]
gi|419918842|ref|ZP_14437018.1| arylsulfatase [Escherichia coli KD2]
gi|419937453|ref|ZP_14454349.1| arylsulfatase [Escherichia coli 576-1]
gi|422789242|ref|ZP_16841973.1| sulfatase [Escherichia coli H489]
gi|422794128|ref|ZP_16846819.1| sulfatase [Escherichia coli TA007]
gi|422977438|ref|ZP_16977390.1| arylsulfatase [Escherichia coli TA124]
gi|432355836|ref|ZP_19599096.1| arylsulfatase [Escherichia coli KTE2]
gi|432404201|ref|ZP_19646943.1| arylsulfatase [Escherichia coli KTE26]
gi|432428468|ref|ZP_19670947.1| arylsulfatase [Escherichia coli KTE181]
gi|432463169|ref|ZP_19705299.1| arylsulfatase [Escherichia coli KTE204]
gi|432478164|ref|ZP_19720148.1| arylsulfatase [Escherichia coli KTE208]
gi|432520017|ref|ZP_19757195.1| arylsulfatase [Escherichia coli KTE228]
gi|432540185|ref|ZP_19777075.1| arylsulfatase [Escherichia coli KTE235]
gi|432545634|ref|ZP_19782456.1| arylsulfatase [Escherichia coli KTE236]
gi|432551113|ref|ZP_19787861.1| arylsulfatase [Escherichia coli KTE237]
gi|432619113|ref|ZP_19855210.1| arylsulfatase [Escherichia coli KTE75]
gi|432624169|ref|ZP_19860181.1| arylsulfatase [Escherichia coli KTE76]
gi|432633749|ref|ZP_19869665.1| arylsulfatase [Escherichia coli KTE80]
gi|432643401|ref|ZP_19879221.1| arylsulfatase [Escherichia coli KTE83]
gi|432668396|ref|ZP_19903964.1| arylsulfatase [Escherichia coli KTE116]
gi|432682583|ref|ZP_19917933.1| arylsulfatase [Escherichia coli KTE143]
gi|432716430|ref|ZP_19951443.1| arylsulfatase [Escherichia coli KTE9]
gi|432772575|ref|ZP_20006886.1| arylsulfatase [Escherichia coli KTE54]
gi|432795059|ref|ZP_20029130.1| arylsulfatase [Escherichia coli KTE78]
gi|432796570|ref|ZP_20030603.1| arylsulfatase [Escherichia coli KTE79]
gi|432889599|ref|ZP_20102871.1| arylsulfatase [Escherichia coli KTE158]
gi|432915470|ref|ZP_20120725.1| arylsulfatase [Escherichia coli KTE190]
gi|433021056|ref|ZP_20209132.1| arylsulfatase [Escherichia coli KTE105]
gi|433055431|ref|ZP_20242583.1| arylsulfatase [Escherichia coli KTE122]
gi|433070166|ref|ZP_20256927.1| arylsulfatase [Escherichia coli KTE128]
gi|433160958|ref|ZP_20345771.1| arylsulfatase [Escherichia coli KTE177]
gi|433180675|ref|ZP_20365046.1| arylsulfatase [Escherichia coli KTE82]
gi|170519014|gb|ACB17192.1| arylsulfatase [Escherichia coli SMS-3-5]
gi|218371274|emb|CAR19108.1| arylsulfatase-like enzyme [Escherichia coli IAI39]
gi|218434532|emb|CAR15458.1| arylsulfatase-like enzyme [Escherichia coli UMN026]
gi|242379331|emb|CAQ34142.1| arylsulfatase [Escherichia coli BL21(DE3)]
gi|253326620|gb|ACT31222.1| sulfatase [Escherichia coli 'BL21-Gold(DE3)pLysS AG']
gi|253975643|gb|ACT41314.1| acrylsulfatase-like enzyme [Escherichia coli B str. REL606]
gi|253979799|gb|ACT45469.1| acrylsulfatase-like enzyme [Escherichia coli BL21(DE3)]
gi|300355853|gb|EFJ71723.1| arylsulfatase [Escherichia coli MS 198-1]
gi|300395824|gb|EFJ79362.1| arylsulfatase [Escherichia coli MS 69-1]
gi|300455159|gb|EFK18652.1| arylsulfatase [Escherichia coli MS 21-1]
gi|323959055|gb|EGB54724.1| sulfatase [Escherichia coli H489]
gi|323969359|gb|EGB64658.1| sulfatase [Escherichia coli TA007]
gi|338768019|gb|EGP22825.1| Arylsulfatase [Escherichia coli PCN033]
gi|349740428|gb|AEQ15134.1| acrylsulfatase-like enzyme [Escherichia coli O7:K1 str. CE10]
gi|371593286|gb|EHN82169.1| arylsulfatase [Escherichia coli TA124]
gi|386155482|gb|EIH11837.1| arylsulfatase [Escherichia coli 97.0259]
gi|388389333|gb|EIL50867.1| arylsulfatase [Escherichia coli KD2]
gi|388397635|gb|EIL58607.1| arylsulfatase [Escherichia coli 576-1]
gi|430872049|gb|ELB95668.1| arylsulfatase [Escherichia coli KTE2]
gi|430922521|gb|ELC43273.1| arylsulfatase [Escherichia coli KTE26]
gi|430950294|gb|ELC69680.1| arylsulfatase [Escherichia coli KTE181]
gi|430985119|gb|ELD01726.1| arylsulfatase [Escherichia coli KTE204]
gi|431001673|gb|ELD17249.1| arylsulfatase [Escherichia coli KTE208]
gi|431047436|gb|ELD57436.1| arylsulfatase [Escherichia coli KTE228]
gi|431066676|gb|ELD75300.1| arylsulfatase [Escherichia coli KTE235]
gi|431070527|gb|ELD78830.1| arylsulfatase [Escherichia coli KTE236]
gi|431075966|gb|ELD83482.1| arylsulfatase [Escherichia coli KTE237]
gi|431150628|gb|ELE51678.1| arylsulfatase [Escherichia coli KTE75]
gi|431155700|gb|ELE56446.1| arylsulfatase [Escherichia coli KTE76]
gi|431166920|gb|ELE67223.1| arylsulfatase [Escherichia coli KTE80]
gi|431176984|gb|ELE76924.1| arylsulfatase [Escherichia coli KTE83]
gi|431197016|gb|ELE95883.1| arylsulfatase [Escherichia coli KTE116]
gi|431216855|gb|ELF14447.1| arylsulfatase [Escherichia coli KTE143]
gi|431269839|gb|ELF61140.1| arylsulfatase [Escherichia coli KTE9]
gi|431323462|gb|ELG10960.1| arylsulfatase [Escherichia coli KTE54]
gi|431335466|gb|ELG22604.1| arylsulfatase [Escherichia coli KTE78]
gi|431347741|gb|ELG34619.1| arylsulfatase [Escherichia coli KTE79]
gi|431413193|gb|ELG95987.1| arylsulfatase [Escherichia coli KTE158]
gi|431435072|gb|ELH16685.1| arylsulfatase [Escherichia coli KTE190]
gi|431526493|gb|ELI03242.1| arylsulfatase [Escherichia coli KTE105]
gi|431565331|gb|ELI38466.1| arylsulfatase [Escherichia coli KTE122]
gi|431578355|gb|ELI50962.1| arylsulfatase [Escherichia coli KTE128]
gi|431673056|gb|ELJ39287.1| arylsulfatase [Escherichia coli KTE177]
gi|431697635|gb|ELJ62737.1| arylsulfatase [Escherichia coli KTE82]
Length = 551
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|432871692|ref|ZP_20091722.1| arylsulfatase [Escherichia coli KTE147]
gi|431407654|gb|ELG90863.1| arylsulfatase [Escherichia coli KTE147]
Length = 551
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|26250538|ref|NP_756578.1| arylsulfatase [Escherichia coli CFT073]
gi|91213321|ref|YP_543307.1| arylsulfatase [Escherichia coli UTI89]
gi|117626058|ref|YP_859381.1| arylsulfatase-like enzyme [Escherichia coli APEC O1]
gi|218560864|ref|YP_002393777.1| arylsulfatase-like enzyme [Escherichia coli S88]
gi|222158494|ref|YP_002558633.1| Arylsulfatase [Escherichia coli LF82]
gi|227888617|ref|ZP_04006422.1| arylsulfatase [Escherichia coli 83972]
gi|237702808|ref|ZP_04533289.1| arylsulfatase [Escherichia sp. 3_2_53FAA]
gi|300985749|ref|ZP_07177575.1| arylsulfatase [Escherichia coli MS 45-1]
gi|331660144|ref|ZP_08361080.1| arylsulfatase [Escherichia coli TA206]
gi|386601825|ref|YP_006103331.1| arylsulfatase [Escherichia coli IHE3034]
gi|386606378|ref|YP_006112678.1| arylsulfatase [Escherichia coli UM146]
gi|386631737|ref|YP_006151457.1| arylsulfatase [Escherichia coli str. 'clone D i2']
gi|386636657|ref|YP_006156376.1| arylsulfatase [Escherichia coli str. 'clone D i14']
gi|386641433|ref|YP_006108231.1| arylsulfatase [Escherichia coli ABU 83972]
gi|387619093|ref|YP_006122115.1| arylsulfatase-like enzyme [Escherichia coli O83:H1 str. NRG 857C]
gi|417087757|ref|ZP_11954615.1| arylsulfatase [Escherichia coli cloneA_i1]
gi|419702641|ref|ZP_14230230.1| arylsulfatase [Escherichia coli SCI-07]
gi|419943286|ref|ZP_14459846.1| arylsulfatase [Escherichia coli HM605]
gi|422361501|ref|ZP_16442123.1| arylsulfatase [Escherichia coli MS 110-3]
gi|422364128|ref|ZP_16444656.1| arylsulfatase [Escherichia coli MS 153-1]
gi|422381318|ref|ZP_16461486.1| arylsulfatase [Escherichia coli MS 57-2]
gi|422752092|ref|ZP_16805997.1| sulfatase [Escherichia coli H252]
gi|422757517|ref|ZP_16811335.1| sulfatase [Escherichia coli H263]
gi|422842088|ref|ZP_16890054.1| arylsulfatase [Escherichia coli H397]
gi|432360252|ref|ZP_19603463.1| arylsulfatase [Escherichia coli KTE4]
gi|432365052|ref|ZP_19608205.1| arylsulfatase [Escherichia coli KTE5]
gi|432408873|ref|ZP_19651574.1| arylsulfatase [Escherichia coli KTE28]
gi|432414063|ref|ZP_19656715.1| arylsulfatase [Escherichia coli KTE39]
gi|432434023|ref|ZP_19676445.1| arylsulfatase [Escherichia coli KTE187]
gi|432438756|ref|ZP_19681132.1| arylsulfatase [Escherichia coli KTE188]
gi|432458941|ref|ZP_19701114.1| arylsulfatase [Escherichia coli KTE201]
gi|432493051|ref|ZP_19734879.1| arylsulfatase [Escherichia coli KTE214]
gi|432506691|ref|ZP_19748408.1| arylsulfatase [Escherichia coli KTE220]
gi|432526272|ref|ZP_19763383.1| arylsulfatase [Escherichia coli KTE230]
gi|432555880|ref|ZP_19792596.1| arylsulfatase [Escherichia coli KTE47]
gi|432571073|ref|ZP_19807577.1| arylsulfatase [Escherichia coli KTE53]
gi|432576042|ref|ZP_19812509.1| arylsulfatase [Escherichia coli KTE55]
gi|432590252|ref|ZP_19826602.1| arylsulfatase [Escherichia coli KTE58]
gi|432595012|ref|ZP_19831322.1| arylsulfatase [Escherichia coli KTE60]
gi|432600055|ref|ZP_19836323.1| arylsulfatase [Escherichia coli KTE62]
gi|432605236|ref|ZP_19841445.1| arylsulfatase [Escherichia coli KTE67]
gi|432653453|ref|ZP_19889189.1| arylsulfatase [Escherichia coli KTE87]
gi|432756755|ref|ZP_19991298.1| arylsulfatase [Escherichia coli KTE22]
gi|432780960|ref|ZP_20015175.1| arylsulfatase [Escherichia coli KTE59]
gi|432785785|ref|ZP_20019960.1| arylsulfatase [Escherichia coli KTE63]
gi|432789824|ref|ZP_20023950.1| arylsulfatase [Escherichia coli KTE65]
gi|432818588|ref|ZP_20052309.1| arylsulfatase [Escherichia coli KTE118]
gi|432824720|ref|ZP_20058383.1| arylsulfatase [Escherichia coli KTE123]
gi|432847019|ref|ZP_20079530.1| arylsulfatase [Escherichia coli KTE141]
gi|432901390|ref|ZP_20111476.1| arylsulfatase [Escherichia coli KTE192]
gi|432976023|ref|ZP_20164854.1| arylsulfatase [Escherichia coli KTE209]
gi|432997582|ref|ZP_20186161.1| arylsulfatase [Escherichia coli KTE218]
gi|433002177|ref|ZP_20190694.1| arylsulfatase [Escherichia coli KTE223]
gi|433010000|ref|ZP_20198410.1| arylsulfatase [Escherichia coli KTE229]
gi|433030748|ref|ZP_20218592.1| arylsulfatase [Escherichia coli KTE109]
gi|433060323|ref|ZP_20247353.1| arylsulfatase [Escherichia coli KTE124]
gi|433089526|ref|ZP_20275883.1| arylsulfatase [Escherichia coli KTE137]
gi|433117730|ref|ZP_20303508.1| arylsulfatase [Escherichia coli KTE153]
gi|433127432|ref|ZP_20312972.1| arylsulfatase [Escherichia coli KTE160]
gi|433141506|ref|ZP_20326742.1| arylsulfatase [Escherichia coli KTE167]
gi|433151458|ref|ZP_20336453.1| arylsulfatase [Escherichia coli KTE174]
gi|433165816|ref|ZP_20350540.1| arylsulfatase [Escherichia coli KTE179]
gi|433170813|ref|ZP_20355427.1| arylsulfatase [Escherichia coli KTE180]
gi|433209948|ref|ZP_20393610.1| arylsulfatase [Escherichia coli KTE97]
gi|433214827|ref|ZP_20398400.1| arylsulfatase [Escherichia coli KTE99]
gi|442603424|ref|ZP_21018314.1| Arylsulfatase [Escherichia coli Nissle 1917]
gi|26110968|gb|AAN83152.1|AE016769_267 Arylsulfatase [Escherichia coli CFT073]
gi|91074895|gb|ABE09776.1| arylsulfatase [Escherichia coli UTI89]
gi|115515182|gb|ABJ03257.1| arylsulfatase-like enzyme [Escherichia coli APEC O1]
gi|218367633|emb|CAR05416.1| arylsulfatase-like enzyme [Escherichia coli S88]
gi|222035499|emb|CAP78244.1| Arylsulfatase [Escherichia coli LF82]
gi|226902979|gb|EEH89238.1| arylsulfatase [Escherichia sp. 3_2_53FAA]
gi|227834456|gb|EEJ44922.1| arylsulfatase [Escherichia coli 83972]
gi|294491821|gb|ADE90577.1| arylsulfatase [Escherichia coli IHE3034]
gi|300407975|gb|EFJ91513.1| arylsulfatase [Escherichia coli MS 45-1]
gi|307555925|gb|ADN48700.1| arylsulfatase [Escherichia coli ABU 83972]
gi|307628862|gb|ADN73166.1| arylsulfatase [Escherichia coli UM146]
gi|312948354|gb|ADR29181.1| arylsulfatase-like enzyme [Escherichia coli O83:H1 str. NRG 857C]
gi|315284686|gb|EFU44131.1| arylsulfatase [Escherichia coli MS 110-3]
gi|315293143|gb|EFU52495.1| arylsulfatase [Escherichia coli MS 153-1]
gi|323949318|gb|EGB45208.1| sulfatase [Escherichia coli H252]
gi|323954005|gb|EGB49803.1| sulfatase [Escherichia coli H263]
gi|324007464|gb|EGB76683.1| arylsulfatase [Escherichia coli MS 57-2]
gi|331052712|gb|EGI24747.1| arylsulfatase [Escherichia coli TA206]
gi|355349486|gb|EHF98691.1| arylsulfatase [Escherichia coli cloneA_i1]
gi|355422636|gb|AER86833.1| arylsulfatase [Escherichia coli str. 'clone D i2']
gi|355427556|gb|AER91752.1| arylsulfatase [Escherichia coli str. 'clone D i14']
gi|371602152|gb|EHN90863.1| arylsulfatase [Escherichia coli H397]
gi|380346174|gb|EIA34473.1| arylsulfatase [Escherichia coli SCI-07]
gi|388421298|gb|EIL80915.1| arylsulfatase [Escherichia coli HM605]
gi|430873064|gb|ELB96643.1| arylsulfatase [Escherichia coli KTE4]
gi|430883010|gb|ELC06017.1| arylsulfatase [Escherichia coli KTE5]
gi|430925914|gb|ELC46510.1| arylsulfatase [Escherichia coli KTE28]
gi|430932513|gb|ELC52934.1| arylsulfatase [Escherichia coli KTE39]
gi|430950092|gb|ELC69482.1| arylsulfatase [Escherichia coli KTE187]
gi|430959635|gb|ELC77946.1| arylsulfatase [Escherichia coli KTE188]
gi|430978961|gb|ELC95750.1| arylsulfatase [Escherichia coli KTE201]
gi|431030675|gb|ELD43681.1| arylsulfatase [Escherichia coli KTE214]
gi|431034586|gb|ELD46512.1| arylsulfatase [Escherichia coli KTE220]
gi|431047332|gb|ELD57333.1| arylsulfatase [Escherichia coli KTE230]
gi|431080812|gb|ELD87602.1| arylsulfatase [Escherichia coli KTE47]
gi|431096853|gb|ELE02308.1| arylsulfatase [Escherichia coli KTE53]
gi|431104181|gb|ELE08784.1| arylsulfatase [Escherichia coli KTE55]
gi|431117359|gb|ELE20598.1| arylsulfatase [Escherichia coli KTE58]
gi|431125512|gb|ELE27914.1| arylsulfatase [Escherichia coli KTE60]
gi|431127282|gb|ELE29584.1| arylsulfatase [Escherichia coli KTE62]
gi|431144258|gb|ELE45965.1| arylsulfatase [Escherichia coli KTE67]
gi|431186570|gb|ELE86110.1| arylsulfatase [Escherichia coli KTE87]
gi|431299643|gb|ELF89214.1| arylsulfatase [Escherichia coli KTE22]
gi|431323810|gb|ELG11276.1| arylsulfatase [Escherichia coli KTE59]
gi|431325691|gb|ELG13072.1| arylsulfatase [Escherichia coli KTE63]
gi|431334993|gb|ELG22137.1| arylsulfatase [Escherichia coli KTE65]
gi|431373409|gb|ELG59015.1| arylsulfatase [Escherichia coli KTE118]
gi|431377662|gb|ELG62788.1| arylsulfatase [Escherichia coli KTE123]
gi|431392061|gb|ELG75664.1| arylsulfatase [Escherichia coli KTE141]
gi|431422034|gb|ELH04229.1| arylsulfatase [Escherichia coli KTE192]
gi|431485157|gb|ELH64821.1| arylsulfatase [Escherichia coli KTE209]
gi|431501773|gb|ELH80749.1| arylsulfatase [Escherichia coli KTE218]
gi|431504449|gb|ELH83075.1| arylsulfatase [Escherichia coli KTE223]
gi|431520843|gb|ELH98162.1| arylsulfatase [Escherichia coli KTE229]
gi|431540067|gb|ELI15698.1| arylsulfatase [Escherichia coli KTE109]
gi|431565570|gb|ELI38649.1| arylsulfatase [Escherichia coli KTE124]
gi|431600472|gb|ELI70142.1| arylsulfatase [Escherichia coli KTE137]
gi|431630329|gb|ELI98666.1| arylsulfatase [Escherichia coli KTE153]
gi|431639991|gb|ELJ07757.1| arylsulfatase [Escherichia coli KTE160]
gi|431655359|gb|ELJ22392.1| arylsulfatase [Escherichia coli KTE167]
gi|431666969|gb|ELJ33591.1| arylsulfatase [Escherichia coli KTE174]
gi|431683098|gb|ELJ48737.1| arylsulfatase [Escherichia coli KTE179]
gi|431683712|gb|ELJ49340.1| arylsulfatase [Escherichia coli KTE180]
gi|431728000|gb|ELJ91727.1| arylsulfatase [Escherichia coli KTE97]
gi|431731386|gb|ELJ94889.1| arylsulfatase [Escherichia coli KTE99]
gi|441715848|emb|CCQ04291.1| Arylsulfatase [Escherichia coli Nissle 1917]
Length = 551
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|415773843|ref|ZP_11486390.1| arylsulfatase [Escherichia coli 3431]
gi|417615460|ref|ZP_12265908.1| arylsulfatase [Escherichia coli STEC_EH250]
gi|417620470|ref|ZP_12270871.1| arylsulfatase [Escherichia coli G58-1]
gi|418960318|ref|ZP_13512209.1| arylsulfatase [Escherichia coli J53]
gi|422773882|ref|ZP_16827563.1| sulfatase [Escherichia coli E482]
gi|425275090|ref|ZP_18666469.1| sulfatase [Escherichia coli TW15901]
gi|425285668|ref|ZP_18676680.1| sulfatase [Escherichia coli TW00353]
gi|315618503|gb|EFU99089.1| arylsulfatase [Escherichia coli 3431]
gi|323938937|gb|EGB35156.1| sulfatase [Escherichia coli E482]
gi|345357636|gb|EGW89828.1| arylsulfatase [Escherichia coli STEC_EH250]
gi|345369687|gb|EGX01669.1| arylsulfatase [Escherichia coli G58-1]
gi|384376925|gb|EIE34825.1| arylsulfatase [Escherichia coli J53]
gi|408189606|gb|EKI15317.1| sulfatase [Escherichia coli TW15901]
gi|408197795|gb|EKI23046.1| sulfatase [Escherichia coli TW00353]
Length = 531
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 77 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 136
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 137 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 190
>gi|415838432|ref|ZP_11520403.1| arylsulfatase [Escherichia coli RN587/1]
gi|417282000|ref|ZP_12069300.1| arylsulfatase [Escherichia coli 3003]
gi|425280249|ref|ZP_18671461.1| sulfatase [Escherichia coli ARS4.2123]
gi|323189479|gb|EFZ74759.1| arylsulfatase [Escherichia coli RN587/1]
gi|386246329|gb|EII88059.1| arylsulfatase [Escherichia coli 3003]
gi|408197402|gb|EKI22665.1| sulfatase [Escherichia coli ARS4.2123]
Length = 551
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|309784517|ref|ZP_07679155.1| arylsulfatase [Shigella dysenteriae 1617]
gi|308927623|gb|EFP73092.1| arylsulfatase [Shigella dysenteriae 1617]
Length = 531
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 77 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 136
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 137 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 190
>gi|293413242|ref|ZP_06655904.1| conserved hypothetical protein [Escherichia coli B354]
gi|291468190|gb|EFF10687.1| conserved hypothetical protein [Escherichia coli B354]
Length = 551
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|425269797|ref|ZP_18661408.1| arylsulfatase [Escherichia coli 5412]
gi|408180246|gb|EKI06871.1| arylsulfatase [Escherichia coli 5412]
Length = 531
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 77 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 136
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 137 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 190
>gi|331665445|ref|ZP_08366344.1| arylsulfatase [Escherichia coli TA143]
gi|331057343|gb|EGI29332.1| arylsulfatase [Escherichia coli TA143]
Length = 531
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 77 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 136
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 137 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 190
>gi|218550978|ref|YP_002384769.1| arylsulfatase-like protein [Escherichia fergusonii ATCC 35469]
gi|218358519|emb|CAQ91166.1| arylsulfatase-like enzyme [Escherichia fergusonii ATCC 35469]
Length = 551
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|419912589|ref|ZP_14431039.1| arylsulfatase [Escherichia coli KD1]
gi|433200564|ref|ZP_20384444.1| arylsulfatase [Escherichia coli KTE94]
gi|388391448|gb|EIL52915.1| arylsulfatase [Escherichia coli KD1]
gi|431716610|gb|ELJ80717.1| arylsulfatase [Escherichia coli KTE94]
Length = 551
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|343084600|ref|YP_004773895.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342353134|gb|AEL25664.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 472
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 66/205 (32%), Positives = 98/205 (47%), Gaps = 33/205 (16%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGID--- 78
GW D+G +G TPNID L G+ Y+ P C+PSRA+ LTGK P G
Sbjct: 40 GWKDLGCYGSEFYETPNIDKLRDQGMKFTAAYSASPVCSPSRASILTGKNPANIGFTGHI 99
Query: 79 TPVGA----GVAKAVP--------VTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFN 126
T +G + +P + EK++P+ L + GY++ IGKWH+G +E+ P +
Sbjct: 100 TAIGKHRYPEEGRIIPPDDYMHVSLEEKMIPEILLQSGYTSASIGKWHVG-EEEKFFPTH 158
Query: 127 RGFD-NHVGYWNG-----YLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQ 180
+GF N GY +G + + LD R +YLT+ TD+
Sbjct: 159 QGFAINIAGYEHGSPPTYWGPFESEKSWNPVIKNLDNRE---------EGQYLTNRLTDE 209
Query: 181 SVHVIKSHNHSRPLFLQITHAAVHT 205
+++ I N P FL ++H AVHT
Sbjct: 210 AINFI-DENKEGPFFLYLSHYAVHT 233
>gi|422808164|ref|ZP_16856590.1| sulfatase [Escherichia fergusonii B253]
gi|324111024|gb|EGC05011.1| sulfatase [Escherichia fergusonii B253]
Length = 551
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|433002698|ref|ZP_20191206.1| arylsulfatase [Escherichia coli KTE227]
gi|433155988|ref|ZP_20340912.1| arylsulfatase [Escherichia coli KTE176]
gi|431521739|gb|ELH98978.1| arylsulfatase [Escherichia coli KTE227]
gi|431669827|gb|ELJ36193.1| arylsulfatase [Escherichia coli KTE176]
Length = 551
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|416333496|ref|ZP_11670723.1| Arylsulfatase [Escherichia coli WV_060327]
gi|320197610|gb|EFW72222.1| Arylsulfatase [Escherichia coli WV_060327]
Length = 551
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|189404399|ref|ZP_03007442.1| arylsulfatase [Escherichia coli O157:H7 str. EC4501]
gi|419112683|ref|ZP_13657724.1| sulfatase family protein [Escherichia coli DEC4F]
gi|420272332|ref|ZP_14774678.1| arylsulfatase [Escherichia coli PA22]
gi|420277929|ref|ZP_14780207.1| arylsulfatase [Escherichia coli PA40]
gi|420300900|ref|ZP_14802942.1| arylsulfatase [Escherichia coli TW09109]
gi|423728023|ref|ZP_17701804.1| arylsulfatase [Escherichia coli PA31]
gi|424080129|ref|ZP_17817068.1| arylsulfatase [Escherichia coli FDA505]
gi|424092938|ref|ZP_17828845.1| arylsulfatase [Escherichia coli FRIK1996]
gi|424099629|ref|ZP_17834866.1| arylsulfatase [Escherichia coli FRIK1985]
gi|424105821|ref|ZP_17840535.1| arylsulfatase [Escherichia coli FRIK1990]
gi|424112461|ref|ZP_17846671.1| arylsulfatase [Escherichia coli 93-001]
gi|424118395|ref|ZP_17852214.1| arylsulfatase [Escherichia coli PA3]
gi|424124595|ref|ZP_17857876.1| arylsulfatase [Escherichia coli PA5]
gi|424130759|ref|ZP_17863645.1| arylsulfatase [Escherichia coli PA9]
gi|424137072|ref|ZP_17869492.1| arylsulfatase [Escherichia coli PA10]
gi|424143628|ref|ZP_17875464.1| arylsulfatase [Escherichia coli PA14]
gi|424458495|ref|ZP_17909576.1| arylsulfatase [Escherichia coli PA33]
gi|424471258|ref|ZP_17921040.1| arylsulfatase [Escherichia coli PA41]
gi|424496418|ref|ZP_17943938.1| arylsulfatase [Escherichia coli TW09195]
gi|424534867|ref|ZP_17978199.1| arylsulfatase [Escherichia coli EC4422]
gi|424540955|ref|ZP_17983883.1| arylsulfatase [Escherichia coli EC4013]
gi|424547101|ref|ZP_17989416.1| arylsulfatase [Escherichia coli EC4402]
gi|424553297|ref|ZP_17995108.1| arylsulfatase [Escherichia coli EC4439]
gi|424559500|ref|ZP_18000878.1| arylsulfatase [Escherichia coli EC4436]
gi|424571948|ref|ZP_18012466.1| arylsulfatase [Escherichia coli EC4448]
gi|424578107|ref|ZP_18018125.1| arylsulfatase [Escherichia coli EC1845]
gi|424583930|ref|ZP_18023560.1| arylsulfatase [Escherichia coli EC1863]
gi|425158660|ref|ZP_18557907.1| arylsulfatase [Escherichia coli PA34]
gi|425164979|ref|ZP_18563850.1| arylsulfatase [Escherichia coli FDA506]
gi|425170725|ref|ZP_18569183.1| arylsulfatase [Escherichia coli FDA507]
gi|425176770|ref|ZP_18574874.1| arylsulfatase [Escherichia coli FDA504]
gi|425189128|ref|ZP_18586383.1| arylsulfatase [Escherichia coli FRIK1997]
gi|425195857|ref|ZP_18592612.1| arylsulfatase [Escherichia coli NE1487]
gi|425202335|ref|ZP_18598528.1| arylsulfatase [Escherichia coli NE037]
gi|425208713|ref|ZP_18604495.1| arylsulfatase [Escherichia coli FRIK2001]
gi|425257550|ref|ZP_18650031.1| arylsulfatase [Escherichia coli CB7326]
gi|425313969|ref|ZP_18703121.1| arylsulfatase [Escherichia coli EC1735]
gi|425319950|ref|ZP_18708712.1| arylsulfatase [Escherichia coli EC1736]
gi|425326088|ref|ZP_18714400.1| arylsulfatase [Escherichia coli EC1737]
gi|425332401|ref|ZP_18720199.1| arylsulfatase [Escherichia coli EC1846]
gi|425338577|ref|ZP_18725901.1| arylsulfatase [Escherichia coli EC1847]
gi|425344871|ref|ZP_18731744.1| arylsulfatase [Escherichia coli EC1848]
gi|425350712|ref|ZP_18737155.1| arylsulfatase [Escherichia coli EC1849]
gi|425375503|ref|ZP_18760127.1| arylsulfatase [Escherichia coli EC1864]
gi|425388390|ref|ZP_18771933.1| arylsulfatase [Escherichia coli EC1866]
gi|189365812|gb|EDU84228.1| arylsulfatase [Escherichia coli O157:H7 str. EC4501]
gi|377952239|gb|EHV15835.1| sulfatase family protein [Escherichia coli DEC4F]
gi|390637152|gb|EIN16708.1| arylsulfatase [Escherichia coli FRIK1996]
gi|390637579|gb|EIN17122.1| arylsulfatase [Escherichia coli FDA505]
gi|390655840|gb|EIN33752.1| arylsulfatase [Escherichia coli FRIK1985]
gi|390656638|gb|EIN34498.1| arylsulfatase [Escherichia coli 93-001]
gi|390659504|gb|EIN37266.1| arylsulfatase [Escherichia coli FRIK1990]
gi|390674022|gb|EIN50230.1| arylsulfatase [Escherichia coli PA3]
gi|390677315|gb|EIN53370.1| arylsulfatase [Escherichia coli PA5]
gi|390680688|gb|EIN56515.1| arylsulfatase [Escherichia coli PA9]
gi|390691949|gb|EIN66669.1| arylsulfatase [Escherichia coli PA10]
gi|390696242|gb|EIN70731.1| arylsulfatase [Escherichia coli PA14]
gi|390711207|gb|EIN84190.1| arylsulfatase [Escherichia coli PA22]
gi|390736938|gb|EIO08254.1| arylsulfatase [Escherichia coli PA31]
gi|390741168|gb|EIO12258.1| arylsulfatase [Escherichia coli PA33]
gi|390755740|gb|EIO25271.1| arylsulfatase [Escherichia coli PA40]
gi|390761899|gb|EIO31170.1| arylsulfatase [Escherichia coli PA41]
gi|390804528|gb|EIO71494.1| arylsulfatase [Escherichia coli TW09109]
gi|390821974|gb|EIO88123.1| arylsulfatase [Escherichia coli TW09195]
gi|390858190|gb|EIP20598.1| arylsulfatase [Escherichia coli EC4422]
gi|390862478|gb|EIP24661.1| arylsulfatase [Escherichia coli EC4013]
gi|390866610|gb|EIP28560.1| arylsulfatase [Escherichia coli EC4402]
gi|390874882|gb|EIP35966.1| arylsulfatase [Escherichia coli EC4439]
gi|390880252|gb|EIP40944.1| arylsulfatase [Escherichia coli EC4436]
gi|390891496|gb|EIP51124.1| arylsulfatase [Escherichia coli EC4448]
gi|390915602|gb|EIP74111.1| arylsulfatase [Escherichia coli EC1845]
gi|390915802|gb|EIP74302.1| arylsulfatase [Escherichia coli EC1863]
gi|408065071|gb|EKG99547.1| arylsulfatase [Escherichia coli PA34]
gi|408075209|gb|EKH09447.1| arylsulfatase [Escherichia coli FDA506]
gi|408080203|gb|EKH14287.1| arylsulfatase [Escherichia coli FDA507]
gi|408088389|gb|EKH21761.1| arylsulfatase [Escherichia coli FDA504]
gi|408100742|gb|EKH33224.1| arylsulfatase [Escherichia coli FRIK1997]
gi|408105667|gb|EKH37814.1| arylsulfatase [Escherichia coli NE1487]
gi|408112477|gb|EKH44127.1| arylsulfatase [Escherichia coli NE037]
gi|408118660|gb|EKH49779.1| arylsulfatase [Escherichia coli FRIK2001]
gi|408170353|gb|EKH97562.1| arylsulfatase [Escherichia coli CB7326]
gi|408223502|gb|EKI47271.1| arylsulfatase [Escherichia coli EC1735]
gi|408235040|gb|EKI58027.1| arylsulfatase [Escherichia coli EC1736]
gi|408237773|gb|EKI60619.1| arylsulfatase [Escherichia coli EC1737]
gi|408243000|gb|EKI65548.1| arylsulfatase [Escherichia coli EC1846]
gi|408251826|gb|EKI73540.1| arylsulfatase [Escherichia coli EC1847]
gi|408256119|gb|EKI77512.1| arylsulfatase [Escherichia coli EC1848]
gi|408262776|gb|EKI83690.1| arylsulfatase [Escherichia coli EC1849]
gi|408288447|gb|EKJ07270.1| arylsulfatase [Escherichia coli EC1864]
gi|408304492|gb|EKJ21917.1| arylsulfatase [Escherichia coli EC1866]
Length = 531
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 77 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 136
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 137 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 190
>gi|305667515|ref|YP_003863802.1| N-acetylgalactosamine 6-sulfatase [Maribacter sp. HTCC2170]
gi|88709563|gb|EAR01796.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Maribacter sp. HTCC2170]
Length = 596
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 46/118 (38%), Positives = 66/118 (55%), Gaps = 3/118 (2%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPV 81
QGW D+ F+G ++ TPNIDA+A NG Y P C+P+RA LTGKY R G+ +
Sbjct: 47 QGWGDLSFNGNTNLSTPNIDAIAKNGASFQNFYVQPVCSPTRAELLTGKYAARLGVYSTS 106
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGY 139
G + E + + K+ GY T GKWH G + P +RGFD++ G+ +G+
Sbjct: 107 TGG--ERFNSKETTIAEIFKKAGYKTTAYGKWHSGM-QPPYHPNSRGFDDYYGFTSGH 161
>gi|194438593|ref|ZP_03070681.1| arylsulfatase [Escherichia coli 101-1]
gi|293407428|ref|ZP_06651348.1| arylsulfatase [Escherichia coli FVEC1412]
gi|298383168|ref|ZP_06992762.1| arylsulfatase [Escherichia coli FVEC1302]
gi|442596910|ref|ZP_21014711.1| Arylsulfatase [Escherichia coli O5:K4(L):H4 str. ATCC 23502]
gi|194422397|gb|EDX38396.1| arylsulfatase [Escherichia coli 101-1]
gi|291425539|gb|EFE98577.1| arylsulfatase [Escherichia coli FVEC1412]
gi|298276404|gb|EFI17923.1| arylsulfatase [Escherichia coli FVEC1302]
gi|441654658|emb|CCQ00624.1| Arylsulfatase [Escherichia coli O5:K4(L):H4 str. ATCC 23502]
Length = 531
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 77 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 136
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 137 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 190
>gi|291232668|ref|XP_002736267.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 518
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 95/197 (48%), Gaps = 15/197 (7%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
GW D+G G TPN+D +A G ++ Y P C+PSRA+ LTG+ P R G T
Sbjct: 38 GWGDLGVLGNPAKETPNLDRMASEGALMTDFYAPNPLCSPSRASLLTGRLPIRNGFYTTN 97
Query: 82 GAG--------VAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHV 133
+ +P +E +LP+ L + GY + +IGKWH+G ++ + P GFD +
Sbjct: 98 DHARCSYTPQYIVGGIPDSEIVLPELLNKAGYRSKIIGKWHLG-HQTQYHPLKHGFDEYF 156
Query: 134 GYWNGYLTYNDSIHETDFAVGLDAR---RNMERYAPQMSSKY-LTDFFTDQSVHVI-KSH 188
G N ++ D+ + + V DA R E + S + LT F ++++ I K H
Sbjct: 157 GAPNCHVGPYDNKKQPNIPVYRDADMIGRYYEEFKIDKSGESNLTQMFIEEAIAFIEKQH 216
Query: 189 NHSRPLFLQITHAAVHT 205
FL T A H+
Sbjct: 217 QTGEQFFLYWTPDASHS 233
>gi|443700441|gb|ELT99395.1| hypothetical protein CAPTEDRAFT_208054 [Capitella teleta]
Length = 558
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 49/122 (40%), Positives = 72/122 (59%), Gaps = 4/122 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDTPVG 82
G+ D G+ + I TPNID L +GI Y+ C+PSR++FL+G+YP++ G+ V
Sbjct: 82 GFQDAGYR-NSAIHTPNIDKLVGDGISFTNAYSSQQCSPSRSSFLSGRYPYKSGMQHGVI 140
Query: 83 AGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIG-CNKEELLPFNRGFDNHVGYWNGYL 140
+ + + K L YLK+L Y+TH +GKWH+G CNK + P RGFD G ++G
Sbjct: 141 SDEGPNCMDLKFKFLSDYLKDLNYNTHAVGKWHLGYCNK-KCTPTYRGFDTFSGGYSGEG 199
Query: 141 TY 142
Y
Sbjct: 200 DY 201
>gi|32476258|ref|NP_869252.1| arylsulfatase A [Rhodopirellula baltica SH 1]
gi|32446802|emb|CAD76638.1| arylsulfatase A [Rhodopirellula baltica SH 1]
Length = 489
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 102/200 (51%), Gaps = 19/200 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTP 80
QG+ND+G +G +I TPN+D LA G Y+ C+PSRAA LTG YP R G+
Sbjct: 57 QGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQH 116
Query: 81 V-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWN 137
V + E + +LK GY+T +GKWH+G +K E LP + GFD++ G Y N
Sbjct: 117 VLFPQSTYGLHPDEVTIADHLKSAGYATACVGKWHLGHHK-ETLPTSNGFDSYYGIPYSN 175
Query: 138 ----------GYLTYNDSIHETDFAVGL---DARRNMERYAPQMSSKYLTDFFTDQSVHV 184
G ++ +D + AV L ++ E + + +T +TD+++
Sbjct: 176 DMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTVTRRYTDRAIEF 235
Query: 185 IKSHNHSRPLFLQITHAAVH 204
+++ N +P FL + H+ H
Sbjct: 236 VEA-NQDKPFFLYLPHSMPH 254
>gi|325109705|ref|YP_004270773.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324969973|gb|ADY60751.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 443
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 62/194 (31%), Positives = 95/194 (48%), Gaps = 8/194 (4%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D+G +G I TP +D +A +G+ L Y P CTP+RAA +TG Y R G+ TP+
Sbjct: 45 GYGDLGCYGSESIRTPRLDRMAASGMKLTSFYAAAPICTPTRAALMTGCYATRVGLPTPL 104
Query: 82 GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNGYLT 141
+ +E L + +++ GY T +GKWH+G ++ P GF NH YW L
Sbjct: 105 HVYDEIGINESEFTLGEAMQQCGYETVCVGKWHLG-HQPRFYPTEHGF-NH--YWGTPLG 160
Query: 142 YNDSIHETDFAVG--LDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQIT 199
+ + A+G D + R P LT+ T+++V I++ RP FL +
Sbjct: 161 HMFNRPAVGKAIGDTSDLFLDDTREIPFPEDADLTERLTEKAVEFIEA-KRDRPFFLFLA 219
Query: 200 HAAVHTGTAGNAKL 213
H H A + K
Sbjct: 220 HPMPHEPLAASEKF 233
>gi|340373733|ref|XP_003385394.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 389
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/137 (37%), Positives = 74/137 (54%), Gaps = 5/137 (3%)
Query: 5 VGAGVAKAVPVTEKLLPQGWN--DVGFHGENDIPTPNIDALAYN-GIVLNRHYTLPTCTP 61
+ A A P +L W DV F I +P+ ++LA G++L+RHY C+P
Sbjct: 14 IAAATVNAKPNLVFVLVDDWGFADVSFRNPA-ISSPHFESLATKEGLILDRHYVFKYCSP 72
Query: 62 SRAAFLTGKYPFRYGIDTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEE 121
SRA+FLTG++P P +G+ A + ++P LK GY TH++GKWH G ++
Sbjct: 73 SRASFLTGRFPHHAHQWNPPQSGLVGA-NINMTMIPAKLKTAGYKTHMVGKWHEGFYLKK 131
Query: 122 LLPFNRGFDNHVGYWNG 138
LP NRGFD G+ G
Sbjct: 132 FLPINRGFDTMSGFLGG 148
>gi|306815164|ref|ZP_07449317.1| arylsulfatase [Escherichia coli NC101]
gi|432383695|ref|ZP_19626619.1| arylsulfatase [Escherichia coli KTE15]
gi|432389603|ref|ZP_19632481.1| arylsulfatase [Escherichia coli KTE16]
gi|432516187|ref|ZP_19753401.1| arylsulfatase [Escherichia coli KTE224]
gi|432613801|ref|ZP_19849957.1| arylsulfatase [Escherichia coli KTE72]
gi|432648469|ref|ZP_19884253.1| arylsulfatase [Escherichia coli KTE86]
gi|432658034|ref|ZP_19893730.1| arylsulfatase [Escherichia coli KTE93]
gi|432701313|ref|ZP_19936456.1| arylsulfatase [Escherichia coli KTE169]
gi|432747772|ref|ZP_19982433.1| arylsulfatase [Escherichia coli KTE43]
gi|432907621|ref|ZP_20116004.1| arylsulfatase [Escherichia coli KTE194]
gi|432940617|ref|ZP_20138518.1| arylsulfatase [Escherichia coli KTE183]
gi|432974071|ref|ZP_20162913.1| arylsulfatase [Escherichia coli KTE207]
gi|432987644|ref|ZP_20176354.1| arylsulfatase [Escherichia coli KTE215]
gi|433040814|ref|ZP_20228399.1| arylsulfatase [Escherichia coli KTE113]
gi|433084725|ref|ZP_20271169.1| arylsulfatase [Escherichia coli KTE133]
gi|433103396|ref|ZP_20289464.1| arylsulfatase [Escherichia coli KTE145]
gi|433146435|ref|ZP_20331564.1| arylsulfatase [Escherichia coli KTE168]
gi|433190604|ref|ZP_20374689.1| arylsulfatase [Escherichia coli KTE88]
gi|305851533|gb|EFM51987.1| arylsulfatase [Escherichia coli NC101]
gi|430902979|gb|ELC24724.1| arylsulfatase [Escherichia coli KTE16]
gi|430903083|gb|ELC24827.1| arylsulfatase [Escherichia coli KTE15]
gi|431037897|gb|ELD48867.1| arylsulfatase [Escherichia coli KTE224]
gi|431146038|gb|ELE47637.1| arylsulfatase [Escherichia coli KTE72]
gi|431177479|gb|ELE77403.1| arylsulfatase [Escherichia coli KTE86]
gi|431188145|gb|ELE87644.1| arylsulfatase [Escherichia coli KTE93]
gi|431239692|gb|ELF34164.1| arylsulfatase [Escherichia coli KTE169]
gi|431289672|gb|ELF80413.1| arylsulfatase [Escherichia coli KTE43]
gi|431427116|gb|ELH09159.1| arylsulfatase [Escherichia coli KTE194]
gi|431459667|gb|ELH39959.1| arylsulfatase [Escherichia coli KTE183]
gi|431478375|gb|ELH58123.1| arylsulfatase [Escherichia coli KTE207]
gi|431493817|gb|ELH73409.1| arylsulfatase [Escherichia coli KTE215]
gi|431548007|gb|ELI22297.1| arylsulfatase [Escherichia coli KTE113]
gi|431597311|gb|ELI67218.1| arylsulfatase [Escherichia coli KTE133]
gi|431615727|gb|ELI84849.1| arylsulfatase [Escherichia coli KTE145]
gi|431657075|gb|ELJ24043.1| arylsulfatase [Escherichia coli KTE168]
gi|431701561|gb|ELJ66476.1| arylsulfatase [Escherichia coli KTE88]
Length = 494
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 40 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 99
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 100 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 153
>gi|218692077|ref|YP_002400289.1| arylsulfatase-like enzyme [Escherichia coli ED1a]
gi|218429641|emb|CAR10603.2| arylsulfatase-like enzyme [Escherichia coli ED1a]
Length = 551
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYITQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|440714613|ref|ZP_20895192.1| arylsulfatase A [Rhodopirellula baltica SWK14]
gi|436440809|gb|ELP34113.1| arylsulfatase A [Rhodopirellula baltica SWK14]
Length = 470
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 102/200 (51%), Gaps = 19/200 (9%)
Query: 22 QGWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTP 80
QG+ND+G +G +I TPN+D LA G Y+ C+PSRAA LTG YP R G+
Sbjct: 38 QGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQH 97
Query: 81 V-GAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG--YWN 137
V + E + +LK GY+T +GKWH+G +K E LP + GFD++ G Y N
Sbjct: 98 VLFPQSTYGLHPDEVTIADHLKSAGYATACVGKWHLGHHK-ETLPTSNGFDSYYGIPYSN 156
Query: 138 ----------GYLTYNDSIHETDFAVGL---DARRNMERYAPQMSSKYLTDFFTDQSVHV 184
G ++ +D + AV L ++ E + + +T +TD+++
Sbjct: 157 DMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTVTRRYTDRAIEF 216
Query: 185 IKSHNHSRPLFLQITHAAVH 204
+++ N +P FL + H+ H
Sbjct: 217 VEA-NQDKPFFLYLPHSMPH 235
>gi|419156315|ref|ZP_13700868.1| sulfatase family protein, partial [Escherichia coli DEC6C]
gi|377992619|gb|EHV55765.1| sulfatase family protein, partial [Escherichia coli DEC6C]
Length = 370
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 77 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 136
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 137 PPMYGQPGGLQGL-TTLPQLLHDQGYVTQAIGKWHMGENKES-QPQNVGFDDFRGF 190
>gi|110644124|ref|YP_671854.1| arylsulfatase [Escherichia coli 536]
gi|191174275|ref|ZP_03035784.1| arylsulfatase [Escherichia coli F11]
gi|215489127|ref|YP_002331558.1| acrylsulfatase-like protein [Escherichia coli O127:H6 str.
E2348/69]
gi|300979327|ref|ZP_07174511.1| arylsulfatase [Escherichia coli MS 200-1]
gi|312969473|ref|ZP_07783675.1| arylsulfatase [Escherichia coli 2362-75]
gi|417758226|ref|ZP_12406286.1| sulfatase family protein [Escherichia coli DEC2B]
gi|418999243|ref|ZP_13546819.1| sulfatase family protein [Escherichia coli DEC1A]
gi|419004605|ref|ZP_13552112.1| sulfatase family protein [Escherichia coli DEC1B]
gi|419010286|ref|ZP_13557693.1| sulfatase family protein [Escherichia coli DEC1C]
gi|419015988|ref|ZP_13563321.1| sulfatase family protein [Escherichia coli DEC1D]
gi|419020913|ref|ZP_13568209.1| sulfatase family protein [Escherichia coli DEC1E]
gi|419031516|ref|ZP_13578655.1| sulfatase family protein [Escherichia coli DEC2C]
gi|419037120|ref|ZP_13584190.1| sulfatase family protein [Escherichia coli DEC2D]
gi|419042214|ref|ZP_13589228.1| sulfatase family protein [Escherichia coli DEC2E]
gi|422373936|ref|ZP_16454231.1| arylsulfatase [Escherichia coli MS 60-1]
gi|432443330|ref|ZP_19685662.1| arylsulfatase [Escherichia coli KTE189]
gi|432448474|ref|ZP_19690769.1| arylsulfatase [Escherichia coli KTE191]
gi|432473153|ref|ZP_19715188.1| arylsulfatase [Escherichia coli KTE206]
gi|432585327|ref|ZP_19821717.1| arylsulfatase [Escherichia coli KTE57]
gi|432715659|ref|ZP_19950682.1| arylsulfatase [Escherichia coli KTE8]
gi|432734553|ref|ZP_19969374.1| arylsulfatase [Escherichia coli KTE45]
gi|432761638|ref|ZP_19996125.1| arylsulfatase [Escherichia coli KTE46]
gi|432804034|ref|ZP_20037983.1| arylsulfatase [Escherichia coli KTE84]
gi|433016118|ref|ZP_20204444.1| arylsulfatase [Escherichia coli KTE104]
gi|433025709|ref|ZP_20213674.1| arylsulfatase [Escherichia coli KTE106]
gi|433080012|ref|ZP_20266526.1| arylsulfatase [Escherichia coli KTE131]
gi|433122417|ref|ZP_20308070.1| arylsulfatase [Escherichia coli KTE157]
gi|433325279|ref|ZP_20402423.1| arylsulfatase [Escherichia coli J96]
gi|110345716|gb|ABG71953.1| arylsulfatase [Escherichia coli 536]
gi|190905458|gb|EDV65088.1| arylsulfatase [Escherichia coli F11]
gi|215267199|emb|CAS11647.1| acrylsulfatase-like enzyme [Escherichia coli O127:H6 str. E2348/69]
gi|300308080|gb|EFJ62600.1| arylsulfatase [Escherichia coli MS 200-1]
gi|312286020|gb|EFR13938.1| arylsulfatase [Escherichia coli 2362-75]
gi|324014744|gb|EGB83963.1| arylsulfatase [Escherichia coli MS 60-1]
gi|377838924|gb|EHU04028.1| sulfatase family protein [Escherichia coli DEC1C]
gi|377838996|gb|EHU04098.1| sulfatase family protein [Escherichia coli DEC1A]
gi|377841721|gb|EHU06782.1| sulfatase family protein [Escherichia coli DEC1B]
gi|377852838|gb|EHU17750.1| sulfatase family protein [Escherichia coli DEC1D]
gi|377855891|gb|EHU20754.1| sulfatase family protein [Escherichia coli DEC1E]
gi|377870201|gb|EHU34889.1| sulfatase family protein [Escherichia coli DEC2B]
gi|377872176|gb|EHU36825.1| sulfatase family protein [Escherichia coli DEC2C]
gi|377874253|gb|EHU38882.1| sulfatase family protein [Escherichia coli DEC2D]
gi|377885985|gb|EHU50474.1| sulfatase family protein [Escherichia coli DEC2E]
gi|430962751|gb|ELC80603.1| arylsulfatase [Escherichia coli KTE189]
gi|430970859|gb|ELC87904.1| arylsulfatase [Escherichia coli KTE191]
gi|430995319|gb|ELD11616.1| arylsulfatase [Escherichia coli KTE206]
gi|431114313|gb|ELE17857.1| arylsulfatase [Escherichia coli KTE57]
gi|431251061|gb|ELF45079.1| arylsulfatase [Escherichia coli KTE8]
gi|431270540|gb|ELF61703.1| arylsulfatase [Escherichia coli KTE45]
gi|431305314|gb|ELF93643.1| arylsulfatase [Escherichia coli KTE46]
gi|431345125|gb|ELG32052.1| arylsulfatase [Escherichia coli KTE84]
gi|431526204|gb|ELI02963.1| arylsulfatase [Escherichia coli KTE104]
gi|431530145|gb|ELI06830.1| arylsulfatase [Escherichia coli KTE106]
gi|431592977|gb|ELI63542.1| arylsulfatase [Escherichia coli KTE131]
gi|431638384|gb|ELJ06419.1| arylsulfatase [Escherichia coli KTE157]
gi|432346351|gb|ELL40835.1| arylsulfatase [Escherichia coli J96]
Length = 551
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 51/116 (43%), Positives = 67/116 (57%), Gaps = 5/116 (4%)
Query: 23 GWNDVGFHGENDI---PTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGIDT 79
GW DVGF+G PTP+IDA+A G++L Y+ P+ +P+RA LTG+Y +GI
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIHHGILM 156
Query: 80 PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVGY 135
P G + LPQ L + GY T IGKWH+G NKE P N GFD+ G+
Sbjct: 157 PPMYGQPGGLQGL-TTLPQLLHDQGYITQAIGKWHMGENKES-QPQNVGFDDFRGF 210
>gi|421611816|ref|ZP_16052946.1| arylsulfatase [Rhodopirellula baltica SH28]
gi|408497377|gb|EKK01906.1| arylsulfatase [Rhodopirellula baltica SH28]
Length = 1553
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 60/191 (31%), Positives = 96/191 (50%), Gaps = 17/191 (8%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G++D+G +G +I TPNIDALA +G+ L + Y C PSRA+ +TG YP + GI
Sbjct: 7 GYSDLGCYG-GEISTPNIDALAADGVKLTQVYNSARCCPSRASLMTGLYPTQAGIGDFTA 65
Query: 78 ---DTPVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRGFDNHVG 134
+ G G + + + LK GY + +GKWH+ + P RGFD G
Sbjct: 66 REPNRTRGQGYLGRLRDDCVTMAEVLKPEGYGCYYVGKWHM---HPKTGPIKRGFDEFYG 122
Query: 135 YWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKS-HNHSRP 193
Y N ++ ++ D+ + L R ++ P Y TD F D ++ I+ + ++P
Sbjct: 123 YTN---DHSHDQYDADYYIRLPENR-VKEIDPPADQFYATDVFNDYAIEFIRQGQSTNKP 178
Query: 194 LFLQITHAAVH 204
FL + H++ H
Sbjct: 179 WFLFLGHSSPH 189
Score = 60.8 bits (146), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/92 (33%), Positives = 53/92 (57%), Gaps = 6/92 (6%)
Query: 25 NDVGFHGENDIPTPNIDALAYNGIVLNRHY-TLPTCTPSRAAFLTGKYPFRYGIDTPVGA 83
+D+ +G +PTPN++ LA G+V + Y T+ +C+PSR + +TG+YP G
Sbjct: 703 DDLSVYGNAFVPTPNLERLASKGLVFDNAYLTISSCSPSRCSMITGRYPHNTG-----AP 757
Query: 84 GVAKAVPVTEKLLPQYLKELGYSTHLIGKWHI 115
+ +P T++ Q L++ GY T + GK H+
Sbjct: 758 ELHTTLPETQRTFVQSLRDAGYHTVISGKNHM 789
>gi|149179303|ref|ZP_01857864.1| arylsulfatase [Planctomyces maris DSM 8797]
gi|148841844|gb|EDL56246.1| arylsulfatase [Planctomyces maris DSM 8797]
Length = 506
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 95/202 (47%), Gaps = 27/202 (13%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPTCTPSRAAFLTGKYPFRYGI----- 77
G++D+G +G +I TPNIDALA G+ ++ Y C P+RA +TG +P + GI
Sbjct: 38 GFSDIGCYG-GEIETPNIDALAAGGVRFSQFYNSGRCCPTRATLMTGLHPQQTGIGWMTN 96
Query: 78 ---DT------PVGAGVAKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEELLPFNRG 128
DT P G VT L + LK GY+T + GKWH+G N ++ P RG
Sbjct: 97 PPGDTRGYSKPPAYQGYLNRKCVT---LAEVLKPAGYATLMTGKWHLGFNAQDRWPLQRG 153
Query: 129 FDNHVGYWNGYLTYNDSIHETDFAVGLDARRNMERYAPQMSSK-YLTDFFTDQSVHVIKS 187
FD G +G + + G ++E A + Y TD +TD ++ +
Sbjct: 154 FDKFFGCVSGATRFFHPVVPRGMTFG---NEDIETPASTTDRRFYTTDAYTDYAIRFLNE 210
Query: 188 HNHS-----RPLFLQITHAAVH 204
H + +P FL + + A H
Sbjct: 211 HQQAKETQDKPFFLYLAYTAPH 232
>gi|443718583|gb|ELU09136.1| hypothetical protein CAPTEDRAFT_144340 [Capitella teleta]
Length = 557
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 45/118 (38%), Positives = 73/118 (61%), Gaps = 6/118 (5%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTLPT-CTPSRAAFLTGKYPFRYGIDTPV 81
G D+G G + + TP++D++ NG+ L+ + CTPSRAA +T +Y R G+++ +
Sbjct: 34 GIGDIGAFGNDTLRTPHVDSICENGVKLDHDLAAASLCTPSRAALMTSRYAIRSGMESVI 93
Query: 82 GAGVA-KAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL----LPFNRGFDNHVG 134
+ ++ + +P +E LPQ L+E GY+T LIGKWH+G N++ + P RGFD G
Sbjct: 94 LSLMSPQGLPASEYTLPQMLQEQGYATALIGKWHLGWNRQLMDHYYSPLKRGFDFFFG 151
>gi|326913618|ref|XP_003203133.1| PREDICTED: steryl-sulfatase-like, partial [Meleagris gallopavo]
Length = 485
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 52/124 (41%), Positives = 66/124 (53%), Gaps = 12/124 (9%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYTL-PTCTPSRAAFLTGKYPFRYGIDTPV 81
G D+G +G + PNID LA G+ L +H P CTPSRAAFLTG+YP R G+
Sbjct: 64 GIGDLGCYGNRTLRLPNIDRLAKEGVTLTQHLAASPLCTPSRAAFLTGRYPIRSGMAAFS 123
Query: 82 GAGV------AKAVPVTEKLLPQYLKELGYSTHLIGKWHIGCNKEEL-----LPFNRGFD 130
GV + +P E + LK+ GY+T LIGKWH+G N E P + GFD
Sbjct: 124 RVGVFLFSASSGGLPSEEITFSKVLKQRGYATALIGKWHLGMNCESSNDFCHHPLSHGFD 183
Query: 131 NHVG 134
G
Sbjct: 184 YFYG 187
>gi|114798452|ref|YP_760375.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
gi|114738626|gb|ABI76751.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
Length = 508
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 75/238 (31%), Positives = 105/238 (44%), Gaps = 63/238 (26%)
Query: 23 GWNDVGFHG----ENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGI 77
G ND+ G + + TPNID LA G + + Y+ TC PSRA +TG+YP R G
Sbjct: 30 GINDISTFGGGMADGRVQTPNIDRLAAEGALFSTAYSGTGTCAPSRAMLMTGRYPTRTGF 89
Query: 78 D-TPVGAGVAKAVPV-----------------TEKLLPQY---------------LKELG 104
+ TP G+++ VP+ EKL+P + LK+ G
Sbjct: 90 EYTPTPPGMSRIVPMFANDMKTGLPPTEQVKENEKLMPPFAEQGLPTEEVTLAEVLKDRG 149
Query: 105 YSTHLIGKWHIGCNKEELLPFNRGFDNHVGYWNG-YLTYNDS----------------IH 147
Y T IGKWH+G N P ++GFD + +G +L D
Sbjct: 150 YHTVHIGKWHLG-NTSPFRPNDQGFDESLDMASGLFLPPGDPRGVEARLDFDPIDKFLWA 208
Query: 148 ETDFAVGLDARRNMERYAPQMSSKYLTDFFTDQSVHVIKSHNHSRPLFLQITHAAVHT 205
DFA + E YLTD++TD+S+ VI + N +RP FL + H VHT
Sbjct: 209 RMDFAASYNGSDWFE------PGGYLTDYWTDESLKVIDA-NKNRPFFLYLAHWGVHT 259
>gi|429093555|ref|ZP_19156139.1| Arylsulfatase [Cronobacter dublinensis 1210]
gi|426741528|emb|CCJ82252.1| Arylsulfatase [Cronobacter dublinensis 1210]
Length = 502
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 62/197 (31%), Positives = 101/197 (51%), Gaps = 7/197 (3%)
Query: 23 GWNDVGFHGENDIPTPNIDALAYNGIVLNRHYT-LPTCTPSRAAFLTGKYPFRYGIDTPV 81
G+ D G +G + TPNID+LA G+ +Y P C+PSRA LTG+ PFR GI + +
Sbjct: 47 GYGDTGIYGHPIVKTPNIDSLAQQGMRFTEYYAPAPLCSPSRAGLLTGRTPFRTGIRSWI 106
Query: 82 GAGVAK-AVPVTEKLLPQYLKELGYSTHLIGKWHI--GCNK-EELLPFNRGFDNHVGYWN 137
+G A+ EK + YLKE GY T ++GK H+ G ++ ++ + GFD +
Sbjct: 107 PSGGKNVALGRNEKTIASYLKEQGYDTAMMGKLHLNAGADRTDQPQAKDMGFDYSLVNAA 166
Query: 138 GYLTYNDSIHETDFAVGLDARRNMERYA-PQMSSKYLT-DFFTDQSVHVIKSHNHSRPLF 195
G++T + +T G+ R P + K ++ + + +++H + S ++P F
Sbjct: 167 GFVTSDLDKVKTRPRYGVVYPNGFYRNGQPIGTVKQMSGELVSSEAIHWLDSRKDNKPFF 226
Query: 196 LQITHAAVHTGTAGNAK 212
L + VHT A K
Sbjct: 227 LYVAFTEVHTPLASPQK 243
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.138 0.431
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,355,981,890
Number of Sequences: 23463169
Number of extensions: 198194790
Number of successful extensions: 386081
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3408
Number of HSP's successfully gapped in prelim test: 5416
Number of HSP's that attempted gapping in prelim test: 369900
Number of HSP's gapped (non-prelim): 9560
length of query: 242
length of database: 8,064,228,071
effective HSP length: 138
effective length of query: 104
effective length of database: 9,121,278,045
effective search space: 948612916680
effective search space used: 948612916680
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 75 (33.5 bits)