BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy12301
         (632 letters)

Database: pdbaa 
           62,578 sequences; 14,973,337 total letters

Searching..................................................done



>pdb|1FSU|A Chain A, 4-Sulfatase (Human)
          Length = 492

 Score =  267 bits (682), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 155/408 (37%), Positives = 226/408 (55%), Gaps = 33/408 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS  I TP++DALA  G++L+N Y QP+ TPSR+ L+TG+Y I TG+Q   I
Sbjct: 15  GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLXTPSRSQLLTGRYQIRTGLQHQII 73

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 74  WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 133

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPX 268
           YY H    L D  +  V     D R     A      Y+T++FTK A+ LI + P +KP 
Sbjct: 134 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKP- 190

Query: 269 XXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
                             + P+E +  + +I D NR  YA MV  +D++VG V +AL+  
Sbjct: 191 ----LFLYLALQSVHEPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 246

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV+    + SP ++
Sbjct: 247 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 297

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           Q    + +++HISDWLPTL   A G T+     +DG D W ++   +PS R   +  +D 
Sbjct: 298 QKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTISEGSPSPRIELLHNID- 355

Query: 449 WSSLLLNTPSRRNSVLINIDEKKR----------TAAVRLDSWKLVLG 486
             + + ++P  RNS+    D+              AA+R  +WKL+ G
Sbjct: 356 -PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTG 402



 Score = 37.4 bits (85), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 19/64 (29%), Positives = 30/64 (46%), Gaps = 5/64 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KP                   + P+E +  + +I D NR 
Sbjct: 169 YSTNIFTKRAIALITNHPPEKP-----LFLYLALQSVHEPLQVPEEYLKPYDFIQDKNRH 223

Query: 75  TYAA 78
            YA 
Sbjct: 224 HYAG 227



 Score = 34.7 bits (78), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 18/63 (28%), Positives = 34/63 (53%), Gaps = 2/63 (3%)

Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
           +LF++  DP E+++++   P I ++L   L+++ +  VP      D  + DPK     W 
Sbjct: 432 WLFDIDRDPEERHDLSREYPHIVTKLLSRLQFYHKHSVPVYFPAQD-PRCDPKA-TGVWG 489

Query: 628 PWI 630
           PW+
Sbjct: 490 PWM 492


>pdb|4FDI|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A
 pdb|4FDI|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A
 pdb|4FDJ|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
           With Galnac
 pdb|4FDJ|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
           With Galnac
          Length = 502

 Score =  142 bits (359), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 114/368 (30%), Positives = 163/368 (44%), Gaps = 63/368 (17%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           GW DL  +G     TPN+D +A  G++  N Y A P+ +PSRA+L+TG+ PI  G     
Sbjct: 16  GWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLXSPSRAALLTGRLPIRNGFY--- 72

Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
              A  R          G+P +E+ LPE L++ GY +K +GKWHLG  R ++ PL  GF+
Sbjct: 73  TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFD 131

Query: 201 SHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLST--AWDTVGEY------------ 246
             FG  N     YD               +  R N+     W+ VG Y            
Sbjct: 132 EWFGSPNCHFGPYD---------------NKARPNIPVYRDWEMVGRYYEEFPINLKTGE 176

Query: 247 --ATDLFTKEAVQLIEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQFQYITDPNR 304
              T ++ +EA+  I+ Q    P                    AP      F  +    R
Sbjct: 177 ANLTQIYLQEALDFIKRQARHHP---------FFLYWAVDATHAPVYASKPF--LGTSQR 225

Query: 305 RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPY 364
             Y   V+++DDS+G ++  LQ   + +N+ + F SDNGA  +   E       GSN P+
Sbjct: 226 GRYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPF 280

Query: 365 RGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
              K T +EGG++ PA+ W P      +VS Q+  I D L T   A  G T      IDG
Sbjct: 281 LCGKQTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDG 339

Query: 425 LDQWSSLL 432
           L+   +LL
Sbjct: 340 LNLLPTLL 347


>pdb|1HDH|A Chain A, Arylsulfatase From Pseudomonas Aeruginosa
 pdb|1HDH|B Chain B, Arylsulfatase From Pseudomonas Aeruginosa
          Length = 536

 Score = 92.4 bits (228), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 106/413 (25%), Positives = 165/413 (39%), Gaps = 101/413 (24%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ---- 147
           G++D+   G  EI TPN+DALA  G+ L + +     +P+R+ L+TG      G+     
Sbjct: 16  GFSDIGAFG-GEIATPNLDALAIAGLRLTDFHTASTXSPTRSMLLTGTDHHIAGIGTMAE 74

Query: 148 --GPPIWGAEPRGVPLTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
              P + G       L ER   LPE LRE GY T   GKWHLG  + E TP  RGFE  F
Sbjct: 75  ALTPELEGKPGYEGHLNERVVALPELLREAGYQTLMAGKWHLG-LKPEQTPHARGFERSF 133

Query: 204 GYLNGVISYYDHILSDQYSRTVELNGH-----DMRRNLSTAWDTVGEYATDLFTKEAVQL 258
             L G  ++Y        S    L G      +  R L T  +  G Y++D F  + +Q 
Sbjct: 134 SLLPGAANHYGFEPPYDESTPRILKGTPALYVEDERYLDTLPE--GFYSSDAFGDKLLQY 191

Query: 259 IEDQPVDKPXXXXXXXXXXXXXXXXXXXEAPQETINQF--QYITDPN--RRTYAAMVKKL 314
           ++++   +P                   +AP+E + ++  +Y   P   R+   A +K+L
Sbjct: 192 LKERDQSRP-----FFAYLPFSAPHWPLQAPREIVEKYRGRYDAGPEALRQERLARLKEL 246

Query: 315 --------------------------------------------DDSVGTVISALQRKGM 330
                                                       D ++G V+  L+R+G 
Sbjct: 247 GLVEADVEAHPVLALTREWEALEDEERAKSARAMEVYAAMVERMDWNIGRVVDYLRRQGE 306

Query: 331 LENSIIIFMSDNGAP------------------------TVEYRETSNYRNW-------G 359
           L+N+ ++FMSDNGA                         ++E    +N   W        
Sbjct: 307 LDNTFVLFMSDNGAEGALLEAFPKFGPDLLGFLDRHYDNSLENIGRANSYVWYGPRWAQA 366

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           +  P R  K    +GG++VPA++  P++ +   +S     + D  PTL   AG
Sbjct: 367 ATAPSRLYKAFTTQGGIRVPALVRYPRLSRQGAISHAFATVMDVTPTLLDLAG 419


>pdb|1P49|A Chain A, Structure Of Human Placental EstroneDHEA SULFATASE
          Length = 562

 Score = 84.3 bits (207), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 50/124 (40%), Positives = 68/124 (54%), Gaps = 11/124 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILN-NMYAQPVCTPSRASLMTGKYPIHTGMQ--- 147
           G  D   +G+  I TPNID LA  G+ L  ++ A P+ TPSRA+ MTG+YP+ +GM    
Sbjct: 17  GIGDPGCYGNKTIRTPNIDRLASGGVKLTQHLAASPLXTPSRAAFMTGRYPVRSGMASWS 76

Query: 148 --GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT-----PLYRGFE 200
             G  ++ A   G+P  E    + L++ GYST  IGKWHLG      T     PL+ GF 
Sbjct: 77  RTGVFLFTASSGGLPTDEITFAKLLKDQGYSTALIGKWHLGMSCHSKTDFCHHPLHHGFN 136

Query: 201 SHFG 204
             +G
Sbjct: 137 YFYG 140



 Score = 57.0 bits (136), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 42/123 (34%), Positives = 64/123 (52%), Gaps = 6/123 (4%)

Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
            Y   V+++D SVG +++ L    +  +++I F SD GA  VE   +    + GSN  Y+
Sbjct: 286 VYGDAVEEMDWSVGQILNLLDELRLANDTLIYFTSDQGA-HVEEVSSKGEIHGGSNGIYK 344

Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
           G K   WEGG++VP IL  P++ Q  +   +     D  PT+   AG   + LP +  ID
Sbjct: 345 GGKANNWEGGIRVPGILRWPRVIQAGQKIDEPTSNMDIFPTVAKLAG---APLPEDRIID 401

Query: 424 GLD 426
           G D
Sbjct: 402 GRD 404


>pdb|1E2S|P Chain P, Crystal Structure Of An Arylsulfatase A Mutant C69a
          Length = 489

 Score = 70.5 bits (171), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/116 (37%), Positives = 60/116 (51%), Gaps = 4/116 (3%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV--CTPSRASLMTGKYPIHTGMQGP 149
           G+ DL  +G     TPN+D LA  G+   + Y  PV   TPSRA+L+TG+ P+  GM   
Sbjct: 14  GYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYV-PVSLATPSRAALLTGRLPVRMGMYPG 72

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF-RREYTPLYRGFESHFG 204
            +  +   G+PL E  + E L   GY T   GKWHLG      + P ++GF    G
Sbjct: 73  VLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLG 128



 Score = 67.8 bits (164), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/196 (29%), Positives = 84/196 (42%), Gaps = 35/196 (17%)

Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
            R  +   + +LD +VGT+++A+   G+LE +++IF +DNG       ET      G + 
Sbjct: 225 GRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGP------ETMRMSRGGCSG 278

Query: 363 PYRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
             R  K T +EGGV+ PA+  W   I   P V+ ++    D LPTL   AG   + LP  
Sbjct: 279 LLRCGKGTTYEGGVREPALAFWPGHIA--PGVTHELASSLDLLPTLAALAG---APLP-- 331

Query: 422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLI---NIDEKKRTAAVRL 478
                             N  +DG D    LL    S R S+       DE +   AVR 
Sbjct: 332 ------------------NVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRT 373

Query: 479 DSWKLVLGTQENGTMD 494
             +K    TQ +   D
Sbjct: 374 GKYKAHFFTQGSAHSD 389


>pdb|1E33|P Chain P, Crystal Structure Of An Arylsulfatase A Mutant P426l
          Length = 489

 Score = 70.1 bits (170), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 44/116 (37%), Positives = 60/116 (51%), Gaps = 4/116 (3%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVC--TPSRASLMTGKYPIHTGMQGP 149
           G+ DL  +G     TPN+D LA  G+   + Y  PV   TPSRA+L+TG+ P+  GM   
Sbjct: 14  GYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYV-PVSLXTPSRAALLTGRLPVRMGMYPG 72

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF-RREYTPLYRGFESHFG 204
            +  +   G+PL E  + E L   GY T   GKWHLG      + P ++GF    G
Sbjct: 73  VLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLG 128



 Score = 67.8 bits (164), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/196 (29%), Positives = 84/196 (42%), Gaps = 35/196 (17%)

Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
            R  +   + +LD +VGT+++A+   G+LE +++IF +DNG       ET      G + 
Sbjct: 225 GRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGP------ETMRMSRGGCSG 278

Query: 363 PYRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
             R  K T +EGGV+ PA+  W   I   P V+ ++    D LPTL   AG   + LP  
Sbjct: 279 LLRCGKGTTYEGGVREPALAFWPGHIA--PGVTHELASSLDLLPTLAALAG---APLP-- 331

Query: 422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLI---NIDEKKRTAAVRL 478
                             N  +DG D    LL    S R S+       DE +   AVR 
Sbjct: 332 ------------------NVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRT 373

Query: 479 DSWKLVLGTQENGTMD 494
             +K    TQ +   D
Sbjct: 374 GKYKAHFFTQGSAHSD 389


>pdb|1N2K|A Chain A, Crystal Structure Of A Covalent Intermediate Of Endogenous
           Human Arylsulfatase A
 pdb|1N2L|A Chain A, Crystal Structure Of A Covalent Intermediate Of Endogenous
           Human Arylsulfatase A
 pdb|1AUK|A Chain A, Human Arylsulfatase A
          Length = 489

 Score = 70.1 bits (170), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 44/116 (37%), Positives = 60/116 (51%), Gaps = 4/116 (3%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVC--TPSRASLMTGKYPIHTGMQGP 149
           G+ DL  +G     TPN+D LA  G+   + Y  PV   TPSRA+L+TG+ P+  GM   
Sbjct: 14  GYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYV-PVSLXTPSRAALLTGRLPVRMGMYPG 72

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF-RREYTPLYRGFESHFG 204
            +  +   G+PL E  + E L   GY T   GKWHLG      + P ++GF    G
Sbjct: 73  VLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLG 128



 Score = 68.2 bits (165), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/196 (29%), Positives = 84/196 (42%), Gaps = 35/196 (17%)

Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
            R  +   + +LD +VGT+++A+   G+LE +++IF +DNG       ET      G + 
Sbjct: 225 GRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGP------ETMRMSRGGCSG 278

Query: 363 PYRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
             R  K T +EGGV+ PA+  W   I   P V+ ++    D LPTL   AG   + LP  
Sbjct: 279 LLRCGKGTTYEGGVREPALAFWPGHIA--PGVTHELASSLDLLPTLAALAG---APLP-- 331

Query: 422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLI---NIDEKKRTAAVRL 478
                             N  +DG D    LL    S R S+       DE +   AVR 
Sbjct: 332 ------------------NVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRT 373

Query: 479 DSWKLVLGTQENGTMD 494
             +K    TQ +   D
Sbjct: 374 GKYKAHFFTQGSAHSD 389


>pdb|1E1Z|P Chain P, Crystal Structure Of An Arylsulfatase A Mutant C69s
          Length = 489

 Score = 70.1 bits (170), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 44/116 (37%), Positives = 60/116 (51%), Gaps = 4/116 (3%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVC--TPSRASLMTGKYPIHTGMQGP 149
           G+ DL  +G     TPN+D LA  G+   + Y  PV   TPSRA+L+TG+ P+  GM   
Sbjct: 14  GYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYV-PVSLSTPSRAALLTGRLPVRMGMYPG 72

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF-RREYTPLYRGFESHFG 204
            +  +   G+PL E  + E L   GY T   GKWHLG      + P ++GF    G
Sbjct: 73  VLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLG 128



 Score = 67.8 bits (164), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/196 (29%), Positives = 84/196 (42%), Gaps = 35/196 (17%)

Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
            R  +   + +LD +VGT+++A+   G+LE +++IF +DNG       ET      G + 
Sbjct: 225 GRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGP------ETMRMSRGGCSG 278

Query: 363 PYRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
             R  K T +EGGV+ PA+  W   I   P V+ ++    D LPTL   AG   + LP  
Sbjct: 279 LLRCGKGTTYEGGVREPALAFWPGHIA--PGVTHELASSLDLLPTLAALAG---APLP-- 331

Query: 422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLI---NIDEKKRTAAVRL 478
                             N  +DG D    LL    S R S+       DE +   AVR 
Sbjct: 332 ------------------NVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRT 373

Query: 479 DSWKLVLGTQENGTMD 494
             +K    TQ +   D
Sbjct: 374 GKYKAHFFTQGSAHSD 389


>pdb|1E3C|P Chain P, Crystal Structure Of An Arylsulfatase A Mutant C69s Soaked
           In Synthetic Substrate
          Length = 489

 Score = 70.1 bits (170), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 44/116 (37%), Positives = 60/116 (51%), Gaps = 4/116 (3%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVC--TPSRASLMTGKYPIHTGMQGP 149
           G+ DL  +G     TPN+D LA  G+   + Y  PV   TPSRA+L+TG+ P+  GM   
Sbjct: 14  GYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYV-PVSLSTPSRAALLTGRLPVRMGMYPG 72

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF-RREYTPLYRGFESHFG 204
            +  +   G+PL E  + E L   GY T   GKWHLG      + P ++GF    G
Sbjct: 73  VLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQGFHRFLG 128



 Score = 68.2 bits (165), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/196 (29%), Positives = 84/196 (42%), Gaps = 35/196 (17%)

Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
            R  +   + +LD +VGT+++A+   G+LE +++IF +DNG       ET      G + 
Sbjct: 225 GRGPFGDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGP------ETMRMSRGGCSG 278

Query: 363 PYRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
             R  K T +EGGV+ PA+  W   I   P V+ ++    D LPTL   AG   + LP  
Sbjct: 279 LLRCGKGTTYEGGVREPALAFWPGHIA--PGVTHELASSLDLLPTLAALAG---APLP-- 331

Query: 422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLI---NIDEKKRTAAVRL 478
                             N  +DG D    LL    S R S+       DE +   AVR 
Sbjct: 332 ------------------NVTLDGFDLSPLLLGTGKSPRQSLFFYPSYPDEVRGVFAVRT 373

Query: 479 DSWKLVLGTQENGTMD 494
             +K    TQ +   D
Sbjct: 374 GKYKAHFFTQGSAHSD 389


>pdb|3ED4|A Chain A, Crystal Structure Of Putative Arylsulfatase From
           Escherichia Coli
 pdb|3ED4|B Chain B, Crystal Structure Of Putative Arylsulfatase From
           Escherichia Coli
 pdb|3ED4|C Chain C, Crystal Structure Of Putative Arylsulfatase From
           Escherichia Coli
 pdb|3ED4|D Chain D, Crystal Structure Of Putative Arylsulfatase From
           Escherichia Coli
          Length = 502

 Score = 68.6 bits (166), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 39/101 (38%), Positives = 55/101 (54%)

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           Y A +  LD  VG V+  ++  G  +N+I+IF SDNG  T E R+       G     RG
Sbjct: 270 YYANISYLDAQVGKVLDKIKAMGEEDNTIVIFTSDNGPVTREARKVYELNLAGETDGLRG 329

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
            K+ LWEGG++VPAI+   +      VS   ++  DW+PTL
Sbjct: 330 RKDNLWEGGIRVPAIIKYGKHLPQGMVSDTPVYGLDWMPTL 370



 Score = 63.9 bits (154), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 37/97 (38%), Positives = 53/97 (54%), Gaps = 6/97 (6%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL+ +G   + TPNID LA  G+   + YA  P+ +PSRA L+TG+ P  TG++   
Sbjct: 39  GYGDLATYGHQIVKTPNIDRLAQEGVKFTDYYAPAPLSSPSRAGLLTGRMPFRTGIRS-- 96

Query: 151 IWGAEPRGVPL--TERFLPEYLRELGYSTKAIGKWHL 185
            W    + V L   E  +   L+  GY T  +GK HL
Sbjct: 97  -WIPSGKDVALGRNELTIANLLKAQGYDTAMMGKLHL 132



 Score = 35.4 bits (80), Expect = 0.093,   Method: Compositional matrix adjust.
 Identities = 20/57 (35%), Positives = 31/57 (54%), Gaps = 5/57 (8%)

Query: 564 NGPCYLFNLGNDPCEQNNIASSRPDISSQLY-ELLKYH----RRTLVPQSHEQPDLV 615
           N P YL+NL +D  E  N+   +PDI  Q+Y + LKY       +L+    ++P+ V
Sbjct: 435 NKPKYLYNLKSDRYETLNLIGKKPDIEKQMYGKFLKYKTDIDNDSLMKARGDKPEAV 491


>pdb|2QZU|A Chain A, Crystal Structure Of The Putative Sulfatase Yidj From
           Bacteroides Fragilis. Northeast Structural Genomics
           Consortium Target Bfr123
          Length = 491

 Score = 59.7 bits (143), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 36/100 (36%), Positives = 52/100 (52%), Gaps = 3/100 (3%)

Query: 100 GSNEIPTPNIDALAYNGI-ILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRG 158
           G   + TP++D LA  GI   N + + PV +P+R  L TG YPI + + G       P G
Sbjct: 47  GKEPVKTPHLDKLASEGINFTNAISSYPVSSPARGXLXTGXYPIGSKVTGNCNSETAPYG 106

Query: 159 VPLTE--RFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           V L++  R   + L++ GY+   IGKWHL    + Y   Y
Sbjct: 107 VELSQNARCWSDVLKDQGYNXGYIGKWHLDAPYKPYVDTY 146



 Score = 50.4 bits (119), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/132 (26%), Positives = 63/132 (47%), Gaps = 19/132 (14%)

Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
           N R Y A +  +D++VG +I AL++  + +N+I++F SD+G     +             
Sbjct: 275 NIRNYYACITGVDENVGRIIEALKQNNLFDNTIVVFTSDHGICXGAHENAG--------- 325

Query: 363 PYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQM-MHISDWLPTLYTAAGGDTSRLPL 420
                K+  +E   ++P IL W  QI+  PR S  + +  +D  PTL +  G  +  +P 
Sbjct: 326 -----KDIFYEESXRIPXILSWPDQIK--PRKSDPLXIAFADLYPTLLSXXGF-SKEIPE 377

Query: 421 NIDGLDQWSSLL 432
            +   D  + +L
Sbjct: 378 TVQTFDLSNEVL 389


>pdb|2W8S|B Chain B, Crystal Structure Of A Catalytically Promiscuous
           Phosphonate Monoester Hydrolase From Burkholderia
           Caryophylli
 pdb|2W8S|C Chain C, Crystal Structure Of A Catalytically Promiscuous
           Phosphonate Monoester Hydrolase From Burkholderia
           Caryophylli
 pdb|2W8S|D Chain D, Crystal Structure Of A Catalytically Promiscuous
           Phosphonate Monoester Hydrolase From Burkholderia
           Caryophylli
          Length = 514

 Score = 42.4 bits (98), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 40/83 (48%), Gaps = 14/83 (16%)

Query: 104 IPTPNIDALAYNGIILNNMYAQ--PVCTPSRASLMTGKYPI-HTGMQGPPIWGAEPRGVP 160
           + TPN+D L   G+   N      P C P+RASL+TG Y + H  +Q           VP
Sbjct: 31  LKTPNLDRLCREGLTFRNHVTTCVPXCGPARASLLTGLYLMNHRAVQNT---------VP 81

Query: 161 LTERF--LPEYLRELGYSTKAIG 181
           L +R   L + LR +GY    IG
Sbjct: 82  LDQRHLNLGKALRAIGYDPALIG 104



 Score = 39.7 bits (91), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 15/40 (37%), Positives = 26/40 (65%)

Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNG 343
           R TY  ++ ++DD +G V + L   G  ++++IIF SD+G
Sbjct: 287 RATYCGLITEIDDCLGRVFAYLDETGQWDDTLIIFTSDHG 326


>pdb|2W8S|A Chain A, Crystal Structure Of A Catalytically Promiscuous
           Phosphonate Monoester Hydrolase From Burkholderia
           Caryophylli
          Length = 513

 Score = 42.4 bits (98), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 40/83 (48%), Gaps = 14/83 (16%)

Query: 104 IPTPNIDALAYNGIILNNMY--AQPVCTPSRASLMTGKYPI-HTGMQGPPIWGAEPRGVP 160
           + TPN+D L   G+   N      P C P+RASL+TG Y + H  +Q           VP
Sbjct: 30  LKTPNLDRLCREGLTFRNHVTTCVPXCGPARASLLTGLYLMNHRAVQNT---------VP 80

Query: 161 LTERF--LPEYLRELGYSTKAIG 181
           L +R   L + LR +GY    IG
Sbjct: 81  LDQRHLNLGKALRAIGYDPALIG 103



 Score = 39.7 bits (91), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 15/40 (37%), Positives = 26/40 (65%)

Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNG 343
           R TY  ++ ++DD +G V + L   G  ++++IIF SD+G
Sbjct: 286 RATYCGLITEIDDCLGRVFAYLDETGQWDDTLIIFTSDHG 325


>pdb|2VQR|A Chain A, Crystal Structure Of A Phosphonate Monoester Hydrolase
           From Rhizobium Leguminosarum: A New Member Of The
           Alkaline Phosphatase Superfamily
          Length = 514

 Score = 41.6 bits (96), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 40/83 (48%), Gaps = 14/83 (16%)

Query: 104 IPTPNIDALAYNGIILNNMYAQ--PVCTPSRASLMTGKYPI-HTGMQGPPIWGAEPRGVP 160
           + TPN+D L   G+   N      P C P+RASL+TG Y + H  +Q           VP
Sbjct: 31  LKTPNLDRLCREGVTFRNHVTTCVPXCGPARASLLTGLYLMNHRAVQ---------NTVP 81

Query: 161 LTERF--LPEYLRELGYSTKAIG 181
           L +R   L + LR +GY    IG
Sbjct: 82  LDQRHLNLGKALRGVGYDPALIG 104



 Score = 40.4 bits (93), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 16/40 (40%), Positives = 26/40 (65%)

Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNG 343
           R TY  ++ ++DD +G V S L   G  ++++IIF SD+G
Sbjct: 287 RATYCGLITEVDDCLGRVFSYLDETGQWDDTLIIFTSDHG 326


>pdb|3B5Q|A Chain A, Crystal Structure Of A Putative Sulfatase (Np_810509.1)
           From Bacteroides Thetaiotaomicron Vpi-5482 At 2.40 A
           Resolution
 pdb|3B5Q|B Chain B, Crystal Structure Of A Putative Sulfatase (Np_810509.1)
           From Bacteroides Thetaiotaomicron Vpi-5482 At 2.40 A
           Resolution
          Length = 482

 Score = 37.0 bits (84), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 44/100 (44%), Gaps = 15/100 (15%)

Query: 109 IDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQ---GPPIWGAEPRGVPLTER 164
           ID +A  G+I +N Y   P+  PSRA+L +G  P  T ++     P+    P  VP    
Sbjct: 45  IDEVASRGVIFSNAYVGCPLSQPSRAALWSGXXPHQTNVRSNSSEPVNTRLPENVPT--- 101

Query: 165 FLPEYLRELGYSTKAIGKWH-----LGFFRREYTPLYRGF 199
            L     E GY     GK H      GF  +E  P+ + F
Sbjct: 102 -LGSLFSESGYEAVHFGKTHDXGSLRGFKHKE--PVAKPF 138



 Score = 32.7 bits (73), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 29/116 (25%), Positives = 45/116 (38%), Gaps = 19/116 (16%)

Query: 303 NRRTYAAM----VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
           N R Y A      K +   V +V+ AL       N+I++  +D+G     +R  + + ++
Sbjct: 242 NYRHYIAAFQHYTKXVSKQVDSVLKALYSTPAGRNTIVVIXADHGDGXASHRXVTKHISF 301

Query: 359 GSNYPYRGVKNTLWEGGVKVPAILWSPQI--QQNPRVSLQMMHISDWLPTLYTAAG 412
                        ++    VP I   P I  Q+ P   L      D LPTL   AG
Sbjct: 302 -------------YDEXTNVPFIFAGPGIKQQKKPVDHLLTQPTLDLLPTLCDLAG 344


>pdb|2IUC|B Chain B, Structure Of Alkaline Phosphatase From The Antarctic
           Bacterium Tab5
          Length = 375

 Score = 33.1 bits (74), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 24/103 (23%), Positives = 44/103 (42%), Gaps = 13/103 (12%)

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP-YRGVKN 369
           +   DD++GT ++  +  G   N+++I  SD+          SN R  GS Y  Y  +  
Sbjct: 274 INDFDDAIGTALAFAKSDG---NTLVIVTSDHETGGFTLAAKSNKREDGSEYSDYTEIGP 330

Query: 370 TLWEGG---VKVPAILWSP------QIQQNPRVSLQMMHISDW 403
           T   GG     +P   + P       I +N  +  +++ ++ W
Sbjct: 331 TFSTGGHSATLIPVFAYGPGSEEFIGIYENNEIFHKILKVTKW 373


>pdb|2ZIV|B Chain B, Crystal Structure Of The Mus81-Eme1 Complex
          Length = 351

 Score = 32.7 bits (73), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 20/32 (62%), Gaps = 6/32 (18%)

Query: 372 WEGGVKVP------AILWSPQIQQNPRVSLQM 397
           W GGVKV       A++W  QIQQ  RVSL+M
Sbjct: 246 WAGGVKVDLAGRGLALVWRRQIQQLNRVSLEM 277


>pdb|2ZIU|B Chain B, Crystal Structure Of The Mus81-Eme1 Complex
 pdb|2ZIX|B Chain B, Crystal Structure Of The Mus81-Eme1 Complex
          Length = 341

 Score = 32.3 bits (72), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 20/32 (62%), Gaps = 6/32 (18%)

Query: 372 WEGGVKVP------AILWSPQIQQNPRVSLQM 397
           W GGVKV       A++W  QIQQ  RVSL+M
Sbjct: 236 WAGGVKVDLAGRGLALVWRRQIQQLNRVSLEM 267


>pdb|2GSN|A Chain A, Structure Of Xac Nucleotide
           PyrophosphatasePHOSPHODIESTERASE
 pdb|2GSN|B Chain B, Structure Of Xac Nucleotide
           PyrophosphatasePHOSPHODIESTERASE
 pdb|2GSO|A Chain A, Structure Of Xac Nucleotide
           PyrophosphatasePHOSPHODIESTERASE IN COMPLEX WITH
           VANADATE
 pdb|2GSO|B Chain B, Structure Of Xac Nucleotide
           PyrophosphatasePHOSPHODIESTERASE IN COMPLEX WITH
           VANADATE
 pdb|2GSU|A Chain A, Structure Of Xac Nucleotide
           Pyrophosphatase/phosphodiesterase In Complex With Amp
 pdb|2GSU|B Chain B, Structure Of Xac Nucleotide
           Pyrophosphatase/phosphodiesterase In Complex With Amp
 pdb|2RH6|A Chain A, Structure Of Xac Npp For Evaluation Of Refinement
           Methodology
 pdb|2RH6|B Chain B, Structure Of Xac Npp For Evaluation Of Refinement
           Methodology
          Length = 393

 Score = 30.8 bits (68), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 17/57 (29%), Positives = 32/57 (56%)

Query: 291 ETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
           E +++  +   P  R YA  V+ +D ++G +++ +QR G    + II +SD+G   V
Sbjct: 168 EHVDEAGHDHGPESRQYADAVRAVDAAIGRLLAGMQRDGTRARTNIIVVSDHGMAEV 224


>pdb|2ZIW|B Chain B, Crystal Structure Of The Mus81-Eme1 Complex
          Length = 341

 Score = 30.0 bits (66), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 16/31 (51%), Positives = 19/31 (61%), Gaps = 6/31 (19%)

Query: 372 WEGGVKVP------AILWSPQIQQNPRVSLQ 396
           W GGVKV       A++W  QIQQ  RVSL+
Sbjct: 236 WAGGVKVDLAGRGLALVWRRQIQQLNRVSLE 266


>pdb|1UZ8|B Chain B, Anti-Lewis X Fab Fragment In Complex With Lewis X
 pdb|1UZ8|H Chain H, Anti-Lewis X Fab Fragment In Complex With Lewis X
          Length = 212

 Score = 29.6 bits (65), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 41/95 (43%), Gaps = 8/95 (8%)

Query: 413 GDTSRLPLNIDGLD---QWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDE 469
           G + +L     G D    W S +   P +    I  ++  SS +  TPS ++  +I+ D 
Sbjct: 15  GGSQKLSCAASGFDFSGYWMSWVRQAPGKGLEWIGEINPDSSTINYTPSLKDKFIISRDN 74

Query: 470 KKRT-----AAVRLDSWKLVLGTQENGTMDGYYGQ 499
            K T     + VR +   L    +E GT   Y+GQ
Sbjct: 75  AKNTLYLQMSKVRSEDTALYYCARETGTRFDYWGQ 109


>pdb|1UZ6|F Chain F, Anti-Lewis X Fab Fragment Uncomplexed
 pdb|1UZ6|H Chain H, Anti-Lewis X Fab Fragment Uncomplexed
 pdb|1UZ6|P Chain P, Anti-Lewis X Fab Fragment Uncomplexed
 pdb|1UZ6|W Chain W, Anti-Lewis X Fab Fragment Uncomplexed
          Length = 217

 Score = 29.6 bits (65), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 26/95 (27%), Positives = 41/95 (43%), Gaps = 8/95 (8%)

Query: 413 GDTSRLPLNIDGLD---QWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDE 469
           G + +L     G D    W S +   P +    I  ++  SS +  TPS ++  +I+ D 
Sbjct: 15  GGSQKLSCAASGFDFSGYWMSWVRQAPGKGLEWIGEINPDSSTINYTPSLKDKFIISRDN 74

Query: 470 KKRT-----AAVRLDSWKLVLGTQENGTMDGYYGQ 499
            K T     + VR +   L    +E GT   Y+GQ
Sbjct: 75  AKNTLYLQMSKVRSEDTALYYCARETGTRFDYWGQ 109


  Database: pdbaa
    Posted date:  Mar 3, 2013 10:34 PM
  Number of letters in database: 14,973,337
  Number of sequences in database:  62,578
  
Lambda     K      H
   0.317    0.134    0.412 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 18,674,138
Number of Sequences: 62578
Number of extensions: 770326
Number of successful extensions: 1569
Number of sequences better than 100.0: 24
Number of HSP's better than 100.0 without gapping: 16
Number of HSP's successfully gapped in prelim test: 8
Number of HSP's that attempted gapping in prelim test: 1502
Number of HSP's gapped (non-prelim): 43
length of query: 632
length of database: 14,973,337
effective HSP length: 105
effective length of query: 527
effective length of database: 8,402,647
effective search space: 4428194969
effective search space used: 4428194969
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 54 (25.4 bits)