BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy8678
         (394 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|449474288|ref|XP_004174968.1| PREDICTED: LOW QUALITY PROTEIN: sulfatase-modifying factor 1
           [Taeniopygia guttata]
          Length = 284

 Score =  397 bits (1021), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 196/331 (59%), Positives = 229/331 (69%), Gaps = 50/331 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+PG  F MGT +P + +DGE P+R V L +F++DQ+EVSN  FQ FV+ATGYVTEAE
Sbjct: 1   MVLIPGGVFTMGTQEPEIQQDGEGPARRVHLSSFHMDQYEVSNQDFQSFVNATGYVTEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSE  +A I Q                              AVA   W 
Sbjct: 61  KFGDSFVFEGMLSEAVKADIHQ------------------------------AVAAAPWW 90

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   SW HPEG DS+I  RM+HPV+HVSWNDAVA+CTW
Sbjct: 91  ---LPVKG-----------------ASWRHPEGPDSSISSRMDHPVLHVSWNDAVAFCTW 130

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGLENRLFPWGN L P+G+H AN+WQGEFPTNNTA DGY  TAP
Sbjct: 131 AGKRLPTEAEWEYSCRGGLENRLFPWGNKLQPKGQHYANIWQGEFPTNNTAEDGYKGTAP 190

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYNMVGN WEWT+DWW VHH P   +NPKGP++GTD+VKKGGSY+C++ Y
Sbjct: 191 VSAFPPNAYGLYNMVGNAWEWTSDWWAVHHSPQELHNPKGPSSGTDRVKKGGSYMCHKSY 250

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           CYR+RCAARSQNTPDSSA NLGFRCAAD  P
Sbjct: 251 CYRYRCAARSQNTPDSSASNLGFRCAADSAP 281


>gi|355722650|gb|AES07642.1| sulfatase modifying factor 1 [Mustela putorius furo]
          Length = 374

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 187/334 (55%), Positives = 229/334 (68%), Gaps = 50/334 (14%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+TEAE
Sbjct: 91  MVLIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFVNSTGYLTEAE 150

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSE+ +  I Q                              AVA   W 
Sbjct: 151 KFGDSFVFEGMLSEQVKTDIQQ------------------------------AVAAAPWW 180

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DST+ HR++HPV+HVSWNDAVAYCTW
Sbjct: 181 ---LPVKG-----------------ANWRHPEGPDSTVLHRLDHPVLHVSWNDAVAYCTW 220

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP  NT  DG+  TAP
Sbjct: 221 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAP 280

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D+VKKGGSY+C++ Y
Sbjct: 281 VDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEKTINPKGPPSGKDRVKKGGSYMCHKSY 340

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           CYR+RCAARSQNTPDSSA NLGFRCAAD+ PTTG
Sbjct: 341 CYRYRCAARSQNTPDSSASNLGFRCAADRQPTTG 374


>gi|73985046|ref|XP_541796.2| PREDICTED: sulfatase-modifying factor 1 [Canis lupus familiaris]
          Length = 374

 Score =  388 bits (996), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 188/334 (56%), Positives = 229/334 (68%), Gaps = 50/334 (14%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+P   F MGT+ P + +DGE P+R V++DAFY+D +EVSN  F++FV++TGY+TEAE
Sbjct: 91  MVLIPAGVFTMGTDDPQIKQDGEAPARRVSIDAFYMDAYEVSNADFEKFVNSTGYLTEAE 150

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSE+ +  I Q                              AVA   W 
Sbjct: 151 KFGDSFVFEGMLSEQVKTDIQQ------------------------------AVAAAPWW 180

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DST  HR +HPV+HVSWNDAVAYCTW
Sbjct: 181 ---LPVKG-----------------ANWRHPEGPDSTTLHRPDHPVLHVSWNDAVAYCTW 220

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP  NT  DG+  TAP
Sbjct: 221 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAP 280

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGNVWEWT+DWW VHH    ++NPKGP +G D+VKKGGSY+C++ Y
Sbjct: 281 VDAFPPNGYGLYNIVGNVWEWTSDWWTVHHSVEKTHNPKGPPSGKDRVKKGGSYMCHKSY 340

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           CYR+RCAARSQNTPDSSA NLGFRCAAD+ PTTG
Sbjct: 341 CYRYRCAARSQNTPDSSASNLGFRCAADRQPTTG 374


>gi|449270327|gb|EMC81015.1| Sulfatase-modifying factor 1 [Columba livia]
          Length = 284

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 190/331 (57%), Positives = 231/331 (69%), Gaps = 50/331 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV++PG  F MGT++P + +DGE+P R V +++FY+D++EVSN  F+ FV++TGYVTEAE
Sbjct: 1   MVVIPGGVFTMGTDEPAIQQDGEWPVRKVHVNSFYMDRYEVSNEDFERFVNSTGYVTEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSEE +A+I Q                              AVA   W 
Sbjct: 61  KFGDSFVFEGMLSEEVKAEIHQ------------------------------AVAAAPWW 90

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DS I +RM+HPV+HVSWNDAVA+CTW
Sbjct: 91  ---LPVKG-----------------ANWKHPEGPDSNISNRMDHPVLHVSWNDAVAFCTW 130

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGLENRLFPWGN L P+G+H AN+WQG FPTNNTA DGY  TAP
Sbjct: 131 AGKRLPTEAEWEYSCRGGLENRLFPWGNKLQPKGQHYANIWQGVFPTNNTAEDGYKGTAP 190

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGN WEWTADWW VHH     +NPKGP++GTD+VKKGGSY+C++ Y
Sbjct: 191 VTAFPPNGYGLYNIVGNAWEWTADWWAVHHSTEEVHNPKGPSSGTDRVKKGGSYMCHKSY 250

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           CYR+RCAARSQNTPDSSA NLGFRCAAD  P
Sbjct: 251 CYRYRCAARSQNTPDSSASNLGFRCAADASP 281


>gi|115497162|ref|NP_001069544.1| sulfatase-modifying factor 1 precursor [Bos taurus]
 gi|122144266|sp|Q0P5L5.1|SUMF1_BOVIN RecName: Full=Sulfatase-modifying factor 1; AltName:
           Full=C-alpha-formylglycine-generating enzyme 1; Flags:
           Precursor
 gi|112361989|gb|AAI19886.1| Sulfatase modifying factor 1 [Bos taurus]
 gi|296474978|tpg|DAA17093.1| TPA: sulfatase modifying factor 1 precursor [Bos taurus]
          Length = 374

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 189/344 (54%), Positives = 232/344 (67%), Gaps = 54/344 (15%)

Query: 5   PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           P+PP +    MV +P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV
Sbjct: 85  PSPPTK----MVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFV 140

Query: 65  SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
           ++TGY+TEAEKFGD+FVFE +LSE+ ++ I Q                            
Sbjct: 141 NSTGYLTEAEKFGDSFVFEGMLSEQVKSDIQQ---------------------------- 172

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
             AVA   W    LP +                   +W HPEG DST+ HR +HPV+HVS
Sbjct: 173 --AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTVLHRPDHPVLHVS 210

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
           WNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP  NT
Sbjct: 211 WNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNT 270

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
             DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D+VKK
Sbjct: 271 GEDGFRGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSAEETINPKGPPSGKDRVKK 330

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           GGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCAAD  PTTG
Sbjct: 331 GGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCAADHLPTTG 374


>gi|344276480|ref|XP_003410036.1| PREDICTED: sulfatase-modifying factor 1 [Loxodonta africana]
          Length = 374

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 187/334 (55%), Positives = 228/334 (68%), Gaps = 50/334 (14%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+TEAE
Sbjct: 91  MVRIPAGVFTMGTDDPQIKQDGEAPARRVMIDAFYMDAYEVSNAEFEKFVNSTGYLTEAE 150

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSEE +  I Q                              AVA   W 
Sbjct: 151 KFGDSFVFEGMLSEEVKTDIQQ------------------------------AVAAAPWW 180

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DST+ HR +HPV+HVSWNDAVAYCTW
Sbjct: 181 ---LPVKG-----------------ANWRHPEGPDSTVLHRPDHPVLHVSWNDAVAYCTW 220

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGLENRLFPWGN L P+G+H AN+WQGEFP +NT  DG+  TAP
Sbjct: 221 AGKRLPTEAEWEYSCRGGLENRLFPWGNKLQPKGQHYANIWQGEFPVSNTGEDGFRGTAP 280

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGNVWEWT+DWW +HH    + NPKGP +G D+VKKGGSY+C++ Y
Sbjct: 281 VDAFPPNGYGLYNIVGNVWEWTSDWWTIHHSAEETLNPKGPPSGKDRVKKGGSYMCHKSY 340

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           CYR+RCAARSQNTPDSSA NLGFRCAAD+ PT G
Sbjct: 341 CYRYRCAARSQNTPDSSASNLGFRCAADRLPTAG 374


>gi|426339251|ref|XP_004033571.1| PREDICTED: LOW QUALITY PROTEIN: sulfatase-modifying factor 1
           [Gorilla gorilla gorilla]
          Length = 383

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 192/346 (55%), Positives = 232/346 (67%), Gaps = 54/346 (15%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 86  PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 145

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 146 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 181

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DSTI HR++HPV
Sbjct: 182 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRLDHPV 215

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 216 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 275

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D
Sbjct: 276 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWAVHHSAEETLNPKGPPSGKD 335

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           +VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 336 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 381


>gi|363738801|ref|XP_003642070.1| PREDICTED: sulfatase-modifying factor 1-like [Gallus gallus]
          Length = 353

 Score =  384 bits (985), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 187/331 (56%), Positives = 231/331 (69%), Gaps = 50/331 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG  F MGT++P + +DGE+P+R V +++FY+DQ+EVSN +F+ FV++TGY+TEAE
Sbjct: 70  MVAIPGGVFTMGTDEPEIQQDGEWPARRVHVNSFYMDQYEVSNQEFERFVNSTGYLTEAE 129

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSEE +A+I Q                              AVA   W 
Sbjct: 130 KFGDSFVFEGMLSEEVKAEIHQ------------------------------AVAAAPWW 159

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W  PEG  S+I  RM+HPV+HVSWNDAVA+CTW
Sbjct: 160 ---LPVKG-----------------ANWRQPEGPGSSILSRMDHPVLHVSWNDAVAFCTW 199

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEYGCRGGLE RLFPWGN L P+G+H AN+WQG FPTNNTA DGY  TAP
Sbjct: 200 AGKRLPTEAEWEYGCRGGLEKRLFPWGNKLQPKGQHYANIWQGVFPTNNTAEDGYKGTAP 259

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGN WEWT+DWW VHH    ++NPKGP++GTD+VKKGGSY+C++ Y
Sbjct: 260 VTAFPPNGYGLYNIVGNAWEWTSDWWAVHHSADEAHNPKGPSSGTDRVKKGGSYMCHKSY 319

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           CYR+RCAARSQNTPDSSA NLGFRCAAD  P
Sbjct: 320 CYRYRCAARSQNTPDSSASNLGFRCAADALP 350


>gi|297670799|ref|XP_002813542.1| PREDICTED: sulfatase-modifying factor 1 isoform 1 [Pongo abelii]
          Length = 374

 Score =  383 bits (984), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 191/346 (55%), Positives = 231/346 (66%), Gaps = 54/346 (15%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F
Sbjct: 77  PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNAEF 136

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DSTI HR++HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTIRHRLDHPV 206

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 326

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           +VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 327 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 372


>gi|157821111|ref|NP_001102109.1| sulfatase-modifying factor 1 precursor [Rattus norvegicus]
 gi|149036859|gb|EDL91477.1| sulfatase modifying factor 1 (predicted) [Rattus norvegicus]
          Length = 372

 Score =  383 bits (984), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 190/345 (55%), Positives = 226/345 (65%), Gaps = 50/345 (14%)

Query: 3   LLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
           L   P       MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSN  F++
Sbjct: 77  LTSGPRSLALTKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTVDAFYMDAYEVSNADFEK 136

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           FV++TGY+TEAEKFGD+FVFE +LSE  +A+I Q                          
Sbjct: 137 FVNSTGYLTEAEKFGDSFVFEGMLSEPVKAQIHQ-------------------------- 170

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
               AVA   W    LP +                    W HPEG DSTI HR NHPV+H
Sbjct: 171 ----AVAAAPWW---LPVKG-----------------ADWRHPEGPDSTILHRSNHPVLH 206

Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
           VSWNDAVAYC W G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP  
Sbjct: 207 VSWNDAVAYCAWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVT 266

Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKV 302
           NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGPT+G D+V
Sbjct: 267 NTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSAEETLNPKGPTSGKDRV 326

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
           KKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCAAD  PT 
Sbjct: 327 KKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCAADHLPTA 371


>gi|354473114|ref|XP_003498781.1| PREDICTED: sulfatase-modifying factor 1-like [Cricetulus griseus]
          Length = 353

 Score =  383 bits (983), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 187/333 (56%), Positives = 226/333 (67%), Gaps = 50/333 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSN  F++FV++TGY+TEAE
Sbjct: 70  MVPIPAGVFTMGTDDPQIKQDGEAPARRVTVDAFYMDAYEVSNADFEKFVNSTGYLTEAE 129

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSE+ +A I Q                              AVA   W 
Sbjct: 130 KFGDSFVFEGMLSEQVKAHIHQ------------------------------AVAAAPWW 159

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DS+I HR+NHPV+HVSWNDAVAYC W
Sbjct: 160 ---LPVKG-----------------ANWRHPEGPDSSILHRLNHPVLHVSWNDAVAYCAW 199

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP  NT  DG+  TAP
Sbjct: 200 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAP 259

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGPT+G D+VKKGGSY+C++ Y
Sbjct: 260 VDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPTSGKDRVKKGGSYMCHKSY 319

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
           CYR+RCAARSQNTPDSSA NLGFRCAAD  PT 
Sbjct: 320 CYRYRCAARSQNTPDSSASNLGFRCAADHLPTA 352


>gi|38202250|ref|NP_877437.2| sulfatase-modifying factor 1 isoform 1 precursor [Homo sapiens]
 gi|62298562|sp|Q8NBK3.3|SUMF1_HUMAN RecName: Full=Sulfatase-modifying factor 1; AltName:
           Full=C-alpha-formylglycine-generating enzyme 1; Flags:
           Precursor
 gi|30840149|gb|AAO34683.1| C-alpha-formyglycine-generating enzyme [Homo sapiens]
 gi|32527651|gb|AAP86217.1| sulfatase modifying factor 1 [Homo sapiens]
 gi|84569991|gb|AAI10863.1| Sulfatase modifying factor 1 [Homo sapiens]
 gi|111309238|gb|AAI21123.1| Sulfatase modifying factor 1 [Homo sapiens]
 gi|119584311|gb|EAW63907.1| sulfatase modifying factor 1, isoform CRA_b [Homo sapiens]
 gi|220983392|dbj|BAH11168.1| sulfatase modifying factor 1 [Homo sapiens]
          Length = 374

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 192/346 (55%), Positives = 231/346 (66%), Gaps = 54/346 (15%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 77  PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 326

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           +VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 327 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 372


>gi|426249246|ref|XP_004018361.1| PREDICTED: sulfatase-modifying factor 1 [Ovis aries]
          Length = 347

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 188/344 (54%), Positives = 231/344 (67%), Gaps = 54/344 (15%)

Query: 5   PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           P+PP +    MV +P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV
Sbjct: 58  PSPPTK----MVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFV 113

Query: 65  SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
           ++TGY+TEAEKFGD+FVFE +LSE+ ++ I Q                            
Sbjct: 114 NSTGYLTEAEKFGDSFVFEGMLSEQVKSDIQQ---------------------------- 145

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
             AVA   W    LP +                   +W HPEG DST+ HR +HPV+HVS
Sbjct: 146 --AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTVLHRPDHPVLHVS 183

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
           WNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP  NT
Sbjct: 184 WNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNT 243

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
             DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D+VKK
Sbjct: 244 GEDGFRGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETINPKGPPSGKDRVKK 303

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           GGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCAAD  PT G
Sbjct: 304 GGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCAADHLPTMG 347


>gi|349732234|ref|NP_001231861.1| sulfatase-modifying factor precursor [Sus scrofa]
          Length = 374

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 184/334 (55%), Positives = 227/334 (67%), Gaps = 50/334 (14%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+TEAE
Sbjct: 91  MVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFVNSTGYLTEAE 150

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LS++ ++ I Q                              AVA   W 
Sbjct: 151 KFGDSFVFEGILSDQVKSDIQQ------------------------------AVAAAPWW 180

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DST+ HR +HPV+HVSWNDAVAYCTW
Sbjct: 181 ---LPVKG-----------------ANWRHPEGPDSTVVHRPDHPVLHVSWNDAVAYCTW 220

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP  NT  DG+  TAP
Sbjct: 221 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAP 280

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGN WEWT+DWW +HH    + NPKGP +G D+VKKGGSY+C++ Y
Sbjct: 281 VDAFPPNGYGLYNIVGNAWEWTSDWWTIHHAAEETINPKGPPSGKDRVKKGGSYMCHKSY 340

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           CYR+RCAARSQNTPDSSA NLGFRCAAD  PTTG
Sbjct: 341 CYRYRCAARSQNTPDSSASNLGFRCAADHQPTTG 374


>gi|74207135|dbj|BAE30762.1| unnamed protein product [Mus musculus]
          Length = 381

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 186/333 (55%), Positives = 227/333 (68%), Gaps = 50/333 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F MGT+ P + +DGE P+R VT+D FY+D +EVSN  F++FV++TGY+TEAE
Sbjct: 98  MVPIPAGVFTMGTDDPQIRQDGEAPARRVTVDGFYMDAYEVSNADFEKFVNSTGYLTEAE 157

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSE+ +  I Q                              AVA   W 
Sbjct: 158 KFGDSFVFEGMLSEQVKTHIHQ------------------------------AVAAAPWW 187

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DS+I HR NHPV+HVSWNDAVAYCTW
Sbjct: 188 ---LPVKG-----------------ANWRHPEGPDSSILHRSNHPVLHVSWNDAVAYCTW 227

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQG+FP +NT  DG+  TAP
Sbjct: 228 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGKFPVSNTGEDGFQGTAP 287

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGNVWEWT+DWW VHH    ++NPKGPT+G D+VKKGGSY+C++ Y
Sbjct: 288 VDAFPPNGYGLYNIVGNVWEWTSDWWTVHHSVEETFNPKGPTSGKDRVKKGGSYMCHKSY 347

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
           CYR+RCAARSQNTPDSSA NLGFRCAAD  PT 
Sbjct: 348 CYRYRCAARSQNTPDSSASNLGFRCAADHLPTA 380


>gi|74192741|dbj|BAE34887.1| unnamed protein product [Mus musculus]
          Length = 372

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 186/333 (55%), Positives = 227/333 (68%), Gaps = 50/333 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F MGT+ P + +DGE P+R VT+D FY+D +EVSN  F++FV++TGY+TEAE
Sbjct: 89  MVPIPAGVFTMGTDDPQIRQDGEAPARRVTVDGFYMDAYEVSNADFEKFVNSTGYLTEAE 148

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSE+ +  I Q                              AVA   W 
Sbjct: 149 KFGDSFVFEGMLSEQVKTHIHQ------------------------------AVAAAPWW 178

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DS+I HR NHPV+HVSWNDAVAYCTW
Sbjct: 179 ---LPVKG-----------------ANWRHPEGPDSSILHRSNHPVLHVSWNDAVAYCTW 218

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQG+FP +NT  DG+  TAP
Sbjct: 219 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGKFPVSNTGEDGFQGTAP 278

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGNVWEWT+DWW VHH    ++NPKGPT+G D+VKKGGSY+C++ Y
Sbjct: 279 VDAFPPNGYGLYNIVGNVWEWTSDWWTVHHSVEETFNPKGPTSGKDRVKKGGSYMCHKSY 338

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
           CYR+RCAARSQNTPDSSA NLGFRCAAD  PT 
Sbjct: 339 CYRYRCAARSQNTPDSSASNLGFRCAADHLPTA 371


>gi|144094256|ref|NP_666049.2| sulfatase-modifying factor 1 precursor [Mus musculus]
 gi|33301732|sp|Q8R0F3.2|SUMF1_MOUSE RecName: Full=Sulfatase-modifying factor 1; AltName:
           Full=C-alpha-formylglycine-generating enzyme 1; Flags:
           Precursor
 gi|74144496|dbj|BAE36091.1| unnamed protein product [Mus musculus]
 gi|74148140|dbj|BAE36238.1| unnamed protein product [Mus musculus]
 gi|148666991|gb|EDK99407.1| sulfatase modifying factor 1 [Mus musculus]
          Length = 372

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 186/333 (55%), Positives = 227/333 (68%), Gaps = 50/333 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F MGT+ P + +DGE P+R VT+D FY+D +EVSN  F++FV++TGY+TEAE
Sbjct: 89  MVPIPAGVFTMGTDDPQIRQDGEAPARRVTVDGFYMDAYEVSNADFEKFVNSTGYLTEAE 148

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSE+ +  I Q                              AVA   W 
Sbjct: 149 KFGDSFVFEGMLSEQVKTHIHQ------------------------------AVAAAPWW 178

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DS+I HR NHPV+HVSWNDAVAYCTW
Sbjct: 179 ---LPVKG-----------------ANWRHPEGPDSSILHRSNHPVLHVSWNDAVAYCTW 218

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQG+FP +NT  DG+  TAP
Sbjct: 219 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGKFPVSNTGEDGFQGTAP 278

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGNVWEWT+DWW VHH    ++NPKGPT+G D+VKKGGSY+C++ Y
Sbjct: 279 VDAFPPNGYGLYNIVGNVWEWTSDWWTVHHSVEETFNPKGPTSGKDRVKKGGSYMCHKSY 338

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
           CYR+RCAARSQNTPDSSA NLGFRCAAD  PT 
Sbjct: 339 CYRYRCAARSQNTPDSSASNLGFRCAADHLPTA 371


>gi|344235930|gb|EGV92033.1| Sulfatase-modifying factor 1 [Cricetulus griseus]
          Length = 284

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 187/333 (56%), Positives = 226/333 (67%), Gaps = 50/333 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSN  F++FV++TGY+TEAE
Sbjct: 1   MVPIPAGVFTMGTDDPQIKQDGEAPARRVTVDAFYMDAYEVSNADFEKFVNSTGYLTEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSE+ +A I Q                              AVA   W 
Sbjct: 61  KFGDSFVFEGMLSEQVKAHIHQ------------------------------AVAAAPWW 90

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DS+I HR+NHPV+HVSWNDAVAYC W
Sbjct: 91  ---LPVKG-----------------ANWRHPEGPDSSILHRLNHPVLHVSWNDAVAYCAW 130

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP  NT  DG+  TAP
Sbjct: 131 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAP 190

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGPT+G D+VKKGGSY+C++ Y
Sbjct: 191 VDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPTSGKDRVKKGGSYMCHKSY 250

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
           CYR+RCAARSQNTPDSSA NLGFRCAAD  PT 
Sbjct: 251 CYRYRCAARSQNTPDSSASNLGFRCAADHLPTA 283


>gi|380811964|gb|AFE77857.1| sulfatase-modifying factor 1 isoform 1 precursor [Macaca mulatta]
          Length = 374

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 189/342 (55%), Positives = 228/342 (66%), Gaps = 53/342 (15%)

Query: 8   PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F++FV
Sbjct: 81  PGERPLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNAEFEKFV 140

Query: 65  SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
           ++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                            
Sbjct: 141 NSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ---------------------------- 172

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
             AVA   W    LP +                   +W HPEG DSTI HR +HPV+HVS
Sbjct: 173 --AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTIRHRPDHPVLHVS 210

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
           WNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP  NT
Sbjct: 211 WNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNT 270

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
             DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D+VKK
Sbjct: 271 GEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKK 330

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           GGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 331 GGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 372


>gi|149728333|ref|XP_001496626.1| PREDICTED: sulfatase-modifying factor 1 [Equus caballus]
          Length = 374

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 184/334 (55%), Positives = 227/334 (67%), Gaps = 50/334 (14%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+TEAE
Sbjct: 91  MVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFVNSTGYLTEAE 150

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSE+ ++ I Q                              AVA   W 
Sbjct: 151 KFGDSFVFEGMLSEQVKSDIQQ------------------------------AVAAAPWW 180

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DST+ HR +HPV+HVSWNDAVAYC W
Sbjct: 181 ---LPVKG-----------------ANWRHPEGPDSTVLHRPDHPVLHVSWNDAVAYCAW 220

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP  NT  DG+  TAP
Sbjct: 221 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAP 280

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D+VKKGGSY+C++ Y
Sbjct: 281 VDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYMCHKSY 340

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           CYR+RCAARSQNTPDSSA NLGFRCAAD+ PT+G
Sbjct: 341 CYRYRCAARSQNTPDSSASNLGFRCAADRLPTSG 374


>gi|67463712|pdb|1Y1E|X Chain X, Human Formylglycine Generating Enzyme
 gi|67463715|pdb|1Y1H|X Chain X, Human Formylglycine Generating Enzyme, Oxidised Cys
           Refined As Hydroperoxide
 gi|67463716|pdb|1Y1I|X Chain X, Hyuman Formylglycine Generating Enzyme, Reduced Form
          Length = 311

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 192/346 (55%), Positives = 231/346 (66%), Gaps = 54/346 (15%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 5   PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 64

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 65  EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 100

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DSTI HR +HPV
Sbjct: 101 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 134

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 135 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 194

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D
Sbjct: 195 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 254

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           +VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 255 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 300


>gi|383417677|gb|AFH32052.1| sulfatase-modifying factor 1 isoform 1 precursor [Macaca mulatta]
          Length = 374

 Score =  381 bits (978), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 188/342 (54%), Positives = 228/342 (66%), Gaps = 53/342 (15%)

Query: 8   PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F++FV
Sbjct: 81  PGERPLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNAEFEKFV 140

Query: 65  SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
           ++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                            
Sbjct: 141 NSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ---------------------------- 172

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
             AVA   W    LP +                   +W HPEG DST+ HR +HPV+HVS
Sbjct: 173 --AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTVRHRPDHPVLHVS 210

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
           WNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP  NT
Sbjct: 211 WNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNT 270

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
             DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D+VKK
Sbjct: 271 GEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKK 330

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           GGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 331 GGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 372


>gi|16553976|dbj|BAB71625.1| unnamed protein product [Homo sapiens]
          Length = 284

 Score =  380 bits (977), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 187/332 (56%), Positives = 225/332 (67%), Gaps = 50/332 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F++FV++TGY+TEAE
Sbjct: 1   MVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGYLTEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSE+ +  I Q                              AVA   W 
Sbjct: 61  KFGDSFVFEGMLSEQVKTNIQQ------------------------------AVAAAPWW 90

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DSTI HR +HPV+HVSWNDAVAYCTW
Sbjct: 91  ---LPVKG-----------------ANWRHPEGPDSTILHRPDHPVLHVSWNDAVAYCTW 130

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP  NT  DG+  TAP
Sbjct: 131 AGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAP 190

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D+VKKGGSY+C+  Y
Sbjct: 191 VDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSY 250

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           CYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 251 CYRYRCAARSQNTPDSSASNLGFRCAADRLPT 282


>gi|20073363|gb|AAH26981.1| Sulfatase modifying factor 1 [Mus musculus]
          Length = 284

 Score =  380 bits (976), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 186/332 (56%), Positives = 227/332 (68%), Gaps = 50/332 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F MGT+ P + +DGE P+R VT+D FY+D +EVSN  F++FV++TGY+TEAE
Sbjct: 1   MVPIPAGVFTMGTDDPQIRQDGEAPARRVTVDGFYMDAYEVSNADFEKFVNSTGYLTEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSE+ +  I Q                              AVA   W 
Sbjct: 61  KFGDSFVFEGMLSEQVKTHIHQ------------------------------AVAAAPWW 90

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DS+I HR NHPV+HVSWNDAVAYCTW
Sbjct: 91  ---LPVKG-----------------ANWRHPEGPDSSILHRSNHPVLHVSWNDAVAYCTW 130

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQG+FP +NT  DG+  TAP
Sbjct: 131 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGKFPVSNTGEDGFQGTAP 190

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGNVWEWT+DWW VHH    ++NPKGPT+G D+VKKGGSY+C++ Y
Sbjct: 191 VDAFPPNGYGLYNIVGNVWEWTSDWWTVHHSVEETFNPKGPTSGKDRVKKGGSYMCHKSY 250

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           CYR+RCAARSQNTPDSSA NLGFRCAAD  PT
Sbjct: 251 CYRYRCAARSQNTPDSSASNLGFRCAADHLPT 282


>gi|397522432|ref|XP_003831271.1| PREDICTED: sulfatase-modifying factor 1 isoform 1 [Pan paniscus]
          Length = 374

 Score =  380 bits (976), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 191/346 (55%), Positives = 231/346 (66%), Gaps = 54/346 (15%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 77  PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 326

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           +VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCAA++ PT
Sbjct: 327 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAANRLPT 372


>gi|332815962|ref|XP_526122.3| PREDICTED: sulfatase-modifying factor 1 isoform 3 [Pan troglodytes]
 gi|410218334|gb|JAA06386.1| sulfatase modifying factor 1 [Pan troglodytes]
 gi|410259882|gb|JAA17907.1| sulfatase modifying factor 1 [Pan troglodytes]
 gi|410308068|gb|JAA32634.1| sulfatase modifying factor 1 [Pan troglodytes]
 gi|410336755|gb|JAA37324.1| sulfatase modifying factor 1 [Pan troglodytes]
          Length = 374

 Score =  380 bits (976), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 191/346 (55%), Positives = 231/346 (66%), Gaps = 54/346 (15%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 77  PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 326

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           +VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCAA++ PT
Sbjct: 327 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAANRLPT 372


>gi|403270412|ref|XP_003927176.1| PREDICTED: sulfatase-modifying factor 1 isoform 1 [Saimiri
           boliviensis boliviensis]
          Length = 376

 Score =  380 bits (975), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 189/346 (54%), Positives = 230/346 (66%), Gaps = 54/346 (15%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F
Sbjct: 79  PGPVPEERPLAHSKMVFIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDTYEVSNAEF 138

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 139 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTDIQQ------------------------ 174

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DST+ HR +HPV
Sbjct: 175 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTVLHRPDHPV 208

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 209 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLNNRLFPWGNKLQPKGQHYANIWQGEFP 268

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW V+H    + NPKGP +G D
Sbjct: 269 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVYHSVEETLNPKGPPSGKD 328

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           +VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 329 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRPPT 374


>gi|62858375|ref|NP_001016416.1| sulfatase modifying factor 1 precursor [Xenopus (Silurana)
           tropicalis]
 gi|89273863|emb|CAJ82122.1| sulfatase modifying factor 1 [Xenopus (Silurana) tropicalis]
          Length = 367

 Score =  380 bits (975), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 186/329 (56%), Positives = 223/329 (67%), Gaps = 50/329 (15%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + MVL+P   F MGT++P + +DGE P+R V LD+FY+D +EVSN +F++FV ATG+VTE
Sbjct: 86  EQMVLIPSGVFTMGTDEPGIPQDGESPARKVRLDSFYMDTYEVSNLEFEKFVKATGHVTE 145

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AEKFGD+FVFE L+SE+ ++ I Q                              AVA   
Sbjct: 146 AEKFGDSFVFEGLISEDVKSTIDQ------------------------------AVAAAP 175

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W    LP +                    W HPEG DS I +RM+HPV+HVSWNDA A+C
Sbjct: 176 WW---LPVKG-----------------ADWRHPEGPDSNILNRMDHPVLHVSWNDASAFC 215

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
           TW G RLPTEAEWEY  RGGLENRLFPWGN L P+GEH ANVWQGEFPT NT  DGY+ T
Sbjct: 216 TWAGKRLPTEAEWEYASRGGLENRLFPWGNKLEPKGEHYANVWQGEFPTQNTGEDGYVQT 275

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
           +PV ++  N + LYNMVGN WEWT+DWW+ HH    +YNPKGP+TG DKVKKGGSY+C+E
Sbjct: 276 SPVTAFPANGYSLYNMVGNAWEWTSDWWDTHHTADETYNPKGPSTGKDKVKKGGSYMCHE 335

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
            YCYR+RCAARSQNTPDSSA NLGFRC A
Sbjct: 336 SYCYRYRCAARSQNTPDSSASNLGFRCVA 364


>gi|296225740|ref|XP_002807643.1| PREDICTED: LOW QUALITY PROTEIN: sulfatase-modifying factor 1
           [Callithrix jacchus]
          Length = 586

 Score =  380 bits (975), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 189/346 (54%), Positives = 229/346 (66%), Gaps = 54/346 (15%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MVL+P   F MGT+ P + +DGE P+R VT+DAFY+D +EV+N +F
Sbjct: 289 PGPVPGERPLAHSKMVLIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVNNAEF 348

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 349 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTDIQQ------------------------ 384

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG  STI HR +HPV
Sbjct: 385 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPGSTIRHRPDHPV 418

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYC W G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 419 LHVSWNDAVAYCMWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 478

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D
Sbjct: 479 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 538

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           +VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 539 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 584


>gi|326928166|ref|XP_003210252.1| PREDICTED: sulfatase-modifying factor 1-like [Meleagris gallopavo]
          Length = 291

 Score =  380 bits (975), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 186/333 (55%), Positives = 231/333 (69%), Gaps = 50/333 (15%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           K MV +PG  F MGT++P + +DGE+P+R V +++FY+DQ+EVSN +F+ FV++TGY+TE
Sbjct: 6   KLMVAIPGGVFTMGTDEPEIQQDGEWPARRVHVNSFYMDQYEVSNQEFERFVNSTGYLTE 65

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AEKFGD+FVFE +LSEE +A+I Q                              AVA   
Sbjct: 66  AEKFGDSFVFEGMLSEEVKAEIHQ------------------------------AVAAAP 95

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W    LP +                   +W  PEG  S+I  RM+HPV+HVSWNDAVA+C
Sbjct: 96  WW---LPVKG-----------------ANWRQPEGPGSSILSRMDHPVLHVSWNDAVAFC 135

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
           TW G RLPTEAEWEY CRGGLE RLFPWGN L P+G+H AN+WQG FP+NNTA DGY  T
Sbjct: 136 TWAGKRLPTEAEWEYSCRGGLEKRLFPWGNKLQPKGQHYANIWQGVFPSNNTAEDGYKGT 195

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
           APV ++  N +GLYN+VGN WEWT+DWW VHH    ++NPKGP++GTD+VKKGGSY+C++
Sbjct: 196 APVTAFPPNGYGLYNIVGNAWEWTSDWWAVHHSADEAHNPKGPSSGTDRVKKGGSYMCHK 255

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
            YCYR+RCAARSQNTPDSSA NLGFRCAAD  P
Sbjct: 256 SYCYRYRCAARSQNTPDSSASNLGFRCAADALP 288


>gi|395824557|ref|XP_003785529.1| PREDICTED: sulfatase-modifying factor 1 isoform 1 [Otolemur
           garnettii]
          Length = 374

 Score =  380 bits (975), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 189/346 (54%), Positives = 229/346 (66%), Gaps = 54/346 (15%)

Query: 5   PAP-PVERY---KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P PVER      MVL+P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F
Sbjct: 77  PGPIPVERPLAPTKMVLIPAGVFTMGTDDPQIKQDGEAPARRVAVDAFYMDAYEVSNAEF 136

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTDIQQ------------------------ 172

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DS + HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSNVLHRQDHPV 206

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFP 266

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D
Sbjct: 267 VTNTGEDGFRGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSIEETVNPKGPPSGKD 326

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           +VKKGGSY+C+  YCYR+RCAARSQNTPDSS+ NLGFRCAAD  PT
Sbjct: 327 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSSSNLGFRCAADHLPT 372


>gi|402859535|ref|XP_003894210.1| PREDICTED: sulfatase-modifying factor 1 isoform 1 [Papio anubis]
          Length = 374

 Score =  380 bits (975), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 189/342 (55%), Positives = 228/342 (66%), Gaps = 53/342 (15%)

Query: 8   PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F++FV
Sbjct: 81  PGERPLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNAEFEKFV 140

Query: 65  SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
           ++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                            
Sbjct: 141 NSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ---------------------------- 172

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
             AVA   W    LP +                   +W HPEG DSTI HR +HPV+HVS
Sbjct: 173 --AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTIWHRPDHPVLHVS 210

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
           WNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP  NT
Sbjct: 211 WNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNT 270

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
             DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D+VKK
Sbjct: 271 GEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKK 330

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           GGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 331 GGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 372


>gi|410951604|ref|XP_003982484.1| PREDICTED: sulfatase-modifying factor 1 [Felis catus]
          Length = 376

 Score =  379 bits (974), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 184/334 (55%), Positives = 226/334 (67%), Gaps = 50/334 (14%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+TEAE
Sbjct: 93  MVLIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFVNSTGYLTEAE 152

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSE+ +  I Q                              AVA   W 
Sbjct: 153 KFGDSFVFEGMLSEQVKTDIQQ------------------------------AVAAAPWW 182

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DST+ HR +HPV+HVSWNDAVAYCTW
Sbjct: 183 ---LPVKG-----------------ANWRHPEGPDSTVRHRPDHPVLHVSWNDAVAYCTW 222

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP  NT  DG+  TAP
Sbjct: 223 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAP 282

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGN WEWT+DWW VHH    + NP+   +G D+VKKGGSY+C++ Y
Sbjct: 283 VDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEKTLNPEPAPSGKDRVKKGGSYMCHKSY 342

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           CYR+RCAARSQNTPDSSA NLGFRCAAD+ PTTG
Sbjct: 343 CYRYRCAARSQNTPDSSASNLGFRCAADRQPTTG 376


>gi|22761561|dbj|BAC11634.1| unnamed protein product [Homo sapiens]
          Length = 374

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 190/346 (54%), Positives = 230/346 (66%), Gaps = 54/346 (15%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DA Y+D +EVSNT+F
Sbjct: 77  PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDALYMDAYEVSNTEF 136

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQG+FP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGDFP 266

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 326

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           +VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 327 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 372


>gi|85544097|pdb|2AII|X Chain X, Wild-Type Formylglycine Generating Enzyme Reacted With
           Iodoacetamide
          Length = 286

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 186/334 (55%), Positives = 225/334 (67%), Gaps = 50/334 (14%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F++FV++TGY+T
Sbjct: 3   HSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGYLT 62

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           EAEKFGD+FVFE +LSE+ +  I Q                              AVA  
Sbjct: 63  EAEKFGDSFVFEGMLSEQVKTNIQQ------------------------------AVAAA 92

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W    LP +                   +W HPEG DSTI HR +HPV+HVSWNDAVAY
Sbjct: 93  PWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPVLHVSWNDAVAY 132

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
           CTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP  NT  DG+  
Sbjct: 133 CTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQG 192

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
           TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D+VKKGGSY+C+
Sbjct: 193 TAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYMCH 252

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
             YCYR+RCAARSQNTPDSSA NLGFRCAAD+ P
Sbjct: 253 RSYCYRYRCAARSQNTPDSSASNLGFRCAADRLP 286


>gi|291408728|ref|XP_002720667.1| PREDICTED: CG7049-like [Oryctolagus cuniculus]
          Length = 380

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 186/333 (55%), Positives = 222/333 (66%), Gaps = 50/333 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSN  F++FV++TGY+TEAE
Sbjct: 97  MVLIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNADFEKFVNSTGYLTEAE 156

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FGD+FVFE +LSE  +  I Q                              AVA   W 
Sbjct: 157 TFGDSFVFEGMLSERVKTDIQQ------------------------------AVAAAPWW 186

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                    W HPEG DSTI HR +HPV+HVSWNDAVAYCTW
Sbjct: 187 ---LPVKG-----------------ADWRHPEGPDSTIRHRPHHPVLHVSWNDAVAYCTW 226

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGL+NRLFPWGN L PRG+H AN+WQG+FP  NT  DG+  TAP
Sbjct: 227 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPRGQHYANLWQGDFPVTNTGEDGFRGTAP 286

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYNMVGN WEWT+DWW  HH    + NPKGP  G D+VKKGGSY+C++ Y
Sbjct: 287 VDAFPPNGYGLYNMVGNAWEWTSDWWTRHHSVEETLNPKGPPLGKDRVKKGGSYMCHKSY 346

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
           CYR+RCAARSQNTPDSSA NLGFRCAAD+ P+T
Sbjct: 347 CYRYRCAARSQNTPDSSASNLGFRCAADRLPST 379


>gi|67463713|pdb|1Y1F|X Chain X, Human Formylglycine Generating Enzyme With Cysteine
           Sulfenic Acid
 gi|67463717|pdb|1Y1J|X Chain X, Human Formylglycine Generating Enzyme, Sulfonic
           AcidDESULFURATED FORM
          Length = 311

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 191/346 (55%), Positives = 230/346 (66%), Gaps = 54/346 (15%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 5   PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 64

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 65  EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 100

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DSTI HR +HPV
Sbjct: 101 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 134

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 135 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 194

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D
Sbjct: 195 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 254

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           +VKKGGSY+ +  YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 255 RVKKGGSYMXHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 300


>gi|85544095|pdb|2AFT|X Chain X, Formylglycine Generating Enzyme C336s Mutant
 gi|85544098|pdb|2AIJ|X Chain X, Formylglycine Generating Enzyme C336s Mutant Covalently
           Bound To Substrate Peptide Ctpsr
 gi|85544100|pdb|2AIK|X Chain X, Formylglycine Generating Enzyme C336s Mutant Covalently
           Bound To Substrate Peptide Lctpsra
 gi|146386507|pdb|2HI8|X Chain X, Human Formylglycine Generating Enzyme, C336s Mutant,
           Bromide Co- Crystallization
 gi|146386509|pdb|2HIB|X Chain X, Human Formylglycine Generating Enzyme, C336s Mutant,
           Iodide Co- Crystallization
          Length = 286

 Score =  376 bits (966), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 185/334 (55%), Positives = 224/334 (67%), Gaps = 50/334 (14%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F++FV++TGY+T
Sbjct: 3   HSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGYLT 62

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           EAEKFGD+FVFE +LSE+ +  I Q                              AVA  
Sbjct: 63  EAEKFGDSFVFEGMLSEQVKTNIQQ------------------------------AVAAA 92

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W    LP +                   +W HPEG DSTI HR +HPV+HVSWNDAVAY
Sbjct: 93  PWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPVLHVSWNDAVAY 132

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
           CTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP  NT  DG+  
Sbjct: 133 CTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQG 192

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
           TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D+VKKGGSY+ +
Sbjct: 193 TAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYMSH 252

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
             YCYR+RCAARSQNTPDSSA NLGFRCAAD+ P
Sbjct: 253 RSYCYRYRCAARSQNTPDSSASNLGFRCAADRLP 286


>gi|351703202|gb|EHB06121.1| Sulfatase-modifying factor 1 [Heterocephalus glaber]
          Length = 374

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 182/333 (54%), Positives = 226/333 (67%), Gaps = 50/333 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+TEAE
Sbjct: 91  MVLIPAGVFTMGTDDPQIKQDGEAPARRVAVDAFYMDAYEVSNAEFEKFVNSTGYLTEAE 150

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSE+ +  I Q                              AVA   W 
Sbjct: 151 KFGDSFVFEGMLSEQVKTDIQQ------------------------------AVAAAPWW 180

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DST++HR +HPV+HVSWNDAVAYCTW
Sbjct: 181 ---LPVKG-----------------ANWRHPEGPDSTVQHRPDHPVLHVSWNDAVAYCTW 220

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGL+NRLFPWGN L P+ +H AN+WQGEFP  NT  DG+  TAP
Sbjct: 221 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKDQHYANLWQGEFPMTNTGEDGFQGTAP 280

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGNVWEWT+DWW ++H    + NPKGP +G D+VKKGGSY+C++ Y
Sbjct: 281 VDAFPPNGYGLYNIVGNVWEWTSDWWTIYHSVEETLNPKGPPSGKDRVKKGGSYMCHKSY 340

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
           CYR+RCAARSQNTPDSSA NLGFRCAA+  PT 
Sbjct: 341 CYRYRCAARSQNTPDSSASNLGFRCAANHLPTA 373


>gi|126336367|ref|XP_001374411.1| PREDICTED: sulfatase-modifying factor 1-like [Monodelphis
           domestica]
          Length = 396

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 181/333 (54%), Positives = 227/333 (68%), Gaps = 50/333 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +P   F MGT++P + +DGE P+R V +++FY+D +EVSN +F+ FV++TGYVTEAE
Sbjct: 114 MAHIPAGVFTMGTDEPQIKQDGEGPARRVRINSFYMDLYEVSNAEFERFVNSTGYVTEAE 173

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVF+ +LS++ ++ I Q                              AVA   W 
Sbjct: 174 KFGDSFVFDSMLSDQVKSDIHQ------------------------------AVAAAPWW 203

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DS+I HR +HPV+HVSWNDAVAYCTW
Sbjct: 204 ---LPVKG-----------------ANWRHPEGPDSSILHRRDHPVLHVSWNDAVAYCTW 243

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGLENRLFPWGN L P+G+H AN+WQGEFP +NT  DGY  TAP
Sbjct: 244 AGKRLPTEAEWEYSCRGGLENRLFPWGNKLQPKGQHYANIWQGEFPVSNTGEDGYQGTAP 303

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGN WEWT+DWW VHH    + +PKGP +G+D+VKKGGSY+C++ Y
Sbjct: 304 VTAFPPNGYGLYNIVGNAWEWTSDWWTVHHSADETLDPKGPPSGSDRVKKGGSYMCHKSY 363

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
           CYR+RCAARSQNTPDSSA NLGFRCAAD+ P T
Sbjct: 364 CYRYRCAARSQNTPDSSASNLGFRCAADRLPDT 396


>gi|116063450|gb|AAI23111.1| LOC432261 protein [Xenopus laevis]
          Length = 357

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 181/329 (55%), Positives = 224/329 (68%), Gaps = 50/329 (15%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + MVL+P   F MGT++P + +DGE P+R V LD+FY+D +EVSN++F++FV ATG+VTE
Sbjct: 76  EQMVLIPSGVFTMGTDEPGIPQDGESPARKVHLDSFYMDMYEVSNSEFEKFVKATGHVTE 135

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE FGD+FVFE L++EE ++ I Q                              AVA   
Sbjct: 136 AETFGDSFVFEGLITEEVKSTIHQ------------------------------AVAAAP 165

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W    LP +                    W HPEG DS I +RM+HP++HVSWNDA A+C
Sbjct: 166 WW---LPVKG-----------------ADWRHPEGPDSNILNRMDHPILHVSWNDASAFC 205

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
           TW G RLPTEAEWEY CRGGLENR+FPWGN L P+G+H ANVWQG+FPT NT  DGY+ T
Sbjct: 206 TWAGKRLPTEAEWEYACRGGLENRIFPWGNKLEPKGQHYANVWQGDFPTQNTGEDGYVQT 265

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
           +PV ++  N +GLYNMVGN WEWT+DWW+ HH     +NPKGP+TG D+VKKGGSY+C+E
Sbjct: 266 SPVTAFPANGYGLYNMVGNAWEWTSDWWDTHHTADEVHNPKGPSTGNDRVKKGGSYMCHE 325

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
            YCYR+RCAARSQNTPDSSA NLGFRC A
Sbjct: 326 SYCYRYRCAARSQNTPDSSASNLGFRCVA 354


>gi|432092475|gb|ELK25090.1| Sulfatase-modifying factor 1 [Myotis davidii]
          Length = 284

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 187/333 (56%), Positives = 221/333 (66%), Gaps = 50/333 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSN  F+ FV+ATGY+TEAE
Sbjct: 1   MVSIPAGVFTMGTDDPQIRQDGEAPARRVTVDAFYMDAYEVSNADFERFVNATGYLTEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE LLSE+ +A I Q                              AVA   W 
Sbjct: 61  KFGDSFVFEGLLSEQVKAGIQQ------------------------------AVAAAPWW 90

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   SW HPEG DST+ HR +HPV+HVSWNDAVAYC W
Sbjct: 91  ---LPVKG-----------------ASWRHPEGPDSTVLHRPDHPVLHVSWNDAVAYCAW 130

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGL +RLFPWGN L PRG+H AN+WQGEFP  +T  DG+  TAP
Sbjct: 131 AGKRLPTEAEWEYSCRGGLHDRLFPWGNKLQPRGQHYANLWQGEFPVTDTGEDGFRGTAP 190

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GLYN+VGN WEWTADWW V H    + NPKGP +G D+VKKGGSY+C++ Y
Sbjct: 191 VDAFPPNGYGLYNIVGNAWEWTADWWAVSHSAEEALNPKGPPSGKDRVKKGGSYMCHKSY 250

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
           CYR+RCAARSQNTPDSSA NLGFRCAAD  P T
Sbjct: 251 CYRYRCAARSQNTPDSSASNLGFRCAADHLPIT 283


>gi|67463714|pdb|1Y1G|X Chain X, Human Formylglycine Generating Enzyme, Double Sulfonic
           Acid Form
 gi|73535823|pdb|1Z70|X Chain X, 1.15a Resolution Structure Of The Formylglycine Generating
           Enzyme Fge
          Length = 311

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 190/346 (54%), Positives = 229/346 (66%), Gaps = 54/346 (15%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 5   PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 64

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 65  EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 100

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DSTI HR +HPV
Sbjct: 101 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 134

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 135 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 194

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D
Sbjct: 195 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 254

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           +VKKGGSY+ +  Y YR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 255 RVKKGGSYMXHRSYXYRYRCAARSQNTPDSSASNLGFRCAADRLPT 300


>gi|47937785|gb|AAH72353.1| LOC432261 protein, partial [Xenopus laevis]
          Length = 340

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 179/329 (54%), Positives = 223/329 (67%), Gaps = 50/329 (15%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + M+L+P   F MGT++P + +DGE P+R V LD+FY+D +EVSN++F++FV ATG+VTE
Sbjct: 59  EQMILIPSGVFTMGTDEPGIPQDGESPARKVHLDSFYMDMYEVSNSEFEKFVKATGHVTE 118

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE FGD+FVFE L++EE ++ I Q                              AVA   
Sbjct: 119 AETFGDSFVFEGLITEEVKSTIHQ------------------------------AVAAAP 148

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W    LP +                    W HPEG DS I +RM+HP++HVSWNDA A+C
Sbjct: 149 WW---LPVKG-----------------ADWRHPEGPDSNILNRMDHPILHVSWNDASAFC 188

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
           TW G RLPTEAEWEY CRGGLENR+FPWGN L P+G+H ANVWQG+FPT NT  DGY+  
Sbjct: 189 TWAGKRLPTEAEWEYACRGGLENRIFPWGNKLEPKGQHYANVWQGDFPTQNTGEDGYVQI 248

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
           +PV ++  N +GLYNMVGN WEWT+DWW+ HH     +NPKGP+TG D+VKKGGSY+C+E
Sbjct: 249 SPVTAFPANGYGLYNMVGNAWEWTSDWWDTHHTADEVHNPKGPSTGNDRVKKGGSYMCHE 308

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
            YCYR+RCAARSQNTPDSSA NLGFRC A
Sbjct: 309 SYCYRYRCAARSQNTPDSSASNLGFRCVA 337


>gi|85544096|pdb|2AFY|X Chain X, Formylglycine Generating Enzyme C341s Mutant
          Length = 286

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 184/334 (55%), Positives = 223/334 (66%), Gaps = 50/334 (14%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F++FV++TGY+T
Sbjct: 3   HSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGYLT 62

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           EAEKFGD+FVFE +LSE+ +  I Q                              AVA  
Sbjct: 63  EAEKFGDSFVFEGMLSEQVKTNIQQ------------------------------AVAAA 92

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W    LP +                   +W HPEG DSTI HR +HPV+HVSWNDAVAY
Sbjct: 93  PWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPVLHVSWNDAVAY 132

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
           CTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP  NT  DG+  
Sbjct: 133 CTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQG 192

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
           TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D+VKKGGSY+ +
Sbjct: 193 TAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYMXH 252

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
             Y YR+RCAARSQNTPDSSA NLGFRCAAD+ P
Sbjct: 253 RSYSYRYRCAARSQNTPDSSASNLGFRCAADRLP 286


>gi|225714898|gb|ACO13295.1| Sulfatase-modifying factor 1 precursor [Esox lucius]
          Length = 378

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 187/332 (56%), Positives = 221/332 (66%), Gaps = 50/332 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV +PG  F MGT+ P +  DGE P R V L+ FY+D HEVSN QFQ F +ATGYVTEA
Sbjct: 97  EMVFVPGGEFLMGTDDPGIPPDGESPQRKVHLNHFYMDVHEVSNHQFQSFTNATGYVTEA 156

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           EKFGD+FVFE LLSEE +++ISQ                              AVA    
Sbjct: 157 EKFGDSFVFEGLLSEEVKSQISQ------------------------------AVA---- 182

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                   A W    RG           W HPEG DS+I  R++HPV+HVSW DAVAYCT
Sbjct: 183 -------AAPWWLPVRGA---------DWRHPEGPDSSIADRLDHPVLHVSWGDAVAYCT 226

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W   RLPTEAEWE+ CRGGL++RL+PWGN L P+G+H AN+WQGEFPT+NT  DGY+ T+
Sbjct: 227 WAHKRLPTEAEWEFACRGGLQDRLYPWGNKLNPKGQHYANLWQGEFPTHNTGEDGYIKTS 286

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           PV S+  N +GLYNMVGN WEWTADWW VHH      NP GP +GTDKVKKGGSY+C++ 
Sbjct: 287 PVRSFPPNGYGLYNMVGNAWEWTADWWTVHHTTDRQPNPAGPPSGTDKVKKGGSYMCHKS 346

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           YCYR+RCAARSQNTPDSSA NLGFRC + + P
Sbjct: 347 YCYRYRCAARSQNTPDSSAPNLGFRCVSPEQP 378


>gi|63101761|gb|AAH95053.1| Zgc:136465 protein [Danio rerio]
          Length = 388

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 181/331 (54%), Positives = 219/331 (66%), Gaps = 50/331 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           +VLL G  F MGT+ P + +DGE P R V L AFY+++HEV+N QFQ F + TGY+TEAE
Sbjct: 108 LVLLQGGWFLMGTDDPGIPQDGEGPQRKVKLGAFYIEEHEVTNQQFQHFTNQTGYITEAE 167

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +FGD+FVFE LLSEE ++ +S                     H V    W   V    WR
Sbjct: 168 RFGDSFVFEGLLSEEVKSTLS---------------------HAVAAAPWWSPVKGADWR 206

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                        HPEG DSTI +RMNHP +HVSW+DA AYC W
Sbjct: 207 -----------------------------HPEGPDSTIHNRMNHPALHVSWDDARAYCQW 237

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
              RLPTEAEWE  CRGGL++R++PWGN L PRG+H AN+WQG+FP +NTA DGY +T+P
Sbjct: 238 AKRRLPTEAEWELACRGGLQDRMYPWGNKLMPRGQHYANLWQGDFPNHNTAEDGYANTSP 297

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           VMS+  N FGLY+MVGN WEWTADWW VHH     +NPKGP +GTD+VKKGGSY+C++ Y
Sbjct: 298 VMSFPANGFGLYDMVGNAWEWTADWWTVHHSAEDKFNPKGPESGTDRVKKGGSYMCHKSY 357

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           CYR+RCAARSQNTPDSSA NLGFRCA+D  P
Sbjct: 358 CYRYRCAARSQNTPDSSASNLGFRCASDVDP 388


>gi|332231567|ref|XP_003264966.1| PREDICTED: sulfatase-modifying factor 1 isoform 1 [Nomascus
           leucogenys]
          Length = 374

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 183/335 (54%), Positives = 221/335 (65%), Gaps = 50/335 (14%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +  MV +P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+T
Sbjct: 88  HSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFVNSTGYLT 147

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           EAEKFGD+FVFE +LSE+ +  I Q                              AVA  
Sbjct: 148 EAEKFGDSFVFEGMLSEQVKTNIQQ------------------------------AVAAA 177

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W    LP +                   +W HPE  DSTI HR +HPV+HVSWNDA AY
Sbjct: 178 PWW---LPVKG-----------------ANWRHPEWPDSTIRHRPDHPVLHVSWNDAFAY 217

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
           CTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP  NT  DG+  
Sbjct: 218 CTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQG 277

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
           TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D+VKKGGSY+C+
Sbjct: 278 TAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYMCH 337

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
             YCYR+RCAARSQNTPDSSA NLGFRCAAD  PT
Sbjct: 338 RSYCYRYRCAARSQNTPDSSASNLGFRCAADCLPT 372


>gi|213511532|ref|NP_001133607.1| sulfatase-modifying factor 1 precursor [Salmo salar]
 gi|209154660|gb|ACI33562.1| Sulfatase-modifying factor 1 precursor [Salmo salar]
          Length = 378

 Score =  370 bits (949), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 181/331 (54%), Positives = 222/331 (67%), Gaps = 50/331 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+PG  F MGT+ P + +DGE P R V LD FY+D HEVSN  FQ F++ATGY+TEAE
Sbjct: 98  MVLIPGGEFLMGTDNPGIPQDGEGPQRRVHLDHFYMDAHEVSNRHFQSFINATGYITEAE 157

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +FGD+FVFE +LSEE +++ISQ                              AVA     
Sbjct: 158 RFGDSFVFEGVLSEEVKSQISQ------------------------------AVA----- 182

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                  A W    RG           W HPEG DS+I  R++HPV+HVSW DAVAYC+W
Sbjct: 183 ------AAPWWLPVRGA---------DWRHPEGPDSSITGRLDHPVLHVSWQDAVAYCSW 227

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
              RLPTEAEWEY CRGGL++RL+PWGN L P+G+H AN+WQG+FPT+N+  DGY  T+P
Sbjct: 228 AHKRLPTEAEWEYACRGGLQDRLYPWGNKLKPKGQHYANLWQGKFPTHNSEEDGYTKTSP 287

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V S+  N +GLYNMVGN WEWT+DWW VHH     +NP GP +GTD+VKKGGSY+C++ Y
Sbjct: 288 VKSFPANGYGLYNMVGNAWEWTSDWWTVHHTTDERHNPAGPPSGTDRVKKGGSYMCHKSY 347

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           CYR+RCAARSQNTPDSSA NLGFRC + + P
Sbjct: 348 CYRYRCAARSQNTPDSSASNLGFRCVSQEQP 378


>gi|327266132|ref|XP_003217860.1| PREDICTED: sulfatase-modifying factor 1-like [Anolis carolinensis]
          Length = 369

 Score =  369 bits (947), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 184/350 (52%), Positives = 231/350 (66%), Gaps = 56/350 (16%)

Query: 2   VLLPAPPVERYKD------MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEV 55
           V LP  P    KD      M  +P   F MGT++P + +DGE+P+R + ++ F++D +EV
Sbjct: 66  VHLPQQPGNGPKDKDEGRQMASIPAGVFTMGTDEPAIQQDGEWPARKIHINHFFMDLYEV 125

Query: 56  SNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRM 115
           SN +F++FV+ATGY+TEAE+FGD+FVFE +LSE+ +++I Q                   
Sbjct: 126 SNREFEKFVNATGYLTEAERFGDSFVFEGMLSEQVKSEIHQ------------------- 166

Query: 116 HHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHR 175
                      AVA   W    LP +                    W HP+G DS I  R
Sbjct: 167 -----------AVAAAPWW---LPVKG-----------------ADWKHPDGPDSNIVDR 195

Query: 176 MNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW 235
           M+HPV+HVSWNDAVA+CTW G RLPTEAEWEY CRGGLENRLFPWGN L P G+H AN+W
Sbjct: 196 MDHPVLHVSWNDAVAFCTWAGKRLPTEAEWEYSCRGGLENRLFPWGNKLHPNGQHYANIW 255

Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGP 295
           QG+FPT+NT  DGY  TAPV ++  N +GLYN+VGN WEWT+DWW++ H    + NPKGP
Sbjct: 256 QGDFPTSNTREDGYKGTAPVTAFPPNSYGLYNIVGNAWEWTSDWWSISHSADTAQNPKGP 315

Query: 296 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
            TGTD+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCAAD  P
Sbjct: 316 LTGTDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCAADTLP 365


>gi|242014044|ref|XP_002427708.1| Sulfatase-modifying factor 1 precursor, putative [Pediculus humanus
           corporis]
 gi|212512143|gb|EEB14970.1| Sulfatase-modifying factor 1 precursor, putative [Pediculus humanus
           corporis]
          Length = 365

 Score =  367 bits (941), Expect = 7e-99,   Method: Compositional matrix adjust.
 Identities = 180/341 (52%), Positives = 222/341 (65%), Gaps = 50/341 (14%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           R   MV L G  F MGT+KPI + DGE P R+V +  FYLD++EVSN +F +FVS TGY 
Sbjct: 75  RTNQMVYLKGGKFEMGTDKPIFVADGEGPKRSVLIKPFYLDKYEVSNAEFSKFVSDTGYK 134

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TE E FGD+FVFE LLS+  +  ++Q                              AVA 
Sbjct: 135 TEVEGFGDSFVFENLLSDAVKDTVTQ------------------------------AVAR 164

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             W    LP +                   +W+HPEG DS+I  +M+HPVVHVSWNDA A
Sbjct: 165 APWW---LPVKG-----------------ANWMHPEGPDSSISEKMDHPVVHVSWNDAKA 204

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           YC W   RLPTEAEWEY CR G ++RLFPWGNN  P+GE+RAN+WQG FP+++   DGY+
Sbjct: 205 YCDWAQKRLPTEAEWEYACRAGKKDRLFPWGNNFQPKGEYRANIWQGAFPSHDEGLDGYV 264

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
            T+PV ++ EN FGL NM+GNVWEWT+D+W++ H+  P  NP GPT GTDKVKKGGS+LC
Sbjct: 265 GTSPVTAFHENDFGLKNMIGNVWEWTSDFWDIKHNKEPVENPTGPTKGTDKVKKGGSFLC 324

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDK 351
           ++ YCYR+RCAARSQNTPDSSA NLGFRCAADK P    D+
Sbjct: 325 HKSYCYRYRCAARSQNTPDSSASNLGFRCAADKLPDYLIDE 365


>gi|432859552|ref|XP_004069151.1| PREDICTED: sulfatase-modifying factor 1-like [Oryzias latipes]
          Length = 368

 Score =  366 bits (940), Expect = 9e-99,   Method: Compositional matrix adjust.
 Identities = 179/329 (54%), Positives = 222/329 (67%), Gaps = 50/329 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  + G  F MGT++P +  DGE P R V +D+F++D HEV+N QFQ FV ATGY+TEAE
Sbjct: 88  MAWISGGEFLMGTDQPGIAADGEGPQRPVHVDSFHMDIHEVTNRQFQSFVRATGYITEAE 147

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE ++SE+ +++ISQ                              AVA   W 
Sbjct: 148 KFGDSFVFEGIVSEDVKSQISQ------------------------------AVAAAPWW 177

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DS I  RM+HPVV VSWNDA+AYC+W
Sbjct: 178 ---LPVKG-----------------ANWKHPEGPDSNITDRMDHPVVQVSWNDALAYCSW 217

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
              RLPTEAEWE  CRGGL++RL+PWGN L P+G+H AN+WQGEFP +N+A DGY  T+P
Sbjct: 218 VNKRLPTEAEWEVACRGGLKDRLYPWGNKLNPKGQHYANLWQGEFPAHNSAEDGYTKTSP 277

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           VMS+  N FGLY+MVGN WEWT+DWW+VHH     +NP GP +GTDKVKKGGSY+C++ Y
Sbjct: 278 VMSFPGNAFGLYDMVGNAWEWTSDWWSVHHTTDHQHNPTGPPSGTDKVKKGGSYMCHKSY 337

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           CYR+RCAARSQNTPDSSA NLGFRCAA++
Sbjct: 338 CYRYRCAARSQNTPDSSASNLGFRCAAEQ 366


>gi|209730894|gb|ACI66316.1| Sulfatase-modifying factor 1 precursor [Salmo salar]
          Length = 379

 Score =  366 bits (939), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 181/332 (54%), Positives = 222/332 (66%), Gaps = 51/332 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+PG  F MGT+ P + +DGE P R V LD FY+D HEVSN  FQ F++ATGY+TEAE
Sbjct: 98  MVLIPGGEFLMGTDNPGIPQDGEGPQRRVHLDHFYMDAHEVSNRHFQSFINATGYITEAE 157

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +FGD+FVFE +LSEE +++ISQ                              AVA     
Sbjct: 158 RFGDSFVFEGVLSEEVKSQISQ------------------------------AVA----- 182

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIE-HRMNHPVVHVSWNDAVAYCT 193
                  A W    RG           W HPEG DS+I   R++HPV+HVSW DAVAYC+
Sbjct: 183 ------AAPWWLPVRGA---------DWRHPEGPDSSITGSRLDHPVLHVSWQDAVAYCS 227

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W   RLPTEAEWEY CRGGL++RL+PWGN L P+G+H AN+WQG+FPT+N+  DGY  T+
Sbjct: 228 WAHKRLPTEAEWEYACRGGLQDRLYPWGNKLKPKGQHYANLWQGKFPTHNSEEDGYTKTS 287

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           PV S+  N +GLYNMVGN WEWT+DWW VHH     +NP GP +GTD+VKKGGSY+C++ 
Sbjct: 288 PVKSFPANGYGLYNMVGNAWEWTSDWWTVHHTTDERHNPAGPPSGTDRVKKGGSYMCHKS 347

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           YCYR+RCAARSQNTPDSSA NLGFRC + + P
Sbjct: 348 YCYRYRCAARSQNTPDSSASNLGFRCVSQEQP 379


>gi|225707352|gb|ACO09522.1| Sulfatase-modifying factor 1 precursor [Osmerus mordax]
          Length = 368

 Score =  365 bits (938), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 176/331 (53%), Positives = 224/331 (67%), Gaps = 50/331 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+ G  F MGT+ P +  DGE P R+V L+ F++D HEV+N QF+ F++ATGYVTEAE
Sbjct: 88  MVLISGGEFLMGTDNPCIPPDGEGPQRSVNLEPFFMDVHEVTNHQFKSFINATGYVTEAE 147

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FGD+FVF+ +LSE  +++ISQ                              AVA   W 
Sbjct: 148 TFGDSFVFDGVLSEATKSQISQ------------------------------AVAAAPWW 177

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                    W HPEG DS+I  R +HPV+HVSW DAVAYC+W
Sbjct: 178 ---LPVKG-----------------ADWKHPEGPDSSITDRTDHPVLHVSWRDAVAYCSW 217

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
              RLPTEAEWE+ CRGGL++RL+PWGN LTP+G+H +N+WQG+FPT+N+  DGY+ T+P
Sbjct: 218 AQKRLPTEAEWEHACRGGLKDRLYPWGNKLTPKGQHYSNLWQGDFPTHNSGEDGYIKTSP 277

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           VMS+  N +GLY+MVGN WEWT+DWW+VHH     +NPKGP +GTDKVKKGGSY+C++ Y
Sbjct: 278 VMSFPANGYGLYDMVGNAWEWTSDWWSVHHTADRQHNPKGPPSGTDKVKKGGSYMCHKSY 337

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           CYR+RCAARSQNTPDSSA NLGFRC + + P
Sbjct: 338 CYRYRCAARSQNTPDSSASNLGFRCVSQEPP 368


>gi|410919449|ref|XP_003973197.1| PREDICTED: sulfatase-modifying factor 1-like [Takifugu rubripes]
          Length = 362

 Score =  365 bits (936), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 178/327 (54%), Positives = 220/327 (67%), Gaps = 50/327 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+PG  F MGT++P +  DGE P R V +D+FY+D HEV+N QFQ FV+ TGYVTEAE
Sbjct: 82  MVLIPGGEFLMGTDEPSIPADGEGPQRPVHVDSFYMDIHEVTNQQFQTFVADTGYVTEAE 141

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LS+  + +I+Q                              AVA   W 
Sbjct: 142 KFGDSFVFEGILSDSVKKEITQ------------------------------AVAAAPWW 171

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DS I  R++HPV+HVSWNDA+ YC+W
Sbjct: 172 ---LPVKG-----------------ANWKHPEGQDSNITDRLDHPVLHVSWNDALTYCSW 211

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
              RLPTEAEWE  CRGGL+NRL+PWGN L P+G+H AN+WQG+FP +NTA DGY+ T+P
Sbjct: 212 LHKRLPTEAEWECACRGGLQNRLYPWGNKLNPKGQHYANLWQGDFPNHNTAEDGYVKTSP 271

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           VMS+  N FGLY++VGNVWEWT+DWW VHH      NP GP +G+DKVKKGGSY+C++ Y
Sbjct: 272 VMSFPPNGFGLYDIVGNVWEWTSDWWMVHHTAEKLQNPTGPQSGSDKVKKGGSYMCHKSY 331

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAA 341
           CYR+RCAARSQNTPDSSA NLGFRC +
Sbjct: 332 CYRYRCAARSQNTPDSSASNLGFRCVS 358


>gi|427782167|gb|JAA56535.1| Putative sulfatase-modifying factor 1-like protein [Rhipicephalus
           pulchellus]
          Length = 376

 Score =  363 bits (933), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 179/335 (53%), Positives = 215/335 (64%), Gaps = 50/335 (14%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           R   MVL+ G  F MGT+KP+ + DGE P+R V +  FY+D HEVSN +F+ FV ATG+V
Sbjct: 88  RTNQMVLVRGGQFEMGTDKPVFVADGENPARPVRVSDFYIDVHEVSNAEFERFVKATGHV 147

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAE FGD+FV +  +SEE +  I+Q                              AVA 
Sbjct: 148 TEAETFGDSFVLDSAISEETKKGITQ------------------------------AVAA 177

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             W    LP +                    W HPEG DS I  RM+HPVVHVSWNDA A
Sbjct: 178 APWW---LPVKG-----------------ADWRHPEGPDSHIHDRMDHPVVHVSWNDAAA 217

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           +C W G RL TEAEWEY CR GL+NRLFPWGNN TP+GE RAN+W+G+FP  NTA DGY+
Sbjct: 218 FCKWSGKRLLTEAEWEYACRAGLKNRLFPWGNNWTPKGETRANIWEGKFPVTNTAEDGYV 277

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
            TAPV ++   KFGL N++GNVWEWT DWWN  H   P  NPKGP +GTDKVKKGGS++C
Sbjct: 278 GTAPVTAFPATKFGLKNIIGNVWEWTNDWWNTRHTSEPQDNPKGPPSGTDKVKKGGSFMC 337

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           ++ YCYR+RCAARSQNTPD+SA N+G RC ADK P
Sbjct: 338 HKAYCYRYRCAARSQNTPDTSAYNVGIRCGADKLP 372


>gi|348507653|ref|XP_003441370.1| PREDICTED: sulfatase-modifying factor 1-like [Oreochromis
           niloticus]
          Length = 387

 Score =  363 bits (932), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 181/334 (54%), Positives = 217/334 (64%), Gaps = 50/334 (14%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           E    MV + G  F MGT+ P +  DGE P R V +D+F +D HEV+N QFQ FV ATGY
Sbjct: 102 ETQSPMVQISGGEFLMGTDNPGIPPDGEGPQRLVYVDSFNMDIHEVTNQQFQSFVLATGY 161

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           VTEAEKFGD+FVFE LLSE  +++I+Q                              AVA
Sbjct: 162 VTEAEKFGDSFVFEGLLSEPVKSQITQ------------------------------AVA 191

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
              W    LP +                   SW HPEG DS I  R++HPV+HVSW DAV
Sbjct: 192 SAPWW---LPVKG-----------------ASWRHPEGPDSNITDRLHHPVLHVSWTDAV 231

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
           AYC+W   RLPTEAEWEY CRGGL++RL+PWGN L P+G+H AN+WQG+FPT+N+  DGY
Sbjct: 232 AYCSWANKRLPTEAEWEYACRGGLKDRLYPWGNKLNPKGQHFANLWQGDFPTHNSGEDGY 291

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
           + T+PVMS+  N FGLY+MVGN WEWT DWW VHH   P  NP GP +G DKVKKGGSY+
Sbjct: 292 IKTSPVMSFPANAFGLYDMVGNAWEWTTDWWTVHHTTDPQRNPTGPPSGKDKVKKGGSYM 351

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           C+  YCYR+RCAARSQNTPDSSA NLGFRC + +
Sbjct: 352 CHRSYCYRYRCAARSQNTPDSSASNLGFRCVSQE 385


>gi|308322299|gb|ADO28287.1| sulfatase-modifying factor 1 [Ictalurus furcatus]
          Length = 281

 Score =  358 bits (920), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 179/331 (54%), Positives = 215/331 (64%), Gaps = 50/331 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVLL G  F MGT+ P + +DGE   R V LD FY+D+HEV+N QFQ F + T +VTEAE
Sbjct: 1   MVLLQGGWFLMGTDDPGIPQDGEGSQRRVWLDPFYIDEHEVTNLQFQLFTNTTHHVTEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +FGD+FVFE +LSEE                     ++  + H V    W   V    WR
Sbjct: 61  RFGDSFVFEGMLSEE---------------------VKSTLKHAVAAAPWWSPVKGADWR 99

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                        HPEG DS+I HRM+HPV+HVSW DA  YC W
Sbjct: 100 -----------------------------HPEGPDSSITHRMDHPVLHVSWADAQVYCEW 130

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWE+ CRGGL++RL+PWGN L PR +H AN+WQGEFP +NTA DG+  T+P
Sbjct: 131 AGRRLPTEAEWEFSCRGGLKDRLYPWGNKLMPREQHYANLWQGEFPNHNTAEDGFAQTSP 190

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V  +  N FGLY+MVGNVWEWTADWW+VHH    +YNPKGP TG+D+VKKGGSY+C++ Y
Sbjct: 191 VKKFPANAFGLYDMVGNVWEWTADWWSVHHTRDHAYNPKGPETGSDRVKKGGSYMCHKSY 250

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           CYR+RCAARSQNTPDSSA NLGFRCAA   P
Sbjct: 251 CYRYRCAARSQNTPDSSASNLGFRCAAAATP 281


>gi|340367863|ref|XP_003382472.1| PREDICTED: sulfatase-modifying factor 1-like [Amphimedon
           queenslandica]
          Length = 360

 Score =  357 bits (916), Expect = 6e-96,   Method: Compositional matrix adjust.
 Identities = 177/327 (54%), Positives = 217/327 (66%), Gaps = 50/327 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++VL+PG  F+MG +KP++ +DGE PSR+VT+ +FY+D++EVSN ++ +FVS TGYVTE 
Sbjct: 81  ELVLIPGGVFKMGLDKPVIPEDGETPSRSVTITSFYMDKYEVSNDEYGKFVSDTGYVTET 140

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E FG++FV E  +S+E +  I+Q                              AVA   W
Sbjct: 141 ESFGNSFVVEYFISKEVQEGITQ------------------------------AVANAPW 170

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
               LP +                   SW  PEGIDS I+ RM+HPVVHVSWNDA AYC 
Sbjct: 171 W---LPVDN-----------------ASWRQPEGIDSDIKGRMSHPVVHVSWNDADAYCK 210

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY CR GL++RLFPWGNN TP GEH  N+WQG+FP NNT  DGY    
Sbjct: 211 WAGKRLPTEAEWEYACRAGLKDRLFPWGNNPTPHGEHWMNIWQGKFPINNTLDDGYAGPG 270

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           PV  +  NKFGLYNMVGNVWEW +DWW  HH      +PKGP TG++KVKKGGSY+C+++
Sbjct: 271 PVDIFPPNKFGLYNMVGNVWEWVSDWWTTHHSKKDQKDPKGPPTGSEKVKKGGSYMCHKE 330

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCA 340
           YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 331 YCYRYRCAARSQNTPDSSAHNLGFRCA 357


>gi|47229531|emb|CAG06727.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 359

 Score =  354 bits (909), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 175/332 (52%), Positives = 218/332 (65%), Gaps = 50/332 (15%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           E    MVL+PG  F MGT+ P +  DGE P R V +D+FY+D HEV+N QFQ FV+ATGY
Sbjct: 77  EARSPMVLVPGGAFLMGTDDPGIPADGESPQRLVYVDSFYMDTHEVTNRQFQMFVNATGY 136

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           VTEAEKFGD+FVFE +LS++ + +I+Q                              AVA
Sbjct: 137 VTEAEKFGDSFVFEGILSDKVKKEITQ------------------------------AVA 166

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
              W    LP +                   +W HPEG+ S+I  R++HPV+HVSW DA 
Sbjct: 167 AAPWW---LPVKG-----------------ANWKHPEGLGSSITDRLDHPVLHVSWTDAR 206

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
            YC+W   RLPTEAEWE  CRGGL++RL+PWGN L P+G+H AN+WQG+FP  NT  DG+
Sbjct: 207 TYCSWLDKRLPTEAEWECACRGGLKDRLYPWGNKLNPKGKHYANLWQGDFPNYNTGEDGF 266

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
           + T+PVMS+  N FGLY++VGNVWEWT+DWW VHH      NP GP +G DKVKKGGSY+
Sbjct: 267 VKTSPVMSFPPNGFGLYDIVGNVWEWTSDWWTVHHSAENQSNPTGPQSGNDKVKKGGSYM 326

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           C++ YCYR+RCAARSQNTPDSSA NLGFRC +
Sbjct: 327 CHKSYCYRYRCAARSQNTPDSSASNLGFRCVS 358


>gi|260809998|ref|XP_002599791.1| hypothetical protein BRAFLDRAFT_57613 [Branchiostoma floridae]
 gi|229285073|gb|EEN55803.1| hypothetical protein BRAFLDRAFT_57613 [Branchiostoma floridae]
          Length = 372

 Score =  352 bits (904), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 183/337 (54%), Positives = 213/337 (63%), Gaps = 52/337 (15%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           R   MV + G  F MGT+     +DGE P+R V +D+FY D +EVSN +F  FV  TG+V
Sbjct: 75  RTNQMVFVRGGDFTMGTDDVRFPQDGEGPARPVRVDSFYADVYEVSNAEFDIFVKETGFV 134

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAEKFGD+FV E L+SEE +++I+Q                              AVA 
Sbjct: 135 TEAEKFGDSFVLENLISEEVKSEITQ------------------------------AVAG 164

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             W    LP +                    W HPEG+DS I +RM+HPVVHVSWNDAVA
Sbjct: 165 APWW---LPVKG-----------------ADWRHPEGVDSDIINRMDHPVVHVSWNDAVA 204

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           YCTW   RLPTEAEWEY  RGGLENRL+PWGNN  P+ EHR N+WQG+FP  NT  DGY 
Sbjct: 205 YCTWAKKRLPTEAEWEYASRGGLENRLYPWGNNEMPKKEHRMNIWQGKFPEENTEEDGYY 264

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGP--TTGTDKVKKGGSY 308
           STAPV SYK NKFGLYN  GNVWEW +DW +V H      NPKGP  +   DKVKKGGSY
Sbjct: 265 STAPVTSYKPNKFGLYNTAGNVWEWVSDWLSVRHTTELQVNPKGPAKSATNDKVKKGGSY 324

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           +C++ YCYR+R AARSQNTPDSSA NLGFRC ADK P
Sbjct: 325 MCHKDYCYRYRNAARSQNTPDSSASNLGFRCFADKLP 361


>gi|307188144|gb|EFN72976.1| Sulfatase-modifying factor 1 [Camponotus floridanus]
          Length = 279

 Score =  351 bits (901), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 169/326 (51%), Positives = 210/326 (64%), Gaps = 50/326 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ +   T+ +GTN P+++ DGE P R V LD+FY+D+ EVSN +F  FV  TGYVTEAE
Sbjct: 1   MIKINAGTYFIGTNNPVIVADGEGPKREVILDSFYIDKFEVSNEEFATFVKNTGYVTEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FGD+FVFE LL+++ + KI++                               VA   W 
Sbjct: 61  NFGDSFVFEDLLTQDMKNKITKT------------------------------VAQAPWW 90

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              +P +                   SW HPEG DS I +RM+HPV+HVSWNDA+AYC W
Sbjct: 91  ---MPVKQ-----------------ASWQHPEGPDSNITYRMDHPVIHVSWNDAIAYCNW 130

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
              RLPTEAEWE  CRGGL +RL+PWGN   P  +H+AN+WQG FPT NT  DGY  T+P
Sbjct: 131 MKKRLPTEAEWEVACRGGLTDRLYPWGNKFMPNNQHKANIWQGNFPTKNTEKDGYKGTSP 190

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V  + +NK+GL+N+VGNVWEWTADWW   H      NP GP++G DKVKKGGSYLC++ Y
Sbjct: 191 VTMFPQNKYGLHNIVGNVWEWTADWWITRHSSDQQTNPVGPSSGNDKVKKGGSYLCHKSY 250

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCA 340
           CYR+RCAARSQNTPD+SAGNLGFRCA
Sbjct: 251 CYRYRCAARSQNTPDTSAGNLGFRCA 276


>gi|72006588|ref|XP_782973.1| PREDICTED: sulfatase-modifying factor 1 [Strongylocentrotus
           purpuratus]
          Length = 417

 Score =  349 bits (895), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 178/335 (53%), Positives = 206/335 (61%), Gaps = 50/335 (14%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           R   M  + G TF+MGT+K  +  DGE PSR VTLD +Y D +EVSN++F+ FV+ T YV
Sbjct: 120 RTNQMNFIEGGTFQMGTDKAKIYLDGESPSRLVTLDPYYFDVYEVSNSEFELFVNTTSYV 179

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAEKFGD+FV E  +SEE +  ISQV                              VA 
Sbjct: 180 TEAEKFGDSFVLEARISEEVKKDISQV------------------------------VAA 209

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             W    LP +                    W HPEG DS+I  RM+HPV H+SWNDA A
Sbjct: 210 APWW---LPVKG-----------------AEWRHPEGPDSSISSRMDHPVTHMSWNDATA 249

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           YC W G RLPTEAEWE   RGGL+NRLFPWGN L P+  HR N+WQGEFP  NTA DGY 
Sbjct: 250 YCQWAGKRLPTEAEWENAARGGLKNRLFPWGNKLMPKDHHRVNIWQGEFPKVNTAEDGYE 309

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
            T PV +++ N +GLYN VGN WEW ADWWN  H P    NP GP  GTDKVKKGGSY+C
Sbjct: 310 GTCPVTAFEPNGYGLYNTVGNAWEWVADWWNTVHSPESQNNPVGPDEGTDKVKKGGSYMC 369

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           +  YCYR+RC ARSQN+PDSSA NLGFRCAA   P
Sbjct: 370 HISYCYRYRCEARSQNSPDSSACNLGFRCAATNLP 404


>gi|307194482|gb|EFN76774.1| Sulfatase-modifying factor 1 [Harpegnathos saltator]
          Length = 344

 Score =  348 bits (892), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 171/331 (51%), Positives = 214/331 (64%), Gaps = 50/331 (15%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           E+   MV +   T+ +GTN PI + DGE P R V LD+FY+D++EVSN +F  FVS+TGY
Sbjct: 61  EQTGKMVEIIAGTYFVGTNNPIFVVDGEGPKREVLLDSFYIDKYEVSNEEFAAFVSSTGY 120

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
            TEAE+FGD+FVFE LLS+  +  I Q                              AVA
Sbjct: 121 TTEAERFGDSFVFEGLLSQNTKNTIKQ------------------------------AVA 150

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
              W    LP +                   +W HPEG DS I  RM+HPVVHVSWNDA 
Sbjct: 151 EAPWW---LPVKK-----------------ATWQHPEGPDSNITSRMDHPVVHVSWNDAT 190

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
           AYC+W G RLPTEAEWE  CRG + +R +PWGN + P+ +H+ N+WQGEFP  NT  DGY
Sbjct: 191 AYCSWLGKRLPTEAEWEAACRGNISDRHYPWGNKIMPKDQHKVNIWQGEFPLRNTMEDGY 250

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
            ST+PV ++ +NK+GL+N+VGNVWEWT+DWW  +H      NP GP +GT+KVKKGGSYL
Sbjct: 251 KSTSPVTTFPQNKYGLHNIVGNVWEWTSDWWMTNHSHDQQSNPTGPPSGTNKVKKGGSYL 310

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           C++ YC+R+RCAARSQNTPD+SAGNLGFRCA
Sbjct: 311 CHKNYCHRYRCAARSQNTPDTSAGNLGFRCA 341


>gi|325974319|dbj|BAJ83907.1| sulfatase modifying factor 1 [Hemicentrotus pulcherrimus]
          Length = 423

 Score =  347 bits (891), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 177/335 (52%), Positives = 204/335 (60%), Gaps = 50/335 (14%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           R   M  + G TFRMGT+K  +  DGE PSR VTLD +Y D +EVSN++F+ FV+ T Y+
Sbjct: 126 RTNQMNFIEGGTFRMGTDKAKIYLDGESPSRLVTLDPYYFDVYEVSNSEFELFVNTTSYI 185

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAEKFGD+FV E  +SEE +  ISQV                              VA 
Sbjct: 186 TEAEKFGDSFVLEARISEEVKKDISQV------------------------------VAA 215

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             W    LP +                    W HPEG DS+I  RM+HPV H+SWNDA A
Sbjct: 216 APWW---LPVKG-----------------AEWRHPEGPDSSISSRMDHPVTHISWNDATA 255

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           YC W G RLPTEAEWE   RGGL NRLFPWGN L P+  HR N+WQGEFP  NTA DGY 
Sbjct: 256 YCQWAGKRLPTEAEWENAARGGLNNRLFPWGNKLMPKDHHRVNIWQGEFPKVNTAEDGYE 315

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
            T PV +++ N +GLYN VGN WEW ADWW   H P    NP GP  GTDKVKKGGSY+C
Sbjct: 316 GTCPVTAFEPNGYGLYNTVGNAWEWVADWWTTVHSPESQNNPVGPDEGTDKVKKGGSYMC 375

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           +  YCYR+RC ARSQN+PDSSA NLGFRCAA   P
Sbjct: 376 HISYCYRYRCEARSQNSPDSSACNLGFRCAATNLP 410


>gi|345485300|ref|XP_001600441.2| PREDICTED: sulfatase-modifying factor 1-like [Nasonia vitripennis]
          Length = 416

 Score =  347 bits (891), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 172/340 (50%), Positives = 214/340 (62%), Gaps = 50/340 (14%)

Query: 2   VLLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           ++ P    +   +MV + G ++ +GTN P+ + DGE P R V L +FY+D++EVSN  FQ
Sbjct: 125 IIKPIYKKDLTSNMVRIKGGSYFIGTNDPVFVSDGEGPRREVELGSFYIDKYEVSNADFQ 184

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
           EFV+A+ Y TEAEKFGD+FVFE LLS++ + +I +                         
Sbjct: 185 EFVTASSYKTEAEKFGDSFVFEGLLSKDIKNEIKE------------------------- 219

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
                AVA           +A W    +G +         W  PEG DS I+ RM+HPVV
Sbjct: 220 -----AVA-----------QAPWWLPVKGAV---------WSQPEGRDSNIKDRMDHPVV 254

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
           HVSWNDAVAYC W G RLPTEAEWE  CRGGL +RLFPWGN L P  EHR N+WQGEFP 
Sbjct: 255 HVSWNDAVAYCKWIGKRLPTEAEWEVTCRGGLNDRLFPWGNKLMPNNEHRTNIWQGEFPL 314

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK 301
           NNT  DG+  T PV  +  N +G++N++GNVWEWT+DWW   H      NP G   G+DK
Sbjct: 315 NNTVEDGFDRTNPVSYFPPNAYGVHNIIGNVWEWTSDWWKTKHTDEKQTNPTGAPNGSDK 374

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           VKKGGSYLC++ YCYR+RCAARSQNTPD+SAGNLGFRCA 
Sbjct: 375 VKKGGSYLCHKSYCYRYRCAARSQNTPDTSAGNLGFRCAV 414


>gi|297670801|ref|XP_002813543.1| PREDICTED: sulfatase-modifying factor 1 isoform 2 [Pongo abelii]
          Length = 354

 Score =  342 bits (877), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 178/346 (51%), Positives = 214/346 (61%), Gaps = 74/346 (21%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F
Sbjct: 77  PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNAEF 136

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DSTI HR++HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTIRHRLDHPV 206

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NP        
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPS------- 319

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
                        YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 320 -------------YCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 352



 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 28/32 (87%), Positives = 29/32 (90%)

Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           N  YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct: 317 NPSYCYRYRCAARSQNTPDSSASNLGFRCAAD 348


>gi|257470977|ref|NP_001158147.1| sulfatase-modifying factor 1 isoform 3 precursor [Homo sapiens]
          Length = 354

 Score =  341 bits (875), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 179/346 (51%), Positives = 214/346 (61%), Gaps = 74/346 (21%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 77  PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NP        
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPS------- 319

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
                        YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 320 -------------YCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 352



 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 28/32 (87%), Positives = 29/32 (90%)

Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           N  YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct: 317 NPSYCYRYRCAARSQNTPDSSASNLGFRCAAD 348


>gi|383865019|ref|XP_003707974.1| PREDICTED: sulfatase-modifying factor 1-like [Megachile rotundata]
          Length = 355

 Score =  341 bits (874), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 172/329 (52%), Positives = 204/329 (62%), Gaps = 51/329 (15%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           K+M  +   +F +GT+ PI + DGE P R V+LD+FY+D+ EVSN  F  FV AT Y TE
Sbjct: 74  KNMAKIEEGSFTVGTDDPIFVADGEGPKRRVSLDSFYMDELEVSNRDFAMFVDATAYRTE 133

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AEKFGD+FVFE LL +E                     I   +   V    W   +    
Sbjct: 134 AEKFGDSFVFEGLLEQE---------------------IRANISSVVARAPWWVQIKNAN 172

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W+                             HPEG  S I HRM+HPVVHVSWNDA+AYC
Sbjct: 173 WQ-----------------------------HPEGPGSNITHRMDHPVVHVSWNDALAYC 203

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAEWE  CRGGL +RL+PWGN LTP G++RAN WQGEFP+ N   DGY  T
Sbjct: 204 KWLGKRLPTEAEWEVACRGGLSDRLYPWGNKLTPNGQYRANTWQGEFPSINLKEDGYEGT 263

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVH-HHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
           +PV  +  NK+GL NMVGNVWEWT+DWW V       S NP GP+ GTDKVKKGGSY+C+
Sbjct: 264 SPVSKFPPNKYGLRNMVGNVWEWTSDWWTVEVAKRGGSSNPNGPSDGTDKVKKGGSYMCH 323

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           E YC+R+RCAARSQNTPD+SAGNLGFRCA
Sbjct: 324 ESYCFRYRCAARSQNTPDTSAGNLGFRCA 352


>gi|66533071|ref|XP_392816.2| PREDICTED: sulfatase-modifying factor 1-like isoform 2 [Apis
           mellifera]
          Length = 367

 Score =  340 bits (873), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 163/333 (48%), Positives = 210/333 (63%), Gaps = 51/333 (15%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           +   K+M+ +    F +GTN PI I DGE P + + L++FY+D+ EVSN  F +FV AT 
Sbjct: 82  INHLKNMIKIKAGIFGIGTNDPIFIADGEGPKQQIYLNSFYIDKTEVSNHDFSKFVDATN 141

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           Y TEAE+FGD+FVF  LL ++++  I+ V                               
Sbjct: 142 YKTEAERFGDSFVFRGLLKQKDQKNITLV------------------------------- 170

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                       +A W    +           +WLHPEG +S I++RM+HPV+HVSWNDA
Sbjct: 171 ----------VAQAPWWLQVKNA---------NWLHPEGPESDIKNRMDHPVIHVSWNDA 211

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
           +AYC W G RLPTEAEWE  CRGGL +RL+PWGN L P G++RAN WQG+FP NNT  DG
Sbjct: 212 IAYCNWLGKRLPTEAEWEVACRGGLSDRLYPWGNKLIPNGQYRANTWQGDFPNNNTKEDG 271

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-NPKGPTTGTDKVKKGGS 307
           Y  T+PV  +  NK+GLYNM+GNVWEWT DWW +       + NP GP+ G +KVKKGGS
Sbjct: 272 YEGTSPVREFPPNKYGLYNMIGNVWEWTFDWWTIEITTRGGFVNPTGPSQGINKVKKGGS 331

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           YLC++ YC+R+RCAARSQNTPD++AGNLGFRCA
Sbjct: 332 YLCHKSYCFRYRCAARSQNTPDTTAGNLGFRCA 364


>gi|193659919|ref|XP_001952215.1| PREDICTED: sulfatase-modifying factor 1-like [Acyrthosiphon pisum]
          Length = 332

 Score =  340 bits (871), Expect = 9e-91,   Method: Compositional matrix adjust.
 Identities = 165/329 (50%), Positives = 211/329 (64%), Gaps = 50/329 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+L+ G  F +GT+KP ++ DGE PSR V LD +++D++EVSN +F++FV    Y TEAE
Sbjct: 54  MILIKGQNFTIGTDKPEIVSDGESPSRTVYLDDYFIDKYEVSNLEFKKFVEIMEYQTEAE 113

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FGD+FVF+  LS+     I+Q   D                                  
Sbjct: 114 VFGDSFVFQLFLSKTTLKSITQAVKD---------------------------------- 139

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                  A W    +G          +W +PEG DS I  R  HPVVHVSWNDA A+C W
Sbjct: 140 -------APWWVPVKGA---------NWKNPEGKDSDISDRELHPVVHVSWNDATAFCNW 183

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLP+EAEWE  CRGGL++RL+PWGN L P  +HR N+WQGEFP  NTA DG+ +TAP
Sbjct: 184 AGKRLPSEAEWEAACRGGLKDRLYPWGNKLLPSNKHRTNIWQGEFPFKNTAEDGWATTAP 243

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++ +N +GL+NMVGNVWEWT+D+W+V H  + + NP GPT G ++VKKGGSYLC+++Y
Sbjct: 244 VNTFLKNGYGLHNMVGNVWEWTSDFWSVDHDQSFTINPSGPTEGKERVKKGGSYLCHKEY 303

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           CYR+RCAARSQNTPDSSA NLGFRCA DK
Sbjct: 304 CYRYRCAARSQNTPDSSASNLGFRCAKDK 332


>gi|111306476|gb|AAI21124.1| SUMF1 protein [Homo sapiens]
          Length = 354

 Score =  339 bits (870), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 178/346 (51%), Positives = 213/346 (61%), Gaps = 74/346 (21%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R  T+DAFY+D +EVSNT+F
Sbjct: 77  PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRATIDAFYMDAYEVSNTEF 136

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NP        
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPS------- 319

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
                        YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 320 -------------YCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 352



 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 28/32 (87%), Positives = 29/32 (90%)

Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           N  YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct: 317 NPSYCYRYRCAARSQNTPDSSASNLGFRCAAD 348


>gi|397522434|ref|XP_003831272.1| PREDICTED: sulfatase-modifying factor 1 isoform 2 [Pan paniscus]
          Length = 354

 Score =  339 bits (869), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 178/346 (51%), Positives = 214/346 (61%), Gaps = 74/346 (21%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 77  PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NP        
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPS------- 319

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
                        YCYR+RCAARSQNTPDSSA NLGFRCAA++ PT
Sbjct: 320 -------------YCYRYRCAARSQNTPDSSASNLGFRCAANRLPT 352



 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 27/32 (84%), Positives = 29/32 (90%)

Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           N  YCYR+RCAARSQNTPDSSA NLGFRCAA+
Sbjct: 317 NPSYCYRYRCAARSQNTPDSSASNLGFRCAAN 348


>gi|332815964|ref|XP_003309635.1| PREDICTED: sulfatase-modifying factor 1 isoform 1 [Pan troglodytes]
          Length = 354

 Score =  339 bits (869), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 178/346 (51%), Positives = 214/346 (61%), Gaps = 74/346 (21%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 77  PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NP        
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPS------- 319

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
                        YCYR+RCAARSQNTPDSSA NLGFRCAA++ PT
Sbjct: 320 -------------YCYRYRCAARSQNTPDSSASNLGFRCAANRLPT 352



 Score = 61.2 bits (147), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 27/32 (84%), Positives = 29/32 (90%)

Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           N  YCYR+RCAARSQNTPDSSA NLGFRCAA+
Sbjct: 317 NPSYCYRYRCAARSQNTPDSSASNLGFRCAAN 348


>gi|403270414|ref|XP_003927177.1| PREDICTED: sulfatase-modifying factor 1 isoform 2 [Saimiri
           boliviensis boliviensis]
          Length = 356

 Score =  338 bits (868), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 176/346 (50%), Positives = 213/346 (61%), Gaps = 74/346 (21%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F
Sbjct: 79  PGPVPEERPLAHSKMVFIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDTYEVSNAEF 138

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 139 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTDIQQ------------------------ 174

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DST+ HR +HPV
Sbjct: 175 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTVLHRPDHPV 208

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 209 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLNNRLFPWGNKLQPKGQHYANIWQGEFP 268

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW V+H    + NP        
Sbjct: 269 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVYHSVEETLNPS------- 321

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
                        YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 322 -------------YCYRYRCAARSQNTPDSSASNLGFRCAADRPPT 354



 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 28/32 (87%), Positives = 29/32 (90%)

Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           N  YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct: 319 NPSYCYRYRCAARSQNTPDSSASNLGFRCAAD 350


>gi|402859537|ref|XP_003894211.1| PREDICTED: sulfatase-modifying factor 1 isoform 2 [Papio anubis]
          Length = 354

 Score =  338 bits (867), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 176/342 (51%), Positives = 211/342 (61%), Gaps = 73/342 (21%)

Query: 8   PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F++FV
Sbjct: 81  PGERPLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNAEFEKFV 140

Query: 65  SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
           ++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                            
Sbjct: 141 NSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ---------------------------- 172

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
             AVA   W    LP +                   +W HPEG DSTI HR +HPV+HVS
Sbjct: 173 --AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTIWHRPDHPVLHVS 210

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
           WNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP  NT
Sbjct: 211 WNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNT 270

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
             DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NP            
Sbjct: 271 GEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPS----------- 319

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
                    YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 320 ---------YCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 352



 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 28/32 (87%), Positives = 29/32 (90%)

Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           N  YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct: 317 NPSYCYRYRCAARSQNTPDSSASNLGFRCAAD 348


>gi|395824561|ref|XP_003785531.1| PREDICTED: sulfatase-modifying factor 1 isoform 3 [Otolemur
           garnettii]
          Length = 354

 Score =  338 bits (867), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 176/346 (50%), Positives = 212/346 (61%), Gaps = 74/346 (21%)

Query: 5   PAP-PVERY---KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P PVER      MVL+P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F
Sbjct: 77  PGPIPVERPLAPTKMVLIPAGVFTMGTDDPQIKQDGEAPARRVAVDAFYMDAYEVSNAEF 136

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTDIQQ------------------------ 172

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DS + HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSNVLHRQDHPV 206

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFP 266

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NP        
Sbjct: 267 VTNTGEDGFRGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSIEETVNPS------- 319

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
                        YCYR+RCAARSQNTPDSS+ NLGFRCAAD  PT
Sbjct: 320 -------------YCYRYRCAARSQNTPDSSSSNLGFRCAADHLPT 352



 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 27/32 (84%), Positives = 29/32 (90%)

Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           N  YCYR+RCAARSQNTPDSS+ NLGFRCAAD
Sbjct: 317 NPSYCYRYRCAARSQNTPDSSSSNLGFRCAAD 348


>gi|347965878|ref|XP_003435826.1| AGAP013122-PA [Anopheles gambiae str. PEST]
 gi|333470313|gb|EGK97587.1| AGAP013122-PA [Anopheles gambiae str. PEST]
          Length = 359

 Score =  337 bits (865), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 174/334 (52%), Positives = 212/334 (63%), Gaps = 53/334 (15%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           +R++DM L+PG  + +GTN+PI +KD E P+R  T+  FYLDQ+EVSN QF+ FV  TGY
Sbjct: 76  KRFEDMSLIPGGEYVIGTNEPIFVKDRESPARPATIRDFYLDQYEVSNAQFKAFVDQTGY 135

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           VTEAEKFGD+FVF+ LLSE  R +                                    
Sbjct: 136 VTEAEKFGDSFVFQQLLSEPVRQQ------------------------------------ 159

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG-IDSTIEHRMNHPVVHVSWNDA 188
           Y  +R A  P    W Y  RG          SW HPEG +   I  R++HPVVHVSWNDA
Sbjct: 160 YEDFRVAAAP----WWYKVRGA---------SWQHPEGDVSRDISDRLDHPVVHVSWNDA 206

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
           VAYC W+G RLPTEAEWE  CRGG + +LFPWGN L P+ +H  N+WQGEFP +N   DG
Sbjct: 207 VAYCAWKGKRLPTEAEWEAACRGGRKQKLFPWGNKLMPKEQHMMNIWQGEFPDSNLKEDG 266

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
           Y +T PV S+++N F LYN+VGNVWEWTAD W+       +   K  +   ++VKKGGSY
Sbjct: 267 YETTCPVTSFRQNPFELYNIVGNVWEWTADLWDAKD---AAIERKPGSDPPNRVKKGGSY 323

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           LC+E YCYR+RCAARSQNT DSSAGNLGFRCAAD
Sbjct: 324 LCHESYCYRYRCAARSQNTEDSSAGNLGFRCAAD 357


>gi|350420524|ref|XP_003492537.1| PREDICTED: sulfatase-modifying factor 1-like [Bombus impatiens]
          Length = 369

 Score =  336 bits (862), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 170/334 (50%), Positives = 210/334 (62%), Gaps = 53/334 (15%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           ++R K+M  +   TF +GTN+P+ + DGE P + + L +FY+D+ EVSN  F +FVS T 
Sbjct: 84  IDRLKNMAKIEAGTFTIGTNEPVFVADGEGPKQQIYLHSFYIDKMEVSNRDFAKFVSVTN 143

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           Y TEAEKFGD+FVFE LL E+ R                                     
Sbjct: 144 YRTEAEKFGDSFVFEGLLDEKVR------------------------------------- 166

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           A  T   AR P    W    +           +W  PEG  S I++RM+HPVVHVSWNDA
Sbjct: 167 ANVTLVVARAP----WWLQVKDA---------NWQQPEGPTSDIKNRMDHPVVHVSWNDA 213

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
           +AYC W G RLPTEAEWE  CRGGL ++L+PWGNNL P G +RAN WQG+FP+NNT  D 
Sbjct: 214 IAYCEWLGKRLPTEAEWEVACRGGLSDKLYPWGNNLIPNGRYRANTWQGDFPSNNTKEDK 273

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP--APSYNPKGPTTGTDKVKKGG 306
           Y ST+PV  +  NK+GL NM+GNVWEWT DWW         P+ NP GP+ GTDKVKKGG
Sbjct: 274 YESTSPVTEFPPNKYGLRNMIGNVWEWTFDWWTTETRTRGGPN-NPSGPSRGTDKVKKGG 332

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           SYLC++ YC+R+RCAARSQNTPD++AGNLGFRCA
Sbjct: 333 SYLCHKSYCFRYRCAARSQNTPDTTAGNLGFRCA 366


>gi|340709728|ref|XP_003393454.1| PREDICTED: sulfatase-modifying factor 1-like isoform 2 [Bombus
           terrestris]
          Length = 358

 Score =  336 bits (861), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 164/332 (49%), Positives = 210/332 (63%), Gaps = 60/332 (18%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           ++R K+M  +   T+ +GTN+P+ + DGE P + + L +FY+D+ EVSN  F +FV+ T 
Sbjct: 84  IDRLKNMAKIEAGTYTIGTNEPVFVADGEGPKQQIYLHSFYIDKMEVSNRDFAKFVNVTN 143

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           Y TEAEKFGD+FVFE LL E+ RA ++ V                               
Sbjct: 144 YRTEAEKFGDSFVFEGLLDEKVRANVTLV------------------------------- 172

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                       +A W    +           +W HPEG  S I++RM+HPVVHVSWNDA
Sbjct: 173 ----------VAQAPWWLQIKNA---------NWQHPEGPTSDIKNRMDHPVVHVSWNDA 213

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
           +AYC W   RLPTEAEWE  CRGGL +RL+PWGNNL P G +RAN WQG+FP+NNT  D 
Sbjct: 214 IAYCEWLEKRLPTEAEWEVACRGGLSDRLYPWGNNLIPNGRYRANTWQGDFPSNNTKEDK 273

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
           Y ST+PV  +  NK+GL+NM+GNVWEWT DWW  +          GP++GTDKVKKGGSY
Sbjct: 274 YESTSPVTEFPPNKYGLHNMIGNVWEWTFDWWTTN----------GPSSGTDKVKKGGSY 323

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           LC++ YC+R+RCAARSQNTPD++AGNLGFRCA
Sbjct: 324 LCHKSYCFRYRCAARSQNTPDTTAGNLGFRCA 355



 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 38/47 (80%), Positives = 46/47 (97%)

Query: 344 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
           GP++GTDKVKKGGSYLC++ YC+R+RCAARSQNTPD++AGNLGFRCA
Sbjct: 309 GPSSGTDKVKKGGSYLCHKSYCFRYRCAARSQNTPDTTAGNLGFRCA 355


>gi|340709726|ref|XP_003393453.1| PREDICTED: sulfatase-modifying factor 1-like isoform 1 [Bombus
           terrestris]
          Length = 369

 Score =  335 bits (859), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 166/334 (49%), Positives = 213/334 (63%), Gaps = 53/334 (15%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           ++R K+M  +   T+ +GTN+P+ + DGE P + + L +FY+D+ EVSN  F +FV+ T 
Sbjct: 84  IDRLKNMAKIEAGTYTIGTNEPVFVADGEGPKQQIYLHSFYIDKMEVSNRDFAKFVNVTN 143

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           Y TEAEKFGD+FVFE LL E       +VR ++                           
Sbjct: 144 YRTEAEKFGDSFVFEGLLDE-------KVRANV--------------------------- 169

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                    +  +A W    +           +W HPEG  S I++RM+HPVVHVSWNDA
Sbjct: 170 -------TLVVAQAPWWLQIKNA---------NWQHPEGPTSDIKNRMDHPVVHVSWNDA 213

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
           +AYC W   RLPTEAEWE  CRGGL +RL+PWGNNL P G +RAN WQG+FP+NNT  D 
Sbjct: 214 IAYCEWLEKRLPTEAEWEVACRGGLSDRLYPWGNNLIPNGRYRANTWQGDFPSNNTKEDK 273

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP--APSYNPKGPTTGTDKVKKGG 306
           Y ST+PV  +  NK+GL+NM+GNVWEWT DWW         P+ NP GP++GTDKVKKGG
Sbjct: 274 YESTSPVTEFPPNKYGLHNMIGNVWEWTFDWWTTETRKRGGPN-NPNGPSSGTDKVKKGG 332

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           SYLC++ YC+R+RCAARSQNTPD++AGNLGFRCA
Sbjct: 333 SYLCHKSYCFRYRCAARSQNTPDTTAGNLGFRCA 366


>gi|380028488|ref|XP_003697932.1| PREDICTED: LOW QUALITY PROTEIN: sulfatase-modifying factor 1-like
           [Apis florea]
          Length = 280

 Score =  334 bits (857), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 164/327 (50%), Positives = 208/327 (63%), Gaps = 51/327 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ +    F +GTN PI I DGE P + + L++FY+D+ EVSN  F +FV+ T Y TEAE
Sbjct: 1   MIKIKTGIFAIGTNNPIFIADGEGPKQQIYLNSFYIDKTEVSNRDFAKFVNVTKYKTEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +FGD+FVF  LL ++ +  ++ V                     V  + W        W 
Sbjct: 61  RFGDSFVFGSLLKQKSQENVTLV---------------------VAQVPW--------WL 91

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
             +                       +WLHPEG +S I++RM+HPVVHVSWNDA+AYC W
Sbjct: 92  QVKN---------------------ANWLHPEGPESDIKNRMDHPVVHVSWNDAIAYCNW 130

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWE  CRGGL +RL+PWGN L P G++RAN WQG+FP NNT  DGY  T+P
Sbjct: 131 LGKRLPTEAEWEVACRGGLSDRLYPWGNXLIPNGQYRANTWQGDFPNNNTKEDGYEGTSP 190

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           V  +  NK+GLYNM+GNVWEWT DWW +       S NP GP+ GT+KVKKGGSYLC++ 
Sbjct: 191 VREFPPNKYGLYNMIGNVWEWTFDWWTIEITTRGGSVNPTGPSHGTNKVKKGGSYLCHKS 250

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCA 340
           YC+R+RCAARSQNTPD++AGNLGFRCA
Sbjct: 251 YCFRYRCAARSQNTPDTTAGNLGFRCA 277


>gi|198427283|ref|XP_002131627.1| PREDICTED: similar to sulfatase modifying factor 1 [Ciona
           intestinalis]
          Length = 377

 Score =  331 bits (849), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 176/339 (51%), Positives = 213/339 (62%), Gaps = 51/339 (15%)

Query: 7   PPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
           P V+R+  M L+P   F MG+N   L KDGE P R V LD FY+D HE SN +F  FV A
Sbjct: 78  PNVKRFNQMKLIPKGEFTMGSNAK-LTKDGEDPERRVFLDDFYMDVHETSNIEFAHFVKA 136

Query: 67  TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
           T YVTEAEKF ++FV E L+S+E  +KI+Q                              
Sbjct: 137 TNYVTEAEKFKNSFVLEMLISKETLSKITQ------------------------------ 166

Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
            VA   W    LP +                    WLHPEG DS I  R+NHPV+HVSWN
Sbjct: 167 QVASSPWW---LPVDG-----------------ADWLHPEGPDSNISKRLNHPVIHVSWN 206

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
           DA +YC W G RLPTEAEWE   RGGLENRLFPWGN L P G++  N+WQGEFP  NTA 
Sbjct: 207 DAKSYCKWAGKRLPTEAEWEKAARGGLENRLFPWGNKLNPNGKYMINIWQGEFPKGNTAE 266

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
           DGY +TAPV S+  NKFGL+N VGNVWEW  DW++ +H    ++NPKGP +G  KVKKGG
Sbjct: 267 DGYAATAPVDSFPPNKFGLFNTVGNVWEWVEDWYSTYHELERAHNPKGPKSGKVKVKKGG 326

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           SY+C++++CYR+RC+AR +NTPDSS+ NLGFRCAAD  P
Sbjct: 327 SYMCHKEFCYRYRCSARGENTPDSSSSNLGFRCAADTLP 365


>gi|328793659|ref|XP_003251911.1| PREDICTED: sulfatase-modifying factor 1-like isoform 1 [Apis
           mellifera]
          Length = 356

 Score =  330 bits (846), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 160/332 (48%), Positives = 206/332 (62%), Gaps = 60/332 (18%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           +   K+M+ +    F +GTN PI I DGE P + + L++FY+D+ EVSN  F +FV AT 
Sbjct: 82  INHLKNMIKIKAGIFGIGTNDPIFIADGEGPKQQIYLNSFYIDKTEVSNHDFSKFVDATN 141

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           Y TEAE+FGD+FVF  LL ++++  I+ V                               
Sbjct: 142 YKTEAERFGDSFVFRGLLKQKDQKNITLV------------------------------- 170

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                       +A W    +           +WLHPEG +S I++RM+HPV+HVSWNDA
Sbjct: 171 ----------VAQAPWWLQVKNA---------NWLHPEGPESDIKNRMDHPVIHVSWNDA 211

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
           +AYC W G RLPTEAEWE  CRGGL +RL+PWGN L P G++RAN WQG+FP NNT  DG
Sbjct: 212 IAYCNWLGKRLPTEAEWEVACRGGLSDRLYPWGNKLIPNGQYRANTWQGDFPNNNTKEDG 271

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
           Y  T+PV  +  NK+GLYNM+GNVWEWT DWW +          +  T G +KVKKGGSY
Sbjct: 272 YEGTSPVREFPPNKYGLYNMIGNVWEWTFDWWTI----------EITTRGINKVKKGGSY 321

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           LC++ YC+R+RCAARSQNTPD++AGNLGFRCA
Sbjct: 322 LCHKSYCFRYRCAARSQNTPDTTAGNLGFRCA 353



 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 35/45 (77%), Positives = 42/45 (93%)

Query: 346 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
           T G +KVKKGGSYLC++ YC+R+RCAARSQNTPD++AGNLGFRCA
Sbjct: 309 TRGINKVKKGGSYLCHKSYCFRYRCAARSQNTPDTTAGNLGFRCA 353


>gi|332231569|ref|XP_003264967.1| PREDICTED: sulfatase-modifying factor 1 isoform 2 [Nomascus
           leucogenys]
          Length = 354

 Score =  330 bits (846), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 170/335 (50%), Positives = 204/335 (60%), Gaps = 70/335 (20%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +  MV +P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+T
Sbjct: 88  HSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFVNSTGYLT 147

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           EAEKFGD+FVFE +LSE+ +  I Q                              AVA  
Sbjct: 148 EAEKFGDSFVFEGMLSEQVKTNIQQ------------------------------AVAAA 177

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W    LP +                   +W HPE  DSTI HR +HPV+HVSWNDA AY
Sbjct: 178 PWW---LPVKG-----------------ANWRHPEWPDSTIRHRPDHPVLHVSWNDAFAY 217

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
           CTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP  NT  DG+  
Sbjct: 218 CTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQG 277

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
           TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NP                   
Sbjct: 278 TAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPS------------------ 319

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
             YCYR+RCAARSQNTPDSSA NLGFRCAAD  PT
Sbjct: 320 --YCYRYRCAARSQNTPDSSASNLGFRCAADCLPT 352



 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 28/32 (87%), Positives = 29/32 (90%)

Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           N  YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct: 317 NPSYCYRYRCAARSQNTPDSSASNLGFRCAAD 348


>gi|156408243|ref|XP_001641766.1| predicted protein [Nematostella vectensis]
 gi|156228906|gb|EDO49703.1| predicted protein [Nematostella vectensis]
          Length = 290

 Score =  327 bits (838), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 167/340 (49%), Positives = 210/340 (61%), Gaps = 51/340 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV + G  F MGTNKP ++ DGE P+R V L  FY+D +EVSN +F+ FV+ATG+ TEAE
Sbjct: 1   MVRIEGGEFTMGTNKPYIVVDGEGPARKVRLRPFYMDIYEVSNREFELFVNATGHKTEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FGD+FV    +S+E                     IE  +H  V    W        W 
Sbjct: 61  VFGDSFVLGSRVSKE---------------------IEKDIHQAVAAAPW--------W- 90

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +  +                 W  PEG D+ I  RMNHPV+H+SWNDAVA+C W
Sbjct: 91  ---LPVKGAY-----------------WRRPEGPDTDIRERMNHPVLHISWNDAVAFCKW 130

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAE+EY  +GG ++ L+PWGN L   G+H AN+WQG FP  NTA DGY  T P
Sbjct: 131 SGKRLPTEAEFEYALKGGSDDTLYPWGNELLVNGKHMANLWQGRFPVENTAEDGYEGTCP 190

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++ +N +GLYN++GN WEWT+DWWNV H      NP GPTTG DKVK+GGSY+C+E Y
Sbjct: 191 VTAFPQNGYGLYNIIGNAWEWTSDWWNVRHTSDFQDNPTGPTTGKDKVKRGGSYMCHESY 250

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKK 354
           CYR+R AARSQN+PD++A NLGFRCA+D  P  G   +KK
Sbjct: 251 CYRYRSAARSQNSPDTTASNLGFRCASDS-PPPGVTIIKK 289


>gi|291222785|ref|XP_002731393.1| PREDICTED: sulfatase modifying factor 1-like isoform 1
           [Saccoglossus kowalevskii]
          Length = 293

 Score =  326 bits (835), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 166/328 (50%), Positives = 199/328 (60%), Gaps = 50/328 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  + G TF MGT+ P ++ DGE P+R V LD F+ D++EVSN +FQ FV   GYVTEAE
Sbjct: 1   MSYIKGGTFTMGTDTPGVLFDGEGPARKVKLDPFFADKYEVSNREFQLFVDDKGYVTEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FGD+FV E  +SE+  + +++V                     V    W   V    W+
Sbjct: 61  NFGDSFVLEARVSEKTLSTVTEV---------------------VAQTPWWLKVIGADWK 99

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                        HPEG D+ I  RM+HPV+HVSWNDAV +C W
Sbjct: 100 -----------------------------HPEGPDTNITDRMDHPVLHVSWNDAVEFCKW 130

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAE+EY  RGG ENR +PWGN L P G+   NVWQG FPT N+A DG+  T+P
Sbjct: 131 IGKRLPTEAEFEYASRGGKENRTYPWGNRLRPNGKWLMNVWQGVFPTENSAEDGFAGTSP 190

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           VM+Y  N FGL+NMVGN WEWT DWW   H      NPKGP +G DKVKKGGSY+C+  Y
Sbjct: 191 VMAYPPNGFGLHNMVGNAWEWTNDWWGTSHTADLYVNPKGPASGIDKVKKGGSYMCHASY 250

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           CYR+RCAARSQNTPDSSA NLGFRC AD
Sbjct: 251 CYRYRCAARSQNTPDSSASNLGFRCFAD 278


>gi|405962419|gb|EKC28098.1| Sulfatase-modifying factor 1 [Crassostrea gigas]
          Length = 539

 Score =  325 bits (833), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 164/333 (49%), Positives = 205/333 (61%), Gaps = 51/333 (15%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           + R  +M+ +   TF MGT++P+   DGE P+R VT+  FYLD+HEVSN +FQ FV+ T 
Sbjct: 68  IPRTNEMLFIERGTFTMGTDEPVFPVDGEGPARKVTVYGFYLDKHEVSNAEFQRFVAETK 127

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           YVTEAE FG++F  E  +SEE    ++Q                              +V
Sbjct: 128 YVTEAESFGNSFCLENYVSEEVLKNVTQ------------------------------SV 157

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           A   W    +P                      W HP G DS+I+  M+HPVVHVSWNDA
Sbjct: 158 AAAPWW---VPING-----------------ADWRHPNGPDSSIKDIMDHPVVHVSWNDA 197

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
           V YC W G RLPTEAE+E  C+G L++RLFPWGN   P+G+H  N+W G+FP+ NTA DG
Sbjct: 198 VKYCEWAGKRLPTEAEFERACKGNLKSRLFPWGNKEEPKGKHWMNIWHGKFPSENTANDG 257

Query: 249 YLSTAPVMSYKE-NKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
           Y +TAPV  + E N FG+ N++GNVWEWT DWW     P P  NPKGP  GTDKVKKGGS
Sbjct: 258 YHTTAPVTEFPEQNIFGVKNIIGNVWEWTQDWWETKFTPTPKKNPKGPRFGTDKVKKGGS 317

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           ++C++ YCYR+RC ARSQNTPDSSA NLGFRCA
Sbjct: 318 FMCHKNYCYRYRCDARSQNTPDSSAVNLGFRCA 350


>gi|443707651|gb|ELU03164.1| hypothetical protein CAPTEDRAFT_155214 [Capitella teleta]
          Length = 384

 Score =  323 bits (828), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 162/337 (48%), Positives = 203/337 (60%), Gaps = 51/337 (15%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           ER   MV LP  +F MGTN PI+  DGE P+R V +  F+ D +E SN +F+ FV++TGY
Sbjct: 75  ERTNQMVYLPTGSFMMGTNNPIIPPDGEGPARRVHITGFWYDVYETSNAEFELFVNSTGY 134

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
            TEAE FGD+FV E  +SE+ +  ISQ                              AVA
Sbjct: 135 KTEAEGFGDSFVIEAFISEKIKEDISQ------------------------------AVA 164

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
              W    LP +                    W HP G D+ ++  M+HPV+HVSWNDAV
Sbjct: 165 SAPWW---LPVKG-----------------ADWRHPFGPDTDLKGIMDHPVLHVSWNDAV 204

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
           AYC W   RLPTEAE+EY CR G ++RLFPWG+N+ P+ +HR N+WQG+FP  NTA DG+
Sbjct: 205 AYCQWAQKRLPTEAEFEYACRDGKDDRLFPWGSNMLPKNQHRMNIWQGDFPNTNTAEDGF 264

Query: 250 LSTAPVMSYK-ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
             T PV  +K + K G  N++GN WEW  DWW   HH   + NP GP  GTDK KKGGSY
Sbjct: 265 ERTCPVTEFKPQTKLGQKNLIGNAWEWVQDWWQTEHHTGTTKNPVGPKQGTDKTKKGGSY 324

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           +C + +CYR+RC ARSQNTPDSSA NLGFRCA++K P
Sbjct: 325 MCIKTHCYRYRCGARSQNTPDSSASNLGFRCASNKLP 361


>gi|37181290|gb|AAQ88459.1| AAPA3037 [Homo sapiens]
          Length = 426

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 164/316 (51%), Positives = 201/316 (63%), Gaps = 54/316 (17%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 77  PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q                        
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                 AVA   W    LP +                   +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
             NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 326

Query: 301 KVKKGGSYLCNEQYCY 316
           +VKKGGSY+C+    Y
Sbjct: 327 RVKKGGSYMCHRSQEY 342


>gi|440905735|gb|ELR56080.1| Sulfatase-modifying factor 1, partial [Bos grunniens mutus]
          Length = 338

 Score =  322 bits (824), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 158/308 (51%), Positives = 200/308 (64%), Gaps = 54/308 (17%)

Query: 5   PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           P+PP +    MV +P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV
Sbjct: 85  PSPPTK----MVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFV 140

Query: 65  SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
           ++TGY+TEAEKFGD+FVFE +LSE+ ++ I Q                            
Sbjct: 141 NSTGYLTEAEKFGDSFVFEGMLSEQVKSDIQQ---------------------------- 172

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
             AVA   W    LP +                   +W HPEG DST+ HR +HPV+HVS
Sbjct: 173 --AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTVLHRPDHPVLHVS 210

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
           WNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP  NT
Sbjct: 211 WNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNT 270

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
             DG+  TAPV ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D+VKK
Sbjct: 271 GEDGFRGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSAEETINPKGPPSGKDRVKK 330

Query: 305 GGSYLCNE 312
           GGSY+C++
Sbjct: 331 GGSYMCHK 338


>gi|189236927|ref|XP_969670.2| PREDICTED: similar to sulfatase-modifying factor 1 [Tribolium
           castaneum]
          Length = 267

 Score =  315 bits (806), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 167/328 (50%), Positives = 199/328 (60%), Gaps = 63/328 (19%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+   TF MGT+KP+   D E P+RNV++D+FYLD+ EVSN +F EFV  TGY TEAE
Sbjct: 1   MVLIEAATFEMGTDKPVFESDHEGPARNVSVDSFYLDKFEVSNGKFSEFVLKTGYKTEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +FGD+F+FE L+ E ER K    R                                    
Sbjct: 61  EFGDSFIFEMLVPESEREKYKDFR------------------------------------ 84

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                 +A W    +G          +W HPEG  STI  R++HPV+HVSW DA  YC +
Sbjct: 85  ----AVQAPWWIKMKGV---------TWKHPEGEGSTINDRLDHPVIHVSWTDAKKYCEF 131

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWE  CR GL  +L+PWGN LTP+GEH AN+WQG+FP  NTA DGYLSTAP
Sbjct: 132 VGKRLPTEAEWEMACRAGLRQKLYPWGNKLTPKGEHWANIWQGDFPLTNTAEDGYLSTAP 191

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V S+  NKFGL++M GNVWEWT D W          +P        KVKKGGSYLC+E Y
Sbjct: 192 VDSFPANKFGLHHMAGNVWEWTQDNW--------LNDPDA------KVKKGGSYLCHESY 237

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           C+R+RCAARS NT DSSAGNLGFRCA D
Sbjct: 238 CWRYRCAARSFNTKDSSAGNLGFRCAGD 265



 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 36/43 (83%), Positives = 39/43 (90%)

Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           KVKKGGSYLC+E YC+R+RCAARS NT DSSAGNLGFRCA DV
Sbjct: 224 KVKKGGSYLCHESYCWRYRCAARSFNTKDSSAGNLGFRCAGDV 266


>gi|312385017|gb|EFR29611.1| hypothetical protein AND_01279 [Anopheles darlingi]
          Length = 350

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 164/335 (48%), Positives = 205/335 (61%), Gaps = 52/335 (15%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           +RY+ M L+P   + +GT++PI   D E P+R VT+ AF+LDQ+EVSN QF+EFV  TGY
Sbjct: 64  KRYEHMSLIPAGEYLIGTDQPIFPADKESPARPVTMAAFFLDQYEVSNKQFKEFVDQTGY 123

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           +TEAEKFGD+FVF+  L  E R +                                    
Sbjct: 124 ITEAEKFGDSFVFQRFLPAEVRKQ------------------------------------ 147

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGID--STIEHRMNHPVVHVSWND 187
           Y  +R A  P    W Y  +G          SW  PE     S  + R++HPVVHVSWND
Sbjct: 148 YEDFRVAAAP----WWYKVQGA---------SWRFPEADKERSVEQDRLDHPVVHVSWND 194

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           AVAYC W+G RLPTEAEWE  CRGG + +LFPWGN L P+ +H+ N+WQG FP ++ A D
Sbjct: 195 AVAYCHWKGKRLPTEAEWEVACRGGRKQKLFPWGNKLLPKDQHQMNIWQGAFPDSDLAED 254

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
           G   T PV  +++N + LYN+VGNVWEWTAD W+      P+   K      ++VKKGGS
Sbjct: 255 GCDGTCPVGKFRQNPYDLYNIVGNVWEWTADLWDAAEG-KPTAERKPGADPPNRVKKGGS 313

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           YLC+E YCYR+RCAARSQNT DSSAGNLGFRCAAD
Sbjct: 314 YLCHESYCYRYRCAARSQNTEDSSAGNLGFRCAAD 348


>gi|321479101|gb|EFX90057.1| hypothetical protein DAPPUDRAFT_94253 [Daphnia pulex]
          Length = 286

 Score =  308 bits (789), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 160/331 (48%), Positives = 209/331 (63%), Gaps = 50/331 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV++ G  + +GT++P+ I DGE P+R+V LD +++D HEVSN  F +FV  + +VTEAE
Sbjct: 1   MVMVRGGIYEIGTDEPVFIADGESPARSVYLDDYFIDTHEVSNQDFADFVEISNFVTEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFG +FVFE +LS+E +++IS+V      +             PVV+ SW          
Sbjct: 61  KFGSSFVFEGILSDETKSEISKVVASAPWW------------MPVVNASWK--------- 99

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                         PEG +S+I  +MNHP +H+SWNDAV++C +
Sbjct: 100 -----------------------------FPEGPNSSISDKMNHPAIHISWNDAVSFCDF 130

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLP+EAEWE  C+GGL+ RL+PWGN L P G+H  N+WQG+FP  NTA DGY+ TAP
Sbjct: 131 YGKRLPSEAEWEVACQGGLKQRLYPWGNKLNPYGKHWTNIWQGKFPGENTAEDGYVGTAP 190

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  NK+GLY MVGNVWEWT DWW V H      NP GP  G DKVKKGGSY+C++  
Sbjct: 191 VDAFPPNKYGLYQMVGNVWEWTMDWWQVDHSSELLRNPMGPKKGKDKVKKGGSYMCHQST 250

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           C+R+RCAARSQNTPDSS+ NLGFRCA    P
Sbjct: 251 CFRYRCAARSQNTPDSSSSNLGFRCAMSSSP 281



 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 26/38 (68%), Positives = 32/38 (84%)

Query: 357 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADVS 394
           SY+C++  C+R+RCAARSQNTPDSS+ NLGFRCA   S
Sbjct: 243 SYMCHQSTCFRYRCAARSQNTPDSSSSNLGFRCAMSSS 280


>gi|332375104|gb|AEE62693.1| unknown [Dendroctonus ponderosae]
          Length = 332

 Score =  304 bits (778), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 162/329 (49%), Positives = 194/329 (58%), Gaps = 63/329 (19%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLD-AFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M L+    F MGTNKP+   D E P+RNVT++ +FYLD +EVSN QF +FV  T Y TEA
Sbjct: 64  MALIGKGIFEMGTNKPVFPSDFEGPARNVTIENSFYLDIYEVSNQQFYDFVRTTNYKTEA 123

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+FGD+FVFE  L E +R +   +R                                   
Sbjct: 124 EQFGDSFVFEMSLPENQRNEHQDIR----------------------------------- 148

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                  +A W             P   W HPEG  STIE RMNHPV HVSWNDAVAYC 
Sbjct: 149 -----AAQAPWWIK---------LPDAYWKHPEGPKSTIEDRMNHPVAHVSWNDAVAYCE 194

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           + G RLPTEAEWE  CRGGL  +++PWGN L P+G+H AN+WQGEFP  NTA DGY+ T 
Sbjct: 195 YVGKRLPTEAEWEMACRGGLRQKMYPWGNKLQPKGQHWANIWQGEFPKENTAEDGYIFTC 254

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           PV  +  N+FGLYNM GNVWEW  D W          +P+       +VKKGGS+LC++ 
Sbjct: 255 PVDKFPPNQFGLYNMAGNVWEWVQDDWQT--------DPQ-----NSRVKKGGSFLCHQS 301

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           YC+R+RCAARS NT DSSA NLGFRCAAD
Sbjct: 302 YCWRYRCAARSFNTKDSSAANLGFRCAAD 330



 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 32/43 (74%), Positives = 38/43 (88%)

Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +VKKGGS+LC++ YC+R+RCAARS NT DSSA NLGFRCAAD 
Sbjct: 289 RVKKGGSFLCHQSYCWRYRCAARSFNTKDSSAANLGFRCAADA 331


>gi|334134870|ref|ZP_08508371.1| hypothetical protein HMPREF9413_2437 [Paenibacillus sp. HGF7]
 gi|333607372|gb|EGL18685.1| hypothetical protein HMPREF9413_2437 [Paenibacillus sp. HGF7]
          Length = 326

 Score =  304 bits (778), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 158/330 (47%), Positives = 199/330 (60%), Gaps = 52/330 (15%)

Query: 15  MVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M+LLPG +F MGT +      DGE P   V L  FY+D H VSN QF+EF   TGYVTEA
Sbjct: 46  MILLPGGSFDMGTEDGEGFPADGEGPVHEVHLKPFYVDPHAVSNAQFREFARQTGYVTEA 105

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+FG +FVF   +S E    ++QV                                    
Sbjct: 106 ERFGWSFVFHSFVSPETARNVTQV------------------------------------ 129

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                P++  W Y  RG          SW HPEG DS +  RM+HPV+HVSWNDA AYC 
Sbjct: 130 -----PSQTPWWYVVRGA---------SWDHPEGPDSDLRDRMDHPVLHVSWNDAAAYCK 175

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RL TEAEWEY  RGGL  + +PWG+ L P G HR N+WQG+FP  +  +DGY  TA
Sbjct: 176 WAGKRLLTEAEWEYAARGGLVRKRYPWGDELKPGGGHRCNIWQGKFPLKDNGSDGYRGTA 235

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
           PV +Y  N +GLYN+ GNVWEW +DW++  ++  +P+ NP+GP +G  +V +GGSYLC++
Sbjct: 236 PVHAYPPNGYGLYNVSGNVWEWCSDWFSPAYYACSPADNPQGPASGDARVMRGGSYLCHK 295

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            YC R+R AARS+NTPDSSAGN+GFRCAAD
Sbjct: 296 SYCNRYRTAARSKNTPDSSAGNIGFRCAAD 325


>gi|195170615|ref|XP_002026107.1| GL16088 [Drosophila persimilis]
 gi|194110987|gb|EDW33030.1| GL16088 [Drosophila persimilis]
          Length = 339

 Score =  303 bits (776), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 157/328 (47%), Positives = 193/328 (58%), Gaps = 59/328 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M L+PG +  +GT+ P    D E P R V L  FYLD+HEVSN +F+EFVSAT Y TEAE
Sbjct: 70  MSLIPGGSVHIGTDIPHFQADRESPERLVKLKDFYLDKHEVSNAKFEEFVSATKYTTEAE 129

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +FGD+F+F+ LLS  E+ ++   R                                    
Sbjct: 130 RFGDSFLFKTLLSAAEQKRLEDYR------------------------------------ 153

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                  A W Y   G           W +P G+DS++     HPVVHVSW DAVAYCTW
Sbjct: 154 ----VASAPWWYKVSGV---------GWRNPNGVDSSLNGLEQHPVVHVSWRDAVAYCTW 200

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLP+EAEWE  CRGG + +LFPWGN L P+ +H  N+WQG+FP  NTA DGY  T P
Sbjct: 201 AGKRLPSEAEWEAACRGGKQRKLFPWGNKLMPKDQHWLNIWQGDFPDGNTADDGYEGTCP 260

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V  Y++N + LYNMVGNVWEWTAD W+              +   ++VKKGGSYLC++ Y
Sbjct: 261 VDRYRQNVYDLYNMVGNVWEWTADLWDT----------SDVSESPNRVKKGGSYLCHKSY 310

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           C+R+RCAARSQNT DSSAGNLGFRCA D
Sbjct: 311 CFRYRCAARSQNTEDSSAGNLGFRCAKD 338



 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 34/44 (77%), Positives = 40/44 (90%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           ++VKKGGSYLC++ YC+R+RCAARSQNT DSSAGNLGFRCA D 
Sbjct: 296 NRVKKGGSYLCHKSYCFRYRCAARSQNTEDSSAGNLGFRCAKDA 339


>gi|68051697|gb|AAY85112.1| GH26754p [Drosophila melanogaster]
          Length = 337

 Score =  303 bits (775), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 161/329 (48%), Positives = 196/329 (59%), Gaps = 59/329 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           DM LLPG T  MGT+KP    D E P R V L+ FY+D++EVSN  F +FV  T Y TEA
Sbjct: 67  DMSLLPGGTVYMGTDKPHFPADREAPERQVKLNDFYIDKYEVSNEAFAKFVLHTNYTTEA 126

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E++GD+F+F+ LLS  E+  +   R                    V    W   VA   W
Sbjct: 127 ERYGDSFLFKSLLSPLEQKNLEDFR--------------------VASAVWWYKVAGVNW 166

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R                             HP G+DS I+H   HPVVHVSW DAV YC 
Sbjct: 167 R-----------------------------HPNGVDSDIDHLGRHPVVHVSWRDAVEYCK 197

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLP+EAEWE  CRGG E +LFPWGN L PR EH  N+WQG+FP  N A DG+  T+
Sbjct: 198 WAGKRLPSEAEWEAACRGGKERKLFPWGNKLMPRNEHWLNIWQGDFPDGNLAEDGFEYTS 257

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           PV ++++N + L+NMVGNVWEWTAD W+V+     S NP       ++VKKGGSYLC++ 
Sbjct: 258 PVDAFRQNIYDLHNMVGNVWEWTADLWDVND---VSDNP-------NRVKKGGSYLCHKS 307

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 308 YCYRYRCAARSQNTEDSSAGNLGFRCAKN 336



 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 34/44 (77%), Positives = 40/44 (90%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           ++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA + 
Sbjct: 294 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 337


>gi|20130397|ref|NP_612003.1| CG7049 [Drosophila melanogaster]
 gi|7291920|gb|AAF47337.1| CG7049 [Drosophila melanogaster]
          Length = 336

 Score =  302 bits (774), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 161/329 (48%), Positives = 196/329 (59%), Gaps = 59/329 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           DM LLPG T  MGT+KP    D E P R V L+ FY+D++EVSN  F +FV  T Y TEA
Sbjct: 66  DMSLLPGGTVYMGTDKPHFPADREAPERQVKLNDFYIDKYEVSNEAFAKFVLHTNYTTEA 125

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E++GD+F+F+ LLS  E+  +   R                    V    W   VA   W
Sbjct: 126 ERYGDSFLFKSLLSPLEQKNLEDFR--------------------VASAVWWYKVAGVNW 165

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R                             HP G+DS I+H   HPVVHVSW DAV YC 
Sbjct: 166 R-----------------------------HPNGVDSDIDHLGRHPVVHVSWRDAVEYCK 196

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLP+EAEWE  CRGG E +LFPWGN L PR EH  N+WQG+FP  N A DG+  T+
Sbjct: 197 WAGKRLPSEAEWEAACRGGKERKLFPWGNKLMPRNEHWLNIWQGDFPDGNLAEDGFEYTS 256

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           PV ++++N + L+NMVGNVWEWTAD W+V+     S NP       ++VKKGGSYLC++ 
Sbjct: 257 PVDAFRQNIYDLHNMVGNVWEWTADLWDVND---VSDNP-------NRVKKGGSYLCHKS 306

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 307 YCYRYRCAARSQNTEDSSAGNLGFRCAKN 335



 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 34/44 (77%), Positives = 40/44 (90%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           ++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA + 
Sbjct: 293 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 336


>gi|17862416|gb|AAL39685.1| LD26422p [Drosophila melanogaster]
          Length = 303

 Score =  302 bits (774), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 161/329 (48%), Positives = 196/329 (59%), Gaps = 59/329 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           DM LLPG T  MGT+KP    D E P R V L+ FY+D++EVSN  F +FV  T Y TEA
Sbjct: 33  DMSLLPGGTVYMGTDKPHFPADREAPERQVKLNDFYIDKYEVSNEAFAKFVLHTNYTTEA 92

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E++GD+F+F+ LLS  E+  +   R                    V    W   VA   W
Sbjct: 93  ERYGDSFLFKSLLSPLEQKNLEDFR--------------------VASAVWWYKVAGVNW 132

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R                             HP G+DS I+H   HPVVHVSW DAV YC 
Sbjct: 133 R-----------------------------HPNGVDSDIDHLGRHPVVHVSWRDAVEYCK 163

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLP+EAEWE  CRGG E +LFPWGN L PR EH  N+WQG+FP  N A DG+  T+
Sbjct: 164 WAGKRLPSEAEWEAACRGGKERKLFPWGNKLMPRNEHWLNIWQGDFPDGNLAEDGFEYTS 223

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           PV ++++N + L+NMVGNVWEWTAD W+V+     S NP       ++VKKGGSYLC++ 
Sbjct: 224 PVDAFRQNIYDLHNMVGNVWEWTADLWDVND---VSDNP-------NRVKKGGSYLCHKS 273

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 274 YCYRYRCAARSQNTEDSSAGNLGFRCAKN 302



 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 34/44 (77%), Positives = 40/44 (90%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           ++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA + 
Sbjct: 260 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 303


>gi|261406171|ref|YP_003242412.1| Non-specific serine/threonine protein kinase [Paenibacillus sp.
           Y412MC10]
 gi|261282634|gb|ACX64605.1| Non-specific serine/threonine protein kinase [Paenibacillus sp.
           Y412MC10]
          Length = 328

 Score =  302 bits (773), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 162/336 (48%), Positives = 199/336 (59%), Gaps = 54/336 (16%)

Query: 12  YKDMVLLPGDTFRMGTNK-PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           + DM+ LPG TF MGTN      +DGE P+R+VT+  F +  H V+N ++Q FV ATGYV
Sbjct: 43  HTDMIKLPGGTFTMGTNSNEGFPRDGEGPARSVTVSGFEISPHAVTNGEYQRFVEATGYV 102

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAE+FG +FVFE L SEE +A ++QV                                 
Sbjct: 103 TEAERFGWSFVFELLASEETKAAVAQV--------------------------------- 129

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
                   P E  W     G           W  PEG DS+IE RM+HPVVH+SWNDA A
Sbjct: 130 --------PQEVPWWLVVEGAY---------WAAPEGADSSIEDRMDHPVVHISWNDAEA 172

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           YC W G RLPTEAEWEY  RGGLE + +PWG+ L   GEH+ N+WQG+FP  N A+DGY+
Sbjct: 173 YCQWAGVRLPTEAEWEYAARGGLEGKTYPWGDLLKQDGEHQCNIWQGKFPVKNNASDGYI 232

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNP-KGPTTGTDKVKKGGSY 308
            TAPV++YK N +GLYNM GNVWEW  DW++  +H    + NP  G  TG  +  +GGSY
Sbjct: 233 GTAPVIAYKPNGYGLYNMSGNVWEWCGDWFSPSYHQQTTAINPFYGEPTGR-RSMRGGSY 291

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           LC+  YC R+R AARS NTPDSS GN GFR   D G
Sbjct: 292 LCHRSYCNRYRVAARSGNTPDSSTGNCGFRVVRDGG 327


>gi|157116189|ref|XP_001652787.1| hypothetical protein AaeL_AAEL007497 [Aedes aegypti]
 gi|108876575|gb|EAT40800.1| AAEL007497-PA [Aedes aegypti]
          Length = 343

 Score =  301 bits (772), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 158/329 (48%), Positives = 193/329 (58%), Gaps = 60/329 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M L+ G  + +GTN+P  + D E P R V L+AFYLD++EVSN +F EF+  T Y+TEAE
Sbjct: 72  MSLIHGGKYLIGTNEPFFVGDHEGPEREVDLEAFYLDRYEVSNREFAEFIEKTAYITEAE 131

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +FGD+FVF+  L  + R K    R                             VA   W 
Sbjct: 132 RFGDSFVFQEFLDPKVREKYKDYR-----------------------------VAAAVW- 161

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDST-IEHRMNHPVVHVSWNDAVAYCT 193
                    W             P  SW +PEG     IE RM+HPVVHVSWNDAVAYC 
Sbjct: 162 ---------W----------FKIPGASWRYPEGDKQRGIESRMDHPVVHVSWNDAVAYCK 202

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W   RLP+EAEWE  CRGG + +LFPWGN + P+ EH  N+WQGEFP  N   DG   T 
Sbjct: 203 WLDKRLPSEAEWEAACRGGRKGKLFPWGNKMMPKDEHYMNIWQGEFPEGNRGEDGCEGTC 262

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           PV  +++NK+ LYNMVGNVWEWTAD W+   +  P           ++VKKGGSYLC++ 
Sbjct: 263 PVDRFRQNKYDLYNMVGNVWEWTADLWDAKENVKPP----------NRVKKGGSYLCHKS 312

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           YCYR+RCAARSQNT DS+AGNLGFRCAAD
Sbjct: 313 YCYRYRCAARSQNTEDSTAGNLGFRCAAD 341



 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 36/45 (80%), Positives = 42/45 (93%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADVS 394
           ++VKKGGSYLC++ YCYR+RCAARSQNT DS+AGNLGFRCAADV 
Sbjct: 299 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSTAGNLGFRCAADVD 343


>gi|170055860|ref|XP_001863770.1| sulfatase-modifying factor 1 [Culex quinquefasciatus]
 gi|167875738|gb|EDS39121.1| sulfatase-modifying factor 1 [Culex quinquefasciatus]
          Length = 348

 Score =  301 bits (772), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 162/334 (48%), Positives = 198/334 (59%), Gaps = 61/334 (18%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           E++  M L+PG  + +GT +PI  +D E   R V ++AFY+D++EVSN  F+EFV AT Y
Sbjct: 73  EKFNAMSLIPGGKYVIGTREPIFPEDREH-ERKVAINAFYMDKYEVSNGDFREFVEATAY 131

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
            +EAE+FGD+FVF   L EE R K     HD +                           
Sbjct: 132 ESEAERFGDSFVFHGFLDEEVRKKF----HDFRV-------------------------- 161

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDS-TIEHRMNHPVVHVSWNDA 188
                       A W Y   G          SW HPEG  S  +E R++HPVVHVSWNDA
Sbjct: 162 ----------AAAPWWYKVTGA---------SWRHPEGDPSRNVEDRLDHPVVHVSWNDA 202

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
           VAYC W G RLPTEAEWE  CRGG + +LFPWGN + P  +H  N+WQGEFP  + A DG
Sbjct: 203 VAYCDWAGKRLPTEAEWEVACRGGRKQKLFPWGNKMMPNDKHYMNIWQGEFPEADLAEDG 262

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
              T PV  + +N FGL++MVGNVWEWTAD W+     +P           ++VKKGGSY
Sbjct: 263 CKGTCPVDRFSQNTFGLHSMVGNVWEWTADRWDEEEKVSPP----------NRVKKGGSY 312

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           LC+E YCYR+RCAARSQNT DSSAGNLGFRCAAD
Sbjct: 313 LCHESYCYRYRCAARSQNTEDSSAGNLGFRCAAD 346



 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/43 (86%), Positives = 41/43 (95%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           ++VKKGGSYLC+E YCYR+RCAARSQNT DSSAGNLGFRCAAD
Sbjct: 304 NRVKKGGSYLCHESYCYRYRCAARSQNTEDSSAGNLGFRCAAD 346


>gi|198462565|ref|XP_002135327.1| GA28486 [Drosophila pseudoobscura pseudoobscura]
 gi|198150880|gb|EDY73954.1| GA28486 [Drosophila pseudoobscura pseudoobscura]
          Length = 339

 Score =  301 bits (771), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 157/328 (47%), Positives = 192/328 (58%), Gaps = 59/328 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M L+PG +  +GT+ P    D E P R V L  FYLD+HEVSN +F EFVSAT Y TEAE
Sbjct: 70  MSLIPGGSVHIGTDIPHFQADRESPERLVKLKDFYLDKHEVSNAKFGEFVSATKYTTEAE 129

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +FGD+F+F+ LLS  E+ ++   R                                    
Sbjct: 130 RFGDSFLFKTLLSAAEQKRLEDYR------------------------------------ 153

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                  A W Y   G           W +P G+DS++     HPVVHVSW DAVAYCTW
Sbjct: 154 ----VASAPWWYKVSGV---------GWRNPNGVDSSLNGLEQHPVVHVSWRDAVAYCTW 200

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLP+EAEWE  CRGG + +LFPWGN L P+ +H  N+WQG+FP  NTA DGY  T P
Sbjct: 201 AGKRLPSEAEWEAACRGGKQRKLFPWGNKLMPKDQHWLNIWQGDFPDGNTADDGYEGTCP 260

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V  Y++N + LYNMVGNVWEWTAD W+              +   ++VKKGGSYLC++ Y
Sbjct: 261 VDRYRQNVYDLYNMVGNVWEWTADLWDT----------SDVSESPNRVKKGGSYLCHKSY 310

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           C+R+RCAARSQNT DSSAGNLGFRCA D
Sbjct: 311 CFRYRCAARSQNTEDSSAGNLGFRCAKD 338



 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 34/44 (77%), Positives = 40/44 (90%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           ++VKKGGSYLC++ YC+R+RCAARSQNT DSSAGNLGFRCA D 
Sbjct: 296 NRVKKGGSYLCHKSYCFRYRCAARSQNTEDSSAGNLGFRCAKDA 339


>gi|195586666|ref|XP_002083094.1| GD13538 [Drosophila simulans]
 gi|194195103|gb|EDX08679.1| GD13538 [Drosophila simulans]
          Length = 343

 Score =  301 bits (770), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 160/332 (48%), Positives = 196/332 (59%), Gaps = 59/332 (17%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           +  DM LLPG T  MGT+KP    D E P R V L  FY+D++EVSN  F +FV  T Y 
Sbjct: 70  KIADMSLLPGGTVYMGTDKPHFPADREGPERQVKLKDFYIDKYEVSNEAFAKFVLHTNYT 129

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAE++GD+F+F+ LLS  E+  +   R                    V    W   VA 
Sbjct: 130 TEAERYGDSFLFKSLLSPSEQKNLEDFR--------------------VASAVWWYKVAG 169

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             WR                              P G+DS I+H   HPVVHVSW DAV 
Sbjct: 170 VNWR-----------------------------RPNGVDSNIDHLGRHPVVHVSWRDAVE 200

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           YC W G RLP+EAEWE  CRGG E++LFPWGN L PR EH  N+WQG+FP  N A DG+ 
Sbjct: 201 YCKWAGKRLPSEAEWEAACRGGKEHKLFPWGNKLMPRNEHWLNIWQGDFPDGNLAEDGFE 260

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
            T+PV ++++N + L+NMVGNVWEWTAD W+V+     S NP       ++VKKGGSYLC
Sbjct: 261 YTSPVDAFRQNMYDLHNMVGNVWEWTADLWDVND---VSDNP-------NRVKKGGSYLC 310

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           ++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 311 HKSYCYRYRCAARSQNTEDSSAGNLGFRCAKN 342



 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 34/44 (77%), Positives = 40/44 (90%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           ++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA + 
Sbjct: 300 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 343


>gi|390452773|ref|ZP_10238301.1| Non-specific serine/threonine protein kinase [Paenibacillus peoriae
           KCTC 3763]
          Length = 335

 Score =  301 bits (770), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 161/338 (47%), Positives = 205/338 (60%), Gaps = 53/338 (15%)

Query: 7   PPVERYKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
           P +ER   M+ L G TFRMGT+ +    +DGE P+R VTL  FY+D++ V+N QF +FV 
Sbjct: 48  PTLER-TGMIRLEGGTFRMGTDSREGFEQDGEGPAREVTLSPFYIDRYAVTNAQFTDFVK 106

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
            T YVTEAE+FG++FVF  LL  E+R    Q R  M                 V+   W 
Sbjct: 107 DTNYVTEAERFGNSFVFHLLLPSEQR----QNRATM-----------------VMQTPWW 145

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
             V    W+                             HPEG  S ++ RM+HPVVHVSW
Sbjct: 146 HVVEGADWK-----------------------------HPEGKSSDVKERMDHPVVHVSW 176

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
           NDA+AYC W G RLPTEAEWEY  RGGL+   +PWG+ LTP G+H  N+WQG+FP +N A
Sbjct: 177 NDALAYCKWTGKRLPTEAEWEYAARGGLDQARYPWGDELTPEGKHMCNIWQGKFPISNDA 236

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-YNPKGPTTGTDKVKK 304
            DGY+ TAPV +Y+ N FGLYN+ GNVWEW AD +N ++H   S  NP G + G  +  +
Sbjct: 237 TDGYVGTAPVDAYEPNGFGLYNVAGNVWEWCADEFNPYYHLQSSMINPLGVSRGNVRSMR 296

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GGSYLC++ YC R+R AAR++NTPDSS+GN GFRC AD
Sbjct: 297 GGSYLCHDSYCNRYRVAARNKNTPDSSSGNTGFRCVAD 334


>gi|313226833|emb|CBY21978.1| unnamed protein product [Oikopleura dioica]
          Length = 779

 Score =  300 bits (769), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 156/336 (46%), Positives = 198/336 (58%), Gaps = 52/336 (15%)

Query: 7   PPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDA-FYLDQHEVSNTQFQEFVS 65
           P  +RY     +P  TF MG N+ ++  DGE P R VT+ + F++D++EV+N++F  FV 
Sbjct: 482 PRHQRYNQEARIPAGTFTMGDNEELMPGDGEAPERRVTISSDFFIDKYEVTNSEFYRFVH 541

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
            TGYVTE+EKFGD+F F   + E+ +                      ++   V    W 
Sbjct: 542 ETGYVTESEKFGDSFCFIKYIPEKTK---------------------EKIELEVKETPW- 579

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
                  W    LP +                   SWLHP G ++ +    +HP VH+SW
Sbjct: 580 -------W----LPVQG-----------------ASWLHPVGPENDLTGIWDHPAVHISW 611

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
           NDA  YC W   RLPTEAEWEY  RGGLE RLF WGN   P+G+H  N+WQG+FP+ NT 
Sbjct: 612 NDATEYCKWADKRLPTEAEWEYAARGGLEGRLFSWGNKDKPKGQHYMNIWQGDFPSVNTE 671

Query: 246 ADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
            DG+ STAPV  Y  +NK+GL+NM+GNVWEWTAD W   H      +PKGP TGTD+VKK
Sbjct: 672 EDGHGSTAPVDYYPSQNKYGLFNMLGNVWEWTADNWGTRHLGDAVTDPKGPPTGTDRVKK 731

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GGS+LC  QYCYR+R AAR  NTPDS+A NLGFRCA
Sbjct: 732 GGSFLCTTQYCYRYRNAARHYNTPDSAAQNLGFRCA 767


>gi|313240480|emb|CBY32814.1| unnamed protein product [Oikopleura dioica]
          Length = 779

 Score =  299 bits (766), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 156/336 (46%), Positives = 197/336 (58%), Gaps = 52/336 (15%)

Query: 7   PPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDA-FYLDQHEVSNTQFQEFVS 65
           P  +RY     +P  TF MG N+ ++  DGE P R VT+ + F++D++EV+N++F  FV 
Sbjct: 482 PRHQRYNQEARIPAGTFTMGDNEELMPGDGEAPERRVTISSDFFIDKYEVTNSEFYRFVH 541

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
            TGYVTE+EKFGD+F F   + E+ +                      ++   V    W 
Sbjct: 542 ETGYVTESEKFGDSFCFIKYIPEKTK---------------------EKIELEVKETPW- 579

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
                  W    LP +                   SWLHP G ++ +    +HP VH+SW
Sbjct: 580 -------W----LPVQG-----------------ASWLHPVGPENDLTGIWDHPAVHISW 611

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
           NDA  YC W   RLPTEAEWEY  RGGLE RLF WGN   P+G+H  N+WQG+FPT NT 
Sbjct: 612 NDATEYCKWADKRLPTEAEWEYAARGGLEGRLFSWGNKDKPKGQHYMNIWQGDFPTVNTE 671

Query: 246 ADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
            DG+ STAPV  Y  +N +GL+NM+GNVWEWTAD W   H      +PKGP TGTD+VKK
Sbjct: 672 EDGHGSTAPVDYYPSQNIYGLFNMLGNVWEWTADNWGTRHSGDAVTDPKGPPTGTDRVKK 731

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GGS+LC  QYCYR+R AAR  NTPDS+A NLGFRCA
Sbjct: 732 GGSFLCTTQYCYRYRNAARHYNTPDSAAQNLGFRCA 767



 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 43/69 (62%), Positives = 49/69 (71%), Gaps = 5/69 (7%)

Query: 331 SAGNLGFRCAAD-----KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNL 385
           +A N G R + D     KGP TGTD+VKKGGS+LC  QYCYR+R AAR  NTPDS+A NL
Sbjct: 703 TADNWGTRHSGDAVTDPKGPPTGTDRVKKGGSFLCTTQYCYRYRNAARHYNTPDSAAQNL 762

Query: 386 GFRCAADVS 394
           GFRCA   S
Sbjct: 763 GFRCARTAS 771


>gi|297670803|ref|XP_002813544.1| PREDICTED: sulfatase-modifying factor 1 isoform 3 [Pongo abelii]
          Length = 349

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 133/186 (71%), Positives = 153/186 (82%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W HPEG DSTI HR++HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct: 162 NWRHPEGPDSTIRHRLDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPW 221

Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
           GN L P+G+H AN+WQGEFP  NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW
Sbjct: 222 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 281

Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            VHH    + NPKGP +G D+VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 282 TVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 341

Query: 341 ADKGPT 346
           AD+ PT
Sbjct: 342 ADRLPT 347


>gi|195336229|ref|XP_002034744.1| GM14301 [Drosophila sechellia]
 gi|194127837|gb|EDW49880.1| GM14301 [Drosophila sechellia]
          Length = 344

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 159/332 (47%), Positives = 195/332 (58%), Gaps = 59/332 (17%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           +  DM LLPG    MGT+KP    D E P R V L  FY+D++EVSN  F +FV  T Y 
Sbjct: 71  KIADMSLLPGGRVYMGTDKPHFPADREGPERQVKLKDFYIDKYEVSNEAFAKFVLHTNYT 130

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAE++GD+F+F+ LLS  E+  +   R                    V    W   VA 
Sbjct: 131 TEAERYGDSFLFKSLLSPSEQKNLEDFR--------------------VASAVWWYKVAG 170

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             WR                              P G+DS I+H   HPVVHVSW DAV 
Sbjct: 171 VNWR-----------------------------RPNGVDSNIDHLGRHPVVHVSWRDAVE 201

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           YC W G RLP+EAEWE  CRGG E++LFPWGN L PR EH  N+WQG+FP  N A DG+ 
Sbjct: 202 YCKWAGKRLPSEAEWEAACRGGKEHKLFPWGNKLMPRNEHWLNIWQGDFPDGNLAEDGFE 261

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
            T+PV ++++N + L+NMVGNVWEWTAD W+V+     S NP       ++VKKGGSYLC
Sbjct: 262 YTSPVDAFRQNMYDLHNMVGNVWEWTADLWDVND---VSDNP-------NRVKKGGSYLC 311

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           ++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 312 HKSYCYRYRCAARSQNTEDSSAGNLGFRCAKN 343



 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 34/43 (79%), Positives = 40/43 (93%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           ++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 301 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKN 343


>gi|194746826|ref|XP_001955855.1| GF24897 [Drosophila ananassae]
 gi|190623137|gb|EDV38661.1| GF24897 [Drosophila ananassae]
          Length = 342

 Score =  298 bits (764), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 157/328 (47%), Positives = 189/328 (57%), Gaps = 59/328 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M L+PG  F +GTNKP    D E P R   L+ FYLD++EVSN  F  FV AT Y TEAE
Sbjct: 73  MSLIPGGIFTVGTNKPHFHADREGPERQEKLNDFYLDKYEVSNDAFSIFVEATNYTTEAE 132

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FGD+F+F+ LL  +E+  +  +R                                    
Sbjct: 133 HFGDSFMFKTLLGADEQKAMEHLR------------------------------------ 156

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                  A W Y   G          SW HP G D  +E    HPVVHV+W DAVAYCTW
Sbjct: 157 ----VASAPWWYKVSGV---------SWRHPNGKDKGLEGLGQHPVVHVTWRDAVAYCTW 203

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLP+EAEWE  CRGG E +LFPWGN L PR +H  N+WQG+FP  N A DG+  T P
Sbjct: 204 IGKRLPSEAEWEVACRGGKERKLFPWGNKLMPREQHWLNIWQGDFPEGNLAEDGFEFTCP 263

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++++N F LYNMVGNVWEWTAD W+ +            +   ++VKKGGSYLC++ Y
Sbjct: 264 VDAFRQNAFDLYNMVGNVWEWTADLWDTN----------DVSESPNRVKKGGSYLCHKSY 313

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           CYR+RCAARSQNT DSSAGNLGFRCA D
Sbjct: 314 CYRYRCAARSQNTEDSSAGNLGFRCAKD 341



 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 35/44 (79%), Positives = 40/44 (90%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           ++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA D 
Sbjct: 299 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKDA 342


>gi|269128996|ref|YP_003302366.1| Non-specific serine/threonine protein kinase [Thermomonospora
           curvata DSM 43183]
 gi|268313954|gb|ACZ00329.1| Non-specific serine/threonine protein kinase [Thermomonospora
           curvata DSM 43183]
          Length = 303

 Score =  298 bits (764), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 166/340 (48%), Positives = 197/340 (57%), Gaps = 56/340 (16%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKP-ILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
           P E  K MV +PG TFRMG   P    +DGE P R V L  F +D++ VSN QF  FV A
Sbjct: 13  PQEIAKGMVAIPGGTFRMGGEDPDAFPEDGEGPVRTVRLSPFLIDRYAVSNRQFAAFVKA 72

Query: 67  TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
           TGYVT+AE++G +FVF   ++                              PV+     D
Sbjct: 73  TGYVTDAERYGWSFVFHAHVAPGT---------------------------PVM-----D 100

Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
           AV            EA W     G           W  PEG  S+I  R NHPVVHVSWN
Sbjct: 101 AVV----------PEAPWWVAVPGAY---------WKAPEGPGSSITDRPNHPVVHVSWN 141

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
           DAVAY TW G RLPTEAEWE   RGGL+   +PWGN LTPRG HR N+WQG FP ++T  
Sbjct: 142 DAVAYATWAGKRLPTEAEWEMAARGGLDQARYPWGNELTPRGRHRCNIWQGTFPVHDTGE 201

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPTTGTDKV 302
           DGY  TAPV ++  N +GLYN+ GNVWEW ADWW+   H    PA   +P+GP TGT +V
Sbjct: 202 DGYTGTAPVNAFAPNGYGLYNVAGNVWEWCADWWSADWHATESPATRIDPRGPETGTARV 261

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            KGGS+LC+E YC R+R AAR+ NTPDSSA + GFRCAAD
Sbjct: 262 TKGGSFLCHESYCNRYRVAARTCNTPDSSAAHTGFRCAAD 301


>gi|329924928|ref|ZP_08279875.1| hypothetical protein HMPREF9412_1633 [Paenibacillus sp. HGF5]
 gi|328940312|gb|EGG36641.1| hypothetical protein HMPREF9412_1633 [Paenibacillus sp. HGF5]
          Length = 300

 Score =  298 bits (763), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 160/336 (47%), Positives = 197/336 (58%), Gaps = 54/336 (16%)

Query: 12  YKDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           + DM++LPG TF MGT +     +DGE P+R+VT+  F +  H V+N ++Q FV AT YV
Sbjct: 15  HTDMIMLPGGTFTMGTYSNEGFPRDGEGPARSVTVSGFEISPHAVTNGEYQRFVEATDYV 74

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAE+FG +FVFE L SE+ +A ++QV                                 
Sbjct: 75  TEAERFGWSFVFELLASEKTKAAVAQV--------------------------------- 101

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
                   P E  W     G           W  PEG DS+IE RM+HPVVH+SWNDA A
Sbjct: 102 --------PQEVPWWLVVEGAY---------WAAPEGADSSIEGRMDHPVVHISWNDAAA 144

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           YC W G RLPTEAEWEY  RGGLE R +PWG+ L   GEH+ N+WQG+FP  N A+DGY+
Sbjct: 145 YCQWAGVRLPTEAEWEYAARGGLEGRTYPWGDLLKQDGEHQCNIWQGKFPVKNNASDGYI 204

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--GPTTGTDKVKKGGSY 308
            TAPV +YK N +GLYNM GNVWEW  DW++  +H   S N    G  TG  +  +GGSY
Sbjct: 205 GTAPVDAYKPNGYGLYNMSGNVWEWCGDWFSPSYHQQTSANNPFYGEPTGR-RSMRGGSY 263

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           LC+  YC R+R AARS NTPDSS GN GFR   D G
Sbjct: 264 LCHRSYCNRYRVAARSGNTPDSSTGNCGFRVVRDDG 299


>gi|444707484|gb|ELW48757.1| Sulfatase-modifying factor 1 [Tupaia chinensis]
          Length = 415

 Score =  298 bits (762), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 165/368 (44%), Positives = 215/368 (58%), Gaps = 55/368 (14%)

Query: 4   LPAP-PVERYKD---MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQ 59
           +P P P ER  +   MV +P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +
Sbjct: 76  VPGPVPGERQPEATKMVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAE 135

Query: 60  FQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPV 119
           F++FV++TGY+TEAEKFGD+FVFE +LSE+ +  I Q   D + F G+ S      + P 
Sbjct: 136 FEKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTGIQQAMED-EDFSGV-SAARTESNVPP 193

Query: 120 VHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHP 179
            +IS         W                  L ++L P G  L  + +   I+     P
Sbjct: 194 ANISL-------FWVA----------------LRSQLLPGGYLL--KALTGDIQK---GP 225

Query: 180 VVHVSWNDAVAYCTWRGA-RLPTEAEWEYGCRGGLENR--------------------LF 218
           ++       + + T  G  RLPT    E GC   L                       LF
Sbjct: 226 ILLSYTGQIIQFSTSPGTMRLPTARGRESGCPQKLSGNTAVEEAYKIVQNDADVPSILLF 285

Query: 219 PWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD 278
           PWGN L PRG+H AN+WQGEFP  NTA DG+  TAPV ++  N +GLYN+VGN WEWT+D
Sbjct: 286 PWGNKLQPRGQHYANIWQGEFPVTNTAEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSD 345

Query: 279 WWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           WW V+H    + NPKGP +G D+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFR
Sbjct: 346 WWTVYHSVEETLNPKGPPSGKDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFR 405

Query: 339 CAADKGPT 346
           CAAD+ PT
Sbjct: 406 CAADRLPT 413


>gi|195489995|ref|XP_002092959.1| GE21047 [Drosophila yakuba]
 gi|194179060|gb|EDW92671.1| GE21047 [Drosophila yakuba]
          Length = 336

 Score =  298 bits (762), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 157/332 (47%), Positives = 197/332 (59%), Gaps = 59/332 (17%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           +  DM L+PG T  +GT++P    D E P R V L+ FY+D++EVSN  F +FV  T Y 
Sbjct: 63  KIADMSLVPGGTVYVGTDQPHFPADREAPERQVKLNDFYIDKYEVSNDAFAKFVLQTNYT 122

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAE+FGD+F+F+ LLS  E+  +   R                    V    W   VA 
Sbjct: 123 TEAERFGDSFLFKSLLSPSEQKDLEDFR--------------------VASAVWWYKVAG 162

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             WR                             HP G+DS +++   HPVVHVSW DAV 
Sbjct: 163 VNWR-----------------------------HPNGVDSNLDNLGRHPVVHVSWRDAVE 193

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           YC W G RLP+EAEWE  CRGG E +LFPWGN L PR EH  N+WQG+FP  N A DG+ 
Sbjct: 194 YCEWAGKRLPSEAEWEAACRGGKERKLFPWGNKLMPRNEHWLNIWQGDFPDGNLAEDGFE 253

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
            T+PV ++++N + L+NMVGNVWEWTAD W+V+     S NP       ++VKKGGSYLC
Sbjct: 254 YTSPVDAFRQNLYDLHNMVGNVWEWTADLWDVND---VSDNP-------NRVKKGGSYLC 303

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           ++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 304 HKSYCYRYRCAARSQNTEDSSAGNLGFRCAKN 335



 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 34/44 (77%), Positives = 40/44 (90%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           ++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA + 
Sbjct: 293 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 336


>gi|251794995|ref|YP_003009726.1| non-specific serine/threonine protein kinase [Paenibacillus sp.
           JDR-2]
 gi|247542621|gb|ACS99639.1| Non-specific serine/threonine protein kinase [Paenibacillus sp.
           JDR-2]
          Length = 332

 Score =  296 bits (759), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 154/326 (47%), Positives = 195/326 (59%), Gaps = 52/326 (15%)

Query: 15  MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M+ +PG TF MGT+ K     DGE P  +VT+++FY+D+  V+N QF  F++ATGYVTEA
Sbjct: 50  MLFIPGGTFLMGTDDKEGFAADGEGPVHSVTVNSFYMDETAVTNEQFGRFIAATGYVTEA 109

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           ++FG ++VF   +SE E AK+  V                                    
Sbjct: 110 QRFGWSYVFHLFVSEAEAAKVKTV------------------------------------ 133

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                P  A W Y   G           W HPEG DS IE R +HPV+HVSWNDA+AYC 
Sbjct: 134 -----PQTAPWWYPVEGA---------DWNHPEGPDSHIEGREDHPVIHVSWNDAMAYCE 179

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGGL  + +PWG+ L   GEH  N+WQG+FP  N A+DGYL TA
Sbjct: 180 WAGRRLPTEAEWEYAGRGGLAQKRYPWGDELKQGGEHNCNIWQGKFPVKNHASDGYLGTA 239

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
           P  S++ N +GLYNM GNVWEW +DW++  ++    + NP GP TG  +V +GGSYLC+ 
Sbjct: 240 PAKSFRPNGYGLYNMAGNVWEWCSDWFSPSYYQSGENDNPLGPRTGEGRVMRGGSYLCHS 299

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFR 338
            YC R+R AARS+NTPDSS GN GFR
Sbjct: 300 SYCNRYRVAARSKNTPDSSTGNCGFR 325


>gi|94536908|ref|NP_001035430.1| sulfatase-modifying factor 1 precursor [Danio rerio]
 gi|92096356|gb|AAI15143.1| Zgc:136465 [Danio rerio]
 gi|182890128|gb|AAI64368.1| Zgc:136465 protein [Danio rerio]
          Length = 335

 Score =  296 bits (759), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 157/328 (47%), Positives = 186/328 (56%), Gaps = 85/328 (25%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           +VLL G  F MGT+ P + +DGE P R V LDAFY+++HEV+N QFQ F + TGY+TEAE
Sbjct: 90  LVLLQGGWFLMGTDDPGIPQDGEGPQRKVKLDAFYIEEHEVTNQQFQHFTNQTGYITEAE 149

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +FGD+FVFE LLSEE ++ +S                     H V    W   V    WR
Sbjct: 150 RFGDSFVFEGLLSEEVKSTLS---------------------HAVAAAPWWSPVKGADWR 188

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                        HPEG DSTI +RMNHP +HVSW+DA AYC W
Sbjct: 189 -----------------------------HPEGPDSTIHNRMNHPALHVSWDDARAYCQW 219

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
              RLPTEAEWE  CRGGL++R                                      
Sbjct: 220 AKRRLPTEAEWELACRGGLQDRCIS----------------------------------- 244

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           VMS+  N FGLY+MVGN WEWTADWW VHH      NPKGP +GTD+VKKGGSY+C++ Y
Sbjct: 245 VMSFPANGFGLYDMVGNAWEWTADWWTVHHSAEDKLNPKGPESGTDRVKKGGSYMCHKSY 304

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           CYR+RCAARSQNTPDSSA NLGFRCA+D
Sbjct: 305 CYRYRCAARSQNTPDSSASNLGFRCASD 332


>gi|194378504|dbj|BAG63417.1| unnamed protein product [Homo sapiens]
          Length = 349

 Score =  296 bits (759), Expect = 9e-78,   Method: Compositional matrix adjust.
 Identities = 133/186 (71%), Positives = 152/186 (81%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W HPEG DSTI HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct: 162 NWRHPEGPDSTILHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPW 221

Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
           GN L P+G+H AN+WQGEFP  NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW
Sbjct: 222 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 281

Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            VHH    + NPKGP +G D+VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 282 TVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 341

Query: 341 ADKGPT 346
           AD+ PT
Sbjct: 342 ADRLPT 347


>gi|257470975|ref|NP_001158146.1| sulfatase-modifying factor 1 isoform 2 precursor [Homo sapiens]
 gi|119584310|gb|EAW63906.1| sulfatase modifying factor 1, isoform CRA_a [Homo sapiens]
          Length = 349

 Score =  296 bits (759), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 133/186 (71%), Positives = 152/186 (81%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W HPEG DSTI HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct: 162 NWRHPEGPDSTILHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPW 221

Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
           GN L P+G+H AN+WQGEFP  NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW
Sbjct: 222 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 281

Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            VHH    + NPKGP +G D+VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 282 TVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 341

Query: 341 ADKGPT 346
           AD+ PT
Sbjct: 342 ADRLPT 347


>gi|449685727|ref|XP_002155473.2| PREDICTED: sulfatase-modifying factor 1-like [Hydra magnipapillata]
          Length = 401

 Score =  296 bits (758), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 150/326 (46%), Positives = 193/326 (59%), Gaps = 50/326 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           +V   G  F +GT+KP+   DGE P+R V++  F +D +EVSN +F+EFV  TGY TEAE
Sbjct: 117 LVEFQGGDFIIGTDKPVFPDDGESPAREVSIQPFAIDMYEVSNGEFREFVKETGYKTEAE 176

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            +G++FV E  LSEE    I++                              AV    W 
Sbjct: 177 TYGNSFVMEYFLSEEVNKGITE------------------------------AVQGAPWW 206

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W +PEG  S + +R++HPVVHVSWNDA+AYC W
Sbjct: 207 ---LPVKG-----------------ANWKNPEGPGSNLINRLSHPVVHVSWNDALAYCRW 246

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G RLPTEAEWE+ C  G ++ LFPWGN   P G  +AN+W G+FP  NTA DGY  TAP
Sbjct: 247 KGKRLPTEAEWEFACSSGEDDMLFPWGNEFRPHGTFKANIWTGKFPNENTAEDGYNGTAP 306

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V  + +N   L NMVGNVWEW ADWW+  H      +P GP +G  KVKKGGS++C++ Y
Sbjct: 307 VNMFVQNNKKLCNMVGNVWEWVADWWHTDHSKEHQKDPYGPPSGDKKVKKGGSFMCSKNY 366

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCA 340
           CYR+RCA+RS NTPDS + NLGFRCA
Sbjct: 367 CYRYRCASRSYNTPDSGSSNLGFRCA 392


>gi|402859539|ref|XP_003894212.1| PREDICTED: sulfatase-modifying factor 1 isoform 3 [Papio anubis]
          Length = 349

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 133/186 (71%), Positives = 152/186 (81%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W HPEG DSTI HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct: 162 NWRHPEGPDSTIWHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPW 221

Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
           GN L P+G+H AN+WQGEFP  NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW
Sbjct: 222 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 281

Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            VHH    + NPKGP +G D+VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 282 TVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 341

Query: 341 ADKGPT 346
           AD+ PT
Sbjct: 342 ADRLPT 347


>gi|271964559|ref|YP_003338755.1| non-specific serine/threonine protein kinase [Streptosporangium
           roseum DSM 43021]
 gi|270507734|gb|ACZ86012.1| Non-specific serine/threonine protein kinase [Streptosporangium
           roseum DSM 43021]
          Length = 308

 Score =  295 bits (756), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 159/337 (47%), Positives = 192/337 (56%), Gaps = 52/337 (15%)

Query: 7   PPVERYKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
           P   R   +V L G TFRMGT+ +     DGE P R VTL  F L    V+N +F  FV 
Sbjct: 22  PARRRASGLVRLEGGTFRMGTDDRDGFPSDGEGPVRAVTLRPFRLASTAVTNARFAAFVK 81

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
           ATGY T+AE+FG ++VF  L+ E  RA                                 
Sbjct: 82  ATGYETDAERFGWSYVFHLLVPEAMRAS-------------------------------- 109

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
                     A  P    W  G  G          +W  PEG  ST+  R +HPVVHVSW
Sbjct: 110 ----------APSPAATPWWLGIEGA---------AWHAPEGPGSTVADRQDHPVVHVSW 150

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
            DA AYC W G RLPTEAEWEY  RGGL+   +PWG+ LTPRG HR N+WQG FP +NT 
Sbjct: 151 QDAQAYCAWSGGRLPTEAEWEYAARGGLDQARYPWGDELTPRGGHRCNIWQGRFPVHNTE 210

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKG 305
            DG+L+TAPV +Y+ N +GL+NMVGNVWEW ADW+ V H   P  +P+GP  G  +V +G
Sbjct: 211 EDGHLATAPVKAYRPNGYGLHNMVGNVWEWCADWFAVDHLARPLEDPRGPDDGESRVLRG 270

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GSYLC++ YC R+R AARS NTPDSS+GN GFR AAD
Sbjct: 271 GSYLCHDSYCNRYRVAARSSNTPDSSSGNTGFRLAAD 307


>gi|379722178|ref|YP_005314309.1| Non-specific serine/threonine protein kinase [Paenibacillus
           mucilaginosus 3016]
 gi|378570850|gb|AFC31160.1| Non-specific serine/threonine protein kinase [Paenibacillus
           mucilaginosus 3016]
          Length = 404

 Score =  295 bits (755), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 161/333 (48%), Positives = 190/333 (57%), Gaps = 52/333 (15%)

Query: 12  YKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           +  M+ LPG TFRMGT+ +    +DGE PSR VTLD+FYLD   V+N QF  FV  TGY 
Sbjct: 121 HAGMIRLPGGTFRMGTDDREGFPQDGEGPSRLVTLDSFYLDACAVTNEQFLAFVRDTGYK 180

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAE +G +FVF   + E  R+ +                               DAV  
Sbjct: 181 TEAELYGWSFVFHLFVPERLRSSV-------------------------------DAVV- 208

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
                      A W +   G          SW  PEG  + I  RM+HPVVHVSWNDA A
Sbjct: 209 ---------QSAPWWWKVDGA---------SWHAPEGPGTGIGERMDHPVVHVSWNDAQA 250

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           Y  W G RLPTEAEWEY  RGGLE R + WG+ L P G+HR N+WQG FP  N A DGY 
Sbjct: 251 YARWAGKRLPTEAEWEYAARGGLEGRRYAWGDTLLPDGQHRCNIWQGTFPQVNEALDGYA 310

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
            TAP  S+  N FG Y+M GNVWEW ADW++  +H   PS NP GP  G  K  KGGSYL
Sbjct: 311 GTAPARSFPPNGFGFYHMAGNVWEWCADWFSPAYHRTTPSLNPAGPPQGQTKTLKGGSYL 370

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           C+  YC R+R AAR+ NTPDSS G++GFRCAAD
Sbjct: 371 CHSSYCNRYRVAARTSNTPDSSTGHMGFRCAAD 403


>gi|386724920|ref|YP_006191246.1| Non-specific serine/threonine protein kinase [Paenibacillus
           mucilaginosus K02]
 gi|384092045|gb|AFH63481.1| Non-specific serine/threonine protein kinase [Paenibacillus
           mucilaginosus K02]
          Length = 397

 Score =  295 bits (755), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 161/333 (48%), Positives = 190/333 (57%), Gaps = 52/333 (15%)

Query: 12  YKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           +  M+ LPG TFRMGT+ +    +DGE PSR VTLD+FYLD   V+N QF  FV  TGY 
Sbjct: 114 HAGMIRLPGGTFRMGTDDREGFPQDGEGPSRLVTLDSFYLDACAVTNEQFLAFVRDTGYK 173

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAE +G +FVF   + E  R+ +                               DAV  
Sbjct: 174 TEAELYGWSFVFHLFVPERLRSSV-------------------------------DAVV- 201

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
                      A W +   G          SW  PEG  + I  RM+HPVVHVSWNDA A
Sbjct: 202 ---------QSAPWWWKVDGA---------SWHAPEGPGTGIGERMDHPVVHVSWNDAQA 243

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           Y  W G RLPTEAEWEY  RGGLE R + WG+ L P G+HR N+WQG FP  N A DGY 
Sbjct: 244 YARWAGKRLPTEAEWEYAARGGLEGRRYAWGDTLLPDGQHRCNIWQGTFPQVNEALDGYA 303

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
            TAP  S+  N FG Y+M GNVWEW ADW++  +H   PS NP GP  G  K  KGGSYL
Sbjct: 304 GTAPARSFPPNGFGFYHMAGNVWEWCADWFSPAYHRTTPSLNPAGPPQGQTKTLKGGSYL 363

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           C+  YC R+R AAR+ NTPDSS G++GFRCAAD
Sbjct: 364 CHSSYCNRYRVAARTSNTPDSSTGHMGFRCAAD 396


>gi|337749226|ref|YP_004643388.1| Non-specific serine/threonine protein kinase [Paenibacillus
           mucilaginosus KNP414]
 gi|336300415|gb|AEI43518.1| Non-specific serine/threonine protein kinase [Paenibacillus
           mucilaginosus KNP414]
          Length = 404

 Score =  295 bits (755), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 161/333 (48%), Positives = 190/333 (57%), Gaps = 52/333 (15%)

Query: 12  YKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           +  M+ LPG TFRMGT+ +    +DGE PSR VTLD+FYLD   V+N QF  FV  TGY 
Sbjct: 121 HAGMIRLPGGTFRMGTDDREGFPQDGEGPSRLVTLDSFYLDACAVTNEQFLAFVRDTGYK 180

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAE +G +FVF   + E  R+ +                               DAV  
Sbjct: 181 TEAELYGWSFVFHLFVPERLRSSV-------------------------------DAVV- 208

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
                      A W +   G          SW  PEG  + I  RM+HPVVHVSWNDA A
Sbjct: 209 ---------QSAPWWWKVDGA---------SWHAPEGPGTGIGERMDHPVVHVSWNDAQA 250

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           Y  W G RLPTEAEWEY  RGGLE R + WG+ L P G+HR N+WQG FP  N A DGY 
Sbjct: 251 YARWAGKRLPTEAEWEYAARGGLEGRRYAWGDTLLPDGQHRCNIWQGTFPQVNEALDGYA 310

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
            TAP  S+  N FG Y+M GNVWEW ADW++  +H   PS NP GP  G  K  KGGSYL
Sbjct: 311 GTAPARSFPPNGFGFYHMAGNVWEWCADWFSPAYHRTTPSLNPAGPPQGQTKTLKGGSYL 370

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           C+  YC R+R AAR+ NTPDSS G++GFRCAAD
Sbjct: 371 CHSSYCNRYRVAARTSNTPDSSTGHMGFRCAAD 403


>gi|403270416|ref|XP_003927178.1| PREDICTED: sulfatase-modifying factor 1 isoform 3 [Saimiri
           boliviensis boliviensis]
          Length = 351

 Score =  295 bits (754), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 131/186 (70%), Positives = 152/186 (81%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W HPEG DST+ HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct: 164 NWRHPEGPDSTVLHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLNNRLFPW 223

Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
           GN L P+G+H AN+WQGEFP  NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW
Sbjct: 224 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 283

Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            V+H    + NPKGP +G D+VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 284 TVYHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 343

Query: 341 ADKGPT 346
           AD+ PT
Sbjct: 344 ADRPPT 349


>gi|397522436|ref|XP_003831273.1| PREDICTED: sulfatase-modifying factor 1 isoform 3 [Pan paniscus]
          Length = 349

 Score =  295 bits (754), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 132/186 (70%), Positives = 152/186 (81%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W HPEG DSTI HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct: 162 NWRHPEGPDSTILHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPW 221

Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
           GN L P+G+H AN+WQGEFP  NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW
Sbjct: 222 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 281

Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            VHH    + NPKGP +G D+VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 282 TVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 341

Query: 341 ADKGPT 346
           A++ PT
Sbjct: 342 ANRLPT 347


>gi|354581053|ref|ZP_08999957.1| protein of unknown function DUF323 [Paenibacillus lactis 154]
 gi|353201381|gb|EHB66834.1| protein of unknown function DUF323 [Paenibacillus lactis 154]
          Length = 329

 Score =  295 bits (754), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 158/336 (47%), Positives = 195/336 (58%), Gaps = 52/336 (15%)

Query: 12  YKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           + DM+ +P  TF MGTN K     DGE P+R+VT+  F +  + V+N ++Q FV  TGYV
Sbjct: 43  HTDMIAIPEGTFMMGTNSKEGFPSDGEGPARSVTVSGFEISPYAVTNREYQRFVEDTGYV 102

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAEKFG +FVFE L S+E +AK++QV                                 
Sbjct: 103 TEAEKFGWSFVFELLASDETKAKVTQV--------------------------------- 129

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
                   P EA W     G           W  PEG  ST+E RM+HPVVHVSWNDA+A
Sbjct: 130 --------PQEAPWWLVVEGAY---------WAAPEGAASTLEGRMDHPVVHVSWNDAMA 172

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           YC W G RLPTEAEWEY  RGGLE+R +PWG+ L   GEH  N+WQG+FP  N A+DGY+
Sbjct: 173 YCEWAGVRLPTEAEWEYAARGGLEDRTYPWGDLLKQDGEHECNIWQGKFPIKNNASDGYI 232

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
            TAPV +YK N +GLYN+ GNVWEW  DW++  +H    + NP        +  +GGSYL
Sbjct: 233 GTAPVDAYKPNGYGLYNVSGNVWEWCGDWFSPSYHQQTSASNPFYAEPTGRRSMRGGSYL 292

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           C+  YC R+R AARS NTPDSS GN GFR   D  P
Sbjct: 293 CHRSYCNRYRVAARSGNTPDSSTGNCGFRVVRDSLP 328


>gi|332815966|ref|XP_003309636.1| PREDICTED: sulfatase-modifying factor 1 isoform 2 [Pan troglodytes]
 gi|410218332|gb|JAA06385.1| sulfatase modifying factor 1 [Pan troglodytes]
 gi|410259884|gb|JAA17908.1| sulfatase modifying factor 1 [Pan troglodytes]
 gi|410308066|gb|JAA32633.1| sulfatase modifying factor 1 [Pan troglodytes]
 gi|410336753|gb|JAA37323.1| sulfatase modifying factor 1 [Pan troglodytes]
          Length = 349

 Score =  295 bits (754), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 132/186 (70%), Positives = 152/186 (81%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W HPEG DSTI HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct: 162 NWRHPEGPDSTILHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPW 221

Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
           GN L P+G+H AN+WQGEFP  NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW
Sbjct: 222 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 281

Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            VHH    + NPKGP +G D+VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 282 TVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 341

Query: 341 ADKGPT 346
           A++ PT
Sbjct: 342 ANRLPT 347


>gi|395824559|ref|XP_003785530.1| PREDICTED: sulfatase-modifying factor 1 isoform 2 [Otolemur
           garnettii]
          Length = 349

 Score =  295 bits (754), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 130/186 (69%), Positives = 151/186 (81%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W HPEG DS + HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPW
Sbjct: 162 NWRHPEGPDSNVLHRQDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPW 221

Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
           GN L P+G+H AN+WQGEFP  NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW
Sbjct: 222 GNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 281

Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            VHH    + NPKGP +G D+VKKGGSY+C+  YCYR+RCAARSQNTPDSS+ NLGFRCA
Sbjct: 282 TVHHSIEETVNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSSSNLGFRCA 341

Query: 341 ADKGPT 346
           AD  PT
Sbjct: 342 ADHLPT 347


>gi|403722089|ref|ZP_10944830.1| hypothetical protein GORHZ_053_00140 [Gordonia rhizosphera NBRC
           16068]
 gi|403206805|dbj|GAB89161.1| hypothetical protein GORHZ_053_00140 [Gordonia rhizosphera NBRC
           16068]
          Length = 318

 Score =  294 bits (753), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 156/328 (47%), Positives = 186/328 (56%), Gaps = 52/328 (15%)

Query: 16  VLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           V LPG  F MGT+ P+    DGE P R V +DAF + +  V+N +F  F+ ATGYVT+AE
Sbjct: 40  VDLPGGEFLMGTDDPVGFPDDGEGPVRPVFVDAFRMAECPVTNDEFAAFIEATGYVTDAE 99

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           ++G +FVF   L  + R                                          R
Sbjct: 100 RYGWSFVFASFLPAQLR------------------------------------------R 117

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           GA  P +A W  G  G +         W  PEG  ST+E R  HPVVHVSWNDA AYC W
Sbjct: 118 GAPRPEQAPWWCGVSGAM---------WSAPEGPGSTVEDRGRHPVVHVSWNDAQAYCAW 168

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            GARLPTEAEWEY  RGGL  + + WG+ L P G H  N+WQG FPT NTA DGY  T P
Sbjct: 169 AGARLPTEAEWEYAARGGLIQKRYVWGDELVPDGVHMCNIWQGSFPTKNTAEDGYRGTCP 228

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N FGLYNM GNVW+W ADWW V H P    NP GP  G  +V +GGSYLC++ Y
Sbjct: 229 VDAFPPNGFGLYNMAGNVWQWCADWWGVDHSPGRCVNPHGPKGGPARVIRGGSYLCHDSY 288

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           C R+R AAR+ N PDS+ GN GFRCAAD
Sbjct: 289 CNRYRVAARTANEPDSTTGNTGFRCAAD 316


>gi|344998181|ref|YP_004801035.1| hypothetical protein SACTE_0555 [Streptomyces sp. SirexAA-E]
 gi|344313807|gb|AEN08495.1| protein of unknown function DUF323 [Streptomyces sp. SirexAA-E]
          Length = 321

 Score =  294 bits (752), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 162/340 (47%), Positives = 190/340 (55%), Gaps = 56/340 (16%)

Query: 4   LPAPPVERYKDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
           L  PPV     M LLPG TF MG  ++     DGE P R V L AF ++ H VSN +F  
Sbjct: 36  LAVPPVT----MPLLPGGTFLMGAQDQDGFASDGEGPVREVRLSAFRVEAHAVSNERFAR 91

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           FV+ATG+VTEAE+FG ++VF   L    R                               
Sbjct: 92  FVAATGHVTEAERFGWSYVFAGFLPAALR------------------------------- 120

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
                      RGA  P    W  G  G           W  PEG  S ++ R +HPVVH
Sbjct: 121 -----------RGAPRPDGTPWWCGVEGA---------RWNAPEGPGSGLDGREDHPVVH 160

Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
           VSWNDA AYC W G RLPTEAEWEY  RGGLE R +PWG+ LTP GEHR N+WQG FP  
Sbjct: 161 VSWNDARAYCRWAGTRLPTEAEWEYAARGGLEQRRYPWGDELTPGGEHRCNIWQGRFPVK 220

Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKV 302
           NT  DGY  TAPV +Y  N FGL+NM GNVWEW AD W   H      +P+GP +G  +V
Sbjct: 221 NTVEDGYAGTAPVDAYPPNGFGLHNMAGNVWEWCADRWGTDHGDRRRTDPRGPASGGQRV 280

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            +GGSYLC+  YC R+R AAR++NTPDS+AGNLGFRC  D
Sbjct: 281 MRGGSYLCHHSYCNRYRVAARTRNTPDSTAGNLGFRCVRD 320


>gi|196006177|ref|XP_002112955.1| hypothetical protein TRIADDRAFT_26011 [Trichoplax adhaerens]
 gi|190584996|gb|EDV25065.1| hypothetical protein TRIADDRAFT_26011 [Trichoplax adhaerens]
          Length = 294

 Score =  293 bits (751), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 153/334 (45%), Positives = 200/334 (59%), Gaps = 55/334 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G    +GTNK  ++ D E PS    L  FY+D++EVSN +F++F+ ATG+ T+AE
Sbjct: 1   MVFLLGGLTTVGTNKIAILSDSEGPSYRAYLSPFYIDKYEVSNRKFRQFIHATGHRTQAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           + G +F+   +                 + +   S+ E      VV   W        W+
Sbjct: 61  ELGSSFILTAI----------------PKLKSTSSSQEKV----VVDSPW--------WK 92

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
            A+                       +W HPEG++S+I   ++HPVVH+SWNDA AYC W
Sbjct: 93  LAKN---------------------ANWNHPEGLNSSINGILDHPVVHMSWNDANAYCNW 131

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY  RGGL +RLFPWGNN  P+G+HR N+WQG+FP  NTA DGY+ TAP
Sbjct: 132 AGKRLPTEAEWEYAARGGLNDRLFPWGNNPIPKGQHRMNIWQGKFPRENTADDGYIGTAP 191

Query: 255 VM-SYKENKFGLYNMVGNVWEWTADWWN-----VHHHPAPSYNPKGPTTGTDKVKKGGSY 308
           V+ SY  N +G+YN +GNVWEW  DWW+     +      S NPKGP +G  KV+KGGSY
Sbjct: 192 VVNSYLPNAYGIYNTIGNVWEWVYDWWSRRDGQLRSRQDISNNPKGPNSGVKKVQKGGSY 251

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           LC++ YCYR+RCAAR+   PD+SA NLGFRCA D
Sbjct: 252 LCHKSYCYRYRCAARNAAPPDTSAINLGFRCAMD 285


>gi|195375301|ref|XP_002046440.1| GJ12899 [Drosophila virilis]
 gi|194153598|gb|EDW68782.1| GJ12899 [Drosophila virilis]
          Length = 342

 Score =  293 bits (750), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 154/329 (46%), Positives = 191/329 (58%), Gaps = 61/329 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M L+PG +  +GT++P    D E P R V +  FY+D+ EVSN  F +FV AT Y TEAE
Sbjct: 73  MSLIPGGSVNIGTDEPHFEADREAPERVVKIKDFYVDKFEVSNANFAKFVDATNYTTEAE 132

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +FGD+F+F+ +LS  ++A++   R                                    
Sbjct: 133 RFGDSFLFKTMLSPAQQAELEDYR------------------------------------ 156

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                  A W Y   G          SW  P GI S ++    HPVVHVSW DAVAYC W
Sbjct: 157 ----VANALWWYKVSGV---------SWRKPNGISSNLQGLEQHPVVHVSWRDAVAYCAW 203

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLP+EAEWE  CRGG + +LFPWGN L P+ +H  N+WQGEFP  NTA DGYL T P
Sbjct: 204 AGKRLPSEAEWEVACRGGKQRKLFPWGNKLMPQDKHWLNIWQGEFPDGNTAQDGYLYTCP 263

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT-DKVKKGGSYLCNEQ 313
           V  +++N + LYNMVGNVWEWTAD W             G T+ + ++VKKGGSYLC++ 
Sbjct: 264 VDEFRQNVYDLYNMVGNVWEWTADLWQA-----------GDTSESPNRVKKGGSYLCHKS 312

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           YCYR+RCAARSQNT DSSA NLGFRCA D
Sbjct: 313 YCYRYRCAARSQNTEDSSASNLGFRCAKD 341



 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 34/44 (77%), Positives = 39/44 (88%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           ++VKKGGSYLC++ YCYR+RCAARSQNT DSSA NLGFRCA D 
Sbjct: 299 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSASNLGFRCAKDA 342


>gi|281337593|gb|EFB13177.1| hypothetical protein PANDA_008710 [Ailuropoda melanoleuca]
          Length = 312

 Score =  293 bits (750), Expect = 9e-77,   Method: Compositional matrix adjust.
 Identities = 143/278 (51%), Positives = 178/278 (64%), Gaps = 50/278 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+P   F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+TEAE
Sbjct: 85  MVLIPAGVFTMGTDDPKIKQDGEAPARKVAIDAFYMDAYEVSNAEFEKFVNSTGYLTEAE 144

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVFE +LSE+ +  I Q                              AVA   W 
Sbjct: 145 KFGDSFVFEGMLSEQVKTDIQQ------------------------------AVAAAPWW 174

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +                   +W HPEG DST+ HR +HPV+HVSWNDA+AYCTW
Sbjct: 175 ---LPVKG-----------------ANWRHPEGPDSTVLHRPDHPVLHVSWNDAIAYCTW 214

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP  NT  DG+  TAP
Sbjct: 215 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAP 274

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
           V ++  N +GLYN+VGNVWEWT+DWW VHH    + NP
Sbjct: 275 VDAFPPNGYGLYNIVGNVWEWTSDWWTVHHSVEKTLNP 312


>gi|195135258|ref|XP_002012051.1| GI16647 [Drosophila mojavensis]
 gi|193918315|gb|EDW17182.1| GI16647 [Drosophila mojavensis]
          Length = 341

 Score =  292 bits (748), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 153/327 (46%), Positives = 191/327 (58%), Gaps = 61/327 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M L+PG +  +GT++P  I D E P R V  + FYLD++EVSN  F +FV AT Y TEAE
Sbjct: 72  MSLIPGGSQYIGTDEPHFIADHESPERLVKFNDFYLDKYEVSNANFAKFVEATNYTTEAE 131

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +FGD+F+F+ LL  E++ ++   R                                    
Sbjct: 132 RFGDSFIFKTLLKPEQQEELKDYR------------------------------------ 155

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                  A W Y   G          SW  P GIDS +E   +HPVVHVSW DAVAYC W
Sbjct: 156 ----VANAVWWYKVSGV---------SWRRPNGIDSNLEGLEHHPVVHVSWRDAVAYCEW 202

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLP+EAEWE  CRGG + +LFPWGN L P+ +H  N+WQG+FP  NT  DGY  T P
Sbjct: 203 AGKRLPSEAEWEVACRGGKQRKLFPWGNKLMPKDKHWLNIWQGDFPDGNTEEDGYPFTCP 262

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           V  +++N + LYN+VGNVWEWTAD W      P+P           ++VKKGGSYLC++ 
Sbjct: 263 VDQFRQNNYDLYNIVGNVWEWTADLWQAGDTSPSP-----------NRVKKGGSYLCHKS 311

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCA 340
           YCYR+RCAARSQNT DSSAG+LGFRCA
Sbjct: 312 YCYRYRCAARSQNTEDSSAGSLGFRCA 338



 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 37/54 (68%), Positives = 44/54 (81%), Gaps = 2/54 (3%)

Query: 340 AADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           A D  P+   ++VKKGGSYLC++ YCYR+RCAARSQNT DSSAG+LGFRCA  V
Sbjct: 290 AGDTSPSP--NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGSLGFRCAKSV 341


>gi|195012382|ref|XP_001983615.1| GH15994 [Drosophila grimshawi]
 gi|193897097|gb|EDV95963.1| GH15994 [Drosophila grimshawi]
          Length = 341

 Score =  292 bits (748), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 153/329 (46%), Positives = 193/329 (58%), Gaps = 61/329 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M L+PG +  +GTN+P    D E P R V +  FYLD+HEVSN  F +F     Y TEAE
Sbjct: 72  MSLIPGGSLHIGTNEPHFEADHESPERVVKVQDFYLDKHEVSNANFAKFTEQINYKTEAE 131

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +FGD+F+F+ LL+ E++ ++   R                    VV+             
Sbjct: 132 RFGDSFLFKTLLTPEQQTELEDYR--------------------VVN------------- 158

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                  A W Y  +G          SW  P GI+S ++    HPVVHVSW DAVAYCTW
Sbjct: 159 -------ALWWYKVKGV---------SWRKPNGINSNLQGLEQHPVVHVSWRDAVAYCTW 202

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLP+E+EWE  CRGG + +L+PWGN L P+ +H  N+WQGEFP  NT  DGY  T P
Sbjct: 203 AGKRLPSESEWEVACRGGKQRKLYPWGNKLMPKDQHWLNIWQGEFPDGNTQQDGYQFTCP 262

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT-DKVKKGGSYLCNEQ 313
           V  +++N + LYNMVGNVWEWTAD W             G T+ +  +VKKGGSYLC++ 
Sbjct: 263 VNEFRQNNYDLYNMVGNVWEWTADLWQA-----------GDTSESPARVKKGGSYLCHKS 311

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 312 YCYRYRCAARSQNTEDSSAGNLGFRCAKN 340



 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 34/43 (79%), Positives = 39/43 (90%)

Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA + 
Sbjct: 299 RVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 341


>gi|288917453|ref|ZP_06411819.1| Non-specific serine/threonine protein kinase [Frankia sp. EUN1f]
 gi|288351156|gb|EFC85367.1| Non-specific serine/threonine protein kinase [Frankia sp. EUN1f]
          Length = 309

 Score =  292 bits (748), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 160/333 (48%), Positives = 192/333 (57%), Gaps = 53/333 (15%)

Query: 11  RYKDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           R   M+ LPG  F MGT ++     DGE P R + L  F ++   V+N QF  FV ATGY
Sbjct: 28  RTTRMIKLPGGRFLMGTEDEEGYPADGEGPVREIDLSPFAIEATTVTNAQFATFVKATGY 87

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
            T+AE+   +FVFE  LSEE    +S+                   + PVVH        
Sbjct: 88  ATDAERHDFSFVFEGFLSEE----VSR-------------------NSPVVH-------- 116

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
                      E  W +   G           W HPEG  S+I  R NHPVVHVSWNDA 
Sbjct: 117 -----------EVPWWHAVAGAF---------WKHPEGPGSSIASRQNHPVVHVSWNDAQ 156

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
           AYC W G RLP+EAEWEY  RGGLE R +PWG+ L P+G H+ N+WQG FPT NT ADGY
Sbjct: 157 AYCAWAGTRLPSEAEWEYAARGGLEQRRYPWGDELMPKGRHQCNIWQGVFPTVNTGADGY 216

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTTGTDKVKKGGSY 308
           L TAP  +Y+ N  GLYN+VGNVWEW ADW++   H   P  NP GP  G  KV +GGS+
Sbjct: 217 LGTAPAKAYRPNGHGLYNVVGNVWEWCADWFSADFHTTGPRTNPTGPPGGLAKVMRGGSH 276

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           +C+  YC R+R AARS NTPDSSAGN+GFR AA
Sbjct: 277 MCHSSYCNRYRVAARSSNTPDSSAGNIGFRVAA 309


>gi|374605458|ref|ZP_09678386.1| non-specific serine/threonine protein kinase [Paenibacillus
           dendritiformis C454]
 gi|374388957|gb|EHQ60351.1| non-specific serine/threonine protein kinase [Paenibacillus
           dendritiformis C454]
          Length = 371

 Score =  291 bits (744), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 154/330 (46%), Positives = 197/330 (59%), Gaps = 52/330 (15%)

Query: 15  MVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
            +LL G TF MG+ +      DGE P R VT+  F +  + V+N +F  FV+ATGYVTEA
Sbjct: 90  FILLEGGTFAMGSEDAESFPADGEGPIREVTVGPFRIAPYAVTNEEFARFVAATGYVTEA 149

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+FG ++VF  L  + E       +H + R +G                 W        W
Sbjct: 150 ERFGWSYVFHLLAPDLEE------KHVIGRPQG---------------TPW--------W 180

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
            G R                        W  PEG  S++  RMNHPV+HVSW+DA AYC 
Sbjct: 181 LGVRQ---------------------ACWHRPEGPGSSVAERMNHPVIHVSWHDAQAYCE 219

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTE+EWE+  RGGLE R +PWG+ L P GEHR N+WQG+FP  N A+DGY+ TA
Sbjct: 220 WAGTRLPTESEWEFAARGGLERRRYPWGDLLKPDGEHRCNIWQGKFPVKNNASDGYIGTA 279

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVH-HHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
           PV +++ N +GLYN+ GNVWEW ADW+  +     P+  P+GP TGT++V KGGSYLC++
Sbjct: 280 PVDAFEPNGYGLYNVAGNVWEWCADWFTANPDERGPADQPRGPQTGTERVMKGGSYLCHK 339

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            YC R+R  ARS+NTPDSS GNLGFRCAAD
Sbjct: 340 SYCNRYRVGARSKNTPDSSTGNLGFRCAAD 369


>gi|336178162|ref|YP_004583537.1| sulfatase-modifying factor protein [Frankia symbiont of Datisca
           glomerata]
 gi|334859142|gb|AEH09616.1| Sulphatase-modifying factor protein [Frankia symbiont of Datisca
           glomerata]
          Length = 278

 Score =  290 bits (742), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 157/329 (47%), Positives = 191/329 (58%), Gaps = 53/329 (16%)

Query: 15  MVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           MV LPG  F MGT +      DGE P R + L+ F ++   V+N QF  FV+ATGYVT+A
Sbjct: 1   MVQLPGGRFLMGTEDNEGYRADGEGPVREIELNPFSIEPTTVTNAQFATFVNATGYVTDA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E++  +FVFE  LSEE                       +R+   V    W  AV+   W
Sbjct: 61  ERYDFSFVFEGFLSEEL----------------------NRVSSAVHTTPWWRAVSGAGW 98

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           +                             HPEG  S++  R NHPVVHVSWNDA +YC 
Sbjct: 99  K-----------------------------HPEGPGSSVASRQNHPVVHVSWNDAQSYCA 129

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGGLE R +PWG+ LTP G HR N+WQG+FP  NT  DGY+ TA
Sbjct: 130 WAGVRLPTEAEWEYAARGGLEQRRYPWGDELTPGGRHRCNIWQGDFPNLNTGEDGYIGTA 189

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTTGTDKVKKGGSYLCNE 312
           PV SY+ N +G+YN VGNVWEW ADW++   H   P  NP GP  G  KV +GGS++C+ 
Sbjct: 190 PVKSYRPNGYGIYNAVGNVWEWCADWFSATFHVTGPRTNPTGPPDGHAKVMRGGSHMCHA 249

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
            YC R+R AARS NTPDSSAGN+GFR AA
Sbjct: 250 SYCNRYRVAARSSNTPDSSAGNIGFRTAA 278


>gi|194864509|ref|XP_001970974.1| GG14686 [Drosophila erecta]
 gi|190652757|gb|EDV50000.1| GG14686 [Drosophila erecta]
          Length = 303

 Score =  290 bits (742), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 154/332 (46%), Positives = 195/332 (58%), Gaps = 59/332 (17%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           +  DM L+PG T  +GT+K     D E P R V L+ FY+D++EVSN  F +FV  T Y 
Sbjct: 30  KIADMSLIPGGTVYVGTDKAHFPADREAPERQVKLNDFYIDKYEVSNDAFAKFVLHTNYT 89

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAE+FGD+F+F+ LLS  E+  +   R                    V    W   VA 
Sbjct: 90  TEAERFGDSFLFKTLLSPSEQKDLEDFR--------------------VASAVWWYKVAG 129

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             WR                             HP G+ S +++   HPVVHVSW DAV 
Sbjct: 130 VNWR-----------------------------HPNGVGSNLDNLGRHPVVHVSWRDAVE 160

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           YC W G RLP+EAEWE  CRGG + +LFPWGN L PR EH  N+WQG+FP  N A DG+ 
Sbjct: 161 YCKWAGKRLPSEAEWETACRGGRDRKLFPWGNKLMPRNEHWLNIWQGDFPDGNLAEDGFE 220

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
            T+PV ++++N + L+N+VGNVWEWTAD W+V+     S NP       ++VKKGGSYLC
Sbjct: 221 YTSPVDAFRQNVYDLHNIVGNVWEWTADLWDVND---VSDNP-------NRVKKGGSYLC 270

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           ++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 271 HKSYCYRYRCAARSQNTEDSSAGNLGFRCAKN 302



 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 34/44 (77%), Positives = 40/44 (90%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           ++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA + 
Sbjct: 260 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 303


>gi|332231571|ref|XP_003264968.1| PREDICTED: sulfatase-modifying factor 1 isoform 3 [Nomascus
           leucogenys]
          Length = 349

 Score =  290 bits (741), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 131/186 (70%), Positives = 149/186 (80%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W HPE  DSTI HR +HPV+HVSWNDA AYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct: 162 NWRHPEWPDSTIRHRPDHPVLHVSWNDAFAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPW 221

Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
           GN L P+G+H AN+WQGEFP  NT  DG+  TAPV ++  N +GLYN+VGN WEWT+DWW
Sbjct: 222 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 281

Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            VHH    + NPKGP +G D+VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 282 TVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 341

Query: 341 ADKGPT 346
           AD  PT
Sbjct: 342 ADCLPT 347


>gi|409730291|ref|ZP_11271870.1| Non-specific serine/threonine protein kinase [Halococcus
           hamelinensis 100A6]
 gi|448723625|ref|ZP_21706141.1| Non-specific serine/threonine protein kinase [Halococcus
           hamelinensis 100A6]
 gi|445787164|gb|EMA37912.1| Non-specific serine/threonine protein kinase [Halococcus
           hamelinensis 100A6]
          Length = 332

 Score =  288 bits (738), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 154/338 (45%), Positives = 193/338 (57%), Gaps = 56/338 (16%)

Query: 11  RYKDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           R   MV L G  F MGT+  +   +DGE P+R V LD FY+D+  V+N +F  FV  TGY
Sbjct: 43  RTTRMVRLDGGRFTMGTDDGVGFPEDGEGPAREVALDPFYVDRFAVTNAEFLRFVRETGY 102

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
            TEAE+FG +FVFE  ++  + + + Q                      V    W  AV 
Sbjct: 103 TTEAERFGWSFVFEDFVAPADESHVMQS---------------------VAAAPWWVAVE 141

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTI--EHRMNHPVVHVSWND 187
             TW                             L PEG  S++  + R+  PV HVSWND
Sbjct: 142 GATW-----------------------------LRPEGPSSSVVGDDRLKQPVAHVSWND 172

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           A AY  W G RLPTEAEWEY  RGGLE + +PWG++L P GEHR N+WQG+FP +NT  D
Sbjct: 173 AQAYADWAGKRLPTEAEWEYAARGGLEGKRYPWGDDLRPDGEHRCNIWQGDFPEHNTGDD 232

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPKGPTTGTDKVKK 304
           GYL  APV +Y+ N FGLYN+ GNVWEW ADW++  +H   +Y   NP GP  G  +V +
Sbjct: 233 GYLGPAPVTAYEPNGFGLYNVSGNVWEWCADWFSSEYHTTEAYSHTNPTGPPDGDSRVMR 292

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GGSYLC+  +C R+R AARSQNTPDSS GN+GFRC  D
Sbjct: 293 GGSYLCHRSWCNRYRVAARSQNTPDSSTGNIGFRCVVD 330


>gi|195089898|ref|XP_001997488.1| GH11358 [Drosophila grimshawi]
 gi|193905884|gb|EDW04751.1| GH11358 [Drosophila grimshawi]
          Length = 270

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 152/329 (46%), Positives = 192/329 (58%), Gaps = 61/329 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M L+P  +  +GTN+P    D E P R V +  FYLD+HEVSN  F +F     Y TEAE
Sbjct: 1   MSLIPCGSLHIGTNEPHFEADHESPERVVKVQDFYLDEHEVSNANFAKFTEQINYKTEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +FGD+F+F+ LL+ E++ ++   R                    VV+             
Sbjct: 61  RFGDSFLFKTLLTPEQQTELEVYR--------------------VVN------------- 87

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                  A W Y  +G          SW  P GI+S ++    HPVVHVSW DAVAYCTW
Sbjct: 88  -------ALWWYKVKGV---------SWRKPNGINSNLQGLEQHPVVHVSWRDAVAYCTW 131

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLP+E+EWE  CRGG + +L+PWGN L P+ +H  N+WQGEFP  NT  DGY  T P
Sbjct: 132 AGKRLPSESEWEVACRGGKQRKLYPWGNKLMPKDQHWLNIWQGEFPDGNTQQDGYQFTCP 191

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT-DKVKKGGSYLCNEQ 313
           V  +++N + LYNMVGNVWEWTAD W             G T+ +  +VKKGGSYLC++ 
Sbjct: 192 VNEFRQNNYDLYNMVGNVWEWTADLWQA-----------GDTSESPARVKKGGSYLCHKS 240

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 241 YCYRYRCAARSQNTEDSSAGNLGFRCAKN 269



 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 34/43 (79%), Positives = 39/43 (90%)

Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA + 
Sbjct: 228 RVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 270


>gi|345012061|ref|YP_004814415.1| sulfatase-modifying factor protein [Streptomyces violaceusniger Tu
           4113]
 gi|344038410|gb|AEM84135.1| Sulphatase-modifying factor protein [Streptomyces violaceusniger Tu
           4113]
          Length = 316

 Score =  283 bits (725), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 150/329 (45%), Positives = 186/329 (56%), Gaps = 52/329 (15%)

Query: 15  MVLLPGDTFRMGTNK-PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           MV + G  F MG +      +DGE P R V + AF++D + VSN +F EFV ATG+VTEA
Sbjct: 38  MVRVAGGEFLMGADDVEGFPEDGEGPVRTVRVSAFHIDAYAVSNEKFAEFVDATGHVTEA 97

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+ G ++ F   L    R                                          
Sbjct: 98  ERIGWSYAFAGFLPAALR------------------------------------------ 115

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           RGA  P    W  G  G           W HPEG  ST+  R NHPVVH+SWNDA AYC 
Sbjct: 116 RGAPRPARIPWWCGVTGA---------RWDHPEGPGSTLAGRENHPVVHISWNDAAAYCA 166

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGGLE + +PWG++L   G +R N+W+G FPT NT ADGY  TA
Sbjct: 167 WAGKRLPTEAEWEYAARGGLEQKRYPWGDDLDLGGTYRCNIWRGAFPTRNTVADGYRGTA 226

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           PV +++ N FGLYN  GNVWEW ADWW V H    + +PKGP  G +KV +GGS+LC++ 
Sbjct: 227 PVDAFEPNGFGLYNTSGNVWEWCADWWTVRHQGDTAIDPKGPAIGQEKVIRGGSHLCHKS 286

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           YC R+R AAR+ NTPD ++G+ GFRCA D
Sbjct: 287 YCNRYRVAARTANTPDGASGHTGFRCARD 315


>gi|253576938|ref|ZP_04854262.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
           str. D14]
 gi|251843669|gb|EES71693.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
           str. D14]
          Length = 329

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 154/333 (46%), Positives = 186/333 (55%), Gaps = 52/333 (15%)

Query: 12  YKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           + DMV +P  TF MGTN      KDGE P+R V +  F +  + V+N QF+ FV ATGY 
Sbjct: 46  HSDMVTIPDGTFLMGTNTNEGFPKDGEGPAREVRVRGFQMSPYPVTNEQFKMFVDATGYR 105

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAE FG +FVF  L SEE + ++  V                                 
Sbjct: 106 TEAESFGWSFVFHLLASEETKRRVKDV--------------------------------- 132

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
                   P    W     G           W  PEG DS+IE RM+HPVVHVSWNDA A
Sbjct: 133 --------PQGVPWWLVVEGAY---------WAAPEGPDSSIEDRMDHPVVHVSWNDAEA 175

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           YC W G RLPTEAEWEY  RGGLE + +PWG+ L   GEH+ N+WQG+FP  N A+DGY+
Sbjct: 176 YCRWAGVRLPTEAEWEYAARGGLEGKTYPWGDLLKLDGEHQCNIWQGKFPVKNNASDGYV 235

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTTGTDKVKKGGSYL 309
            TAPV +YK N +GLY M GNVWEW ADW++  +H   P+ NP        +  +GGSYL
Sbjct: 236 GTAPVQTYKPNGYGLYQMSGNVWEWCADWFSPDYHRTMPADNPVYTAPTGSRSMRGGSYL 295

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           C+  YC R+R AARS NTPDSS GN GFR   D
Sbjct: 296 CHRSYCNRYRVAARSSNTPDSSTGNCGFRVVRD 328


>gi|167526896|ref|XP_001747781.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773885|gb|EDQ87521.1| predicted protein [Monosiga brevicollis MX1]
          Length = 412

 Score =  281 bits (719), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 162/362 (44%), Positives = 194/362 (53%), Gaps = 77/362 (21%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           E    ++ L G  F MG++      DGE P+R V +  F +  HEVSN +F  FV+ATGY
Sbjct: 102 EEQAALLHLSGGVFTMGSDDGFFPDDGEGPARKVQVSPFLIGAHEVSNARFAAFVAATGY 161

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
            TEAE+FG++FV E  +S    A+IS                               AVA
Sbjct: 162 RTEAERFGNSFVVEQFISPRISAQISS------------------------------AVA 191

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
              W    LP +                    W HPEG D+ I  R +HP VH+SWNDA 
Sbjct: 192 AAPWW---LPVDG-----------------ADWAHPEGPDTNISGRADHPAVHISWNDAN 231

Query: 190 AYCTW---RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE--FPTN-- 242
           A+C W   RG RLPTEAEWE+  RGGLE+RLFPWGN + P+GE R N WQ E   PT+  
Sbjct: 232 AFCRWSHPRG-RLPTEAEWEFAARGGLEHRLFPWGNKMQPKGEFRMNTWQSEMKIPTDAN 290

Query: 243 -------------------NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH 283
                              NTA DG+  TAPV SYK NKFGLYN VGNVWEWT DW    
Sbjct: 291 VFKHSFLPIHDGHAYYSAKNTAEDGFQLTAPVDSYKPNKFGLYNTVGNVWEWTNDWHTRV 350

Query: 284 HHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           H P    +P+GP  G  KVKKGGS++CN   CYR+R +AR   TPDSSA N+GFRCAAD 
Sbjct: 351 HSPDFVVDPRGPPAGDKKVKKGGSFMCNIFTCYRYRNSARMPLTPDSSAANVGFRCAADV 410

Query: 344 GP 345
            P
Sbjct: 411 SP 412


>gi|195436010|ref|XP_002065971.1| GK12600 [Drosophila willistoni]
 gi|194162056|gb|EDW76957.1| GK12600 [Drosophila willistoni]
          Length = 393

 Score =  281 bits (718), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 149/324 (45%), Positives = 190/324 (58%), Gaps = 59/324 (18%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
            + Y DM L+PG +  +GT++P    D E P   V L  FYLD++EVSN +F++FV AT 
Sbjct: 78  TDDYPDMSLIPGGSVLIGTDQPHFQADREAPEVRVDLKNFYLDKYEVSNRKFKDFVLATE 137

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           YVTEAE+FGD+F+F+ LLS  E+  +   R                              
Sbjct: 138 YVTEAERFGDSFLFKTLLSPAEQKNLEDFR------------------------------ 167

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                        A W Y  +G          SW  P G++S ++   +HPVVHVSW DA
Sbjct: 168 ----------VASAIWWYKVKGV---------SWRTPNGVNSNLDGLEDHPVVHVSWRDA 208

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
           VA+CTW G RLPTEAEWE  CRGG + +LFPWGN L PR EH  N+WQG+FP  NTA DG
Sbjct: 209 VAFCTWAGKRLPTEAEWEAACRGGKKRKLFPWGNKLMPRDEHWLNIWQGDFPDGNTADDG 268

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
           Y  T PV  +++N++ LYN+VGNVWEWT+D W  +     S  P       ++VKKGGSY
Sbjct: 269 YEYTCPVNEFRQNEYDLYNIVGNVWEWTSDLWQTND---VSETP-------NRVKKGGSY 318

Query: 309 LCNEQYCYRHRCAARSQNTPDSSA 332
           LC++ YCYR+RCAARSQNT DSSA
Sbjct: 319 LCHKSYCYRYRCAARSQNTEDSSA 342



 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/33 (78%), Positives = 31/33 (93%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSA 382
           ++VKKGGSYLC++ YCYR+RCAARSQNT DSSA
Sbjct: 310 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSA 342


>gi|318058229|ref|ZP_07976952.1| non-specific serine/threonine protein kinase [Streptomyces sp.
           SA3_actG]
 gi|318081153|ref|ZP_07988485.1| non-specific serine/threonine protein kinase [Streptomyces sp.
           SA3_actF]
          Length = 306

 Score =  280 bits (716), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 156/341 (45%), Positives = 185/341 (54%), Gaps = 53/341 (15%)

Query: 4   LPAPPVERYKD-MVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           +PAPP +   +  V L G TF MG  +      DGE P R V L  F +    V+N +F 
Sbjct: 16  VPAPPEDPGPEPSVALDGGTFLMGAEDADGFAADGEGPVREVRLSPFRIGTLAVTNARFA 75

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
            FV+ATG+VTEAE+FG +FVF   L    R                              
Sbjct: 76  AFVAATGHVTEAERFGWSFVFAAFLPGALR------------------------------ 105

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
                       R +  P    W  G  G          +W  PEG  S +  R +HPVV
Sbjct: 106 ------------RVSPRPDATPWWCGVSGA---------TWRAPEGPGSGLAERWDHPVV 144

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
           HV W DA AYC W G RLPTEAEWEY  RGGL  R +PWG+ LTP GEHR N+WQG FPT
Sbjct: 145 HVGWADAAAYCRWAGGRLPTEAEWEYAARGGLAGRRYPWGDELTPDGEHRCNIWQGRFPT 204

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK 301
            NTA DGY  TAP  ++  N  GL+N+ GNVWEW AD W   H   P+ +PKGP  GT +
Sbjct: 205 TNTAEDGYRGTAPAHAFPPNGHGLHNVAGNVWEWCADRWGTDHSARPASDPKGPRRGTSR 264

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           V +GGSYLC+  YC R+R AAR+ NTPDSSAGNLGFRCA D
Sbjct: 265 VMRGGSYLCHHSYCNRYRVAARTSNTPDSSAGNLGFRCAWD 305


>gi|357614993|gb|EHJ69415.1| sulfatase-modifying factor 1 [Danaus plexippus]
          Length = 267

 Score =  280 bits (716), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 149/326 (45%), Positives = 190/326 (58%), Gaps = 62/326 (19%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+L+P  T+++GT++ ++  D E P + V L +FYLD++EVSN  F  F  +T Y TEAE
Sbjct: 1   MILIPSGTYQVGTDEILIDTDNEGPKKFVQLKSFYLDKYEVSNRDFTIFTESTDYKTEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FGD+FVF   L+   + ++    HD +  +                             
Sbjct: 61  TFGDSFVFTLFLNNTFKKQL----HDFRVMQ----------------------------- 87

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                  A W Y   G          +W HP G DS +   M+HPV HVSWNDA AYC W
Sbjct: 88  -------APWWYKVYGT---------NWKHPLGPDSDVADLMDHPVTHVSWNDAKAYCAW 131

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RGARLPTEAEWE GCRG   N  +PWG+ L P  +H AN+WQG FP +N+A DGY+ T P
Sbjct: 132 RGARLPTEAEWEAGCRGDHYNITYPWGDKLFPDRQHMANIWQGTFPNHNSAKDGYVGTNP 191

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V  +++NK GL+NM GNVWEWT D W+         NPK      +KVKKGGSYLC+  Y
Sbjct: 192 VHLFEQNKLGLHNMAGNVWEWTEDSWSED-------NPK------EKVKKGGSYLCHRSY 238

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCA 340
           CYR+RC++RS NT DSSAGNLGFRCA
Sbjct: 239 CYRYRCSSRSHNTDDSSAGNLGFRCA 264



 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 32/41 (78%), Positives = 37/41 (90%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
           +KVKKGGSYLC+  YCYR+RC++RS NT DSSAGNLGFRCA
Sbjct: 224 EKVKKGGSYLCHRSYCYRYRCSSRSHNTDDSSAGNLGFRCA 264


>gi|385677476|ref|ZP_10051404.1| hypothetical protein AATC3_16239 [Amycolatopsis sp. ATCC 39116]
          Length = 308

 Score =  280 bits (715), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 158/331 (47%), Positives = 186/331 (56%), Gaps = 53/331 (16%)

Query: 15  MVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           MV LPG  FRMGT+       DGE P R V L  F +D++ VSN  F  FV+ TGYVTEA
Sbjct: 30  MVALPGGEFRMGTDDAEGFPADGEGPVRTVELSPFLIDRYAVSNDDFAAFVADTGYVTEA 89

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+ G ++VF   L  E R +IS  R D+                      W  AV    W
Sbjct: 90  ERIGWSYVFVRFLDRELR-RISP-RPDV--------------------TPWWAAVTGADW 127

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R                              PEG  S +  R +HPVVHVSW+DA AY  
Sbjct: 128 R-----------------------------RPEGPGSDLSGRGDHPVVHVSWHDAGAYAA 158

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGG + R +PWG+ L P G H AN+WQG FP  +TA DGY  TA
Sbjct: 159 WAGKRLPTEAEWEYAARGGHDQRRYPWGDELNPGGRHLANIWQGRFPITDTAEDGYAGTA 218

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           PV ++  N FGL+NM GNVWEW AD W   H P  + +P GP+TGTD+V +GGSYLC+  
Sbjct: 219 PVDAFPPNDFGLHNMAGNVWEWCADRWGTDHPPR-ARDPIGPSTGTDRVTRGGSYLCHVS 277

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           YC R+R AARS NTPDSS GN GFRCA D G
Sbjct: 278 YCNRYRVAARSHNTPDSSGGNTGFRCARDPG 308


>gi|333023180|ref|ZP_08451244.1| putative Non-specific serine/threonine protein kinase [Streptomyces
           sp. Tu6071]
 gi|332743032|gb|EGJ73473.1| putative Non-specific serine/threonine protein kinase [Streptomyces
           sp. Tu6071]
          Length = 387

 Score =  278 bits (711), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 154/341 (45%), Positives = 184/341 (53%), Gaps = 53/341 (15%)

Query: 4   LPAPPVERYKD-MVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           +PAPP +   +  V L G TF MG  +      DGE P R V L  F +    V+N +F 
Sbjct: 97  VPAPPEDPGPEPSVALDGGTFLMGAEDADGFAADGEGPVREVRLSPFRIGTLAVTNARFA 156

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
            FV+ATG+VTEAE+FG +FVF   L    R                              
Sbjct: 157 AFVAATGHVTEAERFGWSFVFAAFLPGALR------------------------------ 186

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
                       R +  P    W  G  G          +W  PEG  S +  R +HP V
Sbjct: 187 ------------RVSPRPDATPWWCGVSGA---------TWRAPEGPGSGLAERWDHPAV 225

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
           HV W DA AYC W G RLPTEAEWEY  RGGL  R +PWG+ LTP GEHR N+WQG FPT
Sbjct: 226 HVGWADAAAYCRWAGGRLPTEAEWEYAARGGLAGRRYPWGDELTPDGEHRCNIWQGRFPT 285

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK 301
            NTA DGY  TAP  ++  N  GL+N+ GNVWEW AD W   H   P+ +P+GP  GT +
Sbjct: 286 TNTAEDGYRGTAPAHAFPPNGHGLHNVAGNVWEWCADRWGTDHSARPASDPRGPRRGTSR 345

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           V +GGSYLC+  YC R+R AAR+ NTPDSSAGNLGFRCA D
Sbjct: 346 VMRGGSYLCHHSYCNRYRVAARTSNTPDSSAGNLGFRCAWD 386


>gi|429199588|ref|ZP_19191336.1| sulfatase-modifying factor 1 family protein [Streptomyces ipomoeae
           91-03]
 gi|428664672|gb|EKX63947.1| sulfatase-modifying factor 1 family protein [Streptomyces ipomoeae
           91-03]
          Length = 294

 Score =  276 bits (706), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 155/338 (45%), Positives = 192/338 (56%), Gaps = 53/338 (15%)

Query: 6   APPVERYKDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           +P  +  + M+ LPG  F MGT+  +    DGE P R V L  F +    V+N QF EFV
Sbjct: 6   SPEAQHPRRMLDLPGGRFLMGTDDEVGYPADGEGPVREVELSPFRIAATTVTNEQFAEFV 65

Query: 65  SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
            ATG+VTEAE F  +FVFE  L    RA    V                          W
Sbjct: 66  DATGHVTEAEHFDFSFVFEGFLPAHLRAVSPHV----------------------PGTPW 103

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
                   WRG R  T                     W HPEG  S+++  ++HPVVHVS
Sbjct: 104 --------WRGVRGAT---------------------WRHPEGPGSSVDAILDHPVVHVS 134

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
           WNDA AYC W G RLPTEAEWEY  RGGLE + +PWG++L P G++R N+W+G FPT NT
Sbjct: 135 WNDARAYCAWSGTRLPTEAEWEYAARGGLEQQRYPWGDDLDPDGQYRCNIWRGTFPTLNT 194

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTTGTDKVK 303
           AADG+ STAP  +++ N FGLYN VGNVWEW AD ++   H   P  +P GP  G  +V 
Sbjct: 195 AADGHRSTAPADAFEPNGFGLYNTVGNVWEWCADPFSPDFHLTGPRTDPTGPPDGPIRVM 254

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           +GGS++C+  YC R+R AARS NTPDSSAGN+GFR AA
Sbjct: 255 RGGSHMCHASYCNRYRVAARSSNTPDSSAGNIGFRVAA 292


>gi|345012332|ref|YP_004814686.1| sulfatase-modifying factor protein [Streptomyces violaceusniger Tu
           4113]
 gi|344038681|gb|AEM84406.1| Sulphatase-modifying factor protein [Streptomyces violaceusniger Tu
           4113]
          Length = 326

 Score =  275 bits (702), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 152/339 (44%), Positives = 185/339 (54%), Gaps = 54/339 (15%)

Query: 13  KDMVLLPGDTFRMGTNKP-ILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           + M+ +P  T+ MG +      +DGE P R V L  F +D   V+N QF  FV ++GY T
Sbjct: 36  RAMLRIPAGTYLMGGDDADAFPEDGEGPVRAVRLSPFLIDATAVTNRQFATFVRSSGYRT 95

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           +AE++G +FVF  L+  + R                             H+  +  VA  
Sbjct: 96  DAERYGWSFVFYALVHPDAR-----------------------------HLVRDGTVA-- 124

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
                    +A W     G           W  P G  S+     NHPVVHVSW DA AY
Sbjct: 125 ---------QAPWWLAVDGAC---------WRAPYGPGSSWTDLSNHPVVHVSWRDASAY 166

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
            TW G RLPTEAEWE   RGGLE   FPWGN L PRG+HR N+WQGEFP  NT  DGYL 
Sbjct: 167 ATWAGKRLPTEAEWEMAARGGLERARFPWGNELLPRGQHRCNIWQGEFPQVNTGEDGYLG 226

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPTTGTDKVKKGGS 307
           TAPV SY+ N FGLYN  GNVWEW +DWW+   H    P    +P GP  G  KV +GGS
Sbjct: 227 TAPVKSYRANNFGLYNTSGNVWEWCSDWWSTTWHAADRPETRQDPAGPPAGEAKVIRGGS 286

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           YLC+  YC R+R AAR+ NTPDSS G++GFRCAAD  P+
Sbjct: 287 YLCHASYCNRYRVAARTSNTPDSSTGHMGFRCAADLPPS 325


>gi|320106475|ref|YP_004182065.1| hypothetical protein AciPR4_1246 [Terriglobus saanensis SP1PR4]
 gi|319924996|gb|ADV82071.1| protein of unknown function DUF323 [Terriglobus saanensis SP1PR4]
          Length = 318

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 147/333 (44%), Positives = 191/333 (57%), Gaps = 56/333 (16%)

Query: 13  KDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           ++MVLL G  F MGT+       DGE P R VT+  F++D++ V N +F++F   T Y T
Sbjct: 38  EEMVLLNGGNFLMGTDYAAGFPADGEGPVRRVTVSPFWIDRYTVRNREFEKFTETTNYQT 97

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDM-KRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           E+E+FG +FVFE  L++E +     +  +  +R EG D                      
Sbjct: 98  ESERFGWSFVFEGDLADELKVSQQVIGAEWWRRIEGAD---------------------- 135

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
                                          W HPEG  S I+ R ++PVVHVSWNDA A
Sbjct: 136 -------------------------------WRHPEGPQSHIDSRADYPVVHVSWNDASA 164

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           Y  W G RLPTEAEWE+  RGGLE  L+PWG++LTP GE+R N WQG FP+ N+A DG+ 
Sbjct: 165 YAVWAGKRLPTEAEWEFAARGGLEQVLYPWGDDLTPEGEYRCNTWQGIFPSYNSAEDGFA 224

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGSYL 309
           ST P  SY+ N +GLY M GNVWEW ADW++  +H  A   +P GP  G+ +V KGGSYL
Sbjct: 225 STCPSDSYQPNGYGLYCMTGNVWEWCADWFDPEYHVHATHIDPLGPPLGSARVLKGGSYL 284

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           C++ YC R+R AARS NTPDS+  N+GFRC  D
Sbjct: 285 CHKSYCNRYRVAARSSNTPDSATTNMGFRCVRD 317


>gi|386722552|ref|YP_006188878.1| Non-specific serine/threonine protein kinase [Paenibacillus
           mucilaginosus K02]
 gi|384089677|gb|AFH61113.1| Non-specific serine/threonine protein kinase [Paenibacillus
           mucilaginosus K02]
          Length = 307

 Score =  273 bits (698), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 147/339 (43%), Positives = 185/339 (54%), Gaps = 54/339 (15%)

Query: 8   PVERYK--DMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           P  R++  + +LL G  F MGTN P     DGE P R V ++ F +  + V+N QF  FV
Sbjct: 15  PSTRFQQNEFILLEGGDFLMGTNTPEGFPADGEGPVRKVRVNPFRISPYAVTNKQFSTFV 74

Query: 65  SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
            ATGYVTEAE+FG +FVF  L SE+ R ++                              
Sbjct: 75  EATGYVTEAEQFGWSFVFHLLASEQIRQRV------------------------------ 104

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
                         P +  W     G           W  PEG DS+I  RM HPV HVS
Sbjct: 105 -----------VNAPQQVPWWLVVEGAY---------WAKPEGPDSSITDRMEHPVTHVS 144

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
           W+DA AYC W G RLPTEAEWE+  RGGL  + +PWG+ L   G+H  N+WQG+FP  N 
Sbjct: 145 WHDAAAYCAWSGTRLPTEAEWEFAARGGLTGKTYPWGDQLKVDGKHMCNIWQGKFPIKNN 204

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-NPKGPTTGTDKVK 303
           A+DGY+ TAPV ++K N +GLYN+ GNVWEW ADW++  +H      NP        +  
Sbjct: 205 ASDGYIGTAPVDAFKPNGYGLYNLSGNVWEWCADWFSPKYHSRTDIDNPLYSEETAKRSI 264

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +GGSYLC+  YC R+R AARS NTPDSS GN+GFR  AD
Sbjct: 265 RGGSYLCHRSYCNRYRIAARSSNTPDSSTGNIGFRVVAD 303


>gi|414344129|ref|YP_006985650.1| hypothetical protein B932_3174 [Gluconobacter oxydans H24]
 gi|411029464|gb|AFW02719.1| hypothetical protein B932_3174 [Gluconobacter oxydans H24]
          Length = 381

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 147/335 (43%), Positives = 194/335 (57%), Gaps = 54/335 (16%)

Query: 12  YKDMVLLPGDTFRMGTNKP-ILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           Y+DM L+    +R+G +   I I DGE P R+V +DAF++D++ V+N +F  FV+ATGYV
Sbjct: 90  YEDMALIGAGAYRVGADDSEIHISDGEGPVRSVEVDAFWIDKYAVTNQKFAAFVAATGYV 149

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAE  G +FVF         A ++      +R  G                        
Sbjct: 150 TEAETEGWSFVF---------AGVATATGKKRRLRG------------------------ 176

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
                  +P +  W    RG           W HPEG  S I  R +HPVVHVSW+DA A
Sbjct: 177 ------SVP-QTPWWLPVRGA---------QWRHPEGPGSHISSRQDHPVVHVSWHDAAA 220

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           Y  W G RLPTEAEWE   RG +EN  +PWGN L P GEH+ N+WQG FP  NT  DGY+
Sbjct: 221 YAAWVGKRLPTEAEWEIAARGRMENFRYPWGNELLPDGEHQCNIWQGRFPEENTKEDGYV 280

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS----YNPKGPTTGTDKVKKGG 306
           +TAPV ++  N +GLYNMVGN WEW +D+W+   H A +    +NP GP  GTD+V +GG
Sbjct: 281 TTAPVTAFPPNGYGLYNMVGNTWEWCSDFWSPVWHQAENALTRHNPAGPPEGTDRVIRGG 340

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           S++C+  YC R+R +AR+  TPDSS G++GFRCA+
Sbjct: 341 SHMCHHSYCNRYRNSARTHTTPDSSLGHIGFRCAS 375


>gi|302547019|ref|ZP_07299361.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Streptomyces hygroscopicus ATCC 53653]
 gi|302464637|gb|EFL27730.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Streptomyces himastatinicus ATCC 53653]
          Length = 327

 Score =  269 bits (688), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 147/328 (44%), Positives = 182/328 (55%), Gaps = 54/328 (16%)

Query: 16  VLLPGDTFRMGTNKPIL-IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           + +PG TF MG++ P     DGE P R VT+  F +D H V+N +F  FV ATGY TEAE
Sbjct: 44  IRIPGGTFLMGSDDPDANPDDGEGPVRAVTVAPFAIDAHCVTNERFAAFVEATGYRTEAE 103

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +FG ++VF    S E R                                          R
Sbjct: 104 RFGWSYVFATFASGELR------------------------------------------R 121

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
            +  P    W  G +G           W  PEG  S +  R +HPVVHVSW DA A+C W
Sbjct: 122 ISARPKSTPWWCGVQGAY---------WAAPEGPGSDLSGRSDHPVVHVSWTDASAFCQW 172

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G+RLPTE EWEY  RGGLE + FPWG+ LTP GEHR N+WQG FP++NTA DGYL TAP
Sbjct: 173 EGSRLPTEPEWEYAARGGLEGKRFPWGDELTPGGEHRCNIWQGRFPSHNTAEDGYLGTAP 232

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS--YNPKGPTTGTDKVKKGGSYLCNE 312
             +Y+ N +GLYN+ GNVWEW AD W      A      P  P  G ++V +GGSYLC++
Sbjct: 233 ATAYQPNGYGLYNVAGNVWEWCADRWATTPSRAEGSCTTPADPRQGAERVLRGGSYLCHD 292

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            YC R+R AAR+ N+PDSS+GN GFR A
Sbjct: 293 SYCNRYRVAARTSNSPDSSSGNTGFRVA 320


>gi|21225812|ref|NP_631591.1| hypothetical protein SCO7548 [Streptomyces coelicolor A3(2)]
 gi|11228445|emb|CAC16428.1| conserved hypothetical protein [Streptomyces coelicolor A3(2)]
          Length = 314

 Score =  268 bits (685), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 149/339 (43%), Positives = 186/339 (54%), Gaps = 53/339 (15%)

Query: 13  KDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +  V LPG  F MG         DGE P   V L  F++D+  V+N +F  FV ATG+VT
Sbjct: 24  RGQVRLPGGEFAMGDAFGEGYPADGETPVHTVRLRPFHIDETAVTNARFAAFVKATGHVT 83

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           +AE+FG + VF  +++  +   +                                     
Sbjct: 84  DAERFGSSAVFHLVVAAPDADVLGSA---------------------------------- 109

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
                     A W    RG           W  PEG  S I  R NHPVVHVSWNDA AY
Sbjct: 110 --------AGAPWWINVRGA---------HWRRPEGARSDITGRPNHPVVHVSWNDATAY 152

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G RLPTEAEWEY  RGGL  R + WG+ LTP G  R N+WQG FP  NTA DG+LS
Sbjct: 153 ARWAGKRLPTEAEWEYAARGGLAGRRYAWGDELTPGGRWRCNIWQGRFPHVNTAEDGHLS 212

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
           TAPV SY+ N  GL+N  GNVWEW +DW++  ++  +P+ +P GP TG  +V +GGSYLC
Sbjct: 213 TAPVKSYRPNGHGLWNTAGNVWEWCSDWFSPTYYAESPTVDPHGPGTGAARVLRGGSYLC 272

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
           ++ YC R+R AARS NTPDSS+GNLGFRCA D   T+G+
Sbjct: 273 HDSYCNRYRVAARSSNTPDSSSGNLGFRCANDADLTSGS 311


>gi|448406821|ref|ZP_21573253.1| Non-specific serine/threonine protein kinase [Halosimplex
           carlsbadense 2-9-1]
 gi|445676627|gb|ELZ29144.1| Non-specific serine/threonine protein kinase [Halosimplex
           carlsbadense 2-9-1]
          Length = 335

 Score =  268 bits (684), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 145/338 (42%), Positives = 189/338 (55%), Gaps = 56/338 (16%)

Query: 11  RYKDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           R   MV +    FRMGT+  +   +DGE P+R V  DA+Y+D++ V+N +F EFV  TG+
Sbjct: 46  RTDRMVRVDAGRFRMGTDSDVGFPEDGEGPAREVATDAYYVDRYAVTNAEFLEFVRETGH 105

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
            T+AE+FG +FVFE  L+E +R  +                                   
Sbjct: 106 TTDAERFGWSFVFEDFLAESDREHVRD--------------------------------- 132

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTI--EHRMNHPVVHVSWND 187
                  R P  A+W     G           W HP G  S++  +  + HPV HVS  D
Sbjct: 133 -------RAPG-ADWWVAVAGA---------DWFHPRGPSSSVVADDLLTHPVTHVSHRD 175

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           A AY  W G RLPTEAEWE   RGG +   FPWG++L P GEHR NVWQGEFP  NTAAD
Sbjct: 176 AAAYADWAGKRLPTEAEWERAARGGRKGTRFPWGDHLEPDGEHRCNVWQGEFPERNTAAD 235

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPKGPTTGTDKVKK 304
           GY +TAPV ++  N  GL+N+ GNVWEW  DW++  +H   +    NP GP  G ++V +
Sbjct: 236 GYRATAPVDAFDPNDSGLFNVCGNVWEWCRDWFDAEYHTTSASDRENPTGPGDGDERVMR 295

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GGS+LC+E +C R+R AARS+N PDSS GN+GFRC  D
Sbjct: 296 GGSHLCHESWCNRYRLAARSKNGPDSSTGNIGFRCVVD 333


>gi|322435825|ref|YP_004218037.1| hypothetical protein AciX9_2214 [Granulicella tundricola MP5ACTX9]
 gi|321163552|gb|ADW69257.1| protein of unknown function DUF323 [Granulicella tundricola
           MP5ACTX9]
          Length = 324

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 149/336 (44%), Positives = 184/336 (54%), Gaps = 51/336 (15%)

Query: 7   PPVERYKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
           PP      ++ LPG  F MGT+ +     DGE P R V L  F +D + V+N  F  FV 
Sbjct: 35  PPSPDASRIISLPGGEFLMGTDYEEAFAADGERPVRRVHLKPFQIDIYPVTNADFAAFVQ 94

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
           AT Y TEAE +G +FVF   + + +R  +                               
Sbjct: 95  ATHYKTEAELYGWSFVFMNHIPKLQRGSLV-----------------------------T 125

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
           D VA   W          W   C+        P   W  PEG  S +  R +HPVVHV+W
Sbjct: 126 DTVAGAEW----------W---CQ-------VPGAVWNAPEGPGSDVARRSDHPVVHVTW 165

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
           NDA+AYC W G RLPTEAEWEY  RGGL  +L+PWGN L P GEHR N+WQG FP  +TA
Sbjct: 166 NDALAYCDWSGNRLPTEAEWEYAARGGLVQKLYPWGNQLRPNGEHRCNIWQGTFPHEDTA 225

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTTGTDKVKK 304
            DGY  T+PV +Y  N FGLY++ GN WEW  DW+ +       + +P GP TGT +V K
Sbjct: 226 EDGYAGTSPVKAYPPNGFGLYSITGNAWEWCMDWFGIPDSVDTVTEDPVGPPTGTVRVMK 285

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GGS+LC++ YC R+R AARS NTPDSS  NLGFRCA
Sbjct: 286 GGSFLCHKSYCNRYRVAARSSNTPDSSTSNLGFRCA 321


>gi|171848817|pdb|2Q17|A Chain A, Formylglycine Generating Enzyme From Streptomyces
           Coelicolor
 gi|171848818|pdb|2Q17|B Chain B, Formylglycine Generating Enzyme From Streptomyces
           Coelicolor
 gi|171848819|pdb|2Q17|C Chain C, Formylglycine Generating Enzyme From Streptomyces
           Coelicolor
 gi|171848820|pdb|2Q17|D Chain D, Formylglycine Generating Enzyme From Streptomyces
           Coelicolor
 gi|171848821|pdb|2Q17|E Chain E, Formylglycine Generating Enzyme From Streptomyces
           Coelicolor
          Length = 346

 Score =  267 bits (683), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 149/339 (43%), Positives = 186/339 (54%), Gaps = 53/339 (15%)

Query: 13  KDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +  V LPG  F MG         DGE P   V L  F++D+  V+N +F  FV ATG+VT
Sbjct: 56  RGQVRLPGGEFAMGDAFGEGYPADGETPVHTVRLRPFHIDETAVTNARFAAFVKATGHVT 115

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           +AE+FG + VF  +++  +   +                                     
Sbjct: 116 DAERFGSSAVFHLVVAAPDADVLGSA---------------------------------- 141

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
                     A W    RG           W  PEG  S I  R NHPVVHVSWNDA AY
Sbjct: 142 --------AGAPWWINVRGA---------HWRRPEGARSDITGRPNHPVVHVSWNDATAY 184

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G RLPTEAEWEY  RGGL  R + WG+ LTP G  R N+WQG FP  NTA DG+LS
Sbjct: 185 ARWAGKRLPTEAEWEYAARGGLAGRRYAWGDELTPGGRWRCNIWQGRFPHVNTAEDGHLS 244

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
           TAPV SY+ N  GL+N  GNVWEW +DW++  ++  +P+ +P GP TG  +V +GGSYLC
Sbjct: 245 TAPVKSYRPNGHGLWNTAGNVWEWCSDWFSPTYYAESPTVDPHGPGTGAARVLRGGSYLC 304

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
           ++ YC R+R AARS NTPDSS+GNLGFRCA D   T+G+
Sbjct: 305 HDSYCNRYRVAARSSNTPDSSSGNLGFRCANDADLTSGS 343


>gi|289767033|ref|ZP_06526411.1| conserved hypothetical protein [Streptomyces lividans TK24]
 gi|289697232|gb|EFD64661.1| conserved hypothetical protein [Streptomyces lividans TK24]
          Length = 329

 Score =  267 bits (683), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 149/339 (43%), Positives = 186/339 (54%), Gaps = 53/339 (15%)

Query: 13  KDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +  V LPG  F MG         DGE P   V L  F++D+  V+N +F  FV ATG+VT
Sbjct: 39  RGQVRLPGGEFAMGDAFGEGYPADGETPVHTVRLRPFHIDETAVTNARFAAFVKATGHVT 98

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           +AE+FG + VF  +++  +   +                                     
Sbjct: 99  DAERFGSSAVFHLVVAAPDADVLGNA---------------------------------- 124

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
                     A W    RG           W  PEG  S I  R NHPVVHVSWNDA A+
Sbjct: 125 --------AGAPWWINVRGA---------HWRRPEGARSDITGRPNHPVVHVSWNDATAF 167

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G RLPTEAEWEY  RGGL  R + WG+ LTP G  R N+WQG FP  NTA DG+LS
Sbjct: 168 ARWAGKRLPTEAEWEYAARGGLAGRRYAWGDELTPGGRWRCNIWQGRFPHVNTAEDGHLS 227

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
           TAPV SY+ N  GL+N  GNVWEW +DW++  ++  +P+ +P GP TG  +V +GGSYLC
Sbjct: 228 TAPVKSYRPNGHGLWNTAGNVWEWCSDWFSPTYYAESPTVDPHGPGTGAARVLRGGSYLC 287

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
           ++ YC R+R AARS NTPDSS+GNLGFRCA D G T G+
Sbjct: 288 HDSYCNRYRVAARSSNTPDSSSGNLGFRCANDAGLTYGS 326


>gi|403069090|ref|ZP_10910422.1| non-specific serine/threonine protein kinase [Oceanobacillus sp.
           Ndiop]
          Length = 308

 Score =  267 bits (682), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 147/334 (44%), Positives = 182/334 (54%), Gaps = 66/334 (19%)

Query: 11  RYKD-MVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           RY+D MV LPG    MGT +K     D E P RNV ++ FY+D H VSN +F+ FV  T 
Sbjct: 38  RYQDKMVHLPGGDLLMGTEDKEGFPSDREGPIRNVKVEPFYIDNHAVSNREFKRFVDETN 97

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           Y T AEKFG +FVF   L+ E+     Q++                              
Sbjct: 98  YQTVAEKFGWSFVFYEFLTLEQLNDEEQLQ------------------------------ 127

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                       +  W Y  +            W  PEG  STIE R++HPVVHVSW DA
Sbjct: 128 ------------DVPWWYAVKDAF---------WYQPEGKGSTIERRLDHPVVHVSWYDA 166

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            A+C W G RLPTE EWE+  RGGLE + +PWG+ LTP G+H  N+WQG+FP  N+  DG
Sbjct: 167 TAFCDWAGKRLPTEREWEFAARGGLEQKKYPWGDELTPDGKHFCNIWQGKFPQYNSKEDG 226

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
           YL TAP  S+  N +GLYNM GNVWEW +DW+N  +              T K  +GGSY
Sbjct: 227 YLGTAPAKSFPPNNYGLYNMSGNVWEWCSDWFNQDNQ-------------TQKSMRGGSY 273

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           LC++ YC R+R AARS NT DSSAGN+GFRC  D
Sbjct: 274 LCHQSYCNRYRVAARSSNTMDSSAGNIGFRCVRD 307


>gi|297191055|ref|ZP_06908453.1| sulfatase modifying factor 1 [Streptomyces pristinaespiralis ATCC
           25486]
 gi|197723248|gb|EDY67156.1| sulfatase modifying factor 1 [Streptomyces pristinaespiralis ATCC
           25486]
          Length = 320

 Score =  266 bits (681), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 153/343 (44%), Positives = 188/343 (54%), Gaps = 55/343 (16%)

Query: 4   LPAPPVER--YKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           +P P   R   +  V LPG  F MG         DGE P   V L+ F++D+  V+N QF
Sbjct: 24  VPGPAAGRRSLRGQVRLPGGEFAMGDAFGEGYPADGETPVHPVRLEPFHIDETAVTNAQF 83

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
             FV  TG++T+AE++G + VF                                  H VV
Sbjct: 84  ATFVKDTGHITDAERYGSSAVF----------------------------------HLVV 109

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
             +  D +             A W    RG           W  PEG  S I  R NHPV
Sbjct: 110 AAANADILGNAA--------GAPWWINVRGA---------HWRRPEGARSDITDRQNHPV 152

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           VHVSWNDAVAY  W G RLPTEAEWEY  RGGL  R + WG+ LTP G  R N+WQG FP
Sbjct: 153 VHVSWNDAVAYARWAGKRLPTEAEWEYAARGGLAGRRYAWGDELTPDGRWRCNIWQGRFP 212

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGT 299
             NTA DG+L+TAPV +Y+ N FGL+N  GNVWEW  DW++  ++  AP  +P GP TGT
Sbjct: 213 HTNTAEDGHLTTAPVKAYRPNGFGLWNTAGNVWEWCTDWFSPTYYAEAPPDDPCGPQTGT 272

Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            +V +GGSYLC++ YC R+R AARS NTP+SS+GNLGFRCA D
Sbjct: 273 ARVMRGGSYLCHDSYCNRYRVAARSSNTPESSSGNLGFRCAND 315


>gi|357394061|ref|YP_004908902.1| putative sulfatase-modifying factor [Kitasatospora setae KM-6054]
 gi|311900538|dbj|BAJ32946.1| putative sulfatase-modifying factor [Kitasatospora setae KM-6054]
          Length = 344

 Score =  266 bits (680), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 150/333 (45%), Positives = 180/333 (54%), Gaps = 58/333 (17%)

Query: 13  KDMVLLPGDTFRMGTNKPILIK-DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           + ++ LPG  FRMG   P  I  DGE P R V L  F +    VSN +F  FV  TGYVT
Sbjct: 64  RGLLELPGGAFRMGGQDPDGIAGDGEGPVREVLLGPFAIAATTVSNAEFASFVRETGYVT 123

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           +AE+FG +FVFE  L E  RA+  +V                          W  AV   
Sbjct: 124 DAERFGSSFVFEGFLPERLRARSPRV----------------------AATPWWRAVEGA 161

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            WR                              PEG  S    R  HPVVHVSWNDA AY
Sbjct: 162 AWR-----------------------------TPEGPGSGFAARQQHPVVHVSWNDAQAY 192

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
           C W G RLPTEA+WEY  RGG E   +PWG+ L P G    N+WQG+FPT NT    + S
Sbjct: 193 CAWSGTRLPTEAQWEYAARGGSEGTRYPWGDELAPGGRELLNIWQGDFPTRNTGR--HRS 250

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPTTGTDKVKKGGS 307
           TAPV SY+ N  GLYN++GNVWEW AD ++  HH    PA   +P GP TGT +V +GGS
Sbjct: 251 TAPVRSYRPNGHGLYNVLGNVWEWCADHFSPDHHRPDTPATRQDPLGPPTGTARVMRGGS 310

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           ++C+  YC R+R AARS NTPDSS+GN+GFR A
Sbjct: 311 HMCHASYCNRYRLAARSSNTPDSSSGNIGFRVA 343


>gi|209551961|ref|YP_002283877.1| protein of unknown function DUF323 [Rhizobium leguminosarum bv.
           trifolii WSM2304]
 gi|209539554|gb|ACI59485.1| protein of unknown function DUF323 [Rhizobium leguminosarum bv.
           trifolii WSM2304]
          Length = 319

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 151/327 (46%), Positives = 179/327 (54%), Gaps = 54/327 (16%)

Query: 14  DMVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           D V LPG +F MG T    L  DGE P   VTL  F +D   V+N  F  FV+ATGYVTE
Sbjct: 41  DEVPLPGGSFEMGDTFGEGLESDGEGPVHAVTLSPFRIDTTCVTNDHFARFVAATGYVTE 100

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+  D+ VF P ++ +                                           
Sbjct: 101 AERLSDSAVFAPFVAAD------------------------------------------- 117

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                 P +    YG RG L  R      W HP G  S IE    HPVVH+S  DA AYC
Sbjct: 118 ------PADITGGYGARGWLAVRG---ADWRHPFGRYSAIEDSGLHPVVHISHEDAQAYC 168

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWEY  R G   R FPWGN L P G H ANVWQG FPT NT ADG+L+T
Sbjct: 169 HWAGRSLPTEAEWEYAARAGHAGRRFPWGNELEPEGIHMANVWQGHFPTRNTGADGWLAT 228

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
           APV +Y  N FGLY+M+GNVWEW ADW+N  ++  +P+++P G  TG  +V +GGSYLC+
Sbjct: 229 APVGTYFPNDFGLYDMIGNVWEWCADWYNRDYYRNSPAHDPAGAATGDRRVTRGGSYLCH 288

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFR 338
             YC R+R AAR  NTPDS+A NLGFR
Sbjct: 289 RSYCARYRVAARFSNTPDSTACNLGFR 315


>gi|323456776|gb|EGB12642.1| hypothetical protein AURANDRAFT_18589 [Aureococcus anophagefferens]
          Length = 352

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 143/333 (42%), Positives = 188/333 (56%), Gaps = 54/333 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLD-AFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +++ PG+ FRMGT++P +  DGE PSR V L  AF +D +E +N  F  FV ATGYVT++
Sbjct: 66  VLIQPGE-FRMGTDRPRIFVDGEAPSRRVALSRAFLMDVYEATNDDFAAFVDATGYVTDS 124

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+FG +FVF   LSE  RA I +                      V  + W   V    W
Sbjct: 125 ERFGWSFVFHLELSEARRAAIDKA---------------------VDGVEWWLPVNGSDW 163

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R  + P                           G D     R  HPVV VSW+DA AYC 
Sbjct: 164 RSPQGP---------------------------GSDVFAAKRGAHPVVQVSWHDAAAYCA 196

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           WRG RLPTEAEWEY  R G    ++PWGN LTP G +RAN+WQG+FP +NTA DG+    
Sbjct: 197 WRGGRLPTEAEWEYAARDGRSQTMYPWGNELTPGGGYRANLWQGDFPRSNTADDGFSFAG 256

Query: 254 PVMSYK-ENKFGLYNMVGNVWEWTADWW---NVHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
           PV SY  +   G+++++GNVWEW +DWW       H A   +P GP+ G +K+KKGGS+L
Sbjct: 257 PVGSYAPQTATGIHDLIGNVWEWVSDWWYEPEASAHDAVLRDPAGPSAGGEKLKKGGSFL 316

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           C++ YC+R+R AAR +N+PDS++ N G RC  D
Sbjct: 317 CHKSYCFRYRSAARHKNSPDSASSNNGMRCVRD 349


>gi|418466952|ref|ZP_13037853.1| hypothetical protein SMCF_743 [Streptomyces coelicoflavus ZG0656]
 gi|371552421|gb|EHN79668.1| hypothetical protein SMCF_743 [Streptomyces coelicoflavus ZG0656]
          Length = 278

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 148/325 (45%), Positives = 183/325 (56%), Gaps = 57/325 (17%)

Query: 20  GDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDT 79
           GD F  G        DGE P   V L  F++D+  V+N +F  FV ATG+VT+AE+FG +
Sbjct: 2   GDAFGEG-----YPADGETPVHAVRLKPFHIDETAVTNARFAAFVKATGHVTDAERFGSS 56

Query: 80  FVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLP 139
            VF  +++                              P   +  N A            
Sbjct: 57  AVFHLVVAA-----------------------------PGADVLGNAA------------ 75

Query: 140 TEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARL 199
             A W    RG           W  PEG  S I  R NHPVVHVSWNDA AY  W G RL
Sbjct: 76  -GAPWWINVRGA---------HWRRPEGARSDITGRQNHPVVHVSWNDATAYTRWAGKRL 125

Query: 200 PTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK 259
           PTEAEWEY  RGGL  R + WG+ LTP G  R N+WQG FP  NTA DG+L+TAPV SY+
Sbjct: 126 PTEAEWEYAARGGLAGRRYAWGDELTPGGRWRCNIWQGRFPHVNTAEDGHLTTAPVKSYR 185

Query: 260 ENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRH 318
            N  GL+N  GNVWEW ADW++  ++  +P+ +P+GP TGT +V +GGSYLC++ YC R+
Sbjct: 186 PNGHGLWNTAGNVWEWCADWFSPTYYADSPTTDPRGPATGTTRVLRGGSYLCHDSYCNRY 245

Query: 319 RCAARSQNTPDSSAGNLGFRCAADK 343
           R AARS NTPDSS+GNLGFRCA D+
Sbjct: 246 RVAARSSNTPDSSSGNLGFRCANDE 270


>gi|291222787|ref|XP_002731394.1| PREDICTED: sulfatase modifying factor 1-like isoform 2
           [Saccoglossus kowalevskii]
          Length = 258

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 121/181 (66%), Positives = 137/181 (75%)

Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
           W HPEG D+ I  RM+HPV+HVSWNDAV +C W G RLPTEAE+EY  RGG ENR +PWG
Sbjct: 63  WKHPEGPDTNITDRMDHPVLHVSWNDAVEFCKWIGKRLPTEAEFEYASRGGKENRTYPWG 122

Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
           N L P G+   NVWQG FPT N+A DG+  T+PVM+Y  N FGL+NMVGN WEWT DWW 
Sbjct: 123 NRLRPNGKWLMNVWQGVFPTENSAEDGFAGTSPVMAYPPNGFGLHNMVGNAWEWTNDWWG 182

Query: 282 VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
             H      NPKGP +G DKVKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLGFRC A
Sbjct: 183 TSHTADLYVNPKGPASGIDKVKKGGSYMCHASYCYRYRCAARSQNTPDSSASNLGFRCFA 242

Query: 342 D 342
           D
Sbjct: 243 D 243


>gi|390958068|ref|YP_006421825.1| hypothetical protein Terro_2226 [Terriglobus roseus DSM 18391]
 gi|390958409|ref|YP_006422166.1| hypothetical protein Terro_2590 [Terriglobus roseus DSM 18391]
 gi|390412986|gb|AFL88490.1| hypothetical protein Terro_2226 [Terriglobus roseus DSM 18391]
 gi|390413327|gb|AFL88831.1| hypothetical protein Terro_2590 [Terriglobus roseus DSM 18391]
          Length = 321

 Score =  263 bits (671), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 151/333 (45%), Positives = 180/333 (54%), Gaps = 52/333 (15%)

Query: 13  KDMVLLPGDTFRMGTNKP-ILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +DMVLLPG  F MG+        DGE P R VT+DAF++D+  V N  F  FV AT YVT
Sbjct: 38  EDMVLLPGGRFLMGSESADSFPDDGEGPVRPVTVDAFWMDRFVVRNRDFAPFVDATQYVT 97

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           EAE+ G +FVF   L        S   H                                
Sbjct: 98  EAERIGWSFVFAGDLPPNADGSTSVQVHG------------------------------- 126

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
                     AEW  G  G          +W HPEG  S I  R  HPVVHVSWNDA AY
Sbjct: 127 ----------AEWWQGIEGA---------TWSHPEGPSSDISTRAEHPVVHVSWNDAAAY 167

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G RLPTEAEWE+  RGGLE   +PWG++LTP G+H  N+WQG+FPT+NTA DG+ S
Sbjct: 168 AAWAGKRLPTEAEWEFAARGGLEQAAYPWGDDLTPGGKHLCNIWQGDFPTSNTAEDGFAS 227

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
           T P  ++  N +GL+ M GN WEW ADW +   H  A  +NP GP  G  +V KGGSYLC
Sbjct: 228 TCPADAFPPNGYGLFGMTGNTWEWIADWSHPTWHQHATRHNPLGPPEGVARVLKGGSYLC 287

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           +  YC R+R  ARS NTPDS+  N+ FRC  DK
Sbjct: 288 HRSYCNRYRVGARSSNTPDSATTNISFRCVRDK 320


>gi|403529625|ref|YP_006664364.1| formylglycine-generating enzyme [Arthrobacter sp. Rue61a]
 gi|403231905|gb|AFR31326.1| putative formylglycine-generating enzyme [Arthrobacter sp. Rue61a]
          Length = 337

 Score =  260 bits (664), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 147/323 (45%), Positives = 179/323 (55%), Gaps = 57/323 (17%)

Query: 20  GDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDT 79
           GD F  G       +DGE P   VTLDAF +D   V+N  F  FV ATGY TEAE++G +
Sbjct: 58  GDAFGEG-----YPEDGESPVHLVTLDAFRIDATAVTNNMFGTFVQATGYRTEAEQYGSS 112

Query: 80  FVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLP 139
            VF  L         S   HD                         D +           
Sbjct: 113 AVFHLL---------STANHD-------------------------DVLGTAA------- 131

Query: 140 TEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARL 199
             A W    RG           W HP G  S     ++HPVVHVS+NDA+AYC+W G RL
Sbjct: 132 -GAPWWLNVRGA---------DWAHPTGPASDFREFLDHPVVHVSYNDALAYCSWAGRRL 181

Query: 200 PTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK 259
           P+EAEWEY  RGGLE++ + WGN L P GEHR N+WQG FPT NT  DG+L T+PV S+ 
Sbjct: 182 PSEAEWEYAARGGLESKRYAWGNELMPGGEHRCNIWQGTFPTANTLKDGHLGTSPVRSFP 241

Query: 260 ENKFGLYNMVGNVWEWTADWW-NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRH 318
            N +GLY M GNVWEW ADW+   ++  +P  NPKGPT G  +V +GGSYLC++ YC R+
Sbjct: 242 CNGYGLYEMAGNVWEWCADWFLPKYYRNSPVNNPKGPTIGAGRVMRGGSYLCHDSYCNRY 301

Query: 319 RCAARSQNTPDSSAGNLGFRCAA 341
           R AAR+ NTPDSS+GN GFR  A
Sbjct: 302 RVAARTSNTPDSSSGNCGFRTVA 324


>gi|374313129|ref|YP_005059559.1| sulfatase-modifying factor protein [Granulicella mallensis
           MP5ACTX8]
 gi|358755139|gb|AEU38529.1| Sulphatase-modifying factor protein [Granulicella mallensis
           MP5ACTX8]
          Length = 322

 Score =  259 bits (662), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 155/344 (45%), Positives = 194/344 (56%), Gaps = 52/344 (15%)

Query: 3   LLPAPPVERYKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           L+  P +   +  + LPG TF MGTN +     DGE P R V+L AF +D + V+N  F 
Sbjct: 27  LVYTPGLAPNRATIPLPGGTFLMGTNYERGFPADGEGPVRPVSLSAFDIDTYPVTNADFA 86

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
            F++AT Y TEAE FG +FVF   + EE             RFE     +E         
Sbjct: 87  AFIAATNYRTEAEVFGWSFVFWSHIPEE-------------RFE---EVVE--------- 121

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
               D VA   W          W   C+        P  +W HPEG  S ++ R NHPVV
Sbjct: 122 ----DTVAMTPW----------W---CK-------VPGATWNHPEGPGSDVKDRQNHPVV 157

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
           HVSWNDA AY  W G  LPTEA+WEY  RGGLE +L+PWG+ LTP+G H  N+WQG+FP 
Sbjct: 158 HVSWNDAAAYAAWAGRALPTEAQWEYAARGGLEQKLYPWGDELTPKGRHLCNIWQGQFPR 217

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK 301
            +TA DGY  + PV S+  N +GLY++ GNVWEW ADW+      + S +P GP  G  K
Sbjct: 218 EDTAEDGYAGSCPVDSFPPNGYGLYSVTGNVWEWCADWFGTTF--STSQDPVGPPVGEAK 275

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           V KGGS+LC+  YC R+R AAR+ NTPDSSA N+GFRC     P
Sbjct: 276 VMKGGSFLCHASYCNRYRVAARTSNTPDSSASNIGFRCVGKAHP 319


>gi|377567711|ref|ZP_09796918.1| putative sulfatase-modifying factor [Gordonia terrae NBRC 100016]
 gi|377535109|dbj|GAB42083.1| putative sulfatase-modifying factor [Gordonia terrae NBRC 100016]
          Length = 271

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 139/308 (45%), Positives = 170/308 (55%), Gaps = 52/308 (16%)

Query: 34  KDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAK 93
           +DGE P   V L AF L    V+N QF  FV  TG+VT AE+FG + VF           
Sbjct: 11  QDGEGPVHEVDLPAFRLAATPVTNAQFATFVKETGHVTTAEQFGVSAVFH---------- 60

Query: 94  ISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 153
                     F G  S I HR+                         +  W    +G   
Sbjct: 61  --------LAFAGDSSAILHRLE------------------------QTPWWLAVKGA-- 86

Query: 154 NRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGL 213
                   W HP G  + I  R NHPVVHVSW DA AYC W G RLPTEAEWEY  RGG 
Sbjct: 87  -------DWKHPHGPGTNISDRQNHPVVHVSWADATAYCDWAGTRLPTEAEWEYAARGGT 139

Query: 214 ENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVW 273
           + R +PWG+ LTP G+ R N+WQG FP  NT  DG+L+TAPV SY+ N FGLY M GNVW
Sbjct: 140 DGRRYPWGDELTPHGQWRCNIWQGSFPVENTEDDGHLTTAPVKSYRSNGFGLYQMAGNVW 199

Query: 274 EWTADWW-NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSA 332
           EW  DW+ N ++  +   +P GPT G  +V +GGSYLC++ YC R+R +ARS NTP+S+A
Sbjct: 200 EWCQDWFDNTYYERSQRVSPTGPTDGERRVMRGGSYLCHDSYCNRYRISARSSNTPESTA 259

Query: 333 GNLGFRCA 340
            N+GFRCA
Sbjct: 260 ANIGFRCA 267


>gi|374983353|ref|YP_004958848.1| Non-specific serine/threonine protein kinase [Streptomyces
           bingchenggensis BCW-1]
 gi|297154005|gb|ADI03717.1| Non-specific serine/threonine protein kinase [Streptomyces
           bingchenggensis BCW-1]
          Length = 279

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 143/321 (44%), Positives = 171/321 (53%), Gaps = 54/321 (16%)

Query: 26  GTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPL 85
           G +     +DGE P R V L  F +D   V+N QF  FV A+GY T+AE++G ++VF  L
Sbjct: 3   GDDADAFPEDGEGPVRAVRLSPFLIDATAVTNRQFAAFVRASGYRTDAERYGWSYVFYAL 62

Query: 86  LSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWE 145
           +    R                     H +   V    W  AV                E
Sbjct: 63  VHPGAR---------------------HLVRDTVAQAPWWLAV----------------E 85

Query: 146 YGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEW 205
             C             W  P G  S+     NHPVVHVSW DA AY  W G RLPTEAEW
Sbjct: 86  GAC-------------WQAPYGPGSSWTELSNHPVVHVSWRDASAYAAWAGKRLPTEAEW 132

Query: 206 EYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGL 265
           E   RGGLE   FPWG+ L PRG+HR N+WQG FP  NT  DGYL TAPV +++ N FGL
Sbjct: 133 ERAARGGLERSRFPWGDELLPRGQHRCNIWQGRFPQVNTGEDGYLGTAPVKTFRPNNFGL 192

Query: 266 YNMVGNVWEWTADWWNVHHH----PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCA 321
           YN  GNVWEW +DWW+   H    P    +P GP  G  KV +GGSYLC+  YC R+R A
Sbjct: 193 YNTSGNVWEWCSDWWSTTWHAADRPETRQDPAGPPAGEAKVIRGGSYLCHASYCNRYRVA 252

Query: 322 ARSQNTPDSSAGNLGFRCAAD 342
           AR+ NTPDSS G++GFRCAAD
Sbjct: 253 ARTSNTPDSSTGHMGFRCAAD 273


>gi|225709012|gb|ACO10352.1| Sulfatase-modifying factor 1 precursor [Caligus rogercresseyi]
          Length = 313

 Score =  258 bits (660), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 141/331 (42%), Positives = 183/331 (55%), Gaps = 68/331 (20%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +   V + G TF  GTN P+ + D E PS   ++  F +D  +V+N +F+ FV  TGYVT
Sbjct: 45  FTSPVFIAGGTFNKGTNTPVFMADLEGPSTPTSVSDFLMDPTQVTNGEFEAFVRDTGYVT 104

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           EAE +G++FV    LS       S+V+  + +   L                        
Sbjct: 105 EAEGYGNSFVMGYFLS-------SEVKSSVSQSAQL------------------------ 133

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
                     A W     G          SW  PEG  S  + + +HPVVHVSWNDA  Y
Sbjct: 134 ----------APWWLKVPGA---------SWREPEGPGS--DFKGDHPVVHVSWNDANHY 172

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
           C W G RLP+EAEWEY CR G E+RLFPWGN  +P+G+  AN+W G+FP+ N A DGY S
Sbjct: 173 CDWAGKRLPSEAEWEYACRSGKEDRLFPWGNKWSPKGQALANIWTGDFPSQNDALDGYAS 232

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
           T PV  Y  N +GLY+M+GNVWEWT+D W                 G  +VKKGGS++C+
Sbjct: 233 TNPVGFYPPNAYGLYDMIGNVWEWTSDVWR----------------GKSRVKKGGSFMCH 276

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           + +CYR+RCAARSQNT D+SA NLGFRC +D
Sbjct: 277 KSHCYRYRCAARSQNTEDTSAHNLGFRCVSD 307



 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 31/46 (67%), Positives = 40/46 (86%)

Query: 348 GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           G  +VKKGGS++C++ +CYR+RCAARSQNT D+SA NLGFRC +DV
Sbjct: 263 GKSRVKKGGSFMCHKSHCYRYRCAARSQNTEDTSAHNLGFRCVSDV 308


>gi|408682275|ref|YP_006882102.1| Sulfatase modifying factor 1 precursor [Streptomyces venezuelae
           ATCC 10712]
 gi|328886604|emb|CCA59843.1| Sulfatase modifying factor 1 precursor [Streptomyces venezuelae
           ATCC 10712]
          Length = 299

 Score =  258 bits (658), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 149/328 (45%), Positives = 181/328 (55%), Gaps = 53/328 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIK-DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           MV LPG  F MGT+     + DGE P R V L  F +D   V+N  F ++V+ TGYVTEA
Sbjct: 20  MVCLPGGEFLMGTDDAEGFRGDGEGPVRTVRLRPFLIDTCAVTNDAFAQYVADTGYVTEA 79

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+FG ++VF   L    R                                          
Sbjct: 80  ERFGWSYVFAGFLPAALR------------------------------------------ 97

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           RGA  P    W  G  G   NR         PEG  ST++ R +HPVVHVSWNDA AY  
Sbjct: 98  RGAYRPERTPWWCGVGGAAWNR---------PEGPGSTLDGRGDHPVVHVSWNDAAAYAA 148

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGGLE + +PWG+ L P G H  N+W+G FP  NTA DG+  TA
Sbjct: 149 WAGKRLPTEAEWEYAARGGLEQKRYPWGDELDPDGAHHCNIWRGTFPAKNTATDGFRGTA 208

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGSYLCNE 312
           PV +++ N +GLYN  GNVWEW ADWW   H    P  +P GP TGT KV +GGS+LC+ 
Sbjct: 209 PVDAFEPNGYGLYNTSGNVWEWCADWWTTDHPTHRPLADPTGPATGTAKVIRGGSHLCHR 268

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            YC R+R AAR+ NT DS++G+ GFRCA
Sbjct: 269 SYCNRYRVAARTANTSDSTSGHAGFRCA 296


>gi|270008071|gb|EFA04519.1| hypothetical protein TcasGA2_TC016314 [Tribolium castaneum]
          Length = 292

 Score =  257 bits (657), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 132/267 (49%), Positives = 160/267 (59%), Gaps = 49/267 (18%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           DMVL+   TF MGT+KP+   D E P+RNV++D+FYLD+ EVSN +F EFV  TGY TEA
Sbjct: 63  DMVLIEAATFEMGTDKPVFESDHEGPARNVSVDSFYLDKFEVSNGKFSEFVLKTGYKTEA 122

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+FGD+F+FE L+ E ER K    R                                   
Sbjct: 123 EEFGDSFIFEMLVPESEREKYKDFR----------------------------------- 147

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                  +A W    +G          +W HPEG  STI  R++HPV+HVSW DA  YC 
Sbjct: 148 -----AVQAPWWIKMKGV---------TWKHPEGEGSTINDRLDHPVIHVSWTDAKKYCE 193

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           + G RLPTEAEWE  CR GL  +L+PWGN LTP+GEH AN+WQG+FP  NTA DGYLSTA
Sbjct: 194 FVGKRLPTEAEWEMACRAGLRQKLYPWGNKLTPKGEHWANIWQGDFPLTNTAEDGYLSTA 253

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWW 280
           PV S+  NKFGL++M GNVWEWT D W
Sbjct: 254 PVDSFPANKFGLHHMAGNVWEWTQDNW 280


>gi|441506956|ref|ZP_20988884.1| putative sulfatase-modifying factor [Gordonia aichiensis NBRC
           108223]
 gi|441449021|dbj|GAC46845.1| putative sulfatase-modifying factor [Gordonia aichiensis NBRC
           108223]
          Length = 327

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 142/327 (43%), Positives = 180/327 (55%), Gaps = 57/327 (17%)

Query: 15  MVLLPGDTFRMGTNKPIL-IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M+ LP   F MG + P     DGE P R+V +DAF +  H V+N +F EFV+++ YVT+A
Sbjct: 56  MISLPTTDFWMGDDGPSSHAADGEGPVRSVRVDAFAISAHAVTNAEFDEFVTSSEYVTDA 115

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+FG +FVF   L        + +R    R EG                 W  AV   TW
Sbjct: 116 ERFGWSFVFAGFLP-------AALRRASPRPEG---------------TPWWCAVTDATW 153

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R                              PEG  S +  R +HPVVHVSWNDA A+C 
Sbjct: 154 RA-----------------------------PEGPGSDVSERADHPVVHVSWNDARAFCR 184

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGGL    +PWG+ L      R N+W+G FP  NTAADGY STA
Sbjct: 185 WAGVRLPTEAEWEYAARGGLSRARYPWGDELDEDESFRCNIWRGSFPARNTAADGYRSTA 244

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           PV +++ N FGLYNMVGNVWEWTAD W V H   P   P+     + K  +GGS++C++ 
Sbjct: 245 PVEAFEPNGFGLYNMVGNVWEWTADAWRVDHSGTPHEAPE-----SRKAMRGGSHMCHDS 299

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCA 340
           YC R+R +AR+ N+ D S+GN+GFR A
Sbjct: 300 YCNRYRVSARTSNSADGSSGNIGFRIA 326



 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 20/40 (50%), Positives = 31/40 (77%)

Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
           K  +GGS++C++ YC R+R +AR+ N+ D S+GN+GFR A
Sbjct: 287 KAMRGGSHMCHDSYCNRYRVSARTSNSADGSSGNIGFRIA 326


>gi|25028125|ref|NP_738179.1| hypothetical protein CE1569 [Corynebacterium efficiens YS-314]
 gi|23493409|dbj|BAC18379.1| conserved hypothetical protein [Corynebacterium efficiens YS-314]
          Length = 334

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 145/336 (43%), Positives = 189/336 (56%), Gaps = 53/336 (15%)

Query: 10  ERYKDMVLLPGDTFRMGTNK-PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           E+ +D V LPG  F MG +       DGE P   V L  F ++   V+N +F  F+ ATG
Sbjct: 49  EQSRDAVTLPGGAFHMGDHHGEGYPADGEGPVHEVHLAPFGINVTTVTNAEFGRFIEATG 108

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           Y T AE++G + VF     + +RA I      +++  G               + W  AV
Sbjct: 109 YTTTAERYGVSAVFYAAF-QGQRADI------LRQVPG---------------VPWWLAV 146

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                +GA                        +W  P G  ST++   +HPVVHVSW+DA
Sbjct: 147 -----KGA------------------------NWQRPNGPGSTLDGLEDHPVVHVSWDDA 177

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
           VAYCTW G RLPTEAEWEY  RGGL+   + WG+NL   G    N+WQG FP  NTAADG
Sbjct: 178 VAYCTWAGGRLPTEAEWEYAARGGLQGARYAWGDNLALDGRWNCNIWQGGFPMENTAADG 237

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGS 307
           YL+TAPV +Y  N +GL+ M GNVWEW  DW++  ++  A S NP+GP TG  +V +GGS
Sbjct: 238 YLTTAPVKTYTPNGYGLWQMAGNVWEWCQDWFDAEYYSRASSINPRGPDTGARRVMRGGS 297

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           YLC++ YC R+R AAR+ NTPDS++GN GFRC  D 
Sbjct: 298 YLCHDSYCNRYRVAARNSNTPDSTSGNTGFRCVFDS 333


>gi|326334164|ref|ZP_08200388.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Nocardioidaceae bacterium Broad-1]
 gi|325948050|gb|EGD40166.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Nocardioidaceae bacterium Broad-1]
          Length = 326

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 146/341 (42%), Positives = 184/341 (53%), Gaps = 57/341 (16%)

Query: 5   PAPPVERYKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
           PA P +R + +V LPG T+  G +       DGE P   V L  F LD   V+N +F  F
Sbjct: 24  PALPRKRRRTLVDLPGGTYAAGDSFGEGYQADGEGPVHQVELPGFRLDATAVTNEEFARF 83

Query: 64  VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
           V ATG+VT AE+ G + VF      + R  +                             
Sbjct: 84  VKATGHVTTAEREGFSAVFHLAFEGDPRDVL----------------------------- 114

Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHV 183
                        R P EA W  G RG          SW  PEG  S +  R NHPVVHV
Sbjct: 115 ------------GRSP-EAPWWLGVRGA---------SWRTPEGPGSNVGRRGNHPVVHV 152

Query: 184 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNN 243
           S +DA+AYC W G RLPTEAEWEY  RGG     FPWG++L   G H ANVWQG FP  N
Sbjct: 153 SHDDALAYCDWAGTRLPTEAEWEYAARGGRAGSRFPWGDDLEQDGTHHANVWQGVFPDTN 212

Query: 244 TAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTG 298
           TA DG+++TAPV +Y  N FGL+ M+GNVWEW ADW++  ++          +P+GP  G
Sbjct: 213 TADDGFVTTAPVRTYDPNGFGLFQMIGNVWEWCADWFSPTYYNDLVEAGQPSDPRGPAEG 272

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
             +V +GGSYLC++ YC+R+R +ARS + PDS+AGNLGFRC
Sbjct: 273 LTRVMRGGSYLCHDSYCHRYRLSARSSSPPDSTAGNLGFRC 313


>gi|373850102|ref|ZP_09592903.1| Sulphatase-modifying factor protein [Opitutaceae bacterium TAV5]
 gi|372476267|gb|EHP36276.1| Sulphatase-modifying factor protein [Opitutaceae bacterium TAV5]
          Length = 324

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 150/332 (45%), Positives = 183/332 (55%), Gaps = 54/332 (16%)

Query: 15  MVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M LLPG  F MG  +      DGE P   V LD F++D   V+N QF +FV ATGY TEA
Sbjct: 41  MRLLPGGVFLMGNERDYGFPSDGEGPVHEVKLDPFWIDATTVTNRQFCDFVDATGYRTEA 100

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+FG +FVF   LS ++ A   ++                                    
Sbjct: 101 ERFGWSFVFYGHLSGKQLASALRL------------------------------------ 124

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                 T A  E+ CR  +E       +W HPEG  S I  R  HPVV VSWNDAVAY +
Sbjct: 125 ------TVAGSEWWCR--VEG-----ATWRHPEGPGSHIRKRWQHPVVQVSWNDAVAYAS 171

Query: 194 WRGARLPTEAEWEYGCRGGL--ENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
           W G RLPTEAEWEY  R GL   NR FPWG+ L P G H  NVWQG+FP  NT ADGY  
Sbjct: 172 WAGKRLPTEAEWEYAARSGLPHANR-FPWGDELEPGGRHMMNVWQGDFPVRNTEADGYYG 230

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
            AP  S++ + +GLYNM GNVWEW  D ++   +  +P+ NP GPT+G  +  +GGSYLC
Sbjct: 231 VAPARSFRASAWGLYNMTGNVWEWCHDRFSGTWYARSPAENPAGPTSGERRSMRGGSYLC 290

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +  YC R+R  AR+ NTPDS+  NLGFRC  D
Sbjct: 291 HASYCNRYRVDARNSNTPDSATTNLGFRCVRD 322


>gi|431913504|gb|ELK15179.1| Sulfatase-modifying factor 1 [Pteropus alecto]
          Length = 355

 Score =  254 bits (650), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 128/278 (46%), Positives = 163/278 (58%), Gaps = 51/278 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+P   F MGT+ P +  DGE P+R VT+DAFY+D +EVSN +F++FV++TGY+TE  
Sbjct: 82  MVLIPAGVFTMGTDDPQIKPDGEAPARRVTMDAFYMDAYEVSNAEFEKFVNSTGYLTEVT 141

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
             G                        +   G  +++      P   +S           
Sbjct: 142 GPG------------------------RGRTGAAASVHEPRTAPATCVS----------- 166

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                  A W    +G           W HPEG DST+ HR +HPV+HVSWNDA AYC W
Sbjct: 167 -------APWWLPVKGA---------DWRHPEGPDSTVVHRPDHPVLHVSWNDAAAYCAW 210

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEYGCRGGL NRLFPWGN L P+G+H AN+WQGEFP  ++  DG+  TAP
Sbjct: 211 AGKRLPTEAEWEYGCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTDSGEDGFRGTAP 270

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
           V ++  N +GLYN+VGNVWEWT+DWW VHH    ++NP
Sbjct: 271 VDAFPPNAYGLYNVVGNVWEWTSDWWTVHHTVEETFNP 308


>gi|391227612|ref|ZP_10263819.1| hypothetical protein OpiT1DRAFT_00096 [Opitutaceae bacterium TAV1]
 gi|391223105|gb|EIQ01525.1| hypothetical protein OpiT1DRAFT_00096 [Opitutaceae bacterium TAV1]
          Length = 324

 Score =  254 bits (648), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 151/335 (45%), Positives = 181/335 (54%), Gaps = 60/335 (17%)

Query: 15  MVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M LLPG  F MG  +      DGE P   V LD F++D   V+N QF +FV ATGY TEA
Sbjct: 41  MRLLPGGVFLMGNERDYGFPSDGEGPVHEVKLDPFWIDATTVTNRQFCDFVDATGYRTEA 100

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+FG +FVF   LS ++ A   ++                                    
Sbjct: 101 ERFGWSFVFYGHLSGKQLASALRL------------------------------------ 124

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                 T A  E+ CR  +E       +W HPEG  S I  R  HPVV VSWNDAVAY +
Sbjct: 125 ------TVAGSEWWCR--VEG-----ATWRHPEGPGSHIRKRWQHPVVQVSWNDAVAYAS 171

Query: 194 WRGARLPTEAEWEYGCRGGL--ENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
           W G RLPTEAEWEY  R GL   NR FPWG+ L P G H  NVWQG+FP  NT ADGY  
Sbjct: 172 WAGKRLPTEAEWEYAARSGLPHANR-FPWGDELEPGGRHMMNVWQGDFPVRNTEADGYYG 230

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTAD----WWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
            AP  S++ + +GLYNM GNVWEW  D     W  H   +P+ NP GP +G  +  +GGS
Sbjct: 231 VAPARSFRASAWGLYNMTGNVWEWCHDRFSGTWYAH---SPAENPAGPASGERRSMRGGS 287

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           YLC+  YC R+R  AR+ NTPDS+  NLGFRC  D
Sbjct: 288 YLCHASYCNRYRVDARNSNTPDSATTNLGFRCVRD 322


>gi|408682476|ref|YP_006882303.1| Sulfatase modifying factor 1 precursor [Streptomyces venezuelae
           ATCC 10712]
 gi|328886805|emb|CCA60044.1| Sulfatase modifying factor 1 precursor [Streptomyces venezuelae
           ATCC 10712]
          Length = 395

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 154/349 (44%), Positives = 185/349 (53%), Gaps = 73/349 (20%)

Query: 1   MVLLPAP------PVERYKDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQH 53
           + L PAP      P    + ++ LPG  F MGT+ P     DGE P R   +  F +   
Sbjct: 107 LALAPAPRGHVADPERAARRLIDLPGGRFLMGTDDPEGFPADGEGPVREAEVGPFRIAPT 166

Query: 54  EVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEH 113
            V+N QF  FV ATG+VTEAE FG +FVF   LS+E  A    V                
Sbjct: 167 TVTNAQFAAFVEATGHVTEAEHFGFSFVFGGFLSDEVAAASPSV---------------- 210

Query: 114 RMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIE 173
                   + W  AV   TWR                             HPEG  S++ 
Sbjct: 211 ------AAVPWWRAVTGATWR-----------------------------HPEGAGSSLA 235

Query: 174 HRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRAN 233
            R +HPVVHVSWNDA AYC W G RLPTEAEWEY  RGGLE R +PWG++L P G H  N
Sbjct: 236 TRQDHPVVHVSWNDAQAYCAWSGTRLPTEAEWEYAARGGLEQRRYPWGDDLAPDGRHMCN 295

Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK 293
           +W+G+FPT NTA DG+  TAP  S++ N FGLYN VGNVWEW AD +             
Sbjct: 296 IWRGDFPTLNTAEDGWAGTAPAKSFRPNGFGLYNAVGNVWEWCADRF------------- 342

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
                T +V +GGSYLC++ YC R+R AARS NTPDSS GN+GFR AAD
Sbjct: 343 --ARDTGRVMRGGSYLCHDSYCNRYRVAARSSNTPDSSTGNIGFRVAAD 389


>gi|371778242|ref|ZP_09484564.1| sulfatase-modifying factor protein [Anaerophaga sp. HS1]
          Length = 372

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 143/342 (41%), Positives = 184/342 (53%), Gaps = 48/342 (14%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           DMVL+PG  F MG  +    K  EFP   V +D+F +D+H V+N QF+EFV ATGYVT A
Sbjct: 64  DMVLIPGGEFEMGAKENQFAKPDEFPVHKVYVDSFLMDRHPVTNAQFREFVEATGYVTTA 123

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D           +  K        K  + L          P   +S +D +A+  W
Sbjct: 124 ERRPDW----------QEIKKQLPPGTPKPPDSLLVPASLVFKSPDHEVSLSDYIAWWEW 173

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                   P  +W HP G  S+IE   +HPVVHVSW DA+AY T
Sbjct: 174 -----------------------VPGANWRHPLGPGSSIEGLDDHPVVHVSWYDAMAYAT 210

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGG  + ++PWGN   P    RAN WQG FP  N   DG+  TA
Sbjct: 211 WAGKRLPTEAEWEYAARGGNNDYIYPWGNE--PVSAQRANFWQGNFPYKNVVEDGFQRTA 268

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTD-------- 300
           PV S+  N +GLY+M GNVWEWTADW++  ++        + NP+GP T  D        
Sbjct: 269 PVGSFPANGYGLYDMAGNVWEWTADWYHPDYYAFLAQQVIALNPQGPETSFDPMEPGVPK 328

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           KV +GGS+LCN+ YC  +R +AR +++PDS A +LGFRC  D
Sbjct: 329 KVIRGGSFLCNDSYCAGYRSSARMKSSPDSGALHLGFRCVKD 370


>gi|409357737|ref|ZP_11236107.1| hypothetical protein Dali7_07741 [Dietzia alimentaria 72]
          Length = 313

 Score =  250 bits (639), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 148/348 (42%), Positives = 183/348 (52%), Gaps = 65/348 (18%)

Query: 3   LLPAPPVERYKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           ++P     R +   L+ G +F MG +       DGE P   V L  F +D+  V+N QF 
Sbjct: 16  VVPGAGAHRRRRQSLIAGGSFVMGDHFDEGYPDDGERPLHEVRLSPFRMDETAVTNAQFA 75

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
            F   TGYVTEAE+ G + VF  L  +  RA +            LD+T           
Sbjct: 76  TFCKDTGYVTEAEQLGVSAVFH-LAVQASRADV---------LHQLDTT----------- 114

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
                                 W    RG           W HP G  S+I    NHPVV
Sbjct: 115 ---------------------PWWVAVRGA---------DWRHPGGPASSITDLQNHPVV 144

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
           HVSWNDAVAYC W G RLPTEAEWEY  RGGL    FPWG+ LTPRG    NVWQG+FPT
Sbjct: 145 HVSWNDAVAYCAWAGTRLPTEAEWEYAARGGLAGARFPWGDELTPRGRWNLNVWQGDFPT 204

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD-----WWNVH-----HHPAPSYN 291
           +NT  DG+L+TAP  +++ N +GL+ MVGNVWEW +D     W+ V        PAP   
Sbjct: 205 HNTLEDGHLTTAPARAFRPNGYGLHQMVGNVWEWCSDTFDPGWYAVSGVDDPSGPAPGEG 264

Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           P+G   G  +V +GGSYL ++ YCYR+R AARS NT DS+ GN+GFRC
Sbjct: 265 PEG---GAWRVMRGGSYLGHDSYCYRYRVAARSANTADSATGNIGFRC 309


>gi|408533502|emb|CCK31676.1| Sulfatase-modifying factor 1 [Streptomyces davawensis JCM 4913]
          Length = 314

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 150/334 (44%), Positives = 177/334 (52%), Gaps = 54/334 (16%)

Query: 8   PVERYKDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
           P      MVLLPG  F MG  +      DGE P R V L  F +D   V N +F  FV+ 
Sbjct: 30  PTAATDGMVLLPGGEFLMGAEDSEGFPDDGEGPVRAVRLAPFRIDACAVRNDEFAAFVAD 89

Query: 67  TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
           TG+VT+AE+   ++VF   L    R                                   
Sbjct: 90  TGHVTDAERLDWSYVFAGFLPAALR----------------------------------- 114

Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
                  RGA  P    W     G          +W  PEG  S +  R +HPVVHVSW 
Sbjct: 115 -------RGAPRPERTPWWCAVTGA---------AWDRPEGPGSDLAGRGDHPVVHVSWY 158

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
           DA AY  W G RLPTEAEWEY  RGGLE R +PWG+ L P GEHR N+W+G FPT NTAA
Sbjct: 159 DAAAYAAWAGKRLPTEAEWEYAARGGLEQRRYPWGDELDPDGEHRCNIWRGTFPTKNTAA 218

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
           DGY  TAPV +++ N FGLYN  GNVWEW  DWW   H    S +P GP  GT KV +GG
Sbjct: 219 DGYRGTAPVDAFEPNGFGLYNTSGNVWEWCGDWWTTEHR--GSVDPTGPAEGTAKVIRGG 276

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           S++C+  YC R+R AARS NTPDSS+G+ GFRCA
Sbjct: 277 SHMCHASYCNRYRVAARSTNTPDSSSGHTGFRCA 310


>gi|331694601|ref|YP_004330840.1| sulfatase-modifying factor protein [Pseudonocardia dioxanivorans
           CB1190]
 gi|326949290|gb|AEA22987.1| Sulphatase-modifying factor protein [Pseudonocardia dioxanivorans
           CB1190]
          Length = 302

 Score =  248 bits (633), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 144/328 (43%), Positives = 177/328 (53%), Gaps = 65/328 (19%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           +V L G  F MG+++    +DGE P+R   +  F +  H VSN  F  FV ATGY+T AE
Sbjct: 39  LVALAGGEFTMGSDEQRYPEDGEGPTRVARVAPFGIAAHAVSNDDFAAFVDATGYLTTAE 98

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           + G +FVF  LL +              RF    S         VV   W        WR
Sbjct: 99  REGWSFVFVGLLPD--------------RFPPTRS---------VVGAPW--------WR 127

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              L                       W  PEG  S +E R +HPVVHVSW DA  YC W
Sbjct: 128 QVHL---------------------ADWRRPEGPRSDLEGRGDHPVVHVSWLDAREYCRW 166

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLP+E EWEY  RGGL+   +PWG+ LTP G+HR NV+QG FP ++T ADGY  TAP
Sbjct: 167 AGGRLPSETEWEYAARGGLDGARYPWGDELTPGGQHRMNVFQGRFPAHDTGADGYRGTAP 226

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++  N +GL+NM GNVWEW AD +             GP     +  +GGSYLC+E Y
Sbjct: 227 VHAFAPNGYGLHNMTGNVWEWCADRF-------------GPGRPGQRNMRGGSYLCHESY 273

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           C+R+RCAARS NTPDSSAGN+GFR AAD
Sbjct: 274 CWRYRCAARSANTPDSSAGNIGFRMAAD 301



 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 37/60 (61%), Positives = 44/60 (73%)

Query: 333 GNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           GN+   CA   GP     +  +GGSYLC+E YC+R+RCAARS NTPDSSAGN+GFR AAD
Sbjct: 242 GNVWEWCADRFGPGRPGQRNMRGGSYLCHESYCWRYRCAARSANTPDSSAGNIGFRMAAD 301


>gi|259507183|ref|ZP_05750083.1| sulfatase-modifying factor 1 [Corynebacterium efficiens YS-314]
 gi|259165223|gb|EEW49777.1| sulfatase-modifying factor 1 [Corynebacterium efficiens YS-314]
          Length = 271

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 137/310 (44%), Positives = 174/310 (56%), Gaps = 52/310 (16%)

Query: 35  DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKI 94
           DGE P   V L  F ++   V+N +F  F+ ATGY T AE++G + VF         A  
Sbjct: 12  DGEGPVHEVHLAPFGINVTTVTNAEFGRFIEATGYTTTAERYGVSAVF--------YAAF 63

Query: 95  SQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
              R D+ R               V  + W  AV     +GA                  
Sbjct: 64  QGQRADILR--------------QVPGVPWWLAV-----KGA------------------ 86

Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
                 +W  P G  ST++   +HPVVHVSW+DAVAYCTW G RLPTEAEWEY  RGGL+
Sbjct: 87  ------NWQRPNGPGSTLDGLEDHPVVHVSWDDAVAYCTWAGGRLPTEAEWEYAARGGLQ 140

Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
              + WG+NL   G    N+WQG FP  NTAADGYL+TAPV +Y  N +GL+ M GNVWE
Sbjct: 141 GARYAWGDNLALDGRWNCNIWQGGFPMENTAADGYLTTAPVKTYTPNGYGLWQMAGNVWE 200

Query: 275 WTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAG 333
           W  DW++  ++  A S NP+GP TG  +V +GGSYLC++ YC R+R AAR+ NTPDS++G
Sbjct: 201 WCQDWFDAEYYSRASSINPRGPDTGARRVMRGGSYLCHDSYCNRYRVAARNSNTPDSTSG 260

Query: 334 NLGFRCAADK 343
           N GFRC  D 
Sbjct: 261 NTGFRCVFDS 270


>gi|116670275|ref|YP_831208.1| hypothetical protein Arth_1714 [Arthrobacter sp. FB24]
 gi|116610384|gb|ABK03108.1| protein of unknown function DUF323 [Arthrobacter sp. FB24]
          Length = 301

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 139/333 (41%), Positives = 174/333 (52%), Gaps = 54/333 (16%)

Query: 16  VLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           V +P   F MG         DGE P   V LD+F++D   V+N  F  F++A+GY TEAE
Sbjct: 20  VPIPAGAFSMGDAYGEGYPADGEVPVHRVRLDSFHIDATAVTNEMFAAFIAASGYRTEAE 79

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            +G + VF  LL   +   +                                        
Sbjct: 80  LYGSSAVFHLLLDAPKEDVLG--------------------------------------- 100

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              L   A W    RG           W HP G ++  E    HPVVHVS NDA+AYC W
Sbjct: 101 ---LAAGAPWWVNVRGA---------DWAHPGGRNTNWESLRGHPVVHVSHNDALAYCRW 148

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLT-PRGEHRANVWQGEFPTNNTAADGYLSTA 253
            G RLPTEAEWEY  RGGL  R + WGN L    G+H  N+WQG+FP +NT+ DGYL TA
Sbjct: 149 AGRRLPTEAEWEYAARGGLGGRRYVWGNELLGSAGQHLCNIWQGQFPVSNTSEDGYLGTA 208

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
           PV S+  N FGLY + GNVWEW +DW++  ++  +P  NP GP  G  +V +GGSYLC+ 
Sbjct: 209 PVRSFPPNGFGLYEVSGNVWEWCSDWFDPAYYADSPEANPSGPEKGASRVMRGGSYLCHS 268

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
            YC R+R AARS NTP+SS+GN GFR  A   P
Sbjct: 269 SYCSRYRVAARSSNTPESSSGNCGFRTVAVGTP 301


>gi|383778334|ref|YP_005462900.1| hypothetical protein AMIS_31640 [Actinoplanes missouriensis 431]
 gi|381371566|dbj|BAL88384.1| hypothetical protein AMIS_31640 [Actinoplanes missouriensis 431]
          Length = 328

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 147/342 (42%), Positives = 181/342 (52%), Gaps = 62/342 (18%)

Query: 1   MVLLPAPPVERYKDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQ 59
           +VL P  P     DM+ L G  F MGT+       DGE P R VT+  F +    V+N +
Sbjct: 25  VVLTPGRP--DTGDMIELGGGRFLMGTDDADGFPADGEGPVRAVTVSPFRIAPVAVTNAR 82

Query: 60  FQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPV 119
           F EFV+ TGY TEAE+ G +FVF  L                                  
Sbjct: 83  FAEFVADTGYATEAERAGWSFVFAGL---------------------------------- 108

Query: 120 VHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHP 179
                             LP +     G R     R      W HPEG  S +  R +HP
Sbjct: 109 ------------------LPDDFPPTRGVRAAPWWRQVEGADWRHPEGPQSAVGDRADHP 150

Query: 180 VVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEF 239
           VVHV+  DAVAYC W G RLPTEAEWE+  RGGLE   FPWG+   P+G  R N+W+G F
Sbjct: 151 VVHVAHTDAVAYCRWAGVRLPTEAEWEFAARGGLEQARFPWGDE-PPQG--RCNIWEGTF 207

Query: 240 PTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTG 298
           P  +T + G   TAPV SY+ N +GLY+M GNVWEW +DW++  +H   P  +P GP TG
Sbjct: 208 PGRHTGSPG---TAPVRSYQPNGYGLYDMAGNVWEWCSDWFSPAYHRDGPRADPAGPPTG 264

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           T +V +GGSYLC+E YC R+R AAR+QNTPDSS GN GFR A
Sbjct: 265 TARVIRGGSYLCHESYCNRYRVAARNQNTPDSSTGNTGFRVA 306


>gi|325003656|ref|ZP_08124768.1| non-specific serine/threonine protein kinase [Pseudonocardia sp.
           P1]
          Length = 303

 Score =  244 bits (624), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 139/331 (41%), Positives = 177/331 (53%), Gaps = 65/331 (19%)

Query: 13  KDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +  V +PG +F MGT+       DGE P+R V +  + +D   V+N +F EFV+AT YVT
Sbjct: 26  ETAVEVPGGSFLMGTDSAEGYGTDGEGPARRVEVGPYLIDTVAVTNARFAEFVAATSYVT 85

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           E E+ G +FVF   L    R                                        
Sbjct: 86  ETERIGWSFVFVGFLPAALR---------------------------------------- 105

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
             R +  P +  W  G  G          +W  PEG  ST+  R +HPVVH+S  DA AY
Sbjct: 106 --RTSPRPRQTPWWAGVTGA---------TWSAPEGPGSTVGDRADHPVVHLSHADARAY 154

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G RLP+EAEWEY  RGGLE   +PWG+ LTP GEHR N+WQG+FP  NTA DGY +
Sbjct: 155 AAWVGGRLPSEAEWEYAARGGLEGARYPWGDELTPAGEHRCNIWQGDFPVRNTAEDGYRA 214

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
           TAPV ++  N +GL+N  GNVWEW AD +             GP +G  +V +GGSYLC+
Sbjct: 215 TAPVTAFAPNGYGLHNTAGNVWEWCADHF-------------GPRSGDARVMRGGSYLCH 261

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           E YC R+R AAR+ NTPDS+AG+LG R A D
Sbjct: 262 ESYCNRYRVAARTANTPDSAAGHLGLRVAWD 292



 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 36/65 (55%), Positives = 48/65 (73%)

Query: 330 SSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 389
           ++AGN+   CA   GP +G  +V +GGSYLC+E YC R+R AAR+ NTPDS+AG+LG R 
Sbjct: 230 NTAGNVWEWCADHFGPRSGDARVMRGGSYLCHESYCNRYRVAARTANTPDSAAGHLGLRV 289

Query: 390 AADVS 394
           A DV+
Sbjct: 290 AWDVA 294


>gi|359777167|ref|ZP_09280457.1| putative sulfatase-modifying factor [Arthrobacter globiformis NBRC
           12137]
 gi|359305499|dbj|GAB14286.1| putative sulfatase-modifying factor [Arthrobacter globiformis NBRC
           12137]
          Length = 329

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 142/332 (42%), Positives = 179/332 (53%), Gaps = 54/332 (16%)

Query: 18  LPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKF 76
           +P  TF MG         DGE P   VTL  F +D   V+N QF  FV +TGY TEAE F
Sbjct: 37  IPAGTFLMGDAFDEGYPADGETPVHQVTLRGFRIDTTAVTNQQFAAFVDSTGYRTEAELF 96

Query: 77  GDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA 136
           G + VF  LL++ ++  +  V                                      A
Sbjct: 97  GSSAVFH-LLAQADKKDVLGV-------------------------------------AA 118

Query: 137 RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRG 196
             P    W    RG           W HP G  S      +HPVVHVS NDA+AYC W G
Sbjct: 119 GTP----WWLNVRGA---------DWAHPAGPKSGWSDSSDHPVVHVSHNDALAYCRWAG 165

Query: 197 ARLPTEAEWEYGCRGGLENRLFPWGNNLTPR-GEHRANVWQGEFPTNNTAADGYLSTAPV 255
            RLPTEAEWEY  RGGL  + +PWGN L    G+H  N+WQG FP  NT  DG+L TAPV
Sbjct: 166 RRLPTEAEWEYAGRGGLAGQRYPWGNELLGDGGKHLCNIWQGTFPRANTRDDGHLGTAPV 225

Query: 256 MSYKENKFGLYNMVGNVWEWTADWW-NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
            S+  N +GLY + GNVWEW +DW+   ++  +P+ +P+GPT G+ +V +GGSYLC++ Y
Sbjct: 226 KSFPPNGYGLYEVGGNVWEWCSDWFLPKYYRNSPASSPQGPTIGSGRVMRGGSYLCHDSY 285

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           C R+R AARS NTP+SS+GN GFR  A + PT
Sbjct: 286 CNRYRVAARSSNTPESSSGNTGFRTVALESPT 317


>gi|313227917|emb|CBY23066.1| unnamed protein product [Oikopleura dioica]
          Length = 318

 Score =  241 bits (616), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 143/338 (42%), Positives = 186/338 (55%), Gaps = 56/338 (16%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVS 65
           P ER+   VL+P   F  GT+ P+L  DGE P+R   +T D FY+D +EV+N++F  FV 
Sbjct: 19  PKERWNKEVLIPKGDFVFGTDSPVLEDDGEGPAREARITRD-FYMDMYEVTNSEFWIFVQ 77

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
            T Y TEAEKFG+++VF+  L +E                 LDS                
Sbjct: 78  KTSYKTEAEKFGNSYVFDLCLPKEV----------------LDS---------------- 105

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
                      ++   A W     G           W HPEG  S++  R  HPVVHVS 
Sbjct: 106 ---------ADQMVQAAPWWVMINGA---------DWAHPEGPTSSLRGRWTHPVVHVSH 147

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGE-HRANVWQGEFPTNNT 244
            DA  YC W G RLPTEAEWE   RG L ++ FPWG++L P     RAN WQGEFP  +T
Sbjct: 148 GDAERYCKWAGKRLPTEAEWERAARGKLVSKDFPWGDDLLPSDSVFRANYWQGEFPDEDT 207

Query: 245 AADGY-LSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKV 302
            ADG+  S APV +   +N+FGL+NM+GNVWEWT D +  +H   P  +P G   GT++V
Sbjct: 208 GADGFTYSPAPVDALGAQNEFGLFNMIGNVWEWTGDKFARYHTHEPRDDPAGSPRGTERV 267

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           KKGGS+L + ++ YR R AAR  N+PDSS  ++GFRC 
Sbjct: 268 KKGGSFLSDTEHSYRIRNAARHHNSPDSSTNDMGFRCV 305


>gi|377557773|ref|ZP_09787408.1| hypothetical protein GOOTI_012_00320 [Gordonia otitidis NBRC
           100426]
 gi|377525082|dbj|GAB32573.1| hypothetical protein GOOTI_012_00320 [Gordonia otitidis NBRC
           100426]
          Length = 262

 Score =  241 bits (616), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 136/317 (42%), Positives = 172/317 (54%), Gaps = 57/317 (17%)

Query: 25  MGTNKPIL-IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFE 83
           MG + P     DGE P R+ T+DAF +  H V+N +F EFV+ + YVT+AE+FG +FVF 
Sbjct: 1   MGDDGPSSHAADGEGPIRSTTVDAFAISAHTVTNAEFDEFVTNSEYVTDAERFGWSFVFA 60

Query: 84  PLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAE 143
             L        + +R    R EG                 W  AV   TWR         
Sbjct: 61  GFLP-------AALRRASPRPEG---------------TPWWCAVTDATWRA-------- 90

Query: 144 WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEA 203
                                PEG  S +  R +HPVVHVSWNDA A+C W G RLPTEA
Sbjct: 91  ---------------------PEGPGSDVSERADHPVVHVSWNDARAFCRWAGVRLPTEA 129

Query: 204 EWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKF 263
           EWEY  RGGL    +PWG+ L      R N+W+G FP  NTAADGY STAPV +++ N  
Sbjct: 130 EWEYAARGGLSRARYPWGDELDEDESFRCNIWRGSFPARNTAADGYRSTAPVEAFEPNGL 189

Query: 264 GLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAAR 323
           GLYNMVGNVWEWTAD W V H   P   P+     +    +GGS++C++ YC R+R +AR
Sbjct: 190 GLYNMVGNVWEWTADPWRVDHSGTPHEAPE-----SRYAMRGGSHMCHDSYCNRYRVSAR 244

Query: 324 SQNTPDSSAGNLGFRCA 340
           + N+ D S+GN+GFR A
Sbjct: 245 TSNSADGSSGNIGFRIA 261



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 19/37 (51%), Positives = 30/37 (81%)

Query: 354 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
           +GGS++C++ YC R+R +AR+ N+ D S+GN+GFR A
Sbjct: 225 RGGSHMCHDSYCNRYRVSARTSNSADGSSGNIGFRIA 261


>gi|391330173|ref|XP_003739538.1| PREDICTED: sulfatase-modifying factor 1-like [Metaseiulus
           occidentalis]
          Length = 345

 Score =  241 bits (615), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 136/333 (40%), Positives = 178/333 (53%), Gaps = 60/333 (18%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           ++ MVL+ G  F MGTN+P+++ DGE P+R   + +F+LD  EVSN  F++FV  TG++T
Sbjct: 64  HEKMVLVKGGEFIMGTNRPVILADGEHPARVKNISSFWLDATEVSNEAFEKFVHETGHLT 123

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           +AE  GD++V    ++ E +A    V+                                 
Sbjct: 124 DAETIGDSYVLGSTIT-ESKAIQGAVQ--------------------------------- 149

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
                     A W +   G          SW +PEG  S I  RM HPVVHVSW DA  Y
Sbjct: 150 ---------AAPWWFSVFGA---------SWRNPEGPGSNIIKRMKHPVVHVSWRDAERY 191

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR-ANVWQGEFPTNNTAADGYL 250
           C   G RLP EAEWE  C+  L  + FPW + +T     R AN+WQG+FP  NT  DGY 
Sbjct: 192 CEHYGKRLPNEAEWEMACKRSLGGKRFPWNDEVTDAENQRSANIWQGQFPNTNTKKDGYE 251

Query: 251 STAPVMSYKENK-FGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
            TAPV +Y       L NM+GNVWEWTAD W   H   P          T +VKKGGS++
Sbjct: 252 GTAPVDTYDPYPGLPLKNMIGNVWEWTADAWTTDHGTKPDGEE------TRRVKKGGSFM 305

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           C+  YC+R+RC ARS+NT D+S+ N+GFRCAAD
Sbjct: 306 CHASYCFRYRCEARSENTLDTSSVNVGFRCAAD 338



 Score = 67.8 bits (164), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 28/45 (62%), Positives = 38/45 (84%)

Query: 349 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           T +VKKGGS++C+  YC+R+RC ARS+NT D+S+ N+GFRCAAD 
Sbjct: 295 TRRVKKGGSFMCHASYCFRYRCEARSENTLDTSSVNVGFRCAADA 339


>gi|315645055|ref|ZP_07898181.1| Non-specific serine/threonine protein kinase [Paenibacillus vortex
           V453]
 gi|315279476|gb|EFU42781.1| Non-specific serine/threonine protein kinase [Paenibacillus vortex
           V453]
          Length = 325

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 110/191 (57%), Positives = 136/191 (71%), Gaps = 1/191 (0%)

Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
           R  P   W HPEG  S++ HRM+HPVVHVSWNDA AYC W G RLPTEAEWEY  RGGL 
Sbjct: 132 RAVPGADWAHPEGPGSSLVHRMDHPVVHVSWNDADAYCKWAGVRLPTEAEWEYAARGGLV 191

Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
            + +PWG+ L P GE R N+WQG FP  N A+DGY  TAPV +Y+ N +GLY M GNVWE
Sbjct: 192 QKRYPWGDELKPAGEFRCNIWQGRFPVKNHASDGYPGTAPVGAYRPNGYGLYQMSGNVWE 251

Query: 275 WTADWWNVHHH-PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAG 333
           W +DW++  +     + NPKGP  G ++V +GGSYLC+  YC R+R AAR++NTPDSS+ 
Sbjct: 252 WCSDWYDRSYDVTGHTDNPKGPLQGQERVMRGGSYLCHRSYCNRYRVAARNKNTPDSSSS 311

Query: 334 NLGFRCAADKG 344
           ++GFRC  D G
Sbjct: 312 HVGFRCVKDMG 322


>gi|325106308|ref|YP_004275962.1| hypothetical protein [Pedobacter saltans DSM 12145]
 gi|324975156|gb|ADY54140.1| protein of unknown function DUF323 [Pedobacter saltans DSM 12145]
          Length = 365

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 130/338 (38%), Positives = 186/338 (55%), Gaps = 47/338 (13%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV + G T+ MG +     +D E+P   VT+  F++D+HEV+N QF  FV ATGY+T AE
Sbjct: 59  MVYISGGTYMMGADNDQASED-EYPKHKVTVSPFWMDEHEVTNAQFAAFVEATGYITTAE 117

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D         EE + ++          + + +++     +  V ++      Y  W 
Sbjct: 118 RKPDW--------EEIKKQLPPGTPKPDDSKLVPASLVFTPPNQAVDLN-----NYAQW- 163

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    WE+           P  +W HP+G DS I+ + N+PVV VSW+DA+AYC W
Sbjct: 164 ---------WEW----------VPGANWKHPDGPDSDIKGKENYPVVQVSWDDAMAYCKW 204

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE+  RGGL N ++PWGN    +G+ +AN W+G FP  NTA DG+   AP
Sbjct: 205 AGKDLPTEAEWEFAARGGLINNIYPWGNEHISKGKAKANSWEGHFPNINTAKDGFAGLAP 264

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-----NPKGPTTGTD--------K 301
           V SY  N +GLY+M GNVWEW +DW+   ++   +      NPKGP+   D        +
Sbjct: 265 VKSYAPNGYGLYDMAGNVWEWCSDWYRNDYYQTSTQANGINNPKGPSDSHDPDEPYIAKR 324

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           V +GGS++CN+ YC  +R A R + +PDS   N+GFRC
Sbjct: 325 VSRGGSFMCNDSYCSGYRVARRMKTSPDSGTSNMGFRC 362


>gi|319948018|ref|ZP_08022194.1| hypothetical protein ES5_01774 [Dietzia cinnamea P4]
 gi|319438307|gb|EFV93251.1| hypothetical protein ES5_01774 [Dietzia cinnamea P4]
          Length = 336

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 145/357 (40%), Positives = 182/357 (50%), Gaps = 77/357 (21%)

Query: 9   VERYKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
            +R +  V++P  TF MG +       DGE P   V++ AF +D+  V+N QF  F   T
Sbjct: 33  ADRPRRQVVVPAGTFLMGDHFDEGYPDDGELPVHEVSVRAFRMDECAVTNAQFATFCKDT 92

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           GYVTEAE+ G + VF  L  E +   +            LD+T                 
Sbjct: 93  GYVTEAERVGVSPVFH-LAVEADPGDV---------LHRLDTT----------------- 125

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
                           W    RG           W HP G  S+I    NHPVVHV+WND
Sbjct: 126 ---------------PWWVAVRGA---------DWRHPGGRRSSITGLQNHPVVHVTWND 161

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           A AYC W GARL TEAEWEY  RGGL    FPWG+ LTPRG    NVWQG FPT+NT  D
Sbjct: 162 ATAYCDWAGARLSTEAEWEYAARGGLAGARFPWGDELTPRGRWNLNVWQGRFPTHNTLDD 221

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------------- 285
           G+L+TAPV +Y+ N +GL+ MVGNVWEW AD ++  ++                      
Sbjct: 222 GHLTTAPVRAYRPNGYGLHQMVGNVWEWCADVFDPGYYAVSPREDPRGPLPGGGPDPDRA 281

Query: 286 PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            A + +P G T    +V +GGSYLC++ YCYR+R AARS NT  S+  NLGFRC  D
Sbjct: 282 AATAADPDGDTW---RVMRGGSYLCHDSYCYRYRVAARSANTAGSATANLGFRCVRD 335


>gi|257067789|ref|YP_003154044.1| hypothetical protein Bfae_05880 [Brachybacterium faecium DSM 4810]
 gi|256558607|gb|ACU84454.1| uncharacterized conserved protein [Brachybacterium faecium DSM
           4810]
          Length = 303

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 142/346 (41%), Positives = 173/346 (50%), Gaps = 63/346 (18%)

Query: 1   MVLLPAPPVE---RYKDMVLLPGDTFRMGTNKPIL-IKDGEFPSRNVTLDAFYLDQHEVS 56
           +  L AP  +   R +  VL+    FRMGT  P     DGE P R V + AF +D   V+
Sbjct: 16  LAALAAPTAQVRARAERTVLVEAGPFRMGTEDPDRNPGDGESPVRTVEVPAFRIDATCVT 75

Query: 57  NTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMH 116
             +F  FV+ TGYVTEAE+FG +FVF  LL +E R                         
Sbjct: 76  TAEFAAFVAETGYVTEAEEFGWSFVFGALLEKELR------------------------- 110

Query: 117 HPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRM 176
                            R +R P    W     G           W HPEG  ST+  R 
Sbjct: 111 -----------------RASRKPPGTPWWRAVEGA---------RWDHPEGPGSTLAGRE 144

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
           +HPVVHVS  DA A+  W G RLPTEAEWE   RGGL+   + WG  LTP GEHR N+WQ
Sbjct: 145 DHPVVHVSLRDAEAFARWCGMRLPTEAEWEKAARGGLDQARYAWGEELTPGGEHRCNIWQ 204

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT 296
           G FP  NT  DG+L TAPV S+  N FGLY + GNVWEW +D W                
Sbjct: 205 GAFPVRNTLEDGHLGTAPVRSFPPNGFGLYEVAGNVWEWCSDRWTT--------GSAASL 256

Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           TG  +V +GGSYLC++ YC R+R AARS    + ++GN GFR  AD
Sbjct: 257 TGDVRVMRGGSYLCHDSYCNRYRVAARSSTAAEDASGNKGFRLVAD 302


>gi|291297647|ref|YP_003508925.1| Non-specific serine/threonine protein kinase [Stackebrandtia
           nassauensis DSM 44728]
 gi|290566867|gb|ADD39832.1| Non-specific serine/threonine protein kinase [Stackebrandtia
           nassauensis DSM 44728]
          Length = 319

 Score =  238 bits (606), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 137/333 (41%), Positives = 178/333 (53%), Gaps = 52/333 (15%)

Query: 8   PVERYKDMVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
           P +  + MV +    F MG  +      DGE P R V L  F++D   V+N ++  FV  
Sbjct: 36  PAKVRRGMVRVDVGEFDMGGEDVDANPTDGEGPPRPVRLSPFHIDTVCVTNVEYARFVRE 95

Query: 67  TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
           TG+VT+AE+ G ++VF  L++   R  +            +D+++               
Sbjct: 96  TGFVTDAEREGWSYVFHLLVAPTARGHV------------MDASVP-------------- 129

Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
                          A W    RG          SW  P G  S++    NHPV HVSW 
Sbjct: 130 --------------GAPWWLAVRGA---------SWRAPHGPGSSMSSVQNHPVTHVSWR 166

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
           DA AY  W G RLPTEA+WE   RGGL    +PWG+ L  RG HR N+WQG FP +NTA 
Sbjct: 167 DAAAYAAWAGKRLPTEAQWEAAARGGLSGARYPWGDELLVRGAHRCNIWQGSFPRHNTAE 226

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
           DG+L TAPV +++ N +GLY M GNVWEW ADWW          +P GP TGT KV +GG
Sbjct: 227 DGHLGTAPVKTFRPNGYGLYQMAGNVWEWCADWWTTARD--GDSDPTGPATGTSKVIRGG 284

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           SYLC++ YC R+R AAR+ NTPDSS G++GFRC
Sbjct: 285 SYLCHDSYCNRYRVAARTHNTPDSSTGHMGFRC 317


>gi|409196281|ref|ZP_11224944.1| hypothetical protein MsalJ2_04508 [Marinilabilia salmonicolor JCM
           21150]
          Length = 337

 Score =  237 bits (605), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 139/345 (40%), Positives = 188/345 (54%), Gaps = 48/345 (13%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           + + MVL+PG  F MG  +    K  EFP   V +D+F +D+H V+N QF+EFV++TGYV
Sbjct: 27  QIEKMVLIPGGVFDMGAREKQFSKPDEFPVHKVKVDSFLMDKHPVTNAQFREFVASTGYV 86

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           T AEK  D    +  L          V         L +++    H P   +S N+  ++
Sbjct: 87  TTAEKAPDWKELKKQLPPGTPKPPDSVL--------LPASL--VFHSPETRVSLNNHQSW 136

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             W                        P   W HP G  STIE + N+PVVH+SW DA A
Sbjct: 137 WGW-----------------------VPGADWKHPLGPGSTIEGKDNYPVVHISWYDAQA 173

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           Y  W G RLPTEAEWEY  RGG  N ++PWGN L    E +AN WQG FP +N   DG++
Sbjct: 174 YARWAGKRLPTEAEWEYAARGGHNNFVYPWGNELI--TEEKANFWQGNFPYSNKGDDGFI 231

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTD----- 300
           S++PV S+  N +GLY+M GNVWEWTADW++  ++        + NP+GP+   D     
Sbjct: 232 SSSPVASFSSNGYGLYDMAGNVWEWTADWYHTAYYEVLGRKGIADNPQGPSVSLDPMEPG 291

Query: 301 ---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
              K  +GGS+LCN+ YC  +R +AR ++TPDS   +LGFRC  D
Sbjct: 292 VAKKSIRGGSFLCNDSYCAGYRASARMKSTPDSGMMHLGFRCVKD 336


>gi|290561052|gb|ADD37928.1| Sulfatase-modifying factor 1 [Lepeophtheirus salmonis]
          Length = 314

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 139/332 (41%), Positives = 171/332 (51%), Gaps = 66/332 (19%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV + G  F  GT KP+   DGE PS    +  FY+D +E               VT A
Sbjct: 46  NMVHITGGYFNKGTKKPVFPADGEGPSERTFVTDFYMDAYE---------------VTNA 90

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E F    V    ++E E    S V   M+ F  L   +  ++   V    W   V    W
Sbjct: 91  E-FESFVVANDYVTEAEIFGQSFV---MEYF--LSPEVRSKVALGVQLAPWWLQVDRADW 144

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           +                             HPEG DS+I++RM+HPV+HVSWNDAV YC 
Sbjct: 145 K-----------------------------HPEGPDSSIQNRMDHPVLHVSWNDAVKYCE 175

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W   RLPTEAEWEY CR G E+RLFPWGN   P  +  AN+W GEFPT N  ADGY  T 
Sbjct: 176 WSSKRLPTEAEWEYACRAGKEDRLFPWGNKWNPNDKIWANIWTGEFPTKNDEADGYAGTN 235

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           PV  +  N +GLY+M+GN WEWT D                   G   VKKGGS++C++ 
Sbjct: 236 PVGKFPPNAYGLYDMIGNAWEWTGD----------------SFRGEQMVKKGGSFMCHKN 279

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           +CYR+RCAARSQNT DSSA NLGFRCA+   P
Sbjct: 280 HCYRYRCAARSQNTKDSSAHNLGFRCASHTKP 311



 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 31/44 (70%), Positives = 38/44 (86%)

Query: 348 GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 391
           G   VKKGGS++C++ +CYR+RCAARSQNT DSSA NLGFRCA+
Sbjct: 264 GEQMVKKGGSFMCHKNHCYRYRCAARSQNTKDSSAHNLGFRCAS 307


>gi|403069843|ref|ZP_10911175.1| hypothetical protein ONdio_09646 [Oceanobacillus sp. Ndiop]
          Length = 185

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 111/181 (61%), Positives = 131/181 (72%), Gaps = 2/181 (1%)

Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
           W  PEG  STIE R++HPV+HVSWNDA+A+C W G RLPTEAEWEY  RGGLE + +PWG
Sbjct: 6   WFQPEGPGSTIESRLDHPVIHVSWNDALAFCQWAGKRLPTEAEWEYAARGGLEQKRYPWG 65

Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
           + LTP GEH  N+WQG+FP  NT  DGYL TAP  S+  N FGLYN+ GNVWEW ADW+ 
Sbjct: 66  DELTPNGEHHCNIWQGKFPQENTLDDGYLGTAPAQSFPPNGFGLYNVSGNVWEWCADWFT 125

Query: 282 VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
             +HP    +       T KV +GGSYLC++ YC R+R AARS NT DSS GN+GFRCAA
Sbjct: 126 --NHPDRKIDELRVNNQTPKVMRGGSYLCHKSYCNRYRVAARSSNTIDSSTGNMGFRCAA 183

Query: 342 D 342
           D
Sbjct: 184 D 184


>gi|381397685|ref|ZP_09923095.1| Sulphatase-modifying factor protein [Microbacterium laevaniformans
           OR221]
 gi|380775253|gb|EIC08547.1| Sulphatase-modifying factor protein [Microbacterium laevaniformans
           OR221]
          Length = 304

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 138/328 (42%), Positives = 176/328 (53%), Gaps = 53/328 (16%)

Query: 18  LPGDTFRMGTNKPILIK-DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKF 76
           +   TF MG       + DGE P   V L AF +D   V+   F  FV  TGY TEAE F
Sbjct: 1   MSAGTFVMGDASGERNRLDGEVPRHRVDLAAFEIDATTVTVAAFARFVEDTGYRTEAEVF 60

Query: 77  GDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA 136
           G + VF  L++  +    +                      P+    W  AV     RGA
Sbjct: 61  GSSAVFHSLVAAADADVSA----------------------PLGGTPWWRAV-----RGA 93

Query: 137 RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRG 196
                                    W HP G  S+++   +HPVVHVSWNDA AYC W G
Sbjct: 94  ------------------------DWSHPGGPLSSVDGLGDHPVVHVSWNDAQAYCAWAG 129

Query: 197 ARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVM 256
            RLPTEAEWEY  RGGL+   +PWG+     G  RAN+WQGEFP+ NTA DG+++TAPV 
Sbjct: 130 RRLPTEAEWEYAARGGLDGATYPWGDAAVDDGGWRANIWQGEFPSVNTAEDGWVATAPVR 189

Query: 257 SYKENKFGLYNMVGNVWEWTADWWNVH-HHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYC 315
           S+  N +GL+  VGNVWEW ADW++   +  +P  +P+GP  G+ +V +GGSYLC+  YC
Sbjct: 190 SFAPNGYGLWQPVGNVWEWCADWFSADTYRRSPREDPRGPEQGSARVLRGGSYLCHASYC 249

Query: 316 YRHRCAARSQNTPDSSAGNLGFRCAADK 343
            R+R AARS NTPDSS GN GFR  + +
Sbjct: 250 NRYRNAARSSNTPDSSMGNAGFRTVSAR 277


>gi|433456296|ref|ZP_20414348.1| formylglycine-generating enzyme [Arthrobacter crystallopoietes
           BAB-32]
 gi|432196498|gb|ELK52949.1| formylglycine-generating enzyme [Arthrobacter crystallopoietes
           BAB-32]
          Length = 279

 Score =  235 bits (600), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 135/312 (43%), Positives = 166/312 (53%), Gaps = 53/312 (16%)

Query: 35  DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKI 94
           DGE P   V LD+F +D   V+N  F  FV ATGY TE+E +  + VF            
Sbjct: 12  DGEGPVHKVVLDSFRIDAAAVTNEDFSRFVEATGYRTESEHYESSAVF------------ 59

Query: 95  SQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
                                     H+    A A      A  P    W    RG    
Sbjct: 60  --------------------------HLQVQAAAADVLGEAAGAP----WWVNVRGA--- 86

Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
                  W HP G  S+      HPVV VSWNDA AYC W G RLPTEAEWEY  RGGLE
Sbjct: 87  ------DWAHPTGPLSSWRDIPQHPVVQVSWNDAQAYCRWAGRRLPTEAEWEYAARGGLE 140

Query: 215 NRLFPWGNNLT-PRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVW 273
            + + WGN+L  P GEH  N+WQGEFP +NT  DGYL TAPV ++  N FGLY + GNVW
Sbjct: 141 GKRYAWGNDLAGPAGEHYCNIWQGEFPRSNTLDDGYLGTAPVRTFPPNGFGLYEVSGNVW 200

Query: 274 EWTADWW-NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSA 332
           EW  DW+   ++  +PS+NP+GPT G  +V +GGS+LC++ YC R+R AARS N P+S++
Sbjct: 201 EWCNDWFLPKYYRNSPSHNPQGPTIGRGRVMRGGSFLCHDSYCNRYRVAARSSNGPESAS 260

Query: 333 GNLGFRCAADKG 344
            N GFR  A  G
Sbjct: 261 SNCGFRTVAAPG 272


>gi|383778954|ref|YP_005463520.1| hypothetical protein AMIS_37840 [Actinoplanes missouriensis 431]
 gi|381372186|dbj|BAL89004.1| hypothetical protein AMIS_37840 [Actinoplanes missouriensis 431]
          Length = 305

 Score =  235 bits (600), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 134/332 (40%), Positives = 180/332 (54%), Gaps = 57/332 (17%)

Query: 14  DMVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + V +P  TF MG  +      DGE P  +V + AF +D   V+   F+ FV+ATGY T+
Sbjct: 28  EQVSVPAQTFAMGDAHGDGAPGDGEQPVHDVRVPAFTIDATSVTVADFRSFVTATGYRTD 87

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE++G + VF   L++ ++         + + +G                          
Sbjct: 88  AEQYGWSAVFHLALTDPDQV--------IGQMQGT------------------------- 114

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                      W  G  G           W HP G         +HP VHVSWNDA AYC
Sbjct: 115 ----------PWWLGVEGA---------DWAHPGGPSDVAAD--DHPAVHVSWNDAQAYC 153

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLP+EAEWE   R GL +R +PWG+ L P   H  N+WQG FP  NTAADG+++T
Sbjct: 154 AWAGRRLPSEAEWECAARAGLPSRRYPWGDEL-PAAGHALNIWQGSFPAVNTAADGFVTT 212

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
           APV +++ N +GL+  VGNVWEW ADW++  ++  AP  +P+GP  GT +V +GGSYLC+
Sbjct: 213 APVRTFQPNGYGLWQPVGNVWEWVADWFSPAYYAAAPVEDPQGPPDGTARVIRGGSYLCH 272

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           + YC R+R AARS NTPDSS GN GFR A+ K
Sbjct: 273 DSYCNRYRNAARSSNTPDSSTGNTGFRTASRK 304


>gi|290462167|gb|ADD24131.1| Sulfatase-modifying factor 1 [Lepeophtheirus salmonis]
          Length = 314

 Score =  235 bits (600), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 136/338 (40%), Positives = 171/338 (50%), Gaps = 78/338 (23%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV + G  F  GT KP+   DGE PS    +  FY+D +E               VT A
Sbjct: 46  NMVHITGGYFIKGTKKPVFPADGEGPSERTFVTDFYMDAYE---------------VTNA 90

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E       FE  +   +    +++        G    +E+ +                  
Sbjct: 91  E-------FESFVVANDYVTEAEIF-------GQSFVMEYFLSP---------------- 120

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPW------GSWLHPEGIDSTIEHRMNHPVVHVSWND 187
                      E   +  L  +L PW        W HPEG DS+I++RM+HPV+HVSWND
Sbjct: 121 -----------EVRSKVALGVQLAPWWLQVDRADWKHPEGPDSSIQNRMDHPVLHVSWND 169

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           AV YC W   RLPTEAEWEY CR G E+RLFPWGN   P  +  AN+W GEFPT N  AD
Sbjct: 170 AVKYCEWSSKRLPTEAEWEYACRAGKEDRLFPWGNKWNPNDKIWANIWTGEFPTKNDEAD 229

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
           GY  T PV  +  N +GLY+M+GN WEWT D                   G   VKKGGS
Sbjct: 230 GYAGTNPVGKFPPNAYGLYDMIGNAWEWTGD----------------SFRGEQMVKKGGS 273

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           ++C++ +CYR+RCAARSQNT DSSA NLGFRCA+   P
Sbjct: 274 FMCHKNHCYRYRCAARSQNTKDSSAHNLGFRCASHTKP 311



 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 31/44 (70%), Positives = 38/44 (86%)

Query: 348 GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 391
           G   VKKGGS++C++ +CYR+RCAARSQNT DSSA NLGFRCA+
Sbjct: 264 GEQMVKKGGSFMCHKNHCYRYRCAARSQNTKDSSAHNLGFRCAS 307


>gi|257868811|ref|ZP_05648464.1| conserved hypothetical protein [Enterococcus gallinarum EG2]
 gi|357049691|ref|ZP_09110908.1| hypothetical protein HMPREF9478_00891 [Enterococcus saccharolyticus
           30_1]
 gi|257802975|gb|EEV31797.1| conserved hypothetical protein [Enterococcus gallinarum EG2]
 gi|355382923|gb|EHG30014.1| hypothetical protein HMPREF9478_00891 [Enterococcus saccharolyticus
           30_1]
          Length = 284

 Score =  234 bits (598), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 133/333 (39%), Positives = 184/333 (55%), Gaps = 58/333 (17%)

Query: 15  MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M+ +P  T+R+GT+ K    +D E P   V + +F +D+  V+N  F+ F++ATGYVTEA
Sbjct: 1   MIKIPAGTYRIGTDTKAGFEEDQEGPKIQVEMASFEIDETTVTNQMFEAFIAATGYVTEA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+FG +FVF   LSE+ + + SQ+                     + ++SW  AVA   W
Sbjct: 61  EQFGWSFVFHYFLSEKTKQR-SQL---------------------IPNMSWWYAVARADW 98

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R                             HPEG DS+IE RM+HPVV VS NDA+A+C 
Sbjct: 99  R-----------------------------HPEGPDSSIEQRMDHPVVQVSRNDALAFCR 129

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W   RLPTEAEWE   +GG EN L+PWG +     ++  N+WQG FP  NT ADG+ STA
Sbjct: 130 WADKRLPTEAEWEIAAKGGTENDLYPWGTDFLAGEDYHCNIWQGAFPKANTKADGFASTA 189

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVK------KGGS 307
           P   Y+ N +GLY ++GNVWEW ++   +      S++         +V       +GGS
Sbjct: 190 PARFYEPNGYGLYQVIGNVWEWCSNPARIPLEQFRSWDSHFFEQRFQQVDDAMYATRGGS 249

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           +LC+E YC R+R AAR+ N+  S+A NLGFRCA
Sbjct: 250 FLCHESYCKRYRIAARNGNSGMSAANNLGFRCA 282


>gi|386847436|ref|YP_006265449.1| hypothetical protein ACPL_2486 [Actinoplanes sp. SE50/110]
 gi|359834940|gb|AEV83381.1| hypothetical protein ACPL_2486 [Actinoplanes sp. SE50/110]
          Length = 309

 Score =  234 bits (596), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 133/327 (40%), Positives = 178/327 (54%), Gaps = 56/327 (17%)

Query: 14  DMVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           D   +PG TF MG  +      DGE P   VT+  F +D   V+   F+ FV+ATG+ T+
Sbjct: 28  DQARVPGQTFAMGDAHGDGHPADGEQPVHPVTVPGFTIDVTAVTVADFRAFVTATGFRTD 87

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE FG + VF   +++ ER                   +  RM                 
Sbjct: 88  AEHFGWSAVFHSAVADPER-------------------VAGRM----------------- 111

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                      W  G  G           W HP G  +      +HPVVHVSWNDA AYC
Sbjct: 112 -------AGTPWWLGVGGA---------DWAHPGGPSTVALD--DHPVVHVSWNDAQAYC 153

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
           TW G RLP+EAEWE   RGGL+   +PWG++L    +   N+WQG+FP  NTA DG+++T
Sbjct: 154 TWAGRRLPSEAEWECAARGGLDRLRYPWGDDLPAGDDWACNIWQGDFPVRNTAEDGWVTT 213

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGSYLCN 311
           APV +Y  N +GL+ +VGNVWEW ADW++  ++  +P  +P+GP+ G  +V +GGSYLC+
Sbjct: 214 APVRAYAPNAYGLHQVVGNVWEWCADWFSPRYYAVSPGDDPRGPSRGAARVIRGGSYLCH 273

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFR 338
           + YC R+R AARS NTPDSS GN+GFR
Sbjct: 274 DSYCNRYRNAARSSNTPDSSTGNMGFR 300


>gi|260430125|ref|ZP_05784100.1| sulfatase-modifying factor 1 [Citreicella sp. SE45]
 gi|260419048|gb|EEX12303.1| sulfatase-modifying factor 1 [Citreicella sp. SE45]
          Length = 332

 Score =  234 bits (596), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 131/326 (40%), Positives = 164/326 (50%), Gaps = 53/326 (16%)

Query: 18  LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
           +PG  F MG  K    +D + P R V +  F +  + V+N  F  FV ATGY + AE+ G
Sbjct: 57  IPGGFFDMGARKSTFAEDLDSPRRKVFVSPFRISPYTVTNADFARFVEATGYRSVAEREG 116

Query: 78  DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
            ++VF  L                                                    
Sbjct: 117 WSYVFHLL---------------------------------------------------- 124

Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
           L T A+W     G    R      W  PEG  + +  R +HPV HV+W DA+AYC W G 
Sbjct: 125 LETPADWPVSPPGLRWWRRVDGAFWAQPEGPGTDVADRGDHPVTHVAWYDALAYCRWAGV 184

Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMS 257
           RLP EAEWE   RGGL +  FPWGN L P GE   N WQG FP  N+A DGY+ TAPV S
Sbjct: 185 RLPREAEWERAGRGGLAHLKFPWGNTLVPGGEFAMNTWQGHFPDENSAEDGYVGTAPVGS 244

Query: 258 YKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCY 316
           Y  N +GLYNM GNVWEW  DW+     P     +P+GP +GT KV++GGS+LC+  YC 
Sbjct: 245 YAPNGYGLYNMTGNVWEWVQDWFGPRGAPKGVPRDPRGPESGTAKVQRGGSFLCHVSYCD 304

Query: 317 RHRCAARSQNTPDSSAGNLGFRCAAD 342
           R+   +R++N PDSS  N GFR AAD
Sbjct: 305 RYHVHSRTRNDPDSSTSNSGFRIAAD 330


>gi|302538003|ref|ZP_07290345.1| sulfatase-modifying factor 1 [Streptomyces sp. C]
 gi|302446898|gb|EFL18714.1| sulfatase-modifying factor 1 [Streptomyces sp. C]
          Length = 299

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 111/182 (60%), Positives = 126/182 (69%), Gaps = 13/182 (7%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           SW HPEG  S    R NHPVVHVSW+DA AYC W G RLPTEAEWEY  RGGLE R +PW
Sbjct: 129 SWRHPEGPGSHATDRQNHPVVHVSWHDAQAYCAWSGTRLPTEAEWEYAARGGLEQRRYPW 188

Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
           G+ L P G H  N+WQG+FP  NT ADGY+STAPV S++ N FGLYN VGNVWEW ADW+
Sbjct: 189 GDELMPDGRHMCNIWQGDFPVLNTGADGYVSTAPVKSFRPNGFGLYNTVGNVWEWCADWF 248

Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                         P   T +  +GGSYLC++ YC R+R AARS NTPDSS GN+GFR A
Sbjct: 249 T-------------PAEETSRAMRGGSYLCHDSYCNRYRVAARSSNTPDSSTGNIGFRVA 295

Query: 341 AD 342
           AD
Sbjct: 296 AD 297



 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 74/166 (44%), Positives = 95/166 (57%), Gaps = 11/166 (6%)

Query: 13  KDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           + ++ LPG +F MGT+ P     D E P R V ++ F +    V+N QF  F  +TGY T
Sbjct: 31  RQLLALPGGSFLMGTDDPEGFPADAEGPIREVAVEPFRIAATAVTNAQFATFAKSTGYTT 90

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDM----------KRFEGLDSTIEHRMHHPVVH 121
           EAE+FG +F F   L++E  A  + V              +  EG  S    R +HPVVH
Sbjct: 91  EAERFGFSFTFGAFLADEVAAVSAPVAAVPWWRAVEGASWRHPEGPGSHATDRQNHPVVH 150

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
           +SW+DA AYC W G RLPTEAEWEY  RGGLE R +PWG  L P+G
Sbjct: 151 VSWHDAQAYCAWSGTRLPTEAEWEYAARGGLEQRRYPWGDELMPDG 196


>gi|379057564|ref|ZP_09848090.1| hypothetical protein SproM1_05779 [Serinicoccus profundi MCCC
           1A05965]
          Length = 320

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 137/331 (41%), Positives = 167/331 (50%), Gaps = 58/331 (17%)

Query: 15  MVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
            V +PG TF  G  +      DGE P   VT+  F LD   V+N  F  FV+ TG   EA
Sbjct: 36  QVDVPGGTFARGDAHGDGHPGDGETPVHEVTVSGFSLDATTVTNDDFARFVAETGQRIEA 95

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E FG + VF   L+      +  V                                    
Sbjct: 96  EPFGFSAVFHLALAARPEEVLRPV------------------------------------ 119

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                   A W  G RG           W HP G  S +    +HPVVHVSWNDA+AYC 
Sbjct: 120 ------PGAPWWLGVRGA---------DWRHPGGSRSDLTGLGDHPVVHVSWNDALAYCA 164

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLT-PRGEHRANVWQGEFPTNNTAADGYLST 252
           W G RLPTEAEWEY  RGGL +  +PWG++L  P G  R N+WQG+FP  N  ADGYL+T
Sbjct: 165 WAGRRLPTEAEWEYAARGGLAHARYPWGDDLLDPSGAWRCNIWQGDFPRRNDLADGYLTT 224

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTDKVKKGGS 307
           AP  S+  N +G   MVGNVWEW ADW++   +        + +P GP  G  +V +GGS
Sbjct: 225 APATSFTPNGYGAVQMVGNVWEWCADWFDPGTYAQVARAGRAVDPTGPGAGGSRVIRGGS 284

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           YLC++ YC R+R AARS NTPDSS GN GFR
Sbjct: 285 YLCHDSYCNRYRVAARSANTPDSSTGNTGFR 315


>gi|296121197|ref|YP_003628975.1| hypothetical protein Plim_0932 [Planctomyces limnophilus DSM 3776]
 gi|296013537|gb|ADG66776.1| protein of unknown function DUF323 [Planctomyces limnophilus DSM
           3776]
          Length = 338

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 136/344 (39%), Positives = 181/344 (52%), Gaps = 47/344 (13%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG  FRMG+N P L  D E P+  V+++ F+LD+HEV+N +F EFV ATGYVT AE
Sbjct: 29  MVWIPGGEFRMGSNDP-LAWDDERPTHPVSVEGFWLDEHEVTNREFAEFVKATGYVTTAE 87

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K       E +L+       +     +     + +  +H        +  ND   +  W 
Sbjct: 88  K---APTVEEILANSPPGTAAPDPAVLVPASLVFTPPDHP-------VPLNDFSQWWVWT 137

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                  P  +W HPEG  S I+ RM+HPVVHVSW+DA AY  W
Sbjct: 138 -----------------------PGANWRHPEGPGSNIDERMDHPVVHVSWDDASAYAQW 174

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWE   RGG E   F WG         +AN+W G+FP  NTA DG+  TAP
Sbjct: 175 AGKRLPTEAEWERAARGGKEGMPFVWGTTPPSETSPQANLWNGKFPYENTAKDGFTRTAP 234

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTD--------K 301
           V S+  N +GLY+M GNVWEW  DW++   +      A + NP GP    D        +
Sbjct: 235 VKSFPANDYGLYDMSGNVWEWCHDWYDRELYSKRPQLAVTKNPSGPEKSHDPTQPFQQLR 294

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
            ++GGS+LCN+ YC R+R +AR   +PD+   ++GFRCA    P
Sbjct: 295 SQRGGSFLCNDSYCSRYRPSARHGCSPDTGMSHVGFRCAKSVVP 338


>gi|395531848|ref|XP_003767985.1| PREDICTED: sulfatase-modifying factor 2-like [Sarcophilus harrisii]
          Length = 306

 Score =  232 bits (591), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 141/352 (40%), Positives = 188/352 (53%), Gaps = 70/352 (19%)

Query: 1   MVLLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           + L P P   +   MV LPG TFRMGTN P   KDGE P+R VT+ +F +D++ V+N  F
Sbjct: 19  LGLWPEPGSGQDDSMVQLPGGTFRMGTNSPD-GKDGEGPTREVTVKSFAIDKYPVTNRDF 77

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           +EFV    Y TEAE FG +FVFE L+S+E + K++Q                        
Sbjct: 78  REFVRQKKYQTEAESFGWSFVFEDLVSDELKKKVTQ------------------------ 113

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                  +    W    LP E  +                 W  P G  S I  R++ PV
Sbjct: 114 ------KLESAPWW---LPVEKAF-----------------WRQPAGPGSGIRERLDFPV 147

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDA A+C WRG RLPTE EWE+  RGGL+ R++PWGN       +R N+WQG+FP
Sbjct: 148 LHVSWNDARAFCAWRGKRLPTEEEWEFAARGGLKGRVYPWGNQFQ---LNRTNLWQGKFP 204

Query: 241 TNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT 299
             +TA DG+   +PV ++  +N +GLY+++GNVWEWTA           +Y P  P  G 
Sbjct: 205 QADTAEDGFHGASPVDAFPPQNNYGLYDLMGNVWEWTAS----------TYEP--PIPGQ 252

Query: 300 D-KVKKGGSYLCNEQYC--YRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           D KV +G S++   +    +R R   R  NTPDS++ NLGFRCAA  G   G
Sbjct: 253 DKKVLRGASWIDTAEGSANHRARVTTRMGNTPDSASDNLGFRCAASPGRPLG 304


>gi|375143553|ref|YP_005005994.1| sulfatase-modifying factor protein [Niastella koreensis GR20-10]
 gi|361057599|gb|AEV96590.1| Sulphatase-modifying factor protein [Niastella koreensis GR20-10]
          Length = 382

 Score =  231 bits (590), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 131/343 (38%), Positives = 179/343 (52%), Gaps = 53/343 (15%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDG---EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           Y  MV +PG  ++MG       KDG   E+P   V +D FY+D  EV+N QF EFV ATG
Sbjct: 75  YAGMVKIPGGDYQMGAAD----KDGRPDEYPQHPVHVDGFYMDATEVTNAQFMEFVKATG 130

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           YVT AE+              E  K        K  + L          P   ++ ND  
Sbjct: 131 YVTTAER----------TPNWEELKKQLPSGTPKPADSLLVAASLVFTPPAKPVALNDVS 180

Query: 129 AYCTW-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
            +  W +GA                         W HPEG  S+I+ + N+PVV VSW+D
Sbjct: 181 QWWQWVKGA------------------------DWKHPEGPASSIKGKENYPVVQVSWDD 216

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           A+AY  W G +LPTEAEWE+  R GL N+ F WG+    +G+ +AN+WQG FP +NTA D
Sbjct: 217 AMAYAKWAGKQLPTEAEWEWAARAGLTNQPFTWGSEAVEKGKPKANIWQGHFPDHNTATD 276

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS---YNPKGPTTGTD---- 300
           G+   APV S+  N +GLY++ GNVWEW ADW+   ++   +    NP GP+ G D    
Sbjct: 277 GFARVAPVKSFAPNAYGLYDIAGNVWEWCADWYRSDYYTQATKKAVNPPGPSVGYDPDEP 336

Query: 301 ----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
               +V +GGS+LC+  YC  +R +AR + +PD+   + GFRC
Sbjct: 337 TVPKRVVRGGSFLCHASYCASYRVSARMKTSPDTGLEHTGFRC 379


>gi|357384299|ref|YP_004899023.1| sulfatase modifying factor 1 (C-alpha-formyglycine- generating
           enzyme 1) [Pelagibacterium halotolerans B2]
 gi|351592936|gb|AEQ51273.1| sulfatase modifying factor 1 precursor (C-alpha-formyglycine-
           generating enzyme 1) [Pelagibacterium halotolerans B2]
          Length = 305

 Score =  231 bits (589), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 146/357 (40%), Positives = 185/357 (51%), Gaps = 69/357 (19%)

Query: 1   MVLLPAPPVERYK-----------DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFY 49
           M   PAP  E+ +           D + +PG    +GTN P +  DGE P R VTL  F 
Sbjct: 1   MADTPAPSAEKLRQAAPISSGTALDPIRVPGGRSFVGTNSPEIPVDGEGPERPVTLRDFA 60

Query: 50  LDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDS 109
           L+   V+N +F +FV+ATGYVTEAE++G + VF                H  +  E LD+
Sbjct: 61  LEATTVTNARFADFVAATGYVTEAERYGWSAVF----------------HSDR--ESLDA 102

Query: 110 TIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGID 169
                                 T  G++LP    W +   G          SW HPEG D
Sbjct: 103 ---------------------ATRAGSQLP----WWHKVDGV---------SWRHPEGPD 128

Query: 170 STIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGE 229
           S++E R +HPVV VSW DA A+  W G RLPTEAEWE+  RGG E R FPWG+      +
Sbjct: 129 SSLEGREDHPVVQVSWRDANAFAQWVGGRLPTEAEWEHAARGGTERRRFPWGDEEP--ND 186

Query: 230 HRA---NVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-H 285
           H A   N+WQG FP  NTA DG++ TAP  S++ N  G Y+M GNVWEWT D + V    
Sbjct: 187 HDAIFCNIWQGRFPDLNTARDGFVRTAPARSFESNPLGFYSMSGNVWEWTHDAFRVRSVS 246

Query: 286 PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            A           ++KV KGGS+LC+  YCYR+R AAR   TPDS+  N GFR A D
Sbjct: 247 RAAKQRNAYALQHSEKVLKGGSFLCHVSYCYRYRIAARMAMTPDSAGSNTGFRVAYD 303


>gi|256420881|ref|YP_003121534.1| Non-specific serine/threonine protein kinase [Chitinophaga pinensis
           DSM 2588]
 gi|256035789|gb|ACU59333.1| Non-specific serine/threonine protein kinase [Chitinophaga pinensis
           DSM 2588]
          Length = 349

 Score =  231 bits (589), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 132/338 (39%), Positives = 179/338 (52%), Gaps = 43/338 (12%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+PG TF MG +      D E P  +V +D+F++D+HEV+N +F  FV ATGY+T AE
Sbjct: 46  MVLIPGGTFTMGADDAEAHPD-EKPRHDVRVDSFWMDEHEVTNAEFAAFVKATGYLTTAE 104

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +        P+  EE   ++     +      L  +I   +  P  H    D V+     
Sbjct: 105 R--------PITKEELMQQLPPGSPEPDSSMLLPGSI---VFTPPSHPVPLDDVS----- 148

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                   +W    +G          SW HP G  + I  + + PVVH+SWNDA A+  W
Sbjct: 149 --------QWWSFIQGA---------SWQHPGGPTTNINGKEHLPVVHISWNDAQAFAKW 191

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY  RGGL+++ +PWG      G  +AN W G FP  NT  DGY   AP
Sbjct: 192 AGKRLPTEAEWEYAARGGLKDQPYPWGPEALTTGRVKANTWNGHFPYQNTQTDGYFGAAP 251

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-YNPKGPTTGTDK--------VKKG 305
           V S+  N +GLY+M GNVWEW ADW++  ++      NPKGP  G D           +G
Sbjct: 252 VKSFAANAYGLYDMSGNVWEWCADWYDSRYYGLKQGDNPKGPAAGYDPEDPASPKHTIRG 311

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           GS++C ++YC  +R  AR + +P+S   NLGFRC  DK
Sbjct: 312 GSFMCTDEYCSGYRVTARMKTSPESGLENLGFRCVKDK 349


>gi|403526439|ref|YP_006661326.1| formylglycine-generating enzyme [Arthrobacter sp. Rue61a]
 gi|403228866|gb|AFR28288.1| putative formylglycine-generating enzyme [Arthrobacter sp. Rue61a]
          Length = 337

 Score =  231 bits (588), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 142/348 (40%), Positives = 179/348 (51%), Gaps = 66/348 (18%)

Query: 6   APPVERYKDMVLLPGDTFRMGTNKPI---LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
           A P +  KD   +P  TF MG   P       DGE P   V++ AF +    V+N +F E
Sbjct: 43  AVPSKEAKD-AEIPAGTFSMG--DPFNEGYDGDGESPVHEVSVSAFRIGATTVTNQEFAE 99

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           FV ATGY TE+E +G + VF   +                          H + + V ++
Sbjct: 100 FVDATGYRTESETYGTSAVFHLAVKANT----------------------HDILNRVNNV 137

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
            W     +   RGA                         W HP G  S      +HPV H
Sbjct: 138 PW-----WLNIRGA------------------------DWAHPAGPLSNWADIPDHPVTH 168

Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNL--TPRGE--HRANVWQGE 238
           VS NDA+AYC W G RLPTEAEWEY  RGGL  + +PWGN L     GE  H  N+WQGE
Sbjct: 169 VSHNDALAYCQWAGRRLPTEAEWEYAARGGLSGQRYPWGNELHNDAPGETAHNCNIWQGE 228

Query: 239 FPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW-----WNVHHHPAPSYNPK 293
           FP+ NT  DGYL+TAP  S++ N +GLY   GNVWEW +DW     +   H      +P+
Sbjct: 229 FPSRNTIDDGYLTTAPATSFRPNGYGLYQTSGNVWEWCSDWFLPKYYKTCHAQGTVEDPQ 288

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           GPT G  +V +GGSYLC++ YC R+R AARS NTPDS++GNLGFR  A
Sbjct: 289 GPTIGRGRVMRGGSYLCHDSYCNRYRLAARSSNTPDSASGNLGFRTVA 336


>gi|315504000|ref|YP_004082887.1| hypothetical protein ML5_3220 [Micromonospora sp. L5]
 gi|315410619|gb|ADU08736.1| protein of unknown function DUF323 [Micromonospora sp. L5]
          Length = 319

 Score =  231 bits (588), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 133/328 (40%), Positives = 171/328 (52%), Gaps = 59/328 (17%)

Query: 14  DMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           D++ LPG TF MGT+ P    +D E P R V +  F +    V+N QF  FV ATGY T+
Sbjct: 49  DLIALPGGTFLMGTDDPEGFPEDLEGPVRPVAVAPFEIAATPVTNRQFARFVDATGYRTD 108

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE++G +FVF  LL       +            LD+++                     
Sbjct: 109 AERYGWSFVFHLLLVPSAHPYV------------LDASVP-------------------- 136

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                   +A W     G           W  P+G  S +  R  HPVVHVS  DA AYC
Sbjct: 137 --------QAPWWLAVEGAY---------WAAPDGPGSDVRGREQHPVVHVSHRDAEAYC 179

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RL TEAEWEY  RGGL    + WG+ LTP G HR N+WQG FP  NTA DG+  T
Sbjct: 180 RWSGTRLATEAEWEYAARGGLSQARYAWGDELTPDGRHRCNIWQGAFPAVNTAEDGFAGT 239

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
           +PV ++  N +GLY++ GNVWE TA+ W  H         +G  +  + V +GGSYLC++
Sbjct: 240 SPVTTFPPNGYGLYDVAGNVWELTAEPWRDH---------RGRISPGESVIRGGSYLCHD 290

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            YC R+R AARS+ T DSS+GN GFR A
Sbjct: 291 SYCNRYRVAARSRTTVDSSSGNTGFRVA 318



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 26/48 (54%), Positives = 35/48 (72%)

Query: 343 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
           +G  +  + V +GGSYLC++ YC R+R AARS+ T DSS+GN GFR A
Sbjct: 271 RGRISPGESVIRGGSYLCHDSYCNRYRVAARSRTTVDSSSGNTGFRVA 318


>gi|442589738|ref|ZP_21008545.1| hypothetical protein D505_17990 [Elizabethkingia anophelis R26]
 gi|442560626|gb|ELR77854.1| hypothetical protein D505_17990 [Elizabethkingia anophelis R26]
          Length = 383

 Score =  231 bits (588), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 135/355 (38%), Positives = 185/355 (52%), Gaps = 60/355 (16%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           PV+    MVL+P  TF MG+N     ++ E P   V++ +FY+D HEV+N +F++FV AT
Sbjct: 65  PVDVTNKMVLIPAGTFMMGSNGDEWSRNWESPQHKVSVKSFYMDTHEVTNAEFEKFVIAT 124

Query: 68  GYVTEAEKFGDTFVFE-------PLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           GY+T AEK  D    +       P  S+E+    S V    +   GLD            
Sbjct: 125 GYITTAEKPVDWEALKKELPPDTPKPSDEDLMPGSMVFASPESVNGLDD----------- 173

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
           +  W        WR         W  G              W HP G  S I+ +  HPV
Sbjct: 174 YSQW--------WR---------WVKGA------------DWQHPLGPGSNIKGKEKHPV 204

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           VHVS++DAVAY  W G RLPTEAEWE+  RGGL+N+++PWG+    RG  +AN W G FP
Sbjct: 205 VHVSYDDAVAYAKWIGKRLPTEAEWEWAARGGLDNKIYPWGDEHISRGSQKANYWTGTFP 264

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----PSYNPKGP 295
             NT  DGY  TAPV SY  N +GLY+M GNVWE  +DW++ +++ +      + NP GP
Sbjct: 265 IKNTEKDGYYYTAPVGSYAPNAYGLYDMAGNVWEICSDWFSENYYQSLQGNQITNNPAGP 324

Query: 296 TT--------GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +            +V +GGS+LCN+ YC  +R +AR   + D+   + GFR   D
Sbjct: 325 SKPYYPSEPFANKRVVRGGSFLCNDSYCASYRVSARMPYSEDTGMIHTGFRLVKD 379


>gi|384917530|ref|ZP_10017652.1| Non-specific serine/threonine protein kinase [Citreicella sp. 357]
 gi|384468573|gb|EIE52996.1| Non-specific serine/threonine protein kinase [Citreicella sp. 357]
          Length = 320

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 131/330 (39%), Positives = 162/330 (49%), Gaps = 54/330 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           ++ L G  F MGT K    +D + P R V L  F +     +N +F  F   TGY T AE
Sbjct: 40  LIALDGGFFDMGTRKSRFPEDFDAPRRKVRLSPFRISATACTNAEFARFARETGYRTVAE 99

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
             G ++VF  LL +  R   S                      P     W      C   
Sbjct: 100 VEGWSYVFHLLLDDPARFPTS----------------------PAGLRWWRKVDGAC--- 134

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                      W  PEG  S I  R  HPVVHV+W DA+AYC W
Sbjct: 135 ---------------------------WHAPEGQGSDITGRAAHPVVHVAWFDALAYCHW 167

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LP+EA+WE+  RGGL  R FPWGN +TP G    N WQG FP  NTA DG+  TAP
Sbjct: 168 AGLALPSEAQWEFAARGGLARRKFPWGNEMTPGGAFAMNTWQGSFPDRNTAEDGFTGTAP 227

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH--HPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
           V ++  N FGLYN  GNVWEW  D +       P P  +P+GP  GT +V++GGSYLC+E
Sbjct: 228 VEAFAPNGFGLYNTCGNVWEWVRDLYAPRRPEGPFPLRDPQGPENGTGRVQRGGSYLCHE 287

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            YC R+   +R++N PDSS GN GFR AA+
Sbjct: 288 SYCDRYHVHSRTRNDPDSSTGNTGFRVAAE 317


>gi|126307458|ref|XP_001366051.1| PREDICTED: sulfatase-modifying factor 2 [Monodelphis domestica]
          Length = 369

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 136/338 (40%), Positives = 182/338 (53%), Gaps = 70/338 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG TF+MGTN P   KDGE P R VT+  F +D++ V+N  F+EFV    Y TEAE
Sbjct: 96  MVQLPGGTFQMGTNSPD-GKDGEGPIREVTVKPFAIDKYPVTNKDFREFVRQKKYQTEAE 154

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E + K++Q                               +    W 
Sbjct: 155 SFGWSFVFEDFVSDELKKKVTQ------------------------------KLESAPWW 184

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E  +                 W  P G  S I  R++ PV+HVSWNDA A+C W
Sbjct: 185 ---LPVEKAF-----------------WRQPAGPGSGIRDRLDFPVLHVSWNDAQAFCAW 224

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ R++PWGN   P   +R N+WQG+FP  +TA DG+   +P
Sbjct: 225 RGKRLPTEEEWEFAARGGLKGRVYPWGNQFQP---NRTNLWQGKFPQADTAEDGFHGASP 281

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD-KVKKGGSYLCNE 312
           V ++  +N +GLY+++GNVWEWTA           +Y P  P  G D +V +G S++   
Sbjct: 282 VDAFPPQNNYGLYDLMGNVWEWTAS----------TYGP--PVPGQDRRVLRGASWIDTA 329

Query: 313 QYC--YRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           +    +R R   R  NTPDS++ NLGFRCAA  G  +G
Sbjct: 330 EGSANHRARVTTRMGNTPDSASDNLGFRCAASPGRPSG 367


>gi|443320196|ref|ZP_21049313.1| hypothetical protein GLO73106DRAFT_00035460 [Gloeocapsa sp. PCC
           73106]
 gi|442790078|gb|ELR99694.1| hypothetical protein GLO73106DRAFT_00035460 [Gloeocapsa sp. PCC
           73106]
          Length = 335

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 140/362 (38%), Positives = 193/362 (53%), Gaps = 64/362 (17%)

Query: 3   LLPAPPVERYKD------MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVS 56
           L P  P+  + D      MV +P  TFR+G +K   ++  E  + NVT+D F +D++EV+
Sbjct: 18  LDPMQPLVAFSDNFCSEEMVFIPRGTFRLGDDKSNFLE--EKVAENVTVDNFCIDRYEVT 75

Query: 57  NTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMH 116
           N+ F  FV ATGY+T AE+      F  LL +E+R+  S V              E    
Sbjct: 76  NSDFATFVKATGYITVAERPLSKEQFSNLL-DEQRSPGSLV---------FQPPEEEATQ 125

Query: 117 HPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRM 176
            P  ++SW        W          W  G             +WLHP G DS+IE + 
Sbjct: 126 VP--YLSW--------WH---------WVLGA------------NWLHPYGPDSSIEGKD 154

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
           N+PVVH+++ DAVAY  W G  LPTEA+WEY  RGGL+N++F WG+  + +   +AN WQ
Sbjct: 155 NYPVVHMAYEDAVAYAQWAGKSLPTEAQWEYAARGGLKNKIFSWGDEYSAQ---KANTWQ 211

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGP 295
           G FP  NT  DGY+ T+PV S+  N +G+Y++ GNVWEWT+DW+++ H   A   NP GP
Sbjct: 212 GVFPFFNTKEDGYVGTSPVGSFPANNYGIYDITGNVWEWTSDWFSLGHDSQANKVNPTGP 271

Query: 296 TT-----------GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           T            G   V KGGSYLC   YC R+R AAR    PD+   ++GFR   +  
Sbjct: 272 TKAESFDPKKLGEGALHVVKGGSYLCARNYCSRYRPAARESQAPDTGTTHIGFRLVKNLA 331

Query: 345 PT 346
           P+
Sbjct: 332 PS 333


>gi|149276174|ref|ZP_01882318.1| hypothetical protein PBAL39_00602 [Pedobacter sp. BAL39]
 gi|149232694|gb|EDM38069.1| hypothetical protein PBAL39_00602 [Pedobacter sp. BAL39]
          Length = 379

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 140/342 (40%), Positives = 182/342 (53%), Gaps = 45/342 (13%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           K+MV + G TF MG +     +D E+P  NVT+D F++D  EV+N QF EFV ATGYVT 
Sbjct: 72  KNMVWIDGGTFSMGADNDQAAQD-EYPKHNVTVDGFWIDVTEVTNKQFAEFVKATGYVTT 130

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+  D         EE + ++       K  E L          P   +S ND   Y  
Sbjct: 131 AERKPDW--------EELKKQVPP--GTPKPDESLLVPASLVFKAPGNEVSLND---YTQ 177

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W         EW  G             +W  P G DS    + ++PVVHVSW DA AYC
Sbjct: 178 WW--------EWTKG------------ANWRKPHGADSEWSAKDDYPVVHVSWEDAQAYC 217

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAEWE   RG L+N ++PWGN     G+ +AN WQG FP  NT  D Y   
Sbjct: 218 KWAGTRLPTEAEWELAARGRLKNNIYPWGNESVNIGKPKANSWQGTFPVKNTLEDKYYLA 277

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPKGPTTGTD--------K 301
           APV S+  N +GLY+M GNVWEW AD ++ +++    +   NPKGP +  D         
Sbjct: 278 APVKSFPPNGYGLYDMAGNVWEWCADLYHSNYYQEVKSGVKNPKGPGSSFDPDEPHTEKH 337

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           V +GGS+LCN+ YC  +R A R +++ DSS  ++GFRC  DK
Sbjct: 338 VLRGGSFLCNDSYCSGYRVARRMKSSGDSSMEHVGFRCVRDK 379


>gi|407644341|ref|YP_006808100.1| hypothetical protein O3I_015825 [Nocardia brasiliensis ATCC 700358]
 gi|407307225|gb|AFU01126.1| hypothetical protein O3I_015825 [Nocardia brasiliensis ATCC 700358]
          Length = 294

 Score =  229 bits (585), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 137/328 (41%), Positives = 171/328 (52%), Gaps = 58/328 (17%)

Query: 18  LPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKF 76
           +PG  F MG  ++     DGE P   V L  F +D   V+N  F  FV  TGYV++AE+F
Sbjct: 13  VPGGIFEMGDAHRDGFHADGETPVHTVALSPFTIDTTTVTNAAFATFVEETGYVSDAERF 72

Query: 77  GDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA 136
           G +                                      PV H+              
Sbjct: 73  GCS--------------------------------------PVFHLVLAAEPGDLLGSDP 94

Query: 137 RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRG 196
           R P    W    RG           W HPEG  S+I  R  HPVV VSW DAVAYC W G
Sbjct: 95  RAP----WWIDVRGA---------DWRHPEGRHSSIADRAEHPVVQVSWFDAVAYCEWAG 141

Query: 197 ARLPTEAEWEYGCRGGLENRLFPWGNNLT-PRGEHRANVWQGEFPTNNTAADGYLSTAPV 255
            RLPTEAEWE+  RGGL    +PWG+      GE RAN+WQG FPT+NT  DGYL+TAPV
Sbjct: 142 RRLPTEAEWEFAARGGLPGARYPWGDEAPGAGGEWRANIWQGTFPTDNTQEDGYLTTAPV 201

Query: 256 MSYKENKFGLYNMVGNVWEWTADWW--NVHHHPAPS---YNPKGPTTGTDKVKKGGSYLC 310
            SY+ N +GL+ M GNVWEW +D +  N + H A +    +P+G + G  +V +GGSYLC
Sbjct: 202 RSYRPNGYGLWQMAGNVWEWCSDRFSPNTYGHDARTGHVRDPQGASAGQSRVLRGGSYLC 261

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           ++ YC R+R AARS NTPD++ GN GFR
Sbjct: 262 HDSYCSRYRNAARSSNTPDATMGNAGFR 289


>gi|84501935|ref|ZP_01000093.1| hypothetical protein OB2597_17647 [Oceanicola batsensis HTCC2597]
 gi|84389930|gb|EAQ02564.1| hypothetical protein OB2597_17647 [Oceanicola batsensis HTCC2597]
          Length = 295

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 127/328 (38%), Positives = 162/328 (49%), Gaps = 54/328 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           ++ + G  F MG  +    +D + P R V +  F +     SN  F  FV  TGY T AE
Sbjct: 16  LLSIKGGFFDMGARQSRFAQDYDSPRRKVFVSPFRISATACSNADFAHFVEETGYRTVAE 75

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
             G ++VF  L                                                 
Sbjct: 76  VEGWSYVFHML------------------------------------------------- 86

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP  A W     G    R      W  PEG DS +  R +HPVVHV+W DA+AYCTW
Sbjct: 87  ---LPDLAAWPISPPGLEWWRKVDLACWSAPEGPDSDLSERADHPVVHVAWYDALAYCTW 143

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEA+WE+  RGGL    FPWGN   P G H  N WQG FP  N+A DG++ TAP
Sbjct: 144 AGLILPTEAQWEFAARGGLSRARFPWGNATMPGGHHAMNTWQGAFPATNSAEDGFVGTAP 203

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVH--HHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
           V +Y  N FGL+N  GNVWEW +D++  H    P P  +PKGP  G  ++++GGSYLC++
Sbjct: 204 VTAYAPNGFGLFNTCGNVWEWASDFYAPHPARGPFPLRDPKGPADGYARIQRGGSYLCHD 263

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            YC R+   +R++N PDSS GN GFR A
Sbjct: 264 SYCDRYHVHSRTRNDPDSSTGNAGFRVA 291


>gi|383765370|ref|YP_005444351.1| putative sulfatase-modifying factor [Phycisphaera mikurensis NBRC
           102666]
 gi|381385638|dbj|BAM02454.1| putative sulfatase-modifying factor [Phycisphaera mikurensis NBRC
           102666]
          Length = 349

 Score =  229 bits (583), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 145/355 (40%), Positives = 180/355 (50%), Gaps = 75/355 (21%)

Query: 15  MVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M+ LPG  F MGT++P     DGE P R VTL  F LD   V+   F  FV+ TG+VTEA
Sbjct: 42  MITLPGGPFLMGTDEPDGFPADGEGPVREVTLSPFRLDATAVTVAAFARFVAETGFVTEA 101

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+FG +FVF+  LS     +++  R                    V   +W  AV     
Sbjct: 102 ERFGWSFVFQGHLSPRFVRRVTARRQ-------------------VPGAAWWVAV----- 137

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
            GA                         W HPEG  S + HR NHP   +SW+DA AY  
Sbjct: 138 DGA------------------------DWAHPEGPRSGVRHRQNHPATQISWHDAAAYAA 173

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE+  RGG   + +PWG  L PRG+HR NV+QG FP  +TAADG+    
Sbjct: 174 WAGKRLPTEAEWEFAARGGTSQQRYPWGPTLEPRGKHRCNVFQGRFPEEDTAADGFAGPC 233

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADW----WNVHHHPAPSYNPKGPT----------TGT 299
           PV +Y  N +G +N++GNVWEW ADW    W+     A   +P GP            GT
Sbjct: 234 PVDAYAPNAYGFHNLLGNVWEWCADWFCPVWHAEARDATRTDPIGPVPHDDLDRGGRPGT 293

Query: 300 D------------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           D            +V+KGGSYLC+  YC R+R  AR+ NTPDS   N GFRCA D
Sbjct: 294 DGRAGVVHGGFTHRVQKGGSYLCHASYCNRYRLGARTGNTPDSGTTNSGFRCAMD 348


>gi|428777451|ref|YP_007169238.1| hypothetical protein PCC7418_2894 [Halothece sp. PCC 7418]
 gi|428691730|gb|AFZ45024.1| protein of unknown function DUF323 [Halothece sp. PCC 7418]
          Length = 328

 Score =  228 bits (582), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 137/339 (40%), Positives = 185/339 (54%), Gaps = 58/339 (17%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           +DMV + G +F+MG N+    +  E  + +V++ +F +D+HEV+N +F +FV ATGYVT 
Sbjct: 31  EDMVFISGGSFQMGDNQSRFWE--EQAAEDVSVSSFCIDRHEVTNAEFAKFVEATGYVTV 88

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+      F P LSEEERA  S V      FE      +               VAY +
Sbjct: 89  AERPLSQENF-PDLSEEERAPGSLV------FEPPPENSQQ--------------VAYLS 127

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W          W +           P  +W HP G DS I+ + N+PVVH+++ DA AY 
Sbjct: 128 W----------WHWT----------PGANWRHPFGPDSDIQGKDNYPVVHIAYEDAQAYT 167

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEA+WE+  RGGL+N+ F WG+  + +   +AN WQG FP  NT  D YL T
Sbjct: 168 EWAGKTLPTEAQWEFAARGGLKNKKFTWGDEYSAQ---KANTWQGIFPFFNTKEDHYLGT 224

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHH------------HPAPSYNPKGPTTGTD 300
           APV S+  N +GLY+M GNVWEWT+DW+ V H                S++PK PT G  
Sbjct: 225 APVESFPANGYGLYDMTGNVWEWTSDWYGVSHADKAHRLNPTGASEEESFDPKKPTDGAM 284

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            V KGGS+LC + YC R+R AAR    PD+   ++GFR 
Sbjct: 285 HVIKGGSHLCAKNYCSRYRPAARESQAPDTGTTHIGFRS 323


>gi|195095886|ref|XP_001997853.1| GH23459 [Drosophila grimshawi]
 gi|193906138|gb|EDW05005.1| GH23459 [Drosophila grimshawi]
          Length = 201

 Score =  228 bits (582), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 108/183 (59%), Positives = 131/183 (71%), Gaps = 12/183 (6%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           SW  P GI+S ++    HPVVHVSW DAVAYCTW G RLP+E+EWE  CRGG + +L+PW
Sbjct: 29  SWRKPNGINSNLQGLEQHPVVHVSWRDAVAYCTWAGKRLPSESEWEVACRGGKQRKLYPW 88

Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
           GN L P+ +H  N+WQGEFP  +T  DGY  T PV  +++N + LYNMVGNVWEWTAD W
Sbjct: 89  GNKLMPKDQHWLNIWQGEFPDGDTQQDGYQFTCPVNEFRQNNYDLYNMVGNVWEWTADLW 148

Query: 281 NVHHHPAPSYNPKGPTTGT-DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                        G T+ +  +VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRC
Sbjct: 149 QA-----------GDTSESPARVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRC 197

Query: 340 AAD 342
           A +
Sbjct: 198 AKN 200



 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 34/43 (79%), Positives = 39/43 (90%)

Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA + 
Sbjct: 159 RVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 201


>gi|311746857|ref|ZP_07720642.1| sulfatase-modifying factor 1 [Algoriphagus sp. PR1]
 gi|311302580|gb|EFQ79230.1| sulfatase-modifying factor 1 [Algoriphagus sp. PR1]
          Length = 363

 Score =  228 bits (581), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 133/361 (36%), Positives = 180/361 (49%), Gaps = 49/361 (13%)

Query: 1   MVLLPAP-PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQ 59
           MV+   P P ++ + M  +PG  F MGTN+       E P+ ++ ++ F++D HEV+N +
Sbjct: 34  MVVSRTPVPTQKIEGMSWIPGGEFVMGTNE-AEAYPAEKPAVDLKVEGFWMDTHEVTNDE 92

Query: 60  FQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPV 119
           F  FV AT YVT AE+            E E  K        K  E +          P 
Sbjct: 93  FAAFVEATDYVTVAER----------KPEWEELKKQLPPGTPKPDESVLVPGSMVFSPPP 142

Query: 120 VHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHP 179
             +   D   +  W                           +W HPEG DS IE R NHP
Sbjct: 143 YKVPLQDISLWWVWVNG-----------------------ANWKHPEGPDSNIEDRGNHP 179

Query: 180 VVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEF 239
           VVHV++ DA AY  W G RLPTE EWE+  RGG++ + F WG+ LTP G++ AN +QG F
Sbjct: 180 VVHVAYEDAQAYAEWAGKRLPTEIEWEFAARGGVDGKRFAWGDELTPNGQYLANTFQGTF 239

Query: 240 PTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA------------ 287
           P  N   DG++ ++PV +Y  N FGLY+M+GNVWE T+DW++   +              
Sbjct: 240 PNENLGNDGFVGSSPVKAYAPNNFGLYDMIGNVWELTSDWYDALKYARLAGQAPKLDKDM 299

Query: 288 -PSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
            P YNP  P    ++V KGGS+LC E YC  +R +AR     DS   N+GFRC  D  P 
Sbjct: 300 NPCYNPDNP-FAMERVIKGGSFLCAENYCVNYRPSARQGQAFDSGTSNVGFRCVKDPEPK 358

Query: 347 T 347
           T
Sbjct: 359 T 359


>gi|386395503|ref|ZP_10080281.1| hypothetical protein Bra1253DRAFT_00938 [Bradyrhizobium sp.
           WSM1253]
 gi|385736129|gb|EIG56325.1| hypothetical protein Bra1253DRAFT_00938 [Bradyrhizobium sp.
           WSM1253]
          Length = 337

 Score =  228 bits (581), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 137/346 (39%), Positives = 180/346 (52%), Gaps = 62/346 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV +PG TFRMG++        E PS  V +D F++D+  V+N QF++FV+ATG+VTEA
Sbjct: 20  EMVFIPGGTFRMGSDHHY---PEEAPSHRVAVDGFWIDRTPVTNRQFKQFVNATGHVTEA 76

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           +   D   +   L+E   A  S V   + R   L                W         
Sbjct: 77  QVVPDPKDYPGALAEMLYAG-SLVFSPLPRITDL--------------TDW--------- 112

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                   ++W    RG          +W HP G  S I+   +HPVVHVS++DA AY  
Sbjct: 113 --------SQWWTFLRGA---------NWRHPYGPGSNIKDLDDHPVVHVSFSDAAAYAH 155

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEAEWE+  RGGLE   + WG+ L P G+H AN+WQG FP  N   DGY  T+
Sbjct: 156 WAGKDLPTEAEWEFAARGGLEAEEYAWGDALVPGGKHMANIWQGNFPVQNLGEDGYERTS 215

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSYNPKGPTTGTD---------- 300
           PVM++  N +GLY+M+GNVWEWT+DWW+  H      P   PK P  G +          
Sbjct: 216 PVMAFPPNGYGLYDMIGNVWEWTSDWWSSKHAADAAKPCCIPKNPRGGGEHTSYDPRLPD 275

Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
                KV KGGS+LC   YC R+R AAR     D+S G++GFRC  
Sbjct: 276 IQIPRKVLKGGSHLCAPNYCRRYRPAARHAEPVDTSTGHVGFRCVV 321


>gi|255530292|ref|YP_003090664.1| Non-specific serine/threonine protein kinase [Pedobacter heparinus
           DSM 2366]
 gi|255343276|gb|ACU02602.1| Non-specific serine/threonine protein kinase [Pedobacter heparinus
           DSM 2366]
          Length = 367

 Score =  228 bits (581), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 129/339 (38%), Positives = 176/339 (51%), Gaps = 45/339 (13%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +P  +F MG        D E+P+  V +D F++D+ EV+N QF  FV ATGYVT+AE
Sbjct: 63  MKFIPAGSFAMGAADNEGRPD-EYPAHEVVMDGFWIDETEVTNAQFARFVKATGYVTQAE 121

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D           E  K        K  + L          P   ++ ND   +  WR
Sbjct: 122 RKPDW----------EELKKQLPPGTPKPADELLQPASLTFSKPTKRVNLNDVSQW--WR 169

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W  G             SW HP+G  STI  + N+PV  VSW DA AY  W
Sbjct: 170 ---------WTVGA------------SWKHPQGPKSTIVGKDNYPVTQVSWIDASAYAKW 208

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY  RGGL+ + +PWG+    +G+ +AN WQGEFP  +   DG+ STAP
Sbjct: 209 AGKRLPTEAEWEYAARGGLQRKKYPWGDESLDKGKVKANTWQGEFPYTDLKTDGFSSTAP 268

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPKGPTTGTD--------KVK 303
           V S+  N +GLY+M GNVWEWTADW+   ++        NPKGP    D        K+ 
Sbjct: 269 VKSFAANGYGLYDMSGNVWEWTADWYTADYYRGLKGKVNNPKGPAESYDADEPTIPKKII 328

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +GGS++C+  YC  +R +++ +++ D+   N GFRC A+
Sbjct: 329 RGGSFMCHSSYCKGYRVSSKMKSSMDTGLENTGFRCVAN 367


>gi|424876935|ref|ZP_18300594.1| hypothetical protein Rleg5DRAFT_1327 [Rhizobium leguminosarum bv.
           viciae WSM1455]
 gi|393164538|gb|EJC64591.1| hypothetical protein Rleg5DRAFT_1327 [Rhizobium leguminosarum bv.
           viciae WSM1455]
          Length = 317

 Score =  227 bits (579), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 131/337 (38%), Positives = 180/337 (53%), Gaps = 55/337 (16%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           R  D+V + G    +GT++P+L+ DGE P R VT+  ++L+   V+  +F EFV+ATGYV
Sbjct: 28  RAGDVVAVKGGETFLGTHEPVLVVDGEGPERKVTVSDYFLEAETVTVARFAEFVAATGYV 87

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           ++AEKFG   VF  L+ +++                                   + VAY
Sbjct: 88  SDAEKFGCAAVFSGLMEDKKLVA--------------------------------ENVAY 115

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             W  AR+                       W  PEG  S++E+R++HPV  VSWNDAVA
Sbjct: 116 TPW-WARVDGAV-------------------WNMPEGPGSSVENRLDHPVTQVSWNDAVA 155

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           + TW G RLPTEAEWE+  RGG   + +PWG       E   N+WQG FP  NT ADG+ 
Sbjct: 156 FATWVGGRLPTEAEWEHAARGGNRRQRYPWGEAEPTDDEICCNIWQGRFPHQNTLADGFF 215

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVH--HHPAPSYNPKGPTTGTDKVKKGGSY 308
            TAPV S++  + G Y++ GNVWEW +D + V      A   N +      +KV KGGS+
Sbjct: 216 GTAPVRSFEPTEDGFYHLSGNVWEWASDPFRVRSLSRQAKIRNEQSKKNA-EKVLKGGSF 274

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           LC++ YCYR+R AAR   +PDS++ N GFR A D  P
Sbjct: 275 LCHKSYCYRYRIAARIGLSPDSASSNTGFRVAYDTWP 311


>gi|302869538|ref|YP_003838175.1| hypothetical protein Micau_5091 [Micromonospora aurantiaca ATCC
           27029]
 gi|302572397|gb|ADL48599.1| protein of unknown function DUF323 [Micromonospora aurantiaca ATCC
           27029]
          Length = 320

 Score =  227 bits (579), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 132/334 (39%), Positives = 172/334 (51%), Gaps = 59/334 (17%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
           P     D++ LPG TF MGT+ P    +D E P R V +  F +    V+N QF  FV A
Sbjct: 44  PFRGTDDLIALPGGTFLMGTDDPEGFPEDLEGPVRPVAVAPFAIAATPVTNRQFARFVDA 103

Query: 67  TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
           TGY T+AE++G +FVF  LL+      +            LD+++               
Sbjct: 104 TGYRTDAERYGWSFVFHLLLAPSAHRYV------------LDASVP-------------- 137

Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
                         +A W     G           W  P+G  S +  R  HPVVHVS  
Sbjct: 138 --------------QAPWWLAVEGA---------HWAAPDGPGSDVRGRERHPVVHVSHR 174

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
           DA AYC W G RL TEAEWEY  RGGL    + WG+ LTP G H  N+WQG FP  NTA 
Sbjct: 175 DAEAYCLWSGTRLATEAEWEYAARGGLSQARYAWGDELTPDGRHLCNIWQGAFPAVNTAE 234

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
           DG+  T+PV ++  N +GLY++ GNVWE TA+ W          + +G  +  + V +GG
Sbjct: 235 DGFAGTSPVTAFPPNGYGLYDVAGNVWELTAEPWR---------DRRGRISPGESVIRGG 285

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           SYLC++ YC R+R AARS+ T DSS+GN GFR A
Sbjct: 286 SYLCHDSYCNRYRVAARSRTTVDSSSGNTGFRVA 319



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 26/48 (54%), Positives = 35/48 (72%)

Query: 343 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
           +G  +  + V +GGSYLC++ YC R+R AARS+ T DSS+GN GFR A
Sbjct: 272 RGRISPGESVIRGGSYLCHDSYCNRYRVAARSRTTVDSSSGNTGFRVA 319


>gi|218438193|ref|YP_002376522.1| hypothetical protein PCC7424_1204 [Cyanothece sp. PCC 7424]
 gi|218170921|gb|ACK69654.1| protein of unknown function DUF323 [Cyanothece sp. PCC 7424]
          Length = 330

 Score =  227 bits (579), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 133/344 (38%), Positives = 186/344 (54%), Gaps = 59/344 (17%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           K+MVL+ G TFR+G ++   ++  E  + NVT+D F +D++EV+N  F EFV +T Y+T 
Sbjct: 34  KNMVLISGGTFRIGDDRSNFLE--EKAAENVTVDNFCIDRYEVTNANFAEFVKSTDYITV 91

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+      F P L +E+R+  S V      F+  +   +      V ++SW   V    
Sbjct: 92  AERPLSKKQF-PDLPDEQRSPGSLV------FQPPEEVTQ------VPYLSWWHWVVGAN 138

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR                             HP G DS+IE + N+PVVH+++ DAVAY 
Sbjct: 139 WR-----------------------------HPYGPDSSIEGKNNYPVVHIAYEDAVAYA 169

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEA+WEY  RGGL+N++F WG+  +     +AN WQG FP  NT  DGYL T
Sbjct: 170 QWVGKSLPTEAQWEYAARGGLKNKIFSWGDEYS---AQKANTWQGIFPFFNTKEDGYLGT 226

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP------------SYNPKGPTTGTD 300
           +PV S+  N +G+Y++ GNVWEWT+DW+ + H                S++PK P  G  
Sbjct: 227 SPVGSFPPNHYGIYDITGNVWEWTSDWFRLGHDSQANKVNPTGPSKEESFDPKKPGEGAL 286

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
            V KGGSYLC   YC R+R AAR    PD+   ++GFR   + G
Sbjct: 287 HVVKGGSYLCAWNYCSRYRPAARESQAPDTGTTHIGFRLVKNLG 330


>gi|257056606|ref|YP_003134438.1| hypothetical protein Svir_26230 [Saccharomonospora viridis DSM
           43017]
 gi|256586478|gb|ACU97611.1| uncharacterized conserved protein [Saccharomonospora viridis DSM
           43017]
          Length = 312

 Score =  227 bits (579), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 141/335 (42%), Positives = 170/335 (50%), Gaps = 54/335 (16%)

Query: 13  KDMVLLPGDTFRMGTNKPILI-KDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           + MVLLPG  F MG+  P+   +DGE P R V      LD   +S T     VS   +  
Sbjct: 30  EGMVLLPGGEFSMGSEDPLAYPEDGEGPVRTVR-----LDPFWISRTA----VSNRDFAE 80

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
             +  G         +E ER   S V      F GL                        
Sbjct: 81  FVDATG-------YRTEAERWGWSFV------FAGL------------------------ 103

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
                 LP +     G       R      W HPEG  ST++ R +HPVVHVSW+DA  Y
Sbjct: 104 ------LPDDFPPTRGVATAPWWRQVEGADWRHPEGPHSTVDGREDHPVVHVSWSDAQEY 157

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
           C W G RLPTEAEWEY  RGGL    FPWG  L P GEHR NVWQG FP +N+ ADG+  
Sbjct: 158 CAWAGLRLPTEAEWEYAARGGLSGNPFPWGAELEPGGEHRMNVWQGVFPRHNSRADGWYG 217

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-SYNPKGPTTGTDKVKKGGSYLC 310
           T PV  +  N FGL+NM GNVWEW ADW++   H      NP+GP  GT +  +GGSYLC
Sbjct: 218 TCPVTEFPPNGFGLHNMTGNVWEWCADWFHPTFHTLDRRTNPRGPEQGTHRSARGGSYLC 277

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           +E YC R+R +AR   TPD++ GN+GFRC  D  P
Sbjct: 278 HESYCRRYRVSARQGLTPDTTTGNVGFRCVRDAAP 312


>gi|126663718|ref|ZP_01734714.1| hypothetical protein FBBAL38_11629 [Flavobacteria bacterium BAL38]
 gi|126624301|gb|EAZ94993.1| hypothetical protein FBBAL38_11629 [Flavobacteria bacterium BAL38]
          Length = 381

 Score =  227 bits (578), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 134/342 (39%), Positives = 185/342 (54%), Gaps = 49/342 (14%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG  F MG +     +D EFP   V L+ F++D  EV+N QF +FV+ATGYVT AE
Sbjct: 75  MVWIPGGIFSMGGDNDQARQD-EFPKHQVKLNGFFMDITEVTNEQFAKFVAATGYVTTAE 133

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV-HISWNDAVAYCTW 133
           K  D          +E  K  Q+  D  + +         +  P    +S  D   Y  W
Sbjct: 134 KDIDW---------DELKK--QLPSDTPKPDAETLKAASLVFVPTSGEVSLQD---YSQW 179

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W +G             +W HP+G DSTI+ + N+PVVH+SW+DA AYC 
Sbjct: 180 WN--------WSHG------------ANWKHPQGKDSTIDGKDNYPVVHISWDDANAYCK 219

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE+  RGGL   ++ WGN      +   N +QG FP  N  +DG++ +A
Sbjct: 220 WAGKRLPTEAEWEFAARGGLSKNVYTWGNEKVDEEKFHCNYFQGNFPYKNEVSDGFIGSA 279

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----PSYNPKGPTTGTD-------- 300
           PV S+  N +GLY+M GNVWEW AD +N  ++ +      + NP GPT   D        
Sbjct: 280 PVKSFPANDYGLYDMSGNVWEWCADKYNNLYYDSFKKVKVTINPIGPTKSYDPDEPLVEK 339

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +V +GGS+LCNE YC  +R +AR +++ DSS  +LGFRC ++
Sbjct: 340 RVMRGGSFLCNESYCSGYRVSARMKSSADSSMEHLGFRCVSN 381


>gi|119960512|ref|YP_947226.1| hypothetical protein AAur_1451 [Arthrobacter aurescens TC1]
 gi|119947371|gb|ABM06282.1| Domain of unknown function (DUF323) protein [Arthrobacter aurescens
           TC1]
          Length = 277

 Score =  227 bits (578), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 135/331 (40%), Positives = 171/331 (51%), Gaps = 65/331 (19%)

Query: 20  GDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDT 79
           GD F  G +      DGE P   V++ AF +    V+N +F EFV ATGY TE+E +G +
Sbjct: 2   GDPFNEGYDG-----DGESPVHEVSVSAFRIGATTVTNQEFAEFVDATGYRTESETYGTS 56

Query: 80  FVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLP 139
            VF   +       +++V                           N+   +   RGA   
Sbjct: 57  AVFHLAVKANTHDILNRV---------------------------NNVPWWLNIRGA--- 86

Query: 140 TEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARL 199
                                 W HP G  S      +HPV HVS NDA+AYC W G RL
Sbjct: 87  ---------------------DWAHPAGPLSNWADIPDHPVTHVSHNDALAYCQWAGRRL 125

Query: 200 PTEAEWEYGCRGGLENRLFPWGNNL--TPRGE--HRANVWQGEFPTNNTAADGYLSTAPV 255
           PTEAEWEY  RGGL  + +PWGN L     GE  H  N+WQGEFP+ NT  DGYL+TAP 
Sbjct: 126 PTEAEWEYAARGGLSGQRYPWGNELHNDAPGETAHNCNIWQGEFPSRNTIDDGYLTTAPA 185

Query: 256 MSYKENKFGLYNMVGNVWEWTADW-----WNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
            S++ N +GLY   GNVWEW +DW     +   H      +P+GPT G  +V +GGSYLC
Sbjct: 186 TSFRPNGYGLYQTSGNVWEWCSDWFLPKYYKTCHAQGTVEDPQGPTIGRGRVMRGGSYLC 245

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           ++ YC R+R AARS NTPDS++GNLGFR  A
Sbjct: 246 HDSYCNRYRLAARSSNTPDSASGNLGFRTVA 276


>gi|430743633|ref|YP_007202762.1| hypothetical protein Sinac_2774 [Singulisphaera acidiphila DSM
           18658]
 gi|430015353|gb|AGA27067.1| hypothetical protein Sinac_2774 [Singulisphaera acidiphila DSM
           18658]
          Length = 344

 Score =  227 bits (578), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 134/342 (39%), Positives = 183/342 (53%), Gaps = 51/342 (14%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           ++MV + G  F MGT+  +   D E P+  V +D F++D+HEV+N +F  FV+AT YVT 
Sbjct: 33  REMVWIAGGEFTMGTDSELGWPD-ERPAHRVRVDGFWIDEHEVTNDEFAAFVAATRYVTT 91

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AEK       E +L++      S     +     +    E  +          D   +  
Sbjct: 92  AEKAPKA---EEILAQSSPGTPSPPPEALVPGSLVFQPTEEAV----------DLRDFAR 138

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W          W          R  P  SW HPEG  S ++ R  HPVV VSW+DA AY 
Sbjct: 139 W----------W----------RWTPGASWRHPEGPGSDLQGRGAHPVVQVSWDDANAYA 178

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G +LPTEAEWE+  RGGL+ R + WG++     + +AN+WQG FP  NTAADGY  T
Sbjct: 179 QWAGKQLPTEAEWEFAARGGLDGRAYVWGDSPFSLAQPQANIWQGAFPYRNTAADGYDRT 238

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWN-------VHHHPAPSYNPKGPTTGTD----- 300
           APV S+  N FGL++M GNVWEW ADW+          HH  P+ NP GP   +D     
Sbjct: 239 APVKSFPPNGFGLFDMGGNVWEWCADWYQRDLYRQRAGHH--PTLNPTGPEKSSDPARPF 296

Query: 301 ---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
              +V++GGS+LCN+ YC R+R +AR   +PD+   ++GFRC
Sbjct: 297 TPQRVQRGGSFLCNDSYCSRYRPSARHGCSPDTGMSHVGFRC 338


>gi|410456927|ref|ZP_11310775.1| non-specific serine/threonine protein kinase [Bacillus bataviensis
           LMG 21833]
 gi|409927014|gb|EKN64163.1| non-specific serine/threonine protein kinase [Bacillus bataviensis
           LMG 21833]
          Length = 318

 Score =  227 bits (578), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 103/180 (57%), Positives = 129/180 (71%), Gaps = 1/180 (0%)

Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
           W  PEG  STIE R++HPVVHVSWNDA  YC W G RLPTEAEWEY  RGGL  + +PWG
Sbjct: 138 WNQPEGPGSTIESRLDHPVVHVSWNDANEYCNWAGKRLPTEAEWEYAARGGLVQKRYPWG 197

Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
           + L P G+H  N+WQG+FP  N   DGY+ TAP  S+  N +GL+NM GNVWEW +DW++
Sbjct: 198 DRLHPDGKHVCNIWQGKFPHTNHGRDGYIGTAPSKSFPPNHYGLFNMSGNVWEWCSDWFS 257

Query: 282 VHHH-PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
             +H      NP+GP  G DK+ KGGSYLC++ YC R+R +ARS  +PDSS+G++GFRC 
Sbjct: 258 ASYHIEGTRNNPQGPPNGMDKIIKGGSYLCHKSYCNRYRVSARSAVSPDSSSGHMGFRCV 317


>gi|426403429|ref|YP_007022400.1| gliding motility-associated lipoprotein GldK [Bdellovibrio
           bacteriovorus str. Tiberius]
 gi|425860097|gb|AFY01133.1| putative gliding motility-associated lipoprotein GldK [Bdellovibrio
           bacteriovorus str. Tiberius]
          Length = 327

 Score =  226 bits (577), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 132/351 (37%), Positives = 176/351 (50%), Gaps = 61/351 (17%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P   + DM+ +PG  F MG++        E P+  V +D F++D+H V+N  F  FV AT
Sbjct: 16  PSAPFPDMIWIPGGMFTMGSDNHYA---EEAPAHRVKVDGFWMDRHPVTNRDFLAFVQAT 72

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           GY T  EK  D  ++ P  SE  +   S V      F    S ++ R             
Sbjct: 73  GYRTFCEKEPDLNLY-PGASESMKDPASVV------FVKPASPVDLRN------------ 113

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
             Y  W+                       P  +W  P+G  S+IE RM+HPVVHVSW D
Sbjct: 114 -PYLWWQ---------------------FIPGANWRSPQGPGSSIESRMDHPVVHVSWQD 151

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
             AY  W G +LPTEAEWE+  RGG E R F WG  L P G H+AN+WQGEFP  N + D
Sbjct: 152 VQAYAKWIGKKLPTEAEWEFAARGGHEGREFAWGEELEPEGRHQANIWQGEFPWQNHSGD 211

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----------------HPAPSY 290
           G+  T+PV ++  N +GL++M+GNVWEWT DW++  H                   A S+
Sbjct: 212 GFEGTSPVTAFPPNDYGLFDMIGNVWEWTVDWYSPRHPDEKLKACCIPKNPRGGAEADSF 271

Query: 291 NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           +P+       KV KGGS+LC   YC R+R +AR     D+S  +LGFRC  
Sbjct: 272 DPRQALKIPRKVMKGGSHLCAPNYCKRYRPSARMAQPVDTSTSHLGFRCVV 322


>gi|186682703|ref|YP_001865899.1| hypothetical protein Npun_F2391 [Nostoc punctiforme PCC 73102]
 gi|186465155|gb|ACC80956.1| protein of unknown function DUF323 [Nostoc punctiforme PCC 73102]
          Length = 332

 Score =  226 bits (576), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 134/345 (38%), Positives = 184/345 (53%), Gaps = 68/345 (19%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MVL+PG  F MG +    ++  E  ++NVT+ +F +D++EV+N QF EFV AT YVT A
Sbjct: 38  EMVLIPGGIFNMGADNSGFVE--ERTAKNVTVTSFCIDKYEVTNAQFAEFVKATKYVTVA 95

Query: 74  ------EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
                 E+F D       L +E+R   S V H  K  +G+                    
Sbjct: 96  QCPLPKEQFPD-------LPDEQRLPGSLVFHQPK--QGVKQ------------------ 128

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
           V + +W          W +           P  +W HP G DST+  + N+PVVH+++ D
Sbjct: 129 VQFLSW----------WHWT----------PGANWQHPFGSDSTVVGKENYPVVHIAYED 168

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           A+AY  W G  LPTEA+WEY  RGGL+   + WGN  +   E +AN WQG FP  NT AD
Sbjct: 169 ALAYGIWAGKSLPTEAQWEYAARGGLDGATYTWGNQYS---ETKANTWQGIFPFFNTKAD 225

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----------PAPSYNPKGPTT 297
           GY+ TA V S+  N +GLY+M GNVWEWT+DW+++ H            + S++PK PT 
Sbjct: 226 GYIGTAKVGSFSANGYGLYDMTGNVWEWTSDWFSLGHENKAHSVNPTGASKSFDPKKPTE 285

Query: 298 GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
               V KGGSYLC   YC R+R AAR    PD+   ++GFR   +
Sbjct: 286 IALHVIKGGSYLCAPNYCSRYRPAARESQAPDTGTSHIGFRLVKN 330


>gi|374575294|ref|ZP_09648390.1| hypothetical protein Bra471DRAFT_03916 [Bradyrhizobium sp. WSM471]
 gi|374423615|gb|EHR03148.1| hypothetical protein Bra471DRAFT_03916 [Bradyrhizobium sp. WSM471]
          Length = 335

 Score =  225 bits (574), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 136/346 (39%), Positives = 178/346 (51%), Gaps = 62/346 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV +PG TFRMG++        E PS  V +D F++D+  V+N QF++FV ATG+VTEA
Sbjct: 20  EMVFIPGGTFRMGSDHHY---PEEAPSHRVAVDGFWMDRTPVTNRQFKQFVDATGHVTEA 76

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           +   D   +   L+E   A                      +  P+ HI+  D   +  W
Sbjct: 77  QVVPDPKDYPGALAEMLYAG-------------------SLVFSPLPHIT--DLTDWSQW 115

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W +  RG          +W HP G  S I+   +HPVVHVS++DA AY  
Sbjct: 116 ----------WTF-LRGA---------NWRHPYGPGSNIKDLDDHPVVHVSFSDAAAYAR 155

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEAEWE   RGGLE   + WG+ L P G+H AN+WQG FP  N   DGY  T+
Sbjct: 156 WAGKDLPTEAEWESAARGGLEAEEYAWGDALMPGGKHMANIWQGNFPVQNLGEDGYERTS 215

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSYNPKGPTTGTD---------- 300
           PVM++  N +GLY+M+GNVWEWT DWW+  H      P   PK P  G +          
Sbjct: 216 PVMAFPPNGYGLYDMIGNVWEWTVDWWSSKHAADAAKPCCIPKNPRGGGEHASYDPRLPD 275

Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
                KV KGGS+LC   YC R+R AAR     D+S  ++GFRC  
Sbjct: 276 IQIPRKVLKGGSHLCAPNYCRRYRPAARHAEPVDTSTSHVGFRCVV 321


>gi|420237896|ref|ZP_14742341.1| hypothetical protein PMI07_00078 [Rhizobium sp. CF080]
 gi|398089558|gb|EJL80073.1| hypothetical protein PMI07_00078 [Rhizobium sp. CF080]
          Length = 317

 Score =  225 bits (574), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 139/350 (39%), Positives = 181/350 (51%), Gaps = 59/350 (16%)

Query: 5   PAPPVERY---KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           PA P+ R    + + +  G+TF +GT+ P++  DGE P R V+L  F+L+   V+  +F 
Sbjct: 20  PAGPISRGVPGEAIAVKGGETF-VGTDDPVISTDGEGPERKVSLQDFHLEIETVTIRRFA 78

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
           +FV ATG+VTE+E+FG + VF PLL E   A                          V+ 
Sbjct: 79  QFVEATGHVTESERFGSSAVFAPLLGEARTATGG-----------------------VIE 115

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
             W   +   TWR                              PEG  S IE R++HPV 
Sbjct: 116 TPWWTRIDDATWR-----------------------------QPEGRGSLIEGRLDHPVT 146

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
            VSWNDA+A+  W G RLPTEAEWE+  RGG   R +PWG           N+WQG FP 
Sbjct: 147 QVSWNDALAFAQWVGGRLPTEAEWEHAARGGNRRRRYPWGEEEPTDEAIFCNIWQGRFPD 206

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH--HHPAPSYNPKGPTTGT 299
            NT  DGY  TAPV S+  N+ G YN+ GNVWEWTAD + V      A + N       +
Sbjct: 207 TNTLQDGYFGTAPVRSFAPNEAGFYNLAGNVWEWTADPFKVRSLSKQAKARNEIARRF-S 265

Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
           DKV KGGS+LC++ YCYR+R AAR   +PDS++ N GFR A D  P + T
Sbjct: 266 DKVLKGGSFLCHKSYCYRYRIAARMGLSPDSASSNTGFRVAYDAKPGSRT 315


>gi|430745772|ref|YP_007204901.1| hypothetical protein Sinac_5052 [Singulisphaera acidiphila DSM
           18658]
 gi|430017492|gb|AGA29206.1| hypothetical protein Sinac_5052 [Singulisphaera acidiphila DSM
           18658]
          Length = 371

 Score =  225 bits (573), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 129/341 (37%), Positives = 180/341 (52%), Gaps = 51/341 (14%)

Query: 7   PPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
           PP    + MV +P   F MG++   +      P   V+LD F++D+ EV+N QF+ FV A
Sbjct: 66  PPGVAPEGMVWIPAGDFAMGSDDESMSD--ARPVHRVSLDGFWMDRTEVTNRQFERFVRA 123

Query: 67  TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
           TGY T AE+  D   F P   +E     S V                    PV  +S  D
Sbjct: 124 TGYTTVAEQAPDPKAF-PGAPKELLVPGSLV-----------------FSPPVGRVSLED 165

Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
            +A+             W Y           P  +W HP G DS+I+   +HPVV V ++
Sbjct: 166 HLAW-------------WRY----------VPGANWKHPSGPDSSIQGLEDHPVVQVCFD 202

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
           DA+AY  W G RLPTEAEWEY  RGGL+ + + WG++L P G+ + N WQG FP  N   
Sbjct: 203 DALAYAKWAGKRLPTEAEWEYAARGGLDGKRYCWGDDLKPGGKWQVNNWQGLFPNENMKE 262

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD------ 300
           DG+  TAP  S+  N +GL +M GNVWEW ADW+  ++ P+ + NP+GP +  D      
Sbjct: 263 DGFDGTAPTGSFAVNGYGLSDMAGNVWEWCADWYQPNYDPSQARNPQGPDSSHDPAEPGI 322

Query: 301 --KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
             +V++GGS+LC++ YC R+   AR +   DS A ++GFRC
Sbjct: 323 PKRVQRGGSFLCSDLYCVRYLPGARGKGATDSGASHVGFRC 363


>gi|406834765|ref|ZP_11094359.1| hypothetical protein SpalD1_24084 [Schlesneria paludicola DSM
           18645]
          Length = 374

 Score =  224 bits (572), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 131/347 (37%), Positives = 183/347 (52%), Gaps = 47/347 (13%)

Query: 6   APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
           APP E  + MV +PG  F MG N      D E P+  V +D F++D  EV+N+QF  FV+
Sbjct: 58  APPKETPQGMVWIPGGEFLMGNNASTAWPD-EQPAHRVAVDHFWMDSTEVTNSQFAAFVN 116

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
           AT YVT AEK       E +L+             +     + +  EH+       +  N
Sbjct: 117 ATEYVTTAEK---APTAEEILANSPPGTPQPSAQLLVAGSLVFTPTEHQ-------VPLN 166

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
           D   + TW                        P  +W HPEG  S + +R +HPVVHVSW
Sbjct: 167 DIRQWWTWT-----------------------PGANWRHPEGPGSDLRNRDHHPVVHVSW 203

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
           +DA AY  W   RLPTEAEWE+  RGGL+ + + WG++     + +AN+W G FP +NTA
Sbjct: 204 DDATAYARWAEKRLPTEAEWEFAARGGLQQQPYAWGSDSFSETKPQANLWTGSFPFDNTA 263

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTD 300
            DGY  TAPV ++  N +GLY+M GNVWEW +DW++   +        + NP+GP    D
Sbjct: 264 LDGYARTAPVGTFAPNGYGLYDMAGNVWEWCSDWYDRDLYKDRLKSKITSNPRGPDQSRD 323

Query: 301 KVK--------KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            ++        +GGS+LCN+ YC R+R +AR   TPD+   ++GFRC
Sbjct: 324 PMRPFMSQRSQRGGSFLCNDSYCSRYRPSARHGCTPDTGMSHVGFRC 370


>gi|443326286|ref|ZP_21054945.1| hypothetical protein Xen7305DRAFT_00032370 [Xenococcus sp. PCC
           7305]
 gi|442794085|gb|ELS03513.1| hypothetical protein Xen7305DRAFT_00032370 [Xenococcus sp. PCC
           7305]
          Length = 338

 Score =  224 bits (570), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 130/341 (38%), Positives = 180/341 (52%), Gaps = 62/341 (18%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           +DMV +PG +F++G ++  +    E  + +V +D F +DQHE++N QF EFV  TGY+T 
Sbjct: 48  QDMVWIPGGSFKIGDDEHYI---EEHSAPDVRVDGFCMDQHEITNGQFAEFVKETGYITV 104

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+      F P LSEE+RA  S V                                +  
Sbjct: 105 AERPLSATEF-PQLSEEQRAPGSLV--------------------------------FVQ 131

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
             G R+  E  W +   G          +W HPEG+DS+IE + N+PVV V++ DA AY 
Sbjct: 132 PLGLRV-KELGWWHWVTGA---------NWQHPEGVDSSIEGKENYPVVQVAYEDAQAYA 181

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEA+WE+  +GGL+N +F WGN  +     +AN WQG+FP  N   DGY  T
Sbjct: 182 DWAGKSLPTEAQWEFAAKGGLKNAIFTWGNTYSA---DKANTWQGKFPFKNLEKDGYYGT 238

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH------------PAPSYNPKGPTTGTD 300
           APV S+  N +GLY+M GNVWEWT DW+ + H              + S +P+ P     
Sbjct: 239 APVESFAANGYGLYDMAGNVWEWTVDWYRLGHKGKEHQVNPTVADASKSLDPREPGV-AK 297

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
            V KGGS+LC + YC R R AAR   +PD+   ++GFR  +
Sbjct: 298 HVIKGGSFLCAQNYCSRFRPAAREAQSPDTGTSHIGFRLVS 338


>gi|395516530|ref|XP_003762440.1| PREDICTED: sulfatase-modifying factor 1 [Sarcophilus harrisii]
          Length = 396

 Score =  224 bits (570), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 121/251 (48%), Positives = 154/251 (61%), Gaps = 46/251 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +P   F MGTN P + +DGE P+R V +++FY+D +EVSN +F++FV++TGYVTEAE
Sbjct: 1   MAHIPAGIFTMGTNDPQIKQDGEGPARRVQINSFYMDLYEVSNAEFEKFVNSTGYVTEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFGD+FVF+ +LS+       QV++      G+D   E   H              C WR
Sbjct: 61  KFGDSFVFDIMLSD-------QVKN------GIDQAAESGPH--------------CNWR 93

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           G           G R       F       PE  +  I +RM+HPV+HVSWNDAVAYCTW
Sbjct: 94  GV----------GIR-------FLNPKVEMPELKE--ITNRMDHPVLHVSWNDAVAYCTW 134

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY CRGGLENRL+PWGN L PRG+H AN+WQGEFP +NT  DGY  TAP
Sbjct: 135 AGKRLPTEAEWEYSCRGGLENRLYPWGNKLQPRGQHYANLWQGEFPVSNTGEDGYRGTAP 194

Query: 255 VMSYKENKFGL 265
             +  +++  L
Sbjct: 195 CQAGNQSRQSL 205



 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 40/58 (68%), Positives = 51/58 (87%)

Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
           Y PKGP +G+++VKKGGSY+C++ YCYR+RCAARSQNTPDSS+ NLGFRCA+ + P T
Sbjct: 339 YMPKGPHSGSNRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSSSNLGFRCASSRLPDT 396



 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 36/49 (73%), Positives = 46/49 (93%)

Query: 343 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 391
           KGP +G+++VKKGGSY+C++ YCYR+RCAARSQNTPDSS+ NLGFRCA+
Sbjct: 342 KGPHSGSNRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSSSNLGFRCAS 390


>gi|343087302|ref|YP_004776597.1| sulfatase-modifying factor protein [Cyclobacterium marinum DSM 745]
 gi|342355836|gb|AEL28366.1| Sulphatase-modifying factor protein [Cyclobacterium marinum DSM
           745]
          Length = 371

 Score =  223 bits (569), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 127/345 (36%), Positives = 183/345 (53%), Gaps = 48/345 (13%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           ++ MV + G  F  G+      +D E P+    +D F++D+ EV+N QF  FV+ATGYVT
Sbjct: 62  HEGMVWIEGGEFTRGSTDDRGRRD-ELPAHKAKVDGFWIDETEVTNAQFAAFVAATGYVT 120

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
            AEK  D    +  L         +V         + +++  R   P   +  N+A  + 
Sbjct: 121 IAEKKPDWEEIKKQLPPGTPKPPDEVL--------VAASLTFR--SPNQPVPLNNAAQWW 170

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W                        P  +W  P+G  S+IE + N+PVVHV+W+DAVAY
Sbjct: 171 EWT-----------------------PGANWRMPQGPGSSIEGKENYPVVHVAWDDAVAY 207

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G RLPTEAEWEY  +GGLE++ FPWGN+   +    AN+WQG+FP  + A DG+  
Sbjct: 208 AKWAGKRLPTEAEWEYAAKGGLEDKSFPWGNDPVEKNGFMANIWQGDFPVTDKAEDGFQG 267

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-------------PAPSYNPKGPTTG 298
            APV S++ N +GLY+M GNVWEWT DW+   ++             P  SY+P+ PT  
Sbjct: 268 LAPVKSFEPNGYGLYDMAGNVWEWTNDWYREDYYERITDEVAINPQGPKDSYDPQEPTV- 326

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
             KV KGGS+LCN  YC  +R + + +++ D+   + GFRC + K
Sbjct: 327 PKKVVKGGSFLCNVSYCEGYRVSGKMKSSADTGLEHTGFRCVSSK 371


>gi|383772147|ref|YP_005451213.1| hypothetical protein S23_39060 [Bradyrhizobium sp. S23321]
 gi|381360271|dbj|BAL77101.1| hypothetical protein S23_39060 [Bradyrhizobium sp. S23321]
          Length = 337

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 138/363 (38%), Positives = 185/363 (50%), Gaps = 64/363 (17%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           R  +MV +PG TF MG++        E PS  V++D F++D+  V+N QF++FV+ATG+V
Sbjct: 17  RTGEMVFIPGGTFLMGSDHHY---PEEAPSHRVSVDGFWIDRTPVTNRQFKQFVNATGHV 73

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEA+   D   + P   +E     S V   + R   L                W      
Sbjct: 74  TEAQIVPDPKDY-PGAPKEMLYAGSLVFSPLPRITDL--------------TDW------ 112

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
                      ++W    RG          +W HP G  STI+   +HPVVHVS +DAVA
Sbjct: 113 -----------SQWWSFLRGA---------NWRHPYGPGSTIKGMDDHPVVHVSHSDAVA 152

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           Y  W G  LPTEAEWE+  RGGL+   F WG+ L P G+H AN+WQG FP  N   DG+ 
Sbjct: 153 YAGWAGKDLPTEAEWEFAARGGLDGEEFAWGDALMPGGKHMANIWQGNFPVQNLGEDGFE 212

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----------------HPAPSYNPK 293
            T+PVM++  N +GLY+M+GNVWEWT+DWW+  H                     SY+P 
Sbjct: 213 RTSPVMAFPPNGYGLYDMIGNVWEWTSDWWSARHTAEAAKPCCIPTNPRGGREEASYDPA 272

Query: 294 GPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKV 352
            P      KV KGGS+LC   YC R+R AAR     D+S  ++GFRC         TD+ 
Sbjct: 273 LPDIRIPRKVLKGGSHLCAPNYCRRYRPAARHAEPIDTSTSHVGFRCVTRM--PLATDRG 330

Query: 353 KKG 355
           ++G
Sbjct: 331 REG 333


>gi|420262802|ref|ZP_14765443.1| sulfatase-modifying factor 1 [Enterococcus sp. C1]
 gi|394770559|gb|EJF50363.1| sulfatase-modifying factor 1 [Enterococcus sp. C1]
          Length = 287

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 137/338 (40%), Positives = 172/338 (50%), Gaps = 64/338 (18%)

Query: 15  MVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M  +P  TFR+GTN      +D E P  ++   AF +D   V+N  FQ FV ATGYVTEA
Sbjct: 1   MPTIPAGTFRIGTNTMDGFAEDLEGPQISLDSPAFAVDATTVTNQAFQVFVEATGYVTEA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+FG +FVF   LSE       Q +H                                  
Sbjct: 61  ERFGWSFVFHYFLSE-------QTKH---------------------------------- 79

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R  ++P  A W Y   G           W HPEG DSTI  RM+HPVV VS NDA+AYC 
Sbjct: 80  RSQKVPNMAWW-YAVSGA---------DWRHPEGPDSTIAQRMDHPVVQVSRNDAIAYCQ 129

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W   RLPTEAEWE   +GG     +PWG        +  N+WQGEFP +NT ADG+ +TA
Sbjct: 130 WAKKRLPTEAEWEIAAKGGTSFEKYPWGREFLAENTYHCNIWQGEFPKSNTRADGFANTA 189

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT-------TGTDKV--KK 304
           P   Y+ N FGLY ++GNVWEW     N    P   +  +          T  D++   +
Sbjct: 190 PAKWYEPNGFGLYQVIGNVWEWCV---NPARIPLTQFQQQTGQTFWIRNQTQDDQMYATR 246

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GGS+LC+E YC R+R AAR+ N+  S+A NLGFRC  D
Sbjct: 247 GGSFLCHESYCKRYRIAARNGNSGMSAANNLGFRCVKD 284


>gi|325108698|ref|YP_004269766.1| sulfatase-modifying factor protein [Planctomyces brasiliensis DSM
           5305]
 gi|324968966|gb|ADY59744.1| Sulphatase-modifying factor protein [Planctomyces brasiliensis DSM
           5305]
          Length = 394

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 135/349 (38%), Positives = 177/349 (50%), Gaps = 57/349 (16%)

Query: 15  MVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           MV +PG TF MGT+       P  +K  E P   VT+D F++D  EV+N +F +FV  TG
Sbjct: 72  MVWIPGGTFEMGTDPISSPKNPDKVKQDEIPKHEVTVDGFWMDVAEVTNAEFAKFVDETG 131

Query: 69  YVTEAE---KFGDTFVFEP---LLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           YVT AE   K  D     P   L+ EE     S + +     E     +++         
Sbjct: 132 YVTFAEIPPKREDFIGMVPDISLIPEENLVAGSLIFNPDFDRENFRDDVQN--------- 182

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
                                WEY        +  P   W HP G DS+IE   +HPVV+
Sbjct: 183 ---------------------WEYQAW-----KYQPGADWRHPTGPDSSIEGLDDHPVVN 216

Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
           V + D VAYC W G RLPTEAEWEY   GG + + +PWGN   P GE+  N WQG FPT+
Sbjct: 217 VVYRDCVAYCKWAGKRLPTEAEWEYAAHGGKDVK-YPWGNEFLPDGEYMCNYWQGTFPTD 275

Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTTGTD- 300
               DG+L TAPV SY  N +GLY+M GNVWEW  D++   ++  +P  NP+GP    D 
Sbjct: 276 RQNLDGFLDTAPVKSYPPNGYGLYDMAGNVWEWVNDYYRPDYYRRSPKRNPQGPDNSFDP 335

Query: 301 -------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
                  +V +GGS+LCN   C  +RCAAR ++   S + + GFRC  D
Sbjct: 336 NEPGIEKRVTRGGSFLCNTNNCTGYRCAARMRSDVTSPSYHTGFRCVID 384


>gi|417089844|ref|ZP_11955736.1| hypothetical protein SSUR61_0017 [Streptococcus suis R61]
 gi|353533785|gb|EHC03425.1| hypothetical protein SSUR61_0017 [Streptococcus suis R61]
          Length = 288

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 134/346 (38%), Positives = 172/346 (49%), Gaps = 80/346 (23%)

Query: 15  MVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M+ +PG  +R+GT  K     D E PS  V +++FY+D+  V+N +FQ FV AT Y+TEA
Sbjct: 1   MIKIPGGIYRIGTEQKEGFDADFETPSIEVNIESFYMDETSVTNKEFQVFVDATNYITEA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           EKFG ++VF   +S+  R++                       + V    W  AV     
Sbjct: 61  EKFGWSYVFHLFVSDHLRSR-----------------------NIVKETPWWIAV----- 92

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                   P  SW  PEG  S+I  RM+HPVVH++ NDAVAYC 
Sbjct: 93  ------------------------PGASWKEPEGPGSSISDRMDHPVVHITRNDAVAYCR 128

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLT-PRGEHRANVWQGEFPTNNTAADGYLST 252
           W G RLPTEAEWE   +GG E   F WG +L    G HR N WQG FPT NT  DGYL+T
Sbjct: 129 WSGKRLPTEAEWEVAAKGGTEFEKFYWGEDLVLSDGVHRCNTWQGIFPTKNTKEDGYLAT 188

Query: 253 APVMSYKENKFGLYNMVGNVWEWTA----------------DWWNVHHHPAPSYNPKGPT 296
           APV +Y  N  GLY M+GNVWEW                  D+W  +     +       
Sbjct: 189 APVKTYPANPLGLYEMIGNVWEWCVNPGKLDLSTFQIVSGQDFWREYQVIDDAL------ 242

Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
                  KGGS+LC+  YC R+R  AR+ +T  SS+ N+GFRC  D
Sbjct: 243 ----YAIKGGSFLCHSVYCNRYRMNARNSSTAMSSSQNMGFRCVRD 284


>gi|257875092|ref|ZP_05654745.1| conserved hypothetical protein [Enterococcus casseliflavus EC20]
 gi|257809258|gb|EEV38078.1| conserved hypothetical protein [Enterococcus casseliflavus EC20]
          Length = 287

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 136/338 (40%), Positives = 172/338 (50%), Gaps = 64/338 (18%)

Query: 15  MVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M  +P  T+R+GTN      +D E P  ++   AF +D   V+N  FQ FV ATGYVTEA
Sbjct: 1   MPTIPAGTYRIGTNTMDGFAEDLEGPQISLDSPAFAIDATTVTNQAFQVFVEATGYVTEA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+FG +FVF   LSE       Q +H                                  
Sbjct: 61  ERFGWSFVFHYFLSE-------QTKH---------------------------------- 79

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R  ++P  A W Y   G           W HPEG DSTI  RM+HPVV VS NDA+AYC 
Sbjct: 80  RSQKVPNMAWW-YAVSGA---------DWRHPEGPDSTIAQRMDHPVVQVSRNDAIAYCQ 129

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W   RLPTEAEWE   +GG     +PWG        +  N+WQGEFP +NT ADG+ +TA
Sbjct: 130 WAKKRLPTEAEWEIAAKGGTSFEKYPWGREFLAENTYHCNIWQGEFPRSNTQADGFANTA 189

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT-------TGTDKV--KK 304
           P   Y+ N FGLY ++GNVWEW     N    P   +  +          T  D++   +
Sbjct: 190 PAKWYEPNGFGLYQVIGNVWEWCV---NPARIPLTQFQQQTGQTFWARNQTQDDQLYATR 246

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GGS+LC+E YC R+R AAR+ N+  S+A NLGFRC  D
Sbjct: 247 GGSFLCHESYCKRYRIAARNGNSGMSAANNLGFRCVKD 284


>gi|406830272|ref|ZP_11089866.1| sulfatase-modifying factor protein [Schlesneria paludicola DSM
           18645]
          Length = 407

 Score =  222 bits (565), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 132/353 (37%), Positives = 176/353 (49%), Gaps = 50/353 (14%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P    + MV +P   F MGT+ P  + + E P+  V L+ +++D HEV+N +FQ+FV AT
Sbjct: 89  PTSAPEGMVWIPPGEFTMGTSGPSALPN-ERPAHQVHLNGYWIDAHEVTNAEFQKFVDAT 147

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           GYVT AE          +  + E  K +      K  + +          P   +  ND 
Sbjct: 148 GYVTTAE----------VKPDWEELKKTVPPGTPKPPDDVLVPGSLVFTAPPTPVPTNDV 197

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
             + TW                        P   W HPEG  S+I++R  HPVVH+SW+D
Sbjct: 198 SKWWTW-----------------------IPGACWKHPEGPKSSIDNRSEHPVVHISWDD 234

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           A AY  W G RLPTEAEWE   RGGL  + F WG+         +N+WQG FP  N   D
Sbjct: 235 ANAYAKWAGKRLPTEAEWECASRGGLVGKRFTWGDEPPKESSKLSNIWQGVFPNQNDQLD 294

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH--------------PAPSYNPK 293
           G+  TAPV SY  N +GLY+M GNVWEW +DW+    +              PA S++P 
Sbjct: 295 GWERTAPVKSYPPNGYGLYDMSGNVWEWCSDWYRADAYAQRMAEKVVENPPGPADSWDPG 354

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA-DKGP 345
            P     +V +GGS+LC+  YC  +R AAR   TPD+   +LGFRC   DK P
Sbjct: 355 EP-DAAKRVIRGGSFLCHITYCESYRTAARRGTTPDTGMSHLGFRCVVPDKKP 406


>gi|325569224|ref|ZP_08145429.1| sulfatase-modifying factor 1 [Enterococcus casseliflavus ATCC
           12755]
 gi|325157473|gb|EGC69633.1| sulfatase-modifying factor 1 [Enterococcus casseliflavus ATCC
           12755]
          Length = 287

 Score =  221 bits (564), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 136/338 (40%), Positives = 172/338 (50%), Gaps = 64/338 (18%)

Query: 15  MVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M  +P  T+R+GTN      +D E P  ++   AF +D   V+N  FQ FV ATGYVTEA
Sbjct: 1   MPTIPAGTYRIGTNTMDGFAEDLEGPQISLDSPAFSIDATTVTNQAFQVFVEATGYVTEA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+FG +FVF   LSE       Q +H                                  
Sbjct: 61  ERFGWSFVFHYFLSE-------QTKH---------------------------------- 79

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R  ++P  A W Y   G           W HPEG DSTI  RM+HPVV VS NDA+AYC 
Sbjct: 80  RSQKIPNMAWW-YTVSGA---------DWRHPEGPDSTIAQRMDHPVVQVSRNDAIAYCQ 129

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W   RLPTEAEWE   +GG     +PWG        +  N+WQGEFP +NT ADG+ +TA
Sbjct: 130 WAKKRLPTEAEWEIAAKGGTTFEKYPWGREFLAENTYHCNIWQGEFPKSNTRADGFANTA 189

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT-------TGTDKV--KK 304
           P   Y+ N FGLY ++GNVWEW     N    P   +  +          T  D++   +
Sbjct: 190 PAKWYEPNGFGLYQVIGNVWEWCV---NPARIPLTQFQQQTGQTFWIRNQTQDDQLYATR 246

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GGS+LC+E YC R+R AAR+ N+  S+A NLGFRC  D
Sbjct: 247 GGSFLCHESYCKRYRIAARNGNSGMSAANNLGFRCVKD 284



 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 23/40 (57%), Positives = 31/40 (77%)

Query: 354 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +GGS+LC+E YC R+R AAR+ N+  S+A NLGFRC  D+
Sbjct: 246 RGGSFLCHESYCKRYRIAARNGNSGMSAANNLGFRCVKDL 285


>gi|381182444|ref|ZP_09891251.1| non-specific serine/threonine protein kinase [Listeriaceae
           bacterium TTU M1-001]
 gi|380317681|gb|EIA20993.1| non-specific serine/threonine protein kinase [Listeriaceae
           bacterium TTU M1-001]
          Length = 285

 Score =  221 bits (563), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 125/335 (37%), Positives = 174/335 (51%), Gaps = 58/335 (17%)

Query: 15  MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M+ +P   + +GTN K     D E P   V L  FY+D+  V+N +FQ+FV+ TGYV++A
Sbjct: 1   MIFIPKGNYEIGTNSKEGFEFDLEGPKVQVKLPNFYIDETTVTNQEFQKFVTKTGYVSDA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           EKFG +FVF   L EE +A   +V                                    
Sbjct: 61  EKFGSSFVFHYFLDEETKAISEKV------------------------------------ 84

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                  ++ W Y   G          +W HP G +S+I+ +M+HPVV VS NDA+A+  
Sbjct: 85  ------NQSGWWYEVAGA---------NWKHPAGPNSSIKEKMDHPVVQVSRNDAIAFSQ 129

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE   +GG +   +PWG  L    E+  N+WQG FP  N   DGY+ TA
Sbjct: 130 WAGKRLPTEAEWEIAAKGGTDRERYPWGEELLENEEYHCNIWQGNFPFENDGLDGYVDTA 189

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP----KGPTTGTDKVK--KGGS 307
           PV +Y+ N +G Y M+GNVWEW  +   ++     SY      K      D++   +GGS
Sbjct: 190 PVRTYEPNLYGCYQMIGNVWEWCLNPARINLSEFNSYTGKHFWKNYQMKDDQMYAIRGGS 249

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +LC++ YC R+R AAR+ N+  S++ NLGFRC  D
Sbjct: 250 FLCHQSYCKRYRIAARNGNSGMSASNNLGFRCVQD 284


>gi|398819756|ref|ZP_10578305.1| hypothetical protein PMI42_00781 [Bradyrhizobium sp. YR681]
 gi|398229690|gb|EJN15763.1| hypothetical protein PMI42_00781 [Bradyrhizobium sp. YR681]
          Length = 336

 Score =  221 bits (563), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 131/346 (37%), Positives = 182/346 (52%), Gaps = 64/346 (18%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV +PG TFRMG++        E PS  V++D F++D+  V+N QF++FV+AT +VTEA
Sbjct: 20  EMVFIPGGTFRMGSDHHY---PEEAPSHRVSVDGFWIDRTPVTNRQFKQFVNATDHVTEA 76

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
                                 QV  D K + G            +  + +  ++ +   
Sbjct: 77  ----------------------QVVPDPKDYPGA-----------LKAMLYAGSLVFAPL 103

Query: 134 RGARLPTE-AEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                PT+ ++W    RG          +W HP G  S I+   +HPVVHVS +DA+AY 
Sbjct: 104 PRITDPTDWSQWWSFMRGA---------NWRHPYGPGSNIKGLDDHPVVHVSHSDALAYA 154

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
           +W G  LPTEAEWE+  RGGL+   F WG+ L P G+H AN+WQG+FP  N   DG+  T
Sbjct: 155 SWAGKDLPTEAEWEFAARGGLDGEEFAWGDALMPGGKHMANIWQGKFPVQNLGEDGFERT 214

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPKGPTTGTD--------- 300
           +PVM++  N +GLY+M+GNVWEWT+DWW+  H      P   P  P  G +         
Sbjct: 215 SPVMAFPPNGYGLYDMIGNVWEWTSDWWSARHASEVAKPCCIPSNPRGGREEASYDPALP 274

Query: 301 ------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                 KV KGGS+LC   YC R+R AAR     D+S  ++GFRC 
Sbjct: 275 DIRIPRKVLKGGSHLCAPNYCRRYRPAARHAEPVDTSTSHVGFRCV 320


>gi|430842314|ref|ZP_19460229.1| hypothetical protein OGO_02000 [Enterococcus faecium E1007]
 gi|431592164|ref|ZP_19521400.1| hypothetical protein OK5_04253 [Enterococcus faecium E1861]
 gi|430493395|gb|ELA69698.1| hypothetical protein OGO_02000 [Enterococcus faecium E1007]
 gi|430591789|gb|ELB29816.1| hypothetical protein OK5_04253 [Enterococcus faecium E1861]
          Length = 283

 Score =  221 bits (562), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 128/333 (38%), Positives = 170/333 (51%), Gaps = 58/333 (17%)

Query: 15  MVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M+ +P  TF +GTN+P     D E P   +   AF +D+  V+N +F  F+  TGY+TEA
Sbjct: 1   MIAIPEGTFHIGTNQPDGFPTDNEGPKITIQHPAFMIDETTVTNKEFASFIKKTGYITEA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           EKFG +FVF   LSE  + K                                        
Sbjct: 61  EKFGWSFVFHYFLSEVTKKK---------------------------------------- 80

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
             ++L     W Y   G           W HPEG DS+IE R +HPVV VS NDAVAYC 
Sbjct: 81  --SKLVPNMNWWYAVAGA---------DWSHPEGPDSSIEARWDHPVVQVSRNDAVAYCQ 129

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE   +GG E + + WG      G ++ N+WQG+FP  NT ADG+ +TA
Sbjct: 130 WAGKRLPTEAEWEIAAKGGTEFQKYYWGKEFLKDGTYQCNIWQGDFPRTNTLADGFANTA 189

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNV----HHHPAPSYNPKGPTTGTDKV--KKGGS 307
           P   Y+ N++G+Y  +GNVWEW ++   +          +Y         DK    +GGS
Sbjct: 190 PAKWYEANEYGMYQPIGNVWEWCSNPARIDLTEFKEKDSAYYWSHHQIIDDKTYATRGGS 249

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           +LC+E YC R+R +AR+ NT  S+A NLGFRC 
Sbjct: 250 FLCHESYCKRYRLSARNGNTGMSAANNLGFRCV 282


>gi|403308842|ref|XP_003944851.1| PREDICTED: sulfatase-modifying factor 2 [Saimiri boliviensis
           boliviensis]
          Length = 378

 Score =  221 bits (562), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 132/337 (39%), Positives = 176/337 (52%), Gaps = 70/337 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F MGTN P   +DGE P R VT+  F +D   V+N +F++F+    Y TEAE
Sbjct: 107 MVQLPGGRFLMGTNSPD-SRDGEGPVREVTVKPFAIDIFPVTNKEFRDFIREKKYRTEAE 165

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q                     P+  + W        W 
Sbjct: 166 MFGWSFVFEDFVSDELRNKATQ---------------------PMKPVLW--------W- 195

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E  +                 W  P G  S I  R+ HPV+HVSWNDA AYC W
Sbjct: 196 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 235

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 236 RGKRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDRAEDGFHGVSP 292

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA             +P        +V +G S++   +
Sbjct: 293 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 339

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 340 GSANHRARVTTRMGNTPDSASDNLGFRCAADTGRPRG 376


>gi|224076257|ref|XP_002196123.1| PREDICTED: sulfatase-modifying factor 2 isoform 1 [Taeniopygia
           guttata]
          Length = 310

 Score =  221 bits (562), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 130/335 (38%), Positives = 179/335 (53%), Gaps = 65/335 (19%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG TF+MG+   +  +  E P+R VT+  F LD+H V+N +F+EFV    Y TEAE
Sbjct: 27  MVQLPGGTFQMGSGS-LQRRGEEGPAREVTVKPFALDKHPVTNREFREFVREKKYKTEAE 85

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +SEE + K++         + L+S              W        W 
Sbjct: 86  AFGWSFVFEDFVSEELKKKVT---------QKLESA------------PW--------W- 115

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E  +                 W  P G  S+I+ R+++PV+HVSWNDA A+C W
Sbjct: 116 ---LPIEKAF-----------------WRQPSGPGSSIKDRLDYPVLHVSWNDAQAFCAW 155

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G RLP+E EWE+  RGGLE R++PWGN   P   +R N+WQG+FP  +TA DGY   +P
Sbjct: 156 KGKRLPSEEEWEFAARGGLEQRMYPWGNKFQP---NRTNLWQGDFPRGDTAEDGYHGVSP 212

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           V ++  +N +G Y+++GN WEWTA        PAP    +       +V +G S++    
Sbjct: 213 VTAFPAQNSYGFYDLLGNTWEWTAS-----EFPAPGRASRAQKM---QVLRGASWIDTAD 264

Query: 314 YCYRHRC--AARSQNTPDSSAGNLGFRCAADKGPT 346
               HR     R  NTPDS++ NL FRCAAD  P 
Sbjct: 265 GSANHRACVTTRMGNTPDSASDNLSFRCAADVPPV 299


>gi|443325559|ref|ZP_21054249.1| hypothetical protein Xen7305DRAFT_00039650 [Xenococcus sp. PCC
           7305]
 gi|442794840|gb|ELS04237.1| hypothetical protein Xen7305DRAFT_00039650 [Xenococcus sp. PCC
           7305]
          Length = 324

 Score =  221 bits (562), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 133/340 (39%), Positives = 180/340 (52%), Gaps = 56/340 (16%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           ++MVL+PG TF MG++ P  +++       V +D+F +D HEV+N +F +FV  TGY+T 
Sbjct: 30  ENMVLIPGGTFHMGSDHPEFVEENAI--AKVKIDSFCIDTHEVTNAEFAKFVQKTGYLTI 87

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+      F P L E  R   S V      FE  ++ +E               +AY  
Sbjct: 88  AERPLSVQEF-PDLPESARKPGSMV------FEPPEAGLEQ--------------IAYMG 126

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W          W  G             +W HP G +S I+ + NHPVVH+++ DA+AY 
Sbjct: 127 WW--------HWVTGA------------NWQHPYGPESNIQGKENHPVVHIAYFDALAYA 166

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W   +LPTEA+WEY  RGGLE + + WG+  +      AN WQG FP  NT ADGYL T
Sbjct: 167 KWSNKQLPTEAQWEYAARGGLEGKEYTWGDEYSAT---NANTWQGLFPFFNTKADGYLGT 223

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDK---------V 302
           APV S+  N +GL +M GNVWEWT D++ V   P A   NP+GP  G D          V
Sbjct: 224 APVGSFPPNGYGLSDMAGNVWEWTQDYYQVLRKPMAYKSNPQGPKIGYDPKKPFERNSHV 283

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            KGGS+LC   YC R+R AAR   +PD+   ++GFR   +
Sbjct: 284 IKGGSHLCAPNYCSRYRPAARESQSPDTGTTHIGFRLVKN 323


>gi|375088980|ref|ZP_09735316.1| hypothetical protein HMPREF9703_01398 [Dolosigranulum pigrum ATCC
           51524]
 gi|374560781|gb|EHR32134.1| hypothetical protein HMPREF9703_01398 [Dolosigranulum pigrum ATCC
           51524]
          Length = 290

 Score =  220 bits (561), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 130/329 (39%), Positives = 178/329 (54%), Gaps = 59/329 (17%)

Query: 18  LPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKF 76
           +P  +F+MGT  +     D E  S +V L++F +    V+N +F+ F++AT Y+T+AEKF
Sbjct: 8   IPSGSFKMGTICQEGFATDLETGSVDVALESFQIKTTTVTNREFKAFIAATDYITDAEKF 67

Query: 77  GDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA 136
           G + VF  L+ EE+R               LDS     M                     
Sbjct: 68  GWSSVFHLLIPEEDR---------------LDSLAVQGM--------------------- 91

Query: 137 RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRG 196
                 EW    +G           W  PEG  S+I+ R++HPVVHVS NDA+AYC W G
Sbjct: 92  ------EWWLIIKGA---------CWHAPEGPGSSIDGRLDHPVVHVSRNDALAYCRWEG 136

Query: 197 ARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVM 256
           +RLPTEA+WEY  RGGLE + +PWG+NL   G++  N+WQG+FP +NT  DGYL TAP  
Sbjct: 137 SRLPTEAQWEYAARGGLEMKRYPWGDNLLIDGKYPLNIWQGQFPISNTEEDGYLGTAPAN 196

Query: 257 SYKENKFGLYNMVGNVWEWTADWWNV------HHHPAPSYNPKGPTT-GTDKVKKGGSYL 309
           +Y  N +GLY MVGNVWEW  +   +       H     +N    ++ G     +GGS+L
Sbjct: 197 NYTPNGYGLYQMVGNVWEWCLNPGKIPLNQFQKHTSEDFFNYFNRSSHGKLFAIRGGSFL 256

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           C++ YC R+R  AR+  T DSSA N+GFR
Sbjct: 257 CHDSYCNRYRVGARNSVTDDSSASNIGFR 285


>gi|126659223|ref|ZP_01730361.1| hypothetical protein CY0110_04588 [Cyanothece sp. CCY0110]
 gi|126619528|gb|EAZ90259.1| hypothetical protein CY0110_04588 [Cyanothece sp. CCY0110]
          Length = 325

 Score =  220 bits (561), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 134/338 (39%), Positives = 182/338 (53%), Gaps = 56/338 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG TFRMG++    I+  E P  +VT+D+F +D +E++N+QF +FV  TGY T AE
Sbjct: 33  MTYIPGGTFRMGSDNQDFIE--EKPVDDVTVDSFCIDTYEITNSQFAKFVKETGYQTIAE 90

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +      F P L +E+R   S V      F+  +  IE               VAY +W 
Sbjct: 91  RPLSKEQF-PNLPDEQRKPGSLV------FQPPNEGIER--------------VAYLSWW 129

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W  G             +W HP G +S I  + N+PVVH+S++DA+AY  W
Sbjct: 130 --------HWVTGA------------NWQHPYGEESDILSKENYPVVHISYDDALAYAKW 169

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
              +LPTEA+WEY  RGGL+ + + WG+  + +    AN WQG FP  NT  DGYL  AP
Sbjct: 170 ADKQLPTEAQWEYAARGGLQGKDYTWGDQYSAK---NANTWQGIFPFFNTKEDGYLGAAP 226

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVH-HHPAPSYNPKGPTTGTDKVK---------K 304
           V S+  N +GLY+M GNVWEWT DW+ V   + A   NPKGP  G D  K         K
Sbjct: 227 VGSFPANGYGLYDMSGNVWEWTQDWYQVERENMAHKTNPKGPKIGYDPKKPQDKLVHVIK 286

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GGS+LC + YC R+R AAR   +PD+   ++GFR   +
Sbjct: 287 GGSHLCAKNYCSRYRPAARESQSPDTGTTHIGFRLVKN 324


>gi|380300787|ref|ZP_09850480.1| hypothetical protein BsquM_01775 [Brachybacterium squillarum M-6-3]
          Length = 342

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 139/352 (39%), Positives = 179/352 (50%), Gaps = 66/352 (18%)

Query: 6   APPVE----RYKDMVLLPGDTFRMGTNKPILIK-DGEFPSRNVTLDAFYLDQHEVSNTQF 60
           APP +    R  +   +PG  F MG       + DGE P   V + A  +D   V+N  F
Sbjct: 39  APPTDTAERRRIEQAHVPGGEFTMGDATGHRNRGDGETPLHRVAISALDVDVTTVTNADF 98

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
             FV  TGY TEAE+FG + VF   L+ EE    + VR                   P  
Sbjct: 99  ARFVEDTGYRTEAERFGFSAVFHLALAAEE----ADVRG------------------PAA 136

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEY-GCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHP 179
              W        WRG R    A+W + GCR                    S ++   +HP
Sbjct: 137 GTPW--------WRGVR---GADWRHSGCR-------------------RSDLDGLDDHP 166

Query: 180 VVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEF 239
           VVH+SW+DA +YC+W G RLPTEAE EY  RGG+E   FPWG     +  +R +VWQGEF
Sbjct: 167 VVHMSWHDAQSYCSWAGRRLPTEAEGEYTARGGIEGATFPWGEEPIDQPPYRVDVWQGEF 226

Query: 240 PTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH--------PAPSYN 291
           P  +T ADGYL+TAPV S+  N +GL+  VGNVWEW  D ++   +             +
Sbjct: 227 PRRDTGADGYLTTAPVRSFATNGYGLWQSVGNVWEWCEDGFHPKSYRRGTTAAARTTITD 286

Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           P GP  G  +V +GGSYLC+  YC R+R AARS+NTPDSS GN GFR  + +
Sbjct: 287 PTGPEGGGTRVLRGGSYLCHASYCNRYRNAARSRNTPDSSMGNAGFRTVSPR 338



 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 29/48 (60%), Positives = 35/48 (72%)

Query: 344 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 391
           GP  G  +V +GGSYLC+  YC R+R AARS+NTPDSS GN GFR  +
Sbjct: 289 GPEGGGTRVLRGGSYLCHASYCNRYRNAARSRNTPDSSMGNAGFRTVS 336


>gi|344201754|ref|YP_004786897.1| sulfatase-modifying factor protein [Muricauda ruestringensis DSM
           13258]
 gi|343953676|gb|AEM69475.1| Sulphatase-modifying factor protein [Muricauda ruestringensis DSM
           13258]
          Length = 366

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 132/348 (37%), Positives = 180/348 (51%), Gaps = 48/348 (13%)

Query: 8   PVERYKDMVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
           P E  K MV +PG  F  G      +    E P+  V +D F++D HEV+N QF +F+  
Sbjct: 46  PSEAPKGMVWIPGGEFNQGAVPNDNMAMSHEKPAHKVYVDGFFMDIHEVTNAQFAKFIEE 105

Query: 67  TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
           TGYVT AE+  D    +  L E      +Q  HD     G  S +  +    V ++    
Sbjct: 106 TGYVTMAEREIDWEEMKKQLPEG-----TQKPHDSILQPG--SLVFKKTKSSVPNL---- 154

Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
              Y  W         EW+ G             SW HP G DS I+ + +HPVVH+++ 
Sbjct: 155 -YDYSQWW--------EWKIGA------------SWKHPNGPDSNIDGKEDHPVVHIAFE 193

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
           DA AYC W G RLPTEAEWE+  R    +  F WG+++     H AN W+GEFP  NT  
Sbjct: 194 DAQAYCKWAGRRLPTEAEWEFAARAQKTDSKFFWGDDIDQLSSH-ANSWEGEFPVKNTMQ 252

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA------PSYNPKGPTT--- 297
           DG+   AP+MSY +N FGLY+M GNVWEWT+DW+N +++         + NPKG  +   
Sbjct: 253 DGFERRAPIMSYPKNDFGLYDMAGNVWEWTSDWYNTNYYQKLASNGEVAQNPKGAASAFN 312

Query: 298 -----GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                  ++V KGGS+LC+  YC  +R +AR   +PDS   +LGFR  
Sbjct: 313 SNNPYAKERVIKGGSFLCSASYCASYRISARMATSPDSGMEHLGFRTV 360


>gi|134099642|ref|YP_001105303.1| hypothetical protein SACE_3100 [Saccharopolyspora erythraea NRRL
           2338]
 gi|291005452|ref|ZP_06563425.1| hypothetical protein SeryN2_13112 [Saccharopolyspora erythraea NRRL
           2338]
 gi|133912265|emb|CAM02378.1| protein of unknown function DUF323 [Saccharopolyspora erythraea
           NRRL 2338]
          Length = 282

 Score =  220 bits (560), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 137/336 (40%), Positives = 165/336 (49%), Gaps = 66/336 (19%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKP-ILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
           P  R  D+VLL G  F MG+       +DGE P R V L+AF + +  VSN +       
Sbjct: 11  PPRRAGDLVLLDGGEFLMGSEDAHAYPEDGEGPVRAVRLEAFRISRTAVSNVE------- 63

Query: 67  TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
                    FG+        +E ER   S V      F GL                   
Sbjct: 64  ---------FGEFVAATGYRTEAERWGWSFV------FGGL------------------- 89

Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
                      LP +     G       R      W  PEG  S ++ R +HPVVHVSW+
Sbjct: 90  -----------LPDDFPPTRGVAATPWWRQVEGADWRRPEGPGSDVDQRGDHPVVHVSWS 138

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
           DA AYC W G RLPTEAEWEY  RGGL  R FPWG+ L P G H  NVWQGEFP  NT A
Sbjct: 139 DARAYCAWAGLRLPTEAEWEYAARGGLAGRAFPWGDELEPGGRHLMNVWQGEFPRRNTFA 198

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
           DG+L T PV ++  N FGL+N  GNVWEW ADW++             P   T +  +GG
Sbjct: 199 DGWLGTCPVDAFPPNGFGLHNTTGNVWEWCADWFD-------------PGDRTRRSARGG 245

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           SYLC++ YC R+R +AR   TPDS+ GN GFRCA D
Sbjct: 246 SYLCHDSYCRRYRVSARQGLTPDSTTGNTGFRCALD 281



 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/64 (46%), Positives = 40/64 (62%)

Query: 330 SSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 389
           ++ GN+   CA    P   T +  +GGSYLC++ YC R+R +AR   TPDS+ GN GFRC
Sbjct: 219 NTTGNVWEWCADWFDPGDRTRRSARGGSYLCHDSYCRRYRVSARQGLTPDSTTGNTGFRC 278

Query: 390 AADV 393
           A D 
Sbjct: 279 ALDA 282


>gi|402863386|ref|XP_003895998.1| PREDICTED: sulfatase-modifying factor 2 isoform 1 [Papio anubis]
          Length = 320

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 132/337 (39%), Positives = 173/337 (51%), Gaps = 70/337 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F MGTN P   +DGE P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct: 49  MVQLPGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 107

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q                     P+  + W        W 
Sbjct: 108 MFGWSFVFEDFVSDELRNKATQ---------------------PMKPVLW--------W- 137

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E  +                 W  P G  S I  R+ HPV+HVSWNDA AYC W
Sbjct: 138 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 177

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG FP  + A DG+   +P
Sbjct: 178 RGKRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGRFPKGDKAEDGFHGVSP 234

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA             +P        +V +G S++   +
Sbjct: 235 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 281

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 282 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRLPG 318



 Score = 41.2 bits (95), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 269 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 313


>gi|427717852|ref|YP_007065846.1| sulfatase-modifying factor protein [Calothrix sp. PCC 7507]
 gi|427350288|gb|AFY33012.1| Sulphatase-modifying factor protein [Calothrix sp. PCC 7507]
          Length = 357

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 131/340 (38%), Positives = 180/340 (52%), Gaps = 56/340 (16%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + MVL+PG  F+MG++    ++  E    +VT+  F +DQ+EV+N QF  FV ATGYVT 
Sbjct: 47  QGMVLIPGGKFQMGSDNSGFVE--ERSPGDVTVTNFCIDQYEVTNPQFAAFVKATGYVTV 104

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+        PL  E+          D+   + L  ++   M  P V       V++ +
Sbjct: 105 AER--------PLSPEQ--------FPDLPDEQRLPGSLVFEMAKPGVK-----EVSFLS 143

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W          W  G             +W HP G +S I ++ N+PVVH+++ DA+AY 
Sbjct: 144 WW--------HWTTGA------------NWQHPFGKESAIANKSNYPVVHIAYEDALAYA 183

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEA+WEY  RGGL+ + + WG+  +   E +AN WQG FP  NT ADGY   
Sbjct: 184 KWSGKSLPTEAQWEYAARGGLDGKTYAWGDQYS---EKKANTWQGIFPFFNTKADGYTEI 240

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHH-------HPAP---SYNPKGPTTGTDKV 302
           APV S+  N +GLY+M GNVWEWTAD + V H       +P     S++PK P      V
Sbjct: 241 APVGSFAPNGYGLYDMTGNVWEWTADLFQVGHDYKSHQINPIALNQSFDPKKPNESALHV 300

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            KGGSYLC   YC R R AAR   +PD+   ++GFR   +
Sbjct: 301 IKGGSYLCAPNYCSRFRPAARESESPDTGTTHIGFRLVKN 340


>gi|223936270|ref|ZP_03628183.1| protein of unknown function DUF323 [bacterium Ellin514]
 gi|223895132|gb|EEF61580.1| protein of unknown function DUF323 [bacterium Ellin514]
          Length = 368

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 126/335 (37%), Positives = 178/335 (53%), Gaps = 53/335 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  + G TF MG+ +    +  E P   VT+  F++D+ EVSN QF++FV +TGY+T AE
Sbjct: 75  MAWIQGGTFYMGSEE---GRPDEKPVHQVTVHGFWMDKTEVSNEQFEKFVQSTGYITMAE 131

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +       +P   +   A  S +      F             P   +   DA A+    
Sbjct: 132 R-------KPNPKDYPDADPSLLVPGSVVFS-----------PPSESVPLTDASAWW--- 170

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                               R  P   W HPEG +STI+ R  HPVVHV+W DA+AY  W
Sbjct: 171 --------------------RYVPGADWRHPEGPNSTIKGREKHPVVHVAWEDAMAYAKW 210

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY  RGGL+ + + WG  +   G  +AN+W+G FP  +T ADG+  TAP
Sbjct: 211 AGKRLPTEAEWEYAARGGLDRQPYVWGKEMNTNGNWQANIWEGHFPNEDTGADGFKGTAP 270

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTD--------KVKKG 305
           V ++K N +GL++M GNVWEW +DW+   ++  +   NPKGP    D        +V++G
Sbjct: 271 VATFKPNGYGLHDMAGNVWEWCSDWYRPDYYAQSVGKNPKGPEDSFDPEEPGAKKRVQRG 330

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GSYLCN+ YC  ++ ++R + TPD+   + GFRC 
Sbjct: 331 GSYLCNDVYCSGYKPSSRMKCTPDTGLSHSGFRCV 365


>gi|75906337|ref|YP_320633.1| hypothetical protein Ava_0112 [Anabaena variabilis ATCC 29413]
 gi|75700062|gb|ABA19738.1| Protein of unknown function DUF323 [Anabaena variabilis ATCC 29413]
          Length = 331

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 136/348 (39%), Positives = 181/348 (52%), Gaps = 64/348 (18%)

Query: 5   PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           P PP     +MV++PG TF MG++    I+  E  ++ V + +F +D++EV+N+QF EFV
Sbjct: 15  PCPP-----EMVMIPGGTFTMGSDNSGYIE--ELSAQEVRVSSFCIDKYEVTNSQFAEFV 67

Query: 65  SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
            ATGYVT AE+          LS+E+   +     D +R  G            V  I  
Sbjct: 68  KATGYVTVAER---------PLSKEQFPDLP----DEQRLPG----------SLVFAIVQ 104

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
            +A     W          W+ G             +W HP G +S I H+ N+PVVH++
Sbjct: 105 PEAKVLNWWH---------WQTGA------------NWRHPFGKESAIVHQDNYPVVHIA 143

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
           + DAVAY  W G  LPTEA+WEY  RGGL+   + WG+  +   E +AN WQG FP  NT
Sbjct: 144 YEDAVAYAQWAGKSLPTEAQWEYAARGGLDAATYAWGDQYS---EKKANTWQGIFPLFNT 200

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTTGTDK-- 301
             DGY S APV S+  N +GLY+M GNVWE T+DW+   H+H   S NP GP    D   
Sbjct: 201 KKDGYASIAPVGSFPPNGYGLYDMTGNVWELTSDWFMPGHNHKTHSVNPTGPEQSFDPNK 260

Query: 302 -------VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
                  V KGGSYLC   YC R R AAR    PD+   ++GFR   +
Sbjct: 261 PSEQALHVIKGGSYLCAPNYCSRFRPAARESQAPDTGTTHIGFRLVKN 308


>gi|338712716|ref|XP_001493384.3| PREDICTED: sulfatase-modifying factor 2-like [Equus caballus]
          Length = 326

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 133/333 (39%), Positives = 174/333 (52%), Gaps = 70/333 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F+MGTN P   +DGE P R VT+  F +D   V+N  F+EFV    Y TEAE
Sbjct: 55  MVQLPGGRFQMGTNSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFREFVREKKYRTEAE 113

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K++Q    +                    + W          
Sbjct: 114 TFGWSFVFEDFVSDELRNKVTQQMESV--------------------LWW---------- 143

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E             R F    W  P G  S I  R+  PVVHVSWNDA AYC W
Sbjct: 144 ---LPVE-------------RAF----WRQPAGPGSGIRERLELPVVHVSWNDARAYCAW 183

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 184 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 240

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY++VGNVWEWTA             +P        +V +G S++   +
Sbjct: 241 VNAFPPQNNYGLYDLVGNVWEWTA-------------SPYQAADQDMRVLRGASWIDTAD 287

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
               +R R   R  NTPDS++ NLGFRCA+D G
Sbjct: 288 GSANHRARVTTRMGNTPDSASDNLGFRCASDTG 320



 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCA+D 
Sbjct: 275 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCASDT 319


>gi|354553265|ref|ZP_08972572.1| Sulphatase-modifying factor protein [Cyanothece sp. ATCC 51472]
 gi|353555095|gb|EHC24484.1| Sulphatase-modifying factor protein [Cyanothece sp. ATCC 51472]
          Length = 325

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 132/338 (39%), Positives = 181/338 (53%), Gaps = 56/338 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG TFRMG+     ++  E P  +VT+D+F +D +E++N QF +FV  TGY T AE
Sbjct: 33  MTYIPGGTFRMGSEHSDFVE--EKPVDDVTVDSFCIDTYEITNGQFAQFVKETGYKTVAE 90

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +      F P L +E+R   S V      F+  +  I+               VAY +W 
Sbjct: 91  RPLSKEQF-PNLPDEQRKPGSLV------FQPPEEGIQR--------------VAYLSWW 129

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W  G             +W HP G +S I  + N+PV+H+S+ DA+AY  W
Sbjct: 130 --------HWVTGA------------NWKHPFGEESDIIGKDNYPVIHISYEDALAYAQW 169

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G +LPTEA+WEY  RGGL+ + + WG+  + +    AN WQG FP  NT ADGY+  AP
Sbjct: 170 DGKQLPTEAQWEYAARGGLQEKEYTWGDQYSAK---NANTWQGIFPFFNTKADGYVGAAP 226

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-HPAPSYNPKGPTTGTDKVK---------K 304
           V S+  N +GLY+M GNVWEWT DW+ V   + A   NPKGP  G D  K         K
Sbjct: 227 VGSFPPNGYGLYDMSGNVWEWTQDWYQVERDNMAHKTNPKGPERGYDPKKPQDQLVHVIK 286

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GGS+LC + YC R+R AAR   +PD+   ++GFR   +
Sbjct: 287 GGSHLCAKNYCSRYRPAARESQSPDTGTTHIGFRLVKN 324


>gi|172036481|ref|YP_001802982.1| hypothetical protein cce_1566 [Cyanothece sp. ATCC 51142]
 gi|171697935|gb|ACB50916.1| unknown [Cyanothece sp. ATCC 51142]
          Length = 328

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 132/338 (39%), Positives = 181/338 (53%), Gaps = 56/338 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG TFRMG+     ++  E P  +VT+D+F +D +E++N QF +FV  TGY T AE
Sbjct: 36  MTYIPGGTFRMGSEHSDFVE--EKPVDDVTVDSFCIDTYEITNGQFAQFVKETGYKTVAE 93

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +      F P L +E+R   S V      F+  +  I+               VAY +W 
Sbjct: 94  RPLSKEQF-PNLPDEQRKPGSLV------FQPPEEGIQR--------------VAYLSWW 132

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W  G             +W HP G +S I  + N+PV+H+S+ DA+AY  W
Sbjct: 133 --------HWVTGA------------NWKHPFGEESDIIGKDNYPVIHISYEDALAYAQW 172

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G +LPTEA+WEY  RGGL+ + + WG+  + +    AN WQG FP  NT ADGY+  AP
Sbjct: 173 DGKQLPTEAQWEYAARGGLQEKEYTWGDQYSAK---NANTWQGIFPFFNTKADGYVGAAP 229

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-HPAPSYNPKGPTTGTDKVK---------K 304
           V S+  N +GLY+M GNVWEWT DW+ V   + A   NPKGP  G D  K         K
Sbjct: 230 VGSFPPNGYGLYDMSGNVWEWTQDWYQVERDNMAHKTNPKGPERGYDPKKPQDQLVHVIK 289

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GGS+LC + YC R+R AAR   +PD+   ++GFR   +
Sbjct: 290 GGSHLCAKNYCSRYRPAARESQSPDTGTTHIGFRLVKN 327


>gi|326797828|ref|YP_004315647.1| sulfatase-modifying factor protein [Sphingobacterium sp. 21]
 gi|326548592|gb|ADZ76977.1| Sulphatase-modifying factor protein [Sphingobacterium sp. 21]
          Length = 364

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 129/341 (37%), Positives = 181/341 (53%), Gaps = 56/341 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV + G  F+MG+++     D + P   V +  F++D+HEV+N QF +FV ATGY+T AE
Sbjct: 70  MVFIKGGVFQMGSSE---FADTQ-PIHEVKVHDFWMDEHEVTNAQFAKFVEATGYITVAE 125

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  +   F P +S +     S V    K    L++           H+ W          
Sbjct: 126 RALNPADF-PGVSRDLLVPGSAVFTPPKEQVNLNN-----------HLQW---------- 163

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W+Y           P  SW  P+G  S I  + N PVV VS+ DA A+  W
Sbjct: 164 ---------WKY----------VPGASWRQPKGPGSNINGKENEPVVQVSYEDAEAFAKW 204

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY  R G   + + WG+ L P+G+  AN +QG FP +N A DG+   AP
Sbjct: 205 AGKRLPTEAEWEYAARAGQVGKKYYWGDELKPQGKWVANTYQGNFPVSNAAEDGFAELAP 264

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWW--NVHHHPAPSYNPKGPTTGTD--------KVKK 304
           V S+  N +GLY+M GNVWEW AD++  + + H A   NPKGP+   D        +V++
Sbjct: 265 VKSFPANAYGLYDMDGNVWEWCADFYRPDAYEH-AEKDNPKGPSDSYDPAEPGLVKRVQR 323

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           GGS+LCN+QYC R++  +R +    S++ NLGFRC  DK P
Sbjct: 324 GGSFLCNDQYCERYKAGSRGKGEVSSASNNLGFRCVQDKAP 364


>gi|414172785|ref|ZP_11427696.1| hypothetical protein HMPREF9695_01342 [Afipia broomeae ATCC 49717]
 gi|410894460|gb|EKS42250.1| hypothetical protein HMPREF9695_01342 [Afipia broomeae ATCC 49717]
          Length = 336

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 132/351 (37%), Positives = 177/351 (50%), Gaps = 64/351 (18%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           E+  DMV +PG TFRMG++        E PS  VT+D F++D+  V+N QF++FV ATG+
Sbjct: 17  EQTADMVWIPGGTFRMGSDH---HYPEEAPSHRVTVDGFWMDKTPVTNRQFKQFVKATGH 73

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           VT AE                      +  D K + G            + H+ +  ++ 
Sbjct: 74  VTLAE----------------------IPPDPKDYPGA-----------LPHMLYAGSLV 100

Query: 130 YCTWRG-ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           +   R    L   + W    +G           W HP G  + I    NHPVVHVS++DA
Sbjct: 101 FSPPRAPVDLRDWSRWWSFMKGA---------DWRHPYGPKTNINVHDNHPVVHVSYSDA 151

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
           +AY  W G  LPTEAEWE+  RGGL+   F WG   TP G H AN WQGEFP  N   DG
Sbjct: 152 LAYAKWAGKDLPTEAEWEFAARGGLDGAEFAWGEEFTPGGVHMANTWQGEFPLQNLNQDG 211

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPKGPTTGTD----- 300
           Y  T+PV ++  N +G+++M+GNVWEWT+DWW+  H    P     P+ P  G +     
Sbjct: 212 YERTSPVRAFPANGYGIHDMIGNVWEWTSDWWSAGHEADAPKACCIPENPRGGPESASYD 271

Query: 301 ----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
                     KV KGGS+LC   YC R+R AAR     D+S  ++GFRC  
Sbjct: 272 SCQPSIRIPRKVLKGGSHLCAPNYCRRYRPAARHAEPIDTSTSHVGFRCVV 322


>gi|332710257|ref|ZP_08430206.1| hypothetical protein LYNGBM3L_49460 [Moorea producens 3L]
 gi|332350948|gb|EGJ30539.1| hypothetical protein LYNGBM3L_49460 [Moorea producens 3L]
          Length = 347

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 139/345 (40%), Positives = 178/345 (51%), Gaps = 71/345 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSR----NVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           MVL+PG TF +G+       D  +PS     +VT+D+F +D+HEV+N  +Q FV ATGYV
Sbjct: 47  MVLIPGGTFMIGS-------DTYYPSERSAVDVTVDSFCIDRHEVTNADYQAFVQATGYV 99

Query: 71  TEAEK--FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           T AE+    D F   P L+E +R   S V      F+     I+               V
Sbjct: 100 TVAERPLAKDQF---PGLTEAQRQPGSLV------FQPPAPGIQQ--------------V 136

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           AY +W         +W  G             +W HP G DSTIE    +PVVH+++ DA
Sbjct: 137 AYLSWW--------DWVVGA------------NWQHPYGPDSTIEGFDKYPVVHIAYEDA 176

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
           VAY  W G  LPTEA+WEY  RGGL+      G    P   H+AN WQG FP  NT ADG
Sbjct: 177 VAYADWVGKTLPTEAQWEYAARGGLDTDTSIPGEQYAP---HKANTWQGFFPFFNTKADG 233

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH------------PAPSYNPKGPT 296
           YL  APV S+  N +GLY+M+GNVWE TADW+ V H                S++P+ P 
Sbjct: 234 YLGPAPVESFSPNGYGLYDMLGNVWELTADWYRVGHRGKDRSLNPLGPDQGSSFDPEKPD 293

Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
            G   V KGGS+LC   YC R+R  AR    PD+   ++GFR  A
Sbjct: 294 QGALHVIKGGSFLCARNYCSRYRPEARESQAPDTGTNHVGFRLVA 338


>gi|384218970|ref|YP_005610136.1| hypothetical protein BJ6T_52880 [Bradyrhizobium japonicum USDA 6]
 gi|354957869|dbj|BAL10548.1| hypothetical protein BJ6T_52880 [Bradyrhizobium japonicum USDA 6]
          Length = 336

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 134/356 (37%), Positives = 183/356 (51%), Gaps = 63/356 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV +PG TFRMG++        E PS  V++D F++D+  V+N QF++FV+ATG+VTEA
Sbjct: 20  EMVYIPGGTFRMGSDHHY---PEEAPSHRVSVDGFWIDRTPVTNRQFKQFVNATGHVTEA 76

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           +   +   +   L E   A  S V   + R   L                         W
Sbjct: 77  QIVPNPKDYPGALKEMLYAG-SLVFSPLPRITDL-----------------------ADW 112

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                   ++W    RG          +W HP G  S I+   +HPVVHVS++DA AY  
Sbjct: 113 --------SQWWSFMRGA---------NWRHPYGPGSNIKGLDDHPVVHVSYSDAAAYAR 155

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEAEWE+  RGGL+   F WG+ L P G+H AN WQG FP  N   DG+  T+
Sbjct: 156 WAGKELPTEAEWEFAARGGLDGEEFAWGDALMPGGKHMANTWQGNFPVQNLREDGFERTS 215

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSYNPKGPTTGTD---------- 300
           PVM++  N +GL++M+GNVWEWT+DWW+  H      P   P+ P  G +          
Sbjct: 216 PVMAFPPNGYGLHDMIGNVWEWTSDWWSSKHTADAAKPCCIPRNPRGGREEASFDPCQPD 275

Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDK 351
                +V KGGS+LC   YC R+R AAR     D+S  ++GFRC   +GP    +K
Sbjct: 276 IRIPRRVLKGGSHLCAPNYCRRYRPAARHAEPVDTSTSHVGFRCVV-RGPIDTREK 330


>gi|410984718|ref|XP_003998673.1| PREDICTED: sulfatase-modifying factor 2 [Felis catus]
          Length = 300

 Score =  218 bits (555), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 136/338 (40%), Positives = 176/338 (52%), Gaps = 72/338 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F+MGTN P   +DGE P R +T+  F +D   V+N  F+EFV    Y TEAE
Sbjct: 29  MVQLPGGRFQMGTNSPD-GRDGEGPVREMTVKPFAIDIFPVTNKDFREFVREKKYRTEAE 87

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFG +FVFE  +S+E R K++                     H +  + W        W 
Sbjct: 88  KFGWSFVFEDFVSDELRNKVA---------------------HQMESVLW--------W- 117

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E             R F    W  P G  S I  R+  PVVHVSWNDA AYC W
Sbjct: 118 ---LPVE-------------RAF----WRQPAGPGSGIRERLELPVVHVSWNDARAYCAW 157

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL  + +PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 158 RGKRLPTEEEWEFAARGGLTGQAYPWGNQFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 214

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD-KVKKGGSYL--C 310
           V ++  +N +GLY+++GNVWEWTA                 PTT  D +V +G S++   
Sbjct: 215 VNAFPPQNNYGLYDLMGNVWEWTAS--------------PYPTTDQDMRVLRGASWIDTA 260

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           +    +R R   R  NTPDS++ NLGFRCA+  G   G
Sbjct: 261 DGSANHRARVTTRMGNTPDSASDNLGFRCASSAGRLPG 298



 Score = 40.8 bits (94), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 22/50 (44%), Positives = 31/50 (62%), Gaps = 3/50 (6%)

Query: 345 PTTGTD-KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 391
           PTT  D +V +G S++   +    +R R   R  NTPDS++ NLGFRCA+
Sbjct: 242 PTTDQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAS 291


>gi|375010720|ref|YP_004987708.1| hypothetical protein [Owenweeksia hongkongensis DSM 17368]
 gi|359346644|gb|AEV31063.1| hypothetical protein Oweho_0039 [Owenweeksia hongkongensis DSM
           17368]
          Length = 363

 Score =  218 bits (555), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 131/352 (37%), Positives = 183/352 (51%), Gaps = 56/352 (15%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           +E+ + MV +P   +R+G+      K  E PS  V +  FY+D+ EV+N +F+EFV ATG
Sbjct: 50  LEKVEGMVWIPSGVYRVGSEDKHANKS-EGPSHEVKVSGFYMDETEVTNDEFREFVDATG 108

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKI---SQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
           Y+T AE+        P+  EE + ++   +   HD     G        +  P    +  
Sbjct: 109 YLTVAER--------PVDWEEIKQQLPPGTPKPHDSILQPG------SMIFQPGPTSNLY 154

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
           D   +  W+                           W HP G +S I+ + ++PVVH+++
Sbjct: 155 DISQWWAWKIG-----------------------ADWQHPHGPNSDIKKKGDYPVVHIAF 191

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
            DA AY  W G RLPTEAEWE   RGG   + F WG+ LTP GE+ AN +QG FP  NTA
Sbjct: 192 EDAQAYAKWAGKRLPTEAEWEIAARGGEPAKPFAWGDKLTPMGEYLANFFQGTFPDGNTA 251

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-------YNPKGPTTG 298
            DGY ++APV SY  N +GLY+M+GNVWEWT+DW+    H            NP GP+T 
Sbjct: 252 LDGYGNSAPVKSYAPNGYGLYDMIGNVWEWTSDWYRPDTHVQNKQAGLKGCINPNGPSTS 311

Query: 299 TD--------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            D        +V KGGS+LC+E+YC  +R +AR     DS   +LGFRC  +
Sbjct: 312 YDPQEPLVPKRVTKGGSFLCSEEYCSNYRPSARMATAYDSGQEHLGFRCVKN 363


>gi|373956332|ref|ZP_09616292.1| Sulphatase-modifying factor protein [Mucilaginibacter paludis DSM
           18603]
 gi|373892932|gb|EHQ28829.1| Sulphatase-modifying factor protein [Mucilaginibacter paludis DSM
           18603]
          Length = 392

 Score =  218 bits (555), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 136/384 (35%), Positives = 179/384 (46%), Gaps = 81/384 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+   TF MG +      D E+P   VT+++F++D  EV+N QF +FV ATGY+T AE
Sbjct: 77  MVLIKSGTFMMGGDNK-QAADDEYPKHKVTVNSFWIDATEVTNAQFAKFVKATGYLTTAE 135

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D           E  K     +  K  E L          P   +S  D   +  W+
Sbjct: 136 RKPDW----------EELKKQLPPNTPKPDESLLVPASLIFKQPGKPVSLTDYSQWWVWQ 185

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                      W HP G  S I+ + N+PVVHVSW DA AYC W
Sbjct: 186 KG-----------------------ADWKHPHGPSSNIKGKDNYPVVHVSWYDAQAYCKW 222

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWE+  RGGL N ++PWGN     G  +AN WQG FP  NT  D +   AP
Sbjct: 223 AGKRLPTEAEWEWAARGGLSNNIYPWGNEPVDAGLPKANTWQGNFPNTNTVRDHFFYLAP 282

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-----NPKGPTTGTDKVKKGGSYL 309
           V S+K N +GLY+M GNVWEW AD++N  ++   +      NP+GP           SY 
Sbjct: 283 VASFKPNGYGLYDMAGNVWEWCADYYNNTYYATINKANGVSNPRGP---------ANSYD 333

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHR 369
            +E Y  +                                 +V +GGS+LCN+ YC  +R
Sbjct: 334 PDEPYAKK---------------------------------RVIRGGSFLCNDTYCSGYR 360

Query: 370 CAARSQNTPDSSAGNLGFRCAADV 393
            A R ++T DS   +LGFRC  D 
Sbjct: 361 VARRMKSTEDSGMEHLGFRCVKDA 384


>gi|260061739|ref|YP_003194819.1| hypothetical protein RB2501_09060 [Robiginitalea biformata
           HTCC2501]
 gi|88785871|gb|EAR17040.1| hypothetical protein RB2501_09060 [Robiginitalea biformata
           HTCC2501]
          Length = 380

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 130/341 (38%), Positives = 175/341 (51%), Gaps = 47/341 (13%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P  +F MG        D E P R+V +  F++D+ EV+N +F  FV ATGY T AE
Sbjct: 72  MVYIPSGSFLMGGKSDQASPD-ELPRRSVQVAGFFMDRTEVTNREFAAFVDATGYETVAE 130

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K        P+  E  + ++                       P   +     V + T R
Sbjct: 131 K--------PIDWEAMKTQLP----------------PGTPKPPDSLLQAGSLVFHATDR 166

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              L   ++W +           P  +W HPEG +S +E RM+HPVVH++ +DA+AY  W
Sbjct: 167 PVDLRDLSQWWHWT---------PGANWKHPEGPESDLEGRMDHPVVHIALDDALAYARW 217

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWE+   GGLE+  +PWGN        +AN WQG FP +N  ADGYL TAP
Sbjct: 218 AGKRLPTEAEWEWASMGGLEDPKYPWGNAPISESADKANFWQGIFPYSNEQADGYLRTAP 277

Query: 255 VMSYKENKFGLYNMVGNVWEW-----TADWWNVHHHPAPSYNPKGPTTGTDK-------- 301
           V +Y  N +GL +M GNVWEW      +D + +   P    NP+GP T  D         
Sbjct: 278 VGTYPANGYGLVDMAGNVWEWCQDKYRSDTYQLETTPTDRMNPRGPDTSFDPSEPLAEKY 337

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           V +GGSYLCN+ YC  +R + R + T DS+ G+ GFRC  D
Sbjct: 338 VIRGGSYLCNDSYCSGYRVSRRMRATRDSAFGHTGFRCVKD 378


>gi|301780842|ref|XP_002925845.1| PREDICTED: sulfatase-modifying factor 2-like [Ailuropoda
           melanoleuca]
          Length = 300

 Score =  218 bits (554), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 133/338 (39%), Positives = 176/338 (52%), Gaps = 70/338 (20%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV LPG  F+MGTN P   +DGE P R VT+  F +D   V+N  F+EFV    Y TEA
Sbjct: 28  NMVQLPGGRFQMGTNSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFREFVREKKYRTEA 86

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E FG +FVFE  +S+E R K++                     H +  + W        W
Sbjct: 87  EMFGWSFVFEDFVSDELRNKVT---------------------HQMESVLW--------W 117

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
               LP E             R F    W  P G  S I  R+  PV+HVSWNDA AYC 
Sbjct: 118 ----LPVE-------------RAF----WRQPAGPGSGIRERLELPVLHVSWNDARAYCA 156

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           WRG RLPTE EWE+  RGGL+ +L+PWGN   P   +R N+WQG+FP  + A DG+   +
Sbjct: 157 WRGKRLPTEEEWEFAARGGLKGQLYPWGNQFQP---NRTNLWQGKFPKGDKAEDGFHGVS 213

Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--C 310
           PV ++  +N +GLY+++GNVWEWTA             +P        +V +G S++   
Sbjct: 214 PVNAFPPQNNYGLYDLMGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTA 260

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           +    +R R   R  NTPDS++ NLGFRCA+  G   G
Sbjct: 261 DGSANHRARVTTRMGNTPDSASDNLGFRCASSAGRLRG 298


>gi|383767542|ref|YP_005446524.1| putative sulfatase-modifying factor [Phycisphaera mikurensis NBRC
           102666]
 gi|381387811|dbj|BAM04627.1| putative sulfatase-modifying factor [Phycisphaera mikurensis NBRC
           102666]
          Length = 322

 Score =  217 bits (553), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 134/341 (39%), Positives = 175/341 (51%), Gaps = 46/341 (13%)

Query: 5   PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           P PP    + MVL+PG TF MGT+ P  + + E P+R V +DAF+LD+  V+N +F  FV
Sbjct: 26  PGPP----RGMVLVPGGTFAMGTDDPRGMPN-ERPAREVAVDAFFLDRAPVTNAEFAAFV 80

Query: 65  SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
            ATGYVTEAE+  D  V    L  +          +M     L  T       PV     
Sbjct: 81  EATGYVTEAERPVDWEV----LKTQVPPGTPAPPPEMLLPGSLVFTPPPPGSGPV---DL 133

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
            D   + +W                           SW HP+G  S ++ R +HPVV VS
Sbjct: 134 RDLSRWWSWTAG-----------------------ASWRHPDGPGSDLDGRGDHPVVQVS 170

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
           ++DA AY  W G RLPTEAEWE+  RGG + R F WG    P G   AN + G FP NNT
Sbjct: 171 FHDAEAYAAWAGKRLPTEAEWEFAARGGSDTRFF-WGEQFRPGGRFMANTFTGSFPLNNT 229

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
           AAD +   +PV S+  N +GL +M GNVW+WT+ +          Y   G    + +V K
Sbjct: 230 AADRFADRSPVGSFPANGYGLVDMAGNVWQWTSSF----------YGAGGRVDRSQRVIK 279

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           GGSYLC+  YC  +R  AR   TPD+ +G++GFRCA D  P
Sbjct: 280 GGSYLCHADYCESYRPPARRPLTPDTGSGHVGFRCAWDPPP 320


>gi|50758160|ref|XP_415788.1| PREDICTED: sulfatase-modifying factor 2 [Gallus gallus]
          Length = 311

 Score =  217 bits (553), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 134/334 (40%), Positives = 175/334 (52%), Gaps = 70/334 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F+MG++     +D E P R VT+  F +D+  V+N  F+EFV    Y TEAE
Sbjct: 27  MVRLPGGRFQMGSSS-TQSRDEEGPIREVTVKPFAIDKFPVTNRDFREFVREKKYKTEAE 85

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +SEE + KI+Q          L+S              W        W 
Sbjct: 86  AFGWSFVFEDFVSEELKKKITQ---------KLESA------------PW--------W- 115

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E  +                 W  P G  S I  R++HPV+HVSWNDA A+C W
Sbjct: 116 ---LPVEKAF-----------------WRQPAGPGSGIADRLDHPVLHVSWNDAQAFCRW 155

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G RLPTE EWE+  RGGLE RL+PWGN   P   +R N+WQG+FP  +TA DGY   +P
Sbjct: 156 KGKRLPTEEEWEFAARGGLEQRLYPWGNKFQP---NRTNLWQGDFPRGDTAEDGYHGVSP 212

Query: 255 VMSYK-ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK---VKKGGSYLC 310
           V ++  +N +GLY+++GN WEWTA            Y P GP     +   V +G S++ 
Sbjct: 213 VAAFSPQNSYGLYDLLGNTWEWTAS----------QYTPPGPPRPRAEAMHVLRGASWID 262

Query: 311 NEQYCYRHRCA--ARSQNTPDSSAGNLGFRCAAD 342
                  HR +   R  NTPDS++ NL FRCAAD
Sbjct: 263 TVDGSANHRASITTRMGNTPDSASDNLSFRCAAD 296


>gi|344289891|ref|XP_003416674.1| PREDICTED: sulfatase-modifying factor 2-like [Loxodonta africana]
          Length = 297

 Score =  217 bits (552), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 131/337 (38%), Positives = 176/337 (52%), Gaps = 70/337 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           +V LPG  F+MGT+ P   +DGE P R VT+  F +D   V+N  F+EFV A  Y TEAE
Sbjct: 26  VVQLPGGRFQMGTDSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFREFVRANKYRTEAE 84

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +   H MK                   + W          
Sbjct: 85  SFGWSFVFEDFVSDELRKKAT---HQMKSL-----------------LWW---------- 114

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E  +                 W  P G  S I  R+  PV+HVSWNDA AYC W
Sbjct: 115 ---LPVEKAF-----------------WRQPAGPGSGIRERLELPVLHVSWNDARAYCAW 154

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 155 RGKRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDRAEDGFHGVSP 211

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA             +P  P +   +V +G S++   +
Sbjct: 212 VNAFPPQNNYGLYDLMGNVWEWTA-------------SPYQPASQDMRVLRGASWIDTAD 258

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               +R R   R  NTPDS++ NLGFRCA+  G   G
Sbjct: 259 GSANHRARVTTRMGNTPDSASDNLGFRCASSAGRPRG 295


>gi|417409307|gb|JAA51164.1| Putative sulfatase-modifying factor 2, partial [Desmodus rotundus]
          Length = 282

 Score =  217 bits (552), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 138/341 (40%), Positives = 177/341 (51%), Gaps = 76/341 (22%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV LPG  F+MGTN     +DGE P R VT+  F +D   V+N  F+EFV    Y TEA
Sbjct: 10  NMVQLPGGRFQMGTNAQDG-RDGEGPVREVTVKPFSIDIFPVTNKDFREFVREKKYRTEA 68

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           EKFG +FVFE  +S+E R K +Q                     PV+   W        W
Sbjct: 69  EKFGWSFVFEDFVSDELRNKATQ------------------QMKPVL---W--------W 99

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
               LP E             R F    W  P G  S I  R+  PVVHVSWNDA AYC 
Sbjct: 100 ----LPVE-------------RAF----WRQPAGPGSGIRERLELPVVHVSWNDARAYCA 138

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +
Sbjct: 139 WRGKRLPTEEEWEFAARGGLKGQVYPWGNQFQP---NRTNLWQGKFPKGDKAEDGFHGVS 195

Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD---KVKKGGSYL 309
           PV ++  +N +GLY++VGNVWEWTA                 P  G D   +V +G S++
Sbjct: 196 PVNAFPPQNNYGLYDLVGNVWEWTAS----------------PYQGADQDMRVLRGASWI 239

Query: 310 --CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
              +    +R R   R  NTPDS++ NLGFRCA+  G   G
Sbjct: 240 DTSDGSANHRARVTTRMGNTPDSASDNLGFRCASSIGRLPG 280


>gi|209731560|gb|ACI66649.1| Sulfatase-modifying factor 2 precursor [Salmo salar]
          Length = 302

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 131/335 (39%), Positives = 177/335 (52%), Gaps = 71/335 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG    MGTN     +DGE P++ VT+D F +D++ V+N+ F+EFV A  Y TEAE
Sbjct: 34  MVQIPGGKMTMGTNSADG-RDGESPTKEVTVDPFRIDKYPVTNSDFREFVRAQKYKTEAE 92

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVF+  +SEE ++K+++                               +    W 
Sbjct: 93  TFGWSFVFQDFVSEELKSKVTE------------------------------KIESAPWW 122

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              +P E             R+F    W  P G  S I  R++ PVV VSWNDA A+C+W
Sbjct: 123 ---MPVE-------------RVF----WRQPAGPGSGIRERLDSPVVQVSWNDAQAFCSW 162

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLP+E EWE+  RGGL+ R +PWGN    +   R+N+WQG FP  +TA DGY   AP
Sbjct: 163 RGRRLPSEEEWEWAARGGLQGRTYPWGNKFQSK---RSNLWQGLFPDGDTAEDGYHGIAP 219

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           V ++  +N FGLY+M+GN WEWT+        P P   P         V +GGS++  E 
Sbjct: 220 VTAFPPQNSFGLYDMMGNAWEWTS-------TPFPGGRPM-------FVLRGGSWIDTED 265

Query: 314 YC--YRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
                + R   R  NTPDS++ NLGFRCAA K  T
Sbjct: 266 GSANRKARITTRMGNTPDSASDNLGFRCAARKDKT 300


>gi|284036352|ref|YP_003386282.1| hypothetical protein Slin_1433 [Spirosoma linguale DSM 74]
 gi|283815645|gb|ADB37483.1| protein of unknown function DUF323 [Spirosoma linguale DSM 74]
          Length = 358

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 125/339 (36%), Positives = 180/339 (53%), Gaps = 54/339 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           DMVL+PG TF MG ++     D   P  +VT+  F++D+HEV+N +F  FV+ATGY T A
Sbjct: 65  DMVLIPGGTFLMGADE---FPDAR-PVHSVTVKPFWMDEHEVTNAEFARFVAATGYKTVA 120

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+        PL              +   + G+          P   +    AV     
Sbjct: 121 ER--------PL--------------NPADYPGV----------PAGDLVPGSAVFTPPA 148

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           +   L    +W    +G          SW HP G  S+++   N PV+HVS+ DA+AY  
Sbjct: 149 QKVSLANPLQWWEYVKGA---------SWNHPNGPKSSLKGHENEPVIHVSYEDAMAYAK 199

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE+  +GG  NR + WG+ L P G+  AN++QG+FP+ NT  DG+   A
Sbjct: 200 WAGKRLPTEAEWEFAAQGGKGNRTYYWGDELKPSGKWIANIYQGDFPSKNTREDGFAGVA 259

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTTGTDK--------VKK 304
           PV S+  N +GLY+M GNVWEW  D +   ++  +P  +P+GP    D         V++
Sbjct: 260 PVKSFPANPYGLYDMDGNVWEWCQDLYRPDYYMKSPKMDPQGPPDSYDPEEPGAVKHVQR 319

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           GGS+LC++QYC R++  +R +    S + NLGFRC  +K
Sbjct: 320 GGSFLCSDQYCVRYKAGSRGKGEVSSGSNNLGFRCVRNK 358


>gi|332265464|ref|XP_003281740.1| PREDICTED: sulfatase-modifying factor 2 isoform 1 [Nomascus
           leucogenys]
          Length = 302

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 131/337 (38%), Positives = 173/337 (51%), Gaps = 70/337 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  F MGTN P   +DGE P R VT+  F +D   V+N  F++FV    Y TEAE
Sbjct: 31  MVQLQGGRFLMGTNSPD-SRDGEGPVREVTVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 89

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q                     P+  + W        W 
Sbjct: 90  MFGWSFVFEDFVSDELRNKATQ---------------------PMKPVLW--------W- 119

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E  +                 W  P G  S I  R+ HPV+HVSWNDA AYC W
Sbjct: 120 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 159

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLP E EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 160 RGKRLPMEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 216

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA             +P        +V +G S++   +
Sbjct: 217 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 263

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 264 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 300



 Score = 41.2 bits (95), Expect = 0.80,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 251 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 295


>gi|13474559|ref|NP_106128.1| hypothetical protein mll5466 [Mesorhizobium loti MAFF303099]
 gi|14025313|dbj|BAB51914.1| mll5466 [Mesorhizobium loti MAFF303099]
          Length = 372

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 134/344 (38%), Positives = 174/344 (50%), Gaps = 60/344 (17%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           Y  MV +PG TF MG++        E P+  V +D F++D+  VSN  F+ FV+ATG+VT
Sbjct: 62  YPGMVWIPGGTFLMGSDNHY---PEEAPAHRVRVDGFWMDKFTVSNRDFERFVAATGHVT 118

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
            AEK  +   +   L +                    S +  +   PV      D   + 
Sbjct: 119 LAEKPANPDDYPGALPD---------------LLAPSSMMFRKPAGPV------DLGNHY 157

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W          W Y  RG          +W HP G  STI+   +HPVVHV++ D VAY
Sbjct: 158 NW----------WVY-VRGA---------NWRHPRGPASTIKKVADHPVVHVAYEDVVAY 197

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G  LPTEAEWE+  RGGL+   + WGN LTP G+H AN+WQG+FP  NT  DGY  
Sbjct: 198 ANWAGKELPTEAEWEFAARGGLDAAEYVWGNELTPAGKHMANIWQGDFPYRNTVDDGYEY 257

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH---------------HPAPSYNPKGPT 296
           TAPV S+  N +GLY+M GNVW+WT DW+  H                H   SY+ + P 
Sbjct: 258 TAPVGSFPANDYGLYDMAGNVWQWTTDWYQDHKAIDSPCCTAVNPRGGHREASYDTRLPD 317

Query: 297 TGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                KV KGGS+LC   YC R+R AAR     D++  +LGFRC
Sbjct: 318 VKIPRKVTKGGSHLCAPNYCRRYRPAARMAQPVDTAISHLGFRC 361


>gi|194248088|ref|NP_056226.2| sulfatase-modifying factor 2 isoform b precursor [Homo sapiens]
 gi|119628383|gb|EAX07978.1| sulfatase modifying factor 2, isoform CRA_d [Homo sapiens]
          Length = 320

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 131/337 (38%), Positives = 173/337 (51%), Gaps = 70/337 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  F MGTN P   +DGE P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct: 49  MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 107

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q                     P+  + W        W 
Sbjct: 108 MFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 137

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E  +                 W  P G  S I  R+ HPV+HVSWNDA AYC W
Sbjct: 138 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 177

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 178 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 234

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA             +P        +V +G S++   +
Sbjct: 235 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 281

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 282 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 318


>gi|399027142|ref|ZP_10728733.1| hypothetical protein PMI10_00545 [Flavobacterium sp. CF136]
 gi|398075378|gb|EJL66496.1| hypothetical protein PMI10_00545 [Flavobacterium sp. CF136]
          Length = 377

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 131/359 (36%), Positives = 180/359 (50%), Gaps = 67/359 (18%)

Query: 7   PPVERY-KDMVLLPGDTFRMGTNKPI--------LIKDGEFPSRNVTLDAFYLDQHEVSN 57
           PP+ +  K MV +PG  F MG+N           + KD   P   V +D +Y+D+ EV+N
Sbjct: 60  PPINKTPKGMVWIPGGEFSMGSNVEDESLCSIKGVTKDAA-PIHRVYVDGYYMDKTEVTN 118

Query: 58  TQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
            QF+ FV ATGY+T AEK                                  TIE   + 
Sbjct: 119 EQFEAFVKATGYITLAEK--------------------------------KPTIEEYPNV 146

Query: 118 PVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMN 177
           PV ++    AV   T     L    +W +   GG +        W HPEG  S+I+ +  
Sbjct: 147 PVENLIAGSAVFTPTPAKVDLNNYMQW-WSYVGGAD--------WRHPEGPMSSIKGKEK 197

Query: 178 HPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG 237
           +PVV VS+ DA AY  W G RLPTEA+WE+  RGG    L+ WGN L P+G+ +AN++QG
Sbjct: 198 YPVVQVSYEDAAAYAKWAGKRLPTEAQWEFAARGGKTGELYAWGNTLKPKGKFQANIYQG 257

Query: 238 EFPT--NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY----- 290
            FP    +T  DGY+  AP   +  N +GLY++ GNVWEWT DW+   ++   S      
Sbjct: 258 HFPMEKGDTGEDGYIGIAPTAKFAPNGYGLYDVGGNVWEWTNDWYTADYYSTLSQNGEIA 317

Query: 291 -NPKGPTTGTD--------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            NPKGP +  D        KV++GGS+LC +QYC R+    R +    S A ++GFRC 
Sbjct: 318 RNPKGPDSSYDPTEPGLPKKVQRGGSFLCTDQYCTRYMVGTRGKGDYQSPANHIGFRCV 376


>gi|426254649|ref|XP_004020989.1| PREDICTED: sulfatase-modifying factor 2 [Ovis aries]
          Length = 301

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 132/338 (39%), Positives = 175/338 (51%), Gaps = 70/338 (20%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV LPG  F+MGTN P   +DGE P R VT+  F +D   V+N  F+EFV    Y TEA
Sbjct: 29  NMVQLPGGRFQMGTNSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFREFVREKKYRTEA 87

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E FG +FVFE L+S+E R K +Q                 RM             +   W
Sbjct: 88  EVFGWSFVFEDLVSDELRNKATQ-----------------RMQ------------SLLWW 118

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
               LP E             R F    W  P G  S I  ++  PVVHVSWNDA AYC 
Sbjct: 119 ----LPVE-------------RAF----WRQPAGPGSGIREKLEFPVVHVSWNDARAYCA 157

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +
Sbjct: 158 WRGKRLPTEEEWEFAARGGLKGQVYPWGNKFQP---NRTNLWQGKFPKGDKAEDGFHGVS 214

Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--C 310
           PV ++  +N +GLY+++GN+WEWTA  +                    +V +G S++   
Sbjct: 215 PVNAFPPQNDYGLYDLMGNIWEWTASQYQAADQDM-------------RVLRGASWIDTA 261

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           +    +R R   R  NTPDS++ NLGFRCA+  G   G
Sbjct: 262 DGSANHRARVTTRMGNTPDSASDNLGFRCASSAGRPPG 299


>gi|62460456|ref|NP_001014881.1| sulfatase-modifying factor 2 precursor [Bos taurus]
 gi|73621962|sp|Q58CP2.1|SUMF2_BOVIN RecName: Full=Sulfatase-modifying factor 2; AltName:
           Full=C-alpha-formylglycine-generating enzyme 2; Flags:
           Precursor
 gi|61555735|gb|AAX46752.1| sulfatase modifying factor 2 [Bos taurus]
 gi|151556838|gb|AAI49260.1| Sulfatase modifying factor 2 [Bos taurus]
 gi|296473261|tpg|DAA15376.1| TPA: sulfatase-modifying factor 2 precursor [Bos taurus]
          Length = 301

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 133/338 (39%), Positives = 175/338 (51%), Gaps = 70/338 (20%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV LPG  F+MGT+ P   +DGE P R VT+  F +D   V+N  F+EFV    Y TEA
Sbjct: 29  NMVQLPGGRFQMGTDSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFREFVREKKYRTEA 87

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E FG +FVFE L+S+E R K +Q                 RM             +   W
Sbjct: 88  EVFGWSFVFEDLVSDELRNKATQ-----------------RMQ------------SLLWW 118

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
               LP E             R F    W  P G  S I  ++  PVVHVSWNDA AYC 
Sbjct: 119 ----LPVE-------------RAF----WRQPAGPGSGIREKLEFPVVHVSWNDARAYCA 157

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +
Sbjct: 158 WRGKRLPTEEEWEFAARGGLKGQVYPWGNKFQP---NRTNLWQGKFPKGDKAEDGFHGVS 214

Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--C 310
           PV ++  +N +GLY++VGNVWEWTA  +                    +V +G S++   
Sbjct: 215 PVNAFPPQNDYGLYDLVGNVWEWTASQYQAADQDM-------------RVLRGASWIDTA 261

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           +    +R R   R  NTPDS++ NLGFRCA+  G   G
Sbjct: 262 DGSANHRARVTTRMGNTPDSASDNLGFRCASGAGRPPG 299


>gi|255036936|ref|YP_003087557.1| hypothetical protein Dfer_3179 [Dyadobacter fermentans DSM 18053]
 gi|254949692|gb|ACT94392.1| protein of unknown function DUF323 [Dyadobacter fermentans DSM
           18053]
          Length = 351

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 126/339 (37%), Positives = 176/339 (51%), Gaps = 54/339 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV L G  F MG ++     D   P   VT++ FY+D HEV+N +F  FV AT Y T A
Sbjct: 57  EMVWLSGGKFLMGADE---FPDSR-PMHEVTINGFYMDAHEVTNAEFAAFVKATNYKTVA 112

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  +   + P +  ++    S V         LD+ ++                    W
Sbjct: 113 ERPLNPADY-PGVPADKLVPGSAVFTPTPTRVSLDNPLQ--------------------W 151

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W Y           P  +W HPEG  S+I+ R N+PVVHVS+ DA AY  
Sbjct: 152 ----------WNY----------VPGANWAHPEGPSSSIKGRENYPVVHVSYEDAAAYAK 191

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE+  +GG  N  + WG  L P G+  AN++QG FP  NT  DGYL+ A
Sbjct: 192 WAGKRLPTEAEWEFAAQGGKGNHTYYWGEELKPGGKWVANIYQGSFPDKNTKEDGYLTAA 251

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTTGTD--------KVKK 304
           P+ S+  N +GLY+M GNVWEW  D +   ++  +P  NP+GP+   D        +V++
Sbjct: 252 PIRSFPANPYGLYDMDGNVWEWCEDLYRPDYYQKSPGVNPRGPSDSYDPDEPGAVKRVQR 311

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           GGS+LC++ YC R++  +R +    S + NLGFRC  DK
Sbjct: 312 GGSFLCSDDYCIRYKAGSRGKGEVTSGSNNLGFRCVMDK 350


>gi|282897815|ref|ZP_06305812.1| Protein of unknown function DUF323 [Raphidiopsis brookii D9]
 gi|281197299|gb|EFA72198.1| Protein of unknown function DUF323 [Raphidiopsis brookii D9]
          Length = 341

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 126/343 (36%), Positives = 177/343 (51%), Gaps = 63/343 (18%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + M+L+PG T +MGT+  +  +  E PS+NVT+D+F +D++EV+N +F +FV+ATGY+T 
Sbjct: 47  QGMILIPGGTLKMGTDNSVFAE--ERPSKNVTVDSFCMDKYEVTNAEFAKFVTATGYITV 104

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE       F P L+E++R+  S V                                +  
Sbjct: 105 AELPLSKEQF-PDLTEQQRSPGSLV--------------------------------FKM 131

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
            R   +P  + W +           P  +W HP G  S I    N+PVVH+++ DA+ + 
Sbjct: 132 PRQGEIPFLSWWHW----------LPGANWQHPLGNKSEILD--NYPVVHIAYQDALVFA 179

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEA+WEY  RGGLE+  + WGN  +   E +AN WQG FP  NT  DGY+  
Sbjct: 180 DWLGKSLPTEAQWEYAARGGLEDATYTWGNKYS---EIKANTWQGTFPFLNTKVDGYVGL 236

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-------------PSYNPKGPTTGT 299
           APV S+  N +GLY+M GNVWEWT DW+ V    +              S++P  P    
Sbjct: 237 APVGSFPANGYGLYDMAGNVWEWTVDWFTVEDKHSVNQVSEHEISKTRKSFDPNKPGETL 296

Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
             V KGGSYLC   YC R+R AAR    PD+   ++G R   +
Sbjct: 297 LHVIKGGSYLCAPNYCSRYRPAARESQAPDTGTNHIGLRLVKN 339


>gi|332664781|ref|YP_004447569.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
           1100]
 gi|332333595|gb|AEE50696.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis
           DSM 1100]
          Length = 368

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 130/340 (38%), Positives = 177/340 (52%), Gaps = 54/340 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV +PG TF MG   P  ++D E     V +D F++D  EV+N QFQEFV+ATGY+T+A
Sbjct: 68  NMVWIPGGTFTMGGADPQALQD-EKHLHEVHIDGFWMDVSEVTNAQFQEFVTATGYLTQA 126

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+            E +R  +  +         +DS        P V  SW        +
Sbjct: 127 ERN----------PEAQRMMMPGI--------AIDSL-------PKVPYSWCFQSPKNQY 161

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
              RL  E  W             P   W HP+G  S+I  + NHPVVHVSW DA+AYC 
Sbjct: 162 --GRLQPEDWWT----------AVPGADWQHPQGSGSSIVGKENHPVVHVSWYDAIAYCK 209

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RL TEAEWEY  RGG +   +PWG+ L  +   +AN WQG+FP+     DG++ TA
Sbjct: 210 WAGKRLATEAEWEYAARGGYDKNSYPWGDVLNFK---KANYWQGDFPSLFLNRDGFVKTA 266

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTD-------- 300
           PV  +  N +GLY+M GNVWEW  DW++  ++          NP GP    D        
Sbjct: 267 PVKHFAPNPYGLYDMAGNVWEWCLDWYDFDYYVQGKRAKIVSNPLGPKISADPDAKDAPL 326

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           KV +GGS+LC+E YC  +R + R ++ P + + + GFRC 
Sbjct: 327 KVIRGGSFLCSESYCSGYRVSTRMKSAPCTGSEHAGFRCV 366


>gi|296228504|ref|XP_002759839.1| PREDICTED: sulfatase-modifying factor 2 [Callithrix jacchus]
          Length = 275

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 132/340 (38%), Positives = 176/340 (51%), Gaps = 73/340 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ---EFVSATGYVT 71
           MV LPG  F MGTN P   +DGE P R VT+  F +D   V+N +F+   +F+    Y T
Sbjct: 1   MVQLPGGRFLMGTNSPD-SRDGEGPVREVTVKPFAIDIFPVTNKEFRYIRDFIREKKYRT 59

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           EAE FG +FVFE  +S+E R K +Q                     P+  + W       
Sbjct: 60  EAEMFGWSFVFEGFVSDELRNKATQ---------------------PMKPVLW------- 91

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W    LP E  +                 W  P G  S I  R+ HPV+HVSWNDA AY
Sbjct: 92  -W----LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAY 129

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
           C WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+  
Sbjct: 130 CAWRGKRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHG 186

Query: 252 TAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL- 309
            +PV ++  +N +GLY+++GNVWEWTA             +P        +V +G S++ 
Sbjct: 187 VSPVNAFPAQNSYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWID 233

Query: 310 -CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
             +    +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 234 TADGSANHRARVTTRMGNTPDSASDNLGFRCAADTGRPPG 273



 Score = 41.2 bits (95), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 224 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADT 268


>gi|320105153|ref|YP_004180744.1| hypothetical protein Isop_3638 [Isosphaera pallida ATCC 43644]
 gi|319752435|gb|ADV64195.1| protein of unknown function DUF323 [Isosphaera pallida ATCC 43644]
          Length = 411

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 138/346 (39%), Positives = 176/346 (50%), Gaps = 57/346 (16%)

Query: 7   PPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
           PP    + MV +PG  F  G        D + P R V +D F++DQHEV+N Q+  FV+A
Sbjct: 107 PPQTTPEGMVWIPGGVFLQGDPD---FPDAQ-PRRWVGVDGFWIDQHEVTNRQYAAFVAA 162

Query: 67  TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
           TGY T AE+  D   F P +  ++    S V         LD+                 
Sbjct: 163 TGYQTVAERPLDPRDF-PGVPPDQLQPGSIVFTPPSGPVPLDNVTR-------------- 207

Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
                 W          W Y           P   W HPEG  STIE R +HP VH+++ 
Sbjct: 208 ------W----------WIY----------TPGACWRHPEGPGSTIEGREDHPAVHLAYE 241

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTA 245
           DA AY  W G RLPTEAEWEY  RGGL ++ F WG  L P G    AN+WQG FP  N+ 
Sbjct: 242 DAEAYARWAGKRLPTEAEWEYAARGGLVSQPFVWGAELKPEGSPWLANIWQGHFPNENSQ 301

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-SYNPKGPT-------- 296
            DG+  TAPV S+  N FGLY+M GNVWEW ADW++ + +      NP GP         
Sbjct: 302 EDGFARTAPVKSFPPNGFGLYDMAGNVWEWCADWYHPNAYQFDRRLNPSGPPRLASFDPM 361

Query: 297 -TGTDK-VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
             G  K V++GGS+LC +QYC R+R  AR +  P S+A ++GFRCA
Sbjct: 362 EPGVPKRVQRGGSFLCTDQYCTRYRPGARGKGEPGSAASHVGFRCA 407


>gi|171914379|ref|ZP_02929849.1| hypothetical protein VspiD_24410 [Verrucomicrobium spinosum DSM
           4136]
          Length = 290

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 136/333 (40%), Positives = 170/333 (51%), Gaps = 58/333 (17%)

Query: 25  MGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEP 84
           MG N    ++  E P   + +D F++D+HEV+N QF+EFV ATGYVT +E+      F  
Sbjct: 1   MGDN----VRREESPVHEIAVDGFWMDEHEVTNAQFEEFVKATGYVTTSERVPKVEDFP- 55

Query: 85  LLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEW 144
                               E L   I+     P   +    AVA        L    EW
Sbjct: 56  --------------------ERLRPMIKPDQLKPGAVVFTPPAVA-----PPNLDNFLEW 90

Query: 145 EYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAE 204
                G           W HPEG  S+I+ RMNHPVV VSW DA AY  W G RLPTEAE
Sbjct: 91  WRYIHGA---------DWRHPEGPSSSIKERMNHPVVCVSWKDAEAYAKWAGKRLPTEAE 141

Query: 205 WEYGCRGGL---ENRLFPWGNNLTPRGEHR---ANVWQGEFPTNNTAADGYLSTAPVMSY 258
           WEY  RGG+   +   F WG N    GE R   AN W G FP  + A DG+  TAPV S+
Sbjct: 142 WEYAARGGVGPTKRNTFTWGEN----GEQRVKMANTWTGSFPVEDRAVDGFKGTAPVKSF 197

Query: 259 KENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTTGTD--------KVKKGGSYL 309
             N+FGLY+M GNVWE+ ADW++  +   +P  NP GP+ G D        KV KGGS+L
Sbjct: 198 PPNEFGLYDMAGNVWEFVADWFHPDYFRNSPKLNPPGPSVGYDPNEPDLPKKVIKGGSFL 257

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           C+  YC  +R  +R  N PDSSA + GFRC  +
Sbjct: 258 CSPLYCTGYRPGSRMFNDPDSSADHTGFRCVKN 290


>gi|197102978|ref|NP_001125223.1| sulfatase-modifying factor 2 precursor [Pongo abelii]
 gi|73621963|sp|Q5RCR5.1|SUMF2_PONAB RecName: Full=Sulfatase-modifying factor 2; AltName:
           Full=C-alpha-formylglycine-generating enzyme 2; Flags:
           Precursor
 gi|22761650|dbj|BAC11643.1| unnamed protein product [Homo sapiens]
 gi|32527653|gb|AAP86218.1| sulfatase modifying factor 2 [Homo sapiens]
 gi|37183323|gb|AAQ89461.1| ARHG1968 [Homo sapiens]
 gi|54261539|gb|AAH84539.1| Sulfatase modifying factor 2 [Homo sapiens]
 gi|55727373|emb|CAH90442.1| hypothetical protein [Pongo abelii]
 gi|109730303|gb|AAI11093.1| Sulfatase modifying factor 2 [Homo sapiens]
 gi|119628384|gb|EAX07979.1| sulfatase modifying factor 2, isoform CRA_e [Homo sapiens]
          Length = 301

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 131/337 (38%), Positives = 173/337 (51%), Gaps = 70/337 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  F MGTN P   +DGE P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct: 30  MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 88

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q                     P+  + W        W 
Sbjct: 89  MFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 118

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E  +                 W  P G  S I  R+ HPV+HVSWNDA AYC W
Sbjct: 119 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 158

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 159 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 215

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA             +P        +V +G S++   +
Sbjct: 216 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 262

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 263 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 299



 Score = 41.2 bits (95), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 250 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 294


>gi|61680510|pdb|1Y4J|A Chain A, Crystal Structure Of The Paralogue Of The Human
           Formylglycine Generating Enzyme
 gi|61680511|pdb|1Y4J|B Chain B, Crystal Structure Of The Paralogue Of The Human
           Formylglycine Generating Enzyme
          Length = 284

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 131/337 (38%), Positives = 173/337 (51%), Gaps = 70/337 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  F MGTN P   +DGE P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct: 4   MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 62

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q                     P+  + W        W 
Sbjct: 63  MFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 92

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E  +                 W  P G  S I  R+ HPV+HVSWNDA AYC W
Sbjct: 93  ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 132

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 133 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 189

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA             +P        +V +G S++   +
Sbjct: 190 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 236

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 237 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 273



 Score = 41.2 bits (95), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 224 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 268


>gi|424875872|ref|ZP_18299531.1| hypothetical protein Rleg5DRAFT_0212 [Rhizobium leguminosarum bv.
           viciae WSM1455]
 gi|393163475|gb|EJC63528.1| hypothetical protein Rleg5DRAFT_0212 [Rhizobium leguminosarum bv.
           viciae WSM1455]
          Length = 315

 Score =  215 bits (548), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 126/326 (38%), Positives = 172/326 (52%), Gaps = 54/326 (16%)

Query: 18  LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
           + G    +GT++P+L +DGE P R+V ++AF L    V+N +F  FV ATGYVTEAE+ G
Sbjct: 37  IEGGRVSVGTSRPLLPQDGEGPVRSVRINAFELSVGAVTNEEFGRFVEATGYVTEAERLG 96

Query: 78  DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
            ++VF                     ++ LD       ++P   +               
Sbjct: 97  WSYVF---------------------YKFLDG------NNPTSGV--------------- 114

Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
               AEW     G          +W  P G  S++E   N PVVH+SW DA  YC W G 
Sbjct: 115 --VGAEWWRKVEGA---------NWAFPSGPGSSLEKLGNFPVVHISWQDASDYCHWAGG 163

Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMS 257
           RLP+EAEWE+  RGGL+  ++PWG+          N+WQG FP  +TAADGY   AP +S
Sbjct: 164 RLPSEAEWEHAARGGLDQPVYPWGDEQPTETFTPCNIWQGRFPDLDTAADGYAGLAPALS 223

Query: 258 YKENKFGLYNMVGNVWEWTADWWNVHH-HPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCY 316
           ++ N +GL+NMVGN WEWTAD + V     A +           +V KGGSYLC+  YC+
Sbjct: 224 FEPNGYGLHNMVGNCWEWTADPFRVRSLRKAANSANSASKQEWKRVLKGGSYLCHTSYCH 283

Query: 317 RHRCAARSQNTPDSSAGNLGFRCAAD 342
           R+R AAR  +T DS+ G++GFR A D
Sbjct: 284 RYRIAARMGSTGDSTTGHIGFRIAFD 309


>gi|332662774|ref|YP_004445562.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
           1100]
 gi|332331588|gb|AEE48689.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis
           DSM 1100]
          Length = 381

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 128/341 (37%), Positives = 172/341 (50%), Gaps = 48/341 (14%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +    F+MG +      D E+P   V +  F++D  EV+N QFQ FV ATGYVT AE
Sbjct: 77  MVEIQAGQFQMGGDNAQARPD-EYPKHAVEVKGFWMDATEVTNAQFQRFVDATGYVTVAE 135

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D          EE  K      ++   E L          P  +I            
Sbjct: 136 RAIDL---------EELKKQVPPGTELPSAEELQPFSLVFTAPPSGNI------------ 174

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
             +  + A+W    +G          +W  PEG  S ++ +  HPVVHV W DA+AYC W
Sbjct: 175 --QTLSPADWWQMVKGA---------NWRQPEGPGSNLKGKDKHPVVHVCWYDAIAYCKW 223

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY  RGG    ++PWG  + P    +AN +QGEFP      DG+   +P
Sbjct: 224 AGKRLPTEAEWEYAGRGGQSEAIYPWG--MEPVSPQKANFFQGEFPVKQLNEDGFQRLSP 281

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHH------------PAPSYNPKGPTTGTDKV 302
           V S+  N +GLY+M GNVWEW ADW+   ++            PA SY+P+ P     KV
Sbjct: 282 VKSFAANPYGLYDMAGNVWEWCADWYRPDYYAQNKGTLRNPQGPADSYDPEEP-NAPKKV 340

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
            +GGS+LCN+ YC  +R AAR + +PD+   + GFRC  DK
Sbjct: 341 VRGGSFLCNDSYCSGYRVAARMKTSPDTGLEHTGFRCVRDK 381


>gi|13544049|gb|AAH06159.1| SUMF2 protein, partial [Homo sapiens]
          Length = 296

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 131/337 (38%), Positives = 173/337 (51%), Gaps = 70/337 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  F MGTN P   +DGE P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct: 25  MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 83

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q                     P+  + W        W 
Sbjct: 84  MFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 113

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E  +                 W  P G  S I  R+ HPV+HVSWNDA AYC W
Sbjct: 114 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 153

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 154 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 210

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA             +P        +V +G S++   +
Sbjct: 211 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 257

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 258 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 294



 Score = 41.2 bits (95), Expect = 0.84,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 245 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 289


>gi|332706795|ref|ZP_08426856.1| hypothetical protein LYNGBM3L_23730 [Moorea producens 3L]
 gi|332354679|gb|EGJ34158.1| hypothetical protein LYNGBM3L_23730 [Moorea producens 3L]
          Length = 296

 Score =  215 bits (547), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 129/337 (38%), Positives = 186/337 (55%), Gaps = 58/337 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG TF+MG++   +    E  + +VT+++F +D++EV+N +F +FV  TGY+T AE
Sbjct: 1   MTFIPGGTFKMGSD---VYYPSERSADDVTVESFCIDKYEVTNAEFAKFVKETGYITVAE 57

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +   +  F P L+EE+RA  S V      F+  D+ +      PV ++SW          
Sbjct: 58  RPLSSEQF-PNLTEEQRAPGSLV------FQMPDNGVR-----PVGYLSW---------- 95

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W++           P   W HP G  S I+ + N+PVVHV++ D  AY  W
Sbjct: 96  ---------WKWT----------PGADWQHPFGPKSNIKGKKNYPVVHVAYEDVQAYAKW 136

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  +PTEA+WEY   GGL++  + WG   + +   +AN WQG FP  NT ADG+  TAP
Sbjct: 137 AGKSVPTEAQWEYAAHGGLDDATYSWGEQYSAK---KANTWQGFFPFFNTKADGHTGTAP 193

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTT---------GTDK-VK 303
           V S++ N +GLY++ GNVWEWT+DW+++ H+  A S NP GP           G  K V 
Sbjct: 194 VGSFEPNGYGLYDITGNVWEWTSDWYSIGHNGKAHSLNPTGPNQKESFDPREPGVAKHVI 253

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           KGGSYLC   YC R+R AAR   +PD+   ++GFR  
Sbjct: 254 KGGSYLCAPNYCSRYRPAARESQSPDTGTTHIGFRLV 290


>gi|311251202|ref|XP_003124494.1| PREDICTED: sulfatase-modifying factor 2-like isoform 1 [Sus scrofa]
          Length = 301

 Score =  214 bits (546), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 132/337 (39%), Positives = 175/337 (51%), Gaps = 70/337 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F+MGTN P   +DGE P R VT+  F +D   V+N  F++FV    Y TEAE
Sbjct: 30  MVQLPGGRFQMGTNSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 88

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE L+ +E R+K +                     H +  + W        W 
Sbjct: 89  AFGWSFVFEDLVPDELRSKAT---------------------HQMQSLLW--------W- 118

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E             R F    W  P G  S I  R+  PVVHVSWNDA AYC W
Sbjct: 119 ---LPVE-------------RAF----WRQPAGPGSGIRERLEFPVVHVSWNDARAYCAW 158

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 159 RGKRLPTEEEWEFAARGGLKGQVYPWGNQFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 215

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA             +P        +V +G S++   +
Sbjct: 216 VNAFPPQNNYGLYDLMGNVWEWTA-------------SPYQAADQDMRVLRGASWIDTAD 262

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               +R R   R  NTPDS++ NLGFRCA+  G   G
Sbjct: 263 GSANHRARVTTRMGNTPDSASDNLGFRCASSAGRPPG 299


>gi|355722653|gb|AES07643.1| sulfatase modifying factor 2 [Mustela putorius furo]
          Length = 299

 Score =  214 bits (546), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 132/338 (39%), Positives = 175/338 (51%), Gaps = 70/338 (20%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV LPG  F+MGTN P   +DGE P R VT+  F +D   V+N  F+EFV    Y TEA
Sbjct: 28  NMVQLPGGRFQMGTNSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFREFVREEKYRTEA 86

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E FG +FVFE  +S+E R K++                     H +  + W        W
Sbjct: 87  EMFGWSFVFEDFVSDELRNKVT---------------------HQMESVLW--------W 117

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
               LP E             R F    W  P G  S I  ++  PVVHVSWNDA AYC 
Sbjct: 118 ----LPVE-------------RAF----WRQPAGPGSGIREKLELPVVHVSWNDARAYCA 156

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           WRG RLPTE EWE+  RGGL  +++PWGN   P   +R N+WQG+FP  + A DG+   +
Sbjct: 157 WRGKRLPTEEEWEFAARGGLNGQVYPWGNQFQP---NRTNLWQGKFPKGDKAEDGFHGVS 213

Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--C 310
           PV ++  +N +GLY+++GNVWEWTA             +P        +V +G S++   
Sbjct: 214 PVNAFPPQNNYGLYDLMGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTA 260

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           +    +R R   R  NTPDS++ NLGFRCA+  G   G
Sbjct: 261 DGSANHRARVTTRMGNTPDSASDNLGFRCASSAGRLPG 298


>gi|291234208|ref|XP_002737041.1| PREDICTED: sulfatase-modifying factor 2-like [Saccoglossus
           kowalevskii]
          Length = 326

 Score =  214 bits (546), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 125/337 (37%), Positives = 180/337 (53%), Gaps = 64/337 (18%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           ++++++MV L G  F MGT++P   +DGE PSR VT+  F + ++ V+N  F++FV A  
Sbjct: 21  LKKFEEMVELKGGKFIMGTDEPNA-RDGEGPSRKVTVKPFKIMRYPVTNAAFRQFVKAKR 79

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           + TEAE+F  +FVF+  +SEE +A I++                                
Sbjct: 80  FKTEAERFQWSFVFDAFVSEEVKANITE-------------------------------- 107

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                   R+P    W             P   W  PEG  STI  ++N P VH+S NDA
Sbjct: 108 --------RIPNTPWW----------VPVPRAYWRQPEGQGSTITKKLNWPAVHISANDA 149

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
             YC W+G RLPTE EWEY  RGGLE  L+PWG     +   R N+WQG+FP  NT  DG
Sbjct: 150 KQYCQWKGWRLPTEPEWEYAARGGLECELYPWGARYDNK---RMNIWQGKFPEENTKEDG 206

Query: 249 YLSTAPVMSYK-ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
           Y  ++PV +++ +N +G+Y+MVGN+WEWT+    +   P    NP+ P      V +GGS
Sbjct: 207 YHGSSPVDAFEPQNDYGMYDMVGNLWEWTS---TIFKSPNQDSNPENPML----VLRGGS 259

Query: 308 YLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAAD 342
           Y+ ++   + H  R   R  NT D+ + NLGFRCA++
Sbjct: 260 YIDSKDGAFNHMARVTTRMGNTADAGSDNLGFRCASN 296



 Score = 40.4 bits (93), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 19/44 (43%), Positives = 28/44 (63%), Gaps = 2/44 (4%)

Query: 352 VKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAADV 393
           V +GGSY+ ++   + H  R   R  NT D+ + NLGFRCA++V
Sbjct: 254 VLRGGSYIDSKDGAFNHMARVTTRMGNTADAGSDNLGFRCASNV 297


>gi|343959230|dbj|BAK63472.1| sulfatase-modifying factor 2 precursor [Pan troglodytes]
          Length = 301

 Score =  214 bits (546), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 133/337 (39%), Positives = 174/337 (51%), Gaps = 70/337 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  F MGTN P   +DGE P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct: 30  MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 88

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q                     P+  + W        W 
Sbjct: 89  IFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 118

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E             R F    W  P G  S I  R+ HPV+HVSWNDA AYC W
Sbjct: 119 ---LPVE-------------RAF----WRQPAGPGSGILERLEHPVLHVSWNDARAYCAW 158

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 159 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 215

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA             +P        +V +G S++   +
Sbjct: 216 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 262

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 263 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 299



 Score = 41.2 bits (95), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 250 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 294


>gi|431798392|ref|YP_007225296.1| hypothetical protein Echvi_3048 [Echinicola vietnamensis DSM 17526]
 gi|430789157|gb|AGA79286.1| hypothetical protein Echvi_3048 [Echinicola vietnamensis DSM 17526]
          Length = 371

 Score =  214 bits (546), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 127/346 (36%), Positives = 182/346 (52%), Gaps = 58/346 (16%)

Query: 15  MVLLPGDTFRMGTNK----PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           MV +PG  F MGT++    P+     E P+    ++ F++D+ EV+N  F++FV  TGYV
Sbjct: 57  MVWIPGGDFMMGTDEQDAYPV-----ERPAVKRHVEGFWMDETEVTNAAFKKFVDETGYV 111

Query: 71  TEAEKFGD-TFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           T AEK  +   + + L  +  +   S ++     F   D  ++ R           D   
Sbjct: 112 TIAEKKPEWEEMKKQLPPDTPKPADSLLQPGSLVFVAPDRPVDTR-----------DISN 160

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
           +  W                        P  SW HPEG  S ++ RM+HPVVH++++DAV
Sbjct: 161 WWKWT-----------------------PGASWKHPEGPGSDLQGRMDHPVVHIAYDDAV 197

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
           AY  W G RLPTEAEWE+  RGGL  + + WG+ L P+G+  AN +QG FP  N   DG+
Sbjct: 198 AYAKWAGKRLPTEAEWEFAARGGLIEKRYSWGDELRPQGKFMANTFQGRFPNGNQGKDGF 257

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APS--------YNPKGPT 296
           + TAPV +Y  N +GLY+++GNVWE T DW++          AP         YNP  P 
Sbjct: 258 MGTAPVKNYPSNDYGLYDIIGNVWELTDDWFDAIKFQRLTGNAPKLDANMNQCYNPTNPY 317

Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
              ++V KGGSYLC++ YC  +R +AR  +  DS + N+GFRC  D
Sbjct: 318 A-KERVIKGGSYLCSDNYCINYRPSARQGHAYDSGSSNVGFRCVKD 362


>gi|392378971|ref|YP_004986130.1| sulfatase-modifying factor 1 [Azospirillum brasilense Sp245]
 gi|356881338|emb|CCD02323.1| sulfatase-modifying factor 1 [Azospirillum brasilense Sp245]
          Length = 349

 Score =  214 bits (545), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 129/335 (38%), Positives = 169/335 (50%), Gaps = 60/335 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG   R+G+++       E P   V++D F++D+ EV+N QF+ FV ATGYVTEAE
Sbjct: 60  MVWIPGGDTRIGSDQHYRE---EGPVTRVSVDGFWMDRTEVTNAQFRRFVEATGYVTEAE 116

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +                  +     D +R  G    ++     P      +D     TW 
Sbjct: 117 RV-----------------VDGTGADARRPSGSVVFVD-----PTSQAGRSD-----TW- 148

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W          RL P  +W HPEG  S I  R NHPVVHV++NDAVAY  W
Sbjct: 149 ---------W----------RLLPEANWRHPEGSGSDIGGRENHPVVHVTYNDAVAYARW 189

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEA+WE+  RGGL+   + WG   TP+G   AN WQG FP  NT  DGY +TAP
Sbjct: 190 LGRELPTEAQWEHAARGGLDGAPYAWGGEFTPQGVAMANTWQGFFPIQNTRGDGYPATAP 249

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT----------DKVKK 304
           V  +  N +GL +M+GNVWEWT+DW+   H      NP GP               +V K
Sbjct: 250 VGCFTANGYGLRDMIGNVWEWTSDWYRPRHDGLGDVNPTGPRQEDSYDPRQPGVPSRVIK 309

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           GGS+LC + +C R+R A+R        A +LGFR 
Sbjct: 310 GGSFLCAQNFCQRYRPASRHPQEVGVGASHLGFRT 344


>gi|296452916|sp|Q8NBJ7.2|SUMF2_HUMAN RecName: Full=Sulfatase-modifying factor 2; AltName:
           Full=C-alpha-formylglycine-generating enzyme 2; Flags:
           Precursor
          Length = 301

 Score =  214 bits (545), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 130/337 (38%), Positives = 173/337 (51%), Gaps = 70/337 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  F MGTN P   +DG+ P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct: 30  MVQLQGGRFLMGTNSPD-SRDGDGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 88

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q                     P+  + W        W 
Sbjct: 89  MFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 118

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E  +                 W  P G  S I  R+ HPV+HVSWNDA AYC W
Sbjct: 119 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 158

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 159 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 215

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA             +P        +V +G S++   +
Sbjct: 216 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 262

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 263 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 299



 Score = 41.2 bits (95), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 250 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 294


>gi|254776894|ref|ZP_05218410.1| hypothetical protein MaviaA2_19816 [Mycobacterium avium subsp.
           avium ATCC 25291]
          Length = 296

 Score =  214 bits (544), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 129/337 (38%), Positives = 173/337 (51%), Gaps = 50/337 (14%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
            ++V LPG +FRMG+         E P   VT++ F +++H V+N QF EFV+ATGYVT 
Sbjct: 3   SELVDLPGGSFRMGSTS---FYPEEAPIHTVTVEPFAIERHPVTNAQFAEFVAATGYVTV 59

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+  D  ++         A  SQ++     F      ++ R                  
Sbjct: 60  AEQPIDPALYP-------GADPSQLQPGAMVFRPTPGPVDLR-----------------D 95

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR        +W +           P  SW HP G DS I  R +HPVV V++ DA AY 
Sbjct: 96  WR--------QWWH---------WVPGASWRHPSGPDSDIADRADHPVVQVAYPDAAAYA 138

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAEWEY  R G     +PWG+  T  G   AN WQG FP  N  A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAGATT-TYPWGDEPTSDGRLMANTWQGRFPYRNDGALGWVGT 197

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
           +PV  +  N FGL +M+GNVWEWT   ++ HH       P   P GP     ++  KGGS
Sbjct: 198 SPVGVFAPNAFGLLDMIGNVWEWTTTEFSTHHRIGAATKPCCAPSGPADPSVNQTLKGGS 257

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           +LC  +YC+R+R AARS  + DS+  ++GFRC A+ G
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVAEVG 294


>gi|332865303|ref|XP_001160187.2| PREDICTED: sulfatase-modifying factor 2 isoform 10 [Pan
           troglodytes]
 gi|410224492|gb|JAA09465.1| sulfatase modifying factor 2 [Pan troglodytes]
 gi|410302660|gb|JAA29930.1| sulfatase modifying factor 2 [Pan troglodytes]
          Length = 320

 Score =  214 bits (544), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 133/337 (39%), Positives = 172/337 (51%), Gaps = 70/337 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  F MGTN P   +DGE P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct: 49  MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 107

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q                     P+  + W        W 
Sbjct: 108 IFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 137

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E             R F    W  P G  S I  R+ HPV+HVSWNDA AYC W
Sbjct: 138 ---LPVE-------------RAF----WRQPAGPGSGILERLEHPVLHVSWNDARAYCAW 177

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 178 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 234

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           V ++  +N +GLY+++GNVWEWTA             +P        +V +G S++    
Sbjct: 235 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 281

Query: 314 YCYRH--RCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               H  R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 282 GSANHQARVTTRMGNTPDSASDNLGFRCAADAGRPPG 318



 Score = 39.7 bits (91), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 26/45 (57%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++        H  R   R  NTPDS++ NLGFRCAAD 
Sbjct: 269 RVLRGASWIDTADGSANHQARVTTRMGNTPDSASDNLGFRCAADA 313


>gi|297288257|ref|XP_001105383.2| PREDICTED: sulfatase-modifying factor 2-like isoform 1 [Macaca
           mulatta]
          Length = 303

 Score =  214 bits (544), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 132/340 (38%), Positives = 173/340 (50%), Gaps = 73/340 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ---EFVSATGYVT 71
           MV LPG  F MGTN P   +DGE P R  T+  F +D   V+N  F+   +FV    Y T
Sbjct: 29  MVQLPGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRYIRDFVREKKYRT 87

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           EAE FG +FVFE  +S+E R K +Q                     P+  + W       
Sbjct: 88  EAEMFGWSFVFEDFVSDELRNKATQ---------------------PMKPVLW------- 119

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W    LP E  +                 W  P G  S I  R+ HPV+HVSWNDA AY
Sbjct: 120 -W----LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAY 157

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
           C WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG FP  + A DG+  
Sbjct: 158 CAWRGKRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGRFPKGDKAEDGFHG 214

Query: 252 TAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL- 309
            +PV ++  +N +GLY+++GNVWEWTA             +P        +V +G S++ 
Sbjct: 215 VSPVNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWID 261

Query: 310 -CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
             +    +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 262 TADGSANHRARVTTRMGNTPDSASDNLGFRCAADAGRLPG 301



 Score = 41.2 bits (95), Expect = 0.83,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 252 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 296


>gi|338534920|ref|YP_004668254.1| hypothetical protein LILAB_26420 [Myxococcus fulvus HW-1]
 gi|337261016|gb|AEI67176.1| hypothetical protein LILAB_26420 [Myxococcus fulvus HW-1]
          Length = 334

 Score =  213 bits (543), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 132/346 (38%), Positives = 169/346 (48%), Gaps = 63/346 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG TF MG++        E PS  VT+  F++D   V+N  FQ FV ATGYVT AE
Sbjct: 31  MVFIPGGTFWMGSDHHY---PEERPSHQVTVTDFWMDSQPVTNADFQRFVEATGYVTVAE 87

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +        PL  ++              F G +  +          +     V   T  
Sbjct: 88  R--------PLNPDD--------------FPGANPAL----------LVPGSLVFKKTLG 115

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              L    +W +   G           W HPEG DST+E R  HPVVHV + DA+AY  W
Sbjct: 116 RVDLRDLTQWWFYTAGAC---------WKHPEGPDSTLEGREQHPVVHVCYEDALAYARW 166

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
               LPTEAEWEY  RGG+E +++ WG+   P G+  AN WQGEFP  N   DG+  T+P
Sbjct: 167 ADKALPTEAEWEYAARGGMERKVYVWGDEFAPGGKLMANTWQGEFPWQNLNTDGFEGTSP 226

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS------YNPKGPTTG---------- 298
           V S+  N FGL++M GNVWEWT+DW+   H            NP+GP T           
Sbjct: 227 VGSFPPNAFGLFDMAGNVWEWTSDWYQERHERHAGKPCCIPVNPRGPATSERSLDPCLPK 286

Query: 299 ---TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
                +V KGGS+LC   YC R+R AARS    DS   ++GFRC  
Sbjct: 287 VKIPRRVLKGGSHLCAPNYCLRYRPAARSPQAVDSGTSHIGFRCVV 332


>gi|116626601|ref|YP_828757.1| hypothetical protein Acid_7564 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116229763|gb|ABJ88472.1| protein of unknown function DUF323 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 315

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 135/346 (39%), Positives = 170/346 (49%), Gaps = 63/346 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ +PG TFRMG+         E P   V +D F++D HEV+N ++ EFV ATGY+T AE
Sbjct: 11  MIRIPGGTFRMGSEA---FYPEESPVHEVAVDGFWIDCHEVTNQRYGEFVEATGYLTVAE 67

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  +   F P    E                   S +  R   PV      D   Y  W 
Sbjct: 68  RPLNAADF-PGAPPENLVP--------------GSMVFRRTSGPV------DLHNYVNW- 105

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W +           P  SW HP+G  S++E    HPVVHV++ DA AY  W
Sbjct: 106 ---------WAW----------VPGASWKHPQGPGSSLEGIEQHPVVHVAYEDAEAYAGW 146

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE+  RGGL+ + F WG+   P G   AN WQGEFP  N  ADGY  T+P
Sbjct: 147 AGKELPTEAEWEFAARGGLDGKNFTWGDEDFPGGNAMANSWQGEFPWQNLLADGYEGTSP 206

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH------------------HPAPSYNPKGPT 296
           V S+  N +GLY+M GNVWEWT+DW+   H                   PA SY+P  P 
Sbjct: 207 VGSFPPNGYGLYDMAGNVWEWTSDWFVPRHADEVVRSCCGPAVNPRISSPAKSYDPAQPQ 266

Query: 297 TGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
                KV KGGS+LC   YC R+R AAR     D+   ++GFRC A
Sbjct: 267 FQIPRKVVKGGSHLCAPNYCLRYRPAARQPQMLDTGMTHIGFRCIA 312


>gi|374368153|ref|ZP_09626207.1| C-alpha-formyglycine- generating protein [Cupriavidus basilensis
           OR16]
 gi|373100317|gb|EHP41384.1| C-alpha-formyglycine- generating protein [Cupriavidus basilensis
           OR16]
          Length = 388

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 129/355 (36%), Positives = 174/355 (49%), Gaps = 60/355 (16%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + M  +PG  F MG+N   L +D E P+    +  F++DQH V+N +F+ FV+ATGY+T 
Sbjct: 50  RGMAWVPGGEFLMGSNH-RLARDNERPAHKTRVHGFWMDQHHVTNAEFRAFVAATGYITT 108

Query: 73  AEKFGD--TFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           AEK  D  T   +      +    + V   M  F G D  +                   
Sbjct: 109 AEKKPDWETLKVQVPAGTPQPPDSALVPGGMV-FVGTDRPVP------------------ 149

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
                  L   + W          R  P  +W HP+G  STIE + NHPVV VS+ DA A
Sbjct: 150 -------LQYSSRWW---------RFVPGANWRHPDGPGSTIEGKDNHPVVQVSYEDAQA 193

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAA 246
           Y  W G RLPTEAEWE+  RGGLE   + WG+   P G+  ANVWQG+    FP  +  A
Sbjct: 194 YARWAGMRLPTEAEWEFAVRGGLEQATYAWGDQFAPGGKQMANVWQGQQSQPFPVVSAKA 253

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYN 291
            G L T+ V ++  N +GLY+M GN W+W ADW+                    PA S++
Sbjct: 254 GGALGTSAVGTFPANGYGLYDMTGNAWQWGADWYRADQFKRVASIGQVEVYPLGPADSWD 313

Query: 292 PKGPTT---GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           P  P        +V +GGSYLCN+ YC  +R +AR    P +S  +LGFR A D+
Sbjct: 314 PSEPGVPARAPKRVIRGGSYLCNQDYCLSYRPSARQGADPYTSMSHLGFRLAMDE 368


>gi|404447923|ref|ZP_11012917.1| hypothetical protein A33Q_01220 [Indibacter alkaliphilus LW1]
 gi|403766509|gb|EJZ27381.1| hypothetical protein A33Q_01220 [Indibacter alkaliphilus LW1]
          Length = 352

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 132/348 (37%), Positives = 177/348 (50%), Gaps = 49/348 (14%)

Query: 9   VERYKD-MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           + R KD MV +P   F MGTN P      E P+    + AF++D+ EV+N QF EFV+ T
Sbjct: 38  IVRSKDGMVFIPEGEFFMGTNDPESYP-VERPAVKRKVKAFWMDETEVTNAQFAEFVAET 96

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           GYVT AEK  D   +E +  +           +M+    + S    R+    + + W   
Sbjct: 97  GYVTMAEKTPD---WEEMKKQLPPGTPKPPEENMQPGSMVFSAPSSRVPLHDISLWW--- 150

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
                          +W  G             SW HPEG  S IE   NHPVVH+++ D
Sbjct: 151 ---------------KWVLGA------------SWKHPEGPGSDIEGLENHPVVHIAYED 183

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           A AY  W G RLPTE EWEY  + G E++ + WG+   P+    AN +QGEFP NN A D
Sbjct: 184 AEAYAKWAGKRLPTETEWEYAAKSGREHQRYAWGDTYKPQMSFMANTFQGEFPNNNLAED 243

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPS--------YNPKG 294
           G+  TAPV S+  N +GLY+M+GNVWE T+DW++          AP+        YNP  
Sbjct: 244 GFEGTAPVKSFPPNDYGLYDMIGNVWEMTSDWYDAIKFRRLAGNAPALDSSISQCYNPTN 303

Query: 295 PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           P    +KV KGGS+LC + YC  +R +AR     +S   N+GFRC  D
Sbjct: 304 P-YALEKVIKGGSFLCADDYCINYRPSARQGQDYESGTSNIGFRCVRD 350


>gi|395800381|ref|ZP_10479657.1| hypothetical protein FF52_00945 [Flavobacterium sp. F52]
 gi|395437554|gb|EJG03472.1| hypothetical protein FF52_00945 [Flavobacterium sp. F52]
          Length = 376

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 129/355 (36%), Positives = 178/355 (50%), Gaps = 66/355 (18%)

Query: 13  KDMVLLPGDTFRMGTNKPI--------LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           K MV +PG  F MGTN           + KD   P   V +D +Y+D+ EV+N +F++FV
Sbjct: 64  KGMVWIPGGEFSMGTNVEDESLCSIKGVTKDAA-PIHRVYVDGYYMDETEVTNEEFEKFV 122

Query: 65  SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
            ATGYVT AE+      F P  +EE+    S V         L++               
Sbjct: 123 KATGYVTVAEQKPTKEEF-PTANEEDLITGSVVFTPTPSAVNLNN--------------- 166

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
                +  W          W Y           P   W HPEG  STI+ +  +PVVHV 
Sbjct: 167 -----FLQW----------WRYE----------PGADWRHPEGPQSTIKGKEKYPVVHVV 201

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN-- 242
           + DA AY  W G RLPTEAEWE+  RGG    L+ WGN+L P+G+ +AN++QG FP    
Sbjct: 202 YEDAAAYAKWAGKRLPTEAEWEFAARGGKTGNLYAWGNDLKPKGKFQANIYQGHFPIKDG 261

Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA------PSYNPKGPT 296
           +T  DG+   AP   Y  N +GLY+M GNVWEW  DW++V ++ +       + NP+GP+
Sbjct: 262 DTGEDGFKGIAPTKQYAPNGYGLYDMAGNVWEWVNDWYSVDYYKSLAESGKVTKNPQGPS 321

Query: 297 TGTD--------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
              D        +V +GGS+LC +QYC R+    R +    S+A ++GFRC   K
Sbjct: 322 AYYDPNDPSEIKRVHRGGSFLCTDQYCTRYMVGTRGKGEIRSAANHVGFRCVKSK 376


>gi|390944509|ref|YP_006408270.1| hypothetical protein Belba_2996 [Belliella baltica DSM 15883]
 gi|390417937|gb|AFL85515.1| hypothetical protein Belba_2996 [Belliella baltica DSM 15883]
          Length = 346

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 133/346 (38%), Positives = 172/346 (49%), Gaps = 49/346 (14%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           ++   MVL+ G  F MGTN  +     E P+    +++F++D+ EV+N QF  FV ATGY
Sbjct: 36  KKMDGMVLIQGGDFSMGTND-LNSYPSERPAVLRKVESFWMDETEVTNKQFAAFVEATGY 94

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           VT AE+  D           E  K +      K  E L          P   +S +D   
Sbjct: 95  VTVAERKPDW----------EEMKQTLPPGTPKPDESLLVAGSLVFVAPDQAVSLDDISQ 144

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
           +  W                        P   W  PEG  S I+ + NHPVVH++  DA 
Sbjct: 145 WWQW-----------------------VPGAYWKQPEGPGSDIQGKENHPVVHIALEDAE 181

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
           AY TW G RLPTEAEWEY  +GG  N  FPWG+ L P G  RAN +QGEFP  N   DG+
Sbjct: 182 AYATWAGKRLPTEAEWEYAAKGG-NNSRFPWGDELNPSGSFRANTFQGEFPYQNEGIDGF 240

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV-----HHHPAPS--------YNPKGPT 296
              APV S+  N FGLY+M+GNVWE T+DW++          AP+        YNP+ P 
Sbjct: 241 EGAAPVRSFAGNDFGLYDMIGNVWELTSDWFDAAAFMREKALAPALDTSISKCYNPENPY 300

Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
              +KV KGGS+LC + YC  +R +AR      S   N+GFRC  D
Sbjct: 301 A-IEKVIKGGSFLCADDYCINYRPSARRGQDVYSGTSNVGFRCVKD 345


>gi|255292428|dbj|BAH89546.1| hypothetical protein [uncultured bacterium]
          Length = 266

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 124/302 (41%), Positives = 158/302 (52%), Gaps = 57/302 (18%)

Query: 45  LDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRF 104
           L  F +D   V+  +F+ FV  TGYVTEAE+FG ++VF  L+            +D   F
Sbjct: 18  LKPFSIDATAVTIGRFERFVRDTGYVTEAERFGWSYVFFALM------------NDPDSF 65

Query: 105 EGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLH 164
             L              + W   V   TWR                              
Sbjct: 66  SALPG------------LEWWRKVDGATWR-----------------------------T 84

Query: 165 PEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNL 224
           PEG +S I+ R  +PV HVSWNDA A+ +W G RLPTEAEWE+   GGL+   FPWGN+ 
Sbjct: 85  PEGPESDIDGRAEYPVTHVSWNDATAFASWAGGRLPTEAEWEFAAAGGLQGARFPWGNDE 144

Query: 225 TPRGEHR-ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH 283
                 +  N+WQG FP  NT ADGYL TAPV S+  N FGLYNMVGN WEWT D + + 
Sbjct: 145 PDDASFQPCNIWQGSFPDRNTVADGYLGTAPVESFAPNGFGLYNMVGNTWEWTVDRFRIK 204

Query: 284 --HHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
                A   N     +G  +V KGGSYLC++ YCYR+R AAR+ +TPDS+ G++GFR   
Sbjct: 205 SVSKAAERLNTLARKSGA-RVIKGGSYLCHKSYCYRYRIAARTSSTPDSTTGHMGFRLCY 263

Query: 342 DK 343
           D+
Sbjct: 264 DR 265


>gi|386395696|ref|ZP_10080474.1| hypothetical protein Bra1253DRAFT_01138 [Bradyrhizobium sp.
           WSM1253]
 gi|385736322|gb|EIG56518.1| hypothetical protein Bra1253DRAFT_01138 [Bradyrhizobium sp.
           WSM1253]
          Length = 328

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 131/343 (38%), Positives = 176/343 (51%), Gaps = 62/343 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ +PG TFRMG+++       E P   V +DAF++D H V+N QF+ FV ATG+VT AE
Sbjct: 24  MLYVPGGTFRMGSDRHY---PEEAPVHRVAVDAFWMDCHSVTNRQFKAFVGATGHVTVAE 80

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
              D   +  +L     A                S +     HPV               
Sbjct: 81  ITPDPKDYPGMLPHMVYAG---------------SLVFSPPDHPV--------------- 110

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              L   ++W    +G           W HP G  S I+   +HPVVHV+++DA+AY  W
Sbjct: 111 --SLRDFSQWWTFAKGA---------DWRHPYGPGSKIQGLDDHPVVHVAFSDALAYARW 159

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE+  RGGL+   F WG+  TP G H AN WQGEFP  NT  DG+  T+P
Sbjct: 160 AGKDLPTEAEWEFAARGGLDGAEFAWGDEFTPDGRHMANTWQGEFPRQNTRDDGFERTSP 219

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY------NPKG--------PTTGT- 299
           V ++  N +GL++++GNVWEWT+DW++  H    +       NP+G        P+T   
Sbjct: 220 VTAFPPNGYGLHDLIGNVWEWTSDWYSPRHEADAAKACCIPENPRGGREADSYDPSTTNV 279

Query: 300 ---DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
               KV KGGS+LC   YC R+R AAR     D+S  +LGFRC
Sbjct: 280 KIPRKVIKGGSHLCAPNYCRRYRPAARHAEPVDTSTSHLGFRC 322


>gi|386402426|ref|ZP_10087204.1| hypothetical protein Bra1253DRAFT_08214 [Bradyrhizobium sp.
           WSM1253]
 gi|385743052|gb|EIG63248.1| hypothetical protein Bra1253DRAFT_08214 [Bradyrhizobium sp.
           WSM1253]
          Length = 328

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 132/350 (37%), Positives = 177/350 (50%), Gaps = 64/350 (18%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           ++MV +PG TFRMG+ +       E P   VT+D F++D+  ++N QF++FV+ATGYVT 
Sbjct: 20  REMVYIPGGTFRMGSEQHY---PEEAPVHRVTVDGFWIDRTPITNRQFRDFVNATGYVTF 76

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE   D   +   L    +A                          +V      AV    
Sbjct: 77  AELIPDAKDYPGALPRMLKAG------------------------SLVFTPPQQAVDLRN 112

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
           W         EW            F +G+ W  P G  S +    NHPVVHV++ D  AY
Sbjct: 113 W--------GEW----------WTFKFGANWRRPYGPHSNLRGLENHPVVHVAYRDVEAY 154

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G  LPTEAEWE+  RGGL+   F WG+ L P G+H AN+WQG FP  N A DG+  
Sbjct: 155 AKWAGKELPTEAEWEFASRGGLDGAEFAWGHELEPGGQHMANIWQGNFPHENLATDGFER 214

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAP----------------SYNPKG 294
           T+PV ++  N + LY+M+GNVWEWT+DW++ +H   AP                SY+P+ 
Sbjct: 215 TSPVTAFSTNGYELYDMIGNVWEWTSDWYSPMHEADAPKACCIPQNPRGGLENASYDPRN 274

Query: 295 PTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           P      KV KGGS+LC   YC R+R AAR     D+SA ++GFRC   K
Sbjct: 275 PKIRIPRKVLKGGSHLCAPNYCRRYRPAARHAEPIDTSASHVGFRCITRK 324


>gi|118466185|ref|YP_883593.1| sulfatase-modifying factor 1 [Mycobacterium avium 104]
 gi|118167472|gb|ABK68369.1| sulfatase-modifying factor 1 [Mycobacterium avium 104]
          Length = 296

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 128/337 (37%), Positives = 173/337 (51%), Gaps = 50/337 (14%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
            ++  LPG +FRMG+         E P   VT++ F +++H V+N QF EFV+ATGYVT 
Sbjct: 3   SELADLPGGSFRMGSTS---FYPEEAPIHTVTVEPFAIERHPVTNAQFAEFVAATGYVTV 59

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+  D  ++         A  SQ++     F      ++ R                  
Sbjct: 60  AEQPIDPALYP-------GADPSQLQPGAMVFRPTPGPVDLR-----------------D 95

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR        +W +           P  SW HP G DS +  R +HPVV V++ DA AY 
Sbjct: 96  WR--------QWWH---------WVPGASWRHPFGPDSDVADRADHPVVQVAYPDAAAYA 138

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAEWEY  R G     +PWG+  T  G   AN WQG FP  N  A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAGATT-TYPWGDEPTSDGRLMANTWQGRFPYRNDGALGWVGT 197

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
           +PV  +  N FGL +M+GNVWEWT   ++ HH       P   P GP   G ++  KGGS
Sbjct: 198 SPVGVFAPNAFGLLDMIGNVWEWTTTEFSTHHRIGAATKPCCAPSGPADPGVNQTLKGGS 257

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           +LC  +YC+R+R AARS  + DS+  ++GFRC A+ G
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVAEVG 294


>gi|351697166|gb|EHB00085.1| Sulfatase-modifying factor 2 [Heterocephalus glaber]
          Length = 301

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 134/334 (40%), Positives = 173/334 (51%), Gaps = 70/334 (20%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           DMV LPG  F MGT+ P   +DGE P R VT+  F +D   V+N  F++FV    Y TEA
Sbjct: 29  DMVWLPGGRFLMGTDAPD-ARDGEGPVREVTVKPFGIDIFPVTNKDFRKFVREKKYRTEA 87

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E FG +FVFE  +S E + K +Q    MK               PV+            W
Sbjct: 88  ELFGWSFVFEEFVSSELKNKAAQ---QMK---------------PVLW-----------W 118

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
               LP E             R F    W  P G  S I  R+  PV+HVSWNDA AYC 
Sbjct: 119 ----LPVE-------------RAF----WRQPTGPGSGIRDRLEQPVLHVSWNDAQAYCR 157

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +
Sbjct: 158 WRGRRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHGVS 214

Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
           PV ++  +N +GL+++VGNVWEWT+             +P  P      V +G S++   
Sbjct: 215 PVNAFPPQNDYGLFDLVGNVWEWTS-------------SPYQPAKQDMHVLRGASWIDTA 261

Query: 313 QYCYRH--RCAARSQNTPDSSAGNLGFRCAADKG 344
                H  R   R  NTPDS++ NLGFRCAA KG
Sbjct: 262 DGSANHQARVTTRMGNTPDSASDNLGFRCAASKG 295


>gi|27379853|ref|NP_771382.1| hypothetical protein bll4742 [Bradyrhizobium japonicum USDA 110]
 gi|27353006|dbj|BAC50007.1| bll4742 [Bradyrhizobium japonicum USDA 110]
          Length = 331

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 131/343 (38%), Positives = 172/343 (50%), Gaps = 62/343 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ +PG TFRMG+++       E P   VT+DAF++D H V+N QF+ F  ATG+VT AE
Sbjct: 24  MLYVPGGTFRMGSDRHY---PEEAPVHRVTVDAFWMDCHPVTNRQFKAFAKATGHVTVAE 80

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
              D   +  +L     A                S +     HPV               
Sbjct: 81  ITPDPKDYPGMLPHMIYAG---------------SLVFSPPAHPV--------------- 110

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              L   ++W    +G           W HP G  S I    +HPVVHV+++DA+AY  W
Sbjct: 111 --SLRDFSQWWTFAKGA---------DWRHPYGPGSDIRGLDDHPVVHVAFSDALAYARW 159

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE+  RGGL+   F WG+   P G H AN WQGEFP  NT  DGY  T+P
Sbjct: 160 AGKDLPTEAEWEFAARGGLDGAEFAWGDVFAPDGRHMANTWQGEFPRQNTRDDGYERTSP 219

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----------------APSYNPKGPTT 297
           V ++  N +GL++M+GNVWEWT+DW++  H                   A SY+P+    
Sbjct: 220 VTAFPPNGYGLHDMIGNVWEWTSDWYSPRHEADAAKACCIPENPRGGREADSYDPRTTNV 279

Query: 298 GTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
               KV KGGS+LC   YC R+R AAR     D+S  +LGFRC
Sbjct: 280 KIPRKVIKGGSHLCAPNYCRRYRPAARHAEPIDTSTSHLGFRC 322


>gi|383818630|ref|ZP_09973916.1| hypothetical protein MPHLEI_05012 [Mycobacterium phlei RIVM601174]
 gi|383338486|gb|EID16850.1| hypothetical protein MPHLEI_05012 [Mycobacterium phlei RIVM601174]
          Length = 291

 Score =  212 bits (539), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 130/331 (39%), Positives = 170/331 (51%), Gaps = 50/331 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +FRMG+ +       E P   VT+  F +++H V+N QF  FV+ TGYVT A
Sbjct: 4   ELVELPGGSFRMGSTQ---FYPEEAPVHTVTVAPFAIERHPVTNAQFAAFVADTGYVTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E        +P+              D   + G D      +   +V       VA   W
Sbjct: 61  E--------QPI--------------DPALYPGADPA--DLVPGALVFTPTEGPVALRDW 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R        +W          R  P   W HP G DS IE RM+HPVV V++ DA AY  
Sbjct: 97  R--------QWW---------RWVPGACWRHPFGPDSGIEDRMDHPVVQVAYPDAAAYAR 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  R G     +PWG++ TP G   AN WQG+FP  NT A G++ T+
Sbjct: 140 WAGRRLPTEAEWEYAARAGTTT-TYPWGDDPTPDGRLMANTWQGDFPYRNTGALGWVGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHH----HPAPSYNPKG-PTTGTDKVKKGGSY 308
           PV  +  N FGL +M+GNVWEWT   +  HH     P     P G P     +  KGGS+
Sbjct: 199 PVGVFPPNGFGLVDMIGNVWEWTTTRFTGHHRIGEQPKSCCTPSGPPNPAISQALKGGSH 258

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           LC  +YC+R+R AARS  + DS+  ++GFRC
Sbjct: 259 LCAPEYCHRYRAAARSPQSQDSATTHIGFRC 289


>gi|348560086|ref|XP_003465845.1| PREDICTED: sulfatase-modifying factor 2-like [Cavia porcellus]
          Length = 314

 Score =  212 bits (539), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 137/338 (40%), Positives = 175/338 (51%), Gaps = 70/338 (20%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           DMV L G  F MGT+     +DGE P+R VT+  F +D   V+N  F+EFV    Y TEA
Sbjct: 42  DMVQLLGGQFLMGTDASD-ARDGEGPAREVTVKPFAIDIFPVTNKDFREFVREKKYRTEA 100

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E FG +FVFE L+S E R   SQ    MK               PV+            W
Sbjct: 101 ELFGWSFVFEDLVSSELRNSASQ---QMK---------------PVLW-----------W 131

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
               LP E             R F    W  P G  S I  R+  PV+HVSWNDA AYC 
Sbjct: 132 ----LPVE-------------RAF----WRQPSGPGSGIRDRLELPVLHVSWNDAQAYCA 170

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +
Sbjct: 171 WRGRRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDRAEDGFHGVS 227

Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--C 310
           PV ++  +N +GLY+++GNVWEWTA           SY P        +V +G S++   
Sbjct: 228 PVNAFPPQNDYGLYDLMGNVWEWTAS----------SYQPAEQDM---RVLRGASWIDTA 274

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           +    +R R   R  NTPDS++ NLGFRCAA  G   G
Sbjct: 275 DGSANHRARVTTRMGNTPDSASDNLGFRCAASVGRAFG 312


>gi|338176147|ref|YP_004652957.1| sulfatase-modifying factor 1 [Parachlamydia acanthamoebae UV-7]
 gi|336480505|emb|CCB87103.1| sulfatase-modifying factor 1 [Parachlamydia acanthamoebae UV-7]
          Length = 390

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 126/340 (37%), Positives = 179/340 (52%), Gaps = 49/340 (14%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG  F MG++      D E P   V +D F++D   V+N QF+EF+ ATGYVT AE
Sbjct: 41  MVWIPGGEFMMGSDYEDSKPD-EKPQHRVKVDGFWMDATPVTNKQFKEFIDATGYVTTAE 99

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPV-VHISWNDAVAYCTW 133
           +        P L+E     ++QV                 +  P  V I  N++ A+  W
Sbjct: 100 R-------APTLNE----IMNQVPPGTPPPSPELLVAASLVFKPSDVPIPLNNSHAWWEW 148

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           +                       P  +W HP G +STIE + +HPVV VSW+DA AY  
Sbjct: 149 K-----------------------PGANWKHPLGPESTIEEKKDHPVVQVSWDDAQAYAK 185

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE+   GG ++ ++ WGN+       +AN+WQG FP  +T + GY  T 
Sbjct: 186 WAGKRLPTEAEWEFAAYGGRKDIMYVWGNDEFSEEVPQANIWQGIFPYKSTKSKGYNGTT 245

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTT--------GTD 300
           PV ++K N +GLY+M GNVW+W +D +++ ++        S NP GP T         T 
Sbjct: 246 PVTTFKPNPYGLYDMSGNVWQWCSDLYHMTYYQEEAKKELSINPAGPKTSFDPREPYATK 305

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            V +GGS+LC++ YC  +R  AR + +PD+S  +LGFRC 
Sbjct: 306 HVHRGGSFLCHDSYCKGYRITARMKTSPDTSLNHLGFRCV 345


>gi|74209819|dbj|BAE23615.1| unnamed protein product [Mus musculus]
          Length = 299

 Score =  211 bits (538), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 131/333 (39%), Positives = 172/333 (51%), Gaps = 71/333 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F MGT+ P   +DGE P+R VT+  F +D   V+N  F+EFV    Y TEAE
Sbjct: 29  MVHLPGGRFLMGTDAPD-GRDGEGPAREVTVKPFAIDIFPVTNKDFREFVREKKYQTEAE 87

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S E R +                  E+ M  P VH  W   V    WR
Sbjct: 88  AFGWSFVFEDFVSPELRKQ------------------ENLM--PAVH--WWQPVPKAFWR 125

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                         P G  S I  ++  PVVHVSWNDA AYC W
Sbjct: 126 -----------------------------QPAGPGSGIREKLELPVVHVSWNDAGAYCAW 156

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 157 RGRRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHGLSP 213

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA           +Y P G      +V +G S++   +
Sbjct: 214 VNAFPPQNNYGLYDLMGNVWEWTAS----------TYQPAGQDM---RVLRGASWIDTAD 260

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
               +R R   R  NTPDS++ NLGFRCA+  G
Sbjct: 261 GSANHRARVTTRMGNTPDSASDNLGFRCASSAG 293


>gi|77377292|ref|NP_080721.1| sulfatase-modifying factor 2 precursor [Mus musculus]
 gi|377834086|ref|XP_003689431.1| PREDICTED: sulfatase-modifying factor 2 isoform 1 [Mus musculus]
 gi|341942094|sp|Q8BPG6.2|SUMF2_MOUSE RecName: Full=Sulfatase-modifying factor 2; AltName:
           Full=C-alpha-formylglycine-generating enzyme 2; Flags:
           Precursor
 gi|74180529|dbj|BAE34197.1| unnamed protein product [Mus musculus]
 gi|148687550|gb|EDL19497.1| sulfatase modifying factor 2 [Mus musculus]
          Length = 308

 Score =  211 bits (538), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 131/333 (39%), Positives = 172/333 (51%), Gaps = 71/333 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F MGT+ P   +DGE P+R VT+  F +D   V+N  F+EFV    Y TEAE
Sbjct: 38  MVHLPGGRFLMGTDAPD-GRDGEGPAREVTVKPFAIDIFPVTNKDFREFVREKKYQTEAE 96

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S E R +                  E+ M  P VH  W   V    WR
Sbjct: 97  AFGWSFVFEDFVSPELRKQ------------------ENLM--PAVH--WWQPVPKAFWR 134

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                         P G  S I  ++  PVVHVSWNDA AYC W
Sbjct: 135 -----------------------------QPAGPGSGIREKLELPVVHVSWNDAGAYCAW 165

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 166 RGRRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHGLSP 222

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA           +Y P G      +V +G S++   +
Sbjct: 223 VNAFPPQNNYGLYDLMGNVWEWTAS----------TYQPAGQDM---RVLRGASWIDTAD 269

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
               +R R   R  NTPDS++ NLGFRCA+  G
Sbjct: 270 GSANHRARVTTRMGNTPDSASDNLGFRCASSAG 302


>gi|26344956|dbj|BAC36127.1| unnamed protein product [Mus musculus]
          Length = 308

 Score =  211 bits (538), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 131/333 (39%), Positives = 172/333 (51%), Gaps = 71/333 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F MGT+ P   +DGE P+R VT+  F +D   V+N  F+EFV    Y TEAE
Sbjct: 38  MVHLPGGRFLMGTDAPD-GRDGEGPAREVTVKPFAIDIFPVTNKDFREFVREKKYQTEAE 96

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S E R +                  E+ M  P VH  W   V    WR
Sbjct: 97  AFGWSFVFEDFVSPELRKQ------------------ENLM--PAVH--WWQPVPKAFWR 134

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                         P G  S I  ++  PVVHVSWNDA AYC W
Sbjct: 135 -----------------------------QPAGPGSGIREKLELPVVHVSWNDAGAYCAW 165

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 166 RGRRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHGLSP 222

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA           +Y P G      +V +G S++   +
Sbjct: 223 VNAFPPQNNYGLYDLMGNVWEWTAS----------TYQPAGQDM---RVLRGASWIDTAD 269

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
               +R R   R  NTPDS++ NLGFRCA+  G
Sbjct: 270 GSANHRARVTTRMGNTPDSASDNLGFRCASSAG 302


>gi|414078471|ref|YP_006997789.1| hypothetical protein ANA_C13300 [Anabaena sp. 90]
 gi|413971887|gb|AFW95976.1| hypothetical protein ANA_C13300 [Anabaena sp. 90]
          Length = 336

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 128/337 (37%), Positives = 180/337 (53%), Gaps = 58/337 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG TF +G++      D E  + ++T+++F +D++E++N +F +FV  TGY+T AE
Sbjct: 43  MVFIPGGTFNIGSDAHY---DSERSAEDITVNSFCMDKYEITNAKFAQFVKETGYITVAE 99

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +      F P LS++ R   S V      F+  D   +      V H+SW        W+
Sbjct: 100 RPLPKDQF-PDLSDDSRKPGSLV------FQMADPEAKR-----VRHLSW--------WK 139

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W  G             +W HP G DS I  + N+PVV V++ D  AY  W
Sbjct: 140 ---------WTLGA------------NWRHPFGPDSNINGKDNYPVVQVAYEDVKAYTEW 178

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
               LPTE +WEY  RGGL + ++ WG   +   E +AN WQG FP  NT ADGY+ TAP
Sbjct: 179 AKMSLPTETQWEYAARGGLNHAIYSWGEQYS---EKKANTWQGLFPVFNTKADGYIGTAP 235

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTT---------GTDK-VK 303
           V S++ N +GLY++ GNVWEWT DW+   H H + + NP+GP+          G  K V 
Sbjct: 236 VGSFQPNGYGLYDITGNVWEWTTDWYRPGHDHKSHTSNPQGPSKQESFDPREPGVAKHVI 295

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           KGGSYLC   YC R+R AAR   +PD+   ++GFR  
Sbjct: 296 KGGSYLCAPNYCSRYRPAARESQSPDTGTTHIGFRLV 332


>gi|74147446|dbj|BAE38634.1| unnamed protein product [Mus musculus]
          Length = 303

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 131/333 (39%), Positives = 172/333 (51%), Gaps = 71/333 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F MGT+ P   +DGE P+R VT+  F +D   V+N  F+EFV    Y TEAE
Sbjct: 33  MVHLPGGRFLMGTDAPD-GRDGEGPAREVTVKPFAIDIFPVTNKDFREFVREKKYQTEAE 91

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S E R +                  E+ M  P VH  W   V    WR
Sbjct: 92  AFGWSFVFEDFVSPELRKQ------------------ENLM--PAVH--WWQPVPKAFWR 129

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                         P G  S I  ++  PVVHVSWNDA AYC W
Sbjct: 130 -----------------------------QPAGPGSGIREKLELPVVHVSWNDAGAYCAW 160

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 161 RGRRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHGLSP 217

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA           +Y P G      +V +G S++   +
Sbjct: 218 VNAFPPQNNYGLYDLMGNVWEWTAS----------TYQPAGQDM---RVLRGASWIDTAD 264

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
               +R R   R  NTPDS++ NLGFRCA+  G
Sbjct: 265 GSANHRARVTTRMGNTPDSASDNLGFRCASSAG 297


>gi|405374984|ref|ZP_11029278.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycine-
           generating enzyme 1) [Chondromyces apiculatus DSM 436]
 gi|397086652|gb|EJJ17755.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycine-
           generating enzyme 1) [Myxococcus sp. (contaminant ex DSM
           436)]
          Length = 360

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 131/346 (37%), Positives = 164/346 (47%), Gaps = 63/346 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG TF MG++        E PS  VT+  F+LD   V+N  F  FV AT YVT AE
Sbjct: 57  MVYIPGGTFWMGSDH---HYPEERPSHQVTVTDFWLDSQPVTNADFARFVEATHYVTVAE 113

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  +   F     E         +  + R +  D T                        
Sbjct: 114 RPLNPADFPGAKPELLVPGSLVFKKTLARVDLRDLT------------------------ 149

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                   +W +   G           W HPEG DST E R  HPVVHV + DA+AY TW
Sbjct: 150 --------QWWFYTAGAC---------WKHPEGPDSTWEGREQHPVVHVCFEDALAYATW 192

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
               LPTEAEWEY  RGG+E +++ WG+   P G+  AN WQGEFP  N   DG+ ST+P
Sbjct: 193 ADKALPTEAEWEYAARGGMERKVYVWGDEFAPGGKLMANTWQGEFPWQNLNTDGFESTSP 252

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGPTTG---------- 298
           V S+  N FGL++M GNVWEWT DW+   H            NP+GP T           
Sbjct: 253 VGSFPPNTFGLFDMAGNVWEWTTDWYQERHKSQGGKPCCIPVNPRGPATADRSLDPCLPQ 312

Query: 299 ---TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
                +V KGGS+LC   YC R+R AARS    DS   ++GFRC  
Sbjct: 313 VKIPRRVLKGGSHLCAPNYCLRYRPAARSPQAVDSGTSHIGFRCVV 358


>gi|418398948|ref|ZP_12972500.1| hypothetical protein SM0020_02405 [Sinorhizobium meliloti
           CCNWSX0020]
 gi|359507040|gb|EHK79550.1| hypothetical protein SM0020_02405 [Sinorhizobium meliloti
           CCNWSX0020]
          Length = 319

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 130/348 (37%), Positives = 174/348 (50%), Gaps = 61/348 (17%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           +E   +MV +PG TF MG+N        E P   VT+D F++D+  V+N QF EFV+ATG
Sbjct: 11  LELAAEMVWVPGATFMMGSNDHY---PEESPVHPVTVDGFWIDETPVTNRQFLEFVNATG 67

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           +VT AE+       +P   +   A  S +R     F      ++                
Sbjct: 68  HVTFAER-------KPRAEDYPGAPPSNLRAGSLVFTPPKRPLQ---------------- 104

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                 G  +   ++W     G          +W HP G  S+I   ++HPVVHV+++DA
Sbjct: 105 ------GTDI---SQWWIFTLGA---------NWRHPLGRKSSIGAVLDHPVVHVAYSDA 146

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G  LPTEAEWE   RGGL+   F WG+ L P G H AN WQG FP  N+  DG
Sbjct: 147 RAYAEWVGKELPTEAEWERAARGGLDQAEFAWGDELAPGGNHMANTWQGSFPVENSMDDG 206

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----------------PSYN 291
           +  T+PV SY  N +GLY+M+GNVWEWT D+W+V H  A                  S +
Sbjct: 207 FARTSPVRSYPPNGYGLYDMIGNVWEWTTDYWSVRHPEAAAKPCCIPSNPRNADADASID 266

Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           P        +V KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 267 PAARVKVPRRVLKGGSHLCAPDYCRRYRPAARHAQEIDTSTSHVGFRC 314


>gi|410915208|ref|XP_003971079.1| PREDICTED: sulfatase-modifying factor 2-like [Takifugu rubripes]
          Length = 277

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 129/342 (37%), Positives = 175/342 (51%), Gaps = 71/342 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV + G    MGTN     KDGEFP R+V + +F +DQ+ V+N  F++FV A  Y TEAE
Sbjct: 1   MVDIRGGRMLMGTNAADG-KDGEFPPRDVKVQSFKIDQYPVTNADFRDFVRAQKYKTEAE 59

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFG +FVF+  +S+E ++KI+Q                               +    W 
Sbjct: 60  KFGWSFVFQDFVSDELKSKITQ------------------------------RIESAPWW 89

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E             R F    W  P G  S I  R++ PVV VSWNDA A+C W
Sbjct: 90  ---LPVE-------------RAF----WRQPTGPGSGIRERLDFPVVQVSWNDAQAFCQW 129

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G RLPTE EWE+  RGGL+ R +PWGN       +R+N+WQG FP  + A DG+   +P
Sbjct: 130 KGKRLPTEEEWEWAARGGLQGRTYPWGNKFQ---ANRSNLWQGSFPEKDNADDGFHGVSP 186

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+M+GN WEWT+               K P      V +G S++   +
Sbjct: 187 VSAFPPQNSYGLYDMMGNTWEWTS--------------TKFPGAQLMYVLRGASWIDTVD 232

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVK 353
               +R R   R  NTPDS++ NLGFRCAA++G      K K
Sbjct: 233 GSANHRARVTTRMGNTPDSASDNLGFRCAANQGQKQEKKKDK 274


>gi|372223308|ref|ZP_09501729.1| sulfatase-modifying factor protein [Mesoflavibacter
           zeaxanthinifaciens S86]
          Length = 370

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 132/356 (37%), Positives = 180/356 (50%), Gaps = 51/356 (14%)

Query: 1   MVLLPAPPVERYKDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQ 59
           +V+   P +E   DMV + G TF  G   +  L    E P   VT++ FY+D HEV+N Q
Sbjct: 41  LVVSKLPNLEPPVDMVWIKGGTFYQGALPEDSLAMQHEKPRHKVTVNGFYMDMHEVTNDQ 100

Query: 60  FQEFVSATGYVTEAEK-FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHP 118
           F EFV+AT YVT AE+      + + L  E  +   S ++     F+   S++       
Sbjct: 101 FAEFVAATNYVTVAERAISWDEMKKQLPPETPKPHDSILQPGSLVFKKSKSSLP------ 154

Query: 119 VVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNH 178
                  +   Y  W         EW+ G             +W HP G  S+IE +  +
Sbjct: 155 -------NLYDYSQWW--------EWKIGA------------NWRHPNGPGSSIEGKGEY 187

Query: 179 PVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE 238
           PVV +++ DA+AYC W+G RLPTEAEWEY  R    N ++ W   L     H AN W+GE
Sbjct: 188 PVVQIAYEDALAYCEWKGRRLPTEAEWEYATRAEKINDIYSWPGGLEQLSSH-ANTWEGE 246

Query: 239 FPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH------------- 285
           FP  N   DGY   APV SY  N FGLY+M GNVWEWT+DW++++++             
Sbjct: 247 FPVENKLHDGYERRAPVKSYPANAFGLYDMAGNVWEWTSDWYHINYYANVSEIEELINPS 306

Query: 286 -PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            P  +YNP  P    +KV KGGS+LC+  YC  +R +AR     DS A +LGFR  
Sbjct: 307 GPEKAYNPSNPYV-QEKVIKGGSFLCSASYCASYRISARMATGLDSGAEHLGFRTV 361


>gi|345801202|ref|XP_536829.3| PREDICTED: sulfatase-modifying factor 2 [Canis lupus familiaris]
          Length = 297

 Score =  211 bits (537), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 130/338 (38%), Positives = 176/338 (52%), Gaps = 72/338 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F+MGTN P   ++GE P R VT+  F +D   V+N  F+EFV    Y TEAE
Sbjct: 26  MVQLPGGRFQMGTNSPD-GRNGEGPVREVTVKPFAIDVFPVTNKDFREFVREKKYRTEAE 84

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  ++ E R K++                     H +  + W        W 
Sbjct: 85  MFGWSFVFEEFVANELRNKVT---------------------HQMESVLW--------W- 114

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E             R F    W  P G +S I+ R+  PV+HVSWNDA AYC W
Sbjct: 115 ---LPVE-------------RAF----WRQPAGPNSGIQERLELPVLHVSWNDARAYCAW 154

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 155 KGKRLPTEEEWEFAARGGLKGQVYPWGNQFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 211

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD-KVKKGGSYLCNE 312
           V ++  +N +GLY+++GNVWEWTA  +              P+   D +V +G S++   
Sbjct: 212 VNAFPPQNNYGLYDLMGNVWEWTASLY--------------PSADQDMRVLRGASWIDTA 257

Query: 313 QYCYRH--RCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
                H  R   R  NTPDS++ NLGFRCA+  G   G
Sbjct: 258 DGSANHLARVTTRMGNTPDSASDNLGFRCASSIGRLPG 295


>gi|254432243|ref|ZP_05045946.1| sulfatase-modifying factor 1 [Cyanobium sp. PCC 7001]
 gi|197626696|gb|EDY39255.1| sulfatase-modifying factor 1 [Cyanobium sp. PCC 7001]
          Length = 341

 Score =  211 bits (537), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 134/346 (38%), Positives = 181/346 (52%), Gaps = 63/346 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG TFRMG++        E     VT++ F +D+HEV+N +F  FV ATGYVT AE
Sbjct: 47  MEWIPGGTFRMGSDDHY---PDELSVDAVTVEGFCIDRHEVTNAEFDAFVKATGYVTVAE 103

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +      + P L++++RA  + V      FE      E               VAY +W 
Sbjct: 104 RPIPKDQY-PDLADDQRAPGALV------FESPAPGSE---------------VAYLSW- 140

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W +           P  +W HP+G DS +E   NHPVV V++ DA AY  W
Sbjct: 141 ---------WHWT----------PGANWRHPQGPDSDLEGLANHPVVQVAFRDAEAYAAW 181

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEA+WE+  RGGL++++F WG   + +   +AN WQGEFP +N   DGY  TAP
Sbjct: 182 AGKALPTEAQWEFAARGGLKDQVFSWGKTYSAK---QANTWQGEFPIHNQEKDGYFGTAP 238

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAP-------SYNPKGPTTGTDKV 302
           V S+  N +GL +M GNVWEWT DW+   H     H  P       S++P+ P      V
Sbjct: 239 VGSFPPNGYGLQDMTGNVWEWTQDWYRPGHQGMADHVNPAVSEQELSFDPREPGI-AKHV 297

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
            KGGS+LC + YC R+R AAR    PD+   ++GFR  +  GPT G
Sbjct: 298 IKGGSFLCAKNYCSRYRPAAREALEPDTGTSHIGFRLVS--GPTEG 341


>gi|338740518|ref|YP_004677480.1| sulfatase-modifying factor [Hyphomicrobium sp. MC1]
 gi|337761081|emb|CCB66914.1| Sulfatase-modifying factor like (SUMF1-related); putative
           C-alpha-formyglycine-generating enzyme or
           methyltransferase or protein kinase [Hyphomicrobium sp.
           MC1]
          Length = 335

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 136/356 (38%), Positives = 179/356 (50%), Gaps = 63/356 (17%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P ER   MV +P  TFRMG+++       E PS  VT+D F++D+  V+N QF++FVS T
Sbjct: 16  PTER-PGMVFVPSGTFRMGSDRHY---SEEAPSHRVTVDGFWIDETPVTNAQFRKFVSET 71

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
            Y T AE         P  +E+    +    H +K    + +   H +          D 
Sbjct: 72  RYTTFAEI--------PPKAEDYPGALP---HMLKAGSLVFTPPRHAV----------DL 110

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
             +  W          W++  R G         +W HP G  S  +    HPVVH+++ D
Sbjct: 111 RDWSQW----------WDF--RFG--------ANWRHPYGRGSWTKGLDEHPVVHIAYAD 150

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           A AY  W G  LPTEAEWEY  RGGLE+  F WG+  TP+G H AN WQG FP  N   D
Sbjct: 151 AEAYAMWAGKELPTEAEWEYAARGGLEDAEFAWGDEFTPQGRHMANTWQGTFPLENLKED 210

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APS-----YNPKGPTTGTD- 300
           GY  T+PV ++  N FGLY+M+GNVWEWT+DW++  H P AP       NP+G       
Sbjct: 211 GYERTSPVRAFAANGFGLYDMIGNVWEWTSDWYSTKHAPDAPKACCVPVNPRGAAREQSF 270

Query: 301 -----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
                      KV KGGS+LC   YC R+R AAR     D+S  ++GFRC     P
Sbjct: 271 DPCQPNIRIPRKVLKGGSHLCAPNYCRRYRPAARHAEPIDTSTSHVGFRCVVRTRP 326


>gi|359779150|ref|ZP_09282391.1| hypothetical protein ARGLB_113_00760 [Arthrobacter globiformis NBRC
           12137]
 gi|359303599|dbj|GAB16220.1| hypothetical protein ARGLB_113_00760 [Arthrobacter globiformis NBRC
           12137]
          Length = 338

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 126/345 (36%), Positives = 167/345 (48%), Gaps = 67/345 (19%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDG----EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           +M+ +PG TF MG+       DG    E P+  VT+  F++D++ V+N +F  FV+ TG+
Sbjct: 38  NMLWVPGGTFLMGS-------DGHYPEEAPAHRVTVSGFWMDRNTVTNREFARFVAETGH 90

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           VT AE+  D   + P    E     S V                    P   +   D   
Sbjct: 91  VTLAEQVPDPADY-PYAKPELLVPASSV-----------------FRQPTQRVDLGDPYN 132

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
           +             W Y           P  +W  P+G  S++  R +HPVVHV+W D  
Sbjct: 133 W-------------WTY----------VPAANWRQPQGPGSSLNGRQDHPVVHVAWPDVE 169

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
           +Y  W G  LPTEAEWE+  RGGL+   + WG+  TP G   AN WQGEFP  NT  DGY
Sbjct: 170 SYAAWAGKSLPTEAEWEFAARGGLDGATYAWGDEFTPSGRWMANTWQGEFPIQNTEQDGY 229

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA---------------PSYNPKG 294
             TAPV S+  N +GL +M+GNVWEWT+DW+  H   A                S +P  
Sbjct: 230 AGTAPVGSFPANGYGLADMIGNVWEWTSDWYQAHAETAHSCCTVKDPRGGEQNRSNDPHD 289

Query: 295 PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           P     KV KGGS+LC   YC R+R AAR     D++  +LGFRC
Sbjct: 290 PVRIPRKVMKGGSHLCAPNYCQRYRPAARMPQAIDTATSHLGFRC 334


>gi|27380186|ref|NP_771715.1| hypothetical protein bll5075 [Bradyrhizobium japonicum USDA 110]
 gi|27353340|dbj|BAC50340.1| bll5075 [Bradyrhizobium japonicum USDA 110]
          Length = 338

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 135/353 (38%), Positives = 181/353 (51%), Gaps = 66/353 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG TFRMG+++       E P+  VT+D F++D+  V+N QF++FV+ATGYVT AE
Sbjct: 32  MFYVPGGTFRMGSDRHY---PEEAPAHLVTVDGFWMDRTPVTNRQFRKFVNATGYVTFAE 88

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH-PVVHISWNDAVAYCTW 133
                                 +  D K + G    + H +    +V      AV    W
Sbjct: 89  ----------------------ITPDAKDYPG---ALPHMLKAGSLVFTPPKRAVDLHNW 123

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                    +W            F +G+ W  P G  S I    +HPVVHV++ D  AY 
Sbjct: 124 --------GQWW----------TFKFGANWRRPYGPRSNISGLDDHPVVHVAYRDVEAYA 165

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWE+  RGGL+   F WG+ L PRG H AN WQG FP  N AADG+  T
Sbjct: 166 KWAGKELPTEAEWEFAARGGLDGAEFAWGDELFPRGRHMANTWQGNFPHQNLAADGFERT 225

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA----------PSYNPKGPTTGT--- 299
           +PV ++  N +GL++M+GNVWEWTADW++  H             PS  P+G +  +   
Sbjct: 226 SPVTAFPANGYGLHDMIGNVWEWTADWYSQKHEADALKVCCIPQNPSGGPEGASYDSCQP 285

Query: 300 -----DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
                 KV KGGS+LC   YC R+R AAR     D+SA ++GFRC A K  T+
Sbjct: 286 NLRIPRKVLKGGSHLCAPSYCRRYRPAARHAEPIDTSASHVGFRCIARKRITS 338


>gi|307152366|ref|YP_003887750.1| hypothetical protein Cyan7822_2503 [Cyanothece sp. PCC 7822]
 gi|306982594|gb|ADN14475.1| protein of unknown function DUF323 [Cyanothece sp. PCC 7822]
          Length = 346

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 126/341 (36%), Positives = 182/341 (53%), Gaps = 58/341 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  + G TF +G+N+       E  + ++T+ +F +D++EV+N +F +FV  TGY+T AE
Sbjct: 43  MAFIRGGTFNIGSNE---YYPSEHAAEDITVSSFCIDKYEVTNQEFAKFVEETGYLTVAE 99

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +      F P L +E+RA  S V      F+  D  I                + Y +W 
Sbjct: 100 RPLSPVQF-PDLPDEQRAPGSLV------FQMQDKDIMQ--------------IGYLSW- 137

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W++           P  +W HP G +S I+ + N+PVVHV++ D  AY  W
Sbjct: 138 ---------WKWT----------PGANWKHPFGPESDIKGKDNYPVVHVAYEDVEAYAKW 178

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEA+WEY  RGGL+++ + WG+  + +   +AN WQG FP  N   DGY  TAP
Sbjct: 179 AGLSLPTEAQWEYAARGGLDDKTYSWGDQYSAK---KANTWQGIFPIFNLKVDGYKGTAP 235

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTT---------GTDK-VK 303
           V S++ N +GLY++ GNVWEWT+DW+ + H + A   NP GP+          G  K V 
Sbjct: 236 VGSFEPNGYGLYDITGNVWEWTSDWYRIGHDNKAHQVNPTGPSVEESFDPREPGIAKHVI 295

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           KGGSYLC   YC R+R AAR   +PD+   ++GFR   + G
Sbjct: 296 KGGSYLCAPNYCSRYRPAARESQSPDTGTTHIGFRLIKNLG 336


>gi|92118474|ref|YP_578203.1| hypothetical protein Nham_2985 [Nitrobacter hamburgensis X14]
 gi|91801368|gb|ABE63743.1| protein of unknown function DUF323 [Nitrobacter hamburgensis X14]
          Length = 341

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 142/358 (39%), Positives = 180/358 (50%), Gaps = 62/358 (17%)

Query: 1   MVLLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
            +L PA   ER  DMV +PG TFRMG++        E PS  VT+D F++D+  V+N QF
Sbjct: 9   QLLGPAADGERTADMVWIPGGTFRMGSDH---HYPEEAPSHRVTVDGFWIDKTPVTNRQF 65

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV ATG+VT AE   D   +   L E   A                S +      PV 
Sbjct: 66  KQFVRATGHVTVAEVPPDPKDYPGALPEMIYA---------------GSLVFAPPSGPVD 110

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
              W                 A W    +G           W HP G  S I  + +HPV
Sbjct: 111 LREW-----------------ARWWTFMKGA---------DWRHPYGPKSNINVQDHHPV 144

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           VH+++ DA+AY  W G  LPTEAEWE+  RGGLE   F WGN  TP G H AN WQGEFP
Sbjct: 145 VHIAYADALAYADWAGKALPTEAEWEFAARGGLEETEFAWGNEFTPNGRHMANTWQGEFP 204

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-AP----------- 288
             N A DGY  T+PV ++  N +GL++M+GNVWEWT+DW+   H   AP           
Sbjct: 205 RQNLAEDGYERTSPVTAFPPNGYGLHDMIGNVWEWTSDWYTPKHDADAPKACCIPENPRG 264

Query: 289 -----SYNPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                SY+P  P+     KV KGGS+LC   YC R+R AAR     D+S  ++GFRC 
Sbjct: 265 GREDASYDPCQPSIKIPRKVLKGGSHLCAPNYCRRYRPAARHAEPVDTSTSHVGFRCV 322


>gi|41410272|ref|NP_963108.1| hypothetical protein MAP4174 [Mycobacterium avium subsp.
           paratuberculosis K-10]
 gi|440779649|ref|ZP_20958361.1| hypothetical protein D522_23846 [Mycobacterium avium subsp.
           paratuberculosis S5]
 gi|41399106|gb|AAS06724.1| hypothetical protein MAP_4174 [Mycobacterium avium subsp.
           paratuberculosis K-10]
 gi|436719846|gb|ELP44186.1| hypothetical protein D522_23846 [Mycobacterium avium subsp.
           paratuberculosis S5]
          Length = 296

 Score =  210 bits (535), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 127/337 (37%), Positives = 172/337 (51%), Gaps = 50/337 (14%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
            ++  LPG +FRMG+         E P   VT++ F +++H V+N QF EFV+ATGYVT 
Sbjct: 3   SELADLPGGSFRMGSTS---FYPEEAPIHTVTVEPFAIERHPVTNAQFAEFVAATGYVTV 59

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+  D  ++         A  SQ++     F      ++ R                  
Sbjct: 60  AEQPIDPALYP-------GADPSQLQPGAMVFRPTPGPVDLR-----------------D 95

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR        +W +           P  SW HP G DS +  R +HPVV V++ DA AY 
Sbjct: 96  WR--------QWWH---------WVPGASWRHPFGPDSDVADRADHPVVQVAYPDAAAYA 138

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAEWEY  R G     +PWG+  T  G   AN WQG FP  N  A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAGATT-TYPWGDEPTSDGRLMANTWQGRFPYRNDGALGWVGT 197

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
           +PV  +  N FGL +M+GNVWEWT   ++ HH       P   P GP     ++  KGGS
Sbjct: 198 SPVGVFAPNAFGLLDMIGNVWEWTTTEFSTHHRIGAATKPCCAPSGPADPSVNQTLKGGS 257

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           +LC  +YC+R+R AARS  + DS+  ++GFRC A+ G
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVAELG 294


>gi|443666496|ref|ZP_21133781.1| hypothetical protein C789_4321 [Microcystis aeruginosa DIANCHI905]
 gi|159030099|emb|CAO90991.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443331217|gb|ELS45887.1| hypothetical protein C789_4321 [Microcystis aeruginosa DIANCHI905]
          Length = 322

 Score =  210 bits (535), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 129/352 (36%), Positives = 173/352 (49%), Gaps = 64/352 (18%)

Query: 6   APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
           AP    +KD++ +PG TF MG+N     ++G  P   VT+  F++D++ V+N QF++FV 
Sbjct: 10  APGKPPHKDLIWIPGGTFEMGSNSRKYPEEG--PVHTVTVSGFWMDKYLVTNKQFRKFVK 67

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
            TGY+T AEK        P   +   A    +      F   D  ++ R           
Sbjct: 68  ETGYITFAEK-------PPKAEDYPNADPEILVPGSAVFVKPDRPVDPR----------- 109

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
               +C W+                       P  +W HPEG DS+I++R NHPVVH+ +
Sbjct: 110 ---TFCWWQ---------------------YMPGANWRHPEGPDSSIKNRENHPVVHIVY 145

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
            DA+AY  W G  LPTEA+WE+  RGGLE   F WG+   P G+  AN WQG+FP  N  
Sbjct: 146 EDALAYAKWAGKELPTEAQWEFAARGGLEGAEFAWGDEFIPNGKVMANTWQGQFPWQNLK 205

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPKGPTTGTD-- 300
                  A V SY  N +GLY+M+GNVWEWT DW    H   P+     PK P  GT+  
Sbjct: 206 PHAPGPEA-VESYPPNGYGLYDMIGNVWEWTQDWDQEQHPKNPTKACCTPKNPLGGTEED 264

Query: 301 --------------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                         KV KGGS+LC   YC R+R AAR     D+S  ++GFR
Sbjct: 265 SYNLKLHPSMRKPRKVLKGGSFLCAPNYCARYRPAARHPEDIDTSTNHIGFR 316


>gi|326931322|ref|XP_003211781.1| PREDICTED: sulfatase-modifying factor 2-like [Meleagris gallopavo]
          Length = 285

 Score =  210 bits (535), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 125/334 (37%), Positives = 170/334 (50%), Gaps = 70/334 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F+MG++     +D E P R VT+  F +D+  V+N  F+EFV    Y TEAE
Sbjct: 1   MVQLPGGKFQMGSSS-TQSRDEEGPIREVTVKPFAIDKFPVTNRDFREFVRERKYKTEAE 59

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +SEE + K++Q                               +    W 
Sbjct: 60  AFGWSFVFEDFVSEELKKKVTQ------------------------------KLESAPWW 89

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E  +                 W  P G  S I  R+++PV+HVSWNDA A+C+W
Sbjct: 90  ---LPVEKAF-----------------WRQPAGPGSGIADRLDNPVLHVSWNDAQAFCSW 129

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G RLPTE EWE+  RGGLE RL+PWGN   P   +R N+WQG+FP  + A DGY   +P
Sbjct: 130 KGKRLPTEEEWEFAARGGLEQRLYPWGNKFQP---NRTNLWQGDFPRGDRAEDGYHGVSP 186

Query: 255 VMSYK-ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT---TGTDKVKKGGSYLC 310
           V ++  +N +G+Y+++GN WEWTA            Y P GP        +  +G S++ 
Sbjct: 187 VAAFSPQNNYGIYDLLGNTWEWTAS----------QYKPPGPPQPHAEAMRALRGASWID 236

Query: 311 NEQYCYRHRC--AARSQNTPDSSAGNLGFRCAAD 342
                  HR     R  NTPDS++ NL FRCAA+
Sbjct: 237 TVDGSANHRACITTRMGNTPDSASDNLSFRCAAN 270


>gi|406660322|ref|ZP_11068455.1| Serine/threonine-protein kinase pkn1 [Cecembia lonarensis LW9]
 gi|405555946|gb|EKB50931.1| Serine/threonine-protein kinase pkn1 [Cecembia lonarensis LW9]
          Length = 359

 Score =  210 bits (535), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 126/341 (36%), Positives = 170/341 (49%), Gaps = 48/341 (14%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG  F MGTN+ +     E P+    +  F++D+ EV+N QF EF+ AT YVT AE
Sbjct: 52  MVWIPGGEFYMGTNE-MESYAVERPAVKKKVKGFWMDETEVTNAQFAEFIQATSYVTVAE 110

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D           E  ++       K  + L          P   ++ ND   +  W 
Sbjct: 111 RAIDW----------EELRLQLPPDTSKPDDELLQPGSLVFDPPSQPVALNDISQWWKWV 160

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                     +W +PEG  S+I+ R +HPVVH+++ DAVAY TW
Sbjct: 161 TG-----------------------ANWKNPEGPGSSIKGREDHPVVHIAYEDAVAYATW 197

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTE EWEY  +GG  +  + WG    P G   AN +QG+FP  N A DG+  TAP
Sbjct: 198 AGKRLPTETEWEYAAKGGRVHERYAWGEAFKPGGVFMANTFQGKFPHQNVAEDGFEGTAP 257

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APS--------YNPKGPTTGTDK 301
           V S+  N FGLY+M+GNVWE T+DW++          AP+        YNP  P    +K
Sbjct: 258 VKSFSPNDFGLYDMIGNVWELTSDWYDAISFARIAGDAPALDSSINLCYNPSNP-FALEK 316

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           V KGGS+LC + YC  +R +AR     DS   N+GFRC  D
Sbjct: 317 VIKGGSFLCADDYCINYRPSARQSQAYDSGTSNIGFRCVKD 357


>gi|374611475|ref|ZP_09684261.1| protein of unknown function DUF323 [Mycobacterium tusciae JS617]
 gi|373549185|gb|EHP75858.1| protein of unknown function DUF323 [Mycobacterium tusciae JS617]
          Length = 292

 Score =  210 bits (535), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 122/333 (36%), Positives = 176/333 (52%), Gaps = 49/333 (14%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V +PG +FRMG+ +       E P+   T+ A  +++H V+N QF EF+ ATGY+T A
Sbjct: 4   ELVEIPGGSFRMGSTQ---FYPEEAPAHTATVAAIAVERHPVTNAQFAEFIDATGYLTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D  ++ P ++ E+    + V      F      ++ R           D   + TW
Sbjct: 61  EQPMDPALY-PGVAPEDLVPGALV------FRPTSGPVDLR-----------DWRQWWTW 102

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                   P  +W  P G+DSTI+ R++HPVV V++ DA AY  
Sbjct: 103 E-----------------------PGANWRQPFGVDSTIDDRLDHPVVQVAYPDAAAYAL 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGG+    +PWG +  P G   AN WQG FP  N  A G+  T+
Sbjct: 140 WAGRRLPTEAEWEYAARGGV-TTTYPWGEDAAPEGRLMANTWQGRFPYLNEGALGWAGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGTDKVKKGGSYL 309
           PV +++ N FGL +M+GNVWEWT   ++ HH    +      P       ++  KGGS+L
Sbjct: 199 PVGTFEANAFGLLDMIGNVWEWTTTEFSTHHRLDENKGCCTPPATADPAVNQTLKGGSHL 258

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           C  +YC+R+R AARS  + DS+  ++GFRC AD
Sbjct: 259 CAPEYCHRYRPAARSPQSQDSATTHIGFRCVAD 291


>gi|416402665|ref|ZP_11687414.1| protein of unknown function DUF323 [Crocosphaera watsonii WH 0003]
 gi|357261846|gb|EHJ11066.1| protein of unknown function DUF323 [Crocosphaera watsonii WH 0003]
          Length = 268

 Score =  210 bits (535), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 125/312 (40%), Positives = 173/312 (55%), Gaps = 56/312 (17%)

Query: 45  LDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRF 104
           +D+F +D+HEV+N +F++FV ATGY T AE+      F P LSEE+R   S V      F
Sbjct: 1   MDSFCIDRHEVTNAEFRQFVEATGYQTIAERPLSKEQF-PTLSEEQRQAGSLV------F 53

Query: 105 EGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLH 164
           +     IE               VAY +W          W  G             +W H
Sbjct: 54  QPPSEGIEQ--------------VAYLSWW--------HWTVGA------------NWQH 79

Query: 165 PEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNL 224
           P G +S I+ + N+PV+H++++DAVAY  W+G  LPTEA+WEY  RGGL+++ F WGN  
Sbjct: 80  PYGPNSNIKGKDNYPVIHIAYDDAVAYANWKGKTLPTEAQWEYAGRGGLKDQDFSWGNQY 139

Query: 225 TPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH 284
           + +   +AN WQG FP  NT  DG+L TAPV S+  N +GLY+M GNVWE T+DW++V H
Sbjct: 140 SAK---KANTWQGIFPFLNTKQDGHLGTAPVESFPPNGYGLYDMTGNVWELTSDWYSVGH 196

Query: 285 -------HPA-----PSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSA 332
                  +P       S++P  P+ G+  V K GSYLC + YC R+R AAR    PD+  
Sbjct: 197 GGKEHSLNPTGPTKNASFDPTKPSEGSLHVIKRGSYLCAKNYCSRYRPAARESQAPDTGT 256

Query: 333 GNLGFRCAADKG 344
            ++GFR  +  G
Sbjct: 257 THVGFRLVSVDG 268


>gi|150375937|ref|YP_001312533.1| hypothetical protein Smed_3787 [Sinorhizobium medicae WSM419]
 gi|150030484|gb|ABR62600.1| protein of unknown function DUF323 [Sinorhizobium medicae WSM419]
          Length = 319

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 131/348 (37%), Positives = 176/348 (50%), Gaps = 61/348 (17%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           +E   +MV +PG TF MG+N        E P+  VT+D F++D+  V+N QF EFV+ATG
Sbjct: 11  LEPAAEMVWVPGATFMMGSNDHY---PEEAPAHPVTVDGFWIDEAPVTNRQFLEFVNATG 67

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           +VT AEK       +P   +   A  S +R     F             P   +  ND  
Sbjct: 68  HVTFAEK-------KPRAEDYPGAPPSNLRAGSLVFTP-----------PKRPLQGNDI- 108

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                        ++W     G          +W HP G  S+I   ++HPVVH++++DA
Sbjct: 109 -------------SQWWIFTLGA---------NWRHPLGRKSSIGAILDHPVVHIAYDDA 146

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G  LPTEAEWE   RG L+   F WG+ L P G+H AN WQG FP  N   DG
Sbjct: 147 KAYAEWAGKELPTEAEWELAARGDLDEAEFAWGDELVPDGKHMANTWQGRFPIENAMEDG 206

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY------NPKG-------- 294
           +  T+PV  Y  N +GLY+M+GNVWEWT D+W+V H  A +       NP+         
Sbjct: 207 FARTSPVRFYPPNGYGLYDMIGNVWEWTTDYWSVRHPEAAAKPCCIPSNPRNADVEASVD 266

Query: 295 PTTGTD---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           P  G     +V KGGS+LC   YC R+R AAR     D++  ++GFRC
Sbjct: 267 PAAGVKVPRRVLKGGSHLCAPNYCRRYRPAARHAQEIDTTTSHVGFRC 314


>gi|282891805|ref|ZP_06300286.1| hypothetical protein pah_c197o126 [Parachlamydia acanthamoebae str.
           Hall's coccus]
 gi|281498389|gb|EFB40727.1| hypothetical protein pah_c197o126 [Parachlamydia acanthamoebae str.
           Hall's coccus]
          Length = 390

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 126/340 (37%), Positives = 179/340 (52%), Gaps = 49/340 (14%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG  F MG++      D E P   V +D F++D   V+N QF+EF+ ATGYVT AE
Sbjct: 41  MVWIPGGEFMMGSDYEDSKPD-EKPQHRVKVDGFWMDATPVTNKQFKEFIDATGYVTTAE 99

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPV-VHISWNDAVAYCTW 133
           +        P L+E     ++QV                 +  P  V I  N++ A+  W
Sbjct: 100 R-------APTLNE----IMNQVPPGTPSPSPELLVAASLVFKPSDVPIPLNNSHAWWEW 148

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           +                       P  +W HP G +STIE + +HPVV VSW+DA AY  
Sbjct: 149 K-----------------------PGANWKHPLGPESTIEEKKDHPVVQVSWDDAQAYAK 185

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE+   GG ++ ++ WGN+       +AN+WQG FP  +T + GY  T 
Sbjct: 186 WAGKRLPTEAEWEFAAYGGRKDIMYVWGNDEFSEEVPQANIWQGIFPYKSTKSKGYNGTT 245

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTT--------GTD 300
           PV ++K N +GLY+M GNVW+W +D +++ ++        S NP GP T         T 
Sbjct: 246 PVTTFKPNPYGLYDMSGNVWQWCSDLYHMTYYQEEAKKELSINPAGPKTSFDPREPYATK 305

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            V +GGS+LC++ YC  +R  AR + +PD+S  +LGFRC 
Sbjct: 306 HVHRGGSFLCHDSYCKGYRITARMKTSPDTSLNHLGFRCV 345


>gi|389696270|ref|ZP_10183912.1| hypothetical protein MicloDRAFT_00060970 [Microvirga sp. WSM3557]
 gi|388585076|gb|EIM25371.1| hypothetical protein MicloDRAFT_00060970 [Microvirga sp. WSM3557]
          Length = 303

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 131/344 (38%), Positives = 168/344 (48%), Gaps = 62/344 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG TF MG+N        E P   VT+D F++D   V+N QF+ F+ ATG+VT AE
Sbjct: 1   MVRIPGGTFAMGSNDHY---PEEAPVHRVTVDGFWMDAAPVTNRQFKSFIQATGHVTTAE 57

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
              D                   R +M +   L  T      HPV    W+    +    
Sbjct: 58  TIPDA------------KDYPGARREMLKAGSLLFTPPS---HPVDLRDWSQWWTFA--- 99

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                      +G             +W  P G +S ++   +HPVVHVS  DA+AY  W
Sbjct: 100 -----------FGV------------NWKRPNGPNSNLKGLEDHPVVHVSHTDALAYARW 136

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE+  RGGL+   F WG+ LTP G   AN WQG+FP  NT  DGY  T+P
Sbjct: 137 AGKDLPTEAEWEFAARGGLDGAEFAWGDELTPDGRQMANTWQGQFPFQNTMLDGYERTSP 196

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-----------------HPAPSYNPKGPTT 297
           V SY  N +GL +M+GNVWEWT DW+   H                 H   SY+P+ P  
Sbjct: 197 VGSYPANGYGLVDMIGNVWEWTNDWYTPKHPDDAPKACCIPENPRGGHEDSSYDPRQPEI 256

Query: 298 GTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
               KV KGGS+LC   YC R+R AAR  +  D+S  ++GFRC 
Sbjct: 257 RIPRKVVKGGSHLCAPNYCRRYRPAARHPHPVDTSTSHIGFRCV 300


>gi|425449405|ref|ZP_18829245.1| Sulfatase-modifying factor 1 [Microcystis aeruginosa PCC 7941]
 gi|425464892|ref|ZP_18844202.1| Sulfatase-modifying factor 1 [Microcystis aeruginosa PCC 9809]
 gi|389763913|emb|CCI09649.1| Sulfatase-modifying factor 1 [Microcystis aeruginosa PCC 7941]
 gi|389832957|emb|CCI22951.1| Sulfatase-modifying factor 1 [Microcystis aeruginosa PCC 9809]
          Length = 322

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 129/352 (36%), Positives = 174/352 (49%), Gaps = 64/352 (18%)

Query: 6   APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
           AP    +KD++ +PG TF MG+N     ++G  P   VT+  F++D++ V+N QF++FV 
Sbjct: 10  APGKPPHKDLIWIPGGTFEMGSNSRKYPEEG--PVHTVTVSGFWMDKYLVTNKQFRKFVK 67

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
            TGY+T AEK        P   +   A    +      F   D  ++ R       + W 
Sbjct: 68  ETGYITFAEK-------PPKAEDYPNADPEILVPGSAVFVKPDRPVDPRT------VCW- 113

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
                             W+Y           P  +W HPEG  S+I++R NHPVVH+ +
Sbjct: 114 ------------------WQY----------MPGANWRHPEGTGSSIKNRENHPVVHIVY 145

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
            DA+AY  W G  LPTEA+WE+  RGGLE   F WG+   P G+  AN WQG+FP  N  
Sbjct: 146 EDALAYAKWAGKELPTEAQWEFAARGGLEGAEFAWGDEFMPNGKVMANTWQGQFPWQNLK 205

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPKGPTTGTD-- 300
                  A V SY  N +GLY+M+GNVWEWT DW+   H   P+     PK P  GT+  
Sbjct: 206 PHAPGPEA-VESYPPNGYGLYDMIGNVWEWTQDWYQEQHPKNPTKACCTPKNPLGGTEED 264

Query: 301 --------------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                         KV KGGS+LC   YC R+R AAR     D+S  ++GFR
Sbjct: 265 SYNLKLHPSMRKPRKVLKGGSFLCAPNYCARYRPAARHPEDIDTSTNHIGFR 316


>gi|443478841|ref|ZP_21068539.1| Sulphatase-modifying factor protein [Pseudanabaena biceps PCC 7429]
 gi|443015808|gb|ELS30618.1| Sulphatase-modifying factor protein [Pseudanabaena biceps PCC 7429]
          Length = 332

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 131/338 (38%), Positives = 175/338 (51%), Gaps = 58/338 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV + G TFRMG +     +  E  + NV++ +F +D HEV+N +F +FV  TGY T AE
Sbjct: 34  MVFINGGTFRMGADSDRFPE--ERAADNVSVSSFCIDAHEVTNAEFAKFVKETGYKTVAE 91

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                  F P L +E+R   S V    K     D+                  VA+ +W 
Sbjct: 92  IPLSKEEF-PDLPDEQRLAGSLVFQPPKE----DA----------------QTVAFLSWW 130

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W  G             +W HP G DS+I+++ N+PVVH+S+ DA AY  W
Sbjct: 131 --------TWTVGA------------NWQHPFGKDSSIKNKDNYPVVHISYKDAEAYAKW 170

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  +PTEA+WEY  RGGL++  + WG   +   E RAN WQG FP  NT ADGY   AP
Sbjct: 171 IGKSIPTEAQWEYAARGGLDSATYAWGEQYS---EKRANTWQGIFPFFNTKADGYKGLAP 227

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNV-------HHHPAP-----SYNPKGPTTGTDKV 302
           V S+  N +GLY+M GNVWEWT+D++ V        H+P       S++PK P      V
Sbjct: 228 VGSFPPNGYGLYDMTGNVWEWTSDFFEVGHDRMAHQHNPIASDRHKSFDPKKPDESVLHV 287

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            KGGSYLC   YC R+R AAR   +PD+   ++GFR  
Sbjct: 288 IKGGSYLCAPNYCSRYRPAARESESPDTGTTHIGFRLV 325


>gi|409389219|ref|ZP_11241071.1| putative sulfatase-modifying factor [Gordonia rubripertincta NBRC
           101908]
 gi|403200511|dbj|GAB84305.1| putative sulfatase-modifying factor [Gordonia rubripertincta NBRC
           101908]
          Length = 309

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 128/331 (38%), Positives = 170/331 (51%), Gaps = 60/331 (18%)

Query: 14  DMVLLPGDTFRMGTNKPILIK-DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + V +P   F MG       + DGE P  +V + AF +D   V+N  F  FV ATGYVT+
Sbjct: 32  ETVEVPAGRFAMGDALDEGYRTDGEVPVHDVEVGAFSVDVTSVTNAAFAAFVEATGYVTD 91

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE                RA IS V H                              Y T
Sbjct: 92  AE----------------RAGISAVFH-----------------------------GYAT 106

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
             G  +P    W             P  SW HP G DS +  +  HPVVHVS +DA+AYC
Sbjct: 107 APGEPVPETPWW----------LATPGASWRHPGGSDSDLTGKTTHPVVHVSHDDALAYC 156

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
           +W G  LPTEA+WEY  RGGL    +PWG+    R E R N+++G FP      +G++ T
Sbjct: 157 SWAGRSLPTEAQWEYAARGGLHGARYPWGDRHPGREEPRCNIFRGVFPDR---PEGHVGT 213

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
             V +++ N +GLY  VGNVWEW AD ++  ++  + S +P GPT GT +V +GGS+LC+
Sbjct: 214 VDVRAFEPNGYGLYQCVGNVWEWCADRFSARYYRTSDSVDPTGPTRGTLRVLRGGSHLCH 273

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           + YC R+R AARS NTP+S+A N+GFR   D
Sbjct: 274 DSYCNRYRVAARSSNTPNSTASNIGFRTVGD 304


>gi|383458407|ref|YP_005372396.1| hypothetical protein COCOR_06443 [Corallococcus coralloides DSM
           2259]
 gi|380730928|gb|AFE06930.1| hypothetical protein COCOR_06443 [Corallococcus coralloides DSM
           2259]
          Length = 333

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 131/347 (37%), Positives = 171/347 (49%), Gaps = 63/347 (18%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           + DMV +PG T+ MG++        E P+  VT+  F++D+  V+N QF  FV ATGYVT
Sbjct: 27  FPDMVWIPGGTYWMGSDHHY---PEEAPAHQVTVSGFWMDRFTVTNEQFARFVEATGYVT 83

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
            A++        PL   +      +            S +  +   PV      D     
Sbjct: 84  VAQR--------PLNPADYPGATPETLVP-------GSLVFQKPQGPV------DLGNVA 122

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W          W Y           P   W HPEG  S+++HR +HPVVH+++ DA +Y
Sbjct: 123 NW----------WGY----------VPDACWKHPEGRRSSVKHRRDHPVVHIAFEDAASY 162

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G  LPTEAEWE   RGGL+   F WGN+ TP GEH AN WQG FP  N   DG+  
Sbjct: 163 AAWAGKDLPTEAEWERAARGGLDRNEFCWGNDFTPNGEHLANTWQGYFPWQNLREDGHEG 222

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS------YNPKGPTT--GTD--- 300
           T PV ++  N +GL+ M GNVWEWT DW+   H            NP+GP T  G+    
Sbjct: 223 TCPVGAFPPNGYGLHEMTGNVWEWTTDWYQARHQGNKGKACCIPVNPRGPATEHGSQDLF 282

Query: 301 --------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                   +V KGGS+LC   YC R+R AARS    DS A ++GFRC
Sbjct: 283 TSAITIPRRVLKGGSHLCAPNYCRRYRPAARSPQAVDSGASHIGFRC 329


>gi|118465519|ref|YP_884152.1| sulfatase-modifying factor 1 [Mycobacterium avium 104]
 gi|118166806|gb|ABK67703.1| sulfatase-modifying factor 1 [Mycobacterium avium 104]
          Length = 292

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 127/333 (38%), Positives = 175/333 (52%), Gaps = 50/333 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +++ LPG  F MG+N        E P   VT+  F +++H V+N QF EFV  TGYVT A
Sbjct: 4   ELIDLPGGAFLMGSNS---FYPEESPVHEVTVRPFSIERHPVTNAQFAEFVDQTGYVTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           EK  D   F  + +E           D+K      + +      PV   +W     + TW
Sbjct: 61  EKALDPKDFPGVPAE-----------DLKP----GALVFTPTAGPVDLRAWRQ---WWTW 102

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                   P   W  P G  S+IE R++HPVV V++ DA AY +
Sbjct: 103 -----------------------MPGACWRQPFGPGSSIEDRLDHPVVQVAYPDAAAYAS 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLP+EA+WEY  R G  +  + WG+++ P GE  AN WQG+FP  N  A G+  T+
Sbjct: 140 WAGRRLPSEAQWEYAARAGTSS-AYAWGDDVAPGGEIMANTWQGKFPYRNDGARGWTGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---APSYNPKGPT--TGTDKVKKGGSY 308
           PV ++  N FGL +M+GNVWEWT+  ++  HHP   APS  P  PT      +  KGGS+
Sbjct: 199 PVGTFAPNGFGLVDMIGNVWEWTSTRYSAGHHPGRAAPSCCPTSPTGDPAVLQTLKGGSH 258

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           LC  +YC+R+R AARS  + DS+  ++GFRC A
Sbjct: 259 LCAPEYCHRYRPAARSSQSQDSATTHIGFRCIA 291


>gi|384218660|ref|YP_005609826.1| hypothetical protein BJ6T_49760 [Bradyrhizobium japonicum USDA 6]
 gi|354957559|dbj|BAL10238.1| hypothetical protein BJ6T_49760 [Bradyrhizobium japonicum USDA 6]
          Length = 331

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 124/344 (36%), Positives = 175/344 (50%), Gaps = 64/344 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ +PG TFRMG+++       E P   VT+DAF++D H V+N QF+ F   TG+VT   
Sbjct: 24  MLYVPGGTFRMGSDR---HYPEEAPVHRVTVDAFWMDCHPVTNRQFKAFAKTTGHVT--- 77

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                              ++++  D K + G+                    + +  + 
Sbjct: 78  -------------------VAEIPPDPKDYPGI--------------------LPHMVYA 98

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           G+ + +   +    R   +   F  G+ W HP G  S I    +HPVVHV+++DA+AY  
Sbjct: 99  GSLMFSPPAYPVDLRDFSQWWTFAKGADWRHPYGPGSDIRGLDDHPVVHVAFSDALAYAR 158

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEAEWE+  RGGL+   F WG+   P G H AN WQGEFP  NT  DGY  T+
Sbjct: 159 WAGKDLPTEAEWEFAARGGLDGAEFAWGDEFAPGGRHMANTWQGEFPRQNTGDDGYERTS 218

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----------------APSYNPKGPT 296
           PV ++  N +GL++++GNVWEWT+DW++  H                   A SY+P+   
Sbjct: 219 PVTAFPPNGYGLHDLIGNVWEWTSDWYSPRHEADAAKACCIPENPRGGREADSYDPRTIN 278

Query: 297 TGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                KV KGGS+LC   YC R+R AAR     D+S  +LGFRC
Sbjct: 279 VKIPRKVIKGGSHLCAPNYCRRYRPAARHAEPIDTSTSHLGFRC 322


>gi|448728409|ref|ZP_21710737.1| hypothetical protein C449_01471 [Halococcus saccharolyticus DSM
           5350]
 gi|445796891|gb|EMA47376.1| hypothetical protein C449_01471 [Halococcus saccharolyticus DSM
           5350]
          Length = 330

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 126/343 (36%), Positives = 168/343 (48%), Gaps = 62/343 (18%)

Query: 18  LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
           +PG TFRMG+ +       E P R V++D F++++   +N +F  FV+ TGY T AE+  
Sbjct: 29  IPGGTFRMGSEE---FYPEERPIREVSIDGFWMNRTPTTNREFAAFVADTGYTTVAER-- 83

Query: 78  DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
                +P  ++   A    +      F   D  +  R         W             
Sbjct: 84  -----DPDPTDYPDADPDALVAGSAVFRQPDGPVNRRDPS-----QW------------- 120

Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
                 WEY           P   W HP G DS IE RM+HPVVHV++ DA A+  W G 
Sbjct: 121 ------WEY----------VPGAEWRHPLGPDSDIEGRMDHPVVHVAYEDAAAFAEWAGK 164

Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMS 257
            LPTEA+WE   RGGLE   F WG+   P G+  AN WQGEFP  N   DGY  T+PV S
Sbjct: 165 TLPTEAQWERAARGGLEGTRFVWGDEHVPEGQLMANTWQGEFPQENKTLDGYERTSPVGS 224

Query: 258 YKENKFGLYNMVGNVWEWTADWWNV-----HHHPA-------------PSYNPKGPTTGT 299
           +  N FGLY+M GNVW+WT DW++         P+              S +P+ P+   
Sbjct: 225 FPANGFGLYDMAGNVWDWTQDWYSADPSTNDAEPSCCTPTGSRTVTEEQSVDPRDPSGIP 284

Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            KV KGGS+LC   YC+R+R AAR     D+S  ++GFRC  +
Sbjct: 285 RKVLKGGSHLCAPNYCFRYRPAARYPEPVDTSTSHVGFRCVVE 327


>gi|351714038|gb|EHB16957.1| Sulfatase-modifying factor 1 [Heterocephalus glaber]
          Length = 286

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 127/332 (38%), Positives = 164/332 (49%), Gaps = 109/332 (32%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+P   F MGT+ P + +DGE P+R V +DAFY D +E+SN +F++FV++TGY+TEAE
Sbjct: 62  MVLIPAGVFIMGTDGPQIKQDGEAPARRVAVDAFYTDAYEISNAEFEKFVNSTGYLTEAE 121

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFG +FVFE +L +                                              
Sbjct: 122 KFGHSFVFEGMLRK---------------------------------------------- 135

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
             RLPTEA WE+ CRGGL+NRLFPWG+ L P+      +H  N                W
Sbjct: 136 --RLPTEAGWEHSCRGGLQNRLFPWGNKLQPKD-----QHYAN---------------LW 173

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G   P     E G +G      FP      P G                          
Sbjct: 174 QG-EFPVTNTGEDGFQGTALVDAFP------PNG-------------------------- 200

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
                   +G YN+VGNVWEWT DWW ++H    + N KGP +G D V+K GSY+C++ Y
Sbjct: 201 --------YGSYNIVGNVWEWTLDWWTIYHSVEETLNLKGPPSGKDLVRKAGSYMCHKSY 252

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           C R+RCAARSQN PDSSA NLGF CAA++ PT
Sbjct: 253 CCRYRCAARSQNAPDSSALNLGFCCAANRLPT 284



 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 90/195 (46%), Positives = 110/195 (56%), Gaps = 50/195 (25%)

Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMS 257
           RLPTEA WE+ CRGGL+NRLFPWGN L P+ +H AN+WQGEFP  NT  DG+  TA V +
Sbjct: 136 RLPTEAGWEHSCRGGLQNRLFPWGNKLQPKDQHYANLWQGEFPVTNTGEDGFQGTALVDA 195

Query: 258 YKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYR 317
           +  N +G YN+VGNVWEWT DWW ++H    + N                          
Sbjct: 196 FPPNGYGSYNIVGNVWEWTLDWWTIYHSVEETLN-------------------------- 229

Query: 318 HRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNT 377
                                    KGP +G D V+K GSY+C++ YC R+RCAARSQN 
Sbjct: 230 ------------------------LKGPPSGKDLVRKAGSYMCHKSYCCRYRCAARSQNA 265

Query: 378 PDSSAGNLGFRCAAD 392
           PDSSA NLGF CAA+
Sbjct: 266 PDSSALNLGFCCAAN 280


>gi|183981065|ref|YP_001849356.1| hypothetical protein MMAR_1043 [Mycobacterium marinum M]
 gi|183174391|gb|ACC39501.1| conserved protein [Mycobacterium marinum M]
          Length = 293

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 124/335 (37%), Positives = 175/335 (52%), Gaps = 50/335 (14%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +FRMG+         E P  +V++ AF ++++ V+N QF EFV+ATGYVT A
Sbjct: 4   ELVDLPGGSFRMGSTS---FYPEEAPIHSVSVHAFAIERYPVTNAQFAEFVAATGYVTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D  ++  +  ++ R      R      +  D                        W
Sbjct: 61  EQPLDPALYPGVNPDDLRPGAMVFRPTAGPVDLRD------------------------W 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         W++           P  SW HP G DS ++ R +HPVV V++ DA AY  
Sbjct: 97  R-------QWWDWA----------PGASWRHPWGPDSDVDDRADHPVVQVAYPDAAAYAR 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RG      + WG+   P G+  AN WQG+FP  N  A G+  T+
Sbjct: 140 WAGRRLPTEAEWEYAARGATTT-TYAWGDEEKPAGQLMANTWQGKFPYRNDGALGWFGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPTTGT-DKVKKGGSY 308
           PV ++  N FGL++M+GNVWEWT   ++ HH     P     P GP   T  +  KGGS+
Sbjct: 199 PVGTFPANGFGLFDMIGNVWEWTTTPFSAHHRIDQPPKACCAPAGPADPTISQTLKGGSH 258

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           LC  +YC+R+R AARS  + D++  ++GFRC AD+
Sbjct: 259 LCAPEYCHRYRPAARSPQSQDTATTHIGFRCVADR 293


>gi|387793100|ref|YP_006258165.1| hypothetical protein Solca_3996 [Solitalea canadensis DSM 3403]
 gi|379655933|gb|AFD08989.1| hypothetical protein Solca_3996 [Solitalea canadensis DSM 3403]
          Length = 392

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 122/350 (34%), Positives = 176/350 (50%), Gaps = 61/350 (17%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDG-------EFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
           K MV +PG  F MG   P+ +++G         P   V +D FY+D+ EV+N +F  FV 
Sbjct: 82  KGMVFIPGGEFSMGGVNPVGMENGGQQNMNDARPIHRVYVDGFYMDETEVTNAEFTAFVK 141

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
           ATGY+T AE+      F P   EE     S V                    P   +S N
Sbjct: 142 ATGYITIAERKPTREEF-PGAPEENLVAGSVV-----------------FTPPDHDVSLN 183

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
           D   +  W          W+Y  +G           W HP G  S I+ + N+PVVH++W
Sbjct: 184 D---HYQW----------WQY-VKGA---------DWKHPLGPQSNIQGKDNYPVVHIAW 220

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
            DA AY  W G RLPTEAEWE+  RGG   +++ WGN   P G+  AN++QG+FP ++  
Sbjct: 221 EDAQAYAKWAGKRLPTEAEWEFAARGGDFGKMYAWGNQFKPDGKWMANIFQGKFPVSDEG 280

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYNPKGPTTGTD 300
            DG++  AP+  +  + +GLY++ GNVWEW  DW+   +     H   + NP+GP +  D
Sbjct: 281 KDGFVGIAPIKQFSASTYGLYDIAGNVWEWCEDWYRPDYYEELAHRDVARNPQGPDSAFD 340

Query: 301 --------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
                   KV++GGS+LC +QYC R+    R +    S++ ++G RC  D
Sbjct: 341 PMEPNEKKKVQRGGSFLCTDQYCTRYMVGTRGKGEFRSASNHVGLRCVKD 390


>gi|417747829|ref|ZP_12396287.1| hypothetical protein MAPs_38570 [Mycobacterium avium subsp.
           paratuberculosis S397]
 gi|336460655|gb|EGO39546.1| hypothetical protein MAPs_38570 [Mycobacterium avium subsp.
           paratuberculosis S397]
          Length = 296

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 127/337 (37%), Positives = 171/337 (50%), Gaps = 50/337 (14%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
            ++  LPG +FRMG+         E P   VT++ F +++H V+N QF EFV+ATGYVT 
Sbjct: 3   SELADLPGGSFRMGSTS---FYPEEAPIHTVTVEPFAIERHPVTNAQFAEFVAATGYVTV 59

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+  D  ++         A  SQ++     F      ++ R                  
Sbjct: 60  AEQPIDPALYP-------GADPSQLQPGAMVFRPTPGPVDLR-----------------D 95

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR        +W +           P  SW HP G DS +  R +HPVV V+  DA AY 
Sbjct: 96  WR--------QWWH---------WVPGASWRHPFGPDSDVADRADHPVVQVAHPDAAAYA 138

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAEWEY  R G     +PWG+  T  G   AN WQG FP  N  A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAGATT-TYPWGDEPTSDGRLMANTWQGRFPYRNDGALGWVGT 197

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
           +PV  +  N FGL +M+GNVWEWT   ++ HH       P   P GP     ++  KGGS
Sbjct: 198 SPVGVFAPNAFGLLDMIGNVWEWTTTEFSTHHRIGAATKPCCAPSGPADPSVNQTLKGGS 257

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           +LC  +YC+R+R AARS  + DS+  ++GFRC A+ G
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVAELG 294


>gi|319954217|ref|YP_004165484.1| sulfatase-modifying factor protein [Cellulophaga algicola DSM
           14237]
 gi|319422877|gb|ADV49986.1| Sulphatase-modifying factor protein [Cellulophaga algicola DSM
           14237]
          Length = 376

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 128/343 (37%), Positives = 176/343 (51%), Gaps = 50/343 (14%)

Query: 15  MVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           MV +PG  F+ G   +  +  D E PS  V +D F++D  EV+N QF +FV  TGY+T A
Sbjct: 60  MVWIPGGAFQQGAVPQDKMAMDHEKPSHKVQVDGFFMDISEVTNAQFAKFVKETGYITVA 119

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D    +  L E      +   HD     G  + I  +    V ++       +  W
Sbjct: 120 EREIDWEEMKTQLPEG-----TPKPHDSILKPG--ALIFKKTKKSVPNL-----YDFSQW 167

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                    EW+ G              W HP G  S+I  + N PVV VS+ DA+AYCT
Sbjct: 168 W--------EWKIGA------------DWKHPNGPKSSIVGKDNEPVVQVSYEDALAYCT 207

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE   RG     ++ WG+++  +    AN W+GEFP NN+  DG+   A
Sbjct: 208 WAGRRLPTEAEWERAARGNHAETIYFWGDDVD-KLATMANTWEGEFPVNNSKLDGFERRA 266

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNPKGPTTG 298
            VMSY  N FGLY+M GNVWEWT DW++ ++                P  +YNP  P   
Sbjct: 267 SVMSYPANDFGLYDMAGNVWEWTTDWYSTNYFKEVANDMVLAKNPQGPTKTYNPNNP-YA 325

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
            +KV KGGS+LC+  YC  +R ++R  ++PDS A ++GFR  A
Sbjct: 326 IEKVIKGGSFLCSASYCASYRISSRMGSSPDSGAEHVGFRTVA 368


>gi|154248609|ref|YP_001419567.1| hypothetical protein Xaut_4691 [Xanthobacter autotrophicus Py2]
 gi|154162694|gb|ABS69910.1| protein of unknown function DUF323 [Xanthobacter autotrophicus Py2]
          Length = 306

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 127/343 (37%), Positives = 172/343 (50%), Gaps = 62/343 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG TF MG++   L    E P R V +D F++D+  V+N  F  FV+ TG+V+ AE
Sbjct: 1   MVWIPGGTFHMGSDAHYL---EEAPVREVAVDGFWMDRCPVTNRDFARFVAETGHVSLAE 57

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +        P + +   A    +      F   D+                D+  +  W 
Sbjct: 58  QV-------PSIEDYPGADPLLLVAGSLVFVTPDAP--------------GDSADWTRW- 95

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W          R     +W HP+G DS+IE   +HPVVHV   DA+AY  W
Sbjct: 96  ---------W----------RFVAGANWRHPQGPDSSIEGLEDHPVVHVCHGDALAYARW 136

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G  LP+EA+WEY  RGGL+   F WG+ L P G HRAN WQG FP  N A DGY  T+P
Sbjct: 137 QGKELPSEAQWEYAARGGLDRAEFAWGDALVPDGVHRANTWQGHFPDENLALDGYAGTSP 196

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----------------SYNPKGPTT 297
           V ++  N +GL +M+GNVWEWT DW+  H   A                  S +P+ P +
Sbjct: 197 VTAFPPNGYGLLDMIGNVWEWTDDWYRPHGASAQGKACCLPRNPRGGSAEESRDPEAPGS 256

Query: 298 GTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
               KV KGGS+LC   YC R+R AAR   T D+++ ++GFRC
Sbjct: 257 ALPRKVLKGGSHLCAPNYCRRYRPAARQPQTIDTASVHIGFRC 299


>gi|410224498|gb|JAA09468.1| sulfatase modifying factor 2 [Pan troglodytes]
          Length = 323

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 133/340 (39%), Positives = 172/340 (50%), Gaps = 73/340 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ---EFVSATGYVT 71
           MV L G  F MGTN P   +DGE P R  T+  F +D   V+N  F+   +FV    Y T
Sbjct: 49  MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRYIRDFVREKKYRT 107

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           EAE FG +FVFE  +S+E R K +Q                     P+  + W       
Sbjct: 108 EAEIFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW------- 139

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W    LP E             R F    W  P G  S I  R+ HPV+HVSWNDA AY
Sbjct: 140 -W----LPVE-------------RAF----WRQPAGPGSGILERLEHPVLHVSWNDARAY 177

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
           C WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+  
Sbjct: 178 CAWRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHG 234

Query: 252 TAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
            +PV ++  +N +GLY+++GNVWEWTA             +P        +V +G S++ 
Sbjct: 235 VSPVNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWID 281

Query: 311 NEQYCYRH--RCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
                  H  R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 282 TADGSANHQARVTTRMGNTPDSASDNLGFRCAADAGRPPG 321



 Score = 39.7 bits (91), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 26/45 (57%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++        H  R   R  NTPDS++ NLGFRCAAD 
Sbjct: 272 RVLRGASWIDTADGSANHQARVTTRMGNTPDSASDNLGFRCAADA 316


>gi|73542580|ref|YP_297100.1| hypothetical protein Reut_A2895 [Ralstonia eutropha JMP134]
 gi|72119993|gb|AAZ62256.1| Protein of unknown function DUF323 [Ralstonia eutropha JMP134]
          Length = 409

 Score =  208 bits (530), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 128/351 (36%), Positives = 170/351 (48%), Gaps = 58/351 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG  F MG++   + +  E P+  V +  F++DQH V+N +F++FV ATGYVT AE
Sbjct: 77  MVHVPGGEFLMGSDS-KMAQANEKPAHKVRIHGFWMDQHHVTNAEFRKFVEATGYVTTAE 135

Query: 75  KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           K  D    +  L     R   S +      F G    +  +               Y  W
Sbjct: 136 KSPDWETLKVQLPPGTPRPPASAMVAGAMVFVGTSRPVPLQ--------------DYSRW 181

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W Y           P   W HP G  ++IE + NHPVV VS+ DA AY  
Sbjct: 182 ----------WRY----------VPGADWRHPGGPGTSIEGKDNHPVVQVSYEDAQAYAK 221

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGY 249
           W G RLPTEAEWE+  RGGLE   + WG    P G+  ANVWQG+    FP  +  A G 
Sbjct: 222 WAGKRLPTEAEWEFAARGGLEQATYAWGEKFAPDGKQMANVWQGQQGQPFPVISPKAGGA 281

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-------------VHHHPAP--SYNPKG 294
           L T+PV ++  N +GLY+M GN W+W ADW+              V   P P  S++P  
Sbjct: 282 LGTSPVGTFPANGYGLYDMTGNAWQWVADWYRADQFRREAAQGRLVDDPPGPLASWDPSE 341

Query: 295 ---PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
              P     +V +GGS+LCNE YC  +R +AR    P +S  +LGFR   D
Sbjct: 342 PGVPVGAPKRVTRGGSFLCNEDYCLSYRPSARRGTDPYNSMSHLGFRLVMD 392


>gi|348518876|ref|XP_003446957.1| PREDICTED: sulfatase-modifying factor 2-like [Oreochromis
           niloticus]
          Length = 308

 Score =  208 bits (530), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 126/335 (37%), Positives = 174/335 (51%), Gaps = 71/335 (21%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           ++MV +PG    MGT+ P   +DGE PS+ V L  F +D++ V+N  F++FV A  Y TE
Sbjct: 30  EEMVNIPGGKMLMGTSAPDG-RDGESPSKEVELQPFKIDRYPVTNADFRDFVRAQKYKTE 88

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE FG +FVF+  +SEE ++K++Q                               +    
Sbjct: 89  AETFGWSFVFQDFVSEELKSKVTQ------------------------------RIESAP 118

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W    LP E             R+F    W  P G  S I  R++ PVV VSWNDA A+C
Sbjct: 119 WW---LPIE-------------RVF----WRQPSGPGSGIRDRLDFPVVQVSWNDAQAFC 158

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
           +W+  RLPTE EWE+  RGGL  R +PWGN       +R N+WQG FP  +TA DGY   
Sbjct: 159 SWKSKRLPTEEEWEWAARGGLRGRTYPWGNKFQ---ANRTNLWQGSFPDGDTAEDGYHGI 215

Query: 253 APVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL-- 309
           +PV +Y  +N +GLY+M+GN WEWT+  ++      P Y           V +G S++  
Sbjct: 216 SPVTAYPPQNSYGLYDMMGNTWEWTSTRFSAAR---PMY-----------VLRGASWIDT 261

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
            +    ++ R   R  NTPDS++ NLGFRCAA  G
Sbjct: 262 VDGSANHKARITTRMGNTPDSASDNLGFRCAASDG 296


>gi|375137877|ref|YP_004998526.1| hypothetical protein [Mycobacterium rhodesiae NBB3]
 gi|359818498|gb|AEV71311.1| hypothetical protein MycrhN_0676 [Mycobacterium rhodesiae NBB3]
          Length = 292

 Score =  207 bits (528), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 122/332 (36%), Positives = 174/332 (52%), Gaps = 49/332 (14%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           +V LPG +FRMG  +       E P+   T+ A  +++H V+N QF +F+ ATGY+T AE
Sbjct: 5   LVELPGGSFRMGATQ---FYPEEAPAHTATVAAIAVERHPVTNAQFADFIDATGYLTVAE 61

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D  ++ P +  ++    + V      F      ++ R                  WR
Sbjct: 62  QPMDPALY-PGVPPQDLVPGALV------FRPTSGPVDLR-----------------DWR 97

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W++           P  +W HP G DS I+ R++HPVV V++ DA AY  W
Sbjct: 98  -------QWWDWA----------PGANWRHPFGPDSEIDGRLDHPVVQVAYPDAAAYARW 140

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY  R G     +PWG ++ P G   AN WQG+FP  N+ A G++ T+P
Sbjct: 141 AGRRLPTEAEWEYAARAG-ATTTYPWGEDVAPDGRLMANTWQGDFPYRNSGALGWIGTSP 199

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD----KVKKGGSYLC 310
           V ++  N FGL +M+GNVWEWT   ++ HH  A       P    D    +  KGGS+LC
Sbjct: 200 VGAFAPNAFGLVDMIGNVWEWTTTEFSAHHRLAGRKGCCTPGAAADPTVNQTLKGGSHLC 259

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
             +YC+R+R AARS  + DS+  ++GFRC AD
Sbjct: 260 APEYCHRYRAAARSPQSQDSATTHIGFRCVAD 291


>gi|375147002|ref|YP_005009443.1| sulfatase-modifying factor protein [Niastella koreensis GR20-10]
 gi|361061048|gb|AEW00040.1| Sulphatase-modifying factor protein [Niastella koreensis GR20-10]
          Length = 326

 Score =  207 bits (528), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 130/363 (35%), Positives = 175/363 (48%), Gaps = 67/363 (18%)

Query: 3   LLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
           +L    V+   +MV +PG  F MG++K       E P   VT+D F++D++ V+N +F++
Sbjct: 8   VLSDAKVKDQTNMVFIPGGKFMMGSDK---FYPEEKPVHAVTVDGFWIDKYAVTNEEFKK 64

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEE-ERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
           F+  TGYVT AE+        PL +E+   AK   +      F+     ++         
Sbjct: 65  FIDETGYVTVAER--------PLKAEDYPGAKPDMLAPGALVFQKAKGAVDL-------- 108

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
              ND   +  W                        P  SW HP+G  + ++    HPVV
Sbjct: 109 ---NDYFNWWAW-----------------------VPGTSWKHPKGPKTDLKGIEKHPVV 142

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
           HV++ DA AY TW G  LPTEA+WEY  RGGLE   F WGN  T + +  AN WQG FP 
Sbjct: 143 HVAYEDAEAYTTWAGKELPTEAQWEYAARGGLEGMDFTWGNEDTQQTKPMANTWQGIFPY 202

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH----------------- 284
            N   D Y  T+PV S+  N +GLY+M GNVWEWT+DW+  H                  
Sbjct: 203 QNLLIDKYEGTSPVGSFAPNGYGLYDMAGNVWEWTSDWYVAHLDELANKVKTCCTTHVNP 262

Query: 285 ---HPAPSYNPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
               P  SY+P  PT     KV KGGS+LC   YC R+R AAR     D+   +LGFRC 
Sbjct: 263 RVVSPEASYDPCQPTIKIPRKVVKGGSHLCAPNYCLRYRPAARQPQMIDTGMSHLGFRCI 322

Query: 341 ADK 343
             +
Sbjct: 323 VKQ 325


>gi|440747279|ref|ZP_20926538.1| Sulfatase modifying factor 1 precursor [Mariniradius
           saccharolyticus AK6]
 gi|436484199|gb|ELP40203.1| Sulfatase modifying factor 1 precursor [Mariniradius
           saccharolyticus AK6]
          Length = 351

 Score =  207 bits (527), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 130/345 (37%), Positives = 173/345 (50%), Gaps = 54/345 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV + G +F MGT++ +     E P+    +  F++D+ EV+N QF +FV  TGYVT AE
Sbjct: 42  MVWIEGGSFWMGTDE-LEAYAVERPAVQREVKGFWMDETEVTNAQFSKFVEETGYVTVAE 100

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLD----STIEHRMHHPVVHISWNDAVAY 130
           +          +  EE  K  Q+  D  +         S +    +HPV   S ND   +
Sbjct: 101 R---------KIDWEELKK--QLPPDTPKLPDDQLQPGSLVFSPPNHPVSTQS-NDISQW 148

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             W                           SW HPEG +S+I  R NHPVVH+++ DA A
Sbjct: 149 WKWVNG-----------------------ASWKHPEGPNSSIAGRENHPVVHIAFEDAEA 185

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           Y  W G RLPTEAEWEY  + G E   + WG    P G+  AN +QG+FP  N   D + 
Sbjct: 186 YAKWAGKRLPTEAEWEYAAKAGKEKVRYAWGTEFRPNGQFMANTFQGDFPHQNLVEDNFA 245

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APS--------YNPKGPTT 297
            TAPV S++ N FGLY+M+GNVWE TADW++          AP         YNP  P  
Sbjct: 246 GTAPVKSFQANGFGLYDMIGNVWELTADWYDAIQFSRVAGIAPPLDSAMNKCYNPTNP-Y 304

Query: 298 GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
             +KV KGGS+LC + YC  +R +AR  +  DS   N+GFRC  D
Sbjct: 305 AQEKVIKGGSFLCADDYCINYRPSARQGHAYDSGTSNVGFRCVKD 349


>gi|308210762|ref|NP_001016106.2| sulfatase modifying factor 2 precursor [Xenopus (Silurana)
           tropicalis]
          Length = 317

 Score =  207 bits (527), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 130/346 (37%), Positives = 178/346 (51%), Gaps = 64/346 (18%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           E   +MV L G  F MGT K    KDGE P R V +  F +D++ V+N  F+EFV A  Y
Sbjct: 31  EDADNMVQLDGGQFDMGT-KASDGKDGESPVRQVKVLPFAVDKYPVTNKDFREFVRAKKY 89

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
            TEAE FG +FVFE  +SEE + K++Q                      +    W     
Sbjct: 90  KTEAEAFGWSFVFEDFVSEELKKKVTQ---------------------KLESAPW----- 123

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
              W    LP E             R F    W  P G  S I+ ++++PVV VSWNDA 
Sbjct: 124 ---W----LPVE-------------RAF----WRQPAGPASGIKEKLDYPVVQVSWNDAQ 159

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
           AYC W G RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG FP  +TA DGY
Sbjct: 160 AYCKWLGKRLPTEEEWEFAARGGLKGKVYPWGNTFKP---NRTNLWQGTFPGMDTATDGY 216

Query: 250 LSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
              +PV ++  +N++GLY+M+GN WEWTA  +        S   +        V +G S+
Sbjct: 217 HGASPVAAFPAQNEYGLYDMLGNTWEWTATEYT-------SARAQKQNKENMYVLRGASW 269

Query: 309 L--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKV 352
           +   +    ++ R   R  NTPDS++ NL FRCA ++  +   DK+
Sbjct: 270 IDTVDGSANHKARVTTRMGNTPDSASDNLSFRCAKNRRKSKTEDKM 315


>gi|113867611|ref|YP_726100.1| hypothetical protein H16_A1600 [Ralstonia eutropha H16]
 gi|113526387|emb|CAJ92732.1| uncharacterized conserved protein [Ralstonia eutropha H16]
          Length = 414

 Score =  207 bits (527), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 126/353 (35%), Positives = 169/353 (47%), Gaps = 58/353 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG  F MG++   + +  E P+  V + AF++DQH V+N +F++FV ATGYV+ AE
Sbjct: 75  MVRVPGGEFLMGSDS-KMAQPNEKPAHKVRVHAFWMDQHHVTNAEFRKFVEATGYVSTAE 133

Query: 75  KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           +  D       L     R   S +      F G    +  +               Y  W
Sbjct: 134 RAPDWETLRVQLPPGTPRPPDSAMVAGGMVFVGTPRPVPLQ--------------DYSRW 179

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W Y           P   W HP G  S+IE + NHPVV VS+ DA AY  
Sbjct: 180 ----------WRY----------VPGADWRHPTGPGSSIEGKDNHPVVQVSYEDAQAYAK 219

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGY 249
           W G RLPTEAEWE+  RGGLE   + WG    P G+  ANVWQG+    FP  +  A G 
Sbjct: 220 WAGKRLPTEAEWEFAARGGLEQATYAWGEQFAPDGKQMANVWQGQQNQPFPVVSPKAGGA 279

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH------HPAPSYNPKGPTTGTD--- 300
           L T+PV ++  N +GL +M GN W+W +DW+             P  NP GP    D   
Sbjct: 280 LGTSPVGTFPPNGYGLADMTGNAWQWVSDWYRADQFRREAGKAQPIENPVGPMASWDPAE 339

Query: 301 ---------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
                    +V +GGS+LCNE +C  +R +AR    P +S  +LGFR   D+G
Sbjct: 340 PGVPVNAPKRVTRGGSFLCNEDFCLSYRPSARRGTDPYNSMSHLGFRLVMDEG 392


>gi|291234206|ref|XP_002737040.1| PREDICTED: sulfatase-modifying factor 2-like [Saccoglossus
           kowalevskii]
          Length = 340

 Score =  207 bits (527), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 123/337 (36%), Positives = 176/337 (52%), Gaps = 64/337 (18%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           ++++++MV L G  F MGT++P   +DGE PSR VT+  F + ++ V+N  F++FV A  
Sbjct: 45  LKKFEEMVELKGGKFIMGTDEPNA-RDGEGPSRKVTVKPFKIMRYPVTNAAFRKFVKAKK 103

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           + TEAE+F  +FVF+  +SEE +A I+                                 
Sbjct: 104 FKTEAERFQWSFVFDAFVSEEVKANITD-------------------------------- 131

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                   R+P    W             P   W  PEG  STI  R+N P +H+S NDA
Sbjct: 132 --------RIPNTPWW----------VPVPRAYWRQPEGQGSTITKRLNWPALHISANDA 173

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
             YC W G RLPTE EWEY  RGGLE++ +PWG +   +   R N WQG+FP  NT  DG
Sbjct: 174 KQYCEWNGWRLPTEPEWEYAARGGLESKRYPWGASFDNK---RLNTWQGKFPEENTKEDG 230

Query: 249 YLSTAPVMSYK-ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
           Y  ++PV +++ +N +G+Y+MVGN+WEWT+  +   +       P         V +GGS
Sbjct: 231 YHGSSPVDAFEPQNDYGMYDMVGNIWEWTSTKFKSQNQGQAVEIPM-------LVLRGGS 283

Query: 308 YLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAAD 342
           Y+  +   + H  R   R  NT D+S+ NLGFRCA+D
Sbjct: 284 YIDTKDGEFNHIVRVTTRMGNTADASSDNLGFRCASD 320



 Score = 42.7 bits (99), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 28/44 (63%), Gaps = 2/44 (4%)

Query: 352 VKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAADV 393
           V +GGSY+  +   + H  R   R  NT D+S+ NLGFRCA+DV
Sbjct: 278 VLRGGSYIDTKDGEFNHIVRVTTRMGNTADASSDNLGFRCASDV 321


>gi|429206654|ref|ZP_19197918.1| Sulfatase modifying factor 1 precursor protein [Rhodobacter sp.
           AKP1]
 gi|428190240|gb|EKX58788.1| Sulfatase modifying factor 1 precursor protein [Rhodobacter sp.
           AKP1]
          Length = 297

 Score =  207 bits (527), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 127/334 (38%), Positives = 169/334 (50%), Gaps = 53/334 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ LPG  F MG+N        E P    +++ F++D   V+N  +  FVSATGYVT A 
Sbjct: 1   MIWLPGAVFYMGSNS---HYPEEAPVHLASVEGFWIDVFPVTNADYARFVSATGYVTVA- 56

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                          ER+  ++   D      +  ++  R    V      D   +  W 
Sbjct: 57  ---------------ERSPGAEDYPDADPALLVPGSLVFRRPASV-----EDPGDWTRW- 95

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W Y               W HP+G  S +  R++HPVVHV   DA+AY  W
Sbjct: 96  ---------WSYQSGA----------CWRHPDGPGSDVAGRLDHPVVHVCHEDALAYAHW 136

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWEY  RGGL+   F WG+ L P G HRAN+WQG+FP  N A DG+  T+P
Sbjct: 137 AGKDLPTEAEWEYAARGGLDRAEFAWGDVLVPEGRHRANIWQGQFPVENLAEDGFEGTSP 196

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH--HPAPS-YNPKGPTTG------TDKVKKG 305
           V ++  N +GLY+M+GNVWEWT+D +   H    AP    P  P +G        +V KG
Sbjct: 197 VRAFPPNGYGLYDMIGNVWEWTSDGFRADHGGRSAPGCCAPISPASGGHVLPEARQVLKG 256

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           G +LC   YC R+R AAR   TPDS++G++GFRC
Sbjct: 257 GYHLCAPNYCRRYRPAARQGQTPDSASGHIGFRC 290


>gi|158424484|ref|YP_001525776.1| hypothetical protein AZC_2860 [Azorhizobium caulinodans ORS 571]
 gi|158331373|dbj|BAF88858.1| hypothetical protein AZC_2860 [Azorhizobium caulinodans ORS 571]
          Length = 332

 Score =  207 bits (527), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 134/353 (37%), Positives = 172/353 (48%), Gaps = 63/353 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G TF MG++        E P   V +D F++D+  V+N  F  FV  TG+VT AE
Sbjct: 24  MVWLAGATFHMGSDD---HYPEEAPVHPVRVDGFWIDRFPVTNRDFARFVEDTGHVTVAE 80

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +      + PLL        S V      F   ++                         
Sbjct: 81  QVPSPADY-PLLDPHLLVAGSLV------FSPPEAQ------------------------ 109

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
               P  A W+   R            W HP+G  S++E   +HPVVHV++ DA AY  W
Sbjct: 110 ----PGPAAWDQWWR------FVEGADWRHPKGPGSSLEGLEDHPVVHVAFADAQAYARW 159

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
               LPTEAEWEY  RGGL+   F WGN L P G HRAN WQG FP  N A DG+  T+P
Sbjct: 160 CHKALPTEAEWEYAARGGLDRAPFAWGNELVPGGIHRANTWQGAFPVENLAEDGFAGTSP 219

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA---PSYNPKGPTTGTD----------- 300
           V ++  N FGL++M+GNVWEWT DW+   H P    P   P  P  G++           
Sbjct: 220 VDAFPANGFGLHDMIGNVWEWTDDWYQPRHDPPAARPCCVPHNPRGGSEAGSRHADILGA 279

Query: 301 ----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
               KV KGGS+LC   YC R+R AARS  T D+++G++GFRC   +GP   T
Sbjct: 280 ALPRKVLKGGSHLCAPNYCQRYRPAARSPQTIDTASGHIGFRC-VRRGPKQET 331


>gi|430809050|ref|ZP_19436165.1| C-alpha-formyglycine- generating protein [Cupriavidus sp. HMR-1]
 gi|429498565|gb|EKZ97072.1| C-alpha-formyglycine- generating protein [Cupriavidus sp. HMR-1]
          Length = 386

 Score =  207 bits (526), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 127/351 (36%), Positives = 170/351 (48%), Gaps = 58/351 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG  F MG++   + +  E P+  V + AF++DQH V+N +F++FV ATGY+T AE
Sbjct: 46  MVHIPGGEFLMGSDHK-MSQANERPAHKVQVKAFWMDQHHVTNAEFRKFVEATGYLTTAE 104

Query: 75  KFGDTFVFEPLL-SEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           +  D       L +   R   S +      F G +S +  R               Y  W
Sbjct: 105 RKPDWETLRVQLPAGTPRPPDSAMVAGGMVFVGTNSPVPLR--------------EYWRW 150

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W          R  P   W HP G  S+IE + NHPVV VS+ DA AY  
Sbjct: 151 ----------W----------RFVPGADWRHPTGPGSSIEGKDNHPVVQVSYEDAQAYAK 190

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGY 249
           W G RLPTEAEWE+  RGGLE   + WG+   P G   ANVWQG+    FP  +  A G 
Sbjct: 191 WAGKRLPTEAEWEFAARGGLEQATYAWGDKFAPDGRQMANVWQGQQVQPFPVVSAKAGGA 250

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH------PAPSYNPKGPTTGTD--- 300
             T+ + ++  N +GLY+M GN W+W ADW+            A   NP GPT   D   
Sbjct: 251 AGTSAIGTFPANGYGLYDMTGNAWQWVADWYRADQFRREATVAAVLQNPTGPTDSWDPTE 310

Query: 301 ---------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
                    +V +GGS+LCNE +C  +R +AR    P +S  +LGFR   D
Sbjct: 311 PGVPVSAPKRVTRGGSFLCNEDFCLSYRPSARRGTDPYTSMSHLGFRLVMD 361


>gi|334342830|ref|YP_004555434.1| sulfatase-modifying factor protein [Sphingobium chlorophenolicum
           L-1]
 gi|334103505|gb|AEG50928.1| Sulphatase-modifying factor protein [Sphingobium chlorophenolicum
           L-1]
          Length = 313

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 132/346 (38%), Positives = 175/346 (50%), Gaps = 67/346 (19%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M L+PG  F MG+++       E P R V++D+F++D+  V+N QF +FV+ATG++T AE
Sbjct: 6   MRLVPGGVFSMGSDR---FYPEEAPVREVSVDSFWIDETPVTNRQFADFVAATGHITMAE 62

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                    P + +      S        FE     ++                      
Sbjct: 63  T-------PPNVEDYPGMPASMAVAGSLVFEPTAGPVD---------------------- 93

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              L   ++W +  R G +        W HP G DS+ E   +HPVVH++W DAVAY  W
Sbjct: 94  ---LADVSQW-WHFRFGAD--------WRHPLGPDSSTEGLEDHPVVHIAWADAVAYAEW 141

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWE+  RGGL    F WG+ L P G   AN WQG FP +N   DG+  T+P
Sbjct: 142 AGKRLPTEAEWEFATRGGLHGTDFAWGDELAPAGAMLANYWQGLFPFSNQLLDGWARTSP 201

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVH--HHPA------PSYNPKG------------ 294
           V  Y  N +GLY+M+GNVWEWT+DW+  H  H  A      PS NP+G            
Sbjct: 202 VRHYPANGYGLYDMIGNVWEWTSDWFAPHKLHRKAKGSCCIPS-NPRGGLERESYDVSVP 260

Query: 295 -PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            P  G  KV KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 261 TPRIGR-KVLKGGSHLCAPNYCQRYRPAARHAQPIDTSTSHVGFRC 305


>gi|343926957|ref|ZP_08766447.1| putative sulfatase-modifying factor [Gordonia alkanivorans NBRC
           16433]
 gi|343763116|dbj|GAA13373.1| putative sulfatase-modifying factor [Gordonia alkanivorans NBRC
           16433]
          Length = 267

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 123/309 (39%), Positives = 162/309 (52%), Gaps = 59/309 (19%)

Query: 35  DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKI 94
           DGE P  +V + AF +D   V+N  F  FV ATGYVT+AE                RA I
Sbjct: 12  DGEVPVHDVEVGAFSVDVTSVTNAAFAAFVEATGYVTDAE----------------RAGI 55

Query: 95  SQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
           S V H                              Y T  G  +P    W          
Sbjct: 56  SAVFH-----------------------------GYATAPGEPVPETPWW---------- 76

Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
              P  SW HP G DS +  +  HPVVHVS +DA+AYC+W G  LPTEA+WEY  RGGL 
Sbjct: 77  LATPGASWRHPGGPDSDLTGKTTHPVVHVSHDDALAYCSWAGRSLPTEAQWEYAARGGLH 136

Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
              +PWG+    R E R N+++G FP      +G++ T  V +++ N +GLY  VGNVWE
Sbjct: 137 GARYPWGDRHPGREEPRCNIFRGVFPDR---PEGHVGTVDVRAFEPNGYGLYQCVGNVWE 193

Query: 275 WTADWWNV-HHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAG 333
           W AD ++  ++  + S +P GPT GT +V +GGS+LC++ YC R+R AARS NTP+S+A 
Sbjct: 194 WCADRFSARYYRTSDSVDPTGPTRGTLRVLRGGSHLCHDSYCNRYRVAARSSNTPNSTAS 253

Query: 334 NLGFRCAAD 342
           N+GFR   D
Sbjct: 254 NIGFRTVGD 262


>gi|376317284|emb|CCG00652.1| sulfatase-modifying factor 1 [uncultured Flavobacteriia bacterium]
          Length = 339

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 119/344 (34%), Positives = 178/344 (51%), Gaps = 52/344 (15%)

Query: 15  MVLLPGDTFRMGTNK-PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           MV +PG  +  G +K   +    E P  NV +D FY+D  EV+N QF +F+ AT Y+T A
Sbjct: 22  MVWIPGGVYEQGASKNDKMALPHEKPKHNVRIDGFYMDITEVTNAQFLKFIEATNYITTA 81

Query: 74  EKFGD-TFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           E+  D + +   L +   +   S ++     F+   +++ +                +  
Sbjct: 82  EQPIDWSLIASQLPAGTPKPHDSILQPGSLLFKKTKTSVPNLYD-------------FSQ 128

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W         +W  G             +W  P G  S+I  + NHPVVHVS+ DA AYC
Sbjct: 129 WW--------KWSIGT------------NWTQPNGKGSSIIGKENHPVVHVSFEDAQAYC 168

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAEWEY  RGG +N+++ WG +         N W+GEFP +N+ ADG+  +
Sbjct: 169 KWAGRRLPTEAEWEYAARGGKKNKIYFWG-DFVENLSSNVNSWEGEFPVSNSQADGFEKS 227

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA---------------PSYNPKGPTT 297
           APV ++  N FGL+ + GNVWEWT+DW+NV+++                  +YNP  P  
Sbjct: 228 APVKTFPPNDFGLHEISGNVWEWTSDWYNVNYYATLASNGEVSNNPKGANSAYNPNNPQI 287

Query: 298 GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
            ++K+ +GGS+LCN  YC  +R ++R    P++S  +LGFR  A
Sbjct: 288 -SEKIIRGGSFLCNASYCASYRVSSRMATDPNTSLEHLGFRTVA 330


>gi|443489538|ref|YP_007367685.1| Formylglycine-generating sulfatase enzyme [Mycobacterium liflandii
           128FXT]
 gi|442582035|gb|AGC61178.1| Formylglycine-generating sulfatase enzyme [Mycobacterium liflandii
           128FXT]
          Length = 293

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 124/335 (37%), Positives = 175/335 (52%), Gaps = 50/335 (14%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +FRMG+         E    +V++ AF ++++ V+N QF EFV+ATGYVT A
Sbjct: 4   ELVDLPGGSFRMGSTS---FYPEEASIHSVSVHAFAIERYPVTNAQFAEFVAATGYVTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D  ++  +  ++ R      R      +  D                        W
Sbjct: 61  EQPLDPALYPGVNPDDLRPGAMVFRPTAGPVDLRD------------------------W 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         W++           P  SW HP G DS ++ R +HPVV V++ DAVAY  
Sbjct: 97  R-------QWWDWA----------PGASWRHPWGPDSDVDDRADHPVVQVAYPDAVAYAR 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RG      + WG+   P G+  AN WQG+FP  N  A G+  T+
Sbjct: 140 WAGRRLPTEAEWEYAARGATTT-TYAWGDEEKPAGQLMANTWQGKFPYRNDGALGWFGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPTTGT-DKVKKGGSY 308
           PV ++  N FGL++M+GNVWEWT   ++ HH     P     P GP   T  +  KGGS+
Sbjct: 199 PVGTFPANGFGLFDMIGNVWEWTTTPFSAHHRIDQPPKACCAPAGPADPTISQTLKGGSH 258

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           LC  +YC+R+R AARS  + D++  ++GFRC AD+
Sbjct: 259 LCAPEYCHRYRPAARSPQSRDTATTHIGFRCVADR 293


>gi|399576771|ref|ZP_10770526.1| hypothetical protein HSB1_25650 [Halogranum salarium B-1]
 gi|399238215|gb|EJN59144.1| hypothetical protein HSB1_25650 [Halogranum salarium B-1]
          Length = 323

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 130/344 (37%), Positives = 174/344 (50%), Gaps = 60/344 (17%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
            D+V +PG  F MG+++       E P+R V +D F++D+  V+   F+EFV ATGYVT 
Sbjct: 18  DDVVWIPGGEFSMGSDE---HYPEERPARQVGVDGFWIDRTPVTTAAFREFVDATGYVTL 74

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+        P   E   A  + +      F   D  ++ R        +W        
Sbjct: 75  AER-------PPRPEEYPDADPALLVPGSAVFVPPDGPVDLRNPS-----NW-------- 114

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                      W+Y           P  +W  P G +S +   ++HPVVHV++ DAVAY 
Sbjct: 115 -----------WQY----------VPDANWRQPLGPESDLSEWLDHPVVHVAYEDAVAYA 153

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAE E   RGGL+   F WG+ LTP GE  AN WQG FP  NT ADG++ T
Sbjct: 154 EWAGKRLPTEAEHERASRGGLDGATFAWGDELTPEGELMANTWQGTFPWQNTGADGHVRT 213

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNV-----------HHHP-----APSYNPKGPT 296
           +PV S+  N +GL + +GNVWEWT+DW+ V             +P     A S +P  P 
Sbjct: 214 SPVGSFPANGYGLVDTIGNVWEWTSDWFAVGGGDSTPTCCTPENPRGGTEAASVDPFDPA 273

Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
               KV KGGS+LC   YC R+R AAR     D+S  ++GFRCA
Sbjct: 274 RTPRKVLKGGSHLCAPNYCQRYRPAARYPEPVDTSTSHVGFRCA 317


>gi|373252951|ref|ZP_09541069.1| hypothetical protein NestF_08584 [Nesterenkonia sp. F]
          Length = 327

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 124/331 (37%), Positives = 159/331 (48%), Gaps = 70/331 (21%)

Query: 15  MVLLPGDTFRMGTNKPILI-KDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M+ L   TFRMG+   +    DGE P R V +  F +    V+  +F  FV ATG+ T+A
Sbjct: 66  MIGLSATTFRMGSEDELAYPADGEGPVREVAVGPFAIGATSVTVAEFAAFVVATGHRTDA 125

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E +GD+ VF   L++  RA    V                                    
Sbjct: 126 ETYGDSMVFAGRLTDALRASTPAV------------------------------------ 149

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                  EA W    RG          +W  P+G DST E R +HPV HVS  DA AY  
Sbjct: 150 ------AEAPWWRQVRGA---------TWFAPDGPDSTAEGREDHPVTHVSHRDAQAYAQ 194

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W GARLPTEAEWEY  RGGL+ + +PWG    P G  R N++ G+FP +     G   T 
Sbjct: 195 WAGARLPTEAEWEYAARGGLDQQPYPWGAVRDPDGVRRMNIFPGDFPED---PHGEAGTM 251

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           PV S++ N  GLYN  GNVWEWTA  +    +P               V +GGSY+C+E 
Sbjct: 252 PVRSFEPNGLGLYNSTGNVWEWTAGTFGATGNP---------------VARGGSYMCHEN 296

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           YC R+R   R+Q T D++ G+ GFR AAD G
Sbjct: 297 YCRRYRTTGRTQATADTTLGHTGFRLAADVG 327



 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 23/44 (52%), Positives = 32/44 (72%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           + V +GGSY+C+E YC R+R   R+Q T D++ G+ GFR AADV
Sbjct: 283 NPVARGGSYMCHENYCRRYRTTGRTQATADTTLGHTGFRLAADV 326


>gi|58476691|gb|AAH89708.1| MGC108306 protein [Xenopus (Silurana) tropicalis]
 gi|213624563|gb|AAI71273.1| MGC108306 protein [Xenopus (Silurana) tropicalis]
          Length = 282

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 129/341 (37%), Positives = 176/341 (51%), Gaps = 64/341 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  F MGT K    KDGE P R V +  F +D++ V+N  F+EFV A  Y TEAE
Sbjct: 1   MVQLDGGQFDMGT-KASDGKDGESPVRQVKVLPFAVDKYPVTNKDFREFVRAKKYKTEAE 59

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +SEE + K++Q                      +    W        W 
Sbjct: 60  AFGWSFVFEDFVSEELKKKVTQ---------------------KLESAPW--------W- 89

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E             R F    W  P G  S I+ ++++PVV VSWNDA AYC W
Sbjct: 90  ---LPVE-------------RAF----WRQPAGPASGIKEKLDYPVVQVSWNDAQAYCKW 129

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG FP  +TA DGY   +P
Sbjct: 130 LGKRLPTEEEWEFAARGGLKGKVYPWGNTFKP---NRTNLWQGTFPGMDTATDGYHGASP 186

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N++GLY+M+GN WEWTA  +        S   +        V +G S++   +
Sbjct: 187 VAAFPAQNEYGLYDMLGNTWEWTATEYT-------SARAQKQNKENMYVLRGASWIDTVD 239

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKV 352
               ++ R   R  NTPDS++ NL FRCA ++  +   DK+
Sbjct: 240 GSANHKARVTTRMGNTPDSASDNLSFRCAKNRRKSKTEDKM 280


>gi|94313454|ref|YP_586663.1| C-alpha-formyglycine- generating protein [Cupriavidus metallidurans
           CH34]
 gi|93357306|gb|ABF11394.1| Sulfatase-modifying factor (C-alpha-formyglycine- generating
           enzyme) DUF323 [Cupriavidus metallidurans CH34]
          Length = 408

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 126/351 (35%), Positives = 170/351 (48%), Gaps = 58/351 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG  F MG++   + +  E P+  V + AF++DQH V+N  F++FV ATGY+T AE
Sbjct: 68  MVHVPGGEFLMGSDHK-MSQANERPAHKVQVKAFWMDQHHVTNADFRKFVEATGYLTTAE 126

Query: 75  KFGDTFVFEPLL-SEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           +  D       L +   R   S +      F G +S +  R               Y  W
Sbjct: 127 RKPDWETLRVQLPAGTPRPPDSAMVAGGMVFVGTNSPVPLR--------------EYWRW 172

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W          R  P   W HP G  S+IE + NHPVV VS+ DA AY  
Sbjct: 173 ----------W----------RFVPGADWRHPTGPGSSIEGKDNHPVVQVSYEDAQAYAK 212

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGY 249
           W G RLPTEAEWE+  RGGLE   + WG+   P G   ANVWQG+    FP  +  A G 
Sbjct: 213 WAGKRLPTEAEWEFAARGGLEQATYAWGDKFAPDGRQMANVWQGQQVQPFPVVSAKAGGA 272

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNPKG 294
             T+ V ++  N +GLY+M GN W+W ADW+                    PA S++P  
Sbjct: 273 AGTSAVGTFPGNGYGLYDMTGNAWQWVADWYRADQFRREATVAAVLQNPTGPADSWDPTE 332

Query: 295 ---PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
              P +   +V +GGS+LCNE +C  +R +AR    P +S  +LGFR   D
Sbjct: 333 PGVPVSAPKRVTRGGSFLCNEDFCLSYRPSARRGTDPYTSMSHLGFRLVMD 383


>gi|60552102|gb|AAH91176.1| Sumf2 protein [Rattus norvegicus]
          Length = 271

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 126/333 (37%), Positives = 170/333 (51%), Gaps = 71/333 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F MGTN P   +DGE P+R VT+  F +D   ++N  F+EFV    Y TEAE
Sbjct: 1   MVHLPGGRFLMGTNAPD-GRDGEGPAREVTVKPFAIDVFPITNKDFREFVREKKYQTEAE 59

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S E R +                               N+  A   W 
Sbjct: 60  AFGWSFVFEDFVSPELRKQA------------------------------NEMPAVLWW- 88

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +  +                 W  P G  S I  ++  PVVHVSWNDA AYC W
Sbjct: 89  ---LPVQKAF-----------------WRQPAGPGSGIREKLELPVVHVSWNDAGAYCAW 128

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN       +R N+WQG+FP  + A DG+   +P
Sbjct: 129 RGKRLPTEEEWEFAARGGLKGQVYPWGNQFQ---LNRTNLWQGKFPKGDRAEDGFHGLSP 185

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA           +Y   G      +V +G S++   +
Sbjct: 186 VNAFPPQNNYGLYDLMGNVWEWTAS----------TYQAAGQDM---RVLRGASWIDTAD 232

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
               +R R   R  NTPDS++ NLGFRCA+ +G
Sbjct: 233 GSANHRARVTTRMGNTPDSASDNLGFRCASSEG 265


>gi|164607119|ref|NP_001020296.2| sulfatase-modifying factor 2 precursor [Rattus norvegicus]
 gi|149063163|gb|EDM13486.1| sulfatase modifying factor 2 [Rattus norvegicus]
          Length = 307

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 126/333 (37%), Positives = 170/333 (51%), Gaps = 71/333 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F MGTN P   +DGE P+R VT+  F +D   ++N  F+EFV    Y TEAE
Sbjct: 37  MVHLPGGRFLMGTNAPD-GRDGEGPAREVTVKPFAIDVFPITNKDFREFVREKKYQTEAE 95

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S E R +                               N+  A   W 
Sbjct: 96  AFGWSFVFEDFVSPELRKQA------------------------------NEMPAVLWW- 124

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP +  +                 W  P G  S I  ++  PVVHVSWNDA AYC W
Sbjct: 125 ---LPVQKAF-----------------WRQPAGPGSGIREKLELPVVHVSWNDAGAYCAW 164

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN       +R N+WQG+FP  + A DG+   +P
Sbjct: 165 RGKRLPTEEEWEFAARGGLKGQVYPWGNQFQ---LNRTNLWQGKFPKGDRAEDGFHGLSP 221

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA           +Y   G      +V +G S++   +
Sbjct: 222 VNAFPPQNNYGLYDLMGNVWEWTAS----------TYQAAGQDM---RVLRGASWIDTAD 268

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
               +R R   R  NTPDS++ NLGFRCA+ +G
Sbjct: 269 GSANHRARVTTRMGNTPDSASDNLGFRCASSEG 301


>gi|282901004|ref|ZP_06308937.1| protein of unknown function DUF323 [Cylindrospermopsis raciborskii
           CS-505]
 gi|281194095|gb|EFA69059.1| protein of unknown function DUF323 [Cylindrospermopsis raciborskii
           CS-505]
          Length = 333

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 126/342 (36%), Positives = 180/342 (52%), Gaps = 71/342 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+ G T++MG +      D +F  + VT+ +F +D++EV+N +F +FV  TGY+T A+
Sbjct: 43  MVLINGGTYKMGAD------DSDF--QEVTVGSFCIDKYEVTNAEFAKFVELTGYITVAQ 94

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +      F P LS+EER   S V    K+ +                             
Sbjct: 95  RPLSKEQF-PHLSDEERLPGSLVFQAPKQGQ----------------------------- 124

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
             ++P  + W +           P  +W HP G D+T   + N+PVVH+++ DA+AY  W
Sbjct: 125 -KQIPFLSWWYW----------VPDANWQHPLGDDTT--SKENYPVVHIAYQDALAYAKW 171

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTE +WEY  RGGL++  + WGN  +   E +AN WQG FP  NT ADGY+  AP
Sbjct: 172 LGKSLPTEVQWEYAARGGLKSATYTWGNQYS---ETKANTWQGMFPFFNTKADGYVGLAP 228

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP--------------SYNPKGPTTGTD 300
           V S+  N +G+Y+M GNVWEWT+ W+ V   P P              S++P+ PT    
Sbjct: 229 VGSFPPNGYGIYDMAGNVWEWTSSWFKV---PQPHQNGVDYLSVKERTSFDPEKPTEMAL 285

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            V KGGSYLC   YC R+R AAR   +PD+ A ++GFR   +
Sbjct: 286 YVIKGGSYLCAPNYCSRYRPAARESQSPDTGANHIGFRLVKN 327


>gi|400536049|ref|ZP_10799585.1| hypothetical protein MCOL_V216714 [Mycobacterium colombiense CECT
           3035]
 gi|400331092|gb|EJO88589.1| hypothetical protein MCOL_V216714 [Mycobacterium colombiense CECT
           3035]
          Length = 296

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 127/335 (37%), Positives = 170/335 (50%), Gaps = 50/335 (14%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
            ++V LPG +FRMG+         E P   VT+    +++H V+N QF EFV ATGYVT 
Sbjct: 3   SELVDLPGGSFRMGSTS---FYPEEAPIHTVTVGPIAVERHPVTNAQFAEFVDATGYVTV 59

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+  D  ++ P  + ++    + V      F      ++ R                  
Sbjct: 60  AEQPIDPALY-PGANADDLCPGAMV------FRPTPGPVDLR-----------------D 95

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR        +W          R  P  SW HP G DS +  R +HPVV V++ DA AY 
Sbjct: 96  WR--------QWW---------RWAPGASWRHPFGPDSDVADRGDHPVVQVAYPDAAAYA 138

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAEWEY  R G     +PWG+  T  G   AN WQG FP  N  A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAG-STTTYPWGDEATSGGRLMANTWQGSFPYRNDGALGWVGT 197

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
           +PV ++  N FGL +M+GNVWEWTA  +  HH     P     P GP     ++  KGGS
Sbjct: 198 SPVGTFAPNAFGLLDMIGNVWEWTATEFTAHHRLEAPPKSCCTPSGPADPSVNQTLKGGS 257

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +LC  +YC+R+R  ARS  + DS+  ++GFRC AD
Sbjct: 258 HLCAPEYCHRYRPPARSPQSQDSATTHIGFRCVAD 292


>gi|87301449|ref|ZP_01084289.1| hypothetical protein WH5701_02204 [Synechococcus sp. WH 5701]
 gi|87283666|gb|EAQ75620.1| hypothetical protein WH5701_02204 [Synechococcus sp. WH 5701]
          Length = 332

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 129/346 (37%), Positives = 180/346 (52%), Gaps = 60/346 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG TFRMG++        E  +  VT++ F +D+HEV+N +F  FV+ TGYVT AE
Sbjct: 35  MEWIPGGTFRMGSDDHY---PDELSADAVTVEGFCIDRHEVTNAEFDAFVTDTGYVTVAE 91

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +      + P L+E++RA  + V                 M      ++    V+Y +W 
Sbjct: 92  RPIPKDQY-PDLAEDQRAPGALV---------------FAMPPAGTKVT---EVSYLSW- 131

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W +           P  +W HP+G +S +E   NHPVV V++ DA AY  W
Sbjct: 132 ---------WHWT----------PGANWRHPQGPESDLEGLANHPVVQVAFRDAEAYAAW 172

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEA+WE+  RGGL++++F WG   + +   +AN WQGEFP +N   DGY  TAP
Sbjct: 173 AGKELPTEAQWEFAARGGLKDQIFSWGKTYSAK---KANTWQGEFPIHNQEKDGYFGTAP 229

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAP-------SYNPKGPTTGTDKV 302
           V S+  N +GL +M GNVWEWT DW+   H     H  P       S++P+ P      V
Sbjct: 230 VGSFPPNGYGLQDMTGNVWEWTQDWYRPGHQGMADHVNPAVTEQELSFDPREPGIAK-HV 288

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
            KGGS+LC + YC R+R AAR    PD+   ++GFR      PT G
Sbjct: 289 IKGGSFLCAKNYCSRYRPAAREALEPDTGTSHIGFRLVTS--PTEG 332


>gi|384539530|ref|YP_005723614.1| hypothetical protein SM11_pD1281 [Sinorhizobium meliloti SM11]
 gi|336038183|gb|AEH84113.1| hypothetical protein SM11_pD1281 [Sinorhizobium meliloti SM11]
          Length = 318

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 127/348 (36%), Positives = 170/348 (48%), Gaps = 61/348 (17%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           +E   +MV +PG TF MG+N        E P   VT+D F++D   V+N QF EFV+ATG
Sbjct: 10  LEPAAEMVWVPGATFMMGSND---HYPEEAPVHPVTVDGFWIDVTPVTNRQFLEFVNATG 66

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           +VT AE+       +P   +   A  S +R     F      ++                
Sbjct: 67  HVTFAER-------KPRAEDYPGAPPSNLRAGSLVFTPPKRPLQ---------------- 103

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                 G  +   ++W     G          +W HP G  S+I   ++HPVVHV+++DA
Sbjct: 104 ------GTDI---SQWWIFTLGA---------NWRHPLGRKSSIGAILDHPVVHVAYSDA 145

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G  LPTE EWE   RGGL+   F WG  L P G H AN WQG FP  N+  DG
Sbjct: 146 KAYAEWAGKDLPTETEWELAARGGLDGAEFSWGGELAPGGNHMANTWQGSFPVENSMDDG 205

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----------------PSYN 291
           +  T+PV  Y  N +GLY+M+GNVWEWT D+W+V H  A                  S +
Sbjct: 206 FARTSPVRFYPPNGYGLYDMIGNVWEWTTDYWSVRHPEAAAKPCCIPSNPRNADADASID 265

Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           P        +V KGGS+LC   YC R+R AAR     D++  ++GFRC
Sbjct: 266 PAASVKVPRRVLKGGSHLCAPNYCRRYRPAARHAQEIDTTTSHVGFRC 313


>gi|433611510|ref|YP_007194971.1| hypothetical protein C770_GR4pD1269 [Sinorhizobium meliloti GR4]
 gi|429556452|gb|AGA11372.1| hypothetical protein C770_GR4pD1269 [Sinorhizobium meliloti GR4]
          Length = 319

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 127/349 (36%), Positives = 170/349 (48%), Gaps = 61/349 (17%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           +E   +MV +PG TF MG+N        E P   VT+D F++D   V+N QF EFV+ATG
Sbjct: 11  LEPAAEMVWVPGTTFMMGSND---HYPEEAPVHPVTVDGFWIDVTPVTNRQFLEFVNATG 67

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           +VT AE+       +P   +   A  S +R     F      ++                
Sbjct: 68  HVTFAER-------KPRAEDYPGAPPSNLRAGSLVFTPPKRPLQ---------------- 104

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                 G  +   ++W     G          +W HP G  S+I   ++HPVVHV+++DA
Sbjct: 105 ------GTDI---SQWWIFTLGA---------NWRHPLGRKSSIGAILDHPVVHVAYSDA 146

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G  LPTE EWE   RGGL+   F WG  L P G H AN WQG FP  N+  DG
Sbjct: 147 KAYAEWAGKDLPTETEWELAARGGLDGAEFSWGGELAPGGNHMANTWQGSFPVENSMDDG 206

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----------------PSYN 291
           +  T+PV  Y  N +GLY+M+GNVWEWT D+W+V H  A                  S +
Sbjct: 207 FARTSPVRFYPPNGYGLYDMIGNVWEWTTDYWSVRHPEAAAKPCCIPSNPRNADADASID 266

Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           P        +V KGGS+LC   YC R+R AAR     D++  ++GFRC 
Sbjct: 267 PAASVKVPRRVLKGGSHLCAPNYCRRYRPAARHAQEIDTTTSHVGFRCV 315


>gi|409436150|ref|ZP_11263342.1| Sulfatase-modifying factor 1 [Rhizobium mesoamericanum STM3625]
 gi|408752060|emb|CCM74491.1| Sulfatase-modifying factor 1 [Rhizobium mesoamericanum STM3625]
          Length = 339

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 130/344 (37%), Positives = 171/344 (49%), Gaps = 65/344 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ +PG TF MG+N        E P+  V +D F++D+  V+N +F EFV ATGYVT AE
Sbjct: 18  MIWIPGRTFTMGSNDHY---PEEAPAHPVKVDGFWIDEIPVTNRKFTEFVKATGYVTFAE 74

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
              D   +     E  RA  S +    K   G D T             W        W 
Sbjct: 75  TRPDPKDYPGAKPEMLRAG-SLLFFQPKTVSGADIT------------QW--------W- 112

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    +++G             +W  P G  S +  +++HPVVHV+W DA AY  W
Sbjct: 113 --------TFKFGA------------NWRRPLGGTSDLRGKLDHPVVHVAWCDAKAYADW 152

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE   RGGLE+  F WG+ L P G   AN+WQG FP  +T   G+  T+P
Sbjct: 153 AGLELPTEAEWELAARGGLEDAEFAWGDELMPAGTLMANIWQGTFPVTSTKPKGHERTSP 212

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP------------------SYNPKGPT 296
           V S+  N +GLY+M+GN+WEWT D+W+   HP P                  SY+P+ P 
Sbjct: 213 VRSFPPNGYGLYDMIGNIWEWTTDYWS-DRHPEPSAKACCIPNNPRGGGMEASYDPRQPN 271

Query: 297 TGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                +V KGGS+LC   YC R+R AAR     D+S  +LGFRC
Sbjct: 272 IRIPRRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHLGFRC 315


>gi|146299962|ref|YP_001194553.1| hypothetical protein Fjoh_2206 [Flavobacterium johnsoniae UW101]
 gi|146154380|gb|ABQ05234.1| protein of unknown function DUF323 [Flavobacterium johnsoniae
           UW101]
          Length = 384

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 124/350 (35%), Positives = 174/350 (49%), Gaps = 66/350 (18%)

Query: 15  MVLLPGDTFRMGTNKPI--------LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
           MV +PG  F MG+N           + KD   P   V +D +Y+D+ EV+N +F++FV A
Sbjct: 74  MVWIPGGEFSMGSNVEDESLCSIKGVTKDAA-PIHRVYVDGYYMDKTEVTNEEFEKFVKA 132

Query: 67  TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
           TGYVT AE+      F P  +EE+    S V         L++                 
Sbjct: 133 TGYVTVAEQKPTKEEF-PTANEEDLITGSVVFTPTPSAVNLNN----------------- 174

Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
              +  W          W Y           P   W HPEG  S I+ +  +PVVHV + 
Sbjct: 175 ---FLQW----------WRYE----------PGADWKHPEGPQSNIKGKEKYPVVHVVYE 211

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN--NT 244
           DA AY  W G RLPTEAEWE+  RGG    L+ WGN+L P+G+ +AN++QG FP    +T
Sbjct: 212 DAAAYAKWAGKRLPTEAEWEFAARGGKTGNLYAWGNDLKPKGKFQANIYQGHFPIKDGDT 271

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS------YNPKGPT-- 296
             DG+   AP   +  N +GLY+M GNVWEW  DW++V ++ + +       NP+GP   
Sbjct: 272 GEDGFKGIAPTAQFAPNAYGLYDMAGNVWEWVNDWYSVDYYKSLAESGKTVKNPQGPNAY 331

Query: 297 ------TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                 T   +V +GGS+LC +QYC R+    R +    S+A ++GFRC 
Sbjct: 332 YDPNDPTEIKRVHRGGSFLCTDQYCTRYMVGTRGKGEIRSAANHVGFRCV 381


>gi|73538395|ref|YP_298762.1| hypothetical protein Reut_B4568 [Ralstonia eutropha JMP134]
 gi|72121732|gb|AAZ63918.1| conserved hypothetical protein [Ralstonia eutropha JMP134]
          Length = 390

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 123/351 (35%), Positives = 168/351 (47%), Gaps = 58/351 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG  F MG++   + +  E P+  V +  F++D+H V+N QF++FV ATGYVT AE
Sbjct: 55  MAHIPGGEFLMGSDS-NMAQPNEKPAHKVRVHGFWMDRHHVTNAQFRDFVKATGYVTTAE 113

Query: 75  KFGDTFVFEPLL-SEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           +  +       L +   R   S +      F G    +  +               Y  W
Sbjct: 114 RKPEWETLRVQLPAGTPRPPDSALVPGGMVFVGTSRPVPLK--------------DYSQW 159

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W Y           P   W HP G  ++I+ + +HPVV VS+ DA AY  
Sbjct: 160 ----------WRY----------VPGADWRHPGGPGTSIDGKDDHPVVQVSYEDAQAYAR 199

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGY 249
           W G RLPTEAEWE+  RGGLE   + WG    P G   ANVWQG+    FP  +  A G 
Sbjct: 200 WVGKRLPTEAEWEFAARGGLEQATYAWGGTFAPEGRQMANVWQGQQAQPFPVASPKAGGA 259

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNPKG 294
           L T+PV ++ EN +GLY+M GN W+W ADW+                    P  S++P  
Sbjct: 260 LGTSPVGTFPENGYGLYDMTGNAWQWVADWYRADQFRQEAAAGRRVDDPKGPVSSWDPDD 319

Query: 295 ---PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
              PT    +V +GGS+LCNE YC  +R +AR    P +S  +LGFR   D
Sbjct: 320 AGVPTDAPKRVTRGGSFLCNEDYCLSYRPSARRGTDPYNSMSHLGFRLVMD 370


>gi|384533781|ref|YP_005716445.1| sulfatase-modifying factor protein [Sinorhizobium meliloti BL225C]
 gi|333815957|gb|AEG08624.1| Sulphatase-modifying factor protein [Sinorhizobium meliloti BL225C]
          Length = 319

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 127/349 (36%), Positives = 170/349 (48%), Gaps = 61/349 (17%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           +E   +MV +PG TF MG+N        E P   VT+D F++D   V+N QF EFV+ATG
Sbjct: 11  LEPAAEMVWVPGTTFMMGSND---HYPEEAPVHPVTVDGFWIDVTPVTNRQFLEFVNATG 67

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           +VT AE+       +P   +   A  S +R     F      ++                
Sbjct: 68  HVTFAER-------KPRAEDYPGAPPSNLRAGSLVFTPPKRPLQ---------------- 104

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                 G  +   ++W     G          +W HP G  S+I   ++HPVVHV+++DA
Sbjct: 105 ------GTDI---SQWWIFTLGA---------NWRHPLGRKSSIGAILDHPVVHVAYSDA 146

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G  LPTE EWE   RGGL+   F WG  L P G H AN WQG FP  N+  DG
Sbjct: 147 KAYAEWAGKDLPTETEWELSARGGLDGAEFSWGGELAPGGNHMANTWQGSFPVENSMDDG 206

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----------------PSYN 291
           +  T+PV  Y  N +GLY+M+GNVWEWT D+W+V H  A                  S +
Sbjct: 207 FARTSPVRFYPPNGYGLYDMIGNVWEWTTDYWSVRHPEAAAKPCCIPSNPRNADADASID 266

Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           P        +V KGGS+LC   YC R+R AAR     D++  ++GFRC 
Sbjct: 267 PAASVKVPRRVLKGGSHLCAPNYCRRYRPAARHAQEIDTTTSHVGFRCV 315


>gi|220914917|ref|YP_002490225.1| hypothetical protein [Methylobacterium nodulans ORS 2060]
 gi|219952668|gb|ACL63058.1| protein of unknown function DUF323 [Methylobacterium nodulans ORS
           2060]
          Length = 324

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 129/359 (35%), Positives = 178/359 (49%), Gaps = 62/359 (17%)

Query: 5   PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           PA   +    MV + G TFRMG+++       E P   V++D F++D+  V+N QF+ FV
Sbjct: 10  PADSPDLLAGMVFVAGGTFRMGSDR---HYPEEAPVHRVSVDGFWIDRTPVTNAQFRAFV 66

Query: 65  SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
            ATG+VT AE+       +P   +   A    ++     F+     ++ R         W
Sbjct: 67  RATGHVTMAER-------KPKAEDYPGALPHMLQAGSLVFKPPKGAVDLR--------DW 111

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
           +       WR         + +G             SW  P G  S+I    +HPVVHV+
Sbjct: 112 S-----AWWR---------FRFGA------------SWRRPLGPGSSIAGLDDHPVVHVA 145

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
           + DA AY  W G  LPTEAEWEY  RGGL+   F WG+  TP G H AN WQG FP  N 
Sbjct: 146 YADAEAYARWAGKELPTEAEWEYAARGGLDGAEFAWGDEFTPGGRHMANTWQGAFPHQNL 205

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSYNPKGPTTGTDK 301
           A DG+  T+PV ++  N +GL++M+GNVWEWT D++   H    P     P+ P  G +K
Sbjct: 206 AEDGFERTSPVTTFPPNGYGLHDMIGNVWEWTTDFYAPKHPTDAPKACCIPQNPRGGQEK 265

Query: 302 ---------------VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
                          V KGGS+LC   YC R+R AAR  +  D+S  ++GFRC   +GP
Sbjct: 266 GSYDPAQPAIRIPRRVVKGGSHLCAPNYCRRYRPAARHPHPVDTSTSHIGFRCVVRRGP 324


>gi|89269079|emb|CAJ81908.1| similar to human sulfatase modifying factor 2 precursor (sumf2)
           [Xenopus (Silurana) tropicalis]
          Length = 307

 Score =  205 bits (521), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 129/346 (37%), Positives = 177/346 (51%), Gaps = 64/346 (18%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           E   +MV L G  F MGT K    KDGE P R V +  F +D++ V+N  F+EFV A  Y
Sbjct: 21  EDADNMVQLDGGQFDMGT-KASDGKDGESPVRQVKVLPFAVDKYPVTNKDFREFVRAKKY 79

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
            TEAE FG +FVFE  +SEE + K++Q                      +    W     
Sbjct: 80  KTEAEAFGWSFVFEDFVSEELKKKVTQ---------------------KLESAPW----- 113

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
              W    LP E             R F    W  P G  S I+ ++++PVV VSWNDA 
Sbjct: 114 ---W----LPVE-------------RAF----WRQPAGPASGIKEKLDYPVVQVSWNDAQ 149

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
           AYC W G RLP E EWE+  RGGL+ +++PWGN   P   +R N+WQG FP  +TA DGY
Sbjct: 150 AYCKWLGKRLPIEEEWEFAARGGLKGKVYPWGNTFKP---NRTNLWQGTFPGMDTATDGY 206

Query: 250 LSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
              +PV ++  +N++GLY+M+GN WEWTA  +        S   +        V +G S+
Sbjct: 207 HGASPVAAFPAQNEYGLYDMLGNTWEWTATEYT-------SARAQKQNKENMYVLRGASW 259

Query: 309 L--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKV 352
           +   +    ++ R   R  NTPDS++ NL FRCA ++  +   DK+
Sbjct: 260 IDTVDGSANHKARVTTRMGNTPDSASDNLSFRCAKNRRKSKTEDKM 305


>gi|383771887|ref|YP_005450952.1| hypothetical protein S23_36410 [Bradyrhizobium sp. S23321]
 gi|381360010|dbj|BAL76840.1| hypothetical protein S23_36410 [Bradyrhizobium sp. S23321]
          Length = 307

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 127/348 (36%), Positives = 171/348 (49%), Gaps = 62/348 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV + G TF MG++        E PS    +D F++D+  V+N QF+ FV+ATG+VT A 
Sbjct: 1   MVWISGGTFGMGSDDHY---PEEAPSHRAAVDGFWIDRTPVTNRQFKAFVNATGHVTTA- 56

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                                Q+  D K + G  +  E      +V    +     C W 
Sbjct: 57  ---------------------QIVPDPKEYPG--ARPEMIYAGSLVFSPPSRVADLCDW- 92

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                  ++W    +G          +W  P G  S I+   +HPVVHVS++DA+AY TW
Sbjct: 93  -------SQWWTFMKGA---------TWRRPYGPKSNIKDLDDHPVVHVSYSDAMAYATW 136

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE+  RGG +   F WGN L P G H AN WQG FP  N   DG+  T+P
Sbjct: 137 AGKDLPTEAEWEFAARGGFDGEDFAWGNTLNPGGRHMANTWQGIFPLENRCEDGFERTSP 196

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---APSYNPKGPTTGTD----------- 300
           V ++  N +G+++M+GNVWEWTADWW+  H      P   P+ P  G +           
Sbjct: 197 VKAFPPNGYGVHDMIGNVWEWTADWWSARHEADAQKPCCIPQNPRGGPESASFDSCQPHI 256

Query: 301 ----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
               KV KGGS+LC   YC R+R AAR     D+S  ++GFRC    G
Sbjct: 257 RIPRKVLKGGSHLCAPNYCRRYRPAARHAEPIDTSTSHVGFRCVVRNG 304


>gi|296168570|ref|ZP_06850374.1| sulfatase-modifying factor 1 [Mycobacterium parascrofulaceum ATCC
           BAA-614]
 gi|295896633|gb|EFG76272.1| sulfatase-modifying factor 1 [Mycobacterium parascrofulaceum ATCC
           BAA-614]
          Length = 305

 Score =  204 bits (520), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 128/335 (38%), Positives = 169/335 (50%), Gaps = 50/335 (14%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V +PG +FRMG          E P   VTL AF +++H V+N QF EFVSATGYVT A
Sbjct: 5   ELVDIPGGSFRMGCTS---FYPEEAPIHTVTLPAFAVERHPVTNAQFAEFVSATGYVTIA 61

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+        P+              D   + G D      +   +V       V    W
Sbjct: 62  ER--------PI--------------DPALYPGADPA--DLVPGAMVFRPTTGPVDLGDW 97

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R        +W          R  P   W HP G  S +  R +HPVV V++ DA AY  
Sbjct: 98  R--------QWW---------RWVPGACWRHPFGPHSDLTDRADHPVVQVAYPDAAAYAR 140

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGG    ++ WG+  +P G   AN WQG FP  N  A G+  T+
Sbjct: 141 WAGRRLPTEAEWEYAARGG-SAAIYAWGDEPSPGGRLMANTWQGRFPYRNDGALGWAGTS 199

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGSY 308
           PV ++  N FGL +M+GNVWEWTA  ++ HH     P     P GP      +  KGGS+
Sbjct: 200 PVGTFPPNGFGLLDMIGNVWEWTATEFSAHHRLDQPPKACCTPAGPADPSVSQTLKGGSH 259

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           LC  +YC+R+R AARS  + D++  ++GFRC A +
Sbjct: 260 LCAPEYCHRYRPAARSPQSQDTATTHIGFRCVAQQ 294


>gi|194289632|ref|YP_002005539.1| sulfatase-modifying factor (c-alpha-formyglycine-generating enzyme)
           [Cupriavidus taiwanensis LMG 19424]
 gi|193223467|emb|CAQ69472.1| putative Sulfatase-modifying factor
           (C-alpha-formyglycine-generating enzyme) [Cupriavidus
           taiwanensis LMG 19424]
          Length = 410

 Score =  204 bits (520), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 127/352 (36%), Positives = 166/352 (47%), Gaps = 58/352 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG  F MG++   + +  E P+  V + AF++DQH V+N  F++FV ATGYVT AE
Sbjct: 71  MVRVPGGEFLMGSDS-KMAQPNEKPAHKVRVHAFWMDQHHVTNGDFRKFVEATGYVTTAE 129

Query: 75  KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           +  D       L     R   S +      F G    +  +               Y  W
Sbjct: 130 RAPDWETLRVQLPPGTPRPPDSAMVAGGMVFVGTSRPVPLQ--------------DYSRW 175

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W Y           P   W HP G  S+IE + NHPVV VS+ DA AY  
Sbjct: 176 ----------WRY----------VPGADWRHPNGPGSSIEGKDNHPVVQVSYEDAQAYAR 215

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGY 249
           W G RLPTEAEWE+  RGGLE   + WG    P G   ANVWQG+    FP  +  A G 
Sbjct: 216 WVGKRLPTEAEWEFAARGGLEQATYAWGEQFAPDGRQMANVWQGQQNQPFPVVSPKAGGA 275

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH------HPAPSYNPKGPTTGTD--- 300
           L T+PV ++  N +GL +M GN W+W ADW+             P  NP GP    D   
Sbjct: 276 LGTSPVGTFPPNGYGLADMTGNAWQWVADWYRADQFRREAGKAQPIENPVGPLASWDPSE 335

Query: 301 ---------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
                    +V +GGS+LCNE +C  +R +AR    P +S  +LGFR   D+
Sbjct: 336 PGVPVNAPKRVTRGGSFLCNEDFCLSYRPSARRGTDPFNSMSHLGFRLVMDE 387


>gi|333991963|ref|YP_004524577.1| hypothetical protein JDM601_3323 [Mycobacterium sp. JDM601]
 gi|333487931|gb|AEF37323.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
          Length = 285

 Score =  204 bits (520), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 128/334 (38%), Positives = 168/334 (50%), Gaps = 57/334 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V +PG  FRMG+         E P   V + AF +++H V+N QF EFV+ATGYVT A
Sbjct: 4   ELVGIPGGVFRMGSTS---FYPEEAPIHTVAVAAFAIERHPVTNAQFAEFVAATGYVTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D     P+ + E  A +         F      ++ R                  W
Sbjct: 61  EQEPD-----PVYAAEPGALV---------FTPTAGPVDLR-----------------DW 89

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         W++           P  SW HP G  STIEH+ +HPVV V++ DA  Y  
Sbjct: 90  R-------QWWQW----------VPGASWRHPLGPHSTIEHKPDHPVVQVAYPDAAGYAR 132

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLP EAEWEY  R G  +  +PWG    P GE  AN WQG FP  N  A G+  T+
Sbjct: 133 WAGRRLPAEAEWEYAARAGTSS-TYPWGEEEKPGGELMANTWQGAFPYRNDGALGWSGTS 191

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGSY 308
            V S+  N +GL +M+GNVWEWT   +  HH     P     P GP   G  +  KGGS+
Sbjct: 192 SVGSFPPNDWGLLDMIGNVWEWTTTEFCGHHRLDAAPKSCCAPDGPADPGVSQTLKGGSH 251

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           LC   YC+R+R AARS  + D++  ++GFRCAAD
Sbjct: 252 LCAPAYCHRYRAAARSPQSQDTATSHIGFRCAAD 285


>gi|452951552|gb|EME56999.1| hypothetical protein G352_21926 [Rhodococcus ruber BKS 20-38]
          Length = 320

 Score =  204 bits (519), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 132/344 (38%), Positives = 170/344 (49%), Gaps = 62/344 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  LPG TF MG+N        E P   VT+D F++D H V+   F++FV ATG+VT AE
Sbjct: 17  MQHLPGGTFLMGSNH---FYPEEQPVHRVTVDGFWMDIHPVTVADFRKFVDATGHVTTAE 73

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                    P  +E   A    +      F+  D        HPV     +D   + +W 
Sbjct: 74  -------IAPDPAEFPEAPADLLVPGSLVFDPPD--------HPV---PLDDFTRWWSW- 114

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                  P  +W HPEG  ST + R +HPV HVS +DA+AY  W
Sbjct: 115 ----------------------IPGANWRHPEGPGSTSDDRDDHPVTHVSHDDALAYAKW 152

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE+  RGG E R F WG+   P G  RANVWQGEFP  N   DGY  T+P
Sbjct: 153 AGKDLPTEAEWEFAARGGQEGRHFVWGDEDPPSGGPRANVWQGEFPWQNLEEDGYYRTSP 212

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTGTD------ 300
           V ++  N +GL +M GNVWEWT D++   H     + AP+     P+ P           
Sbjct: 213 VHAFPSNPYGLRDMAGNVWEWTVDYFTGDHASGGKNVAPASACCIPRNPRITVAEQPVDA 272

Query: 301 ----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
               +V KGGS+LC   YC R+R AAR   + D+S G++GFRC 
Sbjct: 273 PRPRRVIKGGSHLCAPNYCLRYRPAARQGQSEDTSTGHIGFRCV 316


>gi|217979245|ref|YP_002363392.1| hypothetical protein Msil_3121 [Methylocella silvestris BL2]
 gi|217504621|gb|ACK52030.1| protein of unknown function DUF323 [Methylocella silvestris BL2]
          Length = 334

 Score =  204 bits (519), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 125/343 (36%), Positives = 173/343 (50%), Gaps = 62/343 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ L G TF MG+++       E P+  V +D F++D+  V+N QF+ FV ATG+ T AE
Sbjct: 21  MIYLDGGTFLMGSDRHY---PEEAPAHRVAVDGFFMDETPVTNAQFRAFVEATGHRTFAE 77

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                      ++   +     + H +K      S +     +PV    W+    + T+R
Sbjct: 78  -----------IAPNSKDYPDALPHMLK----PGSLVFSPPDYPVDLRDWSQ---WWTFR 119

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                      +G             SW  P G  S+I+   +HPVVHV++ DA AY  W
Sbjct: 120 -----------FGA------------SWRKPYGPGSSIKGLDDHPVVHVAFRDAEAYARW 156

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LP+EAEWE+  RGGLE   F WG+  TP G H AN WQG+FP  N A DG+  T+P
Sbjct: 157 AGKELPSEAEWEFAARGGLERAEFAWGDEFTPGGRHMANTWQGQFPLQNLAEDGFERTSP 216

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSYNPKGPTTGT------------ 299
           V ++  N +GLY+++GNVWEWT DWW+  H    P     P+ P  G+            
Sbjct: 217 VRTFPPNNYGLYDLIGNVWEWTTDWWSQRHAGDAPKACCVPRNPRGGSMERSYASLQAGA 276

Query: 300 ---DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
               KV KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 277 SIPQKVLKGGSHLCAPNYCRRYRPAARHAEAIDTSTSHVGFRC 319


>gi|352096888|ref|ZP_08957602.1| Sulphatase-modifying factor protein [Synechococcus sp. WH 8016]
 gi|351675472|gb|EHA58633.1| Sulphatase-modifying factor protein [Synechococcus sp. WH 8016]
          Length = 282

 Score =  204 bits (519), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 124/335 (37%), Positives = 173/335 (51%), Gaps = 55/335 (16%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           + +  MV++P   F +G++        E P R+V + +F +D   V+N +F  FVS TGY
Sbjct: 3   DTFDRMVVIPSGDFILGSDS---FYPEESPVRSVQIKSFRIDACPVTNAEFARFVSETGY 59

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           VT +EK  D  ++ P L  +++   S V      F     T++         +SW   + 
Sbjct: 60  VTVSEKPPDPVLY-PNLPPDQQNPESAV------FIPPPPTVDRNQP-----MSWWALIE 107

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
              WR                             HP+G DSTI  RM+HPVVH+++ D++
Sbjct: 108 GADWR-----------------------------HPQGPDSTIADRMDHPVVHLAYEDSL 138

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
           AY  W G RLPT  EWE   RGGL  + + WG  +TP G+  ANVWQG FP  N  ADG+
Sbjct: 139 AYAQWMGKRLPTADEWEVAARGGLVQQHYAWGEEMTPGGQWLANVWQGPFPWKNEQADGW 198

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
             T+PV S+  N +GL +M GNVWEWT+  + V         PKG      ++ KGGS+L
Sbjct: 199 FWTSPVGSFPANGYGLVDMCGNVWEWTSTLFPV---------PKG--EQERRIIKGGSFL 247

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           C E YC+R R AA    T D++  ++GFRCAAD G
Sbjct: 248 CAENYCHRFRPAALMGQTIDTATCHMGFRCAADSG 282



 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 30/42 (71%)

Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           ++ KGGS+LC E YC+R R AA    T D++  ++GFRCAAD
Sbjct: 239 RIIKGGSFLCAENYCHRFRPAALMGQTIDTATCHMGFRCAAD 280


>gi|116693744|ref|YP_839277.1| hypothetical protein Bcen2424_5651 [Burkholderia cenocepacia
           HI2424]
 gi|116651744|gb|ABK12384.1| protein of unknown function DUF323 [Burkholderia cenocepacia
           HI2424]
          Length = 421

 Score =  204 bits (519), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 125/353 (35%), Positives = 173/353 (49%), Gaps = 58/353 (16%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           DM  +PG TF MG++     +  E P+   T+  F+LD+H+V+N +F+ FV ATGYVT A
Sbjct: 86  DMAWIPGGTFLMGSDA-KAAQPNERPAHRATVRGFWLDRHDVTNAEFRRFVDATGYVTTA 144

Query: 74  EKFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           E+  D       L     R   S +      F G    +             +D   Y  
Sbjct: 145 ERKPDWSALRAQLPPGTPRPPDSALVPGALVFVGTAGPVPL-----------DD---YAR 190

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W          W          R  P  +W HP+G  STI  + ++PVV VS++DA AY 
Sbjct: 191 W----------W----------RFLPGANWRHPDGPGSTIAGKDDYPVVQVSYDDARAYA 230

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADG 248
            W G RLPTEAEWE+  RGGLE   + WGN L P G   AN+W+G+    FP  +  +  
Sbjct: 231 QWAGKRLPTEAEWEFAARGGLEQADYAWGNQLKPAGRQMANIWEGQQPQPFPVVDARSGK 290

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH---------------HHPAPSYNPK 293
             + +PV +Y  N +GLY+M GNVW+W ADW+                    PA S++P 
Sbjct: 291 PAAPSPVGTYPPNGYGLYDMAGNVWQWVADWYRADAFLLAVESGRPVADPQGPADSWDPT 350

Query: 294 GPTT---GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
            P        +V +GGS+LCNE YC  +R +AR    P++   +LGFR A D+
Sbjct: 351 EPGVPPHAPKRVIRGGSFLCNESYCQSYRPSARRGVDPEAGMSHLGFRLAMDE 403


>gi|407982456|ref|ZP_11163132.1| formylglycine-generating sulfatase enzyme family protein
           [Mycobacterium hassiacum DSM 44199]
 gi|407375968|gb|EKF24908.1| formylglycine-generating sulfatase enzyme family protein
           [Mycobacterium hassiacum DSM 44199]
          Length = 299

 Score =  204 bits (519), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 128/339 (37%), Positives = 172/339 (50%), Gaps = 55/339 (16%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V L G T+RMG+ +       E P   VT+  F +++H V+N QF EFV+ATGYVT A
Sbjct: 4   ELVELRGATYRMGSTR---FYPEEAPEHTVTVAPFAIERHPVTNAQFAEFVAATGYVTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D   + P  + ++    + V      F   D  ++ R                  W
Sbjct: 61  ERALDPADY-PGANPDDLVPGALV------FTPTDGPVDLR-----------------DW 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDST-IEHRMNHPVVHVSWNDAVAYC 192
           R        +W          R  P   W  P G D    + R +HPVV V++ DA+AY 
Sbjct: 97  R--------QWW---------RYVPGAQWRRPFGPDGPGWQDRPDHPVVQVAYVDALAYA 139

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAEWEY  RGG +   +PWG  L+P G   AN WQG FP  N  A G++ T
Sbjct: 140 RWAGRRLPTEAEWEYAARGGADT-TYPWGEELSPGGRLMANTWQGAFPYRNDGAAGWVGT 198

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------PAPSYNPKGPTTGTDKVK 303
           +PV ++  N +GL +M+GNVWEWT   +  HH          P P   P  P     +V 
Sbjct: 199 SPVGTFPPNGYGLVDMIGNVWEWTTTRFEGHHRIGPAQGAGGPKPCCTPSRPDPSVSQVL 258

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           KGGS+LC  +YC R+R  ARS  T DS+  ++GFRCAAD
Sbjct: 259 KGGSHLCAPEYCRRYRPTARSPQTQDSATTHIGFRCAAD 297


>gi|354494816|ref|XP_003509531.1| PREDICTED: sulfatase-modifying factor 2-like [Cricetulus griseus]
          Length = 302

 Score =  204 bits (519), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 128/354 (36%), Positives = 176/354 (49%), Gaps = 78/354 (22%)

Query: 1   MVLLPAPPVE----RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVS 56
           M LL +P ++    +   MV LPG  F MGT+ P   +DGE P+R VT+ +F +D   V+
Sbjct: 11  MCLLSSPRLQLGNAQAPTMVQLPGGRFLMGTDAPD-GRDGEGPAREVTVKSFTIDVFPVT 69

Query: 57  NTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMH 116
           N  F+EFV    Y TEAE FG +FVFE  +S E R + +Q+                   
Sbjct: 70  NKDFREFVREKKYQTEAETFGWSFVFEDFVSPELRNQANQM------------------- 110

Query: 117 HPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRM 176
                + W   V    WR                              P G  S I  ++
Sbjct: 111 ---PSVLWWLPVQKAFWR-----------------------------QPAGPGSGIREKL 138

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
             PVVHVSWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQ
Sbjct: 139 ELPVVHVSWNDASAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQ 195

Query: 237 ---GEFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
              G+FP  + A DG+   +PV ++  +N +GLY+++GNVWEWTA  +            
Sbjct: 196 GRPGKFPKGDRAEDGFHGLSPVNAFPPQNNYGLYDLMGNVWEWTASIYQSASQDM----- 250

Query: 293 KGPTTGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
                   +V +G S++   +    +R R   R  NTPDS++ NLGFRCA+  G
Sbjct: 251 --------RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCASSTG 296


>gi|47221775|emb|CAG08829.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 284

 Score =  204 bits (519), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 123/332 (37%), Positives = 172/332 (51%), Gaps = 71/332 (21%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV +PG    MGT+     KDGEFP R   + +F +D++ V+N +F++FV +  Y TEA
Sbjct: 21  EMVDIPGGRMLMGTSA-ADGKDGEFPPREAKVQSFKIDKYPVTNAEFRDFVRSQKYKTEA 79

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E FG +FVF+  +S+E ++KI+Q                               +    W
Sbjct: 80  ETFGWSFVFQDFVSDELKSKITQ------------------------------RIESAPW 109

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
               LP E             R F    W  P G  S I  R++ PVVHVSWNDA A+C 
Sbjct: 110 W---LPVE-------------RAF----WRQPAGAGSGIRERLDFPVVHVSWNDAQAFCQ 149

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W+G RLP E EWE+  RGGL+ R +PWGN       +R+N+WQG FP  +TA DG+   +
Sbjct: 150 WKGKRLPGEEEWEWAARGGLQGRTYPWGNKFQ---ANRSNLWQGPFPEGDTAEDGFHGVS 206

Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--C 310
           PV ++  +N  GLY+M+GN WEWT+        P P   P         V +G S++   
Sbjct: 207 PVAAFPPQNSHGLYDMMGNTWEWTS-------TPFPGAQPM-------YVLRGASWIDTV 252

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +    ++ R   R  NTPDS++ NLGFRCAA+
Sbjct: 253 DGSSNHKARVTTRMGNTPDSASDNLGFRCAAN 284


>gi|387877758|ref|YP_006308062.1| hypothetical protein W7S_21930 [Mycobacterium sp. MOTT36Y]
 gi|386791216|gb|AFJ37335.1| hypothetical protein W7S_21930 [Mycobacterium sp. MOTT36Y]
          Length = 296

 Score =  204 bits (519), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 126/336 (37%), Positives = 170/336 (50%), Gaps = 50/336 (14%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
            ++V LPG +FRMG+         E P   VT+    +++H V+N QF EFV ATGY+T 
Sbjct: 3   SELVDLPGGSFRMGSTS---FYPEEAPIHTVTIGPLAVERHPVTNAQFAEFVDATGYLTV 59

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+  D  +F P    ++    + V      F      ++ R                  
Sbjct: 60  AEQPIDPALF-PGADPDDLCPGAMV------FRPTPGPVDLR-----------------D 95

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR         W++           P  SW HP G DS +  R +HPVV V++ DA AY 
Sbjct: 96  WR-------QWWQW----------VPGASWRHPFGPDSDVADRGDHPVVQVAYPDAAAYA 138

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAEWEY  R G     + WG+ +T  G   AN WQG FP  N  A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAG-STTTYSWGDEVTSDGRLMANTWQGSFPYRNDGALGWVGT 197

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
           +PV ++  N FGL +M+GNVWEWTA  ++ HH     P     P GP      +  KGGS
Sbjct: 198 SPVGTFAPNAFGLVDMIGNVWEWTATEFSPHHRLHSPPKACCTPPGPADPSVSQTLKGGS 257

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           +LC  +YC+R+R AARS  + DS+  ++GFRC AD 
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVADS 293


>gi|107027544|ref|YP_625055.1| hypothetical protein Bcen_5208 [Burkholderia cenocepacia AU 1054]
 gi|105896918|gb|ABF80082.1| protein of unknown function DUF323 [Burkholderia cenocepacia AU
           1054]
          Length = 377

 Score =  204 bits (518), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 124/353 (35%), Positives = 171/353 (48%), Gaps = 58/353 (16%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           DM  +PG TF MG++     +  E P+   T+  F+LD+H+V+N +F+ FV ATGYVT A
Sbjct: 42  DMAWIPGGTFLMGSDA-KAAQPNERPAHRATVRGFWLDRHDVTNAEFRRFVDATGYVTTA 100

Query: 74  EKFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           E+  D       L     R   S +      F G    +                     
Sbjct: 101 ERKPDWSALRAQLPPGTPRPPDSALVPGALVFVGTAGPVP-------------------- 140

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                L   A W          R  P  +W HP+G  STI  + ++PVV VS++DA AY 
Sbjct: 141 -----LDDYARWW---------RFLPGANWRHPDGPGSTIAGKDDYPVVQVSYDDARAYA 186

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADG 248
            W G RLPTEAEWE+  RGGLE   + WGN L P G   AN+W+G+    FP  +  +  
Sbjct: 187 QWAGKRLPTEAEWEFAARGGLEQADYAWGNQLKPAGRQMANIWEGQQPQPFPVVDARSGK 246

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH---------------HHPAPSYNPK 293
             + +PV +Y  N +GLY+M GNVW+W ADW+                    PA S++P 
Sbjct: 247 PAAPSPVGTYPPNGYGLYDMAGNVWQWVADWYRADAFLLAVESGRPVADPQGPADSWDPT 306

Query: 294 GPTT---GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
            P        +V +GGS+LCNE YC  +R +AR    P++   +LGFR A D+
Sbjct: 307 EPGVPPHAPKRVIRGGSFLCNESYCQSYRPSARRGVDPEAGMSHLGFRLAMDE 359


>gi|406032646|ref|YP_006731538.1| sulfatase-modifying factor 1 [Mycobacterium indicus pranii MTCC
           9506]
 gi|405131193|gb|AFS16448.1| Sulfatase-modifying factor 1 [Mycobacterium indicus pranii MTCC
           9506]
          Length = 296

 Score =  204 bits (518), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 126/336 (37%), Positives = 170/336 (50%), Gaps = 50/336 (14%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
            ++V LPG +FRMG+         E P   VT+    +++H V+N QF EFV ATGY+T 
Sbjct: 3   SELVDLPGGSFRMGSTS---FYPEEAPIHTVTVGPLAVERHPVTNAQFAEFVDATGYLTV 59

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+  D  ++ P    ++    + V      F      ++ R                  
Sbjct: 60  AEQPIDPALY-PGADPDDLCPGAMV------FRPTPGPVDLR-----------------D 95

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR        +W          R  P  SW HP G DS +  R +HPVV V++ DA AY 
Sbjct: 96  WR--------QWW---------RWVPGASWRHPFGPDSDVADRGDHPVVQVAYPDAAAYA 138

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAEWEY  R G     + WG+ +T  G   AN WQG FP  N  A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAG-STTTYSWGDEVTSDGRLMANTWQGSFPYRNDGALGWVGT 197

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
           +PV ++  N FGL +M+GNVWEWTA  ++ HH     P     P GP      +  KGGS
Sbjct: 198 SPVGTFAPNAFGLIDMIGNVWEWTATEFSPHHRLDSAPKACCTPPGPADPSVSQTLKGGS 257

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           +LC  +YC+R+R AARS  + DS+  ++GFRC AD 
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVADS 293


>gi|108797997|ref|YP_638194.1| hypothetical protein Mmcs_1024 [Mycobacterium sp. MCS]
 gi|119867093|ref|YP_937045.1| hypothetical protein Mkms_1041 [Mycobacterium sp. KMS]
 gi|108768416|gb|ABG07138.1| protein of unknown function DUF323 [Mycobacterium sp. MCS]
 gi|119693182|gb|ABL90255.1| protein of unknown function DUF323 [Mycobacterium sp. KMS]
          Length = 290

 Score =  203 bits (517), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 122/332 (36%), Positives = 170/332 (51%), Gaps = 49/332 (14%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG TFRMG+ +       E P   VT+  F +++H V+N QF EFV ATGYVT A
Sbjct: 4   ELVELPGGTFRMGSTQ---FYPEEVPVHTVTVADFAIERHPVTNAQFAEFVDATGYVTIA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D  ++ P +   +    + V      F+     ++ R           D   + TW
Sbjct: 61  EQALDPALY-PGVPPADLVPGALV------FQPTAGPVDLR-----------DWRQWWTW 102

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                   P   W HP G +S+   R +HPVV V++ DA AY  
Sbjct: 103 S-----------------------PGACWRHPLGPESSWHDRPDHPVVQVAYPDAAAYAK 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEA+WEY    G     +PWG++  P G   AN WQG FP  N  A G++ T+
Sbjct: 140 WAGRRLPTEAQWEYAAHAGAAT-TYPWGDDPQPGGRLMANTWQGRFPYRNDGALGWVGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY----NPKGPTTGTDKVKKGGSYL 309
           PV ++  N FGL +M+GNVWEWT   ++ HH P  +      P  P    ++  KGGS+L
Sbjct: 199 PVGTFPANGFGLVDMIGNVWEWTTTRFSAHHRPDGAVESCCGPADPDPSVNQALKGGSHL 258

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           C  +YC+R+R AARS  + DS+  ++GFRC A
Sbjct: 259 CAPEYCHRYRAAARSPQSQDSATTHIGFRCVA 290


>gi|405380290|ref|ZP_11034130.1| hypothetical protein PMI11_04116 [Rhizobium sp. CF142]
 gi|397323155|gb|EJJ27553.1| hypothetical protein PMI11_04116 [Rhizobium sp. CF142]
          Length = 334

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 131/344 (38%), Positives = 171/344 (49%), Gaps = 65/344 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ +PG TF MG+N        E P+  V +D F++D+  V+N +F EFV ATGYVT AE
Sbjct: 18  MIWIPGRTFTMGSND---HYPEEAPAHPVKVDGFWIDEIPVTNRRFTEFVKATGYVTFAE 74

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K  D   +     E  RA  S +    K   G D T             W        W 
Sbjct: 75  KTPDPKDYPGAKPEMLRAG-SLLFFQPKTVSGPDIT------------QW--------W- 112

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    +++G             +W  P G  S +  +++HPVVHV+W DA AY  W
Sbjct: 113 --------TFKFGA------------NWRRPLGGLSDLRGKLDHPVVHVAWCDAKAYADW 152

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE   RGGL++  F WG+ L P G   AN WQG FP  +T   G+  T+P
Sbjct: 153 AGLELPTEAEWELAARGGLDDTEFAWGDELMPDGVPMANTWQGTFPVKSTKPKGHERTSP 212

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP------------------SYNPKGPT 296
           V S+  N +GLY+M+GNVWEWT D+W+   HP P                  SY+P+ P 
Sbjct: 213 VRSFPPNGYGLYDMIGNVWEWTTDYWS-DRHPEPSAKACCIPDNPRGGGMEASYDPRQPN 271

Query: 297 TGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                +V KGGS+LC   YC R+R AAR     D+S  +LGFRC
Sbjct: 272 IRIPRRVLKGGSHLCAPSYCRRYRPAARHAEPEDTSTSHLGFRC 315


>gi|254820468|ref|ZP_05225469.1| hypothetical protein MintA_11096 [Mycobacterium intracellulare ATCC
           13950]
 gi|379756362|ref|YP_005345034.1| hypothetical protein OCO_43500 [Mycobacterium intracellulare
           MOTT-02]
 gi|378806578|gb|AFC50713.1| hypothetical protein OCO_43500 [Mycobacterium intracellulare
           MOTT-02]
          Length = 296

 Score =  203 bits (517), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 126/336 (37%), Positives = 169/336 (50%), Gaps = 50/336 (14%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
            ++V LPG +FRMG+         E P   V +  F +++H V+N QF EFV ATGY+T 
Sbjct: 3   SELVDLPGGSFRMGSTS---FYPEEAPIHTVAVGPFAVERHPVTNAQFAEFVDATGYLTV 59

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+  D  ++ P    ++    + V      F      ++ R                  
Sbjct: 60  AEQPIDPALY-PGADPDDLCPGAMV------FRPTPGPVDLR-----------------D 95

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR        +W          R  P  SW HP G DS +  R +HPVV V++ DA AY 
Sbjct: 96  WR--------QWW---------RWVPGASWRHPFGPDSDVADRGDHPVVQVAYPDAAAYA 138

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAEWEY  R G     + WG+  T  G   AN WQG FP  N  A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAG-STTTYSWGDEATSAGRLMANTWQGSFPYRNDGALGWVGT 197

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
           +PV ++  N FGL +M+GNVWEWTA  ++ HH     P     P GP      +  KGGS
Sbjct: 198 SPVGTFAPNAFGLIDMIGNVWEWTATEFSPHHRLDSAPKACCTPPGPADPSVSQTLKGGS 257

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           +LC  +YC+R+R AARS  + DS+  ++GFRC AD 
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVADS 293


>gi|16264068|ref|NP_436860.1| hypothetical protein SM_b20334 [Sinorhizobium meliloti 1021]
 gi|15140193|emb|CAC48720.1| HYPOTHETICAL PROTEIN SM_b20334 [Sinorhizobium meliloti 1021]
          Length = 303

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 126/343 (36%), Positives = 167/343 (48%), Gaps = 61/343 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG TF MG+N        E P   VT+D F++D   V+N QF EFV+ATG+VT AE
Sbjct: 1   MVWVPGATFMMGSNDHY---PEEAPVHPVTVDGFWIDVTPVTNRQFLEFVNATGHVTFAE 57

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +       +P   +   A  S +R     F      ++                      
Sbjct: 58  R-------KPRAEDYPGAPPSNLRAGSLVFTPPKRPLQ---------------------- 88

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           G  +   ++W     G          +W HP G  S+I   ++HPVVHV+++DA AY  W
Sbjct: 89  GTDI---SQWWIFTLGA---------NWRHPLGRKSSIGAILDHPVVHVAYSDAKAYAEW 136

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTE EWE   RGGL+   F WG  L P G H AN WQG FP  N+  DG+  T+P
Sbjct: 137 AGKDLPTETEWELAARGGLDGAEFSWGGELAPGGNHMANTWQGSFPVENSMDDGFARTSP 196

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----------------PSYNPKGPTT 297
           V  Y  N +GLY+M+GNVWEWT D+W+V H  A                  S +P     
Sbjct: 197 VRFYPPNGYGLYDMIGNVWEWTTDYWSVRHPEAAAKPCCIPSNPRNADADASIDPAASVK 256

Query: 298 GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
              +V KGGS+LC   YC R+R AAR     D++  ++GFRC 
Sbjct: 257 VPRRVLKGGSHLCAPNYCRRYRPAARHAQEIDTTTSHVGFRCV 299


>gi|404259561|ref|ZP_10962870.1| putative sulfatase-modifying factor [Gordonia namibiensis NBRC
           108229]
 gi|403401908|dbj|GAC01280.1| putative sulfatase-modifying factor [Gordonia namibiensis NBRC
           108229]
          Length = 267

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 121/309 (39%), Positives = 161/309 (52%), Gaps = 59/309 (19%)

Query: 35  DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKI 94
           DGE P   V L AF +D   V+N  F  FV ATGYVT+AE                RA I
Sbjct: 12  DGEVPVHGVELRAFSVDVTSVTNAAFTAFVEATGYVTDAE----------------RAGI 55

Query: 95  SQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
           S V H                             +Y T  G  +P    W          
Sbjct: 56  SAVFH-----------------------------SYATAPGEPVPETPWW---------- 76

Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
              P  SW HP G  S +  +  HPVVHVS +DA+AYC+W G RLPTEA+WEY  RGG +
Sbjct: 77  LATPGASWRHPGGPGSDLTGKDTHPVVHVSHDDALAYCSWAGRRLPTEAQWEYAARGGRD 136

Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
              +PWG+      + R N+++GEFP   T   G++ T  V +++ N +GLY  VGNVWE
Sbjct: 137 GARYPWGDTRPSPEDPRCNIFRGEFPDRPT---GHVGTVDVRTFEPNGYGLYQCVGNVWE 193

Query: 275 WTADWWNV-HHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAG 333
           W AD ++  ++  + + +P GPT GT +V +GGS+LC++ YC R+R AARS NTP+S+  
Sbjct: 194 WCADRFSARYYRTSATVDPTGPTRGTLRVLRGGSHLCHDSYCNRYRVAARSSNTPNSTTS 253

Query: 334 NLGFRCAAD 342
           N+GFR   D
Sbjct: 254 NIGFRTVGD 262


>gi|225012451|ref|ZP_03702887.1| protein of unknown function DUF323 [Flavobacteria bacterium
           MS024-2A]
 gi|225003428|gb|EEG41402.1| protein of unknown function DUF323 [Flavobacteria bacterium
           MS024-2A]
          Length = 345

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 127/350 (36%), Positives = 178/350 (50%), Gaps = 52/350 (14%)

Query: 7   PPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
           PP E    MVL+P   + MG        D EFP   V++ AFY+DQ+EV+N +F+ FV A
Sbjct: 33  PPPE---GMVLVPAGIYTMGGRSSQAYVD-EFPKHEVSISAFYMDQNEVTNQEFKLFVEA 88

Query: 67  TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
           TGY T AE+  D                         +E +  T+   +  P   +    
Sbjct: 89  TGYKTIAERDID-------------------------WEEMKKTVPLGVAKPHDSLLKAG 123

Query: 127 AVAYCTWRG-ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
           ++ +   +G   L   ++W     G          +W HPEG  S++E+RM HPVVH++W
Sbjct: 124 SLVFKQTKGPVNLEDYSQWWRWTTGA---------NWQHPEGSGSSLENRMEHPVVHIAW 174

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
           +DA+AY TW   RLPTEAEWE+  +GG    ++PWGN        +AN WQG FP NN  
Sbjct: 175 DDALAYATWANKRLPTEAEWEWAAQGGNSGNIYPWGNTNVEAAFDQANFWQGVFPYNNRE 234

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-----YNPKGPTTGTD 300
            DGY  TAP+ S+  N +GL++M GNVWEW  D ++   +          NP+GP    D
Sbjct: 235 LDGYFGTAPITSFPPNGYGLFDMAGNVWEWCQDKYDFKAYEKAQNKGVVQNPQGPRQYND 294

Query: 301 --------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
                    V +GGS+LCN+ YC  +R + R  ++ DSS  + GFRC  D
Sbjct: 295 PREPLSPKHVMRGGSFLCNDSYCSGYRVSRRMSSSRDSSFNHTGFRCVQD 344


>gi|16125425|ref|NP_419989.1| hypothetical protein CC_1173 [Caulobacter crescentus CB15]
 gi|221234168|ref|YP_002516604.1| NIRV family protein [Caulobacter crescentus NA1000]
 gi|13422495|gb|AAK23157.1| conserved hypothetical protein [Caulobacter crescentus CB15]
 gi|220963340|gb|ACL94696.1| NIRV family protein [Caulobacter crescentus NA1000]
          Length = 338

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 129/348 (37%), Positives = 172/348 (49%), Gaps = 59/348 (16%)

Query: 4   LPAP-PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
           LP P P  R   MV L G  F+MG      ++  E P + VT+  F++DQ EV+N  F  
Sbjct: 30  LPVPDPQNRTAGMVRLAGGDFQMGAAP---LRPEEGPPQTVTVPPFWIDQTEVTNAAFAR 86

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           FV ATGY T AE+        PL    + A+ + V    +R   L               
Sbjct: 87  FVEATGYRTVAER--------PL----DPARYAHVPAAQRRPASL--------------- 119

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
                  +   +GAR    ++W          ++ P   W HPEG  S I  R   PVVH
Sbjct: 120 ------VFVGAKGARSDDPSQWW---------QVIPGADWRHPEGPGSNIRGRDAWPVVH 164

Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
           ++W DA+AY  W G  LPTEAEWEY  RGGL  + + WG+      + RAN WQG FP  
Sbjct: 165 IAWEDAMAYARWLGRDLPTEAEWEYAARGGLVGKRYTWGDQAQDPAKPRANTWQGVFPAQ 224

Query: 243 NTAADGYLST-APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----------APSY 290
           +   DG+ +  APV  +  N +GL +M GNVWEWT DW+     P           A + 
Sbjct: 225 DLGNDGFKAKPAPVGCFPPNGYGLRDMAGNVWEWTRDWFKPGLDPVSVLETGGPPEARAL 284

Query: 291 NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           +P+ P T    V KGGS+LC + YC+R+R AAR+   PDS A ++GFR
Sbjct: 285 DPEDPNT-PKHVVKGGSFLCADDYCFRYRPAARTPGPPDSGASHVGFR 331


>gi|357023852|ref|ZP_09086020.1| hypothetical protein MEA186_04144 [Mesorhizobium amorphae
           CCNWGS0123]
 gi|355544285|gb|EHH13393.1| hypothetical protein MEA186_04144 [Mesorhizobium amorphae
           CCNWGS0123]
          Length = 314

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 129/343 (37%), Positives = 172/343 (50%), Gaps = 60/343 (17%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           +DMV +PG  F MG++        E P+  V +  F++D+  V+N +F+ FV A+ YVT 
Sbjct: 11  EDMVWIPGGEFLMGSDHHY---PEEAPAHRVVVGGFWMDRTTVTNAEFRRFVEASCYVTL 67

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE   +   + P    +  A  S +   ++   G+D +          H +W        
Sbjct: 68  AEHPANAADY-PGAGADSLAPASALF--VRPARGIDRS---------NHYNW-------- 107

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                      W Y  RG          +W HP G  ST++    HPVVH+++ DA AY 
Sbjct: 108 -----------WAY-VRGA---------NWRHPRGPASTLKGLEAHPVVHIAYEDAQAYA 146

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWE+  RGGL+   F WG+ LTPRG+H AN +QGEFP  N   DGY  T
Sbjct: 147 NWAGKALPTEAEWEFAARGGLDGADFVWGDELTPRGQHMANTYQGEFPFRNDCEDGYEWT 206

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHH---------------HPAPSYNPKGPTT 297
           APV S+  N +GLY+M GNVW+WT DW+  H                    SY+P  P  
Sbjct: 207 APVGSFPANGYGLYDMAGNVWQWTTDWYQEHRLIESPCCTMTNPRGGEREASYDPLTPDI 266

Query: 298 GTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
               KV KGGS+LC   YC R+R AAR     D+S  +LGFRC
Sbjct: 267 RIPRKVTKGGSFLCAPSYCRRYRPAARMAQPVDTSTCHLGFRC 309


>gi|148553970|ref|YP_001261552.1| hypothetical protein Swit_1048 [Sphingomonas wittichii RW1]
 gi|148499160|gb|ABQ67414.1| protein of unknown function DUF323 [Sphingomonas wittichii RW1]
          Length = 304

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 125/343 (36%), Positives = 171/343 (49%), Gaps = 62/343 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG  F MG+++       E P+R V +D F++D+  V+N  F  FV+ATG+VT AE
Sbjct: 1   MAFVPGGVFTMGSDR---FYPEEAPARRVRVDPFFIDEAPVTNADFARFVAATGHVTCAE 57

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
              D   +  +L E   A                S +  R   PV               
Sbjct: 58  IAPDPKDYPGMLPELAHAG---------------SAVFTRTDRPV--------------- 87

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
             R P++  W          R      W HP G DS+I+   +HPVVHV+  DA AY  W
Sbjct: 88  DTRDPSQ--WW---------RFVLGADWRHPCGPDSSIDGLDDHPVVHVACADAEAYARW 136

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE+  RGGLE+  + WG+ L P G   AN WQG FP  NT  DG+  T+P
Sbjct: 137 AGKALPTEAEWEFAARGGLEDADYAWGDELAPDGRMMANYWQGLFPYANTLDDGWERTSP 196

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWN-------------VHHHPA-----PSYNPKGPT 296
           + S+  N +GL++M+GNVWEWT+DW+              +  +P       S++P+ P 
Sbjct: 197 IRSFPPNGYGLHDMIGNVWEWTSDWYAPPKIERKAKGSCCIPANPRGARRHDSFDPQSPA 256

Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
               +V KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 257 RIPRRVLKGGSHLCAPSYCQRYRPAARHPEAIDTSTSHIGFRC 299


>gi|326430702|gb|EGD76272.1| sulfatase modifying factor 2 [Salpingoeca sp. ATCC 50818]
          Length = 299

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 129/336 (38%), Positives = 167/336 (49%), Gaps = 67/336 (19%)

Query: 15  MVLLPGD--TFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           M+ +PG+  TFR+GT K      G  P R VT+D FYLD++ VSN  F+ FV  T YVT+
Sbjct: 20  MIYMPGEIGTFRLGTKKADSTH-GITPQRKVTVDPFYLDRYAVSNADFRAFVRDTKYVTD 78

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           +EK+G +FVF+  +        SQ   + +   G D               W  AV    
Sbjct: 79  SEKYGWSFVFKTFVD-----TASQDLAETQAMPGAD---------------WWLAVPRAY 118

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR                              P G  S I+ R+  PVV VSWNDA  YC
Sbjct: 119 WR-----------------------------KPFGKGSGIKDRLQFPVVQVSWNDAQQYC 149

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAEWEY  RGGL++  +PWG+      ++R N WQG+FP  N   DGY   
Sbjct: 150 DWAGKRLPTEAEWEYAVRGGLQDSAYPWGSRYE---KNRMNAWQGKFPDTNVLGDGYHGA 206

Query: 253 APVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL-- 309
           APV +Y  +N +G YNM+GNVWEW AD         PS  PK        V +GGSY+  
Sbjct: 207 APVDAYGPQNDYGFYNMLGNVWEWVAD--RFEDPSPPSQEPK-------FVLRGGSYVDT 257

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
            + +  ++ R   R  NT DS++ N+GFRCA    P
Sbjct: 258 VDGKTNHKVRVTTRMGNTADSASDNIGFRCARSATP 293


>gi|170744776|ref|YP_001773431.1| hypothetical protein M446_6750 [Methylobacterium sp. 4-46]
 gi|168199050|gb|ACA20997.1| protein of unknown function DUF323 [Methylobacterium sp. 4-46]
          Length = 307

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 126/346 (36%), Positives = 174/346 (50%), Gaps = 64/346 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG TFRMG+++       E P+  V + +F++D+  V+N QF+EFV ATG+VT AE
Sbjct: 1   MRWIPGGTFRMGSDR---HYPEEAPAHRVAVRSFWMDETPVTNRQFREFVRATGHVTFAE 57

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +                      R D K + G    + +     +V    +  V    W 
Sbjct: 58  R----------------------RPDPKDYPGALPNLLYAGS--IVFTPPDHPVDLRLW- 92

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                           G+   L     W HP G  ST+    NHPVVHV++ DA+AY  W
Sbjct: 93  ----------------GVWWTLLEGADWRHPYGPRSTLHGLDNHPVVHVTYADALAYARW 136

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE+  RGGLE   F WG+  TP G H AN WQG+FP  N A DG+  T+P
Sbjct: 137 AGKDLPTEAEWEFAARGGLEGAEFAWGDAFTPAGRHMANTWQGQFPWQNLAQDGFARTSP 196

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGTD---------- 300
             ++  N +GL++M+GNVWEWT+D++    HPA +      P+ P  GT+          
Sbjct: 197 AKAFPPNGYGLHDMIGNVWEWTSDYF-ADRHPADAPKACCIPRDPRGGTEERSYDPRLPE 255

Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
                +V KGGS+LC   +C R+R AAR     D+S  ++GFRC  
Sbjct: 256 IRIARRVLKGGSHLCAPNFCSRYRPAARLPQPVDTSTSHVGFRCVV 301


>gi|386821043|ref|ZP_10108259.1| hypothetical protein JoomaDRAFT_3026 [Joostella marina DSM 19592]
 gi|386426149|gb|EIJ39979.1| hypothetical protein JoomaDRAFT_3026 [Joostella marina DSM 19592]
          Length = 367

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 127/356 (35%), Positives = 186/356 (52%), Gaps = 59/356 (16%)

Query: 9   VERYKD------MVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           +E+ KD      MV +PG ++  G  +      + E P+  V +D F++D  EV+N ++ 
Sbjct: 40  IEKPKDIKTPEGMVWIPGGSYTQGARSDDKFALNHEKPAHPVEVDGFFMDVTEVTNDEYA 99

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKI---SQVRHDMKRFEGLDSTIEHRMHHP 118
           +FV+ATGY T AE+         +  EE + ++   +   HD     G  S I  +    
Sbjct: 100 KFVAATGYKTVAER--------AIEWEEIKKQVPPGTPKPHDSILQPG--SLIFKKTKTS 149

Query: 119 VVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNH 178
           V ++       +  W         EW+ G              W HP+G DS+IE + ++
Sbjct: 150 VPNL-----YDFSQWW--------EWKIGA------------DWKHPKGPDSSIEGKGDY 184

Query: 179 PVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE 238
           PVVH+++ DA+AYC W G RLPTE+EWE    GG+ N ++ WG + + +   +AN W GE
Sbjct: 185 PVVHIAYEDAIAYCEWAGRRLPTESEWELAAHGGVNNSMYAWGEDKS-KLTAQANTWNGE 243

Query: 239 FPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPK 293
           FPT N  ADGY + APV S+  N +GLY M GNVWEWT DW+NV+++          NP+
Sbjct: 244 FPTTNNKADGYENKAPVKSFPPNAYGLYEMSGNVWEWTQDWFNVNYYGEMLEQGLVVNPQ 303

Query: 294 GPTTG--------TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           G T           +KV KGGS+LC++ YC  +R +AR     DS   +LGFR  A
Sbjct: 304 GATKAYNPNNRYVQEKVIKGGSFLCSDTYCSSYRISARMATALDSGQEHLGFRTVA 359


>gi|325673474|ref|ZP_08153165.1| sulfatase-modifying factor 1 [Rhodococcus equi ATCC 33707]
 gi|325555495|gb|EGD25166.1| sulfatase-modifying factor 1 [Rhodococcus equi ATCC 33707]
          Length = 324

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 127/356 (35%), Positives = 175/356 (49%), Gaps = 63/356 (17%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P    K+MV +PG TF MG+         E P   V +D F++D H+V+  +F+ FV AT
Sbjct: 11  PSRAPKNMVWVPGGTFWMGSED---FYPEERPVHEVGVDGFWMDTHQVTVAEFRRFVKAT 67

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           G+VT AE   D   + P          S V                    P   +S +D 
Sbjct: 68  GHVTTAETAPDPADY-PDADPALLVPGSLV-----------------FTPPAQPVSLDDY 109

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
             + +W                        P   W HPEG  S +  R  HPV HVS++D
Sbjct: 110 QQWWSW-----------------------VPGADWRHPEGPGSNVGGRERHPVTHVSYSD 146

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           A AY  W G  LPTEAEWE+  RGGL+ + F WG+  TP G H+AN WQG+FP  N A D
Sbjct: 147 AQAYAAWAGKTLPTEAEWEFAARGGLDRKTFVWGDEFTPDGRHQANTWQGQFPWQNLAED 206

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTGT 299
           G++ T+PV +++ N +GL ++ GNVWEWT D+    H     + AP+ +   P+ P    
Sbjct: 207 GFVGTSPVGTFRPNGYGLVDVAGNVWEWTTDFHTADHGASGKNVAPASSCCIPRNPRVEI 266

Query: 300 DKVK----------KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
            +V+          KGGS+LC   YC R+R AAR   T ++S  ++GFRC   +GP
Sbjct: 267 GEVQAGEAYPRRVIKGGSHLCAPNYCLRYRPAARQGETEETSTCHIGFRCIV-RGP 321


>gi|334142809|ref|YP_004536017.1| hypothetical protein [Novosphingobium sp. PP1Y]
 gi|333940841|emb|CCA94199.1| conserved hypothetical protein [Novosphingobium sp. PP1Y]
          Length = 383

 Score =  202 bits (514), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 127/346 (36%), Positives = 170/346 (49%), Gaps = 65/346 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ +PG TF MG+         E P R V +D F++D+  V+N +F  FV ATGYVT AE
Sbjct: 76  MIHVPGGTFTMGSEN---FYPEEMPLRRVKVDPFWIDETPVTNREFARFVEATGYVTVAE 132

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
              D   +  +L   ++A                S + H+   PV      D      W 
Sbjct: 133 IAPDPKDYPGMLPGMDKAG---------------SLVFHKTATPV------DTSNPANW- 170

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W +         +F    W HP G DS ++   +HPVVH+++ DA AY  W
Sbjct: 171 ---------WSF---------VFG-ADWCHPLGPDSNLDELWDHPVVHIAYADAEAYAKW 211

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAE+E+  RGGLE + + WG+ L P G   AN WQG FP  N   DG+  T+P
Sbjct: 212 AGKALPTEAEFEFAARGGLEGKDYAWGDELAPDGAMMANYWQGLFPFANQCLDGWERTSP 271

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWN---------------VHHHP-----APSYNPKG 294
           V ++  N +GLY+M+GN WEWTADWW+               V  +P       SY+P  
Sbjct: 272 VRTFPANGYGLYDMIGNTWEWTADWWSERPQTVAKKKPSACCVMENPRGAKLKDSYDPVQ 331

Query: 295 PTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           P      KV KGGS+LC   YC R+R AAR     DS+  ++GFRC
Sbjct: 332 PGVRIGRKVLKGGSHLCAANYCQRYRPAARHPEMIDSATTHIGFRC 377


>gi|227537658|ref|ZP_03967707.1| sulfatase-modifying factor 1 [Sphingobacterium spiritivorum ATCC
           33300]
 gi|227242489|gb|EEI92504.1| sulfatase-modifying factor 1 [Sphingobacterium spiritivorum ATCC
           33300]
          Length = 363

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 121/339 (35%), Positives = 174/339 (51%), Gaps = 57/339 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV + G ++ MG+      +D + P   V + +F++D+HEV+N QF +FV ATGY+T AE
Sbjct: 69  MVKIEGGSYDMGSAD---FEDAK-PVHKVNVKSFWMDEHEVTNAQFAKFVKATGYLTVAE 124

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D   F P +  E     S V H  K   GL+  ++                    W 
Sbjct: 125 RPLDARDF-PGVDPEMLKPGSAVFHAPKEIRGLNDPLQ--------------------W- 162

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W Y              +W HPEG +S I+ + N+PVVH+++ DA AY  W
Sbjct: 163 ---------WSYVVGA----------NWKHPEGPESNIKGKENYPVVHIAYQDAEAYAKW 203

Query: 195 RGARLPTEAEWEYGCR-GGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
            G RLPTEAEWEY  +     N  + WG   T  G+  AN++QGEFP  +T  DGY+  A
Sbjct: 204 AGKRLPTEAEWEYAAKTANYNNEKYYWGTEKTENGKWAANIYQGEFPLKDTGEDGYVGVA 263

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----------PAPSYNPKGPTTGTDKVK 303
           PV SY  N  GLY+M GNVWEW +D++   ++          P  SY+P+ P     +V+
Sbjct: 264 PVKSYPANTSGLYDMEGNVWEWCSDFYRPDYYKNSAESNPTGPRDSYDPQEPGA-VKRVQ 322

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +GGS+LCN+ YC R++  +R +   +S   N+GFR   D
Sbjct: 323 RGGSFLCNDMYCERYKAGSRGKGEVNSPTNNVGFRLVKD 361


>gi|126433661|ref|YP_001069352.1| hypothetical protein Mjls_1053 [Mycobacterium sp. JLS]
 gi|126233461|gb|ABN96861.1| protein of unknown function DUF323 [Mycobacterium sp. JLS]
          Length = 290

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 121/332 (36%), Positives = 170/332 (51%), Gaps = 49/332 (14%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG TFRMG+ +       E P   V++  F +++H V+N QF EFV ATGYVT A
Sbjct: 4   ELVELPGGTFRMGSTQ---FYPEEVPVHTVSVADFAIERHPVTNAQFAEFVDATGYVTIA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D  ++ P +   +    + V      F+     ++ R           D   + TW
Sbjct: 61  EQALDPALY-PGVPPADLVPGALV------FQPTAGPVDLR-----------DWRQWWTW 102

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                   P   W HP G +S+   R +HPVV V++ DA AY  
Sbjct: 103 S-----------------------PGACWRHPFGPESSWHDRPDHPVVQVAYPDAAAYAK 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEA+WEY    G     +PWG++  P G   AN WQG FP  N  A G++ T+
Sbjct: 140 WAGRRLPTEAQWEYAAHAGAAT-TYPWGDDPQPGGRLMANTWQGRFPYRNDGALGWVGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY----NPKGPTTGTDKVKKGGSYL 309
           PV ++  N FGL +M+GNVWEWT   ++ HH P  +      P  P    ++  KGGS+L
Sbjct: 199 PVGTFPANGFGLVDMIGNVWEWTTTRFSAHHRPDGAVESCCGPADPDPSVNQALKGGSHL 258

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           C  +YC+R+R AARS  + DS+  ++GFRC A
Sbjct: 259 CAPEYCHRYRAAARSPQSQDSATTHIGFRCVA 290


>gi|379749045|ref|YP_005339866.1| hypothetical protein OCU_43260 [Mycobacterium intracellulare ATCC
           13950]
 gi|379763897|ref|YP_005350294.1| hypothetical protein OCQ_44610 [Mycobacterium intracellulare
           MOTT-64]
 gi|378801409|gb|AFC45545.1| hypothetical protein OCU_43260 [Mycobacterium intracellulare ATCC
           13950]
 gi|378811839|gb|AFC55973.1| hypothetical protein OCQ_44610 [Mycobacterium intracellulare
           MOTT-64]
          Length = 296

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 124/336 (36%), Positives = 167/336 (49%), Gaps = 50/336 (14%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
            ++V LPG +FRMG+         E P   V +  F +++H V+N QF EFV ATGY+T 
Sbjct: 3   SELVDLPGGSFRMGSTS---FYPEEAPIHTVAVGPFAVERHPVTNAQFAEFVDATGYLTV 59

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+  D  ++         A    +      F      ++ R                  
Sbjct: 60  AEQPIDPALYS-------GADPDDLCPGAMVFRPTPGPVDLR-----------------D 95

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR         W++           P  SW HP G DS +  R +HPVV V++ DA AY 
Sbjct: 96  WR-------QWWQW----------VPGASWRHPFGPDSDVADRGDHPVVQVAYPDAAAYA 138

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAEWEY  R G     + WG+ +T  G   AN WQG FP  N  A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAG-STTTYSWGDEVTSDGRLMANTWQGSFPYRNDGALGWVGT 197

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
           +PV ++  N FGL +M+GNVWEWTA  ++ HH     P     P GP      +  KGGS
Sbjct: 198 SPVGTFAPNAFGLIDMIGNVWEWTATEFSPHHRLDSPPKACCTPPGPADPSVSQTLKGGS 257

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           +LC  +YC+R+R AARS  + DS+  ++GFRC AD 
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVADS 293


>gi|345321597|ref|XP_001521381.2| PREDICTED: sulfatase-modifying factor 1-like, partial
           [Ornithorhynchus anatinus]
          Length = 222

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 91/133 (68%), Positives = 106/133 (79%)

Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
            +W HPEG DSTI +R +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGLENRLFP
Sbjct: 13  ANWRHPEGPDSTILNRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLENRLFP 72

Query: 220 WGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
           WGN L P+G+H AN+WQGEFP +NT  DGY  TAPV ++  N +GLYN+VGN WEWT+DW
Sbjct: 73  WGNKLQPKGQHYANIWQGEFPVSNTGEDGYKGTAPVTAFPPNGYGLYNIVGNAWEWTSDW 132

Query: 280 WNVHHHPAPSYNP 292
           W V+H    S +P
Sbjct: 133 WAVNHSTHESQDP 145


>gi|443307542|ref|ZP_21037329.1| hypothetical protein W7U_17885 [Mycobacterium sp. H4Y]
 gi|442764910|gb|ELR82908.1| hypothetical protein W7U_17885 [Mycobacterium sp. H4Y]
          Length = 296

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 124/336 (36%), Positives = 167/336 (49%), Gaps = 50/336 (14%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
            ++V LPG +FRMG+         E P   V +  F +++H V+N QF EFV ATGY+T 
Sbjct: 3   SELVDLPGGSFRMGSTS---FYPEEAPIHTVAVGPFAVERHPVTNAQFAEFVDATGYLTV 59

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+  D  ++         A    +      F      ++ R                  
Sbjct: 60  AEQPIDPALYS-------GADPDDLCPGAMVFRPTPGPVDLR-----------------D 95

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR         W++           P  SW HP G DS +  R +HPVV V++ DA AY 
Sbjct: 96  WR-------QWWQW----------VPGASWRHPFGPDSDVADRGDHPVVQVAYPDAAAYA 138

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAEWEY  R G     + WG+ +T  G   AN WQG FP  N  A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAG-STTTYSWGDEVTSDGRLMANTWQGSFPYRNDGALGWVGT 197

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
           +PV ++  N FGL +M+GNVWEWTA  ++ HH     P     P GP      +  KGGS
Sbjct: 198 SPVGTFAPNAFGLVDMIGNVWEWTATEFSPHHRLDSPPKACCTPPGPADPSVSQTLKGGS 257

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           +LC  +YC+R+R AARS  + DS+  ++GFRC AD 
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVADS 293


>gi|126725775|ref|ZP_01741617.1| hypothetical protein RB2150_06203 [Rhodobacterales bacterium
           HTCC2150]
 gi|126704979|gb|EBA04070.1| hypothetical protein RB2150_06203 [Rhodobacterales bacterium
           HTCC2150]
          Length = 310

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 128/326 (39%), Positives = 168/326 (51%), Gaps = 62/326 (19%)

Query: 18  LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
           +PG T  +GT +P+L  D E P R   L  F + Q  V+N  F +FV  T YVTEAE+FG
Sbjct: 39  IPGGTSMLGTAQPMLQNDNEEPLRKKQLKPFKMMQAAVTNQMFADFVEDTNYVTEAEEFG 98

Query: 78  DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
            +FVF   LS+                  +D T        V    W        WR   
Sbjct: 99  WSFVFYQHLSD-----------------AVDET------QAVEGTQW--------WRRVE 127

Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
              +A W          RL          G  S  + + +HPVVHVSW DA A+  W G 
Sbjct: 128 ---DANW----------RLI--------GGPKSETDFKPDHPVVHVSWADARAFAKWAGG 166

Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR-ANVWQGEFPTNNTAADGYLSTAPVM 256
           RLP EAEWE+  RGGL +  FPWG       + +  N+WQG FPT+N+ ADGY +TAP +
Sbjct: 167 RLPNEAEWEHAARGGLGDVPFPWGEKEPNDTDFQPCNIWQGTFPTHNSLADGYSTTAPAI 226

Query: 257 SYKENKFGLYNMVGNVWEWTADWW----NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
           S+  N +GLYNM GNVWEWT++ +    N     A    PKG      K+ KGGS+LC++
Sbjct: 227 SFDANGYGLYNMCGNVWEWTSEPFKYRSNSKKKQAMDPRPKG-----SKLLKGGSFLCHK 281

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFR 338
            YC+R+R AAR+  TPD++  + GFR
Sbjct: 282 SYCFRYRIAARTGTTPDTATSHQGFR 307


>gi|75910437|ref|YP_324733.1| hypothetical protein Ava_4239 [Anabaena variabilis ATCC 29413]
 gi|75704162|gb|ABA23838.1| Protein of unknown function DUF323 [Anabaena variabilis ATCC 29413]
          Length = 332

 Score =  201 bits (512), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 131/349 (37%), Positives = 172/349 (49%), Gaps = 69/349 (19%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           KDMV +PGDTF MG++        E P+R VT+  F++D++ V+N QFQ FV  T YVT 
Sbjct: 25  KDMVWIPGDTFTMGSDDHY---PEEAPTRRVTVTGFWMDKYAVTNKQFQRFVKETKYVTV 81

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+  +   +   L E                  L          P   +  N     C 
Sbjct: 82  AERSPNPEDYPGALPE------------------LLVPASLVFEQPPYRVDLNS----CN 119

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W          W Y           P  +W +PEG  S+I+ R N+PVVH+++ DA AY 
Sbjct: 120 W----------WNY----------VPGANWRNPEGPGSSIKGRENYPVVHIAYEDAEAYA 159

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEA+WE+  RGGLE+ ++ WGN      +  AN+WQGEFP  N  +      
Sbjct: 160 KWAGKALPTEAQWEFAARGGLESAVYVWGNEFF-HNKLLANIWQGEFPWQNLKSHP-PRP 217

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS--------YNPKGPTTGTD---- 300
            PV SY  N +GLY+M GNVWEWT DW+     P PS         NP+G T        
Sbjct: 218 EPVGSYPPNNYGLYDMTGNVWEWTMDWY--RSQPLPSKPKSCCIPVNPRGGTLEESFDPN 275

Query: 301 --------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
                   KV KGGS+LC E YC+R+R AAR   T D+S  ++GFRC +
Sbjct: 276 FSDIRIPRKVLKGGSFLCAENYCFRYRPAARHAETIDTSTCHIGFRCVS 324


>gi|357020744|ref|ZP_09082975.1| hypothetical protein KEK_12058 [Mycobacterium thermoresistibile
           ATCC 19527]
 gi|356478492|gb|EHI11629.1| hypothetical protein KEK_12058 [Mycobacterium thermoresistibile
           ATCC 19527]
          Length = 300

 Score =  201 bits (511), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 122/340 (35%), Positives = 173/340 (50%), Gaps = 57/340 (16%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           D+V +PG TFRMG+   I     E P   VT+  F +++H V+N QF EFV+ATGY T A
Sbjct: 6   DLVEIPGGTFRMGS---IRFYPEEAPVHTVTVADFAIERHPVTNAQFAEFVAATGYRTVA 62

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D  ++  + + + +                 + +      PV    W     + TW
Sbjct: 63  EQPLDPRLYPGVAAADLQPG---------------ALVFTPTRGPVDLRDWRQ---WWTW 104

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDS-TIEHRMNHPVVHVSWNDAVAYC 192
                                   P   W HP G +  + + R +HPVV +++ DA AY 
Sbjct: 105 T-----------------------PGACWRHPFGPEGPSWQDRPDHPVVQIAYPDAQAYA 141

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTE EWEY  R G +  ++PWG+   P G   AN WQG FP  N  A G++ T
Sbjct: 142 RWAGRRLPTEREWEYAARAGADT-VYPWGDEARPGGRLMANTWQGRFPYRNDGAAGWIGT 200

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----------PSYNPKGPTTGTDK 301
           +PV ++  N FGL++M+GNVWEWT   ++ HH PA           PS   + P     +
Sbjct: 201 SPVGAFPPNDFGLFDMIGNVWEWTTTRFSGHHRPAEADEPVRSCCTPSAPVERPDPAVSQ 260

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
             KGGS+LC  +YC+R+R +ARS  + DS+A ++GFRCAA
Sbjct: 261 TLKGGSHLCAPEYCHRYRASARSPQSQDSAATHIGFRCAA 300


>gi|359398736|ref|ZP_09191752.1| hypothetical protein NSU_1438 [Novosphingobium pentaromativorans
           US6-1]
 gi|357599974|gb|EHJ61677.1| hypothetical protein NSU_1438 [Novosphingobium pentaromativorans
           US6-1]
          Length = 322

 Score =  201 bits (511), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 127/348 (36%), Positives = 169/348 (48%), Gaps = 65/348 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ +PG TF MG+         E P R V +D F++D+  V+N  F  FV ATGYVT AE
Sbjct: 15  MIHIPGGTFTMGSET---FYPEEMPLRRVKVDPFWIDETPVTNRDFARFVEATGYVTVAE 71

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
              D   +  +L   ++A                S + H+   PV      D      W 
Sbjct: 72  IAPDPKDYPGMLPGMDKAG---------------SLVFHKTATPV------DTSNPANW- 109

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W +         +F    W HP G DS ++   +HPVVH+++ DA AY  W
Sbjct: 110 ---------WSF---------VFG-ADWCHPLGPDSNLDDLWDHPVVHIAYADAEAYAKW 150

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAE+E+  RGGLE + + WG+ L P G   AN WQG FP  N   DG+  T+P
Sbjct: 151 AGKALPTEAEFEFAARGGLEGKDYAWGDELAPDGAMMANYWQGLFPFANQCLDGWERTSP 210

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWN---------------VHHHP-----APSYNPKG 294
           V ++  N +GLY+M+GN WEWTADWW+               V  +P       SY+P  
Sbjct: 211 VRTFPANGYGLYDMIGNTWEWTADWWSERPQTVAKKKPGACCVMENPRGAKLKDSYDPVQ 270

Query: 295 PTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           P      KV KGGS+LC   YC R+R AAR     DS+  ++GFRC  
Sbjct: 271 PGVRIGRKVLKGGSHLCAANYCQRYRPAARHPEMIDSATTHIGFRCVV 318


>gi|15607852|ref|NP_215226.1| Conserved protein [Mycobacterium tuberculosis H37Rv]
 gi|15840119|ref|NP_335156.1| hypothetical protein MT0739 [Mycobacterium tuberculosis CDC1551]
 gi|31791898|ref|NP_854391.1| hypothetical protein Mb0733 [Mycobacterium bovis AF2122/97]
 gi|121636634|ref|YP_976857.1| hypothetical protein BCG_0762 [Mycobacterium bovis BCG str. Pasteur
           1173P2]
 gi|148660487|ref|YP_001282010.1| hypothetical protein MRA_0720 [Mycobacterium tuberculosis H37Ra]
 gi|148821917|ref|YP_001286671.1| hypothetical protein TBFG_10726 [Mycobacterium tuberculosis F11]
 gi|167968267|ref|ZP_02550544.1| hypothetical protein MtubH3_09659 [Mycobacterium tuberculosis
           H37Ra]
 gi|224989106|ref|YP_002643793.1| hypothetical protein JTY_0732 [Mycobacterium bovis BCG str. Tokyo
           172]
 gi|253797655|ref|YP_003030656.1| hypothetical protein TBMG_00726 [Mycobacterium tuberculosis KZN
           1435]
 gi|254231032|ref|ZP_04924359.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
 gi|254363658|ref|ZP_04979704.1| conserved hypothetical protein [Mycobacterium tuberculosis str.
           Haarlem]
 gi|254549673|ref|ZP_05140120.1| hypothetical protein Mtube_04290 [Mycobacterium tuberculosis
           '98-R604 INH-RIF-EM']
 gi|289442113|ref|ZP_06431857.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
 gi|289446272|ref|ZP_06436016.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
 gi|289568656|ref|ZP_06448883.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
 gi|289573320|ref|ZP_06453547.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
 gi|289744436|ref|ZP_06503814.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
 gi|289752759|ref|ZP_06512137.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
 gi|289756800|ref|ZP_06516178.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
 gi|294992990|ref|ZP_06798681.1| hypothetical protein Mtub2_00405 [Mycobacterium tuberculosis 210]
 gi|297633211|ref|ZP_06950991.1| hypothetical protein MtubK4_03765 [Mycobacterium tuberculosis KZN
           4207]
 gi|297730191|ref|ZP_06959309.1| hypothetical protein MtubKR_03820 [Mycobacterium tuberculosis KZN
           R506]
 gi|298524203|ref|ZP_07011612.1| conserved hypothetical protein [Mycobacterium tuberculosis
           94_M4241A]
 gi|306796467|ref|ZP_07434769.1| hypothetical protein TMFG_03384 [Mycobacterium tuberculosis
           SUMu006]
 gi|306806536|ref|ZP_07443204.1| hypothetical protein TMGG_03733 [Mycobacterium tuberculosis
           SUMu007]
 gi|306970925|ref|ZP_07483586.1| hypothetical protein TMJG_02459 [Mycobacterium tuberculosis
           SUMu010]
 gi|307083219|ref|ZP_07492332.1| hypothetical protein TMLG_03206 [Mycobacterium tuberculosis
           SUMu012]
 gi|313657518|ref|ZP_07814398.1| hypothetical protein MtubKV_03810 [Mycobacterium tuberculosis KZN
           V2475]
 gi|339630780|ref|YP_004722422.1| hypothetical protein MAF_07210 [Mycobacterium africanum GM041182]
 gi|375294931|ref|YP_005099198.1| hypothetical protein TBSG_00730 [Mycobacterium tuberculosis KZN
           4207]
 gi|378770468|ref|YP_005170201.1| hypothetical protein BCGMEX_0733 [Mycobacterium bovis BCG str.
           Mexico]
 gi|383306613|ref|YP_005359424.1| hypothetical protein MRGA327_04455 [Mycobacterium tuberculosis
           RGTB327]
 gi|385990185|ref|YP_005908483.1| hypothetical protein [Mycobacterium tuberculosis CCDC5180]
 gi|385993785|ref|YP_005912083.1| hypothetical protein [Mycobacterium tuberculosis CCDC5079]
 gi|385997491|ref|YP_005915789.1| hypothetical protein MTCTRI2_0728 [Mycobacterium tuberculosis
           CTRI-2]
 gi|392385432|ref|YP_005307061.1| unnamed protein product [Mycobacterium tuberculosis UT205]
 gi|392431141|ref|YP_006472185.1| hypothetical protein TBXG_000719 [Mycobacterium tuberculosis KZN
           605]
 gi|397672521|ref|YP_006514056.1| hypothetical protein RVBD_0712 [Mycobacterium tuberculosis H37Rv]
 gi|422811652|ref|ZP_16860053.1| hypothetical protein TMMG_03419 [Mycobacterium tuberculosis
           CDC1551A]
 gi|424805790|ref|ZP_18231221.1| hypothetical protein TBPG_02993 [Mycobacterium tuberculosis W-148]
 gi|424946481|ref|ZP_18362177.1| hypothetical protein NCGM2209_1095 [Mycobacterium tuberculosis
           NCGM2209]
 gi|449062734|ref|YP_007429817.1| hypothetical protein K60_007590 [Mycobacterium bovis BCG str. Korea
           1168P]
 gi|13880269|gb|AAK44970.1| hypothetical protein MT0739 [Mycobacterium tuberculosis CDC1551]
 gi|31617485|emb|CAD93595.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium bovis AF2122/97]
 gi|121492281|emb|CAL70748.1| Conserved hypothetical protein [Mycobacterium bovis BCG str.
           Pasteur 1173P2]
 gi|124600091|gb|EAY59101.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
 gi|134149172|gb|EBA41217.1| conserved hypothetical protein [Mycobacterium tuberculosis str.
           Haarlem]
 gi|148504639|gb|ABQ72448.1| hypothetical protein MRA_0720 [Mycobacterium tuberculosis H37Ra]
 gi|148720444|gb|ABR05069.1| conserved hypothetical protein [Mycobacterium tuberculosis F11]
 gi|224772219|dbj|BAH25025.1| hypothetical protein JTY_0732 [Mycobacterium bovis BCG str. Tokyo
           172]
 gi|253319158|gb|ACT23761.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
           1435]
 gi|289415032|gb|EFD12272.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
 gi|289419230|gb|EFD16431.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
 gi|289537751|gb|EFD42329.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
 gi|289542410|gb|EFD46058.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
 gi|289684964|gb|EFD52452.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
 gi|289693346|gb|EFD60775.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
 gi|289712364|gb|EFD76376.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
 gi|298493997|gb|EFI29291.1| conserved hypothetical protein [Mycobacterium tuberculosis
           94_M4241A]
 gi|308343126|gb|EFP31977.1| hypothetical protein TMFG_03384 [Mycobacterium tuberculosis
           SUMu006]
 gi|308347013|gb|EFP35864.1| hypothetical protein TMGG_03733 [Mycobacterium tuberculosis
           SUMu007]
 gi|308359546|gb|EFP48397.1| hypothetical protein TMJG_02459 [Mycobacterium tuberculosis
           SUMu010]
 gi|308367089|gb|EFP55940.1| hypothetical protein TMLG_03206 [Mycobacterium tuberculosis
           SUMu012]
 gi|323720839|gb|EGB29906.1| hypothetical protein TMMG_03419 [Mycobacterium tuberculosis
           CDC1551A]
 gi|326905066|gb|EGE51999.1| hypothetical protein TBPG_02993 [Mycobacterium tuberculosis W-148]
 gi|328457436|gb|AEB02859.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
           4207]
 gi|339293739|gb|AEJ45850.1| hypothetical protein CCDC5079_0660 [Mycobacterium tuberculosis
           CCDC5079]
 gi|339297378|gb|AEJ49488.1| hypothetical protein CCDC5180_0651 [Mycobacterium tuberculosis
           CCDC5180]
 gi|339330136|emb|CCC25793.1| conserved hypothetical protein [Mycobacterium africanum GM041182]
 gi|341600650|emb|CCC63320.1| conserved hypothetical protein [Mycobacterium bovis BCG str. Moreau
           RDJ]
 gi|344218537|gb|AEM99167.1| hypothetical protein MTCTRI2_0728 [Mycobacterium tuberculosis
           CTRI-2]
 gi|356592789|gb|AET18018.1| Hypothetical protein BCGMEX_0733 [Mycobacterium bovis BCG str.
           Mexico]
 gi|358230996|dbj|GAA44488.1| hypothetical protein NCGM2209_1095 [Mycobacterium tuberculosis
           NCGM2209]
 gi|378543983|emb|CCE36255.1| unnamed protein product [Mycobacterium tuberculosis UT205]
 gi|379026863|dbj|BAL64596.1| hypothetical protein ERDMAN_0784 [Mycobacterium tuberculosis str.
           Erdman = ATCC 35801]
 gi|380720566|gb|AFE15675.1| hypothetical protein MRGA327_04455 [Mycobacterium tuberculosis
           RGTB327]
 gi|392052550|gb|AFM48108.1| hypothetical protein TBXG_000719 [Mycobacterium tuberculosis KZN
           605]
 gi|395137426|gb|AFN48585.1| hypothetical protein RVBD_0712 [Mycobacterium tuberculosis H37Rv]
 gi|440580176|emb|CCG10579.1| hypothetical protein MT7199_0730 [Mycobacterium tuberculosis
           7199-99]
 gi|444894202|emb|CCP43456.1| Conserved protein [Mycobacterium tuberculosis H37Rv]
 gi|449031242|gb|AGE66669.1| hypothetical protein K60_007590 [Mycobacterium bovis BCG str. Korea
           1168P]
          Length = 299

 Score =  201 bits (511), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 131/337 (38%), Positives = 169/337 (50%), Gaps = 53/337 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +FRMG+ +       E P   VT+ AF +++H V+N QF EFVSATGYVT A
Sbjct: 4   ELVDLPGGSFRMGSTR---FYPEEAPIHTVTVRAFAVERHPVTNAQFAEFVSATGYVTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E        +PL              D   + G+D+         +V       V    W
Sbjct: 61  E--------QPL--------------DPGLYPGVDAA--DLCPGAMVFCPTAGPVDLRDW 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         W++           P   W HP G DS I  R  HPVV V++ DAVAY  
Sbjct: 97  R-------QWWDW----------VPGACWRHPFGRDSDIADRAGHPVVQVAYPDAVAYAR 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGG     + WG+   P G   AN WQG FP  N  A G++ T+
Sbjct: 140 WAGRRLPTEAEWEYAARGG-TTATYAWGDQEKPGGMLMANTWQGRFPYRNDGALGWVGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGTD----KVKKG 305
           PV  +  N FGL +M+GNVWEWT   +  HH   P       P    T  D    +  KG
Sbjct: 199 PVGRFPANGFGLLDMIGNVWEWTTTEFYPHHRIDPPSTACCAPVKLATAADPTISQTLKG 258

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GS+LC  +YC+R+R AARS  + D++  ++GFRC AD
Sbjct: 259 GSHLCAPEYCHRYRPAARSPQSQDTATTHIGFRCVAD 295


>gi|399042617|ref|ZP_10737294.1| hypothetical protein PMI09_04863 [Rhizobium sp. CF122]
 gi|398058956|gb|EJL50826.1| hypothetical protein PMI09_04863 [Rhizobium sp. CF122]
          Length = 318

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 124/344 (36%), Positives = 173/344 (50%), Gaps = 65/344 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           +V +PG TF MG+N+       E P+  VT+D F++D   V+N QF  FV ATGYVT AE
Sbjct: 17  LVWVPGRTFTMGSNEHY---PEEAPAHPVTVDGFWIDVTPVTNRQFAAFVKATGYVTVAE 73

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D   +   L +            M R   L  T    +  P +   W+         
Sbjct: 74  RTPDAADYPGALPK------------MLRAGSLVFTQPRTIVGPDISQWWS--------- 112

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    +++G              W HP G  S I  +++HPVVHV++ DA AY  W
Sbjct: 113 ---------FKFGA------------DWRHPYGGRSDIRGKLDHPVVHVAYGDARAYADW 151

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LP+EAEWE   RGGL+   F WG+ L P G+  AN WQG FPT ++ + G   T+P
Sbjct: 152 AGKALPSEAEWELAARGGLDEVEFAWGDELVPDGKLMANTWQGTFPTQSSKSAGADRTSP 211

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP------------------SYNPKGPT 296
           V ++  N +G+++M+GNVWEWT+D+W+   HP P                  SY+P+ P 
Sbjct: 212 VGAFPANGYGIHDMIGNVWEWTSDFWSA-RHPEPAAKSCCIPQNPRGGDRETSYDPRQPE 270

Query: 297 TGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                +V KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 271 IRIPRRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRC 314


>gi|433633745|ref|YP_007267372.1| Conserved protein of unknown function [Mycobacterium canettii CIPT
           140070017]
 gi|432165338|emb|CCK62813.1| Conserved protein of unknown function [Mycobacterium canettii CIPT
           140070017]
          Length = 299

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 131/337 (38%), Positives = 169/337 (50%), Gaps = 53/337 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +FRMG+         E P   VT+ AF +++H V+N QF EFVSATGYVT A
Sbjct: 4   ELVDLPGGSFRMGSTS---FYPEEAPIHTVTVRAFAVERHPVTNAQFAEFVSATGYVTIA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E        +PL              D   + G+D+         +V       V    W
Sbjct: 61  E--------QPL--------------DPGLYPGVDAA--DLCPGAMVFCPTAGPVDLRDW 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         W++           P   W HP G DS I  R  HPVV V++ DAVAY  
Sbjct: 97  R-------QWWDW----------VPGACWRHPFGRDSDIADRAGHPVVQVAYPDAVAYAR 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGG     + WG+   P G   AN WQG FP  N  A G++ T+
Sbjct: 140 WAGRRLPTEAEWEYAARGG-TTATYAWGDQEKPGGMLMANTWQGRFPYRNDGALGWVGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGTD----KVKKG 305
           PV ++  N FGL +M+GNVWEWT   +  HH   P       P    T  D    +  KG
Sbjct: 199 PVGTFPANGFGLLDMIGNVWEWTTTEFYPHHRIDPPSTACCAPVKLATAADPTISQTLKG 258

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GS+LC  +YC+R+R AARS  + D++  ++GFRC AD
Sbjct: 259 GSHLCAPEYCHRYRPAARSPQSQDTATTHIGFRCVAD 295


>gi|270290491|ref|ZP_06196716.1| sulfatase modifying factor 1 [Pediococcus acidilactici 7_4]
 gi|270281272|gb|EFA27105.1| sulfatase modifying factor 1 [Pediococcus acidilactici 7_4]
          Length = 283

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 125/335 (37%), Positives = 170/335 (50%), Gaps = 61/335 (18%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           D   + G+TF+MGTNK    +D E P   VT+  F + +  V+N +F +FV  T Y T A
Sbjct: 2   DYQKISGETFKMGTNKGGFPEDFEGPQELVTVTNFEIAKTAVTNREFGQFVKETNYKTVA 61

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E  G++FVF+  + +   AK SQ  H                                  
Sbjct: 62  ETKGNSFVFQAFIPQ---AKQSQYLH---------------------------------- 84

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                PT A W           + P  SW HP G DS IE  ++HPVVHV   DA+A+C 
Sbjct: 85  -----PTGAPWWL---------VVPGASWRHPFGPDSDIEDILDHPVVHVGLEDAIAFCE 130

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W GA+LPTEA+WE   R G     +PWG+ L    ++ AN WQG+FP  N A DG++ TA
Sbjct: 131 WAGAKLPTEAQWECAARSGSTTE-YPWGDELVVDKKYHANTWQGKFPWENDALDGFVGTA 189

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKG-PTTGTDKVK-----KGGS 307
           PV S+  N  GLY M+GNVWEW     N  + P   +N +      +DK+K     +GGS
Sbjct: 190 PVKSFAPNANGLYQMIGNVWEWCR---NPRYVPLLDFNQRVYELKSSDKLKGEFAIRGGS 246

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +LC+  YC R+R AAR+     S++ +LGFRC  +
Sbjct: 247 FLCHCSYCNRYRVAARNGADYQSTSSHLGFRCIKE 281


>gi|118473016|ref|YP_885834.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis str. MC2 155]
 gi|399985837|ref|YP_006566185.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis str. MC2 155]
 gi|118174303|gb|ABK75199.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis str. MC2 155]
 gi|399230397|gb|AFP37890.1| Sulfatase-modifying factor 1 [Mycobacterium smegmatis str. MC2 155]
          Length = 296

 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 127/337 (37%), Positives = 174/337 (51%), Gaps = 54/337 (16%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V +PG +FRMG+         E P    T+  F +++H V+N QF EFV ATGYVT A
Sbjct: 4   ELVEVPGGSFRMGSTS---FYPEEAPVHTATVGDFAIERHPVTNAQFAEFVEATGYVTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D  ++ P + E +    S V      F      ++ R                  W
Sbjct: 61  ERPLDPKLY-PGVPEADLVPGSLV------FRPTSGPVDLR-----------------DW 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         W++           P   W HP G       R +HPVV V++ DAVAY T
Sbjct: 97  R-------QWWDWA----------PGACWHHPFGPRREF-CRPDHPVVQVAYPDAVAYAT 138

Query: 194 WRGARLPTEAEWEYGCRGGLENR---LFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           W G RLPTEAEWEY  RGG +     L+ WG+ + P G+  AN WQG+FP  N  A G+ 
Sbjct: 139 WAGRRLPTEAEWEYAARGGPKGGVGFLYAWGDEVCPDGQLMANTWQGKFPYRNDGALGWK 198

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKK 304
            T+PV ++  N+ GL +M+GNVWEWTA  ++ HH P    +      P G   G ++V K
Sbjct: 199 GTSPVGTFPPNRLGLVDMIGNVWEWTATKFSAHHRPGDESHVTCCPPPSGGDPGVNQVLK 258

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           GGS+LC  +YC+R+R AARS  + DSS  ++GFRC  
Sbjct: 259 GGSHLCAPEYCHRYRPAARSPQSQDSSTTHIGFRCVV 295


>gi|159038683|ref|YP_001537936.1| hypothetical protein Sare_3137 [Salinispora arenicola CNS-205]
 gi|157917518|gb|ABV98945.1| protein of unknown function DUF323 [Salinispora arenicola CNS-205]
          Length = 311

 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 126/334 (37%), Positives = 163/334 (48%), Gaps = 51/334 (15%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           +DMV +PG  FRMG+N        E P+R V +D F++DQ++V+   F  FV  TG+VT 
Sbjct: 19  RDMVWIPGGRFRMGSND---FYPEEGPARWVEVDGFWIDQYQVTVADFDRFVRDTGHVTM 75

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE               E A       D+       S +  +   PV   S  D   + +
Sbjct: 76  AE------------VPPEAADYPDADPDLLV---PGSLVFRKAPGPV---SLADYRNWWS 117

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W                        P  +W HPEG  STIE R  HPV HV+++DAVA+ 
Sbjct: 118 WT-----------------------PGATWRHPEGPGSTIEGRDLHPVTHVAYSDAVAFA 154

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
           TW G  LPTE+EWE   RGGL + +F WG+   P G   AN WQG FP  N   DGY  T
Sbjct: 155 TWAGKELPTESEWELAARGGLADAVFTWGDEFAPEGRMMANTWQGAFPWQNLLLDGYAGT 214

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK-------VKKG 305
           +PV ++  N + LY+M GNVWEWT D +     P+       P T  +        V KG
Sbjct: 215 SPVGAFPPNGYSLYDMAGNVWEWTRDLFTYPAGPSAKACCTTPVTIDEPGSAIPRHVIKG 274

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           GS+LC   YC R+R AAR     D+S  +LGFRC
Sbjct: 275 GSHLCAPNYCMRYRPAARQGEAIDTSTCHLGFRC 308


>gi|241666809|ref|YP_002984893.1| hypothetical protein Rleg_6898 [Rhizobium leguminosarum bv.
           trifolii WSM1325]
 gi|240862266|gb|ACS59931.1| protein of unknown function DUF323 [Rhizobium leguminosarum bv.
           trifolii WSM1325]
          Length = 301

 Score =  201 bits (510), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 129/343 (37%), Positives = 169/343 (49%), Gaps = 60/343 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG  F MG+++       E P+  V +D F++D   V+N  F  FV ATGYVT AE
Sbjct: 1   MSWIPGGDFLMGSDRHYA---EEAPAHRVRVDGFWMDTCPVTNWAFSAFVEATGYVTTAE 57

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  +   + P    E  A  S V    K    L +           H +W          
Sbjct: 58  RPANPADY-PGAIMEMLAPASVVFIKPKSPVDLTN-----------HYNW---------- 95

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W+Y  RG          +W HP G  STI+  + HPVVHV++ DA AY  W
Sbjct: 96  ---------WKY-VRGA---------NWRHPRGPASTIKGLLEHPVVHVAFEDAKAYAAW 136

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE+  RGGLE   + WG  + P G H AN WQGEFP  N+  DG+  T+P
Sbjct: 137 AGKELPTEAEWEFAARGGLEAADYVWGGEMLPGGRHMANTWQGEFPLQNSREDGFEGTSP 196

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVH---------------HHPAPSYNPKGPTTGT 299
           V S+  N +GL++M GNVWEWT DW+  H                  + S++P+ P    
Sbjct: 197 VGSFPANGYGLFDMAGNVWEWTTDWYQEHGMIDSPCCTISNPVGAELSNSHDPRQPQISI 256

Query: 300 D-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
             KV KGGS+LC   YC R+R AAR     D++  +LGFRC A
Sbjct: 257 PRKVMKGGSHLCAPNYCRRYRPAARMAQPVDTATCHLGFRCIA 299


>gi|340625733|ref|YP_004744185.1| hypothetical protein MCAN_07171 [Mycobacterium canettii CIPT
           140010059]
 gi|340003923|emb|CCC43057.1| conserved hypothetical protein [Mycobacterium canettii CIPT
           140010059]
          Length = 299

 Score =  201 bits (510), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 131/337 (38%), Positives = 169/337 (50%), Gaps = 53/337 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +FRMG+         E P   VT+ AF +++H V+N QF EFVSATGYVT A
Sbjct: 4   ELVDLPGGSFRMGSTS---FYPEEAPIHTVTVRAFAVERHPVTNAQFAEFVSATGYVTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E        +PL              D   + G+D+         +V       V    W
Sbjct: 61  E--------QPL--------------DPGLYPGVDAA--DLCPGAMVFCPTAGPVDLRDW 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         W++           P   W HP G DS I  R  HPVV V++ DAVAY  
Sbjct: 97  R-------QWWDW----------VPGACWRHPFGRDSDIAGRAGHPVVQVAYPDAVAYAR 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGG     + WG+   P G   AN WQG FP  N  A G++ T+
Sbjct: 140 WAGRRLPTEAEWEYAARGG-TTAPYAWGDQEKPGGMLMANTWQGRFPYRNDGALGWVGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGTD----KVKKG 305
           PV ++  N FGL +M+GNVWEWT   +  HH   P       P    T  D    +  KG
Sbjct: 199 PVGTFPANGFGLLDMIGNVWEWTTTEFYPHHRIDPPSTACCAPVKLATAADPTISQTLKG 258

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GS+LC  +YC+R+R AARS  + D++  ++GFRC AD
Sbjct: 259 GSHLCAPEYCHRYRPAARSPQSQDTATTHIGFRCVAD 295


>gi|54400462|ref|NP_001005980.1| sulfatase-modifying factor 2 precursor [Danio rerio]
 gi|53734676|gb|AAH83430.1| Sulfatase modifying factor 2 [Danio rerio]
 gi|157423577|gb|AAI53568.1| Sulfatase modifying factor 2 [Danio rerio]
 gi|182891192|gb|AAI64065.1| Sumf2 protein [Danio rerio]
          Length = 299

 Score =  201 bits (510), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 126/345 (36%), Positives = 173/345 (50%), Gaps = 71/345 (20%)

Query: 1   MVLLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           ++LL         +MV +PG    MGT+     +DGE P+R V L  F +D++ V+N+ F
Sbjct: 13  LILLCTVSCAADDEMVFIPGGKMLMGTSAAD-GRDGESPTRAVALQPFKIDKYPVTNSNF 71

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           +EFV    Y TEAE FG +FVF+  +SEE ++K++Q                        
Sbjct: 72  REFVRLQKYKTEAETFGWSFVFQDFVSEELKSKVTQ------------------------ 107

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                  +    W    LP E             ++F    W  P G  S I+ R++ PV
Sbjct: 108 ------KIESAPWW---LPVE-------------KVF----WRQPAGPGSGIKDRLDCPV 141

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           V VSWNDA AYC W+  RLPTE EWE   RGGLE R +PWGN       +R N+WQG FP
Sbjct: 142 VQVSWNDAQAYCQWKKKRLPTEEEWEMAARGGLEGRSYPWGNKYL---LNRTNLWQGPFP 198

Query: 241 TNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT 299
             ++A DGY   APV +Y  +N +GLY+M+GNVWEWT+                 P   +
Sbjct: 199 DKDSAEDGYHGVAPVTAYPPQNNYGLYDMLGNVWEWTSS--------------SFPGAQS 244

Query: 300 DKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
             V +G S++   +    +R R   R  NT DS++ NLGFRCA +
Sbjct: 245 MFVLRGASWIDTADGSANHRARVTTRMGNTADSASDNLGFRCAMN 289


>gi|433625806|ref|YP_007259435.1| Conserved protein of unknown function [Mycobacterium canettii CIPT
           140060008]
 gi|433629796|ref|YP_007263424.1| Conserved protein of unknown function [Mycobacterium canettii CIPT
           140070010]
 gi|432153412|emb|CCK50634.1| Conserved protein of unknown function [Mycobacterium canettii CIPT
           140060008]
 gi|432161389|emb|CCK58731.1| Conserved protein of unknown function [Mycobacterium canettii CIPT
           140070010]
          Length = 299

 Score =  201 bits (510), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 131/337 (38%), Positives = 169/337 (50%), Gaps = 53/337 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +FRMG+         E P   VT+ AF +++H V+N QF EFVSATGYVT A
Sbjct: 4   ELVDLPGGSFRMGSTS---FYPEEAPIHTVTVRAFAVERHPVTNAQFAEFVSATGYVTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E        +PL              D   + G+D+         +V       V    W
Sbjct: 61  E--------QPL--------------DPGLYPGVDAA--DLCPGAMVFCPTAGPVDLRDW 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         W++           P   W HP G DS I  R  HPVV V++ DAVAY  
Sbjct: 97  R-------QWWDW----------VPGACWRHPFGRDSDIAGRAGHPVVQVAYPDAVAYAR 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGG     + WG+   P G   AN WQG FP  N  A G++ T+
Sbjct: 140 WAGRRLPTEAEWEYAARGGTTTP-YAWGDQEKPGGMLMANTWQGRFPYRNDGALGWVGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGTD----KVKKG 305
           PV ++  N FGL +M+GNVWEWT   +  HH   P       P    T  D    +  KG
Sbjct: 199 PVGTFPANGFGLLDMIGNVWEWTTTEFYPHHRIDPPSTACCAPVKLATAADPTISQTLKG 258

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GS+LC  +YC+R+R AARS  + D++  ++GFRC AD
Sbjct: 259 GSHLCAPEYCHRYRPAARSPQSQDTATTHIGFRCVAD 295


>gi|221640916|ref|YP_002527178.1| hypothetical protein RSKD131_2817 [Rhodobacter sphaeroides KD131]
 gi|221161697|gb|ACM02677.1| Hypothetical Protein RSKD131_2817 [Rhodobacter sphaeroides KD131]
          Length = 297

 Score =  201 bits (510), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 125/334 (37%), Positives = 167/334 (50%), Gaps = 53/334 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ LPG  F MG++        E P    +++ F++D   V+N  +  FVSATGYVT AE
Sbjct: 1   MIWLPGAVFYMGSDS---HYPEEAPVHLASVEGFWIDVFPVTNADYARFVSATGYVTVAE 57

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                           R+  ++   D      +  ++  R    V      D   +  W 
Sbjct: 58  ----------------RSPGAEDYPDADPALLVPGSLVFRRPASV-----EDPGDWTRW- 95

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W Y           P   W HP+G    +  R++HPVVHV   DA AY  W
Sbjct: 96  ---------WSYQ----------PGACWRHPDGPGRDVAGRLDHPVVHVCHEDAQAYAHW 136

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWEY  RGGL+   F  G+ L P G HRAN+WQG+FP  N A DG+  T+P
Sbjct: 137 AGKDLPTEAEWEYAARGGLDRAEFARGDVLVPEGRHRANIWQGQFPVENLAEDGFEGTSP 196

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH--HPAPS-YNPKGPTTG------TDKVKKG 305
           V ++  N +GLY+M+GNVWEWT+D +   H    AP    P  P +G        +V KG
Sbjct: 197 VRAFPPNGYGLYDMIGNVWEWTSDGFRADHGGRSAPGCCAPISPASGGHVLPEARQVLKG 256

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           G +LC   YC R+R AAR   TPDS++G++GFRC
Sbjct: 257 GYHLCAPNYCRRYRPAARQGQTPDSASGHIGFRC 290


>gi|375091008|ref|ZP_09737314.1| hypothetical protein HMPREF9709_00176 [Helcococcus kunzii ATCC
           51366]
 gi|374564799|gb|EHR36080.1| hypothetical protein HMPREF9709_00176 [Helcococcus kunzii ATCC
           51366]
          Length = 289

 Score =  200 bits (509), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 116/337 (34%), Positives = 166/337 (49%), Gaps = 67/337 (19%)

Query: 14  DMVLLPGDTFRMGTNK----PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           + V +P  TF+MG       P+   D E P  +V+++ F +    ++N  F+EFV  T Y
Sbjct: 5   NFVNIPSGTFKMGATTDEGHPL---DKETPQVSVSINEFQISDTTITNEHFKEFVDKTNY 61

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
            T AE +G +FVF  LL EE +    +V   +                            
Sbjct: 62  TTTAETYGGSFVFHLLLDEETKKNSLKVERTL---------------------------- 93

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
                         W Y  +G          SW HP G DS I+  ++HPVVH+S+ DA+
Sbjct: 94  --------------WWYYVKGA---------SWKHPFGPDSNIDDILDHPVVHISYFDAL 130

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
           A+C W   RLP+EAEWEY  R G E + +PWG+      ++ AN++QG+FP  NT  DG+
Sbjct: 131 AFCEWGNYRLPSEAEWEYAARAGSEYK-WPWGDTFLQDEKYMANIFQGDFPHTNTEKDGF 189

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTAD--------WWNVHHHPAPSYNPKGPTTGTDK 301
           L T P   +  NKFGLYN++GNVWEW  +        +  V +   P YN +     +  
Sbjct: 190 LGTCPAKYFSPNKFGLYNVIGNVWEWCMNPRYLDLETFKTVDYKIYPPYNLENVNDNSQF 249

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
             +GGS+LC+  YC R+R  AR+ N+  S+A N GFR
Sbjct: 250 ATRGGSFLCHPSYCNRYRLGARNGNSASSTASNCGFR 286


>gi|91783680|ref|YP_558886.1| hypothetical protein Bxe_A2134 [Burkholderia xenovorans LB400]
 gi|91687634|gb|ABE30834.1| Conserved hypothetical protein [Burkholderia xenovorans LB400]
          Length = 384

 Score =  200 bits (509), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 123/352 (34%), Positives = 170/352 (48%), Gaps = 58/352 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG  F MG++     +  E P+  V +  F++D+H V+N +F+ FV ATGYVT AE
Sbjct: 44  MAWVPGGQFLMGSDA-KQAQPNERPAHKVKVHGFWMDRHHVTNAEFRRFVEATGYVTTAE 102

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC-TW 133
           K  D                         +E L   +      P        A+ +  T 
Sbjct: 103 KKPD-------------------------WETLKVQLPPGTPRPPESAMVAGAMVFVGTS 137

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R   L   ++W          R  P  +W HP G +S I  + +HPVV VS+ DA AY  
Sbjct: 138 RPVPLDDYSQWW---------RYVPGANWRHPAGPESNIIGKDDHPVVQVSYEDAQAYAK 188

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGY 249
           W G RLPTEAEWE+  RGGLE   + WG+  +P G+  ANVWQG+    FP  N  A G 
Sbjct: 189 WAGKRLPTEAEWEFAARGGLEQATYAWGDQFSPNGKQMANVWQGQQPQSFPVVNPKAGGA 248

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNPKG 294
           L T+PV ++  N +GL +M GN W+W ADW+                    P+ S++P  
Sbjct: 249 LGTSPVGTFPANGYGLSDMTGNAWQWVADWYRADQFRREAVSTSAIDNPVGPSESWDPAD 308

Query: 295 ---PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
              P     +V +GGS+LCNE YC  +R +AR    P +S  +LGFR   D+
Sbjct: 309 QGVPVNAPKRVTRGGSFLCNEIYCLSYRPSARRGTDPYNSMSHLGFRLVMDE 360


>gi|312139220|ref|YP_004006556.1| hypothetical protein REQ_18030 [Rhodococcus equi 103S]
 gi|311888559|emb|CBH47871.1| conserved hypothetical protein [Rhodococcus equi 103S]
          Length = 324

 Score =  200 bits (509), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 126/356 (35%), Positives = 174/356 (48%), Gaps = 63/356 (17%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P    K+MV +PG TF MG+         E P   V +D F++D H+V+  +F+ FV AT
Sbjct: 11  PSRAPKNMVWVPGGTFWMGSED---FYPEERPVHEVGVDGFWMDTHQVTVAEFRRFVKAT 67

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           G+VT AE   D   + P          S V                    P   +S +D 
Sbjct: 68  GHVTTAETAPDPADY-PDADPALLVPGSLV-----------------FTPPAQPVSLDDY 109

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
             + +W                        P   W HPEG  S +  R  HPV HVS++D
Sbjct: 110 QQWWSW-----------------------VPGADWRHPEGPGSNVGGRERHPVTHVSYSD 146

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           A AY  W G  LPTEAEWE+  RGGL+ + F WG+   P G H+AN WQG+FP  N A D
Sbjct: 147 AQAYAAWAGKTLPTEAEWEFAARGGLDRKTFVWGDEFAPDGRHQANTWQGQFPWQNLAED 206

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTGT 299
           G++ T+PV +++ N +GL ++ GNVWEWT D+    H     + AP+ +   P+ P    
Sbjct: 207 GFVGTSPVGTFRPNGYGLVDVAGNVWEWTTDFHTADHGASGKNVAPASSCCIPRNPRVEI 266

Query: 300 DKVK----------KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
            +V+          KGGS+LC   YC R+R AAR   T ++S  ++GFRC   +GP
Sbjct: 267 GEVQAGEAYPRRVIKGGSHLCAPNYCLRYRPAARQGETEETSTCHIGFRCIV-RGP 321


>gi|424878448|ref|ZP_18302088.1| hypothetical protein Rleg8DRAFT_6322 [Rhizobium leguminosarum bv.
           trifolii WU95]
 gi|392520940|gb|EIW45669.1| hypothetical protein Rleg8DRAFT_6322 [Rhizobium leguminosarum bv.
           trifolii WU95]
          Length = 315

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 128/343 (37%), Positives = 169/343 (49%), Gaps = 60/343 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG  F MG+++       E P+  V +D F++D   V+N  F  FV A+GYVT AE
Sbjct: 15  MSWIPGGDFLMGSDRHYA---EEAPAHRVRVDGFWMDTCPVTNWAFSAFVEASGYVTTAE 71

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  +   + P    E  A  S V    K    L +           H +W          
Sbjct: 72  RPANPADY-PGAIMEMLAPASVVFIKPKSPVDLTN-----------HYNW---------- 109

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W+Y  RG          +W HP G  STI+  + HPVVHV++ DA AY  W
Sbjct: 110 ---------WKY-VRGA---------NWRHPRGPASTIKGLLEHPVVHVAFEDAKAYAAW 150

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE+  RGGLE   + WG  + P G H AN WQGEFP  N+  DG+  T+P
Sbjct: 151 AGKELPTEAEWEFAARGGLEAADYVWGGEMLPGGRHMANTWQGEFPVRNSREDGFEGTSP 210

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVH---------------HHPAPSYNPKGPTTGT 299
           V S+  N +GL++M GNVWEWT DW+  H                  + S++P+ P    
Sbjct: 211 VGSFPANGYGLFDMAGNVWEWTTDWYQEHGMIDSPCCTISNPVGAELSNSHDPRQPQISI 270

Query: 300 D-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
             KV KGGS+LC   YC R+R AAR     D++  +LGFRC A
Sbjct: 271 PRKVMKGGSHLCAPNYCRRYRPAARMAQPVDTATCHLGFRCIA 313


>gi|359772684|ref|ZP_09276103.1| hypothetical protein GOEFS_069_00070 [Gordonia effusa NBRC 100432]
 gi|359310147|dbj|GAB18881.1| hypothetical protein GOEFS_069_00070 [Gordonia effusa NBRC 100432]
          Length = 294

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 127/343 (37%), Positives = 168/343 (48%), Gaps = 64/343 (18%)

Query: 16  VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEK 75
           + +PG  + MG+ +       E P R V + +F +D   ++N QF EFV+ATGYVT AE 
Sbjct: 1   MTIPGGEYLMGSTR---HYPEERPVRPVQVSSFRIDARAITNAQFAEFVAATGYVTLAE- 56

Query: 76  FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHR----MHHPVVHISWNDAVAYC 131
                  +PL              D  R+ G D  +        H     +  +D   + 
Sbjct: 57  -------QPL--------------DPSRYPGADPELLKPGGLVFHQTPEPVELHDLTQW- 94

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
                       W Y           P  +W  P G  S++    +HPVV V   DA AY
Sbjct: 95  ------------WAY----------VPGANWRAPHGPQSSLHGLDDHPVVQVCHADAAAY 132

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             WRG RLPTEAEWE+  RGGL ++ + WG +      +R N W+GEFP  NTA+DG+ S
Sbjct: 133 ARWRGVRLPTEAEWEFAARGGLASQPYTWGADADDDPPYRVNRWRGEFPWQNTASDGFES 192

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD----------- 300
           TAPV S++ N +GLY+M GNVWEWT  WW  HHH   +    GP                
Sbjct: 193 TAPVGSFEPNGYGLYDMAGNVWEWTDTWWT-HHHAVETSPCCGPDVAMQRSMGPGDDVGR 251

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           +V KGGS+LC   YC R+R AAR     DS+  +LGFRCAAD+
Sbjct: 252 RVIKGGSHLCAPNYCLRYRPAARQPEAIDSATCHLGFRCAADE 294


>gi|300024443|ref|YP_003757054.1| hypothetical protein Hden_2938 [Hyphomicrobium denitrificans ATCC
           51888]
 gi|299526264|gb|ADJ24733.1| protein of unknown function DUF323 [Hyphomicrobium denitrificans
           ATCC 51888]
          Length = 333

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 128/347 (36%), Positives = 177/347 (51%), Gaps = 64/347 (18%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +  MV +PG TFRMG++        E P+  V++D+F++D+  ++N +F+ FV AT YVT
Sbjct: 24  HSGMVFVPGGTFRMGSDHHY---PEEAPAHRVSVDSFWIDEAPITNAEFRRFVDATNYVT 80

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
            AE         P  S+   A    +R     F+     I+ R        +W       
Sbjct: 81  FAE-------LTPNASDYPGALPHMLRAGSLVFKPPAGEIDLR--------NWGQ----- 120

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W          WE+  R G         +W  P G  S+I+    HPVVHV++ DA AY
Sbjct: 121 -W----------WEF--RFG--------ANWRLPYGPGSSIKGLDEHPVVHVAYCDAEAY 159

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G  LPTEAEWE+  RGG+E   F WG+ LTP G++ AN WQG FP  N   DGY  
Sbjct: 160 AHWAGKELPTEAEWEFAARGGIEGAEFAWGDELTPGGKYMANTWQGIFPRENLKLDGYER 219

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-------NPKGPTTGTD---- 300
           T+P+ ++  N +G+++M+GNVWEWT+DW+    HPA +        NP+GP         
Sbjct: 220 TSPIKAFPPNGYGIHDMIGNVWEWTSDWY-APKHPADAQKACCIPKNPRGPGQDQSLDPY 278

Query: 301 --------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                   KV KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 279 QPQIRIPRKVLKGGSHLCAPNYCRRYRPAARHAEPIDTSTSHVGFRC 325


>gi|385209534|ref|ZP_10036402.1| hypothetical protein BCh11DRAFT_06664 [Burkholderia sp. Ch1-1]
 gi|385181872|gb|EIF31148.1| hypothetical protein BCh11DRAFT_06664 [Burkholderia sp. Ch1-1]
          Length = 416

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 124/352 (35%), Positives = 167/352 (47%), Gaps = 58/352 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG  F MG++     +  E P+  V +  F++D+H V+N +F+ FV ATGYVT AE
Sbjct: 76  MAWVPGGQFLMGSDA-KQAQPNERPAHKVKVHGFWMDRHHVTNAEFRRFVEATGYVTTAE 134

Query: 75  KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           K  D    +  L     R   S +      F G    +             +D   Y  W
Sbjct: 135 KKPDWDTLKVQLPPGTPRPPESAMVAGAMVFVGTSRPVPL-----------DD---YSQW 180

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W Y           P   W HP G DS I  + +HPVV VS+ DA AY  
Sbjct: 181 ----------WRY----------VPGADWRHPTGPDSNIIGKDDHPVVQVSYEDAQAYAK 220

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGY 249
           W G RLPTEAEWE+  RGGLE   + WG+  +P G+  ANVWQG+    FP  N  A G 
Sbjct: 221 WAGKRLPTEAEWEFAARGGLEQATYAWGDQFSPDGKQMANVWQGQQPQSFPVVNPKAGGA 280

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNPKG 294
           L T+P  ++  N +GL +M GN W+W ADW+                    P+ S++P  
Sbjct: 281 LGTSPAGTFPANGYGLSDMTGNAWQWVADWYRADQFRREAVSTSAIDNPVGPSDSWDPAD 340

Query: 295 ---PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
              P     +V +GGS+LCNE YC  +R +AR    P +S  +LGFR   D+
Sbjct: 341 QGVPVNAPKRVTRGGSFLCNEIYCLSYRPSARRGTDPYNSMSHLGFRLVMDE 392


>gi|433645849|ref|YP_007290851.1| hypothetical protein Mycsm_01020 [Mycobacterium smegmatis JS623]
 gi|433295626|gb|AGB21446.1| hypothetical protein Mycsm_01020 [Mycobacterium smegmatis JS623]
          Length = 300

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 127/381 (33%), Positives = 175/381 (45%), Gaps = 91/381 (23%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +FRMG+ +       E P    T+ AF +++H V+N Q+ EFV+ T YVT A
Sbjct: 4   ELVELPGGSFRMGSTQ---FYPEEAPLHTATVGAFAVERHPVTNAQYAEFVAETDYVTIA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D  ++                HD+       + +      PV    W    A+   
Sbjct: 61  EQPPDPALYP-----------GAAPHDLVP----GALVFQPSQGPVDLRDWRQWWAW--- 102

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                   P  SW HP G  S IE R++HPVV V++ DA AY  
Sbjct: 103 -----------------------LPGASWRHPFGAGSGIEDRLDHPVVQVAYPDAAAYAR 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGG    ++PWG++ TP G+  AN WQG FP  N  A G++ T+
Sbjct: 140 WAGRRLPTEAEWEYASRGGAST-VYPWGDDATPGGQLMANTWQGRFPYRNDGALGWVGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           PV ++  N FGL +M+GNVWEWT   ++ HH         GP                  
Sbjct: 199 PVGAFPPNGFGLVDMIGNVWEWTTTQFSAHH------RLDGPIE---------------- 236

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAAR 373
                                    C     P    ++  KGGS+LC  +YC+R+R AAR
Sbjct: 237 ------------------------SCCGPSSPDPTVNQTLKGGSHLCAPEYCHRYRPAAR 272

Query: 374 SQNTPDSSAGNLGFRCAADVS 394
           S  + DS+  ++GFRC AD+S
Sbjct: 273 SPQSQDSATTHIGFRCVADLS 293


>gi|254504616|ref|ZP_05116767.1| conserved domain protein [Labrenzia alexandrii DFL-11]
 gi|222440687|gb|EEE47366.1| conserved domain protein [Labrenzia alexandrii DFL-11]
          Length = 307

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 126/336 (37%), Positives = 161/336 (47%), Gaps = 55/336 (16%)

Query: 18  LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
           L G TF MG+ +       E P R VTL  F + + EV+N +F  FV  TGYVT AEK  
Sbjct: 13  LEGGTFAMGSGEHYRE---EAPVRQVTLQPFSIQKTEVTNAKFAAFVEETGYVTTAEKDL 69

Query: 78  DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
           D   + P L  +     S V    K    L+           +++ W             
Sbjct: 70  DQSDY-PGLPPDLLKAGSMVFAQPKDAVALND----------INLWW------------- 105

Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
                            R  P   W HP G +S+IE   +HPVV VS  DA AY  W G 
Sbjct: 106 -----------------RYVPGAQWRHPRGPESSIEGLEDHPVVQVSIEDAKAYAAWAGG 148

Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMS 257
           RLPTEAEWE+  RGGL+   F WG +  P    +AN WQG FP+ +TA DG+  +APV S
Sbjct: 149 RLPTEAEWEFAARGGLDGAQFSWGEDYDPSEGWKANTWQGVFPSIDTADDGHHGSAPVGS 208

Query: 258 YKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT-----------TGTDKVKKGG 306
           Y  N +GLY+M GNVWE+ +DWW   H      NP GP+            G   V KGG
Sbjct: 209 YASNGYGLYDMAGNVWEYVSDWWIPGHPTKDQTNPGGPSKTLAEQFANSAVGARHVVKGG 268

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           S+LC   YC+R+R +AR        + ++GFR A D
Sbjct: 269 SWLCAPSYCFRYRPSARQPAERSLGSNHIGFRIARD 304


>gi|339522051|gb|AEJ84190.1| sulfatase-modifying factor 2 [Capra hircus]
          Length = 301

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 130/338 (38%), Positives = 169/338 (50%), Gaps = 70/338 (20%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV LPG  F+MGTN P   +DGE P R V +  F +D   V+N  F+EFV    Y TEA
Sbjct: 29  NMVQLPGGRFQMGTNSPD-GRDGEGPVREVMVKPFAIDIFPVTNKDFREFVREKKYRTEA 87

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E FG +FVFE L+S+E R K  Q                 RM      + W        W
Sbjct: 88  EVFGWSFVFEDLVSDELRNKAPQ-----------------RMQS----LLW--------W 118

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
               LP E             R F    W  P G  S I  ++  PVVHVSWNDA A C 
Sbjct: 119 ----LPME-------------RAF----WRQPAGPGSGIREKLEFPVVHVSWNDARACCA 157

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           WRG RLPTE EWE+  RGGL+ +++P GN   P   +R N+WQG+FP  + A DG+   +
Sbjct: 158 WRGKRLPTEEEWEFAARGGLKGQVYPGGNKFQP---NRTNLWQGKFPKGDKAEDGFHGVS 214

Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--C 310
           PV ++  +N +GLY+++GNVWEWTA                       +V +G S++   
Sbjct: 215 PVNAFPPQNDYGLYDLMGNVWEWTASQSQAADQDM-------------RVLRGASWVDTA 261

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           +     R R   R  NTPDS++  LGFRCA+  G   G
Sbjct: 262 DGSANPRARVTTRMGNTPDSASDTLGFRCASSAGRPPG 299


>gi|404442113|ref|ZP_11007294.1| sulfatase-modifying factor 1 [Mycobacterium vaccae ATCC 25954]
 gi|403657384|gb|EJZ12157.1| sulfatase-modifying factor 1 [Mycobacterium vaccae ATCC 25954]
          Length = 292

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 124/336 (36%), Positives = 171/336 (50%), Gaps = 51/336 (15%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           + ++V LPG +FRMG+   +     E P   V++  F +++H V+N QF EFVS TGYVT
Sbjct: 2   FTELVELPGGSFRMGS---VDFYPEEAPVHTVSVAPFAIERHPVTNAQFAEFVSDTGYVT 58

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
            AE+  D   F P +   +    + V      F      ++ R           D   + 
Sbjct: 59  VAERELDPADF-PGVPPADLVPGALV------FRPTPGPVDLR-----------DWRQWW 100

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
           TW                        P   W HP G  S++E R++HPVV V++ DA AY
Sbjct: 101 TW-----------------------VPGACWRHPFGPASSLEDRLDHPVVQVAYPDAAAY 137

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G RLP+EA+WEY  R G     + WG+++ P G+  AN WQG FP  N  A G+  
Sbjct: 138 ARWAGRRLPSEAQWEYAARAGT-TAAYAWGDDVRPDGQLMANTWQGRFPYRNEGALGWSG 196

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSYNPKGPTTGTDKVK---KG 305
           T+PV ++  N FGL +M+GNVWEWT   +  HH    PA    P  P  G   V    KG
Sbjct: 197 TSPVGAFPPNGFGLLDMIGNVWEWTTTPYTRHHRLDQPAQGCCPTPPPAGDPTVHQALKG 256

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           GS+LC  +YC+R+R AARS  + DS+  ++GFRC A
Sbjct: 257 GSHLCAPEYCHRYRPAARSSQSQDSATTHIGFRCIA 292


>gi|296447678|ref|ZP_06889596.1| protein of unknown function DUF323 [Methylosinus trichosporium
           OB3b]
 gi|296254819|gb|EFH01928.1| protein of unknown function DUF323 [Methylosinus trichosporium
           OB3b]
          Length = 354

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 123/331 (37%), Positives = 165/331 (49%), Gaps = 52/331 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  + G  F MG+         E P+  V +  F++D+HEV+N QF  FV+ATG+ T   
Sbjct: 53  MSWVAGGAFLMGSED---FYPEERPAHRVFVGGFWIDRHEVTNAQFAAFVAATGWRT--- 106

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                              +++   D K   GL + +        +  + N   +  +W 
Sbjct: 107 -------------------VAETGLDPKDHPGLPAELTAPGSMVFIMPTTNPGRSPESW- 146

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W Y   G           W HP G  ++I+ +   PVVHV+W DA AY  W
Sbjct: 147 ---------WRY-VHGA---------DWRHPGGPGTSIDGKDAEPVVHVAWRDARAYSDW 187

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWEY  RGGLE R + WG+ + P G + AN WQG FP  + A+DG+   AP
Sbjct: 188 LGRDLPTEAEWEYAARGGLEGRPYAWGSEVRPDGRYMANSWQGAFPLFDEASDGFHGLAP 247

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT--TGTD-----KVKKGGS 307
           V  Y+ N  GL++M GNVWEWT DWW   H    S +P+GPT   G D     +V KGGS
Sbjct: 248 VGCYEPNGRGLFDMTGNVWEWTGDWWAPGHAAEASADPQGPTRNPGGDEALAARVIKGGS 307

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           +LC+  YC R+R AAR  +  D  A +LGFR
Sbjct: 308 WLCSSNYCGRYRPAARQPHEIDLGASHLGFR 338


>gi|222081471|ref|YP_002540834.1| hypothetical protein Arad_7818 [Agrobacterium radiobacter K84]
 gi|221726150|gb|ACM29239.1| conserved hypothetical protein [Agrobacterium radiobacter K84]
          Length = 322

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 128/354 (36%), Positives = 176/354 (49%), Gaps = 65/354 (18%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
            +R + M  +PG TF MG+N        E P+  V +D F++++  V+N +F  FV ATG
Sbjct: 12  ADRPEGMAWIPGGTFIMGSNSHY---PEEAPAHPVKVDGFWIEETPVTNRKFMAFVKATG 68

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           YVT AEK  D   +   L E  RA  S V    K   G                      
Sbjct: 69  YVTLAEKAPDPEQYPGALPEMLRAG-SLVFTQPKSVSG---------------------- 105

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHRMNHPVVHVSWND 187
                     P  ++W           +F +G+ W  P G  S +  +++HPVVHV+W D
Sbjct: 106 ----------PNISQW----------WIFKFGANWRRPLGGLSDLRGKLDHPVVHVAWCD 145

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           A AY  W G  LPTEAEWE   RGGL++  + WG+   P G   AN W G FPT++  A 
Sbjct: 146 AKAYADWAGLELPTEAEWELAARGGLDDVEYAWGSEFQPNGVVMANTWAGTFPTHSLKAK 205

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PA----------------PSY 290
           G+  T+PV ++  N++GLY+M+GNVWEWTAD+W+  H  PA                 S+
Sbjct: 206 GHERTSPVRAFPPNRYGLYDMIGNVWEWTADYWSPRHEKPALHACCIPTNPRGGDQEASF 265

Query: 291 NPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           +P+ P      +V KGGS+LC   YC R+R AAR     D+S  ++GFRC   K
Sbjct: 266 DPRQPEILIHRRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRCVKRK 319


>gi|254461308|ref|ZP_05074724.1| sulfatase-modifying factor 1 [Rhodobacterales bacterium HTCC2083]
 gi|206677897|gb|EDZ42384.1| sulfatase-modifying factor 1 [Rhodobacteraceae bacterium HTCC2083]
          Length = 313

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 127/343 (37%), Positives = 167/343 (48%), Gaps = 58/343 (16%)

Query: 3   LLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
           ++ A  V+ + D V +P      GT  P +  DGE P R+  +  F      V+N QF+ 
Sbjct: 24  IIAARTVDDF-DTVAIPAGRSVYGTATPQIPNDGEGPMRSKRVRGFTCGVTTVTNAQFKA 82

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           F+ ATGYVTEAE FG +FVF   +++                       E      V  +
Sbjct: 83  FIDATGYVTEAEDFGWSFVFYSDIAK-----------------------EIGPTQAVPGV 119

Query: 123 SWNDAVAYCTWRGARLPT--EAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
            W   V    WR    P    A W+                               +HPV
Sbjct: 120 EWWHRVDGANWRDINGPYTYAAAWD------------------------------PDHPV 149

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNL-TPRGEHRANVWQGEF 239
           V +SW DA AY  W G RLPTEAEWE+  RGGL +  FPWG+     +G    N+WQG+F
Sbjct: 150 VQMSWRDARAYAAWVGGRLPTEAEWEHAARGGLGDVKFPWGDQEPEEKGYTPCNIWQGDF 209

Query: 240 PTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT 299
           P  NT ADGY  TAP  S++ N +GL+NMVGNVWEWT+D ++V               G 
Sbjct: 210 PNTNTCADGYQRTAPARSFEPNTYGLFNMVGNVWEWTSDAYHVQSLKKDVKRRLVQMKGF 269

Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            K+ KGGS+LC++ YCYR+R AARS  +PDS+  + GFR   D
Sbjct: 270 -KLSKGGSFLCHKSYCYRYRIAARSGTSPDSATPHHGFRVVWD 311


>gi|332285115|ref|YP_004417026.1| hypothetical protein PT7_1862 [Pusillimonas sp. T7-7]
 gi|330429068|gb|AEC20402.1| hypothetical protein PT7_1862 [Pusillimonas sp. T7-7]
          Length = 381

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 124/350 (35%), Positives = 170/350 (48%), Gaps = 59/350 (16%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +M  +PGD F MG N   L +  E P+  + +  F++D H+V+N +F+ FV ATGYVT A
Sbjct: 49  NMAWVPGDEFLMG-NSHKLSQPNERPAHQIRVSGFWMDVHDVTNAEFRRFVEATGYVTTA 107

Query: 74  EKFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           E+       +P L     R   S +      F G  S +  R               Y  
Sbjct: 108 EQKPLWEDLKPQLPPGTPRPDDSLLVPGAMVFTGTASEVSLR--------------DYSR 153

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W          W          R  P  +W HP+G  S+I+ + +HPVV VS+ DA AY 
Sbjct: 154 W----------W----------RFIPGANWRHPQGPGSSIQGKDDHPVVQVSYKDAQAYA 193

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAADG 248
            W G RLPTEAEWEY  RGGLE   + WGN L+P G+  AN+W    Q  FP        
Sbjct: 194 KWAGKRLPTEAEWEYAARGGLEQATYSWGNELSPDGKPMANIWDTQQQQPFPVVKDEKI- 252

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNPK 293
            + T PV S+  N +GL++M GNVW+WTADW+                    PA S++P 
Sbjct: 253 QVGTTPVGSFPANDYGLFDMAGNVWQWTADWYRADAFTIQAESKNMPVDPTGPADSFDPD 312

Query: 294 G---PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
               P +   +V +GGS+LC++ YC  +R +AR    P +S  +LGFR  
Sbjct: 313 DGNVPVSAPKRVTRGGSFLCSDTYCISYRTSARRGTDPLNSMSHLGFRTV 362


>gi|408369867|ref|ZP_11167647.1| sulfatase-modifying factor protein [Galbibacter sp. ck-I2-15]
 gi|407744921|gb|EKF56488.1| sulfatase-modifying factor protein [Galbibacter sp. ck-I2-15]
          Length = 376

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 121/350 (34%), Positives = 179/350 (51%), Gaps = 49/350 (14%)

Query: 15  MVLLPGDTFRMGTNK-PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           MV + G  F  G  +      D E P+  V +D F++D  EV+N QF EFV  TGY+T A
Sbjct: 59  MVWIAGGHFTQGARQGDSFALDHEKPAHPVAVDGFFMDITEVTNDQFSEFVKQTGYITIA 118

Query: 74  EKFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           E+  D    +  L E   +   S ++     F+  ++++ +            D   +  
Sbjct: 119 EREVDWEELKKQLPEGTPKPHDSILQPGSLVFKKTETSVPNLY----------DFSQWWV 168

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W+                          +W HP+G  S+IE + N+PVVH+++ DAVAYC
Sbjct: 169 WKIG-----------------------ANWKHPKGPGSSIEGKGNYPVVHIAFPDAVAYC 205

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G +LPTEA+WE     G+ N+ F WG+ L  + + +AN W GEFPT N   DG+ + 
Sbjct: 206 NWAGRKLPTEAQWELAASAGINNKPFTWGDKLD-QLDVKANTWTGEFPTQNDQVDGFENK 264

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYNPKG------PTT--GT 299
           APV S+  N +GLY+M GNVWEWT+DW+++ +     H   ++NP+G      PT     
Sbjct: 265 APVKSFPPNPYGLYDMAGNVWEWTSDWYHIDYYKQLLHEGLAHNPEGAQKAFNPTNPYAQ 324

Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
           +K+ KGGS+LC++ YC  +R +AR     DS   +LGFR         GT
Sbjct: 325 EKIIKGGSFLCSDTYCASYRISARMATAIDSGQEHLGFRTIVTPEMLEGT 374


>gi|115358015|ref|YP_775153.1| hypothetical protein Bamb_3264 [Burkholderia ambifaria AMMD]
 gi|115283303|gb|ABI88819.1| protein of unknown function DUF323 [Burkholderia ambifaria AMMD]
          Length = 330

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 123/341 (36%), Positives = 167/341 (48%), Gaps = 54/341 (15%)

Query: 3   LLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
           L PA      ++++   G    +GT+ P++  DGE P R V L  F ++++ V+N +F E
Sbjct: 39  LAPANARLPAENIIEFDGGPSWVGTDAPVIKADGEGPRRKVHLKPFGIERYAVTNDRFTE 98

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           FV+ATGY T+AE+FG ++VF                         D+ +E          
Sbjct: 99  FVAATGYRTDAERFGWSYVF-------------------------DAFLE---------- 123

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
              D VA         P    W          R  P   W  PEG  S+++ R  HPV+H
Sbjct: 124 --PDCVA-------EAPANVPWW---------RSVPGAYWAAPEGPLSSLDGRGMHPVIH 165

Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
           +SWNDA A+  W G RLPTEAEWE+  RGG + R FPWG+          N+WQG FP  
Sbjct: 166 ISWNDAAAFARWCGGRLPTEAEWEHAARGGADARRFPWGDEEPVEDALCCNIWQGRFPDR 225

Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-HPAPSYNPKGPTTGTDK 301
            +  D    T  V ++  N  GLYN  GNVWEW  D ++V     A     +   +  + 
Sbjct: 226 YSGGDRAFGTVRVDAFAANPAGLYNCSGNVWEWCVDPFHVRSLSSAGKARDRQAVSEREY 285

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
             KGGSYLC+  YCYR+R AAR+    D+SAG+ GFR A D
Sbjct: 286 TLKGGSYLCHRSYCYRYRIAARTGRPTDTSAGHTGFRVAYD 326


>gi|300771046|ref|ZP_07080923.1| sulfatase-modifying factor 1 [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300762319|gb|EFK59138.1| sulfatase-modifying factor 1 [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 363

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 120/339 (35%), Positives = 173/339 (51%), Gaps = 57/339 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV + G ++ MG+      +D +     V + +F++D+HEV+N QF +FV ATGY+T AE
Sbjct: 69  MVKIEGGSYEMGSAD---FEDAK-QVHKVNIKSFWMDEHEVTNAQFAKFVQATGYLTVAE 124

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D   F P +  E     S V H  K   GL+  ++                    W 
Sbjct: 125 RPLDPRDF-PGVDPEMLKPGSAVFHAPKEVRGLNDPLQ--------------------W- 162

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W Y              +W HPEG +S I+ + N+PVVH+++ DA AY  W
Sbjct: 163 ---------WSYVVGA----------NWKHPEGPESNIKGKENYPVVHIAYQDAEAYAKW 203

Query: 195 RGARLPTEAEWEYGCR-GGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
            G RLPTEAEWEY  +     N  + WG   T  G+  AN++QGEFP  +T  DGY+  A
Sbjct: 204 AGKRLPTEAEWEYAAKTANYNNEKYYWGTEKTENGKWAANIYQGEFPLKDTGEDGYVGAA 263

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----------PAPSYNPKGPTTGTDKVK 303
           PV SY  N  GLY+M GNVWEW +D++   ++          P  SY+P+ P     +V+
Sbjct: 264 PVKSYPANTSGLYDMEGNVWEWCSDFYRPDYYKNSSESNPTGPIDSYDPQEPGA-VKRVQ 322

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +GGS+LCN+ YC R++  +R +   +S   N+GFR   D
Sbjct: 323 RGGSFLCNDMYCERYKAGSRGKGEVNSPTNNVGFRLVKD 361


>gi|378764400|ref|YP_005193016.1| putative sulfatase-modifying factor [Sinorhizobium fredii HH103]
 gi|365184028|emb|CCF00877.1| putative sulfatase-modifying factor [Sinorhizobium fredii HH103]
          Length = 329

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 130/342 (38%), Positives = 170/342 (49%), Gaps = 60/342 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           + V +PG TF MG+++       E P+  V +  F++D   V+N +F  FV ATGYVT A
Sbjct: 23  ETVWIPGGTFLMGSDRHY---PEEAPAHKVKVTGFWMDVCTVTNREFAAFVEATGYVTLA 79

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D   + P    +  A  S V      F    + ++ R H+     +W         
Sbjct: 80  ERPVDPKDY-PGAKPDMLAPASVV------FSKPRTRVDLRNHY-----NW--------- 118

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W Y  RG          +W HP G  S+IE   +HPVVHV++ DA AY T
Sbjct: 119 ----------WAY-LRGA---------NWRHPHGPGSSIERLHDHPVVHVAYEDAEAYAT 158

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W    LPTEAEWE+  RGGL+   + WG+   P G   AN WQGEFP  N   DGY +TA
Sbjct: 159 WARKELPTEAEWEFAARGGLDAAEYVWGDEWAPGGRPMANTWQGEFPLQNLCEDGYETTA 218

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNPKGPTTG 298
           PV SY  N +GLY+M GNVW+WT+DW+  H                  A S +P  P   
Sbjct: 219 PVASYPANGYGLYDMAGNVWQWTSDWYQDHGKIDSPCCTISNPRGGDQADSIDPLQPHLK 278

Query: 299 TD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
              +V KGGSYLC   YC R+R AAR     D+S  ++GFRC
Sbjct: 279 IPRRVMKGGSYLCAANYCRRYRPAARMAQPVDTSTCHVGFRC 320


>gi|441204861|ref|ZP_20972317.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis MKD8]
 gi|440629327|gb|ELQ91117.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis MKD8]
          Length = 296

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 125/335 (37%), Positives = 174/335 (51%), Gaps = 54/335 (16%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V +PG +FRMG+         E P    T++ F +++H V+N QF EF+ ATGY T A
Sbjct: 4   ELVEVPGGSFRMGSTS---FYPEEAPVHTATVEDFAIERHPVTNAQFAEFIEATGYATVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D  ++ P + E +    S V      F      ++ R                  W
Sbjct: 61  ERPLDPKLY-PGVPEADLVPGSLV------FRPTSGPVDLR-----------------DW 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         W++           P   W HP G       R +HPVV V++ DAVAY T
Sbjct: 97  R-------QWWDWA----------PGACWHHPFGPRREF-CRPDHPVVQVAYPDAVAYAT 138

Query: 194 WRGARLPTEAEWEYGCRGGLENR---LFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           W G RLPTEAEWEY  RGG +     L+ WG+ + P G+  AN WQG+FP  N  A G+ 
Sbjct: 139 WAGRRLPTEAEWEYAARGGPKGGVGFLYAWGDEVCPDGQLMANTWQGKFPYRNDDALGWK 198

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKK 304
            T+PV ++  N+ GL +M+GNVWEWTA  ++ HH P    +      P G   G ++V K
Sbjct: 199 GTSPVGTFPPNRLGLVDMIGNVWEWTATKFSAHHRPGDESHVTCCPPPSGGDPGVNQVLK 258

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           GGS+LC  +YC+R+R AARS  + DSS  ++GFRC
Sbjct: 259 GGSHLCAPEYCHRYRPAARSPQSQDSSTTHIGFRC 293


>gi|433640835|ref|YP_007286594.1| Conserved protein of unknown function [Mycobacterium canettii CIPT
           140070008]
 gi|432157383|emb|CCK54661.1| Conserved protein of unknown function [Mycobacterium canettii CIPT
           140070008]
          Length = 299

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 130/337 (38%), Positives = 168/337 (49%), Gaps = 53/337 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +FRMG+ +       E P   VT+ AF +++H V+N QF EFVSATGYVT A
Sbjct: 4   ELVDLPGGSFRMGSTR---FYPEEAPIHTVTVRAFAVERHPVTNAQFAEFVSATGYVTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E        +PL              D   + G+D+         +V       V    W
Sbjct: 61  E--------QPL--------------DPGLYPGVDAA--DLCPGAMVFCPTAGPVDLRDW 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         W++           P   W HP G DS I  R  HPVV V++ DAVAY  
Sbjct: 97  R-------QWWDW----------VPGACWRHPFGRDSDIADRAGHPVVQVAYPDAVAYAR 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGG     + WG+   P G   AN WQG FP  N  A G++ T+
Sbjct: 140 WAGRRLPTEAEWEYAARGG-TTATYAWGDQEKPGGMLMANTWQGRFPYRNDGALGWVGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGTD----KVKKG 305
           PV  +  N FGL +M+GNVWEWT   +   H   P       P    T  D    +  KG
Sbjct: 199 PVGRFPANGFGLLDMIGNVWEWTTTEFYPQHRIDPPSTACCAPVKLATAADPTISQTLKG 258

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GS+LC  +YC+R+R AARS  + D++  ++GFRC AD
Sbjct: 259 GSHLCAPEYCHRYRPAARSPQSQDTATTHIGFRCVAD 295


>gi|108757085|ref|YP_631849.1| hypothetical protein MXAN_3662 [Myxococcus xanthus DK 1622]
 gi|108460965|gb|ABF86150.1| conserved hypothetical protein [Myxococcus xanthus DK 1622]
          Length = 334

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 125/346 (36%), Positives = 167/346 (48%), Gaps = 63/346 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG TF MG++        E P+  VT+  F++D   V+N  F  FV ATGYVT AE
Sbjct: 31  MVYIPGGTFWMGSDHHY---PEERPTHQVTVTDFWMDSQLVTNADFARFVEATGYVTVAE 87

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +        PL  ++              F G +  +          +     V   T  
Sbjct: 88  R--------PLNPDD--------------FPGANPAL----------LVPGSLVFKKTLA 115

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              L    +W +   G           W +PEG  ST + R  +PVVHV + DA+AY  W
Sbjct: 116 RVDLRDLTQWWFYTAGA---------CWKYPEGAGSTWQGREQYPVVHVCFEDALAYALW 166

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWEY  RGGL+ +++ WG+   P G+  AN WQGEFP  N   DG+  T+P
Sbjct: 167 AGKTLPTEAEWEYAARGGLDRKVYVWGDEFAPGGKLMANTWQGEFPWQNLNTDGFEGTSP 226

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGPTTG---------- 298
           V ++  N +GL++M GNVWEWT+DW+   H            NP+GP T           
Sbjct: 227 VGNFPPNAYGLFDMAGNVWEWTSDWYQERHEGNGGKPCCIPVNPRGPATSDRSLDPCLPK 286

Query: 299 ---TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
                +V KGGS+LC   YC R+R AARS    DS   ++GFRC  
Sbjct: 287 VKIPRRVLKGGSHLCAPNYCLRYRPAARSPQAVDSGTSHIGFRCVV 332


>gi|34783013|gb|AAH17005.2| SUMF1 protein, partial [Homo sapiens]
          Length = 132

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 88/130 (67%), Positives = 104/130 (80%)

Query: 217 LFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWT 276
           LFPWGN L P+G+H AN+WQGEFP  NT  DG+  TAPV ++  N +GLYN+VGN WEWT
Sbjct: 1   LFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWT 60

Query: 277 ADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
           +DWW VHH    + NPKGP +G D+VKKGGSY+C+  YCYR+RCAARSQNTPDSSA NLG
Sbjct: 61  SDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLG 120

Query: 337 FRCAADKGPT 346
           FRCAAD+ PT
Sbjct: 121 FRCAADRLPT 130


>gi|414164694|ref|ZP_11420941.1| hypothetical protein HMPREF9697_02842 [Afipia felis ATCC 53690]
 gi|410882474|gb|EKS30314.1| hypothetical protein HMPREF9697_02842 [Afipia felis ATCC 53690]
          Length = 320

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 125/345 (36%), Positives = 166/345 (48%), Gaps = 62/345 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           DM+ L G  + MG+++       E P   V + +F +D+  V+N QF+ FV  TGY+T A
Sbjct: 16  DMIRLTGGVYVMGSDRHY---PEEAPEHRVKVGSFLIDRTPVTNRQFKNFVRQTGYITFA 72

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D   +  +L             DM     L  T      H     SW         
Sbjct: 73  ERMPDPKDYPGILP------------DMMFAGSLVFTPPASSSH---RHSW--------- 108

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                   A+W    +G          +W  P G  S I    +HPVVH+++ DA+AY  
Sbjct: 109 --------AQWWTLMKGA---------NWRRPYGPKSNINTLDSHPVVHITFADALAYAQ 151

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEAEWE+  RGGLEN  F WGN L P G + AN WQGEFP  N A DGY  T+
Sbjct: 152 WAGKDLPTEAEWEFAARGGLENADFAWGNELAPGGTYLANTWQGEFPHENLALDGYARTS 211

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY------NPKGP------------ 295
           PV ++  N +G+++M+GNVWEWT DW+   H            NP+G             
Sbjct: 212 PVTAFPPNGYGIHDMIGNVWEWTKDWYGSRHQADAQKACCIPENPRGAREEDSFDAGDRN 271

Query: 296 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           T    +V KGGS+LC   YC R+R AAR     D+S  ++GFRC 
Sbjct: 272 TRIPRRVVKGGSHLCAPNYCRRYRPAARQAQPIDTSTSHVGFRCV 316


>gi|33601704|ref|NP_889264.1| hypothetical protein BB2727 [Bordetella bronchiseptica RB50]
 gi|33576141|emb|CAE33220.1| putative exported protein [Bordetella bronchiseptica RB50]
          Length = 380

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 124/348 (35%), Positives = 165/348 (47%), Gaps = 59/348 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG  F MG N   L +  E P+  V +  F++D H+V+N QF+ FV ATGY+T AE
Sbjct: 49  MAWIPGHQFLMG-NDHKLSQPNERPAHQVRVSGFWMDVHDVTNAQFRRFVQATGYLTTAE 107

Query: 75  KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           +        P L     R   SQ+      F G  S +  R               Y  W
Sbjct: 108 RKPRWEDLRPQLPPGTPRPDDSQLAPGAMVFVGTQSEVSLR--------------DYSRW 153

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W          R  P  SW HP+G  S I+ + +HPVV VS+ DA AY  
Sbjct: 154 ----------W----------RFVPGASWRHPQGPGSGIDGKDDHPVVQVSYEDAQAYAR 193

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAADGY 249
           W G RLPTEAEWE+  RGGLE   + WG+ L P+G+  AN+W    Q  FP         
Sbjct: 194 WAGKRLPTEAEWEFAARGGLEQATYTWGDELLPQGQAMANIWDTRQQQPFPVVKD-EKVQ 252

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH------HHPAPSYNPKGPTTGTD--- 300
           + T PV S+  N +GLY+M GNVW+WTADW+          +  P  +P GP    D   
Sbjct: 253 VGTTPVGSFAPNGYGLYDMAGNVWQWTADWYRADAFRIQAQYREPPLDPAGPADSFDPDD 312

Query: 301 ---------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                    +V +GGS+LC++ YC  +R +AR    P +   +LGFR 
Sbjct: 313 GNVPAAAPKRVTRGGSFLCSDTYCISYRASARRGTDPLNPMSHLGFRT 360


>gi|325287393|ref|YP_004263183.1| sulfatase-modifying factor protein [Cellulophaga lytica DSM 7489]
 gi|324322847|gb|ADY30312.1| Sulphatase-modifying factor protein [Cellulophaga lytica DSM 7489]
          Length = 380

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 127/358 (35%), Positives = 182/358 (50%), Gaps = 54/358 (15%)

Query: 2   VLLPAPPVERYKD-MVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQ 59
           +L+  P   +  D MV +PG  F  G   +  +    E P+  V +D F++D  EV+N Q
Sbjct: 53  LLVKKPKDVKVPDGMVWIPGGEFIQGAVPQDKMAMQHEKPAHKVAVDGFFMDITEVTNAQ 112

Query: 60  FQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVR---HDMKRFEGLDSTIEHRMH 116
           F +FV+ TGY T AE+  D         EE + ++ +     HD     G  +  + +  
Sbjct: 113 FAKFVNQTGYTTVAERAIDW--------EEMKKQVPEGTPKPHDSILQPGSLTFKKTKTS 164

Query: 117 HPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRM 176
            P       +   +  W          W  G             +W HP G  S+I+   
Sbjct: 165 VP-------NLFDFSQWW--------HWTIG------------ANWKHPNGPGSSIKGLD 197

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
           NHPVV + + DA+AYC W   RLPTEAEWEY   GG    +F WG++ +   ++ AN W+
Sbjct: 198 NHPVVQICYEDAMAYCNWANRRLPTEAEWEYAANGGKSGTIFFWGDDRSVLSKN-ANSWE 256

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP--APSYNPKG 294
           GEFP  N+  DG+   APV SY+ N +GLY+M GNVWEWT+DW+NV+++   A +   K 
Sbjct: 257 GEFPVENSKKDGFEKRAPVKSYQPNGYGLYDMAGNVWEWTSDWYNVNYYKELAATTTKKN 316

Query: 295 PTTGTD-----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           P   TD           KV KGGS+LC++ YC  +R +AR  ++ DSSA + GFR  A
Sbjct: 317 PLGATDAYNPQNLYIKEKVIKGGSFLCSDSYCASYRISARMGSSVDSSAEHTGFRTVA 374


>gi|168700488|ref|ZP_02732765.1| hypothetical protein GobsU_13232 [Gemmata obscuriglobus UQM 2246]
          Length = 283

 Score =  198 bits (503), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 115/316 (36%), Positives = 166/316 (52%), Gaps = 54/316 (17%)

Query: 37  EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
           E P   V + AF +D  EV+  QF  FV ATGYVT AE+                     
Sbjct: 10  EQPLHEVVIAAFEMDATEVTVGQFAAFVKATGYVTTAEQ--------------------- 48

Query: 97  VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 156
              D +R+   D    +      V +  +  +    W          W Y   GG     
Sbjct: 49  -APDPRRYPSSDPA--YLKPGSAVFVPMDVPLDIPVW----------WRY--VGG----- 88

Query: 157 FPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 216
               +W +PEG  S+++ + N+PVV ++W+DA AY  W G RLPTEAEWE+  RGGL+ +
Sbjct: 89  ---ANWRYPEGPQSSVKGKKNYPVVQITWDDATAYAKWAGKRLPTEAEWEFAARGGLDRK 145

Query: 217 LFPWGNNLT-PRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEW 275
            + WG++     G+  ANV+QG+FP  +  +DG+   +PV S++ N +GLY+M GN WEW
Sbjct: 146 TYVWGDHKNGADGKWYANVYQGQFPARDAGSDGFAGLSPVKSFQPNGYGLYDMSGNAWEW 205

Query: 276 TADWWNVHHHP-APSYNPKGPTTG--------TDKVKKGGSYLCNEQYCYRHRCAARSQN 326
            ADW++  ++  +P  NP+GP TG          KV++GGS+LC++ YC R+   AR +N
Sbjct: 206 CADWYDPGYYARSPKENPRGPDTGPLVGGERQPQKVRRGGSFLCDDSYCSRYVPGARDKN 265

Query: 327 TPDSSAGNLGFRCAAD 342
             DSSA + GFRC  D
Sbjct: 266 PTDSSANHTGFRCVRD 281


>gi|86143176|ref|ZP_01061578.1| hypothetical protein MED217_08340 [Leeuwenhoekiella blandensis
           MED217]
 gi|85830081|gb|EAQ48541.1| hypothetical protein MED217_08340 [Leeuwenhoekiella blandensis
           MED217]
          Length = 373

 Score =  197 bits (502), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 123/338 (36%), Positives = 171/338 (50%), Gaps = 58/338 (17%)

Query: 19  PGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGD 78
           PGD+F +   KP         +  V +D F++D  EV+N QF+ FV ATGYVT AE+   
Sbjct: 73  PGDSFALPREKP---------AHPVAVDGFFIDITEVTNAQFKAFVEATGYVTLAER--- 120

Query: 79  TFVFEPLLSEEERAKISQ-VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
                P+  EE +  +               S I  R    V  +  ND   Y  W    
Sbjct: 121 -----PVDWEELKKNVPPGTPKPADSVLQPGSLIFDRSVQNVRDL--ND---YTQWW--- 167

Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
                EW+ G             +W HP+G  S+IE + ++PVVH+S+ DA+AYC W   
Sbjct: 168 -----EWKIGA------------NWKHPQGPGSSIEGKDDYPVVHISYEDALAYCKWAKR 210

Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMS 257
           RLPTE EWE    G +   ++ WG++ +     +AN WQG FPT NT  DG+   APV S
Sbjct: 211 RLPTETEWEAAAHGQMPGGVYTWGDDESAL-PTKANTWQGTFPTKNTNEDGFAYVAPVAS 269

Query: 258 YKENKFGLYNMVGNVWEWTADWWNVHHH-------------PAPSYNPKGPTTGTDKVKK 304
           +  N  G+Y++ GNVWE T DW++  ++             PA SYNP+ P    +KV K
Sbjct: 270 FDPNSIGIYDLTGNVWELTQDWFSTRYYQEIAGNTTVNPQGPAASYNPENPYQA-EKVIK 328

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GGS+LC++ YC  +R +AR    PDS+  + GFR  AD
Sbjct: 329 GGSFLCDDSYCASYRISARMGQAPDSATDHSGFRTVAD 366


>gi|412337871|ref|YP_006966626.1| hypothetical protein BN112_0543 [Bordetella bronchiseptica 253]
 gi|408767705|emb|CCJ52461.1| putative exported protein [Bordetella bronchiseptica 253]
          Length = 380

 Score =  197 bits (502), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 124/348 (35%), Positives = 165/348 (47%), Gaps = 59/348 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG  F MG N   L +  E P+  V +  F++D H+V+N QF+ FV ATGY+T AE
Sbjct: 49  MAWIPGHQFLMG-NDHKLSQPNERPAHQVRVSGFWMDVHDVTNAQFRRFVQATGYLTTAE 107

Query: 75  KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           +        P L     R   SQ+      F G  S +  R               Y  W
Sbjct: 108 RKPRWEDLRPQLPPGTPRPDDSQLAPGAMVFVGTRSEVSLR--------------DYSRW 153

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W          R  P  SW HP+G  S I+ + +HPVV VS+ DA AY  
Sbjct: 154 ----------W----------RFVPGASWRHPQGPGSGIDGKDDHPVVQVSYEDAQAYAR 193

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAADGY 249
           W G RLPTEAEWE+  RGGLE   + WG+ L P+G+  AN+W    Q  FP         
Sbjct: 194 WAGKRLPTEAEWEFAARGGLEQATYTWGDELLPQGQAMANIWDTRQQQPFPVVKD-EKVQ 252

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH------HHPAPSYNPKGPTTGTD--- 300
           + T PV S+  N +GLY+M GNVW+WTADW+          +  P  +P GP    D   
Sbjct: 253 VGTTPVGSFAPNGYGLYDMAGNVWQWTADWYRADAFRIQAQYREPPLDPAGPADSFDPDD 312

Query: 301 ---------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                    +V +GGS+LC++ YC  +R +AR    P +   +LGFR 
Sbjct: 313 GNVPAAAPKRVTRGGSFLCSDTYCISYRASARRGTDPLNPMSHLGFRT 360


>gi|295690082|ref|YP_003593775.1| hypothetical protein Cseg_2708 [Caulobacter segnis ATCC 21756]
 gi|295431985|gb|ADG11157.1| protein of unknown function DUF323 [Caulobacter segnis ATCC 21756]
          Length = 342

 Score =  197 bits (502), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 124/347 (35%), Positives = 165/347 (47%), Gaps = 57/347 (16%)

Query: 4   LPAP-PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
           LP P P  R   MV + G  F MG      ++  E P   V +  F++D+ EV+N  F  
Sbjct: 35  LPIPDPAHRTAGMVRVAGGDFLMGGAP---LRSEEGPPETVRVAPFWIDRTEVTNAAFAR 91

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           FV+ATGY T AE+  D     PL + + R                               
Sbjct: 92  FVAATGYRTVAERPLDPARDAPLSAAQRRPA----------------------------- 122

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
               ++ +   +GAR    ++W          R+ P   W  PEG  S I  +   PVVH
Sbjct: 123 ----SLVFVGEKGARSDDPSQWW---------RVIPGADWRRPEGPGSNIRGKEAWPVVH 169

Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
           ++W DA+AY  W G  LPTEAEWEY  RGGL  + + WG+      + RAN WQG FP  
Sbjct: 170 IAWEDAMAYARWLGRDLPTEAEWEYAARGGLVAKRYVWGDQSQTPKQPRANTWQGVFPAQ 229

Query: 243 NTAADGYLS-TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT----- 296
           +   DG+ +  APV  +  N +GL +M GNVWEWT DW+     PA      GP      
Sbjct: 230 DLGDDGFKARPAPVGCFAPNGYGLSDMAGNVWEWTKDWFKPGLEPASVIEMGGPPEIRAL 289

Query: 297 ----TGTDK-VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                G  K V KGGS+LC + YC+R+R AAR+   PDS A ++GFR
Sbjct: 290 DPDEPGVAKHVIKGGSFLCTDDYCFRYRPAARTPGPPDSGASHIGFR 336


>gi|33597322|ref|NP_884965.1| hypothetical protein BPP2758 [Bordetella parapertussis 12822]
 gi|33573749|emb|CAE38051.1| putative exported protein [Bordetella parapertussis]
          Length = 380

 Score =  197 bits (502), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 124/348 (35%), Positives = 165/348 (47%), Gaps = 59/348 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG  F MG N   L +  E P+  V +  F++D H+V+N QF+ FV ATGY+T AE
Sbjct: 49  MAWIPGHQFLMG-NDHKLSQPNERPAHQVRVSGFWMDVHDVTNAQFRRFVQATGYLTTAE 107

Query: 75  KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           +        P L     R   SQ+      F G  S +  R               Y  W
Sbjct: 108 RKPRWEDLRPQLPPGTPRPDDSQLAPGAMVFVGTRSEVSLR--------------DYSRW 153

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W          R  P  SW HP+G  S I+ + +HPVV VS+ DA AY  
Sbjct: 154 ----------W----------RFVPGASWRHPQGPGSGIDGKDDHPVVQVSYEDAQAYAR 193

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAADGY 249
           W G RLPTEAEWE+  RGGLE   + WG+ L P+G+  AN+W    Q  FP         
Sbjct: 194 WAGKRLPTEAEWEFAARGGLEQATYTWGDELLPQGQAMANIWDTRQQQPFPVVKD-EKVQ 252

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH------HHPAPSYNPKGPTTGTD--- 300
           + T PV S+  N +GLY+M GNVW+WTADW+          +  P  +P GP    D   
Sbjct: 253 VGTTPVGSFAPNGYGLYDMAGNVWQWTADWYRADAFRIQAQYREPPLDPAGPADSFDPDD 312

Query: 301 ---------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                    +V +GGS+LC++ YC  +R +AR    P +   +LGFR 
Sbjct: 313 GNVPAAAPKRVTRGGSFLCSDTYCISYRASARRGTDPLNPMSHLGFRT 360


>gi|383827540|ref|ZP_09982634.1| hypothetical protein MXEN_21717 [Mycobacterium xenopi RIVM700367]
 gi|383330240|gb|EID08772.1| hypothetical protein MXEN_21717 [Mycobacterium xenopi RIVM700367]
          Length = 294

 Score =  197 bits (501), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 122/334 (36%), Positives = 169/334 (50%), Gaps = 50/334 (14%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG  FRMG+         E P     +  F +++H V+N QF EFV+ATGY+T A
Sbjct: 4   ELVELPGGAFRMGSTS---FYPEEAPIHTAAVAPFAVERHPVTNVQFAEFVAATGYITVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E        +P+              D  ++ G +   +  +   +V       V    W
Sbjct: 61  E--------QPI--------------DPAQYPGANP--DDLVPGALVFRPTPGPVDLQDW 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R        +W +   G           W HP G  S +  R +HPVV V++ DA AY  
Sbjct: 97  R--------QWWHWVPGAC---------WRHPFGPGSDVTDRADHPVVQVAYPDAAAYAR 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLP+EAEWEY  RGG     + WG+   P G   AN WQG FP  N  A G++ T+
Sbjct: 140 WAGRRLPSEAEWEYAARGGTTT-TYVWGDEEKPNGRLMANTWQGRFPYRNDGALGWIGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPTTG-TDKVKKGGSY 308
           PV ++  N FGL +M+GNVWEWTA  ++ HH     P P   P GP     ++  KGGS+
Sbjct: 199 PVGTFPPNGFGLLDMIGNVWEWTATEFSSHHRLDQPPKPCCTPSGPADPRVNQTLKGGSH 258

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           LC  +YC+R+R AARS  + D++  ++GFRC AD
Sbjct: 259 LCAPEYCHRYRPAARSPQSQDTATTHIGFRCVAD 292


>gi|325974321|dbj|BAJ83908.1| sulfatase modifying factor 2 [Hemicentrotus pulcherrimus]
          Length = 379

 Score =  197 bits (501), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 124/345 (35%), Positives = 177/345 (51%), Gaps = 62/345 (17%)

Query: 4   LPAPPV-ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
           +PA PV ++Y++MV LPG  F MG++K    KDGE   R VT+  F ++++ ++   F++
Sbjct: 32  VPADPVYDKYEEMVALPGGKFTMGSSKSDG-KDGESVQRKVTVKPFKINKYPITTASFRK 90

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           FV A  Y T+AEK+  +F FE  + E                  + +TIE R+       
Sbjct: 91  FVRAKKYKTDAEKYTWSFAFEAFVPE-----------------AIKNTIEQRLPG----A 129

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
            W   V    WR                              PEG  S IE+R+N+P VH
Sbjct: 130 PWWIPVQNAYWR-----------------------------QPEGKGSGIENRLNYPAVH 160

Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
           +S  DA A+C W G RLPTE EWE+  RGGLE + +PWG       ++R N+WQG+FP  
Sbjct: 161 ISNVDARAFCQWHGWRLPTEPEWEFAARGGLEGKEYPWGKKFE---KNRMNIWQGKFPEE 217

Query: 243 NTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNV---HHHPAPSYNPKGPTTG 298
           NT ADGY    PV ++  +N + LY+M+GNVWEWT+  ++        APS   + P+  
Sbjct: 218 NTKADGYHGVGPVDAFPPQNDYDLYDMIGNVWEWTSTKFSATPSAKDDAPSPEEENPSQ- 276

Query: 299 TDKVKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAA 341
              V +GGSY+ ++   + H  R   R  N  D+ + NLGFRCA 
Sbjct: 277 VRFVLRGGSYIDSKDGKFNHATRVTTRMGNEADAGSDNLGFRCAV 321


>gi|410473130|ref|YP_006896411.1| hypothetical protein BN117_2528 [Bordetella parapertussis Bpp5]
 gi|408443240|emb|CCJ49861.1| putative exported protein [Bordetella parapertussis Bpp5]
          Length = 380

 Score =  197 bits (501), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 124/348 (35%), Positives = 165/348 (47%), Gaps = 59/348 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG  F MG N   L +  E P+  V +  F++D H+V+N QF+ FV ATGY+T AE
Sbjct: 49  MAWIPGHQFLMG-NDHKLSQPNERPAHQVRVSGFWMDVHDVTNAQFRRFVQATGYLTTAE 107

Query: 75  KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           +        P L     R   SQ+      F G  S +  R               Y  W
Sbjct: 108 RKPRWEDLRPQLPPGTPRPDDSQLAPGAMVFVGTQSEVSLR--------------DYSRW 153

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W          R  P  SW HP+G  S I+ + +HPVV VS+ DA AY  
Sbjct: 154 ----------W----------RFVPGTSWRHPQGPGSGIDGKDDHPVVQVSYEDAQAYAR 193

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAADGY 249
           W G RLPTEAEWE+  RGGLE   + WG+ L P+G+  AN+W    Q  FP         
Sbjct: 194 WAGKRLPTEAEWEFAARGGLEQATYTWGDELLPQGQAMANIWDTRQQQPFPVVKD-EKVQ 252

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH------HHPAPSYNPKGPTTGTD--- 300
           + T PV S+  N +GLY+M GNVW+WTADW+          +  P  +P GP    D   
Sbjct: 253 VGTTPVGSFAPNGYGLYDMAGNVWQWTADWYRADAFRIQAQYREPPLDPAGPADSFDPDD 312

Query: 301 ---------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                    +V +GGS+LC++ YC  +R +AR    P +   +LGFR 
Sbjct: 313 GNVPAAAPKRVTRGGSFLCSDTYCISYRASARRGTDPLNPMSHLGFRT 360


>gi|88809095|ref|ZP_01124604.1| hypothetical protein WH7805_05366 [Synechococcus sp. WH 7805]
 gi|88787037|gb|EAR18195.1| hypothetical protein WH7805_05366 [Synechococcus sp. WH 7805]
          Length = 284

 Score =  197 bits (500), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 120/329 (36%), Positives = 169/329 (51%), Gaps = 55/329 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV++P  T+R+G+         E P R++ L +F +D   V+N +F  FV+ TGY T +E
Sbjct: 10  MVVIPAGTYRVGSES---FYPEEAPIRSIQLQSFVIDVAPVTNAEFARFVADTGYQTVSE 66

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K  D  ++ P L  E++   S V      F    ++++         +SW   V    WR
Sbjct: 67  KPPDPELY-PNLPPEQQCPESAV------FTPPPASVDRSQP-----LSWWALVEGADWR 114

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                        HP+G  +TI++ M HPVVH+++ DA+AY  W
Sbjct: 115 -----------------------------HPQGPSTTIDNLMEHPVVHLAYEDALAYAEW 145

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPT  EWE   RGGL    + WG+ +TP G+  ANVWQG FP  N   DG+  T+P
Sbjct: 146 AGKRLPTADEWEVAARGGLVGHEYAWGSEMTPEGQWLANVWQGPFPWINEQTDGWFWTSP 205

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V S+  N +GL +M GNVWEWT+  + V         PKG      ++ KGGS+LC E Y
Sbjct: 206 VGSFPANGYGLVDMCGNVWEWTSTLFPV---------PKG--EQERRIIKGGSFLCAENY 254

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           C+R R AA    T D++  ++GFRCAAD 
Sbjct: 255 CHRFRPAALMGQTTDTATCHMGFRCAADS 283



 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 30/42 (71%)

Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           ++ KGGS+LC E YC+R R AA    T D++  ++GFRCAAD
Sbjct: 241 RIIKGGSFLCAENYCHRFRPAALMGQTTDTATCHMGFRCAAD 282


>gi|293605445|ref|ZP_06687827.1| sulfatase-modifying factor 1 [Achromobacter piechaudii ATCC 43553]
 gi|292816173|gb|EFF75272.1| sulfatase-modifying factor 1 [Achromobacter piechaudii ATCC 43553]
          Length = 381

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 123/348 (35%), Positives = 167/348 (47%), Gaps = 59/348 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG  F MG N+  L +  E P+  V +  F++D+H+V+N +F+ FV ATGYVT AE
Sbjct: 50  MAWIPGSEFLMG-NEHKLSQPNERPAHKVRVSGFWMDRHDVTNAEFRRFVQATGYVTTAE 108

Query: 75  KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           +       +P L     R   S +      F G  S +  R               Y  W
Sbjct: 109 QKPRWEDLQPQLPPGTPRPDDSVLVPGAMVFIGTASEVSLR--------------DYSRW 154

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W          R  P  +W HPEG  S++E +  HPVV VS+ DA AY  
Sbjct: 155 ----------W----------RFVPGANWRHPEGPGSSVEGKDEHPVVQVSYEDAQAYAK 194

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAADGY 249
           W G RLPTEAEWE+  RGGL    + WGN L+P+G+  AN+W    Q  FP         
Sbjct: 195 WAGKRLPTEAEWEFAARGGLAQATYSWGNELSPQGQPMANIWDTQQQQPFPVVKDEKI-Q 253

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH------HHPAPSYNPKGPTTGTD--- 300
           + T PV S+  N +GLY+M GNVW+WTADW+          +  P  +P GP    D   
Sbjct: 254 VGTTPVGSFPANGYGLYDMAGNVWQWTADWYRADAFVVQAQYRQPPTDPAGPADSFDPDD 313

Query: 301 ---------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                    +V +GGS+LC++ YC  +R +AR    P +   +LGFR 
Sbjct: 314 GNVPAAAPKRVTRGGSFLCSDTYCISYRASARRGTDPLNGMSHLGFRT 361


>gi|363421081|ref|ZP_09309170.1| hypothetical protein AK37_10411 [Rhodococcus pyridinivorans AK37]
 gi|359734816|gb|EHK83784.1| hypothetical protein AK37_10411 [Rhodococcus pyridinivorans AK37]
          Length = 311

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 126/349 (36%), Positives = 169/349 (48%), Gaps = 70/349 (20%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           ++M L+ G  F MG+         E P   VT+D F++D H V+  +F+ FV  TGYVT 
Sbjct: 7   QNMTLIAGGMFWMGSED---FYPEERPVHQVTVDGFWMDTHPVTVAEFRHFVKDTGYVTT 63

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTI----EHRMHHPVVHISWNDAV 128
           AE                      V  D   + G D ++          P   +  +D  
Sbjct: 64  AE----------------------VAPDPADYPGADPSLLAPGSLVFTPPPGPVPLDD-- 99

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
            Y  W          W +           P  +W HPEG  ST+  R  HPV HVSW DA
Sbjct: 100 -YTRW----------WSF----------VPGANWRHPEGTGSTVGGRERHPVTHVSWFDA 138

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G  LPTEAEWE+  RGGL+ + F WG+   P+G    NVWQGEFP  N   DG
Sbjct: 139 RAYAEWAGKELPTEAEWEFAARGGLDRKAFTWGDEHEPKGRPGGNVWQGEFPWQNLETDG 198

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-------HPAPS----YNPKGPTT 297
           +  T+PV  ++ N +GL++  GNVWEWTAD++  +H        PA S     NP+  T 
Sbjct: 199 FAGTSPVGHFRPNGYGLHDTAGNVWEWTADYFTANHASSGKNVAPASSCCIPTNPRVDTA 258

Query: 298 GTD-------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
             D       +V KGGS+LC   YC R+R AAR  +T +++  ++GFRC
Sbjct: 259 EHDPREAYPRRVIKGGSHLCAPNYCLRYRPAARQGDTEETATCHIGFRC 307


>gi|334319798|ref|YP_004556427.1| sulfatase-modifying factor protein [Sinorhizobium meliloti AK83]
 gi|334097537|gb|AEG55547.1| Sulphatase-modifying factor protein [Sinorhizobium meliloti AK83]
          Length = 326

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 126/346 (36%), Positives = 168/346 (48%), Gaps = 61/346 (17%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           VE   +MV + G TF MG+N        E P   VT+D F++D   V+N QF EFV+ATG
Sbjct: 11  VEPAAEMVWVRGATFMMGSND---HYPEEAPVHPVTVDGFWIDDAPVTNRQFLEFVNATG 67

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           +VT AE+       +P   +   A  S +R     F      ++                
Sbjct: 68  HVTFAER-------KPRAEDYPGAPPSNLRAGSLVFTPPKRPLQ---------------- 104

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                 G  +   ++W     G          +W  P G  S+I   ++HPVVHV+++DA
Sbjct: 105 ------GTEI---SQWWIFALGA---------NWRQPLGRRSSIGAILDHPVVHVAYSDA 146

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G  LPTEAEWE   RGGLE   F WG+ L P G H AN WQG FP  N+  DG
Sbjct: 147 KAYAEWAGKDLPTEAEWELAARGGLEGAEFAWGDELVPGGNHVANTWQGSFPVANSIDDG 206

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----------------PSYN 291
           +  T+PV  Y  N +GLY+M+GNVWEWT D+W+V H  A                  S +
Sbjct: 207 FGRTSPVRFYPPNGYGLYDMIGNVWEWTTDYWSVRHPEAAAKPCCIPSNPRNADADASID 266

Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGF 337
           P        +V KGGS+LC   YC R+R AAR     D++  ++GF
Sbjct: 267 PMARVKVPRRVLKGGSHLCAPNYCRRYRPAARHAQEIDTTTSHVGF 312


>gi|407723961|ref|YP_006843622.1| sulfatase-modifying factor protein [Sinorhizobium meliloti Rm41]
 gi|407324021|emb|CCM72622.1| sulfatase-modifying factor protein [Sinorhizobium meliloti Rm41]
          Length = 326

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 124/346 (35%), Positives = 168/346 (48%), Gaps = 61/346 (17%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           VE   +MV +PG TF MG+N        E P   VT+D F++D   V+N QF EFV+AT 
Sbjct: 11  VEPAAEMVWVPGATFMMGSNG---HYPEEAPVHPVTVDGFWIDDAPVTNRQFLEFVNATA 67

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           +VT AE+       +P   +   A  S +R     F      +                 
Sbjct: 68  HVTFAER-------KPRAEDYPGAPPSNLRAGSLVFTPPKRPL----------------- 103

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                +G  +   ++W     G          +W  P G  S+I   ++HPVVHV+++DA
Sbjct: 104 -----KGTDI---SQWWIFTLGA---------NWRQPLGRKSSIGTILDHPVVHVAYSDA 146

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G  LPTEAEWE   RGGLE   F WG+ L P G + AN WQG FP  N+  DG
Sbjct: 147 KAYADWAGKDLPTEAEWELAARGGLEGAEFAWGDELAPGGNYVANTWQGSFPVANSIDDG 206

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----------------PSYN 291
           +  T+P+  Y  N +GLY+M+GNVWEWT D+W+V H  A                  S +
Sbjct: 207 FARTSPIRFYPPNGYGLYDMIGNVWEWTTDYWSVRHPEAAAKPCCIPSNPRNADADASID 266

Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGF 337
           P        +V KGGS+LC   YC R+R AAR     D++  ++GF
Sbjct: 267 PMARVKVPRRVLKGGSHLCAPNYCRRYRPAARHAQEIDTTTSHVGF 312


>gi|87300776|ref|ZP_01083618.1| hypothetical protein WH5701_04990 [Synechococcus sp. WH 5701]
 gi|87284647|gb|EAQ76599.1| hypothetical protein WH5701_04990 [Synechococcus sp. WH 5701]
          Length = 330

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 133/343 (38%), Positives = 168/343 (48%), Gaps = 68/343 (19%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSR---NVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           DMV +P   FR+G         G  P      V L  F L + EV+N +F  FV ATGY 
Sbjct: 34  DMVAIPAGRFRIGAA-------GRLPEEAEATVRLRPFCLGRTEVTNREFASFVEATGYT 86

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           T+AE+   T  F P LS  ER   S V       E            PV  +SW      
Sbjct: 87  TQAEQPLSTEQF-PGLSAAERRPGSVVFRPPSPGE------------PVQELSW------ 127

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
                        W +           P   W HPEG  S I+ R++HPVV VS  D  A
Sbjct: 128 -------------WHW----------MPGADWRHPEGPGSAIDERLDHPVVQVSVADVQA 164

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           Y  W GA LP+EA+WE+  RGGL++++F WG+   P     AN WQG FP NNTA DGY 
Sbjct: 165 YAAWAGAALPSEAQWEFAARGGLKDQVFSWGDTWKP---GLANTWQGTFPVNNTAEDGYP 221

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH------------HPAPSYNPKGPTTG 298
            TAPV  +  N +GL++M GNVWEWT+DW+   H             PA S +P+ P   
Sbjct: 222 GTAPVGHFPANGYGLFDMTGNVWEWTSDWYQSGHDPIDGQQDPHLEDPAASSDPREPGV- 280

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
              V KGGS+LC   YC R+R AAR   +PD+   ++GFR A+
Sbjct: 281 AKHVIKGGSFLCAPNYCSRYRPAAREAESPDTGTSHIGFRLAS 323


>gi|156391933|ref|XP_001635804.1| predicted protein [Nematostella vectensis]
 gi|156222901|gb|EDO43741.1| predicted protein [Nematostella vectensis]
          Length = 273

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 120/330 (36%), Positives = 168/330 (50%), Gaps = 69/330 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  FRMGTN     KDGE P+R+V + AF +D++ ++N  F+ FV +  + TEAE
Sbjct: 1   MVKLSGGIFRMGTNSEDG-KDGENPARDVRVKAFAIDRYPITNELFRAFVRSKKFKTEAE 59

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFG +F F   + E  R+K                     +   V    W   V    WR
Sbjct: 60  KFGWSFAFHTFVPENVRSK---------------------VTQSVQGAPWWIPVNKAYWR 98

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                         P G  +TI+ ++N+PVVH+S+NDA AYC W
Sbjct: 99  -----------------------------QPGGPGTTIKDKLNYPVVHISYNDATAYCNW 129

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RL TEAEWE+  RGGL+ +++PWG       E R N+WQG+FP  NT  DGY   AP
Sbjct: 130 AGKRLATEAEWEFAARGGLQGQVYPWGKKFE---EKRMNIWQGDFPGKNTKLDGYDWLAP 186

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK-VKKGGSYLCNE 312
           V +Y  +N +G+Y+M+GN WEW  D +          +P+G   G  K V +GGSY+ + 
Sbjct: 187 VNAYPPQNSYGMYDMIGNAWEWVGDEFK---------SPEG--RGEKKFVLRGGSYIDSR 235

Query: 313 --QYCYRHRCAARSQNTPDSSAGNLGFRCA 340
             ++ ++ R   R  NT D+ + N+ FRCA
Sbjct: 236 DGKFNHKARVTTRMGNTADAGSDNISFRCA 265


>gi|392414840|ref|YP_006451445.1| hypothetical protein Mycch_0956 [Mycobacterium chubuense NBB4]
 gi|390614616|gb|AFM15766.1| hypothetical protein Mycch_0956 [Mycobacterium chubuense NBB4]
          Length = 293

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 120/331 (36%), Positives = 168/331 (50%), Gaps = 49/331 (14%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           D+  L G TFRMG+ +       E P   VT+  F +++H V+N QF  FV ATGY T A
Sbjct: 4   DLAELDGGTFRMGSTQ---FYPEEAPVHTVTVAPFAIERHPVTNAQFAAFVDATGYRTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E   D  +F                        L ++ +  +   +V       V    W
Sbjct: 61  EIAPDPALF------------------------LGASPQDLVPGALVFRPTRGPVDLRDW 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         W++           P  +W HP G  S++  +  HPVV V + DA+AY  
Sbjct: 97  R-------QWWQW----------TPGANWRHPFGPGSSVAGKEEHPVVQVCYADALAYAR 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE+  RGG    ++PWG+   P G+  AN WQG FP  N  A G+  T+
Sbjct: 140 WAGRRLPTEAEWEFAARGG-STTVYPWGDEPAPGGQLMANTWQGAFPYRNDGALGWAGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYN---PKGPTTGTDKVKKGGSYL 309
           PV ++  N +GL +M+GNVWEWT   +  HH    P+ +   P+GP    ++  KGGS+L
Sbjct: 199 PVGAFPLNAYGLADMIGNVWEWTTTRFAGHHRLEGPTESCCPPQGPDPAVNQALKGGSHL 258

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           C  +YC+R+R AARS  + DS+  ++GFRCA
Sbjct: 259 CAPEYCHRYRAAARSPQSQDSATTHIGFRCA 289


>gi|418059687|ref|ZP_12697628.1| Sulphatase-modifying factor protein [Methylobacterium extorquens
           DSM 13060]
 gi|373566748|gb|EHP92736.1| Sulphatase-modifying factor protein [Methylobacterium extorquens
           DSM 13060]
          Length = 327

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 126/344 (36%), Positives = 172/344 (50%), Gaps = 63/344 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ + G TF MG+++       E P   VT+ +F++D   V+N QF++FV  TG+VT AE
Sbjct: 20  MIYIRGGTFHMGSDRHY---PEERPVHRVTVGSFWMDATPVTNHQFRDFVRTTGHVTFAE 76

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K        P   +   A+  Q+      F G  S +    +HPV    W          
Sbjct: 77  K-------RPEPDQYPGARPDQL------FAG--SLVFTPPNHPVDLRMWG--------- 112

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMN-HPVVHVSWNDAVAYCT 193
                  A W           L    +W  P G      H ++ HPVVHV++ DA+AY  
Sbjct: 113 -------AWWS----------LLKGANWRRPNGPKGITLHGLDTHPVVHVTYADALAYAR 155

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEAEWE+  RGGL+   F WG+  TPRG H AN WQG+FP  NT  DG+  T+
Sbjct: 156 WAGKDLPTEAEWEFAARGGLDGAEFAWGDTFTPRGRHMANTWQGQFPWQNTEEDGFARTS 215

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPKGPTTGTD---------- 300
           PV ++  N +GL++M GNVWEWT+D++   H    P P   PK P  G +          
Sbjct: 216 PVKAFAPNGYGLHDMTGNVWEWTSDFFADRHSADAPKPCCIPKDPRGGREDDSYDPCLPD 275

Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                +V KGGS+LC   +C R+R AAR     D+S  ++GFRC
Sbjct: 276 IRIPRRVLKGGSHLCAPNFCSRYRPAARLPQPVDTSTSHVGFRC 319


>gi|14518334|ref|NP_116817.1| MS129, hypothetical protein [Microscilla sp. PRE1]
 gi|14484969|gb|AAK62851.1| MS129, hypothetical protein [Microscilla sp. PRE1]
          Length = 354

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 133/343 (38%), Positives = 175/343 (51%), Gaps = 51/343 (14%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +P   + MG       KD E+P  NV + AFY+D  EV+N +F+ FV  TGYVT AE
Sbjct: 45  MAYIPAGQYMMGGKSDQAYKD-EYPRHNVKVSAFYMDLTEVTNAEFKRFVDETGYVTIAE 103

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K  D   +E L S+  +         ++      S +  +   PV   S  D   Y  W 
Sbjct: 104 KDID---WEELKSQVPQGTPKPPDSVLQ----AGSLVFKQTDEPV---SLQD---YSQWW 150

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                   EW  G             +W +PEG  STIE RM+HPVVHVS+ D  AY  W
Sbjct: 151 --------EWTIGA------------NWRNPEGPGSTIEDRMDHPVVHVSFEDVQAYADW 190

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWE+   GG  +  +PWGN    +   +AN WQG FP  N A DG+  TAP
Sbjct: 191 AGKRLPTEAEWEWAAMGGQNDVKYPWGNESVEQASDKANFWQGNFPHQNYALDGFERTAP 250

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT---TGTD----------- 300
           V S+  N +GLY+M GNVWEW  D ++V+ +   SY  KG T   TG++           
Sbjct: 251 VRSFPANGYGLYDMAGNVWEWCQDKYDVNAY--ESYKQKGLTEDPTGSEHYNDPREPYTP 308

Query: 301 -KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
             V +GGS+LCN+ YC  +R + R  ++ DS   + GFRC  D
Sbjct: 309 KHVIRGGSFLCNDSYCSGYRVSRRMSSSRDSGFNHTGFRCVKD 351


>gi|170739501|ref|YP_001768156.1| hypothetical protein M446_1197 [Methylobacterium sp. 4-46]
 gi|168193775|gb|ACA15722.1| protein of unknown function DUF323 [Methylobacterium sp. 4-46]
          Length = 325

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 138/363 (38%), Positives = 177/363 (48%), Gaps = 65/363 (17%)

Query: 2   VLLPAPPV-ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
            LL  PP  +  +DMV LPG TFRMG+N+       E P+  V +D F++D   V+N QF
Sbjct: 5   ALLDRPPAPDAVRDMVFLPGGTFRMGSNR---HYPEEAPAHRVRIDGFWIDPTPVTNAQF 61

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           + FV ATGYVT AE+  D   +   L    +A                S +      PV 
Sbjct: 62  RAFVRATGYVTMAERKPDAADYPGALPHMLQAG---------------SLVFKPPKGPVN 106

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
              W        WR         +++G              W  P G  S+I     HPV
Sbjct: 107 LRDWG-----AWWR---------FKFGVH------------WRRPYGPGSSIAGLDEHPV 140

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           VHV++ DA AY  W    LPTEAEWEY  RGGL+   F WG+ LTP G H AN WQG FP
Sbjct: 141 VHVAYADAEAYAAWADKELPTEAEWEYAARGGLDGAEFAWGDELTPGGRHMANTWQGAFP 200

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA------------- 287
             N A DGY  T+PV ++  N +GL++M GNVWEWT D++ V  HPA             
Sbjct: 201 HQNLAEDGYERTSPVTAFPPNGYGLFDMTGNVWEWTTDFY-VPKHPADAAKACCIPQNPR 259

Query: 288 -----PSYNPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
                 S++P  P      +V KGGS+LC   YC R+R AAR  +  D+S  ++GFRC  
Sbjct: 260 GGQADESFDPSQPGIRIPRRVVKGGSHLCAPNYCRRYRPAARHPHPVDTSTSHIGFRCIR 319

Query: 342 DKG 344
             G
Sbjct: 320 RSG 322


>gi|167646635|ref|YP_001684298.1| hypothetical protein Caul_2673 [Caulobacter sp. K31]
 gi|167349065|gb|ABZ71800.1| protein of unknown function DUF323 [Caulobacter sp. K31]
          Length = 351

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 134/346 (38%), Positives = 175/346 (50%), Gaps = 57/346 (16%)

Query: 5   PAP-PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
           P P P      M L+P  TF MG  KPI  ++G  P R   ++AF++D+ EV+N  F  F
Sbjct: 44  PVPRPAPDTGGMALIPAGTFEMGA-KPIRPEEG--PPRLTRVEAFWIDRTEVTNGDFARF 100

Query: 64  VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
           V ATGYVT AE+  D   + P LS ++ A  + V      F G D+              
Sbjct: 101 VKATGYVTLAERPLDPKAY-PGLSGDQLAPSALV------FVGSDAP------------- 140

Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHV 183
                     RGA     ++W    RG          +W HP+G  S+I  +   PVV V
Sbjct: 141 ----------RGA---DPSQWWRVVRGA---------NWRHPQGPGSSIVGKDFLPVVQV 178

Query: 184 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNN 243
            W DA+AY  W G  LPTEAEWEY  +GG     F WG+      + +ANVWQG FP  +
Sbjct: 179 GWEDAMAYARWLGRDLPTEAEWEYAAQGGKGATRFVWGDAPLDARKPQANVWQGVFPAVD 238

Query: 244 TAADGYLS-TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTT----- 297
           T ADGY S TAPV  +  N +GL++M GNVWEWT DW+      A   +P GP +     
Sbjct: 239 TGADGYTSRTAPVGCFSANGYGLHDMAGNVWEWTRDWYRPGLDAAGHDDPGGPGSSMAFD 298

Query: 298 ----GTDK-VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
               G  K V KGGS+LC+  +CYR+R AAR     D+   ++GFR
Sbjct: 299 PGDPGVRKHVIKGGSFLCSPDFCYRYRPAARQAGPADTGESHMGFR 344


>gi|87198560|ref|YP_495817.1| hypothetical protein Saro_0536 [Novosphingobium aromaticivorans DSM
           12444]
 gi|87134241|gb|ABD24983.1| protein of unknown function DUF323 [Novosphingobium aromaticivorans
           DSM 12444]
          Length = 315

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 126/350 (36%), Positives = 170/350 (48%), Gaps = 65/350 (18%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           +R+  M L+ G TF MG+         E P R V +D+F++D+  V+N QF  F+ ATGY
Sbjct: 7   QRHAGMRLIEGGTFLMGSEA---FYPEEAPVRRVRVDSFWIDETPVTNFQFAAFIEATGY 63

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           VT AE   D   +  ++   +RA                S +  +   PV     + A A
Sbjct: 64  VTVAEVAPDPKDYPGMIPGMDRAG---------------SLVFQKTSGPV-----DMADA 103

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHRMNHPVVHVSWNDA 188
              WR                      F +G+ W HP G  S +    +HPVVHV++ DA
Sbjct: 104 SNWWR----------------------FTFGACWKHPLGPGSDLAGLEDHPVVHVAYPDA 141

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY TW G  LPTEAE+E+  RGGL+ + + WG+ L P G   AN WQG FP  N   DG
Sbjct: 142 EAYATWAGKCLPTEAEFEFAARGGLDGQDYAWGSELAPDGRMMANYWQGLFPFANQCLDG 201

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-------APSYNPKGPTTGTD- 300
           +  T+PV S+  N  G+Y+M+GN WEWT+DWW     P        P  NP+G       
Sbjct: 202 WERTSPVRSFPPNGHGVYDMIGNTWEWTSDWWTAKPMPPKAKRSCCPIDNPRGGRLRESF 261

Query: 301 -----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                      KV KGGS+LC   YC R+R AAR     D++  ++GFRC
Sbjct: 262 DPDHPGVRIGRKVIKGGSHLCAANYCQRYRPAARHPEMVDTATSHIGFRC 311


>gi|116252022|ref|YP_767860.1| hypothetical protein RL2266 [Rhizobium leguminosarum bv. viciae
           3841]
 gi|115256670|emb|CAK07758.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
           3841]
          Length = 317

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 126/347 (36%), Positives = 174/347 (50%), Gaps = 65/347 (18%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           ++ +V + G TF MG+N+  +    E P+  VT+D F++D   V+N QF  FV ATG+VT
Sbjct: 13  FQGLVWVAGRTFTMGSNEHYV---EEAPAHPVTVDGFWIDATPVTNRQFASFVKATGHVT 69

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
            AE+      +   L +  RA  S V    K   G D T             W       
Sbjct: 70  VAERAPRVEDYPGALPKMLRAG-SLVFTPPKAVVGPDIT------------QW------- 109

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W          +++G              W HP G  + I  +++HPVVH++++DA AY
Sbjct: 110 -W---------SFKFGA------------DWRHPYGGRTDIRGKLDHPVVHIAYSDAKAY 147

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G  LPTEAEWE   RGGL++  F WG+ L P G   AN WQG FPT +T   G   
Sbjct: 148 AAWAGKDLPTEAEWELAARGGLDDAEFAWGDGLNPEGMLMANTWQGIFPTLSTKPAGADR 207

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP------------------SYNPK 293
           T+PV  +  N +G+Y+M+GNVWEWT+D+W+   HP P                  SY+P+
Sbjct: 208 TSPVGYFPSNGYGIYDMIGNVWEWTSDFWST-RHPEPASHACCIPSNPRGGDADASYDPQ 266

Query: 294 GPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            P      +V KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 267 LPDIRIPRRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRC 313


>gi|332284401|ref|YP_004416312.1| hypothetical protein PT7_1148 [Pusillimonas sp. T7-7]
 gi|330428354|gb|AEC19688.1| hypothetical protein PT7_1148 [Pusillimonas sp. T7-7]
          Length = 364

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 121/357 (33%), Positives = 172/357 (48%), Gaps = 58/357 (16%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           V   ++M  +PG  F MG++   L +  E P+  V +  F++D   V+N QF EFV  TG
Sbjct: 38  VSTPQNMAWVPGGVFLMGSDS-KLAQPNERPTHQVKVHGFWMDVTHVTNDQFAEFVEKTG 96

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           Y+T AE+  D                         +E +   +   +  P  H+    A+
Sbjct: 97  YLTTAEQKPD-------------------------WETIRVQLPPGVPQPPDHVLQPGAM 131

Query: 129 AYC-TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
            +  T     L   ++W          R  P  +W HP+G  S+IE + +HPVV VS+ D
Sbjct: 132 VFVGTNAQVNLNDYSQWW---------RYVPGANWRHPQGPGSSIEGKGDHPVVQVSYVD 182

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW---QGEFPTNNT 244
             AY TW G RLPTEAEWE+  RGGLE   + WG+ L P GE +AN W   +  FP  + 
Sbjct: 183 VQAYATWAGKRLPTEAEWEFAARGGLEQATYVWGDELMPNGEMQANYWDTTEQPFPVVSP 242

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-------PSYNPKGPTT 297
            A G  +T P  ++  N +GL++M GN W+W +DW+   +          P  NPKGP  
Sbjct: 243 KAGGAANTVPAGTFPPNGYGLHDMTGNAWQWVSDWYRFDYFAMQAQAGQKPIDNPKGPGE 302

Query: 298 GTD------------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
             D            +V +GGS+LCNE YC  +R +AR  + P S   +LGFR   D
Sbjct: 303 SFDPNEPGVPANAPKRVIRGGSFLCNESYCLSYRPSARRSSDPYSPMSHLGFRLVKD 359


>gi|390367342|ref|XP_781714.3| PREDICTED: sulfatase-modifying factor 2 [Strongylocentrotus
           purpuratus]
          Length = 381

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 121/347 (34%), Positives = 178/347 (51%), Gaps = 64/347 (18%)

Query: 4   LPAPPV-ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
           +PA PV ++Y++MV LPG  F MG++K    KDGE   R VT+  F ++++ ++   F++
Sbjct: 32  VPADPVFDKYEEMVALPGGKFMMGSSKSDG-KDGESVQRKVTVKPFKINKYPITAASFRK 90

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           FV A  Y T+AEK+  +F F+  + E                  + +TIE R+       
Sbjct: 91  FVRAKKYKTDAEKYAWSFAFDAFVPE-----------------AIKNTIEQRLPG----A 129

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
            W   V    WR                              PEG  S IE+R+N+P VH
Sbjct: 130 PWWIPVQNAYWR-----------------------------QPEGKGSGIENRLNYPAVH 160

Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
           +S  DA A+C W G RLPTE EWE+  RG LE + +PWG       ++R N+WQG+FP  
Sbjct: 161 ISNVDARAFCQWHGWRLPTEPEWEFAARGSLEGKEYPWGKKFE---KNRMNIWQGKFPEK 217

Query: 243 NTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNV-----HHHPAPSYNPKGPT 296
           NT ADGY   +PV ++  +N + LY+M+GNVWEWT+  ++         P+P+   + P+
Sbjct: 218 NTKADGYHGVSPVDAFPPQNDYDLYDMIGNVWEWTSTKFSATPSAKDDAPSPAGEEENPS 277

Query: 297 TGTDKVKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAA 341
                V +GGSY+ ++   + H  R   R  N  D+ + NLGFRCA 
Sbjct: 278 Q-VRFVLRGGSYIDSKDGKFNHATRVTTRMGNEADAGSDNLGFRCAV 323


>gi|423018521|ref|ZP_17009242.1| hypothetical protein AXXA_28950 [Achromobacter xylosoxidans AXX-A]
 gi|338778406|gb|EGP42880.1| hypothetical protein AXXA_28950 [Achromobacter xylosoxidans AXX-A]
          Length = 378

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 123/350 (35%), Positives = 170/350 (48%), Gaps = 59/350 (16%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + M  +P   F MG N   + +  E P+  V L  F++D H+V+N +F+ FV ATGYVT 
Sbjct: 45  QGMAWVPAGEFLMG-NDHRMSQPNERPAHRVRLTGFWMDAHDVTNAEFRRFVDATGYVTT 103

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+                           R+E L   +      P+      DA     
Sbjct: 104 AER-------------------------KPRWEDLAVQLPPGTPRPI------DAALVP- 131

Query: 133 WRGARLPTEAEWEYGCRGGLEN-RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
             GA + T  E E   R      R     +W HP+G  S+I+ + +HPVV VS+ DA+AY
Sbjct: 132 --GAMVFTGTEQEVSLRDYSRWWRFVAGANWRHPQGPASSIDGKDDHPVVQVSYEDALAY 189

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAAD 247
             W G RLPTEAEWE+  RGGLE   + WG+ L PRG+  AN+W  +    FP   T   
Sbjct: 190 AAWAGKRLPTEAEWEFAARGGLEQATYIWGDELNPRGQAMANIWDTQQRQPFPV-VTDEK 248

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV------HHHPAPSYNPKGPTTGTD- 300
             + T PV S+  N +GLY+M GNVW+WTADW+          +  P  +P GP    D 
Sbjct: 249 VQVGTTPVASFPANGYGLYDMAGNVWQWTADWYRADAFLAQSQYRQPPLDPAGPKDSFDP 308

Query: 301 -----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                      +V +GGS+LC++ YC  +R +AR  N P +   +LGFR 
Sbjct: 309 NDGNVPQGAPKRVVRGGSFLCSDTYCISYRASARRGNDPMNGMSHLGFRT 358


>gi|209547151|ref|YP_002279069.1| hypothetical protein Rleg2_5119 [Rhizobium leguminosarum bv.
           trifolii WSM2304]
 gi|209538395|gb|ACI58329.1| protein of unknown function DUF323 [Rhizobium leguminosarum bv.
           trifolii WSM2304]
          Length = 322

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 124/346 (35%), Positives = 170/346 (49%), Gaps = 65/346 (18%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           +D + + G  F MG+++       E P+  V +D F+LD   V+N QF EFV ATGYVT 
Sbjct: 19  RDQIWIDGGGFLMGSDQHY---PEEAPAHPVKVDGFWLDVAAVTNRQFAEFVEATGYVTL 75

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AEK  +   + P    E     S V    KR  G                 W        
Sbjct: 76  AEKASNPEDY-PGAPPEMLKPGSVVFDPPKRVSGKGPP------------QW-------- 114

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR         + +G             +W  P G  S +  +++HPVV V++ DA+AY 
Sbjct: 115 WR---------FRFGA------------NWRRPYGGLSNLRGKLDHPVVQVAFADAMAYA 153

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W+   LPTEAEWE+  +GGLE + F WG+   P G   AN W G FPT N   DG+  T
Sbjct: 154 LWKDKDLPTEAEWEFAAKGGLEGKEFAWGDEFIPDGRFMANTWHGLFPTENLKPDGFDRT 213

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-------YNPKGPTTGTD----- 300
           +PV S+  N +GLY+M+GNVWEWT+D+W    HPAP+        NP+G +         
Sbjct: 214 SPVGSFPPNGYGLYDMIGNVWEWTSDYWTT-RHPAPAEKSCCIPVNPRGGSAEASRDSGQ 272

Query: 301 -------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                  +V KGGS+LC + YC R+R AAR     +S+  ++GFRC
Sbjct: 273 PDIRIARRVLKGGSHLCAQNYCRRYRPAARHAQEENSATTHIGFRC 318


>gi|152981439|ref|YP_001354099.1| hypothetical protein mma_2409 [Janthinobacterium sp. Marseille]
 gi|151281516|gb|ABR89926.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille]
          Length = 331

 Score =  194 bits (494), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 128/351 (36%), Positives = 166/351 (47%), Gaps = 64/351 (18%)

Query: 7   PPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
           P ++  ++M  +PG  F MG+N        E P+  V +DAF++D++ V+N+ F++FV A
Sbjct: 15  PSLQAPRNMAWIPGGKFLMGSNDHY---PEEAPAHKVVVDAFWMDKYAVTNSDFKQFVDA 71

Query: 67  TGYVT--EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
           TGYVT  E E   D +   P    E     S V         LD   +            
Sbjct: 72  TGYVTMVEREVNPDDY---PGARPELLVPSSVVFQKTATKVALDDPHQ------------ 116

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
                   W          W Y           P   W HP G  S+++    HPVVHV 
Sbjct: 117 --------W----------WHY----------VPGADWKHPRGPQSSLQGLWKHPVVHVG 148

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
             DA AY  W G  LPTEAEWE+  RGGL+   F WG+  TP G +  N WQG+FP  N 
Sbjct: 149 IEDAQAYAEWAGKSLPTEAEWEFAARGGLDGAEFVWGDEFTPGGRYMTNNWQGDFPHQNA 208

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH----HHPAPSYNPKG--PTTG 298
             D Y  T+PV ++  N +GLY+M GNVWEWT DW+  H    H      NP+G  P   
Sbjct: 209 LLDRYEWTSPVGAFPANGYGLYDMAGNVWEWTVDWYQEHSKIQHACCTINNPRGGEPDGS 268

Query: 299 TD----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            D          KV KGGS+LC   YC R+R AAR     D++  +LGFRC
Sbjct: 269 FDQRVKNVQIPRKVMKGGSHLCAPNYCRRYRPAARMAQPIDTATCHLGFRC 319


>gi|145595455|ref|YP_001159752.1| hypothetical protein Strop_2936 [Salinispora tropica CNB-440]
 gi|145304792|gb|ABP55374.1| protein of unknown function DUF323 [Salinispora tropica CNB-440]
          Length = 308

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 127/338 (37%), Positives = 167/338 (49%), Gaps = 57/338 (16%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV +PG  FRMG+         E P R V +D F++DQ+ V+  +F  FV  TG+VT A
Sbjct: 17  EMVWIPGGRFRMGSTD---FYPEEGPVRWVEVDGFWIDQYLVTVREFDRFVRETGHVTMA 73

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E    T    P    E     S V    ++  G  S  ++R        +W         
Sbjct: 74  E-VSPTAADYPDADPELLVPGSLV---FRKASGPVSLADYR--------NW--------- 112

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W +           P  +W HPEG  ST++ R  HPV HV+++DAVA+ T
Sbjct: 113 ----------WSWT----------PGATWRHPEGPGSTVDGRELHPVTHVAYSDAVAFAT 152

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTE+EWE   RGGL+  +F WGN   P G+  AN WQG FP  N   DGY  T+
Sbjct: 153 WAGKELPTESEWELAARGGLDGAVFTWGNEFAPEGKLMANTWQGAFPWQNLLLDGYAGTS 212

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---------NPKGPTTGTDK-VK 303
           PV ++  N   LY+M GNVWEWT D   V  +PA S             GP++   + V 
Sbjct: 213 PVGAFPPNGHHLYDMAGNVWEWTQD---VFTYPAGSLAKACCTTPVTADGPSSAIPRHVI 269

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           KGGS+LC   YC R+R AAR     D+S  +LGFRC +
Sbjct: 270 KGGSHLCAPNYCMRYRPAARQGEAIDTSTCHLGFRCVS 307


>gi|254431410|ref|ZP_05045113.1| conserved domain protein [Cyanobium sp. PCC 7001]
 gi|197625863|gb|EDY38422.1| conserved domain protein [Cyanobium sp. PCC 7001]
          Length = 335

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 132/339 (38%), Positives = 171/339 (50%), Gaps = 61/339 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P  ++R+G       +  E P   V L  F L  HEV+N QF  FV ATGY T AE
Sbjct: 39  MVAIPAGSYRIGAGG---QRPEEGPPALVRLSPFCLATHEVTNRQFAAFVEATGYRTVAE 95

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +      F P LS  ERA  S V      F  LD                          
Sbjct: 96  RPLSREQF-PTLSVAERAPGSLV------FRPLDP------------------------- 123

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           G  +   A W +           P   W HPEG  S+IEH ++HPVV V++ DA AY  W
Sbjct: 124 GEPIGPMAGWHW----------VPGADWRHPEGPGSSIEHLLDHPVVQVAYEDAEAYAAW 173

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RGA LP+EA+WE+  RGGL +++F WG+       +RAN WQG FP  ++A DG+  TAP
Sbjct: 174 RGADLPSEAQWEFAARGGLRDQVFGWGDTWD---ANRANTWQGPFPYRDSAEDGHAGTAP 230

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAP-------SYNPKGPTTGTDKV 302
           V SY  N +GL++  GNVWEWTADW+   H      P P       S +P+ P      V
Sbjct: 231 VGSYAPNGYGLHDTTGNVWEWTADWFAPGHQLLVDRPDPRLDDAAASSDPREPGV-AKHV 289

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
            KGGS+LC+  YC R+R AAR   +PD+   ++G R A+
Sbjct: 290 IKGGSFLCSPSYCSRYRPAAREAESPDTGTSHIGIRLAS 328


>gi|254559464|ref|YP_003066559.1| sulfatase-modifying factor like protein [Methylobacterium
           extorquens DM4]
 gi|254266742|emb|CAX22534.1| Sulfatase-modifying factor like (SUMF1-related); putative
           C-alpha-formyglycine-generating enzyme or
           methyltransferase or protein kinase [Methylobacterium
           extorquens DM4]
          Length = 327

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 125/344 (36%), Positives = 172/344 (50%), Gaps = 63/344 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ + G TFRMG+++       E P   VT+ +F++D   V+N QF++FV  TG+VT AE
Sbjct: 20  MIYIRGGTFRMGSDRHY---SEERPVHRVTVGSFWMDATPVTNHQFRDFVRTTGHVTFAE 76

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K        P   +   A+  Q+      + G  S +    +HPV    W          
Sbjct: 77  K-------RPEPDQYPGARSDQL------YAG--SLVFTPPNHPVDLRMWG--------- 112

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMN-HPVVHVSWNDAVAYCT 193
                  A W           L    +W  P G      H ++ HPVVHV++ DA+AY  
Sbjct: 113 -------AWWS----------LLKGANWRRPNGPKGITLHGLDTHPVVHVTYADALAYAR 155

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEAEWE+  RGGL+   F WG+  TPRG H AN WQG+FP  NT  DG+  T+
Sbjct: 156 WAGKDLPTEAEWEFAARGGLDGAEFAWGDTFTPRGRHMANTWQGQFPWQNTEEDGFARTS 215

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPKGPTTGTD---------- 300
           PV ++  N +GL++M GNVWEWT+D++   H      P   PK P  G +          
Sbjct: 216 PVKAFVPNGYGLHDMTGNVWEWTSDFFADRHSADAAKPCCIPKDPRGGREDDSYDPCLPD 275

Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                +V KGGS+LC   +C R+R AAR     D+S  ++GFRC
Sbjct: 276 IRIPRRVLKGGSHLCAPNFCSRYRPAARLPQPVDTSTSHVGFRC 319


>gi|348029893|ref|YP_004872579.1| hypothetical protein GNIT_2489 [Glaciecola nitratireducens FR1064]
 gi|347947236|gb|AEP30586.1| hypothetical protein GNIT_2489 [Glaciecola nitratireducens FR1064]
          Length = 333

 Score =  194 bits (492), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 122/339 (35%), Positives = 166/339 (48%), Gaps = 54/339 (15%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           K M  + G T  MG      I + E P     +  FY+D+HEV+N QF +FV ATGY+T 
Sbjct: 37  KGMAFIKGGTLFMGAGA---IYEEETPLVKRKIQDFYMDRHEVTNAQFADFVDATGYITV 93

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           A+K  +   + P +S E     S V      F  L   +E              A ++  
Sbjct: 94  AQKLPNAEHY-PNISPELLKPGSAV------FVKLSEAVE--------------AGSFTN 132

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W          W +   G           W  P G +S IE + + PV+H++++DA+AY 
Sbjct: 133 W----------WHF-VEGAY---------WKKPFGPNSNIEGKEHFPVIHIAYDDALAYA 172

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W+G RLPTEAE+EY  RGGL+   +  GN LT  G + AN WQG FP  +TA DGY+  
Sbjct: 173 EWKGHRLPTEAEYEYASRGGLDGAKYATGNTLTDDGRYIANTWQGLFPFTDTAEDGYVGL 232

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------PAPSYNPKGPTTGTDKVK 303
           APV  Y +N FG+++++GNVWEWT   +   H          P   Y+ K P      V 
Sbjct: 233 APVGCYPQNHFGIHDLIGNVWEWTQSVYYPRHFDPKQLPQNLPPQGYDQKQPGVAVG-VI 291

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           KGGSYLC E +C R+R AAR          ++GFR   D
Sbjct: 292 KGGSYLCAEDFCMRYRPAARLAQDTGLGTSHIGFRTVKD 330


>gi|422320474|ref|ZP_16401534.1| exported protein [Achromobacter xylosoxidans C54]
 gi|317404770|gb|EFV85153.1| exported protein [Achromobacter xylosoxidans C54]
          Length = 382

 Score =  194 bits (492), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 124/350 (35%), Positives = 170/350 (48%), Gaps = 59/350 (16%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + M  +P   F MG N   + +  E P+  V L  F++D H+V+N +F+ FV ATGY+T 
Sbjct: 49  QGMAWIPAGEFLMG-NDHRMSQPNERPAHRVRLTGFWMDAHDVTNAEFRRFVDATGYLTT 107

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+                           R+E L   +      P+      DA     
Sbjct: 108 AER-------------------------KPRWEDLAVQLPPGTPRPI------DAALVP- 135

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
             GA + T  E E   R       F  G+ W HP+G  S+IE + +HPVV VS+ DA AY
Sbjct: 136 --GAMVFTGTEQEVSLRDYSRWWRFVAGANWRHPQGPASSIEGKDDHPVVQVSYEDAQAY 193

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAAD 247
             W G RLPTEAEWE+  RGGLE   + WG+ L PRG+  AN+W  +    FP   T   
Sbjct: 194 AAWAGKRLPTEAEWEFAARGGLEQATYIWGDELAPRGQAMANIWDTQQRQPFPV-VTDEK 252

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV------HHHPAPSYNPKGPTTGTD- 300
             + T PV S+  N +GLY+M GNVW+WTADW+          +  P  +P GP    D 
Sbjct: 253 VQVGTTPVASFPPNGYGLYDMAGNVWQWTADWYRADAFLTQSQYRRPPLDPAGPKDSFDP 312

Query: 301 -----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                      +V +GGS+LC++ YC  +R +AR  N P +   +LGFR 
Sbjct: 313 DDGNVPRNAPKRVVRGGSFLCSDTYCISYRASARRGNDPMNGMSHLGFRT 362


>gi|387018858|gb|AFJ51547.1| Sulfatase modifying factor 2 [Crotalus adamanteus]
          Length = 321

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 121/356 (33%), Positives = 171/356 (48%), Gaps = 75/356 (21%)

Query: 7   PPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNV-TLDAFYLDQHEVSNTQFQEFVS 65
           P   + +D V LPG    + TN    + DG+   R+   +  F +D++ V+N  F+EFV 
Sbjct: 29  PAFRKDEDTVQLPGGKLEIRTN----VLDGKNEDRSFRIIKPFVMDKYPVTNKDFREFVK 84

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
              Y TEAE FG +FVFE  + EE + K++Q                             
Sbjct: 85  DQKYKTEAEDFGWSFVFENFVPEELKKKVTQ----------------------------- 115

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
             +    W    LP E  +                 W  P G  S+I+ R+++PV+HVSW
Sbjct: 116 -KLESAPWW---LPIEKAF-----------------WRQPSGPGSSIKDRLDYPVLHVSW 154

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
           NDA A+C W+  RLPTE EWE+  RGGLENRL+PWGN       +R N+WQG FP  + A
Sbjct: 155 NDAQAFCKWKRKRLPTEEEWEFAARGGLENRLYPWGNKFL---ANRTNLWQGNFPKVDIA 211

Query: 246 ADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVK- 303
            DG+   +PV ++  +N +GLY+++GN WEWT+            Y P GP +     K 
Sbjct: 212 EDGFRGASPVAAFPPQNNYGLYDLLGNTWEWTSS----------EYRPNGPASRQLGQKM 261

Query: 304 ---KGGSYLCNEQYCYRHR--CAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKK 354
              +G S++        H+     R  NTPDSS+ NL FRCA         + V+K
Sbjct: 262 CVLRGASWIDTVDGSANHKAQVTTRMGNTPDSSSDNLSFRCAISLVKQPAQEAVRK 317


>gi|190891643|ref|YP_001978185.1| hypothetical protein RHECIAT_CH0002047 [Rhizobium etli CIAT 652]
 gi|190696922|gb|ACE91007.1| hypothetical conserved protein [Rhizobium etli CIAT 652]
          Length = 339

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 135/361 (37%), Positives = 181/361 (50%), Gaps = 64/361 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG TFRMG++K       E P   V++D F++D+  V+N QF+EFV ATGYVT AE
Sbjct: 24  MAYIPGGTFRMGSDK---YYPEEAPVHRVSVDGFWVDRTPVTNRQFREFVEATGYVTFAE 80

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                      ++ + R       H +K      S +    +HPV      D      W 
Sbjct: 81  -----------IAPDPRDYPGAPAHLLK----AGSAVFLPPNHPV------DLGDISKW- 118

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                    W            F +G+ W HP G  S+I    +HPVVHV++ DA AY  
Sbjct: 119 ---------W-----------TFTFGANWRHPYGKGSSIRKLDDHPVVHVAYEDAAAYAA 158

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEAEWE+  RGGL+   + WG+ LTP G H AN WQG+FP  N A DG+  T+
Sbjct: 159 WAGKSLPTEAEWEFAARGGLDGAEYAWGDELTPGGRHMANTWQGQFPHENCAGDGFERTS 218

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWN------------VHHHP-----APSYNPKGPT 296
           PV ++  N + +Y+M+GNVWEWT DW++            +  +P     + SY+P  P 
Sbjct: 219 PVTAFPANGYDVYDMIGNVWEWTTDWYSPRNKGDVSKACCIPQNPRGASESDSYDPGMPE 278

Query: 297 TGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKG 355
                KV KGGS+LC   YC R+R AAR     D+S  +LGFRC      T   + V   
Sbjct: 279 IRIPRKVIKGGSHLCAPNYCRRYRPAARHPEPVDTSTSHLGFRCVVRNLETEVAEDVNSH 338

Query: 356 G 356
           G
Sbjct: 339 G 339


>gi|229488576|ref|ZP_04382442.1| sulfatase-modifying factor 1 [Rhodococcus erythropolis SK121]
 gi|229324080|gb|EEN89835.1| sulfatase-modifying factor 1 [Rhodococcus erythropolis SK121]
          Length = 336

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 126/346 (36%), Positives = 171/346 (49%), Gaps = 63/346 (18%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           K+M L+PG T+ MG+         E P   VT+D F++D   V+  +F+ FV ATG+VT 
Sbjct: 28  KNMSLIPGGTYWMGSEN---FYPEERPVHQVTVDGFWMDTTAVTVAEFRRFVKATGHVTT 84

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE   D   +                 D  R   +  ++    + P   +S +D   Y  
Sbjct: 85  AEIDPDAADYP----------------DADRALLVPGSLV--FNSPPGPVSLDD---YTQ 123

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W          W +           P   W HPEG DS I  R  HPV HVS+ DA AY 
Sbjct: 124 W----------WSFT----------PGADWRHPEGADSNIGGRERHPVTHVSYFDAQAYA 163

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWE+  RGGL+ + + WG++ +P G    NVWQG+FP  N   DGY  T
Sbjct: 164 AWAGKELPTEAEWEFAARGGLDRQAYVWGDHDSPGGRPGGNVWQGQFPWENLLEDGYERT 223

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTG------ 298
           APV  ++ N +GL++M GNVWEWTAD +   H     + AP+ +   P+ P         
Sbjct: 224 APVGKFRPNGYGLFDMAGNVWEWTADHYTSDHAHSSKNVAPASSCCIPRNPRAEFATEAL 283

Query: 299 -----TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                  +V KGGS+LC   YC R+R AAR   + +SS  ++GFRC
Sbjct: 284 VGEPYARRVIKGGSHLCAPNYCLRYRPAARQGESEESSTCHVGFRC 329


>gi|87200487|ref|YP_497744.1| hypothetical protein Saro_2474 [Novosphingobium aromaticivorans DSM
           12444]
 gi|87136168|gb|ABD26910.1| protein of unknown function DUF323 [Novosphingobium aromaticivorans
           DSM 12444]
          Length = 310

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 130/351 (37%), Positives = 169/351 (48%), Gaps = 66/351 (18%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           VER   M L+ G TF MG+         E P R V +D+F++D+  V+N QF  FV ATG
Sbjct: 2   VER-PGMRLIEGGTFTMGSEA---FYPEEAPLRRVKVDSFWIDEAPVTNAQFAAFVEATG 57

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           YVT AE   D   +  +L   +RA                S +  +   PV      D  
Sbjct: 58  YVTVAEIEPDPKDYPGMLPGMDRAG---------------SLVFQKTAGPV------DMA 96

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHRMNHPVVHVSWND 187
               W          W            F +G+ W HP G  S+I+   +HPVVHV++ D
Sbjct: 97  DASNW----------WH-----------FTFGACWKHPLGPGSSIDGIEDHPVVHVAYAD 135

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           A AY  W G  LPTEAE+EY  RGGL+   F WG+ L P G   AN WQG FP  N   D
Sbjct: 136 AEAYAKWAGKDLPTEAEFEYAARGGLDGSEFSWGDELAPEGRMMANYWQGLFPFANQCLD 195

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-------------VHHHP-----APS 289
           G+  T+PV ++  N +GLY+M+GN WEWT DWW                 +P       S
Sbjct: 196 GWERTSPVRNFPPNGYGLYDMIGNTWEWTCDWWADKPLTPQRKSACCAISNPRGGKLKDS 255

Query: 290 YNPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           ++P  P      KV KGGS+LC   YC R+R AAR     D++  ++GFRC
Sbjct: 256 FDPSQPAMRIGRKVIKGGSHLCAANYCQRYRPAARHPEMVDTATTHIGFRC 306


>gi|359421667|ref|ZP_09213579.1| hypothetical protein GOARA_082_00350 [Gordonia araii NBRC 100433]
 gi|358242383|dbj|GAB11648.1| hypothetical protein GOARA_082_00350 [Gordonia araii NBRC 100433]
          Length = 306

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 126/337 (37%), Positives = 172/337 (51%), Gaps = 57/337 (16%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V +PG TF MG++        E P    T++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 16  ELVEIPGGTFHMGSDAHY---PEERPVHERTVEPFSIERHLVTNDQFAEFVADTGYVTVA 72

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+                        D  +F G D      +   +V       V    W
Sbjct: 73  EE----------------------EVDPAQFPGADPA--DLVPGALVFTKTAGPVNLGDW 108

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R        +W          R  P  SW +PEG DS+IE R +HPVV V++ DA AY  
Sbjct: 109 R--------QWW---------RWEPGASWRNPEGPDSSIEGRGDHPVVQVAYRDAAAYAE 151

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTE EWE+  RGG++   F WG++L P G+  AN W G FP  +T   G+  T+
Sbjct: 152 WAGRRLPTETEWEFAARGGVDGLEFAWGDDLFPDGKVMANTWLGNFPYEST---GWGGTS 208

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHH------HPAPSYNPK--GPTTGTD--KVK 303
           PV +Y  N +GLY+M+GNVWE T+D +   H      H +    P    PT      +V 
Sbjct: 209 PVGAYPANGYGLYDMIGNVWERTSDAYAPRHVVPGDQHVSADGRPDLLAPTVDPQVMRVT 268

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           KGGS++C  +YC R+R AARS  + DS+  +LGFRCA
Sbjct: 269 KGGSFICAPEYCRRYRPAARSAQSDDSATSHLGFRCA 305


>gi|410623515|ref|ZP_11334327.1| sulfatase-modifying factor 2 [Glaciecola pallidula DSM 14239 = ACAM
           615]
 gi|410156731|dbj|GAC29701.1| sulfatase-modifying factor 2 [Glaciecola pallidula DSM 14239 = ACAM
           615]
          Length = 337

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 120/339 (35%), Positives = 167/339 (49%), Gaps = 54/339 (15%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           K M  + G TF MG      +   E P+    +  FY+D+HEV+N QF EF+ ATGYVT 
Sbjct: 43  KGMAFVEGGTFTMGAGA---VYAEEMPAIQRVVKDFYMDRHEVTNLQFAEFIEATGYVTM 99

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           A++  +   + P +  E     S V      F  L   +E              A  +  
Sbjct: 100 AQRVPNADDY-PDIPAELLTPGSAV------FVKLSQAVE--------------ASTFFN 138

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W          W +   G          +W +P G +S I+ + + PV+H++++DA AY 
Sbjct: 139 W----------WHF-TEGA---------NWKNPLGPNSNIQDKDDFPVIHIAYDDAQAYA 178

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W+G RLPTEAE+EY  RGGL+   +  G+ L   G++ AN WQG FP N+TA DGY+  
Sbjct: 179 KWKGHRLPTEAEYEYASRGGLDGAKYASGSTLLNNGKYIANTWQGLFPFNDTAEDGYVGL 238

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------PAPSYNPKGPTTGTDKVK 303
           APV  Y  N FG+++++GNVWEWT   +   H          PA  Y+ K P      V 
Sbjct: 239 APVGCYPANHFGIHDLIGNVWEWTQSVYYPRHFDPSHLPANLPAKGYDEKQPGIAV-GVV 297

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           KGGSYLC E +C R+R AAR          ++GFR   D
Sbjct: 298 KGGSYLCAEDFCMRYRPAARHAQDTGLGTSHIGFRTVKD 336


>gi|340616167|ref|YP_004734620.1| C-alpha-formylglycine-generating protein [Zobellia galactanivorans]
 gi|339730964|emb|CAZ94228.1| C-alpha-formylglycine-generating protein [Zobellia galactanivorans]
          Length = 381

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 122/343 (35%), Positives = 170/343 (49%), Gaps = 50/343 (14%)

Query: 15  MVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           MV +PG  F  G   +     D E P+  V +D F++D  EV+N +FQ+FV  TGY T A
Sbjct: 63  MVWIPGAVFSQGAVPQDKAAMDHEKPAHKVAVDGFFMDITEVTNDEFQKFVDETGYKTVA 122

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D    +  L E      +   HD     G  +  + +   P ++        +  W
Sbjct: 123 EREIDWEEMKKQLPEG-----TPKPHDSIMQPGSLTFKQAKSSVPNLY-------DFSQW 170

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W  G             +W +P G  S+I+ + N+PVVH+++ DA+AYC 
Sbjct: 171 WS--------WTIGA------------NWKNPNGPGSSIKGKGNYPVVHIAYEDALAYCE 210

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W   RLPTEAEWE   RG   N ++ WG+    +   +AN W+GEFP  N   DG+   A
Sbjct: 211 WANRRLPTEAEWERAARGNHLNTIYFWGDE-DAKLSDKANTWEGEFPVTNVETDGFGLRA 269

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS---------------YNPKGPTTG 298
           PV SY  N FGLY+M GNVWEWT DW+N +++   S               YN + P   
Sbjct: 270 PVKSYPANDFGLYDMAGNVWEWTNDWYNTNYYKELSTTFPVAKNPLGADQAYNERDP-YA 328

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
            +KV KGGS+LCN  YC  +R ++R   + DSS  +LGFR  A
Sbjct: 329 REKVMKGGSFLCNASYCASYRVSSRMATSLDSSLEHLGFRTVA 371


>gi|402491193|ref|ZP_10837981.1| hypothetical protein RCCGE510_25726 [Rhizobium sp. CCGE 510]
 gi|401809592|gb|EJT01966.1| hypothetical protein RCCGE510_25726 [Rhizobium sp. CCGE 510]
          Length = 323

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 127/358 (35%), Positives = 175/358 (48%), Gaps = 63/358 (17%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           + R   MV +PG TF MG+++       E P+  V +D F++++  V+N  F EFV ATG
Sbjct: 11  LARIDGMVWIPGGTFVMGSDRHY---PEEAPAHPVKVDGFWIEETPVTNRAFAEFVKATG 67

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           YVT AE   D   +     E  RA  S +    K  +  DS I            W    
Sbjct: 68  YVTVAETIPDAKNYPGARPEMLRAG-SLLFVQPKTVD--DSDISQ----------W---- 110

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
               WR         +++G             +W  P G  S ++ +++HPVVHV++ DA
Sbjct: 111 ----WR---------FKFGA------------NWRRPLGGLSDLKGKLDHPVVHVAYADA 145

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G  LPTEAEWE+  RGGL +  F WG+ + P     AN WQG FPT +T   G
Sbjct: 146 TAYARWAGLDLPTEAEWEFAARGGLGDVEFAWGSEMMPDDVIMANTWQGVFPTKSTKPKG 205

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----------------HPAPSYN 291
              ++ V S+  N +GLY+M+GNVWEWT+D+W+  H                  P  SY+
Sbjct: 206 RERSSAVRSFPANGYGLYDMIGNVWEWTSDYWSTRHPEPARHSCCIPNNPRGGDPVASYD 265

Query: 292 PKGPT-TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
            + P  T   +V KGGS+LC   YC R+R AAR     D+S  ++GFRC      T G
Sbjct: 266 RRQPAITIPRRVLKGGSHLCAPNYCRRYRPAARHAEQEDTSTSHVGFRCIRRASSTAG 323


>gi|389864686|ref|YP_006366926.1| sulfatase-modifying factor [Modestobacter marinus]
 gi|388486889|emb|CCH88441.1| Sulfatase-modifying factor [Modestobacter marinus]
          Length = 307

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 121/336 (36%), Positives = 158/336 (47%), Gaps = 67/336 (19%)

Query: 37  EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
           E P   V +D F++D+H V+ T F  FV+ TGY+T AE+                     
Sbjct: 10  ERPVHRVAVDGFWMDRHPVTATAFARFVADTGYLTVAER--------------------- 48

Query: 97  VRHDMKRFEGLDSTI----EHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGL 152
            R D   + G+D  +          P   +   D  A+             W Y      
Sbjct: 49  -RPDPAAYPGVDPAVLVAGSLVFTPPARPVPLRDPRAW-------------WTY------ 88

Query: 153 ENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGG 212
                P  SW HPEG  ST++ R +HPVVHV+  DA AY  W G  LPTEAEWE+  RGG
Sbjct: 89  ----VPGASWRHPEGPGSTVDGRGDHPVVHVAHEDAEAYARWAGKDLPTEAEWEFAARGG 144

Query: 213 LENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNV 272
           L   ++ WG+   P G   AN WQGEFP  +   DGY  T+PV ++  N +GL++  GNV
Sbjct: 145 LAGAVYTWGDEFAPGGRLMANTWQGEFPWEDLGTDGYKRTSPVGAFPPNGYGLHDTAGNV 204

Query: 273 WEWTADWWNVHHH---PAPSYNPKGP---------TTG------TDKVKKGGSYLCNEQY 314
           WEWT+D +   H    P+    P+ P         TTG        +V KGGS+LC   Y
Sbjct: 205 WEWTSDLYAPRHSGDAPSACCIPRNPRVHTADQTWTTGLPDAAVARRVVKGGSHLCAPNY 264

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTD 350
           C R+R AAR     D+S G+LGFRC       T  D
Sbjct: 265 CLRYRPAARQGEPVDTSTGHLGFRCVLRSSERTAPD 300


>gi|427392884|ref|ZP_18886787.1| hypothetical protein HMPREF9698_00593 [Alloiococcus otitis ATCC
           51267]
 gi|425731089|gb|EKU93916.1| hypothetical protein HMPREF9698_00593 [Alloiococcus otitis ATCC
           51267]
          Length = 281

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 120/334 (35%), Positives = 168/334 (50%), Gaps = 61/334 (18%)

Query: 15  MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +V + G  FRMGT+      +D E PS  V +  F +    ++N +F+EF+  TGYVT A
Sbjct: 3   LVSITGGEFRMGTDDNKGFPEDYEGPSTIVNVPNFEIADTPITNREFKEFIDDTGYVTTA 62

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+ G +FVF  LL EEE++K                  +H +  P               
Sbjct: 63  EQKGCSFVFGKLLDEEEKSKA-----------------QHVLGTP--------------- 90

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W +         L P   W HP G  S + +  +HPVVHVS  DA+AYC 
Sbjct: 91  ----------WWF---------LVPGADWAHPYGPKSNLNNLWDHPVVHVSLKDALAYCD 131

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W    LPTEAEWEY  R G  +  FPWG++LT    + AN WQG+FP  N+  DGYL+TA
Sbjct: 132 WSETHLPTEAEWEYAARAGTTSE-FPWGDSLTEDNTYHANTWQGKFPEVNSLDDGYLATA 190

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVK-----KGGSY 308
           PV +Y+ N +GLY ++GNVWEW     N  +     +N        +K+      +GGS+
Sbjct: 191 PVYNYEPNNWGLYQVIGNVWEWCR---NPRYTLFEDFNSNEFLVNKNKLVGEYAIRGGSF 247

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           LC+  YC R+R A R+    +S++ +L FRC  +
Sbjct: 248 LCHSCYCNRYRVAGRNGVDNESTSSHLSFRCIKE 281



 Score = 39.7 bits (91), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 16/39 (41%), Positives = 26/39 (66%)

Query: 354 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           +GGS+LC+  YC R+R A R+    +S++ +L FRC  +
Sbjct: 243 RGGSFLCHSCYCNRYRVAGRNGVDNESTSSHLSFRCIKE 281


>gi|311106407|ref|YP_003979260.1| hypothetical protein AXYL_03225 [Achromobacter xylosoxidans A8]
 gi|310761096|gb|ADP16545.1| hypothetical protein AXYL_03225 [Achromobacter xylosoxidans A8]
          Length = 387

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 119/349 (34%), Positives = 167/349 (47%), Gaps = 61/349 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG  F MG++   L +  E P+  V++  F++D ++V+N QF+ FV ATGYVT AE
Sbjct: 56  MAWVPGREFLMGSSH-KLAQPNEMPAHKVSVSGFWMDVNDVTNAQFRRFVEATGYVTTAE 114

Query: 75  ---KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
              K+ D  V  P  +     ++  +      F G +S +  R               Y 
Sbjct: 115 QKPKWEDLQVQLPPGTPRPDERL--LVPGAMVFVGTESEVSLR--------------DYS 158

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W          W Y           P  +W HP+G  S+I  + +HPVV VS+ DA AY
Sbjct: 159 RW----------WRY----------VPGANWRHPQGPGSSIAGKDDHPVVQVSYQDAAAY 198

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG--- 248
             W G RLPTE+EWE   RGGLE   + WG  L P+G+  AN+W  + P           
Sbjct: 199 AKWAGKRLPTESEWEMAARGGLEQATYAWGEELLPQGKPMANIWDTQQPQPFPVVKDEKV 258

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH------HHPAPSYNPKGPTTGTD-- 300
            + T PV S+  N +GLY+M GNVW+WTADW+          +  P  NP GP    D  
Sbjct: 259 QVGTMPVGSFSPNGYGLYDMAGNVWQWTADWYRADAFKIQAQYRQPPANPAGPADSYDPD 318

Query: 301 ----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                     +V +GGS+LC++ YC  +R +AR    P +   +LGFR 
Sbjct: 319 DGIVPPGAPKRVTRGGSFLCSDTYCISYRTSARRGTDPMNGMSHLGFRT 367


>gi|442321248|ref|YP_007361269.1| hypothetical protein MYSTI_04287 [Myxococcus stipitatus DSM 14675]
 gi|441488890|gb|AGC45585.1| hypothetical protein MYSTI_04287 [Myxococcus stipitatus DSM 14675]
          Length = 333

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 123/346 (35%), Positives = 163/346 (47%), Gaps = 63/346 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG T+ MG++        E P+  VT+  F++D+  V+N  F  FV  TGYVT AE
Sbjct: 27  MEWIPGGTYWMGSDTHY---PEERPTHQVTVTGFWMDRCTVTNADFARFVQTTGYVTVAE 83

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D   F P    E     S V H       L    +                    W 
Sbjct: 84  RPLDAKDF-PGAKPELLVPGSLVFHKTPERVDLKDLSQ--------------------W- 121

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W Y           P   W  P+G  ST + + +HPVVHV++ DA AY TW
Sbjct: 122 ---------WTY----------TPGACWHRPQGHGSTWKGQEDHPVVHVAFEDARAYATW 162

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LP+EAEWE   R GL+ +++ WG+  +P  +  AN+WQGEFP  N   DGY  T+P
Sbjct: 163 AGKDLPSEAEWERAARAGLDRKVYVWGDEFSPGNQQMANIWQGEFPWQNLMTDGYEGTSP 222

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGPTTGTD-------- 300
           V +Y  N +GL++M GNVWEWT DW+   H            NP+GP +           
Sbjct: 223 VGTYPPNAYGLFDMAGNVWEWTTDWFQERHQGNGGKACCIPVNPRGPPSSQKSFDPCTPE 282

Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
                +V KGGS+LC   YC R+R AARS    DS   ++GFRC A
Sbjct: 283 VLIPRRVVKGGSHLCASNYCLRYRPAARSPQAEDSGTTHIGFRCIA 328


>gi|359798305|ref|ZP_09300879.1| hypothetical protein KYC_15197 [Achromobacter arsenitoxydans SY8]
 gi|359363851|gb|EHK65574.1| hypothetical protein KYC_15197 [Achromobacter arsenitoxydans SY8]
          Length = 332

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 122/348 (35%), Positives = 164/348 (47%), Gaps = 59/348 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG  F MG N   + +  E P+  V +  F++D H+V+N +F+ FV ATGYVT AE
Sbjct: 1   MAWIPGTDFLMG-NDHQMSQPNERPAHKVQVSGFWMDVHDVTNAEFRRFVEATGYVTTAE 59

Query: 75  KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
                    P L     R   S++      F G ++ +  R               Y  W
Sbjct: 60  MKPRWEDLRPQLPPGTPRPHDSELIAGAMVFVGTETEVSLR--------------DYSRW 105

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W          R  P  +W HP+G  S I  +  HPVV VS+ DA AY  
Sbjct: 106 ----------W----------RFVPGANWRHPQGPGSDIAGKDEHPVVQVSYEDAQAYAK 145

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAADGY 249
           W G RLPTEAEWE+  RGGLE   + WGN L P+G+  AN+W    Q  FP         
Sbjct: 146 WAGKRLPTEAEWEFAARGGLEQATYTWGNELLPQGKAMANIWDTQQQQPFPVVKDEKI-Q 204

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNPKG 294
           + T PV SY  N +GLY+M GNVW+WTADW+                    PA S++P  
Sbjct: 205 VGTTPVGSYAPNGYGLYDMAGNVWQWTADWYRADAFQIQAQFRQPPKDPAGPADSFDPSD 264

Query: 295 ---PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
              P +   +V +GGS+LC++ YC  +R +AR    P +   +LGFR 
Sbjct: 265 GNVPVSAPKRVTRGGSFLCSDTYCISYRASARRGTDPLNPMSHLGFRT 312


>gi|427702573|ref|YP_007045795.1| hypothetical protein Cyagr_1278 [Cyanobium gracile PCC 6307]
 gi|427345741|gb|AFY28454.1| hypothetical protein Cyagr_1278 [Cyanobium gracile PCC 6307]
          Length = 327

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 130/339 (38%), Positives = 170/339 (50%), Gaps = 62/339 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG  +R+G     L ++ E    +V L  F + + EV+N +F  FV+ATGY T+AE
Sbjct: 32  MAPVPGGRYRIGAAG-RLPEEAE---ASVRLQPFCIARTEVTNREFAAFVAATGYRTQAE 87

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +      F P LS  ER   S V       E            P+  +SW          
Sbjct: 88  QPLPQEQF-PELSAAERRPGSVVFRPPSPGE------------PLAELSW---------- 124

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W++           P   W HPEG  S+I+ R++HPVV VS  DA AY  W
Sbjct: 125 ---------WQW----------VPGADWRHPEGPGSSIDDRLDHPVVQVSLADAEAYAAW 165

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LP+EA+WE+  RGGL +R F WGN   P     AN WQGEFP +NTA DG+  TAP
Sbjct: 166 AGTALPSEAQWEFAARGGLRDRAFSWGNTWRP---GLANTWQGEFPVSNTADDGFTGTAP 222

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH------------HPAPSYNPKGPTTGTDKV 302
           V S+  N +GL +M GNVWEWTADW+   H             PA S +P+ P      V
Sbjct: 223 VGSFPPNGYGLQDMTGNVWEWTADWYRPGHDRLAGQDDPVLADPAASSDPREPGV-AKHV 281

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
            KGGS+LC   YC R+R AAR   +PD+   ++GFR  +
Sbjct: 282 IKGGSFLCAPNYCSRYRPAAREAESPDTGTSHIGFRLVS 320


>gi|163850229|ref|YP_001638272.1| hypothetical protein Mext_0792 [Methylobacterium extorquens PA1]
 gi|163661834|gb|ABY29201.1| protein of unknown function DUF323 [Methylobacterium extorquens
           PA1]
          Length = 327

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 124/344 (36%), Positives = 171/344 (49%), Gaps = 63/344 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ + G TFRMG+++       E P   VT+ +F++D   V+N QF++FV  TG+VT AE
Sbjct: 20  MIYIRGGTFRMGSDRHY---SEERPVHRVTVGSFWMDATPVTNHQFRDFVRTTGHVTFAE 76

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K        P   +   A+  Q+      + G  S +    +HPV    W          
Sbjct: 77  K-------RPEPDQYPGARSDQL------YAG--SLVFTPPNHPVDLRMWG--------- 112

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMN-HPVVHVSWNDAVAYCT 193
                  A W           L    +W  P G      H ++ HPVVHV++ DA+AY  
Sbjct: 113 -------AWWS----------LLKGANWRRPNGPKGITLHGLDTHPVVHVTYADALAYAR 155

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEAEWE+  RGGL+   F WG+  TPRG H AN WQG+FP  NT  DG+  T+
Sbjct: 156 WAGKDLPTEAEWEFAARGGLDGAEFAWGDTFTPRGRHMANTWQGQFPWQNTEEDGFARTS 215

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPKGPTTGTD---------- 300
           PV ++  N +GL++M GNVWEWT+D++   H      P   PK P  G +          
Sbjct: 216 PVKAFVPNGYGLHDMTGNVWEWTSDFFADRHSADAAKPCCIPKDPRGGREDDSYDPCLPD 275

Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                +V KGGS+LC   +C R+R  AR     D+S  ++GFRC
Sbjct: 276 IRIPRRVLKGGSHLCAPNFCSRYRPEARLPQPVDTSTSHVGFRC 319


>gi|441513058|ref|ZP_20994890.1| hypothetical protein GOAMI_15_00400 [Gordonia amicalis NBRC 100051]
 gi|441452039|dbj|GAC52851.1| hypothetical protein GOAMI_15_00400 [Gordonia amicalis NBRC 100051]
          Length = 316

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 123/351 (35%), Positives = 169/351 (48%), Gaps = 61/351 (17%)

Query: 6   APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
           AP     K+M  +PG  F MG+ +       E P   V +  F++D H V+  +F+ FV 
Sbjct: 5   APTGRAPKNMSWIPGGMFWMGSEE---FYPEERPVHQVGVGGFWMDTHPVTVAEFRRFVK 61

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
            TG+VT AE         P  +E   A  + +      F             P   +  +
Sbjct: 62  DTGHVTTAE-------VAPDPAEYPDADPALLVPGSLVFT-----------PPARRVPLD 103

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
           D   + +W                        P   W HPEG  ST+  R  HPV HVS 
Sbjct: 104 DYRRWWSW-----------------------VPGADWRHPEGPGSTVGGRERHPVTHVSH 140

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
            DA+AY  W G  LPTE+EWE+  RGGL+   F WG+  TP+G + AN WQGEFP  N A
Sbjct: 141 YDALAYARWAGKELPTESEWEFAARGGLDRARFSWGDEYTPKGRYLANTWQGEFPWQNLA 200

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----AP------SYNPKG 294
            DGY+ T+PV S++ N +GL ++ GNVWEWT D +   H P     +P        +P+ 
Sbjct: 201 EDGYVGTSPVGSFRPNGYGLVDVTGNVWEWTTDRFTARHDPNRRGSSPVTACCMPADPRT 260

Query: 295 PTTGTD------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
             +GT       +V KGGS+LC   YC R+R AAR   + ++S  ++GFRC
Sbjct: 261 LASGTGAEAHPRRVIKGGSHLCAPNYCLRYRPAARQGESAETSTCHIGFRC 311


>gi|213514658|ref|NP_001135212.1| sulfatase-modifying factor 2 precursor [Salmo salar]
 gi|209154232|gb|ACI33348.1| Sulfatase-modifying factor 2 precursor [Salmo salar]
          Length = 281

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 115/307 (37%), Positives = 159/307 (51%), Gaps = 69/307 (22%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG    MGTN     +DGE P++ VT+D F +D++ V+N+ F+EFV A  Y TEAE
Sbjct: 34  MVQIPGGKMTMGTNSADG-RDGESPTKEVTVDPFRIDKYPVTNSDFREFVRAQKYKTEAE 92

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVF+  +SEE ++K+++                               +    W 
Sbjct: 93  TFGWSFVFQDFVSEELKSKVTE------------------------------KIESAPWW 122

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              +P E             R+F    W  P G  S I  R++ PVV VSWNDA A+C+W
Sbjct: 123 ---MPVE-------------RVF----WRQPAGPGSGIRERLDSPVVQVSWNDAQAFCSW 162

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLP+E EWE+  RGGL+ R +PWGN    +   R+N+WQG FP  +TA DGY   AP
Sbjct: 163 RGRRLPSEEEWEWAARGGLQGRTYPWGNKFQSK---RSNLWQGLFPDGDTAEDGYHGIAP 219

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           V ++  +N FGLY+M+GN WEWT+        P P   P         V +GGS++  E 
Sbjct: 220 VTAFPPQNSFGLYDMMGNAWEWTS-------TPFPGGRPM-------FVLRGGSWIDTED 265

Query: 314 YCYRHRC 320
               H+ 
Sbjct: 266 GSANHKA 272


>gi|377807502|ref|YP_004978694.1| hypothetical protein BYI23_C001100 [Burkholderia sp. YI23]
 gi|357938699|gb|AET92256.1| hypothetical protein BYI23_C001100 [Burkholderia sp. YI23]
          Length = 356

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 121/346 (34%), Positives = 161/346 (46%), Gaps = 58/346 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNV---TLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           M+ + G  FRMG+       +G    RNV    LD F +D HEV+N QF  FV ATGYVT
Sbjct: 56  MIAIAGGRFRMGST------EGYPEERNVQPVKLDGFLIDAHEVTNAQFAMFVRATGYVT 109

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           EAE+                      + D + F G+      ++    V   W +     
Sbjct: 110 EAER----------------------QRDPRDFPGV---AREQLQPGSVVFVWPEGAQ-- 142

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
                +      W          R  P  +W HP G  S I  + NHPVV V++ DA AY
Sbjct: 143 -----QFEQSYRWW---------RFQPGANWRHPYGPGSGIAGKDNHPVVQVTYRDAQAY 188

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G  LPTEAE+EY  RGG E+  +PWGN     G++ AN WQG+FP +N A DG+  
Sbjct: 189 AKWLGHDLPTEAEYEYAARGGRESETYPWGNTPKANGKYMANTWQGDFPAHNEAGDGHKG 248

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD--------KVK 303
           TAPV  Y  N FGL + VGNVW+WT D +   H    + NP                +V 
Sbjct: 249 TAPVGCYPANDFGLLDPVGNVWQWTKDVYQDRHPAEEAQNPLVVRVSAPFDDGRHPARVI 308

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
           KGGS+LC   +C R+R ++R    P     ++GFR      P + +
Sbjct: 309 KGGSFLCAPNFCVRYRPSSRQPQDPTLGTVHIGFRTVLRMSPDSAS 354


>gi|424870496|ref|ZP_18294158.1| hypothetical protein Rleg5DRAFT_1953 [Rhizobium leguminosarum bv.
           viciae WSM1455]
 gi|393166197|gb|EJC66244.1| hypothetical protein Rleg5DRAFT_1953 [Rhizobium leguminosarum bv.
           viciae WSM1455]
          Length = 317

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 123/347 (35%), Positives = 174/347 (50%), Gaps = 65/347 (18%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           ++ +V + G TF MG+N+  +    E P+  VT+D F++D   V+N QF  FV ATG+VT
Sbjct: 13  FQGLVWVAGRTFTMGSNEHYV---EEAPAHPVTVDGFWIDATPVTNRQFASFVKATGHVT 69

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
            AE+      +   L +  RA  S V    K   G D T             W       
Sbjct: 70  VAERAPRAEDYPGALPKMLRAG-SLVFTPPKVVVGPDIT------------QW------- 109

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W          +++G              W HP G  + I  +++HPVVH++++DA AY
Sbjct: 110 -W---------SFKFGA------------DWRHPYGGRTDIRGKLDHPVVHIAYSDAKAY 147

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W    LPTEAEWE   RGGL++  + WG+ L P+G+  AN WQG FP  +T   G   
Sbjct: 148 AAWAEKDLPTEAEWELAARGGLDDAEYAWGDELNPKGKLMANTWQGTFPALSTKPAGADR 207

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP------------------SYNPK 293
           T+PV  +  N +G+Y+M+GNVWEWT+D+W+   HP P                  SY+P+
Sbjct: 208 TSPVGYFPSNGYGIYDMIGNVWEWTSDFWST-RHPEPASHSCCIPSNPRGGDADASYDPR 266

Query: 294 GPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            P      +V KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 267 LPDIRIPRRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRC 313


>gi|389695482|ref|ZP_10183124.1| hypothetical protein MicloDRAFT_00052980 [Microvirga sp. WSM3557]
 gi|388584288|gb|EIM24583.1| hypothetical protein MicloDRAFT_00052980 [Microvirga sp. WSM3557]
          Length = 327

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 127/350 (36%), Positives = 175/350 (50%), Gaps = 62/350 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MVL+PG TFRMG+++       E P   VT+D F++D+  V+N +F+ FV ATG+VT A
Sbjct: 22  NMVLIPGGTFRMGSDR---HYPEEAPVHRVTVDGFWIDRTPVTNREFRRFVGATGHVTVA 78

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D   +     E  +A                S +     +PV    W+   ++   
Sbjct: 79  ERTPDPRDYPGARPEMLKAG---------------SLVFSPPDYPVNLKDWSQWWSFT-- 121

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                       +G  G              P G  S+I+   +HPVVHV++ DA AY  
Sbjct: 122 ------------FGANG------------RRPYGRGSSIKSLDDHPVVHVAYADAEAYAA 157

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEAEWE+  RGGL+   F WG+  TP G H AN WQG FP  NT  DG+  T+
Sbjct: 158 WAGKDLPTEAEWEFAARGGLDGAEFAWGDEFTPDGRHMANTWQGAFPFQNTKLDGFERTS 217

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----------------HPAPSYNPKGPT 296
           PV ++  N +GL++M+GNVWEWT+DW+   H                   A SY+P  P 
Sbjct: 218 PVTAFPPNAYGLHDMIGNVWEWTSDWYAPKHPADAPKACCIPENPRGGREADSYDPCQPE 277

Query: 297 TGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
                +V KGGS+LC   YC R+R AAR     D+S  ++GFRC   + P
Sbjct: 278 IRIPRRVLKGGSHLCAPSYCRRYRPAARHAQPVDTSTSHVGFRCVVRRRP 327


>gi|299533665|ref|ZP_07047039.1| hypothetical protein CTS44_22744 [Comamonas testosteroni S44]
 gi|298718387|gb|EFI59370.1| hypothetical protein CTS44_22744 [Comamonas testosteroni S44]
          Length = 365

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 124/348 (35%), Positives = 166/348 (47%), Gaps = 52/348 (14%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           K+MV +P   F MG      + + E P   V L  F++D+  V+N QF EFV ATGYVT 
Sbjct: 51  KNMVWIPSGHFLMGNTSSKALPN-EMPGHKVELQGFWMDKTHVTNRQFSEFVHATGYVTT 109

Query: 73  AEKFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           AEK  D    +  L     R   S +      F G    +  R               Y 
Sbjct: 110 AEKTPDWQSLQAQLPVGTPRPDSSVLIPGASVFVGTRQPVSLR--------------DYM 155

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W          W Y   GG +        W HP G +S+IE + +HPVV VS+ DA AY
Sbjct: 156 QW----------WRY--VGGAD--------WRHPSGPNSSIEGKDDHPVVQVSYEDAQAY 195

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL- 250
             W G RLPTEAEWEY  RGGL  + + WG    P G   A++W+    +   A    L 
Sbjct: 196 ARWIGKRLPTEAEWEYAARGGLNQQHYAWGTLAEPNGNVMASIWKSRTASFPQAQPKILP 255

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWW-----NVHHHPAPSYNPKGPTTGTD----- 300
            T PV +Y+ N +GL++M GN W+W ADW+     ++        NP+GP+   D     
Sbjct: 256 GTEPVANYEPNGYGLFDMAGNAWQWAADWYRYDAFSIQARMQNISNPQGPSNSFDPDGLR 315

Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
                +V +GGS+LCNE YC  +R +AR    P S++ N+GFR   DK
Sbjct: 316 SDAPKRVIRGGSFLCNEDYCEGYRVSARQGQDPWSASNNVGFRLVMDK 363


>gi|262200810|ref|YP_003272018.1| hypothetical protein Gbro_0811 [Gordonia bronchialis DSM 43247]
 gi|262084157|gb|ACY20125.1| protein of unknown function DUF323 [Gordonia bronchialis DSM 43247]
          Length = 299

 Score =  191 bits (485), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 127/336 (37%), Positives = 170/336 (50%), Gaps = 57/336 (16%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG  FRMG+++       E P+ +  + AF +++H V+N  F EFV+ TGYVT A
Sbjct: 9   ELVDLPGGVFRMGSDRHYA---EERPAHDRAVGAFAIERHPVTNAAFAEFVADTGYVTVA 65

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E        EP+              D   F G D      +   +V       V    W
Sbjct: 66  E--------EPI--------------DPADFPGADPG--DLVPGALVFTPTAGPVNLADW 101

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           +        +W          R  P  +W  PEG  S +E R +HPVVH+++ DA AY  
Sbjct: 102 Q--------QWW---------RWQPGAAWHRPEGPGSGVEDRPDHPVVHIAYRDAAAYAG 144

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGGL    F WG+ L P G   AN W+G FP  N   +G+ +T+
Sbjct: 145 WAGRRLPTEAEWEYAARGGLAAAEFAWGDELHPDGADLANTWKGRFPYRN---EGWGTTS 201

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHH------HPAPSYNPK--GPTTGTD--KVK 303
           PV +   N FGL++M+GNVWE T+D +   H      H      P    PTT     +V 
Sbjct: 202 PVGTCPPNGFGLFDMIGNVWERTSDVFTPRHVRPDDVHVDADGRPDLLAPTTSPRIMRVT 261

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           KGGSY+C  +YC R+R AARS  + DS+  +LGFRC
Sbjct: 262 KGGSYICAPEYCRRYRPAARSAQSDDSATSHLGFRC 297


>gi|419708452|ref|ZP_14235922.1| hypothetical protein OUW_02929 [Mycobacterium abscessus M93]
 gi|382944484|gb|EIC68792.1| hypothetical protein OUW_02929 [Mycobacterium abscessus M93]
          Length = 297

 Score =  191 bits (485), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 123/339 (36%), Positives = 168/339 (49%), Gaps = 59/339 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +F MG N   +    E P R  +++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 10  ELVELPGGSFPMGCN---VFYPEEMPERVSSVEPFAIERHPVTNAQFAEFVTQTGYVTVA 66

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D                     DM      D  +   M                T+
Sbjct: 67  ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 90

Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           R    P +       W++           P   W HP G  S I+    HPVV V++ DA
Sbjct: 91  RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 140

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G RLPTEAEWEY  RGG    ++ WG+ + P G   AN WQG+FP  N  A G
Sbjct: 141 AAYARWAGRRLPTEAEWEYAARGG-SGTVYAWGDEVAPEGRLMANTWQGKFPYRNDGALG 199

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY--NPK-GPTTGTDKVKKG 305
           +  T+PV ++  N FGL +M+GNVWEWT+  +  H   AP    +P   P     +V KG
Sbjct: 200 WHGTSPVGTFPANDFGLLDMIGNVWEWTSTRFQ-HGAGAPKSCCSPSDNPDPSVIQVLKG 258

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           GS+LC  +YC+R+R AARS  + DS+  ++GFRC  + G
Sbjct: 259 GSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVENLG 297


>gi|262204582|ref|YP_003275790.1| hypothetical protein Gbro_4780 [Gordonia bronchialis DSM 43247]
 gi|262087929|gb|ACY23897.1| protein of unknown function DUF323 [Gordonia bronchialis DSM 43247]
          Length = 337

 Score =  191 bits (485), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 126/348 (36%), Positives = 167/348 (47%), Gaps = 69/348 (19%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           KDMV +PG  F MG+++       E P+  V++D F++D++ V+  +F+ FV ATGYVT 
Sbjct: 30  KDMVWVPGGVFWMGSDEHY---PEERPAHQVSVDGFWMDRNPVTVAEFRRFVKATGYVTT 86

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           A++  D   F P    +     S V         LD                     Y  
Sbjct: 87  AQQQPDAADF-PDADPDLLVPGSMVFTPPDGPVPLDD--------------------YRR 125

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W          W Y           P   W HPEG  S +  R  HPV HVS+ DA AY 
Sbjct: 126 W----------WSY----------VPRADWRHPEGPKSNVGERNRHPVTHVSYFDACAYA 165

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWE+  RGGL+  ++ W ++  P G    NVWQG+FP  N   DGY  T
Sbjct: 166 RWAGKSLPTEAEWEFAARGGLDRAVYVWVDDREPHGRPGGNVWQGQFPWENLETDGYYGT 225

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--GPT-------------- 296
           +PV  ++ N FGL++M GNVWEWT D    HH P+   + K   PT              
Sbjct: 226 SPVGRFRPNGFGLHDMAGNVWEWTRD----HHTPSHGESGKNVAPTGACCILRNPIQESA 281

Query: 297 -----TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                T   +  KGGS+LC+  YC R+R A R  +T D+S G++GFRC
Sbjct: 282 AHSDETYARRTVKGGSHLCSPNYCDRYRPAVRQGHTEDASTGHIGFRC 329


>gi|169631382|ref|YP_001705031.1| hypothetical protein MAB_4305 [Mycobacterium abscessus ATCC 19977]
 gi|421025292|ref|ZP_15488335.1| hypothetical protein MA3A0731_4709 [Mycobacterium abscessus
           3A-0731]
 gi|421036856|ref|ZP_15499873.1| hypothetical protein MA3A0930S_4565 [Mycobacterium abscessus
           3A-0930-S]
 gi|169243349|emb|CAM64377.1| Conserved hypothetical protein [Mycobacterium abscessus]
 gi|392208815|gb|EIV34387.1| hypothetical protein MA3A0731_4709 [Mycobacterium abscessus
           3A-0731]
 gi|392220708|gb|EIV46232.1| hypothetical protein MA3A0930S_4565 [Mycobacterium abscessus
           3A-0930-S]
          Length = 309

 Score =  191 bits (485), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 123/339 (36%), Positives = 167/339 (49%), Gaps = 59/339 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +F MG N   +    E P R  +++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 22  ELVELPGGSFPMGCN---VFYPEEMPERVSSVEPFAIERHPVTNAQFAEFVTQTGYVTVA 78

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D                     DM      D  +   M                T+
Sbjct: 79  ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 102

Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           R    P +       W++           P   W HP G  S I+    HPVV V++ DA
Sbjct: 103 RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 152

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G RLPTEAEWEY  RGG    ++ WG+ + P G   AN WQG FP  N  A G
Sbjct: 153 AAYARWAGRRLPTEAEWEYAARGG-SGTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 211

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY--NPK-GPTTGTDKVKKG 305
           +  T+PV ++  N FGL +M+GNVWEWT+  +  H   AP    +P   P     +V KG
Sbjct: 212 WHGTSPVGTFPANDFGLLDMIGNVWEWTSTRFQ-HGAGAPKSCCSPSDNPDPSVIQVLKG 270

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           GS+LC  +YC+R+R AARS  + DS+  ++GFRC  + G
Sbjct: 271 GSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVENLG 309


>gi|420911955|ref|ZP_15375267.1| hypothetical protein MA6G0125R_3479 [Mycobacterium abscessus
           6G-0125-R]
 gi|420918414|ref|ZP_15381717.1| hypothetical protein MA6G0125S_4524 [Mycobacterium abscessus
           6G-0125-S]
 gi|420923577|ref|ZP_15386873.1| hypothetical protein MA6G0728S_4207 [Mycobacterium abscessus
           6G-0728-S]
 gi|420929238|ref|ZP_15392517.1| hypothetical protein MA6G1108_4447 [Mycobacterium abscessus
           6G-1108]
 gi|420968928|ref|ZP_15432131.1| hypothetical protein MM3A0810R_4690 [Mycobacterium abscessus
           3A-0810-R]
 gi|420979576|ref|ZP_15442753.1| hypothetical protein MA6G0212_4506 [Mycobacterium abscessus
           6G-0212]
 gi|420984961|ref|ZP_15448128.1| hypothetical protein MA6G0728R_4450 [Mycobacterium abscessus
           6G-0728-R]
 gi|421009763|ref|ZP_15472872.1| hypothetical protein MA3A0119R_4618 [Mycobacterium abscessus
           3A-0119-R]
 gi|421015133|ref|ZP_15478208.1| hypothetical protein MA3A0122R_4716 [Mycobacterium abscessus
           3A-0122-R]
 gi|421030713|ref|ZP_15493743.1| hypothetical protein MA3A0930R_4629 [Mycobacterium abscessus
           3A-0930-R]
 gi|392111305|gb|EIU37075.1| hypothetical protein MA6G0125S_4524 [Mycobacterium abscessus
           6G-0125-S]
 gi|392113949|gb|EIU39718.1| hypothetical protein MA6G0125R_3479 [Mycobacterium abscessus
           6G-0125-R]
 gi|392126226|gb|EIU51977.1| hypothetical protein MA6G1108_4447 [Mycobacterium abscessus
           6G-1108]
 gi|392128230|gb|EIU53980.1| hypothetical protein MA6G0728S_4207 [Mycobacterium abscessus
           6G-0728-S]
 gi|392163854|gb|EIU89543.1| hypothetical protein MA6G0212_4506 [Mycobacterium abscessus
           6G-0212]
 gi|392169957|gb|EIU95635.1| hypothetical protein MA6G0728R_4450 [Mycobacterium abscessus
           6G-0728-R]
 gi|392195369|gb|EIV20988.1| hypothetical protein MA3A0119R_4618 [Mycobacterium abscessus
           3A-0119-R]
 gi|392198205|gb|EIV23819.1| hypothetical protein MA3A0122R_4716 [Mycobacterium abscessus
           3A-0122-R]
 gi|392218595|gb|EIV44120.1| hypothetical protein MA3A0930R_4629 [Mycobacterium abscessus
           3A-0930-R]
 gi|392244584|gb|EIV70062.1| hypothetical protein MM3A0810R_4690 [Mycobacterium abscessus
           3A-0810-R]
          Length = 297

 Score =  191 bits (484), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 123/339 (36%), Positives = 167/339 (49%), Gaps = 59/339 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +F MG N   +    E P R  +++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 10  ELVELPGGSFPMGCN---VFYPEEMPERVSSVEPFAIERHPVTNAQFAEFVTQTGYVTVA 66

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D                     DM      D  +   M                T+
Sbjct: 67  ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 90

Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           R    P +       W++           P   W HP G  S I+    HPVV V++ DA
Sbjct: 91  RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 140

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G RLPTEAEWEY  RGG    ++ WG+ + P G   AN WQG FP  N  A G
Sbjct: 141 AAYARWAGRRLPTEAEWEYAARGG-SGTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 199

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY--NPK-GPTTGTDKVKKG 305
           +  T+PV ++  N FGL +M+GNVWEWT+  +  H   AP    +P   P     +V KG
Sbjct: 200 WHGTSPVGTFPANDFGLLDMIGNVWEWTSTRFQ-HGAGAPKSCCSPSDNPDPSVIQVLKG 258

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           GS+LC  +YC+R+R AARS  + DS+  ++GFRC  + G
Sbjct: 259 GSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVENLG 297


>gi|259047396|ref|ZP_05737797.1| sulfatase-modifying factor 1 [Granulicatella adiacens ATCC 49175]
 gi|259035587|gb|EEW36842.1| sulfatase-modifying factor 1 [Granulicatella adiacens ATCC 49175]
          Length = 283

 Score =  190 bits (483), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 121/337 (35%), Positives = 164/337 (48%), Gaps = 63/337 (18%)

Query: 14  DMVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + + L   TF+MG T     I+D E P     + AF + +  V+N +F+ FV  TGY T 
Sbjct: 2   NYIPLKAGTFQMGDTENRGFIEDHESPCVIKEVQAFEVAETTVTNEEFKRFVDETGYQTT 61

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE  G                              DS + H +  P    +++       
Sbjct: 62  AELIG------------------------------DSYVFHLLVSPEKRENYSHVSGSPW 91

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W                      L P   W HP G +S+IE+ ++HPVVHVS  DA+AYC
Sbjct: 92  WL---------------------LVPGACWKHPFGPESSIENLLDHPVVHVSLKDALAYC 130

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W GA LPTE +WEY  RGG   + +PWG  L   G + AN WQGEFP  N+  DG+L T
Sbjct: 131 EWAGAGLPTETQWEYAARGGTSTQ-YPWGEELEQDGIYHANTWQGEFPNQNSELDGFLGT 189

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP----SYNPKGP-TTGTDKVKKGGS 307
           APV S++ N +GLY M+GNVWEW      ++H   P      N K P     +   +GGS
Sbjct: 190 APVKSFEPNGYGLYQMIGNVWEWC-----INHRGIPFEDIEANAKAPLNLEVEYAIRGGS 244

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           +LC+  YC R+R AAR+ +  +S+A +LGFRC    G
Sbjct: 245 FLCHCSYCNRYRVAARNGSPWNSTASHLGFRCIRPLG 281


>gi|421020230|ref|ZP_15483286.1| hypothetical protein MA3A0122S_4471 [Mycobacterium abscessus
           3A-0122-S]
 gi|392205953|gb|EIV31536.1| hypothetical protein MA3A0122S_4471 [Mycobacterium abscessus
           3A-0122-S]
          Length = 292

 Score =  190 bits (483), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 123/339 (36%), Positives = 167/339 (49%), Gaps = 59/339 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +F MG N   +    E P R  +++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 5   ELVELPGGSFPMGCN---VFYPEEMPERVSSVEPFAIERHPVTNAQFAEFVTQTGYVTVA 61

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D                     DM      D  +   M                T+
Sbjct: 62  ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 85

Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           R    P +       W++           P   W HP G  S I+    HPVV V++ DA
Sbjct: 86  RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 135

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G RLPTEAEWEY  RGG    ++ WG+ + P G   AN WQG FP  N  A G
Sbjct: 136 AAYARWAGRRLPTEAEWEYAARGG-SGTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 194

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY--NPK-GPTTGTDKVKKG 305
           +  T+PV ++  N FGL +M+GNVWEWT+  +  H   AP    +P   P     +V KG
Sbjct: 195 WHGTSPVGTFPANDFGLLDMIGNVWEWTSTRFQ-HGAGAPKSCCSPSDNPDPSVIQVLKG 253

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           GS+LC  +YC+R+R AARS  + DS+  ++GFRC  + G
Sbjct: 254 GSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVENLG 292


>gi|87300701|ref|ZP_01083543.1| hypothetical protein WH5701_04615 [Synechococcus sp. WH 5701]
 gi|87284572|gb|EAQ76524.1| hypothetical protein WH5701_04615 [Synechococcus sp. WH 5701]
          Length = 307

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 123/347 (35%), Positives = 172/347 (49%), Gaps = 66/347 (19%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P  +F MG++        E P+  V ++ F++D+  V+N QF++FV ATG+VT AE
Sbjct: 1   MVWIPAGSFLMGSDD---HYPEESPAHRVVVEGFWIDRSPVTNAQFRKFVKATGHVTLAE 57

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D   +   L E                  L +        P   IS  D   +    
Sbjct: 58  RPADPAAYPDALPE------------------LLAPSSIVFVPPPGPISTGDPYRW---- 95

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W+Y     L+       +W HPEG  S+I+ R +HPVVHV+  DA+AY  W
Sbjct: 96  ---------WQY-----LQG-----ANWRHPEGPGSSIKSRDHHPVVHVAHEDALAYAAW 136

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LP+EAEWE   RGG+E   F WG  L P G   AN +QG+FP +N+  DG+  T+P
Sbjct: 137 AGKALPSEAEWERAARGGVEGAEFAWGEELHPGGRPMANTFQGDFPHHNSLLDGWERTSP 196

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY----------NPKGPT-------- 296
           V S+  N +GL +M+GNVWEWT DW+  H     ++          +P+G +        
Sbjct: 197 VGSFPPNGYGLLDMIGNVWEWTDDWYAGHGATVAAHKQAGGCCTIESPRGASRQESIDTA 256

Query: 297 ----TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
               T   KV KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 257 SQHGTIPRKVVKGGSFLCAPSYCRRYRPAARMAQGIDTSTCHMGFRC 303


>gi|260584577|ref|ZP_05852323.1| sulfatase-modifying factor 1 [Granulicatella elegans ATCC 700633]
 gi|260157600|gb|EEW92670.1| sulfatase-modifying factor 1 [Granulicatella elegans ATCC 700633]
          Length = 290

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 117/336 (34%), Positives = 165/336 (49%), Gaps = 68/336 (20%)

Query: 14  DMVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + V +P   F MG + +   ++D E P+    + AF + +  V+N +F EFV ATGY T 
Sbjct: 2   EFVQIPAGKFLMGDSTRQGFVEDYEGPAVEKEVHAFEMAKTPVTNAEFLEFVQATGYQTL 61

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+ GD++VF  LL EE+                       R ++P V            
Sbjct: 62  AERLGDSYVFHLLLPEEK-----------------------RENYPHV------------ 86

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                  T + W          ++ P   W  P G  S IE   +HPVVH++  DA+AYC
Sbjct: 87  -------TGSPWW---------KVVPNAHWAQPFGEGSNIEGIEDHPVVHIALEDAIAYC 130

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W   +LPTE +WEY  R  +    +PWG +L P   +RAN WQG+FP  N A DG++ T
Sbjct: 131 RWAKVQLPTEVQWEYAARA-MSISTYPWGEDLVPESGYRANTWQGDFPNVNDALDGFVGT 189

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP-KGPTTGTDKVK-------K 304
           APV S++ N FGLY M+GN+WEW +       HP     P        D +K       +
Sbjct: 190 APVYSFEPNDFGLYQMIGNIWEWCS-------HPRGIVLPLVEERISLDSIKPSGEFAIR 242

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GGS+LC+  YC R+R AAR+     S+  ++GFRC 
Sbjct: 243 GGSFLCHCSYCNRYRVAARNGAFVTSTTSHMGFRCV 278


>gi|424858974|ref|ZP_18282988.1| sulfatase-modifying factor 1 [Rhodococcus opacus PD630]
 gi|356661483|gb|EHI41794.1| sulfatase-modifying factor 1 [Rhodococcus opacus PD630]
          Length = 312

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 123/346 (35%), Positives = 163/346 (47%), Gaps = 63/346 (18%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           K+M  +PG TF MG+         E P   VT+D F++D H+V+  +F+ FV ATG+VT 
Sbjct: 7   KNMAWIPGSTFWMGSED---FYPEERPVHQVTVDGFWMDTHQVTVAEFRRFVKATGHVTT 63

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE                      +  D  ++ G D  +          +     V   T
Sbjct: 64  AE----------------------IAPDPAQYPGADPAL----------LVPGSLVFTPT 91

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                L     W             P   W HPEG DS +  R  HPV HVSW DA AY 
Sbjct: 92  AGPVPLDDFTRWW---------SFTPGADWRHPEGPDSNVGGRERHPVTHVSWFDARAYA 142

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWE+  RGGL+ + F WG    P G    NVWQG FP  N   DG+  T
Sbjct: 143 EWAGKDLPTEAEWEFAARGGLDRKPFVWGEEHEPGGRPGGNVWQGRFPWENLLEDGFAGT 202

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTGTD---- 300
           +PV  ++ N + L +M GNVWEWT D++ V H     + AP+ +   P  P   T     
Sbjct: 203 SPVGHFRPNGYELSDMAGNVWEWTTDYFTVDHSESGKNVAPTSSCCIPTNPRVDTARAEN 262

Query: 301 -------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                  +V KGGS+LC   YC R+R +AR  +T ++S  ++GFRC
Sbjct: 263 PDEPYARRVIKGGSHLCAPNYCLRYRPSARQGDTEETSTCHIGFRC 308


>gi|424888891|ref|ZP_18312494.1| hypothetical protein Rleg10DRAFT_2973 [Rhizobium leguminosarum bv.
           trifolii WSM2012]
 gi|393174440|gb|EJC74484.1| hypothetical protein Rleg10DRAFT_2973 [Rhizobium leguminosarum bv.
           trifolii WSM2012]
          Length = 324

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 121/347 (34%), Positives = 166/347 (47%), Gaps = 65/347 (18%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           +D + + G  F MG+++       E P   V +D F++D   V+N QF EFV ATGYVT 
Sbjct: 19  RDQIWIDGGGFLMGSDQHY---PEEAPVHPVKVDGFWIDVAPVTNRQFAEFVEATGYVTL 75

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AEK  +   + P    E     S V    KR  G                          
Sbjct: 76  AEKASNPEDY-PGAPPEMLKPGSVVFDPPKRVSG-------------------------- 108

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
              +  P   ++ +G             +W  P G  S +  +++HPVV V++ DA+AY 
Sbjct: 109 ---SGPPQWWKFRFGA------------NWRRPYGGLSNLRGKLDHPVVQVAFADAMAYA 153

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            WRG  LPTEAEWE+  +GGLE + F WG+   P G    N W G FPT N   D +  T
Sbjct: 154 LWRGKDLPTEAEWEFAAKGGLEGKEFAWGDEFIPDGRFMGNTWHGLFPTENLKPDSFERT 213

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGT--------- 299
           +PV S+  N +GLY+M+GNVWEWT+D+W    HPAP+      P  P  G+         
Sbjct: 214 SPVGSFPPNGYGLYDMIGNVWEWTSDYWTT-RHPAPAEMSCCIPVNPRAGSAEASRDPGQ 272

Query: 300 ------DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                  +V KGGS+LC   YC R+R AAR     +S+  ++GFRC 
Sbjct: 273 PDIRIARRVLKGGSHLCAPNYCRRYRPAARHAQEENSATTHIGFRCV 319


>gi|419716090|ref|ZP_14243488.1| hypothetical protein S7W_16633 [Mycobacterium abscessus M94]
 gi|382941296|gb|EIC65615.1| hypothetical protein S7W_16633 [Mycobacterium abscessus M94]
          Length = 297

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 124/344 (36%), Positives = 166/344 (48%), Gaps = 69/344 (20%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +F MG N   +    E P R  +++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 10  ELVELPGGSFPMGCN---VFYPEEMPERVSSVEPFAIERHPVTNAQFAEFVTQTGYVTVA 66

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D                     DM      D  +   M                T+
Sbjct: 67  ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 90

Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           R    P +       W++           P   W HP G  S I+    HPVV V++ DA
Sbjct: 91  RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 140

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G RLPTEAEWEY  RGG    ++ WG+ + P G   AN WQG FP  N  A G
Sbjct: 141 AAYARWAGRRLPTEAEWEYAARGG-SGTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 199

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--------PSYNPKGPTTGTD 300
           +  T+PV ++  N FGL +M+GNVWEWT+       H A        PS NP        
Sbjct: 200 WHGTSPVGTFPANDFGLLDMIGNVWEWTS---TRFQHGADTPKSCCSPSDNPD---PSVI 253

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           +V KGGS+LC  +YC+R+R AARS  + DS+  ++GFRC  + G
Sbjct: 254 QVLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVENLG 297


>gi|226187115|dbj|BAH35219.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
          Length = 320

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 125/346 (36%), Positives = 171/346 (49%), Gaps = 63/346 (18%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           K+M L+PG T+ MG+         E P   VT+D F++D   V+  +F+ FV ATG+VT 
Sbjct: 12  KNMSLIPGGTYWMGSED---FYPEERPVHQVTVDGFWMDTTAVTVAEFRRFVKATGHVTT 68

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE         P  ++   A  + +      F           + P   +S +D   Y  
Sbjct: 69  AE-------IAPDAADYPDADPALLVPGSLVF-----------NSPPGPVSLDD---YTQ 107

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W          W +           P   W HPEG DS I  R  HPV HVS+ DA AY 
Sbjct: 108 W----------WSF----------TPGADWRHPEGSDSNIGGRERHPVTHVSYFDAQAYA 147

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWE+  RGGL+ + + WG++ +P G    NVWQG+FP  N   DG+  T
Sbjct: 148 AWAGKELPTEAEWEFAARGGLDRQAYVWGDHDSPGGRPGGNVWQGQFPWENLLEDGHERT 207

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTG------ 298
           APV  ++ N +GL++M GNVWEWTAD +   H     + AP+ +   P+ P         
Sbjct: 208 APVGKFRPNGYGLFDMAGNVWEWTADHYTSDHAHSSKNVAPASSCCIPRNPRAEFATEAL 267

Query: 299 -----TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                  +V KGGS+LC   YC R+R AAR   + +SS  ++GFRC
Sbjct: 268 AGEPYARRVIKGGSHLCAPNYCLRYRPAARQGESEESSTCHVGFRC 313


>gi|124022136|ref|YP_001016443.1| hypothetical protein P9303_04261 [Prochlorococcus marinus str. MIT
           9303]
 gi|123962422|gb|ABM77178.1| Uncharacterized conserved protein [Prochlorococcus marinus str. MIT
           9303]
          Length = 287

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 120/334 (35%), Positives = 168/334 (50%), Gaps = 61/334 (18%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDG-------EFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
           +MV +P   +R+G ++     DG       E P+R V LD+F +D   V+N QF+ FVSA
Sbjct: 4   EMVTIPAGLYRVGCDR--CYPDGSVRCYPEEAPAREVQLDSFQIDVGPVTNAQFRAFVSA 61

Query: 67  TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
           T ++T +E   D  ++ P L  EER   S V      F+   +T++         +SW  
Sbjct: 62  TQHLTVSELPPDPTLY-PDLPPEERIPESVV------FQPPPATVDRSKP-----LSWWT 109

Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
            VA   WR                             HP+G +STI+   +HPVVHV++ 
Sbjct: 110 LVAGADWR-----------------------------HPQGPESTIDGLDDHPVVHVAYA 140

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
           DA+AY  W G RLP+  EWE   RGGL +  + WGN LTP     AN+WQG FP +N   
Sbjct: 141 DAIAYAHWAGKRLPSAEEWEVAARGGLVDAQYAWGNELTPNNRWMANIWQGPFPWHNEEL 200

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
           DG+  T+PV S+  N +GL ++ GNVWEWT            S  P        +  KGG
Sbjct: 201 DGWFWTSPVGSFPANGYGLLDVCGNVWEWTN-----------SVYPVASGHQERRTIKGG 249

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           S+LC + YC R+R +A    T D++  ++GFRCA
Sbjct: 250 SFLCADNYCVRYRPSALQGQTVDTATCHMGFRCA 283



 Score = 44.7 bits (104), Expect = 0.078,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 27/40 (67%)

Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
           +  KGGS+LC + YC R+R +A    T D++  ++GFRCA
Sbjct: 244 RTIKGGSFLCADNYCVRYRPSALQGQTVDTATCHMGFRCA 283


>gi|89069430|ref|ZP_01156785.1| hypothetical protein OG2516_18250 [Oceanicola granulosus HTCC2516]
 gi|89045060|gb|EAR51138.1| hypothetical protein OG2516_18250 [Oceanicola granulosus HTCC2516]
          Length = 314

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 124/320 (38%), Positives = 155/320 (48%), Gaps = 53/320 (16%)

Query: 25  MGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEP 84
           +GT +P + +DGE P R V L  F  D   V+N +F  FV+ATGYVT A           
Sbjct: 45  VGTRRPEIPEDGEGPVRPVRLRPFRCDSFPVTNARFAAFVAATGYVTVA----------- 93

Query: 85  LLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEW 144
                ER    QV      F GL +  E           W                   W
Sbjct: 94  -----ERFGWGQV------FAGLMNPAEAAAARAFPGTPW-------------------W 123

Query: 145 EYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAE 204
                     R  P   W  PEG  S I  R +HP VHV W+DA A+ TW G RLP+EAE
Sbjct: 124 ----------RAVPGACWHAPEGPGSGITGRASHPAVHVGWDDARAFATWAGGRLPSEAE 173

Query: 205 WEYGCRGGLENRLFPWGN-NLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKF 263
           WE+  RGGL+   FPWG+      G    N+WQG FP ++TAADG+  T+PV +Y  N  
Sbjct: 174 WEHAARGGLDQARFPWGDREPDDTGFLPCNIWQGRFPDHDTAADGHAGTSPVGAYPANGA 233

Query: 264 GLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAA 322
           GL++M GNVWEWTAD + V    + +          + +V KGGS+LC+  YCYR+R AA
Sbjct: 234 GLHDMAGNVWEWTADPFRVRSAKSAAAARNAHARQENQRVTKGGSFLCHISYCYRYRIAA 293

Query: 323 RSQNTPDSSAGNLGFRCAAD 342
           RS    DS A N G R   D
Sbjct: 294 RSGTAADSGASNTGMRVFYD 313



 Score = 49.7 bits (117), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/42 (54%), Positives = 27/42 (64%)

Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           +V KGGS+LC+  YCYR+R AARS    DS A N G R   D
Sbjct: 272 RVTKGGSFLCHISYCYRYRIAARSGTAADSGASNTGMRVFYD 313


>gi|404447445|ref|ZP_11012507.1| hypothetical protein MVAC_29103 [Mycobacterium vaccae ATCC 25954]
 gi|403648882|gb|EJZ04367.1| hypothetical protein MVAC_29103 [Mycobacterium vaccae ATCC 25954]
          Length = 295

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 119/334 (35%), Positives = 166/334 (49%), Gaps = 56/334 (16%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           DMV +P  T  +G+++       E P+R VT+D F++ +H+V+N QF EFVSAT Y+T A
Sbjct: 7   DMVWIPAQTAVVGSDE---HYPEEGPAREVTVDGFWMQRHQVTNAQFAEFVSATDYLTVA 63

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPV--VHISWNDAVAYC 131
           E+  D  ++     +  +                 S + HR   PV   H+S      + 
Sbjct: 64  ERPLDASLYPGAPPQNLQPG---------------SMVFHRTTGPVDLRHVS-----QWW 103

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
           TW                        P   W HP G  S++ +R  HPVVH++++DA  Y
Sbjct: 104 TWT-----------------------PGACWKHPRGPRSSLRNREAHPVVHIAFDDAATY 140

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G  LPTEAEWE   RGGL +  + WG+   P+G   AN W GEFP       GY  
Sbjct: 141 AQWAGYELPTEAEWEVAARGGLAHARYTWGDEPEPQGRPLANYWHGEFP--YLPDTGYGQ 198

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTTGTD-KVKKG 305
           + PV S+  N +GL++  GNVWEWT+DW+       P     +Y+P  P      KV KG
Sbjct: 199 STPVGSFPPNGYGLHDTAGNVWEWTSDWYGSTRDIQPCCAAETYDPSQPQFKVPRKVIKG 258

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           GS+LC + YC R+R AAR     D+   ++GFRC
Sbjct: 259 GSFLCADTYCMRYRPAARRPQQVDTGMSHIGFRC 292


>gi|424881460|ref|ZP_18305092.1| hypothetical protein Rleg8DRAFT_3034 [Rhizobium leguminosarum bv.
           trifolii WU95]
 gi|392517823|gb|EIW42555.1| hypothetical protein Rleg8DRAFT_3034 [Rhizobium leguminosarum bv.
           trifolii WU95]
          Length = 317

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 124/347 (35%), Positives = 173/347 (49%), Gaps = 65/347 (18%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           ++ +V + G TF MG+N+       E P+  V +D F++D   V+N QF  FV ATG+VT
Sbjct: 13  FQGLVWVAGRTFTMGSNEHY---PEEAPAHPVAVDGFWIDATPVTNRQFASFVKATGHVT 69

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
            AE+      +   L +  RA  S V    K   G D T             W       
Sbjct: 70  VAERAPRAADYPGALPKMLRAG-SLVFTPPKAVVGPDIT------------QW------- 109

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W          +++G              W HP G  + I  +++HPVVHV+++DA+AY
Sbjct: 110 -W---------SFKFGA------------DWRHPYGGRTDIRGKLDHPVVHVAYSDAMAY 147

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W    LPTEAEWE    GGL++  F WG+ L+P G+  AN WQG FPT +T   G   
Sbjct: 148 AHWAEKDLPTEAEWELAACGGLDDAEFAWGDELSPEGKLMANTWQGTFPTLSTKPAGADR 207

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-------YNPKGPTTGTD---- 300
           T+PV S+  N +G+Y+M+GNVWEWT+D+W+   HP P+        NP+G          
Sbjct: 208 TSPVGSFPSNGYGIYDMIGNVWEWTSDFWST-RHPEPASRSCCIPSNPRGGDADASYDLR 266

Query: 301 --------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                   +V KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 267 LPDIRIPRRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRC 313


>gi|418049943|ref|ZP_12688030.1| Sulphatase-modifying factor protein [Mycobacterium rhodesiae JS60]
 gi|353190848|gb|EHB56358.1| Sulphatase-modifying factor protein [Mycobacterium rhodesiae JS60]
          Length = 290

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 119/331 (35%), Positives = 167/331 (50%), Gaps = 49/331 (14%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           D+V++PG +FRMG+         E P    T+ +F +++H V+N QF +FV+ATGYVT A
Sbjct: 4   DLVVVPGGSFRMGSTS---FYPEEAPIHTATVGSFAIERHPVTNAQFADFVAATGYVTIA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D  ++ P ++E +    + V      F      +  R                  W
Sbjct: 61  ERSLDPALY-PGVAEADLQPGALV------FRPTPGPVNLR-----------------DW 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         W++           P  SW  P G  S I  + +HPVV V++ DA AY  
Sbjct: 97  R-------QWWDWA----------PGASWRTPFGPGSDIADKADHPVVQVAYPDANAYAA 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W   RLPTEAEWEY  R G     + WG+  T  G   AN WQG FP  N  A G++ T+
Sbjct: 140 WADRRLPTEAEWEYAARAG-STGPYAWGDEPTVGGALMANTWQGRFPYRNDGALGWVGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGTDKVKKGGSYL 309
           PV ++  N +GL +M+GNVWEWT   ++ HH      +    P  P     +  KGGS+L
Sbjct: 199 PVGTFPPNGYGLVDMIGNVWEWTTTRYSAHHRLDQQNSCCPPPTEPDPAVSQTLKGGSHL 258

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           C  +YC+R+R AARS  + DSS  ++GFRCA
Sbjct: 259 CAPEYCHRYRPAARSPQSQDSSTTHIGFRCA 289


>gi|226363620|ref|YP_002781402.1| hypothetical protein ROP_42100 [Rhodococcus opacus B4]
 gi|226242109|dbj|BAH52457.1| hypothetical protein [Rhodococcus opacus B4]
          Length = 312

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 123/346 (35%), Positives = 162/346 (46%), Gaps = 63/346 (18%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           K+M  +PG TF MG+         E P   VT+D F++D H+V+  +F+ FV  TG+VT 
Sbjct: 7   KNMAWIPGGTFWMGSED---FYPEERPVHQVTVDGFWMDSHQVTVAEFRRFVKDTGHVTT 63

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE                      +  D  ++ G D  +          +     V   T
Sbjct: 64  AE----------------------IAPDPAQYPGADPAL----------LVPGSLVFTPT 91

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                L     W             P   W HPEG  S +  R  HPV HVSW DA AY 
Sbjct: 92  PGPVPLDDFTRWW---------SFTPGADWRHPEGPGSNVGGRERHPVTHVSWFDARAYA 142

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWE+  RGGL+ + F WG+   P G    NVWQG+FP  N   DG+  T
Sbjct: 143 EWAGKDLPTEAEWEFAARGGLDRKPFVWGDEHEPGGRPGGNVWQGQFPWENLLEDGFAGT 202

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-------PAPS----YNPKGPTTGTD- 300
           +PV  ++ N +GL +M GNVWEWT D++   H        PA S     NP+  T     
Sbjct: 203 SPVGHFRSNGYGLGDMAGNVWEWTTDYFTADHSAGGKNVAPASSCCIPANPRIDTAAAPD 262

Query: 301 -------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                  +V KGGS+LC   YC R+R AAR  +T ++S  ++GFRC
Sbjct: 263 PHEPYARRVIKGGSHLCAPNYCLRYRPAARQGDTEETSTCHIGFRC 308


>gi|409996049|ref|YP_006750450.1| sulfatase-modifying factor 1 [Lactobacillus casei W56]
 gi|418009877|ref|ZP_12649664.1| sulfatase modifying factor [Lactobacillus casei Lc-10]
 gi|406357061|emb|CCK21331.1| Sulfatase-modifying factor 1 [Lactobacillus casei W56]
 gi|410555004|gb|EKQ28969.1| sulfatase modifying factor [Lactobacillus casei Lc-10]
          Length = 287

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 122/329 (37%), Positives = 164/329 (49%), Gaps = 63/329 (19%)

Query: 18  LPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKF 76
           + G +F+MGTN  I   +D E P+  V + +F +    V+N  F +F++AT Y T AE+ 
Sbjct: 9   IKGGSFQMGTNDRIGFGEDYEGPTTIVHVPSFSMADTPVTNADFDDFIAATAYQTVAERL 68

Query: 77  GDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA 136
           G +FVFE L+ EEER                  T +H              VA   W   
Sbjct: 69  GSSFVFELLIPEEERV-----------------TYQH--------------VAGAPW--- 94

Query: 137 RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRG 196
                  W           L P   W HP G +S+     NHPVVHV+  DA+AYC W  
Sbjct: 95  -------W----------LLVPGADWQHPYGAESSNIDLDNHPVVHVALEDALAYCQWSH 137

Query: 197 ARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVM 256
           ++LPTEA+WEY    G     +PWG +L       AN WQG+FP +NTA DG++ TAPV 
Sbjct: 138 SQLPTEAQWEYAAGAGTATT-YPWGESLVDEHGFHANTWQGDFPNDNTAEDGFVGTAPVK 196

Query: 257 SYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP------KGPTTGTDKVKKGGSYLC 310
           SY+ N  GLY M+GNVWEW     N  +     +N       K P  G   + +GGS+LC
Sbjct: 197 SYEPNSNGLYQMIGNVWEWCR---NPRYTLLDDFNAEQFKLGKVPAAGEYAI-RGGSFLC 252

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           +  YC R+R AAR+     S++ +L FRC
Sbjct: 253 HCSYCNRYRTAARNGVDLQSTSSHLSFRC 281


>gi|330826800|ref|YP_004390103.1| sulfatase-modifying factor protein [Alicycliphilus denitrificans
           K601]
 gi|329312172|gb|AEB86587.1| Sulphatase-modifying factor protein [Alicycliphilus denitrificans
           K601]
          Length = 375

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 120/347 (34%), Positives = 159/347 (45%), Gaps = 56/347 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F MG      + + E P   V +  F++D+  V+N QF++FV ATGYVT AE
Sbjct: 60  MVWVPAGRFVMGNTSSKALPN-EKPGHGVRVAGFWMDRTHVTNRQFEQFVRATGYVTTAE 118

Query: 75  KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           +  D    +  L     R     +      F G    +                  Y  W
Sbjct: 119 RVPDWPSLQAQLPPGTPRPPAGAMVPGALVFAGTSQPV--------------SLADYMRW 164

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W Y           P  SW HP+G  S +  + +HPVV VS+ DA AY  
Sbjct: 165 ----------WRY----------MPGASWRHPQGPQSDLRGKEDHPVVQVSYEDAQAYAR 204

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE---FPTNNTAADGYL 250
           W G RLPTEAEWEY  RGGL+ + F WG+ L P G   A  W  +   FP    +A    
Sbjct: 205 WAGKRLPTEAEWEYAARGGLDQQRFAWGHLLEPEGRPMARTWSAQPQVFPV--ASAKVMP 262

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNV-----HHHPAPSYNPKGPTTGTD----- 300
            TAPV SY  N +GLY+M GN W+W ADW+               NP+GP    D     
Sbjct: 263 GTAPVASYAPNGYGLYDMAGNAWQWVADWYRYDAFARQARKGDVANPRGPGESFDPEGLR 322

Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
                +V +GGS+LC+E+YC  +R +AR    P S++ N+GFR   D
Sbjct: 323 ADAPKRVIRGGSFLCSEEYCEGYRVSARQGQDPYSASANVGFRLVLD 369


>gi|191637195|ref|YP_001986361.1| Zgc:136465 protein (Fragment), partial [Lactobacillus casei BL23]
 gi|385818907|ref|YP_005855294.1| Sulfatase-like protein modifying factor 1 [Lactobacillus casei
           LC2W]
 gi|385822077|ref|YP_005858419.1| Sulfatase-like protein modifying factor 1 [Lactobacillus casei
           BD-II]
 gi|190711497|emb|CAQ65503.1| Zgc:136465 protein (Fragment) [Lactobacillus casei BL23]
 gi|327381234|gb|AEA52710.1| Sulfatase-like protein modifying factor 1 [Lactobacillus casei
           LC2W]
 gi|327384404|gb|AEA55878.1| Sulfatase-like protein modifying factor 1 [Lactobacillus casei
           BD-II]
          Length = 284

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 122/329 (37%), Positives = 164/329 (49%), Gaps = 63/329 (19%)

Query: 18  LPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKF 76
           + G +F+MGTN  I   +D E P+  V + +F +    V+N  F +F++AT Y T AE+ 
Sbjct: 6   IKGGSFQMGTNDRIGFGEDYEGPTTIVHVPSFSMADTPVTNADFDDFIAATAYQTVAERL 65

Query: 77  GDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA 136
           G +FVFE L+ EEER                  T +H              VA   W   
Sbjct: 66  GSSFVFELLIPEEERV-----------------TYQH--------------VAGAPW--- 91

Query: 137 RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRG 196
                  W           L P   W HP G +S+     NHPVVHV+  DA+AYC W  
Sbjct: 92  -------W----------LLVPGADWQHPYGAESSNIDLDNHPVVHVALEDALAYCQWSH 134

Query: 197 ARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVM 256
           ++LPTEA+WEY    G     +PWG +L       AN WQG+FP +NTA DG++ TAPV 
Sbjct: 135 SQLPTEAQWEYAAGAGTATT-YPWGESLVDEHGFHANTWQGDFPNDNTAEDGFVGTAPVK 193

Query: 257 SYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP------KGPTTGTDKVKKGGSYLC 310
           SY+ N  GLY M+GNVWEW     N  +     +N       K P  G   + +GGS+LC
Sbjct: 194 SYEPNSNGLYQMIGNVWEWCR---NPRYTLLDDFNAEQFKLGKVPAAGEYAI-RGGSFLC 249

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           +  YC R+R AAR+     S++ +L FRC
Sbjct: 250 HCSYCNRYRTAARNGVDLQSTSSHLSFRC 278


>gi|399061236|ref|ZP_10746002.1| hypothetical protein PMI02_02295 [Novosphingobium sp. AP12]
 gi|398036048|gb|EJL29271.1| hypothetical protein PMI02_02295 [Novosphingobium sp. AP12]
          Length = 325

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 133/358 (37%), Positives = 169/358 (47%), Gaps = 72/358 (20%)

Query: 6   APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
           A P E   DMV++P  TF MG+         E P R V++DAF +D+  V+N QF  FV 
Sbjct: 11  AKPAE-MDDMVVIPAGTFTMGSET---FYPEEAPLRRVSVDAFRMDRTPVTNRQFAAFVE 66

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
           ATG+ T AE   D   +  +L+  +RA                S +  +   PV      
Sbjct: 67  ATGHTTVAEVAPDPKDYPGMLAGMDRAG---------------SLVFGKTTAPV------ 105

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHR--MNHPVVH 182
           D      W          W            F +G+ W  P G+ S IE      HPVV 
Sbjct: 106 DTTNPGNW----------WN-----------FTFGAYWRQPLGVGSDIESLGLWEHPVVQ 144

Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
           V++ DA AY  W G  LPTEAE+EY  RGGLE R + WG+ L P G   AN WQG FP  
Sbjct: 145 VAYADAEAYAKWAGKDLPTEAEFEYAARGGLEGRDYAWGDELAPGGVMMANYWQGLFPFA 204

Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS---------YNPK 293
           N   DG+  T+PV +Y  N +GL +M+GN WEWT DWW+    PA S          NP+
Sbjct: 205 NQMLDGWERTSPVGTYLANGYGLVDMIGNTWEWTRDWWS--EKPAISKPSGGCCALSNPR 262

Query: 294 GPTTGTD------------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           G                  KV KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 263 GAKLKDSFDPAQPGVRIGRKVIKGGSHLCAANYCQRYRPAARHPEMIDTSTTHIGFRC 320


>gi|319764583|ref|YP_004128520.1| hypothetical protein Alide_3927 [Alicycliphilus denitrificans BC]
 gi|317119144|gb|ADV01633.1| protein of unknown function DUF323 [Alicycliphilus denitrificans
           BC]
          Length = 375

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 120/347 (34%), Positives = 159/347 (45%), Gaps = 56/347 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F MG      + + E P   V +  F++D+  V+N QF++FV ATGYVT AE
Sbjct: 60  MVWVPAGRFVMGNTSSKALPN-EKPGHGVRVAGFWMDRTHVTNRQFEQFVRATGYVTTAE 118

Query: 75  KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           +  D    +  L     R     +      F G    +                  Y  W
Sbjct: 119 RVPDWPSLQAQLPPGTPRPPAGAMVPGALVFAGTSQPV--------------SLADYMRW 164

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W Y           P  SW HP+G  S +  + +HPVV VS+ DA AY  
Sbjct: 165 ----------WRY----------MPGASWRHPQGPQSDLRGKEDHPVVQVSYEDAQAYAR 204

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE---FPTNNTAADGYL 250
           W G RLPTEAEWEY  RGGL+ + F WG+ L P G   A  W  +   FP    +A    
Sbjct: 205 WAGKRLPTEAEWEYAARGGLDQQRFAWGHLLEPEGRPMARTWSAQPQVFPV--ASAKVMP 262

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNV-----HHHPAPSYNPKGPTTGTD----- 300
            TAPV SY  N +GLY+M GN W+W ADW+               NP+GP    D     
Sbjct: 263 GTAPVASYAPNGYGLYDMAGNAWQWVADWYRYDAFARQARKGVVANPRGPGESFDPEGLR 322

Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
                +V +GGS+LC+E+YC  +R +AR    P S++ N+GFR   D
Sbjct: 323 ADAPKRVIRGGSFLCSEEYCEGYRVSARQGQDPYSASANVGFRLVLD 369


>gi|414580726|ref|ZP_11437866.1| hypothetical protein MA5S1215_3767 [Mycobacterium abscessus
           5S-1215]
 gi|420885390|ref|ZP_15348750.1| hypothetical protein MA5S0421_4067 [Mycobacterium abscessus
           5S-0421]
 gi|420892281|ref|ZP_15355628.1| hypothetical protein MA5S0422_4986 [Mycobacterium abscessus
           5S-0422]
 gi|420896743|ref|ZP_15360082.1| hypothetical protein MA5S0708_3740 [Mycobacterium abscessus
           5S-0708]
 gi|420905749|ref|ZP_15369067.1| hypothetical protein MA5S1212_3495 [Mycobacterium abscessus
           5S-1212]
 gi|392079541|gb|EIU05368.1| hypothetical protein MA5S0422_4986 [Mycobacterium abscessus
           5S-0422]
 gi|392081153|gb|EIU06979.1| hypothetical protein MA5S0421_4067 [Mycobacterium abscessus
           5S-0421]
 gi|392096055|gb|EIU21850.1| hypothetical protein MA5S0708_3740 [Mycobacterium abscessus
           5S-0708]
 gi|392103653|gb|EIU29439.1| hypothetical protein MA5S1212_3495 [Mycobacterium abscessus
           5S-1212]
 gi|392115878|gb|EIU41646.1| hypothetical protein MA5S1215_3767 [Mycobacterium abscessus
           5S-1215]
          Length = 340

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 122/340 (35%), Positives = 165/340 (48%), Gaps = 71/340 (20%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +F MG N   +    E P R  ++ +F +++H V+N QF EFV+ TGYVT A
Sbjct: 22  ELVELPGGSFPMGCN---VFYPEEMPERVSSVGSFAIERHPVTNAQFAEFVTQTGYVTVA 78

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D                     DM      D  +   M                T+
Sbjct: 79  ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 102

Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           R    P +       W++           P   W HP G  S I+    HPVV V++ DA
Sbjct: 103 RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 152

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G RLPTEAEWEY  RGG  + ++ WG+ + P G   AN WQG FP  N  A G
Sbjct: 153 AAYARWAGRRLPTEAEWEYAARGG-SDTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 211

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------APSYNPKGPTTGT 299
           +  T+PV ++  N FGL +M+GNVWEWT    +    P         +PS NP       
Sbjct: 212 WHGTSPVGTFPANGFGLLDMIGNVWEWT----STRFQPGAGTPRSCCSPSDNPD---PSV 264

Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            +V KGGS+LC  +YC+R+R AARS  + DS+  ++GFRC
Sbjct: 265 IQVLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRC 304


>gi|116073277|ref|ZP_01470539.1| hypothetical protein RS9916_32542 [Synechococcus sp. RS9916]
 gi|116068582|gb|EAU74334.1| hypothetical protein RS9916_32542 [Synechococcus sp. RS9916]
          Length = 282

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 120/331 (36%), Positives = 166/331 (50%), Gaps = 55/331 (16%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + MV +P   + +G+         E P R++ + AF LD   V+N  F  FV+ TGYVT 
Sbjct: 6   QGMVQIPAGEYCLGSES---FYPEESPVRSIQVRAFSLDVAPVTNADFARFVADTGYVTV 62

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           +EK  D  ++ P L+ EE+   S V      F     T++         +SW   V    
Sbjct: 63  SEKPPDPELY-PNLAPEEQCPESAV------FIPPPPTVDRSQP-----LSWWALVEGAD 110

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR                             HP+G  + I+   +HPVVHV+++DA+AY 
Sbjct: 111 WR-----------------------------HPQGPSTQIDDLPDHPVVHVAYDDALAYA 141

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPT  EWE   RGGL  + + WG+ +TP G   ANVWQG FP  N   DG+  T
Sbjct: 142 DWAGKRLPTADEWEVAARGGLVGQDYAWGSEMTPGGRWLANVWQGPFPWRNEQTDGWFWT 201

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
           +PV S+  N +GL +M GNVWEWT+  + V         PKG      ++ KGGS+LC E
Sbjct: 202 SPVGSFPANGYGLVDMCGNVWEWTSTLFPV---------PKG--EQERRIIKGGSFLCAE 250

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
            YC+R R AA    T D++  ++GFRCA+D 
Sbjct: 251 NYCHRFRPAALMGQTTDTATCHMGFRCASDS 281



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 21/42 (50%), Positives = 30/42 (71%)

Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           ++ KGGS+LC E YC+R R AA    T D++  ++GFRCA+D
Sbjct: 239 RIIKGGSFLCAENYCHRFRPAALMGQTTDTATCHMGFRCASD 280


>gi|420880418|ref|ZP_15343785.1| hypothetical protein MA5S0304_3812 [Mycobacterium abscessus
           5S-0304]
 gi|420900706|ref|ZP_15364037.1| hypothetical protein MA5S0817_3360 [Mycobacterium abscessus
           5S-0817]
 gi|420974532|ref|ZP_15437723.1| hypothetical protein MA5S0921_4771 [Mycobacterium abscessus
           5S-0921]
 gi|392085327|gb|EIU11152.1| hypothetical protein MA5S0304_3812 [Mycobacterium abscessus
           5S-0304]
 gi|392098067|gb|EIU23861.1| hypothetical protein MA5S0817_3360 [Mycobacterium abscessus
           5S-0817]
 gi|392162415|gb|EIU88105.1| hypothetical protein MA5S0921_4771 [Mycobacterium abscessus
           5S-0921]
          Length = 323

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 122/340 (35%), Positives = 165/340 (48%), Gaps = 71/340 (20%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +F MG N   +    E P R  ++ +F +++H V+N QF EFV+ TGYVT A
Sbjct: 5   ELVELPGGSFPMGCN---VFYPEEMPERVSSVGSFAIERHPVTNAQFAEFVTQTGYVTVA 61

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D                     DM      D  +   M                T+
Sbjct: 62  ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 85

Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           R    P +       W++           P   W HP G  S I+    HPVV V++ DA
Sbjct: 86  RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 135

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G RLPTEAEWEY  RGG  + ++ WG+ + P G   AN WQG FP  N  A G
Sbjct: 136 AAYARWAGRRLPTEAEWEYAARGG-SDTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 194

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------APSYNPKGPTTGT 299
           +  T+PV ++  N FGL +M+GNVWEWT    +    P         +PS NP       
Sbjct: 195 WHGTSPVGTFPANGFGLLDMIGNVWEWT----STRFQPGAGTPRSCCSPSDNPD---PSV 247

Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            +V KGGS+LC  +YC+R+R AARS  + DS+  ++GFRC
Sbjct: 248 IQVLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRC 287


>gi|187919798|ref|YP_001888829.1| hypothetical protein Bphyt_5096 [Burkholderia phytofirmans PsJN]
 gi|187718236|gb|ACD19459.1| protein of unknown function DUF323 [Burkholderia phytofirmans PsJN]
          Length = 375

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 128/357 (35%), Positives = 174/357 (48%), Gaps = 63/357 (17%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           KDMV + G  F MG+  P  + + E P+  V +  +++D+++V+N QF  FV+ATGYVT 
Sbjct: 47  KDMVWIAGGNFMMGSEHPRALPN-ERPAHEVKVHGYWIDRYDVTNMQFARFVAATGYVTT 105

Query: 73  AEK---FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           AE+   + D  V  P      R     +      F G D+ I           + ND   
Sbjct: 106 AERKPRWEDLAVQLP--PGTPRPADDTLVPGGLVFTGSDAPI-----------ALND--- 149

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
           Y  W          W++           P  +W HP G  S+I    +HPVV VS+ DA+
Sbjct: 150 YSRW----------WKF----------VPGANWRHPGGPGSSIAGMDHHPVVQVSYEDAL 189

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW-QGEFPTNNTAADG 248
           AY  W   RLPTEAEWEY  RGGLE   + WGN   P G+  AN W     P   TA  G
Sbjct: 190 AYARWAHKRLPTEAEWEYAARGGLEQADYAWGNEFAPAGKKMANTWDDAARPFPVTADRG 249

Query: 249 -----YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH---------------HHPAP 288
                 + T+ V S+  N +GLY+M GNVW+W ADW+                  H P  
Sbjct: 250 SHEKVQVGTSVVGSFAPNGYGLYDMAGNVWQWVADWYRADAFRVEAASGHVVSDPHGPVS 309

Query: 289 SYNP-KGPTTGT-DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           SY+P  GPT    ++V +GGS+LC+E YC  +R +AR    P +   +LGFR A D+
Sbjct: 310 SYDPGNGPTAAAPERVTRGGSFLCSETYCISYRTSARRGTDPMNGMSHLGFRLAMDE 366


>gi|418422471|ref|ZP_12995644.1| hypothetical protein MBOL_41900 [Mycobacterium abscessus subsp.
           bolletii BD]
 gi|363996387|gb|EHM17604.1| hypothetical protein MBOL_41900 [Mycobacterium abscessus subsp.
           bolletii BD]
          Length = 309

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 121/340 (35%), Positives = 167/340 (49%), Gaps = 61/340 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +F MG N   +    E P R  ++  F +++H V+N QF EFV  TGYVT A
Sbjct: 22  ELVELPGGSFPMGCN---VFYPEEMPERVSSVGPFAIERHPVTNAQFAEFVRQTGYVTVA 78

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D  ++         A  S +      F      ++  +H             +  W
Sbjct: 79  ERAPDPDMYP-------GASPSDLVPGAMTFRATAGPVD--LHD------------WQQW 117

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W++           P   W HP G  S I+    HPVV V++ DA AY  
Sbjct: 118 ----------WDW----------VPGAYWRHPFGPRSDIDGAAEHPVVQVAYTDAAAYAR 157

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGG  + ++ WG+ + P G   AN WQG FP  N  A G+  T+
Sbjct: 158 WAGRRLPTEAEWEYAARGG-SDTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALGWHGTS 216

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------APSYNPKGPTTGTDKVKK 304
           PV ++  N FGL +M+GNVWEWT    +    P         +PS NP        +V K
Sbjct: 217 PVGTFPANGFGLLDMIGNVWEWT----STRFQPGAGAPKSCCSPSDNPD---PSVIQVLK 269

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           GGS+LC  +YC+R+R AARS  + DS+  ++GFRC  + G
Sbjct: 270 GGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVENLG 309


>gi|33863783|ref|NP_895343.1| hypothetical protein PMT1516 [Prochlorococcus marinus str. MIT
           9313]
 gi|33635366|emb|CAE21691.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
           9313]
          Length = 291

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 119/334 (35%), Positives = 169/334 (50%), Gaps = 61/334 (18%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDG-------EFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
           +MV +P   +R+G ++     DG       E P+R V LD+F +D   V+N QF+ FVSA
Sbjct: 8   EMVTIPAGLYRVGCDR--CYPDGSVRCYPEETPAREVQLDSFQIDVGPVTNAQFRAFVSA 65

Query: 67  TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
           T ++T +E   D  ++ P L+ EER   S V      F+   +T++         +SW  
Sbjct: 66  TQHLTVSELPPDPTLY-PDLAPEERIPESVV------FQPPPATVDRSKP-----LSWWT 113

Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
            +A   WR                             HP+G +STI+   +HPVVHV++ 
Sbjct: 114 LMAGADWR-----------------------------HPQGPESTIDGLDDHPVVHVAYA 144

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
           DA+AY  W G RLP+  EWE   RGGL +  + WGN LTP     AN+WQG FP +N   
Sbjct: 145 DAIAYAHWAGKRLPSAEEWEVAARGGLVDAQYAWGNELTPNNRWMANIWQGPFPWHNEEL 204

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
           DG+  T+PV S+  N +GL ++ GNVWEWT            S  P        +  KGG
Sbjct: 205 DGWFWTSPVGSFPANGYGLLDVCGNVWEWTN-----------SVYPVASGHQERRTIKGG 253

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           S+LC + YC R+R +A    T D++  ++GFRCA
Sbjct: 254 SFLCADNYCVRYRPSALQGQTVDTATCHMGFRCA 287



 Score = 44.7 bits (104), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 27/40 (67%)

Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
           +  KGGS+LC + YC R+R +A    T D++  ++GFRCA
Sbjct: 248 RTIKGGSFLCADNYCVRYRPSALQGQTVDTATCHMGFRCA 287


>gi|420865847|ref|ZP_15329236.1| hypothetical protein MA4S0303_4216 [Mycobacterium abscessus
           4S-0303]
 gi|420870642|ref|ZP_15334024.1| hypothetical protein MA4S0726RA_4155 [Mycobacterium abscessus
           4S-0726-RA]
 gi|420989300|ref|ZP_15452456.1| hypothetical protein MA4S0206_4229 [Mycobacterium abscessus
           4S-0206]
 gi|421041520|ref|ZP_15504528.1| hypothetical protein MA4S0116R_4184 [Mycobacterium abscessus
           4S-0116-R]
 gi|392064563|gb|EIT90412.1| hypothetical protein MA4S0303_4216 [Mycobacterium abscessus
           4S-0303]
 gi|392070112|gb|EIT95959.1| hypothetical protein MA4S0726RA_4155 [Mycobacterium abscessus
           4S-0726-RA]
 gi|392183579|gb|EIV09230.1| hypothetical protein MA4S0206_4229 [Mycobacterium abscessus
           4S-0206]
 gi|392222448|gb|EIV47971.1| hypothetical protein MA4S0116R_4184 [Mycobacterium abscessus
           4S-0116-R]
          Length = 309

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 123/344 (35%), Positives = 165/344 (47%), Gaps = 69/344 (20%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++  LPG +F MG N   +    E P R  +++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 22  ELAELPGGSFPMGCN---VFYPEEMPERVSSVEPFAIERHPVTNAQFAEFVTQTGYVTVA 78

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D                     DM      D  +   M                T+
Sbjct: 79  ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 102

Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           R    P +       W++           P   W HP G  S I+    HPVV V++ DA
Sbjct: 103 RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 152

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G RLPTEAEWEY  RGG    ++ WG+ + P G   AN WQG FP  N  A G
Sbjct: 153 AAYARWAGRRLPTEAEWEYAARGG-SGTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 211

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--------PSYNPKGPTTGTD 300
           +  T+PV ++  N FGL +M+GNVWEWT+       H A        PS NP        
Sbjct: 212 WHGTSPVGTFPANDFGLLDMIGNVWEWTS---TRFQHGADTPKSCCSPSDNPD---PSVI 265

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           +V KGGS+LC  +YC+R+R AARS  + DS+  ++GFRC  + G
Sbjct: 266 QVLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVENLG 309


>gi|315445967|ref|YP_004078846.1| hypothetical protein Mspyr1_44560 [Mycobacterium gilvum Spyr1]
 gi|315264270|gb|ADU01012.1| uncharacterized conserved protein [Mycobacterium gilvum Spyr1]
          Length = 294

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 121/333 (36%), Positives = 167/333 (50%), Gaps = 50/333 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           D+V L G +FRMG+ +       E P+  VT+  F +++H V+N QF EFV  TGY T A
Sbjct: 4   DLVDLEGGSFRMGSTR---FYPEEAPAHTVTVAPFAIERHPVTNAQFAEFVDDTGYRTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D  ++ P  S  +    + V      F      ++ R                  W
Sbjct: 61  ERPPDPALY-PGASPHDLVPGALV------FRPTAGPVDLR-----------------DW 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         WE+           P   W HP G DS+++ R  HPVV V ++DA AY  
Sbjct: 97  R-------QWWEWT----------PGADWRHPFGPDSSVDDRPEHPVVQVCYSDAAAYAR 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE+    G    ++ WG+   P G+  AN WQG FP  N  A G+  T+
Sbjct: 140 WAGRRLPTEAEWEFAAGAG-STAVYAWGDEPAPGGQLMANTWQGAFPYRNDGALGWAGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTDKVKKGGSY 308
           PV ++  N FGL +M+GNVWEWT   +  HH      P     P  P    ++  KGGS+
Sbjct: 199 PVGTFPANGFGLVDMIGNVWEWTTTRFAGHHRVNPPPPPTCCPPADPDPAVNQALKGGSH 258

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           LC  +YC+R+R +ARS  + DS+  ++GFRCAA
Sbjct: 259 LCAPEYCHRYRPSARSPQSQDSATTHIGFRCAA 291


>gi|424775855|ref|ZP_18202844.1| hypothetical protein C660_03890 [Alcaligenes sp. HPC1271]
 gi|422888827|gb|EKU31210.1| hypothetical protein C660_03890 [Alcaligenes sp. HPC1271]
          Length = 369

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 129/390 (33%), Positives = 173/390 (44%), Gaps = 90/390 (23%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           +     +V +P   F MG+++       E P   V + AF +DQHEV+N QF +FV+ATG
Sbjct: 60  IANQAGLVPIPSGRFLMGSDEGY---PEEKPVHPVEVQAFQIDQHEVTNAQFAQFVAATG 116

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQV------RHDMKRFEGLDSTIEHRMHHPVVHI 122
           YVT+AE+  + F     + E+ R   S V       HD    EG      H  H P  + 
Sbjct: 117 YVTQAERVPE-FPPHVQVPEQYRQPGSAVFTPPAATHDHDHHEG------HNQHAPSGNY 169

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
           +W                   W +           P  +W HP G D+ +  R NHPVVH
Sbjct: 170 NW-------------------WVW----------VPGANWRHPGGPDTNLNGRDNHPVVH 200

Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
           +++ DA+AY  W G  LPTE +WE+  RGGLE   +PWGN    RG   AN WQG FP  
Sbjct: 201 IAYEDALAYARWLGRDLPTEEQWEFAARGGLEGATYPWGNTPEVRGRLMANTWQGSFPAQ 260

Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKV 302
           N   DGY  TAPV  +  N +GL++ VGNVWEWT                          
Sbjct: 261 NLLRDGYEGTAPVGCFPANNYGLWDSVGNVWEWT-------------------------- 294

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNE 362
                           R A + Q+ P +    L     A+  P TG   V KGGS+LC  
Sbjct: 295 ----------------RSAWQPQHRPLAPKPLLVAATDANGNPATG---VIKGGSFLCAA 335

Query: 363 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
            +C R+R A+R       S  ++GFR   +
Sbjct: 336 NFCVRYRPASRQPQETTMSTQHVGFRTVLN 365


>gi|420875085|ref|ZP_15338461.1| hypothetical protein MA4S0726RB_3743 [Mycobacterium abscessus
           4S-0726-RB]
 gi|421045440|ref|ZP_15508440.1| hypothetical protein MA4S0116S_3292 [Mycobacterium abscessus
           4S-0116-S]
 gi|392066560|gb|EIT92408.1| hypothetical protein MA4S0726RB_3743 [Mycobacterium abscessus
           4S-0726-RB]
 gi|392234893|gb|EIV60391.1| hypothetical protein MA4S0116S_3292 [Mycobacterium abscessus
           4S-0116-S]
          Length = 297

 Score =  188 bits (478), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 123/344 (35%), Positives = 165/344 (47%), Gaps = 69/344 (20%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++  LPG +F MG N   +    E P R  +++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 10  ELAELPGGSFPMGCN---VFYPEEMPERVSSVEPFAIERHPVTNAQFAEFVTQTGYVTVA 66

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D                     DM      D  +   M                T+
Sbjct: 67  ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 90

Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           R    P +       W++           P   W HP G  S I+    HPVV V++ DA
Sbjct: 91  RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 140

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G RLPTEAEWEY  RGG    ++ WG+ + P G   AN WQG FP  N  A G
Sbjct: 141 AAYARWAGRRLPTEAEWEYAARGG-SGTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 199

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--------PSYNPKGPTTGTD 300
           +  T+PV ++  N FGL +M+GNVWEWT+       H A        PS NP        
Sbjct: 200 WHGTSPVGTFPANDFGLLDMIGNVWEWTS---TRFQHGADTPKSCCSPSDNPD---PSVI 253

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           +V KGGS+LC  +YC+R+R AARS  + DS+  ++GFRC  + G
Sbjct: 254 QVLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVENLG 297


>gi|365872316|ref|ZP_09411854.1| hypothetical protein MMAS_42560 [Mycobacterium massiliense CCUG
           48898 = JCM 15300]
 gi|421051427|ref|ZP_15514421.1| hypothetical protein MMCCUG48898_4440 [Mycobacterium massiliense
           CCUG 48898 = JCM 15300]
 gi|363993461|gb|EHM14684.1| hypothetical protein MMAS_42560 [Mycobacterium massiliense CCUG
           48898 = JCM 15300]
 gi|392240030|gb|EIV65523.1| hypothetical protein MMCCUG48898_4440 [Mycobacterium massiliense
           CCUG 48898]
          Length = 292

 Score =  188 bits (478), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 122/340 (35%), Positives = 164/340 (48%), Gaps = 71/340 (20%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +F MG N   +    E P R  ++  F +++H V+N QF EFV+ TGYVT A
Sbjct: 5   ELVELPGGSFPMGCN---VFYPEEMPERVSSVGPFAIERHPVTNAQFAEFVTQTGYVTVA 61

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D                     DM      D  +   M                T+
Sbjct: 62  ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 85

Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           R    P +       W++           P   W HP G  S I+    HPVV V++ DA
Sbjct: 86  RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 135

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G RLPTEAEWEY  RGG  + ++ WG+ + P G   AN WQG FP  N  A G
Sbjct: 136 AAYARWAGRRLPTEAEWEYAARGG-SDTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 194

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------APSYNPKGPTTGT 299
           +  T+PV ++  N FGL +M+GNVWEWT    +    P         +PS NP       
Sbjct: 195 WHGTSPVGTFPANGFGLLDMIGNVWEWT----STRFQPGAGTPRSCCSPSDNPD---PSV 247

Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            +V KGGS+LC  +YC+R+R AARS  + DS+  ++GFRC
Sbjct: 248 IQVLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRC 287


>gi|432113885|gb|ELK35996.1| Sulfatase-modifying factor 2, partial [Myotis davidii]
          Length = 525

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 124/337 (36%), Positives = 165/337 (48%), Gaps = 85/337 (25%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F+MGTN P   +DGE P R VT+  F +D   V+N  F+EFV    Y TEAE
Sbjct: 269 MVKLPGGRFQMGTNSPDG-RDGEGPVREVTVKPFAIDIFPVTNKDFREFVREKKYRTEAE 327

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q      +  G  S I  R+  PVVH+SWNDA AYC W+
Sbjct: 328 TFGWSFVFEDFVSDELRNKATQ----QMKPAGPGSGIRERLELPVVHVSWNDARAYCAWQ 383

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           G RLPTE EWE+  RGGL+ +++PWG+   P        +R N                W
Sbjct: 384 GKRLPTEEEWEFAARGGLKGQVYPWGNKFQP--------NRTN---------------LW 420

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G                     FP G+                      A DG+   +P
Sbjct: 421 QGK--------------------FPKGDK---------------------AEDGFHGVSP 439

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY++VGNVWEWTA        P    NP        +V +G S++   +
Sbjct: 440 VNAFPPQNNYGLYDLVGNVWEWTA-------SPYQGANPDM------RVLRGASWIDTSD 486

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               +R R   R  NTPDS++ NLGFRCA++ G   G
Sbjct: 487 GSANHRARVTTRMGNTPDSASDNLGFRCASNIGRLPG 523



 Score =  154 bits (389), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 87/232 (37%), Positives = 118/232 (50%), Gaps = 65/232 (28%)

Query: 163 LHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGN 222
           + P G  S I  R+  PVVHVSWNDA AYC W+G RLPTE EWE+  RGGL+ +++PWGN
Sbjct: 351 MKPAGPGSGIRERLELPVVHVSWNDARAYCAWQGKRLPTEEEWEFAARGGLKGQVYPWGN 410

Query: 223 NLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK-ENKFGLYNMVGNVWEWTADWWN 281
              P   +R N+WQG+FP  + A DG+   +PV ++  +N +GLY++VGNVWEWTA    
Sbjct: 411 KFQP---NRTNLWQGKFPKGDKAEDGFHGVSPVNAFPPQNNYGLYDLVGNVWEWTA---- 463

Query: 282 VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
               P    NP        +V +G S++                +T D SA         
Sbjct: 464 ---SPYQGANPDM------RVLRGASWI----------------DTSDGSAN-------- 490

Query: 342 DKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
                                   +R R   R  NTPDS++ NLGFRCA+++
Sbjct: 491 ------------------------HRARVTTRMGNTPDSASDNLGFRCASNI 518


>gi|421484810|ref|ZP_15932376.1| hypothetical protein QWC_19385 [Achromobacter piechaudii HLE]
 gi|400196843|gb|EJO29813.1| hypothetical protein QWC_19385 [Achromobacter piechaudii HLE]
          Length = 373

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 121/350 (34%), Positives = 164/350 (46%), Gaps = 63/350 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +P   F MG N   L +  E P  +V +  F++D H+V+N +F+ FV ATGYVT AE
Sbjct: 42  MAWIPPAEFLMGNNH-KLSQPNELPPHSVRVSGFWMDTHDVTNAEFRRFVEATGYVTTAE 100

Query: 75  ---KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
              K+ D  V  P      R     +      F G +S +  R               Y 
Sbjct: 101 QKPKWEDLKVQLP--PGTPRPDDGLLVPGAMVFVGSESEVSLR--------------DYT 144

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W          W          R  P  SW  P+G  S+I  + +HPVV VS+ DA AY
Sbjct: 145 RW----------W----------RFVPGASWRQPQGPGSSIVGKDDHPVVQVSYEDAQAY 184

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAAD 247
             W G RLPTEAEWE+  RGG E   + WGN L+P G+  AN+W    Q  FP       
Sbjct: 185 AKWAGKRLPTEAEWEFAARGGFEQATYAWGNELSPNGKPMANIWDTQQQQPFPVVKDEKI 244

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNP 292
             + T PV ++  N +GLY+M GNVW+WTADW+                    PA S++P
Sbjct: 245 -QVGTTPVGTFPPNAYGLYDMAGNVWQWTADWYRADAFRIQAQYRQPPQDPVGPADSFDP 303

Query: 293 KG---PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                P +   +V +GGS+LC++ YC  +R +AR    P +   +LGFR 
Sbjct: 304 DDGNVPASAPKRVTRGGSFLCSDTYCISYRASARRGTDPLNGMSHLGFRT 353


>gi|226357800|ref|YP_002787540.1| sulfatase-modifying factor 1 [Deinococcus deserti VCD115]
 gi|226320043|gb|ACO48036.1| putative sulfatase-modifying factor 1 [Deinococcus deserti VCD115]
          Length = 308

 Score =  188 bits (477), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 123/347 (35%), Positives = 166/347 (47%), Gaps = 57/347 (16%)

Query: 1   MVLLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           M+L PA      ++M+ +PG  F+MG++        E P+ +V ++ F++D H V+N +F
Sbjct: 1   MLLRPA-----RENMLWIPGGDFQMGSDH---HYPEERPAHHVHVEGFWMDPHPVTNAEF 52

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           + FV ATGYVT AE+  D   F P +  +     S V                    P  
Sbjct: 53  RRFVEATGYVTLAERTPDPAQF-PGVPLDVLVPGSVV-----------------FQQPAG 94

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
            +   D   +             W Y           P   W  PEG  ST+  + +HP 
Sbjct: 95  PVPLYDHAGW-------------WAY----------VPGACWHQPEGPHSTLTGQDSHPA 131

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           VHV++ DA+AY  W G  LP EAEWEY  RGGL    F WG+   P+G   AN W G FP
Sbjct: 132 VHVAFEDALAYAAWAGKMLPGEAEWEYAARGGLHGATFAWGDEERPQGRVMANTWHGRFP 191

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSYNPKGPTT 297
             N    G+  T+PV +Y  N +GL++M GNVWEWT D +   H   H      P     
Sbjct: 192 WENLDPHGFPRTSPVGTYPANGYGLHDMTGNVWEWTLDPFQPQHVIRHVKSCCTPLNTRA 251

Query: 298 GTD-----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           GT      +V KGGS+LC   YC+R+R A+R     DSS  +LGFRC
Sbjct: 252 GTGGTRERRVIKGGSHLCAPNYCFRYRPASRQGQEVDSSTSHLGFRC 298


>gi|124267866|ref|YP_001021870.1| hypothetical protein Mpe_A2681 [Methylibium petroleiphilum PM1]
 gi|124260641|gb|ABM95635.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
          Length = 337

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 92/195 (47%), Positives = 120/195 (61%), Gaps = 12/195 (6%)

Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
           R  P   W HP G DS ++ R NHPVV VS++DA+AY  W G  LPTEAEWEY  RGGL+
Sbjct: 139 RYVPGADWRHPGGPDSDLDGRGNHPVVQVSYDDALAYARWAGRDLPTEAEWEYAARGGLK 198

Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
            + + WG+ + P G HRAN WQGEFP  NT  DGY  T+PV  ++ N FGLY+M GNVW+
Sbjct: 199 GQRYAWGSEVRPEGVHRANTWQGEFPLRNTQEDGYAGTSPVGCFRPNGFGLYDMAGNVWQ 258

Query: 275 WTADWWNVHHH-----------PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAAR 323
           WT+ W+   H             A S +P+ P T   +V KGGS+LC+  YC R+R +AR
Sbjct: 259 WTSSWYRPGHQAGANGRLAETPAAGSGDPRQPGTPV-RVIKGGSHLCSPDYCLRYRPSAR 317

Query: 324 SQNTPDSSAGNLGFR 338
                 ++  ++GFR
Sbjct: 318 QPQATSTATSHVGFR 332


>gi|421290358|ref|ZP_15741108.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA54354]
 gi|421305750|ref|ZP_15756404.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA62331]
 gi|395888043|gb|EJG99057.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA54354]
 gi|395904708|gb|EJH15622.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA62331]
          Length = 282

 Score =  187 bits (476), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 119/335 (35%), Positives = 159/335 (47%), Gaps = 62/335 (18%)

Query: 15  MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M  + G TF MGTN +   + D E P   V++  F +    V+N +F +FV  TGY T A
Sbjct: 3   MKKIKGGTFMMGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLA 62

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+   +FVF   + E ER   S                     HP     W      C  
Sbjct: 63  ERQEWSFVFILFVPEAEREGYS---------------------HPAGAPWWLQVPNAC-- 99

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                       W HP G +S +    +HPVVHV+  DA+A+C 
Sbjct: 100 ----------------------------WKHPYGENSNLVGLEDHPVVHVALEDALAFCN 131

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEA+WEY  RGG ++  +PWG+ L   G + AN WQG FP  NTA DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTA 190

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
           PV  +  N FGLY M+GNVWEW     N  +    S+N      PK      +   +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +LC+  YC R+R AAR+     S++ +LGFRC  +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282


>gi|326427798|gb|EGD73368.1| SUMF1 protein [Salpingoeca sp. ATCC 50818]
          Length = 155

 Score =  187 bits (475), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 90/155 (58%), Positives = 108/155 (69%), Gaps = 4/155 (2%)

Query: 190 AYCTWR--GARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           A+C W     RLPTEAEWE+   GGL+NR FPWGN + P+  H  N WQ   P  N+  D
Sbjct: 1   AFCKWSHPKGRLPTEAEWEFAASGGLKNRTFPWGNKMMPKNRHWMNTWQSTAP--NSMED 58

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
           GY  TAP  +Y  NKFGLYN VGNVWEWT DW+   H  AP+ +P+GP++G  KVKKGGS
Sbjct: 59  GYALTAPGTAYPPNKFGLYNTVGNVWEWTNDWFTNRHSAAPAVDPRGPSSGETKVKKGGS 118

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           Y+C++  CYR+R AAR   TPDSSA N+GFRCAAD
Sbjct: 119 YMCHQFTCYRYRIAARMHITPDSSAANVGFRCAAD 153



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 22/40 (55%), Positives = 28/40 (70%), Gaps = 2/40 (5%)

Query: 129 AYCTWR--GARLPTEAEWEYGCRGGLENRLFPWGSWLHPE 166
           A+C W     RLPTEAEWE+   GGL+NR FPWG+ + P+
Sbjct: 1   AFCKWSHPKGRLPTEAEWEFAASGGLKNRTFPWGNKMMPK 40


>gi|452876422|ref|ZP_21953778.1| sulfatase-modifying factor 1 [Pseudomonas aeruginosa VRFPA01]
 gi|452186780|gb|EME13798.1| sulfatase-modifying factor 1 [Pseudomonas aeruginosa VRFPA01]
          Length = 317

 Score =  187 bits (474), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 135/358 (37%), Positives = 168/358 (46%), Gaps = 69/358 (19%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           R  DM  LPG  FRMG+ +       E P R V +D F++D   V+N QF  FV ATGY 
Sbjct: 7   RTPDMRYLPGGVFRMGSER---FYPEERPVRQVRVDPFWIDVTPVTNRQFAAFVEATGYR 63

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           T AEK              E A    +  +M R   L  T       PV  +   D    
Sbjct: 64  TFAEK------------APEPADYPGMPAEMARAGSLVFT-------PVSAVV--DLADP 102

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             W          W +        R   W  W  P G  S+I     HPVVHV++ DA A
Sbjct: 103 SRW----------WSF--------RFGAW--WREPLGPGSSIAGLEEHPVVHVAYEDAAA 142

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           Y  W G  LP+EAEWEY  RGG +   F WG+ L P G+  AN WQG FP  N   DG+ 
Sbjct: 143 YAAWVGKALPSEAEWEYAARGGRDGEEFAWGDTLEPGGQILANYWQGRFPLENLLLDGWE 202

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP---------------------S 289
            T+PV ++  N  GL++M+GNVWEWTADW+     PA                      S
Sbjct: 203 RTSPVGAFPANGHGLFDMIGNVWEWTADWF---RQPAATDRQASSCCASRNPRGGREEDS 259

Query: 290 YNPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           Y+P  P      KV KGGS+LC E YC R+R AAR   T D++ G++GFRC   + P 
Sbjct: 260 YDPALPALRIGRKVLKGGSHLCAENYCQRYRPAARIPQTIDTTTGHVGFRCVRREPPA 317


>gi|126436917|ref|YP_001072608.1| hypothetical protein Mjls_4346 [Mycobacterium sp. JLS]
 gi|126236717|gb|ABO00118.1| protein of unknown function DUF323 [Mycobacterium sp. JLS]
          Length = 291

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 123/332 (37%), Positives = 166/332 (50%), Gaps = 51/332 (15%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           +D+V +P  +  +G+++       E P+R+VT+D F++  H V+N +F  FV ATGY T 
Sbjct: 3   EDLVRIPAQSATIGSDR---HYPEEAPARDVTVDGFWIQAHAVTNAEFAAFVDATGYRTV 59

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+       +P    +  A+  Q            S +  R   PV             
Sbjct: 60  AER-----PLDPANYPDAPAQNLQP----------GSMVFRRTAGPV------------- 91

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                L   ++W Y   G          SW HP G  S+I +R +HPVVHV+++DA AY 
Sbjct: 92  ----DLRHLSQWWYWTPGA---------SWRHPIGPGSSIVNRADHPVVHVAYDDAEAYA 138

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
           +W G  LPTEAEWE   RGGL +  + WG+     G   AN W GEFP       GY  T
Sbjct: 139 SWAGLELPTEAEWEVAARGGLSHAGYTWGDEPEGPGHKLANYWHGEFPWRPD--RGYGRT 196

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA----PSYNPKGPTTGTD-KVKKGGS 307
           APV SY  N +GLY+M GNVWEWT DW+      A     SY+P  P      +V KGGS
Sbjct: 197 APVGSYPPNGYGLYDMAGNVWEWTTDWYTDTGGSACCAEDSYDPAQPQFPVPRRVVKGGS 256

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           +LC + YC R+R AAR     D+   ++GFRC
Sbjct: 257 FLCADSYCARYRPAARRPQPVDTGMSHIGFRC 288


>gi|367469483|ref|ZP_09469235.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycinee-
           generating enzyme 1) [Patulibacter sp. I11]
 gi|365815446|gb|EHN10592.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycinee-
           generating enzyme 1) [Patulibacter sp. I11]
          Length = 316

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 93/182 (51%), Positives = 118/182 (64%), Gaps = 6/182 (3%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           SW  PEG  STI+ R++HPVVHV+ +DA+A+C W G RLPTE EWE+  RGGL+   +PW
Sbjct: 140 SWRAPEGPGSTIDGRLDHPVVHVTRDDALAFCAWAGLRLPTEVEWEHAARGGLDGARYPW 199

Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
           G+ L P GEHR NVWQG FP  +   DG+  T PV ++  N FGL+ + GNVWEWTAD W
Sbjct: 200 GDELEPGGEHRCNVWQGAFPGEDRGEDGFRGTCPVDAFPANGFGLHEVSGNVWEWTADPW 259

Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                P     P     G   + +GGS+LC+  YC R+R AAR+   P S+ G+ GFRCA
Sbjct: 260 A---RPGAVLRPA--PDGWVAI-RGGSHLCHRSYCERYRVAARTAADPASTTGHQGFRCA 313

Query: 341 AD 342
           AD
Sbjct: 314 AD 315



 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 22/40 (55%), Positives = 29/40 (72%)

Query: 354 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +GGS+LC+  YC R+R AAR+   P S+ G+ GFRCAAD 
Sbjct: 277 RGGSHLCHRSYCERYRVAARTAADPASTTGHQGFRCAADA 316


>gi|404214088|ref|YP_006668282.1| hypothetical protein KTR9_1487 [Gordonia sp. KTR9]
 gi|403644887|gb|AFR48127.1| hypothetical protein KTR9_1487 [Gordonia sp. KTR9]
          Length = 274

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 122/334 (36%), Positives = 157/334 (47%), Gaps = 67/334 (20%)

Query: 16  VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEK 75
           V   GD F  G        DGE P   V L AF +D   V+N  F  FV+ATG+ T+AE 
Sbjct: 7   VFAMGDAFDEGYRT-----DGETPVHEVELSAFSIDATTVTNAAFASFVAATGHRTDAEI 61

Query: 76  FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRG 135
           FG                 S V H      G           PV    W  AV   +WR 
Sbjct: 62  FGG----------------SAVFHTYATAPG----------EPVPGTPWWLAVDGASWR- 94

Query: 136 ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWR 195
                                        P G  S ++   +HPVVHVS  DA AYC W 
Sbjct: 95  ----------------------------RPAGPGSVVDGLADHPVVHVSHRDAQAYCAWA 126

Query: 196 GARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPV 255
           G  LPTEA+WEY  RGGL    FPWG+      + R N+++G+FP   T     + T PV
Sbjct: 127 GRALPTEAQWEYAARGGLHGARFPWGDEPPTADDPRCNIFRGDFPNEPT---DRVGTTPV 183

Query: 256 MSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-YNPKGPTTGTDKVKKGGSYLCNEQY 314
             ++ N  GLY   GNVWEW AD ++  ++      +P GP  G+ +V +GGS+LC++ Y
Sbjct: 184 RMFEPNGHGLYQCAGNVWEWCADTFSARYYRVSDRTDPTGPGRGSARVLRGGSHLCHDSY 243

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
           C+R+R AARS NTP+S+A N+GFR     GP  G
Sbjct: 244 CHRYRVAARSHNTPESTASNIGFRTV---GPPVG 274


>gi|149020874|ref|ZP_01835403.1| hypothetical protein CGSSp23BS72_02514 [Streptococcus pneumoniae
           SP23-BS72]
 gi|182684754|ref|YP_001836501.1| hypothetical protein SPCG_1784 [Streptococcus pneumoniae CGSP14]
 gi|221232543|ref|YP_002511697.1| hypothetical protein SPN23F_18190 [Streptococcus pneumoniae ATCC
           700669]
 gi|387758004|ref|YP_006064983.1| hypothetical protein SPNOXC_15820 [Streptococcus pneumoniae OXC141]
 gi|405760334|ref|YP_006700930.1| Sulfatase modifying factor 1 precursor
           (C-alpha-formyglycine-generating enzyme 1)
           [Streptococcus pneumoniae SPNA45]
 gi|415700292|ref|ZP_11457994.1| hypothetical protein CGSSp4595_1803 [Streptococcus pneumoniae
           459-5]
 gi|417699171|ref|ZP_12348342.1| hypothetical protein SPAR69_1729 [Streptococcus pneumoniae GA41317]
 gi|418103487|ref|ZP_12740559.1| hypothetical protein SPAR143_1767 [Streptococcus pneumoniae NP070]
 gi|418124150|ref|ZP_12761080.1| hypothetical protein SPAR82_1819 [Streptococcus pneumoniae GA44378]
 gi|418128689|ref|ZP_12765581.1| hypothetical protein SPAR144_1769 [Streptococcus pneumoniae NP170]
 gi|418137892|ref|ZP_12774729.1| hypothetical protein SPAR24_1792 [Streptococcus pneumoniae GA11663]
 gi|418149145|ref|ZP_12785907.1| hypothetical protein SPAR34_1638 [Streptococcus pneumoniae GA13856]
 gi|418178921|ref|ZP_12815502.1| hypothetical protein SPAR73_1854 [Streptococcus pneumoniae GA41565]
 gi|418232795|ref|ZP_12859381.1| hypothetical protein SPAR13_1706 [Streptococcus pneumoniae GA07228]
 gi|418237249|ref|ZP_12863815.1| hypothetical protein SPAR59_1804 [Streptococcus pneumoniae GA19690]
 gi|419453975|ref|ZP_13993945.1| hypothetical protein SPAR138_1707 [Streptococcus pneumoniae
           EU-NP03]
 gi|419473875|ref|ZP_14013723.1| hypothetical protein SPAR29_1747 [Streptococcus pneumoniae GA13430]
 gi|419476129|ref|ZP_14015965.1| hypothetical protein SPAR36_1733 [Streptococcus pneumoniae GA14688]
 gi|419482825|ref|ZP_14022612.1| hypothetical protein SPAR65_1746 [Streptococcus pneumoniae GA40563]
 gi|419487274|ref|ZP_14027036.1| hypothetical protein SPAR79_1788 [Streptococcus pneumoniae GA44128]
 gi|419493914|ref|ZP_14033639.1| hypothetical protein SPAR89_1741 [Streptococcus pneumoniae GA47210]
 gi|419506670|ref|ZP_14046331.1| hypothetical protein SPAR111_1709 [Streptococcus pneumoniae
           GA49194]
 gi|421209563|ref|ZP_15666576.1| hypothetical protein AMCSP11_001752 [Streptococcus pneumoniae
           2070005]
 gi|421225638|ref|ZP_15682376.1| hypothetical protein AMCSP06_001777 [Streptococcus pneumoniae
           2070768]
 gi|421234707|ref|ZP_15691324.1| hypothetical protein AMCSP02_001710 [Streptococcus pneumoniae
           2061617]
 gi|421239120|ref|ZP_15695684.1| hypothetical protein AMCSP15_001689 [Streptococcus pneumoniae
           2071247]
 gi|421241250|ref|ZP_15697795.1| hypothetical protein AMCSP17_001655 [Streptococcus pneumoniae
           2080913]
 gi|421245626|ref|ZP_15702124.1| hypothetical protein AMCSP10_001602 [Streptococcus pneumoniae
           2081685]
 gi|421250044|ref|ZP_15706500.1| hypothetical protein AMCSP19_001672 [Streptococcus pneumoniae
           2082239]
 gi|147930515|gb|EDK81498.1| hypothetical protein CGSSp23BS72_02514 [Streptococcus pneumoniae
           SP23-BS72]
 gi|182630088|gb|ACB91036.1| hypothetical protein SPCG_1784 [Streptococcus pneumoniae CGSP14]
 gi|220675005|emb|CAR69583.1| conserved hypothetical protein [Streptococcus pneumoniae ATCC
           700669]
 gi|301800593|emb|CBW33234.1| conserved hypothetical protein [Streptococcus pneumoniae OXC141]
 gi|332199817|gb|EGJ13892.1| hypothetical protein SPAR69_1729 [Streptococcus pneumoniae GA41317]
 gi|353774788|gb|EHD55275.1| hypothetical protein SPAR143_1767 [Streptococcus pneumoniae NP070]
 gi|353795292|gb|EHD75642.1| hypothetical protein SPAR82_1819 [Streptococcus pneumoniae GA44378]
 gi|353798685|gb|EHD79013.1| hypothetical protein SPAR144_1769 [Streptococcus pneumoniae NP170]
 gi|353811481|gb|EHD91723.1| hypothetical protein SPAR34_1638 [Streptococcus pneumoniae GA13856]
 gi|353841872|gb|EHE21924.1| hypothetical protein SPAR73_1854 [Streptococcus pneumoniae GA41565]
 gi|353886108|gb|EHE65892.1| hypothetical protein SPAR13_1706 [Streptococcus pneumoniae GA07228]
 gi|353891687|gb|EHE71441.1| hypothetical protein SPAR59_1804 [Streptococcus pneumoniae GA19690]
 gi|353900200|gb|EHE75758.1| hypothetical protein SPAR24_1792 [Streptococcus pneumoniae GA11663]
 gi|379550393|gb|EHZ15493.1| hypothetical protein SPAR29_1747 [Streptococcus pneumoniae GA13430]
 gi|379558911|gb|EHZ23943.1| hypothetical protein SPAR36_1733 [Streptococcus pneumoniae GA14688]
 gi|379579417|gb|EHZ44324.1| hypothetical protein SPAR65_1746 [Streptococcus pneumoniae GA40563]
 gi|379585643|gb|EHZ50499.1| hypothetical protein SPAR79_1788 [Streptococcus pneumoniae GA44128]
 gi|379592487|gb|EHZ57303.1| hypothetical protein SPAR89_1741 [Streptococcus pneumoniae GA47210]
 gi|379608584|gb|EHZ73330.1| hypothetical protein SPAR111_1709 [Streptococcus pneumoniae
           GA49194]
 gi|379626045|gb|EHZ90671.1| hypothetical protein SPAR138_1707 [Streptococcus pneumoniae
           EU-NP03]
 gi|381314443|gb|EIC55212.1| hypothetical protein CGSSp4595_1803 [Streptococcus pneumoniae
           459-5]
 gi|395573659|gb|EJG34249.1| hypothetical protein AMCSP11_001752 [Streptococcus pneumoniae
           2070005]
 gi|395589125|gb|EJG49447.1| hypothetical protein AMCSP06_001777 [Streptococcus pneumoniae
           2070768]
 gi|395599760|gb|EJG59922.1| hypothetical protein AMCSP02_001710 [Streptococcus pneumoniae
           2061617]
 gi|395600763|gb|EJG60918.1| hypothetical protein AMCSP15_001689 [Streptococcus pneumoniae
           2071247]
 gi|395607628|gb|EJG67725.1| hypothetical protein AMCSP17_001655 [Streptococcus pneumoniae
           2080913]
 gi|395608153|gb|EJG68249.1| hypothetical protein AMCSP10_001602 [Streptococcus pneumoniae
           2081685]
 gi|395613015|gb|EJG73047.1| hypothetical protein AMCSP19_001672 [Streptococcus pneumoniae
           2082239]
 gi|404277223|emb|CCM07732.1| Sulfatase modifying factor 1 precursor
           (C-alpha-formyglycine-generating enzyme 1)
           [Streptococcus pneumoniae SPNA45]
 gi|429319972|emb|CCP33294.1| conserved hypothetical protein [Streptococcus pneumoniae SPN034183]
 gi|429321790|emb|CCP35268.1| conserved hypothetical protein [Streptococcus pneumoniae SPN994039]
 gi|429323610|emb|CCP31309.1| conserved hypothetical protein [Streptococcus pneumoniae SPN994038]
          Length = 282

 Score =  186 bits (473), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 120/335 (35%), Positives = 160/335 (47%), Gaps = 62/335 (18%)

Query: 15  MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M  + G TF MGTN +   + D E P   V++  F +    V+N +F +FV  TGY T A
Sbjct: 3   MKKIKGGTFMMGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLA 62

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+   +FVF   + E ER             EG          HP     W      C  
Sbjct: 63  ERQEWSFVFILFVPEAER-------------EGYP--------HPAGAPWWLQVSNAC-- 99

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                       W HP G +S +    +HPVVHV+  DA+A+C 
Sbjct: 100 ----------------------------WKHPYGENSNLVGLEDHPVVHVALEDALAFCN 131

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEA+WEY  RGG ++  +PWG+ L   G + AN WQG FP  NTA DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTA 190

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
           PV  +  N FGLY M+GNVWEW     N  +    S+N      PK      +   +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +LC+  YC R+R AAR+     S++ +LGFRC  +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282


>gi|418014464|ref|ZP_12654063.1| sulfatase modifying factor [Lactobacillus casei Lpc-37]
 gi|410553708|gb|EKQ27705.1| sulfatase modifying factor [Lactobacillus casei Lpc-37]
          Length = 287

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 122/336 (36%), Positives = 167/336 (49%), Gaps = 66/336 (19%)

Query: 11  RYKDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           R+K+   + G +F+MGT+  I   +D E P   V + +F +    V+N  F +F++AT Y
Sbjct: 5   RFKE---IKGGSFQMGTDDHIGFDEDYEGPPTIVRVPSFSMADTPVTNADFDDFIAATAY 61

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
            T AE+ G +FVFE L+ EEER                  T +H              VA
Sbjct: 62  QTVAERLGSSFVFELLIPEEERV-----------------TYQH--------------VA 90

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
              W          W           L P   W HP G +S+     NHPVVHV+  DA+
Sbjct: 91  GAPW----------W----------LLVPGADWQHPYGAESSNIDLDNHPVVHVALEDAL 130

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
           AYC W  ++LPTEA+WEY    G     +PWG +L       AN WQG+FP +NTA DG+
Sbjct: 131 AYCQWSHSQLPTEAQWEYAAGAGTATT-YPWGESLVDEHGFHANTWQGDFPNDNTAEDGF 189

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP------KGPTTGTDKVK 303
           + TAPV SY+ N  GLY ++GNVWEW     N  +     +N       K P  G   + 
Sbjct: 190 VGTAPVKSYEPNSNGLYQIIGNVWEWCR---NPRYTLLDDFNAEQFKLGKVPAAGEYAI- 245

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           +GGS+LC+  YC R+R AAR+     S++ +L FRC
Sbjct: 246 RGGSFLCHCSYCNRYRTAARNGVDLQSTSSHLSFRC 281


>gi|254383810|ref|ZP_04999158.1| conserved hypothetical protein [Streptomyces sp. Mg1]
 gi|194342703|gb|EDX23669.1| conserved hypothetical protein [Streptomyces sp. Mg1]
          Length = 309

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 126/335 (37%), Positives = 159/335 (47%), Gaps = 54/335 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG TFRMG+         E P R VT+D F++D H V+  +F+ F  ATG+VT AE
Sbjct: 16  MVRVPGGTFRMGSEA---FYPEERPVRPVTVDGFWMDAHPVTVAEFRRFTKATGHVTVAE 72

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D   F       ER  +              + +      PV    W        W 
Sbjct: 73  RELDPRDF----PGAERGALVP-----------GALVFTMTPGPVDLDDWR------RW- 110

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W Y           P   W  P G DST   R  HPV  V++ DAVAY  W
Sbjct: 111 ---------WSYQ----------PGACWRRPLGGDSTTHGRELHPVTQVAFEDAVAYAAW 151

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP---TNNTAADGYLS 251
            G  LPTEAEWE+  RGGLE  +F WG+  TPRG   AN W G FP      +A      
Sbjct: 152 AGKELPTEAEWEFAARGGLEGAVFVWGDEFTPRGRRMANTWHGAFPWEYLPASAKSPRPG 211

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSY--NPKGPTTGTD----KVKK 304
           T+ V SY  N +GLY+M GNVWEWT D + +    PAP     P     G +    +V K
Sbjct: 212 TSAVRSYPANGYGLYDMAGNVWEWTCDAYADRRADPAPGACCAPGDARAGGERFPRRVTK 271

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           GGS+LC   YC R+R AAR+  + D++  +LGFRC
Sbjct: 272 GGSHLCAPNYCLRYRPAARTGQSEDTATCHLGFRC 306


>gi|254431213|ref|ZP_05044916.1| sulfatase-modifying factor 1 [Cyanobium sp. PCC 7001]
 gi|197625666|gb|EDY38225.1| sulfatase-modifying factor 1 [Cyanobium sp. PCC 7001]
          Length = 337

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 123/354 (34%), Positives = 167/354 (47%), Gaps = 63/354 (17%)

Query: 5   PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           P P     +DMV +PG +F MG+N        E P+    ++ F++D+  V+N QF++FV
Sbjct: 24  PCPGRPPARDMVWIPGGSFTMGSNHHY---PEEAPAHQRQVEGFWIDRAPVTNAQFRKFV 80

Query: 65  SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
            ATG+VT AE+  D   +   L E                  L +        P   +  
Sbjct: 81  KATGHVTLAERAADPADYPDALPE------------------LLAPASIVFVPPSGPVGT 122

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
            D   +             W+Y           P  +W HPEG  S+I+ R +HPVVHV+
Sbjct: 123 GDPYRW-------------WQY----------IPGANWRHPEGPGSSIKGRDHHPVVHVA 159

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
             DA+AY  W G +LPTEAEWE+   GGL    F WG+ L P G   AN +QGEFP +N+
Sbjct: 160 HEDALAYAVWAGKQLPTEAEWEFAAWGGLGGTEFAWGHELHPGGRAMANTFQGEFPHHNS 219

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN---------PKGP 295
              GY  T+PV ++  N +GL +M+GNVWEWT  W+  H    P                
Sbjct: 220 RLVGYERTSPVGAFAANGYGLVDMIGNVWEWTDTWYGEHGAAGPGGEAAPEGGCCASAAR 279

Query: 296 TTGTD----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
               D          KV KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 280 EASIDRNSQHGAIPRKVVKGGSFLCAPSYCRRYRPAARMAQGIDTSTCHMGFRC 333


>gi|381203031|ref|ZP_09910140.1| hypothetical protein SyanX_21071 [Sphingobium yanoikuyae XLDN2-5]
          Length = 320

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 130/352 (36%), Positives = 172/352 (48%), Gaps = 62/352 (17%)

Query: 6   APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
           AP      +M+ +PG  F MG++        E P+  V +DAF++D   V+N  F+ FV 
Sbjct: 8   APAATAPDNMIHIPGGRFIMGSDH---HYPEEAPAHPVEVDAFWIDATPVTNRDFRRFVE 64

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
           ATG+ T AE         P   +   A+      DM R   L  T   R   PV    W+
Sbjct: 65  ATGHRTTAE-------IAPNPDDYPGAQP-----DMLRPASLVFTPTER---PVPLHDWS 109

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
                  W          W Y               W HP G DS+I+   +HPVVHV++
Sbjct: 110 ------QW----------WRYVFDA----------DWRHPLGPDSSIDDLDDHPVVHVTY 143

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
           +DA AY  W G  LPTEAEWE+  RGG++ + F WG+ L P G+  AN WQG FP+ N  
Sbjct: 144 DDAAAYAAWAGKALPTEAEWEFAARGGVDGQEFAWGDELMPDGKAMANFWQGNFPSENLL 203

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP----------------- 288
            DGY  T+PV S+  N +GL +M+GNVWEWT D++   H   P                 
Sbjct: 204 IDGYERTSPVGSFPANGYGLSDMIGNVWEWTEDFYAARHQADPSRPCCAPRNPLNQAREA 263

Query: 289 SYNPKGP-TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           SY+P+ P      +V KGGS+LC   YC R+R AAR     DSS  ++GFRC
Sbjct: 264 SYDPQQPHILIPRRVLKGGSHLCAPSYCRRYRPAARHAQAIDSSTSHIGFRC 315


>gi|359425710|ref|ZP_09216805.1| hypothetical protein GOAMR_51_00440 [Gordonia amarae NBRC 15530]
 gi|358239024|dbj|GAB06387.1| hypothetical protein GOAMR_51_00440 [Gordonia amarae NBRC 15530]
          Length = 300

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 123/344 (35%), Positives = 168/344 (48%), Gaps = 64/344 (18%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG  FRMG+++       E P+    + AF +++H V+N Q+  FVS TGYVT A
Sbjct: 5   ELVPLPGGVFRMGSDR---HYPEERPAHERRVGAFAIERHPVTNAQYARFVSDTGYVTVA 61

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E        EP+              D   F G D   +  +   +V    +  V    W
Sbjct: 62  E--------EPI--------------DPADFPGADP--KQLIPGALVFTPTDGPVDLGDW 97

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W          R  P  SW  P+G  ST++HR +HPVV +++ DA AY  
Sbjct: 98  --------TRWW---------RWVPGASWRSPQGPGSTVDHRPDHPVVQIAYRDAQAYAH 140

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE+   GG     + WG    P  +  AN WQG FP  N    G+ ST+
Sbjct: 141 WAGRRLPTEAEWEFAACGGRSGAEYAWGEEFRPGDQVMANTWQGRFPYLNA---GWGSTS 197

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD------------- 300
           PV ++  N +GL +M+GNVWE  +D +   H  +   +  GP    D             
Sbjct: 198 PVGTFPANGYGLADMIGNVWERVSDIFVPRH--SDISDADGPVVEADGRPDLLAPTTSPR 255

Query: 301 --KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
             +V KGGS+LC  +YC R+R AARS  + DS+ G+LGFRCAAD
Sbjct: 256 VMRVTKGGSFLCAPEYCRRYRPAARSAQSDDSATGHLGFRCAAD 299


>gi|145225614|ref|YP_001136292.1| hypothetical protein Mflv_5038 [Mycobacterium gilvum PYR-GCK]
 gi|145218100|gb|ABP47504.1| protein of unknown function DUF323 [Mycobacterium gilvum PYR-GCK]
          Length = 294

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 120/333 (36%), Positives = 167/333 (50%), Gaps = 50/333 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           D+V L G +FRMG+ +       E P+  VT+  F ++++ V+N QF EFV  TGY T A
Sbjct: 4   DLVDLEGGSFRMGSTR---FYPEEAPAHTVTVAPFAIERNPVTNAQFAEFVDDTGYRTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D  ++ P  S  +    + V      F      ++ R                  W
Sbjct: 61  ERPPDPALY-PGASPHDLVPGALV------FRPTAGPVDLR-----------------DW 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         WE+           P   W HP G DS+++ R  HPVV V ++DA AY  
Sbjct: 97  R-------QWWEWT----------PGADWRHPFGPDSSVDDRPEHPVVQVCYSDAAAYAR 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE+    G    ++ WG+   P G+  AN WQG FP  N  A G+  T+
Sbjct: 140 WAGRRLPTEAEWEFAAGAG-STAVYAWGDEPAPGGQLMANTWQGAFPYRNDGALGWAGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTDKVKKGGSY 308
           PV ++  N FGL +M+GNVWEWT   +  HH      P     P  P    ++  KGGS+
Sbjct: 199 PVGTFPANGFGLVDMIGNVWEWTTTRFAGHHRVNPPPPPTCCPPADPDPAVNQALKGGSH 258

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           LC  +YC+R+R +ARS  + DS+  ++GFRCAA
Sbjct: 259 LCAPEYCHRYRPSARSPQSQDSATTHIGFRCAA 291


>gi|421236881|ref|ZP_15693478.1| hypothetical protein AMCSP07_001690 [Streptococcus pneumoniae
           2071004]
 gi|395601644|gb|EJG61791.1| hypothetical protein AMCSP07_001690 [Streptococcus pneumoniae
           2071004]
          Length = 282

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 120/335 (35%), Positives = 160/335 (47%), Gaps = 62/335 (18%)

Query: 15  MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M  + G TF MGTN +   + D E P   V++  F +    V+N +F +FV  TGY T A
Sbjct: 3   MKKIKGGTFMMGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFVQFVKETGYKTLA 62

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+   +FVF   + E ER             EG          HP     W      C  
Sbjct: 63  ERQEWSFVFILFVPEAER-------------EGYP--------HPAGAPWWLQVSNAC-- 99

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                       W HP G +S +    +HPVVHV+  DA+A+C 
Sbjct: 100 ----------------------------WKHPYGENSNLVGLEDHPVVHVALEDALAFCN 131

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEA+WEY  RGG ++  +PWG+ L   G + AN WQG FP  NTA DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTA 190

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
           PV  +  N FGLY M+GNVWEW     N  +    S+N      PK      +   +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +LC+  YC R+R AAR+     S++ +LGFRC  +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282


>gi|295134836|ref|YP_003585512.1| sulfatase-modifying factor [Zunongwangia profunda SM-A87]
 gi|294982851|gb|ADF53316.1| sulfatase-modifying factor [Zunongwangia profunda SM-A87]
          Length = 376

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 124/350 (35%), Positives = 178/350 (50%), Gaps = 55/350 (15%)

Query: 9   VERYKDMVLLPGDTFRMGTNK--PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
           +E  + MV + G  F MG ++  P  +   E P+  V +D F++DQ EV+N QF++F + 
Sbjct: 60  IETPEGMVWVSGVKFTMGASEGDPYALPR-EKPAHPVAVDGFFIDQTEVTNAQFKKFTAE 118

Query: 67  TGYVTEAEKFGDTFVFEPLLSEEERAKI---SQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
           TGYVT AE+        P+  E+ + ++   +    D     G  S I  +    + ++ 
Sbjct: 119 TGYVTVAER--------PIDWEQMKKQLPPGTPKPPDSVLQPG--SLIFKKELENIANL- 167

Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHV 183
            ND   Y  W         EW+ G              W HP+G  S IE + N+PVVH+
Sbjct: 168 -ND---YGQWW--------EWKTGA------------DWKHPQGPQSDIEGKDNYPVVHI 203

Query: 184 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNN 243
           ++ DA+AY +W G  LPTEAEWE    G L   ++ WG++ + +    AN WQGEFP  N
Sbjct: 204 AYEDALAYASWAGRDLPTEAEWEAAAHGKLHGGIYTWGDDES-KLNKEANTWQGEFPVKN 262

Query: 244 TAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTT- 297
              DGY   APV S+  N   LY+M GNVWE+T D +N  ++          NPKG TT 
Sbjct: 263 IPEDGYKYAAPVKSFPSNSLNLYDMAGNVWEFTKDNFNTRYYQDALQQGELLNPKGSTTY 322

Query: 298 -------GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                    +KV KGGS+LCN+ YC  +R ++R   + DS   +LGFR  
Sbjct: 323 FNEDNPYQKEKVIKGGSFLCNKSYCASYRISSRMGTSMDSGTDHLGFRTV 372


>gi|384103458|ref|ZP_10004435.1| hypothetical protein W59_18879 [Rhodococcus imtechensis RKJ300]
 gi|383839299|gb|EID78656.1| hypothetical protein W59_18879 [Rhodococcus imtechensis RKJ300]
          Length = 304

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 123/347 (35%), Positives = 163/347 (46%), Gaps = 69/347 (19%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG TF MG+         E P   VT+D F++D H+V+  +F+ FV  TG+VT AE
Sbjct: 1   MAWIPGGTFWMGSED---FYPEERPVHQVTVDGFWMDTHQVTVAEFRRFVKDTGHVTTAE 57

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA---YC 131
                                 +  D  ++ G D  +   +   +V I     V    Y 
Sbjct: 58  ----------------------IAPDPAQYPGADPAL--LVPGSLVFIPTPGPVPLDDYT 93

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W          W +           P   W HPEG  S +  R  HPV HVSW DA AY
Sbjct: 94  RW----------WSF----------TPGADWRHPEGPGSNVGGRERHPVTHVSWFDARAY 133

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G  LPTEAEWE+  RGGL+ + F WG    P G    NVWQG FP  N   DGY  
Sbjct: 134 AEWAGKDLPTEAEWEFAARGGLDRKPFVWGEEHEPGGRPGGNVWQGRFPWENLLEDGYAG 193

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTGTD--- 300
           T+PV  ++ N + L +M GNVWEWT D++ V H     + AP+ +   P  P   T    
Sbjct: 194 TSPVGHFRPNGYDLSDMAGNVWEWTTDYFTVDHSESGKNVAPTSSCCIPTNPRVDTARAE 253

Query: 301 --------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                   +V KGGS+LC   YC R+R +AR  +T ++S  ++GFRC
Sbjct: 254 NPDEPYARRVIKGGSHLCAPNYCLRYRPSARQGDTEETSTCHIGFRC 300


>gi|393725914|ref|ZP_10345841.1| hypothetical protein SPAM2_19854 [Sphingomonas sp. PAMC 26605]
          Length = 321

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 132/351 (37%), Positives = 173/351 (49%), Gaps = 63/351 (17%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           V + + M  + G +F+MG+         E P R V ++ F++D H V+N QF EFV ATG
Sbjct: 8   VRQLEGMRHIAGGSFKMGSEA---FYSEERPLRRVKVEDFWIDTHPVTNRQFSEFVEATG 64

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           Y T AE   D   + P ++  +    S V      FE  D+ +   +H    H  W    
Sbjct: 65  YRTVAEIAPDPANY-PGMAPADAVAGSLV------FERTDALVP--LHD---HTRW---- 108

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                          W +  R G +        W HP G DS I+  ++HPVVHV++ DA
Sbjct: 109 ---------------WAF--RHGAD--------WRHPTGPDSGIDDLLDHPVVHVAFADA 143

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G  LPTEAEWE+  RGGL+   + WG+ L P G   AN WQG FP  N  ADG
Sbjct: 144 QAYAAWVGKVLPTEAEWEFAARGGLDGVEYAWGDELAPGGRVLANYWQGLFPFANQRADG 203

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH----HPAPSY----NPKG------ 294
              T PV  Y  N +GLY+M+GNVWEWT D + +        APS     +P+G      
Sbjct: 204 GYRTTPVGHYPPNGYGLYDMIGNVWEWTRDCYTLSRKAGGKAAPSCCAVPDPRGGTLRAS 263

Query: 295 -----PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                P     KV KGGS+LC   YC R+R AAR   T D + G++GFRC 
Sbjct: 264 MDLAKPVRIGRKVLKGGSHLCAANYCQRYRPAARHPQTIDGATGHIGFRCV 314


>gi|325276759|ref|ZP_08142471.1| hypothetical protein G1E_24647 [Pseudomonas sp. TJI-51]
 gi|324098107|gb|EGB96241.1| hypothetical protein G1E_24647 [Pseudomonas sp. TJI-51]
          Length = 340

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 122/348 (35%), Positives = 167/348 (47%), Gaps = 62/348 (17%)

Query: 4   LPAPPVERYKD-MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
           LPAP    ++D MV LPG  F  G+ +     D E P+    + AF++D H V+N QF  
Sbjct: 39  LPAPREGPWRDGMVKLPGGKFSFGSER---YYDEEGPAHLAQVSAFWIDVHPVTNAQFAR 95

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           FV+ATGY+T AE+  DT   +P L +  R   + V       +G D          V+H 
Sbjct: 96  FVAATGYLTHAERGIDT-ADDPSLPQHLRVPGAMVFK-----QGPD----------VLHP 139

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
            W                              +  P  SW HP+G  S ++   NHPVV 
Sbjct: 140 GW------------------------------QFVPGASWRHPQGPGSDLQGLDNHPVVQ 169

Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
           V+  DA AY  W G RLP+EA+ EY  RGGL++  F WG    P+G+  AN WQG+FP +
Sbjct: 170 VALEDAQAYARWAGRRLPSEAQLEYAMRGGLQDADFSWGMTELPKGKAMANTWQGQFPYH 229

Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--PSYNPKGPTTGTD 300
           N A DG+  T+PV  +  N FGL++  GNVWE T   +   H P      +P GP+    
Sbjct: 230 NAATDGFTGTSPVGCFPANGFGLFDAGGNVWELTRTGYRPGHAPQRDAGLDPPGPSLDDS 289

Query: 301 ----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                      V KGGS+LC+   C R+R +AR       +  ++GFR
Sbjct: 290 HDPADPGVKVAVIKGGSHLCSADRCLRYRPSARQPQPVFMATSHVGFR 337


>gi|397734366|ref|ZP_10501076.1| hypothetical protein JVH1_5559 [Rhodococcus sp. JVH1]
 gi|396930034|gb|EJI97233.1| hypothetical protein JVH1_5559 [Rhodococcus sp. JVH1]
          Length = 312

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 122/346 (35%), Positives = 160/346 (46%), Gaps = 63/346 (18%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           K+M  +PG TF MG+         E P   VT+D F++D H+V+  +F+ FV  TG+VT 
Sbjct: 7   KNMAWIPGGTFWMGSED---FYPEERPVHQVTVDGFWMDAHQVTVAEFRRFVKDTGHVTT 63

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE                      +  D  ++ G D  +          +     V   T
Sbjct: 64  AE----------------------IAPDPAQYPGADPAL----------LVPGSLVFTPT 91

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                L     W             P   W HPEG  S +  R  HPV HVSW DA AY 
Sbjct: 92  PGPVPLDDYTRWW---------SFTPGADWRHPEGPGSNVGGRERHPVTHVSWFDARAYA 142

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWE+  RGGL+ + F WG    P G    NVWQG FP  N   DG+  T
Sbjct: 143 EWAGKDLPTEAEWEFAARGGLDRKPFVWGEEHEPGGRPGGNVWQGRFPWENLLEDGFAGT 202

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-------PAPS----YNPKGPTTGTD- 300
           +PV  ++ N + L +M GNVWEWT D++   H        PA S     NP+  T   + 
Sbjct: 203 SPVGHFRPNGYELSDMAGNVWEWTTDYFTADHSASGKNVAPASSCCIPTNPRVETARVEN 262

Query: 301 -------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                  +V KGGS+LC   YC R+R AAR  +T ++S  ++GFRC
Sbjct: 263 PDEPYARRVIKGGSHLCAPNYCLRYRPAARQGDTEETSTCHIGFRC 308


>gi|452961455|gb|EME66755.1| hypothetical protein G352_02424 [Rhodococcus ruber BKS 20-38]
          Length = 303

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 96/198 (48%), Positives = 120/198 (60%), Gaps = 16/198 (8%)

Query: 158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
           P   W HPEG  ST+  R  HPV HVS+ DA+AY  W G  LPTEAEWE+  RGGLE   
Sbjct: 100 PGAQWRHPEGPGSTVAGRDRHPVTHVSYFDALAYAQWAGKELPTEAEWEFAARGGLERAR 159

Query: 218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
           F WG+   PRG +RAN WQG FP  NT  DG++ T+PV SY+ N +GL ++ GNVWEWTA
Sbjct: 160 FVWGDEFAPRGRYRANTWQGRFPWENTGEDGFVGTSPVGSYRPNGYGLVDVAGNVWEWTA 219

Query: 278 DWWNVHH-----HPAPSYN---PKGPTTGTD--------KVKKGGSYLCNEQYCYRHRCA 321
           D +  HH       AP++    P  P TG          +V KGGS+LC   YC R+R A
Sbjct: 220 DHYTAHHGADGRDTAPAHACCIPSNPRTGAGDESGPYPRRVIKGGSHLCAPNYCLRYRPA 279

Query: 322 ARSQNTPDSSAGNLGFRC 339
           AR   + ++S  ++GFRC
Sbjct: 280 ARQGESVETSTCHIGFRC 297


>gi|166796137|gb|AAI59002.1| Unknown (protein for IMAGE:7554678) [Xenopus (Silurana) tropicalis]
 gi|166796992|gb|AAI59072.1| Unknown (protein for IMAGE:7565049) [Xenopus (Silurana) tropicalis]
          Length = 273

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 109/269 (40%), Positives = 143/269 (53%), Gaps = 55/269 (20%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           E   +MV L G  F MGT K    KDGE P R V +  F +D++ V+N  F+EFV A  Y
Sbjct: 18  EDADNMVQLDGGQFDMGT-KASDGKDGESPVRQVKVLPFAVDKYPVTNKDFREFVRAKKY 76

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
            TEAE FG +FVFE  +SEE + K++Q                               + 
Sbjct: 77  KTEAEAFGWSFVFEDFVSEELKKKVTQ------------------------------KLE 106

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
              W    LP E             R F    W  P G  S I+ ++++PVV VSWNDA 
Sbjct: 107 SAPWW---LPVE-------------RAF----WRQPAGPASGIKEKLDYPVVQVSWNDAQ 146

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
           AYC W G RLP E EWE+  RGGL+ +++PWGN   P   +R N+WQG FP  +TA DGY
Sbjct: 147 AYCKWLGKRLPIEEEWEFAARGGLKGKVYPWGNTFKP---NRTNLWQGTFPGMDTATDGY 203

Query: 250 LSTAPVMSY-KENKFGLYNMVGNVWEWTA 277
              +PV ++  +N++GLY+M+GN WEWTA
Sbjct: 204 HGASPVAAFPAQNEYGLYDMLGNTWEWTA 232


>gi|111021278|ref|YP_704250.1| hypothetical protein RHA1_ro04301 [Rhodococcus jostii RHA1]
 gi|110820808|gb|ABG96092.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
          Length = 312

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 121/346 (34%), Positives = 161/346 (46%), Gaps = 63/346 (18%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           K+M  +PG TF MG+         E P   VT+D F++D H+V+  +F+ FV  TG+VT 
Sbjct: 7   KNMAWIPGGTFWMGSED---FYPEERPVHQVTVDGFWMDAHQVTVAEFRRFVKDTGHVTT 63

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE                      +  D  ++ G D  +          +     V   T
Sbjct: 64  AE----------------------IAPDPAQYPGADPAL----------LVPGSLVFTPT 91

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                L     W             P   W HPEG  S +  R  HPV HVSW DA AY 
Sbjct: 92  PGPVPLDDYTRWW---------SFTPGADWRHPEGPGSNVGGRERHPVTHVSWFDARAYA 142

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWE+  RGGL+ + F WG    P G    NVWQG FP  N   DG+  T
Sbjct: 143 EWAGKDLPTEAEWEFAARGGLDRKPFVWGEEHEPGGRPGGNVWQGRFPWENLLEDGFAGT 202

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTGTDKVK- 303
           +PV  ++ N + L +M GNVWEWT D++   H     + AP+ +   P  P   T +V+ 
Sbjct: 203 SPVGHFRPNSYELSDMAGNVWEWTTDYFTADHSASGKNVAPANSCCIPTNPRVETARVEN 262

Query: 304 ----------KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                     KGGS+LC   YC R+R AAR  +T ++S  ++GFRC
Sbjct: 263 PDEPYARRVIKGGSHLCAPNYCLRYRPAARQGDTEETSTCHIGFRC 308


>gi|420933559|ref|ZP_15396834.1| hypothetical protein MM1S1510930_4400 [Mycobacterium massiliense
           1S-151-0930]
 gi|420936161|ref|ZP_15399430.1| hypothetical protein MM1S1520914_4609 [Mycobacterium massiliense
           1S-152-0914]
 gi|420943824|ref|ZP_15407079.1| hypothetical protein MM1S1530915_3953 [Mycobacterium massiliense
           1S-153-0915]
 gi|420946984|ref|ZP_15410234.1| hypothetical protein MM1S1540310_3957 [Mycobacterium massiliense
           1S-154-0310]
 gi|420953972|ref|ZP_15417214.1| hypothetical protein MM2B0626_4216 [Mycobacterium massiliense
           2B-0626]
 gi|420958146|ref|ZP_15421380.1| hypothetical protein MM2B0107_3556 [Mycobacterium massiliense
           2B-0107]
 gi|420962986|ref|ZP_15426210.1| hypothetical protein MM2B1231_4278 [Mycobacterium massiliense
           2B-1231]
 gi|420994089|ref|ZP_15457235.1| hypothetical protein MM2B0307_3516 [Mycobacterium massiliense
           2B-0307]
 gi|420999865|ref|ZP_15463000.1| hypothetical protein MM2B0912R_4532 [Mycobacterium massiliense
           2B-0912-R]
 gi|421004387|ref|ZP_15467509.1| hypothetical protein MM2B0912S_4219 [Mycobacterium massiliense
           2B-0912-S]
 gi|392138318|gb|EIU64055.1| hypothetical protein MM1S1510930_4400 [Mycobacterium massiliense
           1S-151-0930]
 gi|392141676|gb|EIU67401.1| hypothetical protein MM1S1520914_4609 [Mycobacterium massiliense
           1S-152-0914]
 gi|392145430|gb|EIU71154.1| hypothetical protein MM1S1530915_3953 [Mycobacterium massiliense
           1S-153-0915]
 gi|392152885|gb|EIU78592.1| hypothetical protein MM2B0626_4216 [Mycobacterium massiliense
           2B-0626]
 gi|392154014|gb|EIU79720.1| hypothetical protein MM1S1540310_3957 [Mycobacterium massiliense
           1S-154-0310]
 gi|392178647|gb|EIV04300.1| hypothetical protein MM2B0912R_4532 [Mycobacterium massiliense
           2B-0912-R]
 gi|392180191|gb|EIV05843.1| hypothetical protein MM2B0307_3516 [Mycobacterium massiliense
           2B-0307]
 gi|392193090|gb|EIV18714.1| hypothetical protein MM2B0912S_4219 [Mycobacterium massiliense
           2B-0912-S]
 gi|392245899|gb|EIV71376.1| hypothetical protein MM2B1231_4278 [Mycobacterium massiliense
           2B-1231]
 gi|392247872|gb|EIV73348.1| hypothetical protein MM2B0107_3556 [Mycobacterium massiliense
           2B-0107]
          Length = 309

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 121/340 (35%), Positives = 162/340 (47%), Gaps = 71/340 (20%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG  F MG N   +    E P R  ++  F +++H V+N QF EFV+ TGYVT A
Sbjct: 22  ELVELPGGPFPMGCN---VFYPEEMPERVSSVGPFAIERHPVTNAQFAEFVTQTGYVTVA 78

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D                     DM      D  +   M                T+
Sbjct: 79  ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 102

Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           R    P +       W++           P   W  P G  S I+    HPVV V++ DA
Sbjct: 103 RTTAGPVDLHDWQQWWDW----------VPGAYWRQPFGPGSDIDGAAEHPVVQVAYTDA 152

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G RLPTEAEWEY  RGG  + ++ WG+ + P G   AN WQG FP  N  A G
Sbjct: 153 AAYARWAGRRLPTEAEWEYAARGG-SDTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 211

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------APSYNPKGPTTGT 299
           +  T+PV ++  N FGL +M+GNVWEWT    +    P         +PS NP       
Sbjct: 212 WHGTSPVGTFPANGFGLLDMIGNVWEWT----STRFQPGAGTPRSCCSPSDNPD---PSV 264

Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            +V KGGS+LC  +YC+R+R AARS  + DS+  ++GFRC
Sbjct: 265 IQVLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRC 304


>gi|359798277|ref|ZP_09300851.1| hypothetical protein KYC_15057 [Achromobacter arsenitoxydans SY8]
 gi|359363823|gb|EHK65546.1| hypothetical protein KYC_15057 [Achromobacter arsenitoxydans SY8]
          Length = 322

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 115/331 (34%), Positives = 156/331 (47%), Gaps = 58/331 (17%)

Query: 32  LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSE-EE 90
           + +  E P+  V L  F++D H+V+N +F+ FV ATGYVT AE+        P L     
Sbjct: 7   MSQPNERPAHKVRLTGFWMDVHDVTNAEFRRFVDATGYVTTAEQKPRWEDLRPQLPPGTP 66

Query: 91  RAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRG 150
           R   S +      F G ++ +  R               Y  W          W      
Sbjct: 67  RPDDSALVAGAMVFVGTETEVSLR--------------DYSRW----------W------ 96

Query: 151 GLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCR 210
               R  P  +W HP+G  S I+ +  HPVV VS+ DA AY  W G RLPTEAEWE+  R
Sbjct: 97  ----RFVPGANWRHPQGPGSNIDGKDEHPVVQVSYEDAQAYAQWTGKRLPTEAEWEFAAR 152

Query: 211 GGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAADGYLSTAPVMSYKENKFGLY 266
           GG E   + WGN L P+G+  AN+W    Q  FP         + T PV SY  N +GLY
Sbjct: 153 GGYEQATYSWGNELQPQGQAMANIWDTRQQQPFPVVKD-EKVQVGTTPVGSYAPNGYGLY 211

Query: 267 NMVGNVWEWTADWWNVHHH---------------PAPSYNPKG---PTTGTDKVKKGGSY 308
           +M GNVW+WTADW+                    PA S++P     P +   +V +GGS+
Sbjct: 212 DMAGNVWQWTADWYRADAFMVQAQYRQPPTDPAGPADSFDPSDGNVPASAPKRVTRGGSF 271

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           LC++ YC  +R +AR    P +   +LGFR 
Sbjct: 272 LCSDTYCISYRASARRGTDPLNPMSHLGFRT 302


>gi|393774874|ref|ZP_10363207.1| hypothetical protein WSK_4216 [Novosphingobium sp. Rr 2-17]
 gi|392719692|gb|EIZ77224.1| hypothetical protein WSK_4216 [Novosphingobium sp. Rr 2-17]
          Length = 329

 Score =  185 bits (469), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 127/352 (36%), Positives = 171/352 (48%), Gaps = 67/352 (19%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + MV + G TF MG+++       E P+R V +D F++D   V+N QF  FV+ATGYVT 
Sbjct: 16  QGMVWIAGGTFTMGSDRHY---PEEAPTRRVAVDGFWIDAAPVTNRQFAAFVAATGYVTV 72

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE   D   +  +     RA                S + H+   PV     +       
Sbjct: 73  AEIAPDPKDYPGMQPGMARAG---------------SLVFHKTATPV-----DTGNPANW 112

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHR--MNHPVVHVSWNDAVA 190
           WR         +E+G              W HP G DS IE     +HPVV V++ DA A
Sbjct: 113 WR---------FEFGA------------DWRHPLGPDSDIESLDLWDHPVVQVAYADAQA 151

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           Y  W G  LPTEAE+E+  RGGL+ + + WG+ L P G   AN WQG FP  N   DG+ 
Sbjct: 152 YADWAGKVLPTEAEFEFAARGGLDGKDYAWGDQLAPGGAMMANYWQGLFPFANQLLDGWE 211

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--------------------PSY 290
            T+PV S+  N +GL++M+GN WEWT+DWW+    PA                     SY
Sbjct: 212 RTSPVGSFPANGYGLFDMIGNTWEWTSDWWSERSMPADKKPGGSCCIPSNPRGGNLKDSY 271

Query: 291 NPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           +P  P      KV KGGS+LC   YC R+R A+R     D+S  ++GFRC +
Sbjct: 272 DPTQPAVRIGRKVLKGGSHLCAANYCQRYRPASRHPEMIDTSTSHIGFRCVS 323


>gi|397680590|ref|YP_006522125.1| serine/threonine-protein kinase pkn1 [Mycobacterium massiliense
           str. GO 06]
 gi|418251298|ref|ZP_12877495.1| hypothetical protein MAB47J26_20891 [Mycobacterium abscessus 47J26]
 gi|353449123|gb|EHB97522.1| hypothetical protein MAB47J26_20891 [Mycobacterium abscessus 47J26]
 gi|395458855|gb|AFN64518.1| Serine/threonine-protein kinase pkn1 [Mycobacterium massiliense
           str. GO 06]
          Length = 292

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 121/340 (35%), Positives = 162/340 (47%), Gaps = 71/340 (20%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG  F MG N   +    E P R  ++  F +++H V+N QF EFV+ TGYVT A
Sbjct: 5   ELVELPGGPFPMGCN---VFYPEEMPERVSSVGPFAIERHPVTNAQFAEFVTQTGYVTVA 61

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D                     DM      D  +   M                T+
Sbjct: 62  ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 85

Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           R    P +       W++           P   W  P G  S I+    HPVV V++ DA
Sbjct: 86  RTTAGPVDLHDWQQWWDW----------VPGAYWRQPFGPGSDIDGAAEHPVVQVAYTDA 135

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G RLPTEAEWEY  RGG  + ++ WG+ + P G   AN WQG FP  N  A G
Sbjct: 136 AAYARWAGRRLPTEAEWEYAARGG-SDTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 194

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------APSYNPKGPTTGT 299
           +  T+PV ++  N FGL +M+GNVWEWT    +    P         +PS NP       
Sbjct: 195 WHGTSPVGTFPANGFGLLDMIGNVWEWT----STRFQPGAGTPRSCCSPSDNPD---PSV 247

Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            +V KGGS+LC  +YC+R+R AARS  + DS+  ++GFRC
Sbjct: 248 IQVLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRC 287


>gi|377571511|ref|ZP_09800629.1| putative sulfatase-modifying factor [Gordonia terrae NBRC 100016]
 gi|377531341|dbj|GAB45794.1| putative sulfatase-modifying factor [Gordonia terrae NBRC 100016]
          Length = 319

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 118/332 (35%), Positives = 161/332 (48%), Gaps = 60/332 (18%)

Query: 14  DMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + V +P   F MG         DGE P   V L+AF +D   V+N  F  FV+ATG+ T+
Sbjct: 34  ETVSVPAGVFDMGDAFGEGYRTDGETPVHEVELNAFSIDTTAVTNAAFASFVAATGHRTD 93

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE FG + VF    +   R                          PV    W  AV   +
Sbjct: 94  AETFGGSAVFHTYATAPGR--------------------------PVPGTPWWLAVDGAS 127

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR                             HP G  ST++   +HPVVHVS  DA AYC
Sbjct: 128 WR-----------------------------HPAGPGSTLDGLADHPVVHVSHRDAQAYC 158

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEA+WEY  RGG     +PWG+      + R  +++G+FP   T   G + T
Sbjct: 159 DWAGRALPTEAQWEYAARGGRRGARYPWGDEPPTADDPRCTIFRGDFPNEPT---GPVGT 215

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-YNPKGPTTGTDKVKKGGSYLCN 311
            PV +++ N  GLY   GNVWEW AD ++  ++      +P GP  G+ +V +GGS+LC+
Sbjct: 216 TPVRTFEPNGHGLYQCAGNVWEWCADRFSARYYRVSDRTDPSGPARGSARVLRGGSHLCH 275

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           + YC+R+R AARS NTP+S+A N+GFR    +
Sbjct: 276 DSYCHRYRVAARSHNTPESTASNIGFRTVGPR 307


>gi|118469714|ref|YP_885618.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis str. MC2 155]
 gi|399985615|ref|YP_006565963.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis str. MC2 155]
 gi|441204504|ref|ZP_20972070.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis MKD8]
 gi|118171001|gb|ABK71897.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis str. MC2 155]
 gi|399230175|gb|AFP37668.1| Sulfatase-modifying factor 1 [Mycobacterium smegmatis str. MC2 155]
 gi|440629433|gb|ELQ91222.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis MKD8]
          Length = 291

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 123/332 (37%), Positives = 171/332 (51%), Gaps = 49/332 (14%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V L G TFRMG+         E P   V++  F +++H V+N QF EFV+ATGYVT A
Sbjct: 4   ELVALDGGTFRMGSQD---FYPEEAPVHEVSVAPFSIERHPVTNAQFAEFVAATGYVTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D   F P +  +E    + V      F+  +  +  R           D   + TW
Sbjct: 61  EQELDPAAF-PGVPADELVPGALV------FQPTEGPVNLR-----------DWRQWWTW 102

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                   P   W HP+G  S+I+   +HPVV V++ DA AY  
Sbjct: 103 -----------------------VPGACWKHPKGPGSSIDDVPDHPVVQVAYPDAAAYAA 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  R G    ++ WG+++ P G+  AN WQG FP  N  A G+  T+
Sbjct: 140 WAGRRLPTEAEWEYAARAGATT-VYAWGDDVRPDGQLMANTWQGRFPYRNDGALGWTGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD----KVKKGGSYL 309
           PV ++  N FGL +M+GNVWEWT+  +   H   P  +   P  G D    +  KGGS+L
Sbjct: 199 PVGTFPPNGFGLVDMIGNVWEWTSTRYTPRHSRRPEVSGCCPAPGGDPSIHQTLKGGSHL 258

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           C  +YC+R+R AARS  + DS+  ++GFRC A
Sbjct: 259 CAPEYCHRYRPAARSSQSQDSATTHIGFRCVA 290


>gi|427407746|ref|ZP_18897948.1| hypothetical protein HMPREF9718_00422 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425713709|gb|EKU76721.1| hypothetical protein HMPREF9718_00422 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 320

 Score =  184 bits (468), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 130/353 (36%), Positives = 171/353 (48%), Gaps = 62/353 (17%)

Query: 6   APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
           AP      +M+ +PG  F MG++        E P+  V +DAF++D   V+N  F+ FV 
Sbjct: 8   APAATAPDNMIHIPGGRFIMGSDH---HYPEEAPAHPVEVDAFWIDATPVTNRDFRRFVE 64

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
           ATG+ T AE         P   +   A+      DM R   L  T   R   PV    W+
Sbjct: 65  ATGHRTTAE-------IAPNPDDYPGAQP-----DMLRPASLVFTPTER---PVPLHDWS 109

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
                  W          W Y               W HP G DS+I+   +HPVVHV++
Sbjct: 110 ------QW----------WRYVFDA----------DWRHPLGPDSSIDDLDDHPVVHVTY 143

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
           +DA AY  W G  LPTEAEWE+  RGG++ + F WG+ L P G+  AN WQG FP+ N  
Sbjct: 144 DDAAAYAAWAGKALPTEAEWEFAARGGVDGQEFAWGDELLPDGKAMANFWQGNFPSENLL 203

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP----------------- 288
            DGY  T+PV S+  N +GL +M+GNVWEWT D++   H   P                 
Sbjct: 204 IDGYERTSPVGSFPANGYGLSDMIGNVWEWTEDFYAARHQADPSRPCCAPRNPLNQAREA 263

Query: 289 SYNPKGP-TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           SY+P  P      +V KGGS+LC   YC R+R AAR     DSS  ++GFRC 
Sbjct: 264 SYDPHQPHILIPRRVLKGGSHLCAPSYCRRYRPAARHAQAIDSSTSHIGFRCV 316


>gi|148997850|ref|ZP_01825414.1| hypothetical protein CGSSp11BS70_03044 [Streptococcus pneumoniae
           SP11-BS70]
 gi|168575014|ref|ZP_02720977.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae MLV-016]
 gi|307068416|ref|YP_003877382.1| hypothetical protein SPAP_1797 [Streptococcus pneumoniae AP200]
 gi|419471684|ref|ZP_14011543.1| hypothetical protein SPAR15_1638 [Streptococcus pneumoniae GA07914]
 gi|419504523|ref|ZP_14044191.1| hypothetical protein SPAR105_1607 [Streptococcus pneumoniae
           GA47760]
 gi|421314663|ref|ZP_15765250.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA47562]
 gi|147756349|gb|EDK63391.1| hypothetical protein CGSSp11BS70_03044 [Streptococcus pneumoniae
           SP11-BS70]
 gi|183578750|gb|EDT99278.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae MLV-016]
 gi|306409953|gb|ADM85380.1| Uncharacterized conserved protein [Streptococcus pneumoniae AP200]
 gi|379546400|gb|EHZ11539.1| hypothetical protein SPAR15_1638 [Streptococcus pneumoniae GA07914]
 gi|379605911|gb|EHZ70661.1| hypothetical protein SPAR105_1607 [Streptococcus pneumoniae
           GA47760]
 gi|395913348|gb|EJH24201.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA47562]
          Length = 282

 Score =  184 bits (468), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 119/335 (35%), Positives = 160/335 (47%), Gaps = 62/335 (18%)

Query: 15  MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M  + G TF MGTN +   + D E P   V++  F +    V+N +F +FV  TGY T A
Sbjct: 3   MKKIKGGTFMMGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLA 62

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+   +FVF   + E ER             EG          HP     W      C  
Sbjct: 63  ERQEWSFVFILFVPEAER-------------EGYP--------HPAGAPWWLQVSNAC-- 99

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                       W HP G ++ +    +HPVVHV+  DA+A+C 
Sbjct: 100 ----------------------------WKHPYGENNNLVGLEDHPVVHVALEDALAFCN 131

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEA+WEY  RGG ++  +PWG+ L   G + AN WQG FP  NTA DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTA 190

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
           PV  +  N FGLY M+GNVWEW     N  +    S+N      PK      +   +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +LC+  YC R+R AAR+     S++ +LGFRC  +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282


>gi|152988165|ref|YP_001347985.1| sulfatase-modifying factor 1 [Pseudomonas aeruginosa PA7]
 gi|150963323|gb|ABR85348.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Pseudomonas aeruginosa PA7]
          Length = 317

 Score =  184 bits (468), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 132/358 (36%), Positives = 170/358 (47%), Gaps = 69/358 (19%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           R  DM  LPG  FRMG+ +       E P R V +D F++D   V+N QF  FV ATGY 
Sbjct: 7   RTPDMRYLPGGVFRMGSER---FYPEERPVRQVRVDPFWIDVTPVTNRQFAAFVEATGYR 63

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           T AEK  +   +  + +E  RA  S V      F  + + +              D    
Sbjct: 64  TFAEKAPEPADYPGMPAEMARAG-SLV------FTPVSAAV--------------DLADP 102

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             W          W +        R   W  W  P G  S+I     HPVVHV++ DA A
Sbjct: 103 SRW----------WSF--------RFGAW--WREPLGPGSSIAGLEEHPVVHVAYEDAAA 142

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           Y  W G  LP+EAEWEY  RGG +   F WG+ L P G+  AN WQG FP  N   DG+ 
Sbjct: 143 YAAWVGKALPSEAEWEYAARGGRDGEEFAWGDTLEPGGQILANYWQGRFPLENLLLDGWE 202

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP---------------------S 289
            T+PV ++  N  GL++M+GNVWEWTADW+     PA                      S
Sbjct: 203 RTSPVGAFPANGHGLFDMIGNVWEWTADWF---RQPAATDRQASSCCASRNPRGGREEDS 259

Query: 290 YNPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           Y+P  P      KV KGGS+LC E YC R+R AAR   + D++ G++GFRC   + P 
Sbjct: 260 YDPALPALRIGRKVLKGGSHLCAENYCQRYRPAARIPQSIDTTTGHVGFRCVRREPPA 317


>gi|67924262|ref|ZP_00517699.1| Protein of unknown function DUF323 [Crocosphaera watsonii WH 8501]
 gi|67853877|gb|EAM49199.1| Protein of unknown function DUF323 [Crocosphaera watsonii WH 8501]
          Length = 224

 Score =  184 bits (468), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 92/196 (46%), Positives = 127/196 (64%), Gaps = 15/196 (7%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W HP G DS I+ + N+PV+H++++DAVAY  W+G  LPTEA+WEY  RGGL+++ F W
Sbjct: 32  NWQHPYGPDSNIKGKDNYPVIHIAYDDAVAYANWKGKTLPTEAQWEYAGRGGLKDQDFSW 91

Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
           GN  + +   +AN WQG FP  NT  DG+L TAPV S+  N +GLY+M GNVWE T+DW+
Sbjct: 92  GNRYSAK---KANTWQGIFPFLNTKQDGHLGTAPVESFPPNGYGLYDMTGNVWELTSDWY 148

Query: 281 NVHH-------HPA-----PSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTP 328
           +V H       +P       S++P  P+ G+  V KGGSYLC + YC R+R AAR    P
Sbjct: 149 SVGHGGKEHSLNPTGPTKNASFDPTKPSEGSLHVIKGGSYLCAKNYCSRYRPAARESQAP 208

Query: 329 DSSAGNLGFRCAADKG 344
           D+   ++GFR  +  G
Sbjct: 209 DTGTTHVGFRLVSVDG 224


>gi|303254682|ref|ZP_07340784.1| hypothetical protein CGSSpBS455_04421 [Streptococcus pneumoniae
           BS455]
 gi|303258747|ref|ZP_07344727.1| hypothetical protein CGSSp9vBS293_06449 [Streptococcus pneumoniae
           SP-BS293]
 gi|303261911|ref|ZP_07347857.1| hypothetical protein CGSSp14BS292_10874 [Streptococcus pneumoniae
           SP14-BS292]
 gi|303263773|ref|ZP_07349695.1| hypothetical protein CGSSpBS397_08169 [Streptococcus pneumoniae
           BS397]
 gi|303265692|ref|ZP_07351591.1| hypothetical protein CGSSpBS457_09620 [Streptococcus pneumoniae
           BS457]
 gi|303268605|ref|ZP_07354397.1| hypothetical protein CGSSpBS458_09836 [Streptococcus pneumoniae
           BS458]
 gi|387759911|ref|YP_006066889.1| hypothetical protein SPNINV200_16260 [Streptococcus pneumoniae
           INV200]
 gi|418140131|ref|ZP_12776956.1| hypothetical protein SPAR28_1772 [Streptococcus pneumoniae GA13338]
 gi|418181162|ref|ZP_12817731.1| hypothetical protein SPAR74_1775 [Streptococcus pneumoniae GA41688]
 gi|418221825|ref|ZP_12848478.1| hypothetical protein SPAR104_1759 [Streptococcus pneumoniae
           GA47751]
 gi|419515310|ref|ZP_14054935.1| hypothetical protein SPAR152_1676 [Streptococcus pneumoniae
           England14-9]
 gi|421296637|ref|ZP_15747346.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA58581]
 gi|301802500|emb|CBW35259.1| conserved hypothetical protein [Streptococcus pneumoniae INV200]
 gi|302598394|gb|EFL65438.1| hypothetical protein CGSSpBS455_04421 [Streptococcus pneumoniae
           BS455]
 gi|302636994|gb|EFL67483.1| hypothetical protein CGSSp14BS292_10874 [Streptococcus pneumoniae
           SP14-BS292]
 gi|302640248|gb|EFL70703.1| hypothetical protein CGSSpBS293_06449 [Streptococcus pneumoniae
           SP-BS293]
 gi|302641884|gb|EFL72239.1| hypothetical protein CGSSpBS458_09836 [Streptococcus pneumoniae
           BS458]
 gi|302644819|gb|EFL75067.1| hypothetical protein CGSSpBS457_09620 [Streptococcus pneumoniae
           BS457]
 gi|302646811|gb|EFL77036.1| hypothetical protein CGSSpBS397_08169 [Streptococcus pneumoniae
           BS397]
 gi|353843234|gb|EHE23279.1| hypothetical protein SPAR74_1775 [Streptococcus pneumoniae GA41688]
 gi|353875135|gb|EHE54989.1| hypothetical protein SPAR104_1759 [Streptococcus pneumoniae
           GA47751]
 gi|353904910|gb|EHE80360.1| hypothetical protein SPAR28_1772 [Streptococcus pneumoniae GA13338]
 gi|379635859|gb|EIA00418.1| hypothetical protein SPAR152_1676 [Streptococcus pneumoniae
           England14-9]
 gi|395895510|gb|EJH06485.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA58581]
          Length = 282

 Score =  184 bits (467), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 119/335 (35%), Positives = 160/335 (47%), Gaps = 62/335 (18%)

Query: 15  MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M  + G TF MGTN +   + D E P   V++  F +    V+N +F +FV  TGY T A
Sbjct: 3   MKKIKGGTFMMGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLA 62

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+   +FVF   + E ER             EG          HP     W      C  
Sbjct: 63  ERQEWSFVFILFVPEAER-------------EGYP--------HPAGAPWWLQVSNAC-- 99

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                       W HP G +S +    +HPVV+V+  DA+A+C 
Sbjct: 100 ----------------------------WKHPYGENSNLVGLEDHPVVYVALEDALAFCN 131

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEA+WEY  RGG ++  +PWG+ L   G + AN WQG FP  NTA DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTA 190

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
           PV  +  N FGLY M+GNVWEW     N  +    S+N      PK      +   +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +LC+  YC R+R AAR+     S++ +LGFRC  +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282


>gi|194384818|dbj|BAG59569.1| unnamed protein product [Homo sapiens]
          Length = 286

 Score =  184 bits (467), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 122/336 (36%), Positives = 161/336 (47%), Gaps = 83/336 (24%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  F MGTN P   +DGE P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct: 30  MVQLQGGRFLMGTNSPD-SRDGEGPVRGATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 88

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q    MK   G  S I  R+ HPV+H+SWNDA AYC WR
Sbjct: 89  MFGWSFVFEDFVSDELRNKATQ---PMKP-AGPGSGIRERLEHPVLHVSWNDARAYCAWR 144

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           G RLPTE EWE+  RGGL+ +++PWG+W  P        +R N                W
Sbjct: 145 GKRLPTEEEWEFAARGGLKGQVYPWGNWFQP--------NRTN---------------LW 181

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G                             P+G+   + + G  P N   A        
Sbjct: 182 QGK---------------------------FPKGDKAEDGFHGVSPVNAFPA-------- 206

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNE 312
                +N +GLY+++GNVWEWTA             +P        +V +G S++   + 
Sbjct: 207 -----QNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTADG 248

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
              +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 249 SANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 284



 Score = 41.2 bits (95), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 235 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 279


>gi|418183336|ref|ZP_12819893.1| hypothetical protein SPAR78_1744 [Streptococcus pneumoniae GA43380]
 gi|353847765|gb|EHE27785.1| hypothetical protein SPAR78_1744 [Streptococcus pneumoniae GA43380]
          Length = 282

 Score =  184 bits (467), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 119/335 (35%), Positives = 159/335 (47%), Gaps = 62/335 (18%)

Query: 15  MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M  + G TF MGTN +   + D E P   V++  F +    V+N +F +FV  TGY T A
Sbjct: 3   MKKIKGGTFMMGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLA 62

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+   +FVF   + E ER             EG          HP     W      C  
Sbjct: 63  ERQEWSFVFILFVPEAER-------------EGYP--------HPAGAPWWLQVPNAC-- 99

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                       W HP G +S +    +HPVVHV+  DA+A+C 
Sbjct: 100 ----------------------------WKHPYGENSNLVGLEDHPVVHVALEDALAFCN 131

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEA+WEY  RGG ++  +PWG+ L   G + AN WQG FP  NT  DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTGLDGFIGTA 190

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
           PV  +  N FGLY M+GNVWEW     N  +    S+N      PK      +   +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +LC+  YC R+R AAR+     S++ +LGFRC  +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282


>gi|194248087|ref|NP_001035934.2| sulfatase-modifying factor 2 isoform c precursor [Homo sapiens]
          Length = 305

 Score =  184 bits (467), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 121/336 (36%), Positives = 158/336 (47%), Gaps = 83/336 (24%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  F MGTN P   +DGE P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct: 49  MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 107

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q    MK   G  S I  R+ HPV+H+SWNDA AYC WR
Sbjct: 108 MFGWSFVFEDFVSDELRNKATQ---PMKP-AGPGSGIRERLEHPVLHVSWNDARAYCAWR 163

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           G RLPTE EWE+  RGGL+ +++PWG+W  P                             
Sbjct: 164 GKRLPTEEEWEFAARGGLKGQVYPWGNWFQP----------------------------- 194

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
                               NR   W     P+G+   + + G  P N   A        
Sbjct: 195 --------------------NRTNLWQGKF-PKGDKAEDGFHGVSPVNAFPA-------- 225

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNE 312
                +N +GLY+++GNVWEWTA             +P        +V +G S++   + 
Sbjct: 226 -----QNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTADG 267

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
              +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 268 SANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 303



 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 85/232 (36%), Positives = 116/232 (50%), Gaps = 65/232 (28%)

Query: 163 LHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGN 222
           + P G  S I  R+ HPV+HVSWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWGN
Sbjct: 131 MKPAGPGSGIRERLEHPVLHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGN 190

Query: 223 NLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWN 281
              P   +R N+WQG+FP  + A DG+   +PV ++  +N +GLY+++GNVWE     W 
Sbjct: 191 WFQP---NRTNLWQGKFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWE-----WT 242

Query: 282 VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
              + A   + +        V +G S++                +T D SA         
Sbjct: 243 ASPYQAAEQDMR--------VLRGASWI----------------DTADGSAN-------- 270

Query: 342 DKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
                                   +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 271 ------------------------HRARVTTRMGNTPDSASDNLGFRCAADA 298


>gi|163849970|ref|YP_001638013.1| hypothetical protein Mext_0527 [Methylobacterium extorquens PA1]
 gi|163661575|gb|ABY28942.1| protein of unknown function DUF323 [Methylobacterium extorquens
           PA1]
          Length = 325

 Score =  184 bits (466), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 118/352 (33%), Positives = 164/352 (46%), Gaps = 64/352 (18%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
            E    M  + G TFRMG+++       E P   V +D F++D+  V+N QF +FV AT 
Sbjct: 15  AEDTSGMRWISGGTFRMGSDRHY---PEEAPIHRVAVDGFWIDEAPVTNRQFSQFVLATA 71

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           Y T AE                      +  D K + G            + H+ +  ++
Sbjct: 72  YRTVAE----------------------IEPDPKDYPGA-----------LPHLLFAGSL 98

Query: 129 AYCTWRG-ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
            +    G   L   ++W    +G           W  P G  S+I+   +HPVV V++ D
Sbjct: 99  VFTPTPGPVDLRVCSQWWRFLKGA---------DWRRPYGPGSSIKGLDDHPVVQVAYKD 149

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           A AY  W G  LPTEAEWE+  RGGL+   F WG +  P G   AN+WQGEFP  N +  
Sbjct: 150 AEAYSDWAGKALPTEAEWEFAARGGLDGAEFAWGADFMPEGRQMANIWQGEFPHQNLSRR 209

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN---PKGPTTGTD---- 300
            +  T PV ++  N +GL +M+GNVWEWT+DW+   H          P+ P  G +    
Sbjct: 210 AFKRTTPVGAFPPNAYGLVDMIGNVWEWTSDWFAEKHQGDAKKACCIPQNPRGGREADSL 269

Query: 301 -----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
                      KV KGGS+LC   YC R+R AAR     D+S  ++GFRC A
Sbjct: 270 DPRDLQSLIPRKVVKGGSHLCAPTYCRRYRPAARHPQPIDTSMSHVGFRCVA 321


>gi|419960796|ref|ZP_14476811.1| hypothetical protein WSS_A01770 [Rhodococcus opacus M213]
 gi|414574017|gb|EKT84695.1| hypothetical protein WSS_A01770 [Rhodococcus opacus M213]
          Length = 304

 Score =  184 bits (466), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 122/344 (35%), Positives = 160/344 (46%), Gaps = 63/344 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG TF MG+         E P   VT+DAF++D H+V+  +F+ FV  TG+VT AE
Sbjct: 1   MEWIPGGTFWMGSED---FYPEERPVHQVTVDAFWMDSHQVTVAEFRRFVKDTGHVTTAE 57

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                                 +  D  ++ G D  +          +     V   T  
Sbjct: 58  ----------------------IAPDPAQYPGADPAL----------LVPGSLVFTPTPG 85

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              L     W     G           W HPEG  S +  R  HPV HVSW DA AY  W
Sbjct: 86  PVPLDDYTRWWSFTAGA---------DWRHPEGPGSNVGGRERHPVTHVSWFDARAYAEW 136

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE+  RGGL+ + F WG    P G    NVWQG FP  N   DGY  T+P
Sbjct: 137 AGKDLPTEAEWEFAARGGLDRKPFVWGEEHEPGGRPGGNVWQGRFPWENLLEDGYAGTSP 196

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTGT------- 299
           V  ++ N + L +M GNVWEWT D++ V H     + AP+ +   P  P   T       
Sbjct: 197 VGHFRPNGYDLSDMAGNVWEWTTDYFTVDHSESGKNVAPTSSCCIPTNPRVDTARAENPD 256

Query: 300 ----DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                +V KGGS+LC   YC R+R +AR  +T ++S  ++GFRC
Sbjct: 257 EPYARRVIKGGSHLCAPNYCLRYRPSARQGDTEETSTCHIGFRC 300


>gi|299115525|emb|CBN75729.1| Zgc:136465 protein [Ectocarpus siliculosus]
          Length = 243

 Score =  184 bits (466), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 89/177 (50%), Positives = 111/177 (62%), Gaps = 15/177 (8%)

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL--------------FPWGNNLTP 226
           + VSWNDA AYC WRG RLP+EAEWE   +G                    +PWG++LTP
Sbjct: 20  IQVSWNDAKAYCRWRGGRLPSEAEWERAAQGDPVEAAGEGEGEERGRAMPRYPWGDDLTP 79

Query: 227 RGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK-ENKFGLYNMVGNVWEWTADWWNVHHH 285
            G+HRAN+WQG FPT N A DG+  T+PV ++  +N  GL + VGN WEW  DWW     
Sbjct: 80  EGQHRANIWQGTFPTLNLAEDGFPYTSPVDAFPPQNSLGLRDAVGNAWEWVEDWWTPDRS 139

Query: 286 PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
                NP+GP  GT+K KKGGS+LC+  YCYR+R AAR+++  DS   N GFRCAAD
Sbjct: 140 KYKQNNPRGPKRGTEKTKKGGSFLCHHTYCYRYRPAARTKSDIDSGTSNQGFRCAAD 196


>gi|441510337|ref|ZP_20992244.1| hypothetical protein GOACH_22_00020 [Gordonia aichiensis NBRC
           108223]
 gi|441445472|dbj|GAC50205.1| hypothetical protein GOACH_22_00020 [Gordonia aichiensis NBRC
           108223]
          Length = 328

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 124/340 (36%), Positives = 170/340 (50%), Gaps = 58/340 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V L G  F MG+++       E P     +DAF +++H V+N Q+  FV+ATGYVT  
Sbjct: 32  ELVELAGGRFSMGSDR---FYPEERPHHERRVDAFAIERHPVTNAQYAAFVAATGYVT-- 86

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
                               I+++  D   F G D  +   +   +V    +  V    W
Sbjct: 87  --------------------IAELPIDPADFPGADPAL--LVPGALVFTPTSGPVDLTDW 124

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         W          R  P  SW HP G ++ +    +HPV  +++ DAVAY  
Sbjct: 125 R--------HWW---------RWQPGASWRHPLGPETDLVGYEDHPVTQIAYADAVAYAE 167

Query: 194 WRGARLPTEAEWEYGCRGGLEN-RLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
           W G RLPTEAEWEY  RGG+     + WG+ L P G   AN W G+FP    A+DG+  T
Sbjct: 168 WAGRRLPTEAEWEYAARGGIGGAHEYAWGDELHPDGAILANTWIGDFP---YASDGWGGT 224

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--PSYNPKG------PTTG--TDKV 302
           + V SY  N FGL +M+GNVWE T+D +   H P    + +P G      PT      +V
Sbjct: 225 SRVASYPPNGFGLVDMIGNVWERTSDVFTPRHVPPDLATVSPDGRTNLLAPTASPHVSRV 284

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            KGGS+LC+  YC R+R AARS  + DS+  +LGFRCAAD
Sbjct: 285 TKGGSHLCSPHYCRRYRPAARSPQSDDSATSHLGFRCAAD 324


>gi|297288259|ref|XP_002803310.1| PREDICTED: sulfatase-modifying factor 2-like isoform 2 [Macaca
           mulatta]
 gi|402863390|ref|XP_003896000.1| PREDICTED: sulfatase-modifying factor 2 isoform 3 [Papio anubis]
          Length = 231

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 111/294 (37%), Positives = 148/294 (50%), Gaps = 69/294 (23%)

Query: 58  TQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
            Q ++FV    Y TEAE FG +FVFE  +S+E R K +Q                     
Sbjct: 2   AQARDFVREKKYRTEAEMFGWSFVFEDFVSDELRNKATQ--------------------- 40

Query: 118 PVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMN 177
           P+  + W        W    LP E  +                 W  P G  S I  R+ 
Sbjct: 41  PMKPVLW--------W----LPVEKAF-----------------WRQPAGPGSGIRERLE 71

Query: 178 HPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG 237
           HPV+HVSWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG
Sbjct: 72  HPVLHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQG 128

Query: 238 EFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT 296
            FP  + A DG+   +PV ++  +N +GLY+++GNVWEWTA             +P    
Sbjct: 129 RFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAA 175

Query: 297 TGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 176 EQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADAGRLPG 229


>gi|357975662|ref|ZP_09139633.1| hypothetical protein SpKC8_09213 [Sphingomonas sp. KC8]
          Length = 318

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 127/345 (36%), Positives = 170/345 (49%), Gaps = 64/345 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ + G  FRMG+++       E P+ +V +D F +D   V+N QF  FV+ATG+VT AE
Sbjct: 15  MIAIKGGRFRMGSDR---HYPEEAPAHDVIVDDFAIDATPVTNAQFAAFVAATGHVTFAE 71

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
              +   +  +L E        +R     F      IE R           D  A+    
Sbjct: 72  TPPNPDDYPGILPE-------MIRAGSLVFTPPPGPIELR-----------DPFAW---- 109

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W +               W  P G  STI+   +HPVVH+++ DA AY  W
Sbjct: 110 ---------WSFTFDA----------DWRRPYGPGSTIDGLDDHPVVHIAYADAQAYADW 150

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE+  RGG+ +  + WG+ L P G H ANVWQG FP  N   DG+  T+P
Sbjct: 151 AGKSLPTEAEWEFAARGGIADADYAWGDELEPGGVHHANVWQGAFPWQNLGGDGWERTSP 210

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSY-----NPKGP--TTGTD---- 300
           V +Y  N +GL++M+GNVWEWTADW+   H    PA S      NP+G   T   D    
Sbjct: 211 VRNYPANGYGLFDMIGNVWEWTADWYADRHAAKDPAASACCVPRNPRGAVETESRDPCMP 270

Query: 301 ------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                 +V KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 271 DLVVGRRVLKGGSHLCAPNYCQRYRPAARYAQPIDTSTSHVGFRC 315


>gi|452751219|ref|ZP_21950965.1| Sulfatase modifying factor 1 precursor [alpha proteobacterium
           JLT2015]
 gi|451961369|gb|EMD83779.1| Sulfatase modifying factor 1 precursor [alpha proteobacterium
           JLT2015]
          Length = 342

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 133/364 (36%), Positives = 175/364 (48%), Gaps = 71/364 (19%)

Query: 3   LLPAPPVE--RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           +LPA   +  ++  M+ +PG TF MG+++       E P R V +D F +D+  V+N  F
Sbjct: 13  ILPAAAADAPQHTGMMFVPGGTFAMGSDR---FYPEEAPVRTVRVDPFRIDETPVTNADF 69

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
             FV ATGY T AE   D   +  L +E  +A                S + HR   PV 
Sbjct: 70  ARFVDATGYRTVAEIAPDPADYPGLAAEMAKAG---------------SLVFHRTRGPV- 113

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                D      W          W +         +F    W HP G DS+++   +HPV
Sbjct: 114 -----DLHDPSGW----------WSF---------VFD-ADWRHPYGPDSSLDGLDDHPV 148

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           VHV+  DA AY  W G  LPTEAEWE+  RGGL+   + WG+ L P GE  AN WQG FP
Sbjct: 149 VHVAHADAAAYADWAGKALPTEAEWEFAARGGLDGTDYAWGDELAPDGEMLANYWQGAFP 208

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV-----------HHHPAPS 289
             N   DGY  T+PV +Y  N +GL++M+GNVWEWT DW+ +              P+P 
Sbjct: 209 FANQLLDGYERTSPVRTYAANGYGLFDMIGNVWEWTEDWFALPPSPKKKTAKRGSGPSPC 268

Query: 290 YNPKGPTTGT--------------DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNL 335
             P  P  GT               KV KGGS+LC   YC R+R AAR     DS+  ++
Sbjct: 269 CVPANPRGGTRRQSVDPQDTARIARKVLKGGSHLCAANYCQRYRPAARHAQAIDSATSHI 328

Query: 336 GFRC 339
           GFRC
Sbjct: 329 GFRC 332


>gi|365856093|ref|ZP_09396118.1| cysteine-type sulfatase aerobic maturase [Acetobacteraceae
           bacterium AT-5844]
 gi|363718511|gb|EHM01850.1| cysteine-type sulfatase aerobic maturase [Acetobacteraceae
           bacterium AT-5844]
          Length = 329

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 128/343 (37%), Positives = 165/343 (48%), Gaps = 62/343 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG  F MG++        E P+  V +D F++D   V+N  F  FV+ TG++T AE
Sbjct: 19  MRWIPGGRFTMGSDHHYA---EERPAHPVAVDGFWMDATPVTNAAFARFVADTGHLTLAE 75

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D  ++   + E                  +   +  RM    V+            R
Sbjct: 76  RPLDPALYPGAMPELL----------------VPGALVFRMTPGPVNT-----------R 108

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
             R      W Y           P   W  PEG  ST++ R +HPVVHV+  DA AY  W
Sbjct: 109 DVR----QWWSY----------TPGACWKRPEGPGSTLDGRDDHPVVHVAHEDAAAYAAW 154

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE   RGGLE   F WG  LTP G H AN WQG FP  + AADG+  T+P
Sbjct: 155 AGKALPTEAEWERAARGGLEAAEFAWGEELTPEGRHMANTWQGPFPWRDFAADGFAGTSP 214

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS------YNPKG-PTTGT-------- 299
           V S+  N +GL ++ GNVWEWT DW+   H   P       +NP+G P  G+        
Sbjct: 215 VGSFPANGYGLSDVTGNVWEWTEDWYAARHEADPGKPCCAPHNPRGAPVEGSYDPHQPAI 274

Query: 300 ---DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
               KV KGGS+LC   YC R+R AAR     DS+  +LGFRC
Sbjct: 275 RIPRKVVKGGSFLCAPSYCRRYRPAARQPQMVDSAMSHLGFRC 317


>gi|351699435|gb|EHB02354.1| Sulfatase-modifying factor 1 [Heterocephalus glaber]
          Length = 265

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 78/128 (60%), Positives = 97/128 (75%)

Query: 174 HRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRAN 233
           HR +HPV+H+SWNDAVAYCTW G RLPTEA+WEY C+ GL+NRL PW N L P+ +H AN
Sbjct: 138 HRPDHPVLHLSWNDAVAYCTWAGKRLPTEAQWEYSCQRGLQNRLLPWVNKLQPKEQHYAN 197

Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK 293
           +WQGEFP  NT  DG+  TAP   +  N +GLYN+VGNVWEWT+DWW ++H    + NPK
Sbjct: 198 LWQGEFPVTNTGEDGFQGTAPTDVFPPNGYGLYNIVGNVWEWTSDWWTIYHLVEETLNPK 257

Query: 294 GPTTGTDK 301
           GP +G D+
Sbjct: 258 GPPSGKDR 265


>gi|168486347|ref|ZP_02710855.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae CDC1087-00]
 gi|418185547|ref|ZP_12822087.1| hypothetical protein SPAR91_1736 [Streptococcus pneumoniae GA47283]
 gi|419495982|ref|ZP_14035699.1| hypothetical protein SPAR97_1623 [Streptococcus pneumoniae GA47461]
 gi|419530831|ref|ZP_14070357.1| formylglycine-generating sulfatase enzyme family protein
           [Streptococcus pneumoniae GA40028]
 gi|421213727|ref|ZP_15670681.1| hypothetical protein AMCSP12_001614 [Streptococcus pneumoniae
           2070108]
 gi|421216366|ref|ZP_15673274.1| hypothetical protein AMCSP04_002107 [Streptococcus pneumoniae
           2070109]
 gi|421302736|ref|ZP_15753400.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA17484]
 gi|183570564|gb|EDT91092.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae CDC1087-00]
 gi|353848277|gb|EHE28293.1| hypothetical protein SPAR91_1736 [Streptococcus pneumoniae GA47283]
 gi|379571240|gb|EHZ36198.1| formylglycine-generating sulfatase enzyme family protein
           [Streptococcus pneumoniae GA40028]
 gi|379594068|gb|EHZ58879.1| hypothetical protein SPAR97_1623 [Streptococcus pneumoniae GA47461]
 gi|395577898|gb|EJG38429.1| hypothetical protein AMCSP04_002107 [Streptococcus pneumoniae
           2070109]
 gi|395578958|gb|EJG39468.1| hypothetical protein AMCSP12_001614 [Streptococcus pneumoniae
           2070108]
 gi|395901358|gb|EJH12294.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA17484]
          Length = 282

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 119/335 (35%), Positives = 159/335 (47%), Gaps = 62/335 (18%)

Query: 15  MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M  + G TF MGTN +   + D E     V++  F +    V+N +F +FV  TGY T A
Sbjct: 3   MKKIKGGTFMMGTNSEEGFLDDFEGLQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLA 62

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+   +FVF   + E ER             EG          HP     W      C  
Sbjct: 63  ERQEWSFVFILFVPEAER-------------EGYP--------HPAGAPWWLQVSNAC-- 99

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                       W HP G +S +    +HPVVHV+  DA+A+C 
Sbjct: 100 ----------------------------WKHPYGENSNLVGLEDHPVVHVALEDALAFCN 131

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEA+WEY  RGG ++  +PWG+ L   G + AN WQG FP  NTA DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTA 190

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
           PV  +  N FGLY M+GNVWEW     N  +    S+N      PK      +   +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +LC+  YC R+R AAR+     S++ +LGFRC  +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282


>gi|427702661|ref|YP_007045883.1| hypothetical protein Cyagr_1371 [Cyanobium gracile PCC 6307]
 gi|427345829|gb|AFY28542.1| hypothetical protein Cyagr_1371 [Cyanobium gracile PCC 6307]
          Length = 320

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 124/338 (36%), Positives = 165/338 (48%), Gaps = 58/338 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           DMV +P  +F MG++        E P+  V L  F++D+  V+N QFQ+FV ATG++T A
Sbjct: 25  DMVWIPPGSFEMGSDH---HYPEEAPAHRVALGGFWIDRTPVTNAQFQKFVKATGHITLA 81

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D   +   L E                  L +        P   I   D   +   
Sbjct: 82  ERPADPADYPGALPE------------------LLAPASIVFVPPPGPIGTGDPYRW--- 120

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W+Y           P   W HPEG  S+I  R NHPVVHV+  DA AY  
Sbjct: 121 ----------WQY----------RPGACWRHPEGPGSSIRMRENHPVVHVAHGDAAAYAA 160

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G +LP+EAEWE   RGG +   F WG+ L P G   AN +QG+FP +N+  DG+  T+
Sbjct: 161 WAGKQLPSEAEWERAARGGRQGDEFAWGDELHPGGRPMANTFQGDFPHHNSLLDGWERTS 220

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD------------K 301
           PV ++  N +GL +M+GNVWEWT DW+  H   A   NP+G                  K
Sbjct: 221 PVGAFPPNGYGLLDMIGNVWEWTDDWYGGHG--ATVANPEGCCRQASIDPGSQHGHQPRK 278

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           V KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 279 VVKGGSFLCAPSYCRRYRPAARMAQGVDTSTCHMGFRC 316


>gi|419511049|ref|ZP_14050690.1| hypothetical protein SPAR142_1717 [Streptococcus pneumoniae NP141]
 gi|379631652|gb|EHZ96229.1| hypothetical protein SPAR142_1717 [Streptococcus pneumoniae NP141]
          Length = 282

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 119/335 (35%), Positives = 159/335 (47%), Gaps = 62/335 (18%)

Query: 15  MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M  + G TF MGTN +   + D E     V++  F +    V+N +F +FV  TGY T A
Sbjct: 3   MKKIKGGTFMMGTNYEEGFLDDFEGLQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLA 62

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+   +FVF   + E ER             EG          HP     W      C  
Sbjct: 63  ERQEWSFVFILFVPEAER-------------EGYP--------HPAGAPWWLQVSNAC-- 99

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                       W HP G +S +    +HPVVHV+  DA+A+C 
Sbjct: 100 ----------------------------WKHPYGENSNLVGLEDHPVVHVALEDALAFCN 131

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEA+WEY  RGG ++  +PWG+ L   G + AN WQG FP  NTA DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTA 190

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
           PV  +  N FGLY M+GNVWEW     N  +    S+N      PK      +   +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +LC+  YC R+R AAR+     S++ +LGFRC  +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282


>gi|407276350|ref|ZP_11104820.1| hypothetical protein RhP14_07602 [Rhodococcus sp. P14]
          Length = 293

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 95/198 (47%), Positives = 118/198 (59%), Gaps = 16/198 (8%)

Query: 158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
           P   W HPEG  ST+  R  HPV HVS  DA+AY  W G  LPTEAEWE+  RGGL+   
Sbjct: 90  PGAQWRHPEGPGSTVAGRDRHPVTHVSHFDALAYAQWAGKELPTEAEWEFAARGGLDRAR 149

Query: 218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
           F WG+   P G +RAN WQG FP  NT  DG++ T+PV SY+ N +GL +M GNVWEWTA
Sbjct: 150 FVWGDEFAPHGRYRANTWQGRFPWENTGEDGFVGTSPVGSYRANGYGLVDMAGNVWEWTA 209

Query: 278 DWWNVHH-----HPAPSYN---PKGPTTGTD--------KVKKGGSYLCNEQYCYRHRCA 321
           D +  HH       AP++    P  P TG          +V KGGS+LC   YC R+R A
Sbjct: 210 DHYTAHHGADGRDTAPAHACCIPSNPRTGAGDESGPYPRRVIKGGSHLCAPNYCLRYRPA 269

Query: 322 ARSQNTPDSSAGNLGFRC 339
           AR   + ++S  ++GFRC
Sbjct: 270 ARQGESVETSTCHIGFRC 287


>gi|402863388|ref|XP_003895999.1| PREDICTED: sulfatase-modifying factor 2 isoform 2 [Papio anubis]
          Length = 305

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 121/336 (36%), Positives = 158/336 (47%), Gaps = 83/336 (24%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F MGTN P   +DGE P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct: 49  MVQLPGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 107

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q    MK   G  S I  R+ HPV+H+SWNDA AYC WR
Sbjct: 108 MFGWSFVFEDFVSDELRNKATQ---PMKP-AGPGSGIRERLEHPVLHVSWNDARAYCAWR 163

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           G RLPTE EWE+  RGGL+ +++PWG+   P                             
Sbjct: 164 GKRLPTEEEWEFAARGGLKGQVYPWGNRFQP----------------------------- 194

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
                               NR   W     P+G+   + + G  P N   A        
Sbjct: 195 --------------------NRTNLWQGRF-PKGDKAEDGFHGVSPVNAFPA-------- 225

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNE 312
                +N +GLY+++GNVWEWTA             +P        +V +G S++   + 
Sbjct: 226 -----QNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTADG 267

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
              +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 268 SANHRARVTTRMGNTPDSASDNLGFRCAADAGRLPG 303



 Score = 41.2 bits (95), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 254 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 298


>gi|432335937|ref|ZP_19587486.1| hypothetical protein Rwratislav_13708 [Rhodococcus wratislaviensis
           IFP 2016]
 gi|430777142|gb|ELB92516.1| hypothetical protein Rwratislav_13708 [Rhodococcus wratislaviensis
           IFP 2016]
          Length = 304

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 121/344 (35%), Positives = 159/344 (46%), Gaps = 63/344 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG TF MG+         E P   VT+D F++D H+V+  +F+ FV  TG+VT AE
Sbjct: 1   MAWIPGGTFWMGSED---FYPEERPVHQVTVDGFWMDTHQVTVAEFRRFVKDTGHVTTAE 57

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                                 +  D  ++ G D  +          +     V   T  
Sbjct: 58  ----------------------IAPDPAQYPGADPAL----------LVPGSLVFTPTPG 85

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              L     W     G           W HPEG  S +  R  HPV HVSW DA AY  W
Sbjct: 86  PVPLDDYTRWWSFTAGA---------DWRHPEGPGSNVGGRERHPVTHVSWFDARAYAEW 136

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE+  RGGL+ + F WG    P G    NVWQG FP  N   DGY  T+P
Sbjct: 137 AGKDLPTEAEWEFAARGGLDRKPFVWGEEHEPGGRPGGNVWQGRFPWENLLEDGYAGTSP 196

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTGTD------ 300
           V  ++ N + L +M GNVWEWT D++ V H     + AP+ +   P  P   T       
Sbjct: 197 VGHFRPNGYDLSDMAGNVWEWTTDYFTVDHSESGKNVAPTSSCCIPTNPRVDTARAENSD 256

Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                +V KGGS+LC   YC R+R +AR  +T ++S  ++GFRC
Sbjct: 257 EPYARRVIKGGSHLCAPNYCLRYRPSARQGDTEETSTCHIGFRC 300


>gi|424876140|ref|ZP_18299799.1| hypothetical protein Rleg5DRAFT_0493 [Rhizobium leguminosarum bv.
           viciae WSM1455]
 gi|393163743|gb|EJC63796.1| hypothetical protein Rleg5DRAFT_0493 [Rhizobium leguminosarum bv.
           viciae WSM1455]
          Length = 324

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 122/344 (35%), Positives = 167/344 (48%), Gaps = 65/344 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG TF MG++        E P+  V +D F++ Q  V+N QF+EFV ATGYVT AE
Sbjct: 18  MVWIPGRTFTMGSDHHY---PEEAPAHPVKVDGFWMSQTPVTNRQFKEFVEATGYVTVAE 74

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K  D   +               + +M R   L  T    +  P +   W          
Sbjct: 75  KAPDPTDY------------PGAKPEMLRAGSLVFTQPKSVSGPDISQWWT--------- 113

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    +++G             +W  P G  S +  +++HPVVHV++ DA AY  W
Sbjct: 114 ---------FKFGA------------NWRRPLGGLSDLRGKLDHPVVHVAYADAAAYAEW 152

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE   RGGL++  + WG+   P G   AN W G FP ++    G   T+P
Sbjct: 153 AGLDLPTEAEWELAARGGLDDAEYSWGDEFAPGGVPMANTWSGTFPIHSMKPKGQERTSP 212

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-------NPKGPTTGTD------- 300
           V S+  N FGLY+M+GNVWEWT+D+W+   HP P+        NP+G             
Sbjct: 213 VRSFPPNGFGLYDMIGNVWEWTSDYWST-RHPQPAKHSCCIPSNPRGRDAEASSDPHQPE 271

Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                +V KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 272 IRIARRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRC 315


>gi|397480350|ref|XP_003811449.1| PREDICTED: sulfatase-modifying factor 2 [Pan paniscus]
          Length = 327

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 113/294 (38%), Positives = 150/294 (51%), Gaps = 69/294 (23%)

Query: 58  TQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
            Q ++FV    Y TEAE FG +FVFE  +S+E R K +Q                     
Sbjct: 98  AQARDFVREKKYRTEAEIFGWSFVFEDFVSDELRNKATQ--------------------- 136

Query: 118 PVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMN 177
           P+  + W        W    LP E             R F    W  P G  S I  R+ 
Sbjct: 137 PMKSVLW--------W----LPVE-------------RAF----WRQPAGPGSGIRERLE 167

Query: 178 HPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG 237
           HPV+HVSWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG
Sbjct: 168 HPVLHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQG 224

Query: 238 EFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT 296
           +FP  + A DG+   +PV ++  +N +GLY+++GNVWEWTA             +P    
Sbjct: 225 KFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAA 271

Query: 297 TGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 272 EQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 325



 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 276 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 320


>gi|12652935|gb|AAH00224.1| SUMF2 protein [Homo sapiens]
          Length = 242

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 111/294 (37%), Positives = 149/294 (50%), Gaps = 69/294 (23%)

Query: 58  TQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
            Q ++FV    Y TEAE FG +FVFE  +S+E R K +Q                     
Sbjct: 13  AQARDFVREKKYRTEAEMFGWSFVFEDFVSDELRNKATQ--------------------- 51

Query: 118 PVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMN 177
           P+  + W        W    LP E  +                 W  P G  S I  R+ 
Sbjct: 52  PMKSVLW--------W----LPVEKAF-----------------WRQPAGPGSGIRERLE 82

Query: 178 HPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG 237
           HPV+HVSWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG
Sbjct: 83  HPVLHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQG 139

Query: 238 EFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT 296
           +FP  + A DG+   +PV ++  +N +GLY+++GNVWEWTA             +P    
Sbjct: 140 KFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAA 186

Query: 297 TGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 187 EQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 240


>gi|229819257|ref|YP_002880783.1| hypothetical protein Bcav_0760 [Beutenbergia cavernae DSM 12333]
 gi|229565170|gb|ACQ79021.1| protein of unknown function DUF323 [Beutenbergia cavernae DSM
           12333]
          Length = 329

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 125/369 (33%), Positives = 164/369 (44%), Gaps = 90/369 (24%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  FRMG+         E P    T+ AF +D H V+N +F  FV+ +GYVT AE
Sbjct: 4   MVALPGGAFRMGSRD---FYPDEGPVHERTVGAFEIDVHPVTNAEFAAFVADSGYVTVAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D   + P ++ E+      V      F      ++ R        +W    A+  WR
Sbjct: 61  RPLDAMQY-PGVAPEDLVPGGLV------FTPTPGPVDLR--------NWR---AWWDWR 102

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                  P  +W HP G  S +  +  HPVV VS+ DA AY  W
Sbjct: 103 -----------------------PGATWRHPFGPGSDLAGKHEHPVVQVSFEDATAYADW 139

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWEY  R G +   + WG  + P G   AN WQG FP +NT ADG++ T+ 
Sbjct: 140 AGKALPTEAEWEYAARAGGDPATYAWGEEVRPDGVLMANTWQGRFPYDNTGADGWVGTSA 199

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----------------PAPSYN------ 291
           V S+  N FGL +M+GNVWEWT   +   H                   AP +       
Sbjct: 200 VGSFPANAFGLVDMIGNVWEWTTTPYRPAHEVPGGGGSPSAQRSVLPMAAPGHGTDAEHT 259

Query: 292 ---------------------PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDS 330
                                P GPT    +  KGGS+LC  +YC R+R AARS  T DS
Sbjct: 260 PSCCAPQASALAGSRTPADPPPSGPT--MTRALKGGSHLCAPEYCLRYRPAARSPQTDDS 317

Query: 331 SAGNLGFRC 339
           +  ++GFRC
Sbjct: 318 ATTHIGFRC 326


>gi|453067931|ref|ZP_21971216.1| hypothetical protein G418_04863 [Rhodococcus qingshengii BKS 20-40]
 gi|452766550|gb|EME24795.1| hypothetical protein G418_04863 [Rhodococcus qingshengii BKS 20-40]
          Length = 297

 Score =  182 bits (461), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 118/322 (36%), Positives = 159/322 (49%), Gaps = 60/322 (18%)

Query: 37  EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
           E P   VT+D F++D   V+  +F+ FV ATG+VT AE         P  ++   A  + 
Sbjct: 10  ERPVHQVTVDGFWMDTTAVTVAEFRRFVKATGHVTTAE-------IAPDAADYPDADPAL 62

Query: 97  VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 156
           +      F           + P   +S +D   Y  W          W +          
Sbjct: 63  LVPGSLVF-----------NSPPGPVSLDD---YTQW----------WSFT--------- 89

Query: 157 FPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 216
            P   W HPEG DS I  R  HPV HVS+ DA AY  W G  LPTEAEWE+  RGGL+ +
Sbjct: 90  -PGADWRHPEGADSNIGGRERHPVTHVSYFDAQAYAAWAGKELPTEAEWEFAARGGLDRQ 148

Query: 217 LFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWT 276
            + WG++ +P G    NVWQG+FP  N   DGY  TAPV  ++ N +GL++M GNVWEWT
Sbjct: 149 AYVWGDHDSPGGRPGGNVWQGQFPWENLLEDGYERTAPVGKFRPNGYGLFDMAGNVWEWT 208

Query: 277 ADWWNVHH-----HPAPSYN---PKGPTTG-----------TDKVKKGGSYLCNEQYCYR 317
           AD +   H     + AP+ +   P+ P                +V KGGS+LC   YC R
Sbjct: 209 ADHYTSDHAHSSKNVAPASSCCIPRNPRAEFATEALVGEPYARRVIKGGSHLCAPNYCLR 268

Query: 318 HRCAARSQNTPDSSAGNLGFRC 339
           +R AAR   + +SS  ++GFRC
Sbjct: 269 YRPAARQGESEESSTCHVGFRC 290


>gi|332865306|ref|XP_001160010.2| PREDICTED: sulfatase-modifying factor 2 isoform 6 [Pan troglodytes]
 gi|410224494|gb|JAA09466.1| sulfatase modifying factor 2 [Pan troglodytes]
 gi|410302662|gb|JAA29931.1| sulfatase modifying factor 2 [Pan troglodytes]
          Length = 305

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 122/336 (36%), Positives = 159/336 (47%), Gaps = 83/336 (24%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  F MGTN P   +DGE P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct: 49  MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 107

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q    MK   G  S I  R+ HPV+H+SWNDA AYC WR
Sbjct: 108 IFGWSFVFEDFVSDELRNKATQ---PMKP-AGPGSGILERLEHPVLHVSWNDARAYCAWR 163

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           G RLPTE EWE+  RGGL+ +++PWG+W  P        +R N                W
Sbjct: 164 GKRLPTEEEWEFAARGGLKGQVYPWGNWFQP--------NRTN---------------LW 200

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G                             P+G+   + + G  P N   A        
Sbjct: 201 QGK---------------------------FPKGDKAEDGFHGVSPVNAFPA-------- 225

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
                +N +GLY+++GNVWEWTA             +P        +V +G S++     
Sbjct: 226 -----QNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTADG 267

Query: 315 CYRH--RCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
              H  R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 268 SANHQARVTTRMGNTPDSASDNLGFRCAADAGRPPG 303



 Score = 39.7 bits (91), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 26/45 (57%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++        H  R   R  NTPDS++ NLGFRCAAD 
Sbjct: 254 RVLRGASWIDTADGSANHQARVTTRMGNTPDSASDNLGFRCAADA 298


>gi|453078020|ref|ZP_21980754.1| hypothetical protein G419_21909 [Rhodococcus triatomae BKS 15-14]
 gi|452757655|gb|EME16057.1| hypothetical protein G419_21909 [Rhodococcus triatomae BKS 15-14]
          Length = 297

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 113/321 (35%), Positives = 153/321 (47%), Gaps = 59/321 (18%)

Query: 37  EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
           E P   V +D F++D H V+  +F+ FV  TG+VT AE   D   + P     +    S 
Sbjct: 10  ERPVHQVHVDGFWMDTHPVTVAEFRRFVKDTGHVTVAETAPDPADY-PDADPADLVPGSL 68

Query: 97  VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 156
           V                    P   +S +D   + +W                       
Sbjct: 69  V-----------------FTPPSGPVSLDDYRQWWSW----------------------- 88

Query: 157 FPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 216
            P   W HPEG DS    R  HPV HV+++DA+AY  W G  LPTEA+WE+  RGGL+  
Sbjct: 89  VPGADWRHPEGPDSNAGGRERHPVTHVAYSDALAYARWAGKELPTEAQWEFAARGGLDRA 148

Query: 217 LFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWT 276
           ++ WG+   P G H+AN WQG FP  N   DG+  T+PV S++ N +GL +M GNVW+WT
Sbjct: 149 IYVWGDEFAPGGRHQANTWQGSFPWQNLVEDGFPGTSPVGSFRPNGYGLVDMAGNVWQWT 208

Query: 277 ADWWNVHH-------HPA-PSYNPKGP----------TTGTDKVKKGGSYLCNEQYCYRH 318
            D     H        PA P   P+ P           T   +V KGGS+LC   YC R+
Sbjct: 209 TDHHTADHGASGRNVAPASPCCIPRNPRVEAAVRRDGETYPRRVIKGGSHLCAPNYCLRY 268

Query: 319 RCAARSQNTPDSSAGNLGFRC 339
           R AAR  +T ++S  ++GFRC
Sbjct: 269 RPAARQGDTEETSTCHIGFRC 289


>gi|397734414|ref|ZP_10501122.1| hypothetical protein JVH1_5610 [Rhodococcus sp. JVH1]
 gi|396929722|gb|EJI96923.1| hypothetical protein JVH1_5610 [Rhodococcus sp. JVH1]
          Length = 317

 Score =  181 bits (460), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 121/346 (34%), Positives = 156/346 (45%), Gaps = 63/346 (18%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           K+M  + G TF MG+         E P   VT+D F++D H+V+  +F+ FV  TG+VT 
Sbjct: 12  KNMEWISGGTFWMGSED---FYPEERPVHQVTVDGFWMDSHQVTVAEFRRFVKDTGHVTT 68

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE                      +  D  ++ G D  +          +     V   T
Sbjct: 69  AE----------------------IAPDPAQYPGADPAL----------LVPGSLVFTPT 96

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                L     W             P   W HPEG  S +  R  HPV HVSW DA AY 
Sbjct: 97  PGPVPLDDYTRWW---------SFTPGADWRHPEGPGSNVGGRERHPVTHVSWFDARAYA 147

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWE+  RGGL+ + F WG    P G    NVWQG FP  N   DG+  T
Sbjct: 148 EWAGKDLPTEAEWEFAARGGLDRKQFVWGEEHEPGGRPGGNVWQGRFPWENLLEDGFAGT 207

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----PSYN---PKGPTTG------ 298
           +PV  Y+ N + L +M GNVWEWTAD +   H        P+++   P  P         
Sbjct: 208 SPVGRYQPNGYRLSDMAGNVWEWTADHYTADHSAGGKNVTPAHSCCIPANPRVDAMQIND 267

Query: 299 -----TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                T +V KGGS+LC   YC R+R +AR   T D+S   +GFRC
Sbjct: 268 PDEPYTRRVIKGGSHLCAPNYCLRYRPSARQGETEDTSTCQIGFRC 313


>gi|398385642|ref|ZP_10543661.1| hypothetical protein PMI04_03371 [Sphingobium sp. AP49]
 gi|397720168|gb|EJK80728.1| hypothetical protein PMI04_03371 [Sphingobium sp. AP49]
          Length = 322

 Score =  181 bits (458), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 122/343 (35%), Positives = 163/343 (47%), Gaps = 62/343 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG  F MG+++       E P+  V +D F++D   V+N  ++ FV ATG+VT A+
Sbjct: 19  MVHIPGGRFIMGSDR---HYPEEAPAHPVQVDGFWMDATPVTNRDYRRFVEATGHVTSAQ 75

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
              D   +   L E  R                 S +      PV    W+       W 
Sbjct: 76  ISPDPADYPGALPEMLRPA---------------SLVFMPTPKPVPLNDWS------QW- 113

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W Y               W HP G DS++    +HPVVHV++ DA+AY  W
Sbjct: 114 ---------WRYVFDA----------DWQHPLGPDSSVAAIDDHPVVHVNYADAIAYAAW 154

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE+  RGG +   + WG+ L P G+  AN WQG FP  N   DGY  T+P
Sbjct: 155 AGKALPTEAEWEFAARGGFDGLEYAWGDELMPDGKALANFWQGAFPAENLLIDGYERTSP 214

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----------------SYNPKGPTT 297
           V  Y  N +GL++M+GNVWEWT D++   H   P                 SY+P+ P  
Sbjct: 215 VGHYPANGYGLHDMIGNVWEWTEDFYAARHQADPARPCCAPRNPRNDASDGSYDPQQPQI 274

Query: 298 GTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
               KV KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 275 RIPRKVLKGGSHLCAPNYCQRYRPAARHAQPVDTSTSHIGFRC 317


>gi|120405591|ref|YP_955420.1| hypothetical protein Mvan_4639 [Mycobacterium vanbaalenii PYR-1]
 gi|119958409|gb|ABM15414.1| protein of unknown function DUF323 [Mycobacterium vanbaalenii
           PYR-1]
          Length = 295

 Score =  181 bits (458), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 120/333 (36%), Positives = 160/333 (48%), Gaps = 56/333 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG T  +G++        E P+  VT+D F++   +V+N +F EFV+ATGYVT AE
Sbjct: 8   MVWIPGQTAILGSDS---HYPEEAPAHPVTVDGFWMQPTQVTNLEFAEFVAATGYVTAAE 64

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLD--STIEHRMHHPVVHISWNDAVAYCT 132
           +                  ++   H     E L   S +  R   PV     N    + T
Sbjct: 65  R-----------------PVNPADHPGAPRENLQPGSMVFTRTGGPVDLRHLN---LWWT 104

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W                        P  SW HP G  S+I+ R++HPVVHV + DA +Y 
Sbjct: 105 WT-----------------------PGASWRHPVGPASSIDKRLDHPVVHVGYEDAESYA 141

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWE   RGGL    + WG+     G   AN W G+FP       GY  T
Sbjct: 142 AWAGLELPTEAEWETAARGGLTAATYTWGDVPEQPGRQLANYWHGDFPWRPD--RGYGRT 199

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTTGTD-KVKKGG 306
            PV S+  N++GL++M GNVWEWT DW+       P     SY+ K P      +V KGG
Sbjct: 200 RPVGSFPPNRYGLFDMAGNVWEWTTDWYGDRRDYQPCCAAESYDAKQPQFEVPRRVVKGG 259

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           S+LC + YC R+R AAR     D+   ++GFRC
Sbjct: 260 SFLCADSYCQRYRPAARRPQPVDTGMSHIGFRC 292


>gi|332028607|gb|EGI68643.1| Sulfatase-modifying factor 1 [Acromyrmex echinatior]
          Length = 177

 Score =  180 bits (457), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 93/216 (43%), Positives = 120/216 (55%), Gaps = 50/216 (23%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +   T+ +GT+ P+ + DGE P R V LD F++D+ EVSN +F  FV+ATGY+TEAE
Sbjct: 1   MVKINAGTYFIGTDNPVFVADGEGPKREVVLDNFFIDKFEVSNKEFMTFVNATGYITEAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FGD+FVFE LL++  + K+++V                                     
Sbjct: 61  NFGDSFVFEGLLTQTTKDKVTKVV------------------------------------ 84

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                ++A W           L    SW HPEG +S I  RM HPV+HVSWNDAVAYC W
Sbjct: 85  -----SQAPWWL---------LVKQTSWQHPEGPNSNITFRMEHPVIHVSWNDAVAYCNW 130

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
            G RLPTEAEWE  CRG L +RL+PWGN L P+ +H
Sbjct: 131 MGKRLPTEAEWEVACRGELSDRLYPWGNKLMPKNQH 166


>gi|345320582|ref|XP_001517991.2| PREDICTED: sulfatase-modifying factor 2-like [Ornithorhynchus
           anatinus]
          Length = 462

 Score =  180 bits (457), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 122/338 (36%), Positives = 164/338 (48%), Gaps = 80/338 (23%)

Query: 6   APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
           A P +     V L G TFRMGT+ P   +DGE PSR VT+  F +D+  V+N  F+EFV 
Sbjct: 196 AGPGQEADGTVRLLGGTFRMGTDSPD-GRDGEGPSRTVTVRPFAVDRFPVTNRDFREFVR 254

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
              Y TEAE FG +FVFE  ++ E + K+ +         G  S I+ R+ HPV+H+SWN
Sbjct: 255 EKKYQTEAETFGWSFVFEDFVAPELKEKVKEKLEPA----GPGSGIKERLDHPVLHVSWN 310

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
           DA A+C WRG RLP+E EWE+  RGGL  R++PWG    P        +R N        
Sbjct: 311 DARAFCAWRGKRLPSEEEWEFAARGGLSGRMYPWGDDFQP--------NRSN-------- 354

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
                   W+G                     FP   ++   G H A+   G FP  N+ 
Sbjct: 355 -------LWQGG--------------------FP-KTDVAEDGFHGASP-VGAFPAQNS- 384

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKG 305
                            FGLY+++GNVWEWTA       + +P   P G      +V +G
Sbjct: 385 -----------------FGLYDLMGNVWEWTA-----SEYRSPRPGPAG-----MRVLRG 417

Query: 306 GSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
            S++   +    +R R   R  NTPDS++ NLGFRCAA
Sbjct: 418 ASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAA 455


>gi|378717438|ref|YP_005282327.1| putative sulfatase-modifying factor 1 [Gordonia polyisoprenivorans
           VH2]
 gi|375752141|gb|AFA72961.1| putative sulfatase-modifying factor 1 [Gordonia polyisoprenivorans
           VH2]
          Length = 313

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 127/343 (37%), Positives = 168/343 (48%), Gaps = 67/343 (19%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           D+V +P   F MG+++       E P+    +++F L+ H V+N QF  FV+ TGY T A
Sbjct: 23  DVVAIPTADFEMGSDR---FYAEERPAHPRHVESFSLEVHPVTNAQFAAFVADTGYRTVA 79

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+        PL              D   F G D      +   VV    +  V    W
Sbjct: 80  ER--------PL--------------DPADFPGADPA--ELVPGAVVFTPTDGPVDLTDW 115

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R       A W Y           P  SW HP G DS ++ R++HPV  +++ DA AY  
Sbjct: 116 R-------AWWRY----------VPGTSWRHPLGPDSGVDDRLDHPVTQIAYADADAYAR 158

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE   RGGL+   + WG+ L P G   AN W G FP  N   +G+  T+
Sbjct: 159 WAGRRLPTEAEWECAARGGLDGHDYAWGDELRPDGAVMANTWIGRFPYRN---EGWGGTS 215

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------------APSYNPKGPTTG 298
           PV  Y  N FGL++M+GNVWE TA+ +   H P               APS +P      
Sbjct: 216 PVGYYPPNPFGLFDMIGNVWERTAEVFTPRHVPPDMSTVTAGGRQNLLAPSTSPT----- 270

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
             +V KGGS+LC+ QYC R+R AARS  + DS+  +LGFRC A
Sbjct: 271 VLRVTKGGSHLCSPQYCRRYRPAARSPQSDDSATSHLGFRCVA 313


>gi|417995048|ref|ZP_12635352.1| sulfatase modifying factor [Lactobacillus casei M36]
 gi|417997938|ref|ZP_12638170.1| sulfatase modifying factor [Lactobacillus casei T71499]
 gi|410538820|gb|EKQ13365.1| sulfatase modifying factor [Lactobacillus casei M36]
 gi|410542173|gb|EKQ16631.1| sulfatase modifying factor [Lactobacillus casei T71499]
          Length = 272

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 118/322 (36%), Positives = 158/322 (49%), Gaps = 63/322 (19%)

Query: 25  MGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFE 83
           MGT+  I   +D E P   V + +F +    V+N  F +F++AT Y T AE+ G +FVFE
Sbjct: 1   MGTDDHIGFDEDYEGPPTIVRVPSFSMADTPVTNADFDDFIAATAYQTVAERLGSSFVFE 60

Query: 84  PLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAE 143
            L+ EEER                  T +H              VA   W          
Sbjct: 61  LLIPEEERV-----------------TYQH--------------VAGAPW---------- 79

Query: 144 WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEA 203
           W           L P   W HP G +S+     NHPVVHV+  DA+AYC W  ++LPTEA
Sbjct: 80  W----------LLVPGADWQHPYGAESSNIDLDNHPVVHVALEDALAYCQWSHSQLPTEA 129

Query: 204 EWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKF 263
           +WEY    G     +PWG +L       AN WQG+FP +NTA DG++ TAPV SY+ N  
Sbjct: 130 QWEYAAGAGTATT-YPWGESLVDEHGFHANTWQGDFPNDNTAEDGFVGTAPVKSYEPNSN 188

Query: 264 GLYNMVGNVWEWTADWWNVHHHPAPSYNP------KGPTTGTDKVKKGGSYLCNEQYCYR 317
           GLY ++GNVWEW     N  +     +N       K P  G   + +GGS+LC+  YC R
Sbjct: 189 GLYQIIGNVWEWCR---NPRYTLLDDFNAEQFKLGKVPAAGEYAI-RGGSFLCHCSYCNR 244

Query: 318 HRCAARSQNTPDSSAGNLGFRC 339
           +R AAR+     S++ +L FRC
Sbjct: 245 YRTAARNGVDLQSTSSHLSFRC 266


>gi|334342823|ref|YP_004555427.1| sulfatase-modifying factor protein [Sphingobium chlorophenolicum
           L-1]
 gi|334103498|gb|AEG50921.1| Sulphatase-modifying factor protein [Sphingobium chlorophenolicum
           L-1]
          Length = 331

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 130/350 (37%), Positives = 169/350 (48%), Gaps = 62/350 (17%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + M  L G TF MG++        E P+R V +D F++D+  V+N QF  FV ATG+VT 
Sbjct: 25  RGMRWLSGGTFLMGSDHHY---PEEGPARRVRVDGFWMDESPVTNEQFASFVKATGWVTL 81

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE   +   +  +L E            M +   L  T       P   +S ND   + +
Sbjct: 82  AETAPNPSDYPGILPE------------MIQAGSLCFT------SPPGPVSLNDFTQWWS 123

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           +R           +G              W  PEG  ST+  R +HPVVHV+  DA AY 
Sbjct: 124 FR-----------FG------------ACWRAPEGPGSTLAGREDHPVVHVTHADAQAYA 160

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWE+  RGGLE   + WG+ L P GE RAN WQG FP  N A DG+  T
Sbjct: 161 AWAGKALPTEAEWEFAARGGLEGEAYAWGSELAPAGEMRANYWQGTFPWENLATDGFEGT 220

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS------YNPKGPTTGTD------ 300
           +PV  +  N  GL++M+GNVWEWT DW++       S       NP+G            
Sbjct: 221 SPVRIFPANGHGLFDMIGNVWEWTDDWYSDAPPGKKSGDCCIPANPRGALERESVDPCQP 280

Query: 301 ------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
                 KV KGGS+LC   YC R+R AAR     D+S  ++GFRC    G
Sbjct: 281 DIQIGRKVLKGGSHLCAANYCERYRPAARYAQPVDTSTCHVGFRCVKRAG 330


>gi|170721233|ref|YP_001748921.1| hypothetical protein PputW619_2052 [Pseudomonas putida W619]
 gi|169759236|gb|ACA72552.1| protein of unknown function DUF323 [Pseudomonas putida W619]
          Length = 336

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 112/340 (32%), Positives = 161/340 (47%), Gaps = 61/340 (17%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           R+  MV LPG  F  G+++     D E P     +  F++D+H V+N QF  FV+ TGY+
Sbjct: 42  RHDGMVRLPGGVFTFGSDR---FYDEEGPPHAAKVSGFWVDRHPVTNAQFARFVAVTGYL 98

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           T+AE+ G +   +P L +  R   + V      F+   S ++                  
Sbjct: 99  TQAER-GISSEADPTLPDTLRVPGAMV------FQQGPSVLD------------------ 133

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
                        W++           P  SW HP+G  S +    NHPVV V+  DA A
Sbjct: 134 -----------PGWQF----------VPGASWRHPQGPGSDLAGLENHPVVQVTLEDAQA 172

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
           Y  W G  LP+EA+ EY  RGGL++  F WG    P+G+  AN WQG+FP NN AADG+ 
Sbjct: 173 YARWAGRALPSEAQLEYAMRGGLDDADFSWGTTELPKGKLMANTWQGQFPYNNMAADGFT 232

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--PSYNPKGPTTGTD-------- 300
            T+PV  +  N+FG+++  GNVWE T   +   H      + +P GP             
Sbjct: 233 GTSPVGCFPANRFGVFDAGGNVWELTRTGYRPGHDAQRDAALDPSGPALSASYDPAEPGI 292

Query: 301 --KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
              V KGGS+LC+   C R+R +AR       +  ++GFR
Sbjct: 293 PVAVIKGGSHLCSADRCMRYRPSARQPQPVFIATSHVGFR 332


>gi|441650125|ref|XP_004090992.1| PREDICTED: sulfatase-modifying factor 2 [Nomascus leucogenys]
          Length = 286

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 121/336 (36%), Positives = 160/336 (47%), Gaps = 83/336 (24%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  F MGTN P   +DGE P R VT+  F +D   V+N  F++FV    Y TEAE
Sbjct: 30  MVQLQGGRFLMGTNSPD-SRDGEGPVREVTVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 88

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q    MK   G  S I  R+ HPV+H+SWNDA AYC WR
Sbjct: 89  MFGWSFVFEDFVSDELRNKATQ---PMKP-AGPGSGIRERLEHPVLHVSWNDARAYCAWR 144

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           G RLP E EWE+  RGGL+ +++PWG+   P        +R N                W
Sbjct: 145 GKRLPMEEEWEFAARGGLKGQVYPWGNRFQP--------NRTN---------------LW 181

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G                             P+G+   + + G  P N   A        
Sbjct: 182 QGK---------------------------FPKGDKAEDGFHGVSPVNAFPA-------- 206

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNE 312
                +N +GLY+++GNVWEWTA             +P        +V +G S++   + 
Sbjct: 207 -----QNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTADG 248

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
              +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 249 SANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 284



 Score = 41.2 bits (95), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 235 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 279


>gi|148984246|ref|ZP_01817541.1| hypothetical protein CGSSp3BS71_03457 [Streptococcus pneumoniae
           SP3-BS71]
 gi|147923535|gb|EDK74648.1| hypothetical protein CGSSp3BS71_03457 [Streptococcus pneumoniae
           SP3-BS71]
          Length = 271

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 116/325 (35%), Positives = 155/325 (47%), Gaps = 62/325 (19%)

Query: 25  MGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFE 83
           MGTN +   + D E P   V++  F +    V+N +F +FV  TGY T AE+   +FVF 
Sbjct: 2   MGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLAERQEWSFVFI 61

Query: 84  PLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAE 143
             + E ER             EG          HP     W      C            
Sbjct: 62  LFVPEAER-------------EGYP--------HPAGAPWWLQVSNAC------------ 88

Query: 144 WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEA 203
                             W HP G +S +    +HPVVHV+  DA+A+C W G  LPTEA
Sbjct: 89  ------------------WKHPYGENSNLVGLEDHPVVHVALEDALAFCNWSGMSLPTEA 130

Query: 204 EWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKF 263
           +WEY  RGG ++  +PWG+ L   G + AN WQG FP  NTA DG++ TAPV  +  N F
Sbjct: 131 QWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTAPVYEFLPNDF 189

Query: 264 GLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNEQYCYR 317
           GLY M+GNVWEW     N  +    S+N      PK      +   +GGS+LC+  YC R
Sbjct: 190 GLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNR 246

Query: 318 HRCAARSQNTPDSSAGNLGFRCAAD 342
           +R AAR+     S++ +LGFRC  +
Sbjct: 247 YRVAARNGCISTSTSSHLGFRCLKE 271


>gi|405958020|gb|EKC24189.1| Sulfatase-modifying factor 2 [Crassostrea gigas]
          Length = 373

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 115/333 (34%), Positives = 163/333 (48%), Gaps = 72/333 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT-GYVTEA 73
           M ++ G  F  G N P     GE+P +   + AFY+D H ++N  F +F  A   YVT A
Sbjct: 65  MKVIKGGKFLTGINDP-ESDSGEYPMQEAEVKAFYIDTHPITNAHFWKFKEAKPRYVTTA 123

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+ G ++V +P LS   R +                                        
Sbjct: 124 ERQGWSWVLKPFLSPAARDQ---------------------------------------- 143

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
             AR   E  W    +G          +W  PEG+ ST+  R++HPVVH+S  DA A+CT
Sbjct: 144 --ARNVPEGTWWVPVKGA---------AWNKPEGLGSTVRDRLDHPVVHISMEDAKAFCT 192

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTE EWE+  RGGL+ +++PWG+      ++R N+WQG+FP  NT  D +  T+
Sbjct: 193 WTGKRLPTENEWEFAARGGLQGKIYPWGDRYQ---KNRTNLWQGKFPEGNTKVDHWEGTS 249

Query: 254 PVMSYK-ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVK---KGGSYL 309
           PV  Y+ +N+FGLY+M+GNVWEWT+            Y  +       +V    KGGS++
Sbjct: 250 PVDFYRAQNQFGLYDMLGNVWEWTST----------RYYDRVVDRKLQEVMYVVKGGSFI 299

Query: 310 -CNEQYCYRH-RCAARSQNTPDSSAGNLGFRCA 340
              E +C R  R A R    P+ +A N+GFRC 
Sbjct: 300 DSREGHCNRIVRTAQRMGLQPEYTAQNVGFRCV 332


>gi|311106300|ref|YP_003979153.1| hypothetical protein AXYL_03118 [Achromobacter xylosoxidans A8]
 gi|310760989|gb|ADP16438.1| hypothetical protein AXYL_03118 [Achromobacter xylosoxidans A8]
          Length = 360

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 119/341 (34%), Positives = 153/341 (44%), Gaps = 51/341 (14%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F MG++ P   +  E P+  V L  F++D+  V+N  F  FV+ATGY+T AE
Sbjct: 45  MVWIPAGDFLMGSDSP-RSQANERPAHPVRLAGFWIDRDHVTNRDFARFVAATGYLTTAE 103

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D       L          V   +  F G    ++            ND   Y  W 
Sbjct: 104 RTPDWDTIRVHLPAGTPRPAGLVPGALV-FVGNSKPVDL-----------ND---YARW- 147

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W          R  P  SW HP+G  S I  +  HPVV VS+ DA+AY  W
Sbjct: 148 ---------W----------RFAPGASWRHPQGPGSDIAGKEEHPVVQVSYEDALAYAAW 188

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWEY  RGGLE   F WG +L P G   A  W         +      T P
Sbjct: 189 AGKRLPTEAEWEYAARGGLEQADFAWGPDLRPGGRPMARTWDAGLAFPLQSPKVMPGTEP 248

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWW-----NVHHHPAPSYNPKGPTTGTD--------- 300
             SY  N +G+ +M GN W+W ADW+      +        NP GP    D         
Sbjct: 249 AGSYPANGYGVRDMAGNAWQWVADWYRPDAFRLQSQQRRVRNPGGPAAPYDPAMVRPEAP 308

Query: 301 -KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            +V +GGS+LC+E YC  +R +AR    P SSA N+GFR A
Sbjct: 309 KRVIRGGSFLCSEDYCEGYRVSARQGQDPYSSASNVGFRLA 349


>gi|223939387|ref|ZP_03631266.1| protein of unknown function DUF323 [bacterium Ellin514]
 gi|223891991|gb|EEF58473.1| protein of unknown function DUF323 [bacterium Ellin514]
          Length = 425

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 85/206 (41%), Positives = 124/206 (60%), Gaps = 20/206 (9%)

Query: 154 NRLFPW------GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEY 207
           N +F W       +W HPEG D+ ++ R  +PVV +++ DAVAY  W G RLPTEAE+E+
Sbjct: 204 NSMFQWWRYQKGANWRHPEGPDTDLKGREKYPVVQIAYPDAVAYAKWAGKRLPTEAEFEF 263

Query: 208 GCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYN 267
             RGG   + + WGN+L P G+  AN+++G FPT +T ADG+   APV  +  N +GLY+
Sbjct: 264 AARGGSAGKTYAWGNDLKPGGKWMANIYEGTFPTKDTGADGFAGIAPVAQFPPNDYGLYD 323

Query: 268 MVGNVWEWTADWWNVHHHP------APSYNPKGPTTGTD--------KVKKGGSYLCNEQ 313
           M GNVW+W +DW+   ++         + NP+GP T  D        +V KGGS+LC +Q
Sbjct: 324 MAGNVWQWCSDWYRADYYTQLAATGTVARNPQGPDTPFDPAEPTEKKRVHKGGSFLCTDQ 383

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRC 339
           YC R+    R +    +++ +LGFRC
Sbjct: 384 YCTRYMVGTRGKGEVTTASNHLGFRC 409


>gi|377557807|ref|ZP_09787436.1| hypothetical protein GOOTI_018_00050 [Gordonia otitidis NBRC
           100426]
 gi|377525030|dbj|GAB32601.1| hypothetical protein GOOTI_018_00050 [Gordonia otitidis NBRC
           100426]
          Length = 326

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 129/369 (34%), Positives = 173/369 (46%), Gaps = 69/369 (18%)

Query: 1   MVLLPAP-----PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEV 55
           +V  P P     P    K+M  +P   F MG++        E P   V +D F++D H V
Sbjct: 2   LVDTPVPTTASGPTRARKNMAYIPAGIFWMGSDD---FYPEERPVHQVAVDGFWMDVHPV 58

Query: 56  SNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRM 115
           +  +F+ FV  TGYVT A +       EP  ++   A  + +R     F           
Sbjct: 59  TVAEFRRFVRDTGYVTTAAR-------EPDPADYPDADPTLLRAGSMVFTP--------- 102

Query: 116 HHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHR 175
             P   +  +D   Y  W          W Y           P   W HPEG  S +  R
Sbjct: 103 --PPGPVPLDD---YRRW----------WSY----------VPGADWEHPEGPGSNVGAR 137

Query: 176 MNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW 235
             HPV HVS+ DA AY  W G  LPTEAEWEY  RGGL+ + + WG+ L P G   AN W
Sbjct: 138 NRHPVTHVSYADAQAYAAWAGKELPTEAEWEYAARGGLDRKPYAWGDELHPHGRPAANTW 197

Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSY 290
            GEFP  NT  DGY  T+PV  ++ N +GL +++GNVWEWT D +   H     + AP+ 
Sbjct: 198 YGEFPWQNTEEDGYSRTSPVGRFRPNGYGLVDVIGNVWEWTCDHYTTDHAAGDKNVAPAS 257

Query: 291 N---PKGPT------TG------TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNL 335
               P+ P       TG        +  KGGS+LC   YC R+R +AR  +T D+S  ++
Sbjct: 258 TCCIPRNPVQRAAVDTGHGVERFARRTIKGGSHLCAPNYCVRYRPSARQSHTEDASTSHI 317

Query: 336 GFRCAADKG 344
           GFRC   +G
Sbjct: 318 GFRCIVREG 326


>gi|311251204|ref|XP_003124495.1| PREDICTED: sulfatase-modifying factor 2-like isoform 2 [Sus scrofa]
          Length = 286

 Score =  177 bits (450), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 122/336 (36%), Positives = 161/336 (47%), Gaps = 83/336 (24%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F+MGTN P   +DGE P R VT+  F +D   V+N  F++FV    Y TEAE
Sbjct: 30  MVQLPGGRFQMGTNSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 88

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE L+ +E R+K +   H M+   G  S I  R+  PVVH+SWNDA AYC WR
Sbjct: 89  AFGWSFVFEDLVPDELRSKAT---HQMQP-AGPGSGIRERLEFPVVHVSWNDARAYCAWR 144

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           G RLPTE EWE+  RGGL+ +++PWG+   P        +R N                W
Sbjct: 145 GKRLPTEEEWEFAARGGLKGQVYPWGNQFQP--------NRTN---------------LW 181

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G + P   + E G  G      FP                    P NN           
Sbjct: 182 QG-KFPKGDKAEDGFHGVSPVNAFP--------------------PQNN----------- 209

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNE 312
                   +GLY+++GNVWEWTA             +P        +V +G S++   + 
Sbjct: 210 --------YGLYDLMGNVWEWTA-------------SPYQAADQDMRVLRGASWIDTADG 248

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
              +R R   R  NTPDS++ NLGFRCA+  G   G
Sbjct: 249 SANHRARVTTRMGNTPDSASDNLGFRCASSAGRPPG 284


>gi|441510563|ref|ZP_20992468.1| hypothetical protein GOACH_24_00500 [Gordonia aichiensis NBRC
           108223]
 gi|441445345|dbj|GAC50429.1| hypothetical protein GOACH_24_00500 [Gordonia aichiensis NBRC
           108223]
          Length = 323

 Score =  177 bits (450), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 128/361 (35%), Positives = 171/361 (47%), Gaps = 64/361 (17%)

Query: 4   LPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
           +PA  V   K+M  +P   F MG++        E P   V +D F++D H V+  +F+ F
Sbjct: 7   VPAAAVRARKNMAYIPSGVFWMGSDD---FYPEERPVHQVAVDGFWIDTHPVTVAEFRRF 63

Query: 64  VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
           V  TGYVT A +       EP   +   A  + +R     F             P   + 
Sbjct: 64  VRDTGYVTTAAR-------EPDPDDYPDADPTLLRAGSMVFTP-----------PPGPVP 105

Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHV 183
            +D   Y  W          W Y           P   W HPEG  S +  R  HPV HV
Sbjct: 106 LDD---YRRW----------WSY----------VPGADWEHPEGPGSNVGARNRHPVTHV 142

Query: 184 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNN 243
           S+ DA AY  W G  LPTEAEWEY  RGGL+ + + WG+ L P G   AN W GEFP  N
Sbjct: 143 SYADAQAYAAWAGKELPTEAEWEYAARGGLDRKPYAWGDELHPHGRPAANTWYGEFPWQN 202

Query: 244 TAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGP 295
           T  DGY  T+PV  ++ N +GL +++GNVWEWT D +   H     + AP+     P+ P
Sbjct: 203 TEEDGYSRTSPVGRFRPNGYGLVDVIGNVWEWTCDHYTTDHAAGDKNVAPASTCCIPRNP 262

Query: 296 T------TG------TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
                  TG        +  KGGS+LC   YC R+R +AR  +T D+S  ++GFRC   +
Sbjct: 263 VQRAAVDTGHGVERFARRTIKGGSHLCAPNYCVRYRPSARQSHTEDASTSHIGFRCVVRE 322

Query: 344 G 344
           G
Sbjct: 323 G 323


>gi|433649663|ref|YP_007294665.1| hypothetical protein Mycsm_05050 [Mycobacterium smegmatis JS623]
 gi|433299440|gb|AGB25260.1| hypothetical protein Mycsm_05050 [Mycobacterium smegmatis JS623]
          Length = 292

 Score =  177 bits (449), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 112/332 (33%), Positives = 166/332 (50%), Gaps = 52/332 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V +P  T  +G+++    ++   P+R VT+D F++  H+V+N++F EF +ATGY+T A
Sbjct: 4   ELVWIPPQTAVLGSDQHYAEEE---PARPVTIDGFWIQTHQVTNSEFAEFAAATGYLTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+         + ++   A    ++     F      ++ R      H++      + TW
Sbjct: 61  ER-------ALVPADYPGAPAENLQPGSMVFTRTQGPVDLR------HLN-----LWWTW 102

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                   P  SW HP G +S+I+ R +HPVVHV++ DA AY +
Sbjct: 103 T-----------------------PGASWRHPVGPNSSIDKRADHPVVHVAYEDAEAYAS 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEAEWE   RGGL    + WG      G+  AN W G+FP       GY  T 
Sbjct: 140 WAGLALPTEAEWETAARGGLGGATYTWGEEPEQPGQSLANYWHGDFPWRPD--RGYGRTT 197

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTD-KVKKGGS 307
           PV S+  N +GL +M GNVWEWT DW+             +Y+P+ P      +V KGGS
Sbjct: 198 PVGSFPPNGYGLVDMAGNVWEWTTDWYADTRARQSCCAEDTYDPRQPQFRVPRRVVKGGS 257

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           +LC + YC R+R AAR     D+   ++GFRC
Sbjct: 258 FLCADNYCLRYRPAARRPQPVDTGMSHIGFRC 289


>gi|425736218|ref|ZP_18854525.1| non-specific serine/threonine protein kinase [Brevibacterium casei
           S18]
 gi|425478435|gb|EKU45627.1| non-specific serine/threonine protein kinase [Brevibacterium casei
           S18]
          Length = 328

 Score =  177 bits (449), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 121/330 (36%), Positives = 158/330 (47%), Gaps = 41/330 (12%)

Query: 15  MVLLPGDTFRMGTNKPILI-KDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           MV L G  FRMG+   +   +DGE P R V +  F +    V+  +F  FV ATG+ T A
Sbjct: 37  MVGLTGGVFRMGSEDSLAYPEDGEGPVREVDIAPFAIAATTVTRAEFAAFVLATGHRTGA 96

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E +GD+ VF  LL  E                 L+++        V    W   VA  +W
Sbjct: 97  EAYGDSLVFAGLLDAE----------------ALETSPA------VAATPWWRQVAGASW 134

Query: 134 RGARLPTEAEWEYG-CRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                P       G   GG        G+        + I  R +HPV HV+  DA AY 
Sbjct: 135 LHPEGPGRTLVGQGTAAGGEHGNGQERGADARLRADHAFIVARPDHPVTHVTRYDAEAYA 194

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W GARLPTEAEWE+  RGGL+ + +PWG+   P G  R   + G FP + T   G   T
Sbjct: 195 EWVGARLPTEAEWEFAARGGLDQQPYPWGSIREPDGVPRMKTFVGAFPDHPTEPVG---T 251

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
             V +Y  N FGL+NM GNVWEWT+           S+ P         V +GGS+LC+ 
Sbjct: 252 VAVDAYPPNGFGLHNMTGNVWEWTSG----------SFGPD----DARAVIRGGSHLCHA 297

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            YC R+R +ARS  TP++S G+ GFR A D
Sbjct: 298 SYCRRYRTSARSAVTPETSLGHTGFRLAVD 327



 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 22/42 (52%), Positives = 31/42 (73%)

Query: 352 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           V +GGS+LC+  YC R+R +ARS  TP++S G+ GFR A D+
Sbjct: 287 VIRGGSHLCHASYCRRYRTSARSAVTPETSLGHTGFRLAVDL 328


>gi|298682295|gb|ADI95357.1| hypothetical protein [Pseudomonas putida DOT-T1E]
          Length = 302

 Score =  177 bits (449), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 111/333 (33%), Positives = 155/333 (46%), Gaps = 45/333 (13%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P ++  D V +    F MG+++       E P+    +  F++    V+N  F  FV+ T
Sbjct: 11  PAQQEADWVWIEPGRFLMGSDR---HYPEEAPAHWEEVAGFWIQSTPVTNAMFARFVART 67

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           GY+T+AE+  D   + P +++      S V                    P   +S +D 
Sbjct: 68  GYITQAERPLDPRDY-PGVAQAHLRPASLV-----------------FTPPSGTVSLDDV 109

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
             +                         L P   W HP G  S ++    HPVVHV   D
Sbjct: 110 SRWWA-----------------------LVPGADWRHPLGPGSDLQGMAQHPVVHVGLED 146

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           A+AY  W G  LP E EWEY   GG +   F WG++L P G+H AN WQG FP  N   D
Sbjct: 147 ALAYAQWAGCDLPGEPEWEYAAWGGTQQGEFVWGHHLVPEGKHMANTWQGNFPAENLQTD 206

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-HPAPSYNPKGPTTGTDKVKKGG 306
           GY  T+PV ++  N +GLY+M+GNVWEWT   +   H  P  S      +    KV KGG
Sbjct: 207 GYSRTSPVGAFPANGYGLYDMIGNVWEWTTTVFQPSHASPRKSCCANARSAAQMKVLKGG 266

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           S+LC   YC R+R  ARS    D+++ ++GFRC
Sbjct: 267 SHLCAPNYCQRYRPPARSPQATDTTSSHIGFRC 299


>gi|359767934|ref|ZP_09271714.1| hypothetical protein GOPIP_070_00260 [Gordonia polyisoprenivorans
           NBRC 16320]
 gi|359314511|dbj|GAB24547.1| hypothetical protein GOPIP_070_00260 [Gordonia polyisoprenivorans
           NBRC 16320]
          Length = 313

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 125/343 (36%), Positives = 168/343 (48%), Gaps = 67/343 (19%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V +P   F MG+++       E P+    +++F ++ H V+N QF  FV+ TGY T A
Sbjct: 23  ELVAIPTADFEMGSDR---FYAEERPAHIRHVESFSVEVHPVTNAQFATFVADTGYRTVA 79

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+        PL              D   F G D      +   VV    +  V    W
Sbjct: 80  ER--------PL--------------DPADFPGADPA--ELVPGAVVFTPTDGPVDLTDW 115

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R       A W Y           P  SW HP G DS ++ R++HPV  +++ DA AY  
Sbjct: 116 R-------AWWRY----------VPGASWRHPLGPDSGVDDRLDHPVTQIAYADADAYAR 158

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE   RGGL+   + WG+ L P G   AN W G FP  N   +G+  T+
Sbjct: 159 WAGRRLPTEAEWECAARGGLDGHDYAWGDELHPDGAVVANTWIGRFPYRN---EGWGGTS 215

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------------APSYNPKGPTTG 298
           PV  Y  N FGL++M+GNVWE TA+ +   H P               APS +P      
Sbjct: 216 PVGYYPSNPFGLFDMIGNVWERTAEVFTPRHVPPDMSTVTAGGRQNLLAPSTSPT----- 270

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
             +V KGGS+LC+ QYC R+R AARS  + DS+  +LGFRC A
Sbjct: 271 VLRVTKGGSHLCSPQYCRRYRPAARSPQSDDSATSHLGFRCVA 313


>gi|424883781|ref|ZP_18307409.1| hypothetical protein Rleg8DRAFT_0544 [Rhizobium leguminosarum bv.
           trifolii WU95]
 gi|392515442|gb|EIW40175.1| hypothetical protein Rleg8DRAFT_0544 [Rhizobium leguminosarum bv.
           trifolii WU95]
          Length = 326

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 123/344 (35%), Positives = 166/344 (48%), Gaps = 65/344 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M  +PG TF MG++        E P+  V +D F++    V+N QF++FV ATGYVT AE
Sbjct: 20  MAWIPGRTFTMGSDHHY---PEEAPAHPVKVDGFWISLTPVTNRQFRQFVEATGYVTVAE 76

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K  D   +     E  RA  S V    K   G D           +   W          
Sbjct: 77  KAPDPKDYPGAKPEMLRAG-SLVFTQPKAVNGAD-----------ISQWWT--------- 115

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    +++G             +W  P G  S +  +++HPVVHV++ DA AY  W
Sbjct: 116 ---------FKFGA------------NWRRPLGGLSDLRGKLDHPVVHVAYADAAAYAEW 154

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE   RGGL++  + WG+   P G   AN W G FPT++    G   T+P
Sbjct: 155 AGLDLPTEAEWELAARGGLDDAEYAWGDEFAPGGVPMANTWSGTFPTHSMKPKGQERTSP 214

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-------NPKGPTTGTD------- 300
           V S+  N FGLY+M+GNVWEWT+D+W+   HP P+        NP+G             
Sbjct: 215 VRSFPPNGFGLYDMIGNVWEWTSDYWST-RHPEPAKHSCCIPSNPRGRDAEASLDLRQPE 273

Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                +V KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 274 IRIARRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRC 317


>gi|398959331|ref|ZP_10678088.1| hypothetical protein PMI26_05918 [Pseudomonas sp. GM33]
 gi|398145384|gb|EJM34171.1| hypothetical protein PMI26_05918 [Pseudomonas sp. GM33]
          Length = 302

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 112/328 (34%), Positives = 157/328 (47%), Gaps = 46/328 (14%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           ++ +P   F MG+         E P R   +  F++    V+N  F  F+++TGYVT+A 
Sbjct: 17  LIRIPEGRFLMGSQDHY---PEESPVRWEEITGFFMHATPVTNAMFAAFIASTGYVTQA- 72

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                   E  L+ E+   ++    D+ +   L          P   +  +D   + +  
Sbjct: 73  --------ERCLATEDYPGVAP---DLLQPASL------VFMPPDGPVQLDDVSRWWS-- 113

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                L P   W HP G DS ++    HPVVHV   DA AY  W
Sbjct: 114 ---------------------LVPGADWRHPLGPDSNLQGLDQHPVVHVGLEDAHAYAQW 152

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEA+WE+   GG     F WG+ L P G H AN+WQG+FP  N A DGY  T+P
Sbjct: 153 VGCELPTEAQWEFAAWGGRTGSEFVWGDQLVPDGRHMANIWQGDFPWRNLALDGYDRTSP 212

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--PSYNPKGPTTGTDKVKKGGSYLCNE 312
           V ++  N +GL +M+GNVWEWT   +N     +  PS   KG      +V KGGS+LC  
Sbjct: 213 VGAFPANGYGLQDMIGNVWEWTITPFNNPGQGSGKPSCCAKGGGEKNMRVLKGGSHLCAP 272

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            YC R+R  ARS   PD+++ ++GFRC 
Sbjct: 273 DYCRRYRAPARSPQAPDTTSSHIGFRCV 300


>gi|38638565|ref|NP_943151.1| hypothetical protein ND018 [Pseudomonas putida ND6]
 gi|34335353|gb|AAP44251.1| hypothetical protein ND018 [Pseudomonas putida ND6]
          Length = 302

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 115/333 (34%), Positives = 158/333 (47%), Gaps = 45/333 (13%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P ++  D V +    F MG++        E P+    +  F++    V+N  F  FV+ T
Sbjct: 11  PEQQEADWVWIEPGRFLMGSDS---HYPEEAPAHWEEVAGFWIQSTPVTNAMFARFVART 67

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           GYVT+AE+        PL              D + + G+  +  H     +V    + A
Sbjct: 68  GYVTQAER--------PL--------------DPRDYPGVAQS--HLRPASLVFTPPSGA 103

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
           V+        L   + W           L P   W HP G  S ++    HPVVHV   D
Sbjct: 104 VS--------LDDVSRWW---------ALVPGADWRHPLGPGSDLQGMAQHPVVHVGLED 146

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           A+AY  W G  LP E EWEY   GG +   F WG++L P G+H AN WQG FP  N   D
Sbjct: 147 ALAYAQWAGCDLPGEPEWEYAAWGGTQQGEFVWGHHLVPEGKHMANTWQGNFPAENLQTD 206

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-HPAPSYNPKGPTTGTDKVKKGG 306
           GY  T+PV ++  N +GLY+M+GNVWEWT   +   H  P  S      +    KV KGG
Sbjct: 207 GYSRTSPVGAFPANGYGLYDMIGNVWEWTTTVFQPSHASPRKSCCANARSAAQMKVLKGG 266

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           S+LC   YC R+R  ARS    D+++ ++GFRC
Sbjct: 267 SHLCAPNYCQRYRPPARSPQATDTTSSHIGFRC 299


>gi|84497348|ref|ZP_00996170.1| hypothetical protein JNB_14178 [Janibacter sp. HTCC2649]
 gi|84382236|gb|EAP98118.1| hypothetical protein JNB_14178 [Janibacter sp. HTCC2649]
          Length = 333

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 122/381 (32%), Positives = 186/381 (48%), Gaps = 79/381 (20%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG   RMG+++  L    E P+R++ +  F +++H V+N QF  FV+ TGYVT  
Sbjct: 31  ELVDLPGGPLRMGSDEHYL---EEAPARDMVVAPFAMERHPVTNAQFAAFVADTGYVT-- 85

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
                               +++   D + + G D ++   +   +V       V    W
Sbjct: 86  --------------------VAEQALDPEAYPGADPSL--LVPGGLVFTPTPGPVPLDRW 123

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                    +W          R  P  SW HP G  S +    +HPVV V+++D +AY  
Sbjct: 124 D--------QWW---------RWVPGASWRHPRGPGSDVSDDDDHPVVMVAYDDTLAYAA 166

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTE E+E+   G   +  F WG+ L+P GE  AN WQG+FP  NT A+G++ T+
Sbjct: 167 WAGRRLPTEPEFEFAA-GAFASTEFAWGDELSPGGELMANTWQGQFPWRNTGANGWVGTS 225

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           PV S+  N++GL +++GNVWEWT+ ++               T   D+V+   +   +E 
Sbjct: 226 PVGSFPANQWGLVDLIGNVWEWTSTYYV--------------TNAEDRVR---AEEIDET 268

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAAR 373
            C    CA R ++    + G L  R            +V KGGS+LC  +YC R+R AAR
Sbjct: 269 SC----CAPRIRDL-SVAPGELHPR------------RVLKGGSHLCAPEYCRRYRPAAR 311

Query: 374 SQNTPDSSAGNLGFRCAADVS 394
           S  + DS+  ++GFR  A  S
Sbjct: 312 SGQSVDSATSHVGFRLVAAAS 332


>gi|196228936|ref|ZP_03127802.1| protein of unknown function DUF323 [Chthoniobacter flavus Ellin428]
 gi|196227217|gb|EDY21721.1| protein of unknown function DUF323 [Chthoniobacter flavus Ellin428]
          Length = 399

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 85/195 (43%), Positives = 118/195 (60%), Gaps = 9/195 (4%)

Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
           R  P  +W HPEG  S I+ R  +PVV V W+DA+A+ +W G RLPTEAEWEY  RGG  
Sbjct: 201 RYVPGANWRHPEGPQSNIDGREKYPVVQVCWDDAIAFASWAGKRLPTEAEWEYAARGGAM 260

Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
           +  + WG  + P G   AN+WQG FP  + A DG+   APV +++ N +GL +M GN WE
Sbjct: 261 HNPYVWGKEMKPGGRWDANIWQGRFPVEDKAEDGFAGLAPVGTFRTNDYGLADMAGNAWE 320

Query: 275 WTADWWNV-HHHPAPSYNPKGPTTGTD--------KVKKGGSYLCNEQYCYRHRCAARSQ 325
           W ADW++  +++ +P  NP GP    D        +V +GGSYL +E     +R +AR +
Sbjct: 321 WCADWYDATYYNRSPHANPPGPEKSDDPDEPSVPKRVARGGSYLSSENNGAGYRPSARRK 380

Query: 326 NTPDSSAGNLGFRCA 340
             P  ++ +LGFRCA
Sbjct: 381 VPPGYASCDLGFRCA 395


>gi|375140631|ref|YP_005001280.1| hypothetical protein [Mycobacterium rhodesiae NBB3]
 gi|359821252|gb|AEV74065.1| hypothetical protein MycrhN_3546 [Mycobacterium rhodesiae NBB3]
          Length = 290

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 114/332 (34%), Positives = 163/332 (49%), Gaps = 54/332 (16%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           D+V +P  T  +G+++       E P+R V +  F +   +V+N ++  FV ATGY+T A
Sbjct: 4   DLVRIPAQTATLGSDR---HYPEEGPARPVQVAGFGIQPRQVTNAEYAGFVGATGYLTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  +        ++   A  + ++     F      ++ R     +++ W       TW
Sbjct: 61  ERPVNP-------ADYPNAPAANLQPGSMVFSRTRGPVDLRH----LNLWW-------TW 102

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                   P  SW HP G  S+I+ R +HPVVH+++ DA AY T
Sbjct: 103 T-----------------------PGASWRHPVGPLSSIDKRADHPVVHIAYEDAEAYAT 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEAEWE   RGGL+   + WG+   P  +  AN W G+FP       GY  T 
Sbjct: 140 WAGLSLPTEAEWETAARGGLDQAEYTWGDE--PETQPMANYWHGDFPWRPD--KGYGRTT 195

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTTGTD-KVKKGGS 307
           PV SY  N +GL++M GNVWEWT DW+       P     SY+P+ P      KV KGGS
Sbjct: 196 PVGSYPPNGYGLFDMAGNVWEWTTDWYGETRDAQPCCASDSYDPRQPQFPVPRKVVKGGS 255

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           +LC + YC R+R AAR     D+   ++GFRC
Sbjct: 256 FLCADVYCLRYRPAARRPQPVDTGMSHIGFRC 287


>gi|284044520|ref|YP_003394860.1| hypothetical protein Cwoe_3066 [Conexibacter woesei DSM 14684]
 gi|283948741|gb|ADB51485.1| protein of unknown function DUF323 [Conexibacter woesei DSM 14684]
          Length = 300

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 118/339 (34%), Positives = 152/339 (44%), Gaps = 58/339 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G +FRMG+ +       E P R V +  F +D   V+  +F+ F   TGYVT AE
Sbjct: 3   MVELDGGSFRMGSEE---FYPEERPVREVAVGGFAIDPRLVTVREFRRFADETGYVTLAE 59

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K  D  ++ P    E     S V                    P   +S +    +  W 
Sbjct: 60  KPPDPAMY-PGADPELLVPGSLV-----------------FRKPRERVSLDHWRRWWAW- 100

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                  P   W HP G  ++   R  HPVVH++  DAVAY  W
Sbjct: 101 ----------------------VPGADWRHPGGPGTSTRGRDRHPVVHIAHEDAVAYAAW 138

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWEY  RGGL+   F WG++  P G   AN WQGEFP  N  A G+  T+P
Sbjct: 139 AGKALPTEAEWEYAARGGLDGARFAWGDDERPGGRLMANHWQGEFPWQNLKAKGWEGTSP 198

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWW-----------NVHHHPAPSYNPKGPTTGTD--- 300
           V ++  N + LY+M GNVWEWTAD +                       +G   G     
Sbjct: 199 VATFPPNGYDLYDMAGNVWEWTADLFTPAAPTPAAAPAAPAACCAPAPARGQEPGAHIPR 258

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           +V KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 259 RVIKGGSHLCAPNYCLRYRPAARQGEAIDTSTSHIGFRC 297


>gi|322800832|gb|EFZ21706.1| hypothetical protein SINV_15331 [Solenopsis invicta]
          Length = 237

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 95/214 (44%), Positives = 120/214 (56%), Gaps = 54/214 (25%)

Query: 22  TFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFV 81
           T+ +GTN P+ + DGE P R V LD FY+D+ EVSN +F  FV+ATGY+TEAE FG++FV
Sbjct: 74  TYFIGTNNPVFVADGEGPRREVVLDNFYIDKFEVSNEEFMIFVNATGYITEAESFGESFV 133

Query: 82  FEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTE 141
           FE LL +  + KI++V                              VA   W    LP +
Sbjct: 134 FENLLKQNIKDKINKV------------------------------VAQAPWW---LPVK 160

Query: 142 AEWEYGCRGGLENRLFPWGSWLHPEGIDSTIE----HRMNHPVVHVSWNDAVAYCTWRGA 197
                              SW HPEG +S I      RM+HPV+HVSW+DAVAYC+W G 
Sbjct: 161 Q-----------------ASWQHPEGPNSNITCMYLVRMDHPVIHVSWHDAVAYCSWIGK 203

Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR 231
           RLPTEAEWE  CRGGL +RL+PWGN L P  +H+
Sbjct: 204 RLPTEAEWEVACRGGLSDRLYPWGNKLMPNNQHK 237


>gi|241113473|ref|YP_002973308.1| hypothetical protein Rleg_5135 [Rhizobium leguminosarum bv.
           trifolii WSM1325]
 gi|240861681|gb|ACS59347.1| protein of unknown function DUF323 [Rhizobium leguminosarum bv.
           trifolii WSM1325]
          Length = 324

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 122/345 (35%), Positives = 166/345 (48%), Gaps = 65/345 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG TF MG++        E P+  V +D F++    V+N QF++FV  TGYVT AE
Sbjct: 18  MVWIPGRTFTMGSDHHY---PEEAPAHPVKVDGFWMSLTPVTNRQFKQFVETTGYVTVAE 74

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
              D   +     E  RA  S V    K   G D           +   W          
Sbjct: 75  IAPDPKDYPGAKPEMLRAG-SLVFTQPKAVNGAD-----------ISQWWT--------- 113

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    +++G             +W  P G  S +  +++HPVVHV++ DA AY  W
Sbjct: 114 ---------FKFGA------------NWRRPLGGLSDLRGKLDHPVVHVAYADAAAYAEW 152

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE   RGGL++  + WG+   P G   AN W G FPT++    G   T+P
Sbjct: 153 AGLDLPTEAEWELAARGGLDDAEYAWGDEFAPGGVPMANTWSGTFPTHSMKPKGQERTSP 212

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP------------------SYNPKGPT 296
           V S+  N FGLY+++GNVWEWT+D+W+   HP P                  SY+P+ P 
Sbjct: 213 VRSFPPNGFGLYDVIGNVWEWTSDYWST-RHPEPAKHSCCIPSNPRGRDAEASYDPRQPE 271

Query: 297 TG-TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                +V KGGS+LC   YC R+R AAR     D+S  ++GFRC 
Sbjct: 272 IRIARRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRCV 316


>gi|395843048|ref|XP_003794312.1| PREDICTED: sulfatase-modifying factor 2 [Otolemur garnettii]
          Length = 401

 Score =  175 bits (443), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 109/287 (37%), Positives = 146/287 (50%), Gaps = 69/287 (24%)

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           ++FV    Y TEAE FG +FVFE L+S E R K++Q    MK                  
Sbjct: 175 RDFVREKKYRTEAEVFGWSFVFEDLVSSELRNKVTQ---QMK------------------ 213

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
            + W        W    LP E  +                 W  P G  S I  R+  PV
Sbjct: 214 SVLW--------W----LPVEKAF-----------------WRQPAGPGSGIRERLELPV 244

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           VHVSWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP
Sbjct: 245 VHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQIYPWGNRFQP---NRTNLWQGKFP 301

Query: 241 TNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT 299
             + A DG+   +PV ++  +N +GLY+++GNVWEWTA             +P       
Sbjct: 302 KGDKAEDGFHGVSPVNAFPPQNNYGLYDLLGNVWEWTA-------------SPYKAAGQD 348

Query: 300 DKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
            +V +G S++   +    +R R   R  NTPDS++ NLGFRCA+  G
Sbjct: 349 MRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCASSTG 395


>gi|377561797|ref|ZP_09791227.1| hypothetical protein GOOTI_217_00030 [Gordonia otitidis NBRC
           100426]
 gi|377520992|dbj|GAB36392.1| hypothetical protein GOOTI_217_00030 [Gordonia otitidis NBRC
           100426]
          Length = 305

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 125/340 (36%), Positives = 171/340 (50%), Gaps = 57/340 (16%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V L G  F MG+++       E P+    + AF +++H V+N Q+  FV+ATGYVT A
Sbjct: 9   ELVALDGGLFCMGSDR---FYPEERPAHERRIGAFSIERHPVTNAQYAAFVAATGYVTVA 65

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+                        D   F G D      +   +V    +  V    W
Sbjct: 66  ER----------------------EIDPADFPGADPV--SLVPGALVFTPTSGPVDLADW 101

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R        +W          R  P  SW HP G  S +    +HPV  +++ DAVAY  
Sbjct: 102 R--------QWW---------RWEPGASWKHPLGPRSDLVGYEDHPVTQIAYADAVAYAD 144

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE+  RGG+    + WG+ L P GE  AN W G+FP    A+DG+  T+
Sbjct: 145 WIGRRLPTEAEWEFAARGGVAGHEYAWGDELHPDGEILANTWIGDFP---YASDGWAGTS 201

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--PSYNPKG------PTTG--TDKVK 303
            + SY  N FGL +M+GNVWE T++ +   H P    + +P G      PTT     +V 
Sbjct: 202 KIGSYPPNGFGLLDMIGNVWERTSNVFAPRHIPPGLSTVSPDGRQDLLAPTTSPRVRRVT 261

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           KGGS+LC+ QYC R+R AARS  + DS+  +LGFRCAAD 
Sbjct: 262 KGGSHLCSPQYCRRYRPAARSPQSDDSATSHLGFRCAADS 301


>gi|398990931|ref|ZP_10694094.1| hypothetical protein PMI23_04586 [Pseudomonas sp. GM24]
 gi|399012075|ref|ZP_10714403.1| hypothetical protein PMI19_01181 [Pseudomonas sp. GM16]
 gi|398116681|gb|EJM06440.1| hypothetical protein PMI19_01181 [Pseudomonas sp. GM16]
 gi|398142219|gb|EJM31122.1| hypothetical protein PMI23_04586 [Pseudomonas sp. GM24]
          Length = 305

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 85/186 (45%), Positives = 110/186 (59%), Gaps = 1/186 (0%)

Query: 156 LFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 215
           L P   W HP+G  S + +  NHPVVHV   DA+AY  W G  LPTE EWEY   GG + 
Sbjct: 118 LVPGADWSHPQGPGSNLRNMDNHPVVHVGLEDAMAYAQWAGCDLPTEQEWEYAAWGGHQA 177

Query: 216 RLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEW 275
             F WG+ L P G+H+AN WQG FP  N  ADG+  T+PV ++  N F L++M+GNVWEW
Sbjct: 178 SEFVWGHELAPNGQHQANTWQGAFPHENLEADGFAGTSPVRTFPGNGFDLFDMIGNVWEW 237

Query: 276 TAD-WWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGN 334
           T   +   H  P  S       +   KV KGGS+LC   YC R+R  ARS  + D+++ +
Sbjct: 238 TTSVFQTTHDAPQKSCCTNAKISAGMKVLKGGSHLCAPNYCQRYRAPARSPQSVDTTSSH 297

Query: 335 LGFRCA 340
           +GFRC 
Sbjct: 298 IGFRCV 303


>gi|336119282|ref|YP_004574059.1| hypothetical protein MLP_36420 [Microlunatus phosphovorus NM-1]
 gi|334687071|dbj|BAK36656.1| hypothetical protein MLP_36420 [Microlunatus phosphovorus NM-1]
          Length = 318

 Score =  174 bits (441), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 121/341 (35%), Positives = 162/341 (47%), Gaps = 63/341 (18%)

Query: 18  LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
           + G TF MG+++       E P+  V +D F + +  V+N +F  FV+ATGY+T AE+  
Sbjct: 10  IAGGTFTMGSDR---HYPEEAPTHRVRVDGFRIQRSAVTNAEFAAFVTATGYLTVAERPL 66

Query: 78  DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
           D   F P    E     S V      F      ++ R      H+S      +  W    
Sbjct: 67  DPADF-PGAPAENLVAGSMV------FTPTPGPVDLR------HLS-----QWWMW---- 104

Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
                               P  SW HP G  S +    +HPVVH+++ DA AY  W GA
Sbjct: 105 -------------------VPGASWRHPLGPGSDLTELADHPVVHIAYEDAAAYANWCGA 145

Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG-EFPTNNTAADGYLSTAPVM 256
            LPTEA+WEY  RGGL    F WG+   P G   AN W G +FP  +T   G+L TAPV 
Sbjct: 146 ALPTEAQWEYAARGGLAGAAFTWGDAARPDGRLMANTWDGPDFPWRSTGESGWLRTAPVG 205

Query: 257 SYKENKFGLYNMVGNVWEWTADWWNVHH-----------------HPAPSYNPKGPTTGT 299
           S+  N +GL++M GNVWEWT DWW   H                   + S++P+ P    
Sbjct: 206 SFPANGYGLHDMAGNVWEWTQDWWTSRHPEDAAKPCCVPVNPRGGEISDSFDPRQPQFAI 265

Query: 300 D-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
             KV KGGS+LC + YC R+R AAR     D+   ++GFRC
Sbjct: 266 PRKVIKGGSHLCADSYCLRYRPAARRPQPVDTGMSHIGFRC 306


>gi|183984379|ref|YP_001852670.1| hypothetical protein MMAR_4408 [Mycobacterium marinum M]
 gi|183177705|gb|ACC42815.1| conserved protein [Mycobacterium marinum M]
          Length = 293

 Score =  174 bits (440), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 122/332 (36%), Positives = 162/332 (48%), Gaps = 52/332 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           D++ +P  T  +G+         E P R V ++AF +  H+V+NTQF EFV ATGY+T A
Sbjct: 4   DLIRIPAQTAVLGSEA---HYPEESPLRQVDVNAFRIQAHQVTNTQFAEFVDATGYLTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D   + P    E     S V      F      ++ R      H+S      + TW
Sbjct: 61  ERPLDPADY-PGAPPENLQPGSMV------FTRTAGPVDLR------HLS-----LWWTW 102

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                   P  SW HP G  S+I  R +HPVVHV++ DA AY  
Sbjct: 103 T-----------------------PGASWRHPAGPHSSIAKRADHPVVHVAYEDAHAYAG 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LP+EAEWE   RGGL    + WG      G+  AN W G+FP       GY  T+
Sbjct: 140 WAGLALPSEAEWEVAARGGLTGAAYTWGEESERPGQRLANFWHGDFPWRPEP--GYGRTS 197

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTTGTD-KVKKGGS 307
           PV S++ N +GL++M GNVWEWT DW+       P     SY+P+ P      KV KGGS
Sbjct: 198 PVGSFEPNGYGLFDMAGNVWEWTVDWYTDTPSARPAAEADSYDPQQPQFRVPRKVVKGGS 257

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           +LC + YC R+R AAR     D+   ++GFRC
Sbjct: 258 FLCADSYCRRYRPAARRPQPIDTGMSHIGFRC 289


>gi|404420723|ref|ZP_11002458.1| sulfatase-modifying factor 1 [Mycobacterium fortuitum subsp.
           fortuitum DSM 46621]
 gi|403659762|gb|EJZ14388.1| sulfatase-modifying factor 1 [Mycobacterium fortuitum subsp.
           fortuitum DSM 46621]
          Length = 291

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 123/334 (36%), Positives = 167/334 (50%), Gaps = 51/334 (15%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
            +++ L G TFRMG+         E P   VT+ AF +++H V+  +F  FV ATGYVT 
Sbjct: 3   SELIELAGGTFRMGSTA---FYPEEAPVHTVTVGAFAIERHPVTTAEFGAFVEATGYVTI 59

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+  D  ++ P + E +    + V      F   D  ++ R                  
Sbjct: 60  AERPMDPALY-PGVDEADLVPGALV------FRPTDGPVDLR-----------------D 95

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR         W++           P  +W HP G D       +HPVV V++ DA AY 
Sbjct: 96  WR-------QWWDWA----------PGANWRHPFGPDRD-PAAPDHPVVQVAYPDAAAYA 137

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G RLPTEAEWEY  RGG+E   + WG+ + P G   AN WQG FP  N  A G+  T
Sbjct: 138 AWAGRRLPTEAEWEYAARGGVEG-TYAWGDEVNPGGALMANTWQGRFPYRNDGALGWRGT 196

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTG-----TDKVKKGGS 307
           +PV S+  N FGL++M+GNVWEWT   +   H P        P         ++V KGGS
Sbjct: 197 SPVGSFPPNAFGLFDMIGNVWEWTTTLFAGQHRPQADAPACCPPPADPDPMVNQVLKGGS 256

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           +LC  +YC+R+R AARS  + DSS  ++GFRC A
Sbjct: 257 HLCAPEYCHRYRPAARSPQSQDSSTTHIGFRCVA 290



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 23/43 (53%), Positives = 32/43 (74%)

Query: 349 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 391
            ++V KGGS+LC  +YC+R+R AARS  + DSS  ++GFRC A
Sbjct: 248 VNQVLKGGSHLCAPEYCHRYRPAARSPQSQDSSTTHIGFRCVA 290


>gi|449265959|gb|EMC77086.1| Sulfatase-modifying factor 2, partial [Columba livia]
          Length = 225

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 106/285 (37%), Positives = 144/285 (50%), Gaps = 63/285 (22%)

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
           +EFV    Y TEAE FG +FVFE  +SEE + K++         + L+S           
Sbjct: 1   REFVREKKYKTEAEAFGWSFVFEDFVSEELKKKVT---------QKLES----------- 40

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                       W    LP E                    W  P G  S+I+ R+++PV
Sbjct: 41  ----------APWW---LPIEK-----------------AFWRQPSGPGSSIKDRLDYPV 70

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HVSWNDA  +C W+G RLP E EWE+  RGGLE R++PWGN   P   +R N+WQGEFP
Sbjct: 71  LHVSWNDAQTFCAWKGKRLPVEEEWEFAARGGLERRVYPWGNKFQP---NRTNLWQGEFP 127

Query: 241 TNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT 299
             +TA DGY   +PV ++  +N +GLY+++GN WEWTA       +  P    +      
Sbjct: 128 RVDTADDGYHGVSPVAAFPPQNSYGLYDLLGNTWEWTA-----SEYTTPGLLRQ--RAQK 180

Query: 300 DKVKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAAD 342
            +V +G S++        H  R   R  NTPDS++ NL FRCAAD
Sbjct: 181 MQVLRGASWIDTADGSANHKARVTTRMGNTPDSASDNLSFRCAAD 225


>gi|118616111|ref|YP_904443.1| hypothetical protein MUL_0225 [Mycobacterium ulcerans Agy99]
 gi|118568221|gb|ABL02972.1| conserved protein [Mycobacterium ulcerans Agy99]
          Length = 293

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 122/332 (36%), Positives = 162/332 (48%), Gaps = 52/332 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           D++ +P  T  +G+         E P R V ++AF +  H+V+NTQF EFV ATGY+T A
Sbjct: 4   DLIRIPAQTAVLGSEA---HYPEESPLRQVDVNAFRIQAHQVTNTQFAEFVDATGYLTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D   + P    E     S V      F      ++ R      H+S      + TW
Sbjct: 61  ERPLDPADY-PGAPPENLQPGSMV------FTRTAGPVDLR------HLS-----LWWTW 102

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                   P  SW HP G  S+I  R +HPVVHV++ DA AY  
Sbjct: 103 T-----------------------PGASWRHPAGPHSSIAKRADHPVVHVAYEDAHAYAG 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LP+EAEWE   RGGL    + WG      G+  AN W G+FP       GY  T+
Sbjct: 140 WAGLALPSEAEWEVAARGGLTGAAYTWGEESERPGQRLANFWHGDFPWRPEP--GYGRTS 197

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTTGTD-KVKKGGS 307
           PV S++ N +GL++M GNVWEWT DW+       P     SY+P+ P      KV KGGS
Sbjct: 198 PVGSFEPNGYGLFDMAGNVWEWTVDWYIDTPSARPAAEADSYDPQQPQFRVPRKVVKGGS 257

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           +LC + YC R+R AAR     D+   ++GFRC
Sbjct: 258 FLCADSYCRRYRPAARRPQPIDTGVSHIGFRC 289


>gi|260906522|ref|ZP_05914844.1| hypothetical protein BlinB_14423 [Brevibacterium linens BL2]
          Length = 342

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 121/329 (36%), Positives = 159/329 (48%), Gaps = 55/329 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIK-DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M+ L G +F MG+  P+    DGE P R V+++ F +    V+  QF  FV+ATG+ T+A
Sbjct: 67  MIGLSGGSFLMGSEDPLAYPGDGEGPVREVSVNGFAIAATTVTVAQFAAFVAATGHRTDA 126

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E  GD+ VF  LL+E                 GL  T       P     W   V   TW
Sbjct: 127 ETHGDSLVFTGLLAE-----------------GLAETTPSVQATP-----WWRPVTGATW 164

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                P           GL       GS L      +++     HPV HVS+ DA AY  
Sbjct: 165 LQPSGP-----------GLNGVPGVPGSTLE----GASLNDWARHPVTHVSYADAEAYAQ 209

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W GARLPTE EWE+  RGGLE + +PWG+   P+G    N + G FP   T   G   T 
Sbjct: 210 WVGARLPTETEWEFASRGGLEQQPYPWGSVREPQGLANMNTFTGVFPDAPTMPVG---TV 266

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
           P  ++  N FGL+N  GNVWEWT+  ++ H                  V +GGSY+C++ 
Sbjct: 267 PADAFAPNGFGLHNTTGNVWEWTSSHFSDHDR--------------RPVLRGGSYMCHDS 312

Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           YC R+R +ARS  TPD+S G+ GFR A D
Sbjct: 313 YCRRYRTSARSAATPDTSLGHTGFRLAID 341



 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 23/42 (54%), Positives = 32/42 (76%)

Query: 352 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           V +GGSY+C++ YC R+R +ARS  TPD+S G+ GFR A D+
Sbjct: 301 VLRGGSYMCHDSYCRRYRTSARSAATPDTSLGHTGFRLAIDL 342


>gi|291412121|ref|XP_002722334.1| PREDICTED: sulfatase modifying factor 2 [Oryctolagus cuniculus]
          Length = 335

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 116/336 (34%), Positives = 159/336 (47%), Gaps = 83/336 (24%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F MGTN P   +DGE P R VT+  F +D   V+N  F+EF+    Y TEAE
Sbjct: 79  MVQLPGGRFLMGTNAPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFREFIREKKYRTEAE 137

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K+++         G  S I  R+  PV+H+SWNDA AYC WR
Sbjct: 138 AFGWSFVFEDFVSKELRNKVTKQMQPA----GPGSGISERLELPVLHVSWNDARAYCAWR 193

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           G RLPTE EWE+  RGGL+ +++PWG+   P        +R N                W
Sbjct: 194 GKRLPTEEEWEFAARGGLKGQVYPWGNRFQP--------NRTN---------------LW 230

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G                             P+G+   + + G  P N            
Sbjct: 231 QGQ---------------------------FPKGDTAEDGFHGVSPVNAFP--------- 254

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNE 312
                +N +GLY+++GNVWEWTA       +PA   +         +V +G S++   + 
Sbjct: 255 ----PQNNYGLYDLMGNVWEWTAS-----TYPAADQD--------MRVLRGASWIDTADG 297

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
              +R R   R  NTPDS++ NL FRCA+  G   G
Sbjct: 298 SANHRARVTTRMGNTPDSASDNLSFRCASGAGRPPG 333


>gi|374609240|ref|ZP_09682036.1| protein of unknown function DUF323 [Mycobacterium tusciae JS617]
 gi|373552209|gb|EHP78819.1| protein of unknown function DUF323 [Mycobacterium tusciae JS617]
          Length = 292

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/333 (35%), Positives = 161/333 (48%), Gaps = 52/333 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           D+V +P  T  +G+++       E P+R V +D F +    V+NT+F EFV ATGY+T A
Sbjct: 4   DLVWIPKQTATLGSDR---HYPEEAPARPVGVDGFSMMARTVTNTEFAEFVDATGYLTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+  D   +    +E  +                 S +  R   PV     N    + TW
Sbjct: 61  ERPLDPAAYPDAPAENLQPG---------------SMVFTRTQGPVDLRHLN---LWWTW 102

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                                   P  SW HP G  S+I+ R  HPVVH+++ DA AY  
Sbjct: 103 T-----------------------PGASWRHPVGPLSSIDRRAEHPVVHIAYEDAEAYAA 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEAEWE   RGGL+   + WG+      +  AN W G+FP       GY  TA
Sbjct: 140 WAGMSLPTEAEWETAARGGLDQAEYTWGDEPESPDQRMANYWHGDFPWR--PEKGYGRTA 197

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTTGTD-KVKKGGS 307
           PV SY+ N +GLY+M GNVWEWT DW+       P     S++P+ P      +V KGGS
Sbjct: 198 PVGSYQPNGYGLYDMAGNVWEWTTDWYGETRDNQPCCASDSFDPRQPQFQVPRRVVKGGS 257

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           +LC + YC R+R AAR     D+   ++GFRC 
Sbjct: 258 FLCADIYCLRYRPAARRPQPADTGMSHVGFRCV 290


>gi|108801079|ref|YP_641276.1| hypothetical protein Mmcs_4115 [Mycobacterium sp. MCS]
 gi|119870220|ref|YP_940172.1| hypothetical protein Mkms_4190 [Mycobacterium sp. KMS]
 gi|108771498|gb|ABG10220.1| protein of unknown function DUF323 [Mycobacterium sp. MCS]
 gi|119696309|gb|ABL93382.1| protein of unknown function DUF323 [Mycobacterium sp. KMS]
          Length = 291

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 123/332 (37%), Positives = 164/332 (49%), Gaps = 51/332 (15%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           +D+V +P  +  +G+++       E P+R+VT+D F++  H V+N +F  FV ATGY T 
Sbjct: 3   EDLVRIPAQSATIGSDR---HYPEEAPARDVTVDGFWIQAHAVTNAEFAAFVDATGYRTV 59

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+       +P    +  A+  Q            S +  R   PV     N       
Sbjct: 60  AER-----PLDPANYPDAPAQNLQP----------GSMVFRRTAGPVDLRHLN------- 97

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                     +W Y   G          SW HP G  S+I +R +HPVVHV+++DA AY 
Sbjct: 98  ----------QWWYWTPGA---------SWRHPIGPGSSIVNRADHPVVHVAYDDAEAYA 138

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWE   RGGL +  + WG+     G   AN W GEFP       GY  T
Sbjct: 139 AWAGLELPTEAEWEVAARGGLPHAGYTWGDEPEGPGHKLANYWHGEFPWRPD--RGYGRT 196

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA----PSYNPKGPTTGTD-KVKKGGS 307
           APV SY  N +GLY+M GNVWEWT DW+      A     SY+P  P      +V KGGS
Sbjct: 197 APVGSYPPNGYGLYDMAGNVWEWTTDWYTDTGGSACCAEDSYDPAQPQFPVPRRVVKGGS 256

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           +LC + YC R+R AAR     D+   ++GFRC
Sbjct: 257 FLCADSYCARYRPAARRPQPVDTGMSHIGFRC 288


>gi|443492512|ref|YP_007370659.1| Formylglycine-generating sulfatase enzyme [Mycobacterium liflandii
           128FXT]
 gi|442585009|gb|AGC64152.1| Formylglycine-generating sulfatase enzyme [Mycobacterium liflandii
           128FXT]
          Length = 293

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 120/329 (36%), Positives = 159/329 (48%), Gaps = 53/329 (16%)

Query: 21  DTFRMGTNKPILIKDGEFPS----RNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKF 76
           D  R+     +L  +  +P     R V ++AF +  H+V+NTQF EFV ATGY+T AE+ 
Sbjct: 4   DLIRIAAQTAVLGSEAHYPEESPLRQVDVNAFRIQVHQVTNTQFAEFVDATGYLTVAERP 63

Query: 77  GDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA 136
            D   + P    E     S V      F      ++ R      H+S      + TW   
Sbjct: 64  LDPADY-PGAPPENLQPGSMV------FTRTAGPVDLR------HLS-----LWWTWT-- 103

Query: 137 RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRG 196
                                P  SW HP G  S+I  R +HPVVHV++ DA AY  W G
Sbjct: 104 ---------------------PGASWRHPAGPHSSIAKRADHPVVHVAYEDAHAYAGWAG 142

Query: 197 ARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVM 256
             LP+EAEWE   RGGL    + WG      G+  AN W G+FP       GY  T+PV 
Sbjct: 143 LALPSEAEWEVAARGGLTGAAYTWGEESERPGQRLANFWHGDFPWRPEP--GYGRTSPVG 200

Query: 257 SYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTTGTD-KVKKGGSYLC 310
           S++ N +GL++M GNVWEWT DW+       P     SY+P+ P      KV KGGS+LC
Sbjct: 201 SFEPNGYGLFDMAGNVWEWTVDWYTDTPSARPAAEADSYDPQQPQFRVPRKVVKGGSFLC 260

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            + YC R+R AAR     D+   ++GFRC
Sbjct: 261 ADSYCRRYRPAARRPQPIDTGMSHIGFRC 289


>gi|418051829|ref|ZP_12689913.1| Sulphatase-modifying factor protein [Mycobacterium rhodesiae JS60]
 gi|353184521|gb|EHB50048.1| Sulphatase-modifying factor protein [Mycobacterium rhodesiae JS60]
          Length = 290

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 120/328 (36%), Positives = 164/328 (50%), Gaps = 51/328 (15%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V +PG T  +G+++       E P+R+VT+D F +   +V+N  F EFVSATGY+T A
Sbjct: 5   NLVWIPGQTAVLGSDR---HYPEEAPARSVTVDGFSIQDRQVTNAAFTEFVSATGYLTVA 61

Query: 74  EKFGDTFVFEPL-LSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           E+        PL  ++   A    ++     F      ++ R      H+S      + T
Sbjct: 62  ER--------PLNPADYPGAPAVNLQPGSMVFRRTSGPVDLR------HLS-----LWWT 102

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W                        P  SW HP G  S+I+ R  HPVVHV++ DA AY 
Sbjct: 103 WT-----------------------PGASWRHPLGPTSSIDKRAAHPVVHVAYEDAEAYA 139

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWE   RGGL+   + WG +    G+  AN W GEFP       GY  T
Sbjct: 140 AWAGLALPTEAEWEVAARGGLDRADYTWGEHPEFPGQRLANYWHGEFPWR--PEPGYGRT 197

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT-TGTDKVKKGGSYLCN 311
               ++  N +GL++M GNVWEWT DW+      A SY+P  P  T   KV KGGS+LC 
Sbjct: 198 TEAGAFPPNDYGLFDMAGNVWEWTTDWYGGTE--AASYDPAQPQFTVPRKVVKGGSFLCA 255

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           + YC R+R AAR     D+   ++GFRC
Sbjct: 256 DSYCMRYRPAARRPQMIDTGMSHIGFRC 283


>gi|374371504|ref|ZP_09629460.1| hypothetical protein OR16_38974 [Cupriavidus basilensis OR16]
 gi|373096914|gb|EHP38079.1| hypothetical protein OR16_38974 [Cupriavidus basilensis OR16]
          Length = 305

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 89/210 (42%), Positives = 113/210 (53%), Gaps = 22/210 (10%)

Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
           R  P   W HP G DS I  + +HPVV VS+ DA AY  W G RLPTEAEWE+  RGGLE
Sbjct: 25  RYVPGADWRHPNGPDSNITGKDDHPVVQVSYEDAQAYAKWAGKRLPTEAEWEFAARGGLE 84

Query: 215 NRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGYLSTAPVMSYKENKFGLYNMVG 270
              + WG+   P G   ANVWQG+    FP  +  A G L T+PV ++  N +GL +M G
Sbjct: 85  QATYGWGDQFAPDGRQMANVWQGQQVQPFPVVSAKAGGALGTSPVGTFPANGYGLSDMTG 144

Query: 271 NVWEWTADWWNVHH---------------HPAPSYNPKG---PTTGTDKVKKGGSYLCNE 312
           N W+W ADW+                    P  S++P     PT    +V +GGS+LCNE
Sbjct: 145 NAWQWVADWYRADQFKREAVSAKLIDNPADPGESWDPSEPGVPTAAPRRVTRGGSFLCNE 204

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            YC  +R +AR    P +S  +LGFR   D
Sbjct: 205 DYCLSYRPSARRGTDPYNSMSHLGFRLVED 234


>gi|326382909|ref|ZP_08204599.1| hypothetical protein SCNU_08218 [Gordonia neofelifaecis NRRL
           B-59395]
 gi|326198499|gb|EGD55683.1| hypothetical protein SCNU_08218 [Gordonia neofelifaecis NRRL
           B-59395]
          Length = 337

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 115/333 (34%), Positives = 150/333 (45%), Gaps = 49/333 (14%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +++MV +P   F MG++        E P+    + AF++D+H V+  +F+ FV ATGYVT
Sbjct: 45  HRNMVRVPAGEFTMGSDD---FFPEERPAHAREVGAFWIDRHPVTVAEFRRFVQATGYVT 101

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
            AE+  D   +   L  E RA             GL            V +     V   
Sbjct: 102 VAERDPDPARYPDALPGELRAG------------GL------------VFVRPPRQVPLD 137

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            WR         W Y           P   W  P G DS +  R  HPV  V  +DA AY
Sbjct: 138 DWR-------RWWAY----------VPGAQWRRPFGPDSDVRARDRHPVTQVGIDDARAY 180

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG--- 248
             W G  LPTEAEWEY    G  +  + WG+   PRG   AN W G FP+      G   
Sbjct: 181 AEWAGKALPTEAEWEYAAASGRGDTTYAWGDEFAPRGRRMANTWMGAFPSEFVPGRGQSE 240

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK--VKKGG 306
              T PV +Y  N F L +M GNVWEWT D ++  H  +    P     G +   V KGG
Sbjct: 241 VPGTTPVGTYPPNAFDLVDMTGNVWEWTEDRYSDSHRRSTCCPPTRTGDGGEDHYVVKGG 300

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           S+LC   YC R+R AAR     D+S+ ++GFRC
Sbjct: 301 SFLCAPNYCLRYRPAARQSQDSDTSSCHIGFRC 333


>gi|120402330|ref|YP_952159.1| hypothetical protein Mvan_1319 [Mycobacterium vanbaalenii PYR-1]
 gi|119955148|gb|ABM12153.1| protein of unknown function DUF323 [Mycobacterium vanbaalenii
           PYR-1]
          Length = 290

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 116/331 (35%), Positives = 164/331 (49%), Gaps = 49/331 (14%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V L G +FRMG+ +       E P   VT+  + +++H V+N QF  FV+ TGY T A
Sbjct: 4   ELVALDGGSFRMGSTR---FYPEEAPVHTVTVAPYAIERHPVTNAQFASFVADTGYRTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E   D  ++ P  + ++    + V                    PV    W        W
Sbjct: 61  ELAPDPALY-PGAAPQDLVPGALVFRPTA--------------GPVDLTDWRQ------W 99

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     WE+           P   W HP G  S+++ + +HPVV V + DA AY  
Sbjct: 100 ----------WEW----------VPGADWRHPSGPGSSVDDKPDHPVVQVCYADAAAYAR 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWE+  R G    ++ WG+   P G   AN WQG FP  N  A G+  T+
Sbjct: 140 WAGRRLPTEAEWEFAARAG-STAVYAWGDEAVPGGALMANTWQGAFPYRNDGALGWSGTS 198

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSYNPK-GPTTGTDKVKKGGSYL 309
           PV ++  N +GL +M+GNVWEWT   +  HH    P  S  P+ GP     +  KGGS+L
Sbjct: 199 PVGTFPANAYGLVDMIGNVWEWTTTRFAGHHVLDAPVNSCCPQAGPDPAVTQTLKGGSHL 258

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           C  +YC+R+R +ARS  + DS+  ++GFRCA
Sbjct: 259 CAPEYCHRYRPSARSPQSQDSATTHIGFRCA 289


>gi|237649822|ref|ZP_04524074.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Streptococcus pneumoniae CCRI 1974]
 gi|237820749|ref|ZP_04596594.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Streptococcus pneumoniae CCRI 1974M2]
          Length = 282

 Score =  171 bits (433), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 113/335 (33%), Positives = 153/335 (45%), Gaps = 62/335 (18%)

Query: 15  MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           M  + G TF MGTN +   + D E P   V++  F +    V+N +F +FV  TGY    
Sbjct: 3   MKKIKGGTFMMGTNSEEGFLDDFEDPQVAVSVKDFSIADTPVTNQEFAQFVKETGY---- 58

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
                                               T+  R     V I +   V     
Sbjct: 59  -----------------------------------KTLAERQEWSFVFILF---VPEAER 80

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
            G   P  A W             P   W HP G +S +    ++PVVHV+  DA+A+C 
Sbjct: 81  EGYPHPAGAPWWLQV---------PNACWKHPYGENSNLVGLEDYPVVHVALEDALAFCN 131

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEA+WEY  RGG ++  +PWG+ L   G + AN WQG FP  NT  DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTGLDGFIGTA 190

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
           PV  +  N FGLY M+GNVWEW     N  +    S+N      PK      +   +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +LC+  YC R+R AAR+     S++ +LGFRC  +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282


>gi|399065820|ref|ZP_10748072.1| hypothetical protein PMI02_04496, partial [Novosphingobium sp.
           AP12]
 gi|398029111|gb|EJL22600.1| hypothetical protein PMI02_04496, partial [Novosphingobium sp.
           AP12]
          Length = 266

 Score =  171 bits (432), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 110/282 (39%), Positives = 139/282 (49%), Gaps = 43/282 (15%)

Query: 1   MVLLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           M L  +P     + M  +PG  F MG+N        E P R V +  F++D   V+N +F
Sbjct: 1   MTLATSPQHAAARTMEWIPGGRFIMGSNH---HYPEERPERPVEVAGFWIDTTPVTNAEF 57

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
             FV+ATG+ T AE   D   + P    E     S V     R              PV 
Sbjct: 58  ARFVAATGHRTVAETAPDPRDY-PGALPEMLVPASLVFQPPPR--------------PV- 101

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
                           R P    W+Y  R G +        W +P G  S+IE   +HPV
Sbjct: 102 --------------DPRGPASQWWDY--RTGAD--------WRYPTGPGSSIEDFPDHPV 137

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           VHV+  D  AY  W G  LPTEAEWE+  RGG E   F WG++L P G H ANVWQG+FP
Sbjct: 138 VHVALADVEAYLAWSGMSLPTEAEWEFAARGGTEKTEFAWGDDLVPGGIHHANVWQGQFP 197

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV 282
             NT  DG+L T+PV SY  N +GL +M+GNVWEWTADWW++
Sbjct: 198 YANTLDDGWLRTSPVGSYPANGYGLLDMIGNVWEWTADWWSM 239


>gi|170690131|ref|ZP_02881298.1| protein of unknown function DUF323 [Burkholderia graminis C4D1M]
 gi|170144566|gb|EDT12727.1| protein of unknown function DUF323 [Burkholderia graminis C4D1M]
          Length = 357

 Score =  170 bits (431), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 121/336 (36%), Positives = 158/336 (47%), Gaps = 75/336 (22%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           MV L G TF +G+ +       E P+ +    +  F++DQ +V+N QF  FV+ATGYVT+
Sbjct: 79  MVHLHGGTFVLGSTRGY---QDERPAGDGRTRVGGFWIDQTDVTNAQFAAFVNATGYVTD 135

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AEK G   VF            S  R +M   +                      +A+ T
Sbjct: 136 AEKQGGAAVFH-----------SPTRAEMDARD----------------------LAWWT 162

Query: 133 W-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
           W +GA                        SW HP+G  S I+  MN PV  V+  DA+AY
Sbjct: 163 WVKGA------------------------SWRHPQGEGSGIDRLMNQPVTLVTQADALAY 198

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAADG 248
             W G  LPTEAEWEY  +GG E        +  PR   G+  AN WQG FP  NT  DG
Sbjct: 199 ARWLGRDLPTEAEWEYAAKGGNEGADL----DTAPRNAQGKPAANYWQGVFPVLNTNDDG 254

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH--HPAPSYNPKGPTT---GTDKVK 303
           ++  APV  Y  N F LY+M+GN WEWT D +   H  H         P+T    T  V 
Sbjct: 255 HVGLAPVGCYAANGFKLYDMIGNAWEWTKDVYTGAHQSHTNGDTAAVAPSTRRHDTPMVI 314

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           KGGS+LC+  YC R+R ++R Q   D  A ++GFR 
Sbjct: 315 KGGSFLCSRDYCVRYRASSREQQEADLPASHIGFRT 350


>gi|425439454|ref|ZP_18819778.1| Sulfatase-modifying factor 1 (fragment) [Microcystis aeruginosa PCC
           9717]
 gi|389720332|emb|CCH95968.1| Sulfatase-modifying factor 1 (fragment) [Microcystis aeruginosa PCC
           9717]
          Length = 223

 Score =  170 bits (431), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 91/203 (44%), Positives = 115/203 (56%), Gaps = 20/203 (9%)

Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
           +  P  +W HPEG  S+I++R NHPVVH+ + DA+AY  W G  LPTEA+WE+  RGGLE
Sbjct: 16  QYMPGANWRHPEGTGSSIKNRENHPVVHIVYEDALAYAKWAGKELPTEAQWEFAARGGLE 75

Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
              F WG+   P G+  AN WQG+FP  N         A V SY  N +GLY+M+GNVWE
Sbjct: 76  GAEFAWGDEFMPNGKVMANTWQGQFPWQNLKPHAPGPEA-VESYPPNGYGLYDMIGNVWE 134

Query: 275 WTADWWNVHHHPAPSY---NPKGPTTGTD----------------KVKKGGSYLCNEQYC 315
           WT DW+   H   P+     PK P  GT+                KV KGGS+LC   YC
Sbjct: 135 WTQDWYQEQHPKNPTKACCTPKNPLGGTEEDSYNLKLHPSMRKPRKVLKGGSFLCAPNYC 194

Query: 316 YRHRCAARSQNTPDSSAGNLGFR 338
            R+R AAR     D+S  ++GFR
Sbjct: 195 ARYRPAARHPEDIDTSTNHIGFR 217


>gi|26990068|ref|NP_745493.1| hypothetical protein PP_3353 [Pseudomonas putida KT2440]
 gi|24984995|gb|AAN68957.1|AE016527_3 conserved hypothetical protein [Pseudomonas putida KT2440]
          Length = 291

 Score =  170 bits (431), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 111/337 (32%), Positives = 152/337 (45%), Gaps = 61/337 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG  F  G+++     D E P     +  F++D H V+N QF  FV ATGYVT AE
Sbjct: 1   MVHVPGGEFSFGSSR---FYDEEGPPHPAKVSGFWIDVHPVTNAQFARFVKATGYVTHAE 57

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           + G     +P L +  R   + V H     +G D          V+   W          
Sbjct: 58  R-GTRVEDDPALPDALRIPGAMVFH-----QGAD----------VLGPGW---------- 91

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                               +  P  +W HP+G  S++    NHPVV ++  DA AY  W
Sbjct: 92  --------------------QFVPGANWRHPQGPGSSLAGLDNHPVVQIALEDAQAYARW 131

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LP+EA+ EY  RGGL +  F WG    P+G+  AN WQG+FP  N A DG+  T+P
Sbjct: 132 AGRELPSEAQLEYAMRGGLTDADFSWGTTEQPKGKLMANTWQGQFPYRNAAKDGFTGTSP 191

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--PSYNPKGPTTGTD----------KV 302
           V  +  N FGL++  GNVWE T   +   H        +P GP                V
Sbjct: 192 VGCFPANGFGLFDAGGNVWELTRTGYRPGHDAQRDAKLDPSGPALSDSFDPADPGVPVAV 251

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            KGGS+LC+   C R+R +AR       +  ++GFR 
Sbjct: 252 IKGGSHLCSADRCMRYRPSARQPQPVFMTTSHVGFRT 288


>gi|296141015|ref|YP_003648258.1| hypothetical protein [Tsukamurella paurometabola DSM 20162]
 gi|296029149|gb|ADG79919.1| protein of unknown function DUF323 [Tsukamurella paurometabola DSM
           20162]
          Length = 259

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 84/162 (51%), Positives = 112/162 (69%), Gaps = 5/162 (3%)

Query: 178 HPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG 237
           HPVVHVS+ DA+AYC W G  LPTEA+WEY  RGGLE+  +PWGN     GE R +++ G
Sbjct: 98  HPVVHVSYTDALAYCGWAGRALPTEAQWEYAARGGLESARYPWGNEHPAPGE-RCHIFTG 156

Query: 238 EFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPT 296
            FP   T   G + T P  S+  N +G+Y  VGNVWEW +D ++  ++  +P+ +P GP 
Sbjct: 157 SFPDAPT---GPVGTIPAQSFAPNGYGVYQAVGNVWEWCSDRYSARYYRVSPTDSPGGPD 213

Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            GT +V +GGS+LC++ YCYR+R AARS+NTP SSA N+GFR
Sbjct: 214 RGTARVLRGGSHLCHDSYCYRYRVAARSRNTPSSSASNIGFR 255



 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 55/131 (41%), Positives = 73/131 (55%), Gaps = 6/131 (4%)

Query: 35  DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFE-----PLLSEE 89
           DGE P   V+L  F +D   V+   F  F +ATGY+T+AE+FG + VF+     P    +
Sbjct: 12  DGEQPVHPVSLPDFEIDATTVTVAAFAAFATATGYITDAERFGASAVFQASASAPGTPVD 71

Query: 90  ERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCR 149
                  V     R  G   ++     HPVVH+S+ DA+AYC W G  LPTEA+WEY  R
Sbjct: 72  GTPWWRTVPGADFRTPGGPGSVALD-EHPVVHVSYTDALAYCGWAGRALPTEAQWEYAAR 130

Query: 150 GGLENRLFPWG 160
           GGLE+  +PWG
Sbjct: 131 GGLESARYPWG 141


>gi|377566259|ref|ZP_09795520.1| hypothetical protein GOSPT_114_00280 [Gordonia sputi NBRC 100414]
 gi|377526513|dbj|GAB40685.1| hypothetical protein GOSPT_114_00280 [Gordonia sputi NBRC 100414]
          Length = 304

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 129/339 (38%), Positives = 168/339 (49%), Gaps = 57/339 (16%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V L G  F MG+++       E P+    + AF +++H V+N Q+  FVS TGYVT A
Sbjct: 9   ELVELTGGVFCMGSDR---FYPEERPAHERRVGAFAIERHPVTNAQYAAFVSDTGYVTVA 65

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E                         D   F G D  +   +   +V       V    W
Sbjct: 66  ES----------------------DIDPVDFPGSDPAM--LVPGALVFTPTAGPVDLTDW 101

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R       A W          R  P  SW HP G  + I    +HPV  +++ DAVAY  
Sbjct: 102 R-------AWW----------RWQPGASWRHPLGPATDIVGHDDHPVTQIAYADAVAYAA 144

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGG+    + WG+ L P GE  AN W G FP    A+DG+  T+
Sbjct: 145 WAGRRLPTEAEWEYAARGGVVGHEYAWGDELHPEGEILANTWIGNFP---YASDGWSGTS 201

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--PSYNPKG------PTTG--TDKVK 303
            V SY  N FGL +M+GNVWE T+D +   H P    + +P G      PTT     +V 
Sbjct: 202 RVGSYPPNGFGLLDMIGNVWERTSDVFTPRHVPPDLATVSPDGRPDLLTPTTSPQVTRVT 261

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           KGGS+LC+ QYC R+R AARS  + DS+  +LGFRCAAD
Sbjct: 262 KGGSHLCSPQYCRRYRPAARSPQSDDSATSHLGFRCAAD 300


>gi|389864487|ref|YP_006366727.1| sulfatase-modifying factor 1 [Modestobacter marinus]
 gi|388486690|emb|CCH88242.1| Sulfatase-modifying factor 1 [Modestobacter marinus]
          Length = 310

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 88/199 (44%), Positives = 109/199 (54%), Gaps = 10/199 (5%)

Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
           R  P   W HP G  ST++    HPVVHV W DA+AY  W G RLPTEAEWE   RGG  
Sbjct: 112 RWVPGADWRHPGGPGSTLDGLERHPVVHVGWEDALAYAAWAGRRLPTEAEWERAARGGAP 171

Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
             ++ WG    PRG   AN W+G FP  NT+  G   T+PV  Y  N +GL +M+GNVWE
Sbjct: 172 PTVYAWGVEARPRGRAMANTWEGRFPWENTSPAGRTRTSPVGVYPANGYGLVDMIGNVWE 231

Query: 275 WTADWWNVHHHPAPSYN----------PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARS 324
           WT+  W   H   P  +          P   +    +V KGGS+LC   YC+R+R  AR 
Sbjct: 232 WTSSPWTEDHGADPESDQVAHACCGGGPPAVSETDRRVMKGGSHLCAPTYCHRYRPPARQ 291

Query: 325 QNTPDSSAGNLGFRCAADK 343
            +   SS  +LGFRCA D+
Sbjct: 292 GHDVRSSTSHLGFRCAVDR 310


>gi|302556760|ref|ZP_07309102.1| sulfatase-modifying factor 1 [Streptomyces griseoflavus Tu4000]
 gi|302474378|gb|EFL37471.1| sulfatase-modifying factor 1 [Streptomyces griseoflavus Tu4000]
          Length = 243

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 103/261 (39%), Positives = 125/261 (47%), Gaps = 52/261 (19%)

Query: 4   LPAPPVERYKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
           LPA      +  V LPG  F MG         DGE P   V L  F++D+  V+N +F  
Sbjct: 24  LPAAGKRSTRGQVRLPGGEFAMGDAFGEGYRADGETPVHPVRLTPFHIDETAVTNARFAA 83

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           FV ATG+VT+AE+ G + VF  ++                               P   I
Sbjct: 84  FVKATGHVTDAERHGSSAVFHLVVDA-----------------------------PAADI 114

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
             N A              A W    RG           W  PEG+ S +  R NHPVVH
Sbjct: 115 LGNAA-------------GAPWWINVRGA---------HWRRPEGVRSDVTGRQNHPVVH 152

Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
           VSWNDA AY  W G RLPTEAEWEY  RGGL+ R + WG+ LTP G  R N+WQG FP  
Sbjct: 153 VSWNDASAYARWAGKRLPTEAEWEYAARGGLDGRRYAWGDELTPDGRWRCNIWQGRFPHT 212

Query: 243 NTAADGYLSTAPVMSYKENKF 263
           NTA DG+L+TAPV SY+  + 
Sbjct: 213 NTADDGHLTTAPVKSYRPQRL 233


>gi|148547626|ref|YP_001267728.1| hypothetical protein Pput_2405 [Pseudomonas putida F1]
 gi|148511684|gb|ABQ78544.1| protein of unknown function DUF323 [Pseudomonas putida F1]
          Length = 341

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 115/348 (33%), Positives = 157/348 (45%), Gaps = 62/348 (17%)

Query: 4   LPAPPVERYKD-MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
           LPA     ++D MV +PG  F  G+++     D E P     +  F++D H V+N QF  
Sbjct: 39  LPAQRQGPWRDGMVHVPGGEFSFGSSR---FYDEEGPPHPAKVSGFWIDVHPVTNAQFAR 95

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           FV ATGYVT AE+ G     +P L    R   + V H     +G D          V+  
Sbjct: 96  FVKATGYVTHAER-GTRVEDDPALPGALRVPGAMVFH-----QGAD----------VLSP 139

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
            W                              +  P  +W HP+G  S++    NHPVV 
Sbjct: 140 GW------------------------------QFVPGANWRHPQGPGSSLAGLDNHPVVQ 169

Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
           ++  DA AY  W G  LP+EA+ EY  RGGL +  F WG    P+G+  AN WQG+FP  
Sbjct: 170 IALEDAQAYARWAGRELPSEAQLEYAMRGGLTDADFSWGATEQPKGKLMANTWQGQFPYR 229

Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--PSYNPKGPTTGTD 300
           N A DG+  T+PV  +  N  GL++  GNVWE T   +   H        +P GPT    
Sbjct: 230 NAAKDGFTGTSPVGCFPANGLGLFDAGGNVWELTRTGYRPGHDAQRDAKLDPSGPTLSDS 289

Query: 301 ----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                      V KGGS+LC+   C R+R +AR       +  ++GFR
Sbjct: 290 FDPADPGVPVAVIKGGSHLCSADRCMRYRPSARQPQPVFMTTSHVGFR 337


>gi|384097485|ref|ZP_09998606.1| hypothetical protein W5A_02480 [Imtechella halotolerans K1]
 gi|383837453|gb|EID76853.1| hypothetical protein W5A_02480 [Imtechella halotolerans K1]
          Length = 347

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 89/193 (46%), Positives = 116/193 (60%), Gaps = 16/193 (8%)

Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
            SW HP G +S IE + +HPVV VSW DA AYC W G RLPTEAE+EY  RG      +P
Sbjct: 159 ASWKHPYGPNSNIEGKDDHPVVQVSWYDAQAYCHWSGKRLPTEAEFEYLSRGQKTEESYP 218

Query: 220 WGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
           WGN+ +     +AN +QG+FP  N+  DGY  TA V S++ N FGLY++ GNVWEWT D 
Sbjct: 219 WGNDFS-IATQKANFFQGDFPNFNSMDDGYNKTAAVKSFESNGFGLYDIAGNVWEWTLDT 277

Query: 280 WNVHHHPAPS----------YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPD 329
           +  + +              +NPK      +KV +GGSYLC+E YC  +R AAR  ++PD
Sbjct: 278 YYPNAYQLKKKHEQDFFLTYHNPK-----QEKVVRGGSYLCSESYCTGYRSAARMSSSPD 332

Query: 330 SSAGNLGFRCAAD 342
           +S  + GFR   D
Sbjct: 333 TSLEHTGFRTVLD 345



 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 62/160 (38%), Positives = 88/160 (55%), Gaps = 16/160 (10%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           ++MV + G TF MG N      D EFP    T +  ++D+ EV+N QF++F+ ATGY+T 
Sbjct: 67  ENMVFIKGGTFVMGGNNAQARPD-EFPQHKDTANDIWVDKTEVTNAQFKKFIEATGYITT 125

Query: 73  AEKFGDT---------FVFEPLLSEE--ERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
           AE+  +           VF+P   +   +  + +  +H      G +S IE +  HPVV 
Sbjct: 126 AERSFEVEGQSYPPGALVFDPNQPDFWWKFVEGASWKHPY----GPNSNIEGKDDHPVVQ 181

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
           +SW DA AYC W G RLPTEAE+EY  RG      +PWG+
Sbjct: 182 VSWYDAQAYCHWSGKRLPTEAEFEYLSRGQKTEESYPWGN 221



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 22/43 (51%), Positives = 30/43 (69%)

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
           +KV +GGSYLC+E YC  +R AAR  ++PD+S  + GFR   D
Sbjct: 303 EKVVRGGSYLCSESYCTGYRSAARMSSSPDTSLEHTGFRTVLD 345


>gi|154251586|ref|YP_001412410.1| hypothetical protein Plav_1132 [Parvibaculum lavamentivorans DS-1]
 gi|154155536|gb|ABS62753.1| protein of unknown function DUF323 [Parvibaculum lavamentivorans
           DS-1]
          Length = 318

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 114/328 (34%), Positives = 154/328 (46%), Gaps = 73/328 (22%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV + G TF MG      +   E P R V+++AF++D+ EV+N +F EFV+ATGYVT AE
Sbjct: 55  MVRVEGGTFMMGDT----LYPEEGPLREVSVEAFWMDRTEVTNAEFAEFVAATGYVTVAE 110

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA---YC 131
           +  DT                       R  GL    E +    VV +  ND        
Sbjct: 111 RPVDT----------------------ARHPGLPP--EMQKPGAVVFVMPNDVSGTGDIS 146

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
            W          W+Y           P  +W HP G D++IE R + PV  +++ DA AY
Sbjct: 147 QW----------WQY----------IPGANWRHPGGPDTSIEERDHFPVTAIAYEDAQAY 186

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR-ANVWQGEFPTNNTAADGYL 250
             W+G  LPTEA+WE+  R           +   PR + R AN WQG FP  N   DG++
Sbjct: 187 AAWKGRVLPTEAQWEWAAREAK-------ADAPPPRAQPREANTWQGLFPVANNTEDGFV 239

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
             APV  ++ N  GLY+M+GN+WEWT D +               T    +V KGGS+LC
Sbjct: 240 GIAPVGCFEPNALGLYDMIGNLWEWTVDEYE--------------TVPGARVIKGGSWLC 285

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
              YC R+R  AR     D +  +LGFR
Sbjct: 286 APNYCLRYRPGARQPQEADLATTHLGFR 313



 Score = 42.4 bits (98), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 30/56 (53%), Gaps = 1/56 (1%)

Query: 333 GNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 388
           GNL +    D+  T    +V KGGS+LC   YC R+R  AR     D +  +LGFR
Sbjct: 259 GNL-WEWTVDEYETVPGARVIKGGSWLCAPNYCLRYRPGARQPQEADLATTHLGFR 313


>gi|226437577|ref|NP_001139805.1| sulfatase-modifying factor 2 isoform g [Homo sapiens]
 gi|119628385|gb|EAX07980.1| sulfatase modifying factor 2, isoform CRA_f [Homo sapiens]
 gi|119628387|gb|EAX07982.1| sulfatase modifying factor 2, isoform CRA_f [Homo sapiens]
 gi|194390888|dbj|BAG62203.1| unnamed protein product [Homo sapiens]
          Length = 213

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 86/190 (45%), Positives = 115/190 (60%), Gaps = 19/190 (10%)

Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
           W  P G  S I  R+ HPV+HVSWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWG
Sbjct: 38  WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWG 97

Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWW 280
           N   P   +R N+WQG+FP  + A DG+   +PV ++  +N +GLY+++GNVWEWTA   
Sbjct: 98  NWFQP---NRTNLWQGKFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWEWTA--- 151

Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                     +P        +V +G S++   +    +R R   R  NTPDS++ NLGFR
Sbjct: 152 ----------SPYQAAEQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFR 201

Query: 339 CAADKGPTTG 348
           CAAD G   G
Sbjct: 202 CAADAGRPPG 211


>gi|444725750|gb|ELW66304.1| Sulfatase-modifying factor 2 [Tupaia chinensis]
          Length = 285

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 118/332 (35%), Positives = 153/332 (46%), Gaps = 83/332 (25%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F MGTN P   +DGE P R VT+  F +D   V+N  F++FV    Y TEAE
Sbjct: 29  MVQLPGGRFLMGTNSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 87

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S E + K ++      +  G  S I  R+  PVVH+SWNDA  YC W+
Sbjct: 88  MFGWSFVFEDFVSAELKNKATE----QMKPAGPGSGIRERLELPVVHVSWNDARTYCAWQ 143

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           G RLPTE EWE+  RGGL+ +++PWG+   P        +R N                W
Sbjct: 144 GKRLPTEEEWEFAARGGLQGQVYPWGNRFQP--------NRTN---------------LW 180

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G   P     E G  G      FP                    P NN           
Sbjct: 181 QG-EFPKGDRAEDGFHGLSPVNAFP--------------------PQNN----------- 208

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNE 312
                   +GLY++VGNVWEWTA            Y   G      +V +G S++   + 
Sbjct: 209 --------YGLYDLVGNVWEWTAS----------PYQAAGQDM---RVLRGASWIDTADG 247

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
              +R R   R  NTPDS++ NLGFRCA+  G
Sbjct: 248 SANHRARVTTRMGNTPDSASDNLGFRCASSAG 279


>gi|116251674|ref|YP_767512.1| hypothetical protein RL1910 [Rhizobium leguminosarum bv. viciae
           3841]
 gi|115256322|emb|CAK07403.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
           3841]
          Length = 347

 Score =  168 bits (426), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 123/344 (35%), Positives = 170/344 (49%), Gaps = 65/344 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
            + +PG TF +G++K       E P+  VT+D F++ +  V+N QF++F  ATGYVT AE
Sbjct: 41  FIWIPGRTFTVGSDKHY---PEEAPAHPVTVDGFWMAETPVTNRQFKQFAEATGYVTVAE 97

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K  D   +     E  RA  S V    K   G D           +   W          
Sbjct: 98  KAPDPRDYPGAKPEMLRAG-SLVFTQPKTVSGAD-----------ISQWWT--------- 136

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    +++G             +W  P G  S +  +++HPVVHV++ DA AY  W
Sbjct: 137 ---------FKFGA------------NWRRPLGGLSDLRGKLDHPVVHVAYADAAAYAHW 175

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEAEWE   RGGL++  + WG+   P G   AN W G FP ++T   G+  T+P
Sbjct: 176 AGLDLPTEAEWELAARGGLDDAEYAWGDEFAPGGVPMANTWSGTFPIHSTKPKGHERTSP 235

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP------------------SYNPKGPT 296
           V S+  N FGLY+M+GNVWEWT+D+W+   HP P                  SY+P+ P 
Sbjct: 236 VRSFPPNGFGLYDMIGNVWEWTSDYWST-RHPEPAKHSCCIPSNPRGRDAEASYDPRQPE 294

Query: 297 TG-TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
                +V KGGS+LC   YC R+R AAR     D+S  ++GFRC
Sbjct: 295 IRIARRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRC 338


>gi|226363895|ref|YP_002781677.1| hypothetical protein ROP_44850 [Rhodococcus opacus B4]
 gi|226242384|dbj|BAH52732.1| hypothetical protein [Rhodococcus opacus B4]
          Length = 310

 Score =  168 bits (426), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 129/342 (37%), Positives = 162/342 (47%), Gaps = 71/342 (20%)

Query: 18  LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
           +P  TF MG+++       E P+  V +DAF ++ H+V+N QF  FV  TGYVT AE+  
Sbjct: 12  IPAQTFTMGSDR---HYPEEGPAHRVAVDAFAIEAHQVTNAQFTTFVRDTGYVTVAERPL 68

Query: 78  DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS--WNDAVAYCTWRG 135
           D   F P    E     S V      F      ++ R      H+S  W           
Sbjct: 69  DPADF-PGAPAENLQPGSMV------FTPTRGPVDLR------HLSQWWT---------- 105

Query: 136 ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWR 195
                   W  G             SW  P G  STI  R  HPVVHV+  DA AY  W 
Sbjct: 106 --------WTVGA------------SWQRPTGPASTIAGRERHPVVHVAHEDAAAYAQWA 145

Query: 196 GARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPV 255
           G  LPTEA+WE   RGGL+   F WG+   P  E  AN W G+FP    A  GY +T  V
Sbjct: 146 GRSLPTEAQWEAAARGGLDQAEFTWGD--APASEPSANYWHGDFPWR--ADPGYGTTTAV 201

Query: 256 MSYKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGP--TTGTD------- 300
            SY+ N +GL++M GNVWEWTADW+   H        AP  NP+GP  T   D       
Sbjct: 202 GSYRPNGYGLFDMAGNVWEWTADWYADRHRDTAESCCAPR-NPRGPGMTASFDPRQPQFR 260

Query: 301 ---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
              +V KGGS+LC + YC R+R AAR     D+   ++GFRC
Sbjct: 261 VPRRVIKGGSFLCADSYCRRYRPAARRPQMVDTGMSHIGFRC 302


>gi|384106318|ref|ZP_10007225.1| hypothetical protein W59_33398 [Rhodococcus imtechensis RKJ300]
 gi|383833654|gb|EID73104.1| hypothetical protein W59_33398 [Rhodococcus imtechensis RKJ300]
          Length = 350

 Score =  168 bits (426), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 134/386 (34%), Positives = 175/386 (45%), Gaps = 76/386 (19%)

Query: 18  LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
           +P  TF MG+++       E P+  V +DAF ++ H+V+  Q+  FV  TGYVT AE+  
Sbjct: 13  IPAHTFTMGSDR---HYPEESPAHRVAIDAFAIEAHQVTTAQYAAFVRDTGYVTVAERPL 69

Query: 78  DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS--WNDAVAYCTWRG 135
           D   F P    E     S V      F      ++ R      H+S  W           
Sbjct: 70  DPADF-PGAPAENLQPGSMV------FTPTRGPVDLR------HLSQWWT---------- 106

Query: 136 ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWR 195
                   W  G             SW HP G  STI  R  HPVVHV+  DA AY  W 
Sbjct: 107 --------WTVG------------ASWRHPTGPASTITGREQHPVVHVAHEDAAAYAQWA 146

Query: 196 GARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPV 255
           G  LPTEA+WE   RGGL++  F WG+         AN W GEFP    A  GY +T  V
Sbjct: 147 GRSLPTEAQWEAAARGGLDHAEFTWGDAPESESPPLANYWHGEFPWR--ADPGYGTTTAV 204

Query: 256 MSYKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGPTTGTD--------- 300
            SY  N++GL++M GNVWEWTADW++  H        AP  NP+GP              
Sbjct: 205 GSYPPNEYGLFDMAGNVWEWTADWYSDRHADTAESCCAPR-NPRGPDMSESFDPRQPQFR 263

Query: 301 ---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKGGS 357
              +V KGGS+LC + YC R+R AAR     D+   ++GFRC      T   D+++   +
Sbjct: 264 VPRRVIKGGSFLCADSYCRRYRPAARRPQMVDTGMSHIGFRCV-----TAYPDQIRTTTA 318

Query: 358 YLCNEQYCYRHRCAAR--SQNTPDSS 381
              +         AA     N+PDSS
Sbjct: 319 SSASTVTTLDVTTAANVAGANSPDSS 344


>gi|332265468|ref|XP_003281742.1| PREDICTED: sulfatase-modifying factor 2 isoform 3 [Nomascus
           leucogenys]
          Length = 213

 Score =  168 bits (425), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 85/190 (44%), Positives = 114/190 (60%), Gaps = 19/190 (10%)

Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
           W  P G  S I  R+ HPV+HVSWNDA AYC WRG RLP E EWE+  RGGL+ +++PWG
Sbjct: 38  WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAWRGKRLPMEEEWEFAARGGLKGQVYPWG 97

Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWW 280
           N   P   +R N+WQG+FP  + A DG+   +PV ++  +N +GLY+++GNVWEWTA   
Sbjct: 98  NRFQP---NRTNLWQGKFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWEWTA--- 151

Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                     +P        +V +G S++   +    +R R   R  NTPDS++ NLGFR
Sbjct: 152 ----------SPYQAAEQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFR 201

Query: 339 CAADKGPTTG 348
           CAAD G   G
Sbjct: 202 CAADAGRPPG 211



 Score = 41.2 bits (95), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 162 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 206


>gi|397698240|ref|YP_006536123.1| hypothetical protein T1E_5508 [Pseudomonas putida DOT-T1E]
 gi|397334970|gb|AFO51329.1| hypothetical protein T1E_5508 [Pseudomonas putida DOT-T1E]
          Length = 218

 Score =  167 bits (424), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 84/185 (45%), Positives = 106/185 (57%), Gaps = 1/185 (0%)

Query: 156 LFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 215
           L P   W HP G  S ++    HPVVHV   DA+AY  W G  LP E EWEY   GG + 
Sbjct: 31  LVPGADWRHPLGPGSDLQGMAQHPVVHVGLEDALAYAQWAGCDLPGEPEWEYAAWGGTQQ 90

Query: 216 RLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEW 275
             F WG++L P G+H AN WQG FP  N   DGY  T+PV ++  N +GLY+M+GNVWEW
Sbjct: 91  GEFVWGHHLVPEGKHMANTWQGNFPAENLQTDGYSRTSPVGAFPANGYGLYDMIGNVWEW 150

Query: 276 TADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGN 334
           T   +   H  P  S      +    KV KGGS+LC   YC R+R  ARS    D+++ +
Sbjct: 151 TTTVFQPSHASPRKSCCANARSAAQMKVLKGGSHLCAPNYCQRYRPPARSPQATDTTSSH 210

Query: 335 LGFRC 339
           +GFRC
Sbjct: 211 IGFRC 215


>gi|397734685|ref|ZP_10501389.1| hypothetical protein JVH1_5881 [Rhodococcus sp. JVH1]
 gi|396929473|gb|EJI96678.1| hypothetical protein JVH1_5881 [Rhodococcus sp. JVH1]
          Length = 312

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 125/342 (36%), Positives = 158/342 (46%), Gaps = 69/342 (20%)

Query: 18  LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
           +P  TF MG+++       E P+  V +DAF ++ H+V+  QF  FV  TGYVT AE+  
Sbjct: 12  IPAQTFTMGSDR---HYPEEGPAHRVAVDAFAIETHQVTTAQFARFVGDTGYVTVAERPL 68

Query: 78  DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS--WNDAVAYCTWRG 135
           D   F P    E     S V      F      ++ R      H+S  W           
Sbjct: 69  DPAEF-PGAPAENLQPGSMV------FTPTRGPVDLR------HLSQWWT---------- 105

Query: 136 ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWR 195
                   W  G             SW HP G  STI  R  HPVVHV+  DA AY  W 
Sbjct: 106 --------WTVGA------------SWRHPTGPGSTITGRETHPVVHVAHEDAAAYAEWA 145

Query: 196 GARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPV 255
           G  LPTEA+WE   RGGL++  F WG+         AN W GEFP       GY +T  V
Sbjct: 146 GRSLPTEAQWEAAARGGLDHTEFTWGDTAESESPPLANYWHGEFPWRPDP--GYGTTTAV 203

Query: 256 MSYKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGPTTGTD--------- 300
            SY  N +GL++M GNVWEWTADW++  H        AP  NP+GP              
Sbjct: 204 GSYPPNDYGLFDMAGNVWEWTADWYSDRHADTAESCCAPR-NPRGPGMSESFDARQPQFR 262

Query: 301 ---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
              +V KGGS+LC + YC R+R AAR     D+   ++GFRC
Sbjct: 263 VPRRVIKGGSFLCADSYCRRYRPAARRPQMVDTGMSHIGFRC 304


>gi|335037887|ref|ZP_08531187.1| hypothetical protein AGRO_5199 [Agrobacterium sp. ATCC 31749]
 gi|333790728|gb|EGL62125.1| hypothetical protein AGRO_5199 [Agrobacterium sp. ATCC 31749]
          Length = 326

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/183 (49%), Positives = 113/183 (61%), Gaps = 2/183 (1%)

Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
           W  PEG  S +  R  HPV HVSWNDA A+ TW G RLP+E EWE   RGG  +  +PWG
Sbjct: 124 WDVPEGPGSGLSGRGRHPVTHVSWNDAQAFATWAGGRLPSEIEWETAARGGDADAQYPWG 183

Query: 222 NNLTPRGEHR-ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
           N      +    N+WQGEFP +NTA DG+ STAPV  +  N FGL+NMVGNVWEWTA+ +
Sbjct: 184 NEEPDDLDFNPCNIWQGEFPNSNTAVDGFASTAPVDQFAANAFGLHNMVGNVWEWTAEPF 243

Query: 281 NVHH-HPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            VH             +   DK+ KGGS+LC++ YC+R+R AARS    D+S  ++GFR 
Sbjct: 244 KVHSLRREAKIRNAAASREKDKLMKGGSFLCHKSYCWRYRIAARSSAAADTSIAHVGFRV 303

Query: 340 AAD 342
             D
Sbjct: 304 VFD 306



 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 70/177 (39%), Positives = 99/177 (55%), Gaps = 10/177 (5%)

Query: 2   VLLPAPPVERYKDM----VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSN 57
           V LP P   R + M    V +       GTN+P+  +DGE   R   + A+Y+D   V+N
Sbjct: 16  VALPVPHWTRDRMMPERFVRIEAQRTFSGTNRPVFAQDGEAIVRQAQIQAYYIDPFAVTN 75

Query: 58  TQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ--VRHDMKRF---EGLDSTIE 112
             F++FV  TG+VT+AE++G +FVF  +     RA ++    R D  R+   EG  S + 
Sbjct: 76  DWFRDFVEETGHVTDAERYGWSFVFLGVGKAHIRAAVAPWWGRIDGARWDVPEGPGSGLS 135

Query: 113 HRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGID 169
            R  HPV H+SWNDA A+ TW G RLP+E EWE   RGG  +  +PWG+   P+ +D
Sbjct: 136 GRGRHPVTHVSWNDAQAFATWAGGRLPSEIEWETAARGGDADAQYPWGN-EEPDDLD 191


>gi|452955888|gb|EME61283.1| hypothetical protein G352_19031 [Rhodococcus ruber BKS 20-38]
          Length = 301

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 92/206 (44%), Positives = 117/206 (56%), Gaps = 20/206 (9%)

Query: 156 LFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 215
             P  +W HPEG DS +  R  HPV  VSW DA AY  W G  LPTEAEWE+  RGGL+ 
Sbjct: 88  FVPGANWRHPEGPDSNVGGRERHPVTQVSWFDARAYAQWAGKELPTEAEWEFAARGGLDR 147

Query: 216 RLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEW 275
           + F WG    P G    NVWQGEFP  N   DG+  T+PV  ++ N + L +M GNVWEW
Sbjct: 148 KPFVWGEEHEPGGRPGGNVWQGEFPWQNLLEDGFAGTSPVGRFRPNGYHLSDMAGNVWEW 207

Query: 276 TADWWNVHH-----HPAPSY------NPKGPT-TGTD--------KVKKGGSYLCNEQYC 315
           TAD++   H     + AP++      NP+  T TG          +V KGGS+LC   YC
Sbjct: 208 TADYFTGDHSTRGKNMAPTHSCCIPANPRAETATGIQYPHEPYPRRVIKGGSHLCAPNYC 267

Query: 316 YRHRCAARSQNTPDSSAGNLGFRCAA 341
            R+R AAR   T ++S  ++GFRC +
Sbjct: 268 LRYRPAARQGETEETSTCHIGFRCIS 293


>gi|111021544|ref|YP_704516.1| hypothetical protein RHA1_ro04572 [Rhodococcus jostii RHA1]
 gi|110821074|gb|ABG96358.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
          Length = 312

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 125/342 (36%), Positives = 158/342 (46%), Gaps = 69/342 (20%)

Query: 18  LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
           +P  TF MG+++       E P+  V +DAF ++ H+V+  QF  FV  TGYVT AE+  
Sbjct: 12  IPAQTFTMGSDR---HYPEEGPAHRVAVDAFAIETHQVTTAQFARFVGDTGYVTVAERPL 68

Query: 78  DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS--WNDAVAYCTWRG 135
           D   F P    E     S V      F      ++ R      H+S  W           
Sbjct: 69  DPAEF-PGAPAENLQPGSMV------FTPTRGPVDLR------HLSQWWT---------- 105

Query: 136 ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWR 195
                   W  G             SW HP G  STI  R  HPVVHV+  DA AY  W 
Sbjct: 106 --------WTVGA------------SWRHPTGPGSTITGRETHPVVHVAHEDAAAYAEWA 145

Query: 196 GARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPV 255
           G  LPTEA+WE   RGGL++  F WG+         AN W GEFP       GY +T  V
Sbjct: 146 GRSLPTEAQWEAAARGGLDHTEFTWGDAAESESPPLANYWHGEFPWRPDP--GYGTTTAV 203

Query: 256 MSYKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGPTTGTD--------- 300
            SY  N +GL++M GNVWEWTADW++  H        AP  NP+GP              
Sbjct: 204 GSYPPNDYGLFDMAGNVWEWTADWYSDRHADTAESCCAPR-NPRGPGMSESFDARQPQFR 262

Query: 301 ---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
              +V KGGS+LC + YC R+R AAR     D+   ++GFRC
Sbjct: 263 VPRRVIKGGSFLCADSYCRRYRPAARRPQMVDTGMSHIGFRC 304


>gi|419961808|ref|ZP_14477811.1| hypothetical protein WSS_A06824 [Rhodococcus opacus M213]
 gi|414572832|gb|EKT83522.1| hypothetical protein WSS_A06824 [Rhodococcus opacus M213]
          Length = 319

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 124/340 (36%), Positives = 160/340 (47%), Gaps = 65/340 (19%)

Query: 18  LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
           +P  TF MG+++       E P+  V +DAF ++ H+V+  Q+  FV  TGYVT AE+  
Sbjct: 13  IPAHTFTMGSDR---HYPEESPAHRVAIDAFAIEAHQVTTAQYAAFVRDTGYVTVAERPL 69

Query: 78  DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
           D+  F P    E     S V      F      ++ R      H+S      + TW    
Sbjct: 70  DSADF-PGAPAENLQPGSMV------FTPTRGPVDLR------HLS-----QWWTWTVG- 110

Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
                                  SW HP G  STI  R  HPVVHV+  DA AY  W G 
Sbjct: 111 ----------------------ASWRHPTGPASTITGREQHPVVHVAHEDAAAYAQWAGR 148

Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMS 257
            LPTEA+WE   RGGL+   F WG+         AN W GEFP    A  GY +T  V S
Sbjct: 149 SLPTEAQWEAAARGGLDYAEFTWGDAPESESPPLANYWHGEFPWR--ADPGYGTTTAVGS 206

Query: 258 YKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGPTTGTD----------- 300
           Y  N++GL++M GNVWEWTADW++  H        AP  NP+GP                
Sbjct: 207 YPPNEYGLFDMAGNVWEWTADWYSDRHADTAESCCAPR-NPRGPGMSESFDPRQPQFRVP 265

Query: 301 -KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            +V KGGS+LC + YC R+R AAR     D+   ++GFRC
Sbjct: 266 RRVIKGGSFLCADSYCRRYRPAARRPQMVDTGMSHIGFRC 305


>gi|432342940|ref|ZP_19592161.1| hypothetical protein Rwratislav_37752 [Rhodococcus wratislaviensis
           IFP 2016]
 gi|430772042|gb|ELB87849.1| hypothetical protein Rwratislav_37752 [Rhodococcus wratislaviensis
           IFP 2016]
          Length = 319

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 125/342 (36%), Positives = 160/342 (46%), Gaps = 69/342 (20%)

Query: 18  LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
           +P  TF MG+++       E P+  V +DAF ++ H+V+  Q+  FV  TGYVT AE+  
Sbjct: 13  IPAHTFTMGSDR---HYPEESPAHRVAIDAFAIEAHQVTTAQYAAFVRDTGYVTVAERPL 69

Query: 78  DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS--WNDAVAYCTWRG 135
           D   F P    E     S V      F      ++ R      H+S  W           
Sbjct: 70  DPADF-PGAPAENLQPGSMV------FTPTRGPVDLR------HLSQWWT---------- 106

Query: 136 ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWR 195
                   W  G             SW HP G  STI  R  HPVVHV+  DA AY  W 
Sbjct: 107 --------WTVGA------------SWRHPTGPASTITGREQHPVVHVAHEDAAAYAQWA 146

Query: 196 GARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPV 255
           G  LPTEA+WE   RGGL++  F WG+         AN W GEFP    A  GY +T  V
Sbjct: 147 GRSLPTEAQWEAAARGGLDHAEFTWGDAPESESPPLANYWHGEFPWR--ADPGYGTTTAV 204

Query: 256 MSYKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGPTTGTD--------- 300
            SY  N++GL++M GNVWEWTADW++  H        AP  NP+GP              
Sbjct: 205 GSYPPNEYGLFDMAGNVWEWTADWYSDRHADTAESCCAPR-NPRGPGMSESFDPCQPQFR 263

Query: 301 ---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
              +V KGGS+LC + YC R+R AAR     D+   ++GFRC
Sbjct: 264 VPRRVIKGGSFLCADSYCRRYRPAARRPQMVDTGMSHIGFRC 305


>gi|315646548|ref|ZP_07899666.1| Non-specific serine/threonine protein kinase [Paenibacillus vortex
           V453]
 gi|315278191|gb|EFU41511.1| Non-specific serine/threonine protein kinase [Paenibacillus vortex
           V453]
          Length = 170

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 93/220 (42%), Positives = 114/220 (51%), Gaps = 51/220 (23%)

Query: 25  MGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFE 83
           MGT+ K    +DGE P+R V++  F +    V+N +FQ FV  TGYVT+AEK+G ++VFE
Sbjct: 1   MGTDTKEGFPRDGEGPARGVSVSGFEISPFTVTNAEFQRFVETTGYVTDAEKYGWSYVFE 60

Query: 84  PLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAE 143
            L SEE +A + QV                                         P    
Sbjct: 61  LLASEETKASVVQV-----------------------------------------PQGVP 79

Query: 144 WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEA 203
           W     G           W  PEG DS+IE RM+HPVVHVSWNDA  YC W G RLPTEA
Sbjct: 80  WWLVVEGAY---------WAAPEGKDSSIEGRMDHPVVHVSWNDADVYCQWAGVRLPTEA 130

Query: 204 EWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNN 243
           EWEY  RGGLE R +PWG+ L   GEH+ N+WQG+FP   
Sbjct: 131 EWEYAARGGLEGRTYPWGDLLKQDGEHQCNIWQGKFPVQK 170


>gi|148995069|ref|ZP_01823997.1| hypothetical protein CGSSp9BS68_00262 [Streptococcus pneumoniae
           SP9-BS68]
 gi|168488488|ref|ZP_02712687.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Streptococcus pneumoniae SP195]
 gi|417679760|ref|ZP_12329156.1| hypothetical protein SPAR50_1797 [Streptococcus pneumoniae GA17570]
 gi|418126447|ref|ZP_12763352.1| hypothetical protein SPAR86_1818 [Streptococcus pneumoniae GA44511]
 gi|418192233|ref|ZP_12828735.1| hypothetical protein SPAR96_1767 [Streptococcus pneumoniae GA47388]
 gi|418234994|ref|ZP_12861570.1| hypothetical protein SPAR17_1778 [Streptococcus pneumoniae GA08780]
 gi|419508889|ref|ZP_14048540.1| hypothetical protein SPAR114_1782 [Streptococcus pneumoniae
           GA49542]
 gi|421220875|ref|ZP_15677714.1| hypothetical protein AMCSP05_001612 [Streptococcus pneumoniae
           2070425]
 gi|421223130|ref|ZP_15679912.1| hypothetical protein AMCSP14_001555 [Streptococcus pneumoniae
           2070531]
 gi|421279570|ref|ZP_15730376.1| hypothetical protein SPAR44_1768 [Streptococcus pneumoniae GA17301]
 gi|421294634|ref|ZP_15745355.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA56113]
 gi|147926881|gb|EDK77931.1| hypothetical protein CGSSp9BS68_00262 [Streptococcus pneumoniae
           SP9-BS68]
 gi|183572735|gb|EDT93263.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Streptococcus pneumoniae SP195]
 gi|332072625|gb|EGI83108.1| hypothetical protein SPAR50_1797 [Streptococcus pneumoniae GA17570]
 gi|353795511|gb|EHD75859.1| hypothetical protein SPAR86_1818 [Streptococcus pneumoniae GA44511]
 gi|353855319|gb|EHE35289.1| hypothetical protein SPAR96_1767 [Streptococcus pneumoniae GA47388]
 gi|353886616|gb|EHE66398.1| hypothetical protein SPAR17_1778 [Streptococcus pneumoniae GA08780]
 gi|379610843|gb|EHZ75573.1| hypothetical protein SPAR114_1782 [Streptococcus pneumoniae
           GA49542]
 gi|395586785|gb|EJG47152.1| hypothetical protein AMCSP05_001612 [Streptococcus pneumoniae
           2070425]
 gi|395587110|gb|EJG47472.1| hypothetical protein AMCSP14_001555 [Streptococcus pneumoniae
           2070531]
 gi|395879063|gb|EJG90125.1| hypothetical protein SPAR44_1768 [Streptococcus pneumoniae GA17301]
 gi|395893203|gb|EJH04190.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA56113]
          Length = 282

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 85/191 (44%), Positives = 113/191 (59%), Gaps = 10/191 (5%)

Query: 158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
           P   W HP G +S +    +HPVVHV+  DA+A+C W G  LPTEA+WEY  RGG ++  
Sbjct: 96  PNACWKHPYGENSNLVGLEDHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSE- 154

Query: 218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
           +PWG+ L   G + AN WQG FP  NTA DG++ TAPV  +  N FGLY M+GNVWEW  
Sbjct: 155 YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTAPVYEFLPNDFGLYQMIGNVWEWCR 214

Query: 278 DWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSS 331
              N  +    S+N      PK      +   +GGS+LC+  YC R+R AAR+     S+
Sbjct: 215 ---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTST 271

Query: 332 AGNLGFRCAAD 342
           + +LGFRC  +
Sbjct: 272 SSHLGFRCLKE 282


>gi|149013585|ref|ZP_01834174.1| hypothetical protein CGSSp19BS75_12168 [Streptococcus pneumoniae
           SP19-BS75]
 gi|169833111|ref|YP_001695178.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Streptococcus pneumoniae Hungary19A-6]
 gi|225859569|ref|YP_002741079.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Streptococcus pneumoniae 70585]
 gi|410477158|ref|YP_006743917.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae gamPNI0373]
 gi|417687268|ref|ZP_12336542.1| hypothetical protein SPAR68_1832 [Streptococcus pneumoniae GA41301]
 gi|417696957|ref|ZP_12346135.1| hypothetical protein SPAR93_1833 [Streptococcus pneumoniae GA47368]
 gi|418087480|ref|ZP_12724649.1| hypothetical protein SPAR87_1266 [Streptococcus pneumoniae GA47033]
 gi|418092398|ref|ZP_12729538.1| hypothetical protein SPAR84_1773 [Streptococcus pneumoniae GA44452]
 gi|418108230|ref|ZP_12745267.1| hypothetical protein SPAR70_1778 [Streptococcus pneumoniae GA41410]
 gi|418110763|ref|ZP_12747782.1| hypothetical protein SPAR113_1837 [Streptococcus pneumoniae
           GA49447]
 gi|418130957|ref|ZP_12767840.1| hypothetical protein SPAR14_1759 [Streptococcus pneumoniae GA07643]
 gi|418169893|ref|ZP_12806534.1| hypothetical protein SPAR56_1989 [Streptococcus pneumoniae GA19077]
 gi|418187788|ref|ZP_12824311.1| hypothetical protein SPAR92_1763 [Streptococcus pneumoniae GA47360]
 gi|418190031|ref|ZP_12826543.1| hypothetical protein SPAR94_1796 [Streptococcus pneumoniae GA47373]
 gi|418200866|ref|ZP_12837308.1| hypothetical protein SPAR109_1776 [Streptococcus pneumoniae
           GA47976]
 gi|418203023|ref|ZP_12839450.1| hypothetical protein SPAR115_1743 [Streptococcus pneumoniae
           GA52306]
 gi|418230585|ref|ZP_12857184.1| hypothetical protein SPAR136_1835 [Streptococcus pneumoniae
           EU-NP01]
 gi|419423445|ref|ZP_13963658.1| hypothetical protein SPAR76_1818 [Streptococcus pneumoniae GA43264]
 gi|419456157|ref|ZP_13996113.1| hypothetical protein SPAR139_1854 [Streptococcus pneumoniae
           EU-NP04]
 gi|419460599|ref|ZP_14000527.1| hypothetical protein SPAR2_1776 [Streptococcus pneumoniae GA02270]
 gi|419478446|ref|ZP_14018269.1| hypothetical protein SPAR53_1781 [Streptococcus pneumoniae GA18068]
 gi|419489505|ref|ZP_14029254.1| hypothetical protein SPAR83_1782 [Streptococcus pneumoniae GA44386]
 gi|419521748|ref|ZP_14061343.1| formylglycine-generating sulfatase enzyme family protein
           [Streptococcus pneumoniae GA05245]
 gi|419524247|ref|ZP_14063820.1| formylglycine-generating sulfatase enzyme family protein
           [Streptococcus pneumoniae GA13723]
 gi|421207226|ref|ZP_15664277.1| hypothetical protein AMCSP20_001794 [Streptococcus pneumoniae
           2090008]
 gi|421227986|ref|ZP_15684688.1| hypothetical protein AMCSP08_001806 [Streptococcus pneumoniae
           2072047]
 gi|421230402|ref|ZP_15687065.1| hypothetical protein AMCSP01_001773 [Streptococcus pneumoniae
           2061376]
 gi|421271227|ref|ZP_15722080.1| hypothetical protein SPAR48_1783 [Streptococcus pneumoniae SPAR48]
 gi|421273449|ref|ZP_15724289.1| hypothetical protein SPAR55_1748 [Streptococcus pneumoniae SPAR55]
 gi|421286076|ref|ZP_15736851.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA60190]
 gi|421292663|ref|ZP_15743397.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA56348]
 gi|421312602|ref|ZP_15763204.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA58981]
 gi|444387636|ref|ZP_21185655.1| hypothetical protein PCS125219_01058 [Streptococcus pneumoniae
           PCS125219]
 gi|444389110|ref|ZP_21187027.1| hypothetical protein PCS70012_00124 [Streptococcus pneumoniae
           PCS70012]
 gi|444392650|ref|ZP_21190352.1| hypothetical protein PCS81218_01160 [Streptococcus pneumoniae
           PCS81218]
 gi|444394787|ref|ZP_21192337.1| hypothetical protein PNI0002_00795 [Streptococcus pneumoniae
           PNI0002]
 gi|444398242|ref|ZP_21195725.1| hypothetical protein PNI0006_01827 [Streptococcus pneumoniae
           PNI0006]
 gi|444401187|ref|ZP_21198473.1| hypothetical protein PNI0007_02299 [Streptococcus pneumoniae
           PNI0007]
 gi|444402213|ref|ZP_21199384.1| hypothetical protein PNI0008_00830 [Streptococcus pneumoniae
           PNI0008]
 gi|444406043|ref|ZP_21202867.1| hypothetical protein PNI0009_01995 [Streptococcus pneumoniae
           PNI0009]
 gi|444407861|ref|ZP_21204528.1| hypothetical protein PNI0010_01284 [Streptococcus pneumoniae
           PNI0010]
 gi|444417203|ref|ZP_21213256.1| hypothetical protein PNI0360_00642 [Streptococcus pneumoniae
           PNI0360]
 gi|444419195|ref|ZP_21215074.1| hypothetical protein PNI0427_00093 [Streptococcus pneumoniae
           PNI0427]
 gi|147762817|gb|EDK69771.1| hypothetical protein CGSSp19BS75_12168 [Streptococcus pneumoniae
           SP19-BS75]
 gi|168995613|gb|ACA36225.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Streptococcus pneumoniae Hungary19A-6]
 gi|225720247|gb|ACO16101.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Streptococcus pneumoniae 70585]
 gi|332074158|gb|EGI84636.1| hypothetical protein SPAR68_1832 [Streptococcus pneumoniae GA41301]
 gi|332200355|gb|EGJ14428.1| hypothetical protein SPAR93_1833 [Streptococcus pneumoniae GA47368]
 gi|353758496|gb|EHD39088.1| hypothetical protein SPAR87_1266 [Streptococcus pneumoniae GA47033]
 gi|353763098|gb|EHD43655.1| hypothetical protein SPAR84_1773 [Streptococcus pneumoniae GA44452]
 gi|353778507|gb|EHD58975.1| hypothetical protein SPAR70_1778 [Streptococcus pneumoniae GA41410]
 gi|353781384|gb|EHD61829.1| hypothetical protein SPAR113_1837 [Streptococcus pneumoniae
           GA49447]
 gi|353802281|gb|EHD82581.1| hypothetical protein SPAR14_1759 [Streptococcus pneumoniae GA07643]
 gi|353833872|gb|EHE13980.1| hypothetical protein SPAR56_1989 [Streptococcus pneumoniae GA19077]
 gi|353849773|gb|EHE29778.1| hypothetical protein SPAR92_1763 [Streptococcus pneumoniae GA47360]
 gi|353853758|gb|EHE33739.1| hypothetical protein SPAR94_1796 [Streptococcus pneumoniae GA47373]
 gi|353863814|gb|EHE43734.1| hypothetical protein SPAR109_1776 [Streptococcus pneumoniae
           GA47976]
 gi|353866514|gb|EHE46414.1| hypothetical protein SPAR115_1743 [Streptococcus pneumoniae
           GA52306]
 gi|353885466|gb|EHE65255.1| hypothetical protein SPAR136_1835 [Streptococcus pneumoniae
           EU-NP01]
 gi|379530735|gb|EHY95974.1| hypothetical protein SPAR2_1776 [Streptococcus pneumoniae GA02270]
 gi|379539048|gb|EHZ04228.1| formylglycine-generating sulfatase enzyme family protein
           [Streptococcus pneumoniae GA05245]
 gi|379555960|gb|EHZ21021.1| formylglycine-generating sulfatase enzyme family protein
           [Streptococcus pneumoniae GA13723]
 gi|379564958|gb|EHZ29953.1| hypothetical protein SPAR53_1781 [Streptococcus pneumoniae GA18068]
 gi|379586017|gb|EHZ50871.1| hypothetical protein SPAR76_1818 [Streptococcus pneumoniae GA43264]
 gi|379587047|gb|EHZ51897.1| hypothetical protein SPAR83_1782 [Streptococcus pneumoniae GA44386]
 gi|379627597|gb|EHZ92208.1| hypothetical protein SPAR139_1854 [Streptococcus pneumoniae
           EU-NP04]
 gi|395573948|gb|EJG34533.1| hypothetical protein AMCSP20_001794 [Streptococcus pneumoniae
           2090008]
 gi|395593414|gb|EJG53662.1| hypothetical protein AMCSP01_001773 [Streptococcus pneumoniae
           2061376]
 gi|395594270|gb|EJG54510.1| hypothetical protein AMCSP08_001806 [Streptococcus pneumoniae
           2072047]
 gi|395866423|gb|EJG77552.1| hypothetical protein SPAR48_1783 [Streptococcus pneumoniae SPAR48]
 gi|395873880|gb|EJG84970.1| hypothetical protein SPAR55_1748 [Streptococcus pneumoniae SPAR55]
 gi|395885384|gb|EJG96408.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA60190]
 gi|395891970|gb|EJH02964.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA56348]
 gi|395909456|gb|EJH20332.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA58981]
 gi|406370103|gb|AFS43793.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae gamPNI0373]
 gi|444252514|gb|ELU58977.1| hypothetical protein PCS125219_01058 [Streptococcus pneumoniae
           PCS125219]
 gi|444257710|gb|ELU64043.1| hypothetical protein PCS70012_00124 [Streptococcus pneumoniae
           PCS70012]
 gi|444259405|gb|ELU65719.1| hypothetical protein PNI0002_00795 [Streptococcus pneumoniae
           PNI0002]
 gi|444260899|gb|ELU67207.1| hypothetical protein PNI0006_01827 [Streptococcus pneumoniae
           PNI0006]
 gi|444263040|gb|ELU69275.1| hypothetical protein PCS81218_01160 [Streptococcus pneumoniae
           PCS81218]
 gi|444263733|gb|ELU69880.1| hypothetical protein PNI0007_02299 [Streptococcus pneumoniae
           PNI0007]
 gi|444266835|gb|ELU72766.1| hypothetical protein PNI0008_00830 [Streptococcus pneumoniae
           PNI0008]
 gi|444270417|gb|ELU76190.1| hypothetical protein PNI0009_01995 [Streptococcus pneumoniae
           PNI0009]
 gi|444271457|gb|ELU77208.1| hypothetical protein PNI0010_01284 [Streptococcus pneumoniae
           PNI0010]
 gi|444284369|gb|ELU89517.1| hypothetical protein PNI0360_00642 [Streptococcus pneumoniae
           PNI0360]
 gi|444287356|gb|ELU92287.1| hypothetical protein PNI0427_00093 [Streptococcus pneumoniae
           PNI0427]
          Length = 282

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 85/191 (44%), Positives = 113/191 (59%), Gaps = 10/191 (5%)

Query: 158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
           P   W HP G +S +    +HPVVHV+  DA+A+C W G  LPTEA+WEY  RGG ++  
Sbjct: 96  PNACWKHPYGENSNLVGLEDHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSE- 154

Query: 218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
           +PWG+ L   G + AN WQG FP  NTA DG++ TAPV  +  N FGLY M+GNVWEW  
Sbjct: 155 YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTAPVYEFLPNDFGLYQMIGNVWEWCR 214

Query: 278 DWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSS 331
              N  +    S+N      PK      +   +GGS+LC+  YC R+R AAR+     S+
Sbjct: 215 ---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTST 271

Query: 332 AGNLGFRCAAD 342
           + +LGFRC  +
Sbjct: 272 SSHLGFRCLKE 282


>gi|187919188|ref|YP_001888219.1| hypothetical protein Bphyt_4473 [Burkholderia phytofirmans PsJN]
 gi|187717626|gb|ACD18849.1| protein of unknown function DUF323 [Burkholderia phytofirmans PsJN]
          Length = 361

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 115/336 (34%), Positives = 152/336 (45%), Gaps = 75/336 (22%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           MV L G  F  G+    L  + E P+ +    +  F++DQ +V+N QF  FV ATGYV++
Sbjct: 84  MVHLHGGDFVFGSK---LGYEDERPAGDGKTRVAGFWIDQTDVTNAQFAAFVKATGYVSD 140

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           +E+ G   VF     +E  A+                                  +A+ T
Sbjct: 141 SERQGGAVVFHTPTRDEMNAR---------------------------------DLAWWT 167

Query: 133 W-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
           W +GA                        +W HP G  ST+E R+N PV  V+  DA+AY
Sbjct: 168 WVKGA------------------------AWNHPTGPGSTLEGRLNQPVTMVTQADALAY 203

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAADG 248
             W G  LPTEAEWEY  + G E        +  PR   G+  AN WQG FP  NT  DG
Sbjct: 204 ARWLGRDLPTEAEWEYAGKAGREGADL----DTAPRDGHGKPSANYWQGVFPVLNTNEDG 259

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDKVK 303
           ++  APV  Y  N F LY+M+GN WEWT D +   H            P      T  V 
Sbjct: 260 HIGLAPVGCYAANDFKLYDMIGNAWEWTKDVYTGPHQSHTNGDTAEVAPLSRKHDTPMVI 319

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           KGGS+LC+  YC R+R A+R Q   D  A ++GFR 
Sbjct: 320 KGGSFLCSRDYCVRYRAASREQQEADLPASHIGFRT 355


>gi|418167413|ref|ZP_12804067.1| hypothetical protein SPAR52_1838 [Streptococcus pneumoniae GA17971]
 gi|353828579|gb|EHE08717.1| hypothetical protein SPAR52_1838 [Streptococcus pneumoniae GA17971]
          Length = 282

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 85/191 (44%), Positives = 113/191 (59%), Gaps = 10/191 (5%)

Query: 158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
           P   W HP G +S +    +HPVVHV+  DA+A+C W G  LPTEA+WEY  RGG ++  
Sbjct: 96  PNACWKHPYGENSNLVGLEDHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSE- 154

Query: 218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
           +PWG+ L   G + AN WQG FP  NTA DG++ TAPV  +  N FGLY M+GNVWEW  
Sbjct: 155 YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTAPVYEFLPNDFGLYQMIGNVWEWCR 214

Query: 278 DWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSS 331
              N  +    S+N      PK      +   +GGS+LC+  YC R+R AAR+     S+
Sbjct: 215 ---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTST 271

Query: 332 AGNLGFRCAAD 342
           + +LGFRC  +
Sbjct: 272 SSHLGFRCLKE 282


>gi|440913314|gb|ELR62778.1| Sulfatase-modifying factor 2, partial [Bos grunniens mutus]
          Length = 308

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 118/340 (34%), Positives = 155/340 (45%), Gaps = 75/340 (22%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV LPG  F+MGT+ P   +DGE P R VT+  F +D   V+N  F+EFV    Y TEA
Sbjct: 37  NMVQLPGGRFQMGTDSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFREFVREKKYRTEA 95

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E FG +FVFE L+S+E R K +Q                 RM      + W        W
Sbjct: 96  EVFGWSFVFEDLVSDELRNKATQ-----------------RMQS----LLW--------W 126

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
               LP E             R F    W  P G  S I  ++  PVVHVSWNDA AYC 
Sbjct: 127 ----LPVE-------------RAF----WRQPAGPGSGIREKLEFPVVHVSWNDARAYCA 165

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT---AADGYL 250
           WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG             G +
Sbjct: 166 WRGKRLPTEEEWEFAARGGLKGQVYPWGNKFQP---NRTNLWQGRGRCKGALVPVGRGGV 222

Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL- 309
            +   + +     G   +                  A S   +       +V +G S++ 
Sbjct: 223 QSGETVPFLRRHVGFMQL---------------REVAESGESRAADQDM-RVLRGASWID 266

Query: 310 -CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
             +    +R R   R  NTPDS++ NLGFRCA+  G   G
Sbjct: 267 TADGSANHRARVTTRMGNTPDSASDNLGFRCASGAGRPPG 306



 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 5/65 (7%)

Query: 332 AGNLGFRCAADKGPTTGTD---KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLG 386
            G +  R  A+ G +   D   +V +G S++   +    +R R   R  NTPDS++ NLG
Sbjct: 235 VGFMQLREVAESGESRAADQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLG 294

Query: 387 FRCAA 391
           FRCA+
Sbjct: 295 FRCAS 299


>gi|402819323|ref|ZP_10868891.1| hypothetical protein IMCC14465_01250 [alpha proteobacterium
           IMCC14465]
 gi|402511470|gb|EJW21731.1| hypothetical protein IMCC14465_01250 [alpha proteobacterium
           IMCC14465]
          Length = 338

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 120/337 (35%), Positives = 161/337 (47%), Gaps = 62/337 (18%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNV----TLDAFYLDQHEVSNTQFQEFVSATGY 69
           +MV + G  F +G +         +P   V    TL  FY+D++EV+N QF EFV+ATGY
Sbjct: 45  EMVFIKGGKFIIGDSS-------GYPEERVRQQHTLADFYIDKYEVTNAQFAEFVTATGY 97

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           VT AE+ G +    P L +      S V     +F  L+                     
Sbjct: 98  VTLAER-GLSKEAYPDLDDALLKPGSAVFSPPDKFYTLN--------------------- 135

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
           +  W          W++           P  +W HPEG  S+IE R ++PVVH+++ DA 
Sbjct: 136 FMNW----------WQF----------VPGANWRHPEGPTSSIETRPHYPVVHIAYADAQ 175

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
           AY  W+G  LP+EAEWEY  + G + R + WG+   P   + AN WQG FP   T  DGY
Sbjct: 176 AYADWKGHDLPSEAEWEYAAKAGSDAR-YAWGDEAYPDDIYMANTWQGRFPMLGTGEDGY 234

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH----HPAPSYNPKGPTTGTDK---- 301
              APV  + EN  GL +M+GNVWEWT D +        H  P    +  TT  +     
Sbjct: 235 HLIAPVGCFPENNNGLADMIGNVWEWTRDSYKPERETITHSGPDGKLRKNTTQQNMDEVY 294

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           V KGGSYLC + YC R+R AAR          +LGFR
Sbjct: 295 VIKGGSYLCAQNYCLRYRPAARQPQETGLGTNHLGFR 331


>gi|194248090|ref|NP_001123541.1| sulfatase-modifying factor 2 isoform e precursor [Homo sapiens]
 gi|119628380|gb|EAX07975.1| sulfatase modifying factor 2, isoform CRA_a [Homo sapiens]
          Length = 358

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 98/252 (38%), Positives = 128/252 (50%), Gaps = 55/252 (21%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
            MV L G  F MGTN P   +DGE P R  T+  F +D   V+N  F++FV    Y TEA
Sbjct: 48  SMVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEA 106

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E FG +FVFE  +S+E R K +Q                     P+  + W        W
Sbjct: 107 EMFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W 137

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
               LP E  +                 W  P G  S I  R+ HPV+HVSWNDA AYC 
Sbjct: 138 ----LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCA 176

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +
Sbjct: 177 WRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVS 233

Query: 254 PVMSY-KENKFG 264
           PV ++  +N +G
Sbjct: 234 PVNAFPAQNNYG 245


>gi|225855238|ref|YP_002736750.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Streptococcus pneumoniae JJA]
 gi|225723961|gb|ACO19814.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Streptococcus pneumoniae JJA]
          Length = 282

 Score =  164 bits (416), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 84/191 (43%), Positives = 112/191 (58%), Gaps = 10/191 (5%)

Query: 158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
           P   W HP G +S +    +HPVVHV+  DA+A+C W G  LPTEA+WEY  RGG ++  
Sbjct: 96  PNACWKHPYGENSNLVGLEDHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSE- 154

Query: 218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
           +PWG+ L   G + AN WQG FP  NT  DG++ TAPV  +  N FGLY M+GNVWEW  
Sbjct: 155 YPWGDTLLEGGYYHANTWQGRFPYENTGLDGFIGTAPVYEFLPNDFGLYQMIGNVWEWCR 214

Query: 278 DWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSS 331
              N  +    S+N      PK      +   +GGS+LC+  YC R+R AAR+     S+
Sbjct: 215 ---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTST 271

Query: 332 AGNLGFRCAAD 342
           + +LGFRC  +
Sbjct: 272 SSHLGFRCLKE 282


>gi|377834088|ref|XP_003689432.1| PREDICTED: sulfatase-modifying factor 2 isoform 2 [Mus musculus]
          Length = 190

 Score =  164 bits (415), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 84/186 (45%), Positives = 114/186 (61%), Gaps = 19/186 (10%)

Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
           W  P G  S I  ++  PVVHVSWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWG
Sbjct: 15  WRQPAGPGSGIREKLELPVVHVSWNDAGAYCAWRGRRLPTEEEWEFAARGGLKGQVYPWG 74

Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWW 280
           N   P   +R N+WQG+FP  + A DG+   +PV ++  +N +GLY+++GNVWEWTA   
Sbjct: 75  NRFQP---NRTNLWQGKFPKGDKAEDGFHGLSPVNAFPPQNNYGLYDLMGNVWEWTAS-- 129

Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                   +Y P G      +V +G S++   +    +R R   R  NTPDS++ NLGFR
Sbjct: 130 --------TYQPAGQDM---RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFR 178

Query: 339 CAADKG 344
           CA+  G
Sbjct: 179 CASSAG 184


>gi|40226435|gb|AAH15600.2| SUMF2 protein, partial [Homo sapiens]
          Length = 334

 Score =  164 bits (415), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 98/252 (38%), Positives = 128/252 (50%), Gaps = 55/252 (21%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
            MV L G  F MGTN P   +DGE P R  T+  F +D   V+N  F++FV    Y TEA
Sbjct: 24  SMVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEA 82

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E FG +FVFE  +S+E R K +Q                     P+  + W        W
Sbjct: 83  EMFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W 113

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
               LP E  +                 W  P G  S I  R+ HPV+HVSWNDA AYC 
Sbjct: 114 ----LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCA 152

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +
Sbjct: 153 WRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVS 209

Query: 254 PVMSY-KENKFG 264
           PV ++  +N +G
Sbjct: 210 PVNAFPAQNNYG 221


>gi|114613483|ref|XP_001160140.1| PREDICTED: sulfatase-modifying factor 2 isoform 9 [Pan troglodytes]
 gi|410302666|gb|JAA29933.1| sulfatase modifying factor 2 [Pan troglodytes]
          Length = 358

 Score =  164 bits (414), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 100/252 (39%), Positives = 129/252 (51%), Gaps = 55/252 (21%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
            MV L G  F MGTN P   +DGE P R  T+  F +D   V+N  F++FV    Y TEA
Sbjct: 48  SMVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEA 106

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E FG +FVFE  +S+E R K +Q                     P+  + W        W
Sbjct: 107 EIFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W 137

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
               LP E             R F    W  P G  S I  R+ HPV+HVSWNDA AYC 
Sbjct: 138 ----LPVE-------------RAF----WRQPAGPGSGILERLEHPVLHVSWNDARAYCA 176

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +
Sbjct: 177 WRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVS 233

Query: 254 PVMSY-KENKFG 264
           PV ++  +N +G
Sbjct: 234 PVNAFPAQNNYG 245


>gi|194396886|ref|YP_002038398.1| hypothetical protein SPG_1694 [Streptococcus pneumoniae G54]
 gi|194356553|gb|ACF55001.1| conserved domain protein [Streptococcus pneumoniae G54]
          Length = 208

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 85/191 (44%), Positives = 114/191 (59%), Gaps = 10/191 (5%)

Query: 158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
           P   W HP G +S +    +HPVVHV+  DA+A+C W G  LPTEA+WEY  RGG ++  
Sbjct: 22  PNACWKHPYGENSNLVGLEDHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSE- 80

Query: 218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
           +PWG+ L   G + AN WQG FP  NTA+DG++ TAPV  +  N FGLY M+GNVWEW  
Sbjct: 81  YPWGDTLLEGGYYHANTWQGRFPYENTASDGFIGTAPVYEFLPNDFGLYQMIGNVWEWCR 140

Query: 278 DWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSS 331
              N  +    S+N      PK      +   +GGS+LC+  YC R+R AAR+     S+
Sbjct: 141 ---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTST 197

Query: 332 AGNLGFRCAAD 342
           + +LGFRC  +
Sbjct: 198 SSHLGFRCLKE 208


>gi|387626987|ref|YP_006063163.1| hypothetical protein INV104_15460 [Streptococcus pneumoniae INV104]
 gi|417694666|ref|ZP_12343853.1| hypothetical protein SPAR120_1722 [Streptococcus pneumoniae
           GA47901]
 gi|421268982|ref|ZP_15719850.1| hypothetical protein SPAR95_1746 [Streptococcus pneumoniae SPAR95]
 gi|421308008|ref|ZP_15758649.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA60132]
 gi|444383708|ref|ZP_21181892.1| hypothetical protein PCS8106_02106 [Streptococcus pneumoniae
           PCS8106]
 gi|444386357|ref|ZP_21184416.1| hypothetical protein PCS8203_02222 [Streptococcus pneumoniae
           PCS8203]
 gi|301794773|emb|CBW37227.1| conserved hypothetical protein [Streptococcus pneumoniae INV104]
 gi|332201215|gb|EGJ15286.1| hypothetical protein SPAR120_1722 [Streptococcus pneumoniae
           GA47901]
 gi|395868030|gb|EJG79149.1| hypothetical protein SPAR95_1746 [Streptococcus pneumoniae SPAR95]
 gi|395906908|gb|EJH17805.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
           GA60132]
 gi|444247784|gb|ELU54314.1| hypothetical protein PCS8203_02222 [Streptococcus pneumoniae
           PCS8203]
 gi|444248589|gb|ELU55093.1| hypothetical protein PCS8106_02106 [Streptococcus pneumoniae
           PCS8106]
          Length = 282

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 84/191 (43%), Positives = 112/191 (58%), Gaps = 10/191 (5%)

Query: 158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
           P   W HP G +S +    +HPVVHV+  DA+A+C W G  LPTEA+WEY  RGG ++  
Sbjct: 96  PNACWKHPYGENSNLVGLEDHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSE- 154

Query: 218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
           +PWG+ L   G + AN WQG FP  NTA D ++ TAPV  +  N FGLY M+GNVWEW  
Sbjct: 155 YPWGDTLLEGGYYHANTWQGRFPYENTALDRFIGTAPVYEFLPNDFGLYQMIGNVWEWCR 214

Query: 278 DWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSS 331
              N  +    S+N      PK      +   +GGS+LC+  YC R+R AAR+     S+
Sbjct: 215 ---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTST 271

Query: 332 AGNLGFRCAAD 342
           + +LGFRC  +
Sbjct: 272 SSHLGFRCLKE 282


>gi|424854519|ref|ZP_18278877.1| sulfatase-modifying factor 1 [Rhodococcus opacus PD630]
 gi|356664566|gb|EHI44659.1| sulfatase-modifying factor 1 [Rhodococcus opacus PD630]
          Length = 300

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 119/321 (37%), Positives = 152/321 (47%), Gaps = 62/321 (19%)

Query: 37  EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
           E P+  V +DAF ++ H+V+  Q+  FV  TGYVT AE+  D   F P    E     S 
Sbjct: 10  EGPAHRVAIDAFAIEAHQVTTAQYAAFVRDTGYVTVAERPLDAADF-PGAPAENLQPGSM 68

Query: 97  VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 156
           V      F      ++ R      H+S      + TW                       
Sbjct: 69  V------FTPTRGPVDLR------HLS-----QWWTWTVG-------------------- 91

Query: 157 FPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 216
               SW HP G  STI  R  HPVVHV+  DA AY  W G  LPTEA+WE   RGGL++ 
Sbjct: 92  ---ASWRHPTGPASTITGREQHPVVHVAHEDAAAYAQWAGRSLPTEAQWEAAARGGLDHA 148

Query: 217 LFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWT 276
            F WG+         AN W GEFP    A  GY +T  V S+  N++GL++M GNVWEWT
Sbjct: 149 EFTWGDAPESESPPLANYWHGEFPWR--ADPGYGTTTAVGSFPPNEYGLFDMAGNVWEWT 206

Query: 277 ADWWNVHHHP------APSYNPKGPTTGTD------------KVKKGGSYLCNEQYCYRH 318
           ADW++V H        AP  NP+GP                 +V KGGS+LC + YC R+
Sbjct: 207 ADWFSVRHADTAESCCAPR-NPRGPGMSESFDPRQPQFRVPRRVIKGGSFLCADSYCRRY 265

Query: 319 RCAARSQNTPDSSAGNLGFRC 339
           R AAR     D+   ++GFRC
Sbjct: 266 RPAARRPQMVDTGMSHIGFRC 286


>gi|111601485|gb|AAI19481.1| Sumf2 protein [Mus musculus]
          Length = 190

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 83/186 (44%), Positives = 114/186 (61%), Gaps = 19/186 (10%)

Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
           W  P G  S I  ++  PVVH+SWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWG
Sbjct: 15  WRQPAGPGSGIREKLELPVVHMSWNDAGAYCAWRGRRLPTEEEWEFAARGGLKGQVYPWG 74

Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWW 280
           N   P   +R N+WQG+FP  + A DG+   +PV ++  +N +GLY+++GNVWEWTA   
Sbjct: 75  NRFQP---NRTNLWQGKFPKGDKAEDGFHGLSPVNAFPPQNNYGLYDLMGNVWEWTAS-- 129

Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                   +Y P G      +V +G S++   +    +R R   R  NTPDS++ NLGFR
Sbjct: 130 --------TYQPAGQDM---RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFR 178

Query: 339 CAADKG 344
           CA+  G
Sbjct: 179 CASSAG 184


>gi|41472461|gb|AAS07452.1| unknown [Homo sapiens]
          Length = 310

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 97/251 (38%), Positives = 128/251 (50%), Gaps = 55/251 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  F MGTN P   +DG+ P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct: 1   MVQLQGGRFLMGTNSPD-SRDGDGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 59

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q                     P+  + W        W 
Sbjct: 60  MFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 89

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E  +                 W  P G  S I  R+ HPV+HVSWNDA AYC W
Sbjct: 90  ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 129

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 130 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 186

Query: 255 VMSY-KENKFG 264
           V ++  +N +G
Sbjct: 187 VNAFPAQNNYG 197


>gi|421787630|ref|ZP_16223975.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii Naval-82]
 gi|410406523|gb|EKP58527.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii Naval-82]
          Length = 322

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 114/342 (33%), Positives = 155/342 (45%), Gaps = 83/342 (24%)

Query: 8   PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           P  +   MV LP     +GT       +P+ ++    P       AF +D  EV+N QFQ
Sbjct: 48  PKNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQPTSVP-------AFLIDATEVTNAQFQ 100

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
           EFV  TGYVT+AEK G   +F              ++ D                HPV  
Sbjct: 101 EFVKQTGYVTDAEKQGGAAMF--------------IQPD----------------HPVEE 130

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
           + W        W+         +E G             SW HP G++++ +   + PV 
Sbjct: 131 LQW--------WK---------FEKGA------------SWKHPWGLNNSKQPAPHEPVR 161

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTP--RGEHRANVWQGEF 239
            V+WNDA AY  W G  LPTE EWEY  +G  +N      ++++P   G   AN WQGEF
Sbjct: 162 MVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQN------SDISPTHEGHIVANFWQGEF 215

Query: 240 PTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPT 296
           P  N   DG+   +PV  + +N FGLY+++GNVWEWT   +     HH   PS   +   
Sbjct: 216 PYKNIQEDGFKDVSPVGCFSKNPFGLYDLIGNVWEWTQTPYTGSRDHHMGDPSKYRQSHE 275

Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                  KGGSYLC   YC R+R AAR     D +  ++GFR
Sbjct: 276 QIMQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFR 317


>gi|392417851|ref|YP_006454456.1| hypothetical protein Mycch_4055 [Mycobacterium chubuense NBB4]
 gi|390617627|gb|AFM18777.1| hypothetical protein Mycch_4055 [Mycobacterium chubuense NBB4]
          Length = 292

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 116/331 (35%), Positives = 158/331 (47%), Gaps = 52/331 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           +V +P  T  +G+++       E P R V +  F++    V+N QF EFVSATGYVT AE
Sbjct: 5   LVHVPAQTAVLGSDE---HYPEEGPQRAVEVAEFWIQATPVTNAQFAEFVSATGYVTVAE 61

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D        +E   A    ++     F      ++ R      H+S      + TW 
Sbjct: 62  RPLDA-------AEHPGAPKENLQPGSMVFTRTAGPVDLR------HLS-----QWWTWT 103

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                  P   W HP G  S+IE R +HPVVHV++ DA AY +W
Sbjct: 104 -----------------------PGACWRHPVGPLSSIEKRADHPVVHVAYEDAEAYASW 140

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G  LPTEA+WE   RGGL    + WG+   P G   AN W G+FP       GY  T P
Sbjct: 141 AGLGLPTEAQWETAARGGLTAATYTWGDEPEPPGRPLANYWHGDFPWRPD--RGYGRTTP 198

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTD-KVKKGGSY 308
           V S+  N +GL++  GNVWEWT DW+           + S +P+ P      +V KGGS+
Sbjct: 199 VGSFPPNAYGLFDTAGNVWEWTTDWYADRRDARDCCASASRDPRQPQFPVPRRVIKGGSF 258

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           LC + YC R+R AAR     D+   ++GFRC
Sbjct: 259 LCADSYCRRYRPAARRPQPVDTGMSHIGFRC 289


>gi|421806414|ref|ZP_16242278.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC035]
 gi|410417758|gb|EKP69526.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC035]
          Length = 322

 Score =  161 bits (408), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 112/340 (32%), Positives = 150/340 (44%), Gaps = 79/340 (23%)

Query: 8   PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           P  +   MV LP     +GT       +P+ ++    P       AF +D  EV+N QFQ
Sbjct: 48  PKNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQ 100

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
           EFV  TGYVT+AEK G   +F                               +  HPV  
Sbjct: 101 EFVKQTGYVTDAEKQGGAAMFV------------------------------QPEHPVEE 130

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
           + W                   W++G +G          +W HP G+++  +   + PV 
Sbjct: 131 LQW-------------------WKFG-KGA---------NWKHPWGLNNPKQPAPHEPVR 161

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
            V+WNDA AY +W G  LP+E EWEY  +G  +N         T  G   AN WQGEFP 
Sbjct: 162 MVTWNDAYAYASWLGHDLPSELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPY 217

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTG 298
            N   DG+   APV  + +N FGLY+++GNVWEWT   +     HH   PS   +     
Sbjct: 218 KNVQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHEQI 277

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                KGGSYLC   YC R+R AAR     D +  ++GFR
Sbjct: 278 LQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFR 317


>gi|421624110|ref|ZP_16064986.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC098]
 gi|408702141|gb|EKL47555.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC098]
          Length = 322

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 116/342 (33%), Positives = 149/342 (43%), Gaps = 79/342 (23%)

Query: 8   PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           P  +   MV LP     +GT       +P+ ++    P       AF +D  EV+N QFQ
Sbjct: 48  PKNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQ 100

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
           EFV  TGYVT+AEK G   +F              V+ D                HPV  
Sbjct: 101 EFVKQTGYVTDAEKQGGAAMF--------------VQPD----------------HPVAE 130

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
           + W        WR         +E G             +W HP G ++  +   + PV 
Sbjct: 131 LQW--------WR---------FEKGA------------NWKHPWGPNNPKQPAPHEPVR 161

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
            V+WNDA AY  W G  LPTE EWEY  +G  +N         T  G   AN WQGEFP 
Sbjct: 162 MVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPY 217

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTG 298
            N   DG+   APV  + +N FGLY+++GNVWEWT   +     HH   PS   +     
Sbjct: 218 KNVQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGEPSKYRQSHEQI 277

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                KGGSYLC   YC R+R AAR     D +  ++GFR  
Sbjct: 278 LQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 319


>gi|417551536|ref|ZP_12202614.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii Naval-18]
 gi|417565904|ref|ZP_12216778.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC143]
 gi|395557660|gb|EJG23661.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC143]
 gi|400385991|gb|EJP49066.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii Naval-18]
          Length = 322

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 113/341 (33%), Positives = 149/341 (43%), Gaps = 77/341 (22%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDA-----FYLDQHEVSNTQFQE 62
           P  +   M+ LP     +GT +      G    R + L A     F +D  EV+N QFQE
Sbjct: 48  PKNKEAGMIKLPQGKIVLGTTQ------GYEDERPLNLQATSVPDFLIDATEVTNAQFQE 101

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           FV  TGYVT+AEK G   +F                               +  HPV  +
Sbjct: 102 FVKQTGYVTDAEKQGGAAMFV------------------------------QPEHPVEEL 131

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
            W                   W++G +G          +W HP G++++ +   + PV  
Sbjct: 132 QW-------------------WKFG-KGA---------NWKHPWGLNNSKQPAPHEPVRM 162

Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
           V+WNDA AY  W G  LPTE EWEY  +G  +N         T  G   AN WQGEFP  
Sbjct: 163 VTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPYK 218

Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTGT 299
           N   DG+   APV  + +N FGLY+++GNVWEWT   +     HH   PS   +      
Sbjct: 219 NVQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHEQIL 278

Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
               KGGSYLC   YC R+R AAR     D +  ++GFR  
Sbjct: 279 QYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 319


>gi|375134016|ref|YP_004994666.1| protein kinase [Acinetobacter calcoaceticus PHEA-2]
 gi|325121461|gb|ADY80984.1| protein kinase [Acinetobacter calcoaceticus PHEA-2]
          Length = 317

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 115/340 (33%), Positives = 151/340 (44%), Gaps = 79/340 (23%)

Query: 8   PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           P  +   MV LP     +GT       +P+ ++    P       AF +D  EV+N QFQ
Sbjct: 43  PNNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQPTLVP-------AFLIDATEVTNAQFQ 95

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
           EFV  TGYVT+AEK G   +F              ++ D                HPV  
Sbjct: 96  EFVKQTGYVTDAEKQGGAAMF--------------IQPD----------------HPVEE 125

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
           + W        W+         +E G             SW HP G++++ +   + PV 
Sbjct: 126 LQW--------WK---------FEKGA------------SWKHPWGLNNSKQPAPHEPVR 156

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
            V+WNDA AY  W G  LPTE EWEY  +G  +N         T  G   AN WQGEFP 
Sbjct: 157 MVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPY 212

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTG 298
            N   DG+   APV  + +N FGLY+++GNVWEWT   +     HH   PS   +     
Sbjct: 213 KNIQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGSRDHHMGDPSKYRQSHEQI 272

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                KGGSYLC   YC R+R AAR     D +  ++GFR
Sbjct: 273 LQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFR 312


>gi|239503462|ref|ZP_04662772.1| hypothetical protein AbauAB_14222 [Acinetobacter baumannii AB900]
          Length = 340

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 112/342 (32%), Positives = 149/342 (43%), Gaps = 79/342 (23%)

Query: 8   PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           P  +   MV LP     +GT       +P+ ++    P       AF +D  EV+N QFQ
Sbjct: 66  PKNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQ 118

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
           EFV  TGYVT+AEK G   +F                               +  HPV  
Sbjct: 119 EFVKQTGYVTDAEKQGGAAMFV------------------------------QPEHPVEE 148

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
           + W                   W++G +G          +W HP G+++  +   + PV 
Sbjct: 149 LQW-------------------WKFG-KGA---------NWKHPWGLNNPKQPAPHEPVR 179

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
            V+WNDA AY  W G  LP+E EWEY  +G  +N         T  G   AN WQGEFP 
Sbjct: 180 MVTWNDAYAYANWLGHDLPSELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPY 235

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTG 298
            N   DG+   APV  + +N FGLY+++GNVWEWT   +     HH   PS   +     
Sbjct: 236 KNVQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHEQI 295

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                KGGSYLC   YC R+R AAR     D +  ++GFR  
Sbjct: 296 LQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 337


>gi|402495314|ref|ZP_10842044.1| hypothetical protein AagaZ_13304 [Aquimarina agarilytica ZC1]
          Length = 352

 Score =  160 bits (406), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 82/189 (43%), Positives = 112/189 (59%), Gaps = 8/189 (4%)

Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
            +W HP+G +STIE + ++PVV VSW DA+AYC W G RLPTE E+EY  R   +   + 
Sbjct: 163 ANWKHPKGPESTIEGKDDYPVVQVSWYDAMAYCHWAGKRLPTEEEFEYVARNKGKRIKYS 222

Query: 220 WGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
           WGN+   R     N  QG FP+ N   D +   APV S+K+N+ G+Y++ GNVWEWT + 
Sbjct: 223 WGNDFR-RAVEFVNFHQGSFPSQNLKEDNFEGLAPVKSFKQNELGVYDIGGNVWEWTLNS 281

Query: 280 WNVHHHPAPSYNPKGPTTGTD------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAG 333
           +    +     N + P    D      KV +GGS+LCNE YC  +R AAR  ++PD+   
Sbjct: 282 YFSDGY-TRKLNFEKPLLANDQSIHQKKVIRGGSFLCNESYCSGYRVAARMNSSPDTGLE 340

Query: 334 NLGFRCAAD 342
           +LGFRC  D
Sbjct: 341 HLGFRCVMD 349


>gi|404423688|ref|ZP_11005320.1| hypothetical protein MFORT_24422 [Mycobacterium fortuitum subsp.
           fortuitum DSM 46621]
 gi|403653443|gb|EJZ08424.1| hypothetical protein MFORT_24422 [Mycobacterium fortuitum subsp.
           fortuitum DSM 46621]
          Length = 294

 Score =  160 bits (406), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 115/335 (34%), Positives = 154/335 (45%), Gaps = 56/335 (16%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY--VT 71
           D+V +P     +G++        E P+R V+   F++  H+V+N  + EFVSATGY  V 
Sbjct: 5   DLVWIPAQVTTLGSDA---HYPEEAPAREVSTAGFWIQPHQVTNAAYTEFVSATGYVTVA 61

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           E     D F   P    E     S V    +R  G        + H  +  +W       
Sbjct: 62  ERPVNPDEF---PGAPPENLVPGSMV---FRRTAG-----PVDLRHLNLWWAWT------ 104

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
                                     P   W HP G  S+++ R  HPVVH+++ DA AY
Sbjct: 105 --------------------------PGACWNHPRGPKSSLKGREQHPVVHIAFEDAQAY 138

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G  LPTEAEWE   RGG+    + WG+     G   AN W GEFP       GY +
Sbjct: 139 ADWAGLALPTEAEWETAARGGIAGAAYTWGDEPEKPGRQLANYWHGEFP--YLPDTGYGT 196

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTTGTD-KVKKG 305
           T PV S++ N +GL++M GNVWEWT DW+       P     SY+P  P      KV KG
Sbjct: 197 TKPVGSFEPNGYGLFDMAGNVWEWTTDWYGEDRATTPCCAADSYDPNQPQFQIGRKVIKG 256

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GS+LC + YC R+R AAR     D+   ++GFRC 
Sbjct: 257 GSFLCADSYCMRYRPAARRPQMVDTGMSHIGFRCV 291


>gi|445457906|ref|ZP_21446821.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC047]
 gi|444776086|gb|ELX00138.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC047]
          Length = 322

 Score =  160 bits (405), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 116/338 (34%), Positives = 150/338 (44%), Gaps = 71/338 (21%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPS--RNVTLDAFYLDQHEVSNTQFQEFVS 65
           P  +   MV LP     +GT +     + E P   +  ++ AF +D  EV+N QFQEFV 
Sbjct: 48  PKNKEAGMVKLPQGKIVLGTTQGY---EDEHPLNLQATSVPAFLIDATEVTNAQFQEFVK 104

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
            TGYVT+AEK G   +F              V+ D                HPV  + W 
Sbjct: 105 QTGYVTDAEKQGGAAMF--------------VQPD----------------HPVEELQW- 133

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
                  WR         +E G             +W HP G ++  +   + PV  V+W
Sbjct: 134 -------WR---------FEKGA------------NWKHPWGPNNPKQPAPHEPVRMVTW 165

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
           NDA AY  W G  LPTE EWEY  +G  +N         T  G   AN WQGEFP  N  
Sbjct: 166 NDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPYKNVQ 221

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTGTDKV 302
            DG+   APV  + +N FGLY+++GNVWEWT   +     HH   PS   +         
Sbjct: 222 EDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGEPSKYRQSHEQILQYT 281

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            KGGSYLC   YC R+R AAR     D +  ++GFR  
Sbjct: 282 IKGGSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 319


>gi|421652338|ref|ZP_16092697.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC0162]
 gi|408505464|gb|EKK07185.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC0162]
          Length = 322

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 116/342 (33%), Positives = 149/342 (43%), Gaps = 79/342 (23%)

Query: 8   PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           P  +   MV LP     +GT       +P+ ++    P       AF +D  EV+N QFQ
Sbjct: 48  PKNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQ 100

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
           EFV  TGYVT+AEK G   +F              V+ D                HPV  
Sbjct: 101 EFVKQTGYVTDAEKQGGAAMF--------------VQPD----------------HPVEE 130

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
           + W        WR         +E G             +W HP G ++  +   + PV 
Sbjct: 131 LQW--------WR---------FEKGA------------NWKHPWGPNNPKQPAPHEPVR 161

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
            V+WNDA AY  W G  LPTE EWEY  +G  +N         T  G   AN WQGEFP 
Sbjct: 162 MVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPY 217

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTG 298
            N   DG+   APV  + +N FGLY+++GNVWEWT   +     HH   PS   +     
Sbjct: 218 KNVQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGEPSKYRQSHEQI 277

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                KGGSYLC   YC R+R AAR     D +  ++GFR  
Sbjct: 278 LQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 319


>gi|229820583|ref|YP_002882109.1| hypothetical protein Bcav_2094 [Beutenbergia cavernae DSM 12333]
 gi|229566496|gb|ACQ80347.1| protein of unknown function DUF323 [Beutenbergia cavernae DSM
           12333]
          Length = 306

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 122/339 (35%), Positives = 161/339 (47%), Gaps = 59/339 (17%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           +V + G  F MG++   +    E P+  V +D F +   +V+N +F  FV  T Y T AE
Sbjct: 13  LVRIEGGEFTMGSD---VHYPEEAPAHRVAVDGFEISPTQVTNREFTRFVEDTRYRTVAE 69

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +  D   F P    E  A  S V      F      ++ R      H+S      +  W 
Sbjct: 70  RPLDPAAF-PGAPAENLAPGSMV------FTPTAGPVDLR------HLS-----QWWAWT 111

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                  P  SW  P G  S++  R++HPVVHV++ DA AY  W
Sbjct: 112 -----------------------PGASWRRPLGPGSSVRKRLDHPVVHVAYEDAEAYAAW 148

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTEAEWE   RGGL+   F W +     GE RAN W G+FP    A  GY +TAP
Sbjct: 149 AGLRLPTEAEWELAARGGLDGAEFTWSDEPEAAGERRANYWHGDFPWR--AEPGYGATAP 206

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-------------HPAPSYNPKGPTTGTDK 301
           V SY  N +GL++M GNVWEWT+DW+  H                A S +   P T   +
Sbjct: 207 VGSYPANAYGLHDMAGNVWEWTSDWYRAHDAGGCCIPRNPRGGDAASSVDAAQPFTVPRR 266

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           V KGGS+LC + YC R+R AAR     D+   ++GFRCA
Sbjct: 267 VVKGGSFLCADSYCRRYRPAARRPQQVDTGMSHVGFRCA 305


>gi|407261319|ref|XP_003946222.1| PREDICTED: sulfatase-modifying factor 2 [Mus musculus]
          Length = 349

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 99/251 (39%), Positives = 127/251 (50%), Gaps = 56/251 (22%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV LPG  F MGT+ P   +DGE P+R VT+  F +D   V+N  F+EFV    Y TEAE
Sbjct: 38  MVHLPGGRFLMGTDAPD-GRDGEGPAREVTVKPFAIDIFPVTNKDFREFVREKKYQTEAE 96

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S E R +                  E+ M  P VH  W   V    WR
Sbjct: 97  AFGWSFVFEDFVSPELRKQ------------------ENLM--PAVH--WWQPVPKAFWR 134

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                                         P G  S I  ++  PVVHVSWNDA AYC W
Sbjct: 135 -----------------------------QPAGPGSGIREKLELPVVHVSWNDAGAYCAW 165

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           RG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQG+FP  + A DG+   +P
Sbjct: 166 RGRRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHGLSP 222

Query: 255 VMSY-KENKFG 264
           V ++  +N +G
Sbjct: 223 VNAFPPQNNYG 233


>gi|46205251|ref|ZP_00209771.1| COG1262: Uncharacterized conserved protein [Magnetospirillum
           magnetotacticum MS-1]
          Length = 271

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 108/274 (39%), Positives = 137/274 (50%), Gaps = 45/274 (16%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           DMV LPG TFRMG+++       E P   V +D F++D   V+N QF+ FV ATGYVT A
Sbjct: 19  DMVFLPGGTFRMGSDR---HYPEEAPVHRVRVDGFWIDLTPVTNAQFRAFVRATGYVTMA 75

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E+               R         + R     S +      PV    W+       W
Sbjct: 76  ER---------------RPDPKDYPGALPRMLQAGSLMFKPPKGPVDLRDWS-----AWW 115

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         +++G              W  P G  S+I     HPVVHV++ DA AY  
Sbjct: 116 R---------FKFGVH------------WRRPYGSGSSIAGLDEHPVVHVAFADAEAYAA 154

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G  LPTEAEWEY  RGGL+   F WG+ LTP G H AN WQG FP  N A DG+  T+
Sbjct: 155 WAGKELPTEAEWEYAARGGLDGAEFAWGDELTPGGRHMANTWQGAFPHQNLAEDGFERTS 214

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA 287
           PV ++  N +GL++M GNVWEWT D++ V  HPA
Sbjct: 215 PVTAFPPNGYGLFDMTGNVWEWTTDFY-VPKHPA 247


>gi|424060549|ref|ZP_17798040.1| hypothetical protein W9K_01663 [Acinetobacter baumannii Ab33333]
 gi|404668501|gb|EKB36410.1| hypothetical protein W9K_01663 [Acinetobacter baumannii Ab33333]
          Length = 322

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 113/344 (32%), Positives = 151/344 (43%), Gaps = 83/344 (24%)

Query: 8   PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           P  +   MV LP     +GT       +P+ ++    P       AF +D  EV+N QFQ
Sbjct: 48  PKNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQ 100

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
           EFV  TGYVT+AEK G   +F                               +  HPV  
Sbjct: 101 EFVKQTGYVTDAEKQGGAAMFV------------------------------QPEHPVEE 130

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
           + W        WR         +E G             +W HP G+++  +   + PV 
Sbjct: 131 LQW--------WR---------FEKGA------------NWKHPWGLNNPKQPAPHEPVR 161

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTP--RGEHRANVWQGEF 239
            V+WNDA AY  W G  LP+E EWEY  +G  +N      +++ P   G   AN WQGEF
Sbjct: 162 MVTWNDAYAYANWLGHDLPSELEWEYAAKGFQQN------SDIGPTYEGHIVANFWQGEF 215

Query: 240 PTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPT 296
           P  N   DG+   APV  + +N FGLY+++GNVWEWT   +     HH   PS   +   
Sbjct: 216 PYKNVQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHE 275

Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                  KGGSYLC   YC R+R AAR     D +  ++GFR  
Sbjct: 276 QILQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 319


>gi|392384161|ref|YP_005033357.1| sulfatase-modifying factor 1 (fragment), partial [Azospirillum
           brasilense Sp245]
 gi|356880876|emb|CCD01844.1| sulfatase-modifying factor 1 (fragment) [Azospirillum brasilense
           Sp245]
          Length = 211

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 85/184 (46%), Positives = 107/184 (58%), Gaps = 22/184 (11%)

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
            HV++ DA AY  W G  LPTEAEWE+  RGGL+  ++ WG+   P G H AN WQG+FP
Sbjct: 27  FHVAYEDAEAYARWAGKTLPTEAEWEFAARGGLDGTIYCWGDEFMPDGRHMANTWQGQFP 86

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--------PSYNP 292
             N  +DG+  TAPV SY  N +GL++M GNVWEWT DW++   HPA        PS NP
Sbjct: 87  WQNQESDGFERTAPVGSYPANGYGLFDMAGNVWEWTCDWYSA-RHPADAGKPCCIPS-NP 144

Query: 293 KGPTTGTD------------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           +G    +             KV KGGS+LC   YC R+R AAR     DS+  ++GFRC 
Sbjct: 145 RGAPRESSLDPAQPNIRIPRKVVKGGSFLCAPSYCERYRPAARQPQMIDSAMSHIGFRCI 204

Query: 341 ADKG 344
             KG
Sbjct: 205 IRKG 208


>gi|421654353|ref|ZP_16094683.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii Naval-72]
 gi|421679130|ref|ZP_16119009.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC111]
 gi|408511120|gb|EKK12774.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii Naval-72]
 gi|410391622|gb|EKP43989.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC111]
          Length = 268

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 111/335 (33%), Positives = 147/335 (43%), Gaps = 79/335 (23%)

Query: 15  MVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           MV LP     +GT       +P+ ++    P       AF +D  EV+N QFQEFV  TG
Sbjct: 1   MVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQEFVKQTG 53

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           YVT+AEK G   +F                               +  HPV  + W    
Sbjct: 54  YVTDAEKQGGAAMFV------------------------------QPEHPVEELQW---- 79

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
                          W++G +G          +W HP G+++  +   + PV  V+WNDA
Sbjct: 80  ---------------WKFG-KGA---------NWKHPWGLNNPKQPAPHEPVRMVTWNDA 114

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G  LP+E EWEY  +G  +N         T  G   AN WQGEFP  N   DG
Sbjct: 115 YAYANWLGHDLPSELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPYKNVQEDG 170

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTGTDKVKKG 305
           +   APV  + +N FGLY+++GNVWEWT   +     HH   PS   +          KG
Sbjct: 171 FKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHEQILQYTIKG 230

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GSYLC   YC R+R AAR     D +  ++GFR  
Sbjct: 231 GSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 265


>gi|403673645|ref|ZP_10935936.1| hypothetical protein ANCT1_03131 [Acinetobacter sp. NCTC 10304]
          Length = 322

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 112/340 (32%), Positives = 148/340 (43%), Gaps = 79/340 (23%)

Query: 8   PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           P  +   MV LP     +GT       +P+ ++    P       AF +D  EV+N QFQ
Sbjct: 48  PNNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQ 100

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
           EFV  TGYVT+AEK G   +F                               +  HPV  
Sbjct: 101 EFVKQTGYVTDAEKQGGAAMFV------------------------------QPEHPVEE 130

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
           + W        W+         +E G             +W HP G+++  +   + PV 
Sbjct: 131 LQW--------WK---------FEKGA------------NWKHPWGLNNPKQPAPHEPVR 161

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
            V+WNDA AY  W G  LP+E EWEY  +G  +N         T  G   AN WQGEFP 
Sbjct: 162 MVTWNDAYAYANWLGHDLPSELEWEYAAKGFQQNSDI----GPTDEGHIVANFWQGEFPY 217

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTG 298
            N   DG+   APV  + +N FGLY+++GNVWEWT   +     HH   PS   +     
Sbjct: 218 KNVQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHEQI 277

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                KGGSYLC   YC R+R AAR     D +  ++GFR
Sbjct: 278 LQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFR 317


>gi|445486947|ref|ZP_21457568.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii AA-014]
 gi|444769174|gb|ELW93371.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii AA-014]
          Length = 268

 Score =  157 bits (398), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 112/335 (33%), Positives = 146/335 (43%), Gaps = 79/335 (23%)

Query: 15  MVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           MV LP     +GT       +P+ ++    P       AF +D  EV+N QFQEFV  TG
Sbjct: 1   MVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQEFVKQTG 53

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           YVT+AEK G   +F                               +  HPV  + W    
Sbjct: 54  YVTDAEKQGGAAMFV------------------------------QPEHPVEELKW---- 79

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
               WR         +E G             +W HP G+++  +   + PV  V+WNDA
Sbjct: 80  ----WR---------FEKGA------------NWKHPWGLNNPKQPAPHEPVRMVTWNDA 114

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G  LP+E EWEY  +G  +N         T  G   AN WQGEFP  N   DG
Sbjct: 115 YAYANWLGHDLPSELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPYKNVQEDG 170

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTGTDKVKKG 305
           +   APV  + +N FGLY+++GNVWEWT   +     HH   PS   +          KG
Sbjct: 171 FKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHEQILQYTIKG 230

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GSYLC   YC R+R AAR     D +  ++GFR  
Sbjct: 231 GSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 265


>gi|417546502|ref|ZP_12197588.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC032]
 gi|421667451|ref|ZP_16107521.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC087]
 gi|421669663|ref|ZP_16109682.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC099]
 gi|400384390|gb|EJP43068.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC032]
 gi|410384726|gb|EKP37232.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC087]
 gi|410388048|gb|EKP40488.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC099]
          Length = 268

 Score =  157 bits (397), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 110/335 (32%), Positives = 147/335 (43%), Gaps = 79/335 (23%)

Query: 15  MVLLPGDTFRMGT------NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           MV LP     +GT       +P+ ++    P       AF +D  EV+N QFQEFV  TG
Sbjct: 1   MVKLPQGKIVLGTPQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQEFVKQTG 53

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           Y+T+AEK G   +F                               +  HPV  + W    
Sbjct: 54  YITDAEKQGGAAMFV------------------------------QPEHPVEELQW---- 79

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
               W+         +E G             +W HP G++++ +   + PV  V+WNDA
Sbjct: 80  ----WK---------FEKGA------------NWKHPWGLNNSKQPAPHEPVRMVTWNDA 114

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
            AY  W G  LP+E EWEY  +G  +N         T  G   AN WQGEFP  N   DG
Sbjct: 115 YAYANWLGHDLPSELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPYKNVQEDG 170

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTGTDKVKKG 305
           +   APV  + +N FGLY+++GNVWEWT   +     HH   PS   +          KG
Sbjct: 171 FKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHEQILQYTIKG 230

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GSYLC   YC R+R AAR     D +  ++GFR  
Sbjct: 231 GSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 265


>gi|340785240|ref|YP_004750705.1| sulfatase modifying factor 1 (C-alpha-formyglycine- generating
           enzyme 1) [Collimonas fungivorans Ter331]
 gi|340550507|gb|AEK59882.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycine-
           generating enzyme 1) [Collimonas fungivorans Ter331]
          Length = 344

 Score =  157 bits (397), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 102/299 (34%), Positives = 135/299 (45%), Gaps = 68/299 (22%)

Query: 48  FYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGL 107
           F++DQ +V+N QF  FV+ATGYV++AE+ G   VF     EE  A+              
Sbjct: 100 FWIDQTDVTNAQFSSFVAATGYVSDAERQGGGAVFHKPTLEELNAR-------------- 145

Query: 108 DSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
                         ++W   VA  +WR                             HP G
Sbjct: 146 -------------DLAWWSWVAGASWR-----------------------------HPAG 163

Query: 168 IDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR 227
             S++E   N PV  V+  DA+AY  W G  LP+EAEWEY  + G +        +  PR
Sbjct: 164 PGSSLEGIGNQPVTMVTQADALAYARWLGRDLPSEAEWEYAGKAGHDGAEL----DTAPR 219

Query: 228 GEH---RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH 284
             H    AN WQG FP  N   DG++  APV  Y  N F LY+MVGN WEWT D ++  H
Sbjct: 220 DHHGRPTANYWQGVFPLVNNNEDGHIGLAPVGCYAANDFKLYDMVGNAWEWTKDIYSGSH 279

Query: 285 H-----PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                    +  P      T  V KGGS+LC+  YC R+R ++R Q   D  A ++GFR
Sbjct: 280 QVHSNGDTAAVAPASRRHDTPMVIKGGSFLCSRDYCVRYRASSREQQEADLPASHIGFR 338


>gi|424741299|ref|ZP_18169658.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii WC-141]
 gi|422944924|gb|EKU39897.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii WC-141]
          Length = 322

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 113/340 (33%), Positives = 147/340 (43%), Gaps = 79/340 (23%)

Query: 8   PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           P  +   MV LP     +GT       +P+ ++    P       AF +D  EV+N QFQ
Sbjct: 48  PKNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQ 100

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
           EFV  TGYVT+AEK G   +F                               + +HPV  
Sbjct: 101 EFVKQTGYVTDAEKQGGAAMFV------------------------------QPNHPVEE 130

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
           + W        W+         +E G             +W HP G ++  +   + PV 
Sbjct: 131 LQW--------WK---------FEKGA------------NWKHPWGPNNPKQPAPHEPVR 161

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
            V+WNDA AY  W G  LPTE EWEY  +G  +N         T  G   AN WQGEFP 
Sbjct: 162 MVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPY 217

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTG 298
            N   DG+   APV  + +N FGLY+++GNVWEWT   +     HH   PS   +     
Sbjct: 218 KNIKEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHEQI 277

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                KGGSYLC   YC R+R AAR     D +  + GFR
Sbjct: 278 LQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHAGFR 317


>gi|323456570|gb|EGB12437.1| hypothetical protein AURANDRAFT_3711, partial [Aureococcus
           anophagefferens]
          Length = 210

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 92/248 (37%), Positives = 120/248 (48%), Gaps = 54/248 (21%)

Query: 32  LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEER 91
           L +DG   + +  + AF +D   VSN QF+ FV  T Y+TEAE F  +FV E L +    
Sbjct: 13  LSRDGAKAASHRRVPAFRIDPTAVSNDQFKRFVRDTQYLTEAENFRWSFVLE-LTASNAT 71

Query: 92  AKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGG 151
              S  ++ + R +                                    + W  G  G 
Sbjct: 72  INQSDAKNGLGRVQA-----------------------------------SPWWMGVFGA 96

Query: 152 LENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRG 211
                     W  PEG DS++  R +HP  H+SWNDA AYC W G RLPTE EWE   RG
Sbjct: 97  Y---------WRKPEGPDSSLRGRGDHPATHISWNDASAYCKWAGRRLPTEVEWEMAARG 147

Query: 212 GLENRLFPWGNNLTPRGEH----RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYN 267
           GLE+  FPWG+      +H    R N W+GEFP  N+A DG++   PV +Y  N FG+YN
Sbjct: 148 GLEDEPFPWGD-----ADHDPWTRLNAWEGEFPDENSARDGFVGPGPVDAYAPNAFGIYN 202

Query: 268 MVGNVWEW 275
            +GNVWEW
Sbjct: 203 ALGNVWEW 210


>gi|425747363|ref|ZP_18865371.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii WC-348]
 gi|425493937|gb|EKU60159.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii WC-348]
          Length = 322

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 115/342 (33%), Positives = 148/342 (43%), Gaps = 79/342 (23%)

Query: 8   PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           P  +   MV LP     +GT       +P+ ++    P       AF +D  EV+N QFQ
Sbjct: 48  PKNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQ 100

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
           EFV  TGYVT+AEK G   +F              V+ D                HPV  
Sbjct: 101 EFVKQTGYVTDAEKQGGAAMF--------------VQPD----------------HPVEE 130

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
           + W        WR         +E G             +W HP G ++  +   +  V 
Sbjct: 131 LQW--------WR---------FEKGA------------NWKHPWGPNNPKQPAPHELVR 161

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
            V+WNDA AY  W G  LPTE EWEY  +G  +N         T  G   AN WQGEFP 
Sbjct: 162 MVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPY 217

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTG 298
            N   DG+   APV  + +N FGLY+++GNVWEWT   +     HH   PS   +     
Sbjct: 218 KNVQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGEPSKYRQSHEQI 277

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                KGGSYLC   YC R+R AAR     D +  ++GFR  
Sbjct: 278 LQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 319


>gi|167527321|ref|XP_001747993.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163773742|gb|EDQ87380.1| predicted protein [Monosiga brevicollis MX1]
          Length = 360

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 113/314 (35%), Positives = 150/314 (47%), Gaps = 68/314 (21%)

Query: 34  KDGEFPSRNVTLDAFY---LDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEE 90
           +DGE P R  ++       L  + V+N+ F+ FV AT Y TEAE++G +FVF  L S+  
Sbjct: 98  RDGEAPPRKASVRPVSCHELSTNTVNNSAFRAFVRATKYTTEAEQYGWSFVFHLLASDSV 157

Query: 91  RAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRG 150
           RA   +V   +K  E                  W        W    LP           
Sbjct: 158 RA---EVTETIKGSE------------------W--------W----LPV---------- 174

Query: 151 GLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCR 210
                  P   W  P G  S+I+ R ++PV  +SWNDA A+C WRG RLPTEAEWE    
Sbjct: 175 -------PRAYWRMPFGKGSSIKGRNDYPVTQISWNDARAFCKWRGMRLPTEAEWELAAG 227

Query: 211 GGLENRLFPWGNNLTPR-GEHRANVWQGEFPTNNTAADGYLSTAPVMSY-KENKFGLYNM 268
           G  +++ +PWG    PR G    NVWQG+FP  NT ADG+L  APV +Y  +   G YN+
Sbjct: 228 GSKQSKRYPWG----PRAGLDHMNVWQGKFPDVNTKADGHLGLAPVDAYGPQTDTGFYNL 283

Query: 269 VGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNEQYCYRHRCAARSQN 326
           +GN WEW    +       PS   +        V KGGSY+   + Q  +  R   R  N
Sbjct: 284 LGNGWEWVDQVYEGQPGEPPSQEKR-------VVLKGGSYVDSVDGQTNHPVRVTTRMGN 336

Query: 327 TPDSSAGNLGFRCA 340
           T DS++ N+ FRCA
Sbjct: 337 TVDSASDNMSFRCA 350


>gi|444410026|ref|ZP_21206582.1| cysteine-type sulfatase aerobic maturase domain protein
           [Streptococcus pneumoniae PNI0076]
 gi|444412934|ref|ZP_21209253.1| cysteine-type sulfatase aerobic maturase domain protein
           [Streptococcus pneumoniae PNI0153]
 gi|444421844|ref|ZP_21217512.1| cysteine-type sulfatase aerobic maturase domain protein
           [Streptococcus pneumoniae PNI0446]
 gi|444274173|gb|ELU79828.1| cysteine-type sulfatase aerobic maturase domain protein
           [Streptococcus pneumoniae PNI0153]
 gi|444278414|gb|ELU83866.1| cysteine-type sulfatase aerobic maturase domain protein
           [Streptococcus pneumoniae PNI0076]
 gi|444288796|gb|ELU93685.1| cysteine-type sulfatase aerobic maturase domain protein
           [Streptococcus pneumoniae PNI0446]
          Length = 172

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 78/172 (45%), Positives = 105/172 (61%), Gaps = 10/172 (5%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
           +HPVVHV+  DA+A+C W G  LPTEA+WEY  RGG ++  +PWG+ L   G + AN WQ
Sbjct: 5   DHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQ 63

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----- 291
           G FP  NT +DG++ TAPV  +  N FGLY M+GNVWEW     N  +    S+N     
Sbjct: 64  GRFPYENTVSDGFIGTAPVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYE 120

Query: 292 -PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            PK      +   +GGS+LC+  YC R+R AAR+     S++ +LGFRC  +
Sbjct: 121 LPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 172


>gi|119628382|gb|EAX07977.1| sulfatase modifying factor 2, isoform CRA_c [Homo sapiens]
          Length = 197

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 99/263 (37%), Positives = 129/263 (49%), Gaps = 68/263 (25%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  F MGTN P   +DGE P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct: 1   MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 59

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q    MK   G  S I  R+ HPV+H+SWNDA AYC WR
Sbjct: 60  MFGWSFVFEDFVSDELRNKATQ---PMKP-AGPGSGIRERLEHPVLHVSWNDARAYCAWR 115

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           G RLPTE EWE+  RGGL+ +++PWG+W  P        +R N                W
Sbjct: 116 GKRLPTEEEWEFAARGGLKGQVYPWGNWFQP--------NRTN---------------LW 152

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G                             P+G+   + + G  P N   A        
Sbjct: 153 QGK---------------------------FPKGDKAEDGFHGVSPVNAFPA-------- 177

Query: 255 VMSYKENKFGLYNMVGNVWEWTA 277
                +N +GLY+++GNVWEWTA
Sbjct: 178 -----QNNYGLYDLLGNVWEWTA 195


>gi|218678281|ref|ZP_03526178.1| hypothetical protein RetlC8_05157 [Rhizobium etli CIAT 894]
          Length = 249

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 90/240 (37%), Positives = 122/240 (50%), Gaps = 41/240 (17%)

Query: 37  EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
           E P+    +D F++D   V+N  F  FVSATGYVT AEK  D   + P    E  A  S 
Sbjct: 8   EAPAHRERVDGFWMDGFTVTNLDFGRFVSATGYVTLAEKPADPANY-PGAKPEMLAPSSI 66

Query: 97  VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 156
           +      F+   + ++ R H+                          W    RG      
Sbjct: 67  M------FKKPSAPVDMRDHY-------------------------NWWVYVRGA----- 90

Query: 157 FPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 216
               +W HP G  S+I    +HPVVHV+++D  AY  W G +LP+EAEWE+  RGGL+  
Sbjct: 91  ----NWRHPRGSASSIRKLGDHPVVHVAYDDVAAYAKWAGKQLPSEAEWEFAARGGLDGA 146

Query: 217 LFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWT 276
            + WG+   P G+H AN WQG+FP  N+ ADG+  TAPV S+  N +GLY+M  NVW+WT
Sbjct: 147 EYVWGDEFNPGGKHMANTWQGDFPYRNSLADGFEYTAPVGSFPANGYGLYDMARNVWQWT 206


>gi|299770976|ref|YP_003733002.1| hypothetical protein AOLE_13710 [Acinetobacter oleivorans DR1]
 gi|298701064|gb|ADI91629.1| hypothetical protein AOLE_13710 [Acinetobacter oleivorans DR1]
          Length = 317

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 104/299 (34%), Positives = 134/299 (44%), Gaps = 66/299 (22%)

Query: 43  VTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMK 102
            ++ AF +D  EV+N QFQEFV  TGYVT+AEK G   +F                    
Sbjct: 77  TSVPAFLIDATEVTNAQFQEFVKQTGYVTDAEKQGGAAMFA------------------- 117

Query: 103 RFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSW 162
                      +  HPV  + W        W+         +E G             +W
Sbjct: 118 -----------QPEHPVEELQW--------WK---------FEKGA------------NW 137

Query: 163 LHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGN 222
            HP G ++  +   + PV  V+WNDA AY  W G  LPTE EWEY  +G  +N       
Sbjct: 138 KHPWGPNNPKQPAPHEPVRMVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI---- 193

Query: 223 NLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN- 281
             T  G   AN WQGEFP  N   DG+   APV  + +N FGLY+++GNVWEWT   +  
Sbjct: 194 GPTHEGHIVANFWQGEFPYKNIQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTG 253

Query: 282 --VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
              HH   PS   +          KGGSYLC   YC R+R AAR     D +  ++GFR
Sbjct: 254 PRDHHMGDPSKYRQSHEQILQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFR 312


>gi|170697260|ref|ZP_02888354.1| protein of unknown function DUF323 [Burkholderia ambifaria
           IOP40-10]
 gi|170137880|gb|EDT06114.1| protein of unknown function DUF323 [Burkholderia ambifaria
           IOP40-10]
          Length = 349

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 112/336 (33%), Positives = 146/336 (43%), Gaps = 77/336 (22%)

Query: 15  MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           MV L G  F  G+ +       + DG        +  F++DQ +V+  QF  FV ATGYV
Sbjct: 72  MVHLSGGAFVFGSTRGYADERPVGDGR-----TRVGGFWIDQTDVTIAQFAAFVKATGYV 126

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           TEAE+ G   VF     +E  A      HD+                      WN     
Sbjct: 127 TEAEQQGGAAVFHVPARDELNA------HDLAW--------------------WN----- 155

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
                        W  G             SW  P G  S+++   N PV  V+  DA+A
Sbjct: 156 -------------WVKGA------------SWRQPRGPGSSVDGLGNLPVTLVTQRDALA 190

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAAD 247
           Y  W G  LPTEAEWEY  + G ++       +  PR   G+  AN WQG FP  +TA D
Sbjct: 191 YAHWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGAFPVLDTAED 246

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDKV 302
           G+   APV  Y  N F LY+M+GN WEWT D +   H         +  P      T  V
Sbjct: 247 GHAGLAPVGCYAANGFHLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPMV 306

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            KGGS+LC+  YC R+R ++R Q   D  A ++GFR
Sbjct: 307 IKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342


>gi|408791375|ref|ZP_11202985.1| cysteine-type sulfatase aerobic maturase family protein [Leptospira
           meyeri serovar Hardjo str. Went 5]
 gi|408462785|gb|EKJ86510.1| cysteine-type sulfatase aerobic maturase family protein [Leptospira
           meyeri serovar Hardjo str. Went 5]
          Length = 327

 Score =  154 bits (390), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 106/326 (32%), Positives = 150/326 (46%), Gaps = 74/326 (22%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+P   F  G+N    +   E P  N T+  F++D+ EV+N +F +FV  TGY TEAE
Sbjct: 51  MVLVPSGNFEKGSN----VYPEESPIYNTTVSEFWMDETEVTNDEFAKFVWETGYQTEAE 106

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                     L    E   +   +     F+    + + +++    H+ W          
Sbjct: 107 T--------NLKPNSEEVNLEHYQPGAVVFQ--KPSNQGKINS---HLEW---------- 143

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                    W Y           P  +W HP+G  S+IE + ++PVV VS+ DA +Y  W
Sbjct: 144 ---------WSY----------VPGTNWRHPDGPYSSIEGKGSYPVVAVSYQDANSYAKW 184

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G  LPTE EWE+    G                   AN WQGEFP  +   DG+   AP
Sbjct: 185 KGRTLPTEVEWEWAAANG---------------SREEANTWQGEFPFLDEGKDGFTGIAP 229

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK--VKKGGSYLCNE 312
           V  + +NKFGLY+M+GNVWE+TAD W           P   T    K  + KGGSYLC+ 
Sbjct: 230 VGCFAKNKFGLYDMIGNVWEYTADPW-----------PVDQTLDKSKYHIIKGGSYLCSP 278

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFR 338
            YC R+R  A+       ++ ++GFR
Sbjct: 279 NYCKRYRAQAKQPQEDRLASNHIGFR 304



 Score = 38.5 bits (88), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 38/74 (51%), Gaps = 6/74 (8%)

Query: 320 CAARSQNTPDSSAGNLGFRCAADKGPTTGT-DKVK----KGGSYLCNEQYCYRHRCAARS 374
           C A+++       GN+ +   AD  P   T DK K    KGGSYLC+  YC R+R  A+ 
Sbjct: 232 CFAKNKFGLYDMIGNV-WEYTADPWPVDQTLDKSKYHIIKGGSYLCSPNYCKRYRAQAKQ 290

Query: 375 QNTPDSSAGNLGFR 388
                 ++ ++GFR
Sbjct: 291 PQEDRLASNHIGFR 304


>gi|363581085|ref|ZP_09313895.1| hypothetical protein FbacHQ_06220 [Flavobacteriaceae bacterium
           HQM9]
          Length = 353

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 80/187 (42%), Positives = 107/187 (57%), Gaps = 8/187 (4%)

Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
            +W HP+G  STI  + ++PVV VSW DA+AYC W   RLPTE E+EY  R   +   + 
Sbjct: 164 ANWRHPQGPKSTIIGKDSYPVVQVSWYDAIAYCHWANKRLPTEEEFEYAARNQGKQIKYS 223

Query: 220 WGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
           WGN+     E+ AN  QG FP  N   D +   APV S+K N  G+Y++ GNVWEWT + 
Sbjct: 224 WGNDFNKAVEY-ANFHQGNFPLGNLKIDNFEGLAPVKSFKNNALGIYDIGGNVWEWTLNS 282

Query: 280 WNVHHHPAPSYNPKGPTTGTD------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAG 333
           +    + +   N + P    D      KV +GGS+LCNE YC  +R AAR   +PD+   
Sbjct: 283 YFSDAY-SKKINFEQPILANDNTLNQKKVIRGGSFLCNESYCSGYRVAARMNASPDTGLE 341

Query: 334 NLGFRCA 340
           +LGFRC 
Sbjct: 342 HLGFRCV 348


>gi|444414754|ref|ZP_21211005.1| cysteine-type sulfatase aerobic maturase domain protein
           [Streptococcus pneumoniae PNI0199]
 gi|444281206|gb|ELU86535.1| cysteine-type sulfatase aerobic maturase domain protein
           [Streptococcus pneumoniae PNI0199]
          Length = 172

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 77/172 (44%), Positives = 104/172 (60%), Gaps = 10/172 (5%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
           +HPVVHV+  DA+A+C W G  LPTEA+WEY  RGG ++  +PWG+ L   G + AN WQ
Sbjct: 5   DHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQ 63

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----- 291
           G FP  NT +DG++ TAPV  +  N FGLY M+GN WEW     N  +    S+N     
Sbjct: 64  GRFPYENTVSDGFIGTAPVYEFLPNDFGLYQMIGNDWEWCR---NPRYTLLASFNEDDYE 120

Query: 292 -PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            PK      +   +GGS+LC+  YC R+R AAR+     S++ +LGFRC  +
Sbjct: 121 LPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 172


>gi|167590166|ref|ZP_02382554.1| hypothetical protein BuboB_32820 [Burkholderia ubonensis Bu]
          Length = 347

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 114/339 (33%), Positives = 152/339 (44%), Gaps = 69/339 (20%)

Query: 15  MVLLPGDTFRMGTNK--PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           MV L G  F  G+ +  P     G+  +R   +  F++DQ +V+  QF  FV ATGYVT+
Sbjct: 70  MVHLRGGAFVFGSTRGYPDERPAGDGRTR---VGGFWIDQTDVTIAQFAAFVQATGYVTD 126

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           A                ER   + V H   R E                ++  D +A+ +
Sbjct: 127 A----------------ERQGGAAVFHVPTRDE----------------MNARD-LAWWS 153

Query: 133 W-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
           W RGA                        SW HP G  S+++ R N PV  V+  DA+AY
Sbjct: 154 WVRGA------------------------SWWHPAGPGSSVDGRGNLPVTLVTQRDALAY 189

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G  LPTEAEWEY  + G ++           RG+  AN WQG FP  +TA DG+  
Sbjct: 190 ARWLGRDLPTEAEWEYAGKAGRDDASLD-AAPRDARGKPAANYWQGVFPVLDTAEDGHAG 248

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDKVKKGG 306
            APV  Y  N F LY+M+GN WEWT D +   H            P      T  V KGG
Sbjct: 249 LAPVGCYAANGFRLYDMIGNAWEWTKDVYTGPHQSHTNGDTAVVAPPARRHDTPMVIKGG 308

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           S+LC+  YC R+R ++R Q   D +A ++GFR      P
Sbjct: 309 SFLCSRDYCVRYRASSREQQEADLAASHIGFRTILRDAP 347


>gi|408674864|ref|YP_006874612.1| Sulphatase-modifying factor protein [Emticicia oligotrophica DSM
           17448]
 gi|387856488|gb|AFK04585.1| Sulphatase-modifying factor protein [Emticicia oligotrophica DSM
           17448]
          Length = 313

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 77/182 (42%), Positives = 109/182 (59%), Gaps = 5/182 (2%)

Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR-LF 218
            +W +P+G D   +   NHPV  VSW DAVAY  W G RLP+E EWE+  R G  +R L+
Sbjct: 135 ANWEYPQGKDQP-KAPDNHPVTQVSWYDAVAYAKWAGKRLPSEIEWEHAARNGTNDRTLY 193

Query: 219 PWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD 278
           P+GN++   G+  AN W G FP  N   DG+  T+PV  + + K GL +M GNVW+W  D
Sbjct: 194 PFGNDIEKNGKALANTWNGVFPDYNKNTDGFFYTSPVGYFGKTKLGLTDMTGNVWQW-CD 252

Query: 279 WWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            W + +        K PT   +KV+KGGS+LC   +C+ +R + RS  +P++S  ++GFR
Sbjct: 253 NWKISYEDIN--QGKMPTDTLEKVEKGGSFLCEPGWCHGYRVSGRSFTSPETSLMHVGFR 310

Query: 339 CA 340
           C 
Sbjct: 311 CV 312



 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 56/154 (36%), Positives = 82/154 (53%), Gaps = 9/154 (5%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +PG   ++G    +     E P   V +  F++D   V+  QF+EFV AT YVT AE
Sbjct: 58  MVYVPGGYTQIGVTDGMSF---EKPMFWVEVKPFFMDISPVTVAQFREFVKATKYVTYAE 114

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +FGD   F+     E +    +   + +  +G D   +   +HPV  +SW DAVAY  W 
Sbjct: 115 RFGDGGFFD----VEAKGWTLKKGANWEYPQGKDQP-KAPDNHPVTQVSWYDAVAYAKWA 169

Query: 135 GARLPTEAEWEYGCRGGLENR-LFPWGSWLHPEG 167
           G RLP+E EWE+  R G  +R L+P+G+ +   G
Sbjct: 170 GKRLPSEIEWEHAARNGTNDRTLYPFGNDIEKNG 203


>gi|120436433|ref|YP_862119.1| hypothetical protein GFO_2087 [Gramella forsetii KT0803]
 gi|117578583|emb|CAL67052.1| protein containing DUF323 [Gramella forsetii KT0803]
          Length = 344

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 77/188 (40%), Positives = 105/188 (55%), Gaps = 6/188 (3%)

Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
            +W +P G DSTI+ +  +PVV VSW DA+AY  W G RLPTEAE+EY  R G +  ++ 
Sbjct: 157 ANWKNPTGPDSTIDGKDTYPVVQVSWYDAMAYAKWAGKRLPTEAEFEYFNRAGNDTLIYH 216

Query: 220 WGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
           WGN+     E   N +QG+FP  N+  D +   A V S+  N FGLY   GNVWEW  D 
Sbjct: 217 WGNDFEKASEM-VNFFQGDFPRQNSTEDEFEKKASVKSFPANDFGLYETSGNVWEWCLDT 275

Query: 280 -----WNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGN 334
                +++       Y  +       KV +GGS+LC+E YC  +R +AR  + PD+   +
Sbjct: 276 YYPDAYSIRDQQTNGYFKEFVNMQQQKVIRGGSFLCSESYCTGYRNSARMSSAPDTGLEH 335

Query: 335 LGFRCAAD 342
            GFRC  D
Sbjct: 336 TGFRCVKD 343


>gi|413961059|ref|ZP_11400288.1| hypothetical protein BURK_014113 [Burkholderia sp. SJ98]
 gi|413931773|gb|EKS71059.1| hypothetical protein BURK_014113 [Burkholderia sp. SJ98]
          Length = 333

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 113/336 (33%), Positives = 145/336 (43%), Gaps = 66/336 (19%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           MV L G  F  G+    L  + E P  +    + AF++DQ +V+N QF  FV ATGYVT 
Sbjct: 58  MVHLRGGEFVFGST---LGYEDERPMGDGKTKVGAFWIDQTDVTNAQFAAFVDATGYVT- 113

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
                          E ER   + V H           +  R       ++W        
Sbjct: 114 ---------------EAERQGGAVVFH-----------VPTREEMNARDLAW-------- 139

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W          W  G             SW HP G  ST + R N PV  V+  DA+AY 
Sbjct: 140 W---------SWVKGA------------SWKHPTGSSSTWKGRENRPVTMVTLADALAYA 178

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWEY  + G E            RG+  AN WQG FP  ++A DG+   
Sbjct: 179 HWLGRDLPTEAEWEYAGKAGREGAQLD-AAPRDARGKPSANYWQGVFPVLDSAEDGHAGL 237

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTG----TDKVKKGGSY 308
           APV  Y  N F LY+M+GN WEWT D +     P  + +           T  V KGGS+
Sbjct: 238 APVGCYAANDFRLYDMIGNAWEWTKDPYTGPRQPHSNGDTAAVIPARKHDTAMVIKGGSF 297

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           LC + YC R+R ++R Q   D  A ++GFR     G
Sbjct: 298 LCAQDYCVRYRASSREQQEADLPASHIGFRTVLRDG 333


>gi|295700051|ref|YP_003607944.1| hypothetical protein [Burkholderia sp. CCGE1002]
 gi|295439264|gb|ADG18433.1| protein of unknown function DUF323 [Burkholderia sp. CCGE1002]
          Length = 363

 Score =  151 bits (381), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 113/343 (32%), Positives = 147/343 (42%), Gaps = 85/343 (24%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           MV L G  F  G+    L  + E P+ +    +  F++DQ +V+N QF  FV ATGYV  
Sbjct: 86  MVHLHGGDFVFGSK---LGYEDERPAGDGKTHVAGFWIDQTDVTNAQFAAFVRATGYV-- 140

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
                         S+ ER   + V H                                 
Sbjct: 141 --------------SDAERQGGAVVFH--------------------------------- 153

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWL------HPEGIDSTIEHRMNHPVVHVSWN 186
                 PT  E        +  R   W SW+      HP G  S ++ RMN PV  V+  
Sbjct: 154 -----TPTPEE--------MNARDLAWWSWVKGAAWNHPGGPGSNLDGRMNQPVTMVTQA 200

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH---RANVWQGEFPTNN 243
           DA+AY  W G  LPTEAEWEY  + G E        +  PR  H    AN WQG FP  N
Sbjct: 201 DALAYAHWLGRDLPTEAEWEYAGKAGHEGADL----DTAPRDAHGRPSANYWQGVFPVLN 256

Query: 244 TAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTG 298
           T+ DG++  +PV  Y  N F LY+M+GN WEWT D ++  H         +  P      
Sbjct: 257 TSEDGHVGLSPVGCYAANDFRLYDMIGNAWEWTKDVYSGPHQSHTNGDTAAVAPLSRRHD 316

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           T  V KGGS+LC   YC R+R A+R Q   D  A ++GFR  +
Sbjct: 317 TPMVIKGGSFLCARDYCVRYRAASREQQEADLPASHIGFRTVS 359


>gi|149927426|ref|ZP_01915681.1| hypothetical protein LMED105_01032 [Limnobacter sp. MED105]
 gi|149823918|gb|EDM83143.1| hypothetical protein LMED105_01032 [Limnobacter sp. MED105]
          Length = 273

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 113/333 (33%), Positives = 144/333 (43%), Gaps = 65/333 (19%)

Query: 17  LLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKF 76
           +L G  FRMG+N+       E P R  ++  F +D   V N+ F EFV AT Y+T A   
Sbjct: 1   MLSGGEFRMGSNRHY---PEERPERTESVAEFAIDVFPVRNSDFAEFVEATQYMTAA--- 54

Query: 77  GDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA 136
                        ER   S V    +    L++  +           W  AV    W   
Sbjct: 55  -------------ERTGHSHVFQMTQGPVPLNNPDQ-----------WWKAVQGACWYNP 90

Query: 137 RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRG 196
           R  TE   ++                               HPVVHV+  DA AY  W G
Sbjct: 91  RPGTELPGDFS-----------------------------EHPVVHVALEDAKAYAAWAG 121

Query: 197 ARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVM 256
            RLPTEAEWE+  RGG     F WG+   P G+  A VWQG FP    A  G   T  V 
Sbjct: 122 GRLPTEAEWEFAARGGPHYAQFAWGDEFAPNGQRMAQVWQGAFPW-YYAPGGLPGTVAVG 180

Query: 257 SYKENKFGLYNMVGNVWEWTADWWNVHHHPAP----SYNPKGPTTGTDKVKKGGSYLCNE 312
            +  N +GLY+M+GNVWEWT   +       P    S +P+   + T    KGGS+LC  
Sbjct: 181 QFPANGYGLYDMIGNVWEWTQSTFAAPALDKPAGCCSCSPQ-QDSNTLFTLKGGSHLCAA 239

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           +YC R+R AAR      SS  +LGFRCA +  P
Sbjct: 240 EYCLRYRPAARIGIAGASSTSHLGFRCAYNTSP 272


>gi|347527447|ref|YP_004834194.1| hypothetical protein SLG_10620 [Sphingobium sp. SYK-6]
 gi|345136128|dbj|BAK65737.1| conserved hypothetical protein [Sphingobium sp. SYK-6]
          Length = 317

 Score =  148 bits (374), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 82/197 (41%), Positives = 106/197 (53%), Gaps = 15/197 (7%)

Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
           R+     W HP+G  S I+ R++ PVVHVS  DA AY  WRG RLP+E E+E   +G L 
Sbjct: 124 RIVQGADWRHPQGPGSRIDDRLDEPVVHVSQADARAYAHWRGHRLPSEEEYERAAQGRL- 182

Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
                 G          AN WQG FP  +   DG    APV  YK N +G ++++GNVWE
Sbjct: 183 ------GTLRDQPAPDSANSWQGPFPHEDLKLDGAAGIAPVGCYKANAYGAHDLIGNVWE 236

Query: 275 WTADWWNVHHHPAPS-----YNPKGPTTGTDK---VKKGGSYLCNEQYCYRHRCAARSQN 326
           WT+ W+   H P PS      NP       DK   V KGGS+LC   YC R+R AAR   
Sbjct: 237 WTSSWYTPGHGPEPSPLGAGSNPSADPRQPDKPVRVIKGGSFLCAPNYCARYRPAARHAQ 296

Query: 327 TPDSSAGNLGFRCAADK 343
             D +A ++GFR A+++
Sbjct: 297 AEDETASHIGFRTASNR 313



 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 58/144 (40%), Positives = 82/144 (56%), Gaps = 10/144 (6%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV + G   R+G ++       E PS    +  F+LD+ EV+N QF  FV+ATG+VT AE
Sbjct: 43  MVWVAGGETRIGDDEGYA---DERPSYTAPVKGFWLDRTEVTNRQFAAFVAATGHVTTAE 99

Query: 75  KFGDTFVFEP------LLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           + GD+FVF P        + ++  +I Q   D +  +G  S I+ R+  PVVH+S  DA 
Sbjct: 100 RRGDSFVFVPPAAGTIATAPDQWWRIVQ-GADWRHPQGPGSRIDDRLDEPVVHVSQADAR 158

Query: 129 AYCTWRGARLPTEAEWEYGCRGGL 152
           AY  WRG RLP+E E+E   +G L
Sbjct: 159 AYAHWRGHRLPSEEEYERAAQGRL 182


>gi|338209668|ref|YP_004653715.1| sulfatase-modifying factor protein [Runella slithyformis DSM 19594]
 gi|336303481|gb|AEI46583.1| Sulphatase-modifying factor protein [Runella slithyformis DSM
           19594]
          Length = 366

 Score =  147 bits (371), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 70/184 (38%), Positives = 107/184 (58%), Gaps = 3/184 (1%)

Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR-LF 218
            +W HP G D       +HPV  VSWNDA AY  W G RLP E EWE+  R    +R ++
Sbjct: 183 ANWHHPMGPDFPAAAD-DHPVTQVSWNDANAYAQWAGKRLPCELEWEHAARNARNSRSIY 241

Query: 219 PWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD 278
           PWGN++  +G++ AN+WQG+FP  N   DG+  T+PV  + +   GL +M GNVWEW ++
Sbjct: 242 PWGNDIQTKGQYNANIWQGKFPVKNANEDGFAYTSPVGKFGKTPLGLTDMSGNVWEWQSN 301

Query: 279 WWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
               ++        +      ++ ++GGS+LC   +C+ +R + RS +T ++S  ++GFR
Sbjct: 302 -LKFNYEALFRTGTRDGDLSAERAQRGGSFLCEPGWCHGYRVSGRSSSTSETSLFHVGFR 360

Query: 339 CAAD 342
           C  D
Sbjct: 361 CVKD 364



 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 51/154 (33%), Positives = 84/154 (54%), Gaps = 8/154 (5%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ + G   ++G+++ +   + E PS    +++F++D H V+  +F++FV AT + TEAE
Sbjct: 105 MIYIQGGVVQIGSDEGL---EQERPSFWAKVNSFWMDTHPVTVGEFRKFVQATHFQTEAE 161

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           KFG+  + +      +R  I +   +     G D        HPV  +SWNDA AY  W 
Sbjct: 162 KFGNAGIID---ESTDRQWILKDGANWHHPMGPDFPAAAD-DHPVTQVSWNDANAYAQWA 217

Query: 135 GARLPTEAEWEYGCRGGLENR-LFPWGSWLHPEG 167
           G RLP E EWE+  R    +R ++PWG+ +  +G
Sbjct: 218 GKRLPCELEWEHAARNARNSRSIYPWGNDIQTKG 251


>gi|255318498|ref|ZP_05359731.1| sulfatase-modifying factor 1 [Acinetobacter radioresistens SK82]
 gi|262378727|ref|ZP_06071884.1| conserved hypothetical protein [Acinetobacter radioresistens SH164]
 gi|255304490|gb|EET83674.1| sulfatase-modifying factor 1 [Acinetobacter radioresistens SK82]
 gi|262300012|gb|EEY87924.1| conserved hypothetical protein [Acinetobacter radioresistens SH164]
          Length = 327

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 108/338 (31%), Positives = 145/338 (42%), Gaps = 89/338 (26%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV + G TF +G N+    +   + S     D F++D  EV+N QFQ FV AT YVTEAE
Sbjct: 59  MVKIQGGTFNIGNNQSYREEKALYKSERTVTD-FWIDATEVTNAQFQSFVDATAYVTEAE 117

Query: 75  KFGDTFVF-EPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           K G+  VF EP                                 PV  ++W     +   
Sbjct: 118 KQGEAAVFTEP-------------------------------QKPVKELAW-----WSLM 141

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           +GA                        +W  P G +S  +   N PV  ++  DA+AY  
Sbjct: 142 KGA------------------------NWKQPWGPNSHRKIEPNQPVRMITLKDAMAYAD 177

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLST 252
           W    L TE +WEY  +G      F    +++   +H  ANVWQGEFP +N   DGY   
Sbjct: 178 WLDRDLATEEQWEYAAKG------FSHERDVSADLKHVDANVWQGEFPYHNENKDGYRDV 231

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWN------------VHHHPAPSYNPKGPTTGTD 300
           APV  +  N FGLY+M+GNVWE+T   +              HH   P++N         
Sbjct: 232 APVGCFAANGFGLYDMIGNVWEYTKSPFTGTHDDHMGMNQLEHHQDNPAFNF-------- 283

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
              KGGS+LC   YC R+R AAR     D    ++GFR
Sbjct: 284 YTIKGGSFLCASNYCMRYRAAARHSQEIDLGISHVGFR 321


>gi|372209873|ref|ZP_09497675.1| sulfatase-modifying factor protein [Flavobacteriaceae bacterium
           S85]
          Length = 306

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 82/206 (39%), Positives = 116/206 (56%), Gaps = 13/206 (6%)

Query: 140 TEAEWEYGCRGGLENRLFPWG-----SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           TEAE  +G  G  + R   W      +W +P G +   + + NHPV HVSWNDA AY TW
Sbjct: 105 TEAE-NFGDSGIFDLRGIRWNLVKGTTWEYPFGPEGD-KAQNNHPVTHVSWNDANAYATW 162

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
            G RLPTE EWEY  + G E + +PWGN++   G + AN W G+   +    DGYL T+P
Sbjct: 163 AGKRLPTEIEWEYAAKNGKELK-YPWGNSVKMNGNYMANTWDGQTIQDQNVKDGYLFTSP 221

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
           V ++ ++K G+ +MVGNVW+WTA  +  +    P +           V +GGS++ ++  
Sbjct: 222 VGAFPKSKSGINDMVGNVWQWTATVFEPYDKTIP-FQKNAQVV----VTRGGSFMFDQAL 276

Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCA 340
              +    R+QNT +SS  N GFRCA
Sbjct: 277 ELSYTTTFRAQNTVESSLFNTGFRCA 302



 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 88/164 (53%), Gaps = 27/164 (16%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           +DMV   G    +G++    I   E PS    +  FYLD + V+   F++FV AT Y+TE
Sbjct: 50  QDMVFFKGGEITIGSDDRTEI---EAPSFTTIVAPFYLDINLVTVADFRKFVEATNYITE 106

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEH---------RMHHPVVHIS 123
           AE FGD+ +F+          +  +R ++ +     +T E+         + +HPV H+S
Sbjct: 107 AENFGDSGIFD----------LRGIRWNLVK----GTTWEYPFGPEGDKAQNNHPVTHVS 152

Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
           WNDA AY TW G RLPTE EWEY  + G E + +PWG+ +   G
Sbjct: 153 WNDANAYATWAGKRLPTEIEWEYAAKNGKELK-YPWGNSVKMNG 195


>gi|171319832|ref|ZP_02908915.1| protein of unknown function DUF323 [Burkholderia ambifaria MEX-5]
 gi|171094927|gb|EDT39956.1| protein of unknown function DUF323 [Burkholderia ambifaria MEX-5]
          Length = 349

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 110/337 (32%), Positives = 148/337 (43%), Gaps = 79/337 (23%)

Query: 15  MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           MV L G  F  G+ +       + DG        +  F++DQ +V+  QF  FV ATGYV
Sbjct: 72  MVHLSGGAFVFGSTRGYADERPVGDGR-----TRIGGFWIDQTDVTIAQFAAFVKATGYV 126

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           T                E ER   + V H   R E                ++  D +A+
Sbjct: 127 T----------------EAERQGGAAVFHVPARDE----------------LNARD-LAW 153

Query: 131 CTW-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
            +W +GA                        SW  P G  S+++   N PV  V+  DA+
Sbjct: 154 WSWVKGA------------------------SWRQPRGPGSSVDGLDNLPVTLVTQRDAL 189

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAA 246
           AY  W G  LPTEAEWEY  + G ++       +  PR   G+  AN WQG FP  +TA 
Sbjct: 190 AYAHWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGAFPVLDTAE 245

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDK 301
           DG+   APV  Y  N F LY+M+GN WEWT D +   H         +  P      T  
Sbjct: 246 DGHAGLAPVGCYAANGFHLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPM 305

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           V KGGS+LC+  YC R+R ++R Q   D  A ++GFR
Sbjct: 306 VIKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342


>gi|254254946|ref|ZP_04948263.1| hypothetical protein BDAG_04268 [Burkholderia dolosa AUO158]
 gi|124899591|gb|EAY71434.1| hypothetical protein BDAG_04268 [Burkholderia dolosa AUO158]
          Length = 349

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 110/334 (32%), Positives = 148/334 (44%), Gaps = 73/334 (21%)

Query: 15  MVLLPGDTFRMGTNK--PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           MV L G  F  G+ +  P     G+  +R   +  F++DQ +V+  QF  FV ATGYVT+
Sbjct: 72  MVHLRGGAFVFGSTRGYPDERPAGDGRTR---VGGFWIDQTDVTIAQFAAFVHATGYVTD 128

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           A                ER   + V H   R E                ++  D +A+ T
Sbjct: 129 A----------------ERQGGAAVFHVPTRDE----------------LNLRD-LAWWT 155

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           W                           SW HP G  S+++   N PV  V+  DA+AY 
Sbjct: 156 WVDG-----------------------ASWRHPRGRGSSVDGLGNLPVTLVTQRDALAYA 192

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH---RANVWQGEFPTNNTAADGY 249
            W G  LPTEAEWEY  + G ++       +  PR  H    AN WQG FP  +TA DG+
Sbjct: 193 RWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAHGKPAANYWQGAFPVLDTAEDGH 248

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDKVKK 304
              APV  Y  N F LY+M+GN WEWT D +   H         +  P      T  V K
Sbjct: 249 AGLAPVGCYAANGFRLYDMIGNAWEWTKDPYTGPHQSHTNGDTAAVAPPTHAHDTPMVIK 308

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           GGS+LC+  YC R+R ++R +   D  A ++GFR
Sbjct: 309 GGSFLCSRDYCVRYRASSRERQEADLGASHIGFR 342


>gi|330820618|ref|YP_004349480.1| hypothetical protein bgla_2g15220 [Burkholderia gladioli BSR3]
 gi|327372613|gb|AEA63968.1| hypothetical protein bgla_2g15220 [Burkholderia gladioli BSR3]
          Length = 349

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 101/302 (33%), Positives = 136/302 (45%), Gaps = 68/302 (22%)

Query: 45  LDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRF 104
           +  F++DQ +V+  QF  FV ATGY+TEA                ER   + V H     
Sbjct: 101 VGGFWIDQTDVTVAQFDSFVRATGYLTEA----------------ERQGGAAVFH----- 139

Query: 105 EGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLH 164
                 +  R       ++W        W          W  G             SW H
Sbjct: 140 ------VPSREELAARDLAW--------W---------SWVKGA------------SWRH 164

Query: 165 PEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLF---PWG 221
           P G  S+++ R N PV  V+ +DA+AY  W G  LPTEAEWEY  + G +       P G
Sbjct: 165 PNGPGSSVDGRGNLPVTLVTLHDALAYARWLGRDLPTEAEWEYAGKAGRDGADLDAAPVG 224

Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
           ++    G+  AN WQG FP  +TA DG+   APV  Y  N F LY+M+GN WEWT D ++
Sbjct: 225 SD----GKPAANYWQGVFPVIDTAEDGHAGLAPVGCYAANDFRLYDMIGNAWEWTRDAYS 280

Query: 282 VHHHP-----APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
             H         +  P      T  V KGGS+LC   YC R+R ++R     D +A ++G
Sbjct: 281 GPHQSHTNGDTAAVAPASRPHDTPMVIKGGSFLCARDYCVRYRASSRESQEADLAASHIG 340

Query: 337 FR 338
           FR
Sbjct: 341 FR 342


>gi|297285361|ref|XP_002802766.1| PREDICTED: sulfatase-modifying factor 1-like [Macaca mulatta]
          Length = 242

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 66/99 (66%), Positives = 79/99 (79%), Gaps = 2/99 (2%)

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
           GYL+   V ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D+VKKGGS
Sbjct: 144 GYLTE--VDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGS 201

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           Y+C+  YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 202 YMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 240



 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 36/72 (50%), Positives = 52/72 (72%), Gaps = 3/72 (4%)

Query: 8   PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F++FV
Sbjct: 81  PGERPLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNAEFEKFV 140

Query: 65  SATGYVTEAEKF 76
           ++TGY+TE + F
Sbjct: 141 NSTGYLTEVDAF 152


>gi|298708177|emb|CBJ30517.1| Sulfatase-modifying factor 2 precursor [Ectocarpus siliculosus]
          Length = 297

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 104/321 (32%), Positives = 137/321 (42%), Gaps = 57/321 (17%)

Query: 35  DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKI 94
           DG  P   V +  F +D+H V+N QF  FV  +GYVTEAEKFG +F  E +L  E    I
Sbjct: 17  DGGLPHTQVAVGQFNIDEHCVTNEQFAIFVKDSGYVTEAEKFGWSFALEYMLDAE---TI 73

Query: 95  SQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
            +    + R +G +  +                                       G+E 
Sbjct: 74  KEADEGIGRVKGSEHWV---------------------------------------GVEG 94

Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
                 +W  PEG  S+++ R N PV  VSW DA AYC+W G RLPTEAEWEY  RGG  
Sbjct: 95  -----ANWRQPEGPRSSLKSRGNVPVAQVSWVDADAYCSWAGRRLPTEAEWEYAARGGQT 149

Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
                 G+  T +     N WQG FP  N+  DGY   AP  ++  N   +Y+M+GN WE
Sbjct: 150 LASEHVGDKST-KMWLVGNSWQGHFPNTNSVEDGYAGLAPATAFAPNALEVYDMLGNAWE 208

Query: 275 WTADWWNVHHHPAPSYNPKGPTTGTDKV----KKGGSYLCNEQYCYRHRCAARSQNTPDS 330
           W A        P       G     D V      G S +  +   +      R + T DS
Sbjct: 209 WVAG-----GDPTRRTLRGGSFVDFDPVAAEKNNGSSGIPRQPANHAITPGTRMETTQDS 263

Query: 331 SAGNLGFRCAADKGPTTGTDK 351
            + N  FRCA     + GT +
Sbjct: 264 GSANTSFRCAQSPLKSGGTSR 284


>gi|416911903|ref|ZP_11931660.1| hypothetical protein B1M_06815 [Burkholderia sp. TJI49]
 gi|325528173|gb|EGD05359.1| hypothetical protein B1M_06815 [Burkholderia sp. TJI49]
          Length = 347

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 113/335 (33%), Positives = 150/335 (44%), Gaps = 75/335 (22%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           MV L G  F  G+ +       E PS +    +  F++DQ +V+  QF  FV ATGY   
Sbjct: 70  MVHLAGGAFVFGSTRGYA---DERPSGDGRTRVGGFWIDQTDVTVAQFAAFVQATGY--- 123

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
                        ++E E+   + V H   R E                ++  D +A+ +
Sbjct: 124 -------------VTEAEQQGGAAVFHVPTRDE----------------LNARD-LAWWS 153

Query: 133 W-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
           W +GA                        SW HP G  STI+   N PV  V+  DA+AY
Sbjct: 154 WVKGA------------------------SWRHPRGPGSTIDGHGNLPVTLVTQRDALAY 189

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAADG 248
             W G  LPTEAEWEY  + G ++       +  PR   G   AN WQG FP  +TA DG
Sbjct: 190 AHWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGRPAANYWQGPFPVLDTAEDG 245

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH--HPAPSYNPKGPTT---GTDKVK 303
           +   APV  Y  N F LY+M+GN WEWT D +   H  H         P T    T  V 
Sbjct: 246 HAGLAPVGCYAANGFRLYDMIGNAWEWTRDVYTGPHQSHTNGDTAAVAPATRRHDTPMVI 305

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           KGGS+LC+  YC R+R ++R Q   D  A ++GFR
Sbjct: 306 KGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 340


>gi|194379762|dbj|BAG58233.1| unnamed protein product [Homo sapiens]
          Length = 242

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 66/99 (66%), Positives = 79/99 (79%), Gaps = 2/99 (2%)

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
           GYL+ A   ++  N +GLYN+VGN WEWT+DWW VHH    + NPKGP +G D+VKKGGS
Sbjct: 144 GYLTEAD--AFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGS 201

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           Y+C+  YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 202 YMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 240



 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 40/76 (52%), Positives = 56/76 (73%), Gaps = 4/76 (5%)

Query: 5   PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
           P P P ER   +  MV +P   F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 77  PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136

Query: 61  QEFVSATGYVTEAEKF 76
           ++FV++TGY+TEA+ F
Sbjct: 137 EKFVNSTGYLTEADAF 152


>gi|392964375|ref|ZP_10329796.1| sulfatase modifying factor 1 [Fibrisoma limi BUZ 3]
 gi|387847270|emb|CCH51840.1| sulfatase modifying factor 1 [Fibrisoma limi BUZ 3]
          Length = 320

 Score =  145 bits (366), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 86/234 (36%), Positives = 123/234 (52%), Gaps = 17/234 (7%)

Query: 116 HHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW-----GSWLHPEGIDS 170
            HPV    +   V    ++     TEAE ++G    L+     W       W HP+G + 
Sbjct: 96  EHPVTVAEFRRFVTATKYK-----TEAE-KFGDGAVLDETTRQWTLRKGAKWSHPQGPEQ 149

Query: 171 TIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL-FPWGNNLT-PRG 228
                 NHPV  VSWNDA AY  W G RLP E EWE+  R    +R  +P+GN L    G
Sbjct: 150 PPAPD-NHPVTQVSWNDAQAYARWAGKRLPYEFEWEHAARNATNSRTRYPFGNELINADG 208

Query: 229 EHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP 288
           ++ AN W G FP ++   DG+  T+PV  + +   GL +M GNVW+W  DW     + A 
Sbjct: 209 KYLANTWNGTFPESDAVLDGFHLTSPVGQFGKTPLGLTDMTGNVWQWCEDW---RQNYAD 265

Query: 289 SYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
               + PT  T+KV++GGS+LC   +C+ +R + RS  +P++S  ++GFRC  D
Sbjct: 266 VMAGQPPTNPTEKVQRGGSFLCEPGWCHGYRVSGRSFTSPETSLMHVGFRCVKD 319


>gi|170736356|ref|YP_001777616.1| hypothetical protein Bcenmc03_3975 [Burkholderia cenocepacia MC0-3]
 gi|169818544|gb|ACA93126.1| protein of unknown function DUF323 [Burkholderia cenocepacia MC0-3]
          Length = 349

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 109/336 (32%), Positives = 145/336 (43%), Gaps = 77/336 (22%)

Query: 15  MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           MV L G  F  G+ +       + DG+       +  F++DQ +V+  QF  FV ATGYV
Sbjct: 72  MVHLRGGAFVFGSTRGYADERPVGDGK-----TRVGGFWIDQTDVTIAQFAAFVQATGYV 126

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
                           +E E+   + V H           +  R       ++W      
Sbjct: 127 ----------------TEAEQQGGAAVFH-----------VPTRDEMNARDLAW------ 153

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             W          W  G             SW HP G  S +E   N PV  V+  DA+A
Sbjct: 154 --W---------SWVKGA------------SWRHPRGPGSDVEGLGNLPVTLVTQRDALA 190

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAAD 247
           Y  W G  LPTEAEWEY  + G ++       +  PR   G+  AN WQG FP  ++A D
Sbjct: 191 YARWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGAFPVLDSAED 246

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH--HPAPSYNPKGPTT---GTDKV 302
           G+   APV  Y  N F LY+M+GN WEWT D +   H  H         P T    T  V
Sbjct: 247 GHAGLAPVGCYAANGFRLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPLTRRHDTPMV 306

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            KGGS+LC+  YC R+R ++R Q   D  A ++GFR
Sbjct: 307 IKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342


>gi|254249091|ref|ZP_04942411.1| hypothetical protein BCPG_03951 [Burkholderia cenocepacia PC184]
 gi|124875592|gb|EAY65582.1| hypothetical protein BCPG_03951 [Burkholderia cenocepacia PC184]
          Length = 349

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 107/336 (31%), Positives = 144/336 (42%), Gaps = 77/336 (22%)

Query: 15  MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           MV L G  F  G+ +       + DG+       +  F++DQ +V+  QF  FV ATGYV
Sbjct: 72  MVHLRGGAFVFGSTRGYADERPVGDGK-----TRVGGFWIDQTDVTIAQFAAFVQATGYV 126

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
                           +E E+   + V H           +  R       ++W      
Sbjct: 127 ----------------TEAEQQGGAAVFH-----------VPTRDEMNARDLAW------ 153

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             W          W  G             SW HP G  S +E   N PV  V+  DA+A
Sbjct: 154 --W---------SWVKGA------------SWRHPRGPGSDVEGLGNLPVTLVTQRDALA 190

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAAD 247
           Y  W G  LPTEAEWEY  + G ++       +  PR   G+  AN WQG FP  ++A D
Sbjct: 191 YARWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGAFPVLDSAED 246

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDKV 302
           G+   APV  Y  N F LY+M+GN WEWT D +   H         +  P      T  V
Sbjct: 247 GHAGLAPVGCYAANGFRLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPMV 306

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            KGGS+LC+  YC R+R ++R Q   D  A ++GFR
Sbjct: 307 IKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342


>gi|304321886|ref|YP_003855529.1| hypothetical protein PB2503_11714 [Parvularcula bermudensis
           HTCC2503]
 gi|303300788|gb|ADM10387.1| hypothetical protein PB2503_11714 [Parvularcula bermudensis
           HTCC2503]
          Length = 308

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 86/189 (45%), Positives = 105/189 (55%), Gaps = 14/189 (7%)

Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
           RL P  SW  P G  ST++    HPVVH+S  DA AY  W G RLPTE EWEY    GL 
Sbjct: 121 RLDPEASWRTPGGEGSTLDGLALHPVVHISLADAHAYAEWVGGRLPTEVEWEYAASLGLF 180

Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
           +   P      P GE RANVW G FP  N+A DG++ TAPV  Y+  + G Y+M+GNVWE
Sbjct: 181 DPNDPDSGMRGPNGEARANVWTGIFPLINSAEDGFIGTAPVGCYRPGRIGAYDMIGNVWE 240

Query: 275 WTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGN 334
           WT         P  +  P+          KGGS+LC++ YC R+R AAR    PD S  +
Sbjct: 241 WT-------DSPFAAGTPR-------FTIKGGSFLCSKNYCRRYRAAAREAFEPDFSTAH 286

Query: 335 LGFRCAADK 343
           LGFR   D+
Sbjct: 287 LGFRIVRDR 295


>gi|344256595|gb|EGW12699.1| Sulfatase-modifying factor 2 [Cricetulus griseus]
          Length = 227

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 90/241 (37%), Positives = 117/241 (48%), Gaps = 59/241 (24%)

Query: 1   MVLLPAPPVE----RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVS 56
           M LL +P ++    +   MV LPG  F MGT+ P   +DGE P+R VT+ +F +D   V+
Sbjct: 11  MCLLSSPRLQLGNAQAPTMVQLPGGRFLMGTDAPD-GRDGEGPAREVTVKSFTIDVFPVT 69

Query: 57  NTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMH 116
           N  F+EFV    Y TEAE FG +FVFE  +S E R + +Q+                   
Sbjct: 70  NKDFREFVREKKYQTEAETFGWSFVFEDFVSPELRNQANQMPS----------------- 112

Query: 117 HPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRM 176
                + W   V    WR                              P G  S I  ++
Sbjct: 113 -----VLWWLPVQKAFWR-----------------------------QPAGPGSGIREKL 138

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
             PVVHVSWNDA AYC WRG RLPTE EWE+  RGGL+ +++PWGN   P   +R N+WQ
Sbjct: 139 ELPVVHVSWNDASAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQ 195

Query: 237 G 237
           G
Sbjct: 196 G 196


>gi|134100962|ref|YP_001106623.1| serine/threonine protein kinase [Saccharopolyspora erythraea NRRL
           2338]
 gi|291003423|ref|ZP_06561396.1| putative serine/threonine protein kinase [Saccharopolyspora
           erythraea NRRL 2338]
 gi|133913585|emb|CAM03698.1| putative serine/threonine protein kinase [Saccharopolyspora
           erythraea NRRL 2338]
          Length = 223

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 65/103 (63%), Positives = 72/103 (69%)

Query: 180 VVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEF 239
           +V VSWNDA AYC W G RLPTE EWEY  RGG   R +PWG+ L P G HR N+WQG F
Sbjct: 1   MVQVSWNDAAAYCDWAGLRLPTEVEWEYAARGGPVRRRYPWGDELAPGGGHRCNIWQGHF 60

Query: 240 PTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV 282
           PT NTA DG+  TAP  +Y+ N FGLYNM GN WEW AD W V
Sbjct: 61  PTRNTAEDGFRGTAPADAYRPNGFGLYNMAGNFWEWCADRWTV 103


>gi|78063430|ref|YP_373338.1| hypothetical protein Bcep18194_B2583 [Burkholderia sp. 383]
 gi|77971315|gb|ABB12694.1| protein of unknown function DUF323 [Burkholderia sp. 383]
          Length = 349

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 107/336 (31%), Positives = 143/336 (42%), Gaps = 77/336 (22%)

Query: 15  MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           MV L G  F  G+ +       + DG        +  F++DQ +V+  QF  FV ATGYV
Sbjct: 72  MVHLRGGAFVFGSTRGYADERPVGDGR-----TRVGGFWIDQTDVTIAQFAAFVQATGYV 126

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
                           +E E+   + V H           +  R       ++W      
Sbjct: 127 ----------------TEAEQQDGAAVFH-----------VPTRDEMNARDLAW------ 153

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             W          W  G             SW HP G  S ++   N PV  V+  DA+A
Sbjct: 154 --W---------SWVKGA------------SWRHPRGPGSGVDGLGNLPVTLVTQRDALA 190

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAAD 247
           Y  W G  LPTEAEWEY  + G ++       +  PR   G+  AN WQG FP  +TA D
Sbjct: 191 YARWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGVFPVLDTAED 246

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDKV 302
           G+   APV  Y  N F LY+M+GN WEWT D +   H         +  P      T  V
Sbjct: 247 GHAGLAPVGCYAANGFRLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPMV 306

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            KGGS+LC+  YC R+R ++R Q   D  A ++GFR
Sbjct: 307 IKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342


>gi|296137978|ref|YP_003645221.1| hypothetical protein [Tsukamurella paurometabola DSM 20162]
 gi|296026112|gb|ADG76882.1| protein of unknown function DUF323 [Tsukamurella paurometabola DSM
           20162]
          Length = 351

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 81/193 (41%), Positives = 110/193 (56%), Gaps = 16/193 (8%)

Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
           W +P G  ST E+R +HPVVH+S+ DA+AYC W    LPTEA+WE   RG +    +PWG
Sbjct: 160 WRNPFGAGSTAENRPDHPVVHISYFDALAYCAWANRTLPTEAQWECAARGTILGETYPWG 219

Query: 222 NNLTPRGEH-RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
                 GE  + N+++G FP   T   G   T PV S   N  G+Y+ VGNVWEWTAD +
Sbjct: 220 GQPPIDGEPLKVNIFRGAFPDQPTTPVG---TMPVTSLPPNDIGIYHSVGNVWEWTADAY 276

Query: 281 ---------NVHHHPA-PSYNPKGP--TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTP 328
                         PA  + +P+ P  T   D + +GGSYLC++ YC R+R +AR+  TP
Sbjct: 277 VPGLYAQRAATTLGPAGTAVDPRSPGDTDRDDFMLRGGSYLCHDSYCNRYRNSARTHATP 336

Query: 329 DSSAGNLGFRCAA 341
            +S  ++GFR  A
Sbjct: 337 HTSTSHIGFRTTA 349



 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 53/141 (37%), Positives = 72/141 (51%), Gaps = 14/141 (9%)

Query: 32  LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEER 91
           +  D E P+  V + A+ +D   V+N QF  FV ATGY T A++ G + VF   ++    
Sbjct: 81  IAADEEGPAHPVGVSAYSIDATTVTNAQFARFVDATGYRTTAQQRGSSTVFYSQVAPS-- 138

Query: 92  AKISQVRHDM------------KRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLP 139
           A +  V                +   G  ST E+R  HPVVHIS+ DA+AYC W    LP
Sbjct: 139 ATVLDVPDPQAWWWRTVEDAYWRNPFGAGSTAENRPDHPVVHISYFDALAYCAWANRTLP 198

Query: 140 TEAEWEYGCRGGLENRLFPWG 160
           TEA+WE   RG +    +PWG
Sbjct: 199 TEAQWECAARGTILGETYPWG 219


>gi|116691643|ref|YP_837176.1| hypothetical protein Bcen2424_3544 [Burkholderia cenocepacia
           HI2424]
 gi|116649643|gb|ABK10283.1| protein of unknown function DUF323 [Burkholderia cenocepacia
           HI2424]
          Length = 349

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 108/337 (32%), Positives = 149/337 (44%), Gaps = 79/337 (23%)

Query: 15  MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           MV L G  F  G+ +       + DG+       +  F++DQ +V+  QF  FV ATGY 
Sbjct: 72  MVHLRGGAFVFGSTRGYADERPVGDGK-----TRVGGFWIDQTDVTIAQFAAFVQATGY- 125

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
                          ++E E+   + V H   R E                ++  D +A+
Sbjct: 126 ---------------VTEAEQQGGAAVFHVPTRDE----------------LNARD-LAW 153

Query: 131 CTW-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
            +W +GA                        SW HP G  S +E   N PV  V+  DA+
Sbjct: 154 WSWVKGA------------------------SWRHPHGPGSDVEGLGNLPVTLVTQRDAL 189

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAA 246
           AY  W G  LPTEAEWEY  + G ++       +  PR   G+  AN WQG FP  ++A 
Sbjct: 190 AYARWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGAFPVLDSAE 245

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDK 301
           DG+   APV  Y  N F LY+M+GN WEWT D +   H         +  P      T  
Sbjct: 246 DGHAGLAPVGCYAANGFRLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPM 305

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           V KGGS+LC+  YC R+R ++R Q   D  A ++GFR
Sbjct: 306 VIKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342


>gi|441500083|ref|ZP_20982253.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycine-
           generating enzyme 1) [Fulvivirga imtechensis AK7]
 gi|441436174|gb|ELR69548.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycine-
           generating enzyme 1) [Fulvivirga imtechensis AK7]
          Length = 290

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 83/218 (38%), Positives = 120/218 (55%), Gaps = 23/218 (10%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE-NRLFP 219
           +W  P G +     + NHP+ HVSWNDA+AY +W G RLP E EWE+  R G   N  + 
Sbjct: 79  TWQQPLGPNGA-PAKDNHPITHVSWNDAMAYASWAGKRLPHEYEWEFAARNGKNTNDKYS 137

Query: 220 WGNNLTP-RGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD 278
           WGN+LT   G  +ANVWQG FP  NT AD +  T+PV ++     G+ +M GNVWEW ++
Sbjct: 138 WGNSLTDNNGVFKANVWQGHFPDTNTVADDFAHTSPVGAFGITSCGMSDMGGNVWEWCSN 197

Query: 279 WWNVH---HHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNL 335
            + ++   H+P      K P     KV +GGS++C+   C+ +R +AR   + ++S  ++
Sbjct: 198 IFELYPGNHYPY----QKDPNV---KVIRGGSFMCDSLVCHGYRVSARQFTSRETSNFHM 250

Query: 336 GFRCAAD----------KGPTTGTDKVKKGGSYLCNEQ 363
           GFRCA D          + P   T       SYL N++
Sbjct: 251 GFRCAMDVDRRTSNPKHQTPNQNTTIKTTYKSYLSNDK 288



 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 53/152 (34%), Positives = 78/152 (51%), Gaps = 13/152 (8%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV   G    +G+   +     E P+    +  F++D+H V+  QF+ FV  TGY T+AE
Sbjct: 1   MVYFAGGDITIGSEHGLT---NEAPTFEAEIAPFFIDKHPVTVAQFRAFVQTTGYKTDAE 57

Query: 75  KFGDT--FVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           KFG++  F ++ L+ E   +   Q           D+       HP+ H+SWNDA+AY +
Sbjct: 58  KFGNSALFNYKKLIYELVDSVTWQQPLGPNGAPAKDN-------HPITHVSWNDAMAYAS 110

Query: 133 WRGARLPTEAEWEYGCRGGLE-NRLFPWGSWL 163
           W G RLP E EWE+  R G   N  + WG+ L
Sbjct: 111 WAGKRLPHEYEWEFAARNGKNTNDKYSWGNSL 142



 Score = 44.7 bits (104), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 18/43 (41%), Positives = 30/43 (69%)

Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           KV +GGS++C+   C+ +R +AR   + ++S  ++GFRCA DV
Sbjct: 216 KVIRGGSFMCDSLVCHGYRVSARQFTSRETSNFHMGFRCAMDV 258


>gi|161521664|ref|YP_001585091.1| hypothetical protein Bmul_5129 [Burkholderia multivorans ATCC
           17616]
 gi|189352169|ref|YP_001947796.1| hypothetical protein BMULJ_03387 [Burkholderia multivorans ATCC
           17616]
 gi|160345714|gb|ABX18799.1| protein of unknown function DUF323 [Burkholderia multivorans ATCC
           17616]
 gi|189336191|dbj|BAG45260.1| conserved hypothetical protein [Burkholderia multivorans ATCC
           17616]
          Length = 347

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 85/217 (39%), Positives = 109/217 (50%), Gaps = 19/217 (8%)

Query: 140 TEAEWEYGC-------RGGLENRLFPW------GSWLHPEGIDSTIEHRMNHPVVHVSWN 186
           TEAE + G        R  L  R   W       SW HP G  S +E R N PV  V+  
Sbjct: 125 TEAEQQGGAAVFHVPTRDELNARDLAWWSWVKGASWRHPRGPGSDVEGRGNLPVTLVTQR 184

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
           DA+AY  W G  LPTEAEWEY  + G ++           RG+  AN WQG FP  +TA 
Sbjct: 185 DALAYARWLGRDLPTEAEWEYAGKAGRDDASLD-AAPRDARGKPAANYWQGPFPVLDTAE 243

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDK 301
           DG+   APV  Y  N F LY+M+GN W+WT D +   H         +  P      T  
Sbjct: 244 DGHAGLAPVGCYAANGFRLYDMIGNAWQWTKDVYTGPHQSHTNGDTAAVAPAARRHDTPM 303

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           V KGGS+LC+  YC R+R ++R +   D +A ++GFR
Sbjct: 304 VIKGGSFLCSRDYCVRYRASSRERQEADLAASHIGFR 340



 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 56/147 (38%), Positives = 77/147 (52%), Gaps = 10/147 (6%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           MV L G TF  G+ +       E P+ +    +  F++DQ +V+  QF  FV ATGYVTE
Sbjct: 70  MVHLRGGTFVFGSTRGY---QDERPAADGRTRVGGFWIDQTDVTVAQFAAFVQATGYVTE 126

Query: 73  AEKFGDTFVFEPLLSEEERAKI----SQVRHDMKRF-EGLDSTIEHRMHHPVVHISWNDA 127
           AE+ G   VF     +E  A+     S V+    R   G  S +E R + PV  ++  DA
Sbjct: 127 AEQQGGAAVFHVPTRDELNARDLAWWSWVKGASWRHPRGPGSDVEGRGNLPVTLVTQRDA 186

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLEN 154
           +AY  W G  LPTEAEWEY  + G ++
Sbjct: 187 LAYARWLGRDLPTEAEWEYAGKAGRDD 213


>gi|221197358|ref|ZP_03570405.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
 gi|221204031|ref|ZP_03577049.1| conserved hypothetical protein [Burkholderia multivorans CGD2]
 gi|421471366|ref|ZP_15919662.1| formylglycine-generating sulfatase enzyme [Burkholderia multivorans
           ATCC BAA-247]
 gi|221176197|gb|EEE08626.1| conserved hypothetical protein [Burkholderia multivorans CGD2]
 gi|221183912|gb|EEE16312.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
 gi|400225650|gb|EJO55794.1| formylglycine-generating sulfatase enzyme [Burkholderia multivorans
           ATCC BAA-247]
          Length = 347

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 85/218 (38%), Positives = 109/218 (50%), Gaps = 19/218 (8%)

Query: 140 TEAEWEYGC-------RGGLENRLFPW------GSWLHPEGIDSTIEHRMNHPVVHVSWN 186
           TEAE + G        R  L  R   W       SW HP G  S +E R N PV  V+  
Sbjct: 125 TEAEQQGGAAVFHVPTRDELNARDLAWWSWVKGASWRHPRGPGSDVEGRGNLPVTLVTQR 184

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
           DA+AY  W G  LPTEAEWEY  + G ++           RG+  AN WQG FP  +TA 
Sbjct: 185 DALAYARWLGRDLPTEAEWEYAGKAGRDDASLD-AAPRDARGKPAANYWQGPFPVLDTAE 243

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDK 301
           DG+   APV  Y  N F LY+M+GN W+WT D +   H         +  P      T  
Sbjct: 244 DGHAGLAPVGCYAANGFRLYDMIGNAWQWTKDAYTGPHQSHTNGDTAAVAPAARRHDTPM 303

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           V KGGS+LC+  YC R+R ++R +   D +A ++GFR 
Sbjct: 304 VIKGGSFLCSRDYCVRYRASSRERQEADLAASHIGFRT 341



 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 56/147 (38%), Positives = 77/147 (52%), Gaps = 10/147 (6%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           MV L G TF  G+ +       E P+ +    +  F++DQ +V+  QF  FV ATGYVTE
Sbjct: 70  MVHLRGGTFVFGSTRGY---QDERPAGDGRTRVGGFWIDQTDVTVAQFAAFVQATGYVTE 126

Query: 73  AEKFGDTFVFEPLLSEEERAKI----SQVRHDMKRF-EGLDSTIEHRMHHPVVHISWNDA 127
           AE+ G   VF     +E  A+     S V+    R   G  S +E R + PV  ++  DA
Sbjct: 127 AEQQGGAAVFHVPTRDELNARDLAWWSWVKGASWRHPRGPGSDVEGRGNLPVTLVTQRDA 186

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLEN 154
           +AY  W G  LPTEAEWEY  + G ++
Sbjct: 187 LAYARWLGRDLPTEAEWEYAGKAGRDD 213


>gi|172062487|ref|YP_001810138.1| hypothetical protein BamMC406_3455 [Burkholderia ambifaria MC40-6]
 gi|171995004|gb|ACB65922.1| protein of unknown function DUF323 [Burkholderia ambifaria MC40-6]
          Length = 349

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 107/337 (31%), Positives = 148/337 (43%), Gaps = 79/337 (23%)

Query: 15  MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           MV L G  F  G+ +       + DG        +  F++DQ +V+  QF  FV ATGY 
Sbjct: 72  MVHLSGGAFVFGSTRGYADERPVGDGR-----TRVGGFWIDQTDVTIAQFAAFVKATGY- 125

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
                          ++E E+   + V H   R E                ++  D +A+
Sbjct: 126 ---------------VTEAEQQGGAAVFHVPTRDE----------------LNARD-LAW 153

Query: 131 CTW-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
            +W +GA                        SW  P G  S+++   N PV  V+  DA+
Sbjct: 154 WSWVKGA------------------------SWRQPRGPGSSVDGHGNLPVTLVTQRDAL 189

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAA 246
           AY  W G  LPTEAEWEY  + G ++       +  PR   G+  AN WQG FP  +TA 
Sbjct: 190 AYAHWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGAFPVLDTAE 245

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDK 301
           DG+   APV  Y  N F LY+M+GN WEWT D +   H         +  P      T  
Sbjct: 246 DGHAGLAPVGCYAANGFRLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPM 305

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           V KGGS+LC+  YC R+R ++R Q   D  A ++GFR
Sbjct: 306 VIKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342


>gi|103485795|ref|YP_615356.1| hypothetical protein Sala_0300 [Sphingopyxis alaskensis RB2256]
 gi|98975872|gb|ABF52023.1| protein of unknown function DUF323 [Sphingopyxis alaskensis RB2256]
          Length = 309

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 100/335 (29%), Positives = 145/335 (43%), Gaps = 73/335 (21%)

Query: 5   PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           P  P    +D   +PG TF +G +  +     E P R VT+  F++  HEV+N +F EFV
Sbjct: 33  PECPAMPAQDYAWIPGATFTLGADAQL---PEEGPPREVTVAGFWIATHEVTNAEFAEFV 89

Query: 65  SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
            ATGY T              L+E++   +     DM    G          +P    +W
Sbjct: 90  RATGYKT--------------LAEQDPPPLPGAPPDM-LIPGSAVFTAPTDGNP----NW 130

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
                   WR         W  G              W HP G  + I+ R   PVV + 
Sbjct: 131 --------WR---------WVVGAE------------WRHPAGPATNIDGRGRDPVVQIG 161

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
           + DA+A+  W+G  LP+E +WE     G  +R  P G +    G+ RAN +QG FP  + 
Sbjct: 162 YEDALAFAKWKGKTLPSEEQWELAAASGGADRNVPVGKD----GKPRANFYQGSFPLRDL 217

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK-VK 303
             DG+   APV  +  +  G+++++GNVWEWT                     G+++ V 
Sbjct: 218 GTDGFTGRAPVACFPADAHGVHDLIGNVWEWTTS-----------------AAGSERNVI 260

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           KGGS+LC   YC R+R AAR          ++GFR
Sbjct: 261 KGGSFLCAANYCARYRPAARQFQERSLGTDHIGFR 295


>gi|393720429|ref|ZP_10340356.1| hypothetical protein SechA1_11798 [Sphingomonas echinoides ATCC
           14820]
          Length = 300

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 101/307 (32%), Positives = 132/307 (42%), Gaps = 66/307 (21%)

Query: 37  EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
           E P R VT+  F++D HE++N +F  FV ATGY T AE+        P    E R   S 
Sbjct: 55  EGPPRTVTVPGFWIDTHELTNAEFAAFVKATGYRTMAERKPPVL---PGAPPEMRVPGSA 111

Query: 97  VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 156
           V        G D               W        WR         W  G +       
Sbjct: 112 V---FAAPSGDDP-------------RW--------WR---------WVVGAQ------- 131

Query: 157 FPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 216
                W HP G + +I  +   PVV +++ DA AY  W G  LP+EA+WEY  RGG    
Sbjct: 132 -----WRHPSGSNESIIGKDTTPVVQIAYEDAEAYARWAGKALPSEAQWEYAARGGQPAL 186

Query: 217 LFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWT 276
             P   + TP     AN +QG FP  +   DGY   APV  +  N +GLY+M+GNVWEWT
Sbjct: 187 PEPIDAHGTPT----ANYYQGVFPARDLGIDGYRGRAPVGCFPANGYGLYDMIGNVWEWT 242

Query: 277 ADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
           +                  + G  +V KGGSYLC   YC R+R AAR          ++G
Sbjct: 243 SG--------------VAGSAGDRRVIKGGSYLCAANYCARYRPAARQFQELGLGTDHIG 288

Query: 337 FRCAADK 343
            R   ++
Sbjct: 289 VRLVDNR 295


>gi|115360020|ref|YP_777158.1| hypothetical protein Bamb_5277 [Burkholderia ambifaria AMMD]
 gi|115285308|gb|ABI90824.1| protein of unknown function DUF323 [Burkholderia ambifaria AMMD]
          Length = 349

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 107/337 (31%), Positives = 148/337 (43%), Gaps = 79/337 (23%)

Query: 15  MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           MV L G  F  G+ +       + DG        +  F++DQ +V+  QF  FV ATGY 
Sbjct: 72  MVHLSGGAFVFGSTRGYADERPVGDGR-----TRVGGFWIDQTDVTIAQFAAFVKATGY- 125

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
                          ++E E+   + V H   R E                ++  D +A+
Sbjct: 126 ---------------VTEAEQQGGAAVFHVPTRDE----------------LNARD-LAW 153

Query: 131 CTW-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
            +W +GA                        SW  P G  S+++   N PV  V+  DA+
Sbjct: 154 WSWVKGA------------------------SWRQPRGPGSSVDGLGNLPVTLVTQRDAL 189

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAA 246
           AY  W G  LPTEAEWEY  + G ++       +  PR   G+  AN WQG FP  +TA 
Sbjct: 190 AYAHWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGAFPVLDTAE 245

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDK 301
           DG+   APV  Y  N F LY+M+GN WEWT D +   H         +  P      T  
Sbjct: 246 DGHAGLAPVGCYAANGFHLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPM 305

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           V KGGS+LC+  YC R+R ++R Q   D  A ++GFR
Sbjct: 306 VIKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342


>gi|385677485|ref|ZP_10051413.1| non-specific serine/threonine protein kinase [Amycolatopsis sp.
           ATCC 39116]
          Length = 136

 Score =  140 bits (354), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 67/133 (50%), Positives = 86/133 (64%), Gaps = 14/133 (10%)

Query: 210 RGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMV 269
           R GL+   +PWG+ LTP GEHR N+WQG FP +NT  DG + TAPV ++  N FGLYN+ 
Sbjct: 17  RRGLDQARYPWGDELTPGGEHRCNIWQGRFPVHNTEEDGQVGTAPVDAFPPNGFGLYNVS 76

Query: 270 GNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPD 329
           GNVWEW ADW++  H               ++  +GGSYLC++ YC R+R AAR+ N PD
Sbjct: 77  GNVWEWCADWFDDAH--------------ANRAMRGGSYLCHDSYCNRYRVAARTANMPD 122

Query: 330 SSAGNLGFRCAAD 342
            S+GNLGFR   D
Sbjct: 123 GSSGNLGFRTVTD 135



 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 24/45 (53%), Positives = 33/45 (73%)

Query: 349 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
            ++  +GGSYLC++ YC R+R AAR+ N PD S+GNLGFR   D+
Sbjct: 92  ANRAMRGGSYLCHDSYCNRYRVAARTANMPDGSSGNLGFRTVTDI 136


>gi|221210763|ref|ZP_03583743.1| conserved hypothetical protein [Burkholderia multivorans CGD1]
 gi|221169719|gb|EEE02186.1| conserved hypothetical protein [Burkholderia multivorans CGD1]
          Length = 347

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 84/218 (38%), Positives = 109/218 (50%), Gaps = 19/218 (8%)

Query: 140 TEAEWEYGC-------RGGLENRLFPW------GSWLHPEGIDSTIEHRMNHPVVHVSWN 186
           TEAE + G        R  L  R   W       SW HP G  S ++ R N PV  V+  
Sbjct: 125 TEAEQQGGAAVFHVPTRDELNARDLAWWSWVKGASWRHPRGPGSDVDGRDNLPVTLVTQR 184

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
           DA+AY  W G  LPTEAEWEY  + G ++           RG+  AN WQG FP  +TA 
Sbjct: 185 DALAYARWLGRDLPTEAEWEYAGKAGRDDASLD-AAPRDARGKPTANYWQGPFPVLDTAE 243

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDK 301
           DG+   APV  Y  N F LY+M+GN W+WT D +   H         +  P      T  
Sbjct: 244 DGHAGLAPVGCYAANGFRLYDMIGNAWQWTKDVYTGPHQSHTNGDTAAVAPAARRHDTPM 303

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           V KGGS+LC+  YC R+R ++R +   D +A ++GFR 
Sbjct: 304 VIKGGSFLCSRDYCVRYRASSRERQEADLAASHIGFRT 341



 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 55/147 (37%), Positives = 77/147 (52%), Gaps = 10/147 (6%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           MV L G TF  G+ +       E P+ +    +  F++DQ +V+  QF  FV ATGYVTE
Sbjct: 70  MVHLRGGTFVFGSTRGY---QDERPAGDGRTRVGGFWIDQTDVTVAQFAAFVQATGYVTE 126

Query: 73  AEKFGDTFVFEPLLSEEERAKI----SQVRHDMKRF-EGLDSTIEHRMHHPVVHISWNDA 127
           AE+ G   VF     +E  A+     S V+    R   G  S ++ R + PV  ++  DA
Sbjct: 127 AEQQGGAAVFHVPTRDELNARDLAWWSWVKGASWRHPRGPGSDVDGRDNLPVTLVTQRDA 186

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLEN 154
           +AY  W G  LPTEAEWEY  + G ++
Sbjct: 187 LAYARWLGRDLPTEAEWEYAGKAGRDD 213


>gi|195103144|ref|XP_001998099.1| GH11729 [Drosophila grimshawi]
 gi|193891453|gb|EDV90319.1| GH11729 [Drosophila grimshawi]
          Length = 109

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 69/118 (58%), Positives = 83/118 (70%), Gaps = 12/118 (10%)

Query: 226 PRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH 285
           P+ +H  N+WQGEFP  NT  DGY  T PV  +++N + LYNMVGNVWEWTAD W     
Sbjct: 2   PKDQHWLNIWQGEFPDGNTQQDGYQFTCPVNEFRQNNYDLYNMVGNVWEWTADLWQA--- 58

Query: 286 PAPSYNPKGPTTGT-DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
                   G T+ +  +VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 59  --------GDTSESPARVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKN 108



 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 34/43 (79%), Positives = 39/43 (90%)

Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA + 
Sbjct: 67  RVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 109


>gi|444429732|ref|ZP_21224914.1| hypothetical protein GS4_03_00830 [Gordonia soli NBRC 108243]
 gi|443889393|dbj|GAC66635.1| hypothetical protein GS4_03_00830 [Gordonia soli NBRC 108243]
          Length = 287

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 105/323 (32%), Positives = 140/323 (43%), Gaps = 62/323 (19%)

Query: 37  EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
           E P R V ++ F++D   V+  +F  FV  TG+VT                       ++
Sbjct: 10  EQPVRVVEVEPFWIDHAPVTVGEFARFVHETGHVT----------------------TAE 47

Query: 97  VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 156
           +  D   F   D  +          +     V   T     L     W          R 
Sbjct: 48  IAPDPAEFPDADPDL----------LVAGSLVFTPTSGPVPLDDHRRWW---------RF 88

Query: 157 FPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 216
                W  P G  S +E R +HPV HVS +DA+AY  W G RLP EAEWE+  RGG  + 
Sbjct: 89  VAGACWSRPSGPGSDVEGRADHPVTHVSHSDALAYAAWAGKRLPGEAEWEFAARGGCADA 148

Query: 217 LFPWGNNLTPRGEHR-------------ANVWQGEFPTNNTAADGYLSTAPVMSYKENKF 263
               G+ LT   E               A V+ G+FP  +T   G  +TA + +   ++ 
Sbjct: 149 Q---GDELTADAEFTWDIDGIDDLDDVPATVFVGDFPWRHTGRRGPGTTA-IYTNPPDRR 204

Query: 264 GLYNMVGNVWEWTADWWNVHHHPAPSYNPKGP----TTGTDKVKKGGSYLCNEQYCYRHR 319
           GL +M+GNVWEWTAD     H P    +   P    T    +  KGGS+LC+ QYC R+R
Sbjct: 205 GLVDMIGNVWEWTADAHTRDHRPPAGGSCCAPARPATLARHRTIKGGSHLCSPQYCLRYR 264

Query: 320 CAARSQNTPDSSAGNLGFRCAAD 342
            AAR     DSS G+LGFRC AD
Sbjct: 265 PAARQGQDEDSSTGHLGFRCVAD 287


>gi|399065902|ref|ZP_10748107.1| hypothetical protein PMI02_04531 [Novosphingobium sp. AP12]
 gi|398029080|gb|EJL22576.1| hypothetical protein PMI02_04531 [Novosphingobium sp. AP12]
          Length = 247

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 81/204 (39%), Positives = 106/204 (51%), Gaps = 28/204 (13%)

Query: 162 WLHPEGIDSTIEHR--MNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
           W HP G    I+     +HPVV +S++DA AY  W    LPTEAE+E+  RGGLE   + 
Sbjct: 33  WRHPLGARDDIDSLGLWDHPVVQISYSDAEAYARWVNKDLPTEAEFEFAARGGLEGADYA 92

Query: 220 WGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
           WG+ L P     AN  QG FP  N   DG+  T+PV +Y  N +GL++++GN WEWT DW
Sbjct: 93  WGDELAPGDAMMANYRQGLFPFANQMLDGWERTSPVGTYPANGYGLFDIIGNTWEWTRDW 152

Query: 280 WNVHHHPAP-----------------------SYNPKGPTTGTD-KVKKGGSYLCNEQYC 315
           W+    PA                        S++P  P      KV K GS+LC   YC
Sbjct: 153 WS--EKPAADRGGRKSRSCCSISNPRGGKLKESFDPAQPGVEIGRKVLKVGSHLCAANYC 210

Query: 316 YRHRCAARSQNTPDSSAGNLGFRC 339
            R+R AAR     D+S  ++GFRC
Sbjct: 211 QRYRPAARHPEMIDTSTTHIGFRC 234


>gi|302522996|ref|ZP_07275338.1| sulfatase modifying factor 1 [Streptomyces sp. SPB78]
 gi|302431891|gb|EFL03707.1| sulfatase modifying factor 1 [Streptomyces sp. SPB78]
          Length = 109

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 65/108 (60%), Positives = 77/108 (71%)

Query: 235 WQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKG 294
           WQG FPT NTA DGY  TAP  ++  N  GL+N+ GNVWEW AD W   H   P+ +P+G
Sbjct: 1   WQGRFPTTNTAEDGYRGTAPAHAFPPNGHGLHNVAGNVWEWCADRWGTDHSARPASDPRG 60

Query: 295 PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           P  GT +V +GGSYLC+  YC R+R AAR+ NTPDSSAGNLGFRCA D
Sbjct: 61  PRRGTSRVMRGGSYLCHHSYCNRYRVAARTSNTPDSSAGNLGFRCAWD 108


>gi|421865904|ref|ZP_16297578.1| Sulfatase modifying factor 1 precursor
           (C-alpha-formyglycine-generating enzyme 1) [Burkholderia
           cenocepacia H111]
 gi|358074045|emb|CCE48456.1| Sulfatase modifying factor 1 precursor
           (C-alpha-formyglycine-generating enzyme 1) [Burkholderia
           cenocepacia H111]
          Length = 349

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 105/336 (31%), Positives = 142/336 (42%), Gaps = 77/336 (22%)

Query: 15  MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           MV L G  F  G+ +       + DG        +  F++DQ +V+  QF  FV ATGY 
Sbjct: 72  MVHLRGGAFVFGSTRGYADERPVGDGR-----TRVGGFWIDQTDVTIAQFAAFVQATGY- 125

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
                          ++E E+   + V H           +  R       ++W      
Sbjct: 126 ---------------VTEAEQQGGAAVFH-----------VPTRDEMNARDLAW------ 153

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             W          W  G             SW HP G  S +    N PV  V+  DA+A
Sbjct: 154 --W---------SWVKGA------------SWRHPRGPGSDVTGLGNLPVTLVTQRDALA 190

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAAD 247
           Y  W G  LPTEAEWEY  + G ++       +  PR   G+  AN WQG FP  ++A D
Sbjct: 191 YAHWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGAFPVLDSAED 246

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDKV 302
           G+   APV  Y  N F LY+M+GN WEWT D +   H         +  P      T  V
Sbjct: 247 GHAGLAPVGCYAANGFRLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPMV 306

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            KGGS+LC+  YC R+R ++R Q   D  A ++GFR
Sbjct: 307 IKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342


>gi|421479554|ref|ZP_15927240.1| formylglycine-generating sulfatase enzyme [Burkholderia multivorans
           CF2]
 gi|400222768|gb|EJO53124.1| formylglycine-generating sulfatase enzyme [Burkholderia multivorans
           CF2]
          Length = 347

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 84/218 (38%), Positives = 109/218 (50%), Gaps = 19/218 (8%)

Query: 140 TEAEWEYGC-------RGGLENRLFPW------GSWLHPEGIDSTIEHRMNHPVVHVSWN 186
           TEAE + G        R  L  R   W       SW HP G  S ++ R N PV  V+  
Sbjct: 125 TEAEQQGGAAVFHVPTRDELNARDLAWWSWVKGASWRHPRGPGSDVDGRGNLPVTLVTQR 184

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
           DA+AY  W G  LPTEAEWEY  + G ++           RG+  AN WQG FP  +TA 
Sbjct: 185 DALAYARWLGRDLPTEAEWEYAGKAGRDDASLD-AAPRDARGKPAANYWQGPFPVLDTAE 243

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDK 301
           DG+   APV  Y  N F LY+M+GN W+WT D +   H         +  P      T  
Sbjct: 244 DGHAGLAPVGCYAANGFRLYDMIGNAWQWTKDVYTGPHQSHTNGDTAAVAPAERRHDTPM 303

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           V KGGS+LC+  YC R+R ++R +   D +A ++GFR 
Sbjct: 304 VIKGGSFLCSRDYCVRYRASSRERQEADLAASHIGFRT 341



 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 54/147 (36%), Positives = 77/147 (52%), Gaps = 10/147 (6%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           MV + G TF  G+ +       E P+ +    +  F++DQ +V+  QF  FV ATGYVTE
Sbjct: 70  MVHVRGGTFVFGSTRGY---QDERPAGDGRTRVGGFWIDQTDVTVAQFAAFVQATGYVTE 126

Query: 73  AEKFGDTFVFEPLLSEEERAKI----SQVRHDMKRF-EGLDSTIEHRMHHPVVHISWNDA 127
           AE+ G   VF     +E  A+     S V+    R   G  S ++ R + PV  ++  DA
Sbjct: 127 AEQQGGAAVFHVPTRDELNARDLAWWSWVKGASWRHPRGPGSDVDGRGNLPVTLVTQRDA 186

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLEN 154
           +AY  W G  LPTEAEWEY  + G ++
Sbjct: 187 LAYARWLGRDLPTEAEWEYAGKAGRDD 213


>gi|206562431|ref|YP_002233194.1| hypothetical protein BCAM0570 [Burkholderia cenocepacia J2315]
 gi|444359023|ref|ZP_21160361.1| formylglycine-generating sulfatase enzyme [Burkholderia cenocepacia
           BC7]
 gi|444369974|ref|ZP_21169673.1| formylglycine-generating sulfatase enzyme [Burkholderia cenocepacia
           K56-2Valvano]
 gi|198038471|emb|CAR54429.1| putative exported protein [Burkholderia cenocepacia J2315]
 gi|443598221|gb|ELT66596.1| formylglycine-generating sulfatase enzyme [Burkholderia cenocepacia
           K56-2Valvano]
 gi|443602803|gb|ELT70857.1| formylglycine-generating sulfatase enzyme [Burkholderia cenocepacia
           BC7]
          Length = 349

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 105/336 (31%), Positives = 142/336 (42%), Gaps = 77/336 (22%)

Query: 15  MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           MV L G  F  G+ +       + DG        +  F++DQ +V+  QF  FV ATGY 
Sbjct: 72  MVHLRGGAFVFGSTRGYADERPVGDGR-----TRVGGFWIDQTDVTIAQFAAFVQATGY- 125

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
                          ++E E+   + V H           +  R       ++W      
Sbjct: 126 ---------------VTEAEQQGGAAVFH-----------VPTRDEMNARDLAW------ 153

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
             W          W  G             SW HP G  S +    N PV  V+  DA+A
Sbjct: 154 --W---------SWVKGA------------SWRHPRGPGSDVTGLGNLPVTLVTQRDALA 190

Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAAD 247
           Y  W G  LPTEAEWEY  + G ++       +  PR   G+  AN WQG FP  ++A D
Sbjct: 191 YAHWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDANGKPAANYWQGAFPVLDSAED 246

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDKV 302
           G+   APV  Y  N F LY+M+GN WEWT D +   H         +  P      T  V
Sbjct: 247 GHAGLAPVGCYAANGFHLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPMV 306

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            KGGS+LC+  YC R+R ++R Q   D  A ++GFR
Sbjct: 307 IKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342


>gi|430804889|ref|ZP_19432004.1| Sulfatase-modifying factor 1 [Cupriavidus sp. HMR-1]
 gi|429503016|gb|ELA01319.1| Sulfatase-modifying factor 1 [Cupriavidus sp. HMR-1]
          Length = 352

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 108/355 (30%), Positives = 141/355 (39%), Gaps = 88/355 (24%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
            R   M  + G  F  GT    L    E P+    + +F++D+ EV+  QF  FV ATGY
Sbjct: 66  SRTAGMARVTGGEFVPGTT---LGYPDERPAGKARVGSFWIDRTEVTVAQFAAFVQATGY 122

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           VT+A                ER   + V H                              
Sbjct: 123 VTDA----------------ERQGAAVVFHK----------------------------- 137

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPW------GSWLHPEGIDSTIEH-----RMNH 178
                    PT+AE        L  R + W       +W HPEG D+   H     R N 
Sbjct: 138 ---------PTDAE--------LGQRPYAWWTMVTGANWRHPEGPDAANPHGYDHRRDNQ 180

Query: 179 PVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE 238
           PV  V+  DA AY  W G  LPTE EWE+  + G  +         T  G   AN WQG 
Sbjct: 181 PVTLVTQADARAYANWLGHDLPTEDEWEFAAKAGRSDAGLETAPQ-TADGTPTANYWQGV 239

Query: 239 FPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------APS 289
           FP  NT+ DG+   APV  Y  N  GL++M+ N WEWT D +               A +
Sbjct: 240 FPVLNTSRDGFAGLAPVGCYTANALGLFDMIANAWEWTGDAYTGPRQSHANGDTAVVAAA 299

Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
             P+ P   +  V KGGS+LC   +C R+R +AR     D    ++GFR     G
Sbjct: 300 SRPRKPAAAS--VIKGGSFLCAPDFCVRYRASARESAESDLPTSHIGFRTVLRDG 352


>gi|436838039|ref|YP_007323255.1| hypothetical protein FAES_4663 [Fibrella aestuarina BUZ 2]
 gi|384069452|emb|CCH02662.1| hypothetical protein FAES_4663 [Fibrella aestuarina BUZ 2]
          Length = 324

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 71/174 (40%), Positives = 101/174 (58%), Gaps = 16/174 (9%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR-LFPWGNNLTPRGEHRANVW 235
           N PV  VSWNDA AY  W G RLP E EWE+  R    ++ L+P+GNNLT  G+  AN W
Sbjct: 158 NMPVTQVSWNDAQAYANWAGKRLPHEIEWEHAARNATNSQTLYPFGNNLTQDGKWLANTW 217

Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWT-------ADWWNVHHHPAP 288
            G+FP ++  +DG+   APV ++  +  GL +M GNVWEW         DW N       
Sbjct: 218 NGKFPDHDAVSDGFHRAAPVGTFGPSPIGLADMSGNVWEWCENPKVAYTDWLN------- 270

Query: 289 SYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
              P   T  T++V++GGS+LC   +C+ +R + RS +T +++  ++GFRC  D
Sbjct: 271 -QTPPRITEATERVQRGGSFLCEPGWCHGYRVSGRSGSTAETALMHVGFRCVKD 323



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 18/48 (37%), Positives = 34/48 (70%)

Query: 346 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           T  T++V++GGS+LC   +C+ +R + RS +T +++  ++GFRC  D+
Sbjct: 277 TEATERVQRGGSFLCEPGWCHGYRVSGRSGSTAETALMHVGFRCVKDL 324


>gi|294010120|ref|YP_003543580.1| hypothetical protein SJA_C1-01340 [Sphingobium japonicum UT26S]
 gi|292673450|dbj|BAI94968.1| conserved hypothetical protein [Sphingobium japonicum UT26S]
          Length = 293

 Score =  137 bits (346), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 109/326 (33%), Positives = 141/326 (43%), Gaps = 79/326 (24%)

Query: 16  VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEK 75
           VL+P  T  +G +      DG  P + V + AF++D+HEV+N QF  FV+ATGY T AE+
Sbjct: 40  VLVPAGTVLLGEDG-----DGR-PGQAVAVPAFWIDRHEVTNRQFAAFVAATGYRTAAEQ 93

Query: 76  FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRG 135
            G + VF                                   P   +  N+A  +     
Sbjct: 94  EGGSAVFV---------------------------------QPTGPVDLNNAGNW----- 115

Query: 136 ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWR 195
                   W Y  R            W HP+G  S I  R + PVVHV++ DA AY  W 
Sbjct: 116 --------WRYRQRA----------DWRHPDGPGSDIAGREDWPVVHVNYQDAAAYARWA 157

Query: 196 GARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPV 255
           GA LP EA+WE   RG       P        G   AN WQG FP  +T  DG++  APV
Sbjct: 158 GAALPDEAQWERAARGDQSGPRDPRSWAYDGEGHPTANSWQGVFPVRDTGQDGHVGLAPV 217

Query: 256 MSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYC 315
             +  N FGL++MVGNVWEWT++                   G + + KGGSYLC   YC
Sbjct: 218 GCFDGNGFGLHDMVGNVWEWTSE-----------------RRGGNGLIKGGSYLCAMNYC 260

Query: 316 YRHRCAARSQNTPDSSAGNLGFRCAA 341
              R AA      D    ++GFR  A
Sbjct: 261 ANFRPAAFQAQELDLGTSHIGFRTIA 286


>gi|383645574|ref|ZP_09957980.1| hypothetical protein SeloA3_16162 [Sphingomonas elodea ATCC 31461]
          Length = 342

 Score =  137 bits (345), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 80/205 (39%), Positives = 104/205 (50%), Gaps = 17/205 (8%)

Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
           R  P  +W HPEG  S+I      PVV V++ DA AY  W G  LP+E ++E   R GL 
Sbjct: 126 RFVPGANWRHPEGPGSSIAGHEEEPVVAVTFVDAQAYARWAGRALPSEEQFEAAARVGLT 185

Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
           +       +  P  + +AN WQG FP  N   DGY   AP   +K N  G Y+++GNVWE
Sbjct: 186 D------PDAEPNPD-QANTWQGSFPNMNAKRDGYTGRAPAGCFKPNGAGAYDLIGNVWE 238

Query: 275 WTADWWNVHHHP---------APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQ 325
           WT  W+   H P          PS++P+ P     +V KGGSYLC   YC R+R AAR  
Sbjct: 239 WTESWYLPGHAPFAVGDGSPGNPSFDPRQPEAKA-RVIKGGSYLCANNYCARYRPAARHA 297

Query: 326 NTPDSSAGNLGFRCAADKGPTTGTD 350
                +A +LGFR  +       TD
Sbjct: 298 QEETLAASHLGFRTVSRSAFDLATD 322



 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 55/152 (36%), Positives = 74/152 (48%), Gaps = 23/152 (15%)

Query: 16  VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEK 75
           V +P   F MG+++       E P     + AF +D HEV+N QF  FV+ATGY TEAEK
Sbjct: 45  VRIPASRFVMGSSR---YYPEEGPPTETEVPAFDIDAHEVTNRQFATFVAATGYRTEAEK 101

Query: 76  -FGDTFVFEPLLSEEERAKISQVRHDMKRF------------EGLDSTIEHRMHHPVVHI 122
             G  FVF P        K +    D +R+            EG  S+I      PVV +
Sbjct: 102 PGGGAFVFVP-------PKTAIDEPDPRRWWRFVPGANWRHPEGPGSSIAGHEEEPVVAV 154

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
           ++ DA AY  W G  LP+E ++E   R GL +
Sbjct: 155 TFVDAQAYARWAGRALPSEEQFEAAARVGLTD 186


>gi|87308793|ref|ZP_01090932.1| hypothetical protein DSM3645_11157 [Blastopirellula marina DSM
           3645]
 gi|87288504|gb|EAQ80399.1| hypothetical protein DSM3645_11157 [Blastopirellula marina DSM
           3645]
          Length = 331

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 80/199 (40%), Positives = 104/199 (52%), Gaps = 7/199 (3%)

Query: 144 WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEA 203
           W +  + G E  +     W HP G  STI  R ++PV+ VSW DA AY  W G RLPTEA
Sbjct: 133 WVFDAKTG-EASIVDGAQWRHPTGPGSTIIGREDYPVLQVSWYDARAYAEWAGKRLPTEA 191

Query: 204 EWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKF 263
           EWE+  RGG  +  +P  +     G    NVWQG FP  + A+D Y   APV  +  N F
Sbjct: 192 EWEFAARGGQTSGEYPAQDAAGNIG----NVWQGRFPFADAASDDYAEVAPVGKFAANGF 247

Query: 264 GLYNMVGNVWEWTAD-WWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCA- 321
           G Y+M GN  EW +D +    +  A   +P GP  G ++V +GGS+L +EQ         
Sbjct: 248 GFYDMAGNAAEWCSDHYAEDSYQLADRVDPVGPVEGEERVIRGGSWLSSEQDGVSEAMVW 307

Query: 322 ARSQNTPDSSAGNLGFRCA 340
            RS + P S     GFRCA
Sbjct: 308 YRSHSGPRSPNNFTGFRCA 326



 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 58/144 (40%), Positives = 79/144 (54%), Gaps = 7/144 (4%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M+ L G  F +G  KP    D + P   V +    +D+HE +N QF  F+ AT Y+T+AE
Sbjct: 71  MISLAGGLFLLG--KPDGPLDAQ-PVVKVNISPLRMDRHETTNAQFAAFIEATQYLTDAE 127

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           + G ++VF+    E      +Q RH      G  STI  R  +PV+ +SW DA AY  W 
Sbjct: 128 RRGRSWVFDAKTGEASIVDGAQWRHPT----GPGSTIIGREDYPVLQVSWYDARAYAEWA 183

Query: 135 GARLPTEAEWEYGCRGGLENRLFP 158
           G RLPTEAEWE+  RGG  +  +P
Sbjct: 184 GKRLPTEAEWEFAARGGQTSGEYP 207


>gi|426356328|ref|XP_004065371.1| PREDICTED: LOW QUALITY PROTEIN: sulfatase-modifying factor 2
           [Gorilla gorilla gorilla]
          Length = 244

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 80/193 (41%), Positives = 105/193 (54%), Gaps = 24/193 (12%)

Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR---LF 218
           W    G  S I  R+ HPV+HVSWNDA AYC WRG RLPTE EWE+  RGGL+       
Sbjct: 68  WSLHAGPGSGIRERLEHPVLHVSWNDARAYCVWRGKRLPTEEEWEFAARGGLKGMRTCFG 127

Query: 219 PWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTA 277
             G + T    H     QG+FP  + A DG+   +PV ++  +N +GLY+++GNVWEWTA
Sbjct: 128 TGGKSFTLPFSH-----QGKFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWEWTA 182

Query: 278 DWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNL 335
                        +P        +V +G S++   +    +R R   R  NTPDS++ NL
Sbjct: 183 -------------SPYQAAEQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNL 229

Query: 336 GFRCAADKGPTTG 348
           GFRCAAD G   G
Sbjct: 230 GFRCAADAGRPPG 242



 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 31/49 (63%), Positives = 36/49 (73%)

Query: 106 GLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
           G  S I  R+ HPV+H+SWNDA AYC WRG RLPTE EWE+  RGGL+ 
Sbjct: 73  GPGSGIRERLEHPVLHVSWNDARAYCVWRGKRLPTEEEWEFAARGGLKG 121



 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 193 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 237


>gi|427401679|ref|ZP_18892751.1| hypothetical protein HMPREF9710_02347 [Massilia timonae CCUG 45783]
 gi|425719391|gb|EKU82324.1| hypothetical protein HMPREF9710_02347 [Massilia timonae CCUG 45783]
          Length = 334

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 104/329 (31%), Positives = 139/329 (42%), Gaps = 68/329 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNV--TLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           MV L G  F  G+   +     E P+      +D F++DQ EV+N QF+ FV ATGYV  
Sbjct: 63  MVRLAGGEFVFGS---LAGYPDERPASTARHKVDGFWIDQTEVTNAQFERFVQATGYV-- 117

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
                         +E ER   + V      F+  D+    R  +               
Sbjct: 118 --------------TEAEREGAAVV------FDTPDAAELRRRPY--------------- 142

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
                    A W Y  RG          SW  P    +      N PV  V+  DA+AY 
Sbjct: 143 ---------AWWRY-VRGA---------SWKDPHSAQAPAP---NAPVTLVTQADALAYA 180

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
            W G  LPTEAEWEY  + G ++     G  +   G   AN WQG FP  ++  DG+   
Sbjct: 181 RWLGRDLPTEAEWEYAAKAGRDDAGLD-GAPVDAHGRPTANYWQGPFPLADSGQDGHAGL 239

Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---APSYNPKGPTTGTDKVKKGGSYL 309
           APV  Y  N FGL++M+GN WEWT D ++    P     +            V KGGS+L
Sbjct: 240 APVGCYAANAFGLHDMIGNAWEWTKDGYSGARQPHANGDTAVAAARLPAAPVVIKGGSFL 299

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           C   YC R+R +AR     +  A ++GFR
Sbjct: 300 CAPTYCLRYRASARESQEANLGASHIGFR 328



 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 17/37 (45%), Positives = 23/37 (62%)

Query: 352 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 388
           V KGGS+LC   YC R+R +AR     +  A ++GFR
Sbjct: 292 VIKGGSFLCAPTYCLRYRASARESQEANLGASHIGFR 328


>gi|424740677|ref|ZP_18169056.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii WC-141]
 gi|422945468|gb|EKU40420.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii WC-141]
          Length = 334

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 108/348 (31%), Positives = 145/348 (41%), Gaps = 81/348 (23%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFP--------------SRNVTLDAFYLDQHE 54
           +E+ K+   LP    +  T   +LI DG F                +   +  F++DQ E
Sbjct: 44  IEKCKNYTGLPQGWLKKPTAGMVLIADGHFNFGSEKAYPDELNFGKKQREVKGFWIDQTE 103

Query: 55  VSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHR 114
           V+  QF  FV ATGY+T+AEK     VF P                       DS     
Sbjct: 104 VTVAQFASFVKATGYITDAEKQKQAAVFSP-----------------------DS----- 135

Query: 115 MHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEH 174
            HHP     W    +  TW+            G  G L N                    
Sbjct: 136 -HHPQ---QWWQLKSGYTWKTPN---------GSTGALANP------------------- 163

Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RAN 233
             N PV +VS NDA  Y  W G  LPTE EWEY  +   +    P     T   +H +AN
Sbjct: 164 --NEPVRYVSKNDAEHYAVWLGHDLPTELEWEYAAKANSKTDT-PLHQAPTDERQHPQAN 220

Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP 292
            WQGEFP  N   D +   APV  +  N F L++ +GNVWEWT+  +   H     +Y+ 
Sbjct: 221 YWQGEFPFQNLNQDHFKGVAPVGCFAPNNFKLFDTIGNVWEWTSSPYQGAHDQHMGNYSD 280

Query: 293 --KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
             +   T T  V KGGS+LC + YC R+R ++R     D +A ++GFR
Sbjct: 281 LRQQQITSTQYVIKGGSFLCAQNYCSRYRNSSRYPQDFDLAATHVGFR 328


>gi|421219252|ref|ZP_15676118.1| hypothetical protein AMCSP13_002919 [Streptococcus pneumoniae
           2070335]
 gi|395578724|gb|EJG39236.1| hypothetical protein AMCSP13_002919 [Streptococcus pneumoniae
           2070335]
          Length = 156

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 70/160 (43%), Positives = 94/160 (58%), Gaps = 10/160 (6%)

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
           +A+C W G  LPTEA+WEY  RGG ++  + WG+ L   G + AN WQG FP  NTA DG
Sbjct: 1   MAFCNWSGMSLPTEAQWEYAARGGRQSE-YLWGDTLLEGGYYHANTWQGRFPYENTALDG 59

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKV 302
           ++ TAPV  +  N FGLY M+GNVWEW     N  +    S+N      PK      +  
Sbjct: 60  FIGTAPVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYA 116

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            +GGS+LC+  YC R+R AAR+     S++ +LGFRC  +
Sbjct: 117 IRGGSFLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 156


>gi|293608773|ref|ZP_06691076.1| predicted protein [Acinetobacter sp. SH024]
 gi|292829346|gb|EFF87708.1| predicted protein [Acinetobacter sp. SH024]
          Length = 327

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 75/184 (40%), Positives = 96/184 (52%), Gaps = 7/184 (3%)

Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
            +W HP G+++  +   + PV  V+WNDA AY  W G  LPTE EWEY  +G  +N    
Sbjct: 145 ANWKHPWGLNNPKQPAPHEPVRMVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI- 203

Query: 220 WGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
                T  G   AN WQGEFP  N   DG+   APV  + +N FGLY+++GNVWEWT   
Sbjct: 204 ---GPTHEGHIVANFWQGEFPYKNIQEDGFKDIAPVGCFSKNPFGLYDLIGNVWEWTQTP 260

Query: 280 WN---VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
           +     HH   PS   +          KGGSYLC   YC R+R AAR     D +  ++G
Sbjct: 261 YTGSRDHHMGDPSKYRQSHEQILQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVG 320

Query: 337 FRCA 340
           FR  
Sbjct: 321 FRTV 324



 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 47/151 (31%), Positives = 65/151 (43%), Gaps = 22/151 (14%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P  +   MV LP     +GT +      G    R + L A          T    F+   
Sbjct: 68  PKNKEAGMVKLPQGKIVLGTTQ------GYEDERPLNLQA----------TSVPAFLIDA 111

Query: 68  GYVTEAEKFGDTFVF----EPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
             VT+AEK G   +F     P+  EE +    +   + K   GL++  +   H PV  ++
Sbjct: 112 TEVTDAEKQGGAAMFVQPDHPV--EELQWWKFEKGANWKHPWGLNNPKQPAPHEPVRMVT 169

Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
           WNDA AY  W G  LPTE EWEY  +G  +N
Sbjct: 170 WNDAYAYANWLGHDLPTELEWEYAAKGFQQN 200


>gi|119356728|ref|YP_911372.1| hypothetical protein Cpha266_0900 [Chlorobium phaeobacteroides DSM
           266]
 gi|119354077|gb|ABL64948.1| protein of unknown function DUF323 [Chlorobium phaeobacteroides DSM
           266]
          Length = 826

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 110/339 (32%), Positives = 149/339 (43%), Gaps = 87/339 (25%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           + V +P   F MG+    + + G+     V +  FYL ++ V+   F+EF+ A+GY T+A
Sbjct: 560 NFVRIPEGEFTMGSPASEIDRSGDETQHQVKVSDFYLCKYAVTLAGFKEFIEASGYRTDA 619

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           EK   ++V++   S E+RA I+                                      
Sbjct: 620 EKGDGSYVWDGK-SWEKRAGIN-------------------------------------- 640

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W +G  GG+                    +   NHPVVHVSWNDAVAYC 
Sbjct: 641 ----------WRFGISGGVRP------------------QSEENHPVVHVSWNDAVAYCK 672

Query: 194 W------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           W      +  RLPTEAEWEY CR G     F  G+NLT  GE   N   G +P N     
Sbjct: 673 WISEKTGQQFRLPTEAEWEYACRAGTTTP-FHTGDNLTT-GEANYN---GNYPYNKNPKG 727

Query: 248 GYL-STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTDK 301
            Y  +T PV S   N +GLY+M GNVWEW  D +   ++          NP GP TG+ +
Sbjct: 728 EYRENTVPVDSLAPNAWGLYHMHGNVWEWCGDRYGDKYYDECKAKGVVENPVGPETGSIR 787

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           V +GGS+     YC   R A R+ +TP     + GFR A
Sbjct: 788 VLRGGSWGNYAGYC---RSANRNSDTPGDRGSDAGFRLA 823


>gi|427423321|ref|ZP_18913479.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii WC-136]
 gi|425699874|gb|EKU69473.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii WC-136]
          Length = 302

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 75/182 (41%), Positives = 96/182 (52%), Gaps = 7/182 (3%)

Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
            +W HP G+++  +   + PV  V+WNDA AY  W G  LPTE EWEY  +G  +N    
Sbjct: 120 ANWKHPWGLNNPKQPAPHEPVRMVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI- 178

Query: 220 WGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
                T  G   AN WQGEFP  N   DG+   APV  + +N FGLY+++GNVWEWT   
Sbjct: 179 ---GPTHEGHIVANFWQGEFPYKNIQEDGFKDIAPVGCFSKNPFGLYDLIGNVWEWTQTP 235

Query: 280 WN---VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
           +     HH   PS   +          KGGSYLC   YC R+R AAR     D +  ++G
Sbjct: 236 YTGSRDHHMGDPSKYRQSHEQILQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVG 295

Query: 337 FR 338
           FR
Sbjct: 296 FR 297



 Score = 55.1 bits (131), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 47/151 (31%), Positives = 65/151 (43%), Gaps = 22/151 (14%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P  +   MV LP     +GT +      G    R + L A          T    F+   
Sbjct: 43  PKNKEAGMVKLPQGKIVLGTTQ------GYEDERPLNLQA----------TSVPAFLIDA 86

Query: 68  GYVTEAEKFGDTFVF----EPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
             VT+AEK G   +F     P+  EE +    +   + K   GL++  +   H PV  ++
Sbjct: 87  TEVTDAEKQGGAAMFVQPDHPV--EELQWWKFEKGANWKHPWGLNNPKQPAPHEPVRMVT 144

Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
           WNDA AY  W G  LPTE EWEY  +G  +N
Sbjct: 145 WNDAYAYANWLGHDLPTELEWEYAAKGFQQN 175


>gi|223994935|ref|XP_002287151.1| hypothetical protein THAPSDRAFT_261211 [Thalassiosira pseudonana
           CCMP1335]
 gi|220976267|gb|EED94594.1| hypothetical protein THAPSDRAFT_261211 [Thalassiosira pseudonana
           CCMP1335]
          Length = 250

 Score =  135 bits (339), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 100/312 (32%), Positives = 141/312 (45%), Gaps = 77/312 (24%)

Query: 36  GEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKIS 95
           G  P R V ++ F++D   V+N ++ +FV AT Y TEAEKFG +FV    L  +E  + +
Sbjct: 1   GATPPRVVYIEPFFIDIAPVTNKEYGKFVRATYYETEAEKFGWSFVLSSFL--QELGESA 58

Query: 96  QVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 155
           +V  + + +  +D                                               
Sbjct: 59  EVDPEAEDWVAVDRAY-------------------------------------------- 74

Query: 156 LFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGL-- 213
                 W +PEG  ++ ++R +HPVVHVS  DA  YC W G RLP E EWE   R G   
Sbjct: 75  ------WRNPEGPGTSYKYREHHPVVHVSHRDAAEYCKWIGKRLPGEREWEAAARAGHYG 128

Query: 214 -ENR-LFPWGNNLTPR-GEHRANVW-QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMV 269
             NR LF WG   T    +  AN+W  G FP  N A DG+ +T+PV ++K N  GLY+M 
Sbjct: 129 PNNRTLFIWGEEQTVDVAKEYANLWGAGAFPWENLAEDGWRATSPVKTFKPNALGLYDMT 188

Query: 270 GNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRC--AARSQNT 327
           GNVWEW         H A             ++ +GGSY+ +    + H     AR+   
Sbjct: 189 GNVWEWMRG----GKHKA-------------RIVRGGSYVDSLDGNFNHAATLGARATLH 231

Query: 328 PDSSAGNLGFRC 339
             ++ GN+GFRC
Sbjct: 232 GTTTTGNVGFRC 243


>gi|421807665|ref|ZP_16243525.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC035]
 gi|410416646|gb|EKP68418.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC035]
          Length = 335

 Score =  134 bits (337), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 104/328 (31%), Positives = 138/328 (42%), Gaps = 68/328 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY+T+AE
Sbjct: 66  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYITDAE 124

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K     VF P                               HHP     W     Y TWR
Sbjct: 125 KQKQAAVFSP-----------------------------APHHP--QQWWQLKTGY-TWR 152

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                       G  G L N                      + PV +V+ NDA  Y  W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
            G  LPTE EWEY  +        P     T   +H +AN WQG FP  N   D +   A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241

Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
           PV  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            E YC R+R ++R     D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329


>gi|288919751|ref|ZP_06414077.1| protein of unknown function DUF323 [Frankia sp. EUN1f]
 gi|288348851|gb|EFC83102.1| protein of unknown function DUF323 [Frankia sp. EUN1f]
          Length = 281

 Score =  134 bits (337), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 101/283 (35%), Positives = 132/283 (46%), Gaps = 48/283 (16%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           K MV +PG TF MG+++       E P   V +D F++D+H V+  +F+ FV ATG+VT 
Sbjct: 27  KGMVRIPGGTFLMGSDE---FYAEERPVHPVAVDGFWMDEHPVTVARFRAFVKATGHVTV 83

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+                    QV  D   + G D      +   +V    +  V    
Sbjct: 84  AER--------------------QV--DPADYPGADPA--GLVPGALVFRMTSGPVDLDD 119

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR         W Y           P   W  P G  ST+  R  HPV  V++ DA AY 
Sbjct: 120 WR-------RWWSYE----------PGACWHRPLGTGSTVHGRERHPVTQVAFADAAAYA 162

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP---TNNTAADGY 249
            W G  LPTEAEWE+  RGGL+  +F WG+  TP+G   AN W G FP      +A    
Sbjct: 163 AWAGKDLPTEAEWEFAARGGLDGAVFAWGDEFTPKGRRMANTWHGAFPWQYVPASAGQPM 222

Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
             T PV S+  N +GLY+M GN WEWTAD +    HP P+  P
Sbjct: 223 PGTTPVGSFPPNGYGLYDMAGNTWEWTADHYT-DRHPDPAPGP 264


>gi|289760839|ref|ZP_06520217.1| hypothetical protein TBIG_03921 [Mycobacterium tuberculosis GM
           1503]
 gi|289708345|gb|EFD72361.1| hypothetical protein TBIG_03921 [Mycobacterium tuberculosis GM
           1503]
          Length = 381

 Score =  134 bits (337), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 93/252 (36%), Positives = 119/252 (47%), Gaps = 45/252 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +FRMG+ +       E P   VT+ AF +++H V+N QF EFVSATGYVT A
Sbjct: 4   ELVDLPGGSFRMGSTR---FYPEEAPIHTVTVRAFAVERHPVTNAQFAEFVSATGYVTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E        +PL              D   + G+D+         +V       V    W
Sbjct: 61  E--------QPL--------------DPGLYPGVDAA--DLCPGAMVFCPTAGPVDLRDW 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         W++           P   W HP G DS I  R  HPVV V++ DAVAY  
Sbjct: 97  R-------QWWDW----------VPGACWRHPFGRDSDIADRAGHPVVQVAYPDAVAYAR 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
           W G RLPTEAEWEY  RGG     + WG+   P G   AN WQG FP  N  A G++ T+
Sbjct: 140 WAGRRLPTEAEWEYAARGG-TTATYAWGDQEKPGGMLMANTWQGRFPYRNDGALGWVGTS 198

Query: 254 PVMSYKENKFGL 265
           PV  +   +  L
Sbjct: 199 PVGRFPGQRVWL 210


>gi|159898226|ref|YP_001544473.1| hypothetical protein Haur_1702 [Herpetosiphon aurantiacus DSM 785]
 gi|159891265|gb|ABX04345.1| protein of unknown function DUF323 [Herpetosiphon aurantiacus DSM
           785]
          Length = 510

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 102/319 (31%), Positives = 152/319 (47%), Gaps = 84/319 (26%)

Query: 35  DGEF----PSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLS--E 88
           D EF     +R V L+++ +D++EV+N QF +FV  T Y+T AE+ G++ + E + +   
Sbjct: 263 DDEFDQNGQARQVFLNSYRIDKYEVTNAQFAQFVEQTQYITYAERQGESMILEVIETAGS 322

Query: 89  EERAKISQVRHDM-KRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYG 147
           +E    S ++    K+  G +S+I+ +  +PV+HI + DAVAYCT +  RLPTEAEWE  
Sbjct: 323 KETLNFSAIKGAFWKQPYGPNSSIDDKADYPVIHIHYEDAVAYCTAKHKRLPTEAEWEKA 382

Query: 148 CRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEY 207
            R G+E   FPWG+                                          EW  
Sbjct: 383 AR-GVEGWRFPWGN------------------------------------------EW-- 397

Query: 208 GCRGGLENRLFPWGNN-LTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLY 266
             + GL N   P  ++ L  RG        G+ P          S +P        +G++
Sbjct: 398 --KSGLSNHAIPLRSHILQVRGLQAI----GQSPQ---------SISP--------YGVH 434

Query: 267 NMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAAR 323
           +++GNV EWTADW+      ++PA S NP+GP  G   VK+GGS+     Y +     + 
Sbjct: 435 DLLGNVSEWTADWYQPSYYQNNPA-SQNPQGPELGNSHVKRGGSWATPPGYLHN----SW 489

Query: 324 SQNTPDSSAGNLGFRCAAD 342
              TPD +   LGFRCAAD
Sbjct: 490 RIGTPDQTTDRLGFRCAAD 508



 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 79/268 (29%), Positives = 114/268 (42%), Gaps = 60/268 (22%)

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
           + Y   +G  +  E     G +  L         W  P G +S+I+ + ++PV+H+ + D
Sbjct: 302 ITYAERQGESMILEVIETAGSKETLNFSAIKGAFWKQPYGPNSSIDDKADYPVIHIHYED 361

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           AVAYCT +  RLPTEAEWE   RG +E   FPWGN      E ++ +     P  +    
Sbjct: 362 AVAYCTAKHKRLPTEAEWEKAARG-VEGWRFPWGN------EWKSGLSNHAIPLRSHILQ 414

Query: 248 GYLSTAPVMSYKE-NKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
                A   S +  + +G+++++GNV EWTADW+       PSY    P           
Sbjct: 415 VRGLQAIGQSPQSISPYGVHDLLGNVSEWTADWYQ------PSYYQNNPA---------- 458

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCY 366
                            SQN                +GP  G   VK+GGS+     Y +
Sbjct: 459 -----------------SQNP---------------QGPELGNSHVKRGGSWATPPGYLH 486

Query: 367 RHRCAARSQNTPDSSAGNLGFRCAADVS 394
                +    TPD +   LGFRCAADV+
Sbjct: 487 N----SWRIGTPDQTTDRLGFRCAADVN 510


>gi|94314341|ref|YP_587550.1| Sulfatase-modifying factor 1 [Cupriavidus metallidurans CH34]
 gi|93358193|gb|ABF12281.1| Sulfatase-modifying factor 1 precursor (C-alpha-formyglycine-
           generating enzyme 1); putative exported protein
           [Cupriavidus metallidurans CH34]
          Length = 352

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 106/355 (29%), Positives = 138/355 (38%), Gaps = 88/355 (24%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
            R   M  + G  F  GT    L    E P+    + +F++D+ EV+  QF  FV ATGY
Sbjct: 66  SRTAGMARVTGGEFVPGTT---LGYPDERPAGKTRVGSFWIDRTEVTVAQFAAFVQATGY 122

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           VT+A                ER   + V H                              
Sbjct: 123 VTDA----------------ERQGAAVVFHK----------------------------- 137

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPW------GSWLHPEGIDSTIEH-----RMNH 178
                    PT+AE        L  R + W       +W HPEG  +   H     R N 
Sbjct: 138 ---------PTDAE--------LGQRPYAWWTMVTGANWRHPEGPAAANSHGYDHRRDNQ 180

Query: 179 PVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE 238
           PV  V+  DA AY  W G  LPTE EWE+  + G  +         T  G   AN WQG 
Sbjct: 181 PVTLVTQADARAYANWLGHDLPTEDEWEFAAKAGRSDAGLETAPQ-TAEGTPTANYWQGV 239

Query: 239 FPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------APS 289
           FP  NT+ DG+   APV  Y  N  GL++M+ N WEWT D +               A +
Sbjct: 240 FPVLNTSRDGFAGLAPVGCYTANALGLFDMIANAWEWTGDAYTGPRQSHANGDTAVVAAA 299

Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
              + P      V KGGS+LC   +C R+R +AR     D    ++GFR     G
Sbjct: 300 SRSRKPA--ATSVIKGGSFLCAPDFCVRYRASARESAESDLPTSHIGFRTVLRDG 352


>gi|406833741|ref|ZP_11093335.1| hypothetical protein SpalD1_18927 [Schlesneria paludicola DSM
           18645]
          Length = 289

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 111/353 (31%), Positives = 158/353 (44%), Gaps = 88/353 (24%)

Query: 15  MVLLPGDTFRMGTNKPILIK------------DGEFPSRNVTL-DAFYLDQHEVSNTQFQ 61
           M+ +P   F MG  +  ++             DGE P   V +  AF + Q+EV+  QF 
Sbjct: 1   MIEIPAGEFLMGAEEERIVTLRKFPNSQPKLLDGELPRHKVRITKAFDMGQYEVTLAQFM 60

Query: 62  EFVSATGYVTEAEKFGD-TFVFEP---LLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
            F  AT Y  EAE+ G  +F  +P   LL  +E    + +              ++   H
Sbjct: 61  TFRDATKYEIEAERDGQPSFGIDPEGELLKSQEFRPWNPI------------GWKNDTDH 108

Query: 118 PVVHISWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDST 171
           PV+++SWNDAVA+C W      R  RLPTEAEWEY CR G E R F +G     E +D  
Sbjct: 109 PVIYVSWNDAVAFCEWLSQQEGRSYRLPTEAEWEYACRAGSETR-FSFGD----EPVDL- 162

Query: 172 IEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR 231
                   +   + NDA      R A+                         L PR + R
Sbjct: 163 --------IRFANSNDAT-----RRAK-------------------------LFPRLDQR 184

Query: 232 ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSY 290
              + G F +    +D Y+ T+PV  +  N FGLY+M GNVWEW +D+++  ++  +P  
Sbjct: 185 K--FGGHFLSK---SDNYVWTSPVGKFAPNSFGLYDMHGNVWEWCSDFFDKDYYKDSPID 239

Query: 291 NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           +PKGP  G  +V +GGSY   E      R A R+     S   N GFR   ++
Sbjct: 240 DPKGPGNGESRVLRGGSY---ENQLMSLRSAIRAGAPASSRGSNTGFRVVRER 289


>gi|169796826|ref|YP_001714619.1| hypothetical protein ABAYE2816 [Acinetobacter baumannii AYE]
 gi|213156782|ref|YP_002318443.1| hypothetical protein AB57_1055 [Acinetobacter baumannii AB0057]
 gi|215484304|ref|YP_002326531.1| hypothetical protein ABBFA_002633 [Acinetobacter baumannii
           AB307-0294]
 gi|301346709|ref|ZP_07227450.1| hypothetical protein AbauAB0_10696 [Acinetobacter baumannii AB056]
 gi|301511822|ref|ZP_07237059.1| hypothetical protein AbauAB05_09590 [Acinetobacter baumannii AB058]
 gi|301594414|ref|ZP_07239422.1| hypothetical protein AbauAB059_01345 [Acinetobacter baumannii
           AB059]
 gi|332853700|ref|ZP_08434930.1| hypothetical protein HMPREF0021_02513 [Acinetobacter baumannii
           6013150]
 gi|332870879|ref|ZP_08439524.1| hypothetical protein HMPREF0020_03177 [Acinetobacter baumannii
           6013113]
 gi|403675978|ref|ZP_10938061.1| hypothetical protein ANCT1_15121 [Acinetobacter sp. NCTC 10304]
 gi|417574664|ref|ZP_12225518.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Canada BC-5]
 gi|421620540|ref|ZP_16061472.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC074]
 gi|421643641|ref|ZP_16084135.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii IS-235]
 gi|421646374|ref|ZP_16086826.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii IS-251]
 gi|421658785|ref|ZP_16099016.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-83]
 gi|421700071|ref|ZP_16139588.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii IS-58]
 gi|421796085|ref|ZP_16232154.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-21]
 gi|421801852|ref|ZP_16237809.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Canada BC1]
 gi|169149753|emb|CAM87644.1| conserved hypothetical protein; putative exported protein
           [Acinetobacter baumannii AYE]
 gi|213055942|gb|ACJ40844.1| hypothetical protein AB57_1055 [Acinetobacter baumannii AB0057]
 gi|213988595|gb|ACJ58894.1| conserved hypothetical protein [Acinetobacter baumannii AB307-0294]
 gi|332728524|gb|EGJ59898.1| hypothetical protein HMPREF0021_02513 [Acinetobacter baumannii
           6013150]
 gi|332731980|gb|EGJ63258.1| hypothetical protein HMPREF0020_03177 [Acinetobacter baumannii
           6013113]
 gi|400210232|gb|EJO41202.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Canada BC-5]
 gi|404570453|gb|EKA75526.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii IS-58]
 gi|408508324|gb|EKK10010.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii IS-235]
 gi|408517761|gb|EKK19299.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii IS-251]
 gi|408700227|gb|EKL45690.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC074]
 gi|408709481|gb|EKL54727.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-83]
 gi|410400281|gb|EKP52460.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-21]
 gi|410405109|gb|EKP57162.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Canada BC1]
          Length = 335

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 103/328 (31%), Positives = 137/328 (41%), Gaps = 68/328 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV  TGY+T+AE
Sbjct: 66  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKETGYITDAE 124

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K     VF P                               HHP     W     Y TWR
Sbjct: 125 KQKQAAVFSP-----------------------------DPHHP--QQWWQLKTGY-TWR 152

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                       G  G L N                      + PV +V+ NDA  Y  W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
            G  LPTE EWEY  +        P     T   +H +AN WQG FP  N   D +   A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241

Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
           PV  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            E YC R+R ++R     D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329


>gi|365088279|ref|ZP_09327876.1| hypothetical protein KYG_03856 [Acidovorax sp. NO-1]
 gi|363417259|gb|EHL24344.1| hypothetical protein KYG_03856 [Acidovorax sp. NO-1]
          Length = 289

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 107/325 (32%), Positives = 146/325 (44%), Gaps = 91/325 (28%)

Query: 37  EFPSRNVTL-DAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG-DTFVFEPLLSEEERAKI 94
           E P+  VT+  AFYL QHEV+  QF+ FV A+GY  E+E  G   + + P          
Sbjct: 28  EGPAHRVTIGQAFYLGQHEVTVGQFRRFVQASGYRAESEADGTGGYGYNPRYD-----PA 82

Query: 95  SQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
           + VR D   FEG D+                      +WR                    
Sbjct: 83  TTVRGDA--FEGRDTRY--------------------SWR-------------------- 100

Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYG 208
                         D       +HPVV+V+WNDA+A   W         RLPTEAEWEY 
Sbjct: 101 --------------DPGFAQGDDHPVVNVTWNDAMALADWLSRTEGHHYRLPTEAEWEYA 146

Query: 209 CRGGLENRLFPWGNNLTPRGE-HRANV-----------WQGEFPTNNTAADGYLSTAPVM 256
           CR G   R +P G++  P+G  H AN            WQ      N   DG+  TAPV 
Sbjct: 147 CRAGTRTR-YPHGDD--PQGLLHTANTFDQAAAPLWPRWQQHALAGN---DGHAFTAPVG 200

Query: 257 SYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYC 315
           SY  N FGL++M+GN WEW ADW   +++  +P  +P+GP  GT +V++GGS+     + 
Sbjct: 201 SYGANAFGLHDMLGNAWEWVADWHGDNYYAHSPRNDPRGPAHGTVRVRRGGSW---HTWA 257

Query: 316 YRHRCAARSQNTPDSSAGNLGFRCA 340
           +  RC  R+ N+P +    +G R A
Sbjct: 258 FYARCGYRNWNSPQTRYTLVGMRLA 282


>gi|194292611|ref|YP_002008518.1| sulfatase-modifying factor 1 (c-alpha-formyglycine-generating
           enzyme 1); exported protein [Cupriavidus taiwanensis LMG
           19424]
 gi|193226515|emb|CAQ72466.1| Sulfatase-modifying factor 1 precursor
           (C-alpha-formyglycine-generating enzyme 1); putative
           exported protein [Cupriavidus taiwanensis LMG 19424]
          Length = 346

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 108/359 (30%), Positives = 142/359 (39%), Gaps = 97/359 (27%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           ER   MV + G  F  GT         E P+  V +  F++DQ EV+  QF  FV ATGY
Sbjct: 61  ERTAGMVRVAGGEFLPGTTHGY---PDERPAGKVRVRPFWIDQTEVTVAQFAAFVDATGY 117

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           +                                       T   R    VV         
Sbjct: 118 I---------------------------------------TDAERQGTAVVF-------- 130

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPW------GSWLHPEG--IDSTIEH-----RM 176
                  R P++AE        L  R + W      G W HP+G  + S  +      R 
Sbjct: 131 -------RQPSDAE--------LRARPYAWWSMVQGGDWRHPQGRAVGSDADRDGWQARA 175

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLF---PWGNNLTPRGEHRAN 233
           + PV  V+  DA+AY  W G  LP+E EWE+  + G  +      P   + TP     AN
Sbjct: 176 SQPVTLVTQADALAYARWLGRELPSEDEWEFAAKAGRSDAALEAAPQSADGTPT----AN 231

Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK 293
            WQG FP  N++ DG+   APV  Y  N F LY+M+ N WEWTAD    +  P  S+   
Sbjct: 232 YWQGVFPVLNSSRDGFAGLAPVGCYPANGFALYDMIANAWEWTAD---AYTGPRQSHA-N 287

Query: 294 GPTTGT--------DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
           G T             V KGGS+LC   +C R+R +AR     D    ++GFR     G
Sbjct: 288 GDTAAVAGRQRAVRSAVIKGGSFLCAPDFCVRYRASAREPAENDLPTSHIGFRTVLRDG 346


>gi|417551131|ref|ZP_12202209.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii Naval-18]
 gi|417565391|ref|ZP_12216265.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC143]
 gi|395557147|gb|EJG23148.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC143]
 gi|400385586|gb|EJP48661.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii Naval-18]
          Length = 326

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 99/340 (29%), Positives = 145/340 (42%), Gaps = 76/340 (22%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           + Y  MV +P  TF++G N     +   + S + T+  F++D  EV+N QFQ FV ATGY
Sbjct: 54  DAYAGMVKIPSGTFKIGNNNSYPEEKALYKS-DRTIKEFWIDATEVTNAQFQSFVKATGY 112

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKIS----QVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
           +TEAE  G+  VF P   +E+ A+++        + +   G D+T +   + PV  I+  
Sbjct: 113 LTEAESQGEAAVFTP--PKEQVAELAWWSLMKGANWRHPWGPDATRQILPNEPVRMITLK 170

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
           DA+AY  W G  LPTE +WEY  +G  + R            I + +EH           
Sbjct: 171 DAMAYADWLGRDLPTEEQWEYAAKGFSKER-----------DISANLEH----------- 208

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
              +    W G                     FP+                      N  
Sbjct: 209 ---IDANVWEGQ--------------------FPY---------------------QNEN 224

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD-WWNVH--HHPAPSYNPKGPTTGTDKV 302
            DG+   APV  +  N FGLY+M+GNVWE+T+  +   H  H      +   P       
Sbjct: 225 KDGFEDVAPVGCFPSNGFGLYDMIGNVWEYTSSPFTGTHDDHMGEQQLHTGHPQVFNSYT 284

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            KGGS+LC   YC R+R +AR     D    ++GFR   +
Sbjct: 285 IKGGSFLCASNYCMRYRASARHSQEADLGISHVGFRTVKN 324



 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 75/239 (31%), Positives = 105/239 (43%), Gaps = 54/239 (22%)

Query: 156 LFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 215
           L    +W HP G D+T +   N PV  ++  DA+AY  W G  LPTE +WEY  +G  + 
Sbjct: 140 LMKGANWRHPWGPDATRQILPNEPVRMITLKDAMAYADWLGRDLPTEEQWEYAAKGFSKE 199

Query: 216 RLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
           R      +++   EH  ANVW+G+FP  N   DG+   APV  +  N FGLY+M+GNVWE
Sbjct: 200 R------DISANLEHIDANVWEGQFPYQNENKDGFEDVAPVGCFPSNGFGLYDMIGNVWE 253

Query: 275 WTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGN 334
           +T+                 P TGT        ++  +Q    H                
Sbjct: 254 YTSS----------------PFTGTHD-----DHMGEQQLHTGH---------------- 276

Query: 335 LGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
                     P        KGGS+LC   YC R+R +AR     D    ++GFR   ++
Sbjct: 277 ----------PQVFNSYTIKGGSFLCASNYCMRYRASARHSQEADLGISHVGFRTVKNI 325


>gi|91202992|emb|CAJ72631.1| conserved hypothetical protein [Candidatus Kuenenia
           stuttgartiensis]
          Length = 263

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 74/167 (44%), Positives = 98/167 (58%), Gaps = 19/167 (11%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
           + PVV V W DAVAY  W G RLPTEAEWE   RG  + RL+PWGN +      R NVW+
Sbjct: 109 DKPVVGVRWEDAVAYAEWAGKRLPTEAEWEKAARGS-DRRLWPWGNEIF--DAERCNVWE 165

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNV-HHHPAPSYNPK 293
            +           + T+PV SYK+ K  +G Y+M GNVWEW  DW++  +++  P  NPK
Sbjct: 166 TK----------KMQTSPVGSYKQGKSPYGCYDMSGNVWEWCTDWYDQNYYYSFPRENPK 215

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GP +G  KV +GGS+     + +  RCAAR +  P +    +GFRCA
Sbjct: 216 GPESGQQKVIRGGSW---HYFGHFARCAARYRVLPHAETPQIGFRCA 259



 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 60/167 (35%), Positives = 79/167 (47%), Gaps = 43/167 (25%)

Query: 3   LLPAPPVERYKDMVLLPGDTFRMGTNK--------PILIKDGEFPSRNVTLDAFYLDQHE 54
           LL A   E+Y  MV +P   F MG +K           I DG  P   V LD++Y+D++E
Sbjct: 22  LLAAEEGEQYPGMVYVPAGEFYMGEDKYYDWTFMLAFNIYDG--PEHIVYLDSYYIDKYE 79

Query: 55  VSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHR 114
           V+N Q+ +F+  TG+                       +I     D +RF   D      
Sbjct: 80  VTNEQYAKFIEETGH-----------------------RIPYCWFD-ERFNTPDK----- 110

Query: 115 MHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
              PVV + W DAVAY  W G RLPTEAEWE   RG  + RL+PWG+
Sbjct: 111 ---PVVGVRWEDAVAYAEWAGKRLPTEAEWEKAARGS-DRRLWPWGN 153


>gi|302037191|ref|YP_003797513.1| hypothetical protein NIDE1860 [Candidatus Nitrospira defluvii]
 gi|300605255|emb|CBK41588.1| conserved exported protein of unknown function, similar to
           sulfatase modifying factor [Candidatus Nitrospira
           defluvii]
          Length = 296

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 103/339 (30%), Positives = 139/339 (41%), Gaps = 79/339 (23%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           R  DMV +P   FRMG  +       E P R + L AF +D+ EV+N  +QEFV ATGY 
Sbjct: 28  RVDDMVFVPAGEFRMGATEESGGLPDERPLRLIYLSAFRIDRQEVTNAAYQEFVEATGYQ 87

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
             A          P L+  E                 ++ +     HPVV++SW DAVA+
Sbjct: 88  APANS-------APALTLWEH----------------NAPLPGIEQHPVVNVSWLDAVAF 124

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN---- 186
           C W   RLPTEAEWE   R G + R +PWG+                       WN    
Sbjct: 125 CRWANKRLPTEAEWEKAAR-GTDGRTYPWGN----------------------EWNFENG 161

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
           ++ +Y   R  +     EWE     G+                       G   +     
Sbjct: 162 NSASYWARRTVQFADSTEWEAFWVKGV-----------------------GAAISKEKGL 198

Query: 247 DGYLSTAPVMSYK--ENKFGLYNMVGNVWEWTADWWNVH-HHPAPSYNPKGPTTGTDKVK 303
            G + T PV S+    + +G+ +M GN  EW  DW+N + +  A   NP+GP  G  K  
Sbjct: 199 KGEILTLPVGSFPGGASPYGVLDMAGNAAEWVQDWYNPNDYRTAALTNPQGPERGAIKGM 258

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +GGS+L   +     R   R   T DS     GFRCA D
Sbjct: 259 RGGSWL---KPAISLRTTDRDWGTMDSRPSGTGFRCARD 294


>gi|418121846|ref|ZP_12758789.1| hypothetical protein SPAR80_1718 [Streptococcus pneumoniae GA44194]
 gi|419491687|ref|ZP_14031425.1| hypothetical protein SPAR88_1682 [Streptococcus pneumoniae GA47179]
 gi|419532986|ref|ZP_14072501.1| formylglycine-generating sulfatase enzyme family protein
           [Streptococcus pneumoniae GA47794]
 gi|421275499|ref|ZP_15726328.1| hypothetical protein SPAR117_1508 [Streptococcus pneumoniae
           GA52612]
 gi|353792682|gb|EHD73054.1| hypothetical protein SPAR80_1718 [Streptococcus pneumoniae GA44194]
 gi|379593049|gb|EHZ57864.1| hypothetical protein SPAR88_1682 [Streptococcus pneumoniae GA47179]
 gi|379605506|gb|EHZ70257.1| formylglycine-generating sulfatase enzyme family protein
           [Streptococcus pneumoniae GA47794]
 gi|395873463|gb|EJG84555.1| hypothetical protein SPAR117_1508 [Streptococcus pneumoniae
           GA52612]
          Length = 148

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 67/150 (44%), Positives = 90/150 (60%), Gaps = 10/150 (6%)

Query: 199 LPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY 258
           LPTEA+WEY  RGG ++  +PWG+ L   G + AN WQG FP  NTA+DG++ TAPV  +
Sbjct: 3   LPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTASDGFIGTAPVYEF 61

Query: 259 KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNE 312
             N FGLY M+GNVWEW     N  +    S+N      PK      +   +GGS+LC+ 
Sbjct: 62  LPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHC 118

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            YC R+R AAR+     S++ +LGFRC  +
Sbjct: 119 SYCNRYRVAARNGCISTSTSSHLGFRCLKE 148


>gi|392409905|ref|YP_006446512.1| hypothetical protein Desti_1536 [Desulfomonile tiedjei DSM 6799]
 gi|390623041|gb|AFM24248.1| hypothetical protein Desti_1536 [Desulfomonile tiedjei DSM 6799]
          Length = 671

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 76/184 (41%), Positives = 101/184 (54%), Gaps = 15/184 (8%)

Query: 177 NHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
           + P V +SWNDA A+C W      R  RLPTEAEWE GCRGG     FPWG+ L   G+ 
Sbjct: 270 DQPAVFISWNDAQAFCRWLSRKEKRTYRLPTEAEWEMGCRGGSAWVRFPWGDRLP--GDR 327

Query: 231 RANVWQG--EFPTNNTAA-DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP- 286
             N   G  + P + T   DGY   +PV S+  N  GLY+M GNV EW  D +  +++  
Sbjct: 328 DLNFGDGNPKLPESMTTVNDGYEYVSPVGSFPPNGLGLYDMEGNVMEWVEDRYERNYYET 387

Query: 287 APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           +P  +PKGP +GT ++ KGG++  +   C   RCA R  + P+ S  NLGFR   +    
Sbjct: 388 SPIEDPKGPASGTSRINKGGNWYASPADC---RCAFRGFSGPEMSFFNLGFRVVLEDENQ 444

Query: 347 TGTD 350
             TD
Sbjct: 445 LKTD 448


>gi|418160529|ref|ZP_12797228.1| hypothetical protein SPAR43_1871 [Streptococcus pneumoniae GA17227]
 gi|353822262|gb|EHE02438.1| hypothetical protein SPAR43_1871 [Streptococcus pneumoniae GA17227]
          Length = 148

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 67/150 (44%), Positives = 89/150 (59%), Gaps = 10/150 (6%)

Query: 199 LPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY 258
           LPTEA+WEY  RGG ++  +PWG+ L   G + AN WQG FP  NTA DG++ TAPV  +
Sbjct: 3   LPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTAPVYEF 61

Query: 259 KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNE 312
             N FGLY M+GNVWEW     N  +    S+N      PK      +   +GGS+LC+ 
Sbjct: 62  LPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHC 118

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            YC R+R AAR+     S++ +LGFRC  +
Sbjct: 119 SYCNRYRVAARNGCISTSTSSHLGFRCLKE 148


>gi|119628390|gb|EAX07985.1| sulfatase modifying factor 2, isoform CRA_j [Homo sapiens]
          Length = 273

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 101/337 (29%), Positives = 136/337 (40%), Gaps = 117/337 (34%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV L G  F MGTN P   +DGE P R  T+  F +D   V+N  F++FV    Y TEAE
Sbjct: 49  MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 107

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            FG +FVFE  +S+E R K +Q                     P+  + W        W 
Sbjct: 108 MFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 137

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
              LP E  +                 W  P G  S I  R+ HPV+HVS          
Sbjct: 138 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVS---------- 167

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
                                                   W G+FP  + A DG+   +P
Sbjct: 168 ----------------------------------------WNGKFPKGDKAEDGFHGVSP 187

Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
           V ++  +N +GLY+++GNVWEWTA             +P        +V +G S++   +
Sbjct: 188 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 234

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
               +R R   R  NTPDS++ NLGFRCAAD G   G
Sbjct: 235 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 271



 Score = 41.2 bits (95), Expect = 0.96,   Method: Compositional matrix adjust.
 Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)

Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           +V +G S++   +    +R R   R  NTPDS++ NLGFRCAAD 
Sbjct: 222 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 266


>gi|297181914|gb|ADI18091.1| uncharacterized conserved protein [uncultured Acidobacteriales
           bacterium HF0200_23L05]
          Length = 272

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 69/167 (41%), Positives = 94/167 (56%), Gaps = 7/167 (4%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
           +HPVVH++W+DA + CTW G RLPTEAEWEY  RGG +  +F WGN LT      AN   
Sbjct: 104 DHPVVHLTWDDAASVCTWVGGRLPTEAEWEYAARGGSDGTIFWWGNQLT---RDYANFGH 160

Query: 237 GEFPTNNT-AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGP 295
            E     T  AD +++TAPV S+  N FG Y++ GN WEW   W +  +      +P+ P
Sbjct: 161 TECCAGTTGGADVWVNTAPVGSFPANGFGFYDISGNAWEWVDGWIDDQYPNESVIDPQPP 220

Query: 296 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            TG  +V +G S+L    Y    R + R   + ++   N+G RCA D
Sbjct: 221 ATGQLRVMRGASWL---NYPEVWRLSVRLAFSAEAHTSNIGVRCARD 264



 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 59/168 (35%), Positives = 79/168 (47%), Gaps = 41/168 (24%)

Query: 1   MVLLPAPPVERYKDM--VLLPGDTFRMGT--NKPILIKDGEFPSRNVTLDAFYLDQHEVS 56
           +VL  A  V +  D+  V + G TF+MG   + P  ++  E P   VTL AF L + E++
Sbjct: 20  IVLGKAELVAQEIDLSWVRVEGGTFQMGCVPDDPFCLET-EQPRHEVTLSAFELMETELT 78

Query: 57  NTQFQEFVSATGYVTE-AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRM 115
            +Q+  FV ATG     A  F  T                                    
Sbjct: 79  VSQYGVFVDATGRTPPPAPDFLQT-----------------------------------G 103

Query: 116 HHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWL 163
            HPVVH++W+DA + CTW G RLPTEAEWEY  RGG +  +F WG+ L
Sbjct: 104 DHPVVHLTWDDAASVCTWVGGRLPTEAEWEYAARGGSDGTIFWWGNQL 151


>gi|225857398|ref|YP_002738909.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Streptococcus pneumoniae P1031]
 gi|225724825|gb|ACO20677.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
           enzyme 1) [Streptococcus pneumoniae P1031]
          Length = 148

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 66/150 (44%), Positives = 89/150 (59%), Gaps = 10/150 (6%)

Query: 199 LPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY 258
           LPTEA+WEY  RGG ++  +PWG+ L   G + AN WQG FP  NT +DG++ TAPV  +
Sbjct: 3   LPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTVSDGFIGTAPVYEF 61

Query: 259 KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNE 312
             N FGLY M+GNVWEW     N  +    S+N      PK      +   +GGS+LC+ 
Sbjct: 62  LPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHC 118

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            YC R+R AAR+     S++ +LGFRC  +
Sbjct: 119 SYCNRYRVAARNGCISTSTSSHLGFRCLKE 148


>gi|114799422|ref|YP_760374.1| hypothetical protein HNE_1666 [Hyphomonas neptunium ATCC 15444]
 gi|114739596|gb|ABI77721.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444]
          Length = 256

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 77/182 (42%), Positives = 97/182 (53%), Gaps = 16/182 (8%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           SW  PEG  S+IE + N PV+HVS  DA AY  W G RLP+E EWE+  R GL +     
Sbjct: 87  SWKTPEGAGSSIEGKGNWPVMHVSLADAEAYAAWAGGRLPSEEEWEHAARLGLPDPDRET 146

Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
                  G+ RAN WQG FP  N   DG+   APV  +  ++ GLY+M+GNVWEWT    
Sbjct: 147 SGAFEDDGKPRANTWQGIFPVANAGEDGFAGAAPVGCFPADQLGLYDMIGNVWEWTD--- 203

Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                      P  P   T    KGGSYLC + +C R+R AAR     D S+ ++GFR  
Sbjct: 204 ----------TPFAPGNNT---IKGGSYLCADNFCQRYRPAARHPQEIDFSSNHIGFRIV 250

Query: 341 AD 342
            +
Sbjct: 251 KE 252



 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 55/143 (38%), Positives = 72/143 (50%), Gaps = 12/143 (8%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
            V +PG   + G      I   E P   + +D F +  HEV+N QF EFV+ATGYVT+AE
Sbjct: 4   FVEVPGGVLQKGRGA---IYPEERPEVTLHVDGFRIQAHEVTNDQFAEFVTATGYVTDAE 60

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRF-----EGLDSTIEHRMHHPVVHISWNDAVA 129
           +     V E               H M+       EG  S+IE + + PV+H+S  DA A
Sbjct: 61  RG----VMEDRPGAGSAVFQGARWHLMREASWKTPEGAGSSIEGKGNWPVMHVSLADAEA 116

Query: 130 YCTWRGARLPTEAEWEYGCRGGL 152
           Y  W G RLP+E EWE+  R GL
Sbjct: 117 YAAWAGGRLPSEEEWEHAARLGL 139



 Score = 42.4 bits (98), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 26/40 (65%)

Query: 354 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
           KGGSYLC + +C R+R AAR     D S+ ++GFR   ++
Sbjct: 214 KGGSYLCADNFCQRYRPAARHPQEIDFSSNHIGFRIVKEL 253


>gi|121596142|ref|YP_988038.1| hypothetical protein Ajs_3855 [Acidovorax sp. JS42]
 gi|120608222|gb|ABM43962.1| protein of unknown function DUF323 [Acidovorax sp. JS42]
          Length = 348

 Score =  127 bits (319), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 105/353 (29%), Positives = 147/353 (41%), Gaps = 83/353 (23%)

Query: 17  LLPGDTFRMGTNKPI--LIKD-------------GEFPSRNVTLD-AFYLDQHEVSNTQF 60
           + PG +FRMG+++P+  L +D              E P+  V +  AFYL Q EV+  QF
Sbjct: 44  IAPG-SFRMGSSEPVDALARDYPTLERWRFEQLADEAPAHTVRITRAFYLGQTEVTVGQF 102

Query: 61  QEFVSATGYVTEAEKFGDT-FVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPV 119
           + FV  +GYV E+E  G   + + P       A+         R+   +        HPV
Sbjct: 103 RRFVQTSGYVPESEADGTGGYGYNPAYDPAASARGDAFEGRDPRYSWRNPGFAQGDDHPV 162

Query: 120 VHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIE 173
           V+I+W+DA A   W  A      RLPTEAEWEY CR G   R        +P G D    
Sbjct: 163 VNITWHDAHALAAWLSATEGHRYRLPTEAEWEYACRAGTRTR--------YPHGDDPAAL 214

Query: 174 HRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRAN 233
            R      +V   DA  Y                                          
Sbjct: 215 ARH----ANVFDEDAAPY------------------------------------------ 228

Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNP 292
            W    P      DG+  TAPV  Y  N FG+++M+GNVWEW +DW+    +  +P  +P
Sbjct: 229 -WPRWRPQAAPGHDGHAFTAPVGRYAPNAFGVHDMLGNVWEWVSDWYAEEGYAHSPRDDP 287

Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
           +GP  G  +V++GGS+     Y    RC  R+ NTP +    +G R   +  P
Sbjct: 288 QGPAAGQVRVRRGGSWHTWPLYA---RCGFRNWNTPQTRYTLVGLRLVREIDP 337


>gi|241766627|ref|ZP_04764477.1| protein of unknown function DUF323 [Acidovorax delafieldii 2AN]
 gi|241363096|gb|EER58719.1| protein of unknown function DUF323 [Acidovorax delafieldii 2AN]
          Length = 344

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 105/364 (28%), Positives = 151/364 (41%), Gaps = 82/364 (22%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPS---RNVTL-------------DAFYLDQHEVSNT 58
            V +P  +F MG+++        FP    +  TL              AFYL QHEV+  
Sbjct: 21  FVRVPAGSFWMGSDESAQTLAQAFPRLEPKRFTLLGDEAPVHRVHIRRAFYLGQHEVTVG 80

Query: 59  QFQEFVSATGYVTEAEKFGD-TFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
           QF+ F+ A+GY  E+E  G   + + P       A+         R+   +        H
Sbjct: 81  QFRRFIEASGYRPESEADGTGGYGYNPDYDPATTARGDAFEGRNPRYSWRNPGFVQGDDH 140

Query: 118 PVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDST 171
           PVV+++WNDA A   W         RLPTEAEWEY CR G   R +P G    P G+   
Sbjct: 141 PVVNVTWNDAQALAAWLSRTEGHRYRLPTEAEWEYACRAGTRTR-YPHGD--APAGL--- 194

Query: 172 IEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR 231
                   V   +  D  A   W                               PR +  
Sbjct: 195 --------VSEANVFDQAAAPYW-------------------------------PRWQQH 215

Query: 232 ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW-NVHHHPAPSY 290
           A       P N    DGY  TAPV  +  N FGLY+M+GN WEW +DW  + ++  +P  
Sbjct: 216 A------LPGN----DGYAFTAPVGRFAPNAFGLYDMLGNAWEWVSDWHSDTYYAQSPPS 265

Query: 291 NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTD 350
           +P+GP  G+ +V++GGS+     + +  RCA R+ N+P +    +G R   +     G  
Sbjct: 266 DPQGPAEGSVRVRRGGSW---HTWAFYARCAFRNWNSPQTRYTLVGMRLVRETDEEDGIR 322

Query: 351 KVKK 354
            V +
Sbjct: 323 DVPR 326


>gi|445491299|ref|ZP_21459614.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii AA-014]
 gi|444764433|gb|ELW88746.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii AA-014]
          Length = 335

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 102/330 (30%), Positives = 138/330 (41%), Gaps = 72/330 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA- 73
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY+T+A 
Sbjct: 66  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYITDAE 124

Query: 74  -EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
            +K    F  +P                               HHP     W     Y T
Sbjct: 125 KQKQAAVFSPDP-------------------------------HHP--QQWWQLKTGY-T 150

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR            G  G L N                      + PV +V+ NDA  Y 
Sbjct: 151 WRTPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYA 180

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLS 251
            W G  LPTE EWEY  +        P     T   +H +AN WQG FP  N   D +  
Sbjct: 181 VWLGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTG 239

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWT-ADWWNVHHHPAPSYNP--KGPTTGTDKVKKGGSY 308
            APV  +K N F L++M+GNVWEWT + +   H     +Y+   +   + T  V KGGS+
Sbjct: 240 IAPVGCFKPNGFKLFDMIGNVWEWTSSSYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSF 299

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           LC E YC R+R ++R     D +A ++GFR
Sbjct: 300 LCAENYCSRYRNSSRYPQDFDLAATHVGFR 329


>gi|222112297|ref|YP_002554561.1| hypothetical protein Dtpsy_3128 [Acidovorax ebreus TPSY]
 gi|221731741|gb|ACM34561.1| protein of unknown function DUF323 [Acidovorax ebreus TPSY]
          Length = 348

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 112/361 (31%), Positives = 152/361 (42%), Gaps = 99/361 (27%)

Query: 17  LLPGDTFRMGTNKPI----------------LIKDGEFPSRNVTLD-AFYLDQHEVSNTQ 59
           + PG +FRMG+++P+                 + D E P+  V +  AFYL Q EV+  Q
Sbjct: 44  IAPG-SFRMGSSEPVDALARAYPTLERWRFEQLAD-EAPAHTVRITRAFYLGQTEVTVGQ 101

Query: 60  FQEFVSATGYVTEAEKFGD-TFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRM--- 115
           F+ FV A+GYV E+E  G   + + P          + VR D   FEG D     R    
Sbjct: 102 FRRFVQASGYVPESEADGTGGYGYNPAYD-----PAASVRGDA--FEGRDPRYSWRNPGF 154

Query: 116 ----HHPVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
                HPVV+I+W+DA A   W  A      RLPTEAEWEY CR G   R        +P
Sbjct: 155 AQGDDHPVVNITWHDAHALAAWLSATEGHRYRLPTEAEWEYACRAGTRTR--------YP 206

Query: 166 EGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLT 225
            G D     R      +V   DA  Y                                  
Sbjct: 207 HGDDPAALARH----ANVFDEDAAPY---------------------------------- 228

Query: 226 PRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH 285
                    W    P      DG+  TAPV  Y  N FG+++M+GNVWEW +DW+    +
Sbjct: 229 ---------WPRWRPQAAPGHDGHAFTAPVGRYAPNAFGVHDMLGNVWEWVSDWYAEEGY 279

Query: 286 P-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
             +P  +P+GP TG  +V++GGS+     Y    RC  R+ NTP +    +G R   +  
Sbjct: 280 AHSPRDDPQGPATGQVRVRRGGSWHTWPLYA---RCGFRNWNTPQTRYTLVGLRLVREID 336

Query: 345 P 345
           P
Sbjct: 337 P 337


>gi|417553728|ref|ZP_12204797.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-81]
 gi|417562667|ref|ZP_12213546.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC137]
 gi|421201406|ref|ZP_15658565.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC109]
 gi|421456418|ref|ZP_15905760.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii IS-123]
 gi|421635404|ref|ZP_16076006.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-13]
 gi|421805025|ref|ZP_16240919.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii WC-A-694]
 gi|395525249|gb|EJG13338.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC137]
 gi|395563438|gb|EJG25091.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC109]
 gi|400210846|gb|EJO41810.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii IS-123]
 gi|400390145|gb|EJP57192.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-81]
 gi|408702223|gb|EKL47636.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-13]
 gi|410410075|gb|EKP61995.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii WC-A-694]
          Length = 335

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 102/330 (30%), Positives = 138/330 (41%), Gaps = 72/330 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA- 73
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY+T+A 
Sbjct: 66  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYITDAE 124

Query: 74  -EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
            +K    F  +P                               HHP     W     Y T
Sbjct: 125 KQKQAAVFSPDP-------------------------------HHP--QQWWQLKTGY-T 150

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR            G  G L N                      + PV +V+ NDA  Y 
Sbjct: 151 WRTPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYA 180

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLS 251
            W G  LPTE EWEY  +        P     T   +H +AN WQG FP  N   D +  
Sbjct: 181 VWLGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTG 239

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSY 308
            APV  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+
Sbjct: 240 IAPVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSF 299

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           LC E YC R+R ++R     D +A ++GFR
Sbjct: 300 LCAENYCSRYRNSSRYPQDFDLAATHVGFR 329


>gi|332663258|ref|YP_004446046.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
            1100]
 gi|332332072|gb|AEE49173.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis DSM
            1100]
          Length = 1050

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 105/354 (29%), Positives = 158/354 (44%), Gaps = 60/354 (16%)

Query: 14   DMVLLPGDTFRMGTNKPILI----------KDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
            +MV++ G +F MG +  IL+           D E     V L  F++  H V+  +F+ F
Sbjct: 734  EMVIVEGGSFAMGLDSTILLYNDGKGNKEHSDNEQSVYQVVLADFWMATHPVTLGEFRRF 793

Query: 64   VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
            V  TGY T AEK G ++ ++    E E+ +    + + K       T++    HPV+H+S
Sbjct: 794  VQETGYQTLAEKEGWSWSYDAETGEWEKIEGQNWQCNTK------GTLQINDRHPVLHLS 847

Query: 124  WNDAVAYCTW--RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
            W DA+ YC W  R A L    + +Y                L+P+   +  E+      V
Sbjct: 848  WYDALEYCNWLSRAAGLAEVYQIDY----------------LNPDA-HNLAENDDLKWTV 890

Query: 182  HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
             V+W  +       G RLPTEAEWEY  R     R   +GN     G+  A+  Q  F  
Sbjct: 891  KVNWETS-------GYRLPTEAEWEYAARE--RGRSVRFGN-----GQDIADPTQMNFDA 936

Query: 242  NNTAADGY-------LSTAPVMSYKENKFGLYNMVGNVWEWTADWW-NVHHHPAPSYNPK 293
            ++     Y        +T P  ++  N  G++NM GNV EW  DW+ N ++  +P  NP 
Sbjct: 937  SSHYKIDYSIVGIYRAATVPADAFAPNALGIFNMSGNVAEWCWDWYANDYYQYSPLENPH 996

Query: 294  GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
            GP     +V +GGS+  N +YC   R + R  N P     + GFR      P +
Sbjct: 997  GPAEDAYRVLRGGSWFYNTKYC---RSSFRFSNLPAYRFSDFGFRVVLFPPPVS 1047


>gi|78188021|ref|YP_378359.1| hypothetical protein Cag_0037 [Chlorobium chlorochromatii CaD3]
 gi|78170220|gb|ABB27316.1| conserved hypothetical protein [Chlorobium chlorochromatii CaD3]
          Length = 325

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 103/341 (30%), Positives = 154/341 (45%), Gaps = 87/341 (25%)

Query: 14  DMVLLPGDTFRMGTNKPILIKD-GEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + VL+ G  F MG+ +    +D  E P R V +  FY+ ++EV+  +F+ FV  TGY T+
Sbjct: 59  NFVLIRGGEFTMGSPESESERDRDEMPHR-VKVGDFYIARYEVTTAEFRTFVQETGYRTD 117

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AEK   + VF   L   +     +   + K   G ++      +HPV+ +SWNDAVAYC 
Sbjct: 118 AEKTNPSLVFWSGLWPGKAGLNWRYGTNGKERSGAEN------NHPVILVSWNDAVAYCK 171

Query: 133 WRGA------RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
           W         RLPT AEWEY CR G    +F +G  L                       
Sbjct: 172 WLSKKHGMNFRLPTSAEWEYACRAGTST-VFNYGDNLS---------------------- 208

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
                         T+A ++           +P+ N+  P+G +R N             
Sbjct: 209 -------------TTQANYDGN---------YPYSNH--PKGIYRKN------------- 231

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-----YNPKGPTTGTDK 301
                T PV S+  N +GLYNM GNV EW +DW++  ++ +        NP GP TG+++
Sbjct: 232 -----TVPVNSFTPNAWGLYNMHGNVAEWCSDWYSEPYYESSKANGTVTNPTGPATGSNR 286

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           V +GGS+  + +YC   R A  + +TP     N+GFR   +
Sbjct: 287 VMRGGSWYDDARYC---RSADINDSTPSYRYINVGFRVVLE 324


>gi|392408858|ref|YP_006445465.1| hypothetical protein Desti_0466 [Desulfomonile tiedjei DSM 6799]
 gi|390621994|gb|AFM23201.1| hypothetical protein Desti_0466 [Desulfomonile tiedjei DSM 6799]
          Length = 984

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 109/359 (30%), Positives = 153/359 (42%), Gaps = 111/359 (30%)

Query: 14  DMVLLPGDTFRMGTN--------------KPILIKDGEFPSRNVTLD-AFYLDQHEVSNT 58
           +MV +P  +F MG++              +PI +++ EFP   V +   FY+   EV+  
Sbjct: 537 NMVRIPAGSFVMGSSETDIAWAMTTLAQGQPISLEN-EFPFHKVRMSRPFYISATEVTVA 595

Query: 59  QFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDS--------T 110
           QF+ FV  TGYVT+AE             +E+  +I   R   +RFE  +         T
Sbjct: 596 QFKAFVDETGYVTDAE-------------DEKGGQIFNSR--TRRFEKKEGSSWKDPGWT 640

Query: 111 IEHRMHHPVVHISWNDAVAYCTWRGAR------LPTEAEWEYGCRGGLENRLFPWGSWLH 164
           I      PVV IS+NDA A+  W  A+      LPTEA+WEY  RGGL   +FPWG  L 
Sbjct: 641 ITR--DQPVVMISYNDAQAFVEWLAAKEKLPYKLPTEAQWEYAARGGLPMSVFPWGDQL- 697

Query: 165 PEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNL 224
           P+G       R N+                  A   T+ EW                   
Sbjct: 698 PDG------QRANY------------------ADKSTDYEWR------------------ 715

Query: 225 TPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH 284
                            + TA DG+   +PV SY+ N +GLY+M GN  EW  D++   +
Sbjct: 716 -----------------DRTADDGHKFVSPVGSYEPNGYGLYDMAGNALEWVRDYYGEDY 758

Query: 285 HP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +  +P  +P+GP  G  +  KGG +          RCA R    P+SS  N GFR A +
Sbjct: 759 YRFSPEIDPEGPGHGEFRTMKGGEWTFG---AVNLRCAFRGWARPESSFQNSGFRVAIE 814



 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 73/202 (36%), Positives = 100/202 (49%), Gaps = 20/202 (9%)

Query: 177 NHPVVHVSWNDAVAYCTWRGAR------LPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
           + PV  VS  DA A+C W   R      LPTEA+WEY  RGG +   FPWGN+  P G  
Sbjct: 273 DLPVTLVSHADAEAFCEWLSKREGRPYTLPTEAQWEYAARGGKQGDRFPWGNDY-PDG-R 330

Query: 231 RANVWQ--GEFP-TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA 287
           + NV       P  + T  DGY   APV SY+ N F LY+M+GNVWE  +D+++   + +
Sbjct: 331 KLNVADRNARVPWADRTVDDGYGEIAPVGSYEPNGFWLYDMMGNVWEACSDYYDPKLYES 390

Query: 288 PS----YNPKGPTTGTDKVKKGGSYLCNEQYCYR-HRCAARSQNTPDSSAGNLGFRCAAD 342
            S    ++P GP TG  KV +GG++  +        R  A  +   D S    GFR +A 
Sbjct: 391 RSSQSVFDPTGPATGKTKVVRGGNWAFDAGIARNAFRFGADRELCTDMS----GFRVSAL 446

Query: 343 KGPTTGTDKVKKGGSYLCNEQY 364
                 +   K   + L NE +
Sbjct: 447 ASAAELSQAKKTDENLLSNENF 468


>gi|168494286|ref|ZP_02718429.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae CDC3059-06]
 gi|172079528|ref|ZP_02708283.2| sulfatase-modifying factor 1 [Streptococcus pneumoniae CDC1873-00]
 gi|418074650|ref|ZP_12711899.1| hypothetical protein SPAR19_1791 [Streptococcus pneumoniae GA11184]
 gi|418079245|ref|ZP_12716467.1| hypothetical protein SPAR123_1693 [Streptococcus pneumoniae
           4027-06]
 gi|418081442|ref|ZP_12718652.1| hypothetical protein SPAR121_1740 [Streptococcus pneumoniae
           6735-05]
 gi|418090172|ref|ZP_12727325.1| hypothetical protein SPAR77_1762 [Streptococcus pneumoniae GA43265]
 gi|418099137|ref|ZP_12736233.1| hypothetical protein SPAR122_1719 [Streptococcus pneumoniae
           6901-05]
 gi|418105924|ref|ZP_12742980.1| hypothetical protein SPAR85_1853 [Streptococcus pneumoniae GA44500]
 gi|418115329|ref|ZP_12752315.1| hypothetical protein SPAR125_1731 [Streptococcus pneumoniae
           5787-06]
 gi|418117488|ref|ZP_12754457.1| hypothetical protein SPAR124_1704 [Streptococcus pneumoniae
           6963-05]
 gi|418135513|ref|ZP_12772367.1| hypothetical protein SPAR23_1789 [Streptococcus pneumoniae GA11426]
 gi|418162832|ref|ZP_12799513.1| hypothetical protein SPAR49_1824 [Streptococcus pneumoniae GA17328]
 gi|418174213|ref|ZP_12810824.1| hypothetical protein SPAR67_1819 [Streptococcus pneumoniae GA41277]
 gi|418176579|ref|ZP_12813170.1| hypothetical protein SPAR71_1815 [Streptococcus pneumoniae GA41437]
 gi|418217261|ref|ZP_12843940.1| hypothetical protein SPAR147_1733 [Streptococcus pneumoniae
           Netherlands15B-37]
 gi|418219515|ref|ZP_12846180.1| hypothetical protein SPAR145_1804 [Streptococcus pneumoniae NP127]
 gi|418239313|ref|ZP_12865864.1| hypothetical protein SPAR146_1771 [Streptococcus pneumoniae
           NorthCarolina6A-23]
 gi|419432192|ref|ZP_13972325.1| hypothetical protein SPAR140_1732 [Streptococcus pneumoniae
           EU-NP05]
 gi|419441000|ref|ZP_13981043.1| hypothetical protein SPAR64_1744 [Streptococcus pneumoniae GA40410]
 gi|419462951|ref|ZP_14002854.1| hypothetical protein SPAR3_1848 [Streptococcus pneumoniae GA02714]
 gi|419465293|ref|ZP_14005184.1| hypothetical protein SPAR4_1809 [Streptococcus pneumoniae GA04175]
 gi|419469614|ref|ZP_14009481.1| hypothetical protein SPAR9_1688 [Streptococcus pneumoniae GA06083]
 gi|419498180|ref|ZP_14037887.1| hypothetical protein SPAR99_1742 [Streptococcus pneumoniae GA47522]
 gi|419526502|ref|ZP_14066060.1| formylglycine-generating sulfatase enzyme family protein
           [Streptococcus pneumoniae GA14373]
 gi|419535281|ref|ZP_14074780.1| formylglycine-generating sulfatase enzyme family protein
           [Streptococcus pneumoniae GA17457]
 gi|421281800|ref|ZP_15732596.1| non-specific serine/threonine protein kinase [Streptococcus
           pneumoniae GA04672]
 gi|421310213|ref|ZP_15760838.1| non-specific serine/threonine protein kinase [Streptococcus
           pneumoniae GA62681]
 gi|172043211|gb|EDT51257.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae CDC1873-00]
 gi|183575814|gb|EDT96342.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae CDC3059-06]
 gi|353746181|gb|EHD26843.1| hypothetical protein SPAR19_1791 [Streptococcus pneumoniae GA11184]
 gi|353746772|gb|EHD27432.1| hypothetical protein SPAR123_1693 [Streptococcus pneumoniae
           4027-06]
 gi|353752181|gb|EHD32812.1| hypothetical protein SPAR121_1740 [Streptococcus pneumoniae
           6735-05]
 gi|353760945|gb|EHD41520.1| hypothetical protein SPAR77_1762 [Streptococcus pneumoniae GA43265]
 gi|353768143|gb|EHD48668.1| hypothetical protein SPAR122_1719 [Streptococcus pneumoniae
           6901-05]
 gi|353776100|gb|EHD56579.1| hypothetical protein SPAR85_1853 [Streptococcus pneumoniae GA44500]
 gi|353785413|gb|EHD65832.1| hypothetical protein SPAR125_1731 [Streptococcus pneumoniae
           5787-06]
 gi|353788169|gb|EHD68567.1| hypothetical protein SPAR124_1704 [Streptococcus pneumoniae
           6963-05]
 gi|353826894|gb|EHE07051.1| hypothetical protein SPAR49_1824 [Streptococcus pneumoniae GA17328]
 gi|353837418|gb|EHE17502.1| hypothetical protein SPAR67_1819 [Streptococcus pneumoniae GA41277]
 gi|353840650|gb|EHE20714.1| hypothetical protein SPAR71_1815 [Streptococcus pneumoniae GA41437]
 gi|353870051|gb|EHE49927.1| hypothetical protein SPAR147_1733 [Streptococcus pneumoniae
           Netherlands15B-37]
 gi|353873875|gb|EHE53734.1| hypothetical protein SPAR145_1804 [Streptococcus pneumoniae NP127]
 gi|353892304|gb|EHE72053.1| hypothetical protein SPAR146_1771 [Streptococcus pneumoniae
           NorthCarolina6A-23]
 gi|353900846|gb|EHE76396.1| hypothetical protein SPAR23_1789 [Streptococcus pneumoniae GA11426]
 gi|379530383|gb|EHY95623.1| hypothetical protein SPAR3_1848 [Streptococcus pneumoniae GA02714]
 gi|379536893|gb|EHZ02079.1| hypothetical protein SPAR4_1809 [Streptococcus pneumoniae GA04175]
 gi|379543868|gb|EHZ09016.1| hypothetical protein SPAR9_1688 [Streptococcus pneumoniae GA06083]
 gi|379557142|gb|EHZ22189.1| formylglycine-generating sulfatase enzyme family protein
           [Streptococcus pneumoniae GA14373]
 gi|379563442|gb|EHZ28446.1| formylglycine-generating sulfatase enzyme family protein
           [Streptococcus pneumoniae GA17457]
 gi|379577237|gb|EHZ42159.1| hypothetical protein SPAR64_1744 [Streptococcus pneumoniae GA40410]
 gi|379599013|gb|EHZ63798.1| hypothetical protein SPAR99_1742 [Streptococcus pneumoniae GA47522]
 gi|379629273|gb|EHZ93874.1| hypothetical protein SPAR140_1732 [Streptococcus pneumoniae
           EU-NP05]
 gi|395879967|gb|EJG91022.1| non-specific serine/threonine protein kinase [Streptococcus
           pneumoniae GA04672]
 gi|395909828|gb|EJH20703.1| non-specific serine/threonine protein kinase [Streptococcus
           pneumoniae GA62681]
          Length = 148

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 66/150 (44%), Positives = 88/150 (58%), Gaps = 10/150 (6%)

Query: 199 LPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY 258
           LPTEA+WEY  RGG ++  + WG+ L   G + AN WQG FP  NTA DG++ TAPV  +
Sbjct: 3   LPTEAQWEYAARGGRQSE-YLWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTAPVYEF 61

Query: 259 KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNE 312
             N FGLY M+GNVWEW     N  +    S+N      PK      +   +GGS+LC+ 
Sbjct: 62  LPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHC 118

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            YC R+R AAR+     S++ +LGFRC  +
Sbjct: 119 SYCNRYRVAARNGCISTSTSSHLGFRCLKE 148


>gi|407937989|ref|YP_006853630.1| hypothetical protein C380_06410 [Acidovorax sp. KKS102]
 gi|407895783|gb|AFU44992.1| hypothetical protein C380_06410 [Acidovorax sp. KKS102]
          Length = 335

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 103/350 (29%), Positives = 142/350 (40%), Gaps = 84/350 (24%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFP-----------------SRNVTLDAFYLDQHEVSN 57
            V +P   F MG   P+   +  FP                    +T D FYL QHEV+ 
Sbjct: 34  FVRVPAGAFWMGGVPPVETLEKNFPLLERKRLTDLADESPAHEVQITRD-FYLGQHEVTV 92

Query: 58  TQFQEFVSATGYVTEAEKFGD-TFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMH 116
            QFQ F+ A+GY  E+E  G   + + P        +         R+   +        
Sbjct: 93  GQFQRFLEASGYRPESEADGTGGYGYNPAYDPATTRRGDAFEGRDPRYSWRNPGFAQGPD 152

Query: 117 HPVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDS 170
           HPVV+++WNDA A   W         RLPTEAEWEY CR G  +        L+P G   
Sbjct: 153 HPVVNVTWNDAQAMAQWLSRTEGHRYRLPTEAEWEYACRAGSRS--------LYPHG--- 201

Query: 171 TIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
                 ++P   VS     A  T+  A  P  + W      G                  
Sbjct: 202 ------DNPQELVS-----AGNTFDQAAAPQWSRWRQHALAG------------------ 232

Query: 231 RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW-WNVHHHPAPS 289
                           DGY  TAPV  +  N FGLY+M+GN WEWT DW  + ++  +P 
Sbjct: 233 ---------------NDGYAFTAPVGRFAPNAFGLYDMLGNAWEWTGDWHGDTYYAESPR 277

Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            +P GP TG+ +V++GGS+     Y    RC  R+ N+P +    +G R 
Sbjct: 278 NDPTGPATGSVRVRRGGSWHTWALYA---RCGYRNWNSPQTRYTLVGMRL 324


>gi|338175245|ref|YP_004652055.1| serine/threonine-protein kinase pkn1 [Parachlamydia acanthamoebae
           UV-7]
 gi|336479603|emb|CCB86201.1| serine/threonine-protein kinase pkn1 [Parachlamydia acanthamoebae
           UV-7]
          Length = 667

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 69/169 (40%), Positives = 93/169 (55%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A+AY  W G RLPTEAEWE    GG++N LFP G+++    + +AN + 
Sbjct: 516 KHPVVGVTWYGALAYAKWVGKRLPTEAEWEVAAYGGVDNYLFPTGDDIE---KTQANFFS 572

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPK 293
            +             T  VMSY  N +GLY+M GNV+EW  DW+  +++        NPK
Sbjct: 573 AD-------------TTGVMSYAPNGYGLYDMAGNVYEWCHDWYGYNYYEISVQEPENPK 619

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RC+ R +N P +  G  GFRCAAD
Sbjct: 620 GPLQGVYRVLRGG---CWKSLKEDLRCSRRHRNNPGTVNGTYGFRCAAD 665



 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 55/148 (37%), Positives = 72/148 (48%), Gaps = 20/148 (13%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           DMV +PG  F  G+         E P   VT+D+F +D H V+N QF  F+ A G     
Sbjct: 432 DMVGIPGGVFARGSASG---NRDEMPRHQVTVDSFAIDIHPVTNEQFTRFLEAMG----G 484

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMH-HPVVHISWNDAVAYCT 132
           EK            +     I ++R    +  G    IE   + HPVV ++W  A+AY  
Sbjct: 485 EK------------DNNHHDIIRLRDSRIKRSGGKVHIEFGYNKHPVVGVTWYGALAYAK 532

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
           W G RLPTEAEWE    GG++N LFP G
Sbjct: 533 WVGKRLPTEAEWEVAAYGGVDNYLFPTG 560


>gi|282890625|ref|ZP_06299148.1| hypothetical protein pah_c022o234 [Parachlamydia acanthamoebae str.
           Hall's coccus]
 gi|281499622|gb|EFB41918.1| hypothetical protein pah_c022o234 [Parachlamydia acanthamoebae str.
           Hall's coccus]
          Length = 655

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 69/169 (40%), Positives = 93/169 (55%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A+AY  W G RLPTEAEWE    GG++N LFP G+++    + +AN + 
Sbjct: 504 KHPVVGVTWYGALAYAKWVGKRLPTEAEWEVAAYGGVDNYLFPTGDDIE---KTQANFFS 560

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPK 293
            +             T  VMSY  N +GLY+M GNV+EW  DW+  +++        NPK
Sbjct: 561 AD-------------TTGVMSYAPNGYGLYDMAGNVYEWCHDWYGYNYYEISVQEPENPK 607

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RC+ R +N P +  G  GFRCAAD
Sbjct: 608 GPLQGVYRVLRGG---CWKSLKEDLRCSRRHRNNPGTVNGTYGFRCAAD 653



 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 55/148 (37%), Positives = 72/148 (48%), Gaps = 20/148 (13%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           DMV +PG  F  G+         E P   VT+D+F +D H V+N QF  F+ A G     
Sbjct: 420 DMVGIPGGVFARGSASG---NRDEMPRHQVTVDSFAIDIHPVTNEQFTRFLEAMG----G 472

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMH-HPVVHISWNDAVAYCT 132
           EK            +     I ++R    +  G    IE   + HPVV ++W  A+AY  
Sbjct: 473 EK------------DNNHHDIIRLRDSRIKRSGGKVHIEFGYNKHPVVGVTWYGALAYAK 520

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
           W G RLPTEAEWE    GG++N LFP G
Sbjct: 521 WVGKRLPTEAEWEVAAYGGVDNYLFPTG 548


>gi|293608913|ref|ZP_06691216.1| conserved hypothetical protein [Acinetobacter sp. SH024]
 gi|292829486|gb|EFF87848.1| conserved hypothetical protein [Acinetobacter sp. SH024]
          Length = 334

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 101/348 (29%), Positives = 143/348 (41%), Gaps = 81/348 (23%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFP--------------SRNVTLDAFYLDQHE 54
           +E+ K+   LP    +  T   +LI DG+F                +   +  F++DQ E
Sbjct: 44  IEKCKNYTGLPQGWLKQPTAGMVLIVDGDFNFGSEKAYPDELNFGKKQREVKGFWIDQTE 103

Query: 55  VSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHR 114
           V+  QF  FV ATGY                +++ E+ K + V                 
Sbjct: 104 VTVAQFASFVKATGY----------------ITDAEKQKQAAV-------------FSPD 134

Query: 115 MHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEH 174
            HHP     W    +  TW+            G  G + N                    
Sbjct: 135 PHHPQ---QWWQLKSGYTWKTPN---------GGTGAIANP------------------- 163

Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RAN 233
             N PV +VS NDA  Y  W G  LPTE EWEY  +   +    P     T   +H +AN
Sbjct: 164 --NEPVRYVSKNDAEHYAVWLGRDLPTELEWEYAAKANSKTDT-PLHQAPTDEHQHPQAN 220

Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP 292
            WQGEFP  N   D +   APV  +  N F L++M+GNVWEWT+  +   H     +Y+ 
Sbjct: 221 YWQGEFPFQNLNQDHFKDVAPVGCFTPNNFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSD 280

Query: 293 --KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
             +     T  V KGGS+LC + YC R+R ++R     D +A ++GFR
Sbjct: 281 LRQQQIASTQYVIKGGSFLCAQNYCSRYRSSSRYPQDFDLAATHVGFR 328


>gi|194337392|ref|YP_002019186.1| hypothetical protein Ppha_2378 [Pelodictyon phaeoclathratiforme
           BU-1]
 gi|194309869|gb|ACF44569.1| protein of unknown function DUF323 [Pelodictyon phaeoclathratiforme
           BU-1]
          Length = 433

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 102/339 (30%), Positives = 148/339 (43%), Gaps = 89/339 (26%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           + VL+ G  F MG+ +  +  + +     V ++ FY+ ++EV+  +F+ F++ +GY T  
Sbjct: 165 NFVLIRGSEFTMGSPETEVGHESDETQHQVKVNDFYMSKYEVTVAEFRRFIAESGYRT-- 222

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
                         + E+A  S V                          WN + A  T 
Sbjct: 223 --------------DAEKAGDSYV--------------------------WNGSEAVLT- 241

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                     W +G  G +  +                     NHPVVHVSWNDAV YC 
Sbjct: 242 ------KGVNWRHGVSGKVRTK------------------SEENHPVVHVSWNDAVEYCR 277

Query: 194 WRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
           W         RLPTEAEWEY CR G     F  G N+T     +AN + G +P N     
Sbjct: 278 WLSEKTGKWYRLPTEAEWEYACRAG-SRTAFNTGGNITT---AQAN-YDGNYPYNKNRKG 332

Query: 248 GYLS-TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPT--TGT 299
            Y + T  V S++ N +GLYNM GNVWEW  D ++  ++          NP GP   TG+
Sbjct: 333 QYRARTVAVESFEPNAWGLYNMHGNVWEWCDDVYSGTYYDECKAKGLVENPGGPAPETGS 392

Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           ++V +GGS+  + +YC   R A+R   TP   +  +GFR
Sbjct: 393 NRVLRGGSWNYDAEYC---RSASRRGYTPGGRSNFVGFR 428


>gi|194335022|ref|YP_002016882.1| hypothetical protein Paes_2233 [Prosthecochloris aestuarii DSM 271]
 gi|194312840|gb|ACF47235.1| protein of unknown function DUF323 [Prosthecochloris aestuarii DSM
           271]
          Length = 398

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 78/175 (44%), Positives = 97/175 (55%), Gaps = 20/175 (11%)

Query: 178 HPVVHVSWNDAVAYCTW----RGA--RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR 231
           +PVV+VSWNDAVAYC W    RG   RLPTEAEWEY CRGG     F  G NLT      
Sbjct: 225 YPVVNVSWNDAVAYCRWLSEKRGELFRLPTEAEWEYACRGGTTTP-FSTGENLT---TDE 280

Query: 232 ANVWQGEFPTNNTAADGYL-STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----- 285
           AN + G +P  N +   Y  +T PV S++ N +GLYNM GNVWEW  DW+   ++     
Sbjct: 281 AN-YDGNYPYRNNSKGKYREATVPVDSFEPNSYGLYNMHGNVWEWCGDWYGEKYYEECRK 339

Query: 286 PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                NP+GP  G+ +V +GG +    +YC   R A RS   P     N+GFR  
Sbjct: 340 KGVVENPQGPKEGSRRVLRGGGWTYYARYC---RSADRSSGGPGYRGNNVGFRLV 391



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/154 (30%), Positives = 64/154 (41%), Gaps = 46/154 (29%)

Query: 7   PPVERYK---DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
           PP  R +   + VL+ G  F MG+ +     + +     V +  F L ++ V+  ++ EF
Sbjct: 149 PPASRPQLPENFVLIKGGQFSMGSPEDEHGHESDETLHEVKVSDFALCRYAVTVGEYLEF 208

Query: 64  VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
                   +A   GD +                                     PVV++S
Sbjct: 209 TEEAKINYDAGTEGDRY-------------------------------------PVVNVS 231

Query: 124 WNDAVAYCTW----RGA--RLPTEAEWEYGCRGG 151
           WNDAVAYC W    RG   RLPTEAEWEY CRGG
Sbjct: 232 WNDAVAYCRWLSEKRGELFRLPTEAEWEYACRGG 265


>gi|443471534|ref|ZP_21061596.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycine-
           generating enzyme 1) [Pseudomonas pseudoalcaligenes
           KF707]
 gi|442901605|gb|ELS27425.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycine-
           generating enzyme 1) [Pseudomonas pseudoalcaligenes
           KF707]
          Length = 333

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 90/217 (41%), Positives = 108/217 (49%), Gaps = 20/217 (9%)

Query: 140 TEAEWEYGC---RGGLEN----RLFPW------GSWLHPEGIDSTIEHRMNHPVVHVSWN 186
           TEAE + G    R  LEN       PW        W HPEG D+       HPV  V+  
Sbjct: 106 TEAERQGGAALFRAPLENAADGSALPWWRYVEGADWRHPEGPDAAPAALNAHPVTLVTLA 165

Query: 187 DAVAYCTWRGARLPTEAEWEYGCR-GGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
           DA AY  WRG  LP+EAEWEY  R GG   RL      L   G   AN WQG FP  +  
Sbjct: 166 DAQAYARWRGNELPSEAEWEYAARAGGDSARLGE--APLDAEGRPTANYWQGVFPVQDAG 223

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD--WWNV--HHHPAPSYNPKGPTTGTDK 301
            DG+   AP   +  N  GLY+M+GNVWEWTAD  W  +  H +  P      P     +
Sbjct: 224 EDGFAGLAPSGCFAANGNGLYDMIGNVWEWTADAQWGRLISHANGDPGQLRAVPGGSRPQ 283

Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           V KGGS+LC   YC R+R AAR +   D +  ++GFR
Sbjct: 284 VIKGGSWLCASTYCARYRAAARERQEADLATSHVGFR 320



 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 55/142 (38%), Positives = 74/142 (52%), Gaps = 8/142 (5%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           M L+ G  F  GT         E P   V +  F++D+ EV+  QF  FV+ATGYVTEAE
Sbjct: 53  MALIRGGRFVPGTEGGY---PDERPLGAVEVGDFWIDRTEVTRAQFAAFVAATGYVTEAE 109

Query: 75  KFGDTFVFE-PLLSEEERAKISQVRH----DMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           + G   +F  PL +  + + +   R+    D +  EG D+       HPV  ++  DA A
Sbjct: 110 RQGGAALFRAPLENAADGSALPWWRYVEGADWRHPEGPDAAPAALNAHPVTLVTLADAQA 169

Query: 130 YCTWRGARLPTEAEWEYGCRGG 151
           Y  WRG  LP+EAEWEY  R G
Sbjct: 170 YARWRGNELPSEAEWEYAARAG 191


>gi|417546710|ref|ZP_12197796.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC032]
 gi|421666877|ref|ZP_16106959.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC087]
 gi|421669835|ref|ZP_16109848.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC099]
 gi|400384598|gb|EJP43276.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC032]
 gi|410386349|gb|EKP38820.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC087]
 gi|410387304|gb|EKP39760.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC099]
          Length = 335

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 100/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY+    
Sbjct: 66  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYI---- 120

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                       S+ E+ K + V                  HHP     W     Y TW+
Sbjct: 121 ------------SDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                       G  G L N                      + PV +V+ NDA  Y  W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
            G  LPTE EWEY  +        P     T   +H +AN WQG FP  N   D +   A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241

Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
           PV  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            E YC R+R ++R     D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329


>gi|421625272|ref|ZP_16066125.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC098]
 gi|408698941|gb|EKL44426.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC098]
          Length = 335

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 101/330 (30%), Positives = 137/330 (41%), Gaps = 72/330 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA- 73
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV AT Y+T+A 
Sbjct: 66  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATCYITDAE 124

Query: 74  -EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
            +K    F  +P                               HHP     W     Y T
Sbjct: 125 KQKQAAVFSPDP-------------------------------HHP--QQWWQLKTGY-T 150

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
           WR            G  G L N                      + PV +V+ NDA  Y 
Sbjct: 151 WRTPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYA 180

Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLS 251
            W G  LPTE EWEY  +        P     T   +H +AN WQG FP  N   D +  
Sbjct: 181 VWLGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTG 239

Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSY 308
            APV  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+
Sbjct: 240 IAPVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSF 299

Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           LC E YC R+R ++R     D +A ++GFR
Sbjct: 300 LCAENYCSRYRNSSRYPQDFDLAATHVGFR 329


>gi|424060770|ref|ZP_17798261.1| hypothetical protein W9K_01884 [Acinetobacter baumannii Ab33333]
 gi|404668722|gb|EKB36631.1| hypothetical protein W9K_01884 [Acinetobacter baumannii Ab33333]
          Length = 335

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 100/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY+    
Sbjct: 66  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYI---- 120

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                       S+ E+ K + V                  HHP     W     Y TW+
Sbjct: 121 ------------SDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                       G  G L N                      + PV +V+ NDA  Y  W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
            G  LPTE EWEY  +        P     T   +H +AN WQG FP  N   D +   A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241

Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
           PV  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            E YC R+R ++R     D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329


>gi|445400935|ref|ZP_21430236.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-57]
 gi|444783062|gb|ELX06924.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-57]
          Length = 335

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 99/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY     
Sbjct: 66  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGY----- 119

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                      +++ E+ K + V                  HHP     W     Y TW+
Sbjct: 120 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                       G  G L N                      + PV +V+ NDA  Y  W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
            G  LPTE EWEY  +        P     T   +H +AN WQG FP  N   D +   A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241

Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
           PV  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+LC
Sbjct: 242 PVGCFKPNSFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            E YC R+R ++R     D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329


>gi|289749219|ref|ZP_06508597.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
 gi|289689806|gb|EFD57235.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
          Length = 188

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 87/230 (37%), Positives = 108/230 (46%), Gaps = 45/230 (19%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           ++V LPG +FRMG+ +       E P   VT+ AF +++H V+N QF EFVSATGYVT A
Sbjct: 4   ELVDLPGGSFRMGSTR---FYPEEAPIHTVTVRAFAVERHPVTNAQFAEFVSATGYVTVA 60

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           E        +PL              D   + G+D+         +V       V    W
Sbjct: 61  E--------QPL--------------DPGLYPGVDAA--DLCPGAMVFCPTAGPVDLRDW 96

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           R         W++           P   W HP G DS I  R  HPVV V++ DAVAY  
Sbjct: 97  R-------QWWDW----------VPGACWRHPFGRDSDIADRAGHPVVQVAYPDAVAYAR 139

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNN 243
           W G RLPTEAEWEY  RGG     + WG+   P G   AN WQG FP  N
Sbjct: 140 WAGRRLPTEAEWEYAARGG-TTATYAWGDQEKPGGMLMANTWQGRFPYRN 188


>gi|351730981|ref|ZP_08948672.1| hypothetical protein AradN_14464 [Acidovorax radicis N35]
          Length = 341

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 100/330 (30%), Positives = 142/330 (43%), Gaps = 89/330 (26%)

Query: 37  EFPSRNVTL-DAFYLDQHEVSNTQFQEFVSATGYVTEAEK-----FGDTFVFEPLLSEEE 90
           E P+  V +  AFYL +HEV+  QFQ F+ A+GY  E+E      +G    ++P  +   
Sbjct: 70  EGPAHTVHIRQAFYLGRHEVTVGQFQRFIEASGYQPESEADGTGGYGYNAQYDPATTRRG 129

Query: 91  RAKISQVRHDMKRFEGLDSTIEHR-------MHHPVVHISWNDAVAYCTWRGA------R 137
            A           FEG D     R         HPVV+I+WNDA A   W         R
Sbjct: 130 DA-----------FEGRDPRYSWRNPGFAQGPDHPVVNITWNDAQALARWLSRTEGHHYR 178

Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
           LPTEAEWEY CR G   R +P G    P+G+           + + +  D  A   W   
Sbjct: 179 LPTEAEWEYACRAGTRTR-YPHGD--DPQGL-----------LKNANTFDQEAAPLW--- 221

Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMS 257
                  W+     G +   F                                 TAPV S
Sbjct: 222 -----PRWQQHALAGSDGHAF---------------------------------TAPVGS 243

Query: 258 YKENKFGLYNMVGNVWEWTADW-WNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCY 316
           Y  N FGL++M+GN WEW +DW  + ++  +P  +PKGP  GT +V++GGS+     + +
Sbjct: 244 YAPNAFGLHDMLGNAWEWVSDWHGDTYYAHSPRSDPKGPADGTVRVRRGGSW---HTWSF 300

Query: 317 RHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
             RC  R+ N+P++    +G R   +  PT
Sbjct: 301 YARCGYRNWNSPETRYTLVGMRLVREIAPT 330


>gi|119356730|ref|YP_911374.1| hypothetical protein Cpha266_0902 [Chlorobium phaeobacteroides DSM
           266]
 gi|119354079|gb|ABL64950.1| protein of unknown function DUF323 [Chlorobium phaeobacteroides DSM
           266]
          Length = 586

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 104/342 (30%), Positives = 145/342 (42%), Gaps = 87/342 (25%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           R  + VL+ G  F MG+       + +     V +  FYL ++ V+  +F++F+  +GY 
Sbjct: 317 RPGNFVLIRGGEFTMGSPANESGHESDETQHQVKVSDFYLCKYAVTLAEFKKFIEDSGYQ 376

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
           T+AEK G ++                                          SW D +++
Sbjct: 377 TDAEKDGGSY------------------------------------------SW-DGISW 393

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
               G       +W YG  G +                    +   NHPV+HVSWNDAVA
Sbjct: 394 VKNAGV------DWRYGVSGSVRP------------------QSEENHPVLHVSWNDAVA 429

Query: 191 YCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP-TNN 243
           YC W         RLPTEAEWEY CR G     F  G+NLT     +AN + G +P TNN
Sbjct: 430 YCKWISKKTGDAFRLPTEAEWEYACRAGTTTP-FHTGDNLT---TGQAN-YNGNYPYTNN 484

Query: 244 TAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTG 298
                  +T  V  +  N +GLY+M GNVWEW  D +   ++          NP GP TG
Sbjct: 485 QKGVYRENTVKVDEFAPNAWGLYHMHGNVWEWCGDRYGDKYYDECKAEGVVENPVGPETG 544

Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           + +V +GG +  N + C   R A R    PD  +   GFR A
Sbjct: 545 SLRVLRGGGWSFNARSC---RSAFRIDVAPDYRSNYAGFRLA 583


>gi|421656586|ref|ZP_16096891.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-72]
 gi|408504913|gb|EKK06643.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-72]
          Length = 335

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 99/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY     
Sbjct: 66  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGY----- 119

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                      +++ E+ K + V                  HHP     W     Y TW+
Sbjct: 120 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                       G  G L N                      + PV +V+ NDA  Y  W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
            G  LPTE EWEY  +        P     T   +H +AN WQG FP  N   D +   A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGVA 241

Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
           PV  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQKMSSTQYVIKGGSFLC 301

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            E YC R+R ++R     D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329


>gi|184157258|ref|YP_001845597.1| hypothetical protein ACICU_00938 [Acinetobacter baumannii ACICU]
 gi|332872886|ref|ZP_08440850.1| hypothetical protein HMPREF0022_00449 [Acinetobacter baumannii
           6014059]
 gi|384142326|ref|YP_005525036.1| hypothetical protein [Acinetobacter baumannii MDR-ZJ06]
 gi|385236624|ref|YP_005797963.1| hypothetical protein ABTW07_1067 [Acinetobacter baumannii
           TCDC-AB0715]
 gi|387124838|ref|YP_006290720.1| hypothetical protein ABTJ_02834 [Acinetobacter baumannii MDR-TJ]
 gi|407931955|ref|YP_006847598.1| hypothetical protein M3Q_1275 [Acinetobacter baumannii TYTH-1]
 gi|416148149|ref|ZP_11602198.1| hypothetical protein AB210_2246 [Acinetobacter baumannii AB210]
 gi|417551254|ref|ZP_12202332.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-18]
 gi|417566146|ref|ZP_12217020.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC143]
 gi|417570359|ref|ZP_12221216.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC189]
 gi|417576078|ref|ZP_12226923.1| formylglycine-generating sulfatase enzyme [Acinetobacter baumannii
           Naval-17]
 gi|417869267|ref|ZP_12514259.1| hypothetical protein ABNIH1_03787 [Acinetobacter baumannii ABNIH1]
 gi|417872655|ref|ZP_12517550.1| hypothetical protein ABNIH2_01612 [Acinetobacter baumannii ABNIH2]
 gi|417877076|ref|ZP_12521811.1| hypothetical protein ABNIH3_03897 [Acinetobacter baumannii ABNIH3]
 gi|417880716|ref|ZP_12525185.1| hypothetical protein ABNIH4_00465 [Acinetobacter baumannii ABNIH4]
 gi|421202377|ref|ZP_15659528.1| sulfatase-modifying factor 1 [Acinetobacter baumannii AC12]
 gi|421535518|ref|ZP_15981777.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii AC30]
 gi|421631030|ref|ZP_16071719.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC180]
 gi|421651446|ref|ZP_16091815.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC0162]
 gi|421674948|ref|ZP_16114874.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC065]
 gi|421689319|ref|ZP_16129003.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii IS-143]
 gi|421690825|ref|ZP_16130491.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii IS-116]
 gi|421702779|ref|ZP_16142255.1| hypothetical protein B825_05946 [Acinetobacter baumannii ZWS1122]
 gi|421706529|ref|ZP_16145942.1| hypothetical protein B837_05656 [Acinetobacter baumannii ZWS1219]
 gi|421792548|ref|ZP_16228701.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-2]
 gi|424053300|ref|ZP_17790832.1| hypothetical protein W9G_01989 [Acinetobacter baumannii Ab11111]
 gi|424063038|ref|ZP_17800523.1| hypothetical protein W9M_00321 [Acinetobacter baumannii Ab44444]
 gi|425747841|ref|ZP_18865839.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii WC-348]
 gi|425752687|ref|ZP_18870594.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-113]
 gi|445456304|ref|ZP_21445750.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC047]
 gi|445466288|ref|ZP_21450267.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC338]
 gi|445475293|ref|ZP_21453295.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii Naval-78]
 gi|183208852|gb|ACC56250.1| uncharacterized conserved protein [Acinetobacter baumannii ACICU]
 gi|323517122|gb|ADX91503.1| hypothetical protein ABTW07_1067 [Acinetobacter baumannii
           TCDC-AB0715]
 gi|332738897|gb|EGJ69760.1| hypothetical protein HMPREF0022_00449 [Acinetobacter baumannii
           6014059]
 gi|333365156|gb|EGK47170.1| hypothetical protein AB210_2246 [Acinetobacter baumannii AB210]
 gi|342231038|gb|EGT95857.1| hypothetical protein ABNIH1_03787 [Acinetobacter baumannii ABNIH1]
 gi|342233291|gb|EGT98030.1| hypothetical protein ABNIH2_01612 [Acinetobacter baumannii ABNIH2]
 gi|342236436|gb|EGU00958.1| hypothetical protein ABNIH3_03897 [Acinetobacter baumannii ABNIH3]
 gi|342239552|gb|EGU03951.1| hypothetical protein ABNIH4_00465 [Acinetobacter baumannii ABNIH4]
 gi|347592819|gb|AEP05540.1| conserve hypothetical protein [Acinetobacter baumannii MDR-ZJ06]
 gi|385879330|gb|AFI96425.1| hypothetical protein ABTJ_02834 [Acinetobacter baumannii MDR-TJ]
 gi|395550807|gb|EJG16816.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC189]
 gi|395557902|gb|EJG23903.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC143]
 gi|395569299|gb|EJG29961.1| formylglycine-generating sulfatase enzyme [Acinetobacter baumannii
           Naval-17]
 gi|398328332|gb|EJN44459.1| sulfatase-modifying factor 1 [Acinetobacter baumannii AC12]
 gi|400385709|gb|EJP48784.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-18]
 gi|404558699|gb|EKA63980.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii IS-143]
 gi|404563722|gb|EKA68922.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii IS-116]
 gi|404669088|gb|EKB36995.1| hypothetical protein W9G_01989 [Acinetobacter baumannii Ab11111]
 gi|404675040|gb|EKB42765.1| hypothetical protein W9M_00321 [Acinetobacter baumannii Ab44444]
 gi|407193594|gb|EKE64750.1| hypothetical protein B825_05946 [Acinetobacter baumannii ZWS1122]
 gi|407193878|gb|EKE65027.1| hypothetical protein B837_05656 [Acinetobacter baumannii ZWS1219]
 gi|407900536|gb|AFU37367.1| hypothetical protein M3Q_1275 [Acinetobacter baumannii TYTH-1]
 gi|408508056|gb|EKK09743.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC0162]
 gi|408695196|gb|EKL40752.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC180]
 gi|409986360|gb|EKO42554.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii AC30]
 gi|410382963|gb|EKP35497.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC065]
 gi|410400128|gb|EKP52308.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-2]
 gi|425492880|gb|EKU59132.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii WC-348]
 gi|425498918|gb|EKU64984.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-113]
 gi|444778099|gb|ELX02118.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii OIFC338]
 gi|444778250|gb|ELX02268.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC047]
 gi|444778957|gb|ELX02952.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii Naval-78]
          Length = 335

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 99/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY     
Sbjct: 66  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGY----- 119

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                      +++ E+ K + V                  HHP     W     Y TW+
Sbjct: 120 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                       G  G L N                      + PV +V+ NDA  Y  W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
            G  LPTE EWEY  +        P     T   +H +AN WQG FP  N   D +   A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241

Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
           PV  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            E YC R+R ++R     D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329


>gi|421694783|ref|ZP_16134400.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii WC-692]
 gi|404567018|gb|EKA72146.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii WC-692]
          Length = 335

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 99/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY     
Sbjct: 66  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGY----- 119

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                      +++ E+ K + V                  HHP     W     Y TW+
Sbjct: 120 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                       G  G L N                      + PV +V+ NDA  Y  W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
            G  LPTE EWEY  +        P     T   +H +AN WQG FP  N   D +   A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241

Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
           PV  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            E YC R+R ++R     D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329


>gi|239502993|ref|ZP_04662303.1| hypothetical protein AbauAB_11844 [Acinetobacter baumannii AB900]
 gi|421679867|ref|ZP_16119735.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC111]
 gi|410390686|gb|EKP43069.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC111]
          Length = 335

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 99/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY     
Sbjct: 66  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGY----- 119

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                      +++ E+ K + V                  HHP     W     Y TW+
Sbjct: 120 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                       G  G L N                      + PV +V+ NDA  Y  W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
            G  LPTE EWEY  +        P     T   +H +AN WQG FP  N   D +   A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241

Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
           PV  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            E YC R+R ++R     D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329


>gi|260555909|ref|ZP_05828129.1| sulfatase-modifying factor 1 [Acinetobacter baumannii ATCC 19606 =
           CIP 70.34]
 gi|260410820|gb|EEX04118.1| sulfatase-modifying factor 1 [Acinetobacter baumannii ATCC 19606 =
           CIP 70.34]
 gi|452954321|gb|EME59725.1| hypothetical protein G347_03445 [Acinetobacter baumannii MSP4-16]
          Length = 335

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 99/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY     
Sbjct: 66  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGY----- 119

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                      +++ E+ K + V                  HHP     W     Y TW+
Sbjct: 120 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                       G  G L N                      + PV +V+ NDA  Y  W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
            G  LPTE EWEY  +        P     T   +H +AN WQG FP  N   D +   A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241

Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
           PV  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            E YC R+R ++R     D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329


>gi|384130923|ref|YP_005513535.1| hypothetical protein [Acinetobacter baumannii 1656-2]
 gi|322507143|gb|ADX02597.1| Putative uncharacterized protein [Acinetobacter baumannii 1656-2]
          Length = 322

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 99/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY     
Sbjct: 53  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGY----- 106

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                      +++ E+ K + V                  HHP     W     Y TW+
Sbjct: 107 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 139

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                       G  G L N                      + PV +V+ NDA  Y  W
Sbjct: 140 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 169

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
            G  LPTE EWEY  +        P     T   +H +AN WQG FP  N   D +   A
Sbjct: 170 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 228

Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
           PV  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+LC
Sbjct: 229 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 288

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            E YC R+R ++R     D +A ++GFR
Sbjct: 289 AENYCSRYRNSSRYPQDFDLAATHVGFR 316


>gi|46446714|ref|YP_008079.1| hypothetical protein pc1080 [Candidatus Protochlamydia amoebophila
           UWE25]
 gi|46400355|emb|CAF23804.1| conserved hypothetical protein [Candidatus Protochlamydia
           amoebophila UWE25]
          Length = 655

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 69/169 (40%), Positives = 91/169 (53%), Gaps = 24/169 (14%)

Query: 178 HPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG 237
           HPVV V+W  A+AY  W G RLPTEAEWE   RGG EN L+P G  +    + +AN +  
Sbjct: 503 HPVVGVTWYGAIAYAKWIGKRLPTEAEWEIAARGGQENVLYPTGEEIE---KTQANFFSS 559

Query: 238 EFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA----PSYNPK 293
           +             T  VMSY  N  GLY+M GNV+EW  DW+  +++      P + P+
Sbjct: 560 D-------------TTTVMSYAPNNCGLYDMAGNVYEWCHDWYGYNYYELSMQEPDF-PQ 605

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RC+ R +N P +  G  GFRCAAD
Sbjct: 606 GPLQGVYRVLRGG---CWKSLKEDLRCSRRHRNNPGTVNGTYGFRCAAD 651



 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 57/151 (37%), Positives = 73/151 (48%), Gaps = 26/151 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDG---EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           DMV++ G  F  G+      +DG   E P   +TL +F LD H V+N QF  F+   G  
Sbjct: 418 DMVIIEGGNFYRGS------QDGNRDEMPRHQITLSSFALDVHPVTNEQFVRFLEVMG-- 469

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHD-MKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
              EK            +     I ++R   +KR  G  S       HPVV ++W  A+A
Sbjct: 470 --GEK------------DSNHNDIIRLRDSRIKRSGGKLSIESGYAKHPVVGVTWYGAIA 515

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWG 160
           Y  W G RLPTEAEWE   RGG EN L+P G
Sbjct: 516 YAKWIGKRLPTEAEWEIAARGGQENVLYPTG 546


>gi|302039207|ref|YP_003799529.1| hypothetical protein NIDE3934 [Candidatus Nitrospira defluvii]
 gi|300607271|emb|CBK43604.1| conserved exported protein of unknown function, Sulfatase-modifying
           factor-like [Candidatus Nitrospira defluvii]
          Length = 320

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 71/179 (39%), Positives = 98/179 (54%), Gaps = 13/179 (7%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
           +HP++HV+W DA AYC W G RLPTEAEWE   R G++ RLFPWG+     G  RAN  +
Sbjct: 140 HHPIMHVNWYDADAYCKWAGKRLPTEAEWEKAAR-GVDGRLFPWGSEYA--GPTRANFGR 196

Query: 237 -GEFPTNNTAADGYLSTAPVMSYKE-----NKFGLYNMVGNVWEWTADWWNV-HHHPAPS 289
            G         +  L   P++S  +     + +GLY  +GNV EW +DW++  ++  AP 
Sbjct: 197 TGLSGPVRDRPERLLLYPPIISVDKYENALSPYGLYQTIGNVAEWVSDWYDQDYYKTAPG 256

Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
            NPKGP TGT K  +GG ++ +       R A R+   P +    +GFRCA D     G
Sbjct: 257 RNPKGPETGTQKAFRGGGWMDSTTTM---RAAMRNGTDPKTKINWMGFRCAQDAKEDAG 312



 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 57/151 (37%), Positives = 80/151 (52%), Gaps = 31/151 (20%)

Query: 13  KDMVLLPGDTFRMGTNKPI--LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           ++MVL+P   F MG++K    L    E P R+V LDAF + ++EV+  ++ +FV AT   
Sbjct: 63  EEMVLIPAGEFLMGSDKKTDRLAYRSEIPQRSVYLDAFMIGKYEVTALEYLKFVLAT--- 119

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
                              +R      R+D   F+      +   HHP++H++W DA AY
Sbjct: 120 -------------------DRLPQLDWRYDGGNFQ------DTMAHHPIMHVNWYDADAY 154

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
           C W G RLPTEAEWE   R G++ RLFPWGS
Sbjct: 155 CKWAGKRLPTEAEWEKAAR-GVDGRLFPWGS 184


>gi|421786383|ref|ZP_16222786.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-82]
 gi|445454532|ref|ZP_21445454.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii WC-A-92]
 gi|410412861|gb|EKP64709.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii Naval-82]
 gi|444752530|gb|ELW77214.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii WC-A-92]
          Length = 335

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 98/328 (29%), Positives = 137/328 (41%), Gaps = 68/328 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY     
Sbjct: 66  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGY----- 119

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                      +++ E+ K + V                  HHP     W     Y TW+
Sbjct: 120 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                       G  G L N                      + PV +V+ NDA  Y  W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
            G  LPTE EWEY  +        P     T   +H +AN WQG FP  N   D +   A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241

Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
           PV  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            + YC R+R ++R     D +A ++GFR
Sbjct: 302 AKNYCSRYRNSSRYPQDFDLAATHVGFR 329


>gi|421661394|ref|ZP_16101570.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC110]
 gi|408715806|gb|EKL60928.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC110]
          Length = 335

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 98/328 (29%), Positives = 136/328 (41%), Gaps = 68/328 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ E +  QF  FV ATGY     
Sbjct: 66  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEATVAQFASFVKATGY----- 119

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                      +++ E+ K + V                  HHP     W     Y TW+
Sbjct: 120 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
                       G  G L N                      + PV +V+ NDA  Y  W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
            G  LPTE EWEY  +        P     T   +H +AN WQG FP  N   D +   A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241

Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
           PV  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            E YC R+R ++R     D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329


>gi|262279922|ref|ZP_06057707.1| conserved hypothetical protein [Acinetobacter calcoaceticus
           RUH2202]
 gi|262260273|gb|EEY79006.1| conserved hypothetical protein [Acinetobacter calcoaceticus
           RUH2202]
          Length = 334

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 70/182 (38%), Positives = 97/182 (53%), Gaps = 6/182 (3%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W  P G +  + +  N PV +VS NDA  Y  W G  LPTE EWEY  +   +    P 
Sbjct: 149 TWKTPNGDNGAVANP-NEPVRYVSKNDAEHYAVWLGRDLPTELEWEYAAKANSQTDT-PL 206

Query: 221 GNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD- 278
               T   +H +AN WQGEFP  N   D +   APV  +  N F L++M+GNVWEWT+  
Sbjct: 207 HQAPTDEHQHPQANYWQGEFPFQNLNQDHFKDVAPVGCFASNHFKLFDMIGNVWEWTSSP 266

Query: 279 WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
           +   H     +Y+   +   T T  V KGGS+LC + YC R+R ++R     D +A ++G
Sbjct: 267 YQGAHDQHMGNYSDLRQQQITSTQYVIKGGSFLCAQNYCSRYRSSSRYPQDFDLAATHVG 326

Query: 337 FR 338
           FR
Sbjct: 327 FR 328


>gi|299771125|ref|YP_003733151.1| hypothetical protein AOLE_14455 [Acinetobacter oleivorans DR1]
 gi|298701213|gb|ADI91778.1| hypothetical protein AOLE_14455 [Acinetobacter oleivorans DR1]
          Length = 334

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 98/348 (28%), Positives = 141/348 (40%), Gaps = 81/348 (23%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFP--------------SRNVTLDAFYLDQHE 54
           +E+ K+   LP    +  T   +LI DG F                +   +  F++DQ E
Sbjct: 44  IEKCKNYTGLPQGWLKQPTAGMVLITDGHFNFGSEKAYPDELNFGKKQREVKGFWIDQTE 103

Query: 55  VSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHR 114
           V+  QF  F+ ATGY                +++ E+ K + V                 
Sbjct: 104 VTVAQFASFIKATGY----------------ITDAEKQKQAAV-------------FSPD 134

Query: 115 MHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEH 174
             HP     W    +  TW+            G +G L N                    
Sbjct: 135 PQHPQ---QWWQLKSGYTWKTPN---------GSKGALANP------------------- 163

Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RAN 233
             N PV +VS NDA  Y  W G  LPTE EWEY  +   +    P     T   +H ++N
Sbjct: 164 --NEPVRYVSKNDAEHYAVWLGHDLPTELEWEYAAKANSKTDT-PLHQAPTDEHQHPQSN 220

Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP 292
            WQGEFP  N   D +   APV  +  N F L++ +GNVWEWT+  +   H     +Y+ 
Sbjct: 221 YWQGEFPFQNLNQDHFKDVAPVGCFAANNFKLFDTIGNVWEWTSSPYQGAHDQHMGNYSD 280

Query: 293 --KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
             +     T  V KGGS+LC + YC R+R ++R     D +A ++GFR
Sbjct: 281 LRQQQIASTQYVIKGGSFLCAQNYCSRYRNSSRYPQDFDLAATHVGFR 328


>gi|375133874|ref|YP_004994524.1| protein kinase [Acinetobacter calcoaceticus PHEA-2]
 gi|325121319|gb|ADY80842.1| protein kinase [Acinetobacter calcoaceticus PHEA-2]
          Length = 303

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 72/183 (39%), Positives = 96/183 (52%), Gaps = 6/183 (3%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W  P G   TI    N PV +VS NDA  Y  W G  LPTE EWEY  +   +    P 
Sbjct: 118 TWKTPNGSTGTIA-TPNEPVRYVSKNDAEHYAVWLGRDLPTELEWEYAAKANSKTDT-PL 175

Query: 221 GNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD- 278
               T   +H +AN WQGEFP  N   D +   APV  +  N F L++M+GNVWEWT+  
Sbjct: 176 HQAPTDEHQHPQANYWQGEFPFQNLNQDHFKGVAPVGCFTANHFKLFDMIGNVWEWTSSP 235

Query: 279 WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
           +   H     +Y+   +   T T  V KGGS+LC + YC R+R ++R     D +A ++G
Sbjct: 236 YQGAHDQHMGNYSDLRQQQITSTQYVIKGGSFLCAQNYCSRYRSSSRYPQDFDLAATHVG 295

Query: 337 FRC 339
           FR 
Sbjct: 296 FRT 298



 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 50/156 (32%), Positives = 72/156 (46%), Gaps = 15/156 (9%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPILIKDGEFP--------------SRNVTLDAFYLDQHE 54
           +E+ ++   LP    +  T   +LI DG+F                +   +  F++DQ E
Sbjct: 13  IEKCQNYTGLPQGWLKQTTAGMVLIADGDFNFGSEKAYPDELNFGKKQREVKGFWIDQTE 72

Query: 55  VSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHR 114
           V+  QF  FV ATGY+T+AEK     VF P     ++    +  +  K   G   TI   
Sbjct: 73  VTVAQFASFVKATGYITDAEKQKQAAVFSPDPDHPQQWWQLKSGYTWKTPNGSTGTIA-T 131

Query: 115 MHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRG 150
            + PV ++S NDA  Y  W G  LPTE EWEY  + 
Sbjct: 132 PNEPVRYVSKNDAEHYAVWLGRDLPTELEWEYAAKA 167


>gi|302039449|ref|YP_003799771.1| hypothetical protein NIDE4178 [Candidatus Nitrospira defluvii]
 gi|300607513|emb|CBK43846.1| conserved exported protein of unknown function, Sulfatase-modifying
           factor-like [Candidatus Nitrospira defluvii]
          Length = 308

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 74/198 (37%), Positives = 107/198 (54%), Gaps = 15/198 (7%)

Query: 152 LENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRG 211
           L + L P   W +  G  +  E  ++HPV+HVSW DA AYC W G RLPTEAEWE   RG
Sbjct: 106 LASDLVPLIDWQYDGG--NFQETMVSHPVMHVSWFDAEAYCKWAGKRLPTEAEWEKAARG 163

Query: 212 GLENRLFPWGNNLTPRGEHRANVWQ-GEFPTNNTAADGYLSTAPVMSYKE-----NKFGL 265
             + R++PWGN +   G  RAN  + G         +  L   P++S  +     + +G+
Sbjct: 164 E-DGRIYPWGNQMA--GLSRANFGRTGLSGPVRDRPERLLLYPPIISVNKYDNAASPYGV 220

Query: 266 YNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARS 324
           + M GNV EW ADW++  ++   P  NPKGP TG+ +  +GG ++ +       R A R+
Sbjct: 221 FQMAGNVAEWVADWYDPKYYTTGPDKNPKGPETGSQRSFRGGGWIDSTPSV---RVAQRN 277

Query: 325 QNTPDSSAGNLGFRCAAD 342
              P++    +GFRCA D
Sbjct: 278 GTDPNTKMNWMGFRCARD 295



 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 55/152 (36%), Positives = 81/152 (53%), Gaps = 31/152 (20%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKD--GEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           + +M+ +P   F MG+++ +       EFP R V LDA+ +D++EV+  QF +FV A+  
Sbjct: 51  HDEMIRIPAGEFLMGSDRKVDKNSYLAEFPQRKVYLDAYEIDKYEVTTVQFLKFVLASDL 110

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           V             PL+  +         +D   F+      E  + HPV+H+SW DA A
Sbjct: 111 V-------------PLIDWQ---------YDGGNFQ------ETMVSHPVMHVSWFDAEA 142

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
           YC W G RLPTEAEWE   RG  + R++PWG+
Sbjct: 143 YCKWAGKRLPTEAEWEKAARGE-DGRIYPWGN 173


>gi|427424482|ref|ZP_18914605.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii WC-136]
 gi|425698782|gb|EKU68415.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii WC-136]
          Length = 334

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 72/182 (39%), Positives = 96/182 (52%), Gaps = 6/182 (3%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W  P G    I    N PV +VS NDA  Y  W G  LPTE EWEY  +   +    P 
Sbjct: 149 TWKTPNGSMGAIA-TPNEPVRYVSKNDAEHYAVWLGRDLPTELEWEYAAKANSKTDT-PL 206

Query: 221 GNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD- 278
               T   +H +AN WQGEFP  N   D +   APV  +  NKF L++M+GNVWEWT+  
Sbjct: 207 HQPPTDEHQHPQANYWQGEFPFQNLNQDHFKGVAPVGCFAANKFKLFDMIGNVWEWTSSP 266

Query: 279 WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
           +   H     +Y+   +   T T  V KGGS+LC + YC R+R ++R     D +A ++G
Sbjct: 267 YQGAHDQHMGNYSDLRQQQITSTQYVIKGGSFLCAQNYCSRYRSSSRYPQDFDLAATHVG 326

Query: 337 FR 338
           FR
Sbjct: 327 FR 328


>gi|194336182|ref|YP_002017976.1| hypothetical protein Ppha_1079 [Pelodictyon phaeoclathratiforme
           BU-1]
 gi|194308659|gb|ACF43359.1| protein of unknown function DUF323 [Pelodictyon phaeoclathratiforme
           BU-1]
          Length = 267

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 76/174 (43%), Positives = 99/174 (56%), Gaps = 20/174 (11%)

Query: 177 NHPVVHVSWNDAVAYC------TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
           N PVV+VSWNDAVAYC      T +  RLPTEAEWEY CR G     F  G NLT     
Sbjct: 97  NDPVVNVSWNDAVAYCKALSAKTGKRCRLPTEAEWEYACRAG-SRTPFNTGQNLT---TG 152

Query: 231 RANVWQGEFPTNNTAADGY-LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---- 285
           +AN + G +P N      Y     PV ++  N++GLYNM GNVWEW +DW++  ++    
Sbjct: 153 QAN-YNGNYPYNGNPKGVYRQKIVPVNTFAPNRWGLYNMHGNVWEWCSDWYSETYYDECK 211

Query: 286 -PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                 NP GP+T + +V +GG +  + +YC   R A RS NTPD  +  +GFR
Sbjct: 212 AQGVVTNPTGPSTRSRRVIRGGGWSDDAEYC---RSAYRSDNTPDYRSNGVGFR 262



 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 49/154 (31%), Positives = 68/154 (44%), Gaps = 46/154 (29%)

Query: 4   LPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
           LPA  +E     VL+PG TF MG+    +          V +  FY+ ++EV+  +F++F
Sbjct: 25  LPAKAME----FVLIPGGTFTMGSPAGEVGHQNNETQHQVKVSDFYIAEYEVTVAEFRKF 80

Query: 64  VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
           V+A GY                                      +   +   + PVV++S
Sbjct: 81  VAAKGYQ------------------------------------WNEGTQGEANDPVVNVS 104

Query: 124 WNDAVAYC------TWRGARLPTEAEWEYGCRGG 151
           WNDAVAYC      T +  RLPTEAEWEY CR G
Sbjct: 105 WNDAVAYCKALSAKTGKRCRLPTEAEWEYACRAG 138


>gi|403052604|ref|ZP_10907088.1| hypothetical protein AberL1_13896 [Acinetobacter bereziniae LMG
           1003]
          Length = 331

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 93/327 (28%), Positives = 138/327 (42%), Gaps = 66/327 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P  +F++G+N+     +  F ++   +++F++DQ EV+  QF  FV ATGY+T+AE
Sbjct: 62  MVRIPKGSFQLGSNQ-AYPDEINFGAKQRQVESFWIDQTEVTVAQFSSFVKATGYITDAE 120

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K     VF P     ++    +  +  K   G           PV +++ NDA  Y  W 
Sbjct: 121 KQHQAAVFSPNKQTPQQWWQLKADYTWKSPNGAKGQAPLPA-EPVRYVTQNDAEHYAVWL 179

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           G  LPTE EWEY  + G                +  T E    HP  +           W
Sbjct: 180 GHDLPTEQEWEYAAKAGSTTDT----------PLHQTPEDAHQHPQANY----------W 219

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +G                     FP+ N                     T AD +   AP
Sbjct: 220 QGE--------------------FPFNN---------------------TQADHFEGVAP 238

Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNP--KGPTTGTDKVKKGGSYLCN 311
           V  Y  N F L++M+GNVWEWT+  +   H     +Y    +    GT  V KGGS+LC 
Sbjct: 239 VGCYAANAFKLFDMIGNVWEWTSSIYRGAHDQHMGNYADLRQQRIAGTQYVIKGGSFLCA 298

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFR 338
             +C R+R ++R     D +A ++GFR
Sbjct: 299 SNFCSRYRNSSRYPQEFDLAATHVGFR 325



 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 71/213 (33%), Positives = 93/213 (43%), Gaps = 53/213 (24%)

Query: 179 PVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR---ANVW 235
           PV +V+ NDA  Y  W G  LPTE EWEY  + G          + TP   H+   AN W
Sbjct: 163 PVRYVTQNDAEHYAVWLGHDLPTEQEWEYAAKAGSTTDT---PLHQTPEDAHQHPQANYW 219

Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGP 295
           QGEFP NNT AD +   APV  Y  N F L++M+GNVWEWT+  +   H           
Sbjct: 220 QGEFPFNNTQADHFEGVAPVGCYAANAFKLFDMIGNVWEWTSSIYRGAH----------- 268

Query: 296 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKG 355
                       ++ N     + R A                          GT  V KG
Sbjct: 269 ----------DQHMGNYADLRQQRIA--------------------------GTQYVIKG 292

Query: 356 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 388
           GS+LC   +C R+R ++R     D +A ++GFR
Sbjct: 293 GSFLCASNFCSRYRNSSRYPQEFDLAATHVGFR 325


>gi|294651296|ref|ZP_06728619.1| cysteine-type sulfatase aerobic maturase domain protein, partial
           [Acinetobacter haemolyticus ATCC 19194]
 gi|292822809|gb|EFF81689.1| cysteine-type sulfatase aerobic maturase domain protein
           [Acinetobacter haemolyticus ATCC 19194]
          Length = 153

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 67/154 (43%), Positives = 83/154 (53%), Gaps = 11/154 (7%)

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTP--RGEHRANVWQGEFPTNNTAAD 247
           AY  W G  LPTE EWEY  +G      F   +++ P  +G   AN WQGEFP  NT  D
Sbjct: 1   AYAHWLGHDLPTELEWEYAAKG------FQQQSDIGPTHQGHTTANYWQGEFPYQNTQQD 54

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTGTDKVKK 304
           G+   APV  +K N FGLY+M+GNVWEWT   +     HH   P+     P +      K
Sbjct: 55  GFNEVAPVGCFKANPFGLYDMIGNVWEWTQTPYTGVRDHHMGDPATYRHDPNSILQYTIK 114

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           GGSYLC   YC R+R AAR     D +  ++GFR
Sbjct: 115 GGSYLCASNYCARYRAAARHPQELDLATSHVGFR 148


>gi|193076705|gb|ABO11408.2| hypothetical protein A1S_0976 [Acinetobacter baumannii ATCC 17978]
          Length = 335

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 97/327 (29%), Positives = 142/327 (43%), Gaps = 66/327 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY+T+AE
Sbjct: 66  MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYITDAE 124

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K     VF P     ++    +  +  K   G + T+ +  H PV +++ NDA  Y  W 
Sbjct: 125 KQKQAAVFSPDPHHPQQWWQLKTGYTWKTPNGKNGTLPNP-HEPVRYVTKNDAEHYAVWL 183

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           G  LPTE EWEY  +        P  + L+    D   EH+  HP  +           W
Sbjct: 184 GRDLPTELEWEYAAKANS-----PTDTPLYQAPTD---EHQ--HPQANY----------W 223

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +GA                    FP+                      N   D +   AP
Sbjct: 224 QGA--------------------FPF---------------------ENLNQDHFTGIAP 242

Query: 255 VMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLCN 311
           V  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+LC 
Sbjct: 243 VGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLCA 302

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFR 338
           E YC R+R ++R     D +A ++GFR
Sbjct: 303 ENYCSRYRNSSRYPQDFDLAATHVGFR 329



 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 73/233 (31%), Positives = 100/233 (42%), Gaps = 58/233 (24%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGG--LENRLF 218
           +W  P G + T+ +  + PV +V+ NDA  Y  W G  LPTE EWEY  +     +  L+
Sbjct: 150 TWKTPNGKNGTLPNP-HEPVRYVTKNDAEHYAVWLGRDLPTELEWEYAAKANSPTDTPLY 208

Query: 219 PWGNNLTPRGEHR---ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEW 275
                  P  EH+   AN WQG FP  N   D +   APV  +K N F L++M+GNVWEW
Sbjct: 209 -----QAPTDEHQHPQANYWQGAFPFENLNQDHFTGIAPVGCFKPNGFKLFDMIGNVWEW 263

Query: 276 TADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNL 335
           T+  +   H                                            D   GN 
Sbjct: 264 TSSPYQGAH--------------------------------------------DQHMGNY 279

Query: 336 GFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 388
               A  +   + T  V KGGS+LC E YC R+R ++R     D +A ++GFR
Sbjct: 280 ---SALRQQEMSSTQYVIKGGSFLCAENYCSRYRNSSRYPQDFDLAATHVGFR 329


>gi|194337011|ref|YP_002018805.1| hypothetical protein Ppha_1976 [Pelodictyon phaeoclathratiforme
           BU-1]
 gi|194309488|gb|ACF44188.1| protein of unknown function DUF323 [Pelodictyon phaeoclathratiforme
           BU-1]
          Length = 319

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 76/174 (43%), Positives = 99/174 (56%), Gaps = 20/174 (11%)

Query: 177 NHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
           NHPV+HVSWNDA AYC W         RLPTEAEWE+ CR G E + F  G+NLT     
Sbjct: 149 NHPVLHVSWNDAEAYCAWLSKKTGNTFRLPTEAEWEFACRAGSE-KPFHTGDNLTT---G 204

Query: 231 RANVWQGEFPTNNTAADGYL-STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---- 285
           +AN + G +P NN+    +  +T PV S + N +GLYNM GNVWEW +D +  +++    
Sbjct: 205 QAN-YNGNYPFNNSVKGIFRKNTLPVDSLEPNVWGLYNMHGNVWEWCSDMYGKNYYDECR 263

Query: 286 -PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                 NP+GPTTG+ +V +GG +     YC   R A R  N P      +GFR
Sbjct: 264 AKGTVENPEGPTTGSHRVIRGGGWNYVAVYC---RSAYRDNNAPGYRYDLVGFR 314



 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 16/148 (10%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           + + + GD F MG+ +       +    +V +  + L +  V    F+ F+  +GY T+A
Sbjct: 53  NYIEVKGDVFVMGSPENEAGHREDERQHSVKIADYSLYKFAVCVGDFKRFIDDSGYKTDA 112

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEH--RMHHPVVHISWNDAVAYC 131
           EK   +FV++         K  + +  +     L   +      +HPV+H+SWNDA AYC
Sbjct: 113 EKENFSFVYD--------GKSGKRKDGINWSHDLSGNVREPSAYNHPVLHVSWNDAEAYC 164

Query: 132 TWRGA------RLPTEAEWEYGCRGGLE 153
            W         RLPTEAEWE+ CR G E
Sbjct: 165 AWLSKKTGNTFRLPTEAEWEFACRAGSE 192


>gi|110598764|ref|ZP_01387025.1| Protein of unknown function DUF323 [Chlorobium ferrooxidans DSM
           13031]
 gi|110339628|gb|EAT58142.1| Protein of unknown function DUF323 [Chlorobium ferrooxidans DSM
           13031]
          Length = 332

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 102/341 (29%), Positives = 140/341 (41%), Gaps = 102/341 (29%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           ++ V +PG  F MG+      +        V L  FY+ ++ V++ +F++FV ATGY+T+
Sbjct: 78  ENFVFIPGGEFTMGSPLSEANRQINETQHQVRLSDFYISRYPVTSAEFRKFVKATGYLTD 137

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE--HRMHHPVVHISWNDAVAY 130
           AEK  D                   RH      G+  T+      +HPVV++SWNDAVAY
Sbjct: 138 AEKANDQ---------------KHWRH------GVSGTLRPNSEDNHPVVYMSWNDAVAY 176

Query: 131 CTW------RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
           C W      R  RLPTEAE EY CR G      P+ S        S    R N+      
Sbjct: 177 CNWISQHTGRKFRLPTEAEREYACRAGTAT---PFNSG------QSLTTERANY------ 221

Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
                                         N   P GN   P+G +R N           
Sbjct: 222 ------------------------------NGQNPSGNK--PKGVYREN----------- 238

Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGT 299
                  T  V S+  N +GLYNM GNVWEW +DW++   +          NP GP  G+
Sbjct: 239 -------TVAVNSFAPNAWGLYNMHGNVWEWCSDWFDPAFYEQCKAAGTVTNPDGPEDGS 291

Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            +V +G S+   E++C   R A R    P +   N+GFR  
Sbjct: 292 RRVLRGASWFDFEEHC---RSAYRIGGAPLARNPNVGFRLV 329


>gi|119357294|ref|YP_911938.1| hypothetical protein Cpha266_1490 [Chlorobium phaeobacteroides DSM
           266]
 gi|119354643|gb|ABL65514.1| protein of unknown function DUF323 [Chlorobium phaeobacteroides DSM
           266]
          Length = 742

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 104/337 (30%), Positives = 140/337 (41%), Gaps = 87/337 (25%)

Query: 16  VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEK 75
           VL+ GD F MG+ K  + +        V L  +YL ++ V+   F+ F    GY      
Sbjct: 478 VLIRGDEFTMGSPKHEVDRSDNETEHQVKLSDYYLCKYAVTVADFKRFAEEIGYT----- 532

Query: 76  FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRG 135
                      ++ E A  S V      F+G                +WN  V       
Sbjct: 533 -----------TDAETAGSSSV------FDGK---------------AWNKKVG------ 554

Query: 136 ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW- 194
                   W +G                  E + S  E+  NHPV+HVSWNDAVAY  W 
Sbjct: 555 ------INWRHGV----------------SESLRSAGEY--NHPVLHVSWNDAVAYAGWL 590

Query: 195 -----RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
                +  RLPTEAEWEY CR G +   F  G NLT     +AN + G +P N      +
Sbjct: 591 SLQTGKSFRLPTEAEWEYACRAG-KGEPFSTGENLTT---DQAN-YDGNYPYNGNRKGVF 645

Query: 250 LSTAPVMSYKE-NKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTDKVK 303
             T       E N  GLYNM GNVWEW +D +   ++          NP GP TG+ +V 
Sbjct: 646 RGTTIACDALEPNALGLYNMHGNVWEWCSDRYGDKYYDECKAKGIVENPVGPETGSLRVL 705

Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           +GGS+    + C   R A R  ++PD  + + GFR A
Sbjct: 706 RGGSWFNYARIC---RSAYRDGDSPDYRSDDAGFRLA 739


>gi|126641026|ref|YP_001084010.1| hypothetical protein A1S_0976 [Acinetobacter baumannii ATCC 17978]
          Length = 270

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 97/328 (29%), Positives = 142/328 (43%), Gaps = 66/328 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY+T+AE
Sbjct: 1   MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYITDAE 59

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K     VF P     ++    +  +  K   G + T+ +  H PV +++ NDA  Y  W 
Sbjct: 60  KQKQAAVFSPDPHHPQQWWQLKTGYTWKTPNGKNGTLPNP-HEPVRYVTKNDAEHYAVWL 118

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
           G  LPTE EWEY  +        P  + L+    D   EH+  HP  +           W
Sbjct: 119 GRDLPTELEWEYAAKANS-----PTDTPLYQAPTD---EHQ--HPQANY----------W 158

Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
           +GA                    FP+ N                        D +   AP
Sbjct: 159 QGA--------------------FPFEN---------------------LNQDHFTGIAP 177

Query: 255 VMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLCN 311
           V  +K N F L++M+GNVWEWT+  +   H     +Y+   +   + T  V KGGS+LC 
Sbjct: 178 VGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLCA 237

Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           E YC R+R ++R     D +A ++GFR 
Sbjct: 238 ENYCSRYRNSSRYPQDFDLAATHVGFRT 265



 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 73/234 (31%), Positives = 100/234 (42%), Gaps = 58/234 (24%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGG--LENRLF 218
           +W  P G + T+ +  + PV +V+ NDA  Y  W G  LPTE EWEY  +     +  L+
Sbjct: 85  TWKTPNGKNGTLPNP-HEPVRYVTKNDAEHYAVWLGRDLPTELEWEYAAKANSPTDTPLY 143

Query: 219 PWGNNLTPRGEHR---ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEW 275
                  P  EH+   AN WQG FP  N   D +   APV  +K N F L++M+GNVWEW
Sbjct: 144 -----QAPTDEHQHPQANYWQGAFPFENLNQDHFTGIAPVGCFKPNGFKLFDMIGNVWEW 198

Query: 276 TADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNL 335
           T+  +   H                                            D   GN 
Sbjct: 199 TSSPYQGAH--------------------------------------------DQHMGNY 214

Query: 336 GFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 389
               A  +   + T  V KGGS+LC E YC R+R ++R     D +A ++GFR 
Sbjct: 215 ---SALRQQEMSSTQYVIKGGSFLCAENYCSRYRNSSRYPQDFDLAATHVGFRT 265


>gi|406672268|ref|ZP_11079494.1| hypothetical protein HMPREF9700_00036 [Bergeyella zoohelcum CCUG
           30536]
 gi|405587956|gb|EKB61674.1| hypothetical protein HMPREF9700_00036 [Bergeyella zoohelcum CCUG
           30536]
          Length = 741

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 80/214 (37%), Positives = 104/214 (48%), Gaps = 29/214 (13%)

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
           A+   RG ++     W  G   G E ++   GSW   +G       + NHPVVHV+W  A
Sbjct: 553 AFLNARGNQVTNGVRWYQGGSSG-EIQVVS-GSWKAKQG-------KENHPVVHVTWYGA 603

Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRAN--VWQGEFPTNNTAA 246
            AY  W G RLPTEAEWEY  RGG  +  F +       G + A    W G+   +N   
Sbjct: 604 KAYAEWVGGRLPTEAEWEYAARGGKYSLGFFYA------GSNNAGDVAWYGDNSGSN--- 654

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
                T PV   + N+ GLY+M GNVWEW +DW+  +   A S NP GPTTG+ +V +GG
Sbjct: 655 -----THPVGQKQPNELGLYDMSGNVWEWCSDWYGAYSGNAQS-NPTGPTTGSYRVLRGG 708

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            +     Y    R A R+   P     N GFR  
Sbjct: 709 GFYYGATYT---RVARRNYYDPSYQNYNYGFRVV 739



 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 49/137 (35%), Positives = 68/137 (49%), Gaps = 18/137 (13%)

Query: 16  VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG-YVTEAE 74
           VL+ G TF MG+       D E P   VTL  + + ++E++N Q+  F++A G  VT   
Sbjct: 508 VLVEGGTFWMGSPNNEGETD-ERPIHQVTLSNYRMSKYEITNAQYAAFLNARGNQVTNGV 566

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           ++                +I  V    K  +G       + +HPVVH++W  A AY  W 
Sbjct: 567 RW---------YQGGSSGEIQVVSGSWKAKQG-------KENHPVVHVTWYGAKAYAEWV 610

Query: 135 GARLPTEAEWEYGCRGG 151
           G RLPTEAEWEY  RGG
Sbjct: 611 GGRLPTEAEWEYAARGG 627


>gi|194336983|ref|YP_002018777.1| hypothetical protein Ppha_1944 [Pelodictyon phaeoclathratiforme
           BU-1]
 gi|194309460|gb|ACF44160.1| protein of unknown function DUF323 [Pelodictyon phaeoclathratiforme
           BU-1]
          Length = 829

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 75/195 (38%), Positives = 102/195 (52%), Gaps = 23/195 (11%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLE 214
           +W H          + +HPV+HVSWNDAVAYC W      +  RLPTEAEWEY CR    
Sbjct: 640 NWRHDVSGKKRPSDQYSHPVLHVSWNDAVAYCKWMTDKTGKTFRLPTEAEWEYACRAKTT 699

Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKE----NKFGLYNMVG 270
              F  G NLT     +AN + G +P NN A   Y  T   +   E    N++GL++M G
Sbjct: 700 TP-FNTGENLTT---DQAN-YDGNYPYNNNAKGRYRETTVAVGSSERFEPNQWGLFHMHG 754

Query: 271 NVWEWTADWWNVHHH-----PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQ 325
           NVWEW +DW+  +++          NP GP TG+ +V +GG +  + + C   R A R  
Sbjct: 755 NVWEWCSDWYGGNYYEECKTKGTVENPVGPETGSGRVLRGGGWNGSARGC---RSAYRGS 811

Query: 326 NTPDSSAGNLGFRCA 340
           + PD  + + GFR A
Sbjct: 812 DAPDYRSRSAGFRLA 826



 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 49/144 (34%), Positives = 76/144 (52%), Gaps = 13/144 (9%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSR-NVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + V +P   F MG+ +    + G   ++  V +  FYL ++ V+  +F+ F+  +GY T+
Sbjct: 559 NFVRIPAGEFAMGSPEGEPERYGSDETQHQVKVSEFYLCKYAVTPAEFKTFIDESGYQTD 618

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AEK   + +++    E +  +    RHD+            +  HPV+H+SWNDAVAYC 
Sbjct: 619 AEKANSSGIWDG--KEWKDKEGVNWRHDVSG----KKRPSDQYSHPVLHVSWNDAVAYCK 672

Query: 133 W------RGARLPTEAEWEYGCRG 150
           W      +  RLPTEAEWEY CR 
Sbjct: 673 WMTDKTGKTFRLPTEAEWEYACRA 696


>gi|260551202|ref|ZP_05825405.1| sulfatase-modifying factor 1 [Acinetobacter sp. RUH2624]
 gi|424056424|ref|ZP_17793945.1| hypothetical protein W9I_02794 [Acinetobacter nosocomialis Ab22222]
 gi|425740965|ref|ZP_18859124.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii WC-487]
 gi|260405807|gb|EEW99296.1| sulfatase-modifying factor 1 [Acinetobacter sp. RUH2624]
 gi|407441464|gb|EKF47970.1| hypothetical protein W9I_02794 [Acinetobacter nosocomialis Ab22222]
 gi|425493812|gb|EKU60036.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii WC-487]
          Length = 335

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 69/182 (37%), Positives = 97/182 (53%), Gaps = 6/182 (3%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W  P G +  + H  + PV +V+ NDA  Y  W G  LPTE EWEY  +   +    P 
Sbjct: 150 TWKTPNGKNGALPHP-HEPVRYVTKNDAEHYAVWLGRDLPTELEWEYAAKANSKTDT-PL 207

Query: 221 GNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD- 278
               T   +H +AN WQG FP  N   D +   APV  +K N F L++M+GNVWEWT+  
Sbjct: 208 HQAPTDEHQHPQANYWQGTFPFENLNHDHFTGVAPVGCFKSNGFKLFDMIGNVWEWTSSP 267

Query: 279 WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
           +   H     +Y+   +   + T  V KGGS+LC E YC R+R ++R     D +A ++G
Sbjct: 268 YQGAHDQHMGNYSDLRQKEMSSTQYVIKGGSFLCAENYCSRYRNSSRYPQDFDLAATHVG 327

Query: 337 FR 338
           FR
Sbjct: 328 FR 329



 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 47/136 (34%), Positives = 67/136 (49%), Gaps = 2/136 (1%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY+T+AE
Sbjct: 66  MVFIPDGQFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYITDAE 124

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K     VF P     ++    +  +  K   G +  + H  H PV +++ NDA  Y  W 
Sbjct: 125 KQKQAAVFTPDPRHPQQWWQLKSGYTWKTPNGKNGALPHP-HEPVRYVTKNDAEHYAVWL 183

Query: 135 GARLPTEAEWEYGCRG 150
           G  LPTE EWEY  + 
Sbjct: 184 GRDLPTELEWEYAAKA 199


>gi|445426185|ref|ZP_21437542.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           sp. WC-743]
 gi|444752735|gb|ELW77407.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           sp. WC-743]
          Length = 316

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 68/166 (40%), Positives = 89/166 (53%), Gaps = 9/166 (5%)

Query: 179 PVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR---ANVW 235
           PV +V+ NDA  Y  W G  LPTE EWEY  + G          +  P   H+   AN W
Sbjct: 148 PVRYVTQNDAEHYAVWLGHDLPTEQEWEYAAKAGSTTDT---PLHQAPEDAHQHPQANYW 204

Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW-NVHHHPAPSYNP-- 292
           QGEFP NNT AD +   APV  Y  N F L++M+GNVWEWT+  +   H     +Y    
Sbjct: 205 QGEFPFNNTQADHFEGVAPVGCYAANAFKLFDMIGNVWEWTSSIYRGAHDQHMGNYADLR 264

Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           +    GT  V KGGS+LC   +C R+R ++R     D +A ++GFR
Sbjct: 265 QQRIAGTQYVIKGGSFLCASNFCSRYRNSSRYPQEFDLAATHVGFR 310


>gi|451979552|ref|ZP_21927968.1| conserved exported hypothetical protein [Nitrospina gracilis 3/211]
 gi|451763204|emb|CCQ89159.1| conserved exported hypothetical protein [Nitrospina gracilis 3/211]
          Length = 432

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 68/168 (40%), Positives = 96/168 (57%), Gaps = 11/168 (6%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
           N PV  ++WN+A  YC  +G RLPTEAEWEY  RGG +   + WG+  T +G + AN   
Sbjct: 268 NLPVESITWNEASQYCEKQGKRLPTEAEWEYAARGGTQTHFY-WGD--TVKG-NEANFCD 323

Query: 237 GEFPTNNTAA---DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNP 292
                N   A   DG+ +TAPV  +K N +GL++M GNV EWTADW + +++  +P  NP
Sbjct: 324 KACELNIRVAAIDDGFRTTAPVGQFKPNSYGLHDMAGNVAEWTADWMDENYYRSSPKDNP 383

Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           KGP    +KV +GG++     +    R + R+   PD     +GFRC 
Sbjct: 384 KGPRPLDNKVVRGGAWNTTAGFL---RSSNRAAFLPDFRNPGVGFRCV 428



 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 41/147 (27%), Positives = 63/147 (42%), Gaps = 45/147 (30%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV + G  F+MG+ +       E P   V +D+F +D++EV+  +FQ  +         
Sbjct: 210 EMVTIKGGCFQMGSKQGT---PAEQPVHEVCVDSFQMDKYEVTQARFQAVM--------- 257

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
                           +R     +  ++                PV  I+WN+A  YC  
Sbjct: 258 ----------------DRNPSDHIGANL----------------PVESITWNEASQYCEK 285

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWG 160
           +G RLPTEAEWEY  RGG +   + WG
Sbjct: 286 QGKRLPTEAEWEYAARGGTQTHFY-WG 311


>gi|328953727|ref|YP_004371061.1| sulfatase-modifying factor protein [Desulfobacca acetoxidans DSM
           11109]
 gi|328454051|gb|AEB09880.1| Sulphatase-modifying factor protein [Desulfobacca acetoxidans DSM
           11109]
          Length = 292

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 69/176 (39%), Positives = 92/176 (52%), Gaps = 14/176 (7%)

Query: 177 NHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
           N PVV V+W+DA A+  W          LPTEA+WE   RGGL  + FPWG+    +G  
Sbjct: 124 NQPVVGVTWHDATAFAKWLSRITQEEYMLPTEAQWEAAARGGLVGQPFPWGSTPPEQGGR 183

Query: 231 RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPS 289
               ++  F  N    DG+  TAPV S+  N FGLY+M GNV EW  DW++   +  +  
Sbjct: 184 ----FRASFNPNPYDKDGFRYTAPVASFPPNGFGLYDMAGNVAEWCEDWYHSSSYTKSRK 239

Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
            NP GP +GT +V +GGS+    +     RCAAR    P  + G +GFR      P
Sbjct: 240 ENPAGPASGTSRVLRGGSWYGRAR---ELRCAARQSFRPSQTDGFIGFRLIRPVSP 292



 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 70/158 (44%), Gaps = 45/158 (28%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEF----PSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
            + L+PG  + +G+      + G +    P   V L  FY+   E++N Q+ EF++ATG+
Sbjct: 53  QLRLIPGGPYLLGSPPE---ESGRYWDEGPQHWVNLSPFYITATEITNAQYGEFLAATGH 109

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
                        EPL                     LD  + +  + PVV ++W+DA A
Sbjct: 110 P------------EPLY-------------------WLDRNL-NAPNQPVVGVTWHDATA 137

Query: 130 YCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGS 161
           +  W          LPTEA+WE   RGGL  + FPWGS
Sbjct: 138 FAKWLSRITQEEYMLPTEAQWEAAARGGLVGQPFPWGS 175


>gi|312131280|ref|YP_003998620.1| gliding motility-associated lipoprotein gldj [Leadbetterella
           byssophila DSM 17132]
 gi|311907826|gb|ADQ18267.1| gliding motility-associated lipoprotein GldJ [Leadbetterella
           byssophila DSM 17132]
          Length = 419

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 113/392 (28%), Positives = 164/392 (41%), Gaps = 75/392 (19%)

Query: 5   PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           PA P     +MV + G  F MGT +  ++   +   R VT+ +FY+D+ E++N  + E++
Sbjct: 59  PAGP-----NMVFIEGGRFVMGTLEEDVMYSRDNIDRTVTVSSFYMDETEIANLDYLEYL 113

Query: 65  SATGYVTEAEKFG----DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
            A    +  E +     DT V+   +S  +     Q      R+ G  S       +PVV
Sbjct: 114 YAVSRDSTREFYAAALPDTTVWAHEMSFND-----QYVDQYLRYPGFRS-------YPVV 161

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
            +SW  A  YC WR A +  E   +YG +G  +   F        E  D+T +     P+
Sbjct: 162 GVSWVQANDYCKWRSAVVNREMYAKYGGKGEKKTLSFGRKKKGAEEVADNTPQVTGRVPI 221

Query: 181 -VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGL---------ENRLFPW-GNNL----- 224
              ++  D          RLPTEAEWEY  +  +         +NR++PW G++L     
Sbjct: 222 ETGINLPDF---------RLPTEAEWEYAAKATIGTLSVDENYDNRIYPWDGSSLRRAKG 272

Query: 225 TPRGEHRANVWQGEFPTNNTAA---DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
             RGE +AN  +G       A    D Y  TAPV  Y  N FGLYNM GNV EW  D + 
Sbjct: 273 KARGEMQANYKRGRGDYAGIAGRLNDKYQITAPVYEYYPNDFGLYNMAGNVNEWVYDLYR 332

Query: 282 VHHHPAPSYNPKGPTTGTD---------------------KVKKGGSYLCNEQYCYRHRC 320
            + +    +N   P   +D                     +V KGGS+       Y    
Sbjct: 333 PNSY--RDFNDLNPIRRSDYQDEEKNYDNANYNSLINNRVRVYKGGSW---NDVAYWLSP 387

Query: 321 AARSQNTPDSSAGNLGFRCAADKGPTTGTDKV 352
             R     DSS   +GFRCA     + G  K+
Sbjct: 388 GTRRFLDQDSSTATIGFRCAMISVGSAGKRKL 419


>gi|423315592|ref|ZP_17293497.1| hypothetical protein HMPREF9699_00068 [Bergeyella zoohelcum ATCC
           43767]
 gi|405585910|gb|EKB59703.1| hypothetical protein HMPREF9699_00068 [Bergeyella zoohelcum ATCC
           43767]
          Length = 869

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 69/165 (41%), Positives = 88/165 (53%), Gaps = 14/165 (8%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR-LFPWGNNLTPRGEHRANVW 235
           NHPV HVSW  A AY  W G RLPTEAEWEY  RGG  +   F  G+N         NV 
Sbjct: 716 NHPVGHVSWYGARAYAQWVGGRLPTEAEWEYAARGGKYSLGFFYAGSN---------NVG 766

Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGP 295
              +  +N++ +G  +T PV   + N+ GLY+M GNVWEW +DW+  +   A S NP GP
Sbjct: 767 DVAWYDDNSSVNGTRTTHPVGQKQPNELGLYDMSGNVWEWCSDWYGAYSSNAQS-NPTGP 825

Query: 296 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           TTG  +V +GG +  +       R A R  N P +     GFR  
Sbjct: 826 TTGNLRVLRGGGFNDDANIT---RVAHRRSNYPSNQLNYFGFRVV 867



 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 49/136 (36%), Positives = 70/136 (51%), Gaps = 17/136 (12%)

Query: 16  VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEK 75
           VL+ G TF MG+   +   + E P   VT+  F + ++EV+N Q+ +F++  G  TE   
Sbjct: 633 VLVEGGTFLMGSPNGVGYSN-ERPQHQVTVSTFRMSKYEVTNAQYAQFLNVKGNQTEG-- 689

Query: 76  FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRG 135
            G+++  E   S             +K+  G+        +HPV H+SW  A AY  W G
Sbjct: 690 -GNSWYDEYYGS-------------IKKNGGVYLVTPGYENHPVGHVSWYGARAYAQWVG 735

Query: 136 ARLPTEAEWEYGCRGG 151
            RLPTEAEWEY  RGG
Sbjct: 736 GRLPTEAEWEYAARGG 751


>gi|402758441|ref|ZP_10860697.1| hypothetical protein ANCT7_12175 [Acinetobacter sp. NCTC 7422]
          Length = 334

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 67/168 (39%), Positives = 93/168 (55%), Gaps = 9/168 (5%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR---AN 233
           + PV +V+ NDA  Y  W G  LP+E EWEY  + G          +  P+  H+   AN
Sbjct: 163 SEPVRYVTKNDAEHYAVWLGRDLPSEQEWEYAAKAGSTTDT---PLHQAPQDTHQHPQAN 219

Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNP 292
            WQGEFP NNTA D +   APV  Y  N F L++M+GNVWEWT+  ++  H     +Y  
Sbjct: 220 YWQGEFPLNNTAEDHFDGVAPVGCYPANAFRLFDMIGNVWEWTSSVYHGAHDQHMGNYAE 279

Query: 293 --KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
             +  +T T  V KGGS+LC   +C R+R ++R     D +A ++GFR
Sbjct: 280 LRQQNSTATQYVLKGGSFLCASNFCARYRNSSRYPQEFDLAATHVGFR 327



 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 51/139 (36%), Positives = 72/139 (51%), Gaps = 6/139 (4%)

Query: 15  MVLLPGDTFRMGTNK--PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           MV +P  TF++G+N   P  +  G+ P R V  + F++DQ EV+  QF  FV ATGYVT+
Sbjct: 64  MVWIPKGTFQLGSNLAYPDELNFGK-PQRQV--EGFWIDQTEVTVAQFASFVKATGYVTD 120

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AEK     VF P ++   +    +  +  K   G           PV +++ NDA  Y  
Sbjct: 121 AEKQKQAAVFSPDVNHPNQWWQLKSDYSWKTPNGAAGQAP-LPSEPVRYVTKNDAEHYAV 179

Query: 133 WRGARLPTEAEWEYGCRGG 151
           W G  LP+E EWEY  + G
Sbjct: 180 WLGRDLPSEQEWEYAAKAG 198


>gi|422321266|ref|ZP_16402315.1| hypothetical protein HMPREF0005_03342 [Achromobacter xylosoxidans
           C54]
 gi|317403869|gb|EFV84342.1| hypothetical protein HMPREF0005_03342 [Achromobacter xylosoxidans
           C54]
          Length = 331

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 110/367 (29%), Positives = 150/367 (40%), Gaps = 106/367 (28%)

Query: 15  MVLLPGDTFRMGTNKP--ILIKD-------------GEFPSRNVTLD-AFYLDQHEVSNT 58
            V +P   F MG+++    L +D              E P   V +   F L   EV+  
Sbjct: 35  FVAVPAGRFLMGSDEAPEALARDYPHYPRERFTQLFDEAPVHAVRITRPFQLMATEVTVG 94

Query: 59  QFQEFVSATGYVTEAEK-----FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEH 113
           QF+ F+ A+GYV E+E      +G    ++P  SE   A           FEG D     
Sbjct: 95  QFRRFLEASGYVPESEADGTGGYGYNAAYDPDASERGDA-----------FEGRDRRY-- 141

Query: 114 RMHHPVVHISWNDAVAYCTWRGARLP-TEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTI 172
                             +WR    P T+A                              
Sbjct: 142 ------------------SWRNPGFPQTDA------------------------------ 153

Query: 173 EHRMNHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTP 226
                HPVV+VSWNDAVA   W      R  RLPTEAEWEY  R G   R +  G++   
Sbjct: 154 -----HPVVNVSWNDAVAMARWLSQREGRVYRLPTEAEWEYAARAGGRGR-YQHGDDPAA 207

Query: 227 RGE-------HRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
             E         A +W          +DG+  TAPV SY  N FGLY+M GN WEW ADW
Sbjct: 208 LAEVANTFDADAAPLWPKWQAYAVARSDGHRYTAPVASYAPNAFGLYDMHGNAWEWVADW 267

Query: 280 WNVHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           +   ++  +P  +P+GP +G  +V++GGS+     Y    R A R+ NTP++    +GFR
Sbjct: 268 YGEDYYAHSPRDDPQGPASGNVRVRRGGSWHTWPLYT---RSAYRNWNTPETRYPLVGFR 324

Query: 339 CAADKGP 345
              +  P
Sbjct: 325 LLREVAP 331


>gi|159037635|ref|YP_001536888.1| hypothetical protein Sare_2027 [Salinispora arenicola CNS-205]
 gi|157916470|gb|ABV97897.1| protein of unknown function DUF323 [Salinispora arenicola CNS-205]
          Length = 314

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 117/358 (32%), Positives = 148/358 (41%), Gaps = 89/358 (24%)

Query: 6   APPVERYKDMVLLPGDTFRMGTNKPIL----------IKD---GEFPSRNVTLDAFYLDQ 52
           APP      MV +PG TF  G+ +  L           +D    E P R V L  + +D+
Sbjct: 14  APPAP--ATMVHVPGGTFLQGSPERTLDWLEAEGQAFPRDWFTDETPQRPVALRDYLIDR 71

Query: 53  HEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE 112
           + V+  QF +FV+ TGYVT AE+ G + V+       E A          R  G  S I 
Sbjct: 72  YPVTVAQFADFVACTGYVTSAERAGGSMVYSDYWEIREGAC-------WHRPAGYGSGIR 124

Query: 113 HRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTI 172
            R  HPVVHIS+ DA AY TW G RLPTE+EWE     G   RL+PWG            
Sbjct: 125 SREDHPVVHISYPDAEAYATWAGKRLPTESEWERAAT-GRAYRLWPWGD----------- 172

Query: 173 EHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRA 232
                               TW   R  T AEW  G   GL+     W       G H A
Sbjct: 173 --------------------TWDPTRANT-AEWTGGSLRGLDEWRQWWS------GIHNA 205

Query: 233 NVWQGEFPTNNTAADGYLSTAPVMSYK---ENKFGLYNMVGNVWEWTADWWNVHHHPAPS 289
              QG  P           T PV ++    ++  G  +M GNV+EWT    +   HP   
Sbjct: 206 ---QGPVP----------QTTPVRAFSPRGDSPDGCADMTGNVYEWT----STTAHPYAP 248

Query: 290 YNPKGPTT----GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDS-SAGNLGFRCAAD 342
                PT     G  +  +GGS++    + Y+ RCA R    P   S   LGFRCA D
Sbjct: 249 ETRCDPTIHLVMGRFRAIRGGSWM---NFRYQVRCAERLYGDPTGWSNFALGFRCAGD 303


>gi|451980952|ref|ZP_21929334.1| putative Sulfatase-modifying factor 1 [Nitrospina gracilis 3/211]
 gi|451761874|emb|CCQ90579.1| putative Sulfatase-modifying factor 1 [Nitrospina gracilis 3/211]
          Length = 320

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 68/174 (39%), Positives = 91/174 (52%), Gaps = 14/174 (8%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
           N PV  VSW+DA  YC WRG RLPTEAEWE   RG + N L+PWGN+L P        W+
Sbjct: 150 NQPVTQVSWHDAKEYCEWRGKRLPTEAEWEKAARGPMGN-LYPWGNDLPPGKATYRRKWR 208

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-NPKGP 295
           G     +  +  Y +   V       +G+++M GNVWEW +DW++  ++      NP+GP
Sbjct: 209 GIHTMTDVGS--YPNGVSV-------YGVHDMAGNVWEWVSDWYDRKYYKKRVRNNPQGP 259

Query: 296 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
             G  KV +GGS++  E   +    A R    P     + GFRCA D  P   T
Sbjct: 260 KRGVFKVMRGGSWVNYEDTLH---SAFRRWGRPYVRFNDTGFRCAKDPIPIPST 310



 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 63/175 (36%), Positives = 86/175 (49%), Gaps = 37/175 (21%)

Query: 9   VERYKDMVLLPGDTFRMGTNKPIL------------------IKDGEFPSRNVTLDAFYL 50
           +E Y DMV +P  TFRMG     +                   KD EFP   V LD +++
Sbjct: 43  LEMYSDMVEIPASTFRMGLGFHKITDMLEMCWQVDDRCNRWWFKD-EFPDHMVYLDKYWI 101

Query: 51  DQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDST 110
           D +EV+N ++ EFV ATG+              P L +    K      ++ + E     
Sbjct: 102 DIYEVTNQRYLEFVKATGH-------------RPALDDTCETKACW-EGNLWKGESFPKE 147

Query: 111 IEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
           I+++   PV  +SW+DA  YC WRG RLPTEAEWE   RG + N L+PWG+ L P
Sbjct: 148 IKNQ---PVTQVSWHDAKEYCEWRGKRLPTEAEWEKAARGPMGN-LYPWGNDLPP 198


>gi|386811281|ref|ZP_10098507.1| conserved hypothetical protein [planctomycete KSU-1]
 gi|386406005|dbj|GAB61388.1| conserved hypothetical protein [planctomycete KSU-1]
          Length = 263

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 65/167 (38%), Positives = 93/167 (55%), Gaps = 20/167 (11%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
             P+V ++W DAVAY TW G RLPTEAEWE   R G + R++PWGN      + + NVW+
Sbjct: 109 KQPIVGITWEDAVAYATWAGKRLPTEAEWEKAAR-GTDKRVWPWGNKFD---KEKCNVWE 164

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNV-HHHPAPSYNPK 293
                         +T PV  Y+  K  +G Y+M GN WEW AD+++  +++ +   NP 
Sbjct: 165 SVIK----------ATTPVGKYENGKSPYGCYDMAGNAWEWCADYYDQNYYYVSLQKNPA 214

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GP +G  +V +GGS+L    Y    RCAAR +    + +  +GFRC 
Sbjct: 215 GPDSGLQRVVRGGSFLYFGHYA---RCAARYRVPWYAESPQIGFRCV 258



 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 59/160 (36%), Positives = 83/160 (51%), Gaps = 43/160 (26%)

Query: 10  ERYKDMVLLPGDTFRMGTNK--------PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
           E+Y DMVL+P   F MG +            I DG  P   V LDAFY+D++EV+N Q++
Sbjct: 29  EKYPDMVLVPAGEFIMGDSTYYDWTFLLAYNIYDG--PEHRVYLDAFYIDKYEVTNEQYK 86

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
           +FV  TG+                       ++ +  +D+ RF        ++   P+V 
Sbjct: 87  KFVEETGH-----------------------RMPRCWNDV-RF--------NQPKQPIVG 114

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
           I+W DAVAY TW G RLPTEAEWE   R G + R++PWG+
Sbjct: 115 ITWEDAVAYATWAGKRLPTEAEWEKAAR-GTDKRVWPWGN 153


>gi|162452078|ref|YP_001614446.1| hypothetical protein sce3806 [Sorangium cellulosum So ce56]
 gi|161162660|emb|CAN93965.1| hypothetical protein sce3806 [Sorangium cellulosum So ce56]
          Length = 428

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 69/170 (40%), Positives = 94/170 (55%), Gaps = 10/170 (5%)

Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANV 234
           R   PVV VSW+DA  YC W G RLPTEAEWE   R G+  R +PWG+   P     AN 
Sbjct: 266 RPEVPVVLVSWSDAQRYCVWAGGRLPTEAEWERAAR-GVAGRRYPWGHVYNP---FHANH 321

Query: 235 WQGEFPTNNTAADGYLSTAPVMSYKENKF--GLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
            +  F   +  +DG+L  APV S    +   G+ ++ GNV EW AD+++  +    + NP
Sbjct: 322 GRLSFDELDD-SDGFLELAPVGSLVAGRTPDGIDDLAGNVEEWVADYYHPEYPSGSAKNP 380

Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +GP  G ++V +GGSY+    +    R AAR  ++P S     GFRCA D
Sbjct: 381 RGPDMGEERVLRGGSYILGRPWL---RGAARGHDSPTSRRAWRGFRCARD 427



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 28/52 (53%), Positives = 33/52 (63%), Gaps = 1/52 (1%)

Query: 114 RMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
           R   PVV +SW+DA  YC W G RLPTEAEWE   R G+  R +PWG   +P
Sbjct: 266 RPEVPVVLVSWSDAQRYCVWAGGRLPTEAEWERAAR-GVAGRRYPWGHVYNP 316


>gi|399017730|ref|ZP_10719919.1| hypothetical protein PMI16_00829 [Herbaspirillum sp. CF444]
 gi|398102497|gb|EJL92677.1| hypothetical protein PMI16_00829 [Herbaspirillum sp. CF444]
          Length = 299

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 77/183 (42%), Positives = 101/183 (55%), Gaps = 24/183 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
           NHPV++V+WNDA A   W         RLPTEAEWEY CR G   R +P G++  P G  
Sbjct: 119 NHPVLNVTWNDAQALAAWLSKKEQVHYRLPTEAEWEYSCRAGSTTR-YPNGDD--PEGLS 175

Query: 231 R-ANVWQGE----FPTNNTAA----DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW- 280
           R AN +       +P   + A    DG+  T+PV S+  N FGLY+MVGN WEWTADW+ 
Sbjct: 176 RIANTFDANATPYWPRFASHALKNHDGFAFTSPVGSFPPNGFGLYDMVGNAWEWTADWYG 235

Query: 281 -NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            + + H +P  +P GP  G  KV++GGS+     Y    RCA R+ NT  +    +G R 
Sbjct: 236 EDTYAH-SPLADPHGPADGEVKVRRGGSWHTWSLYA---RCAFRNWNTTSTRYTLVGMRL 291

Query: 340 AAD 342
             D
Sbjct: 292 VRD 294



 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/181 (32%), Positives = 87/181 (48%), Gaps = 34/181 (18%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPS----RNVTLD------------AFYLDQHEVSNT 58
           MVL+P   F MGT++ +    G +P     R + LD             FY+ ++ V+  
Sbjct: 1   MVLIPAGEFLMGTDESLASLKGAYPDYEERRLLDLDDERPRHKVLITHPFYVGKYLVTVG 60

Query: 59  QFQEFVSATGYVTEAEK-----FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEH 113
           QF +FV+ +GY  E+ +     +G    ++P   E ER    + R+   R+   +     
Sbjct: 61  QFAKFVAESGYTPESIRDQTGGYGYNPDYDP--QETERKDAFEGRN--PRYSWKNPGFPQ 116

Query: 114 RMHHPVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
             +HPV++++WNDA A   W         RLPTEAEWEY CR G   R +P G    PEG
Sbjct: 117 ADNHPVLNVTWNDAQALAAWLSKKEQVHYRLPTEAEWEYSCRAGSTTR-YPNGD--DPEG 173

Query: 168 I 168
           +
Sbjct: 174 L 174


>gi|258648993|ref|ZP_05736462.1| serine/threonine kinase [Prevotella tannerae ATCC 51259]
 gi|260850608|gb|EEX70477.1| serine/threonine kinase [Prevotella tannerae ATCC 51259]
          Length = 291

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 72/166 (43%), Positives = 91/166 (54%), Gaps = 11/166 (6%)

Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANV 234
           R N PV+ VSW DAVA+C W G RLPTEAEWEY CR G     F  G+NLT    +    
Sbjct: 130 RENRPVIDVSWEDAVAFCEWTGCRLPTEAEWEYACRAG-STTPFNTGDNLTTDDAN---- 184

Query: 235 WQGEFPTNNTAADGY-LSTAPVMSYKE-NKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
           + G +P NN     Y   T PV S  + N +GLY+M GNV EW ADW+   +  A   +P
Sbjct: 185 YDGTYPYNNHEQGLYRQQTVPVGSVGQPNAWGLYDMHGNVQEWCADWYE-EYSKAEQTDP 243

Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           KGP  GT ++ +GGS+      C   R A R  +  +S   + GFR
Sbjct: 244 KGPANGTYRILRGGSWTKRALQC---RSAYRGGSEYNSRFDDAGFR 286



 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 43/140 (30%), Positives = 59/140 (42%), Gaps = 38/140 (27%)

Query: 14  DMVLLPGDTFRMG--TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           + V +   +F MG  T +P   + GE   +      FY+ ++ V+  QF +F  ATG   
Sbjct: 64  EFVKIQAGSFVMGSPTTEPQR-EAGEIQHKVTLTKDFYMSKYPVTFEQFDKFCEATGR-- 120

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
                                            +  D     R + PV+ +SW DAVA+C
Sbjct: 121 ---------------------------------QKPDDYGWGRENRPVIDVSWEDAVAFC 147

Query: 132 TWRGARLPTEAEWEYGCRGG 151
            W G RLPTEAEWEY CR G
Sbjct: 148 EWTGCRLPTEAEWEYACRAG 167


>gi|390344998|ref|XP_001186776.2| PREDICTED: sulfatase-modifying factor 2-like [Strongylocentrotus
           purpuratus]
          Length = 198

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 66/162 (40%), Positives = 98/162 (60%), Gaps = 13/162 (8%)

Query: 4   LPAPPV-ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
           +PA PV ++Y++MV LPG  F MG++K    KDGE   R VT+  F ++++ ++   F++
Sbjct: 32  VPADPVFDKYEEMVALPGGKFMMGSSKSDG-KDGESVQRKVTVKPFKINKYPITAASFRK 90

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ----------VRHDM-KRFEGLDSTI 111
           FV A  Y T+AEK+  +F F+  + E  +  I Q          V++   ++ EG  S I
Sbjct: 91  FVRAKKYKTDAEKYSWSFAFDAFVPEAIKNTIEQRLPGAPWWIPVQNAYWRQPEGKGSGI 150

Query: 112 EHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 153
           E+R+++P VHIS  DA A+C W G RLPTE EWE+  RG LE
Sbjct: 151 ENRLNYPAVHISNVDARAFCQWHGWRLPTEPEWEFAARGSLE 192


>gi|194336982|ref|YP_002018776.1| hypothetical protein Ppha_1942 [Pelodictyon phaeoclathratiforme
           BU-1]
 gi|194309459|gb|ACF44159.1| protein of unknown function DUF323 [Pelodictyon phaeoclathratiforme
           BU-1]
          Length = 517

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 74/176 (42%), Positives = 94/176 (53%), Gaps = 20/176 (11%)

Query: 177 NHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
           NHPV+HVSWNDAVAYC W  A      RLPTEAEWE  CR G     F  G NL+     
Sbjct: 347 NHPVLHVSWNDAVAYCKWLSATTGEAFRLPTEAEWECACRAGTTTP-FNTGENLS---TG 402

Query: 231 RANVWQGEFPTNNTAADGYL-STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---- 285
           +AN + G +P  +     Y  +T  V S+  N +GLYNM GNV EW  DW+   ++    
Sbjct: 403 QAN-YDGNYPYKSNQKGVYRDNTVAVDSFSSNAWGLYNMHGNVLEWCGDWYGDKYYEECK 461

Query: 286 -PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                 NP GP TG+ +V +GG ++   +YC   R A RS   PD  +  +GFR  
Sbjct: 462 TKGTVENPSGPATGSCRVLRGGRWVSYARYC---RSAFRSLGPPDYRSYFVGFRLV 514



 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 56/155 (36%), Positives = 85/155 (54%), Gaps = 18/155 (11%)

Query: 7   PPVE--RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
           PP++   + D VL+ G  F MG+    + +  +     V +  FYL ++ V+   F+EF+
Sbjct: 242 PPIQPKVFSDFVLIRGGIFPMGSPTGEVDRFDDETQHQVKVSDFYLCRYTVTVAGFREFI 301

Query: 65  SATGYVTEAEKFGDTFVFEPLLSEEERAKISQV--RHDMKRFEGLDSTIEHRMHHPVVHI 122
            A+ Y T+AEK   ++V+    +  E  K++ +  R+ +   E  D+      +HPV+H+
Sbjct: 302 EASRYRTDAEKGDGSYVW----NGNEWNKMAGINWRYGISGSERNDA----EENHPVLHV 353

Query: 123 SWNDAVAYCTWRGA------RLPTEAEWEYGCRGG 151
           SWNDAVAYC W  A      RLPTEAEWE  CR G
Sbjct: 354 SWNDAVAYCKWLSATTGEAFRLPTEAEWECACRAG 388


>gi|451980968|ref|ZP_21929349.1| conserved exported hypothetical protein [Nitrospina gracilis 3/211]
 gi|451761800|emb|CCQ90594.1| conserved exported hypothetical protein [Nitrospina gracilis 3/211]
          Length = 265

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 69/170 (40%), Positives = 92/170 (54%), Gaps = 14/170 (8%)

Query: 174 HRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRAN 233
           +R NHPVV VSW DA+AY  W+G RLPTEAEWE   RG  + RL+PWG+         A 
Sbjct: 103 NRPNHPVVGVSWFDAMAYARWKGRRLPTEAEWEKAARGN-DTRLYPWGDKW-------AK 154

Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKE--NKFGLYNMVGNVWEWTADWW-NVHHHPAPSY 290
            +   F       D Y  TAPV +Y    + FG++NM GNVWEW  DW+ + ++  +P  
Sbjct: 155 GFVLYFVNVFGVEDKYRHTAPVDAYPAGISPFGVFNMAGNVWEWCLDWYEDDYYRKSPEL 214

Query: 291 NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           NP+GP     KV +GGS++         R   R +N P++     GFR  
Sbjct: 215 NPEGPEPTKMKVVRGGSWINTIDGA---RLVRRGRNFPNTKNMIYGFRTV 261



 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 56/153 (36%), Positives = 79/153 (51%), Gaps = 35/153 (22%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P E  + MV +P   F MG N     ++G  P  +V   A+++D++EVSN ++ +FV  T
Sbjct: 32  PPEAPEGMVYIPEGYFAMGVNSGNASEEG--PQHHVYTSAYFIDRYEVSNAEYMKFVKDT 89

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           G+             +PL  E+ER                     +R +HPVV +SW DA
Sbjct: 90  GHP------------KPLFWEDERF--------------------NRPNHPVVGVSWFDA 117

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 160
           +AY  W+G RLPTEAEWE   RG  + RL+PWG
Sbjct: 118 MAYARWKGRRLPTEAEWEKAARGN-DTRLYPWG 149


>gi|363581291|ref|ZP_09314101.1| gliding motility-associated lipoprotein GldK [Flavobacteriaceae
           bacterium HQM9]
          Length = 454

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 115/427 (26%), Positives = 167/427 (39%), Gaps = 94/427 (22%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF-------VSAT 67
           M L+PG  F MG +          P + VT+ +FY+D+ E++N+++++F       V  T
Sbjct: 36  MTLIPGGAFIMGKSDDDKAGLLNAPPKTVTVRSFYMDETEITNSEYRQFTEWVKDSVVRT 95

Query: 68  GYVTEAEKFGDT---------FVFEPLLSEEERAKISQVRHDMKRFEGLDST-------- 110
                A++ G T         F F     EEE +   +   D   +EGL  T        
Sbjct: 96  QLAITADEVGQTGSDEDGIGQFAFRNTEDEEELSVYEKYVRD--NYEGLGETGYEGKALN 153

Query: 111 -----IEHRMHHPVVHISWNDAVAYC----------TWRGARLPTEAEWEYGCRGGLENR 155
                IE    +P  + +      Y           T++  +L  + +W Y       +R
Sbjct: 154 KDVDLIEDLNEYPDEYYAEVMDSLYIRAEESHNGVRTFKTEKLVYDYKW-YDIEAMARSR 212

Query: 156 LFPWGSWLHPEGI----DSTI------------EHRM--------NHPVVHVSWNDAVAY 191
                 +L  E +    D+T+             H M        ++PVV V+W  A A+
Sbjct: 213 SKDSKKFLKREALKIYPDTTVWIRDFSYSYNEPMHDMYFTHTSYDDYPVVGVTWKQAKAF 272

Query: 192 CTWRGA------------------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRAN 233
           C WR                    RLPTEAEWEY  RGGLE+  FPWG    P  ++   
Sbjct: 273 CDWRTKNHNDALRKKKNKDQVSRYRLPTEAEWEYAARGGLESGTFPWGG---PYAKNDRG 329

Query: 234 VWQGEFP--TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSY 290
            +   F     + AAD  L T    SY  N +GLYNM GNV EW    ++   +    S 
Sbjct: 330 CFLANFKPMRGDYAADQALYTVEANSYDPNDYGLYNMAGNVSEWVGSTYDPASYEYTSSM 389

Query: 291 NPKGPTT-GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
           NP         KV +GGS+   +   Y  + + R     DS+   +GFR   D   T  T
Sbjct: 390 NPNSSDDENKRKVVRGGSW---KDVAYFLQVSTRDYEYADSARSYIGFRAVQDYMGTDVT 446

Query: 350 DKVKKGG 356
              K  G
Sbjct: 447 GNTKTTG 453


>gi|149919635|ref|ZP_01908114.1| hypothetical protein PPSIR1_07178 [Plesiocystis pacifica SIR-1]
 gi|149819578|gb|EDM79006.1| hypothetical protein PPSIR1_07178 [Plesiocystis pacifica SIR-1]
          Length = 289

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 92/330 (27%), Positives = 128/330 (38%), Gaps = 94/330 (28%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +++ + G +F MG        D + P R VT+  F +++ EV+ T + E V A G     
Sbjct: 52  NLIAITGGSFEMGCPAGAEACDADNPLRTVTVSDFAIERTEVTMTAYTECVDAGGCSATT 111

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
                     P+L                            + HP+  +SW  A  YC W
Sbjct: 112 NTDTSCTFGSPVL----------------------------LEHPINCVSWQQAADYCAW 143

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
           +G RLPTEAEWEY  RG  E R FPWG                          +  A CT
Sbjct: 144 KGRRLPTEAEWEYAARGA-EGRTFPWG--------------------------ETPASCT 176

Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
                +      +YGC  G+                  ANV  G +P             
Sbjct: 177 LAHMFMMMGDGGDYGCLTGI-----------------TANV--GTYP------------- 204

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTTGTDKVKKGGSYLCNE 312
                 E    L +M GNV EW ADW+   ++   PS +P+GP TGT +V +GG +L  +
Sbjct: 205 ----MGETPETLADMAGNVDEWVADWYAADYYTTGPSADPQGPDTGTQRVHRGGDFL--D 258

Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
                 R   R ++ PD+   + GFRCAAD
Sbjct: 259 ASPNNLRSFERWRSNPDNDGPSRGFRCAAD 288



 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 73/223 (32%), Positives = 94/223 (42%), Gaps = 65/223 (29%)

Query: 176 MNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW 235
           + HP+  VSW  A  YC W+G RLPTEAEWEY  RG  E R FPWG   TP     A++ 
Sbjct: 125 LEHPINCVSWQQAADYCAWKGRRLPTEAEWEYAARGA-EGRTFPWGE--TPASCTLAHM- 180

Query: 236 QGEFPTNNTAAD-GYLS--TAPVMSYK--ENKFGLYNMVGNVWEWTADWWNVHHHP-APS 289
              F       D G L+  TA V +Y   E    L +M GNV EW ADW+   ++   PS
Sbjct: 181 ---FMMMGDGGDYGCLTGITANVGTYPMGETPETLADMAGNVDEWVADWYAADYYTTGPS 237

Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
            +P+                                                  GP TGT
Sbjct: 238 ADPQ--------------------------------------------------GPDTGT 247

Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
            +V +GG +L  +      R   R ++ PD+   + GFRCAAD
Sbjct: 248 QRVHRGGDFL--DASPNNLRSFERWRSNPDNDGPSRGFRCAAD 288


>gi|387133017|ref|YP_006298989.1| formylglycine-generating sulfatase enzyme [Prevotella intermedia
           17]
 gi|386375865|gb|AFJ08724.1| formylglycine-generating sulfatase enzyme [Prevotella intermedia
           17]
          Length = 293

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 71/172 (41%), Positives = 93/172 (54%), Gaps = 11/172 (6%)

Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANV 234
           R N PV+ VSW+DAVA+C W G RLPTEAEWEY C+ G     F  G+NLT      AN 
Sbjct: 130 RENRPVIDVSWDDAVAFCEWIGCRLPTEAEWEYACKSGTTTP-FNTGDNLT---TDEAN- 184

Query: 235 WQGEFPTNNTAADGYL-STAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-N 291
           + G +P NN     Y   T PV +  K N +GL++M GN+ EW +DW+  +        +
Sbjct: 185 YDGNYPYNNNEKGTYRKKTVPVGTLGKPNAWGLFDMHGNIQEWCSDWYAEYSAADEVVTD 244

Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           PKGP +GT ++ +GGS+      C   R A R    P S   + GFR    K
Sbjct: 245 PKGPDSGTYRILRGGSWTTRAMQC---RSAYRGGIEPSSRLDDAGFRIVLVK 293



 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 38/146 (26%)

Query: 8   PVERYKDMVLLPGDTFRMGT--NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
           P E   + V +P  +F MG+   +P   + GE   R V    FY+ ++ ++  Q+ +F  
Sbjct: 58  PKELGIEFVKIPAGSFLMGSPLEEPQR-ESGEIQHRVVLSKDFYMSKYPITFEQYDKFCE 116

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
           ATG                                    +  D     R + PV+ +SW+
Sbjct: 117 ATGR-----------------------------------QKPDDYTWGRENRPVIDVSWD 141

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGG 151
           DAVA+C W G RLPTEAEWEY C+ G
Sbjct: 142 DAVAFCEWIGCRLPTEAEWEYACKSG 167


>gi|445439319|ref|ZP_21441656.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC021]
 gi|444752273|gb|ELW76962.1| cysteine-type sulfatase aerobic maturase family protein
           [Acinetobacter baumannii OIFC021]
          Length = 335

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 68/182 (37%), Positives = 96/182 (52%), Gaps = 6/182 (3%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W  P G +  +    + PV +V+ NDA  Y  W G  LPTE EWEY  +   +    P 
Sbjct: 150 TWKTPNGKNGALP-PPHEPVRYVTKNDAEHYAVWLGRDLPTELEWEYAAKANSKTDT-PL 207

Query: 221 GNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD- 278
               T   +H +AN WQG FP  N   D +   APV  +K N F L++M+GNVWEWT+  
Sbjct: 208 HQAPTDEHQHPQANYWQGTFPFENLNHDHFTGVAPVGCFKSNGFKLFDMIGNVWEWTSSP 267

Query: 279 WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
           +   H     +Y+   +   + T  V KGGS+LC E YC R+R ++R     D +A ++G
Sbjct: 268 YQGAHDQHMGNYSDLRQKEMSSTQYVIKGGSFLCAENYCSRYRNSSRYPQDFDLAATHVG 327

Query: 337 FR 338
           FR
Sbjct: 328 FR 329



 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 46/136 (33%), Positives = 66/136 (48%), Gaps = 2/136 (1%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F  G+ K     +  F  +   +  F++DQ EV+  QF  FV ATGY+T+AE
Sbjct: 66  MVFIPDGQFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYITDAE 124

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           K     VF P     ++    +  +  K   G +  +    H PV +++ NDA  Y  W 
Sbjct: 125 KQKQAAVFTPDPRHPQQWWQLKSGYTWKTPNGKNGALP-PPHEPVRYVTKNDAEHYAVWL 183

Query: 135 GARLPTEAEWEYGCRG 150
           G  LPTE EWEY  + 
Sbjct: 184 GRDLPTELEWEYAAKA 199


>gi|302036551|ref|YP_003796873.1| hypothetical protein NIDE1190 [Candidatus Nitrospira defluvii]
 gi|300604615|emb|CBK40947.1| conserved exported protein of unknown function, DUF323 [Candidatus
           Nitrospira defluvii]
          Length = 282

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 72/178 (40%), Positives = 94/178 (52%), Gaps = 17/178 (9%)

Query: 166 EGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLT 225
           E +D    H  + PV+  SW DA AYC WRGARLPTEAEWE   RG  + RLFPWG+  T
Sbjct: 110 ESVDPAQHH--DRPVIGASWFDAEAYCRWRGARLPTEAEWEKAARGS-DGRLFPWGHR-T 165

Query: 226 PRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNVH 283
           P  E         F  +          APV  Y   +  +GL+ M GNV EW ADW+  +
Sbjct: 166 PTDEEANFGLGARFSYSQV-------LAPVQGYPSGRSPYGLHQMAGNVGEWVADWYGAN 218

Query: 284 HH-PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           ++  + S NP GP  GT +V +GGS+    +Y   +    RS+  P++     GFRCA
Sbjct: 219 YYETSASKNPAGPERGTFRVVRGGSWSDLPKYLLTY---GRSKLPPETRNSYTGFRCA 273



 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 53/148 (35%), Positives = 69/148 (46%), Gaps = 37/148 (25%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F MG N    ++D E P   V LD + +D +EV+  Q+  F+         E
Sbjct: 50  MVSIPEGDFLMGANGMDALED-EKPQHRVWLDWYQIDTYEVTTGQYAAFLVQ-------E 101

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH--PVVHISWNDAVAYCT 132
           K    + +E                          +++   HH  PV+  SW DA AYC 
Sbjct: 102 KVSPPWQWE--------------------------SVDPAQHHDRPVIGASWFDAEAYCR 135

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
           WRGARLPTEAEWE   RG  + RLFPWG
Sbjct: 136 WRGARLPTEAEWEKAARGS-DGRLFPWG 162


>gi|254495108|ref|ZP_05108032.1| gliding motility protein GldK [Polaribacter sp. MED152]
 gi|85819458|gb|EAQ40615.1| gliding motility protein GldK [Polaribacter sp. MED152]
          Length = 453

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 118/429 (27%), Positives = 170/429 (39%), Gaps = 115/429 (26%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF-------VSAT 67
           MVL+PG +F MG     +I     P++ VT+  +++D+ E++N +++EF       V  T
Sbjct: 42  MVLIPGGSFTMGKQDEDIIGTMNSPTKTVTVRPYFMDETEITNNEYKEFVFWVRDSVVRT 101

Query: 68  GYVTEAE-----------------------KFGDTF------------------VFEPLL 86
               +AE                       KF DT                     +PL 
Sbjct: 102 KLAYQAEFAALGSEPDPTGNNNRATGIQLYKFKDTVENSTPYQRYMYENYYQFDTIQPLN 161

Query: 87  SEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV-AYCTWRGARLPTEAEW- 144
            EE+     +   DM   E +DS          ++IS  DAV    T+    L  +  W 
Sbjct: 162 WEEDLIWKKEEYPDMDYVEVMDS----------LYISREDAVDGVRTFNTKFLKYKYSWF 211

Query: 145 --EYGCRGGLENRLFPWGSWL--HPEGIDSTI--------------------EHRMNHPV 180
             +   R G + + F     L  +P   D+T+                    +   ++PV
Sbjct: 212 DRDNAARKGGDRKDFVQTEILNIYP---DTTVWVKDFNYSYNDPMHQDYFSHQSYGDYPV 268

Query: 181 VHVSWNDAVAYCTWR------------------GARLPTEAEWEYGCRGGLENRLFPWGN 222
           V V+W  A A+C WR                    RLPTEAEWEY  RGGLE   +PWG 
Sbjct: 269 VGVTWGQANAFCNWRTKKKNDYLKNKKNAAQVPAFRLPTEAEWEYAARGGLEFATYPWGT 328

Query: 223 NLTP--RGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
             T   RG   AN    +    N A DG L T    SY  N +GLYNM GNV EWT   +
Sbjct: 329 GSTTSDRGCFLANF---KPVRGNYAVDGALYTMEAESYNANDYGLYNMAGNVSEWTNTAY 385

Query: 281 NV-HHHPAPSYNPK-GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           N+  ++ A + NP         K+ +GGS+   +   Y     +R     D++   +GFR
Sbjct: 386 NLSSYYMASTMNPNVEDRKNKRKIVRGGSW---KDVAYYLEVGSRDYEYADTARSYIGFR 442

Query: 339 CAADKGPTT 347
              +   TT
Sbjct: 443 TVQNYIGTT 451


>gi|21637141|gb|AAM70357.1|AF505622_29 CalU17 [Micromonospora echinospora]
          Length = 316

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 110/347 (31%), Positives = 148/347 (42%), Gaps = 80/347 (23%)

Query: 14  DMVLLPGDTFRMGTNKPILI---KDG----------EFPSRNVTLDAFYLDQHEVSNTQF 60
           DMV +PG TF  G+ +  L    ++G          E P   VTL  + +D+H+V+  QF
Sbjct: 20  DMVRIPGGTFLQGSPERTLDWLDREGQAFPRDWFTDETPQIPVTLPDYLIDRHQVTVAQF 79

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
             FVS TGYVT AE+ G + V+     E              R  G  S I  R  HPVV
Sbjct: 80  AAFVSRTGYVTSAERAGGSMVYGEQYWEIREGAC------WHRPAGYGSGIRGRDDHPVV 133

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
           HIS+ DA AY  W G RLPTE+EWE     G   RL+PWG                    
Sbjct: 134 HISFADAEAYARWAGRRLPTESEWERAAT-GPSYRLWPWGD------------------- 173

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
              +W+          +R    AE   G  G L+     WG           +  QG  P
Sbjct: 174 ---TWD----------SRNANTAEHTAGALGDLDAWRTWWGA---------IHAVQGPMP 211

Query: 241 TNNTAADGYLSTAPVMSYK---ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK-GPT 296
                      T PV ++    ++  G  +M GNV+EWT+   ++ + PA   +P     
Sbjct: 212 ----------QTTPVGAFSPRGDSVDGCADMTGNVYEWTSTLAHL-YSPATRCDPTIHLV 260

Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDS-SAGNLGFRCAAD 342
            G  +V +GGS++    + Y+ RCA R    P   S   LGFRCA D
Sbjct: 261 MGRSRVIRGGSWM---NFRYQVRCAERLYGDPTGWSNFALGFRCARD 304


>gi|340349800|ref|ZP_08672803.1| serine/threonine kinase [Prevotella nigrescens ATCC 33563]
 gi|339610003|gb|EGQ14864.1| serine/threonine kinase [Prevotella nigrescens ATCC 33563]
          Length = 291

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 70/171 (40%), Positives = 92/171 (53%), Gaps = 11/171 (6%)

Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANV 234
           R N PV+ VSW+DA+A+C W G RLPTEAEWEY C+ G     F  GNNLT    +    
Sbjct: 130 RENRPVIDVSWDDAIAFCEWIGCRLPTEAEWEYACKAGTTTP-FNTGNNLTTNDAN---- 184

Query: 235 WQGEFPTNNTAADGYL-STAPVMSYKE-NKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
           + G +P NN     Y   T  V +  E N +GLY+M GNV EW ADW++ +     + +P
Sbjct: 185 YDGNYPYNNNEKGDYRKKTVAVGTIGEPNAWGLYDMHGNVQEWCADWYDDYPQTEQT-DP 243

Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           KGP  GT ++ +GGS+      C   R A R  +   S   + GFR    K
Sbjct: 244 KGPENGTYRILRGGSWTSRAMQC---RSAYRGGSEHSSRLDDAGFRVVLMK 291



 Score = 69.7 bits (169), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 43/148 (29%), Positives = 64/148 (43%), Gaps = 36/148 (24%)

Query: 5   PAPPVERYKDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
           P  P +   + V +P  +F MG+ N     + GE   +      FY+ ++ ++  Q+ +F
Sbjct: 55  PVTPKDLGIEFVKVPAGSFLMGSPNNEPQRESGEIQHKVTITKDFYMSKYPITFEQYDKF 114

Query: 64  VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
             ATG                      R K S                  R + PV+ +S
Sbjct: 115 CEATG----------------------RQKPSDYTW-------------GRENRPVIDVS 139

Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGG 151
           W+DA+A+C W G RLPTEAEWEY C+ G
Sbjct: 140 WDDAIAFCEWIGCRLPTEAEWEYACKAG 167


>gi|421854856|ref|ZP_16287241.1| hypothetical protein ACRAD_02_02150 [Acinetobacter radioresistens
           DSM 6976 = NBRC 102413]
 gi|403189871|dbj|GAB73442.1| hypothetical protein ACRAD_02_02150 [Acinetobacter radioresistens
           DSM 6976 = NBRC 102413]
          Length = 270

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 71/185 (38%), Positives = 99/185 (53%), Gaps = 10/185 (5%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W  PEG +     R N  V +++ NDA  Y  W G  LP+E EWEY  +G  E  + P 
Sbjct: 85  NWRLPEGPEGPPA-RANEAVRYITKNDAEHYALWLGHDLPSEVEWEYAAKGNDETDV-PL 142

Query: 221 GNNLTPRG-EHR--ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
            +   PR  +HR  AN WQG+FP  NT  D +   APV  +  N FGLY+M+GNVWEWT+
Sbjct: 143 KHE--PRNHQHRPDANYWQGDFPFQNTQEDHFKGIAPVGCFSPNAFGLYDMIGNVWEWTS 200

Query: 278 DWWN-VHHHPAPSYNPKGPTTGTDK--VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGN 334
             +   H     +Y+    +    +  V KGGS+LC  QYC R+R ++R     + +  +
Sbjct: 201 SVYTGAHDEHMGNYSALRTSQHIPQSFVIKGGSFLCASQYCARYRKSSRHPQEFNLATTH 260

Query: 335 LGFRC 339
            GFR 
Sbjct: 261 TGFRT 265


>gi|332663535|ref|YP_004446323.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
           1100]
 gi|332332349|gb|AEE49450.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis
           DSM 1100]
          Length = 505

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 94/340 (27%), Positives = 149/340 (43%), Gaps = 85/340 (25%)

Query: 15  MVLLPGDTFRMGTNKPILIKDG-----EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           +V + G  +RMG    I  +DG     E P   V++ +FYL ++EV+N +F +F++A   
Sbjct: 234 LVKIDGGKYRMGC---IEARDGSCYSNETPDHEVSISSFYLSKYEVTNAEFVQFLNAISA 290

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
               +  GD   ++                  K FE                        
Sbjct: 291 QISIDANGDGVNYQ----------------GKKIFE-----------------------L 311

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
           +C           + + GC G  E     + +   P G  + +     HP V V+   A 
Sbjct: 312 FC----------GDKKGGCSGFKEQ--IEYSAADRPGGRFTVVSGFEKHPAVMVTKYGAE 359

Query: 190 AYCTW------RGARLPTEAEWEYGCRGGLENRLFPWG--NNLTPRGEHRANVWQGEFPT 241
           AY  W      +  RLPTEAEWE+  RGG++++ + +   N+L   G      W   + +
Sbjct: 360 AYSAWLSSKTGKSYRLPTEAEWEFVARGGIQSQRYKYAGSNDLDAVG------W---YSS 410

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTD 300
           N+++A       PV     N+ GLY++ GNVWEW +DW+   ++  +P+ NPKGP++GT+
Sbjct: 411 NSSSA-----AHPVGLKNANELGLYDLSGNVWEWCSDWYEKEYYKNSPASNPKGPSSGTN 465

Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            V +GGS+   +  C   R A R  N P      +GFR A
Sbjct: 466 SVLRGGSWYVIDINC---RLAIRGINDPGYCGNVIGFRVA 502


>gi|78189652|ref|YP_379990.1| hypothetical protein Cag_1696 [Chlorobium chlorochromatii CaD3]
 gi|78171851|gb|ABB28947.1| conserved hypothetical protein [Chlorobium chlorochromatii CaD3]
          Length = 751

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 78/206 (37%), Positives = 108/206 (52%), Gaps = 30/206 (14%)

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
            +R+ T  EW+Y  + G+        +W    G    +++  NHPV+HVSWNDA AYC W
Sbjct: 538 SSRIWTGKEWKY--KAGV--------NWRCGVGGQLRLQNEENHPVIHVSWNDAKAYCDW 587

Query: 195 ------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
                 +  RLPTEAEWEY CR G     F  G+NLT     +AN + G +P N  A   
Sbjct: 588 LSKKTGKKYRLPTEAEWEYACRAGTTTP-FNTGDNLTT---AQAN-YDGNYPYNGNAKGK 642

Query: 249 Y-LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----PSYNPKGPTTGTDKV 302
           Y  +T PV S+  N +GLYNM GNVWEW +DW+N  ++          NP+     + +V
Sbjct: 643 YRQTTVPVDSFAPNAWGLYNMHGNVWEWCSDWYNDKYYEVCKAKGVVENPECTEEQSYRV 702

Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTP 328
            +GGS+  + + C   R A R+   P
Sbjct: 703 LRGGSWGNDARSC---RSAIRNLLRP 725



 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 48/144 (33%), Positives = 75/144 (52%), Gaps = 12/144 (8%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           D V + G  F MG+    + +        V L  FY+ ++ V+  +F++F+   GY T+A
Sbjct: 474 DYVPIKGGEFTMGSPDGEVDRSSTETQHQVKLHNFYIAKYVVTVAEFRKFIEECGYKTDA 533

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
           EK   + ++     +E + K      + +   G    +++  +HPV+H+SWNDA AYC W
Sbjct: 534 EKANSSRIWT---GKEWKYKAGV---NWRCGVGGQLRLQNEENHPVIHVSWNDAKAYCDW 587

Query: 134 ------RGARLPTEAEWEYGCRGG 151
                 +  RLPTEAEWEY CR G
Sbjct: 588 LSKKTGKKYRLPTEAEWEYACRAG 611


>gi|293607291|ref|ZP_06689632.1| DNA replication and repair protein RecF [Achromobacter piechaudii
           ATCC 43553]
 gi|292814383|gb|EFF73523.1| DNA replication and repair protein RecF [Achromobacter piechaudii
           ATCC 43553]
          Length = 355

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 110/374 (29%), Positives = 150/374 (40%), Gaps = 110/374 (29%)

Query: 5   PAPPVERYKDM--VLLPGDTFRMGTNKPI--LIKD-------------GEFPSRNVTLD- 46
           PA  VE    M  V +P  TF+MG+ +    L +D              E P   V +  
Sbjct: 48  PAALVENSLGMAFVAVPAGTFQMGSEESADSLGRDYPQYPMERFAQLFDEAPVHTVRITR 107

Query: 47  AFYLDQHEVSNTQFQEFVSATGYVTEAEK-----FGDTFVFEPLLSEEERAKISQVRHDM 101
            FY+ + EV+  QF+ F+ A+GYV E+E      +G    ++P  SE   A         
Sbjct: 108 PFYMGRTEVTVGQFRRFIEASGYVPESEADGTGGYGYNAAYDPDKSERGDA--------- 158

Query: 102 KRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
             FEG D                                                    S
Sbjct: 159 --FEGRDRRY-------------------------------------------------S 167

Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLEN 215
           W +P G   T EH    PVV++SWNDA A   W         RLPTEAEWEY CR G   
Sbjct: 168 WRNP-GFAQTDEH----PVVNISWNDATALARWLTQQEGHVYRLPTEAEWEYACRAGTRT 222

Query: 216 RLFPWGNNLTPRGEHRAN---------VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLY 266
           R   + N   P G              +W           DG   TAPV SY  N FGLY
Sbjct: 223 R---YQNGDAPDGMAEVGNGFDADAEPLWPKWHAFATPRHDGNTFTAPVASYAVNAFGLY 279

Query: 267 NMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQ 325
           +M GN WEW +DW+   ++  +P+ +P+GP +G  +V++GGS+     Y    R + R+ 
Sbjct: 280 DMHGNAWEWVSDWYGEDYYAHSPTDDPEGPASGNVRVRRGGSWHTWALYT---RSSYRNW 336

Query: 326 NTPDSSAGNLGFRC 339
           NT ++    +G R 
Sbjct: 337 NTQETRYPLVGLRL 350


>gi|134094839|ref|YP_001099914.1| hypothetical protein HEAR1630 [Herminiimonas arsenicoxydans]
 gi|133738742|emb|CAL61789.1| conserved hypothetical protein; putative C-type lectin domain;
           putative exported protein [Herminiimonas arsenicoxydans]
          Length = 328

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 65/178 (36%), Positives = 98/178 (55%), Gaps = 20/178 (11%)

Query: 177 NHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
           +HPVV+V+WNDA+A C W      +  RLPTEAEWEY  R G   + +   +  +  G  
Sbjct: 151 DHPVVNVTWNDALAMCKWLSQVEGKTYRLPTEAEWEYAARAGTSTQYYTGDDPQSLLGA- 209

Query: 231 RANVWQGEFPTN--------NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV 282
            AN++  +   N            D +  T+PV S+  N FGLY++ GN WEWTADW + 
Sbjct: 210 -ANIFDADAAKNWGRWSSYALAGHDNFAFTSPVGSFAPNAFGLYDIHGNAWEWTADWHDD 268

Query: 283 HHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           H++  +P  +P+GP TG  +V++GGS+     + +  R + R+ N PD+    +G R 
Sbjct: 269 HYYADSPLNDPQGPATGNVRVRRGGSW---HTWSFYARSSYRNWNAPDTRYTLVGMRL 323



 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 53/166 (31%), Positives = 75/166 (45%), Gaps = 23/166 (13%)

Query: 15  MVLLPGDTFRMGTNKP---------------ILIKDGEFPSRNVTLD-AFYLDQHEVSNT 58
            VL+P   F MG++ P                L    E P   V +  AFYL QHEV+  
Sbjct: 33  FVLIPAGEFMMGSDVPADKFAQIYPQYERKRFLELGDEAPVHKVRITRAFYLGQHEVTVG 92

Query: 59  QFQEFVSATGYVTEAEKFGD-TFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
           QF+ F+ A+GY  E+E  G   + + P       A+         ++   +        H
Sbjct: 93  QFRRFLLASGYQPESEADGSGGYGYNPAYDPALSARGDAFEGRNPKYSWRNPGFAQDDDH 152

Query: 118 PVVHISWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLF 157
           PVV+++WNDA+A C W      +  RLPTEAEWEY  R G   + +
Sbjct: 153 PVVNVTWNDALAMCKWLSQVEGKTYRLPTEAEWEYAARAGTSTQYY 198


>gi|189501384|ref|YP_001960854.1| hypothetical protein Cphamn1_2479 [Chlorobium phaeobacteroides BS1]
 gi|189496825|gb|ACE05373.1| protein of unknown function DUF323 [Chlorobium phaeobacteroides
           BS1]
          Length = 398

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 74/175 (42%), Positives = 94/175 (53%), Gaps = 20/175 (11%)

Query: 178 HPVVHVSWNDAVAYCTW----RGA--RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR 231
           +P+V+VSWNDAVAYC W    RG   RLPTEAEWEY CR G     F  G NLT    + 
Sbjct: 225 YPLVNVSWNDAVAYCRWLSEKRGELFRLPTEAEWEYACRAGTRTP-FSTGENLT---TYE 280

Query: 232 ANVWQGEFPTNNTAADGYLS-TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----- 285
           AN + G +P  N     Y   T PV S+K N +GLYNM GNVWEW  D +   ++     
Sbjct: 281 AN-YDGNYPYRNNPKGMYRKKTVPVDSFKPNSYGLYNMHGNVWEWCGDRYGAGYYEECRK 339

Query: 286 PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                NP+G   G++ V +GG +L   Q+C   R A R  + P   +  +GFR  
Sbjct: 340 QGIMENPQGAEKGSNSVLRGGGWLTYAQHC---RSAYRHFDGPVYRSNYVGFRLV 391



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/41 (65%), Positives = 31/41 (75%), Gaps = 6/41 (14%)

Query: 117 HPVVHISWNDAVAYCTW----RGA--RLPTEAEWEYGCRGG 151
           +P+V++SWNDAVAYC W    RG   RLPTEAEWEY CR G
Sbjct: 225 YPLVNVSWNDAVAYCRWLSEKRGELFRLPTEAEWEYACRAG 265


>gi|347754869|ref|YP_004862433.1| hypothetical protein [Candidatus Chloracidobacterium thermophilum
           B]
 gi|347587387|gb|AEP11917.1| Uncharacterized conserved protein [Candidatus Chloracidobacterium
           thermophilum B]
          Length = 271

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 69/169 (40%), Positives = 86/169 (50%), Gaps = 6/169 (3%)

Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANV 234
           R  HPVV V+WNDAVAY  W G RLPTEAEWE+  R G     +      T      AN 
Sbjct: 97  REQHPVVAVTWNDAVAYTRWAGKRLPTEAEWEFAARAGRPTAQYGTAGTGTVLAGA-ANF 155

Query: 235 WQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPK 293
                        G + T PV ++  N FGLY++ GN  EW  DW++  ++   PS+NP 
Sbjct: 156 GHIAVNPKRMPLRG-IQTTPVATFAPNPFGLYDLSGNAAEWCQDWFDADYYAFGPSHNPP 214

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GGS+   E Y    R   R    P + A  LGFRCAAD
Sbjct: 215 GPDFGRGRVVRGGSWNDREDYLQLTR---RYGLPPGTIAFTLGFRCAAD 260



 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 48/137 (35%), Positives = 62/137 (45%), Gaps = 39/137 (28%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+PG  F        + +D         +  F+LD+ EV+N +F+ FV ATGY +E  
Sbjct: 37  MVLVPGGAF--------VRRDDAGRGHLTVVAPFFLDRTEVTNAEFRRFVMATGYQSEG- 87

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
                                  R  +    G       R  HPVV ++WNDAVAY  W 
Sbjct: 88  -----------------------RWALYATPG-------REQHPVVAVTWNDAVAYTRWA 117

Query: 135 GARLPTEAEWEYGCRGG 151
           G RLPTEAEWE+  R G
Sbjct: 118 GKRLPTEAEWEFAARAG 134


>gi|152981513|ref|YP_001353598.1| hypothetical protein mma_1908 [Janthinobacterium sp. Marseille]
 gi|151281590|gb|ABR90000.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille]
          Length = 326

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 68/176 (38%), Positives = 93/176 (52%), Gaps = 16/176 (9%)

Query: 177 NHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWG------NNL 224
           +HPVV+V+WNDAVA C W      R  RLPTEAEWEY  R G   R +         N  
Sbjct: 149 SHPVVNVTWNDAVAMCKWLSRTEGRTYRLPTEAEWEYAGRAGTRTRYYSGDDPQSLLNAA 208

Query: 225 TPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH 284
                  A  WQ          DG+  TAPV S+  N FGLY++ GN WEWTADW +  +
Sbjct: 209 NVFDADAAKNWQQWKKFALDGPDGFAFTAPVGSFAPNAFGLYDIHGNAWEWTADWHDERY 268

Query: 285 HP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           +  +P  +P+GP  GT +V++GGS+     + +  R + R+ N+ D+    +G R 
Sbjct: 269 YARSPVDDPQGPADGTVRVRRGGSW---HTWSFYARSSYRNWNSQDTRYTLVGMRL 321



 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 53/166 (31%), Positives = 76/166 (45%), Gaps = 23/166 (13%)

Query: 15  MVLLPGDTFRMGTN---------------KPILIKDGEFPSRNVTLD-AFYLDQHEVSNT 58
            VL+P   F MG++               K  +    E P+  V +   FY  Q+EV+  
Sbjct: 31  FVLIPAGEFVMGSDESSEAMLRHYPQYDHKRFVELSDEGPAHKVRITHPFYFGQYEVTVG 90

Query: 59  QFQEFVSATGYVTEAEKFGDT-FVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
           +F+ FV A+GY+ EAE  G   + + P     + A+         ++   +        H
Sbjct: 91  EFRRFVEASGYLPEAEADGTGGYGYNPDYDPAKSARGDAFEGRNPKYSWRNPGFPQDDSH 150

Query: 118 PVVHISWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLF 157
           PVV+++WNDAVA C W      R  RLPTEAEWEY  R G   R +
Sbjct: 151 PVVNVTWNDAVAMCKWLSRTEGRTYRLPTEAEWEYAGRAGTRTRYY 196


>gi|406592493|ref|YP_006739673.1| hypothetical protein B711_0714 [Chlamydia psittaci CP3]
 gi|405788365|gb|AFS27108.1| hypothetical protein B711_0714 [Chlamydia psittaci CP3]
          Length = 618

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 65/167 (38%), Positives = 86/167 (51%), Gaps = 22/167 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A  Y  W G RLPTEAEWE G  GG+  + +P G  +    +  AN   
Sbjct: 467 KHPVVGVTWYGASGYAAWVGKRLPTEAEWEVGAYGGVAQQRYPCGEEI---DKSLANF-- 521

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPK 293
             F ++ TA         VMSY  N +GLY+M GNV+EW  DW++   +      S+ P+
Sbjct: 522 --FSSDTTA---------VMSYPANAYGLYDMAGNVYEWCEDWYSYDFYELSAQESHAPQ 570

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614



 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 57/190 (30%), Positives = 88/190 (46%), Gaps = 23/190 (12%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P   + +MV + G TF  G+ +    +  E P   + L +F+LD H V+N QF  ++  +
Sbjct: 377 PQPLFTEMVFIEGGTFIRGSREG---QRDEHPVHEIFLQSFFLDIHPVTNEQFVRYLECS 433

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           G  +E +K+ +  +         R K S++    +R  G          HPVV ++W  A
Sbjct: 434 G--SEQDKYYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWYGA 478

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
             Y  W G RLPTEAEWE G  GG+  + +P G     E ID ++ +  +     V    
Sbjct: 479 SGYAAWVGKRLPTEAEWEVGAYGGVAQQRYPCG-----EEIDKSLANFFSSDTTAVMSYP 533

Query: 188 AVAYCTWRGA 197
           A AY  +  A
Sbjct: 534 ANAYGLYDMA 543


>gi|421465527|ref|ZP_15914214.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           radioresistens WC-A-157]
 gi|400203794|gb|EJO34779.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           radioresistens WC-A-157]
          Length = 327

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 72/190 (37%), Positives = 103/190 (54%), Gaps = 11/190 (5%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W  PEG +     R N  V +++ NDA  Y  W G  LP+E EWEY  +G  E  + P 
Sbjct: 142 NWRLPEGPEGPPA-RANEAVRYITKNDAEHYALWLGHDLPSEVEWEYAAKGNDETDV-PL 199

Query: 221 GNNLTPRG-EHR--ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
            +   PR  +HR  AN WQG+FP +N+  D +   APV  +  N FGLY+M+GNVWEWT+
Sbjct: 200 KHE--PRNHQHRPDANYWQGDFPFHNSQEDHFKGIAPVGCFSPNAFGLYDMIGNVWEWTS 257

Query: 278 DWWN-VHHHPAPSYNPKGPTTGTDK--VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGN 334
             +   H     +Y+    +    +  V KGGS+LC  QYC R+R ++R     + +  +
Sbjct: 258 SVYTGAHDEHMGNYSALRTSQHIPQSFVIKGGSFLCASQYCARYRKSSRHPQEFNLATTH 317

Query: 335 LGFRCAADKG 344
            GFR   D+G
Sbjct: 318 TGFR-TIDRG 326



 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 45/151 (29%), Positives = 79/151 (52%), Gaps = 3/151 (1%)

Query: 4   LPAPPVER-YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
           LP+  +E+ +  M+ +P    ++G+ +     +  F ++   +  F++DQ EV+  QF  
Sbjct: 46  LPSGWLEQAHAGMLYIPEGEVKLGS-ELAYPDEINFGAQKRKVKGFWIDQTEVTVAQFAS 104

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           F++AT Y+T+AEK  +  VFE      ++    +  ++ +  EG +     R +  V +I
Sbjct: 105 FIAATHYITDAEKQNEAAVFEADEQSPQQWWKLKSGYNWRLPEGPEGPPA-RANEAVRYI 163

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 153
           + NDA  Y  W G  LP+E EWEY  +G  E
Sbjct: 164 TKNDAEHYALWLGHDLPSEVEWEYAAKGNDE 194


>gi|359800113|ref|ZP_09302664.1| hypothetical protein KYC_24182 [Achromobacter arsenitoxydans SY8]
 gi|359361911|gb|EHK63657.1| hypothetical protein KYC_24182 [Achromobacter arsenitoxydans SY8]
          Length = 330

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 70/182 (38%), Positives = 94/182 (51%), Gaps = 22/182 (12%)

Query: 177 NHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENR---------LFPWG 221
           +HPVV++SWNDA A   W      R  RLPTEAEWEY CR G   R         +   G
Sbjct: 153 DHPVVNISWNDATALARWLTQKEGRVYRLPTEAEWEYACRAGTRTRYQNGDDPAAMPDVG 212

Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
           N      +     WQ   P  +   DG   TAPV SY  N FGLY+M GN WEW ADW+ 
Sbjct: 213 NGFDADAQPLWPKWQ---PYASVRHDGNTFTAPVASYSPNAFGLYDMHGNAWEWVADWYG 269

Query: 282 VHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
             ++  +P  +P+GP TG  +V++GGS+     Y    R + R+ NT ++    +G R  
Sbjct: 270 EDYYARSPVDDPQGPATGNVRVRRGGSWHTWALYT---RSSYRNWNTQETRYPLVGLRLL 326

Query: 341 AD 342
           ++
Sbjct: 327 SE 328



 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 84/180 (46%), Gaps = 35/180 (19%)

Query: 7   PPVERYKDM----VLLPGDTFRMGTNK---------PILIKD------GEFPSRNVTLD- 46
           PPV+    +    V +P  TF+MG+++         P    D       E P   V +  
Sbjct: 23  PPVQFENSLGMTFVRVPAGTFQMGSDEAPDTLAAAYPQYPADRFAQLFDEAPVHTVRITR 82

Query: 47  AFYLDQHEVSNTQFQEFVSATGYVTEAEK-----FGDTFVFEPLLSEEERAKISQVRHDM 101
            FYL++ EV+  QF+ F+ A+GYV E+E      +G    ++P  S+   A   + R   
Sbjct: 83  PFYLERTEVTVGQFRRFIEASGYVPESEADGTGGYGYNAAYDPEKSQSGDAFEGRDR--- 139

Query: 102 KRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENR 155
            R+   +        HPVV+ISWNDA A   W      R  RLPTEAEWEY CR G   R
Sbjct: 140 -RYSWRNPGFAQGDDHPVVNISWNDATALARWLTQKEGRVYRLPTEAEWEYACRAGTRTR 198


>gi|255319125|ref|ZP_05360343.1| sulfatase-modifying factor 1 [Acinetobacter radioresistens SK82]
 gi|262379257|ref|ZP_06072413.1| conserved hypothetical protein [Acinetobacter radioresistens SH164]
 gi|255303771|gb|EET82970.1| sulfatase-modifying factor 1 [Acinetobacter radioresistens SK82]
 gi|262298714|gb|EEY86627.1| conserved hypothetical protein [Acinetobacter radioresistens SH164]
          Length = 327

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 72/190 (37%), Positives = 103/190 (54%), Gaps = 11/190 (5%)

Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
           +W  PEG +     R N  V +++ NDA  Y  W G  LP+E EWEY  +G  E  + P 
Sbjct: 142 NWRLPEGPEGPPA-RANEAVRYITKNDAEHYALWLGHDLPSEVEWEYAAKGNDETDV-PL 199

Query: 221 GNNLTPRG-EHR--ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
            +   PR  +HR  AN WQG+FP +N+  D +   APV  +  N FGLY+M+GNVWEWT+
Sbjct: 200 KHE--PRNHQHRPDANYWQGDFPFHNSQEDHFKGIAPVGCFSPNAFGLYDMIGNVWEWTS 257

Query: 278 DWWN-VHHHPAPSYNPKGPTTGTDK--VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGN 334
             +   H     +Y+    +    +  V KGGS+LC  QYC R+R ++R     + +  +
Sbjct: 258 SVYTGAHDEHMGNYSALRTSQHIPQSFVIKGGSFLCASQYCARYRKSSRHPQEFNLATTH 317

Query: 335 LGFRCAADKG 344
            GFR   D+G
Sbjct: 318 TGFR-TIDRG 326



 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 44/151 (29%), Positives = 79/151 (52%), Gaps = 3/151 (1%)

Query: 4   LPAPPVER-YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
           LP+  +E+ +  M+ +P    ++G+ +     +  F ++   +  F++DQ EV+  QF  
Sbjct: 46  LPSGWLEQAHAGMLYIPKGEVKLGS-ELAYPDEINFGAQKRKVKGFWIDQTEVTVAQFAS 104

Query: 63  FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
           F++AT Y+T+AEK  +  +FE      ++    +  ++ +  EG +     R +  V +I
Sbjct: 105 FIAATHYITDAEKQNEAAIFEADEQSPQQWWKLKSGYNWRLPEGPEGPPA-RANEAVRYI 163

Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 153
           + NDA  Y  W G  LP+E EWEY  +G  E
Sbjct: 164 TKNDAEHYALWLGHDLPSEVEWEYAAKGNDE 194


>gi|445114478|ref|ZP_21377940.1| hypothetical protein HMPREF0662_00992 [Prevotella nigrescens F0103]
 gi|444840707|gb|ELX67733.1| hypothetical protein HMPREF0662_00992 [Prevotella nigrescens F0103]
          Length = 291

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 69/171 (40%), Positives = 92/171 (53%), Gaps = 11/171 (6%)

Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANV 234
           R N PV+ V+W+DA+A+C W G RLPTEAEWEY C+ G     F  GNNLT    +    
Sbjct: 130 RENRPVIDVNWDDAIAFCEWIGCRLPTEAEWEYACKAGTTTP-FNTGNNLTTNDAN---- 184

Query: 235 WQGEFPTNNTAADGYL-STAPVMSYKE-NKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
           + G +P NN     Y   T  V +  E N +GLY+M GNV EW ADW++ +     + +P
Sbjct: 185 YDGNYPYNNNEKGDYRKKTVAVGTIGEPNAWGLYDMHGNVQEWCADWYDDYPQTEQT-DP 243

Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           KGP  GT ++ +GGS+      C   R A R  +   S   + GFR    K
Sbjct: 244 KGPENGTYRILRGGSWTSRAMQC---RSAYRGGSEHSSRLDDAGFRVVLMK 291



 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 42/148 (28%), Positives = 64/148 (43%), Gaps = 36/148 (24%)

Query: 5   PAPPVERYKDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
           P  P +   + V +P  +F MG+ N     + GE   +      FY+ ++ ++  Q+ +F
Sbjct: 55  PVTPKDLGIEFVKVPAGSFLMGSPNDEPQRESGEIQHKVTITKDFYMSKYPITFEQYDKF 114

Query: 64  VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
             ATG                      R K S                  R + PV+ ++
Sbjct: 115 CEATG----------------------RQKPSDYTW-------------GRENRPVIDVN 139

Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGG 151
           W+DA+A+C W G RLPTEAEWEY C+ G
Sbjct: 140 WDDAIAFCEWIGCRLPTEAEWEYACKAG 167


>gi|406594806|ref|YP_006741779.1| hypothetical protein B599_0664 [Chlamydia psittaci MN]
 gi|410858565|ref|YP_006974505.1| putative serine/threonine protein kinase [Chlamydia psittaci
           01DC12]
 gi|405783238|gb|AFS21986.1| hypothetical protein B599_0664 [Chlamydia psittaci MN]
 gi|410811460|emb|CCO02111.1| putative serine/threonine protein kinase [Chlamydia psittaci
           01DC12]
          Length = 618

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 65/167 (38%), Positives = 86/167 (51%), Gaps = 22/167 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A  Y  W G RLPTEAEWE G  GG+  + +P G  +    +  AN   
Sbjct: 467 KHPVVGVTWYGASGYAAWVGKRLPTEAEWEVGAYGGVAQQRYPCGEEI---DKSLANF-- 521

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPK 293
             F ++ TA         VMSY  N +GLY+M GNV+EW  DW++   +      S+ P+
Sbjct: 522 --FSSDTTA---------VMSYPANAYGLYDMAGNVYEWCEDWYSYDFYELSAQESHAPQ 570

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614



 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 57/190 (30%), Positives = 88/190 (46%), Gaps = 23/190 (12%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P   + +MV + G TF  G+ +    +  E P   + L +F+LD H V+N QF  ++  +
Sbjct: 377 PQPLFTEMVFIEGGTFIRGSREG---QRDEHPVHEIFLQSFFLDIHPVTNEQFVRYLECS 433

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           G  +E +K+ +  +         R K S++    +R  G          HPVV ++W  A
Sbjct: 434 G--SEQDKYYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWYGA 478

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
             Y  W G RLPTEAEWE G  GG+  + +P G     E ID ++ +  +     V    
Sbjct: 479 SGYAAWVGKRLPTEAEWEVGAYGGVAQQRYPCG-----EEIDKSLANFFSSDTTAVMSYP 533

Query: 188 AVAYCTWRGA 197
           A AY  +  A
Sbjct: 534 ANAYGLYDMA 543


>gi|340353143|ref|ZP_08675971.1| serine/threonine kinase [Prevotella pallens ATCC 700821]
 gi|339611339|gb|EGQ16169.1| serine/threonine kinase [Prevotella pallens ATCC 700821]
          Length = 291

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 68/171 (39%), Positives = 91/171 (53%), Gaps = 11/171 (6%)

Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANV 234
           R N PV+ VSW+DA A+C W G RLPTEAEWEY C+ G     +  GNN+T    +    
Sbjct: 130 RENRPVIDVSWDDATAFCQWIGCRLPTEAEWEYACKAGTTTPFYT-GNNITTNDAN---- 184

Query: 235 WQGEFPTNNTAADGY-LSTAPVMSYKE-NKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
           + G +P NN     Y   T  V +  E N +GLY+M GNV EW ADW++  +      +P
Sbjct: 185 YDGNYPYNNNDQGEYRKKTVAVGTVGEPNSWGLYDMHGNVQEWCADWYD-DYPQGEQTDP 243

Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
           KGP  GT ++ +GGS+      C   R A R  +  +S   + GFR    K
Sbjct: 244 KGPDNGTYRILRGGSWTSRAMQC---RSAYRGGSEHNSRLDDAGFRIVLIK 291



 Score = 68.6 bits (166), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 43/154 (27%), Positives = 64/154 (41%), Gaps = 36/154 (23%)

Query: 5   PAPPVERYKDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
           P  P +   + V +P  +F MG+ N     + GE   +      FY+ ++ ++  Q+ +F
Sbjct: 55  PVTPKDLGIEFVKIPAGSFLMGSPNDEPQRESGEIQHKVTITKDFYMSKYPITFEQYDKF 114

Query: 64  VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
             ATG                      R K S                  R + PV+ +S
Sbjct: 115 CEATG----------------------RQKPSDYTW-------------GRENRPVIDVS 139

Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLF 157
           W+DA A+C W G RLPTEAEWEY C+ G     +
Sbjct: 140 WDDATAFCQWIGCRLPTEAEWEYACKAGTTTPFY 173


>gi|29840376|ref|NP_829482.1| protein kinase [Chlamydophila caviae GPIC]
 gi|55583926|sp|Q822R1.1|PKN1_CHLCV RecName: Full=Serine/threonine-protein kinase pkn1
 gi|29834725|gb|AAP05360.1| protein kinase, putative [Chlamydophila caviae GPIC]
          Length = 618

 Score =  110 bits (276), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 63/167 (37%), Positives = 84/167 (50%), Gaps = 22/167 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A  Y +W G RLPTEAEWE    GG+    +P G  +    + +AN + 
Sbjct: 467 KHPVVGVTWYGASGYASWVGKRLPTEAEWEIASCGGVTQLRYPCGEEI---DKSQANFFS 523

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP---SYNPK 293
            +             T PVMSY  N +GLY+M GNV+EW  DW+    +      S+ P+
Sbjct: 524 SD-------------TTPVMSYPANPYGLYDMAGNVYEWCEDWYGYDFYEISAQESHAPQ 570

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614



 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 47/153 (30%), Positives = 73/153 (47%), Gaps = 18/153 (11%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P   + +MV + G  F  G+ +    +  E P   + L +F+LD H V+N QF  ++  +
Sbjct: 377 PQPLFTEMVFIEGGKFLRGSREG---QRDEHPVHEIFLHSFFLDIHPVTNEQFVRYLECS 433

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           G  +E +K+ +  +         R K S++    +R  G          HPVV ++W  A
Sbjct: 434 G--SEQDKYYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWYGA 478

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 160
             Y +W G RLPTEAEWE    GG+    +P G
Sbjct: 479 SGYASWVGKRLPTEAEWEIASCGGVTQLRYPCG 511


>gi|297621236|ref|YP_003709373.1| serine/threonine-protein kinase pkn1 [Waddlia chondrophila WSU
           86-1044]
 gi|297376537|gb|ADI38367.1| putative serine/threonine-protein kinase pkn1 [Waddlia chondrophila
           WSU 86-1044]
 gi|337293465|emb|CCB91454.1| Serine/threonine-protein kinase pkn1 [Waddlia chondrophila 2032/99]
          Length = 599

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 64/170 (37%), Positives = 88/170 (51%), Gaps = 22/170 (12%)

Query: 176 MNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW 235
           + HPVV V+W  AVAY  W G RLPTEAEWE   +   ++  +P GN++    + +AN +
Sbjct: 447 LKHPVVGVTWYGAVAYAKWLGKRLPTEAEWEVASQSHKDHAEYPTGNDIE---KTQANFF 503

Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS---YNP 292
             +             T  VMSY  N  GLY+M GNV+EW  DW+  +++       +NP
Sbjct: 504 SSD-------------TTSVMSYPSNDNGLYDMAGNVYEWCQDWYGYNYYEISVQEPFNP 550

Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            GP  G  +V +GG   C +      R A R +N P +     GFRCAAD
Sbjct: 551 TGPLQGVYRVLRGG---CWKGLKEDLRSAKRHRNNPGTGNSTYGFRCAAD 597



 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 46/150 (30%), Positives = 74/150 (49%), Gaps = 18/150 (12%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           + +MV++ G TF  G+++       E P   +++D+F +D H V+N QF  F+ A     
Sbjct: 362 HTEMVVIEGGTFTRGSSQG---NRDEMPKHEISIDSFAIDIHPVTNEQFVRFLEAM---- 414

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
             E   D+   + +  +E R         + R  G  +     + HPVV ++W  AVAY 
Sbjct: 415 --EGIKDSNHNDLINLKESR---------IHRSRGKCTIESGYLKHPVVGVTWYGAVAYA 463

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
            W G RLPTEAEWE   +   ++  +P G+
Sbjct: 464 KWLGKRLPTEAEWEVASQSHKDHAEYPTGN 493


>gi|90023627|ref|YP_529454.1| RecF protein [Saccharophagus degradans 2-40]
 gi|89953227|gb|ABD83242.1| protein of unknown function DUF323 [Saccharophagus degradans 2-40]
          Length = 338

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 104/347 (29%), Positives = 146/347 (42%), Gaps = 79/347 (22%)

Query: 14  DMVLLPGDTFRMGTNKPILIK-DGEFPSRNVTL-DAFYLDQHEVSNTQFQEFVSATGYVT 71
           +M+++P  +F MG+      + + E P   VTL   F + +HEV+  QF+ FV ATGY+T
Sbjct: 51  EMIVVPAGSFLMGSPASEEGRSEAEGPQLKVTLAKVFAMGKHEVTRGQFRAFVDATGYLT 110

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           +AE   D       + ++         +   R   L    +    HPV+ +S  DA+AY 
Sbjct: 111 QAEVEADRGCLTLDVRDDGNGNWGWSANQNWRNATLYGLEQGSDAHPVLCVSQQDALAYI 170

Query: 132 TW----------RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
           TW          R  RLPTEAEWEY  R G         S  +  G D T          
Sbjct: 171 TWFNETHLPPDERAYRLPTEAEWEYAARAG--------SSTTYSHGTDVT---------- 212

Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
                   A C              YG  G              P+G          +P 
Sbjct: 213 --------AMC-------------RYGNIG---------DQTALPKGH--------AWPE 234

Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP----SYNPKGPTT 297
             T  DGY+ TAPV S++ N FGL++M GNVWEWT D W+ ++  AP    ++  +    
Sbjct: 235 WVTCDDGYVFTAPVGSFQANAFGLHDMHGNVWEWTEDCWHDNYIDAPVNGEAWEEENGGD 294

Query: 298 GTDKVKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAAD 342
             ++V +GGS+     Y +R   R A R  N P      +GFR A D
Sbjct: 295 CANRVARGGSW-----YFHRGDLRVAFRFINNPAMRYSLMGFRLAQD 336


>gi|338733048|ref|YP_004671521.1| serine/threonine-protein kinase pkn1 [Simkania negevensis Z]
 gi|336482431|emb|CCB89030.1| serine/threonine-protein kinase pkn1 [Simkania negevensis Z]
          Length = 672

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 61/168 (36%), Positives = 87/168 (51%), Gaps = 22/168 (13%)

Query: 178 HPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG 237
           HPV+ V+W  A+AY  W G RLPTEAEWE     G E  ++P G  +      +AN +  
Sbjct: 520 HPVIGVTWYGAIAYAKWIGKRLPTEAEWEIAASCGNEENIYPTGATIE---RTQANFFSS 576

Query: 238 EFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPKG 294
           +             T  V SY  N++GLY+M GN++EW  DW++ +++        NP+G
Sbjct: 577 D-------------TVTVKSYPPNEYGLYDMAGNIYEWCQDWYDYNYYEVSQQEPENPQG 623

Query: 295 PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           P  G  +V +GG   C +      RC+ R++N P       GFRCAAD
Sbjct: 624 PLQGVFRVLRGG---CWKSLKEDLRCSHRNRNNPGIVNRTYGFRCAAD 668



 Score = 73.6 bits (179), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 44/150 (29%), Positives = 71/150 (47%), Gaps = 18/150 (12%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV++ G  F  G+N+       E P   + + +F +D H V+N QF  F+   G   +A
Sbjct: 435 EMVIIKGGKFERGSNQG---GRDERPRHKIEISSFAMDIHAVTNEQFVRFLEVMGGEKDA 491

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
                        + ++  ++ + R  +KR  G          HPV+ ++W  A+AY  W
Sbjct: 492 -------------NNQDIIQLKESR--IKRHSGKLVIESGYAKHPVIGVTWYGAIAYAKW 536

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWL 163
            G RLPTEAEWE     G E  ++P G+ +
Sbjct: 537 IGKRLPTEAEWEIAASCGNEENIYPTGATI 566


>gi|421481579|ref|ZP_15929162.1| hypothetical protein QWC_03223 [Achromobacter piechaudii HLE]
 gi|400199894|gb|EJO32847.1| hypothetical protein QWC_03223 [Achromobacter piechaudii HLE]
          Length = 335

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 69/179 (38%), Positives = 94/179 (52%), Gaps = 22/179 (12%)

Query: 177 NHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
           +HPVV++SWNDA A   W      R  RLPTEAEWEY CR G   R   + N   P G  
Sbjct: 156 DHPVVNISWNDAQALARWLTQKEGRVYRLPTEAEWEYACRAGTRTR---YQNGDDPAGMP 212

Query: 231 RAN---------VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
            A          +W       +   DGY  TAPV +Y  N FGLY+M GN WEW ADW+ 
Sbjct: 213 DAGNGFDADAQPLWPKWQAFASARHDGYTFTAPVATYAPNAFGLYDMHGNAWEWVADWYG 272

Query: 282 VHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
             ++  +P+ +P+GP +G  +V++GGS+     Y    R + R+ NT ++    +G R 
Sbjct: 273 EDYYAHSPTDDPQGPDSGNVRVRRGGSWHTWALYT---RSSYRNWNTQETRYPLVGLRL 328



 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 62/168 (36%), Positives = 82/168 (48%), Gaps = 31/168 (18%)

Query: 15  MVLLPGDTFRMGTNK---------PILIKD------GEFPSRNVTLD-AFYLDQHEVSNT 58
            V +P  TF+MG+++         P   KD       E P  +V +   FYL++ EV+  
Sbjct: 38  FVAIPAGTFQMGSDETPEALARDYPQYPKDRFAQLFDEAPVHSVRITRPFYLERTEVTVG 97

Query: 59  QFQEFVSATGYVTEAEK-----FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEH 113
           QF+ F+ A+GYV E+E      +G    ++P  SE   A   + R    R  G   T +H
Sbjct: 98  QFRRFIEASGYVPESEADGTGGYGYNAAYDPDKSERGDAFEGRDRRYSWRNPGFAQTDDH 157

Query: 114 RMHHPVVHISWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENR 155
               PVV+ISWNDA A   W      R  RLPTEAEWEY CR G   R
Sbjct: 158 ----PVVNISWNDAQALARWLTQKEGRVYRLPTEAEWEYACRAGTRTR 201


>gi|89898193|ref|YP_515303.1| serine/threonine protein kinase [Chlamydophila felis Fe/C-56]
 gi|89331565|dbj|BAE81158.1| serine/threonine protein kinase [Chlamydophila felis Fe/C-56]
          Length = 618

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 63/167 (37%), Positives = 84/167 (50%), Gaps = 22/167 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A  Y +W G RLPTEAEWE    GG+  + +P G  +           Q
Sbjct: 467 KHPVVGVTWYGASGYASWVGKRLPTEAEWEVAAYGGIAQQRYPCGEEIDKS--------Q 518

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP---SYNPK 293
             F +++T A        VMSY  N +GLY+M GNV+EW  DW+    +      S+ P+
Sbjct: 519 ANFFSSDTTA--------VMSYPANPYGLYDMAGNVYEWCEDWYGYDFYEISAQESHCPR 570

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614



 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 41/124 (33%), Positives = 62/124 (50%), Gaps = 15/124 (12%)

Query: 37  EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
           E P   + L++F+LD H V+N QF  ++   G  +E +K+ +  +         R K S+
Sbjct: 403 EHPVHEIFLESFFLDIHPVTNEQFVRYLECAG--SEQDKYYNELI---------RLKDSR 451

Query: 97  VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 156
           +    +R  G          HPVV ++W  A  Y +W G RLPTEAEWE    GG+  + 
Sbjct: 452 I----QRRSGKLVIEPGYAKHPVVGVTWYGASGYASWVGKRLPTEAEWEVAAYGGIAQQR 507

Query: 157 FPWG 160
           +P G
Sbjct: 508 YPCG 511


>gi|395007433|ref|ZP_10391177.1| hypothetical protein PMI14_03869 [Acidovorax sp. CF316]
 gi|394314580|gb|EJE51467.1| hypothetical protein PMI14_03869 [Acidovorax sp. CF316]
          Length = 360

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 67/177 (37%), Positives = 96/177 (54%), Gaps = 18/177 (10%)

Query: 177 NHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGN-------N 223
           +HPVV+V+WNDA A   W      +  RLPTEAEWEY CR G   R +P G+       N
Sbjct: 160 DHPVVNVTWNDAQALAQWLSRTEGQRYRLPTEAEWEYACRAGTRTR-YPHGDDPEGLLQN 218

Query: 224 LTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW-WNV 282
                +  A  W+         +DG+  TAPV SY  N FGL +M+GN WEW ADW  + 
Sbjct: 219 ANTFDQEAAPHWERWRQHALRGSDGHAFTAPVGSYAPNAFGLQDMLGNAWEWVADWHGDS 278

Query: 283 HHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           ++  +P+ +P GP  G+ +V++GGS+     + +  RC  R+ N+P +    +G R 
Sbjct: 279 YYAESPAVDPGGPADGSVRVRRGGSW---HTWSFYARCGYRNWNSPQTRYTLVGMRL 332



 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 50/135 (37%), Positives = 66/135 (48%), Gaps = 10/135 (7%)

Query: 41  RNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGD-TFVFEPLLSEEERAKISQVRH 99
           R V   AFYL QHEV+  QF+ F+ A+GY  E+E  G   + + P        +      
Sbjct: 84  RVVIRQAFYLGQHEVTVGQFRRFLQASGYQPESEADGTGGYGYNPRYDPATTRRGDAFEG 143

Query: 100 DMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW------RGARLPTEAEWEYGCRGGLE 153
              R+   +        HPVV+++WNDA A   W      +  RLPTEAEWEY CR G  
Sbjct: 144 RDPRYSWRNPGFAQGEDHPVVNVTWNDAQALAQWLSRTEGQRYRLPTEAEWEYACRAGTR 203

Query: 154 NRLFPWGSWLHPEGI 168
            R +P G    PEG+
Sbjct: 204 TR-YPHGD--DPEGL 215


>gi|209733864|gb|ACI67801.1| Sulfatase-modifying factor 1 precursor [Salmo salar]
          Length = 209

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 51/82 (62%), Positives = 65/82 (79%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+PG  F MGT+ P + +DGE P R V LD FY+D HEVSN  FQ F++ATGY+TEAE
Sbjct: 98  MVLIPGGEFLMGTDNPGIPQDGEGPQRRVHLDHFYMDAHEVSNRHFQSFINATGYITEAE 157

Query: 75  KFGDTFVFEPLLSEEERAKISQ 96
           +FGD+FVFE +LSEE +++ISQ
Sbjct: 158 RFGDSFVFEGVLSEEVKSQISQ 179


>gi|402495015|ref|ZP_10841749.1| gliding motility-associated lipoprotein GldK [Aquimarina
           agarilytica ZC1]
          Length = 453

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 110/413 (26%), Positives = 162/413 (39%), Gaps = 94/413 (22%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF-------VSAT 67
           M L+PG  F MG +          P + VT+ +FY+D+ E++N+++++F       V  T
Sbjct: 36  MTLIPGGAFIMGKSDDDKAGLLNAPPKTVTVRSFYMDETEITNSEYRQFTEWVKDSVVRT 95

Query: 68  GYVTEAEKFGDT---------FVFEPLLSEEERAKISQVRHDMKRFEGLDST-------- 110
                A++ G T         F F     EEE +   +   D   +EGL  T        
Sbjct: 96  QLAITADEVGQTGSDEDGIGQFAFRNTEDEEELSVYEKYVRD--NYEGLGETGYEGKALN 153

Query: 111 -----IEHRMHHPVVHISWNDAVAYC----------TWRGARLPTEAEWEYGCRGGLENR 155
                I     +P  + +      Y           T++  +L  + +W Y       +R
Sbjct: 154 KDVDLITDLNEYPDEYYAEVMDSLYVRAEEAHNGVRTFKTEKLVYDYKW-YDVEAMARSR 212

Query: 156 LFPWGSWLHPEGI----DSTI------------EHRM--------NHPVVHVSWNDAVAY 191
                 +L  E +    D+T+             H M        ++PVV V+W  A A+
Sbjct: 213 SKDSKKFLKREALKIYPDTTVWIRDFSYSYNEPMHDMYFTHTSYDDYPVVGVTWKQAKAF 272

Query: 192 CTWRGA------------------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRAN 233
           C WR                    RLPTEAEWEY  RGGLE+  FPWG    P  ++   
Sbjct: 273 CDWRTKNHNDALRKKKNKDLVSRYRLPTEAEWEYAARGGLESGTFPWGG---PYAKNDRG 329

Query: 234 VWQGEFP--TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSY 290
            +   F     + AAD  L T    SY  N +GLYNM GNV EW    ++   +    S 
Sbjct: 330 CFLANFKPMRGDYAADQALYTVEANSYDPNDYGLYNMAGNVSEWVGSTYDPASYEYTSSM 389

Query: 291 NPKGPTT-GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           NP         KV +GGS+   +   Y  + + R     DS+   +GFR   D
Sbjct: 390 NPNSSDDENKRKVVRGGSW---KDVAYFLQVSTRDYEYADSARSYIGFRAVQD 439


>gi|425744693|ref|ZP_18862748.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii WC-323]
 gi|425490289|gb|EKU56589.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
           baumannii WC-323]
          Length = 334

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 64/164 (39%), Positives = 88/164 (53%), Gaps = 5/164 (3%)

Query: 179 PVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQG 237
           PV +V+ ND   Y  W G  LPTE EWEY  + G +    P         +H +AN WQG
Sbjct: 165 PVRYVTKNDTEHYAVWLGHDLPTELEWEYAAKAGSKTDT-PLHQGPQDAHQHPQANYWQG 223

Query: 238 EFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNP--KG 294
           +FP NNT  D +   APV  Y  N F L++M+GNVWEWT+  ++  H     +Y    + 
Sbjct: 224 DFPGNNTVEDHFEGVAPVGCYPANAFKLFDMIGNVWEWTSSVYHGAHDQHMGNYTDLRQH 283

Query: 295 PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
               T  V KGGS+LC   +C R+R ++R     D +A ++GFR
Sbjct: 284 NNATTQYVLKGGSFLCASNFCARYRNSSRYPQEFDLAATHVGFR 327


>gi|162454345|ref|YP_001616712.1| protein kinase [Sorangium cellulosum So ce56]
 gi|161164927|emb|CAN96232.1| Protein kinase [Sorangium cellulosum So ce56]
          Length = 733

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 104/350 (29%), Positives = 138/350 (39%), Gaps = 108/350 (30%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEF----PSRNVTLDAFYLDQHEVSNTQFQ--------- 61
           MVL+PG  F MG+++P      EF    P+  VT+D F +D HEV+   ++         
Sbjct: 467 MVLVPGGKFFMGSDEP------EFKLWQPAHKVTIDTFCIDVHEVTAGDYKACSDKGECK 520

Query: 62  ------EFVSATGYVTEAEKFGDTFVFEP-LLSEEERAKISQVRHDMKRFEGLDSTIEHR 114
                 +F + +G    A +   T + E     + E AK                     
Sbjct: 521 RPPAAPDFPNTSGAPAAAHQKTLTALAEQCTFGKPELAK--------------------- 559

Query: 115 MHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEH 174
             HP+  + W+ A AYC   G RLPTEAEWEY  RG  + R FPWG              
Sbjct: 560 --HPINCLPWSLANAYCEVHGKRLPTEAEWEYAARGS-DGRKFPWG-------------- 602

Query: 175 RMNHPVVHVSWNDAVAYCT-WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRAN 233
             + P      N     CT W  AR  T +E  Y                          
Sbjct: 603 --DKPGDQTFMNACGTECTAWEKARGLTPSERMY-------------------------- 634

Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYK--ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN 291
                      A DG+  TAPV S+     +FG  ++VGNVWEWTADW+  +       N
Sbjct: 635 ----------EADDGFPGTAPVGSFPNGRTRFGADDVVGNVWEWTADWFETYKD-EEQVN 683

Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
           PKG   G  K  +GG+Y  N  +      A R      +SA  +GFRCAA
Sbjct: 684 PKGAPAGDRKAIRGGAY--NGGFALWLNPAFRYHQLATASAPGIGFRCAA 731


>gi|406039676|ref|ZP_11047031.1| hypothetical protein AursD1_07627 [Acinetobacter ursingii DSM 16037
           = CIP 107286]
          Length = 332

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 65/166 (39%), Positives = 86/166 (51%), Gaps = 5/166 (3%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVW 235
           N PV  V+ NDA  Y  W G  LP+E EWEY  + G + +  P         +H +AN W
Sbjct: 163 NEPVRFVTKNDAEHYAVWLGRDLPSEIEWEYAAKAG-QTQDTPLHQAPMDHHQHPQANYW 221

Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKG 294
           QGEFP +NT  D +   APV  +  N F LY+ +GNVWEWT+  +   H      Y    
Sbjct: 222 QGEFPFHNTKQDQFEGVAPVGCFAANNFQLYDTIGNVWEWTSSVYQGAHDEHMGDYQHLR 281

Query: 295 PTT--GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
            T     + V KGGSYLC + YC R R ++R     D +  ++GFR
Sbjct: 282 QTNIPSQNFVIKGGSYLCADNYCARFRNSSRHPQEFDLATSHVGFR 327


>gi|407455448|ref|YP_006734339.1| hypothetical protein B598_0661 [Chlamydia psittaci GR9]
 gi|405781991|gb|AFS20740.1| hypothetical protein B598_0661 [Chlamydia psittaci GR9]
          Length = 618

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 64/167 (38%), Positives = 85/167 (50%), Gaps = 22/167 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A  Y  W G RLPTEAEWE    GG+  + +P G  +    +  AN   
Sbjct: 467 KHPVVGVTWYGASGYAAWVGKRLPTEAEWEVAAYGGVAQQRYPCGEEI---DKSLANF-- 521

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPK 293
             F ++ TA         VMSY  N +GLY+M GNV+EW  DW++   +      S+ P+
Sbjct: 522 --FSSDTTA---------VMSYPANAYGLYDMAGNVYEWCEDWYSYDFYELSAQESHAPQ 570

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614



 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 60/210 (28%), Positives = 92/210 (43%), Gaps = 37/210 (17%)

Query: 2   VLLPAPPVERYK--------------DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDA 47
           +L+  P V RY               +MV + G TF  G+ +    +  E P   + L +
Sbjct: 357 LLIREPVVSRYVEEEKEEVKPQPLLTEMVFIEGGTFIRGSREG---QRDEHPVHEIFLQS 413

Query: 48  FYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGL 107
           F+LD H V+N QF  ++  +G  +E +K+ +  +         R K S++    +R  G 
Sbjct: 414 FFLDIHPVTNEQFVRYLECSG--SEQDKYYNELI---------RLKDSRI----QRRSGK 458

Query: 108 DSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
                    HPVV ++W  A  Y  W G RLPTEAEWE    GG+  + +P G     E 
Sbjct: 459 LVIEPGYAKHPVVGVTWYGASGYAAWVGKRLPTEAEWEVAAYGGVAQQRYPCG-----EE 513

Query: 168 IDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
           ID ++ +  +     V    A AY  +  A
Sbjct: 514 IDKSLANFFSSDTTAVMSYPANAYGLYDMA 543


>gi|332661785|ref|YP_004451255.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
            1100]
 gi|332337282|gb|AEE54382.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis DSM
            1100]
          Length = 2048

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 71/191 (37%), Positives = 98/191 (51%), Gaps = 42/191 (21%)

Query: 175  RMNHPVVHVSWNDAVAYCTWR------------------------GARLPTEAEWEYGCR 210
            R   PV++VSW DA+ YC WR                        G RLPTEAEWE+  R
Sbjct: 965  RGTRPVINVSWFDAIDYCNWRSQEEGLQQVYQVNKEQVTPNWSANGYRLPTEAEWEFAAR 1024

Query: 211  GGLENRLFPW-GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMV 269
            GG E++ F + G+N         NV +  + + N+ A     T PV   K N+ G+Y+M 
Sbjct: 1025 GGSESQGFTYAGSN---------NVDEVAWFSENSGA----KTQPVGQKKANELGIYDMS 1071

Query: 270  GNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPD 329
            GNVWEW  DW++ +   A + +PKGP TG+ +V + GS+  +   C   R A RS N P+
Sbjct: 1072 GNVWEWCWDWYSAYSSSA-TKDPKGPDTGSIRVLRCGSWFYDADGC---RVAFRSDNDPN 1127

Query: 330  SSAGNLGFRCA 340
             +  N GFR A
Sbjct: 1128 VANSNCGFRIA 1138



 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 74/259 (28%), Positives = 104/259 (40%), Gaps = 72/259 (27%)

Query: 121  HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
             +++ +  A+CT  G  LP ++ W  G R                             PV
Sbjct: 1234 ELTFEEYDAFCTATGRELPDDSGWGRGKR-----------------------------PV 1264

Query: 181  VHVSWNDAVAYCTWR---------------------------------------GARLPT 201
            ++V W DA+ YC WR                                       G RLPT
Sbjct: 1265 INVDWYDAIEYCNWRSSQEGLQTVYSINKNTKDSNNSSPSDTKQWQVTTSRSANGYRLPT 1324

Query: 202  EAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKEN 261
            EAEWEY  R G +   F  G ++    E   N   G     +   +    T PV S+  N
Sbjct: 1325 EAEWEYAARQGGQKVRFGNGKDIADPQEINFNALSGYKKRYSVVGEYRQKTLPVGSFSPN 1384

Query: 262  KFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCA 321
               L++M GNVWEW  DW+  +   A S +P+G T G+D+V +GGS+  N  +    R A
Sbjct: 1385 TLDLFDMSGNVWEWCGDWYGPYPSSA-SNDPQGATGGSDRVFRGGSWRSNPIFT---RVA 1440

Query: 322  ARSQNTPDSSAGNLGFRCA 340
             R + +P   +GN GFR A
Sbjct: 1441 LRYRISPGDRSGNKGFRLA 1459



 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 66/192 (34%), Positives = 90/192 (46%), Gaps = 44/192 (22%)

Query: 175  RMNHPVVHVSWNDAVAYCTWR------------------------GARLPTEAEWEYGCR 210
            R   PV++V+W DA+ YC WR                        G RLPTEAEWEY  +
Sbjct: 1572 RGTRPVINVNWFDAIDYCNWRSQQENLKPVYNGGKDGFTADWSANGYRLPTEAEWEYAAK 1631

Query: 211  GGLENRLFPWG--NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNM 268
            GGL +R F +   NN+          W G    N         T PV   K N+ GL++M
Sbjct: 1632 GGLSSRGFKYAGSNNVDEVA------WYGSNSGN--------KTQPVGQKKANELGLFDM 1677

Query: 269  VGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTP 328
             GNV EW  DW+  + + A + +P+GP  G  +V + GS+  +  YC   R A R +N P
Sbjct: 1678 SGNVQEWCWDWYEAYSNRAFN-DPRGPDFGMYRVLRSGSWFNSSDYC---RVAHRDRNVP 1733

Query: 329  DSSAGNLGFRCA 340
              +  + GFR A
Sbjct: 1734 RRAGDDYGFRLA 1745



 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 63/193 (32%), Positives = 90/193 (46%), Gaps = 46/193 (23%)

Query: 175  RMNHPVVHVSWNDAVAYCTWR------------------------GARLPTEAEWEYGCR 210
            R + P ++V+W DAV YC WR                        G RLPTEAEWEY  R
Sbjct: 1859 RGDRPAINVNWFDAVDYCNWRSQEEGLQQVYHVNMEQVTPNWSANGYRLPTEAEWEYAAR 1918

Query: 211  GGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVG 270
            GG E++ F +  +          V +  + + N+    +L    V   K N+ GL++M G
Sbjct: 1919 GGSESQGFTYAGS--------RKVLEVAWFSENSGGSTHL----VGQKKTNELGLFDMSG 1966

Query: 271  NVWEWTADWWNVHHHPAPSY---NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNT 327
            NVWEW  DW    H   P+    +P+GP+ G+ +V +GGS+L   + C+    A R    
Sbjct: 1967 NVWEWCWDW----HAAYPNSFINDPRGPSAGSRRVLRGGSWLNEAELCH---VATRLNYN 2019

Query: 328  PDSSAGNLGFRCA 340
            P  +    GFR A
Sbjct: 2020 PLYADNRFGFRLA 2032



 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 47/171 (27%), Positives = 68/171 (39%), Gaps = 70/171 (40%)

Query: 15   MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY-VTEA 73
            MVL+ G TF MG          E  + +VTL  FY+ +H+++  ++  F  ATG  +T+ 
Sbjct: 910  MVLVKGGTFEMGE---------ENNTHSVTLSDFYIGKHQLTFAEYDAFCRATGRELTKD 960

Query: 74   EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
              +G                                    R   PV+++SW DA+ YC W
Sbjct: 961  RGWG------------------------------------RGTRPVINVSWFDAIDYCNW 984

Query: 134  R------------------------GARLPTEAEWEYGCRGGLENRLFPWG 160
            R                        G RLPTEAEWE+  RGG E++ F + 
Sbjct: 985  RSQEEGLQQVYQVNKEQVTPNWSANGYRLPTEAEWEFAARGGSESQGFTYA 1035



 Score = 54.7 bits (130), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 48/182 (26%), Positives = 68/182 (37%), Gaps = 69/182 (37%)

Query: 4    LPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
            L A   E+   MVL+ G TF+MG  K          S  VTL  F + + +++  +F  F
Sbjct: 1792 LIAIIQEKQDSMVLVKGGTFQMGEVKK--------NSHQVTLSDFLIAKRQLTFDEFDAF 1843

Query: 64   VSATGYVTEAEK-FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
              AT     +++ +G                                    R   P +++
Sbjct: 1844 CKATSRELPSDRNWG------------------------------------RGDRPAINV 1867

Query: 123  SWNDAVAYCTWR------------------------GARLPTEAEWEYGCRGGLENRLFP 158
            +W DAV YC WR                        G RLPTEAEWEY  RGG E++ F 
Sbjct: 1868 NWFDAVDYCNWRSQEEGLQQVYHVNMEQVTPNWSANGYRLPTEAEWEYAARGGSESQGFT 1927

Query: 159  WG 160
            + 
Sbjct: 1928 YA 1929


>gi|329942958|ref|ZP_08291737.1| hypothetical protein G5Q_0636 [Chlamydophila psittaci Cal10]
 gi|332287549|ref|YP_004422450.1| putative serine/threonine protein kinase [Chlamydophila psittaci
           6BC]
 gi|384450705|ref|YP_005663305.1| s/t protein kinase [Chlamydophila psittaci 6BC]
 gi|384451703|ref|YP_005664301.1| putative serine/threonine protein kinase [Chlamydophila psittaci
           01DC11]
 gi|384452677|ref|YP_005665274.1| putative serine/threonine protein kinase [Chlamydophila psittaci
           08DC60]
 gi|384453653|ref|YP_005666249.1| putative serine/threonine protein kinase [Chlamydophila psittaci
           C19/98]
 gi|384454632|ref|YP_005667227.1| putative serine/threonine protein kinase [Chlamydophila psittaci
           02DC15]
 gi|392376781|ref|YP_004064559.1| putative serine/threonine protein kinase [Chlamydophila psittaci
           RD1]
 gi|407454166|ref|YP_006733274.1| hypothetical protein B595_0715 [Chlamydia psittaci 84/55]
 gi|407456852|ref|YP_006735425.1| hypothetical protein B600_0709 [Chlamydia psittaci VS225]
 gi|407458186|ref|YP_006736491.1| hypothetical protein B601_0665 [Chlamydia psittaci WS/RT/E30]
 gi|407459433|ref|YP_006737536.1| hypothetical protein B602_0666 [Chlamydia psittaci M56]
 gi|407460805|ref|YP_006738580.1| hypothetical protein B603_0670 [Chlamydia psittaci WC]
 gi|313848124|emb|CBY17125.1| putative serine/threonine protein kinase [Chlamydophila psittaci
           RD1]
 gi|325506747|gb|ADZ18385.1| putative serine/threonine protein kinase [Chlamydophila psittaci
           6BC]
 gi|328815218|gb|EGF85207.1| hypothetical protein G5Q_0636 [Chlamydophila psittaci Cal10]
 gi|328914799|gb|AEB55632.1| s/t protein kinase [Chlamydophila psittaci 6BC]
 gi|334692434|gb|AEG85653.1| putative serine/threonine protein kinase [Chlamydophila psittaci
           C19/98]
 gi|334693413|gb|AEG86631.1| putative serine/threonine protein kinase [Chlamydophila psittaci
           01DC11]
 gi|334694389|gb|AEG87606.1| putative serine/threonine protein kinase [Chlamydophila psittaci
           02DC15]
 gi|334695366|gb|AEG88582.1| putative serine/threonine protein kinase [Chlamydophila psittaci
           08DC60]
 gi|405780925|gb|AFS19675.1| hypothetical protein B595_0715 [Chlamydia psittaci 84/55]
 gi|405784113|gb|AFS22860.1| hypothetical protein B600_0709 [Chlamydia psittaci VS225]
 gi|405785555|gb|AFS24301.1| hypothetical protein B601_0665 [Chlamydia psittaci WS/RT/E30]
 gi|405785667|gb|AFS24412.1| hypothetical protein B602_0666 [Chlamydia psittaci M56]
 gi|405787550|gb|AFS26294.1| hypothetical protein B603_0670 [Chlamydia psittaci WC]
          Length = 618

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 64/167 (38%), Positives = 85/167 (50%), Gaps = 22/167 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A  Y  W G RLPTEAEWE    GG+  + +P G  +    +  AN   
Sbjct: 467 KHPVVGVTWYGASGYAAWVGKRLPTEAEWEVAAYGGVAQQRYPCGEEI---DKSLANF-- 521

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPK 293
             F ++ TA         VMSY  N +GLY+M GNV+EW  DW++   +      S+ P+
Sbjct: 522 --FSSDTTA---------VMSYPANAYGLYDMAGNVYEWCEDWYSYDFYELSAQESHAPQ 570

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614



 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 56/190 (29%), Positives = 87/190 (45%), Gaps = 23/190 (12%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P   + +MV + G TF  G+ +    +  E P   + L +F+LD H V+N QF  ++  +
Sbjct: 377 PQPLFTEMVFIEGGTFIRGSREG---QRDEHPVHEIFLQSFFLDIHPVTNEQFVRYLECS 433

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           G  +E +K+ +  +         R K S++    +R  G          HPVV ++W  A
Sbjct: 434 G--SEQDKYYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWYGA 478

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
             Y  W G RLPTEAEWE    GG+  + +P G     E ID ++ +  +     V    
Sbjct: 479 SGYAAWVGKRLPTEAEWEVAAYGGVAQQRYPCG-----EEIDKSLANFFSSDTTAVMSYP 533

Query: 188 AVAYCTWRGA 197
           A AY  +  A
Sbjct: 534 ANAYGLYDMA 543


>gi|424825250|ref|ZP_18250237.1| putative serine/threonine protein kinase [Chlamydophila abortus
           LLG]
 gi|333410349|gb|EGK69336.1| putative serine/threonine protein kinase [Chlamydophila abortus
           LLG]
          Length = 618

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 64/167 (38%), Positives = 85/167 (50%), Gaps = 22/167 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A  Y  W G RLPTEAEWE    GG+  + +P G  +    +  AN   
Sbjct: 467 KHPVVGVTWYGASGYAAWVGKRLPTEAEWEIAAYGGVAQQRYPCGEEI---DKSLANF-- 521

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPK 293
             F ++ TA         VMSY  N +GLY+M GNV+EW  DW++   +      S+ P+
Sbjct: 522 --FSSDTTA---------VMSYPANAYGLYDMAGNVYEWCEDWYSYDFYELSAQESHAPQ 570

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614



 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 55/190 (28%), Positives = 86/190 (45%), Gaps = 23/190 (12%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P   + +MV + G  F  G+ +    +  E P   + L +F+LD H V+N QF  ++  +
Sbjct: 377 PQPLFTEMVFIEGGAFIRGSREG---QRDEHPVHEIFLQSFFLDIHPVTNEQFVRYLECS 433

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           G  +E +K+ +  +         R K S++    +R  G          HPVV ++W  A
Sbjct: 434 G--SEQDKYYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWYGA 478

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
             Y  W G RLPTEAEWE    GG+  + +P G     E ID ++ +  +     V    
Sbjct: 479 SGYAAWVGKRLPTEAEWEIAAYGGVAQQRYPCG-----EEIDKSLANFFSSDTTAVMSYP 533

Query: 188 AVAYCTWRGA 197
           A AY  +  A
Sbjct: 534 ANAYGLYDMA 543


>gi|62185206|ref|YP_219991.1| serine/threonine protein kinase [Chlamydophila abortus S26/3]
 gi|62148273|emb|CAH64038.1| putative serine/threonine protein kinase [Chlamydophila abortus
           S26/3]
          Length = 618

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 64/167 (38%), Positives = 85/167 (50%), Gaps = 22/167 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A  Y  W G RLPTEAEWE    GG+  + +P G  +    +  AN   
Sbjct: 467 KHPVVGVTWYGASGYAAWVGKRLPTEAEWEIAAYGGVAQQRYPCGEEI---DKSLANF-- 521

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPK 293
             F ++ TA         VMSY  N +GLY+M GNV+EW  DW++   +      S+ P+
Sbjct: 522 --FSSDTTA---------VMSYPANAYGLYDMAGNVYEWCEDWYSYDFYELSAQESHAPQ 570

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614



 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 55/190 (28%), Positives = 86/190 (45%), Gaps = 23/190 (12%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P   + +MV + G  F  G+ +    +  E P   + L +F+LD H V+N QF  ++  +
Sbjct: 377 PQPLFTEMVFIEGGAFIRGSREG---QRDEHPVHEIFLQSFFLDIHPVTNEQFVRYLECS 433

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           G  +E +K+ +  +         R K S++    +R  G          HPVV ++W  A
Sbjct: 434 G--SEQDKYYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWYGA 478

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
             Y  W G RLPTEAEWE    GG+  + +P G     E ID ++ +  +     V    
Sbjct: 479 SGYAAWVGKRLPTEAEWEIAAYGGVAQQRYPCG-----EEIDKSLANFFSSDTTAVMSYP 533

Query: 188 AVAYCTWRGA 197
           A AY  +  A
Sbjct: 534 ANAYGLYDMA 543


>gi|448734250|ref|ZP_21716476.1| sulfatase-modifying factor [Halococcus salifodinae DSM 8989]
 gi|445800298|gb|EMA50653.1| sulfatase-modifying factor [Halococcus salifodinae DSM 8989]
          Length = 133

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 58/128 (45%), Positives = 74/128 (57%), Gaps = 18/128 (14%)

Query: 232 ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH------- 284
           AN WQGEFP  N A DGY  T+PV S+  N+FGLY+M GNVW+WT DW++          
Sbjct: 2   ANTWQGEFPHENKALDGYERTSPVGSFPANEFGLYDMAGNVWDWTQDWYSADPPTSEPEA 61

Query: 285 ---HPA--------PSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAG 333
               PA         S +P+ P+    KV KGGS+LC   YC+R+R AAR     D+S  
Sbjct: 62  SCCTPANLRTVTEEQSVDPRDPSEIPRKVLKGGSHLCAPNYCFRYRPAARYPEPIDTSTS 121

Query: 334 NLGFRCAA 341
           ++GFRC A
Sbjct: 122 HVGFRCVA 129


>gi|406593560|ref|YP_006740739.1| hypothetical protein B712_0664 [Chlamydia psittaci NJ1]
 gi|405789432|gb|AFS28174.1| hypothetical protein B712_0664 [Chlamydia psittaci NJ1]
          Length = 618

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 64/167 (38%), Positives = 85/167 (50%), Gaps = 22/167 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A  Y  W G RLPTEAEWE    GG+  + +P G  +    +  AN   
Sbjct: 467 KHPVVGVTWYGASGYAAWVGKRLPTEAEWEVAAYGGVAQQRYPCGEEI---DKSLANF-- 521

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPK 293
             F ++ TA         VMSY  N +GLY+M GNV+EW  DW++   +      S+ P+
Sbjct: 522 --FSSDTTA---------VMSYPANAYGLYDMAGNVYEWCEDWYSYDFYELSAQESHAPQ 570

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614



 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 55/190 (28%), Positives = 86/190 (45%), Gaps = 23/190 (12%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P   + +MV + G  F  G+ +    +  E P   + L +F+LD H V+N QF  ++  +
Sbjct: 377 PQPLFTEMVFIEGGAFTRGSREG---QRDEHPVHEIFLQSFFLDIHPVTNEQFVRYLECS 433

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           G  +E +K+ +  +         R K S++    +R  G          HPVV ++W  A
Sbjct: 434 G--SEQDKYYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWYGA 478

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
             Y  W G RLPTEAEWE    GG+  + +P G     E ID ++ +  +     V    
Sbjct: 479 SGYAAWVGKRLPTEAEWEVAAYGGVAQQRYPCG-----EEIDKSLANFFSSDTTAVMSYP 533

Query: 188 AVAYCTWRGA 197
           A AY  +  A
Sbjct: 534 ANAYGLYDMA 543


>gi|449071261|ref|YP_007438341.1| serine/threonine protein kinase [Chlamydophila psittaci Mat116]
 gi|449039769|gb|AGE75193.1| serine/threonine protein kinase [Chlamydophila psittaci Mat116]
          Length = 618

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 64/167 (38%), Positives = 85/167 (50%), Gaps = 22/167 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A  Y  W G RLPTEAEWE    GG+  + +P G  +    +  AN   
Sbjct: 467 KHPVVGVTWYGASGYAAWVGKRLPTEAEWEVAAYGGVAQQRYPCGEEI---DKSLANF-- 521

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPK 293
             F ++ TA         VMSY  N +GLY+M GNV+EW  DW++   +      S+ P+
Sbjct: 522 --FSSDTTA---------VMSYPANAYGLYDMAGNVYEWCEDWYSYDFYELSAQESHAPQ 570

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614



 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 56/190 (29%), Positives = 87/190 (45%), Gaps = 23/190 (12%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P   + +MV + G TF  G+ +    +  E P   + L +F+LD H V+N QF  ++  +
Sbjct: 377 PQPLFTEMVFIEGGTFIRGSREG---QRDEHPVHEIFLQSFFLDIHPVTNEQFVRYLECS 433

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           G  +E +K+ +  +         R K S++    +R  G          HPVV ++W  A
Sbjct: 434 G--SEQDKYYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWYGA 478

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
             Y  W G RLPTEAEWE    GG+  + +P G     E ID ++ +  +     V    
Sbjct: 479 SGYAAWVGKRLPTEAEWEVAAYGGVAQQRYPCG-----EEIDKSLANFFSSDTTAVMSYP 533

Query: 188 AVAYCTWRGA 197
           A AY  +  A
Sbjct: 534 ANAYGLYDMA 543


>gi|373456878|ref|ZP_09548645.1| Sulphatase-modifying factor protein [Caldithrix abyssi DSM 13497]
 gi|371718542|gb|EHO40313.1| Sulphatase-modifying factor protein [Caldithrix abyssi DSM 13497]
          Length = 298

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 69/183 (37%), Positives = 98/183 (53%), Gaps = 10/183 (5%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGG---LENRLFPWGNNLTPRGE-HRA 232
           NHPVV+V+W DA AY  W G RLPTEAEWEY  + G   L   L    + +T  G     
Sbjct: 111 NHPVVNVNWYDAQAYAKWAGKRLPTEAEWEYVAKAGGNSLNYTLTAERSYITSLGNVADF 170

Query: 233 NVWQGEF--PTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APS 289
           ++ Q ++     N   DG+  T+PV  +  N FG+Y++ GNV EW ADW++  ++  +P+
Sbjct: 171 SLRQKDYHRIVMNNYEDGFPYTSPVGYFPPNPFGIYDLEGNVLEWCADWYDPQYYAQSPA 230

Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
            NP GP  G  +V +GGS+  +  Y    R   R+   P+ +   LGFRCA D       
Sbjct: 231 ENPTGPAQGKYRVIRGGSWNRSGDYL---RTTYRTWYPPECTFEFLGFRCAMDAEAALPA 287

Query: 350 DKV 352
            K+
Sbjct: 288 KKI 290


>gi|15835040|ref|NP_296799.1| hypothetical protein TC0422 [Chlamydia muridarum Nigg]
 gi|270285205|ref|ZP_06194599.1| serine/threonine protein kinase [Chlamydia muridarum Nigg]
 gi|270289224|ref|ZP_06195526.1| serine/threonine protein kinase [Chlamydia muridarum Weiss]
 gi|301336601|ref|ZP_07224803.1| serine-threonine-protein kinase [Chlamydia muridarum MopnTet14]
 gi|55667874|sp|Q9PKP3.1|PKN1_CHLMU RecName: Full=Serine/threonine-protein kinase pkn1
 gi|7190465|gb|AAF39278.1| conserved hypothetical protein [Chlamydia muridarum Nigg]
          Length = 614

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 86/169 (50%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A +Y +W G RLP+EAEWE    GG     +P G ++    + +AN + 
Sbjct: 463 KHPVVGVTWYGASSYASWIGKRLPSEAEWEVAAAGGKLGMRYPTGEDV---DKSKANFFS 519

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPK 293
            +             T PVMSY  N  GLY+M GN++EW  DW++   +   +    +P+
Sbjct: 520 SD-------------TTPVMSYPANILGLYDMAGNIYEWCQDWYSYDFYENSALEPDSPQ 566

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKEDLRCAHRHRNNPGAINSTYGFRCAKD 612



 Score = 67.0 bits (162), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 49/151 (32%), Positives = 69/151 (45%), Gaps = 26/151 (17%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDG---EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           +MV + G  F  G+      +DG   E P  N+TL  F LD H V+N  F  F+   G  
Sbjct: 379 EMVFIEGGDFSRGS------RDGQRDELPVHNITLPGFLLDIHPVTNEHFVRFLEFIG-- 430

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVA 129
            E ++  +  +         R K S+++    R       IE     HPVV ++W  A +
Sbjct: 431 GEQDEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASS 476

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWG 160
           Y +W G RLP+EAEWE    GG     +P G
Sbjct: 477 YASWIGKRLPSEAEWEVAAAGGKLGMRYPTG 507


>gi|302527058|ref|ZP_07279400.1| CalU17 [Streptomyces sp. AA4]
 gi|302435953|gb|EFL07769.1| CalU17 [Streptomyces sp. AA4]
          Length = 287

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 102/344 (29%), Positives = 151/344 (43%), Gaps = 79/344 (22%)

Query: 14  DMVLLPGDTFRMGT-----------NKPILIK--DGEFPSRNVTLDAFYLDQHEVSNTQF 60
           +MV +P  +F  G+           ++P+  +    E P  +  L  + +D+  V+  +F
Sbjct: 3   EMVSIPAGSFLQGSPPWVLRWLDESDQPLPPEWFGDETPQVSRRLPPYRIDRRPVTVAEF 62

Query: 61  QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
            EFV ATGY T+AE  G   +++   +E E         D +   G  S    R +HPVV
Sbjct: 63  AEFVRATGYRTDAESRGYGMLYDRGWTEREGV-------DWRAPAGPGSGTAGRENHPVV 115

Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
           ++SW+DA AY  W G RLPTE EWEY  RG    R++PWG+  +P G+ +T E+      
Sbjct: 116 NVSWDDASAYAAWTGKRLPTEPEWEYAARGA-GFRIWPWGNRWNP-GLANTAEY------ 167

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
                          G ++ T AEW              W   +   GE RA++      
Sbjct: 168 --------------YGEKMTTLAEWRK------------WWATV---GESRADL------ 192

Query: 241 TNNTAADGYLSTAPVMSYK---ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTT 297
                      T PV ++    ++ FG  +M GNV+EWTA     +   A       P  
Sbjct: 193 ---------PLTTPVGAFSPAGDSVFGCADMAGNVYEWTATVSEPYADTARFDPAIRPAM 243

Query: 298 GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDS-SAGNLGFRCA 340
           G  +V +GGS++    + Y+ RC  R    P   S   LGFRCA
Sbjct: 244 GNYRVIRGGSWM---NFRYQVRCCERLHGDPLGWSNFALGFRCA 284


>gi|302038590|ref|YP_003798912.1| hypothetical protein NIDE3299 [Candidatus Nitrospira defluvii]
 gi|300606654|emb|CBK42987.1| conserved exported protein of unknown function DUF323 [Candidatus
           Nitrospira defluvii]
          Length = 317

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 71/179 (39%), Positives = 93/179 (51%), Gaps = 21/179 (11%)

Query: 176 MNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW 235
           +N P V+VSW DA AYC WRG RLP+EAEWE   R G + RL+PWGN   P   + A V 
Sbjct: 133 VNQPAVYVSWEDAKAYCEWRGRRLPSEAEWEKAMR-GPDGRLWPWGNVEVPNAANWARV- 190

Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNVHHHPAPS-YNP 292
                      DG+   APV   K +   +G+ +  GNV EW  DW+    + A S  NP
Sbjct: 191 ----------DDGFDVAAPVGRVKSDASPYGVMDGAGNVMEWVNDWYLEGAYAAASERNP 240

Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA---ADKGPTTG 348
           + P  GT KV +G +Y  +       R  ARS+   D     +GFRCA   A+ G ++G
Sbjct: 241 ESPEYGTYKVLRGAAYTSSGSDL---RITARSKMMQDFRDETIGFRCAQSSAENGRSSG 296



 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 51/144 (35%), Positives = 71/144 (49%), Gaps = 31/144 (21%)

Query: 18  LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
           +P   F  GTN+       E P R + LD F +D++EV+N Q+  FV ATG+        
Sbjct: 66  IPAGPFVFGTNRGGF---DEQPERTIYLDQFLIDRYEVTNAQYAAFVKATGH-------- 114

Query: 78  DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
                       +    S+   +  R  G+        + P V++SW DA AYC WRG R
Sbjct: 115 -----------RKSGPPSRYAKNTLRMRGV--------NQPAVYVSWEDAKAYCEWRGRR 155

Query: 138 LPTEAEWEYGCRGGLENRLFPWGS 161
           LP+EAEWE   R G + RL+PWG+
Sbjct: 156 LPSEAEWEKAMR-GPDGRLWPWGN 178


>gi|319792296|ref|YP_004153936.1| hypothetical protein Varpa_1615 [Variovorax paradoxus EPS]
 gi|315594759|gb|ADU35825.1| protein of unknown function DUF323 [Variovorax paradoxus EPS]
          Length = 323

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 69/182 (37%), Positives = 92/182 (50%), Gaps = 22/182 (12%)

Query: 178 HPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENR---------LFPWGN 222
           HPVV+V+WNDA A   W      +  RLPTEAEWEY CR G   R         L    N
Sbjct: 148 HPVVNVTWNDAQALAAWLSKSEGKRYRLPTEAEWEYACRAGTRTRYSAGDDPQSLLRTAN 207

Query: 223 NLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV 282
               R    A  W    P      DG+  TAPV S+  N +GLY+M GN WEW ADW   
Sbjct: 208 VFDAR---TARYWPRWAPMALKGDDGFAFTAPVGSFAPNAWGLYDMHGNAWEWVADWHGE 264

Query: 283 HHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
            ++  +P  +P+GP  GT +V++GGS+     + +  R A R+ N PD+    +G R   
Sbjct: 265 EYYARSPVDDPQGPAEGTVRVRRGGSW---HTWPFYARAAYRNWNAPDTRYTLVGTRLVR 321

Query: 342 DK 343
           ++
Sbjct: 322 EQ 323



 Score = 72.8 bits (177), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 58/175 (33%), Positives = 78/175 (44%), Gaps = 45/175 (25%)

Query: 15  MVLLPGDTFRMGTNK---------PILIKD------GEFPSRNVTLD-AFYLDQHEVSNT 58
            V LP   F MG+++         P L K+       E P   V +  +F++ QHEV+  
Sbjct: 29  FVKLPAGEFMMGSDETPEALAAAYPELPKERFTKLGDEGPVHRVRITRSFWMGQHEVTVG 88

Query: 59  QFQEFVSATGYVTEAEK-----FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEH 113
           QF+ FV A+GYV E+       +G    ++P  +    A           FEG D     
Sbjct: 89  QFRRFVEASGYVPESVADGTGGYGWRADYDPATTRRGDA-----------FEGRDVRYSW 137

Query: 114 RM-------HHPVVHISWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENR 155
           R         HPVV+++WNDA A   W      +  RLPTEAEWEY CR G   R
Sbjct: 138 RNPGFAQSDEHPVVNVTWNDAQALAAWLSKSEGKRYRLPTEAEWEYACRAGTRTR 192


>gi|284114704|ref|ZP_06386680.1| protein of unknown function DUF323 [Candidatus Poribacteria sp.
           WGA-A3]
 gi|283829573|gb|EFC33916.1| protein of unknown function DUF323 [Candidatus Poribacteria sp.
           WGA-A3]
          Length = 433

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 66/176 (37%), Positives = 88/176 (50%), Gaps = 23/176 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
           NHPVV+VSW  A+AY  W G RLPTEAEWE   RGGL ++ +PWGN++ P   + +    
Sbjct: 270 NHPVVYVSWYAAMAYADWAGKRLPTEAEWEKAARGGLVSQAYPWGNSIDPSKANYSGKIG 329

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-YNPKGP 295
           G              T PV +Y EN +GLY+MVGN WEW  D +    +   S  NP   
Sbjct: 330 G--------------TTPVGNYPENAYGLYDMVGNAWEWCLDIYKTDFYSGSSRRNPIAG 375

Query: 296 TTGTDKVKK----GGSYLCNEQYCYRH----RCAARSQNTPDSSAGNLGFRCAADK 343
            + T+  +K      S +    Y Y      R A R + +PDS+     FRC   +
Sbjct: 376 ESVTELTEKYLTVKDSRVVRSGYWYNQSVYVRVADRYKLSPDSTHRGGSFRCVMSR 431



 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 57/158 (36%), Positives = 85/158 (53%), Gaps = 24/158 (15%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           ++ +M  +    F+MG+N  I     E P   V LD FY+D +EV+N Q+++FV A    
Sbjct: 183 QFTEMATISAGDFQMGSND-IEADHDEQPIHTVYLDTFYMDVYEVTNAQYKQFVEAN--- 238

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHD---MKRFEGLDSTIEHRMHHPVVHISWNDA 127
                            E ++ +I    HD   +K +EG D   + + +HPVV++SW  A
Sbjct: 239 ----------------PEWQKDRIPSAYHDGNYLKYWEGNDYP-QGQENHPVVYVSWYAA 281

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
           +AY  W G RLPTEAEWE   RGGL ++ +PWG+ + P
Sbjct: 282 MAYADWAGKRLPTEAEWEKAARGGLVSQAYPWGNSIDP 319


>gi|302039208|ref|YP_003799530.1| hypothetical protein NIDE3935 [Candidatus Nitrospira defluvii]
 gi|300607272|emb|CBK43605.1| conserved exported protein of unknown function, Sulfatase-modifying
           factor-like [Candidatus Nitrospira defluvii]
          Length = 310

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 97/334 (29%), Positives = 135/334 (40%), Gaps = 112/334 (33%)

Query: 15  MVLLPGDTFRMGTNKPILIKDG--EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG---- 68
           MV +PG  F MG++       G  E P   V +DAF +D++EVSN  +  FV A G    
Sbjct: 59  MVPVPGGPFLMGSDPKFDRAAGPQELPQHQVYVDAFSIDRYEVSNVNYLRFVLAAGAAWP 118

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
           +   A+ F D                                      HPV+ +SW +A 
Sbjct: 119 HYWRAQPFPDKMA----------------------------------KHPVIGVSWREAD 144

Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG-IDSTIEHRMNHPVVHVSWND 187
           AYC WRGARLPTEAEWE   R G + R+FPWG+   P G I S I H  +          
Sbjct: 145 AYCRWRGARLPTEAEWEKAAR-GEDGRMFPWGN--EPAGWIKSNIAHPGSK--------- 192

Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
                  RGA+ P  A  +             + N ++P G H+                
Sbjct: 193 -------RGAKYPPLANVDR------------YDNGVSPYGVHQ---------------- 217

Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTTGTDKVKKGG 306
                               + GNV EW +DW++  ++    + NP+GP +G DKV +GG
Sbjct: 218 --------------------LAGNVSEWVSDWFDPEYYRRQENRNPQGPASGQDKVFRGG 257

Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           S+  + +     R A R+    D  +   GFRCA
Sbjct: 258 SWNEDPEVA---RSAGRNSGGLDHWSYLTGFRCA 288


>gi|389579892|ref|ZP_10169919.1| hypothetical protein DespoDRAFT_01834 [Desulfobacter postgatei
           2ac9]
 gi|389401527|gb|EIM63749.1| hypothetical protein DespoDRAFT_01834 [Desulfobacter postgatei
           2ac9]
          Length = 581

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 89/335 (26%), Positives = 138/335 (41%), Gaps = 84/335 (25%)

Query: 10  ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           +++   + +P  T+ +GT K +         +   +   Y+ ++ V+N  F+ F+  TGY
Sbjct: 326 KKFNAYITIPAGTYTIGTEKTL---KSSLALQPFDMPRVYIGKYPVTNALFEIFIEQTGY 382

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           VT AE+ G                I  V H   + +G               + W+    
Sbjct: 383 VTTAERKG----------------IGTVYHSRYKKQG-------------EKVIWSKQAG 413

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
               +GA           C             W  P G  ST+  + NHPVV +S +DA 
Sbjct: 414 SSIVKGA-----------C-------------WHQPLGPGSTLYGKRNHPVVQISVDDAF 449

Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
           AY +W G RLPTEAEWE   R  +  + +PWGN   P                N  + G 
Sbjct: 450 AYASWIGRRLPTEAEWEAAARTDMALK-YPWGNQFDPDAL-------------NIESSGL 495

Query: 250 LSTAPVMSYK--ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
             T+PV +Y    N FG+ +M+GNV EWTAD     + P P  + K        V KGG+
Sbjct: 496 ADTSPVDNYDHAANVFGMADMLGNVMEWTAD-----NQPPPFASSKAKFFN---VAKGGA 547

Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           +   +        ++R     +++A  +GFRC ++
Sbjct: 548 WNAKDTVT----ISSRGLFPFETTANIIGFRCISE 578


>gi|451980739|ref|ZP_21929125.1| conserved exported hypothetical protein [Nitrospina gracilis 3/211]
 gi|451762075|emb|CCQ90364.1| conserved exported hypothetical protein [Nitrospina gracilis 3/211]
          Length = 249

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 69/168 (41%), Positives = 91/168 (54%), Gaps = 20/168 (11%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
           N P   V+W +A AYC   G RLPTEAEWE   R G +   + WG+      EH      
Sbjct: 95  NLPADQVTWLEASAYCKQVGKRLPTEAEWEKAARAGTQTAYY-WGD------EH-----S 142

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGP 295
           GE+      +D   +T PV   K N FGLY+M GNVWEW +DW++  ++  +P  NP+GP
Sbjct: 143 GEYAWFEDNSD--ETTHPVGQKKPNGFGLYDMSGNVWEWVSDWYDKTYYERSPPANPQGP 200

Query: 296 TTGTDKVKKGGSYLCNEQYCYRHRCAA-RSQNTPDSSAGNLGFRCAAD 342
            TGT+K  +GGS+        RH+ +A R  + P     N GFRCA D
Sbjct: 201 ETGTEKTLRGGSWYSGA----RHQMSATRFWSEPHLRNSNFGFRCAMD 244



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 41/153 (26%), Positives = 63/153 (41%), Gaps = 47/153 (30%)

Query: 8   PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
           P++    MV +P  T+ MG          ++P + V +D F+LD+ EV+    +E+V+  
Sbjct: 33  PLQPPAGMVYIPPGTYTMGLEG----ASNKYPHK-VFVDGFFLDKFEVTQ---KEYVAIR 84

Query: 68  GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
           G                               +  +FEG +         P   ++W +A
Sbjct: 85  G------------------------------ANPSKFEGENL--------PADQVTWLEA 106

Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 160
            AYC   G RLPTEAEWE   R G +   + WG
Sbjct: 107 SAYCKQVGKRLPTEAEWEKAARAGTQTAYY-WG 138


>gi|308273234|emb|CBX29837.1| hypothetical protein N47_F15320 [uncultured Desulfobacterium sp.]
          Length = 665

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 96/332 (28%), Positives = 135/332 (40%), Gaps = 87/332 (26%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +   +L+PG  + +G+ +P   K  E   RN+ L AFY+ ++ V+N  F+ F   TGY T
Sbjct: 418 FNQYILIPGGNYIIGSRQP---KRNECLERNINLPAFYMGKYPVTNALFEVFAEKTGYKT 474

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
            AEK G   V+                    R+E    T++ +    +  + +N    Y 
Sbjct: 475 TAEKRGYGKVY------------------FGRYE---KTLDGQT--GMGKLIFNATTTYK 511

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
           T  GA           C             W  P G  STI  + NHPVV VS+ DA+A+
Sbjct: 512 TIEGA-----------C-------------WYQPCGPKSTIHRKRNHPVVQVSFEDAMAF 547

Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
             W G RLPTE EWE   R   +   FPWG +         NV +               
Sbjct: 548 AAWTGKRLPTENEWEAASRTD-KGYSFPWGESFRKEA---CNVEESSI----------CG 593

Query: 252 TAPVMSYK--ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
           T+PV  Y+  EN FG+ + +GNV+EWT                         V KGG ++
Sbjct: 594 TSPVDLYREFENGFGIADTLGNVFEWTTSRLEKFM-----------------VVKGGCWI 636

Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
                    +   R Q  PD+ +  LGFRC A
Sbjct: 637 SGNDI----KLFTRFQIEPDNHSNILGFRCIA 664


>gi|311108477|ref|YP_003981330.1| hypothetical protein AXYL_05315 [Achromobacter xylosoxidans A8]
 gi|310763166|gb|ADP18615.1| hypothetical protein AXYL_05315 [Achromobacter xylosoxidans A8]
          Length = 330

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 67/179 (37%), Positives = 92/179 (51%), Gaps = 22/179 (12%)

Query: 177 NHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENR---------LFPWG 221
           +HPVV++SWNDA A   W      +  RLPTEAEWEY CR G   R         +   G
Sbjct: 153 DHPVVNISWNDATALARWLTQTEGKVYRLPTEAEWEYACRAGTRTRYQNGDDPAAMPDIG 212

Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
           N            WQ      +  ADG+  TAPV SY  N FGLY+M GN WEW +DW+ 
Sbjct: 213 NGFDADAAPLWPKWQAH---ASARADGHAFTAPVASYAPNAFGLYDMHGNAWEWVSDWYG 269

Query: 282 VHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
             ++  +P  +P+GP +G  +V++GGS+     Y    R + R+ NT ++    +G R 
Sbjct: 270 EDYYAHSPVDDPQGPASGNVRVRRGGSWHTWALYT---RASYRNWNTQETRYPLVGLRL 325



 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 54/164 (32%), Positives = 79/164 (48%), Gaps = 23/164 (14%)

Query: 15  MVLLPGDTFRMGTNK---------PILIKD------GEFPSRNVTLD-AFYLDQHEVSNT 58
            V +P  TF+MG+++         P+   D       E P   V +  AFY+++ EV+  
Sbjct: 35  FVRVPAGTFQMGSDEAPEALAQAYPLYPADRYAQLSDEAPVHAVRITRAFYMERTEVTVG 94

Query: 59  QFQEFVSATGYVTEAEKFGDT-FVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
           QF+ F+ A+GYV E+E  G   + + P    ++  +        +R+   +        H
Sbjct: 95  QFRRFIEASGYVPESEADGTGGYGYNPQYDADKSERGDAFEGRDRRYSWRNPGFAQGDDH 154

Query: 118 PVVHISWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENR 155
           PVV+ISWNDA A   W      +  RLPTEAEWEY CR G   R
Sbjct: 155 PVVNISWNDATALARWLTQTEGKVYRLPTEAEWEYACRAGTRTR 198


>gi|330444630|ref|YP_004377616.1| s/t protein kinase [Chlamydophila pecorum E58]
 gi|328807740|gb|AEB41913.1| s/t protein kinase [Chlamydophila pecorum E58]
          Length = 617

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 63/169 (37%), Positives = 82/169 (48%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A  Y  W G RLP EAEWE   RGG     +P G  +        +  Q
Sbjct: 466 KHPVVGVTWYGAAGYAAWVGKRLPKEAEWEIAARGGGVRHRYPCGEEI--------DKTQ 517

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPK 293
             F +++T A        VMSY  N +GLY+M GNV+EW  DW+    +   +     PK
Sbjct: 518 ANFFSSDTTA--------VMSYPPNGYGLYDMAGNVYEWCQDWYAYDFYEIAAQEIETPK 569

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA +
Sbjct: 570 GPVQGVYRVLRGG---CWKSLKNDLRCAHRHRNNPGAVNSTYGFRCAQE 615



 Score = 68.2 bits (165), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 53/166 (31%), Positives = 72/166 (43%), Gaps = 24/166 (14%)

Query: 6   APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
           A P     +MV + G  F  G+ +    +  E P   V L +F+LD H V+N QF  ++ 
Sbjct: 375 ARPQPLLTEMVFIEGGVFSRGSLEG---QRDEHPVHQVFLQSFFLDIHPVTNEQFIRYLD 431

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
             G   E +K  +  +         R K S++    +R  G          HPVV ++W 
Sbjct: 432 CCG---EQDKHYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWY 475

Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDST 171
            A  Y  W G RLP EAEWE   RGG     +P G     E ID T
Sbjct: 476 GAAGYAAWVGKRLPKEAEWEIAARGGGVRHRYPCG-----EEIDKT 516


>gi|410028720|ref|ZP_11278556.1| hypothetical protein MaAK2_05930 [Marinilabilia sp. AK2]
          Length = 166

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 51/102 (50%), Positives = 70/102 (68%)

Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
           +HV++ DA AY  W G RLPTEAEWE+  +GG  ++ + WG    P G + AN +QG+FP
Sbjct: 1   MHVAYEDAEAYAAWAGKRLPTEAEWEFAAKGGRAHQRYAWGEEFRPNGVYMANTFQGKFP 60

Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV 282
             N + DG+  TAPV S+  N FGLY+M+GNVWE T+DW++ 
Sbjct: 61  HQNESTDGFSGTAPVKSFSPNDFGLYDMIGNVWELTSDWYDA 102



 Score = 45.1 bits (105), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 22/49 (44%), Positives = 32/49 (65%)

Query: 120 VHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGI 168
           +H+++ DA AY  W G RLPTEAEWE+  +GG  ++ + WG    P G+
Sbjct: 1   MHVAYEDAEAYAAWAGKRLPTEAEWEFAAKGGRAHQRYAWGEEFRPNGV 49


>gi|451982162|ref|ZP_21930490.1| conserved hypothetical protein [Nitrospina gracilis 3/211]
 gi|451760631|emb|CCQ91770.1| conserved hypothetical protein [Nitrospina gracilis 3/211]
          Length = 296

 Score =  107 bits (267), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 68/174 (39%), Positives = 91/174 (52%), Gaps = 22/174 (12%)

Query: 173 EHRM-NHP---VVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRG 228
           EH +  HP    V VSW DAV YCTW G RL TEAEWE   R G    ++PWGN + PR 
Sbjct: 132 EHELLKHPNVAQVGVSWYDAVNYCTWAGKRLLTEAEWEKAAR-GTHGLVYPWGNEMLPR- 189

Query: 229 EHRANVWQGEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWN-VHHH 285
             RAN+        +  AD Y   AP+ S+   +  +G+Y+M GNV EW  DW++  ++ 
Sbjct: 190 --RANI--------HGTADDYQYLAPIGSFPMGRSVYGVYDMAGNVSEWVEDWYDQFYYQ 239

Query: 286 PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
            AP  NP G     ++V +GGS+   +      R + R   TP      +GFRC
Sbjct: 240 EAPMMNPTGAENKKNRVFRGGSWDSTKVDV---RASKRYAATPGRKDSVVGFRC 290



 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/156 (33%), Positives = 74/156 (47%), Gaps = 38/156 (24%)

Query: 13  KDMVLLPGDTFRMGTNKPILIKDGEF---PSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
           ++MVL+P   F  G+       +G+F   P + V LDAFY+D++EVS   + +F  A  Y
Sbjct: 68  REMVLVPEGVFSRGS------LEGDFDEKPPQEVYLDAFYVDKYEVSVEAYNKFRKAANY 121

Query: 70  VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
           V  +  F                           FEG    ++H  +   V +SW DAV 
Sbjct: 122 VEPSVPF---------------------------FEGEHELLKHP-NVAQVGVSWYDAVN 153

Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
           YCTW G RL TEAEWE   R G    ++PWG+ + P
Sbjct: 154 YCTWAGKRLLTEAEWEKAAR-GTHGLVYPWGNEMLP 188


>gi|15618072|ref|NP_224356.1| S/T protein kinase [Chlamydophila pneumoniae CWL029]
 gi|15835683|ref|NP_300207.1| S/T protein kinase [Chlamydophila pneumoniae J138]
 gi|55667873|sp|Q7AJA5.1|PKN1_CHLPN RecName: Full=Serine/threonine-protein kinase pkn1
 gi|4376414|gb|AAD18301.1| S/T Protein Kinase [Chlamydophila pneumoniae CWL029]
 gi|8978521|dbj|BAA98358.1| S/T protein kinase [Chlamydophila pneumoniae J138]
          Length = 619

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 62/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A  Y  W G RLPTEAEWE    GG+    +P G  +    + RAN + 
Sbjct: 468 KHPVVGVTWYGASGYAEWIGKRLPTEAEWEIAASGGVAALRYPCGEEIE---KSRANFFT 524

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPK 293
            +             T  VMSY  N +GLY+M GNV+EW  DW+    +   +    +P+
Sbjct: 525 AD-------------TTTVMSYPPNPYGLYDMAGNVYEWCQDWYGYDFYEISAQEPESPQ 571

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA +
Sbjct: 572 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCAKN 617



 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 48/148 (32%), Positives = 71/148 (47%), Gaps = 20/148 (13%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MVL+ G  F  G+ +    +  E P   V L +F+LD H V+N QF  ++   G  +E 
Sbjct: 384 EMVLIEGGEFSRGSVEG---QRDELPVHKVILHSFFLDVHPVTNEQFIRYLECCG--SEQ 438

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
           +K+ +  +         R + S+++    R       IE     HPVV ++W  A  Y  
Sbjct: 439 DKYYNELI---------RLRDSRIQRRSGRL-----VIEPGYAKHPVVGVTWYGASGYAE 484

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
           W G RLPTEAEWE    GG+    +P G
Sbjct: 485 WIGKRLPTEAEWEIAASGGVAALRYPCG 512


>gi|16752896|ref|NP_445167.1| hypothetical protein CP0625 [Chlamydophila pneumoniae AR39]
 gi|7189538|gb|AAF38440.1| conserved hypothetical protein [Chlamydophila pneumoniae AR39]
          Length = 619

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 62/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A  Y  W G RLPTEAEWE    GG+    +P G  +    + RAN + 
Sbjct: 468 KHPVVGVTWYGASGYAEWIGKRLPTEAEWEIAASGGVAALRYPCGEEIE---KSRANFFT 524

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPK 293
            +             T  VMSY  N +GLY+M GNV+EW  DW+    +   +    +P+
Sbjct: 525 AD-------------TTTVMSYPPNPYGLYDMAGNVYEWCQDWYGYDFYEISAQEPESPQ 571

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA +
Sbjct: 572 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCAKN 617



 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 48/148 (32%), Positives = 71/148 (47%), Gaps = 20/148 (13%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MVL+ G  F  G+ +    +  E P   V L +F+LD H V+N QF  ++   G  +E 
Sbjct: 384 EMVLIEGGEFSRGSVEG---QRDELPVHKVILHSFFLDVHPVTNEQFIRYLECCG--SEQ 438

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
           +K+ +  +         R + S+++    R       IE     HPVV ++W  A  Y  
Sbjct: 439 DKYYNELI---------RLRDSRIQRRSGRL-----VIEPGYAKHPVVGVTWYGASGYAE 484

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
           W G RLPTEAEWE    GG+    +P G
Sbjct: 485 WIGKRLPTEAEWEIAASGGVAALRYPCG 512


>gi|295135973|ref|YP_003586649.1| gliding motility protein GldK [Zunongwangia profunda SM-A87]
 gi|294983988|gb|ADF54453.1| gliding motility protein GldK [Zunongwangia profunda SM-A87]
          Length = 456

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 110/412 (26%), Positives = 165/412 (40%), Gaps = 95/412 (23%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG------ 68
           MVL+PG +F MG +   +      P + VT+++FY+D+ E++N ++++FV+         
Sbjct: 41  MVLVPGGSFIMGKSDDDIAGQLNAPPKTVTVNSFYMDETEITNAEYRQFVNYVKDSVVRV 100

Query: 69  -YVTEAEKFGDT----FVFEPLLSEEERAKISQVRHDMKRFEGLDSTI---EHRMHHPVV 120
               EAE  G T     + E   ++ + + ++  +  M    G  S +   E+R  +  V
Sbjct: 101 MLAMEAELEGATPGASGIGEYAFADADESNLTPYQEYMLETYGTGSEMDAYENRKLNKDV 160

Query: 121 HISWNDA----VAYC-TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHR 175
            I W+      VAY        +P E  +       ++N  F + SW+  E        R
Sbjct: 161 DIIWDTEDYPDVAYARVMDQLYIPQEDAYNGTRTIDVKNLKFKY-SWMDVEAAVKGKGER 219

Query: 176 MNH----------------------------------------PVVHVSWNDAVAYCTWR 195
            N+                                        PVV V+W  A A+  WR
Sbjct: 220 RNYMKTEVVEVYPDTAVWIKDFNYSYNEPMHNDYFWHSAFDDYPVVGVTWKQARAFAQWR 279

Query: 196 GA-----------------RLPTEAEWEYGCRGGLENRLFPWGNNLTP--RGEHRANV-- 234
                              RLPTEAEWEY  RGGLE   +PWG   T   RG   AN   
Sbjct: 280 TKYNNDFRKSRGDSNVPHYRLPTEAEWEYAARGGLEGGTYPWGGPYTKNDRGCFMANFKP 339

Query: 235 WQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKG 294
            +G++     AAD  L T    S+  N +GLYNM GNV EW    ++   +   S     
Sbjct: 340 LRGDY-----AADQALYTVEAKSFDPNGYGLYNMAGNVAEWVNSSYDAASYDYMS--SMN 392

Query: 295 PTTGT----DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           PT        KV +GGS+   +   Y  + ++R     DS+   +GFR   D
Sbjct: 393 PTVNNYEDPRKVVRGGSW---KDVAYFLQVSSRDYEYADSARSYIGFRTVQD 441


>gi|332661980|ref|YP_004451449.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
           1100]
 gi|332337477|gb|AEE54576.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis
           DSM 1100]
          Length = 655

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 69/191 (36%), Positives = 93/191 (48%), Gaps = 42/191 (21%)

Query: 175 RMNHPVVHVSWNDAVAYCTWR------------------------GARLPTEAEWEYGCR 210
           R   P ++V+W DAV YC WR                        G RLPTEAEWEY  R
Sbjct: 476 RGKRPAIYVNWFDAVDYCNWRSQQEGLSQVYQVNKQQVNPNWQANGYRLPTEAEWEYAAR 535

Query: 211 GGLENRLFPW-GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMV 269
           GGL ++ F + G+N         NV +  +   N+       T PV   K N+ G+Y++ 
Sbjct: 536 GGLSSQGFTYAGSN---------NVDEVAWYDKNSGN----KTQPVGQKKANELGIYDLS 582

Query: 270 GNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPD 329
           GNVWEW  DW+  +   A + +P GP TG+D+V +G S   +       R A RS NTPD
Sbjct: 583 GNVWEWCWDWYGAYPSSATN-DPNGPNTGSDRVIRGSSRYSDPAVV---RVAIRSYNTPD 638

Query: 330 SSAGNLGFRCA 340
               ++GFR A
Sbjct: 639 GRGDDIGFRLA 649


>gi|384449585|ref|YP_005662187.1| hypothetical protein CPK_ORF00662 [Chlamydophila pneumoniae LPCoLN]
 gi|269303031|gb|ACZ33131.1| conserved hypothetical protein [Chlamydophila pneumoniae LPCoLN]
          Length = 619

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 62/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A  Y  W G RLPTEAEWE    GG+    +P G  +    + RAN + 
Sbjct: 468 KHPVVGVTWYGASGYAEWIGKRLPTEAEWEIAASGGVAALRYPCGEEIE---KSRANFFT 524

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPK 293
            +             T  VMSY  N +GLY+M GNV+EW  DW+    +   +    +P+
Sbjct: 525 AD-------------TTTVMSYPPNPYGLYDMAGNVYEWCQDWYGYDFYEISAQEPESPQ 571

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA +
Sbjct: 572 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCAKN 617



 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 48/148 (32%), Positives = 71/148 (47%), Gaps = 20/148 (13%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MVL+ G  F  G+ +    +  E P   V L +F+LD H V+N QF  ++   G  +E 
Sbjct: 384 EMVLIEGGEFSRGSVEG---QRDELPVHKVILHSFFLDVHPVTNEQFIRYLECCG--SEQ 438

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
           +K+ +  +         R + S+++    R       IE     HPVV ++W  A  Y  
Sbjct: 439 DKYYNELI---------RLRDSRIQRRSGRL-----VIEPGYAKHPVVGVTWYGASGYAE 484

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
           W G RLPTEAEWE    GG+    +P G
Sbjct: 485 WIGKRLPTEAEWEIAASGGVAALRYPCG 512


>gi|91204532|emb|CAJ70760.1| similar to sulfatase modifying factor 1 (
           C-alpha-formylglycine-generating enzyme) [Candidatus
           Kuenenia stuttgartiensis]
          Length = 312

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 81/290 (27%), Positives = 131/290 (45%), Gaps = 58/290 (20%)

Query: 82  FEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTE 141
            E LL +++  +  ++ +++ + E   S   +   +PV +  ++  +A   ++     +E
Sbjct: 46  IEKLLEQDKSIETYRLNNEIPQREIFLSAY-YIDKYPVTNAQFSQFIAANGYKKKLYWSE 104

Query: 142 AEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPT 201
           A W+Y           P GS      ID+ ++   + P V++SW  A A+  W G RLPT
Sbjct: 105 AGWQYVLD------FNPSGS----NDIDAILQGDKDCPAVNISWYAAEAFANWAGKRLPT 154

Query: 202 EAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKEN 261
           EAEWE   RG  + R++PWGN+     + R N  + +             + PV  Y + 
Sbjct: 155 EAEWEKAARGA-DGRIYPWGNDFD---KTRLNCAEAKIE----------KSTPVTQYPQG 200

Query: 262 K--FGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGP----------------------- 295
           +  +G ++M GNVWEW  DW++  ++H AP  NP+GP                       
Sbjct: 201 QSVYGCFDMAGNVWEWIKDWYDSQYYHNAPDKNPQGPDVPEEKPYSGRPEEVGVSIYELK 260

Query: 296 -----TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                T  T KV +GGS+  N       RCA R  + P   +  +GFRCA
Sbjct: 261 PSLSKTLTTCKVIRGGSW--NGSGIVHIRCANRDYDEPSFKSDIIGFRCA 308



 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 55/167 (32%), Positives = 85/167 (50%), Gaps = 35/167 (20%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIK-------------DGEFPSRNVTLDAFYLDQHEVSNT 58
           ++DM+L+P  TF MG+ K  + K             + E P R + L A+Y+D++ V+N 
Sbjct: 26  FEDMILIPEGTFLMGSTKEDIEKLLEQDKSIETYRLNNEIPQREIFLSAYYIDKYPVTNA 85

Query: 59  QFQEFVSATGYVTEA--EKFGDTFV--FEPLLSEEERAKISQVRHDMKRFEGLDSTIEHR 114
           QF +F++A GY  +    + G  +V  F P  S +                 +D+ ++  
Sbjct: 86  QFSQFIAANGYKKKLYWSEAGWQYVLDFNPSGSND-----------------IDAILQGD 128

Query: 115 MHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
              P V+ISW  A A+  W G RLPTEAEWE   RG  + R++PWG+
Sbjct: 129 KDCPAVNISWYAAEAFANWAGKRLPTEAEWEKAARGA-DGRIYPWGN 174


>gi|33241484|ref|NP_876425.1| protein kinase [Chlamydophila pneumoniae TW-183]
 gi|33235992|gb|AAP98082.1| protein kinase [Chlamydophila pneumoniae TW-183]
          Length = 619

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 62/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A  Y  W G RLPTEAEWE    GG+    +P G  +    + RAN + 
Sbjct: 468 KHPVVGVTWYGASGYAEWIGKRLPTEAEWEIAASGGVAALRYPCGEEVE---KSRANFFT 524

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPK 293
            +             T  VMSY  N +GLY+M GNV+EW  DW+    +   +    +P+
Sbjct: 525 AD-------------TTTVMSYPPNPYGLYDMAGNVYEWCQDWYGYDFYEISAQEPESPQ 571

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA +
Sbjct: 572 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCAKN 617



 Score = 72.4 bits (176), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 48/148 (32%), Positives = 71/148 (47%), Gaps = 20/148 (13%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MVL+ G  F  G+ +    +  E P   V L +F+LD H V+N QF  ++   G  +E 
Sbjct: 384 EMVLIEGGEFSRGSVEG---QRDELPVHKVILHSFFLDVHPVTNEQFIRYLECCG--SEQ 438

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
           +K+ +  +         R + S+++    R       IE     HPVV ++W  A  Y  
Sbjct: 439 DKYYNELI---------RLRDSRIQRRSGRL-----VIEPGYAKHPVVGVTWYGASGYAE 484

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
           W G RLPTEAEWE    GG+    +P G
Sbjct: 485 WIGKRLPTEAEWEIAASGGVAALRYPCG 512


>gi|194336145|ref|YP_002017939.1| hypothetical protein Ppha_1040 [Pelodictyon phaeoclathratiforme
           BU-1]
 gi|194308622|gb|ACF43322.1| protein of unknown function DUF323 [Pelodictyon phaeoclathratiforme
           BU-1]
          Length = 517

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 94/337 (27%), Positives = 140/337 (41%), Gaps = 103/337 (30%)

Query: 14  DMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           + V + G TF MG+        D E P   V L AFY+ ++EV+ ++F++F+ AT Y+T+
Sbjct: 275 NFVPIRGGTFFMGSPGSEDEAYDDEGPQHLVRLSAFYMSRYEVTVSEFRKFIEATEYLTD 334

Query: 73  AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
           AE+                    + + D K          +  +HPV+++SWNDA++YC 
Sbjct: 335 AEQV-------------------RSKRDWKCGVLGSRRPANEDNHPVIYVSWNDAISYCK 375

Query: 133 W------RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
           W      +  RLP E EWEY CR G                       R + P  +  +N
Sbjct: 376 WLSNKTGKTFRLPKEEEWEYACRAG----------------------SRTSTP-FNTGYN 412

Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
              +   + G                     FP+ +N   +GE R N             
Sbjct: 413 LTTSQANYNGH--------------------FPYDHN--KKGEFRKN------------- 437

Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT-DKVKKG 305
                T PV S+  N +GLYNM GNVWEW  +W          Y P G +  T  +V +G
Sbjct: 438 -----TVPVNSFAPNNWGLYNMHGNVWEWCENW----------YGPYGSSDDTFQRVIRG 482

Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           G   C +Q+  R R A R    P++S  ++GFR   +
Sbjct: 483 G---CWDQHAGRCRSANRGYYPPENSLNHVGFRVVCE 516


>gi|406987727|gb|EKE07977.1| hypothetical protein ACD_17C00415G0002 [uncultured bacterium]
          Length = 594

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 66/170 (38%), Positives = 88/170 (51%), Gaps = 25/170 (14%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  AVAY  W G RLPTE EWE  C  G E  ++P G+++           Q
Sbjct: 444 KHPVVGVTWYGAVAYAKWIGKRLPTEVEWEAACSVGKEG-IYPTGSDIERS--------Q 494

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA----PSYNP 292
             F +++T A        VMSY  N  GLY+MVGNV+EW  DW+  +++      P+Y P
Sbjct: 495 ANFFSSDTTA--------VMSYPPNLLGLYDMVGNVYEWCQDWYAYNYYDTSVVEPNY-P 545

Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            GP  G  +V +GG   C +      R + R +N P +     GFRCA D
Sbjct: 546 MGPPQGVYRVLRGG---CWKSLKEDLRSSHRHRNNPGAVNSTYGFRCATD 592



 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 54/149 (36%), Positives = 72/149 (48%), Gaps = 21/149 (14%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +MV++PG T+  G+ +       E P  +V L +F LD H V+N QF  F+ A G     
Sbjct: 360 EMVVVPGGTYLRGSAEG---ARDEMPRHSVKLRSFALDIHPVTNEQFVRFLEAMG----G 412

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEH-RMHHPVVHISWNDAVAYCT 132
           EK            +     I ++R    +  G   TIE     HPVV ++W  AVAY  
Sbjct: 413 EK------------DHNNNDIIRLRDSRIKRSGGKLTIESGYAKHPVVGVTWYGAVAYAK 460

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGS 161
           W G RLPTE EWE  C  G E  ++P GS
Sbjct: 461 WIGKRLPTEVEWEAACSVGKEG-IYPTGS 488


>gi|116624323|ref|YP_826479.1| hypothetical protein Acid_5241 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116227485|gb|ABJ86194.1| protein of unknown function DUF323 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 226

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 66/172 (38%), Positives = 89/172 (51%), Gaps = 28/172 (16%)

Query: 178 HPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR 231
            PVV V+W DA A+C W+G       RLPTEAEWE+  RGG   RL+PWGN L       
Sbjct: 73  QPVVAVNWFDAAAFCHWQGEQWNLHLRLPTEAEWEFAARGGFAQRLYPWGNELP------ 126

Query: 232 ANVWQGEFPTNNTAADGYLS-TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APS 289
                        AAD + +   PV     N FGL+++  NV EW ADW++  ++  +P+
Sbjct: 127 -----------VIAADRWRNGPEPVGLGDPNGFGLFDLCQNVHEWCADWYDPGYYAVSPT 175

Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
            NP GP  G  +  +GG++     +    RCAARS   P+    + GFR AA
Sbjct: 176 ENPHGPEHGKRRASRGGAW---RHHIKVSRCAARSSIPPEFRYSDYGFRVAA 224



 Score = 71.6 bits (174), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 49/156 (31%), Positives = 71/156 (45%), Gaps = 41/156 (26%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           D++ +PG  F MG       +D E P   V L  F L +++V+N  +  F  AT  + + 
Sbjct: 5   DLLPVPGGEFLMGQAD---GRDEERPPHRVALTPFRLCRYQVTNADYDHFRKAT--LRDK 59

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
            ++ D     P  S+  +                          PVV ++W DA A+C W
Sbjct: 60  SEYRD----RPEFSDPAQ--------------------------PVVAVNWFDAAAFCHW 89

Query: 134 RGA------RLPTEAEWEYGCRGGLENRLFPWGSWL 163
           +G       RLPTEAEWE+  RGG   RL+PWG+ L
Sbjct: 90  QGEQWNLHLRLPTEAEWEFAARGGFAQRLYPWGNEL 125


>gi|384097854|ref|ZP_09998974.1| gliding motility membrane lipoprotein [Imtechella halotolerans K1]
 gi|383836736|gb|EID76143.1| gliding motility membrane lipoprotein [Imtechella halotolerans K1]
          Length = 460

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 111/417 (26%), Positives = 170/417 (40%), Gaps = 91/417 (21%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQ---FQEFVSATGYVT 71
           M L+PG +F MG     + +    P++  T+ +FY+D+ E++N+Q   F E+V  +   T
Sbjct: 41  MTLVPGGSFIMGKPDDDVAQLYNAPTKTATVRSFYMDETEITNSQYRQFVEWVKDSVVRT 100

Query: 72  EAEKFGDTFVFEP---------LLSEEERAKISQVRHDMKRFEGLDST-IEHRMHHPVVH 121
                 D     P          L  +        ++    + GL  T  E R  +  + 
Sbjct: 101 RLAILADEMQLTPDDGGIGEYAFLDADTTNMTEYEKYMFNNYVGLGPTGYEGRKLNKKIK 160

Query: 122 ISWNDA------------VAYC----TWRGAR------LPTEAEWEYGCRGGLENRLFPW 159
           I W+ +              Y     T+ G R      L  + +W       ++++    
Sbjct: 161 IEWSTSKYPDQYYAEVMDSMYIPVDQTYNGRRTIDVNKLVFKYQWR-DINAAVQSKTRKD 219

Query: 160 GSWLHPEGI----DSTI-----EHRMN---------------HPVVHVSWNDAVAYCTWR 195
            ++L  E +    D+T+     E+  N               +PVV V+WN A A+C WR
Sbjct: 220 KNFLRDEELKIYPDTTVWIRDFEYSYNEPMHNDYFWHQAYGEYPVVGVTWNQARAFCEWR 279

Query: 196 -----------------GARLPTEAEWEYGCRGGLENRLFPWGN--NLTPRGEHRANV-- 234
                              RLPTEAEWEY  RGGLE+  +PWG    L  RG   AN   
Sbjct: 280 TLLKNGYQKSRGLQPVNNFRLPTEAEWEYAARGGLESATYPWGGPYILNDRGCFMANFKP 339

Query: 235 WQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPK 293
            +G++     AAD  L T    SY  N + LYNM GNV EWT   ++ + +    + NP 
Sbjct: 340 LRGDY-----AADEALYTVEAKSYDPNDYNLYNMAGNVAEWTNSSYDPNSYEYVSTMNPN 394

Query: 294 G-PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
              T    KV +GGS+   +   Y  + + R+    D++   +GFR   D   T  T
Sbjct: 395 AYDTDNKRKVIRGGSW---KDVAYFLQVSTRAFEYQDTARSYIGFRTVQDYMGTEAT 448


>gi|332666261|ref|YP_004449049.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
           1100]
 gi|332335075|gb|AEE52176.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis
           DSM 1100]
          Length = 958

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 66/188 (35%), Positives = 89/188 (47%), Gaps = 40/188 (21%)

Query: 177 NHPVVHVSWNDAVAYCTW------------------------RGARLPTEAEWEYGCRGG 212
           N+P V++SW DA  YC W                        +G RLPTEAEWEY  RGG
Sbjct: 784 NNPAVYLSWYDATDYCNWVSTQMGYPNVYARQGDEVSINWNAKGFRLPTEAEWEYAARGG 843

Query: 213 LENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNV 272
            + + F +  +     E     W  E    N+ +     T PV   K N  GL++M GNV
Sbjct: 844 AQQQNFEYSGS----NEIEEVAWYAE----NSGS----RTRPVGIKKANSLGLFDMSGNV 891

Query: 273 WEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSA 332
           WEW  DW+   + PA   +P+GP  G+ +V++GGS+     Y +  R A R   TPD   
Sbjct: 892 WEWCWDWYG-DYDPAAKADPRGPNKGSLRVRRGGSWFY---YPFSARVADRYDVTPDRRD 947

Query: 333 GNLGFRCA 340
             +GFR A
Sbjct: 948 DGIGFRLA 955



 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 24/68 (35%)

Query: 116 HHPVVHISWNDAVAYCTW------------------------RGARLPTEAEWEYGCRGG 151
           ++P V++SW DA  YC W                        +G RLPTEAEWEY  RGG
Sbjct: 784 NNPAVYLSWYDATDYCNWVSTQMGYPNVYARQGDEVSINWNAKGFRLPTEAEWEYAARGG 843

Query: 152 LENRLFPW 159
            + + F +
Sbjct: 844 AQQQNFEY 851


>gi|386815297|ref|ZP_10102515.1| Sulphatase-modifying factor protein [Thiothrix nivea DSM 5205]
 gi|386419873|gb|EIJ33708.1| Sulphatase-modifying factor protein [Thiothrix nivea DSM 5205]
          Length = 367

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 69/174 (39%), Positives = 88/174 (50%), Gaps = 21/174 (12%)

Query: 175 RMNHPVVHVSWNDAV-AYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR 227
           R N PV++VSW D    Y  W      R  RLPTEAEWEY  RGG  +  +PWGN ++  
Sbjct: 199 RGNRPVINVSWEDITQKYIPWLNQQTGRHYRLPTEAEWEYAARGG-SDTAYPWGNTISC- 256

Query: 228 GEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW-NVHHHP 286
                N   G +          + T PV SY  N +GLY+  GNVWEW  DW+   ++  
Sbjct: 257 ----GNANYGSYSNQCET----VRTKPVGSYTANGYGLYDTAGNVWEWCQDWYAGDYYKS 308

Query: 287 APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           +P   P GP  G  +V +GGS+  + QY    R A RS +TP      LGFR A
Sbjct: 309 SPVSAPHGPGAGAARVLRGGSWYNDAQYV---RSADRSHDTPVPRINLLGFRLA 359



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 45/160 (28%), Positives = 66/160 (41%), Gaps = 48/160 (30%)

Query: 14  DMVLLPGDTFRMG-----TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
           +M L+P   F MG      +     +D E P+  V+++AFY+ ++EV+  Q+        
Sbjct: 129 EMQLIPAGKFMMGCVEGRDDVEGGCEDDEKPAHEVSINAFYMGKYEVTFDQWDAC----- 183

Query: 69  YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
                                E+AK      D     G         + PV+++SW D  
Sbjct: 184 ---------------------EQAKACPHAGDSGWGRG---------NRPVINVSWEDIT 213

Query: 129 -AYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGS 161
             Y  W      R  RLPTEAEWEY  RGG  +  +PWG+
Sbjct: 214 QKYIPWLNQQTGRHYRLPTEAEWEYAARGG-SDTAYPWGN 252


>gi|291278721|ref|YP_003495556.1| hypothetical protein DEFDS_0296 [Deferribacter desulfuricans SSM1]
 gi|290753423|dbj|BAI79800.1| conserved hypothetical protein [Deferribacter desulfuricans SSM1]
          Length = 280

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 72/185 (38%), Positives = 99/185 (53%), Gaps = 24/185 (12%)

Query: 178 HPVVHVSWNDAVAYCTWRGAR------LPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR 231
            PVV++SW DA  +  W   R      LPTEAEWEY  R G   + F WG+N++   E+ 
Sbjct: 100 QPVVNISWEDAKNFTKWLSKRNDIIFRLPTEAEWEYTARAG-SRKDFYWGDNVSVICEY- 157

Query: 232 ANV--------WQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW-NV 282
           ANV        ++     N    DG+  TAPV S+K NKFGLY+M+GNVWEWT D + + 
Sbjct: 158 ANVHDTTSKKTFKNFTWDNFQCNDGFAVTAPVGSFKPNKFGLYDMLGNVWEWTEDVYAHD 217

Query: 283 HHHPAPSYNPKGPTTGTDK----VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
           ++  +P  NPKGP    +K    V +GG + C        R A+R+ +  DS +  +GFR
Sbjct: 218 YYLKSPLRNPKGPKPELNKENKRVIRGGGWPCKPSLV---RLASRNWDYQDSISVRIGFR 274

Query: 339 CAADK 343
               K
Sbjct: 275 IVMIK 279


>gi|385239656|ref|YP_005807498.1| serine-threonine-protein kinase [Chlamydia trachomatis G/9768]
 gi|385240576|ref|YP_005808417.1| serine-threonine-protein kinase [Chlamydia trachomatis G/11222]
 gi|385242432|ref|YP_005810271.1| serine-threonine-protein kinase [Chlamydia trachomatis G/9301]
 gi|385246042|ref|YP_005814864.1| serine-threonine-protein kinase [Chlamydia trachomatis G/11074]
 gi|296435661|gb|ADH17835.1| serine-threonine-protein kinase [Chlamydia trachomatis G/9768]
 gi|296436584|gb|ADH18754.1| serine-threonine-protein kinase [Chlamydia trachomatis G/11222]
 gi|296437521|gb|ADH19682.1| serine-threonine-protein kinase [Chlamydia trachomatis G/11074]
 gi|297140020|gb|ADH96778.1| serine-threonine-protein kinase [Chlamydia trachomatis G/9301]
          Length = 614

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A +Y  W G RLP+EAEWE    GG     +P G  +    + +AN + 
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPIGEEID---KSKANFFS 519

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
            +             T PVMSY  +  GLY+M GNV+EW  DW++   + + +  P    
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612



 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +M+ + G  F  G+      +  E P  N+TL  F+LD H V+N QF  F+   G  +E 
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLECVG--SEQ 433

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
           ++  +  +         R K S+++    R       IE     HPVV ++W  A +Y  
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
           W G RLP+EAEWE    GG     +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPIG 507


>gi|237802574|ref|YP_002887768.1| serine-threonine-protein kinase [Chlamydia trachomatis B/Jali20/OT]
 gi|231273808|emb|CAX10592.1| serine-threonine-protein kinase [Chlamydia trachomatis B/Jali20/OT]
          Length = 614

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A +Y  W G RLP+EAEWE    GG     +P G  +    + +AN + 
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPIGEEID---KSKANFFS 519

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
            +             T PVMSY  +  GLY+M GNV+EW  DW++   + + +  P    
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612



 Score = 70.9 bits (172), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +M+ + G  F  G+      +  E P  N+TL  F+LD H V+N QF  F+   G  +E 
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLECVG--SEQ 433

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
           ++  +  +         R K S+++    R       IE     HPVV ++W  A +Y  
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
           W G RLP+EAEWE    GG     +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPIG 507


>gi|302039307|ref|YP_003799629.1| hypothetical protein NIDE4035 [Candidatus Nitrospira defluvii]
 gi|300607371|emb|CBK43704.1| conserved exported protein of unknown function, Sulfatase-modifying
           factor-like [Candidatus Nitrospira defluvii]
          Length = 306

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 65/173 (37%), Positives = 95/173 (54%), Gaps = 17/173 (9%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
             PVV V+W DA  YC W G RLPTEAEWE   R G E R++PWG+    RG   A    
Sbjct: 144 KRPVVGVNWYDARDYCRWAGKRLPTEAEWEIAAR-GTEGRIYPWGSAHPTRGHANA---- 198

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKF--GLYNMVGNVWEWTADWWN-VHHHPAPSYNPK 293
                  T   GY + + V  ++  K   G+Y++ GN+WEW ADW++  ++  +   NPK
Sbjct: 199 -----GETRWRGYDTLSNVGRFELGKTPEGVYDLSGNLWEWVADWYDPTYYQFSVRDNPK 253

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
           GP+ G  +  +GG++  + +     R + R+   PD+   ++GFRCA D GP+
Sbjct: 254 GPSAGPLRALRGGAWNNDSKAI---RSSNRAGYAPDARRNDVGFRCAQD-GPS 302



 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 53/155 (34%), Positives = 73/155 (47%), Gaps = 36/155 (23%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           R   MVL+P   F MG+++    ++ E P   V++ AFYLD +E +   + EF       
Sbjct: 73  RGTPMVLIPAGEFAMGSDR---GQEDEQPVHRVSVKAFYLDVYETTVAHYAEF------- 122

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
                         L S++  A           F+  ++T       PVV ++W DA  Y
Sbjct: 123 --------------LASQKPDAP----------FKWSEATAGTHDKRPVVGVNWYDARDY 158

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
           C W G RLPTEAEWE   R G E R++PWGS  HP
Sbjct: 159 CRWAGKRLPTEAEWEIAAR-GTEGRIYPWGS-AHP 191


>gi|27465032|gb|AAN71625.1| serine/threonine kinase [Chlamydia trachomatis]
          Length = 614

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A +Y  W G RLP+EAEWE    GG     +P G  +    + +AN + 
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPTGEEID---KSKANFFS 519

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
            +             T PVMSY  +  GLY+M GNV+EW  DW++   + + +  P    
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612



 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +M+ + G  F  G+      +  E P  N+TL  F+LD H V+N QF  F+   G  +E 
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLECVG--SEQ 433

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
           ++  +  +         R K S+++    R       IE     HPVV ++W  A +Y  
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
           W G RLP+EAEWE    GG     +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPTG 507


>gi|440525059|emb|CCP50310.1| putative ATPase [Chlamydia trachomatis K/SotonK1]
          Length = 614

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A +Y  W G RLP+EAEWE    GG     +P G  +    + +AN + 
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPTGEEID---KSKANFFS 519

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
            +             T PVMSY  +  GLY+M GNV+EW  DW++   + + +  P    
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612



 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +M+ + G  F  G+      +  E P  N+TL  F+LD H V+N QF  F+   G  +E 
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLECVG--SEQ 433

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
           ++  +  +         R K S+++    R       IE     HPVV ++W  A +Y  
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
           W G RLP+EAEWE    GG     +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPTG 507


>gi|76788860|ref|YP_327946.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis
           A/HAR-13]
 gi|255348507|ref|ZP_05380514.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis 70]
 gi|255503048|ref|ZP_05381438.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis 70s]
 gi|255506720|ref|ZP_05382359.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis
           D(s)2923]
 gi|385241509|ref|YP_005809349.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis
           E/11023]
 gi|385245118|ref|YP_005813941.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis E/150]
 gi|389857840|ref|YP_006360082.1| serine-threonine-protein kinase [Chlamydia trachomatis F/SW4]
 gi|389858714|ref|YP_006360955.1| serine-threonine-protein kinase [Chlamydia trachomatis E/SW3]
 gi|389859591|ref|YP_006361831.1| serine-threonine-protein kinase [Chlamydia trachomatis F/SW5]
 gi|76167390|gb|AAX50398.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis
           A/HAR-13]
 gi|296434734|gb|ADH16912.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis E/150]
 gi|296438452|gb|ADH20605.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis
           E/11023]
 gi|380248911|emb|CCE14199.1| serine-threonine-protein kinase [Chlamydia trachomatis F/SW5]
 gi|380249787|emb|CCE13310.1| serine-threonine-protein kinase [Chlamydia trachomatis F/SW4]
 gi|380250663|emb|CCE12421.1| serine-threonine-protein kinase [Chlamydia trachomatis E/SW3]
 gi|440526841|emb|CCP52325.1| putative ATPase [Chlamydia trachomatis D/SotonD1]
 gi|440529517|emb|CCP55001.1| putative ATPase [Chlamydia trachomatis E/SotonE4]
 gi|440530410|emb|CCP55894.1| putative ATPase [Chlamydia trachomatis E/SotonE8]
          Length = 614

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A +Y  W G RLP+EAEWE    GG     +P G  +    + +AN + 
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPTGEEID---KSKANFFS 519

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
            +             T PVMSY  +  GLY+M GNV+EW  DW++   + + +  P    
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612



 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +M+ + G  F  G+      +  E P  N+TL  F+LD H V+N QF  F+   G  +E 
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLECVG--SEQ 433

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
           ++  +  +         R K S+++    R       IE     HPVV ++W  A +Y  
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
           W G RLP+EAEWE    GG     +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPTG 507


>gi|237804492|ref|YP_002888646.1| serine-threonine-protein kinase [Chlamydia trachomatis
           B/TZ1A828/OT]
 gi|376282151|ref|YP_005155977.1| serine-threonine-protein kinase [Chlamydia trachomatis A2497]
 gi|385269804|ref|YP_005812964.1| NIRV family protein [Chlamydia trachomatis A2497]
 gi|231272792|emb|CAX09698.1| serine-threonine-protein kinase [Chlamydia trachomatis
           B/TZ1A828/OT]
 gi|347974944|gb|AEP34965.1| Serine/threonine protein kinase [Chlamydia trachomatis A2497]
 gi|371908181|emb|CAX08804.1| serine-threonine-protein kinase [Chlamydia trachomatis A2497]
 gi|438690063|emb|CCP49320.1| putative ATPase [Chlamydia trachomatis A/7249]
 gi|438691147|emb|CCP48421.1| putative ATPase [Chlamydia trachomatis A/5291]
 gi|438692520|emb|CCP47522.1| putative ATPase [Chlamydia trachomatis A/363]
 gi|440534882|emb|CCP60392.1| putative ATPase [Chlamydia trachomatis E/Bour]
          Length = 614

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A +Y  W G RLP+EAEWE    GG     +P G  +    + +AN + 
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPTGEEID---KSKANFFS 519

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
            +             T PVMSY  +  GLY+M GNV+EW  DW++   + + +  P    
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612



 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +M+ + G  F  G+      +  E P  N+TL  F+LD H V+N QF  F+   G  +E 
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLECVG--SEQ 433

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
           ++  +  +         R K S+++    R       IE     HPVV ++W  A +Y  
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
           W G RLP+EAEWE    GG     +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPTG 507


>gi|255310945|ref|ZP_05353515.1| serine-threonine-protein kinase [Chlamydia trachomatis 6276]
 gi|255317246|ref|ZP_05358492.1| serine-threonine-protein kinase [Chlamydia trachomatis 6276s]
 gi|440533091|emb|CCP58601.1| putative ATPase [Chlamydia trachomatis Ia/SotonIa1]
 gi|440533985|emb|CCP59495.1| putative ATPase [Chlamydia trachomatis Ia/SotonIa3]
          Length = 614

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A +Y  W G RLP+EAEWE    GG     +P G  +    + +AN + 
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPTGEEID---KSKANFFS 519

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
            +             T PVMSY  +  GLY+M GNV+EW  DW++   + + +  P    
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612



 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +M+ + G  F  G+      +  E P  N+TL  F+LD H V+N QF  F+   G  +E 
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLEYVG--SEQ 433

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
           ++  +  +         R K S+++    R       IE     HPVV ++W  A +Y  
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
           W G RLP+EAEWE    GG     +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPTG 507


>gi|166154366|ref|YP_001654484.1| serine-threonine-protein kinase [Chlamydia trachomatis 434/Bu]
 gi|166155241|ref|YP_001653496.1| serine-threonine-protein kinase [Chlamydia trachomatis
           L2b/UCH-1/proctitis]
 gi|301335619|ref|ZP_07223863.1| serine-threonine-protein kinase [Chlamydia trachomatis L2tet1]
 gi|339625799|ref|YP_004717278.1| protein tyrosine kinase family protein [Chlamydia trachomatis L2c]
 gi|165930354|emb|CAP03840.1| serine-threonine-protein kinase [Chlamydia trachomatis 434/Bu]
 gi|165931229|emb|CAP06794.1| serine-threonine-protein kinase [Chlamydia trachomatis
           L2b/UCH-1/proctitis]
 gi|339460555|gb|AEJ77058.1| protein tyrosine kinase family protein [Chlamydia trachomatis L2c]
 gi|440525951|emb|CCP51435.1| putative ATPase [Chlamydia trachomatis L2b/8200/07]
 gi|440535775|emb|CCP61288.1| putative ATPase [Chlamydia trachomatis L2b/795]
 gi|440536665|emb|CCP62179.1| putative ATPase [Chlamydia trachomatis L1/440/LN]
 gi|440537557|emb|CCP63071.1| putative ATPase [Chlamydia trachomatis L1/1322/p2]
 gi|440538447|emb|CCP63961.1| putative ATPase [Chlamydia trachomatis L1/115]
 gi|440539336|emb|CCP64850.1| putative ATPase [Chlamydia trachomatis L1/224]
 gi|440540226|emb|CCP65740.1| putative ATPase [Chlamydia trachomatis L2/25667R]
 gi|440541116|emb|CCP66630.1| putative ATPase [Chlamydia trachomatis L3/404/LN]
 gi|440542004|emb|CCP67518.1| putative ATPase [Chlamydia trachomatis L2b/UCH-2]
 gi|440542894|emb|CCP68408.1| putative ATPase [Chlamydia trachomatis L2b/Canada2]
 gi|440543786|emb|CCP69300.1| putative ATPase [Chlamydia trachomatis L2b/LST]
 gi|440544676|emb|CCP70190.1| putative ATPase [Chlamydia trachomatis L2b/Ams1]
 gi|440545566|emb|CCP71080.1| putative ATPase [Chlamydia trachomatis L2b/CV204]
 gi|440913828|emb|CCP90245.1| putative ATPase [Chlamydia trachomatis L2b/Ams2]
 gi|440914718|emb|CCP91135.1| putative ATPase [Chlamydia trachomatis L2b/Ams3]
 gi|440915609|emb|CCP92026.1| putative ATPase [Chlamydia trachomatis L2b/Canada1]
 gi|440916503|emb|CCP92920.1| putative ATPase [Chlamydia trachomatis L2b/Ams4]
 gi|440917393|emb|CCP93810.1| putative ATPase [Chlamydia trachomatis L2b/Ams5]
          Length = 614

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A +Y  W G RLP+EAEWE    GG     +P G  +    + +AN + 
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPTGEEID---KSKANFFS 519

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
            +             T PVMSY  +  GLY+M GNV+EW  DW++   + + +  P    
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612



 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +M+ + G  F  G+      +  E P  N+TL  F+LD H V+N QF  F+   G  +E 
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLECVG--SEQ 433

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
           ++  +  +         R K S+++    R       IE     HPVV ++W  A +Y  
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
           W G RLP+EAEWE    GG     +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPTG 507


>gi|15604864|ref|NP_219648.1| serine/threonine protein kinase [Chlamydia trachomatis D/UW-3/CX]
 gi|385243352|ref|YP_005811198.1| NIRV family protein [Chlamydia trachomatis D-EC]
 gi|385244232|ref|YP_005812076.1| NIRV family protein [Chlamydia trachomatis D-LC]
 gi|386262498|ref|YP_005815777.1| serine-threonine-protein kinase [Chlamydia trachomatis Sweden2]
 gi|55667872|sp|O84147.1|PKN1_CHLTR RecName: Full=Serine/threonine-protein kinase pkn1
 gi|3328546|gb|AAC67736.1| Serine/threonine Protein Kinase [Chlamydia trachomatis D/UW-3/CX]
 gi|289525186|emb|CBJ14661.1| serine-threonine-protein kinase [Chlamydia trachomatis Sweden2]
 gi|297748275|gb|ADI50821.1| Serine/threonine protein kinase [Chlamydia trachomatis D-EC]
 gi|297749155|gb|ADI51833.1| Serine/threonine protein kinase [Chlamydia trachomatis D-LC]
 gi|440527735|emb|CCP53219.1| putative ATPase [Chlamydia trachomatis D/SotonD5]
 gi|440528626|emb|CCP54110.1| putative ATPase [Chlamydia trachomatis D/SotonD6]
 gi|440531307|emb|CCP56817.1| putative ATPase [Chlamydia trachomatis F/SotonF3]
 gi|440532198|emb|CCP57708.1| putative ATPase [Chlamydia trachomatis G/SotonG1]
          Length = 614

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
            HPVV V+W  A +Y  W G RLP+EAEWE    GG     +P G  +    + +AN + 
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPTGEEID---KSKANFFS 519

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
            +             T PVMSY  +  GLY+M GNV+EW  DW++   + + +  P    
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           GP  G  +V +GG   C +      RCA R +N P +     GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612



 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +M+ + G  F  G+      +  E P  N+TL  F+LD H V+N QF  F+   G  +E 
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLECVG--SEQ 433

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
           ++  +  +         R K S+++    R       IE     HPVV ++W  A +Y  
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479

Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
           W G RLP+EAEWE    GG     +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPTG 507


>gi|260831884|ref|XP_002610888.1| hypothetical protein BRAFLDRAFT_91482 [Branchiostoma floridae]
 gi|229296257|gb|EEN66898.1| hypothetical protein BRAFLDRAFT_91482 [Branchiostoma floridae]
          Length = 297

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 68/200 (34%), Positives = 94/200 (47%), Gaps = 29/200 (14%)

Query: 164 HPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNN 223
            P G  S I  ++ HPVVHV W DA AYC W G RLPTE EWE   +GGL+ R  PWG  
Sbjct: 56  QPAGPRSNIMDKLQHPVVHVRWTDAAAYCYWAGKRLPTEEEWEVAAKGGLDGRKLPWGGR 115

Query: 224 LTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKF-------GLY-NMVGNVWEW 275
            +P    R N+WQ             L     + ++  +F       G Y   +GNVWEW
Sbjct: 116 YSP---ERLNIWQ---------TTTLLKMVTSLRHQFCRFQSRTITVGTYVRPLGNVWEW 163

Query: 276 TADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHR--CAARSQNTPDSSAG 333
           T+  +       P    +      + V +GGS+L ++   + +R     R     D    
Sbjct: 164 TSSQF------IPPGMRREEVEDKEYVVRGGSWLDSKDGSFNNRVHVTTRKGQAADVGCD 217

Query: 334 NLGFRCAADKGPTTGTDKVK 353
           ++GFRCA    PT+ +DKV+
Sbjct: 218 HVGFRCAQSVVPTS-SDKVR 236



 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 48/121 (39%), Positives = 71/121 (58%), Gaps = 4/121 (3%)

Query: 51  DQHEVSNTQFQEFVSATG--YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLD 108
           D   VS  + ++F+ + G  + T++EK G + VF+ L+SEE +   +    ++ +  G  
Sbjct: 4   DPAVVSRFRIRKFIRSKGKRFQTDSEKIGWSHVFQDLVSEEVKQNFTV--ENVTQPAGPR 61

Query: 109 STIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGI 168
           S I  ++ HPVVH+ W DA AYC W G RLPTE EWE   +GGL+ R  PWG    PE +
Sbjct: 62  SNIMDKLQHPVVHVRWTDAAAYCYWAGKRLPTEEEWEVAAKGGLDGRKLPWGGRYSPERL 121

Query: 169 D 169
           +
Sbjct: 122 N 122


>gi|325105235|ref|YP_004274889.1| protein involved in gliding motility GldK [Pedobacter saltans DSM
           12145]
 gi|324974083|gb|ADY53067.1| protein involved in gliding motility GldK [Pedobacter saltans DSM
           12145]
          Length = 451

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 108/411 (26%), Positives = 169/411 (41%), Gaps = 91/411 (22%)

Query: 15  MVLLPGDTFRMG-TNKPILIKDGEFP-SRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
           MV +P  TF MG  ++ + I   +FP ++ VT+ AFY+D+ E++N+++++FV        
Sbjct: 47  MVYIPAGTFVMGQVDQDVTI--SQFPQNKQVTISAFYMDETEITNSEYKQFVHWVRDSIA 104

Query: 73  AEKFG---DTFVFEPLLSEE--------------------ERAKISQVRHDM-----KRF 104
            ++ G   D ++ +P   +                     ++A + ++  +M     +RF
Sbjct: 105 VKRLGNGADKYMLKPKGKDANTGATFIDWKKVQSGKGIWGDKALVEELNKNMYYQDNERF 164

Query: 105 EGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLH 164
            G       ++ +      W D       RGAR P   +   G     +  +     +++
Sbjct: 165 FGKKELNVEQLKY---DYEWIDYRKAADMRGARDPNRNKNGLGVT--RDQFIVKEEVYVY 219

Query: 165 PEGI----DSTIEHR-------------MNHPVVHVSWNDAVAYCTWRGA---------- 197
           P+ +    D T                  N+PVV VSW  A A+  WR            
Sbjct: 220 PDTLVWISDFTYAQNDPMTKGYFSHPSFANYPVVGVSWKQAKAFTVWRSRLNEAYKTSRN 279

Query: 198 -------RLPTEAEWEYGCRGGLENRLFPWGNNLT--PRGEHRANVWQGEFPTNNTAADG 248
                   LPTEA++EY  RGG     +PWG       +G   AN   G     N   DG
Sbjct: 280 LPSRLDYDLPTEAQFEYAARGGRVGTTYPWGGPYIRNAKGCLLANFKPGR---GNYIDDG 336

Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-----VHHHPAPSYNPK----GPTTGT 299
              T    SY  N +GLYNM GNV EWT+  +N       +   PSY+       P    
Sbjct: 337 GAFTVYAKSYFPNDYGLYNMAGNVAEWTSSIYNPTATAYINTLNPSYDKAVKEGDPDYDK 396

Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTD 350
            +V KGGS+   +   Y  + +A+S   PDS+  ++GFRC     P  G D
Sbjct: 397 RRVVKGGSW---KDIGYFLQNSAKSYEYPDSARSSIGFRCVT---PFVGRD 441


>gi|294661128|ref|YP_003573003.1| hypothetical protein Aasi_1513 [Candidatus Amoebophilus asiaticus
           5a2]
 gi|227336278|gb|ACP20875.1| hypothetical protein Aasi_1513 [Candidatus Amoebophilus asiaticus
           5a2]
          Length = 369

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 99/357 (27%), Positives = 145/357 (40%), Gaps = 84/357 (23%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MVL+P   F MG  +  + K    P+R V++ +F++D+ EV+N +++ F++        E
Sbjct: 58  MVLIPTGAFMMGGVEEDIFKRAN-PNRRVSVSSFFMDETEVTNNEYRYFLTKV-----KE 111

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +F D F   P + E     I     D +  + LD  +       V+              
Sbjct: 112 RF-DYFKNNPSIGENTGTTID----DGQTTDPLDQKLTDEFIKDVLT------------- 153

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
               P    W       + +R+     +    G D       N+PVV VSW  A  +  W
Sbjct: 154 ----PNMDVWRTDFTNHMADRILE--GYFELRGYD-------NYPVVGVSWEAASYFAAW 200

Query: 195 R-----------------GARLPTEAEWEYGCRGGLENRLFPWGNNLT--PRGEHRANVW 235
           R                    LP+ A+WEY  RGG E   +PWG        G  RAN  
Sbjct: 201 RTKYFNENKEKKGLEKYPSFSLPSAAQWEYAARGGKELAKYPWGGPYIRDAEGNLRANFK 260

Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-----VHHHPAPSY 290
            G+    N +  GY  T+PV ++  N +GLY+M GNV EWT D +N     +     P Y
Sbjct: 261 SGQ---GNYSECGYTYTSPVTAFSPNDYGLYDMAGNVAEWTLDAYNPAAVALTWDSDPLY 317

Query: 291 -NPKGPTTGTDKVKKGGS------YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            + + P     K+ KGGS      +L    Y Y H+  ARS          +GFRC 
Sbjct: 318 LDDEQPM----KIIKGGSWKDISRFLQTGAYDYEHKDTARSY---------IGFRCV 361


>gi|254444401|ref|ZP_05057877.1| conserved domain protein [Verrucomicrobiae bacterium DG1235]
 gi|198258709|gb|EDY83017.1| conserved domain protein [Verrucomicrobiae bacterium DG1235]
          Length = 336

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 70/174 (40%), Positives = 90/174 (51%), Gaps = 16/174 (9%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLT-----PRGEHR 231
           N PVV V W DA AY  W G RLPTEAEWE   RG  E RL+PWGN+ +       G+  
Sbjct: 170 NMPVVGVDWFDAYAYAKWAGKRLPTEAEWELAARGDDE-RLWPWGNSWSWGLCNIGGDKS 228

Query: 232 ANVWQGEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNVHHHPA-P 288
               + + P      DGY+  APV S+ ++   FG  +M GNV EW ADW++   +   P
Sbjct: 229 GKDARDKSPDR----DGYIYPAPVGSFADSASPFGCMDMAGNVAEWCADWYSEDAYKVFP 284

Query: 289 SYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
             NP G      +  +GGS   ++      RC+ARS   P+     LGFRCA D
Sbjct: 285 DRNPIGTIDSGYRSIRGGS---SQSVPSGVRCSARSFEEPEFRKFTLGFRCAKD 335



 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 62/188 (32%), Positives = 85/188 (45%), Gaps = 50/188 (26%)

Query: 14  DMVLLPGDTFRMGTN----------------KP--ILIK--------DGEFPSRNVTLDA 47
           +MV++PG  F MG +                +P  +LI         + E P R V +D 
Sbjct: 40  EMVIIPGSIFAMGNDLTHPDFANQAVEGKALRPYHVLIARAKKAWQMEAERPVRKVRVDR 99

Query: 48  FYLDQHEVSNTQFQEF---VSATGYVT---EAEKFGDTFV------FEPLLSEEERAKIS 95
           F +D+HEV+N Q++ F   V   G  +   E +  G          + PLL +E  A ++
Sbjct: 100 FAIDRHEVTNRQYRLFLDWVQLNGDDSVRHEDQPVGKDHTPRYWTSYNPLLQDEATAALA 159

Query: 96  QVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 155
                        S    + + PVV + W DA AY  W G RLPTEAEWE   RG  E R
Sbjct: 160 P----------FSSETFLKDNMPVVGVDWFDAYAYAKWAGKRLPTEAEWELAARGDDE-R 208

Query: 156 LFPWG-SW 162
           L+PWG SW
Sbjct: 209 LWPWGNSW 216


>gi|120613095|ref|YP_972773.1| hypothetical protein Aave_4459 [Acidovorax citrulli AAC00-1]
 gi|120591559|gb|ABM34999.1| protein of unknown function DUF323 [Acidovorax citrulli AAC00-1]
          Length = 390

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 69/182 (37%), Positives = 92/182 (50%), Gaps = 28/182 (15%)

Query: 177 NHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
           +HPVV+++W+DA A   W  A      RLPTEAEWEY CR G   R +  GN+      H
Sbjct: 197 DHPVVNITWHDAHAMAAWLSATEGHRYRLPTEAEWEYACRAGKAAR-YGHGNDPAALPRH 255

Query: 231 RANVWQGEFP-------TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH 283
                Q   P             DG+  TAPV  Y  N FGL++M GNVWEW +DW    
Sbjct: 256 ANTFDQAAAPYWPRWQAQAVPGHDGHAFTAPVAGYPANAFGLHDMSGNVWEWVSDW---- 311

Query: 284 HHPAPSY------NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGF 337
            H   +Y      +P+GP TGT +V++GGS+     Y    RC  R+ N+P++    +G 
Sbjct: 312 -HADDAYARSARDDPQGPATGTVRVRRGGSWHTWPLYA---RCTYRNWNSPETRYTLVGM 367

Query: 338 RC 339
           R 
Sbjct: 368 RL 369



 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 55/164 (33%), Positives = 77/164 (46%), Gaps = 23/164 (14%)

Query: 15  MVLLPGDTFRMGTNKPI--LIKD-------------GEFPSRNVTLD-AFYLDQHEVSNT 58
            V +P  +FRMG+++ +  L +D              E P   V +  AFYL Q EV+  
Sbjct: 79  FVRVPAGSFRMGSDESVKHLARDYPGMEAGRLSALADEAPVHTVRITRAFYLGQTEVTVG 138

Query: 59  QFQEFVSATGYVTEAEKFGDT-FVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
           QF+ FV A+GY  E++  G   + + P    ++ A+         R+            H
Sbjct: 139 QFRRFVEASGYRAESDADGTGGYGYNPAYDPDKSARGDAFEGRSPRYSWRYPGFAQGDDH 198

Query: 118 PVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENR 155
           PVV+I+W+DA A   W  A      RLPTEAEWEY CR G   R
Sbjct: 199 PVVNITWHDAHAMAAWLSATEGHRYRLPTEAEWEYACRAGKAAR 242


>gi|451981317|ref|ZP_21929679.1| conserved hypothetical protein [Nitrospina gracilis 3/211]
 gi|451761462|emb|CCQ90935.1| conserved hypothetical protein [Nitrospina gracilis 3/211]
          Length = 280

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 69/179 (38%), Positives = 92/179 (51%), Gaps = 22/179 (12%)

Query: 165 PEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNL 224
           P+ +D+   ++   PVV V+W +A  YC W G RLPTEAEWE   R G + R +PWGN+ 
Sbjct: 120 PKYMDNEKFNQPTQPVVGVTWQEAYDYCEWAGKRLPTEAEWEKAAR-GPDGRFYPWGNDP 178

Query: 225 TPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKE--NKFGLYNMVGNVWEWTADWWNV 282
            P    RAN             DG+   APV ++ +  + +G  +M GNVWEWTADW   
Sbjct: 179 DPT---RANA--------RGMKDGHRYPAPVGTFPKGASPYGALDMAGNVWEWTADW--- 224

Query: 283 HHHPAPSYNPKGPTTG-TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            + P P    K    G T +V +GGS+  N       R   R + TPD     +GFRCA
Sbjct: 225 -YLPYPENLVKNDLYGNTFRVMRGGSWFSNMDLA---RSTVRGKLTPDQRQNYIGFRCA 279



 Score = 72.0 bits (175), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 49/160 (30%), Positives = 72/160 (45%), Gaps = 46/160 (28%)

Query: 15  MVLLPGDTFRMGTN--------KPILIKDGEF-----PSRNVTLDAFYLDQHEVSNTQFQ 61
           MV +P     MG++        K    +  E+     PS  V ++AF +DQHEV+N ++ 
Sbjct: 50  MVFVPAGFLTMGSSEEDIQWAAKTFFSESLEYYRDETPSHKVQVEAFEIDQHEVTNGEYG 109

Query: 62  EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
           +++ ATG    A K+ D   F                              ++   PVV 
Sbjct: 110 KYLKATG--RPAPKYMDNEKF------------------------------NQPTQPVVG 137

Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
           ++W +A  YC W G RLPTEAEWE   R G + R +PWG+
Sbjct: 138 VTWQEAYDYCEWAGKRLPTEAEWEKAAR-GPDGRFYPWGN 176


>gi|268325661|emb|CBH39249.1| hypothetical protein, calcineurin-like phosphoesterase, NACHT domain,
            and DUF323 family [uncultured archaeon]
          Length = 1107

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 65/174 (37%), Positives = 92/174 (52%), Gaps = 30/174 (17%)

Query: 177  NHPVVHVSWNDAVAYCTWRGAR------LPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
            N PVV + W +A AY  W   R      LPTEAE E   RG +    +PWG+        
Sbjct: 949  NFPVVGIIWYEAEAYANWLSKRTEHPYRLPTEAECEKAARG-IGGYKYPWGD-------- 999

Query: 231  RANVWQGEFPTN--NTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNVHHH- 285
                   EF  N  N++  G   T+PV  + + K  +G ++M GNVWEW +DW+N  ++ 
Sbjct: 1000 -------EFDKNLCNSSESGLYHTSPVGIFPKGKSPYGCFDMAGNVWEWCSDWYNDKYYG 1052

Query: 286  PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
             +P  NPKGP++G D+V +GGS+L   +YC   R A R  + P   + +LGFR 
Sbjct: 1053 NSPDRNPKGPSSGADRVFRGGSWLHGARYC---RSAYRDLSVPRDRSSDLGFRL 1103



 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 51/163 (31%), Positives = 74/163 (45%), Gaps = 33/163 (20%)

Query: 11  RYKDMVLLPGDTFRMGTNKPILIKDG-----EFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
           +  +MVL+ G  F  G+++     D      E P R + +D F + ++ V+N +F+EFV 
Sbjct: 856 KVDNMVLVKGGKFMRGSSE----DDADAYPEEKPQREIYIDDFMIGKYPVTNEEFKEFVD 911

Query: 66  ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVR--HDMKRFEGLDSTIEHRMHHPVVHIS 123
             GY  E       F         E+ KI + R  HD +++ G         + PVV I 
Sbjct: 912 DGGYDREE------FWTREGWRWREKNKIFKPRYWHD-RKWNG--------SNFPVVGII 956

Query: 124 WNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWG 160
           W +A AY  W         RLPTEAE E   R G+    +PWG
Sbjct: 957 WYEAEAYANWLSKRTEHPYRLPTEAECEKAAR-GIGGYKYPWG 998


>gi|340617143|ref|YP_004735596.1| gliding motility membrane lipoprotein [Zobellia galactanivorans]
 gi|339731940|emb|CAZ95208.1| Gliding motility membrane lipoprotein [Zobellia galactanivorans]
          Length = 451

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 108/410 (26%), Positives = 168/410 (40%), Gaps = 91/410 (22%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF-------VSAT 67
           M L+P  +F MG ++    K    P++ VT+ +FY+D  E++N+++++F       ++ T
Sbjct: 42  MELIPRGSFIMGKSEEDQAKVLNAPTKTVTVRSFYMDDTEITNSEYRQFVEWVQDSITRT 101

Query: 68  GYVTEAEKFG----DTFVFEPLLSEEERAKISQV-RHDMKRFEGLDST-IEHRMHHPVVH 121
                A++ G    D  + E    + +  K+S   ++ +  + G+  T  E R  +    
Sbjct: 102 RLAILADELGLGPEDGGIGEYAFKDADTTKLSAYDKYMLDNYAGMGETGFEGRALNKEED 161

Query: 122 ISWN-----DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGI-------- 168
           + W+     D           +P E  +  G R     +L    SW+  E          
Sbjct: 162 LIWDISEYPDEYYTEVMDSLYIPEEESYN-GQRTIDVTKLKYKYSWMDIEAAARSRTGSR 220

Query: 169 -------------DSTI-----EHRMN---------------HPVVHVSWNDAVAYCTWR 195
                        D+T+     E+  N               +PVV VSW  A A+C WR
Sbjct: 221 KDFIRQEELEIYPDTTVWIRDFEYSYNEPMHNDYFWHDAYSEYPVVGVSWQQARAFCNWR 280

Query: 196 GA-----------------RLPTEAEWEYGCRGGLENRLFPWGN--NLTPRGEHRANV-- 234
                              RLPTEAEWEY  RGG+E   +PWG    ++  G   AN   
Sbjct: 281 TKFKNDDQKSKGKQFVNQFRLPTEAEWEYAARGGIEGGTYPWGGPYVISDTGCFMANFKP 340

Query: 235 WQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPK 293
            +G++     AAD  L T    SY+ N F LYNM GNV EWT   ++   +    + NP 
Sbjct: 341 QRGDY-----AADAALYTVEAKSYEPNDFNLYNMAGNVSEWTNSSYDPSSYEFVSTMNPN 395

Query: 294 -GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
            G      KV +GGS+   +   Y  + + R     DS+   +GFR   D
Sbjct: 396 GGDGANARKVIRGGSW---KDVAYFLQVSTRDYEYADSARSYIGFRTVQD 442


>gi|326319127|ref|YP_004236799.1| sulfatase-modifying factor protein [Acidovorax avenae subsp. avenae
           ATCC 19860]
 gi|323375963|gb|ADX48232.1| Sulphatase-modifying factor protein [Acidovorax avenae subsp.
           avenae ATCC 19860]
          Length = 339

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 70/182 (38%), Positives = 92/182 (50%), Gaps = 28/182 (15%)

Query: 177 NHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
           +HPVV+V+W+DA A   W  A      RLPTEAEWEY C  G + R +  GN+      H
Sbjct: 155 DHPVVNVTWHDAHALAAWLSATEGHRYRLPTEAEWEYACLAGRQAR-YGHGNDPAALARH 213

Query: 231 RANVWQGEFP-------TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH 283
                Q   P             DG+  TAPV SY  N FGL++M GN WEW +DW    
Sbjct: 214 ANTFDQDAAPYWPRWQAQAVPGHDGHAFTAPVASYPPNAFGLHDMNGNAWEWVSDW---- 269

Query: 284 HHPAPSY------NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGF 337
            H   +Y      +P+GP TGT +V++GGS+     Y    RC  R+ NTP++    +G 
Sbjct: 270 -HADDAYARSVREDPQGPATGTVRVRRGGSWHTWPLYA---RCTYRNWNTPETRYTLVGM 325

Query: 338 RC 339
           R 
Sbjct: 326 RL 327



 Score = 82.4 bits (202), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 53/164 (32%), Positives = 75/164 (45%), Gaps = 23/164 (14%)

Query: 15  MVLLPGDTFRMGTNKPI---------------LIKDGEFPSRNVTLD-AFYLDQHEVSNT 58
            V +P  +FRMG+++P+                + D E P   V +  AFY+ Q EV+  
Sbjct: 37  FVRVPAGSFRMGSDEPVERLARDYPGMEARRLAVLDDEAPVHRVRITRAFYMGQTEVTVG 96

Query: 59  QFQEFVSATGYVTEAEKFGD-TFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
           QF+ FV A+GY  E+E  G   + + P        +         R+   D        H
Sbjct: 97  QFRRFVLASGYRAESEADGTGGYGYNPAYDPATTVRGDAFEGRSPRYSWRDPGFAQGDDH 156

Query: 118 PVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENR 155
           PVV+++W+DA A   W  A      RLPTEAEWEY C  G + R
Sbjct: 157 PVVNVTWHDAHALAAWLSATEGHRYRLPTEAEWEYACLAGRQAR 200


>gi|387792441|ref|YP_006257506.1| hypothetical protein Solca_3323 [Solitalea canadensis DSM 3403]
 gi|379655274|gb|AFD08330.1| hypothetical protein Solca_3323 [Solitalea canadensis DSM 3403]
          Length = 422

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 102/379 (26%), Positives = 152/379 (40%), Gaps = 63/379 (16%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P  TF MG     +       +R VT+  FY+D+ E+SN ++++FV        A+
Sbjct: 45  MVYIPAGTFIMGQGDQDVTASRVSQNRQVTIAPFYMDETEISNNEYRQFVLWVRDSIAAK 104

Query: 75  KFGDTFVF------EPLLSEEERAKISQVRHDMKRFEGL----DSTIEHRMHHPVVHISW 124
             G  FV       E +  +++    S+   + +  +G+    D  I  R    V  + +
Sbjct: 105 TMGGNFVVKAEDGTEFINPKQKVTYTSKNNPNAETLKGMYYQGDDKITGRNDFDVRQLVY 164

Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENR----LFP----WGSWLHPEGIDSTIEHRM 176
                  TW   R     +     R     R    ++P    W +       +  +E   
Sbjct: 165 Q-----YTWFDIRSAALKQNRTKPRSSFIKREKIAIYPDTLVWLADFSYAQNEPMVEQYF 219

Query: 177 NHP------VVHVSWNDAVAYCTWRG-----------------ARLPTEAEWEYGCRGGL 213
           +HP      VV V+W  A A+C WR                    LPTEA++EY  RGG 
Sbjct: 220 SHPAFDDYPVVGVTWKQANAFCAWRTEFNNTYRDKMKKPRRGVVSLPTEAQFEYAARGGR 279

Query: 214 ENRLFPWGNNLT--PRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGN 271
               +PWG       +G   AN   G     N A DG   T  V SY  N +GLYNM GN
Sbjct: 280 LGADYPWGGPYIRNSKGCILANFKPGR---GNYADDGGFYTVHVKSYFPNDYGLYNMAGN 336

Query: 272 VWEWTADWWN-----VHHHPAPSY----NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAA 322
           V EWT+  ++       H   P+Y    N   P     KV +GGS+   +   +  +   
Sbjct: 337 VSEWTSSTYDESSYTFVHDMNPTYTSEVNETTPMAQRRKVVRGGSW---KDIGFFLQNGT 393

Query: 323 RSQNTPDSSAGNLGFRCAA 341
           R+    D++   +GFRC A
Sbjct: 394 RAYEYLDTAKSYIGFRCVA 412


>gi|302039446|ref|YP_003799768.1| hypothetical protein NIDE4175 [Candidatus Nitrospira defluvii]
 gi|300607510|emb|CBK43843.1| conserved exported protein of unknown function, Sulfatase-modifying
           factor-like [Candidatus Nitrospira defluvii]
          Length = 279

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 70/180 (38%), Positives = 93/180 (51%), Gaps = 22/180 (12%)

Query: 167 GIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTP 226
           GI +++  R   PV++V W DA AYC W G RLP EAEWE   +G  + R FPWGN    
Sbjct: 116 GIPASLADR---PVINVDWADADAYCKWAGRRLPREAEWEKAAKGNNDWR-FPWGNVEPT 171

Query: 227 RGEHRAN-VWQGEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNV- 282
            G    N  W GE            +  PV SY++ K  +G+Y+M GNVWEW  DW++  
Sbjct: 172 DGHLNFNQKWVGE-----------KTLMPVGSYEKGKSPYGVYDMAGNVWEWVNDWYDSR 220

Query: 283 HHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
           ++  +P  NP GP +G  KV +G  +  NE      R   R  + P     + GFRCA D
Sbjct: 221 YYEKSPDKNPPGPDSGEKKVIRGAGW-QNETPTV--RIFTRVDSDPTMRNESTGFRCAMD 277



 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 50/151 (33%), Positives = 79/151 (52%), Gaps = 30/151 (19%)

Query: 15  MVLLPGDTFRMGTNKPILIKDG---EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY-V 70
           +VL+P  ++ MG   P   +DG   E+P   + +  FY+D++EV+N ++ EFV AT + V
Sbjct: 43  LVLIPSGSYPMGV--PTGDRDGGRDEYPRHVIEITDFYIDKYEVTNARYLEFVKATNHRV 100

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
            +  K     ++E +                    G+ +++  R   PV+++ W DA AY
Sbjct: 101 PQNPKNPTRNLWEGI--------------------GIPASLADR---PVINVDWADADAY 137

Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
           C W G RLP EAEWE   +G  + R FPWG+
Sbjct: 138 CKWAGRRLPREAEWEKAAKGNNDWR-FPWGN 167


>gi|124008148|ref|ZP_01692846.1| GldK [Microscilla marina ATCC 23134]
 gi|123986396|gb|EAY26209.1| GldK [Microscilla marina ATCC 23134]
          Length = 398

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 101/359 (28%), Positives = 147/359 (40%), Gaps = 73/359 (20%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV++P  TF MG     +       ++ +T+ AF++DQ EV+N Q++ F           
Sbjct: 64  MVVVPNGTFHMGQADEDVPATQINLNKQITISAFFMDQTEVTNNQYRLFT---------- 113

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
              D  +     SE  +A+  ++  + +  EG+   +  +   PV      D++      
Sbjct: 114 ---DVLL---AYSEGRQAQGEELPQEYQ--EGVMDKLFLKNGTPVRP----DSIKVMN-- 159

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
               P    W       + + L  +  ++HP   D        +PVV V W  A  +C W
Sbjct: 160 -EMYPDTTRWSTEYAHHMGDPLMEY-YFMHPAFDD--------YPVVGVDWQAANFFCQW 209

Query: 195 RGA-------------------RLPTEAEWEYGCRGGLENRLFPWGNNL--TPRGEHRAN 233
           R                     RLPTEAEWEY  RGG +   +PWG+       G   AN
Sbjct: 210 RTMHLNYYRHTVEGDPYPAPRFRLPTEAEWEYASRGGRDMAKYPWGSPYLRNTLGCMLAN 269

Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----P 288
              G     N   DG+  T PV  Y  N +GLY+M GNV EW  D +     P      P
Sbjct: 270 FKPGR---GNYYDDGFAYTGPVAQYFPNDYGLYDMAGNVAEWVLDAYFESSVPVVWDLNP 326

Query: 289 SYNPKGPTTGTD-------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
            YNP     G +       +V +GGS+     YC      AR+    DSS+ ++GFRCA
Sbjct: 327 VYNPPITKAGKNDVNINNRRVIRGGSWKDIAYYC---ETGARTFQHWDSSSTSIGFRCA 382


>gi|332661979|ref|YP_004451448.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
           1100]
 gi|332337476|gb|AEE54575.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis
           DSM 1100]
          Length = 591

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 93/324 (28%), Positives = 136/324 (41%), Gaps = 83/324 (25%)

Query: 54  EVSNTQFQEFVSATGYVTEAEKFGDTFV--FEPLLSEEERAKISQVRHD-MKRFEGLDST 110
           E +  +F    SA+ YV EA++  D      +P        K S +  D +   +G    
Sbjct: 311 EAALLRFTRENSASPYVAEAQRLIDQLQSNIKPPAESLVPPKPSIIVPDHLVLVKGGTFQ 370

Query: 111 IEHRMHHPVV---------HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
           +     HPV           +++++  A+C     +LP + +W  G R            
Sbjct: 371 LGEDKTHPVTLSDFLIAKNQLTFDEYDAFCKATDRKLPDDRKWGRGKR------------ 418

Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWR------------------------GA 197
                            PV++V+W DA+ YC WR                        G 
Sbjct: 419 -----------------PVIYVNWFDAIDYCNWRSQQEGLSQVYQVNKPQVNPNWQANGY 461

Query: 198 RLPTEAEWEYGCRGGLENRLFPW-GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVM 256
           RLPTEAEWEY  RGG+ ++ F + G+N         NV +  +   N+ +     T PV 
Sbjct: 462 RLPTEAEWEYAARGGVSSQGFTYAGSN---------NVGEVAWHDTNSGS----KTQPVG 508

Query: 257 SYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCY 316
             K N+ GLY++ GNVWEW  D W V +    S +PKGP TG+ +V +GGS+  +  YC 
Sbjct: 509 QKKTNELGLYDLSGNVWEWCWD-WRVAYPSNASNDPKGPDTGSYRVLRGGSWSYSAGYC- 566

Query: 317 RHRCAARSQNTPDSSAGNLGFRCA 340
             R A R  N P  +    GFR A
Sbjct: 567 --RIALRFSNDPYDAYNYCGFRLA 588


>gi|196228313|ref|ZP_03127180.1| protein of unknown function DUF323 [Chthoniobacter flavus Ellin428]
 gi|196227716|gb|EDY22219.1| protein of unknown function DUF323 [Chthoniobacter flavus Ellin428]
          Length = 235

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 64/172 (37%), Positives = 92/172 (53%), Gaps = 14/172 (8%)

Query: 177 NHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
           NHPVV+V+  +A  +C W      R  RLPTE EWEY  R G++ R FPWG  L      
Sbjct: 63  NHPVVYVNSREAEKFCQWLSTKEGRKYRLPTEPEWEYAAR-GMDGRTFPWGERLDAGNLA 121

Query: 231 RANVWQGEFP-TNNTAADGYLSTAPVMSYKE--NKFGLYNMVGNVWEWTADWWNVHHHPA 287
                Q  F   +    DG+  +APV S+    + FG+ +M GNV+EW  D++ V+    
Sbjct: 122 NFADKQTSFAWRDPNIDDGFPESAPVGSFPRGASPFGIEDMAGNVFEWCLDYFEVYRG-K 180

Query: 288 PSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
              NP+GP+ GT ++ +GGS+   +      R  AR+ N PD S+ ++GFR 
Sbjct: 181 DRVNPRGPSNGTKRIYRGGSW---KSRATNLRTTARNHNLPDYSSNDVGFRV 229



 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 46/154 (29%), Positives = 63/154 (40%), Gaps = 52/154 (33%)

Query: 16  VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEK 75
           VLL G T R  +         E P   VTL  F+L +  ++N Q+++F  A         
Sbjct: 8   VLLMGSTARDASPH-------EQPITQVTLSCFFLSRFPITNAQYEKFDPA--------- 51

Query: 76  FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW-- 133
                                  H MKR       +    +HPVV+++  +A  +C W  
Sbjct: 52  -----------------------HKMKR------ALNAGDNHPVVYVNSREAEKFCQWLS 82

Query: 134 ----RGARLPTEAEWEYGCRGGLENRLFPWGSWL 163
               R  RLPTE EWEY  R G++ R FPWG  L
Sbjct: 83  TKEGRKYRLPTEPEWEYAAR-GMDGRTFPWGERL 115


>gi|408370220|ref|ZP_11167998.1| gliding motility protein gldk [Galbibacter sp. ck-I2-15]
 gi|407744298|gb|EKF55867.1| gliding motility protein gldk [Galbibacter sp. ck-I2-15]
          Length = 451

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 106/419 (25%), Positives = 175/419 (41%), Gaps = 94/419 (22%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF-------VSAT 67
           M L+PG  F MG +   +      P++ VT+ +FY+D+ E++N+++++F       +  T
Sbjct: 41  MSLVPGGAFIMGRSDEDMADVLNAPTKTVTVRSFYMDETEITNSEYRQFVEWVKDSIVRT 100

Query: 68  GYVTEAEKFGDT----FVFEPLLSEEERAKISQV-RHDMKRFEGLDST-IEHRMHHPVVH 121
                A++ G+      + E    + + + +S+  ++ +  + G+  T  E R  +  V 
Sbjct: 101 KLAVLADELGEVPGTGGIGEFAFKDADTSNLSEYDKYMLFNYAGMGETGYEGRSLNKDVD 160

Query: 122 ISWN-------------DAVAYCT---WRGARLPTEAEWEYG-----------CRGG--- 151
           + W+             D+V   T   + G R     E +Y             +GG   
Sbjct: 161 LIWDVTEYPDEYYAEVMDSVYLSTEESYNGQRTIDVKELKYKYSWMDIQAAARAKGGGSR 220

Query: 152 ------------------LENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
                             +++  + +   +H +   S  E   ++PVV V+W  A A+C 
Sbjct: 221 LDFIKHEELNIYPDTTVWIKDFAYSYNEPMHNDYFWS--EAYSDYPVVGVTWKQAKAFCA 278

Query: 194 WRGA-----------------RLPTEAEWEYGCRGGLENRLFPWGN--NLTPRGEHRANV 234
           WR                   RLPTEAEWEY  RGG+++  +PWG    L  RG   AN 
Sbjct: 279 WRTKYKNDYQKENGKQFVNHFRLPTEAEWEYAARGGIQSGTYPWGGPYILNDRGCFMANF 338

Query: 235 --WQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
              +G++     AAD  L T    SY  N + LYNM GNV EWT   ++ + +   S   
Sbjct: 339 KPLRGDY-----AADQALYTVEAKSYDPNDYNLYNMAGNVSEWTNSSYDPNSYEYVSTMN 393

Query: 293 KGPTTGTD--KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
                G +  KV +GGS+   +   Y  + + R     DS+   +GFR   D   T  T
Sbjct: 394 PNTYDGENKRKVIRGGSW---KDVAYFLQVSTRDYEYQDSARSYIGFRTVQDYMGTEST 449


>gi|91202283|emb|CAJ75343.1| hypothetical protein kuste4581 [Candidatus Kuenenia
           stuttgartiensis]
          Length = 476

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 63/167 (37%), Positives = 84/167 (50%), Gaps = 20/167 (11%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
           ++PV  V W DA AY  W+G RLPTEAEWE   R GL+ R FPWGN        R N+  
Sbjct: 324 DYPVARVDWYDAYAYAAWKGKRLPTEAEWEKAAR-GLDVRAFPWGNEW---DHTRCNLSG 379

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW-NVHHHPAPSYNPKGP 295
              P       G + +   M      +G Y+M G+V+EW  DW+ + ++  +PS NP GP
Sbjct: 380 EPMPV------GSVESGKSM------YGCYDMSGSVFEWCNDWFSSTYYQYSPSINPTGP 427

Query: 296 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
             G  K+ +GGS        ++ R   R    PD     +GFRCA D
Sbjct: 428 EKGERKLIRGGSRFSRP---FQVRVTERKSERPDLFNMAIGFRCAKD 471



 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 57/152 (37%), Positives = 78/152 (51%), Gaps = 21/152 (13%)

Query: 14  DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           D VL+P   F MGTN+    +D   P   V LDA+ +D++E++N Q+ EFV    Y+ E 
Sbjct: 241 DTVLIPAGKFSMGTNEA---RDVSQPEHVVYLDAYEIDRYEITNAQYWEFVK---YIEET 294

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
                 F  EP+  +          H  + +E     +     +PV  + W DA AY  W
Sbjct: 295 NDHSKCFKDEPIGKD----------HKPRYWEDEYYNVPD---YPVARVDWYDAYAYAAW 341

Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGS-WLH 164
           +G RLPTEAEWE   R GL+ R FPWG+ W H
Sbjct: 342 KGKRLPTEAEWEKAAR-GLDVRAFPWGNEWDH 372


>gi|332668391|ref|YP_004451179.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
           1100]
 gi|332337205|gb|AEE54306.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis
           DSM 1100]
          Length = 686

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 68/174 (39%), Positives = 92/174 (52%), Gaps = 20/174 (11%)

Query: 177 NHPVVHVSWNDAVAYCTWRG------ARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
            HPV+HVSWNDA AYC W         RLP EAEWEY CR G     F  G NLT     
Sbjct: 516 QHPVIHVSWNDARAYCDWLEQRLSLPIRLPREAEWEYACRAGTTTP-FNTGANLTTV--- 571

Query: 231 RANVWQGEFPTNNTAADGYLS-TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---- 285
           +AN + G +P  +     YL  T+P+  Y  N +GLY+M GNV EW  DW++  ++    
Sbjct: 572 QAN-YDGNYPYADFPKGTYLGRTSPIGLYPPNAWGLYDMHGNVLEWCEDWYSETYYNECK 630

Query: 286 -PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
                 +P+G + G+ +V +GGS+L   ++C   R A R  N     +  +GFR
Sbjct: 631 KQGVVADPEGSSAGSARVLRGGSWLSFARHC---RTAYRFWNGSVYRSAGVGFR 681



 Score = 84.3 bits (207), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 52/146 (35%), Positives = 74/146 (50%), Gaps = 15/146 (10%)

Query: 13  KDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +D VL+ G  F MG  ++    +  E P   V L  FYL ++  +  QF+EF+ ATGY T
Sbjct: 420 QDFVLIKGGEFMMGAPDEEADARADEKPQHQVRLTDFYLSKYPTTLAQFEEFILATGYQT 479

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           +A++ G         S+    K  +++  +     +    +    HPV+H+SWNDA AYC
Sbjct: 480 DADQSGG--------SDFWNGKNWELQAGVNWTYDVHGAQQKDKQHPVIHVSWNDARAYC 531

Query: 132 TWRG------ARLPTEAEWEYGCRGG 151
            W         RLP EAEWEY CR G
Sbjct: 532 DWLEQRLSLPIRLPREAEWEYACRAG 557


>gi|451980491|ref|ZP_21928880.1| conserved exported hypothetical protein, DUF323 [Nitrospina
           gracilis 3/211]
 gi|451762227|emb|CCQ90112.1| conserved exported hypothetical protein, DUF323 [Nitrospina
           gracilis 3/211]
          Length = 287

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 67/167 (40%), Positives = 89/167 (53%), Gaps = 17/167 (10%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
             PVV ++W DAV YC W G RLPTEAEWE   R G + R+FPWG +  PR ++ AN   
Sbjct: 122 KRPVVQITWYDAVDYCLWAGKRLPTEAEWEKAAR-GTDGRMFPWGGD--PRLQNAANF-A 177

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNVHHHP-APSYNPK 293
             +   NT         PV S  +N   +G+ +M GNV EW ADW++  ++   P  NP+
Sbjct: 178 HNWEDRNT-------LWPVGSQPQNASPYGVLDMAGNVREWVADWYSPDYYRWGPVRNPQ 230

Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           GP  G  KV +GGS+     +    R AAR +         +GFRCA
Sbjct: 231 GPGQGILKVIRGGSW---HSFAADIRTAARGKGGFALKTDGIGFRCA 274



 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 53/146 (36%), Positives = 69/146 (47%), Gaps = 29/146 (19%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +    F MG N     +D E P R V +D FY+D  EV+NT++  ++ +        
Sbjct: 49  MVWVQEGPFLMG-NVEGSGRDDERPQREVFVDGFYIDSVEVTNTRYLAYIKSAHRKEPPN 107

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
            +GD     PL  E++  K                        PVV I+W DAV YC W 
Sbjct: 108 PYGDG----PLSGEKDIGK-----------------------RPVVQITWYDAVDYCLWA 140

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWG 160
           G RLPTEAEWE   R G + R+FPWG
Sbjct: 141 GKRLPTEAEWEKAAR-GTDGRMFPWG 165


>gi|302035580|ref|YP_003795902.1| hypothetical protein NIDE0192 [Candidatus Nitrospira defluvii]
 gi|300603644|emb|CBK39975.1| conserved exported protein of unknown function, DUF323 [Candidatus
           Nitrospira defluvii]
          Length = 320

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 71/190 (37%), Positives = 101/190 (53%), Gaps = 29/190 (15%)

Query: 178 HPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG 237
            PVV VSW DA A+C W G RLPTEAEWE   +G   +  +PWG++L P+   +AN  Q 
Sbjct: 100 QPVVGVSWTDANAFCKWDGKRLPTEAEWEKAAKGPEGDNHYPWGHHLDPK---KANYGQ- 155

Query: 238 EFPTNNTAADGYLSTAPVMSYKE--NKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKG 294
                         T PV SY E  + +G+YNM GNV+EW  DW++  ++  + + NP+G
Sbjct: 156 ----------NVGRTTPVDSYPEGVSGYGVYNMAGNVFEWVDDWYDPKYYRTSNALNPRG 205

Query: 295 PTTGTD-------KVKKGGSYLCNEQYCY-RHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
              G +       KV +GGS+L  E   +  HR   + +N  +S    LGFRCA  K  +
Sbjct: 206 ADKGYNFANQGPVKVLRGGSWLAPETSLHTSHRFWNQPEN--NSYGVGLGFRCA--KSAS 261

Query: 347 TGTDKVKKGG 356
             +D+  + G
Sbjct: 262 VVSDEAAQAG 271



 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/155 (34%), Positives = 77/155 (49%), Gaps = 37/155 (23%)

Query: 12  YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
           +KDMVL+P   F MG+N+       +     V LDA+ +D+ E SN +++EF+ ATG+  
Sbjct: 32  HKDMVLIPSGEFTMGSNE-----HSDEVRHQVVLDAYLIDKFEASNARYKEFMRATGHPA 86

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
            A      +  +P LS+ E+                          PVV +SW DA A+C
Sbjct: 87  PA------YWDDPRLSKPEQ--------------------------PVVGVSWTDANAFC 114

Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPE 166
            W G RLPTEAEWE   +G   +  +PWG  L P+
Sbjct: 115 KWDGKRLPTEAEWEKAAKGPEGDNHYPWGHHLDPK 149


>gi|257093415|ref|YP_003167056.1| peptidase C14 caspase catalytic subunit p20 [Candidatus
           Accumulibacter phosphatis clade IIA str. UW-1]
 gi|257045939|gb|ACV35127.1| peptidase C14 caspase catalytic subunit p20 [Candidatus
           Accumulibacter phosphatis clade IIA str. UW-1]
          Length = 776

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 97/312 (31%), Positives = 123/312 (39%), Gaps = 62/312 (19%)

Query: 39  PSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVR 98
           P  NV +  F L  +EV+  QF++FV AT Y TEAE+ G +       S  E A      
Sbjct: 513 PQHNVKVANFELAYYEVTVGQFRKFVEATAYQTEAERVGRSPGCHVARSGGEFAVERSAT 572

Query: 99  HDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW-----RGARLPTEAEWEYGCRGGLE 153
                F   DS       HPVV +SWND  AY  W      G RLP+EAEWEY  R G  
Sbjct: 573 WRAPGFAQTDS-------HPVVCVSWNDTQAYVAWLNGAAAGYRLPSEAEWEYAARAGT- 624

Query: 154 NRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA---VAYCTWRGARLPTEAEWEYGCR 210
                                     V    W+D     A     G     E      CR
Sbjct: 625 --------------------------VTPRPWSDTGNFFARMFQSGKGTDPEQPPSRTCR 658

Query: 211 GGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVG 270
                       N+      +A  W    P  +  +DG    +    +  N F LY+M+G
Sbjct: 659 NA----------NVADESLKKALAW----PVTHNCSDGQAYASSGGYFGRNGFSLYDMIG 704

Query: 271 NVWEWTADWWNVHHHPAPSYNPKGPTTG--TDKVKKGGSYLCNEQYCYRHRCAARSQNTP 328
           NV EWT D WN  H  AP  + K   +G  + +V +GGS+     Y    R AAR +N  
Sbjct: 705 NVSEWTQDCWNPSHAGAP-LDGKARLSGDCSQRVIRGGSWSSGPVYA---RSAARQKNPA 760

Query: 329 DSSAGNLGFRCA 340
              A +LGFR A
Sbjct: 761 SYRASDLGFRLA 772


>gi|423013771|ref|ZP_17004492.1| hypothetical protein AXXA_04948 [Achromobacter xylosoxidans AXX-A]
 gi|338783265|gb|EGP47633.1| hypothetical protein AXXA_04948 [Achromobacter xylosoxidans AXX-A]
          Length = 328

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 68/176 (38%), Positives = 91/176 (51%), Gaps = 18/176 (10%)

Query: 178 HPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGE-- 229
           HPVV+VSWNDA A   W         RLPTEAEWEY  R G   R +  G++     E  
Sbjct: 152 HPVVNVSWNDATAMARWLSREEGHVYRLPTEAEWEYAARAGSRGR-YQNGDDPAALPEVA 210

Query: 230 -----HRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH 284
                  A +W           DG+  TAPV SY  N FGL++M GN WEW +DW+   +
Sbjct: 211 NTFDADAAPLWPKWQAFAAARRDGHAFTAPVASYAPNAFGLHDMHGNAWEWVSDWYGEDY 270

Query: 285 HP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           +  +P  +P+GP +G  KV++GGS+     Y    R A R+ NT ++    +GFR 
Sbjct: 271 YAHSPQDDPQGPASGNVKVRRGGSWHTWALYV---RSAYRNWNTVETRYPLVGFRL 323



 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 61/184 (33%), Positives = 82/184 (44%), Gaps = 33/184 (17%)

Query: 1   MVLLPAPPVERYKDMVL--LPGDTFRMGTNK--PILIKD-------------GEFPSRNV 43
           M   P  P+E    MV   LP  +F+MG+++    L +D              E P   V
Sbjct: 17  MAAEPPAPIENSLGMVFVPLPAGSFQMGSDETPEALARDYPQYPKERFAQLFDEAPVHTV 76

Query: 44  TLD-AFYLDQHEVSNTQFQEFVSATGYVTEAEK-----FGDTFVFEPLLSEEERAKISQV 97
            +   F + + EV+  QF+ F+ A+GYV E+E      +G    ++P  SE   A   + 
Sbjct: 77  RITRPFLMARTEVTVGQFRRFLEASGYVPESEADGTGGYGYNAAYDPDASERGDAFEGRD 136

Query: 98  RHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGG 151
           R    R  G   T  H    PVV++SWNDA A   W         RLPTEAEWEY  R G
Sbjct: 137 RRYSWRNPGFAQTDAH----PVVNVSWNDATAMARWLSREEGHVYRLPTEAEWEYAARAG 192

Query: 152 LENR 155
              R
Sbjct: 193 SRGR 196


>gi|332666591|ref|YP_004449379.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
           1100]
 gi|332335405|gb|AEE52506.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis
           DSM 1100]
          Length = 625

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 70/178 (39%), Positives = 90/178 (50%), Gaps = 34/178 (19%)

Query: 179 PVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGG--------LENRLFPWGNNL 224
           PVV VSW DA AYC W         RLPTEAEWEY  RGG         +   +   N++
Sbjct: 463 PVVGVSWEDAKAYCEWLSRKTGLKYRLPTEAEWEYAARGGGLGITQTNFQVTQYAGSNSI 522

Query: 225 TPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH 284
                H +N   G  P             PV   K N+ GLY+M GNVWEW ADW++  +
Sbjct: 523 DEVAWHSSN--SGSKP------------HPVGQKKPNRLGLYDMSGNVWEWCADWYDSSY 568

Query: 285 HP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRH-RCAARSQNTPDSSAGNLGFRCA 340
           +  +PS NP G  TG+D+V +GGS+     Y  R  R A R + TP +   ++GFR A
Sbjct: 569 YKNSPSTNPPGAITGSDRVCRGGSW----NYVPRDARMAGRGRYTPVNRIYDIGFRVA 622



 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 53/181 (29%), Positives = 77/181 (42%), Gaps = 29/181 (16%)

Query: 15  MVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
           +VL+ G +F MG T++     + E P+  V L  FYL ++E++  ++ +F   T      
Sbjct: 373 LVLVKGGSFTMGCTSEQQNCGEDEKPTHQVNLGDFYLSKNELTIGEYLKFADET------ 426

Query: 74  EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
                   +   L +  +  +     D  + +GL     +    PVV +SW DA AYC W
Sbjct: 427 -----RSHYPEWLEQGSKYNLETGSDDHYKNKGLS---RNATDLPVVGVSWEDAKAYCEW 478

Query: 134 RGA------RLPTEAEWEYGCRG---GLENRLFPWGSWLHPEGIDSTIEHRMN-----HP 179
                    RLPTEAEWEY  RG   G+    F    +     ID    H  N     HP
Sbjct: 479 LSRKTGLKYRLPTEAEWEYAARGGGLGITQTNFQVTQYAGSNSIDEVAWHSSNSGSKPHP 538

Query: 180 V 180
           V
Sbjct: 539 V 539


>gi|163753939|ref|ZP_02161062.1| lipoprotein, putative [Kordia algicida OT-1]
 gi|161326153|gb|EDP97479.1| lipoprotein, putative [Kordia algicida OT-1]
          Length = 454

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 114/424 (26%), Positives = 180/424 (42%), Gaps = 95/424 (22%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV-------SAT 67
           M L+PG +F MG +   + +    P++ VT+ +FY+D+ E++N++++EFV         T
Sbjct: 41  MTLVPGGSFIMGKSDDDIAQTKNAPTKTVTVPSFYMDETEITNSEYREFVYWVRDSIIRT 100

Query: 68  GYVTEAEKFGDTF-----VFEPLLSEEERAKISQ-VRHDMKRFEGL-DSTIEHRMHHPVV 120
                A++ G T      + +    + + A ++   +++++ + GL D+  E R  +   
Sbjct: 101 KLAILADEEGKTDPAEDGIAKYAFKDADTADLTAWEKYNLENYVGLGDTGYEGRRLNRDE 160

Query: 121 HISWNDAVA----YCTWRGARLPTEAEWEYGCR-----------------GGLENRLFPW 159
            + W+ +      Y     + L  EAE   G R                    +N+    
Sbjct: 161 ELIWDYSEIPDEFYAEVMDSMLIPEAESYNGQRTIDWTKVRFQYTWMDIQAAAKNKGMKR 220

Query: 160 GSWLHPEGI----DSTI-----EHRMNHPVV-----HVSWND----------AVAYCTWR 195
             ++  E +    D+T+     E+  N P+      H ++ND          A A+C WR
Sbjct: 221 TEFIKRENLEIYPDTTVWIRDFEYSYNEPMHNDYFWHDAYNDYPVVGVTWKQAKAFCEWR 280

Query: 196 GA-----------------RLPTEAEWEYGCRGGLENRLFPWGN--NLTPRGEHRANVWQ 236
                              RLP+EAEWEY  RGGLE   +PWG    +  RG   AN   
Sbjct: 281 TLKKNIYRKERKRHMINKFRLPSEAEWEYAARGGLEGATYPWGGPYTMNDRGCFLANF-- 338

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYN 291
            +    N AAD  L T    SY+ N F LYNM GNV EWTA  ++   +      +P+ N
Sbjct: 339 -KPLRGNYAADQALYTVEAESYEPNDFNLYNMAGNVAEWTASSYDPSSYEYMSTMSPNVN 397

Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDK 351
               T    KV +GGS+   E +    + + R     DS+   +GFR         GTD 
Sbjct: 398 ---DTLNKRKVIRGGSWKDVEYFL---QVSTRDFEYRDSARSYIGFRTVQ---SYMGTDV 448

Query: 352 VKKG 355
           V KG
Sbjct: 449 VGKG 452


>gi|218781314|ref|YP_002432632.1| hypothetical protein Dalk_3476 [Desulfatibacillum alkenivorans
           AK-01]
 gi|218762698|gb|ACL05164.1| protein of unknown function DUF323 [Desulfatibacillum alkenivorans
           AK-01]
          Length = 307

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 70/188 (37%), Positives = 99/188 (52%), Gaps = 25/188 (13%)

Query: 164 HPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGAR--------LPTEAEWEYGCRGGLEN 215
           HP  +D+        PVV+VSW +A A+  W  A+        LPTEAEWE+ CR G E 
Sbjct: 127 HPLNMDA-------QPVVNVSWKNARAFAIWLSAQNQGQYSFWLPTEAEWEFACRAGTET 179

Query: 216 RLFPWGNNLTPRGEHR--ANVWQGEF-PTNNT--AADGYLSTAPVMSYKENKFGLYNMVG 270
             + WG++++   ++   A+   G++ P+ N     DG+  +APV S+  N FGLY+M+G
Sbjct: 180 SRY-WGDDVSQTCQYANVADQTAGKYWPSWNVHPCNDGFAVSAPVGSFSPNAFGLYDMMG 238

Query: 271 NVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDS 330
           NVWE   D W   + P    NP GP TG  ++ +GGS+   +      R A RS  TP  
Sbjct: 239 NVWEACQD-WKAPYRPGEQVNPAGPQTGKRRMVRGGSW---DNESRGIRSANRSYATPSF 294

Query: 331 SAGNLGFR 338
              N GFR
Sbjct: 295 KRYNNGFR 302



 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 46/158 (29%), Positives = 67/158 (42%), Gaps = 48/158 (30%)

Query: 14  DMVLLPGDTFRMGTN---KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
           + V +PG  F MG+     P   +D E P   V LD F++ ++EV+N Q+  F       
Sbjct: 64  EFVFIPGQCFMMGSTGKTDPDRYQD-EGPIHKVCLDGFWMGRYEVTNAQYHFFRPG---- 118

Query: 71  TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
                                       HD   +EG    ++ +   PVV++SW +A A+
Sbjct: 119 ----------------------------HDSGEYEGHPLNMDAQ---PVVNVSWKNARAF 147

Query: 131 CTWRGAR--------LPTEAEWEYGCRGGLENRLFPWG 160
             W  A+        LPTEAEWE+ CR G E   + WG
Sbjct: 148 AIWLSAQNQGQYSFWLPTEAEWEFACRAGTETSRY-WG 184


>gi|389795065|ref|ZP_10198203.1| sulfatase-modifying factor protein [Rhodanobacter fulvus Jip2]
 gi|388431516|gb|EIL88587.1| sulfatase-modifying factor protein [Rhodanobacter fulvus Jip2]
          Length = 645

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 99/335 (29%), Positives = 146/335 (43%), Gaps = 76/335 (22%)

Query: 15  MVLLPGDTFRMGT--NKPILIKDGEFPSRNVTLD-AFYLDQHEVSNTQFQEFVSATGYVT 71
           MV++P  +F+MG+  ++P    D E P   VT+   F + +  ++  QF+EFV A+GY  
Sbjct: 373 MVVIPTGSFQMGSPASEPGH-NDAETPQHAVTIAMGFAMARSAITVAQFREFVRASGYRP 431

Query: 72  EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
           ++EK G   V++      ER+   +   D   +E   +  +     PVV++SWNDA AY 
Sbjct: 432 DSEKLGGASVYD------ERSGTMRDDADAS-WEDDYAGRKAADDLPVVNVSWNDASAYA 484

Query: 132 TWRGA------RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
            W G       RL +EAE+EY  RGG   R + WG        D T +  + +       
Sbjct: 485 AWLGERTGKIYRLASEAEFEYALRGGTTTRYW-WG--------DGTPQKTVEN------- 528

Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
                  T  G   PT   W +  RG                  +R              
Sbjct: 529 ------LTGSGDHSPTGRRWSHAFRG------------------YR-------------- 550

Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTG-TDKVKK 304
            DGY   APVM +  N FGLY++ GNV EW AD W+ +++ AP         G T +V +
Sbjct: 551 -DGYWGPAPVMKFAPNPFGLYDINGNVSEWVADCWHDNYNRAPVNGGAWINPGCTARVVR 609

Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
           GGS+  +     + R A R        +G +GFR 
Sbjct: 610 GGSWGSSPD---QERSAYRQGVDATVRSGRVGFRV 641


>gi|359461177|ref|ZP_09249740.1| serine/threonine protein kinase [Acaryochloris sp. CCMEE 5410]
          Length = 627

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 67/172 (38%), Positives = 93/172 (54%), Gaps = 17/172 (9%)

Query: 178 HPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR 231
           +PV  +SW +A+ +C        R  RLP+EAEWEY CR G     F +G  L+     +
Sbjct: 461 YPVEQISWREAIEFCARLSQKAGREYRLPSEAEWEYACRAGTATP-FSYGPTLS---SQQ 516

Query: 232 ANVWQGEFPTNNTAADGYL-STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY 290
           AN +QG+F   N     Y  ST PV  +  N FGLY+M GNV+EW  D W+ ++  AP  
Sbjct: 517 AN-YQGQFVYGNGPKGVYRESTTPVGRFSPNNFGLYDMHGNVYEWCLDCWHKNYDGAPVD 575

Query: 291 NPKGPTT--GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
                TT  G  +V +GGS+    +YC   R A RS+ TP++   ++GFR A
Sbjct: 576 GSPWMTTREGDPRVLRGGSWYNKPRYC---RAANRSRYTPNNRVNDIGFRVA 624


>gi|153873484|ref|ZP_02002053.1| protein of unknown function DUF323 [Beggiatoa sp. PS]
 gi|152070051|gb|EDN67948.1| protein of unknown function DUF323 [Beggiatoa sp. PS]
          Length = 442

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 95/312 (30%), Positives = 142/312 (45%), Gaps = 60/312 (19%)

Query: 50  LDQHEVSNTQFQEF--VSATGYVTEAEKFGDTFVFEPLLSE----EERAKISQVRHDMKR 103
            +Q+ +  +Q  E   +  T ++ +++K     VF  LL++     E   I   +  M  
Sbjct: 166 FNQNTLPESQLSESHPLDITTFLPKSQKTIGK-VFSDLLTDGTEGPEMVMIPAGKFTMGD 224

Query: 104 FEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLP-TEAEWEYGCRGGLENRLFP---- 158
            +G     E  +H           V+  ++   R P T AE++Y C+    NR  P    
Sbjct: 225 IQGTGEASEKPVHE----------VSIKSFAIGRYPVTFAEYDYFCKA--TNREKPDNRG 272

Query: 159 WGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGG 212
           WG              R   PV+++SW DAVAY  W         RLP+EA+WEY  R G
Sbjct: 273 WG--------------RYKRPVMNISWKDAVAYTDWLSKQTGNTYRLPSEAQWEYAARAG 318

Query: 213 LENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNV 272
            E   + WGN++   G ++AN     + + +  +D  + T+PV S+K N FGLY+ VGNV
Sbjct: 319 TETDYW-WGNDI---GINQANC----YLSGSQWSD--IQTSPVGSFKPNPFGLYDTVGNV 368

Query: 273 WEWTADWWNVHHHPAPSYNP---KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPD 329
           WEW  D W+ ++  AP+          T   KV +GGS+  N       R A R ++ PD
Sbjct: 369 WEWCVDSWHENYQNAPTDGSVWNNNKKTNEYKVLRGGSWSNNP---INARAANRYRHKPD 425

Query: 330 SSAGNLGFRCAA 341
                LGFR  A
Sbjct: 426 IRLNFLGFRVVA 437


>gi|451979721|ref|ZP_21928134.1| exported hypothetical protein [Nitrospina gracilis 3/211]
 gi|451763090|emb|CCQ89331.1| exported hypothetical protein [Nitrospina gracilis 3/211]
          Length = 835

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 64/168 (38%), Positives = 89/168 (52%), Gaps = 15/168 (8%)

Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
           N+P  +VSW  A AYC W+  RLPTEAEWE   RG   N LFPWGN+     + R N+  
Sbjct: 642 NYPANNVSWFGAAAYCKWKDQRLPTEAEWEKAARGPNGN-LFPWGNDPISPQKSRYNLQW 700

Query: 237 GEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNVHHHPAPSY--NP 292
            E   +           PV S  E K  +G+++M+GNV EW  DW++  ++   ++  NP
Sbjct: 701 NENIRHRV-------MVPVDSMPEGKSFYGVFHMLGNVKEWVDDWYDREYYKEENHKTNP 753

Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
           KG   G  KV KGGS+     + Y    + R+ + P +   + GFRCA
Sbjct: 754 KGQIGGEYKVLKGGSWRDLRSFVY---ASFRNNSYPAAMVDDYGFRCA 798



 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 53/147 (36%), Positives = 74/147 (50%), Gaps = 23/147 (15%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   F MG +      D E P   V LD +Y+D+HEVS  +F EF++A   + E +
Sbjct: 563 MVKIPQGFFIMGDDHH---GDDEAPKHKVYLDTYYIDKHEVSAAEFAEFLNA---MPETK 616

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
           +F   F      + E+R      R  ++             ++P  ++SW  A AYC W+
Sbjct: 617 RF---FGINKFGTIEQRGGNFIPRRGLE-------------NYPANNVSWFGAAAYCKWK 660

Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGS 161
             RLPTEAEWE   RG   N LFPWG+
Sbjct: 661 DQRLPTEAEWEKAARGPNGN-LFPWGN 686


>gi|153871211|ref|ZP_02000440.1| Protein of unknown function DUF323 [Beggiatoa sp. PS]
 gi|152072325|gb|EDN69562.1| Protein of unknown function DUF323 [Beggiatoa sp. PS]
          Length = 323

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 73/210 (34%), Positives = 98/210 (46%), Gaps = 27/210 (12%)

Query: 140 TEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW----- 194
           TEAE E GC     N    W    H    +       +HPVV +SWNDA+ Y  W     
Sbjct: 130 TEAEQENGCYIWEGN---DWKKHKHANWRNPYFSQNDDHPVVCISWNDAMVYVNWLSQQT 186

Query: 195 -RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
            +  RLP+EAEWEY  R G E + + WGN+           W       +   DG+  TA
Sbjct: 187 GKNYRLPSEAEWEYAARAGTETQYW-WGNH-----------WDKTKANCSVKEDGFKYTA 234

Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK---GPTTGTDKVKKGGSYLC 310
           PV ++K N FGL +M+GNVWEW AD W+ ++  AP+ +     GP     +V +GG++  
Sbjct: 235 PVNAFKSNAFGLDDMLGNVWEWMADPWHENYQNAPTDSRVWNFGPKKNKYRVLRGGAWPY 294

Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
              +C   R A R            GFR A
Sbjct: 295 APNFC---RTANRDWGNQTYRFYTFGFRVA 321



 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 54/153 (35%), Positives = 81/153 (52%), Gaps = 19/153 (12%)

Query: 15  MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
           MV +P   FRMG  + I   D E P   V++  F + ++ V+  +F++FV ATGY TEAE
Sbjct: 75  MVWIPAGRFRMGDIQGIG-SDNEKPVHFVSVKRFAMGKNGVTVGEFRQFVKATGYQTEAE 133

Query: 75  KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW- 133
           +    +++E   ++ ++ K +  R         +        HPVV ISWNDA+ Y  W 
Sbjct: 134 QENGCYIWEG--NDWKKHKHANWR---------NPYFSQNDDHPVVCISWNDAMVYVNWL 182

Query: 134 -----RGARLPTEAEWEYGCRGGLENRLFPWGS 161
                +  RLP+EAEWEY  R G E + + WG+
Sbjct: 183 SQQTGKNYRLPSEAEWEYAARAGTETQYW-WGN 214


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.135    0.450 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,258,641,178
Number of Sequences: 23463169
Number of extensions: 332813634
Number of successful extensions: 635517
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3620
Number of HSP's successfully gapped in prelim test: 1640
Number of HSP's that attempted gapping in prelim test: 610393
Number of HSP's gapped (non-prelim): 15146
length of query: 394
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 250
effective length of database: 8,980,499,031
effective search space: 2245124757750
effective search space used: 2245124757750
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)