BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy8678
(394 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449474288|ref|XP_004174968.1| PREDICTED: LOW QUALITY PROTEIN: sulfatase-modifying factor 1
[Taeniopygia guttata]
Length = 284
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 196/331 (59%), Positives = 229/331 (69%), Gaps = 50/331 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+PG F MGT +P + +DGE P+R V L +F++DQ+EVSN FQ FV+ATGYVTEAE
Sbjct: 1 MVLIPGGVFTMGTQEPEIQQDGEGPARRVHLSSFHMDQYEVSNQDFQSFVNATGYVTEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSE +A I Q AVA W
Sbjct: 61 KFGDSFVFEGMLSEAVKADIHQ------------------------------AVAAAPWW 90
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + SW HPEG DS+I RM+HPV+HVSWNDAVA+CTW
Sbjct: 91 ---LPVKG-----------------ASWRHPEGPDSSISSRMDHPVLHVSWNDAVAFCTW 130
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGLENRLFPWGN L P+G+H AN+WQGEFPTNNTA DGY TAP
Sbjct: 131 AGKRLPTEAEWEYSCRGGLENRLFPWGNKLQPKGQHYANIWQGEFPTNNTAEDGYKGTAP 190
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYNMVGN WEWT+DWW VHH P +NPKGP++GTD+VKKGGSY+C++ Y
Sbjct: 191 VSAFPPNAYGLYNMVGNAWEWTSDWWAVHHSPQELHNPKGPSSGTDRVKKGGSYMCHKSY 250
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
CYR+RCAARSQNTPDSSA NLGFRCAAD P
Sbjct: 251 CYRYRCAARSQNTPDSSASNLGFRCAADSAP 281
>gi|355722650|gb|AES07642.1| sulfatase modifying factor 1 [Mustela putorius furo]
Length = 374
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 187/334 (55%), Positives = 229/334 (68%), Gaps = 50/334 (14%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+P F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+TEAE
Sbjct: 91 MVLIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFVNSTGYLTEAE 150
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSE+ + I Q AVA W
Sbjct: 151 KFGDSFVFEGMLSEQVKTDIQQ------------------------------AVAAAPWW 180
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DST+ HR++HPV+HVSWNDAVAYCTW
Sbjct: 181 ---LPVKG-----------------ANWRHPEGPDSTVLHRLDHPVLHVSWNDAVAYCTW 220
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP NT DG+ TAP
Sbjct: 221 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAP 280
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D+VKKGGSY+C++ Y
Sbjct: 281 VDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEKTINPKGPPSGKDRVKKGGSYMCHKSY 340
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
CYR+RCAARSQNTPDSSA NLGFRCAAD+ PTTG
Sbjct: 341 CYRYRCAARSQNTPDSSASNLGFRCAADRQPTTG 374
>gi|73985046|ref|XP_541796.2| PREDICTED: sulfatase-modifying factor 1 [Canis lupus familiaris]
Length = 374
Score = 388 bits (996), Expect = e-105, Method: Compositional matrix adjust.
Identities = 188/334 (56%), Positives = 229/334 (68%), Gaps = 50/334 (14%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+P F MGT+ P + +DGE P+R V++DAFY+D +EVSN F++FV++TGY+TEAE
Sbjct: 91 MVLIPAGVFTMGTDDPQIKQDGEAPARRVSIDAFYMDAYEVSNADFEKFVNSTGYLTEAE 150
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSE+ + I Q AVA W
Sbjct: 151 KFGDSFVFEGMLSEQVKTDIQQ------------------------------AVAAAPWW 180
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DST HR +HPV+HVSWNDAVAYCTW
Sbjct: 181 ---LPVKG-----------------ANWRHPEGPDSTTLHRPDHPVLHVSWNDAVAYCTW 220
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP NT DG+ TAP
Sbjct: 221 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAP 280
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGNVWEWT+DWW VHH ++NPKGP +G D+VKKGGSY+C++ Y
Sbjct: 281 VDAFPPNGYGLYNIVGNVWEWTSDWWTVHHSVEKTHNPKGPPSGKDRVKKGGSYMCHKSY 340
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
CYR+RCAARSQNTPDSSA NLGFRCAAD+ PTTG
Sbjct: 341 CYRYRCAARSQNTPDSSASNLGFRCAADRQPTTG 374
>gi|449270327|gb|EMC81015.1| Sulfatase-modifying factor 1 [Columba livia]
Length = 284
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 190/331 (57%), Positives = 231/331 (69%), Gaps = 50/331 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV++PG F MGT++P + +DGE+P R V +++FY+D++EVSN F+ FV++TGYVTEAE
Sbjct: 1 MVVIPGGVFTMGTDEPAIQQDGEWPVRKVHVNSFYMDRYEVSNEDFERFVNSTGYVTEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSEE +A+I Q AVA W
Sbjct: 61 KFGDSFVFEGMLSEEVKAEIHQ------------------------------AVAAAPWW 90
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DS I +RM+HPV+HVSWNDAVA+CTW
Sbjct: 91 ---LPVKG-----------------ANWKHPEGPDSNISNRMDHPVLHVSWNDAVAFCTW 130
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGLENRLFPWGN L P+G+H AN+WQG FPTNNTA DGY TAP
Sbjct: 131 AGKRLPTEAEWEYSCRGGLENRLFPWGNKLQPKGQHYANIWQGVFPTNNTAEDGYKGTAP 190
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGN WEWTADWW VHH +NPKGP++GTD+VKKGGSY+C++ Y
Sbjct: 191 VTAFPPNGYGLYNIVGNAWEWTADWWAVHHSTEEVHNPKGPSSGTDRVKKGGSYMCHKSY 250
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
CYR+RCAARSQNTPDSSA NLGFRCAAD P
Sbjct: 251 CYRYRCAARSQNTPDSSASNLGFRCAADASP 281
>gi|115497162|ref|NP_001069544.1| sulfatase-modifying factor 1 precursor [Bos taurus]
gi|122144266|sp|Q0P5L5.1|SUMF1_BOVIN RecName: Full=Sulfatase-modifying factor 1; AltName:
Full=C-alpha-formylglycine-generating enzyme 1; Flags:
Precursor
gi|112361989|gb|AAI19886.1| Sulfatase modifying factor 1 [Bos taurus]
gi|296474978|tpg|DAA17093.1| TPA: sulfatase modifying factor 1 precursor [Bos taurus]
Length = 374
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 189/344 (54%), Positives = 232/344 (67%), Gaps = 54/344 (15%)
Query: 5 PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
P+PP + MV +P F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV
Sbjct: 85 PSPPTK----MVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFV 140
Query: 65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
++TGY+TEAEKFGD+FVFE +LSE+ ++ I Q
Sbjct: 141 NSTGYLTEAEKFGDSFVFEGMLSEQVKSDIQQ---------------------------- 172
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
AVA W LP + +W HPEG DST+ HR +HPV+HVS
Sbjct: 173 --AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTVLHRPDHPVLHVS 210
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
WNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP NT
Sbjct: 211 WNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNT 270
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D+VKK
Sbjct: 271 GEDGFRGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSAEETINPKGPPSGKDRVKK 330
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
GGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCAAD PTTG
Sbjct: 331 GGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCAADHLPTTG 374
>gi|344276480|ref|XP_003410036.1| PREDICTED: sulfatase-modifying factor 1 [Loxodonta africana]
Length = 374
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 187/334 (55%), Positives = 228/334 (68%), Gaps = 50/334 (14%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+TEAE
Sbjct: 91 MVRIPAGVFTMGTDDPQIKQDGEAPARRVMIDAFYMDAYEVSNAEFEKFVNSTGYLTEAE 150
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSEE + I Q AVA W
Sbjct: 151 KFGDSFVFEGMLSEEVKTDIQQ------------------------------AVAAAPWW 180
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DST+ HR +HPV+HVSWNDAVAYCTW
Sbjct: 181 ---LPVKG-----------------ANWRHPEGPDSTVLHRPDHPVLHVSWNDAVAYCTW 220
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGLENRLFPWGN L P+G+H AN+WQGEFP +NT DG+ TAP
Sbjct: 221 AGKRLPTEAEWEYSCRGGLENRLFPWGNKLQPKGQHYANIWQGEFPVSNTGEDGFRGTAP 280
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGNVWEWT+DWW +HH + NPKGP +G D+VKKGGSY+C++ Y
Sbjct: 281 VDAFPPNGYGLYNIVGNVWEWTSDWWTIHHSAEETLNPKGPPSGKDRVKKGGSYMCHKSY 340
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
CYR+RCAARSQNTPDSSA NLGFRCAAD+ PT G
Sbjct: 341 CYRYRCAARSQNTPDSSASNLGFRCAADRLPTAG 374
>gi|426339251|ref|XP_004033571.1| PREDICTED: LOW QUALITY PROTEIN: sulfatase-modifying factor 1
[Gorilla gorilla gorilla]
Length = 383
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 192/346 (55%), Positives = 232/346 (67%), Gaps = 54/346 (15%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 86 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 145
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 146 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 181
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DSTI HR++HPV
Sbjct: 182 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRLDHPV 215
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 216 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 275
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D
Sbjct: 276 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWAVHHSAEETLNPKGPPSGKD 335
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 336 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 381
>gi|363738801|ref|XP_003642070.1| PREDICTED: sulfatase-modifying factor 1-like [Gallus gallus]
Length = 353
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 187/331 (56%), Positives = 231/331 (69%), Gaps = 50/331 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG F MGT++P + +DGE+P+R V +++FY+DQ+EVSN +F+ FV++TGY+TEAE
Sbjct: 70 MVAIPGGVFTMGTDEPEIQQDGEWPARRVHVNSFYMDQYEVSNQEFERFVNSTGYLTEAE 129
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSEE +A+I Q AVA W
Sbjct: 130 KFGDSFVFEGMLSEEVKAEIHQ------------------------------AVAAAPWW 159
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W PEG S+I RM+HPV+HVSWNDAVA+CTW
Sbjct: 160 ---LPVKG-----------------ANWRQPEGPGSSILSRMDHPVLHVSWNDAVAFCTW 199
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEYGCRGGLE RLFPWGN L P+G+H AN+WQG FPTNNTA DGY TAP
Sbjct: 200 AGKRLPTEAEWEYGCRGGLEKRLFPWGNKLQPKGQHYANIWQGVFPTNNTAEDGYKGTAP 259
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGN WEWT+DWW VHH ++NPKGP++GTD+VKKGGSY+C++ Y
Sbjct: 260 VTAFPPNGYGLYNIVGNAWEWTSDWWAVHHSADEAHNPKGPSSGTDRVKKGGSYMCHKSY 319
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
CYR+RCAARSQNTPDSSA NLGFRCAAD P
Sbjct: 320 CYRYRCAARSQNTPDSSASNLGFRCAADALP 350
>gi|297670799|ref|XP_002813542.1| PREDICTED: sulfatase-modifying factor 1 isoform 1 [Pongo abelii]
Length = 374
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 191/346 (55%), Positives = 231/346 (66%), Gaps = 54/346 (15%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F
Sbjct: 77 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNAEF 136
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DSTI HR++HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTIRHRLDHPV 206
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 326
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 327 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 372
>gi|157821111|ref|NP_001102109.1| sulfatase-modifying factor 1 precursor [Rattus norvegicus]
gi|149036859|gb|EDL91477.1| sulfatase modifying factor 1 (predicted) [Rattus norvegicus]
Length = 372
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 190/345 (55%), Positives = 226/345 (65%), Gaps = 50/345 (14%)
Query: 3 LLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
L P MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSN F++
Sbjct: 77 LTSGPRSLALTKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTVDAFYMDAYEVSNADFEK 136
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
FV++TGY+TEAEKFGD+FVFE +LSE +A+I Q
Sbjct: 137 FVNSTGYLTEAEKFGDSFVFEGMLSEPVKAQIHQ-------------------------- 170
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
AVA W LP + W HPEG DSTI HR NHPV+H
Sbjct: 171 ----AVAAAPWW---LPVKG-----------------ADWRHPEGPDSTILHRSNHPVLH 206
Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
VSWNDAVAYC W G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 VSWNDAVAYCAWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVT 266
Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKV 302
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGPT+G D+V
Sbjct: 267 NTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSAEETLNPKGPTSGKDRV 326
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
KKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCAAD PT
Sbjct: 327 KKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCAADHLPTA 371
>gi|354473114|ref|XP_003498781.1| PREDICTED: sulfatase-modifying factor 1-like [Cricetulus griseus]
Length = 353
Score = 383 bits (983), Expect = e-104, Method: Compositional matrix adjust.
Identities = 187/333 (56%), Positives = 226/333 (67%), Gaps = 50/333 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSN F++FV++TGY+TEAE
Sbjct: 70 MVPIPAGVFTMGTDDPQIKQDGEAPARRVTVDAFYMDAYEVSNADFEKFVNSTGYLTEAE 129
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSE+ +A I Q AVA W
Sbjct: 130 KFGDSFVFEGMLSEQVKAHIHQ------------------------------AVAAAPWW 159
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DS+I HR+NHPV+HVSWNDAVAYC W
Sbjct: 160 ---LPVKG-----------------ANWRHPEGPDSSILHRLNHPVLHVSWNDAVAYCAW 199
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP NT DG+ TAP
Sbjct: 200 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAP 259
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGN WEWT+DWW VHH + NPKGPT+G D+VKKGGSY+C++ Y
Sbjct: 260 VDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPTSGKDRVKKGGSYMCHKSY 319
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
CYR+RCAARSQNTPDSSA NLGFRCAAD PT
Sbjct: 320 CYRYRCAARSQNTPDSSASNLGFRCAADHLPTA 352
>gi|38202250|ref|NP_877437.2| sulfatase-modifying factor 1 isoform 1 precursor [Homo sapiens]
gi|62298562|sp|Q8NBK3.3|SUMF1_HUMAN RecName: Full=Sulfatase-modifying factor 1; AltName:
Full=C-alpha-formylglycine-generating enzyme 1; Flags:
Precursor
gi|30840149|gb|AAO34683.1| C-alpha-formyglycine-generating enzyme [Homo sapiens]
gi|32527651|gb|AAP86217.1| sulfatase modifying factor 1 [Homo sapiens]
gi|84569991|gb|AAI10863.1| Sulfatase modifying factor 1 [Homo sapiens]
gi|111309238|gb|AAI21123.1| Sulfatase modifying factor 1 [Homo sapiens]
gi|119584311|gb|EAW63907.1| sulfatase modifying factor 1, isoform CRA_b [Homo sapiens]
gi|220983392|dbj|BAH11168.1| sulfatase modifying factor 1 [Homo sapiens]
Length = 374
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 192/346 (55%), Positives = 231/346 (66%), Gaps = 54/346 (15%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 77 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 326
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 327 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 372
>gi|426249246|ref|XP_004018361.1| PREDICTED: sulfatase-modifying factor 1 [Ovis aries]
Length = 347
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 188/344 (54%), Positives = 231/344 (67%), Gaps = 54/344 (15%)
Query: 5 PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
P+PP + MV +P F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV
Sbjct: 58 PSPPTK----MVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFV 113
Query: 65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
++TGY+TEAEKFGD+FVFE +LSE+ ++ I Q
Sbjct: 114 NSTGYLTEAEKFGDSFVFEGMLSEQVKSDIQQ---------------------------- 145
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
AVA W LP + +W HPEG DST+ HR +HPV+HVS
Sbjct: 146 --AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTVLHRPDHPVLHVS 183
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
WNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP NT
Sbjct: 184 WNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNT 243
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D+VKK
Sbjct: 244 GEDGFRGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETINPKGPPSGKDRVKK 303
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
GGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCAAD PT G
Sbjct: 304 GGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCAADHLPTMG 347
>gi|349732234|ref|NP_001231861.1| sulfatase-modifying factor precursor [Sus scrofa]
Length = 374
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 184/334 (55%), Positives = 227/334 (67%), Gaps = 50/334 (14%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+TEAE
Sbjct: 91 MVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFVNSTGYLTEAE 150
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LS++ ++ I Q AVA W
Sbjct: 151 KFGDSFVFEGILSDQVKSDIQQ------------------------------AVAAAPWW 180
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DST+ HR +HPV+HVSWNDAVAYCTW
Sbjct: 181 ---LPVKG-----------------ANWRHPEGPDSTVVHRPDHPVLHVSWNDAVAYCTW 220
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP NT DG+ TAP
Sbjct: 221 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAP 280
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGN WEWT+DWW +HH + NPKGP +G D+VKKGGSY+C++ Y
Sbjct: 281 VDAFPPNGYGLYNIVGNAWEWTSDWWTIHHAAEETINPKGPPSGKDRVKKGGSYMCHKSY 340
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
CYR+RCAARSQNTPDSSA NLGFRCAAD PTTG
Sbjct: 341 CYRYRCAARSQNTPDSSASNLGFRCAADHQPTTG 374
>gi|74207135|dbj|BAE30762.1| unnamed protein product [Mus musculus]
Length = 381
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 186/333 (55%), Positives = 227/333 (68%), Gaps = 50/333 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F MGT+ P + +DGE P+R VT+D FY+D +EVSN F++FV++TGY+TEAE
Sbjct: 98 MVPIPAGVFTMGTDDPQIRQDGEAPARRVTVDGFYMDAYEVSNADFEKFVNSTGYLTEAE 157
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSE+ + I Q AVA W
Sbjct: 158 KFGDSFVFEGMLSEQVKTHIHQ------------------------------AVAAAPWW 187
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DS+I HR NHPV+HVSWNDAVAYCTW
Sbjct: 188 ---LPVKG-----------------ANWRHPEGPDSSILHRSNHPVLHVSWNDAVAYCTW 227
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQG+FP +NT DG+ TAP
Sbjct: 228 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGKFPVSNTGEDGFQGTAP 287
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGNVWEWT+DWW VHH ++NPKGPT+G D+VKKGGSY+C++ Y
Sbjct: 288 VDAFPPNGYGLYNIVGNVWEWTSDWWTVHHSVEETFNPKGPTSGKDRVKKGGSYMCHKSY 347
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
CYR+RCAARSQNTPDSSA NLGFRCAAD PT
Sbjct: 348 CYRYRCAARSQNTPDSSASNLGFRCAADHLPTA 380
>gi|74192741|dbj|BAE34887.1| unnamed protein product [Mus musculus]
Length = 372
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 186/333 (55%), Positives = 227/333 (68%), Gaps = 50/333 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F MGT+ P + +DGE P+R VT+D FY+D +EVSN F++FV++TGY+TEAE
Sbjct: 89 MVPIPAGVFTMGTDDPQIRQDGEAPARRVTVDGFYMDAYEVSNADFEKFVNSTGYLTEAE 148
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSE+ + I Q AVA W
Sbjct: 149 KFGDSFVFEGMLSEQVKTHIHQ------------------------------AVAAAPWW 178
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DS+I HR NHPV+HVSWNDAVAYCTW
Sbjct: 179 ---LPVKG-----------------ANWRHPEGPDSSILHRSNHPVLHVSWNDAVAYCTW 218
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQG+FP +NT DG+ TAP
Sbjct: 219 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGKFPVSNTGEDGFQGTAP 278
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGNVWEWT+DWW VHH ++NPKGPT+G D+VKKGGSY+C++ Y
Sbjct: 279 VDAFPPNGYGLYNIVGNVWEWTSDWWTVHHSVEETFNPKGPTSGKDRVKKGGSYMCHKSY 338
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
CYR+RCAARSQNTPDSSA NLGFRCAAD PT
Sbjct: 339 CYRYRCAARSQNTPDSSASNLGFRCAADHLPTA 371
>gi|144094256|ref|NP_666049.2| sulfatase-modifying factor 1 precursor [Mus musculus]
gi|33301732|sp|Q8R0F3.2|SUMF1_MOUSE RecName: Full=Sulfatase-modifying factor 1; AltName:
Full=C-alpha-formylglycine-generating enzyme 1; Flags:
Precursor
gi|74144496|dbj|BAE36091.1| unnamed protein product [Mus musculus]
gi|74148140|dbj|BAE36238.1| unnamed protein product [Mus musculus]
gi|148666991|gb|EDK99407.1| sulfatase modifying factor 1 [Mus musculus]
Length = 372
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 186/333 (55%), Positives = 227/333 (68%), Gaps = 50/333 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F MGT+ P + +DGE P+R VT+D FY+D +EVSN F++FV++TGY+TEAE
Sbjct: 89 MVPIPAGVFTMGTDDPQIRQDGEAPARRVTVDGFYMDAYEVSNADFEKFVNSTGYLTEAE 148
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSE+ + I Q AVA W
Sbjct: 149 KFGDSFVFEGMLSEQVKTHIHQ------------------------------AVAAAPWW 178
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DS+I HR NHPV+HVSWNDAVAYCTW
Sbjct: 179 ---LPVKG-----------------ANWRHPEGPDSSILHRSNHPVLHVSWNDAVAYCTW 218
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQG+FP +NT DG+ TAP
Sbjct: 219 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGKFPVSNTGEDGFQGTAP 278
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGNVWEWT+DWW VHH ++NPKGPT+G D+VKKGGSY+C++ Y
Sbjct: 279 VDAFPPNGYGLYNIVGNVWEWTSDWWTVHHSVEETFNPKGPTSGKDRVKKGGSYMCHKSY 338
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
CYR+RCAARSQNTPDSSA NLGFRCAAD PT
Sbjct: 339 CYRYRCAARSQNTPDSSASNLGFRCAADHLPTA 371
>gi|344235930|gb|EGV92033.1| Sulfatase-modifying factor 1 [Cricetulus griseus]
Length = 284
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 187/333 (56%), Positives = 226/333 (67%), Gaps = 50/333 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSN F++FV++TGY+TEAE
Sbjct: 1 MVPIPAGVFTMGTDDPQIKQDGEAPARRVTVDAFYMDAYEVSNADFEKFVNSTGYLTEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSE+ +A I Q AVA W
Sbjct: 61 KFGDSFVFEGMLSEQVKAHIHQ------------------------------AVAAAPWW 90
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DS+I HR+NHPV+HVSWNDAVAYC W
Sbjct: 91 ---LPVKG-----------------ANWRHPEGPDSSILHRLNHPVLHVSWNDAVAYCAW 130
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP NT DG+ TAP
Sbjct: 131 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAP 190
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGN WEWT+DWW VHH + NPKGPT+G D+VKKGGSY+C++ Y
Sbjct: 191 VDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPTSGKDRVKKGGSYMCHKSY 250
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
CYR+RCAARSQNTPDSSA NLGFRCAAD PT
Sbjct: 251 CYRYRCAARSQNTPDSSASNLGFRCAADHLPTA 283
>gi|380811964|gb|AFE77857.1| sulfatase-modifying factor 1 isoform 1 precursor [Macaca mulatta]
Length = 374
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 189/342 (55%), Positives = 228/342 (66%), Gaps = 53/342 (15%)
Query: 8 PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F++FV
Sbjct: 81 PGERPLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNAEFEKFV 140
Query: 65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 141 NSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ---------------------------- 172
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
AVA W LP + +W HPEG DSTI HR +HPV+HVS
Sbjct: 173 --AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTIRHRPDHPVLHVS 210
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
WNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP NT
Sbjct: 211 WNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNT 270
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D+VKK
Sbjct: 271 GEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKK 330
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
GGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 331 GGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 372
>gi|149728333|ref|XP_001496626.1| PREDICTED: sulfatase-modifying factor 1 [Equus caballus]
Length = 374
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 184/334 (55%), Positives = 227/334 (67%), Gaps = 50/334 (14%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+TEAE
Sbjct: 91 MVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFVNSTGYLTEAE 150
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSE+ ++ I Q AVA W
Sbjct: 151 KFGDSFVFEGMLSEQVKSDIQQ------------------------------AVAAAPWW 180
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DST+ HR +HPV+HVSWNDAVAYC W
Sbjct: 181 ---LPVKG-----------------ANWRHPEGPDSTVLHRPDHPVLHVSWNDAVAYCAW 220
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP NT DG+ TAP
Sbjct: 221 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAP 280
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D+VKKGGSY+C++ Y
Sbjct: 281 VDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYMCHKSY 340
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
CYR+RCAARSQNTPDSSA NLGFRCAAD+ PT+G
Sbjct: 341 CYRYRCAARSQNTPDSSASNLGFRCAADRLPTSG 374
>gi|67463712|pdb|1Y1E|X Chain X, Human Formylglycine Generating Enzyme
gi|67463715|pdb|1Y1H|X Chain X, Human Formylglycine Generating Enzyme, Oxidised Cys
Refined As Hydroperoxide
gi|67463716|pdb|1Y1I|X Chain X, Hyuman Formylglycine Generating Enzyme, Reduced Form
Length = 311
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 192/346 (55%), Positives = 231/346 (66%), Gaps = 54/346 (15%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 5 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 64
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 65 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 100
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DSTI HR +HPV
Sbjct: 101 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 134
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 135 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 194
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D
Sbjct: 195 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 254
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 255 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 300
>gi|383417677|gb|AFH32052.1| sulfatase-modifying factor 1 isoform 1 precursor [Macaca mulatta]
Length = 374
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 188/342 (54%), Positives = 228/342 (66%), Gaps = 53/342 (15%)
Query: 8 PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F++FV
Sbjct: 81 PGERPLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNAEFEKFV 140
Query: 65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 141 NSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ---------------------------- 172
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
AVA W LP + +W HPEG DST+ HR +HPV+HVS
Sbjct: 173 --AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTVRHRPDHPVLHVS 210
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
WNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP NT
Sbjct: 211 WNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNT 270
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D+VKK
Sbjct: 271 GEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKK 330
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
GGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 331 GGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 372
>gi|16553976|dbj|BAB71625.1| unnamed protein product [Homo sapiens]
Length = 284
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 187/332 (56%), Positives = 225/332 (67%), Gaps = 50/332 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F++FV++TGY+TEAE
Sbjct: 1 MVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGYLTEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSE+ + I Q AVA W
Sbjct: 61 KFGDSFVFEGMLSEQVKTNIQQ------------------------------AVAAAPWW 90
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DSTI HR +HPV+HVSWNDAVAYCTW
Sbjct: 91 ---LPVKG-----------------ANWRHPEGPDSTILHRPDHPVLHVSWNDAVAYCTW 130
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP NT DG+ TAP
Sbjct: 131 AGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAP 190
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D+VKKGGSY+C+ Y
Sbjct: 191 VDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSY 250
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
CYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 251 CYRYRCAARSQNTPDSSASNLGFRCAADRLPT 282
>gi|20073363|gb|AAH26981.1| Sulfatase modifying factor 1 [Mus musculus]
Length = 284
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 186/332 (56%), Positives = 227/332 (68%), Gaps = 50/332 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F MGT+ P + +DGE P+R VT+D FY+D +EVSN F++FV++TGY+TEAE
Sbjct: 1 MVPIPAGVFTMGTDDPQIRQDGEAPARRVTVDGFYMDAYEVSNADFEKFVNSTGYLTEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSE+ + I Q AVA W
Sbjct: 61 KFGDSFVFEGMLSEQVKTHIHQ------------------------------AVAAAPWW 90
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DS+I HR NHPV+HVSWNDAVAYCTW
Sbjct: 91 ---LPVKG-----------------ANWRHPEGPDSSILHRSNHPVLHVSWNDAVAYCTW 130
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQG+FP +NT DG+ TAP
Sbjct: 131 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGKFPVSNTGEDGFQGTAP 190
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGNVWEWT+DWW VHH ++NPKGPT+G D+VKKGGSY+C++ Y
Sbjct: 191 VDAFPPNGYGLYNIVGNVWEWTSDWWTVHHSVEETFNPKGPTSGKDRVKKGGSYMCHKSY 250
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
CYR+RCAARSQNTPDSSA NLGFRCAAD PT
Sbjct: 251 CYRYRCAARSQNTPDSSASNLGFRCAADHLPT 282
>gi|397522432|ref|XP_003831271.1| PREDICTED: sulfatase-modifying factor 1 isoform 1 [Pan paniscus]
Length = 374
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 191/346 (55%), Positives = 231/346 (66%), Gaps = 54/346 (15%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 77 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 326
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCAA++ PT
Sbjct: 327 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAANRLPT 372
>gi|332815962|ref|XP_526122.3| PREDICTED: sulfatase-modifying factor 1 isoform 3 [Pan troglodytes]
gi|410218334|gb|JAA06386.1| sulfatase modifying factor 1 [Pan troglodytes]
gi|410259882|gb|JAA17907.1| sulfatase modifying factor 1 [Pan troglodytes]
gi|410308068|gb|JAA32634.1| sulfatase modifying factor 1 [Pan troglodytes]
gi|410336755|gb|JAA37324.1| sulfatase modifying factor 1 [Pan troglodytes]
Length = 374
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 191/346 (55%), Positives = 231/346 (66%), Gaps = 54/346 (15%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 77 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 326
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCAA++ PT
Sbjct: 327 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAANRLPT 372
>gi|403270412|ref|XP_003927176.1| PREDICTED: sulfatase-modifying factor 1 isoform 1 [Saimiri
boliviensis boliviensis]
Length = 376
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 189/346 (54%), Positives = 230/346 (66%), Gaps = 54/346 (15%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F
Sbjct: 79 PGPVPEERPLAHSKMVFIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDTYEVSNAEF 138
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 139 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTDIQQ------------------------ 174
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DST+ HR +HPV
Sbjct: 175 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTVLHRPDHPV 208
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 209 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLNNRLFPWGNKLQPKGQHYANIWQGEFP 268
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW V+H + NPKGP +G D
Sbjct: 269 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVYHSVEETLNPKGPPSGKD 328
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 329 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRPPT 374
>gi|62858375|ref|NP_001016416.1| sulfatase modifying factor 1 precursor [Xenopus (Silurana)
tropicalis]
gi|89273863|emb|CAJ82122.1| sulfatase modifying factor 1 [Xenopus (Silurana) tropicalis]
Length = 367
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 186/329 (56%), Positives = 223/329 (67%), Gaps = 50/329 (15%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ MVL+P F MGT++P + +DGE P+R V LD+FY+D +EVSN +F++FV ATG+VTE
Sbjct: 86 EQMVLIPSGVFTMGTDEPGIPQDGESPARKVRLDSFYMDTYEVSNLEFEKFVKATGHVTE 145
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AEKFGD+FVFE L+SE+ ++ I Q AVA
Sbjct: 146 AEKFGDSFVFEGLISEDVKSTIDQ------------------------------AVAAAP 175
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W LP + W HPEG DS I +RM+HPV+HVSWNDA A+C
Sbjct: 176 WW---LPVKG-----------------ADWRHPEGPDSNILNRMDHPVLHVSWNDASAFC 215
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
TW G RLPTEAEWEY RGGLENRLFPWGN L P+GEH ANVWQGEFPT NT DGY+ T
Sbjct: 216 TWAGKRLPTEAEWEYASRGGLENRLFPWGNKLEPKGEHYANVWQGEFPTQNTGEDGYVQT 275
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
+PV ++ N + LYNMVGN WEWT+DWW+ HH +YNPKGP+TG DKVKKGGSY+C+E
Sbjct: 276 SPVTAFPANGYSLYNMVGNAWEWTSDWWDTHHTADETYNPKGPSTGKDKVKKGGSYMCHE 335
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
YCYR+RCAARSQNTPDSSA NLGFRC A
Sbjct: 336 SYCYRYRCAARSQNTPDSSASNLGFRCVA 364
>gi|296225740|ref|XP_002807643.1| PREDICTED: LOW QUALITY PROTEIN: sulfatase-modifying factor 1
[Callithrix jacchus]
Length = 586
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 189/346 (54%), Positives = 229/346 (66%), Gaps = 54/346 (15%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MVL+P F MGT+ P + +DGE P+R VT+DAFY+D +EV+N +F
Sbjct: 289 PGPVPGERPLAHSKMVLIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVNNAEF 348
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 349 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTDIQQ------------------------ 384
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG STI HR +HPV
Sbjct: 385 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPGSTIRHRPDHPV 418
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYC W G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 419 LHVSWNDAVAYCMWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 478
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D
Sbjct: 479 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 538
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 539 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 584
>gi|326928166|ref|XP_003210252.1| PREDICTED: sulfatase-modifying factor 1-like [Meleagris gallopavo]
Length = 291
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 186/333 (55%), Positives = 231/333 (69%), Gaps = 50/333 (15%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
K MV +PG F MGT++P + +DGE+P+R V +++FY+DQ+EVSN +F+ FV++TGY+TE
Sbjct: 6 KLMVAIPGGVFTMGTDEPEIQQDGEWPARRVHVNSFYMDQYEVSNQEFERFVNSTGYLTE 65
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AEKFGD+FVFE +LSEE +A+I Q AVA
Sbjct: 66 AEKFGDSFVFEGMLSEEVKAEIHQ------------------------------AVAAAP 95
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W LP + +W PEG S+I RM+HPV+HVSWNDAVA+C
Sbjct: 96 WW---LPVKG-----------------ANWRQPEGPGSSILSRMDHPVLHVSWNDAVAFC 135
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
TW G RLPTEAEWEY CRGGLE RLFPWGN L P+G+H AN+WQG FP+NNTA DGY T
Sbjct: 136 TWAGKRLPTEAEWEYSCRGGLEKRLFPWGNKLQPKGQHYANIWQGVFPSNNTAEDGYKGT 195
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
APV ++ N +GLYN+VGN WEWT+DWW VHH ++NPKGP++GTD+VKKGGSY+C++
Sbjct: 196 APVTAFPPNGYGLYNIVGNAWEWTSDWWAVHHSADEAHNPKGPSSGTDRVKKGGSYMCHK 255
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
YCYR+RCAARSQNTPDSSA NLGFRCAAD P
Sbjct: 256 SYCYRYRCAARSQNTPDSSASNLGFRCAADALP 288
>gi|395824557|ref|XP_003785529.1| PREDICTED: sulfatase-modifying factor 1 isoform 1 [Otolemur
garnettii]
Length = 374
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 189/346 (54%), Positives = 229/346 (66%), Gaps = 54/346 (15%)
Query: 5 PAP-PVERY---KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P PVER MVL+P F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F
Sbjct: 77 PGPIPVERPLAPTKMVLIPAGVFTMGTDDPQIKQDGEAPARRVAVDAFYMDAYEVSNAEF 136
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTDIQQ------------------------ 172
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DS + HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSNVLHRQDHPV 206
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFP 266
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D
Sbjct: 267 VTNTGEDGFRGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSIEETVNPKGPPSGKD 326
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
+VKKGGSY+C+ YCYR+RCAARSQNTPDSS+ NLGFRCAAD PT
Sbjct: 327 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSSSNLGFRCAADHLPT 372
>gi|402859535|ref|XP_003894210.1| PREDICTED: sulfatase-modifying factor 1 isoform 1 [Papio anubis]
Length = 374
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 189/342 (55%), Positives = 228/342 (66%), Gaps = 53/342 (15%)
Query: 8 PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F++FV
Sbjct: 81 PGERPLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNAEFEKFV 140
Query: 65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 141 NSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ---------------------------- 172
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
AVA W LP + +W HPEG DSTI HR +HPV+HVS
Sbjct: 173 --AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTIWHRPDHPVLHVS 210
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
WNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP NT
Sbjct: 211 WNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNT 270
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D+VKK
Sbjct: 271 GEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKK 330
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
GGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 331 GGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 372
>gi|410951604|ref|XP_003982484.1| PREDICTED: sulfatase-modifying factor 1 [Felis catus]
Length = 376
Score = 379 bits (974), Expect = e-103, Method: Compositional matrix adjust.
Identities = 184/334 (55%), Positives = 226/334 (67%), Gaps = 50/334 (14%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+P F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+TEAE
Sbjct: 93 MVLIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFVNSTGYLTEAE 152
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSE+ + I Q AVA W
Sbjct: 153 KFGDSFVFEGMLSEQVKTDIQQ------------------------------AVAAAPWW 182
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DST+ HR +HPV+HVSWNDAVAYCTW
Sbjct: 183 ---LPVKG-----------------ANWRHPEGPDSTVRHRPDHPVLHVSWNDAVAYCTW 222
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP NT DG+ TAP
Sbjct: 223 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAP 282
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGN WEWT+DWW VHH + NP+ +G D+VKKGGSY+C++ Y
Sbjct: 283 VDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEKTLNPEPAPSGKDRVKKGGSYMCHKSY 342
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
CYR+RCAARSQNTPDSSA NLGFRCAAD+ PTTG
Sbjct: 343 CYRYRCAARSQNTPDSSASNLGFRCAADRQPTTG 376
>gi|22761561|dbj|BAC11634.1| unnamed protein product [Homo sapiens]
Length = 374
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 190/346 (54%), Positives = 230/346 (66%), Gaps = 54/346 (15%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DA Y+D +EVSNT+F
Sbjct: 77 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDALYMDAYEVSNTEF 136
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQG+FP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGDFP 266
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 326
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 327 RVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 372
>gi|85544097|pdb|2AII|X Chain X, Wild-Type Formylglycine Generating Enzyme Reacted With
Iodoacetamide
Length = 286
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 186/334 (55%), Positives = 225/334 (67%), Gaps = 50/334 (14%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F++FV++TGY+T
Sbjct: 3 HSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGYLT 62
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
EAEKFGD+FVFE +LSE+ + I Q AVA
Sbjct: 63 EAEKFGDSFVFEGMLSEQVKTNIQQ------------------------------AVAAA 92
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W LP + +W HPEG DSTI HR +HPV+HVSWNDAVAY
Sbjct: 93 PWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPVLHVSWNDAVAY 132
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
CTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP NT DG+
Sbjct: 133 CTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQG 192
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D+VKKGGSY+C+
Sbjct: 193 TAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYMCH 252
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
YCYR+RCAARSQNTPDSSA NLGFRCAAD+ P
Sbjct: 253 RSYCYRYRCAARSQNTPDSSASNLGFRCAADRLP 286
>gi|291408728|ref|XP_002720667.1| PREDICTED: CG7049-like [Oryctolagus cuniculus]
Length = 380
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 186/333 (55%), Positives = 222/333 (66%), Gaps = 50/333 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+P F MGT+ P + +DGE P+R VT+DAFY+D +EVSN F++FV++TGY+TEAE
Sbjct: 97 MVLIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNADFEKFVNSTGYLTEAE 156
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FGD+FVFE +LSE + I Q AVA W
Sbjct: 157 TFGDSFVFEGMLSERVKTDIQQ------------------------------AVAAAPWW 186
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + W HPEG DSTI HR +HPV+HVSWNDAVAYCTW
Sbjct: 187 ---LPVKG-----------------ADWRHPEGPDSTIRHRPHHPVLHVSWNDAVAYCTW 226
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGL+NRLFPWGN L PRG+H AN+WQG+FP NT DG+ TAP
Sbjct: 227 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPRGQHYANLWQGDFPVTNTGEDGFRGTAP 286
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYNMVGN WEWT+DWW HH + NPKGP G D+VKKGGSY+C++ Y
Sbjct: 287 VDAFPPNGYGLYNMVGNAWEWTSDWWTRHHSVEETLNPKGPPLGKDRVKKGGSYMCHKSY 346
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
CYR+RCAARSQNTPDSSA NLGFRCAAD+ P+T
Sbjct: 347 CYRYRCAARSQNTPDSSASNLGFRCAADRLPST 379
>gi|67463713|pdb|1Y1F|X Chain X, Human Formylglycine Generating Enzyme With Cysteine
Sulfenic Acid
gi|67463717|pdb|1Y1J|X Chain X, Human Formylglycine Generating Enzyme, Sulfonic
AcidDESULFURATED FORM
Length = 311
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 191/346 (55%), Positives = 230/346 (66%), Gaps = 54/346 (15%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 5 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 64
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 65 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 100
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DSTI HR +HPV
Sbjct: 101 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 134
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 135 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 194
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D
Sbjct: 195 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 254
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
+VKKGGSY+ + YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 255 RVKKGGSYMXHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 300
>gi|85544095|pdb|2AFT|X Chain X, Formylglycine Generating Enzyme C336s Mutant
gi|85544098|pdb|2AIJ|X Chain X, Formylglycine Generating Enzyme C336s Mutant Covalently
Bound To Substrate Peptide Ctpsr
gi|85544100|pdb|2AIK|X Chain X, Formylglycine Generating Enzyme C336s Mutant Covalently
Bound To Substrate Peptide Lctpsra
gi|146386507|pdb|2HI8|X Chain X, Human Formylglycine Generating Enzyme, C336s Mutant,
Bromide Co- Crystallization
gi|146386509|pdb|2HIB|X Chain X, Human Formylglycine Generating Enzyme, C336s Mutant,
Iodide Co- Crystallization
Length = 286
Score = 376 bits (966), Expect = e-102, Method: Compositional matrix adjust.
Identities = 185/334 (55%), Positives = 224/334 (67%), Gaps = 50/334 (14%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F++FV++TGY+T
Sbjct: 3 HSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGYLT 62
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
EAEKFGD+FVFE +LSE+ + I Q AVA
Sbjct: 63 EAEKFGDSFVFEGMLSEQVKTNIQQ------------------------------AVAAA 92
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W LP + +W HPEG DSTI HR +HPV+HVSWNDAVAY
Sbjct: 93 PWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPVLHVSWNDAVAY 132
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
CTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP NT DG+
Sbjct: 133 CTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQG 192
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D+VKKGGSY+ +
Sbjct: 193 TAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYMSH 252
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
YCYR+RCAARSQNTPDSSA NLGFRCAAD+ P
Sbjct: 253 RSYCYRYRCAARSQNTPDSSASNLGFRCAADRLP 286
>gi|351703202|gb|EHB06121.1| Sulfatase-modifying factor 1 [Heterocephalus glaber]
Length = 374
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 182/333 (54%), Positives = 226/333 (67%), Gaps = 50/333 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+P F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+TEAE
Sbjct: 91 MVLIPAGVFTMGTDDPQIKQDGEAPARRVAVDAFYMDAYEVSNAEFEKFVNSTGYLTEAE 150
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSE+ + I Q AVA W
Sbjct: 151 KFGDSFVFEGMLSEQVKTDIQQ------------------------------AVAAAPWW 180
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DST++HR +HPV+HVSWNDAVAYCTW
Sbjct: 181 ---LPVKG-----------------ANWRHPEGPDSTVQHRPDHPVLHVSWNDAVAYCTW 220
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGL+NRLFPWGN L P+ +H AN+WQGEFP NT DG+ TAP
Sbjct: 221 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKDQHYANLWQGEFPMTNTGEDGFQGTAP 280
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGNVWEWT+DWW ++H + NPKGP +G D+VKKGGSY+C++ Y
Sbjct: 281 VDAFPPNGYGLYNIVGNVWEWTSDWWTIYHSVEETLNPKGPPSGKDRVKKGGSYMCHKSY 340
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
CYR+RCAARSQNTPDSSA NLGFRCAA+ PT
Sbjct: 341 CYRYRCAARSQNTPDSSASNLGFRCAANHLPTA 373
>gi|126336367|ref|XP_001374411.1| PREDICTED: sulfatase-modifying factor 1-like [Monodelphis
domestica]
Length = 396
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 181/333 (54%), Positives = 227/333 (68%), Gaps = 50/333 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +P F MGT++P + +DGE P+R V +++FY+D +EVSN +F+ FV++TGYVTEAE
Sbjct: 114 MAHIPAGVFTMGTDEPQIKQDGEGPARRVRINSFYMDLYEVSNAEFERFVNSTGYVTEAE 173
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVF+ +LS++ ++ I Q AVA W
Sbjct: 174 KFGDSFVFDSMLSDQVKSDIHQ------------------------------AVAAAPWW 203
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DS+I HR +HPV+HVSWNDAVAYCTW
Sbjct: 204 ---LPVKG-----------------ANWRHPEGPDSSILHRRDHPVLHVSWNDAVAYCTW 243
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGLENRLFPWGN L P+G+H AN+WQGEFP +NT DGY TAP
Sbjct: 244 AGKRLPTEAEWEYSCRGGLENRLFPWGNKLQPKGQHYANIWQGEFPVSNTGEDGYQGTAP 303
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGN WEWT+DWW VHH + +PKGP +G+D+VKKGGSY+C++ Y
Sbjct: 304 VTAFPPNGYGLYNIVGNAWEWTSDWWTVHHSADETLDPKGPPSGSDRVKKGGSYMCHKSY 363
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
CYR+RCAARSQNTPDSSA NLGFRCAAD+ P T
Sbjct: 364 CYRYRCAARSQNTPDSSASNLGFRCAADRLPDT 396
>gi|116063450|gb|AAI23111.1| LOC432261 protein [Xenopus laevis]
Length = 357
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 181/329 (55%), Positives = 224/329 (68%), Gaps = 50/329 (15%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ MVL+P F MGT++P + +DGE P+R V LD+FY+D +EVSN++F++FV ATG+VTE
Sbjct: 76 EQMVLIPSGVFTMGTDEPGIPQDGESPARKVHLDSFYMDMYEVSNSEFEKFVKATGHVTE 135
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE FGD+FVFE L++EE ++ I Q AVA
Sbjct: 136 AETFGDSFVFEGLITEEVKSTIHQ------------------------------AVAAAP 165
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W LP + W HPEG DS I +RM+HP++HVSWNDA A+C
Sbjct: 166 WW---LPVKG-----------------ADWRHPEGPDSNILNRMDHPILHVSWNDASAFC 205
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
TW G RLPTEAEWEY CRGGLENR+FPWGN L P+G+H ANVWQG+FPT NT DGY+ T
Sbjct: 206 TWAGKRLPTEAEWEYACRGGLENRIFPWGNKLEPKGQHYANVWQGDFPTQNTGEDGYVQT 265
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
+PV ++ N +GLYNMVGN WEWT+DWW+ HH +NPKGP+TG D+VKKGGSY+C+E
Sbjct: 266 SPVTAFPANGYGLYNMVGNAWEWTSDWWDTHHTADEVHNPKGPSTGNDRVKKGGSYMCHE 325
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
YCYR+RCAARSQNTPDSSA NLGFRC A
Sbjct: 326 SYCYRYRCAARSQNTPDSSASNLGFRCVA 354
>gi|432092475|gb|ELK25090.1| Sulfatase-modifying factor 1 [Myotis davidii]
Length = 284
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 187/333 (56%), Positives = 221/333 (66%), Gaps = 50/333 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSN F+ FV+ATGY+TEAE
Sbjct: 1 MVSIPAGVFTMGTDDPQIRQDGEAPARRVTVDAFYMDAYEVSNADFERFVNATGYLTEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE LLSE+ +A I Q AVA W
Sbjct: 61 KFGDSFVFEGLLSEQVKAGIQQ------------------------------AVAAAPWW 90
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + SW HPEG DST+ HR +HPV+HVSWNDAVAYC W
Sbjct: 91 ---LPVKG-----------------ASWRHPEGPDSTVLHRPDHPVLHVSWNDAVAYCAW 130
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGL +RLFPWGN L PRG+H AN+WQGEFP +T DG+ TAP
Sbjct: 131 AGKRLPTEAEWEYSCRGGLHDRLFPWGNKLQPRGQHYANLWQGEFPVTDTGEDGFRGTAP 190
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GLYN+VGN WEWTADWW V H + NPKGP +G D+VKKGGSY+C++ Y
Sbjct: 191 VDAFPPNGYGLYNIVGNAWEWTADWWAVSHSAEEALNPKGPPSGKDRVKKGGSYMCHKSY 250
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
CYR+RCAARSQNTPDSSA NLGFRCAAD P T
Sbjct: 251 CYRYRCAARSQNTPDSSASNLGFRCAADHLPIT 283
>gi|67463714|pdb|1Y1G|X Chain X, Human Formylglycine Generating Enzyme, Double Sulfonic
Acid Form
gi|73535823|pdb|1Z70|X Chain X, 1.15a Resolution Structure Of The Formylglycine Generating
Enzyme Fge
Length = 311
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 190/346 (54%), Positives = 229/346 (66%), Gaps = 54/346 (15%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 5 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 64
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 65 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 100
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DSTI HR +HPV
Sbjct: 101 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 134
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 135 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 194
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D
Sbjct: 195 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 254
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
+VKKGGSY+ + Y YR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 255 RVKKGGSYMXHRSYXYRYRCAARSQNTPDSSASNLGFRCAADRLPT 300
>gi|47937785|gb|AAH72353.1| LOC432261 protein, partial [Xenopus laevis]
Length = 340
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 179/329 (54%), Positives = 223/329 (67%), Gaps = 50/329 (15%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ M+L+P F MGT++P + +DGE P+R V LD+FY+D +EVSN++F++FV ATG+VTE
Sbjct: 59 EQMILIPSGVFTMGTDEPGIPQDGESPARKVHLDSFYMDMYEVSNSEFEKFVKATGHVTE 118
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE FGD+FVFE L++EE ++ I Q AVA
Sbjct: 119 AETFGDSFVFEGLITEEVKSTIHQ------------------------------AVAAAP 148
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W LP + W HPEG DS I +RM+HP++HVSWNDA A+C
Sbjct: 149 WW---LPVKG-----------------ADWRHPEGPDSNILNRMDHPILHVSWNDASAFC 188
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
TW G RLPTEAEWEY CRGGLENR+FPWGN L P+G+H ANVWQG+FPT NT DGY+
Sbjct: 189 TWAGKRLPTEAEWEYACRGGLENRIFPWGNKLEPKGQHYANVWQGDFPTQNTGEDGYVQI 248
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
+PV ++ N +GLYNMVGN WEWT+DWW+ HH +NPKGP+TG D+VKKGGSY+C+E
Sbjct: 249 SPVTAFPANGYGLYNMVGNAWEWTSDWWDTHHTADEVHNPKGPSTGNDRVKKGGSYMCHE 308
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
YCYR+RCAARSQNTPDSSA NLGFRC A
Sbjct: 309 SYCYRYRCAARSQNTPDSSASNLGFRCVA 337
>gi|85544096|pdb|2AFY|X Chain X, Formylglycine Generating Enzyme C341s Mutant
Length = 286
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 184/334 (55%), Positives = 223/334 (66%), Gaps = 50/334 (14%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F++FV++TGY+T
Sbjct: 3 HSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEFEKFVNSTGYLT 62
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
EAEKFGD+FVFE +LSE+ + I Q AVA
Sbjct: 63 EAEKFGDSFVFEGMLSEQVKTNIQQ------------------------------AVAAA 92
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W LP + +W HPEG DSTI HR +HPV+HVSWNDAVAY
Sbjct: 93 PWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPVLHVSWNDAVAY 132
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
CTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP NT DG+
Sbjct: 133 CTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQG 192
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D+VKKGGSY+ +
Sbjct: 193 TAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYMXH 252
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
Y YR+RCAARSQNTPDSSA NLGFRCAAD+ P
Sbjct: 253 RSYSYRYRCAARSQNTPDSSASNLGFRCAADRLP 286
>gi|225714898|gb|ACO13295.1| Sulfatase-modifying factor 1 precursor [Esox lucius]
Length = 378
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 187/332 (56%), Positives = 221/332 (66%), Gaps = 50/332 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV +PG F MGT+ P + DGE P R V L+ FY+D HEVSN QFQ F +ATGYVTEA
Sbjct: 97 EMVFVPGGEFLMGTDDPGIPPDGESPQRKVHLNHFYMDVHEVSNHQFQSFTNATGYVTEA 156
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
EKFGD+FVFE LLSEE +++ISQ AVA
Sbjct: 157 EKFGDSFVFEGLLSEEVKSQISQ------------------------------AVA---- 182
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
A W RG W HPEG DS+I R++HPV+HVSW DAVAYCT
Sbjct: 183 -------AAPWWLPVRGA---------DWRHPEGPDSSIADRLDHPVLHVSWGDAVAYCT 226
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W RLPTEAEWE+ CRGGL++RL+PWGN L P+G+H AN+WQGEFPT+NT DGY+ T+
Sbjct: 227 WAHKRLPTEAEWEFACRGGLQDRLYPWGNKLNPKGQHYANLWQGEFPTHNTGEDGYIKTS 286
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
PV S+ N +GLYNMVGN WEWTADWW VHH NP GP +GTDKVKKGGSY+C++
Sbjct: 287 PVRSFPPNGYGLYNMVGNAWEWTADWWTVHHTTDRQPNPAGPPSGTDKVKKGGSYMCHKS 346
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
YCYR+RCAARSQNTPDSSA NLGFRC + + P
Sbjct: 347 YCYRYRCAARSQNTPDSSAPNLGFRCVSPEQP 378
>gi|63101761|gb|AAH95053.1| Zgc:136465 protein [Danio rerio]
Length = 388
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 181/331 (54%), Positives = 219/331 (66%), Gaps = 50/331 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
+VLL G F MGT+ P + +DGE P R V L AFY+++HEV+N QFQ F + TGY+TEAE
Sbjct: 108 LVLLQGGWFLMGTDDPGIPQDGEGPQRKVKLGAFYIEEHEVTNQQFQHFTNQTGYITEAE 167
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+FGD+FVFE LLSEE ++ +S H V W V WR
Sbjct: 168 RFGDSFVFEGLLSEEVKSTLS---------------------HAVAAAPWWSPVKGADWR 206
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
HPEG DSTI +RMNHP +HVSW+DA AYC W
Sbjct: 207 -----------------------------HPEGPDSTIHNRMNHPALHVSWDDARAYCQW 237
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RLPTEAEWE CRGGL++R++PWGN L PRG+H AN+WQG+FP +NTA DGY +T+P
Sbjct: 238 AKRRLPTEAEWELACRGGLQDRMYPWGNKLMPRGQHYANLWQGDFPNHNTAEDGYANTSP 297
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
VMS+ N FGLY+MVGN WEWTADWW VHH +NPKGP +GTD+VKKGGSY+C++ Y
Sbjct: 298 VMSFPANGFGLYDMVGNAWEWTADWWTVHHSAEDKFNPKGPESGTDRVKKGGSYMCHKSY 357
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
CYR+RCAARSQNTPDSSA NLGFRCA+D P
Sbjct: 358 CYRYRCAARSQNTPDSSASNLGFRCASDVDP 388
>gi|332231567|ref|XP_003264966.1| PREDICTED: sulfatase-modifying factor 1 isoform 1 [Nomascus
leucogenys]
Length = 374
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 183/335 (54%), Positives = 221/335 (65%), Gaps = 50/335 (14%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ MV +P F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+T
Sbjct: 88 HSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFVNSTGYLT 147
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
EAEKFGD+FVFE +LSE+ + I Q AVA
Sbjct: 148 EAEKFGDSFVFEGMLSEQVKTNIQQ------------------------------AVAAA 177
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W LP + +W HPE DSTI HR +HPV+HVSWNDA AY
Sbjct: 178 PWW---LPVKG-----------------ANWRHPEWPDSTIRHRPDHPVLHVSWNDAFAY 217
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
CTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP NT DG+
Sbjct: 218 CTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQG 277
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D+VKKGGSY+C+
Sbjct: 278 TAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYMCH 337
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
YCYR+RCAARSQNTPDSSA NLGFRCAAD PT
Sbjct: 338 RSYCYRYRCAARSQNTPDSSASNLGFRCAADCLPT 372
>gi|213511532|ref|NP_001133607.1| sulfatase-modifying factor 1 precursor [Salmo salar]
gi|209154660|gb|ACI33562.1| Sulfatase-modifying factor 1 precursor [Salmo salar]
Length = 378
Score = 370 bits (949), Expect = e-100, Method: Compositional matrix adjust.
Identities = 181/331 (54%), Positives = 222/331 (67%), Gaps = 50/331 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+PG F MGT+ P + +DGE P R V LD FY+D HEVSN FQ F++ATGY+TEAE
Sbjct: 98 MVLIPGGEFLMGTDNPGIPQDGEGPQRRVHLDHFYMDAHEVSNRHFQSFINATGYITEAE 157
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+FGD+FVFE +LSEE +++ISQ AVA
Sbjct: 158 RFGDSFVFEGVLSEEVKSQISQ------------------------------AVA----- 182
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
A W RG W HPEG DS+I R++HPV+HVSW DAVAYC+W
Sbjct: 183 ------AAPWWLPVRGA---------DWRHPEGPDSSITGRLDHPVLHVSWQDAVAYCSW 227
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RLPTEAEWEY CRGGL++RL+PWGN L P+G+H AN+WQG+FPT+N+ DGY T+P
Sbjct: 228 AHKRLPTEAEWEYACRGGLQDRLYPWGNKLKPKGQHYANLWQGKFPTHNSEEDGYTKTSP 287
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V S+ N +GLYNMVGN WEWT+DWW VHH +NP GP +GTD+VKKGGSY+C++ Y
Sbjct: 288 VKSFPANGYGLYNMVGNAWEWTSDWWTVHHTTDERHNPAGPPSGTDRVKKGGSYMCHKSY 347
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
CYR+RCAARSQNTPDSSA NLGFRC + + P
Sbjct: 348 CYRYRCAARSQNTPDSSASNLGFRCVSQEQP 378
>gi|327266132|ref|XP_003217860.1| PREDICTED: sulfatase-modifying factor 1-like [Anolis carolinensis]
Length = 369
Score = 369 bits (947), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 184/350 (52%), Positives = 231/350 (66%), Gaps = 56/350 (16%)
Query: 2 VLLPAPPVERYKD------MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEV 55
V LP P KD M +P F MGT++P + +DGE+P+R + ++ F++D +EV
Sbjct: 66 VHLPQQPGNGPKDKDEGRQMASIPAGVFTMGTDEPAIQQDGEWPARKIHINHFFMDLYEV 125
Query: 56 SNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRM 115
SN +F++FV+ATGY+TEAE+FGD+FVFE +LSE+ +++I Q
Sbjct: 126 SNREFEKFVNATGYLTEAERFGDSFVFEGMLSEQVKSEIHQ------------------- 166
Query: 116 HHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHR 175
AVA W LP + W HP+G DS I R
Sbjct: 167 -----------AVAAAPWW---LPVKG-----------------ADWKHPDGPDSNIVDR 195
Query: 176 MNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW 235
M+HPV+HVSWNDAVA+CTW G RLPTEAEWEY CRGGLENRLFPWGN L P G+H AN+W
Sbjct: 196 MDHPVLHVSWNDAVAFCTWAGKRLPTEAEWEYSCRGGLENRLFPWGNKLHPNGQHYANIW 255
Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGP 295
QG+FPT+NT DGY TAPV ++ N +GLYN+VGN WEWT+DWW++ H + NPKGP
Sbjct: 256 QGDFPTSNTREDGYKGTAPVTAFPPNSYGLYNIVGNAWEWTSDWWSISHSADTAQNPKGP 315
Query: 296 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
TGTD+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFRCAAD P
Sbjct: 316 LTGTDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFRCAADTLP 365
>gi|242014044|ref|XP_002427708.1| Sulfatase-modifying factor 1 precursor, putative [Pediculus humanus
corporis]
gi|212512143|gb|EEB14970.1| Sulfatase-modifying factor 1 precursor, putative [Pediculus humanus
corporis]
Length = 365
Score = 367 bits (941), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 180/341 (52%), Positives = 222/341 (65%), Gaps = 50/341 (14%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
R MV L G F MGT+KPI + DGE P R+V + FYLD++EVSN +F +FVS TGY
Sbjct: 75 RTNQMVYLKGGKFEMGTDKPIFVADGEGPKRSVLIKPFYLDKYEVSNAEFSKFVSDTGYK 134
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TE E FGD+FVFE LLS+ + ++Q AVA
Sbjct: 135 TEVEGFGDSFVFENLLSDAVKDTVTQ------------------------------AVAR 164
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W LP + +W+HPEG DS+I +M+HPVVHVSWNDA A
Sbjct: 165 APWW---LPVKG-----------------ANWMHPEGPDSSISEKMDHPVVHVSWNDAKA 204
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
YC W RLPTEAEWEY CR G ++RLFPWGNN P+GE+RAN+WQG FP+++ DGY+
Sbjct: 205 YCDWAQKRLPTEAEWEYACRAGKKDRLFPWGNNFQPKGEYRANIWQGAFPSHDEGLDGYV 264
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
T+PV ++ EN FGL NM+GNVWEWT+D+W++ H+ P NP GPT GTDKVKKGGS+LC
Sbjct: 265 GTSPVTAFHENDFGLKNMIGNVWEWTSDFWDIKHNKEPVENPTGPTKGTDKVKKGGSFLC 324
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDK 351
++ YCYR+RCAARSQNTPDSSA NLGFRCAADK P D+
Sbjct: 325 HKSYCYRYRCAARSQNTPDSSASNLGFRCAADKLPDYLIDE 365
>gi|432859552|ref|XP_004069151.1| PREDICTED: sulfatase-modifying factor 1-like [Oryzias latipes]
Length = 368
Score = 366 bits (940), Expect = 9e-99, Method: Compositional matrix adjust.
Identities = 179/329 (54%), Positives = 222/329 (67%), Gaps = 50/329 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M + G F MGT++P + DGE P R V +D+F++D HEV+N QFQ FV ATGY+TEAE
Sbjct: 88 MAWISGGEFLMGTDQPGIAADGEGPQRPVHVDSFHMDIHEVTNRQFQSFVRATGYITEAE 147
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE ++SE+ +++ISQ AVA W
Sbjct: 148 KFGDSFVFEGIVSEDVKSQISQ------------------------------AVAAAPWW 177
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DS I RM+HPVV VSWNDA+AYC+W
Sbjct: 178 ---LPVKG-----------------ANWKHPEGPDSNITDRMDHPVVQVSWNDALAYCSW 217
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RLPTEAEWE CRGGL++RL+PWGN L P+G+H AN+WQGEFP +N+A DGY T+P
Sbjct: 218 VNKRLPTEAEWEVACRGGLKDRLYPWGNKLNPKGQHYANLWQGEFPAHNSAEDGYTKTSP 277
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
VMS+ N FGLY+MVGN WEWT+DWW+VHH +NP GP +GTDKVKKGGSY+C++ Y
Sbjct: 278 VMSFPGNAFGLYDMVGNAWEWTSDWWSVHHTTDHQHNPTGPPSGTDKVKKGGSYMCHKSY 337
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADK 343
CYR+RCAARSQNTPDSSA NLGFRCAA++
Sbjct: 338 CYRYRCAARSQNTPDSSASNLGFRCAAEQ 366
>gi|209730894|gb|ACI66316.1| Sulfatase-modifying factor 1 precursor [Salmo salar]
Length = 379
Score = 366 bits (939), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 181/332 (54%), Positives = 222/332 (66%), Gaps = 51/332 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+PG F MGT+ P + +DGE P R V LD FY+D HEVSN FQ F++ATGY+TEAE
Sbjct: 98 MVLIPGGEFLMGTDNPGIPQDGEGPQRRVHLDHFYMDAHEVSNRHFQSFINATGYITEAE 157
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+FGD+FVFE +LSEE +++ISQ AVA
Sbjct: 158 RFGDSFVFEGVLSEEVKSQISQ------------------------------AVA----- 182
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIE-HRMNHPVVHVSWNDAVAYCT 193
A W RG W HPEG DS+I R++HPV+HVSW DAVAYC+
Sbjct: 183 ------AAPWWLPVRGA---------DWRHPEGPDSSITGSRLDHPVLHVSWQDAVAYCS 227
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W RLPTEAEWEY CRGGL++RL+PWGN L P+G+H AN+WQG+FPT+N+ DGY T+
Sbjct: 228 WAHKRLPTEAEWEYACRGGLQDRLYPWGNKLKPKGQHYANLWQGKFPTHNSEEDGYTKTS 287
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
PV S+ N +GLYNMVGN WEWT+DWW VHH +NP GP +GTD+VKKGGSY+C++
Sbjct: 288 PVKSFPANGYGLYNMVGNAWEWTSDWWTVHHTTDERHNPAGPPSGTDRVKKGGSYMCHKS 347
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
YCYR+RCAARSQNTPDSSA NLGFRC + + P
Sbjct: 348 YCYRYRCAARSQNTPDSSASNLGFRCVSQEQP 379
>gi|225707352|gb|ACO09522.1| Sulfatase-modifying factor 1 precursor [Osmerus mordax]
Length = 368
Score = 365 bits (938), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 176/331 (53%), Positives = 224/331 (67%), Gaps = 50/331 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+ G F MGT+ P + DGE P R+V L+ F++D HEV+N QF+ F++ATGYVTEAE
Sbjct: 88 MVLISGGEFLMGTDNPCIPPDGEGPQRSVNLEPFFMDVHEVTNHQFKSFINATGYVTEAE 147
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FGD+FVF+ +LSE +++ISQ AVA W
Sbjct: 148 TFGDSFVFDGVLSEATKSQISQ------------------------------AVAAAPWW 177
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + W HPEG DS+I R +HPV+HVSW DAVAYC+W
Sbjct: 178 ---LPVKG-----------------ADWKHPEGPDSSITDRTDHPVLHVSWRDAVAYCSW 217
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RLPTEAEWE+ CRGGL++RL+PWGN LTP+G+H +N+WQG+FPT+N+ DGY+ T+P
Sbjct: 218 AQKRLPTEAEWEHACRGGLKDRLYPWGNKLTPKGQHYSNLWQGDFPTHNSGEDGYIKTSP 277
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
VMS+ N +GLY+MVGN WEWT+DWW+VHH +NPKGP +GTDKVKKGGSY+C++ Y
Sbjct: 278 VMSFPANGYGLYDMVGNAWEWTSDWWSVHHTADRQHNPKGPPSGTDKVKKGGSYMCHKSY 337
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
CYR+RCAARSQNTPDSSA NLGFRC + + P
Sbjct: 338 CYRYRCAARSQNTPDSSASNLGFRCVSQEPP 368
>gi|410919449|ref|XP_003973197.1| PREDICTED: sulfatase-modifying factor 1-like [Takifugu rubripes]
Length = 362
Score = 365 bits (936), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 178/327 (54%), Positives = 220/327 (67%), Gaps = 50/327 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+PG F MGT++P + DGE P R V +D+FY+D HEV+N QFQ FV+ TGYVTEAE
Sbjct: 82 MVLIPGGEFLMGTDEPSIPADGEGPQRPVHVDSFYMDIHEVTNQQFQTFVADTGYVTEAE 141
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LS+ + +I+Q AVA W
Sbjct: 142 KFGDSFVFEGILSDSVKKEITQ------------------------------AVAAAPWW 171
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DS I R++HPV+HVSWNDA+ YC+W
Sbjct: 172 ---LPVKG-----------------ANWKHPEGQDSNITDRLDHPVLHVSWNDALTYCSW 211
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RLPTEAEWE CRGGL+NRL+PWGN L P+G+H AN+WQG+FP +NTA DGY+ T+P
Sbjct: 212 LHKRLPTEAEWECACRGGLQNRLYPWGNKLNPKGQHYANLWQGDFPNHNTAEDGYVKTSP 271
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
VMS+ N FGLY++VGNVWEWT+DWW VHH NP GP +G+DKVKKGGSY+C++ Y
Sbjct: 272 VMSFPPNGFGLYDIVGNVWEWTSDWWMVHHTAEKLQNPTGPQSGSDKVKKGGSYMCHKSY 331
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAA 341
CYR+RCAARSQNTPDSSA NLGFRC +
Sbjct: 332 CYRYRCAARSQNTPDSSASNLGFRCVS 358
>gi|427782167|gb|JAA56535.1| Putative sulfatase-modifying factor 1-like protein [Rhipicephalus
pulchellus]
Length = 376
Score = 363 bits (933), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 179/335 (53%), Positives = 215/335 (64%), Gaps = 50/335 (14%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
R MVL+ G F MGT+KP+ + DGE P+R V + FY+D HEVSN +F+ FV ATG+V
Sbjct: 88 RTNQMVLVRGGQFEMGTDKPVFVADGENPARPVRVSDFYIDVHEVSNAEFERFVKATGHV 147
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAE FGD+FV + +SEE + I+Q AVA
Sbjct: 148 TEAETFGDSFVLDSAISEETKKGITQ------------------------------AVAA 177
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W LP + W HPEG DS I RM+HPVVHVSWNDA A
Sbjct: 178 APWW---LPVKG-----------------ADWRHPEGPDSHIHDRMDHPVVHVSWNDAAA 217
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
+C W G RL TEAEWEY CR GL+NRLFPWGNN TP+GE RAN+W+G+FP NTA DGY+
Sbjct: 218 FCKWSGKRLLTEAEWEYACRAGLKNRLFPWGNNWTPKGETRANIWEGKFPVTNTAEDGYV 277
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
TAPV ++ KFGL N++GNVWEWT DWWN H P NPKGP +GTDKVKKGGS++C
Sbjct: 278 GTAPVTAFPATKFGLKNIIGNVWEWTNDWWNTRHTSEPQDNPKGPPSGTDKVKKGGSFMC 337
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
++ YCYR+RCAARSQNTPD+SA N+G RC ADK P
Sbjct: 338 HKAYCYRYRCAARSQNTPDTSAYNVGIRCGADKLP 372
>gi|348507653|ref|XP_003441370.1| PREDICTED: sulfatase-modifying factor 1-like [Oreochromis
niloticus]
Length = 387
Score = 363 bits (932), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 181/334 (54%), Positives = 217/334 (64%), Gaps = 50/334 (14%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
E MV + G F MGT+ P + DGE P R V +D+F +D HEV+N QFQ FV ATGY
Sbjct: 102 ETQSPMVQISGGEFLMGTDNPGIPPDGEGPQRLVYVDSFNMDIHEVTNQQFQSFVLATGY 161
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
VTEAEKFGD+FVFE LLSE +++I+Q AVA
Sbjct: 162 VTEAEKFGDSFVFEGLLSEPVKSQITQ------------------------------AVA 191
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
W LP + SW HPEG DS I R++HPV+HVSW DAV
Sbjct: 192 SAPWW---LPVKG-----------------ASWRHPEGPDSNITDRLHHPVLHVSWTDAV 231
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
AYC+W RLPTEAEWEY CRGGL++RL+PWGN L P+G+H AN+WQG+FPT+N+ DGY
Sbjct: 232 AYCSWANKRLPTEAEWEYACRGGLKDRLYPWGNKLNPKGQHFANLWQGDFPTHNSGEDGY 291
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
+ T+PVMS+ N FGLY+MVGN WEWT DWW VHH P NP GP +G DKVKKGGSY+
Sbjct: 292 IKTSPVMSFPANAFGLYDMVGNAWEWTTDWWTVHHTTDPQRNPTGPPSGKDKVKKGGSYM 351
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
C+ YCYR+RCAARSQNTPDSSA NLGFRC + +
Sbjct: 352 CHRSYCYRYRCAARSQNTPDSSASNLGFRCVSQE 385
>gi|308322299|gb|ADO28287.1| sulfatase-modifying factor 1 [Ictalurus furcatus]
Length = 281
Score = 358 bits (920), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 179/331 (54%), Positives = 215/331 (64%), Gaps = 50/331 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVLL G F MGT+ P + +DGE R V LD FY+D+HEV+N QFQ F + T +VTEAE
Sbjct: 1 MVLLQGGWFLMGTDDPGIPQDGEGSQRRVWLDPFYIDEHEVTNLQFQLFTNTTHHVTEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+FGD+FVFE +LSEE ++ + H V W V WR
Sbjct: 61 RFGDSFVFEGMLSEE---------------------VKSTLKHAVAAAPWWSPVKGADWR 99
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
HPEG DS+I HRM+HPV+HVSW DA YC W
Sbjct: 100 -----------------------------HPEGPDSSITHRMDHPVLHVSWADAQVYCEW 130
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWE+ CRGGL++RL+PWGN L PR +H AN+WQGEFP +NTA DG+ T+P
Sbjct: 131 AGRRLPTEAEWEFSCRGGLKDRLYPWGNKLMPREQHYANLWQGEFPNHNTAEDGFAQTSP 190
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V + N FGLY+MVGNVWEWTADWW+VHH +YNPKGP TG+D+VKKGGSY+C++ Y
Sbjct: 191 VKKFPANAFGLYDMVGNVWEWTADWWSVHHTRDHAYNPKGPETGSDRVKKGGSYMCHKSY 250
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
CYR+RCAARSQNTPDSSA NLGFRCAA P
Sbjct: 251 CYRYRCAARSQNTPDSSASNLGFRCAAAATP 281
>gi|340367863|ref|XP_003382472.1| PREDICTED: sulfatase-modifying factor 1-like [Amphimedon
queenslandica]
Length = 360
Score = 357 bits (916), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 177/327 (54%), Positives = 217/327 (66%), Gaps = 50/327 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++VL+PG F+MG +KP++ +DGE PSR+VT+ +FY+D++EVSN ++ +FVS TGYVTE
Sbjct: 81 ELVLIPGGVFKMGLDKPVIPEDGETPSRSVTITSFYMDKYEVSNDEYGKFVSDTGYVTET 140
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E FG++FV E +S+E + I+Q AVA W
Sbjct: 141 ESFGNSFVVEYFISKEVQEGITQ------------------------------AVANAPW 170
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
LP + SW PEGIDS I+ RM+HPVVHVSWNDA AYC
Sbjct: 171 W---LPVDN-----------------ASWRQPEGIDSDIKGRMSHPVVHVSWNDADAYCK 210
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY CR GL++RLFPWGNN TP GEH N+WQG+FP NNT DGY
Sbjct: 211 WAGKRLPTEAEWEYACRAGLKDRLFPWGNNPTPHGEHWMNIWQGKFPINNTLDDGYAGPG 270
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
PV + NKFGLYNMVGNVWEW +DWW HH +PKGP TG++KVKKGGSY+C+++
Sbjct: 271 PVDIFPPNKFGLYNMVGNVWEWVSDWWTTHHSKKDQKDPKGPPTGSEKVKKGGSYMCHKE 330
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCA 340
YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 331 YCYRYRCAARSQNTPDSSAHNLGFRCA 357
>gi|47229531|emb|CAG06727.1| unnamed protein product [Tetraodon nigroviridis]
Length = 359
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 175/332 (52%), Positives = 218/332 (65%), Gaps = 50/332 (15%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
E MVL+PG F MGT+ P + DGE P R V +D+FY+D HEV+N QFQ FV+ATGY
Sbjct: 77 EARSPMVLVPGGAFLMGTDDPGIPADGESPQRLVYVDSFYMDTHEVTNRQFQMFVNATGY 136
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
VTEAEKFGD+FVFE +LS++ + +I+Q AVA
Sbjct: 137 VTEAEKFGDSFVFEGILSDKVKKEITQ------------------------------AVA 166
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
W LP + +W HPEG+ S+I R++HPV+HVSW DA
Sbjct: 167 AAPWW---LPVKG-----------------ANWKHPEGLGSSITDRLDHPVLHVSWTDAR 206
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
YC+W RLPTEAEWE CRGGL++RL+PWGN L P+G+H AN+WQG+FP NT DG+
Sbjct: 207 TYCSWLDKRLPTEAEWECACRGGLKDRLYPWGNKLNPKGKHYANLWQGDFPNYNTGEDGF 266
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
+ T+PVMS+ N FGLY++VGNVWEWT+DWW VHH NP GP +G DKVKKGGSY+
Sbjct: 267 VKTSPVMSFPPNGFGLYDIVGNVWEWTSDWWTVHHSAENQSNPTGPQSGNDKVKKGGSYM 326
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
C++ YCYR+RCAARSQNTPDSSA NLGFRC +
Sbjct: 327 CHKSYCYRYRCAARSQNTPDSSASNLGFRCVS 358
>gi|260809998|ref|XP_002599791.1| hypothetical protein BRAFLDRAFT_57613 [Branchiostoma floridae]
gi|229285073|gb|EEN55803.1| hypothetical protein BRAFLDRAFT_57613 [Branchiostoma floridae]
Length = 372
Score = 352 bits (904), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 183/337 (54%), Positives = 213/337 (63%), Gaps = 52/337 (15%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
R MV + G F MGT+ +DGE P+R V +D+FY D +EVSN +F FV TG+V
Sbjct: 75 RTNQMVFVRGGDFTMGTDDVRFPQDGEGPARPVRVDSFYADVYEVSNAEFDIFVKETGFV 134
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAEKFGD+FV E L+SEE +++I+Q AVA
Sbjct: 135 TEAEKFGDSFVLENLISEEVKSEITQ------------------------------AVAG 164
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W LP + W HPEG+DS I +RM+HPVVHVSWNDAVA
Sbjct: 165 APWW---LPVKG-----------------ADWRHPEGVDSDIINRMDHPVVHVSWNDAVA 204
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
YCTW RLPTEAEWEY RGGLENRL+PWGNN P+ EHR N+WQG+FP NT DGY
Sbjct: 205 YCTWAKKRLPTEAEWEYASRGGLENRLYPWGNNEMPKKEHRMNIWQGKFPEENTEEDGYY 264
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGP--TTGTDKVKKGGSY 308
STAPV SYK NKFGLYN GNVWEW +DW +V H NPKGP + DKVKKGGSY
Sbjct: 265 STAPVTSYKPNKFGLYNTAGNVWEWVSDWLSVRHTTELQVNPKGPAKSATNDKVKKGGSY 324
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
+C++ YCYR+R AARSQNTPDSSA NLGFRC ADK P
Sbjct: 325 MCHKDYCYRYRNAARSQNTPDSSASNLGFRCFADKLP 361
>gi|307188144|gb|EFN72976.1| Sulfatase-modifying factor 1 [Camponotus floridanus]
Length = 279
Score = 351 bits (901), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 169/326 (51%), Positives = 210/326 (64%), Gaps = 50/326 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ + T+ +GTN P+++ DGE P R V LD+FY+D+ EVSN +F FV TGYVTEAE
Sbjct: 1 MIKINAGTYFIGTNNPVIVADGEGPKREVILDSFYIDKFEVSNEEFATFVKNTGYVTEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FGD+FVFE LL+++ + KI++ VA W
Sbjct: 61 NFGDSFVFEDLLTQDMKNKITKT------------------------------VAQAPWW 90
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+P + SW HPEG DS I +RM+HPV+HVSWNDA+AYC W
Sbjct: 91 ---MPVKQ-----------------ASWQHPEGPDSNITYRMDHPVIHVSWNDAIAYCNW 130
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RLPTEAEWE CRGGL +RL+PWGN P +H+AN+WQG FPT NT DGY T+P
Sbjct: 131 MKKRLPTEAEWEVACRGGLTDRLYPWGNKFMPNNQHKANIWQGNFPTKNTEKDGYKGTSP 190
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V + +NK+GL+N+VGNVWEWTADWW H NP GP++G DKVKKGGSYLC++ Y
Sbjct: 191 VTMFPQNKYGLHNIVGNVWEWTADWWITRHSSDQQTNPVGPSSGNDKVKKGGSYLCHKSY 250
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCA 340
CYR+RCAARSQNTPD+SAGNLGFRCA
Sbjct: 251 CYRYRCAARSQNTPDTSAGNLGFRCA 276
>gi|72006588|ref|XP_782973.1| PREDICTED: sulfatase-modifying factor 1 [Strongylocentrotus
purpuratus]
Length = 417
Score = 349 bits (895), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 178/335 (53%), Positives = 206/335 (61%), Gaps = 50/335 (14%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
R M + G TF+MGT+K + DGE PSR VTLD +Y D +EVSN++F+ FV+ T YV
Sbjct: 120 RTNQMNFIEGGTFQMGTDKAKIYLDGESPSRLVTLDPYYFDVYEVSNSEFELFVNTTSYV 179
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAEKFGD+FV E +SEE + ISQV VA
Sbjct: 180 TEAEKFGDSFVLEARISEEVKKDISQV------------------------------VAA 209
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W LP + W HPEG DS+I RM+HPV H+SWNDA A
Sbjct: 210 APWW---LPVKG-----------------AEWRHPEGPDSSISSRMDHPVTHMSWNDATA 249
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
YC W G RLPTEAEWE RGGL+NRLFPWGN L P+ HR N+WQGEFP NTA DGY
Sbjct: 250 YCQWAGKRLPTEAEWENAARGGLKNRLFPWGNKLMPKDHHRVNIWQGEFPKVNTAEDGYE 309
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
T PV +++ N +GLYN VGN WEW ADWWN H P NP GP GTDKVKKGGSY+C
Sbjct: 310 GTCPVTAFEPNGYGLYNTVGNAWEWVADWWNTVHSPESQNNPVGPDEGTDKVKKGGSYMC 369
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
+ YCYR+RC ARSQN+PDSSA NLGFRCAA P
Sbjct: 370 HISYCYRYRCEARSQNSPDSSACNLGFRCAATNLP 404
>gi|307194482|gb|EFN76774.1| Sulfatase-modifying factor 1 [Harpegnathos saltator]
Length = 344
Score = 348 bits (892), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 171/331 (51%), Positives = 214/331 (64%), Gaps = 50/331 (15%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
E+ MV + T+ +GTN PI + DGE P R V LD+FY+D++EVSN +F FVS+TGY
Sbjct: 61 EQTGKMVEIIAGTYFVGTNNPIFVVDGEGPKREVLLDSFYIDKYEVSNEEFAAFVSSTGY 120
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
TEAE+FGD+FVFE LLS+ + I Q AVA
Sbjct: 121 TTEAERFGDSFVFEGLLSQNTKNTIKQ------------------------------AVA 150
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
W LP + +W HPEG DS I RM+HPVVHVSWNDA
Sbjct: 151 EAPWW---LPVKK-----------------ATWQHPEGPDSNITSRMDHPVVHVSWNDAT 190
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
AYC+W G RLPTEAEWE CRG + +R +PWGN + P+ +H+ N+WQGEFP NT DGY
Sbjct: 191 AYCSWLGKRLPTEAEWEAACRGNISDRHYPWGNKIMPKDQHKVNIWQGEFPLRNTMEDGY 250
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
ST+PV ++ +NK+GL+N+VGNVWEWT+DWW +H NP GP +GT+KVKKGGSYL
Sbjct: 251 KSTSPVTTFPQNKYGLHNIVGNVWEWTSDWWMTNHSHDQQSNPTGPPSGTNKVKKGGSYL 310
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
C++ YC+R+RCAARSQNTPD+SAGNLGFRCA
Sbjct: 311 CHKNYCHRYRCAARSQNTPDTSAGNLGFRCA 341
>gi|325974319|dbj|BAJ83907.1| sulfatase modifying factor 1 [Hemicentrotus pulcherrimus]
Length = 423
Score = 347 bits (891), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 177/335 (52%), Positives = 204/335 (60%), Gaps = 50/335 (14%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
R M + G TFRMGT+K + DGE PSR VTLD +Y D +EVSN++F+ FV+ T Y+
Sbjct: 126 RTNQMNFIEGGTFRMGTDKAKIYLDGESPSRLVTLDPYYFDVYEVSNSEFELFVNTTSYI 185
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAEKFGD+FV E +SEE + ISQV VA
Sbjct: 186 TEAEKFGDSFVLEARISEEVKKDISQV------------------------------VAA 215
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W LP + W HPEG DS+I RM+HPV H+SWNDA A
Sbjct: 216 APWW---LPVKG-----------------AEWRHPEGPDSSISSRMDHPVTHISWNDATA 255
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
YC W G RLPTEAEWE RGGL NRLFPWGN L P+ HR N+WQGEFP NTA DGY
Sbjct: 256 YCQWAGKRLPTEAEWENAARGGLNNRLFPWGNKLMPKDHHRVNIWQGEFPKVNTAEDGYE 315
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
T PV +++ N +GLYN VGN WEW ADWW H P NP GP GTDKVKKGGSY+C
Sbjct: 316 GTCPVTAFEPNGYGLYNTVGNAWEWVADWWTTVHSPESQNNPVGPDEGTDKVKKGGSYMC 375
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
+ YCYR+RC ARSQN+PDSSA NLGFRCAA P
Sbjct: 376 HISYCYRYRCEARSQNSPDSSACNLGFRCAATNLP 410
>gi|345485300|ref|XP_001600441.2| PREDICTED: sulfatase-modifying factor 1-like [Nasonia vitripennis]
Length = 416
Score = 347 bits (891), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 172/340 (50%), Positives = 214/340 (62%), Gaps = 50/340 (14%)
Query: 2 VLLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
++ P + +MV + G ++ +GTN P+ + DGE P R V L +FY+D++EVSN FQ
Sbjct: 125 IIKPIYKKDLTSNMVRIKGGSYFIGTNDPVFVSDGEGPRREVELGSFYIDKYEVSNADFQ 184
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
EFV+A+ Y TEAEKFGD+FVFE LLS++ + +I +
Sbjct: 185 EFVTASSYKTEAEKFGDSFVFEGLLSKDIKNEIKE------------------------- 219
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
AVA +A W +G + W PEG DS I+ RM+HPVV
Sbjct: 220 -----AVA-----------QAPWWLPVKGAV---------WSQPEGRDSNIKDRMDHPVV 254
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
HVSWNDAVAYC W G RLPTEAEWE CRGGL +RLFPWGN L P EHR N+WQGEFP
Sbjct: 255 HVSWNDAVAYCKWIGKRLPTEAEWEVTCRGGLNDRLFPWGNKLMPNNEHRTNIWQGEFPL 314
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK 301
NNT DG+ T PV + N +G++N++GNVWEWT+DWW H NP G G+DK
Sbjct: 315 NNTVEDGFDRTNPVSYFPPNAYGVHNIIGNVWEWTSDWWKTKHTDEKQTNPTGAPNGSDK 374
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
VKKGGSYLC++ YCYR+RCAARSQNTPD+SAGNLGFRCA
Sbjct: 375 VKKGGSYLCHKSYCYRYRCAARSQNTPDTSAGNLGFRCAV 414
>gi|297670801|ref|XP_002813543.1| PREDICTED: sulfatase-modifying factor 1 isoform 2 [Pongo abelii]
Length = 354
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 178/346 (51%), Positives = 214/346 (61%), Gaps = 74/346 (21%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F
Sbjct: 77 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNAEF 136
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DSTI HR++HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTIRHRLDHPV 206
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NP
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPS------- 319
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 320 -------------YCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 352
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/32 (87%), Positives = 29/32 (90%)
Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
N YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct: 317 NPSYCYRYRCAARSQNTPDSSASNLGFRCAAD 348
>gi|257470977|ref|NP_001158147.1| sulfatase-modifying factor 1 isoform 3 precursor [Homo sapiens]
Length = 354
Score = 341 bits (875), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 179/346 (51%), Positives = 214/346 (61%), Gaps = 74/346 (21%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 77 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NP
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPS------- 319
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 320 -------------YCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 352
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/32 (87%), Positives = 29/32 (90%)
Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
N YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct: 317 NPSYCYRYRCAARSQNTPDSSASNLGFRCAAD 348
>gi|383865019|ref|XP_003707974.1| PREDICTED: sulfatase-modifying factor 1-like [Megachile rotundata]
Length = 355
Score = 341 bits (874), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 172/329 (52%), Positives = 204/329 (62%), Gaps = 51/329 (15%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
K+M + +F +GT+ PI + DGE P R V+LD+FY+D+ EVSN F FV AT Y TE
Sbjct: 74 KNMAKIEEGSFTVGTDDPIFVADGEGPKRRVSLDSFYMDELEVSNRDFAMFVDATAYRTE 133
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AEKFGD+FVFE LL +E I + V W +
Sbjct: 134 AEKFGDSFVFEGLLEQE---------------------IRANISSVVARAPWWVQIKNAN 172
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W+ HPEG S I HRM+HPVVHVSWNDA+AYC
Sbjct: 173 WQ-----------------------------HPEGPGSNITHRMDHPVVHVSWNDALAYC 203
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWE CRGGL +RL+PWGN LTP G++RAN WQGEFP+ N DGY T
Sbjct: 204 KWLGKRLPTEAEWEVACRGGLSDRLYPWGNKLTPNGQYRANTWQGEFPSINLKEDGYEGT 263
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVH-HHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
+PV + NK+GL NMVGNVWEWT+DWW V S NP GP+ GTDKVKKGGSY+C+
Sbjct: 264 SPVSKFPPNKYGLRNMVGNVWEWTSDWWTVEVAKRGGSSNPNGPSDGTDKVKKGGSYMCH 323
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
E YC+R+RCAARSQNTPD+SAGNLGFRCA
Sbjct: 324 ESYCFRYRCAARSQNTPDTSAGNLGFRCA 352
>gi|66533071|ref|XP_392816.2| PREDICTED: sulfatase-modifying factor 1-like isoform 2 [Apis
mellifera]
Length = 367
Score = 340 bits (873), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 163/333 (48%), Positives = 210/333 (63%), Gaps = 51/333 (15%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
+ K+M+ + F +GTN PI I DGE P + + L++FY+D+ EVSN F +FV AT
Sbjct: 82 INHLKNMIKIKAGIFGIGTNDPIFIADGEGPKQQIYLNSFYIDKTEVSNHDFSKFVDATN 141
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
Y TEAE+FGD+FVF LL ++++ I+ V
Sbjct: 142 YKTEAERFGDSFVFRGLLKQKDQKNITLV------------------------------- 170
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
+A W + +WLHPEG +S I++RM+HPV+HVSWNDA
Sbjct: 171 ----------VAQAPWWLQVKNA---------NWLHPEGPESDIKNRMDHPVIHVSWNDA 211
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
+AYC W G RLPTEAEWE CRGGL +RL+PWGN L P G++RAN WQG+FP NNT DG
Sbjct: 212 IAYCNWLGKRLPTEAEWEVACRGGLSDRLYPWGNKLIPNGQYRANTWQGDFPNNNTKEDG 271
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-NPKGPTTGTDKVKKGGS 307
Y T+PV + NK+GLYNM+GNVWEWT DWW + + NP GP+ G +KVKKGGS
Sbjct: 272 YEGTSPVREFPPNKYGLYNMIGNVWEWTFDWWTIEITTRGGFVNPTGPSQGINKVKKGGS 331
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
YLC++ YC+R+RCAARSQNTPD++AGNLGFRCA
Sbjct: 332 YLCHKSYCFRYRCAARSQNTPDTTAGNLGFRCA 364
>gi|193659919|ref|XP_001952215.1| PREDICTED: sulfatase-modifying factor 1-like [Acyrthosiphon pisum]
Length = 332
Score = 340 bits (871), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 165/329 (50%), Positives = 211/329 (64%), Gaps = 50/329 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+L+ G F +GT+KP ++ DGE PSR V LD +++D++EVSN +F++FV Y TEAE
Sbjct: 54 MILIKGQNFTIGTDKPEIVSDGESPSRTVYLDDYFIDKYEVSNLEFKKFVEIMEYQTEAE 113
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FGD+FVF+ LS+ I+Q D
Sbjct: 114 VFGDSFVFQLFLSKTTLKSITQAVKD---------------------------------- 139
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
A W +G +W +PEG DS I R HPVVHVSWNDA A+C W
Sbjct: 140 -------APWWVPVKGA---------NWKNPEGKDSDISDRELHPVVHVSWNDATAFCNW 183
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLP+EAEWE CRGGL++RL+PWGN L P +HR N+WQGEFP NTA DG+ +TAP
Sbjct: 184 AGKRLPSEAEWEAACRGGLKDRLYPWGNKLLPSNKHRTNIWQGEFPFKNTAEDGWATTAP 243
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ +N +GL+NMVGNVWEWT+D+W+V H + + NP GPT G ++VKKGGSYLC+++Y
Sbjct: 244 VNTFLKNGYGLHNMVGNVWEWTSDFWSVDHDQSFTINPSGPTEGKERVKKGGSYLCHKEY 303
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADK 343
CYR+RCAARSQNTPDSSA NLGFRCA DK
Sbjct: 304 CYRYRCAARSQNTPDSSASNLGFRCAKDK 332
>gi|111306476|gb|AAI21124.1| SUMF1 protein [Homo sapiens]
Length = 354
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 178/346 (51%), Positives = 213/346 (61%), Gaps = 74/346 (21%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R T+DAFY+D +EVSNT+F
Sbjct: 77 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRATIDAFYMDAYEVSNTEF 136
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NP
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPS------- 319
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 320 -------------YCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 352
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/32 (87%), Positives = 29/32 (90%)
Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
N YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct: 317 NPSYCYRYRCAARSQNTPDSSASNLGFRCAAD 348
>gi|397522434|ref|XP_003831272.1| PREDICTED: sulfatase-modifying factor 1 isoform 2 [Pan paniscus]
Length = 354
Score = 339 bits (869), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 178/346 (51%), Positives = 214/346 (61%), Gaps = 74/346 (21%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 77 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NP
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPS------- 319
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
YCYR+RCAARSQNTPDSSA NLGFRCAA++ PT
Sbjct: 320 -------------YCYRYRCAARSQNTPDSSASNLGFRCAANRLPT 352
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 27/32 (84%), Positives = 29/32 (90%)
Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
N YCYR+RCAARSQNTPDSSA NLGFRCAA+
Sbjct: 317 NPSYCYRYRCAARSQNTPDSSASNLGFRCAAN 348
>gi|332815964|ref|XP_003309635.1| PREDICTED: sulfatase-modifying factor 1 isoform 1 [Pan troglodytes]
Length = 354
Score = 339 bits (869), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 178/346 (51%), Positives = 214/346 (61%), Gaps = 74/346 (21%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 77 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NP
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPS------- 319
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
YCYR+RCAARSQNTPDSSA NLGFRCAA++ PT
Sbjct: 320 -------------YCYRYRCAARSQNTPDSSASNLGFRCAANRLPT 352
Score = 61.2 bits (147), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 27/32 (84%), Positives = 29/32 (90%)
Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
N YCYR+RCAARSQNTPDSSA NLGFRCAA+
Sbjct: 317 NPSYCYRYRCAARSQNTPDSSASNLGFRCAAN 348
>gi|403270414|ref|XP_003927177.1| PREDICTED: sulfatase-modifying factor 1 isoform 2 [Saimiri
boliviensis boliviensis]
Length = 356
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 176/346 (50%), Positives = 213/346 (61%), Gaps = 74/346 (21%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F
Sbjct: 79 PGPVPEERPLAHSKMVFIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDTYEVSNAEF 138
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 139 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTDIQQ------------------------ 174
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DST+ HR +HPV
Sbjct: 175 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTVLHRPDHPV 208
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 209 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLNNRLFPWGNKLQPKGQHYANIWQGEFP 268
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW V+H + NP
Sbjct: 269 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVYHSVEETLNPS------- 321
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 322 -------------YCYRYRCAARSQNTPDSSASNLGFRCAADRPPT 354
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/32 (87%), Positives = 29/32 (90%)
Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
N YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct: 319 NPSYCYRYRCAARSQNTPDSSASNLGFRCAAD 350
>gi|402859537|ref|XP_003894211.1| PREDICTED: sulfatase-modifying factor 1 isoform 2 [Papio anubis]
Length = 354
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 176/342 (51%), Positives = 211/342 (61%), Gaps = 73/342 (21%)
Query: 8 PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F++FV
Sbjct: 81 PGERPLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNAEFEKFV 140
Query: 65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 141 NSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ---------------------------- 172
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
AVA W LP + +W HPEG DSTI HR +HPV+HVS
Sbjct: 173 --AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTIWHRPDHPVLHVS 210
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
WNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP NT
Sbjct: 211 WNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNT 270
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NP
Sbjct: 271 GEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPS----------- 319
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 320 ---------YCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 352
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 28/32 (87%), Positives = 29/32 (90%)
Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
N YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct: 317 NPSYCYRYRCAARSQNTPDSSASNLGFRCAAD 348
>gi|395824561|ref|XP_003785531.1| PREDICTED: sulfatase-modifying factor 1 isoform 3 [Otolemur
garnettii]
Length = 354
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 176/346 (50%), Positives = 212/346 (61%), Gaps = 74/346 (21%)
Query: 5 PAP-PVERY---KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P PVER MVL+P F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F
Sbjct: 77 PGPIPVERPLAPTKMVLIPAGVFTMGTDDPQIKQDGEAPARRVAVDAFYMDAYEVSNAEF 136
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTDIQQ------------------------ 172
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DS + HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSNVLHRQDHPV 206
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFP 266
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NP
Sbjct: 267 VTNTGEDGFRGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSIEETVNPS------- 319
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
YCYR+RCAARSQNTPDSS+ NLGFRCAAD PT
Sbjct: 320 -------------YCYRYRCAARSQNTPDSSSSNLGFRCAADHLPT 352
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 27/32 (84%), Positives = 29/32 (90%)
Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
N YCYR+RCAARSQNTPDSS+ NLGFRCAAD
Sbjct: 317 NPSYCYRYRCAARSQNTPDSSSSNLGFRCAAD 348
>gi|347965878|ref|XP_003435826.1| AGAP013122-PA [Anopheles gambiae str. PEST]
gi|333470313|gb|EGK97587.1| AGAP013122-PA [Anopheles gambiae str. PEST]
Length = 359
Score = 337 bits (865), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 174/334 (52%), Positives = 212/334 (63%), Gaps = 53/334 (15%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
+R++DM L+PG + +GTN+PI +KD E P+R T+ FYLDQ+EVSN QF+ FV TGY
Sbjct: 76 KRFEDMSLIPGGEYVIGTNEPIFVKDRESPARPATIRDFYLDQYEVSNAQFKAFVDQTGY 135
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
VTEAEKFGD+FVF+ LLSE R +
Sbjct: 136 VTEAEKFGDSFVFQQLLSEPVRQQ------------------------------------ 159
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG-IDSTIEHRMNHPVVHVSWNDA 188
Y +R A P W Y RG SW HPEG + I R++HPVVHVSWNDA
Sbjct: 160 YEDFRVAAAP----WWYKVRGA---------SWQHPEGDVSRDISDRLDHPVVHVSWNDA 206
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
VAYC W+G RLPTEAEWE CRGG + +LFPWGN L P+ +H N+WQGEFP +N DG
Sbjct: 207 VAYCAWKGKRLPTEAEWEAACRGGRKQKLFPWGNKLMPKEQHMMNIWQGEFPDSNLKEDG 266
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
Y +T PV S+++N F LYN+VGNVWEWTAD W+ + K + ++VKKGGSY
Sbjct: 267 YETTCPVTSFRQNPFELYNIVGNVWEWTADLWDAKD---AAIERKPGSDPPNRVKKGGSY 323
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
LC+E YCYR+RCAARSQNT DSSAGNLGFRCAAD
Sbjct: 324 LCHESYCYRYRCAARSQNTEDSSAGNLGFRCAAD 357
>gi|350420524|ref|XP_003492537.1| PREDICTED: sulfatase-modifying factor 1-like [Bombus impatiens]
Length = 369
Score = 336 bits (862), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 170/334 (50%), Positives = 210/334 (62%), Gaps = 53/334 (15%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
++R K+M + TF +GTN+P+ + DGE P + + L +FY+D+ EVSN F +FVS T
Sbjct: 84 IDRLKNMAKIEAGTFTIGTNEPVFVADGEGPKQQIYLHSFYIDKMEVSNRDFAKFVSVTN 143
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
Y TEAEKFGD+FVFE LL E+ R
Sbjct: 144 YRTEAEKFGDSFVFEGLLDEKVR------------------------------------- 166
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
A T AR P W + +W PEG S I++RM+HPVVHVSWNDA
Sbjct: 167 ANVTLVVARAP----WWLQVKDA---------NWQQPEGPTSDIKNRMDHPVVHVSWNDA 213
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
+AYC W G RLPTEAEWE CRGGL ++L+PWGNNL P G +RAN WQG+FP+NNT D
Sbjct: 214 IAYCEWLGKRLPTEAEWEVACRGGLSDKLYPWGNNLIPNGRYRANTWQGDFPSNNTKEDK 273
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP--APSYNPKGPTTGTDKVKKGG 306
Y ST+PV + NK+GL NM+GNVWEWT DWW P+ NP GP+ GTDKVKKGG
Sbjct: 274 YESTSPVTEFPPNKYGLRNMIGNVWEWTFDWWTTETRTRGGPN-NPSGPSRGTDKVKKGG 332
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
SYLC++ YC+R+RCAARSQNTPD++AGNLGFRCA
Sbjct: 333 SYLCHKSYCFRYRCAARSQNTPDTTAGNLGFRCA 366
>gi|340709728|ref|XP_003393454.1| PREDICTED: sulfatase-modifying factor 1-like isoform 2 [Bombus
terrestris]
Length = 358
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 164/332 (49%), Positives = 210/332 (63%), Gaps = 60/332 (18%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
++R K+M + T+ +GTN+P+ + DGE P + + L +FY+D+ EVSN F +FV+ T
Sbjct: 84 IDRLKNMAKIEAGTYTIGTNEPVFVADGEGPKQQIYLHSFYIDKMEVSNRDFAKFVNVTN 143
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
Y TEAEKFGD+FVFE LL E+ RA ++ V
Sbjct: 144 YRTEAEKFGDSFVFEGLLDEKVRANVTLV------------------------------- 172
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
+A W + +W HPEG S I++RM+HPVVHVSWNDA
Sbjct: 173 ----------VAQAPWWLQIKNA---------NWQHPEGPTSDIKNRMDHPVVHVSWNDA 213
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
+AYC W RLPTEAEWE CRGGL +RL+PWGNNL P G +RAN WQG+FP+NNT D
Sbjct: 214 IAYCEWLEKRLPTEAEWEVACRGGLSDRLYPWGNNLIPNGRYRANTWQGDFPSNNTKEDK 273
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
Y ST+PV + NK+GL+NM+GNVWEWT DWW + GP++GTDKVKKGGSY
Sbjct: 274 YESTSPVTEFPPNKYGLHNMIGNVWEWTFDWWTTN----------GPSSGTDKVKKGGSY 323
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
LC++ YC+R+RCAARSQNTPD++AGNLGFRCA
Sbjct: 324 LCHKSYCFRYRCAARSQNTPDTTAGNLGFRCA 355
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 38/47 (80%), Positives = 46/47 (97%)
Query: 344 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
GP++GTDKVKKGGSYLC++ YC+R+RCAARSQNTPD++AGNLGFRCA
Sbjct: 309 GPSSGTDKVKKGGSYLCHKSYCFRYRCAARSQNTPDTTAGNLGFRCA 355
>gi|340709726|ref|XP_003393453.1| PREDICTED: sulfatase-modifying factor 1-like isoform 1 [Bombus
terrestris]
Length = 369
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 166/334 (49%), Positives = 213/334 (63%), Gaps = 53/334 (15%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
++R K+M + T+ +GTN+P+ + DGE P + + L +FY+D+ EVSN F +FV+ T
Sbjct: 84 IDRLKNMAKIEAGTYTIGTNEPVFVADGEGPKQQIYLHSFYIDKMEVSNRDFAKFVNVTN 143
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
Y TEAEKFGD+FVFE LL E +VR ++
Sbjct: 144 YRTEAEKFGDSFVFEGLLDE-------KVRANV--------------------------- 169
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
+ +A W + +W HPEG S I++RM+HPVVHVSWNDA
Sbjct: 170 -------TLVVAQAPWWLQIKNA---------NWQHPEGPTSDIKNRMDHPVVHVSWNDA 213
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
+AYC W RLPTEAEWE CRGGL +RL+PWGNNL P G +RAN WQG+FP+NNT D
Sbjct: 214 IAYCEWLEKRLPTEAEWEVACRGGLSDRLYPWGNNLIPNGRYRANTWQGDFPSNNTKEDK 273
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP--APSYNPKGPTTGTDKVKKGG 306
Y ST+PV + NK+GL+NM+GNVWEWT DWW P+ NP GP++GTDKVKKGG
Sbjct: 274 YESTSPVTEFPPNKYGLHNMIGNVWEWTFDWWTTETRKRGGPN-NPNGPSSGTDKVKKGG 332
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
SYLC++ YC+R+RCAARSQNTPD++AGNLGFRCA
Sbjct: 333 SYLCHKSYCFRYRCAARSQNTPDTTAGNLGFRCA 366
>gi|380028488|ref|XP_003697932.1| PREDICTED: LOW QUALITY PROTEIN: sulfatase-modifying factor 1-like
[Apis florea]
Length = 280
Score = 334 bits (857), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 164/327 (50%), Positives = 208/327 (63%), Gaps = 51/327 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ + F +GTN PI I DGE P + + L++FY+D+ EVSN F +FV+ T Y TEAE
Sbjct: 1 MIKIKTGIFAIGTNNPIFIADGEGPKQQIYLNSFYIDKTEVSNRDFAKFVNVTKYKTEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+FGD+FVF LL ++ + ++ V V + W W
Sbjct: 61 RFGDSFVFGSLLKQKSQENVTLV---------------------VAQVPW--------WL 91
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+ +WLHPEG +S I++RM+HPVVHVSWNDA+AYC W
Sbjct: 92 QVKN---------------------ANWLHPEGPESDIKNRMDHPVVHVSWNDAIAYCNW 130
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWE CRGGL +RL+PWGN L P G++RAN WQG+FP NNT DGY T+P
Sbjct: 131 LGKRLPTEAEWEVACRGGLSDRLYPWGNXLIPNGQYRANTWQGDFPNNNTKEDGYEGTSP 190
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQ 313
V + NK+GLYNM+GNVWEWT DWW + S NP GP+ GT+KVKKGGSYLC++
Sbjct: 191 VREFPPNKYGLYNMIGNVWEWTFDWWTIEITTRGGSVNPTGPSHGTNKVKKGGSYLCHKS 250
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCA 340
YC+R+RCAARSQNTPD++AGNLGFRCA
Sbjct: 251 YCFRYRCAARSQNTPDTTAGNLGFRCA 277
>gi|198427283|ref|XP_002131627.1| PREDICTED: similar to sulfatase modifying factor 1 [Ciona
intestinalis]
Length = 377
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 176/339 (51%), Positives = 213/339 (62%), Gaps = 51/339 (15%)
Query: 7 PPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
P V+R+ M L+P F MG+N L KDGE P R V LD FY+D HE SN +F FV A
Sbjct: 78 PNVKRFNQMKLIPKGEFTMGSNAK-LTKDGEDPERRVFLDDFYMDVHETSNIEFAHFVKA 136
Query: 67 TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
T YVTEAEKF ++FV E L+S+E +KI+Q
Sbjct: 137 TNYVTEAEKFKNSFVLEMLISKETLSKITQ------------------------------ 166
Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
VA W LP + WLHPEG DS I R+NHPV+HVSWN
Sbjct: 167 QVASSPWW---LPVDG-----------------ADWLHPEGPDSNISKRLNHPVIHVSWN 206
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
DA +YC W G RLPTEAEWE RGGLENRLFPWGN L P G++ N+WQGEFP NTA
Sbjct: 207 DAKSYCKWAGKRLPTEAEWEKAARGGLENRLFPWGNKLNPNGKYMINIWQGEFPKGNTAE 266
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
DGY +TAPV S+ NKFGL+N VGNVWEW DW++ +H ++NPKGP +G KVKKGG
Sbjct: 267 DGYAATAPVDSFPPNKFGLFNTVGNVWEWVEDWYSTYHELERAHNPKGPKSGKVKVKKGG 326
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
SY+C++++CYR+RC+AR +NTPDSS+ NLGFRCAAD P
Sbjct: 327 SYMCHKEFCYRYRCSARGENTPDSSSSNLGFRCAADTLP 365
>gi|328793659|ref|XP_003251911.1| PREDICTED: sulfatase-modifying factor 1-like isoform 1 [Apis
mellifera]
Length = 356
Score = 330 bits (846), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 160/332 (48%), Positives = 206/332 (62%), Gaps = 60/332 (18%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
+ K+M+ + F +GTN PI I DGE P + + L++FY+D+ EVSN F +FV AT
Sbjct: 82 INHLKNMIKIKAGIFGIGTNDPIFIADGEGPKQQIYLNSFYIDKTEVSNHDFSKFVDATN 141
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
Y TEAE+FGD+FVF LL ++++ I+ V
Sbjct: 142 YKTEAERFGDSFVFRGLLKQKDQKNITLV------------------------------- 170
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
+A W + +WLHPEG +S I++RM+HPV+HVSWNDA
Sbjct: 171 ----------VAQAPWWLQVKNA---------NWLHPEGPESDIKNRMDHPVIHVSWNDA 211
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
+AYC W G RLPTEAEWE CRGGL +RL+PWGN L P G++RAN WQG+FP NNT DG
Sbjct: 212 IAYCNWLGKRLPTEAEWEVACRGGLSDRLYPWGNKLIPNGQYRANTWQGDFPNNNTKEDG 271
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
Y T+PV + NK+GLYNM+GNVWEWT DWW + + T G +KVKKGGSY
Sbjct: 272 YEGTSPVREFPPNKYGLYNMIGNVWEWTFDWWTI----------EITTRGINKVKKGGSY 321
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
LC++ YC+R+RCAARSQNTPD++AGNLGFRCA
Sbjct: 322 LCHKSYCFRYRCAARSQNTPDTTAGNLGFRCA 353
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 35/45 (77%), Positives = 42/45 (93%)
Query: 346 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
T G +KVKKGGSYLC++ YC+R+RCAARSQNTPD++AGNLGFRCA
Sbjct: 309 TRGINKVKKGGSYLCHKSYCFRYRCAARSQNTPDTTAGNLGFRCA 353
>gi|332231569|ref|XP_003264967.1| PREDICTED: sulfatase-modifying factor 1 isoform 2 [Nomascus
leucogenys]
Length = 354
Score = 330 bits (846), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 170/335 (50%), Positives = 204/335 (60%), Gaps = 70/335 (20%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ MV +P F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+T
Sbjct: 88 HSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFVNSTGYLT 147
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
EAEKFGD+FVFE +LSE+ + I Q AVA
Sbjct: 148 EAEKFGDSFVFEGMLSEQVKTNIQQ------------------------------AVAAA 177
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W LP + +W HPE DSTI HR +HPV+HVSWNDA AY
Sbjct: 178 PWW---LPVKG-----------------ANWRHPEWPDSTIRHRPDHPVLHVSWNDAFAY 217
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
CTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP NT DG+
Sbjct: 218 CTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQG 277
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NP
Sbjct: 278 TAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPS------------------ 319
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
YCYR+RCAARSQNTPDSSA NLGFRCAAD PT
Sbjct: 320 --YCYRYRCAARSQNTPDSSASNLGFRCAADCLPT 352
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 28/32 (87%), Positives = 29/32 (90%)
Query: 361 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
N YCYR+RCAARSQNTPDSSA NLGFRCAAD
Sbjct: 317 NPSYCYRYRCAARSQNTPDSSASNLGFRCAAD 348
>gi|156408243|ref|XP_001641766.1| predicted protein [Nematostella vectensis]
gi|156228906|gb|EDO49703.1| predicted protein [Nematostella vectensis]
Length = 290
Score = 327 bits (838), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 167/340 (49%), Positives = 210/340 (61%), Gaps = 51/340 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV + G F MGTNKP ++ DGE P+R V L FY+D +EVSN +F+ FV+ATG+ TEAE
Sbjct: 1 MVRIEGGEFTMGTNKPYIVVDGEGPARKVRLRPFYMDIYEVSNREFELFVNATGHKTEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FGD+FV +S+E IE +H V W W
Sbjct: 61 VFGDSFVLGSRVSKE---------------------IEKDIHQAVAAAPW--------W- 90
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + + W PEG D+ I RMNHPV+H+SWNDAVA+C W
Sbjct: 91 ---LPVKGAY-----------------WRRPEGPDTDIRERMNHPVLHISWNDAVAFCKW 130
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAE+EY +GG ++ L+PWGN L G+H AN+WQG FP NTA DGY T P
Sbjct: 131 SGKRLPTEAEFEYALKGGSDDTLYPWGNELLVNGKHMANLWQGRFPVENTAEDGYEGTCP 190
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ +N +GLYN++GN WEWT+DWWNV H NP GPTTG DKVK+GGSY+C+E Y
Sbjct: 191 VTAFPQNGYGLYNIIGNAWEWTSDWWNVRHTSDFQDNPTGPTTGKDKVKRGGSYMCHESY 250
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKK 354
CYR+R AARSQN+PD++A NLGFRCA+D P G +KK
Sbjct: 251 CYRYRSAARSQNSPDTTASNLGFRCASDS-PPPGVTIIKK 289
>gi|291222785|ref|XP_002731393.1| PREDICTED: sulfatase modifying factor 1-like isoform 1
[Saccoglossus kowalevskii]
Length = 293
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 166/328 (50%), Positives = 199/328 (60%), Gaps = 50/328 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M + G TF MGT+ P ++ DGE P+R V LD F+ D++EVSN +FQ FV GYVTEAE
Sbjct: 1 MSYIKGGTFTMGTDTPGVLFDGEGPARKVKLDPFFADKYEVSNREFQLFVDDKGYVTEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FGD+FV E +SE+ + +++V V W V W+
Sbjct: 61 NFGDSFVLEARVSEKTLSTVTEV---------------------VAQTPWWLKVIGADWK 99
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
HPEG D+ I RM+HPV+HVSWNDAV +C W
Sbjct: 100 -----------------------------HPEGPDTNITDRMDHPVLHVSWNDAVEFCKW 130
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAE+EY RGG ENR +PWGN L P G+ NVWQG FPT N+A DG+ T+P
Sbjct: 131 IGKRLPTEAEFEYASRGGKENRTYPWGNRLRPNGKWLMNVWQGVFPTENSAEDGFAGTSP 190
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
VM+Y N FGL+NMVGN WEWT DWW H NPKGP +G DKVKKGGSY+C+ Y
Sbjct: 191 VMAYPPNGFGLHNMVGNAWEWTNDWWGTSHTADLYVNPKGPASGIDKVKKGGSYMCHASY 250
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAAD 342
CYR+RCAARSQNTPDSSA NLGFRC AD
Sbjct: 251 CYRYRCAARSQNTPDSSASNLGFRCFAD 278
>gi|405962419|gb|EKC28098.1| Sulfatase-modifying factor 1 [Crassostrea gigas]
Length = 539
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 164/333 (49%), Positives = 205/333 (61%), Gaps = 51/333 (15%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
+ R +M+ + TF MGT++P+ DGE P+R VT+ FYLD+HEVSN +FQ FV+ T
Sbjct: 68 IPRTNEMLFIERGTFTMGTDEPVFPVDGEGPARKVTVYGFYLDKHEVSNAEFQRFVAETK 127
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
YVTEAE FG++F E +SEE ++Q +V
Sbjct: 128 YVTEAESFGNSFCLENYVSEEVLKNVTQ------------------------------SV 157
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
A W +P W HP G DS+I+ M+HPVVHVSWNDA
Sbjct: 158 AAAPWW---VPING-----------------ADWRHPNGPDSSIKDIMDHPVVHVSWNDA 197
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
V YC W G RLPTEAE+E C+G L++RLFPWGN P+G+H N+W G+FP+ NTA DG
Sbjct: 198 VKYCEWAGKRLPTEAEFERACKGNLKSRLFPWGNKEEPKGKHWMNIWHGKFPSENTANDG 257
Query: 249 YLSTAPVMSYKE-NKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
Y +TAPV + E N FG+ N++GNVWEWT DWW P P NPKGP GTDKVKKGGS
Sbjct: 258 YHTTAPVTEFPEQNIFGVKNIIGNVWEWTQDWWETKFTPTPKKNPKGPRFGTDKVKKGGS 317
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
++C++ YCYR+RC ARSQNTPDSSA NLGFRCA
Sbjct: 318 FMCHKNYCYRYRCDARSQNTPDSSAVNLGFRCA 350
>gi|443707651|gb|ELU03164.1| hypothetical protein CAPTEDRAFT_155214 [Capitella teleta]
Length = 384
Score = 323 bits (828), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 162/337 (48%), Positives = 203/337 (60%), Gaps = 51/337 (15%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
ER MV LP +F MGTN PI+ DGE P+R V + F+ D +E SN +F+ FV++TGY
Sbjct: 75 ERTNQMVYLPTGSFMMGTNNPIIPPDGEGPARRVHITGFWYDVYETSNAEFELFVNSTGY 134
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
TEAE FGD+FV E +SE+ + ISQ AVA
Sbjct: 135 KTEAEGFGDSFVIEAFISEKIKEDISQ------------------------------AVA 164
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
W LP + W HP G D+ ++ M+HPV+HVSWNDAV
Sbjct: 165 SAPWW---LPVKG-----------------ADWRHPFGPDTDLKGIMDHPVLHVSWNDAV 204
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
AYC W RLPTEAE+EY CR G ++RLFPWG+N+ P+ +HR N+WQG+FP NTA DG+
Sbjct: 205 AYCQWAQKRLPTEAEFEYACRDGKDDRLFPWGSNMLPKNQHRMNIWQGDFPNTNTAEDGF 264
Query: 250 LSTAPVMSYK-ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
T PV +K + K G N++GN WEW DWW HH + NP GP GTDK KKGGSY
Sbjct: 265 ERTCPVTEFKPQTKLGQKNLIGNAWEWVQDWWQTEHHTGTTKNPVGPKQGTDKTKKGGSY 324
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
+C + +CYR+RC ARSQNTPDSSA NLGFRCA++K P
Sbjct: 325 MCIKTHCYRYRCGARSQNTPDSSASNLGFRCASNKLP 361
>gi|37181290|gb|AAQ88459.1| AAPA3037 [Homo sapiens]
Length = 426
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 164/316 (51%), Positives = 201/316 (63%), Gaps = 54/316 (17%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 77 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q
Sbjct: 137 EKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTNIQQ------------------------ 172
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
AVA W LP + +W HPEG DSTI HR +HPV
Sbjct: 173 ------AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTILHRPDHPV 206
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPWGN L P+G+H AN+WQGEFP
Sbjct: 207 LHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFP 266
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD 300
NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D
Sbjct: 267 VTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKD 326
Query: 301 KVKKGGSYLCNEQYCY 316
+VKKGGSY+C+ Y
Sbjct: 327 RVKKGGSYMCHRSQEY 342
>gi|440905735|gb|ELR56080.1| Sulfatase-modifying factor 1, partial [Bos grunniens mutus]
Length = 338
Score = 322 bits (824), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 158/308 (51%), Positives = 200/308 (64%), Gaps = 54/308 (17%)
Query: 5 PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
P+PP + MV +P F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV
Sbjct: 85 PSPPTK----MVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAEFEKFV 140
Query: 65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
++TGY+TEAEKFGD+FVFE +LSE+ ++ I Q
Sbjct: 141 NSTGYLTEAEKFGDSFVFEGMLSEQVKSDIQQ---------------------------- 172
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
AVA W LP + +W HPEG DST+ HR +HPV+HVS
Sbjct: 173 --AVAAAPWW---LPVKG-----------------ANWRHPEGPDSTVLHRPDHPVLHVS 210
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
WNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP NT
Sbjct: 211 WNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNT 270
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
DG+ TAPV ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D+VKK
Sbjct: 271 GEDGFRGTAPVDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSAEETINPKGPPSGKDRVKK 330
Query: 305 GGSYLCNE 312
GGSY+C++
Sbjct: 331 GGSYMCHK 338
>gi|189236927|ref|XP_969670.2| PREDICTED: similar to sulfatase-modifying factor 1 [Tribolium
castaneum]
Length = 267
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 167/328 (50%), Positives = 199/328 (60%), Gaps = 63/328 (19%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+ TF MGT+KP+ D E P+RNV++D+FYLD+ EVSN +F EFV TGY TEAE
Sbjct: 1 MVLIEAATFEMGTDKPVFESDHEGPARNVSVDSFYLDKFEVSNGKFSEFVLKTGYKTEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+FGD+F+FE L+ E ER K R
Sbjct: 61 EFGDSFIFEMLVPESEREKYKDFR------------------------------------ 84
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+A W +G +W HPEG STI R++HPV+HVSW DA YC +
Sbjct: 85 ----AVQAPWWIKMKGV---------TWKHPEGEGSTINDRLDHPVIHVSWTDAKKYCEF 131
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWE CR GL +L+PWGN LTP+GEH AN+WQG+FP NTA DGYLSTAP
Sbjct: 132 VGKRLPTEAEWEMACRAGLRQKLYPWGNKLTPKGEHWANIWQGDFPLTNTAEDGYLSTAP 191
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V S+ NKFGL++M GNVWEWT D W +P KVKKGGSYLC+E Y
Sbjct: 192 VDSFPANKFGLHHMAGNVWEWTQDNW--------LNDPDA------KVKKGGSYLCHESY 237
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAAD 342
C+R+RCAARS NT DSSAGNLGFRCA D
Sbjct: 238 CWRYRCAARSFNTKDSSAGNLGFRCAGD 265
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 36/43 (83%), Positives = 39/43 (90%)
Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
KVKKGGSYLC+E YC+R+RCAARS NT DSSAGNLGFRCA DV
Sbjct: 224 KVKKGGSYLCHESYCWRYRCAARSFNTKDSSAGNLGFRCAGDV 266
>gi|312385017|gb|EFR29611.1| hypothetical protein AND_01279 [Anopheles darlingi]
Length = 350
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 164/335 (48%), Positives = 205/335 (61%), Gaps = 52/335 (15%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
+RY+ M L+P + +GT++PI D E P+R VT+ AF+LDQ+EVSN QF+EFV TGY
Sbjct: 64 KRYEHMSLIPAGEYLIGTDQPIFPADKESPARPVTMAAFFLDQYEVSNKQFKEFVDQTGY 123
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
+TEAEKFGD+FVF+ L E R +
Sbjct: 124 ITEAEKFGDSFVFQRFLPAEVRKQ------------------------------------ 147
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGID--STIEHRMNHPVVHVSWND 187
Y +R A P W Y +G SW PE S + R++HPVVHVSWND
Sbjct: 148 YEDFRVAAAP----WWYKVQGA---------SWRFPEADKERSVEQDRLDHPVVHVSWND 194
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
AVAYC W+G RLPTEAEWE CRGG + +LFPWGN L P+ +H+ N+WQG FP ++ A D
Sbjct: 195 AVAYCHWKGKRLPTEAEWEVACRGGRKQKLFPWGNKLLPKDQHQMNIWQGAFPDSDLAED 254
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
G T PV +++N + LYN+VGNVWEWTAD W+ P+ K ++VKKGGS
Sbjct: 255 GCDGTCPVGKFRQNPYDLYNIVGNVWEWTADLWDAAEG-KPTAERKPGADPPNRVKKGGS 313
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YLC+E YCYR+RCAARSQNT DSSAGNLGFRCAAD
Sbjct: 314 YLCHESYCYRYRCAARSQNTEDSSAGNLGFRCAAD 348
>gi|321479101|gb|EFX90057.1| hypothetical protein DAPPUDRAFT_94253 [Daphnia pulex]
Length = 286
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 160/331 (48%), Positives = 209/331 (63%), Gaps = 50/331 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV++ G + +GT++P+ I DGE P+R+V LD +++D HEVSN F +FV + +VTEAE
Sbjct: 1 MVMVRGGIYEIGTDEPVFIADGESPARSVYLDDYFIDTHEVSNQDFADFVEISNFVTEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFG +FVFE +LS+E +++IS+V + PVV+ SW
Sbjct: 61 KFGSSFVFEGILSDETKSEISKVVASAPWW------------MPVVNASWK--------- 99
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
PEG +S+I +MNHP +H+SWNDAV++C +
Sbjct: 100 -----------------------------FPEGPNSSISDKMNHPAIHISWNDAVSFCDF 130
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLP+EAEWE C+GGL+ RL+PWGN L P G+H N+WQG+FP NTA DGY+ TAP
Sbjct: 131 YGKRLPSEAEWEVACQGGLKQRLYPWGNKLNPYGKHWTNIWQGKFPGENTAEDGYVGTAP 190
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ NK+GLY MVGNVWEWT DWW V H NP GP G DKVKKGGSY+C++
Sbjct: 191 VDAFPPNKYGLYQMVGNVWEWTMDWWQVDHSSELLRNPMGPKKGKDKVKKGGSYMCHQST 250
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
C+R+RCAARSQNTPDSS+ NLGFRCA P
Sbjct: 251 CFRYRCAARSQNTPDSSSSNLGFRCAMSSSP 281
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 26/38 (68%), Positives = 32/38 (84%)
Query: 357 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADVS 394
SY+C++ C+R+RCAARSQNTPDSS+ NLGFRCA S
Sbjct: 243 SYMCHQSTCFRYRCAARSQNTPDSSSSNLGFRCAMSSS 280
>gi|332375104|gb|AEE62693.1| unknown [Dendroctonus ponderosae]
Length = 332
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 162/329 (49%), Positives = 194/329 (58%), Gaps = 63/329 (19%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLD-AFYLDQHEVSNTQFQEFVSATGYVTEA 73
M L+ F MGTNKP+ D E P+RNVT++ +FYLD +EVSN QF +FV T Y TEA
Sbjct: 64 MALIGKGIFEMGTNKPVFPSDFEGPARNVTIENSFYLDIYEVSNQQFYDFVRTTNYKTEA 123
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+FGD+FVFE L E +R + +R
Sbjct: 124 EQFGDSFVFEMSLPENQRNEHQDIR----------------------------------- 148
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
+A W P W HPEG STIE RMNHPV HVSWNDAVAYC
Sbjct: 149 -----AAQAPWWIK---------LPDAYWKHPEGPKSTIEDRMNHPVAHVSWNDAVAYCE 194
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
+ G RLPTEAEWE CRGGL +++PWGN L P+G+H AN+WQGEFP NTA DGY+ T
Sbjct: 195 YVGKRLPTEAEWEMACRGGLRQKMYPWGNKLQPKGQHWANIWQGEFPKENTAEDGYIFTC 254
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
PV + N+FGLYNM GNVWEW D W +P+ +VKKGGS+LC++
Sbjct: 255 PVDKFPPNQFGLYNMAGNVWEWVQDDWQT--------DPQ-----NSRVKKGGSFLCHQS 301
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YC+R+RCAARS NT DSSA NLGFRCAAD
Sbjct: 302 YCWRYRCAARSFNTKDSSAANLGFRCAAD 330
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 32/43 (74%), Positives = 38/43 (88%)
Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+VKKGGS+LC++ YC+R+RCAARS NT DSSA NLGFRCAAD
Sbjct: 289 RVKKGGSFLCHQSYCWRYRCAARSFNTKDSSAANLGFRCAADA 331
>gi|334134870|ref|ZP_08508371.1| hypothetical protein HMPREF9413_2437 [Paenibacillus sp. HGF7]
gi|333607372|gb|EGL18685.1| hypothetical protein HMPREF9413_2437 [Paenibacillus sp. HGF7]
Length = 326
Score = 304 bits (778), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 158/330 (47%), Positives = 199/330 (60%), Gaps = 52/330 (15%)
Query: 15 MVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M+LLPG +F MGT + DGE P V L FY+D H VSN QF+EF TGYVTEA
Sbjct: 46 MILLPGGSFDMGTEDGEGFPADGEGPVHEVHLKPFYVDPHAVSNAQFREFARQTGYVTEA 105
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+FG +FVF +S E ++QV
Sbjct: 106 ERFGWSFVFHSFVSPETARNVTQV------------------------------------ 129
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
P++ W Y RG SW HPEG DS + RM+HPV+HVSWNDA AYC
Sbjct: 130 -----PSQTPWWYVVRGA---------SWDHPEGPDSDLRDRMDHPVLHVSWNDAAAYCK 175
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RL TEAEWEY RGGL + +PWG+ L P G HR N+WQG+FP + +DGY TA
Sbjct: 176 WAGKRLLTEAEWEYAARGGLVRKRYPWGDELKPGGGHRCNIWQGKFPLKDNGSDGYRGTA 235
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
PV +Y N +GLYN+ GNVWEW +DW++ ++ +P+ NP+GP +G +V +GGSYLC++
Sbjct: 236 PVHAYPPNGYGLYNVSGNVWEWCSDWFSPAYYACSPADNPQGPASGDARVMRGGSYLCHK 295
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YC R+R AARS+NTPDSSAGN+GFRCAAD
Sbjct: 296 SYCNRYRTAARSKNTPDSSAGNIGFRCAAD 325
>gi|195170615|ref|XP_002026107.1| GL16088 [Drosophila persimilis]
gi|194110987|gb|EDW33030.1| GL16088 [Drosophila persimilis]
Length = 339
Score = 303 bits (776), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 157/328 (47%), Positives = 193/328 (58%), Gaps = 59/328 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M L+PG + +GT+ P D E P R V L FYLD+HEVSN +F+EFVSAT Y TEAE
Sbjct: 70 MSLIPGGSVHIGTDIPHFQADRESPERLVKLKDFYLDKHEVSNAKFEEFVSATKYTTEAE 129
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+FGD+F+F+ LLS E+ ++ R
Sbjct: 130 RFGDSFLFKTLLSAAEQKRLEDYR------------------------------------ 153
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
A W Y G W +P G+DS++ HPVVHVSW DAVAYCTW
Sbjct: 154 ----VASAPWWYKVSGV---------GWRNPNGVDSSLNGLEQHPVVHVSWRDAVAYCTW 200
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLP+EAEWE CRGG + +LFPWGN L P+ +H N+WQG+FP NTA DGY T P
Sbjct: 201 AGKRLPSEAEWEAACRGGKQRKLFPWGNKLMPKDQHWLNIWQGDFPDGNTADDGYEGTCP 260
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V Y++N + LYNMVGNVWEWTAD W+ + ++VKKGGSYLC++ Y
Sbjct: 261 VDRYRQNVYDLYNMVGNVWEWTADLWDT----------SDVSESPNRVKKGGSYLCHKSY 310
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAAD 342
C+R+RCAARSQNT DSSAGNLGFRCA D
Sbjct: 311 CFRYRCAARSQNTEDSSAGNLGFRCAKD 338
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 34/44 (77%), Positives = 40/44 (90%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
++VKKGGSYLC++ YC+R+RCAARSQNT DSSAGNLGFRCA D
Sbjct: 296 NRVKKGGSYLCHKSYCFRYRCAARSQNTEDSSAGNLGFRCAKDA 339
>gi|68051697|gb|AAY85112.1| GH26754p [Drosophila melanogaster]
Length = 337
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 161/329 (48%), Positives = 196/329 (59%), Gaps = 59/329 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
DM LLPG T MGT+KP D E P R V L+ FY+D++EVSN F +FV T Y TEA
Sbjct: 67 DMSLLPGGTVYMGTDKPHFPADREAPERQVKLNDFYIDKYEVSNEAFAKFVLHTNYTTEA 126
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E++GD+F+F+ LLS E+ + R V W VA W
Sbjct: 127 ERYGDSFLFKSLLSPLEQKNLEDFR--------------------VASAVWWYKVAGVNW 166
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R HP G+DS I+H HPVVHVSW DAV YC
Sbjct: 167 R-----------------------------HPNGVDSDIDHLGRHPVVHVSWRDAVEYCK 197
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLP+EAEWE CRGG E +LFPWGN L PR EH N+WQG+FP N A DG+ T+
Sbjct: 198 WAGKRLPSEAEWEAACRGGKERKLFPWGNKLMPRNEHWLNIWQGDFPDGNLAEDGFEYTS 257
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
PV ++++N + L+NMVGNVWEWTAD W+V+ S NP ++VKKGGSYLC++
Sbjct: 258 PVDAFRQNIYDLHNMVGNVWEWTADLWDVND---VSDNP-------NRVKKGGSYLCHKS 307
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 308 YCYRYRCAARSQNTEDSSAGNLGFRCAKN 336
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/44 (77%), Positives = 40/44 (90%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 294 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 337
>gi|20130397|ref|NP_612003.1| CG7049 [Drosophila melanogaster]
gi|7291920|gb|AAF47337.1| CG7049 [Drosophila melanogaster]
Length = 336
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 161/329 (48%), Positives = 196/329 (59%), Gaps = 59/329 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
DM LLPG T MGT+KP D E P R V L+ FY+D++EVSN F +FV T Y TEA
Sbjct: 66 DMSLLPGGTVYMGTDKPHFPADREAPERQVKLNDFYIDKYEVSNEAFAKFVLHTNYTTEA 125
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E++GD+F+F+ LLS E+ + R V W VA W
Sbjct: 126 ERYGDSFLFKSLLSPLEQKNLEDFR--------------------VASAVWWYKVAGVNW 165
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R HP G+DS I+H HPVVHVSW DAV YC
Sbjct: 166 R-----------------------------HPNGVDSDIDHLGRHPVVHVSWRDAVEYCK 196
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLP+EAEWE CRGG E +LFPWGN L PR EH N+WQG+FP N A DG+ T+
Sbjct: 197 WAGKRLPSEAEWEAACRGGKERKLFPWGNKLMPRNEHWLNIWQGDFPDGNLAEDGFEYTS 256
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
PV ++++N + L+NMVGNVWEWTAD W+V+ S NP ++VKKGGSYLC++
Sbjct: 257 PVDAFRQNIYDLHNMVGNVWEWTADLWDVND---VSDNP-------NRVKKGGSYLCHKS 306
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 307 YCYRYRCAARSQNTEDSSAGNLGFRCAKN 335
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/44 (77%), Positives = 40/44 (90%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 293 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 336
>gi|17862416|gb|AAL39685.1| LD26422p [Drosophila melanogaster]
Length = 303
Score = 302 bits (774), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 161/329 (48%), Positives = 196/329 (59%), Gaps = 59/329 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
DM LLPG T MGT+KP D E P R V L+ FY+D++EVSN F +FV T Y TEA
Sbjct: 33 DMSLLPGGTVYMGTDKPHFPADREAPERQVKLNDFYIDKYEVSNEAFAKFVLHTNYTTEA 92
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E++GD+F+F+ LLS E+ + R V W VA W
Sbjct: 93 ERYGDSFLFKSLLSPLEQKNLEDFR--------------------VASAVWWYKVAGVNW 132
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R HP G+DS I+H HPVVHVSW DAV YC
Sbjct: 133 R-----------------------------HPNGVDSDIDHLGRHPVVHVSWRDAVEYCK 163
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLP+EAEWE CRGG E +LFPWGN L PR EH N+WQG+FP N A DG+ T+
Sbjct: 164 WAGKRLPSEAEWEAACRGGKERKLFPWGNKLMPRNEHWLNIWQGDFPDGNLAEDGFEYTS 223
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
PV ++++N + L+NMVGNVWEWTAD W+V+ S NP ++VKKGGSYLC++
Sbjct: 224 PVDAFRQNIYDLHNMVGNVWEWTADLWDVND---VSDNP-------NRVKKGGSYLCHKS 273
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 274 YCYRYRCAARSQNTEDSSAGNLGFRCAKN 302
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/44 (77%), Positives = 40/44 (90%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 260 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 303
>gi|261406171|ref|YP_003242412.1| Non-specific serine/threonine protein kinase [Paenibacillus sp.
Y412MC10]
gi|261282634|gb|ACX64605.1| Non-specific serine/threonine protein kinase [Paenibacillus sp.
Y412MC10]
Length = 328
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 162/336 (48%), Positives = 199/336 (59%), Gaps = 54/336 (16%)
Query: 12 YKDMVLLPGDTFRMGTNK-PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
+ DM+ LPG TF MGTN +DGE P+R+VT+ F + H V+N ++Q FV ATGYV
Sbjct: 43 HTDMIKLPGGTFTMGTNSNEGFPRDGEGPARSVTVSGFEISPHAVTNGEYQRFVEATGYV 102
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAE+FG +FVFE L SEE +A ++QV
Sbjct: 103 TEAERFGWSFVFELLASEETKAAVAQV--------------------------------- 129
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
P E W G W PEG DS+IE RM+HPVVH+SWNDA A
Sbjct: 130 --------PQEVPWWLVVEGAY---------WAAPEGADSSIEDRMDHPVVHISWNDAEA 172
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
YC W G RLPTEAEWEY RGGLE + +PWG+ L GEH+ N+WQG+FP N A+DGY+
Sbjct: 173 YCQWAGVRLPTEAEWEYAARGGLEGKTYPWGDLLKQDGEHQCNIWQGKFPVKNNASDGYI 232
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNP-KGPTTGTDKVKKGGSY 308
TAPV++YK N +GLYNM GNVWEW DW++ +H + NP G TG + +GGSY
Sbjct: 233 GTAPVIAYKPNGYGLYNMSGNVWEWCGDWFSPSYHQQTTAINPFYGEPTGR-RSMRGGSY 291
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
LC+ YC R+R AARS NTPDSS GN GFR D G
Sbjct: 292 LCHRSYCNRYRVAARSGNTPDSSTGNCGFRVVRDGG 327
>gi|157116189|ref|XP_001652787.1| hypothetical protein AaeL_AAEL007497 [Aedes aegypti]
gi|108876575|gb|EAT40800.1| AAEL007497-PA [Aedes aegypti]
Length = 343
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 158/329 (48%), Positives = 193/329 (58%), Gaps = 60/329 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M L+ G + +GTN+P + D E P R V L+AFYLD++EVSN +F EF+ T Y+TEAE
Sbjct: 72 MSLIHGGKYLIGTNEPFFVGDHEGPEREVDLEAFYLDRYEVSNREFAEFIEKTAYITEAE 131
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+FGD+FVF+ L + R K R VA W
Sbjct: 132 RFGDSFVFQEFLDPKVREKYKDYR-----------------------------VAAAVW- 161
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDST-IEHRMNHPVVHVSWNDAVAYCT 193
W P SW +PEG IE RM+HPVVHVSWNDAVAYC
Sbjct: 162 ---------W----------FKIPGASWRYPEGDKQRGIESRMDHPVVHVSWNDAVAYCK 202
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W RLP+EAEWE CRGG + +LFPWGN + P+ EH N+WQGEFP N DG T
Sbjct: 203 WLDKRLPSEAEWEAACRGGRKGKLFPWGNKMMPKDEHYMNIWQGEFPEGNRGEDGCEGTC 262
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
PV +++NK+ LYNMVGNVWEWTAD W+ + P ++VKKGGSYLC++
Sbjct: 263 PVDRFRQNKYDLYNMVGNVWEWTADLWDAKENVKPP----------NRVKKGGSYLCHKS 312
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YCYR+RCAARSQNT DS+AGNLGFRCAAD
Sbjct: 313 YCYRYRCAARSQNTEDSTAGNLGFRCAAD 341
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 36/45 (80%), Positives = 42/45 (93%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADVS 394
++VKKGGSYLC++ YCYR+RCAARSQNT DS+AGNLGFRCAADV
Sbjct: 299 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSTAGNLGFRCAADVD 343
>gi|170055860|ref|XP_001863770.1| sulfatase-modifying factor 1 [Culex quinquefasciatus]
gi|167875738|gb|EDS39121.1| sulfatase-modifying factor 1 [Culex quinquefasciatus]
Length = 348
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 162/334 (48%), Positives = 198/334 (59%), Gaps = 61/334 (18%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
E++ M L+PG + +GT +PI +D E R V ++AFY+D++EVSN F+EFV AT Y
Sbjct: 73 EKFNAMSLIPGGKYVIGTREPIFPEDREH-ERKVAINAFYMDKYEVSNGDFREFVEATAY 131
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
+EAE+FGD+FVF L EE R K HD +
Sbjct: 132 ESEAERFGDSFVFHGFLDEEVRKKF----HDFRV-------------------------- 161
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDS-TIEHRMNHPVVHVSWNDA 188
A W Y G SW HPEG S +E R++HPVVHVSWNDA
Sbjct: 162 ----------AAAPWWYKVTGA---------SWRHPEGDPSRNVEDRLDHPVVHVSWNDA 202
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
VAYC W G RLPTEAEWE CRGG + +LFPWGN + P +H N+WQGEFP + A DG
Sbjct: 203 VAYCDWAGKRLPTEAEWEVACRGGRKQKLFPWGNKMMPNDKHYMNIWQGEFPEADLAEDG 262
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
T PV + +N FGL++MVGNVWEWTAD W+ +P ++VKKGGSY
Sbjct: 263 CKGTCPVDRFSQNTFGLHSMVGNVWEWTADRWDEEEKVSPP----------NRVKKGGSY 312
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
LC+E YCYR+RCAARSQNT DSSAGNLGFRCAAD
Sbjct: 313 LCHESYCYRYRCAARSQNTEDSSAGNLGFRCAAD 346
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/43 (86%), Positives = 41/43 (95%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
++VKKGGSYLC+E YCYR+RCAARSQNT DSSAGNLGFRCAAD
Sbjct: 304 NRVKKGGSYLCHESYCYRYRCAARSQNTEDSSAGNLGFRCAAD 346
>gi|198462565|ref|XP_002135327.1| GA28486 [Drosophila pseudoobscura pseudoobscura]
gi|198150880|gb|EDY73954.1| GA28486 [Drosophila pseudoobscura pseudoobscura]
Length = 339
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 157/328 (47%), Positives = 192/328 (58%), Gaps = 59/328 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M L+PG + +GT+ P D E P R V L FYLD+HEVSN +F EFVSAT Y TEAE
Sbjct: 70 MSLIPGGSVHIGTDIPHFQADRESPERLVKLKDFYLDKHEVSNAKFGEFVSATKYTTEAE 129
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+FGD+F+F+ LLS E+ ++ R
Sbjct: 130 RFGDSFLFKTLLSAAEQKRLEDYR------------------------------------ 153
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
A W Y G W +P G+DS++ HPVVHVSW DAVAYCTW
Sbjct: 154 ----VASAPWWYKVSGV---------GWRNPNGVDSSLNGLEQHPVVHVSWRDAVAYCTW 200
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLP+EAEWE CRGG + +LFPWGN L P+ +H N+WQG+FP NTA DGY T P
Sbjct: 201 AGKRLPSEAEWEAACRGGKQRKLFPWGNKLMPKDQHWLNIWQGDFPDGNTADDGYEGTCP 260
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V Y++N + LYNMVGNVWEWTAD W+ + ++VKKGGSYLC++ Y
Sbjct: 261 VDRYRQNVYDLYNMVGNVWEWTADLWDT----------SDVSESPNRVKKGGSYLCHKSY 310
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAAD 342
C+R+RCAARSQNT DSSAGNLGFRCA D
Sbjct: 311 CFRYRCAARSQNTEDSSAGNLGFRCAKD 338
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 34/44 (77%), Positives = 40/44 (90%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
++VKKGGSYLC++ YC+R+RCAARSQNT DSSAGNLGFRCA D
Sbjct: 296 NRVKKGGSYLCHKSYCFRYRCAARSQNTEDSSAGNLGFRCAKDA 339
>gi|195586666|ref|XP_002083094.1| GD13538 [Drosophila simulans]
gi|194195103|gb|EDX08679.1| GD13538 [Drosophila simulans]
Length = 343
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 160/332 (48%), Positives = 196/332 (59%), Gaps = 59/332 (17%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
+ DM LLPG T MGT+KP D E P R V L FY+D++EVSN F +FV T Y
Sbjct: 70 KIADMSLLPGGTVYMGTDKPHFPADREGPERQVKLKDFYIDKYEVSNEAFAKFVLHTNYT 129
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAE++GD+F+F+ LLS E+ + R V W VA
Sbjct: 130 TEAERYGDSFLFKSLLSPSEQKNLEDFR--------------------VASAVWWYKVAG 169
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
WR P G+DS I+H HPVVHVSW DAV
Sbjct: 170 VNWR-----------------------------RPNGVDSNIDHLGRHPVVHVSWRDAVE 200
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
YC W G RLP+EAEWE CRGG E++LFPWGN L PR EH N+WQG+FP N A DG+
Sbjct: 201 YCKWAGKRLPSEAEWEAACRGGKEHKLFPWGNKLMPRNEHWLNIWQGDFPDGNLAEDGFE 260
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
T+PV ++++N + L+NMVGNVWEWTAD W+V+ S NP ++VKKGGSYLC
Sbjct: 261 YTSPVDAFRQNMYDLHNMVGNVWEWTADLWDVND---VSDNP-------NRVKKGGSYLC 310
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 311 HKSYCYRYRCAARSQNTEDSSAGNLGFRCAKN 342
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/44 (77%), Positives = 40/44 (90%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 300 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 343
>gi|390452773|ref|ZP_10238301.1| Non-specific serine/threonine protein kinase [Paenibacillus peoriae
KCTC 3763]
Length = 335
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 161/338 (47%), Positives = 205/338 (60%), Gaps = 53/338 (15%)
Query: 7 PPVERYKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
P +ER M+ L G TFRMGT+ + +DGE P+R VTL FY+D++ V+N QF +FV
Sbjct: 48 PTLER-TGMIRLEGGTFRMGTDSREGFEQDGEGPAREVTLSPFYIDRYAVTNAQFTDFVK 106
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
T YVTEAE+FG++FVF LL E+R Q R M V+ W
Sbjct: 107 DTNYVTEAERFGNSFVFHLLLPSEQR----QNRATM-----------------VMQTPWW 145
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
V W+ HPEG S ++ RM+HPVVHVSW
Sbjct: 146 HVVEGADWK-----------------------------HPEGKSSDVKERMDHPVVHVSW 176
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
NDA+AYC W G RLPTEAEWEY RGGL+ +PWG+ LTP G+H N+WQG+FP +N A
Sbjct: 177 NDALAYCKWTGKRLPTEAEWEYAARGGLDQARYPWGDELTPEGKHMCNIWQGKFPISNDA 236
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-YNPKGPTTGTDKVKK 304
DGY+ TAPV +Y+ N FGLYN+ GNVWEW AD +N ++H S NP G + G + +
Sbjct: 237 TDGYVGTAPVDAYEPNGFGLYNVAGNVWEWCADEFNPYYHLQSSMINPLGVSRGNVRSMR 296
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GGSYLC++ YC R+R AAR++NTPDSS+GN GFRC AD
Sbjct: 297 GGSYLCHDSYCNRYRVAARNKNTPDSSSGNTGFRCVAD 334
>gi|313226833|emb|CBY21978.1| unnamed protein product [Oikopleura dioica]
Length = 779
Score = 300 bits (769), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 156/336 (46%), Positives = 198/336 (58%), Gaps = 52/336 (15%)
Query: 7 PPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDA-FYLDQHEVSNTQFQEFVS 65
P +RY +P TF MG N+ ++ DGE P R VT+ + F++D++EV+N++F FV
Sbjct: 482 PRHQRYNQEARIPAGTFTMGDNEELMPGDGEAPERRVTISSDFFIDKYEVTNSEFYRFVH 541
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
TGYVTE+EKFGD+F F + E+ + ++ V W
Sbjct: 542 ETGYVTESEKFGDSFCFIKYIPEKTK---------------------EKIELEVKETPW- 579
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
W LP + SWLHP G ++ + +HP VH+SW
Sbjct: 580 -------W----LPVQG-----------------ASWLHPVGPENDLTGIWDHPAVHISW 611
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
NDA YC W RLPTEAEWEY RGGLE RLF WGN P+G+H N+WQG+FP+ NT
Sbjct: 612 NDATEYCKWADKRLPTEAEWEYAARGGLEGRLFSWGNKDKPKGQHYMNIWQGDFPSVNTE 671
Query: 246 ADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
DG+ STAPV Y +NK+GL+NM+GNVWEWTAD W H +PKGP TGTD+VKK
Sbjct: 672 EDGHGSTAPVDYYPSQNKYGLFNMLGNVWEWTADNWGTRHLGDAVTDPKGPPTGTDRVKK 731
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GGS+LC QYCYR+R AAR NTPDS+A NLGFRCA
Sbjct: 732 GGSFLCTTQYCYRYRNAARHYNTPDSAAQNLGFRCA 767
>gi|313240480|emb|CBY32814.1| unnamed protein product [Oikopleura dioica]
Length = 779
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 156/336 (46%), Positives = 197/336 (58%), Gaps = 52/336 (15%)
Query: 7 PPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDA-FYLDQHEVSNTQFQEFVS 65
P +RY +P TF MG N+ ++ DGE P R VT+ + F++D++EV+N++F FV
Sbjct: 482 PRHQRYNQEARIPAGTFTMGDNEELMPGDGEAPERRVTISSDFFIDKYEVTNSEFYRFVH 541
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
TGYVTE+EKFGD+F F + E+ + ++ V W
Sbjct: 542 ETGYVTESEKFGDSFCFIKYIPEKTK---------------------EKIELEVKETPW- 579
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
W LP + SWLHP G ++ + +HP VH+SW
Sbjct: 580 -------W----LPVQG-----------------ASWLHPVGPENDLTGIWDHPAVHISW 611
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
NDA YC W RLPTEAEWEY RGGLE RLF WGN P+G+H N+WQG+FPT NT
Sbjct: 612 NDATEYCKWADKRLPTEAEWEYAARGGLEGRLFSWGNKDKPKGQHYMNIWQGDFPTVNTE 671
Query: 246 ADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
DG+ STAPV Y +N +GL+NM+GNVWEWTAD W H +PKGP TGTD+VKK
Sbjct: 672 EDGHGSTAPVDYYPSQNIYGLFNMLGNVWEWTADNWGTRHSGDAVTDPKGPPTGTDRVKK 731
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GGS+LC QYCYR+R AAR NTPDS+A NLGFRCA
Sbjct: 732 GGSFLCTTQYCYRYRNAARHYNTPDSAAQNLGFRCA 767
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 43/69 (62%), Positives = 49/69 (71%), Gaps = 5/69 (7%)
Query: 331 SAGNLGFRCAAD-----KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNL 385
+A N G R + D KGP TGTD+VKKGGS+LC QYCYR+R AAR NTPDS+A NL
Sbjct: 703 TADNWGTRHSGDAVTDPKGPPTGTDRVKKGGSFLCTTQYCYRYRNAARHYNTPDSAAQNL 762
Query: 386 GFRCAADVS 394
GFRCA S
Sbjct: 763 GFRCARTAS 771
>gi|297670803|ref|XP_002813544.1| PREDICTED: sulfatase-modifying factor 1 isoform 3 [Pongo abelii]
Length = 349
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 133/186 (71%), Positives = 153/186 (82%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W HPEG DSTI HR++HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct: 162 NWRHPEGPDSTIRHRLDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPW 221
Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
GN L P+G+H AN+WQGEFP NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW
Sbjct: 222 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 281
Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
VHH + NPKGP +G D+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 282 TVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 341
Query: 341 ADKGPT 346
AD+ PT
Sbjct: 342 ADRLPT 347
>gi|195336229|ref|XP_002034744.1| GM14301 [Drosophila sechellia]
gi|194127837|gb|EDW49880.1| GM14301 [Drosophila sechellia]
Length = 344
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 159/332 (47%), Positives = 195/332 (58%), Gaps = 59/332 (17%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
+ DM LLPG MGT+KP D E P R V L FY+D++EVSN F +FV T Y
Sbjct: 71 KIADMSLLPGGRVYMGTDKPHFPADREGPERQVKLKDFYIDKYEVSNEAFAKFVLHTNYT 130
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAE++GD+F+F+ LLS E+ + R V W VA
Sbjct: 131 TEAERYGDSFLFKSLLSPSEQKNLEDFR--------------------VASAVWWYKVAG 170
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
WR P G+DS I+H HPVVHVSW DAV
Sbjct: 171 VNWR-----------------------------RPNGVDSNIDHLGRHPVVHVSWRDAVE 201
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
YC W G RLP+EAEWE CRGG E++LFPWGN L PR EH N+WQG+FP N A DG+
Sbjct: 202 YCKWAGKRLPSEAEWEAACRGGKEHKLFPWGNKLMPRNEHWLNIWQGDFPDGNLAEDGFE 261
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
T+PV ++++N + L+NMVGNVWEWTAD W+V+ S NP ++VKKGGSYLC
Sbjct: 262 YTSPVDAFRQNMYDLHNMVGNVWEWTADLWDVND---VSDNP-------NRVKKGGSYLC 311
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 312 HKSYCYRYRCAARSQNTEDSSAGNLGFRCAKN 343
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 34/43 (79%), Positives = 40/43 (93%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 301 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKN 343
>gi|194746826|ref|XP_001955855.1| GF24897 [Drosophila ananassae]
gi|190623137|gb|EDV38661.1| GF24897 [Drosophila ananassae]
Length = 342
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 157/328 (47%), Positives = 189/328 (57%), Gaps = 59/328 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M L+PG F +GTNKP D E P R L+ FYLD++EVSN F FV AT Y TEAE
Sbjct: 73 MSLIPGGIFTVGTNKPHFHADREGPERQEKLNDFYLDKYEVSNDAFSIFVEATNYTTEAE 132
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FGD+F+F+ LL +E+ + +R
Sbjct: 133 HFGDSFMFKTLLGADEQKAMEHLR------------------------------------ 156
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
A W Y G SW HP G D +E HPVVHV+W DAVAYCTW
Sbjct: 157 ----VASAPWWYKVSGV---------SWRHPNGKDKGLEGLGQHPVVHVTWRDAVAYCTW 203
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLP+EAEWE CRGG E +LFPWGN L PR +H N+WQG+FP N A DG+ T P
Sbjct: 204 IGKRLPSEAEWEVACRGGKERKLFPWGNKLMPREQHWLNIWQGDFPEGNLAEDGFEFTCP 263
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++++N F LYNMVGNVWEWTAD W+ + + ++VKKGGSYLC++ Y
Sbjct: 264 VDAFRQNAFDLYNMVGNVWEWTADLWDTN----------DVSESPNRVKKGGSYLCHKSY 313
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAAD 342
CYR+RCAARSQNT DSSAGNLGFRCA D
Sbjct: 314 CYRYRCAARSQNTEDSSAGNLGFRCAKD 341
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 35/44 (79%), Positives = 40/44 (90%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA D
Sbjct: 299 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKDA 342
>gi|269128996|ref|YP_003302366.1| Non-specific serine/threonine protein kinase [Thermomonospora
curvata DSM 43183]
gi|268313954|gb|ACZ00329.1| Non-specific serine/threonine protein kinase [Thermomonospora
curvata DSM 43183]
Length = 303
Score = 298 bits (764), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 166/340 (48%), Positives = 197/340 (57%), Gaps = 56/340 (16%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKP-ILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
P E K MV +PG TFRMG P +DGE P R V L F +D++ VSN QF FV A
Sbjct: 13 PQEIAKGMVAIPGGTFRMGGEDPDAFPEDGEGPVRTVRLSPFLIDRYAVSNRQFAAFVKA 72
Query: 67 TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
TGYVT+AE++G +FVF ++ PV+ D
Sbjct: 73 TGYVTDAERYGWSFVFHAHVAPGT---------------------------PVM-----D 100
Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
AV EA W G W PEG S+I R NHPVVHVSWN
Sbjct: 101 AVV----------PEAPWWVAVPGAY---------WKAPEGPGSSITDRPNHPVVHVSWN 141
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
DAVAY TW G RLPTEAEWE RGGL+ +PWGN LTPRG HR N+WQG FP ++T
Sbjct: 142 DAVAYATWAGKRLPTEAEWEMAARGGLDQARYPWGNELTPRGRHRCNIWQGTFPVHDTGE 201
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPTTGTDKV 302
DGY TAPV ++ N +GLYN+ GNVWEW ADWW+ H PA +P+GP TGT +V
Sbjct: 202 DGYTGTAPVNAFAPNGYGLYNVAGNVWEWCADWWSADWHATESPATRIDPRGPETGTARV 261
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
KGGS+LC+E YC R+R AAR+ NTPDSSA + GFRCAAD
Sbjct: 262 TKGGSFLCHESYCNRYRVAARTCNTPDSSAAHTGFRCAAD 301
>gi|329924928|ref|ZP_08279875.1| hypothetical protein HMPREF9412_1633 [Paenibacillus sp. HGF5]
gi|328940312|gb|EGG36641.1| hypothetical protein HMPREF9412_1633 [Paenibacillus sp. HGF5]
Length = 300
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 160/336 (47%), Positives = 197/336 (58%), Gaps = 54/336 (16%)
Query: 12 YKDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
+ DM++LPG TF MGT + +DGE P+R+VT+ F + H V+N ++Q FV AT YV
Sbjct: 15 HTDMIMLPGGTFTMGTYSNEGFPRDGEGPARSVTVSGFEISPHAVTNGEYQRFVEATDYV 74
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAE+FG +FVFE L SE+ +A ++QV
Sbjct: 75 TEAERFGWSFVFELLASEKTKAAVAQV--------------------------------- 101
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
P E W G W PEG DS+IE RM+HPVVH+SWNDA A
Sbjct: 102 --------PQEVPWWLVVEGAY---------WAAPEGADSSIEGRMDHPVVHISWNDAAA 144
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
YC W G RLPTEAEWEY RGGLE R +PWG+ L GEH+ N+WQG+FP N A+DGY+
Sbjct: 145 YCQWAGVRLPTEAEWEYAARGGLEGRTYPWGDLLKQDGEHQCNIWQGKFPVKNNASDGYI 204
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--GPTTGTDKVKKGGSY 308
TAPV +YK N +GLYNM GNVWEW DW++ +H S N G TG + +GGSY
Sbjct: 205 GTAPVDAYKPNGYGLYNMSGNVWEWCGDWFSPSYHQQTSANNPFYGEPTGR-RSMRGGSY 263
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
LC+ YC R+R AARS NTPDSS GN GFR D G
Sbjct: 264 LCHRSYCNRYRVAARSGNTPDSSTGNCGFRVVRDDG 299
>gi|444707484|gb|ELW48757.1| Sulfatase-modifying factor 1 [Tupaia chinensis]
Length = 415
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 165/368 (44%), Positives = 215/368 (58%), Gaps = 55/368 (14%)
Query: 4 LPAP-PVERYKD---MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQ 59
+P P P ER + MV +P F MGT+ P + +DGE P+R V +DAFY+D +EVSN +
Sbjct: 76 VPGPVPGERQPEATKMVPIPAGVFTMGTDDPQIKQDGEAPARRVAIDAFYMDAYEVSNAE 135
Query: 60 FQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPV 119
F++FV++TGY+TEAEKFGD+FVFE +LSE+ + I Q D + F G+ S + P
Sbjct: 136 FEKFVNSTGYLTEAEKFGDSFVFEGMLSEQVKTGIQQAMED-EDFSGV-SAARTESNVPP 193
Query: 120 VHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHP 179
+IS W L ++L P G L + + I+ P
Sbjct: 194 ANISL-------FWVA----------------LRSQLLPGGYLL--KALTGDIQK---GP 225
Query: 180 VVHVSWNDAVAYCTWRGA-RLPTEAEWEYGCRGGLENR--------------------LF 218
++ + + T G RLPT E GC L LF
Sbjct: 226 ILLSYTGQIIQFSTSPGTMRLPTARGRESGCPQKLSGNTAVEEAYKIVQNDADVPSILLF 285
Query: 219 PWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD 278
PWGN L PRG+H AN+WQGEFP NTA DG+ TAPV ++ N +GLYN+VGN WEWT+D
Sbjct: 286 PWGNKLQPRGQHYANIWQGEFPVTNTAEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSD 345
Query: 279 WWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
WW V+H + NPKGP +G D+VKKGGSY+C++ YCYR+RCAARSQNTPDSSA NLGFR
Sbjct: 346 WWTVYHSVEETLNPKGPPSGKDRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSASNLGFR 405
Query: 339 CAADKGPT 346
CAAD+ PT
Sbjct: 406 CAADRLPT 413
>gi|195489995|ref|XP_002092959.1| GE21047 [Drosophila yakuba]
gi|194179060|gb|EDW92671.1| GE21047 [Drosophila yakuba]
Length = 336
Score = 298 bits (762), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 157/332 (47%), Positives = 197/332 (59%), Gaps = 59/332 (17%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
+ DM L+PG T +GT++P D E P R V L+ FY+D++EVSN F +FV T Y
Sbjct: 63 KIADMSLVPGGTVYVGTDQPHFPADREAPERQVKLNDFYIDKYEVSNDAFAKFVLQTNYT 122
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAE+FGD+F+F+ LLS E+ + R V W VA
Sbjct: 123 TEAERFGDSFLFKSLLSPSEQKDLEDFR--------------------VASAVWWYKVAG 162
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
WR HP G+DS +++ HPVVHVSW DAV
Sbjct: 163 VNWR-----------------------------HPNGVDSNLDNLGRHPVVHVSWRDAVE 193
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
YC W G RLP+EAEWE CRGG E +LFPWGN L PR EH N+WQG+FP N A DG+
Sbjct: 194 YCEWAGKRLPSEAEWEAACRGGKERKLFPWGNKLMPRNEHWLNIWQGDFPDGNLAEDGFE 253
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
T+PV ++++N + L+NMVGNVWEWTAD W+V+ S NP ++VKKGGSYLC
Sbjct: 254 YTSPVDAFRQNLYDLHNMVGNVWEWTADLWDVND---VSDNP-------NRVKKGGSYLC 303
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 304 HKSYCYRYRCAARSQNTEDSSAGNLGFRCAKN 335
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/44 (77%), Positives = 40/44 (90%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 293 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 336
>gi|251794995|ref|YP_003009726.1| non-specific serine/threonine protein kinase [Paenibacillus sp.
JDR-2]
gi|247542621|gb|ACS99639.1| Non-specific serine/threonine protein kinase [Paenibacillus sp.
JDR-2]
Length = 332
Score = 296 bits (759), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 154/326 (47%), Positives = 195/326 (59%), Gaps = 52/326 (15%)
Query: 15 MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M+ +PG TF MGT+ K DGE P +VT+++FY+D+ V+N QF F++ATGYVTEA
Sbjct: 50 MLFIPGGTFLMGTDDKEGFAADGEGPVHSVTVNSFYMDETAVTNEQFGRFIAATGYVTEA 109
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
++FG ++VF +SE E AK+ V
Sbjct: 110 QRFGWSYVFHLFVSEAEAAKVKTV------------------------------------ 133
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
P A W Y G W HPEG DS IE R +HPV+HVSWNDA+AYC
Sbjct: 134 -----PQTAPWWYPVEGA---------DWNHPEGPDSHIEGREDHPVIHVSWNDAMAYCE 179
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGGL + +PWG+ L GEH N+WQG+FP N A+DGYL TA
Sbjct: 180 WAGRRLPTEAEWEYAGRGGLAQKRYPWGDELKQGGEHNCNIWQGKFPVKNHASDGYLGTA 239
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
P S++ N +GLYNM GNVWEW +DW++ ++ + NP GP TG +V +GGSYLC+
Sbjct: 240 PAKSFRPNGYGLYNMAGNVWEWCSDWFSPSYYQSGENDNPLGPRTGEGRVMRGGSYLCHS 299
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFR 338
YC R+R AARS+NTPDSS GN GFR
Sbjct: 300 SYCNRYRVAARSKNTPDSSTGNCGFR 325
>gi|94536908|ref|NP_001035430.1| sulfatase-modifying factor 1 precursor [Danio rerio]
gi|92096356|gb|AAI15143.1| Zgc:136465 [Danio rerio]
gi|182890128|gb|AAI64368.1| Zgc:136465 protein [Danio rerio]
Length = 335
Score = 296 bits (759), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 157/328 (47%), Positives = 186/328 (56%), Gaps = 85/328 (25%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
+VLL G F MGT+ P + +DGE P R V LDAFY+++HEV+N QFQ F + TGY+TEAE
Sbjct: 90 LVLLQGGWFLMGTDDPGIPQDGEGPQRKVKLDAFYIEEHEVTNQQFQHFTNQTGYITEAE 149
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+FGD+FVFE LLSEE ++ +S H V W V WR
Sbjct: 150 RFGDSFVFEGLLSEEVKSTLS---------------------HAVAAAPWWSPVKGADWR 188
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
HPEG DSTI +RMNHP +HVSW+DA AYC W
Sbjct: 189 -----------------------------HPEGPDSTIHNRMNHPALHVSWDDARAYCQW 219
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RLPTEAEWE CRGGL++R
Sbjct: 220 AKRRLPTEAEWELACRGGLQDRCIS----------------------------------- 244
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
VMS+ N FGLY+MVGN WEWTADWW VHH NPKGP +GTD+VKKGGSY+C++ Y
Sbjct: 245 VMSFPANGFGLYDMVGNAWEWTADWWTVHHSAEDKLNPKGPESGTDRVKKGGSYMCHKSY 304
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAAD 342
CYR+RCAARSQNTPDSSA NLGFRCA+D
Sbjct: 305 CYRYRCAARSQNTPDSSASNLGFRCASD 332
>gi|194378504|dbj|BAG63417.1| unnamed protein product [Homo sapiens]
Length = 349
Score = 296 bits (759), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 133/186 (71%), Positives = 152/186 (81%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W HPEG DSTI HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct: 162 NWRHPEGPDSTILHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPW 221
Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
GN L P+G+H AN+WQGEFP NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW
Sbjct: 222 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 281
Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
VHH + NPKGP +G D+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 282 TVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 341
Query: 341 ADKGPT 346
AD+ PT
Sbjct: 342 ADRLPT 347
>gi|257470975|ref|NP_001158146.1| sulfatase-modifying factor 1 isoform 2 precursor [Homo sapiens]
gi|119584310|gb|EAW63906.1| sulfatase modifying factor 1, isoform CRA_a [Homo sapiens]
Length = 349
Score = 296 bits (759), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 133/186 (71%), Positives = 152/186 (81%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W HPEG DSTI HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct: 162 NWRHPEGPDSTILHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPW 221
Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
GN L P+G+H AN+WQGEFP NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW
Sbjct: 222 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 281
Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
VHH + NPKGP +G D+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 282 TVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 341
Query: 341 ADKGPT 346
AD+ PT
Sbjct: 342 ADRLPT 347
>gi|449685727|ref|XP_002155473.2| PREDICTED: sulfatase-modifying factor 1-like [Hydra magnipapillata]
Length = 401
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 150/326 (46%), Positives = 193/326 (59%), Gaps = 50/326 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
+V G F +GT+KP+ DGE P+R V++ F +D +EVSN +F+EFV TGY TEAE
Sbjct: 117 LVEFQGGDFIIGTDKPVFPDDGESPAREVSIQPFAIDMYEVSNGEFREFVKETGYKTEAE 176
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+G++FV E LSEE I++ AV W
Sbjct: 177 TYGNSFVMEYFLSEEVNKGITE------------------------------AVQGAPWW 206
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W +PEG S + +R++HPVVHVSWNDA+AYC W
Sbjct: 207 ---LPVKG-----------------ANWKNPEGPGSNLINRLSHPVVHVSWNDALAYCRW 246
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G RLPTEAEWE+ C G ++ LFPWGN P G +AN+W G+FP NTA DGY TAP
Sbjct: 247 KGKRLPTEAEWEFACSSGEDDMLFPWGNEFRPHGTFKANIWTGKFPNENTAEDGYNGTAP 306
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V + +N L NMVGNVWEW ADWW+ H +P GP +G KVKKGGS++C++ Y
Sbjct: 307 VNMFVQNNKKLCNMVGNVWEWVADWWHTDHSKEHQKDPYGPPSGDKKVKKGGSFMCSKNY 366
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCA 340
CYR+RCA+RS NTPDS + NLGFRCA
Sbjct: 367 CYRYRCASRSYNTPDSGSSNLGFRCA 392
>gi|402859539|ref|XP_003894212.1| PREDICTED: sulfatase-modifying factor 1 isoform 3 [Papio anubis]
Length = 349
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 133/186 (71%), Positives = 152/186 (81%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W HPEG DSTI HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct: 162 NWRHPEGPDSTIWHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPW 221
Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
GN L P+G+H AN+WQGEFP NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW
Sbjct: 222 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 281
Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
VHH + NPKGP +G D+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 282 TVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 341
Query: 341 ADKGPT 346
AD+ PT
Sbjct: 342 ADRLPT 347
>gi|271964559|ref|YP_003338755.1| non-specific serine/threonine protein kinase [Streptosporangium
roseum DSM 43021]
gi|270507734|gb|ACZ86012.1| Non-specific serine/threonine protein kinase [Streptosporangium
roseum DSM 43021]
Length = 308
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 159/337 (47%), Positives = 192/337 (56%), Gaps = 52/337 (15%)
Query: 7 PPVERYKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
P R +V L G TFRMGT+ + DGE P R VTL F L V+N +F FV
Sbjct: 22 PARRRASGLVRLEGGTFRMGTDDRDGFPSDGEGPVRAVTLRPFRLASTAVTNARFAAFVK 81
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
ATGY T+AE+FG ++VF L+ E RA
Sbjct: 82 ATGYETDAERFGWSYVFHLLVPEAMRAS-------------------------------- 109
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
A P W G G +W PEG ST+ R +HPVVHVSW
Sbjct: 110 ----------APSPAATPWWLGIEGA---------AWHAPEGPGSTVADRQDHPVVHVSW 150
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
DA AYC W G RLPTEAEWEY RGGL+ +PWG+ LTPRG HR N+WQG FP +NT
Sbjct: 151 QDAQAYCAWSGGRLPTEAEWEYAARGGLDQARYPWGDELTPRGGHRCNIWQGRFPVHNTE 210
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKG 305
DG+L+TAPV +Y+ N +GL+NMVGNVWEW ADW+ V H P +P+GP G +V +G
Sbjct: 211 EDGHLATAPVKAYRPNGYGLHNMVGNVWEWCADWFAVDHLARPLEDPRGPDDGESRVLRG 270
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GSYLC++ YC R+R AARS NTPDSS+GN GFR AAD
Sbjct: 271 GSYLCHDSYCNRYRVAARSSNTPDSSSGNTGFRLAAD 307
>gi|379722178|ref|YP_005314309.1| Non-specific serine/threonine protein kinase [Paenibacillus
mucilaginosus 3016]
gi|378570850|gb|AFC31160.1| Non-specific serine/threonine protein kinase [Paenibacillus
mucilaginosus 3016]
Length = 404
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 161/333 (48%), Positives = 190/333 (57%), Gaps = 52/333 (15%)
Query: 12 YKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
+ M+ LPG TFRMGT+ + +DGE PSR VTLD+FYLD V+N QF FV TGY
Sbjct: 121 HAGMIRLPGGTFRMGTDDREGFPQDGEGPSRLVTLDSFYLDACAVTNEQFLAFVRDTGYK 180
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAE +G +FVF + E R+ + DAV
Sbjct: 181 TEAELYGWSFVFHLFVPERLRSSV-------------------------------DAVV- 208
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
A W + G SW PEG + I RM+HPVVHVSWNDA A
Sbjct: 209 ---------QSAPWWWKVDGA---------SWHAPEGPGTGIGERMDHPVVHVSWNDAQA 250
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
Y W G RLPTEAEWEY RGGLE R + WG+ L P G+HR N+WQG FP N A DGY
Sbjct: 251 YARWAGKRLPTEAEWEYAARGGLEGRRYAWGDTLLPDGQHRCNIWQGTFPQVNEALDGYA 310
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
TAP S+ N FG Y+M GNVWEW ADW++ +H PS NP GP G K KGGSYL
Sbjct: 311 GTAPARSFPPNGFGFYHMAGNVWEWCADWFSPAYHRTTPSLNPAGPPQGQTKTLKGGSYL 370
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
C+ YC R+R AAR+ NTPDSS G++GFRCAAD
Sbjct: 371 CHSSYCNRYRVAARTSNTPDSSTGHMGFRCAAD 403
>gi|386724920|ref|YP_006191246.1| Non-specific serine/threonine protein kinase [Paenibacillus
mucilaginosus K02]
gi|384092045|gb|AFH63481.1| Non-specific serine/threonine protein kinase [Paenibacillus
mucilaginosus K02]
Length = 397
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 161/333 (48%), Positives = 190/333 (57%), Gaps = 52/333 (15%)
Query: 12 YKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
+ M+ LPG TFRMGT+ + +DGE PSR VTLD+FYLD V+N QF FV TGY
Sbjct: 114 HAGMIRLPGGTFRMGTDDREGFPQDGEGPSRLVTLDSFYLDACAVTNEQFLAFVRDTGYK 173
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAE +G +FVF + E R+ + DAV
Sbjct: 174 TEAELYGWSFVFHLFVPERLRSSV-------------------------------DAVV- 201
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
A W + G SW PEG + I RM+HPVVHVSWNDA A
Sbjct: 202 ---------QSAPWWWKVDGA---------SWHAPEGPGTGIGERMDHPVVHVSWNDAQA 243
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
Y W G RLPTEAEWEY RGGLE R + WG+ L P G+HR N+WQG FP N A DGY
Sbjct: 244 YARWAGKRLPTEAEWEYAARGGLEGRRYAWGDTLLPDGQHRCNIWQGTFPQVNEALDGYA 303
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
TAP S+ N FG Y+M GNVWEW ADW++ +H PS NP GP G K KGGSYL
Sbjct: 304 GTAPARSFPPNGFGFYHMAGNVWEWCADWFSPAYHRTTPSLNPAGPPQGQTKTLKGGSYL 363
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
C+ YC R+R AAR+ NTPDSS G++GFRCAAD
Sbjct: 364 CHSSYCNRYRVAARTSNTPDSSTGHMGFRCAAD 396
>gi|337749226|ref|YP_004643388.1| Non-specific serine/threonine protein kinase [Paenibacillus
mucilaginosus KNP414]
gi|336300415|gb|AEI43518.1| Non-specific serine/threonine protein kinase [Paenibacillus
mucilaginosus KNP414]
Length = 404
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 161/333 (48%), Positives = 190/333 (57%), Gaps = 52/333 (15%)
Query: 12 YKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
+ M+ LPG TFRMGT+ + +DGE PSR VTLD+FYLD V+N QF FV TGY
Sbjct: 121 HAGMIRLPGGTFRMGTDDREGFPQDGEGPSRLVTLDSFYLDACAVTNEQFLAFVRDTGYK 180
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAE +G +FVF + E R+ + DAV
Sbjct: 181 TEAELYGWSFVFHLFVPERLRSSV-------------------------------DAVV- 208
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
A W + G SW PEG + I RM+HPVVHVSWNDA A
Sbjct: 209 ---------QSAPWWWKVDGA---------SWHAPEGPGTGIGERMDHPVVHVSWNDAQA 250
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
Y W G RLPTEAEWEY RGGLE R + WG+ L P G+HR N+WQG FP N A DGY
Sbjct: 251 YARWAGKRLPTEAEWEYAARGGLEGRRYAWGDTLLPDGQHRCNIWQGTFPQVNEALDGYA 310
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
TAP S+ N FG Y+M GNVWEW ADW++ +H PS NP GP G K KGGSYL
Sbjct: 311 GTAPARSFPPNGFGFYHMAGNVWEWCADWFSPAYHRTTPSLNPAGPPQGQTKTLKGGSYL 370
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
C+ YC R+R AAR+ NTPDSS G++GFRCAAD
Sbjct: 371 CHSSYCNRYRVAARTSNTPDSSTGHMGFRCAAD 403
>gi|403270416|ref|XP_003927178.1| PREDICTED: sulfatase-modifying factor 1 isoform 3 [Saimiri
boliviensis boliviensis]
Length = 351
Score = 295 bits (754), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 131/186 (70%), Positives = 152/186 (81%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W HPEG DST+ HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct: 164 NWRHPEGPDSTVLHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLNNRLFPW 223
Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
GN L P+G+H AN+WQGEFP NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW
Sbjct: 224 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 283
Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
V+H + NPKGP +G D+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 284 TVYHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 343
Query: 341 ADKGPT 346
AD+ PT
Sbjct: 344 ADRPPT 349
>gi|397522436|ref|XP_003831273.1| PREDICTED: sulfatase-modifying factor 1 isoform 3 [Pan paniscus]
Length = 349
Score = 295 bits (754), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 132/186 (70%), Positives = 152/186 (81%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W HPEG DSTI HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct: 162 NWRHPEGPDSTILHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPW 221
Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
GN L P+G+H AN+WQGEFP NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW
Sbjct: 222 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 281
Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
VHH + NPKGP +G D+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 282 TVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 341
Query: 341 ADKGPT 346
A++ PT
Sbjct: 342 ANRLPT 347
>gi|354581053|ref|ZP_08999957.1| protein of unknown function DUF323 [Paenibacillus lactis 154]
gi|353201381|gb|EHB66834.1| protein of unknown function DUF323 [Paenibacillus lactis 154]
Length = 329
Score = 295 bits (754), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 158/336 (47%), Positives = 195/336 (58%), Gaps = 52/336 (15%)
Query: 12 YKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
+ DM+ +P TF MGTN K DGE P+R+VT+ F + + V+N ++Q FV TGYV
Sbjct: 43 HTDMIAIPEGTFMMGTNSKEGFPSDGEGPARSVTVSGFEISPYAVTNREYQRFVEDTGYV 102
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAEKFG +FVFE L S+E +AK++QV
Sbjct: 103 TEAEKFGWSFVFELLASDETKAKVTQV--------------------------------- 129
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
P EA W G W PEG ST+E RM+HPVVHVSWNDA+A
Sbjct: 130 --------PQEAPWWLVVEGAY---------WAAPEGAASTLEGRMDHPVVHVSWNDAMA 172
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
YC W G RLPTEAEWEY RGGLE+R +PWG+ L GEH N+WQG+FP N A+DGY+
Sbjct: 173 YCEWAGVRLPTEAEWEYAARGGLEDRTYPWGDLLKQDGEHECNIWQGKFPIKNNASDGYI 232
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
TAPV +YK N +GLYN+ GNVWEW DW++ +H + NP + +GGSYL
Sbjct: 233 GTAPVDAYKPNGYGLYNVSGNVWEWCGDWFSPSYHQQTSASNPFYAEPTGRRSMRGGSYL 292
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
C+ YC R+R AARS NTPDSS GN GFR D P
Sbjct: 293 CHRSYCNRYRVAARSGNTPDSSTGNCGFRVVRDSLP 328
>gi|332815966|ref|XP_003309636.1| PREDICTED: sulfatase-modifying factor 1 isoform 2 [Pan troglodytes]
gi|410218332|gb|JAA06385.1| sulfatase modifying factor 1 [Pan troglodytes]
gi|410259884|gb|JAA17908.1| sulfatase modifying factor 1 [Pan troglodytes]
gi|410308066|gb|JAA32633.1| sulfatase modifying factor 1 [Pan troglodytes]
gi|410336753|gb|JAA37323.1| sulfatase modifying factor 1 [Pan troglodytes]
Length = 349
Score = 295 bits (754), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 132/186 (70%), Positives = 152/186 (81%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W HPEG DSTI HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct: 162 NWRHPEGPDSTILHRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPW 221
Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
GN L P+G+H AN+WQGEFP NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW
Sbjct: 222 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 281
Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
VHH + NPKGP +G D+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 282 TVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 341
Query: 341 ADKGPT 346
A++ PT
Sbjct: 342 ANRLPT 347
>gi|395824559|ref|XP_003785530.1| PREDICTED: sulfatase-modifying factor 1 isoform 2 [Otolemur
garnettii]
Length = 349
Score = 295 bits (754), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 130/186 (69%), Positives = 151/186 (81%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W HPEG DS + HR +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGL+NRLFPW
Sbjct: 162 NWRHPEGPDSNVLHRQDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLQNRLFPW 221
Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
GN L P+G+H AN+WQGEFP NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW
Sbjct: 222 GNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 281
Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
VHH + NPKGP +G D+VKKGGSY+C+ YCYR+RCAARSQNTPDSS+ NLGFRCA
Sbjct: 282 TVHHSIEETVNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSSSNLGFRCA 341
Query: 341 ADKGPT 346
AD PT
Sbjct: 342 ADHLPT 347
>gi|403722089|ref|ZP_10944830.1| hypothetical protein GORHZ_053_00140 [Gordonia rhizosphera NBRC
16068]
gi|403206805|dbj|GAB89161.1| hypothetical protein GORHZ_053_00140 [Gordonia rhizosphera NBRC
16068]
Length = 318
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 156/328 (47%), Positives = 186/328 (56%), Gaps = 52/328 (15%)
Query: 16 VLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
V LPG F MGT+ P+ DGE P R V +DAF + + V+N +F F+ ATGYVT+AE
Sbjct: 40 VDLPGGEFLMGTDDPVGFPDDGEGPVRPVFVDAFRMAECPVTNDEFAAFIEATGYVTDAE 99
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
++G +FVF L + R R
Sbjct: 100 RYGWSFVFASFLPAQLR------------------------------------------R 117
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
GA P +A W G G + W PEG ST+E R HPVVHVSWNDA AYC W
Sbjct: 118 GAPRPEQAPWWCGVSGAM---------WSAPEGPGSTVEDRGRHPVVHVSWNDAQAYCAW 168
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
GARLPTEAEWEY RGGL + + WG+ L P G H N+WQG FPT NTA DGY T P
Sbjct: 169 AGARLPTEAEWEYAARGGLIQKRYVWGDELVPDGVHMCNIWQGSFPTKNTAEDGYRGTCP 228
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N FGLYNM GNVW+W ADWW V H P NP GP G +V +GGSYLC++ Y
Sbjct: 229 VDAFPPNGFGLYNMAGNVWQWCADWWGVDHSPGRCVNPHGPKGGPARVIRGGSYLCHDSY 288
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAAD 342
C R+R AAR+ N PDS+ GN GFRCAAD
Sbjct: 289 CNRYRVAARTANEPDSTTGNTGFRCAAD 316
>gi|344998181|ref|YP_004801035.1| hypothetical protein SACTE_0555 [Streptomyces sp. SirexAA-E]
gi|344313807|gb|AEN08495.1| protein of unknown function DUF323 [Streptomyces sp. SirexAA-E]
Length = 321
Score = 294 bits (752), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 162/340 (47%), Positives = 190/340 (55%), Gaps = 56/340 (16%)
Query: 4 LPAPPVERYKDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
L PPV M LLPG TF MG ++ DGE P R V L AF ++ H VSN +F
Sbjct: 36 LAVPPVT----MPLLPGGTFLMGAQDQDGFASDGEGPVREVRLSAFRVEAHAVSNERFAR 91
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
FV+ATG+VTEAE+FG ++VF L R
Sbjct: 92 FVAATGHVTEAERFGWSYVFAGFLPAALR------------------------------- 120
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
RGA P W G G W PEG S ++ R +HPVVH
Sbjct: 121 -----------RGAPRPDGTPWWCGVEGA---------RWNAPEGPGSGLDGREDHPVVH 160
Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
VSWNDA AYC W G RLPTEAEWEY RGGLE R +PWG+ LTP GEHR N+WQG FP
Sbjct: 161 VSWNDARAYCRWAGTRLPTEAEWEYAARGGLEQRRYPWGDELTPGGEHRCNIWQGRFPVK 220
Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKV 302
NT DGY TAPV +Y N FGL+NM GNVWEW AD W H +P+GP +G +V
Sbjct: 221 NTVEDGYAGTAPVDAYPPNGFGLHNMAGNVWEWCADRWGTDHGDRRRTDPRGPASGGQRV 280
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+GGSYLC+ YC R+R AAR++NTPDS+AGNLGFRC D
Sbjct: 281 MRGGSYLCHHSYCNRYRVAARTRNTPDSTAGNLGFRCVRD 320
>gi|196006177|ref|XP_002112955.1| hypothetical protein TRIADDRAFT_26011 [Trichoplax adhaerens]
gi|190584996|gb|EDV25065.1| hypothetical protein TRIADDRAFT_26011 [Trichoplax adhaerens]
Length = 294
Score = 293 bits (751), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 153/334 (45%), Positives = 200/334 (59%), Gaps = 55/334 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G +GTNK ++ D E PS L FY+D++EVSN +F++F+ ATG+ T+AE
Sbjct: 1 MVFLLGGLTTVGTNKIAILSDSEGPSYRAYLSPFYIDKYEVSNRKFRQFIHATGHRTQAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ G +F+ + + + S+ E VV W W+
Sbjct: 61 ELGSSFILTAI----------------PKLKSTSSSQEKV----VVDSPW--------WK 92
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
A+ +W HPEG++S+I ++HPVVH+SWNDA AYC W
Sbjct: 93 LAKN---------------------ANWNHPEGLNSSINGILDHPVVHMSWNDANAYCNW 131
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY RGGL +RLFPWGNN P+G+HR N+WQG+FP NTA DGY+ TAP
Sbjct: 132 AGKRLPTEAEWEYAARGGLNDRLFPWGNNPIPKGQHRMNIWQGKFPRENTADDGYIGTAP 191
Query: 255 VM-SYKENKFGLYNMVGNVWEWTADWWN-----VHHHPAPSYNPKGPTTGTDKVKKGGSY 308
V+ SY N +G+YN +GNVWEW DWW+ + S NPKGP +G KV+KGGSY
Sbjct: 192 VVNSYLPNAYGIYNTIGNVWEWVYDWWSRRDGQLRSRQDISNNPKGPNSGVKKVQKGGSY 251
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
LC++ YCYR+RCAAR+ PD+SA NLGFRCA D
Sbjct: 252 LCHKSYCYRYRCAARNAAPPDTSAINLGFRCAMD 285
>gi|195375301|ref|XP_002046440.1| GJ12899 [Drosophila virilis]
gi|194153598|gb|EDW68782.1| GJ12899 [Drosophila virilis]
Length = 342
Score = 293 bits (750), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 154/329 (46%), Positives = 191/329 (58%), Gaps = 61/329 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M L+PG + +GT++P D E P R V + FY+D+ EVSN F +FV AT Y TEAE
Sbjct: 73 MSLIPGGSVNIGTDEPHFEADREAPERVVKIKDFYVDKFEVSNANFAKFVDATNYTTEAE 132
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+FGD+F+F+ +LS ++A++ R
Sbjct: 133 RFGDSFLFKTMLSPAQQAELEDYR------------------------------------ 156
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
A W Y G SW P GI S ++ HPVVHVSW DAVAYC W
Sbjct: 157 ----VANALWWYKVSGV---------SWRKPNGISSNLQGLEQHPVVHVSWRDAVAYCAW 203
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLP+EAEWE CRGG + +LFPWGN L P+ +H N+WQGEFP NTA DGYL T P
Sbjct: 204 AGKRLPSEAEWEVACRGGKQRKLFPWGNKLMPQDKHWLNIWQGEFPDGNTAQDGYLYTCP 263
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT-DKVKKGGSYLCNEQ 313
V +++N + LYNMVGNVWEWTAD W G T+ + ++VKKGGSYLC++
Sbjct: 264 VDEFRQNVYDLYNMVGNVWEWTADLWQA-----------GDTSESPNRVKKGGSYLCHKS 312
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YCYR+RCAARSQNT DSSA NLGFRCA D
Sbjct: 313 YCYRYRCAARSQNTEDSSASNLGFRCAKD 341
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/44 (77%), Positives = 39/44 (88%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
++VKKGGSYLC++ YCYR+RCAARSQNT DSSA NLGFRCA D
Sbjct: 299 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSASNLGFRCAKDA 342
>gi|281337593|gb|EFB13177.1| hypothetical protein PANDA_008710 [Ailuropoda melanoleuca]
Length = 312
Score = 293 bits (750), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 143/278 (51%), Positives = 178/278 (64%), Gaps = 50/278 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+P F MGT+ P + +DGE P+R V +DAFY+D +EVSN +F++FV++TGY+TEAE
Sbjct: 85 MVLIPAGVFTMGTDDPKIKQDGEAPARKVAIDAFYMDAYEVSNAEFEKFVNSTGYLTEAE 144
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVFE +LSE+ + I Q AVA W
Sbjct: 145 KFGDSFVFEGMLSEQVKTDIQQ------------------------------AVAAAPWW 174
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + +W HPEG DST+ HR +HPV+HVSWNDA+AYCTW
Sbjct: 175 ---LPVKG-----------------ANWRHPEGPDSTVLHRPDHPVLHVSWNDAIAYCTW 214
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGL+NRLFPWGN L P+G+H AN+WQGEFP NT DG+ TAP
Sbjct: 215 AGKRLPTEAEWEYSCRGGLQNRLFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFRGTAP 274
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
V ++ N +GLYN+VGNVWEWT+DWW VHH + NP
Sbjct: 275 VDAFPPNGYGLYNIVGNVWEWTSDWWTVHHSVEKTLNP 312
>gi|195135258|ref|XP_002012051.1| GI16647 [Drosophila mojavensis]
gi|193918315|gb|EDW17182.1| GI16647 [Drosophila mojavensis]
Length = 341
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 153/327 (46%), Positives = 191/327 (58%), Gaps = 61/327 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M L+PG + +GT++P I D E P R V + FYLD++EVSN F +FV AT Y TEAE
Sbjct: 72 MSLIPGGSQYIGTDEPHFIADHESPERLVKFNDFYLDKYEVSNANFAKFVEATNYTTEAE 131
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+FGD+F+F+ LL E++ ++ R
Sbjct: 132 RFGDSFIFKTLLKPEQQEELKDYR------------------------------------ 155
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
A W Y G SW P GIDS +E +HPVVHVSW DAVAYC W
Sbjct: 156 ----VANAVWWYKVSGV---------SWRRPNGIDSNLEGLEHHPVVHVSWRDAVAYCEW 202
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLP+EAEWE CRGG + +LFPWGN L P+ +H N+WQG+FP NT DGY T P
Sbjct: 203 AGKRLPSEAEWEVACRGGKQRKLFPWGNKLMPKDKHWLNIWQGDFPDGNTEEDGYPFTCP 262
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
V +++N + LYN+VGNVWEWTAD W P+P ++VKKGGSYLC++
Sbjct: 263 VDQFRQNNYDLYNIVGNVWEWTADLWQAGDTSPSP-----------NRVKKGGSYLCHKS 311
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCA 340
YCYR+RCAARSQNT DSSAG+LGFRCA
Sbjct: 312 YCYRYRCAARSQNTEDSSAGSLGFRCA 338
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 37/54 (68%), Positives = 44/54 (81%), Gaps = 2/54 (3%)
Query: 340 AADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
A D P+ ++VKKGGSYLC++ YCYR+RCAARSQNT DSSAG+LGFRCA V
Sbjct: 290 AGDTSPSP--NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGSLGFRCAKSV 341
>gi|195012382|ref|XP_001983615.1| GH15994 [Drosophila grimshawi]
gi|193897097|gb|EDV95963.1| GH15994 [Drosophila grimshawi]
Length = 341
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 193/329 (58%), Gaps = 61/329 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M L+PG + +GTN+P D E P R V + FYLD+HEVSN F +F Y TEAE
Sbjct: 72 MSLIPGGSLHIGTNEPHFEADHESPERVVKVQDFYLDKHEVSNANFAKFTEQINYKTEAE 131
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+FGD+F+F+ LL+ E++ ++ R VV+
Sbjct: 132 RFGDSFLFKTLLTPEQQTELEDYR--------------------VVN------------- 158
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
A W Y +G SW P GI+S ++ HPVVHVSW DAVAYCTW
Sbjct: 159 -------ALWWYKVKGV---------SWRKPNGINSNLQGLEQHPVVHVSWRDAVAYCTW 202
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLP+E+EWE CRGG + +L+PWGN L P+ +H N+WQGEFP NT DGY T P
Sbjct: 203 AGKRLPSESEWEVACRGGKQRKLYPWGNKLMPKDQHWLNIWQGEFPDGNTQQDGYQFTCP 262
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT-DKVKKGGSYLCNEQ 313
V +++N + LYNMVGNVWEWTAD W G T+ + +VKKGGSYLC++
Sbjct: 263 VNEFRQNNYDLYNMVGNVWEWTADLWQA-----------GDTSESPARVKKGGSYLCHKS 311
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 312 YCYRYRCAARSQNTEDSSAGNLGFRCAKN 340
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/43 (79%), Positives = 39/43 (90%)
Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 299 RVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 341
>gi|288917453|ref|ZP_06411819.1| Non-specific serine/threonine protein kinase [Frankia sp. EUN1f]
gi|288351156|gb|EFC85367.1| Non-specific serine/threonine protein kinase [Frankia sp. EUN1f]
Length = 309
Score = 292 bits (748), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 160/333 (48%), Positives = 192/333 (57%), Gaps = 53/333 (15%)
Query: 11 RYKDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
R M+ LPG F MGT ++ DGE P R + L F ++ V+N QF FV ATGY
Sbjct: 28 RTTRMIKLPGGRFLMGTEDEEGYPADGEGPVREIDLSPFAIEATTVTNAQFATFVKATGY 87
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
T+AE+ +FVFE LSEE +S+ + PVVH
Sbjct: 88 ATDAERHDFSFVFEGFLSEE----VSR-------------------NSPVVH-------- 116
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
E W + G W HPEG S+I R NHPVVHVSWNDA
Sbjct: 117 -----------EVPWWHAVAGAF---------WKHPEGPGSSIASRQNHPVVHVSWNDAQ 156
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
AYC W G RLP+EAEWEY RGGLE R +PWG+ L P+G H+ N+WQG FPT NT ADGY
Sbjct: 157 AYCAWAGTRLPSEAEWEYAARGGLEQRRYPWGDELMPKGRHQCNIWQGVFPTVNTGADGY 216
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTTGTDKVKKGGSY 308
L TAP +Y+ N GLYN+VGNVWEW ADW++ H P NP GP G KV +GGS+
Sbjct: 217 LGTAPAKAYRPNGHGLYNVVGNVWEWCADWFSADFHTTGPRTNPTGPPGGLAKVMRGGSH 276
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+C+ YC R+R AARS NTPDSSAGN+GFR AA
Sbjct: 277 MCHSSYCNRYRVAARSSNTPDSSAGNIGFRVAA 309
>gi|374605458|ref|ZP_09678386.1| non-specific serine/threonine protein kinase [Paenibacillus
dendritiformis C454]
gi|374388957|gb|EHQ60351.1| non-specific serine/threonine protein kinase [Paenibacillus
dendritiformis C454]
Length = 371
Score = 291 bits (744), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 154/330 (46%), Positives = 197/330 (59%), Gaps = 52/330 (15%)
Query: 15 MVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+LL G TF MG+ + DGE P R VT+ F + + V+N +F FV+ATGYVTEA
Sbjct: 90 FILLEGGTFAMGSEDAESFPADGEGPIREVTVGPFRIAPYAVTNEEFARFVAATGYVTEA 149
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+FG ++VF L + E +H + R +G W W
Sbjct: 150 ERFGWSYVFHLLAPDLEE------KHVIGRPQG---------------TPW--------W 180
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
G R W PEG S++ RMNHPV+HVSW+DA AYC
Sbjct: 181 LGVRQ---------------------ACWHRPEGPGSSVAERMNHPVIHVSWHDAQAYCE 219
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTE+EWE+ RGGLE R +PWG+ L P GEHR N+WQG+FP N A+DGY+ TA
Sbjct: 220 WAGTRLPTESEWEFAARGGLERRRYPWGDLLKPDGEHRCNIWQGKFPVKNNASDGYIGTA 279
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVH-HHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
PV +++ N +GLYN+ GNVWEW ADW+ + P+ P+GP TGT++V KGGSYLC++
Sbjct: 280 PVDAFEPNGYGLYNVAGNVWEWCADWFTANPDERGPADQPRGPQTGTERVMKGGSYLCHK 339
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YC R+R ARS+NTPDSS GNLGFRCAAD
Sbjct: 340 SYCNRYRVGARSKNTPDSSTGNLGFRCAAD 369
>gi|336178162|ref|YP_004583537.1| sulfatase-modifying factor protein [Frankia symbiont of Datisca
glomerata]
gi|334859142|gb|AEH09616.1| Sulphatase-modifying factor protein [Frankia symbiont of Datisca
glomerata]
Length = 278
Score = 290 bits (742), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 157/329 (47%), Positives = 191/329 (58%), Gaps = 53/329 (16%)
Query: 15 MVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
MV LPG F MGT + DGE P R + L+ F ++ V+N QF FV+ATGYVT+A
Sbjct: 1 MVQLPGGRFLMGTEDNEGYRADGEGPVREIELNPFSIEPTTVTNAQFATFVNATGYVTDA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E++ +FVFE LSEE +R+ V W AV+ W
Sbjct: 61 ERYDFSFVFEGFLSEEL----------------------NRVSSAVHTTPWWRAVSGAGW 98
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
+ HPEG S++ R NHPVVHVSWNDA +YC
Sbjct: 99 K-----------------------------HPEGPGSSVASRQNHPVVHVSWNDAQSYCA 129
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGGLE R +PWG+ LTP G HR N+WQG+FP NT DGY+ TA
Sbjct: 130 WAGVRLPTEAEWEYAARGGLEQRRYPWGDELTPGGRHRCNIWQGDFPNLNTGEDGYIGTA 189
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTTGTDKVKKGGSYLCNE 312
PV SY+ N +G+YN VGNVWEW ADW++ H P NP GP G KV +GGS++C+
Sbjct: 190 PVKSYRPNGYGIYNAVGNVWEWCADWFSATFHVTGPRTNPTGPPDGHAKVMRGGSHMCHA 249
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
YC R+R AARS NTPDSSAGN+GFR AA
Sbjct: 250 SYCNRYRVAARSSNTPDSSAGNIGFRTAA 278
>gi|194864509|ref|XP_001970974.1| GG14686 [Drosophila erecta]
gi|190652757|gb|EDV50000.1| GG14686 [Drosophila erecta]
Length = 303
Score = 290 bits (742), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 154/332 (46%), Positives = 195/332 (58%), Gaps = 59/332 (17%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
+ DM L+PG T +GT+K D E P R V L+ FY+D++EVSN F +FV T Y
Sbjct: 30 KIADMSLIPGGTVYVGTDKAHFPADREAPERQVKLNDFYIDKYEVSNDAFAKFVLHTNYT 89
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAE+FGD+F+F+ LLS E+ + R V W VA
Sbjct: 90 TEAERFGDSFLFKTLLSPSEQKDLEDFR--------------------VASAVWWYKVAG 129
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
WR HP G+ S +++ HPVVHVSW DAV
Sbjct: 130 VNWR-----------------------------HPNGVGSNLDNLGRHPVVHVSWRDAVE 160
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
YC W G RLP+EAEWE CRGG + +LFPWGN L PR EH N+WQG+FP N A DG+
Sbjct: 161 YCKWAGKRLPSEAEWETACRGGRDRKLFPWGNKLMPRNEHWLNIWQGDFPDGNLAEDGFE 220
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
T+PV ++++N + L+N+VGNVWEWTAD W+V+ S NP ++VKKGGSYLC
Sbjct: 221 YTSPVDAFRQNVYDLHNIVGNVWEWTADLWDVND---VSDNP-------NRVKKGGSYLC 270
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 271 HKSYCYRYRCAARSQNTEDSSAGNLGFRCAKN 302
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/44 (77%), Positives = 40/44 (90%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
++VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 260 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 303
>gi|332231571|ref|XP_003264968.1| PREDICTED: sulfatase-modifying factor 1 isoform 3 [Nomascus
leucogenys]
Length = 349
Score = 290 bits (741), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 131/186 (70%), Positives = 149/186 (80%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W HPE DSTI HR +HPV+HVSWNDA AYCTW G RLPTEAEWEY CRGGL NRLFPW
Sbjct: 162 NWRHPEWPDSTIRHRPDHPVLHVSWNDAFAYCTWAGKRLPTEAEWEYSCRGGLHNRLFPW 221
Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
GN L P+G+H AN+WQGEFP NT DG+ TAPV ++ N +GLYN+VGN WEWT+DWW
Sbjct: 222 GNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWTSDWW 281
Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
VHH + NPKGP +G D+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRCA
Sbjct: 282 TVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLGFRCA 341
Query: 341 ADKGPT 346
AD PT
Sbjct: 342 ADCLPT 347
>gi|409730291|ref|ZP_11271870.1| Non-specific serine/threonine protein kinase [Halococcus
hamelinensis 100A6]
gi|448723625|ref|ZP_21706141.1| Non-specific serine/threonine protein kinase [Halococcus
hamelinensis 100A6]
gi|445787164|gb|EMA37912.1| Non-specific serine/threonine protein kinase [Halococcus
hamelinensis 100A6]
Length = 332
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 154/338 (45%), Positives = 193/338 (57%), Gaps = 56/338 (16%)
Query: 11 RYKDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
R MV L G F MGT+ + +DGE P+R V LD FY+D+ V+N +F FV TGY
Sbjct: 43 RTTRMVRLDGGRFTMGTDDGVGFPEDGEGPAREVALDPFYVDRFAVTNAEFLRFVRETGY 102
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
TEAE+FG +FVFE ++ + + + Q V W AV
Sbjct: 103 TTEAERFGWSFVFEDFVAPADESHVMQS---------------------VAAAPWWVAVE 141
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTI--EHRMNHPVVHVSWND 187
TW L PEG S++ + R+ PV HVSWND
Sbjct: 142 GATW-----------------------------LRPEGPSSSVVGDDRLKQPVAHVSWND 172
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
A AY W G RLPTEAEWEY RGGLE + +PWG++L P GEHR N+WQG+FP +NT D
Sbjct: 173 AQAYADWAGKRLPTEAEWEYAARGGLEGKRYPWGDDLRPDGEHRCNIWQGDFPEHNTGDD 232
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPKGPTTGTDKVKK 304
GYL APV +Y+ N FGLYN+ GNVWEW ADW++ +H +Y NP GP G +V +
Sbjct: 233 GYLGPAPVTAYEPNGFGLYNVSGNVWEWCADWFSSEYHTTEAYSHTNPTGPPDGDSRVMR 292
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GGSYLC+ +C R+R AARSQNTPDSS GN+GFRC D
Sbjct: 293 GGSYLCHRSWCNRYRVAARSQNTPDSSTGNIGFRCVVD 330
>gi|195089898|ref|XP_001997488.1| GH11358 [Drosophila grimshawi]
gi|193905884|gb|EDW04751.1| GH11358 [Drosophila grimshawi]
Length = 270
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 152/329 (46%), Positives = 192/329 (58%), Gaps = 61/329 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M L+P + +GTN+P D E P R V + FYLD+HEVSN F +F Y TEAE
Sbjct: 1 MSLIPCGSLHIGTNEPHFEADHESPERVVKVQDFYLDEHEVSNANFAKFTEQINYKTEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+FGD+F+F+ LL+ E++ ++ R VV+
Sbjct: 61 RFGDSFLFKTLLTPEQQTELEVYR--------------------VVN------------- 87
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
A W Y +G SW P GI+S ++ HPVVHVSW DAVAYCTW
Sbjct: 88 -------ALWWYKVKGV---------SWRKPNGINSNLQGLEQHPVVHVSWRDAVAYCTW 131
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLP+E+EWE CRGG + +L+PWGN L P+ +H N+WQGEFP NT DGY T P
Sbjct: 132 AGKRLPSESEWEVACRGGKQRKLYPWGNKLMPKDQHWLNIWQGEFPDGNTQQDGYQFTCP 191
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT-DKVKKGGSYLCNEQ 313
V +++N + LYNMVGNVWEWTAD W G T+ + +VKKGGSYLC++
Sbjct: 192 VNEFRQNNYDLYNMVGNVWEWTADLWQA-----------GDTSESPARVKKGGSYLCHKS 240
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 241 YCYRYRCAARSQNTEDSSAGNLGFRCAKN 269
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 34/43 (79%), Positives = 39/43 (90%)
Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 228 RVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 270
>gi|345012061|ref|YP_004814415.1| sulfatase-modifying factor protein [Streptomyces violaceusniger Tu
4113]
gi|344038410|gb|AEM84135.1| Sulphatase-modifying factor protein [Streptomyces violaceusniger Tu
4113]
Length = 316
Score = 283 bits (725), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 150/329 (45%), Positives = 186/329 (56%), Gaps = 52/329 (15%)
Query: 15 MVLLPGDTFRMGTNK-PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
MV + G F MG + +DGE P R V + AF++D + VSN +F EFV ATG+VTEA
Sbjct: 38 MVRVAGGEFLMGADDVEGFPEDGEGPVRTVRVSAFHIDAYAVSNEKFAEFVDATGHVTEA 97
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ G ++ F L R
Sbjct: 98 ERIGWSYAFAGFLPAALR------------------------------------------ 115
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
RGA P W G G W HPEG ST+ R NHPVVH+SWNDA AYC
Sbjct: 116 RGAPRPARIPWWCGVTGA---------RWDHPEGPGSTLAGRENHPVVHISWNDAAAYCA 166
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGGLE + +PWG++L G +R N+W+G FPT NT ADGY TA
Sbjct: 167 WAGKRLPTEAEWEYAARGGLEQKRYPWGDDLDLGGTYRCNIWRGAFPTRNTVADGYRGTA 226
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
PV +++ N FGLYN GNVWEW ADWW V H + +PKGP G +KV +GGS+LC++
Sbjct: 227 PVDAFEPNGFGLYNTSGNVWEWCADWWTVRHQGDTAIDPKGPAIGQEKVIRGGSHLCHKS 286
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YC R+R AAR+ NTPD ++G+ GFRCA D
Sbjct: 287 YCNRYRVAARTANTPDGASGHTGFRCARD 315
>gi|253576938|ref|ZP_04854262.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
gi|251843669|gb|EES71693.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
Length = 329
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 154/333 (46%), Positives = 186/333 (55%), Gaps = 52/333 (15%)
Query: 12 YKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
+ DMV +P TF MGTN KDGE P+R V + F + + V+N QF+ FV ATGY
Sbjct: 46 HSDMVTIPDGTFLMGTNTNEGFPKDGEGPAREVRVRGFQMSPYPVTNEQFKMFVDATGYR 105
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAE FG +FVF L SEE + ++ V
Sbjct: 106 TEAESFGWSFVFHLLASEETKRRVKDV--------------------------------- 132
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
P W G W PEG DS+IE RM+HPVVHVSWNDA A
Sbjct: 133 --------PQGVPWWLVVEGAY---------WAAPEGPDSSIEDRMDHPVVHVSWNDAEA 175
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
YC W G RLPTEAEWEY RGGLE + +PWG+ L GEH+ N+WQG+FP N A+DGY+
Sbjct: 176 YCRWAGVRLPTEAEWEYAARGGLEGKTYPWGDLLKLDGEHQCNIWQGKFPVKNNASDGYV 235
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTTGTDKVKKGGSYL 309
TAPV +YK N +GLY M GNVWEW ADW++ +H P+ NP + +GGSYL
Sbjct: 236 GTAPVQTYKPNGYGLYQMSGNVWEWCADWFSPDYHRTMPADNPVYTAPTGSRSMRGGSYL 295
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
C+ YC R+R AARS NTPDSS GN GFR D
Sbjct: 296 CHRSYCNRYRVAARSSNTPDSSTGNCGFRVVRD 328
>gi|167526896|ref|XP_001747781.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773885|gb|EDQ87521.1| predicted protein [Monosiga brevicollis MX1]
Length = 412
Score = 281 bits (719), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 162/362 (44%), Positives = 194/362 (53%), Gaps = 77/362 (21%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
E ++ L G F MG++ DGE P+R V + F + HEVSN +F FV+ATGY
Sbjct: 102 EEQAALLHLSGGVFTMGSDDGFFPDDGEGPARKVQVSPFLIGAHEVSNARFAAFVAATGY 161
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
TEAE+FG++FV E +S A+IS AVA
Sbjct: 162 RTEAERFGNSFVVEQFISPRISAQISS------------------------------AVA 191
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
W LP + W HPEG D+ I R +HP VH+SWNDA
Sbjct: 192 AAPWW---LPVDG-----------------ADWAHPEGPDTNISGRADHPAVHISWNDAN 231
Query: 190 AYCTW---RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE--FPTN-- 242
A+C W RG RLPTEAEWE+ RGGLE+RLFPWGN + P+GE R N WQ E PT+
Sbjct: 232 AFCRWSHPRG-RLPTEAEWEFAARGGLEHRLFPWGNKMQPKGEFRMNTWQSEMKIPTDAN 290
Query: 243 -------------------NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH 283
NTA DG+ TAPV SYK NKFGLYN VGNVWEWT DW
Sbjct: 291 VFKHSFLPIHDGHAYYSAKNTAEDGFQLTAPVDSYKPNKFGLYNTVGNVWEWTNDWHTRV 350
Query: 284 HHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
H P +P+GP G KVKKGGS++CN CYR+R +AR TPDSSA N+GFRCAAD
Sbjct: 351 HSPDFVVDPRGPPAGDKKVKKGGSFMCNIFTCYRYRNSARMPLTPDSSAANVGFRCAADV 410
Query: 344 GP 345
P
Sbjct: 411 SP 412
>gi|195436010|ref|XP_002065971.1| GK12600 [Drosophila willistoni]
gi|194162056|gb|EDW76957.1| GK12600 [Drosophila willistoni]
Length = 393
Score = 281 bits (718), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 149/324 (45%), Positives = 190/324 (58%), Gaps = 59/324 (18%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
+ Y DM L+PG + +GT++P D E P V L FYLD++EVSN +F++FV AT
Sbjct: 78 TDDYPDMSLIPGGSVLIGTDQPHFQADREAPEVRVDLKNFYLDKYEVSNRKFKDFVLATE 137
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
YVTEAE+FGD+F+F+ LLS E+ + R
Sbjct: 138 YVTEAERFGDSFLFKTLLSPAEQKNLEDFR------------------------------ 167
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
A W Y +G SW P G++S ++ +HPVVHVSW DA
Sbjct: 168 ----------VASAIWWYKVKGV---------SWRTPNGVNSNLDGLEDHPVVHVSWRDA 208
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
VA+CTW G RLPTEAEWE CRGG + +LFPWGN L PR EH N+WQG+FP NTA DG
Sbjct: 209 VAFCTWAGKRLPTEAEWEAACRGGKKRKLFPWGNKLMPRDEHWLNIWQGDFPDGNTADDG 268
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
Y T PV +++N++ LYN+VGNVWEWT+D W + S P ++VKKGGSY
Sbjct: 269 YEYTCPVNEFRQNEYDLYNIVGNVWEWTSDLWQTND---VSETP-------NRVKKGGSY 318
Query: 309 LCNEQYCYRHRCAARSQNTPDSSA 332
LC++ YCYR+RCAARSQNT DSSA
Sbjct: 319 LCHKSYCYRYRCAARSQNTEDSSA 342
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/33 (78%), Positives = 31/33 (93%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSA 382
++VKKGGSYLC++ YCYR+RCAARSQNT DSSA
Sbjct: 310 NRVKKGGSYLCHKSYCYRYRCAARSQNTEDSSA 342
>gi|318058229|ref|ZP_07976952.1| non-specific serine/threonine protein kinase [Streptomyces sp.
SA3_actG]
gi|318081153|ref|ZP_07988485.1| non-specific serine/threonine protein kinase [Streptomyces sp.
SA3_actF]
Length = 306
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 156/341 (45%), Positives = 185/341 (54%), Gaps = 53/341 (15%)
Query: 4 LPAPPVERYKD-MVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
+PAPP + + V L G TF MG + DGE P R V L F + V+N +F
Sbjct: 16 VPAPPEDPGPEPSVALDGGTFLMGAEDADGFAADGEGPVREVRLSPFRIGTLAVTNARFA 75
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
FV+ATG+VTEAE+FG +FVF L R
Sbjct: 76 AFVAATGHVTEAERFGWSFVFAAFLPGALR------------------------------ 105
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
R + P W G G +W PEG S + R +HPVV
Sbjct: 106 ------------RVSPRPDATPWWCGVSGA---------TWRAPEGPGSGLAERWDHPVV 144
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
HV W DA AYC W G RLPTEAEWEY RGGL R +PWG+ LTP GEHR N+WQG FPT
Sbjct: 145 HVGWADAAAYCRWAGGRLPTEAEWEYAARGGLAGRRYPWGDELTPDGEHRCNIWQGRFPT 204
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK 301
NTA DGY TAP ++ N GL+N+ GNVWEW AD W H P+ +PKGP GT +
Sbjct: 205 TNTAEDGYRGTAPAHAFPPNGHGLHNVAGNVWEWCADRWGTDHSARPASDPKGPRRGTSR 264
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
V +GGSYLC+ YC R+R AAR+ NTPDSSAGNLGFRCA D
Sbjct: 265 VMRGGSYLCHHSYCNRYRVAARTSNTPDSSAGNLGFRCAWD 305
>gi|357614993|gb|EHJ69415.1| sulfatase-modifying factor 1 [Danaus plexippus]
Length = 267
Score = 280 bits (716), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 149/326 (45%), Positives = 190/326 (58%), Gaps = 62/326 (19%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+L+P T+++GT++ ++ D E P + V L +FYLD++EVSN F F +T Y TEAE
Sbjct: 1 MILIPSGTYQVGTDEILIDTDNEGPKKFVQLKSFYLDKYEVSNRDFTIFTESTDYKTEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FGD+FVF L+ + ++ HD + +
Sbjct: 61 TFGDSFVFTLFLNNTFKKQL----HDFRVMQ----------------------------- 87
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
A W Y G +W HP G DS + M+HPV HVSWNDA AYC W
Sbjct: 88 -------APWWYKVYGT---------NWKHPLGPDSDVADLMDHPVTHVSWNDAKAYCAW 131
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RGARLPTEAEWE GCRG N +PWG+ L P +H AN+WQG FP +N+A DGY+ T P
Sbjct: 132 RGARLPTEAEWEAGCRGDHYNITYPWGDKLFPDRQHMANIWQGTFPNHNSAKDGYVGTNP 191
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V +++NK GL+NM GNVWEWT D W+ NPK +KVKKGGSYLC+ Y
Sbjct: 192 VHLFEQNKLGLHNMAGNVWEWTEDSWSED-------NPK------EKVKKGGSYLCHRSY 238
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCA 340
CYR+RC++RS NT DSSAGNLGFRCA
Sbjct: 239 CYRYRCSSRSHNTDDSSAGNLGFRCA 264
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 32/41 (78%), Positives = 37/41 (90%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
+KVKKGGSYLC+ YCYR+RC++RS NT DSSAGNLGFRCA
Sbjct: 224 EKVKKGGSYLCHRSYCYRYRCSSRSHNTDDSSAGNLGFRCA 264
>gi|385677476|ref|ZP_10051404.1| hypothetical protein AATC3_16239 [Amycolatopsis sp. ATCC 39116]
Length = 308
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 158/331 (47%), Positives = 186/331 (56%), Gaps = 53/331 (16%)
Query: 15 MVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
MV LPG FRMGT+ DGE P R V L F +D++ VSN F FV+ TGYVTEA
Sbjct: 30 MVALPGGEFRMGTDDAEGFPADGEGPVRTVELSPFLIDRYAVSNDDFAAFVADTGYVTEA 89
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ G ++VF L E R +IS R D+ W AV W
Sbjct: 90 ERIGWSYVFVRFLDRELR-RISP-RPDV--------------------TPWWAAVTGADW 127
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R PEG S + R +HPVVHVSW+DA AY
Sbjct: 128 R-----------------------------RPEGPGSDLSGRGDHPVVHVSWHDAGAYAA 158
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGG + R +PWG+ L P G H AN+WQG FP +TA DGY TA
Sbjct: 159 WAGKRLPTEAEWEYAARGGHDQRRYPWGDELNPGGRHLANIWQGRFPITDTAEDGYAGTA 218
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
PV ++ N FGL+NM GNVWEW AD W H P + +P GP+TGTD+V +GGSYLC+
Sbjct: 219 PVDAFPPNDFGLHNMAGNVWEWCADRWGTDHPPR-ARDPIGPSTGTDRVTRGGSYLCHVS 277
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
YC R+R AARS NTPDSS GN GFRCA D G
Sbjct: 278 YCNRYRVAARSHNTPDSSGGNTGFRCARDPG 308
>gi|333023180|ref|ZP_08451244.1| putative Non-specific serine/threonine protein kinase [Streptomyces
sp. Tu6071]
gi|332743032|gb|EGJ73473.1| putative Non-specific serine/threonine protein kinase [Streptomyces
sp. Tu6071]
Length = 387
Score = 278 bits (711), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 154/341 (45%), Positives = 184/341 (53%), Gaps = 53/341 (15%)
Query: 4 LPAPPVERYKD-MVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
+PAPP + + V L G TF MG + DGE P R V L F + V+N +F
Sbjct: 97 VPAPPEDPGPEPSVALDGGTFLMGAEDADGFAADGEGPVREVRLSPFRIGTLAVTNARFA 156
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
FV+ATG+VTEAE+FG +FVF L R
Sbjct: 157 AFVAATGHVTEAERFGWSFVFAAFLPGALR------------------------------ 186
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
R + P W G G +W PEG S + R +HP V
Sbjct: 187 ------------RVSPRPDATPWWCGVSGA---------TWRAPEGPGSGLAERWDHPAV 225
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
HV W DA AYC W G RLPTEAEWEY RGGL R +PWG+ LTP GEHR N+WQG FPT
Sbjct: 226 HVGWADAAAYCRWAGGRLPTEAEWEYAARGGLAGRRYPWGDELTPDGEHRCNIWQGRFPT 285
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK 301
NTA DGY TAP ++ N GL+N+ GNVWEW AD W H P+ +P+GP GT +
Sbjct: 286 TNTAEDGYRGTAPAHAFPPNGHGLHNVAGNVWEWCADRWGTDHSARPASDPRGPRRGTSR 345
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
V +GGSYLC+ YC R+R AAR+ NTPDSSAGNLGFRCA D
Sbjct: 346 VMRGGSYLCHHSYCNRYRVAARTSNTPDSSAGNLGFRCAWD 386
>gi|429199588|ref|ZP_19191336.1| sulfatase-modifying factor 1 family protein [Streptomyces ipomoeae
91-03]
gi|428664672|gb|EKX63947.1| sulfatase-modifying factor 1 family protein [Streptomyces ipomoeae
91-03]
Length = 294
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 155/338 (45%), Positives = 192/338 (56%), Gaps = 53/338 (15%)
Query: 6 APPVERYKDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
+P + + M+ LPG F MGT+ + DGE P R V L F + V+N QF EFV
Sbjct: 6 SPEAQHPRRMLDLPGGRFLMGTDDEVGYPADGEGPVREVELSPFRIAATTVTNEQFAEFV 65
Query: 65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
ATG+VTEAE F +FVFE L RA V W
Sbjct: 66 DATGHVTEAEHFDFSFVFEGFLPAHLRAVSPHV----------------------PGTPW 103
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
WRG R T W HPEG S+++ ++HPVVHVS
Sbjct: 104 --------WRGVRGAT---------------------WRHPEGPGSSVDAILDHPVVHVS 134
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
WNDA AYC W G RLPTEAEWEY RGGLE + +PWG++L P G++R N+W+G FPT NT
Sbjct: 135 WNDARAYCAWSGTRLPTEAEWEYAARGGLEQQRYPWGDDLDPDGQYRCNIWRGTFPTLNT 194
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTTGTDKVK 303
AADG+ STAP +++ N FGLYN VGNVWEW AD ++ H P +P GP G +V
Sbjct: 195 AADGHRSTAPADAFEPNGFGLYNTVGNVWEWCADPFSPDFHLTGPRTDPTGPPDGPIRVM 254
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+GGS++C+ YC R+R AARS NTPDSSAGN+GFR AA
Sbjct: 255 RGGSHMCHASYCNRYRVAARSSNTPDSSAGNIGFRVAA 292
>gi|345012332|ref|YP_004814686.1| sulfatase-modifying factor protein [Streptomyces violaceusniger Tu
4113]
gi|344038681|gb|AEM84406.1| Sulphatase-modifying factor protein [Streptomyces violaceusniger Tu
4113]
Length = 326
Score = 275 bits (702), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 152/339 (44%), Positives = 185/339 (54%), Gaps = 54/339 (15%)
Query: 13 KDMVLLPGDTFRMGTNKP-ILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ M+ +P T+ MG + +DGE P R V L F +D V+N QF FV ++GY T
Sbjct: 36 RAMLRIPAGTYLMGGDDADAFPEDGEGPVRAVRLSPFLIDATAVTNRQFATFVRSSGYRT 95
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
+AE++G +FVF L+ + R H+ + VA
Sbjct: 96 DAERYGWSFVFYALVHPDAR-----------------------------HLVRDGTVA-- 124
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
+A W G W P G S+ NHPVVHVSW DA AY
Sbjct: 125 ---------QAPWWLAVDGAC---------WRAPYGPGSSWTDLSNHPVVHVSWRDASAY 166
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
TW G RLPTEAEWE RGGLE FPWGN L PRG+HR N+WQGEFP NT DGYL
Sbjct: 167 ATWAGKRLPTEAEWEMAARGGLERARFPWGNELLPRGQHRCNIWQGEFPQVNTGEDGYLG 226
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPTTGTDKVKKGGS 307
TAPV SY+ N FGLYN GNVWEW +DWW+ H P +P GP G KV +GGS
Sbjct: 227 TAPVKSYRANNFGLYNTSGNVWEWCSDWWSTTWHAADRPETRQDPAGPPAGEAKVIRGGS 286
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
YLC+ YC R+R AAR+ NTPDSS G++GFRCAAD P+
Sbjct: 287 YLCHASYCNRYRVAARTSNTPDSSTGHMGFRCAADLPPS 325
>gi|320106475|ref|YP_004182065.1| hypothetical protein AciPR4_1246 [Terriglobus saanensis SP1PR4]
gi|319924996|gb|ADV82071.1| protein of unknown function DUF323 [Terriglobus saanensis SP1PR4]
Length = 318
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 147/333 (44%), Positives = 191/333 (57%), Gaps = 56/333 (16%)
Query: 13 KDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
++MVLL G F MGT+ DGE P R VT+ F++D++ V N +F++F T Y T
Sbjct: 38 EEMVLLNGGNFLMGTDYAAGFPADGEGPVRRVTVSPFWIDRYTVRNREFEKFTETTNYQT 97
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDM-KRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
E+E+FG +FVFE L++E + + + +R EG D
Sbjct: 98 ESERFGWSFVFEGDLADELKVSQQVIGAEWWRRIEGAD---------------------- 135
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W HPEG S I+ R ++PVVHVSWNDA A
Sbjct: 136 -------------------------------WRHPEGPQSHIDSRADYPVVHVSWNDASA 164
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
Y W G RLPTEAEWE+ RGGLE L+PWG++LTP GE+R N WQG FP+ N+A DG+
Sbjct: 165 YAVWAGKRLPTEAEWEFAARGGLEQVLYPWGDDLTPEGEYRCNTWQGIFPSYNSAEDGFA 224
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGSYL 309
ST P SY+ N +GLY M GNVWEW ADW++ +H A +P GP G+ +V KGGSYL
Sbjct: 225 STCPSDSYQPNGYGLYCMTGNVWEWCADWFDPEYHVHATHIDPLGPPLGSARVLKGGSYL 284
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
C++ YC R+R AARS NTPDS+ N+GFRC D
Sbjct: 285 CHKSYCNRYRVAARSSNTPDSATTNMGFRCVRD 317
>gi|386722552|ref|YP_006188878.1| Non-specific serine/threonine protein kinase [Paenibacillus
mucilaginosus K02]
gi|384089677|gb|AFH61113.1| Non-specific serine/threonine protein kinase [Paenibacillus
mucilaginosus K02]
Length = 307
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 147/339 (43%), Positives = 185/339 (54%), Gaps = 54/339 (15%)
Query: 8 PVERYK--DMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
P R++ + +LL G F MGTN P DGE P R V ++ F + + V+N QF FV
Sbjct: 15 PSTRFQQNEFILLEGGDFLMGTNTPEGFPADGEGPVRKVRVNPFRISPYAVTNKQFSTFV 74
Query: 65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
ATGYVTEAE+FG +FVF L SE+ R ++
Sbjct: 75 EATGYVTEAEQFGWSFVFHLLASEQIRQRV------------------------------ 104
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
P + W G W PEG DS+I RM HPV HVS
Sbjct: 105 -----------VNAPQQVPWWLVVEGAY---------WAKPEGPDSSITDRMEHPVTHVS 144
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
W+DA AYC W G RLPTEAEWE+ RGGL + +PWG+ L G+H N+WQG+FP N
Sbjct: 145 WHDAAAYCAWSGTRLPTEAEWEFAARGGLTGKTYPWGDQLKVDGKHMCNIWQGKFPIKNN 204
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-NPKGPTTGTDKVK 303
A+DGY+ TAPV ++K N +GLYN+ GNVWEW ADW++ +H NP +
Sbjct: 205 ASDGYIGTAPVDAFKPNGYGLYNLSGNVWEWCADWFSPKYHSRTDIDNPLYSEETAKRSI 264
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+GGSYLC+ YC R+R AARS NTPDSS GN+GFR AD
Sbjct: 265 RGGSYLCHRSYCNRYRIAARSSNTPDSSTGNIGFRVVAD 303
>gi|414344129|ref|YP_006985650.1| hypothetical protein B932_3174 [Gluconobacter oxydans H24]
gi|411029464|gb|AFW02719.1| hypothetical protein B932_3174 [Gluconobacter oxydans H24]
Length = 381
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 147/335 (43%), Positives = 194/335 (57%), Gaps = 54/335 (16%)
Query: 12 YKDMVLLPGDTFRMGTNKP-ILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
Y+DM L+ +R+G + I I DGE P R+V +DAF++D++ V+N +F FV+ATGYV
Sbjct: 90 YEDMALIGAGAYRVGADDSEIHISDGEGPVRSVEVDAFWIDKYAVTNQKFAAFVAATGYV 149
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAE G +FVF A ++ +R G
Sbjct: 150 TEAETEGWSFVF---------AGVATATGKKRRLRG------------------------ 176
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
+P + W RG W HPEG S I R +HPVVHVSW+DA A
Sbjct: 177 ------SVP-QTPWWLPVRGA---------QWRHPEGPGSHISSRQDHPVVHVSWHDAAA 220
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
Y W G RLPTEAEWE RG +EN +PWGN L P GEH+ N+WQG FP NT DGY+
Sbjct: 221 YAAWVGKRLPTEAEWEIAARGRMENFRYPWGNELLPDGEHQCNIWQGRFPEENTKEDGYV 280
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS----YNPKGPTTGTDKVKKGG 306
+TAPV ++ N +GLYNMVGN WEW +D+W+ H A + +NP GP GTD+V +GG
Sbjct: 281 TTAPVTAFPPNGYGLYNMVGNTWEWCSDFWSPVWHQAENALTRHNPAGPPEGTDRVIRGG 340
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
S++C+ YC R+R +AR+ TPDSS G++GFRCA+
Sbjct: 341 SHMCHHSYCNRYRNSARTHTTPDSSLGHIGFRCAS 375
>gi|302547019|ref|ZP_07299361.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Streptomyces hygroscopicus ATCC 53653]
gi|302464637|gb|EFL27730.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Streptomyces himastatinicus ATCC 53653]
Length = 327
Score = 269 bits (688), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 182/328 (55%), Gaps = 54/328 (16%)
Query: 16 VLLPGDTFRMGTNKPIL-IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
+ +PG TF MG++ P DGE P R VT+ F +D H V+N +F FV ATGY TEAE
Sbjct: 44 IRIPGGTFLMGSDDPDANPDDGEGPVRAVTVAPFAIDAHCVTNERFAAFVEATGYRTEAE 103
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+FG ++VF S E R R
Sbjct: 104 RFGWSYVFATFASGELR------------------------------------------R 121
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+ P W G +G W PEG S + R +HPVVHVSW DA A+C W
Sbjct: 122 ISARPKSTPWWCGVQGAY---------WAAPEGPGSDLSGRSDHPVVHVSWTDASAFCQW 172
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G+RLPTE EWEY RGGLE + FPWG+ LTP GEHR N+WQG FP++NTA DGYL TAP
Sbjct: 173 EGSRLPTEPEWEYAARGGLEGKRFPWGDELTPGGEHRCNIWQGRFPSHNTAEDGYLGTAP 232
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS--YNPKGPTTGTDKVKKGGSYLCNE 312
+Y+ N +GLYN+ GNVWEW AD W A P P G ++V +GGSYLC++
Sbjct: 233 ATAYQPNGYGLYNVAGNVWEWCADRWATTPSRAEGSCTTPADPRQGAERVLRGGSYLCHD 292
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCA 340
YC R+R AAR+ N+PDSS+GN GFR A
Sbjct: 293 SYCNRYRVAARTSNSPDSSSGNTGFRVA 320
>gi|21225812|ref|NP_631591.1| hypothetical protein SCO7548 [Streptomyces coelicolor A3(2)]
gi|11228445|emb|CAC16428.1| conserved hypothetical protein [Streptomyces coelicolor A3(2)]
Length = 314
Score = 268 bits (685), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 149/339 (43%), Positives = 186/339 (54%), Gaps = 53/339 (15%)
Query: 13 KDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ V LPG F MG DGE P V L F++D+ V+N +F FV ATG+VT
Sbjct: 24 RGQVRLPGGEFAMGDAFGEGYPADGETPVHTVRLRPFHIDETAVTNARFAAFVKATGHVT 83
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
+AE+FG + VF +++ + +
Sbjct: 84 DAERFGSSAVFHLVVAAPDADVLGSA---------------------------------- 109
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
A W RG W PEG S I R NHPVVHVSWNDA AY
Sbjct: 110 --------AGAPWWINVRGA---------HWRRPEGARSDITGRPNHPVVHVSWNDATAY 152
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G RLPTEAEWEY RGGL R + WG+ LTP G R N+WQG FP NTA DG+LS
Sbjct: 153 ARWAGKRLPTEAEWEYAARGGLAGRRYAWGDELTPGGRWRCNIWQGRFPHVNTAEDGHLS 212
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
TAPV SY+ N GL+N GNVWEW +DW++ ++ +P+ +P GP TG +V +GGSYLC
Sbjct: 213 TAPVKSYRPNGHGLWNTAGNVWEWCSDWFSPTYYAESPTVDPHGPGTGAARVLRGGSYLC 272
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
++ YC R+R AARS NTPDSS+GNLGFRCA D T+G+
Sbjct: 273 HDSYCNRYRVAARSSNTPDSSSGNLGFRCANDADLTSGS 311
>gi|448406821|ref|ZP_21573253.1| Non-specific serine/threonine protein kinase [Halosimplex
carlsbadense 2-9-1]
gi|445676627|gb|ELZ29144.1| Non-specific serine/threonine protein kinase [Halosimplex
carlsbadense 2-9-1]
Length = 335
Score = 268 bits (684), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 145/338 (42%), Positives = 189/338 (55%), Gaps = 56/338 (16%)
Query: 11 RYKDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
R MV + FRMGT+ + +DGE P+R V DA+Y+D++ V+N +F EFV TG+
Sbjct: 46 RTDRMVRVDAGRFRMGTDSDVGFPEDGEGPAREVATDAYYVDRYAVTNAEFLEFVRETGH 105
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
T+AE+FG +FVFE L+E +R +
Sbjct: 106 TTDAERFGWSFVFEDFLAESDREHVRD--------------------------------- 132
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTI--EHRMNHPVVHVSWND 187
R P A+W G W HP G S++ + + HPV HVS D
Sbjct: 133 -------RAPG-ADWWVAVAGA---------DWFHPRGPSSSVVADDLLTHPVTHVSHRD 175
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
A AY W G RLPTEAEWE RGG + FPWG++L P GEHR NVWQGEFP NTAAD
Sbjct: 176 AAAYADWAGKRLPTEAEWERAARGGRKGTRFPWGDHLEPDGEHRCNVWQGEFPERNTAAD 235
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPKGPTTGTDKVKK 304
GY +TAPV ++ N GL+N+ GNVWEW DW++ +H + NP GP G ++V +
Sbjct: 236 GYRATAPVDAFDPNDSGLFNVCGNVWEWCRDWFDAEYHTTSASDRENPTGPGDGDERVMR 295
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GGS+LC+E +C R+R AARS+N PDSS GN+GFRC D
Sbjct: 296 GGSHLCHESWCNRYRLAARSKNGPDSSTGNIGFRCVVD 333
>gi|322435825|ref|YP_004218037.1| hypothetical protein AciX9_2214 [Granulicella tundricola MP5ACTX9]
gi|321163552|gb|ADW69257.1| protein of unknown function DUF323 [Granulicella tundricola
MP5ACTX9]
Length = 324
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 149/336 (44%), Positives = 184/336 (54%), Gaps = 51/336 (15%)
Query: 7 PPVERYKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
PP ++ LPG F MGT+ + DGE P R V L F +D + V+N F FV
Sbjct: 35 PPSPDASRIISLPGGEFLMGTDYEEAFAADGERPVRRVHLKPFQIDIYPVTNADFAAFVQ 94
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
AT Y TEAE +G +FVF + + +R +
Sbjct: 95 ATHYKTEAELYGWSFVFMNHIPKLQRGSLV-----------------------------T 125
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
D VA W W C+ P W PEG S + R +HPVVHV+W
Sbjct: 126 DTVAGAEW----------W---CQ-------VPGAVWNAPEGPGSDVARRSDHPVVHVTW 165
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
NDA+AYC W G RLPTEAEWEY RGGL +L+PWGN L P GEHR N+WQG FP +TA
Sbjct: 166 NDALAYCDWSGNRLPTEAEWEYAARGGLVQKLYPWGNQLRPNGEHRCNIWQGTFPHEDTA 225
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTTGTDKVKK 304
DGY T+PV +Y N FGLY++ GN WEW DW+ + + +P GP TGT +V K
Sbjct: 226 EDGYAGTSPVKAYPPNGFGLYSITGNAWEWCMDWFGIPDSVDTVTEDPVGPPTGTVRVMK 285
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GGS+LC++ YC R+R AARS NTPDSS NLGFRCA
Sbjct: 286 GGSFLCHKSYCNRYRVAARSSNTPDSSTSNLGFRCA 321
>gi|171848817|pdb|2Q17|A Chain A, Formylglycine Generating Enzyme From Streptomyces
Coelicolor
gi|171848818|pdb|2Q17|B Chain B, Formylglycine Generating Enzyme From Streptomyces
Coelicolor
gi|171848819|pdb|2Q17|C Chain C, Formylglycine Generating Enzyme From Streptomyces
Coelicolor
gi|171848820|pdb|2Q17|D Chain D, Formylglycine Generating Enzyme From Streptomyces
Coelicolor
gi|171848821|pdb|2Q17|E Chain E, Formylglycine Generating Enzyme From Streptomyces
Coelicolor
Length = 346
Score = 267 bits (683), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 149/339 (43%), Positives = 186/339 (54%), Gaps = 53/339 (15%)
Query: 13 KDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ V LPG F MG DGE P V L F++D+ V+N +F FV ATG+VT
Sbjct: 56 RGQVRLPGGEFAMGDAFGEGYPADGETPVHTVRLRPFHIDETAVTNARFAAFVKATGHVT 115
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
+AE+FG + VF +++ + +
Sbjct: 116 DAERFGSSAVFHLVVAAPDADVLGSA---------------------------------- 141
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
A W RG W PEG S I R NHPVVHVSWNDA AY
Sbjct: 142 --------AGAPWWINVRGA---------HWRRPEGARSDITGRPNHPVVHVSWNDATAY 184
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G RLPTEAEWEY RGGL R + WG+ LTP G R N+WQG FP NTA DG+LS
Sbjct: 185 ARWAGKRLPTEAEWEYAARGGLAGRRYAWGDELTPGGRWRCNIWQGRFPHVNTAEDGHLS 244
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
TAPV SY+ N GL+N GNVWEW +DW++ ++ +P+ +P GP TG +V +GGSYLC
Sbjct: 245 TAPVKSYRPNGHGLWNTAGNVWEWCSDWFSPTYYAESPTVDPHGPGTGAARVLRGGSYLC 304
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
++ YC R+R AARS NTPDSS+GNLGFRCA D T+G+
Sbjct: 305 HDSYCNRYRVAARSSNTPDSSSGNLGFRCANDADLTSGS 343
>gi|289767033|ref|ZP_06526411.1| conserved hypothetical protein [Streptomyces lividans TK24]
gi|289697232|gb|EFD64661.1| conserved hypothetical protein [Streptomyces lividans TK24]
Length = 329
Score = 267 bits (683), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 149/339 (43%), Positives = 186/339 (54%), Gaps = 53/339 (15%)
Query: 13 KDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ V LPG F MG DGE P V L F++D+ V+N +F FV ATG+VT
Sbjct: 39 RGQVRLPGGEFAMGDAFGEGYPADGETPVHTVRLRPFHIDETAVTNARFAAFVKATGHVT 98
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
+AE+FG + VF +++ + +
Sbjct: 99 DAERFGSSAVFHLVVAAPDADVLGNA---------------------------------- 124
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
A W RG W PEG S I R NHPVVHVSWNDA A+
Sbjct: 125 --------AGAPWWINVRGA---------HWRRPEGARSDITGRPNHPVVHVSWNDATAF 167
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G RLPTEAEWEY RGGL R + WG+ LTP G R N+WQG FP NTA DG+LS
Sbjct: 168 ARWAGKRLPTEAEWEYAARGGLAGRRYAWGDELTPGGRWRCNIWQGRFPHVNTAEDGHLS 227
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
TAPV SY+ N GL+N GNVWEW +DW++ ++ +P+ +P GP TG +V +GGSYLC
Sbjct: 228 TAPVKSYRPNGHGLWNTAGNVWEWCSDWFSPTYYAESPTVDPHGPGTGAARVLRGGSYLC 287
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
++ YC R+R AARS NTPDSS+GNLGFRCA D G T G+
Sbjct: 288 HDSYCNRYRVAARSSNTPDSSSGNLGFRCANDAGLTYGS 326
>gi|403069090|ref|ZP_10910422.1| non-specific serine/threonine protein kinase [Oceanobacillus sp.
Ndiop]
Length = 308
Score = 267 bits (682), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 147/334 (44%), Positives = 182/334 (54%), Gaps = 66/334 (19%)
Query: 11 RYKD-MVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
RY+D MV LPG MGT +K D E P RNV ++ FY+D H VSN +F+ FV T
Sbjct: 38 RYQDKMVHLPGGDLLMGTEDKEGFPSDREGPIRNVKVEPFYIDNHAVSNREFKRFVDETN 97
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
Y T AEKFG +FVF L+ E+ Q++
Sbjct: 98 YQTVAEKFGWSFVFYEFLTLEQLNDEEQLQ------------------------------ 127
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
+ W Y + W PEG STIE R++HPVVHVSW DA
Sbjct: 128 ------------DVPWWYAVKDAF---------WYQPEGKGSTIERRLDHPVVHVSWYDA 166
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
A+C W G RLPTE EWE+ RGGLE + +PWG+ LTP G+H N+WQG+FP N+ DG
Sbjct: 167 TAFCDWAGKRLPTEREWEFAARGGLEQKKYPWGDELTPDGKHFCNIWQGKFPQYNSKEDG 226
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
YL TAP S+ N +GLYNM GNVWEW +DW+N + T K +GGSY
Sbjct: 227 YLGTAPAKSFPPNNYGLYNMSGNVWEWCSDWFNQDNQ-------------TQKSMRGGSY 273
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
LC++ YC R+R AARS NT DSSAGN+GFRC D
Sbjct: 274 LCHQSYCNRYRVAARSSNTMDSSAGNIGFRCVRD 307
>gi|297191055|ref|ZP_06908453.1| sulfatase modifying factor 1 [Streptomyces pristinaespiralis ATCC
25486]
gi|197723248|gb|EDY67156.1| sulfatase modifying factor 1 [Streptomyces pristinaespiralis ATCC
25486]
Length = 320
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 153/343 (44%), Positives = 188/343 (54%), Gaps = 55/343 (16%)
Query: 4 LPAPPVER--YKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
+P P R + V LPG F MG DGE P V L+ F++D+ V+N QF
Sbjct: 24 VPGPAAGRRSLRGQVRLPGGEFAMGDAFGEGYPADGETPVHPVRLEPFHIDETAVTNAQF 83
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
FV TG++T+AE++G + VF H VV
Sbjct: 84 ATFVKDTGHITDAERYGSSAVF----------------------------------HLVV 109
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
+ D + A W RG W PEG S I R NHPV
Sbjct: 110 AAANADILGNAA--------GAPWWINVRGA---------HWRRPEGARSDITDRQNHPV 152
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
VHVSWNDAVAY W G RLPTEAEWEY RGGL R + WG+ LTP G R N+WQG FP
Sbjct: 153 VHVSWNDAVAYARWAGKRLPTEAEWEYAARGGLAGRRYAWGDELTPDGRWRCNIWQGRFP 212
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGT 299
NTA DG+L+TAPV +Y+ N FGL+N GNVWEW DW++ ++ AP +P GP TGT
Sbjct: 213 HTNTAEDGHLTTAPVKAYRPNGFGLWNTAGNVWEWCTDWFSPTYYAEAPPDDPCGPQTGT 272
Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+V +GGSYLC++ YC R+R AARS NTP+SS+GNLGFRCA D
Sbjct: 273 ARVMRGGSYLCHDSYCNRYRVAARSSNTPESSSGNLGFRCAND 315
>gi|357394061|ref|YP_004908902.1| putative sulfatase-modifying factor [Kitasatospora setae KM-6054]
gi|311900538|dbj|BAJ32946.1| putative sulfatase-modifying factor [Kitasatospora setae KM-6054]
Length = 344
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 150/333 (45%), Positives = 180/333 (54%), Gaps = 58/333 (17%)
Query: 13 KDMVLLPGDTFRMGTNKPILIK-DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ ++ LPG FRMG P I DGE P R V L F + VSN +F FV TGYVT
Sbjct: 64 RGLLELPGGAFRMGGQDPDGIAGDGEGPVREVLLGPFAIAATTVSNAEFASFVRETGYVT 123
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
+AE+FG +FVFE L E RA+ +V W AV
Sbjct: 124 DAERFGSSFVFEGFLPERLRARSPRV----------------------AATPWWRAVEGA 161
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
WR PEG S R HPVVHVSWNDA AY
Sbjct: 162 AWR-----------------------------TPEGPGSGFAARQQHPVVHVSWNDAQAY 192
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
C W G RLPTEA+WEY RGG E +PWG+ L P G N+WQG+FPT NT + S
Sbjct: 193 CAWSGTRLPTEAQWEYAARGGSEGTRYPWGDELAPGGRELLNIWQGDFPTRNTGR--HRS 250
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPTTGTDKVKKGGS 307
TAPV SY+ N GLYN++GNVWEW AD ++ HH PA +P GP TGT +V +GGS
Sbjct: 251 TAPVRSYRPNGHGLYNVLGNVWEWCADHFSPDHHRPDTPATRQDPLGPPTGTARVMRGGS 310
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
++C+ YC R+R AARS NTPDSS+GN+GFR A
Sbjct: 311 HMCHASYCNRYRLAARSSNTPDSSSGNIGFRVA 343
>gi|209551961|ref|YP_002283877.1| protein of unknown function DUF323 [Rhizobium leguminosarum bv.
trifolii WSM2304]
gi|209539554|gb|ACI59485.1| protein of unknown function DUF323 [Rhizobium leguminosarum bv.
trifolii WSM2304]
Length = 319
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 151/327 (46%), Positives = 179/327 (54%), Gaps = 54/327 (16%)
Query: 14 DMVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
D V LPG +F MG T L DGE P VTL F +D V+N F FV+ATGYVTE
Sbjct: 41 DEVPLPGGSFEMGDTFGEGLESDGEGPVHAVTLSPFRIDTTCVTNDHFARFVAATGYVTE 100
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ D+ VF P ++ +
Sbjct: 101 AERLSDSAVFAPFVAAD------------------------------------------- 117
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
P + YG RG L R W HP G S IE HPVVH+S DA AYC
Sbjct: 118 ------PADITGGYGARGWLAVRG---ADWRHPFGRYSAIEDSGLHPVVHISHEDAQAYC 168
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWEY R G R FPWGN L P G H ANVWQG FPT NT ADG+L+T
Sbjct: 169 HWAGRSLPTEAEWEYAARAGHAGRRFPWGNELEPEGIHMANVWQGHFPTRNTGADGWLAT 228
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
APV +Y N FGLY+M+GNVWEW ADW+N ++ +P+++P G TG +V +GGSYLC+
Sbjct: 229 APVGTYFPNDFGLYDMIGNVWEWCADWYNRDYYRNSPAHDPAGAATGDRRVTRGGSYLCH 288
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFR 338
YC R+R AAR NTPDS+A NLGFR
Sbjct: 289 RSYCARYRVAARFSNTPDSTACNLGFR 315
>gi|323456776|gb|EGB12642.1| hypothetical protein AURANDRAFT_18589 [Aureococcus anophagefferens]
Length = 352
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 143/333 (42%), Positives = 188/333 (56%), Gaps = 54/333 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLD-AFYLDQHEVSNTQFQEFVSATGYVTEA 73
+++ PG+ FRMGT++P + DGE PSR V L AF +D +E +N F FV ATGYVT++
Sbjct: 66 VLIQPGE-FRMGTDRPRIFVDGEAPSRRVALSRAFLMDVYEATNDDFAAFVDATGYVTDS 124
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+FG +FVF LSE RA I + V + W V W
Sbjct: 125 ERFGWSFVFHLELSEARRAAIDKA---------------------VDGVEWWLPVNGSDW 163
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R + P G D R HPVV VSW+DA AYC
Sbjct: 164 RSPQGP---------------------------GSDVFAAKRGAHPVVQVSWHDAAAYCA 196
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
WRG RLPTEAEWEY R G ++PWGN LTP G +RAN+WQG+FP +NTA DG+
Sbjct: 197 WRGGRLPTEAEWEYAARDGRSQTMYPWGNELTPGGGYRANLWQGDFPRSNTADDGFSFAG 256
Query: 254 PVMSYK-ENKFGLYNMVGNVWEWTADWW---NVHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
PV SY + G+++++GNVWEW +DWW H A +P GP+ G +K+KKGGS+L
Sbjct: 257 PVGSYAPQTATGIHDLIGNVWEWVSDWWYEPEASAHDAVLRDPAGPSAGGEKLKKGGSFL 316
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
C++ YC+R+R AAR +N+PDS++ N G RC D
Sbjct: 317 CHKSYCFRYRSAARHKNSPDSASSNNGMRCVRD 349
>gi|418466952|ref|ZP_13037853.1| hypothetical protein SMCF_743 [Streptomyces coelicoflavus ZG0656]
gi|371552421|gb|EHN79668.1| hypothetical protein SMCF_743 [Streptomyces coelicoflavus ZG0656]
Length = 278
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 148/325 (45%), Positives = 183/325 (56%), Gaps = 57/325 (17%)
Query: 20 GDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDT 79
GD F G DGE P V L F++D+ V+N +F FV ATG+VT+AE+FG +
Sbjct: 2 GDAFGEG-----YPADGETPVHAVRLKPFHIDETAVTNARFAAFVKATGHVTDAERFGSS 56
Query: 80 FVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLP 139
VF +++ P + N A
Sbjct: 57 AVFHLVVAA-----------------------------PGADVLGNAA------------ 75
Query: 140 TEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARL 199
A W RG W PEG S I R NHPVVHVSWNDA AY W G RL
Sbjct: 76 -GAPWWINVRGA---------HWRRPEGARSDITGRQNHPVVHVSWNDATAYTRWAGKRL 125
Query: 200 PTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK 259
PTEAEWEY RGGL R + WG+ LTP G R N+WQG FP NTA DG+L+TAPV SY+
Sbjct: 126 PTEAEWEYAARGGLAGRRYAWGDELTPGGRWRCNIWQGRFPHVNTAEDGHLTTAPVKSYR 185
Query: 260 ENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRH 318
N GL+N GNVWEW ADW++ ++ +P+ +P+GP TGT +V +GGSYLC++ YC R+
Sbjct: 186 PNGHGLWNTAGNVWEWCADWFSPTYYADSPTTDPRGPATGTTRVLRGGSYLCHDSYCNRY 245
Query: 319 RCAARSQNTPDSSAGNLGFRCAADK 343
R AARS NTPDSS+GNLGFRCA D+
Sbjct: 246 RVAARSSNTPDSSSGNLGFRCANDE 270
>gi|291222787|ref|XP_002731394.1| PREDICTED: sulfatase modifying factor 1-like isoform 2
[Saccoglossus kowalevskii]
Length = 258
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 121/181 (66%), Positives = 137/181 (75%)
Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
W HPEG D+ I RM+HPV+HVSWNDAV +C W G RLPTEAE+EY RGG ENR +PWG
Sbjct: 63 WKHPEGPDTNITDRMDHPVLHVSWNDAVEFCKWIGKRLPTEAEFEYASRGGKENRTYPWG 122
Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
N L P G+ NVWQG FPT N+A DG+ T+PVM+Y N FGL+NMVGN WEWT DWW
Sbjct: 123 NRLRPNGKWLMNVWQGVFPTENSAEDGFAGTSPVMAYPPNGFGLHNMVGNAWEWTNDWWG 182
Query: 282 VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
H NPKGP +G DKVKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLGFRC A
Sbjct: 183 TSHTADLYVNPKGPASGIDKVKKGGSYMCHASYCYRYRCAARSQNTPDSSASNLGFRCFA 242
Query: 342 D 342
D
Sbjct: 243 D 243
>gi|390958068|ref|YP_006421825.1| hypothetical protein Terro_2226 [Terriglobus roseus DSM 18391]
gi|390958409|ref|YP_006422166.1| hypothetical protein Terro_2590 [Terriglobus roseus DSM 18391]
gi|390412986|gb|AFL88490.1| hypothetical protein Terro_2226 [Terriglobus roseus DSM 18391]
gi|390413327|gb|AFL88831.1| hypothetical protein Terro_2590 [Terriglobus roseus DSM 18391]
Length = 321
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 151/333 (45%), Positives = 180/333 (54%), Gaps = 52/333 (15%)
Query: 13 KDMVLLPGDTFRMGTNKP-ILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+DMVLLPG F MG+ DGE P R VT+DAF++D+ V N F FV AT YVT
Sbjct: 38 EDMVLLPGGRFLMGSESADSFPDDGEGPVRPVTVDAFWMDRFVVRNRDFAPFVDATQYVT 97
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
EAE+ G +FVF L S H
Sbjct: 98 EAERIGWSFVFAGDLPPNADGSTSVQVHG------------------------------- 126
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
AEW G G +W HPEG S I R HPVVHVSWNDA AY
Sbjct: 127 ----------AEWWQGIEGA---------TWSHPEGPSSDISTRAEHPVVHVSWNDAAAY 167
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G RLPTEAEWE+ RGGLE +PWG++LTP G+H N+WQG+FPT+NTA DG+ S
Sbjct: 168 AAWAGKRLPTEAEWEFAARGGLEQAAYPWGDDLTPGGKHLCNIWQGDFPTSNTAEDGFAS 227
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
T P ++ N +GL+ M GN WEW ADW + H A +NP GP G +V KGGSYLC
Sbjct: 228 TCPADAFPPNGYGLFGMTGNTWEWIADWSHPTWHQHATRHNPLGPPEGVARVLKGGSYLC 287
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
+ YC R+R ARS NTPDS+ N+ FRC DK
Sbjct: 288 HRSYCNRYRVGARSSNTPDSATTNISFRCVRDK 320
>gi|403529625|ref|YP_006664364.1| formylglycine-generating enzyme [Arthrobacter sp. Rue61a]
gi|403231905|gb|AFR31326.1| putative formylglycine-generating enzyme [Arthrobacter sp. Rue61a]
Length = 337
Score = 260 bits (664), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 147/323 (45%), Positives = 179/323 (55%), Gaps = 57/323 (17%)
Query: 20 GDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDT 79
GD F G +DGE P VTLDAF +D V+N F FV ATGY TEAE++G +
Sbjct: 58 GDAFGEG-----YPEDGESPVHLVTLDAFRIDATAVTNNMFGTFVQATGYRTEAEQYGSS 112
Query: 80 FVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLP 139
VF L S HD D +
Sbjct: 113 AVFHLL---------STANHD-------------------------DVLGTAA------- 131
Query: 140 TEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARL 199
A W RG W HP G S ++HPVVHVS+NDA+AYC+W G RL
Sbjct: 132 -GAPWWLNVRGA---------DWAHPTGPASDFREFLDHPVVHVSYNDALAYCSWAGRRL 181
Query: 200 PTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK 259
P+EAEWEY RGGLE++ + WGN L P GEHR N+WQG FPT NT DG+L T+PV S+
Sbjct: 182 PSEAEWEYAARGGLESKRYAWGNELMPGGEHRCNIWQGTFPTANTLKDGHLGTSPVRSFP 241
Query: 260 ENKFGLYNMVGNVWEWTADWW-NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRH 318
N +GLY M GNVWEW ADW+ ++ +P NPKGPT G +V +GGSYLC++ YC R+
Sbjct: 242 CNGYGLYEMAGNVWEWCADWFLPKYYRNSPVNNPKGPTIGAGRVMRGGSYLCHDSYCNRY 301
Query: 319 RCAARSQNTPDSSAGNLGFRCAA 341
R AAR+ NTPDSS+GN GFR A
Sbjct: 302 RVAARTSNTPDSSSGNCGFRTVA 324
>gi|374313129|ref|YP_005059559.1| sulfatase-modifying factor protein [Granulicella mallensis
MP5ACTX8]
gi|358755139|gb|AEU38529.1| Sulphatase-modifying factor protein [Granulicella mallensis
MP5ACTX8]
Length = 322
Score = 259 bits (662), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 155/344 (45%), Positives = 194/344 (56%), Gaps = 52/344 (15%)
Query: 3 LLPAPPVERYKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
L+ P + + + LPG TF MGTN + DGE P R V+L AF +D + V+N F
Sbjct: 27 LVYTPGLAPNRATIPLPGGTFLMGTNYERGFPADGEGPVRPVSLSAFDIDTYPVTNADFA 86
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
F++AT Y TEAE FG +FVF + EE RFE +E
Sbjct: 87 AFIAATNYRTEAEVFGWSFVFWSHIPEE-------------RFE---EVVE--------- 121
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
D VA W W C+ P +W HPEG S ++ R NHPVV
Sbjct: 122 ----DTVAMTPW----------W---CK-------VPGATWNHPEGPGSDVKDRQNHPVV 157
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
HVSWNDA AY W G LPTEA+WEY RGGLE +L+PWG+ LTP+G H N+WQG+FP
Sbjct: 158 HVSWNDAAAYAAWAGRALPTEAQWEYAARGGLEQKLYPWGDELTPKGRHLCNIWQGQFPR 217
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK 301
+TA DGY + PV S+ N +GLY++ GNVWEW ADW+ + S +P GP G K
Sbjct: 218 EDTAEDGYAGSCPVDSFPPNGYGLYSVTGNVWEWCADWFGTTF--STSQDPVGPPVGEAK 275
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
V KGGS+LC+ YC R+R AAR+ NTPDSSA N+GFRC P
Sbjct: 276 VMKGGSFLCHASYCNRYRVAARTSNTPDSSASNIGFRCVGKAHP 319
>gi|377567711|ref|ZP_09796918.1| putative sulfatase-modifying factor [Gordonia terrae NBRC 100016]
gi|377535109|dbj|GAB42083.1| putative sulfatase-modifying factor [Gordonia terrae NBRC 100016]
Length = 271
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 139/308 (45%), Positives = 170/308 (55%), Gaps = 52/308 (16%)
Query: 34 KDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAK 93
+DGE P V L AF L V+N QF FV TG+VT AE+FG + VF
Sbjct: 11 QDGEGPVHEVDLPAFRLAATPVTNAQFATFVKETGHVTTAEQFGVSAVFH---------- 60
Query: 94 ISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 153
F G S I HR+ + W +G
Sbjct: 61 --------LAFAGDSSAILHRLE------------------------QTPWWLAVKGA-- 86
Query: 154 NRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGL 213
W HP G + I R NHPVVHVSW DA AYC W G RLPTEAEWEY RGG
Sbjct: 87 -------DWKHPHGPGTNISDRQNHPVVHVSWADATAYCDWAGTRLPTEAEWEYAARGGT 139
Query: 214 ENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVW 273
+ R +PWG+ LTP G+ R N+WQG FP NT DG+L+TAPV SY+ N FGLY M GNVW
Sbjct: 140 DGRRYPWGDELTPHGQWRCNIWQGSFPVENTEDDGHLTTAPVKSYRSNGFGLYQMAGNVW 199
Query: 274 EWTADWW-NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSA 332
EW DW+ N ++ + +P GPT G +V +GGSYLC++ YC R+R +ARS NTP+S+A
Sbjct: 200 EWCQDWFDNTYYERSQRVSPTGPTDGERRVMRGGSYLCHDSYCNRYRISARSSNTPESTA 259
Query: 333 GNLGFRCA 340
N+GFRCA
Sbjct: 260 ANIGFRCA 267
>gi|374983353|ref|YP_004958848.1| Non-specific serine/threonine protein kinase [Streptomyces
bingchenggensis BCW-1]
gi|297154005|gb|ADI03717.1| Non-specific serine/threonine protein kinase [Streptomyces
bingchenggensis BCW-1]
Length = 279
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 143/321 (44%), Positives = 171/321 (53%), Gaps = 54/321 (16%)
Query: 26 GTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPL 85
G + +DGE P R V L F +D V+N QF FV A+GY T+AE++G ++VF L
Sbjct: 3 GDDADAFPEDGEGPVRAVRLSPFLIDATAVTNRQFAAFVRASGYRTDAERYGWSYVFYAL 62
Query: 86 LSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWE 145
+ R H + V W AV E
Sbjct: 63 VHPGAR---------------------HLVRDTVAQAPWWLAV----------------E 85
Query: 146 YGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEW 205
C W P G S+ NHPVVHVSW DA AY W G RLPTEAEW
Sbjct: 86 GAC-------------WQAPYGPGSSWTELSNHPVVHVSWRDASAYAAWAGKRLPTEAEW 132
Query: 206 EYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGL 265
E RGGLE FPWG+ L PRG+HR N+WQG FP NT DGYL TAPV +++ N FGL
Sbjct: 133 ERAARGGLERSRFPWGDELLPRGQHRCNIWQGRFPQVNTGEDGYLGTAPVKTFRPNNFGL 192
Query: 266 YNMVGNVWEWTADWWNVHHH----PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCA 321
YN GNVWEW +DWW+ H P +P GP G KV +GGSYLC+ YC R+R A
Sbjct: 193 YNTSGNVWEWCSDWWSTTWHAADRPETRQDPAGPPAGEAKVIRGGSYLCHASYCNRYRVA 252
Query: 322 ARSQNTPDSSAGNLGFRCAAD 342
AR+ NTPDSS G++GFRCAAD
Sbjct: 253 ARTSNTPDSSTGHMGFRCAAD 273
>gi|225709012|gb|ACO10352.1| Sulfatase-modifying factor 1 precursor [Caligus rogercresseyi]
Length = 313
Score = 258 bits (660), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 141/331 (42%), Positives = 183/331 (55%), Gaps = 68/331 (20%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ V + G TF GTN P+ + D E PS ++ F +D +V+N +F+ FV TGYVT
Sbjct: 45 FTSPVFIAGGTFNKGTNTPVFMADLEGPSTPTSVSDFLMDPTQVTNGEFEAFVRDTGYVT 104
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
EAE +G++FV LS S+V+ + + L
Sbjct: 105 EAEGYGNSFVMGYFLS-------SEVKSSVSQSAQL------------------------ 133
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
A W G SW PEG S + + +HPVVHVSWNDA Y
Sbjct: 134 ----------APWWLKVPGA---------SWREPEGPGS--DFKGDHPVVHVSWNDANHY 172
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
C W G RLP+EAEWEY CR G E+RLFPWGN +P+G+ AN+W G+FP+ N A DGY S
Sbjct: 173 CDWAGKRLPSEAEWEYACRSGKEDRLFPWGNKWSPKGQALANIWTGDFPSQNDALDGYAS 232
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
T PV Y N +GLY+M+GNVWEWT+D W G +VKKGGS++C+
Sbjct: 233 TNPVGFYPPNAYGLYDMIGNVWEWTSDVWR----------------GKSRVKKGGSFMCH 276
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+ +CYR+RCAARSQNT D+SA NLGFRC +D
Sbjct: 277 KSHCYRYRCAARSQNTEDTSAHNLGFRCVSD 307
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 31/46 (67%), Positives = 40/46 (86%)
Query: 348 GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
G +VKKGGS++C++ +CYR+RCAARSQNT D+SA NLGFRC +DV
Sbjct: 263 GKSRVKKGGSFMCHKSHCYRYRCAARSQNTEDTSAHNLGFRCVSDV 308
>gi|408682275|ref|YP_006882102.1| Sulfatase modifying factor 1 precursor [Streptomyces venezuelae
ATCC 10712]
gi|328886604|emb|CCA59843.1| Sulfatase modifying factor 1 precursor [Streptomyces venezuelae
ATCC 10712]
Length = 299
Score = 258 bits (658), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 149/328 (45%), Positives = 181/328 (55%), Gaps = 53/328 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIK-DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
MV LPG F MGT+ + DGE P R V L F +D V+N F ++V+ TGYVTEA
Sbjct: 20 MVCLPGGEFLMGTDDAEGFRGDGEGPVRTVRLRPFLIDTCAVTNDAFAQYVADTGYVTEA 79
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+FG ++VF L R
Sbjct: 80 ERFGWSYVFAGFLPAALR------------------------------------------ 97
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
RGA P W G G NR PEG ST++ R +HPVVHVSWNDA AY
Sbjct: 98 RGAYRPERTPWWCGVGGAAWNR---------PEGPGSTLDGRGDHPVVHVSWNDAAAYAA 148
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGGLE + +PWG+ L P G H N+W+G FP NTA DG+ TA
Sbjct: 149 WAGKRLPTEAEWEYAARGGLEQKRYPWGDELDPDGAHHCNIWRGTFPAKNTATDGFRGTA 208
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGSYLCNE 312
PV +++ N +GLYN GNVWEW ADWW H P +P GP TGT KV +GGS+LC+
Sbjct: 209 PVDAFEPNGYGLYNTSGNVWEWCADWWTTDHPTHRPLADPTGPATGTAKVIRGGSHLCHR 268
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCA 340
YC R+R AAR+ NT DS++G+ GFRCA
Sbjct: 269 SYCNRYRVAARTANTSDSTSGHAGFRCA 296
>gi|270008071|gb|EFA04519.1| hypothetical protein TcasGA2_TC016314 [Tribolium castaneum]
Length = 292
Score = 257 bits (657), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 132/267 (49%), Positives = 160/267 (59%), Gaps = 49/267 (18%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
DMVL+ TF MGT+KP+ D E P+RNV++D+FYLD+ EVSN +F EFV TGY TEA
Sbjct: 63 DMVLIEAATFEMGTDKPVFESDHEGPARNVSVDSFYLDKFEVSNGKFSEFVLKTGYKTEA 122
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+FGD+F+FE L+ E ER K R
Sbjct: 123 EEFGDSFIFEMLVPESEREKYKDFR----------------------------------- 147
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
+A W +G +W HPEG STI R++HPV+HVSW DA YC
Sbjct: 148 -----AVQAPWWIKMKGV---------TWKHPEGEGSTINDRLDHPVIHVSWTDAKKYCE 193
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
+ G RLPTEAEWE CR GL +L+PWGN LTP+GEH AN+WQG+FP NTA DGYLSTA
Sbjct: 194 FVGKRLPTEAEWEMACRAGLRQKLYPWGNKLTPKGEHWANIWQGDFPLTNTAEDGYLSTA 253
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWW 280
PV S+ NKFGL++M GNVWEWT D W
Sbjct: 254 PVDSFPANKFGLHHMAGNVWEWTQDNW 280
>gi|441506956|ref|ZP_20988884.1| putative sulfatase-modifying factor [Gordonia aichiensis NBRC
108223]
gi|441449021|dbj|GAC46845.1| putative sulfatase-modifying factor [Gordonia aichiensis NBRC
108223]
Length = 327
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 142/327 (43%), Positives = 180/327 (55%), Gaps = 57/327 (17%)
Query: 15 MVLLPGDTFRMGTNKPIL-IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M+ LP F MG + P DGE P R+V +DAF + H V+N +F EFV+++ YVT+A
Sbjct: 56 MISLPTTDFWMGDDGPSSHAADGEGPVRSVRVDAFAISAHAVTNAEFDEFVTSSEYVTDA 115
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+FG +FVF L + +R R EG W AV TW
Sbjct: 116 ERFGWSFVFAGFLP-------AALRRASPRPEG---------------TPWWCAVTDATW 153
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R PEG S + R +HPVVHVSWNDA A+C
Sbjct: 154 RA-----------------------------PEGPGSDVSERADHPVVHVSWNDARAFCR 184
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGGL +PWG+ L R N+W+G FP NTAADGY STA
Sbjct: 185 WAGVRLPTEAEWEYAARGGLSRARYPWGDELDEDESFRCNIWRGSFPARNTAADGYRSTA 244
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
PV +++ N FGLYNMVGNVWEWTAD W V H P P+ + K +GGS++C++
Sbjct: 245 PVEAFEPNGFGLYNMVGNVWEWTADAWRVDHSGTPHEAPE-----SRKAMRGGSHMCHDS 299
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCA 340
YC R+R +AR+ N+ D S+GN+GFR A
Sbjct: 300 YCNRYRVSARTSNSADGSSGNIGFRIA 326
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 20/40 (50%), Positives = 31/40 (77%)
Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
K +GGS++C++ YC R+R +AR+ N+ D S+GN+GFR A
Sbjct: 287 KAMRGGSHMCHDSYCNRYRVSARTSNSADGSSGNIGFRIA 326
>gi|25028125|ref|NP_738179.1| hypothetical protein CE1569 [Corynebacterium efficiens YS-314]
gi|23493409|dbj|BAC18379.1| conserved hypothetical protein [Corynebacterium efficiens YS-314]
Length = 334
Score = 256 bits (655), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 145/336 (43%), Positives = 189/336 (56%), Gaps = 53/336 (15%)
Query: 10 ERYKDMVLLPGDTFRMGTNK-PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
E+ +D V LPG F MG + DGE P V L F ++ V+N +F F+ ATG
Sbjct: 49 EQSRDAVTLPGGAFHMGDHHGEGYPADGEGPVHEVHLAPFGINVTTVTNAEFGRFIEATG 108
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
Y T AE++G + VF + +RA I +++ G + W AV
Sbjct: 109 YTTTAERYGVSAVFYAAF-QGQRADI------LRQVPG---------------VPWWLAV 146
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
+GA +W P G ST++ +HPVVHVSW+DA
Sbjct: 147 -----KGA------------------------NWQRPNGPGSTLDGLEDHPVVHVSWDDA 177
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
VAYCTW G RLPTEAEWEY RGGL+ + WG+NL G N+WQG FP NTAADG
Sbjct: 178 VAYCTWAGGRLPTEAEWEYAARGGLQGARYAWGDNLALDGRWNCNIWQGGFPMENTAADG 237
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGS 307
YL+TAPV +Y N +GL+ M GNVWEW DW++ ++ A S NP+GP TG +V +GGS
Sbjct: 238 YLTTAPVKTYTPNGYGLWQMAGNVWEWCQDWFDAEYYSRASSINPRGPDTGARRVMRGGS 297
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
YLC++ YC R+R AAR+ NTPDS++GN GFRC D
Sbjct: 298 YLCHDSYCNRYRVAARNSNTPDSTSGNTGFRCVFDS 333
>gi|326334164|ref|ZP_08200388.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Nocardioidaceae bacterium Broad-1]
gi|325948050|gb|EGD40166.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Nocardioidaceae bacterium Broad-1]
Length = 326
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 184/341 (53%), Gaps = 57/341 (16%)
Query: 5 PAPPVERYKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
PA P +R + +V LPG T+ G + DGE P V L F LD V+N +F F
Sbjct: 24 PALPRKRRRTLVDLPGGTYAAGDSFGEGYQADGEGPVHQVELPGFRLDATAVTNEEFARF 83
Query: 64 VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
V ATG+VT AE+ G + VF + R +
Sbjct: 84 VKATGHVTTAEREGFSAVFHLAFEGDPRDVL----------------------------- 114
Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHV 183
R P EA W G RG SW PEG S + R NHPVVHV
Sbjct: 115 ------------GRSP-EAPWWLGVRGA---------SWRTPEGPGSNVGRRGNHPVVHV 152
Query: 184 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNN 243
S +DA+AYC W G RLPTEAEWEY RGG FPWG++L G H ANVWQG FP N
Sbjct: 153 SHDDALAYCDWAGTRLPTEAEWEYAARGGRAGSRFPWGDDLEQDGTHHANVWQGVFPDTN 212
Query: 244 TAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTG 298
TA DG+++TAPV +Y N FGL+ M+GNVWEW ADW++ ++ +P+GP G
Sbjct: 213 TADDGFVTTAPVRTYDPNGFGLFQMIGNVWEWCADWFSPTYYNDLVEAGQPSDPRGPAEG 272
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V +GGSYLC++ YC+R+R +ARS + PDS+AGNLGFRC
Sbjct: 273 LTRVMRGGSYLCHDSYCHRYRLSARSSSPPDSTAGNLGFRC 313
>gi|373850102|ref|ZP_09592903.1| Sulphatase-modifying factor protein [Opitutaceae bacterium TAV5]
gi|372476267|gb|EHP36276.1| Sulphatase-modifying factor protein [Opitutaceae bacterium TAV5]
Length = 324
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 150/332 (45%), Positives = 183/332 (55%), Gaps = 54/332 (16%)
Query: 15 MVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M LLPG F MG + DGE P V LD F++D V+N QF +FV ATGY TEA
Sbjct: 41 MRLLPGGVFLMGNERDYGFPSDGEGPVHEVKLDPFWIDATTVTNRQFCDFVDATGYRTEA 100
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+FG +FVF LS ++ A ++
Sbjct: 101 ERFGWSFVFYGHLSGKQLASALRL------------------------------------ 124
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
T A E+ CR +E +W HPEG S I R HPVV VSWNDAVAY +
Sbjct: 125 ------TVAGSEWWCR--VEG-----ATWRHPEGPGSHIRKRWQHPVVQVSWNDAVAYAS 171
Query: 194 WRGARLPTEAEWEYGCRGGL--ENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G RLPTEAEWEY R GL NR FPWG+ L P G H NVWQG+FP NT ADGY
Sbjct: 172 WAGKRLPTEAEWEYAARSGLPHANR-FPWGDELEPGGRHMMNVWQGDFPVRNTEADGYYG 230
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
AP S++ + +GLYNM GNVWEW D ++ + +P+ NP GPT+G + +GGSYLC
Sbjct: 231 VAPARSFRASAWGLYNMTGNVWEWCHDRFSGTWYARSPAENPAGPTSGERRSMRGGSYLC 290
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+ YC R+R AR+ NTPDS+ NLGFRC D
Sbjct: 291 HASYCNRYRVDARNSNTPDSATTNLGFRCVRD 322
>gi|431913504|gb|ELK15179.1| Sulfatase-modifying factor 1 [Pteropus alecto]
Length = 355
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 128/278 (46%), Positives = 163/278 (58%), Gaps = 51/278 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+P F MGT+ P + DGE P+R VT+DAFY+D +EVSN +F++FV++TGY+TE
Sbjct: 82 MVLIPAGVFTMGTDDPQIKPDGEAPARRVTMDAFYMDAYEVSNAEFEKFVNSTGYLTEVT 141
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
G + G +++ P +S
Sbjct: 142 GPG------------------------RGRTGAAASVHEPRTAPATCVS----------- 166
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
A W +G W HPEG DST+ HR +HPV+HVSWNDA AYC W
Sbjct: 167 -------APWWLPVKGA---------DWRHPEGPDSTVVHRPDHPVLHVSWNDAAAYCAW 210
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEYGCRGGL NRLFPWGN L P+G+H AN+WQGEFP ++ DG+ TAP
Sbjct: 211 AGKRLPTEAEWEYGCRGGLHNRLFPWGNKLQPKGQHYANIWQGEFPVTDSGEDGFRGTAP 270
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
V ++ N +GLYN+VGNVWEWT+DWW VHH ++NP
Sbjct: 271 VDAFPPNAYGLYNVVGNVWEWTSDWWTVHHTVEETFNP 308
>gi|391227612|ref|ZP_10263819.1| hypothetical protein OpiT1DRAFT_00096 [Opitutaceae bacterium TAV1]
gi|391223105|gb|EIQ01525.1| hypothetical protein OpiT1DRAFT_00096 [Opitutaceae bacterium TAV1]
Length = 324
Score = 254 bits (648), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 151/335 (45%), Positives = 181/335 (54%), Gaps = 60/335 (17%)
Query: 15 MVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M LLPG F MG + DGE P V LD F++D V+N QF +FV ATGY TEA
Sbjct: 41 MRLLPGGVFLMGNERDYGFPSDGEGPVHEVKLDPFWIDATTVTNRQFCDFVDATGYRTEA 100
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+FG +FVF LS ++ A ++
Sbjct: 101 ERFGWSFVFYGHLSGKQLASALRL------------------------------------ 124
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
T A E+ CR +E +W HPEG S I R HPVV VSWNDAVAY +
Sbjct: 125 ------TVAGSEWWCR--VEG-----ATWRHPEGPGSHIRKRWQHPVVQVSWNDAVAYAS 171
Query: 194 WRGARLPTEAEWEYGCRGGL--ENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G RLPTEAEWEY R GL NR FPWG+ L P G H NVWQG+FP NT ADGY
Sbjct: 172 WAGKRLPTEAEWEYAARSGLPHANR-FPWGDELEPGGRHMMNVWQGDFPVRNTEADGYYG 230
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTAD----WWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
AP S++ + +GLYNM GNVWEW D W H +P+ NP GP +G + +GGS
Sbjct: 231 VAPARSFRASAWGLYNMTGNVWEWCHDRFSGTWYAH---SPAENPAGPASGERRSMRGGS 287
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YLC+ YC R+R AR+ NTPDS+ NLGFRC D
Sbjct: 288 YLCHASYCNRYRVDARNSNTPDSATTNLGFRCVRD 322
>gi|408682476|ref|YP_006882303.1| Sulfatase modifying factor 1 precursor [Streptomyces venezuelae
ATCC 10712]
gi|328886805|emb|CCA60044.1| Sulfatase modifying factor 1 precursor [Streptomyces venezuelae
ATCC 10712]
Length = 395
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 185/349 (53%), Gaps = 73/349 (20%)
Query: 1 MVLLPAP------PVERYKDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQH 53
+ L PAP P + ++ LPG F MGT+ P DGE P R + F +
Sbjct: 107 LALAPAPRGHVADPERAARRLIDLPGGRFLMGTDDPEGFPADGEGPVREAEVGPFRIAPT 166
Query: 54 EVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEH 113
V+N QF FV ATG+VTEAE FG +FVF LS+E A V
Sbjct: 167 TVTNAQFAAFVEATGHVTEAEHFGFSFVFGGFLSDEVAAASPSV---------------- 210
Query: 114 RMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIE 173
+ W AV TWR HPEG S++
Sbjct: 211 ------AAVPWWRAVTGATWR-----------------------------HPEGAGSSLA 235
Query: 174 HRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRAN 233
R +HPVVHVSWNDA AYC W G RLPTEAEWEY RGGLE R +PWG++L P G H N
Sbjct: 236 TRQDHPVVHVSWNDAQAYCAWSGTRLPTEAEWEYAARGGLEQRRYPWGDDLAPDGRHMCN 295
Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK 293
+W+G+FPT NTA DG+ TAP S++ N FGLYN VGNVWEW AD +
Sbjct: 296 IWRGDFPTLNTAEDGWAGTAPAKSFRPNGFGLYNAVGNVWEWCADRF------------- 342
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
T +V +GGSYLC++ YC R+R AARS NTPDSS GN+GFR AAD
Sbjct: 343 --ARDTGRVMRGGSYLCHDSYCNRYRVAARSSNTPDSSTGNIGFRVAAD 389
>gi|371778242|ref|ZP_09484564.1| sulfatase-modifying factor protein [Anaerophaga sp. HS1]
Length = 372
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 143/342 (41%), Positives = 184/342 (53%), Gaps = 48/342 (14%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
DMVL+PG F MG + K EFP V +D+F +D+H V+N QF+EFV ATGYVT A
Sbjct: 64 DMVLIPGGEFEMGAKENQFAKPDEFPVHKVYVDSFLMDRHPVTNAQFREFVEATGYVTTA 123
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D + K K + L P +S +D +A+ W
Sbjct: 124 ERRPDW----------QEIKKQLPPGTPKPPDSLLVPASLVFKSPDHEVSLSDYIAWWEW 173
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
P +W HP G S+IE +HPVVHVSW DA+AY T
Sbjct: 174 -----------------------VPGANWRHPLGPGSSIEGLDDHPVVHVSWYDAMAYAT 210
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGG + ++PWGN P RAN WQG FP N DG+ TA
Sbjct: 211 WAGKRLPTEAEWEYAARGGNNDYIYPWGNE--PVSAQRANFWQGNFPYKNVVEDGFQRTA 268
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTD-------- 300
PV S+ N +GLY+M GNVWEWTADW++ ++ + NP+GP T D
Sbjct: 269 PVGSFPANGYGLYDMAGNVWEWTADWYHPDYYAFLAQQVIALNPQGPETSFDPMEPGVPK 328
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
KV +GGS+LCN+ YC +R +AR +++PDS A +LGFRC D
Sbjct: 329 KVIRGGSFLCNDSYCAGYRSSARMKSSPDSGALHLGFRCVKD 370
>gi|409357737|ref|ZP_11236107.1| hypothetical protein Dali7_07741 [Dietzia alimentaria 72]
Length = 313
Score = 250 bits (639), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 148/348 (42%), Positives = 183/348 (52%), Gaps = 65/348 (18%)
Query: 3 LLPAPPVERYKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
++P R + L+ G +F MG + DGE P V L F +D+ V+N QF
Sbjct: 16 VVPGAGAHRRRRQSLIAGGSFVMGDHFDEGYPDDGERPLHEVRLSPFRMDETAVTNAQFA 75
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
F TGYVTEAE+ G + VF L + RA + LD+T
Sbjct: 76 TFCKDTGYVTEAEQLGVSAVFH-LAVQASRADV---------LHQLDTT----------- 114
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
W RG W HP G S+I NHPVV
Sbjct: 115 ---------------------PWWVAVRGA---------DWRHPGGPASSITDLQNHPVV 144
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
HVSWNDAVAYC W G RLPTEAEWEY RGGL FPWG+ LTPRG NVWQG+FPT
Sbjct: 145 HVSWNDAVAYCAWAGTRLPTEAEWEYAARGGLAGARFPWGDELTPRGRWNLNVWQGDFPT 204
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD-----WWNVH-----HHPAPSYN 291
+NT DG+L+TAP +++ N +GL+ MVGNVWEW +D W+ V PAP
Sbjct: 205 HNTLEDGHLTTAPARAFRPNGYGLHQMVGNVWEWCSDTFDPGWYAVSGVDDPSGPAPGEG 264
Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
P+G G +V +GGSYL ++ YCYR+R AARS NT DS+ GN+GFRC
Sbjct: 265 PEG---GAWRVMRGGSYLGHDSYCYRYRVAARSANTADSATGNIGFRC 309
>gi|408533502|emb|CCK31676.1| Sulfatase-modifying factor 1 [Streptomyces davawensis JCM 4913]
Length = 314
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 150/334 (44%), Positives = 177/334 (52%), Gaps = 54/334 (16%)
Query: 8 PVERYKDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
P MVLLPG F MG + DGE P R V L F +D V N +F FV+
Sbjct: 30 PTAATDGMVLLPGGEFLMGAEDSEGFPDDGEGPVRAVRLAPFRIDACAVRNDEFAAFVAD 89
Query: 67 TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
TG+VT+AE+ ++VF L R
Sbjct: 90 TGHVTDAERLDWSYVFAGFLPAALR----------------------------------- 114
Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
RGA P W G +W PEG S + R +HPVVHVSW
Sbjct: 115 -------RGAPRPERTPWWCAVTGA---------AWDRPEGPGSDLAGRGDHPVVHVSWY 158
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
DA AY W G RLPTEAEWEY RGGLE R +PWG+ L P GEHR N+W+G FPT NTAA
Sbjct: 159 DAAAYAAWAGKRLPTEAEWEYAARGGLEQRRYPWGDELDPDGEHRCNIWRGTFPTKNTAA 218
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
DGY TAPV +++ N FGLYN GNVWEW DWW H S +P GP GT KV +GG
Sbjct: 219 DGYRGTAPVDAFEPNGFGLYNTSGNVWEWCGDWWTTEHR--GSVDPTGPAEGTAKVIRGG 276
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
S++C+ YC R+R AARS NTPDSS+G+ GFRCA
Sbjct: 277 SHMCHASYCNRYRVAARSTNTPDSSSGHTGFRCA 310
>gi|331694601|ref|YP_004330840.1| sulfatase-modifying factor protein [Pseudonocardia dioxanivorans
CB1190]
gi|326949290|gb|AEA22987.1| Sulphatase-modifying factor protein [Pseudonocardia dioxanivorans
CB1190]
Length = 302
Score = 248 bits (633), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 144/328 (43%), Positives = 177/328 (53%), Gaps = 65/328 (19%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
+V L G F MG+++ +DGE P+R + F + H VSN F FV ATGY+T AE
Sbjct: 39 LVALAGGEFTMGSDEQRYPEDGEGPTRVARVAPFGIAAHAVSNDDFAAFVDATGYLTTAE 98
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ G +FVF LL + RF S VV W WR
Sbjct: 99 REGWSFVFVGLLPD--------------RFPPTRS---------VVGAPW--------WR 127
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
L W PEG S +E R +HPVVHVSW DA YC W
Sbjct: 128 QVHL---------------------ADWRRPEGPRSDLEGRGDHPVVHVSWLDAREYCRW 166
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLP+E EWEY RGGL+ +PWG+ LTP G+HR NV+QG FP ++T ADGY TAP
Sbjct: 167 AGGRLPSETEWEYAARGGLDGARYPWGDELTPGGQHRMNVFQGRFPAHDTGADGYRGTAP 226
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ N +GL+NM GNVWEW AD + GP + +GGSYLC+E Y
Sbjct: 227 VHAFAPNGYGLHNMTGNVWEWCADRF-------------GPGRPGQRNMRGGSYLCHESY 273
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAAD 342
C+R+RCAARS NTPDSSAGN+GFR AAD
Sbjct: 274 CWRYRCAARSANTPDSSAGNIGFRMAAD 301
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 37/60 (61%), Positives = 44/60 (73%)
Query: 333 GNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
GN+ CA GP + +GGSYLC+E YC+R+RCAARS NTPDSSAGN+GFR AAD
Sbjct: 242 GNVWEWCADRFGPGRPGQRNMRGGSYLCHESYCWRYRCAARSANTPDSSAGNIGFRMAAD 301
>gi|259507183|ref|ZP_05750083.1| sulfatase-modifying factor 1 [Corynebacterium efficiens YS-314]
gi|259165223|gb|EEW49777.1| sulfatase-modifying factor 1 [Corynebacterium efficiens YS-314]
Length = 271
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 137/310 (44%), Positives = 174/310 (56%), Gaps = 52/310 (16%)
Query: 35 DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKI 94
DGE P V L F ++ V+N +F F+ ATGY T AE++G + VF A
Sbjct: 12 DGEGPVHEVHLAPFGINVTTVTNAEFGRFIEATGYTTTAERYGVSAVF--------YAAF 63
Query: 95 SQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
R D+ R V + W AV +GA
Sbjct: 64 QGQRADILR--------------QVPGVPWWLAV-----KGA------------------ 86
Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
+W P G ST++ +HPVVHVSW+DAVAYCTW G RLPTEAEWEY RGGL+
Sbjct: 87 ------NWQRPNGPGSTLDGLEDHPVVHVSWDDAVAYCTWAGGRLPTEAEWEYAARGGLQ 140
Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
+ WG+NL G N+WQG FP NTAADGYL+TAPV +Y N +GL+ M GNVWE
Sbjct: 141 GARYAWGDNLALDGRWNCNIWQGGFPMENTAADGYLTTAPVKTYTPNGYGLWQMAGNVWE 200
Query: 275 WTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAG 333
W DW++ ++ A S NP+GP TG +V +GGSYLC++ YC R+R AAR+ NTPDS++G
Sbjct: 201 WCQDWFDAEYYSRASSINPRGPDTGARRVMRGGSYLCHDSYCNRYRVAARNSNTPDSTSG 260
Query: 334 NLGFRCAADK 343
N GFRC D
Sbjct: 261 NTGFRCVFDS 270
>gi|116670275|ref|YP_831208.1| hypothetical protein Arth_1714 [Arthrobacter sp. FB24]
gi|116610384|gb|ABK03108.1| protein of unknown function DUF323 [Arthrobacter sp. FB24]
Length = 301
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 139/333 (41%), Positives = 174/333 (52%), Gaps = 54/333 (16%)
Query: 16 VLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
V +P F MG DGE P V LD+F++D V+N F F++A+GY TEAE
Sbjct: 20 VPIPAGAFSMGDAYGEGYPADGEVPVHRVRLDSFHIDATAVTNEMFAAFIAASGYRTEAE 79
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+G + VF LL + +
Sbjct: 80 LYGSSAVFHLLLDAPKEDVLG--------------------------------------- 100
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
L A W RG W HP G ++ E HPVVHVS NDA+AYC W
Sbjct: 101 ---LAAGAPWWVNVRGA---------DWAHPGGRNTNWESLRGHPVVHVSHNDALAYCRW 148
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLT-PRGEHRANVWQGEFPTNNTAADGYLSTA 253
G RLPTEAEWEY RGGL R + WGN L G+H N+WQG+FP +NT+ DGYL TA
Sbjct: 149 AGRRLPTEAEWEYAARGGLGGRRYVWGNELLGSAGQHLCNIWQGQFPVSNTSEDGYLGTA 208
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
PV S+ N FGLY + GNVWEW +DW++ ++ +P NP GP G +V +GGSYLC+
Sbjct: 209 PVRSFPPNGFGLYEVSGNVWEWCSDWFDPAYYADSPEANPSGPEKGASRVMRGGSYLCHS 268
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
YC R+R AARS NTP+SS+GN GFR A P
Sbjct: 269 SYCSRYRVAARSSNTPESSSGNCGFRTVAVGTP 301
>gi|383778334|ref|YP_005462900.1| hypothetical protein AMIS_31640 [Actinoplanes missouriensis 431]
gi|381371566|dbj|BAL88384.1| hypothetical protein AMIS_31640 [Actinoplanes missouriensis 431]
Length = 328
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 147/342 (42%), Positives = 181/342 (52%), Gaps = 62/342 (18%)
Query: 1 MVLLPAPPVERYKDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQ 59
+VL P P DM+ L G F MGT+ DGE P R VT+ F + V+N +
Sbjct: 25 VVLTPGRP--DTGDMIELGGGRFLMGTDDADGFPADGEGPVRAVTVSPFRIAPVAVTNAR 82
Query: 60 FQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPV 119
F EFV+ TGY TEAE+ G +FVF L
Sbjct: 83 FAEFVADTGYATEAERAGWSFVFAGL---------------------------------- 108
Query: 120 VHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHP 179
LP + G R R W HPEG S + R +HP
Sbjct: 109 ------------------LPDDFPPTRGVRAAPWWRQVEGADWRHPEGPQSAVGDRADHP 150
Query: 180 VVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEF 239
VVHV+ DAVAYC W G RLPTEAEWE+ RGGLE FPWG+ P+G R N+W+G F
Sbjct: 151 VVHVAHTDAVAYCRWAGVRLPTEAEWEFAARGGLEQARFPWGDE-PPQG--RCNIWEGTF 207
Query: 240 PTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTG 298
P +T + G TAPV SY+ N +GLY+M GNVWEW +DW++ +H P +P GP TG
Sbjct: 208 PGRHTGSPG---TAPVRSYQPNGYGLYDMAGNVWEWCSDWFSPAYHRDGPRADPAGPPTG 264
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
T +V +GGSYLC+E YC R+R AAR+QNTPDSS GN GFR A
Sbjct: 265 TARVIRGGSYLCHESYCNRYRVAARNQNTPDSSTGNTGFRVA 306
>gi|325003656|ref|ZP_08124768.1| non-specific serine/threonine protein kinase [Pseudonocardia sp.
P1]
Length = 303
Score = 244 bits (624), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 139/331 (41%), Positives = 177/331 (53%), Gaps = 65/331 (19%)
Query: 13 KDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ V +PG +F MGT+ DGE P+R V + + +D V+N +F EFV+AT YVT
Sbjct: 26 ETAVEVPGGSFLMGTDSAEGYGTDGEGPARRVEVGPYLIDTVAVTNARFAEFVAATSYVT 85
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
E E+ G +FVF L R
Sbjct: 86 ETERIGWSFVFVGFLPAALR---------------------------------------- 105
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
R + P + W G G +W PEG ST+ R +HPVVH+S DA AY
Sbjct: 106 --RTSPRPRQTPWWAGVTGA---------TWSAPEGPGSTVGDRADHPVVHLSHADARAY 154
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G RLP+EAEWEY RGGLE +PWG+ LTP GEHR N+WQG+FP NTA DGY +
Sbjct: 155 AAWVGGRLPSEAEWEYAARGGLEGARYPWGDELTPAGEHRCNIWQGDFPVRNTAEDGYRA 214
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
TAPV ++ N +GL+N GNVWEW AD + GP +G +V +GGSYLC+
Sbjct: 215 TAPVTAFAPNGYGLHNTAGNVWEWCADHF-------------GPRSGDARVMRGGSYLCH 261
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
E YC R+R AAR+ NTPDS+AG+LG R A D
Sbjct: 262 ESYCNRYRVAARTANTPDSAAGHLGLRVAWD 292
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 36/65 (55%), Positives = 48/65 (73%)
Query: 330 SSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 389
++AGN+ CA GP +G +V +GGSYLC+E YC R+R AAR+ NTPDS+AG+LG R
Sbjct: 230 NTAGNVWEWCADHFGPRSGDARVMRGGSYLCHESYCNRYRVAARTANTPDSAAGHLGLRV 289
Query: 390 AADVS 394
A DV+
Sbjct: 290 AWDVA 294
>gi|359777167|ref|ZP_09280457.1| putative sulfatase-modifying factor [Arthrobacter globiformis NBRC
12137]
gi|359305499|dbj|GAB14286.1| putative sulfatase-modifying factor [Arthrobacter globiformis NBRC
12137]
Length = 329
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 142/332 (42%), Positives = 179/332 (53%), Gaps = 54/332 (16%)
Query: 18 LPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKF 76
+P TF MG DGE P VTL F +D V+N QF FV +TGY TEAE F
Sbjct: 37 IPAGTFLMGDAFDEGYPADGETPVHQVTLRGFRIDTTAVTNQQFAAFVDSTGYRTEAELF 96
Query: 77 GDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA 136
G + VF LL++ ++ + V A
Sbjct: 97 GSSAVFH-LLAQADKKDVLGV-------------------------------------AA 118
Query: 137 RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRG 196
P W RG W HP G S +HPVVHVS NDA+AYC W G
Sbjct: 119 GTP----WWLNVRGA---------DWAHPAGPKSGWSDSSDHPVVHVSHNDALAYCRWAG 165
Query: 197 ARLPTEAEWEYGCRGGLENRLFPWGNNLTPR-GEHRANVWQGEFPTNNTAADGYLSTAPV 255
RLPTEAEWEY RGGL + +PWGN L G+H N+WQG FP NT DG+L TAPV
Sbjct: 166 RRLPTEAEWEYAGRGGLAGQRYPWGNELLGDGGKHLCNIWQGTFPRANTRDDGHLGTAPV 225
Query: 256 MSYKENKFGLYNMVGNVWEWTADWW-NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
S+ N +GLY + GNVWEW +DW+ ++ +P+ +P+GPT G+ +V +GGSYLC++ Y
Sbjct: 226 KSFPPNGYGLYEVGGNVWEWCSDWFLPKYYRNSPASSPQGPTIGSGRVMRGGSYLCHDSY 285
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
C R+R AARS NTP+SS+GN GFR A + PT
Sbjct: 286 CNRYRVAARSSNTPESSSGNTGFRTVALESPT 317
>gi|313227917|emb|CBY23066.1| unnamed protein product [Oikopleura dioica]
Length = 318
Score = 241 bits (616), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 143/338 (42%), Positives = 186/338 (55%), Gaps = 56/338 (16%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVS 65
P ER+ VL+P F GT+ P+L DGE P+R +T D FY+D +EV+N++F FV
Sbjct: 19 PKERWNKEVLIPKGDFVFGTDSPVLEDDGEGPAREARITRD-FYMDMYEVTNSEFWIFVQ 77
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
T Y TEAEKFG+++VF+ L +E LDS
Sbjct: 78 KTSYKTEAEKFGNSYVFDLCLPKEV----------------LDS---------------- 105
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
++ A W G W HPEG S++ R HPVVHVS
Sbjct: 106 ---------ADQMVQAAPWWVMINGA---------DWAHPEGPTSSLRGRWTHPVVHVSH 147
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGE-HRANVWQGEFPTNNT 244
DA YC W G RLPTEAEWE RG L ++ FPWG++L P RAN WQGEFP +T
Sbjct: 148 GDAERYCKWAGKRLPTEAEWERAARGKLVSKDFPWGDDLLPSDSVFRANYWQGEFPDEDT 207
Query: 245 AADGY-LSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKV 302
ADG+ S APV + +N+FGL+NM+GNVWEWT D + +H P +P G GT++V
Sbjct: 208 GADGFTYSPAPVDALGAQNEFGLFNMIGNVWEWTGDKFARYHTHEPRDDPAGSPRGTERV 267
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KKGGS+L + ++ YR R AAR N+PDSS ++GFRC
Sbjct: 268 KKGGSFLSDTEHSYRIRNAARHHNSPDSSTNDMGFRCV 305
>gi|377557773|ref|ZP_09787408.1| hypothetical protein GOOTI_012_00320 [Gordonia otitidis NBRC
100426]
gi|377525082|dbj|GAB32573.1| hypothetical protein GOOTI_012_00320 [Gordonia otitidis NBRC
100426]
Length = 262
Score = 241 bits (616), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 136/317 (42%), Positives = 172/317 (54%), Gaps = 57/317 (17%)
Query: 25 MGTNKPIL-IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFE 83
MG + P DGE P R+ T+DAF + H V+N +F EFV+ + YVT+AE+FG +FVF
Sbjct: 1 MGDDGPSSHAADGEGPIRSTTVDAFAISAHTVTNAEFDEFVTNSEYVTDAERFGWSFVFA 60
Query: 84 PLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAE 143
L + +R R EG W AV TWR
Sbjct: 61 GFLP-------AALRRASPRPEG---------------TPWWCAVTDATWRA-------- 90
Query: 144 WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEA 203
PEG S + R +HPVVHVSWNDA A+C W G RLPTEA
Sbjct: 91 ---------------------PEGPGSDVSERADHPVVHVSWNDARAFCRWAGVRLPTEA 129
Query: 204 EWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKF 263
EWEY RGGL +PWG+ L R N+W+G FP NTAADGY STAPV +++ N
Sbjct: 130 EWEYAARGGLSRARYPWGDELDEDESFRCNIWRGSFPARNTAADGYRSTAPVEAFEPNGL 189
Query: 264 GLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAAR 323
GLYNMVGNVWEWTAD W V H P P+ + +GGS++C++ YC R+R +AR
Sbjct: 190 GLYNMVGNVWEWTADPWRVDHSGTPHEAPE-----SRYAMRGGSHMCHDSYCNRYRVSAR 244
Query: 324 SQNTPDSSAGNLGFRCA 340
+ N+ D S+GN+GFR A
Sbjct: 245 TSNSADGSSGNIGFRIA 261
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 30/37 (81%)
Query: 354 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
+GGS++C++ YC R+R +AR+ N+ D S+GN+GFR A
Sbjct: 225 RGGSHMCHDSYCNRYRVSARTSNSADGSSGNIGFRIA 261
>gi|391330173|ref|XP_003739538.1| PREDICTED: sulfatase-modifying factor 1-like [Metaseiulus
occidentalis]
Length = 345
Score = 241 bits (615), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 136/333 (40%), Positives = 178/333 (53%), Gaps = 60/333 (18%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
++ MVL+ G F MGTN+P+++ DGE P+R + +F+LD EVSN F++FV TG++T
Sbjct: 64 HEKMVLVKGGEFIMGTNRPVILADGEHPARVKNISSFWLDATEVSNEAFEKFVHETGHLT 123
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
+AE GD++V ++ E +A V+
Sbjct: 124 DAETIGDSYVLGSTIT-ESKAIQGAVQ--------------------------------- 149
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
A W + G SW +PEG S I RM HPVVHVSW DA Y
Sbjct: 150 ---------AAPWWFSVFGA---------SWRNPEGPGSNIIKRMKHPVVHVSWRDAERY 191
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR-ANVWQGEFPTNNTAADGYL 250
C G RLP EAEWE C+ L + FPW + +T R AN+WQG+FP NT DGY
Sbjct: 192 CEHYGKRLPNEAEWEMACKRSLGGKRFPWNDEVTDAENQRSANIWQGQFPNTNTKKDGYE 251
Query: 251 STAPVMSYKENK-FGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
TAPV +Y L NM+GNVWEWTAD W H P T +VKKGGS++
Sbjct: 252 GTAPVDTYDPYPGLPLKNMIGNVWEWTADAWTTDHGTKPDGEE------TRRVKKGGSFM 305
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
C+ YC+R+RC ARS+NT D+S+ N+GFRCAAD
Sbjct: 306 CHASYCFRYRCEARSENTLDTSSVNVGFRCAAD 338
Score = 67.8 bits (164), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 28/45 (62%), Positives = 38/45 (84%)
Query: 349 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
T +VKKGGS++C+ YC+R+RC ARS+NT D+S+ N+GFRCAAD
Sbjct: 295 TRRVKKGGSFMCHASYCFRYRCEARSENTLDTSSVNVGFRCAADA 339
>gi|315645055|ref|ZP_07898181.1| Non-specific serine/threonine protein kinase [Paenibacillus vortex
V453]
gi|315279476|gb|EFU42781.1| Non-specific serine/threonine protein kinase [Paenibacillus vortex
V453]
Length = 325
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 110/191 (57%), Positives = 136/191 (71%), Gaps = 1/191 (0%)
Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
R P W HPEG S++ HRM+HPVVHVSWNDA AYC W G RLPTEAEWEY RGGL
Sbjct: 132 RAVPGADWAHPEGPGSSLVHRMDHPVVHVSWNDADAYCKWAGVRLPTEAEWEYAARGGLV 191
Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
+ +PWG+ L P GE R N+WQG FP N A+DGY TAPV +Y+ N +GLY M GNVWE
Sbjct: 192 QKRYPWGDELKPAGEFRCNIWQGRFPVKNHASDGYPGTAPVGAYRPNGYGLYQMSGNVWE 251
Query: 275 WTADWWNVHHH-PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAG 333
W +DW++ + + NPKGP G ++V +GGSYLC+ YC R+R AAR++NTPDSS+
Sbjct: 252 WCSDWYDRSYDVTGHTDNPKGPLQGQERVMRGGSYLCHRSYCNRYRVAARNKNTPDSSSS 311
Query: 334 NLGFRCAADKG 344
++GFRC D G
Sbjct: 312 HVGFRCVKDMG 322
>gi|325106308|ref|YP_004275962.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324975156|gb|ADY54140.1| protein of unknown function DUF323 [Pedobacter saltans DSM 12145]
Length = 365
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 130/338 (38%), Positives = 186/338 (55%), Gaps = 47/338 (13%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV + G T+ MG + +D E+P VT+ F++D+HEV+N QF FV ATGY+T AE
Sbjct: 59 MVYISGGTYMMGADNDQASED-EYPKHKVTVSPFWMDEHEVTNAQFAAFVEATGYITTAE 117
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D EE + ++ + + +++ + V ++ Y W
Sbjct: 118 RKPDW--------EEIKKQLPPGTPKPDDSKLVPASLVFTPPNQAVDLN-----NYAQW- 163
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
WE+ P +W HP+G DS I+ + N+PVV VSW+DA+AYC W
Sbjct: 164 ---------WEW----------VPGANWKHPDGPDSDIKGKENYPVVQVSWDDAMAYCKW 204
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE+ RGGL N ++PWGN +G+ +AN W+G FP NTA DG+ AP
Sbjct: 205 AGKDLPTEAEWEFAARGGLINNIYPWGNEHISKGKAKANSWEGHFPNINTAKDGFAGLAP 264
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-----NPKGPTTGTD--------K 301
V SY N +GLY+M GNVWEW +DW+ ++ + NPKGP+ D +
Sbjct: 265 VKSYAPNGYGLYDMAGNVWEWCSDWYRNDYYQTSTQANGINNPKGPSDSHDPDEPYIAKR 324
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
V +GGS++CN+ YC +R A R + +PDS N+GFRC
Sbjct: 325 VSRGGSFMCNDSYCSGYRVARRMKTSPDSGTSNMGFRC 362
>gi|319948018|ref|ZP_08022194.1| hypothetical protein ES5_01774 [Dietzia cinnamea P4]
gi|319438307|gb|EFV93251.1| hypothetical protein ES5_01774 [Dietzia cinnamea P4]
Length = 336
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 145/357 (40%), Positives = 182/357 (50%), Gaps = 77/357 (21%)
Query: 9 VERYKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
+R + V++P TF MG + DGE P V++ AF +D+ V+N QF F T
Sbjct: 33 ADRPRRQVVVPAGTFLMGDHFDEGYPDDGELPVHEVSVRAFRMDECAVTNAQFATFCKDT 92
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
GYVTEAE+ G + VF L E + + LD+T
Sbjct: 93 GYVTEAERVGVSPVFH-LAVEADPGDV---------LHRLDTT----------------- 125
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
W RG W HP G S+I NHPVVHV+WND
Sbjct: 126 ---------------PWWVAVRGA---------DWRHPGGRRSSITGLQNHPVVHVTWND 161
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
A AYC W GARL TEAEWEY RGGL FPWG+ LTPRG NVWQG FPT+NT D
Sbjct: 162 ATAYCDWAGARLSTEAEWEYAARGGLAGARFPWGDELTPRGRWNLNVWQGRFPTHNTLDD 221
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------------- 285
G+L+TAPV +Y+ N +GL+ MVGNVWEW AD ++ ++
Sbjct: 222 GHLTTAPVRAYRPNGYGLHQMVGNVWEWCADVFDPGYYAVSPREDPRGPLPGGGPDPDRA 281
Query: 286 PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
A + +P G T +V +GGSYLC++ YCYR+R AARS NT S+ NLGFRC D
Sbjct: 282 AATAADPDGDTW---RVMRGGSYLCHDSYCYRYRVAARSANTAGSATANLGFRCVRD 335
>gi|257067789|ref|YP_003154044.1| hypothetical protein Bfae_05880 [Brachybacterium faecium DSM 4810]
gi|256558607|gb|ACU84454.1| uncharacterized conserved protein [Brachybacterium faecium DSM
4810]
Length = 303
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 142/346 (41%), Positives = 173/346 (50%), Gaps = 63/346 (18%)
Query: 1 MVLLPAPPVE---RYKDMVLLPGDTFRMGTNKPIL-IKDGEFPSRNVTLDAFYLDQHEVS 56
+ L AP + R + VL+ FRMGT P DGE P R V + AF +D V+
Sbjct: 16 LAALAAPTAQVRARAERTVLVEAGPFRMGTEDPDRNPGDGESPVRTVEVPAFRIDATCVT 75
Query: 57 NTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMH 116
+F FV+ TGYVTEAE+FG +FVF LL +E R
Sbjct: 76 TAEFAAFVAETGYVTEAEEFGWSFVFGALLEKELR------------------------- 110
Query: 117 HPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRM 176
R +R P W G W HPEG ST+ R
Sbjct: 111 -----------------RASRKPPGTPWWRAVEGA---------RWDHPEGPGSTLAGRE 144
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
+HPVVHVS DA A+ W G RLPTEAEWE RGGL+ + WG LTP GEHR N+WQ
Sbjct: 145 DHPVVHVSLRDAEAFARWCGMRLPTEAEWEKAARGGLDQARYAWGEELTPGGEHRCNIWQ 204
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT 296
G FP NT DG+L TAPV S+ N FGLY + GNVWEW +D W
Sbjct: 205 GAFPVRNTLEDGHLGTAPVRSFPPNGFGLYEVAGNVWEWCSDRWTT--------GSAASL 256
Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
TG +V +GGSYLC++ YC R+R AARS + ++GN GFR AD
Sbjct: 257 TGDVRVMRGGSYLCHDSYCNRYRVAARSSTAAEDASGNKGFRLVAD 302
>gi|291297647|ref|YP_003508925.1| Non-specific serine/threonine protein kinase [Stackebrandtia
nassauensis DSM 44728]
gi|290566867|gb|ADD39832.1| Non-specific serine/threonine protein kinase [Stackebrandtia
nassauensis DSM 44728]
Length = 319
Score = 238 bits (606), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 137/333 (41%), Positives = 178/333 (53%), Gaps = 52/333 (15%)
Query: 8 PVERYKDMVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
P + + MV + F MG + DGE P R V L F++D V+N ++ FV
Sbjct: 36 PAKVRRGMVRVDVGEFDMGGEDVDANPTDGEGPPRPVRLSPFHIDTVCVTNVEYARFVRE 95
Query: 67 TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
TG+VT+AE+ G ++VF L++ R + +D+++
Sbjct: 96 TGFVTDAEREGWSYVFHLLVAPTARGHV------------MDASVP-------------- 129
Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
A W RG SW P G S++ NHPV HVSW
Sbjct: 130 --------------GAPWWLAVRGA---------SWRAPHGPGSSMSSVQNHPVTHVSWR 166
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
DA AY W G RLPTEA+WE RGGL +PWG+ L RG HR N+WQG FP +NTA
Sbjct: 167 DAAAYAAWAGKRLPTEAQWEAAARGGLSGARYPWGDELLVRGAHRCNIWQGSFPRHNTAE 226
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
DG+L TAPV +++ N +GLY M GNVWEW ADWW +P GP TGT KV +GG
Sbjct: 227 DGHLGTAPVKTFRPNGYGLYQMAGNVWEWCADWWTTARD--GDSDPTGPATGTSKVIRGG 284
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
SYLC++ YC R+R AAR+ NTPDSS G++GFRC
Sbjct: 285 SYLCHDSYCNRYRVAARTHNTPDSSTGHMGFRC 317
>gi|409196281|ref|ZP_11224944.1| hypothetical protein MsalJ2_04508 [Marinilabilia salmonicolor JCM
21150]
Length = 337
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 139/345 (40%), Positives = 188/345 (54%), Gaps = 48/345 (13%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
+ + MVL+PG F MG + K EFP V +D+F +D+H V+N QF+EFV++TGYV
Sbjct: 27 QIEKMVLIPGGVFDMGAREKQFSKPDEFPVHKVKVDSFLMDKHPVTNAQFREFVASTGYV 86
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
T AEK D + L V L +++ H P +S N+ ++
Sbjct: 87 TTAEKAPDWKELKKQLPPGTPKPPDSVL--------LPASL--VFHSPETRVSLNNHQSW 136
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W P W HP G STIE + N+PVVH+SW DA A
Sbjct: 137 WGW-----------------------VPGADWKHPLGPGSTIEGKDNYPVVHISWYDAQA 173
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
Y W G RLPTEAEWEY RGG N ++PWGN L E +AN WQG FP +N DG++
Sbjct: 174 YARWAGKRLPTEAEWEYAARGGHNNFVYPWGNELI--TEEKANFWQGNFPYSNKGDDGFI 231
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTD----- 300
S++PV S+ N +GLY+M GNVWEWTADW++ ++ + NP+GP+ D
Sbjct: 232 SSSPVASFSSNGYGLYDMAGNVWEWTADWYHTAYYEVLGRKGIADNPQGPSVSLDPMEPG 291
Query: 301 ---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
K +GGS+LCN+ YC +R +AR ++TPDS +LGFRC D
Sbjct: 292 VAKKSIRGGSFLCNDSYCAGYRASARMKSTPDSGMMHLGFRCVKD 336
>gi|290561052|gb|ADD37928.1| Sulfatase-modifying factor 1 [Lepeophtheirus salmonis]
Length = 314
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 139/332 (41%), Positives = 171/332 (51%), Gaps = 66/332 (19%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV + G F GT KP+ DGE PS + FY+D +E VT A
Sbjct: 46 NMVHITGGYFNKGTKKPVFPADGEGPSERTFVTDFYMDAYE---------------VTNA 90
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E F V ++E E S V M+ F L + ++ V W V W
Sbjct: 91 E-FESFVVANDYVTEAEIFGQSFV---MEYF--LSPEVRSKVALGVQLAPWWLQVDRADW 144
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
+ HPEG DS+I++RM+HPV+HVSWNDAV YC
Sbjct: 145 K-----------------------------HPEGPDSSIQNRMDHPVLHVSWNDAVKYCE 175
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W RLPTEAEWEY CR G E+RLFPWGN P + AN+W GEFPT N ADGY T
Sbjct: 176 WSSKRLPTEAEWEYACRAGKEDRLFPWGNKWNPNDKIWANIWTGEFPTKNDEADGYAGTN 235
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
PV + N +GLY+M+GN WEWT D G VKKGGS++C++
Sbjct: 236 PVGKFPPNAYGLYDMIGNAWEWTGD----------------SFRGEQMVKKGGSFMCHKN 279
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
+CYR+RCAARSQNT DSSA NLGFRCA+ P
Sbjct: 280 HCYRYRCAARSQNTKDSSAHNLGFRCASHTKP 311
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 31/44 (70%), Positives = 38/44 (86%)
Query: 348 GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 391
G VKKGGS++C++ +CYR+RCAARSQNT DSSA NLGFRCA+
Sbjct: 264 GEQMVKKGGSFMCHKNHCYRYRCAARSQNTKDSSAHNLGFRCAS 307
>gi|403069843|ref|ZP_10911175.1| hypothetical protein ONdio_09646 [Oceanobacillus sp. Ndiop]
Length = 185
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 111/181 (61%), Positives = 131/181 (72%), Gaps = 2/181 (1%)
Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
W PEG STIE R++HPV+HVSWNDA+A+C W G RLPTEAEWEY RGGLE + +PWG
Sbjct: 6 WFQPEGPGSTIESRLDHPVIHVSWNDALAFCQWAGKRLPTEAEWEYAARGGLEQKRYPWG 65
Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
+ LTP GEH N+WQG+FP NT DGYL TAP S+ N FGLYN+ GNVWEW ADW+
Sbjct: 66 DELTPNGEHHCNIWQGKFPQENTLDDGYLGTAPAQSFPPNGFGLYNVSGNVWEWCADWFT 125
Query: 282 VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+HP + T KV +GGSYLC++ YC R+R AARS NT DSS GN+GFRCAA
Sbjct: 126 --NHPDRKIDELRVNNQTPKVMRGGSYLCHKSYCNRYRVAARSSNTIDSSTGNMGFRCAA 183
Query: 342 D 342
D
Sbjct: 184 D 184
>gi|381397685|ref|ZP_09923095.1| Sulphatase-modifying factor protein [Microbacterium laevaniformans
OR221]
gi|380775253|gb|EIC08547.1| Sulphatase-modifying factor protein [Microbacterium laevaniformans
OR221]
Length = 304
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 138/328 (42%), Positives = 176/328 (53%), Gaps = 53/328 (16%)
Query: 18 LPGDTFRMGTNKPILIK-DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKF 76
+ TF MG + DGE P V L AF +D V+ F FV TGY TEAE F
Sbjct: 1 MSAGTFVMGDASGERNRLDGEVPRHRVDLAAFEIDATTVTVAAFARFVEDTGYRTEAEVF 60
Query: 77 GDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA 136
G + VF L++ + + P+ W AV RGA
Sbjct: 61 GSSAVFHSLVAAADADVSA----------------------PLGGTPWWRAV-----RGA 93
Query: 137 RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRG 196
W HP G S+++ +HPVVHVSWNDA AYC W G
Sbjct: 94 ------------------------DWSHPGGPLSSVDGLGDHPVVHVSWNDAQAYCAWAG 129
Query: 197 ARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVM 256
RLPTEAEWEY RGGL+ +PWG+ G RAN+WQGEFP+ NTA DG+++TAPV
Sbjct: 130 RRLPTEAEWEYAARGGLDGATYPWGDAAVDDGGWRANIWQGEFPSVNTAEDGWVATAPVR 189
Query: 257 SYKENKFGLYNMVGNVWEWTADWWNVH-HHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYC 315
S+ N +GL+ VGNVWEW ADW++ + +P +P+GP G+ +V +GGSYLC+ YC
Sbjct: 190 SFAPNGYGLWQPVGNVWEWCADWFSADTYRRSPREDPRGPEQGSARVLRGGSYLCHASYC 249
Query: 316 YRHRCAARSQNTPDSSAGNLGFRCAADK 343
R+R AARS NTPDSS GN GFR + +
Sbjct: 250 NRYRNAARSSNTPDSSMGNAGFRTVSAR 277
>gi|433456296|ref|ZP_20414348.1| formylglycine-generating enzyme [Arthrobacter crystallopoietes
BAB-32]
gi|432196498|gb|ELK52949.1| formylglycine-generating enzyme [Arthrobacter crystallopoietes
BAB-32]
Length = 279
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 135/312 (43%), Positives = 166/312 (53%), Gaps = 53/312 (16%)
Query: 35 DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKI 94
DGE P V LD+F +D V+N F FV ATGY TE+E + + VF
Sbjct: 12 DGEGPVHKVVLDSFRIDAAAVTNEDFSRFVEATGYRTESEHYESSAVF------------ 59
Query: 95 SQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
H+ A A A P W RG
Sbjct: 60 --------------------------HLQVQAAAADVLGEAAGAP----WWVNVRGA--- 86
Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
W HP G S+ HPVV VSWNDA AYC W G RLPTEAEWEY RGGLE
Sbjct: 87 ------DWAHPTGPLSSWRDIPQHPVVQVSWNDAQAYCRWAGRRLPTEAEWEYAARGGLE 140
Query: 215 NRLFPWGNNLT-PRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVW 273
+ + WGN+L P GEH N+WQGEFP +NT DGYL TAPV ++ N FGLY + GNVW
Sbjct: 141 GKRYAWGNDLAGPAGEHYCNIWQGEFPRSNTLDDGYLGTAPVRTFPPNGFGLYEVSGNVW 200
Query: 274 EWTADWW-NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSA 332
EW DW+ ++ +PS+NP+GPT G +V +GGS+LC++ YC R+R AARS N P+S++
Sbjct: 201 EWCNDWFLPKYYRNSPSHNPQGPTIGRGRVMRGGSFLCHDSYCNRYRVAARSSNGPESAS 260
Query: 333 GNLGFRCAADKG 344
N GFR A G
Sbjct: 261 SNCGFRTVAAPG 272
>gi|383778954|ref|YP_005463520.1| hypothetical protein AMIS_37840 [Actinoplanes missouriensis 431]
gi|381372186|dbj|BAL89004.1| hypothetical protein AMIS_37840 [Actinoplanes missouriensis 431]
Length = 305
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 134/332 (40%), Positives = 180/332 (54%), Gaps = 57/332 (17%)
Query: 14 DMVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ V +P TF MG + DGE P +V + AF +D V+ F+ FV+ATGY T+
Sbjct: 28 EQVSVPAQTFAMGDAHGDGAPGDGEQPVHDVRVPAFTIDATSVTVADFRSFVTATGYRTD 87
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE++G + VF L++ ++ + + +G
Sbjct: 88 AEQYGWSAVFHLALTDPDQV--------IGQMQGT------------------------- 114
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W G G W HP G +HP VHVSWNDA AYC
Sbjct: 115 ----------PWWLGVEGA---------DWAHPGGPSDVAAD--DHPAVHVSWNDAQAYC 153
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLP+EAEWE R GL +R +PWG+ L P H N+WQG FP NTAADG+++T
Sbjct: 154 AWAGRRLPSEAEWECAARAGLPSRRYPWGDEL-PAAGHALNIWQGSFPAVNTAADGFVTT 212
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
APV +++ N +GL+ VGNVWEW ADW++ ++ AP +P+GP GT +V +GGSYLC+
Sbjct: 213 APVRTFQPNGYGLWQPVGNVWEWVADWFSPAYYAAAPVEDPQGPPDGTARVIRGGSYLCH 272
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
+ YC R+R AARS NTPDSS GN GFR A+ K
Sbjct: 273 DSYCNRYRNAARSSNTPDSSTGNTGFRTASRK 304
>gi|290462167|gb|ADD24131.1| Sulfatase-modifying factor 1 [Lepeophtheirus salmonis]
Length = 314
Score = 235 bits (600), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 136/338 (40%), Positives = 171/338 (50%), Gaps = 78/338 (23%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV + G F GT KP+ DGE PS + FY+D +E VT A
Sbjct: 46 NMVHITGGYFIKGTKKPVFPADGEGPSERTFVTDFYMDAYE---------------VTNA 90
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E FE + + +++ G +E+ +
Sbjct: 91 E-------FESFVVANDYVTEAEIF-------GQSFVMEYFLSP---------------- 120
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPW------GSWLHPEGIDSTIEHRMNHPVVHVSWND 187
E + L +L PW W HPEG DS+I++RM+HPV+HVSWND
Sbjct: 121 -----------EVRSKVALGVQLAPWWLQVDRADWKHPEGPDSSIQNRMDHPVLHVSWND 169
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
AV YC W RLPTEAEWEY CR G E+RLFPWGN P + AN+W GEFPT N AD
Sbjct: 170 AVKYCEWSSKRLPTEAEWEYACRAGKEDRLFPWGNKWNPNDKIWANIWTGEFPTKNDEAD 229
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
GY T PV + N +GLY+M+GN WEWT D G VKKGGS
Sbjct: 230 GYAGTNPVGKFPPNAYGLYDMIGNAWEWTGD----------------SFRGEQMVKKGGS 273
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
++C++ +CYR+RCAARSQNT DSSA NLGFRCA+ P
Sbjct: 274 FMCHKNHCYRYRCAARSQNTKDSSAHNLGFRCASHTKP 311
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 31/44 (70%), Positives = 38/44 (86%)
Query: 348 GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 391
G VKKGGS++C++ +CYR+RCAARSQNT DSSA NLGFRCA+
Sbjct: 264 GEQMVKKGGSFMCHKNHCYRYRCAARSQNTKDSSAHNLGFRCAS 307
>gi|257868811|ref|ZP_05648464.1| conserved hypothetical protein [Enterococcus gallinarum EG2]
gi|357049691|ref|ZP_09110908.1| hypothetical protein HMPREF9478_00891 [Enterococcus saccharolyticus
30_1]
gi|257802975|gb|EEV31797.1| conserved hypothetical protein [Enterococcus gallinarum EG2]
gi|355382923|gb|EHG30014.1| hypothetical protein HMPREF9478_00891 [Enterococcus saccharolyticus
30_1]
Length = 284
Score = 234 bits (598), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 133/333 (39%), Positives = 184/333 (55%), Gaps = 58/333 (17%)
Query: 15 MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M+ +P T+R+GT+ K +D E P V + +F +D+ V+N F+ F++ATGYVTEA
Sbjct: 1 MIKIPAGTYRIGTDTKAGFEEDQEGPKIQVEMASFEIDETTVTNQMFEAFIAATGYVTEA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+FG +FVF LSE+ + + SQ+ + ++SW AVA W
Sbjct: 61 EQFGWSFVFHYFLSEKTKQR-SQL---------------------IPNMSWWYAVARADW 98
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R HPEG DS+IE RM+HPVV VS NDA+A+C
Sbjct: 99 R-----------------------------HPEGPDSSIEQRMDHPVVQVSRNDALAFCR 129
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W RLPTEAEWE +GG EN L+PWG + ++ N+WQG FP NT ADG+ STA
Sbjct: 130 WADKRLPTEAEWEIAAKGGTENDLYPWGTDFLAGEDYHCNIWQGAFPKANTKADGFASTA 189
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVK------KGGS 307
P Y+ N +GLY ++GNVWEW ++ + S++ +V +GGS
Sbjct: 190 PARFYEPNGYGLYQVIGNVWEWCSNPARIPLEQFRSWDSHFFEQRFQQVDDAMYATRGGS 249
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+LC+E YC R+R AAR+ N+ S+A NLGFRCA
Sbjct: 250 FLCHESYCKRYRIAARNGNSGMSAANNLGFRCA 282
>gi|386847436|ref|YP_006265449.1| hypothetical protein ACPL_2486 [Actinoplanes sp. SE50/110]
gi|359834940|gb|AEV83381.1| hypothetical protein ACPL_2486 [Actinoplanes sp. SE50/110]
Length = 309
Score = 234 bits (596), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 133/327 (40%), Positives = 178/327 (54%), Gaps = 56/327 (17%)
Query: 14 DMVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
D +PG TF MG + DGE P VT+ F +D V+ F+ FV+ATG+ T+
Sbjct: 28 DQARVPGQTFAMGDAHGDGHPADGEQPVHPVTVPGFTIDVTAVTVADFRAFVTATGFRTD 87
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE FG + VF +++ ER + RM
Sbjct: 88 AEHFGWSAVFHSAVADPER-------------------VAGRM----------------- 111
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W G G W HP G + +HPVVHVSWNDA AYC
Sbjct: 112 -------AGTPWWLGVGGA---------DWAHPGGPSTVALD--DHPVVHVSWNDAQAYC 153
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
TW G RLP+EAEWE RGGL+ +PWG++L + N+WQG+FP NTA DG+++T
Sbjct: 154 TWAGRRLPSEAEWECAARGGLDRLRYPWGDDLPAGDDWACNIWQGDFPVRNTAEDGWVTT 213
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGSYLCN 311
APV +Y N +GL+ +VGNVWEW ADW++ ++ +P +P+GP+ G +V +GGSYLC+
Sbjct: 214 APVRAYAPNAYGLHQVVGNVWEWCADWFSPRYYAVSPGDDPRGPSRGAARVIRGGSYLCH 273
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFR 338
+ YC R+R AARS NTPDSS GN+GFR
Sbjct: 274 DSYCNRYRNAARSSNTPDSSTGNMGFR 300
>gi|260430125|ref|ZP_05784100.1| sulfatase-modifying factor 1 [Citreicella sp. SE45]
gi|260419048|gb|EEX12303.1| sulfatase-modifying factor 1 [Citreicella sp. SE45]
Length = 332
Score = 234 bits (596), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 131/326 (40%), Positives = 164/326 (50%), Gaps = 53/326 (16%)
Query: 18 LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
+PG F MG K +D + P R V + F + + V+N F FV ATGY + AE+ G
Sbjct: 57 IPGGFFDMGARKSTFAEDLDSPRRKVFVSPFRISPYTVTNADFARFVEATGYRSVAEREG 116
Query: 78 DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
++VF L
Sbjct: 117 WSYVFHLL---------------------------------------------------- 124
Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
L T A+W G R W PEG + + R +HPV HV+W DA+AYC W G
Sbjct: 125 LETPADWPVSPPGLRWWRRVDGAFWAQPEGPGTDVADRGDHPVTHVAWYDALAYCRWAGV 184
Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMS 257
RLP EAEWE RGGL + FPWGN L P GE N WQG FP N+A DGY+ TAPV S
Sbjct: 185 RLPREAEWERAGRGGLAHLKFPWGNTLVPGGEFAMNTWQGHFPDENSAEDGYVGTAPVGS 244
Query: 258 YKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCY 316
Y N +GLYNM GNVWEW DW+ P +P+GP +GT KV++GGS+LC+ YC
Sbjct: 245 YAPNGYGLYNMTGNVWEWVQDWFGPRGAPKGVPRDPRGPESGTAKVQRGGSFLCHVSYCD 304
Query: 317 RHRCAARSQNTPDSSAGNLGFRCAAD 342
R+ +R++N PDSS N GFR AAD
Sbjct: 305 RYHVHSRTRNDPDSSTSNSGFRIAAD 330
>gi|302538003|ref|ZP_07290345.1| sulfatase-modifying factor 1 [Streptomyces sp. C]
gi|302446898|gb|EFL18714.1| sulfatase-modifying factor 1 [Streptomyces sp. C]
Length = 299
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 111/182 (60%), Positives = 126/182 (69%), Gaps = 13/182 (7%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
SW HPEG S R NHPVVHVSW+DA AYC W G RLPTEAEWEY RGGLE R +PW
Sbjct: 129 SWRHPEGPGSHATDRQNHPVVHVSWHDAQAYCAWSGTRLPTEAEWEYAARGGLEQRRYPW 188
Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
G+ L P G H N+WQG+FP NT ADGY+STAPV S++ N FGLYN VGNVWEW ADW+
Sbjct: 189 GDELMPDGRHMCNIWQGDFPVLNTGADGYVSTAPVKSFRPNGFGLYNTVGNVWEWCADWF 248
Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
P T + +GGSYLC++ YC R+R AARS NTPDSS GN+GFR A
Sbjct: 249 T-------------PAEETSRAMRGGSYLCHDSYCNRYRVAARSSNTPDSSTGNIGFRVA 295
Query: 341 AD 342
AD
Sbjct: 296 AD 297
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/166 (44%), Positives = 95/166 (57%), Gaps = 11/166 (6%)
Query: 13 KDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ ++ LPG +F MGT+ P D E P R V ++ F + V+N QF F +TGY T
Sbjct: 31 RQLLALPGGSFLMGTDDPEGFPADAEGPIREVAVEPFRIAATAVTNAQFATFAKSTGYTT 90
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDM----------KRFEGLDSTIEHRMHHPVVH 121
EAE+FG +F F L++E A + V + EG S R +HPVVH
Sbjct: 91 EAERFGFSFTFGAFLADEVAAVSAPVAAVPWWRAVEGASWRHPEGPGSHATDRQNHPVVH 150
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
+SW+DA AYC W G RLPTEAEWEY RGGLE R +PWG L P+G
Sbjct: 151 VSWHDAQAYCAWSGTRLPTEAEWEYAARGGLEQRRYPWGDELMPDG 196
>gi|379057564|ref|ZP_09848090.1| hypothetical protein SproM1_05779 [Serinicoccus profundi MCCC
1A05965]
Length = 320
Score = 233 bits (593), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 137/331 (41%), Positives = 167/331 (50%), Gaps = 58/331 (17%)
Query: 15 MVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
V +PG TF G + DGE P VT+ F LD V+N F FV+ TG EA
Sbjct: 36 QVDVPGGTFARGDAHGDGHPGDGETPVHEVTVSGFSLDATTVTNDDFARFVAETGQRIEA 95
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E FG + VF L+ + V
Sbjct: 96 EPFGFSAVFHLALAARPEEVLRPV------------------------------------ 119
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
A W G RG W HP G S + +HPVVHVSWNDA+AYC
Sbjct: 120 ------PGAPWWLGVRGA---------DWRHPGGSRSDLTGLGDHPVVHVSWNDALAYCA 164
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLT-PRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWEY RGGL + +PWG++L P G R N+WQG+FP N ADGYL+T
Sbjct: 165 WAGRRLPTEAEWEYAARGGLAHARYPWGDDLLDPSGAWRCNIWQGDFPRRNDLADGYLTT 224
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTDKVKKGGS 307
AP S+ N +G MVGNVWEW ADW++ + + +P GP G +V +GGS
Sbjct: 225 APATSFTPNGYGAVQMVGNVWEWCADWFDPGTYAQVARAGRAVDPTGPGAGGSRVIRGGS 284
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
YLC++ YC R+R AARS NTPDSS GN GFR
Sbjct: 285 YLCHDSYCNRYRVAARSANTPDSSTGNTGFR 315
>gi|296121197|ref|YP_003628975.1| hypothetical protein Plim_0932 [Planctomyces limnophilus DSM 3776]
gi|296013537|gb|ADG66776.1| protein of unknown function DUF323 [Planctomyces limnophilus DSM
3776]
Length = 338
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 136/344 (39%), Positives = 181/344 (52%), Gaps = 47/344 (13%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG FRMG+N P L D E P+ V+++ F+LD+HEV+N +F EFV ATGYVT AE
Sbjct: 29 MVWIPGGEFRMGSNDP-LAWDDERPTHPVSVEGFWLDEHEVTNREFAEFVKATGYVTTAE 87
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K E +L+ + + + + +H + ND + W
Sbjct: 88 K---APTVEEILANSPPGTAAPDPAVLVPASLVFTPPDHP-------VPLNDFSQWWVWT 137
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
P +W HPEG S I+ RM+HPVVHVSW+DA AY W
Sbjct: 138 -----------------------PGANWRHPEGPGSNIDERMDHPVVHVSWDDASAYAQW 174
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWE RGG E F WG +AN+W G+FP NTA DG+ TAP
Sbjct: 175 AGKRLPTEAEWERAARGGKEGMPFVWGTTPPSETSPQANLWNGKFPYENTAKDGFTRTAP 234
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTD--------K 301
V S+ N +GLY+M GNVWEW DW++ + A + NP GP D +
Sbjct: 235 VKSFPANDYGLYDMSGNVWEWCHDWYDRELYSKRPQLAVTKNPSGPEKSHDPTQPFQQLR 294
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
++GGS+LCN+ YC R+R +AR +PD+ ++GFRCA P
Sbjct: 295 SQRGGSFLCNDSYCSRYRPSARHGCSPDTGMSHVGFRCAKSVVP 338
>gi|395531848|ref|XP_003767985.1| PREDICTED: sulfatase-modifying factor 2-like [Sarcophilus harrisii]
Length = 306
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 141/352 (40%), Positives = 188/352 (53%), Gaps = 70/352 (19%)
Query: 1 MVLLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
+ L P P + MV LPG TFRMGTN P KDGE P+R VT+ +F +D++ V+N F
Sbjct: 19 LGLWPEPGSGQDDSMVQLPGGTFRMGTNSPD-GKDGEGPTREVTVKSFAIDKYPVTNRDF 77
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
+EFV Y TEAE FG +FVFE L+S+E + K++Q
Sbjct: 78 REFVRQKKYQTEAESFGWSFVFEDLVSDELKKKVTQ------------------------ 113
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
+ W LP E + W P G S I R++ PV
Sbjct: 114 ------KLESAPWW---LPVEKAF-----------------WRQPAGPGSGIRERLDFPV 147
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDA A+C WRG RLPTE EWE+ RGGL+ R++PWGN +R N+WQG+FP
Sbjct: 148 LHVSWNDARAFCAWRGKRLPTEEEWEFAARGGLKGRVYPWGNQFQ---LNRTNLWQGKFP 204
Query: 241 TNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT 299
+TA DG+ +PV ++ +N +GLY+++GNVWEWTA +Y P P G
Sbjct: 205 QADTAEDGFHGASPVDAFPPQNNYGLYDLMGNVWEWTAS----------TYEP--PIPGQ 252
Query: 300 D-KVKKGGSYLCNEQYC--YRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
D KV +G S++ + +R R R NTPDS++ NLGFRCAA G G
Sbjct: 253 DKKVLRGASWIDTAEGSANHRARVTTRMGNTPDSASDNLGFRCAASPGRPLG 304
>gi|375143553|ref|YP_005005994.1| sulfatase-modifying factor protein [Niastella koreensis GR20-10]
gi|361057599|gb|AEV96590.1| Sulphatase-modifying factor protein [Niastella koreensis GR20-10]
Length = 382
Score = 231 bits (590), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 131/343 (38%), Positives = 179/343 (52%), Gaps = 53/343 (15%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDG---EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
Y MV +PG ++MG KDG E+P V +D FY+D EV+N QF EFV ATG
Sbjct: 75 YAGMVKIPGGDYQMGAAD----KDGRPDEYPQHPVHVDGFYMDATEVTNAQFMEFVKATG 130
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
YVT AE+ E K K + L P ++ ND
Sbjct: 131 YVTTAER----------TPNWEELKKQLPSGTPKPADSLLVAASLVFTPPAKPVALNDVS 180
Query: 129 AYCTW-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
+ W +GA W HPEG S+I+ + N+PVV VSW+D
Sbjct: 181 QWWQWVKGA------------------------DWKHPEGPASSIKGKENYPVVQVSWDD 216
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
A+AY W G +LPTEAEWE+ R GL N+ F WG+ +G+ +AN+WQG FP +NTA D
Sbjct: 217 AMAYAKWAGKQLPTEAEWEWAARAGLTNQPFTWGSEAVEKGKPKANIWQGHFPDHNTATD 276
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS---YNPKGPTTGTD---- 300
G+ APV S+ N +GLY++ GNVWEW ADW+ ++ + NP GP+ G D
Sbjct: 277 GFARVAPVKSFAPNAYGLYDIAGNVWEWCADWYRSDYYTQATKKAVNPPGPSVGYDPDEP 336
Query: 301 ----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V +GGS+LC+ YC +R +AR + +PD+ + GFRC
Sbjct: 337 TVPKRVVRGGSFLCHASYCASYRVSARMKTSPDTGLEHTGFRC 379
>gi|357384299|ref|YP_004899023.1| sulfatase modifying factor 1 (C-alpha-formyglycine- generating
enzyme 1) [Pelagibacterium halotolerans B2]
gi|351592936|gb|AEQ51273.1| sulfatase modifying factor 1 precursor (C-alpha-formyglycine-
generating enzyme 1) [Pelagibacterium halotolerans B2]
Length = 305
Score = 231 bits (589), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 146/357 (40%), Positives = 185/357 (51%), Gaps = 69/357 (19%)
Query: 1 MVLLPAPPVERYK-----------DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFY 49
M PAP E+ + D + +PG +GTN P + DGE P R VTL F
Sbjct: 1 MADTPAPSAEKLRQAAPISSGTALDPIRVPGGRSFVGTNSPEIPVDGEGPERPVTLRDFA 60
Query: 50 LDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDS 109
L+ V+N +F +FV+ATGYVTEAE++G + VF H + E LD+
Sbjct: 61 LEATTVTNARFADFVAATGYVTEAERYGWSAVF----------------HSDR--ESLDA 102
Query: 110 TIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGID 169
T G++LP W + G SW HPEG D
Sbjct: 103 ---------------------ATRAGSQLP----WWHKVDGV---------SWRHPEGPD 128
Query: 170 STIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGE 229
S++E R +HPVV VSW DA A+ W G RLPTEAEWE+ RGG E R FPWG+ +
Sbjct: 129 SSLEGREDHPVVQVSWRDANAFAQWVGGRLPTEAEWEHAARGGTERRRFPWGDEEP--ND 186
Query: 230 HRA---NVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-H 285
H A N+WQG FP NTA DG++ TAP S++ N G Y+M GNVWEWT D + V
Sbjct: 187 HDAIFCNIWQGRFPDLNTARDGFVRTAPARSFESNPLGFYSMSGNVWEWTHDAFRVRSVS 246
Query: 286 PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
A ++KV KGGS+LC+ YCYR+R AAR TPDS+ N GFR A D
Sbjct: 247 RAAKQRNAYALQHSEKVLKGGSFLCHVSYCYRYRIAARMAMTPDSAGSNTGFRVAYD 303
>gi|256420881|ref|YP_003121534.1| Non-specific serine/threonine protein kinase [Chitinophaga pinensis
DSM 2588]
gi|256035789|gb|ACU59333.1| Non-specific serine/threonine protein kinase [Chitinophaga pinensis
DSM 2588]
Length = 349
Score = 231 bits (589), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 132/338 (39%), Positives = 179/338 (52%), Gaps = 43/338 (12%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+PG TF MG + D E P +V +D+F++D+HEV+N +F FV ATGY+T AE
Sbjct: 46 MVLIPGGTFTMGADDAEAHPD-EKPRHDVRVDSFWMDEHEVTNAEFAAFVKATGYLTTAE 104
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ P+ EE ++ + L +I + P H D V+
Sbjct: 105 R--------PITKEELMQQLPPGSPEPDSSMLLPGSI---VFTPPSHPVPLDDVS----- 148
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+W +G SW HP G + I + + PVVH+SWNDA A+ W
Sbjct: 149 --------QWWSFIQGA---------SWQHPGGPTTNINGKEHLPVVHISWNDAQAFAKW 191
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY RGGL+++ +PWG G +AN W G FP NT DGY AP
Sbjct: 192 AGKRLPTEAEWEYAARGGLKDQPYPWGPEALTTGRVKANTWNGHFPYQNTQTDGYFGAAP 251
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-YNPKGPTTGTDK--------VKKG 305
V S+ N +GLY+M GNVWEW ADW++ ++ NPKGP G D +G
Sbjct: 252 VKSFAANAYGLYDMSGNVWEWCADWYDSRYYGLKQGDNPKGPAAGYDPEDPASPKHTIRG 311
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
GS++C ++YC +R AR + +P+S NLGFRC DK
Sbjct: 312 GSFMCTDEYCSGYRVTARMKTSPESGLENLGFRCVKDK 349
>gi|403526439|ref|YP_006661326.1| formylglycine-generating enzyme [Arthrobacter sp. Rue61a]
gi|403228866|gb|AFR28288.1| putative formylglycine-generating enzyme [Arthrobacter sp. Rue61a]
Length = 337
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 142/348 (40%), Positives = 179/348 (51%), Gaps = 66/348 (18%)
Query: 6 APPVERYKDMVLLPGDTFRMGTNKPI---LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
A P + KD +P TF MG P DGE P V++ AF + V+N +F E
Sbjct: 43 AVPSKEAKD-AEIPAGTFSMG--DPFNEGYDGDGESPVHEVSVSAFRIGATTVTNQEFAE 99
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
FV ATGY TE+E +G + VF + H + + V ++
Sbjct: 100 FVDATGYRTESETYGTSAVFHLAVKANT----------------------HDILNRVNNV 137
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
W + RGA W HP G S +HPV H
Sbjct: 138 PW-----WLNIRGA------------------------DWAHPAGPLSNWADIPDHPVTH 168
Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNL--TPRGE--HRANVWQGE 238
VS NDA+AYC W G RLPTEAEWEY RGGL + +PWGN L GE H N+WQGE
Sbjct: 169 VSHNDALAYCQWAGRRLPTEAEWEYAARGGLSGQRYPWGNELHNDAPGETAHNCNIWQGE 228
Query: 239 FPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW-----WNVHHHPAPSYNPK 293
FP+ NT DGYL+TAP S++ N +GLY GNVWEW +DW + H +P+
Sbjct: 229 FPSRNTIDDGYLTTAPATSFRPNGYGLYQTSGNVWEWCSDWFLPKYYKTCHAQGTVEDPQ 288
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
GPT G +V +GGSYLC++ YC R+R AARS NTPDS++GNLGFR A
Sbjct: 289 GPTIGRGRVMRGGSYLCHDSYCNRYRLAARSSNTPDSASGNLGFRTVA 336
>gi|315504000|ref|YP_004082887.1| hypothetical protein ML5_3220 [Micromonospora sp. L5]
gi|315410619|gb|ADU08736.1| protein of unknown function DUF323 [Micromonospora sp. L5]
Length = 319
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 133/328 (40%), Positives = 171/328 (52%), Gaps = 59/328 (17%)
Query: 14 DMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
D++ LPG TF MGT+ P +D E P R V + F + V+N QF FV ATGY T+
Sbjct: 49 DLIALPGGTFLMGTDDPEGFPEDLEGPVRPVAVAPFEIAATPVTNRQFARFVDATGYRTD 108
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE++G +FVF LL + LD+++
Sbjct: 109 AERYGWSFVFHLLLVPSAHPYV------------LDASVP-------------------- 136
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
+A W G W P+G S + R HPVVHVS DA AYC
Sbjct: 137 --------QAPWWLAVEGAY---------WAAPDGPGSDVRGREQHPVVHVSHRDAEAYC 179
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RL TEAEWEY RGGL + WG+ LTP G HR N+WQG FP NTA DG+ T
Sbjct: 180 RWSGTRLATEAEWEYAARGGLSQARYAWGDELTPDGRHRCNIWQGAFPAVNTAEDGFAGT 239
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
+PV ++ N +GLY++ GNVWE TA+ W H +G + + V +GGSYLC++
Sbjct: 240 SPVTTFPPNGYGLYDVAGNVWELTAEPWRDH---------RGRISPGESVIRGGSYLCHD 290
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCA 340
YC R+R AARS+ T DSS+GN GFR A
Sbjct: 291 SYCNRYRVAARSRTTVDSSSGNTGFRVA 318
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 35/48 (72%)
Query: 343 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
+G + + V +GGSYLC++ YC R+R AARS+ T DSS+GN GFR A
Sbjct: 271 RGRISPGESVIRGGSYLCHDSYCNRYRVAARSRTTVDSSSGNTGFRVA 318
>gi|442589738|ref|ZP_21008545.1| hypothetical protein D505_17990 [Elizabethkingia anophelis R26]
gi|442560626|gb|ELR77854.1| hypothetical protein D505_17990 [Elizabethkingia anophelis R26]
Length = 383
Score = 231 bits (588), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 135/355 (38%), Positives = 185/355 (52%), Gaps = 60/355 (16%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
PV+ MVL+P TF MG+N ++ E P V++ +FY+D HEV+N +F++FV AT
Sbjct: 65 PVDVTNKMVLIPAGTFMMGSNGDEWSRNWESPQHKVSVKSFYMDTHEVTNAEFEKFVIAT 124
Query: 68 GYVTEAEKFGDTFVFE-------PLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
GY+T AEK D + P S+E+ S V + GLD
Sbjct: 125 GYITTAEKPVDWEALKKELPPDTPKPSDEDLMPGSMVFASPESVNGLDD----------- 173
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
+ W WR W G W HP G S I+ + HPV
Sbjct: 174 YSQW--------WR---------WVKGA------------DWQHPLGPGSNIKGKEKHPV 204
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
VHVS++DAVAY W G RLPTEAEWE+ RGGL+N+++PWG+ RG +AN W G FP
Sbjct: 205 VHVSYDDAVAYAKWIGKRLPTEAEWEWAARGGLDNKIYPWGDEHISRGSQKANYWTGTFP 264
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----PSYNPKGP 295
NT DGY TAPV SY N +GLY+M GNVWE +DW++ +++ + + NP GP
Sbjct: 265 IKNTEKDGYYYTAPVGSYAPNAYGLYDMAGNVWEICSDWFSENYYQSLQGNQITNNPAGP 324
Query: 296 TT--------GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+ +V +GGS+LCN+ YC +R +AR + D+ + GFR D
Sbjct: 325 SKPYYPSEPFANKRVVRGGSFLCNDSYCASYRVSARMPYSEDTGMIHTGFRLVKD 379
>gi|384917530|ref|ZP_10017652.1| Non-specific serine/threonine protein kinase [Citreicella sp. 357]
gi|384468573|gb|EIE52996.1| Non-specific serine/threonine protein kinase [Citreicella sp. 357]
Length = 320
Score = 230 bits (586), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 131/330 (39%), Positives = 162/330 (49%), Gaps = 54/330 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
++ L G F MGT K +D + P R V L F + +N +F F TGY T AE
Sbjct: 40 LIALDGGFFDMGTRKSRFPEDFDAPRRKVRLSPFRISATACTNAEFARFARETGYRTVAE 99
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
G ++VF LL + R S P W C
Sbjct: 100 VEGWSYVFHLLLDDPARFPTS----------------------PAGLRWWRKVDGAC--- 134
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W PEG S I R HPVVHV+W DA+AYC W
Sbjct: 135 ---------------------------WHAPEGQGSDITGRAAHPVVHVAWFDALAYCHW 167
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LP+EA+WE+ RGGL R FPWGN +TP G N WQG FP NTA DG+ TAP
Sbjct: 168 AGLALPSEAQWEFAARGGLARRKFPWGNEMTPGGAFAMNTWQGSFPDRNTAEDGFTGTAP 227
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH--HPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
V ++ N FGLYN GNVWEW D + P P +P+GP GT +V++GGSYLC+E
Sbjct: 228 VEAFAPNGFGLYNTCGNVWEWVRDLYAPRRPEGPFPLRDPQGPENGTGRVQRGGSYLCHE 287
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YC R+ +R++N PDSS GN GFR AA+
Sbjct: 288 SYCDRYHVHSRTRNDPDSSTGNTGFRVAAE 317
>gi|126307458|ref|XP_001366051.1| PREDICTED: sulfatase-modifying factor 2 [Monodelphis domestica]
Length = 369
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 136/338 (40%), Positives = 182/338 (53%), Gaps = 70/338 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG TF+MGTN P KDGE P R VT+ F +D++ V+N F+EFV Y TEAE
Sbjct: 96 MVQLPGGTFQMGTNSPD-GKDGEGPIREVTVKPFAIDKYPVTNKDFREFVRQKKYQTEAE 154
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E + K++Q + W
Sbjct: 155 SFGWSFVFEDFVSDELKKKVTQ------------------------------KLESAPWW 184
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E + W P G S I R++ PV+HVSWNDA A+C W
Sbjct: 185 ---LPVEKAF-----------------WRQPAGPGSGIRDRLDFPVLHVSWNDAQAFCAW 224
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ R++PWGN P +R N+WQG+FP +TA DG+ +P
Sbjct: 225 RGKRLPTEEEWEFAARGGLKGRVYPWGNQFQP---NRTNLWQGKFPQADTAEDGFHGASP 281
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD-KVKKGGSYLCNE 312
V ++ +N +GLY+++GNVWEWTA +Y P P G D +V +G S++
Sbjct: 282 VDAFPPQNNYGLYDLMGNVWEWTAS----------TYGP--PVPGQDRRVLRGASWIDTA 329
Query: 313 QYC--YRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+ +R R R NTPDS++ NLGFRCAA G +G
Sbjct: 330 EGSANHRARVTTRMGNTPDSASDNLGFRCAASPGRPSG 367
>gi|443320196|ref|ZP_21049313.1| hypothetical protein GLO73106DRAFT_00035460 [Gloeocapsa sp. PCC
73106]
gi|442790078|gb|ELR99694.1| hypothetical protein GLO73106DRAFT_00035460 [Gloeocapsa sp. PCC
73106]
Length = 335
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 140/362 (38%), Positives = 193/362 (53%), Gaps = 64/362 (17%)
Query: 3 LLPAPPVERYKD------MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVS 56
L P P+ + D MV +P TFR+G +K ++ E + NVT+D F +D++EV+
Sbjct: 18 LDPMQPLVAFSDNFCSEEMVFIPRGTFRLGDDKSNFLE--EKVAENVTVDNFCIDRYEVT 75
Query: 57 NTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMH 116
N+ F FV ATGY+T AE+ F LL +E+R+ S V E
Sbjct: 76 NSDFATFVKATGYITVAERPLSKEQFSNLL-DEQRSPGSLV---------FQPPEEEATQ 125
Query: 117 HPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRM 176
P ++SW W W G +WLHP G DS+IE +
Sbjct: 126 VP--YLSW--------WH---------WVLGA------------NWLHPYGPDSSIEGKD 154
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
N+PVVH+++ DAVAY W G LPTEA+WEY RGGL+N++F WG+ + + +AN WQ
Sbjct: 155 NYPVVHMAYEDAVAYAQWAGKSLPTEAQWEYAARGGLKNKIFSWGDEYSAQ---KANTWQ 211
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGP 295
G FP NT DGY+ T+PV S+ N +G+Y++ GNVWEWT+DW+++ H A NP GP
Sbjct: 212 GVFPFFNTKEDGYVGTSPVGSFPANNYGIYDITGNVWEWTSDWFSLGHDSQANKVNPTGP 271
Query: 296 TT-----------GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
T G V KGGSYLC YC R+R AAR PD+ ++GFR +
Sbjct: 272 TKAESFDPKKLGEGALHVVKGGSYLCARNYCSRYRPAARESQAPDTGTTHIGFRLVKNLA 331
Query: 345 PT 346
P+
Sbjct: 332 PS 333
>gi|149276174|ref|ZP_01882318.1| hypothetical protein PBAL39_00602 [Pedobacter sp. BAL39]
gi|149232694|gb|EDM38069.1| hypothetical protein PBAL39_00602 [Pedobacter sp. BAL39]
Length = 379
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 140/342 (40%), Positives = 182/342 (53%), Gaps = 45/342 (13%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
K+MV + G TF MG + +D E+P NVT+D F++D EV+N QF EFV ATGYVT
Sbjct: 72 KNMVWIDGGTFSMGADNDQAAQD-EYPKHNVTVDGFWIDVTEVTNKQFAEFVKATGYVTT 130
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ D EE + ++ K E L P +S ND Y
Sbjct: 131 AERKPDW--------EELKKQVPP--GTPKPDESLLVPASLVFKAPGNEVSLND---YTQ 177
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W EW G +W P G DS + ++PVVHVSW DA AYC
Sbjct: 178 WW--------EWTKG------------ANWRKPHGADSEWSAKDDYPVVHVSWEDAQAYC 217
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWE RG L+N ++PWGN G+ +AN WQG FP NT D Y
Sbjct: 218 KWAGTRLPTEAEWELAARGRLKNNIYPWGNESVNIGKPKANSWQGTFPVKNTLEDKYYLA 277
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPKGPTTGTD--------K 301
APV S+ N +GLY+M GNVWEW AD ++ +++ + NPKGP + D
Sbjct: 278 APVKSFPPNGYGLYDMAGNVWEWCADLYHSNYYQEVKSGVKNPKGPGSSFDPDEPHTEKH 337
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
V +GGS+LCN+ YC +R A R +++ DSS ++GFRC DK
Sbjct: 338 VLRGGSFLCNDSYCSGYRVARRMKSSGDSSMEHVGFRCVRDK 379
>gi|407644341|ref|YP_006808100.1| hypothetical protein O3I_015825 [Nocardia brasiliensis ATCC 700358]
gi|407307225|gb|AFU01126.1| hypothetical protein O3I_015825 [Nocardia brasiliensis ATCC 700358]
Length = 294
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 137/328 (41%), Positives = 171/328 (52%), Gaps = 58/328 (17%)
Query: 18 LPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKF 76
+PG F MG ++ DGE P V L F +D V+N F FV TGYV++AE+F
Sbjct: 13 VPGGIFEMGDAHRDGFHADGETPVHTVALSPFTIDTTTVTNAAFATFVEETGYVSDAERF 72
Query: 77 GDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA 136
G + PV H+
Sbjct: 73 GCS--------------------------------------PVFHLVLAAEPGDLLGSDP 94
Query: 137 RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRG 196
R P W RG W HPEG S+I R HPVV VSW DAVAYC W G
Sbjct: 95 RAP----WWIDVRGA---------DWRHPEGRHSSIADRAEHPVVQVSWFDAVAYCEWAG 141
Query: 197 ARLPTEAEWEYGCRGGLENRLFPWGNNLT-PRGEHRANVWQGEFPTNNTAADGYLSTAPV 255
RLPTEAEWE+ RGGL +PWG+ GE RAN+WQG FPT+NT DGYL+TAPV
Sbjct: 142 RRLPTEAEWEFAARGGLPGARYPWGDEAPGAGGEWRANIWQGTFPTDNTQEDGYLTTAPV 201
Query: 256 MSYKENKFGLYNMVGNVWEWTADWW--NVHHHPAPS---YNPKGPTTGTDKVKKGGSYLC 310
SY+ N +GL+ M GNVWEW +D + N + H A + +P+G + G +V +GGSYLC
Sbjct: 202 RSYRPNGYGLWQMAGNVWEWCSDRFSPNTYGHDARTGHVRDPQGASAGQSRVLRGGSYLC 261
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
++ YC R+R AARS NTPD++ GN GFR
Sbjct: 262 HDSYCSRYRNAARSSNTPDATMGNAGFR 289
>gi|84501935|ref|ZP_01000093.1| hypothetical protein OB2597_17647 [Oceanicola batsensis HTCC2597]
gi|84389930|gb|EAQ02564.1| hypothetical protein OB2597_17647 [Oceanicola batsensis HTCC2597]
Length = 295
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 127/328 (38%), Positives = 162/328 (49%), Gaps = 54/328 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
++ + G F MG + +D + P R V + F + SN F FV TGY T AE
Sbjct: 16 LLSIKGGFFDMGARQSRFAQDYDSPRRKVFVSPFRISATACSNADFAHFVEETGYRTVAE 75
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
G ++VF L
Sbjct: 76 VEGWSYVFHML------------------------------------------------- 86
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP A W G R W PEG DS + R +HPVVHV+W DA+AYCTW
Sbjct: 87 ---LPDLAAWPISPPGLEWWRKVDLACWSAPEGPDSDLSERADHPVVHVAWYDALAYCTW 143
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEA+WE+ RGGL FPWGN P G H N WQG FP N+A DG++ TAP
Sbjct: 144 AGLILPTEAQWEFAARGGLSRARFPWGNATMPGGHHAMNTWQGAFPATNSAEDGFVGTAP 203
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVH--HHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
V +Y N FGL+N GNVWEW +D++ H P P +PKGP G ++++GGSYLC++
Sbjct: 204 VTAYAPNGFGLFNTCGNVWEWASDFYAPHPARGPFPLRDPKGPADGYARIQRGGSYLCHD 263
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCA 340
YC R+ +R++N PDSS GN GFR A
Sbjct: 264 SYCDRYHVHSRTRNDPDSSTGNAGFRVA 291
>gi|383765370|ref|YP_005444351.1| putative sulfatase-modifying factor [Phycisphaera mikurensis NBRC
102666]
gi|381385638|dbj|BAM02454.1| putative sulfatase-modifying factor [Phycisphaera mikurensis NBRC
102666]
Length = 349
Score = 229 bits (583), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 145/355 (40%), Positives = 180/355 (50%), Gaps = 75/355 (21%)
Query: 15 MVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M+ LPG F MGT++P DGE P R VTL F LD V+ F FV+ TG+VTEA
Sbjct: 42 MITLPGGPFLMGTDEPDGFPADGEGPVREVTLSPFRLDATAVTVAAFARFVAETGFVTEA 101
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+FG +FVF+ LS +++ R V +W AV
Sbjct: 102 ERFGWSFVFQGHLSPRFVRRVTARRQ-------------------VPGAAWWVAV----- 137
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
GA W HPEG S + HR NHP +SW+DA AY
Sbjct: 138 DGA------------------------DWAHPEGPRSGVRHRQNHPATQISWHDAAAYAA 173
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE+ RGG + +PWG L PRG+HR NV+QG FP +TAADG+
Sbjct: 174 WAGKRLPTEAEWEFAARGGTSQQRYPWGPTLEPRGKHRCNVFQGRFPEEDTAADGFAGPC 233
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADW----WNVHHHPAPSYNPKGPT----------TGT 299
PV +Y N +G +N++GNVWEW ADW W+ A +P GP GT
Sbjct: 234 PVDAYAPNAYGFHNLLGNVWEWCADWFCPVWHAEARDATRTDPIGPVPHDDLDRGGRPGT 293
Query: 300 D------------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
D +V+KGGSYLC+ YC R+R AR+ NTPDS N GFRCA D
Sbjct: 294 DGRAGVVHGGFTHRVQKGGSYLCHASYCNRYRLGARTGNTPDSGTTNSGFRCAMD 348
>gi|428777451|ref|YP_007169238.1| hypothetical protein PCC7418_2894 [Halothece sp. PCC 7418]
gi|428691730|gb|AFZ45024.1| protein of unknown function DUF323 [Halothece sp. PCC 7418]
Length = 328
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 137/339 (40%), Positives = 185/339 (54%), Gaps = 58/339 (17%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+DMV + G +F+MG N+ + E + +V++ +F +D+HEV+N +F +FV ATGYVT
Sbjct: 31 EDMVFISGGSFQMGDNQSRFWE--EQAAEDVSVSSFCIDRHEVTNAEFAKFVEATGYVTV 88
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ F P LSEEERA S V FE + VAY +
Sbjct: 89 AERPLSQENF-PDLSEEERAPGSLV------FEPPPENSQQ--------------VAYLS 127
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W W + P +W HP G DS I+ + N+PVVH+++ DA AY
Sbjct: 128 W----------WHWT----------PGANWRHPFGPDSDIQGKDNYPVVHIAYEDAQAYT 167
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEA+WE+ RGGL+N+ F WG+ + + +AN WQG FP NT D YL T
Sbjct: 168 EWAGKTLPTEAQWEFAARGGLKNKKFTWGDEYSAQ---KANTWQGIFPFFNTKEDHYLGT 224
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHH------------HPAPSYNPKGPTTGTD 300
APV S+ N +GLY+M GNVWEWT+DW+ V H S++PK PT G
Sbjct: 225 APVESFPANGYGLYDMTGNVWEWTSDWYGVSHADKAHRLNPTGASEEESFDPKKPTDGAM 284
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
V KGGS+LC + YC R+R AAR PD+ ++GFR
Sbjct: 285 HVIKGGSHLCAKNYCSRYRPAARESQAPDTGTTHIGFRS 323
>gi|195095886|ref|XP_001997853.1| GH23459 [Drosophila grimshawi]
gi|193906138|gb|EDW05005.1| GH23459 [Drosophila grimshawi]
Length = 201
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 108/183 (59%), Positives = 131/183 (71%), Gaps = 12/183 (6%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
SW P GI+S ++ HPVVHVSW DAVAYCTW G RLP+E+EWE CRGG + +L+PW
Sbjct: 29 SWRKPNGINSNLQGLEQHPVVHVSWRDAVAYCTWAGKRLPSESEWEVACRGGKQRKLYPW 88
Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
GN L P+ +H N+WQGEFP +T DGY T PV +++N + LYNMVGNVWEWTAD W
Sbjct: 89 GNKLMPKDQHWLNIWQGEFPDGDTQQDGYQFTCPVNEFRQNNYDLYNMVGNVWEWTADLW 148
Query: 281 NVHHHPAPSYNPKGPTTGT-DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
G T+ + +VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRC
Sbjct: 149 QA-----------GDTSESPARVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRC 197
Query: 340 AAD 342
A +
Sbjct: 198 AKN 200
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 34/43 (79%), Positives = 39/43 (90%)
Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 159 RVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 201
>gi|311746857|ref|ZP_07720642.1| sulfatase-modifying factor 1 [Algoriphagus sp. PR1]
gi|311302580|gb|EFQ79230.1| sulfatase-modifying factor 1 [Algoriphagus sp. PR1]
Length = 363
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 133/361 (36%), Positives = 180/361 (49%), Gaps = 49/361 (13%)
Query: 1 MVLLPAP-PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQ 59
MV+ P P ++ + M +PG F MGTN+ E P+ ++ ++ F++D HEV+N +
Sbjct: 34 MVVSRTPVPTQKIEGMSWIPGGEFVMGTNE-AEAYPAEKPAVDLKVEGFWMDTHEVTNDE 92
Query: 60 FQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPV 119
F FV AT YVT AE+ E E K K E + P
Sbjct: 93 FAAFVEATDYVTVAER----------KPEWEELKKQLPPGTPKPDESVLVPGSMVFSPPP 142
Query: 120 VHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHP 179
+ D + W +W HPEG DS IE R NHP
Sbjct: 143 YKVPLQDISLWWVWVNG-----------------------ANWKHPEGPDSNIEDRGNHP 179
Query: 180 VVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEF 239
VVHV++ DA AY W G RLPTE EWE+ RGG++ + F WG+ LTP G++ AN +QG F
Sbjct: 180 VVHVAYEDAQAYAEWAGKRLPTEIEWEFAARGGVDGKRFAWGDELTPNGQYLANTFQGTF 239
Query: 240 PTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA------------ 287
P N DG++ ++PV +Y N FGLY+M+GNVWE T+DW++ +
Sbjct: 240 PNENLGNDGFVGSSPVKAYAPNNFGLYDMIGNVWELTSDWYDALKYARLAGQAPKLDKDM 299
Query: 288 -PSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
P YNP P ++V KGGS+LC E YC +R +AR DS N+GFRC D P
Sbjct: 300 NPCYNPDNP-FAMERVIKGGSFLCAENYCVNYRPSARQGQAFDSGTSNVGFRCVKDPEPK 358
Query: 347 T 347
T
Sbjct: 359 T 359
>gi|386395503|ref|ZP_10080281.1| hypothetical protein Bra1253DRAFT_00938 [Bradyrhizobium sp.
WSM1253]
gi|385736129|gb|EIG56325.1| hypothetical protein Bra1253DRAFT_00938 [Bradyrhizobium sp.
WSM1253]
Length = 337
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 137/346 (39%), Positives = 180/346 (52%), Gaps = 62/346 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV +PG TFRMG++ E PS V +D F++D+ V+N QF++FV+ATG+VTEA
Sbjct: 20 EMVFIPGGTFRMGSDHHY---PEEAPSHRVAVDGFWIDRTPVTNRQFKQFVNATGHVTEA 76
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ D + L+E A S V + R L W
Sbjct: 77 QVVPDPKDYPGALAEMLYAG-SLVFSPLPRITDL--------------TDW--------- 112
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
++W RG +W HP G S I+ +HPVVHVS++DA AY
Sbjct: 113 --------SQWWTFLRGA---------NWRHPYGPGSNIKDLDDHPVVHVSFSDAAAYAH 155
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEAEWE+ RGGLE + WG+ L P G+H AN+WQG FP N DGY T+
Sbjct: 156 WAGKDLPTEAEWEFAARGGLEAEEYAWGDALVPGGKHMANIWQGNFPVQNLGEDGYERTS 215
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSYNPKGPTTGTD---------- 300
PVM++ N +GLY+M+GNVWEWT+DWW+ H P PK P G +
Sbjct: 216 PVMAFPPNGYGLYDMIGNVWEWTSDWWSSKHAADAAKPCCIPKNPRGGGEHTSYDPRLPD 275
Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
KV KGGS+LC YC R+R AAR D+S G++GFRC
Sbjct: 276 IQIPRKVLKGGSHLCAPNYCRRYRPAARHAEPVDTSTGHVGFRCVV 321
>gi|255530292|ref|YP_003090664.1| Non-specific serine/threonine protein kinase [Pedobacter heparinus
DSM 2366]
gi|255343276|gb|ACU02602.1| Non-specific serine/threonine protein kinase [Pedobacter heparinus
DSM 2366]
Length = 367
Score = 228 bits (581), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 129/339 (38%), Positives = 176/339 (51%), Gaps = 45/339 (13%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +P +F MG D E+P+ V +D F++D+ EV+N QF FV ATGYVT+AE
Sbjct: 63 MKFIPAGSFAMGAADNEGRPD-EYPAHEVVMDGFWIDETEVTNAQFARFVKATGYVTQAE 121
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D E K K + L P ++ ND + WR
Sbjct: 122 RKPDW----------EELKKQLPPGTPKPADELLQPASLTFSKPTKRVNLNDVSQW--WR 169
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W G SW HP+G STI + N+PV VSW DA AY W
Sbjct: 170 ---------WTVGA------------SWKHPQGPKSTIVGKDNYPVTQVSWIDASAYAKW 208
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY RGGL+ + +PWG+ +G+ +AN WQGEFP + DG+ STAP
Sbjct: 209 AGKRLPTEAEWEYAARGGLQRKKYPWGDESLDKGKVKANTWQGEFPYTDLKTDGFSSTAP 268
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPKGPTTGTD--------KVK 303
V S+ N +GLY+M GNVWEWTADW+ ++ NPKGP D K+
Sbjct: 269 VKSFAANGYGLYDMSGNVWEWTADWYTADYYRGLKGKVNNPKGPAESYDADEPTIPKKII 328
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+GGS++C+ YC +R +++ +++ D+ N GFRC A+
Sbjct: 329 RGGSFMCHSSYCKGYRVSSKMKSSMDTGLENTGFRCVAN 367
>gi|424876935|ref|ZP_18300594.1| hypothetical protein Rleg5DRAFT_1327 [Rhizobium leguminosarum bv.
viciae WSM1455]
gi|393164538|gb|EJC64591.1| hypothetical protein Rleg5DRAFT_1327 [Rhizobium leguminosarum bv.
viciae WSM1455]
Length = 317
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 131/337 (38%), Positives = 180/337 (53%), Gaps = 55/337 (16%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
R D+V + G +GT++P+L+ DGE P R VT+ ++L+ V+ +F EFV+ATGYV
Sbjct: 28 RAGDVVAVKGGETFLGTHEPVLVVDGEGPERKVTVSDYFLEAETVTVARFAEFVAATGYV 87
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
++AEKFG VF L+ +++ + VAY
Sbjct: 88 SDAEKFGCAAVFSGLMEDKKLVA--------------------------------ENVAY 115
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W AR+ W PEG S++E+R++HPV VSWNDAVA
Sbjct: 116 TPW-WARVDGAV-------------------WNMPEGPGSSVENRLDHPVTQVSWNDAVA 155
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
+ TW G RLPTEAEWE+ RGG + +PWG E N+WQG FP NT ADG+
Sbjct: 156 FATWVGGRLPTEAEWEHAARGGNRRQRYPWGEAEPTDDEICCNIWQGRFPHQNTLADGFF 215
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVH--HHPAPSYNPKGPTTGTDKVKKGGSY 308
TAPV S++ + G Y++ GNVWEW +D + V A N + +KV KGGS+
Sbjct: 216 GTAPVRSFEPTEDGFYHLSGNVWEWASDPFRVRSLSRQAKIRNEQSKKNA-EKVLKGGSF 274
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
LC++ YCYR+R AAR +PDS++ N GFR A D P
Sbjct: 275 LCHKSYCYRYRIAARIGLSPDSASSNTGFRVAYDTWP 311
>gi|302869538|ref|YP_003838175.1| hypothetical protein Micau_5091 [Micromonospora aurantiaca ATCC
27029]
gi|302572397|gb|ADL48599.1| protein of unknown function DUF323 [Micromonospora aurantiaca ATCC
27029]
Length = 320
Score = 227 bits (579), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 132/334 (39%), Positives = 172/334 (51%), Gaps = 59/334 (17%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
P D++ LPG TF MGT+ P +D E P R V + F + V+N QF FV A
Sbjct: 44 PFRGTDDLIALPGGTFLMGTDDPEGFPEDLEGPVRPVAVAPFAIAATPVTNRQFARFVDA 103
Query: 67 TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
TGY T+AE++G +FVF LL+ + LD+++
Sbjct: 104 TGYRTDAERYGWSFVFHLLLAPSAHRYV------------LDASVP-------------- 137
Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
+A W G W P+G S + R HPVVHVS
Sbjct: 138 --------------QAPWWLAVEGA---------HWAAPDGPGSDVRGRERHPVVHVSHR 174
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
DA AYC W G RL TEAEWEY RGGL + WG+ LTP G H N+WQG FP NTA
Sbjct: 175 DAEAYCLWSGTRLATEAEWEYAARGGLSQARYAWGDELTPDGRHLCNIWQGAFPAVNTAE 234
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
DG+ T+PV ++ N +GLY++ GNVWE TA+ W + +G + + V +GG
Sbjct: 235 DGFAGTSPVTAFPPNGYGLYDVAGNVWELTAEPWR---------DRRGRISPGESVIRGG 285
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
SYLC++ YC R+R AARS+ T DSS+GN GFR A
Sbjct: 286 SYLCHDSYCNRYRVAARSRTTVDSSSGNTGFRVA 319
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/48 (54%), Positives = 35/48 (72%)
Query: 343 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
+G + + V +GGSYLC++ YC R+R AARS+ T DSS+GN GFR A
Sbjct: 272 RGRISPGESVIRGGSYLCHDSYCNRYRVAARSRTTVDSSSGNTGFRVA 319
>gi|218438193|ref|YP_002376522.1| hypothetical protein PCC7424_1204 [Cyanothece sp. PCC 7424]
gi|218170921|gb|ACK69654.1| protein of unknown function DUF323 [Cyanothece sp. PCC 7424]
Length = 330
Score = 227 bits (579), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 133/344 (38%), Positives = 186/344 (54%), Gaps = 59/344 (17%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
K+MVL+ G TFR+G ++ ++ E + NVT+D F +D++EV+N F EFV +T Y+T
Sbjct: 34 KNMVLISGGTFRIGDDRSNFLE--EKAAENVTVDNFCIDRYEVTNANFAEFVKSTDYITV 91
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ F P L +E+R+ S V F+ + + V ++SW V
Sbjct: 92 AERPLSKKQF-PDLPDEQRSPGSLV------FQPPEEVTQ------VPYLSWWHWVVGAN 138
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR HP G DS+IE + N+PVVH+++ DAVAY
Sbjct: 139 WR-----------------------------HPYGPDSSIEGKNNYPVVHIAYEDAVAYA 169
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEA+WEY RGGL+N++F WG+ + +AN WQG FP NT DGYL T
Sbjct: 170 QWVGKSLPTEAQWEYAARGGLKNKIFSWGDEYS---AQKANTWQGIFPFFNTKEDGYLGT 226
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP------------SYNPKGPTTGTD 300
+PV S+ N +G+Y++ GNVWEWT+DW+ + H S++PK P G
Sbjct: 227 SPVGSFPPNHYGIYDITGNVWEWTSDWFRLGHDSQANKVNPTGPSKEESFDPKKPGEGAL 286
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
V KGGSYLC YC R+R AAR PD+ ++GFR + G
Sbjct: 287 HVVKGGSYLCAWNYCSRYRPAARESQAPDTGTTHIGFRLVKNLG 330
>gi|257056606|ref|YP_003134438.1| hypothetical protein Svir_26230 [Saccharomonospora viridis DSM
43017]
gi|256586478|gb|ACU97611.1| uncharacterized conserved protein [Saccharomonospora viridis DSM
43017]
Length = 312
Score = 227 bits (579), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 141/335 (42%), Positives = 170/335 (50%), Gaps = 54/335 (16%)
Query: 13 KDMVLLPGDTFRMGTNKPILI-KDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ MVLLPG F MG+ P+ +DGE P R V LD +S T VS +
Sbjct: 30 EGMVLLPGGEFSMGSEDPLAYPEDGEGPVRTVR-----LDPFWISRTA----VSNRDFAE 80
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
+ G +E ER S V F GL
Sbjct: 81 FVDATG-------YRTEAERWGWSFV------FAGL------------------------ 103
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
LP + G R W HPEG ST++ R +HPVVHVSW+DA Y
Sbjct: 104 ------LPDDFPPTRGVATAPWWRQVEGADWRHPEGPHSTVDGREDHPVVHVSWSDAQEY 157
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
C W G RLPTEAEWEY RGGL FPWG L P GEHR NVWQG FP +N+ ADG+
Sbjct: 158 CAWAGLRLPTEAEWEYAARGGLSGNPFPWGAELEPGGEHRMNVWQGVFPRHNSRADGWYG 217
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-SYNPKGPTTGTDKVKKGGSYLC 310
T PV + N FGL+NM GNVWEW ADW++ H NP+GP GT + +GGSYLC
Sbjct: 218 TCPVTEFPPNGFGLHNMTGNVWEWCADWFHPTFHTLDRRTNPRGPEQGTHRSARGGSYLC 277
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
+E YC R+R +AR TPD++ GN+GFRC D P
Sbjct: 278 HESYCRRYRVSARQGLTPDTTTGNVGFRCVRDAAP 312
>gi|126663718|ref|ZP_01734714.1| hypothetical protein FBBAL38_11629 [Flavobacteria bacterium BAL38]
gi|126624301|gb|EAZ94993.1| hypothetical protein FBBAL38_11629 [Flavobacteria bacterium BAL38]
Length = 381
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 134/342 (39%), Positives = 185/342 (54%), Gaps = 49/342 (14%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG F MG + +D EFP V L+ F++D EV+N QF +FV+ATGYVT AE
Sbjct: 75 MVWIPGGIFSMGGDNDQARQD-EFPKHQVKLNGFFMDITEVTNEQFAKFVAATGYVTTAE 133
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV-HISWNDAVAYCTW 133
K D +E K Q+ D + + + P +S D Y W
Sbjct: 134 KDIDW---------DELKK--QLPSDTPKPDAETLKAASLVFVPTSGEVSLQD---YSQW 179
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W +G +W HP+G DSTI+ + N+PVVH+SW+DA AYC
Sbjct: 180 WN--------WSHG------------ANWKHPQGKDSTIDGKDNYPVVHISWDDANAYCK 219
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE+ RGGL ++ WGN + N +QG FP N +DG++ +A
Sbjct: 220 WAGKRLPTEAEWEFAARGGLSKNVYTWGNEKVDEEKFHCNYFQGNFPYKNEVSDGFIGSA 279
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----PSYNPKGPTTGTD-------- 300
PV S+ N +GLY+M GNVWEW AD +N ++ + + NP GPT D
Sbjct: 280 PVKSFPANDYGLYDMSGNVWEWCADKYNNLYYDSFKKVKVTINPIGPTKSYDPDEPLVEK 339
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+V +GGS+LCNE YC +R +AR +++ DSS +LGFRC ++
Sbjct: 340 RVMRGGSFLCNESYCSGYRVSARMKSSADSSMEHLGFRCVSN 381
>gi|119960512|ref|YP_947226.1| hypothetical protein AAur_1451 [Arthrobacter aurescens TC1]
gi|119947371|gb|ABM06282.1| Domain of unknown function (DUF323) protein [Arthrobacter aurescens
TC1]
Length = 277
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 135/331 (40%), Positives = 171/331 (51%), Gaps = 65/331 (19%)
Query: 20 GDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDT 79
GD F G + DGE P V++ AF + V+N +F EFV ATGY TE+E +G +
Sbjct: 2 GDPFNEGYDG-----DGESPVHEVSVSAFRIGATTVTNQEFAEFVDATGYRTESETYGTS 56
Query: 80 FVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLP 139
VF + +++V N+ + RGA
Sbjct: 57 AVFHLAVKANTHDILNRV---------------------------NNVPWWLNIRGA--- 86
Query: 140 TEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARL 199
W HP G S +HPV HVS NDA+AYC W G RL
Sbjct: 87 ---------------------DWAHPAGPLSNWADIPDHPVTHVSHNDALAYCQWAGRRL 125
Query: 200 PTEAEWEYGCRGGLENRLFPWGNNL--TPRGE--HRANVWQGEFPTNNTAADGYLSTAPV 255
PTEAEWEY RGGL + +PWGN L GE H N+WQGEFP+ NT DGYL+TAP
Sbjct: 126 PTEAEWEYAARGGLSGQRYPWGNELHNDAPGETAHNCNIWQGEFPSRNTIDDGYLTTAPA 185
Query: 256 MSYKENKFGLYNMVGNVWEWTADW-----WNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
S++ N +GLY GNVWEW +DW + H +P+GPT G +V +GGSYLC
Sbjct: 186 TSFRPNGYGLYQTSGNVWEWCSDWFLPKYYKTCHAQGTVEDPQGPTIGRGRVMRGGSYLC 245
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
++ YC R+R AARS NTPDS++GNLGFR A
Sbjct: 246 HDSYCNRYRLAARSSNTPDSASGNLGFRTVA 276
>gi|430743633|ref|YP_007202762.1| hypothetical protein Sinac_2774 [Singulisphaera acidiphila DSM
18658]
gi|430015353|gb|AGA27067.1| hypothetical protein Sinac_2774 [Singulisphaera acidiphila DSM
18658]
Length = 344
Score = 227 bits (578), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 134/342 (39%), Positives = 183/342 (53%), Gaps = 51/342 (14%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++MV + G F MGT+ + D E P+ V +D F++D+HEV+N +F FV+AT YVT
Sbjct: 33 REMVWIAGGEFTMGTDSELGWPD-ERPAHRVRVDGFWIDEHEVTNDEFAAFVAATRYVTT 91
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AEK E +L++ S + + E + D +
Sbjct: 92 AEKAPKA---EEILAQSSPGTPSPPPEALVPGSLVFQPTEEAV----------DLRDFAR 138
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W W R P SW HPEG S ++ R HPVV VSW+DA AY
Sbjct: 139 W----------W----------RWTPGASWRHPEGPGSDLQGRGAHPVVQVSWDDANAYA 178
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G +LPTEAEWE+ RGGL+ R + WG++ + +AN+WQG FP NTAADGY T
Sbjct: 179 QWAGKQLPTEAEWEFAARGGLDGRAYVWGDSPFSLAQPQANIWQGAFPYRNTAADGYDRT 238
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWN-------VHHHPAPSYNPKGPTTGTD----- 300
APV S+ N FGL++M GNVWEW ADW+ HH P+ NP GP +D
Sbjct: 239 APVKSFPPNGFGLFDMGGNVWEWCADWYQRDLYRQRAGHH--PTLNPTGPEKSSDPARPF 296
Query: 301 ---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V++GGS+LCN+ YC R+R +AR +PD+ ++GFRC
Sbjct: 297 TPQRVQRGGSFLCNDSYCSRYRPSARHGCSPDTGMSHVGFRC 338
>gi|410456927|ref|ZP_11310775.1| non-specific serine/threonine protein kinase [Bacillus bataviensis
LMG 21833]
gi|409927014|gb|EKN64163.1| non-specific serine/threonine protein kinase [Bacillus bataviensis
LMG 21833]
Length = 318
Score = 227 bits (578), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 103/180 (57%), Positives = 129/180 (71%), Gaps = 1/180 (0%)
Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
W PEG STIE R++HPVVHVSWNDA YC W G RLPTEAEWEY RGGL + +PWG
Sbjct: 138 WNQPEGPGSTIESRLDHPVVHVSWNDANEYCNWAGKRLPTEAEWEYAARGGLVQKRYPWG 197
Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
+ L P G+H N+WQG+FP N DGY+ TAP S+ N +GL+NM GNVWEW +DW++
Sbjct: 198 DRLHPDGKHVCNIWQGKFPHTNHGRDGYIGTAPSKSFPPNHYGLFNMSGNVWEWCSDWFS 257
Query: 282 VHHH-PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+H NP+GP G DK+ KGGSYLC++ YC R+R +ARS +PDSS+G++GFRC
Sbjct: 258 ASYHIEGTRNNPQGPPNGMDKIIKGGSYLCHKSYCNRYRVSARSAVSPDSSSGHMGFRCV 317
>gi|426403429|ref|YP_007022400.1| gliding motility-associated lipoprotein GldK [Bdellovibrio
bacteriovorus str. Tiberius]
gi|425860097|gb|AFY01133.1| putative gliding motility-associated lipoprotein GldK [Bdellovibrio
bacteriovorus str. Tiberius]
Length = 327
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 132/351 (37%), Positives = 176/351 (50%), Gaps = 61/351 (17%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P + DM+ +PG F MG++ E P+ V +D F++D+H V+N F FV AT
Sbjct: 16 PSAPFPDMIWIPGGMFTMGSDNHYA---EEAPAHRVKVDGFWMDRHPVTNRDFLAFVQAT 72
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
GY T EK D ++ P SE + S V F S ++ R
Sbjct: 73 GYRTFCEKEPDLNLY-PGASESMKDPASVV------FVKPASPVDLRN------------ 113
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
Y W+ P +W P+G S+IE RM+HPVVHVSW D
Sbjct: 114 -PYLWWQ---------------------FIPGANWRSPQGPGSSIESRMDHPVVHVSWQD 151
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
AY W G +LPTEAEWE+ RGG E R F WG L P G H+AN+WQGEFP N + D
Sbjct: 152 VQAYAKWIGKKLPTEAEWEFAARGGHEGREFAWGEELEPEGRHQANIWQGEFPWQNHSGD 211
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----------------HPAPSY 290
G+ T+PV ++ N +GL++M+GNVWEWT DW++ H A S+
Sbjct: 212 GFEGTSPVTAFPPNDYGLFDMIGNVWEWTVDWYSPRHPDEKLKACCIPKNPRGGAEADSF 271
Query: 291 NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+P+ KV KGGS+LC YC R+R +AR D+S +LGFRC
Sbjct: 272 DPRQALKIPRKVMKGGSHLCAPNYCKRYRPSARMAQPVDTSTSHLGFRCVV 322
>gi|186682703|ref|YP_001865899.1| hypothetical protein Npun_F2391 [Nostoc punctiforme PCC 73102]
gi|186465155|gb|ACC80956.1| protein of unknown function DUF323 [Nostoc punctiforme PCC 73102]
Length = 332
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 134/345 (38%), Positives = 184/345 (53%), Gaps = 68/345 (19%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MVL+PG F MG + ++ E ++NVT+ +F +D++EV+N QF EFV AT YVT A
Sbjct: 38 EMVLIPGGIFNMGADNSGFVE--ERTAKNVTVTSFCIDKYEVTNAQFAEFVKATKYVTVA 95
Query: 74 ------EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
E+F D L +E+R S V H K +G+
Sbjct: 96 QCPLPKEQFPD-------LPDEQRLPGSLVFHQPK--QGVKQ------------------ 128
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
V + +W W + P +W HP G DST+ + N+PVVH+++ D
Sbjct: 129 VQFLSW----------WHWT----------PGANWQHPFGSDSTVVGKENYPVVHIAYED 168
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
A+AY W G LPTEA+WEY RGGL+ + WGN + E +AN WQG FP NT AD
Sbjct: 169 ALAYGIWAGKSLPTEAQWEYAARGGLDGATYTWGNQYS---ETKANTWQGIFPFFNTKAD 225
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----------PAPSYNPKGPTT 297
GY+ TA V S+ N +GLY+M GNVWEWT+DW+++ H + S++PK PT
Sbjct: 226 GYIGTAKVGSFSANGYGLYDMTGNVWEWTSDWFSLGHENKAHSVNPTGASKSFDPKKPTE 285
Query: 298 GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
V KGGSYLC YC R+R AAR PD+ ++GFR +
Sbjct: 286 IALHVIKGGSYLCAPNYCSRYRPAARESQAPDTGTSHIGFRLVKN 330
>gi|374575294|ref|ZP_09648390.1| hypothetical protein Bra471DRAFT_03916 [Bradyrhizobium sp. WSM471]
gi|374423615|gb|EHR03148.1| hypothetical protein Bra471DRAFT_03916 [Bradyrhizobium sp. WSM471]
Length = 335
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 136/346 (39%), Positives = 178/346 (51%), Gaps = 62/346 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV +PG TFRMG++ E PS V +D F++D+ V+N QF++FV ATG+VTEA
Sbjct: 20 EMVFIPGGTFRMGSDHHY---PEEAPSHRVAVDGFWMDRTPVTNRQFKQFVDATGHVTEA 76
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ D + L+E A + P+ HI+ D + W
Sbjct: 77 QVVPDPKDYPGALAEMLYAG-------------------SLVFSPLPHIT--DLTDWSQW 115
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W + RG +W HP G S I+ +HPVVHVS++DA AY
Sbjct: 116 ----------WTF-LRGA---------NWRHPYGPGSNIKDLDDHPVVHVSFSDAAAYAR 155
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEAEWE RGGLE + WG+ L P G+H AN+WQG FP N DGY T+
Sbjct: 156 WAGKDLPTEAEWESAARGGLEAEEYAWGDALMPGGKHMANIWQGNFPVQNLGEDGYERTS 215
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSYNPKGPTTGTD---------- 300
PVM++ N +GLY+M+GNVWEWT DWW+ H P PK P G +
Sbjct: 216 PVMAFPPNGYGLYDMIGNVWEWTVDWWSSKHAADAAKPCCIPKNPRGGGEHASYDPRLPD 275
Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
KV KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 276 IQIPRKVLKGGSHLCAPNYCRRYRPAARHAEPVDTSTSHVGFRCVV 321
>gi|420237896|ref|ZP_14742341.1| hypothetical protein PMI07_00078 [Rhizobium sp. CF080]
gi|398089558|gb|EJL80073.1| hypothetical protein PMI07_00078 [Rhizobium sp. CF080]
Length = 317
Score = 225 bits (574), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 139/350 (39%), Positives = 181/350 (51%), Gaps = 59/350 (16%)
Query: 5 PAPPVERY---KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
PA P+ R + + + G+TF +GT+ P++ DGE P R V+L F+L+ V+ +F
Sbjct: 20 PAGPISRGVPGEAIAVKGGETF-VGTDDPVISTDGEGPERKVSLQDFHLEIETVTIRRFA 78
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
+FV ATG+VTE+E+FG + VF PLL E A V+
Sbjct: 79 QFVEATGHVTESERFGSSAVFAPLLGEARTATGG-----------------------VIE 115
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
W + TWR PEG S IE R++HPV
Sbjct: 116 TPWWTRIDDATWR-----------------------------QPEGRGSLIEGRLDHPVT 146
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
VSWNDA+A+ W G RLPTEAEWE+ RGG R +PWG N+WQG FP
Sbjct: 147 QVSWNDALAFAQWVGGRLPTEAEWEHAARGGNRRRRYPWGEEEPTDEAIFCNIWQGRFPD 206
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH--HHPAPSYNPKGPTTGT 299
NT DGY TAPV S+ N+ G YN+ GNVWEWTAD + V A + N +
Sbjct: 207 TNTLQDGYFGTAPVRSFAPNEAGFYNLAGNVWEWTADPFKVRSLSKQAKARNEIARRF-S 265
Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
DKV KGGS+LC++ YCYR+R AAR +PDS++ N GFR A D P + T
Sbjct: 266 DKVLKGGSFLCHKSYCYRYRIAARMGLSPDSASSNTGFRVAYDAKPGSRT 315
>gi|430745772|ref|YP_007204901.1| hypothetical protein Sinac_5052 [Singulisphaera acidiphila DSM
18658]
gi|430017492|gb|AGA29206.1| hypothetical protein Sinac_5052 [Singulisphaera acidiphila DSM
18658]
Length = 371
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 129/341 (37%), Positives = 180/341 (52%), Gaps = 51/341 (14%)
Query: 7 PPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
PP + MV +P F MG++ + P V+LD F++D+ EV+N QF+ FV A
Sbjct: 66 PPGVAPEGMVWIPAGDFAMGSDDESMSD--ARPVHRVSLDGFWMDRTEVTNRQFERFVRA 123
Query: 67 TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
TGY T AE+ D F P +E S V PV +S D
Sbjct: 124 TGYTTVAEQAPDPKAF-PGAPKELLVPGSLV-----------------FSPPVGRVSLED 165
Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
+A+ W Y P +W HP G DS+I+ +HPVV V ++
Sbjct: 166 HLAW-------------WRY----------VPGANWKHPSGPDSSIQGLEDHPVVQVCFD 202
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
DA+AY W G RLPTEAEWEY RGGL+ + + WG++L P G+ + N WQG FP N
Sbjct: 203 DALAYAKWAGKRLPTEAEWEYAARGGLDGKRYCWGDDLKPGGKWQVNNWQGLFPNENMKE 262
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD------ 300
DG+ TAP S+ N +GL +M GNVWEW ADW+ ++ P+ + NP+GP + D
Sbjct: 263 DGFDGTAPTGSFAVNGYGLSDMAGNVWEWCADWYQPNYDPSQARNPQGPDSSHDPAEPGI 322
Query: 301 --KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V++GGS+LC++ YC R+ AR + DS A ++GFRC
Sbjct: 323 PKRVQRGGSFLCSDLYCVRYLPGARGKGATDSGASHVGFRC 363
>gi|406834765|ref|ZP_11094359.1| hypothetical protein SpalD1_24084 [Schlesneria paludicola DSM
18645]
Length = 374
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 131/347 (37%), Positives = 183/347 (52%), Gaps = 47/347 (13%)
Query: 6 APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
APP E + MV +PG F MG N D E P+ V +D F++D EV+N+QF FV+
Sbjct: 58 APPKETPQGMVWIPGGEFLMGNNASTAWPD-EQPAHRVAVDHFWMDSTEVTNSQFAAFVN 116
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
AT YVT AEK E +L+ + + + EH+ + N
Sbjct: 117 ATEYVTTAEK---APTAEEILANSPPGTPQPSAQLLVAGSLVFTPTEHQ-------VPLN 166
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
D + TW P +W HPEG S + +R +HPVVHVSW
Sbjct: 167 DIRQWWTWT-----------------------PGANWRHPEGPGSDLRNRDHHPVVHVSW 203
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
+DA AY W RLPTEAEWE+ RGGL+ + + WG++ + +AN+W G FP +NTA
Sbjct: 204 DDATAYARWAEKRLPTEAEWEFAARGGLQQQPYAWGSDSFSETKPQANLWTGSFPFDNTA 263
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTD 300
DGY TAPV ++ N +GLY+M GNVWEW +DW++ + + NP+GP D
Sbjct: 264 LDGYARTAPVGTFAPNGYGLYDMAGNVWEWCSDWYDRDLYKDRLKSKITSNPRGPDQSRD 323
Query: 301 KVK--------KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
++ +GGS+LCN+ YC R+R +AR TPD+ ++GFRC
Sbjct: 324 PMRPFMSQRSQRGGSFLCNDSYCSRYRPSARHGCTPDTGMSHVGFRC 370
>gi|443326286|ref|ZP_21054945.1| hypothetical protein Xen7305DRAFT_00032370 [Xenococcus sp. PCC
7305]
gi|442794085|gb|ELS03513.1| hypothetical protein Xen7305DRAFT_00032370 [Xenococcus sp. PCC
7305]
Length = 338
Score = 224 bits (570), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 130/341 (38%), Positives = 180/341 (52%), Gaps = 62/341 (18%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+DMV +PG +F++G ++ + E + +V +D F +DQHE++N QF EFV TGY+T
Sbjct: 48 QDMVWIPGGSFKIGDDEHYI---EEHSAPDVRVDGFCMDQHEITNGQFAEFVKETGYITV 104
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ F P LSEE+RA S V +
Sbjct: 105 AERPLSATEF-PQLSEEQRAPGSLV--------------------------------FVQ 131
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
G R+ E W + G +W HPEG+DS+IE + N+PVV V++ DA AY
Sbjct: 132 PLGLRV-KELGWWHWVTGA---------NWQHPEGVDSSIEGKENYPVVQVAYEDAQAYA 181
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEA+WE+ +GGL+N +F WGN + +AN WQG+FP N DGY T
Sbjct: 182 DWAGKSLPTEAQWEFAAKGGLKNAIFTWGNTYSA---DKANTWQGKFPFKNLEKDGYYGT 238
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH------------PAPSYNPKGPTTGTD 300
APV S+ N +GLY+M GNVWEWT DW+ + H + S +P+ P
Sbjct: 239 APVESFAANGYGLYDMAGNVWEWTVDWYRLGHKGKEHQVNPTVADASKSLDPREPGV-AK 297
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
V KGGS+LC + YC R R AAR +PD+ ++GFR +
Sbjct: 298 HVIKGGSFLCAQNYCSRFRPAAREAQSPDTGTSHIGFRLVS 338
>gi|395516530|ref|XP_003762440.1| PREDICTED: sulfatase-modifying factor 1 [Sarcophilus harrisii]
Length = 396
Score = 224 bits (570), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 121/251 (48%), Positives = 154/251 (61%), Gaps = 46/251 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +P F MGTN P + +DGE P+R V +++FY+D +EVSN +F++FV++TGYVTEAE
Sbjct: 1 MAHIPAGIFTMGTNDPQIKQDGEGPARRVQINSFYMDLYEVSNAEFEKFVNSTGYVTEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFGD+FVF+ +LS+ QV++ G+D E H C WR
Sbjct: 61 KFGDSFVFDIMLSD-------QVKN------GIDQAAESGPH--------------CNWR 93
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G G R F PE + I +RM+HPV+HVSWNDAVAYCTW
Sbjct: 94 GV----------GIR-------FLNPKVEMPELKE--ITNRMDHPVLHVSWNDAVAYCTW 134
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY CRGGLENRL+PWGN L PRG+H AN+WQGEFP +NT DGY TAP
Sbjct: 135 AGKRLPTEAEWEYSCRGGLENRLYPWGNKLQPRGQHYANLWQGEFPVSNTGEDGYRGTAP 194
Query: 255 VMSYKENKFGL 265
+ +++ L
Sbjct: 195 CQAGNQSRQSL 205
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 40/58 (68%), Positives = 51/58 (87%)
Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
Y PKGP +G+++VKKGGSY+C++ YCYR+RCAARSQNTPDSS+ NLGFRCA+ + P T
Sbjct: 339 YMPKGPHSGSNRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSSSNLGFRCASSRLPDT 396
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 36/49 (73%), Positives = 46/49 (93%)
Query: 343 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 391
KGP +G+++VKKGGSY+C++ YCYR+RCAARSQNTPDSS+ NLGFRCA+
Sbjct: 342 KGPHSGSNRVKKGGSYMCHKSYCYRYRCAARSQNTPDSSSSNLGFRCAS 390
>gi|343087302|ref|YP_004776597.1| sulfatase-modifying factor protein [Cyclobacterium marinum DSM 745]
gi|342355836|gb|AEL28366.1| Sulphatase-modifying factor protein [Cyclobacterium marinum DSM
745]
Length = 371
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 127/345 (36%), Positives = 183/345 (53%), Gaps = 48/345 (13%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
++ MV + G F G+ +D E P+ +D F++D+ EV+N QF FV+ATGYVT
Sbjct: 62 HEGMVWIEGGEFTRGSTDDRGRRD-ELPAHKAKVDGFWIDETEVTNAQFAAFVAATGYVT 120
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
AEK D + L +V + +++ R P + N+A +
Sbjct: 121 IAEKKPDWEEIKKQLPPGTPKPPDEVL--------VAASLTFR--SPNQPVPLNNAAQWW 170
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W P +W P+G S+IE + N+PVVHV+W+DAVAY
Sbjct: 171 EWT-----------------------PGANWRMPQGPGSSIEGKENYPVVHVAWDDAVAY 207
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G RLPTEAEWEY +GGLE++ FPWGN+ + AN+WQG+FP + A DG+
Sbjct: 208 AKWAGKRLPTEAEWEYAAKGGLEDKSFPWGNDPVEKNGFMANIWQGDFPVTDKAEDGFQG 267
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-------------PAPSYNPKGPTTG 298
APV S++ N +GLY+M GNVWEWT DW+ ++ P SY+P+ PT
Sbjct: 268 LAPVKSFEPNGYGLYDMAGNVWEWTNDWYREDYYERITDEVAINPQGPKDSYDPQEPTV- 326
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
KV KGGS+LCN YC +R + + +++ D+ + GFRC + K
Sbjct: 327 PKKVVKGGSFLCNVSYCEGYRVSGKMKSSADTGLEHTGFRCVSSK 371
>gi|383772147|ref|YP_005451213.1| hypothetical protein S23_39060 [Bradyrhizobium sp. S23321]
gi|381360271|dbj|BAL77101.1| hypothetical protein S23_39060 [Bradyrhizobium sp. S23321]
Length = 337
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 138/363 (38%), Positives = 185/363 (50%), Gaps = 64/363 (17%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
R +MV +PG TF MG++ E PS V++D F++D+ V+N QF++FV+ATG+V
Sbjct: 17 RTGEMVFIPGGTFLMGSDHHY---PEEAPSHRVSVDGFWIDRTPVTNRQFKQFVNATGHV 73
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEA+ D + P +E S V + R L W
Sbjct: 74 TEAQIVPDPKDY-PGAPKEMLYAGSLVFSPLPRITDL--------------TDW------ 112
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
++W RG +W HP G STI+ +HPVVHVS +DAVA
Sbjct: 113 -----------SQWWSFLRGA---------NWRHPYGPGSTIKGMDDHPVVHVSHSDAVA 152
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
Y W G LPTEAEWE+ RGGL+ F WG+ L P G+H AN+WQG FP N DG+
Sbjct: 153 YAGWAGKDLPTEAEWEFAARGGLDGEEFAWGDALMPGGKHMANIWQGNFPVQNLGEDGFE 212
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----------------HPAPSYNPK 293
T+PVM++ N +GLY+M+GNVWEWT+DWW+ H SY+P
Sbjct: 213 RTSPVMAFPPNGYGLYDMIGNVWEWTSDWWSARHTAEAAKPCCIPTNPRGGREEASYDPA 272
Query: 294 GPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKV 352
P KV KGGS+LC YC R+R AAR D+S ++GFRC TD+
Sbjct: 273 LPDIRIPRKVLKGGSHLCAPNYCRRYRPAARHAEPIDTSTSHVGFRCVTRM--PLATDRG 330
Query: 353 KKG 355
++G
Sbjct: 331 REG 333
>gi|420262802|ref|ZP_14765443.1| sulfatase-modifying factor 1 [Enterococcus sp. C1]
gi|394770559|gb|EJF50363.1| sulfatase-modifying factor 1 [Enterococcus sp. C1]
Length = 287
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 137/338 (40%), Positives = 172/338 (50%), Gaps = 64/338 (18%)
Query: 15 MVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M +P TFR+GTN +D E P ++ AF +D V+N FQ FV ATGYVTEA
Sbjct: 1 MPTIPAGTFRIGTNTMDGFAEDLEGPQISLDSPAFAVDATTVTNQAFQVFVEATGYVTEA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+FG +FVF LSE Q +H
Sbjct: 61 ERFGWSFVFHYFLSE-------QTKH---------------------------------- 79
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R ++P A W Y G W HPEG DSTI RM+HPVV VS NDA+AYC
Sbjct: 80 RSQKVPNMAWW-YAVSGA---------DWRHPEGPDSTIAQRMDHPVVQVSRNDAIAYCQ 129
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W RLPTEAEWE +GG +PWG + N+WQGEFP +NT ADG+ +TA
Sbjct: 130 WAKKRLPTEAEWEIAAKGGTSFEKYPWGREFLAENTYHCNIWQGEFPKSNTRADGFANTA 189
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT-------TGTDKV--KK 304
P Y+ N FGLY ++GNVWEW N P + + T D++ +
Sbjct: 190 PAKWYEPNGFGLYQVIGNVWEWCV---NPARIPLTQFQQQTGQTFWIRNQTQDDQMYATR 246
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GGS+LC+E YC R+R AAR+ N+ S+A NLGFRC D
Sbjct: 247 GGSFLCHESYCKRYRIAARNGNSGMSAANNLGFRCVKD 284
>gi|325108698|ref|YP_004269766.1| sulfatase-modifying factor protein [Planctomyces brasiliensis DSM
5305]
gi|324968966|gb|ADY59744.1| Sulphatase-modifying factor protein [Planctomyces brasiliensis DSM
5305]
Length = 394
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 135/349 (38%), Positives = 177/349 (50%), Gaps = 57/349 (16%)
Query: 15 MVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
MV +PG TF MGT+ P +K E P VT+D F++D EV+N +F +FV TG
Sbjct: 72 MVWIPGGTFEMGTDPISSPKNPDKVKQDEIPKHEVTVDGFWMDVAEVTNAEFAKFVDETG 131
Query: 69 YVTEAE---KFGDTFVFEP---LLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
YVT AE K D P L+ EE S + + E +++
Sbjct: 132 YVTFAEIPPKREDFIGMVPDISLIPEENLVAGSLIFNPDFDRENFRDDVQN--------- 182
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
WEY + P W HP G DS+IE +HPVV+
Sbjct: 183 ---------------------WEYQAW-----KYQPGADWRHPTGPDSSIEGLDDHPVVN 216
Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
V + D VAYC W G RLPTEAEWEY GG + + +PWGN P GE+ N WQG FPT+
Sbjct: 217 VVYRDCVAYCKWAGKRLPTEAEWEYAAHGGKDVK-YPWGNEFLPDGEYMCNYWQGTFPTD 275
Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTTGTD- 300
DG+L TAPV SY N +GLY+M GNVWEW D++ ++ +P NP+GP D
Sbjct: 276 RQNLDGFLDTAPVKSYPPNGYGLYDMAGNVWEWVNDYYRPDYYRRSPKRNPQGPDNSFDP 335
Query: 301 -------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+V +GGS+LCN C +RCAAR ++ S + + GFRC D
Sbjct: 336 NEPGIEKRVTRGGSFLCNTNNCTGYRCAARMRSDVTSPSYHTGFRCVID 384
>gi|417089844|ref|ZP_11955736.1| hypothetical protein SSUR61_0017 [Streptococcus suis R61]
gi|353533785|gb|EHC03425.1| hypothetical protein SSUR61_0017 [Streptococcus suis R61]
Length = 288
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 134/346 (38%), Positives = 172/346 (49%), Gaps = 80/346 (23%)
Query: 15 MVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M+ +PG +R+GT K D E PS V +++FY+D+ V+N +FQ FV AT Y+TEA
Sbjct: 1 MIKIPGGIYRIGTEQKEGFDADFETPSIEVNIESFYMDETSVTNKEFQVFVDATNYITEA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
EKFG ++VF +S+ R++ + V W AV
Sbjct: 61 EKFGWSYVFHLFVSDHLRSR-----------------------NIVKETPWWIAV----- 92
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
P SW PEG S+I RM+HPVVH++ NDAVAYC
Sbjct: 93 ------------------------PGASWKEPEGPGSSISDRMDHPVVHITRNDAVAYCR 128
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLT-PRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWE +GG E F WG +L G HR N WQG FPT NT DGYL+T
Sbjct: 129 WSGKRLPTEAEWEVAAKGGTEFEKFYWGEDLVLSDGVHRCNTWQGIFPTKNTKEDGYLAT 188
Query: 253 APVMSYKENKFGLYNMVGNVWEWTA----------------DWWNVHHHPAPSYNPKGPT 296
APV +Y N GLY M+GNVWEW D+W + +
Sbjct: 189 APVKTYPANPLGLYEMIGNVWEWCVNPGKLDLSTFQIVSGQDFWREYQVIDDAL------ 242
Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
KGGS+LC+ YC R+R AR+ +T SS+ N+GFRC D
Sbjct: 243 ----YAIKGGSFLCHSVYCNRYRMNARNSSTAMSSSQNMGFRCVRD 284
>gi|257875092|ref|ZP_05654745.1| conserved hypothetical protein [Enterococcus casseliflavus EC20]
gi|257809258|gb|EEV38078.1| conserved hypothetical protein [Enterococcus casseliflavus EC20]
Length = 287
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 136/338 (40%), Positives = 172/338 (50%), Gaps = 64/338 (18%)
Query: 15 MVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M +P T+R+GTN +D E P ++ AF +D V+N FQ FV ATGYVTEA
Sbjct: 1 MPTIPAGTYRIGTNTMDGFAEDLEGPQISLDSPAFAIDATTVTNQAFQVFVEATGYVTEA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+FG +FVF LSE Q +H
Sbjct: 61 ERFGWSFVFHYFLSE-------QTKH---------------------------------- 79
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R ++P A W Y G W HPEG DSTI RM+HPVV VS NDA+AYC
Sbjct: 80 RSQKVPNMAWW-YAVSGA---------DWRHPEGPDSTIAQRMDHPVVQVSRNDAIAYCQ 129
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W RLPTEAEWE +GG +PWG + N+WQGEFP +NT ADG+ +TA
Sbjct: 130 WAKKRLPTEAEWEIAAKGGTSFEKYPWGREFLAENTYHCNIWQGEFPRSNTQADGFANTA 189
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT-------TGTDKV--KK 304
P Y+ N FGLY ++GNVWEW N P + + T D++ +
Sbjct: 190 PAKWYEPNGFGLYQVIGNVWEWCV---NPARIPLTQFQQQTGQTFWARNQTQDDQLYATR 246
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GGS+LC+E YC R+R AAR+ N+ S+A NLGFRC D
Sbjct: 247 GGSFLCHESYCKRYRIAARNGNSGMSAANNLGFRCVKD 284
>gi|406830272|ref|ZP_11089866.1| sulfatase-modifying factor protein [Schlesneria paludicola DSM
18645]
Length = 407
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 132/353 (37%), Positives = 176/353 (49%), Gaps = 50/353 (14%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P + MV +P F MGT+ P + + E P+ V L+ +++D HEV+N +FQ+FV AT
Sbjct: 89 PTSAPEGMVWIPPGEFTMGTSGPSALPN-ERPAHQVHLNGYWIDAHEVTNAEFQKFVDAT 147
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
GYVT AE + + E K + K + + P + ND
Sbjct: 148 GYVTTAE----------VKPDWEELKKTVPPGTPKPPDDVLVPGSLVFTAPPTPVPTNDV 197
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
+ TW P W HPEG S+I++R HPVVH+SW+D
Sbjct: 198 SKWWTW-----------------------IPGACWKHPEGPKSSIDNRSEHPVVHISWDD 234
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
A AY W G RLPTEAEWE RGGL + F WG+ +N+WQG FP N D
Sbjct: 235 ANAYAKWAGKRLPTEAEWECASRGGLVGKRFTWGDEPPKESSKLSNIWQGVFPNQNDQLD 294
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH--------------PAPSYNPK 293
G+ TAPV SY N +GLY+M GNVWEW +DW+ + PA S++P
Sbjct: 295 GWERTAPVKSYPPNGYGLYDMSGNVWEWCSDWYRADAYAQRMAEKVVENPPGPADSWDPG 354
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA-DKGP 345
P +V +GGS+LC+ YC +R AAR TPD+ +LGFRC DK P
Sbjct: 355 EP-DAAKRVIRGGSFLCHITYCESYRTAARRGTTPDTGMSHLGFRCVVPDKKP 406
>gi|325569224|ref|ZP_08145429.1| sulfatase-modifying factor 1 [Enterococcus casseliflavus ATCC
12755]
gi|325157473|gb|EGC69633.1| sulfatase-modifying factor 1 [Enterococcus casseliflavus ATCC
12755]
Length = 287
Score = 221 bits (564), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 136/338 (40%), Positives = 172/338 (50%), Gaps = 64/338 (18%)
Query: 15 MVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M +P T+R+GTN +D E P ++ AF +D V+N FQ FV ATGYVTEA
Sbjct: 1 MPTIPAGTYRIGTNTMDGFAEDLEGPQISLDSPAFSIDATTVTNQAFQVFVEATGYVTEA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+FG +FVF LSE Q +H
Sbjct: 61 ERFGWSFVFHYFLSE-------QTKH---------------------------------- 79
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R ++P A W Y G W HPEG DSTI RM+HPVV VS NDA+AYC
Sbjct: 80 RSQKIPNMAWW-YTVSGA---------DWRHPEGPDSTIAQRMDHPVVQVSRNDAIAYCQ 129
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W RLPTEAEWE +GG +PWG + N+WQGEFP +NT ADG+ +TA
Sbjct: 130 WAKKRLPTEAEWEIAAKGGTTFEKYPWGREFLAENTYHCNIWQGEFPKSNTRADGFANTA 189
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT-------TGTDKV--KK 304
P Y+ N FGLY ++GNVWEW N P + + T D++ +
Sbjct: 190 PAKWYEPNGFGLYQVIGNVWEWCV---NPARIPLTQFQQQTGQTFWIRNQTQDDQLYATR 246
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GGS+LC+E YC R+R AAR+ N+ S+A NLGFRC D
Sbjct: 247 GGSFLCHESYCKRYRIAARNGNSGMSAANNLGFRCVKD 284
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/40 (57%), Positives = 31/40 (77%)
Query: 354 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+GGS+LC+E YC R+R AAR+ N+ S+A NLGFRC D+
Sbjct: 246 RGGSFLCHESYCKRYRIAARNGNSGMSAANNLGFRCVKDL 285
>gi|381182444|ref|ZP_09891251.1| non-specific serine/threonine protein kinase [Listeriaceae
bacterium TTU M1-001]
gi|380317681|gb|EIA20993.1| non-specific serine/threonine protein kinase [Listeriaceae
bacterium TTU M1-001]
Length = 285
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 125/335 (37%), Positives = 174/335 (51%), Gaps = 58/335 (17%)
Query: 15 MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M+ +P + +GTN K D E P V L FY+D+ V+N +FQ+FV+ TGYV++A
Sbjct: 1 MIFIPKGNYEIGTNSKEGFEFDLEGPKVQVKLPNFYIDETTVTNQEFQKFVTKTGYVSDA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
EKFG +FVF L EE +A +V
Sbjct: 61 EKFGSSFVFHYFLDEETKAISEKV------------------------------------ 84
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
++ W Y G +W HP G +S+I+ +M+HPVV VS NDA+A+
Sbjct: 85 ------NQSGWWYEVAGA---------NWKHPAGPNSSIKEKMDHPVVQVSRNDAIAFSQ 129
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE +GG + +PWG L E+ N+WQG FP N DGY+ TA
Sbjct: 130 WAGKRLPTEAEWEIAAKGGTDRERYPWGEELLENEEYHCNIWQGNFPFENDGLDGYVDTA 189
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP----KGPTTGTDKVK--KGGS 307
PV +Y+ N +G Y M+GNVWEW + ++ SY K D++ +GGS
Sbjct: 190 PVRTYEPNLYGCYQMIGNVWEWCLNPARINLSEFNSYTGKHFWKNYQMKDDQMYAIRGGS 249
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+LC++ YC R+R AAR+ N+ S++ NLGFRC D
Sbjct: 250 FLCHQSYCKRYRIAARNGNSGMSASNNLGFRCVQD 284
>gi|398819756|ref|ZP_10578305.1| hypothetical protein PMI42_00781 [Bradyrhizobium sp. YR681]
gi|398229690|gb|EJN15763.1| hypothetical protein PMI42_00781 [Bradyrhizobium sp. YR681]
Length = 336
Score = 221 bits (563), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 131/346 (37%), Positives = 182/346 (52%), Gaps = 64/346 (18%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV +PG TFRMG++ E PS V++D F++D+ V+N QF++FV+AT +VTEA
Sbjct: 20 EMVFIPGGTFRMGSDHHY---PEEAPSHRVSVDGFWIDRTPVTNRQFKQFVNATDHVTEA 76
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
QV D K + G + + + ++ +
Sbjct: 77 ----------------------QVVPDPKDYPGA-----------LKAMLYAGSLVFAPL 103
Query: 134 RGARLPTE-AEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
PT+ ++W RG +W HP G S I+ +HPVVHVS +DA+AY
Sbjct: 104 PRITDPTDWSQWWSFMRGA---------NWRHPYGPGSNIKGLDDHPVVHVSHSDALAYA 154
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
+W G LPTEAEWE+ RGGL+ F WG+ L P G+H AN+WQG+FP N DG+ T
Sbjct: 155 SWAGKDLPTEAEWEFAARGGLDGEEFAWGDALMPGGKHMANIWQGKFPVQNLGEDGFERT 214
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPKGPTTGTD--------- 300
+PVM++ N +GLY+M+GNVWEWT+DWW+ H P P P G +
Sbjct: 215 SPVMAFPPNGYGLYDMIGNVWEWTSDWWSARHASEVAKPCCIPSNPRGGREEASYDPALP 274
Query: 301 ------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KV KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 275 DIRIPRKVLKGGSHLCAPNYCRRYRPAARHAEPVDTSTSHVGFRCV 320
>gi|430842314|ref|ZP_19460229.1| hypothetical protein OGO_02000 [Enterococcus faecium E1007]
gi|431592164|ref|ZP_19521400.1| hypothetical protein OK5_04253 [Enterococcus faecium E1861]
gi|430493395|gb|ELA69698.1| hypothetical protein OGO_02000 [Enterococcus faecium E1007]
gi|430591789|gb|ELB29816.1| hypothetical protein OK5_04253 [Enterococcus faecium E1861]
Length = 283
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 128/333 (38%), Positives = 170/333 (51%), Gaps = 58/333 (17%)
Query: 15 MVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M+ +P TF +GTN+P D E P + AF +D+ V+N +F F+ TGY+TEA
Sbjct: 1 MIAIPEGTFHIGTNQPDGFPTDNEGPKITIQHPAFMIDETTVTNKEFASFIKKTGYITEA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
EKFG +FVF LSE + K
Sbjct: 61 EKFGWSFVFHYFLSEVTKKK---------------------------------------- 80
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
++L W Y G W HPEG DS+IE R +HPVV VS NDAVAYC
Sbjct: 81 --SKLVPNMNWWYAVAGA---------DWSHPEGPDSSIEARWDHPVVQVSRNDAVAYCQ 129
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE +GG E + + WG G ++ N+WQG+FP NT ADG+ +TA
Sbjct: 130 WAGKRLPTEAEWEIAAKGGTEFQKYYWGKEFLKDGTYQCNIWQGDFPRTNTLADGFANTA 189
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNV----HHHPAPSYNPKGPTTGTDKV--KKGGS 307
P Y+ N++G+Y +GNVWEW ++ + +Y DK +GGS
Sbjct: 190 PAKWYEANEYGMYQPIGNVWEWCSNPARIDLTEFKEKDSAYYWSHHQIIDDKTYATRGGS 249
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+LC+E YC R+R +AR+ NT S+A NLGFRC
Sbjct: 250 FLCHESYCKRYRLSARNGNTGMSAANNLGFRCV 282
>gi|403308842|ref|XP_003944851.1| PREDICTED: sulfatase-modifying factor 2 [Saimiri boliviensis
boliviensis]
Length = 378
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 132/337 (39%), Positives = 176/337 (52%), Gaps = 70/337 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F MGTN P +DGE P R VT+ F +D V+N +F++F+ Y TEAE
Sbjct: 107 MVQLPGGRFLMGTNSPD-SRDGEGPVREVTVKPFAIDIFPVTNKEFRDFIREKKYRTEAE 165
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q P+ + W W
Sbjct: 166 MFGWSFVFEDFVSDELRNKATQ---------------------PMKPVLW--------W- 195
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E + W P G S I R+ HPV+HVSWNDA AYC W
Sbjct: 196 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 235
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 236 RGKRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDRAEDGFHGVSP 292
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +P +V +G S++ +
Sbjct: 293 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 339
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCAAD G G
Sbjct: 340 GSANHRARVTTRMGNTPDSASDNLGFRCAADTGRPRG 376
>gi|224076257|ref|XP_002196123.1| PREDICTED: sulfatase-modifying factor 2 isoform 1 [Taeniopygia
guttata]
Length = 310
Score = 221 bits (562), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 130/335 (38%), Positives = 179/335 (53%), Gaps = 65/335 (19%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG TF+MG+ + + E P+R VT+ F LD+H V+N +F+EFV Y TEAE
Sbjct: 27 MVQLPGGTFQMGSGS-LQRRGEEGPAREVTVKPFALDKHPVTNREFREFVREKKYKTEAE 85
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +SEE + K++ + L+S W W
Sbjct: 86 AFGWSFVFEDFVSEELKKKVT---------QKLESA------------PW--------W- 115
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E + W P G S+I+ R+++PV+HVSWNDA A+C W
Sbjct: 116 ---LPIEKAF-----------------WRQPSGPGSSIKDRLDYPVLHVSWNDAQAFCAW 155
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G RLP+E EWE+ RGGLE R++PWGN P +R N+WQG+FP +TA DGY +P
Sbjct: 156 KGKRLPSEEEWEFAARGGLEQRMYPWGNKFQP---NRTNLWQGDFPRGDTAEDGYHGVSP 212
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
V ++ +N +G Y+++GN WEWTA PAP + +V +G S++
Sbjct: 213 VTAFPAQNSYGFYDLLGNTWEWTAS-----EFPAPGRASRAQKM---QVLRGASWIDTAD 264
Query: 314 YCYRHRC--AARSQNTPDSSAGNLGFRCAADKGPT 346
HR R NTPDS++ NL FRCAAD P
Sbjct: 265 GSANHRACVTTRMGNTPDSASDNLSFRCAADVPPV 299
>gi|443325559|ref|ZP_21054249.1| hypothetical protein Xen7305DRAFT_00039650 [Xenococcus sp. PCC
7305]
gi|442794840|gb|ELS04237.1| hypothetical protein Xen7305DRAFT_00039650 [Xenococcus sp. PCC
7305]
Length = 324
Score = 221 bits (562), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 133/340 (39%), Positives = 180/340 (52%), Gaps = 56/340 (16%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++MVL+PG TF MG++ P +++ V +D+F +D HEV+N +F +FV TGY+T
Sbjct: 30 ENMVLIPGGTFHMGSDHPEFVEENAI--AKVKIDSFCIDTHEVTNAEFAKFVQKTGYLTI 87
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ F P L E R S V FE ++ +E +AY
Sbjct: 88 AERPLSVQEF-PDLPESARKPGSMV------FEPPEAGLEQ--------------IAYMG 126
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W W G +W HP G +S I+ + NHPVVH+++ DA+AY
Sbjct: 127 WW--------HWVTGA------------NWQHPYGPESNIQGKENHPVVHIAYFDALAYA 166
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W +LPTEA+WEY RGGLE + + WG+ + AN WQG FP NT ADGYL T
Sbjct: 167 KWSNKQLPTEAQWEYAARGGLEGKEYTWGDEYSAT---NANTWQGLFPFFNTKADGYLGT 223
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDK---------V 302
APV S+ N +GL +M GNVWEWT D++ V P A NP+GP G D V
Sbjct: 224 APVGSFPPNGYGLSDMAGNVWEWTQDYYQVLRKPMAYKSNPQGPKIGYDPKKPFERNSHV 283
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
KGGS+LC YC R+R AAR +PD+ ++GFR +
Sbjct: 284 IKGGSHLCAPNYCSRYRPAARESQSPDTGTTHIGFRLVKN 323
>gi|375088980|ref|ZP_09735316.1| hypothetical protein HMPREF9703_01398 [Dolosigranulum pigrum ATCC
51524]
gi|374560781|gb|EHR32134.1| hypothetical protein HMPREF9703_01398 [Dolosigranulum pigrum ATCC
51524]
Length = 290
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 130/329 (39%), Positives = 178/329 (54%), Gaps = 59/329 (17%)
Query: 18 LPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKF 76
+P +F+MGT + D E S +V L++F + V+N +F+ F++AT Y+T+AEKF
Sbjct: 8 IPSGSFKMGTICQEGFATDLETGSVDVALESFQIKTTTVTNREFKAFIAATDYITDAEKF 67
Query: 77 GDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA 136
G + VF L+ EE+R LDS M
Sbjct: 68 GWSSVFHLLIPEEDR---------------LDSLAVQGM--------------------- 91
Query: 137 RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRG 196
EW +G W PEG S+I+ R++HPVVHVS NDA+AYC W G
Sbjct: 92 ------EWWLIIKGA---------CWHAPEGPGSSIDGRLDHPVVHVSRNDALAYCRWEG 136
Query: 197 ARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVM 256
+RLPTEA+WEY RGGLE + +PWG+NL G++ N+WQG+FP +NT DGYL TAP
Sbjct: 137 SRLPTEAQWEYAARGGLEMKRYPWGDNLLIDGKYPLNIWQGQFPISNTEEDGYLGTAPAN 196
Query: 257 SYKENKFGLYNMVGNVWEWTADWWNV------HHHPAPSYNPKGPTT-GTDKVKKGGSYL 309
+Y N +GLY MVGNVWEW + + H +N ++ G +GGS+L
Sbjct: 197 NYTPNGYGLYQMVGNVWEWCLNPGKIPLNQFQKHTSEDFFNYFNRSSHGKLFAIRGGSFL 256
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
C++ YC R+R AR+ T DSSA N+GFR
Sbjct: 257 CHDSYCNRYRVGARNSVTDDSSASNIGFR 285
>gi|126659223|ref|ZP_01730361.1| hypothetical protein CY0110_04588 [Cyanothece sp. CCY0110]
gi|126619528|gb|EAZ90259.1| hypothetical protein CY0110_04588 [Cyanothece sp. CCY0110]
Length = 325
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 134/338 (39%), Positives = 182/338 (53%), Gaps = 56/338 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG TFRMG++ I+ E P +VT+D+F +D +E++N+QF +FV TGY T AE
Sbjct: 33 MTYIPGGTFRMGSDNQDFIE--EKPVDDVTVDSFCIDTYEITNSQFAKFVKETGYQTIAE 90
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ F P L +E+R S V F+ + IE VAY +W
Sbjct: 91 RPLSKEQF-PNLPDEQRKPGSLV------FQPPNEGIER--------------VAYLSWW 129
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W G +W HP G +S I + N+PVVH+S++DA+AY W
Sbjct: 130 --------HWVTGA------------NWQHPYGEESDILSKENYPVVHISYDDALAYAKW 169
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+LPTEA+WEY RGGL+ + + WG+ + + AN WQG FP NT DGYL AP
Sbjct: 170 ADKQLPTEAQWEYAARGGLQGKDYTWGDQYSAK---NANTWQGIFPFFNTKEDGYLGAAP 226
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVH-HHPAPSYNPKGPTTGTDKVK---------K 304
V S+ N +GLY+M GNVWEWT DW+ V + A NPKGP G D K K
Sbjct: 227 VGSFPANGYGLYDMSGNVWEWTQDWYQVERENMAHKTNPKGPKIGYDPKKPQDKLVHVIK 286
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GGS+LC + YC R+R AAR +PD+ ++GFR +
Sbjct: 287 GGSHLCAKNYCSRYRPAARESQSPDTGTTHIGFRLVKN 324
>gi|380300787|ref|ZP_09850480.1| hypothetical protein BsquM_01775 [Brachybacterium squillarum M-6-3]
Length = 342
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 139/352 (39%), Positives = 179/352 (50%), Gaps = 66/352 (18%)
Query: 6 APPVE----RYKDMVLLPGDTFRMGTNKPILIK-DGEFPSRNVTLDAFYLDQHEVSNTQF 60
APP + R + +PG F MG + DGE P V + A +D V+N F
Sbjct: 39 APPTDTAERRRIEQAHVPGGEFTMGDATGHRNRGDGETPLHRVAISALDVDVTTVTNADF 98
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
FV TGY TEAE+FG + VF L+ EE + VR P
Sbjct: 99 ARFVEDTGYRTEAERFGFSAVFHLALAAEE----ADVRG------------------PAA 136
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEY-GCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHP 179
W WRG R A+W + GCR S ++ +HP
Sbjct: 137 GTPW--------WRGVR---GADWRHSGCR-------------------RSDLDGLDDHP 166
Query: 180 VVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEF 239
VVH+SW+DA +YC+W G RLPTEAE EY RGG+E FPWG + +R +VWQGEF
Sbjct: 167 VVHMSWHDAQSYCSWAGRRLPTEAEGEYTARGGIEGATFPWGEEPIDQPPYRVDVWQGEF 226
Query: 240 PTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH--------PAPSYN 291
P +T ADGYL+TAPV S+ N +GL+ VGNVWEW D ++ + +
Sbjct: 227 PRRDTGADGYLTTAPVRSFATNGYGLWQSVGNVWEWCEDGFHPKSYRRGTTAAARTTITD 286
Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
P GP G +V +GGSYLC+ YC R+R AARS+NTPDSS GN GFR + +
Sbjct: 287 PTGPEGGGTRVLRGGSYLCHASYCNRYRNAARSRNTPDSSMGNAGFRTVSPR 338
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 29/48 (60%), Positives = 35/48 (72%)
Query: 344 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 391
GP G +V +GGSYLC+ YC R+R AARS+NTPDSS GN GFR +
Sbjct: 289 GPEGGGTRVLRGGSYLCHASYCNRYRNAARSRNTPDSSMGNAGFRTVS 336
>gi|344201754|ref|YP_004786897.1| sulfatase-modifying factor protein [Muricauda ruestringensis DSM
13258]
gi|343953676|gb|AEM69475.1| Sulphatase-modifying factor protein [Muricauda ruestringensis DSM
13258]
Length = 366
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 132/348 (37%), Positives = 180/348 (51%), Gaps = 48/348 (13%)
Query: 8 PVERYKDMVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
P E K MV +PG F G + E P+ V +D F++D HEV+N QF +F+
Sbjct: 46 PSEAPKGMVWIPGGEFNQGAVPNDNMAMSHEKPAHKVYVDGFFMDIHEVTNAQFAKFIEE 105
Query: 67 TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
TGYVT AE+ D + L E +Q HD G S + + V ++
Sbjct: 106 TGYVTMAEREIDWEEMKKQLPEG-----TQKPHDSILQPG--SLVFKKTKSSVPNL---- 154
Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
Y W EW+ G SW HP G DS I+ + +HPVVH+++
Sbjct: 155 -YDYSQWW--------EWKIGA------------SWKHPNGPDSNIDGKEDHPVVHIAFE 193
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
DA AYC W G RLPTEAEWE+ R + F WG+++ H AN W+GEFP NT
Sbjct: 194 DAQAYCKWAGRRLPTEAEWEFAARAQKTDSKFFWGDDIDQLSSH-ANSWEGEFPVKNTMQ 252
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA------PSYNPKGPTT--- 297
DG+ AP+MSY +N FGLY+M GNVWEWT+DW+N +++ + NPKG +
Sbjct: 253 DGFERRAPIMSYPKNDFGLYDMAGNVWEWTSDWYNTNYYQKLASNGEVAQNPKGAASAFN 312
Query: 298 -----GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
++V KGGS+LC+ YC +R +AR +PDS +LGFR
Sbjct: 313 SNNPYAKERVIKGGSFLCSASYCASYRISARMATSPDSGMEHLGFRTV 360
>gi|134099642|ref|YP_001105303.1| hypothetical protein SACE_3100 [Saccharopolyspora erythraea NRRL
2338]
gi|291005452|ref|ZP_06563425.1| hypothetical protein SeryN2_13112 [Saccharopolyspora erythraea NRRL
2338]
gi|133912265|emb|CAM02378.1| protein of unknown function DUF323 [Saccharopolyspora erythraea
NRRL 2338]
Length = 282
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 137/336 (40%), Positives = 165/336 (49%), Gaps = 66/336 (19%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKP-ILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
P R D+VLL G F MG+ +DGE P R V L+AF + + VSN +
Sbjct: 11 PPRRAGDLVLLDGGEFLMGSEDAHAYPEDGEGPVRAVRLEAFRISRTAVSNVE------- 63
Query: 67 TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
FG+ +E ER S V F GL
Sbjct: 64 ---------FGEFVAATGYRTEAERWGWSFV------FGGL------------------- 89
Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
LP + G R W PEG S ++ R +HPVVHVSW+
Sbjct: 90 -----------LPDDFPPTRGVAATPWWRQVEGADWRRPEGPGSDVDQRGDHPVVHVSWS 138
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
DA AYC W G RLPTEAEWEY RGGL R FPWG+ L P G H NVWQGEFP NT A
Sbjct: 139 DARAYCAWAGLRLPTEAEWEYAARGGLAGRAFPWGDELEPGGRHLMNVWQGEFPRRNTFA 198
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
DG+L T PV ++ N FGL+N GNVWEW ADW++ P T + +GG
Sbjct: 199 DGWLGTCPVDAFPPNGFGLHNTTGNVWEWCADWFD-------------PGDRTRRSARGG 245
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
SYLC++ YC R+R +AR TPDS+ GN GFRCA D
Sbjct: 246 SYLCHDSYCRRYRVSARQGLTPDSTTGNTGFRCALD 281
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/64 (46%), Positives = 40/64 (62%)
Query: 330 SSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 389
++ GN+ CA P T + +GGSYLC++ YC R+R +AR TPDS+ GN GFRC
Sbjct: 219 NTTGNVWEWCADWFDPGDRTRRSARGGSYLCHDSYCRRYRVSARQGLTPDSTTGNTGFRC 278
Query: 390 AADV 393
A D
Sbjct: 279 ALDA 282
>gi|402863386|ref|XP_003895998.1| PREDICTED: sulfatase-modifying factor 2 isoform 1 [Papio anubis]
Length = 320
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 132/337 (39%), Positives = 173/337 (51%), Gaps = 70/337 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F MGTN P +DGE P R T+ F +D V+N F++FV Y TEAE
Sbjct: 49 MVQLPGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 107
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q P+ + W W
Sbjct: 108 MFGWSFVFEDFVSDELRNKATQ---------------------PMKPVLW--------W- 137
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E + W P G S I R+ HPV+HVSWNDA AYC W
Sbjct: 138 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 177
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG FP + A DG+ +P
Sbjct: 178 RGKRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGRFPKGDKAEDGFHGVSP 234
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +P +V +G S++ +
Sbjct: 235 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 281
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCAAD G G
Sbjct: 282 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRLPG 318
Score = 41.2 bits (95), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCAAD
Sbjct: 269 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 313
>gi|427717852|ref|YP_007065846.1| sulfatase-modifying factor protein [Calothrix sp. PCC 7507]
gi|427350288|gb|AFY33012.1| Sulphatase-modifying factor protein [Calothrix sp. PCC 7507]
Length = 357
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 131/340 (38%), Positives = 180/340 (52%), Gaps = 56/340 (16%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ MVL+PG F+MG++ ++ E +VT+ F +DQ+EV+N QF FV ATGYVT
Sbjct: 47 QGMVLIPGGKFQMGSDNSGFVE--ERSPGDVTVTNFCIDQYEVTNPQFAAFVKATGYVTV 104
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ PL E+ D+ + L ++ M P V V++ +
Sbjct: 105 AER--------PLSPEQ--------FPDLPDEQRLPGSLVFEMAKPGVK-----EVSFLS 143
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W W G +W HP G +S I ++ N+PVVH+++ DA+AY
Sbjct: 144 WW--------HWTTGA------------NWQHPFGKESAIANKSNYPVVHIAYEDALAYA 183
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEA+WEY RGGL+ + + WG+ + E +AN WQG FP NT ADGY
Sbjct: 184 KWSGKSLPTEAQWEYAARGGLDGKTYAWGDQYS---EKKANTWQGIFPFFNTKADGYTEI 240
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHH-------HPAP---SYNPKGPTTGTDKV 302
APV S+ N +GLY+M GNVWEWTAD + V H +P S++PK P V
Sbjct: 241 APVGSFAPNGYGLYDMTGNVWEWTADLFQVGHDYKSHQINPIALNQSFDPKKPNESALHV 300
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
KGGSYLC YC R R AAR +PD+ ++GFR +
Sbjct: 301 IKGGSYLCAPNYCSRFRPAARESESPDTGTTHIGFRLVKN 340
>gi|223936270|ref|ZP_03628183.1| protein of unknown function DUF323 [bacterium Ellin514]
gi|223895132|gb|EEF61580.1| protein of unknown function DUF323 [bacterium Ellin514]
Length = 368
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 126/335 (37%), Positives = 178/335 (53%), Gaps = 53/335 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M + G TF MG+ + + E P VT+ F++D+ EVSN QF++FV +TGY+T AE
Sbjct: 75 MAWIQGGTFYMGSEE---GRPDEKPVHQVTVHGFWMDKTEVSNEQFEKFVQSTGYITMAE 131
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ +P + A S + F P + DA A+
Sbjct: 132 R-------KPNPKDYPDADPSLLVPGSVVFS-----------PPSESVPLTDASAWW--- 170
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
R P W HPEG +STI+ R HPVVHV+W DA+AY W
Sbjct: 171 --------------------RYVPGADWRHPEGPNSTIKGREKHPVVHVAWEDAMAYAKW 210
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY RGGL+ + + WG + G +AN+W+G FP +T ADG+ TAP
Sbjct: 211 AGKRLPTEAEWEYAARGGLDRQPYVWGKEMNTNGNWQANIWEGHFPNEDTGADGFKGTAP 270
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTD--------KVKKG 305
V ++K N +GL++M GNVWEW +DW+ ++ + NPKGP D +V++G
Sbjct: 271 VATFKPNGYGLHDMAGNVWEWCSDWYRPDYYAQSVGKNPKGPEDSFDPEEPGAKKRVQRG 330
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GSYLCN+ YC ++ ++R + TPD+ + GFRC
Sbjct: 331 GSYLCNDVYCSGYKPSSRMKCTPDTGLSHSGFRCV 365
>gi|75906337|ref|YP_320633.1| hypothetical protein Ava_0112 [Anabaena variabilis ATCC 29413]
gi|75700062|gb|ABA19738.1| Protein of unknown function DUF323 [Anabaena variabilis ATCC 29413]
Length = 331
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 136/348 (39%), Positives = 181/348 (52%), Gaps = 64/348 (18%)
Query: 5 PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
P PP +MV++PG TF MG++ I+ E ++ V + +F +D++EV+N+QF EFV
Sbjct: 15 PCPP-----EMVMIPGGTFTMGSDNSGYIE--ELSAQEVRVSSFCIDKYEVTNSQFAEFV 67
Query: 65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
ATGYVT AE+ LS+E+ + D +R G V I
Sbjct: 68 KATGYVTVAER---------PLSKEQFPDLP----DEQRLPG----------SLVFAIVQ 104
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
+A W W+ G +W HP G +S I H+ N+PVVH++
Sbjct: 105 PEAKVLNWWH---------WQTGA------------NWRHPFGKESAIVHQDNYPVVHIA 143
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
+ DAVAY W G LPTEA+WEY RGGL+ + WG+ + E +AN WQG FP NT
Sbjct: 144 YEDAVAYAQWAGKSLPTEAQWEYAARGGLDAATYAWGDQYS---EKKANTWQGIFPLFNT 200
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTTGTDK-- 301
DGY S APV S+ N +GLY+M GNVWE T+DW+ H+H S NP GP D
Sbjct: 201 KKDGYASIAPVGSFPPNGYGLYDMTGNVWELTSDWFMPGHNHKTHSVNPTGPEQSFDPNK 260
Query: 302 -------VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
V KGGSYLC YC R R AAR PD+ ++GFR +
Sbjct: 261 PSEQALHVIKGGSYLCAPNYCSRFRPAARESQAPDTGTTHIGFRLVKN 308
>gi|338712716|ref|XP_001493384.3| PREDICTED: sulfatase-modifying factor 2-like [Equus caballus]
Length = 326
Score = 219 bits (557), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 133/333 (39%), Positives = 174/333 (52%), Gaps = 70/333 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F+MGTN P +DGE P R VT+ F +D V+N F+EFV Y TEAE
Sbjct: 55 MVQLPGGRFQMGTNSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFREFVREKKYRTEAE 113
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K++Q + + W
Sbjct: 114 TFGWSFVFEDFVSDELRNKVTQQMESV--------------------LWW---------- 143
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E R F W P G S I R+ PVVHVSWNDA AYC W
Sbjct: 144 ---LPVE-------------RAF----WRQPAGPGSGIRERLELPVVHVSWNDARAYCAW 183
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 184 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 240
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY++VGNVWEWTA +P +V +G S++ +
Sbjct: 241 VNAFPPQNNYGLYDLVGNVWEWTA-------------SPYQAADQDMRVLRGASWIDTAD 287
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+R R R NTPDS++ NLGFRCA+D G
Sbjct: 288 GSANHRARVTTRMGNTPDSASDNLGFRCASDTG 320
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 19/45 (42%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCA+D
Sbjct: 275 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCASDT 319
>gi|354553265|ref|ZP_08972572.1| Sulphatase-modifying factor protein [Cyanothece sp. ATCC 51472]
gi|353555095|gb|EHC24484.1| Sulphatase-modifying factor protein [Cyanothece sp. ATCC 51472]
Length = 325
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 132/338 (39%), Positives = 181/338 (53%), Gaps = 56/338 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG TFRMG+ ++ E P +VT+D+F +D +E++N QF +FV TGY T AE
Sbjct: 33 MTYIPGGTFRMGSEHSDFVE--EKPVDDVTVDSFCIDTYEITNGQFAQFVKETGYKTVAE 90
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ F P L +E+R S V F+ + I+ VAY +W
Sbjct: 91 RPLSKEQF-PNLPDEQRKPGSLV------FQPPEEGIQR--------------VAYLSWW 129
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W G +W HP G +S I + N+PV+H+S+ DA+AY W
Sbjct: 130 --------HWVTGA------------NWKHPFGEESDIIGKDNYPVIHISYEDALAYAQW 169
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G +LPTEA+WEY RGGL+ + + WG+ + + AN WQG FP NT ADGY+ AP
Sbjct: 170 DGKQLPTEAQWEYAARGGLQEKEYTWGDQYSAK---NANTWQGIFPFFNTKADGYVGAAP 226
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-HPAPSYNPKGPTTGTDKVK---------K 304
V S+ N +GLY+M GNVWEWT DW+ V + A NPKGP G D K K
Sbjct: 227 VGSFPPNGYGLYDMSGNVWEWTQDWYQVERDNMAHKTNPKGPERGYDPKKPQDQLVHVIK 286
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GGS+LC + YC R+R AAR +PD+ ++GFR +
Sbjct: 287 GGSHLCAKNYCSRYRPAARESQSPDTGTTHIGFRLVKN 324
>gi|172036481|ref|YP_001802982.1| hypothetical protein cce_1566 [Cyanothece sp. ATCC 51142]
gi|171697935|gb|ACB50916.1| unknown [Cyanothece sp. ATCC 51142]
Length = 328
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 132/338 (39%), Positives = 181/338 (53%), Gaps = 56/338 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG TFRMG+ ++ E P +VT+D+F +D +E++N QF +FV TGY T AE
Sbjct: 36 MTYIPGGTFRMGSEHSDFVE--EKPVDDVTVDSFCIDTYEITNGQFAQFVKETGYKTVAE 93
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ F P L +E+R S V F+ + I+ VAY +W
Sbjct: 94 RPLSKEQF-PNLPDEQRKPGSLV------FQPPEEGIQR--------------VAYLSWW 132
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W G +W HP G +S I + N+PV+H+S+ DA+AY W
Sbjct: 133 --------HWVTGA------------NWKHPFGEESDIIGKDNYPVIHISYEDALAYAQW 172
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G +LPTEA+WEY RGGL+ + + WG+ + + AN WQG FP NT ADGY+ AP
Sbjct: 173 DGKQLPTEAQWEYAARGGLQEKEYTWGDQYSAK---NANTWQGIFPFFNTKADGYVGAAP 229
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-HPAPSYNPKGPTTGTDKVK---------K 304
V S+ N +GLY+M GNVWEWT DW+ V + A NPKGP G D K K
Sbjct: 230 VGSFPPNGYGLYDMSGNVWEWTQDWYQVERDNMAHKTNPKGPERGYDPKKPQDQLVHVIK 289
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GGS+LC + YC R+R AAR +PD+ ++GFR +
Sbjct: 290 GGSHLCAKNYCSRYRPAARESQSPDTGTTHIGFRLVKN 327
>gi|326797828|ref|YP_004315647.1| sulfatase-modifying factor protein [Sphingobacterium sp. 21]
gi|326548592|gb|ADZ76977.1| Sulphatase-modifying factor protein [Sphingobacterium sp. 21]
Length = 364
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 129/341 (37%), Positives = 181/341 (53%), Gaps = 56/341 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV + G F+MG+++ D + P V + F++D+HEV+N QF +FV ATGY+T AE
Sbjct: 70 MVFIKGGVFQMGSSE---FADTQ-PIHEVKVHDFWMDEHEVTNAQFAKFVEATGYITVAE 125
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ + F P +S + S V K L++ H+ W
Sbjct: 126 RALNPADF-PGVSRDLLVPGSAVFTPPKEQVNLNN-----------HLQW---------- 163
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W+Y P SW P+G S I + N PVV VS+ DA A+ W
Sbjct: 164 ---------WKY----------VPGASWRQPKGPGSNINGKENEPVVQVSYEDAEAFAKW 204
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY R G + + WG+ L P+G+ AN +QG FP +N A DG+ AP
Sbjct: 205 AGKRLPTEAEWEYAARAGQVGKKYYWGDELKPQGKWVANTYQGNFPVSNAAEDGFAELAP 264
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWW--NVHHHPAPSYNPKGPTTGTD--------KVKK 304
V S+ N +GLY+M GNVWEW AD++ + + H A NPKGP+ D +V++
Sbjct: 265 VKSFPANAYGLYDMDGNVWEWCADFYRPDAYEH-AEKDNPKGPSDSYDPAEPGLVKRVQR 323
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
GGS+LCN+QYC R++ +R + S++ NLGFRC DK P
Sbjct: 324 GGSFLCNDQYCERYKAGSRGKGEVSSASNNLGFRCVQDKAP 364
>gi|414172785|ref|ZP_11427696.1| hypothetical protein HMPREF9695_01342 [Afipia broomeae ATCC 49717]
gi|410894460|gb|EKS42250.1| hypothetical protein HMPREF9695_01342 [Afipia broomeae ATCC 49717]
Length = 336
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 132/351 (37%), Positives = 177/351 (50%), Gaps = 64/351 (18%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
E+ DMV +PG TFRMG++ E PS VT+D F++D+ V+N QF++FV ATG+
Sbjct: 17 EQTADMVWIPGGTFRMGSDH---HYPEEAPSHRVTVDGFWMDKTPVTNRQFKQFVKATGH 73
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
VT AE + D K + G + H+ + ++
Sbjct: 74 VTLAE----------------------IPPDPKDYPGA-----------LPHMLYAGSLV 100
Query: 130 YCTWRG-ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
+ R L + W +G W HP G + I NHPVVHVS++DA
Sbjct: 101 FSPPRAPVDLRDWSRWWSFMKGA---------DWRHPYGPKTNINVHDNHPVVHVSYSDA 151
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
+AY W G LPTEAEWE+ RGGL+ F WG TP G H AN WQGEFP N DG
Sbjct: 152 LAYAKWAGKDLPTEAEWEFAARGGLDGAEFAWGEEFTPGGVHMANTWQGEFPLQNLNQDG 211
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPKGPTTGTD----- 300
Y T+PV ++ N +G+++M+GNVWEWT+DWW+ H P P+ P G +
Sbjct: 212 YERTSPVRAFPANGYGIHDMIGNVWEWTSDWWSAGHEADAPKACCIPENPRGGPESASYD 271
Query: 301 ----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
KV KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 272 SCQPSIRIPRKVLKGGSHLCAPNYCRRYRPAARHAEPIDTSTSHVGFRCVV 322
>gi|332710257|ref|ZP_08430206.1| hypothetical protein LYNGBM3L_49460 [Moorea producens 3L]
gi|332350948|gb|EGJ30539.1| hypothetical protein LYNGBM3L_49460 [Moorea producens 3L]
Length = 347
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 139/345 (40%), Positives = 178/345 (51%), Gaps = 71/345 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSR----NVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
MVL+PG TF +G+ D +PS +VT+D+F +D+HEV+N +Q FV ATGYV
Sbjct: 47 MVLIPGGTFMIGS-------DTYYPSERSAVDVTVDSFCIDRHEVTNADYQAFVQATGYV 99
Query: 71 TEAEK--FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
T AE+ D F P L+E +R S V F+ I+ V
Sbjct: 100 TVAERPLAKDQF---PGLTEAQRQPGSLV------FQPPAPGIQQ--------------V 136
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
AY +W +W G +W HP G DSTIE +PVVH+++ DA
Sbjct: 137 AYLSWW--------DWVVGA------------NWQHPYGPDSTIEGFDKYPVVHIAYEDA 176
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
VAY W G LPTEA+WEY RGGL+ G P H+AN WQG FP NT ADG
Sbjct: 177 VAYADWVGKTLPTEAQWEYAARGGLDTDTSIPGEQYAP---HKANTWQGFFPFFNTKADG 233
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH------------PAPSYNPKGPT 296
YL APV S+ N +GLY+M+GNVWE TADW+ V H S++P+ P
Sbjct: 234 YLGPAPVESFSPNGYGLYDMLGNVWELTADWYRVGHRGKDRSLNPLGPDQGSSFDPEKPD 293
Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
G V KGGS+LC YC R+R AR PD+ ++GFR A
Sbjct: 294 QGALHVIKGGSFLCARNYCSRYRPEARESQAPDTGTNHVGFRLVA 338
>gi|384218970|ref|YP_005610136.1| hypothetical protein BJ6T_52880 [Bradyrhizobium japonicum USDA 6]
gi|354957869|dbj|BAL10548.1| hypothetical protein BJ6T_52880 [Bradyrhizobium japonicum USDA 6]
Length = 336
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 134/356 (37%), Positives = 183/356 (51%), Gaps = 63/356 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV +PG TFRMG++ E PS V++D F++D+ V+N QF++FV+ATG+VTEA
Sbjct: 20 EMVYIPGGTFRMGSDHHY---PEEAPSHRVSVDGFWIDRTPVTNRQFKQFVNATGHVTEA 76
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ + + L E A S V + R L W
Sbjct: 77 QIVPNPKDYPGALKEMLYAG-SLVFSPLPRITDL-----------------------ADW 112
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
++W RG +W HP G S I+ +HPVVHVS++DA AY
Sbjct: 113 --------SQWWSFMRGA---------NWRHPYGPGSNIKGLDDHPVVHVSYSDAAAYAR 155
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEAEWE+ RGGL+ F WG+ L P G+H AN WQG FP N DG+ T+
Sbjct: 156 WAGKELPTEAEWEFAARGGLDGEEFAWGDALMPGGKHMANTWQGNFPVQNLREDGFERTS 215
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSYNPKGPTTGTD---------- 300
PVM++ N +GL++M+GNVWEWT+DWW+ H P P+ P G +
Sbjct: 216 PVMAFPPNGYGLHDMIGNVWEWTSDWWSSKHTADAAKPCCIPRNPRGGREEASFDPCQPD 275
Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDK 351
+V KGGS+LC YC R+R AAR D+S ++GFRC +GP +K
Sbjct: 276 IRIPRRVLKGGSHLCAPNYCRRYRPAARHAEPVDTSTSHVGFRCVV-RGPIDTREK 330
>gi|410984718|ref|XP_003998673.1| PREDICTED: sulfatase-modifying factor 2 [Felis catus]
Length = 300
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 136/338 (40%), Positives = 176/338 (52%), Gaps = 72/338 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F+MGTN P +DGE P R +T+ F +D V+N F+EFV Y TEAE
Sbjct: 29 MVQLPGGRFQMGTNSPD-GRDGEGPVREMTVKPFAIDIFPVTNKDFREFVREKKYRTEAE 87
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFG +FVFE +S+E R K++ H + + W W
Sbjct: 88 KFGWSFVFEDFVSDELRNKVA---------------------HQMESVLW--------W- 117
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E R F W P G S I R+ PVVHVSWNDA AYC W
Sbjct: 118 ---LPVE-------------RAF----WRQPAGPGSGIRERLELPVVHVSWNDARAYCAW 157
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL + +PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 158 RGKRLPTEEEWEFAARGGLTGQAYPWGNQFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 214
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD-KVKKGGSYL--C 310
V ++ +N +GLY+++GNVWEWTA PTT D +V +G S++
Sbjct: 215 VNAFPPQNNYGLYDLMGNVWEWTAS--------------PYPTTDQDMRVLRGASWIDTA 260
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+ +R R R NTPDS++ NLGFRCA+ G G
Sbjct: 261 DGSANHRARVTTRMGNTPDSASDNLGFRCASSAGRLPG 298
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 22/50 (44%), Positives = 31/50 (62%), Gaps = 3/50 (6%)
Query: 345 PTTGTD-KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 391
PTT D +V +G S++ + +R R R NTPDS++ NLGFRCA+
Sbjct: 242 PTTDQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAS 291
>gi|375010720|ref|YP_004987708.1| hypothetical protein [Owenweeksia hongkongensis DSM 17368]
gi|359346644|gb|AEV31063.1| hypothetical protein Oweho_0039 [Owenweeksia hongkongensis DSM
17368]
Length = 363
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 131/352 (37%), Positives = 183/352 (51%), Gaps = 56/352 (15%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
+E+ + MV +P +R+G+ K E PS V + FY+D+ EV+N +F+EFV ATG
Sbjct: 50 LEKVEGMVWIPSGVYRVGSEDKHANKS-EGPSHEVKVSGFYMDETEVTNDEFREFVDATG 108
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKI---SQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
Y+T AE+ P+ EE + ++ + HD G + P +
Sbjct: 109 YLTVAER--------PVDWEEIKQQLPPGTPKPHDSILQPG------SMIFQPGPTSNLY 154
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
D + W+ W HP G +S I+ + ++PVVH+++
Sbjct: 155 DISQWWAWKIG-----------------------ADWQHPHGPNSDIKKKGDYPVVHIAF 191
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
DA AY W G RLPTEAEWE RGG + F WG+ LTP GE+ AN +QG FP NTA
Sbjct: 192 EDAQAYAKWAGKRLPTEAEWEIAARGGEPAKPFAWGDKLTPMGEYLANFFQGTFPDGNTA 251
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-------YNPKGPTTG 298
DGY ++APV SY N +GLY+M+GNVWEWT+DW+ H NP GP+T
Sbjct: 252 LDGYGNSAPVKSYAPNGYGLYDMIGNVWEWTSDWYRPDTHVQNKQAGLKGCINPNGPSTS 311
Query: 299 TD--------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
D +V KGGS+LC+E+YC +R +AR DS +LGFRC +
Sbjct: 312 YDPQEPLVPKRVTKGGSFLCSEEYCSNYRPSARMATAYDSGQEHLGFRCVKN 363
>gi|373956332|ref|ZP_09616292.1| Sulphatase-modifying factor protein [Mucilaginibacter paludis DSM
18603]
gi|373892932|gb|EHQ28829.1| Sulphatase-modifying factor protein [Mucilaginibacter paludis DSM
18603]
Length = 392
Score = 218 bits (555), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 136/384 (35%), Positives = 179/384 (46%), Gaps = 81/384 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+ TF MG + D E+P VT+++F++D EV+N QF +FV ATGY+T AE
Sbjct: 77 MVLIKSGTFMMGGDNK-QAADDEYPKHKVTVNSFWIDATEVTNAQFAKFVKATGYLTTAE 135
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D E K + K E L P +S D + W+
Sbjct: 136 RKPDW----------EELKKQLPPNTPKPDESLLVPASLIFKQPGKPVSLTDYSQWWVWQ 185
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W HP G S I+ + N+PVVHVSW DA AYC W
Sbjct: 186 KG-----------------------ADWKHPHGPSSNIKGKDNYPVVHVSWYDAQAYCKW 222
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWE+ RGGL N ++PWGN G +AN WQG FP NT D + AP
Sbjct: 223 AGKRLPTEAEWEWAARGGLSNNIYPWGNEPVDAGLPKANTWQGNFPNTNTVRDHFFYLAP 282
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-----NPKGPTTGTDKVKKGGSYL 309
V S+K N +GLY+M GNVWEW AD++N ++ + NP+GP SY
Sbjct: 283 VASFKPNGYGLYDMAGNVWEWCADYYNNTYYATINKANGVSNPRGP---------ANSYD 333
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHR 369
+E Y + +V +GGS+LCN+ YC +R
Sbjct: 334 PDEPYAKK---------------------------------RVIRGGSFLCNDTYCSGYR 360
Query: 370 CAARSQNTPDSSAGNLGFRCAADV 393
A R ++T DS +LGFRC D
Sbjct: 361 VARRMKSTEDSGMEHLGFRCVKDA 384
>gi|260061739|ref|YP_003194819.1| hypothetical protein RB2501_09060 [Robiginitalea biformata
HTCC2501]
gi|88785871|gb|EAR17040.1| hypothetical protein RB2501_09060 [Robiginitalea biformata
HTCC2501]
Length = 380
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 130/341 (38%), Positives = 175/341 (51%), Gaps = 47/341 (13%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P +F MG D E P R+V + F++D+ EV+N +F FV ATGY T AE
Sbjct: 72 MVYIPSGSFLMGGKSDQASPD-ELPRRSVQVAGFFMDRTEVTNREFAAFVDATGYETVAE 130
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K P+ E + ++ P + V + T R
Sbjct: 131 K--------PIDWEAMKTQLP----------------PGTPKPPDSLLQAGSLVFHATDR 166
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
L ++W + P +W HPEG +S +E RM+HPVVH++ +DA+AY W
Sbjct: 167 PVDLRDLSQWWHWT---------PGANWKHPEGPESDLEGRMDHPVVHIALDDALAYARW 217
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWE+ GGLE+ +PWGN +AN WQG FP +N ADGYL TAP
Sbjct: 218 AGKRLPTEAEWEWASMGGLEDPKYPWGNAPISESADKANFWQGIFPYSNEQADGYLRTAP 277
Query: 255 VMSYKENKFGLYNMVGNVWEW-----TADWWNVHHHPAPSYNPKGPTTGTDK-------- 301
V +Y N +GL +M GNVWEW +D + + P NP+GP T D
Sbjct: 278 VGTYPANGYGLVDMAGNVWEWCQDKYRSDTYQLETTPTDRMNPRGPDTSFDPSEPLAEKY 337
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
V +GGSYLCN+ YC +R + R + T DS+ G+ GFRC D
Sbjct: 338 VIRGGSYLCNDSYCSGYRVSRRMRATRDSAFGHTGFRCVKD 378
>gi|301780842|ref|XP_002925845.1| PREDICTED: sulfatase-modifying factor 2-like [Ailuropoda
melanoleuca]
Length = 300
Score = 218 bits (554), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 133/338 (39%), Positives = 176/338 (52%), Gaps = 70/338 (20%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV LPG F+MGTN P +DGE P R VT+ F +D V+N F+EFV Y TEA
Sbjct: 28 NMVQLPGGRFQMGTNSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFREFVREKKYRTEA 86
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E FG +FVFE +S+E R K++ H + + W W
Sbjct: 87 EMFGWSFVFEDFVSDELRNKVT---------------------HQMESVLW--------W 117
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
LP E R F W P G S I R+ PV+HVSWNDA AYC
Sbjct: 118 ----LPVE-------------RAF----WRQPAGPGSGIRERLELPVLHVSWNDARAYCA 156
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
WRG RLPTE EWE+ RGGL+ +L+PWGN P +R N+WQG+FP + A DG+ +
Sbjct: 157 WRGKRLPTEEEWEFAARGGLKGQLYPWGNQFQP---NRTNLWQGKFPKGDKAEDGFHGVS 213
Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--C 310
PV ++ +N +GLY+++GNVWEWTA +P +V +G S++
Sbjct: 214 PVNAFPPQNNYGLYDLMGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTA 260
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+ +R R R NTPDS++ NLGFRCA+ G G
Sbjct: 261 DGSANHRARVTTRMGNTPDSASDNLGFRCASSAGRLRG 298
>gi|383767542|ref|YP_005446524.1| putative sulfatase-modifying factor [Phycisphaera mikurensis NBRC
102666]
gi|381387811|dbj|BAM04627.1| putative sulfatase-modifying factor [Phycisphaera mikurensis NBRC
102666]
Length = 322
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 134/341 (39%), Positives = 175/341 (51%), Gaps = 46/341 (13%)
Query: 5 PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
P PP + MVL+PG TF MGT+ P + + E P+R V +DAF+LD+ V+N +F FV
Sbjct: 26 PGPP----RGMVLVPGGTFAMGTDDPRGMPN-ERPAREVAVDAFFLDRAPVTNAEFAAFV 80
Query: 65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
ATGYVTEAE+ D V L + +M L T PV
Sbjct: 81 EATGYVTEAERPVDWEV----LKTQVPPGTPAPPPEMLLPGSLVFTPPPPGSGPV---DL 133
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
D + +W SW HP+G S ++ R +HPVV VS
Sbjct: 134 RDLSRWWSWTAG-----------------------ASWRHPDGPGSDLDGRGDHPVVQVS 170
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
++DA AY W G RLPTEAEWE+ RGG + R F WG P G AN + G FP NNT
Sbjct: 171 FHDAEAYAAWAGKRLPTEAEWEFAARGGSDTRFF-WGEQFRPGGRFMANTFTGSFPLNNT 229
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKK 304
AAD + +PV S+ N +GL +M GNVW+WT+ + Y G + +V K
Sbjct: 230 AADRFADRSPVGSFPANGYGLVDMAGNVWQWTSSF----------YGAGGRVDRSQRVIK 279
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
GGSYLC+ YC +R AR TPD+ +G++GFRCA D P
Sbjct: 280 GGSYLCHADYCESYRPPARRPLTPDTGSGHVGFRCAWDPPP 320
>gi|50758160|ref|XP_415788.1| PREDICTED: sulfatase-modifying factor 2 [Gallus gallus]
Length = 311
Score = 217 bits (553), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 134/334 (40%), Positives = 175/334 (52%), Gaps = 70/334 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F+MG++ +D E P R VT+ F +D+ V+N F+EFV Y TEAE
Sbjct: 27 MVRLPGGRFQMGSSS-TQSRDEEGPIREVTVKPFAIDKFPVTNRDFREFVREKKYKTEAE 85
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +SEE + KI+Q L+S W W
Sbjct: 86 AFGWSFVFEDFVSEELKKKITQ---------KLESA------------PW--------W- 115
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E + W P G S I R++HPV+HVSWNDA A+C W
Sbjct: 116 ---LPVEKAF-----------------WRQPAGPGSGIADRLDHPVLHVSWNDAQAFCRW 155
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G RLPTE EWE+ RGGLE RL+PWGN P +R N+WQG+FP +TA DGY +P
Sbjct: 156 KGKRLPTEEEWEFAARGGLEQRLYPWGNKFQP---NRTNLWQGDFPRGDTAEDGYHGVSP 212
Query: 255 VMSYK-ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK---VKKGGSYLC 310
V ++ +N +GLY+++GN WEWTA Y P GP + V +G S++
Sbjct: 213 VAAFSPQNSYGLYDLLGNTWEWTAS----------QYTPPGPPRPRAEAMHVLRGASWID 262
Query: 311 NEQYCYRHRCA--ARSQNTPDSSAGNLGFRCAAD 342
HR + R NTPDS++ NL FRCAAD
Sbjct: 263 TVDGSANHRASITTRMGNTPDSASDNLSFRCAAD 296
>gi|344289891|ref|XP_003416674.1| PREDICTED: sulfatase-modifying factor 2-like [Loxodonta africana]
Length = 297
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 131/337 (38%), Positives = 176/337 (52%), Gaps = 70/337 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
+V LPG F+MGT+ P +DGE P R VT+ F +D V+N F+EFV A Y TEAE
Sbjct: 26 VVQLPGGRFQMGTDSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFREFVRANKYRTEAE 84
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K + H MK + W
Sbjct: 85 SFGWSFVFEDFVSDELRKKAT---HQMKSL-----------------LWW---------- 114
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E + W P G S I R+ PV+HVSWNDA AYC W
Sbjct: 115 ---LPVEKAF-----------------WRQPAGPGSGIRERLELPVLHVSWNDARAYCAW 154
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 155 RGKRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDRAEDGFHGVSP 211
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +P P + +V +G S++ +
Sbjct: 212 VNAFPPQNNYGLYDLMGNVWEWTA-------------SPYQPASQDMRVLRGASWIDTAD 258
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCA+ G G
Sbjct: 259 GSANHRARVTTRMGNTPDSASDNLGFRCASSAGRPRG 295
>gi|417409307|gb|JAA51164.1| Putative sulfatase-modifying factor 2, partial [Desmodus rotundus]
Length = 282
Score = 217 bits (552), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 138/341 (40%), Positives = 177/341 (51%), Gaps = 76/341 (22%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV LPG F+MGTN +DGE P R VT+ F +D V+N F+EFV Y TEA
Sbjct: 10 NMVQLPGGRFQMGTNAQDG-RDGEGPVREVTVKPFSIDIFPVTNKDFREFVREKKYRTEA 68
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
EKFG +FVFE +S+E R K +Q PV+ W W
Sbjct: 69 EKFGWSFVFEDFVSDELRNKATQ------------------QMKPVL---W--------W 99
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
LP E R F W P G S I R+ PVVHVSWNDA AYC
Sbjct: 100 ----LPVE-------------RAF----WRQPAGPGSGIRERLELPVVHVSWNDARAYCA 138
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +
Sbjct: 139 WRGKRLPTEEEWEFAARGGLKGQVYPWGNQFQP---NRTNLWQGKFPKGDKAEDGFHGVS 195
Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD---KVKKGGSYL 309
PV ++ +N +GLY++VGNVWEWTA P G D +V +G S++
Sbjct: 196 PVNAFPPQNNYGLYDLVGNVWEWTAS----------------PYQGADQDMRVLRGASWI 239
Query: 310 --CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+ +R R R NTPDS++ NLGFRCA+ G G
Sbjct: 240 DTSDGSANHRARVTTRMGNTPDSASDNLGFRCASSIGRLPG 280
>gi|209731560|gb|ACI66649.1| Sulfatase-modifying factor 2 precursor [Salmo salar]
Length = 302
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 131/335 (39%), Positives = 177/335 (52%), Gaps = 71/335 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG MGTN +DGE P++ VT+D F +D++ V+N+ F+EFV A Y TEAE
Sbjct: 34 MVQIPGGKMTMGTNSADG-RDGESPTKEVTVDPFRIDKYPVTNSDFREFVRAQKYKTEAE 92
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVF+ +SEE ++K+++ + W
Sbjct: 93 TFGWSFVFQDFVSEELKSKVTE------------------------------KIESAPWW 122
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+P E R+F W P G S I R++ PVV VSWNDA A+C+W
Sbjct: 123 ---MPVE-------------RVF----WRQPAGPGSGIRERLDSPVVQVSWNDAQAFCSW 162
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLP+E EWE+ RGGL+ R +PWGN + R+N+WQG FP +TA DGY AP
Sbjct: 163 RGRRLPSEEEWEWAARGGLQGRTYPWGNKFQSK---RSNLWQGLFPDGDTAEDGYHGIAP 219
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
V ++ +N FGLY+M+GN WEWT+ P P P V +GGS++ E
Sbjct: 220 VTAFPPQNSFGLYDMMGNAWEWTS-------TPFPGGRPM-------FVLRGGSWIDTED 265
Query: 314 YC--YRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
+ R R NTPDS++ NLGFRCAA K T
Sbjct: 266 GSANRKARITTRMGNTPDSASDNLGFRCAARKDKT 300
>gi|284036352|ref|YP_003386282.1| hypothetical protein Slin_1433 [Spirosoma linguale DSM 74]
gi|283815645|gb|ADB37483.1| protein of unknown function DUF323 [Spirosoma linguale DSM 74]
Length = 358
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 125/339 (36%), Positives = 180/339 (53%), Gaps = 54/339 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
DMVL+PG TF MG ++ D P +VT+ F++D+HEV+N +F FV+ATGY T A
Sbjct: 65 DMVLIPGGTFLMGADE---FPDAR-PVHSVTVKPFWMDEHEVTNAEFARFVAATGYKTVA 120
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ PL + + G+ P + AV
Sbjct: 121 ER--------PL--------------NPADYPGV----------PAGDLVPGSAVFTPPA 148
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
+ L +W +G SW HP G S+++ N PV+HVS+ DA+AY
Sbjct: 149 QKVSLANPLQWWEYVKGA---------SWNHPNGPKSSLKGHENEPVIHVSYEDAMAYAK 199
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE+ +GG NR + WG+ L P G+ AN++QG+FP+ NT DG+ A
Sbjct: 200 WAGKRLPTEAEWEFAAQGGKGNRTYYWGDELKPSGKWIANIYQGDFPSKNTREDGFAGVA 259
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTTGTDK--------VKK 304
PV S+ N +GLY+M GNVWEW D + ++ +P +P+GP D V++
Sbjct: 260 PVKSFPANPYGLYDMDGNVWEWCQDLYRPDYYMKSPKMDPQGPPDSYDPEEPGAVKHVQR 319
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
GGS+LC++QYC R++ +R + S + NLGFRC +K
Sbjct: 320 GGSFLCSDQYCVRYKAGSRGKGEVSSGSNNLGFRCVRNK 358
>gi|332265464|ref|XP_003281740.1| PREDICTED: sulfatase-modifying factor 2 isoform 1 [Nomascus
leucogenys]
Length = 302
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/337 (38%), Positives = 173/337 (51%), Gaps = 70/337 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G F MGTN P +DGE P R VT+ F +D V+N F++FV Y TEAE
Sbjct: 31 MVQLQGGRFLMGTNSPD-SRDGEGPVREVTVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 89
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q P+ + W W
Sbjct: 90 MFGWSFVFEDFVSDELRNKATQ---------------------PMKPVLW--------W- 119
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E + W P G S I R+ HPV+HVSWNDA AYC W
Sbjct: 120 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 159
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLP E EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 160 RGKRLPMEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 216
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +P +V +G S++ +
Sbjct: 217 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 263
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCAAD G G
Sbjct: 264 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 300
Score = 41.2 bits (95), Expect = 0.80, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCAAD
Sbjct: 251 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 295
>gi|13474559|ref|NP_106128.1| hypothetical protein mll5466 [Mesorhizobium loti MAFF303099]
gi|14025313|dbj|BAB51914.1| mll5466 [Mesorhizobium loti MAFF303099]
Length = 372
Score = 216 bits (550), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 134/344 (38%), Positives = 174/344 (50%), Gaps = 60/344 (17%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
Y MV +PG TF MG++ E P+ V +D F++D+ VSN F+ FV+ATG+VT
Sbjct: 62 YPGMVWIPGGTFLMGSDNHY---PEEAPAHRVRVDGFWMDKFTVSNRDFERFVAATGHVT 118
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
AEK + + L + S + + PV D +
Sbjct: 119 LAEKPANPDDYPGALPD---------------LLAPSSMMFRKPAGPV------DLGNHY 157
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W W Y RG +W HP G STI+ +HPVVHV++ D VAY
Sbjct: 158 NW----------WVY-VRGA---------NWRHPRGPASTIKKVADHPVVHVAYEDVVAY 197
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G LPTEAEWE+ RGGL+ + WGN LTP G+H AN+WQG+FP NT DGY
Sbjct: 198 ANWAGKELPTEAEWEFAARGGLDAAEYVWGNELTPAGKHMANIWQGDFPYRNTVDDGYEY 257
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH---------------HPAPSYNPKGPT 296
TAPV S+ N +GLY+M GNVW+WT DW+ H H SY+ + P
Sbjct: 258 TAPVGSFPANDYGLYDMAGNVWQWTTDWYQDHKAIDSPCCTAVNPRGGHREASYDTRLPD 317
Query: 297 TGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KV KGGS+LC YC R+R AAR D++ +LGFRC
Sbjct: 318 VKIPRKVTKGGSHLCAPNYCRRYRPAARMAQPVDTAISHLGFRC 361
>gi|194248088|ref|NP_056226.2| sulfatase-modifying factor 2 isoform b precursor [Homo sapiens]
gi|119628383|gb|EAX07978.1| sulfatase modifying factor 2, isoform CRA_d [Homo sapiens]
Length = 320
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/337 (38%), Positives = 173/337 (51%), Gaps = 70/337 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G F MGTN P +DGE P R T+ F +D V+N F++FV Y TEAE
Sbjct: 49 MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 107
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q P+ + W W
Sbjct: 108 MFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 137
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E + W P G S I R+ HPV+HVSWNDA AYC W
Sbjct: 138 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 177
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 178 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 234
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +P +V +G S++ +
Sbjct: 235 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 281
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCAAD G G
Sbjct: 282 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 318
>gi|399027142|ref|ZP_10728733.1| hypothetical protein PMI10_00545 [Flavobacterium sp. CF136]
gi|398075378|gb|EJL66496.1| hypothetical protein PMI10_00545 [Flavobacterium sp. CF136]
Length = 377
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/359 (36%), Positives = 180/359 (50%), Gaps = 67/359 (18%)
Query: 7 PPVERY-KDMVLLPGDTFRMGTNKPI--------LIKDGEFPSRNVTLDAFYLDQHEVSN 57
PP+ + K MV +PG F MG+N + KD P V +D +Y+D+ EV+N
Sbjct: 60 PPINKTPKGMVWIPGGEFSMGSNVEDESLCSIKGVTKDAA-PIHRVYVDGYYMDKTEVTN 118
Query: 58 TQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
QF+ FV ATGY+T AEK TIE +
Sbjct: 119 EQFEAFVKATGYITLAEK--------------------------------KPTIEEYPNV 146
Query: 118 PVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMN 177
PV ++ AV T L +W + GG + W HPEG S+I+ +
Sbjct: 147 PVENLIAGSAVFTPTPAKVDLNNYMQW-WSYVGGAD--------WRHPEGPMSSIKGKEK 197
Query: 178 HPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG 237
+PVV VS+ DA AY W G RLPTEA+WE+ RGG L+ WGN L P+G+ +AN++QG
Sbjct: 198 YPVVQVSYEDAAAYAKWAGKRLPTEAQWEFAARGGKTGELYAWGNTLKPKGKFQANIYQG 257
Query: 238 EFPT--NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY----- 290
FP +T DGY+ AP + N +GLY++ GNVWEWT DW+ ++ S
Sbjct: 258 HFPMEKGDTGEDGYIGIAPTAKFAPNGYGLYDVGGNVWEWTNDWYTADYYSTLSQNGEIA 317
Query: 291 -NPKGPTTGTD--------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
NPKGP + D KV++GGS+LC +QYC R+ R + S A ++GFRC
Sbjct: 318 RNPKGPDSSYDPTEPGLPKKVQRGGSFLCTDQYCTRYMVGTRGKGDYQSPANHIGFRCV 376
>gi|426254649|ref|XP_004020989.1| PREDICTED: sulfatase-modifying factor 2 [Ovis aries]
Length = 301
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 132/338 (39%), Positives = 175/338 (51%), Gaps = 70/338 (20%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV LPG F+MGTN P +DGE P R VT+ F +D V+N F+EFV Y TEA
Sbjct: 29 NMVQLPGGRFQMGTNSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFREFVREKKYRTEA 87
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E FG +FVFE L+S+E R K +Q RM + W
Sbjct: 88 EVFGWSFVFEDLVSDELRNKATQ-----------------RMQ------------SLLWW 118
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
LP E R F W P G S I ++ PVVHVSWNDA AYC
Sbjct: 119 ----LPVE-------------RAF----WRQPAGPGSGIREKLEFPVVHVSWNDARAYCA 157
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +
Sbjct: 158 WRGKRLPTEEEWEFAARGGLKGQVYPWGNKFQP---NRTNLWQGKFPKGDKAEDGFHGVS 214
Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--C 310
PV ++ +N +GLY+++GN+WEWTA + +V +G S++
Sbjct: 215 PVNAFPPQNDYGLYDLMGNIWEWTASQYQAADQDM-------------RVLRGASWIDTA 261
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+ +R R R NTPDS++ NLGFRCA+ G G
Sbjct: 262 DGSANHRARVTTRMGNTPDSASDNLGFRCASSAGRPPG 299
>gi|62460456|ref|NP_001014881.1| sulfatase-modifying factor 2 precursor [Bos taurus]
gi|73621962|sp|Q58CP2.1|SUMF2_BOVIN RecName: Full=Sulfatase-modifying factor 2; AltName:
Full=C-alpha-formylglycine-generating enzyme 2; Flags:
Precursor
gi|61555735|gb|AAX46752.1| sulfatase modifying factor 2 [Bos taurus]
gi|151556838|gb|AAI49260.1| Sulfatase modifying factor 2 [Bos taurus]
gi|296473261|tpg|DAA15376.1| TPA: sulfatase-modifying factor 2 precursor [Bos taurus]
Length = 301
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 133/338 (39%), Positives = 175/338 (51%), Gaps = 70/338 (20%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV LPG F+MGT+ P +DGE P R VT+ F +D V+N F+EFV Y TEA
Sbjct: 29 NMVQLPGGRFQMGTDSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFREFVREKKYRTEA 87
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E FG +FVFE L+S+E R K +Q RM + W
Sbjct: 88 EVFGWSFVFEDLVSDELRNKATQ-----------------RMQ------------SLLWW 118
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
LP E R F W P G S I ++ PVVHVSWNDA AYC
Sbjct: 119 ----LPVE-------------RAF----WRQPAGPGSGIREKLEFPVVHVSWNDARAYCA 157
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +
Sbjct: 158 WRGKRLPTEEEWEFAARGGLKGQVYPWGNKFQP---NRTNLWQGKFPKGDKAEDGFHGVS 214
Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--C 310
PV ++ +N +GLY++VGNVWEWTA + +V +G S++
Sbjct: 215 PVNAFPPQNDYGLYDLVGNVWEWTASQYQAADQDM-------------RVLRGASWIDTA 261
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+ +R R R NTPDS++ NLGFRCA+ G G
Sbjct: 262 DGSANHRARVTTRMGNTPDSASDNLGFRCASGAGRPPG 299
>gi|255036936|ref|YP_003087557.1| hypothetical protein Dfer_3179 [Dyadobacter fermentans DSM 18053]
gi|254949692|gb|ACT94392.1| protein of unknown function DUF323 [Dyadobacter fermentans DSM
18053]
Length = 351
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 126/339 (37%), Positives = 176/339 (51%), Gaps = 54/339 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV L G F MG ++ D P VT++ FY+D HEV+N +F FV AT Y T A
Sbjct: 57 EMVWLSGGKFLMGADE---FPDSR-PMHEVTINGFYMDAHEVTNAEFAAFVKATNYKTVA 112
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ + + P + ++ S V LD+ ++ W
Sbjct: 113 ERPLNPADY-PGVPADKLVPGSAVFTPTPTRVSLDNPLQ--------------------W 151
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W Y P +W HPEG S+I+ R N+PVVHVS+ DA AY
Sbjct: 152 ----------WNY----------VPGANWAHPEGPSSSIKGRENYPVVHVSYEDAAAYAK 191
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE+ +GG N + WG L P G+ AN++QG FP NT DGYL+ A
Sbjct: 192 WAGKRLPTEAEWEFAAQGGKGNHTYYWGEELKPGGKWVANIYQGSFPDKNTKEDGYLTAA 251
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTTGTD--------KVKK 304
P+ S+ N +GLY+M GNVWEW D + ++ +P NP+GP+ D +V++
Sbjct: 252 PIRSFPANPYGLYDMDGNVWEWCEDLYRPDYYQKSPGVNPRGPSDSYDPDEPGAVKRVQR 311
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
GGS+LC++ YC R++ +R + S + NLGFRC DK
Sbjct: 312 GGSFLCSDDYCIRYKAGSRGKGEVTSGSNNLGFRCVMDK 350
>gi|282897815|ref|ZP_06305812.1| Protein of unknown function DUF323 [Raphidiopsis brookii D9]
gi|281197299|gb|EFA72198.1| Protein of unknown function DUF323 [Raphidiopsis brookii D9]
Length = 341
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 126/343 (36%), Positives = 177/343 (51%), Gaps = 63/343 (18%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ M+L+PG T +MGT+ + + E PS+NVT+D+F +D++EV+N +F +FV+ATGY+T
Sbjct: 47 QGMILIPGGTLKMGTDNSVFAE--ERPSKNVTVDSFCMDKYEVTNAEFAKFVTATGYITV 104
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE F P L+E++R+ S V +
Sbjct: 105 AELPLSKEQF-PDLTEQQRSPGSLV--------------------------------FKM 131
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
R +P + W + P +W HP G S I N+PVVH+++ DA+ +
Sbjct: 132 PRQGEIPFLSWWHW----------LPGANWQHPLGNKSEILD--NYPVVHIAYQDALVFA 179
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEA+WEY RGGLE+ + WGN + E +AN WQG FP NT DGY+
Sbjct: 180 DWLGKSLPTEAQWEYAARGGLEDATYTWGNKYS---EIKANTWQGTFPFLNTKVDGYVGL 236
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-------------PSYNPKGPTTGT 299
APV S+ N +GLY+M GNVWEWT DW+ V + S++P P
Sbjct: 237 APVGSFPANGYGLYDMAGNVWEWTVDWFTVEDKHSVNQVSEHEISKTRKSFDPNKPGETL 296
Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
V KGGSYLC YC R+R AAR PD+ ++G R +
Sbjct: 297 LHVIKGGSYLCAPNYCSRYRPAARESQAPDTGTNHIGLRLVKN 339
>gi|332664781|ref|YP_004447569.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
1100]
gi|332333595|gb|AEE50696.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis
DSM 1100]
Length = 368
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/340 (38%), Positives = 177/340 (52%), Gaps = 54/340 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV +PG TF MG P ++D E V +D F++D EV+N QFQEFV+ATGY+T+A
Sbjct: 68 NMVWIPGGTFTMGGADPQALQD-EKHLHEVHIDGFWMDVSEVTNAQFQEFVTATGYLTQA 126
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ E +R + + +DS P V SW +
Sbjct: 127 ERN----------PEAQRMMMPGI--------AIDSL-------PKVPYSWCFQSPKNQY 161
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
RL E W P W HP+G S+I + NHPVVHVSW DA+AYC
Sbjct: 162 --GRLQPEDWWT----------AVPGADWQHPQGSGSSIVGKENHPVVHVSWYDAIAYCK 209
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RL TEAEWEY RGG + +PWG+ L + +AN WQG+FP+ DG++ TA
Sbjct: 210 WAGKRLATEAEWEYAARGGYDKNSYPWGDVLNFK---KANYWQGDFPSLFLNRDGFVKTA 266
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTD-------- 300
PV + N +GLY+M GNVWEW DW++ ++ NP GP D
Sbjct: 267 PVKHFAPNPYGLYDMAGNVWEWCLDWYDFDYYVQGKRAKIVSNPLGPKISADPDAKDAPL 326
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KV +GGS+LC+E YC +R + R ++ P + + + GFRC
Sbjct: 327 KVIRGGSFLCSESYCSGYRVSTRMKSAPCTGSEHAGFRCV 366
>gi|296228504|ref|XP_002759839.1| PREDICTED: sulfatase-modifying factor 2 [Callithrix jacchus]
Length = 275
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 132/340 (38%), Positives = 176/340 (51%), Gaps = 73/340 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ---EFVSATGYVT 71
MV LPG F MGTN P +DGE P R VT+ F +D V+N +F+ +F+ Y T
Sbjct: 1 MVQLPGGRFLMGTNSPD-SRDGEGPVREVTVKPFAIDIFPVTNKEFRYIRDFIREKKYRT 59
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
EAE FG +FVFE +S+E R K +Q P+ + W
Sbjct: 60 EAEMFGWSFVFEGFVSDELRNKATQ---------------------PMKPVLW------- 91
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W LP E + W P G S I R+ HPV+HVSWNDA AY
Sbjct: 92 -W----LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAY 129
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
C WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+
Sbjct: 130 CAWRGKRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHG 186
Query: 252 TAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL- 309
+PV ++ +N +GLY+++GNVWEWTA +P +V +G S++
Sbjct: 187 VSPVNAFPAQNSYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWID 233
Query: 310 -CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+ +R R R NTPDS++ NLGFRCAAD G G
Sbjct: 234 TADGSANHRARVTTRMGNTPDSASDNLGFRCAADTGRPPG 273
Score = 41.2 bits (95), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCAAD
Sbjct: 224 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADT 268
>gi|320105153|ref|YP_004180744.1| hypothetical protein Isop_3638 [Isosphaera pallida ATCC 43644]
gi|319752435|gb|ADV64195.1| protein of unknown function DUF323 [Isosphaera pallida ATCC 43644]
Length = 411
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 138/346 (39%), Positives = 176/346 (50%), Gaps = 57/346 (16%)
Query: 7 PPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
PP + MV +PG F G D + P R V +D F++DQHEV+N Q+ FV+A
Sbjct: 107 PPQTTPEGMVWIPGGVFLQGDPD---FPDAQ-PRRWVGVDGFWIDQHEVTNRQYAAFVAA 162
Query: 67 TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
TGY T AE+ D F P + ++ S V LD+
Sbjct: 163 TGYQTVAERPLDPRDF-PGVPPDQLQPGSIVFTPPSGPVPLDNVTR-------------- 207
Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
W W Y P W HPEG STIE R +HP VH+++
Sbjct: 208 ------W----------WIY----------TPGACWRHPEGPGSTIEGREDHPAVHLAYE 241
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTA 245
DA AY W G RLPTEAEWEY RGGL ++ F WG L P G AN+WQG FP N+
Sbjct: 242 DAEAYARWAGKRLPTEAEWEYAARGGLVSQPFVWGAELKPEGSPWLANIWQGHFPNENSQ 301
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-SYNPKGPT-------- 296
DG+ TAPV S+ N FGLY+M GNVWEW ADW++ + + NP GP
Sbjct: 302 EDGFARTAPVKSFPPNGFGLYDMAGNVWEWCADWYHPNAYQFDRRLNPSGPPRLASFDPM 361
Query: 297 -TGTDK-VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
G K V++GGS+LC +QYC R+R AR + P S+A ++GFRCA
Sbjct: 362 EPGVPKRVQRGGSFLCTDQYCTRYRPGARGKGEPGSAASHVGFRCA 407
>gi|171914379|ref|ZP_02929849.1| hypothetical protein VspiD_24410 [Verrucomicrobium spinosum DSM
4136]
Length = 290
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 136/333 (40%), Positives = 170/333 (51%), Gaps = 58/333 (17%)
Query: 25 MGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEP 84
MG N ++ E P + +D F++D+HEV+N QF+EFV ATGYVT +E+ F
Sbjct: 1 MGDN----VRREESPVHEIAVDGFWMDEHEVTNAQFEEFVKATGYVTTSERVPKVEDFP- 55
Query: 85 LLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEW 144
E L I+ P + AVA L EW
Sbjct: 56 --------------------ERLRPMIKPDQLKPGAVVFTPPAVA-----PPNLDNFLEW 90
Query: 145 EYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAE 204
G W HPEG S+I+ RMNHPVV VSW DA AY W G RLPTEAE
Sbjct: 91 WRYIHGA---------DWRHPEGPSSSIKERMNHPVVCVSWKDAEAYAKWAGKRLPTEAE 141
Query: 205 WEYGCRGGL---ENRLFPWGNNLTPRGEHR---ANVWQGEFPTNNTAADGYLSTAPVMSY 258
WEY RGG+ + F WG N GE R AN W G FP + A DG+ TAPV S+
Sbjct: 142 WEYAARGGVGPTKRNTFTWGEN----GEQRVKMANTWTGSFPVEDRAVDGFKGTAPVKSF 197
Query: 259 KENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTTGTD--------KVKKGGSYL 309
N+FGLY+M GNVWE+ ADW++ + +P NP GP+ G D KV KGGS+L
Sbjct: 198 PPNEFGLYDMAGNVWEFVADWFHPDYFRNSPKLNPPGPSVGYDPNEPDLPKKVIKGGSFL 257
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
C+ YC +R +R N PDSSA + GFRC +
Sbjct: 258 CSPLYCTGYRPGSRMFNDPDSSADHTGFRCVKN 290
>gi|197102978|ref|NP_001125223.1| sulfatase-modifying factor 2 precursor [Pongo abelii]
gi|73621963|sp|Q5RCR5.1|SUMF2_PONAB RecName: Full=Sulfatase-modifying factor 2; AltName:
Full=C-alpha-formylglycine-generating enzyme 2; Flags:
Precursor
gi|22761650|dbj|BAC11643.1| unnamed protein product [Homo sapiens]
gi|32527653|gb|AAP86218.1| sulfatase modifying factor 2 [Homo sapiens]
gi|37183323|gb|AAQ89461.1| ARHG1968 [Homo sapiens]
gi|54261539|gb|AAH84539.1| Sulfatase modifying factor 2 [Homo sapiens]
gi|55727373|emb|CAH90442.1| hypothetical protein [Pongo abelii]
gi|109730303|gb|AAI11093.1| Sulfatase modifying factor 2 [Homo sapiens]
gi|119628384|gb|EAX07979.1| sulfatase modifying factor 2, isoform CRA_e [Homo sapiens]
Length = 301
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 131/337 (38%), Positives = 173/337 (51%), Gaps = 70/337 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G F MGTN P +DGE P R T+ F +D V+N F++FV Y TEAE
Sbjct: 30 MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 88
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q P+ + W W
Sbjct: 89 MFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 118
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E + W P G S I R+ HPV+HVSWNDA AYC W
Sbjct: 119 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 158
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 159 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 215
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +P +V +G S++ +
Sbjct: 216 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 262
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCAAD G G
Sbjct: 263 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 299
Score = 41.2 bits (95), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCAAD
Sbjct: 250 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 294
>gi|61680510|pdb|1Y4J|A Chain A, Crystal Structure Of The Paralogue Of The Human
Formylglycine Generating Enzyme
gi|61680511|pdb|1Y4J|B Chain B, Crystal Structure Of The Paralogue Of The Human
Formylglycine Generating Enzyme
Length = 284
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 131/337 (38%), Positives = 173/337 (51%), Gaps = 70/337 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G F MGTN P +DGE P R T+ F +D V+N F++FV Y TEAE
Sbjct: 4 MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 62
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q P+ + W W
Sbjct: 63 MFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 92
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E + W P G S I R+ HPV+HVSWNDA AYC W
Sbjct: 93 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 132
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 133 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 189
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +P +V +G S++ +
Sbjct: 190 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 236
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCAAD G G
Sbjct: 237 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 273
Score = 41.2 bits (95), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCAAD
Sbjct: 224 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 268
>gi|424875872|ref|ZP_18299531.1| hypothetical protein Rleg5DRAFT_0212 [Rhizobium leguminosarum bv.
viciae WSM1455]
gi|393163475|gb|EJC63528.1| hypothetical protein Rleg5DRAFT_0212 [Rhizobium leguminosarum bv.
viciae WSM1455]
Length = 315
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 126/326 (38%), Positives = 172/326 (52%), Gaps = 54/326 (16%)
Query: 18 LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
+ G +GT++P+L +DGE P R+V ++AF L V+N +F FV ATGYVTEAE+ G
Sbjct: 37 IEGGRVSVGTSRPLLPQDGEGPVRSVRINAFELSVGAVTNEEFGRFVEATGYVTEAERLG 96
Query: 78 DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
++VF ++ LD ++P +
Sbjct: 97 WSYVF---------------------YKFLDG------NNPTSGV--------------- 114
Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
AEW G +W P G S++E N PVVH+SW DA YC W G
Sbjct: 115 --VGAEWWRKVEGA---------NWAFPSGPGSSLEKLGNFPVVHISWQDASDYCHWAGG 163
Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMS 257
RLP+EAEWE+ RGGL+ ++PWG+ N+WQG FP +TAADGY AP +S
Sbjct: 164 RLPSEAEWEHAARGGLDQPVYPWGDEQPTETFTPCNIWQGRFPDLDTAADGYAGLAPALS 223
Query: 258 YKENKFGLYNMVGNVWEWTADWWNVHH-HPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCY 316
++ N +GL+NMVGN WEWTAD + V A + +V KGGSYLC+ YC+
Sbjct: 224 FEPNGYGLHNMVGNCWEWTADPFRVRSLRKAANSANSASKQEWKRVLKGGSYLCHTSYCH 283
Query: 317 RHRCAARSQNTPDSSAGNLGFRCAAD 342
R+R AAR +T DS+ G++GFR A D
Sbjct: 284 RYRIAARMGSTGDSTTGHIGFRIAFD 309
>gi|332662774|ref|YP_004445562.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
1100]
gi|332331588|gb|AEE48689.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis
DSM 1100]
Length = 381
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 128/341 (37%), Positives = 172/341 (50%), Gaps = 48/341 (14%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV + F+MG + D E+P V + F++D EV+N QFQ FV ATGYVT AE
Sbjct: 77 MVEIQAGQFQMGGDNAQARPD-EYPKHAVEVKGFWMDATEVTNAQFQRFVDATGYVTVAE 135
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D EE K ++ E L P +I
Sbjct: 136 RAIDL---------EELKKQVPPGTELPSAEELQPFSLVFTAPPSGNI------------ 174
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+ + A+W +G +W PEG S ++ + HPVVHV W DA+AYC W
Sbjct: 175 --QTLSPADWWQMVKGA---------NWRQPEGPGSNLKGKDKHPVVHVCWYDAIAYCKW 223
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY RGG ++PWG + P +AN +QGEFP DG+ +P
Sbjct: 224 AGKRLPTEAEWEYAGRGGQSEAIYPWG--MEPVSPQKANFFQGEFPVKQLNEDGFQRLSP 281
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHH------------PAPSYNPKGPTTGTDKV 302
V S+ N +GLY+M GNVWEW ADW+ ++ PA SY+P+ P KV
Sbjct: 282 VKSFAANPYGLYDMAGNVWEWCADWYRPDYYAQNKGTLRNPQGPADSYDPEEP-NAPKKV 340
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
+GGS+LCN+ YC +R AAR + +PD+ + GFRC DK
Sbjct: 341 VRGGSFLCNDSYCSGYRVAARMKTSPDTGLEHTGFRCVRDK 381
>gi|13544049|gb|AAH06159.1| SUMF2 protein, partial [Homo sapiens]
Length = 296
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 131/337 (38%), Positives = 173/337 (51%), Gaps = 70/337 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G F MGTN P +DGE P R T+ F +D V+N F++FV Y TEAE
Sbjct: 25 MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 83
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q P+ + W W
Sbjct: 84 MFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 113
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E + W P G S I R+ HPV+HVSWNDA AYC W
Sbjct: 114 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 153
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 154 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 210
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +P +V +G S++ +
Sbjct: 211 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 257
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCAAD G G
Sbjct: 258 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 294
Score = 41.2 bits (95), Expect = 0.84, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCAAD
Sbjct: 245 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 289
>gi|332706795|ref|ZP_08426856.1| hypothetical protein LYNGBM3L_23730 [Moorea producens 3L]
gi|332354679|gb|EGJ34158.1| hypothetical protein LYNGBM3L_23730 [Moorea producens 3L]
Length = 296
Score = 215 bits (547), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 129/337 (38%), Positives = 186/337 (55%), Gaps = 58/337 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG TF+MG++ + E + +VT+++F +D++EV+N +F +FV TGY+T AE
Sbjct: 1 MTFIPGGTFKMGSD---VYYPSERSADDVTVESFCIDKYEVTNAEFAKFVKETGYITVAE 57
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ + F P L+EE+RA S V F+ D+ + PV ++SW
Sbjct: 58 RPLSSEQF-PNLTEEQRAPGSLV------FQMPDNGVR-----PVGYLSW---------- 95
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W++ P W HP G S I+ + N+PVVHV++ D AY W
Sbjct: 96 ---------WKWT----------PGADWQHPFGPKSNIKGKKNYPVVHVAYEDVQAYAKW 136
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G +PTEA+WEY GGL++ + WG + + +AN WQG FP NT ADG+ TAP
Sbjct: 137 AGKSVPTEAQWEYAAHGGLDDATYSWGEQYSAK---KANTWQGFFPFFNTKADGHTGTAP 193
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTT---------GTDK-VK 303
V S++ N +GLY++ GNVWEWT+DW+++ H+ A S NP GP G K V
Sbjct: 194 VGSFEPNGYGLYDITGNVWEWTSDWYSIGHNGKAHSLNPTGPNQKESFDPREPGVAKHVI 253
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KGGSYLC YC R+R AAR +PD+ ++GFR
Sbjct: 254 KGGSYLCAPNYCSRYRPAARESQSPDTGTTHIGFRLV 290
>gi|311251202|ref|XP_003124494.1| PREDICTED: sulfatase-modifying factor 2-like isoform 1 [Sus scrofa]
Length = 301
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 132/337 (39%), Positives = 175/337 (51%), Gaps = 70/337 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F+MGTN P +DGE P R VT+ F +D V+N F++FV Y TEAE
Sbjct: 30 MVQLPGGRFQMGTNSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 88
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE L+ +E R+K + H + + W W
Sbjct: 89 AFGWSFVFEDLVPDELRSKAT---------------------HQMQSLLW--------W- 118
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E R F W P G S I R+ PVVHVSWNDA AYC W
Sbjct: 119 ---LPVE-------------RAF----WRQPAGPGSGIRERLEFPVVHVSWNDARAYCAW 158
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 159 RGKRLPTEEEWEFAARGGLKGQVYPWGNQFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 215
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +P +V +G S++ +
Sbjct: 216 VNAFPPQNNYGLYDLMGNVWEWTA-------------SPYQAADQDMRVLRGASWIDTAD 262
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCA+ G G
Sbjct: 263 GSANHRARVTTRMGNTPDSASDNLGFRCASSAGRPPG 299
>gi|355722653|gb|AES07643.1| sulfatase modifying factor 2 [Mustela putorius furo]
Length = 299
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 132/338 (39%), Positives = 175/338 (51%), Gaps = 70/338 (20%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV LPG F+MGTN P +DGE P R VT+ F +D V+N F+EFV Y TEA
Sbjct: 28 NMVQLPGGRFQMGTNSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFREFVREEKYRTEA 86
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E FG +FVFE +S+E R K++ H + + W W
Sbjct: 87 EMFGWSFVFEDFVSDELRNKVT---------------------HQMESVLW--------W 117
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
LP E R F W P G S I ++ PVVHVSWNDA AYC
Sbjct: 118 ----LPVE-------------RAF----WRQPAGPGSGIREKLELPVVHVSWNDARAYCA 156
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
WRG RLPTE EWE+ RGGL +++PWGN P +R N+WQG+FP + A DG+ +
Sbjct: 157 WRGKRLPTEEEWEFAARGGLNGQVYPWGNQFQP---NRTNLWQGKFPKGDKAEDGFHGVS 213
Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--C 310
PV ++ +N +GLY+++GNVWEWTA +P +V +G S++
Sbjct: 214 PVNAFPPQNNYGLYDLMGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTA 260
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+ +R R R NTPDS++ NLGFRCA+ G G
Sbjct: 261 DGSANHRARVTTRMGNTPDSASDNLGFRCASSAGRLPG 298
>gi|291234208|ref|XP_002737041.1| PREDICTED: sulfatase-modifying factor 2-like [Saccoglossus
kowalevskii]
Length = 326
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 125/337 (37%), Positives = 180/337 (53%), Gaps = 64/337 (18%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
++++++MV L G F MGT++P +DGE PSR VT+ F + ++ V+N F++FV A
Sbjct: 21 LKKFEEMVELKGGKFIMGTDEPNA-RDGEGPSRKVTVKPFKIMRYPVTNAAFRQFVKAKR 79
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
+ TEAE+F +FVF+ +SEE +A I++
Sbjct: 80 FKTEAERFQWSFVFDAFVSEEVKANITE-------------------------------- 107
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
R+P W P W PEG STI ++N P VH+S NDA
Sbjct: 108 --------RIPNTPWW----------VPVPRAYWRQPEGQGSTITKKLNWPAVHISANDA 149
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
YC W+G RLPTE EWEY RGGLE L+PWG + R N+WQG+FP NT DG
Sbjct: 150 KQYCQWKGWRLPTEPEWEYAARGGLECELYPWGARYDNK---RMNIWQGKFPEENTKEDG 206
Query: 249 YLSTAPVMSYK-ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
Y ++PV +++ +N +G+Y+MVGN+WEWT+ + P NP+ P V +GGS
Sbjct: 207 YHGSSPVDAFEPQNDYGMYDMVGNLWEWTS---TIFKSPNQDSNPENPML----VLRGGS 259
Query: 308 YLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAAD 342
Y+ ++ + H R R NT D+ + NLGFRCA++
Sbjct: 260 YIDSKDGAFNHMARVTTRMGNTADAGSDNLGFRCASN 296
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 352 VKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAADV 393
V +GGSY+ ++ + H R R NT D+ + NLGFRCA++V
Sbjct: 254 VLRGGSYIDSKDGAFNHMARVTTRMGNTADAGSDNLGFRCASNV 297
>gi|343959230|dbj|BAK63472.1| sulfatase-modifying factor 2 precursor [Pan troglodytes]
Length = 301
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 133/337 (39%), Positives = 174/337 (51%), Gaps = 70/337 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G F MGTN P +DGE P R T+ F +D V+N F++FV Y TEAE
Sbjct: 30 MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 88
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q P+ + W W
Sbjct: 89 IFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 118
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E R F W P G S I R+ HPV+HVSWNDA AYC W
Sbjct: 119 ---LPVE-------------RAF----WRQPAGPGSGILERLEHPVLHVSWNDARAYCAW 158
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 159 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 215
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +P +V +G S++ +
Sbjct: 216 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 262
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCAAD G G
Sbjct: 263 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 299
Score = 41.2 bits (95), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCAAD
Sbjct: 250 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 294
>gi|431798392|ref|YP_007225296.1| hypothetical protein Echvi_3048 [Echinicola vietnamensis DSM 17526]
gi|430789157|gb|AGA79286.1| hypothetical protein Echvi_3048 [Echinicola vietnamensis DSM 17526]
Length = 371
Score = 214 bits (546), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 127/346 (36%), Positives = 182/346 (52%), Gaps = 58/346 (16%)
Query: 15 MVLLPGDTFRMGTNK----PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
MV +PG F MGT++ P+ E P+ ++ F++D+ EV+N F++FV TGYV
Sbjct: 57 MVWIPGGDFMMGTDEQDAYPV-----ERPAVKRHVEGFWMDETEVTNAAFKKFVDETGYV 111
Query: 71 TEAEKFGD-TFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
T AEK + + + L + + S ++ F D ++ R D
Sbjct: 112 TIAEKKPEWEEMKKQLPPDTPKPADSLLQPGSLVFVAPDRPVDTR-----------DISN 160
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
+ W P SW HPEG S ++ RM+HPVVH++++DAV
Sbjct: 161 WWKWT-----------------------PGASWKHPEGPGSDLQGRMDHPVVHIAYDDAV 197
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
AY W G RLPTEAEWE+ RGGL + + WG+ L P+G+ AN +QG FP N DG+
Sbjct: 198 AYAKWAGKRLPTEAEWEFAARGGLIEKRYSWGDELRPQGKFMANTFQGRFPNGNQGKDGF 257
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APS--------YNPKGPT 296
+ TAPV +Y N +GLY+++GNVWE T DW++ AP YNP P
Sbjct: 258 MGTAPVKNYPSNDYGLYDIIGNVWELTDDWFDAIKFQRLTGNAPKLDANMNQCYNPTNPY 317
Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
++V KGGSYLC++ YC +R +AR + DS + N+GFRC D
Sbjct: 318 A-KERVIKGGSYLCSDNYCINYRPSARQGHAYDSGSSNVGFRCVKD 362
>gi|392378971|ref|YP_004986130.1| sulfatase-modifying factor 1 [Azospirillum brasilense Sp245]
gi|356881338|emb|CCD02323.1| sulfatase-modifying factor 1 [Azospirillum brasilense Sp245]
Length = 349
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 129/335 (38%), Positives = 169/335 (50%), Gaps = 60/335 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG R+G+++ E P V++D F++D+ EV+N QF+ FV ATGYVTEAE
Sbjct: 60 MVWIPGGDTRIGSDQHYRE---EGPVTRVSVDGFWMDRTEVTNAQFRRFVEATGYVTEAE 116
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ + D +R G ++ P +D TW
Sbjct: 117 RV-----------------VDGTGADARRPSGSVVFVD-----PTSQAGRSD-----TW- 148
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W RL P +W HPEG S I R NHPVVHV++NDAVAY W
Sbjct: 149 ---------W----------RLLPEANWRHPEGSGSDIGGRENHPVVHVTYNDAVAYARW 189
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEA+WE+ RGGL+ + WG TP+G AN WQG FP NT DGY +TAP
Sbjct: 190 LGRELPTEAQWEHAARGGLDGAPYAWGGEFTPQGVAMANTWQGFFPIQNTRGDGYPATAP 249
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT----------DKVKK 304
V + N +GL +M+GNVWEWT+DW+ H NP GP +V K
Sbjct: 250 VGCFTANGYGLRDMIGNVWEWTSDWYRPRHDGLGDVNPTGPRQEDSYDPRQPGVPSRVIK 309
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
GGS+LC + +C R+R A+R A +LGFR
Sbjct: 310 GGSFLCAQNFCQRYRPASRHPQEVGVGASHLGFRT 344
>gi|296452916|sp|Q8NBJ7.2|SUMF2_HUMAN RecName: Full=Sulfatase-modifying factor 2; AltName:
Full=C-alpha-formylglycine-generating enzyme 2; Flags:
Precursor
Length = 301
Score = 214 bits (545), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 130/337 (38%), Positives = 173/337 (51%), Gaps = 70/337 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G F MGTN P +DG+ P R T+ F +D V+N F++FV Y TEAE
Sbjct: 30 MVQLQGGRFLMGTNSPD-SRDGDGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 88
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q P+ + W W
Sbjct: 89 MFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 118
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E + W P G S I R+ HPV+HVSWNDA AYC W
Sbjct: 119 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 158
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 159 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 215
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +P +V +G S++ +
Sbjct: 216 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 262
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCAAD G G
Sbjct: 263 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 299
Score = 41.2 bits (95), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCAAD
Sbjct: 250 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 294
>gi|254776894|ref|ZP_05218410.1| hypothetical protein MaviaA2_19816 [Mycobacterium avium subsp.
avium ATCC 25291]
Length = 296
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 129/337 (38%), Positives = 173/337 (51%), Gaps = 50/337 (14%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++V LPG +FRMG+ E P VT++ F +++H V+N QF EFV+ATGYVT
Sbjct: 3 SELVDLPGGSFRMGSTS---FYPEEAPIHTVTVEPFAIERHPVTNAQFAEFVAATGYVTV 59
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ D ++ A SQ++ F ++ R
Sbjct: 60 AEQPIDPALYP-------GADPSQLQPGAMVFRPTPGPVDLR-----------------D 95
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR +W + P SW HP G DS I R +HPVV V++ DA AY
Sbjct: 96 WR--------QWWH---------WVPGASWRHPSGPDSDIADRADHPVVQVAYPDAAAYA 138
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWEY R G +PWG+ T G AN WQG FP N A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAGATT-TYPWGDEPTSDGRLMANTWQGRFPYRNDGALGWVGT 197
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
+PV + N FGL +M+GNVWEWT ++ HH P P GP ++ KGGS
Sbjct: 198 SPVGVFAPNAFGLLDMIGNVWEWTTTEFSTHHRIGAATKPCCAPSGPADPSVNQTLKGGS 257
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+LC +YC+R+R AARS + DS+ ++GFRC A+ G
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVAEVG 294
>gi|332865303|ref|XP_001160187.2| PREDICTED: sulfatase-modifying factor 2 isoform 10 [Pan
troglodytes]
gi|410224492|gb|JAA09465.1| sulfatase modifying factor 2 [Pan troglodytes]
gi|410302660|gb|JAA29930.1| sulfatase modifying factor 2 [Pan troglodytes]
Length = 320
Score = 214 bits (544), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 133/337 (39%), Positives = 172/337 (51%), Gaps = 70/337 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G F MGTN P +DGE P R T+ F +D V+N F++FV Y TEAE
Sbjct: 49 MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 107
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q P+ + W W
Sbjct: 108 IFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 137
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E R F W P G S I R+ HPV+HVSWNDA AYC W
Sbjct: 138 ---LPVE-------------RAF----WRQPAGPGSGILERLEHPVLHVSWNDARAYCAW 177
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 178 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 234
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
V ++ +N +GLY+++GNVWEWTA +P +V +G S++
Sbjct: 235 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 281
Query: 314 YCYRH--RCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
H R R NTPDS++ NLGFRCAAD G G
Sbjct: 282 GSANHQARVTTRMGNTPDSASDNLGFRCAADAGRPPG 318
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 26/45 (57%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ H R R NTPDS++ NLGFRCAAD
Sbjct: 269 RVLRGASWIDTADGSANHQARVTTRMGNTPDSASDNLGFRCAADA 313
>gi|297288257|ref|XP_001105383.2| PREDICTED: sulfatase-modifying factor 2-like isoform 1 [Macaca
mulatta]
Length = 303
Score = 214 bits (544), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 132/340 (38%), Positives = 173/340 (50%), Gaps = 73/340 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ---EFVSATGYVT 71
MV LPG F MGTN P +DGE P R T+ F +D V+N F+ +FV Y T
Sbjct: 29 MVQLPGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRYIRDFVREKKYRT 87
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
EAE FG +FVFE +S+E R K +Q P+ + W
Sbjct: 88 EAEMFGWSFVFEDFVSDELRNKATQ---------------------PMKPVLW------- 119
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W LP E + W P G S I R+ HPV+HVSWNDA AY
Sbjct: 120 -W----LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAY 157
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
C WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG FP + A DG+
Sbjct: 158 CAWRGKRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGRFPKGDKAEDGFHG 214
Query: 252 TAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL- 309
+PV ++ +N +GLY+++GNVWEWTA +P +V +G S++
Sbjct: 215 VSPVNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWID 261
Query: 310 -CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+ +R R R NTPDS++ NLGFRCAAD G G
Sbjct: 262 TADGSANHRARVTTRMGNTPDSASDNLGFRCAADAGRLPG 301
Score = 41.2 bits (95), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCAAD
Sbjct: 252 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 296
>gi|338534920|ref|YP_004668254.1| hypothetical protein LILAB_26420 [Myxococcus fulvus HW-1]
gi|337261016|gb|AEI67176.1| hypothetical protein LILAB_26420 [Myxococcus fulvus HW-1]
Length = 334
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 132/346 (38%), Positives = 169/346 (48%), Gaps = 63/346 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG TF MG++ E PS VT+ F++D V+N FQ FV ATGYVT AE
Sbjct: 31 MVFIPGGTFWMGSDHHY---PEERPSHQVTVTDFWMDSQPVTNADFQRFVEATGYVTVAE 87
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ PL ++ F G + + + V T
Sbjct: 88 R--------PLNPDD--------------FPGANPAL----------LVPGSLVFKKTLG 115
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
L +W + G W HPEG DST+E R HPVVHV + DA+AY W
Sbjct: 116 RVDLRDLTQWWFYTAGAC---------WKHPEGPDSTLEGREQHPVVHVCYEDALAYARW 166
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
LPTEAEWEY RGG+E +++ WG+ P G+ AN WQGEFP N DG+ T+P
Sbjct: 167 ADKALPTEAEWEYAARGGMERKVYVWGDEFAPGGKLMANTWQGEFPWQNLNTDGFEGTSP 226
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS------YNPKGPTTG---------- 298
V S+ N FGL++M GNVWEWT+DW+ H NP+GP T
Sbjct: 227 VGSFPPNAFGLFDMAGNVWEWTSDWYQERHERHAGKPCCIPVNPRGPATSERSLDPCLPK 286
Query: 299 ---TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+V KGGS+LC YC R+R AARS DS ++GFRC
Sbjct: 287 VKIPRRVLKGGSHLCAPNYCLRYRPAARSPQAVDSGTSHIGFRCVV 332
>gi|116626601|ref|YP_828757.1| hypothetical protein Acid_7564 [Candidatus Solibacter usitatus
Ellin6076]
gi|116229763|gb|ABJ88472.1| protein of unknown function DUF323 [Candidatus Solibacter usitatus
Ellin6076]
Length = 315
Score = 213 bits (543), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 135/346 (39%), Positives = 170/346 (49%), Gaps = 63/346 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ +PG TFRMG+ E P V +D F++D HEV+N ++ EFV ATGY+T AE
Sbjct: 11 MIRIPGGTFRMGSEA---FYPEESPVHEVAVDGFWIDCHEVTNQRYGEFVEATGYLTVAE 67
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ + F P E S + R PV D Y W
Sbjct: 68 RPLNAADF-PGAPPENLVP--------------GSMVFRRTSGPV------DLHNYVNW- 105
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W + P SW HP+G S++E HPVVHV++ DA AY W
Sbjct: 106 ---------WAW----------VPGASWKHPQGPGSSLEGIEQHPVVHVAYEDAEAYAGW 146
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE+ RGGL+ + F WG+ P G AN WQGEFP N ADGY T+P
Sbjct: 147 AGKELPTEAEWEFAARGGLDGKNFTWGDEDFPGGNAMANSWQGEFPWQNLLADGYEGTSP 206
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH------------------HPAPSYNPKGPT 296
V S+ N +GLY+M GNVWEWT+DW+ H PA SY+P P
Sbjct: 207 VGSFPPNGYGLYDMAGNVWEWTSDWFVPRHADEVVRSCCGPAVNPRISSPAKSYDPAQPQ 266
Query: 297 TGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
KV KGGS+LC YC R+R AAR D+ ++GFRC A
Sbjct: 267 FQIPRKVVKGGSHLCAPNYCLRYRPAARQPQMLDTGMTHIGFRCIA 312
>gi|374368153|ref|ZP_09626207.1| C-alpha-formyglycine- generating protein [Cupriavidus basilensis
OR16]
gi|373100317|gb|EHP41384.1| C-alpha-formyglycine- generating protein [Cupriavidus basilensis
OR16]
Length = 388
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 129/355 (36%), Positives = 174/355 (49%), Gaps = 60/355 (16%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ M +PG F MG+N L +D E P+ + F++DQH V+N +F+ FV+ATGY+T
Sbjct: 50 RGMAWVPGGEFLMGSNH-RLARDNERPAHKTRVHGFWMDQHHVTNAEFRAFVAATGYITT 108
Query: 73 AEKFGD--TFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
AEK D T + + + V M F G D +
Sbjct: 109 AEKKPDWETLKVQVPAGTPQPPDSALVPGGMV-FVGTDRPVP------------------ 149
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
L + W R P +W HP+G STIE + NHPVV VS+ DA A
Sbjct: 150 -------LQYSSRWW---------RFVPGANWRHPDGPGSTIEGKDNHPVVQVSYEDAQA 193
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAA 246
Y W G RLPTEAEWE+ RGGLE + WG+ P G+ ANVWQG+ FP + A
Sbjct: 194 YARWAGMRLPTEAEWEFAVRGGLEQATYAWGDQFAPGGKQMANVWQGQQSQPFPVVSAKA 253
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYN 291
G L T+ V ++ N +GLY+M GN W+W ADW+ PA S++
Sbjct: 254 GGALGTSAVGTFPANGYGLYDMTGNAWQWGADWYRADQFKRVASIGQVEVYPLGPADSWD 313
Query: 292 PKGPTT---GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
P P +V +GGSYLCN+ YC +R +AR P +S +LGFR A D+
Sbjct: 314 PSEPGVPARAPKRVIRGGSYLCNQDYCLSYRPSARQGADPYTSMSHLGFRLAMDE 368
>gi|404447923|ref|ZP_11012917.1| hypothetical protein A33Q_01220 [Indibacter alkaliphilus LW1]
gi|403766509|gb|EJZ27381.1| hypothetical protein A33Q_01220 [Indibacter alkaliphilus LW1]
Length = 352
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 132/348 (37%), Positives = 177/348 (50%), Gaps = 49/348 (14%)
Query: 9 VERYKD-MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
+ R KD MV +P F MGTN P E P+ + AF++D+ EV+N QF EFV+ T
Sbjct: 38 IVRSKDGMVFIPEGEFFMGTNDPESYP-VERPAVKRKVKAFWMDETEVTNAQFAEFVAET 96
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
GYVT AEK D +E + + +M+ + S R+ + + W
Sbjct: 97 GYVTMAEKTPD---WEEMKKQLPPGTPKPPEENMQPGSMVFSAPSSRVPLHDISLWW--- 150
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
+W G SW HPEG S IE NHPVVH+++ D
Sbjct: 151 ---------------KWVLGA------------SWKHPEGPGSDIEGLENHPVVHIAYED 183
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
A AY W G RLPTE EWEY + G E++ + WG+ P+ AN +QGEFP NN A D
Sbjct: 184 AEAYAKWAGKRLPTETEWEYAAKSGREHQRYAWGDTYKPQMSFMANTFQGEFPNNNLAED 243
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPS--------YNPKG 294
G+ TAPV S+ N +GLY+M+GNVWE T+DW++ AP+ YNP
Sbjct: 244 GFEGTAPVKSFPPNDYGLYDMIGNVWEMTSDWYDAIKFRRLAGNAPALDSSISQCYNPTN 303
Query: 295 PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
P +KV KGGS+LC + YC +R +AR +S N+GFRC D
Sbjct: 304 P-YALEKVIKGGSFLCADDYCINYRPSARQGQDYESGTSNIGFRCVRD 350
>gi|395800381|ref|ZP_10479657.1| hypothetical protein FF52_00945 [Flavobacterium sp. F52]
gi|395437554|gb|EJG03472.1| hypothetical protein FF52_00945 [Flavobacterium sp. F52]
Length = 376
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/355 (36%), Positives = 178/355 (50%), Gaps = 66/355 (18%)
Query: 13 KDMVLLPGDTFRMGTNKPI--------LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
K MV +PG F MGTN + KD P V +D +Y+D+ EV+N +F++FV
Sbjct: 64 KGMVWIPGGEFSMGTNVEDESLCSIKGVTKDAA-PIHRVYVDGYYMDETEVTNEEFEKFV 122
Query: 65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
ATGYVT AE+ F P +EE+ S V L++
Sbjct: 123 KATGYVTVAEQKPTKEEF-PTANEEDLITGSVVFTPTPSAVNLNN--------------- 166
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
+ W W Y P W HPEG STI+ + +PVVHV
Sbjct: 167 -----FLQW----------WRYE----------PGADWRHPEGPQSTIKGKEKYPVVHVV 201
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN-- 242
+ DA AY W G RLPTEAEWE+ RGG L+ WGN+L P+G+ +AN++QG FP
Sbjct: 202 YEDAAAYAKWAGKRLPTEAEWEFAARGGKTGNLYAWGNDLKPKGKFQANIYQGHFPIKDG 261
Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA------PSYNPKGPT 296
+T DG+ AP Y N +GLY+M GNVWEW DW++V ++ + + NP+GP+
Sbjct: 262 DTGEDGFKGIAPTKQYAPNGYGLYDMAGNVWEWVNDWYSVDYYKSLAESGKVTKNPQGPS 321
Query: 297 TGTD--------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
D +V +GGS+LC +QYC R+ R + S+A ++GFRC K
Sbjct: 322 AYYDPNDPSEIKRVHRGGSFLCTDQYCTRYMVGTRGKGEIRSAANHVGFRCVKSK 376
>gi|390944509|ref|YP_006408270.1| hypothetical protein Belba_2996 [Belliella baltica DSM 15883]
gi|390417937|gb|AFL85515.1| hypothetical protein Belba_2996 [Belliella baltica DSM 15883]
Length = 346
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 133/346 (38%), Positives = 172/346 (49%), Gaps = 49/346 (14%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
++ MVL+ G F MGTN + E P+ +++F++D+ EV+N QF FV ATGY
Sbjct: 36 KKMDGMVLIQGGDFSMGTND-LNSYPSERPAVLRKVESFWMDETEVTNKQFAAFVEATGY 94
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
VT AE+ D E K + K E L P +S +D
Sbjct: 95 VTVAERKPDW----------EEMKQTLPPGTPKPDESLLVAGSLVFVAPDQAVSLDDISQ 144
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
+ W P W PEG S I+ + NHPVVH++ DA
Sbjct: 145 WWQW-----------------------VPGAYWKQPEGPGSDIQGKENHPVVHIALEDAE 181
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
AY TW G RLPTEAEWEY +GG N FPWG+ L P G RAN +QGEFP N DG+
Sbjct: 182 AYATWAGKRLPTEAEWEYAAKGG-NNSRFPWGDELNPSGSFRANTFQGEFPYQNEGIDGF 240
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV-----HHHPAPS--------YNPKGPT 296
APV S+ N FGLY+M+GNVWE T+DW++ AP+ YNP+ P
Sbjct: 241 EGAAPVRSFAGNDFGLYDMIGNVWELTSDWFDAAAFMREKALAPALDTSISKCYNPENPY 300
Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+KV KGGS+LC + YC +R +AR S N+GFRC D
Sbjct: 301 A-IEKVIKGGSFLCADDYCINYRPSARRGQDVYSGTSNVGFRCVKD 345
>gi|255292428|dbj|BAH89546.1| hypothetical protein [uncultured bacterium]
Length = 266
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 124/302 (41%), Positives = 158/302 (52%), Gaps = 57/302 (18%)
Query: 45 LDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRF 104
L F +D V+ +F+ FV TGYVTEAE+FG ++VF L+ +D F
Sbjct: 18 LKPFSIDATAVTIGRFERFVRDTGYVTEAERFGWSYVFFALM------------NDPDSF 65
Query: 105 EGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLH 164
L + W V TWR
Sbjct: 66 SALPG------------LEWWRKVDGATWR-----------------------------T 84
Query: 165 PEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNL 224
PEG +S I+ R +PV HVSWNDA A+ +W G RLPTEAEWE+ GGL+ FPWGN+
Sbjct: 85 PEGPESDIDGRAEYPVTHVSWNDATAFASWAGGRLPTEAEWEFAAAGGLQGARFPWGNDE 144
Query: 225 TPRGEHR-ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH 283
+ N+WQG FP NT ADGYL TAPV S+ N FGLYNMVGN WEWT D + +
Sbjct: 145 PDDASFQPCNIWQGSFPDRNTVADGYLGTAPVESFAPNGFGLYNMVGNTWEWTVDRFRIK 204
Query: 284 --HHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
A N +G +V KGGSYLC++ YCYR+R AAR+ +TPDS+ G++GFR
Sbjct: 205 SVSKAAERLNTLARKSGA-RVIKGGSYLCHKSYCYRYRIAARTSSTPDSTTGHMGFRLCY 263
Query: 342 DK 343
D+
Sbjct: 264 DR 265
>gi|386395696|ref|ZP_10080474.1| hypothetical protein Bra1253DRAFT_01138 [Bradyrhizobium sp.
WSM1253]
gi|385736322|gb|EIG56518.1| hypothetical protein Bra1253DRAFT_01138 [Bradyrhizobium sp.
WSM1253]
Length = 328
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 131/343 (38%), Positives = 176/343 (51%), Gaps = 62/343 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ +PG TFRMG+++ E P V +DAF++D H V+N QF+ FV ATG+VT AE
Sbjct: 24 MLYVPGGTFRMGSDRHY---PEEAPVHRVAVDAFWMDCHSVTNRQFKAFVGATGHVTVAE 80
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
D + +L A S + HPV
Sbjct: 81 ITPDPKDYPGMLPHMVYAG---------------SLVFSPPDHPV--------------- 110
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
L ++W +G W HP G S I+ +HPVVHV+++DA+AY W
Sbjct: 111 --SLRDFSQWWTFAKGA---------DWRHPYGPGSKIQGLDDHPVVHVAFSDALAYARW 159
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE+ RGGL+ F WG+ TP G H AN WQGEFP NT DG+ T+P
Sbjct: 160 AGKDLPTEAEWEFAARGGLDGAEFAWGDEFTPDGRHMANTWQGEFPRQNTRDDGFERTSP 219
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY------NPKG--------PTTGT- 299
V ++ N +GL++++GNVWEWT+DW++ H + NP+G P+T
Sbjct: 220 VTAFPPNGYGLHDLIGNVWEWTSDWYSPRHEADAAKACCIPENPRGGREADSYDPSTTNV 279
Query: 300 ---DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KV KGGS+LC YC R+R AAR D+S +LGFRC
Sbjct: 280 KIPRKVIKGGSHLCAPNYCRRYRPAARHAEPVDTSTSHLGFRC 322
>gi|386402426|ref|ZP_10087204.1| hypothetical protein Bra1253DRAFT_08214 [Bradyrhizobium sp.
WSM1253]
gi|385743052|gb|EIG63248.1| hypothetical protein Bra1253DRAFT_08214 [Bradyrhizobium sp.
WSM1253]
Length = 328
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 132/350 (37%), Positives = 177/350 (50%), Gaps = 64/350 (18%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++MV +PG TFRMG+ + E P VT+D F++D+ ++N QF++FV+ATGYVT
Sbjct: 20 REMVYIPGGTFRMGSEQHY---PEEAPVHRVTVDGFWIDRTPITNRQFRDFVNATGYVTF 76
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE D + L +A +V AV
Sbjct: 77 AELIPDAKDYPGALPRMLKAG------------------------SLVFTPPQQAVDLRN 112
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W EW F +G+ W P G S + NHPVVHV++ D AY
Sbjct: 113 W--------GEW----------WTFKFGANWRRPYGPHSNLRGLENHPVVHVAYRDVEAY 154
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G LPTEAEWE+ RGGL+ F WG+ L P G+H AN+WQG FP N A DG+
Sbjct: 155 AKWAGKELPTEAEWEFASRGGLDGAEFAWGHELEPGGQHMANIWQGNFPHENLATDGFER 214
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAP----------------SYNPKG 294
T+PV ++ N + LY+M+GNVWEWT+DW++ +H AP SY+P+
Sbjct: 215 TSPVTAFSTNGYELYDMIGNVWEWTSDWYSPMHEADAPKACCIPQNPRGGLENASYDPRN 274
Query: 295 PTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
P KV KGGS+LC YC R+R AAR D+SA ++GFRC K
Sbjct: 275 PKIRIPRKVLKGGSHLCAPNYCRRYRPAARHAEPIDTSASHVGFRCITRK 324
>gi|118466185|ref|YP_883593.1| sulfatase-modifying factor 1 [Mycobacterium avium 104]
gi|118167472|gb|ABK68369.1| sulfatase-modifying factor 1 [Mycobacterium avium 104]
Length = 296
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 128/337 (37%), Positives = 173/337 (51%), Gaps = 50/337 (14%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++ LPG +FRMG+ E P VT++ F +++H V+N QF EFV+ATGYVT
Sbjct: 3 SELADLPGGSFRMGSTS---FYPEEAPIHTVTVEPFAIERHPVTNAQFAEFVAATGYVTV 59
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ D ++ A SQ++ F ++ R
Sbjct: 60 AEQPIDPALYP-------GADPSQLQPGAMVFRPTPGPVDLR-----------------D 95
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR +W + P SW HP G DS + R +HPVV V++ DA AY
Sbjct: 96 WR--------QWWH---------WVPGASWRHPFGPDSDVADRADHPVVQVAYPDAAAYA 138
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWEY R G +PWG+ T G AN WQG FP N A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAGATT-TYPWGDEPTSDGRLMANTWQGRFPYRNDGALGWVGT 197
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
+PV + N FGL +M+GNVWEWT ++ HH P P GP G ++ KGGS
Sbjct: 198 SPVGVFAPNAFGLLDMIGNVWEWTTTEFSTHHRIGAATKPCCAPSGPADPGVNQTLKGGS 257
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+LC +YC+R+R AARS + DS+ ++GFRC A+ G
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVAEVG 294
>gi|351697166|gb|EHB00085.1| Sulfatase-modifying factor 2 [Heterocephalus glaber]
Length = 301
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 134/334 (40%), Positives = 173/334 (51%), Gaps = 70/334 (20%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
DMV LPG F MGT+ P +DGE P R VT+ F +D V+N F++FV Y TEA
Sbjct: 29 DMVWLPGGRFLMGTDAPD-ARDGEGPVREVTVKPFGIDIFPVTNKDFRKFVREKKYRTEA 87
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E FG +FVFE +S E + K +Q MK PV+ W
Sbjct: 88 ELFGWSFVFEEFVSSELKNKAAQ---QMK---------------PVLW-----------W 118
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
LP E R F W P G S I R+ PV+HVSWNDA AYC
Sbjct: 119 ----LPVE-------------RAF----WRQPTGPGSGIRDRLEQPVLHVSWNDAQAYCR 157
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +
Sbjct: 158 WRGRRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHGVS 214
Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
PV ++ +N +GL+++VGNVWEWT+ +P P V +G S++
Sbjct: 215 PVNAFPPQNDYGLFDLVGNVWEWTS-------------SPYQPAKQDMHVLRGASWIDTA 261
Query: 313 QYCYRH--RCAARSQNTPDSSAGNLGFRCAADKG 344
H R R NTPDS++ NLGFRCAA KG
Sbjct: 262 DGSANHQARVTTRMGNTPDSASDNLGFRCAASKG 295
>gi|27379853|ref|NP_771382.1| hypothetical protein bll4742 [Bradyrhizobium japonicum USDA 110]
gi|27353006|dbj|BAC50007.1| bll4742 [Bradyrhizobium japonicum USDA 110]
Length = 331
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 131/343 (38%), Positives = 172/343 (50%), Gaps = 62/343 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ +PG TFRMG+++ E P VT+DAF++D H V+N QF+ F ATG+VT AE
Sbjct: 24 MLYVPGGTFRMGSDRHY---PEEAPVHRVTVDAFWMDCHPVTNRQFKAFAKATGHVTVAE 80
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
D + +L A S + HPV
Sbjct: 81 ITPDPKDYPGMLPHMIYAG---------------SLVFSPPAHPV--------------- 110
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
L ++W +G W HP G S I +HPVVHV+++DA+AY W
Sbjct: 111 --SLRDFSQWWTFAKGA---------DWRHPYGPGSDIRGLDDHPVVHVAFSDALAYARW 159
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE+ RGGL+ F WG+ P G H AN WQGEFP NT DGY T+P
Sbjct: 160 AGKDLPTEAEWEFAARGGLDGAEFAWGDVFAPDGRHMANTWQGEFPRQNTRDDGYERTSP 219
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----------------APSYNPKGPTT 297
V ++ N +GL++M+GNVWEWT+DW++ H A SY+P+
Sbjct: 220 VTAFPPNGYGLHDMIGNVWEWTSDWYSPRHEADAAKACCIPENPRGGREADSYDPRTTNV 279
Query: 298 GTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KV KGGS+LC YC R+R AAR D+S +LGFRC
Sbjct: 280 KIPRKVIKGGSHLCAPNYCRRYRPAARHAEPIDTSTSHLGFRC 322
>gi|383818630|ref|ZP_09973916.1| hypothetical protein MPHLEI_05012 [Mycobacterium phlei RIVM601174]
gi|383338486|gb|EID16850.1| hypothetical protein MPHLEI_05012 [Mycobacterium phlei RIVM601174]
Length = 291
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 130/331 (39%), Positives = 170/331 (51%), Gaps = 50/331 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +FRMG+ + E P VT+ F +++H V+N QF FV+ TGYVT A
Sbjct: 4 ELVELPGGSFRMGSTQ---FYPEEAPVHTVTVAPFAIERHPVTNAQFAAFVADTGYVTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E +P+ D + G D + +V VA W
Sbjct: 61 E--------QPI--------------DPALYPGADPA--DLVPGALVFTPTEGPVALRDW 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R +W R P W HP G DS IE RM+HPVV V++ DA AY
Sbjct: 97 R--------QWW---------RWVPGACWRHPFGPDSGIEDRMDHPVVQVAYPDAAAYAR 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY R G +PWG++ TP G AN WQG+FP NT A G++ T+
Sbjct: 140 WAGRRLPTEAEWEYAARAGTTT-TYPWGDDPTPDGRLMANTWQGDFPYRNTGALGWVGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHH----HPAPSYNPKG-PTTGTDKVKKGGSY 308
PV + N FGL +M+GNVWEWT + HH P P G P + KGGS+
Sbjct: 199 PVGVFPPNGFGLVDMIGNVWEWTTTRFTGHHRIGEQPKSCCTPSGPPNPAISQALKGGSH 258
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
LC +YC+R+R AARS + DS+ ++GFRC
Sbjct: 259 LCAPEYCHRYRAAARSPQSQDSATTHIGFRC 289
>gi|348560086|ref|XP_003465845.1| PREDICTED: sulfatase-modifying factor 2-like [Cavia porcellus]
Length = 314
Score = 212 bits (539), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 137/338 (40%), Positives = 175/338 (51%), Gaps = 70/338 (20%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
DMV L G F MGT+ +DGE P+R VT+ F +D V+N F+EFV Y TEA
Sbjct: 42 DMVQLLGGQFLMGTDASD-ARDGEGPAREVTVKPFAIDIFPVTNKDFREFVREKKYRTEA 100
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E FG +FVFE L+S E R SQ MK PV+ W
Sbjct: 101 ELFGWSFVFEDLVSSELRNSASQ---QMK---------------PVLW-----------W 131
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
LP E R F W P G S I R+ PV+HVSWNDA AYC
Sbjct: 132 ----LPVE-------------RAF----WRQPSGPGSGIRDRLELPVLHVSWNDAQAYCA 170
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +
Sbjct: 171 WRGRRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDRAEDGFHGVS 227
Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--C 310
PV ++ +N +GLY+++GNVWEWTA SY P +V +G S++
Sbjct: 228 PVNAFPPQNDYGLYDLMGNVWEWTAS----------SYQPAEQDM---RVLRGASWIDTA 274
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+ +R R R NTPDS++ NLGFRCAA G G
Sbjct: 275 DGSANHRARVTTRMGNTPDSASDNLGFRCAASVGRAFG 312
>gi|338176147|ref|YP_004652957.1| sulfatase-modifying factor 1 [Parachlamydia acanthamoebae UV-7]
gi|336480505|emb|CCB87103.1| sulfatase-modifying factor 1 [Parachlamydia acanthamoebae UV-7]
Length = 390
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 126/340 (37%), Positives = 179/340 (52%), Gaps = 49/340 (14%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG F MG++ D E P V +D F++D V+N QF+EF+ ATGYVT AE
Sbjct: 41 MVWIPGGEFMMGSDYEDSKPD-EKPQHRVKVDGFWMDATPVTNKQFKEFIDATGYVTTAE 99
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPV-VHISWNDAVAYCTW 133
+ P L+E ++QV + P V I N++ A+ W
Sbjct: 100 R-------APTLNE----IMNQVPPGTPPPSPELLVAASLVFKPSDVPIPLNNSHAWWEW 148
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
+ P +W HP G +STIE + +HPVV VSW+DA AY
Sbjct: 149 K-----------------------PGANWKHPLGPESTIEEKKDHPVVQVSWDDAQAYAK 185
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE+ GG ++ ++ WGN+ +AN+WQG FP +T + GY T
Sbjct: 186 WAGKRLPTEAEWEFAAYGGRKDIMYVWGNDEFSEEVPQANIWQGIFPYKSTKSKGYNGTT 245
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTT--------GTD 300
PV ++K N +GLY+M GNVW+W +D +++ ++ S NP GP T T
Sbjct: 246 PVTTFKPNPYGLYDMSGNVWQWCSDLYHMTYYQEEAKKELSINPAGPKTSFDPREPYATK 305
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
V +GGS+LC++ YC +R AR + +PD+S +LGFRC
Sbjct: 306 HVHRGGSFLCHDSYCKGYRITARMKTSPDTSLNHLGFRCV 345
>gi|74209819|dbj|BAE23615.1| unnamed protein product [Mus musculus]
Length = 299
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 131/333 (39%), Positives = 172/333 (51%), Gaps = 71/333 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F MGT+ P +DGE P+R VT+ F +D V+N F+EFV Y TEAE
Sbjct: 29 MVHLPGGRFLMGTDAPD-GRDGEGPAREVTVKPFAIDIFPVTNKDFREFVREKKYQTEAE 87
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S E R + E+ M P VH W V WR
Sbjct: 88 AFGWSFVFEDFVSPELRKQ------------------ENLM--PAVH--WWQPVPKAFWR 125
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
P G S I ++ PVVHVSWNDA AYC W
Sbjct: 126 -----------------------------QPAGPGSGIREKLELPVVHVSWNDAGAYCAW 156
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 157 RGRRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHGLSP 213
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +Y P G +V +G S++ +
Sbjct: 214 VNAFPPQNNYGLYDLMGNVWEWTAS----------TYQPAGQDM---RVLRGASWIDTAD 260
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+R R R NTPDS++ NLGFRCA+ G
Sbjct: 261 GSANHRARVTTRMGNTPDSASDNLGFRCASSAG 293
>gi|77377292|ref|NP_080721.1| sulfatase-modifying factor 2 precursor [Mus musculus]
gi|377834086|ref|XP_003689431.1| PREDICTED: sulfatase-modifying factor 2 isoform 1 [Mus musculus]
gi|341942094|sp|Q8BPG6.2|SUMF2_MOUSE RecName: Full=Sulfatase-modifying factor 2; AltName:
Full=C-alpha-formylglycine-generating enzyme 2; Flags:
Precursor
gi|74180529|dbj|BAE34197.1| unnamed protein product [Mus musculus]
gi|148687550|gb|EDL19497.1| sulfatase modifying factor 2 [Mus musculus]
Length = 308
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 131/333 (39%), Positives = 172/333 (51%), Gaps = 71/333 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F MGT+ P +DGE P+R VT+ F +D V+N F+EFV Y TEAE
Sbjct: 38 MVHLPGGRFLMGTDAPD-GRDGEGPAREVTVKPFAIDIFPVTNKDFREFVREKKYQTEAE 96
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S E R + E+ M P VH W V WR
Sbjct: 97 AFGWSFVFEDFVSPELRKQ------------------ENLM--PAVH--WWQPVPKAFWR 134
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
P G S I ++ PVVHVSWNDA AYC W
Sbjct: 135 -----------------------------QPAGPGSGIREKLELPVVHVSWNDAGAYCAW 165
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 166 RGRRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHGLSP 222
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +Y P G +V +G S++ +
Sbjct: 223 VNAFPPQNNYGLYDLMGNVWEWTAS----------TYQPAGQDM---RVLRGASWIDTAD 269
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+R R R NTPDS++ NLGFRCA+ G
Sbjct: 270 GSANHRARVTTRMGNTPDSASDNLGFRCASSAG 302
>gi|26344956|dbj|BAC36127.1| unnamed protein product [Mus musculus]
Length = 308
Score = 211 bits (538), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 131/333 (39%), Positives = 172/333 (51%), Gaps = 71/333 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F MGT+ P +DGE P+R VT+ F +D V+N F+EFV Y TEAE
Sbjct: 38 MVHLPGGRFLMGTDAPD-GRDGEGPAREVTVKPFAIDIFPVTNKDFREFVREKKYQTEAE 96
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S E R + E+ M P VH W V WR
Sbjct: 97 AFGWSFVFEDFVSPELRKQ------------------ENLM--PAVH--WWQPVPKAFWR 134
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
P G S I ++ PVVHVSWNDA AYC W
Sbjct: 135 -----------------------------QPAGPGSGIREKLELPVVHVSWNDAGAYCAW 165
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 166 RGRRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHGLSP 222
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +Y P G +V +G S++ +
Sbjct: 223 VNAFPPQNNYGLYDLMGNVWEWTAS----------TYQPAGQDM---RVLRGASWIDTAD 269
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+R R R NTPDS++ NLGFRCA+ G
Sbjct: 270 GSANHRARVTTRMGNTPDSASDNLGFRCASSAG 302
>gi|414078471|ref|YP_006997789.1| hypothetical protein ANA_C13300 [Anabaena sp. 90]
gi|413971887|gb|AFW95976.1| hypothetical protein ANA_C13300 [Anabaena sp. 90]
Length = 336
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 128/337 (37%), Positives = 180/337 (53%), Gaps = 58/337 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG TF +G++ D E + ++T+++F +D++E++N +F +FV TGY+T AE
Sbjct: 43 MVFIPGGTFNIGSDAHY---DSERSAEDITVNSFCMDKYEITNAKFAQFVKETGYITVAE 99
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ F P LS++ R S V F+ D + V H+SW W+
Sbjct: 100 RPLPKDQF-PDLSDDSRKPGSLV------FQMADPEAKR-----VRHLSW--------WK 139
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W G +W HP G DS I + N+PVV V++ D AY W
Sbjct: 140 ---------WTLGA------------NWRHPFGPDSNINGKDNYPVVQVAYEDVKAYTEW 178
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
LPTE +WEY RGGL + ++ WG + E +AN WQG FP NT ADGY+ TAP
Sbjct: 179 AKMSLPTETQWEYAARGGLNHAIYSWGEQYS---EKKANTWQGLFPVFNTKADGYIGTAP 235
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTT---------GTDK-VK 303
V S++ N +GLY++ GNVWEWT DW+ H H + + NP+GP+ G K V
Sbjct: 236 VGSFQPNGYGLYDITGNVWEWTTDWYRPGHDHKSHTSNPQGPSKQESFDPREPGVAKHVI 295
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KGGSYLC YC R+R AAR +PD+ ++GFR
Sbjct: 296 KGGSYLCAPNYCSRYRPAARESQSPDTGTTHIGFRLV 332
>gi|74147446|dbj|BAE38634.1| unnamed protein product [Mus musculus]
Length = 303
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 131/333 (39%), Positives = 172/333 (51%), Gaps = 71/333 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F MGT+ P +DGE P+R VT+ F +D V+N F+EFV Y TEAE
Sbjct: 33 MVHLPGGRFLMGTDAPD-GRDGEGPAREVTVKPFAIDIFPVTNKDFREFVREKKYQTEAE 91
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S E R + E+ M P VH W V WR
Sbjct: 92 AFGWSFVFEDFVSPELRKQ------------------ENLM--PAVH--WWQPVPKAFWR 129
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
P G S I ++ PVVHVSWNDA AYC W
Sbjct: 130 -----------------------------QPAGPGSGIREKLELPVVHVSWNDAGAYCAW 160
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 161 RGRRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHGLSP 217
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +Y P G +V +G S++ +
Sbjct: 218 VNAFPPQNNYGLYDLMGNVWEWTAS----------TYQPAGQDM---RVLRGASWIDTAD 264
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+R R R NTPDS++ NLGFRCA+ G
Sbjct: 265 GSANHRARVTTRMGNTPDSASDNLGFRCASSAG 297
>gi|405374984|ref|ZP_11029278.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycine-
generating enzyme 1) [Chondromyces apiculatus DSM 436]
gi|397086652|gb|EJJ17755.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycine-
generating enzyme 1) [Myxococcus sp. (contaminant ex DSM
436)]
Length = 360
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 131/346 (37%), Positives = 164/346 (47%), Gaps = 63/346 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG TF MG++ E PS VT+ F+LD V+N F FV AT YVT AE
Sbjct: 57 MVYIPGGTFWMGSDH---HYPEERPSHQVTVTDFWLDSQPVTNADFARFVEATHYVTVAE 113
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ + F E + + R + D T
Sbjct: 114 RPLNPADFPGAKPELLVPGSLVFKKTLARVDLRDLT------------------------ 149
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+W + G W HPEG DST E R HPVVHV + DA+AY TW
Sbjct: 150 --------QWWFYTAGAC---------WKHPEGPDSTWEGREQHPVVHVCFEDALAYATW 192
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
LPTEAEWEY RGG+E +++ WG+ P G+ AN WQGEFP N DG+ ST+P
Sbjct: 193 ADKALPTEAEWEYAARGGMERKVYVWGDEFAPGGKLMANTWQGEFPWQNLNTDGFESTSP 252
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGPTTG---------- 298
V S+ N FGL++M GNVWEWT DW+ H NP+GP T
Sbjct: 253 VGSFPPNTFGLFDMAGNVWEWTTDWYQERHKSQGGKPCCIPVNPRGPATADRSLDPCLPQ 312
Query: 299 ---TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+V KGGS+LC YC R+R AARS DS ++GFRC
Sbjct: 313 VKIPRRVLKGGSHLCAPNYCLRYRPAARSPQAVDSGTSHIGFRCVV 358
>gi|418398948|ref|ZP_12972500.1| hypothetical protein SM0020_02405 [Sinorhizobium meliloti
CCNWSX0020]
gi|359507040|gb|EHK79550.1| hypothetical protein SM0020_02405 [Sinorhizobium meliloti
CCNWSX0020]
Length = 319
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 130/348 (37%), Positives = 174/348 (50%), Gaps = 61/348 (17%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
+E +MV +PG TF MG+N E P VT+D F++D+ V+N QF EFV+ATG
Sbjct: 11 LELAAEMVWVPGATFMMGSNDHY---PEESPVHPVTVDGFWIDETPVTNRQFLEFVNATG 67
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
+VT AE+ +P + A S +R F ++
Sbjct: 68 HVTFAER-------KPRAEDYPGAPPSNLRAGSLVFTPPKRPLQ---------------- 104
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
G + ++W G +W HP G S+I ++HPVVHV+++DA
Sbjct: 105 ------GTDI---SQWWIFTLGA---------NWRHPLGRKSSIGAVLDHPVVHVAYSDA 146
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G LPTEAEWE RGGL+ F WG+ L P G H AN WQG FP N+ DG
Sbjct: 147 RAYAEWVGKELPTEAEWERAARGGLDQAEFAWGDELAPGGNHMANTWQGSFPVENSMDDG 206
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----------------PSYN 291
+ T+PV SY N +GLY+M+GNVWEWT D+W+V H A S +
Sbjct: 207 FARTSPVRSYPPNGYGLYDMIGNVWEWTTDYWSVRHPEAAAKPCCIPSNPRNADADASID 266
Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
P +V KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 267 PAARVKVPRRVLKGGSHLCAPDYCRRYRPAARHAQEIDTSTSHVGFRC 314
>gi|410915208|ref|XP_003971079.1| PREDICTED: sulfatase-modifying factor 2-like [Takifugu rubripes]
Length = 277
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 129/342 (37%), Positives = 175/342 (51%), Gaps = 71/342 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV + G MGTN KDGEFP R+V + +F +DQ+ V+N F++FV A Y TEAE
Sbjct: 1 MVDIRGGRMLMGTNAADG-KDGEFPPRDVKVQSFKIDQYPVTNADFRDFVRAQKYKTEAE 59
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFG +FVF+ +S+E ++KI+Q + W
Sbjct: 60 KFGWSFVFQDFVSDELKSKITQ------------------------------RIESAPWW 89
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E R F W P G S I R++ PVV VSWNDA A+C W
Sbjct: 90 ---LPVE-------------RAF----WRQPTGPGSGIRERLDFPVVQVSWNDAQAFCQW 129
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G RLPTE EWE+ RGGL+ R +PWGN +R+N+WQG FP + A DG+ +P
Sbjct: 130 KGKRLPTEEEWEWAARGGLQGRTYPWGNKFQ---ANRSNLWQGSFPEKDNADDGFHGVSP 186
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+M+GN WEWT+ K P V +G S++ +
Sbjct: 187 VSAFPPQNSYGLYDMMGNTWEWTS--------------TKFPGAQLMYVLRGASWIDTVD 232
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVK 353
+R R R NTPDS++ NLGFRCAA++G K K
Sbjct: 233 GSANHRARVTTRMGNTPDSASDNLGFRCAANQGQKQEKKKDK 274
>gi|372223308|ref|ZP_09501729.1| sulfatase-modifying factor protein [Mesoflavibacter
zeaxanthinifaciens S86]
Length = 370
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 132/356 (37%), Positives = 180/356 (50%), Gaps = 51/356 (14%)
Query: 1 MVLLPAPPVERYKDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQ 59
+V+ P +E DMV + G TF G + L E P VT++ FY+D HEV+N Q
Sbjct: 41 LVVSKLPNLEPPVDMVWIKGGTFYQGALPEDSLAMQHEKPRHKVTVNGFYMDMHEVTNDQ 100
Query: 60 FQEFVSATGYVTEAEK-FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHP 118
F EFV+AT YVT AE+ + + L E + S ++ F+ S++
Sbjct: 101 FAEFVAATNYVTVAERAISWDEMKKQLPPETPKPHDSILQPGSLVFKKSKSSLP------ 154
Query: 119 VVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNH 178
+ Y W EW+ G +W HP G S+IE + +
Sbjct: 155 -------NLYDYSQWW--------EWKIGA------------NWRHPNGPGSSIEGKGEY 187
Query: 179 PVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE 238
PVV +++ DA+AYC W+G RLPTEAEWEY R N ++ W L H AN W+GE
Sbjct: 188 PVVQIAYEDALAYCEWKGRRLPTEAEWEYATRAEKINDIYSWPGGLEQLSSH-ANTWEGE 246
Query: 239 FPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH------------- 285
FP N DGY APV SY N FGLY+M GNVWEWT+DW++++++
Sbjct: 247 FPVENKLHDGYERRAPVKSYPANAFGLYDMAGNVWEWTSDWYHINYYANVSEIEELINPS 306
Query: 286 -PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
P +YNP P +KV KGGS+LC+ YC +R +AR DS A +LGFR
Sbjct: 307 GPEKAYNPSNPYV-QEKVIKGGSFLCSASYCASYRISARMATGLDSGAEHLGFRTV 361
>gi|345801202|ref|XP_536829.3| PREDICTED: sulfatase-modifying factor 2 [Canis lupus familiaris]
Length = 297
Score = 211 bits (537), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 130/338 (38%), Positives = 176/338 (52%), Gaps = 72/338 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F+MGTN P ++GE P R VT+ F +D V+N F+EFV Y TEAE
Sbjct: 26 MVQLPGGRFQMGTNSPD-GRNGEGPVREVTVKPFAIDVFPVTNKDFREFVREKKYRTEAE 84
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE ++ E R K++ H + + W W
Sbjct: 85 MFGWSFVFEEFVANELRNKVT---------------------HQMESVLW--------W- 114
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E R F W P G +S I+ R+ PV+HVSWNDA AYC W
Sbjct: 115 ---LPVE-------------RAF----WRQPAGPNSGIQERLELPVLHVSWNDARAYCAW 154
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 155 KGKRLPTEEEWEFAARGGLKGQVYPWGNQFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 211
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD-KVKKGGSYLCNE 312
V ++ +N +GLY+++GNVWEWTA + P+ D +V +G S++
Sbjct: 212 VNAFPPQNNYGLYDLMGNVWEWTASLY--------------PSADQDMRVLRGASWIDTA 257
Query: 313 QYCYRH--RCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
H R R NTPDS++ NLGFRCA+ G G
Sbjct: 258 DGSANHLARVTTRMGNTPDSASDNLGFRCASSIGRLPG 295
>gi|254432243|ref|ZP_05045946.1| sulfatase-modifying factor 1 [Cyanobium sp. PCC 7001]
gi|197626696|gb|EDY39255.1| sulfatase-modifying factor 1 [Cyanobium sp. PCC 7001]
Length = 341
Score = 211 bits (537), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 134/346 (38%), Positives = 181/346 (52%), Gaps = 63/346 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG TFRMG++ E VT++ F +D+HEV+N +F FV ATGYVT AE
Sbjct: 47 MEWIPGGTFRMGSDDHY---PDELSVDAVTVEGFCIDRHEVTNAEFDAFVKATGYVTVAE 103
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ + P L++++RA + V FE E VAY +W
Sbjct: 104 RPIPKDQY-PDLADDQRAPGALV------FESPAPGSE---------------VAYLSW- 140
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W + P +W HP+G DS +E NHPVV V++ DA AY W
Sbjct: 141 ---------WHWT----------PGANWRHPQGPDSDLEGLANHPVVQVAFRDAEAYAAW 181
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEA+WE+ RGGL++++F WG + + +AN WQGEFP +N DGY TAP
Sbjct: 182 AGKALPTEAQWEFAARGGLKDQVFSWGKTYSAK---QANTWQGEFPIHNQEKDGYFGTAP 238
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAP-------SYNPKGPTTGTDKV 302
V S+ N +GL +M GNVWEWT DW+ H H P S++P+ P V
Sbjct: 239 VGSFPPNGYGLQDMTGNVWEWTQDWYRPGHQGMADHVNPAVSEQELSFDPREPGI-AKHV 297
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
KGGS+LC + YC R+R AAR PD+ ++GFR + GPT G
Sbjct: 298 IKGGSFLCAKNYCSRYRPAAREALEPDTGTSHIGFRLVS--GPTEG 341
>gi|338740518|ref|YP_004677480.1| sulfatase-modifying factor [Hyphomicrobium sp. MC1]
gi|337761081|emb|CCB66914.1| Sulfatase-modifying factor like (SUMF1-related); putative
C-alpha-formyglycine-generating enzyme or
methyltransferase or protein kinase [Hyphomicrobium sp.
MC1]
Length = 335
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 136/356 (38%), Positives = 179/356 (50%), Gaps = 63/356 (17%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P ER MV +P TFRMG+++ E PS VT+D F++D+ V+N QF++FVS T
Sbjct: 16 PTER-PGMVFVPSGTFRMGSDRHY---SEEAPSHRVTVDGFWIDETPVTNAQFRKFVSET 71
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
Y T AE P +E+ + H +K + + H + D
Sbjct: 72 RYTTFAEI--------PPKAEDYPGALP---HMLKAGSLVFTPPRHAV----------DL 110
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
+ W W++ R G +W HP G S + HPVVH+++ D
Sbjct: 111 RDWSQW----------WDF--RFG--------ANWRHPYGRGSWTKGLDEHPVVHIAYAD 150
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
A AY W G LPTEAEWEY RGGLE+ F WG+ TP+G H AN WQG FP N D
Sbjct: 151 AEAYAMWAGKELPTEAEWEYAARGGLEDAEFAWGDEFTPQGRHMANTWQGTFPLENLKED 210
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APS-----YNPKGPTTGTD- 300
GY T+PV ++ N FGLY+M+GNVWEWT+DW++ H P AP NP+G
Sbjct: 211 GYERTSPVRAFAANGFGLYDMIGNVWEWTSDWYSTKHAPDAPKACCVPVNPRGAAREQSF 270
Query: 301 -----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
KV KGGS+LC YC R+R AAR D+S ++GFRC P
Sbjct: 271 DPCQPNIRIPRKVLKGGSHLCAPNYCRRYRPAARHAEPIDTSTSHVGFRCVVRTRP 326
>gi|359779150|ref|ZP_09282391.1| hypothetical protein ARGLB_113_00760 [Arthrobacter globiformis NBRC
12137]
gi|359303599|dbj|GAB16220.1| hypothetical protein ARGLB_113_00760 [Arthrobacter globiformis NBRC
12137]
Length = 338
Score = 211 bits (536), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 126/345 (36%), Positives = 167/345 (48%), Gaps = 67/345 (19%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDG----EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
+M+ +PG TF MG+ DG E P+ VT+ F++D++ V+N +F FV+ TG+
Sbjct: 38 NMLWVPGGTFLMGS-------DGHYPEEAPAHRVTVSGFWMDRNTVTNREFARFVAETGH 90
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
VT AE+ D + P E S V P + D
Sbjct: 91 VTLAEQVPDPADY-PYAKPELLVPASSV-----------------FRQPTQRVDLGDPYN 132
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
+ W Y P +W P+G S++ R +HPVVHV+W D
Sbjct: 133 W-------------WTY----------VPAANWRQPQGPGSSLNGRQDHPVVHVAWPDVE 169
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
+Y W G LPTEAEWE+ RGGL+ + WG+ TP G AN WQGEFP NT DGY
Sbjct: 170 SYAAWAGKSLPTEAEWEFAARGGLDGATYAWGDEFTPSGRWMANTWQGEFPIQNTEQDGY 229
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA---------------PSYNPKG 294
TAPV S+ N +GL +M+GNVWEWT+DW+ H A S +P
Sbjct: 230 AGTAPVGSFPANGYGLADMIGNVWEWTSDWYQAHAETAHSCCTVKDPRGGEQNRSNDPHD 289
Query: 295 PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
P KV KGGS+LC YC R+R AAR D++ +LGFRC
Sbjct: 290 PVRIPRKVMKGGSHLCAPNYCQRYRPAARMPQAIDTATSHLGFRC 334
>gi|27380186|ref|NP_771715.1| hypothetical protein bll5075 [Bradyrhizobium japonicum USDA 110]
gi|27353340|dbj|BAC50340.1| bll5075 [Bradyrhizobium japonicum USDA 110]
Length = 338
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 135/353 (38%), Positives = 181/353 (51%), Gaps = 66/353 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG TFRMG+++ E P+ VT+D F++D+ V+N QF++FV+ATGYVT AE
Sbjct: 32 MFYVPGGTFRMGSDRHY---PEEAPAHLVTVDGFWMDRTPVTNRQFRKFVNATGYVTFAE 88
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH-PVVHISWNDAVAYCTW 133
+ D K + G + H + +V AV W
Sbjct: 89 ----------------------ITPDAKDYPG---ALPHMLKAGSLVFTPPKRAVDLHNW 123
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
+W F +G+ W P G S I +HPVVHV++ D AY
Sbjct: 124 --------GQWW----------TFKFGANWRRPYGPRSNISGLDDHPVVHVAYRDVEAYA 165
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWE+ RGGL+ F WG+ L PRG H AN WQG FP N AADG+ T
Sbjct: 166 KWAGKELPTEAEWEFAARGGLDGAEFAWGDELFPRGRHMANTWQGNFPHQNLAADGFERT 225
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA----------PSYNPKGPTTGT--- 299
+PV ++ N +GL++M+GNVWEWTADW++ H PS P+G + +
Sbjct: 226 SPVTAFPANGYGLHDMIGNVWEWTADWYSQKHEADALKVCCIPQNPSGGPEGASYDSCQP 285
Query: 300 -----DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
KV KGGS+LC YC R+R AAR D+SA ++GFRC A K T+
Sbjct: 286 NLRIPRKVLKGGSHLCAPSYCRRYRPAARHAEPIDTSASHVGFRCIARKRITS 338
>gi|307152366|ref|YP_003887750.1| hypothetical protein Cyan7822_2503 [Cyanothece sp. PCC 7822]
gi|306982594|gb|ADN14475.1| protein of unknown function DUF323 [Cyanothece sp. PCC 7822]
Length = 346
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 126/341 (36%), Positives = 182/341 (53%), Gaps = 58/341 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M + G TF +G+N+ E + ++T+ +F +D++EV+N +F +FV TGY+T AE
Sbjct: 43 MAFIRGGTFNIGSNE---YYPSEHAAEDITVSSFCIDKYEVTNQEFAKFVEETGYLTVAE 99
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ F P L +E+RA S V F+ D I + Y +W
Sbjct: 100 RPLSPVQF-PDLPDEQRAPGSLV------FQMQDKDIMQ--------------IGYLSW- 137
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W++ P +W HP G +S I+ + N+PVVHV++ D AY W
Sbjct: 138 ---------WKWT----------PGANWKHPFGPESDIKGKDNYPVVHVAYEDVEAYAKW 178
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEA+WEY RGGL+++ + WG+ + + +AN WQG FP N DGY TAP
Sbjct: 179 AGLSLPTEAQWEYAARGGLDDKTYSWGDQYSAK---KANTWQGIFPIFNLKVDGYKGTAP 235
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTT---------GTDK-VK 303
V S++ N +GLY++ GNVWEWT+DW+ + H + A NP GP+ G K V
Sbjct: 236 VGSFEPNGYGLYDITGNVWEWTSDWYRIGHDNKAHQVNPTGPSVEESFDPREPGIAKHVI 295
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
KGGSYLC YC R+R AAR +PD+ ++GFR + G
Sbjct: 296 KGGSYLCAPNYCSRYRPAARESQSPDTGTTHIGFRLIKNLG 336
>gi|92118474|ref|YP_578203.1| hypothetical protein Nham_2985 [Nitrobacter hamburgensis X14]
gi|91801368|gb|ABE63743.1| protein of unknown function DUF323 [Nitrobacter hamburgensis X14]
Length = 341
Score = 211 bits (536), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 142/358 (39%), Positives = 180/358 (50%), Gaps = 62/358 (17%)
Query: 1 MVLLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
+L PA ER DMV +PG TFRMG++ E PS VT+D F++D+ V+N QF
Sbjct: 9 QLLGPAADGERTADMVWIPGGTFRMGSDH---HYPEEAPSHRVTVDGFWIDKTPVTNRQF 65
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV ATG+VT AE D + L E A S + PV
Sbjct: 66 KQFVRATGHVTVAEVPPDPKDYPGALPEMIYA---------------GSLVFAPPSGPVD 110
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
W A W +G W HP G S I + +HPV
Sbjct: 111 LREW-----------------ARWWTFMKGA---------DWRHPYGPKSNINVQDHHPV 144
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
VH+++ DA+AY W G LPTEAEWE+ RGGLE F WGN TP G H AN WQGEFP
Sbjct: 145 VHIAYADALAYADWAGKALPTEAEWEFAARGGLEETEFAWGNEFTPNGRHMANTWQGEFP 204
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-AP----------- 288
N A DGY T+PV ++ N +GL++M+GNVWEWT+DW+ H AP
Sbjct: 205 RQNLAEDGYERTSPVTAFPPNGYGLHDMIGNVWEWTSDWYTPKHDADAPKACCIPENPRG 264
Query: 289 -----SYNPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
SY+P P+ KV KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 265 GREDASYDPCQPSIKIPRKVLKGGSHLCAPNYCRRYRPAARHAEPVDTSTSHVGFRCV 322
>gi|41410272|ref|NP_963108.1| hypothetical protein MAP4174 [Mycobacterium avium subsp.
paratuberculosis K-10]
gi|440779649|ref|ZP_20958361.1| hypothetical protein D522_23846 [Mycobacterium avium subsp.
paratuberculosis S5]
gi|41399106|gb|AAS06724.1| hypothetical protein MAP_4174 [Mycobacterium avium subsp.
paratuberculosis K-10]
gi|436719846|gb|ELP44186.1| hypothetical protein D522_23846 [Mycobacterium avium subsp.
paratuberculosis S5]
Length = 296
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 127/337 (37%), Positives = 172/337 (51%), Gaps = 50/337 (14%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++ LPG +FRMG+ E P VT++ F +++H V+N QF EFV+ATGYVT
Sbjct: 3 SELADLPGGSFRMGSTS---FYPEEAPIHTVTVEPFAIERHPVTNAQFAEFVAATGYVTV 59
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ D ++ A SQ++ F ++ R
Sbjct: 60 AEQPIDPALYP-------GADPSQLQPGAMVFRPTPGPVDLR-----------------D 95
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR +W + P SW HP G DS + R +HPVV V++ DA AY
Sbjct: 96 WR--------QWWH---------WVPGASWRHPFGPDSDVADRADHPVVQVAYPDAAAYA 138
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWEY R G +PWG+ T G AN WQG FP N A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAGATT-TYPWGDEPTSDGRLMANTWQGRFPYRNDGALGWVGT 197
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
+PV + N FGL +M+GNVWEWT ++ HH P P GP ++ KGGS
Sbjct: 198 SPVGVFAPNAFGLLDMIGNVWEWTTTEFSTHHRIGAATKPCCAPSGPADPSVNQTLKGGS 257
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+LC +YC+R+R AARS + DS+ ++GFRC A+ G
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVAELG 294
>gi|443666496|ref|ZP_21133781.1| hypothetical protein C789_4321 [Microcystis aeruginosa DIANCHI905]
gi|159030099|emb|CAO90991.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443331217|gb|ELS45887.1| hypothetical protein C789_4321 [Microcystis aeruginosa DIANCHI905]
Length = 322
Score = 210 bits (535), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 129/352 (36%), Positives = 173/352 (49%), Gaps = 64/352 (18%)
Query: 6 APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
AP +KD++ +PG TF MG+N ++G P VT+ F++D++ V+N QF++FV
Sbjct: 10 APGKPPHKDLIWIPGGTFEMGSNSRKYPEEG--PVHTVTVSGFWMDKYLVTNKQFRKFVK 67
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
TGY+T AEK P + A + F D ++ R
Sbjct: 68 ETGYITFAEK-------PPKAEDYPNADPEILVPGSAVFVKPDRPVDPR----------- 109
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
+C W+ P +W HPEG DS+I++R NHPVVH+ +
Sbjct: 110 ---TFCWWQ---------------------YMPGANWRHPEGPDSSIKNRENHPVVHIVY 145
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
DA+AY W G LPTEA+WE+ RGGLE F WG+ P G+ AN WQG+FP N
Sbjct: 146 EDALAYAKWAGKELPTEAQWEFAARGGLEGAEFAWGDEFIPNGKVMANTWQGQFPWQNLK 205
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPKGPTTGTD-- 300
A V SY N +GLY+M+GNVWEWT DW H P+ PK P GT+
Sbjct: 206 PHAPGPEA-VESYPPNGYGLYDMIGNVWEWTQDWDQEQHPKNPTKACCTPKNPLGGTEED 264
Query: 301 --------------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KV KGGS+LC YC R+R AAR D+S ++GFR
Sbjct: 265 SYNLKLHPSMRKPRKVLKGGSFLCAPNYCARYRPAARHPEDIDTSTNHIGFR 316
>gi|326931322|ref|XP_003211781.1| PREDICTED: sulfatase-modifying factor 2-like [Meleagris gallopavo]
Length = 285
Score = 210 bits (535), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 125/334 (37%), Positives = 170/334 (50%), Gaps = 70/334 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F+MG++ +D E P R VT+ F +D+ V+N F+EFV Y TEAE
Sbjct: 1 MVQLPGGKFQMGSSS-TQSRDEEGPIREVTVKPFAIDKFPVTNRDFREFVRERKYKTEAE 59
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +SEE + K++Q + W
Sbjct: 60 AFGWSFVFEDFVSEELKKKVTQ------------------------------KLESAPWW 89
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E + W P G S I R+++PV+HVSWNDA A+C+W
Sbjct: 90 ---LPVEKAF-----------------WRQPAGPGSGIADRLDNPVLHVSWNDAQAFCSW 129
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G RLPTE EWE+ RGGLE RL+PWGN P +R N+WQG+FP + A DGY +P
Sbjct: 130 KGKRLPTEEEWEFAARGGLEQRLYPWGNKFQP---NRTNLWQGDFPRGDRAEDGYHGVSP 186
Query: 255 VMSYK-ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT---TGTDKVKKGGSYLC 310
V ++ +N +G+Y+++GN WEWTA Y P GP + +G S++
Sbjct: 187 VAAFSPQNNYGIYDLLGNTWEWTAS----------QYKPPGPPQPHAEAMRALRGASWID 236
Query: 311 NEQYCYRHRC--AARSQNTPDSSAGNLGFRCAAD 342
HR R NTPDS++ NL FRCAA+
Sbjct: 237 TVDGSANHRACITTRMGNTPDSASDNLSFRCAAN 270
>gi|406660322|ref|ZP_11068455.1| Serine/threonine-protein kinase pkn1 [Cecembia lonarensis LW9]
gi|405555946|gb|EKB50931.1| Serine/threonine-protein kinase pkn1 [Cecembia lonarensis LW9]
Length = 359
Score = 210 bits (535), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 126/341 (36%), Positives = 170/341 (49%), Gaps = 48/341 (14%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG F MGTN+ + E P+ + F++D+ EV+N QF EF+ AT YVT AE
Sbjct: 52 MVWIPGGEFYMGTNE-MESYAVERPAVKKKVKGFWMDETEVTNAQFAEFIQATSYVTVAE 110
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D E ++ K + L P ++ ND + W
Sbjct: 111 RAIDW----------EELRLQLPPDTSKPDDELLQPGSLVFDPPSQPVALNDISQWWKWV 160
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+W +PEG S+I+ R +HPVVH+++ DAVAY TW
Sbjct: 161 TG-----------------------ANWKNPEGPGSSIKGREDHPVVHIAYEDAVAYATW 197
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTE EWEY +GG + + WG P G AN +QG+FP N A DG+ TAP
Sbjct: 198 AGKRLPTETEWEYAAKGGRVHERYAWGEAFKPGGVFMANTFQGKFPHQNVAEDGFEGTAP 257
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APS--------YNPKGPTTGTDK 301
V S+ N FGLY+M+GNVWE T+DW++ AP+ YNP P +K
Sbjct: 258 VKSFSPNDFGLYDMIGNVWELTSDWYDAISFARIAGDAPALDSSINLCYNPSNP-FALEK 316
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
V KGGS+LC + YC +R +AR DS N+GFRC D
Sbjct: 317 VIKGGSFLCADDYCINYRPSARQSQAYDSGTSNIGFRCVKD 357
>gi|374611475|ref|ZP_09684261.1| protein of unknown function DUF323 [Mycobacterium tusciae JS617]
gi|373549185|gb|EHP75858.1| protein of unknown function DUF323 [Mycobacterium tusciae JS617]
Length = 292
Score = 210 bits (535), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 122/333 (36%), Positives = 176/333 (52%), Gaps = 49/333 (14%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V +PG +FRMG+ + E P+ T+ A +++H V+N QF EF+ ATGY+T A
Sbjct: 4 ELVEIPGGSFRMGSTQ---FYPEEAPAHTATVAAIAVERHPVTNAQFAEFIDATGYLTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D ++ P ++ E+ + V F ++ R D + TW
Sbjct: 61 EQPMDPALY-PGVAPEDLVPGALV------FRPTSGPVDLR-----------DWRQWWTW 102
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
P +W P G+DSTI+ R++HPVV V++ DA AY
Sbjct: 103 E-----------------------PGANWRQPFGVDSTIDDRLDHPVVQVAYPDAAAYAL 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGG+ +PWG + P G AN WQG FP N A G+ T+
Sbjct: 140 WAGRRLPTEAEWEYAARGGV-TTTYPWGEDAAPEGRLMANTWQGRFPYLNEGALGWAGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGTDKVKKGGSYL 309
PV +++ N FGL +M+GNVWEWT ++ HH + P ++ KGGS+L
Sbjct: 199 PVGTFEANAFGLLDMIGNVWEWTTTEFSTHHRLDENKGCCTPPATADPAVNQTLKGGSHL 258
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
C +YC+R+R AARS + DS+ ++GFRC AD
Sbjct: 259 CAPEYCHRYRPAARSPQSQDSATTHIGFRCVAD 291
>gi|416402665|ref|ZP_11687414.1| protein of unknown function DUF323 [Crocosphaera watsonii WH 0003]
gi|357261846|gb|EHJ11066.1| protein of unknown function DUF323 [Crocosphaera watsonii WH 0003]
Length = 268
Score = 210 bits (535), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 125/312 (40%), Positives = 173/312 (55%), Gaps = 56/312 (17%)
Query: 45 LDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRF 104
+D+F +D+HEV+N +F++FV ATGY T AE+ F P LSEE+R S V F
Sbjct: 1 MDSFCIDRHEVTNAEFRQFVEATGYQTIAERPLSKEQF-PTLSEEQRQAGSLV------F 53
Query: 105 EGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLH 164
+ IE VAY +W W G +W H
Sbjct: 54 QPPSEGIEQ--------------VAYLSWW--------HWTVGA------------NWQH 79
Query: 165 PEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNL 224
P G +S I+ + N+PV+H++++DAVAY W+G LPTEA+WEY RGGL+++ F WGN
Sbjct: 80 PYGPNSNIKGKDNYPVIHIAYDDAVAYANWKGKTLPTEAQWEYAGRGGLKDQDFSWGNQY 139
Query: 225 TPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH 284
+ + +AN WQG FP NT DG+L TAPV S+ N +GLY+M GNVWE T+DW++V H
Sbjct: 140 SAK---KANTWQGIFPFLNTKQDGHLGTAPVESFPPNGYGLYDMTGNVWELTSDWYSVGH 196
Query: 285 -------HPA-----PSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSA 332
+P S++P P+ G+ V K GSYLC + YC R+R AAR PD+
Sbjct: 197 GGKEHSLNPTGPTKNASFDPTKPSEGSLHVIKRGSYLCAKNYCSRYRPAARESQAPDTGT 256
Query: 333 GNLGFRCAADKG 344
++GFR + G
Sbjct: 257 THVGFRLVSVDG 268
>gi|150375937|ref|YP_001312533.1| hypothetical protein Smed_3787 [Sinorhizobium medicae WSM419]
gi|150030484|gb|ABR62600.1| protein of unknown function DUF323 [Sinorhizobium medicae WSM419]
Length = 319
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 131/348 (37%), Positives = 176/348 (50%), Gaps = 61/348 (17%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
+E +MV +PG TF MG+N E P+ VT+D F++D+ V+N QF EFV+ATG
Sbjct: 11 LEPAAEMVWVPGATFMMGSNDHY---PEEAPAHPVTVDGFWIDEAPVTNRQFLEFVNATG 67
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
+VT AEK +P + A S +R F P + ND
Sbjct: 68 HVTFAEK-------KPRAEDYPGAPPSNLRAGSLVFTP-----------PKRPLQGNDI- 108
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
++W G +W HP G S+I ++HPVVH++++DA
Sbjct: 109 -------------SQWWIFTLGA---------NWRHPLGRKSSIGAILDHPVVHIAYDDA 146
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G LPTEAEWE RG L+ F WG+ L P G+H AN WQG FP N DG
Sbjct: 147 KAYAEWAGKELPTEAEWELAARGDLDEAEFAWGDELVPDGKHMANTWQGRFPIENAMEDG 206
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY------NPKG-------- 294
+ T+PV Y N +GLY+M+GNVWEWT D+W+V H A + NP+
Sbjct: 207 FARTSPVRFYPPNGYGLYDMIGNVWEWTTDYWSVRHPEAAAKPCCIPSNPRNADVEASVD 266
Query: 295 PTTGTD---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
P G +V KGGS+LC YC R+R AAR D++ ++GFRC
Sbjct: 267 PAAGVKVPRRVLKGGSHLCAPNYCRRYRPAARHAQEIDTTTSHVGFRC 314
>gi|282891805|ref|ZP_06300286.1| hypothetical protein pah_c197o126 [Parachlamydia acanthamoebae str.
Hall's coccus]
gi|281498389|gb|EFB40727.1| hypothetical protein pah_c197o126 [Parachlamydia acanthamoebae str.
Hall's coccus]
Length = 390
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 126/340 (37%), Positives = 179/340 (52%), Gaps = 49/340 (14%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG F MG++ D E P V +D F++D V+N QF+EF+ ATGYVT AE
Sbjct: 41 MVWIPGGEFMMGSDYEDSKPD-EKPQHRVKVDGFWMDATPVTNKQFKEFIDATGYVTTAE 99
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPV-VHISWNDAVAYCTW 133
+ P L+E ++QV + P V I N++ A+ W
Sbjct: 100 R-------APTLNE----IMNQVPPGTPSPSPELLVAASLVFKPSDVPIPLNNSHAWWEW 148
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
+ P +W HP G +STIE + +HPVV VSW+DA AY
Sbjct: 149 K-----------------------PGANWKHPLGPESTIEEKKDHPVVQVSWDDAQAYAK 185
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE+ GG ++ ++ WGN+ +AN+WQG FP +T + GY T
Sbjct: 186 WAGKRLPTEAEWEFAAYGGRKDIMYVWGNDEFSEEVPQANIWQGIFPYKSTKSKGYNGTT 245
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTT--------GTD 300
PV ++K N +GLY+M GNVW+W +D +++ ++ S NP GP T T
Sbjct: 246 PVTTFKPNPYGLYDMSGNVWQWCSDLYHMTYYQEEAKKELSINPAGPKTSFDPREPYATK 305
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
V +GGS+LC++ YC +R AR + +PD+S +LGFRC
Sbjct: 306 HVHRGGSFLCHDSYCKGYRITARMKTSPDTSLNHLGFRCV 345
>gi|389696270|ref|ZP_10183912.1| hypothetical protein MicloDRAFT_00060970 [Microvirga sp. WSM3557]
gi|388585076|gb|EIM25371.1| hypothetical protein MicloDRAFT_00060970 [Microvirga sp. WSM3557]
Length = 303
Score = 210 bits (534), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 131/344 (38%), Positives = 168/344 (48%), Gaps = 62/344 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG TF MG+N E P VT+D F++D V+N QF+ F+ ATG+VT AE
Sbjct: 1 MVRIPGGTFAMGSNDHY---PEEAPVHRVTVDGFWMDAAPVTNRQFKSFIQATGHVTTAE 57
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
D R +M + L T HPV W+ +
Sbjct: 58 TIPDA------------KDYPGARREMLKAGSLLFTPPS---HPVDLRDWSQWWTFA--- 99
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+G +W P G +S ++ +HPVVHVS DA+AY W
Sbjct: 100 -----------FGV------------NWKRPNGPNSNLKGLEDHPVVHVSHTDALAYARW 136
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE+ RGGL+ F WG+ LTP G AN WQG+FP NT DGY T+P
Sbjct: 137 AGKDLPTEAEWEFAARGGLDGAEFAWGDELTPDGRQMANTWQGQFPFQNTMLDGYERTSP 196
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-----------------HPAPSYNPKGPTT 297
V SY N +GL +M+GNVWEWT DW+ H H SY+P+ P
Sbjct: 197 VGSYPANGYGLVDMIGNVWEWTNDWYTPKHPDDAPKACCIPENPRGGHEDSSYDPRQPEI 256
Query: 298 GTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KV KGGS+LC YC R+R AAR + D+S ++GFRC
Sbjct: 257 RIPRKVVKGGSHLCAPNYCRRYRPAARHPHPVDTSTSHIGFRCV 300
>gi|425449405|ref|ZP_18829245.1| Sulfatase-modifying factor 1 [Microcystis aeruginosa PCC 7941]
gi|425464892|ref|ZP_18844202.1| Sulfatase-modifying factor 1 [Microcystis aeruginosa PCC 9809]
gi|389763913|emb|CCI09649.1| Sulfatase-modifying factor 1 [Microcystis aeruginosa PCC 7941]
gi|389832957|emb|CCI22951.1| Sulfatase-modifying factor 1 [Microcystis aeruginosa PCC 9809]
Length = 322
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 129/352 (36%), Positives = 174/352 (49%), Gaps = 64/352 (18%)
Query: 6 APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
AP +KD++ +PG TF MG+N ++G P VT+ F++D++ V+N QF++FV
Sbjct: 10 APGKPPHKDLIWIPGGTFEMGSNSRKYPEEG--PVHTVTVSGFWMDKYLVTNKQFRKFVK 67
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
TGY+T AEK P + A + F D ++ R + W
Sbjct: 68 ETGYITFAEK-------PPKAEDYPNADPEILVPGSAVFVKPDRPVDPRT------VCW- 113
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
W+Y P +W HPEG S+I++R NHPVVH+ +
Sbjct: 114 ------------------WQY----------MPGANWRHPEGTGSSIKNRENHPVVHIVY 145
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
DA+AY W G LPTEA+WE+ RGGLE F WG+ P G+ AN WQG+FP N
Sbjct: 146 EDALAYAKWAGKELPTEAQWEFAARGGLEGAEFAWGDEFMPNGKVMANTWQGQFPWQNLK 205
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPKGPTTGTD-- 300
A V SY N +GLY+M+GNVWEWT DW+ H P+ PK P GT+
Sbjct: 206 PHAPGPEA-VESYPPNGYGLYDMIGNVWEWTQDWYQEQHPKNPTKACCTPKNPLGGTEED 264
Query: 301 --------------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KV KGGS+LC YC R+R AAR D+S ++GFR
Sbjct: 265 SYNLKLHPSMRKPRKVLKGGSFLCAPNYCARYRPAARHPEDIDTSTNHIGFR 316
>gi|443478841|ref|ZP_21068539.1| Sulphatase-modifying factor protein [Pseudanabaena biceps PCC 7429]
gi|443015808|gb|ELS30618.1| Sulphatase-modifying factor protein [Pseudanabaena biceps PCC 7429]
Length = 332
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 131/338 (38%), Positives = 175/338 (51%), Gaps = 58/338 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV + G TFRMG + + E + NV++ +F +D HEV+N +F +FV TGY T AE
Sbjct: 34 MVFINGGTFRMGADSDRFPE--ERAADNVSVSSFCIDAHEVTNAEFAKFVKETGYKTVAE 91
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
F P L +E+R S V K D+ VA+ +W
Sbjct: 92 IPLSKEEF-PDLPDEQRLAGSLVFQPPKE----DA----------------QTVAFLSWW 130
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W G +W HP G DS+I+++ N+PVVH+S+ DA AY W
Sbjct: 131 --------TWTVGA------------NWQHPFGKDSSIKNKDNYPVVHISYKDAEAYAKW 170
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G +PTEA+WEY RGGL++ + WG + E RAN WQG FP NT ADGY AP
Sbjct: 171 IGKSIPTEAQWEYAARGGLDSATYAWGEQYS---EKRANTWQGIFPFFNTKADGYKGLAP 227
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNV-------HHHPAP-----SYNPKGPTTGTDKV 302
V S+ N +GLY+M GNVWEWT+D++ V H+P S++PK P V
Sbjct: 228 VGSFPPNGYGLYDMTGNVWEWTSDFFEVGHDRMAHQHNPIASDRHKSFDPKKPDESVLHV 287
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KGGSYLC YC R+R AAR +PD+ ++GFR
Sbjct: 288 IKGGSYLCAPNYCSRYRPAARESESPDTGTTHIGFRLV 325
>gi|409389219|ref|ZP_11241071.1| putative sulfatase-modifying factor [Gordonia rubripertincta NBRC
101908]
gi|403200511|dbj|GAB84305.1| putative sulfatase-modifying factor [Gordonia rubripertincta NBRC
101908]
Length = 309
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 128/331 (38%), Positives = 170/331 (51%), Gaps = 60/331 (18%)
Query: 14 DMVLLPGDTFRMGTNKPILIK-DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ V +P F MG + DGE P +V + AF +D V+N F FV ATGYVT+
Sbjct: 32 ETVEVPAGRFAMGDALDEGYRTDGEVPVHDVEVGAFSVDVTSVTNAAFAAFVEATGYVTD 91
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE RA IS V H Y T
Sbjct: 92 AE----------------RAGISAVFH-----------------------------GYAT 106
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
G +P W P SW HP G DS + + HPVVHVS +DA+AYC
Sbjct: 107 APGEPVPETPWW----------LATPGASWRHPGGSDSDLTGKTTHPVVHVSHDDALAYC 156
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
+W G LPTEA+WEY RGGL +PWG+ R E R N+++G FP +G++ T
Sbjct: 157 SWAGRSLPTEAQWEYAARGGLHGARYPWGDRHPGREEPRCNIFRGVFPDR---PEGHVGT 213
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPTTGTDKVKKGGSYLCN 311
V +++ N +GLY VGNVWEW AD ++ ++ + S +P GPT GT +V +GGS+LC+
Sbjct: 214 VDVRAFEPNGYGLYQCVGNVWEWCADRFSARYYRTSDSVDPTGPTRGTLRVLRGGSHLCH 273
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+ YC R+R AARS NTP+S+A N+GFR D
Sbjct: 274 DSYCNRYRVAARSSNTPNSTASNIGFRTVGD 304
>gi|383458407|ref|YP_005372396.1| hypothetical protein COCOR_06443 [Corallococcus coralloides DSM
2259]
gi|380730928|gb|AFE06930.1| hypothetical protein COCOR_06443 [Corallococcus coralloides DSM
2259]
Length = 333
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 131/347 (37%), Positives = 171/347 (49%), Gaps = 63/347 (18%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ DMV +PG T+ MG++ E P+ VT+ F++D+ V+N QF FV ATGYVT
Sbjct: 27 FPDMVWIPGGTYWMGSDHHY---PEEAPAHQVTVSGFWMDRFTVTNEQFARFVEATGYVT 83
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
A++ PL + + S + + PV D
Sbjct: 84 VAQR--------PLNPADYPGATPETLVP-------GSLVFQKPQGPV------DLGNVA 122
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W W Y P W HPEG S+++HR +HPVVH+++ DA +Y
Sbjct: 123 NW----------WGY----------VPDACWKHPEGRRSSVKHRRDHPVVHIAFEDAASY 162
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G LPTEAEWE RGGL+ F WGN+ TP GEH AN WQG FP N DG+
Sbjct: 163 AAWAGKDLPTEAEWERAARGGLDRNEFCWGNDFTPNGEHLANTWQGYFPWQNLREDGHEG 222
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS------YNPKGPTT--GTD--- 300
T PV ++ N +GL+ M GNVWEWT DW+ H NP+GP T G+
Sbjct: 223 TCPVGAFPPNGYGLHEMTGNVWEWTTDWYQARHQGNKGKACCIPVNPRGPATEHGSQDLF 282
Query: 301 --------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R AARS DS A ++GFRC
Sbjct: 283 TSAITIPRRVLKGGSHLCAPNYCRRYRPAARSPQAVDSGASHIGFRC 329
>gi|118465519|ref|YP_884152.1| sulfatase-modifying factor 1 [Mycobacterium avium 104]
gi|118166806|gb|ABK67703.1| sulfatase-modifying factor 1 [Mycobacterium avium 104]
Length = 292
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 127/333 (38%), Positives = 175/333 (52%), Gaps = 50/333 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+++ LPG F MG+N E P VT+ F +++H V+N QF EFV TGYVT A
Sbjct: 4 ELIDLPGGAFLMGSNS---FYPEESPVHEVTVRPFSIERHPVTNAQFAEFVDQTGYVTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
EK D F + +E D+K + + PV +W + TW
Sbjct: 61 EKALDPKDFPGVPAE-----------DLKP----GALVFTPTAGPVDLRAWRQ---WWTW 102
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
P W P G S+IE R++HPVV V++ DA AY +
Sbjct: 103 -----------------------MPGACWRQPFGPGSSIEDRLDHPVVQVAYPDAAAYAS 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLP+EA+WEY R G + + WG+++ P GE AN WQG+FP N A G+ T+
Sbjct: 140 WAGRRLPSEAQWEYAARAGTSS-AYAWGDDVAPGGEIMANTWQGKFPYRNDGARGWTGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---APSYNPKGPT--TGTDKVKKGGSY 308
PV ++ N FGL +M+GNVWEWT+ ++ HHP APS P PT + KGGS+
Sbjct: 199 PVGTFAPNGFGLVDMIGNVWEWTSTRYSAGHHPGRAAPSCCPTSPTGDPAVLQTLKGGSH 258
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
LC +YC+R+R AARS + DS+ ++GFRC A
Sbjct: 259 LCAPEYCHRYRPAARSSQSQDSATTHIGFRCIA 291
>gi|384218660|ref|YP_005609826.1| hypothetical protein BJ6T_49760 [Bradyrhizobium japonicum USDA 6]
gi|354957559|dbj|BAL10238.1| hypothetical protein BJ6T_49760 [Bradyrhizobium japonicum USDA 6]
Length = 331
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 175/344 (50%), Gaps = 64/344 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ +PG TFRMG+++ E P VT+DAF++D H V+N QF+ F TG+VT
Sbjct: 24 MLYVPGGTFRMGSDR---HYPEEAPVHRVTVDAFWMDCHPVTNRQFKAFAKTTGHVT--- 77
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
++++ D K + G+ + + +
Sbjct: 78 -------------------VAEIPPDPKDYPGI--------------------LPHMVYA 98
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
G+ + + + R + F G+ W HP G S I +HPVVHV+++DA+AY
Sbjct: 99 GSLMFSPPAYPVDLRDFSQWWTFAKGADWRHPYGPGSDIRGLDDHPVVHVAFSDALAYAR 158
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEAEWE+ RGGL+ F WG+ P G H AN WQGEFP NT DGY T+
Sbjct: 159 WAGKDLPTEAEWEFAARGGLDGAEFAWGDEFAPGGRHMANTWQGEFPRQNTGDDGYERTS 218
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----------------APSYNPKGPT 296
PV ++ N +GL++++GNVWEWT+DW++ H A SY+P+
Sbjct: 219 PVTAFPPNGYGLHDLIGNVWEWTSDWYSPRHEADAAKACCIPENPRGGREADSYDPRTIN 278
Query: 297 TGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KV KGGS+LC YC R+R AAR D+S +LGFRC
Sbjct: 279 VKIPRKVIKGGSHLCAPNYCRRYRPAARHAEPIDTSTSHLGFRC 322
>gi|448728409|ref|ZP_21710737.1| hypothetical protein C449_01471 [Halococcus saccharolyticus DSM
5350]
gi|445796891|gb|EMA47376.1| hypothetical protein C449_01471 [Halococcus saccharolyticus DSM
5350]
Length = 330
Score = 209 bits (532), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 126/343 (36%), Positives = 168/343 (48%), Gaps = 62/343 (18%)
Query: 18 LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
+PG TFRMG+ + E P R V++D F++++ +N +F FV+ TGY T AE+
Sbjct: 29 IPGGTFRMGSEE---FYPEERPIREVSIDGFWMNRTPTTNREFAAFVADTGYTTVAER-- 83
Query: 78 DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
+P ++ A + F D + R W
Sbjct: 84 -----DPDPTDYPDADPDALVAGSAVFRQPDGPVNRRDPS-----QW------------- 120
Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
WEY P W HP G DS IE RM+HPVVHV++ DA A+ W G
Sbjct: 121 ------WEY----------VPGAEWRHPLGPDSDIEGRMDHPVVHVAYEDAAAFAEWAGK 164
Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMS 257
LPTEA+WE RGGLE F WG+ P G+ AN WQGEFP N DGY T+PV S
Sbjct: 165 TLPTEAQWERAARGGLEGTRFVWGDEHVPEGQLMANTWQGEFPQENKTLDGYERTSPVGS 224
Query: 258 YKENKFGLYNMVGNVWEWTADWWNV-----HHHPA-------------PSYNPKGPTTGT 299
+ N FGLY+M GNVW+WT DW++ P+ S +P+ P+
Sbjct: 225 FPANGFGLYDMAGNVWDWTQDWYSADPSTNDAEPSCCTPTGSRTVTEEQSVDPRDPSGIP 284
Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
KV KGGS+LC YC+R+R AAR D+S ++GFRC +
Sbjct: 285 RKVLKGGSHLCAPNYCFRYRPAARYPEPVDTSTSHVGFRCVVE 327
>gi|351714038|gb|EHB16957.1| Sulfatase-modifying factor 1 [Heterocephalus glaber]
Length = 286
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 127/332 (38%), Positives = 164/332 (49%), Gaps = 109/332 (32%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+P F MGT+ P + +DGE P+R V +DAFY D +E+SN +F++FV++TGY+TEAE
Sbjct: 62 MVLIPAGVFIMGTDGPQIKQDGEAPARRVAVDAFYTDAYEISNAEFEKFVNSTGYLTEAE 121
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFG +FVFE +L +
Sbjct: 122 KFGHSFVFEGMLRK---------------------------------------------- 135
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
RLPTEA WE+ CRGGL+NRLFPWG+ L P+ +H N W
Sbjct: 136 --RLPTEAGWEHSCRGGLQNRLFPWGNKLQPKD-----QHYAN---------------LW 173
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G P E G +G FP P G
Sbjct: 174 QG-EFPVTNTGEDGFQGTALVDAFP------PNG-------------------------- 200
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
+G YN+VGNVWEWT DWW ++H + N KGP +G D V+K GSY+C++ Y
Sbjct: 201 --------YGSYNIVGNVWEWTLDWWTIYHSVEETLNLKGPPSGKDLVRKAGSYMCHKSY 252
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
C R+RCAARSQN PDSSA NLGF CAA++ PT
Sbjct: 253 CCRYRCAARSQNAPDSSALNLGFCCAANRLPT 284
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 90/195 (46%), Positives = 110/195 (56%), Gaps = 50/195 (25%)
Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMS 257
RLPTEA WE+ CRGGL+NRLFPWGN L P+ +H AN+WQGEFP NT DG+ TA V +
Sbjct: 136 RLPTEAGWEHSCRGGLQNRLFPWGNKLQPKDQHYANLWQGEFPVTNTGEDGFQGTALVDA 195
Query: 258 YKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYR 317
+ N +G YN+VGNVWEWT DWW ++H + N
Sbjct: 196 FPPNGYGSYNIVGNVWEWTLDWWTIYHSVEETLN-------------------------- 229
Query: 318 HRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNT 377
KGP +G D V+K GSY+C++ YC R+RCAARSQN
Sbjct: 230 ------------------------LKGPPSGKDLVRKAGSYMCHKSYCCRYRCAARSQNA 265
Query: 378 PDSSAGNLGFRCAAD 392
PDSSA NLGF CAA+
Sbjct: 266 PDSSALNLGFCCAAN 280
>gi|183981065|ref|YP_001849356.1| hypothetical protein MMAR_1043 [Mycobacterium marinum M]
gi|183174391|gb|ACC39501.1| conserved protein [Mycobacterium marinum M]
Length = 293
Score = 209 bits (531), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 124/335 (37%), Positives = 175/335 (52%), Gaps = 50/335 (14%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +FRMG+ E P +V++ AF ++++ V+N QF EFV+ATGYVT A
Sbjct: 4 ELVDLPGGSFRMGSTS---FYPEEAPIHSVSVHAFAIERYPVTNAQFAEFVAATGYVTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D ++ + ++ R R + D W
Sbjct: 61 EQPLDPALYPGVNPDDLRPGAMVFRPTAGPVDLRD------------------------W 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R W++ P SW HP G DS ++ R +HPVV V++ DA AY
Sbjct: 97 R-------QWWDWA----------PGASWRHPWGPDSDVDDRADHPVVQVAYPDAAAYAR 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RG + WG+ P G+ AN WQG+FP N A G+ T+
Sbjct: 140 WAGRRLPTEAEWEYAARGATTT-TYAWGDEEKPAGQLMANTWQGKFPYRNDGALGWFGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPTTGT-DKVKKGGSY 308
PV ++ N FGL++M+GNVWEWT ++ HH P P GP T + KGGS+
Sbjct: 199 PVGTFPANGFGLFDMIGNVWEWTTTPFSAHHRIDQPPKACCAPAGPADPTISQTLKGGSH 258
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
LC +YC+R+R AARS + D++ ++GFRC AD+
Sbjct: 259 LCAPEYCHRYRPAARSPQSQDTATTHIGFRCVADR 293
>gi|387793100|ref|YP_006258165.1| hypothetical protein Solca_3996 [Solitalea canadensis DSM 3403]
gi|379655933|gb|AFD08989.1| hypothetical protein Solca_3996 [Solitalea canadensis DSM 3403]
Length = 392
Score = 209 bits (531), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 122/350 (34%), Positives = 176/350 (50%), Gaps = 61/350 (17%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDG-------EFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
K MV +PG F MG P+ +++G P V +D FY+D+ EV+N +F FV
Sbjct: 82 KGMVFIPGGEFSMGGVNPVGMENGGQQNMNDARPIHRVYVDGFYMDETEVTNAEFTAFVK 141
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
ATGY+T AE+ F P EE S V P +S N
Sbjct: 142 ATGYITIAERKPTREEF-PGAPEENLVAGSVV-----------------FTPPDHDVSLN 183
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
D + W W+Y +G W HP G S I+ + N+PVVH++W
Sbjct: 184 D---HYQW----------WQY-VKGA---------DWKHPLGPQSNIQGKDNYPVVHIAW 220
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
DA AY W G RLPTEAEWE+ RGG +++ WGN P G+ AN++QG+FP ++
Sbjct: 221 EDAQAYAKWAGKRLPTEAEWEFAARGGDFGKMYAWGNQFKPDGKWMANIFQGKFPVSDEG 280
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYNPKGPTTGTD 300
DG++ AP+ + + +GLY++ GNVWEW DW+ + H + NP+GP + D
Sbjct: 281 KDGFVGIAPIKQFSASTYGLYDIAGNVWEWCEDWYRPDYYEELAHRDVARNPQGPDSAFD 340
Query: 301 --------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
KV++GGS+LC +QYC R+ R + S++ ++G RC D
Sbjct: 341 PMEPNEKKKVQRGGSFLCTDQYCTRYMVGTRGKGEFRSASNHVGLRCVKD 390
>gi|417747829|ref|ZP_12396287.1| hypothetical protein MAPs_38570 [Mycobacterium avium subsp.
paratuberculosis S397]
gi|336460655|gb|EGO39546.1| hypothetical protein MAPs_38570 [Mycobacterium avium subsp.
paratuberculosis S397]
Length = 296
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 127/337 (37%), Positives = 171/337 (50%), Gaps = 50/337 (14%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++ LPG +FRMG+ E P VT++ F +++H V+N QF EFV+ATGYVT
Sbjct: 3 SELADLPGGSFRMGSTS---FYPEEAPIHTVTVEPFAIERHPVTNAQFAEFVAATGYVTV 59
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ D ++ A SQ++ F ++ R
Sbjct: 60 AEQPIDPALYP-------GADPSQLQPGAMVFRPTPGPVDLR-----------------D 95
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR +W + P SW HP G DS + R +HPVV V+ DA AY
Sbjct: 96 WR--------QWWH---------WVPGASWRHPFGPDSDVADRADHPVVQVAHPDAAAYA 138
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWEY R G +PWG+ T G AN WQG FP N A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAGATT-TYPWGDEPTSDGRLMANTWQGRFPYRNDGALGWVGT 197
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
+PV + N FGL +M+GNVWEWT ++ HH P P GP ++ KGGS
Sbjct: 198 SPVGVFAPNAFGLLDMIGNVWEWTTTEFSTHHRIGAATKPCCAPSGPADPSVNQTLKGGS 257
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+LC +YC+R+R AARS + DS+ ++GFRC A+ G
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVAELG 294
>gi|319954217|ref|YP_004165484.1| sulfatase-modifying factor protein [Cellulophaga algicola DSM
14237]
gi|319422877|gb|ADV49986.1| Sulphatase-modifying factor protein [Cellulophaga algicola DSM
14237]
Length = 376
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 128/343 (37%), Positives = 176/343 (51%), Gaps = 50/343 (14%)
Query: 15 MVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
MV +PG F+ G + + D E PS V +D F++D EV+N QF +FV TGY+T A
Sbjct: 60 MVWIPGGAFQQGAVPQDKMAMDHEKPSHKVQVDGFFMDISEVTNAQFAKFVKETGYITVA 119
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D + L E + HD G + I + V ++ + W
Sbjct: 120 EREIDWEEMKTQLPEG-----TPKPHDSILKPG--ALIFKKTKKSVPNL-----YDFSQW 167
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
EW+ G W HP G S+I + N PVV VS+ DA+AYCT
Sbjct: 168 W--------EWKIGA------------DWKHPNGPKSSIVGKDNEPVVQVSYEDALAYCT 207
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE RG ++ WG+++ + AN W+GEFP NN+ DG+ A
Sbjct: 208 WAGRRLPTEAEWERAARGNHAETIYFWGDDVD-KLATMANTWEGEFPVNNSKLDGFERRA 266
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNPKGPTTG 298
VMSY N FGLY+M GNVWEWT DW++ ++ P +YNP P
Sbjct: 267 SVMSYPANDFGLYDMAGNVWEWTTDWYSTNYFKEVANDMVLAKNPQGPTKTYNPNNP-YA 325
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+KV KGGS+LC+ YC +R ++R ++PDS A ++GFR A
Sbjct: 326 IEKVIKGGSFLCSASYCASYRISSRMGSSPDSGAEHVGFRTVA 368
>gi|154248609|ref|YP_001419567.1| hypothetical protein Xaut_4691 [Xanthobacter autotrophicus Py2]
gi|154162694|gb|ABS69910.1| protein of unknown function DUF323 [Xanthobacter autotrophicus Py2]
Length = 306
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 127/343 (37%), Positives = 172/343 (50%), Gaps = 62/343 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG TF MG++ L E P R V +D F++D+ V+N F FV+ TG+V+ AE
Sbjct: 1 MVWIPGGTFHMGSDAHYL---EEAPVREVAVDGFWMDRCPVTNRDFARFVAETGHVSLAE 57
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ P + + A + F D+ D+ + W
Sbjct: 58 QV-------PSIEDYPGADPLLLVAGSLVFVTPDAP--------------GDSADWTRW- 95
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W R +W HP+G DS+IE +HPVVHV DA+AY W
Sbjct: 96 ---------W----------RFVAGANWRHPQGPDSSIEGLEDHPVVHVCHGDALAYARW 136
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G LP+EA+WEY RGGL+ F WG+ L P G HRAN WQG FP N A DGY T+P
Sbjct: 137 QGKELPSEAQWEYAARGGLDRAEFAWGDALVPDGVHRANTWQGHFPDENLALDGYAGTSP 196
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----------------SYNPKGPTT 297
V ++ N +GL +M+GNVWEWT DW+ H A S +P+ P +
Sbjct: 197 VTAFPPNGYGLLDMIGNVWEWTDDWYRPHGASAQGKACCLPRNPRGGSAEESRDPEAPGS 256
Query: 298 GTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KV KGGS+LC YC R+R AAR T D+++ ++GFRC
Sbjct: 257 ALPRKVLKGGSHLCAPNYCRRYRPAARQPQTIDTASVHIGFRC 299
>gi|410224498|gb|JAA09468.1| sulfatase modifying factor 2 [Pan troglodytes]
Length = 323
Score = 208 bits (530), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 133/340 (39%), Positives = 172/340 (50%), Gaps = 73/340 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ---EFVSATGYVT 71
MV L G F MGTN P +DGE P R T+ F +D V+N F+ +FV Y T
Sbjct: 49 MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRYIRDFVREKKYRT 107
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
EAE FG +FVFE +S+E R K +Q P+ + W
Sbjct: 108 EAEIFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW------- 139
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W LP E R F W P G S I R+ HPV+HVSWNDA AY
Sbjct: 140 -W----LPVE-------------RAF----WRQPAGPGSGILERLEHPVLHVSWNDARAY 177
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
C WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+
Sbjct: 178 CAWRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHG 234
Query: 252 TAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
+PV ++ +N +GLY+++GNVWEWTA +P +V +G S++
Sbjct: 235 VSPVNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWID 281
Query: 311 NEQYCYRH--RCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
H R R NTPDS++ NLGFRCAAD G G
Sbjct: 282 TADGSANHQARVTTRMGNTPDSASDNLGFRCAADAGRPPG 321
Score = 39.7 bits (91), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 26/45 (57%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ H R R NTPDS++ NLGFRCAAD
Sbjct: 272 RVLRGASWIDTADGSANHQARVTTRMGNTPDSASDNLGFRCAADA 316
>gi|73542580|ref|YP_297100.1| hypothetical protein Reut_A2895 [Ralstonia eutropha JMP134]
gi|72119993|gb|AAZ62256.1| Protein of unknown function DUF323 [Ralstonia eutropha JMP134]
Length = 409
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 128/351 (36%), Positives = 170/351 (48%), Gaps = 58/351 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG F MG++ + + E P+ V + F++DQH V+N +F++FV ATGYVT AE
Sbjct: 77 MVHVPGGEFLMGSDS-KMAQANEKPAHKVRIHGFWMDQHHVTNAEFRKFVEATGYVTTAE 135
Query: 75 KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
K D + L R S + F G + + Y W
Sbjct: 136 KSPDWETLKVQLPPGTPRPPASAMVAGAMVFVGTSRPVPLQ--------------DYSRW 181
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W Y P W HP G ++IE + NHPVV VS+ DA AY
Sbjct: 182 ----------WRY----------VPGADWRHPGGPGTSIEGKDNHPVVQVSYEDAQAYAK 221
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGY 249
W G RLPTEAEWE+ RGGLE + WG P G+ ANVWQG+ FP + A G
Sbjct: 222 WAGKRLPTEAEWEFAARGGLEQATYAWGEKFAPDGKQMANVWQGQQGQPFPVISPKAGGA 281
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-------------VHHHPAP--SYNPKG 294
L T+PV ++ N +GLY+M GN W+W ADW+ V P P S++P
Sbjct: 282 LGTSPVGTFPANGYGLYDMTGNAWQWVADWYRADQFRREAAQGRLVDDPPGPLASWDPSE 341
Query: 295 ---PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
P +V +GGS+LCNE YC +R +AR P +S +LGFR D
Sbjct: 342 PGVPVGAPKRVTRGGSFLCNEDYCLSYRPSARRGTDPYNSMSHLGFRLVMD 392
>gi|348518876|ref|XP_003446957.1| PREDICTED: sulfatase-modifying factor 2-like [Oreochromis
niloticus]
Length = 308
Score = 208 bits (530), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 126/335 (37%), Positives = 174/335 (51%), Gaps = 71/335 (21%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++MV +PG MGT+ P +DGE PS+ V L F +D++ V+N F++FV A Y TE
Sbjct: 30 EEMVNIPGGKMLMGTSAPDG-RDGESPSKEVELQPFKIDRYPVTNADFRDFVRAQKYKTE 88
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE FG +FVF+ +SEE ++K++Q +
Sbjct: 89 AETFGWSFVFQDFVSEELKSKVTQ------------------------------RIESAP 118
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W LP E R+F W P G S I R++ PVV VSWNDA A+C
Sbjct: 119 WW---LPIE-------------RVF----WRQPSGPGSGIRDRLDFPVVQVSWNDAQAFC 158
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
+W+ RLPTE EWE+ RGGL R +PWGN +R N+WQG FP +TA DGY
Sbjct: 159 SWKSKRLPTEEEWEWAARGGLRGRTYPWGNKFQ---ANRTNLWQGSFPDGDTAEDGYHGI 215
Query: 253 APVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL-- 309
+PV +Y +N +GLY+M+GN WEWT+ ++ P Y V +G S++
Sbjct: 216 SPVTAYPPQNSYGLYDMMGNTWEWTSTRFSAAR---PMY-----------VLRGASWIDT 261
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+ ++ R R NTPDS++ NLGFRCAA G
Sbjct: 262 VDGSANHKARITTRMGNTPDSASDNLGFRCAASDG 296
>gi|375137877|ref|YP_004998526.1| hypothetical protein [Mycobacterium rhodesiae NBB3]
gi|359818498|gb|AEV71311.1| hypothetical protein MycrhN_0676 [Mycobacterium rhodesiae NBB3]
Length = 292
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 122/332 (36%), Positives = 174/332 (52%), Gaps = 49/332 (14%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
+V LPG +FRMG + E P+ T+ A +++H V+N QF +F+ ATGY+T AE
Sbjct: 5 LVELPGGSFRMGATQ---FYPEEAPAHTATVAAIAVERHPVTNAQFADFIDATGYLTVAE 61
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D ++ P + ++ + V F ++ R WR
Sbjct: 62 QPMDPALY-PGVPPQDLVPGALV------FRPTSGPVDLR-----------------DWR 97
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W++ P +W HP G DS I+ R++HPVV V++ DA AY W
Sbjct: 98 -------QWWDWA----------PGANWRHPFGPDSEIDGRLDHPVVQVAYPDAAAYARW 140
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY R G +PWG ++ P G AN WQG+FP N+ A G++ T+P
Sbjct: 141 AGRRLPTEAEWEYAARAG-ATTTYPWGEDVAPDGRLMANTWQGDFPYRNSGALGWIGTSP 199
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD----KVKKGGSYLC 310
V ++ N FGL +M+GNVWEWT ++ HH A P D + KGGS+LC
Sbjct: 200 VGAFAPNAFGLVDMIGNVWEWTTTEFSAHHRLAGRKGCCTPGAAADPTVNQTLKGGSHLC 259
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+YC+R+R AARS + DS+ ++GFRC AD
Sbjct: 260 APEYCHRYRAAARSPQSQDSATTHIGFRCVAD 291
>gi|375147002|ref|YP_005009443.1| sulfatase-modifying factor protein [Niastella koreensis GR20-10]
gi|361061048|gb|AEW00040.1| Sulphatase-modifying factor protein [Niastella koreensis GR20-10]
Length = 326
Score = 207 bits (528), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 130/363 (35%), Positives = 175/363 (48%), Gaps = 67/363 (18%)
Query: 3 LLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
+L V+ +MV +PG F MG++K E P VT+D F++D++ V+N +F++
Sbjct: 8 VLSDAKVKDQTNMVFIPGGKFMMGSDK---FYPEEKPVHAVTVDGFWIDKYAVTNEEFKK 64
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEE-ERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
F+ TGYVT AE+ PL +E+ AK + F+ ++
Sbjct: 65 FIDETGYVTVAER--------PLKAEDYPGAKPDMLAPGALVFQKAKGAVDL-------- 108
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
ND + W P SW HP+G + ++ HPVV
Sbjct: 109 ---NDYFNWWAW-----------------------VPGTSWKHPKGPKTDLKGIEKHPVV 142
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
HV++ DA AY TW G LPTEA+WEY RGGLE F WGN T + + AN WQG FP
Sbjct: 143 HVAYEDAEAYTTWAGKELPTEAQWEYAARGGLEGMDFTWGNEDTQQTKPMANTWQGIFPY 202
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH----------------- 284
N D Y T+PV S+ N +GLY+M GNVWEWT+DW+ H
Sbjct: 203 QNLLIDKYEGTSPVGSFAPNGYGLYDMAGNVWEWTSDWYVAHLDELANKVKTCCTTHVNP 262
Query: 285 ---HPAPSYNPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
P SY+P PT KV KGGS+LC YC R+R AAR D+ +LGFRC
Sbjct: 263 RVVSPEASYDPCQPTIKIPRKVVKGGSHLCAPNYCLRYRPAARQPQMIDTGMSHLGFRCI 322
Query: 341 ADK 343
+
Sbjct: 323 VKQ 325
>gi|440747279|ref|ZP_20926538.1| Sulfatase modifying factor 1 precursor [Mariniradius
saccharolyticus AK6]
gi|436484199|gb|ELP40203.1| Sulfatase modifying factor 1 precursor [Mariniradius
saccharolyticus AK6]
Length = 351
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 130/345 (37%), Positives = 173/345 (50%), Gaps = 54/345 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV + G +F MGT++ + E P+ + F++D+ EV+N QF +FV TGYVT AE
Sbjct: 42 MVWIEGGSFWMGTDE-LEAYAVERPAVQREVKGFWMDETEVTNAQFSKFVEETGYVTVAE 100
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLD----STIEHRMHHPVVHISWNDAVAY 130
+ + EE K Q+ D + S + +HPV S ND +
Sbjct: 101 R---------KIDWEELKK--QLPPDTPKLPDDQLQPGSLVFSPPNHPVSTQS-NDISQW 148
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W SW HPEG +S+I R NHPVVH+++ DA A
Sbjct: 149 WKWVNG-----------------------ASWKHPEGPNSSIAGRENHPVVHIAFEDAEA 185
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
Y W G RLPTEAEWEY + G E + WG P G+ AN +QG+FP N D +
Sbjct: 186 YAKWAGKRLPTEAEWEYAAKAGKEKVRYAWGTEFRPNGQFMANTFQGDFPHQNLVEDNFA 245
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APS--------YNPKGPTT 297
TAPV S++ N FGLY+M+GNVWE TADW++ AP YNP P
Sbjct: 246 GTAPVKSFQANGFGLYDMIGNVWELTADWYDAIQFSRVAGIAPPLDSAMNKCYNPTNP-Y 304
Query: 298 GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+KV KGGS+LC + YC +R +AR + DS N+GFRC D
Sbjct: 305 AQEKVIKGGSFLCADDYCINYRPSARQGHAYDSGTSNVGFRCVKD 349
>gi|308210762|ref|NP_001016106.2| sulfatase modifying factor 2 precursor [Xenopus (Silurana)
tropicalis]
Length = 317
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 130/346 (37%), Positives = 178/346 (51%), Gaps = 64/346 (18%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
E +MV L G F MGT K KDGE P R V + F +D++ V+N F+EFV A Y
Sbjct: 31 EDADNMVQLDGGQFDMGT-KASDGKDGESPVRQVKVLPFAVDKYPVTNKDFREFVRAKKY 89
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
TEAE FG +FVFE +SEE + K++Q + W
Sbjct: 90 KTEAEAFGWSFVFEDFVSEELKKKVTQ---------------------KLESAPW----- 123
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
W LP E R F W P G S I+ ++++PVV VSWNDA
Sbjct: 124 ---W----LPVE-------------RAF----WRQPAGPASGIKEKLDYPVVQVSWNDAQ 159
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
AYC W G RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG FP +TA DGY
Sbjct: 160 AYCKWLGKRLPTEEEWEFAARGGLKGKVYPWGNTFKP---NRTNLWQGTFPGMDTATDGY 216
Query: 250 LSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
+PV ++ +N++GLY+M+GN WEWTA + S + V +G S+
Sbjct: 217 HGASPVAAFPAQNEYGLYDMLGNTWEWTATEYT-------SARAQKQNKENMYVLRGASW 269
Query: 309 L--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKV 352
+ + ++ R R NTPDS++ NL FRCA ++ + DK+
Sbjct: 270 IDTVDGSANHKARVTTRMGNTPDSASDNLSFRCAKNRRKSKTEDKM 315
>gi|113867611|ref|YP_726100.1| hypothetical protein H16_A1600 [Ralstonia eutropha H16]
gi|113526387|emb|CAJ92732.1| uncharacterized conserved protein [Ralstonia eutropha H16]
Length = 414
Score = 207 bits (527), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 126/353 (35%), Positives = 169/353 (47%), Gaps = 58/353 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG F MG++ + + E P+ V + AF++DQH V+N +F++FV ATGYV+ AE
Sbjct: 75 MVRVPGGEFLMGSDS-KMAQPNEKPAHKVRVHAFWMDQHHVTNAEFRKFVEATGYVSTAE 133
Query: 75 KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ D L R S + F G + + Y W
Sbjct: 134 RAPDWETLRVQLPPGTPRPPDSAMVAGGMVFVGTPRPVPLQ--------------DYSRW 179
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W Y P W HP G S+IE + NHPVV VS+ DA AY
Sbjct: 180 ----------WRY----------VPGADWRHPTGPGSSIEGKDNHPVVQVSYEDAQAYAK 219
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGY 249
W G RLPTEAEWE+ RGGLE + WG P G+ ANVWQG+ FP + A G
Sbjct: 220 WAGKRLPTEAEWEFAARGGLEQATYAWGEQFAPDGKQMANVWQGQQNQPFPVVSPKAGGA 279
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH------HPAPSYNPKGPTTGTD--- 300
L T+PV ++ N +GL +M GN W+W +DW+ P NP GP D
Sbjct: 280 LGTSPVGTFPPNGYGLADMTGNAWQWVSDWYRADQFRREAGKAQPIENPVGPMASWDPAE 339
Query: 301 ---------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+V +GGS+LCNE +C +R +AR P +S +LGFR D+G
Sbjct: 340 PGVPVNAPKRVTRGGSFLCNEDFCLSYRPSARRGTDPYNSMSHLGFRLVMDEG 392
>gi|291234206|ref|XP_002737040.1| PREDICTED: sulfatase-modifying factor 2-like [Saccoglossus
kowalevskii]
Length = 340
Score = 207 bits (527), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 123/337 (36%), Positives = 176/337 (52%), Gaps = 64/337 (18%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
++++++MV L G F MGT++P +DGE PSR VT+ F + ++ V+N F++FV A
Sbjct: 45 LKKFEEMVELKGGKFIMGTDEPNA-RDGEGPSRKVTVKPFKIMRYPVTNAAFRKFVKAKK 103
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
+ TEAE+F +FVF+ +SEE +A I+
Sbjct: 104 FKTEAERFQWSFVFDAFVSEEVKANITD-------------------------------- 131
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
R+P W P W PEG STI R+N P +H+S NDA
Sbjct: 132 --------RIPNTPWW----------VPVPRAYWRQPEGQGSTITKRLNWPALHISANDA 173
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
YC W G RLPTE EWEY RGGLE++ +PWG + + R N WQG+FP NT DG
Sbjct: 174 KQYCEWNGWRLPTEPEWEYAARGGLESKRYPWGASFDNK---RLNTWQGKFPEENTKEDG 230
Query: 249 YLSTAPVMSYK-ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
Y ++PV +++ +N +G+Y+MVGN+WEWT+ + + P V +GGS
Sbjct: 231 YHGSSPVDAFEPQNDYGMYDMVGNIWEWTSTKFKSQNQGQAVEIPM-------LVLRGGS 283
Query: 308 YLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAAD 342
Y+ + + H R R NT D+S+ NLGFRCA+D
Sbjct: 284 YIDTKDGEFNHIVRVTTRMGNTADASSDNLGFRCASD 320
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 28/44 (63%), Gaps = 2/44 (4%)
Query: 352 VKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAADV 393
V +GGSY+ + + H R R NT D+S+ NLGFRCA+DV
Sbjct: 278 VLRGGSYIDTKDGEFNHIVRVTTRMGNTADASSDNLGFRCASDV 321
>gi|429206654|ref|ZP_19197918.1| Sulfatase modifying factor 1 precursor protein [Rhodobacter sp.
AKP1]
gi|428190240|gb|EKX58788.1| Sulfatase modifying factor 1 precursor protein [Rhodobacter sp.
AKP1]
Length = 297
Score = 207 bits (527), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 127/334 (38%), Positives = 169/334 (50%), Gaps = 53/334 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ LPG F MG+N E P +++ F++D V+N + FVSATGYVT A
Sbjct: 1 MIWLPGAVFYMGSNS---HYPEEAPVHLASVEGFWIDVFPVTNADYARFVSATGYVTVA- 56
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
ER+ ++ D + ++ R V D + W
Sbjct: 57 ---------------ERSPGAEDYPDADPALLVPGSLVFRRPASV-----EDPGDWTRW- 95
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W Y W HP+G S + R++HPVVHV DA+AY W
Sbjct: 96 ---------WSYQSGA----------CWRHPDGPGSDVAGRLDHPVVHVCHEDALAYAHW 136
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWEY RGGL+ F WG+ L P G HRAN+WQG+FP N A DG+ T+P
Sbjct: 137 AGKDLPTEAEWEYAARGGLDRAEFAWGDVLVPEGRHRANIWQGQFPVENLAEDGFEGTSP 196
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH--HPAPS-YNPKGPTTG------TDKVKKG 305
V ++ N +GLY+M+GNVWEWT+D + H AP P P +G +V KG
Sbjct: 197 VRAFPPNGYGLYDMIGNVWEWTSDGFRADHGGRSAPGCCAPISPASGGHVLPEARQVLKG 256
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
G +LC YC R+R AAR TPDS++G++GFRC
Sbjct: 257 GYHLCAPNYCRRYRPAARQGQTPDSASGHIGFRC 290
>gi|158424484|ref|YP_001525776.1| hypothetical protein AZC_2860 [Azorhizobium caulinodans ORS 571]
gi|158331373|dbj|BAF88858.1| hypothetical protein AZC_2860 [Azorhizobium caulinodans ORS 571]
Length = 332
Score = 207 bits (527), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 134/353 (37%), Positives = 172/353 (48%), Gaps = 63/353 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G TF MG++ E P V +D F++D+ V+N F FV TG+VT AE
Sbjct: 24 MVWLAGATFHMGSDD---HYPEEAPVHPVRVDGFWIDRFPVTNRDFARFVEDTGHVTVAE 80
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ + PLL S V F ++
Sbjct: 81 QVPSPADY-PLLDPHLLVAGSLV------FSPPEAQ------------------------ 109
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
P A W+ R W HP+G S++E +HPVVHV++ DA AY W
Sbjct: 110 ----PGPAAWDQWWR------FVEGADWRHPKGPGSSLEGLEDHPVVHVAFADAQAYARW 159
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
LPTEAEWEY RGGL+ F WGN L P G HRAN WQG FP N A DG+ T+P
Sbjct: 160 CHKALPTEAEWEYAARGGLDRAPFAWGNELVPGGIHRANTWQGAFPVENLAEDGFAGTSP 219
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA---PSYNPKGPTTGTD----------- 300
V ++ N FGL++M+GNVWEWT DW+ H P P P P G++
Sbjct: 220 VDAFPANGFGLHDMIGNVWEWTDDWYQPRHDPPAARPCCVPHNPRGGSEAGSRHADILGA 279
Query: 301 ----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
KV KGGS+LC YC R+R AARS T D+++G++GFRC +GP T
Sbjct: 280 ALPRKVLKGGSHLCAPNYCQRYRPAARSPQTIDTASGHIGFRC-VRRGPKQET 331
>gi|430809050|ref|ZP_19436165.1| C-alpha-formyglycine- generating protein [Cupriavidus sp. HMR-1]
gi|429498565|gb|EKZ97072.1| C-alpha-formyglycine- generating protein [Cupriavidus sp. HMR-1]
Length = 386
Score = 207 bits (526), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 127/351 (36%), Positives = 170/351 (48%), Gaps = 58/351 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG F MG++ + + E P+ V + AF++DQH V+N +F++FV ATGY+T AE
Sbjct: 46 MVHIPGGEFLMGSDHK-MSQANERPAHKVQVKAFWMDQHHVTNAEFRKFVEATGYLTTAE 104
Query: 75 KFGDTFVFEPLL-SEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ D L + R S + F G +S + R Y W
Sbjct: 105 RKPDWETLRVQLPAGTPRPPDSAMVAGGMVFVGTNSPVPLR--------------EYWRW 150
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W R P W HP G S+IE + NHPVV VS+ DA AY
Sbjct: 151 ----------W----------RFVPGADWRHPTGPGSSIEGKDNHPVVQVSYEDAQAYAK 190
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGY 249
W G RLPTEAEWE+ RGGLE + WG+ P G ANVWQG+ FP + A G
Sbjct: 191 WAGKRLPTEAEWEFAARGGLEQATYAWGDKFAPDGRQMANVWQGQQVQPFPVVSAKAGGA 250
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH------PAPSYNPKGPTTGTD--- 300
T+ + ++ N +GLY+M GN W+W ADW+ A NP GPT D
Sbjct: 251 AGTSAIGTFPANGYGLYDMTGNAWQWVADWYRADQFRREATVAAVLQNPTGPTDSWDPTE 310
Query: 301 ---------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+V +GGS+LCNE +C +R +AR P +S +LGFR D
Sbjct: 311 PGVPVSAPKRVTRGGSFLCNEDFCLSYRPSARRGTDPYTSMSHLGFRLVMD 361
>gi|334342830|ref|YP_004555434.1| sulfatase-modifying factor protein [Sphingobium chlorophenolicum
L-1]
gi|334103505|gb|AEG50928.1| Sulphatase-modifying factor protein [Sphingobium chlorophenolicum
L-1]
Length = 313
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 132/346 (38%), Positives = 175/346 (50%), Gaps = 67/346 (19%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M L+PG F MG+++ E P R V++D+F++D+ V+N QF +FV+ATG++T AE
Sbjct: 6 MRLVPGGVFSMGSDR---FYPEEAPVREVSVDSFWIDETPVTNRQFADFVAATGHITMAE 62
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
P + + S FE ++
Sbjct: 63 T-------PPNVEDYPGMPASMAVAGSLVFEPTAGPVD---------------------- 93
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
L ++W + R G + W HP G DS+ E +HPVVH++W DAVAY W
Sbjct: 94 ---LADVSQW-WHFRFGAD--------WRHPLGPDSSTEGLEDHPVVHIAWADAVAYAEW 141
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWE+ RGGL F WG+ L P G AN WQG FP +N DG+ T+P
Sbjct: 142 AGKRLPTEAEWEFATRGGLHGTDFAWGDELAPAGAMLANYWQGLFPFSNQLLDGWARTSP 201
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVH--HHPA------PSYNPKG------------ 294
V Y N +GLY+M+GNVWEWT+DW+ H H A PS NP+G
Sbjct: 202 VRHYPANGYGLYDMIGNVWEWTSDWFAPHKLHRKAKGSCCIPS-NPRGGLERESYDVSVP 260
Query: 295 -PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
P G KV KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 261 TPRIGR-KVLKGGSHLCAPNYCQRYRPAARHAQPIDTSTSHVGFRC 305
>gi|343926957|ref|ZP_08766447.1| putative sulfatase-modifying factor [Gordonia alkanivorans NBRC
16433]
gi|343763116|dbj|GAA13373.1| putative sulfatase-modifying factor [Gordonia alkanivorans NBRC
16433]
Length = 267
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 123/309 (39%), Positives = 162/309 (52%), Gaps = 59/309 (19%)
Query: 35 DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKI 94
DGE P +V + AF +D V+N F FV ATGYVT+AE RA I
Sbjct: 12 DGEVPVHDVEVGAFSVDVTSVTNAAFAAFVEATGYVTDAE----------------RAGI 55
Query: 95 SQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
S V H Y T G +P W
Sbjct: 56 SAVFH-----------------------------GYATAPGEPVPETPWW---------- 76
Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
P SW HP G DS + + HPVVHVS +DA+AYC+W G LPTEA+WEY RGGL
Sbjct: 77 LATPGASWRHPGGPDSDLTGKTTHPVVHVSHDDALAYCSWAGRSLPTEAQWEYAARGGLH 136
Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
+PWG+ R E R N+++G FP +G++ T V +++ N +GLY VGNVWE
Sbjct: 137 GARYPWGDRHPGREEPRCNIFRGVFPDR---PEGHVGTVDVRAFEPNGYGLYQCVGNVWE 193
Query: 275 WTADWWNV-HHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAG 333
W AD ++ ++ + S +P GPT GT +V +GGS+LC++ YC R+R AARS NTP+S+A
Sbjct: 194 WCADRFSARYYRTSDSVDPTGPTRGTLRVLRGGSHLCHDSYCNRYRVAARSSNTPNSTAS 253
Query: 334 NLGFRCAAD 342
N+GFR D
Sbjct: 254 NIGFRTVGD 262
>gi|376317284|emb|CCG00652.1| sulfatase-modifying factor 1 [uncultured Flavobacteriia bacterium]
Length = 339
Score = 207 bits (526), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 119/344 (34%), Positives = 178/344 (51%), Gaps = 52/344 (15%)
Query: 15 MVLLPGDTFRMGTNK-PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
MV +PG + G +K + E P NV +D FY+D EV+N QF +F+ AT Y+T A
Sbjct: 22 MVWIPGGVYEQGASKNDKMALPHEKPKHNVRIDGFYMDITEVTNAQFLKFIEATNYITTA 81
Query: 74 EKFGD-TFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
E+ D + + L + + S ++ F+ +++ + +
Sbjct: 82 EQPIDWSLIASQLPAGTPKPHDSILQPGSLLFKKTKTSVPNLYD-------------FSQ 128
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W +W G +W P G S+I + NHPVVHVS+ DA AYC
Sbjct: 129 WW--------KWSIGT------------NWTQPNGKGSSIIGKENHPVVHVSFEDAQAYC 168
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWEY RGG +N+++ WG + N W+GEFP +N+ ADG+ +
Sbjct: 169 KWAGRRLPTEAEWEYAARGGKKNKIYFWG-DFVENLSSNVNSWEGEFPVSNSQADGFEKS 227
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA---------------PSYNPKGPTT 297
APV ++ N FGL+ + GNVWEWT+DW+NV+++ +YNP P
Sbjct: 228 APVKTFPPNDFGLHEISGNVWEWTSDWYNVNYYATLASNGEVSNNPKGANSAYNPNNPQI 287
Query: 298 GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
++K+ +GGS+LCN YC +R ++R P++S +LGFR A
Sbjct: 288 -SEKIIRGGSFLCNASYCASYRVSSRMATDPNTSLEHLGFRTVA 330
>gi|443489538|ref|YP_007367685.1| Formylglycine-generating sulfatase enzyme [Mycobacterium liflandii
128FXT]
gi|442582035|gb|AGC61178.1| Formylglycine-generating sulfatase enzyme [Mycobacterium liflandii
128FXT]
Length = 293
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 124/335 (37%), Positives = 175/335 (52%), Gaps = 50/335 (14%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +FRMG+ E +V++ AF ++++ V+N QF EFV+ATGYVT A
Sbjct: 4 ELVDLPGGSFRMGSTS---FYPEEASIHSVSVHAFAIERYPVTNAQFAEFVAATGYVTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D ++ + ++ R R + D W
Sbjct: 61 EQPLDPALYPGVNPDDLRPGAMVFRPTAGPVDLRD------------------------W 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R W++ P SW HP G DS ++ R +HPVV V++ DAVAY
Sbjct: 97 R-------QWWDWA----------PGASWRHPWGPDSDVDDRADHPVVQVAYPDAVAYAR 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RG + WG+ P G+ AN WQG+FP N A G+ T+
Sbjct: 140 WAGRRLPTEAEWEYAARGATTT-TYAWGDEEKPAGQLMANTWQGKFPYRNDGALGWFGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPTTGT-DKVKKGGSY 308
PV ++ N FGL++M+GNVWEWT ++ HH P P GP T + KGGS+
Sbjct: 199 PVGTFPANGFGLFDMIGNVWEWTTTPFSAHHRIDQPPKACCAPAGPADPTISQTLKGGSH 258
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
LC +YC+R+R AARS + D++ ++GFRC AD+
Sbjct: 259 LCAPEYCHRYRPAARSPQSRDTATTHIGFRCVADR 293
>gi|399576771|ref|ZP_10770526.1| hypothetical protein HSB1_25650 [Halogranum salarium B-1]
gi|399238215|gb|EJN59144.1| hypothetical protein HSB1_25650 [Halogranum salarium B-1]
Length = 323
Score = 206 bits (525), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 130/344 (37%), Positives = 174/344 (50%), Gaps = 60/344 (17%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
D+V +PG F MG+++ E P+R V +D F++D+ V+ F+EFV ATGYVT
Sbjct: 18 DDVVWIPGGEFSMGSDE---HYPEERPARQVGVDGFWIDRTPVTTAAFREFVDATGYVTL 74
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ P E A + + F D ++ R +W
Sbjct: 75 AER-------PPRPEEYPDADPALLVPGSAVFVPPDGPVDLRNPS-----NW-------- 114
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W+Y P +W P G +S + ++HPVVHV++ DAVAY
Sbjct: 115 -----------WQY----------VPDANWRQPLGPESDLSEWLDHPVVHVAYEDAVAYA 153
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAE E RGGL+ F WG+ LTP GE AN WQG FP NT ADG++ T
Sbjct: 154 EWAGKRLPTEAEHERASRGGLDGATFAWGDELTPEGELMANTWQGTFPWQNTGADGHVRT 213
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNV-----------HHHP-----APSYNPKGPT 296
+PV S+ N +GL + +GNVWEWT+DW+ V +P A S +P P
Sbjct: 214 SPVGSFPANGYGLVDTIGNVWEWTSDWFAVGGGDSTPTCCTPENPRGGTEAASVDPFDPA 273
Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KV KGGS+LC YC R+R AAR D+S ++GFRCA
Sbjct: 274 RTPRKVLKGGSHLCAPNYCQRYRPAARYPEPVDTSTSHVGFRCA 317
>gi|373252951|ref|ZP_09541069.1| hypothetical protein NestF_08584 [Nesterenkonia sp. F]
Length = 327
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 124/331 (37%), Positives = 159/331 (48%), Gaps = 70/331 (21%)
Query: 15 MVLLPGDTFRMGTNKPILI-KDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M+ L TFRMG+ + DGE P R V + F + V+ +F FV ATG+ T+A
Sbjct: 66 MIGLSATTFRMGSEDELAYPADGEGPVREVAVGPFAIGATSVTVAEFAAFVVATGHRTDA 125
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E +GD+ VF L++ RA V
Sbjct: 126 ETYGDSMVFAGRLTDALRASTPAV------------------------------------ 149
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
EA W RG +W P+G DST E R +HPV HVS DA AY
Sbjct: 150 ------AEAPWWRQVRGA---------TWFAPDGPDSTAEGREDHPVTHVSHRDAQAYAQ 194
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W GARLPTEAEWEY RGGL+ + +PWG P G R N++ G+FP + G T
Sbjct: 195 WAGARLPTEAEWEYAARGGLDQQPYPWGAVRDPDGVRRMNIFPGDFPED---PHGEAGTM 251
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
PV S++ N GLYN GNVWEWTA + +P V +GGSY+C+E
Sbjct: 252 PVRSFEPNGLGLYNSTGNVWEWTAGTFGATGNP---------------VARGGSYMCHEN 296
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
YC R+R R+Q T D++ G+ GFR AAD G
Sbjct: 297 YCRRYRTTGRTQATADTTLGHTGFRLAADVG 327
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 23/44 (52%), Positives = 32/44 (72%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+ V +GGSY+C+E YC R+R R+Q T D++ G+ GFR AADV
Sbjct: 283 NPVARGGSYMCHENYCRRYRTTGRTQATADTTLGHTGFRLAADV 326
>gi|58476691|gb|AAH89708.1| MGC108306 protein [Xenopus (Silurana) tropicalis]
gi|213624563|gb|AAI71273.1| MGC108306 protein [Xenopus (Silurana) tropicalis]
Length = 282
Score = 206 bits (524), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 129/341 (37%), Positives = 176/341 (51%), Gaps = 64/341 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G F MGT K KDGE P R V + F +D++ V+N F+EFV A Y TEAE
Sbjct: 1 MVQLDGGQFDMGT-KASDGKDGESPVRQVKVLPFAVDKYPVTNKDFREFVRAKKYKTEAE 59
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +SEE + K++Q + W W
Sbjct: 60 AFGWSFVFEDFVSEELKKKVTQ---------------------KLESAPW--------W- 89
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E R F W P G S I+ ++++PVV VSWNDA AYC W
Sbjct: 90 ---LPVE-------------RAF----WRQPAGPASGIKEKLDYPVVQVSWNDAQAYCKW 129
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG FP +TA DGY +P
Sbjct: 130 LGKRLPTEEEWEFAARGGLKGKVYPWGNTFKP---NRTNLWQGTFPGMDTATDGYHGASP 186
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N++GLY+M+GN WEWTA + S + V +G S++ +
Sbjct: 187 VAAFPAQNEYGLYDMLGNTWEWTATEYT-------SARAQKQNKENMYVLRGASWIDTVD 239
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKV 352
++ R R NTPDS++ NL FRCA ++ + DK+
Sbjct: 240 GSANHKARVTTRMGNTPDSASDNLSFRCAKNRRKSKTEDKM 280
>gi|94313454|ref|YP_586663.1| C-alpha-formyglycine- generating protein [Cupriavidus metallidurans
CH34]
gi|93357306|gb|ABF11394.1| Sulfatase-modifying factor (C-alpha-formyglycine- generating
enzyme) DUF323 [Cupriavidus metallidurans CH34]
Length = 408
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 126/351 (35%), Positives = 170/351 (48%), Gaps = 58/351 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG F MG++ + + E P+ V + AF++DQH V+N F++FV ATGY+T AE
Sbjct: 68 MVHVPGGEFLMGSDHK-MSQANERPAHKVQVKAFWMDQHHVTNADFRKFVEATGYLTTAE 126
Query: 75 KFGDTFVFEPLL-SEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ D L + R S + F G +S + R Y W
Sbjct: 127 RKPDWETLRVQLPAGTPRPPDSAMVAGGMVFVGTNSPVPLR--------------EYWRW 172
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W R P W HP G S+IE + NHPVV VS+ DA AY
Sbjct: 173 ----------W----------RFVPGADWRHPTGPGSSIEGKDNHPVVQVSYEDAQAYAK 212
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGY 249
W G RLPTEAEWE+ RGGLE + WG+ P G ANVWQG+ FP + A G
Sbjct: 213 WAGKRLPTEAEWEFAARGGLEQATYAWGDKFAPDGRQMANVWQGQQVQPFPVVSAKAGGA 272
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNPKG 294
T+ V ++ N +GLY+M GN W+W ADW+ PA S++P
Sbjct: 273 AGTSAVGTFPGNGYGLYDMTGNAWQWVADWYRADQFRREATVAAVLQNPTGPADSWDPTE 332
Query: 295 ---PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
P + +V +GGS+LCNE +C +R +AR P +S +LGFR D
Sbjct: 333 PGVPVSAPKRVTRGGSFLCNEDFCLSYRPSARRGTDPYTSMSHLGFRLVMD 383
>gi|60552102|gb|AAH91176.1| Sumf2 protein [Rattus norvegicus]
Length = 271
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 126/333 (37%), Positives = 170/333 (51%), Gaps = 71/333 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F MGTN P +DGE P+R VT+ F +D ++N F+EFV Y TEAE
Sbjct: 1 MVHLPGGRFLMGTNAPD-GRDGEGPAREVTVKPFAIDVFPITNKDFREFVREKKYQTEAE 59
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S E R + N+ A W
Sbjct: 60 AFGWSFVFEDFVSPELRKQA------------------------------NEMPAVLWW- 88
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + + W P G S I ++ PVVHVSWNDA AYC W
Sbjct: 89 ---LPVQKAF-----------------WRQPAGPGSGIREKLELPVVHVSWNDAGAYCAW 128
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN +R N+WQG+FP + A DG+ +P
Sbjct: 129 RGKRLPTEEEWEFAARGGLKGQVYPWGNQFQ---LNRTNLWQGKFPKGDRAEDGFHGLSP 185
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +Y G +V +G S++ +
Sbjct: 186 VNAFPPQNNYGLYDLMGNVWEWTAS----------TYQAAGQDM---RVLRGASWIDTAD 232
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+R R R NTPDS++ NLGFRCA+ +G
Sbjct: 233 GSANHRARVTTRMGNTPDSASDNLGFRCASSEG 265
>gi|164607119|ref|NP_001020296.2| sulfatase-modifying factor 2 precursor [Rattus norvegicus]
gi|149063163|gb|EDM13486.1| sulfatase modifying factor 2 [Rattus norvegicus]
Length = 307
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 126/333 (37%), Positives = 170/333 (51%), Gaps = 71/333 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F MGTN P +DGE P+R VT+ F +D ++N F+EFV Y TEAE
Sbjct: 37 MVHLPGGRFLMGTNAPD-GRDGEGPAREVTVKPFAIDVFPITNKDFREFVREKKYQTEAE 95
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S E R + N+ A W
Sbjct: 96 AFGWSFVFEDFVSPELRKQA------------------------------NEMPAVLWW- 124
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP + + W P G S I ++ PVVHVSWNDA AYC W
Sbjct: 125 ---LPVQKAF-----------------WRQPAGPGSGIREKLELPVVHVSWNDAGAYCAW 164
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN +R N+WQG+FP + A DG+ +P
Sbjct: 165 RGKRLPTEEEWEFAARGGLKGQVYPWGNQFQ---LNRTNLWQGKFPKGDRAEDGFHGLSP 221
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +Y G +V +G S++ +
Sbjct: 222 VNAFPPQNNYGLYDLMGNVWEWTAS----------TYQAAGQDM---RVLRGASWIDTAD 268
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+R R R NTPDS++ NLGFRCA+ +G
Sbjct: 269 GSANHRARVTTRMGNTPDSASDNLGFRCASSEG 301
>gi|282901004|ref|ZP_06308937.1| protein of unknown function DUF323 [Cylindrospermopsis raciborskii
CS-505]
gi|281194095|gb|EFA69059.1| protein of unknown function DUF323 [Cylindrospermopsis raciborskii
CS-505]
Length = 333
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 126/342 (36%), Positives = 180/342 (52%), Gaps = 71/342 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+ G T++MG + D +F + VT+ +F +D++EV+N +F +FV TGY+T A+
Sbjct: 43 MVLINGGTYKMGAD------DSDF--QEVTVGSFCIDKYEVTNAEFAKFVELTGYITVAQ 94
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ F P LS+EER S V K+ +
Sbjct: 95 RPLSKEQF-PHLSDEERLPGSLVFQAPKQGQ----------------------------- 124
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
++P + W + P +W HP G D+T + N+PVVH+++ DA+AY W
Sbjct: 125 -KQIPFLSWWYW----------VPDANWQHPLGDDTT--SKENYPVVHIAYQDALAYAKW 171
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTE +WEY RGGL++ + WGN + E +AN WQG FP NT ADGY+ AP
Sbjct: 172 LGKSLPTEVQWEYAARGGLKSATYTWGNQYS---ETKANTWQGMFPFFNTKADGYVGLAP 228
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP--------------SYNPKGPTTGTD 300
V S+ N +G+Y+M GNVWEWT+ W+ V P P S++P+ PT
Sbjct: 229 VGSFPPNGYGIYDMAGNVWEWTSSWFKV---PQPHQNGVDYLSVKERTSFDPEKPTEMAL 285
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
V KGGSYLC YC R+R AAR +PD+ A ++GFR +
Sbjct: 286 YVIKGGSYLCAPNYCSRYRPAARESQSPDTGANHIGFRLVKN 327
>gi|400536049|ref|ZP_10799585.1| hypothetical protein MCOL_V216714 [Mycobacterium colombiense CECT
3035]
gi|400331092|gb|EJO88589.1| hypothetical protein MCOL_V216714 [Mycobacterium colombiense CECT
3035]
Length = 296
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 127/335 (37%), Positives = 170/335 (50%), Gaps = 50/335 (14%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++V LPG +FRMG+ E P VT+ +++H V+N QF EFV ATGYVT
Sbjct: 3 SELVDLPGGSFRMGSTS---FYPEEAPIHTVTVGPIAVERHPVTNAQFAEFVDATGYVTV 59
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ D ++ P + ++ + V F ++ R
Sbjct: 60 AEQPIDPALY-PGANADDLCPGAMV------FRPTPGPVDLR-----------------D 95
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR +W R P SW HP G DS + R +HPVV V++ DA AY
Sbjct: 96 WR--------QWW---------RWAPGASWRHPFGPDSDVADRGDHPVVQVAYPDAAAYA 138
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWEY R G +PWG+ T G AN WQG FP N A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAG-STTTYPWGDEATSGGRLMANTWQGSFPYRNDGALGWVGT 197
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
+PV ++ N FGL +M+GNVWEWTA + HH P P GP ++ KGGS
Sbjct: 198 SPVGTFAPNAFGLLDMIGNVWEWTATEFTAHHRLEAPPKSCCTPSGPADPSVNQTLKGGS 257
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+LC +YC+R+R ARS + DS+ ++GFRC AD
Sbjct: 258 HLCAPEYCHRYRPPARSPQSQDSATTHIGFRCVAD 292
>gi|87301449|ref|ZP_01084289.1| hypothetical protein WH5701_02204 [Synechococcus sp. WH 5701]
gi|87283666|gb|EAQ75620.1| hypothetical protein WH5701_02204 [Synechococcus sp. WH 5701]
Length = 332
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 129/346 (37%), Positives = 180/346 (52%), Gaps = 60/346 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG TFRMG++ E + VT++ F +D+HEV+N +F FV+ TGYVT AE
Sbjct: 35 MEWIPGGTFRMGSDDHY---PDELSADAVTVEGFCIDRHEVTNAEFDAFVTDTGYVTVAE 91
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ + P L+E++RA + V M ++ V+Y +W
Sbjct: 92 RPIPKDQY-PDLAEDQRAPGALV---------------FAMPPAGTKVT---EVSYLSW- 131
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W + P +W HP+G +S +E NHPVV V++ DA AY W
Sbjct: 132 ---------WHWT----------PGANWRHPQGPESDLEGLANHPVVQVAFRDAEAYAAW 172
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEA+WE+ RGGL++++F WG + + +AN WQGEFP +N DGY TAP
Sbjct: 173 AGKELPTEAQWEFAARGGLKDQIFSWGKTYSAK---KANTWQGEFPIHNQEKDGYFGTAP 229
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAP-------SYNPKGPTTGTDKV 302
V S+ N +GL +M GNVWEWT DW+ H H P S++P+ P V
Sbjct: 230 VGSFPPNGYGLQDMTGNVWEWTQDWYRPGHQGMADHVNPAVTEQELSFDPREPGIAK-HV 288
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
KGGS+LC + YC R+R AAR PD+ ++GFR PT G
Sbjct: 289 IKGGSFLCAKNYCSRYRPAAREALEPDTGTSHIGFRLVTS--PTEG 332
>gi|384539530|ref|YP_005723614.1| hypothetical protein SM11_pD1281 [Sinorhizobium meliloti SM11]
gi|336038183|gb|AEH84113.1| hypothetical protein SM11_pD1281 [Sinorhizobium meliloti SM11]
Length = 318
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 127/348 (36%), Positives = 170/348 (48%), Gaps = 61/348 (17%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
+E +MV +PG TF MG+N E P VT+D F++D V+N QF EFV+ATG
Sbjct: 10 LEPAAEMVWVPGATFMMGSND---HYPEEAPVHPVTVDGFWIDVTPVTNRQFLEFVNATG 66
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
+VT AE+ +P + A S +R F ++
Sbjct: 67 HVTFAER-------KPRAEDYPGAPPSNLRAGSLVFTPPKRPLQ---------------- 103
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
G + ++W G +W HP G S+I ++HPVVHV+++DA
Sbjct: 104 ------GTDI---SQWWIFTLGA---------NWRHPLGRKSSIGAILDHPVVHVAYSDA 145
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G LPTE EWE RGGL+ F WG L P G H AN WQG FP N+ DG
Sbjct: 146 KAYAEWAGKDLPTETEWELAARGGLDGAEFSWGGELAPGGNHMANTWQGSFPVENSMDDG 205
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----------------PSYN 291
+ T+PV Y N +GLY+M+GNVWEWT D+W+V H A S +
Sbjct: 206 FARTSPVRFYPPNGYGLYDMIGNVWEWTTDYWSVRHPEAAAKPCCIPSNPRNADADASID 265
Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
P +V KGGS+LC YC R+R AAR D++ ++GFRC
Sbjct: 266 PAASVKVPRRVLKGGSHLCAPNYCRRYRPAARHAQEIDTTTSHVGFRC 313
>gi|433611510|ref|YP_007194971.1| hypothetical protein C770_GR4pD1269 [Sinorhizobium meliloti GR4]
gi|429556452|gb|AGA11372.1| hypothetical protein C770_GR4pD1269 [Sinorhizobium meliloti GR4]
Length = 319
Score = 205 bits (522), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 127/349 (36%), Positives = 170/349 (48%), Gaps = 61/349 (17%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
+E +MV +PG TF MG+N E P VT+D F++D V+N QF EFV+ATG
Sbjct: 11 LEPAAEMVWVPGTTFMMGSND---HYPEEAPVHPVTVDGFWIDVTPVTNRQFLEFVNATG 67
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
+VT AE+ +P + A S +R F ++
Sbjct: 68 HVTFAER-------KPRAEDYPGAPPSNLRAGSLVFTPPKRPLQ---------------- 104
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
G + ++W G +W HP G S+I ++HPVVHV+++DA
Sbjct: 105 ------GTDI---SQWWIFTLGA---------NWRHPLGRKSSIGAILDHPVVHVAYSDA 146
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G LPTE EWE RGGL+ F WG L P G H AN WQG FP N+ DG
Sbjct: 147 KAYAEWAGKDLPTETEWELAARGGLDGAEFSWGGELAPGGNHMANTWQGSFPVENSMDDG 206
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----------------PSYN 291
+ T+PV Y N +GLY+M+GNVWEWT D+W+V H A S +
Sbjct: 207 FARTSPVRFYPPNGYGLYDMIGNVWEWTTDYWSVRHPEAAAKPCCIPSNPRNADADASID 266
Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
P +V KGGS+LC YC R+R AAR D++ ++GFRC
Sbjct: 267 PAASVKVPRRVLKGGSHLCAPNYCRRYRPAARHAQEIDTTTSHVGFRCV 315
>gi|409436150|ref|ZP_11263342.1| Sulfatase-modifying factor 1 [Rhizobium mesoamericanum STM3625]
gi|408752060|emb|CCM74491.1| Sulfatase-modifying factor 1 [Rhizobium mesoamericanum STM3625]
Length = 339
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 130/344 (37%), Positives = 171/344 (49%), Gaps = 65/344 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ +PG TF MG+N E P+ V +D F++D+ V+N +F EFV ATGYVT AE
Sbjct: 18 MIWIPGRTFTMGSNDHY---PEEAPAHPVKVDGFWIDEIPVTNRKFTEFVKATGYVTFAE 74
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
D + E RA S + K G D T W W
Sbjct: 75 TRPDPKDYPGAKPEMLRAG-SLLFFQPKTVSGADIT------------QW--------W- 112
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+++G +W P G S + +++HPVVHV+W DA AY W
Sbjct: 113 --------TFKFGA------------NWRRPLGGTSDLRGKLDHPVVHVAWCDAKAYADW 152
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE RGGLE+ F WG+ L P G AN+WQG FP +T G+ T+P
Sbjct: 153 AGLELPTEAEWELAARGGLEDAEFAWGDELMPAGTLMANIWQGTFPVTSTKPKGHERTSP 212
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP------------------SYNPKGPT 296
V S+ N +GLY+M+GN+WEWT D+W+ HP P SY+P+ P
Sbjct: 213 VRSFPPNGYGLYDMIGNIWEWTTDYWS-DRHPEPSAKACCIPNNPRGGGMEASYDPRQPN 271
Query: 297 TGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R AAR D+S +LGFRC
Sbjct: 272 IRIPRRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHLGFRC 315
>gi|146299962|ref|YP_001194553.1| hypothetical protein Fjoh_2206 [Flavobacterium johnsoniae UW101]
gi|146154380|gb|ABQ05234.1| protein of unknown function DUF323 [Flavobacterium johnsoniae
UW101]
Length = 384
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 124/350 (35%), Positives = 174/350 (49%), Gaps = 66/350 (18%)
Query: 15 MVLLPGDTFRMGTNKPI--------LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
MV +PG F MG+N + KD P V +D +Y+D+ EV+N +F++FV A
Sbjct: 74 MVWIPGGEFSMGSNVEDESLCSIKGVTKDAA-PIHRVYVDGYYMDKTEVTNEEFEKFVKA 132
Query: 67 TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
TGYVT AE+ F P +EE+ S V L++
Sbjct: 133 TGYVTVAEQKPTKEEF-PTANEEDLITGSVVFTPTPSAVNLNN----------------- 174
Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
+ W W Y P W HPEG S I+ + +PVVHV +
Sbjct: 175 ---FLQW----------WRYE----------PGADWKHPEGPQSNIKGKEKYPVVHVVYE 211
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN--NT 244
DA AY W G RLPTEAEWE+ RGG L+ WGN+L P+G+ +AN++QG FP +T
Sbjct: 212 DAAAYAKWAGKRLPTEAEWEFAARGGKTGNLYAWGNDLKPKGKFQANIYQGHFPIKDGDT 271
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS------YNPKGPT-- 296
DG+ AP + N +GLY+M GNVWEW DW++V ++ + + NP+GP
Sbjct: 272 GEDGFKGIAPTAQFAPNAYGLYDMAGNVWEWVNDWYSVDYYKSLAESGKTVKNPQGPNAY 331
Query: 297 ------TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
T +V +GGS+LC +QYC R+ R + S+A ++GFRC
Sbjct: 332 YDPNDPTEIKRVHRGGSFLCTDQYCTRYMVGTRGKGEIRSAANHVGFRCV 381
>gi|73538395|ref|YP_298762.1| hypothetical protein Reut_B4568 [Ralstonia eutropha JMP134]
gi|72121732|gb|AAZ63918.1| conserved hypothetical protein [Ralstonia eutropha JMP134]
Length = 390
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 123/351 (35%), Positives = 168/351 (47%), Gaps = 58/351 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG F MG++ + + E P+ V + F++D+H V+N QF++FV ATGYVT AE
Sbjct: 55 MAHIPGGEFLMGSDS-NMAQPNEKPAHKVRVHGFWMDRHHVTNAQFRDFVKATGYVTTAE 113
Query: 75 KFGDTFVFEPLL-SEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ + L + R S + F G + + Y W
Sbjct: 114 RKPEWETLRVQLPAGTPRPPDSALVPGGMVFVGTSRPVPLK--------------DYSQW 159
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W Y P W HP G ++I+ + +HPVV VS+ DA AY
Sbjct: 160 ----------WRY----------VPGADWRHPGGPGTSIDGKDDHPVVQVSYEDAQAYAR 199
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGY 249
W G RLPTEAEWE+ RGGLE + WG P G ANVWQG+ FP + A G
Sbjct: 200 WVGKRLPTEAEWEFAARGGLEQATYAWGGTFAPEGRQMANVWQGQQAQPFPVASPKAGGA 259
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNPKG 294
L T+PV ++ EN +GLY+M GN W+W ADW+ P S++P
Sbjct: 260 LGTSPVGTFPENGYGLYDMTGNAWQWVADWYRADQFRQEAAAGRRVDDPKGPVSSWDPDD 319
Query: 295 ---PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
PT +V +GGS+LCNE YC +R +AR P +S +LGFR D
Sbjct: 320 AGVPTDAPKRVTRGGSFLCNEDYCLSYRPSARRGTDPYNSMSHLGFRLVMD 370
>gi|384533781|ref|YP_005716445.1| sulfatase-modifying factor protein [Sinorhizobium meliloti BL225C]
gi|333815957|gb|AEG08624.1| Sulphatase-modifying factor protein [Sinorhizobium meliloti BL225C]
Length = 319
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 127/349 (36%), Positives = 170/349 (48%), Gaps = 61/349 (17%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
+E +MV +PG TF MG+N E P VT+D F++D V+N QF EFV+ATG
Sbjct: 11 LEPAAEMVWVPGTTFMMGSND---HYPEEAPVHPVTVDGFWIDVTPVTNRQFLEFVNATG 67
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
+VT AE+ +P + A S +R F ++
Sbjct: 68 HVTFAER-------KPRAEDYPGAPPSNLRAGSLVFTPPKRPLQ---------------- 104
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
G + ++W G +W HP G S+I ++HPVVHV+++DA
Sbjct: 105 ------GTDI---SQWWIFTLGA---------NWRHPLGRKSSIGAILDHPVVHVAYSDA 146
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G LPTE EWE RGGL+ F WG L P G H AN WQG FP N+ DG
Sbjct: 147 KAYAEWAGKDLPTETEWELSARGGLDGAEFSWGGELAPGGNHMANTWQGSFPVENSMDDG 206
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----------------PSYN 291
+ T+PV Y N +GLY+M+GNVWEWT D+W+V H A S +
Sbjct: 207 FARTSPVRFYPPNGYGLYDMIGNVWEWTTDYWSVRHPEAAAKPCCIPSNPRNADADASID 266
Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
P +V KGGS+LC YC R+R AAR D++ ++GFRC
Sbjct: 267 PAASVKVPRRVLKGGSHLCAPNYCRRYRPAARHAQEIDTTTSHVGFRCV 315
>gi|220914917|ref|YP_002490225.1| hypothetical protein [Methylobacterium nodulans ORS 2060]
gi|219952668|gb|ACL63058.1| protein of unknown function DUF323 [Methylobacterium nodulans ORS
2060]
Length = 324
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 129/359 (35%), Positives = 178/359 (49%), Gaps = 62/359 (17%)
Query: 5 PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
PA + MV + G TFRMG+++ E P V++D F++D+ V+N QF+ FV
Sbjct: 10 PADSPDLLAGMVFVAGGTFRMGSDR---HYPEEAPVHRVSVDGFWIDRTPVTNAQFRAFV 66
Query: 65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
ATG+VT AE+ +P + A ++ F+ ++ R W
Sbjct: 67 RATGHVTMAER-------KPKAEDYPGALPHMLQAGSLVFKPPKGAVDLR--------DW 111
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
+ WR + +G SW P G S+I +HPVVHV+
Sbjct: 112 S-----AWWR---------FRFGA------------SWRRPLGPGSSIAGLDDHPVVHVA 145
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
+ DA AY W G LPTEAEWEY RGGL+ F WG+ TP G H AN WQG FP N
Sbjct: 146 YADAEAYARWAGKELPTEAEWEYAARGGLDGAEFAWGDEFTPGGRHMANTWQGAFPHQNL 205
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSYNPKGPTTGTDK 301
A DG+ T+PV ++ N +GL++M+GNVWEWT D++ H P P+ P G +K
Sbjct: 206 AEDGFERTSPVTTFPPNGYGLHDMIGNVWEWTTDFYAPKHPTDAPKACCIPQNPRGGQEK 265
Query: 302 ---------------VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
V KGGS+LC YC R+R AAR + D+S ++GFRC +GP
Sbjct: 266 GSYDPAQPAIRIPRRVVKGGSHLCAPNYCRRYRPAARHPHPVDTSTSHIGFRCVVRRGP 324
>gi|89269079|emb|CAJ81908.1| similar to human sulfatase modifying factor 2 precursor (sumf2)
[Xenopus (Silurana) tropicalis]
Length = 307
Score = 205 bits (521), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 129/346 (37%), Positives = 177/346 (51%), Gaps = 64/346 (18%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
E +MV L G F MGT K KDGE P R V + F +D++ V+N F+EFV A Y
Sbjct: 21 EDADNMVQLDGGQFDMGT-KASDGKDGESPVRQVKVLPFAVDKYPVTNKDFREFVRAKKY 79
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
TEAE FG +FVFE +SEE + K++Q + W
Sbjct: 80 KTEAEAFGWSFVFEDFVSEELKKKVTQ---------------------KLESAPW----- 113
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
W LP E R F W P G S I+ ++++PVV VSWNDA
Sbjct: 114 ---W----LPVE-------------RAF----WRQPAGPASGIKEKLDYPVVQVSWNDAQ 149
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
AYC W G RLP E EWE+ RGGL+ +++PWGN P +R N+WQG FP +TA DGY
Sbjct: 150 AYCKWLGKRLPIEEEWEFAARGGLKGKVYPWGNTFKP---NRTNLWQGTFPGMDTATDGY 206
Query: 250 LSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSY 308
+PV ++ +N++GLY+M+GN WEWTA + S + V +G S+
Sbjct: 207 HGASPVAAFPAQNEYGLYDMLGNTWEWTATEYT-------SARAQKQNKENMYVLRGASW 259
Query: 309 L--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKV 352
+ + ++ R R NTPDS++ NL FRCA ++ + DK+
Sbjct: 260 IDTVDGSANHKARVTTRMGNTPDSASDNLSFRCAKNRRKSKTEDKM 305
>gi|383771887|ref|YP_005450952.1| hypothetical protein S23_36410 [Bradyrhizobium sp. S23321]
gi|381360010|dbj|BAL76840.1| hypothetical protein S23_36410 [Bradyrhizobium sp. S23321]
Length = 307
Score = 204 bits (520), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 127/348 (36%), Positives = 171/348 (49%), Gaps = 62/348 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV + G TF MG++ E PS +D F++D+ V+N QF+ FV+ATG+VT A
Sbjct: 1 MVWISGGTFGMGSDDHY---PEEAPSHRAAVDGFWIDRTPVTNRQFKAFVNATGHVTTA- 56
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
Q+ D K + G + E +V + C W
Sbjct: 57 ---------------------QIVPDPKEYPG--ARPEMIYAGSLVFSPPSRVADLCDW- 92
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
++W +G +W P G S I+ +HPVVHVS++DA+AY TW
Sbjct: 93 -------SQWWTFMKGA---------TWRRPYGPKSNIKDLDDHPVVHVSYSDAMAYATW 136
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE+ RGG + F WGN L P G H AN WQG FP N DG+ T+P
Sbjct: 137 AGKDLPTEAEWEFAARGGFDGEDFAWGNTLNPGGRHMANTWQGIFPLENRCEDGFERTSP 196
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---APSYNPKGPTTGTD----------- 300
V ++ N +G+++M+GNVWEWTADWW+ H P P+ P G +
Sbjct: 197 VKAFPPNGYGVHDMIGNVWEWTADWWSARHEADAQKPCCIPQNPRGGPESASFDSCQPHI 256
Query: 301 ----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
KV KGGS+LC YC R+R AAR D+S ++GFRC G
Sbjct: 257 RIPRKVLKGGSHLCAPNYCRRYRPAARHAEPIDTSTSHVGFRCVVRNG 304
>gi|296168570|ref|ZP_06850374.1| sulfatase-modifying factor 1 [Mycobacterium parascrofulaceum ATCC
BAA-614]
gi|295896633|gb|EFG76272.1| sulfatase-modifying factor 1 [Mycobacterium parascrofulaceum ATCC
BAA-614]
Length = 305
Score = 204 bits (520), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 128/335 (38%), Positives = 169/335 (50%), Gaps = 50/335 (14%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V +PG +FRMG E P VTL AF +++H V+N QF EFVSATGYVT A
Sbjct: 5 ELVDIPGGSFRMGCTS---FYPEEAPIHTVTLPAFAVERHPVTNAQFAEFVSATGYVTIA 61
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ P+ D + G D + +V V W
Sbjct: 62 ER--------PI--------------DPALYPGADPA--DLVPGAMVFRPTTGPVDLGDW 97
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R +W R P W HP G S + R +HPVV V++ DA AY
Sbjct: 98 R--------QWW---------RWVPGACWRHPFGPHSDLTDRADHPVVQVAYPDAAAYAR 140
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGG ++ WG+ +P G AN WQG FP N A G+ T+
Sbjct: 141 WAGRRLPTEAEWEYAARGG-SAAIYAWGDEPSPGGRLMANTWQGRFPYRNDGALGWAGTS 199
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGSY 308
PV ++ N FGL +M+GNVWEWTA ++ HH P P GP + KGGS+
Sbjct: 200 PVGTFPPNGFGLLDMIGNVWEWTATEFSAHHRLDQPPKACCTPAGPADPSVSQTLKGGSH 259
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
LC +YC+R+R AARS + D++ ++GFRC A +
Sbjct: 260 LCAPEYCHRYRPAARSPQSQDTATTHIGFRCVAQQ 294
>gi|194289632|ref|YP_002005539.1| sulfatase-modifying factor (c-alpha-formyglycine-generating enzyme)
[Cupriavidus taiwanensis LMG 19424]
gi|193223467|emb|CAQ69472.1| putative Sulfatase-modifying factor
(C-alpha-formyglycine-generating enzyme) [Cupriavidus
taiwanensis LMG 19424]
Length = 410
Score = 204 bits (520), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 127/352 (36%), Positives = 166/352 (47%), Gaps = 58/352 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG F MG++ + + E P+ V + AF++DQH V+N F++FV ATGYVT AE
Sbjct: 71 MVRVPGGEFLMGSDS-KMAQPNEKPAHKVRVHAFWMDQHHVTNGDFRKFVEATGYVTTAE 129
Query: 75 KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ D L R S + F G + + Y W
Sbjct: 130 RAPDWETLRVQLPPGTPRPPDSAMVAGGMVFVGTSRPVPLQ--------------DYSRW 175
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W Y P W HP G S+IE + NHPVV VS+ DA AY
Sbjct: 176 ----------WRY----------VPGADWRHPNGPGSSIEGKDNHPVVQVSYEDAQAYAR 215
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGY 249
W G RLPTEAEWE+ RGGLE + WG P G ANVWQG+ FP + A G
Sbjct: 216 WVGKRLPTEAEWEFAARGGLEQATYAWGEQFAPDGRQMANVWQGQQNQPFPVVSPKAGGA 275
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH------HPAPSYNPKGPTTGTD--- 300
L T+PV ++ N +GL +M GN W+W ADW+ P NP GP D
Sbjct: 276 LGTSPVGTFPPNGYGLADMTGNAWQWVADWYRADQFRREAGKAQPIENPVGPLASWDPSE 335
Query: 301 ---------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
+V +GGS+LCNE +C +R +AR P +S +LGFR D+
Sbjct: 336 PGVPVNAPKRVTRGGSFLCNEDFCLSYRPSARRGTDPFNSMSHLGFRLVMDE 387
>gi|333991963|ref|YP_004524577.1| hypothetical protein JDM601_3323 [Mycobacterium sp. JDM601]
gi|333487931|gb|AEF37323.1| conserved hypothetical protein [Mycobacterium sp. JDM601]
Length = 285
Score = 204 bits (520), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 128/334 (38%), Positives = 168/334 (50%), Gaps = 57/334 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V +PG FRMG+ E P V + AF +++H V+N QF EFV+ATGYVT A
Sbjct: 4 ELVGIPGGVFRMGSTS---FYPEEAPIHTVAVAAFAIERHPVTNAQFAEFVAATGYVTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D P+ + E A + F ++ R W
Sbjct: 61 EQEPD-----PVYAAEPGALV---------FTPTAGPVDLR-----------------DW 89
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R W++ P SW HP G STIEH+ +HPVV V++ DA Y
Sbjct: 90 R-------QWWQW----------VPGASWRHPLGPHSTIEHKPDHPVVQVAYPDAAGYAR 132
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLP EAEWEY R G + +PWG P GE AN WQG FP N A G+ T+
Sbjct: 133 WAGRRLPAEAEWEYAARAGTSS-TYPWGEEEKPGGELMANTWQGAFPYRNDGALGWSGTS 191
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGSY 308
V S+ N +GL +M+GNVWEWT + HH P P GP G + KGGS+
Sbjct: 192 SVGSFPPNDWGLLDMIGNVWEWTTTEFCGHHRLDAAPKSCCAPDGPADPGVSQTLKGGSH 251
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
LC YC+R+R AARS + D++ ++GFRCAAD
Sbjct: 252 LCAPAYCHRYRAAARSPQSQDTATSHIGFRCAAD 285
>gi|452951552|gb|EME56999.1| hypothetical protein G352_21926 [Rhodococcus ruber BKS 20-38]
Length = 320
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 132/344 (38%), Positives = 170/344 (49%), Gaps = 62/344 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M LPG TF MG+N E P VT+D F++D H V+ F++FV ATG+VT AE
Sbjct: 17 MQHLPGGTFLMGSNH---FYPEEQPVHRVTVDGFWMDIHPVTVADFRKFVDATGHVTTAE 73
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
P +E A + F+ D HPV +D + +W
Sbjct: 74 -------IAPDPAEFPEAPADLLVPGSLVFDPPD--------HPV---PLDDFTRWWSW- 114
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
P +W HPEG ST + R +HPV HVS +DA+AY W
Sbjct: 115 ----------------------IPGANWRHPEGPGSTSDDRDDHPVTHVSHDDALAYAKW 152
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE+ RGG E R F WG+ P G RANVWQGEFP N DGY T+P
Sbjct: 153 AGKDLPTEAEWEFAARGGQEGRHFVWGDEDPPSGGPRANVWQGEFPWQNLEEDGYYRTSP 212
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTGTD------ 300
V ++ N +GL +M GNVWEWT D++ H + AP+ P+ P
Sbjct: 213 VHAFPSNPYGLRDMAGNVWEWTVDYFTGDHASGGKNVAPASACCIPRNPRITVAEQPVDA 272
Query: 301 ----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+V KGGS+LC YC R+R AAR + D+S G++GFRC
Sbjct: 273 PRPRRVIKGGSHLCAPNYCLRYRPAARQGQSEDTSTGHIGFRCV 316
>gi|217979245|ref|YP_002363392.1| hypothetical protein Msil_3121 [Methylocella silvestris BL2]
gi|217504621|gb|ACK52030.1| protein of unknown function DUF323 [Methylocella silvestris BL2]
Length = 334
Score = 204 bits (519), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 125/343 (36%), Positives = 173/343 (50%), Gaps = 62/343 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ L G TF MG+++ E P+ V +D F++D+ V+N QF+ FV ATG+ T AE
Sbjct: 21 MIYLDGGTFLMGSDRHY---PEEAPAHRVAVDGFFMDETPVTNAQFRAFVEATGHRTFAE 77
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
++ + + H +K S + +PV W+ + T+R
Sbjct: 78 -----------IAPNSKDYPDALPHMLK----PGSLVFSPPDYPVDLRDWSQ---WWTFR 119
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+G SW P G S+I+ +HPVVHV++ DA AY W
Sbjct: 120 -----------FGA------------SWRKPYGPGSSIKGLDDHPVVHVAFRDAEAYARW 156
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LP+EAEWE+ RGGLE F WG+ TP G H AN WQG+FP N A DG+ T+P
Sbjct: 157 AGKELPSEAEWEFAARGGLERAEFAWGDEFTPGGRHMANTWQGQFPLQNLAEDGFERTSP 216
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSYNPKGPTTGT------------ 299
V ++ N +GLY+++GNVWEWT DWW+ H P P+ P G+
Sbjct: 217 VRTFPPNNYGLYDLIGNVWEWTTDWWSQRHAGDAPKACCVPRNPRGGSMERSYASLQAGA 276
Query: 300 ---DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KV KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 277 SIPQKVLKGGSHLCAPNYCRRYRPAARHAEAIDTSTSHVGFRC 319
>gi|352096888|ref|ZP_08957602.1| Sulphatase-modifying factor protein [Synechococcus sp. WH 8016]
gi|351675472|gb|EHA58633.1| Sulphatase-modifying factor protein [Synechococcus sp. WH 8016]
Length = 282
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 124/335 (37%), Positives = 173/335 (51%), Gaps = 55/335 (16%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
+ + MV++P F +G++ E P R+V + +F +D V+N +F FVS TGY
Sbjct: 3 DTFDRMVVIPSGDFILGSDS---FYPEESPVRSVQIKSFRIDACPVTNAEFARFVSETGY 59
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
VT +EK D ++ P L +++ S V F T++ +SW +
Sbjct: 60 VTVSEKPPDPVLY-PNLPPDQQNPESAV------FIPPPPTVDRNQP-----MSWWALIE 107
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
WR HP+G DSTI RM+HPVVH+++ D++
Sbjct: 108 GADWR-----------------------------HPQGPDSTIADRMDHPVVHLAYEDSL 138
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
AY W G RLPT EWE RGGL + + WG +TP G+ ANVWQG FP N ADG+
Sbjct: 139 AYAQWMGKRLPTADEWEVAARGGLVQQHYAWGEEMTPGGQWLANVWQGPFPWKNEQADGW 198
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
T+PV S+ N +GL +M GNVWEWT+ + V PKG ++ KGGS+L
Sbjct: 199 FWTSPVGSFPANGYGLVDMCGNVWEWTSTLFPV---------PKG--EQERRIIKGGSFL 247
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
C E YC+R R AA T D++ ++GFRCAAD G
Sbjct: 248 CAENYCHRFRPAALMGQTIDTATCHMGFRCAADSG 282
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 30/42 (71%)
Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
++ KGGS+LC E YC+R R AA T D++ ++GFRCAAD
Sbjct: 239 RIIKGGSFLCAENYCHRFRPAALMGQTIDTATCHMGFRCAAD 280
>gi|116693744|ref|YP_839277.1| hypothetical protein Bcen2424_5651 [Burkholderia cenocepacia
HI2424]
gi|116651744|gb|ABK12384.1| protein of unknown function DUF323 [Burkholderia cenocepacia
HI2424]
Length = 421
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 125/353 (35%), Positives = 173/353 (49%), Gaps = 58/353 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
DM +PG TF MG++ + E P+ T+ F+LD+H+V+N +F+ FV ATGYVT A
Sbjct: 86 DMAWIPGGTFLMGSDA-KAAQPNERPAHRATVRGFWLDRHDVTNAEFRRFVDATGYVTTA 144
Query: 74 EKFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
E+ D L R S + F G + +D Y
Sbjct: 145 ERKPDWSALRAQLPPGTPRPPDSALVPGALVFVGTAGPVPL-----------DD---YAR 190
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W W R P +W HP+G STI + ++PVV VS++DA AY
Sbjct: 191 W----------W----------RFLPGANWRHPDGPGSTIAGKDDYPVVQVSYDDARAYA 230
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADG 248
W G RLPTEAEWE+ RGGLE + WGN L P G AN+W+G+ FP + +
Sbjct: 231 QWAGKRLPTEAEWEFAARGGLEQADYAWGNQLKPAGRQMANIWEGQQPQPFPVVDARSGK 290
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH---------------HHPAPSYNPK 293
+ +PV +Y N +GLY+M GNVW+W ADW+ PA S++P
Sbjct: 291 PAAPSPVGTYPPNGYGLYDMAGNVWQWVADWYRADAFLLAVESGRPVADPQGPADSWDPT 350
Query: 294 GPTT---GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
P +V +GGS+LCNE YC +R +AR P++ +LGFR A D+
Sbjct: 351 EPGVPPHAPKRVIRGGSFLCNESYCQSYRPSARRGVDPEAGMSHLGFRLAMDE 403
>gi|407982456|ref|ZP_11163132.1| formylglycine-generating sulfatase enzyme family protein
[Mycobacterium hassiacum DSM 44199]
gi|407375968|gb|EKF24908.1| formylglycine-generating sulfatase enzyme family protein
[Mycobacterium hassiacum DSM 44199]
Length = 299
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 128/339 (37%), Positives = 172/339 (50%), Gaps = 55/339 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V L G T+RMG+ + E P VT+ F +++H V+N QF EFV+ATGYVT A
Sbjct: 4 ELVELRGATYRMGSTR---FYPEEAPEHTVTVAPFAIERHPVTNAQFAEFVAATGYVTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D + P + ++ + V F D ++ R W
Sbjct: 61 ERALDPADY-PGANPDDLVPGALV------FTPTDGPVDLR-----------------DW 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDST-IEHRMNHPVVHVSWNDAVAYC 192
R +W R P W P G D + R +HPVV V++ DA+AY
Sbjct: 97 R--------QWW---------RYVPGAQWRRPFGPDGPGWQDRPDHPVVQVAYVDALAYA 139
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWEY RGG + +PWG L+P G AN WQG FP N A G++ T
Sbjct: 140 RWAGRRLPTEAEWEYAARGGADT-TYPWGEELSPGGRLMANTWQGAFPYRNDGAAGWVGT 198
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------PAPSYNPKGPTTGTDKVK 303
+PV ++ N +GL +M+GNVWEWT + HH P P P P +V
Sbjct: 199 SPVGTFPPNGYGLVDMIGNVWEWTTTRFEGHHRIGPAQGAGGPKPCCTPSRPDPSVSQVL 258
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
KGGS+LC +YC R+R ARS T DS+ ++GFRCAAD
Sbjct: 259 KGGSHLCAPEYCRRYRPTARSPQTQDSATTHIGFRCAAD 297
>gi|354494816|ref|XP_003509531.1| PREDICTED: sulfatase-modifying factor 2-like [Cricetulus griseus]
Length = 302
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 128/354 (36%), Positives = 176/354 (49%), Gaps = 78/354 (22%)
Query: 1 MVLLPAPPVE----RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVS 56
M LL +P ++ + MV LPG F MGT+ P +DGE P+R VT+ +F +D V+
Sbjct: 11 MCLLSSPRLQLGNAQAPTMVQLPGGRFLMGTDAPD-GRDGEGPAREVTVKSFTIDVFPVT 69
Query: 57 NTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMH 116
N F+EFV Y TEAE FG +FVFE +S E R + +Q+
Sbjct: 70 NKDFREFVREKKYQTEAETFGWSFVFEDFVSPELRNQANQM------------------- 110
Query: 117 HPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRM 176
+ W V WR P G S I ++
Sbjct: 111 ---PSVLWWLPVQKAFWR-----------------------------QPAGPGSGIREKL 138
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
PVVHVSWNDA AYC WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQ
Sbjct: 139 ELPVVHVSWNDASAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQ 195
Query: 237 ---GEFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
G+FP + A DG+ +PV ++ +N +GLY+++GNVWEWTA +
Sbjct: 196 GRPGKFPKGDRAEDGFHGLSPVNAFPPQNNYGLYDLMGNVWEWTASIYQSASQDM----- 250
Query: 293 KGPTTGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+V +G S++ + +R R R NTPDS++ NLGFRCA+ G
Sbjct: 251 --------RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCASSTG 296
>gi|47221775|emb|CAG08829.1| unnamed protein product [Tetraodon nigroviridis]
Length = 284
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 123/332 (37%), Positives = 172/332 (51%), Gaps = 71/332 (21%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV +PG MGT+ KDGEFP R + +F +D++ V+N +F++FV + Y TEA
Sbjct: 21 EMVDIPGGRMLMGTSA-ADGKDGEFPPREAKVQSFKIDKYPVTNAEFRDFVRSQKYKTEA 79
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E FG +FVF+ +S+E ++KI+Q + W
Sbjct: 80 ETFGWSFVFQDFVSDELKSKITQ------------------------------RIESAPW 109
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
LP E R F W P G S I R++ PVVHVSWNDA A+C
Sbjct: 110 W---LPVE-------------RAF----WRQPAGAGSGIRERLDFPVVHVSWNDAQAFCQ 149
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W+G RLP E EWE+ RGGL+ R +PWGN +R+N+WQG FP +TA DG+ +
Sbjct: 150 WKGKRLPGEEEWEWAARGGLQGRTYPWGNKFQ---ANRSNLWQGPFPEGDTAEDGFHGVS 206
Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--C 310
PV ++ +N GLY+M+GN WEWT+ P P P V +G S++
Sbjct: 207 PVAAFPPQNSHGLYDMMGNTWEWTS-------TPFPGAQPM-------YVLRGASWIDTV 252
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+ ++ R R NTPDS++ NLGFRCAA+
Sbjct: 253 DGSSNHKARVTTRMGNTPDSASDNLGFRCAAN 284
>gi|387877758|ref|YP_006308062.1| hypothetical protein W7S_21930 [Mycobacterium sp. MOTT36Y]
gi|386791216|gb|AFJ37335.1| hypothetical protein W7S_21930 [Mycobacterium sp. MOTT36Y]
Length = 296
Score = 204 bits (519), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 126/336 (37%), Positives = 170/336 (50%), Gaps = 50/336 (14%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++V LPG +FRMG+ E P VT+ +++H V+N QF EFV ATGY+T
Sbjct: 3 SELVDLPGGSFRMGSTS---FYPEEAPIHTVTIGPLAVERHPVTNAQFAEFVDATGYLTV 59
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ D +F P ++ + V F ++ R
Sbjct: 60 AEQPIDPALF-PGADPDDLCPGAMV------FRPTPGPVDLR-----------------D 95
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR W++ P SW HP G DS + R +HPVV V++ DA AY
Sbjct: 96 WR-------QWWQW----------VPGASWRHPFGPDSDVADRGDHPVVQVAYPDAAAYA 138
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWEY R G + WG+ +T G AN WQG FP N A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAG-STTTYSWGDEVTSDGRLMANTWQGSFPYRNDGALGWVGT 197
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
+PV ++ N FGL +M+GNVWEWTA ++ HH P P GP + KGGS
Sbjct: 198 SPVGTFAPNAFGLVDMIGNVWEWTATEFSPHHRLHSPPKACCTPPGPADPSVSQTLKGGS 257
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
+LC +YC+R+R AARS + DS+ ++GFRC AD
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVADS 293
>gi|107027544|ref|YP_625055.1| hypothetical protein Bcen_5208 [Burkholderia cenocepacia AU 1054]
gi|105896918|gb|ABF80082.1| protein of unknown function DUF323 [Burkholderia cenocepacia AU
1054]
Length = 377
Score = 204 bits (518), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 124/353 (35%), Positives = 171/353 (48%), Gaps = 58/353 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
DM +PG TF MG++ + E P+ T+ F+LD+H+V+N +F+ FV ATGYVT A
Sbjct: 42 DMAWIPGGTFLMGSDA-KAAQPNERPAHRATVRGFWLDRHDVTNAEFRRFVDATGYVTTA 100
Query: 74 EKFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
E+ D L R S + F G +
Sbjct: 101 ERKPDWSALRAQLPPGTPRPPDSALVPGALVFVGTAGPVP-------------------- 140
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
L A W R P +W HP+G STI + ++PVV VS++DA AY
Sbjct: 141 -----LDDYARWW---------RFLPGANWRHPDGPGSTIAGKDDYPVVQVSYDDARAYA 186
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADG 248
W G RLPTEAEWE+ RGGLE + WGN L P G AN+W+G+ FP + +
Sbjct: 187 QWAGKRLPTEAEWEFAARGGLEQADYAWGNQLKPAGRQMANIWEGQQPQPFPVVDARSGK 246
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH---------------HHPAPSYNPK 293
+ +PV +Y N +GLY+M GNVW+W ADW+ PA S++P
Sbjct: 247 PAAPSPVGTYPPNGYGLYDMAGNVWQWVADWYRADAFLLAVESGRPVADPQGPADSWDPT 306
Query: 294 GPTT---GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
P +V +GGS+LCNE YC +R +AR P++ +LGFR A D+
Sbjct: 307 EPGVPPHAPKRVIRGGSFLCNESYCQSYRPSARRGVDPEAGMSHLGFRLAMDE 359
>gi|406032646|ref|YP_006731538.1| sulfatase-modifying factor 1 [Mycobacterium indicus pranii MTCC
9506]
gi|405131193|gb|AFS16448.1| Sulfatase-modifying factor 1 [Mycobacterium indicus pranii MTCC
9506]
Length = 296
Score = 204 bits (518), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 126/336 (37%), Positives = 170/336 (50%), Gaps = 50/336 (14%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++V LPG +FRMG+ E P VT+ +++H V+N QF EFV ATGY+T
Sbjct: 3 SELVDLPGGSFRMGSTS---FYPEEAPIHTVTVGPLAVERHPVTNAQFAEFVDATGYLTV 59
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ D ++ P ++ + V F ++ R
Sbjct: 60 AEQPIDPALY-PGADPDDLCPGAMV------FRPTPGPVDLR-----------------D 95
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR +W R P SW HP G DS + R +HPVV V++ DA AY
Sbjct: 96 WR--------QWW---------RWVPGASWRHPFGPDSDVADRGDHPVVQVAYPDAAAYA 138
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWEY R G + WG+ +T G AN WQG FP N A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAG-STTTYSWGDEVTSDGRLMANTWQGSFPYRNDGALGWVGT 197
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
+PV ++ N FGL +M+GNVWEWTA ++ HH P P GP + KGGS
Sbjct: 198 SPVGTFAPNAFGLIDMIGNVWEWTATEFSPHHRLDSAPKACCTPPGPADPSVSQTLKGGS 257
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
+LC +YC+R+R AARS + DS+ ++GFRC AD
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVADS 293
>gi|108797997|ref|YP_638194.1| hypothetical protein Mmcs_1024 [Mycobacterium sp. MCS]
gi|119867093|ref|YP_937045.1| hypothetical protein Mkms_1041 [Mycobacterium sp. KMS]
gi|108768416|gb|ABG07138.1| protein of unknown function DUF323 [Mycobacterium sp. MCS]
gi|119693182|gb|ABL90255.1| protein of unknown function DUF323 [Mycobacterium sp. KMS]
Length = 290
Score = 203 bits (517), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 122/332 (36%), Positives = 170/332 (51%), Gaps = 49/332 (14%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG TFRMG+ + E P VT+ F +++H V+N QF EFV ATGYVT A
Sbjct: 4 ELVELPGGTFRMGSTQ---FYPEEVPVHTVTVADFAIERHPVTNAQFAEFVDATGYVTIA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D ++ P + + + V F+ ++ R D + TW
Sbjct: 61 EQALDPALY-PGVPPADLVPGALV------FQPTAGPVDLR-----------DWRQWWTW 102
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
P W HP G +S+ R +HPVV V++ DA AY
Sbjct: 103 S-----------------------PGACWRHPLGPESSWHDRPDHPVVQVAYPDAAAYAK 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEA+WEY G +PWG++ P G AN WQG FP N A G++ T+
Sbjct: 140 WAGRRLPTEAQWEYAAHAGAAT-TYPWGDDPQPGGRLMANTWQGRFPYRNDGALGWVGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY----NPKGPTTGTDKVKKGGSYL 309
PV ++ N FGL +M+GNVWEWT ++ HH P + P P ++ KGGS+L
Sbjct: 199 PVGTFPANGFGLVDMIGNVWEWTTTRFSAHHRPDGAVESCCGPADPDPSVNQALKGGSHL 258
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
C +YC+R+R AARS + DS+ ++GFRC A
Sbjct: 259 CAPEYCHRYRAAARSPQSQDSATTHIGFRCVA 290
>gi|405380290|ref|ZP_11034130.1| hypothetical protein PMI11_04116 [Rhizobium sp. CF142]
gi|397323155|gb|EJJ27553.1| hypothetical protein PMI11_04116 [Rhizobium sp. CF142]
Length = 334
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 131/344 (38%), Positives = 171/344 (49%), Gaps = 65/344 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ +PG TF MG+N E P+ V +D F++D+ V+N +F EFV ATGYVT AE
Sbjct: 18 MIWIPGRTFTMGSND---HYPEEAPAHPVKVDGFWIDEIPVTNRRFTEFVKATGYVTFAE 74
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K D + E RA S + K G D T W W
Sbjct: 75 KTPDPKDYPGAKPEMLRAG-SLLFFQPKTVSGPDIT------------QW--------W- 112
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+++G +W P G S + +++HPVVHV+W DA AY W
Sbjct: 113 --------TFKFGA------------NWRRPLGGLSDLRGKLDHPVVHVAWCDAKAYADW 152
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE RGGL++ F WG+ L P G AN WQG FP +T G+ T+P
Sbjct: 153 AGLELPTEAEWELAARGGLDDTEFAWGDELMPDGVPMANTWQGTFPVKSTKPKGHERTSP 212
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP------------------SYNPKGPT 296
V S+ N +GLY+M+GNVWEWT D+W+ HP P SY+P+ P
Sbjct: 213 VRSFPPNGYGLYDMIGNVWEWTTDYWS-DRHPEPSAKACCIPDNPRGGGMEASYDPRQPN 271
Query: 297 TGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R AAR D+S +LGFRC
Sbjct: 272 IRIPRRVLKGGSHLCAPSYCRRYRPAARHAEPEDTSTSHLGFRC 315
>gi|254820468|ref|ZP_05225469.1| hypothetical protein MintA_11096 [Mycobacterium intracellulare ATCC
13950]
gi|379756362|ref|YP_005345034.1| hypothetical protein OCO_43500 [Mycobacterium intracellulare
MOTT-02]
gi|378806578|gb|AFC50713.1| hypothetical protein OCO_43500 [Mycobacterium intracellulare
MOTT-02]
Length = 296
Score = 203 bits (517), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 126/336 (37%), Positives = 169/336 (50%), Gaps = 50/336 (14%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++V LPG +FRMG+ E P V + F +++H V+N QF EFV ATGY+T
Sbjct: 3 SELVDLPGGSFRMGSTS---FYPEEAPIHTVAVGPFAVERHPVTNAQFAEFVDATGYLTV 59
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ D ++ P ++ + V F ++ R
Sbjct: 60 AEQPIDPALY-PGADPDDLCPGAMV------FRPTPGPVDLR-----------------D 95
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR +W R P SW HP G DS + R +HPVV V++ DA AY
Sbjct: 96 WR--------QWW---------RWVPGASWRHPFGPDSDVADRGDHPVVQVAYPDAAAYA 138
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWEY R G + WG+ T G AN WQG FP N A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAG-STTTYSWGDEATSAGRLMANTWQGSFPYRNDGALGWVGT 197
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
+PV ++ N FGL +M+GNVWEWTA ++ HH P P GP + KGGS
Sbjct: 198 SPVGTFAPNAFGLIDMIGNVWEWTATEFSPHHRLDSAPKACCTPPGPADPSVSQTLKGGS 257
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
+LC +YC+R+R AARS + DS+ ++GFRC AD
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVADS 293
>gi|16264068|ref|NP_436860.1| hypothetical protein SM_b20334 [Sinorhizobium meliloti 1021]
gi|15140193|emb|CAC48720.1| HYPOTHETICAL PROTEIN SM_b20334 [Sinorhizobium meliloti 1021]
Length = 303
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 126/343 (36%), Positives = 167/343 (48%), Gaps = 61/343 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG TF MG+N E P VT+D F++D V+N QF EFV+ATG+VT AE
Sbjct: 1 MVWVPGATFMMGSNDHY---PEEAPVHPVTVDGFWIDVTPVTNRQFLEFVNATGHVTFAE 57
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ +P + A S +R F ++
Sbjct: 58 R-------KPRAEDYPGAPPSNLRAGSLVFTPPKRPLQ---------------------- 88
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G + ++W G +W HP G S+I ++HPVVHV+++DA AY W
Sbjct: 89 GTDI---SQWWIFTLGA---------NWRHPLGRKSSIGAILDHPVVHVAYSDAKAYAEW 136
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTE EWE RGGL+ F WG L P G H AN WQG FP N+ DG+ T+P
Sbjct: 137 AGKDLPTETEWELAARGGLDGAEFSWGGELAPGGNHMANTWQGSFPVENSMDDGFARTSP 196
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----------------PSYNPKGPTT 297
V Y N +GLY+M+GNVWEWT D+W+V H A S +P
Sbjct: 197 VRFYPPNGYGLYDMIGNVWEWTTDYWSVRHPEAAAKPCCIPSNPRNADADASIDPAASVK 256
Query: 298 GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+V KGGS+LC YC R+R AAR D++ ++GFRC
Sbjct: 257 VPRRVLKGGSHLCAPNYCRRYRPAARHAQEIDTTTSHVGFRCV 299
>gi|404259561|ref|ZP_10962870.1| putative sulfatase-modifying factor [Gordonia namibiensis NBRC
108229]
gi|403401908|dbj|GAC01280.1| putative sulfatase-modifying factor [Gordonia namibiensis NBRC
108229]
Length = 267
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 121/309 (39%), Positives = 161/309 (52%), Gaps = 59/309 (19%)
Query: 35 DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKI 94
DGE P V L AF +D V+N F FV ATGYVT+AE RA I
Sbjct: 12 DGEVPVHGVELRAFSVDVTSVTNAAFTAFVEATGYVTDAE----------------RAGI 55
Query: 95 SQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
S V H +Y T G +P W
Sbjct: 56 SAVFH-----------------------------SYATAPGEPVPETPWW---------- 76
Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
P SW HP G S + + HPVVHVS +DA+AYC+W G RLPTEA+WEY RGG +
Sbjct: 77 LATPGASWRHPGGPGSDLTGKDTHPVVHVSHDDALAYCSWAGRRLPTEAQWEYAARGGRD 136
Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
+PWG+ + R N+++GEFP T G++ T V +++ N +GLY VGNVWE
Sbjct: 137 GARYPWGDTRPSPEDPRCNIFRGEFPDRPT---GHVGTVDVRTFEPNGYGLYQCVGNVWE 193
Query: 275 WTADWWNV-HHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAG 333
W AD ++ ++ + + +P GPT GT +V +GGS+LC++ YC R+R AARS NTP+S+
Sbjct: 194 WCADRFSARYYRTSATVDPTGPTRGTLRVLRGGSHLCHDSYCNRYRVAARSSNTPNSTTS 253
Query: 334 NLGFRCAAD 342
N+GFR D
Sbjct: 254 NIGFRTVGD 262
>gi|225012451|ref|ZP_03702887.1| protein of unknown function DUF323 [Flavobacteria bacterium
MS024-2A]
gi|225003428|gb|EEG41402.1| protein of unknown function DUF323 [Flavobacteria bacterium
MS024-2A]
Length = 345
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 127/350 (36%), Positives = 178/350 (50%), Gaps = 52/350 (14%)
Query: 7 PPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
PP E MVL+P + MG D EFP V++ AFY+DQ+EV+N +F+ FV A
Sbjct: 33 PPPE---GMVLVPAGIYTMGGRSSQAYVD-EFPKHEVSISAFYMDQNEVTNQEFKLFVEA 88
Query: 67 TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
TGY T AE+ D +E + T+ + P +
Sbjct: 89 TGYKTIAERDID-------------------------WEEMKKTVPLGVAKPHDSLLKAG 123
Query: 127 AVAYCTWRG-ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
++ + +G L ++W G +W HPEG S++E+RM HPVVH++W
Sbjct: 124 SLVFKQTKGPVNLEDYSQWWRWTTGA---------NWQHPEGSGSSLENRMEHPVVHIAW 174
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
+DA+AY TW RLPTEAEWE+ +GG ++PWGN +AN WQG FP NN
Sbjct: 175 DDALAYATWANKRLPTEAEWEWAAQGGNSGNIYPWGNTNVEAAFDQANFWQGVFPYNNRE 234
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-----YNPKGPTTGTD 300
DGY TAP+ S+ N +GL++M GNVWEW D ++ + NP+GP D
Sbjct: 235 LDGYFGTAPITSFPPNGYGLFDMAGNVWEWCQDKYDFKAYEKAQNKGVVQNPQGPRQYND 294
Query: 301 --------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
V +GGS+LCN+ YC +R + R ++ DSS + GFRC D
Sbjct: 295 PREPLSPKHVMRGGSFLCNDSYCSGYRVSRRMSSSRDSSFNHTGFRCVQD 344
>gi|16125425|ref|NP_419989.1| hypothetical protein CC_1173 [Caulobacter crescentus CB15]
gi|221234168|ref|YP_002516604.1| NIRV family protein [Caulobacter crescentus NA1000]
gi|13422495|gb|AAK23157.1| conserved hypothetical protein [Caulobacter crescentus CB15]
gi|220963340|gb|ACL94696.1| NIRV family protein [Caulobacter crescentus NA1000]
Length = 338
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 129/348 (37%), Positives = 172/348 (49%), Gaps = 59/348 (16%)
Query: 4 LPAP-PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
LP P P R MV L G F+MG ++ E P + VT+ F++DQ EV+N F
Sbjct: 30 LPVPDPQNRTAGMVRLAGGDFQMGAAP---LRPEEGPPQTVTVPPFWIDQTEVTNAAFAR 86
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
FV ATGY T AE+ PL + A+ + V +R L
Sbjct: 87 FVEATGYRTVAER--------PL----DPARYAHVPAAQRRPASL--------------- 119
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
+ +GAR ++W ++ P W HPEG S I R PVVH
Sbjct: 120 ------VFVGAKGARSDDPSQWW---------QVIPGADWRHPEGPGSNIRGRDAWPVVH 164
Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
++W DA+AY W G LPTEAEWEY RGGL + + WG+ + RAN WQG FP
Sbjct: 165 IAWEDAMAYARWLGRDLPTEAEWEYAARGGLVGKRYTWGDQAQDPAKPRANTWQGVFPAQ 224
Query: 243 NTAADGYLST-APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----------APSY 290
+ DG+ + APV + N +GL +M GNVWEWT DW+ P A +
Sbjct: 225 DLGNDGFKAKPAPVGCFPPNGYGLRDMAGNVWEWTRDWFKPGLDPVSVLETGGPPEARAL 284
Query: 291 NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
+P+ P T V KGGS+LC + YC+R+R AAR+ PDS A ++GFR
Sbjct: 285 DPEDPNT-PKHVVKGGSFLCADDYCFRYRPAARTPGPPDSGASHVGFR 331
>gi|357023852|ref|ZP_09086020.1| hypothetical protein MEA186_04144 [Mesorhizobium amorphae
CCNWGS0123]
gi|355544285|gb|EHH13393.1| hypothetical protein MEA186_04144 [Mesorhizobium amorphae
CCNWGS0123]
Length = 314
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 129/343 (37%), Positives = 172/343 (50%), Gaps = 60/343 (17%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+DMV +PG F MG++ E P+ V + F++D+ V+N +F+ FV A+ YVT
Sbjct: 11 EDMVWIPGGEFLMGSDHHY---PEEAPAHRVVVGGFWMDRTTVTNAEFRRFVEASCYVTL 67
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE + + P + A S + ++ G+D + H +W
Sbjct: 68 AEHPANAADY-PGAGADSLAPASALF--VRPARGIDRS---------NHYNW-------- 107
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W Y RG +W HP G ST++ HPVVH+++ DA AY
Sbjct: 108 -----------WAY-VRGA---------NWRHPRGPASTLKGLEAHPVVHIAYEDAQAYA 146
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWE+ RGGL+ F WG+ LTPRG+H AN +QGEFP N DGY T
Sbjct: 147 NWAGKALPTEAEWEFAARGGLDGADFVWGDELTPRGQHMANTYQGEFPFRNDCEDGYEWT 206
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHH---------------HPAPSYNPKGPTT 297
APV S+ N +GLY+M GNVW+WT DW+ H SY+P P
Sbjct: 207 APVGSFPANGYGLYDMAGNVWQWTTDWYQEHRLIESPCCTMTNPRGGEREASYDPLTPDI 266
Query: 298 GTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KV KGGS+LC YC R+R AAR D+S +LGFRC
Sbjct: 267 RIPRKVTKGGSFLCAPSYCRRYRPAARMAQPVDTSTCHLGFRC 309
>gi|148553970|ref|YP_001261552.1| hypothetical protein Swit_1048 [Sphingomonas wittichii RW1]
gi|148499160|gb|ABQ67414.1| protein of unknown function DUF323 [Sphingomonas wittichii RW1]
Length = 304
Score = 202 bits (515), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 125/343 (36%), Positives = 171/343 (49%), Gaps = 62/343 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG F MG+++ E P+R V +D F++D+ V+N F FV+ATG+VT AE
Sbjct: 1 MAFVPGGVFTMGSDR---FYPEEAPARRVRVDPFFIDEAPVTNADFARFVAATGHVTCAE 57
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
D + +L E A S + R PV
Sbjct: 58 IAPDPKDYPGMLPELAHAG---------------SAVFTRTDRPV--------------- 87
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
R P++ W R W HP G DS+I+ +HPVVHV+ DA AY W
Sbjct: 88 DTRDPSQ--WW---------RFVLGADWRHPCGPDSSIDGLDDHPVVHVACADAEAYARW 136
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE+ RGGLE+ + WG+ L P G AN WQG FP NT DG+ T+P
Sbjct: 137 AGKALPTEAEWEFAARGGLEDADYAWGDELAPDGRMMANYWQGLFPYANTLDDGWERTSP 196
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWN-------------VHHHPA-----PSYNPKGPT 296
+ S+ N +GL++M+GNVWEWT+DW+ + +P S++P+ P
Sbjct: 197 IRSFPPNGYGLHDMIGNVWEWTSDWYAPPKIERKAKGSCCIPANPRGARRHDSFDPQSPA 256
Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 257 RIPRRVLKGGSHLCAPSYCQRYRPAARHPEAIDTSTSHIGFRC 299
>gi|326430702|gb|EGD76272.1| sulfatase modifying factor 2 [Salpingoeca sp. ATCC 50818]
Length = 299
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 129/336 (38%), Positives = 167/336 (49%), Gaps = 67/336 (19%)
Query: 15 MVLLPGD--TFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
M+ +PG+ TFR+GT K G P R VT+D FYLD++ VSN F+ FV T YVT+
Sbjct: 20 MIYMPGEIGTFRLGTKKADSTH-GITPQRKVTVDPFYLDRYAVSNADFRAFVRDTKYVTD 78
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
+EK+G +FVF+ + SQ + + G D W AV
Sbjct: 79 SEKYGWSFVFKTFVD-----TASQDLAETQAMPGAD---------------WWLAVPRAY 118
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR P G S I+ R+ PVV VSWNDA YC
Sbjct: 119 WR-----------------------------KPFGKGSGIKDRLQFPVVQVSWNDAQQYC 149
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWEY RGGL++ +PWG+ ++R N WQG+FP N DGY
Sbjct: 150 DWAGKRLPTEAEWEYAVRGGLQDSAYPWGSRYE---KNRMNAWQGKFPDTNVLGDGYHGA 206
Query: 253 APVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL-- 309
APV +Y +N +G YNM+GNVWEW AD PS PK V +GGSY+
Sbjct: 207 APVDAYGPQNDYGFYNMLGNVWEWVAD--RFEDPSPPSQEPK-------FVLRGGSYVDT 257
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
+ + ++ R R NT DS++ N+GFRCA P
Sbjct: 258 VDGKTNHKVRVTTRMGNTADSASDNIGFRCARSATP 293
>gi|170744776|ref|YP_001773431.1| hypothetical protein M446_6750 [Methylobacterium sp. 4-46]
gi|168199050|gb|ACA20997.1| protein of unknown function DUF323 [Methylobacterium sp. 4-46]
Length = 307
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 126/346 (36%), Positives = 174/346 (50%), Gaps = 64/346 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG TFRMG+++ E P+ V + +F++D+ V+N QF+EFV ATG+VT AE
Sbjct: 1 MRWIPGGTFRMGSDR---HYPEEAPAHRVAVRSFWMDETPVTNRQFREFVRATGHVTFAE 57
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ R D K + G + + +V + V W
Sbjct: 58 R----------------------RPDPKDYPGALPNLLYAGS--IVFTPPDHPVDLRLW- 92
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G+ L W HP G ST+ NHPVVHV++ DA+AY W
Sbjct: 93 ----------------GVWWTLLEGADWRHPYGPRSTLHGLDNHPVVHVTYADALAYARW 136
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE+ RGGLE F WG+ TP G H AN WQG+FP N A DG+ T+P
Sbjct: 137 AGKDLPTEAEWEFAARGGLEGAEFAWGDAFTPAGRHMANTWQGQFPWQNLAQDGFARTSP 196
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGTD---------- 300
++ N +GL++M+GNVWEWT+D++ HPA + P+ P GT+
Sbjct: 197 AKAFPPNGYGLHDMIGNVWEWTSDYF-ADRHPADAPKACCIPRDPRGGTEERSYDPRLPE 255
Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+V KGGS+LC +C R+R AAR D+S ++GFRC
Sbjct: 256 IRIARRVLKGGSHLCAPNFCSRYRPAARLPQPVDTSTSHVGFRCVV 301
>gi|386821043|ref|ZP_10108259.1| hypothetical protein JoomaDRAFT_3026 [Joostella marina DSM 19592]
gi|386426149|gb|EIJ39979.1| hypothetical protein JoomaDRAFT_3026 [Joostella marina DSM 19592]
Length = 367
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 127/356 (35%), Positives = 186/356 (52%), Gaps = 59/356 (16%)
Query: 9 VERYKD------MVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
+E+ KD MV +PG ++ G + + E P+ V +D F++D EV+N ++
Sbjct: 40 IEKPKDIKTPEGMVWIPGGSYTQGARSDDKFALNHEKPAHPVEVDGFFMDVTEVTNDEYA 99
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKI---SQVRHDMKRFEGLDSTIEHRMHHP 118
+FV+ATGY T AE+ + EE + ++ + HD G S I +
Sbjct: 100 KFVAATGYKTVAER--------AIEWEEIKKQVPPGTPKPHDSILQPG--SLIFKKTKTS 149
Query: 119 VVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNH 178
V ++ + W EW+ G W HP+G DS+IE + ++
Sbjct: 150 VPNL-----YDFSQWW--------EWKIGA------------DWKHPKGPDSSIEGKGDY 184
Query: 179 PVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE 238
PVVH+++ DA+AYC W G RLPTE+EWE GG+ N ++ WG + + + +AN W GE
Sbjct: 185 PVVHIAYEDAIAYCEWAGRRLPTESEWELAAHGGVNNSMYAWGEDKS-KLTAQANTWNGE 243
Query: 239 FPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPK 293
FPT N ADGY + APV S+ N +GLY M GNVWEWT DW+NV+++ NP+
Sbjct: 244 FPTTNNKADGYENKAPVKSFPPNAYGLYEMSGNVWEWTQDWFNVNYYGEMLEQGLVVNPQ 303
Query: 294 GPTTG--------TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
G T +KV KGGS+LC++ YC +R +AR DS +LGFR A
Sbjct: 304 GATKAYNPNNRYVQEKVIKGGSFLCSDTYCSSYRISARMATALDSGQEHLGFRTVA 359
>gi|325673474|ref|ZP_08153165.1| sulfatase-modifying factor 1 [Rhodococcus equi ATCC 33707]
gi|325555495|gb|EGD25166.1| sulfatase-modifying factor 1 [Rhodococcus equi ATCC 33707]
Length = 324
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 127/356 (35%), Positives = 175/356 (49%), Gaps = 63/356 (17%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P K+MV +PG TF MG+ E P V +D F++D H+V+ +F+ FV AT
Sbjct: 11 PSRAPKNMVWVPGGTFWMGSED---FYPEERPVHEVGVDGFWMDTHQVTVAEFRRFVKAT 67
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
G+VT AE D + P S V P +S +D
Sbjct: 68 GHVTTAETAPDPADY-PDADPALLVPGSLV-----------------FTPPAQPVSLDDY 109
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
+ +W P W HPEG S + R HPV HVS++D
Sbjct: 110 QQWWSW-----------------------VPGADWRHPEGPGSNVGGRERHPVTHVSYSD 146
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
A AY W G LPTEAEWE+ RGGL+ + F WG+ TP G H+AN WQG+FP N A D
Sbjct: 147 AQAYAAWAGKTLPTEAEWEFAARGGLDRKTFVWGDEFTPDGRHQANTWQGQFPWQNLAED 206
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTGT 299
G++ T+PV +++ N +GL ++ GNVWEWT D+ H + AP+ + P+ P
Sbjct: 207 GFVGTSPVGTFRPNGYGLVDVAGNVWEWTTDFHTADHGASGKNVAPASSCCIPRNPRVEI 266
Query: 300 DKVK----------KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
+V+ KGGS+LC YC R+R AAR T ++S ++GFRC +GP
Sbjct: 267 GEVQAGEAYPRRVIKGGSHLCAPNYCLRYRPAARQGETEETSTCHIGFRCIV-RGP 321
>gi|334142809|ref|YP_004536017.1| hypothetical protein [Novosphingobium sp. PP1Y]
gi|333940841|emb|CCA94199.1| conserved hypothetical protein [Novosphingobium sp. PP1Y]
Length = 383
Score = 202 bits (514), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 127/346 (36%), Positives = 170/346 (49%), Gaps = 65/346 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ +PG TF MG+ E P R V +D F++D+ V+N +F FV ATGYVT AE
Sbjct: 76 MIHVPGGTFTMGSEN---FYPEEMPLRRVKVDPFWIDETPVTNREFARFVEATGYVTVAE 132
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
D + +L ++A S + H+ PV D W
Sbjct: 133 IAPDPKDYPGMLPGMDKAG---------------SLVFHKTATPV------DTSNPANW- 170
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W + +F W HP G DS ++ +HPVVH+++ DA AY W
Sbjct: 171 ---------WSF---------VFG-ADWCHPLGPDSNLDELWDHPVVHIAYADAEAYAKW 211
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAE+E+ RGGLE + + WG+ L P G AN WQG FP N DG+ T+P
Sbjct: 212 AGKALPTEAEFEFAARGGLEGKDYAWGDELAPDGAMMANYWQGLFPFANQCLDGWERTSP 271
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWN---------------VHHHP-----APSYNPKG 294
V ++ N +GLY+M+GN WEWTADWW+ V +P SY+P
Sbjct: 272 VRTFPANGYGLYDMIGNTWEWTADWWSERPQTVAKKKPSACCVMENPRGAKLKDSYDPVQ 331
Query: 295 PTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
P KV KGGS+LC YC R+R AAR DS+ ++GFRC
Sbjct: 332 PGVRIGRKVLKGGSHLCAANYCQRYRPAARHPEMIDSATTHIGFRC 377
>gi|227537658|ref|ZP_03967707.1| sulfatase-modifying factor 1 [Sphingobacterium spiritivorum ATCC
33300]
gi|227242489|gb|EEI92504.1| sulfatase-modifying factor 1 [Sphingobacterium spiritivorum ATCC
33300]
Length = 363
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 121/339 (35%), Positives = 174/339 (51%), Gaps = 57/339 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV + G ++ MG+ +D + P V + +F++D+HEV+N QF +FV ATGY+T AE
Sbjct: 69 MVKIEGGSYDMGSAD---FEDAK-PVHKVNVKSFWMDEHEVTNAQFAKFVKATGYLTVAE 124
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D F P + E S V H K GL+ ++ W
Sbjct: 125 RPLDARDF-PGVDPEMLKPGSAVFHAPKEIRGLNDPLQ--------------------W- 162
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W Y +W HPEG +S I+ + N+PVVH+++ DA AY W
Sbjct: 163 ---------WSYVVGA----------NWKHPEGPESNIKGKENYPVVHIAYQDAEAYAKW 203
Query: 195 RGARLPTEAEWEYGCR-GGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
G RLPTEAEWEY + N + WG T G+ AN++QGEFP +T DGY+ A
Sbjct: 204 AGKRLPTEAEWEYAAKTANYNNEKYYWGTEKTENGKWAANIYQGEFPLKDTGEDGYVGVA 263
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----------PAPSYNPKGPTTGTDKVK 303
PV SY N GLY+M GNVWEW +D++ ++ P SY+P+ P +V+
Sbjct: 264 PVKSYPANTSGLYDMEGNVWEWCSDFYRPDYYKNSAESNPTGPRDSYDPQEPGA-VKRVQ 322
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+GGS+LCN+ YC R++ +R + +S N+GFR D
Sbjct: 323 RGGSFLCNDMYCERYKAGSRGKGEVNSPTNNVGFRLVKD 361
>gi|126433661|ref|YP_001069352.1| hypothetical protein Mjls_1053 [Mycobacterium sp. JLS]
gi|126233461|gb|ABN96861.1| protein of unknown function DUF323 [Mycobacterium sp. JLS]
Length = 290
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 121/332 (36%), Positives = 170/332 (51%), Gaps = 49/332 (14%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG TFRMG+ + E P V++ F +++H V+N QF EFV ATGYVT A
Sbjct: 4 ELVELPGGTFRMGSTQ---FYPEEVPVHTVSVADFAIERHPVTNAQFAEFVDATGYVTIA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D ++ P + + + V F+ ++ R D + TW
Sbjct: 61 EQALDPALY-PGVPPADLVPGALV------FQPTAGPVDLR-----------DWRQWWTW 102
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
P W HP G +S+ R +HPVV V++ DA AY
Sbjct: 103 S-----------------------PGACWRHPFGPESSWHDRPDHPVVQVAYPDAAAYAK 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEA+WEY G +PWG++ P G AN WQG FP N A G++ T+
Sbjct: 140 WAGRRLPTEAQWEYAAHAGAAT-TYPWGDDPQPGGRLMANTWQGRFPYRNDGALGWVGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY----NPKGPTTGTDKVKKGGSYL 309
PV ++ N FGL +M+GNVWEWT ++ HH P + P P ++ KGGS+L
Sbjct: 199 PVGTFPANGFGLVDMIGNVWEWTTTRFSAHHRPDGAVESCCGPADPDPSVNQALKGGSHL 258
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
C +YC+R+R AARS + DS+ ++GFRC A
Sbjct: 259 CAPEYCHRYRAAARSPQSQDSATTHIGFRCVA 290
>gi|379749045|ref|YP_005339866.1| hypothetical protein OCU_43260 [Mycobacterium intracellulare ATCC
13950]
gi|379763897|ref|YP_005350294.1| hypothetical protein OCQ_44610 [Mycobacterium intracellulare
MOTT-64]
gi|378801409|gb|AFC45545.1| hypothetical protein OCU_43260 [Mycobacterium intracellulare ATCC
13950]
gi|378811839|gb|AFC55973.1| hypothetical protein OCQ_44610 [Mycobacterium intracellulare
MOTT-64]
Length = 296
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 124/336 (36%), Positives = 167/336 (49%), Gaps = 50/336 (14%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++V LPG +FRMG+ E P V + F +++H V+N QF EFV ATGY+T
Sbjct: 3 SELVDLPGGSFRMGSTS---FYPEEAPIHTVAVGPFAVERHPVTNAQFAEFVDATGYLTV 59
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ D ++ A + F ++ R
Sbjct: 60 AEQPIDPALYS-------GADPDDLCPGAMVFRPTPGPVDLR-----------------D 95
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR W++ P SW HP G DS + R +HPVV V++ DA AY
Sbjct: 96 WR-------QWWQW----------VPGASWRHPFGPDSDVADRGDHPVVQVAYPDAAAYA 138
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWEY R G + WG+ +T G AN WQG FP N A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAG-STTTYSWGDEVTSDGRLMANTWQGSFPYRNDGALGWVGT 197
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
+PV ++ N FGL +M+GNVWEWTA ++ HH P P GP + KGGS
Sbjct: 198 SPVGTFAPNAFGLIDMIGNVWEWTATEFSPHHRLDSPPKACCTPPGPADPSVSQTLKGGS 257
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
+LC +YC+R+R AARS + DS+ ++GFRC AD
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVADS 293
>gi|345321597|ref|XP_001521381.2| PREDICTED: sulfatase-modifying factor 1-like, partial
[Ornithorhynchus anatinus]
Length = 222
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 91/133 (68%), Positives = 106/133 (79%)
Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
+W HPEG DSTI +R +HPV+HVSWNDAVAYCTW G RLPTEAEWEY CRGGLENRLFP
Sbjct: 13 ANWRHPEGPDSTILNRPDHPVLHVSWNDAVAYCTWAGKRLPTEAEWEYSCRGGLENRLFP 72
Query: 220 WGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
WGN L P+G+H AN+WQGEFP +NT DGY TAPV ++ N +GLYN+VGN WEWT+DW
Sbjct: 73 WGNKLQPKGQHYANIWQGEFPVSNTGEDGYKGTAPVTAFPPNGYGLYNIVGNAWEWTSDW 132
Query: 280 WNVHHHPAPSYNP 292
W V+H S +P
Sbjct: 133 WAVNHSTHESQDP 145
>gi|443307542|ref|ZP_21037329.1| hypothetical protein W7U_17885 [Mycobacterium sp. H4Y]
gi|442764910|gb|ELR82908.1| hypothetical protein W7U_17885 [Mycobacterium sp. H4Y]
Length = 296
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 124/336 (36%), Positives = 167/336 (49%), Gaps = 50/336 (14%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++V LPG +FRMG+ E P V + F +++H V+N QF EFV ATGY+T
Sbjct: 3 SELVDLPGGSFRMGSTS---FYPEEAPIHTVAVGPFAVERHPVTNAQFAEFVDATGYLTV 59
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ D ++ A + F ++ R
Sbjct: 60 AEQPIDPALYS-------GADPDDLCPGAMVFRPTPGPVDLR-----------------D 95
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR W++ P SW HP G DS + R +HPVV V++ DA AY
Sbjct: 96 WR-------QWWQW----------VPGASWRHPFGPDSDVADRGDHPVVQVAYPDAAAYA 138
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWEY R G + WG+ +T G AN WQG FP N A G++ T
Sbjct: 139 RWAGRRLPTEAEWEYAARAG-STTTYSWGDEVTSDGRLMANTWQGSFPYRNDGALGWVGT 197
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPT-TGTDKVKKGGS 307
+PV ++ N FGL +M+GNVWEWTA ++ HH P P GP + KGGS
Sbjct: 198 SPVGTFAPNAFGLVDMIGNVWEWTATEFSPHHRLDSPPKACCTPPGPADPSVSQTLKGGS 257
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
+LC +YC+R+R AARS + DS+ ++GFRC AD
Sbjct: 258 HLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVADS 293
>gi|126725775|ref|ZP_01741617.1| hypothetical protein RB2150_06203 [Rhodobacterales bacterium
HTCC2150]
gi|126704979|gb|EBA04070.1| hypothetical protein RB2150_06203 [Rhodobacterales bacterium
HTCC2150]
Length = 310
Score = 202 bits (513), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 128/326 (39%), Positives = 168/326 (51%), Gaps = 62/326 (19%)
Query: 18 LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
+PG T +GT +P+L D E P R L F + Q V+N F +FV T YVTEAE+FG
Sbjct: 39 IPGGTSMLGTAQPMLQNDNEEPLRKKQLKPFKMMQAAVTNQMFADFVEDTNYVTEAEEFG 98
Query: 78 DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
+FVF LS+ +D T V W WR
Sbjct: 99 WSFVFYQHLSD-----------------AVDET------QAVEGTQW--------WRRVE 127
Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
+A W RL G S + + +HPVVHVSW DA A+ W G
Sbjct: 128 ---DANW----------RLI--------GGPKSETDFKPDHPVVHVSWADARAFAKWAGG 166
Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR-ANVWQGEFPTNNTAADGYLSTAPVM 256
RLP EAEWE+ RGGL + FPWG + + N+WQG FPT+N+ ADGY +TAP +
Sbjct: 167 RLPNEAEWEHAARGGLGDVPFPWGEKEPNDTDFQPCNIWQGTFPTHNSLADGYSTTAPAI 226
Query: 257 SYKENKFGLYNMVGNVWEWTADWW----NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
S+ N +GLYNM GNVWEWT++ + N A PKG K+ KGGS+LC++
Sbjct: 227 SFDANGYGLYNMCGNVWEWTSEPFKYRSNSKKKQAMDPRPKG-----SKLLKGGSFLCHK 281
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFR 338
YC+R+R AAR+ TPD++ + GFR
Sbjct: 282 SYCFRYRIAARTGTTPDTATSHQGFR 307
>gi|75910437|ref|YP_324733.1| hypothetical protein Ava_4239 [Anabaena variabilis ATCC 29413]
gi|75704162|gb|ABA23838.1| Protein of unknown function DUF323 [Anabaena variabilis ATCC 29413]
Length = 332
Score = 201 bits (512), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 131/349 (37%), Positives = 172/349 (49%), Gaps = 69/349 (19%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
KDMV +PGDTF MG++ E P+R VT+ F++D++ V+N QFQ FV T YVT
Sbjct: 25 KDMVWIPGDTFTMGSDDHY---PEEAPTRRVTVTGFWMDKYAVTNKQFQRFVKETKYVTV 81
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ + + L E L P + N C
Sbjct: 82 AERSPNPEDYPGALPE------------------LLVPASLVFEQPPYRVDLNS----CN 119
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W W Y P +W +PEG S+I+ R N+PVVH+++ DA AY
Sbjct: 120 W----------WNY----------VPGANWRNPEGPGSSIKGRENYPVVHIAYEDAEAYA 159
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEA+WE+ RGGLE+ ++ WGN + AN+WQGEFP N +
Sbjct: 160 KWAGKALPTEAQWEFAARGGLESAVYVWGNEFF-HNKLLANIWQGEFPWQNLKSHP-PRP 217
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS--------YNPKGPTTGTD---- 300
PV SY N +GLY+M GNVWEWT DW+ P PS NP+G T
Sbjct: 218 EPVGSYPPNNYGLYDMTGNVWEWTMDWY--RSQPLPSKPKSCCIPVNPRGGTLEESFDPN 275
Query: 301 --------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
KV KGGS+LC E YC+R+R AAR T D+S ++GFRC +
Sbjct: 276 FSDIRIPRKVLKGGSFLCAENYCFRYRPAARHAETIDTSTCHIGFRCVS 324
>gi|357020744|ref|ZP_09082975.1| hypothetical protein KEK_12058 [Mycobacterium thermoresistibile
ATCC 19527]
gi|356478492|gb|EHI11629.1| hypothetical protein KEK_12058 [Mycobacterium thermoresistibile
ATCC 19527]
Length = 300
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 122/340 (35%), Positives = 173/340 (50%), Gaps = 57/340 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
D+V +PG TFRMG+ I E P VT+ F +++H V+N QF EFV+ATGY T A
Sbjct: 6 DLVEIPGGTFRMGS---IRFYPEEAPVHTVTVADFAIERHPVTNAQFAEFVAATGYRTVA 62
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D ++ + + + + + + PV W + TW
Sbjct: 63 EQPLDPRLYPGVAAADLQPG---------------ALVFTPTRGPVDLRDWRQ---WWTW 104
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDS-TIEHRMNHPVVHVSWNDAVAYC 192
P W HP G + + + R +HPVV +++ DA AY
Sbjct: 105 T-----------------------PGACWRHPFGPEGPSWQDRPDHPVVQIAYPDAQAYA 141
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTE EWEY R G + ++PWG+ P G AN WQG FP N A G++ T
Sbjct: 142 RWAGRRLPTEREWEYAARAGADT-VYPWGDEARPGGRLMANTWQGRFPYRNDGAAGWIGT 200
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----------PSYNPKGPTTGTDK 301
+PV ++ N FGL++M+GNVWEWT ++ HH PA PS + P +
Sbjct: 201 SPVGAFPPNDFGLFDMIGNVWEWTTTRFSGHHRPAEADEPVRSCCTPSAPVERPDPAVSQ 260
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
KGGS+LC +YC+R+R +ARS + DS+A ++GFRCAA
Sbjct: 261 TLKGGSHLCAPEYCHRYRASARSPQSQDSAATHIGFRCAA 300
>gi|359398736|ref|ZP_09191752.1| hypothetical protein NSU_1438 [Novosphingobium pentaromativorans
US6-1]
gi|357599974|gb|EHJ61677.1| hypothetical protein NSU_1438 [Novosphingobium pentaromativorans
US6-1]
Length = 322
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 127/348 (36%), Positives = 169/348 (48%), Gaps = 65/348 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ +PG TF MG+ E P R V +D F++D+ V+N F FV ATGYVT AE
Sbjct: 15 MIHIPGGTFTMGSET---FYPEEMPLRRVKVDPFWIDETPVTNRDFARFVEATGYVTVAE 71
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
D + +L ++A S + H+ PV D W
Sbjct: 72 IAPDPKDYPGMLPGMDKAG---------------SLVFHKTATPV------DTSNPANW- 109
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W + +F W HP G DS ++ +HPVVH+++ DA AY W
Sbjct: 110 ---------WSF---------VFG-ADWCHPLGPDSNLDDLWDHPVVHIAYADAEAYAKW 150
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAE+E+ RGGLE + + WG+ L P G AN WQG FP N DG+ T+P
Sbjct: 151 AGKALPTEAEFEFAARGGLEGKDYAWGDELAPDGAMMANYWQGLFPFANQCLDGWERTSP 210
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWN---------------VHHHP-----APSYNPKG 294
V ++ N +GLY+M+GN WEWTADWW+ V +P SY+P
Sbjct: 211 VRTFPANGYGLYDMIGNTWEWTADWWSERPQTVAKKKPGACCVMENPRGAKLKDSYDPVQ 270
Query: 295 PTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
P KV KGGS+LC YC R+R AAR DS+ ++GFRC
Sbjct: 271 PGVRIGRKVLKGGSHLCAANYCQRYRPAARHPEMIDSATTHIGFRCVV 318
>gi|15607852|ref|NP_215226.1| Conserved protein [Mycobacterium tuberculosis H37Rv]
gi|15840119|ref|NP_335156.1| hypothetical protein MT0739 [Mycobacterium tuberculosis CDC1551]
gi|31791898|ref|NP_854391.1| hypothetical protein Mb0733 [Mycobacterium bovis AF2122/97]
gi|121636634|ref|YP_976857.1| hypothetical protein BCG_0762 [Mycobacterium bovis BCG str. Pasteur
1173P2]
gi|148660487|ref|YP_001282010.1| hypothetical protein MRA_0720 [Mycobacterium tuberculosis H37Ra]
gi|148821917|ref|YP_001286671.1| hypothetical protein TBFG_10726 [Mycobacterium tuberculosis F11]
gi|167968267|ref|ZP_02550544.1| hypothetical protein MtubH3_09659 [Mycobacterium tuberculosis
H37Ra]
gi|224989106|ref|YP_002643793.1| hypothetical protein JTY_0732 [Mycobacterium bovis BCG str. Tokyo
172]
gi|253797655|ref|YP_003030656.1| hypothetical protein TBMG_00726 [Mycobacterium tuberculosis KZN
1435]
gi|254231032|ref|ZP_04924359.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|254363658|ref|ZP_04979704.1| conserved hypothetical protein [Mycobacterium tuberculosis str.
Haarlem]
gi|254549673|ref|ZP_05140120.1| hypothetical protein Mtube_04290 [Mycobacterium tuberculosis
'98-R604 INH-RIF-EM']
gi|289442113|ref|ZP_06431857.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289446272|ref|ZP_06436016.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289568656|ref|ZP_06448883.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289573320|ref|ZP_06453547.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|289744436|ref|ZP_06503814.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289752759|ref|ZP_06512137.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
gi|289756800|ref|ZP_06516178.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|294992990|ref|ZP_06798681.1| hypothetical protein Mtub2_00405 [Mycobacterium tuberculosis 210]
gi|297633211|ref|ZP_06950991.1| hypothetical protein MtubK4_03765 [Mycobacterium tuberculosis KZN
4207]
gi|297730191|ref|ZP_06959309.1| hypothetical protein MtubKR_03820 [Mycobacterium tuberculosis KZN
R506]
gi|298524203|ref|ZP_07011612.1| conserved hypothetical protein [Mycobacterium tuberculosis
94_M4241A]
gi|306796467|ref|ZP_07434769.1| hypothetical protein TMFG_03384 [Mycobacterium tuberculosis
SUMu006]
gi|306806536|ref|ZP_07443204.1| hypothetical protein TMGG_03733 [Mycobacterium tuberculosis
SUMu007]
gi|306970925|ref|ZP_07483586.1| hypothetical protein TMJG_02459 [Mycobacterium tuberculosis
SUMu010]
gi|307083219|ref|ZP_07492332.1| hypothetical protein TMLG_03206 [Mycobacterium tuberculosis
SUMu012]
gi|313657518|ref|ZP_07814398.1| hypothetical protein MtubKV_03810 [Mycobacterium tuberculosis KZN
V2475]
gi|339630780|ref|YP_004722422.1| hypothetical protein MAF_07210 [Mycobacterium africanum GM041182]
gi|375294931|ref|YP_005099198.1| hypothetical protein TBSG_00730 [Mycobacterium tuberculosis KZN
4207]
gi|378770468|ref|YP_005170201.1| hypothetical protein BCGMEX_0733 [Mycobacterium bovis BCG str.
Mexico]
gi|383306613|ref|YP_005359424.1| hypothetical protein MRGA327_04455 [Mycobacterium tuberculosis
RGTB327]
gi|385990185|ref|YP_005908483.1| hypothetical protein [Mycobacterium tuberculosis CCDC5180]
gi|385993785|ref|YP_005912083.1| hypothetical protein [Mycobacterium tuberculosis CCDC5079]
gi|385997491|ref|YP_005915789.1| hypothetical protein MTCTRI2_0728 [Mycobacterium tuberculosis
CTRI-2]
gi|392385432|ref|YP_005307061.1| unnamed protein product [Mycobacterium tuberculosis UT205]
gi|392431141|ref|YP_006472185.1| hypothetical protein TBXG_000719 [Mycobacterium tuberculosis KZN
605]
gi|397672521|ref|YP_006514056.1| hypothetical protein RVBD_0712 [Mycobacterium tuberculosis H37Rv]
gi|422811652|ref|ZP_16860053.1| hypothetical protein TMMG_03419 [Mycobacterium tuberculosis
CDC1551A]
gi|424805790|ref|ZP_18231221.1| hypothetical protein TBPG_02993 [Mycobacterium tuberculosis W-148]
gi|424946481|ref|ZP_18362177.1| hypothetical protein NCGM2209_1095 [Mycobacterium tuberculosis
NCGM2209]
gi|449062734|ref|YP_007429817.1| hypothetical protein K60_007590 [Mycobacterium bovis BCG str. Korea
1168P]
gi|13880269|gb|AAK44970.1| hypothetical protein MT0739 [Mycobacterium tuberculosis CDC1551]
gi|31617485|emb|CAD93595.1| CONSERVED HYPOTHETICAL PROTEIN [Mycobacterium bovis AF2122/97]
gi|121492281|emb|CAL70748.1| Conserved hypothetical protein [Mycobacterium bovis BCG str.
Pasteur 1173P2]
gi|124600091|gb|EAY59101.1| conserved hypothetical protein [Mycobacterium tuberculosis C]
gi|134149172|gb|EBA41217.1| conserved hypothetical protein [Mycobacterium tuberculosis str.
Haarlem]
gi|148504639|gb|ABQ72448.1| hypothetical protein MRA_0720 [Mycobacterium tuberculosis H37Ra]
gi|148720444|gb|ABR05069.1| conserved hypothetical protein [Mycobacterium tuberculosis F11]
gi|224772219|dbj|BAH25025.1| hypothetical protein JTY_0732 [Mycobacterium bovis BCG str. Tokyo
172]
gi|253319158|gb|ACT23761.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
1435]
gi|289415032|gb|EFD12272.1| conserved hypothetical protein [Mycobacterium tuberculosis T46]
gi|289419230|gb|EFD16431.1| conserved hypothetical protein [Mycobacterium tuberculosis CPHL_A]
gi|289537751|gb|EFD42329.1| conserved hypothetical protein [Mycobacterium tuberculosis K85]
gi|289542410|gb|EFD46058.1| conserved hypothetical protein [Mycobacterium tuberculosis T17]
gi|289684964|gb|EFD52452.1| conserved hypothetical protein [Mycobacterium tuberculosis 02_1987]
gi|289693346|gb|EFD60775.1| conserved hypothetical protein [Mycobacterium tuberculosis EAS054]
gi|289712364|gb|EFD76376.1| conserved hypothetical protein [Mycobacterium tuberculosis T85]
gi|298493997|gb|EFI29291.1| conserved hypothetical protein [Mycobacterium tuberculosis
94_M4241A]
gi|308343126|gb|EFP31977.1| hypothetical protein TMFG_03384 [Mycobacterium tuberculosis
SUMu006]
gi|308347013|gb|EFP35864.1| hypothetical protein TMGG_03733 [Mycobacterium tuberculosis
SUMu007]
gi|308359546|gb|EFP48397.1| hypothetical protein TMJG_02459 [Mycobacterium tuberculosis
SUMu010]
gi|308367089|gb|EFP55940.1| hypothetical protein TMLG_03206 [Mycobacterium tuberculosis
SUMu012]
gi|323720839|gb|EGB29906.1| hypothetical protein TMMG_03419 [Mycobacterium tuberculosis
CDC1551A]
gi|326905066|gb|EGE51999.1| hypothetical protein TBPG_02993 [Mycobacterium tuberculosis W-148]
gi|328457436|gb|AEB02859.1| conserved hypothetical protein [Mycobacterium tuberculosis KZN
4207]
gi|339293739|gb|AEJ45850.1| hypothetical protein CCDC5079_0660 [Mycobacterium tuberculosis
CCDC5079]
gi|339297378|gb|AEJ49488.1| hypothetical protein CCDC5180_0651 [Mycobacterium tuberculosis
CCDC5180]
gi|339330136|emb|CCC25793.1| conserved hypothetical protein [Mycobacterium africanum GM041182]
gi|341600650|emb|CCC63320.1| conserved hypothetical protein [Mycobacterium bovis BCG str. Moreau
RDJ]
gi|344218537|gb|AEM99167.1| hypothetical protein MTCTRI2_0728 [Mycobacterium tuberculosis
CTRI-2]
gi|356592789|gb|AET18018.1| Hypothetical protein BCGMEX_0733 [Mycobacterium bovis BCG str.
Mexico]
gi|358230996|dbj|GAA44488.1| hypothetical protein NCGM2209_1095 [Mycobacterium tuberculosis
NCGM2209]
gi|378543983|emb|CCE36255.1| unnamed protein product [Mycobacterium tuberculosis UT205]
gi|379026863|dbj|BAL64596.1| hypothetical protein ERDMAN_0784 [Mycobacterium tuberculosis str.
Erdman = ATCC 35801]
gi|380720566|gb|AFE15675.1| hypothetical protein MRGA327_04455 [Mycobacterium tuberculosis
RGTB327]
gi|392052550|gb|AFM48108.1| hypothetical protein TBXG_000719 [Mycobacterium tuberculosis KZN
605]
gi|395137426|gb|AFN48585.1| hypothetical protein RVBD_0712 [Mycobacterium tuberculosis H37Rv]
gi|440580176|emb|CCG10579.1| hypothetical protein MT7199_0730 [Mycobacterium tuberculosis
7199-99]
gi|444894202|emb|CCP43456.1| Conserved protein [Mycobacterium tuberculosis H37Rv]
gi|449031242|gb|AGE66669.1| hypothetical protein K60_007590 [Mycobacterium bovis BCG str. Korea
1168P]
Length = 299
Score = 201 bits (511), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 131/337 (38%), Positives = 169/337 (50%), Gaps = 53/337 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +FRMG+ + E P VT+ AF +++H V+N QF EFVSATGYVT A
Sbjct: 4 ELVDLPGGSFRMGSTR---FYPEEAPIHTVTVRAFAVERHPVTNAQFAEFVSATGYVTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E +PL D + G+D+ +V V W
Sbjct: 61 E--------QPL--------------DPGLYPGVDAA--DLCPGAMVFCPTAGPVDLRDW 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R W++ P W HP G DS I R HPVV V++ DAVAY
Sbjct: 97 R-------QWWDW----------VPGACWRHPFGRDSDIADRAGHPVVQVAYPDAVAYAR 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGG + WG+ P G AN WQG FP N A G++ T+
Sbjct: 140 WAGRRLPTEAEWEYAARGG-TTATYAWGDQEKPGGMLMANTWQGRFPYRNDGALGWVGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGTD----KVKKG 305
PV + N FGL +M+GNVWEWT + HH P P T D + KG
Sbjct: 199 PVGRFPANGFGLLDMIGNVWEWTTTEFYPHHRIDPPSTACCAPVKLATAADPTISQTLKG 258
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GS+LC +YC+R+R AARS + D++ ++GFRC AD
Sbjct: 259 GSHLCAPEYCHRYRPAARSPQSQDTATTHIGFRCVAD 295
>gi|399042617|ref|ZP_10737294.1| hypothetical protein PMI09_04863 [Rhizobium sp. CF122]
gi|398058956|gb|EJL50826.1| hypothetical protein PMI09_04863 [Rhizobium sp. CF122]
Length = 318
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 173/344 (50%), Gaps = 65/344 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
+V +PG TF MG+N+ E P+ VT+D F++D V+N QF FV ATGYVT AE
Sbjct: 17 LVWVPGRTFTMGSNEHY---PEEAPAHPVTVDGFWIDVTPVTNRQFAAFVKATGYVTVAE 73
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D + L + M R L T + P + W+
Sbjct: 74 RTPDAADYPGALPK------------MLRAGSLVFTQPRTIVGPDISQWWS--------- 112
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+++G W HP G S I +++HPVVHV++ DA AY W
Sbjct: 113 ---------FKFGA------------DWRHPYGGRSDIRGKLDHPVVHVAYGDARAYADW 151
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LP+EAEWE RGGL+ F WG+ L P G+ AN WQG FPT ++ + G T+P
Sbjct: 152 AGKALPSEAEWELAARGGLDEVEFAWGDELVPDGKLMANTWQGTFPTQSSKSAGADRTSP 211
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP------------------SYNPKGPT 296
V ++ N +G+++M+GNVWEWT+D+W+ HP P SY+P+ P
Sbjct: 212 VGAFPANGYGIHDMIGNVWEWTSDFWSA-RHPEPAAKSCCIPQNPRGGDRETSYDPRQPE 270
Query: 297 TGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 271 IRIPRRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRC 314
>gi|433633745|ref|YP_007267372.1| Conserved protein of unknown function [Mycobacterium canettii CIPT
140070017]
gi|432165338|emb|CCK62813.1| Conserved protein of unknown function [Mycobacterium canettii CIPT
140070017]
Length = 299
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 131/337 (38%), Positives = 169/337 (50%), Gaps = 53/337 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +FRMG+ E P VT+ AF +++H V+N QF EFVSATGYVT A
Sbjct: 4 ELVDLPGGSFRMGSTS---FYPEEAPIHTVTVRAFAVERHPVTNAQFAEFVSATGYVTIA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E +PL D + G+D+ +V V W
Sbjct: 61 E--------QPL--------------DPGLYPGVDAA--DLCPGAMVFCPTAGPVDLRDW 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R W++ P W HP G DS I R HPVV V++ DAVAY
Sbjct: 97 R-------QWWDW----------VPGACWRHPFGRDSDIADRAGHPVVQVAYPDAVAYAR 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGG + WG+ P G AN WQG FP N A G++ T+
Sbjct: 140 WAGRRLPTEAEWEYAARGG-TTATYAWGDQEKPGGMLMANTWQGRFPYRNDGALGWVGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGTD----KVKKG 305
PV ++ N FGL +M+GNVWEWT + HH P P T D + KG
Sbjct: 199 PVGTFPANGFGLLDMIGNVWEWTTTEFYPHHRIDPPSTACCAPVKLATAADPTISQTLKG 258
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GS+LC +YC+R+R AARS + D++ ++GFRC AD
Sbjct: 259 GSHLCAPEYCHRYRPAARSPQSQDTATTHIGFRCVAD 295
>gi|270290491|ref|ZP_06196716.1| sulfatase modifying factor 1 [Pediococcus acidilactici 7_4]
gi|270281272|gb|EFA27105.1| sulfatase modifying factor 1 [Pediococcus acidilactici 7_4]
Length = 283
Score = 201 bits (511), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 125/335 (37%), Positives = 170/335 (50%), Gaps = 61/335 (18%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
D + G+TF+MGTNK +D E P VT+ F + + V+N +F +FV T Y T A
Sbjct: 2 DYQKISGETFKMGTNKGGFPEDFEGPQELVTVTNFEIAKTAVTNREFGQFVKETNYKTVA 61
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E G++FVF+ + + AK SQ H
Sbjct: 62 ETKGNSFVFQAFIPQ---AKQSQYLH---------------------------------- 84
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
PT A W + P SW HP G DS IE ++HPVVHV DA+A+C
Sbjct: 85 -----PTGAPWWL---------VVPGASWRHPFGPDSDIEDILDHPVVHVGLEDAIAFCE 130
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W GA+LPTEA+WE R G +PWG+ L ++ AN WQG+FP N A DG++ TA
Sbjct: 131 WAGAKLPTEAQWECAARSGSTTE-YPWGDELVVDKKYHANTWQGKFPWENDALDGFVGTA 189
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKG-PTTGTDKVK-----KGGS 307
PV S+ N GLY M+GNVWEW N + P +N + +DK+K +GGS
Sbjct: 190 PVKSFAPNANGLYQMIGNVWEWCR---NPRYVPLLDFNQRVYELKSSDKLKGEFAIRGGS 246
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+LC+ YC R+R AAR+ S++ +LGFRC +
Sbjct: 247 FLCHCSYCNRYRVAARNGADYQSTSSHLGFRCIKE 281
>gi|118473016|ref|YP_885834.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis str. MC2 155]
gi|399985837|ref|YP_006566185.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis str. MC2 155]
gi|118174303|gb|ABK75199.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis str. MC2 155]
gi|399230397|gb|AFP37890.1| Sulfatase-modifying factor 1 [Mycobacterium smegmatis str. MC2 155]
Length = 296
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 127/337 (37%), Positives = 174/337 (51%), Gaps = 54/337 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V +PG +FRMG+ E P T+ F +++H V+N QF EFV ATGYVT A
Sbjct: 4 ELVEVPGGSFRMGSTS---FYPEEAPVHTATVGDFAIERHPVTNAQFAEFVEATGYVTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D ++ P + E + S V F ++ R W
Sbjct: 61 ERPLDPKLY-PGVPEADLVPGSLV------FRPTSGPVDLR-----------------DW 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R W++ P W HP G R +HPVV V++ DAVAY T
Sbjct: 97 R-------QWWDWA----------PGACWHHPFGPRREF-CRPDHPVVQVAYPDAVAYAT 138
Query: 194 WRGARLPTEAEWEYGCRGGLENR---LFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
W G RLPTEAEWEY RGG + L+ WG+ + P G+ AN WQG+FP N A G+
Sbjct: 139 WAGRRLPTEAEWEYAARGGPKGGVGFLYAWGDEVCPDGQLMANTWQGKFPYRNDGALGWK 198
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKK 304
T+PV ++ N+ GL +M+GNVWEWTA ++ HH P + P G G ++V K
Sbjct: 199 GTSPVGTFPPNRLGLVDMIGNVWEWTATKFSAHHRPGDESHVTCCPPPSGGDPGVNQVLK 258
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
GGS+LC +YC+R+R AARS + DSS ++GFRC
Sbjct: 259 GGSHLCAPEYCHRYRPAARSPQSQDSSTTHIGFRCVV 295
>gi|159038683|ref|YP_001537936.1| hypothetical protein Sare_3137 [Salinispora arenicola CNS-205]
gi|157917518|gb|ABV98945.1| protein of unknown function DUF323 [Salinispora arenicola CNS-205]
Length = 311
Score = 201 bits (510), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 126/334 (37%), Positives = 163/334 (48%), Gaps = 51/334 (15%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+DMV +PG FRMG+N E P+R V +D F++DQ++V+ F FV TG+VT
Sbjct: 19 RDMVWIPGGRFRMGSND---FYPEEGPARWVEVDGFWIDQYQVTVADFDRFVRDTGHVTM 75
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE E A D+ S + + PV S D + +
Sbjct: 76 AE------------VPPEAADYPDADPDLLV---PGSLVFRKAPGPV---SLADYRNWWS 117
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W P +W HPEG STIE R HPV HV+++DAVA+
Sbjct: 118 WT-----------------------PGATWRHPEGPGSTIEGRDLHPVTHVAYSDAVAFA 154
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
TW G LPTE+EWE RGGL + +F WG+ P G AN WQG FP N DGY T
Sbjct: 155 TWAGKELPTESEWELAARGGLADAVFTWGDEFAPEGRMMANTWQGAFPWQNLLLDGYAGT 214
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK-------VKKG 305
+PV ++ N + LY+M GNVWEWT D + P+ P T + V KG
Sbjct: 215 SPVGAFPPNGYSLYDMAGNVWEWTRDLFTYPAGPSAKACCTTPVTIDEPGSAIPRHVIKG 274
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
GS+LC YC R+R AAR D+S +LGFRC
Sbjct: 275 GSHLCAPNYCMRYRPAARQGEAIDTSTCHLGFRC 308
>gi|241666809|ref|YP_002984893.1| hypothetical protein Rleg_6898 [Rhizobium leguminosarum bv.
trifolii WSM1325]
gi|240862266|gb|ACS59931.1| protein of unknown function DUF323 [Rhizobium leguminosarum bv.
trifolii WSM1325]
Length = 301
Score = 201 bits (510), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 129/343 (37%), Positives = 169/343 (49%), Gaps = 60/343 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG F MG+++ E P+ V +D F++D V+N F FV ATGYVT AE
Sbjct: 1 MSWIPGGDFLMGSDRHYA---EEAPAHRVRVDGFWMDTCPVTNWAFSAFVEATGYVTTAE 57
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ + + P E A S V K L + H +W
Sbjct: 58 RPANPADY-PGAIMEMLAPASVVFIKPKSPVDLTN-----------HYNW---------- 95
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W+Y RG +W HP G STI+ + HPVVHV++ DA AY W
Sbjct: 96 ---------WKY-VRGA---------NWRHPRGPASTIKGLLEHPVVHVAFEDAKAYAAW 136
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE+ RGGLE + WG + P G H AN WQGEFP N+ DG+ T+P
Sbjct: 137 AGKELPTEAEWEFAARGGLEAADYVWGGEMLPGGRHMANTWQGEFPLQNSREDGFEGTSP 196
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVH---------------HHPAPSYNPKGPTTGT 299
V S+ N +GL++M GNVWEWT DW+ H + S++P+ P
Sbjct: 197 VGSFPANGYGLFDMAGNVWEWTTDWYQEHGMIDSPCCTISNPVGAELSNSHDPRQPQISI 256
Query: 300 D-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
KV KGGS+LC YC R+R AAR D++ +LGFRC A
Sbjct: 257 PRKVMKGGSHLCAPNYCRRYRPAARMAQPVDTATCHLGFRCIA 299
>gi|340625733|ref|YP_004744185.1| hypothetical protein MCAN_07171 [Mycobacterium canettii CIPT
140010059]
gi|340003923|emb|CCC43057.1| conserved hypothetical protein [Mycobacterium canettii CIPT
140010059]
Length = 299
Score = 201 bits (510), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 131/337 (38%), Positives = 169/337 (50%), Gaps = 53/337 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +FRMG+ E P VT+ AF +++H V+N QF EFVSATGYVT A
Sbjct: 4 ELVDLPGGSFRMGSTS---FYPEEAPIHTVTVRAFAVERHPVTNAQFAEFVSATGYVTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E +PL D + G+D+ +V V W
Sbjct: 61 E--------QPL--------------DPGLYPGVDAA--DLCPGAMVFCPTAGPVDLRDW 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R W++ P W HP G DS I R HPVV V++ DAVAY
Sbjct: 97 R-------QWWDW----------VPGACWRHPFGRDSDIAGRAGHPVVQVAYPDAVAYAR 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGG + WG+ P G AN WQG FP N A G++ T+
Sbjct: 140 WAGRRLPTEAEWEYAARGG-TTAPYAWGDQEKPGGMLMANTWQGRFPYRNDGALGWVGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGTD----KVKKG 305
PV ++ N FGL +M+GNVWEWT + HH P P T D + KG
Sbjct: 199 PVGTFPANGFGLLDMIGNVWEWTTTEFYPHHRIDPPSTACCAPVKLATAADPTISQTLKG 258
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GS+LC +YC+R+R AARS + D++ ++GFRC AD
Sbjct: 259 GSHLCAPEYCHRYRPAARSPQSQDTATTHIGFRCVAD 295
>gi|54400462|ref|NP_001005980.1| sulfatase-modifying factor 2 precursor [Danio rerio]
gi|53734676|gb|AAH83430.1| Sulfatase modifying factor 2 [Danio rerio]
gi|157423577|gb|AAI53568.1| Sulfatase modifying factor 2 [Danio rerio]
gi|182891192|gb|AAI64065.1| Sumf2 protein [Danio rerio]
Length = 299
Score = 201 bits (510), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 126/345 (36%), Positives = 173/345 (50%), Gaps = 71/345 (20%)
Query: 1 MVLLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
++LL +MV +PG MGT+ +DGE P+R V L F +D++ V+N+ F
Sbjct: 13 LILLCTVSCAADDEMVFIPGGKMLMGTSAAD-GRDGESPTRAVALQPFKIDKYPVTNSNF 71
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
+EFV Y TEAE FG +FVF+ +SEE ++K++Q
Sbjct: 72 REFVRLQKYKTEAETFGWSFVFQDFVSEELKSKVTQ------------------------ 107
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
+ W LP E ++F W P G S I+ R++ PV
Sbjct: 108 ------KIESAPWW---LPVE-------------KVF----WRQPAGPGSGIKDRLDCPV 141
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
V VSWNDA AYC W+ RLPTE EWE RGGLE R +PWGN +R N+WQG FP
Sbjct: 142 VQVSWNDAQAYCQWKKKRLPTEEEWEMAARGGLEGRSYPWGNKYL---LNRTNLWQGPFP 198
Query: 241 TNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT 299
++A DGY APV +Y +N +GLY+M+GNVWEWT+ P +
Sbjct: 199 DKDSAEDGYHGVAPVTAYPPQNNYGLYDMLGNVWEWTSS--------------SFPGAQS 244
Query: 300 DKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
V +G S++ + +R R R NT DS++ NLGFRCA +
Sbjct: 245 MFVLRGASWIDTADGSANHRARVTTRMGNTADSASDNLGFRCAMN 289
>gi|433625806|ref|YP_007259435.1| Conserved protein of unknown function [Mycobacterium canettii CIPT
140060008]
gi|433629796|ref|YP_007263424.1| Conserved protein of unknown function [Mycobacterium canettii CIPT
140070010]
gi|432153412|emb|CCK50634.1| Conserved protein of unknown function [Mycobacterium canettii CIPT
140060008]
gi|432161389|emb|CCK58731.1| Conserved protein of unknown function [Mycobacterium canettii CIPT
140070010]
Length = 299
Score = 201 bits (510), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 131/337 (38%), Positives = 169/337 (50%), Gaps = 53/337 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +FRMG+ E P VT+ AF +++H V+N QF EFVSATGYVT A
Sbjct: 4 ELVDLPGGSFRMGSTS---FYPEEAPIHTVTVRAFAVERHPVTNAQFAEFVSATGYVTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E +PL D + G+D+ +V V W
Sbjct: 61 E--------QPL--------------DPGLYPGVDAA--DLCPGAMVFCPTAGPVDLRDW 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R W++ P W HP G DS I R HPVV V++ DAVAY
Sbjct: 97 R-------QWWDW----------VPGACWRHPFGRDSDIAGRAGHPVVQVAYPDAVAYAR 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGG + WG+ P G AN WQG FP N A G++ T+
Sbjct: 140 WAGRRLPTEAEWEYAARGGTTTP-YAWGDQEKPGGMLMANTWQGRFPYRNDGALGWVGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGTD----KVKKG 305
PV ++ N FGL +M+GNVWEWT + HH P P T D + KG
Sbjct: 199 PVGTFPANGFGLLDMIGNVWEWTTTEFYPHHRIDPPSTACCAPVKLATAADPTISQTLKG 258
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GS+LC +YC+R+R AARS + D++ ++GFRC AD
Sbjct: 259 GSHLCAPEYCHRYRPAARSPQSQDTATTHIGFRCVAD 295
>gi|221640916|ref|YP_002527178.1| hypothetical protein RSKD131_2817 [Rhodobacter sphaeroides KD131]
gi|221161697|gb|ACM02677.1| Hypothetical Protein RSKD131_2817 [Rhodobacter sphaeroides KD131]
Length = 297
Score = 201 bits (510), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 125/334 (37%), Positives = 167/334 (50%), Gaps = 53/334 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ LPG F MG++ E P +++ F++D V+N + FVSATGYVT AE
Sbjct: 1 MIWLPGAVFYMGSDS---HYPEEAPVHLASVEGFWIDVFPVTNADYARFVSATGYVTVAE 57
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
R+ ++ D + ++ R V D + W
Sbjct: 58 ----------------RSPGAEDYPDADPALLVPGSLVFRRPASV-----EDPGDWTRW- 95
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W Y P W HP+G + R++HPVVHV DA AY W
Sbjct: 96 ---------WSYQ----------PGACWRHPDGPGRDVAGRLDHPVVHVCHEDAQAYAHW 136
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWEY RGGL+ F G+ L P G HRAN+WQG+FP N A DG+ T+P
Sbjct: 137 AGKDLPTEAEWEYAARGGLDRAEFARGDVLVPEGRHRANIWQGQFPVENLAEDGFEGTSP 196
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH--HPAPS-YNPKGPTTG------TDKVKKG 305
V ++ N +GLY+M+GNVWEWT+D + H AP P P +G +V KG
Sbjct: 197 VRAFPPNGYGLYDMIGNVWEWTSDGFRADHGGRSAPGCCAPISPASGGHVLPEARQVLKG 256
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
G +LC YC R+R AAR TPDS++G++GFRC
Sbjct: 257 GYHLCAPNYCRRYRPAARQGQTPDSASGHIGFRC 290
>gi|375091008|ref|ZP_09737314.1| hypothetical protein HMPREF9709_00176 [Helcococcus kunzii ATCC
51366]
gi|374564799|gb|EHR36080.1| hypothetical protein HMPREF9709_00176 [Helcococcus kunzii ATCC
51366]
Length = 289
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 116/337 (34%), Positives = 166/337 (49%), Gaps = 67/337 (19%)
Query: 14 DMVLLPGDTFRMGTNK----PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
+ V +P TF+MG P+ D E P +V+++ F + ++N F+EFV T Y
Sbjct: 5 NFVNIPSGTFKMGATTDEGHPL---DKETPQVSVSINEFQISDTTITNEHFKEFVDKTNY 61
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
T AE +G +FVF LL EE + +V +
Sbjct: 62 TTTAETYGGSFVFHLLLDEETKKNSLKVERTL---------------------------- 93
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
W Y +G SW HP G DS I+ ++HPVVH+S+ DA+
Sbjct: 94 --------------WWYYVKGA---------SWKHPFGPDSNIDDILDHPVVHISYFDAL 130
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
A+C W RLP+EAEWEY R G E + +PWG+ ++ AN++QG+FP NT DG+
Sbjct: 131 AFCEWGNYRLPSEAEWEYAARAGSEYK-WPWGDTFLQDEKYMANIFQGDFPHTNTEKDGF 189
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTAD--------WWNVHHHPAPSYNPKGPTTGTDK 301
L T P + NKFGLYN++GNVWEW + + V + P YN + +
Sbjct: 190 LGTCPAKYFSPNKFGLYNVIGNVWEWCMNPRYLDLETFKTVDYKIYPPYNLENVNDNSQF 249
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
+GGS+LC+ YC R+R AR+ N+ S+A N GFR
Sbjct: 250 ATRGGSFLCHPSYCNRYRLGARNGNSASSTASNCGFR 286
>gi|91783680|ref|YP_558886.1| hypothetical protein Bxe_A2134 [Burkholderia xenovorans LB400]
gi|91687634|gb|ABE30834.1| Conserved hypothetical protein [Burkholderia xenovorans LB400]
Length = 384
Score = 200 bits (509), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 123/352 (34%), Positives = 170/352 (48%), Gaps = 58/352 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG F MG++ + E P+ V + F++D+H V+N +F+ FV ATGYVT AE
Sbjct: 44 MAWVPGGQFLMGSDA-KQAQPNERPAHKVKVHGFWMDRHHVTNAEFRRFVEATGYVTTAE 102
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC-TW 133
K D +E L + P A+ + T
Sbjct: 103 KKPD-------------------------WETLKVQLPPGTPRPPESAMVAGAMVFVGTS 137
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R L ++W R P +W HP G +S I + +HPVV VS+ DA AY
Sbjct: 138 RPVPLDDYSQWW---------RYVPGANWRHPAGPESNIIGKDDHPVVQVSYEDAQAYAK 188
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGY 249
W G RLPTEAEWE+ RGGLE + WG+ +P G+ ANVWQG+ FP N A G
Sbjct: 189 WAGKRLPTEAEWEFAARGGLEQATYAWGDQFSPNGKQMANVWQGQQPQSFPVVNPKAGGA 248
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNPKG 294
L T+PV ++ N +GL +M GN W+W ADW+ P+ S++P
Sbjct: 249 LGTSPVGTFPANGYGLSDMTGNAWQWVADWYRADQFRREAVSTSAIDNPVGPSESWDPAD 308
Query: 295 ---PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
P +V +GGS+LCNE YC +R +AR P +S +LGFR D+
Sbjct: 309 QGVPVNAPKRVTRGGSFLCNEIYCLSYRPSARRGTDPYNSMSHLGFRLVMDE 360
>gi|312139220|ref|YP_004006556.1| hypothetical protein REQ_18030 [Rhodococcus equi 103S]
gi|311888559|emb|CBH47871.1| conserved hypothetical protein [Rhodococcus equi 103S]
Length = 324
Score = 200 bits (509), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 126/356 (35%), Positives = 174/356 (48%), Gaps = 63/356 (17%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P K+MV +PG TF MG+ E P V +D F++D H+V+ +F+ FV AT
Sbjct: 11 PSRAPKNMVWVPGGTFWMGSED---FYPEERPVHEVGVDGFWMDTHQVTVAEFRRFVKAT 67
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
G+VT AE D + P S V P +S +D
Sbjct: 68 GHVTTAETAPDPADY-PDADPALLVPGSLV-----------------FTPPAQPVSLDDY 109
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
+ +W P W HPEG S + R HPV HVS++D
Sbjct: 110 QQWWSW-----------------------VPGADWRHPEGPGSNVGGRERHPVTHVSYSD 146
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
A AY W G LPTEAEWE+ RGGL+ + F WG+ P G H+AN WQG+FP N A D
Sbjct: 147 AQAYAAWAGKTLPTEAEWEFAARGGLDRKTFVWGDEFAPDGRHQANTWQGQFPWQNLAED 206
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTGT 299
G++ T+PV +++ N +GL ++ GNVWEWT D+ H + AP+ + P+ P
Sbjct: 207 GFVGTSPVGTFRPNGYGLVDVAGNVWEWTTDFHTADHGASGKNVAPASSCCIPRNPRVEI 266
Query: 300 DKVK----------KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
+V+ KGGS+LC YC R+R AAR T ++S ++GFRC +GP
Sbjct: 267 GEVQAGEAYPRRVIKGGSHLCAPNYCLRYRPAARQGETEETSTCHIGFRCIV-RGP 321
>gi|424878448|ref|ZP_18302088.1| hypothetical protein Rleg8DRAFT_6322 [Rhizobium leguminosarum bv.
trifolii WU95]
gi|392520940|gb|EIW45669.1| hypothetical protein Rleg8DRAFT_6322 [Rhizobium leguminosarum bv.
trifolii WU95]
Length = 315
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 128/343 (37%), Positives = 169/343 (49%), Gaps = 60/343 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG F MG+++ E P+ V +D F++D V+N F FV A+GYVT AE
Sbjct: 15 MSWIPGGDFLMGSDRHYA---EEAPAHRVRVDGFWMDTCPVTNWAFSAFVEASGYVTTAE 71
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ + + P E A S V K L + H +W
Sbjct: 72 RPANPADY-PGAIMEMLAPASVVFIKPKSPVDLTN-----------HYNW---------- 109
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W+Y RG +W HP G STI+ + HPVVHV++ DA AY W
Sbjct: 110 ---------WKY-VRGA---------NWRHPRGPASTIKGLLEHPVVHVAFEDAKAYAAW 150
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE+ RGGLE + WG + P G H AN WQGEFP N+ DG+ T+P
Sbjct: 151 AGKELPTEAEWEFAARGGLEAADYVWGGEMLPGGRHMANTWQGEFPVRNSREDGFEGTSP 210
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVH---------------HHPAPSYNPKGPTTGT 299
V S+ N +GL++M GNVWEWT DW+ H + S++P+ P
Sbjct: 211 VGSFPANGYGLFDMAGNVWEWTTDWYQEHGMIDSPCCTISNPVGAELSNSHDPRQPQISI 270
Query: 300 D-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
KV KGGS+LC YC R+R AAR D++ +LGFRC A
Sbjct: 271 PRKVMKGGSHLCAPNYCRRYRPAARMAQPVDTATCHLGFRCIA 313
>gi|359772684|ref|ZP_09276103.1| hypothetical protein GOEFS_069_00070 [Gordonia effusa NBRC 100432]
gi|359310147|dbj|GAB18881.1| hypothetical protein GOEFS_069_00070 [Gordonia effusa NBRC 100432]
Length = 294
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 127/343 (37%), Positives = 168/343 (48%), Gaps = 64/343 (18%)
Query: 16 VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEK 75
+ +PG + MG+ + E P R V + +F +D ++N QF EFV+ATGYVT AE
Sbjct: 1 MTIPGGEYLMGSTR---HYPEERPVRPVQVSSFRIDARAITNAQFAEFVAATGYVTLAE- 56
Query: 76 FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHR----MHHPVVHISWNDAVAYC 131
+PL D R+ G D + H + +D +
Sbjct: 57 -------QPL--------------DPSRYPGADPELLKPGGLVFHQTPEPVELHDLTQW- 94
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W Y P +W P G S++ +HPVV V DA AY
Sbjct: 95 ------------WAY----------VPGANWRAPHGPQSSLHGLDDHPVVQVCHADAAAY 132
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
WRG RLPTEAEWE+ RGGL ++ + WG + +R N W+GEFP NTA+DG+ S
Sbjct: 133 ARWRGVRLPTEAEWEFAARGGLASQPYTWGADADDDPPYRVNRWRGEFPWQNTASDGFES 192
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD----------- 300
TAPV S++ N +GLY+M GNVWEWT WW HHH + GP
Sbjct: 193 TAPVGSFEPNGYGLYDMAGNVWEWTDTWWT-HHHAVETSPCCGPDVAMQRSMGPGDDVGR 251
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
+V KGGS+LC YC R+R AAR DS+ +LGFRCAAD+
Sbjct: 252 RVIKGGSHLCAPNYCLRYRPAARQPEAIDSATCHLGFRCAADE 294
>gi|300024443|ref|YP_003757054.1| hypothetical protein Hden_2938 [Hyphomicrobium denitrificans ATCC
51888]
gi|299526264|gb|ADJ24733.1| protein of unknown function DUF323 [Hyphomicrobium denitrificans
ATCC 51888]
Length = 333
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 128/347 (36%), Positives = 177/347 (51%), Gaps = 64/347 (18%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ MV +PG TFRMG++ E P+ V++D+F++D+ ++N +F+ FV AT YVT
Sbjct: 24 HSGMVFVPGGTFRMGSDHHY---PEEAPAHRVSVDSFWIDEAPITNAEFRRFVDATNYVT 80
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
AE P S+ A +R F+ I+ R +W
Sbjct: 81 FAE-------LTPNASDYPGALPHMLRAGSLVFKPPAGEIDLR--------NWGQ----- 120
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W WE+ R G +W P G S+I+ HPVVHV++ DA AY
Sbjct: 121 -W----------WEF--RFG--------ANWRLPYGPGSSIKGLDEHPVVHVAYCDAEAY 159
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G LPTEAEWE+ RGG+E F WG+ LTP G++ AN WQG FP N DGY
Sbjct: 160 AHWAGKELPTEAEWEFAARGGIEGAEFAWGDELTPGGKYMANTWQGIFPRENLKLDGYER 219
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-------NPKGPTTGTD---- 300
T+P+ ++ N +G+++M+GNVWEWT+DW+ HPA + NP+GP
Sbjct: 220 TSPIKAFPPNGYGIHDMIGNVWEWTSDWY-APKHPADAQKACCIPKNPRGPGQDQSLDPY 278
Query: 301 --------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KV KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 279 QPQIRIPRKVLKGGSHLCAPNYCRRYRPAARHAEPIDTSTSHVGFRC 325
>gi|385209534|ref|ZP_10036402.1| hypothetical protein BCh11DRAFT_06664 [Burkholderia sp. Ch1-1]
gi|385181872|gb|EIF31148.1| hypothetical protein BCh11DRAFT_06664 [Burkholderia sp. Ch1-1]
Length = 416
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 124/352 (35%), Positives = 167/352 (47%), Gaps = 58/352 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG F MG++ + E P+ V + F++D+H V+N +F+ FV ATGYVT AE
Sbjct: 76 MAWVPGGQFLMGSDA-KQAQPNERPAHKVKVHGFWMDRHHVTNAEFRRFVEATGYVTTAE 134
Query: 75 KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
K D + L R S + F G + +D Y W
Sbjct: 135 KKPDWDTLKVQLPPGTPRPPESAMVAGAMVFVGTSRPVPL-----------DD---YSQW 180
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W Y P W HP G DS I + +HPVV VS+ DA AY
Sbjct: 181 ----------WRY----------VPGADWRHPTGPDSNIIGKDDHPVVQVSYEDAQAYAK 220
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGY 249
W G RLPTEAEWE+ RGGLE + WG+ +P G+ ANVWQG+ FP N A G
Sbjct: 221 WAGKRLPTEAEWEFAARGGLEQATYAWGDQFSPDGKQMANVWQGQQPQSFPVVNPKAGGA 280
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNPKG 294
L T+P ++ N +GL +M GN W+W ADW+ P+ S++P
Sbjct: 281 LGTSPAGTFPANGYGLSDMTGNAWQWVADWYRADQFRREAVSTSAIDNPVGPSDSWDPAD 340
Query: 295 ---PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
P +V +GGS+LCNE YC +R +AR P +S +LGFR D+
Sbjct: 341 QGVPVNAPKRVTRGGSFLCNEIYCLSYRPSARRGTDPYNSMSHLGFRLVMDE 392
>gi|433645849|ref|YP_007290851.1| hypothetical protein Mycsm_01020 [Mycobacterium smegmatis JS623]
gi|433295626|gb|AGB21446.1| hypothetical protein Mycsm_01020 [Mycobacterium smegmatis JS623]
Length = 300
Score = 200 bits (508), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 127/381 (33%), Positives = 175/381 (45%), Gaps = 91/381 (23%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +FRMG+ + E P T+ AF +++H V+N Q+ EFV+ T YVT A
Sbjct: 4 ELVELPGGSFRMGSTQ---FYPEEAPLHTATVGAFAVERHPVTNAQYAEFVAETDYVTIA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D ++ HD+ + + PV W A+
Sbjct: 61 EQPPDPALYP-----------GAAPHDLVP----GALVFQPSQGPVDLRDWRQWWAW--- 102
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
P SW HP G S IE R++HPVV V++ DA AY
Sbjct: 103 -----------------------LPGASWRHPFGAGSGIEDRLDHPVVQVAYPDAAAYAR 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGG ++PWG++ TP G+ AN WQG FP N A G++ T+
Sbjct: 140 WAGRRLPTEAEWEYASRGGAST-VYPWGDDATPGGQLMANTWQGRFPYRNDGALGWVGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
PV ++ N FGL +M+GNVWEWT ++ HH GP
Sbjct: 199 PVGAFPPNGFGLVDMIGNVWEWTTTQFSAHH------RLDGPIE---------------- 236
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAAR 373
C P ++ KGGS+LC +YC+R+R AAR
Sbjct: 237 ------------------------SCCGPSSPDPTVNQTLKGGSHLCAPEYCHRYRPAAR 272
Query: 374 SQNTPDSSAGNLGFRCAADVS 394
S + DS+ ++GFRC AD+S
Sbjct: 273 SPQSQDSATTHIGFRCVADLS 293
>gi|254504616|ref|ZP_05116767.1| conserved domain protein [Labrenzia alexandrii DFL-11]
gi|222440687|gb|EEE47366.1| conserved domain protein [Labrenzia alexandrii DFL-11]
Length = 307
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 126/336 (37%), Positives = 161/336 (47%), Gaps = 55/336 (16%)
Query: 18 LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
L G TF MG+ + E P R VTL F + + EV+N +F FV TGYVT AEK
Sbjct: 13 LEGGTFAMGSGEHYRE---EAPVRQVTLQPFSIQKTEVTNAKFAAFVEETGYVTTAEKDL 69
Query: 78 DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
D + P L + S V K L+ +++ W
Sbjct: 70 DQSDY-PGLPPDLLKAGSMVFAQPKDAVALND----------INLWW------------- 105
Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
R P W HP G +S+IE +HPVV VS DA AY W G
Sbjct: 106 -----------------RYVPGAQWRHPRGPESSIEGLEDHPVVQVSIEDAKAYAAWAGG 148
Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMS 257
RLPTEAEWE+ RGGL+ F WG + P +AN WQG FP+ +TA DG+ +APV S
Sbjct: 149 RLPTEAEWEFAARGGLDGAQFSWGEDYDPSEGWKANTWQGVFPSIDTADDGHHGSAPVGS 208
Query: 258 YKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT-----------TGTDKVKKGG 306
Y N +GLY+M GNVWE+ +DWW H NP GP+ G V KGG
Sbjct: 209 YASNGYGLYDMAGNVWEYVSDWWIPGHPTKDQTNPGGPSKTLAEQFANSAVGARHVVKGG 268
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
S+LC YC+R+R +AR + ++GFR A D
Sbjct: 269 SWLCAPSYCFRYRPSARQPAERSLGSNHIGFRIARD 304
>gi|339522051|gb|AEJ84190.1| sulfatase-modifying factor 2 [Capra hircus]
Length = 301
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 130/338 (38%), Positives = 169/338 (50%), Gaps = 70/338 (20%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV LPG F+MGTN P +DGE P R V + F +D V+N F+EFV Y TEA
Sbjct: 29 NMVQLPGGRFQMGTNSPD-GRDGEGPVREVMVKPFAIDIFPVTNKDFREFVREKKYRTEA 87
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E FG +FVFE L+S+E R K Q RM + W W
Sbjct: 88 EVFGWSFVFEDLVSDELRNKAPQ-----------------RMQS----LLW--------W 118
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
LP E R F W P G S I ++ PVVHVSWNDA A C
Sbjct: 119 ----LPME-------------RAF----WRQPAGPGSGIREKLEFPVVHVSWNDARACCA 157
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
WRG RLPTE EWE+ RGGL+ +++P GN P +R N+WQG+FP + A DG+ +
Sbjct: 158 WRGKRLPTEEEWEFAARGGLKGQVYPGGNKFQP---NRTNLWQGKFPKGDKAEDGFHGVS 214
Query: 254 PVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--C 310
PV ++ +N +GLY+++GNVWEWTA +V +G S++
Sbjct: 215 PVNAFPPQNDYGLYDLMGNVWEWTASQSQAADQDM-------------RVLRGASWVDTA 261
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+ R R R NTPDS++ LGFRCA+ G G
Sbjct: 262 DGSANPRARVTTRMGNTPDSASDTLGFRCASSAGRPPG 299
>gi|404442113|ref|ZP_11007294.1| sulfatase-modifying factor 1 [Mycobacterium vaccae ATCC 25954]
gi|403657384|gb|EJZ12157.1| sulfatase-modifying factor 1 [Mycobacterium vaccae ATCC 25954]
Length = 292
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 124/336 (36%), Positives = 171/336 (50%), Gaps = 51/336 (15%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ ++V LPG +FRMG+ + E P V++ F +++H V+N QF EFVS TGYVT
Sbjct: 2 FTELVELPGGSFRMGS---VDFYPEEAPVHTVSVAPFAIERHPVTNAQFAEFVSDTGYVT 58
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
AE+ D F P + + + V F ++ R D +
Sbjct: 59 VAERELDPADF-PGVPPADLVPGALV------FRPTPGPVDLR-----------DWRQWW 100
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
TW P W HP G S++E R++HPVV V++ DA AY
Sbjct: 101 TW-----------------------VPGACWRHPFGPASSLEDRLDHPVVQVAYPDAAAY 137
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G RLP+EA+WEY R G + WG+++ P G+ AN WQG FP N A G+
Sbjct: 138 ARWAGRRLPSEAQWEYAARAGT-TAAYAWGDDVRPDGQLMANTWQGRFPYRNEGALGWSG 196
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSYNPKGPTTGTDKVK---KG 305
T+PV ++ N FGL +M+GNVWEWT + HH PA P P G V KG
Sbjct: 197 TSPVGAFPPNGFGLLDMIGNVWEWTTTPYTRHHRLDQPAQGCCPTPPPAGDPTVHQALKG 256
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
GS+LC +YC+R+R AARS + DS+ ++GFRC A
Sbjct: 257 GSHLCAPEYCHRYRPAARSSQSQDSATTHIGFRCIA 292
>gi|296447678|ref|ZP_06889596.1| protein of unknown function DUF323 [Methylosinus trichosporium
OB3b]
gi|296254819|gb|EFH01928.1| protein of unknown function DUF323 [Methylosinus trichosporium
OB3b]
Length = 354
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 123/331 (37%), Positives = 165/331 (49%), Gaps = 52/331 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M + G F MG+ E P+ V + F++D+HEV+N QF FV+ATG+ T
Sbjct: 53 MSWVAGGAFLMGSED---FYPEERPAHRVFVGGFWIDRHEVTNAQFAAFVAATGWRT--- 106
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+++ D K GL + + + + N + +W
Sbjct: 107 -------------------VAETGLDPKDHPGLPAELTAPGSMVFIMPTTNPGRSPESW- 146
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W Y G W HP G ++I+ + PVVHV+W DA AY W
Sbjct: 147 ---------WRY-VHGA---------DWRHPGGPGTSIDGKDAEPVVHVAWRDARAYSDW 187
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWEY RGGLE R + WG+ + P G + AN WQG FP + A+DG+ AP
Sbjct: 188 LGRDLPTEAEWEYAARGGLEGRPYAWGSEVRPDGRYMANSWQGAFPLFDEASDGFHGLAP 247
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT--TGTD-----KVKKGGS 307
V Y+ N GL++M GNVWEWT DWW H S +P+GPT G D +V KGGS
Sbjct: 248 VGCYEPNGRGLFDMTGNVWEWTGDWWAPGHAAEASADPQGPTRNPGGDEALAARVIKGGS 307
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
+LC+ YC R+R AAR + D A +LGFR
Sbjct: 308 WLCSSNYCGRYRPAARQPHEIDLGASHLGFR 338
>gi|222081471|ref|YP_002540834.1| hypothetical protein Arad_7818 [Agrobacterium radiobacter K84]
gi|221726150|gb|ACM29239.1| conserved hypothetical protein [Agrobacterium radiobacter K84]
Length = 322
Score = 199 bits (507), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 128/354 (36%), Positives = 176/354 (49%), Gaps = 65/354 (18%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
+R + M +PG TF MG+N E P+ V +D F++++ V+N +F FV ATG
Sbjct: 12 ADRPEGMAWIPGGTFIMGSNSHY---PEEAPAHPVKVDGFWIEETPVTNRKFMAFVKATG 68
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
YVT AEK D + L E RA S V K G
Sbjct: 69 YVTLAEKAPDPEQYPGALPEMLRAG-SLVFTQPKSVSG---------------------- 105
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHRMNHPVVHVSWND 187
P ++W +F +G+ W P G S + +++HPVVHV+W D
Sbjct: 106 ----------PNISQW----------WIFKFGANWRRPLGGLSDLRGKLDHPVVHVAWCD 145
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
A AY W G LPTEAEWE RGGL++ + WG+ P G AN W G FPT++ A
Sbjct: 146 AKAYADWAGLELPTEAEWELAARGGLDDVEYAWGSEFQPNGVVMANTWAGTFPTHSLKAK 205
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PA----------------PSY 290
G+ T+PV ++ N++GLY+M+GNVWEWTAD+W+ H PA S+
Sbjct: 206 GHERTSPVRAFPPNRYGLYDMIGNVWEWTADYWSPRHEKPALHACCIPTNPRGGDQEASF 265
Query: 291 NPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
+P+ P +V KGGS+LC YC R+R AAR D+S ++GFRC K
Sbjct: 266 DPRQPEILIHRRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRCVKRK 319
>gi|254461308|ref|ZP_05074724.1| sulfatase-modifying factor 1 [Rhodobacterales bacterium HTCC2083]
gi|206677897|gb|EDZ42384.1| sulfatase-modifying factor 1 [Rhodobacteraceae bacterium HTCC2083]
Length = 313
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 127/343 (37%), Positives = 167/343 (48%), Gaps = 58/343 (16%)
Query: 3 LLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
++ A V+ + D V +P GT P + DGE P R+ + F V+N QF+
Sbjct: 24 IIAARTVDDF-DTVAIPAGRSVYGTATPQIPNDGEGPMRSKRVRGFTCGVTTVTNAQFKA 82
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
F+ ATGYVTEAE FG +FVF +++ E V +
Sbjct: 83 FIDATGYVTEAEDFGWSFVFYSDIAK-----------------------EIGPTQAVPGV 119
Query: 123 SWNDAVAYCTWRGARLPT--EAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
W V WR P A W+ +HPV
Sbjct: 120 EWWHRVDGANWRDINGPYTYAAAWD------------------------------PDHPV 149
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNL-TPRGEHRANVWQGEF 239
V +SW DA AY W G RLPTEAEWE+ RGGL + FPWG+ +G N+WQG+F
Sbjct: 150 VQMSWRDARAYAAWVGGRLPTEAEWEHAARGGLGDVKFPWGDQEPEEKGYTPCNIWQGDF 209
Query: 240 PTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT 299
P NT ADGY TAP S++ N +GL+NMVGNVWEWT+D ++V G
Sbjct: 210 PNTNTCADGYQRTAPARSFEPNTYGLFNMVGNVWEWTSDAYHVQSLKKDVKRRLVQMKGF 269
Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
K+ KGGS+LC++ YCYR+R AARS +PDS+ + GFR D
Sbjct: 270 -KLSKGGSFLCHKSYCYRYRIAARSGTSPDSATPHHGFRVVWD 311
>gi|332285115|ref|YP_004417026.1| hypothetical protein PT7_1862 [Pusillimonas sp. T7-7]
gi|330429068|gb|AEC20402.1| hypothetical protein PT7_1862 [Pusillimonas sp. T7-7]
Length = 381
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 124/350 (35%), Positives = 170/350 (48%), Gaps = 59/350 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+M +PGD F MG N L + E P+ + + F++D H+V+N +F+ FV ATGYVT A
Sbjct: 49 NMAWVPGDEFLMG-NSHKLSQPNERPAHQIRVSGFWMDVHDVTNAEFRRFVEATGYVTTA 107
Query: 74 EKFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
E+ +P L R S + F G S + R Y
Sbjct: 108 EQKPLWEDLKPQLPPGTPRPDDSLLVPGAMVFTGTASEVSLR--------------DYSR 153
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W W R P +W HP+G S+I+ + +HPVV VS+ DA AY
Sbjct: 154 W----------W----------RFIPGANWRHPQGPGSSIQGKDDHPVVQVSYKDAQAYA 193
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAADG 248
W G RLPTEAEWEY RGGLE + WGN L+P G+ AN+W Q FP
Sbjct: 194 KWAGKRLPTEAEWEYAARGGLEQATYSWGNELSPDGKPMANIWDTQQQQPFPVVKDEKI- 252
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNPK 293
+ T PV S+ N +GL++M GNVW+WTADW+ PA S++P
Sbjct: 253 QVGTTPVGSFPANDYGLFDMAGNVWQWTADWYRADAFTIQAESKNMPVDPTGPADSFDPD 312
Query: 294 G---PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
P + +V +GGS+LC++ YC +R +AR P +S +LGFR
Sbjct: 313 DGNVPVSAPKRVTRGGSFLCSDTYCISYRTSARRGTDPLNSMSHLGFRTV 362
>gi|408369867|ref|ZP_11167647.1| sulfatase-modifying factor protein [Galbibacter sp. ck-I2-15]
gi|407744921|gb|EKF56488.1| sulfatase-modifying factor protein [Galbibacter sp. ck-I2-15]
Length = 376
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 121/350 (34%), Positives = 179/350 (51%), Gaps = 49/350 (14%)
Query: 15 MVLLPGDTFRMGTNK-PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
MV + G F G + D E P+ V +D F++D EV+N QF EFV TGY+T A
Sbjct: 59 MVWIAGGHFTQGARQGDSFALDHEKPAHPVAVDGFFMDITEVTNDQFSEFVKQTGYITIA 118
Query: 74 EKFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
E+ D + L E + S ++ F+ ++++ + D +
Sbjct: 119 EREVDWEELKKQLPEGTPKPHDSILQPGSLVFKKTETSVPNLY----------DFSQWWV 168
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W+ +W HP+G S+IE + N+PVVH+++ DAVAYC
Sbjct: 169 WKIG-----------------------ANWKHPKGPGSSIEGKGNYPVVHIAFPDAVAYC 205
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G +LPTEA+WE G+ N+ F WG+ L + + +AN W GEFPT N DG+ +
Sbjct: 206 NWAGRKLPTEAQWELAASAGINNKPFTWGDKLD-QLDVKANTWTGEFPTQNDQVDGFENK 264
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYNPKG------PTT--GT 299
APV S+ N +GLY+M GNVWEWT+DW+++ + H ++NP+G PT
Sbjct: 265 APVKSFPPNPYGLYDMAGNVWEWTSDWYHIDYYKQLLHEGLAHNPEGAQKAFNPTNPYAQ 324
Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
+K+ KGGS+LC++ YC +R +AR DS +LGFR GT
Sbjct: 325 EKIIKGGSFLCSDTYCASYRISARMATAIDSGQEHLGFRTIVTPEMLEGT 374
>gi|115358015|ref|YP_775153.1| hypothetical protein Bamb_3264 [Burkholderia ambifaria AMMD]
gi|115283303|gb|ABI88819.1| protein of unknown function DUF323 [Burkholderia ambifaria AMMD]
Length = 330
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 123/341 (36%), Positives = 167/341 (48%), Gaps = 54/341 (15%)
Query: 3 LLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
L PA ++++ G +GT+ P++ DGE P R V L F ++++ V+N +F E
Sbjct: 39 LAPANARLPAENIIEFDGGPSWVGTDAPVIKADGEGPRRKVHLKPFGIERYAVTNDRFTE 98
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
FV+ATGY T+AE+FG ++VF D+ +E
Sbjct: 99 FVAATGYRTDAERFGWSYVF-------------------------DAFLE---------- 123
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
D VA P W R P W PEG S+++ R HPV+H
Sbjct: 124 --PDCVA-------EAPANVPWW---------RSVPGAYWAAPEGPLSSLDGRGMHPVIH 165
Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
+SWNDA A+ W G RLPTEAEWE+ RGG + R FPWG+ N+WQG FP
Sbjct: 166 ISWNDAAAFARWCGGRLPTEAEWEHAARGGADARRFPWGDEEPVEDALCCNIWQGRFPDR 225
Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-HPAPSYNPKGPTTGTDK 301
+ D T V ++ N GLYN GNVWEW D ++V A + + +
Sbjct: 226 YSGGDRAFGTVRVDAFAANPAGLYNCSGNVWEWCVDPFHVRSLSSAGKARDRQAVSEREY 285
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
KGGSYLC+ YCYR+R AAR+ D+SAG+ GFR A D
Sbjct: 286 TLKGGSYLCHRSYCYRYRIAARTGRPTDTSAGHTGFRVAYD 326
>gi|300771046|ref|ZP_07080923.1| sulfatase-modifying factor 1 [Sphingobacterium spiritivorum ATCC
33861]
gi|300762319|gb|EFK59138.1| sulfatase-modifying factor 1 [Sphingobacterium spiritivorum ATCC
33861]
Length = 363
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 120/339 (35%), Positives = 173/339 (51%), Gaps = 57/339 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV + G ++ MG+ +D + V + +F++D+HEV+N QF +FV ATGY+T AE
Sbjct: 69 MVKIEGGSYEMGSAD---FEDAK-QVHKVNIKSFWMDEHEVTNAQFAKFVQATGYLTVAE 124
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D F P + E S V H K GL+ ++ W
Sbjct: 125 RPLDPRDF-PGVDPEMLKPGSAVFHAPKEVRGLNDPLQ--------------------W- 162
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W Y +W HPEG +S I+ + N+PVVH+++ DA AY W
Sbjct: 163 ---------WSYVVGA----------NWKHPEGPESNIKGKENYPVVHIAYQDAEAYAKW 203
Query: 195 RGARLPTEAEWEYGCR-GGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
G RLPTEAEWEY + N + WG T G+ AN++QGEFP +T DGY+ A
Sbjct: 204 AGKRLPTEAEWEYAAKTANYNNEKYYWGTEKTENGKWAANIYQGEFPLKDTGEDGYVGAA 263
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----------PAPSYNPKGPTTGTDKVK 303
PV SY N GLY+M GNVWEW +D++ ++ P SY+P+ P +V+
Sbjct: 264 PVKSYPANTSGLYDMEGNVWEWCSDFYRPDYYKNSSESNPTGPIDSYDPQEPGA-VKRVQ 322
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+GGS+LCN+ YC R++ +R + +S N+GFR D
Sbjct: 323 RGGSFLCNDMYCERYKAGSRGKGEVNSPTNNVGFRLVKD 361
>gi|378764400|ref|YP_005193016.1| putative sulfatase-modifying factor [Sinorhizobium fredii HH103]
gi|365184028|emb|CCF00877.1| putative sulfatase-modifying factor [Sinorhizobium fredii HH103]
Length = 329
Score = 199 bits (506), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 130/342 (38%), Positives = 170/342 (49%), Gaps = 60/342 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+ V +PG TF MG+++ E P+ V + F++D V+N +F FV ATGYVT A
Sbjct: 23 ETVWIPGGTFLMGSDRHY---PEEAPAHKVKVTGFWMDVCTVTNREFAAFVEATGYVTLA 79
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D + P + A S V F + ++ R H+ +W
Sbjct: 80 ERPVDPKDY-PGAKPDMLAPASVV------FSKPRTRVDLRNHY-----NW--------- 118
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W Y RG +W HP G S+IE +HPVVHV++ DA AY T
Sbjct: 119 ----------WAY-LRGA---------NWRHPHGPGSSIERLHDHPVVHVAYEDAEAYAT 158
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W LPTEAEWE+ RGGL+ + WG+ P G AN WQGEFP N DGY +TA
Sbjct: 159 WARKELPTEAEWEFAARGGLDAAEYVWGDEWAPGGRPMANTWQGEFPLQNLCEDGYETTA 218
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNPKGPTTG 298
PV SY N +GLY+M GNVW+WT+DW+ H A S +P P
Sbjct: 219 PVASYPANGYGLYDMAGNVWQWTSDWYQDHGKIDSPCCTISNPRGGDQADSIDPLQPHLK 278
Query: 299 TD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGSYLC YC R+R AAR D+S ++GFRC
Sbjct: 279 IPRRVMKGGSYLCAANYCRRYRPAARMAQPVDTSTCHVGFRC 320
>gi|441204861|ref|ZP_20972317.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis MKD8]
gi|440629327|gb|ELQ91117.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis MKD8]
Length = 296
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 125/335 (37%), Positives = 174/335 (51%), Gaps = 54/335 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V +PG +FRMG+ E P T++ F +++H V+N QF EF+ ATGY T A
Sbjct: 4 ELVEVPGGSFRMGSTS---FYPEEAPVHTATVEDFAIERHPVTNAQFAEFIEATGYATVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D ++ P + E + S V F ++ R W
Sbjct: 61 ERPLDPKLY-PGVPEADLVPGSLV------FRPTSGPVDLR-----------------DW 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R W++ P W HP G R +HPVV V++ DAVAY T
Sbjct: 97 R-------QWWDWA----------PGACWHHPFGPRREF-CRPDHPVVQVAYPDAVAYAT 138
Query: 194 WRGARLPTEAEWEYGCRGGLENR---LFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
W G RLPTEAEWEY RGG + L+ WG+ + P G+ AN WQG+FP N A G+
Sbjct: 139 WAGRRLPTEAEWEYAARGGPKGGVGFLYAWGDEVCPDGQLMANTWQGKFPYRNDDALGWK 198
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKK 304
T+PV ++ N+ GL +M+GNVWEWTA ++ HH P + P G G ++V K
Sbjct: 199 GTSPVGTFPPNRLGLVDMIGNVWEWTATKFSAHHRPGDESHVTCCPPPSGGDPGVNQVLK 258
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
GGS+LC +YC+R+R AARS + DSS ++GFRC
Sbjct: 259 GGSHLCAPEYCHRYRPAARSPQSQDSSTTHIGFRC 293
>gi|433640835|ref|YP_007286594.1| Conserved protein of unknown function [Mycobacterium canettii CIPT
140070008]
gi|432157383|emb|CCK54661.1| Conserved protein of unknown function [Mycobacterium canettii CIPT
140070008]
Length = 299
Score = 199 bits (505), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 130/337 (38%), Positives = 168/337 (49%), Gaps = 53/337 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +FRMG+ + E P VT+ AF +++H V+N QF EFVSATGYVT A
Sbjct: 4 ELVDLPGGSFRMGSTR---FYPEEAPIHTVTVRAFAVERHPVTNAQFAEFVSATGYVTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E +PL D + G+D+ +V V W
Sbjct: 61 E--------QPL--------------DPGLYPGVDAA--DLCPGAMVFCPTAGPVDLRDW 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R W++ P W HP G DS I R HPVV V++ DAVAY
Sbjct: 97 R-------QWWDW----------VPGACWRHPFGRDSDIADRAGHPVVQVAYPDAVAYAR 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGG + WG+ P G AN WQG FP N A G++ T+
Sbjct: 140 WAGRRLPTEAEWEYAARGG-TTATYAWGDQEKPGGMLMANTWQGRFPYRNDGALGWVGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGTD----KVKKG 305
PV + N FGL +M+GNVWEWT + H P P T D + KG
Sbjct: 199 PVGRFPANGFGLLDMIGNVWEWTTTEFYPQHRIDPPSTACCAPVKLATAADPTISQTLKG 258
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GS+LC +YC+R+R AARS + D++ ++GFRC AD
Sbjct: 259 GSHLCAPEYCHRYRPAARSPQSQDTATTHIGFRCVAD 295
>gi|108757085|ref|YP_631849.1| hypothetical protein MXAN_3662 [Myxococcus xanthus DK 1622]
gi|108460965|gb|ABF86150.1| conserved hypothetical protein [Myxococcus xanthus DK 1622]
Length = 334
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 125/346 (36%), Positives = 167/346 (48%), Gaps = 63/346 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG TF MG++ E P+ VT+ F++D V+N F FV ATGYVT AE
Sbjct: 31 MVYIPGGTFWMGSDHHY---PEERPTHQVTVTDFWMDSQLVTNADFARFVEATGYVTVAE 87
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ PL ++ F G + + + V T
Sbjct: 88 R--------PLNPDD--------------FPGANPAL----------LVPGSLVFKKTLA 115
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
L +W + G W +PEG ST + R +PVVHV + DA+AY W
Sbjct: 116 RVDLRDLTQWWFYTAGA---------CWKYPEGAGSTWQGREQYPVVHVCFEDALAYALW 166
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWEY RGGL+ +++ WG+ P G+ AN WQGEFP N DG+ T+P
Sbjct: 167 AGKTLPTEAEWEYAARGGLDRKVYVWGDEFAPGGKLMANTWQGEFPWQNLNTDGFEGTSP 226
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGPTTG---------- 298
V ++ N +GL++M GNVWEWT+DW+ H NP+GP T
Sbjct: 227 VGNFPPNAYGLFDMAGNVWEWTSDWYQERHEGNGGKPCCIPVNPRGPATSDRSLDPCLPK 286
Query: 299 ---TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+V KGGS+LC YC R+R AARS DS ++GFRC
Sbjct: 287 VKIPRRVLKGGSHLCAPNYCLRYRPAARSPQAVDSGTSHIGFRCVV 332
>gi|34783013|gb|AAH17005.2| SUMF1 protein, partial [Homo sapiens]
Length = 132
Score = 198 bits (504), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 88/130 (67%), Positives = 104/130 (80%)
Query: 217 LFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWT 276
LFPWGN L P+G+H AN+WQGEFP NT DG+ TAPV ++ N +GLYN+VGN WEWT
Sbjct: 1 LFPWGNKLQPKGQHYANIWQGEFPVTNTGEDGFQGTAPVDAFPPNGYGLYNIVGNAWEWT 60
Query: 277 ADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
+DWW VHH + NPKGP +G D+VKKGGSY+C+ YCYR+RCAARSQNTPDSSA NLG
Sbjct: 61 SDWWTVHHSVEETLNPKGPPSGKDRVKKGGSYMCHRSYCYRYRCAARSQNTPDSSASNLG 120
Query: 337 FRCAADKGPT 346
FRCAAD+ PT
Sbjct: 121 FRCAADRLPT 130
>gi|414164694|ref|ZP_11420941.1| hypothetical protein HMPREF9697_02842 [Afipia felis ATCC 53690]
gi|410882474|gb|EKS30314.1| hypothetical protein HMPREF9697_02842 [Afipia felis ATCC 53690]
Length = 320
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 125/345 (36%), Positives = 166/345 (48%), Gaps = 62/345 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
DM+ L G + MG+++ E P V + +F +D+ V+N QF+ FV TGY+T A
Sbjct: 16 DMIRLTGGVYVMGSDRHY---PEEAPEHRVKVGSFLIDRTPVTNRQFKNFVRQTGYITFA 72
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D + +L DM L T H SW
Sbjct: 73 ERMPDPKDYPGILP------------DMMFAGSLVFTPPASSSH---RHSW--------- 108
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
A+W +G +W P G S I +HPVVH+++ DA+AY
Sbjct: 109 --------AQWWTLMKGA---------NWRRPYGPKSNINTLDSHPVVHITFADALAYAQ 151
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEAEWE+ RGGLEN F WGN L P G + AN WQGEFP N A DGY T+
Sbjct: 152 WAGKDLPTEAEWEFAARGGLENADFAWGNELAPGGTYLANTWQGEFPHENLALDGYARTS 211
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY------NPKGP------------ 295
PV ++ N +G+++M+GNVWEWT DW+ H NP+G
Sbjct: 212 PVTAFPPNGYGIHDMIGNVWEWTKDWYGSRHQADAQKACCIPENPRGAREEDSFDAGDRN 271
Query: 296 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
T +V KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 272 TRIPRRVVKGGSHLCAPNYCRRYRPAARQAQPIDTSTSHVGFRCV 316
>gi|33601704|ref|NP_889264.1| hypothetical protein BB2727 [Bordetella bronchiseptica RB50]
gi|33576141|emb|CAE33220.1| putative exported protein [Bordetella bronchiseptica RB50]
Length = 380
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 124/348 (35%), Positives = 165/348 (47%), Gaps = 59/348 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG F MG N L + E P+ V + F++D H+V+N QF+ FV ATGY+T AE
Sbjct: 49 MAWIPGHQFLMG-NDHKLSQPNERPAHQVRVSGFWMDVHDVTNAQFRRFVQATGYLTTAE 107
Query: 75 KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ P L R SQ+ F G S + R Y W
Sbjct: 108 RKPRWEDLRPQLPPGTPRPDDSQLAPGAMVFVGTQSEVSLR--------------DYSRW 153
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W R P SW HP+G S I+ + +HPVV VS+ DA AY
Sbjct: 154 ----------W----------RFVPGASWRHPQGPGSGIDGKDDHPVVQVSYEDAQAYAR 193
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAADGY 249
W G RLPTEAEWE+ RGGLE + WG+ L P+G+ AN+W Q FP
Sbjct: 194 WAGKRLPTEAEWEFAARGGLEQATYTWGDELLPQGQAMANIWDTRQQQPFPVVKD-EKVQ 252
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH------HHPAPSYNPKGPTTGTD--- 300
+ T PV S+ N +GLY+M GNVW+WTADW+ + P +P GP D
Sbjct: 253 VGTTPVGSFAPNGYGLYDMAGNVWQWTADWYRADAFRIQAQYREPPLDPAGPADSFDPDD 312
Query: 301 ---------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V +GGS+LC++ YC +R +AR P + +LGFR
Sbjct: 313 GNVPAAAPKRVTRGGSFLCSDTYCISYRASARRGTDPLNPMSHLGFRT 360
>gi|325287393|ref|YP_004263183.1| sulfatase-modifying factor protein [Cellulophaga lytica DSM 7489]
gi|324322847|gb|ADY30312.1| Sulphatase-modifying factor protein [Cellulophaga lytica DSM 7489]
Length = 380
Score = 198 bits (504), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 127/358 (35%), Positives = 182/358 (50%), Gaps = 54/358 (15%)
Query: 2 VLLPAPPVERYKD-MVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQ 59
+L+ P + D MV +PG F G + + E P+ V +D F++D EV+N Q
Sbjct: 53 LLVKKPKDVKVPDGMVWIPGGEFIQGAVPQDKMAMQHEKPAHKVAVDGFFMDITEVTNAQ 112
Query: 60 FQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVR---HDMKRFEGLDSTIEHRMH 116
F +FV+ TGY T AE+ D EE + ++ + HD G + + +
Sbjct: 113 FAKFVNQTGYTTVAERAIDW--------EEMKKQVPEGTPKPHDSILQPGSLTFKKTKTS 164
Query: 117 HPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRM 176
P + + W W G +W HP G S+I+
Sbjct: 165 VP-------NLFDFSQWW--------HWTIG------------ANWKHPNGPGSSIKGLD 197
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
NHPVV + + DA+AYC W RLPTEAEWEY GG +F WG++ + ++ AN W+
Sbjct: 198 NHPVVQICYEDAMAYCNWANRRLPTEAEWEYAANGGKSGTIFFWGDDRSVLSKN-ANSWE 256
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP--APSYNPKG 294
GEFP N+ DG+ APV SY+ N +GLY+M GNVWEWT+DW+NV+++ A + K
Sbjct: 257 GEFPVENSKKDGFEKRAPVKSYQPNGYGLYDMAGNVWEWTSDWYNVNYYKELAATTTKKN 316
Query: 295 PTTGTD-----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
P TD KV KGGS+LC++ YC +R +AR ++ DSSA + GFR A
Sbjct: 317 PLGATDAYNPQNLYIKEKVIKGGSFLCSDSYCASYRISARMGSSVDSSAEHTGFRTVA 374
>gi|168700488|ref|ZP_02732765.1| hypothetical protein GobsU_13232 [Gemmata obscuriglobus UQM 2246]
Length = 283
Score = 198 bits (503), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 115/316 (36%), Positives = 166/316 (52%), Gaps = 54/316 (17%)
Query: 37 EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
E P V + AF +D EV+ QF FV ATGYVT AE+
Sbjct: 10 EQPLHEVVIAAFEMDATEVTVGQFAAFVKATGYVTTAEQ--------------------- 48
Query: 97 VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 156
D +R+ D + V + + + W W Y GG
Sbjct: 49 -APDPRRYPSSDPA--YLKPGSAVFVPMDVPLDIPVW----------WRY--VGG----- 88
Query: 157 FPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 216
+W +PEG S+++ + N+PVV ++W+DA AY W G RLPTEAEWE+ RGGL+ +
Sbjct: 89 ---ANWRYPEGPQSSVKGKKNYPVVQITWDDATAYAKWAGKRLPTEAEWEFAARGGLDRK 145
Query: 217 LFPWGNNLT-PRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEW 275
+ WG++ G+ ANV+QG+FP + +DG+ +PV S++ N +GLY+M GN WEW
Sbjct: 146 TYVWGDHKNGADGKWYANVYQGQFPARDAGSDGFAGLSPVKSFQPNGYGLYDMSGNAWEW 205
Query: 276 TADWWNVHHHP-APSYNPKGPTTG--------TDKVKKGGSYLCNEQYCYRHRCAARSQN 326
ADW++ ++ +P NP+GP TG KV++GGS+LC++ YC R+ AR +N
Sbjct: 206 CADWYDPGYYARSPKENPRGPDTGPLVGGERQPQKVRRGGSFLCDDSYCSRYVPGARDKN 265
Query: 327 TPDSSAGNLGFRCAAD 342
DSSA + GFRC D
Sbjct: 266 PTDSSANHTGFRCVRD 281
>gi|86143176|ref|ZP_01061578.1| hypothetical protein MED217_08340 [Leeuwenhoekiella blandensis
MED217]
gi|85830081|gb|EAQ48541.1| hypothetical protein MED217_08340 [Leeuwenhoekiella blandensis
MED217]
Length = 373
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 123/338 (36%), Positives = 171/338 (50%), Gaps = 58/338 (17%)
Query: 19 PGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGD 78
PGD+F + KP + V +D F++D EV+N QF+ FV ATGYVT AE+
Sbjct: 73 PGDSFALPREKP---------AHPVAVDGFFIDITEVTNAQFKAFVEATGYVTLAER--- 120
Query: 79 TFVFEPLLSEEERAKISQ-VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
P+ EE + + S I R V + ND Y W
Sbjct: 121 -----PVDWEELKKNVPPGTPKPADSVLQPGSLIFDRSVQNVRDL--ND---YTQWW--- 167
Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
EW+ G +W HP+G S+IE + ++PVVH+S+ DA+AYC W
Sbjct: 168 -----EWKIGA------------NWKHPQGPGSSIEGKDDYPVVHISYEDALAYCKWAKR 210
Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMS 257
RLPTE EWE G + ++ WG++ + +AN WQG FPT NT DG+ APV S
Sbjct: 211 RLPTETEWEAAAHGQMPGGVYTWGDDESAL-PTKANTWQGTFPTKNTNEDGFAYVAPVAS 269
Query: 258 YKENKFGLYNMVGNVWEWTADWWNVHHH-------------PAPSYNPKGPTTGTDKVKK 304
+ N G+Y++ GNVWE T DW++ ++ PA SYNP+ P +KV K
Sbjct: 270 FDPNSIGIYDLTGNVWELTQDWFSTRYYQEIAGNTTVNPQGPAASYNPENPYQA-EKVIK 328
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GGS+LC++ YC +R +AR PDS+ + GFR AD
Sbjct: 329 GGSFLCDDSYCASYRISARMGQAPDSATDHSGFRTVAD 366
>gi|412337871|ref|YP_006966626.1| hypothetical protein BN112_0543 [Bordetella bronchiseptica 253]
gi|408767705|emb|CCJ52461.1| putative exported protein [Bordetella bronchiseptica 253]
Length = 380
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 124/348 (35%), Positives = 165/348 (47%), Gaps = 59/348 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG F MG N L + E P+ V + F++D H+V+N QF+ FV ATGY+T AE
Sbjct: 49 MAWIPGHQFLMG-NDHKLSQPNERPAHQVRVSGFWMDVHDVTNAQFRRFVQATGYLTTAE 107
Query: 75 KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ P L R SQ+ F G S + R Y W
Sbjct: 108 RKPRWEDLRPQLPPGTPRPDDSQLAPGAMVFVGTRSEVSLR--------------DYSRW 153
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W R P SW HP+G S I+ + +HPVV VS+ DA AY
Sbjct: 154 ----------W----------RFVPGASWRHPQGPGSGIDGKDDHPVVQVSYEDAQAYAR 193
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAADGY 249
W G RLPTEAEWE+ RGGLE + WG+ L P+G+ AN+W Q FP
Sbjct: 194 WAGKRLPTEAEWEFAARGGLEQATYTWGDELLPQGQAMANIWDTRQQQPFPVVKD-EKVQ 252
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH------HHPAPSYNPKGPTTGTD--- 300
+ T PV S+ N +GLY+M GNVW+WTADW+ + P +P GP D
Sbjct: 253 VGTTPVGSFAPNGYGLYDMAGNVWQWTADWYRADAFRIQAQYREPPLDPAGPADSFDPDD 312
Query: 301 ---------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V +GGS+LC++ YC +R +AR P + +LGFR
Sbjct: 313 GNVPAAAPKRVTRGGSFLCSDTYCISYRASARRGTDPLNPMSHLGFRT 360
>gi|295690082|ref|YP_003593775.1| hypothetical protein Cseg_2708 [Caulobacter segnis ATCC 21756]
gi|295431985|gb|ADG11157.1| protein of unknown function DUF323 [Caulobacter segnis ATCC 21756]
Length = 342
Score = 197 bits (502), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 124/347 (35%), Positives = 165/347 (47%), Gaps = 57/347 (16%)
Query: 4 LPAP-PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
LP P P R MV + G F MG ++ E P V + F++D+ EV+N F
Sbjct: 35 LPIPDPAHRTAGMVRVAGGDFLMGGAP---LRSEEGPPETVRVAPFWIDRTEVTNAAFAR 91
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
FV+ATGY T AE+ D PL + + R
Sbjct: 92 FVAATGYRTVAERPLDPARDAPLSAAQRRPA----------------------------- 122
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
++ + +GAR ++W R+ P W PEG S I + PVVH
Sbjct: 123 ----SLVFVGEKGARSDDPSQWW---------RVIPGADWRRPEGPGSNIRGKEAWPVVH 169
Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
++W DA+AY W G LPTEAEWEY RGGL + + WG+ + RAN WQG FP
Sbjct: 170 IAWEDAMAYARWLGRDLPTEAEWEYAARGGLVAKRYVWGDQSQTPKQPRANTWQGVFPAQ 229
Query: 243 NTAADGYLS-TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT----- 296
+ DG+ + APV + N +GL +M GNVWEWT DW+ PA GP
Sbjct: 230 DLGDDGFKARPAPVGCFAPNGYGLSDMAGNVWEWTKDWFKPGLEPASVIEMGGPPEIRAL 289
Query: 297 ----TGTDK-VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
G K V KGGS+LC + YC+R+R AAR+ PDS A ++GFR
Sbjct: 290 DPDEPGVAKHVIKGGSFLCTDDYCFRYRPAARTPGPPDSGASHIGFR 336
>gi|33597322|ref|NP_884965.1| hypothetical protein BPP2758 [Bordetella parapertussis 12822]
gi|33573749|emb|CAE38051.1| putative exported protein [Bordetella parapertussis]
Length = 380
Score = 197 bits (502), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 124/348 (35%), Positives = 165/348 (47%), Gaps = 59/348 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG F MG N L + E P+ V + F++D H+V+N QF+ FV ATGY+T AE
Sbjct: 49 MAWIPGHQFLMG-NDHKLSQPNERPAHQVRVSGFWMDVHDVTNAQFRRFVQATGYLTTAE 107
Query: 75 KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ P L R SQ+ F G S + R Y W
Sbjct: 108 RKPRWEDLRPQLPPGTPRPDDSQLAPGAMVFVGTRSEVSLR--------------DYSRW 153
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W R P SW HP+G S I+ + +HPVV VS+ DA AY
Sbjct: 154 ----------W----------RFVPGASWRHPQGPGSGIDGKDDHPVVQVSYEDAQAYAR 193
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAADGY 249
W G RLPTEAEWE+ RGGLE + WG+ L P+G+ AN+W Q FP
Sbjct: 194 WAGKRLPTEAEWEFAARGGLEQATYTWGDELLPQGQAMANIWDTRQQQPFPVVKD-EKVQ 252
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH------HHPAPSYNPKGPTTGTD--- 300
+ T PV S+ N +GLY+M GNVW+WTADW+ + P +P GP D
Sbjct: 253 VGTTPVGSFAPNGYGLYDMAGNVWQWTADWYRADAFRIQAQYREPPLDPAGPADSFDPDD 312
Query: 301 ---------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V +GGS+LC++ YC +R +AR P + +LGFR
Sbjct: 313 GNVPAAAPKRVTRGGSFLCSDTYCISYRASARRGTDPLNPMSHLGFRT 360
>gi|383827540|ref|ZP_09982634.1| hypothetical protein MXEN_21717 [Mycobacterium xenopi RIVM700367]
gi|383330240|gb|EID08772.1| hypothetical protein MXEN_21717 [Mycobacterium xenopi RIVM700367]
Length = 294
Score = 197 bits (501), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 122/334 (36%), Positives = 169/334 (50%), Gaps = 50/334 (14%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG FRMG+ E P + F +++H V+N QF EFV+ATGY+T A
Sbjct: 4 ELVELPGGAFRMGSTS---FYPEEAPIHTAAVAPFAVERHPVTNVQFAEFVAATGYITVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E +P+ D ++ G + + + +V V W
Sbjct: 61 E--------QPI--------------DPAQYPGANP--DDLVPGALVFRPTPGPVDLQDW 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R +W + G W HP G S + R +HPVV V++ DA AY
Sbjct: 97 R--------QWWHWVPGAC---------WRHPFGPGSDVTDRADHPVVQVAYPDAAAYAR 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLP+EAEWEY RGG + WG+ P G AN WQG FP N A G++ T+
Sbjct: 140 WAGRRLPSEAEWEYAARGGTTT-TYVWGDEEKPNGRLMANTWQGRFPYRNDGALGWIGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----PAPSYNPKGPTTG-TDKVKKGGSY 308
PV ++ N FGL +M+GNVWEWTA ++ HH P P P GP ++ KGGS+
Sbjct: 199 PVGTFPPNGFGLLDMIGNVWEWTATEFSSHHRLDQPPKPCCTPSGPADPRVNQTLKGGSH 258
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
LC +YC+R+R AARS + D++ ++GFRC AD
Sbjct: 259 LCAPEYCHRYRPAARSPQSQDTATTHIGFRCVAD 292
>gi|325974321|dbj|BAJ83908.1| sulfatase modifying factor 2 [Hemicentrotus pulcherrimus]
Length = 379
Score = 197 bits (501), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 124/345 (35%), Positives = 177/345 (51%), Gaps = 62/345 (17%)
Query: 4 LPAPPV-ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
+PA PV ++Y++MV LPG F MG++K KDGE R VT+ F ++++ ++ F++
Sbjct: 32 VPADPVYDKYEEMVALPGGKFTMGSSKSDG-KDGESVQRKVTVKPFKINKYPITTASFRK 90
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
FV A Y T+AEK+ +F FE + E + +TIE R+
Sbjct: 91 FVRAKKYKTDAEKYTWSFAFEAFVPE-----------------AIKNTIEQRLPG----A 129
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
W V WR PEG S IE+R+N+P VH
Sbjct: 130 PWWIPVQNAYWR-----------------------------QPEGKGSGIENRLNYPAVH 160
Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
+S DA A+C W G RLPTE EWE+ RGGLE + +PWG ++R N+WQG+FP
Sbjct: 161 ISNVDARAFCQWHGWRLPTEPEWEFAARGGLEGKEYPWGKKFE---KNRMNIWQGKFPEE 217
Query: 243 NTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNV---HHHPAPSYNPKGPTTG 298
NT ADGY PV ++ +N + LY+M+GNVWEWT+ ++ APS + P+
Sbjct: 218 NTKADGYHGVGPVDAFPPQNDYDLYDMIGNVWEWTSTKFSATPSAKDDAPSPEEENPSQ- 276
Query: 299 TDKVKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAA 341
V +GGSY+ ++ + H R R N D+ + NLGFRCA
Sbjct: 277 VRFVLRGGSYIDSKDGKFNHATRVTTRMGNEADAGSDNLGFRCAV 321
>gi|410473130|ref|YP_006896411.1| hypothetical protein BN117_2528 [Bordetella parapertussis Bpp5]
gi|408443240|emb|CCJ49861.1| putative exported protein [Bordetella parapertussis Bpp5]
Length = 380
Score = 197 bits (501), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 124/348 (35%), Positives = 165/348 (47%), Gaps = 59/348 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG F MG N L + E P+ V + F++D H+V+N QF+ FV ATGY+T AE
Sbjct: 49 MAWIPGHQFLMG-NDHKLSQPNERPAHQVRVSGFWMDVHDVTNAQFRRFVQATGYLTTAE 107
Query: 75 KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ P L R SQ+ F G S + R Y W
Sbjct: 108 RKPRWEDLRPQLPPGTPRPDDSQLAPGAMVFVGTQSEVSLR--------------DYSRW 153
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W R P SW HP+G S I+ + +HPVV VS+ DA AY
Sbjct: 154 ----------W----------RFVPGTSWRHPQGPGSGIDGKDDHPVVQVSYEDAQAYAR 193
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAADGY 249
W G RLPTEAEWE+ RGGLE + WG+ L P+G+ AN+W Q FP
Sbjct: 194 WAGKRLPTEAEWEFAARGGLEQATYTWGDELLPQGQAMANIWDTRQQQPFPVVKD-EKVQ 252
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH------HHPAPSYNPKGPTTGTD--- 300
+ T PV S+ N +GLY+M GNVW+WTADW+ + P +P GP D
Sbjct: 253 VGTTPVGSFAPNGYGLYDMAGNVWQWTADWYRADAFRIQAQYREPPLDPAGPADSFDPDD 312
Query: 301 ---------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V +GGS+LC++ YC +R +AR P + +LGFR
Sbjct: 313 GNVPAAAPKRVTRGGSFLCSDTYCISYRASARRGTDPLNPMSHLGFRT 360
>gi|88809095|ref|ZP_01124604.1| hypothetical protein WH7805_05366 [Synechococcus sp. WH 7805]
gi|88787037|gb|EAR18195.1| hypothetical protein WH7805_05366 [Synechococcus sp. WH 7805]
Length = 284
Score = 197 bits (500), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 120/329 (36%), Positives = 169/329 (51%), Gaps = 55/329 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV++P T+R+G+ E P R++ L +F +D V+N +F FV+ TGY T +E
Sbjct: 10 MVVIPAGTYRVGSES---FYPEEAPIRSIQLQSFVIDVAPVTNAEFARFVADTGYQTVSE 66
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K D ++ P L E++ S V F ++++ +SW V WR
Sbjct: 67 KPPDPELY-PNLPPEQQCPESAV------FTPPPASVDRSQP-----LSWWALVEGADWR 114
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
HP+G +TI++ M HPVVH+++ DA+AY W
Sbjct: 115 -----------------------------HPQGPSTTIDNLMEHPVVHLAYEDALAYAEW 145
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPT EWE RGGL + WG+ +TP G+ ANVWQG FP N DG+ T+P
Sbjct: 146 AGKRLPTADEWEVAARGGLVGHEYAWGSEMTPEGQWLANVWQGPFPWINEQTDGWFWTSP 205
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V S+ N +GL +M GNVWEWT+ + V PKG ++ KGGS+LC E Y
Sbjct: 206 VGSFPANGYGLVDMCGNVWEWTSTLFPV---------PKG--EQERRIIKGGSFLCAENY 254
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADK 343
C+R R AA T D++ ++GFRCAAD
Sbjct: 255 CHRFRPAALMGQTTDTATCHMGFRCAADS 283
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 30/42 (71%)
Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
++ KGGS+LC E YC+R R AA T D++ ++GFRCAAD
Sbjct: 241 RIIKGGSFLCAENYCHRFRPAALMGQTTDTATCHMGFRCAAD 282
>gi|293605445|ref|ZP_06687827.1| sulfatase-modifying factor 1 [Achromobacter piechaudii ATCC 43553]
gi|292816173|gb|EFF75272.1| sulfatase-modifying factor 1 [Achromobacter piechaudii ATCC 43553]
Length = 381
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 123/348 (35%), Positives = 167/348 (47%), Gaps = 59/348 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG F MG N+ L + E P+ V + F++D+H+V+N +F+ FV ATGYVT AE
Sbjct: 50 MAWIPGSEFLMG-NEHKLSQPNERPAHKVRVSGFWMDRHDVTNAEFRRFVQATGYVTTAE 108
Query: 75 KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ +P L R S + F G S + R Y W
Sbjct: 109 QKPRWEDLQPQLPPGTPRPDDSVLVPGAMVFIGTASEVSLR--------------DYSRW 154
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W R P +W HPEG S++E + HPVV VS+ DA AY
Sbjct: 155 ----------W----------RFVPGANWRHPEGPGSSVEGKDEHPVVQVSYEDAQAYAK 194
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAADGY 249
W G RLPTEAEWE+ RGGL + WGN L+P+G+ AN+W Q FP
Sbjct: 195 WAGKRLPTEAEWEFAARGGLAQATYSWGNELSPQGQPMANIWDTQQQQPFPVVKDEKI-Q 253
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH------HHPAPSYNPKGPTTGTD--- 300
+ T PV S+ N +GLY+M GNVW+WTADW+ + P +P GP D
Sbjct: 254 VGTTPVGSFPANGYGLYDMAGNVWQWTADWYRADAFVVQAQYRQPPTDPAGPADSFDPDD 313
Query: 301 ---------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V +GGS+LC++ YC +R +AR P + +LGFR
Sbjct: 314 GNVPAAAPKRVTRGGSFLCSDTYCISYRASARRGTDPLNGMSHLGFRT 361
>gi|363421081|ref|ZP_09309170.1| hypothetical protein AK37_10411 [Rhodococcus pyridinivorans AK37]
gi|359734816|gb|EHK83784.1| hypothetical protein AK37_10411 [Rhodococcus pyridinivorans AK37]
Length = 311
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 126/349 (36%), Positives = 169/349 (48%), Gaps = 70/349 (20%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++M L+ G F MG+ E P VT+D F++D H V+ +F+ FV TGYVT
Sbjct: 7 QNMTLIAGGMFWMGSED---FYPEERPVHQVTVDGFWMDTHPVTVAEFRHFVKDTGYVTT 63
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTI----EHRMHHPVVHISWNDAV 128
AE V D + G D ++ P + +D
Sbjct: 64 AE----------------------VAPDPADYPGADPSLLAPGSLVFTPPPGPVPLDD-- 99
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
Y W W + P +W HPEG ST+ R HPV HVSW DA
Sbjct: 100 -YTRW----------WSF----------VPGANWRHPEGTGSTVGGRERHPVTHVSWFDA 138
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G LPTEAEWE+ RGGL+ + F WG+ P+G NVWQGEFP N DG
Sbjct: 139 RAYAEWAGKELPTEAEWEFAARGGLDRKAFTWGDEHEPKGRPGGNVWQGEFPWQNLETDG 198
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-------HPAPS----YNPKGPTT 297
+ T+PV ++ N +GL++ GNVWEWTAD++ +H PA S NP+ T
Sbjct: 199 FAGTSPVGHFRPNGYGLHDTAGNVWEWTADYFTANHASSGKNVAPASSCCIPTNPRVDTA 258
Query: 298 GTD-------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
D +V KGGS+LC YC R+R AAR +T +++ ++GFRC
Sbjct: 259 EHDPREAYPRRVIKGGSHLCAPNYCLRYRPAARQGDTEETATCHIGFRC 307
>gi|334319798|ref|YP_004556427.1| sulfatase-modifying factor protein [Sinorhizobium meliloti AK83]
gi|334097537|gb|AEG55547.1| Sulphatase-modifying factor protein [Sinorhizobium meliloti AK83]
Length = 326
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 126/346 (36%), Positives = 168/346 (48%), Gaps = 61/346 (17%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
VE +MV + G TF MG+N E P VT+D F++D V+N QF EFV+ATG
Sbjct: 11 VEPAAEMVWVRGATFMMGSND---HYPEEAPVHPVTVDGFWIDDAPVTNRQFLEFVNATG 67
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
+VT AE+ +P + A S +R F ++
Sbjct: 68 HVTFAER-------KPRAEDYPGAPPSNLRAGSLVFTPPKRPLQ---------------- 104
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
G + ++W G +W P G S+I ++HPVVHV+++DA
Sbjct: 105 ------GTEI---SQWWIFALGA---------NWRQPLGRRSSIGAILDHPVVHVAYSDA 146
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G LPTEAEWE RGGLE F WG+ L P G H AN WQG FP N+ DG
Sbjct: 147 KAYAEWAGKDLPTEAEWELAARGGLEGAEFAWGDELVPGGNHVANTWQGSFPVANSIDDG 206
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----------------PSYN 291
+ T+PV Y N +GLY+M+GNVWEWT D+W+V H A S +
Sbjct: 207 FGRTSPVRFYPPNGYGLYDMIGNVWEWTTDYWSVRHPEAAAKPCCIPSNPRNADADASID 266
Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGF 337
P +V KGGS+LC YC R+R AAR D++ ++GF
Sbjct: 267 PMARVKVPRRVLKGGSHLCAPNYCRRYRPAARHAQEIDTTTSHVGF 312
>gi|407723961|ref|YP_006843622.1| sulfatase-modifying factor protein [Sinorhizobium meliloti Rm41]
gi|407324021|emb|CCM72622.1| sulfatase-modifying factor protein [Sinorhizobium meliloti Rm41]
Length = 326
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 124/346 (35%), Positives = 168/346 (48%), Gaps = 61/346 (17%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
VE +MV +PG TF MG+N E P VT+D F++D V+N QF EFV+AT
Sbjct: 11 VEPAAEMVWVPGATFMMGSNG---HYPEEAPVHPVTVDGFWIDDAPVTNRQFLEFVNATA 67
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
+VT AE+ +P + A S +R F +
Sbjct: 68 HVTFAER-------KPRAEDYPGAPPSNLRAGSLVFTPPKRPL----------------- 103
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
+G + ++W G +W P G S+I ++HPVVHV+++DA
Sbjct: 104 -----KGTDI---SQWWIFTLGA---------NWRQPLGRKSSIGTILDHPVVHVAYSDA 146
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G LPTEAEWE RGGLE F WG+ L P G + AN WQG FP N+ DG
Sbjct: 147 KAYADWAGKDLPTEAEWELAARGGLEGAEFAWGDELAPGGNYVANTWQGSFPVANSIDDG 206
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----------------PSYN 291
+ T+P+ Y N +GLY+M+GNVWEWT D+W+V H A S +
Sbjct: 207 FARTSPIRFYPPNGYGLYDMIGNVWEWTTDYWSVRHPEAAAKPCCIPSNPRNADADASID 266
Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGF 337
P +V KGGS+LC YC R+R AAR D++ ++GF
Sbjct: 267 PMARVKVPRRVLKGGSHLCAPNYCRRYRPAARHAQEIDTTTSHVGF 312
>gi|87300776|ref|ZP_01083618.1| hypothetical protein WH5701_04990 [Synechococcus sp. WH 5701]
gi|87284647|gb|EAQ76599.1| hypothetical protein WH5701_04990 [Synechococcus sp. WH 5701]
Length = 330
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 133/343 (38%), Positives = 168/343 (48%), Gaps = 68/343 (19%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSR---NVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
DMV +P FR+G G P V L F L + EV+N +F FV ATGY
Sbjct: 34 DMVAIPAGRFRIGAA-------GRLPEEAEATVRLRPFCLGRTEVTNREFASFVEATGYT 86
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
T+AE+ T F P LS ER S V E PV +SW
Sbjct: 87 TQAEQPLSTEQF-PGLSAAERRPGSVVFRPPSPGE------------PVQELSW------ 127
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W + P W HPEG S I+ R++HPVV VS D A
Sbjct: 128 -------------WHW----------MPGADWRHPEGPGSAIDERLDHPVVQVSVADVQA 164
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
Y W GA LP+EA+WE+ RGGL++++F WG+ P AN WQG FP NNTA DGY
Sbjct: 165 YAAWAGAALPSEAQWEFAARGGLKDQVFSWGDTWKP---GLANTWQGTFPVNNTAEDGYP 221
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH------------HPAPSYNPKGPTTG 298
TAPV + N +GL++M GNVWEWT+DW+ H PA S +P+ P
Sbjct: 222 GTAPVGHFPANGYGLFDMTGNVWEWTSDWYQSGHDPIDGQQDPHLEDPAASSDPREPGV- 280
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
V KGGS+LC YC R+R AAR +PD+ ++GFR A+
Sbjct: 281 AKHVIKGGSFLCAPNYCSRYRPAAREAESPDTGTSHIGFRLAS 323
>gi|156391933|ref|XP_001635804.1| predicted protein [Nematostella vectensis]
gi|156222901|gb|EDO43741.1| predicted protein [Nematostella vectensis]
Length = 273
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 120/330 (36%), Positives = 168/330 (50%), Gaps = 69/330 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G FRMGTN KDGE P+R+V + AF +D++ ++N F+ FV + + TEAE
Sbjct: 1 MVKLSGGIFRMGTNSEDG-KDGENPARDVRVKAFAIDRYPITNELFRAFVRSKKFKTEAE 59
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFG +F F + E R+K + V W V WR
Sbjct: 60 KFGWSFAFHTFVPENVRSK---------------------VTQSVQGAPWWIPVNKAYWR 98
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
P G +TI+ ++N+PVVH+S+NDA AYC W
Sbjct: 99 -----------------------------QPGGPGTTIKDKLNYPVVHISYNDATAYCNW 129
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RL TEAEWE+ RGGL+ +++PWG E R N+WQG+FP NT DGY AP
Sbjct: 130 AGKRLATEAEWEFAARGGLQGQVYPWGKKFE---EKRMNIWQGDFPGKNTKLDGYDWLAP 186
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK-VKKGGSYLCNE 312
V +Y +N +G+Y+M+GN WEW D + +P+G G K V +GGSY+ +
Sbjct: 187 VNAYPPQNSYGMYDMIGNAWEWVGDEFK---------SPEG--RGEKKFVLRGGSYIDSR 235
Query: 313 --QYCYRHRCAARSQNTPDSSAGNLGFRCA 340
++ ++ R R NT D+ + N+ FRCA
Sbjct: 236 DGKFNHKARVTTRMGNTADAGSDNISFRCA 265
>gi|392414840|ref|YP_006451445.1| hypothetical protein Mycch_0956 [Mycobacterium chubuense NBB4]
gi|390614616|gb|AFM15766.1| hypothetical protein Mycch_0956 [Mycobacterium chubuense NBB4]
Length = 293
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 120/331 (36%), Positives = 168/331 (50%), Gaps = 49/331 (14%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
D+ L G TFRMG+ + E P VT+ F +++H V+N QF FV ATGY T A
Sbjct: 4 DLAELDGGTFRMGSTQ---FYPEEAPVHTVTVAPFAIERHPVTNAQFAAFVDATGYRTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E D +F L ++ + + +V V W
Sbjct: 61 EIAPDPALF------------------------LGASPQDLVPGALVFRPTRGPVDLRDW 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R W++ P +W HP G S++ + HPVV V + DA+AY
Sbjct: 97 R-------QWWQW----------TPGANWRHPFGPGSSVAGKEEHPVVQVCYADALAYAR 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE+ RGG ++PWG+ P G+ AN WQG FP N A G+ T+
Sbjct: 140 WAGRRLPTEAEWEFAARGG-STTVYPWGDEPAPGGQLMANTWQGAFPYRNDGALGWAGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYN---PKGPTTGTDKVKKGGSYL 309
PV ++ N +GL +M+GNVWEWT + HH P+ + P+GP ++ KGGS+L
Sbjct: 199 PVGAFPLNAYGLADMIGNVWEWTTTRFAGHHRLEGPTESCCPPQGPDPAVNQALKGGSHL 258
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
C +YC+R+R AARS + DS+ ++GFRCA
Sbjct: 259 CAPEYCHRYRAAARSPQSQDSATTHIGFRCA 289
>gi|418059687|ref|ZP_12697628.1| Sulphatase-modifying factor protein [Methylobacterium extorquens
DSM 13060]
gi|373566748|gb|EHP92736.1| Sulphatase-modifying factor protein [Methylobacterium extorquens
DSM 13060]
Length = 327
Score = 196 bits (498), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 126/344 (36%), Positives = 172/344 (50%), Gaps = 63/344 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ + G TF MG+++ E P VT+ +F++D V+N QF++FV TG+VT AE
Sbjct: 20 MIYIRGGTFHMGSDRHY---PEERPVHRVTVGSFWMDATPVTNHQFRDFVRTTGHVTFAE 76
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K P + A+ Q+ F G S + +HPV W
Sbjct: 77 K-------RPEPDQYPGARPDQL------FAG--SLVFTPPNHPVDLRMWG--------- 112
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMN-HPVVHVSWNDAVAYCT 193
A W L +W P G H ++ HPVVHV++ DA+AY
Sbjct: 113 -------AWWS----------LLKGANWRRPNGPKGITLHGLDTHPVVHVTYADALAYAR 155
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEAEWE+ RGGL+ F WG+ TPRG H AN WQG+FP NT DG+ T+
Sbjct: 156 WAGKDLPTEAEWEFAARGGLDGAEFAWGDTFTPRGRHMANTWQGQFPWQNTEEDGFARTS 215
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPKGPTTGTD---------- 300
PV ++ N +GL++M GNVWEWT+D++ H P P PK P G +
Sbjct: 216 PVKAFAPNGYGLHDMTGNVWEWTSDFFADRHSADAPKPCCIPKDPRGGREDDSYDPCLPD 275
Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC +C R+R AAR D+S ++GFRC
Sbjct: 276 IRIPRRVLKGGSHLCAPNFCSRYRPAARLPQPVDTSTSHVGFRC 319
>gi|14518334|ref|NP_116817.1| MS129, hypothetical protein [Microscilla sp. PRE1]
gi|14484969|gb|AAK62851.1| MS129, hypothetical protein [Microscilla sp. PRE1]
Length = 354
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 133/343 (38%), Positives = 175/343 (51%), Gaps = 51/343 (14%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +P + MG KD E+P NV + AFY+D EV+N +F+ FV TGYVT AE
Sbjct: 45 MAYIPAGQYMMGGKSDQAYKD-EYPRHNVKVSAFYMDLTEVTNAEFKRFVDETGYVTIAE 103
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K D +E L S+ + ++ S + + PV S D Y W
Sbjct: 104 KDID---WEELKSQVPQGTPKPPDSVLQ----AGSLVFKQTDEPV---SLQD---YSQWW 150
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
EW G +W +PEG STIE RM+HPVVHVS+ D AY W
Sbjct: 151 --------EWTIGA------------NWRNPEGPGSTIEDRMDHPVVHVSFEDVQAYADW 190
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWE+ GG + +PWGN + +AN WQG FP N A DG+ TAP
Sbjct: 191 AGKRLPTEAEWEWAAMGGQNDVKYPWGNESVEQASDKANFWQGNFPHQNYALDGFERTAP 250
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT---TGTD----------- 300
V S+ N +GLY+M GNVWEW D ++V+ + SY KG T TG++
Sbjct: 251 VRSFPANGYGLYDMAGNVWEWCQDKYDVNAY--ESYKQKGLTEDPTGSEHYNDPREPYTP 308
Query: 301 -KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
V +GGS+LCN+ YC +R + R ++ DS + GFRC D
Sbjct: 309 KHVIRGGSFLCNDSYCSGYRVSRRMSSSRDSGFNHTGFRCVKD 351
>gi|170739501|ref|YP_001768156.1| hypothetical protein M446_1197 [Methylobacterium sp. 4-46]
gi|168193775|gb|ACA15722.1| protein of unknown function DUF323 [Methylobacterium sp. 4-46]
Length = 325
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 138/363 (38%), Positives = 177/363 (48%), Gaps = 65/363 (17%)
Query: 2 VLLPAPPV-ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
LL PP + +DMV LPG TFRMG+N+ E P+ V +D F++D V+N QF
Sbjct: 5 ALLDRPPAPDAVRDMVFLPGGTFRMGSNR---HYPEEAPAHRVRIDGFWIDPTPVTNAQF 61
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
+ FV ATGYVT AE+ D + L +A S + PV
Sbjct: 62 RAFVRATGYVTMAERKPDAADYPGALPHMLQAG---------------SLVFKPPKGPVN 106
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
W WR +++G W P G S+I HPV
Sbjct: 107 LRDWG-----AWWR---------FKFGVH------------WRRPYGPGSSIAGLDEHPV 140
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
VHV++ DA AY W LPTEAEWEY RGGL+ F WG+ LTP G H AN WQG FP
Sbjct: 141 VHVAYADAEAYAAWADKELPTEAEWEYAARGGLDGAEFAWGDELTPGGRHMANTWQGAFP 200
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA------------- 287
N A DGY T+PV ++ N +GL++M GNVWEWT D++ V HPA
Sbjct: 201 HQNLAEDGYERTSPVTAFPPNGYGLFDMTGNVWEWTTDFY-VPKHPADAAKACCIPQNPR 259
Query: 288 -----PSYNPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
S++P P +V KGGS+LC YC R+R AAR + D+S ++GFRC
Sbjct: 260 GGQADESFDPSQPGIRIPRRVVKGGSHLCAPNYCRRYRPAARHPHPVDTSTSHIGFRCIR 319
Query: 342 DKG 344
G
Sbjct: 320 RSG 322
>gi|167646635|ref|YP_001684298.1| hypothetical protein Caul_2673 [Caulobacter sp. K31]
gi|167349065|gb|ABZ71800.1| protein of unknown function DUF323 [Caulobacter sp. K31]
Length = 351
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 134/346 (38%), Positives = 175/346 (50%), Gaps = 57/346 (16%)
Query: 5 PAP-PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
P P P M L+P TF MG KPI ++G P R ++AF++D+ EV+N F F
Sbjct: 44 PVPRPAPDTGGMALIPAGTFEMGA-KPIRPEEG--PPRLTRVEAFWIDRTEVTNGDFARF 100
Query: 64 VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
V ATGYVT AE+ D + P LS ++ A + V F G D+
Sbjct: 101 VKATGYVTLAERPLDPKAY-PGLSGDQLAPSALV------FVGSDAP------------- 140
Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHV 183
RGA ++W RG +W HP+G S+I + PVV V
Sbjct: 141 ----------RGA---DPSQWWRVVRGA---------NWRHPQGPGSSIVGKDFLPVVQV 178
Query: 184 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNN 243
W DA+AY W G LPTEAEWEY +GG F WG+ + +ANVWQG FP +
Sbjct: 179 GWEDAMAYARWLGRDLPTEAEWEYAAQGGKGATRFVWGDAPLDARKPQANVWQGVFPAVD 238
Query: 244 TAADGYLS-TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTT----- 297
T ADGY S TAPV + N +GL++M GNVWEWT DW+ A +P GP +
Sbjct: 239 TGADGYTSRTAPVGCFSANGYGLHDMAGNVWEWTRDWYRPGLDAAGHDDPGGPGSSMAFD 298
Query: 298 ----GTDK-VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
G K V KGGS+LC+ +CYR+R AAR D+ ++GFR
Sbjct: 299 PGDPGVRKHVIKGGSFLCSPDFCYRYRPAARQAGPADTGESHMGFR 344
>gi|87198560|ref|YP_495817.1| hypothetical protein Saro_0536 [Novosphingobium aromaticivorans DSM
12444]
gi|87134241|gb|ABD24983.1| protein of unknown function DUF323 [Novosphingobium aromaticivorans
DSM 12444]
Length = 315
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 126/350 (36%), Positives = 170/350 (48%), Gaps = 65/350 (18%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
+R+ M L+ G TF MG+ E P R V +D+F++D+ V+N QF F+ ATGY
Sbjct: 7 QRHAGMRLIEGGTFLMGSEA---FYPEEAPVRRVRVDSFWIDETPVTNFQFAAFIEATGY 63
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
VT AE D + ++ +RA S + + PV + A A
Sbjct: 64 VTVAEVAPDPKDYPGMIPGMDRAG---------------SLVFQKTSGPV-----DMADA 103
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHRMNHPVVHVSWNDA 188
WR F +G+ W HP G S + +HPVVHV++ DA
Sbjct: 104 SNWWR----------------------FTFGACWKHPLGPGSDLAGLEDHPVVHVAYPDA 141
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY TW G LPTEAE+E+ RGGL+ + + WG+ L P G AN WQG FP N DG
Sbjct: 142 EAYATWAGKCLPTEAEFEFAARGGLDGQDYAWGSELAPDGRMMANYWQGLFPFANQCLDG 201
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-------APSYNPKGPTTGTD- 300
+ T+PV S+ N G+Y+M+GN WEWT+DWW P P NP+G
Sbjct: 202 WERTSPVRSFPPNGHGVYDMIGNTWEWTSDWWTAKPMPPKAKRSCCPIDNPRGGRLRESF 261
Query: 301 -----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KV KGGS+LC YC R+R AAR D++ ++GFRC
Sbjct: 262 DPDHPGVRIGRKVIKGGSHLCAANYCQRYRPAARHPEMVDTATSHIGFRC 311
>gi|116252022|ref|YP_767860.1| hypothetical protein RL2266 [Rhizobium leguminosarum bv. viciae
3841]
gi|115256670|emb|CAK07758.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
3841]
Length = 317
Score = 195 bits (496), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 126/347 (36%), Positives = 174/347 (50%), Gaps = 65/347 (18%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
++ +V + G TF MG+N+ + E P+ VT+D F++D V+N QF FV ATG+VT
Sbjct: 13 FQGLVWVAGRTFTMGSNEHYV---EEAPAHPVTVDGFWIDATPVTNRQFASFVKATGHVT 69
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
AE+ + L + RA S V K G D T W
Sbjct: 70 VAERAPRVEDYPGALPKMLRAG-SLVFTPPKAVVGPDIT------------QW------- 109
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W +++G W HP G + I +++HPVVH++++DA AY
Sbjct: 110 -W---------SFKFGA------------DWRHPYGGRTDIRGKLDHPVVHIAYSDAKAY 147
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G LPTEAEWE RGGL++ F WG+ L P G AN WQG FPT +T G
Sbjct: 148 AAWAGKDLPTEAEWELAARGGLDDAEFAWGDGLNPEGMLMANTWQGIFPTLSTKPAGADR 207
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP------------------SYNPK 293
T+PV + N +G+Y+M+GNVWEWT+D+W+ HP P SY+P+
Sbjct: 208 TSPVGYFPSNGYGIYDMIGNVWEWTSDFWST-RHPEPASHACCIPSNPRGGDADASYDPQ 266
Query: 294 GPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
P +V KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 267 LPDIRIPRRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRC 313
>gi|332284401|ref|YP_004416312.1| hypothetical protein PT7_1148 [Pusillimonas sp. T7-7]
gi|330428354|gb|AEC19688.1| hypothetical protein PT7_1148 [Pusillimonas sp. T7-7]
Length = 364
Score = 195 bits (495), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 121/357 (33%), Positives = 172/357 (48%), Gaps = 58/357 (16%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
V ++M +PG F MG++ L + E P+ V + F++D V+N QF EFV TG
Sbjct: 38 VSTPQNMAWVPGGVFLMGSDS-KLAQPNERPTHQVKVHGFWMDVTHVTNDQFAEFVEKTG 96
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
Y+T AE+ D +E + + + P H+ A+
Sbjct: 97 YLTTAEQKPD-------------------------WETIRVQLPPGVPQPPDHVLQPGAM 131
Query: 129 AYC-TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
+ T L ++W R P +W HP+G S+IE + +HPVV VS+ D
Sbjct: 132 VFVGTNAQVNLNDYSQWW---------RYVPGANWRHPQGPGSSIEGKGDHPVVQVSYVD 182
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW---QGEFPTNNT 244
AY TW G RLPTEAEWE+ RGGLE + WG+ L P GE +AN W + FP +
Sbjct: 183 VQAYATWAGKRLPTEAEWEFAARGGLEQATYVWGDELMPNGEMQANYWDTTEQPFPVVSP 242
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-------PSYNPKGPTT 297
A G +T P ++ N +GL++M GN W+W +DW+ + P NPKGP
Sbjct: 243 KAGGAANTVPAGTFPPNGYGLHDMTGNAWQWVSDWYRFDYFAMQAQAGQKPIDNPKGPGE 302
Query: 298 GTD------------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
D +V +GGS+LCNE YC +R +AR + P S +LGFR D
Sbjct: 303 SFDPNEPGVPANAPKRVIRGGSFLCNESYCLSYRPSARRSSDPYSPMSHLGFRLVKD 359
>gi|390367342|ref|XP_781714.3| PREDICTED: sulfatase-modifying factor 2 [Strongylocentrotus
purpuratus]
Length = 381
Score = 195 bits (495), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 121/347 (34%), Positives = 178/347 (51%), Gaps = 64/347 (18%)
Query: 4 LPAPPV-ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
+PA PV ++Y++MV LPG F MG++K KDGE R VT+ F ++++ ++ F++
Sbjct: 32 VPADPVFDKYEEMVALPGGKFMMGSSKSDG-KDGESVQRKVTVKPFKINKYPITAASFRK 90
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
FV A Y T+AEK+ +F F+ + E + +TIE R+
Sbjct: 91 FVRAKKYKTDAEKYAWSFAFDAFVPE-----------------AIKNTIEQRLPG----A 129
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
W V WR PEG S IE+R+N+P VH
Sbjct: 130 PWWIPVQNAYWR-----------------------------QPEGKGSGIENRLNYPAVH 160
Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
+S DA A+C W G RLPTE EWE+ RG LE + +PWG ++R N+WQG+FP
Sbjct: 161 ISNVDARAFCQWHGWRLPTEPEWEFAARGSLEGKEYPWGKKFE---KNRMNIWQGKFPEK 217
Query: 243 NTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNV-----HHHPAPSYNPKGPT 296
NT ADGY +PV ++ +N + LY+M+GNVWEWT+ ++ P+P+ + P+
Sbjct: 218 NTKADGYHGVSPVDAFPPQNDYDLYDMIGNVWEWTSTKFSATPSAKDDAPSPAGEEENPS 277
Query: 297 TGTDKVKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAA 341
V +GGSY+ ++ + H R R N D+ + NLGFRCA
Sbjct: 278 Q-VRFVLRGGSYIDSKDGKFNHATRVTTRMGNEADAGSDNLGFRCAV 323
>gi|423018521|ref|ZP_17009242.1| hypothetical protein AXXA_28950 [Achromobacter xylosoxidans AXX-A]
gi|338778406|gb|EGP42880.1| hypothetical protein AXXA_28950 [Achromobacter xylosoxidans AXX-A]
Length = 378
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 123/350 (35%), Positives = 170/350 (48%), Gaps = 59/350 (16%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ M +P F MG N + + E P+ V L F++D H+V+N +F+ FV ATGYVT
Sbjct: 45 QGMAWVPAGEFLMG-NDHRMSQPNERPAHRVRLTGFWMDAHDVTNAEFRRFVDATGYVTT 103
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ R+E L + P+ DA
Sbjct: 104 AER-------------------------KPRWEDLAVQLPPGTPRPI------DAALVP- 131
Query: 133 WRGARLPTEAEWEYGCRGGLEN-RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
GA + T E E R R +W HP+G S+I+ + +HPVV VS+ DA+AY
Sbjct: 132 --GAMVFTGTEQEVSLRDYSRWWRFVAGANWRHPQGPASSIDGKDDHPVVQVSYEDALAY 189
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAAD 247
W G RLPTEAEWE+ RGGLE + WG+ L PRG+ AN+W + FP T
Sbjct: 190 AAWAGKRLPTEAEWEFAARGGLEQATYIWGDELNPRGQAMANIWDTQQRQPFPV-VTDEK 248
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV------HHHPAPSYNPKGPTTGTD- 300
+ T PV S+ N +GLY+M GNVW+WTADW+ + P +P GP D
Sbjct: 249 VQVGTTPVASFPANGYGLYDMAGNVWQWTADWYRADAFLAQSQYRQPPLDPAGPKDSFDP 308
Query: 301 -----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V +GGS+LC++ YC +R +AR N P + +LGFR
Sbjct: 309 NDGNVPQGAPKRVVRGGSFLCSDTYCISYRASARRGNDPMNGMSHLGFRT 358
>gi|209547151|ref|YP_002279069.1| hypothetical protein Rleg2_5119 [Rhizobium leguminosarum bv.
trifolii WSM2304]
gi|209538395|gb|ACI58329.1| protein of unknown function DUF323 [Rhizobium leguminosarum bv.
trifolii WSM2304]
Length = 322
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 124/346 (35%), Positives = 170/346 (49%), Gaps = 65/346 (18%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+D + + G F MG+++ E P+ V +D F+LD V+N QF EFV ATGYVT
Sbjct: 19 RDQIWIDGGGFLMGSDQHY---PEEAPAHPVKVDGFWLDVAAVTNRQFAEFVEATGYVTL 75
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AEK + + P E S V KR G W
Sbjct: 76 AEKASNPEDY-PGAPPEMLKPGSVVFDPPKRVSGKGPP------------QW-------- 114
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR + +G +W P G S + +++HPVV V++ DA+AY
Sbjct: 115 WR---------FRFGA------------NWRRPYGGLSNLRGKLDHPVVQVAFADAMAYA 153
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W+ LPTEAEWE+ +GGLE + F WG+ P G AN W G FPT N DG+ T
Sbjct: 154 LWKDKDLPTEAEWEFAAKGGLEGKEFAWGDEFIPDGRFMANTWHGLFPTENLKPDGFDRT 213
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-------YNPKGPTTGTD----- 300
+PV S+ N +GLY+M+GNVWEWT+D+W HPAP+ NP+G +
Sbjct: 214 SPVGSFPPNGYGLYDMIGNVWEWTSDYWTT-RHPAPAEKSCCIPVNPRGGSAEASRDSGQ 272
Query: 301 -------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC + YC R+R AAR +S+ ++GFRC
Sbjct: 273 PDIRIARRVLKGGSHLCAQNYCRRYRPAARHAQEENSATTHIGFRC 318
>gi|152981439|ref|YP_001354099.1| hypothetical protein mma_2409 [Janthinobacterium sp. Marseille]
gi|151281516|gb|ABR89926.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille]
Length = 331
Score = 194 bits (494), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 128/351 (36%), Positives = 166/351 (47%), Gaps = 64/351 (18%)
Query: 7 PPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
P ++ ++M +PG F MG+N E P+ V +DAF++D++ V+N+ F++FV A
Sbjct: 15 PSLQAPRNMAWIPGGKFLMGSNDHY---PEEAPAHKVVVDAFWMDKYAVTNSDFKQFVDA 71
Query: 67 TGYVT--EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
TGYVT E E D + P E S V LD +
Sbjct: 72 TGYVTMVEREVNPDDY---PGARPELLVPSSVVFQKTATKVALDDPHQ------------ 116
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
W W Y P W HP G S+++ HPVVHV
Sbjct: 117 --------W----------WHY----------VPGADWKHPRGPQSSLQGLWKHPVVHVG 148
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
DA AY W G LPTEAEWE+ RGGL+ F WG+ TP G + N WQG+FP N
Sbjct: 149 IEDAQAYAEWAGKSLPTEAEWEFAARGGLDGAEFVWGDEFTPGGRYMTNNWQGDFPHQNA 208
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH----HHPAPSYNPKG--PTTG 298
D Y T+PV ++ N +GLY+M GNVWEWT DW+ H H NP+G P
Sbjct: 209 LLDRYEWTSPVGAFPANGYGLYDMAGNVWEWTVDWYQEHSKIQHACCTINNPRGGEPDGS 268
Query: 299 TD----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
D KV KGGS+LC YC R+R AAR D++ +LGFRC
Sbjct: 269 FDQRVKNVQIPRKVMKGGSHLCAPNYCRRYRPAARMAQPIDTATCHLGFRC 319
>gi|145595455|ref|YP_001159752.1| hypothetical protein Strop_2936 [Salinispora tropica CNB-440]
gi|145304792|gb|ABP55374.1| protein of unknown function DUF323 [Salinispora tropica CNB-440]
Length = 308
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 127/338 (37%), Positives = 167/338 (49%), Gaps = 57/338 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV +PG FRMG+ E P R V +D F++DQ+ V+ +F FV TG+VT A
Sbjct: 17 EMVWIPGGRFRMGSTD---FYPEEGPVRWVEVDGFWIDQYLVTVREFDRFVRETGHVTMA 73
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E T P E S V ++ G S ++R +W
Sbjct: 74 E-VSPTAADYPDADPELLVPGSLV---FRKASGPVSLADYR--------NW--------- 112
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W + P +W HPEG ST++ R HPV HV+++DAVA+ T
Sbjct: 113 ----------WSWT----------PGATWRHPEGPGSTVDGRELHPVTHVAYSDAVAFAT 152
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTE+EWE RGGL+ +F WGN P G+ AN WQG FP N DGY T+
Sbjct: 153 WAGKELPTESEWELAARGGLDGAVFTWGNEFAPEGKLMANTWQGAFPWQNLLLDGYAGTS 212
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---------NPKGPTTGTDK-VK 303
PV ++ N LY+M GNVWEWT D V +PA S GP++ + V
Sbjct: 213 PVGAFPPNGHHLYDMAGNVWEWTQD---VFTYPAGSLAKACCTTPVTADGPSSAIPRHVI 269
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
KGGS+LC YC R+R AAR D+S +LGFRC +
Sbjct: 270 KGGSHLCAPNYCMRYRPAARQGEAIDTSTCHLGFRCVS 307
>gi|254431410|ref|ZP_05045113.1| conserved domain protein [Cyanobium sp. PCC 7001]
gi|197625863|gb|EDY38422.1| conserved domain protein [Cyanobium sp. PCC 7001]
Length = 335
Score = 194 bits (493), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 132/339 (38%), Positives = 171/339 (50%), Gaps = 61/339 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P ++R+G + E P V L F L HEV+N QF FV ATGY T AE
Sbjct: 39 MVAIPAGSYRIGAGG---QRPEEGPPALVRLSPFCLATHEVTNRQFAAFVEATGYRTVAE 95
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ F P LS ERA S V F LD
Sbjct: 96 RPLSREQF-PTLSVAERAPGSLV------FRPLDP------------------------- 123
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G + A W + P W HPEG S+IEH ++HPVV V++ DA AY W
Sbjct: 124 GEPIGPMAGWHW----------VPGADWRHPEGPGSSIEHLLDHPVVQVAYEDAEAYAAW 173
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RGA LP+EA+WE+ RGGL +++F WG+ +RAN WQG FP ++A DG+ TAP
Sbjct: 174 RGADLPSEAQWEFAARGGLRDQVFGWGDTWD---ANRANTWQGPFPYRDSAEDGHAGTAP 230
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAP-------SYNPKGPTTGTDKV 302
V SY N +GL++ GNVWEWTADW+ H P P S +P+ P V
Sbjct: 231 VGSYAPNGYGLHDTTGNVWEWTADWFAPGHQLLVDRPDPRLDDAAASSDPREPGV-AKHV 289
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
KGGS+LC+ YC R+R AAR +PD+ ++G R A+
Sbjct: 290 IKGGSFLCSPSYCSRYRPAAREAESPDTGTSHIGIRLAS 328
>gi|254559464|ref|YP_003066559.1| sulfatase-modifying factor like protein [Methylobacterium
extorquens DM4]
gi|254266742|emb|CAX22534.1| Sulfatase-modifying factor like (SUMF1-related); putative
C-alpha-formyglycine-generating enzyme or
methyltransferase or protein kinase [Methylobacterium
extorquens DM4]
Length = 327
Score = 194 bits (492), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 125/344 (36%), Positives = 172/344 (50%), Gaps = 63/344 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ + G TFRMG+++ E P VT+ +F++D V+N QF++FV TG+VT AE
Sbjct: 20 MIYIRGGTFRMGSDRHY---SEERPVHRVTVGSFWMDATPVTNHQFRDFVRTTGHVTFAE 76
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K P + A+ Q+ + G S + +HPV W
Sbjct: 77 K-------RPEPDQYPGARSDQL------YAG--SLVFTPPNHPVDLRMWG--------- 112
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMN-HPVVHVSWNDAVAYCT 193
A W L +W P G H ++ HPVVHV++ DA+AY
Sbjct: 113 -------AWWS----------LLKGANWRRPNGPKGITLHGLDTHPVVHVTYADALAYAR 155
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEAEWE+ RGGL+ F WG+ TPRG H AN WQG+FP NT DG+ T+
Sbjct: 156 WAGKDLPTEAEWEFAARGGLDGAEFAWGDTFTPRGRHMANTWQGQFPWQNTEEDGFARTS 215
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPKGPTTGTD---------- 300
PV ++ N +GL++M GNVWEWT+D++ H P PK P G +
Sbjct: 216 PVKAFVPNGYGLHDMTGNVWEWTSDFFADRHSADAAKPCCIPKDPRGGREDDSYDPCLPD 275
Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC +C R+R AAR D+S ++GFRC
Sbjct: 276 IRIPRRVLKGGSHLCAPNFCSRYRPAARLPQPVDTSTSHVGFRC 319
>gi|348029893|ref|YP_004872579.1| hypothetical protein GNIT_2489 [Glaciecola nitratireducens FR1064]
gi|347947236|gb|AEP30586.1| hypothetical protein GNIT_2489 [Glaciecola nitratireducens FR1064]
Length = 333
Score = 194 bits (492), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 122/339 (35%), Positives = 166/339 (48%), Gaps = 54/339 (15%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
K M + G T MG I + E P + FY+D+HEV+N QF +FV ATGY+T
Sbjct: 37 KGMAFIKGGTLFMGAGA---IYEEETPLVKRKIQDFYMDRHEVTNAQFADFVDATGYITV 93
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
A+K + + P +S E S V F L +E A ++
Sbjct: 94 AQKLPNAEHY-PNISPELLKPGSAV------FVKLSEAVE--------------AGSFTN 132
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W W + G W P G +S IE + + PV+H++++DA+AY
Sbjct: 133 W----------WHF-VEGAY---------WKKPFGPNSNIEGKEHFPVIHIAYDDALAYA 172
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W+G RLPTEAE+EY RGGL+ + GN LT G + AN WQG FP +TA DGY+
Sbjct: 173 EWKGHRLPTEAEYEYASRGGLDGAKYATGNTLTDDGRYIANTWQGLFPFTDTAEDGYVGL 232
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------PAPSYNPKGPTTGTDKVK 303
APV Y +N FG+++++GNVWEWT + H P Y+ K P V
Sbjct: 233 APVGCYPQNHFGIHDLIGNVWEWTQSVYYPRHFDPKQLPQNLPPQGYDQKQPGVAVG-VI 291
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
KGGSYLC E +C R+R AAR ++GFR D
Sbjct: 292 KGGSYLCAEDFCMRYRPAARLAQDTGLGTSHIGFRTVKD 330
>gi|422320474|ref|ZP_16401534.1| exported protein [Achromobacter xylosoxidans C54]
gi|317404770|gb|EFV85153.1| exported protein [Achromobacter xylosoxidans C54]
Length = 382
Score = 194 bits (492), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 124/350 (35%), Positives = 170/350 (48%), Gaps = 59/350 (16%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ M +P F MG N + + E P+ V L F++D H+V+N +F+ FV ATGY+T
Sbjct: 49 QGMAWIPAGEFLMG-NDHRMSQPNERPAHRVRLTGFWMDAHDVTNAEFRRFVDATGYLTT 107
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ R+E L + P+ DA
Sbjct: 108 AER-------------------------KPRWEDLAVQLPPGTPRPI------DAALVP- 135
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
GA + T E E R F G+ W HP+G S+IE + +HPVV VS+ DA AY
Sbjct: 136 --GAMVFTGTEQEVSLRDYSRWWRFVAGANWRHPQGPASSIEGKDDHPVVQVSYEDAQAY 193
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAAD 247
W G RLPTEAEWE+ RGGLE + WG+ L PRG+ AN+W + FP T
Sbjct: 194 AAWAGKRLPTEAEWEFAARGGLEQATYIWGDELAPRGQAMANIWDTQQRQPFPV-VTDEK 252
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV------HHHPAPSYNPKGPTTGTD- 300
+ T PV S+ N +GLY+M GNVW+WTADW+ + P +P GP D
Sbjct: 253 VQVGTTPVASFPPNGYGLYDMAGNVWQWTADWYRADAFLTQSQYRRPPLDPAGPKDSFDP 312
Query: 301 -----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V +GGS+LC++ YC +R +AR N P + +LGFR
Sbjct: 313 DDGNVPRNAPKRVVRGGSFLCSDTYCISYRASARRGNDPMNGMSHLGFRT 362
>gi|387018858|gb|AFJ51547.1| Sulfatase modifying factor 2 [Crotalus adamanteus]
Length = 321
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 121/356 (33%), Positives = 171/356 (48%), Gaps = 75/356 (21%)
Query: 7 PPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNV-TLDAFYLDQHEVSNTQFQEFVS 65
P + +D V LPG + TN + DG+ R+ + F +D++ V+N F+EFV
Sbjct: 29 PAFRKDEDTVQLPGGKLEIRTN----VLDGKNEDRSFRIIKPFVMDKYPVTNKDFREFVK 84
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
Y TEAE FG +FVFE + EE + K++Q
Sbjct: 85 DQKYKTEAEDFGWSFVFENFVPEELKKKVTQ----------------------------- 115
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
+ W LP E + W P G S+I+ R+++PV+HVSW
Sbjct: 116 -KLESAPWW---LPIEKAF-----------------WRQPSGPGSSIKDRLDYPVLHVSW 154
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
NDA A+C W+ RLPTE EWE+ RGGLENRL+PWGN +R N+WQG FP + A
Sbjct: 155 NDAQAFCKWKRKRLPTEEEWEFAARGGLENRLYPWGNKFL---ANRTNLWQGNFPKVDIA 211
Query: 246 ADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVK- 303
DG+ +PV ++ +N +GLY+++GN WEWT+ Y P GP + K
Sbjct: 212 EDGFRGASPVAAFPPQNNYGLYDLLGNTWEWTSS----------EYRPNGPASRQLGQKM 261
Query: 304 ---KGGSYLCNEQYCYRHR--CAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKK 354
+G S++ H+ R NTPDSS+ NL FRCA + V+K
Sbjct: 262 CVLRGASWIDTVDGSANHKAQVTTRMGNTPDSSSDNLSFRCAISLVKQPAQEAVRK 317
>gi|190891643|ref|YP_001978185.1| hypothetical protein RHECIAT_CH0002047 [Rhizobium etli CIAT 652]
gi|190696922|gb|ACE91007.1| hypothetical conserved protein [Rhizobium etli CIAT 652]
Length = 339
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 135/361 (37%), Positives = 181/361 (50%), Gaps = 64/361 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG TFRMG++K E P V++D F++D+ V+N QF+EFV ATGYVT AE
Sbjct: 24 MAYIPGGTFRMGSDK---YYPEEAPVHRVSVDGFWVDRTPVTNRQFREFVEATGYVTFAE 80
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
++ + R H +K S + +HPV D W
Sbjct: 81 -----------IAPDPRDYPGAPAHLLK----AGSAVFLPPNHPV------DLGDISKW- 118
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W F +G+ W HP G S+I +HPVVHV++ DA AY
Sbjct: 119 ---------W-----------TFTFGANWRHPYGKGSSIRKLDDHPVVHVAYEDAAAYAA 158
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEAEWE+ RGGL+ + WG+ LTP G H AN WQG+FP N A DG+ T+
Sbjct: 159 WAGKSLPTEAEWEFAARGGLDGAEYAWGDELTPGGRHMANTWQGQFPHENCAGDGFERTS 218
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWN------------VHHHP-----APSYNPKGPT 296
PV ++ N + +Y+M+GNVWEWT DW++ + +P + SY+P P
Sbjct: 219 PVTAFPANGYDVYDMIGNVWEWTTDWYSPRNKGDVSKACCIPQNPRGASESDSYDPGMPE 278
Query: 297 TGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKG 355
KV KGGS+LC YC R+R AAR D+S +LGFRC T + V
Sbjct: 279 IRIPRKVIKGGSHLCAPNYCRRYRPAARHPEPVDTSTSHLGFRCVVRNLETEVAEDVNSH 338
Query: 356 G 356
G
Sbjct: 339 G 339
>gi|229488576|ref|ZP_04382442.1| sulfatase-modifying factor 1 [Rhodococcus erythropolis SK121]
gi|229324080|gb|EEN89835.1| sulfatase-modifying factor 1 [Rhodococcus erythropolis SK121]
Length = 336
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 126/346 (36%), Positives = 171/346 (49%), Gaps = 63/346 (18%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
K+M L+PG T+ MG+ E P VT+D F++D V+ +F+ FV ATG+VT
Sbjct: 28 KNMSLIPGGTYWMGSEN---FYPEERPVHQVTVDGFWMDTTAVTVAEFRRFVKATGHVTT 84
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE D + D R + ++ + P +S +D Y
Sbjct: 85 AEIDPDAADYP----------------DADRALLVPGSLV--FNSPPGPVSLDD---YTQ 123
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W W + P W HPEG DS I R HPV HVS+ DA AY
Sbjct: 124 W----------WSFT----------PGADWRHPEGADSNIGGRERHPVTHVSYFDAQAYA 163
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWE+ RGGL+ + + WG++ +P G NVWQG+FP N DGY T
Sbjct: 164 AWAGKELPTEAEWEFAARGGLDRQAYVWGDHDSPGGRPGGNVWQGQFPWENLLEDGYERT 223
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTG------ 298
APV ++ N +GL++M GNVWEWTAD + H + AP+ + P+ P
Sbjct: 224 APVGKFRPNGYGLFDMAGNVWEWTADHYTSDHAHSSKNVAPASSCCIPRNPRAEFATEAL 283
Query: 299 -----TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R AAR + +SS ++GFRC
Sbjct: 284 VGEPYARRVIKGGSHLCAPNYCLRYRPAARQGESEESSTCHVGFRC 329
>gi|87200487|ref|YP_497744.1| hypothetical protein Saro_2474 [Novosphingobium aromaticivorans DSM
12444]
gi|87136168|gb|ABD26910.1| protein of unknown function DUF323 [Novosphingobium aromaticivorans
DSM 12444]
Length = 310
Score = 193 bits (491), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 130/351 (37%), Positives = 169/351 (48%), Gaps = 66/351 (18%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
VER M L+ G TF MG+ E P R V +D+F++D+ V+N QF FV ATG
Sbjct: 2 VER-PGMRLIEGGTFTMGSEA---FYPEEAPLRRVKVDSFWIDEAPVTNAQFAAFVEATG 57
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
YVT AE D + +L +RA S + + PV D
Sbjct: 58 YVTVAEIEPDPKDYPGMLPGMDRAG---------------SLVFQKTAGPV------DMA 96
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHRMNHPVVHVSWND 187
W W F +G+ W HP G S+I+ +HPVVHV++ D
Sbjct: 97 DASNW----------WH-----------FTFGACWKHPLGPGSSIDGIEDHPVVHVAYAD 135
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
A AY W G LPTEAE+EY RGGL+ F WG+ L P G AN WQG FP N D
Sbjct: 136 AEAYAKWAGKDLPTEAEFEYAARGGLDGSEFSWGDELAPEGRMMANYWQGLFPFANQCLD 195
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-------------VHHHP-----APS 289
G+ T+PV ++ N +GLY+M+GN WEWT DWW +P S
Sbjct: 196 GWERTSPVRNFPPNGYGLYDMIGNTWEWTCDWWADKPLTPQRKSACCAISNPRGGKLKDS 255
Query: 290 YNPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
++P P KV KGGS+LC YC R+R AAR D++ ++GFRC
Sbjct: 256 FDPSQPAMRIGRKVIKGGSHLCAANYCQRYRPAARHPEMVDTATTHIGFRC 306
>gi|359421667|ref|ZP_09213579.1| hypothetical protein GOARA_082_00350 [Gordonia araii NBRC 100433]
gi|358242383|dbj|GAB11648.1| hypothetical protein GOARA_082_00350 [Gordonia araii NBRC 100433]
Length = 306
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 126/337 (37%), Positives = 172/337 (51%), Gaps = 57/337 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V +PG TF MG++ E P T++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 16 ELVEIPGGTFHMGSDAHY---PEERPVHERTVEPFSIERHLVTNDQFAEFVADTGYVTVA 72
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D +F G D + +V V W
Sbjct: 73 EE----------------------EVDPAQFPGADPA--DLVPGALVFTKTAGPVNLGDW 108
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R +W R P SW +PEG DS+IE R +HPVV V++ DA AY
Sbjct: 109 R--------QWW---------RWEPGASWRNPEGPDSSIEGRGDHPVVQVAYRDAAAYAE 151
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTE EWE+ RGG++ F WG++L P G+ AN W G FP +T G+ T+
Sbjct: 152 WAGRRLPTETEWEFAARGGVDGLEFAWGDDLFPDGKVMANTWLGNFPYEST---GWGGTS 208
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHH------HPAPSYNPK--GPTTGTD--KVK 303
PV +Y N +GLY+M+GNVWE T+D + H H + P PT +V
Sbjct: 209 PVGAYPANGYGLYDMIGNVWERTSDAYAPRHVVPGDQHVSADGRPDLLAPTVDPQVMRVT 268
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KGGS++C +YC R+R AARS + DS+ +LGFRCA
Sbjct: 269 KGGSFICAPEYCRRYRPAARSAQSDDSATSHLGFRCA 305
>gi|410623515|ref|ZP_11334327.1| sulfatase-modifying factor 2 [Glaciecola pallidula DSM 14239 = ACAM
615]
gi|410156731|dbj|GAC29701.1| sulfatase-modifying factor 2 [Glaciecola pallidula DSM 14239 = ACAM
615]
Length = 337
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 120/339 (35%), Positives = 167/339 (49%), Gaps = 54/339 (15%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
K M + G TF MG + E P+ + FY+D+HEV+N QF EF+ ATGYVT
Sbjct: 43 KGMAFVEGGTFTMGAGA---VYAEEMPAIQRVVKDFYMDRHEVTNLQFAEFIEATGYVTM 99
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
A++ + + P + E S V F L +E A +
Sbjct: 100 AQRVPNADDY-PDIPAELLTPGSAV------FVKLSQAVE--------------ASTFFN 138
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W W + G +W +P G +S I+ + + PV+H++++DA AY
Sbjct: 139 W----------WHF-TEGA---------NWKNPLGPNSNIQDKDDFPVIHIAYDDAQAYA 178
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W+G RLPTEAE+EY RGGL+ + G+ L G++ AN WQG FP N+TA DGY+
Sbjct: 179 KWKGHRLPTEAEYEYASRGGLDGAKYASGSTLLNNGKYIANTWQGLFPFNDTAEDGYVGL 238
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------PAPSYNPKGPTTGTDKVK 303
APV Y N FG+++++GNVWEWT + H PA Y+ K P V
Sbjct: 239 APVGCYPANHFGIHDLIGNVWEWTQSVYYPRHFDPSHLPANLPAKGYDEKQPGIAV-GVV 297
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
KGGSYLC E +C R+R AAR ++GFR D
Sbjct: 298 KGGSYLCAEDFCMRYRPAARHAQDTGLGTSHIGFRTVKD 336
>gi|340616167|ref|YP_004734620.1| C-alpha-formylglycine-generating protein [Zobellia galactanivorans]
gi|339730964|emb|CAZ94228.1| C-alpha-formylglycine-generating protein [Zobellia galactanivorans]
Length = 381
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 122/343 (35%), Positives = 170/343 (49%), Gaps = 50/343 (14%)
Query: 15 MVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
MV +PG F G + D E P+ V +D F++D EV+N +FQ+FV TGY T A
Sbjct: 63 MVWIPGAVFSQGAVPQDKAAMDHEKPAHKVAVDGFFMDITEVTNDEFQKFVDETGYKTVA 122
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D + L E + HD G + + + P ++ + W
Sbjct: 123 EREIDWEEMKKQLPEG-----TPKPHDSIMQPGSLTFKQAKSSVPNLY-------DFSQW 170
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W G +W +P G S+I+ + N+PVVH+++ DA+AYC
Sbjct: 171 WS--------WTIGA------------NWKNPNGPGSSIKGKGNYPVVHIAYEDALAYCE 210
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W RLPTEAEWE RG N ++ WG+ + +AN W+GEFP N DG+ A
Sbjct: 211 WANRRLPTEAEWERAARGNHLNTIYFWGDE-DAKLSDKANTWEGEFPVTNVETDGFGLRA 269
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS---------------YNPKGPTTG 298
PV SY N FGLY+M GNVWEWT DW+N +++ S YN + P
Sbjct: 270 PVKSYPANDFGLYDMAGNVWEWTNDWYNTNYYKELSTTFPVAKNPLGADQAYNERDP-YA 328
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+KV KGGS+LCN YC +R ++R + DSS +LGFR A
Sbjct: 329 REKVMKGGSFLCNASYCASYRVSSRMATSLDSSLEHLGFRTVA 371
>gi|402491193|ref|ZP_10837981.1| hypothetical protein RCCGE510_25726 [Rhizobium sp. CCGE 510]
gi|401809592|gb|EJT01966.1| hypothetical protein RCCGE510_25726 [Rhizobium sp. CCGE 510]
Length = 323
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/358 (35%), Positives = 175/358 (48%), Gaps = 63/358 (17%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
+ R MV +PG TF MG+++ E P+ V +D F++++ V+N F EFV ATG
Sbjct: 11 LARIDGMVWIPGGTFVMGSDRHY---PEEAPAHPVKVDGFWIEETPVTNRAFAEFVKATG 67
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
YVT AE D + E RA S + K + DS I W
Sbjct: 68 YVTVAETIPDAKNYPGARPEMLRAG-SLLFVQPKTVD--DSDISQ----------W---- 110
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
WR +++G +W P G S ++ +++HPVVHV++ DA
Sbjct: 111 ----WR---------FKFGA------------NWRRPLGGLSDLKGKLDHPVVHVAYADA 145
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G LPTEAEWE+ RGGL + F WG+ + P AN WQG FPT +T G
Sbjct: 146 TAYARWAGLDLPTEAEWEFAARGGLGDVEFAWGSEMMPDDVIMANTWQGVFPTKSTKPKG 205
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----------------HPAPSYN 291
++ V S+ N +GLY+M+GNVWEWT+D+W+ H P SY+
Sbjct: 206 RERSSAVRSFPANGYGLYDMIGNVWEWTSDYWSTRHPEPARHSCCIPNNPRGGDPVASYD 265
Query: 292 PKGPT-TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+ P T +V KGGS+LC YC R+R AAR D+S ++GFRC T G
Sbjct: 266 RRQPAITIPRRVLKGGSHLCAPNYCRRYRPAARHAEQEDTSTSHVGFRCIRRASSTAG 323
>gi|389864686|ref|YP_006366926.1| sulfatase-modifying factor [Modestobacter marinus]
gi|388486889|emb|CCH88441.1| Sulfatase-modifying factor [Modestobacter marinus]
Length = 307
Score = 192 bits (489), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 121/336 (36%), Positives = 158/336 (47%), Gaps = 67/336 (19%)
Query: 37 EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
E P V +D F++D+H V+ T F FV+ TGY+T AE+
Sbjct: 10 ERPVHRVAVDGFWMDRHPVTATAFARFVADTGYLTVAER--------------------- 48
Query: 97 VRHDMKRFEGLDSTI----EHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGL 152
R D + G+D + P + D A+ W Y
Sbjct: 49 -RPDPAAYPGVDPAVLVAGSLVFTPPARPVPLRDPRAW-------------WTY------ 88
Query: 153 ENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGG 212
P SW HPEG ST++ R +HPVVHV+ DA AY W G LPTEAEWE+ RGG
Sbjct: 89 ----VPGASWRHPEGPGSTVDGRGDHPVVHVAHEDAEAYARWAGKDLPTEAEWEFAARGG 144
Query: 213 LENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNV 272
L ++ WG+ P G AN WQGEFP + DGY T+PV ++ N +GL++ GNV
Sbjct: 145 LAGAVYTWGDEFAPGGRLMANTWQGEFPWEDLGTDGYKRTSPVGAFPPNGYGLHDTAGNV 204
Query: 273 WEWTADWWNVHHH---PAPSYNPKGP---------TTG------TDKVKKGGSYLCNEQY 314
WEWT+D + H P+ P+ P TTG +V KGGS+LC Y
Sbjct: 205 WEWTSDLYAPRHSGDAPSACCIPRNPRVHTADQTWTTGLPDAAVARRVVKGGSHLCAPNY 264
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTD 350
C R+R AAR D+S G+LGFRC T D
Sbjct: 265 CLRYRPAARQGEPVDTSTGHLGFRCVLRSSERTAPD 300
>gi|427392884|ref|ZP_18886787.1| hypothetical protein HMPREF9698_00593 [Alloiococcus otitis ATCC
51267]
gi|425731089|gb|EKU93916.1| hypothetical protein HMPREF9698_00593 [Alloiococcus otitis ATCC
51267]
Length = 281
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 120/334 (35%), Positives = 168/334 (50%), Gaps = 61/334 (18%)
Query: 15 MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+V + G FRMGT+ +D E PS V + F + ++N +F+EF+ TGYVT A
Sbjct: 3 LVSITGGEFRMGTDDNKGFPEDYEGPSTIVNVPNFEIADTPITNREFKEFIDDTGYVTTA 62
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ G +FVF LL EEE++K +H + P
Sbjct: 63 EQKGCSFVFGKLLDEEEKSKA-----------------QHVLGTP--------------- 90
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W + L P W HP G S + + +HPVVHVS DA+AYC
Sbjct: 91 ----------WWF---------LVPGADWAHPYGPKSNLNNLWDHPVVHVSLKDALAYCD 131
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W LPTEAEWEY R G + FPWG++LT + AN WQG+FP N+ DGYL+TA
Sbjct: 132 WSETHLPTEAEWEYAARAGTTSE-FPWGDSLTEDNTYHANTWQGKFPEVNSLDDGYLATA 190
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVK-----KGGSY 308
PV +Y+ N +GLY ++GNVWEW N + +N +K+ +GGS+
Sbjct: 191 PVYNYEPNNWGLYQVIGNVWEWCR---NPRYTLFEDFNSNEFLVNKNKLVGEYAIRGGSF 247
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
LC+ YC R+R A R+ +S++ +L FRC +
Sbjct: 248 LCHSCYCNRYRVAGRNGVDNESTSSHLSFRCIKE 281
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 16/39 (41%), Positives = 26/39 (66%)
Query: 354 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
+GGS+LC+ YC R+R A R+ +S++ +L FRC +
Sbjct: 243 RGGSFLCHSCYCNRYRVAGRNGVDNESTSSHLSFRCIKE 281
>gi|311106407|ref|YP_003979260.1| hypothetical protein AXYL_03225 [Achromobacter xylosoxidans A8]
gi|310761096|gb|ADP16545.1| hypothetical protein AXYL_03225 [Achromobacter xylosoxidans A8]
Length = 387
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 119/349 (34%), Positives = 167/349 (47%), Gaps = 61/349 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG F MG++ L + E P+ V++ F++D ++V+N QF+ FV ATGYVT AE
Sbjct: 56 MAWVPGREFLMGSSH-KLAQPNEMPAHKVSVSGFWMDVNDVTNAQFRRFVEATGYVTTAE 114
Query: 75 ---KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
K+ D V P + ++ + F G +S + R Y
Sbjct: 115 QKPKWEDLQVQLPPGTPRPDERL--LVPGAMVFVGTESEVSLR--------------DYS 158
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W W Y P +W HP+G S+I + +HPVV VS+ DA AY
Sbjct: 159 RW----------WRY----------VPGANWRHPQGPGSSIAGKDDHPVVQVSYQDAAAY 198
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG--- 248
W G RLPTE+EWE RGGLE + WG L P+G+ AN+W + P
Sbjct: 199 AKWAGKRLPTESEWEMAARGGLEQATYAWGEELLPQGKPMANIWDTQQPQPFPVVKDEKV 258
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH------HHPAPSYNPKGPTTGTD-- 300
+ T PV S+ N +GLY+M GNVW+WTADW+ + P NP GP D
Sbjct: 259 QVGTMPVGSFSPNGYGLYDMAGNVWQWTADWYRADAFKIQAQYRQPPANPAGPADSYDPD 318
Query: 301 ----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V +GGS+LC++ YC +R +AR P + +LGFR
Sbjct: 319 DGIVPPGAPKRVTRGGSFLCSDTYCISYRTSARRGTDPMNGMSHLGFRT 367
>gi|442321248|ref|YP_007361269.1| hypothetical protein MYSTI_04287 [Myxococcus stipitatus DSM 14675]
gi|441488890|gb|AGC45585.1| hypothetical protein MYSTI_04287 [Myxococcus stipitatus DSM 14675]
Length = 333
Score = 192 bits (488), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 123/346 (35%), Positives = 163/346 (47%), Gaps = 63/346 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG T+ MG++ E P+ VT+ F++D+ V+N F FV TGYVT AE
Sbjct: 27 MEWIPGGTYWMGSDTHY---PEERPTHQVTVTGFWMDRCTVTNADFARFVQTTGYVTVAE 83
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D F P E S V H L + W
Sbjct: 84 RPLDAKDF-PGAKPELLVPGSLVFHKTPERVDLKDLSQ--------------------W- 121
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W Y P W P+G ST + + +HPVVHV++ DA AY TW
Sbjct: 122 ---------WTY----------TPGACWHRPQGHGSTWKGQEDHPVVHVAFEDARAYATW 162
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LP+EAEWE R GL+ +++ WG+ +P + AN+WQGEFP N DGY T+P
Sbjct: 163 AGKDLPSEAEWERAARAGLDRKVYVWGDEFSPGNQQMANIWQGEFPWQNLMTDGYEGTSP 222
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGPTTGTD-------- 300
V +Y N +GL++M GNVWEWT DW+ H NP+GP +
Sbjct: 223 VGTYPPNAYGLFDMAGNVWEWTTDWFQERHQGNGGKACCIPVNPRGPPSSQKSFDPCTPE 282
Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+V KGGS+LC YC R+R AARS DS ++GFRC A
Sbjct: 283 VLIPRRVVKGGSHLCASNYCLRYRPAARSPQAEDSGTTHIGFRCIA 328
>gi|359798305|ref|ZP_09300879.1| hypothetical protein KYC_15197 [Achromobacter arsenitoxydans SY8]
gi|359363851|gb|EHK65574.1| hypothetical protein KYC_15197 [Achromobacter arsenitoxydans SY8]
Length = 332
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 122/348 (35%), Positives = 164/348 (47%), Gaps = 59/348 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG F MG N + + E P+ V + F++D H+V+N +F+ FV ATGYVT AE
Sbjct: 1 MAWIPGTDFLMG-NDHQMSQPNERPAHKVQVSGFWMDVHDVTNAEFRRFVEATGYVTTAE 59
Query: 75 KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
P L R S++ F G ++ + R Y W
Sbjct: 60 MKPRWEDLRPQLPPGTPRPHDSELIAGAMVFVGTETEVSLR--------------DYSRW 105
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W R P +W HP+G S I + HPVV VS+ DA AY
Sbjct: 106 ----------W----------RFVPGANWRHPQGPGSDIAGKDEHPVVQVSYEDAQAYAK 145
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAADGY 249
W G RLPTEAEWE+ RGGLE + WGN L P+G+ AN+W Q FP
Sbjct: 146 WAGKRLPTEAEWEFAARGGLEQATYTWGNELLPQGKAMANIWDTQQQQPFPVVKDEKI-Q 204
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNPKG 294
+ T PV SY N +GLY+M GNVW+WTADW+ PA S++P
Sbjct: 205 VGTTPVGSYAPNGYGLYDMAGNVWQWTADWYRADAFQIQAQFRQPPKDPAGPADSFDPSD 264
Query: 295 ---PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
P + +V +GGS+LC++ YC +R +AR P + +LGFR
Sbjct: 265 GNVPVSAPKRVTRGGSFLCSDTYCISYRASARRGTDPLNPMSHLGFRT 312
>gi|427702573|ref|YP_007045795.1| hypothetical protein Cyagr_1278 [Cyanobium gracile PCC 6307]
gi|427345741|gb|AFY28454.1| hypothetical protein Cyagr_1278 [Cyanobium gracile PCC 6307]
Length = 327
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 130/339 (38%), Positives = 170/339 (50%), Gaps = 62/339 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG +R+G L ++ E +V L F + + EV+N +F FV+ATGY T+AE
Sbjct: 32 MAPVPGGRYRIGAAG-RLPEEAE---ASVRLQPFCIARTEVTNREFAAFVAATGYRTQAE 87
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ F P LS ER S V E P+ +SW
Sbjct: 88 QPLPQEQF-PELSAAERRPGSVVFRPPSPGE------------PLAELSW---------- 124
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W++ P W HPEG S+I+ R++HPVV VS DA AY W
Sbjct: 125 ---------WQW----------VPGADWRHPEGPGSSIDDRLDHPVVQVSLADAEAYAAW 165
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LP+EA+WE+ RGGL +R F WGN P AN WQGEFP +NTA DG+ TAP
Sbjct: 166 AGTALPSEAQWEFAARGGLRDRAFSWGNTWRP---GLANTWQGEFPVSNTADDGFTGTAP 222
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH------------HPAPSYNPKGPTTGTDKV 302
V S+ N +GL +M GNVWEWTADW+ H PA S +P+ P V
Sbjct: 223 VGSFPPNGYGLQDMTGNVWEWTADWYRPGHDRLAGQDDPVLADPAASSDPREPGV-AKHV 281
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
KGGS+LC YC R+R AAR +PD+ ++GFR +
Sbjct: 282 IKGGSFLCAPNYCSRYRPAAREAESPDTGTSHIGFRLVS 320
>gi|163850229|ref|YP_001638272.1| hypothetical protein Mext_0792 [Methylobacterium extorquens PA1]
gi|163661834|gb|ABY29201.1| protein of unknown function DUF323 [Methylobacterium extorquens
PA1]
Length = 327
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 171/344 (49%), Gaps = 63/344 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ + G TFRMG+++ E P VT+ +F++D V+N QF++FV TG+VT AE
Sbjct: 20 MIYIRGGTFRMGSDRHY---SEERPVHRVTVGSFWMDATPVTNHQFRDFVRTTGHVTFAE 76
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K P + A+ Q+ + G S + +HPV W
Sbjct: 77 K-------RPEPDQYPGARSDQL------YAG--SLVFTPPNHPVDLRMWG--------- 112
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMN-HPVVHVSWNDAVAYCT 193
A W L +W P G H ++ HPVVHV++ DA+AY
Sbjct: 113 -------AWWS----------LLKGANWRRPNGPKGITLHGLDTHPVVHVTYADALAYAR 155
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEAEWE+ RGGL+ F WG+ TPRG H AN WQG+FP NT DG+ T+
Sbjct: 156 WAGKDLPTEAEWEFAARGGLDGAEFAWGDTFTPRGRHMANTWQGQFPWQNTEEDGFARTS 215
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPKGPTTGTD---------- 300
PV ++ N +GL++M GNVWEWT+D++ H P PK P G +
Sbjct: 216 PVKAFVPNGYGLHDMTGNVWEWTSDFFADRHSADAAKPCCIPKDPRGGREDDSYDPCLPD 275
Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC +C R+R AR D+S ++GFRC
Sbjct: 276 IRIPRRVLKGGSHLCAPNFCSRYRPEARLPQPVDTSTSHVGFRC 319
>gi|441513058|ref|ZP_20994890.1| hypothetical protein GOAMI_15_00400 [Gordonia amicalis NBRC 100051]
gi|441452039|dbj|GAC52851.1| hypothetical protein GOAMI_15_00400 [Gordonia amicalis NBRC 100051]
Length = 316
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 123/351 (35%), Positives = 169/351 (48%), Gaps = 61/351 (17%)
Query: 6 APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
AP K+M +PG F MG+ + E P V + F++D H V+ +F+ FV
Sbjct: 5 APTGRAPKNMSWIPGGMFWMGSEE---FYPEERPVHQVGVGGFWMDTHPVTVAEFRRFVK 61
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
TG+VT AE P +E A + + F P + +
Sbjct: 62 DTGHVTTAE-------VAPDPAEYPDADPALLVPGSLVFT-----------PPARRVPLD 103
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
D + +W P W HPEG ST+ R HPV HVS
Sbjct: 104 DYRRWWSW-----------------------VPGADWRHPEGPGSTVGGRERHPVTHVSH 140
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
DA+AY W G LPTE+EWE+ RGGL+ F WG+ TP+G + AN WQGEFP N A
Sbjct: 141 YDALAYARWAGKELPTESEWEFAARGGLDRARFSWGDEYTPKGRYLANTWQGEFPWQNLA 200
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----AP------SYNPKG 294
DGY+ T+PV S++ N +GL ++ GNVWEWT D + H P +P +P+
Sbjct: 201 EDGYVGTSPVGSFRPNGYGLVDVTGNVWEWTTDRFTARHDPNRRGSSPVTACCMPADPRT 260
Query: 295 PTTGTD------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+GT +V KGGS+LC YC R+R AAR + ++S ++GFRC
Sbjct: 261 LASGTGAEAHPRRVIKGGSHLCAPNYCLRYRPAARQGESAETSTCHIGFRC 311
>gi|213514658|ref|NP_001135212.1| sulfatase-modifying factor 2 precursor [Salmo salar]
gi|209154232|gb|ACI33348.1| Sulfatase-modifying factor 2 precursor [Salmo salar]
Length = 281
Score = 192 bits (487), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 115/307 (37%), Positives = 159/307 (51%), Gaps = 69/307 (22%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG MGTN +DGE P++ VT+D F +D++ V+N+ F+EFV A Y TEAE
Sbjct: 34 MVQIPGGKMTMGTNSADG-RDGESPTKEVTVDPFRIDKYPVTNSDFREFVRAQKYKTEAE 92
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVF+ +SEE ++K+++ + W
Sbjct: 93 TFGWSFVFQDFVSEELKSKVTE------------------------------KIESAPWW 122
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+P E R+F W P G S I R++ PVV VSWNDA A+C+W
Sbjct: 123 ---MPVE-------------RVF----WRQPAGPGSGIRERLDSPVVQVSWNDAQAFCSW 162
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLP+E EWE+ RGGL+ R +PWGN + R+N+WQG FP +TA DGY AP
Sbjct: 163 RGRRLPSEEEWEWAARGGLQGRTYPWGNKFQSK---RSNLWQGLFPDGDTAEDGYHGIAP 219
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
V ++ +N FGLY+M+GN WEWT+ P P P V +GGS++ E
Sbjct: 220 VTAFPPQNSFGLYDMMGNAWEWTS-------TPFPGGRPM-------FVLRGGSWIDTED 265
Query: 314 YCYRHRC 320
H+
Sbjct: 266 GSANHKA 272
>gi|377807502|ref|YP_004978694.1| hypothetical protein BYI23_C001100 [Burkholderia sp. YI23]
gi|357938699|gb|AET92256.1| hypothetical protein BYI23_C001100 [Burkholderia sp. YI23]
Length = 356
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 121/346 (34%), Positives = 161/346 (46%), Gaps = 58/346 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNV---TLDAFYLDQHEVSNTQFQEFVSATGYVT 71
M+ + G FRMG+ +G RNV LD F +D HEV+N QF FV ATGYVT
Sbjct: 56 MIAIAGGRFRMGST------EGYPEERNVQPVKLDGFLIDAHEVTNAQFAMFVRATGYVT 109
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
EAE+ + D + F G+ ++ V W +
Sbjct: 110 EAER----------------------QRDPRDFPGV---AREQLQPGSVVFVWPEGAQ-- 142
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
+ W R P +W HP G S I + NHPVV V++ DA AY
Sbjct: 143 -----QFEQSYRWW---------RFQPGANWRHPYGPGSGIAGKDNHPVVQVTYRDAQAY 188
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G LPTEAE+EY RGG E+ +PWGN G++ AN WQG+FP +N A DG+
Sbjct: 189 AKWLGHDLPTEAEYEYAARGGRESETYPWGNTPKANGKYMANTWQGDFPAHNEAGDGHKG 248
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD--------KVK 303
TAPV Y N FGL + VGNVW+WT D + H + NP +V
Sbjct: 249 TAPVGCYPANDFGLLDPVGNVWQWTKDVYQDRHPAEEAQNPLVVRVSAPFDDGRHPARVI 308
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
KGGS+LC +C R+R ++R P ++GFR P + +
Sbjct: 309 KGGSFLCAPNFCVRYRPSSRQPQDPTLGTVHIGFRTVLRMSPDSAS 354
>gi|424870496|ref|ZP_18294158.1| hypothetical protein Rleg5DRAFT_1953 [Rhizobium leguminosarum bv.
viciae WSM1455]
gi|393166197|gb|EJC66244.1| hypothetical protein Rleg5DRAFT_1953 [Rhizobium leguminosarum bv.
viciae WSM1455]
Length = 317
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 123/347 (35%), Positives = 174/347 (50%), Gaps = 65/347 (18%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
++ +V + G TF MG+N+ + E P+ VT+D F++D V+N QF FV ATG+VT
Sbjct: 13 FQGLVWVAGRTFTMGSNEHYV---EEAPAHPVTVDGFWIDATPVTNRQFASFVKATGHVT 69
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
AE+ + L + RA S V K G D T W
Sbjct: 70 VAERAPRAEDYPGALPKMLRAG-SLVFTPPKVVVGPDIT------------QW------- 109
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W +++G W HP G + I +++HPVVH++++DA AY
Sbjct: 110 -W---------SFKFGA------------DWRHPYGGRTDIRGKLDHPVVHIAYSDAKAY 147
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W LPTEAEWE RGGL++ + WG+ L P+G+ AN WQG FP +T G
Sbjct: 148 AAWAEKDLPTEAEWELAARGGLDDAEYAWGDELNPKGKLMANTWQGTFPALSTKPAGADR 207
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP------------------SYNPK 293
T+PV + N +G+Y+M+GNVWEWT+D+W+ HP P SY+P+
Sbjct: 208 TSPVGYFPSNGYGIYDMIGNVWEWTSDFWST-RHPEPASHSCCIPSNPRGGDADASYDPR 266
Query: 294 GPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
P +V KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 267 LPDIRIPRRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRC 313
>gi|389695482|ref|ZP_10183124.1| hypothetical protein MicloDRAFT_00052980 [Microvirga sp. WSM3557]
gi|388584288|gb|EIM24583.1| hypothetical protein MicloDRAFT_00052980 [Microvirga sp. WSM3557]
Length = 327
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 127/350 (36%), Positives = 175/350 (50%), Gaps = 62/350 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MVL+PG TFRMG+++ E P VT+D F++D+ V+N +F+ FV ATG+VT A
Sbjct: 22 NMVLIPGGTFRMGSDR---HYPEEAPVHRVTVDGFWIDRTPVTNREFRRFVGATGHVTVA 78
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D + E +A S + +PV W+ ++
Sbjct: 79 ERTPDPRDYPGARPEMLKAG---------------SLVFSPPDYPVNLKDWSQWWSFT-- 121
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
+G G P G S+I+ +HPVVHV++ DA AY
Sbjct: 122 ------------FGANG------------RRPYGRGSSIKSLDDHPVVHVAYADAEAYAA 157
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEAEWE+ RGGL+ F WG+ TP G H AN WQG FP NT DG+ T+
Sbjct: 158 WAGKDLPTEAEWEFAARGGLDGAEFAWGDEFTPDGRHMANTWQGAFPFQNTKLDGFERTS 217
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----------------HPAPSYNPKGPT 296
PV ++ N +GL++M+GNVWEWT+DW+ H A SY+P P
Sbjct: 218 PVTAFPPNAYGLHDMIGNVWEWTSDWYAPKHPADAPKACCIPENPRGGREADSYDPCQPE 277
Query: 297 TGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
+V KGGS+LC YC R+R AAR D+S ++GFRC + P
Sbjct: 278 IRIPRRVLKGGSHLCAPSYCRRYRPAARHAQPVDTSTSHVGFRCVVRRRP 327
>gi|299533665|ref|ZP_07047039.1| hypothetical protein CTS44_22744 [Comamonas testosteroni S44]
gi|298718387|gb|EFI59370.1| hypothetical protein CTS44_22744 [Comamonas testosteroni S44]
Length = 365
Score = 191 bits (486), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 124/348 (35%), Positives = 166/348 (47%), Gaps = 52/348 (14%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
K+MV +P F MG + + E P V L F++D+ V+N QF EFV ATGYVT
Sbjct: 51 KNMVWIPSGHFLMGNTSSKALPN-EMPGHKVELQGFWMDKTHVTNRQFSEFVHATGYVTT 109
Query: 73 AEKFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
AEK D + L R S + F G + R Y
Sbjct: 110 AEKTPDWQSLQAQLPVGTPRPDSSVLIPGASVFVGTRQPVSLR--------------DYM 155
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W W Y GG + W HP G +S+IE + +HPVV VS+ DA AY
Sbjct: 156 QW----------WRY--VGGAD--------WRHPSGPNSSIEGKDDHPVVQVSYEDAQAY 195
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL- 250
W G RLPTEAEWEY RGGL + + WG P G A++W+ + A L
Sbjct: 196 ARWIGKRLPTEAEWEYAARGGLNQQHYAWGTLAEPNGNVMASIWKSRTASFPQAQPKILP 255
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWW-----NVHHHPAPSYNPKGPTTGTD----- 300
T PV +Y+ N +GL++M GN W+W ADW+ ++ NP+GP+ D
Sbjct: 256 GTEPVANYEPNGYGLFDMAGNAWQWAADWYRYDAFSIQARMQNISNPQGPSNSFDPDGLR 315
Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
+V +GGS+LCNE YC +R +AR P S++ N+GFR DK
Sbjct: 316 SDAPKRVIRGGSFLCNEDYCEGYRVSARQGQDPWSASNNVGFRLVMDK 363
>gi|262200810|ref|YP_003272018.1| hypothetical protein Gbro_0811 [Gordonia bronchialis DSM 43247]
gi|262084157|gb|ACY20125.1| protein of unknown function DUF323 [Gordonia bronchialis DSM 43247]
Length = 299
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 127/336 (37%), Positives = 170/336 (50%), Gaps = 57/336 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG FRMG+++ E P+ + + AF +++H V+N F EFV+ TGYVT A
Sbjct: 9 ELVDLPGGVFRMGSDRHYA---EERPAHDRAVGAFAIERHPVTNAAFAEFVADTGYVTVA 65
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E EP+ D F G D + +V V W
Sbjct: 66 E--------EPI--------------DPADFPGADPG--DLVPGALVFTPTAGPVNLADW 101
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
+ +W R P +W PEG S +E R +HPVVH+++ DA AY
Sbjct: 102 Q--------QWW---------RWQPGAAWHRPEGPGSGVEDRPDHPVVHIAYRDAAAYAG 144
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGGL F WG+ L P G AN W+G FP N +G+ +T+
Sbjct: 145 WAGRRLPTEAEWEYAARGGLAAAEFAWGDELHPDGADLANTWKGRFPYRN---EGWGTTS 201
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHH------HPAPSYNPK--GPTTGTD--KVK 303
PV + N FGL++M+GNVWE T+D + H H P PTT +V
Sbjct: 202 PVGTCPPNGFGLFDMIGNVWERTSDVFTPRHVRPDDVHVDADGRPDLLAPTTSPRIMRVT 261
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KGGSY+C +YC R+R AARS + DS+ +LGFRC
Sbjct: 262 KGGSYICAPEYCRRYRPAARSAQSDDSATSHLGFRC 297
>gi|419708452|ref|ZP_14235922.1| hypothetical protein OUW_02929 [Mycobacterium abscessus M93]
gi|382944484|gb|EIC68792.1| hypothetical protein OUW_02929 [Mycobacterium abscessus M93]
Length = 297
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 123/339 (36%), Positives = 168/339 (49%), Gaps = 59/339 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +F MG N + E P R +++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 10 ELVELPGGSFPMGCN---VFYPEEMPERVSSVEPFAIERHPVTNAQFAEFVTQTGYVTVA 66
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D DM D + M T+
Sbjct: 67 ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 90
Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
R P + W++ P W HP G S I+ HPVV V++ DA
Sbjct: 91 RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 140
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G RLPTEAEWEY RGG ++ WG+ + P G AN WQG+FP N A G
Sbjct: 141 AAYARWAGRRLPTEAEWEYAARGG-SGTVYAWGDEVAPEGRLMANTWQGKFPYRNDGALG 199
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY--NPK-GPTTGTDKVKKG 305
+ T+PV ++ N FGL +M+GNVWEWT+ + H AP +P P +V KG
Sbjct: 200 WHGTSPVGTFPANDFGLLDMIGNVWEWTSTRFQ-HGAGAPKSCCSPSDNPDPSVIQVLKG 258
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
GS+LC +YC+R+R AARS + DS+ ++GFRC + G
Sbjct: 259 GSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVENLG 297
>gi|262204582|ref|YP_003275790.1| hypothetical protein Gbro_4780 [Gordonia bronchialis DSM 43247]
gi|262087929|gb|ACY23897.1| protein of unknown function DUF323 [Gordonia bronchialis DSM 43247]
Length = 337
Score = 191 bits (485), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 126/348 (36%), Positives = 167/348 (47%), Gaps = 69/348 (19%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
KDMV +PG F MG+++ E P+ V++D F++D++ V+ +F+ FV ATGYVT
Sbjct: 30 KDMVWVPGGVFWMGSDEHY---PEERPAHQVSVDGFWMDRNPVTVAEFRRFVKATGYVTT 86
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
A++ D F P + S V LD Y
Sbjct: 87 AQQQPDAADF-PDADPDLLVPGSMVFTPPDGPVPLDD--------------------YRR 125
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W W Y P W HPEG S + R HPV HVS+ DA AY
Sbjct: 126 W----------WSY----------VPRADWRHPEGPKSNVGERNRHPVTHVSYFDACAYA 165
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWE+ RGGL+ ++ W ++ P G NVWQG+FP N DGY T
Sbjct: 166 RWAGKSLPTEAEWEFAARGGLDRAVYVWVDDREPHGRPGGNVWQGQFPWENLETDGYYGT 225
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--GPT-------------- 296
+PV ++ N FGL++M GNVWEWT D HH P+ + K PT
Sbjct: 226 SPVGRFRPNGFGLHDMAGNVWEWTRD----HHTPSHGESGKNVAPTGACCILRNPIQESA 281
Query: 297 -----TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
T + KGGS+LC+ YC R+R A R +T D+S G++GFRC
Sbjct: 282 AHSDETYARRTVKGGSHLCSPNYCDRYRPAVRQGHTEDASTGHIGFRC 329
>gi|169631382|ref|YP_001705031.1| hypothetical protein MAB_4305 [Mycobacterium abscessus ATCC 19977]
gi|421025292|ref|ZP_15488335.1| hypothetical protein MA3A0731_4709 [Mycobacterium abscessus
3A-0731]
gi|421036856|ref|ZP_15499873.1| hypothetical protein MA3A0930S_4565 [Mycobacterium abscessus
3A-0930-S]
gi|169243349|emb|CAM64377.1| Conserved hypothetical protein [Mycobacterium abscessus]
gi|392208815|gb|EIV34387.1| hypothetical protein MA3A0731_4709 [Mycobacterium abscessus
3A-0731]
gi|392220708|gb|EIV46232.1| hypothetical protein MA3A0930S_4565 [Mycobacterium abscessus
3A-0930-S]
Length = 309
Score = 191 bits (485), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 123/339 (36%), Positives = 167/339 (49%), Gaps = 59/339 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +F MG N + E P R +++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 22 ELVELPGGSFPMGCN---VFYPEEMPERVSSVEPFAIERHPVTNAQFAEFVTQTGYVTVA 78
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D DM D + M T+
Sbjct: 79 ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 102
Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
R P + W++ P W HP G S I+ HPVV V++ DA
Sbjct: 103 RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 152
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G RLPTEAEWEY RGG ++ WG+ + P G AN WQG FP N A G
Sbjct: 153 AAYARWAGRRLPTEAEWEYAARGG-SGTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 211
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY--NPK-GPTTGTDKVKKG 305
+ T+PV ++ N FGL +M+GNVWEWT+ + H AP +P P +V KG
Sbjct: 212 WHGTSPVGTFPANDFGLLDMIGNVWEWTSTRFQ-HGAGAPKSCCSPSDNPDPSVIQVLKG 270
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
GS+LC +YC+R+R AARS + DS+ ++GFRC + G
Sbjct: 271 GSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVENLG 309
>gi|420911955|ref|ZP_15375267.1| hypothetical protein MA6G0125R_3479 [Mycobacterium abscessus
6G-0125-R]
gi|420918414|ref|ZP_15381717.1| hypothetical protein MA6G0125S_4524 [Mycobacterium abscessus
6G-0125-S]
gi|420923577|ref|ZP_15386873.1| hypothetical protein MA6G0728S_4207 [Mycobacterium abscessus
6G-0728-S]
gi|420929238|ref|ZP_15392517.1| hypothetical protein MA6G1108_4447 [Mycobacterium abscessus
6G-1108]
gi|420968928|ref|ZP_15432131.1| hypothetical protein MM3A0810R_4690 [Mycobacterium abscessus
3A-0810-R]
gi|420979576|ref|ZP_15442753.1| hypothetical protein MA6G0212_4506 [Mycobacterium abscessus
6G-0212]
gi|420984961|ref|ZP_15448128.1| hypothetical protein MA6G0728R_4450 [Mycobacterium abscessus
6G-0728-R]
gi|421009763|ref|ZP_15472872.1| hypothetical protein MA3A0119R_4618 [Mycobacterium abscessus
3A-0119-R]
gi|421015133|ref|ZP_15478208.1| hypothetical protein MA3A0122R_4716 [Mycobacterium abscessus
3A-0122-R]
gi|421030713|ref|ZP_15493743.1| hypothetical protein MA3A0930R_4629 [Mycobacterium abscessus
3A-0930-R]
gi|392111305|gb|EIU37075.1| hypothetical protein MA6G0125S_4524 [Mycobacterium abscessus
6G-0125-S]
gi|392113949|gb|EIU39718.1| hypothetical protein MA6G0125R_3479 [Mycobacterium abscessus
6G-0125-R]
gi|392126226|gb|EIU51977.1| hypothetical protein MA6G1108_4447 [Mycobacterium abscessus
6G-1108]
gi|392128230|gb|EIU53980.1| hypothetical protein MA6G0728S_4207 [Mycobacterium abscessus
6G-0728-S]
gi|392163854|gb|EIU89543.1| hypothetical protein MA6G0212_4506 [Mycobacterium abscessus
6G-0212]
gi|392169957|gb|EIU95635.1| hypothetical protein MA6G0728R_4450 [Mycobacterium abscessus
6G-0728-R]
gi|392195369|gb|EIV20988.1| hypothetical protein MA3A0119R_4618 [Mycobacterium abscessus
3A-0119-R]
gi|392198205|gb|EIV23819.1| hypothetical protein MA3A0122R_4716 [Mycobacterium abscessus
3A-0122-R]
gi|392218595|gb|EIV44120.1| hypothetical protein MA3A0930R_4629 [Mycobacterium abscessus
3A-0930-R]
gi|392244584|gb|EIV70062.1| hypothetical protein MM3A0810R_4690 [Mycobacterium abscessus
3A-0810-R]
Length = 297
Score = 191 bits (484), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 123/339 (36%), Positives = 167/339 (49%), Gaps = 59/339 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +F MG N + E P R +++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 10 ELVELPGGSFPMGCN---VFYPEEMPERVSSVEPFAIERHPVTNAQFAEFVTQTGYVTVA 66
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D DM D + M T+
Sbjct: 67 ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 90
Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
R P + W++ P W HP G S I+ HPVV V++ DA
Sbjct: 91 RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 140
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G RLPTEAEWEY RGG ++ WG+ + P G AN WQG FP N A G
Sbjct: 141 AAYARWAGRRLPTEAEWEYAARGG-SGTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 199
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY--NPK-GPTTGTDKVKKG 305
+ T+PV ++ N FGL +M+GNVWEWT+ + H AP +P P +V KG
Sbjct: 200 WHGTSPVGTFPANDFGLLDMIGNVWEWTSTRFQ-HGAGAPKSCCSPSDNPDPSVIQVLKG 258
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
GS+LC +YC+R+R AARS + DS+ ++GFRC + G
Sbjct: 259 GSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVENLG 297
>gi|259047396|ref|ZP_05737797.1| sulfatase-modifying factor 1 [Granulicatella adiacens ATCC 49175]
gi|259035587|gb|EEW36842.1| sulfatase-modifying factor 1 [Granulicatella adiacens ATCC 49175]
Length = 283
Score = 190 bits (483), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 121/337 (35%), Positives = 164/337 (48%), Gaps = 63/337 (18%)
Query: 14 DMVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ + L TF+MG T I+D E P + AF + + V+N +F+ FV TGY T
Sbjct: 2 NYIPLKAGTFQMGDTENRGFIEDHESPCVIKEVQAFEVAETTVTNEEFKRFVDETGYQTT 61
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE G DS + H + P +++
Sbjct: 62 AELIG------------------------------DSYVFHLLVSPEKRENYSHVSGSPW 91
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W L P W HP G +S+IE+ ++HPVVHVS DA+AYC
Sbjct: 92 WL---------------------LVPGACWKHPFGPESSIENLLDHPVVHVSLKDALAYC 130
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W GA LPTE +WEY RGG + +PWG L G + AN WQGEFP N+ DG+L T
Sbjct: 131 EWAGAGLPTETQWEYAARGGTSTQ-YPWGEELEQDGIYHANTWQGEFPNQNSELDGFLGT 189
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP----SYNPKGP-TTGTDKVKKGGS 307
APV S++ N +GLY M+GNVWEW ++H P N K P + +GGS
Sbjct: 190 APVKSFEPNGYGLYQMIGNVWEWC-----INHRGIPFEDIEANAKAPLNLEVEYAIRGGS 244
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+LC+ YC R+R AAR+ + +S+A +LGFRC G
Sbjct: 245 FLCHCSYCNRYRVAARNGSPWNSTASHLGFRCIRPLG 281
>gi|421020230|ref|ZP_15483286.1| hypothetical protein MA3A0122S_4471 [Mycobacterium abscessus
3A-0122-S]
gi|392205953|gb|EIV31536.1| hypothetical protein MA3A0122S_4471 [Mycobacterium abscessus
3A-0122-S]
Length = 292
Score = 190 bits (483), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 123/339 (36%), Positives = 167/339 (49%), Gaps = 59/339 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +F MG N + E P R +++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 5 ELVELPGGSFPMGCN---VFYPEEMPERVSSVEPFAIERHPVTNAQFAEFVTQTGYVTVA 61
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D DM D + M T+
Sbjct: 62 ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 85
Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
R P + W++ P W HP G S I+ HPVV V++ DA
Sbjct: 86 RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 135
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G RLPTEAEWEY RGG ++ WG+ + P G AN WQG FP N A G
Sbjct: 136 AAYARWAGRRLPTEAEWEYAARGG-SGTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 194
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY--NPK-GPTTGTDKVKKG 305
+ T+PV ++ N FGL +M+GNVWEWT+ + H AP +P P +V KG
Sbjct: 195 WHGTSPVGTFPANDFGLLDMIGNVWEWTSTRFQ-HGAGAPKSCCSPSDNPDPSVIQVLKG 253
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
GS+LC +YC+R+R AARS + DS+ ++GFRC + G
Sbjct: 254 GSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVENLG 292
>gi|87300701|ref|ZP_01083543.1| hypothetical protein WH5701_04615 [Synechococcus sp. WH 5701]
gi|87284572|gb|EAQ76524.1| hypothetical protein WH5701_04615 [Synechococcus sp. WH 5701]
Length = 307
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 123/347 (35%), Positives = 172/347 (49%), Gaps = 66/347 (19%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P +F MG++ E P+ V ++ F++D+ V+N QF++FV ATG+VT AE
Sbjct: 1 MVWIPAGSFLMGSDD---HYPEESPAHRVVVEGFWIDRSPVTNAQFRKFVKATGHVTLAE 57
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D + L E L + P IS D +
Sbjct: 58 RPADPAAYPDALPE------------------LLAPSSIVFVPPPGPISTGDPYRW---- 95
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W+Y L+ +W HPEG S+I+ R +HPVVHV+ DA+AY W
Sbjct: 96 ---------WQY-----LQG-----ANWRHPEGPGSSIKSRDHHPVVHVAHEDALAYAAW 136
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LP+EAEWE RGG+E F WG L P G AN +QG+FP +N+ DG+ T+P
Sbjct: 137 AGKALPSEAEWERAARGGVEGAEFAWGEELHPGGRPMANTFQGDFPHHNSLLDGWERTSP 196
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY----------NPKGPT-------- 296
V S+ N +GL +M+GNVWEWT DW+ H ++ +P+G +
Sbjct: 197 VGSFPPNGYGLLDMIGNVWEWTDDWYAGHGATVAAHKQAGGCCTIESPRGASRQESIDTA 256
Query: 297 ----TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
T KV KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 257 SQHGTIPRKVVKGGSFLCAPSYCRRYRPAARMAQGIDTSTCHMGFRC 303
>gi|260584577|ref|ZP_05852323.1| sulfatase-modifying factor 1 [Granulicatella elegans ATCC 700633]
gi|260157600|gb|EEW92670.1| sulfatase-modifying factor 1 [Granulicatella elegans ATCC 700633]
Length = 290
Score = 190 bits (483), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 117/336 (34%), Positives = 165/336 (49%), Gaps = 68/336 (20%)
Query: 14 DMVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ V +P F MG + + ++D E P+ + AF + + V+N +F EFV ATGY T
Sbjct: 2 EFVQIPAGKFLMGDSTRQGFVEDYEGPAVEKEVHAFEMAKTPVTNAEFLEFVQATGYQTL 61
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ GD++VF LL EE+ R ++P V
Sbjct: 62 AERLGDSYVFHLLLPEEK-----------------------RENYPHV------------ 86
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
T + W ++ P W P G S IE +HPVVH++ DA+AYC
Sbjct: 87 -------TGSPWW---------KVVPNAHWAQPFGEGSNIEGIEDHPVVHIALEDAIAYC 130
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W +LPTE +WEY R + +PWG +L P +RAN WQG+FP N A DG++ T
Sbjct: 131 RWAKVQLPTEVQWEYAARA-MSISTYPWGEDLVPESGYRANTWQGDFPNVNDALDGFVGT 189
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP-KGPTTGTDKVK-------K 304
APV S++ N FGLY M+GN+WEW + HP P D +K +
Sbjct: 190 APVYSFEPNDFGLYQMIGNIWEWCS-------HPRGIVLPLVEERISLDSIKPSGEFAIR 242
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GGS+LC+ YC R+R AAR+ S+ ++GFRC
Sbjct: 243 GGSFLCHCSYCNRYRVAARNGAFVTSTTSHMGFRCV 278
>gi|424858974|ref|ZP_18282988.1| sulfatase-modifying factor 1 [Rhodococcus opacus PD630]
gi|356661483|gb|EHI41794.1| sulfatase-modifying factor 1 [Rhodococcus opacus PD630]
Length = 312
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 123/346 (35%), Positives = 163/346 (47%), Gaps = 63/346 (18%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
K+M +PG TF MG+ E P VT+D F++D H+V+ +F+ FV ATG+VT
Sbjct: 7 KNMAWIPGSTFWMGSED---FYPEERPVHQVTVDGFWMDTHQVTVAEFRRFVKATGHVTT 63
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE + D ++ G D + + V T
Sbjct: 64 AE----------------------IAPDPAQYPGADPAL----------LVPGSLVFTPT 91
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
L W P W HPEG DS + R HPV HVSW DA AY
Sbjct: 92 AGPVPLDDFTRWW---------SFTPGADWRHPEGPDSNVGGRERHPVTHVSWFDARAYA 142
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWE+ RGGL+ + F WG P G NVWQG FP N DG+ T
Sbjct: 143 EWAGKDLPTEAEWEFAARGGLDRKPFVWGEEHEPGGRPGGNVWQGRFPWENLLEDGFAGT 202
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTGTD---- 300
+PV ++ N + L +M GNVWEWT D++ V H + AP+ + P P T
Sbjct: 203 SPVGHFRPNGYELSDMAGNVWEWTTDYFTVDHSESGKNVAPTSSCCIPTNPRVDTARAEN 262
Query: 301 -------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R +AR +T ++S ++GFRC
Sbjct: 263 PDEPYARRVIKGGSHLCAPNYCLRYRPSARQGDTEETSTCHIGFRC 308
>gi|424888891|ref|ZP_18312494.1| hypothetical protein Rleg10DRAFT_2973 [Rhizobium leguminosarum bv.
trifolii WSM2012]
gi|393174440|gb|EJC74484.1| hypothetical protein Rleg10DRAFT_2973 [Rhizobium leguminosarum bv.
trifolii WSM2012]
Length = 324
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 121/347 (34%), Positives = 166/347 (47%), Gaps = 65/347 (18%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+D + + G F MG+++ E P V +D F++D V+N QF EFV ATGYVT
Sbjct: 19 RDQIWIDGGGFLMGSDQHY---PEEAPVHPVKVDGFWIDVAPVTNRQFAEFVEATGYVTL 75
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AEK + + P E S V KR G
Sbjct: 76 AEKASNPEDY-PGAPPEMLKPGSVVFDPPKRVSG-------------------------- 108
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
+ P ++ +G +W P G S + +++HPVV V++ DA+AY
Sbjct: 109 ---SGPPQWWKFRFGA------------NWRRPYGGLSNLRGKLDHPVVQVAFADAMAYA 153
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
WRG LPTEAEWE+ +GGLE + F WG+ P G N W G FPT N D + T
Sbjct: 154 LWRGKDLPTEAEWEFAAKGGLEGKEFAWGDEFIPDGRFMGNTWHGLFPTENLKPDSFERT 213
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGT--------- 299
+PV S+ N +GLY+M+GNVWEWT+D+W HPAP+ P P G+
Sbjct: 214 SPVGSFPPNGYGLYDMIGNVWEWTSDYWTT-RHPAPAEMSCCIPVNPRAGSAEASRDPGQ 272
Query: 300 ------DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+V KGGS+LC YC R+R AAR +S+ ++GFRC
Sbjct: 273 PDIRIARRVLKGGSHLCAPNYCRRYRPAARHAQEENSATTHIGFRCV 319
>gi|419716090|ref|ZP_14243488.1| hypothetical protein S7W_16633 [Mycobacterium abscessus M94]
gi|382941296|gb|EIC65615.1| hypothetical protein S7W_16633 [Mycobacterium abscessus M94]
Length = 297
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 166/344 (48%), Gaps = 69/344 (20%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +F MG N + E P R +++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 10 ELVELPGGSFPMGCN---VFYPEEMPERVSSVEPFAIERHPVTNAQFAEFVTQTGYVTVA 66
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D DM D + M T+
Sbjct: 67 ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 90
Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
R P + W++ P W HP G S I+ HPVV V++ DA
Sbjct: 91 RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 140
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G RLPTEAEWEY RGG ++ WG+ + P G AN WQG FP N A G
Sbjct: 141 AAYARWAGRRLPTEAEWEYAARGG-SGTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 199
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--------PSYNPKGPTTGTD 300
+ T+PV ++ N FGL +M+GNVWEWT+ H A PS NP
Sbjct: 200 WHGTSPVGTFPANDFGLLDMIGNVWEWTS---TRFQHGADTPKSCCSPSDNPD---PSVI 253
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+V KGGS+LC +YC+R+R AARS + DS+ ++GFRC + G
Sbjct: 254 QVLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVENLG 297
>gi|226187115|dbj|BAH35219.1| conserved hypothetical protein [Rhodococcus erythropolis PR4]
Length = 320
Score = 190 bits (482), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 125/346 (36%), Positives = 171/346 (49%), Gaps = 63/346 (18%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
K+M L+PG T+ MG+ E P VT+D F++D V+ +F+ FV ATG+VT
Sbjct: 12 KNMSLIPGGTYWMGSED---FYPEERPVHQVTVDGFWMDTTAVTVAEFRRFVKATGHVTT 68
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE P ++ A + + F + P +S +D Y
Sbjct: 69 AE-------IAPDAADYPDADPALLVPGSLVF-----------NSPPGPVSLDD---YTQ 107
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W W + P W HPEG DS I R HPV HVS+ DA AY
Sbjct: 108 W----------WSF----------TPGADWRHPEGSDSNIGGRERHPVTHVSYFDAQAYA 147
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWE+ RGGL+ + + WG++ +P G NVWQG+FP N DG+ T
Sbjct: 148 AWAGKELPTEAEWEFAARGGLDRQAYVWGDHDSPGGRPGGNVWQGQFPWENLLEDGHERT 207
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTG------ 298
APV ++ N +GL++M GNVWEWTAD + H + AP+ + P+ P
Sbjct: 208 APVGKFRPNGYGLFDMAGNVWEWTADHYTSDHAHSSKNVAPASSCCIPRNPRAEFATEAL 267
Query: 299 -----TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R AAR + +SS ++GFRC
Sbjct: 268 AGEPYARRVIKGGSHLCAPNYCLRYRPAARQGESEESSTCHVGFRC 313
>gi|124022136|ref|YP_001016443.1| hypothetical protein P9303_04261 [Prochlorococcus marinus str. MIT
9303]
gi|123962422|gb|ABM77178.1| Uncharacterized conserved protein [Prochlorococcus marinus str. MIT
9303]
Length = 287
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 120/334 (35%), Positives = 168/334 (50%), Gaps = 61/334 (18%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDG-------EFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
+MV +P +R+G ++ DG E P+R V LD+F +D V+N QF+ FVSA
Sbjct: 4 EMVTIPAGLYRVGCDR--CYPDGSVRCYPEEAPAREVQLDSFQIDVGPVTNAQFRAFVSA 61
Query: 67 TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
T ++T +E D ++ P L EER S V F+ +T++ +SW
Sbjct: 62 TQHLTVSELPPDPTLY-PDLPPEERIPESVV------FQPPPATVDRSKP-----LSWWT 109
Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
VA WR HP+G +STI+ +HPVVHV++
Sbjct: 110 LVAGADWR-----------------------------HPQGPESTIDGLDDHPVVHVAYA 140
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
DA+AY W G RLP+ EWE RGGL + + WGN LTP AN+WQG FP +N
Sbjct: 141 DAIAYAHWAGKRLPSAEEWEVAARGGLVDAQYAWGNELTPNNRWMANIWQGPFPWHNEEL 200
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
DG+ T+PV S+ N +GL ++ GNVWEWT S P + KGG
Sbjct: 201 DGWFWTSPVGSFPANGYGLLDVCGNVWEWTN-----------SVYPVASGHQERRTIKGG 249
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
S+LC + YC R+R +A T D++ ++GFRCA
Sbjct: 250 SFLCADNYCVRYRPSALQGQTVDTATCHMGFRCA 283
Score = 44.7 bits (104), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 27/40 (67%)
Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
+ KGGS+LC + YC R+R +A T D++ ++GFRCA
Sbjct: 244 RTIKGGSFLCADNYCVRYRPSALQGQTVDTATCHMGFRCA 283
>gi|89069430|ref|ZP_01156785.1| hypothetical protein OG2516_18250 [Oceanicola granulosus HTCC2516]
gi|89045060|gb|EAR51138.1| hypothetical protein OG2516_18250 [Oceanicola granulosus HTCC2516]
Length = 314
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 124/320 (38%), Positives = 155/320 (48%), Gaps = 53/320 (16%)
Query: 25 MGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEP 84
+GT +P + +DGE P R V L F D V+N +F FV+ATGYVT A
Sbjct: 45 VGTRRPEIPEDGEGPVRPVRLRPFRCDSFPVTNARFAAFVAATGYVTVA----------- 93
Query: 85 LLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEW 144
ER QV F GL + E W W
Sbjct: 94 -----ERFGWGQV------FAGLMNPAEAAAARAFPGTPW-------------------W 123
Query: 145 EYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAE 204
R P W PEG S I R +HP VHV W+DA A+ TW G RLP+EAE
Sbjct: 124 ----------RAVPGACWHAPEGPGSGITGRASHPAVHVGWDDARAFATWAGGRLPSEAE 173
Query: 205 WEYGCRGGLENRLFPWGN-NLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKF 263
WE+ RGGL+ FPWG+ G N+WQG FP ++TAADG+ T+PV +Y N
Sbjct: 174 WEHAARGGLDQARFPWGDREPDDTGFLPCNIWQGRFPDHDTAADGHAGTSPVGAYPANGA 233
Query: 264 GLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAA 322
GL++M GNVWEWTAD + V + + + +V KGGS+LC+ YCYR+R AA
Sbjct: 234 GLHDMAGNVWEWTADPFRVRSAKSAAAARNAHARQENQRVTKGGSFLCHISYCYRYRIAA 293
Query: 323 RSQNTPDSSAGNLGFRCAAD 342
RS DS A N G R D
Sbjct: 294 RSGTAADSGASNTGMRVFYD 313
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 23/42 (54%), Positives = 27/42 (64%)
Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
+V KGGS+LC+ YCYR+R AARS DS A N G R D
Sbjct: 272 RVTKGGSFLCHISYCYRYRIAARSGTAADSGASNTGMRVFYD 313
>gi|404447445|ref|ZP_11012507.1| hypothetical protein MVAC_29103 [Mycobacterium vaccae ATCC 25954]
gi|403648882|gb|EJZ04367.1| hypothetical protein MVAC_29103 [Mycobacterium vaccae ATCC 25954]
Length = 295
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 119/334 (35%), Positives = 166/334 (49%), Gaps = 56/334 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
DMV +P T +G+++ E P+R VT+D F++ +H+V+N QF EFVSAT Y+T A
Sbjct: 7 DMVWIPAQTAVVGSDE---HYPEEGPAREVTVDGFWMQRHQVTNAQFAEFVSATDYLTVA 63
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPV--VHISWNDAVAYC 131
E+ D ++ + + S + HR PV H+S +
Sbjct: 64 ERPLDASLYPGAPPQNLQPG---------------SMVFHRTTGPVDLRHVS-----QWW 103
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
TW P W HP G S++ +R HPVVH++++DA Y
Sbjct: 104 TWT-----------------------PGACWKHPRGPRSSLRNREAHPVVHIAFDDAATY 140
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G LPTEAEWE RGGL + + WG+ P+G AN W GEFP GY
Sbjct: 141 AQWAGYELPTEAEWEVAARGGLAHARYTWGDEPEPQGRPLANYWHGEFP--YLPDTGYGQ 198
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTTGTD-KVKKG 305
+ PV S+ N +GL++ GNVWEWT+DW+ P +Y+P P KV KG
Sbjct: 199 STPVGSFPPNGYGLHDTAGNVWEWTSDWYGSTRDIQPCCAAETYDPSQPQFKVPRKVIKG 258
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
GS+LC + YC R+R AAR D+ ++GFRC
Sbjct: 259 GSFLCADTYCMRYRPAARRPQQVDTGMSHIGFRC 292
>gi|424881460|ref|ZP_18305092.1| hypothetical protein Rleg8DRAFT_3034 [Rhizobium leguminosarum bv.
trifolii WU95]
gi|392517823|gb|EIW42555.1| hypothetical protein Rleg8DRAFT_3034 [Rhizobium leguminosarum bv.
trifolii WU95]
Length = 317
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 124/347 (35%), Positives = 173/347 (49%), Gaps = 65/347 (18%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
++ +V + G TF MG+N+ E P+ V +D F++D V+N QF FV ATG+VT
Sbjct: 13 FQGLVWVAGRTFTMGSNEHY---PEEAPAHPVAVDGFWIDATPVTNRQFASFVKATGHVT 69
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
AE+ + L + RA S V K G D T W
Sbjct: 70 VAERAPRAADYPGALPKMLRAG-SLVFTPPKAVVGPDIT------------QW------- 109
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W +++G W HP G + I +++HPVVHV+++DA+AY
Sbjct: 110 -W---------SFKFGA------------DWRHPYGGRTDIRGKLDHPVVHVAYSDAMAY 147
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W LPTEAEWE GGL++ F WG+ L+P G+ AN WQG FPT +T G
Sbjct: 148 AHWAEKDLPTEAEWELAACGGLDDAEFAWGDELSPEGKLMANTWQGTFPTLSTKPAGADR 207
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-------YNPKGPTTGTD---- 300
T+PV S+ N +G+Y+M+GNVWEWT+D+W+ HP P+ NP+G
Sbjct: 208 TSPVGSFPSNGYGIYDMIGNVWEWTSDFWST-RHPEPASRSCCIPSNPRGGDADASYDLR 266
Query: 301 --------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 267 LPDIRIPRRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRC 313
>gi|418049943|ref|ZP_12688030.1| Sulphatase-modifying factor protein [Mycobacterium rhodesiae JS60]
gi|353190848|gb|EHB56358.1| Sulphatase-modifying factor protein [Mycobacterium rhodesiae JS60]
Length = 290
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 119/331 (35%), Positives = 167/331 (50%), Gaps = 49/331 (14%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
D+V++PG +FRMG+ E P T+ +F +++H V+N QF +FV+ATGYVT A
Sbjct: 4 DLVVVPGGSFRMGSTS---FYPEEAPIHTATVGSFAIERHPVTNAQFADFVAATGYVTIA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D ++ P ++E + + V F + R W
Sbjct: 61 ERSLDPALY-PGVAEADLQPGALV------FRPTPGPVNLR-----------------DW 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R W++ P SW P G S I + +HPVV V++ DA AY
Sbjct: 97 R-------QWWDWA----------PGASWRTPFGPGSDIADKADHPVVQVAYPDANAYAA 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W RLPTEAEWEY R G + WG+ T G AN WQG FP N A G++ T+
Sbjct: 140 WADRRLPTEAEWEYAARAG-STGPYAWGDEPTVGGALMANTWQGRFPYRNDGALGWVGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----PKGPTTGTDKVKKGGSYL 309
PV ++ N +GL +M+GNVWEWT ++ HH + P P + KGGS+L
Sbjct: 199 PVGTFPPNGYGLVDMIGNVWEWTTTRYSAHHRLDQQNSCCPPPTEPDPAVSQTLKGGSHL 258
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
C +YC+R+R AARS + DSS ++GFRCA
Sbjct: 259 CAPEYCHRYRPAARSPQSQDSSTTHIGFRCA 289
>gi|226363620|ref|YP_002781402.1| hypothetical protein ROP_42100 [Rhodococcus opacus B4]
gi|226242109|dbj|BAH52457.1| hypothetical protein [Rhodococcus opacus B4]
Length = 312
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 123/346 (35%), Positives = 162/346 (46%), Gaps = 63/346 (18%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
K+M +PG TF MG+ E P VT+D F++D H+V+ +F+ FV TG+VT
Sbjct: 7 KNMAWIPGGTFWMGSED---FYPEERPVHQVTVDGFWMDSHQVTVAEFRRFVKDTGHVTT 63
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE + D ++ G D + + V T
Sbjct: 64 AE----------------------IAPDPAQYPGADPAL----------LVPGSLVFTPT 91
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
L W P W HPEG S + R HPV HVSW DA AY
Sbjct: 92 PGPVPLDDFTRWW---------SFTPGADWRHPEGPGSNVGGRERHPVTHVSWFDARAYA 142
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWE+ RGGL+ + F WG+ P G NVWQG+FP N DG+ T
Sbjct: 143 EWAGKDLPTEAEWEFAARGGLDRKPFVWGDEHEPGGRPGGNVWQGQFPWENLLEDGFAGT 202
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-------PAPS----YNPKGPTTGTD- 300
+PV ++ N +GL +M GNVWEWT D++ H PA S NP+ T
Sbjct: 203 SPVGHFRSNGYGLGDMAGNVWEWTTDYFTADHSAGGKNVAPASSCCIPANPRIDTAAAPD 262
Query: 301 -------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R AAR +T ++S ++GFRC
Sbjct: 263 PHEPYARRVIKGGSHLCAPNYCLRYRPAARQGDTEETSTCHIGFRC 308
>gi|409996049|ref|YP_006750450.1| sulfatase-modifying factor 1 [Lactobacillus casei W56]
gi|418009877|ref|ZP_12649664.1| sulfatase modifying factor [Lactobacillus casei Lc-10]
gi|406357061|emb|CCK21331.1| Sulfatase-modifying factor 1 [Lactobacillus casei W56]
gi|410555004|gb|EKQ28969.1| sulfatase modifying factor [Lactobacillus casei Lc-10]
Length = 287
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 122/329 (37%), Positives = 164/329 (49%), Gaps = 63/329 (19%)
Query: 18 LPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKF 76
+ G +F+MGTN I +D E P+ V + +F + V+N F +F++AT Y T AE+
Sbjct: 9 IKGGSFQMGTNDRIGFGEDYEGPTTIVHVPSFSMADTPVTNADFDDFIAATAYQTVAERL 68
Query: 77 GDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA 136
G +FVFE L+ EEER T +H VA W
Sbjct: 69 GSSFVFELLIPEEERV-----------------TYQH--------------VAGAPW--- 94
Query: 137 RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRG 196
W L P W HP G +S+ NHPVVHV+ DA+AYC W
Sbjct: 95 -------W----------LLVPGADWQHPYGAESSNIDLDNHPVVHVALEDALAYCQWSH 137
Query: 197 ARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVM 256
++LPTEA+WEY G +PWG +L AN WQG+FP +NTA DG++ TAPV
Sbjct: 138 SQLPTEAQWEYAAGAGTATT-YPWGESLVDEHGFHANTWQGDFPNDNTAEDGFVGTAPVK 196
Query: 257 SYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP------KGPTTGTDKVKKGGSYLC 310
SY+ N GLY M+GNVWEW N + +N K P G + +GGS+LC
Sbjct: 197 SYEPNSNGLYQMIGNVWEWCR---NPRYTLLDDFNAEQFKLGKVPAAGEYAI-RGGSFLC 252
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+ YC R+R AAR+ S++ +L FRC
Sbjct: 253 HCSYCNRYRTAARNGVDLQSTSSHLSFRC 281
>gi|330826800|ref|YP_004390103.1| sulfatase-modifying factor protein [Alicycliphilus denitrificans
K601]
gi|329312172|gb|AEB86587.1| Sulphatase-modifying factor protein [Alicycliphilus denitrificans
K601]
Length = 375
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 120/347 (34%), Positives = 159/347 (45%), Gaps = 56/347 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F MG + + E P V + F++D+ V+N QF++FV ATGYVT AE
Sbjct: 60 MVWVPAGRFVMGNTSSKALPN-EKPGHGVRVAGFWMDRTHVTNRQFEQFVRATGYVTTAE 118
Query: 75 KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ D + L R + F G + Y W
Sbjct: 119 RVPDWPSLQAQLPPGTPRPPAGAMVPGALVFAGTSQPV--------------SLADYMRW 164
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W Y P SW HP+G S + + +HPVV VS+ DA AY
Sbjct: 165 ----------WRY----------MPGASWRHPQGPQSDLRGKEDHPVVQVSYEDAQAYAR 204
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE---FPTNNTAADGYL 250
W G RLPTEAEWEY RGGL+ + F WG+ L P G A W + FP +A
Sbjct: 205 WAGKRLPTEAEWEYAARGGLDQQRFAWGHLLEPEGRPMARTWSAQPQVFPV--ASAKVMP 262
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNV-----HHHPAPSYNPKGPTTGTD----- 300
TAPV SY N +GLY+M GN W+W ADW+ NP+GP D
Sbjct: 263 GTAPVASYAPNGYGLYDMAGNAWQWVADWYRYDAFARQARKGDVANPRGPGESFDPEGLR 322
Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+V +GGS+LC+E+YC +R +AR P S++ N+GFR D
Sbjct: 323 ADAPKRVIRGGSFLCSEEYCEGYRVSARQGQDPYSASANVGFRLVLD 369
>gi|191637195|ref|YP_001986361.1| Zgc:136465 protein (Fragment), partial [Lactobacillus casei BL23]
gi|385818907|ref|YP_005855294.1| Sulfatase-like protein modifying factor 1 [Lactobacillus casei
LC2W]
gi|385822077|ref|YP_005858419.1| Sulfatase-like protein modifying factor 1 [Lactobacillus casei
BD-II]
gi|190711497|emb|CAQ65503.1| Zgc:136465 protein (Fragment) [Lactobacillus casei BL23]
gi|327381234|gb|AEA52710.1| Sulfatase-like protein modifying factor 1 [Lactobacillus casei
LC2W]
gi|327384404|gb|AEA55878.1| Sulfatase-like protein modifying factor 1 [Lactobacillus casei
BD-II]
Length = 284
Score = 189 bits (481), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 122/329 (37%), Positives = 164/329 (49%), Gaps = 63/329 (19%)
Query: 18 LPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKF 76
+ G +F+MGTN I +D E P+ V + +F + V+N F +F++AT Y T AE+
Sbjct: 6 IKGGSFQMGTNDRIGFGEDYEGPTTIVHVPSFSMADTPVTNADFDDFIAATAYQTVAERL 65
Query: 77 GDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA 136
G +FVFE L+ EEER T +H VA W
Sbjct: 66 GSSFVFELLIPEEERV-----------------TYQH--------------VAGAPW--- 91
Query: 137 RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRG 196
W L P W HP G +S+ NHPVVHV+ DA+AYC W
Sbjct: 92 -------W----------LLVPGADWQHPYGAESSNIDLDNHPVVHVALEDALAYCQWSH 134
Query: 197 ARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVM 256
++LPTEA+WEY G +PWG +L AN WQG+FP +NTA DG++ TAPV
Sbjct: 135 SQLPTEAQWEYAAGAGTATT-YPWGESLVDEHGFHANTWQGDFPNDNTAEDGFVGTAPVK 193
Query: 257 SYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP------KGPTTGTDKVKKGGSYLC 310
SY+ N GLY M+GNVWEW N + +N K P G + +GGS+LC
Sbjct: 194 SYEPNSNGLYQMIGNVWEWCR---NPRYTLLDDFNAEQFKLGKVPAAGEYAI-RGGSFLC 249
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+ YC R+R AAR+ S++ +L FRC
Sbjct: 250 HCSYCNRYRTAARNGVDLQSTSSHLSFRC 278
>gi|399061236|ref|ZP_10746002.1| hypothetical protein PMI02_02295 [Novosphingobium sp. AP12]
gi|398036048|gb|EJL29271.1| hypothetical protein PMI02_02295 [Novosphingobium sp. AP12]
Length = 325
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 133/358 (37%), Positives = 169/358 (47%), Gaps = 72/358 (20%)
Query: 6 APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
A P E DMV++P TF MG+ E P R V++DAF +D+ V+N QF FV
Sbjct: 11 AKPAE-MDDMVVIPAGTFTMGSET---FYPEEAPLRRVSVDAFRMDRTPVTNRQFAAFVE 66
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
ATG+ T AE D + +L+ +RA S + + PV
Sbjct: 67 ATGHTTVAEVAPDPKDYPGMLAGMDRAG---------------SLVFGKTTAPV------ 105
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS-WLHPEGIDSTIEHR--MNHPVVH 182
D W W F +G+ W P G+ S IE HPVV
Sbjct: 106 DTTNPGNW----------WN-----------FTFGAYWRQPLGVGSDIESLGLWEHPVVQ 144
Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
V++ DA AY W G LPTEAE+EY RGGLE R + WG+ L P G AN WQG FP
Sbjct: 145 VAYADAEAYAKWAGKDLPTEAEFEYAARGGLEGRDYAWGDELAPGGVMMANYWQGLFPFA 204
Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS---------YNPK 293
N DG+ T+PV +Y N +GL +M+GN WEWT DWW+ PA S NP+
Sbjct: 205 NQMLDGWERTSPVGTYLANGYGLVDMIGNTWEWTRDWWS--EKPAISKPSGGCCALSNPR 262
Query: 294 GPTTGTD------------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
G KV KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 263 GAKLKDSFDPAQPGVRIGRKVIKGGSHLCAANYCQRYRPAARHPEMIDTSTTHIGFRC 320
>gi|319764583|ref|YP_004128520.1| hypothetical protein Alide_3927 [Alicycliphilus denitrificans BC]
gi|317119144|gb|ADV01633.1| protein of unknown function DUF323 [Alicycliphilus denitrificans
BC]
Length = 375
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 120/347 (34%), Positives = 159/347 (45%), Gaps = 56/347 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F MG + + E P V + F++D+ V+N QF++FV ATGYVT AE
Sbjct: 60 MVWVPAGRFVMGNTSSKALPN-EKPGHGVRVAGFWMDRTHVTNRQFEQFVRATGYVTTAE 118
Query: 75 KFGDTFVFEPLLSE-EERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ D + L R + F G + Y W
Sbjct: 119 RVPDWPSLQAQLPPGTPRPPAGAMVPGALVFAGTSQPV--------------SLADYMRW 164
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W Y P SW HP+G S + + +HPVV VS+ DA AY
Sbjct: 165 ----------WRY----------MPGASWRHPQGPQSDLRGKEDHPVVQVSYEDAQAYAR 204
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE---FPTNNTAADGYL 250
W G RLPTEAEWEY RGGL+ + F WG+ L P G A W + FP +A
Sbjct: 205 WAGKRLPTEAEWEYAARGGLDQQRFAWGHLLEPEGRPMARTWSAQPQVFPV--ASAKVMP 262
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNV-----HHHPAPSYNPKGPTTGTD----- 300
TAPV SY N +GLY+M GN W+W ADW+ NP+GP D
Sbjct: 263 GTAPVASYAPNGYGLYDMAGNAWQWVADWYRYDAFARQARKGVVANPRGPGESFDPEGLR 322
Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+V +GGS+LC+E+YC +R +AR P S++ N+GFR D
Sbjct: 323 ADAPKRVIRGGSFLCSEEYCEGYRVSARQGQDPYSASANVGFRLVLD 369
>gi|414580726|ref|ZP_11437866.1| hypothetical protein MA5S1215_3767 [Mycobacterium abscessus
5S-1215]
gi|420885390|ref|ZP_15348750.1| hypothetical protein MA5S0421_4067 [Mycobacterium abscessus
5S-0421]
gi|420892281|ref|ZP_15355628.1| hypothetical protein MA5S0422_4986 [Mycobacterium abscessus
5S-0422]
gi|420896743|ref|ZP_15360082.1| hypothetical protein MA5S0708_3740 [Mycobacterium abscessus
5S-0708]
gi|420905749|ref|ZP_15369067.1| hypothetical protein MA5S1212_3495 [Mycobacterium abscessus
5S-1212]
gi|392079541|gb|EIU05368.1| hypothetical protein MA5S0422_4986 [Mycobacterium abscessus
5S-0422]
gi|392081153|gb|EIU06979.1| hypothetical protein MA5S0421_4067 [Mycobacterium abscessus
5S-0421]
gi|392096055|gb|EIU21850.1| hypothetical protein MA5S0708_3740 [Mycobacterium abscessus
5S-0708]
gi|392103653|gb|EIU29439.1| hypothetical protein MA5S1212_3495 [Mycobacterium abscessus
5S-1212]
gi|392115878|gb|EIU41646.1| hypothetical protein MA5S1215_3767 [Mycobacterium abscessus
5S-1215]
Length = 340
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 122/340 (35%), Positives = 165/340 (48%), Gaps = 71/340 (20%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +F MG N + E P R ++ +F +++H V+N QF EFV+ TGYVT A
Sbjct: 22 ELVELPGGSFPMGCN---VFYPEEMPERVSSVGSFAIERHPVTNAQFAEFVTQTGYVTVA 78
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D DM D + M T+
Sbjct: 79 ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 102
Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
R P + W++ P W HP G S I+ HPVV V++ DA
Sbjct: 103 RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 152
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G RLPTEAEWEY RGG + ++ WG+ + P G AN WQG FP N A G
Sbjct: 153 AAYARWAGRRLPTEAEWEYAARGG-SDTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 211
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------APSYNPKGPTTGT 299
+ T+PV ++ N FGL +M+GNVWEWT + P +PS NP
Sbjct: 212 WHGTSPVGTFPANGFGLLDMIGNVWEWT----STRFQPGAGTPRSCCSPSDNPD---PSV 264
Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC +YC+R+R AARS + DS+ ++GFRC
Sbjct: 265 IQVLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRC 304
>gi|116073277|ref|ZP_01470539.1| hypothetical protein RS9916_32542 [Synechococcus sp. RS9916]
gi|116068582|gb|EAU74334.1| hypothetical protein RS9916_32542 [Synechococcus sp. RS9916]
Length = 282
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 120/331 (36%), Positives = 166/331 (50%), Gaps = 55/331 (16%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ MV +P + +G+ E P R++ + AF LD V+N F FV+ TGYVT
Sbjct: 6 QGMVQIPAGEYCLGSES---FYPEESPVRSIQVRAFSLDVAPVTNADFARFVADTGYVTV 62
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
+EK D ++ P L+ EE+ S V F T++ +SW V
Sbjct: 63 SEKPPDPELY-PNLAPEEQCPESAV------FIPPPPTVDRSQP-----LSWWALVEGAD 110
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR HP+G + I+ +HPVVHV+++DA+AY
Sbjct: 111 WR-----------------------------HPQGPSTQIDDLPDHPVVHVAYDDALAYA 141
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPT EWE RGGL + + WG+ +TP G ANVWQG FP N DG+ T
Sbjct: 142 DWAGKRLPTADEWEVAARGGLVGQDYAWGSEMTPGGRWLANVWQGPFPWRNEQTDGWFWT 201
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
+PV S+ N +GL +M GNVWEWT+ + V PKG ++ KGGS+LC E
Sbjct: 202 SPVGSFPANGYGLVDMCGNVWEWTSTLFPV---------PKG--EQERRIIKGGSFLCAE 250
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
YC+R R AA T D++ ++GFRCA+D
Sbjct: 251 NYCHRFRPAALMGQTTDTATCHMGFRCASDS 281
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 21/42 (50%), Positives = 30/42 (71%)
Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
++ KGGS+LC E YC+R R AA T D++ ++GFRCA+D
Sbjct: 239 RIIKGGSFLCAENYCHRFRPAALMGQTTDTATCHMGFRCASD 280
>gi|420880418|ref|ZP_15343785.1| hypothetical protein MA5S0304_3812 [Mycobacterium abscessus
5S-0304]
gi|420900706|ref|ZP_15364037.1| hypothetical protein MA5S0817_3360 [Mycobacterium abscessus
5S-0817]
gi|420974532|ref|ZP_15437723.1| hypothetical protein MA5S0921_4771 [Mycobacterium abscessus
5S-0921]
gi|392085327|gb|EIU11152.1| hypothetical protein MA5S0304_3812 [Mycobacterium abscessus
5S-0304]
gi|392098067|gb|EIU23861.1| hypothetical protein MA5S0817_3360 [Mycobacterium abscessus
5S-0817]
gi|392162415|gb|EIU88105.1| hypothetical protein MA5S0921_4771 [Mycobacterium abscessus
5S-0921]
Length = 323
Score = 189 bits (480), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 122/340 (35%), Positives = 165/340 (48%), Gaps = 71/340 (20%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +F MG N + E P R ++ +F +++H V+N QF EFV+ TGYVT A
Sbjct: 5 ELVELPGGSFPMGCN---VFYPEEMPERVSSVGSFAIERHPVTNAQFAEFVTQTGYVTVA 61
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D DM D + M T+
Sbjct: 62 ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 85
Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
R P + W++ P W HP G S I+ HPVV V++ DA
Sbjct: 86 RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 135
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G RLPTEAEWEY RGG + ++ WG+ + P G AN WQG FP N A G
Sbjct: 136 AAYARWAGRRLPTEAEWEYAARGG-SDTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 194
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------APSYNPKGPTTGT 299
+ T+PV ++ N FGL +M+GNVWEWT + P +PS NP
Sbjct: 195 WHGTSPVGTFPANGFGLLDMIGNVWEWT----STRFQPGAGTPRSCCSPSDNPD---PSV 247
Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC +YC+R+R AARS + DS+ ++GFRC
Sbjct: 248 IQVLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRC 287
>gi|187919798|ref|YP_001888829.1| hypothetical protein Bphyt_5096 [Burkholderia phytofirmans PsJN]
gi|187718236|gb|ACD19459.1| protein of unknown function DUF323 [Burkholderia phytofirmans PsJN]
Length = 375
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 128/357 (35%), Positives = 174/357 (48%), Gaps = 63/357 (17%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
KDMV + G F MG+ P + + E P+ V + +++D+++V+N QF FV+ATGYVT
Sbjct: 47 KDMVWIAGGNFMMGSEHPRALPN-ERPAHEVKVHGYWIDRYDVTNMQFARFVAATGYVTT 105
Query: 73 AEK---FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
AE+ + D V P R + F G D+ I + ND
Sbjct: 106 AERKPRWEDLAVQLP--PGTPRPADDTLVPGGLVFTGSDAPI-----------ALND--- 149
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
Y W W++ P +W HP G S+I +HPVV VS+ DA+
Sbjct: 150 YSRW----------WKF----------VPGANWRHPGGPGSSIAGMDHHPVVQVSYEDAL 189
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW-QGEFPTNNTAADG 248
AY W RLPTEAEWEY RGGLE + WGN P G+ AN W P TA G
Sbjct: 190 AYARWAHKRLPTEAEWEYAARGGLEQADYAWGNEFAPAGKKMANTWDDAARPFPVTADRG 249
Query: 249 -----YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH---------------HHPAP 288
+ T+ V S+ N +GLY+M GNVW+W ADW+ H P
Sbjct: 250 SHEKVQVGTSVVGSFAPNGYGLYDMAGNVWQWVADWYRADAFRVEAASGHVVSDPHGPVS 309
Query: 289 SYNP-KGPTTGT-DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
SY+P GPT ++V +GGS+LC+E YC +R +AR P + +LGFR A D+
Sbjct: 310 SYDPGNGPTAAAPERVTRGGSFLCSETYCISYRTSARRGTDPMNGMSHLGFRLAMDE 366
>gi|418422471|ref|ZP_12995644.1| hypothetical protein MBOL_41900 [Mycobacterium abscessus subsp.
bolletii BD]
gi|363996387|gb|EHM17604.1| hypothetical protein MBOL_41900 [Mycobacterium abscessus subsp.
bolletii BD]
Length = 309
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 121/340 (35%), Positives = 167/340 (49%), Gaps = 61/340 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +F MG N + E P R ++ F +++H V+N QF EFV TGYVT A
Sbjct: 22 ELVELPGGSFPMGCN---VFYPEEMPERVSSVGPFAIERHPVTNAQFAEFVRQTGYVTVA 78
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D ++ A S + F ++ +H + W
Sbjct: 79 ERAPDPDMYP-------GASPSDLVPGAMTFRATAGPVD--LHD------------WQQW 117
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W++ P W HP G S I+ HPVV V++ DA AY
Sbjct: 118 ----------WDW----------VPGAYWRHPFGPRSDIDGAAEHPVVQVAYTDAAAYAR 157
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGG + ++ WG+ + P G AN WQG FP N A G+ T+
Sbjct: 158 WAGRRLPTEAEWEYAARGG-SDTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALGWHGTS 216
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------APSYNPKGPTTGTDKVKK 304
PV ++ N FGL +M+GNVWEWT + P +PS NP +V K
Sbjct: 217 PVGTFPANGFGLLDMIGNVWEWT----STRFQPGAGAPKSCCSPSDNPD---PSVIQVLK 269
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
GGS+LC +YC+R+R AARS + DS+ ++GFRC + G
Sbjct: 270 GGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVENLG 309
>gi|33863783|ref|NP_895343.1| hypothetical protein PMT1516 [Prochlorococcus marinus str. MIT
9313]
gi|33635366|emb|CAE21691.1| conserved hypothetical protein [Prochlorococcus marinus str. MIT
9313]
Length = 291
Score = 189 bits (479), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 119/334 (35%), Positives = 169/334 (50%), Gaps = 61/334 (18%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDG-------EFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
+MV +P +R+G ++ DG E P+R V LD+F +D V+N QF+ FVSA
Sbjct: 8 EMVTIPAGLYRVGCDR--CYPDGSVRCYPEETPAREVQLDSFQIDVGPVTNAQFRAFVSA 65
Query: 67 TGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWND 126
T ++T +E D ++ P L+ EER S V F+ +T++ +SW
Sbjct: 66 TQHLTVSELPPDPTLY-PDLAPEERIPESVV------FQPPPATVDRSKP-----LSWWT 113
Query: 127 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
+A WR HP+G +STI+ +HPVVHV++
Sbjct: 114 LMAGADWR-----------------------------HPQGPESTIDGLDDHPVVHVAYA 144
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
DA+AY W G RLP+ EWE RGGL + + WGN LTP AN+WQG FP +N
Sbjct: 145 DAIAYAHWAGKRLPSAEEWEVAARGGLVDAQYAWGNELTPNNRWMANIWQGPFPWHNEEL 204
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
DG+ T+PV S+ N +GL ++ GNVWEWT S P + KGG
Sbjct: 205 DGWFWTSPVGSFPANGYGLLDVCGNVWEWTN-----------SVYPVASGHQERRTIKGG 253
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
S+LC + YC R+R +A T D++ ++GFRCA
Sbjct: 254 SFLCADNYCVRYRPSALQGQTVDTATCHMGFRCA 287
Score = 44.7 bits (104), Expect = 0.080, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 27/40 (67%)
Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 390
+ KGGS+LC + YC R+R +A T D++ ++GFRCA
Sbjct: 248 RTIKGGSFLCADNYCVRYRPSALQGQTVDTATCHMGFRCA 287
>gi|420865847|ref|ZP_15329236.1| hypothetical protein MA4S0303_4216 [Mycobacterium abscessus
4S-0303]
gi|420870642|ref|ZP_15334024.1| hypothetical protein MA4S0726RA_4155 [Mycobacterium abscessus
4S-0726-RA]
gi|420989300|ref|ZP_15452456.1| hypothetical protein MA4S0206_4229 [Mycobacterium abscessus
4S-0206]
gi|421041520|ref|ZP_15504528.1| hypothetical protein MA4S0116R_4184 [Mycobacterium abscessus
4S-0116-R]
gi|392064563|gb|EIT90412.1| hypothetical protein MA4S0303_4216 [Mycobacterium abscessus
4S-0303]
gi|392070112|gb|EIT95959.1| hypothetical protein MA4S0726RA_4155 [Mycobacterium abscessus
4S-0726-RA]
gi|392183579|gb|EIV09230.1| hypothetical protein MA4S0206_4229 [Mycobacterium abscessus
4S-0206]
gi|392222448|gb|EIV47971.1| hypothetical protein MA4S0116R_4184 [Mycobacterium abscessus
4S-0116-R]
Length = 309
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 123/344 (35%), Positives = 165/344 (47%), Gaps = 69/344 (20%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++ LPG +F MG N + E P R +++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 22 ELAELPGGSFPMGCN---VFYPEEMPERVSSVEPFAIERHPVTNAQFAEFVTQTGYVTVA 78
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D DM D + M T+
Sbjct: 79 ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 102
Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
R P + W++ P W HP G S I+ HPVV V++ DA
Sbjct: 103 RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 152
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G RLPTEAEWEY RGG ++ WG+ + P G AN WQG FP N A G
Sbjct: 153 AAYARWAGRRLPTEAEWEYAARGG-SGTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 211
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--------PSYNPKGPTTGTD 300
+ T+PV ++ N FGL +M+GNVWEWT+ H A PS NP
Sbjct: 212 WHGTSPVGTFPANDFGLLDMIGNVWEWTS---TRFQHGADTPKSCCSPSDNPD---PSVI 265
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+V KGGS+LC +YC+R+R AARS + DS+ ++GFRC + G
Sbjct: 266 QVLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVENLG 309
>gi|315445967|ref|YP_004078846.1| hypothetical protein Mspyr1_44560 [Mycobacterium gilvum Spyr1]
gi|315264270|gb|ADU01012.1| uncharacterized conserved protein [Mycobacterium gilvum Spyr1]
Length = 294
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 121/333 (36%), Positives = 167/333 (50%), Gaps = 50/333 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
D+V L G +FRMG+ + E P+ VT+ F +++H V+N QF EFV TGY T A
Sbjct: 4 DLVDLEGGSFRMGSTR---FYPEEAPAHTVTVAPFAIERHPVTNAQFAEFVDDTGYRTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D ++ P S + + V F ++ R W
Sbjct: 61 ERPPDPALY-PGASPHDLVPGALV------FRPTAGPVDLR-----------------DW 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R WE+ P W HP G DS+++ R HPVV V ++DA AY
Sbjct: 97 R-------QWWEWT----------PGADWRHPFGPDSSVDDRPEHPVVQVCYSDAAAYAR 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE+ G ++ WG+ P G+ AN WQG FP N A G+ T+
Sbjct: 140 WAGRRLPTEAEWEFAAGAG-STAVYAWGDEPAPGGQLMANTWQGAFPYRNDGALGWAGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTDKVKKGGSY 308
PV ++ N FGL +M+GNVWEWT + HH P P P ++ KGGS+
Sbjct: 199 PVGTFPANGFGLVDMIGNVWEWTTTRFAGHHRVNPPPPPTCCPPADPDPAVNQALKGGSH 258
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
LC +YC+R+R +ARS + DS+ ++GFRCAA
Sbjct: 259 LCAPEYCHRYRPSARSPQSQDSATTHIGFRCAA 291
>gi|424775855|ref|ZP_18202844.1| hypothetical protein C660_03890 [Alcaligenes sp. HPC1271]
gi|422888827|gb|EKU31210.1| hypothetical protein C660_03890 [Alcaligenes sp. HPC1271]
Length = 369
Score = 188 bits (478), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 129/390 (33%), Positives = 173/390 (44%), Gaps = 90/390 (23%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
+ +V +P F MG+++ E P V + AF +DQHEV+N QF +FV+ATG
Sbjct: 60 IANQAGLVPIPSGRFLMGSDEGY---PEEKPVHPVEVQAFQIDQHEVTNAQFAQFVAATG 116
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQV------RHDMKRFEGLDSTIEHRMHHPVVHI 122
YVT+AE+ + F + E+ R S V HD EG H H P +
Sbjct: 117 YVTQAERVPE-FPPHVQVPEQYRQPGSAVFTPPAATHDHDHHEG------HNQHAPSGNY 169
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
+W W + P +W HP G D+ + R NHPVVH
Sbjct: 170 NW-------------------WVW----------VPGANWRHPGGPDTNLNGRDNHPVVH 200
Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
+++ DA+AY W G LPTE +WE+ RGGLE +PWGN RG AN WQG FP
Sbjct: 201 IAYEDALAYARWLGRDLPTEEQWEFAARGGLEGATYPWGNTPEVRGRLMANTWQGSFPAQ 260
Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKV 302
N DGY TAPV + N +GL++ VGNVWEWT
Sbjct: 261 NLLRDGYEGTAPVGCFPANNYGLWDSVGNVWEWT-------------------------- 294
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNE 362
R A + Q+ P + L A+ P TG V KGGS+LC
Sbjct: 295 ----------------RSAWQPQHRPLAPKPLLVAATDANGNPATG---VIKGGSFLCAA 335
Query: 363 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
+C R+R A+R S ++GFR +
Sbjct: 336 NFCVRYRPASRQPQETTMSTQHVGFRTVLN 365
>gi|420875085|ref|ZP_15338461.1| hypothetical protein MA4S0726RB_3743 [Mycobacterium abscessus
4S-0726-RB]
gi|421045440|ref|ZP_15508440.1| hypothetical protein MA4S0116S_3292 [Mycobacterium abscessus
4S-0116-S]
gi|392066560|gb|EIT92408.1| hypothetical protein MA4S0726RB_3743 [Mycobacterium abscessus
4S-0726-RB]
gi|392234893|gb|EIV60391.1| hypothetical protein MA4S0116S_3292 [Mycobacterium abscessus
4S-0116-S]
Length = 297
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 123/344 (35%), Positives = 165/344 (47%), Gaps = 69/344 (20%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++ LPG +F MG N + E P R +++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 10 ELAELPGGSFPMGCN---VFYPEEMPERVSSVEPFAIERHPVTNAQFAEFVTQTGYVTVA 66
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D DM D + M T+
Sbjct: 67 ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 90
Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
R P + W++ P W HP G S I+ HPVV V++ DA
Sbjct: 91 RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 140
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G RLPTEAEWEY RGG ++ WG+ + P G AN WQG FP N A G
Sbjct: 141 AAYARWAGRRLPTEAEWEYAARGG-SGTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 199
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--------PSYNPKGPTTGTD 300
+ T+PV ++ N FGL +M+GNVWEWT+ H A PS NP
Sbjct: 200 WHGTSPVGTFPANDFGLLDMIGNVWEWTS---TRFQHGADTPKSCCSPSDNPD---PSVI 253
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+V KGGS+LC +YC+R+R AARS + DS+ ++GFRC + G
Sbjct: 254 QVLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRCVENLG 297
>gi|365872316|ref|ZP_09411854.1| hypothetical protein MMAS_42560 [Mycobacterium massiliense CCUG
48898 = JCM 15300]
gi|421051427|ref|ZP_15514421.1| hypothetical protein MMCCUG48898_4440 [Mycobacterium massiliense
CCUG 48898 = JCM 15300]
gi|363993461|gb|EHM14684.1| hypothetical protein MMAS_42560 [Mycobacterium massiliense CCUG
48898 = JCM 15300]
gi|392240030|gb|EIV65523.1| hypothetical protein MMCCUG48898_4440 [Mycobacterium massiliense
CCUG 48898]
Length = 292
Score = 188 bits (478), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 122/340 (35%), Positives = 164/340 (48%), Gaps = 71/340 (20%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +F MG N + E P R ++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 5 ELVELPGGSFPMGCN---VFYPEEMPERVSSVGPFAIERHPVTNAQFAEFVTQTGYVTVA 61
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D DM D + M T+
Sbjct: 62 ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 85
Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
R P + W++ P W HP G S I+ HPVV V++ DA
Sbjct: 86 RTTAGPVDLHDWQQWWDW----------VPGAYWRHPFGPGSDIDGAAEHPVVQVAYTDA 135
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G RLPTEAEWEY RGG + ++ WG+ + P G AN WQG FP N A G
Sbjct: 136 AAYARWAGRRLPTEAEWEYAARGG-SDTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 194
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------APSYNPKGPTTGT 299
+ T+PV ++ N FGL +M+GNVWEWT + P +PS NP
Sbjct: 195 WHGTSPVGTFPANGFGLLDMIGNVWEWT----STRFQPGAGTPRSCCSPSDNPD---PSV 247
Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC +YC+R+R AARS + DS+ ++GFRC
Sbjct: 248 IQVLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRC 287
>gi|432113885|gb|ELK35996.1| Sulfatase-modifying factor 2, partial [Myotis davidii]
Length = 525
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 124/337 (36%), Positives = 165/337 (48%), Gaps = 85/337 (25%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F+MGTN P +DGE P R VT+ F +D V+N F+EFV Y TEAE
Sbjct: 269 MVKLPGGRFQMGTNSPDG-RDGEGPVREVTVKPFAIDIFPVTNKDFREFVREKKYRTEAE 327
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q + G S I R+ PVVH+SWNDA AYC W+
Sbjct: 328 TFGWSFVFEDFVSDELRNKATQ----QMKPAGPGSGIRERLELPVVHVSWNDARAYCAWQ 383
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G RLPTE EWE+ RGGL+ +++PWG+ P +R N W
Sbjct: 384 GKRLPTEEEWEFAARGGLKGQVYPWGNKFQP--------NRTN---------------LW 420
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G FP G+ A DG+ +P
Sbjct: 421 QGK--------------------FPKGDK---------------------AEDGFHGVSP 439
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY++VGNVWEWTA P NP +V +G S++ +
Sbjct: 440 VNAFPPQNNYGLYDLVGNVWEWTA-------SPYQGANPDM------RVLRGASWIDTSD 486
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCA++ G G
Sbjct: 487 GSANHRARVTTRMGNTPDSASDNLGFRCASNIGRLPG 523
Score = 154 bits (389), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 87/232 (37%), Positives = 118/232 (50%), Gaps = 65/232 (28%)
Query: 163 LHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGN 222
+ P G S I R+ PVVHVSWNDA AYC W+G RLPTE EWE+ RGGL+ +++PWGN
Sbjct: 351 MKPAGPGSGIRERLELPVVHVSWNDARAYCAWQGKRLPTEEEWEFAARGGLKGQVYPWGN 410
Query: 223 NLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK-ENKFGLYNMVGNVWEWTADWWN 281
P +R N+WQG+FP + A DG+ +PV ++ +N +GLY++VGNVWEWTA
Sbjct: 411 KFQP---NRTNLWQGKFPKGDKAEDGFHGVSPVNAFPPQNNYGLYDLVGNVWEWTA---- 463
Query: 282 VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
P NP +V +G S++ +T D SA
Sbjct: 464 ---SPYQGANPDM------RVLRGASWI----------------DTSDGSAN-------- 490
Query: 342 DKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+R R R NTPDS++ NLGFRCA+++
Sbjct: 491 ------------------------HRARVTTRMGNTPDSASDNLGFRCASNI 518
>gi|421484810|ref|ZP_15932376.1| hypothetical protein QWC_19385 [Achromobacter piechaudii HLE]
gi|400196843|gb|EJO29813.1| hypothetical protein QWC_19385 [Achromobacter piechaudii HLE]
Length = 373
Score = 188 bits (477), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 121/350 (34%), Positives = 164/350 (46%), Gaps = 63/350 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +P F MG N L + E P +V + F++D H+V+N +F+ FV ATGYVT AE
Sbjct: 42 MAWIPPAEFLMGNNH-KLSQPNELPPHSVRVSGFWMDTHDVTNAEFRRFVEATGYVTTAE 100
Query: 75 ---KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
K+ D V P R + F G +S + R Y
Sbjct: 101 QKPKWEDLKVQLP--PGTPRPDDGLLVPGAMVFVGSESEVSLR--------------DYT 144
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W W R P SW P+G S+I + +HPVV VS+ DA AY
Sbjct: 145 RW----------W----------RFVPGASWRQPQGPGSSIVGKDDHPVVQVSYEDAQAY 184
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAAD 247
W G RLPTEAEWE+ RGG E + WGN L+P G+ AN+W Q FP
Sbjct: 185 AKWAGKRLPTEAEWEFAARGGFEQATYAWGNELSPNGKPMANIWDTQQQQPFPVVKDEKI 244
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---------------PAPSYNP 292
+ T PV ++ N +GLY+M GNVW+WTADW+ PA S++P
Sbjct: 245 -QVGTTPVGTFPPNAYGLYDMAGNVWQWTADWYRADAFRIQAQYRQPPQDPVGPADSFDP 303
Query: 293 KG---PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
P + +V +GGS+LC++ YC +R +AR P + +LGFR
Sbjct: 304 DDGNVPASAPKRVTRGGSFLCSDTYCISYRASARRGTDPLNGMSHLGFRT 353
>gi|226357800|ref|YP_002787540.1| sulfatase-modifying factor 1 [Deinococcus deserti VCD115]
gi|226320043|gb|ACO48036.1| putative sulfatase-modifying factor 1 [Deinococcus deserti VCD115]
Length = 308
Score = 188 bits (477), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 123/347 (35%), Positives = 166/347 (47%), Gaps = 57/347 (16%)
Query: 1 MVLLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
M+L PA ++M+ +PG F+MG++ E P+ +V ++ F++D H V+N +F
Sbjct: 1 MLLRPA-----RENMLWIPGGDFQMGSDH---HYPEERPAHHVHVEGFWMDPHPVTNAEF 52
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
+ FV ATGYVT AE+ D F P + + S V P
Sbjct: 53 RRFVEATGYVTLAERTPDPAQF-PGVPLDVLVPGSVV-----------------FQQPAG 94
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
+ D + W Y P W PEG ST+ + +HP
Sbjct: 95 PVPLYDHAGW-------------WAY----------VPGACWHQPEGPHSTLTGQDSHPA 131
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
VHV++ DA+AY W G LP EAEWEY RGGL F WG+ P+G AN W G FP
Sbjct: 132 VHVAFEDALAYAAWAGKMLPGEAEWEYAARGGLHGATFAWGDEERPQGRVMANTWHGRFP 191
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSYNPKGPTT 297
N G+ T+PV +Y N +GL++M GNVWEWT D + H H P
Sbjct: 192 WENLDPHGFPRTSPVGTYPANGYGLHDMTGNVWEWTLDPFQPQHVIRHVKSCCTPLNTRA 251
Query: 298 GTD-----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
GT +V KGGS+LC YC+R+R A+R DSS +LGFRC
Sbjct: 252 GTGGTRERRVIKGGSHLCAPNYCFRYRPASRQGQEVDSSTSHLGFRC 298
>gi|124267866|ref|YP_001021870.1| hypothetical protein Mpe_A2681 [Methylibium petroleiphilum PM1]
gi|124260641|gb|ABM95635.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
Length = 337
Score = 187 bits (476), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 92/195 (47%), Positives = 120/195 (61%), Gaps = 12/195 (6%)
Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
R P W HP G DS ++ R NHPVV VS++DA+AY W G LPTEAEWEY RGGL+
Sbjct: 139 RYVPGADWRHPGGPDSDLDGRGNHPVVQVSYDDALAYARWAGRDLPTEAEWEYAARGGLK 198
Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
+ + WG+ + P G HRAN WQGEFP NT DGY T+PV ++ N FGLY+M GNVW+
Sbjct: 199 GQRYAWGSEVRPEGVHRANTWQGEFPLRNTQEDGYAGTSPVGCFRPNGFGLYDMAGNVWQ 258
Query: 275 WTADWWNVHHH-----------PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAAR 323
WT+ W+ H A S +P+ P T +V KGGS+LC+ YC R+R +AR
Sbjct: 259 WTSSWYRPGHQAGANGRLAETPAAGSGDPRQPGTPV-RVIKGGSHLCSPDYCLRYRPSAR 317
Query: 324 SQNTPDSSAGNLGFR 338
++ ++GFR
Sbjct: 318 QPQATSTATSHVGFR 332
>gi|421290358|ref|ZP_15741108.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA54354]
gi|421305750|ref|ZP_15756404.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA62331]
gi|395888043|gb|EJG99057.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA54354]
gi|395904708|gb|EJH15622.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA62331]
Length = 282
Score = 187 bits (476), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 119/335 (35%), Positives = 159/335 (47%), Gaps = 62/335 (18%)
Query: 15 MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M + G TF MGTN + + D E P V++ F + V+N +F +FV TGY T A
Sbjct: 3 MKKIKGGTFMMGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLA 62
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ +FVF + E ER S HP W C
Sbjct: 63 ERQEWSFVFILFVPEAEREGYS---------------------HPAGAPWWLQVPNAC-- 99
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W HP G +S + +HPVVHV+ DA+A+C
Sbjct: 100 ----------------------------WKHPYGENSNLVGLEDHPVVHVALEDALAFCN 131
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEA+WEY RGG ++ +PWG+ L G + AN WQG FP NTA DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTA 190
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
PV + N FGLY M+GNVWEW N + S+N PK + +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+LC+ YC R+R AAR+ S++ +LGFRC +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282
>gi|326427798|gb|EGD73368.1| SUMF1 protein [Salpingoeca sp. ATCC 50818]
Length = 155
Score = 187 bits (475), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 90/155 (58%), Positives = 108/155 (69%), Gaps = 4/155 (2%)
Query: 190 AYCTWR--GARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
A+C W RLPTEAEWE+ GGL+NR FPWGN + P+ H N WQ P N+ D
Sbjct: 1 AFCKWSHPKGRLPTEAEWEFAASGGLKNRTFPWGNKMMPKNRHWMNTWQSTAP--NSMED 58
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
GY TAP +Y NKFGLYN VGNVWEWT DW+ H AP+ +P+GP++G KVKKGGS
Sbjct: 59 GYALTAPGTAYPPNKFGLYNTVGNVWEWTNDWFTNRHSAAPAVDPRGPSSGETKVKKGGS 118
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
Y+C++ CYR+R AAR TPDSSA N+GFRCAAD
Sbjct: 119 YMCHQFTCYRYRIAARMHITPDSSAANVGFRCAAD 153
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 28/40 (70%), Gaps = 2/40 (5%)
Query: 129 AYCTWR--GARLPTEAEWEYGCRGGLENRLFPWGSWLHPE 166
A+C W RLPTEAEWE+ GGL+NR FPWG+ + P+
Sbjct: 1 AFCKWSHPKGRLPTEAEWEFAASGGLKNRTFPWGNKMMPK 40
>gi|452876422|ref|ZP_21953778.1| sulfatase-modifying factor 1 [Pseudomonas aeruginosa VRFPA01]
gi|452186780|gb|EME13798.1| sulfatase-modifying factor 1 [Pseudomonas aeruginosa VRFPA01]
Length = 317
Score = 187 bits (474), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 135/358 (37%), Positives = 168/358 (46%), Gaps = 69/358 (19%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
R DM LPG FRMG+ + E P R V +D F++D V+N QF FV ATGY
Sbjct: 7 RTPDMRYLPGGVFRMGSER---FYPEERPVRQVRVDPFWIDVTPVTNRQFAAFVEATGYR 63
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
T AEK E A + +M R L T PV + D
Sbjct: 64 TFAEK------------APEPADYPGMPAEMARAGSLVFT-------PVSAVV--DLADP 102
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W W + R W W P G S+I HPVVHV++ DA A
Sbjct: 103 SRW----------WSF--------RFGAW--WREPLGPGSSIAGLEEHPVVHVAYEDAAA 142
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
Y W G LP+EAEWEY RGG + F WG+ L P G+ AN WQG FP N DG+
Sbjct: 143 YAAWVGKALPSEAEWEYAARGGRDGEEFAWGDTLEPGGQILANYWQGRFPLENLLLDGWE 202
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP---------------------S 289
T+PV ++ N GL++M+GNVWEWTADW+ PA S
Sbjct: 203 RTSPVGAFPANGHGLFDMIGNVWEWTADWF---RQPAATDRQASSCCASRNPRGGREEDS 259
Query: 290 YNPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
Y+P P KV KGGS+LC E YC R+R AAR T D++ G++GFRC + P
Sbjct: 260 YDPALPALRIGRKVLKGGSHLCAENYCQRYRPAARIPQTIDTTTGHVGFRCVRREPPA 317
>gi|126436917|ref|YP_001072608.1| hypothetical protein Mjls_4346 [Mycobacterium sp. JLS]
gi|126236717|gb|ABO00118.1| protein of unknown function DUF323 [Mycobacterium sp. JLS]
Length = 291
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 123/332 (37%), Positives = 166/332 (50%), Gaps = 51/332 (15%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+D+V +P + +G+++ E P+R+VT+D F++ H V+N +F FV ATGY T
Sbjct: 3 EDLVRIPAQSATIGSDR---HYPEEAPARDVTVDGFWIQAHAVTNAEFAAFVDATGYRTV 59
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ +P + A+ Q S + R PV
Sbjct: 60 AER-----PLDPANYPDAPAQNLQP----------GSMVFRRTAGPV------------- 91
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
L ++W Y G SW HP G S+I +R +HPVVHV+++DA AY
Sbjct: 92 ----DLRHLSQWWYWTPGA---------SWRHPIGPGSSIVNRADHPVVHVAYDDAEAYA 138
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
+W G LPTEAEWE RGGL + + WG+ G AN W GEFP GY T
Sbjct: 139 SWAGLELPTEAEWEVAARGGLSHAGYTWGDEPEGPGHKLANYWHGEFPWRPD--RGYGRT 196
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA----PSYNPKGPTTGTD-KVKKGGS 307
APV SY N +GLY+M GNVWEWT DW+ A SY+P P +V KGGS
Sbjct: 197 APVGSYPPNGYGLYDMAGNVWEWTTDWYTDTGGSACCAEDSYDPAQPQFPVPRRVVKGGS 256
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+LC + YC R+R AAR D+ ++GFRC
Sbjct: 257 FLCADSYCARYRPAARRPQPVDTGMSHIGFRC 288
>gi|367469483|ref|ZP_09469235.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycinee-
generating enzyme 1) [Patulibacter sp. I11]
gi|365815446|gb|EHN10592.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycinee-
generating enzyme 1) [Patulibacter sp. I11]
Length = 316
Score = 187 bits (474), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 93/182 (51%), Positives = 118/182 (64%), Gaps = 6/182 (3%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
SW PEG STI+ R++HPVVHV+ +DA+A+C W G RLPTE EWE+ RGGL+ +PW
Sbjct: 140 SWRAPEGPGSTIDGRLDHPVVHVTRDDALAFCAWAGLRLPTEVEWEHAARGGLDGARYPW 199
Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
G+ L P GEHR NVWQG FP + DG+ T PV ++ N FGL+ + GNVWEWTAD W
Sbjct: 200 GDELEPGGEHRCNVWQGAFPGEDRGEDGFRGTCPVDAFPANGFGLHEVSGNVWEWTADPW 259
Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
P P G + +GGS+LC+ YC R+R AAR+ P S+ G+ GFRCA
Sbjct: 260 A---RPGAVLRPA--PDGWVAI-RGGSHLCHRSYCERYRVAARTAADPASTTGHQGFRCA 313
Query: 341 AD 342
AD
Sbjct: 314 AD 315
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 22/40 (55%), Positives = 29/40 (72%)
Query: 354 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+GGS+LC+ YC R+R AAR+ P S+ G+ GFRCAAD
Sbjct: 277 RGGSHLCHRSYCERYRVAARTAADPASTTGHQGFRCAADA 316
>gi|404214088|ref|YP_006668282.1| hypothetical protein KTR9_1487 [Gordonia sp. KTR9]
gi|403644887|gb|AFR48127.1| hypothetical protein KTR9_1487 [Gordonia sp. KTR9]
Length = 274
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 122/334 (36%), Positives = 157/334 (47%), Gaps = 67/334 (20%)
Query: 16 VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEK 75
V GD F G DGE P V L AF +D V+N F FV+ATG+ T+AE
Sbjct: 7 VFAMGDAFDEGYRT-----DGETPVHEVELSAFSIDATTVTNAAFASFVAATGHRTDAEI 61
Query: 76 FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRG 135
FG S V H G PV W AV +WR
Sbjct: 62 FGG----------------SAVFHTYATAPG----------EPVPGTPWWLAVDGASWR- 94
Query: 136 ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWR 195
P G S ++ +HPVVHVS DA AYC W
Sbjct: 95 ----------------------------RPAGPGSVVDGLADHPVVHVSHRDAQAYCAWA 126
Query: 196 GARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPV 255
G LPTEA+WEY RGGL FPWG+ + R N+++G+FP T + T PV
Sbjct: 127 GRALPTEAQWEYAARGGLHGARFPWGDEPPTADDPRCNIFRGDFPNEPT---DRVGTTPV 183
Query: 256 MSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-YNPKGPTTGTDKVKKGGSYLCNEQY 314
++ N GLY GNVWEW AD ++ ++ +P GP G+ +V +GGS+LC++ Y
Sbjct: 184 RMFEPNGHGLYQCAGNVWEWCADTFSARYYRVSDRTDPTGPGRGSARVLRGGSHLCHDSY 243
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
C+R+R AARS NTP+S+A N+GFR GP G
Sbjct: 244 CHRYRVAARSHNTPESTASNIGFRTV---GPPVG 274
>gi|149020874|ref|ZP_01835403.1| hypothetical protein CGSSp23BS72_02514 [Streptococcus pneumoniae
SP23-BS72]
gi|182684754|ref|YP_001836501.1| hypothetical protein SPCG_1784 [Streptococcus pneumoniae CGSP14]
gi|221232543|ref|YP_002511697.1| hypothetical protein SPN23F_18190 [Streptococcus pneumoniae ATCC
700669]
gi|387758004|ref|YP_006064983.1| hypothetical protein SPNOXC_15820 [Streptococcus pneumoniae OXC141]
gi|405760334|ref|YP_006700930.1| Sulfatase modifying factor 1 precursor
(C-alpha-formyglycine-generating enzyme 1)
[Streptococcus pneumoniae SPNA45]
gi|415700292|ref|ZP_11457994.1| hypothetical protein CGSSp4595_1803 [Streptococcus pneumoniae
459-5]
gi|417699171|ref|ZP_12348342.1| hypothetical protein SPAR69_1729 [Streptococcus pneumoniae GA41317]
gi|418103487|ref|ZP_12740559.1| hypothetical protein SPAR143_1767 [Streptococcus pneumoniae NP070]
gi|418124150|ref|ZP_12761080.1| hypothetical protein SPAR82_1819 [Streptococcus pneumoniae GA44378]
gi|418128689|ref|ZP_12765581.1| hypothetical protein SPAR144_1769 [Streptococcus pneumoniae NP170]
gi|418137892|ref|ZP_12774729.1| hypothetical protein SPAR24_1792 [Streptococcus pneumoniae GA11663]
gi|418149145|ref|ZP_12785907.1| hypothetical protein SPAR34_1638 [Streptococcus pneumoniae GA13856]
gi|418178921|ref|ZP_12815502.1| hypothetical protein SPAR73_1854 [Streptococcus pneumoniae GA41565]
gi|418232795|ref|ZP_12859381.1| hypothetical protein SPAR13_1706 [Streptococcus pneumoniae GA07228]
gi|418237249|ref|ZP_12863815.1| hypothetical protein SPAR59_1804 [Streptococcus pneumoniae GA19690]
gi|419453975|ref|ZP_13993945.1| hypothetical protein SPAR138_1707 [Streptococcus pneumoniae
EU-NP03]
gi|419473875|ref|ZP_14013723.1| hypothetical protein SPAR29_1747 [Streptococcus pneumoniae GA13430]
gi|419476129|ref|ZP_14015965.1| hypothetical protein SPAR36_1733 [Streptococcus pneumoniae GA14688]
gi|419482825|ref|ZP_14022612.1| hypothetical protein SPAR65_1746 [Streptococcus pneumoniae GA40563]
gi|419487274|ref|ZP_14027036.1| hypothetical protein SPAR79_1788 [Streptococcus pneumoniae GA44128]
gi|419493914|ref|ZP_14033639.1| hypothetical protein SPAR89_1741 [Streptococcus pneumoniae GA47210]
gi|419506670|ref|ZP_14046331.1| hypothetical protein SPAR111_1709 [Streptococcus pneumoniae
GA49194]
gi|421209563|ref|ZP_15666576.1| hypothetical protein AMCSP11_001752 [Streptococcus pneumoniae
2070005]
gi|421225638|ref|ZP_15682376.1| hypothetical protein AMCSP06_001777 [Streptococcus pneumoniae
2070768]
gi|421234707|ref|ZP_15691324.1| hypothetical protein AMCSP02_001710 [Streptococcus pneumoniae
2061617]
gi|421239120|ref|ZP_15695684.1| hypothetical protein AMCSP15_001689 [Streptococcus pneumoniae
2071247]
gi|421241250|ref|ZP_15697795.1| hypothetical protein AMCSP17_001655 [Streptococcus pneumoniae
2080913]
gi|421245626|ref|ZP_15702124.1| hypothetical protein AMCSP10_001602 [Streptococcus pneumoniae
2081685]
gi|421250044|ref|ZP_15706500.1| hypothetical protein AMCSP19_001672 [Streptococcus pneumoniae
2082239]
gi|147930515|gb|EDK81498.1| hypothetical protein CGSSp23BS72_02514 [Streptococcus pneumoniae
SP23-BS72]
gi|182630088|gb|ACB91036.1| hypothetical protein SPCG_1784 [Streptococcus pneumoniae CGSP14]
gi|220675005|emb|CAR69583.1| conserved hypothetical protein [Streptococcus pneumoniae ATCC
700669]
gi|301800593|emb|CBW33234.1| conserved hypothetical protein [Streptococcus pneumoniae OXC141]
gi|332199817|gb|EGJ13892.1| hypothetical protein SPAR69_1729 [Streptococcus pneumoniae GA41317]
gi|353774788|gb|EHD55275.1| hypothetical protein SPAR143_1767 [Streptococcus pneumoniae NP070]
gi|353795292|gb|EHD75642.1| hypothetical protein SPAR82_1819 [Streptococcus pneumoniae GA44378]
gi|353798685|gb|EHD79013.1| hypothetical protein SPAR144_1769 [Streptococcus pneumoniae NP170]
gi|353811481|gb|EHD91723.1| hypothetical protein SPAR34_1638 [Streptococcus pneumoniae GA13856]
gi|353841872|gb|EHE21924.1| hypothetical protein SPAR73_1854 [Streptococcus pneumoniae GA41565]
gi|353886108|gb|EHE65892.1| hypothetical protein SPAR13_1706 [Streptococcus pneumoniae GA07228]
gi|353891687|gb|EHE71441.1| hypothetical protein SPAR59_1804 [Streptococcus pneumoniae GA19690]
gi|353900200|gb|EHE75758.1| hypothetical protein SPAR24_1792 [Streptococcus pneumoniae GA11663]
gi|379550393|gb|EHZ15493.1| hypothetical protein SPAR29_1747 [Streptococcus pneumoniae GA13430]
gi|379558911|gb|EHZ23943.1| hypothetical protein SPAR36_1733 [Streptococcus pneumoniae GA14688]
gi|379579417|gb|EHZ44324.1| hypothetical protein SPAR65_1746 [Streptococcus pneumoniae GA40563]
gi|379585643|gb|EHZ50499.1| hypothetical protein SPAR79_1788 [Streptococcus pneumoniae GA44128]
gi|379592487|gb|EHZ57303.1| hypothetical protein SPAR89_1741 [Streptococcus pneumoniae GA47210]
gi|379608584|gb|EHZ73330.1| hypothetical protein SPAR111_1709 [Streptococcus pneumoniae
GA49194]
gi|379626045|gb|EHZ90671.1| hypothetical protein SPAR138_1707 [Streptococcus pneumoniae
EU-NP03]
gi|381314443|gb|EIC55212.1| hypothetical protein CGSSp4595_1803 [Streptococcus pneumoniae
459-5]
gi|395573659|gb|EJG34249.1| hypothetical protein AMCSP11_001752 [Streptococcus pneumoniae
2070005]
gi|395589125|gb|EJG49447.1| hypothetical protein AMCSP06_001777 [Streptococcus pneumoniae
2070768]
gi|395599760|gb|EJG59922.1| hypothetical protein AMCSP02_001710 [Streptococcus pneumoniae
2061617]
gi|395600763|gb|EJG60918.1| hypothetical protein AMCSP15_001689 [Streptococcus pneumoniae
2071247]
gi|395607628|gb|EJG67725.1| hypothetical protein AMCSP17_001655 [Streptococcus pneumoniae
2080913]
gi|395608153|gb|EJG68249.1| hypothetical protein AMCSP10_001602 [Streptococcus pneumoniae
2081685]
gi|395613015|gb|EJG73047.1| hypothetical protein AMCSP19_001672 [Streptococcus pneumoniae
2082239]
gi|404277223|emb|CCM07732.1| Sulfatase modifying factor 1 precursor
(C-alpha-formyglycine-generating enzyme 1)
[Streptococcus pneumoniae SPNA45]
gi|429319972|emb|CCP33294.1| conserved hypothetical protein [Streptococcus pneumoniae SPN034183]
gi|429321790|emb|CCP35268.1| conserved hypothetical protein [Streptococcus pneumoniae SPN994039]
gi|429323610|emb|CCP31309.1| conserved hypothetical protein [Streptococcus pneumoniae SPN994038]
Length = 282
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 120/335 (35%), Positives = 160/335 (47%), Gaps = 62/335 (18%)
Query: 15 MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M + G TF MGTN + + D E P V++ F + V+N +F +FV TGY T A
Sbjct: 3 MKKIKGGTFMMGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLA 62
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ +FVF + E ER EG HP W C
Sbjct: 63 ERQEWSFVFILFVPEAER-------------EGYP--------HPAGAPWWLQVSNAC-- 99
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W HP G +S + +HPVVHV+ DA+A+C
Sbjct: 100 ----------------------------WKHPYGENSNLVGLEDHPVVHVALEDALAFCN 131
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEA+WEY RGG ++ +PWG+ L G + AN WQG FP NTA DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTA 190
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
PV + N FGLY M+GNVWEW N + S+N PK + +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+LC+ YC R+R AAR+ S++ +LGFRC +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282
>gi|418014464|ref|ZP_12654063.1| sulfatase modifying factor [Lactobacillus casei Lpc-37]
gi|410553708|gb|EKQ27705.1| sulfatase modifying factor [Lactobacillus casei Lpc-37]
Length = 287
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 122/336 (36%), Positives = 167/336 (49%), Gaps = 66/336 (19%)
Query: 11 RYKDMVLLPGDTFRMGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
R+K+ + G +F+MGT+ I +D E P V + +F + V+N F +F++AT Y
Sbjct: 5 RFKE---IKGGSFQMGTDDHIGFDEDYEGPPTIVRVPSFSMADTPVTNADFDDFIAATAY 61
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
T AE+ G +FVFE L+ EEER T +H VA
Sbjct: 62 QTVAERLGSSFVFELLIPEEERV-----------------TYQH--------------VA 90
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
W W L P W HP G +S+ NHPVVHV+ DA+
Sbjct: 91 GAPW----------W----------LLVPGADWQHPYGAESSNIDLDNHPVVHVALEDAL 130
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
AYC W ++LPTEA+WEY G +PWG +L AN WQG+FP +NTA DG+
Sbjct: 131 AYCQWSHSQLPTEAQWEYAAGAGTATT-YPWGESLVDEHGFHANTWQGDFPNDNTAEDGF 189
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP------KGPTTGTDKVK 303
+ TAPV SY+ N GLY ++GNVWEW N + +N K P G +
Sbjct: 190 VGTAPVKSYEPNSNGLYQIIGNVWEWCR---NPRYTLLDDFNAEQFKLGKVPAAGEYAI- 245
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+GGS+LC+ YC R+R AAR+ S++ +L FRC
Sbjct: 246 RGGSFLCHCSYCNRYRTAARNGVDLQSTSSHLSFRC 281
>gi|254383810|ref|ZP_04999158.1| conserved hypothetical protein [Streptomyces sp. Mg1]
gi|194342703|gb|EDX23669.1| conserved hypothetical protein [Streptomyces sp. Mg1]
Length = 309
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 126/335 (37%), Positives = 159/335 (47%), Gaps = 54/335 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG TFRMG+ E P R VT+D F++D H V+ +F+ F ATG+VT AE
Sbjct: 16 MVRVPGGTFRMGSEA---FYPEERPVRPVTVDGFWMDAHPVTVAEFRRFTKATGHVTVAE 72
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D F ER + + + PV W W
Sbjct: 73 RELDPRDF----PGAERGALVP-----------GALVFTMTPGPVDLDDWR------RW- 110
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W Y P W P G DST R HPV V++ DAVAY W
Sbjct: 111 ---------WSYQ----------PGACWRRPLGGDSTTHGRELHPVTQVAFEDAVAYAAW 151
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP---TNNTAADGYLS 251
G LPTEAEWE+ RGGLE +F WG+ TPRG AN W G FP +A
Sbjct: 152 AGKELPTEAEWEFAARGGLEGAVFVWGDEFTPRGRRMANTWHGAFPWEYLPASAKSPRPG 211
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSY--NPKGPTTGTD----KVKK 304
T+ V SY N +GLY+M GNVWEWT D + + PAP P G + +V K
Sbjct: 212 TSAVRSYPANGYGLYDMAGNVWEWTCDAYADRRADPAPGACCAPGDARAGGERFPRRVTK 271
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
GGS+LC YC R+R AAR+ + D++ +LGFRC
Sbjct: 272 GGSHLCAPNYCLRYRPAARTGQSEDTATCHLGFRC 306
>gi|254431213|ref|ZP_05044916.1| sulfatase-modifying factor 1 [Cyanobium sp. PCC 7001]
gi|197625666|gb|EDY38225.1| sulfatase-modifying factor 1 [Cyanobium sp. PCC 7001]
Length = 337
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 123/354 (34%), Positives = 167/354 (47%), Gaps = 63/354 (17%)
Query: 5 PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
P P +DMV +PG +F MG+N E P+ ++ F++D+ V+N QF++FV
Sbjct: 24 PCPGRPPARDMVWIPGGSFTMGSNHHY---PEEAPAHQRQVEGFWIDRAPVTNAQFRKFV 80
Query: 65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
ATG+VT AE+ D + L E L + P +
Sbjct: 81 KATGHVTLAERAADPADYPDALPE------------------LLAPASIVFVPPSGPVGT 122
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
D + W+Y P +W HPEG S+I+ R +HPVVHV+
Sbjct: 123 GDPYRW-------------WQY----------IPGANWRHPEGPGSSIKGRDHHPVVHVA 159
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
DA+AY W G +LPTEAEWE+ GGL F WG+ L P G AN +QGEFP +N+
Sbjct: 160 HEDALAYAVWAGKQLPTEAEWEFAAWGGLGGTEFAWGHELHPGGRAMANTFQGEFPHHNS 219
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN---------PKGP 295
GY T+PV ++ N +GL +M+GNVWEWT W+ H P
Sbjct: 220 RLVGYERTSPVGAFAANGYGLVDMIGNVWEWTDTWYGEHGAAGPGGEAAPEGGCCASAAR 279
Query: 296 TTGTD----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
D KV KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 280 EASIDRNSQHGAIPRKVVKGGSFLCAPSYCRRYRPAARMAQGIDTSTCHMGFRC 333
>gi|381203031|ref|ZP_09910140.1| hypothetical protein SyanX_21071 [Sphingobium yanoikuyae XLDN2-5]
Length = 320
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 130/352 (36%), Positives = 172/352 (48%), Gaps = 62/352 (17%)
Query: 6 APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
AP +M+ +PG F MG++ E P+ V +DAF++D V+N F+ FV
Sbjct: 8 APAATAPDNMIHIPGGRFIMGSDH---HYPEEAPAHPVEVDAFWIDATPVTNRDFRRFVE 64
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
ATG+ T AE P + A+ DM R L T R PV W+
Sbjct: 65 ATGHRTTAE-------IAPNPDDYPGAQP-----DMLRPASLVFTPTER---PVPLHDWS 109
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
W W Y W HP G DS+I+ +HPVVHV++
Sbjct: 110 ------QW----------WRYVFDA----------DWRHPLGPDSSIDDLDDHPVVHVTY 143
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
+DA AY W G LPTEAEWE+ RGG++ + F WG+ L P G+ AN WQG FP+ N
Sbjct: 144 DDAAAYAAWAGKALPTEAEWEFAARGGVDGQEFAWGDELMPDGKAMANFWQGNFPSENLL 203
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP----------------- 288
DGY T+PV S+ N +GL +M+GNVWEWT D++ H P
Sbjct: 204 IDGYERTSPVGSFPANGYGLSDMIGNVWEWTEDFYAARHQADPSRPCCAPRNPLNQAREA 263
Query: 289 SYNPKGP-TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
SY+P+ P +V KGGS+LC YC R+R AAR DSS ++GFRC
Sbjct: 264 SYDPQQPHILIPRRVLKGGSHLCAPSYCRRYRPAARHAQAIDSSTSHIGFRC 315
>gi|359425710|ref|ZP_09216805.1| hypothetical protein GOAMR_51_00440 [Gordonia amarae NBRC 15530]
gi|358239024|dbj|GAB06387.1| hypothetical protein GOAMR_51_00440 [Gordonia amarae NBRC 15530]
Length = 300
Score = 186 bits (472), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 123/344 (35%), Positives = 168/344 (48%), Gaps = 64/344 (18%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG FRMG+++ E P+ + AF +++H V+N Q+ FVS TGYVT A
Sbjct: 5 ELVPLPGGVFRMGSDR---HYPEERPAHERRVGAFAIERHPVTNAQYARFVSDTGYVTVA 61
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E EP+ D F G D + + +V + V W
Sbjct: 62 E--------EPI--------------DPADFPGADP--KQLIPGALVFTPTDGPVDLGDW 97
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W R P SW P+G ST++HR +HPVV +++ DA AY
Sbjct: 98 --------TRWW---------RWVPGASWRSPQGPGSTVDHRPDHPVVQIAYRDAQAYAH 140
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE+ GG + WG P + AN WQG FP N G+ ST+
Sbjct: 141 WAGRRLPTEAEWEFAACGGRSGAEYAWGEEFRPGDQVMANTWQGRFPYLNA---GWGSTS 197
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD------------- 300
PV ++ N +GL +M+GNVWE +D + H + + GP D
Sbjct: 198 PVGTFPANGYGLADMIGNVWERVSDIFVPRH--SDISDADGPVVEADGRPDLLAPTTSPR 255
Query: 301 --KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+V KGGS+LC +YC R+R AARS + DS+ G+LGFRCAAD
Sbjct: 256 VMRVTKGGSFLCAPEYCRRYRPAARSAQSDDSATGHLGFRCAAD 299
>gi|145225614|ref|YP_001136292.1| hypothetical protein Mflv_5038 [Mycobacterium gilvum PYR-GCK]
gi|145218100|gb|ABP47504.1| protein of unknown function DUF323 [Mycobacterium gilvum PYR-GCK]
Length = 294
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 120/333 (36%), Positives = 167/333 (50%), Gaps = 50/333 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
D+V L G +FRMG+ + E P+ VT+ F ++++ V+N QF EFV TGY T A
Sbjct: 4 DLVDLEGGSFRMGSTR---FYPEEAPAHTVTVAPFAIERNPVTNAQFAEFVDDTGYRTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D ++ P S + + V F ++ R W
Sbjct: 61 ERPPDPALY-PGASPHDLVPGALV------FRPTAGPVDLR-----------------DW 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R WE+ P W HP G DS+++ R HPVV V ++DA AY
Sbjct: 97 R-------QWWEWT----------PGADWRHPFGPDSSVDDRPEHPVVQVCYSDAAAYAR 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE+ G ++ WG+ P G+ AN WQG FP N A G+ T+
Sbjct: 140 WAGRRLPTEAEWEFAAGAG-STAVYAWGDEPAPGGQLMANTWQGAFPYRNDGALGWAGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTDKVKKGGSY 308
PV ++ N FGL +M+GNVWEWT + HH P P P ++ KGGS+
Sbjct: 199 PVGTFPANGFGLVDMIGNVWEWTTTRFAGHHRVNPPPPPTCCPPADPDPAVNQALKGGSH 258
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
LC +YC+R+R +ARS + DS+ ++GFRCAA
Sbjct: 259 LCAPEYCHRYRPSARSPQSQDSATTHIGFRCAA 291
>gi|421236881|ref|ZP_15693478.1| hypothetical protein AMCSP07_001690 [Streptococcus pneumoniae
2071004]
gi|395601644|gb|EJG61791.1| hypothetical protein AMCSP07_001690 [Streptococcus pneumoniae
2071004]
Length = 282
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 120/335 (35%), Positives = 160/335 (47%), Gaps = 62/335 (18%)
Query: 15 MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M + G TF MGTN + + D E P V++ F + V+N +F +FV TGY T A
Sbjct: 3 MKKIKGGTFMMGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFVQFVKETGYKTLA 62
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ +FVF + E ER EG HP W C
Sbjct: 63 ERQEWSFVFILFVPEAER-------------EGYP--------HPAGAPWWLQVSNAC-- 99
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W HP G +S + +HPVVHV+ DA+A+C
Sbjct: 100 ----------------------------WKHPYGENSNLVGLEDHPVVHVALEDALAFCN 131
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEA+WEY RGG ++ +PWG+ L G + AN WQG FP NTA DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTA 190
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
PV + N FGLY M+GNVWEW N + S+N PK + +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+LC+ YC R+R AAR+ S++ +LGFRC +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282
>gi|295134836|ref|YP_003585512.1| sulfatase-modifying factor [Zunongwangia profunda SM-A87]
gi|294982851|gb|ADF53316.1| sulfatase-modifying factor [Zunongwangia profunda SM-A87]
Length = 376
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 124/350 (35%), Positives = 178/350 (50%), Gaps = 55/350 (15%)
Query: 9 VERYKDMVLLPGDTFRMGTNK--PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSA 66
+E + MV + G F MG ++ P + E P+ V +D F++DQ EV+N QF++F +
Sbjct: 60 IETPEGMVWVSGVKFTMGASEGDPYALPR-EKPAHPVAVDGFFIDQTEVTNAQFKKFTAE 118
Query: 67 TGYVTEAEKFGDTFVFEPLLSEEERAKI---SQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
TGYVT AE+ P+ E+ + ++ + D G S I + + ++
Sbjct: 119 TGYVTVAER--------PIDWEQMKKQLPPGTPKPPDSVLQPG--SLIFKKELENIANL- 167
Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHV 183
ND Y W EW+ G W HP+G S IE + N+PVVH+
Sbjct: 168 -ND---YGQWW--------EWKTGA------------DWKHPQGPQSDIEGKDNYPVVHI 203
Query: 184 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNN 243
++ DA+AY +W G LPTEAEWE G L ++ WG++ + + AN WQGEFP N
Sbjct: 204 AYEDALAYASWAGRDLPTEAEWEAAAHGKLHGGIYTWGDDES-KLNKEANTWQGEFPVKN 262
Query: 244 TAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTT- 297
DGY APV S+ N LY+M GNVWE+T D +N ++ NPKG TT
Sbjct: 263 IPEDGYKYAAPVKSFPSNSLNLYDMAGNVWEFTKDNFNTRYYQDALQQGELLNPKGSTTY 322
Query: 298 -------GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+KV KGGS+LCN+ YC +R ++R + DS +LGFR
Sbjct: 323 FNEDNPYQKEKVIKGGSFLCNKSYCASYRISSRMGTSMDSGTDHLGFRTV 372
>gi|384103458|ref|ZP_10004435.1| hypothetical protein W59_18879 [Rhodococcus imtechensis RKJ300]
gi|383839299|gb|EID78656.1| hypothetical protein W59_18879 [Rhodococcus imtechensis RKJ300]
Length = 304
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 123/347 (35%), Positives = 163/347 (46%), Gaps = 69/347 (19%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG TF MG+ E P VT+D F++D H+V+ +F+ FV TG+VT AE
Sbjct: 1 MAWIPGGTFWMGSED---FYPEERPVHQVTVDGFWMDTHQVTVAEFRRFVKDTGHVTTAE 57
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA---YC 131
+ D ++ G D + + +V I V Y
Sbjct: 58 ----------------------IAPDPAQYPGADPAL--LVPGSLVFIPTPGPVPLDDYT 93
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W W + P W HPEG S + R HPV HVSW DA AY
Sbjct: 94 RW----------WSF----------TPGADWRHPEGPGSNVGGRERHPVTHVSWFDARAY 133
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G LPTEAEWE+ RGGL+ + F WG P G NVWQG FP N DGY
Sbjct: 134 AEWAGKDLPTEAEWEFAARGGLDRKPFVWGEEHEPGGRPGGNVWQGRFPWENLLEDGYAG 193
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTGTD--- 300
T+PV ++ N + L +M GNVWEWT D++ V H + AP+ + P P T
Sbjct: 194 TSPVGHFRPNGYDLSDMAGNVWEWTTDYFTVDHSESGKNVAPTSSCCIPTNPRVDTARAE 253
Query: 301 --------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R +AR +T ++S ++GFRC
Sbjct: 254 NPDEPYARRVIKGGSHLCAPNYCLRYRPSARQGDTEETSTCHIGFRC 300
>gi|393725914|ref|ZP_10345841.1| hypothetical protein SPAM2_19854 [Sphingomonas sp. PAMC 26605]
Length = 321
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 132/351 (37%), Positives = 173/351 (49%), Gaps = 63/351 (17%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
V + + M + G +F+MG+ E P R V ++ F++D H V+N QF EFV ATG
Sbjct: 8 VRQLEGMRHIAGGSFKMGSEA---FYSEERPLRRVKVEDFWIDTHPVTNRQFSEFVEATG 64
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
Y T AE D + P ++ + S V FE D+ + +H H W
Sbjct: 65 YRTVAEIAPDPANY-PGMAPADAVAGSLV------FERTDALVP--LHD---HTRW---- 108
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
W + R G + W HP G DS I+ ++HPVVHV++ DA
Sbjct: 109 ---------------WAF--RHGAD--------WRHPTGPDSGIDDLLDHPVVHVAFADA 143
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G LPTEAEWE+ RGGL+ + WG+ L P G AN WQG FP N ADG
Sbjct: 144 QAYAAWVGKVLPTEAEWEFAARGGLDGVEYAWGDELAPGGRVLANYWQGLFPFANQRADG 203
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH----HPAPSY----NPKG------ 294
T PV Y N +GLY+M+GNVWEWT D + + APS +P+G
Sbjct: 204 GYRTTPVGHYPPNGYGLYDMIGNVWEWTRDCYTLSRKAGGKAAPSCCAVPDPRGGTLRAS 263
Query: 295 -----PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
P KV KGGS+LC YC R+R AAR T D + G++GFRC
Sbjct: 264 MDLAKPVRIGRKVLKGGSHLCAANYCQRYRPAARHPQTIDGATGHIGFRCV 314
>gi|325276759|ref|ZP_08142471.1| hypothetical protein G1E_24647 [Pseudomonas sp. TJI-51]
gi|324098107|gb|EGB96241.1| hypothetical protein G1E_24647 [Pseudomonas sp. TJI-51]
Length = 340
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 122/348 (35%), Positives = 167/348 (47%), Gaps = 62/348 (17%)
Query: 4 LPAPPVERYKD-MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
LPAP ++D MV LPG F G+ + D E P+ + AF++D H V+N QF
Sbjct: 39 LPAPREGPWRDGMVKLPGGKFSFGSER---YYDEEGPAHLAQVSAFWIDVHPVTNAQFAR 95
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
FV+ATGY+T AE+ DT +P L + R + V +G D V+H
Sbjct: 96 FVAATGYLTHAERGIDT-ADDPSLPQHLRVPGAMVFK-----QGPD----------VLHP 139
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
W + P SW HP+G S ++ NHPVV
Sbjct: 140 GW------------------------------QFVPGASWRHPQGPGSDLQGLDNHPVVQ 169
Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
V+ DA AY W G RLP+EA+ EY RGGL++ F WG P+G+ AN WQG+FP +
Sbjct: 170 VALEDAQAYARWAGRRLPSEAQLEYAMRGGLQDADFSWGMTELPKGKAMANTWQGQFPYH 229
Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--PSYNPKGPTTGTD 300
N A DG+ T+PV + N FGL++ GNVWE T + H P +P GP+
Sbjct: 230 NAATDGFTGTSPVGCFPANGFGLFDAGGNVWELTRTGYRPGHAPQRDAGLDPPGPSLDDS 289
Query: 301 ----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
V KGGS+LC+ C R+R +AR + ++GFR
Sbjct: 290 HDPADPGVKVAVIKGGSHLCSADRCLRYRPSARQPQPVFMATSHVGFR 337
>gi|397734366|ref|ZP_10501076.1| hypothetical protein JVH1_5559 [Rhodococcus sp. JVH1]
gi|396930034|gb|EJI97233.1| hypothetical protein JVH1_5559 [Rhodococcus sp. JVH1]
Length = 312
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 122/346 (35%), Positives = 160/346 (46%), Gaps = 63/346 (18%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
K+M +PG TF MG+ E P VT+D F++D H+V+ +F+ FV TG+VT
Sbjct: 7 KNMAWIPGGTFWMGSED---FYPEERPVHQVTVDGFWMDAHQVTVAEFRRFVKDTGHVTT 63
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE + D ++ G D + + V T
Sbjct: 64 AE----------------------IAPDPAQYPGADPAL----------LVPGSLVFTPT 91
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
L W P W HPEG S + R HPV HVSW DA AY
Sbjct: 92 PGPVPLDDYTRWW---------SFTPGADWRHPEGPGSNVGGRERHPVTHVSWFDARAYA 142
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWE+ RGGL+ + F WG P G NVWQG FP N DG+ T
Sbjct: 143 EWAGKDLPTEAEWEFAARGGLDRKPFVWGEEHEPGGRPGGNVWQGRFPWENLLEDGFAGT 202
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-------PAPS----YNPKGPTTGTD- 300
+PV ++ N + L +M GNVWEWT D++ H PA S NP+ T +
Sbjct: 203 SPVGHFRPNGYELSDMAGNVWEWTTDYFTADHSASGKNVAPASSCCIPTNPRVETARVEN 262
Query: 301 -------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R AAR +T ++S ++GFRC
Sbjct: 263 PDEPYARRVIKGGSHLCAPNYCLRYRPAARQGDTEETSTCHIGFRC 308
>gi|452961455|gb|EME66755.1| hypothetical protein G352_02424 [Rhodococcus ruber BKS 20-38]
Length = 303
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 96/198 (48%), Positives = 120/198 (60%), Gaps = 16/198 (8%)
Query: 158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
P W HPEG ST+ R HPV HVS+ DA+AY W G LPTEAEWE+ RGGLE
Sbjct: 100 PGAQWRHPEGPGSTVAGRDRHPVTHVSYFDALAYAQWAGKELPTEAEWEFAARGGLERAR 159
Query: 218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
F WG+ PRG +RAN WQG FP NT DG++ T+PV SY+ N +GL ++ GNVWEWTA
Sbjct: 160 FVWGDEFAPRGRYRANTWQGRFPWENTGEDGFVGTSPVGSYRPNGYGLVDVAGNVWEWTA 219
Query: 278 DWWNVHH-----HPAPSYN---PKGPTTGTD--------KVKKGGSYLCNEQYCYRHRCA 321
D + HH AP++ P P TG +V KGGS+LC YC R+R A
Sbjct: 220 DHYTAHHGADGRDTAPAHACCIPSNPRTGAGDESGPYPRRVIKGGSHLCAPNYCLRYRPA 279
Query: 322 ARSQNTPDSSAGNLGFRC 339
AR + ++S ++GFRC
Sbjct: 280 ARQGESVETSTCHIGFRC 297
>gi|166796137|gb|AAI59002.1| Unknown (protein for IMAGE:7554678) [Xenopus (Silurana) tropicalis]
gi|166796992|gb|AAI59072.1| Unknown (protein for IMAGE:7565049) [Xenopus (Silurana) tropicalis]
Length = 273
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 109/269 (40%), Positives = 143/269 (53%), Gaps = 55/269 (20%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
E +MV L G F MGT K KDGE P R V + F +D++ V+N F+EFV A Y
Sbjct: 18 EDADNMVQLDGGQFDMGT-KASDGKDGESPVRQVKVLPFAVDKYPVTNKDFREFVRAKKY 76
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
TEAE FG +FVFE +SEE + K++Q +
Sbjct: 77 KTEAEAFGWSFVFEDFVSEELKKKVTQ------------------------------KLE 106
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
W LP E R F W P G S I+ ++++PVV VSWNDA
Sbjct: 107 SAPWW---LPVE-------------RAF----WRQPAGPASGIKEKLDYPVVQVSWNDAQ 146
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
AYC W G RLP E EWE+ RGGL+ +++PWGN P +R N+WQG FP +TA DGY
Sbjct: 147 AYCKWLGKRLPIEEEWEFAARGGLKGKVYPWGNTFKP---NRTNLWQGTFPGMDTATDGY 203
Query: 250 LSTAPVMSY-KENKFGLYNMVGNVWEWTA 277
+PV ++ +N++GLY+M+GN WEWTA
Sbjct: 204 HGASPVAAFPAQNEYGLYDMLGNTWEWTA 232
>gi|111021278|ref|YP_704250.1| hypothetical protein RHA1_ro04301 [Rhodococcus jostii RHA1]
gi|110820808|gb|ABG96092.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length = 312
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 121/346 (34%), Positives = 161/346 (46%), Gaps = 63/346 (18%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
K+M +PG TF MG+ E P VT+D F++D H+V+ +F+ FV TG+VT
Sbjct: 7 KNMAWIPGGTFWMGSED---FYPEERPVHQVTVDGFWMDAHQVTVAEFRRFVKDTGHVTT 63
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE + D ++ G D + + V T
Sbjct: 64 AE----------------------IAPDPAQYPGADPAL----------LVPGSLVFTPT 91
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
L W P W HPEG S + R HPV HVSW DA AY
Sbjct: 92 PGPVPLDDYTRWW---------SFTPGADWRHPEGPGSNVGGRERHPVTHVSWFDARAYA 142
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWE+ RGGL+ + F WG P G NVWQG FP N DG+ T
Sbjct: 143 EWAGKDLPTEAEWEFAARGGLDRKPFVWGEEHEPGGRPGGNVWQGRFPWENLLEDGFAGT 202
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTGTDKVK- 303
+PV ++ N + L +M GNVWEWT D++ H + AP+ + P P T +V+
Sbjct: 203 SPVGHFRPNSYELSDMAGNVWEWTTDYFTADHSASGKNVAPANSCCIPTNPRVETARVEN 262
Query: 304 ----------KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KGGS+LC YC R+R AAR +T ++S ++GFRC
Sbjct: 263 PDEPYARRVIKGGSHLCAPNYCLRYRPAARQGDTEETSTCHIGFRC 308
>gi|420933559|ref|ZP_15396834.1| hypothetical protein MM1S1510930_4400 [Mycobacterium massiliense
1S-151-0930]
gi|420936161|ref|ZP_15399430.1| hypothetical protein MM1S1520914_4609 [Mycobacterium massiliense
1S-152-0914]
gi|420943824|ref|ZP_15407079.1| hypothetical protein MM1S1530915_3953 [Mycobacterium massiliense
1S-153-0915]
gi|420946984|ref|ZP_15410234.1| hypothetical protein MM1S1540310_3957 [Mycobacterium massiliense
1S-154-0310]
gi|420953972|ref|ZP_15417214.1| hypothetical protein MM2B0626_4216 [Mycobacterium massiliense
2B-0626]
gi|420958146|ref|ZP_15421380.1| hypothetical protein MM2B0107_3556 [Mycobacterium massiliense
2B-0107]
gi|420962986|ref|ZP_15426210.1| hypothetical protein MM2B1231_4278 [Mycobacterium massiliense
2B-1231]
gi|420994089|ref|ZP_15457235.1| hypothetical protein MM2B0307_3516 [Mycobacterium massiliense
2B-0307]
gi|420999865|ref|ZP_15463000.1| hypothetical protein MM2B0912R_4532 [Mycobacterium massiliense
2B-0912-R]
gi|421004387|ref|ZP_15467509.1| hypothetical protein MM2B0912S_4219 [Mycobacterium massiliense
2B-0912-S]
gi|392138318|gb|EIU64055.1| hypothetical protein MM1S1510930_4400 [Mycobacterium massiliense
1S-151-0930]
gi|392141676|gb|EIU67401.1| hypothetical protein MM1S1520914_4609 [Mycobacterium massiliense
1S-152-0914]
gi|392145430|gb|EIU71154.1| hypothetical protein MM1S1530915_3953 [Mycobacterium massiliense
1S-153-0915]
gi|392152885|gb|EIU78592.1| hypothetical protein MM2B0626_4216 [Mycobacterium massiliense
2B-0626]
gi|392154014|gb|EIU79720.1| hypothetical protein MM1S1540310_3957 [Mycobacterium massiliense
1S-154-0310]
gi|392178647|gb|EIV04300.1| hypothetical protein MM2B0912R_4532 [Mycobacterium massiliense
2B-0912-R]
gi|392180191|gb|EIV05843.1| hypothetical protein MM2B0307_3516 [Mycobacterium massiliense
2B-0307]
gi|392193090|gb|EIV18714.1| hypothetical protein MM2B0912S_4219 [Mycobacterium massiliense
2B-0912-S]
gi|392245899|gb|EIV71376.1| hypothetical protein MM2B1231_4278 [Mycobacterium massiliense
2B-1231]
gi|392247872|gb|EIV73348.1| hypothetical protein MM2B0107_3556 [Mycobacterium massiliense
2B-0107]
Length = 309
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 121/340 (35%), Positives = 162/340 (47%), Gaps = 71/340 (20%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG F MG N + E P R ++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 22 ELVELPGGPFPMGCN---VFYPEEMPERVSSVGPFAIERHPVTNAQFAEFVTQTGYVTVA 78
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D DM D + M T+
Sbjct: 79 ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 102
Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
R P + W++ P W P G S I+ HPVV V++ DA
Sbjct: 103 RTTAGPVDLHDWQQWWDW----------VPGAYWRQPFGPGSDIDGAAEHPVVQVAYTDA 152
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G RLPTEAEWEY RGG + ++ WG+ + P G AN WQG FP N A G
Sbjct: 153 AAYARWAGRRLPTEAEWEYAARGG-SDTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 211
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------APSYNPKGPTTGT 299
+ T+PV ++ N FGL +M+GNVWEWT + P +PS NP
Sbjct: 212 WHGTSPVGTFPANGFGLLDMIGNVWEWT----STRFQPGAGTPRSCCSPSDNPD---PSV 264
Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC +YC+R+R AARS + DS+ ++GFRC
Sbjct: 265 IQVLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRC 304
>gi|359798277|ref|ZP_09300851.1| hypothetical protein KYC_15057 [Achromobacter arsenitoxydans SY8]
gi|359363823|gb|EHK65546.1| hypothetical protein KYC_15057 [Achromobacter arsenitoxydans SY8]
Length = 322
Score = 185 bits (470), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 115/331 (34%), Positives = 156/331 (47%), Gaps = 58/331 (17%)
Query: 32 LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSE-EE 90
+ + E P+ V L F++D H+V+N +F+ FV ATGYVT AE+ P L
Sbjct: 7 MSQPNERPAHKVRLTGFWMDVHDVTNAEFRRFVDATGYVTTAEQKPRWEDLRPQLPPGTP 66
Query: 91 RAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRG 150
R S + F G ++ + R Y W W
Sbjct: 67 RPDDSALVAGAMVFVGTETEVSLR--------------DYSRW----------W------ 96
Query: 151 GLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCR 210
R P +W HP+G S I+ + HPVV VS+ DA AY W G RLPTEAEWE+ R
Sbjct: 97 ----RFVPGANWRHPQGPGSNIDGKDEHPVVQVSYEDAQAYAQWTGKRLPTEAEWEFAAR 152
Query: 211 GGLENRLFPWGNNLTPRGEHRANVW----QGEFPTNNTAADGYLSTAPVMSYKENKFGLY 266
GG E + WGN L P+G+ AN+W Q FP + T PV SY N +GLY
Sbjct: 153 GGYEQATYSWGNELQPQGQAMANIWDTRQQQPFPVVKD-EKVQVGTTPVGSYAPNGYGLY 211
Query: 267 NMVGNVWEWTADWWNVHHH---------------PAPSYNPKG---PTTGTDKVKKGGSY 308
+M GNVW+WTADW+ PA S++P P + +V +GGS+
Sbjct: 212 DMAGNVWQWTADWYRADAFMVQAQYRQPPTDPAGPADSFDPSDGNVPASAPKRVTRGGSF 271
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
LC++ YC +R +AR P + +LGFR
Sbjct: 272 LCSDTYCISYRASARRGTDPLNPMSHLGFRT 302
>gi|393774874|ref|ZP_10363207.1| hypothetical protein WSK_4216 [Novosphingobium sp. Rr 2-17]
gi|392719692|gb|EIZ77224.1| hypothetical protein WSK_4216 [Novosphingobium sp. Rr 2-17]
Length = 329
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 127/352 (36%), Positives = 171/352 (48%), Gaps = 67/352 (19%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ MV + G TF MG+++ E P+R V +D F++D V+N QF FV+ATGYVT
Sbjct: 16 QGMVWIAGGTFTMGSDRHY---PEEAPTRRVAVDGFWIDAAPVTNRQFAAFVAATGYVTV 72
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE D + + RA S + H+ PV +
Sbjct: 73 AEIAPDPKDYPGMQPGMARAG---------------SLVFHKTATPV-----DTGNPANW 112
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHR--MNHPVVHVSWNDAVA 190
WR +E+G W HP G DS IE +HPVV V++ DA A
Sbjct: 113 WR---------FEFGA------------DWRHPLGPDSDIESLDLWDHPVVQVAYADAQA 151
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
Y W G LPTEAE+E+ RGGL+ + + WG+ L P G AN WQG FP N DG+
Sbjct: 152 YADWAGKVLPTEAEFEFAARGGLDGKDYAWGDQLAPGGAMMANYWQGLFPFANQLLDGWE 211
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--------------------PSY 290
T+PV S+ N +GL++M+GN WEWT+DWW+ PA SY
Sbjct: 212 RTSPVGSFPANGYGLFDMIGNTWEWTSDWWSERSMPADKKPGGSCCIPSNPRGGNLKDSY 271
Query: 291 NPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+P P KV KGGS+LC YC R+R A+R D+S ++GFRC +
Sbjct: 272 DPTQPAVRIGRKVLKGGSHLCAANYCQRYRPASRHPEMIDTSTSHIGFRCVS 323
>gi|397680590|ref|YP_006522125.1| serine/threonine-protein kinase pkn1 [Mycobacterium massiliense
str. GO 06]
gi|418251298|ref|ZP_12877495.1| hypothetical protein MAB47J26_20891 [Mycobacterium abscessus 47J26]
gi|353449123|gb|EHB97522.1| hypothetical protein MAB47J26_20891 [Mycobacterium abscessus 47J26]
gi|395458855|gb|AFN64518.1| Serine/threonine-protein kinase pkn1 [Mycobacterium massiliense
str. GO 06]
Length = 292
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 121/340 (35%), Positives = 162/340 (47%), Gaps = 71/340 (20%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG F MG N + E P R ++ F +++H V+N QF EFV+ TGYVT A
Sbjct: 5 ELVELPGGPFPMGCN---VFYPEEMPERVSSVGPFAIERHPVTNAQFAEFVTQTGYVTVA 61
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D DM D + M T+
Sbjct: 62 ERAPDP--------------------DMYPGASPDDLVPGAM----------------TF 85
Query: 134 RGARLPTEAE-----WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
R P + W++ P W P G S I+ HPVV V++ DA
Sbjct: 86 RTTAGPVDLHDWQQWWDW----------VPGAYWRQPFGPGSDIDGAAEHPVVQVAYTDA 135
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G RLPTEAEWEY RGG + ++ WG+ + P G AN WQG FP N A G
Sbjct: 136 AAYARWAGRRLPTEAEWEYAARGG-SDTVYAWGDEVAPEGRLMANTWQGRFPYRNDGALG 194
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------APSYNPKGPTTGT 299
+ T+PV ++ N FGL +M+GNVWEWT + P +PS NP
Sbjct: 195 WHGTSPVGTFPANGFGLLDMIGNVWEWT----STRFQPGAGTPRSCCSPSDNPD---PSV 247
Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC +YC+R+R AARS + DS+ ++GFRC
Sbjct: 248 IQVLKGGSHLCAPEYCHRYRPAARSPQSQDSATTHIGFRC 287
>gi|377571511|ref|ZP_09800629.1| putative sulfatase-modifying factor [Gordonia terrae NBRC 100016]
gi|377531341|dbj|GAB45794.1| putative sulfatase-modifying factor [Gordonia terrae NBRC 100016]
Length = 319
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 118/332 (35%), Positives = 161/332 (48%), Gaps = 60/332 (18%)
Query: 14 DMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ V +P F MG DGE P V L+AF +D V+N F FV+ATG+ T+
Sbjct: 34 ETVSVPAGVFDMGDAFGEGYRTDGETPVHEVELNAFSIDTTAVTNAAFASFVAATGHRTD 93
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE FG + VF + R PV W AV +
Sbjct: 94 AETFGGSAVFHTYATAPGR--------------------------PVPGTPWWLAVDGAS 127
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR HP G ST++ +HPVVHVS DA AYC
Sbjct: 128 WR-----------------------------HPAGPGSTLDGLADHPVVHVSHRDAQAYC 158
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEA+WEY RGG +PWG+ + R +++G+FP T G + T
Sbjct: 159 DWAGRALPTEAQWEYAARGGRRGARYPWGDEPPTADDPRCTIFRGDFPNEPT---GPVGT 215
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-YNPKGPTTGTDKVKKGGSYLCN 311
PV +++ N GLY GNVWEW AD ++ ++ +P GP G+ +V +GGS+LC+
Sbjct: 216 TPVRTFEPNGHGLYQCAGNVWEWCADRFSARYYRVSDRTDPSGPARGSARVLRGGSHLCH 275
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
+ YC+R+R AARS NTP+S+A N+GFR +
Sbjct: 276 DSYCHRYRVAARSHNTPESTASNIGFRTVGPR 307
>gi|118469714|ref|YP_885618.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis str. MC2 155]
gi|399985615|ref|YP_006565963.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis str. MC2 155]
gi|441204504|ref|ZP_20972070.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis MKD8]
gi|118171001|gb|ABK71897.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis str. MC2 155]
gi|399230175|gb|AFP37668.1| Sulfatase-modifying factor 1 [Mycobacterium smegmatis str. MC2 155]
gi|440629433|gb|ELQ91222.1| sulfatase-modifying factor 1 [Mycobacterium smegmatis MKD8]
Length = 291
Score = 185 bits (469), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 123/332 (37%), Positives = 171/332 (51%), Gaps = 49/332 (14%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V L G TFRMG+ E P V++ F +++H V+N QF EFV+ATGYVT A
Sbjct: 4 ELVALDGGTFRMGSQD---FYPEEAPVHEVSVAPFSIERHPVTNAQFAEFVAATGYVTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D F P + +E + V F+ + + R D + TW
Sbjct: 61 EQELDPAAF-PGVPADELVPGALV------FQPTEGPVNLR-----------DWRQWWTW 102
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
P W HP+G S+I+ +HPVV V++ DA AY
Sbjct: 103 -----------------------VPGACWKHPKGPGSSIDDVPDHPVVQVAYPDAAAYAA 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY R G ++ WG+++ P G+ AN WQG FP N A G+ T+
Sbjct: 140 WAGRRLPTEAEWEYAARAGATT-VYAWGDDVRPDGQLMANTWQGRFPYRNDGALGWTGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD----KVKKGGSYL 309
PV ++ N FGL +M+GNVWEWT+ + H P + P G D + KGGS+L
Sbjct: 199 PVGTFPPNGFGLVDMIGNVWEWTSTRYTPRHSRRPEVSGCCPAPGGDPSIHQTLKGGSHL 258
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
C +YC+R+R AARS + DS+ ++GFRC A
Sbjct: 259 CAPEYCHRYRPAARSSQSQDSATTHIGFRCVA 290
>gi|427407746|ref|ZP_18897948.1| hypothetical protein HMPREF9718_00422 [Sphingobium yanoikuyae ATCC
51230]
gi|425713709|gb|EKU76721.1| hypothetical protein HMPREF9718_00422 [Sphingobium yanoikuyae ATCC
51230]
Length = 320
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 130/353 (36%), Positives = 171/353 (48%), Gaps = 62/353 (17%)
Query: 6 APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
AP +M+ +PG F MG++ E P+ V +DAF++D V+N F+ FV
Sbjct: 8 APAATAPDNMIHIPGGRFIMGSDH---HYPEEAPAHPVEVDAFWIDATPVTNRDFRRFVE 64
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
ATG+ T AE P + A+ DM R L T R PV W+
Sbjct: 65 ATGHRTTAE-------IAPNPDDYPGAQP-----DMLRPASLVFTPTER---PVPLHDWS 109
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
W W Y W HP G DS+I+ +HPVVHV++
Sbjct: 110 ------QW----------WRYVFDA----------DWRHPLGPDSSIDDLDDHPVVHVTY 143
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
+DA AY W G LPTEAEWE+ RGG++ + F WG+ L P G+ AN WQG FP+ N
Sbjct: 144 DDAAAYAAWAGKALPTEAEWEFAARGGVDGQEFAWGDELLPDGKAMANFWQGNFPSENLL 203
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP----------------- 288
DGY T+PV S+ N +GL +M+GNVWEWT D++ H P
Sbjct: 204 IDGYERTSPVGSFPANGYGLSDMIGNVWEWTEDFYAARHQADPSRPCCAPRNPLNQAREA 263
Query: 289 SYNPKGP-TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
SY+P P +V KGGS+LC YC R+R AAR DSS ++GFRC
Sbjct: 264 SYDPHQPHILIPRRVLKGGSHLCAPSYCRRYRPAARHAQAIDSSTSHIGFRCV 316
>gi|148997850|ref|ZP_01825414.1| hypothetical protein CGSSp11BS70_03044 [Streptococcus pneumoniae
SP11-BS70]
gi|168575014|ref|ZP_02720977.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae MLV-016]
gi|307068416|ref|YP_003877382.1| hypothetical protein SPAP_1797 [Streptococcus pneumoniae AP200]
gi|419471684|ref|ZP_14011543.1| hypothetical protein SPAR15_1638 [Streptococcus pneumoniae GA07914]
gi|419504523|ref|ZP_14044191.1| hypothetical protein SPAR105_1607 [Streptococcus pneumoniae
GA47760]
gi|421314663|ref|ZP_15765250.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA47562]
gi|147756349|gb|EDK63391.1| hypothetical protein CGSSp11BS70_03044 [Streptococcus pneumoniae
SP11-BS70]
gi|183578750|gb|EDT99278.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae MLV-016]
gi|306409953|gb|ADM85380.1| Uncharacterized conserved protein [Streptococcus pneumoniae AP200]
gi|379546400|gb|EHZ11539.1| hypothetical protein SPAR15_1638 [Streptococcus pneumoniae GA07914]
gi|379605911|gb|EHZ70661.1| hypothetical protein SPAR105_1607 [Streptococcus pneumoniae
GA47760]
gi|395913348|gb|EJH24201.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA47562]
Length = 282
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 119/335 (35%), Positives = 160/335 (47%), Gaps = 62/335 (18%)
Query: 15 MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M + G TF MGTN + + D E P V++ F + V+N +F +FV TGY T A
Sbjct: 3 MKKIKGGTFMMGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLA 62
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ +FVF + E ER EG HP W C
Sbjct: 63 ERQEWSFVFILFVPEAER-------------EGYP--------HPAGAPWWLQVSNAC-- 99
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W HP G ++ + +HPVVHV+ DA+A+C
Sbjct: 100 ----------------------------WKHPYGENNNLVGLEDHPVVHVALEDALAFCN 131
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEA+WEY RGG ++ +PWG+ L G + AN WQG FP NTA DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTA 190
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
PV + N FGLY M+GNVWEW N + S+N PK + +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+LC+ YC R+R AAR+ S++ +LGFRC +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282
>gi|152988165|ref|YP_001347985.1| sulfatase-modifying factor 1 [Pseudomonas aeruginosa PA7]
gi|150963323|gb|ABR85348.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Pseudomonas aeruginosa PA7]
Length = 317
Score = 184 bits (468), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 132/358 (36%), Positives = 170/358 (47%), Gaps = 69/358 (19%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
R DM LPG FRMG+ + E P R V +D F++D V+N QF FV ATGY
Sbjct: 7 RTPDMRYLPGGVFRMGSER---FYPEERPVRQVRVDPFWIDVTPVTNRQFAAFVEATGYR 63
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
T AEK + + + +E RA S V F + + + D
Sbjct: 64 TFAEKAPEPADYPGMPAEMARAG-SLV------FTPVSAAV--------------DLADP 102
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W W + R W W P G S+I HPVVHV++ DA A
Sbjct: 103 SRW----------WSF--------RFGAW--WREPLGPGSSIAGLEEHPVVHVAYEDAAA 142
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
Y W G LP+EAEWEY RGG + F WG+ L P G+ AN WQG FP N DG+
Sbjct: 143 YAAWVGKALPSEAEWEYAARGGRDGEEFAWGDTLEPGGQILANYWQGRFPLENLLLDGWE 202
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP---------------------S 289
T+PV ++ N GL++M+GNVWEWTADW+ PA S
Sbjct: 203 RTSPVGAFPANGHGLFDMIGNVWEWTADWF---RQPAATDRQASSCCASRNPRGGREEDS 259
Query: 290 YNPKGPTTGTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
Y+P P KV KGGS+LC E YC R+R AAR + D++ G++GFRC + P
Sbjct: 260 YDPALPALRIGRKVLKGGSHLCAENYCQRYRPAARIPQSIDTTTGHVGFRCVRREPPA 317
>gi|67924262|ref|ZP_00517699.1| Protein of unknown function DUF323 [Crocosphaera watsonii WH 8501]
gi|67853877|gb|EAM49199.1| Protein of unknown function DUF323 [Crocosphaera watsonii WH 8501]
Length = 224
Score = 184 bits (468), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 92/196 (46%), Positives = 127/196 (64%), Gaps = 15/196 (7%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W HP G DS I+ + N+PV+H++++DAVAY W+G LPTEA+WEY RGGL+++ F W
Sbjct: 32 NWQHPYGPDSNIKGKDNYPVIHIAYDDAVAYANWKGKTLPTEAQWEYAGRGGLKDQDFSW 91
Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
GN + + +AN WQG FP NT DG+L TAPV S+ N +GLY+M GNVWE T+DW+
Sbjct: 92 GNRYSAK---KANTWQGIFPFLNTKQDGHLGTAPVESFPPNGYGLYDMTGNVWELTSDWY 148
Query: 281 NVHH-------HPA-----PSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTP 328
+V H +P S++P P+ G+ V KGGSYLC + YC R+R AAR P
Sbjct: 149 SVGHGGKEHSLNPTGPTKNASFDPTKPSEGSLHVIKGGSYLCAKNYCSRYRPAARESQAP 208
Query: 329 DSSAGNLGFRCAADKG 344
D+ ++GFR + G
Sbjct: 209 DTGTTHVGFRLVSVDG 224
>gi|303254682|ref|ZP_07340784.1| hypothetical protein CGSSpBS455_04421 [Streptococcus pneumoniae
BS455]
gi|303258747|ref|ZP_07344727.1| hypothetical protein CGSSp9vBS293_06449 [Streptococcus pneumoniae
SP-BS293]
gi|303261911|ref|ZP_07347857.1| hypothetical protein CGSSp14BS292_10874 [Streptococcus pneumoniae
SP14-BS292]
gi|303263773|ref|ZP_07349695.1| hypothetical protein CGSSpBS397_08169 [Streptococcus pneumoniae
BS397]
gi|303265692|ref|ZP_07351591.1| hypothetical protein CGSSpBS457_09620 [Streptococcus pneumoniae
BS457]
gi|303268605|ref|ZP_07354397.1| hypothetical protein CGSSpBS458_09836 [Streptococcus pneumoniae
BS458]
gi|387759911|ref|YP_006066889.1| hypothetical protein SPNINV200_16260 [Streptococcus pneumoniae
INV200]
gi|418140131|ref|ZP_12776956.1| hypothetical protein SPAR28_1772 [Streptococcus pneumoniae GA13338]
gi|418181162|ref|ZP_12817731.1| hypothetical protein SPAR74_1775 [Streptococcus pneumoniae GA41688]
gi|418221825|ref|ZP_12848478.1| hypothetical protein SPAR104_1759 [Streptococcus pneumoniae
GA47751]
gi|419515310|ref|ZP_14054935.1| hypothetical protein SPAR152_1676 [Streptococcus pneumoniae
England14-9]
gi|421296637|ref|ZP_15747346.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA58581]
gi|301802500|emb|CBW35259.1| conserved hypothetical protein [Streptococcus pneumoniae INV200]
gi|302598394|gb|EFL65438.1| hypothetical protein CGSSpBS455_04421 [Streptococcus pneumoniae
BS455]
gi|302636994|gb|EFL67483.1| hypothetical protein CGSSp14BS292_10874 [Streptococcus pneumoniae
SP14-BS292]
gi|302640248|gb|EFL70703.1| hypothetical protein CGSSpBS293_06449 [Streptococcus pneumoniae
SP-BS293]
gi|302641884|gb|EFL72239.1| hypothetical protein CGSSpBS458_09836 [Streptococcus pneumoniae
BS458]
gi|302644819|gb|EFL75067.1| hypothetical protein CGSSpBS457_09620 [Streptococcus pneumoniae
BS457]
gi|302646811|gb|EFL77036.1| hypothetical protein CGSSpBS397_08169 [Streptococcus pneumoniae
BS397]
gi|353843234|gb|EHE23279.1| hypothetical protein SPAR74_1775 [Streptococcus pneumoniae GA41688]
gi|353875135|gb|EHE54989.1| hypothetical protein SPAR104_1759 [Streptococcus pneumoniae
GA47751]
gi|353904910|gb|EHE80360.1| hypothetical protein SPAR28_1772 [Streptococcus pneumoniae GA13338]
gi|379635859|gb|EIA00418.1| hypothetical protein SPAR152_1676 [Streptococcus pneumoniae
England14-9]
gi|395895510|gb|EJH06485.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA58581]
Length = 282
Score = 184 bits (467), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 119/335 (35%), Positives = 160/335 (47%), Gaps = 62/335 (18%)
Query: 15 MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M + G TF MGTN + + D E P V++ F + V+N +F +FV TGY T A
Sbjct: 3 MKKIKGGTFMMGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLA 62
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ +FVF + E ER EG HP W C
Sbjct: 63 ERQEWSFVFILFVPEAER-------------EGYP--------HPAGAPWWLQVSNAC-- 99
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W HP G +S + +HPVV+V+ DA+A+C
Sbjct: 100 ----------------------------WKHPYGENSNLVGLEDHPVVYVALEDALAFCN 131
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEA+WEY RGG ++ +PWG+ L G + AN WQG FP NTA DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTA 190
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
PV + N FGLY M+GNVWEW N + S+N PK + +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+LC+ YC R+R AAR+ S++ +LGFRC +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282
>gi|194384818|dbj|BAG59569.1| unnamed protein product [Homo sapiens]
Length = 286
Score = 184 bits (467), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 122/336 (36%), Positives = 161/336 (47%), Gaps = 83/336 (24%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G F MGTN P +DGE P R T+ F +D V+N F++FV Y TEAE
Sbjct: 30 MVQLQGGRFLMGTNSPD-SRDGEGPVRGATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 88
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q MK G S I R+ HPV+H+SWNDA AYC WR
Sbjct: 89 MFGWSFVFEDFVSDELRNKATQ---PMKP-AGPGSGIRERLEHPVLHVSWNDARAYCAWR 144
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G RLPTE EWE+ RGGL+ +++PWG+W P +R N W
Sbjct: 145 GKRLPTEEEWEFAARGGLKGQVYPWGNWFQP--------NRTN---------------LW 181
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G P+G+ + + G P N A
Sbjct: 182 QGK---------------------------FPKGDKAEDGFHGVSPVNAFPA-------- 206
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNE 312
+N +GLY+++GNVWEWTA +P +V +G S++ +
Sbjct: 207 -----QNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTADG 248
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCAAD G G
Sbjct: 249 SANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 284
Score = 41.2 bits (95), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCAAD
Sbjct: 235 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 279
>gi|418183336|ref|ZP_12819893.1| hypothetical protein SPAR78_1744 [Streptococcus pneumoniae GA43380]
gi|353847765|gb|EHE27785.1| hypothetical protein SPAR78_1744 [Streptococcus pneumoniae GA43380]
Length = 282
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 119/335 (35%), Positives = 159/335 (47%), Gaps = 62/335 (18%)
Query: 15 MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M + G TF MGTN + + D E P V++ F + V+N +F +FV TGY T A
Sbjct: 3 MKKIKGGTFMMGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLA 62
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ +FVF + E ER EG HP W C
Sbjct: 63 ERQEWSFVFILFVPEAER-------------EGYP--------HPAGAPWWLQVPNAC-- 99
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W HP G +S + +HPVVHV+ DA+A+C
Sbjct: 100 ----------------------------WKHPYGENSNLVGLEDHPVVHVALEDALAFCN 131
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEA+WEY RGG ++ +PWG+ L G + AN WQG FP NT DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTGLDGFIGTA 190
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
PV + N FGLY M+GNVWEW N + S+N PK + +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+LC+ YC R+R AAR+ S++ +LGFRC +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282
>gi|194248087|ref|NP_001035934.2| sulfatase-modifying factor 2 isoform c precursor [Homo sapiens]
Length = 305
Score = 184 bits (467), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 121/336 (36%), Positives = 158/336 (47%), Gaps = 83/336 (24%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G F MGTN P +DGE P R T+ F +D V+N F++FV Y TEAE
Sbjct: 49 MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 107
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q MK G S I R+ HPV+H+SWNDA AYC WR
Sbjct: 108 MFGWSFVFEDFVSDELRNKATQ---PMKP-AGPGSGIRERLEHPVLHVSWNDARAYCAWR 163
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G RLPTE EWE+ RGGL+ +++PWG+W P
Sbjct: 164 GKRLPTEEEWEFAARGGLKGQVYPWGNWFQP----------------------------- 194
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
NR W P+G+ + + G P N A
Sbjct: 195 --------------------NRTNLWQGKF-PKGDKAEDGFHGVSPVNAFPA-------- 225
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNE 312
+N +GLY+++GNVWEWTA +P +V +G S++ +
Sbjct: 226 -----QNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTADG 267
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCAAD G G
Sbjct: 268 SANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 303
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 85/232 (36%), Positives = 116/232 (50%), Gaps = 65/232 (28%)
Query: 163 LHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGN 222
+ P G S I R+ HPV+HVSWNDA AYC WRG RLPTE EWE+ RGGL+ +++PWGN
Sbjct: 131 MKPAGPGSGIRERLEHPVLHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGN 190
Query: 223 NLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWN 281
P +R N+WQG+FP + A DG+ +PV ++ +N +GLY+++GNVWE W
Sbjct: 191 WFQP---NRTNLWQGKFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWE-----WT 242
Query: 282 VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+ A + + V +G S++ +T D SA
Sbjct: 243 ASPYQAAEQDMR--------VLRGASWI----------------DTADGSAN-------- 270
Query: 342 DKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+R R R NTPDS++ NLGFRCAAD
Sbjct: 271 ------------------------HRARVTTRMGNTPDSASDNLGFRCAADA 298
>gi|163849970|ref|YP_001638013.1| hypothetical protein Mext_0527 [Methylobacterium extorquens PA1]
gi|163661575|gb|ABY28942.1| protein of unknown function DUF323 [Methylobacterium extorquens
PA1]
Length = 325
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 118/352 (33%), Positives = 164/352 (46%), Gaps = 64/352 (18%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
E M + G TFRMG+++ E P V +D F++D+ V+N QF +FV AT
Sbjct: 15 AEDTSGMRWISGGTFRMGSDRHY---PEEAPIHRVAVDGFWIDEAPVTNRQFSQFVLATA 71
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
Y T AE + D K + G + H+ + ++
Sbjct: 72 YRTVAE----------------------IEPDPKDYPGA-----------LPHLLFAGSL 98
Query: 129 AYCTWRG-ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
+ G L ++W +G W P G S+I+ +HPVV V++ D
Sbjct: 99 VFTPTPGPVDLRVCSQWWRFLKGA---------DWRRPYGPGSSIKGLDDHPVVQVAYKD 149
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
A AY W G LPTEAEWE+ RGGL+ F WG + P G AN+WQGEFP N +
Sbjct: 150 AEAYSDWAGKALPTEAEWEFAARGGLDGAEFAWGADFMPEGRQMANIWQGEFPHQNLSRR 209
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN---PKGPTTGTD---- 300
+ T PV ++ N +GL +M+GNVWEWT+DW+ H P+ P G +
Sbjct: 210 AFKRTTPVGAFPPNAYGLVDMIGNVWEWTSDWFAEKHQGDAKKACCIPQNPRGGREADSL 269
Query: 301 -----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
KV KGGS+LC YC R+R AAR D+S ++GFRC A
Sbjct: 270 DPRDLQSLIPRKVVKGGSHLCAPTYCRRYRPAARHPQPIDTSMSHVGFRCVA 321
>gi|419960796|ref|ZP_14476811.1| hypothetical protein WSS_A01770 [Rhodococcus opacus M213]
gi|414574017|gb|EKT84695.1| hypothetical protein WSS_A01770 [Rhodococcus opacus M213]
Length = 304
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 122/344 (35%), Positives = 160/344 (46%), Gaps = 63/344 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG TF MG+ E P VT+DAF++D H+V+ +F+ FV TG+VT AE
Sbjct: 1 MEWIPGGTFWMGSED---FYPEERPVHQVTVDAFWMDSHQVTVAEFRRFVKDTGHVTTAE 57
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D ++ G D + + V T
Sbjct: 58 ----------------------IAPDPAQYPGADPAL----------LVPGSLVFTPTPG 85
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
L W G W HPEG S + R HPV HVSW DA AY W
Sbjct: 86 PVPLDDYTRWWSFTAGA---------DWRHPEGPGSNVGGRERHPVTHVSWFDARAYAEW 136
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE+ RGGL+ + F WG P G NVWQG FP N DGY T+P
Sbjct: 137 AGKDLPTEAEWEFAARGGLDRKPFVWGEEHEPGGRPGGNVWQGRFPWENLLEDGYAGTSP 196
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTGT------- 299
V ++ N + L +M GNVWEWT D++ V H + AP+ + P P T
Sbjct: 197 VGHFRPNGYDLSDMAGNVWEWTTDYFTVDHSESGKNVAPTSSCCIPTNPRVDTARAENPD 256
Query: 300 ----DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R +AR +T ++S ++GFRC
Sbjct: 257 EPYARRVIKGGSHLCAPNYCLRYRPSARQGDTEETSTCHIGFRC 300
>gi|299115525|emb|CBN75729.1| Zgc:136465 protein [Ectocarpus siliculosus]
Length = 243
Score = 184 bits (466), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 89/177 (50%), Positives = 111/177 (62%), Gaps = 15/177 (8%)
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL--------------FPWGNNLTP 226
+ VSWNDA AYC WRG RLP+EAEWE +G +PWG++LTP
Sbjct: 20 IQVSWNDAKAYCRWRGGRLPSEAEWERAAQGDPVEAAGEGEGEERGRAMPRYPWGDDLTP 79
Query: 227 RGEHRANVWQGEFPTNNTAADGYLSTAPVMSYK-ENKFGLYNMVGNVWEWTADWWNVHHH 285
G+HRAN+WQG FPT N A DG+ T+PV ++ +N GL + VGN WEW DWW
Sbjct: 80 EGQHRANIWQGTFPTLNLAEDGFPYTSPVDAFPPQNSLGLRDAVGNAWEWVEDWWTPDRS 139
Query: 286 PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
NP+GP GT+K KKGGS+LC+ YCYR+R AAR+++ DS N GFRCAAD
Sbjct: 140 KYKQNNPRGPKRGTEKTKKGGSFLCHHTYCYRYRPAARTKSDIDSGTSNQGFRCAAD 196
>gi|441510337|ref|ZP_20992244.1| hypothetical protein GOACH_22_00020 [Gordonia aichiensis NBRC
108223]
gi|441445472|dbj|GAC50205.1| hypothetical protein GOACH_22_00020 [Gordonia aichiensis NBRC
108223]
Length = 328
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 124/340 (36%), Positives = 170/340 (50%), Gaps = 58/340 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V L G F MG+++ E P +DAF +++H V+N Q+ FV+ATGYVT
Sbjct: 32 ELVELAGGRFSMGSDR---FYPEERPHHERRVDAFAIERHPVTNAQYAAFVAATGYVT-- 86
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
I+++ D F G D + + +V + V W
Sbjct: 87 --------------------IAELPIDPADFPGADPAL--LVPGALVFTPTSGPVDLTDW 124
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R W R P SW HP G ++ + +HPV +++ DAVAY
Sbjct: 125 R--------HWW---------RWQPGASWRHPLGPETDLVGYEDHPVTQIAYADAVAYAE 167
Query: 194 WRGARLPTEAEWEYGCRGGLEN-RLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWEY RGG+ + WG+ L P G AN W G+FP A+DG+ T
Sbjct: 168 WAGRRLPTEAEWEYAARGGIGGAHEYAWGDELHPDGAILANTWIGDFP---YASDGWGGT 224
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--PSYNPKG------PTTG--TDKV 302
+ V SY N FGL +M+GNVWE T+D + H P + +P G PT +V
Sbjct: 225 SRVASYPPNGFGLVDMIGNVWERTSDVFTPRHVPPDLATVSPDGRTNLLAPTASPHVSRV 284
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
KGGS+LC+ YC R+R AARS + DS+ +LGFRCAAD
Sbjct: 285 TKGGSHLCSPHYCRRYRPAARSPQSDDSATSHLGFRCAAD 324
>gi|297288259|ref|XP_002803310.1| PREDICTED: sulfatase-modifying factor 2-like isoform 2 [Macaca
mulatta]
gi|402863390|ref|XP_003896000.1| PREDICTED: sulfatase-modifying factor 2 isoform 3 [Papio anubis]
Length = 231
Score = 183 bits (465), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 111/294 (37%), Positives = 148/294 (50%), Gaps = 69/294 (23%)
Query: 58 TQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
Q ++FV Y TEAE FG +FVFE +S+E R K +Q
Sbjct: 2 AQARDFVREKKYRTEAEMFGWSFVFEDFVSDELRNKATQ--------------------- 40
Query: 118 PVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMN 177
P+ + W W LP E + W P G S I R+
Sbjct: 41 PMKPVLW--------W----LPVEKAF-----------------WRQPAGPGSGIRERLE 71
Query: 178 HPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG 237
HPV+HVSWNDA AYC WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG
Sbjct: 72 HPVLHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQG 128
Query: 238 EFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT 296
FP + A DG+ +PV ++ +N +GLY+++GNVWEWTA +P
Sbjct: 129 RFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAA 175
Query: 297 TGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+V +G S++ + +R R R NTPDS++ NLGFRCAAD G G
Sbjct: 176 EQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADAGRLPG 229
>gi|357975662|ref|ZP_09139633.1| hypothetical protein SpKC8_09213 [Sphingomonas sp. KC8]
Length = 318
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 127/345 (36%), Positives = 170/345 (49%), Gaps = 64/345 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ + G FRMG+++ E P+ +V +D F +D V+N QF FV+ATG+VT AE
Sbjct: 15 MIAIKGGRFRMGSDR---HYPEEAPAHDVIVDDFAIDATPVTNAQFAAFVAATGHVTFAE 71
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ + +L E +R F IE R D A+
Sbjct: 72 TPPNPDDYPGILPE-------MIRAGSLVFTPPPGPIELR-----------DPFAW---- 109
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W + W P G STI+ +HPVVH+++ DA AY W
Sbjct: 110 ---------WSFTFDA----------DWRRPYGPGSTIDGLDDHPVVHIAYADAQAYADW 150
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE+ RGG+ + + WG+ L P G H ANVWQG FP N DG+ T+P
Sbjct: 151 AGKSLPTEAEWEFAARGGIADADYAWGDELEPGGVHHANVWQGAFPWQNLGGDGWERTSP 210
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSY-----NPKGP--TTGTD---- 300
V +Y N +GL++M+GNVWEWTADW+ H PA S NP+G T D
Sbjct: 211 VRNYPANGYGLFDMIGNVWEWTADWYADRHAAKDPAASACCVPRNPRGAVETESRDPCMP 270
Query: 301 ------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 271 DLVVGRRVLKGGSHLCAPNYCQRYRPAARYAQPIDTSTSHVGFRC 315
>gi|452751219|ref|ZP_21950965.1| Sulfatase modifying factor 1 precursor [alpha proteobacterium
JLT2015]
gi|451961369|gb|EMD83779.1| Sulfatase modifying factor 1 precursor [alpha proteobacterium
JLT2015]
Length = 342
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 133/364 (36%), Positives = 175/364 (48%), Gaps = 71/364 (19%)
Query: 3 LLPAPPVE--RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
+LPA + ++ M+ +PG TF MG+++ E P R V +D F +D+ V+N F
Sbjct: 13 ILPAAAADAPQHTGMMFVPGGTFAMGSDR---FYPEEAPVRTVRVDPFRIDETPVTNADF 69
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
FV ATGY T AE D + L +E +A S + HR PV
Sbjct: 70 ARFVDATGYRTVAEIAPDPADYPGLAAEMAKAG---------------SLVFHRTRGPV- 113
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
D W W + +F W HP G DS+++ +HPV
Sbjct: 114 -----DLHDPSGW----------WSF---------VFD-ADWRHPYGPDSSLDGLDDHPV 148
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
VHV+ DA AY W G LPTEAEWE+ RGGL+ + WG+ L P GE AN WQG FP
Sbjct: 149 VHVAHADAAAYADWAGKALPTEAEWEFAARGGLDGTDYAWGDELAPDGEMLANYWQGAFP 208
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV-----------HHHPAPS 289
N DGY T+PV +Y N +GL++M+GNVWEWT DW+ + P+P
Sbjct: 209 FANQLLDGYERTSPVRTYAANGYGLFDMIGNVWEWTEDWFALPPSPKKKTAKRGSGPSPC 268
Query: 290 YNPKGPTTGT--------------DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNL 335
P P GT KV KGGS+LC YC R+R AAR DS+ ++
Sbjct: 269 CVPANPRGGTRRQSVDPQDTARIARKVLKGGSHLCAANYCQRYRPAARHAQAIDSATSHI 328
Query: 336 GFRC 339
GFRC
Sbjct: 329 GFRC 332
>gi|365856093|ref|ZP_09396118.1| cysteine-type sulfatase aerobic maturase [Acetobacteraceae
bacterium AT-5844]
gi|363718511|gb|EHM01850.1| cysteine-type sulfatase aerobic maturase [Acetobacteraceae
bacterium AT-5844]
Length = 329
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 128/343 (37%), Positives = 165/343 (48%), Gaps = 62/343 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG F MG++ E P+ V +D F++D V+N F FV+ TG++T AE
Sbjct: 19 MRWIPGGRFTMGSDHHYA---EERPAHPVAVDGFWMDATPVTNAAFARFVADTGHLTLAE 75
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D ++ + E + + RM V+ R
Sbjct: 76 RPLDPALYPGAMPELL----------------VPGALVFRMTPGPVNT-----------R 108
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
R W Y P W PEG ST++ R +HPVVHV+ DA AY W
Sbjct: 109 DVR----QWWSY----------TPGACWKRPEGPGSTLDGRDDHPVVHVAHEDAAAYAAW 154
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE RGGLE F WG LTP G H AN WQG FP + AADG+ T+P
Sbjct: 155 AGKALPTEAEWERAARGGLEAAEFAWGEELTPEGRHMANTWQGPFPWRDFAADGFAGTSP 214
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS------YNPKG-PTTGT-------- 299
V S+ N +GL ++ GNVWEWT DW+ H P +NP+G P G+
Sbjct: 215 VGSFPANGYGLSDVTGNVWEWTEDWYAARHEADPGKPCCAPHNPRGAPVEGSYDPHQPAI 274
Query: 300 ---DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KV KGGS+LC YC R+R AAR DS+ +LGFRC
Sbjct: 275 RIPRKVVKGGSFLCAPSYCRRYRPAARQPQMVDSAMSHLGFRC 317
>gi|351699435|gb|EHB02354.1| Sulfatase-modifying factor 1 [Heterocephalus glaber]
Length = 265
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 78/128 (60%), Positives = 97/128 (75%)
Query: 174 HRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRAN 233
HR +HPV+H+SWNDAVAYCTW G RLPTEA+WEY C+ GL+NRL PW N L P+ +H AN
Sbjct: 138 HRPDHPVLHLSWNDAVAYCTWAGKRLPTEAQWEYSCQRGLQNRLLPWVNKLQPKEQHYAN 197
Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK 293
+WQGEFP NT DG+ TAP + N +GLYN+VGNVWEWT+DWW ++H + NPK
Sbjct: 198 LWQGEFPVTNTGEDGFQGTAPTDVFPPNGYGLYNIVGNVWEWTSDWWTIYHLVEETLNPK 257
Query: 294 GPTTGTDK 301
GP +G D+
Sbjct: 258 GPPSGKDR 265
>gi|168486347|ref|ZP_02710855.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae CDC1087-00]
gi|418185547|ref|ZP_12822087.1| hypothetical protein SPAR91_1736 [Streptococcus pneumoniae GA47283]
gi|419495982|ref|ZP_14035699.1| hypothetical protein SPAR97_1623 [Streptococcus pneumoniae GA47461]
gi|419530831|ref|ZP_14070357.1| formylglycine-generating sulfatase enzyme family protein
[Streptococcus pneumoniae GA40028]
gi|421213727|ref|ZP_15670681.1| hypothetical protein AMCSP12_001614 [Streptococcus pneumoniae
2070108]
gi|421216366|ref|ZP_15673274.1| hypothetical protein AMCSP04_002107 [Streptococcus pneumoniae
2070109]
gi|421302736|ref|ZP_15753400.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA17484]
gi|183570564|gb|EDT91092.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae CDC1087-00]
gi|353848277|gb|EHE28293.1| hypothetical protein SPAR91_1736 [Streptococcus pneumoniae GA47283]
gi|379571240|gb|EHZ36198.1| formylglycine-generating sulfatase enzyme family protein
[Streptococcus pneumoniae GA40028]
gi|379594068|gb|EHZ58879.1| hypothetical protein SPAR97_1623 [Streptococcus pneumoniae GA47461]
gi|395577898|gb|EJG38429.1| hypothetical protein AMCSP04_002107 [Streptococcus pneumoniae
2070109]
gi|395578958|gb|EJG39468.1| hypothetical protein AMCSP12_001614 [Streptococcus pneumoniae
2070108]
gi|395901358|gb|EJH12294.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA17484]
Length = 282
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 119/335 (35%), Positives = 159/335 (47%), Gaps = 62/335 (18%)
Query: 15 MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M + G TF MGTN + + D E V++ F + V+N +F +FV TGY T A
Sbjct: 3 MKKIKGGTFMMGTNSEEGFLDDFEGLQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLA 62
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ +FVF + E ER EG HP W C
Sbjct: 63 ERQEWSFVFILFVPEAER-------------EGYP--------HPAGAPWWLQVSNAC-- 99
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W HP G +S + +HPVVHV+ DA+A+C
Sbjct: 100 ----------------------------WKHPYGENSNLVGLEDHPVVHVALEDALAFCN 131
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEA+WEY RGG ++ +PWG+ L G + AN WQG FP NTA DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTA 190
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
PV + N FGLY M+GNVWEW N + S+N PK + +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+LC+ YC R+R AAR+ S++ +LGFRC +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282
>gi|427702661|ref|YP_007045883.1| hypothetical protein Cyagr_1371 [Cyanobium gracile PCC 6307]
gi|427345829|gb|AFY28542.1| hypothetical protein Cyagr_1371 [Cyanobium gracile PCC 6307]
Length = 320
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 124/338 (36%), Positives = 165/338 (48%), Gaps = 58/338 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
DMV +P +F MG++ E P+ V L F++D+ V+N QFQ+FV ATG++T A
Sbjct: 25 DMVWIPPGSFEMGSDH---HYPEEAPAHRVALGGFWIDRTPVTNAQFQKFVKATGHITLA 81
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D + L E L + P I D +
Sbjct: 82 ERPADPADYPGALPE------------------LLAPASIVFVPPPGPIGTGDPYRW--- 120
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W+Y P W HPEG S+I R NHPVVHV+ DA AY
Sbjct: 121 ----------WQY----------RPGACWRHPEGPGSSIRMRENHPVVHVAHGDAAAYAA 160
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G +LP+EAEWE RGG + F WG+ L P G AN +QG+FP +N+ DG+ T+
Sbjct: 161 WAGKQLPSEAEWERAARGGRQGDEFAWGDELHPGGRPMANTFQGDFPHHNSLLDGWERTS 220
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTD------------K 301
PV ++ N +GL +M+GNVWEWT DW+ H A NP+G K
Sbjct: 221 PVGAFPPNGYGLLDMIGNVWEWTDDWYGGHG--ATVANPEGCCRQASIDPGSQHGHQPRK 278
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
V KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 279 VVKGGSFLCAPSYCRRYRPAARMAQGVDTSTCHMGFRC 316
>gi|419511049|ref|ZP_14050690.1| hypothetical protein SPAR142_1717 [Streptococcus pneumoniae NP141]
gi|379631652|gb|EHZ96229.1| hypothetical protein SPAR142_1717 [Streptococcus pneumoniae NP141]
Length = 282
Score = 183 bits (464), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 119/335 (35%), Positives = 159/335 (47%), Gaps = 62/335 (18%)
Query: 15 MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M + G TF MGTN + + D E V++ F + V+N +F +FV TGY T A
Sbjct: 3 MKKIKGGTFMMGTNYEEGFLDDFEGLQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLA 62
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ +FVF + E ER EG HP W C
Sbjct: 63 ERQEWSFVFILFVPEAER-------------EGYP--------HPAGAPWWLQVSNAC-- 99
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W HP G +S + +HPVVHV+ DA+A+C
Sbjct: 100 ----------------------------WKHPYGENSNLVGLEDHPVVHVALEDALAFCN 131
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEA+WEY RGG ++ +PWG+ L G + AN WQG FP NTA DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTA 190
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
PV + N FGLY M+GNVWEW N + S+N PK + +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+LC+ YC R+R AAR+ S++ +LGFRC +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282
>gi|407276350|ref|ZP_11104820.1| hypothetical protein RhP14_07602 [Rhodococcus sp. P14]
Length = 293
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 95/198 (47%), Positives = 118/198 (59%), Gaps = 16/198 (8%)
Query: 158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
P W HPEG ST+ R HPV HVS DA+AY W G LPTEAEWE+ RGGL+
Sbjct: 90 PGAQWRHPEGPGSTVAGRDRHPVTHVSHFDALAYAQWAGKELPTEAEWEFAARGGLDRAR 149
Query: 218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
F WG+ P G +RAN WQG FP NT DG++ T+PV SY+ N +GL +M GNVWEWTA
Sbjct: 150 FVWGDEFAPHGRYRANTWQGRFPWENTGEDGFVGTSPVGSYRANGYGLVDMAGNVWEWTA 209
Query: 278 DWWNVHH-----HPAPSYN---PKGPTTGTD--------KVKKGGSYLCNEQYCYRHRCA 321
D + HH AP++ P P TG +V KGGS+LC YC R+R A
Sbjct: 210 DHYTAHHGADGRDTAPAHACCIPSNPRTGAGDESGPYPRRVIKGGSHLCAPNYCLRYRPA 269
Query: 322 ARSQNTPDSSAGNLGFRC 339
AR + ++S ++GFRC
Sbjct: 270 ARQGESVETSTCHIGFRC 287
>gi|402863388|ref|XP_003895999.1| PREDICTED: sulfatase-modifying factor 2 isoform 2 [Papio anubis]
Length = 305
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 121/336 (36%), Positives = 158/336 (47%), Gaps = 83/336 (24%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F MGTN P +DGE P R T+ F +D V+N F++FV Y TEAE
Sbjct: 49 MVQLPGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 107
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q MK G S I R+ HPV+H+SWNDA AYC WR
Sbjct: 108 MFGWSFVFEDFVSDELRNKATQ---PMKP-AGPGSGIRERLEHPVLHVSWNDARAYCAWR 163
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G RLPTE EWE+ RGGL+ +++PWG+ P
Sbjct: 164 GKRLPTEEEWEFAARGGLKGQVYPWGNRFQP----------------------------- 194
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
NR W P+G+ + + G P N A
Sbjct: 195 --------------------NRTNLWQGRF-PKGDKAEDGFHGVSPVNAFPA-------- 225
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNE 312
+N +GLY+++GNVWEWTA +P +V +G S++ +
Sbjct: 226 -----QNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTADG 267
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCAAD G G
Sbjct: 268 SANHRARVTTRMGNTPDSASDNLGFRCAADAGRLPG 303
Score = 41.2 bits (95), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCAAD
Sbjct: 254 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 298
>gi|432335937|ref|ZP_19587486.1| hypothetical protein Rwratislav_13708 [Rhodococcus wratislaviensis
IFP 2016]
gi|430777142|gb|ELB92516.1| hypothetical protein Rwratislav_13708 [Rhodococcus wratislaviensis
IFP 2016]
Length = 304
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 121/344 (35%), Positives = 159/344 (46%), Gaps = 63/344 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG TF MG+ E P VT+D F++D H+V+ +F+ FV TG+VT AE
Sbjct: 1 MAWIPGGTFWMGSED---FYPEERPVHQVTVDGFWMDTHQVTVAEFRRFVKDTGHVTTAE 57
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D ++ G D + + V T
Sbjct: 58 ----------------------IAPDPAQYPGADPAL----------LVPGSLVFTPTPG 85
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
L W G W HPEG S + R HPV HVSW DA AY W
Sbjct: 86 PVPLDDYTRWWSFTAGA---------DWRHPEGPGSNVGGRERHPVTHVSWFDARAYAEW 136
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE+ RGGL+ + F WG P G NVWQG FP N DGY T+P
Sbjct: 137 AGKDLPTEAEWEFAARGGLDRKPFVWGEEHEPGGRPGGNVWQGRFPWENLLEDGYAGTSP 196
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGPTTGTD------ 300
V ++ N + L +M GNVWEWT D++ V H + AP+ + P P T
Sbjct: 197 VGHFRPNGYDLSDMAGNVWEWTTDYFTVDHSESGKNVAPTSSCCIPTNPRVDTARAENSD 256
Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R +AR +T ++S ++GFRC
Sbjct: 257 EPYARRVIKGGSHLCAPNYCLRYRPSARQGDTEETSTCHIGFRC 300
>gi|424876140|ref|ZP_18299799.1| hypothetical protein Rleg5DRAFT_0493 [Rhizobium leguminosarum bv.
viciae WSM1455]
gi|393163743|gb|EJC63796.1| hypothetical protein Rleg5DRAFT_0493 [Rhizobium leguminosarum bv.
viciae WSM1455]
Length = 324
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 122/344 (35%), Positives = 167/344 (48%), Gaps = 65/344 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG TF MG++ E P+ V +D F++ Q V+N QF+EFV ATGYVT AE
Sbjct: 18 MVWIPGRTFTMGSDHHY---PEEAPAHPVKVDGFWMSQTPVTNRQFKEFVEATGYVTVAE 74
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K D + + +M R L T + P + W
Sbjct: 75 KAPDPTDY------------PGAKPEMLRAGSLVFTQPKSVSGPDISQWWT--------- 113
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+++G +W P G S + +++HPVVHV++ DA AY W
Sbjct: 114 ---------FKFGA------------NWRRPLGGLSDLRGKLDHPVVHVAYADAAAYAEW 152
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE RGGL++ + WG+ P G AN W G FP ++ G T+P
Sbjct: 153 AGLDLPTEAEWELAARGGLDDAEYSWGDEFAPGGVPMANTWSGTFPIHSMKPKGQERTSP 212
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-------NPKGPTTGTD------- 300
V S+ N FGLY+M+GNVWEWT+D+W+ HP P+ NP+G
Sbjct: 213 VRSFPPNGFGLYDMIGNVWEWTSDYWST-RHPQPAKHSCCIPSNPRGRDAEASSDPHQPE 271
Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 272 IRIARRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRC 315
>gi|397480350|ref|XP_003811449.1| PREDICTED: sulfatase-modifying factor 2 [Pan paniscus]
Length = 327
Score = 182 bits (463), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 113/294 (38%), Positives = 150/294 (51%), Gaps = 69/294 (23%)
Query: 58 TQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
Q ++FV Y TEAE FG +FVFE +S+E R K +Q
Sbjct: 98 AQARDFVREKKYRTEAEIFGWSFVFEDFVSDELRNKATQ--------------------- 136
Query: 118 PVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMN 177
P+ + W W LP E R F W P G S I R+
Sbjct: 137 PMKSVLW--------W----LPVE-------------RAF----WRQPAGPGSGIRERLE 167
Query: 178 HPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG 237
HPV+HVSWNDA AYC WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG
Sbjct: 168 HPVLHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQG 224
Query: 238 EFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT 296
+FP + A DG+ +PV ++ +N +GLY+++GNVWEWTA +P
Sbjct: 225 KFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAA 271
Query: 297 TGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+V +G S++ + +R R R NTPDS++ NLGFRCAAD G G
Sbjct: 272 EQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 325
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCAAD
Sbjct: 276 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 320
>gi|12652935|gb|AAH00224.1| SUMF2 protein [Homo sapiens]
Length = 242
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 111/294 (37%), Positives = 149/294 (50%), Gaps = 69/294 (23%)
Query: 58 TQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
Q ++FV Y TEAE FG +FVFE +S+E R K +Q
Sbjct: 13 AQARDFVREKKYRTEAEMFGWSFVFEDFVSDELRNKATQ--------------------- 51
Query: 118 PVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMN 177
P+ + W W LP E + W P G S I R+
Sbjct: 52 PMKSVLW--------W----LPVEKAF-----------------WRQPAGPGSGIRERLE 82
Query: 178 HPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG 237
HPV+HVSWNDA AYC WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG
Sbjct: 83 HPVLHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQG 139
Query: 238 EFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT 296
+FP + A DG+ +PV ++ +N +GLY+++GNVWEWTA +P
Sbjct: 140 KFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAA 186
Query: 297 TGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+V +G S++ + +R R R NTPDS++ NLGFRCAAD G G
Sbjct: 187 EQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 240
>gi|229819257|ref|YP_002880783.1| hypothetical protein Bcav_0760 [Beutenbergia cavernae DSM 12333]
gi|229565170|gb|ACQ79021.1| protein of unknown function DUF323 [Beutenbergia cavernae DSM
12333]
Length = 329
Score = 182 bits (462), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 125/369 (33%), Positives = 164/369 (44%), Gaps = 90/369 (24%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG FRMG+ E P T+ AF +D H V+N +F FV+ +GYVT AE
Sbjct: 4 MVALPGGAFRMGSRD---FYPDEGPVHERTVGAFEIDVHPVTNAEFAAFVADSGYVTVAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D + P ++ E+ V F ++ R +W A+ WR
Sbjct: 61 RPLDAMQY-PGVAPEDLVPGGLV------FTPTPGPVDLR--------NWR---AWWDWR 102
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
P +W HP G S + + HPVV VS+ DA AY W
Sbjct: 103 -----------------------PGATWRHPFGPGSDLAGKHEHPVVQVSFEDATAYADW 139
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWEY R G + + WG + P G AN WQG FP +NT ADG++ T+
Sbjct: 140 AGKALPTEAEWEYAARAGGDPATYAWGEEVRPDGVLMANTWQGRFPYDNTGADGWVGTSA 199
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----------------PAPSYN------ 291
V S+ N FGL +M+GNVWEWT + H AP +
Sbjct: 200 VGSFPANAFGLVDMIGNVWEWTTTPYRPAHEVPGGGGSPSAQRSVLPMAAPGHGTDAEHT 259
Query: 292 ---------------------PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDS 330
P GPT + KGGS+LC +YC R+R AARS T DS
Sbjct: 260 PSCCAPQASALAGSRTPADPPPSGPT--MTRALKGGSHLCAPEYCLRYRPAARSPQTDDS 317
Query: 331 SAGNLGFRC 339
+ ++GFRC
Sbjct: 318 ATTHIGFRC 326
>gi|453067931|ref|ZP_21971216.1| hypothetical protein G418_04863 [Rhodococcus qingshengii BKS 20-40]
gi|452766550|gb|EME24795.1| hypothetical protein G418_04863 [Rhodococcus qingshengii BKS 20-40]
Length = 297
Score = 182 bits (461), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 118/322 (36%), Positives = 159/322 (49%), Gaps = 60/322 (18%)
Query: 37 EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
E P VT+D F++D V+ +F+ FV ATG+VT AE P ++ A +
Sbjct: 10 ERPVHQVTVDGFWMDTTAVTVAEFRRFVKATGHVTTAE-------IAPDAADYPDADPAL 62
Query: 97 VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 156
+ F + P +S +D Y W W +
Sbjct: 63 LVPGSLVF-----------NSPPGPVSLDD---YTQW----------WSFT--------- 89
Query: 157 FPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 216
P W HPEG DS I R HPV HVS+ DA AY W G LPTEAEWE+ RGGL+ +
Sbjct: 90 -PGADWRHPEGADSNIGGRERHPVTHVSYFDAQAYAAWAGKELPTEAEWEFAARGGLDRQ 148
Query: 217 LFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWT 276
+ WG++ +P G NVWQG+FP N DGY TAPV ++ N +GL++M GNVWEWT
Sbjct: 149 AYVWGDHDSPGGRPGGNVWQGQFPWENLLEDGYERTAPVGKFRPNGYGLFDMAGNVWEWT 208
Query: 277 ADWWNVHH-----HPAPSYN---PKGPTTG-----------TDKVKKGGSYLCNEQYCYR 317
AD + H + AP+ + P+ P +V KGGS+LC YC R
Sbjct: 209 ADHYTSDHAHSSKNVAPASSCCIPRNPRAEFATEALVGEPYARRVIKGGSHLCAPNYCLR 268
Query: 318 HRCAARSQNTPDSSAGNLGFRC 339
+R AAR + +SS ++GFRC
Sbjct: 269 YRPAARQGESEESSTCHVGFRC 290
>gi|332865306|ref|XP_001160010.2| PREDICTED: sulfatase-modifying factor 2 isoform 6 [Pan troglodytes]
gi|410224494|gb|JAA09466.1| sulfatase modifying factor 2 [Pan troglodytes]
gi|410302662|gb|JAA29931.1| sulfatase modifying factor 2 [Pan troglodytes]
Length = 305
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 122/336 (36%), Positives = 159/336 (47%), Gaps = 83/336 (24%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G F MGTN P +DGE P R T+ F +D V+N F++FV Y TEAE
Sbjct: 49 MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 107
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q MK G S I R+ HPV+H+SWNDA AYC WR
Sbjct: 108 IFGWSFVFEDFVSDELRNKATQ---PMKP-AGPGSGILERLEHPVLHVSWNDARAYCAWR 163
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G RLPTE EWE+ RGGL+ +++PWG+W P +R N W
Sbjct: 164 GKRLPTEEEWEFAARGGLKGQVYPWGNWFQP--------NRTN---------------LW 200
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G P+G+ + + G P N A
Sbjct: 201 QGK---------------------------FPKGDKAEDGFHGVSPVNAFPA-------- 225
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
+N +GLY+++GNVWEWTA +P +V +G S++
Sbjct: 226 -----QNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTADG 267
Query: 315 CYRH--RCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
H R R NTPDS++ NLGFRCAAD G G
Sbjct: 268 SANHQARVTTRMGNTPDSASDNLGFRCAADAGRPPG 303
Score = 39.7 bits (91), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 26/45 (57%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ H R R NTPDS++ NLGFRCAAD
Sbjct: 254 RVLRGASWIDTADGSANHQARVTTRMGNTPDSASDNLGFRCAADA 298
>gi|453078020|ref|ZP_21980754.1| hypothetical protein G419_21909 [Rhodococcus triatomae BKS 15-14]
gi|452757655|gb|EME16057.1| hypothetical protein G419_21909 [Rhodococcus triatomae BKS 15-14]
Length = 297
Score = 181 bits (460), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 113/321 (35%), Positives = 153/321 (47%), Gaps = 59/321 (18%)
Query: 37 EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
E P V +D F++D H V+ +F+ FV TG+VT AE D + P + S
Sbjct: 10 ERPVHQVHVDGFWMDTHPVTVAEFRRFVKDTGHVTVAETAPDPADY-PDADPADLVPGSL 68
Query: 97 VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 156
V P +S +D + +W
Sbjct: 69 V-----------------FTPPSGPVSLDDYRQWWSW----------------------- 88
Query: 157 FPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 216
P W HPEG DS R HPV HV+++DA+AY W G LPTEA+WE+ RGGL+
Sbjct: 89 VPGADWRHPEGPDSNAGGRERHPVTHVAYSDALAYARWAGKELPTEAQWEFAARGGLDRA 148
Query: 217 LFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWT 276
++ WG+ P G H+AN WQG FP N DG+ T+PV S++ N +GL +M GNVW+WT
Sbjct: 149 IYVWGDEFAPGGRHQANTWQGSFPWQNLVEDGFPGTSPVGSFRPNGYGLVDMAGNVWQWT 208
Query: 277 ADWWNVHH-------HPA-PSYNPKGP----------TTGTDKVKKGGSYLCNEQYCYRH 318
D H PA P P+ P T +V KGGS+LC YC R+
Sbjct: 209 TDHHTADHGASGRNVAPASPCCIPRNPRVEAAVRRDGETYPRRVIKGGSHLCAPNYCLRY 268
Query: 319 RCAARSQNTPDSSAGNLGFRC 339
R AAR +T ++S ++GFRC
Sbjct: 269 RPAARQGDTEETSTCHIGFRC 289
>gi|397734414|ref|ZP_10501122.1| hypothetical protein JVH1_5610 [Rhodococcus sp. JVH1]
gi|396929722|gb|EJI96923.1| hypothetical protein JVH1_5610 [Rhodococcus sp. JVH1]
Length = 317
Score = 181 bits (460), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 121/346 (34%), Positives = 156/346 (45%), Gaps = 63/346 (18%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
K+M + G TF MG+ E P VT+D F++D H+V+ +F+ FV TG+VT
Sbjct: 12 KNMEWISGGTFWMGSED---FYPEERPVHQVTVDGFWMDSHQVTVAEFRRFVKDTGHVTT 68
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE + D ++ G D + + V T
Sbjct: 69 AE----------------------IAPDPAQYPGADPAL----------LVPGSLVFTPT 96
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
L W P W HPEG S + R HPV HVSW DA AY
Sbjct: 97 PGPVPLDDYTRWW---------SFTPGADWRHPEGPGSNVGGRERHPVTHVSWFDARAYA 147
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWE+ RGGL+ + F WG P G NVWQG FP N DG+ T
Sbjct: 148 EWAGKDLPTEAEWEFAARGGLDRKQFVWGEEHEPGGRPGGNVWQGRFPWENLLEDGFAGT 207
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----PSYN---PKGPTTG------ 298
+PV Y+ N + L +M GNVWEWTAD + H P+++ P P
Sbjct: 208 SPVGRYQPNGYRLSDMAGNVWEWTADHYTADHSAGGKNVTPAHSCCIPANPRVDAMQIND 267
Query: 299 -----TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
T +V KGGS+LC YC R+R +AR T D+S +GFRC
Sbjct: 268 PDEPYTRRVIKGGSHLCAPNYCLRYRPSARQGETEDTSTCQIGFRC 313
>gi|398385642|ref|ZP_10543661.1| hypothetical protein PMI04_03371 [Sphingobium sp. AP49]
gi|397720168|gb|EJK80728.1| hypothetical protein PMI04_03371 [Sphingobium sp. AP49]
Length = 322
Score = 181 bits (458), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 122/343 (35%), Positives = 163/343 (47%), Gaps = 62/343 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG F MG+++ E P+ V +D F++D V+N ++ FV ATG+VT A+
Sbjct: 19 MVHIPGGRFIMGSDR---HYPEEAPAHPVQVDGFWMDATPVTNRDYRRFVEATGHVTSAQ 75
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
D + L E R S + PV W+ W
Sbjct: 76 ISPDPADYPGALPEMLRPA---------------SLVFMPTPKPVPLNDWS------QW- 113
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W Y W HP G DS++ +HPVVHV++ DA+AY W
Sbjct: 114 ---------WRYVFDA----------DWQHPLGPDSSVAAIDDHPVVHVNYADAIAYAAW 154
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE+ RGG + + WG+ L P G+ AN WQG FP N DGY T+P
Sbjct: 155 AGKALPTEAEWEFAARGGFDGLEYAWGDELMPDGKALANFWQGAFPAENLLIDGYERTSP 214
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----------------SYNPKGPTT 297
V Y N +GL++M+GNVWEWT D++ H P SY+P+ P
Sbjct: 215 VGHYPANGYGLHDMIGNVWEWTEDFYAARHQADPARPCCAPRNPRNDASDGSYDPQQPQI 274
Query: 298 GTD-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KV KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 275 RIPRKVLKGGSHLCAPNYCQRYRPAARHAQPVDTSTSHIGFRC 317
>gi|120405591|ref|YP_955420.1| hypothetical protein Mvan_4639 [Mycobacterium vanbaalenii PYR-1]
gi|119958409|gb|ABM15414.1| protein of unknown function DUF323 [Mycobacterium vanbaalenii
PYR-1]
Length = 295
Score = 181 bits (458), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 120/333 (36%), Positives = 160/333 (48%), Gaps = 56/333 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG T +G++ E P+ VT+D F++ +V+N +F EFV+ATGYVT AE
Sbjct: 8 MVWIPGQTAILGSDS---HYPEEAPAHPVTVDGFWMQPTQVTNLEFAEFVAATGYVTAAE 64
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLD--STIEHRMHHPVVHISWNDAVAYCT 132
+ ++ H E L S + R PV N + T
Sbjct: 65 R-----------------PVNPADHPGAPRENLQPGSMVFTRTGGPVDLRHLN---LWWT 104
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W P SW HP G S+I+ R++HPVVHV + DA +Y
Sbjct: 105 WT-----------------------PGASWRHPVGPASSIDKRLDHPVVHVGYEDAESYA 141
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWE RGGL + WG+ G AN W G+FP GY T
Sbjct: 142 AWAGLELPTEAEWETAARGGLTAATYTWGDVPEQPGRQLANYWHGDFPWRPD--RGYGRT 199
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTTGTD-KVKKGG 306
PV S+ N++GL++M GNVWEWT DW+ P SY+ K P +V KGG
Sbjct: 200 RPVGSFPPNRYGLFDMAGNVWEWTTDWYGDRRDYQPCCAAESYDAKQPQFEVPRRVVKGG 259
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
S+LC + YC R+R AAR D+ ++GFRC
Sbjct: 260 SFLCADSYCQRYRPAARRPQPVDTGMSHIGFRC 292
>gi|332028607|gb|EGI68643.1| Sulfatase-modifying factor 1 [Acromyrmex echinatior]
Length = 177
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 93/216 (43%), Positives = 120/216 (55%), Gaps = 50/216 (23%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV + T+ +GT+ P+ + DGE P R V LD F++D+ EVSN +F FV+ATGY+TEAE
Sbjct: 1 MVKINAGTYFIGTDNPVFVADGEGPKREVVLDNFFIDKFEVSNKEFMTFVNATGYITEAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FGD+FVFE LL++ + K+++V
Sbjct: 61 NFGDSFVFEGLLTQTTKDKVTKVV------------------------------------ 84
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
++A W L SW HPEG +S I RM HPV+HVSWNDAVAYC W
Sbjct: 85 -----SQAPWWL---------LVKQTSWQHPEGPNSNITFRMEHPVIHVSWNDAVAYCNW 130
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
G RLPTEAEWE CRG L +RL+PWGN L P+ +H
Sbjct: 131 MGKRLPTEAEWEVACRGELSDRLYPWGNKLMPKNQH 166
>gi|345320582|ref|XP_001517991.2| PREDICTED: sulfatase-modifying factor 2-like [Ornithorhynchus
anatinus]
Length = 462
Score = 180 bits (457), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 122/338 (36%), Positives = 164/338 (48%), Gaps = 80/338 (23%)
Query: 6 APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
A P + V L G TFRMGT+ P +DGE PSR VT+ F +D+ V+N F+EFV
Sbjct: 196 AGPGQEADGTVRLLGGTFRMGTDSPD-GRDGEGPSRTVTVRPFAVDRFPVTNRDFREFVR 254
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
Y TEAE FG +FVFE ++ E + K+ + G S I+ R+ HPV+H+SWN
Sbjct: 255 EKKYQTEAETFGWSFVFEDFVAPELKEKVKEKLEPA----GPGSGIKERLDHPVLHVSWN 310
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
DA A+C WRG RLP+E EWE+ RGGL R++PWG P +R N
Sbjct: 311 DARAFCAWRGKRLPSEEEWEFAARGGLSGRMYPWGDDFQP--------NRSN-------- 354
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
W+G FP ++ G H A+ G FP N+
Sbjct: 355 -------LWQGG--------------------FP-KTDVAEDGFHGASP-VGAFPAQNS- 384
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKG 305
FGLY+++GNVWEWTA + +P P G +V +G
Sbjct: 385 -----------------FGLYDLMGNVWEWTA-----SEYRSPRPGPAG-----MRVLRG 417
Query: 306 GSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
S++ + +R R R NTPDS++ NLGFRCAA
Sbjct: 418 ASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAA 455
>gi|378717438|ref|YP_005282327.1| putative sulfatase-modifying factor 1 [Gordonia polyisoprenivorans
VH2]
gi|375752141|gb|AFA72961.1| putative sulfatase-modifying factor 1 [Gordonia polyisoprenivorans
VH2]
Length = 313
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 127/343 (37%), Positives = 168/343 (48%), Gaps = 67/343 (19%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
D+V +P F MG+++ E P+ +++F L+ H V+N QF FV+ TGY T A
Sbjct: 23 DVVAIPTADFEMGSDR---FYAEERPAHPRHVESFSLEVHPVTNAQFAAFVADTGYRTVA 79
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ PL D F G D + VV + V W
Sbjct: 80 ER--------PL--------------DPADFPGADPA--ELVPGAVVFTPTDGPVDLTDW 115
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R A W Y P SW HP G DS ++ R++HPV +++ DA AY
Sbjct: 116 R-------AWWRY----------VPGTSWRHPLGPDSGVDDRLDHPVTQIAYADADAYAR 158
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE RGGL+ + WG+ L P G AN W G FP N +G+ T+
Sbjct: 159 WAGRRLPTEAEWECAARGGLDGHDYAWGDELRPDGAVMANTWIGRFPYRN---EGWGGTS 215
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------------APSYNPKGPTTG 298
PV Y N FGL++M+GNVWE TA+ + H P APS +P
Sbjct: 216 PVGYYPPNPFGLFDMIGNVWERTAEVFTPRHVPPDMSTVTAGGRQNLLAPSTSPT----- 270
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+V KGGS+LC+ QYC R+R AARS + DS+ +LGFRC A
Sbjct: 271 VLRVTKGGSHLCSPQYCRRYRPAARSPQSDDSATSHLGFRCVA 313
>gi|417995048|ref|ZP_12635352.1| sulfatase modifying factor [Lactobacillus casei M36]
gi|417997938|ref|ZP_12638170.1| sulfatase modifying factor [Lactobacillus casei T71499]
gi|410538820|gb|EKQ13365.1| sulfatase modifying factor [Lactobacillus casei M36]
gi|410542173|gb|EKQ16631.1| sulfatase modifying factor [Lactobacillus casei T71499]
Length = 272
Score = 180 bits (456), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 118/322 (36%), Positives = 158/322 (49%), Gaps = 63/322 (19%)
Query: 25 MGTNKPI-LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFE 83
MGT+ I +D E P V + +F + V+N F +F++AT Y T AE+ G +FVFE
Sbjct: 1 MGTDDHIGFDEDYEGPPTIVRVPSFSMADTPVTNADFDDFIAATAYQTVAERLGSSFVFE 60
Query: 84 PLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAE 143
L+ EEER T +H VA W
Sbjct: 61 LLIPEEERV-----------------TYQH--------------VAGAPW---------- 79
Query: 144 WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEA 203
W L P W HP G +S+ NHPVVHV+ DA+AYC W ++LPTEA
Sbjct: 80 W----------LLVPGADWQHPYGAESSNIDLDNHPVVHVALEDALAYCQWSHSQLPTEA 129
Query: 204 EWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKF 263
+WEY G +PWG +L AN WQG+FP +NTA DG++ TAPV SY+ N
Sbjct: 130 QWEYAAGAGTATT-YPWGESLVDEHGFHANTWQGDFPNDNTAEDGFVGTAPVKSYEPNSN 188
Query: 264 GLYNMVGNVWEWTADWWNVHHHPAPSYNP------KGPTTGTDKVKKGGSYLCNEQYCYR 317
GLY ++GNVWEW N + +N K P G + +GGS+LC+ YC R
Sbjct: 189 GLYQIIGNVWEWCR---NPRYTLLDDFNAEQFKLGKVPAAGEYAI-RGGSFLCHCSYCNR 244
Query: 318 HRCAARSQNTPDSSAGNLGFRC 339
+R AAR+ S++ +L FRC
Sbjct: 245 YRTAARNGVDLQSTSSHLSFRC 266
>gi|334342823|ref|YP_004555427.1| sulfatase-modifying factor protein [Sphingobium chlorophenolicum
L-1]
gi|334103498|gb|AEG50921.1| Sulphatase-modifying factor protein [Sphingobium chlorophenolicum
L-1]
Length = 331
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 130/350 (37%), Positives = 169/350 (48%), Gaps = 62/350 (17%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ M L G TF MG++ E P+R V +D F++D+ V+N QF FV ATG+VT
Sbjct: 25 RGMRWLSGGTFLMGSDHHY---PEEGPARRVRVDGFWMDESPVTNEQFASFVKATGWVTL 81
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE + + +L E M + L T P +S ND + +
Sbjct: 82 AETAPNPSDYPGILPE------------MIQAGSLCFT------SPPGPVSLNDFTQWWS 123
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
+R +G W PEG ST+ R +HPVVHV+ DA AY
Sbjct: 124 FR-----------FG------------ACWRAPEGPGSTLAGREDHPVVHVTHADAQAYA 160
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWE+ RGGLE + WG+ L P GE RAN WQG FP N A DG+ T
Sbjct: 161 AWAGKALPTEAEWEFAARGGLEGEAYAWGSELAPAGEMRANYWQGTFPWENLATDGFEGT 220
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS------YNPKGPTTGTD------ 300
+PV + N GL++M+GNVWEWT DW++ S NP+G
Sbjct: 221 SPVRIFPANGHGLFDMIGNVWEWTDDWYSDAPPGKKSGDCCIPANPRGALERESVDPCQP 280
Query: 301 ------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
KV KGGS+LC YC R+R AAR D+S ++GFRC G
Sbjct: 281 DIQIGRKVLKGGSHLCAANYCERYRPAARYAQPVDTSTCHVGFRCVKRAG 330
>gi|170721233|ref|YP_001748921.1| hypothetical protein PputW619_2052 [Pseudomonas putida W619]
gi|169759236|gb|ACA72552.1| protein of unknown function DUF323 [Pseudomonas putida W619]
Length = 336
Score = 179 bits (455), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 112/340 (32%), Positives = 161/340 (47%), Gaps = 61/340 (17%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
R+ MV LPG F G+++ D E P + F++D+H V+N QF FV+ TGY+
Sbjct: 42 RHDGMVRLPGGVFTFGSDR---FYDEEGPPHAAKVSGFWVDRHPVTNAQFARFVAVTGYL 98
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
T+AE+ G + +P L + R + V F+ S ++
Sbjct: 99 TQAER-GISSEADPTLPDTLRVPGAMV------FQQGPSVLD------------------ 133
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W++ P SW HP+G S + NHPVV V+ DA A
Sbjct: 134 -----------PGWQF----------VPGASWRHPQGPGSDLAGLENHPVVQVTLEDAQA 172
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYL 250
Y W G LP+EA+ EY RGGL++ F WG P+G+ AN WQG+FP NN AADG+
Sbjct: 173 YARWAGRALPSEAQLEYAMRGGLDDADFSWGTTELPKGKLMANTWQGQFPYNNMAADGFT 232
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--PSYNPKGPTTGTD-------- 300
T+PV + N+FG+++ GNVWE T + H + +P GP
Sbjct: 233 GTSPVGCFPANRFGVFDAGGNVWELTRTGYRPGHDAQRDAALDPSGPALSASYDPAEPGI 292
Query: 301 --KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
V KGGS+LC+ C R+R +AR + ++GFR
Sbjct: 293 PVAVIKGGSHLCSADRCMRYRPSARQPQPVFIATSHVGFR 332
>gi|441650125|ref|XP_004090992.1| PREDICTED: sulfatase-modifying factor 2 [Nomascus leucogenys]
Length = 286
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 121/336 (36%), Positives = 160/336 (47%), Gaps = 83/336 (24%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G F MGTN P +DGE P R VT+ F +D V+N F++FV Y TEAE
Sbjct: 30 MVQLQGGRFLMGTNSPD-SRDGEGPVREVTVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 88
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q MK G S I R+ HPV+H+SWNDA AYC WR
Sbjct: 89 MFGWSFVFEDFVSDELRNKATQ---PMKP-AGPGSGIRERLEHPVLHVSWNDARAYCAWR 144
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G RLP E EWE+ RGGL+ +++PWG+ P +R N W
Sbjct: 145 GKRLPMEEEWEFAARGGLKGQVYPWGNRFQP--------NRTN---------------LW 181
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G P+G+ + + G P N A
Sbjct: 182 QGK---------------------------FPKGDKAEDGFHGVSPVNAFPA-------- 206
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNE 312
+N +GLY+++GNVWEWTA +P +V +G S++ +
Sbjct: 207 -----QNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTADG 248
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCAAD G G
Sbjct: 249 SANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 284
Score = 41.2 bits (95), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCAAD
Sbjct: 235 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 279
>gi|148984246|ref|ZP_01817541.1| hypothetical protein CGSSp3BS71_03457 [Streptococcus pneumoniae
SP3-BS71]
gi|147923535|gb|EDK74648.1| hypothetical protein CGSSp3BS71_03457 [Streptococcus pneumoniae
SP3-BS71]
Length = 271
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 116/325 (35%), Positives = 155/325 (47%), Gaps = 62/325 (19%)
Query: 25 MGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFE 83
MGTN + + D E P V++ F + V+N +F +FV TGY T AE+ +FVF
Sbjct: 2 MGTNSEEGFLDDFEGPQVAVSVKDFSIADTPVTNQEFAQFVKETGYKTLAERQEWSFVFI 61
Query: 84 PLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAE 143
+ E ER EG HP W C
Sbjct: 62 LFVPEAER-------------EGYP--------HPAGAPWWLQVSNAC------------ 88
Query: 144 WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEA 203
W HP G +S + +HPVVHV+ DA+A+C W G LPTEA
Sbjct: 89 ------------------WKHPYGENSNLVGLEDHPVVHVALEDALAFCNWSGMSLPTEA 130
Query: 204 EWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKF 263
+WEY RGG ++ +PWG+ L G + AN WQG FP NTA DG++ TAPV + N F
Sbjct: 131 QWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTAPVYEFLPNDF 189
Query: 264 GLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNEQYCYR 317
GLY M+GNVWEW N + S+N PK + +GGS+LC+ YC R
Sbjct: 190 GLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNR 246
Query: 318 HRCAARSQNTPDSSAGNLGFRCAAD 342
+R AAR+ S++ +LGFRC +
Sbjct: 247 YRVAARNGCISTSTSSHLGFRCLKE 271
>gi|405958020|gb|EKC24189.1| Sulfatase-modifying factor 2 [Crassostrea gigas]
Length = 373
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 115/333 (34%), Positives = 163/333 (48%), Gaps = 72/333 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT-GYVTEA 73
M ++ G F G N P GE+P + + AFY+D H ++N F +F A YVT A
Sbjct: 65 MKVIKGGKFLTGINDP-ESDSGEYPMQEAEVKAFYIDTHPITNAHFWKFKEAKPRYVTTA 123
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ G ++V +P LS R +
Sbjct: 124 ERQGWSWVLKPFLSPAARDQ---------------------------------------- 143
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
AR E W +G +W PEG+ ST+ R++HPVVH+S DA A+CT
Sbjct: 144 --ARNVPEGTWWVPVKGA---------AWNKPEGLGSTVRDRLDHPVVHISMEDAKAFCT 192
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTE EWE+ RGGL+ +++PWG+ ++R N+WQG+FP NT D + T+
Sbjct: 193 WTGKRLPTENEWEFAARGGLQGKIYPWGDRYQ---KNRTNLWQGKFPEGNTKVDHWEGTS 249
Query: 254 PVMSYK-ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVK---KGGSYL 309
PV Y+ +N+FGLY+M+GNVWEWT+ Y + +V KGGS++
Sbjct: 250 PVDFYRAQNQFGLYDMLGNVWEWTST----------RYYDRVVDRKLQEVMYVVKGGSFI 299
Query: 310 -CNEQYCYRH-RCAARSQNTPDSSAGNLGFRCA 340
E +C R R A R P+ +A N+GFRC
Sbjct: 300 DSREGHCNRIVRTAQRMGLQPEYTAQNVGFRCV 332
>gi|311106300|ref|YP_003979153.1| hypothetical protein AXYL_03118 [Achromobacter xylosoxidans A8]
gi|310760989|gb|ADP16438.1| hypothetical protein AXYL_03118 [Achromobacter xylosoxidans A8]
Length = 360
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 119/341 (34%), Positives = 153/341 (44%), Gaps = 51/341 (14%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F MG++ P + E P+ V L F++D+ V+N F FV+ATGY+T AE
Sbjct: 45 MVWIPAGDFLMGSDSP-RSQANERPAHPVRLAGFWIDRDHVTNRDFARFVAATGYLTTAE 103
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D L V + F G ++ ND Y W
Sbjct: 104 RTPDWDTIRVHLPAGTPRPAGLVPGALV-FVGNSKPVDL-----------ND---YARW- 147
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W R P SW HP+G S I + HPVV VS+ DA+AY W
Sbjct: 148 ---------W----------RFAPGASWRHPQGPGSDIAGKEEHPVVQVSYEDALAYAAW 188
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWEY RGGLE F WG +L P G A W + T P
Sbjct: 189 AGKRLPTEAEWEYAARGGLEQADFAWGPDLRPGGRPMARTWDAGLAFPLQSPKVMPGTEP 248
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWW-----NVHHHPAPSYNPKGPTTGTD--------- 300
SY N +G+ +M GN W+W ADW+ + NP GP D
Sbjct: 249 AGSYPANGYGVRDMAGNAWQWVADWYRPDAFRLQSQQRRVRNPGGPAAPYDPAMVRPEAP 308
Query: 301 -KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+V +GGS+LC+E YC +R +AR P SSA N+GFR A
Sbjct: 309 KRVIRGGSFLCSEDYCEGYRVSARQGQDPYSSASNVGFRLA 349
>gi|223939387|ref|ZP_03631266.1| protein of unknown function DUF323 [bacterium Ellin514]
gi|223891991|gb|EEF58473.1| protein of unknown function DUF323 [bacterium Ellin514]
Length = 425
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 85/206 (41%), Positives = 124/206 (60%), Gaps = 20/206 (9%)
Query: 154 NRLFPW------GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEY 207
N +F W +W HPEG D+ ++ R +PVV +++ DAVAY W G RLPTEAE+E+
Sbjct: 204 NSMFQWWRYQKGANWRHPEGPDTDLKGREKYPVVQIAYPDAVAYAKWAGKRLPTEAEFEF 263
Query: 208 GCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYN 267
RGG + + WGN+L P G+ AN+++G FPT +T ADG+ APV + N +GLY+
Sbjct: 264 AARGGSAGKTYAWGNDLKPGGKWMANIYEGTFPTKDTGADGFAGIAPVAQFPPNDYGLYD 323
Query: 268 MVGNVWEWTADWWNVHHHP------APSYNPKGPTTGTD--------KVKKGGSYLCNEQ 313
M GNVW+W +DW+ ++ + NP+GP T D +V KGGS+LC +Q
Sbjct: 324 MAGNVWQWCSDWYRADYYTQLAATGTVARNPQGPDTPFDPAEPTEKKRVHKGGSFLCTDQ 383
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRC 339
YC R+ R + +++ +LGFRC
Sbjct: 384 YCTRYMVGTRGKGEVTTASNHLGFRC 409
>gi|377557807|ref|ZP_09787436.1| hypothetical protein GOOTI_018_00050 [Gordonia otitidis NBRC
100426]
gi|377525030|dbj|GAB32601.1| hypothetical protein GOOTI_018_00050 [Gordonia otitidis NBRC
100426]
Length = 326
Score = 179 bits (453), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 129/369 (34%), Positives = 173/369 (46%), Gaps = 69/369 (18%)
Query: 1 MVLLPAP-----PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEV 55
+V P P P K+M +P F MG++ E P V +D F++D H V
Sbjct: 2 LVDTPVPTTASGPTRARKNMAYIPAGIFWMGSDD---FYPEERPVHQVAVDGFWMDVHPV 58
Query: 56 SNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRM 115
+ +F+ FV TGYVT A + EP ++ A + +R F
Sbjct: 59 TVAEFRRFVRDTGYVTTAAR-------EPDPADYPDADPTLLRAGSMVFTP--------- 102
Query: 116 HHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHR 175
P + +D Y W W Y P W HPEG S + R
Sbjct: 103 --PPGPVPLDD---YRRW----------WSY----------VPGADWEHPEGPGSNVGAR 137
Query: 176 MNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW 235
HPV HVS+ DA AY W G LPTEAEWEY RGGL+ + + WG+ L P G AN W
Sbjct: 138 NRHPVTHVSYADAQAYAAWAGKELPTEAEWEYAARGGLDRKPYAWGDELHPHGRPAANTW 197
Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSY 290
GEFP NT DGY T+PV ++ N +GL +++GNVWEWT D + H + AP+
Sbjct: 198 YGEFPWQNTEEDGYSRTSPVGRFRPNGYGLVDVIGNVWEWTCDHYTTDHAAGDKNVAPAS 257
Query: 291 N---PKGPT------TG------TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNL 335
P+ P TG + KGGS+LC YC R+R +AR +T D+S ++
Sbjct: 258 TCCIPRNPVQRAAVDTGHGVERFARRTIKGGSHLCAPNYCVRYRPSARQSHTEDASTSHI 317
Query: 336 GFRCAADKG 344
GFRC +G
Sbjct: 318 GFRCIVREG 326
>gi|311251204|ref|XP_003124495.1| PREDICTED: sulfatase-modifying factor 2-like isoform 2 [Sus scrofa]
Length = 286
Score = 177 bits (450), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 122/336 (36%), Positives = 161/336 (47%), Gaps = 83/336 (24%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F+MGTN P +DGE P R VT+ F +D V+N F++FV Y TEAE
Sbjct: 30 MVQLPGGRFQMGTNSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 88
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE L+ +E R+K + H M+ G S I R+ PVVH+SWNDA AYC WR
Sbjct: 89 AFGWSFVFEDLVPDELRSKAT---HQMQP-AGPGSGIRERLEFPVVHVSWNDARAYCAWR 144
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G RLPTE EWE+ RGGL+ +++PWG+ P +R N W
Sbjct: 145 GKRLPTEEEWEFAARGGLKGQVYPWGNQFQP--------NRTN---------------LW 181
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G + P + E G G FP P NN
Sbjct: 182 QG-KFPKGDKAEDGFHGVSPVNAFP--------------------PQNN----------- 209
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNE 312
+GLY+++GNVWEWTA +P +V +G S++ +
Sbjct: 210 --------YGLYDLMGNVWEWTA-------------SPYQAADQDMRVLRGASWIDTADG 248
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCA+ G G
Sbjct: 249 SANHRARVTTRMGNTPDSASDNLGFRCASSAGRPPG 284
>gi|441510563|ref|ZP_20992468.1| hypothetical protein GOACH_24_00500 [Gordonia aichiensis NBRC
108223]
gi|441445345|dbj|GAC50429.1| hypothetical protein GOACH_24_00500 [Gordonia aichiensis NBRC
108223]
Length = 323
Score = 177 bits (450), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 128/361 (35%), Positives = 171/361 (47%), Gaps = 64/361 (17%)
Query: 4 LPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
+PA V K+M +P F MG++ E P V +D F++D H V+ +F+ F
Sbjct: 7 VPAAAVRARKNMAYIPSGVFWMGSDD---FYPEERPVHQVAVDGFWIDTHPVTVAEFRRF 63
Query: 64 VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
V TGYVT A + EP + A + +R F P +
Sbjct: 64 VRDTGYVTTAAR-------EPDPDDYPDADPTLLRAGSMVFTP-----------PPGPVP 105
Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHV 183
+D Y W W Y P W HPEG S + R HPV HV
Sbjct: 106 LDD---YRRW----------WSY----------VPGADWEHPEGPGSNVGARNRHPVTHV 142
Query: 184 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNN 243
S+ DA AY W G LPTEAEWEY RGGL+ + + WG+ L P G AN W GEFP N
Sbjct: 143 SYADAQAYAAWAGKELPTEAEWEYAARGGLDRKPYAWGDELHPHGRPAANTWYGEFPWQN 202
Query: 244 TAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-----HPAPSYN---PKGP 295
T DGY T+PV ++ N +GL +++GNVWEWT D + H + AP+ P+ P
Sbjct: 203 TEEDGYSRTSPVGRFRPNGYGLVDVIGNVWEWTCDHYTTDHAAGDKNVAPASTCCIPRNP 262
Query: 296 T------TG------TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
TG + KGGS+LC YC R+R +AR +T D+S ++GFRC +
Sbjct: 263 VQRAAVDTGHGVERFARRTIKGGSHLCAPNYCVRYRPSARQSHTEDASTSHIGFRCVVRE 322
Query: 344 G 344
G
Sbjct: 323 G 323
>gi|433649663|ref|YP_007294665.1| hypothetical protein Mycsm_05050 [Mycobacterium smegmatis JS623]
gi|433299440|gb|AGB25260.1| hypothetical protein Mycsm_05050 [Mycobacterium smegmatis JS623]
Length = 292
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 112/332 (33%), Positives = 166/332 (50%), Gaps = 52/332 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V +P T +G+++ ++ P+R VT+D F++ H+V+N++F EF +ATGY+T A
Sbjct: 4 ELVWIPPQTAVLGSDQHYAEEE---PARPVTIDGFWIQTHQVTNSEFAEFAAATGYLTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ + ++ A ++ F ++ R H++ + TW
Sbjct: 61 ER-------ALVPADYPGAPAENLQPGSMVFTRTQGPVDLR------HLN-----LWWTW 102
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
P SW HP G +S+I+ R +HPVVHV++ DA AY +
Sbjct: 103 T-----------------------PGASWRHPVGPNSSIDKRADHPVVHVAYEDAEAYAS 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEAEWE RGGL + WG G+ AN W G+FP GY T
Sbjct: 140 WAGLALPTEAEWETAARGGLGGATYTWGEEPEQPGQSLANYWHGDFPWRPD--RGYGRTT 197
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTD-KVKKGGS 307
PV S+ N +GL +M GNVWEWT DW+ +Y+P+ P +V KGGS
Sbjct: 198 PVGSFPPNGYGLVDMAGNVWEWTTDWYADTRARQSCCAEDTYDPRQPQFRVPRRVVKGGS 257
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+LC + YC R+R AAR D+ ++GFRC
Sbjct: 258 FLCADNYCLRYRPAARRPQPVDTGMSHIGFRC 289
>gi|425736218|ref|ZP_18854525.1| non-specific serine/threonine protein kinase [Brevibacterium casei
S18]
gi|425478435|gb|EKU45627.1| non-specific serine/threonine protein kinase [Brevibacterium casei
S18]
Length = 328
Score = 177 bits (449), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 121/330 (36%), Positives = 158/330 (47%), Gaps = 41/330 (12%)
Query: 15 MVLLPGDTFRMGTNKPILI-KDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
MV L G FRMG+ + +DGE P R V + F + V+ +F FV ATG+ T A
Sbjct: 37 MVGLTGGVFRMGSEDSLAYPEDGEGPVREVDIAPFAIAATTVTRAEFAAFVLATGHRTGA 96
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E +GD+ VF LL E L+++ V W VA +W
Sbjct: 97 EAYGDSLVFAGLLDAE----------------ALETSPA------VAATPWWRQVAGASW 134
Query: 134 RGARLPTEAEWEYG-CRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
P G GG G+ + I R +HPV HV+ DA AY
Sbjct: 135 LHPEGPGRTLVGQGTAAGGEHGNGQERGADARLRADHAFIVARPDHPVTHVTRYDAEAYA 194
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W GARLPTEAEWE+ RGGL+ + +PWG+ P G R + G FP + T G T
Sbjct: 195 EWVGARLPTEAEWEFAARGGLDQQPYPWGSIREPDGVPRMKTFVGAFPDHPTEPVG---T 251
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNE 312
V +Y N FGL+NM GNVWEWT+ S+ P V +GGS+LC+
Sbjct: 252 VAVDAYPPNGFGLHNMTGNVWEWTSG----------SFGPD----DARAVIRGGSHLCHA 297
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YC R+R +ARS TP++S G+ GFR A D
Sbjct: 298 SYCRRYRTSARSAVTPETSLGHTGFRLAVD 327
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 22/42 (52%), Positives = 31/42 (73%)
Query: 352 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
V +GGS+LC+ YC R+R +ARS TP++S G+ GFR A D+
Sbjct: 287 VIRGGSHLCHASYCRRYRTSARSAVTPETSLGHTGFRLAVDL 328
>gi|298682295|gb|ADI95357.1| hypothetical protein [Pseudomonas putida DOT-T1E]
Length = 302
Score = 177 bits (449), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 111/333 (33%), Positives = 155/333 (46%), Gaps = 45/333 (13%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P ++ D V + F MG+++ E P+ + F++ V+N F FV+ T
Sbjct: 11 PAQQEADWVWIEPGRFLMGSDR---HYPEEAPAHWEEVAGFWIQSTPVTNAMFARFVART 67
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
GY+T+AE+ D + P +++ S V P +S +D
Sbjct: 68 GYITQAERPLDPRDY-PGVAQAHLRPASLV-----------------FTPPSGTVSLDDV 109
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
+ L P W HP G S ++ HPVVHV D
Sbjct: 110 SRWWA-----------------------LVPGADWRHPLGPGSDLQGMAQHPVVHVGLED 146
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
A+AY W G LP E EWEY GG + F WG++L P G+H AN WQG FP N D
Sbjct: 147 ALAYAQWAGCDLPGEPEWEYAAWGGTQQGEFVWGHHLVPEGKHMANTWQGNFPAENLQTD 206
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-HPAPSYNPKGPTTGTDKVKKGG 306
GY T+PV ++ N +GLY+M+GNVWEWT + H P S + KV KGG
Sbjct: 207 GYSRTSPVGAFPANGYGLYDMIGNVWEWTTTVFQPSHASPRKSCCANARSAAQMKVLKGG 266
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
S+LC YC R+R ARS D+++ ++GFRC
Sbjct: 267 SHLCAPNYCQRYRPPARSPQATDTTSSHIGFRC 299
>gi|359767934|ref|ZP_09271714.1| hypothetical protein GOPIP_070_00260 [Gordonia polyisoprenivorans
NBRC 16320]
gi|359314511|dbj|GAB24547.1| hypothetical protein GOPIP_070_00260 [Gordonia polyisoprenivorans
NBRC 16320]
Length = 313
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 125/343 (36%), Positives = 168/343 (48%), Gaps = 67/343 (19%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V +P F MG+++ E P+ +++F ++ H V+N QF FV+ TGY T A
Sbjct: 23 ELVAIPTADFEMGSDR---FYAEERPAHIRHVESFSVEVHPVTNAQFATFVADTGYRTVA 79
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ PL D F G D + VV + V W
Sbjct: 80 ER--------PL--------------DPADFPGADPA--ELVPGAVVFTPTDGPVDLTDW 115
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R A W Y P SW HP G DS ++ R++HPV +++ DA AY
Sbjct: 116 R-------AWWRY----------VPGASWRHPLGPDSGVDDRLDHPVTQIAYADADAYAR 158
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE RGGL+ + WG+ L P G AN W G FP N +G+ T+
Sbjct: 159 WAGRRLPTEAEWECAARGGLDGHDYAWGDELHPDGAVVANTWIGRFPYRN---EGWGGTS 215
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------------APSYNPKGPTTG 298
PV Y N FGL++M+GNVWE TA+ + H P APS +P
Sbjct: 216 PVGYYPSNPFGLFDMIGNVWERTAEVFTPRHVPPDMSTVTAGGRQNLLAPSTSPT----- 270
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+V KGGS+LC+ QYC R+R AARS + DS+ +LGFRC A
Sbjct: 271 VLRVTKGGSHLCSPQYCRRYRPAARSPQSDDSATSHLGFRCVA 313
>gi|424883781|ref|ZP_18307409.1| hypothetical protein Rleg8DRAFT_0544 [Rhizobium leguminosarum bv.
trifolii WU95]
gi|392515442|gb|EIW40175.1| hypothetical protein Rleg8DRAFT_0544 [Rhizobium leguminosarum bv.
trifolii WU95]
Length = 326
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 123/344 (35%), Positives = 166/344 (48%), Gaps = 65/344 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M +PG TF MG++ E P+ V +D F++ V+N QF++FV ATGYVT AE
Sbjct: 20 MAWIPGRTFTMGSDHHY---PEEAPAHPVKVDGFWISLTPVTNRQFRQFVEATGYVTVAE 76
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K D + E RA S V K G D + W
Sbjct: 77 KAPDPKDYPGAKPEMLRAG-SLVFTQPKAVNGAD-----------ISQWWT--------- 115
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+++G +W P G S + +++HPVVHV++ DA AY W
Sbjct: 116 ---------FKFGA------------NWRRPLGGLSDLRGKLDHPVVHVAYADAAAYAEW 154
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE RGGL++ + WG+ P G AN W G FPT++ G T+P
Sbjct: 155 AGLDLPTEAEWELAARGGLDDAEYAWGDEFAPGGVPMANTWSGTFPTHSMKPKGQERTSP 214
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-------NPKGPTTGTD------- 300
V S+ N FGLY+M+GNVWEWT+D+W+ HP P+ NP+G
Sbjct: 215 VRSFPPNGFGLYDMIGNVWEWTSDYWST-RHPEPAKHSCCIPSNPRGRDAEASLDLRQPE 273
Query: 301 -----KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 274 IRIARRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRC 317
>gi|398959331|ref|ZP_10678088.1| hypothetical protein PMI26_05918 [Pseudomonas sp. GM33]
gi|398145384|gb|EJM34171.1| hypothetical protein PMI26_05918 [Pseudomonas sp. GM33]
Length = 302
Score = 177 bits (448), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 112/328 (34%), Positives = 157/328 (47%), Gaps = 46/328 (14%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
++ +P F MG+ E P R + F++ V+N F F+++TGYVT+A
Sbjct: 17 LIRIPEGRFLMGSQDHY---PEESPVRWEEITGFFMHATPVTNAMFAAFIASTGYVTQA- 72
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
E L+ E+ ++ D+ + L P + +D + +
Sbjct: 73 --------ERCLATEDYPGVAP---DLLQPASL------VFMPPDGPVQLDDVSRWWS-- 113
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
L P W HP G DS ++ HPVVHV DA AY W
Sbjct: 114 ---------------------LVPGADWRHPLGPDSNLQGLDQHPVVHVGLEDAHAYAQW 152
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEA+WE+ GG F WG+ L P G H AN+WQG+FP N A DGY T+P
Sbjct: 153 VGCELPTEAQWEFAAWGGRTGSEFVWGDQLVPDGRHMANIWQGDFPWRNLALDGYDRTSP 212
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--PSYNPKGPTTGTDKVKKGGSYLCNE 312
V ++ N +GL +M+GNVWEWT +N + PS KG +V KGGS+LC
Sbjct: 213 VGAFPANGYGLQDMIGNVWEWTITPFNNPGQGSGKPSCCAKGGGEKNMRVLKGGSHLCAP 272
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCA 340
YC R+R ARS PD+++ ++GFRC
Sbjct: 273 DYCRRYRAPARSPQAPDTTSSHIGFRCV 300
>gi|38638565|ref|NP_943151.1| hypothetical protein ND018 [Pseudomonas putida ND6]
gi|34335353|gb|AAP44251.1| hypothetical protein ND018 [Pseudomonas putida ND6]
Length = 302
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 115/333 (34%), Positives = 158/333 (47%), Gaps = 45/333 (13%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P ++ D V + F MG++ E P+ + F++ V+N F FV+ T
Sbjct: 11 PEQQEADWVWIEPGRFLMGSDS---HYPEEAPAHWEEVAGFWIQSTPVTNAMFARFVART 67
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
GYVT+AE+ PL D + + G+ + H +V + A
Sbjct: 68 GYVTQAER--------PL--------------DPRDYPGVAQS--HLRPASLVFTPPSGA 103
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
V+ L + W L P W HP G S ++ HPVVHV D
Sbjct: 104 VS--------LDDVSRWW---------ALVPGADWRHPLGPGSDLQGMAQHPVVHVGLED 146
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
A+AY W G LP E EWEY GG + F WG++L P G+H AN WQG FP N D
Sbjct: 147 ALAYAQWAGCDLPGEPEWEYAAWGGTQQGEFVWGHHLVPEGKHMANTWQGNFPAENLQTD 206
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH-HPAPSYNPKGPTTGTDKVKKGG 306
GY T+PV ++ N +GLY+M+GNVWEWT + H P S + KV KGG
Sbjct: 207 GYSRTSPVGAFPANGYGLYDMIGNVWEWTTTVFQPSHASPRKSCCANARSAAQMKVLKGG 266
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
S+LC YC R+R ARS D+++ ++GFRC
Sbjct: 267 SHLCAPNYCQRYRPPARSPQATDTTSSHIGFRC 299
>gi|84497348|ref|ZP_00996170.1| hypothetical protein JNB_14178 [Janibacter sp. HTCC2649]
gi|84382236|gb|EAP98118.1| hypothetical protein JNB_14178 [Janibacter sp. HTCC2649]
Length = 333
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 122/381 (32%), Positives = 186/381 (48%), Gaps = 79/381 (20%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG RMG+++ L E P+R++ + F +++H V+N QF FV+ TGYVT
Sbjct: 31 ELVDLPGGPLRMGSDEHYL---EEAPARDMVVAPFAMERHPVTNAQFAAFVADTGYVT-- 85
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+++ D + + G D ++ + +V V W
Sbjct: 86 --------------------VAEQALDPEAYPGADPSL--LVPGGLVFTPTPGPVPLDRW 123
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
+W R P SW HP G S + +HPVV V+++D +AY
Sbjct: 124 D--------QWW---------RWVPGASWRHPRGPGSDVSDDDDHPVVMVAYDDTLAYAA 166
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTE E+E+ G + F WG+ L+P GE AN WQG+FP NT A+G++ T+
Sbjct: 167 WAGRRLPTEPEFEFAA-GAFASTEFAWGDELSPGGELMANTWQGQFPWRNTGANGWVGTS 225
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
PV S+ N++GL +++GNVWEWT+ ++ T D+V+ + +E
Sbjct: 226 PVGSFPANQWGLVDLIGNVWEWTSTYYV--------------TNAEDRVR---AEEIDET 268
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAAR 373
C CA R ++ + G L R +V KGGS+LC +YC R+R AAR
Sbjct: 269 SC----CAPRIRDL-SVAPGELHPR------------RVLKGGSHLCAPEYCRRYRPAAR 311
Query: 374 SQNTPDSSAGNLGFRCAADVS 394
S + DS+ ++GFR A S
Sbjct: 312 SGQSVDSATSHVGFRLVAAAS 332
>gi|196228936|ref|ZP_03127802.1| protein of unknown function DUF323 [Chthoniobacter flavus Ellin428]
gi|196227217|gb|EDY21721.1| protein of unknown function DUF323 [Chthoniobacter flavus Ellin428]
Length = 399
Score = 176 bits (446), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 85/195 (43%), Positives = 118/195 (60%), Gaps = 9/195 (4%)
Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
R P +W HPEG S I+ R +PVV V W+DA+A+ +W G RLPTEAEWEY RGG
Sbjct: 201 RYVPGANWRHPEGPQSNIDGREKYPVVQVCWDDAIAFASWAGKRLPTEAEWEYAARGGAM 260
Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
+ + WG + P G AN+WQG FP + A DG+ APV +++ N +GL +M GN WE
Sbjct: 261 HNPYVWGKEMKPGGRWDANIWQGRFPVEDKAEDGFAGLAPVGTFRTNDYGLADMAGNAWE 320
Query: 275 WTADWWNV-HHHPAPSYNPKGPTTGTD--------KVKKGGSYLCNEQYCYRHRCAARSQ 325
W ADW++ +++ +P NP GP D +V +GGSYL +E +R +AR +
Sbjct: 321 WCADWYDATYYNRSPHANPPGPEKSDDPDEPSVPKRVARGGSYLSSENNGAGYRPSARRK 380
Query: 326 NTPDSSAGNLGFRCA 340
P ++ +LGFRCA
Sbjct: 381 VPPGYASCDLGFRCA 395
>gi|375140631|ref|YP_005001280.1| hypothetical protein [Mycobacterium rhodesiae NBB3]
gi|359821252|gb|AEV74065.1| hypothetical protein MycrhN_3546 [Mycobacterium rhodesiae NBB3]
Length = 290
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 114/332 (34%), Positives = 163/332 (49%), Gaps = 54/332 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
D+V +P T +G+++ E P+R V + F + +V+N ++ FV ATGY+T A
Sbjct: 4 DLVRIPAQTATLGSDR---HYPEEGPARPVQVAGFGIQPRQVTNAEYAGFVGATGYLTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ + ++ A + ++ F ++ R +++ W TW
Sbjct: 61 ERPVNP-------ADYPNAPAANLQPGSMVFSRTRGPVDLRH----LNLWW-------TW 102
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
P SW HP G S+I+ R +HPVVH+++ DA AY T
Sbjct: 103 T-----------------------PGASWRHPVGPLSSIDKRADHPVVHIAYEDAEAYAT 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEAEWE RGGL+ + WG+ P + AN W G+FP GY T
Sbjct: 140 WAGLSLPTEAEWETAARGGLDQAEYTWGDE--PETQPMANYWHGDFPWRPD--KGYGRTT 195
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTTGTD-KVKKGGS 307
PV SY N +GL++M GNVWEWT DW+ P SY+P+ P KV KGGS
Sbjct: 196 PVGSYPPNGYGLFDMAGNVWEWTTDWYGETRDAQPCCASDSYDPRQPQFPVPRKVVKGGS 255
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+LC + YC R+R AAR D+ ++GFRC
Sbjct: 256 FLCADVYCLRYRPAARRPQPVDTGMSHIGFRC 287
>gi|284044520|ref|YP_003394860.1| hypothetical protein Cwoe_3066 [Conexibacter woesei DSM 14684]
gi|283948741|gb|ADB51485.1| protein of unknown function DUF323 [Conexibacter woesei DSM 14684]
Length = 300
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 118/339 (34%), Positives = 152/339 (44%), Gaps = 58/339 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G +FRMG+ + E P R V + F +D V+ +F+ F TGYVT AE
Sbjct: 3 MVELDGGSFRMGSEE---FYPEERPVREVAVGGFAIDPRLVTVREFRRFADETGYVTLAE 59
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K D ++ P E S V P +S + + W
Sbjct: 60 KPPDPAMY-PGADPELLVPGSLV-----------------FRKPRERVSLDHWRRWWAW- 100
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
P W HP G ++ R HPVVH++ DAVAY W
Sbjct: 101 ----------------------VPGADWRHPGGPGTSTRGRDRHPVVHIAHEDAVAYAAW 138
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWEY RGGL+ F WG++ P G AN WQGEFP N A G+ T+P
Sbjct: 139 AGKALPTEAEWEYAARGGLDGARFAWGDDERPGGRLMANHWQGEFPWQNLKAKGWEGTSP 198
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWW-----------NVHHHPAPSYNPKGPTTGTD--- 300
V ++ N + LY+M GNVWEWTAD + +G G
Sbjct: 199 VATFPPNGYDLYDMAGNVWEWTADLFTPAAPTPAAAPAAPAACCAPAPARGQEPGAHIPR 258
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 259 RVIKGGSHLCAPNYCLRYRPAARQGEAIDTSTSHIGFRC 297
>gi|322800832|gb|EFZ21706.1| hypothetical protein SINV_15331 [Solenopsis invicta]
Length = 237
Score = 176 bits (445), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 95/214 (44%), Positives = 120/214 (56%), Gaps = 54/214 (25%)
Query: 22 TFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFV 81
T+ +GTN P+ + DGE P R V LD FY+D+ EVSN +F FV+ATGY+TEAE FG++FV
Sbjct: 74 TYFIGTNNPVFVADGEGPRREVVLDNFYIDKFEVSNEEFMIFVNATGYITEAESFGESFV 133
Query: 82 FEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTE 141
FE LL + + KI++V VA W LP +
Sbjct: 134 FENLLKQNIKDKINKV------------------------------VAQAPWW---LPVK 160
Query: 142 AEWEYGCRGGLENRLFPWGSWLHPEGIDSTIE----HRMNHPVVHVSWNDAVAYCTWRGA 197
SW HPEG +S I RM+HPV+HVSW+DAVAYC+W G
Sbjct: 161 Q-----------------ASWQHPEGPNSNITCMYLVRMDHPVIHVSWHDAVAYCSWIGK 203
Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR 231
RLPTEAEWE CRGGL +RL+PWGN L P +H+
Sbjct: 204 RLPTEAEWEVACRGGLSDRLYPWGNKLMPNNQHK 237
>gi|241113473|ref|YP_002973308.1| hypothetical protein Rleg_5135 [Rhizobium leguminosarum bv.
trifolii WSM1325]
gi|240861681|gb|ACS59347.1| protein of unknown function DUF323 [Rhizobium leguminosarum bv.
trifolii WSM1325]
Length = 324
Score = 175 bits (444), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 122/345 (35%), Positives = 166/345 (48%), Gaps = 65/345 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG TF MG++ E P+ V +D F++ V+N QF++FV TGYVT AE
Sbjct: 18 MVWIPGRTFTMGSDHHY---PEEAPAHPVKVDGFWMSLTPVTNRQFKQFVETTGYVTVAE 74
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
D + E RA S V K G D + W
Sbjct: 75 IAPDPKDYPGAKPEMLRAG-SLVFTQPKAVNGAD-----------ISQWWT--------- 113
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+++G +W P G S + +++HPVVHV++ DA AY W
Sbjct: 114 ---------FKFGA------------NWRRPLGGLSDLRGKLDHPVVHVAYADAAAYAEW 152
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE RGGL++ + WG+ P G AN W G FPT++ G T+P
Sbjct: 153 AGLDLPTEAEWELAARGGLDDAEYAWGDEFAPGGVPMANTWSGTFPTHSMKPKGQERTSP 212
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP------------------SYNPKGPT 296
V S+ N FGLY+++GNVWEWT+D+W+ HP P SY+P+ P
Sbjct: 213 VRSFPPNGFGLYDVIGNVWEWTSDYWST-RHPEPAKHSCCIPSNPRGRDAEASYDPRQPE 271
Query: 297 TG-TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+V KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 272 IRIARRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRCV 316
>gi|395843048|ref|XP_003794312.1| PREDICTED: sulfatase-modifying factor 2 [Otolemur garnettii]
Length = 401
Score = 175 bits (443), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 109/287 (37%), Positives = 146/287 (50%), Gaps = 69/287 (24%)
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
++FV Y TEAE FG +FVFE L+S E R K++Q MK
Sbjct: 175 RDFVREKKYRTEAEVFGWSFVFEDLVSSELRNKVTQ---QMK------------------ 213
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
+ W W LP E + W P G S I R+ PV
Sbjct: 214 SVLW--------W----LPVEKAF-----------------WRQPAGPGSGIRERLELPV 244
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
VHVSWNDA AYC WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP
Sbjct: 245 VHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQIYPWGNRFQP---NRTNLWQGKFP 301
Query: 241 TNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT 299
+ A DG+ +PV ++ +N +GLY+++GNVWEWTA +P
Sbjct: 302 KGDKAEDGFHGVSPVNAFPPQNNYGLYDLLGNVWEWTA-------------SPYKAAGQD 348
Query: 300 DKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+V +G S++ + +R R R NTPDS++ NLGFRCA+ G
Sbjct: 349 MRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCASSTG 395
>gi|377561797|ref|ZP_09791227.1| hypothetical protein GOOTI_217_00030 [Gordonia otitidis NBRC
100426]
gi|377520992|dbj|GAB36392.1| hypothetical protein GOOTI_217_00030 [Gordonia otitidis NBRC
100426]
Length = 305
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 125/340 (36%), Positives = 171/340 (50%), Gaps = 57/340 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V L G F MG+++ E P+ + AF +++H V+N Q+ FV+ATGYVT A
Sbjct: 9 ELVALDGGLFCMGSDR---FYPEERPAHERRIGAFSIERHPVTNAQYAAFVAATGYVTVA 65
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D F G D + +V + V W
Sbjct: 66 ER----------------------EIDPADFPGADPV--SLVPGALVFTPTSGPVDLADW 101
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R +W R P SW HP G S + +HPV +++ DAVAY
Sbjct: 102 R--------QWW---------RWEPGASWKHPLGPRSDLVGYEDHPVTQIAYADAVAYAD 144
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE+ RGG+ + WG+ L P GE AN W G+FP A+DG+ T+
Sbjct: 145 WIGRRLPTEAEWEFAARGGVAGHEYAWGDELHPDGEILANTWIGDFP---YASDGWAGTS 201
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--PSYNPKG------PTTG--TDKVK 303
+ SY N FGL +M+GNVWE T++ + H P + +P G PTT +V
Sbjct: 202 KIGSYPPNGFGLLDMIGNVWERTSNVFAPRHIPPGLSTVSPDGRQDLLAPTTSPRVRRVT 261
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
KGGS+LC+ QYC R+R AARS + DS+ +LGFRCAAD
Sbjct: 262 KGGSHLCSPQYCRRYRPAARSPQSDDSATSHLGFRCAADS 301
>gi|398990931|ref|ZP_10694094.1| hypothetical protein PMI23_04586 [Pseudomonas sp. GM24]
gi|399012075|ref|ZP_10714403.1| hypothetical protein PMI19_01181 [Pseudomonas sp. GM16]
gi|398116681|gb|EJM06440.1| hypothetical protein PMI19_01181 [Pseudomonas sp. GM16]
gi|398142219|gb|EJM31122.1| hypothetical protein PMI23_04586 [Pseudomonas sp. GM24]
Length = 305
Score = 174 bits (442), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 85/186 (45%), Positives = 110/186 (59%), Gaps = 1/186 (0%)
Query: 156 LFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 215
L P W HP+G S + + NHPVVHV DA+AY W G LPTE EWEY GG +
Sbjct: 118 LVPGADWSHPQGPGSNLRNMDNHPVVHVGLEDAMAYAQWAGCDLPTEQEWEYAAWGGHQA 177
Query: 216 RLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEW 275
F WG+ L P G+H+AN WQG FP N ADG+ T+PV ++ N F L++M+GNVWEW
Sbjct: 178 SEFVWGHELAPNGQHQANTWQGAFPHENLEADGFAGTSPVRTFPGNGFDLFDMIGNVWEW 237
Query: 276 TAD-WWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGN 334
T + H P S + KV KGGS+LC YC R+R ARS + D+++ +
Sbjct: 238 TTSVFQTTHDAPQKSCCTNAKISAGMKVLKGGSHLCAPNYCQRYRAPARSPQSVDTTSSH 297
Query: 335 LGFRCA 340
+GFRC
Sbjct: 298 IGFRCV 303
>gi|336119282|ref|YP_004574059.1| hypothetical protein MLP_36420 [Microlunatus phosphovorus NM-1]
gi|334687071|dbj|BAK36656.1| hypothetical protein MLP_36420 [Microlunatus phosphovorus NM-1]
Length = 318
Score = 174 bits (441), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 121/341 (35%), Positives = 162/341 (47%), Gaps = 63/341 (18%)
Query: 18 LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
+ G TF MG+++ E P+ V +D F + + V+N +F FV+ATGY+T AE+
Sbjct: 10 IAGGTFTMGSDR---HYPEEAPTHRVRVDGFRIQRSAVTNAEFAAFVTATGYLTVAERPL 66
Query: 78 DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
D F P E S V F ++ R H+S + W
Sbjct: 67 DPADF-PGAPAENLVAGSMV------FTPTPGPVDLR------HLS-----QWWMW---- 104
Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
P SW HP G S + +HPVVH+++ DA AY W GA
Sbjct: 105 -------------------VPGASWRHPLGPGSDLTELADHPVVHIAYEDAAAYANWCGA 145
Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG-EFPTNNTAADGYLSTAPVM 256
LPTEA+WEY RGGL F WG+ P G AN W G +FP +T G+L TAPV
Sbjct: 146 ALPTEAQWEYAARGGLAGAAFTWGDAARPDGRLMANTWDGPDFPWRSTGESGWLRTAPVG 205
Query: 257 SYKENKFGLYNMVGNVWEWTADWWNVHH-----------------HPAPSYNPKGPTTGT 299
S+ N +GL++M GNVWEWT DWW H + S++P+ P
Sbjct: 206 SFPANGYGLHDMAGNVWEWTQDWWTSRHPEDAAKPCCVPVNPRGGEISDSFDPRQPQFAI 265
Query: 300 D-KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KV KGGS+LC + YC R+R AAR D+ ++GFRC
Sbjct: 266 PRKVIKGGSHLCADSYCLRYRPAARRPQPVDTGMSHIGFRC 306
>gi|183984379|ref|YP_001852670.1| hypothetical protein MMAR_4408 [Mycobacterium marinum M]
gi|183177705|gb|ACC42815.1| conserved protein [Mycobacterium marinum M]
Length = 293
Score = 174 bits (440), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 122/332 (36%), Positives = 162/332 (48%), Gaps = 52/332 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
D++ +P T +G+ E P R V ++AF + H+V+NTQF EFV ATGY+T A
Sbjct: 4 DLIRIPAQTAVLGSEA---HYPEESPLRQVDVNAFRIQAHQVTNTQFAEFVDATGYLTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D + P E S V F ++ R H+S + TW
Sbjct: 61 ERPLDPADY-PGAPPENLQPGSMV------FTRTAGPVDLR------HLS-----LWWTW 102
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
P SW HP G S+I R +HPVVHV++ DA AY
Sbjct: 103 T-----------------------PGASWRHPAGPHSSIAKRADHPVVHVAYEDAHAYAG 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LP+EAEWE RGGL + WG G+ AN W G+FP GY T+
Sbjct: 140 WAGLALPSEAEWEVAARGGLTGAAYTWGEESERPGQRLANFWHGDFPWRPEP--GYGRTS 197
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTTGTD-KVKKGGS 307
PV S++ N +GL++M GNVWEWT DW+ P SY+P+ P KV KGGS
Sbjct: 198 PVGSFEPNGYGLFDMAGNVWEWTVDWYTDTPSARPAAEADSYDPQQPQFRVPRKVVKGGS 257
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+LC + YC R+R AAR D+ ++GFRC
Sbjct: 258 FLCADSYCRRYRPAARRPQPIDTGMSHIGFRC 289
>gi|404420723|ref|ZP_11002458.1| sulfatase-modifying factor 1 [Mycobacterium fortuitum subsp.
fortuitum DSM 46621]
gi|403659762|gb|EJZ14388.1| sulfatase-modifying factor 1 [Mycobacterium fortuitum subsp.
fortuitum DSM 46621]
Length = 291
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 123/334 (36%), Positives = 167/334 (50%), Gaps = 51/334 (15%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+++ L G TFRMG+ E P VT+ AF +++H V+ +F FV ATGYVT
Sbjct: 3 SELIELAGGTFRMGSTA---FYPEEAPVHTVTVGAFAIERHPVTTAEFGAFVEATGYVTI 59
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ D ++ P + E + + V F D ++ R
Sbjct: 60 AERPMDPALY-PGVDEADLVPGALV------FRPTDGPVDLR-----------------D 95
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR W++ P +W HP G D +HPVV V++ DA AY
Sbjct: 96 WR-------QWWDWA----------PGANWRHPFGPDRD-PAAPDHPVVQVAYPDAAAYA 137
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G RLPTEAEWEY RGG+E + WG+ + P G AN WQG FP N A G+ T
Sbjct: 138 AWAGRRLPTEAEWEYAARGGVEG-TYAWGDEVNPGGALMANTWQGRFPYRNDGALGWRGT 196
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTG-----TDKVKKGGS 307
+PV S+ N FGL++M+GNVWEWT + H P P ++V KGGS
Sbjct: 197 SPVGSFPPNAFGLFDMIGNVWEWTTTLFAGQHRPQADAPACCPPPADPDPMVNQVLKGGS 256
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+LC +YC+R+R AARS + DSS ++GFRC A
Sbjct: 257 HLCAPEYCHRYRPAARSPQSQDSSTTHIGFRCVA 290
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 23/43 (53%), Positives = 32/43 (74%)
Query: 349 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 391
++V KGGS+LC +YC+R+R AARS + DSS ++GFRC A
Sbjct: 248 VNQVLKGGSHLCAPEYCHRYRPAARSPQSQDSSTTHIGFRCVA 290
>gi|449265959|gb|EMC77086.1| Sulfatase-modifying factor 2, partial [Columba livia]
Length = 225
Score = 173 bits (439), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 106/285 (37%), Positives = 144/285 (50%), Gaps = 63/285 (22%)
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
+EFV Y TEAE FG +FVFE +SEE + K++ + L+S
Sbjct: 1 REFVREKKYKTEAEAFGWSFVFEDFVSEELKKKVT---------QKLES----------- 40
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
W LP E W P G S+I+ R+++PV
Sbjct: 41 ----------APWW---LPIEK-----------------AFWRQPSGPGSSIKDRLDYPV 70
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HVSWNDA +C W+G RLP E EWE+ RGGLE R++PWGN P +R N+WQGEFP
Sbjct: 71 LHVSWNDAQTFCAWKGKRLPVEEEWEFAARGGLERRVYPWGNKFQP---NRTNLWQGEFP 127
Query: 241 TNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT 299
+TA DGY +PV ++ +N +GLY+++GN WEWTA + P +
Sbjct: 128 RVDTADDGYHGVSPVAAFPPQNSYGLYDLLGNTWEWTA-----SEYTTPGLLRQ--RAQK 180
Query: 300 DKVKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAAD 342
+V +G S++ H R R NTPDS++ NL FRCAAD
Sbjct: 181 MQVLRGASWIDTADGSANHKARVTTRMGNTPDSASDNLSFRCAAD 225
>gi|118616111|ref|YP_904443.1| hypothetical protein MUL_0225 [Mycobacterium ulcerans Agy99]
gi|118568221|gb|ABL02972.1| conserved protein [Mycobacterium ulcerans Agy99]
Length = 293
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 122/332 (36%), Positives = 162/332 (48%), Gaps = 52/332 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
D++ +P T +G+ E P R V ++AF + H+V+NTQF EFV ATGY+T A
Sbjct: 4 DLIRIPAQTAVLGSEA---HYPEESPLRQVDVNAFRIQAHQVTNTQFAEFVDATGYLTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D + P E S V F ++ R H+S + TW
Sbjct: 61 ERPLDPADY-PGAPPENLQPGSMV------FTRTAGPVDLR------HLS-----LWWTW 102
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
P SW HP G S+I R +HPVVHV++ DA AY
Sbjct: 103 T-----------------------PGASWRHPAGPHSSIAKRADHPVVHVAYEDAHAYAG 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LP+EAEWE RGGL + WG G+ AN W G+FP GY T+
Sbjct: 140 WAGLALPSEAEWEVAARGGLTGAAYTWGEESERPGQRLANFWHGDFPWRPEP--GYGRTS 197
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTTGTD-KVKKGGS 307
PV S++ N +GL++M GNVWEWT DW+ P SY+P+ P KV KGGS
Sbjct: 198 PVGSFEPNGYGLFDMAGNVWEWTVDWYIDTPSARPAAEADSYDPQQPQFRVPRKVVKGGS 257
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+LC + YC R+R AAR D+ ++GFRC
Sbjct: 258 FLCADSYCRRYRPAARRPQPIDTGVSHIGFRC 289
>gi|260906522|ref|ZP_05914844.1| hypothetical protein BlinB_14423 [Brevibacterium linens BL2]
Length = 342
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 121/329 (36%), Positives = 159/329 (48%), Gaps = 55/329 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIK-DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M+ L G +F MG+ P+ DGE P R V+++ F + V+ QF FV+ATG+ T+A
Sbjct: 67 MIGLSGGSFLMGSEDPLAYPGDGEGPVREVSVNGFAIAATTVTVAQFAAFVAATGHRTDA 126
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E GD+ VF LL+E GL T P W V TW
Sbjct: 127 ETHGDSLVFTGLLAE-----------------GLAETTPSVQATP-----WWRPVTGATW 164
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
P GL GS L +++ HPV HVS+ DA AY
Sbjct: 165 LQPSGP-----------GLNGVPGVPGSTLE----GASLNDWARHPVTHVSYADAEAYAQ 209
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W GARLPTE EWE+ RGGLE + +PWG+ P+G N + G FP T G T
Sbjct: 210 WVGARLPTETEWEFASRGGLEQQPYPWGSVREPQGLANMNTFTGVFPDAPTMPVG---TV 266
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQ 313
P ++ N FGL+N GNVWEWT+ ++ H V +GGSY+C++
Sbjct: 267 PADAFAPNGFGLHNTTGNVWEWTSSHFSDHDR--------------RPVLRGGSYMCHDS 312
Query: 314 YCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YC R+R +ARS TPD+S G+ GFR A D
Sbjct: 313 YCRRYRTSARSAATPDTSLGHTGFRLAID 341
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 23/42 (54%), Positives = 32/42 (76%)
Query: 352 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
V +GGSY+C++ YC R+R +ARS TPD+S G+ GFR A D+
Sbjct: 301 VLRGGSYMCHDSYCRRYRTSARSAATPDTSLGHTGFRLAIDL 342
>gi|291412121|ref|XP_002722334.1| PREDICTED: sulfatase modifying factor 2 [Oryctolagus cuniculus]
Length = 335
Score = 173 bits (438), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 116/336 (34%), Positives = 159/336 (47%), Gaps = 83/336 (24%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F MGTN P +DGE P R VT+ F +D V+N F+EF+ Y TEAE
Sbjct: 79 MVQLPGGRFLMGTNAPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFREFIREKKYRTEAE 137
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K+++ G S I R+ PV+H+SWNDA AYC WR
Sbjct: 138 AFGWSFVFEDFVSKELRNKVTKQMQPA----GPGSGISERLELPVLHVSWNDARAYCAWR 193
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G RLPTE EWE+ RGGL+ +++PWG+ P +R N W
Sbjct: 194 GKRLPTEEEWEFAARGGLKGQVYPWGNRFQP--------NRTN---------------LW 230
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G P+G+ + + G P N
Sbjct: 231 QGQ---------------------------FPKGDTAEDGFHGVSPVNAFP--------- 254
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNE 312
+N +GLY+++GNVWEWTA +PA + +V +G S++ +
Sbjct: 255 ----PQNNYGLYDLMGNVWEWTAS-----TYPAADQD--------MRVLRGASWIDTADG 297
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NL FRCA+ G G
Sbjct: 298 SANHRARVTTRMGNTPDSASDNLSFRCASGAGRPPG 333
>gi|374609240|ref|ZP_09682036.1| protein of unknown function DUF323 [Mycobacterium tusciae JS617]
gi|373552209|gb|EHP78819.1| protein of unknown function DUF323 [Mycobacterium tusciae JS617]
Length = 292
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/333 (35%), Positives = 161/333 (48%), Gaps = 52/333 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
D+V +P T +G+++ E P+R V +D F + V+NT+F EFV ATGY+T A
Sbjct: 4 DLVWIPKQTATLGSDR---HYPEEAPARPVGVDGFSMMARTVTNTEFAEFVDATGYLTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ D + +E + S + R PV N + TW
Sbjct: 61 ERPLDPAAYPDAPAENLQPG---------------SMVFTRTQGPVDLRHLN---LWWTW 102
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
P SW HP G S+I+ R HPVVH+++ DA AY
Sbjct: 103 T-----------------------PGASWRHPVGPLSSIDRRAEHPVVHIAYEDAEAYAA 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEAEWE RGGL+ + WG+ + AN W G+FP GY TA
Sbjct: 140 WAGMSLPTEAEWETAARGGLDQAEYTWGDEPESPDQRMANYWHGDFPWR--PEKGYGRTA 197
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTTGTD-KVKKGGS 307
PV SY+ N +GLY+M GNVWEWT DW+ P S++P+ P +V KGGS
Sbjct: 198 PVGSYQPNGYGLYDMAGNVWEWTTDWYGETRDNQPCCASDSFDPRQPQFQVPRRVVKGGS 257
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+LC + YC R+R AAR D+ ++GFRC
Sbjct: 258 FLCADIYCLRYRPAARRPQPADTGMSHVGFRCV 290
>gi|108801079|ref|YP_641276.1| hypothetical protein Mmcs_4115 [Mycobacterium sp. MCS]
gi|119870220|ref|YP_940172.1| hypothetical protein Mkms_4190 [Mycobacterium sp. KMS]
gi|108771498|gb|ABG10220.1| protein of unknown function DUF323 [Mycobacterium sp. MCS]
gi|119696309|gb|ABL93382.1| protein of unknown function DUF323 [Mycobacterium sp. KMS]
Length = 291
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 123/332 (37%), Positives = 164/332 (49%), Gaps = 51/332 (15%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+D+V +P + +G+++ E P+R+VT+D F++ H V+N +F FV ATGY T
Sbjct: 3 EDLVRIPAQSATIGSDR---HYPEEAPARDVTVDGFWIQAHAVTNAEFAAFVDATGYRTV 59
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ +P + A+ Q S + R PV N
Sbjct: 60 AER-----PLDPANYPDAPAQNLQP----------GSMVFRRTAGPVDLRHLN------- 97
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
+W Y G SW HP G S+I +R +HPVVHV+++DA AY
Sbjct: 98 ----------QWWYWTPGA---------SWRHPIGPGSSIVNRADHPVVHVAYDDAEAYA 138
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWE RGGL + + WG+ G AN W GEFP GY T
Sbjct: 139 AWAGLELPTEAEWEVAARGGLPHAGYTWGDEPEGPGHKLANYWHGEFPWRPD--RGYGRT 196
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA----PSYNPKGPTTGTD-KVKKGGS 307
APV SY N +GLY+M GNVWEWT DW+ A SY+P P +V KGGS
Sbjct: 197 APVGSYPPNGYGLYDMAGNVWEWTTDWYTDTGGSACCAEDSYDPAQPQFPVPRRVVKGGS 256
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+LC + YC R+R AAR D+ ++GFRC
Sbjct: 257 FLCADSYCARYRPAARRPQPVDTGMSHIGFRC 288
>gi|443492512|ref|YP_007370659.1| Formylglycine-generating sulfatase enzyme [Mycobacterium liflandii
128FXT]
gi|442585009|gb|AGC64152.1| Formylglycine-generating sulfatase enzyme [Mycobacterium liflandii
128FXT]
Length = 293
Score = 172 bits (437), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 120/329 (36%), Positives = 159/329 (48%), Gaps = 53/329 (16%)
Query: 21 DTFRMGTNKPILIKDGEFPS----RNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKF 76
D R+ +L + +P R V ++AF + H+V+NTQF EFV ATGY+T AE+
Sbjct: 4 DLIRIAAQTAVLGSEAHYPEESPLRQVDVNAFRIQVHQVTNTQFAEFVDATGYLTVAERP 63
Query: 77 GDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA 136
D + P E S V F ++ R H+S + TW
Sbjct: 64 LDPADY-PGAPPENLQPGSMV------FTRTAGPVDLR------HLS-----LWWTWT-- 103
Query: 137 RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRG 196
P SW HP G S+I R +HPVVHV++ DA AY W G
Sbjct: 104 ---------------------PGASWRHPAGPHSSIAKRADHPVVHVAYEDAHAYAGWAG 142
Query: 197 ARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVM 256
LP+EAEWE RGGL + WG G+ AN W G+FP GY T+PV
Sbjct: 143 LALPSEAEWEVAARGGLTGAAYTWGEESERPGQRLANFWHGDFPWRPEP--GYGRTSPVG 200
Query: 257 SYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTTGTD-KVKKGGSYLC 310
S++ N +GL++M GNVWEWT DW+ P SY+P+ P KV KGGS+LC
Sbjct: 201 SFEPNGYGLFDMAGNVWEWTVDWYTDTPSARPAAEADSYDPQQPQFRVPRKVVKGGSFLC 260
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+ YC R+R AAR D+ ++GFRC
Sbjct: 261 ADSYCRRYRPAARRPQPIDTGMSHIGFRC 289
>gi|418051829|ref|ZP_12689913.1| Sulphatase-modifying factor protein [Mycobacterium rhodesiae JS60]
gi|353184521|gb|EHB50048.1| Sulphatase-modifying factor protein [Mycobacterium rhodesiae JS60]
Length = 290
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 120/328 (36%), Positives = 164/328 (50%), Gaps = 51/328 (15%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V +PG T +G+++ E P+R+VT+D F + +V+N F EFVSATGY+T A
Sbjct: 5 NLVWIPGQTAVLGSDR---HYPEEAPARSVTVDGFSIQDRQVTNAAFTEFVSATGYLTVA 61
Query: 74 EKFGDTFVFEPL-LSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
E+ PL ++ A ++ F ++ R H+S + T
Sbjct: 62 ER--------PLNPADYPGAPAVNLQPGSMVFRRTSGPVDLR------HLS-----LWWT 102
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W P SW HP G S+I+ R HPVVHV++ DA AY
Sbjct: 103 WT-----------------------PGASWRHPLGPTSSIDKRAAHPVVHVAYEDAEAYA 139
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWE RGGL+ + WG + G+ AN W GEFP GY T
Sbjct: 140 AWAGLALPTEAEWEVAARGGLDRADYTWGEHPEFPGQRLANYWHGEFPWR--PEPGYGRT 197
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPT-TGTDKVKKGGSYLCN 311
++ N +GL++M GNVWEWT DW+ A SY+P P T KV KGGS+LC
Sbjct: 198 TEAGAFPPNDYGLFDMAGNVWEWTTDWYGGTE--AASYDPAQPQFTVPRKVVKGGSFLCA 255
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+ YC R+R AAR D+ ++GFRC
Sbjct: 256 DSYCMRYRPAARRPQMIDTGMSHIGFRC 283
>gi|374371504|ref|ZP_09629460.1| hypothetical protein OR16_38974 [Cupriavidus basilensis OR16]
gi|373096914|gb|EHP38079.1| hypothetical protein OR16_38974 [Cupriavidus basilensis OR16]
Length = 305
Score = 172 bits (436), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 89/210 (42%), Positives = 113/210 (53%), Gaps = 22/210 (10%)
Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
R P W HP G DS I + +HPVV VS+ DA AY W G RLPTEAEWE+ RGGLE
Sbjct: 25 RYVPGADWRHPNGPDSNITGKDDHPVVQVSYEDAQAYAKWAGKRLPTEAEWEFAARGGLE 84
Query: 215 NRLFPWGNNLTPRGEHRANVWQGE----FPTNNTAADGYLSTAPVMSYKENKFGLYNMVG 270
+ WG+ P G ANVWQG+ FP + A G L T+PV ++ N +GL +M G
Sbjct: 85 QATYGWGDQFAPDGRQMANVWQGQQVQPFPVVSAKAGGALGTSPVGTFPANGYGLSDMTG 144
Query: 271 NVWEWTADWWNVHH---------------HPAPSYNPKG---PTTGTDKVKKGGSYLCNE 312
N W+W ADW+ P S++P PT +V +GGS+LCNE
Sbjct: 145 NAWQWVADWYRADQFKREAVSAKLIDNPADPGESWDPSEPGVPTAAPRRVTRGGSFLCNE 204
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YC +R +AR P +S +LGFR D
Sbjct: 205 DYCLSYRPSARRGTDPYNSMSHLGFRLVED 234
>gi|326382909|ref|ZP_08204599.1| hypothetical protein SCNU_08218 [Gordonia neofelifaecis NRRL
B-59395]
gi|326198499|gb|EGD55683.1| hypothetical protein SCNU_08218 [Gordonia neofelifaecis NRRL
B-59395]
Length = 337
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 115/333 (34%), Positives = 150/333 (45%), Gaps = 49/333 (14%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+++MV +P F MG++ E P+ + AF++D+H V+ +F+ FV ATGYVT
Sbjct: 45 HRNMVRVPAGEFTMGSDD---FFPEERPAHAREVGAFWIDRHPVTVAEFRRFVQATGYVT 101
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
AE+ D + L E RA GL V + V
Sbjct: 102 VAERDPDPARYPDALPGELRAG------------GL------------VFVRPPRQVPLD 137
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
WR W Y P W P G DS + R HPV V +DA AY
Sbjct: 138 DWR-------RWWAY----------VPGAQWRRPFGPDSDVRARDRHPVTQVGIDDARAY 180
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG--- 248
W G LPTEAEWEY G + + WG+ PRG AN W G FP+ G
Sbjct: 181 AEWAGKALPTEAEWEYAAASGRGDTTYAWGDEFAPRGRRMANTWMGAFPSEFVPGRGQSE 240
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK--VKKGG 306
T PV +Y N F L +M GNVWEWT D ++ H + P G + V KGG
Sbjct: 241 VPGTTPVGTYPPNAFDLVDMTGNVWEWTEDRYSDSHRRSTCCPPTRTGDGGEDHYVVKGG 300
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
S+LC YC R+R AAR D+S+ ++GFRC
Sbjct: 301 SFLCAPNYCLRYRPAARQSQDSDTSSCHIGFRC 333
>gi|120402330|ref|YP_952159.1| hypothetical protein Mvan_1319 [Mycobacterium vanbaalenii PYR-1]
gi|119955148|gb|ABM12153.1| protein of unknown function DUF323 [Mycobacterium vanbaalenii
PYR-1]
Length = 290
Score = 172 bits (435), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 116/331 (35%), Positives = 164/331 (49%), Gaps = 49/331 (14%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V L G +FRMG+ + E P VT+ + +++H V+N QF FV+ TGY T A
Sbjct: 4 ELVALDGGSFRMGSTR---FYPEEAPVHTVTVAPYAIERHPVTNAQFASFVADTGYRTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E D ++ P + ++ + V PV W W
Sbjct: 61 ELAPDPALY-PGAAPQDLVPGALVFRPTA--------------GPVDLTDWRQ------W 99
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
WE+ P W HP G S+++ + +HPVV V + DA AY
Sbjct: 100 ----------WEW----------VPGADWRHPSGPGSSVDDKPDHPVVQVCYADAAAYAR 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWE+ R G ++ WG+ P G AN WQG FP N A G+ T+
Sbjct: 140 WAGRRLPTEAEWEFAARAG-STAVYAWGDEAVPGGALMANTWQGAFPYRNDGALGWSGTS 198
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHH---HPAPSYNPK-GPTTGTDKVKKGGSYL 309
PV ++ N +GL +M+GNVWEWT + HH P S P+ GP + KGGS+L
Sbjct: 199 PVGTFPANAYGLVDMIGNVWEWTTTRFAGHHVLDAPVNSCCPQAGPDPAVTQTLKGGSHL 258
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
C +YC+R+R +ARS + DS+ ++GFRCA
Sbjct: 259 CAPEYCHRYRPSARSPQSQDSATTHIGFRCA 289
>gi|237649822|ref|ZP_04524074.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Streptococcus pneumoniae CCRI 1974]
gi|237820749|ref|ZP_04596594.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Streptococcus pneumoniae CCRI 1974M2]
Length = 282
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 153/335 (45%), Gaps = 62/335 (18%)
Query: 15 MVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
M + G TF MGTN + + D E P V++ F + V+N +F +FV TGY
Sbjct: 3 MKKIKGGTFMMGTNSEEGFLDDFEDPQVAVSVKDFSIADTPVTNQEFAQFVKETGY---- 58
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
T+ R V I + V
Sbjct: 59 -----------------------------------KTLAERQEWSFVFILF---VPEAER 80
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
G P A W P W HP G +S + ++PVVHV+ DA+A+C
Sbjct: 81 EGYPHPAGAPWWLQV---------PNACWKHPYGENSNLVGLEDYPVVHVALEDALAFCN 131
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEA+WEY RGG ++ +PWG+ L G + AN WQG FP NT DG++ TA
Sbjct: 132 WSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTGLDGFIGTA 190
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGS 307
PV + N FGLY M+GNVWEW N + S+N PK + +GGS
Sbjct: 191 PVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGS 247
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+LC+ YC R+R AAR+ S++ +LGFRC +
Sbjct: 248 FLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 282
>gi|399065820|ref|ZP_10748072.1| hypothetical protein PMI02_04496, partial [Novosphingobium sp.
AP12]
gi|398029111|gb|EJL22600.1| hypothetical protein PMI02_04496, partial [Novosphingobium sp.
AP12]
Length = 266
Score = 171 bits (432), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 110/282 (39%), Positives = 139/282 (49%), Gaps = 43/282 (15%)
Query: 1 MVLLPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
M L +P + M +PG F MG+N E P R V + F++D V+N +F
Sbjct: 1 MTLATSPQHAAARTMEWIPGGRFIMGSNH---HYPEERPERPVEVAGFWIDTTPVTNAEF 57
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
FV+ATG+ T AE D + P E S V R PV
Sbjct: 58 ARFVAATGHRTVAETAPDPRDY-PGALPEMLVPASLVFQPPPR--------------PV- 101
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
R P W+Y R G + W +P G S+IE +HPV
Sbjct: 102 --------------DPRGPASQWWDY--RTGAD--------WRYPTGPGSSIEDFPDHPV 137
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
VHV+ D AY W G LPTEAEWE+ RGG E F WG++L P G H ANVWQG+FP
Sbjct: 138 VHVALADVEAYLAWSGMSLPTEAEWEFAARGGTEKTEFAWGDDLVPGGIHHANVWQGQFP 197
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV 282
NT DG+L T+PV SY N +GL +M+GNVWEWTADWW++
Sbjct: 198 YANTLDDGWLRTSPVGSYPANGYGLLDMIGNVWEWTADWWSM 239
>gi|170690131|ref|ZP_02881298.1| protein of unknown function DUF323 [Burkholderia graminis C4D1M]
gi|170144566|gb|EDT12727.1| protein of unknown function DUF323 [Burkholderia graminis C4D1M]
Length = 357
Score = 170 bits (431), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 121/336 (36%), Positives = 158/336 (47%), Gaps = 75/336 (22%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
MV L G TF +G+ + E P+ + + F++DQ +V+N QF FV+ATGYVT+
Sbjct: 79 MVHLHGGTFVLGSTRGY---QDERPAGDGRTRVGGFWIDQTDVTNAQFAAFVNATGYVTD 135
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AEK G VF S R +M + +A+ T
Sbjct: 136 AEKQGGAAVFH-----------SPTRAEMDARD----------------------LAWWT 162
Query: 133 W-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W +GA SW HP+G S I+ MN PV V+ DA+AY
Sbjct: 163 WVKGA------------------------SWRHPQGEGSGIDRLMNQPVTLVTQADALAY 198
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAADG 248
W G LPTEAEWEY +GG E + PR G+ AN WQG FP NT DG
Sbjct: 199 ARWLGRDLPTEAEWEYAAKGGNEGADL----DTAPRNAQGKPAANYWQGVFPVLNTNDDG 254
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH--HPAPSYNPKGPTT---GTDKVK 303
++ APV Y N F LY+M+GN WEWT D + H H P+T T V
Sbjct: 255 HVGLAPVGCYAANGFKLYDMIGNAWEWTKDVYTGAHQSHTNGDTAAVAPSTRRHDTPMVI 314
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KGGS+LC+ YC R+R ++R Q D A ++GFR
Sbjct: 315 KGGSFLCSRDYCVRYRASSREQQEADLPASHIGFRT 350
>gi|425439454|ref|ZP_18819778.1| Sulfatase-modifying factor 1 (fragment) [Microcystis aeruginosa PCC
9717]
gi|389720332|emb|CCH95968.1| Sulfatase-modifying factor 1 (fragment) [Microcystis aeruginosa PCC
9717]
Length = 223
Score = 170 bits (431), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 91/203 (44%), Positives = 115/203 (56%), Gaps = 20/203 (9%)
Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
+ P +W HPEG S+I++R NHPVVH+ + DA+AY W G LPTEA+WE+ RGGLE
Sbjct: 16 QYMPGANWRHPEGTGSSIKNRENHPVVHIVYEDALAYAKWAGKELPTEAQWEFAARGGLE 75
Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
F WG+ P G+ AN WQG+FP N A V SY N +GLY+M+GNVWE
Sbjct: 76 GAEFAWGDEFMPNGKVMANTWQGQFPWQNLKPHAPGPEA-VESYPPNGYGLYDMIGNVWE 134
Query: 275 WTADWWNVHHHPAPSY---NPKGPTTGTD----------------KVKKGGSYLCNEQYC 315
WT DW+ H P+ PK P GT+ KV KGGS+LC YC
Sbjct: 135 WTQDWYQEQHPKNPTKACCTPKNPLGGTEEDSYNLKLHPSMRKPRKVLKGGSFLCAPNYC 194
Query: 316 YRHRCAARSQNTPDSSAGNLGFR 338
R+R AAR D+S ++GFR
Sbjct: 195 ARYRPAARHPEDIDTSTNHIGFR 217
>gi|26990068|ref|NP_745493.1| hypothetical protein PP_3353 [Pseudomonas putida KT2440]
gi|24984995|gb|AAN68957.1|AE016527_3 conserved hypothetical protein [Pseudomonas putida KT2440]
Length = 291
Score = 170 bits (431), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 111/337 (32%), Positives = 152/337 (45%), Gaps = 61/337 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG F G+++ D E P + F++D H V+N QF FV ATGYVT AE
Sbjct: 1 MVHVPGGEFSFGSSR---FYDEEGPPHPAKVSGFWIDVHPVTNAQFARFVKATGYVTHAE 57
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ G +P L + R + V H +G D V+ W
Sbjct: 58 R-GTRVEDDPALPDALRIPGAMVFH-----QGAD----------VLGPGW---------- 91
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+ P +W HP+G S++ NHPVV ++ DA AY W
Sbjct: 92 --------------------QFVPGANWRHPQGPGSSLAGLDNHPVVQIALEDAQAYARW 131
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LP+EA+ EY RGGL + F WG P+G+ AN WQG+FP N A DG+ T+P
Sbjct: 132 AGRELPSEAQLEYAMRGGLTDADFSWGTTEQPKGKLMANTWQGQFPYRNAAKDGFTGTSP 191
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--PSYNPKGPTTGTD----------KV 302
V + N FGL++ GNVWE T + H +P GP V
Sbjct: 192 VGCFPANGFGLFDAGGNVWELTRTGYRPGHDAQRDAKLDPSGPALSDSFDPADPGVPVAV 251
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KGGS+LC+ C R+R +AR + ++GFR
Sbjct: 252 IKGGSHLCSADRCMRYRPSARQPQPVFMTTSHVGFRT 288
>gi|296141015|ref|YP_003648258.1| hypothetical protein [Tsukamurella paurometabola DSM 20162]
gi|296029149|gb|ADG79919.1| protein of unknown function DUF323 [Tsukamurella paurometabola DSM
20162]
Length = 259
Score = 170 bits (431), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 84/162 (51%), Positives = 112/162 (69%), Gaps = 5/162 (3%)
Query: 178 HPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG 237
HPVVHVS+ DA+AYC W G LPTEA+WEY RGGLE+ +PWGN GE R +++ G
Sbjct: 98 HPVVHVSYTDALAYCGWAGRALPTEAQWEYAARGGLESARYPWGNEHPAPGE-RCHIFTG 156
Query: 238 EFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGPT 296
FP T G + T P S+ N +G+Y VGNVWEW +D ++ ++ +P+ +P GP
Sbjct: 157 SFPDAPT---GPVGTIPAQSFAPNGYGVYQAVGNVWEWCSDRYSARYYRVSPTDSPGGPD 213
Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
GT +V +GGS+LC++ YCYR+R AARS+NTP SSA N+GFR
Sbjct: 214 RGTARVLRGGSHLCHDSYCYRYRVAARSRNTPSSSASNIGFR 255
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/131 (41%), Positives = 73/131 (55%), Gaps = 6/131 (4%)
Query: 35 DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFE-----PLLSEE 89
DGE P V+L F +D V+ F F +ATGY+T+AE+FG + VF+ P +
Sbjct: 12 DGEQPVHPVSLPDFEIDATTVTVAAFAAFATATGYITDAERFGASAVFQASASAPGTPVD 71
Query: 90 ERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCR 149
V R G ++ HPVVH+S+ DA+AYC W G LPTEA+WEY R
Sbjct: 72 GTPWWRTVPGADFRTPGGPGSVALD-EHPVVHVSYTDALAYCGWAGRALPTEAQWEYAAR 130
Query: 150 GGLENRLFPWG 160
GGLE+ +PWG
Sbjct: 131 GGLESARYPWG 141
>gi|377566259|ref|ZP_09795520.1| hypothetical protein GOSPT_114_00280 [Gordonia sputi NBRC 100414]
gi|377526513|dbj|GAB40685.1| hypothetical protein GOSPT_114_00280 [Gordonia sputi NBRC 100414]
Length = 304
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 129/339 (38%), Positives = 168/339 (49%), Gaps = 57/339 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V L G F MG+++ E P+ + AF +++H V+N Q+ FVS TGYVT A
Sbjct: 9 ELVELTGGVFCMGSDR---FYPEERPAHERRVGAFAIERHPVTNAQYAAFVSDTGYVTVA 65
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E D F G D + + +V V W
Sbjct: 66 ES----------------------DIDPVDFPGSDPAM--LVPGALVFTPTAGPVDLTDW 101
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R A W R P SW HP G + I +HPV +++ DAVAY
Sbjct: 102 R-------AWW----------RWQPGASWRHPLGPATDIVGHDDHPVTQIAYADAVAYAA 144
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGG+ + WG+ L P GE AN W G FP A+DG+ T+
Sbjct: 145 WAGRRLPTEAEWEYAARGGVVGHEYAWGDELHPEGEILANTWIGNFP---YASDGWSGTS 201
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--PSYNPKG------PTTG--TDKVK 303
V SY N FGL +M+GNVWE T+D + H P + +P G PTT +V
Sbjct: 202 RVGSYPPNGFGLLDMIGNVWERTSDVFTPRHVPPDLATVSPDGRPDLLTPTTSPQVTRVT 261
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
KGGS+LC+ QYC R+R AARS + DS+ +LGFRCAAD
Sbjct: 262 KGGSHLCSPQYCRRYRPAARSPQSDDSATSHLGFRCAAD 300
>gi|389864487|ref|YP_006366727.1| sulfatase-modifying factor 1 [Modestobacter marinus]
gi|388486690|emb|CCH88242.1| Sulfatase-modifying factor 1 [Modestobacter marinus]
Length = 310
Score = 170 bits (430), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 88/199 (44%), Positives = 109/199 (54%), Gaps = 10/199 (5%)
Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
R P W HP G ST++ HPVVHV W DA+AY W G RLPTEAEWE RGG
Sbjct: 112 RWVPGADWRHPGGPGSTLDGLERHPVVHVGWEDALAYAAWAGRRLPTEAEWERAARGGAP 171
Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
++ WG PRG AN W+G FP NT+ G T+PV Y N +GL +M+GNVWE
Sbjct: 172 PTVYAWGVEARPRGRAMANTWEGRFPWENTSPAGRTRTSPVGVYPANGYGLVDMIGNVWE 231
Query: 275 WTADWWNVHHHPAPSYN----------PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARS 324
WT+ W H P + P + +V KGGS+LC YC+R+R AR
Sbjct: 232 WTSSPWTEDHGADPESDQVAHACCGGGPPAVSETDRRVMKGGSHLCAPTYCHRYRPPARQ 291
Query: 325 QNTPDSSAGNLGFRCAADK 343
+ SS +LGFRCA D+
Sbjct: 292 GHDVRSSTSHLGFRCAVDR 310
>gi|302556760|ref|ZP_07309102.1| sulfatase-modifying factor 1 [Streptomyces griseoflavus Tu4000]
gi|302474378|gb|EFL37471.1| sulfatase-modifying factor 1 [Streptomyces griseoflavus Tu4000]
Length = 243
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 103/261 (39%), Positives = 125/261 (47%), Gaps = 52/261 (19%)
Query: 4 LPAPPVERYKDMVLLPGDTFRMGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
LPA + V LPG F MG DGE P V L F++D+ V+N +F
Sbjct: 24 LPAAGKRSTRGQVRLPGGEFAMGDAFGEGYRADGETPVHPVRLTPFHIDETAVTNARFAA 83
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
FV ATG+VT+AE+ G + VF ++ P I
Sbjct: 84 FVKATGHVTDAERHGSSAVFHLVVDA-----------------------------PAADI 114
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
N A A W RG W PEG+ S + R NHPVVH
Sbjct: 115 LGNAA-------------GAPWWINVRGA---------HWRRPEGVRSDVTGRQNHPVVH 152
Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
VSWNDA AY W G RLPTEAEWEY RGGL+ R + WG+ LTP G R N+WQG FP
Sbjct: 153 VSWNDASAYARWAGKRLPTEAEWEYAARGGLDGRRYAWGDELTPDGRWRCNIWQGRFPHT 212
Query: 243 NTAADGYLSTAPVMSYKENKF 263
NTA DG+L+TAPV SY+ +
Sbjct: 213 NTADDGHLTTAPVKSYRPQRL 233
>gi|148547626|ref|YP_001267728.1| hypothetical protein Pput_2405 [Pseudomonas putida F1]
gi|148511684|gb|ABQ78544.1| protein of unknown function DUF323 [Pseudomonas putida F1]
Length = 341
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 115/348 (33%), Positives = 157/348 (45%), Gaps = 62/348 (17%)
Query: 4 LPAPPVERYKD-MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
LPA ++D MV +PG F G+++ D E P + F++D H V+N QF
Sbjct: 39 LPAQRQGPWRDGMVHVPGGEFSFGSSR---FYDEEGPPHPAKVSGFWIDVHPVTNAQFAR 95
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
FV ATGYVT AE+ G +P L R + V H +G D V+
Sbjct: 96 FVKATGYVTHAER-GTRVEDDPALPGALRVPGAMVFH-----QGAD----------VLSP 139
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
W + P +W HP+G S++ NHPVV
Sbjct: 140 GW------------------------------QFVPGANWRHPQGPGSSLAGLDNHPVVQ 169
Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
++ DA AY W G LP+EA+ EY RGGL + F WG P+G+ AN WQG+FP
Sbjct: 170 IALEDAQAYARWAGRELPSEAQLEYAMRGGLTDADFSWGATEQPKGKLMANTWQGQFPYR 229
Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--PSYNPKGPTTGTD 300
N A DG+ T+PV + N GL++ GNVWE T + H +P GPT
Sbjct: 230 NAAKDGFTGTSPVGCFPANGLGLFDAGGNVWELTRTGYRPGHDAQRDAKLDPSGPTLSDS 289
Query: 301 ----------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
V KGGS+LC+ C R+R +AR + ++GFR
Sbjct: 290 FDPADPGVPVAVIKGGSHLCSADRCMRYRPSARQPQPVFMTTSHVGFR 337
>gi|384097485|ref|ZP_09998606.1| hypothetical protein W5A_02480 [Imtechella halotolerans K1]
gi|383837453|gb|EID76853.1| hypothetical protein W5A_02480 [Imtechella halotolerans K1]
Length = 347
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 89/193 (46%), Positives = 116/193 (60%), Gaps = 16/193 (8%)
Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
SW HP G +S IE + +HPVV VSW DA AYC W G RLPTEAE+EY RG +P
Sbjct: 159 ASWKHPYGPNSNIEGKDDHPVVQVSWYDAQAYCHWSGKRLPTEAEFEYLSRGQKTEESYP 218
Query: 220 WGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
WGN+ + +AN +QG+FP N+ DGY TA V S++ N FGLY++ GNVWEWT D
Sbjct: 219 WGNDFS-IATQKANFFQGDFPNFNSMDDGYNKTAAVKSFESNGFGLYDIAGNVWEWTLDT 277
Query: 280 WNVHHHPAPS----------YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPD 329
+ + + +NPK +KV +GGSYLC+E YC +R AAR ++PD
Sbjct: 278 YYPNAYQLKKKHEQDFFLTYHNPK-----QEKVVRGGSYLCSESYCTGYRSAARMSSSPD 332
Query: 330 SSAGNLGFRCAAD 342
+S + GFR D
Sbjct: 333 TSLEHTGFRTVLD 345
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/160 (38%), Positives = 88/160 (55%), Gaps = 16/160 (10%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++MV + G TF MG N D EFP T + ++D+ EV+N QF++F+ ATGY+T
Sbjct: 67 ENMVFIKGGTFVMGGNNAQARPD-EFPQHKDTANDIWVDKTEVTNAQFKKFIEATGYITT 125
Query: 73 AEKFGDT---------FVFEPLLSEE--ERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
AE+ + VF+P + + + + +H G +S IE + HPVV
Sbjct: 126 AERSFEVEGQSYPPGALVFDPNQPDFWWKFVEGASWKHPY----GPNSNIEGKDDHPVVQ 181
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
+SW DA AYC W G RLPTEAE+EY RG +PWG+
Sbjct: 182 VSWYDAQAYCHWSGKRLPTEAEFEYLSRGQKTEESYPWGN 221
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 30/43 (69%)
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
+KV +GGSYLC+E YC +R AAR ++PD+S + GFR D
Sbjct: 303 EKVVRGGSYLCSESYCTGYRSAARMSSSPDTSLEHTGFRTVLD 345
>gi|154251586|ref|YP_001412410.1| hypothetical protein Plav_1132 [Parvibaculum lavamentivorans DS-1]
gi|154155536|gb|ABS62753.1| protein of unknown function DUF323 [Parvibaculum lavamentivorans
DS-1]
Length = 318
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 114/328 (34%), Positives = 154/328 (46%), Gaps = 73/328 (22%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV + G TF MG + E P R V+++AF++D+ EV+N +F EFV+ATGYVT AE
Sbjct: 55 MVRVEGGTFMMGDT----LYPEEGPLREVSVEAFWMDRTEVTNAEFAEFVAATGYVTVAE 110
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA---YC 131
+ DT R GL E + VV + ND
Sbjct: 111 RPVDT----------------------ARHPGLPP--EMQKPGAVVFVMPNDVSGTGDIS 146
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W W+Y P +W HP G D++IE R + PV +++ DA AY
Sbjct: 147 QW----------WQY----------IPGANWRHPGGPDTSIEERDHFPVTAIAYEDAQAY 186
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR-ANVWQGEFPTNNTAADGYL 250
W+G LPTEA+WE+ R + PR + R AN WQG FP N DG++
Sbjct: 187 AAWKGRVLPTEAQWEWAAREAK-------ADAPPPRAQPREANTWQGLFPVANNTEDGFV 239
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLC 310
APV ++ N GLY+M+GN+WEWT D + T +V KGGS+LC
Sbjct: 240 GIAPVGCFEPNALGLYDMIGNLWEWTVDEYE--------------TVPGARVIKGGSWLC 285
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
YC R+R AR D + +LGFR
Sbjct: 286 APNYCLRYRPGARQPQEADLATTHLGFR 313
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 30/56 (53%), Gaps = 1/56 (1%)
Query: 333 GNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 388
GNL + D+ T +V KGGS+LC YC R+R AR D + +LGFR
Sbjct: 259 GNL-WEWTVDEYETVPGARVIKGGSWLCAPNYCLRYRPGARQPQEADLATTHLGFR 313
>gi|226437577|ref|NP_001139805.1| sulfatase-modifying factor 2 isoform g [Homo sapiens]
gi|119628385|gb|EAX07980.1| sulfatase modifying factor 2, isoform CRA_f [Homo sapiens]
gi|119628387|gb|EAX07982.1| sulfatase modifying factor 2, isoform CRA_f [Homo sapiens]
gi|194390888|dbj|BAG62203.1| unnamed protein product [Homo sapiens]
Length = 213
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 86/190 (45%), Positives = 115/190 (60%), Gaps = 19/190 (10%)
Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
W P G S I R+ HPV+HVSWNDA AYC WRG RLPTE EWE+ RGGL+ +++PWG
Sbjct: 38 WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWG 97
Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWW 280
N P +R N+WQG+FP + A DG+ +PV ++ +N +GLY+++GNVWEWTA
Sbjct: 98 NWFQP---NRTNLWQGKFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWEWTA--- 151
Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
+P +V +G S++ + +R R R NTPDS++ NLGFR
Sbjct: 152 ----------SPYQAAEQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFR 201
Query: 339 CAADKGPTTG 348
CAAD G G
Sbjct: 202 CAADAGRPPG 211
>gi|444725750|gb|ELW66304.1| Sulfatase-modifying factor 2 [Tupaia chinensis]
Length = 285
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 118/332 (35%), Positives = 153/332 (46%), Gaps = 83/332 (25%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F MGTN P +DGE P R VT+ F +D V+N F++FV Y TEAE
Sbjct: 29 MVQLPGGRFLMGTNSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 87
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S E + K ++ + G S I R+ PVVH+SWNDA YC W+
Sbjct: 88 MFGWSFVFEDFVSAELKNKATE----QMKPAGPGSGIRERLELPVVHVSWNDARTYCAWQ 143
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G RLPTE EWE+ RGGL+ +++PWG+ P +R N W
Sbjct: 144 GKRLPTEEEWEFAARGGLQGQVYPWGNRFQP--------NRTN---------------LW 180
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G P E G G FP P NN
Sbjct: 181 QG-EFPKGDRAEDGFHGLSPVNAFP--------------------PQNN----------- 208
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNE 312
+GLY++VGNVWEWTA Y G +V +G S++ +
Sbjct: 209 --------YGLYDLVGNVWEWTAS----------PYQAAGQDM---RVLRGASWIDTADG 247
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+R R R NTPDS++ NLGFRCA+ G
Sbjct: 248 SANHRARVTTRMGNTPDSASDNLGFRCASSAG 279
>gi|116251674|ref|YP_767512.1| hypothetical protein RL1910 [Rhizobium leguminosarum bv. viciae
3841]
gi|115256322|emb|CAK07403.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
3841]
Length = 347
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 123/344 (35%), Positives = 170/344 (49%), Gaps = 65/344 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
+ +PG TF +G++K E P+ VT+D F++ + V+N QF++F ATGYVT AE
Sbjct: 41 FIWIPGRTFTVGSDKHY---PEEAPAHPVTVDGFWMAETPVTNRQFKQFAEATGYVTVAE 97
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K D + E RA S V K G D + W
Sbjct: 98 KAPDPRDYPGAKPEMLRAG-SLVFTQPKTVSGAD-----------ISQWWT--------- 136
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+++G +W P G S + +++HPVVHV++ DA AY W
Sbjct: 137 ---------FKFGA------------NWRRPLGGLSDLRGKLDHPVVHVAYADAAAYAHW 175
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEAEWE RGGL++ + WG+ P G AN W G FP ++T G+ T+P
Sbjct: 176 AGLDLPTEAEWELAARGGLDDAEYAWGDEFAPGGVPMANTWSGTFPIHSTKPKGHERTSP 235
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP------------------SYNPKGPT 296
V S+ N FGLY+M+GNVWEWT+D+W+ HP P SY+P+ P
Sbjct: 236 VRSFPPNGFGLYDMIGNVWEWTSDYWST-RHPEPAKHSCCIPSNPRGRDAEASYDPRQPE 294
Query: 297 TG-TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC YC R+R AAR D+S ++GFRC
Sbjct: 295 IRIARRVLKGGSHLCAPNYCRRYRPAARHAEPEDTSTSHVGFRC 338
>gi|226363895|ref|YP_002781677.1| hypothetical protein ROP_44850 [Rhodococcus opacus B4]
gi|226242384|dbj|BAH52732.1| hypothetical protein [Rhodococcus opacus B4]
Length = 310
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 129/342 (37%), Positives = 162/342 (47%), Gaps = 71/342 (20%)
Query: 18 LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
+P TF MG+++ E P+ V +DAF ++ H+V+N QF FV TGYVT AE+
Sbjct: 12 IPAQTFTMGSDR---HYPEEGPAHRVAVDAFAIEAHQVTNAQFTTFVRDTGYVTVAERPL 68
Query: 78 DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS--WNDAVAYCTWRG 135
D F P E S V F ++ R H+S W
Sbjct: 69 DPADF-PGAPAENLQPGSMV------FTPTRGPVDLR------HLSQWWT---------- 105
Query: 136 ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWR 195
W G SW P G STI R HPVVHV+ DA AY W
Sbjct: 106 --------WTVGA------------SWQRPTGPASTIAGRERHPVVHVAHEDAAAYAQWA 145
Query: 196 GARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPV 255
G LPTEA+WE RGGL+ F WG+ P E AN W G+FP A GY +T V
Sbjct: 146 GRSLPTEAQWEAAARGGLDQAEFTWGD--APASEPSANYWHGDFPWR--ADPGYGTTTAV 201
Query: 256 MSYKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGP--TTGTD------- 300
SY+ N +GL++M GNVWEWTADW+ H AP NP+GP T D
Sbjct: 202 GSYRPNGYGLFDMAGNVWEWTADWYADRHRDTAESCCAPR-NPRGPGMTASFDPRQPQFR 260
Query: 301 ---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC + YC R+R AAR D+ ++GFRC
Sbjct: 261 VPRRVIKGGSFLCADSYCRRYRPAARRPQMVDTGMSHIGFRC 302
>gi|384106318|ref|ZP_10007225.1| hypothetical protein W59_33398 [Rhodococcus imtechensis RKJ300]
gi|383833654|gb|EID73104.1| hypothetical protein W59_33398 [Rhodococcus imtechensis RKJ300]
Length = 350
Score = 168 bits (426), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 134/386 (34%), Positives = 175/386 (45%), Gaps = 76/386 (19%)
Query: 18 LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
+P TF MG+++ E P+ V +DAF ++ H+V+ Q+ FV TGYVT AE+
Sbjct: 13 IPAHTFTMGSDR---HYPEESPAHRVAIDAFAIEAHQVTTAQYAAFVRDTGYVTVAERPL 69
Query: 78 DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS--WNDAVAYCTWRG 135
D F P E S V F ++ R H+S W
Sbjct: 70 DPADF-PGAPAENLQPGSMV------FTPTRGPVDLR------HLSQWWT---------- 106
Query: 136 ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWR 195
W G SW HP G STI R HPVVHV+ DA AY W
Sbjct: 107 --------WTVG------------ASWRHPTGPASTITGREQHPVVHVAHEDAAAYAQWA 146
Query: 196 GARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPV 255
G LPTEA+WE RGGL++ F WG+ AN W GEFP A GY +T V
Sbjct: 147 GRSLPTEAQWEAAARGGLDHAEFTWGDAPESESPPLANYWHGEFPWR--ADPGYGTTTAV 204
Query: 256 MSYKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGPTTGTD--------- 300
SY N++GL++M GNVWEWTADW++ H AP NP+GP
Sbjct: 205 GSYPPNEYGLFDMAGNVWEWTADWYSDRHADTAESCCAPR-NPRGPDMSESFDPRQPQFR 263
Query: 301 ---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKGGS 357
+V KGGS+LC + YC R+R AAR D+ ++GFRC T D+++ +
Sbjct: 264 VPRRVIKGGSFLCADSYCRRYRPAARRPQMVDTGMSHIGFRCV-----TAYPDQIRTTTA 318
Query: 358 YLCNEQYCYRHRCAAR--SQNTPDSS 381
+ AA N+PDSS
Sbjct: 319 SSASTVTTLDVTTAANVAGANSPDSS 344
>gi|332265468|ref|XP_003281742.1| PREDICTED: sulfatase-modifying factor 2 isoform 3 [Nomascus
leucogenys]
Length = 213
Score = 168 bits (425), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 85/190 (44%), Positives = 114/190 (60%), Gaps = 19/190 (10%)
Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
W P G S I R+ HPV+HVSWNDA AYC WRG RLP E EWE+ RGGL+ +++PWG
Sbjct: 38 WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAWRGKRLPMEEEWEFAARGGLKGQVYPWG 97
Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWW 280
N P +R N+WQG+FP + A DG+ +PV ++ +N +GLY+++GNVWEWTA
Sbjct: 98 NRFQP---NRTNLWQGKFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWEWTA--- 151
Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
+P +V +G S++ + +R R R NTPDS++ NLGFR
Sbjct: 152 ----------SPYQAAEQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFR 201
Query: 339 CAADKGPTTG 348
CAAD G G
Sbjct: 202 CAADAGRPPG 211
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCAAD
Sbjct: 162 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 206
>gi|397698240|ref|YP_006536123.1| hypothetical protein T1E_5508 [Pseudomonas putida DOT-T1E]
gi|397334970|gb|AFO51329.1| hypothetical protein T1E_5508 [Pseudomonas putida DOT-T1E]
Length = 218
Score = 167 bits (424), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 84/185 (45%), Positives = 106/185 (57%), Gaps = 1/185 (0%)
Query: 156 LFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 215
L P W HP G S ++ HPVVHV DA+AY W G LP E EWEY GG +
Sbjct: 31 LVPGADWRHPLGPGSDLQGMAQHPVVHVGLEDALAYAQWAGCDLPGEPEWEYAAWGGTQQ 90
Query: 216 RLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEW 275
F WG++L P G+H AN WQG FP N DGY T+PV ++ N +GLY+M+GNVWEW
Sbjct: 91 GEFVWGHHLVPEGKHMANTWQGNFPAENLQTDGYSRTSPVGAFPANGYGLYDMIGNVWEW 150
Query: 276 TADWWN-VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGN 334
T + H P S + KV KGGS+LC YC R+R ARS D+++ +
Sbjct: 151 TTTVFQPSHASPRKSCCANARSAAQMKVLKGGSHLCAPNYCQRYRPPARSPQATDTTSSH 210
Query: 335 LGFRC 339
+GFRC
Sbjct: 211 IGFRC 215
>gi|397734685|ref|ZP_10501389.1| hypothetical protein JVH1_5881 [Rhodococcus sp. JVH1]
gi|396929473|gb|EJI96678.1| hypothetical protein JVH1_5881 [Rhodococcus sp. JVH1]
Length = 312
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 125/342 (36%), Positives = 158/342 (46%), Gaps = 69/342 (20%)
Query: 18 LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
+P TF MG+++ E P+ V +DAF ++ H+V+ QF FV TGYVT AE+
Sbjct: 12 IPAQTFTMGSDR---HYPEEGPAHRVAVDAFAIETHQVTTAQFARFVGDTGYVTVAERPL 68
Query: 78 DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS--WNDAVAYCTWRG 135
D F P E S V F ++ R H+S W
Sbjct: 69 DPAEF-PGAPAENLQPGSMV------FTPTRGPVDLR------HLSQWWT---------- 105
Query: 136 ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWR 195
W G SW HP G STI R HPVVHV+ DA AY W
Sbjct: 106 --------WTVGA------------SWRHPTGPGSTITGRETHPVVHVAHEDAAAYAEWA 145
Query: 196 GARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPV 255
G LPTEA+WE RGGL++ F WG+ AN W GEFP GY +T V
Sbjct: 146 GRSLPTEAQWEAAARGGLDHTEFTWGDTAESESPPLANYWHGEFPWRPDP--GYGTTTAV 203
Query: 256 MSYKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGPTTGTD--------- 300
SY N +GL++M GNVWEWTADW++ H AP NP+GP
Sbjct: 204 GSYPPNDYGLFDMAGNVWEWTADWYSDRHADTAESCCAPR-NPRGPGMSESFDARQPQFR 262
Query: 301 ---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC + YC R+R AAR D+ ++GFRC
Sbjct: 263 VPRRVIKGGSFLCADSYCRRYRPAARRPQMVDTGMSHIGFRC 304
>gi|335037887|ref|ZP_08531187.1| hypothetical protein AGRO_5199 [Agrobacterium sp. ATCC 31749]
gi|333790728|gb|EGL62125.1| hypothetical protein AGRO_5199 [Agrobacterium sp. ATCC 31749]
Length = 326
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/183 (49%), Positives = 113/183 (61%), Gaps = 2/183 (1%)
Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
W PEG S + R HPV HVSWNDA A+ TW G RLP+E EWE RGG + +PWG
Sbjct: 124 WDVPEGPGSGLSGRGRHPVTHVSWNDAQAFATWAGGRLPSEIEWETAARGGDADAQYPWG 183
Query: 222 NNLTPRGEHR-ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
N + N+WQGEFP +NTA DG+ STAPV + N FGL+NMVGNVWEWTA+ +
Sbjct: 184 NEEPDDLDFNPCNIWQGEFPNSNTAVDGFASTAPVDQFAANAFGLHNMVGNVWEWTAEPF 243
Query: 281 NVHH-HPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
VH + DK+ KGGS+LC++ YC+R+R AARS D+S ++GFR
Sbjct: 244 KVHSLRREAKIRNAAASREKDKLMKGGSFLCHKSYCWRYRIAARSSAAADTSIAHVGFRV 303
Query: 340 AAD 342
D
Sbjct: 304 VFD 306
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 70/177 (39%), Positives = 99/177 (55%), Gaps = 10/177 (5%)
Query: 2 VLLPAPPVERYKDM----VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSN 57
V LP P R + M V + GTN+P+ +DGE R + A+Y+D V+N
Sbjct: 16 VALPVPHWTRDRMMPERFVRIEAQRTFSGTNRPVFAQDGEAIVRQAQIQAYYIDPFAVTN 75
Query: 58 TQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ--VRHDMKRF---EGLDSTIE 112
F++FV TG+VT+AE++G +FVF + RA ++ R D R+ EG S +
Sbjct: 76 DWFRDFVEETGHVTDAERYGWSFVFLGVGKAHIRAAVAPWWGRIDGARWDVPEGPGSGLS 135
Query: 113 HRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGID 169
R HPV H+SWNDA A+ TW G RLP+E EWE RGG + +PWG+ P+ +D
Sbjct: 136 GRGRHPVTHVSWNDAQAFATWAGGRLPSEIEWETAARGGDADAQYPWGN-EEPDDLD 191
>gi|452955888|gb|EME61283.1| hypothetical protein G352_19031 [Rhodococcus ruber BKS 20-38]
Length = 301
Score = 167 bits (422), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 92/206 (44%), Positives = 117/206 (56%), Gaps = 20/206 (9%)
Query: 156 LFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 215
P +W HPEG DS + R HPV VSW DA AY W G LPTEAEWE+ RGGL+
Sbjct: 88 FVPGANWRHPEGPDSNVGGRERHPVTQVSWFDARAYAQWAGKELPTEAEWEFAARGGLDR 147
Query: 216 RLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEW 275
+ F WG P G NVWQGEFP N DG+ T+PV ++ N + L +M GNVWEW
Sbjct: 148 KPFVWGEEHEPGGRPGGNVWQGEFPWQNLLEDGFAGTSPVGRFRPNGYHLSDMAGNVWEW 207
Query: 276 TADWWNVHH-----HPAPSY------NPKGPT-TGTD--------KVKKGGSYLCNEQYC 315
TAD++ H + AP++ NP+ T TG +V KGGS+LC YC
Sbjct: 208 TADYFTGDHSTRGKNMAPTHSCCIPANPRAETATGIQYPHEPYPRRVIKGGSHLCAPNYC 267
Query: 316 YRHRCAARSQNTPDSSAGNLGFRCAA 341
R+R AAR T ++S ++GFRC +
Sbjct: 268 LRYRPAARQGETEETSTCHIGFRCIS 293
>gi|111021544|ref|YP_704516.1| hypothetical protein RHA1_ro04572 [Rhodococcus jostii RHA1]
gi|110821074|gb|ABG96358.1| conserved hypothetical protein [Rhodococcus jostii RHA1]
Length = 312
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 125/342 (36%), Positives = 158/342 (46%), Gaps = 69/342 (20%)
Query: 18 LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
+P TF MG+++ E P+ V +DAF ++ H+V+ QF FV TGYVT AE+
Sbjct: 12 IPAQTFTMGSDR---HYPEEGPAHRVAVDAFAIETHQVTTAQFARFVGDTGYVTVAERPL 68
Query: 78 DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS--WNDAVAYCTWRG 135
D F P E S V F ++ R H+S W
Sbjct: 69 DPAEF-PGAPAENLQPGSMV------FTPTRGPVDLR------HLSQWWT---------- 105
Query: 136 ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWR 195
W G SW HP G STI R HPVVHV+ DA AY W
Sbjct: 106 --------WTVGA------------SWRHPTGPGSTITGRETHPVVHVAHEDAAAYAEWA 145
Query: 196 GARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPV 255
G LPTEA+WE RGGL++ F WG+ AN W GEFP GY +T V
Sbjct: 146 GRSLPTEAQWEAAARGGLDHTEFTWGDAAESESPPLANYWHGEFPWRPDP--GYGTTTAV 203
Query: 256 MSYKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGPTTGTD--------- 300
SY N +GL++M GNVWEWTADW++ H AP NP+GP
Sbjct: 204 GSYPPNDYGLFDMAGNVWEWTADWYSDRHADTAESCCAPR-NPRGPGMSESFDARQPQFR 262
Query: 301 ---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC + YC R+R AAR D+ ++GFRC
Sbjct: 263 VPRRVIKGGSFLCADSYCRRYRPAARRPQMVDTGMSHIGFRC 304
>gi|419961808|ref|ZP_14477811.1| hypothetical protein WSS_A06824 [Rhodococcus opacus M213]
gi|414572832|gb|EKT83522.1| hypothetical protein WSS_A06824 [Rhodococcus opacus M213]
Length = 319
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 124/340 (36%), Positives = 160/340 (47%), Gaps = 65/340 (19%)
Query: 18 LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
+P TF MG+++ E P+ V +DAF ++ H+V+ Q+ FV TGYVT AE+
Sbjct: 13 IPAHTFTMGSDR---HYPEESPAHRVAIDAFAIEAHQVTTAQYAAFVRDTGYVTVAERPL 69
Query: 78 DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
D+ F P E S V F ++ R H+S + TW
Sbjct: 70 DSADF-PGAPAENLQPGSMV------FTPTRGPVDLR------HLS-----QWWTWTVG- 110
Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
SW HP G STI R HPVVHV+ DA AY W G
Sbjct: 111 ----------------------ASWRHPTGPASTITGREQHPVVHVAHEDAAAYAQWAGR 148
Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMS 257
LPTEA+WE RGGL+ F WG+ AN W GEFP A GY +T V S
Sbjct: 149 SLPTEAQWEAAARGGLDYAEFTWGDAPESESPPLANYWHGEFPWR--ADPGYGTTTAVGS 206
Query: 258 YKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGPTTGTD----------- 300
Y N++GL++M GNVWEWTADW++ H AP NP+GP
Sbjct: 207 YPPNEYGLFDMAGNVWEWTADWYSDRHADTAESCCAPR-NPRGPGMSESFDPRQPQFRVP 265
Query: 301 -KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC + YC R+R AAR D+ ++GFRC
Sbjct: 266 RRVIKGGSFLCADSYCRRYRPAARRPQMVDTGMSHIGFRC 305
>gi|432342940|ref|ZP_19592161.1| hypothetical protein Rwratislav_37752 [Rhodococcus wratislaviensis
IFP 2016]
gi|430772042|gb|ELB87849.1| hypothetical protein Rwratislav_37752 [Rhodococcus wratislaviensis
IFP 2016]
Length = 319
Score = 166 bits (421), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 125/342 (36%), Positives = 160/342 (46%), Gaps = 69/342 (20%)
Query: 18 LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
+P TF MG+++ E P+ V +DAF ++ H+V+ Q+ FV TGYVT AE+
Sbjct: 13 IPAHTFTMGSDR---HYPEESPAHRVAIDAFAIEAHQVTTAQYAAFVRDTGYVTVAERPL 69
Query: 78 DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS--WNDAVAYCTWRG 135
D F P E S V F ++ R H+S W
Sbjct: 70 DPADF-PGAPAENLQPGSMV------FTPTRGPVDLR------HLSQWWT---------- 106
Query: 136 ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWR 195
W G SW HP G STI R HPVVHV+ DA AY W
Sbjct: 107 --------WTVGA------------SWRHPTGPASTITGREQHPVVHVAHEDAAAYAQWA 146
Query: 196 GARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPV 255
G LPTEA+WE RGGL++ F WG+ AN W GEFP A GY +T V
Sbjct: 147 GRSLPTEAQWEAAARGGLDHAEFTWGDAPESESPPLANYWHGEFPWR--ADPGYGTTTAV 204
Query: 256 MSYKENKFGLYNMVGNVWEWTADWWNVHHHP------APSYNPKGPTTGTD--------- 300
SY N++GL++M GNVWEWTADW++ H AP NP+GP
Sbjct: 205 GSYPPNEYGLFDMAGNVWEWTADWYSDRHADTAESCCAPR-NPRGPGMSESFDPCQPQFR 263
Query: 301 ---KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+V KGGS+LC + YC R+R AAR D+ ++GFRC
Sbjct: 264 VPRRVIKGGSFLCADSYCRRYRPAARRPQMVDTGMSHIGFRC 305
>gi|315646548|ref|ZP_07899666.1| Non-specific serine/threonine protein kinase [Paenibacillus vortex
V453]
gi|315278191|gb|EFU41511.1| Non-specific serine/threonine protein kinase [Paenibacillus vortex
V453]
Length = 170
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 114/220 (51%), Gaps = 51/220 (23%)
Query: 25 MGTN-KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFE 83
MGT+ K +DGE P+R V++ F + V+N +FQ FV TGYVT+AEK+G ++VFE
Sbjct: 1 MGTDTKEGFPRDGEGPARGVSVSGFEISPFTVTNAEFQRFVETTGYVTDAEKYGWSYVFE 60
Query: 84 PLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAE 143
L SEE +A + QV P
Sbjct: 61 LLASEETKASVVQV-----------------------------------------PQGVP 79
Query: 144 WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEA 203
W G W PEG DS+IE RM+HPVVHVSWNDA YC W G RLPTEA
Sbjct: 80 WWLVVEGAY---------WAAPEGKDSSIEGRMDHPVVHVSWNDADVYCQWAGVRLPTEA 130
Query: 204 EWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNN 243
EWEY RGGLE R +PWG+ L GEH+ N+WQG+FP
Sbjct: 131 EWEYAARGGLEGRTYPWGDLLKQDGEHQCNIWQGKFPVQK 170
>gi|148995069|ref|ZP_01823997.1| hypothetical protein CGSSp9BS68_00262 [Streptococcus pneumoniae
SP9-BS68]
gi|168488488|ref|ZP_02712687.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Streptococcus pneumoniae SP195]
gi|417679760|ref|ZP_12329156.1| hypothetical protein SPAR50_1797 [Streptococcus pneumoniae GA17570]
gi|418126447|ref|ZP_12763352.1| hypothetical protein SPAR86_1818 [Streptococcus pneumoniae GA44511]
gi|418192233|ref|ZP_12828735.1| hypothetical protein SPAR96_1767 [Streptococcus pneumoniae GA47388]
gi|418234994|ref|ZP_12861570.1| hypothetical protein SPAR17_1778 [Streptococcus pneumoniae GA08780]
gi|419508889|ref|ZP_14048540.1| hypothetical protein SPAR114_1782 [Streptococcus pneumoniae
GA49542]
gi|421220875|ref|ZP_15677714.1| hypothetical protein AMCSP05_001612 [Streptococcus pneumoniae
2070425]
gi|421223130|ref|ZP_15679912.1| hypothetical protein AMCSP14_001555 [Streptococcus pneumoniae
2070531]
gi|421279570|ref|ZP_15730376.1| hypothetical protein SPAR44_1768 [Streptococcus pneumoniae GA17301]
gi|421294634|ref|ZP_15745355.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA56113]
gi|147926881|gb|EDK77931.1| hypothetical protein CGSSp9BS68_00262 [Streptococcus pneumoniae
SP9-BS68]
gi|183572735|gb|EDT93263.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Streptococcus pneumoniae SP195]
gi|332072625|gb|EGI83108.1| hypothetical protein SPAR50_1797 [Streptococcus pneumoniae GA17570]
gi|353795511|gb|EHD75859.1| hypothetical protein SPAR86_1818 [Streptococcus pneumoniae GA44511]
gi|353855319|gb|EHE35289.1| hypothetical protein SPAR96_1767 [Streptococcus pneumoniae GA47388]
gi|353886616|gb|EHE66398.1| hypothetical protein SPAR17_1778 [Streptococcus pneumoniae GA08780]
gi|379610843|gb|EHZ75573.1| hypothetical protein SPAR114_1782 [Streptococcus pneumoniae
GA49542]
gi|395586785|gb|EJG47152.1| hypothetical protein AMCSP05_001612 [Streptococcus pneumoniae
2070425]
gi|395587110|gb|EJG47472.1| hypothetical protein AMCSP14_001555 [Streptococcus pneumoniae
2070531]
gi|395879063|gb|EJG90125.1| hypothetical protein SPAR44_1768 [Streptococcus pneumoniae GA17301]
gi|395893203|gb|EJH04190.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA56113]
Length = 282
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 85/191 (44%), Positives = 113/191 (59%), Gaps = 10/191 (5%)
Query: 158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
P W HP G +S + +HPVVHV+ DA+A+C W G LPTEA+WEY RGG ++
Sbjct: 96 PNACWKHPYGENSNLVGLEDHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSE- 154
Query: 218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
+PWG+ L G + AN WQG FP NTA DG++ TAPV + N FGLY M+GNVWEW
Sbjct: 155 YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTAPVYEFLPNDFGLYQMIGNVWEWCR 214
Query: 278 DWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSS 331
N + S+N PK + +GGS+LC+ YC R+R AAR+ S+
Sbjct: 215 ---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTST 271
Query: 332 AGNLGFRCAAD 342
+ +LGFRC +
Sbjct: 272 SSHLGFRCLKE 282
>gi|149013585|ref|ZP_01834174.1| hypothetical protein CGSSp19BS75_12168 [Streptococcus pneumoniae
SP19-BS75]
gi|169833111|ref|YP_001695178.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Streptococcus pneumoniae Hungary19A-6]
gi|225859569|ref|YP_002741079.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Streptococcus pneumoniae 70585]
gi|410477158|ref|YP_006743917.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae gamPNI0373]
gi|417687268|ref|ZP_12336542.1| hypothetical protein SPAR68_1832 [Streptococcus pneumoniae GA41301]
gi|417696957|ref|ZP_12346135.1| hypothetical protein SPAR93_1833 [Streptococcus pneumoniae GA47368]
gi|418087480|ref|ZP_12724649.1| hypothetical protein SPAR87_1266 [Streptococcus pneumoniae GA47033]
gi|418092398|ref|ZP_12729538.1| hypothetical protein SPAR84_1773 [Streptococcus pneumoniae GA44452]
gi|418108230|ref|ZP_12745267.1| hypothetical protein SPAR70_1778 [Streptococcus pneumoniae GA41410]
gi|418110763|ref|ZP_12747782.1| hypothetical protein SPAR113_1837 [Streptococcus pneumoniae
GA49447]
gi|418130957|ref|ZP_12767840.1| hypothetical protein SPAR14_1759 [Streptococcus pneumoniae GA07643]
gi|418169893|ref|ZP_12806534.1| hypothetical protein SPAR56_1989 [Streptococcus pneumoniae GA19077]
gi|418187788|ref|ZP_12824311.1| hypothetical protein SPAR92_1763 [Streptococcus pneumoniae GA47360]
gi|418190031|ref|ZP_12826543.1| hypothetical protein SPAR94_1796 [Streptococcus pneumoniae GA47373]
gi|418200866|ref|ZP_12837308.1| hypothetical protein SPAR109_1776 [Streptococcus pneumoniae
GA47976]
gi|418203023|ref|ZP_12839450.1| hypothetical protein SPAR115_1743 [Streptococcus pneumoniae
GA52306]
gi|418230585|ref|ZP_12857184.1| hypothetical protein SPAR136_1835 [Streptococcus pneumoniae
EU-NP01]
gi|419423445|ref|ZP_13963658.1| hypothetical protein SPAR76_1818 [Streptococcus pneumoniae GA43264]
gi|419456157|ref|ZP_13996113.1| hypothetical protein SPAR139_1854 [Streptococcus pneumoniae
EU-NP04]
gi|419460599|ref|ZP_14000527.1| hypothetical protein SPAR2_1776 [Streptococcus pneumoniae GA02270]
gi|419478446|ref|ZP_14018269.1| hypothetical protein SPAR53_1781 [Streptococcus pneumoniae GA18068]
gi|419489505|ref|ZP_14029254.1| hypothetical protein SPAR83_1782 [Streptococcus pneumoniae GA44386]
gi|419521748|ref|ZP_14061343.1| formylglycine-generating sulfatase enzyme family protein
[Streptococcus pneumoniae GA05245]
gi|419524247|ref|ZP_14063820.1| formylglycine-generating sulfatase enzyme family protein
[Streptococcus pneumoniae GA13723]
gi|421207226|ref|ZP_15664277.1| hypothetical protein AMCSP20_001794 [Streptococcus pneumoniae
2090008]
gi|421227986|ref|ZP_15684688.1| hypothetical protein AMCSP08_001806 [Streptococcus pneumoniae
2072047]
gi|421230402|ref|ZP_15687065.1| hypothetical protein AMCSP01_001773 [Streptococcus pneumoniae
2061376]
gi|421271227|ref|ZP_15722080.1| hypothetical protein SPAR48_1783 [Streptococcus pneumoniae SPAR48]
gi|421273449|ref|ZP_15724289.1| hypothetical protein SPAR55_1748 [Streptococcus pneumoniae SPAR55]
gi|421286076|ref|ZP_15736851.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA60190]
gi|421292663|ref|ZP_15743397.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA56348]
gi|421312602|ref|ZP_15763204.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA58981]
gi|444387636|ref|ZP_21185655.1| hypothetical protein PCS125219_01058 [Streptococcus pneumoniae
PCS125219]
gi|444389110|ref|ZP_21187027.1| hypothetical protein PCS70012_00124 [Streptococcus pneumoniae
PCS70012]
gi|444392650|ref|ZP_21190352.1| hypothetical protein PCS81218_01160 [Streptococcus pneumoniae
PCS81218]
gi|444394787|ref|ZP_21192337.1| hypothetical protein PNI0002_00795 [Streptococcus pneumoniae
PNI0002]
gi|444398242|ref|ZP_21195725.1| hypothetical protein PNI0006_01827 [Streptococcus pneumoniae
PNI0006]
gi|444401187|ref|ZP_21198473.1| hypothetical protein PNI0007_02299 [Streptococcus pneumoniae
PNI0007]
gi|444402213|ref|ZP_21199384.1| hypothetical protein PNI0008_00830 [Streptococcus pneumoniae
PNI0008]
gi|444406043|ref|ZP_21202867.1| hypothetical protein PNI0009_01995 [Streptococcus pneumoniae
PNI0009]
gi|444407861|ref|ZP_21204528.1| hypothetical protein PNI0010_01284 [Streptococcus pneumoniae
PNI0010]
gi|444417203|ref|ZP_21213256.1| hypothetical protein PNI0360_00642 [Streptococcus pneumoniae
PNI0360]
gi|444419195|ref|ZP_21215074.1| hypothetical protein PNI0427_00093 [Streptococcus pneumoniae
PNI0427]
gi|147762817|gb|EDK69771.1| hypothetical protein CGSSp19BS75_12168 [Streptococcus pneumoniae
SP19-BS75]
gi|168995613|gb|ACA36225.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Streptococcus pneumoniae Hungary19A-6]
gi|225720247|gb|ACO16101.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Streptococcus pneumoniae 70585]
gi|332074158|gb|EGI84636.1| hypothetical protein SPAR68_1832 [Streptococcus pneumoniae GA41301]
gi|332200355|gb|EGJ14428.1| hypothetical protein SPAR93_1833 [Streptococcus pneumoniae GA47368]
gi|353758496|gb|EHD39088.1| hypothetical protein SPAR87_1266 [Streptococcus pneumoniae GA47033]
gi|353763098|gb|EHD43655.1| hypothetical protein SPAR84_1773 [Streptococcus pneumoniae GA44452]
gi|353778507|gb|EHD58975.1| hypothetical protein SPAR70_1778 [Streptococcus pneumoniae GA41410]
gi|353781384|gb|EHD61829.1| hypothetical protein SPAR113_1837 [Streptococcus pneumoniae
GA49447]
gi|353802281|gb|EHD82581.1| hypothetical protein SPAR14_1759 [Streptococcus pneumoniae GA07643]
gi|353833872|gb|EHE13980.1| hypothetical protein SPAR56_1989 [Streptococcus pneumoniae GA19077]
gi|353849773|gb|EHE29778.1| hypothetical protein SPAR92_1763 [Streptococcus pneumoniae GA47360]
gi|353853758|gb|EHE33739.1| hypothetical protein SPAR94_1796 [Streptococcus pneumoniae GA47373]
gi|353863814|gb|EHE43734.1| hypothetical protein SPAR109_1776 [Streptococcus pneumoniae
GA47976]
gi|353866514|gb|EHE46414.1| hypothetical protein SPAR115_1743 [Streptococcus pneumoniae
GA52306]
gi|353885466|gb|EHE65255.1| hypothetical protein SPAR136_1835 [Streptococcus pneumoniae
EU-NP01]
gi|379530735|gb|EHY95974.1| hypothetical protein SPAR2_1776 [Streptococcus pneumoniae GA02270]
gi|379539048|gb|EHZ04228.1| formylglycine-generating sulfatase enzyme family protein
[Streptococcus pneumoniae GA05245]
gi|379555960|gb|EHZ21021.1| formylglycine-generating sulfatase enzyme family protein
[Streptococcus pneumoniae GA13723]
gi|379564958|gb|EHZ29953.1| hypothetical protein SPAR53_1781 [Streptococcus pneumoniae GA18068]
gi|379586017|gb|EHZ50871.1| hypothetical protein SPAR76_1818 [Streptococcus pneumoniae GA43264]
gi|379587047|gb|EHZ51897.1| hypothetical protein SPAR83_1782 [Streptococcus pneumoniae GA44386]
gi|379627597|gb|EHZ92208.1| hypothetical protein SPAR139_1854 [Streptococcus pneumoniae
EU-NP04]
gi|395573948|gb|EJG34533.1| hypothetical protein AMCSP20_001794 [Streptococcus pneumoniae
2090008]
gi|395593414|gb|EJG53662.1| hypothetical protein AMCSP01_001773 [Streptococcus pneumoniae
2061376]
gi|395594270|gb|EJG54510.1| hypothetical protein AMCSP08_001806 [Streptococcus pneumoniae
2072047]
gi|395866423|gb|EJG77552.1| hypothetical protein SPAR48_1783 [Streptococcus pneumoniae SPAR48]
gi|395873880|gb|EJG84970.1| hypothetical protein SPAR55_1748 [Streptococcus pneumoniae SPAR55]
gi|395885384|gb|EJG96408.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA60190]
gi|395891970|gb|EJH02964.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA56348]
gi|395909456|gb|EJH20332.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA58981]
gi|406370103|gb|AFS43793.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae gamPNI0373]
gi|444252514|gb|ELU58977.1| hypothetical protein PCS125219_01058 [Streptococcus pneumoniae
PCS125219]
gi|444257710|gb|ELU64043.1| hypothetical protein PCS70012_00124 [Streptococcus pneumoniae
PCS70012]
gi|444259405|gb|ELU65719.1| hypothetical protein PNI0002_00795 [Streptococcus pneumoniae
PNI0002]
gi|444260899|gb|ELU67207.1| hypothetical protein PNI0006_01827 [Streptococcus pneumoniae
PNI0006]
gi|444263040|gb|ELU69275.1| hypothetical protein PCS81218_01160 [Streptococcus pneumoniae
PCS81218]
gi|444263733|gb|ELU69880.1| hypothetical protein PNI0007_02299 [Streptococcus pneumoniae
PNI0007]
gi|444266835|gb|ELU72766.1| hypothetical protein PNI0008_00830 [Streptococcus pneumoniae
PNI0008]
gi|444270417|gb|ELU76190.1| hypothetical protein PNI0009_01995 [Streptococcus pneumoniae
PNI0009]
gi|444271457|gb|ELU77208.1| hypothetical protein PNI0010_01284 [Streptococcus pneumoniae
PNI0010]
gi|444284369|gb|ELU89517.1| hypothetical protein PNI0360_00642 [Streptococcus pneumoniae
PNI0360]
gi|444287356|gb|ELU92287.1| hypothetical protein PNI0427_00093 [Streptococcus pneumoniae
PNI0427]
Length = 282
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 85/191 (44%), Positives = 113/191 (59%), Gaps = 10/191 (5%)
Query: 158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
P W HP G +S + +HPVVHV+ DA+A+C W G LPTEA+WEY RGG ++
Sbjct: 96 PNACWKHPYGENSNLVGLEDHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSE- 154
Query: 218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
+PWG+ L G + AN WQG FP NTA DG++ TAPV + N FGLY M+GNVWEW
Sbjct: 155 YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTAPVYEFLPNDFGLYQMIGNVWEWCR 214
Query: 278 DWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSS 331
N + S+N PK + +GGS+LC+ YC R+R AAR+ S+
Sbjct: 215 ---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTST 271
Query: 332 AGNLGFRCAAD 342
+ +LGFRC +
Sbjct: 272 SSHLGFRCLKE 282
>gi|187919188|ref|YP_001888219.1| hypothetical protein Bphyt_4473 [Burkholderia phytofirmans PsJN]
gi|187717626|gb|ACD18849.1| protein of unknown function DUF323 [Burkholderia phytofirmans PsJN]
Length = 361
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 115/336 (34%), Positives = 152/336 (45%), Gaps = 75/336 (22%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
MV L G F G+ L + E P+ + + F++DQ +V+N QF FV ATGYV++
Sbjct: 84 MVHLHGGDFVFGSK---LGYEDERPAGDGKTRVAGFWIDQTDVTNAQFAAFVKATGYVSD 140
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
+E+ G VF +E A+ +A+ T
Sbjct: 141 SERQGGAVVFHTPTRDEMNAR---------------------------------DLAWWT 167
Query: 133 W-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W +GA +W HP G ST+E R+N PV V+ DA+AY
Sbjct: 168 WVKGA------------------------AWNHPTGPGSTLEGRLNQPVTMVTQADALAY 203
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAADG 248
W G LPTEAEWEY + G E + PR G+ AN WQG FP NT DG
Sbjct: 204 ARWLGRDLPTEAEWEYAGKAGREGADL----DTAPRDGHGKPSANYWQGVFPVLNTNEDG 259
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDKVK 303
++ APV Y N F LY+M+GN WEWT D + H P T V
Sbjct: 260 HIGLAPVGCYAANDFKLYDMIGNAWEWTKDVYTGPHQSHTNGDTAEVAPLSRKHDTPMVI 319
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
KGGS+LC+ YC R+R A+R Q D A ++GFR
Sbjct: 320 KGGSFLCSRDYCVRYRAASREQQEADLPASHIGFRT 355
>gi|418167413|ref|ZP_12804067.1| hypothetical protein SPAR52_1838 [Streptococcus pneumoniae GA17971]
gi|353828579|gb|EHE08717.1| hypothetical protein SPAR52_1838 [Streptococcus pneumoniae GA17971]
Length = 282
Score = 166 bits (420), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 85/191 (44%), Positives = 113/191 (59%), Gaps = 10/191 (5%)
Query: 158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
P W HP G +S + +HPVVHV+ DA+A+C W G LPTEA+WEY RGG ++
Sbjct: 96 PNACWKHPYGENSNLVGLEDHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSE- 154
Query: 218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
+PWG+ L G + AN WQG FP NTA DG++ TAPV + N FGLY M+GNVWEW
Sbjct: 155 YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTAPVYEFLPNDFGLYQMIGNVWEWCR 214
Query: 278 DWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSS 331
N + S+N PK + +GGS+LC+ YC R+R AAR+ S+
Sbjct: 215 ---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTST 271
Query: 332 AGNLGFRCAAD 342
+ +LGFRC +
Sbjct: 272 SSHLGFRCLKE 282
>gi|440913314|gb|ELR62778.1| Sulfatase-modifying factor 2, partial [Bos grunniens mutus]
Length = 308
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 118/340 (34%), Positives = 155/340 (45%), Gaps = 75/340 (22%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV LPG F+MGT+ P +DGE P R VT+ F +D V+N F+EFV Y TEA
Sbjct: 37 NMVQLPGGRFQMGTDSPD-GRDGEGPVREVTVKPFAIDIFPVTNKDFREFVREKKYRTEA 95
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E FG +FVFE L+S+E R K +Q RM + W W
Sbjct: 96 EVFGWSFVFEDLVSDELRNKATQ-----------------RMQS----LLW--------W 126
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
LP E R F W P G S I ++ PVVHVSWNDA AYC
Sbjct: 127 ----LPVE-------------RAF----WRQPAGPGSGIREKLEFPVVHVSWNDARAYCA 165
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT---AADGYL 250
WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG G +
Sbjct: 166 WRGKRLPTEEEWEFAARGGLKGQVYPWGNKFQP---NRTNLWQGRGRCKGALVPVGRGGV 222
Query: 251 STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL- 309
+ + + G + A S + +V +G S++
Sbjct: 223 QSGETVPFLRRHVGFMQL---------------REVAESGESRAADQDM-RVLRGASWID 266
Query: 310 -CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+ +R R R NTPDS++ NLGFRCA+ G G
Sbjct: 267 TADGSANHRARVTTRMGNTPDSASDNLGFRCASGAGRPPG 306
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 23/65 (35%), Positives = 35/65 (53%), Gaps = 5/65 (7%)
Query: 332 AGNLGFRCAADKGPTTGTD---KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLG 386
G + R A+ G + D +V +G S++ + +R R R NTPDS++ NLG
Sbjct: 235 VGFMQLREVAESGESRAADQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLG 294
Query: 387 FRCAA 391
FRCA+
Sbjct: 295 FRCAS 299
>gi|402819323|ref|ZP_10868891.1| hypothetical protein IMCC14465_01250 [alpha proteobacterium
IMCC14465]
gi|402511470|gb|EJW21731.1| hypothetical protein IMCC14465_01250 [alpha proteobacterium
IMCC14465]
Length = 338
Score = 166 bits (419), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 120/337 (35%), Positives = 161/337 (47%), Gaps = 62/337 (18%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNV----TLDAFYLDQHEVSNTQFQEFVSATGY 69
+MV + G F +G + +P V TL FY+D++EV+N QF EFV+ATGY
Sbjct: 45 EMVFIKGGKFIIGDSS-------GYPEERVRQQHTLADFYIDKYEVTNAQFAEFVTATGY 97
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
VT AE+ G + P L + S V +F L+
Sbjct: 98 VTLAER-GLSKEAYPDLDDALLKPGSAVFSPPDKFYTLN--------------------- 135
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
+ W W++ P +W HPEG S+IE R ++PVVH+++ DA
Sbjct: 136 FMNW----------WQF----------VPGANWRHPEGPTSSIETRPHYPVVHIAYADAQ 175
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
AY W+G LP+EAEWEY + G + R + WG+ P + AN WQG FP T DGY
Sbjct: 176 AYADWKGHDLPSEAEWEYAAKAGSDAR-YAWGDEAYPDDIYMANTWQGRFPMLGTGEDGY 234
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH----HPAPSYNPKGPTTGTDK---- 301
APV + EN GL +M+GNVWEWT D + H P + TT +
Sbjct: 235 HLIAPVGCFPENNNGLADMIGNVWEWTRDSYKPERETITHSGPDGKLRKNTTQQNMDEVY 294
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
V KGGSYLC + YC R+R AAR +LGFR
Sbjct: 295 VIKGGSYLCAQNYCLRYRPAARQPQETGLGTNHLGFR 331
>gi|194248090|ref|NP_001123541.1| sulfatase-modifying factor 2 isoform e precursor [Homo sapiens]
gi|119628380|gb|EAX07975.1| sulfatase modifying factor 2, isoform CRA_a [Homo sapiens]
Length = 358
Score = 165 bits (417), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 98/252 (38%), Positives = 128/252 (50%), Gaps = 55/252 (21%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
MV L G F MGTN P +DGE P R T+ F +D V+N F++FV Y TEA
Sbjct: 48 SMVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEA 106
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E FG +FVFE +S+E R K +Q P+ + W W
Sbjct: 107 EMFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W 137
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
LP E + W P G S I R+ HPV+HVSWNDA AYC
Sbjct: 138 ----LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCA 176
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +
Sbjct: 177 WRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVS 233
Query: 254 PVMSY-KENKFG 264
PV ++ +N +G
Sbjct: 234 PVNAFPAQNNYG 245
>gi|225855238|ref|YP_002736750.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Streptococcus pneumoniae JJA]
gi|225723961|gb|ACO19814.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Streptococcus pneumoniae JJA]
Length = 282
Score = 164 bits (416), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 84/191 (43%), Positives = 112/191 (58%), Gaps = 10/191 (5%)
Query: 158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
P W HP G +S + +HPVVHV+ DA+A+C W G LPTEA+WEY RGG ++
Sbjct: 96 PNACWKHPYGENSNLVGLEDHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSE- 154
Query: 218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
+PWG+ L G + AN WQG FP NT DG++ TAPV + N FGLY M+GNVWEW
Sbjct: 155 YPWGDTLLEGGYYHANTWQGRFPYENTGLDGFIGTAPVYEFLPNDFGLYQMIGNVWEWCR 214
Query: 278 DWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSS 331
N + S+N PK + +GGS+LC+ YC R+R AAR+ S+
Sbjct: 215 ---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTST 271
Query: 332 AGNLGFRCAAD 342
+ +LGFRC +
Sbjct: 272 SSHLGFRCLKE 282
>gi|377834088|ref|XP_003689432.1| PREDICTED: sulfatase-modifying factor 2 isoform 2 [Mus musculus]
Length = 190
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 84/186 (45%), Positives = 114/186 (61%), Gaps = 19/186 (10%)
Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
W P G S I ++ PVVHVSWNDA AYC WRG RLPTE EWE+ RGGL+ +++PWG
Sbjct: 15 WRQPAGPGSGIREKLELPVVHVSWNDAGAYCAWRGRRLPTEEEWEFAARGGLKGQVYPWG 74
Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWW 280
N P +R N+WQG+FP + A DG+ +PV ++ +N +GLY+++GNVWEWTA
Sbjct: 75 NRFQP---NRTNLWQGKFPKGDKAEDGFHGLSPVNAFPPQNNYGLYDLMGNVWEWTAS-- 129
Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
+Y P G +V +G S++ + +R R R NTPDS++ NLGFR
Sbjct: 130 --------TYQPAGQDM---RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFR 178
Query: 339 CAADKG 344
CA+ G
Sbjct: 179 CASSAG 184
>gi|40226435|gb|AAH15600.2| SUMF2 protein, partial [Homo sapiens]
Length = 334
Score = 164 bits (415), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 98/252 (38%), Positives = 128/252 (50%), Gaps = 55/252 (21%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
MV L G F MGTN P +DGE P R T+ F +D V+N F++FV Y TEA
Sbjct: 24 SMVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEA 82
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E FG +FVFE +S+E R K +Q P+ + W W
Sbjct: 83 EMFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W 113
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
LP E + W P G S I R+ HPV+HVSWNDA AYC
Sbjct: 114 ----LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCA 152
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +
Sbjct: 153 WRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVS 209
Query: 254 PVMSY-KENKFG 264
PV ++ +N +G
Sbjct: 210 PVNAFPAQNNYG 221
>gi|114613483|ref|XP_001160140.1| PREDICTED: sulfatase-modifying factor 2 isoform 9 [Pan troglodytes]
gi|410302666|gb|JAA29933.1| sulfatase modifying factor 2 [Pan troglodytes]
Length = 358
Score = 164 bits (414), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 100/252 (39%), Positives = 129/252 (51%), Gaps = 55/252 (21%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
MV L G F MGTN P +DGE P R T+ F +D V+N F++FV Y TEA
Sbjct: 48 SMVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEA 106
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E FG +FVFE +S+E R K +Q P+ + W W
Sbjct: 107 EIFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W 137
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
LP E R F W P G S I R+ HPV+HVSWNDA AYC
Sbjct: 138 ----LPVE-------------RAF----WRQPAGPGSGILERLEHPVLHVSWNDARAYCA 176
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +
Sbjct: 177 WRGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVS 233
Query: 254 PVMSY-KENKFG 264
PV ++ +N +G
Sbjct: 234 PVNAFPAQNNYG 245
>gi|194396886|ref|YP_002038398.1| hypothetical protein SPG_1694 [Streptococcus pneumoniae G54]
gi|194356553|gb|ACF55001.1| conserved domain protein [Streptococcus pneumoniae G54]
Length = 208
Score = 164 bits (414), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 85/191 (44%), Positives = 114/191 (59%), Gaps = 10/191 (5%)
Query: 158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
P W HP G +S + +HPVVHV+ DA+A+C W G LPTEA+WEY RGG ++
Sbjct: 22 PNACWKHPYGENSNLVGLEDHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSE- 80
Query: 218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
+PWG+ L G + AN WQG FP NTA+DG++ TAPV + N FGLY M+GNVWEW
Sbjct: 81 YPWGDTLLEGGYYHANTWQGRFPYENTASDGFIGTAPVYEFLPNDFGLYQMIGNVWEWCR 140
Query: 278 DWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSS 331
N + S+N PK + +GGS+LC+ YC R+R AAR+ S+
Sbjct: 141 ---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTST 197
Query: 332 AGNLGFRCAAD 342
+ +LGFRC +
Sbjct: 198 SSHLGFRCLKE 208
>gi|387626987|ref|YP_006063163.1| hypothetical protein INV104_15460 [Streptococcus pneumoniae INV104]
gi|417694666|ref|ZP_12343853.1| hypothetical protein SPAR120_1722 [Streptococcus pneumoniae
GA47901]
gi|421268982|ref|ZP_15719850.1| hypothetical protein SPAR95_1746 [Streptococcus pneumoniae SPAR95]
gi|421308008|ref|ZP_15758649.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA60132]
gi|444383708|ref|ZP_21181892.1| hypothetical protein PCS8106_02106 [Streptococcus pneumoniae
PCS8106]
gi|444386357|ref|ZP_21184416.1| hypothetical protein PCS8203_02222 [Streptococcus pneumoniae
PCS8203]
gi|301794773|emb|CBW37227.1| conserved hypothetical protein [Streptococcus pneumoniae INV104]
gi|332201215|gb|EGJ15286.1| hypothetical protein SPAR120_1722 [Streptococcus pneumoniae
GA47901]
gi|395868030|gb|EJG79149.1| hypothetical protein SPAR95_1746 [Streptococcus pneumoniae SPAR95]
gi|395906908|gb|EJH17805.1| sulfatase-like protein modifying factor 1 [Streptococcus pneumoniae
GA60132]
gi|444247784|gb|ELU54314.1| hypothetical protein PCS8203_02222 [Streptococcus pneumoniae
PCS8203]
gi|444248589|gb|ELU55093.1| hypothetical protein PCS8106_02106 [Streptococcus pneumoniae
PCS8106]
Length = 282
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 84/191 (43%), Positives = 112/191 (58%), Gaps = 10/191 (5%)
Query: 158 PWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 217
P W HP G +S + +HPVVHV+ DA+A+C W G LPTEA+WEY RGG ++
Sbjct: 96 PNACWKHPYGENSNLVGLEDHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSE- 154
Query: 218 FPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
+PWG+ L G + AN WQG FP NTA D ++ TAPV + N FGLY M+GNVWEW
Sbjct: 155 YPWGDTLLEGGYYHANTWQGRFPYENTALDRFIGTAPVYEFLPNDFGLYQMIGNVWEWCR 214
Query: 278 DWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSS 331
N + S+N PK + +GGS+LC+ YC R+R AAR+ S+
Sbjct: 215 ---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTST 271
Query: 332 AGNLGFRCAAD 342
+ +LGFRC +
Sbjct: 272 SSHLGFRCLKE 282
>gi|424854519|ref|ZP_18278877.1| sulfatase-modifying factor 1 [Rhodococcus opacus PD630]
gi|356664566|gb|EHI44659.1| sulfatase-modifying factor 1 [Rhodococcus opacus PD630]
Length = 300
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 119/321 (37%), Positives = 152/321 (47%), Gaps = 62/321 (19%)
Query: 37 EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
E P+ V +DAF ++ H+V+ Q+ FV TGYVT AE+ D F P E S
Sbjct: 10 EGPAHRVAIDAFAIEAHQVTTAQYAAFVRDTGYVTVAERPLDAADF-PGAPAENLQPGSM 68
Query: 97 VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 156
V F ++ R H+S + TW
Sbjct: 69 V------FTPTRGPVDLR------HLS-----QWWTWTVG-------------------- 91
Query: 157 FPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 216
SW HP G STI R HPVVHV+ DA AY W G LPTEA+WE RGGL++
Sbjct: 92 ---ASWRHPTGPASTITGREQHPVVHVAHEDAAAYAQWAGRSLPTEAQWEAAARGGLDHA 148
Query: 217 LFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWT 276
F WG+ AN W GEFP A GY +T V S+ N++GL++M GNVWEWT
Sbjct: 149 EFTWGDAPESESPPLANYWHGEFPWR--ADPGYGTTTAVGSFPPNEYGLFDMAGNVWEWT 206
Query: 277 ADWWNVHHHP------APSYNPKGPTTGTD------------KVKKGGSYLCNEQYCYRH 318
ADW++V H AP NP+GP +V KGGS+LC + YC R+
Sbjct: 207 ADWFSVRHADTAESCCAPR-NPRGPGMSESFDPRQPQFRVPRRVIKGGSFLCADSYCRRY 265
Query: 319 RCAARSQNTPDSSAGNLGFRC 339
R AAR D+ ++GFRC
Sbjct: 266 RPAARRPQMVDTGMSHIGFRC 286
>gi|111601485|gb|AAI19481.1| Sumf2 protein [Mus musculus]
Length = 190
Score = 163 bits (412), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 83/186 (44%), Positives = 114/186 (61%), Gaps = 19/186 (10%)
Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
W P G S I ++ PVVH+SWNDA AYC WRG RLPTE EWE+ RGGL+ +++PWG
Sbjct: 15 WRQPAGPGSGIREKLELPVVHMSWNDAGAYCAWRGRRLPTEEEWEFAARGGLKGQVYPWG 74
Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTADWW 280
N P +R N+WQG+FP + A DG+ +PV ++ +N +GLY+++GNVWEWTA
Sbjct: 75 NRFQP---NRTNLWQGKFPKGDKAEDGFHGLSPVNAFPPQNNYGLYDLMGNVWEWTAS-- 129
Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
+Y P G +V +G S++ + +R R R NTPDS++ NLGFR
Sbjct: 130 --------TYQPAGQDM---RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFR 178
Query: 339 CAADKG 344
CA+ G
Sbjct: 179 CASSAG 184
>gi|41472461|gb|AAS07452.1| unknown [Homo sapiens]
Length = 310
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 97/251 (38%), Positives = 128/251 (50%), Gaps = 55/251 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G F MGTN P +DG+ P R T+ F +D V+N F++FV Y TEAE
Sbjct: 1 MVQLQGGRFLMGTNSPD-SRDGDGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 59
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q P+ + W W
Sbjct: 60 MFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 89
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E + W P G S I R+ HPV+HVSWNDA AYC W
Sbjct: 90 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVSWNDARAYCAW 129
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 130 RGKRLPTEEEWEFAARGGLKGQVYPWGNWFQP---NRTNLWQGKFPKGDKAEDGFHGVSP 186
Query: 255 VMSY-KENKFG 264
V ++ +N +G
Sbjct: 187 VNAFPAQNNYG 197
>gi|421787630|ref|ZP_16223975.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii Naval-82]
gi|410406523|gb|EKP58527.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii Naval-82]
Length = 322
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 114/342 (33%), Positives = 155/342 (45%), Gaps = 83/342 (24%)
Query: 8 PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
P + MV LP +GT +P+ ++ P AF +D EV+N QFQ
Sbjct: 48 PKNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQPTSVP-------AFLIDATEVTNAQFQ 100
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
EFV TGYVT+AEK G +F ++ D HPV
Sbjct: 101 EFVKQTGYVTDAEKQGGAAMF--------------IQPD----------------HPVEE 130
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
+ W W+ +E G SW HP G++++ + + PV
Sbjct: 131 LQW--------WK---------FEKGA------------SWKHPWGLNNSKQPAPHEPVR 161
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTP--RGEHRANVWQGEF 239
V+WNDA AY W G LPTE EWEY +G +N ++++P G AN WQGEF
Sbjct: 162 MVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQN------SDISPTHEGHIVANFWQGEF 215
Query: 240 PTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPT 296
P N DG+ +PV + +N FGLY+++GNVWEWT + HH PS +
Sbjct: 216 PYKNIQEDGFKDVSPVGCFSKNPFGLYDLIGNVWEWTQTPYTGSRDHHMGDPSKYRQSHE 275
Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KGGSYLC YC R+R AAR D + ++GFR
Sbjct: 276 QIMQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFR 317
>gi|392417851|ref|YP_006454456.1| hypothetical protein Mycch_4055 [Mycobacterium chubuense NBB4]
gi|390617627|gb|AFM18777.1| hypothetical protein Mycch_4055 [Mycobacterium chubuense NBB4]
Length = 292
Score = 162 bits (410), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 116/331 (35%), Positives = 158/331 (47%), Gaps = 52/331 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
+V +P T +G+++ E P R V + F++ V+N QF EFVSATGYVT AE
Sbjct: 5 LVHVPAQTAVLGSDE---HYPEEGPQRAVEVAEFWIQATPVTNAQFAEFVSATGYVTVAE 61
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D +E A ++ F ++ R H+S + TW
Sbjct: 62 RPLDA-------AEHPGAPKENLQPGSMVFTRTAGPVDLR------HLS-----QWWTWT 103
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
P W HP G S+IE R +HPVVHV++ DA AY +W
Sbjct: 104 -----------------------PGACWRHPVGPLSSIEKRADHPVVHVAYEDAEAYASW 140
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G LPTEA+WE RGGL + WG+ P G AN W G+FP GY T P
Sbjct: 141 AGLGLPTEAQWETAARGGLTAATYTWGDEPEPPGRPLANYWHGDFPWRPD--RGYGRTTP 198
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTD-KVKKGGSY 308
V S+ N +GL++ GNVWEWT DW+ + S +P+ P +V KGGS+
Sbjct: 199 VGSFPPNAYGLFDTAGNVWEWTTDWYADRRDARDCCASASRDPRQPQFPVPRRVIKGGSF 258
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
LC + YC R+R AAR D+ ++GFRC
Sbjct: 259 LCADSYCRRYRPAARRPQPVDTGMSHIGFRC 289
>gi|421806414|ref|ZP_16242278.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC035]
gi|410417758|gb|EKP69526.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC035]
Length = 322
Score = 161 bits (408), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 112/340 (32%), Positives = 150/340 (44%), Gaps = 79/340 (23%)
Query: 8 PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
P + MV LP +GT +P+ ++ P AF +D EV+N QFQ
Sbjct: 48 PKNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQ 100
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
EFV TGYVT+AEK G +F + HPV
Sbjct: 101 EFVKQTGYVTDAEKQGGAAMFV------------------------------QPEHPVEE 130
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
+ W W++G +G +W HP G+++ + + PV
Sbjct: 131 LQW-------------------WKFG-KGA---------NWKHPWGLNNPKQPAPHEPVR 161
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
V+WNDA AY +W G LP+E EWEY +G +N T G AN WQGEFP
Sbjct: 162 MVTWNDAYAYASWLGHDLPSELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPY 217
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTG 298
N DG+ APV + +N FGLY+++GNVWEWT + HH PS +
Sbjct: 218 KNVQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHEQI 277
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KGGSYLC YC R+R AAR D + ++GFR
Sbjct: 278 LQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFR 317
>gi|421624110|ref|ZP_16064986.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC098]
gi|408702141|gb|EKL47555.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC098]
Length = 322
Score = 161 bits (407), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 116/342 (33%), Positives = 149/342 (43%), Gaps = 79/342 (23%)
Query: 8 PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
P + MV LP +GT +P+ ++ P AF +D EV+N QFQ
Sbjct: 48 PKNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQ 100
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
EFV TGYVT+AEK G +F V+ D HPV
Sbjct: 101 EFVKQTGYVTDAEKQGGAAMF--------------VQPD----------------HPVAE 130
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
+ W WR +E G +W HP G ++ + + PV
Sbjct: 131 LQW--------WR---------FEKGA------------NWKHPWGPNNPKQPAPHEPVR 161
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
V+WNDA AY W G LPTE EWEY +G +N T G AN WQGEFP
Sbjct: 162 MVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPY 217
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTG 298
N DG+ APV + +N FGLY+++GNVWEWT + HH PS +
Sbjct: 218 KNVQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGEPSKYRQSHEQI 277
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KGGSYLC YC R+R AAR D + ++GFR
Sbjct: 278 LQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 319
>gi|417551536|ref|ZP_12202614.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii Naval-18]
gi|417565904|ref|ZP_12216778.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC143]
gi|395557660|gb|EJG23661.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC143]
gi|400385991|gb|EJP49066.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii Naval-18]
Length = 322
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 113/341 (33%), Positives = 149/341 (43%), Gaps = 77/341 (22%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDA-----FYLDQHEVSNTQFQE 62
P + M+ LP +GT + G R + L A F +D EV+N QFQE
Sbjct: 48 PKNKEAGMIKLPQGKIVLGTTQ------GYEDERPLNLQATSVPDFLIDATEVTNAQFQE 101
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
FV TGYVT+AEK G +F + HPV +
Sbjct: 102 FVKQTGYVTDAEKQGGAAMFV------------------------------QPEHPVEEL 131
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVH 182
W W++G +G +W HP G++++ + + PV
Sbjct: 132 QW-------------------WKFG-KGA---------NWKHPWGLNNSKQPAPHEPVRM 162
Query: 183 VSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTN 242
V+WNDA AY W G LPTE EWEY +G +N T G AN WQGEFP
Sbjct: 163 VTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPYK 218
Query: 243 NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTGT 299
N DG+ APV + +N FGLY+++GNVWEWT + HH PS +
Sbjct: 219 NVQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHEQIL 278
Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KGGSYLC YC R+R AAR D + ++GFR
Sbjct: 279 QYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 319
>gi|375134016|ref|YP_004994666.1| protein kinase [Acinetobacter calcoaceticus PHEA-2]
gi|325121461|gb|ADY80984.1| protein kinase [Acinetobacter calcoaceticus PHEA-2]
Length = 317
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 115/340 (33%), Positives = 151/340 (44%), Gaps = 79/340 (23%)
Query: 8 PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
P + MV LP +GT +P+ ++ P AF +D EV+N QFQ
Sbjct: 43 PNNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQPTLVP-------AFLIDATEVTNAQFQ 95
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
EFV TGYVT+AEK G +F ++ D HPV
Sbjct: 96 EFVKQTGYVTDAEKQGGAAMF--------------IQPD----------------HPVEE 125
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
+ W W+ +E G SW HP G++++ + + PV
Sbjct: 126 LQW--------WK---------FEKGA------------SWKHPWGLNNSKQPAPHEPVR 156
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
V+WNDA AY W G LPTE EWEY +G +N T G AN WQGEFP
Sbjct: 157 MVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPY 212
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTG 298
N DG+ APV + +N FGLY+++GNVWEWT + HH PS +
Sbjct: 213 KNIQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGSRDHHMGDPSKYRQSHEQI 272
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KGGSYLC YC R+R AAR D + ++GFR
Sbjct: 273 LQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFR 312
>gi|239503462|ref|ZP_04662772.1| hypothetical protein AbauAB_14222 [Acinetobacter baumannii AB900]
Length = 340
Score = 161 bits (407), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 112/342 (32%), Positives = 149/342 (43%), Gaps = 79/342 (23%)
Query: 8 PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
P + MV LP +GT +P+ ++ P AF +D EV+N QFQ
Sbjct: 66 PKNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQ 118
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
EFV TGYVT+AEK G +F + HPV
Sbjct: 119 EFVKQTGYVTDAEKQGGAAMFV------------------------------QPEHPVEE 148
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
+ W W++G +G +W HP G+++ + + PV
Sbjct: 149 LQW-------------------WKFG-KGA---------NWKHPWGLNNPKQPAPHEPVR 179
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
V+WNDA AY W G LP+E EWEY +G +N T G AN WQGEFP
Sbjct: 180 MVTWNDAYAYANWLGHDLPSELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPY 235
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTG 298
N DG+ APV + +N FGLY+++GNVWEWT + HH PS +
Sbjct: 236 KNVQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHEQI 295
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KGGSYLC YC R+R AAR D + ++GFR
Sbjct: 296 LQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 337
>gi|402495314|ref|ZP_10842044.1| hypothetical protein AagaZ_13304 [Aquimarina agarilytica ZC1]
Length = 352
Score = 160 bits (406), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 82/189 (43%), Positives = 112/189 (59%), Gaps = 8/189 (4%)
Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
+W HP+G +STIE + ++PVV VSW DA+AYC W G RLPTE E+EY R + +
Sbjct: 163 ANWKHPKGPESTIEGKDDYPVVQVSWYDAMAYCHWAGKRLPTEEEFEYVARNKGKRIKYS 222
Query: 220 WGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
WGN+ R N QG FP+ N D + APV S+K+N+ G+Y++ GNVWEWT +
Sbjct: 223 WGNDFR-RAVEFVNFHQGSFPSQNLKEDNFEGLAPVKSFKQNELGVYDIGGNVWEWTLNS 281
Query: 280 WNVHHHPAPSYNPKGPTTGTD------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAG 333
+ + N + P D KV +GGS+LCNE YC +R AAR ++PD+
Sbjct: 282 YFSDGY-TRKLNFEKPLLANDQSIHQKKVIRGGSFLCNESYCSGYRVAARMNSSPDTGLE 340
Query: 334 NLGFRCAAD 342
+LGFRC D
Sbjct: 341 HLGFRCVMD 349
>gi|404423688|ref|ZP_11005320.1| hypothetical protein MFORT_24422 [Mycobacterium fortuitum subsp.
fortuitum DSM 46621]
gi|403653443|gb|EJZ08424.1| hypothetical protein MFORT_24422 [Mycobacterium fortuitum subsp.
fortuitum DSM 46621]
Length = 294
Score = 160 bits (406), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 115/335 (34%), Positives = 154/335 (45%), Gaps = 56/335 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY--VT 71
D+V +P +G++ E P+R V+ F++ H+V+N + EFVSATGY V
Sbjct: 5 DLVWIPAQVTTLGSDA---HYPEEAPAREVSTAGFWIQPHQVTNAAYTEFVSATGYVTVA 61
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
E D F P E S V +R G + H + +W
Sbjct: 62 ERPVNPDEF---PGAPPENLVPGSMV---FRRTAG-----PVDLRHLNLWWAWT------ 104
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
P W HP G S+++ R HPVVH+++ DA AY
Sbjct: 105 --------------------------PGACWNHPRGPKSSLKGREQHPVVHIAFEDAQAY 138
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G LPTEAEWE RGG+ + WG+ G AN W GEFP GY +
Sbjct: 139 ADWAGLALPTEAEWETAARGGIAGAAYTWGDEPEKPGRQLANYWHGEFP--YLPDTGYGT 196
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP-----SYNPKGPTTGTD-KVKKG 305
T PV S++ N +GL++M GNVWEWT DW+ P SY+P P KV KG
Sbjct: 197 TKPVGSFEPNGYGLFDMAGNVWEWTTDWYGEDRATTPCCAADSYDPNQPQFQIGRKVIKG 256
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GS+LC + YC R+R AAR D+ ++GFRC
Sbjct: 257 GSFLCADSYCMRYRPAARRPQMVDTGMSHIGFRCV 291
>gi|445457906|ref|ZP_21446821.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC047]
gi|444776086|gb|ELX00138.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC047]
Length = 322
Score = 160 bits (405), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 116/338 (34%), Positives = 150/338 (44%), Gaps = 71/338 (21%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPS--RNVTLDAFYLDQHEVSNTQFQEFVS 65
P + MV LP +GT + + E P + ++ AF +D EV+N QFQEFV
Sbjct: 48 PKNKEAGMVKLPQGKIVLGTTQGY---EDEHPLNLQATSVPAFLIDATEVTNAQFQEFVK 104
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
TGYVT+AEK G +F V+ D HPV + W
Sbjct: 105 QTGYVTDAEKQGGAAMF--------------VQPD----------------HPVEELQW- 133
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
WR +E G +W HP G ++ + + PV V+W
Sbjct: 134 -------WR---------FEKGA------------NWKHPWGPNNPKQPAPHEPVRMVTW 165
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
NDA AY W G LPTE EWEY +G +N T G AN WQGEFP N
Sbjct: 166 NDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPYKNVQ 221
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTGTDKV 302
DG+ APV + +N FGLY+++GNVWEWT + HH PS +
Sbjct: 222 EDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGEPSKYRQSHEQILQYT 281
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KGGSYLC YC R+R AAR D + ++GFR
Sbjct: 282 IKGGSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 319
>gi|421652338|ref|ZP_16092697.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC0162]
gi|408505464|gb|EKK07185.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC0162]
Length = 322
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 116/342 (33%), Positives = 149/342 (43%), Gaps = 79/342 (23%)
Query: 8 PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
P + MV LP +GT +P+ ++ P AF +D EV+N QFQ
Sbjct: 48 PKNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQ 100
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
EFV TGYVT+AEK G +F V+ D HPV
Sbjct: 101 EFVKQTGYVTDAEKQGGAAMF--------------VQPD----------------HPVEE 130
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
+ W WR +E G +W HP G ++ + + PV
Sbjct: 131 LQW--------WR---------FEKGA------------NWKHPWGPNNPKQPAPHEPVR 161
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
V+WNDA AY W G LPTE EWEY +G +N T G AN WQGEFP
Sbjct: 162 MVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPY 217
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTG 298
N DG+ APV + +N FGLY+++GNVWEWT + HH PS +
Sbjct: 218 KNVQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGEPSKYRQSHEQI 277
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KGGSYLC YC R+R AAR D + ++GFR
Sbjct: 278 LQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 319
>gi|229820583|ref|YP_002882109.1| hypothetical protein Bcav_2094 [Beutenbergia cavernae DSM 12333]
gi|229566496|gb|ACQ80347.1| protein of unknown function DUF323 [Beutenbergia cavernae DSM
12333]
Length = 306
Score = 160 bits (405), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 122/339 (35%), Positives = 161/339 (47%), Gaps = 59/339 (17%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
+V + G F MG++ + E P+ V +D F + +V+N +F FV T Y T AE
Sbjct: 13 LVRIEGGEFTMGSD---VHYPEEAPAHRVAVDGFEISPTQVTNREFTRFVEDTRYRTVAE 69
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ D F P E A S V F ++ R H+S + W
Sbjct: 70 RPLDPAAF-PGAPAENLAPGSMV------FTPTAGPVDLR------HLS-----QWWAWT 111
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
P SW P G S++ R++HPVVHV++ DA AY W
Sbjct: 112 -----------------------PGASWRRPLGPGSSVRKRLDHPVVHVAYEDAEAYAAW 148
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTEAEWE RGGL+ F W + GE RAN W G+FP A GY +TAP
Sbjct: 149 AGLRLPTEAEWELAARGGLDGAEFTWSDEPEAAGERRANYWHGDFPWR--AEPGYGATAP 206
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHH-------------HPAPSYNPKGPTTGTDK 301
V SY N +GL++M GNVWEWT+DW+ H A S + P T +
Sbjct: 207 VGSYPANAYGLHDMAGNVWEWTSDWYRAHDAGGCCIPRNPRGGDAASSVDAAQPFTVPRR 266
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
V KGGS+LC + YC R+R AAR D+ ++GFRCA
Sbjct: 267 VVKGGSFLCADSYCRRYRPAARRPQQVDTGMSHVGFRCA 305
>gi|407261319|ref|XP_003946222.1| PREDICTED: sulfatase-modifying factor 2 [Mus musculus]
Length = 349
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 99/251 (39%), Positives = 127/251 (50%), Gaps = 56/251 (22%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV LPG F MGT+ P +DGE P+R VT+ F +D V+N F+EFV Y TEAE
Sbjct: 38 MVHLPGGRFLMGTDAPD-GRDGEGPAREVTVKPFAIDIFPVTNKDFREFVREKKYQTEAE 96
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S E R + E+ M P VH W V WR
Sbjct: 97 AFGWSFVFEDFVSPELRKQ------------------ENLM--PAVH--WWQPVPKAFWR 134
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
P G S I ++ PVVHVSWNDA AYC W
Sbjct: 135 -----------------------------QPAGPGSGIREKLELPVVHVSWNDAGAYCAW 165
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
RG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQG+FP + A DG+ +P
Sbjct: 166 RGRRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQGKFPKGDKAEDGFHGLSP 222
Query: 255 VMSY-KENKFG 264
V ++ +N +G
Sbjct: 223 VNAFPPQNNYG 233
>gi|46205251|ref|ZP_00209771.1| COG1262: Uncharacterized conserved protein [Magnetospirillum
magnetotacticum MS-1]
Length = 271
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 108/274 (39%), Positives = 137/274 (50%), Gaps = 45/274 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
DMV LPG TFRMG+++ E P V +D F++D V+N QF+ FV ATGYVT A
Sbjct: 19 DMVFLPGGTFRMGSDR---HYPEEAPVHRVRVDGFWIDLTPVTNAQFRAFVRATGYVTMA 75
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E+ R + R S + PV W+ W
Sbjct: 76 ER---------------RPDPKDYPGALPRMLQAGSLMFKPPKGPVDLRDWS-----AWW 115
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R +++G W P G S+I HPVVHV++ DA AY
Sbjct: 116 R---------FKFGVH------------WRRPYGSGSSIAGLDEHPVVHVAFADAEAYAA 154
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G LPTEAEWEY RGGL+ F WG+ LTP G H AN WQG FP N A DG+ T+
Sbjct: 155 WAGKELPTEAEWEYAARGGLDGAEFAWGDELTPGGRHMANTWQGAFPHQNLAEDGFERTS 214
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA 287
PV ++ N +GL++M GNVWEWT D++ V HPA
Sbjct: 215 PVTAFPPNGYGLFDMTGNVWEWTTDFY-VPKHPA 247
>gi|424060549|ref|ZP_17798040.1| hypothetical protein W9K_01663 [Acinetobacter baumannii Ab33333]
gi|404668501|gb|EKB36410.1| hypothetical protein W9K_01663 [Acinetobacter baumannii Ab33333]
Length = 322
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 113/344 (32%), Positives = 151/344 (43%), Gaps = 83/344 (24%)
Query: 8 PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
P + MV LP +GT +P+ ++ P AF +D EV+N QFQ
Sbjct: 48 PKNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQ 100
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
EFV TGYVT+AEK G +F + HPV
Sbjct: 101 EFVKQTGYVTDAEKQGGAAMFV------------------------------QPEHPVEE 130
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
+ W WR +E G +W HP G+++ + + PV
Sbjct: 131 LQW--------WR---------FEKGA------------NWKHPWGLNNPKQPAPHEPVR 161
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTP--RGEHRANVWQGEF 239
V+WNDA AY W G LP+E EWEY +G +N +++ P G AN WQGEF
Sbjct: 162 MVTWNDAYAYANWLGHDLPSELEWEYAAKGFQQN------SDIGPTYEGHIVANFWQGEF 215
Query: 240 PTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPT 296
P N DG+ APV + +N FGLY+++GNVWEWT + HH PS +
Sbjct: 216 PYKNVQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHE 275
Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KGGSYLC YC R+R AAR D + ++GFR
Sbjct: 276 QILQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 319
>gi|392384161|ref|YP_005033357.1| sulfatase-modifying factor 1 (fragment), partial [Azospirillum
brasilense Sp245]
gi|356880876|emb|CCD01844.1| sulfatase-modifying factor 1 (fragment) [Azospirillum brasilense
Sp245]
Length = 211
Score = 159 bits (403), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 85/184 (46%), Positives = 107/184 (58%), Gaps = 22/184 (11%)
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
HV++ DA AY W G LPTEAEWE+ RGGL+ ++ WG+ P G H AN WQG+FP
Sbjct: 27 FHVAYEDAEAYARWAGKTLPTEAEWEFAARGGLDGTIYCWGDEFMPDGRHMANTWQGQFP 86
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA--------PSYNP 292
N +DG+ TAPV SY N +GL++M GNVWEWT DW++ HPA PS NP
Sbjct: 87 WQNQESDGFERTAPVGSYPANGYGLFDMAGNVWEWTCDWYSA-RHPADAGKPCCIPS-NP 144
Query: 293 KGPTTGTD------------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+G + KV KGGS+LC YC R+R AAR DS+ ++GFRC
Sbjct: 145 RGAPRESSLDPAQPNIRIPRKVVKGGSFLCAPSYCERYRPAARQPQMIDSAMSHIGFRCI 204
Query: 341 ADKG 344
KG
Sbjct: 205 IRKG 208
>gi|421654353|ref|ZP_16094683.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii Naval-72]
gi|421679130|ref|ZP_16119009.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC111]
gi|408511120|gb|EKK12774.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii Naval-72]
gi|410391622|gb|EKP43989.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC111]
Length = 268
Score = 159 bits (401), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 111/335 (33%), Positives = 147/335 (43%), Gaps = 79/335 (23%)
Query: 15 MVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
MV LP +GT +P+ ++ P AF +D EV+N QFQEFV TG
Sbjct: 1 MVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQEFVKQTG 53
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
YVT+AEK G +F + HPV + W
Sbjct: 54 YVTDAEKQGGAAMFV------------------------------QPEHPVEELQW---- 79
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
W++G +G +W HP G+++ + + PV V+WNDA
Sbjct: 80 ---------------WKFG-KGA---------NWKHPWGLNNPKQPAPHEPVRMVTWNDA 114
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G LP+E EWEY +G +N T G AN WQGEFP N DG
Sbjct: 115 YAYANWLGHDLPSELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPYKNVQEDG 170
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTGTDKVKKG 305
+ APV + +N FGLY+++GNVWEWT + HH PS + KG
Sbjct: 171 FKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHEQILQYTIKG 230
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GSYLC YC R+R AAR D + ++GFR
Sbjct: 231 GSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 265
>gi|403673645|ref|ZP_10935936.1| hypothetical protein ANCT1_03131 [Acinetobacter sp. NCTC 10304]
Length = 322
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 112/340 (32%), Positives = 148/340 (43%), Gaps = 79/340 (23%)
Query: 8 PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
P + MV LP +GT +P+ ++ P AF +D EV+N QFQ
Sbjct: 48 PNNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQ 100
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
EFV TGYVT+AEK G +F + HPV
Sbjct: 101 EFVKQTGYVTDAEKQGGAAMFV------------------------------QPEHPVEE 130
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
+ W W+ +E G +W HP G+++ + + PV
Sbjct: 131 LQW--------WK---------FEKGA------------NWKHPWGLNNPKQPAPHEPVR 161
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
V+WNDA AY W G LP+E EWEY +G +N T G AN WQGEFP
Sbjct: 162 MVTWNDAYAYANWLGHDLPSELEWEYAAKGFQQNSDI----GPTDEGHIVANFWQGEFPY 217
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTG 298
N DG+ APV + +N FGLY+++GNVWEWT + HH PS +
Sbjct: 218 KNVQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHEQI 277
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KGGSYLC YC R+R AAR D + ++GFR
Sbjct: 278 LQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFR 317
>gi|445486947|ref|ZP_21457568.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii AA-014]
gi|444769174|gb|ELW93371.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii AA-014]
Length = 268
Score = 157 bits (398), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 112/335 (33%), Positives = 146/335 (43%), Gaps = 79/335 (23%)
Query: 15 MVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
MV LP +GT +P+ ++ P AF +D EV+N QFQEFV TG
Sbjct: 1 MVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQEFVKQTG 53
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
YVT+AEK G +F + HPV + W
Sbjct: 54 YVTDAEKQGGAAMFV------------------------------QPEHPVEELKW---- 79
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
WR +E G +W HP G+++ + + PV V+WNDA
Sbjct: 80 ----WR---------FEKGA------------NWKHPWGLNNPKQPAPHEPVRMVTWNDA 114
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G LP+E EWEY +G +N T G AN WQGEFP N DG
Sbjct: 115 YAYANWLGHDLPSELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPYKNVQEDG 170
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTGTDKVKKG 305
+ APV + +N FGLY+++GNVWEWT + HH PS + KG
Sbjct: 171 FKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHEQILQYTIKG 230
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GSYLC YC R+R AAR D + ++GFR
Sbjct: 231 GSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 265
>gi|417546502|ref|ZP_12197588.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC032]
gi|421667451|ref|ZP_16107521.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC087]
gi|421669663|ref|ZP_16109682.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC099]
gi|400384390|gb|EJP43068.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC032]
gi|410384726|gb|EKP37232.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC087]
gi|410388048|gb|EKP40488.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC099]
Length = 268
Score = 157 bits (397), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 110/335 (32%), Positives = 147/335 (43%), Gaps = 79/335 (23%)
Query: 15 MVLLPGDTFRMGT------NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
MV LP +GT +P+ ++ P AF +D EV+N QFQEFV TG
Sbjct: 1 MVKLPQGKIVLGTPQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQEFVKQTG 53
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
Y+T+AEK G +F + HPV + W
Sbjct: 54 YITDAEKQGGAAMFV------------------------------QPEHPVEELQW---- 79
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
W+ +E G +W HP G++++ + + PV V+WNDA
Sbjct: 80 ----WK---------FEKGA------------NWKHPWGLNNSKQPAPHEPVRMVTWNDA 114
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
AY W G LP+E EWEY +G +N T G AN WQGEFP N DG
Sbjct: 115 YAYANWLGHDLPSELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPYKNVQEDG 170
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTGTDKVKKG 305
+ APV + +N FGLY+++GNVWEWT + HH PS + KG
Sbjct: 171 FKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHEQILQYTIKG 230
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GSYLC YC R+R AAR D + ++GFR
Sbjct: 231 GSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 265
>gi|340785240|ref|YP_004750705.1| sulfatase modifying factor 1 (C-alpha-formyglycine- generating
enzyme 1) [Collimonas fungivorans Ter331]
gi|340550507|gb|AEK59882.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycine-
generating enzyme 1) [Collimonas fungivorans Ter331]
Length = 344
Score = 157 bits (397), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 102/299 (34%), Positives = 135/299 (45%), Gaps = 68/299 (22%)
Query: 48 FYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGL 107
F++DQ +V+N QF FV+ATGYV++AE+ G VF EE A+
Sbjct: 100 FWIDQTDVTNAQFSSFVAATGYVSDAERQGGGAVFHKPTLEELNAR-------------- 145
Query: 108 DSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
++W VA +WR HP G
Sbjct: 146 -------------DLAWWSWVAGASWR-----------------------------HPAG 163
Query: 168 IDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR 227
S++E N PV V+ DA+AY W G LP+EAEWEY + G + + PR
Sbjct: 164 PGSSLEGIGNQPVTMVTQADALAYARWLGRDLPSEAEWEYAGKAGHDGAEL----DTAPR 219
Query: 228 GEH---RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH 284
H AN WQG FP N DG++ APV Y N F LY+MVGN WEWT D ++ H
Sbjct: 220 DHHGRPTANYWQGVFPLVNNNEDGHIGLAPVGCYAANDFKLYDMVGNAWEWTKDIYSGSH 279
Query: 285 H-----PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
+ P T V KGGS+LC+ YC R+R ++R Q D A ++GFR
Sbjct: 280 QVHSNGDTAAVAPASRRHDTPMVIKGGSFLCSRDYCVRYRASSREQQEADLPASHIGFR 338
>gi|424741299|ref|ZP_18169658.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii WC-141]
gi|422944924|gb|EKU39897.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii WC-141]
Length = 322
Score = 157 bits (396), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 113/340 (33%), Positives = 147/340 (43%), Gaps = 79/340 (23%)
Query: 8 PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
P + MV LP +GT +P+ ++ P AF +D EV+N QFQ
Sbjct: 48 PKNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQ 100
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
EFV TGYVT+AEK G +F + +HPV
Sbjct: 101 EFVKQTGYVTDAEKQGGAAMFV------------------------------QPNHPVEE 130
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
+ W W+ +E G +W HP G ++ + + PV
Sbjct: 131 LQW--------WK---------FEKGA------------NWKHPWGPNNPKQPAPHEPVR 161
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
V+WNDA AY W G LPTE EWEY +G +N T G AN WQGEFP
Sbjct: 162 MVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPY 217
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTG 298
N DG+ APV + +N FGLY+++GNVWEWT + HH PS +
Sbjct: 218 KNIKEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGDPSKYRQSHEQI 277
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KGGSYLC YC R+R AAR D + + GFR
Sbjct: 278 LQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHAGFR 317
>gi|323456570|gb|EGB12437.1| hypothetical protein AURANDRAFT_3711, partial [Aureococcus
anophagefferens]
Length = 210
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 92/248 (37%), Positives = 120/248 (48%), Gaps = 54/248 (21%)
Query: 32 LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEER 91
L +DG + + + AF +D VSN QF+ FV T Y+TEAE F +FV E L +
Sbjct: 13 LSRDGAKAASHRRVPAFRIDPTAVSNDQFKRFVRDTQYLTEAENFRWSFVLE-LTASNAT 71
Query: 92 AKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGG 151
S ++ + R + + W G G
Sbjct: 72 INQSDAKNGLGRVQA-----------------------------------SPWWMGVFGA 96
Query: 152 LENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRG 211
W PEG DS++ R +HP H+SWNDA AYC W G RLPTE EWE RG
Sbjct: 97 Y---------WRKPEGPDSSLRGRGDHPATHISWNDASAYCKWAGRRLPTEVEWEMAARG 147
Query: 212 GLENRLFPWGNNLTPRGEH----RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYN 267
GLE+ FPWG+ +H R N W+GEFP N+A DG++ PV +Y N FG+YN
Sbjct: 148 GLEDEPFPWGD-----ADHDPWTRLNAWEGEFPDENSARDGFVGPGPVDAYAPNAFGIYN 202
Query: 268 MVGNVWEW 275
+GNVWEW
Sbjct: 203 ALGNVWEW 210
>gi|425747363|ref|ZP_18865371.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii WC-348]
gi|425493937|gb|EKU60159.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii WC-348]
Length = 322
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 115/342 (33%), Positives = 148/342 (43%), Gaps = 79/342 (23%)
Query: 8 PVERYKDMVLLPGDTFRMGTN------KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
P + MV LP +GT +P+ ++ P AF +D EV+N QFQ
Sbjct: 48 PKNKEAGMVKLPQGKIVLGTTQGYEDERPLNLQATSVP-------AFLIDATEVTNAQFQ 100
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
EFV TGYVT+AEK G +F V+ D HPV
Sbjct: 101 EFVKQTGYVTDAEKQGGAAMF--------------VQPD----------------HPVEE 130
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
+ W WR +E G +W HP G ++ + + V
Sbjct: 131 LQW--------WR---------FEKGA------------NWKHPWGPNNPKQPAPHELVR 161
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
V+WNDA AY W G LPTE EWEY +G +N T G AN WQGEFP
Sbjct: 162 MVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI----GPTHEGHIVANFWQGEFPY 217
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTG 298
N DG+ APV + +N FGLY+++GNVWEWT + HH PS +
Sbjct: 218 KNVQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTGPRDHHMGEPSKYRQSHEQI 277
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KGGSYLC YC R+R AAR D + ++GFR
Sbjct: 278 LQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFRTV 319
>gi|167527321|ref|XP_001747993.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163773742|gb|EDQ87380.1| predicted protein [Monosiga brevicollis MX1]
Length = 360
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 113/314 (35%), Positives = 150/314 (47%), Gaps = 68/314 (21%)
Query: 34 KDGEFPSRNVTLDAFY---LDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEE 90
+DGE P R ++ L + V+N+ F+ FV AT Y TEAE++G +FVF L S+
Sbjct: 98 RDGEAPPRKASVRPVSCHELSTNTVNNSAFRAFVRATKYTTEAEQYGWSFVFHLLASDSV 157
Query: 91 RAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRG 150
RA +V +K E W W LP
Sbjct: 158 RA---EVTETIKGSE------------------W--------W----LPV---------- 174
Query: 151 GLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCR 210
P W P G S+I+ R ++PV +SWNDA A+C WRG RLPTEAEWE
Sbjct: 175 -------PRAYWRMPFGKGSSIKGRNDYPVTQISWNDARAFCKWRGMRLPTEAEWELAAG 227
Query: 211 GGLENRLFPWGNNLTPR-GEHRANVWQGEFPTNNTAADGYLSTAPVMSY-KENKFGLYNM 268
G +++ +PWG PR G NVWQG+FP NT ADG+L APV +Y + G YN+
Sbjct: 228 GSKQSKRYPWG----PRAGLDHMNVWQGKFPDVNTKADGHLGLAPVDAYGPQTDTGFYNL 283
Query: 269 VGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNEQYCYRHRCAARSQN 326
+GN WEW + PS + V KGGSY+ + Q + R R N
Sbjct: 284 LGNGWEWVDQVYEGQPGEPPSQEKR-------VVLKGGSYVDSVDGQTNHPVRVTTRMGN 336
Query: 327 TPDSSAGNLGFRCA 340
T DS++ N+ FRCA
Sbjct: 337 TVDSASDNMSFRCA 350
>gi|444410026|ref|ZP_21206582.1| cysteine-type sulfatase aerobic maturase domain protein
[Streptococcus pneumoniae PNI0076]
gi|444412934|ref|ZP_21209253.1| cysteine-type sulfatase aerobic maturase domain protein
[Streptococcus pneumoniae PNI0153]
gi|444421844|ref|ZP_21217512.1| cysteine-type sulfatase aerobic maturase domain protein
[Streptococcus pneumoniae PNI0446]
gi|444274173|gb|ELU79828.1| cysteine-type sulfatase aerobic maturase domain protein
[Streptococcus pneumoniae PNI0153]
gi|444278414|gb|ELU83866.1| cysteine-type sulfatase aerobic maturase domain protein
[Streptococcus pneumoniae PNI0076]
gi|444288796|gb|ELU93685.1| cysteine-type sulfatase aerobic maturase domain protein
[Streptococcus pneumoniae PNI0446]
Length = 172
Score = 156 bits (394), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 78/172 (45%), Positives = 105/172 (61%), Gaps = 10/172 (5%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
+HPVVHV+ DA+A+C W G LPTEA+WEY RGG ++ +PWG+ L G + AN WQ
Sbjct: 5 DHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQ 63
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----- 291
G FP NT +DG++ TAPV + N FGLY M+GNVWEW N + S+N
Sbjct: 64 GRFPYENTVSDGFIGTAPVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYE 120
Query: 292 -PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
PK + +GGS+LC+ YC R+R AAR+ S++ +LGFRC +
Sbjct: 121 LPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 172
>gi|119628382|gb|EAX07977.1| sulfatase modifying factor 2, isoform CRA_c [Homo sapiens]
Length = 197
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 99/263 (37%), Positives = 129/263 (49%), Gaps = 68/263 (25%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G F MGTN P +DGE P R T+ F +D V+N F++FV Y TEAE
Sbjct: 1 MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 59
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q MK G S I R+ HPV+H+SWNDA AYC WR
Sbjct: 60 MFGWSFVFEDFVSDELRNKATQ---PMKP-AGPGSGIRERLEHPVLHVSWNDARAYCAWR 115
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G RLPTE EWE+ RGGL+ +++PWG+W P +R N W
Sbjct: 116 GKRLPTEEEWEFAARGGLKGQVYPWGNWFQP--------NRTN---------------LW 152
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G P+G+ + + G P N A
Sbjct: 153 QGK---------------------------FPKGDKAEDGFHGVSPVNAFPA-------- 177
Query: 255 VMSYKENKFGLYNMVGNVWEWTA 277
+N +GLY+++GNVWEWTA
Sbjct: 178 -----QNNYGLYDLLGNVWEWTA 195
>gi|218678281|ref|ZP_03526178.1| hypothetical protein RetlC8_05157 [Rhizobium etli CIAT 894]
Length = 249
Score = 155 bits (392), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 90/240 (37%), Positives = 122/240 (50%), Gaps = 41/240 (17%)
Query: 37 EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
E P+ +D F++D V+N F FVSATGYVT AEK D + P E A S
Sbjct: 8 EAPAHRERVDGFWMDGFTVTNLDFGRFVSATGYVTLAEKPADPANY-PGAKPEMLAPSSI 66
Query: 97 VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 156
+ F+ + ++ R H+ W RG
Sbjct: 67 M------FKKPSAPVDMRDHY-------------------------NWWVYVRGA----- 90
Query: 157 FPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 216
+W HP G S+I +HPVVHV+++D AY W G +LP+EAEWE+ RGGL+
Sbjct: 91 ----NWRHPRGSASSIRKLGDHPVVHVAYDDVAAYAKWAGKQLPSEAEWEFAARGGLDGA 146
Query: 217 LFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWT 276
+ WG+ P G+H AN WQG+FP N+ ADG+ TAPV S+ N +GLY+M NVW+WT
Sbjct: 147 EYVWGDEFNPGGKHMANTWQGDFPYRNSLADGFEYTAPVGSFPANGYGLYDMARNVWQWT 206
>gi|299770976|ref|YP_003733002.1| hypothetical protein AOLE_13710 [Acinetobacter oleivorans DR1]
gi|298701064|gb|ADI91629.1| hypothetical protein AOLE_13710 [Acinetobacter oleivorans DR1]
Length = 317
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 104/299 (34%), Positives = 134/299 (44%), Gaps = 66/299 (22%)
Query: 43 VTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMK 102
++ AF +D EV+N QFQEFV TGYVT+AEK G +F
Sbjct: 77 TSVPAFLIDATEVTNAQFQEFVKQTGYVTDAEKQGGAAMFA------------------- 117
Query: 103 RFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSW 162
+ HPV + W W+ +E G +W
Sbjct: 118 -----------QPEHPVEELQW--------WK---------FEKGA------------NW 137
Query: 163 LHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGN 222
HP G ++ + + PV V+WNDA AY W G LPTE EWEY +G +N
Sbjct: 138 KHPWGPNNPKQPAPHEPVRMVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI---- 193
Query: 223 NLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN- 281
T G AN WQGEFP N DG+ APV + +N FGLY+++GNVWEWT +
Sbjct: 194 GPTHEGHIVANFWQGEFPYKNIQEDGFKDVAPVGCFSKNPFGLYDLIGNVWEWTQTPYTG 253
Query: 282 --VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
HH PS + KGGSYLC YC R+R AAR D + ++GFR
Sbjct: 254 PRDHHMGDPSKYRQSHEQILQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVGFR 312
>gi|170697260|ref|ZP_02888354.1| protein of unknown function DUF323 [Burkholderia ambifaria
IOP40-10]
gi|170137880|gb|EDT06114.1| protein of unknown function DUF323 [Burkholderia ambifaria
IOP40-10]
Length = 349
Score = 154 bits (390), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 112/336 (33%), Positives = 146/336 (43%), Gaps = 77/336 (22%)
Query: 15 MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
MV L G F G+ + + DG + F++DQ +V+ QF FV ATGYV
Sbjct: 72 MVHLSGGAFVFGSTRGYADERPVGDGR-----TRVGGFWIDQTDVTIAQFAAFVKATGYV 126
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
TEAE+ G VF +E A HD+ WN
Sbjct: 127 TEAEQQGGAAVFHVPARDELNA------HDLAW--------------------WN----- 155
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W G SW P G S+++ N PV V+ DA+A
Sbjct: 156 -------------WVKGA------------SWRQPRGPGSSVDGLGNLPVTLVTQRDALA 190
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAAD 247
Y W G LPTEAEWEY + G ++ + PR G+ AN WQG FP +TA D
Sbjct: 191 YAHWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGAFPVLDTAED 246
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDKV 302
G+ APV Y N F LY+M+GN WEWT D + H + P T V
Sbjct: 247 GHAGLAPVGCYAANGFHLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPMV 306
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KGGS+LC+ YC R+R ++R Q D A ++GFR
Sbjct: 307 IKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342
>gi|408791375|ref|ZP_11202985.1| cysteine-type sulfatase aerobic maturase family protein [Leptospira
meyeri serovar Hardjo str. Went 5]
gi|408462785|gb|EKJ86510.1| cysteine-type sulfatase aerobic maturase family protein [Leptospira
meyeri serovar Hardjo str. Went 5]
Length = 327
Score = 154 bits (390), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 106/326 (32%), Positives = 150/326 (46%), Gaps = 74/326 (22%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+P F G+N + E P N T+ F++D+ EV+N +F +FV TGY TEAE
Sbjct: 51 MVLVPSGNFEKGSN----VYPEESPIYNTTVSEFWMDETEVTNDEFAKFVWETGYQTEAE 106
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
L E + + F+ + + +++ H+ W
Sbjct: 107 T--------NLKPNSEEVNLEHYQPGAVVFQ--KPSNQGKINS---HLEW---------- 143
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
W Y P +W HP+G S+IE + ++PVV VS+ DA +Y W
Sbjct: 144 ---------WSY----------VPGTNWRHPDGPYSSIEGKGSYPVVAVSYQDANSYAKW 184
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G LPTE EWE+ G AN WQGEFP + DG+ AP
Sbjct: 185 KGRTLPTEVEWEWAAANG---------------SREEANTWQGEFPFLDEGKDGFTGIAP 229
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK--VKKGGSYLCNE 312
V + +NKFGLY+M+GNVWE+TAD W P T K + KGGSYLC+
Sbjct: 230 VGCFAKNKFGLYDMIGNVWEYTADPW-----------PVDQTLDKSKYHIIKGGSYLCSP 278
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFR 338
YC R+R A+ ++ ++GFR
Sbjct: 279 NYCKRYRAQAKQPQEDRLASNHIGFR 304
Score = 38.5 bits (88), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 38/74 (51%), Gaps = 6/74 (8%)
Query: 320 CAARSQNTPDSSAGNLGFRCAADKGPTTGT-DKVK----KGGSYLCNEQYCYRHRCAARS 374
C A+++ GN+ + AD P T DK K KGGSYLC+ YC R+R A+
Sbjct: 232 CFAKNKFGLYDMIGNV-WEYTADPWPVDQTLDKSKYHIIKGGSYLCSPNYCKRYRAQAKQ 290
Query: 375 QNTPDSSAGNLGFR 388
++ ++GFR
Sbjct: 291 PQEDRLASNHIGFR 304
>gi|363581085|ref|ZP_09313895.1| hypothetical protein FbacHQ_06220 [Flavobacteriaceae bacterium
HQM9]
Length = 353
Score = 154 bits (388), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 80/187 (42%), Positives = 107/187 (57%), Gaps = 8/187 (4%)
Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
+W HP+G STI + ++PVV VSW DA+AYC W RLPTE E+EY R + +
Sbjct: 164 ANWRHPQGPKSTIIGKDSYPVVQVSWYDAIAYCHWANKRLPTEEEFEYAARNQGKQIKYS 223
Query: 220 WGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
WGN+ E+ AN QG FP N D + APV S+K N G+Y++ GNVWEWT +
Sbjct: 224 WGNDFNKAVEY-ANFHQGNFPLGNLKIDNFEGLAPVKSFKNNALGIYDIGGNVWEWTLNS 282
Query: 280 WNVHHHPAPSYNPKGPTTGTD------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAG 333
+ + + N + P D KV +GGS+LCNE YC +R AAR +PD+
Sbjct: 283 YFSDAY-SKKINFEQPILANDNTLNQKKVIRGGSFLCNESYCSGYRVAARMNASPDTGLE 341
Query: 334 NLGFRCA 340
+LGFRC
Sbjct: 342 HLGFRCV 348
>gi|444414754|ref|ZP_21211005.1| cysteine-type sulfatase aerobic maturase domain protein
[Streptococcus pneumoniae PNI0199]
gi|444281206|gb|ELU86535.1| cysteine-type sulfatase aerobic maturase domain protein
[Streptococcus pneumoniae PNI0199]
Length = 172
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 104/172 (60%), Gaps = 10/172 (5%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
+HPVVHV+ DA+A+C W G LPTEA+WEY RGG ++ +PWG+ L G + AN WQ
Sbjct: 5 DHPVVHVALEDALAFCNWSGMSLPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQ 63
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN----- 291
G FP NT +DG++ TAPV + N FGLY M+GN WEW N + S+N
Sbjct: 64 GRFPYENTVSDGFIGTAPVYEFLPNDFGLYQMIGNDWEWCR---NPRYTLLASFNEDDYE 120
Query: 292 -PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
PK + +GGS+LC+ YC R+R AAR+ S++ +LGFRC +
Sbjct: 121 LPKYGIQDEEYAIRGGSFLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 172
>gi|167590166|ref|ZP_02382554.1| hypothetical protein BuboB_32820 [Burkholderia ubonensis Bu]
Length = 347
Score = 152 bits (384), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 114/339 (33%), Positives = 152/339 (44%), Gaps = 69/339 (20%)
Query: 15 MVLLPGDTFRMGTNK--PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
MV L G F G+ + P G+ +R + F++DQ +V+ QF FV ATGYVT+
Sbjct: 70 MVHLRGGAFVFGSTRGYPDERPAGDGRTR---VGGFWIDQTDVTIAQFAAFVQATGYVTD 126
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
A ER + V H R E ++ D +A+ +
Sbjct: 127 A----------------ERQGGAAVFHVPTRDE----------------MNARD-LAWWS 153
Query: 133 W-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W RGA SW HP G S+++ R N PV V+ DA+AY
Sbjct: 154 WVRGA------------------------SWWHPAGPGSSVDGRGNLPVTLVTQRDALAY 189
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G LPTEAEWEY + G ++ RG+ AN WQG FP +TA DG+
Sbjct: 190 ARWLGRDLPTEAEWEYAGKAGRDDASLD-AAPRDARGKPAANYWQGVFPVLDTAEDGHAG 248
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDKVKKGG 306
APV Y N F LY+M+GN WEWT D + H P T V KGG
Sbjct: 249 LAPVGCYAANGFRLYDMIGNAWEWTKDVYTGPHQSHTNGDTAVVAPPARRHDTPMVIKGG 308
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
S+LC+ YC R+R ++R Q D +A ++GFR P
Sbjct: 309 SFLCSRDYCVRYRASSREQQEADLAASHIGFRTILRDAP 347
>gi|408674864|ref|YP_006874612.1| Sulphatase-modifying factor protein [Emticicia oligotrophica DSM
17448]
gi|387856488|gb|AFK04585.1| Sulphatase-modifying factor protein [Emticicia oligotrophica DSM
17448]
Length = 313
Score = 152 bits (383), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 77/182 (42%), Positives = 109/182 (59%), Gaps = 5/182 (2%)
Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR-LF 218
+W +P+G D + NHPV VSW DAVAY W G RLP+E EWE+ R G +R L+
Sbjct: 135 ANWEYPQGKDQP-KAPDNHPVTQVSWYDAVAYAKWAGKRLPSEIEWEHAARNGTNDRTLY 193
Query: 219 PWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD 278
P+GN++ G+ AN W G FP N DG+ T+PV + + K GL +M GNVW+W D
Sbjct: 194 PFGNDIEKNGKALANTWNGVFPDYNKNTDGFFYTSPVGYFGKTKLGLTDMTGNVWQW-CD 252
Query: 279 WWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
W + + K PT +KV+KGGS+LC +C+ +R + RS +P++S ++GFR
Sbjct: 253 NWKISYEDIN--QGKMPTDTLEKVEKGGSFLCEPGWCHGYRVSGRSFTSPETSLMHVGFR 310
Query: 339 CA 340
C
Sbjct: 311 CV 312
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 56/154 (36%), Positives = 82/154 (53%), Gaps = 9/154 (5%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +PG ++G + E P V + F++D V+ QF+EFV AT YVT AE
Sbjct: 58 MVYVPGGYTQIGVTDGMSF---EKPMFWVEVKPFFMDISPVTVAQFREFVKATKYVTYAE 114
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+FGD F+ E + + + + +G D + +HPV +SW DAVAY W
Sbjct: 115 RFGDGGFFD----VEAKGWTLKKGANWEYPQGKDQP-KAPDNHPVTQVSWYDAVAYAKWA 169
Query: 135 GARLPTEAEWEYGCRGGLENR-LFPWGSWLHPEG 167
G RLP+E EWE+ R G +R L+P+G+ + G
Sbjct: 170 GKRLPSEIEWEHAARNGTNDRTLYPFGNDIEKNG 203
>gi|120436433|ref|YP_862119.1| hypothetical protein GFO_2087 [Gramella forsetii KT0803]
gi|117578583|emb|CAL67052.1| protein containing DUF323 [Gramella forsetii KT0803]
Length = 344
Score = 152 bits (383), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 77/188 (40%), Positives = 105/188 (55%), Gaps = 6/188 (3%)
Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
+W +P G DSTI+ + +PVV VSW DA+AY W G RLPTEAE+EY R G + ++
Sbjct: 157 ANWKNPTGPDSTIDGKDTYPVVQVSWYDAMAYAKWAGKRLPTEAEFEYFNRAGNDTLIYH 216
Query: 220 WGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
WGN+ E N +QG+FP N+ D + A V S+ N FGLY GNVWEW D
Sbjct: 217 WGNDFEKASEM-VNFFQGDFPRQNSTEDEFEKKASVKSFPANDFGLYETSGNVWEWCLDT 275
Query: 280 -----WNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGN 334
+++ Y + KV +GGS+LC+E YC +R +AR + PD+ +
Sbjct: 276 YYPDAYSIRDQQTNGYFKEFVNMQQQKVIRGGSFLCSESYCTGYRNSARMSSAPDTGLEH 335
Query: 335 LGFRCAAD 342
GFRC D
Sbjct: 336 TGFRCVKD 343
>gi|413961059|ref|ZP_11400288.1| hypothetical protein BURK_014113 [Burkholderia sp. SJ98]
gi|413931773|gb|EKS71059.1| hypothetical protein BURK_014113 [Burkholderia sp. SJ98]
Length = 333
Score = 151 bits (382), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 113/336 (33%), Positives = 145/336 (43%), Gaps = 66/336 (19%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
MV L G F G+ L + E P + + AF++DQ +V+N QF FV ATGYVT
Sbjct: 58 MVHLRGGEFVFGST---LGYEDERPMGDGKTKVGAFWIDQTDVTNAQFAAFVDATGYVT- 113
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
E ER + V H + R ++W
Sbjct: 114 ---------------EAERQGGAVVFH-----------VPTREEMNARDLAW-------- 139
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W W G SW HP G ST + R N PV V+ DA+AY
Sbjct: 140 W---------SWVKGA------------SWKHPTGSSSTWKGRENRPVTMVTLADALAYA 178
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWEY + G E RG+ AN WQG FP ++A DG+
Sbjct: 179 HWLGRDLPTEAEWEYAGKAGREGAQLD-AAPRDARGKPSANYWQGVFPVLDSAEDGHAGL 237
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTG----TDKVKKGGSY 308
APV Y N F LY+M+GN WEWT D + P + + T V KGGS+
Sbjct: 238 APVGCYAANDFRLYDMIGNAWEWTKDPYTGPRQPHSNGDTAAVIPARKHDTAMVIKGGSF 297
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
LC + YC R+R ++R Q D A ++GFR G
Sbjct: 298 LCAQDYCVRYRASSREQQEADLPASHIGFRTVLRDG 333
>gi|295700051|ref|YP_003607944.1| hypothetical protein [Burkholderia sp. CCGE1002]
gi|295439264|gb|ADG18433.1| protein of unknown function DUF323 [Burkholderia sp. CCGE1002]
Length = 363
Score = 151 bits (381), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 113/343 (32%), Positives = 147/343 (42%), Gaps = 85/343 (24%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
MV L G F G+ L + E P+ + + F++DQ +V+N QF FV ATGYV
Sbjct: 86 MVHLHGGDFVFGSK---LGYEDERPAGDGKTHVAGFWIDQTDVTNAQFAAFVRATGYV-- 140
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
S+ ER + V H
Sbjct: 141 --------------SDAERQGGAVVFH--------------------------------- 153
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWL------HPEGIDSTIEHRMNHPVVHVSWN 186
PT E + R W SW+ HP G S ++ RMN PV V+
Sbjct: 154 -----TPTPEE--------MNARDLAWWSWVKGAAWNHPGGPGSNLDGRMNQPVTMVTQA 200
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH---RANVWQGEFPTNN 243
DA+AY W G LPTEAEWEY + G E + PR H AN WQG FP N
Sbjct: 201 DALAYAHWLGRDLPTEAEWEYAGKAGHEGADL----DTAPRDAHGRPSANYWQGVFPVLN 256
Query: 244 TAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTG 298
T+ DG++ +PV Y N F LY+M+GN WEWT D ++ H + P
Sbjct: 257 TSEDGHVGLSPVGCYAANDFRLYDMIGNAWEWTKDVYSGPHQSHTNGDTAAVAPLSRRHD 316
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
T V KGGS+LC YC R+R A+R Q D A ++GFR +
Sbjct: 317 TPMVIKGGSFLCARDYCVRYRAASREQQEADLPASHIGFRTVS 359
>gi|149927426|ref|ZP_01915681.1| hypothetical protein LMED105_01032 [Limnobacter sp. MED105]
gi|149823918|gb|EDM83143.1| hypothetical protein LMED105_01032 [Limnobacter sp. MED105]
Length = 273
Score = 149 bits (377), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 113/333 (33%), Positives = 144/333 (43%), Gaps = 65/333 (19%)
Query: 17 LLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKF 76
+L G FRMG+N+ E P R ++ F +D V N+ F EFV AT Y+T A
Sbjct: 1 MLSGGEFRMGSNRHY---PEERPERTESVAEFAIDVFPVRNSDFAEFVEATQYMTAA--- 54
Query: 77 GDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA 136
ER S V + L++ + W AV W
Sbjct: 55 -------------ERTGHSHVFQMTQGPVPLNNPDQ-----------WWKAVQGACWYNP 90
Query: 137 RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRG 196
R TE ++ HPVVHV+ DA AY W G
Sbjct: 91 RPGTELPGDFS-----------------------------EHPVVHVALEDAKAYAAWAG 121
Query: 197 ARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVM 256
RLPTEAEWE+ RGG F WG+ P G+ A VWQG FP A G T V
Sbjct: 122 GRLPTEAEWEFAARGGPHYAQFAWGDEFAPNGQRMAQVWQGAFPW-YYAPGGLPGTVAVG 180
Query: 257 SYKENKFGLYNMVGNVWEWTADWWNVHHHPAP----SYNPKGPTTGTDKVKKGGSYLCNE 312
+ N +GLY+M+GNVWEWT + P S +P+ + T KGGS+LC
Sbjct: 181 QFPANGYGLYDMIGNVWEWTQSTFAAPALDKPAGCCSCSPQ-QDSNTLFTLKGGSHLCAA 239
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
+YC R+R AAR SS +LGFRCA + P
Sbjct: 240 EYCLRYRPAARIGIAGASSTSHLGFRCAYNTSP 272
>gi|347527447|ref|YP_004834194.1| hypothetical protein SLG_10620 [Sphingobium sp. SYK-6]
gi|345136128|dbj|BAK65737.1| conserved hypothetical protein [Sphingobium sp. SYK-6]
Length = 317
Score = 148 bits (374), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 82/197 (41%), Positives = 106/197 (53%), Gaps = 15/197 (7%)
Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
R+ W HP+G S I+ R++ PVVHVS DA AY WRG RLP+E E+E +G L
Sbjct: 124 RIVQGADWRHPQGPGSRIDDRLDEPVVHVSQADARAYAHWRGHRLPSEEEYERAAQGRL- 182
Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
G AN WQG FP + DG APV YK N +G ++++GNVWE
Sbjct: 183 ------GTLRDQPAPDSANSWQGPFPHEDLKLDGAAGIAPVGCYKANAYGAHDLIGNVWE 236
Query: 275 WTADWWNVHHHPAPS-----YNPKGPTTGTDK---VKKGGSYLCNEQYCYRHRCAARSQN 326
WT+ W+ H P PS NP DK V KGGS+LC YC R+R AAR
Sbjct: 237 WTSSWYTPGHGPEPSPLGAGSNPSADPRQPDKPVRVIKGGSFLCAPNYCARYRPAARHAQ 296
Query: 327 TPDSSAGNLGFRCAADK 343
D +A ++GFR A+++
Sbjct: 297 AEDETASHIGFRTASNR 313
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 58/144 (40%), Positives = 82/144 (56%), Gaps = 10/144 (6%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV + G R+G ++ E PS + F+LD+ EV+N QF FV+ATG+VT AE
Sbjct: 43 MVWVAGGETRIGDDEGYA---DERPSYTAPVKGFWLDRTEVTNRQFAAFVAATGHVTTAE 99
Query: 75 KFGDTFVFEP------LLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
+ GD+FVF P + ++ +I Q D + +G S I+ R+ PVVH+S DA
Sbjct: 100 RRGDSFVFVPPAAGTIATAPDQWWRIVQ-GADWRHPQGPGSRIDDRLDEPVVHVSQADAR 158
Query: 129 AYCTWRGARLPTEAEWEYGCRGGL 152
AY WRG RLP+E E+E +G L
Sbjct: 159 AYAHWRGHRLPSEEEYERAAQGRL 182
>gi|338209668|ref|YP_004653715.1| sulfatase-modifying factor protein [Runella slithyformis DSM 19594]
gi|336303481|gb|AEI46583.1| Sulphatase-modifying factor protein [Runella slithyformis DSM
19594]
Length = 366
Score = 147 bits (371), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 70/184 (38%), Positives = 107/184 (58%), Gaps = 3/184 (1%)
Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR-LF 218
+W HP G D +HPV VSWNDA AY W G RLP E EWE+ R +R ++
Sbjct: 183 ANWHHPMGPDFPAAAD-DHPVTQVSWNDANAYAQWAGKRLPCELEWEHAARNARNSRSIY 241
Query: 219 PWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD 278
PWGN++ +G++ AN+WQG+FP N DG+ T+PV + + GL +M GNVWEW ++
Sbjct: 242 PWGNDIQTKGQYNANIWQGKFPVKNANEDGFAYTSPVGKFGKTPLGLTDMSGNVWEWQSN 301
Query: 279 WWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
++ + ++ ++GGS+LC +C+ +R + RS +T ++S ++GFR
Sbjct: 302 -LKFNYEALFRTGTRDGDLSAERAQRGGSFLCEPGWCHGYRVSGRSSSTSETSLFHVGFR 360
Query: 339 CAAD 342
C D
Sbjct: 361 CVKD 364
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 84/154 (54%), Gaps = 8/154 (5%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ + G ++G+++ + + E PS +++F++D H V+ +F++FV AT + TEAE
Sbjct: 105 MIYIQGGVVQIGSDEGL---EQERPSFWAKVNSFWMDTHPVTVGEFRKFVQATHFQTEAE 161
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
KFG+ + + +R I + + G D HPV +SWNDA AY W
Sbjct: 162 KFGNAGIID---ESTDRQWILKDGANWHHPMGPDFPAAAD-DHPVTQVSWNDANAYAQWA 217
Query: 135 GARLPTEAEWEYGCRGGLENR-LFPWGSWLHPEG 167
G RLP E EWE+ R +R ++PWG+ + +G
Sbjct: 218 GKRLPCELEWEHAARNARNSRSIYPWGNDIQTKG 251
>gi|255318498|ref|ZP_05359731.1| sulfatase-modifying factor 1 [Acinetobacter radioresistens SK82]
gi|262378727|ref|ZP_06071884.1| conserved hypothetical protein [Acinetobacter radioresistens SH164]
gi|255304490|gb|EET83674.1| sulfatase-modifying factor 1 [Acinetobacter radioresistens SK82]
gi|262300012|gb|EEY87924.1| conserved hypothetical protein [Acinetobacter radioresistens SH164]
Length = 327
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 108/338 (31%), Positives = 145/338 (42%), Gaps = 89/338 (26%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV + G TF +G N+ + + S D F++D EV+N QFQ FV AT YVTEAE
Sbjct: 59 MVKIQGGTFNIGNNQSYREEKALYKSERTVTD-FWIDATEVTNAQFQSFVDATAYVTEAE 117
Query: 75 KFGDTFVF-EPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
K G+ VF EP PV ++W +
Sbjct: 118 KQGEAAVFTEP-------------------------------QKPVKELAW-----WSLM 141
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
+GA +W P G +S + N PV ++ DA+AY
Sbjct: 142 KGA------------------------NWKQPWGPNSHRKIEPNQPVRMITLKDAMAYAD 177
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLST 252
W L TE +WEY +G F +++ +H ANVWQGEFP +N DGY
Sbjct: 178 WLDRDLATEEQWEYAAKG------FSHERDVSADLKHVDANVWQGEFPYHNENKDGYRDV 231
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWN------------VHHHPAPSYNPKGPTTGTD 300
APV + N FGLY+M+GNVWE+T + HH P++N
Sbjct: 232 APVGCFAANGFGLYDMIGNVWEYTKSPFTGTHDDHMGMNQLEHHQDNPAFNF-------- 283
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KGGS+LC YC R+R AAR D ++GFR
Sbjct: 284 YTIKGGSFLCASNYCMRYRAAARHSQEIDLGISHVGFR 321
>gi|372209873|ref|ZP_09497675.1| sulfatase-modifying factor protein [Flavobacteriaceae bacterium
S85]
Length = 306
Score = 147 bits (370), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 82/206 (39%), Positives = 116/206 (56%), Gaps = 13/206 (6%)
Query: 140 TEAEWEYGCRGGLENRLFPWG-----SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
TEAE +G G + R W +W +P G + + + NHPV HVSWNDA AY TW
Sbjct: 105 TEAE-NFGDSGIFDLRGIRWNLVKGTTWEYPFGPEGD-KAQNNHPVTHVSWNDANAYATW 162
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
G RLPTE EWEY + G E + +PWGN++ G + AN W G+ + DGYL T+P
Sbjct: 163 AGKRLPTEIEWEYAAKNGKELK-YPWGNSVKMNGNYMANTWDGQTIQDQNVKDGYLFTSP 221
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQY 314
V ++ ++K G+ +MVGNVW+WTA + + P + V +GGS++ ++
Sbjct: 222 VGAFPKSKSGINDMVGNVWQWTATVFEPYDKTIP-FQKNAQVV----VTRGGSFMFDQAL 276
Query: 315 CYRHRCAARSQNTPDSSAGNLGFRCA 340
+ R+QNT +SS N GFRCA
Sbjct: 277 ELSYTTTFRAQNTVESSLFNTGFRCA 302
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 88/164 (53%), Gaps = 27/164 (16%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+DMV G +G++ I E PS + FYLD + V+ F++FV AT Y+TE
Sbjct: 50 QDMVFFKGGEITIGSDDRTEI---EAPSFTTIVAPFYLDINLVTVADFRKFVEATNYITE 106
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEH---------RMHHPVVHIS 123
AE FGD+ +F+ + +R ++ + +T E+ + +HPV H+S
Sbjct: 107 AENFGDSGIFD----------LRGIRWNLVK----GTTWEYPFGPEGDKAQNNHPVTHVS 152
Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
WNDA AY TW G RLPTE EWEY + G E + +PWG+ + G
Sbjct: 153 WNDANAYATWAGKRLPTEIEWEYAAKNGKELK-YPWGNSVKMNG 195
>gi|171319832|ref|ZP_02908915.1| protein of unknown function DUF323 [Burkholderia ambifaria MEX-5]
gi|171094927|gb|EDT39956.1| protein of unknown function DUF323 [Burkholderia ambifaria MEX-5]
Length = 349
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/337 (32%), Positives = 148/337 (43%), Gaps = 79/337 (23%)
Query: 15 MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
MV L G F G+ + + DG + F++DQ +V+ QF FV ATGYV
Sbjct: 72 MVHLSGGAFVFGSTRGYADERPVGDGR-----TRIGGFWIDQTDVTIAQFAAFVKATGYV 126
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
T E ER + V H R E ++ D +A+
Sbjct: 127 T----------------EAERQGGAAVFHVPARDE----------------LNARD-LAW 153
Query: 131 CTW-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
+W +GA SW P G S+++ N PV V+ DA+
Sbjct: 154 WSWVKGA------------------------SWRQPRGPGSSVDGLDNLPVTLVTQRDAL 189
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAA 246
AY W G LPTEAEWEY + G ++ + PR G+ AN WQG FP +TA
Sbjct: 190 AYAHWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGAFPVLDTAE 245
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDK 301
DG+ APV Y N F LY+M+GN WEWT D + H + P T
Sbjct: 246 DGHAGLAPVGCYAANGFHLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPM 305
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
V KGGS+LC+ YC R+R ++R Q D A ++GFR
Sbjct: 306 VIKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342
>gi|254254946|ref|ZP_04948263.1| hypothetical protein BDAG_04268 [Burkholderia dolosa AUO158]
gi|124899591|gb|EAY71434.1| hypothetical protein BDAG_04268 [Burkholderia dolosa AUO158]
Length = 349
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/334 (32%), Positives = 148/334 (44%), Gaps = 73/334 (21%)
Query: 15 MVLLPGDTFRMGTNK--PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
MV L G F G+ + P G+ +R + F++DQ +V+ QF FV ATGYVT+
Sbjct: 72 MVHLRGGAFVFGSTRGYPDERPAGDGRTR---VGGFWIDQTDVTIAQFAAFVHATGYVTD 128
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
A ER + V H R E ++ D +A+ T
Sbjct: 129 A----------------ERQGGAAVFHVPTRDE----------------LNLRD-LAWWT 155
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
W SW HP G S+++ N PV V+ DA+AY
Sbjct: 156 WVDG-----------------------ASWRHPRGRGSSVDGLGNLPVTLVTQRDALAYA 192
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH---RANVWQGEFPTNNTAADGY 249
W G LPTEAEWEY + G ++ + PR H AN WQG FP +TA DG+
Sbjct: 193 RWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAHGKPAANYWQGAFPVLDTAEDGH 248
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDKVKK 304
APV Y N F LY+M+GN WEWT D + H + P T V K
Sbjct: 249 AGLAPVGCYAANGFRLYDMIGNAWEWTKDPYTGPHQSHTNGDTAAVAPPTHAHDTPMVIK 308
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
GGS+LC+ YC R+R ++R + D A ++GFR
Sbjct: 309 GGSFLCSRDYCVRYRASSRERQEADLGASHIGFR 342
>gi|330820618|ref|YP_004349480.1| hypothetical protein bgla_2g15220 [Burkholderia gladioli BSR3]
gi|327372613|gb|AEA63968.1| hypothetical protein bgla_2g15220 [Burkholderia gladioli BSR3]
Length = 349
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 101/302 (33%), Positives = 136/302 (45%), Gaps = 68/302 (22%)
Query: 45 LDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRF 104
+ F++DQ +V+ QF FV ATGY+TEA ER + V H
Sbjct: 101 VGGFWIDQTDVTVAQFDSFVRATGYLTEA----------------ERQGGAAVFH----- 139
Query: 105 EGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLH 164
+ R ++W W W G SW H
Sbjct: 140 ------VPSREELAARDLAW--------W---------SWVKGA------------SWRH 164
Query: 165 PEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLF---PWG 221
P G S+++ R N PV V+ +DA+AY W G LPTEAEWEY + G + P G
Sbjct: 165 PNGPGSSVDGRGNLPVTLVTLHDALAYARWLGRDLPTEAEWEYAGKAGRDGADLDAAPVG 224
Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
++ G+ AN WQG FP +TA DG+ APV Y N F LY+M+GN WEWT D ++
Sbjct: 225 SD----GKPAANYWQGVFPVIDTAEDGHAGLAPVGCYAANDFRLYDMIGNAWEWTRDAYS 280
Query: 282 VHHHP-----APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
H + P T V KGGS+LC YC R+R ++R D +A ++G
Sbjct: 281 GPHQSHTNGDTAAVAPASRPHDTPMVIKGGSFLCARDYCVRYRASSRESQEADLAASHIG 340
Query: 337 FR 338
FR
Sbjct: 341 FR 342
>gi|297285361|ref|XP_002802766.1| PREDICTED: sulfatase-modifying factor 1-like [Macaca mulatta]
Length = 242
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 66/99 (66%), Positives = 79/99 (79%), Gaps = 2/99 (2%)
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
GYL+ V ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D+VKKGGS
Sbjct: 144 GYLTE--VDAFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGS 201
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
Y+C+ YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 202 YMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 240
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 36/72 (50%), Positives = 52/72 (72%), Gaps = 3/72 (4%)
Query: 8 PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSN +F++FV
Sbjct: 81 PGERPLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNAEFEKFV 140
Query: 65 SATGYVTEAEKF 76
++TGY+TE + F
Sbjct: 141 NSTGYLTEVDAF 152
>gi|298708177|emb|CBJ30517.1| Sulfatase-modifying factor 2 precursor [Ectocarpus siliculosus]
Length = 297
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 104/321 (32%), Positives = 137/321 (42%), Gaps = 57/321 (17%)
Query: 35 DGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKI 94
DG P V + F +D+H V+N QF FV +GYVTEAEKFG +F E +L E I
Sbjct: 17 DGGLPHTQVAVGQFNIDEHCVTNEQFAIFVKDSGYVTEAEKFGWSFALEYMLDAE---TI 73
Query: 95 SQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
+ + R +G + + G+E
Sbjct: 74 KEADEGIGRVKGSEHWV---------------------------------------GVEG 94
Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
+W PEG S+++ R N PV VSW DA AYC+W G RLPTEAEWEY RGG
Sbjct: 95 -----ANWRQPEGPRSSLKSRGNVPVAQVSWVDADAYCSWAGRRLPTEAEWEYAARGGQT 149
Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
G+ T + N WQG FP N+ DGY AP ++ N +Y+M+GN WE
Sbjct: 150 LASEHVGDKST-KMWLVGNSWQGHFPNTNSVEDGYAGLAPATAFAPNALEVYDMLGNAWE 208
Query: 275 WTADWWNVHHHPAPSYNPKGPTTGTDKV----KKGGSYLCNEQYCYRHRCAARSQNTPDS 330
W A P G D V G S + + + R + T DS
Sbjct: 209 WVAG-----GDPTRRTLRGGSFVDFDPVAAEKNNGSSGIPRQPANHAITPGTRMETTQDS 263
Query: 331 SAGNLGFRCAADKGPTTGTDK 351
+ N FRCA + GT +
Sbjct: 264 GSANTSFRCAQSPLKSGGTSR 284
>gi|416911903|ref|ZP_11931660.1| hypothetical protein B1M_06815 [Burkholderia sp. TJI49]
gi|325528173|gb|EGD05359.1| hypothetical protein B1M_06815 [Burkholderia sp. TJI49]
Length = 347
Score = 145 bits (367), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 150/335 (44%), Gaps = 75/335 (22%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
MV L G F G+ + E PS + + F++DQ +V+ QF FV ATGY
Sbjct: 70 MVHLAGGAFVFGSTRGYA---DERPSGDGRTRVGGFWIDQTDVTVAQFAAFVQATGY--- 123
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
++E E+ + V H R E ++ D +A+ +
Sbjct: 124 -------------VTEAEQQGGAAVFHVPTRDE----------------LNARD-LAWWS 153
Query: 133 W-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
W +GA SW HP G STI+ N PV V+ DA+AY
Sbjct: 154 WVKGA------------------------SWRHPRGPGSTIDGHGNLPVTLVTQRDALAY 189
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAADG 248
W G LPTEAEWEY + G ++ + PR G AN WQG FP +TA DG
Sbjct: 190 AHWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGRPAANYWQGPFPVLDTAEDG 245
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH--HPAPSYNPKGPTT---GTDKVK 303
+ APV Y N F LY+M+GN WEWT D + H H P T T V
Sbjct: 246 HAGLAPVGCYAANGFRLYDMIGNAWEWTRDVYTGPHQSHTNGDTAAVAPATRRHDTPMVI 305
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KGGS+LC+ YC R+R ++R Q D A ++GFR
Sbjct: 306 KGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 340
>gi|194379762|dbj|BAG58233.1| unnamed protein product [Homo sapiens]
Length = 242
Score = 145 bits (366), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 66/99 (66%), Positives = 79/99 (79%), Gaps = 2/99 (2%)
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
GYL+ A ++ N +GLYN+VGN WEWT+DWW VHH + NPKGP +G D+VKKGGS
Sbjct: 144 GYLTEAD--AFPPNGYGLYNIVGNAWEWTSDWWTVHHSVEETLNPKGPPSGKDRVKKGGS 201
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
Y+C+ YCYR+RCAARSQNTPDSSA NLGFRCAAD+ PT
Sbjct: 202 YMCHRSYCYRYRCAARSQNTPDSSASNLGFRCAADRLPT 240
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 40/76 (52%), Positives = 56/76 (73%), Gaps = 4/76 (5%)
Query: 5 PAP-PVER---YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQF 60
P P P ER + MV +P F MGT+ P + +DGE P+R VT+DAFY+D +EVSNT+F
Sbjct: 77 PGPVPGERQLAHSKMVPIPAGVFTMGTDDPQIKQDGEAPARRVTIDAFYMDAYEVSNTEF 136
Query: 61 QEFVSATGYVTEAEKF 76
++FV++TGY+TEA+ F
Sbjct: 137 EKFVNSTGYLTEADAF 152
>gi|392964375|ref|ZP_10329796.1| sulfatase modifying factor 1 [Fibrisoma limi BUZ 3]
gi|387847270|emb|CCH51840.1| sulfatase modifying factor 1 [Fibrisoma limi BUZ 3]
Length = 320
Score = 145 bits (366), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 86/234 (36%), Positives = 123/234 (52%), Gaps = 17/234 (7%)
Query: 116 HHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW-----GSWLHPEGIDS 170
HPV + V ++ TEAE ++G L+ W W HP+G +
Sbjct: 96 EHPVTVAEFRRFVTATKYK-----TEAE-KFGDGAVLDETTRQWTLRKGAKWSHPQGPEQ 149
Query: 171 TIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL-FPWGNNLT-PRG 228
NHPV VSWNDA AY W G RLP E EWE+ R +R +P+GN L G
Sbjct: 150 PPAPD-NHPVTQVSWNDAQAYARWAGKRLPYEFEWEHAARNATNSRTRYPFGNELINADG 208
Query: 229 EHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP 288
++ AN W G FP ++ DG+ T+PV + + GL +M GNVW+W DW + A
Sbjct: 209 KYLANTWNGTFPESDAVLDGFHLTSPVGQFGKTPLGLTDMTGNVWQWCEDW---RQNYAD 265
Query: 289 SYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+ PT T+KV++GGS+LC +C+ +R + RS +P++S ++GFRC D
Sbjct: 266 VMAGQPPTNPTEKVQRGGSFLCEPGWCHGYRVSGRSFTSPETSLMHVGFRCVKD 319
>gi|170736356|ref|YP_001777616.1| hypothetical protein Bcenmc03_3975 [Burkholderia cenocepacia MC0-3]
gi|169818544|gb|ACA93126.1| protein of unknown function DUF323 [Burkholderia cenocepacia MC0-3]
Length = 349
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 109/336 (32%), Positives = 145/336 (43%), Gaps = 77/336 (22%)
Query: 15 MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
MV L G F G+ + + DG+ + F++DQ +V+ QF FV ATGYV
Sbjct: 72 MVHLRGGAFVFGSTRGYADERPVGDGK-----TRVGGFWIDQTDVTIAQFAAFVQATGYV 126
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
+E E+ + V H + R ++W
Sbjct: 127 ----------------TEAEQQGGAAVFH-----------VPTRDEMNARDLAW------ 153
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W W G SW HP G S +E N PV V+ DA+A
Sbjct: 154 --W---------SWVKGA------------SWRHPRGPGSDVEGLGNLPVTLVTQRDALA 190
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAAD 247
Y W G LPTEAEWEY + G ++ + PR G+ AN WQG FP ++A D
Sbjct: 191 YARWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGAFPVLDSAED 246
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH--HPAPSYNPKGPTT---GTDKV 302
G+ APV Y N F LY+M+GN WEWT D + H H P T T V
Sbjct: 247 GHAGLAPVGCYAANGFRLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPLTRRHDTPMV 306
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KGGS+LC+ YC R+R ++R Q D A ++GFR
Sbjct: 307 IKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342
>gi|254249091|ref|ZP_04942411.1| hypothetical protein BCPG_03951 [Burkholderia cenocepacia PC184]
gi|124875592|gb|EAY65582.1| hypothetical protein BCPG_03951 [Burkholderia cenocepacia PC184]
Length = 349
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 107/336 (31%), Positives = 144/336 (42%), Gaps = 77/336 (22%)
Query: 15 MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
MV L G F G+ + + DG+ + F++DQ +V+ QF FV ATGYV
Sbjct: 72 MVHLRGGAFVFGSTRGYADERPVGDGK-----TRVGGFWIDQTDVTIAQFAAFVQATGYV 126
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
+E E+ + V H + R ++W
Sbjct: 127 ----------------TEAEQQGGAAVFH-----------VPTRDEMNARDLAW------ 153
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W W G SW HP G S +E N PV V+ DA+A
Sbjct: 154 --W---------SWVKGA------------SWRHPRGPGSDVEGLGNLPVTLVTQRDALA 190
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAAD 247
Y W G LPTEAEWEY + G ++ + PR G+ AN WQG FP ++A D
Sbjct: 191 YARWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGAFPVLDSAED 246
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDKV 302
G+ APV Y N F LY+M+GN WEWT D + H + P T V
Sbjct: 247 GHAGLAPVGCYAANGFRLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPMV 306
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KGGS+LC+ YC R+R ++R Q D A ++GFR
Sbjct: 307 IKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342
>gi|304321886|ref|YP_003855529.1| hypothetical protein PB2503_11714 [Parvularcula bermudensis
HTCC2503]
gi|303300788|gb|ADM10387.1| hypothetical protein PB2503_11714 [Parvularcula bermudensis
HTCC2503]
Length = 308
Score = 144 bits (364), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 86/189 (45%), Positives = 105/189 (55%), Gaps = 14/189 (7%)
Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
RL P SW P G ST++ HPVVH+S DA AY W G RLPTE EWEY GL
Sbjct: 121 RLDPEASWRTPGGEGSTLDGLALHPVVHISLADAHAYAEWVGGRLPTEVEWEYAASLGLF 180
Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
+ P P GE RANVW G FP N+A DG++ TAPV Y+ + G Y+M+GNVWE
Sbjct: 181 DPNDPDSGMRGPNGEARANVWTGIFPLINSAEDGFIGTAPVGCYRPGRIGAYDMIGNVWE 240
Query: 275 WTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGN 334
WT P + P+ KGGS+LC++ YC R+R AAR PD S +
Sbjct: 241 WT-------DSPFAAGTPR-------FTIKGGSFLCSKNYCRRYRAAAREAFEPDFSTAH 286
Query: 335 LGFRCAADK 343
LGFR D+
Sbjct: 287 LGFRIVRDR 295
>gi|344256595|gb|EGW12699.1| Sulfatase-modifying factor 2 [Cricetulus griseus]
Length = 227
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 90/241 (37%), Positives = 117/241 (48%), Gaps = 59/241 (24%)
Query: 1 MVLLPAPPVE----RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVS 56
M LL +P ++ + MV LPG F MGT+ P +DGE P+R VT+ +F +D V+
Sbjct: 11 MCLLSSPRLQLGNAQAPTMVQLPGGRFLMGTDAPD-GRDGEGPAREVTVKSFTIDVFPVT 69
Query: 57 NTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMH 116
N F+EFV Y TEAE FG +FVFE +S E R + +Q+
Sbjct: 70 NKDFREFVREKKYQTEAETFGWSFVFEDFVSPELRNQANQMPS----------------- 112
Query: 117 HPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRM 176
+ W V WR P G S I ++
Sbjct: 113 -----VLWWLPVQKAFWR-----------------------------QPAGPGSGIREKL 138
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
PVVHVSWNDA AYC WRG RLPTE EWE+ RGGL+ +++PWGN P +R N+WQ
Sbjct: 139 ELPVVHVSWNDASAYCAWRGKRLPTEEEWEFAARGGLKGQVYPWGNRFQP---NRTNLWQ 195
Query: 237 G 237
G
Sbjct: 196 G 196
>gi|134100962|ref|YP_001106623.1| serine/threonine protein kinase [Saccharopolyspora erythraea NRRL
2338]
gi|291003423|ref|ZP_06561396.1| putative serine/threonine protein kinase [Saccharopolyspora
erythraea NRRL 2338]
gi|133913585|emb|CAM03698.1| putative serine/threonine protein kinase [Saccharopolyspora
erythraea NRRL 2338]
Length = 223
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 65/103 (63%), Positives = 72/103 (69%)
Query: 180 VVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEF 239
+V VSWNDA AYC W G RLPTE EWEY RGG R +PWG+ L P G HR N+WQG F
Sbjct: 1 MVQVSWNDAAAYCDWAGLRLPTEVEWEYAARGGPVRRRYPWGDELAPGGGHRCNIWQGHF 60
Query: 240 PTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV 282
PT NTA DG+ TAP +Y+ N FGLYNM GN WEW AD W V
Sbjct: 61 PTRNTAEDGFRGTAPADAYRPNGFGLYNMAGNFWEWCADRWTV 103
>gi|78063430|ref|YP_373338.1| hypothetical protein Bcep18194_B2583 [Burkholderia sp. 383]
gi|77971315|gb|ABB12694.1| protein of unknown function DUF323 [Burkholderia sp. 383]
Length = 349
Score = 143 bits (361), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 107/336 (31%), Positives = 143/336 (42%), Gaps = 77/336 (22%)
Query: 15 MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
MV L G F G+ + + DG + F++DQ +V+ QF FV ATGYV
Sbjct: 72 MVHLRGGAFVFGSTRGYADERPVGDGR-----TRVGGFWIDQTDVTIAQFAAFVQATGYV 126
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
+E E+ + V H + R ++W
Sbjct: 127 ----------------TEAEQQDGAAVFH-----------VPTRDEMNARDLAW------ 153
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W W G SW HP G S ++ N PV V+ DA+A
Sbjct: 154 --W---------SWVKGA------------SWRHPRGPGSGVDGLGNLPVTLVTQRDALA 190
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAAD 247
Y W G LPTEAEWEY + G ++ + PR G+ AN WQG FP +TA D
Sbjct: 191 YARWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGVFPVLDTAED 246
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDKV 302
G+ APV Y N F LY+M+GN WEWT D + H + P T V
Sbjct: 247 GHAGLAPVGCYAANGFRLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPMV 306
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KGGS+LC+ YC R+R ++R Q D A ++GFR
Sbjct: 307 IKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342
>gi|296137978|ref|YP_003645221.1| hypothetical protein [Tsukamurella paurometabola DSM 20162]
gi|296026112|gb|ADG76882.1| protein of unknown function DUF323 [Tsukamurella paurometabola DSM
20162]
Length = 351
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 81/193 (41%), Positives = 110/193 (56%), Gaps = 16/193 (8%)
Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 221
W +P G ST E+R +HPVVH+S+ DA+AYC W LPTEA+WE RG + +PWG
Sbjct: 160 WRNPFGAGSTAENRPDHPVVHISYFDALAYCAWANRTLPTEAQWECAARGTILGETYPWG 219
Query: 222 NNLTPRGEH-RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
GE + N+++G FP T G T PV S N G+Y+ VGNVWEWTAD +
Sbjct: 220 GQPPIDGEPLKVNIFRGAFPDQPTTPVG---TMPVTSLPPNDIGIYHSVGNVWEWTADAY 276
Query: 281 ---------NVHHHPA-PSYNPKGP--TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTP 328
PA + +P+ P T D + +GGSYLC++ YC R+R +AR+ TP
Sbjct: 277 VPGLYAQRAATTLGPAGTAVDPRSPGDTDRDDFMLRGGSYLCHDSYCNRYRNSARTHATP 336
Query: 329 DSSAGNLGFRCAA 341
+S ++GFR A
Sbjct: 337 HTSTSHIGFRTTA 349
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/141 (37%), Positives = 72/141 (51%), Gaps = 14/141 (9%)
Query: 32 LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEER 91
+ D E P+ V + A+ +D V+N QF FV ATGY T A++ G + VF ++
Sbjct: 81 IAADEEGPAHPVGVSAYSIDATTVTNAQFARFVDATGYRTTAQQRGSSTVFYSQVAPS-- 138
Query: 92 AKISQVRHDM------------KRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLP 139
A + V + G ST E+R HPVVHIS+ DA+AYC W LP
Sbjct: 139 ATVLDVPDPQAWWWRTVEDAYWRNPFGAGSTAENRPDHPVVHISYFDALAYCAWANRTLP 198
Query: 140 TEAEWEYGCRGGLENRLFPWG 160
TEA+WE RG + +PWG
Sbjct: 199 TEAQWECAARGTILGETYPWG 219
>gi|116691643|ref|YP_837176.1| hypothetical protein Bcen2424_3544 [Burkholderia cenocepacia
HI2424]
gi|116649643|gb|ABK10283.1| protein of unknown function DUF323 [Burkholderia cenocepacia
HI2424]
Length = 349
Score = 142 bits (359), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 108/337 (32%), Positives = 149/337 (44%), Gaps = 79/337 (23%)
Query: 15 MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
MV L G F G+ + + DG+ + F++DQ +V+ QF FV ATGY
Sbjct: 72 MVHLRGGAFVFGSTRGYADERPVGDGK-----TRVGGFWIDQTDVTIAQFAAFVQATGY- 125
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
++E E+ + V H R E ++ D +A+
Sbjct: 126 ---------------VTEAEQQGGAAVFHVPTRDE----------------LNARD-LAW 153
Query: 131 CTW-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
+W +GA SW HP G S +E N PV V+ DA+
Sbjct: 154 WSWVKGA------------------------SWRHPHGPGSDVEGLGNLPVTLVTQRDAL 189
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAA 246
AY W G LPTEAEWEY + G ++ + PR G+ AN WQG FP ++A
Sbjct: 190 AYARWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGAFPVLDSAE 245
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDK 301
DG+ APV Y N F LY+M+GN WEWT D + H + P T
Sbjct: 246 DGHAGLAPVGCYAANGFRLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPM 305
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
V KGGS+LC+ YC R+R ++R Q D A ++GFR
Sbjct: 306 VIKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342
>gi|441500083|ref|ZP_20982253.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycine-
generating enzyme 1) [Fulvivirga imtechensis AK7]
gi|441436174|gb|ELR69548.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycine-
generating enzyme 1) [Fulvivirga imtechensis AK7]
Length = 290
Score = 142 bits (358), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 83/218 (38%), Positives = 120/218 (55%), Gaps = 23/218 (10%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE-NRLFP 219
+W P G + + NHP+ HVSWNDA+AY +W G RLP E EWE+ R G N +
Sbjct: 79 TWQQPLGPNGA-PAKDNHPITHVSWNDAMAYASWAGKRLPHEYEWEFAARNGKNTNDKYS 137
Query: 220 WGNNLTP-RGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD 278
WGN+LT G +ANVWQG FP NT AD + T+PV ++ G+ +M GNVWEW ++
Sbjct: 138 WGNSLTDNNGVFKANVWQGHFPDTNTVADDFAHTSPVGAFGITSCGMSDMGGNVWEWCSN 197
Query: 279 WWNVH---HHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNL 335
+ ++ H+P K P KV +GGS++C+ C+ +R +AR + ++S ++
Sbjct: 198 IFELYPGNHYPY----QKDPNV---KVIRGGSFMCDSLVCHGYRVSARQFTSRETSNFHM 250
Query: 336 GFRCAAD----------KGPTTGTDKVKKGGSYLCNEQ 363
GFRCA D + P T SYL N++
Sbjct: 251 GFRCAMDVDRRTSNPKHQTPNQNTTIKTTYKSYLSNDK 288
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 53/152 (34%), Positives = 78/152 (51%), Gaps = 13/152 (8%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV G +G+ + E P+ + F++D+H V+ QF+ FV TGY T+AE
Sbjct: 1 MVYFAGGDITIGSEHGLT---NEAPTFEAEIAPFFIDKHPVTVAQFRAFVQTTGYKTDAE 57
Query: 75 KFGDT--FVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
KFG++ F ++ L+ E + Q D+ HP+ H+SWNDA+AY +
Sbjct: 58 KFGNSALFNYKKLIYELVDSVTWQQPLGPNGAPAKDN-------HPITHVSWNDAMAYAS 110
Query: 133 WRGARLPTEAEWEYGCRGGLE-NRLFPWGSWL 163
W G RLP E EWE+ R G N + WG+ L
Sbjct: 111 WAGKRLPHEYEWEFAARNGKNTNDKYSWGNSL 142
Score = 44.7 bits (104), Expect = 0.082, Method: Compositional matrix adjust.
Identities = 18/43 (41%), Positives = 30/43 (69%)
Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
KV +GGS++C+ C+ +R +AR + ++S ++GFRCA DV
Sbjct: 216 KVIRGGSFMCDSLVCHGYRVSARQFTSRETSNFHMGFRCAMDV 258
>gi|161521664|ref|YP_001585091.1| hypothetical protein Bmul_5129 [Burkholderia multivorans ATCC
17616]
gi|189352169|ref|YP_001947796.1| hypothetical protein BMULJ_03387 [Burkholderia multivorans ATCC
17616]
gi|160345714|gb|ABX18799.1| protein of unknown function DUF323 [Burkholderia multivorans ATCC
17616]
gi|189336191|dbj|BAG45260.1| conserved hypothetical protein [Burkholderia multivorans ATCC
17616]
Length = 347
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 85/217 (39%), Positives = 109/217 (50%), Gaps = 19/217 (8%)
Query: 140 TEAEWEYGC-------RGGLENRLFPW------GSWLHPEGIDSTIEHRMNHPVVHVSWN 186
TEAE + G R L R W SW HP G S +E R N PV V+
Sbjct: 125 TEAEQQGGAAVFHVPTRDELNARDLAWWSWVKGASWRHPRGPGSDVEGRGNLPVTLVTQR 184
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
DA+AY W G LPTEAEWEY + G ++ RG+ AN WQG FP +TA
Sbjct: 185 DALAYARWLGRDLPTEAEWEYAGKAGRDDASLD-AAPRDARGKPAANYWQGPFPVLDTAE 243
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDK 301
DG+ APV Y N F LY+M+GN W+WT D + H + P T
Sbjct: 244 DGHAGLAPVGCYAANGFRLYDMIGNAWQWTKDVYTGPHQSHTNGDTAAVAPAARRHDTPM 303
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
V KGGS+LC+ YC R+R ++R + D +A ++GFR
Sbjct: 304 VIKGGSFLCSRDYCVRYRASSRERQEADLAASHIGFR 340
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 56/147 (38%), Positives = 77/147 (52%), Gaps = 10/147 (6%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
MV L G TF G+ + E P+ + + F++DQ +V+ QF FV ATGYVTE
Sbjct: 70 MVHLRGGTFVFGSTRGY---QDERPAADGRTRVGGFWIDQTDVTVAQFAAFVQATGYVTE 126
Query: 73 AEKFGDTFVFEPLLSEEERAKI----SQVRHDMKRF-EGLDSTIEHRMHHPVVHISWNDA 127
AE+ G VF +E A+ S V+ R G S +E R + PV ++ DA
Sbjct: 127 AEQQGGAAVFHVPTRDELNARDLAWWSWVKGASWRHPRGPGSDVEGRGNLPVTLVTQRDA 186
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLEN 154
+AY W G LPTEAEWEY + G ++
Sbjct: 187 LAYARWLGRDLPTEAEWEYAGKAGRDD 213
>gi|221197358|ref|ZP_03570405.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
gi|221204031|ref|ZP_03577049.1| conserved hypothetical protein [Burkholderia multivorans CGD2]
gi|421471366|ref|ZP_15919662.1| formylglycine-generating sulfatase enzyme [Burkholderia multivorans
ATCC BAA-247]
gi|221176197|gb|EEE08626.1| conserved hypothetical protein [Burkholderia multivorans CGD2]
gi|221183912|gb|EEE16312.1| conserved hypothetical protein [Burkholderia multivorans CGD2M]
gi|400225650|gb|EJO55794.1| formylglycine-generating sulfatase enzyme [Burkholderia multivorans
ATCC BAA-247]
Length = 347
Score = 141 bits (356), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 85/218 (38%), Positives = 109/218 (50%), Gaps = 19/218 (8%)
Query: 140 TEAEWEYGC-------RGGLENRLFPW------GSWLHPEGIDSTIEHRMNHPVVHVSWN 186
TEAE + G R L R W SW HP G S +E R N PV V+
Sbjct: 125 TEAEQQGGAAVFHVPTRDELNARDLAWWSWVKGASWRHPRGPGSDVEGRGNLPVTLVTQR 184
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
DA+AY W G LPTEAEWEY + G ++ RG+ AN WQG FP +TA
Sbjct: 185 DALAYARWLGRDLPTEAEWEYAGKAGRDDASLD-AAPRDARGKPAANYWQGPFPVLDTAE 243
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDK 301
DG+ APV Y N F LY+M+GN W+WT D + H + P T
Sbjct: 244 DGHAGLAPVGCYAANGFRLYDMIGNAWQWTKDAYTGPHQSHTNGDTAAVAPAARRHDTPM 303
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
V KGGS+LC+ YC R+R ++R + D +A ++GFR
Sbjct: 304 VIKGGSFLCSRDYCVRYRASSRERQEADLAASHIGFRT 341
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 56/147 (38%), Positives = 77/147 (52%), Gaps = 10/147 (6%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
MV L G TF G+ + E P+ + + F++DQ +V+ QF FV ATGYVTE
Sbjct: 70 MVHLRGGTFVFGSTRGY---QDERPAGDGRTRVGGFWIDQTDVTVAQFAAFVQATGYVTE 126
Query: 73 AEKFGDTFVFEPLLSEEERAKI----SQVRHDMKRF-EGLDSTIEHRMHHPVVHISWNDA 127
AE+ G VF +E A+ S V+ R G S +E R + PV ++ DA
Sbjct: 127 AEQQGGAAVFHVPTRDELNARDLAWWSWVKGASWRHPRGPGSDVEGRGNLPVTLVTQRDA 186
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLEN 154
+AY W G LPTEAEWEY + G ++
Sbjct: 187 LAYARWLGRDLPTEAEWEYAGKAGRDD 213
>gi|172062487|ref|YP_001810138.1| hypothetical protein BamMC406_3455 [Burkholderia ambifaria MC40-6]
gi|171995004|gb|ACB65922.1| protein of unknown function DUF323 [Burkholderia ambifaria MC40-6]
Length = 349
Score = 141 bits (356), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 107/337 (31%), Positives = 148/337 (43%), Gaps = 79/337 (23%)
Query: 15 MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
MV L G F G+ + + DG + F++DQ +V+ QF FV ATGY
Sbjct: 72 MVHLSGGAFVFGSTRGYADERPVGDGR-----TRVGGFWIDQTDVTIAQFAAFVKATGY- 125
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
++E E+ + V H R E ++ D +A+
Sbjct: 126 ---------------VTEAEQQGGAAVFHVPTRDE----------------LNARD-LAW 153
Query: 131 CTW-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
+W +GA SW P G S+++ N PV V+ DA+
Sbjct: 154 WSWVKGA------------------------SWRQPRGPGSSVDGHGNLPVTLVTQRDAL 189
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAA 246
AY W G LPTEAEWEY + G ++ + PR G+ AN WQG FP +TA
Sbjct: 190 AYAHWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGAFPVLDTAE 245
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDK 301
DG+ APV Y N F LY+M+GN WEWT D + H + P T
Sbjct: 246 DGHAGLAPVGCYAANGFRLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPM 305
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
V KGGS+LC+ YC R+R ++R Q D A ++GFR
Sbjct: 306 VIKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342
>gi|103485795|ref|YP_615356.1| hypothetical protein Sala_0300 [Sphingopyxis alaskensis RB2256]
gi|98975872|gb|ABF52023.1| protein of unknown function DUF323 [Sphingopyxis alaskensis RB2256]
Length = 309
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 100/335 (29%), Positives = 145/335 (43%), Gaps = 73/335 (21%)
Query: 5 PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
P P +D +PG TF +G + + E P R VT+ F++ HEV+N +F EFV
Sbjct: 33 PECPAMPAQDYAWIPGATFTLGADAQL---PEEGPPREVTVAGFWIATHEVTNAEFAEFV 89
Query: 65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISW 124
ATGY T L+E++ + DM G +P +W
Sbjct: 90 RATGYKT--------------LAEQDPPPLPGAPPDM-LIPGSAVFTAPTDGNP----NW 130
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
WR W G W HP G + I+ R PVV +
Sbjct: 131 --------WR---------WVVGAE------------WRHPAGPATNIDGRGRDPVVQIG 161
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
+ DA+A+ W+G LP+E +WE G +R P G + G+ RAN +QG FP +
Sbjct: 162 YEDALAFAKWKGKTLPSEEQWELAAASGGADRNVPVGKD----GKPRANFYQGSFPLRDL 217
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDK-VK 303
DG+ APV + + G+++++GNVWEWT G+++ V
Sbjct: 218 GTDGFTGRAPVACFPADAHGVHDLIGNVWEWTTS-----------------AAGSERNVI 260
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KGGS+LC YC R+R AAR ++GFR
Sbjct: 261 KGGSFLCAANYCARYRPAARQFQERSLGTDHIGFR 295
>gi|393720429|ref|ZP_10340356.1| hypothetical protein SechA1_11798 [Sphingomonas echinoides ATCC
14820]
Length = 300
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 101/307 (32%), Positives = 132/307 (42%), Gaps = 66/307 (21%)
Query: 37 EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
E P R VT+ F++D HE++N +F FV ATGY T AE+ P E R S
Sbjct: 55 EGPPRTVTVPGFWIDTHELTNAEFAAFVKATGYRTMAERKPPVL---PGAPPEMRVPGSA 111
Query: 97 VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 156
V G D W WR W G +
Sbjct: 112 V---FAAPSGDDP-------------RW--------WR---------WVVGAQ------- 131
Query: 157 FPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 216
W HP G + +I + PVV +++ DA AY W G LP+EA+WEY RGG
Sbjct: 132 -----WRHPSGSNESIIGKDTTPVVQIAYEDAEAYARWAGKALPSEAQWEYAARGGQPAL 186
Query: 217 LFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWT 276
P + TP AN +QG FP + DGY APV + N +GLY+M+GNVWEWT
Sbjct: 187 PEPIDAHGTPT----ANYYQGVFPARDLGIDGYRGRAPVGCFPANGYGLYDMIGNVWEWT 242
Query: 277 ADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
+ + G +V KGGSYLC YC R+R AAR ++G
Sbjct: 243 SG--------------VAGSAGDRRVIKGGSYLCAANYCARYRPAARQFQELGLGTDHIG 288
Query: 337 FRCAADK 343
R ++
Sbjct: 289 VRLVDNR 295
>gi|115360020|ref|YP_777158.1| hypothetical protein Bamb_5277 [Burkholderia ambifaria AMMD]
gi|115285308|gb|ABI90824.1| protein of unknown function DUF323 [Burkholderia ambifaria AMMD]
Length = 349
Score = 141 bits (355), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 107/337 (31%), Positives = 148/337 (43%), Gaps = 79/337 (23%)
Query: 15 MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
MV L G F G+ + + DG + F++DQ +V+ QF FV ATGY
Sbjct: 72 MVHLSGGAFVFGSTRGYADERPVGDGR-----TRVGGFWIDQTDVTIAQFAAFVKATGY- 125
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
++E E+ + V H R E ++ D +A+
Sbjct: 126 ---------------VTEAEQQGGAAVFHVPTRDE----------------LNARD-LAW 153
Query: 131 CTW-RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
+W +GA SW P G S+++ N PV V+ DA+
Sbjct: 154 WSWVKGA------------------------SWRQPRGPGSSVDGLGNLPVTLVTQRDAL 189
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAA 246
AY W G LPTEAEWEY + G ++ + PR G+ AN WQG FP +TA
Sbjct: 190 AYAHWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGAFPVLDTAE 245
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDK 301
DG+ APV Y N F LY+M+GN WEWT D + H + P T
Sbjct: 246 DGHAGLAPVGCYAANGFHLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPM 305
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
V KGGS+LC+ YC R+R ++R Q D A ++GFR
Sbjct: 306 VIKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342
>gi|385677485|ref|ZP_10051413.1| non-specific serine/threonine protein kinase [Amycolatopsis sp.
ATCC 39116]
Length = 136
Score = 140 bits (354), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 67/133 (50%), Positives = 86/133 (64%), Gaps = 14/133 (10%)
Query: 210 RGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMV 269
R GL+ +PWG+ LTP GEHR N+WQG FP +NT DG + TAPV ++ N FGLYN+
Sbjct: 17 RRGLDQARYPWGDELTPGGEHRCNIWQGRFPVHNTEEDGQVGTAPVDAFPPNGFGLYNVS 76
Query: 270 GNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPD 329
GNVWEW ADW++ H ++ +GGSYLC++ YC R+R AAR+ N PD
Sbjct: 77 GNVWEWCADWFDDAH--------------ANRAMRGGSYLCHDSYCNRYRVAARTANMPD 122
Query: 330 SSAGNLGFRCAAD 342
S+GNLGFR D
Sbjct: 123 GSSGNLGFRTVTD 135
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 24/45 (53%), Positives = 33/45 (73%)
Query: 349 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
++ +GGSYLC++ YC R+R AAR+ N PD S+GNLGFR D+
Sbjct: 92 ANRAMRGGSYLCHDSYCNRYRVAARTANMPDGSSGNLGFRTVTDI 136
>gi|221210763|ref|ZP_03583743.1| conserved hypothetical protein [Burkholderia multivorans CGD1]
gi|221169719|gb|EEE02186.1| conserved hypothetical protein [Burkholderia multivorans CGD1]
Length = 347
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 84/218 (38%), Positives = 109/218 (50%), Gaps = 19/218 (8%)
Query: 140 TEAEWEYGC-------RGGLENRLFPW------GSWLHPEGIDSTIEHRMNHPVVHVSWN 186
TEAE + G R L R W SW HP G S ++ R N PV V+
Sbjct: 125 TEAEQQGGAAVFHVPTRDELNARDLAWWSWVKGASWRHPRGPGSDVDGRDNLPVTLVTQR 184
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
DA+AY W G LPTEAEWEY + G ++ RG+ AN WQG FP +TA
Sbjct: 185 DALAYARWLGRDLPTEAEWEYAGKAGRDDASLD-AAPRDARGKPTANYWQGPFPVLDTAE 243
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDK 301
DG+ APV Y N F LY+M+GN W+WT D + H + P T
Sbjct: 244 DGHAGLAPVGCYAANGFRLYDMIGNAWQWTKDVYTGPHQSHTNGDTAAVAPAARRHDTPM 303
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
V KGGS+LC+ YC R+R ++R + D +A ++GFR
Sbjct: 304 VIKGGSFLCSRDYCVRYRASSRERQEADLAASHIGFRT 341
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/147 (37%), Positives = 77/147 (52%), Gaps = 10/147 (6%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
MV L G TF G+ + E P+ + + F++DQ +V+ QF FV ATGYVTE
Sbjct: 70 MVHLRGGTFVFGSTRGY---QDERPAGDGRTRVGGFWIDQTDVTVAQFAAFVQATGYVTE 126
Query: 73 AEKFGDTFVFEPLLSEEERAKI----SQVRHDMKRF-EGLDSTIEHRMHHPVVHISWNDA 127
AE+ G VF +E A+ S V+ R G S ++ R + PV ++ DA
Sbjct: 127 AEQQGGAAVFHVPTRDELNARDLAWWSWVKGASWRHPRGPGSDVDGRDNLPVTLVTQRDA 186
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLEN 154
+AY W G LPTEAEWEY + G ++
Sbjct: 187 LAYARWLGRDLPTEAEWEYAGKAGRDD 213
>gi|195103144|ref|XP_001998099.1| GH11729 [Drosophila grimshawi]
gi|193891453|gb|EDV90319.1| GH11729 [Drosophila grimshawi]
Length = 109
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 69/118 (58%), Positives = 83/118 (70%), Gaps = 12/118 (10%)
Query: 226 PRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH 285
P+ +H N+WQGEFP NT DGY T PV +++N + LYNMVGNVWEWTAD W
Sbjct: 2 PKDQHWLNIWQGEFPDGNTQQDGYQFTCPVNEFRQNNYDLYNMVGNVWEWTADLWQA--- 58
Query: 286 PAPSYNPKGPTTGT-DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
G T+ + +VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 59 --------GDTSESPARVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKN 108
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/43 (79%), Positives = 39/43 (90%)
Query: 351 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+VKKGGSYLC++ YCYR+RCAARSQNT DSSAGNLGFRCA +
Sbjct: 67 RVKKGGSYLCHKSYCYRYRCAARSQNTEDSSAGNLGFRCAKNA 109
>gi|444429732|ref|ZP_21224914.1| hypothetical protein GS4_03_00830 [Gordonia soli NBRC 108243]
gi|443889393|dbj|GAC66635.1| hypothetical protein GS4_03_00830 [Gordonia soli NBRC 108243]
Length = 287
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 105/323 (32%), Positives = 140/323 (43%), Gaps = 62/323 (19%)
Query: 37 EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
E P R V ++ F++D V+ +F FV TG+VT ++
Sbjct: 10 EQPVRVVEVEPFWIDHAPVTVGEFARFVHETGHVT----------------------TAE 47
Query: 97 VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 156
+ D F D + + V T L W R
Sbjct: 48 IAPDPAEFPDADPDL----------LVAGSLVFTPTSGPVPLDDHRRWW---------RF 88
Query: 157 FPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 216
W P G S +E R +HPV HVS +DA+AY W G RLP EAEWE+ RGG +
Sbjct: 89 VAGACWSRPSGPGSDVEGRADHPVTHVSHSDALAYAAWAGKRLPGEAEWEFAARGGCADA 148
Query: 217 LFPWGNNLTPRGEHR-------------ANVWQGEFPTNNTAADGYLSTAPVMSYKENKF 263
G+ LT E A V+ G+FP +T G +TA + + ++
Sbjct: 149 Q---GDELTADAEFTWDIDGIDDLDDVPATVFVGDFPWRHTGRRGPGTTA-IYTNPPDRR 204
Query: 264 GLYNMVGNVWEWTADWWNVHHHPAPSYNPKGP----TTGTDKVKKGGSYLCNEQYCYRHR 319
GL +M+GNVWEWTAD H P + P T + KGGS+LC+ QYC R+R
Sbjct: 205 GLVDMIGNVWEWTADAHTRDHRPPAGGSCCAPARPATLARHRTIKGGSHLCSPQYCLRYR 264
Query: 320 CAARSQNTPDSSAGNLGFRCAAD 342
AAR DSS G+LGFRC AD
Sbjct: 265 PAARQGQDEDSSTGHLGFRCVAD 287
>gi|399065902|ref|ZP_10748107.1| hypothetical protein PMI02_04531 [Novosphingobium sp. AP12]
gi|398029080|gb|EJL22576.1| hypothetical protein PMI02_04531 [Novosphingobium sp. AP12]
Length = 247
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 81/204 (39%), Positives = 106/204 (51%), Gaps = 28/204 (13%)
Query: 162 WLHPEGIDSTIEHR--MNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
W HP G I+ +HPVV +S++DA AY W LPTEAE+E+ RGGLE +
Sbjct: 33 WRHPLGARDDIDSLGLWDHPVVQISYSDAEAYARWVNKDLPTEAEFEFAARGGLEGADYA 92
Query: 220 WGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
WG+ L P AN QG FP N DG+ T+PV +Y N +GL++++GN WEWT DW
Sbjct: 93 WGDELAPGDAMMANYRQGLFPFANQMLDGWERTSPVGTYPANGYGLFDIIGNTWEWTRDW 152
Query: 280 WNVHHHPAP-----------------------SYNPKGPTTGTD-KVKKGGSYLCNEQYC 315
W+ PA S++P P KV K GS+LC YC
Sbjct: 153 WS--EKPAADRGGRKSRSCCSISNPRGGKLKESFDPAQPGVEIGRKVLKVGSHLCAANYC 210
Query: 316 YRHRCAARSQNTPDSSAGNLGFRC 339
R+R AAR D+S ++GFRC
Sbjct: 211 QRYRPAARHPEMIDTSTTHIGFRC 234
>gi|302522996|ref|ZP_07275338.1| sulfatase modifying factor 1 [Streptomyces sp. SPB78]
gi|302431891|gb|EFL03707.1| sulfatase modifying factor 1 [Streptomyces sp. SPB78]
Length = 109
Score = 140 bits (352), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 65/108 (60%), Positives = 77/108 (71%)
Query: 235 WQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKG 294
WQG FPT NTA DGY TAP ++ N GL+N+ GNVWEW AD W H P+ +P+G
Sbjct: 1 WQGRFPTTNTAEDGYRGTAPAHAFPPNGHGLHNVAGNVWEWCADRWGTDHSARPASDPRG 60
Query: 295 PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
P GT +V +GGSYLC+ YC R+R AAR+ NTPDSSAGNLGFRCA D
Sbjct: 61 PRRGTSRVMRGGSYLCHHSYCNRYRVAARTSNTPDSSAGNLGFRCAWD 108
>gi|421865904|ref|ZP_16297578.1| Sulfatase modifying factor 1 precursor
(C-alpha-formyglycine-generating enzyme 1) [Burkholderia
cenocepacia H111]
gi|358074045|emb|CCE48456.1| Sulfatase modifying factor 1 precursor
(C-alpha-formyglycine-generating enzyme 1) [Burkholderia
cenocepacia H111]
Length = 349
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 105/336 (31%), Positives = 142/336 (42%), Gaps = 77/336 (22%)
Query: 15 MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
MV L G F G+ + + DG + F++DQ +V+ QF FV ATGY
Sbjct: 72 MVHLRGGAFVFGSTRGYADERPVGDGR-----TRVGGFWIDQTDVTIAQFAAFVQATGY- 125
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
++E E+ + V H + R ++W
Sbjct: 126 ---------------VTEAEQQGGAAVFH-----------VPTRDEMNARDLAW------ 153
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W W G SW HP G S + N PV V+ DA+A
Sbjct: 154 --W---------SWVKGA------------SWRHPRGPGSDVTGLGNLPVTLVTQRDALA 190
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAAD 247
Y W G LPTEAEWEY + G ++ + PR G+ AN WQG FP ++A D
Sbjct: 191 YAHWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDAQGKPAANYWQGAFPVLDSAED 246
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDKV 302
G+ APV Y N F LY+M+GN WEWT D + H + P T V
Sbjct: 247 GHAGLAPVGCYAANGFRLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPMV 306
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KGGS+LC+ YC R+R ++R Q D A ++GFR
Sbjct: 307 IKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342
>gi|421479554|ref|ZP_15927240.1| formylglycine-generating sulfatase enzyme [Burkholderia multivorans
CF2]
gi|400222768|gb|EJO53124.1| formylglycine-generating sulfatase enzyme [Burkholderia multivorans
CF2]
Length = 347
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 84/218 (38%), Positives = 109/218 (50%), Gaps = 19/218 (8%)
Query: 140 TEAEWEYGC-------RGGLENRLFPW------GSWLHPEGIDSTIEHRMNHPVVHVSWN 186
TEAE + G R L R W SW HP G S ++ R N PV V+
Sbjct: 125 TEAEQQGGAAVFHVPTRDELNARDLAWWSWVKGASWRHPRGPGSDVDGRGNLPVTLVTQR 184
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
DA+AY W G LPTEAEWEY + G ++ RG+ AN WQG FP +TA
Sbjct: 185 DALAYARWLGRDLPTEAEWEYAGKAGRDDASLD-AAPRDARGKPAANYWQGPFPVLDTAE 243
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDK 301
DG+ APV Y N F LY+M+GN W+WT D + H + P T
Sbjct: 244 DGHAGLAPVGCYAANGFRLYDMIGNAWQWTKDVYTGPHQSHTNGDTAAVAPAERRHDTPM 303
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
V KGGS+LC+ YC R+R ++R + D +A ++GFR
Sbjct: 304 VIKGGSFLCSRDYCVRYRASSRERQEADLAASHIGFRT 341
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 54/147 (36%), Positives = 77/147 (52%), Gaps = 10/147 (6%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRN--VTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
MV + G TF G+ + E P+ + + F++DQ +V+ QF FV ATGYVTE
Sbjct: 70 MVHVRGGTFVFGSTRGY---QDERPAGDGRTRVGGFWIDQTDVTVAQFAAFVQATGYVTE 126
Query: 73 AEKFGDTFVFEPLLSEEERAKI----SQVRHDMKRF-EGLDSTIEHRMHHPVVHISWNDA 127
AE+ G VF +E A+ S V+ R G S ++ R + PV ++ DA
Sbjct: 127 AEQQGGAAVFHVPTRDELNARDLAWWSWVKGASWRHPRGPGSDVDGRGNLPVTLVTQRDA 186
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLEN 154
+AY W G LPTEAEWEY + G ++
Sbjct: 187 LAYARWLGRDLPTEAEWEYAGKAGRDD 213
>gi|206562431|ref|YP_002233194.1| hypothetical protein BCAM0570 [Burkholderia cenocepacia J2315]
gi|444359023|ref|ZP_21160361.1| formylglycine-generating sulfatase enzyme [Burkholderia cenocepacia
BC7]
gi|444369974|ref|ZP_21169673.1| formylglycine-generating sulfatase enzyme [Burkholderia cenocepacia
K56-2Valvano]
gi|198038471|emb|CAR54429.1| putative exported protein [Burkholderia cenocepacia J2315]
gi|443598221|gb|ELT66596.1| formylglycine-generating sulfatase enzyme [Burkholderia cenocepacia
K56-2Valvano]
gi|443602803|gb|ELT70857.1| formylglycine-generating sulfatase enzyme [Burkholderia cenocepacia
BC7]
Length = 349
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 105/336 (31%), Positives = 142/336 (42%), Gaps = 77/336 (22%)
Query: 15 MVLLPGDTFRMGTNKPIL----IKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
MV L G F G+ + + DG + F++DQ +V+ QF FV ATGY
Sbjct: 72 MVHLRGGAFVFGSTRGYADERPVGDGR-----TRVGGFWIDQTDVTIAQFAAFVQATGY- 125
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
++E E+ + V H + R ++W
Sbjct: 126 ---------------VTEAEQQGGAAVFH-----------VPTRDEMNARDLAW------ 153
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
W W G SW HP G S + N PV V+ DA+A
Sbjct: 154 --W---------SWVKGA------------SWRHPRGPGSDVTGLGNLPVTLVTQRDALA 190
Query: 191 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR---GEHRANVWQGEFPTNNTAAD 247
Y W G LPTEAEWEY + G ++ + PR G+ AN WQG FP ++A D
Sbjct: 191 YAHWLGRDLPTEAEWEYAGKAGRDDASL----DAAPRDANGKPAANYWQGAFPVLDSAED 246
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYNPKGPTTGTDKV 302
G+ APV Y N F LY+M+GN WEWT D + H + P T V
Sbjct: 247 GHAGLAPVGCYAANGFHLYDMIGNAWEWTKDAYTGPHQSHTNGDTAAVAPPTRRHDTPMV 306
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KGGS+LC+ YC R+R ++R Q D A ++GFR
Sbjct: 307 IKGGSFLCSRDYCVRYRASSREQQEADLGASHIGFR 342
>gi|430804889|ref|ZP_19432004.1| Sulfatase-modifying factor 1 [Cupriavidus sp. HMR-1]
gi|429503016|gb|ELA01319.1| Sulfatase-modifying factor 1 [Cupriavidus sp. HMR-1]
Length = 352
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 108/355 (30%), Positives = 141/355 (39%), Gaps = 88/355 (24%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
R M + G F GT L E P+ + +F++D+ EV+ QF FV ATGY
Sbjct: 66 SRTAGMARVTGGEFVPGTT---LGYPDERPAGKARVGSFWIDRTEVTVAQFAAFVQATGY 122
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
VT+A ER + V H
Sbjct: 123 VTDA----------------ERQGAAVVFHK----------------------------- 137
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPW------GSWLHPEGIDSTIEH-----RMNH 178
PT+AE L R + W +W HPEG D+ H R N
Sbjct: 138 ---------PTDAE--------LGQRPYAWWTMVTGANWRHPEGPDAANPHGYDHRRDNQ 180
Query: 179 PVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE 238
PV V+ DA AY W G LPTE EWE+ + G + T G AN WQG
Sbjct: 181 PVTLVTQADARAYANWLGHDLPTEDEWEFAAKAGRSDAGLETAPQ-TADGTPTANYWQGV 239
Query: 239 FPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------APS 289
FP NT+ DG+ APV Y N GL++M+ N WEWT D + A +
Sbjct: 240 FPVLNTSRDGFAGLAPVGCYTANALGLFDMIANAWEWTGDAYTGPRQSHANGDTAVVAAA 299
Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
P+ P + V KGGS+LC +C R+R +AR D ++GFR G
Sbjct: 300 SRPRKPAAAS--VIKGGSFLCAPDFCVRYRASARESAESDLPTSHIGFRTVLRDG 352
>gi|436838039|ref|YP_007323255.1| hypothetical protein FAES_4663 [Fibrella aestuarina BUZ 2]
gi|384069452|emb|CCH02662.1| hypothetical protein FAES_4663 [Fibrella aestuarina BUZ 2]
Length = 324
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 71/174 (40%), Positives = 101/174 (58%), Gaps = 16/174 (9%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR-LFPWGNNLTPRGEHRANVW 235
N PV VSWNDA AY W G RLP E EWE+ R ++ L+P+GNNLT G+ AN W
Sbjct: 158 NMPVTQVSWNDAQAYANWAGKRLPHEIEWEHAARNATNSQTLYPFGNNLTQDGKWLANTW 217
Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWT-------ADWWNVHHHPAP 288
G+FP ++ +DG+ APV ++ + GL +M GNVWEW DW N
Sbjct: 218 NGKFPDHDAVSDGFHRAAPVGTFGPSPIGLADMSGNVWEWCENPKVAYTDWLN------- 270
Query: 289 SYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
P T T++V++GGS+LC +C+ +R + RS +T +++ ++GFRC D
Sbjct: 271 -QTPPRITEATERVQRGGSFLCEPGWCHGYRVSGRSGSTAETALMHVGFRCVKD 323
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 18/48 (37%), Positives = 34/48 (70%)
Query: 346 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
T T++V++GGS+LC +C+ +R + RS +T +++ ++GFRC D+
Sbjct: 277 TEATERVQRGGSFLCEPGWCHGYRVSGRSGSTAETALMHVGFRCVKDL 324
>gi|294010120|ref|YP_003543580.1| hypothetical protein SJA_C1-01340 [Sphingobium japonicum UT26S]
gi|292673450|dbj|BAI94968.1| conserved hypothetical protein [Sphingobium japonicum UT26S]
Length = 293
Score = 137 bits (346), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 109/326 (33%), Positives = 141/326 (43%), Gaps = 79/326 (24%)
Query: 16 VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEK 75
VL+P T +G + DG P + V + AF++D+HEV+N QF FV+ATGY T AE+
Sbjct: 40 VLVPAGTVLLGEDG-----DGR-PGQAVAVPAFWIDRHEVTNRQFAAFVAATGYRTAAEQ 93
Query: 76 FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRG 135
G + VF P + N+A +
Sbjct: 94 EGGSAVFV---------------------------------QPTGPVDLNNAGNW----- 115
Query: 136 ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWR 195
W Y R W HP+G S I R + PVVHV++ DA AY W
Sbjct: 116 --------WRYRQRA----------DWRHPDGPGSDIAGREDWPVVHVNYQDAAAYARWA 157
Query: 196 GARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPV 255
GA LP EA+WE RG P G AN WQG FP +T DG++ APV
Sbjct: 158 GAALPDEAQWERAARGDQSGPRDPRSWAYDGEGHPTANSWQGVFPVRDTGQDGHVGLAPV 217
Query: 256 MSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYC 315
+ N FGL++MVGNVWEWT++ G + + KGGSYLC YC
Sbjct: 218 GCFDGNGFGLHDMVGNVWEWTSE-----------------RRGGNGLIKGGSYLCAMNYC 260
Query: 316 YRHRCAARSQNTPDSSAGNLGFRCAA 341
R AA D ++GFR A
Sbjct: 261 ANFRPAAFQAQELDLGTSHIGFRTIA 286
>gi|383645574|ref|ZP_09957980.1| hypothetical protein SeloA3_16162 [Sphingomonas elodea ATCC 31461]
Length = 342
Score = 137 bits (345), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 80/205 (39%), Positives = 104/205 (50%), Gaps = 17/205 (8%)
Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 214
R P +W HPEG S+I PVV V++ DA AY W G LP+E ++E R GL
Sbjct: 126 RFVPGANWRHPEGPGSSIAGHEEEPVVAVTFVDAQAYARWAGRALPSEEQFEAAARVGLT 185
Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
+ + P + +AN WQG FP N DGY AP +K N G Y+++GNVWE
Sbjct: 186 D------PDAEPNPD-QANTWQGSFPNMNAKRDGYTGRAPAGCFKPNGAGAYDLIGNVWE 238
Query: 275 WTADWWNVHHHP---------APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQ 325
WT W+ H P PS++P+ P +V KGGSYLC YC R+R AAR
Sbjct: 239 WTESWYLPGHAPFAVGDGSPGNPSFDPRQPEAKA-RVIKGGSYLCANNYCARYRPAARHA 297
Query: 326 NTPDSSAGNLGFRCAADKGPTTGTD 350
+A +LGFR + TD
Sbjct: 298 QEETLAASHLGFRTVSRSAFDLATD 322
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 55/152 (36%), Positives = 74/152 (48%), Gaps = 23/152 (15%)
Query: 16 VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEK 75
V +P F MG+++ E P + AF +D HEV+N QF FV+ATGY TEAEK
Sbjct: 45 VRIPASRFVMGSSR---YYPEEGPPTETEVPAFDIDAHEVTNRQFATFVAATGYRTEAEK 101
Query: 76 -FGDTFVFEPLLSEEERAKISQVRHDMKRF------------EGLDSTIEHRMHHPVVHI 122
G FVF P K + D +R+ EG S+I PVV +
Sbjct: 102 PGGGAFVFVP-------PKTAIDEPDPRRWWRFVPGANWRHPEGPGSSIAGHEEEPVVAV 154
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
++ DA AY W G LP+E ++E R GL +
Sbjct: 155 TFVDAQAYARWAGRALPSEEQFEAAARVGLTD 186
>gi|87308793|ref|ZP_01090932.1| hypothetical protein DSM3645_11157 [Blastopirellula marina DSM
3645]
gi|87288504|gb|EAQ80399.1| hypothetical protein DSM3645_11157 [Blastopirellula marina DSM
3645]
Length = 331
Score = 137 bits (345), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 80/199 (40%), Positives = 104/199 (52%), Gaps = 7/199 (3%)
Query: 144 WEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEA 203
W + + G E + W HP G STI R ++PV+ VSW DA AY W G RLPTEA
Sbjct: 133 WVFDAKTG-EASIVDGAQWRHPTGPGSTIIGREDYPVLQVSWYDARAYAEWAGKRLPTEA 191
Query: 204 EWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKF 263
EWE+ RGG + +P + G NVWQG FP + A+D Y APV + N F
Sbjct: 192 EWEFAARGGQTSGEYPAQDAAGNIG----NVWQGRFPFADAASDDYAEVAPVGKFAANGF 247
Query: 264 GLYNMVGNVWEWTAD-WWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCA- 321
G Y+M GN EW +D + + A +P GP G ++V +GGS+L +EQ
Sbjct: 248 GFYDMAGNAAEWCSDHYAEDSYQLADRVDPVGPVEGEERVIRGGSWLSSEQDGVSEAMVW 307
Query: 322 ARSQNTPDSSAGNLGFRCA 340
RS + P S GFRCA
Sbjct: 308 YRSHSGPRSPNNFTGFRCA 326
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 58/144 (40%), Positives = 79/144 (54%), Gaps = 7/144 (4%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M+ L G F +G KP D + P V + +D+HE +N QF F+ AT Y+T+AE
Sbjct: 71 MISLAGGLFLLG--KPDGPLDAQ-PVVKVNISPLRMDRHETTNAQFAAFIEATQYLTDAE 127
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+ G ++VF+ E +Q RH G STI R +PV+ +SW DA AY W
Sbjct: 128 RRGRSWVFDAKTGEASIVDGAQWRHPT----GPGSTIIGREDYPVLQVSWYDARAYAEWA 183
Query: 135 GARLPTEAEWEYGCRGGLENRLFP 158
G RLPTEAEWE+ RGG + +P
Sbjct: 184 GKRLPTEAEWEFAARGGQTSGEYP 207
>gi|426356328|ref|XP_004065371.1| PREDICTED: LOW QUALITY PROTEIN: sulfatase-modifying factor 2
[Gorilla gorilla gorilla]
Length = 244
Score = 136 bits (343), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 80/193 (41%), Positives = 105/193 (54%), Gaps = 24/193 (12%)
Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR---LF 218
W G S I R+ HPV+HVSWNDA AYC WRG RLPTE EWE+ RGGL+
Sbjct: 68 WSLHAGPGSGIRERLEHPVLHVSWNDARAYCVWRGKRLPTEEEWEFAARGGLKGMRTCFG 127
Query: 219 PWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY-KENKFGLYNMVGNVWEWTA 277
G + T H QG+FP + A DG+ +PV ++ +N +GLY+++GNVWEWTA
Sbjct: 128 TGGKSFTLPFSH-----QGKFPKGDKAEDGFHGVSPVNAFPAQNNYGLYDLLGNVWEWTA 182
Query: 278 DWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNL 335
+P +V +G S++ + +R R R NTPDS++ NL
Sbjct: 183 -------------SPYQAAEQDMRVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNL 229
Query: 336 GFRCAADKGPTTG 348
GFRCAAD G G
Sbjct: 230 GFRCAADAGRPPG 242
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 31/49 (63%), Positives = 36/49 (73%)
Query: 106 GLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
G S I R+ HPV+H+SWNDA AYC WRG RLPTE EWE+ RGGL+
Sbjct: 73 GPGSGIRERLEHPVLHVSWNDARAYCVWRGKRLPTEEEWEFAARGGLKG 121
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCAAD
Sbjct: 193 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 237
>gi|427401679|ref|ZP_18892751.1| hypothetical protein HMPREF9710_02347 [Massilia timonae CCUG 45783]
gi|425719391|gb|EKU82324.1| hypothetical protein HMPREF9710_02347 [Massilia timonae CCUG 45783]
Length = 334
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 104/329 (31%), Positives = 139/329 (42%), Gaps = 68/329 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNV--TLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
MV L G F G+ + E P+ +D F++DQ EV+N QF+ FV ATGYV
Sbjct: 63 MVRLAGGEFVFGS---LAGYPDERPASTARHKVDGFWIDQTEVTNAQFERFVQATGYV-- 117
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
+E ER + V F+ D+ R +
Sbjct: 118 --------------TEAEREGAAVV------FDTPDAAELRRRPY--------------- 142
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
A W Y RG SW P + N PV V+ DA+AY
Sbjct: 143 ---------AWWRY-VRGA---------SWKDPHSAQAPAP---NAPVTLVTQADALAYA 180
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLST 252
W G LPTEAEWEY + G ++ G + G AN WQG FP ++ DG+
Sbjct: 181 RWLGRDLPTEAEWEYAAKAGRDDAGLD-GAPVDAHGRPTANYWQGPFPLADSGQDGHAGL 239
Query: 253 APVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---APSYNPKGPTTGTDKVKKGGSYL 309
APV Y N FGL++M+GN WEWT D ++ P + V KGGS+L
Sbjct: 240 APVGCYAANAFGLHDMIGNAWEWTKDGYSGARQPHANGDTAVAAARLPAAPVVIKGGSFL 299
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
C YC R+R +AR + A ++GFR
Sbjct: 300 CAPTYCLRYRASARESQEANLGASHIGFR 328
Score = 40.4 bits (93), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 17/37 (45%), Positives = 23/37 (62%)
Query: 352 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 388
V KGGS+LC YC R+R +AR + A ++GFR
Sbjct: 292 VIKGGSFLCAPTYCLRYRASARESQEANLGASHIGFR 328
>gi|424740677|ref|ZP_18169056.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii WC-141]
gi|422945468|gb|EKU40420.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii WC-141]
Length = 334
Score = 136 bits (342), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 108/348 (31%), Positives = 145/348 (41%), Gaps = 81/348 (23%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFP--------------SRNVTLDAFYLDQHE 54
+E+ K+ LP + T +LI DG F + + F++DQ E
Sbjct: 44 IEKCKNYTGLPQGWLKKPTAGMVLIADGHFNFGSEKAYPDELNFGKKQREVKGFWIDQTE 103
Query: 55 VSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHR 114
V+ QF FV ATGY+T+AEK VF P DS
Sbjct: 104 VTVAQFASFVKATGYITDAEKQKQAAVFSP-----------------------DS----- 135
Query: 115 MHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEH 174
HHP W + TW+ G G L N
Sbjct: 136 -HHPQ---QWWQLKSGYTWKTPN---------GSTGALANP------------------- 163
Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RAN 233
N PV +VS NDA Y W G LPTE EWEY + + P T +H +AN
Sbjct: 164 --NEPVRYVSKNDAEHYAVWLGHDLPTELEWEYAAKANSKTDT-PLHQAPTDERQHPQAN 220
Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP 292
WQGEFP N D + APV + N F L++ +GNVWEWT+ + H +Y+
Sbjct: 221 YWQGEFPFQNLNQDHFKGVAPVGCFAPNNFKLFDTIGNVWEWTSSPYQGAHDQHMGNYSD 280
Query: 293 --KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
+ T T V KGGS+LC + YC R+R ++R D +A ++GFR
Sbjct: 281 LRQQQITSTQYVIKGGSFLCAQNYCSRYRNSSRYPQDFDLAATHVGFR 328
>gi|421219252|ref|ZP_15676118.1| hypothetical protein AMCSP13_002919 [Streptococcus pneumoniae
2070335]
gi|395578724|gb|EJG39236.1| hypothetical protein AMCSP13_002919 [Streptococcus pneumoniae
2070335]
Length = 156
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 70/160 (43%), Positives = 94/160 (58%), Gaps = 10/160 (6%)
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
+A+C W G LPTEA+WEY RGG ++ + WG+ L G + AN WQG FP NTA DG
Sbjct: 1 MAFCNWSGMSLPTEAQWEYAARGGRQSE-YLWGDTLLEGGYYHANTWQGRFPYENTALDG 59
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKV 302
++ TAPV + N FGLY M+GNVWEW N + S+N PK +
Sbjct: 60 FIGTAPVYEFLPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYA 116
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+GGS+LC+ YC R+R AAR+ S++ +LGFRC +
Sbjct: 117 IRGGSFLCHCSYCNRYRVAARNGCISTSTSSHLGFRCLKE 156
>gi|293608773|ref|ZP_06691076.1| predicted protein [Acinetobacter sp. SH024]
gi|292829346|gb|EFF87708.1| predicted protein [Acinetobacter sp. SH024]
Length = 327
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 75/184 (40%), Positives = 96/184 (52%), Gaps = 7/184 (3%)
Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
+W HP G+++ + + PV V+WNDA AY W G LPTE EWEY +G +N
Sbjct: 145 ANWKHPWGLNNPKQPAPHEPVRMVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI- 203
Query: 220 WGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
T G AN WQGEFP N DG+ APV + +N FGLY+++GNVWEWT
Sbjct: 204 ---GPTHEGHIVANFWQGEFPYKNIQEDGFKDIAPVGCFSKNPFGLYDLIGNVWEWTQTP 260
Query: 280 WN---VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
+ HH PS + KGGSYLC YC R+R AAR D + ++G
Sbjct: 261 YTGSRDHHMGDPSKYRQSHEQILQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVG 320
Query: 337 FRCA 340
FR
Sbjct: 321 FRTV 324
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 65/151 (43%), Gaps = 22/151 (14%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P + MV LP +GT + G R + L A T F+
Sbjct: 68 PKNKEAGMVKLPQGKIVLGTTQ------GYEDERPLNLQA----------TSVPAFLIDA 111
Query: 68 GYVTEAEKFGDTFVF----EPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
VT+AEK G +F P+ EE + + + K GL++ + H PV ++
Sbjct: 112 TEVTDAEKQGGAAMFVQPDHPV--EELQWWKFEKGANWKHPWGLNNPKQPAPHEPVRMVT 169
Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
WNDA AY W G LPTE EWEY +G +N
Sbjct: 170 WNDAYAYANWLGHDLPTELEWEYAAKGFQQN 200
>gi|119356728|ref|YP_911372.1| hypothetical protein Cpha266_0900 [Chlorobium phaeobacteroides DSM
266]
gi|119354077|gb|ABL64948.1| protein of unknown function DUF323 [Chlorobium phaeobacteroides DSM
266]
Length = 826
Score = 135 bits (341), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 110/339 (32%), Positives = 149/339 (43%), Gaps = 87/339 (25%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+ V +P F MG+ + + G+ V + FYL ++ V+ F+EF+ A+GY T+A
Sbjct: 560 NFVRIPEGEFTMGSPASEIDRSGDETQHQVKVSDFYLCKYAVTLAGFKEFIEASGYRTDA 619
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
EK ++V++ S E+RA I+
Sbjct: 620 EKGDGSYVWDGK-SWEKRAGIN-------------------------------------- 640
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W +G GG+ + NHPVVHVSWNDAVAYC
Sbjct: 641 ----------WRFGISGGVRP------------------QSEENHPVVHVSWNDAVAYCK 672
Query: 194 W------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
W + RLPTEAEWEY CR G F G+NLT GE N G +P N
Sbjct: 673 WISEKTGQQFRLPTEAEWEYACRAGTTTP-FHTGDNLTT-GEANYN---GNYPYNKNPKG 727
Query: 248 GYL-STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTDK 301
Y +T PV S N +GLY+M GNVWEW D + ++ NP GP TG+ +
Sbjct: 728 EYRENTVPVDSLAPNAWGLYHMHGNVWEWCGDRYGDKYYDECKAKGVVENPVGPETGSIR 787
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
V +GGS+ YC R A R+ +TP + GFR A
Sbjct: 788 VLRGGSWGNYAGYC---RSANRNSDTPGDRGSDAGFRLA 823
>gi|427423321|ref|ZP_18913479.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii WC-136]
gi|425699874|gb|EKU69473.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii WC-136]
Length = 302
Score = 135 bits (340), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 75/182 (41%), Positives = 96/182 (52%), Gaps = 7/182 (3%)
Query: 160 GSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFP 219
+W HP G+++ + + PV V+WNDA AY W G LPTE EWEY +G +N
Sbjct: 120 ANWKHPWGLNNPKQPAPHEPVRMVTWNDAYAYANWLGHDLPTELEWEYAAKGFQQNSDI- 178
Query: 220 WGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
T G AN WQGEFP N DG+ APV + +N FGLY+++GNVWEWT
Sbjct: 179 ---GPTHEGHIVANFWQGEFPYKNIQEDGFKDIAPVGCFSKNPFGLYDLIGNVWEWTQTP 235
Query: 280 WN---VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
+ HH PS + KGGSYLC YC R+R AAR D + ++G
Sbjct: 236 YTGSRDHHMGDPSKYRQSHEQILQYTIKGGSYLCATNYCARYRAAARHPQELDLATSHVG 295
Query: 337 FR 338
FR
Sbjct: 296 FR 297
Score = 55.1 bits (131), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 65/151 (43%), Gaps = 22/151 (14%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P + MV LP +GT + G R + L A T F+
Sbjct: 43 PKNKEAGMVKLPQGKIVLGTTQ------GYEDERPLNLQA----------TSVPAFLIDA 86
Query: 68 GYVTEAEKFGDTFVF----EPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
VT+AEK G +F P+ EE + + + K GL++ + H PV ++
Sbjct: 87 TEVTDAEKQGGAAMFVQPDHPV--EELQWWKFEKGANWKHPWGLNNPKQPAPHEPVRMVT 144
Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
WNDA AY W G LPTE EWEY +G +N
Sbjct: 145 WNDAYAYANWLGHDLPTELEWEYAAKGFQQN 175
>gi|223994935|ref|XP_002287151.1| hypothetical protein THAPSDRAFT_261211 [Thalassiosira pseudonana
CCMP1335]
gi|220976267|gb|EED94594.1| hypothetical protein THAPSDRAFT_261211 [Thalassiosira pseudonana
CCMP1335]
Length = 250
Score = 135 bits (339), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 100/312 (32%), Positives = 141/312 (45%), Gaps = 77/312 (24%)
Query: 36 GEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKIS 95
G P R V ++ F++D V+N ++ +FV AT Y TEAEKFG +FV L +E + +
Sbjct: 1 GATPPRVVYIEPFFIDIAPVTNKEYGKFVRATYYETEAEKFGWSFVLSSFL--QELGESA 58
Query: 96 QVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 155
+V + + + +D
Sbjct: 59 EVDPEAEDWVAVDRAY-------------------------------------------- 74
Query: 156 LFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGL-- 213
W +PEG ++ ++R +HPVVHVS DA YC W G RLP E EWE R G
Sbjct: 75 ------WRNPEGPGTSYKYREHHPVVHVSHRDAAEYCKWIGKRLPGEREWEAAARAGHYG 128
Query: 214 -ENR-LFPWGNNLTPR-GEHRANVW-QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMV 269
NR LF WG T + AN+W G FP N A DG+ +T+PV ++K N GLY+M
Sbjct: 129 PNNRTLFIWGEEQTVDVAKEYANLWGAGAFPWENLAEDGWRATSPVKTFKPNALGLYDMT 188
Query: 270 GNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRC--AARSQNT 327
GNVWEW H A ++ +GGSY+ + + H AR+
Sbjct: 189 GNVWEWMRG----GKHKA-------------RIVRGGSYVDSLDGNFNHAATLGARATLH 231
Query: 328 PDSSAGNLGFRC 339
++ GN+GFRC
Sbjct: 232 GTTTTGNVGFRC 243
>gi|421807665|ref|ZP_16243525.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC035]
gi|410416646|gb|EKP68418.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC035]
Length = 335
Score = 134 bits (337), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 104/328 (31%), Positives = 138/328 (42%), Gaps = 68/328 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY+T+AE
Sbjct: 66 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYITDAE 124
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K VF P HHP W Y TWR
Sbjct: 125 KQKQAAVFSP-----------------------------APHHP--QQWWQLKTGY-TWR 152
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G G L N + PV +V+ NDA Y W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
G LPTE EWEY + P T +H +AN WQG FP N D + A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241
Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
PV +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
E YC R+R ++R D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329
>gi|288919751|ref|ZP_06414077.1| protein of unknown function DUF323 [Frankia sp. EUN1f]
gi|288348851|gb|EFC83102.1| protein of unknown function DUF323 [Frankia sp. EUN1f]
Length = 281
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 101/283 (35%), Positives = 132/283 (46%), Gaps = 48/283 (16%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
K MV +PG TF MG+++ E P V +D F++D+H V+ +F+ FV ATG+VT
Sbjct: 27 KGMVRIPGGTFLMGSDE---FYAEERPVHPVAVDGFWMDEHPVTVARFRAFVKATGHVTV 83
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ QV D + G D + +V + V
Sbjct: 84 AER--------------------QV--DPADYPGADPA--GLVPGALVFRMTSGPVDLDD 119
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR W Y P W P G ST+ R HPV V++ DA AY
Sbjct: 120 WR-------RWWSYE----------PGACWHRPLGTGSTVHGRERHPVTQVAFADAAAYA 162
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP---TNNTAADGY 249
W G LPTEAEWE+ RGGL+ +F WG+ TP+G AN W G FP +A
Sbjct: 163 AWAGKDLPTEAEWEFAARGGLDGAVFAWGDEFTPKGRRMANTWHGAFPWQYVPASAGQPM 222
Query: 250 LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
T PV S+ N +GLY+M GN WEWTAD + HP P+ P
Sbjct: 223 PGTTPVGSFPPNGYGLYDMAGNTWEWTADHYT-DRHPDPAPGP 264
>gi|289760839|ref|ZP_06520217.1| hypothetical protein TBIG_03921 [Mycobacterium tuberculosis GM
1503]
gi|289708345|gb|EFD72361.1| hypothetical protein TBIG_03921 [Mycobacterium tuberculosis GM
1503]
Length = 381
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 93/252 (36%), Positives = 119/252 (47%), Gaps = 45/252 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +FRMG+ + E P VT+ AF +++H V+N QF EFVSATGYVT A
Sbjct: 4 ELVDLPGGSFRMGSTR---FYPEEAPIHTVTVRAFAVERHPVTNAQFAEFVSATGYVTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E +PL D + G+D+ +V V W
Sbjct: 61 E--------QPL--------------DPGLYPGVDAA--DLCPGAMVFCPTAGPVDLRDW 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R W++ P W HP G DS I R HPVV V++ DAVAY
Sbjct: 97 R-------QWWDW----------VPGACWRHPFGRDSDIADRAGHPVVQVAYPDAVAYAR 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
W G RLPTEAEWEY RGG + WG+ P G AN WQG FP N A G++ T+
Sbjct: 140 WAGRRLPTEAEWEYAARGG-TTATYAWGDQEKPGGMLMANTWQGRFPYRNDGALGWVGTS 198
Query: 254 PVMSYKENKFGL 265
PV + + L
Sbjct: 199 PVGRFPGQRVWL 210
>gi|159898226|ref|YP_001544473.1| hypothetical protein Haur_1702 [Herpetosiphon aurantiacus DSM 785]
gi|159891265|gb|ABX04345.1| protein of unknown function DUF323 [Herpetosiphon aurantiacus DSM
785]
Length = 510
Score = 133 bits (335), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 102/319 (31%), Positives = 152/319 (47%), Gaps = 84/319 (26%)
Query: 35 DGEF----PSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLS--E 88
D EF +R V L+++ +D++EV+N QF +FV T Y+T AE+ G++ + E + +
Sbjct: 263 DDEFDQNGQARQVFLNSYRIDKYEVTNAQFAQFVEQTQYITYAERQGESMILEVIETAGS 322
Query: 89 EERAKISQVRHDM-KRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYG 147
+E S ++ K+ G +S+I+ + +PV+HI + DAVAYCT + RLPTEAEWE
Sbjct: 323 KETLNFSAIKGAFWKQPYGPNSSIDDKADYPVIHIHYEDAVAYCTAKHKRLPTEAEWEKA 382
Query: 148 CRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEY 207
R G+E FPWG+ EW
Sbjct: 383 AR-GVEGWRFPWGN------------------------------------------EW-- 397
Query: 208 GCRGGLENRLFPWGNN-LTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLY 266
+ GL N P ++ L RG G+ P S +P +G++
Sbjct: 398 --KSGLSNHAIPLRSHILQVRGLQAI----GQSPQ---------SISP--------YGVH 434
Query: 267 NMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAAR 323
+++GNV EWTADW+ ++PA S NP+GP G VK+GGS+ Y + +
Sbjct: 435 DLLGNVSEWTADWYQPSYYQNNPA-SQNPQGPELGNSHVKRGGSWATPPGYLHN----SW 489
Query: 324 SQNTPDSSAGNLGFRCAAD 342
TPD + LGFRCAAD
Sbjct: 490 RIGTPDQTTDRLGFRCAAD 508
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 79/268 (29%), Positives = 114/268 (42%), Gaps = 60/268 (22%)
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
+ Y +G + E G + L W P G +S+I+ + ++PV+H+ + D
Sbjct: 302 ITYAERQGESMILEVIETAGSKETLNFSAIKGAFWKQPYGPNSSIDDKADYPVIHIHYED 361
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
AVAYCT + RLPTEAEWE RG +E FPWGN E ++ + P +
Sbjct: 362 AVAYCTAKHKRLPTEAEWEKAARG-VEGWRFPWGN------EWKSGLSNHAIPLRSHILQ 414
Query: 248 GYLSTAPVMSYKE-NKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
A S + + +G+++++GNV EWTADW+ PSY P
Sbjct: 415 VRGLQAIGQSPQSISPYGVHDLLGNVSEWTADWYQ------PSYYQNNPA---------- 458
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKGGSYLCNEQYCY 366
SQN +GP G VK+GGS+ Y +
Sbjct: 459 -----------------SQNP---------------QGPELGNSHVKRGGSWATPPGYLH 486
Query: 367 RHRCAARSQNTPDSSAGNLGFRCAADVS 394
+ TPD + LGFRCAADV+
Sbjct: 487 N----SWRIGTPDQTTDRLGFRCAADVN 510
>gi|94314341|ref|YP_587550.1| Sulfatase-modifying factor 1 [Cupriavidus metallidurans CH34]
gi|93358193|gb|ABF12281.1| Sulfatase-modifying factor 1 precursor (C-alpha-formyglycine-
generating enzyme 1); putative exported protein
[Cupriavidus metallidurans CH34]
Length = 352
Score = 133 bits (334), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 106/355 (29%), Positives = 138/355 (38%), Gaps = 88/355 (24%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
R M + G F GT L E P+ + +F++D+ EV+ QF FV ATGY
Sbjct: 66 SRTAGMARVTGGEFVPGTT---LGYPDERPAGKTRVGSFWIDRTEVTVAQFAAFVQATGY 122
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
VT+A ER + V H
Sbjct: 123 VTDA----------------ERQGAAVVFHK----------------------------- 137
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPW------GSWLHPEGIDSTIEH-----RMNH 178
PT+AE L R + W +W HPEG + H R N
Sbjct: 138 ---------PTDAE--------LGQRPYAWWTMVTGANWRHPEGPAAANSHGYDHRRDNQ 180
Query: 179 PVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGE 238
PV V+ DA AY W G LPTE EWE+ + G + T G AN WQG
Sbjct: 181 PVTLVTQADARAYANWLGHDLPTEDEWEFAAKAGRSDAGLETAPQ-TAEGTPTANYWQGV 239
Query: 239 FPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP---------APS 289
FP NT+ DG+ APV Y N GL++M+ N WEWT D + A +
Sbjct: 240 FPVLNTSRDGFAGLAPVGCYTANALGLFDMIANAWEWTGDAYTGPRQSHANGDTAVVAAA 299
Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+ P V KGGS+LC +C R+R +AR D ++GFR G
Sbjct: 300 SRSRKPA--ATSVIKGGSFLCAPDFCVRYRASARESAESDLPTSHIGFRTVLRDG 352
>gi|406833741|ref|ZP_11093335.1| hypothetical protein SpalD1_18927 [Schlesneria paludicola DSM
18645]
Length = 289
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 111/353 (31%), Positives = 158/353 (44%), Gaps = 88/353 (24%)
Query: 15 MVLLPGDTFRMGTNKPILIK------------DGEFPSRNVTL-DAFYLDQHEVSNTQFQ 61
M+ +P F MG + ++ DGE P V + AF + Q+EV+ QF
Sbjct: 1 MIEIPAGEFLMGAEEERIVTLRKFPNSQPKLLDGELPRHKVRITKAFDMGQYEVTLAQFM 60
Query: 62 EFVSATGYVTEAEKFGD-TFVFEP---LLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
F AT Y EAE+ G +F +P LL +E + + ++ H
Sbjct: 61 TFRDATKYEIEAERDGQPSFGIDPEGELLKSQEFRPWNPI------------GWKNDTDH 108
Query: 118 PVVHISWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDST 171
PV+++SWNDAVA+C W R RLPTEAEWEY CR G E R F +G E +D
Sbjct: 109 PVIYVSWNDAVAFCEWLSQQEGRSYRLPTEAEWEYACRAGSETR-FSFGD----EPVDL- 162
Query: 172 IEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR 231
+ + NDA R A+ L PR + R
Sbjct: 163 --------IRFANSNDAT-----RRAK-------------------------LFPRLDQR 184
Query: 232 ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSY 290
+ G F + +D Y+ T+PV + N FGLY+M GNVWEW +D+++ ++ +P
Sbjct: 185 K--FGGHFLSK---SDNYVWTSPVGKFAPNSFGLYDMHGNVWEWCSDFFDKDYYKDSPID 239
Query: 291 NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
+PKGP G +V +GGSY E R A R+ S N GFR ++
Sbjct: 240 DPKGPGNGESRVLRGGSY---ENQLMSLRSAIRAGAPASSRGSNTGFRVVRER 289
>gi|169796826|ref|YP_001714619.1| hypothetical protein ABAYE2816 [Acinetobacter baumannii AYE]
gi|213156782|ref|YP_002318443.1| hypothetical protein AB57_1055 [Acinetobacter baumannii AB0057]
gi|215484304|ref|YP_002326531.1| hypothetical protein ABBFA_002633 [Acinetobacter baumannii
AB307-0294]
gi|301346709|ref|ZP_07227450.1| hypothetical protein AbauAB0_10696 [Acinetobacter baumannii AB056]
gi|301511822|ref|ZP_07237059.1| hypothetical protein AbauAB05_09590 [Acinetobacter baumannii AB058]
gi|301594414|ref|ZP_07239422.1| hypothetical protein AbauAB059_01345 [Acinetobacter baumannii
AB059]
gi|332853700|ref|ZP_08434930.1| hypothetical protein HMPREF0021_02513 [Acinetobacter baumannii
6013150]
gi|332870879|ref|ZP_08439524.1| hypothetical protein HMPREF0020_03177 [Acinetobacter baumannii
6013113]
gi|403675978|ref|ZP_10938061.1| hypothetical protein ANCT1_15121 [Acinetobacter sp. NCTC 10304]
gi|417574664|ref|ZP_12225518.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Canada BC-5]
gi|421620540|ref|ZP_16061472.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC074]
gi|421643641|ref|ZP_16084135.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii IS-235]
gi|421646374|ref|ZP_16086826.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii IS-251]
gi|421658785|ref|ZP_16099016.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-83]
gi|421700071|ref|ZP_16139588.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii IS-58]
gi|421796085|ref|ZP_16232154.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-21]
gi|421801852|ref|ZP_16237809.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Canada BC1]
gi|169149753|emb|CAM87644.1| conserved hypothetical protein; putative exported protein
[Acinetobacter baumannii AYE]
gi|213055942|gb|ACJ40844.1| hypothetical protein AB57_1055 [Acinetobacter baumannii AB0057]
gi|213988595|gb|ACJ58894.1| conserved hypothetical protein [Acinetobacter baumannii AB307-0294]
gi|332728524|gb|EGJ59898.1| hypothetical protein HMPREF0021_02513 [Acinetobacter baumannii
6013150]
gi|332731980|gb|EGJ63258.1| hypothetical protein HMPREF0020_03177 [Acinetobacter baumannii
6013113]
gi|400210232|gb|EJO41202.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Canada BC-5]
gi|404570453|gb|EKA75526.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii IS-58]
gi|408508324|gb|EKK10010.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii IS-235]
gi|408517761|gb|EKK19299.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii IS-251]
gi|408700227|gb|EKL45690.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC074]
gi|408709481|gb|EKL54727.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-83]
gi|410400281|gb|EKP52460.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-21]
gi|410405109|gb|EKP57162.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Canada BC1]
Length = 335
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 103/328 (31%), Positives = 137/328 (41%), Gaps = 68/328 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ EV+ QF FV TGY+T+AE
Sbjct: 66 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKETGYITDAE 124
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K VF P HHP W Y TWR
Sbjct: 125 KQKQAAVFSP-----------------------------DPHHP--QQWWQLKTGY-TWR 152
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G G L N + PV +V+ NDA Y W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
G LPTE EWEY + P T +H +AN WQG FP N D + A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241
Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
PV +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
E YC R+R ++R D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329
>gi|365088279|ref|ZP_09327876.1| hypothetical protein KYG_03856 [Acidovorax sp. NO-1]
gi|363417259|gb|EHL24344.1| hypothetical protein KYG_03856 [Acidovorax sp. NO-1]
Length = 289
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 107/325 (32%), Positives = 146/325 (44%), Gaps = 91/325 (28%)
Query: 37 EFPSRNVTL-DAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG-DTFVFEPLLSEEERAKI 94
E P+ VT+ AFYL QHEV+ QF+ FV A+GY E+E G + + P
Sbjct: 28 EGPAHRVTIGQAFYLGQHEVTVGQFRRFVQASGYRAESEADGTGGYGYNPRYD-----PA 82
Query: 95 SQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 154
+ VR D FEG D+ +WR
Sbjct: 83 TTVRGDA--FEGRDTRY--------------------SWR-------------------- 100
Query: 155 RLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYG 208
D +HPVV+V+WNDA+A W RLPTEAEWEY
Sbjct: 101 --------------DPGFAQGDDHPVVNVTWNDAMALADWLSRTEGHHYRLPTEAEWEYA 146
Query: 209 CRGGLENRLFPWGNNLTPRGE-HRANV-----------WQGEFPTNNTAADGYLSTAPVM 256
CR G R +P G++ P+G H AN WQ N DG+ TAPV
Sbjct: 147 CRAGTRTR-YPHGDD--PQGLLHTANTFDQAAAPLWPRWQQHALAGN---DGHAFTAPVG 200
Query: 257 SYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYC 315
SY N FGL++M+GN WEW ADW +++ +P +P+GP GT +V++GGS+ +
Sbjct: 201 SYGANAFGLHDMLGNAWEWVADWHGDNYYAHSPRNDPRGPAHGTVRVRRGGSW---HTWA 257
Query: 316 YRHRCAARSQNTPDSSAGNLGFRCA 340
+ RC R+ N+P + +G R A
Sbjct: 258 FYARCGYRNWNSPQTRYTLVGMRLA 282
>gi|194292611|ref|YP_002008518.1| sulfatase-modifying factor 1 (c-alpha-formyglycine-generating
enzyme 1); exported protein [Cupriavidus taiwanensis LMG
19424]
gi|193226515|emb|CAQ72466.1| Sulfatase-modifying factor 1 precursor
(C-alpha-formyglycine-generating enzyme 1); putative
exported protein [Cupriavidus taiwanensis LMG 19424]
Length = 346
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 108/359 (30%), Positives = 142/359 (39%), Gaps = 97/359 (27%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
ER MV + G F GT E P+ V + F++DQ EV+ QF FV ATGY
Sbjct: 61 ERTAGMVRVAGGEFLPGTTHGY---PDERPAGKVRVRPFWIDQTEVTVAQFAAFVDATGY 117
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
+ T R VV
Sbjct: 118 I---------------------------------------TDAERQGTAVVF-------- 130
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPW------GSWLHPEG--IDSTIEH-----RM 176
R P++AE L R + W G W HP+G + S + R
Sbjct: 131 -------RQPSDAE--------LRARPYAWWSMVQGGDWRHPQGRAVGSDADRDGWQARA 175
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLF---PWGNNLTPRGEHRAN 233
+ PV V+ DA+AY W G LP+E EWE+ + G + P + TP AN
Sbjct: 176 SQPVTLVTQADALAYARWLGRELPSEDEWEFAAKAGRSDAALEAAPQSADGTPT----AN 231
Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK 293
WQG FP N++ DG+ APV Y N F LY+M+ N WEWTAD + P S+
Sbjct: 232 YWQGVFPVLNSSRDGFAGLAPVGCYPANGFALYDMIANAWEWTAD---AYTGPRQSHA-N 287
Query: 294 GPTTGT--------DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
G T V KGGS+LC +C R+R +AR D ++GFR G
Sbjct: 288 GDTAAVAGRQRAVRSAVIKGGSFLCAPDFCVRYRASAREPAENDLPTSHIGFRTVLRDG 346
>gi|417551131|ref|ZP_12202209.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii Naval-18]
gi|417565391|ref|ZP_12216265.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC143]
gi|395557147|gb|EJG23148.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC143]
gi|400385586|gb|EJP48661.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii Naval-18]
Length = 326
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 99/340 (29%), Positives = 145/340 (42%), Gaps = 76/340 (22%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
+ Y MV +P TF++G N + + S + T+ F++D EV+N QFQ FV ATGY
Sbjct: 54 DAYAGMVKIPSGTFKIGNNNSYPEEKALYKS-DRTIKEFWIDATEVTNAQFQSFVKATGY 112
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKIS----QVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
+TEAE G+ VF P +E+ A+++ + + G D+T + + PV I+
Sbjct: 113 LTEAESQGEAAVFTP--PKEQVAELAWWSLMKGANWRHPWGPDATRQILPNEPVRMITLK 170
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
DA+AY W G LPTE +WEY +G + R I + +EH
Sbjct: 171 DAMAYADWLGRDLPTEEQWEYAAKGFSKER-----------DISANLEH----------- 208
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
+ W G FP+ N
Sbjct: 209 ---IDANVWEGQ--------------------FPY---------------------QNEN 224
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD-WWNVH--HHPAPSYNPKGPTTGTDKV 302
DG+ APV + N FGLY+M+GNVWE+T+ + H H + P
Sbjct: 225 KDGFEDVAPVGCFPSNGFGLYDMIGNVWEYTSSPFTGTHDDHMGEQQLHTGHPQVFNSYT 284
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
KGGS+LC YC R+R +AR D ++GFR +
Sbjct: 285 IKGGSFLCASNYCMRYRASARHSQEADLGISHVGFRTVKN 324
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 75/239 (31%), Positives = 105/239 (43%), Gaps = 54/239 (22%)
Query: 156 LFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLEN 215
L +W HP G D+T + N PV ++ DA+AY W G LPTE +WEY +G +
Sbjct: 140 LMKGANWRHPWGPDATRQILPNEPVRMITLKDAMAYADWLGRDLPTEEQWEYAAKGFSKE 199
Query: 216 RLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWE 274
R +++ EH ANVW+G+FP N DG+ APV + N FGLY+M+GNVWE
Sbjct: 200 R------DISANLEHIDANVWEGQFPYQNENKDGFEDVAPVGCFPSNGFGLYDMIGNVWE 253
Query: 275 WTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGN 334
+T+ P TGT ++ +Q H
Sbjct: 254 YTSS----------------PFTGTHD-----DHMGEQQLHTGH---------------- 276
Query: 335 LGFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
P KGGS+LC YC R+R +AR D ++GFR ++
Sbjct: 277 ----------PQVFNSYTIKGGSFLCASNYCMRYRASARHSQEADLGISHVGFRTVKNI 325
>gi|91202992|emb|CAJ72631.1| conserved hypothetical protein [Candidatus Kuenenia
stuttgartiensis]
Length = 263
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 74/167 (44%), Positives = 98/167 (58%), Gaps = 19/167 (11%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
+ PVV V W DAVAY W G RLPTEAEWE RG + RL+PWGN + R NVW+
Sbjct: 109 DKPVVGVRWEDAVAYAEWAGKRLPTEAEWEKAARGS-DRRLWPWGNEIF--DAERCNVWE 165
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNV-HHHPAPSYNPK 293
+ + T+PV SYK+ K +G Y+M GNVWEW DW++ +++ P NPK
Sbjct: 166 TK----------KMQTSPVGSYKQGKSPYGCYDMSGNVWEWCTDWYDQNYYYSFPRENPK 215
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GP +G KV +GGS+ + + RCAAR + P + +GFRCA
Sbjct: 216 GPESGQQKVIRGGSW---HYFGHFARCAARYRVLPHAETPQIGFRCA 259
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 60/167 (35%), Positives = 79/167 (47%), Gaps = 43/167 (25%)
Query: 3 LLPAPPVERYKDMVLLPGDTFRMGTNK--------PILIKDGEFPSRNVTLDAFYLDQHE 54
LL A E+Y MV +P F MG +K I DG P V LD++Y+D++E
Sbjct: 22 LLAAEEGEQYPGMVYVPAGEFYMGEDKYYDWTFMLAFNIYDG--PEHIVYLDSYYIDKYE 79
Query: 55 VSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHR 114
V+N Q+ +F+ TG+ +I D +RF D
Sbjct: 80 VTNEQYAKFIEETGH-----------------------RIPYCWFD-ERFNTPDK----- 110
Query: 115 MHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
PVV + W DAVAY W G RLPTEAEWE RG + RL+PWG+
Sbjct: 111 ---PVVGVRWEDAVAYAEWAGKRLPTEAEWEKAARGS-DRRLWPWGN 153
>gi|302037191|ref|YP_003797513.1| hypothetical protein NIDE1860 [Candidatus Nitrospira defluvii]
gi|300605255|emb|CBK41588.1| conserved exported protein of unknown function, similar to
sulfatase modifying factor [Candidatus Nitrospira
defluvii]
Length = 296
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 103/339 (30%), Positives = 139/339 (41%), Gaps = 79/339 (23%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
R DMV +P FRMG + E P R + L AF +D+ EV+N +QEFV ATGY
Sbjct: 28 RVDDMVFVPAGEFRMGATEESGGLPDERPLRLIYLSAFRIDRQEVTNAAYQEFVEATGYQ 87
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
A P L+ E ++ + HPVV++SW DAVA+
Sbjct: 88 APANS-------APALTLWEH----------------NAPLPGIEQHPVVNVSWLDAVAF 124
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN---- 186
C W RLPTEAEWE R G + R +PWG+ WN
Sbjct: 125 CRWANKRLPTEAEWEKAAR-GTDGRTYPWGN----------------------EWNFENG 161
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
++ +Y R + EWE G+ G +
Sbjct: 162 NSASYWARRTVQFADSTEWEAFWVKGV-----------------------GAAISKEKGL 198
Query: 247 DGYLSTAPVMSYK--ENKFGLYNMVGNVWEWTADWWNVH-HHPAPSYNPKGPTTGTDKVK 303
G + T PV S+ + +G+ +M GN EW DW+N + + A NP+GP G K
Sbjct: 199 KGEILTLPVGSFPGGASPYGVLDMAGNAAEWVQDWYNPNDYRTAALTNPQGPERGAIKGM 258
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+GGS+L + R R T DS GFRCA D
Sbjct: 259 RGGSWL---KPAISLRTTDRDWGTMDSRPSGTGFRCARD 294
>gi|418121846|ref|ZP_12758789.1| hypothetical protein SPAR80_1718 [Streptococcus pneumoniae GA44194]
gi|419491687|ref|ZP_14031425.1| hypothetical protein SPAR88_1682 [Streptococcus pneumoniae GA47179]
gi|419532986|ref|ZP_14072501.1| formylglycine-generating sulfatase enzyme family protein
[Streptococcus pneumoniae GA47794]
gi|421275499|ref|ZP_15726328.1| hypothetical protein SPAR117_1508 [Streptococcus pneumoniae
GA52612]
gi|353792682|gb|EHD73054.1| hypothetical protein SPAR80_1718 [Streptococcus pneumoniae GA44194]
gi|379593049|gb|EHZ57864.1| hypothetical protein SPAR88_1682 [Streptococcus pneumoniae GA47179]
gi|379605506|gb|EHZ70257.1| formylglycine-generating sulfatase enzyme family protein
[Streptococcus pneumoniae GA47794]
gi|395873463|gb|EJG84555.1| hypothetical protein SPAR117_1508 [Streptococcus pneumoniae
GA52612]
Length = 148
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 67/150 (44%), Positives = 90/150 (60%), Gaps = 10/150 (6%)
Query: 199 LPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY 258
LPTEA+WEY RGG ++ +PWG+ L G + AN WQG FP NTA+DG++ TAPV +
Sbjct: 3 LPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTASDGFIGTAPVYEF 61
Query: 259 KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNE 312
N FGLY M+GNVWEW N + S+N PK + +GGS+LC+
Sbjct: 62 LPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHC 118
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YC R+R AAR+ S++ +LGFRC +
Sbjct: 119 SYCNRYRVAARNGCISTSTSSHLGFRCLKE 148
>gi|392409905|ref|YP_006446512.1| hypothetical protein Desti_1536 [Desulfomonile tiedjei DSM 6799]
gi|390623041|gb|AFM24248.1| hypothetical protein Desti_1536 [Desulfomonile tiedjei DSM 6799]
Length = 671
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 76/184 (41%), Positives = 101/184 (54%), Gaps = 15/184 (8%)
Query: 177 NHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
+ P V +SWNDA A+C W R RLPTEAEWE GCRGG FPWG+ L G+
Sbjct: 270 DQPAVFISWNDAQAFCRWLSRKEKRTYRLPTEAEWEMGCRGGSAWVRFPWGDRLP--GDR 327
Query: 231 RANVWQG--EFPTNNTAA-DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP- 286
N G + P + T DGY +PV S+ N GLY+M GNV EW D + +++
Sbjct: 328 DLNFGDGNPKLPESMTTVNDGYEYVSPVGSFPPNGLGLYDMEGNVMEWVEDRYERNYYET 387
Query: 287 APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
+P +PKGP +GT ++ KGG++ + C RCA R + P+ S NLGFR +
Sbjct: 388 SPIEDPKGPASGTSRINKGGNWYASPADC---RCAFRGFSGPEMSFFNLGFRVVLEDENQ 444
Query: 347 TGTD 350
TD
Sbjct: 445 LKTD 448
>gi|418160529|ref|ZP_12797228.1| hypothetical protein SPAR43_1871 [Streptococcus pneumoniae GA17227]
gi|353822262|gb|EHE02438.1| hypothetical protein SPAR43_1871 [Streptococcus pneumoniae GA17227]
Length = 148
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 67/150 (44%), Positives = 89/150 (59%), Gaps = 10/150 (6%)
Query: 199 LPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY 258
LPTEA+WEY RGG ++ +PWG+ L G + AN WQG FP NTA DG++ TAPV +
Sbjct: 3 LPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTAPVYEF 61
Query: 259 KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNE 312
N FGLY M+GNVWEW N + S+N PK + +GGS+LC+
Sbjct: 62 LPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHC 118
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YC R+R AAR+ S++ +LGFRC +
Sbjct: 119 SYCNRYRVAARNGCISTSTSSHLGFRCLKE 148
>gi|119628390|gb|EAX07985.1| sulfatase modifying factor 2, isoform CRA_j [Homo sapiens]
Length = 273
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 101/337 (29%), Positives = 136/337 (40%), Gaps = 117/337 (34%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV L G F MGTN P +DGE P R T+ F +D V+N F++FV Y TEAE
Sbjct: 49 MVQLQGGRFLMGTNSPD-SRDGEGPVREATVKPFAIDIFPVTNKDFRDFVREKKYRTEAE 107
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
FG +FVFE +S+E R K +Q P+ + W W
Sbjct: 108 MFGWSFVFEDFVSDELRNKATQ---------------------PMKSVLW--------W- 137
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
LP E + W P G S I R+ HPV+HVS
Sbjct: 138 ---LPVEKAF-----------------WRQPAGPGSGIRERLEHPVLHVS---------- 167
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
W G+FP + A DG+ +P
Sbjct: 168 ----------------------------------------WNGKFPKGDKAEDGFHGVSP 187
Query: 255 VMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL--CN 311
V ++ +N +GLY+++GNVWEWTA +P +V +G S++ +
Sbjct: 188 VNAFPAQNNYGLYDLLGNVWEWTA-------------SPYQAAEQDMRVLRGASWIDTAD 234
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
+R R R NTPDS++ NLGFRCAAD G G
Sbjct: 235 GSANHRARVTTRMGNTPDSASDNLGFRCAADAGRPPG 271
Score = 41.2 bits (95), Expect = 0.96, Method: Compositional matrix adjust.
Identities = 20/45 (44%), Positives = 28/45 (62%), Gaps = 2/45 (4%)
Query: 351 KVKKGGSYL--CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
+V +G S++ + +R R R NTPDS++ NLGFRCAAD
Sbjct: 222 RVLRGASWIDTADGSANHRARVTTRMGNTPDSASDNLGFRCAADA 266
>gi|297181914|gb|ADI18091.1| uncharacterized conserved protein [uncultured Acidobacteriales
bacterium HF0200_23L05]
Length = 272
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 69/167 (41%), Positives = 94/167 (56%), Gaps = 7/167 (4%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
+HPVVH++W+DA + CTW G RLPTEAEWEY RGG + +F WGN LT AN
Sbjct: 104 DHPVVHLTWDDAASVCTWVGGRLPTEAEWEYAARGGSDGTIFWWGNQLT---RDYANFGH 160
Query: 237 GEFPTNNT-AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGP 295
E T AD +++TAPV S+ N FG Y++ GN WEW W + + +P+ P
Sbjct: 161 TECCAGTTGGADVWVNTAPVGSFPANGFGFYDISGNAWEWVDGWIDDQYPNESVIDPQPP 220
Query: 296 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
TG +V +G S+L Y R + R + ++ N+G RCA D
Sbjct: 221 ATGQLRVMRGASWL---NYPEVWRLSVRLAFSAEAHTSNIGVRCARD 264
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 59/168 (35%), Positives = 79/168 (47%), Gaps = 41/168 (24%)
Query: 1 MVLLPAPPVERYKDM--VLLPGDTFRMGT--NKPILIKDGEFPSRNVTLDAFYLDQHEVS 56
+VL A V + D+ V + G TF+MG + P ++ E P VTL AF L + E++
Sbjct: 20 IVLGKAELVAQEIDLSWVRVEGGTFQMGCVPDDPFCLET-EQPRHEVTLSAFELMETELT 78
Query: 57 NTQFQEFVSATGYVTE-AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRM 115
+Q+ FV ATG A F T
Sbjct: 79 VSQYGVFVDATGRTPPPAPDFLQT-----------------------------------G 103
Query: 116 HHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWL 163
HPVVH++W+DA + CTW G RLPTEAEWEY RGG + +F WG+ L
Sbjct: 104 DHPVVHLTWDDAASVCTWVGGRLPTEAEWEYAARGGSDGTIFWWGNQL 151
>gi|225857398|ref|YP_002738909.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Streptococcus pneumoniae P1031]
gi|225724825|gb|ACO20677.1| sulfatase-modifying factor 1 (C-alpha-formyglycine-generating
enzyme 1) [Streptococcus pneumoniae P1031]
Length = 148
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 66/150 (44%), Positives = 89/150 (59%), Gaps = 10/150 (6%)
Query: 199 LPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY 258
LPTEA+WEY RGG ++ +PWG+ L G + AN WQG FP NT +DG++ TAPV +
Sbjct: 3 LPTEAQWEYAARGGRQSE-YPWGDTLLEGGYYHANTWQGRFPYENTVSDGFIGTAPVYEF 61
Query: 259 KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNE 312
N FGLY M+GNVWEW N + S+N PK + +GGS+LC+
Sbjct: 62 LPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHC 118
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YC R+R AAR+ S++ +LGFRC +
Sbjct: 119 SYCNRYRVAARNGCISTSTSSHLGFRCLKE 148
>gi|114799422|ref|YP_760374.1| hypothetical protein HNE_1666 [Hyphomonas neptunium ATCC 15444]
gi|114739596|gb|ABI77721.1| conserved hypothetical protein [Hyphomonas neptunium ATCC 15444]
Length = 256
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 77/182 (42%), Positives = 97/182 (53%), Gaps = 16/182 (8%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
SW PEG S+IE + N PV+HVS DA AY W G RLP+E EWE+ R GL +
Sbjct: 87 SWKTPEGAGSSIEGKGNWPVMHVSLADAEAYAAWAGGRLPSEEEWEHAARLGLPDPDRET 146
Query: 221 GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
G+ RAN WQG FP N DG+ APV + ++ GLY+M+GNVWEWT
Sbjct: 147 SGAFEDDGKPRANTWQGIFPVANAGEDGFAGAAPVGCFPADQLGLYDMIGNVWEWTD--- 203
Query: 281 NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
P P T KGGSYLC + +C R+R AAR D S+ ++GFR
Sbjct: 204 ----------TPFAPGNNT---IKGGSYLCADNFCQRYRPAARHPQEIDFSSNHIGFRIV 250
Query: 341 AD 342
+
Sbjct: 251 KE 252
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 55/143 (38%), Positives = 72/143 (50%), Gaps = 12/143 (8%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
V +PG + G I E P + +D F + HEV+N QF EFV+ATGYVT+AE
Sbjct: 4 FVEVPGGVLQKGRGA---IYPEERPEVTLHVDGFRIQAHEVTNDQFAEFVTATGYVTDAE 60
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRF-----EGLDSTIEHRMHHPVVHISWNDAVA 129
+ V E H M+ EG S+IE + + PV+H+S DA A
Sbjct: 61 RG----VMEDRPGAGSAVFQGARWHLMREASWKTPEGAGSSIEGKGNWPVMHVSLADAEA 116
Query: 130 YCTWRGARLPTEAEWEYGCRGGL 152
Y W G RLP+E EWE+ R GL
Sbjct: 117 YAAWAGGRLPSEEEWEHAARLGL 139
Score = 42.4 bits (98), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 26/40 (65%)
Query: 354 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADV 393
KGGSYLC + +C R+R AAR D S+ ++GFR ++
Sbjct: 214 KGGSYLCADNFCQRYRPAARHPQEIDFSSNHIGFRIVKEL 253
>gi|121596142|ref|YP_988038.1| hypothetical protein Ajs_3855 [Acidovorax sp. JS42]
gi|120608222|gb|ABM43962.1| protein of unknown function DUF323 [Acidovorax sp. JS42]
Length = 348
Score = 127 bits (319), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 105/353 (29%), Positives = 147/353 (41%), Gaps = 83/353 (23%)
Query: 17 LLPGDTFRMGTNKPI--LIKD-------------GEFPSRNVTLD-AFYLDQHEVSNTQF 60
+ PG +FRMG+++P+ L +D E P+ V + AFYL Q EV+ QF
Sbjct: 44 IAPG-SFRMGSSEPVDALARDYPTLERWRFEQLADEAPAHTVRITRAFYLGQTEVTVGQF 102
Query: 61 QEFVSATGYVTEAEKFGDT-FVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPV 119
+ FV +GYV E+E G + + P A+ R+ + HPV
Sbjct: 103 RRFVQTSGYVPESEADGTGGYGYNPAYDPAASARGDAFEGRDPRYSWRNPGFAQGDDHPV 162
Query: 120 VHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIE 173
V+I+W+DA A W A RLPTEAEWEY CR G R +P G D
Sbjct: 163 VNITWHDAHALAAWLSATEGHRYRLPTEAEWEYACRAGTRTR--------YPHGDDPAAL 214
Query: 174 HRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRAN 233
R +V DA Y
Sbjct: 215 ARH----ANVFDEDAAPY------------------------------------------ 228
Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNP 292
W P DG+ TAPV Y N FG+++M+GNVWEW +DW+ + +P +P
Sbjct: 229 -WPRWRPQAAPGHDGHAFTAPVGRYAPNAFGVHDMLGNVWEWVSDWYAEEGYAHSPRDDP 287
Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
+GP G +V++GGS+ Y RC R+ NTP + +G R + P
Sbjct: 288 QGPAAGQVRVRRGGSWHTWPLYA---RCGFRNWNTPQTRYTLVGLRLVREIDP 337
>gi|241766627|ref|ZP_04764477.1| protein of unknown function DUF323 [Acidovorax delafieldii 2AN]
gi|241363096|gb|EER58719.1| protein of unknown function DUF323 [Acidovorax delafieldii 2AN]
Length = 344
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 105/364 (28%), Positives = 151/364 (41%), Gaps = 82/364 (22%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPS---RNVTL-------------DAFYLDQHEVSNT 58
V +P +F MG+++ FP + TL AFYL QHEV+
Sbjct: 21 FVRVPAGSFWMGSDESAQTLAQAFPRLEPKRFTLLGDEAPVHRVHIRRAFYLGQHEVTVG 80
Query: 59 QFQEFVSATGYVTEAEKFGD-TFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
QF+ F+ A+GY E+E G + + P A+ R+ + H
Sbjct: 81 QFRRFIEASGYRPESEADGTGGYGYNPDYDPATTARGDAFEGRNPRYSWRNPGFVQGDDH 140
Query: 118 PVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDST 171
PVV+++WNDA A W RLPTEAEWEY CR G R +P G P G+
Sbjct: 141 PVVNVTWNDAQALAAWLSRTEGHRYRLPTEAEWEYACRAGTRTR-YPHGD--APAGL--- 194
Query: 172 IEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR 231
V + D A W PR +
Sbjct: 195 --------VSEANVFDQAAAPYW-------------------------------PRWQQH 215
Query: 232 ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW-NVHHHPAPSY 290
A P N DGY TAPV + N FGLY+M+GN WEW +DW + ++ +P
Sbjct: 216 A------LPGN----DGYAFTAPVGRFAPNAFGLYDMLGNAWEWVSDWHSDTYYAQSPPS 265
Query: 291 NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTD 350
+P+GP G+ +V++GGS+ + + RCA R+ N+P + +G R + G
Sbjct: 266 DPQGPAEGSVRVRRGGSW---HTWAFYARCAFRNWNSPQTRYTLVGMRLVRETDEEDGIR 322
Query: 351 KVKK 354
V +
Sbjct: 323 DVPR 326
>gi|445491299|ref|ZP_21459614.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii AA-014]
gi|444764433|gb|ELW88746.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii AA-014]
Length = 335
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 102/330 (30%), Positives = 138/330 (41%), Gaps = 72/330 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA- 73
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY+T+A
Sbjct: 66 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYITDAE 124
Query: 74 -EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
+K F +P HHP W Y T
Sbjct: 125 KQKQAAVFSPDP-------------------------------HHP--QQWWQLKTGY-T 150
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR G G L N + PV +V+ NDA Y
Sbjct: 151 WRTPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYA 180
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLS 251
W G LPTE EWEY + P T +H +AN WQG FP N D +
Sbjct: 181 VWLGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTG 239
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWT-ADWWNVHHHPAPSYNP--KGPTTGTDKVKKGGSY 308
APV +K N F L++M+GNVWEWT + + H +Y+ + + T V KGGS+
Sbjct: 240 IAPVGCFKPNGFKLFDMIGNVWEWTSSSYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSF 299
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
LC E YC R+R ++R D +A ++GFR
Sbjct: 300 LCAENYCSRYRNSSRYPQDFDLAATHVGFR 329
>gi|222112297|ref|YP_002554561.1| hypothetical protein Dtpsy_3128 [Acidovorax ebreus TPSY]
gi|221731741|gb|ACM34561.1| protein of unknown function DUF323 [Acidovorax ebreus TPSY]
Length = 348
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 112/361 (31%), Positives = 152/361 (42%), Gaps = 99/361 (27%)
Query: 17 LLPGDTFRMGTNKPI----------------LIKDGEFPSRNVTLD-AFYLDQHEVSNTQ 59
+ PG +FRMG+++P+ + D E P+ V + AFYL Q EV+ Q
Sbjct: 44 IAPG-SFRMGSSEPVDALARAYPTLERWRFEQLAD-EAPAHTVRITRAFYLGQTEVTVGQ 101
Query: 60 FQEFVSATGYVTEAEKFGD-TFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRM--- 115
F+ FV A+GYV E+E G + + P + VR D FEG D R
Sbjct: 102 FRRFVQASGYVPESEADGTGGYGYNPAYD-----PAASVRGDA--FEGRDPRYSWRNPGF 154
Query: 116 ----HHPVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
HPVV+I+W+DA A W A RLPTEAEWEY CR G R +P
Sbjct: 155 AQGDDHPVVNITWHDAHALAAWLSATEGHRYRLPTEAEWEYACRAGTRTR--------YP 206
Query: 166 EGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLT 225
G D R +V DA Y
Sbjct: 207 HGDDPAALARH----ANVFDEDAAPY---------------------------------- 228
Query: 226 PRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH 285
W P DG+ TAPV Y N FG+++M+GNVWEW +DW+ +
Sbjct: 229 ---------WPRWRPQAAPGHDGHAFTAPVGRYAPNAFGVHDMLGNVWEWVSDWYAEEGY 279
Query: 286 P-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKG 344
+P +P+GP TG +V++GGS+ Y RC R+ NTP + +G R +
Sbjct: 280 AHSPRDDPQGPATGQVRVRRGGSWHTWPLYA---RCGFRNWNTPQTRYTLVGLRLVREID 336
Query: 345 P 345
P
Sbjct: 337 P 337
>gi|417553728|ref|ZP_12204797.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-81]
gi|417562667|ref|ZP_12213546.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC137]
gi|421201406|ref|ZP_15658565.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC109]
gi|421456418|ref|ZP_15905760.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii IS-123]
gi|421635404|ref|ZP_16076006.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-13]
gi|421805025|ref|ZP_16240919.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii WC-A-694]
gi|395525249|gb|EJG13338.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC137]
gi|395563438|gb|EJG25091.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC109]
gi|400210846|gb|EJO41810.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii IS-123]
gi|400390145|gb|EJP57192.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-81]
gi|408702223|gb|EKL47636.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-13]
gi|410410075|gb|EKP61995.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii WC-A-694]
Length = 335
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 102/330 (30%), Positives = 138/330 (41%), Gaps = 72/330 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA- 73
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY+T+A
Sbjct: 66 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYITDAE 124
Query: 74 -EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
+K F +P HHP W Y T
Sbjct: 125 KQKQAAVFSPDP-------------------------------HHP--QQWWQLKTGY-T 150
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR G G L N + PV +V+ NDA Y
Sbjct: 151 WRTPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYA 180
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLS 251
W G LPTE EWEY + P T +H +AN WQG FP N D +
Sbjct: 181 VWLGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTG 239
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSY 308
APV +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+
Sbjct: 240 IAPVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSF 299
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
LC E YC R+R ++R D +A ++GFR
Sbjct: 300 LCAENYCSRYRNSSRYPQDFDLAATHVGFR 329
>gi|332663258|ref|YP_004446046.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
1100]
gi|332332072|gb|AEE49173.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis DSM
1100]
Length = 1050
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 105/354 (29%), Positives = 158/354 (44%), Gaps = 60/354 (16%)
Query: 14 DMVLLPGDTFRMGTNKPILI----------KDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
+MV++ G +F MG + IL+ D E V L F++ H V+ +F+ F
Sbjct: 734 EMVIVEGGSFAMGLDSTILLYNDGKGNKEHSDNEQSVYQVVLADFWMATHPVTLGEFRRF 793
Query: 64 VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
V TGY T AEK G ++ ++ E E+ + + + K T++ HPV+H+S
Sbjct: 794 VQETGYQTLAEKEGWSWSYDAETGEWEKIEGQNWQCNTK------GTLQINDRHPVLHLS 847
Query: 124 WNDAVAYCTW--RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
W DA+ YC W R A L + +Y L+P+ + E+ V
Sbjct: 848 WYDALEYCNWLSRAAGLAEVYQIDY----------------LNPDA-HNLAENDDLKWTV 890
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
V+W + G RLPTEAEWEY R R +GN G+ A+ Q F
Sbjct: 891 KVNWETS-------GYRLPTEAEWEYAARE--RGRSVRFGN-----GQDIADPTQMNFDA 936
Query: 242 NNTAADGY-------LSTAPVMSYKENKFGLYNMVGNVWEWTADWW-NVHHHPAPSYNPK 293
++ Y +T P ++ N G++NM GNV EW DW+ N ++ +P NP
Sbjct: 937 SSHYKIDYSIVGIYRAATVPADAFAPNALGIFNMSGNVAEWCWDWYANDYYQYSPLENPH 996
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTT 347
GP +V +GGS+ N +YC R + R N P + GFR P +
Sbjct: 997 GPAEDAYRVLRGGSWFYNTKYC---RSSFRFSNLPAYRFSDFGFRVVLFPPPVS 1047
>gi|78188021|ref|YP_378359.1| hypothetical protein Cag_0037 [Chlorobium chlorochromatii CaD3]
gi|78170220|gb|ABB27316.1| conserved hypothetical protein [Chlorobium chlorochromatii CaD3]
Length = 325
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 103/341 (30%), Positives = 154/341 (45%), Gaps = 87/341 (25%)
Query: 14 DMVLLPGDTFRMGTNKPILIKD-GEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ VL+ G F MG+ + +D E P R V + FY+ ++EV+ +F+ FV TGY T+
Sbjct: 59 NFVLIRGGEFTMGSPESESERDRDEMPHR-VKVGDFYIARYEVTTAEFRTFVQETGYRTD 117
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AEK + VF L + + + K G ++ +HPV+ +SWNDAVAYC
Sbjct: 118 AEKTNPSLVFWSGLWPGKAGLNWRYGTNGKERSGAEN------NHPVILVSWNDAVAYCK 171
Query: 133 WRGA------RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
W RLPT AEWEY CR G +F +G L
Sbjct: 172 WLSKKHGMNFRLPTSAEWEYACRAGTST-VFNYGDNLS---------------------- 208
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
T+A ++ +P+ N+ P+G +R N
Sbjct: 209 -------------TTQANYDGN---------YPYSNH--PKGIYRKN------------- 231
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-----YNPKGPTTGTDK 301
T PV S+ N +GLYNM GNV EW +DW++ ++ + NP GP TG+++
Sbjct: 232 -----TVPVNSFTPNAWGLYNMHGNVAEWCSDWYSEPYYESSKANGTVTNPTGPATGSNR 286
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
V +GGS+ + +YC R A + +TP N+GFR +
Sbjct: 287 VMRGGSWYDDARYC---RSADINDSTPSYRYINVGFRVVLE 324
>gi|392408858|ref|YP_006445465.1| hypothetical protein Desti_0466 [Desulfomonile tiedjei DSM 6799]
gi|390621994|gb|AFM23201.1| hypothetical protein Desti_0466 [Desulfomonile tiedjei DSM 6799]
Length = 984
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 109/359 (30%), Positives = 153/359 (42%), Gaps = 111/359 (30%)
Query: 14 DMVLLPGDTFRMGTN--------------KPILIKDGEFPSRNVTLD-AFYLDQHEVSNT 58
+MV +P +F MG++ +PI +++ EFP V + FY+ EV+
Sbjct: 537 NMVRIPAGSFVMGSSETDIAWAMTTLAQGQPISLEN-EFPFHKVRMSRPFYISATEVTVA 595
Query: 59 QFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDS--------T 110
QF+ FV TGYVT+AE +E+ +I R +RFE + T
Sbjct: 596 QFKAFVDETGYVTDAE-------------DEKGGQIFNSR--TRRFEKKEGSSWKDPGWT 640
Query: 111 IEHRMHHPVVHISWNDAVAYCTWRGAR------LPTEAEWEYGCRGGLENRLFPWGSWLH 164
I PVV IS+NDA A+ W A+ LPTEA+WEY RGGL +FPWG L
Sbjct: 641 ITR--DQPVVMISYNDAQAFVEWLAAKEKLPYKLPTEAQWEYAARGGLPMSVFPWGDQL- 697
Query: 165 PEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNL 224
P+G R N+ A T+ EW
Sbjct: 698 PDG------QRANY------------------ADKSTDYEWR------------------ 715
Query: 225 TPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH 284
+ TA DG+ +PV SY+ N +GLY+M GN EW D++ +
Sbjct: 716 -----------------DRTADDGHKFVSPVGSYEPNGYGLYDMAGNALEWVRDYYGEDY 758
Query: 285 HP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+ +P +P+GP G + KGG + RCA R P+SS N GFR A +
Sbjct: 759 YRFSPEIDPEGPGHGEFRTMKGGEWTFG---AVNLRCAFRGWARPESSFQNSGFRVAIE 814
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 100/202 (49%), Gaps = 20/202 (9%)
Query: 177 NHPVVHVSWNDAVAYCTWRGAR------LPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
+ PV VS DA A+C W R LPTEA+WEY RGG + FPWGN+ P G
Sbjct: 273 DLPVTLVSHADAEAFCEWLSKREGRPYTLPTEAQWEYAARGGKQGDRFPWGNDY-PDG-R 330
Query: 231 RANVWQ--GEFP-TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA 287
+ NV P + T DGY APV SY+ N F LY+M+GNVWE +D+++ + +
Sbjct: 331 KLNVADRNARVPWADRTVDDGYGEIAPVGSYEPNGFWLYDMMGNVWEACSDYYDPKLYES 390
Query: 288 PS----YNPKGPTTGTDKVKKGGSYLCNEQYCYR-HRCAARSQNTPDSSAGNLGFRCAAD 342
S ++P GP TG KV +GG++ + R A + D S GFR +A
Sbjct: 391 RSSQSVFDPTGPATGKTKVVRGGNWAFDAGIARNAFRFGADRELCTDMS----GFRVSAL 446
Query: 343 KGPTTGTDKVKKGGSYLCNEQY 364
+ K + L NE +
Sbjct: 447 ASAAELSQAKKTDENLLSNENF 468
>gi|168494286|ref|ZP_02718429.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae CDC3059-06]
gi|172079528|ref|ZP_02708283.2| sulfatase-modifying factor 1 [Streptococcus pneumoniae CDC1873-00]
gi|418074650|ref|ZP_12711899.1| hypothetical protein SPAR19_1791 [Streptococcus pneumoniae GA11184]
gi|418079245|ref|ZP_12716467.1| hypothetical protein SPAR123_1693 [Streptococcus pneumoniae
4027-06]
gi|418081442|ref|ZP_12718652.1| hypothetical protein SPAR121_1740 [Streptococcus pneumoniae
6735-05]
gi|418090172|ref|ZP_12727325.1| hypothetical protein SPAR77_1762 [Streptococcus pneumoniae GA43265]
gi|418099137|ref|ZP_12736233.1| hypothetical protein SPAR122_1719 [Streptococcus pneumoniae
6901-05]
gi|418105924|ref|ZP_12742980.1| hypothetical protein SPAR85_1853 [Streptococcus pneumoniae GA44500]
gi|418115329|ref|ZP_12752315.1| hypothetical protein SPAR125_1731 [Streptococcus pneumoniae
5787-06]
gi|418117488|ref|ZP_12754457.1| hypothetical protein SPAR124_1704 [Streptococcus pneumoniae
6963-05]
gi|418135513|ref|ZP_12772367.1| hypothetical protein SPAR23_1789 [Streptococcus pneumoniae GA11426]
gi|418162832|ref|ZP_12799513.1| hypothetical protein SPAR49_1824 [Streptococcus pneumoniae GA17328]
gi|418174213|ref|ZP_12810824.1| hypothetical protein SPAR67_1819 [Streptococcus pneumoniae GA41277]
gi|418176579|ref|ZP_12813170.1| hypothetical protein SPAR71_1815 [Streptococcus pneumoniae GA41437]
gi|418217261|ref|ZP_12843940.1| hypothetical protein SPAR147_1733 [Streptococcus pneumoniae
Netherlands15B-37]
gi|418219515|ref|ZP_12846180.1| hypothetical protein SPAR145_1804 [Streptococcus pneumoniae NP127]
gi|418239313|ref|ZP_12865864.1| hypothetical protein SPAR146_1771 [Streptococcus pneumoniae
NorthCarolina6A-23]
gi|419432192|ref|ZP_13972325.1| hypothetical protein SPAR140_1732 [Streptococcus pneumoniae
EU-NP05]
gi|419441000|ref|ZP_13981043.1| hypothetical protein SPAR64_1744 [Streptococcus pneumoniae GA40410]
gi|419462951|ref|ZP_14002854.1| hypothetical protein SPAR3_1848 [Streptococcus pneumoniae GA02714]
gi|419465293|ref|ZP_14005184.1| hypothetical protein SPAR4_1809 [Streptococcus pneumoniae GA04175]
gi|419469614|ref|ZP_14009481.1| hypothetical protein SPAR9_1688 [Streptococcus pneumoniae GA06083]
gi|419498180|ref|ZP_14037887.1| hypothetical protein SPAR99_1742 [Streptococcus pneumoniae GA47522]
gi|419526502|ref|ZP_14066060.1| formylglycine-generating sulfatase enzyme family protein
[Streptococcus pneumoniae GA14373]
gi|419535281|ref|ZP_14074780.1| formylglycine-generating sulfatase enzyme family protein
[Streptococcus pneumoniae GA17457]
gi|421281800|ref|ZP_15732596.1| non-specific serine/threonine protein kinase [Streptococcus
pneumoniae GA04672]
gi|421310213|ref|ZP_15760838.1| non-specific serine/threonine protein kinase [Streptococcus
pneumoniae GA62681]
gi|172043211|gb|EDT51257.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae CDC1873-00]
gi|183575814|gb|EDT96342.1| sulfatase-modifying factor 1 [Streptococcus pneumoniae CDC3059-06]
gi|353746181|gb|EHD26843.1| hypothetical protein SPAR19_1791 [Streptococcus pneumoniae GA11184]
gi|353746772|gb|EHD27432.1| hypothetical protein SPAR123_1693 [Streptococcus pneumoniae
4027-06]
gi|353752181|gb|EHD32812.1| hypothetical protein SPAR121_1740 [Streptococcus pneumoniae
6735-05]
gi|353760945|gb|EHD41520.1| hypothetical protein SPAR77_1762 [Streptococcus pneumoniae GA43265]
gi|353768143|gb|EHD48668.1| hypothetical protein SPAR122_1719 [Streptococcus pneumoniae
6901-05]
gi|353776100|gb|EHD56579.1| hypothetical protein SPAR85_1853 [Streptococcus pneumoniae GA44500]
gi|353785413|gb|EHD65832.1| hypothetical protein SPAR125_1731 [Streptococcus pneumoniae
5787-06]
gi|353788169|gb|EHD68567.1| hypothetical protein SPAR124_1704 [Streptococcus pneumoniae
6963-05]
gi|353826894|gb|EHE07051.1| hypothetical protein SPAR49_1824 [Streptococcus pneumoniae GA17328]
gi|353837418|gb|EHE17502.1| hypothetical protein SPAR67_1819 [Streptococcus pneumoniae GA41277]
gi|353840650|gb|EHE20714.1| hypothetical protein SPAR71_1815 [Streptococcus pneumoniae GA41437]
gi|353870051|gb|EHE49927.1| hypothetical protein SPAR147_1733 [Streptococcus pneumoniae
Netherlands15B-37]
gi|353873875|gb|EHE53734.1| hypothetical protein SPAR145_1804 [Streptococcus pneumoniae NP127]
gi|353892304|gb|EHE72053.1| hypothetical protein SPAR146_1771 [Streptococcus pneumoniae
NorthCarolina6A-23]
gi|353900846|gb|EHE76396.1| hypothetical protein SPAR23_1789 [Streptococcus pneumoniae GA11426]
gi|379530383|gb|EHY95623.1| hypothetical protein SPAR3_1848 [Streptococcus pneumoniae GA02714]
gi|379536893|gb|EHZ02079.1| hypothetical protein SPAR4_1809 [Streptococcus pneumoniae GA04175]
gi|379543868|gb|EHZ09016.1| hypothetical protein SPAR9_1688 [Streptococcus pneumoniae GA06083]
gi|379557142|gb|EHZ22189.1| formylglycine-generating sulfatase enzyme family protein
[Streptococcus pneumoniae GA14373]
gi|379563442|gb|EHZ28446.1| formylglycine-generating sulfatase enzyme family protein
[Streptococcus pneumoniae GA17457]
gi|379577237|gb|EHZ42159.1| hypothetical protein SPAR64_1744 [Streptococcus pneumoniae GA40410]
gi|379599013|gb|EHZ63798.1| hypothetical protein SPAR99_1742 [Streptococcus pneumoniae GA47522]
gi|379629273|gb|EHZ93874.1| hypothetical protein SPAR140_1732 [Streptococcus pneumoniae
EU-NP05]
gi|395879967|gb|EJG91022.1| non-specific serine/threonine protein kinase [Streptococcus
pneumoniae GA04672]
gi|395909828|gb|EJH20703.1| non-specific serine/threonine protein kinase [Streptococcus
pneumoniae GA62681]
Length = 148
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 66/150 (44%), Positives = 88/150 (58%), Gaps = 10/150 (6%)
Query: 199 LPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSY 258
LPTEA+WEY RGG ++ + WG+ L G + AN WQG FP NTA DG++ TAPV +
Sbjct: 3 LPTEAQWEYAARGGRQSE-YLWGDTLLEGGYYHANTWQGRFPYENTALDGFIGTAPVYEF 61
Query: 259 KENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN------PKGPTTGTDKVKKGGSYLCNE 312
N FGLY M+GNVWEW N + S+N PK + +GGS+LC+
Sbjct: 62 LPNDFGLYQMIGNVWEWCR---NPRYTLLASFNEDDYELPKYGIQDEEYAIRGGSFLCHC 118
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
YC R+R AAR+ S++ +LGFRC +
Sbjct: 119 SYCNRYRVAARNGCISTSTSSHLGFRCLKE 148
>gi|407937989|ref|YP_006853630.1| hypothetical protein C380_06410 [Acidovorax sp. KKS102]
gi|407895783|gb|AFU44992.1| hypothetical protein C380_06410 [Acidovorax sp. KKS102]
Length = 335
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 103/350 (29%), Positives = 142/350 (40%), Gaps = 84/350 (24%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFP-----------------SRNVTLDAFYLDQHEVSN 57
V +P F MG P+ + FP +T D FYL QHEV+
Sbjct: 34 FVRVPAGAFWMGGVPPVETLEKNFPLLERKRLTDLADESPAHEVQITRD-FYLGQHEVTV 92
Query: 58 TQFQEFVSATGYVTEAEKFGD-TFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMH 116
QFQ F+ A+GY E+E G + + P + R+ +
Sbjct: 93 GQFQRFLEASGYRPESEADGTGGYGYNPAYDPATTRRGDAFEGRDPRYSWRNPGFAQGPD 152
Query: 117 HPVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDS 170
HPVV+++WNDA A W RLPTEAEWEY CR G + L+P G
Sbjct: 153 HPVVNVTWNDAQAMAQWLSRTEGHRYRLPTEAEWEYACRAGSRS--------LYPHG--- 201
Query: 171 TIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
++P VS A T+ A P + W G
Sbjct: 202 ------DNPQELVS-----AGNTFDQAAAPQWSRWRQHALAG------------------ 232
Query: 231 RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW-WNVHHHPAPS 289
DGY TAPV + N FGLY+M+GN WEWT DW + ++ +P
Sbjct: 233 ---------------NDGYAFTAPVGRFAPNAFGLYDMLGNAWEWTGDWHGDTYYAESPR 277
Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+P GP TG+ +V++GGS+ Y RC R+ N+P + +G R
Sbjct: 278 NDPTGPATGSVRVRRGGSWHTWALYA---RCGYRNWNSPQTRYTLVGMRL 324
>gi|338175245|ref|YP_004652055.1| serine/threonine-protein kinase pkn1 [Parachlamydia acanthamoebae
UV-7]
gi|336479603|emb|CCB86201.1| serine/threonine-protein kinase pkn1 [Parachlamydia acanthamoebae
UV-7]
Length = 667
Score = 125 bits (313), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 69/169 (40%), Positives = 93/169 (55%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A+AY W G RLPTEAEWE GG++N LFP G+++ + +AN +
Sbjct: 516 KHPVVGVTWYGALAYAKWVGKRLPTEAEWEVAAYGGVDNYLFPTGDDIE---KTQANFFS 572
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPK 293
+ T VMSY N +GLY+M GNV+EW DW+ +++ NPK
Sbjct: 573 AD-------------TTGVMSYAPNGYGLYDMAGNVYEWCHDWYGYNYYEISVQEPENPK 619
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RC+ R +N P + G GFRCAAD
Sbjct: 620 GPLQGVYRVLRGG---CWKSLKEDLRCSRRHRNNPGTVNGTYGFRCAAD 665
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 55/148 (37%), Positives = 72/148 (48%), Gaps = 20/148 (13%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
DMV +PG F G+ E P VT+D+F +D H V+N QF F+ A G
Sbjct: 432 DMVGIPGGVFARGSASG---NRDEMPRHQVTVDSFAIDIHPVTNEQFTRFLEAMG----G 484
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMH-HPVVHISWNDAVAYCT 132
EK + I ++R + G IE + HPVV ++W A+AY
Sbjct: 485 EK------------DNNHHDIIRLRDSRIKRSGGKVHIEFGYNKHPVVGVTWYGALAYAK 532
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
W G RLPTEAEWE GG++N LFP G
Sbjct: 533 WVGKRLPTEAEWEVAAYGGVDNYLFPTG 560
>gi|282890625|ref|ZP_06299148.1| hypothetical protein pah_c022o234 [Parachlamydia acanthamoebae str.
Hall's coccus]
gi|281499622|gb|EFB41918.1| hypothetical protein pah_c022o234 [Parachlamydia acanthamoebae str.
Hall's coccus]
Length = 655
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 69/169 (40%), Positives = 93/169 (55%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A+AY W G RLPTEAEWE GG++N LFP G+++ + +AN +
Sbjct: 504 KHPVVGVTWYGALAYAKWVGKRLPTEAEWEVAAYGGVDNYLFPTGDDIE---KTQANFFS 560
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPK 293
+ T VMSY N +GLY+M GNV+EW DW+ +++ NPK
Sbjct: 561 AD-------------TTGVMSYAPNGYGLYDMAGNVYEWCHDWYGYNYYEISVQEPENPK 607
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RC+ R +N P + G GFRCAAD
Sbjct: 608 GPLQGVYRVLRGG---CWKSLKEDLRCSRRHRNNPGTVNGTYGFRCAAD 653
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 55/148 (37%), Positives = 72/148 (48%), Gaps = 20/148 (13%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
DMV +PG F G+ E P VT+D+F +D H V+N QF F+ A G
Sbjct: 420 DMVGIPGGVFARGSASG---NRDEMPRHQVTVDSFAIDIHPVTNEQFTRFLEAMG----G 472
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMH-HPVVHISWNDAVAYCT 132
EK + I ++R + G IE + HPVV ++W A+AY
Sbjct: 473 EK------------DNNHHDIIRLRDSRIKRSGGKVHIEFGYNKHPVVGVTWYGALAYAK 520
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
W G RLPTEAEWE GG++N LFP G
Sbjct: 521 WVGKRLPTEAEWEVAAYGGVDNYLFPTG 548
>gi|293608913|ref|ZP_06691216.1| conserved hypothetical protein [Acinetobacter sp. SH024]
gi|292829486|gb|EFF87848.1| conserved hypothetical protein [Acinetobacter sp. SH024]
Length = 334
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 101/348 (29%), Positives = 143/348 (41%), Gaps = 81/348 (23%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFP--------------SRNVTLDAFYLDQHE 54
+E+ K+ LP + T +LI DG+F + + F++DQ E
Sbjct: 44 IEKCKNYTGLPQGWLKQPTAGMVLIVDGDFNFGSEKAYPDELNFGKKQREVKGFWIDQTE 103
Query: 55 VSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHR 114
V+ QF FV ATGY +++ E+ K + V
Sbjct: 104 VTVAQFASFVKATGY----------------ITDAEKQKQAAV-------------FSPD 134
Query: 115 MHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEH 174
HHP W + TW+ G G + N
Sbjct: 135 PHHPQ---QWWQLKSGYTWKTPN---------GGTGAIANP------------------- 163
Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RAN 233
N PV +VS NDA Y W G LPTE EWEY + + P T +H +AN
Sbjct: 164 --NEPVRYVSKNDAEHYAVWLGRDLPTELEWEYAAKANSKTDT-PLHQAPTDEHQHPQAN 220
Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP 292
WQGEFP N D + APV + N F L++M+GNVWEWT+ + H +Y+
Sbjct: 221 YWQGEFPFQNLNQDHFKDVAPVGCFTPNNFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSD 280
Query: 293 --KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
+ T V KGGS+LC + YC R+R ++R D +A ++GFR
Sbjct: 281 LRQQQIASTQYVIKGGSFLCAQNYCSRYRSSSRYPQDFDLAATHVGFR 328
>gi|194337392|ref|YP_002019186.1| hypothetical protein Ppha_2378 [Pelodictyon phaeoclathratiforme
BU-1]
gi|194309869|gb|ACF44569.1| protein of unknown function DUF323 [Pelodictyon phaeoclathratiforme
BU-1]
Length = 433
Score = 124 bits (310), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 102/339 (30%), Positives = 148/339 (43%), Gaps = 89/339 (26%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+ VL+ G F MG+ + + + + V ++ FY+ ++EV+ +F+ F++ +GY T
Sbjct: 165 NFVLIRGSEFTMGSPETEVGHESDETQHQVKVNDFYMSKYEVTVAEFRRFIAESGYRT-- 222
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ E+A S V WN + A T
Sbjct: 223 --------------DAEKAGDSYV--------------------------WNGSEAVLT- 241
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
W +G G + + NHPVVHVSWNDAV YC
Sbjct: 242 ------KGVNWRHGVSGKVRTK------------------SEENHPVVHVSWNDAVEYCR 277
Query: 194 WRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
W RLPTEAEWEY CR G F G N+T +AN + G +P N
Sbjct: 278 WLSEKTGKWYRLPTEAEWEYACRAG-SRTAFNTGGNITT---AQAN-YDGNYPYNKNRKG 332
Query: 248 GYLS-TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPT--TGT 299
Y + T V S++ N +GLYNM GNVWEW D ++ ++ NP GP TG+
Sbjct: 333 QYRARTVAVESFEPNAWGLYNMHGNVWEWCDDVYSGTYYDECKAKGLVENPGGPAPETGS 392
Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
++V +GGS+ + +YC R A+R TP + +GFR
Sbjct: 393 NRVLRGGSWNYDAEYC---RSASRRGYTPGGRSNFVGFR 428
>gi|194335022|ref|YP_002016882.1| hypothetical protein Paes_2233 [Prosthecochloris aestuarii DSM 271]
gi|194312840|gb|ACF47235.1| protein of unknown function DUF323 [Prosthecochloris aestuarii DSM
271]
Length = 398
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 78/175 (44%), Positives = 97/175 (55%), Gaps = 20/175 (11%)
Query: 178 HPVVHVSWNDAVAYCTW----RGA--RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR 231
+PVV+VSWNDAVAYC W RG RLPTEAEWEY CRGG F G NLT
Sbjct: 225 YPVVNVSWNDAVAYCRWLSEKRGELFRLPTEAEWEYACRGGTTTP-FSTGENLT---TDE 280
Query: 232 ANVWQGEFPTNNTAADGYL-STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----- 285
AN + G +P N + Y +T PV S++ N +GLYNM GNVWEW DW+ ++
Sbjct: 281 AN-YDGNYPYRNNSKGKYREATVPVDSFEPNSYGLYNMHGNVWEWCGDWYGEKYYEECRK 339
Query: 286 PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
NP+GP G+ +V +GG + +YC R A RS P N+GFR
Sbjct: 340 KGVVENPQGPKEGSRRVLRGGGWTYYARYC---RSADRSSGGPGYRGNNVGFRLV 391
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/154 (30%), Positives = 64/154 (41%), Gaps = 46/154 (29%)
Query: 7 PPVERYK---DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
PP R + + VL+ G F MG+ + + + V + F L ++ V+ ++ EF
Sbjct: 149 PPASRPQLPENFVLIKGGQFSMGSPEDEHGHESDETLHEVKVSDFALCRYAVTVGEYLEF 208
Query: 64 VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
+A GD + PVV++S
Sbjct: 209 TEEAKINYDAGTEGDRY-------------------------------------PVVNVS 231
Query: 124 WNDAVAYCTW----RGA--RLPTEAEWEYGCRGG 151
WNDAVAYC W RG RLPTEAEWEY CRGG
Sbjct: 232 WNDAVAYCRWLSEKRGELFRLPTEAEWEYACRGG 265
>gi|443471534|ref|ZP_21061596.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycine-
generating enzyme 1) [Pseudomonas pseudoalcaligenes
KF707]
gi|442901605|gb|ELS27425.1| Sulfatase modifying factor 1 precursor (C-alpha-formyglycine-
generating enzyme 1) [Pseudomonas pseudoalcaligenes
KF707]
Length = 333
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 90/217 (41%), Positives = 108/217 (49%), Gaps = 20/217 (9%)
Query: 140 TEAEWEYGC---RGGLEN----RLFPW------GSWLHPEGIDSTIEHRMNHPVVHVSWN 186
TEAE + G R LEN PW W HPEG D+ HPV V+
Sbjct: 106 TEAERQGGAALFRAPLENAADGSALPWWRYVEGADWRHPEGPDAAPAALNAHPVTLVTLA 165
Query: 187 DAVAYCTWRGARLPTEAEWEYGCR-GGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
DA AY WRG LP+EAEWEY R GG RL L G AN WQG FP +
Sbjct: 166 DAQAYARWRGNELPSEAEWEYAARAGGDSARLGE--APLDAEGRPTANYWQGVFPVQDAG 223
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD--WWNV--HHHPAPSYNPKGPTTGTDK 301
DG+ AP + N GLY+M+GNVWEWTAD W + H + P P +
Sbjct: 224 EDGFAGLAPSGCFAANGNGLYDMIGNVWEWTADAQWGRLISHANGDPGQLRAVPGGSRPQ 283
Query: 302 VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
V KGGS+LC YC R+R AAR + D + ++GFR
Sbjct: 284 VIKGGSWLCASTYCARYRAAARERQEADLATSHVGFR 320
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 74/142 (52%), Gaps = 8/142 (5%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
M L+ G F GT E P V + F++D+ EV+ QF FV+ATGYVTEAE
Sbjct: 53 MALIRGGRFVPGTEGGY---PDERPLGAVEVGDFWIDRTEVTRAQFAAFVAATGYVTEAE 109
Query: 75 KFGDTFVFE-PLLSEEERAKISQVRH----DMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
+ G +F PL + + + + R+ D + EG D+ HPV ++ DA A
Sbjct: 110 RQGGAALFRAPLENAADGSALPWWRYVEGADWRHPEGPDAAPAALNAHPVTLVTLADAQA 169
Query: 130 YCTWRGARLPTEAEWEYGCRGG 151
Y WRG LP+EAEWEY R G
Sbjct: 170 YARWRGNELPSEAEWEYAARAG 191
>gi|417546710|ref|ZP_12197796.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC032]
gi|421666877|ref|ZP_16106959.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC087]
gi|421669835|ref|ZP_16109848.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC099]
gi|400384598|gb|EJP43276.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC032]
gi|410386349|gb|EKP38820.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC087]
gi|410387304|gb|EKP39760.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC099]
Length = 335
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 100/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY+
Sbjct: 66 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYI---- 120
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
S+ E+ K + V HHP W Y TW+
Sbjct: 121 ------------SDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G G L N + PV +V+ NDA Y W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
G LPTE EWEY + P T +H +AN WQG FP N D + A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241
Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
PV +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
E YC R+R ++R D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329
>gi|421625272|ref|ZP_16066125.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC098]
gi|408698941|gb|EKL44426.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC098]
Length = 335
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 101/330 (30%), Positives = 137/330 (41%), Gaps = 72/330 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA- 73
MV +P F G+ K + F + + F++DQ EV+ QF FV AT Y+T+A
Sbjct: 66 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATCYITDAE 124
Query: 74 -EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
+K F +P HHP W Y T
Sbjct: 125 KQKQAAVFSPDP-------------------------------HHP--QQWWQLKTGY-T 150
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYC 192
WR G G L N + PV +V+ NDA Y
Sbjct: 151 WRTPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYA 180
Query: 193 TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLS 251
W G LPTE EWEY + P T +H +AN WQG FP N D +
Sbjct: 181 VWLGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTG 239
Query: 252 TAPVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSY 308
APV +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+
Sbjct: 240 IAPVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSF 299
Query: 309 LCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
LC E YC R+R ++R D +A ++GFR
Sbjct: 300 LCAENYCSRYRNSSRYPQDFDLAATHVGFR 329
>gi|424060770|ref|ZP_17798261.1| hypothetical protein W9K_01884 [Acinetobacter baumannii Ab33333]
gi|404668722|gb|EKB36631.1| hypothetical protein W9K_01884 [Acinetobacter baumannii Ab33333]
Length = 335
Score = 123 bits (309), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 100/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY+
Sbjct: 66 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYI---- 120
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
S+ E+ K + V HHP W Y TW+
Sbjct: 121 ------------SDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G G L N + PV +V+ NDA Y W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
G LPTE EWEY + P T +H +AN WQG FP N D + A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241
Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
PV +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
E YC R+R ++R D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329
>gi|445400935|ref|ZP_21430236.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-57]
gi|444783062|gb|ELX06924.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-57]
Length = 335
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 99/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY
Sbjct: 66 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGY----- 119
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+++ E+ K + V HHP W Y TW+
Sbjct: 120 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G G L N + PV +V+ NDA Y W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
G LPTE EWEY + P T +H +AN WQG FP N D + A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241
Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
PV +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+LC
Sbjct: 242 PVGCFKPNSFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
E YC R+R ++R D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329
>gi|289749219|ref|ZP_06508597.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
gi|289689806|gb|EFD57235.1| conserved hypothetical protein [Mycobacterium tuberculosis T92]
Length = 188
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 87/230 (37%), Positives = 108/230 (46%), Gaps = 45/230 (19%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
++V LPG +FRMG+ + E P VT+ AF +++H V+N QF EFVSATGYVT A
Sbjct: 4 ELVDLPGGSFRMGSTR---FYPEEAPIHTVTVRAFAVERHPVTNAQFAEFVSATGYVTVA 60
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
E +PL D + G+D+ +V V W
Sbjct: 61 E--------QPL--------------DPGLYPGVDAA--DLCPGAMVFCPTAGPVDLRDW 96
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
R W++ P W HP G DS I R HPVV V++ DAVAY
Sbjct: 97 R-------QWWDW----------VPGACWRHPFGRDSDIADRAGHPVVQVAYPDAVAYAR 139
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNN 243
W G RLPTEAEWEY RGG + WG+ P G AN WQG FP N
Sbjct: 140 WAGRRLPTEAEWEYAARGG-TTATYAWGDQEKPGGMLMANTWQGRFPYRN 188
>gi|351730981|ref|ZP_08948672.1| hypothetical protein AradN_14464 [Acidovorax radicis N35]
Length = 341
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 100/330 (30%), Positives = 142/330 (43%), Gaps = 89/330 (26%)
Query: 37 EFPSRNVTL-DAFYLDQHEVSNTQFQEFVSATGYVTEAEK-----FGDTFVFEPLLSEEE 90
E P+ V + AFYL +HEV+ QFQ F+ A+GY E+E +G ++P +
Sbjct: 70 EGPAHTVHIRQAFYLGRHEVTVGQFQRFIEASGYQPESEADGTGGYGYNAQYDPATTRRG 129
Query: 91 RAKISQVRHDMKRFEGLDSTIEHR-------MHHPVVHISWNDAVAYCTWRGA------R 137
A FEG D R HPVV+I+WNDA A W R
Sbjct: 130 DA-----------FEGRDPRYSWRNPGFAQGPDHPVVNITWNDAQALARWLSRTEGHHYR 178
Query: 138 LPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
LPTEAEWEY CR G R +P G P+G+ + + + D A W
Sbjct: 179 LPTEAEWEYACRAGTRTR-YPHGD--DPQGL-----------LKNANTFDQEAAPLW--- 221
Query: 198 RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMS 257
W+ G + F TAPV S
Sbjct: 222 -----PRWQQHALAGSDGHAF---------------------------------TAPVGS 243
Query: 258 YKENKFGLYNMVGNVWEWTADW-WNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCY 316
Y N FGL++M+GN WEW +DW + ++ +P +PKGP GT +V++GGS+ + +
Sbjct: 244 YAPNAFGLHDMLGNAWEWVSDWHGDTYYAHSPRSDPKGPADGTVRVRRGGSW---HTWSF 300
Query: 317 RHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
RC R+ N+P++ +G R + PT
Sbjct: 301 YARCGYRNWNSPETRYTLVGMRLVREIAPT 330
>gi|119356730|ref|YP_911374.1| hypothetical protein Cpha266_0902 [Chlorobium phaeobacteroides DSM
266]
gi|119354079|gb|ABL64950.1| protein of unknown function DUF323 [Chlorobium phaeobacteroides DSM
266]
Length = 586
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 104/342 (30%), Positives = 145/342 (42%), Gaps = 87/342 (25%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
R + VL+ G F MG+ + + V + FYL ++ V+ +F++F+ +GY
Sbjct: 317 RPGNFVLIRGGEFTMGSPANESGHESDETQHQVKVSDFYLCKYAVTLAEFKKFIEDSGYQ 376
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
T+AEK G ++ SW D +++
Sbjct: 377 TDAEKDGGSY------------------------------------------SW-DGISW 393
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVA 190
G +W YG G + + NHPV+HVSWNDAVA
Sbjct: 394 VKNAGV------DWRYGVSGSVRP------------------QSEENHPVLHVSWNDAVA 429
Query: 191 YCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP-TNN 243
YC W RLPTEAEWEY CR G F G+NLT +AN + G +P TNN
Sbjct: 430 YCKWISKKTGDAFRLPTEAEWEYACRAGTTTP-FHTGDNLT---TGQAN-YNGNYPYTNN 484
Query: 244 TAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTG 298
+T V + N +GLY+M GNVWEW D + ++ NP GP TG
Sbjct: 485 QKGVYRENTVKVDEFAPNAWGLYHMHGNVWEWCGDRYGDKYYDECKAEGVVENPVGPETG 544
Query: 299 TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+ +V +GG + N + C R A R PD + GFR A
Sbjct: 545 SLRVLRGGGWSFNARSC---RSAFRIDVAPDYRSNYAGFRLA 583
>gi|421656586|ref|ZP_16096891.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-72]
gi|408504913|gb|EKK06643.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-72]
Length = 335
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY
Sbjct: 66 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGY----- 119
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+++ E+ K + V HHP W Y TW+
Sbjct: 120 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G G L N + PV +V+ NDA Y W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
G LPTE EWEY + P T +H +AN WQG FP N D + A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGVA 241
Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
PV +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQKMSSTQYVIKGGSFLC 301
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
E YC R+R ++R D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329
>gi|184157258|ref|YP_001845597.1| hypothetical protein ACICU_00938 [Acinetobacter baumannii ACICU]
gi|332872886|ref|ZP_08440850.1| hypothetical protein HMPREF0022_00449 [Acinetobacter baumannii
6014059]
gi|384142326|ref|YP_005525036.1| hypothetical protein [Acinetobacter baumannii MDR-ZJ06]
gi|385236624|ref|YP_005797963.1| hypothetical protein ABTW07_1067 [Acinetobacter baumannii
TCDC-AB0715]
gi|387124838|ref|YP_006290720.1| hypothetical protein ABTJ_02834 [Acinetobacter baumannii MDR-TJ]
gi|407931955|ref|YP_006847598.1| hypothetical protein M3Q_1275 [Acinetobacter baumannii TYTH-1]
gi|416148149|ref|ZP_11602198.1| hypothetical protein AB210_2246 [Acinetobacter baumannii AB210]
gi|417551254|ref|ZP_12202332.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-18]
gi|417566146|ref|ZP_12217020.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC143]
gi|417570359|ref|ZP_12221216.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC189]
gi|417576078|ref|ZP_12226923.1| formylglycine-generating sulfatase enzyme [Acinetobacter baumannii
Naval-17]
gi|417869267|ref|ZP_12514259.1| hypothetical protein ABNIH1_03787 [Acinetobacter baumannii ABNIH1]
gi|417872655|ref|ZP_12517550.1| hypothetical protein ABNIH2_01612 [Acinetobacter baumannii ABNIH2]
gi|417877076|ref|ZP_12521811.1| hypothetical protein ABNIH3_03897 [Acinetobacter baumannii ABNIH3]
gi|417880716|ref|ZP_12525185.1| hypothetical protein ABNIH4_00465 [Acinetobacter baumannii ABNIH4]
gi|421202377|ref|ZP_15659528.1| sulfatase-modifying factor 1 [Acinetobacter baumannii AC12]
gi|421535518|ref|ZP_15981777.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii AC30]
gi|421631030|ref|ZP_16071719.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC180]
gi|421651446|ref|ZP_16091815.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC0162]
gi|421674948|ref|ZP_16114874.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC065]
gi|421689319|ref|ZP_16129003.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii IS-143]
gi|421690825|ref|ZP_16130491.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii IS-116]
gi|421702779|ref|ZP_16142255.1| hypothetical protein B825_05946 [Acinetobacter baumannii ZWS1122]
gi|421706529|ref|ZP_16145942.1| hypothetical protein B837_05656 [Acinetobacter baumannii ZWS1219]
gi|421792548|ref|ZP_16228701.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-2]
gi|424053300|ref|ZP_17790832.1| hypothetical protein W9G_01989 [Acinetobacter baumannii Ab11111]
gi|424063038|ref|ZP_17800523.1| hypothetical protein W9M_00321 [Acinetobacter baumannii Ab44444]
gi|425747841|ref|ZP_18865839.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii WC-348]
gi|425752687|ref|ZP_18870594.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-113]
gi|445456304|ref|ZP_21445750.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC047]
gi|445466288|ref|ZP_21450267.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC338]
gi|445475293|ref|ZP_21453295.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii Naval-78]
gi|183208852|gb|ACC56250.1| uncharacterized conserved protein [Acinetobacter baumannii ACICU]
gi|323517122|gb|ADX91503.1| hypothetical protein ABTW07_1067 [Acinetobacter baumannii
TCDC-AB0715]
gi|332738897|gb|EGJ69760.1| hypothetical protein HMPREF0022_00449 [Acinetobacter baumannii
6014059]
gi|333365156|gb|EGK47170.1| hypothetical protein AB210_2246 [Acinetobacter baumannii AB210]
gi|342231038|gb|EGT95857.1| hypothetical protein ABNIH1_03787 [Acinetobacter baumannii ABNIH1]
gi|342233291|gb|EGT98030.1| hypothetical protein ABNIH2_01612 [Acinetobacter baumannii ABNIH2]
gi|342236436|gb|EGU00958.1| hypothetical protein ABNIH3_03897 [Acinetobacter baumannii ABNIH3]
gi|342239552|gb|EGU03951.1| hypothetical protein ABNIH4_00465 [Acinetobacter baumannii ABNIH4]
gi|347592819|gb|AEP05540.1| conserve hypothetical protein [Acinetobacter baumannii MDR-ZJ06]
gi|385879330|gb|AFI96425.1| hypothetical protein ABTJ_02834 [Acinetobacter baumannii MDR-TJ]
gi|395550807|gb|EJG16816.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC189]
gi|395557902|gb|EJG23903.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC143]
gi|395569299|gb|EJG29961.1| formylglycine-generating sulfatase enzyme [Acinetobacter baumannii
Naval-17]
gi|398328332|gb|EJN44459.1| sulfatase-modifying factor 1 [Acinetobacter baumannii AC12]
gi|400385709|gb|EJP48784.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-18]
gi|404558699|gb|EKA63980.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii IS-143]
gi|404563722|gb|EKA68922.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii IS-116]
gi|404669088|gb|EKB36995.1| hypothetical protein W9G_01989 [Acinetobacter baumannii Ab11111]
gi|404675040|gb|EKB42765.1| hypothetical protein W9M_00321 [Acinetobacter baumannii Ab44444]
gi|407193594|gb|EKE64750.1| hypothetical protein B825_05946 [Acinetobacter baumannii ZWS1122]
gi|407193878|gb|EKE65027.1| hypothetical protein B837_05656 [Acinetobacter baumannii ZWS1219]
gi|407900536|gb|AFU37367.1| hypothetical protein M3Q_1275 [Acinetobacter baumannii TYTH-1]
gi|408508056|gb|EKK09743.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC0162]
gi|408695196|gb|EKL40752.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC180]
gi|409986360|gb|EKO42554.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii AC30]
gi|410382963|gb|EKP35497.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC065]
gi|410400128|gb|EKP52308.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-2]
gi|425492880|gb|EKU59132.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii WC-348]
gi|425498918|gb|EKU64984.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-113]
gi|444778099|gb|ELX02118.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii OIFC338]
gi|444778250|gb|ELX02268.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC047]
gi|444778957|gb|ELX02952.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii Naval-78]
Length = 335
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY
Sbjct: 66 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGY----- 119
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+++ E+ K + V HHP W Y TW+
Sbjct: 120 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G G L N + PV +V+ NDA Y W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
G LPTE EWEY + P T +H +AN WQG FP N D + A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241
Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
PV +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
E YC R+R ++R D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329
>gi|421694783|ref|ZP_16134400.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii WC-692]
gi|404567018|gb|EKA72146.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii WC-692]
Length = 335
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY
Sbjct: 66 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGY----- 119
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+++ E+ K + V HHP W Y TW+
Sbjct: 120 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G G L N + PV +V+ NDA Y W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
G LPTE EWEY + P T +H +AN WQG FP N D + A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241
Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
PV +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
E YC R+R ++R D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329
>gi|239502993|ref|ZP_04662303.1| hypothetical protein AbauAB_11844 [Acinetobacter baumannii AB900]
gi|421679867|ref|ZP_16119735.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC111]
gi|410390686|gb|EKP43069.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC111]
Length = 335
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY
Sbjct: 66 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGY----- 119
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+++ E+ K + V HHP W Y TW+
Sbjct: 120 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G G L N + PV +V+ NDA Y W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
G LPTE EWEY + P T +H +AN WQG FP N D + A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241
Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
PV +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
E YC R+R ++R D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329
>gi|260555909|ref|ZP_05828129.1| sulfatase-modifying factor 1 [Acinetobacter baumannii ATCC 19606 =
CIP 70.34]
gi|260410820|gb|EEX04118.1| sulfatase-modifying factor 1 [Acinetobacter baumannii ATCC 19606 =
CIP 70.34]
gi|452954321|gb|EME59725.1| hypothetical protein G347_03445 [Acinetobacter baumannii MSP4-16]
Length = 335
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY
Sbjct: 66 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGY----- 119
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+++ E+ K + V HHP W Y TW+
Sbjct: 120 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G G L N + PV +V+ NDA Y W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
G LPTE EWEY + P T +H +AN WQG FP N D + A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241
Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
PV +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
E YC R+R ++R D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329
>gi|384130923|ref|YP_005513535.1| hypothetical protein [Acinetobacter baumannii 1656-2]
gi|322507143|gb|ADX02597.1| Putative uncharacterized protein [Acinetobacter baumannii 1656-2]
Length = 322
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 99/328 (30%), Positives = 137/328 (41%), Gaps = 68/328 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY
Sbjct: 53 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGY----- 106
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+++ E+ K + V HHP W Y TW+
Sbjct: 107 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 139
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G G L N + PV +V+ NDA Y W
Sbjct: 140 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 169
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
G LPTE EWEY + P T +H +AN WQG FP N D + A
Sbjct: 170 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 228
Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
PV +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+LC
Sbjct: 229 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 288
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
E YC R+R ++R D +A ++GFR
Sbjct: 289 AENYCSRYRNSSRYPQDFDLAATHVGFR 316
>gi|46446714|ref|YP_008079.1| hypothetical protein pc1080 [Candidatus Protochlamydia amoebophila
UWE25]
gi|46400355|emb|CAF23804.1| conserved hypothetical protein [Candidatus Protochlamydia
amoebophila UWE25]
Length = 655
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 69/169 (40%), Positives = 91/169 (53%), Gaps = 24/169 (14%)
Query: 178 HPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG 237
HPVV V+W A+AY W G RLPTEAEWE RGG EN L+P G + + +AN +
Sbjct: 503 HPVVGVTWYGAIAYAKWIGKRLPTEAEWEIAARGGQENVLYPTGEEIE---KTQANFFSS 559
Query: 238 EFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA----PSYNPK 293
+ T VMSY N GLY+M GNV+EW DW+ +++ P + P+
Sbjct: 560 D-------------TTTVMSYAPNNCGLYDMAGNVYEWCHDWYGYNYYELSMQEPDF-PQ 605
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RC+ R +N P + G GFRCAAD
Sbjct: 606 GPLQGVYRVLRGG---CWKSLKEDLRCSRRHRNNPGTVNGTYGFRCAAD 651
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 57/151 (37%), Positives = 73/151 (48%), Gaps = 26/151 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDG---EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
DMV++ G F G+ +DG E P +TL +F LD H V+N QF F+ G
Sbjct: 418 DMVIIEGGNFYRGS------QDGNRDEMPRHQITLSSFALDVHPVTNEQFVRFLEVMG-- 469
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHD-MKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
EK + I ++R +KR G S HPVV ++W A+A
Sbjct: 470 --GEK------------DSNHNDIIRLRDSRIKRSGGKLSIESGYAKHPVVGVTWYGAIA 515
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWG 160
Y W G RLPTEAEWE RGG EN L+P G
Sbjct: 516 YAKWIGKRLPTEAEWEIAARGGQENVLYPTG 546
>gi|302039207|ref|YP_003799529.1| hypothetical protein NIDE3934 [Candidatus Nitrospira defluvii]
gi|300607271|emb|CBK43604.1| conserved exported protein of unknown function, Sulfatase-modifying
factor-like [Candidatus Nitrospira defluvii]
Length = 320
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 71/179 (39%), Positives = 98/179 (54%), Gaps = 13/179 (7%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
+HP++HV+W DA AYC W G RLPTEAEWE R G++ RLFPWG+ G RAN +
Sbjct: 140 HHPIMHVNWYDADAYCKWAGKRLPTEAEWEKAAR-GVDGRLFPWGSEYA--GPTRANFGR 196
Query: 237 -GEFPTNNTAADGYLSTAPVMSYKE-----NKFGLYNMVGNVWEWTADWWNV-HHHPAPS 289
G + L P++S + + +GLY +GNV EW +DW++ ++ AP
Sbjct: 197 TGLSGPVRDRPERLLLYPPIISVDKYENALSPYGLYQTIGNVAEWVSDWYDQDYYKTAPG 256
Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTG 348
NPKGP TGT K +GG ++ + R A R+ P + +GFRCA D G
Sbjct: 257 RNPKGPETGTQKAFRGGGWMDSTTTM---RAAMRNGTDPKTKINWMGFRCAQDAKEDAG 312
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 57/151 (37%), Positives = 80/151 (52%), Gaps = 31/151 (20%)
Query: 13 KDMVLLPGDTFRMGTNKPI--LIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
++MVL+P F MG++K L E P R+V LDAF + ++EV+ ++ +FV AT
Sbjct: 63 EEMVLIPAGEFLMGSDKKTDRLAYRSEIPQRSVYLDAFMIGKYEVTALEYLKFVLAT--- 119
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
+R R+D F+ + HHP++H++W DA AY
Sbjct: 120 -------------------DRLPQLDWRYDGGNFQ------DTMAHHPIMHVNWYDADAY 154
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
C W G RLPTEAEWE R G++ RLFPWGS
Sbjct: 155 CKWAGKRLPTEAEWEKAAR-GVDGRLFPWGS 184
>gi|421786383|ref|ZP_16222786.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-82]
gi|445454532|ref|ZP_21445454.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii WC-A-92]
gi|410412861|gb|EKP64709.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii Naval-82]
gi|444752530|gb|ELW77214.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii WC-A-92]
Length = 335
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 98/328 (29%), Positives = 137/328 (41%), Gaps = 68/328 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY
Sbjct: 66 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGY----- 119
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+++ E+ K + V HHP W Y TW+
Sbjct: 120 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G G L N + PV +V+ NDA Y W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
G LPTE EWEY + P T +H +AN WQG FP N D + A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241
Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
PV +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
+ YC R+R ++R D +A ++GFR
Sbjct: 302 AKNYCSRYRNSSRYPQDFDLAATHVGFR 329
>gi|421661394|ref|ZP_16101570.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC110]
gi|408715806|gb|EKL60928.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC110]
Length = 335
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 98/328 (29%), Positives = 136/328 (41%), Gaps = 68/328 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ E + QF FV ATGY
Sbjct: 66 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEATVAQFASFVKATGY----- 119
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+++ E+ K + V HHP W Y TW+
Sbjct: 120 -----------ITDAEKQKQAAV-------------FSPDPHHP--QQWWQLKTGY-TWK 152
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G G L N + PV +V+ NDA Y W
Sbjct: 153 TPN---------GKNGALPNP---------------------HEPVRYVTKNDAEHYAVW 182
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTA 253
G LPTE EWEY + P T +H +AN WQG FP N D + A
Sbjct: 183 LGRDLPTELEWEYAAKANSPTDT-PLHQAPTDEHQHPQANYWQGAFPFENLNQDHFTGIA 241
Query: 254 PVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLC 310
PV +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+LC
Sbjct: 242 PVGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLC 301
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFR 338
E YC R+R ++R D +A ++GFR
Sbjct: 302 AENYCSRYRNSSRYPQDFDLAATHVGFR 329
>gi|262279922|ref|ZP_06057707.1| conserved hypothetical protein [Acinetobacter calcoaceticus
RUH2202]
gi|262260273|gb|EEY79006.1| conserved hypothetical protein [Acinetobacter calcoaceticus
RUH2202]
Length = 334
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 70/182 (38%), Positives = 97/182 (53%), Gaps = 6/182 (3%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W P G + + + N PV +VS NDA Y W G LPTE EWEY + + P
Sbjct: 149 TWKTPNGDNGAVANP-NEPVRYVSKNDAEHYAVWLGRDLPTELEWEYAAKANSQTDT-PL 206
Query: 221 GNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD- 278
T +H +AN WQGEFP N D + APV + N F L++M+GNVWEWT+
Sbjct: 207 HQAPTDEHQHPQANYWQGEFPFQNLNQDHFKDVAPVGCFASNHFKLFDMIGNVWEWTSSP 266
Query: 279 WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
+ H +Y+ + T T V KGGS+LC + YC R+R ++R D +A ++G
Sbjct: 267 YQGAHDQHMGNYSDLRQQQITSTQYVIKGGSFLCAQNYCSRYRSSSRYPQDFDLAATHVG 326
Query: 337 FR 338
FR
Sbjct: 327 FR 328
>gi|299771125|ref|YP_003733151.1| hypothetical protein AOLE_14455 [Acinetobacter oleivorans DR1]
gi|298701213|gb|ADI91778.1| hypothetical protein AOLE_14455 [Acinetobacter oleivorans DR1]
Length = 334
Score = 120 bits (300), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 98/348 (28%), Positives = 141/348 (40%), Gaps = 81/348 (23%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFP--------------SRNVTLDAFYLDQHE 54
+E+ K+ LP + T +LI DG F + + F++DQ E
Sbjct: 44 IEKCKNYTGLPQGWLKQPTAGMVLITDGHFNFGSEKAYPDELNFGKKQREVKGFWIDQTE 103
Query: 55 VSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHR 114
V+ QF F+ ATGY +++ E+ K + V
Sbjct: 104 VTVAQFASFIKATGY----------------ITDAEKQKQAAV-------------FSPD 134
Query: 115 MHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEH 174
HP W + TW+ G +G L N
Sbjct: 135 PQHPQ---QWWQLKSGYTWKTPN---------GSKGALANP------------------- 163
Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RAN 233
N PV +VS NDA Y W G LPTE EWEY + + P T +H ++N
Sbjct: 164 --NEPVRYVSKNDAEHYAVWLGHDLPTELEWEYAAKANSKTDT-PLHQAPTDEHQHPQSN 220
Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP 292
WQGEFP N D + APV + N F L++ +GNVWEWT+ + H +Y+
Sbjct: 221 YWQGEFPFQNLNQDHFKDVAPVGCFAANNFKLFDTIGNVWEWTSSPYQGAHDQHMGNYSD 280
Query: 293 --KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
+ T V KGGS+LC + YC R+R ++R D +A ++GFR
Sbjct: 281 LRQQQIASTQYVIKGGSFLCAQNYCSRYRNSSRYPQDFDLAATHVGFR 328
>gi|375133874|ref|YP_004994524.1| protein kinase [Acinetobacter calcoaceticus PHEA-2]
gi|325121319|gb|ADY80842.1| protein kinase [Acinetobacter calcoaceticus PHEA-2]
Length = 303
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/183 (39%), Positives = 96/183 (52%), Gaps = 6/183 (3%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W P G TI N PV +VS NDA Y W G LPTE EWEY + + P
Sbjct: 118 TWKTPNGSTGTIA-TPNEPVRYVSKNDAEHYAVWLGRDLPTELEWEYAAKANSKTDT-PL 175
Query: 221 GNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD- 278
T +H +AN WQGEFP N D + APV + N F L++M+GNVWEWT+
Sbjct: 176 HQAPTDEHQHPQANYWQGEFPFQNLNQDHFKGVAPVGCFTANHFKLFDMIGNVWEWTSSP 235
Query: 279 WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
+ H +Y+ + T T V KGGS+LC + YC R+R ++R D +A ++G
Sbjct: 236 YQGAHDQHMGNYSDLRQQQITSTQYVIKGGSFLCAQNYCSRYRSSSRYPQDFDLAATHVG 295
Query: 337 FRC 339
FR
Sbjct: 296 FRT 298
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 50/156 (32%), Positives = 72/156 (46%), Gaps = 15/156 (9%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPILIKDGEFP--------------SRNVTLDAFYLDQHE 54
+E+ ++ LP + T +LI DG+F + + F++DQ E
Sbjct: 13 IEKCQNYTGLPQGWLKQTTAGMVLIADGDFNFGSEKAYPDELNFGKKQREVKGFWIDQTE 72
Query: 55 VSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHR 114
V+ QF FV ATGY+T+AEK VF P ++ + + K G TI
Sbjct: 73 VTVAQFASFVKATGYITDAEKQKQAAVFSPDPDHPQQWWQLKSGYTWKTPNGSTGTIA-T 131
Query: 115 MHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRG 150
+ PV ++S NDA Y W G LPTE EWEY +
Sbjct: 132 PNEPVRYVSKNDAEHYAVWLGRDLPTELEWEYAAKA 167
>gi|302039449|ref|YP_003799771.1| hypothetical protein NIDE4178 [Candidatus Nitrospira defluvii]
gi|300607513|emb|CBK43846.1| conserved exported protein of unknown function, Sulfatase-modifying
factor-like [Candidatus Nitrospira defluvii]
Length = 308
Score = 120 bits (300), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 74/198 (37%), Positives = 107/198 (54%), Gaps = 15/198 (7%)
Query: 152 LENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRG 211
L + L P W + G + E ++HPV+HVSW DA AYC W G RLPTEAEWE RG
Sbjct: 106 LASDLVPLIDWQYDGG--NFQETMVSHPVMHVSWFDAEAYCKWAGKRLPTEAEWEKAARG 163
Query: 212 GLENRLFPWGNNLTPRGEHRANVWQ-GEFPTNNTAADGYLSTAPVMSYKE-----NKFGL 265
+ R++PWGN + G RAN + G + L P++S + + +G+
Sbjct: 164 E-DGRIYPWGNQMA--GLSRANFGRTGLSGPVRDRPERLLLYPPIISVNKYDNAASPYGV 220
Query: 266 YNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARS 324
+ M GNV EW ADW++ ++ P NPKGP TG+ + +GG ++ + R A R+
Sbjct: 221 FQMAGNVAEWVADWYDPKYYTTGPDKNPKGPETGSQRSFRGGGWIDSTPSV---RVAQRN 277
Query: 325 QNTPDSSAGNLGFRCAAD 342
P++ +GFRCA D
Sbjct: 278 GTDPNTKMNWMGFRCARD 295
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/152 (36%), Positives = 81/152 (53%), Gaps = 31/152 (20%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKD--GEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
+ +M+ +P F MG+++ + EFP R V LDA+ +D++EV+ QF +FV A+
Sbjct: 51 HDEMIRIPAGEFLMGSDRKVDKNSYLAEFPQRKVYLDAYEIDKYEVTTVQFLKFVLASDL 110
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
V PL+ + +D F+ E + HPV+H+SW DA A
Sbjct: 111 V-------------PLIDWQ---------YDGGNFQ------ETMVSHPVMHVSWFDAEA 142
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
YC W G RLPTEAEWE RG + R++PWG+
Sbjct: 143 YCKWAGKRLPTEAEWEKAARGE-DGRIYPWGN 173
>gi|427424482|ref|ZP_18914605.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii WC-136]
gi|425698782|gb|EKU68415.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii WC-136]
Length = 334
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/182 (39%), Positives = 96/182 (52%), Gaps = 6/182 (3%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W P G I N PV +VS NDA Y W G LPTE EWEY + + P
Sbjct: 149 TWKTPNGSMGAIA-TPNEPVRYVSKNDAEHYAVWLGRDLPTELEWEYAAKANSKTDT-PL 206
Query: 221 GNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD- 278
T +H +AN WQGEFP N D + APV + NKF L++M+GNVWEWT+
Sbjct: 207 HQPPTDEHQHPQANYWQGEFPFQNLNQDHFKGVAPVGCFAANKFKLFDMIGNVWEWTSSP 266
Query: 279 WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
+ H +Y+ + T T V KGGS+LC + YC R+R ++R D +A ++G
Sbjct: 267 YQGAHDQHMGNYSDLRQQQITSTQYVIKGGSFLCAQNYCSRYRSSSRYPQDFDLAATHVG 326
Query: 337 FR 338
FR
Sbjct: 327 FR 328
>gi|194336182|ref|YP_002017976.1| hypothetical protein Ppha_1079 [Pelodictyon phaeoclathratiforme
BU-1]
gi|194308659|gb|ACF43359.1| protein of unknown function DUF323 [Pelodictyon phaeoclathratiforme
BU-1]
Length = 267
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 76/174 (43%), Positives = 99/174 (56%), Gaps = 20/174 (11%)
Query: 177 NHPVVHVSWNDAVAYC------TWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
N PVV+VSWNDAVAYC T + RLPTEAEWEY CR G F G NLT
Sbjct: 97 NDPVVNVSWNDAVAYCKALSAKTGKRCRLPTEAEWEYACRAG-SRTPFNTGQNLT---TG 152
Query: 231 RANVWQGEFPTNNTAADGY-LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---- 285
+AN + G +P N Y PV ++ N++GLYNM GNVWEW +DW++ ++
Sbjct: 153 QAN-YNGNYPYNGNPKGVYRQKIVPVNTFAPNRWGLYNMHGNVWEWCSDWYSETYYDECK 211
Query: 286 -PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
NP GP+T + +V +GG + + +YC R A RS NTPD + +GFR
Sbjct: 212 AQGVVTNPTGPSTRSRRVIRGGGWSDDAEYC---RSAYRSDNTPDYRSNGVGFR 262
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 68/154 (44%), Gaps = 46/154 (29%)
Query: 4 LPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
LPA +E VL+PG TF MG+ + V + FY+ ++EV+ +F++F
Sbjct: 25 LPAKAME----FVLIPGGTFTMGSPAGEVGHQNNETQHQVKVSDFYIAEYEVTVAEFRKF 80
Query: 64 VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
V+A GY + + + PVV++S
Sbjct: 81 VAAKGYQ------------------------------------WNEGTQGEANDPVVNVS 104
Query: 124 WNDAVAYC------TWRGARLPTEAEWEYGCRGG 151
WNDAVAYC T + RLPTEAEWEY CR G
Sbjct: 105 WNDAVAYCKALSAKTGKRCRLPTEAEWEYACRAG 138
>gi|403052604|ref|ZP_10907088.1| hypothetical protein AberL1_13896 [Acinetobacter bereziniae LMG
1003]
Length = 331
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 93/327 (28%), Positives = 138/327 (42%), Gaps = 66/327 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P +F++G+N+ + F ++ +++F++DQ EV+ QF FV ATGY+T+AE
Sbjct: 62 MVRIPKGSFQLGSNQ-AYPDEINFGAKQRQVESFWIDQTEVTVAQFSSFVKATGYITDAE 120
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K VF P ++ + + K G PV +++ NDA Y W
Sbjct: 121 KQHQAAVFSPNKQTPQQWWQLKADYTWKSPNGAKGQAPLPA-EPVRYVTQNDAEHYAVWL 179
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G LPTE EWEY + G + T E HP + W
Sbjct: 180 GHDLPTEQEWEYAAKAGSTTDT----------PLHQTPEDAHQHPQANY----------W 219
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+G FP+ N T AD + AP
Sbjct: 220 QGE--------------------FPFNN---------------------TQADHFEGVAP 238
Query: 255 VMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNP--KGPTTGTDKVKKGGSYLCN 311
V Y N F L++M+GNVWEWT+ + H +Y + GT V KGGS+LC
Sbjct: 239 VGCYAANAFKLFDMIGNVWEWTSSIYRGAHDQHMGNYADLRQQRIAGTQYVIKGGSFLCA 298
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFR 338
+C R+R ++R D +A ++GFR
Sbjct: 299 SNFCSRYRNSSRYPQEFDLAATHVGFR 325
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 71/213 (33%), Positives = 93/213 (43%), Gaps = 53/213 (24%)
Query: 179 PVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR---ANVW 235
PV +V+ NDA Y W G LPTE EWEY + G + TP H+ AN W
Sbjct: 163 PVRYVTQNDAEHYAVWLGHDLPTEQEWEYAAKAGSTTDT---PLHQTPEDAHQHPQANYW 219
Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGP 295
QGEFP NNT AD + APV Y N F L++M+GNVWEWT+ + H
Sbjct: 220 QGEFPFNNTQADHFEGVAPVGCYAANAFKLFDMIGNVWEWTSSIYRGAH----------- 268
Query: 296 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDKVKKG 355
++ N + R A GT V KG
Sbjct: 269 ----------DQHMGNYADLRQQRIA--------------------------GTQYVIKG 292
Query: 356 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 388
GS+LC +C R+R ++R D +A ++GFR
Sbjct: 293 GSFLCASNFCSRYRNSSRYPQEFDLAATHVGFR 325
>gi|294651296|ref|ZP_06728619.1| cysteine-type sulfatase aerobic maturase domain protein, partial
[Acinetobacter haemolyticus ATCC 19194]
gi|292822809|gb|EFF81689.1| cysteine-type sulfatase aerobic maturase domain protein
[Acinetobacter haemolyticus ATCC 19194]
Length = 153
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 67/154 (43%), Positives = 83/154 (53%), Gaps = 11/154 (7%)
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTP--RGEHRANVWQGEFPTNNTAAD 247
AY W G LPTE EWEY +G F +++ P +G AN WQGEFP NT D
Sbjct: 1 AYAHWLGHDLPTELEWEYAAKG------FQQQSDIGPTHQGHTTANYWQGEFPYQNTQQD 54
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN---VHHHPAPSYNPKGPTTGTDKVKK 304
G+ APV +K N FGLY+M+GNVWEWT + HH P+ P + K
Sbjct: 55 GFNEVAPVGCFKANPFGLYDMIGNVWEWTQTPYTGVRDHHMGDPATYRHDPNSILQYTIK 114
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
GGSYLC YC R+R AAR D + ++GFR
Sbjct: 115 GGSYLCASNYCARYRAAARHPQELDLATSHVGFR 148
>gi|193076705|gb|ABO11408.2| hypothetical protein A1S_0976 [Acinetobacter baumannii ATCC 17978]
Length = 335
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 97/327 (29%), Positives = 142/327 (43%), Gaps = 66/327 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY+T+AE
Sbjct: 66 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYITDAE 124
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K VF P ++ + + K G + T+ + H PV +++ NDA Y W
Sbjct: 125 KQKQAAVFSPDPHHPQQWWQLKTGYTWKTPNGKNGTLPNP-HEPVRYVTKNDAEHYAVWL 183
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G LPTE EWEY + P + L+ D EH+ HP + W
Sbjct: 184 GRDLPTELEWEYAAKANS-----PTDTPLYQAPTD---EHQ--HPQANY----------W 223
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+GA FP+ N D + AP
Sbjct: 224 QGA--------------------FPF---------------------ENLNQDHFTGIAP 242
Query: 255 VMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLCN 311
V +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+LC
Sbjct: 243 VGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLCA 302
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFR 338
E YC R+R ++R D +A ++GFR
Sbjct: 303 ENYCSRYRNSSRYPQDFDLAATHVGFR 329
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 73/233 (31%), Positives = 100/233 (42%), Gaps = 58/233 (24%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGG--LENRLF 218
+W P G + T+ + + PV +V+ NDA Y W G LPTE EWEY + + L+
Sbjct: 150 TWKTPNGKNGTLPNP-HEPVRYVTKNDAEHYAVWLGRDLPTELEWEYAAKANSPTDTPLY 208
Query: 219 PWGNNLTPRGEHR---ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEW 275
P EH+ AN WQG FP N D + APV +K N F L++M+GNVWEW
Sbjct: 209 -----QAPTDEHQHPQANYWQGAFPFENLNQDHFTGIAPVGCFKPNGFKLFDMIGNVWEW 263
Query: 276 TADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNL 335
T+ + H D GN
Sbjct: 264 TSSPYQGAH--------------------------------------------DQHMGNY 279
Query: 336 GFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 388
A + + T V KGGS+LC E YC R+R ++R D +A ++GFR
Sbjct: 280 ---SALRQQEMSSTQYVIKGGSFLCAENYCSRYRNSSRYPQDFDLAATHVGFR 329
>gi|194337011|ref|YP_002018805.1| hypothetical protein Ppha_1976 [Pelodictyon phaeoclathratiforme
BU-1]
gi|194309488|gb|ACF44188.1| protein of unknown function DUF323 [Pelodictyon phaeoclathratiforme
BU-1]
Length = 319
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 76/174 (43%), Positives = 99/174 (56%), Gaps = 20/174 (11%)
Query: 177 NHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
NHPV+HVSWNDA AYC W RLPTEAEWE+ CR G E + F G+NLT
Sbjct: 149 NHPVLHVSWNDAEAYCAWLSKKTGNTFRLPTEAEWEFACRAGSE-KPFHTGDNLTT---G 204
Query: 231 RANVWQGEFPTNNTAADGYL-STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---- 285
+AN + G +P NN+ + +T PV S + N +GLYNM GNVWEW +D + +++
Sbjct: 205 QAN-YNGNYPFNNSVKGIFRKNTLPVDSLEPNVWGLYNMHGNVWEWCSDMYGKNYYDECR 263
Query: 286 -PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
NP+GPTTG+ +V +GG + YC R A R N P +GFR
Sbjct: 264 AKGTVENPEGPTTGSHRVIRGGGWNYVAVYC---RSAYRDNNAPGYRYDLVGFR 314
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/148 (31%), Positives = 71/148 (47%), Gaps = 16/148 (10%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+ + + GD F MG+ + + +V + + L + V F+ F+ +GY T+A
Sbjct: 53 NYIEVKGDVFVMGSPENEAGHREDERQHSVKIADYSLYKFAVCVGDFKRFIDDSGYKTDA 112
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEH--RMHHPVVHISWNDAVAYC 131
EK +FV++ K + + + L + +HPV+H+SWNDA AYC
Sbjct: 113 EKENFSFVYD--------GKSGKRKDGINWSHDLSGNVREPSAYNHPVLHVSWNDAEAYC 164
Query: 132 TWRGA------RLPTEAEWEYGCRGGLE 153
W RLPTEAEWE+ CR G E
Sbjct: 165 AWLSKKTGNTFRLPTEAEWEFACRAGSE 192
>gi|110598764|ref|ZP_01387025.1| Protein of unknown function DUF323 [Chlorobium ferrooxidans DSM
13031]
gi|110339628|gb|EAT58142.1| Protein of unknown function DUF323 [Chlorobium ferrooxidans DSM
13031]
Length = 332
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 102/341 (29%), Positives = 140/341 (41%), Gaps = 102/341 (29%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
++ V +PG F MG+ + V L FY+ ++ V++ +F++FV ATGY+T+
Sbjct: 78 ENFVFIPGGEFTMGSPLSEANRQINETQHQVRLSDFYISRYPVTSAEFRKFVKATGYLTD 137
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE--HRMHHPVVHISWNDAVAY 130
AEK D RH G+ T+ +HPVV++SWNDAVAY
Sbjct: 138 AEKANDQ---------------KHWRH------GVSGTLRPNSEDNHPVVYMSWNDAVAY 176
Query: 131 CTW------RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVS 184
C W R RLPTEAE EY CR G P+ S S R N+
Sbjct: 177 CNWISQHTGRKFRLPTEAEREYACRAGTAT---PFNSG------QSLTTERANY------ 221
Query: 185 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNT 244
N P GN P+G +R N
Sbjct: 222 ------------------------------NGQNPSGNK--PKGVYREN----------- 238
Query: 245 AADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGT 299
T V S+ N +GLYNM GNVWEW +DW++ + NP GP G+
Sbjct: 239 -------TVAVNSFAPNAWGLYNMHGNVWEWCSDWFDPAFYEQCKAAGTVTNPDGPEDGS 291
Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+V +G S+ E++C R A R P + N+GFR
Sbjct: 292 RRVLRGASWFDFEEHC---RSAYRIGGAPLARNPNVGFRLV 329
>gi|119357294|ref|YP_911938.1| hypothetical protein Cpha266_1490 [Chlorobium phaeobacteroides DSM
266]
gi|119354643|gb|ABL65514.1| protein of unknown function DUF323 [Chlorobium phaeobacteroides DSM
266]
Length = 742
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 104/337 (30%), Positives = 140/337 (41%), Gaps = 87/337 (25%)
Query: 16 VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEK 75
VL+ GD F MG+ K + + V L +YL ++ V+ F+ F GY
Sbjct: 478 VLIRGDEFTMGSPKHEVDRSDNETEHQVKLSDYYLCKYAVTVADFKRFAEEIGYT----- 532
Query: 76 FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRG 135
++ E A S V F+G +WN V
Sbjct: 533 -----------TDAETAGSSSV------FDGK---------------AWNKKVG------ 554
Query: 136 ARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW- 194
W +G E + S E+ NHPV+HVSWNDAVAY W
Sbjct: 555 ------INWRHGV----------------SESLRSAGEY--NHPVLHVSWNDAVAYAGWL 590
Query: 195 -----RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
+ RLPTEAEWEY CR G + F G NLT +AN + G +P N +
Sbjct: 591 SLQTGKSFRLPTEAEWEYACRAG-KGEPFSTGENLTT---DQAN-YDGNYPYNGNRKGVF 645
Query: 250 LSTAPVMSYKE-NKFGLYNMVGNVWEWTADWWNVHHH-----PAPSYNPKGPTTGTDKVK 303
T E N GLYNM GNVWEW +D + ++ NP GP TG+ +V
Sbjct: 646 RGTTIACDALEPNALGLYNMHGNVWEWCSDRYGDKYYDECKAKGIVENPVGPETGSLRVL 705
Query: 304 KGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+GGS+ + C R A R ++PD + + GFR A
Sbjct: 706 RGGSWFNYARIC---RSAYRDGDSPDYRSDDAGFRLA 739
>gi|126641026|ref|YP_001084010.1| hypothetical protein A1S_0976 [Acinetobacter baumannii ATCC 17978]
Length = 270
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 97/328 (29%), Positives = 142/328 (43%), Gaps = 66/328 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY+T+AE
Sbjct: 1 MVFIPDGHFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYITDAE 59
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K VF P ++ + + K G + T+ + H PV +++ NDA Y W
Sbjct: 60 KQKQAAVFSPDPHHPQQWWQLKTGYTWKTPNGKNGTLPNP-HEPVRYVTKNDAEHYAVWL 118
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
G LPTE EWEY + P + L+ D EH+ HP + W
Sbjct: 119 GRDLPTELEWEYAAKANS-----PTDTPLYQAPTD---EHQ--HPQANY----------W 158
Query: 195 RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAP 254
+GA FP+ N D + AP
Sbjct: 159 QGA--------------------FPFEN---------------------LNQDHFTGIAP 177
Query: 255 VMSYKENKFGLYNMVGNVWEWTAD-WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLCN 311
V +K N F L++M+GNVWEWT+ + H +Y+ + + T V KGGS+LC
Sbjct: 178 VGCFKPNGFKLFDMIGNVWEWTSSPYQGAHDQHMGNYSALRQQEMSSTQYVIKGGSFLCA 237
Query: 312 EQYCYRHRCAARSQNTPDSSAGNLGFRC 339
E YC R+R ++R D +A ++GFR
Sbjct: 238 ENYCSRYRNSSRYPQDFDLAATHVGFRT 265
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 73/234 (31%), Positives = 100/234 (42%), Gaps = 58/234 (24%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGG--LENRLF 218
+W P G + T+ + + PV +V+ NDA Y W G LPTE EWEY + + L+
Sbjct: 85 TWKTPNGKNGTLPNP-HEPVRYVTKNDAEHYAVWLGRDLPTELEWEYAAKANSPTDTPLY 143
Query: 219 PWGNNLTPRGEHR---ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEW 275
P EH+ AN WQG FP N D + APV +K N F L++M+GNVWEW
Sbjct: 144 -----QAPTDEHQHPQANYWQGAFPFENLNQDHFTGIAPVGCFKPNGFKLFDMIGNVWEW 198
Query: 276 TADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNL 335
T+ + H D GN
Sbjct: 199 TSSPYQGAH--------------------------------------------DQHMGNY 214
Query: 336 GFRCAADKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 389
A + + T V KGGS+LC E YC R+R ++R D +A ++GFR
Sbjct: 215 ---SALRQQEMSSTQYVIKGGSFLCAENYCSRYRNSSRYPQDFDLAATHVGFRT 265
>gi|406672268|ref|ZP_11079494.1| hypothetical protein HMPREF9700_00036 [Bergeyella zoohelcum CCUG
30536]
gi|405587956|gb|EKB61674.1| hypothetical protein HMPREF9700_00036 [Bergeyella zoohelcum CCUG
30536]
Length = 741
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 80/214 (37%), Positives = 104/214 (48%), Gaps = 29/214 (13%)
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA 188
A+ RG ++ W G G E ++ GSW +G + NHPVVHV+W A
Sbjct: 553 AFLNARGNQVTNGVRWYQGGSSG-EIQVVS-GSWKAKQG-------KENHPVVHVTWYGA 603
Query: 189 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRAN--VWQGEFPTNNTAA 246
AY W G RLPTEAEWEY RGG + F + G + A W G+ +N
Sbjct: 604 KAYAEWVGGRLPTEAEWEYAARGGKYSLGFFYA------GSNNAGDVAWYGDNSGSN--- 654
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGG 306
T PV + N+ GLY+M GNVWEW +DW+ + A S NP GPTTG+ +V +GG
Sbjct: 655 -----THPVGQKQPNELGLYDMSGNVWEWCSDWYGAYSGNAQS-NPTGPTTGSYRVLRGG 708
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+ Y R A R+ P N GFR
Sbjct: 709 GFYYGATYT---RVARRNYYDPSYQNYNYGFRVV 739
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 49/137 (35%), Positives = 68/137 (49%), Gaps = 18/137 (13%)
Query: 16 VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG-YVTEAE 74
VL+ G TF MG+ D E P VTL + + ++E++N Q+ F++A G VT
Sbjct: 508 VLVEGGTFWMGSPNNEGETD-ERPIHQVTLSNYRMSKYEITNAQYAAFLNARGNQVTNGV 566
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
++ +I V K +G + +HPVVH++W A AY W
Sbjct: 567 RW---------YQGGSSGEIQVVSGSWKAKQG-------KENHPVVHVTWYGAKAYAEWV 610
Query: 135 GARLPTEAEWEYGCRGG 151
G RLPTEAEWEY RGG
Sbjct: 611 GGRLPTEAEWEYAARGG 627
>gi|194336983|ref|YP_002018777.1| hypothetical protein Ppha_1944 [Pelodictyon phaeoclathratiforme
BU-1]
gi|194309460|gb|ACF44160.1| protein of unknown function DUF323 [Pelodictyon phaeoclathratiforme
BU-1]
Length = 829
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 75/195 (38%), Positives = 102/195 (52%), Gaps = 23/195 (11%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLE 214
+W H + +HPV+HVSWNDAVAYC W + RLPTEAEWEY CR
Sbjct: 640 NWRHDVSGKKRPSDQYSHPVLHVSWNDAVAYCKWMTDKTGKTFRLPTEAEWEYACRAKTT 699
Query: 215 NRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKE----NKFGLYNMVG 270
F G NLT +AN + G +P NN A Y T + E N++GL++M G
Sbjct: 700 TP-FNTGENLTT---DQAN-YDGNYPYNNNAKGRYRETTVAVGSSERFEPNQWGLFHMHG 754
Query: 271 NVWEWTADWWNVHHH-----PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQ 325
NVWEW +DW+ +++ NP GP TG+ +V +GG + + + C R A R
Sbjct: 755 NVWEWCSDWYGGNYYEECKTKGTVENPVGPETGSGRVLRGGGWNGSARGC---RSAYRGS 811
Query: 326 NTPDSSAGNLGFRCA 340
+ PD + + GFR A
Sbjct: 812 DAPDYRSRSAGFRLA 826
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 76/144 (52%), Gaps = 13/144 (9%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSR-NVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ V +P F MG+ + + G ++ V + FYL ++ V+ +F+ F+ +GY T+
Sbjct: 559 NFVRIPAGEFAMGSPEGEPERYGSDETQHQVKVSEFYLCKYAVTPAEFKTFIDESGYQTD 618
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AEK + +++ E + + RHD+ + HPV+H+SWNDAVAYC
Sbjct: 619 AEKANSSGIWDG--KEWKDKEGVNWRHDVSG----KKRPSDQYSHPVLHVSWNDAVAYCK 672
Query: 133 W------RGARLPTEAEWEYGCRG 150
W + RLPTEAEWEY CR
Sbjct: 673 WMTDKTGKTFRLPTEAEWEYACRA 696
>gi|260551202|ref|ZP_05825405.1| sulfatase-modifying factor 1 [Acinetobacter sp. RUH2624]
gi|424056424|ref|ZP_17793945.1| hypothetical protein W9I_02794 [Acinetobacter nosocomialis Ab22222]
gi|425740965|ref|ZP_18859124.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii WC-487]
gi|260405807|gb|EEW99296.1| sulfatase-modifying factor 1 [Acinetobacter sp. RUH2624]
gi|407441464|gb|EKF47970.1| hypothetical protein W9I_02794 [Acinetobacter nosocomialis Ab22222]
gi|425493812|gb|EKU60036.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii WC-487]
Length = 335
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 69/182 (37%), Positives = 97/182 (53%), Gaps = 6/182 (3%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W P G + + H + PV +V+ NDA Y W G LPTE EWEY + + P
Sbjct: 150 TWKTPNGKNGALPHP-HEPVRYVTKNDAEHYAVWLGRDLPTELEWEYAAKANSKTDT-PL 207
Query: 221 GNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD- 278
T +H +AN WQG FP N D + APV +K N F L++M+GNVWEWT+
Sbjct: 208 HQAPTDEHQHPQANYWQGTFPFENLNHDHFTGVAPVGCFKSNGFKLFDMIGNVWEWTSSP 267
Query: 279 WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
+ H +Y+ + + T V KGGS+LC E YC R+R ++R D +A ++G
Sbjct: 268 YQGAHDQHMGNYSDLRQKEMSSTQYVIKGGSFLCAENYCSRYRNSSRYPQDFDLAATHVG 327
Query: 337 FR 338
FR
Sbjct: 328 FR 329
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 47/136 (34%), Positives = 67/136 (49%), Gaps = 2/136 (1%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY+T+AE
Sbjct: 66 MVFIPDGQFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYITDAE 124
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K VF P ++ + + K G + + H H PV +++ NDA Y W
Sbjct: 125 KQKQAAVFTPDPRHPQQWWQLKSGYTWKTPNGKNGALPHP-HEPVRYVTKNDAEHYAVWL 183
Query: 135 GARLPTEAEWEYGCRG 150
G LPTE EWEY +
Sbjct: 184 GRDLPTELEWEYAAKA 199
>gi|445426185|ref|ZP_21437542.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
sp. WC-743]
gi|444752735|gb|ELW77407.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
sp. WC-743]
Length = 316
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/166 (40%), Positives = 89/166 (53%), Gaps = 9/166 (5%)
Query: 179 PVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR---ANVW 235
PV +V+ NDA Y W G LPTE EWEY + G + P H+ AN W
Sbjct: 148 PVRYVTQNDAEHYAVWLGHDLPTEQEWEYAAKAGSTTDT---PLHQAPEDAHQHPQANYW 204
Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW-NVHHHPAPSYNP-- 292
QGEFP NNT AD + APV Y N F L++M+GNVWEWT+ + H +Y
Sbjct: 205 QGEFPFNNTQADHFEGVAPVGCYAANAFKLFDMIGNVWEWTSSIYRGAHDQHMGNYADLR 264
Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
+ GT V KGGS+LC +C R+R ++R D +A ++GFR
Sbjct: 265 QQRIAGTQYVIKGGSFLCASNFCSRYRNSSRYPQEFDLAATHVGFR 310
>gi|451979552|ref|ZP_21927968.1| conserved exported hypothetical protein [Nitrospina gracilis 3/211]
gi|451763204|emb|CCQ89159.1| conserved exported hypothetical protein [Nitrospina gracilis 3/211]
Length = 432
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/168 (40%), Positives = 96/168 (57%), Gaps = 11/168 (6%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
N PV ++WN+A YC +G RLPTEAEWEY RGG + + WG+ T +G + AN
Sbjct: 268 NLPVESITWNEASQYCEKQGKRLPTEAEWEYAARGGTQTHFY-WGD--TVKG-NEANFCD 323
Query: 237 GEFPTNNTAA---DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNP 292
N A DG+ +TAPV +K N +GL++M GNV EWTADW + +++ +P NP
Sbjct: 324 KACELNIRVAAIDDGFRTTAPVGQFKPNSYGLHDMAGNVAEWTADWMDENYYRSSPKDNP 383
Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KGP +KV +GG++ + R + R+ PD +GFRC
Sbjct: 384 KGPRPLDNKVVRGGAWNTTAGFL---RSSNRAAFLPDFRNPGVGFRCV 428
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 41/147 (27%), Positives = 63/147 (42%), Gaps = 45/147 (30%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV + G F+MG+ + E P V +D+F +D++EV+ +FQ +
Sbjct: 210 EMVTIKGGCFQMGSKQGT---PAEQPVHEVCVDSFQMDKYEVTQARFQAVM--------- 257
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+R + ++ PV I+WN+A YC
Sbjct: 258 ----------------DRNPSDHIGANL----------------PVESITWNEASQYCEK 285
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWG 160
+G RLPTEAEWEY RGG + + WG
Sbjct: 286 QGKRLPTEAEWEYAARGGTQTHFY-WG 311
>gi|328953727|ref|YP_004371061.1| sulfatase-modifying factor protein [Desulfobacca acetoxidans DSM
11109]
gi|328454051|gb|AEB09880.1| Sulphatase-modifying factor protein [Desulfobacca acetoxidans DSM
11109]
Length = 292
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 69/176 (39%), Positives = 92/176 (52%), Gaps = 14/176 (7%)
Query: 177 NHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
N PVV V+W+DA A+ W LPTEA+WE RGGL + FPWG+ +G
Sbjct: 124 NQPVVGVTWHDATAFAKWLSRITQEEYMLPTEAQWEAAARGGLVGQPFPWGSTPPEQGGR 183
Query: 231 RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPS 289
++ F N DG+ TAPV S+ N FGLY+M GNV EW DW++ + +
Sbjct: 184 ----FRASFNPNPYDKDGFRYTAPVASFPPNGFGLYDMAGNVAEWCEDWYHSSSYTKSRK 239
Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGP 345
NP GP +GT +V +GGS+ + RCAAR P + G +GFR P
Sbjct: 240 ENPAGPASGTSRVLRGGSWYGRAR---ELRCAARQSFRPSQTDGFIGFRLIRPVSP 292
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 47/158 (29%), Positives = 70/158 (44%), Gaps = 45/158 (28%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEF----PSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
+ L+PG + +G+ + G + P V L FY+ E++N Q+ EF++ATG+
Sbjct: 53 QLRLIPGGPYLLGSPPE---ESGRYWDEGPQHWVNLSPFYITATEITNAQYGEFLAATGH 109
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
EPL LD + + + PVV ++W+DA A
Sbjct: 110 P------------EPLY-------------------WLDRNL-NAPNQPVVGVTWHDATA 137
Query: 130 YCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGS 161
+ W LPTEA+WE RGGL + FPWGS
Sbjct: 138 FAKWLSRITQEEYMLPTEAQWEAAARGGLVGQPFPWGS 175
>gi|312131280|ref|YP_003998620.1| gliding motility-associated lipoprotein gldj [Leadbetterella
byssophila DSM 17132]
gi|311907826|gb|ADQ18267.1| gliding motility-associated lipoprotein GldJ [Leadbetterella
byssophila DSM 17132]
Length = 419
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 113/392 (28%), Positives = 164/392 (41%), Gaps = 75/392 (19%)
Query: 5 PAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
PA P +MV + G F MGT + ++ + R VT+ +FY+D+ E++N + E++
Sbjct: 59 PAGP-----NMVFIEGGRFVMGTLEEDVMYSRDNIDRTVTVSSFYMDETEIANLDYLEYL 113
Query: 65 SATGYVTEAEKFG----DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
A + E + DT V+ +S + Q R+ G S +PVV
Sbjct: 114 YAVSRDSTREFYAAALPDTTVWAHEMSFND-----QYVDQYLRYPGFRS-------YPVV 161
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
+SW A YC WR A + E +YG +G + F E D+T + P+
Sbjct: 162 GVSWVQANDYCKWRSAVVNREMYAKYGGKGEKKTLSFGRKKKGAEEVADNTPQVTGRVPI 221
Query: 181 -VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGL---------ENRLFPW-GNNL----- 224
++ D RLPTEAEWEY + + +NR++PW G++L
Sbjct: 222 ETGINLPDF---------RLPTEAEWEYAAKATIGTLSVDENYDNRIYPWDGSSLRRAKG 272
Query: 225 TPRGEHRANVWQGEFPTNNTAA---DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
RGE +AN +G A D Y TAPV Y N FGLYNM GNV EW D +
Sbjct: 273 KARGEMQANYKRGRGDYAGIAGRLNDKYQITAPVYEYYPNDFGLYNMAGNVNEWVYDLYR 332
Query: 282 VHHHPAPSYNPKGPTTGTD---------------------KVKKGGSYLCNEQYCYRHRC 320
+ + +N P +D +V KGGS+ Y
Sbjct: 333 PNSY--RDFNDLNPIRRSDYQDEEKNYDNANYNSLINNRVRVYKGGSW---NDVAYWLSP 387
Query: 321 AARSQNTPDSSAGNLGFRCAADKGPTTGTDKV 352
R DSS +GFRCA + G K+
Sbjct: 388 GTRRFLDQDSSTATIGFRCAMISVGSAGKRKL 419
>gi|423315592|ref|ZP_17293497.1| hypothetical protein HMPREF9699_00068 [Bergeyella zoohelcum ATCC
43767]
gi|405585910|gb|EKB59703.1| hypothetical protein HMPREF9699_00068 [Bergeyella zoohelcum ATCC
43767]
Length = 869
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 69/165 (41%), Positives = 88/165 (53%), Gaps = 14/165 (8%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR-LFPWGNNLTPRGEHRANVW 235
NHPV HVSW A AY W G RLPTEAEWEY RGG + F G+N NV
Sbjct: 716 NHPVGHVSWYGARAYAQWVGGRLPTEAEWEYAARGGKYSLGFFYAGSN---------NVG 766
Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGP 295
+ +N++ +G +T PV + N+ GLY+M GNVWEW +DW+ + A S NP GP
Sbjct: 767 DVAWYDDNSSVNGTRTTHPVGQKQPNELGLYDMSGNVWEWCSDWYGAYSSNAQS-NPTGP 825
Query: 296 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
TTG +V +GG + + R A R N P + GFR
Sbjct: 826 TTGNLRVLRGGGFNDDANIT---RVAHRRSNYPSNQLNYFGFRVV 867
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 49/136 (36%), Positives = 70/136 (51%), Gaps = 17/136 (12%)
Query: 16 VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEK 75
VL+ G TF MG+ + + E P VT+ F + ++EV+N Q+ +F++ G TE
Sbjct: 633 VLVEGGTFLMGSPNGVGYSN-ERPQHQVTVSTFRMSKYEVTNAQYAQFLNVKGNQTEG-- 689
Query: 76 FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRG 135
G+++ E S +K+ G+ +HPV H+SW A AY W G
Sbjct: 690 -GNSWYDEYYGS-------------IKKNGGVYLVTPGYENHPVGHVSWYGARAYAQWVG 735
Query: 136 ARLPTEAEWEYGCRGG 151
RLPTEAEWEY RGG
Sbjct: 736 GRLPTEAEWEYAARGG 751
>gi|402758441|ref|ZP_10860697.1| hypothetical protein ANCT7_12175 [Acinetobacter sp. NCTC 7422]
Length = 334
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 67/168 (39%), Positives = 93/168 (55%), Gaps = 9/168 (5%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR---AN 233
+ PV +V+ NDA Y W G LP+E EWEY + G + P+ H+ AN
Sbjct: 163 SEPVRYVTKNDAEHYAVWLGRDLPSEQEWEYAAKAGSTTDT---PLHQAPQDTHQHPQAN 219
Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNP 292
WQGEFP NNTA D + APV Y N F L++M+GNVWEWT+ ++ H +Y
Sbjct: 220 YWQGEFPLNNTAEDHFDGVAPVGCYPANAFRLFDMIGNVWEWTSSVYHGAHDQHMGNYAE 279
Query: 293 --KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
+ +T T V KGGS+LC +C R+R ++R D +A ++GFR
Sbjct: 280 LRQQNSTATQYVLKGGSFLCASNFCARYRNSSRYPQEFDLAATHVGFR 327
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 51/139 (36%), Positives = 72/139 (51%), Gaps = 6/139 (4%)
Query: 15 MVLLPGDTFRMGTNK--PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
MV +P TF++G+N P + G+ P R V + F++DQ EV+ QF FV ATGYVT+
Sbjct: 64 MVWIPKGTFQLGSNLAYPDELNFGK-PQRQV--EGFWIDQTEVTVAQFASFVKATGYVTD 120
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AEK VF P ++ + + + K G PV +++ NDA Y
Sbjct: 121 AEKQKQAAVFSPDVNHPNQWWQLKSDYSWKTPNGAAGQAP-LPSEPVRYVTKNDAEHYAV 179
Query: 133 WRGARLPTEAEWEYGCRGG 151
W G LP+E EWEY + G
Sbjct: 180 WLGRDLPSEQEWEYAAKAG 198
>gi|422321266|ref|ZP_16402315.1| hypothetical protein HMPREF0005_03342 [Achromobacter xylosoxidans
C54]
gi|317403869|gb|EFV84342.1| hypothetical protein HMPREF0005_03342 [Achromobacter xylosoxidans
C54]
Length = 331
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 110/367 (29%), Positives = 150/367 (40%), Gaps = 106/367 (28%)
Query: 15 MVLLPGDTFRMGTNKP--ILIKD-------------GEFPSRNVTLD-AFYLDQHEVSNT 58
V +P F MG+++ L +D E P V + F L EV+
Sbjct: 35 FVAVPAGRFLMGSDEAPEALARDYPHYPRERFTQLFDEAPVHAVRITRPFQLMATEVTVG 94
Query: 59 QFQEFVSATGYVTEAEK-----FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEH 113
QF+ F+ A+GYV E+E +G ++P SE A FEG D
Sbjct: 95 QFRRFLEASGYVPESEADGTGGYGYNAAYDPDASERGDA-----------FEGRDRRY-- 141
Query: 114 RMHHPVVHISWNDAVAYCTWRGARLP-TEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTI 172
+WR P T+A
Sbjct: 142 ------------------SWRNPGFPQTDA------------------------------ 153
Query: 173 EHRMNHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTP 226
HPVV+VSWNDAVA W R RLPTEAEWEY R G R + G++
Sbjct: 154 -----HPVVNVSWNDAVAMARWLSQREGRVYRLPTEAEWEYAARAGGRGR-YQHGDDPAA 207
Query: 227 RGE-------HRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW 279
E A +W +DG+ TAPV SY N FGLY+M GN WEW ADW
Sbjct: 208 LAEVANTFDADAAPLWPKWQAYAVARSDGHRYTAPVASYAPNAFGLYDMHGNAWEWVADW 267
Query: 280 WNVHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
+ ++ +P +P+GP +G +V++GGS+ Y R A R+ NTP++ +GFR
Sbjct: 268 YGEDYYAHSPRDDPQGPASGNVRVRRGGSWHTWPLYT---RSAYRNWNTPETRYPLVGFR 324
Query: 339 CAADKGP 345
+ P
Sbjct: 325 LLREVAP 331
>gi|159037635|ref|YP_001536888.1| hypothetical protein Sare_2027 [Salinispora arenicola CNS-205]
gi|157916470|gb|ABV97897.1| protein of unknown function DUF323 [Salinispora arenicola CNS-205]
Length = 314
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 117/358 (32%), Positives = 148/358 (41%), Gaps = 89/358 (24%)
Query: 6 APPVERYKDMVLLPGDTFRMGTNKPIL----------IKD---GEFPSRNVTLDAFYLDQ 52
APP MV +PG TF G+ + L +D E P R V L + +D+
Sbjct: 14 APPAP--ATMVHVPGGTFLQGSPERTLDWLEAEGQAFPRDWFTDETPQRPVALRDYLIDR 71
Query: 53 HEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE 112
+ V+ QF +FV+ TGYVT AE+ G + V+ E A R G S I
Sbjct: 72 YPVTVAQFADFVACTGYVTSAERAGGSMVYSDYWEIREGAC-------WHRPAGYGSGIR 124
Query: 113 HRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTI 172
R HPVVHIS+ DA AY TW G RLPTE+EWE G RL+PWG
Sbjct: 125 SREDHPVVHISYPDAEAYATWAGKRLPTESEWERAAT-GRAYRLWPWGD----------- 172
Query: 173 EHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRA 232
TW R T AEW G GL+ W G H A
Sbjct: 173 --------------------TWDPTRANT-AEWTGGSLRGLDEWRQWWS------GIHNA 205
Query: 233 NVWQGEFPTNNTAADGYLSTAPVMSYK---ENKFGLYNMVGNVWEWTADWWNVHHHPAPS 289
QG P T PV ++ ++ G +M GNV+EWT + HP
Sbjct: 206 ---QGPVP----------QTTPVRAFSPRGDSPDGCADMTGNVYEWT----STTAHPYAP 248
Query: 290 YNPKGPTT----GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDS-SAGNLGFRCAAD 342
PT G + +GGS++ + Y+ RCA R P S LGFRCA D
Sbjct: 249 ETRCDPTIHLVMGRFRAIRGGSWM---NFRYQVRCAERLYGDPTGWSNFALGFRCAGD 303
>gi|451980952|ref|ZP_21929334.1| putative Sulfatase-modifying factor 1 [Nitrospina gracilis 3/211]
gi|451761874|emb|CCQ90579.1| putative Sulfatase-modifying factor 1 [Nitrospina gracilis 3/211]
Length = 320
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 68/174 (39%), Positives = 91/174 (52%), Gaps = 14/174 (8%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
N PV VSW+DA YC WRG RLPTEAEWE RG + N L+PWGN+L P W+
Sbjct: 150 NQPVTQVSWHDAKEYCEWRGKRLPTEAEWEKAARGPMGN-LYPWGNDLPPGKATYRRKWR 208
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-NPKGP 295
G + + Y + V +G+++M GNVWEW +DW++ ++ NP+GP
Sbjct: 209 GIHTMTDVGS--YPNGVSV-------YGVHDMAGNVWEWVSDWYDRKYYKKRVRNNPQGP 259
Query: 296 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
G KV +GGS++ E + A R P + GFRCA D P T
Sbjct: 260 KRGVFKVMRGGSWVNYEDTLH---SAFRRWGRPYVRFNDTGFRCAKDPIPIPST 310
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/175 (36%), Positives = 86/175 (49%), Gaps = 37/175 (21%)
Query: 9 VERYKDMVLLPGDTFRMGTNKPIL------------------IKDGEFPSRNVTLDAFYL 50
+E Y DMV +P TFRMG + KD EFP V LD +++
Sbjct: 43 LEMYSDMVEIPASTFRMGLGFHKITDMLEMCWQVDDRCNRWWFKD-EFPDHMVYLDKYWI 101
Query: 51 DQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDST 110
D +EV+N ++ EFV ATG+ P L + K ++ + E
Sbjct: 102 DIYEVTNQRYLEFVKATGH-------------RPALDDTCETKACW-EGNLWKGESFPKE 147
Query: 111 IEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
I+++ PV +SW+DA YC WRG RLPTEAEWE RG + N L+PWG+ L P
Sbjct: 148 IKNQ---PVTQVSWHDAKEYCEWRGKRLPTEAEWEKAARGPMGN-LYPWGNDLPP 198
>gi|386811281|ref|ZP_10098507.1| conserved hypothetical protein [planctomycete KSU-1]
gi|386406005|dbj|GAB61388.1| conserved hypothetical protein [planctomycete KSU-1]
Length = 263
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 65/167 (38%), Positives = 93/167 (55%), Gaps = 20/167 (11%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
P+V ++W DAVAY TW G RLPTEAEWE R G + R++PWGN + + NVW+
Sbjct: 109 KQPIVGITWEDAVAYATWAGKRLPTEAEWEKAAR-GTDKRVWPWGNKFD---KEKCNVWE 164
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNV-HHHPAPSYNPK 293
+T PV Y+ K +G Y+M GN WEW AD+++ +++ + NP
Sbjct: 165 SVIK----------ATTPVGKYENGKSPYGCYDMAGNAWEWCADYYDQNYYYVSLQKNPA 214
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GP +G +V +GGS+L Y RCAAR + + + +GFRC
Sbjct: 215 GPDSGLQRVVRGGSFLYFGHYA---RCAARYRVPWYAESPQIGFRCV 258
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/160 (36%), Positives = 83/160 (51%), Gaps = 43/160 (26%)
Query: 10 ERYKDMVLLPGDTFRMGTNK--------PILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQ 61
E+Y DMVL+P F MG + I DG P V LDAFY+D++EV+N Q++
Sbjct: 29 EKYPDMVLVPAGEFIMGDSTYYDWTFLLAYNIYDG--PEHRVYLDAFYIDKYEVTNEQYK 86
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
+FV TG+ ++ + +D+ RF ++ P+V
Sbjct: 87 KFVEETGH-----------------------RMPRCWNDV-RF--------NQPKQPIVG 114
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
I+W DAVAY TW G RLPTEAEWE R G + R++PWG+
Sbjct: 115 ITWEDAVAYATWAGKRLPTEAEWEKAAR-GTDKRVWPWGN 153
>gi|162452078|ref|YP_001614446.1| hypothetical protein sce3806 [Sorangium cellulosum So ce56]
gi|161162660|emb|CAN93965.1| hypothetical protein sce3806 [Sorangium cellulosum So ce56]
Length = 428
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 69/170 (40%), Positives = 94/170 (55%), Gaps = 10/170 (5%)
Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANV 234
R PVV VSW+DA YC W G RLPTEAEWE R G+ R +PWG+ P AN
Sbjct: 266 RPEVPVVLVSWSDAQRYCVWAGGRLPTEAEWERAAR-GVAGRRYPWGHVYNP---FHANH 321
Query: 235 WQGEFPTNNTAADGYLSTAPVMSYKENKF--GLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
+ F + +DG+L APV S + G+ ++ GNV EW AD+++ + + NP
Sbjct: 322 GRLSFDELDD-SDGFLELAPVGSLVAGRTPDGIDDLAGNVEEWVADYYHPEYPSGSAKNP 380
Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+GP G ++V +GGSY+ + R AAR ++P S GFRCA D
Sbjct: 381 RGPDMGEERVLRGGSYILGRPWL---RGAARGHDSPTSRRAWRGFRCARD 427
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/52 (53%), Positives = 33/52 (63%), Gaps = 1/52 (1%)
Query: 114 RMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
R PVV +SW+DA YC W G RLPTEAEWE R G+ R +PWG +P
Sbjct: 266 RPEVPVVLVSWSDAQRYCVWAGGRLPTEAEWERAAR-GVAGRRYPWGHVYNP 316
>gi|399017730|ref|ZP_10719919.1| hypothetical protein PMI16_00829 [Herbaspirillum sp. CF444]
gi|398102497|gb|EJL92677.1| hypothetical protein PMI16_00829 [Herbaspirillum sp. CF444]
Length = 299
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 77/183 (42%), Positives = 101/183 (55%), Gaps = 24/183 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
NHPV++V+WNDA A W RLPTEAEWEY CR G R +P G++ P G
Sbjct: 119 NHPVLNVTWNDAQALAAWLSKKEQVHYRLPTEAEWEYSCRAGSTTR-YPNGDD--PEGLS 175
Query: 231 R-ANVWQGE----FPTNNTAA----DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW- 280
R AN + +P + A DG+ T+PV S+ N FGLY+MVGN WEWTADW+
Sbjct: 176 RIANTFDANATPYWPRFASHALKNHDGFAFTSPVGSFPPNGFGLYDMVGNAWEWTADWYG 235
Query: 281 -NVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+ + H +P +P GP G KV++GGS+ Y RCA R+ NT + +G R
Sbjct: 236 EDTYAH-SPLADPHGPADGEVKVRRGGSWHTWSLYA---RCAFRNWNTTSTRYTLVGMRL 291
Query: 340 AAD 342
D
Sbjct: 292 VRD 294
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/181 (32%), Positives = 87/181 (48%), Gaps = 34/181 (18%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPS----RNVTLD------------AFYLDQHEVSNT 58
MVL+P F MGT++ + G +P R + LD FY+ ++ V+
Sbjct: 1 MVLIPAGEFLMGTDESLASLKGAYPDYEERRLLDLDDERPRHKVLITHPFYVGKYLVTVG 60
Query: 59 QFQEFVSATGYVTEAEK-----FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEH 113
QF +FV+ +GY E+ + +G ++P E ER + R+ R+ +
Sbjct: 61 QFAKFVAESGYTPESIRDQTGGYGYNPDYDP--QETERKDAFEGRN--PRYSWKNPGFPQ 116
Query: 114 RMHHPVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
+HPV++++WNDA A W RLPTEAEWEY CR G R +P G PEG
Sbjct: 117 ADNHPVLNVTWNDAQALAAWLSKKEQVHYRLPTEAEWEYSCRAGSTTR-YPNGD--DPEG 173
Query: 168 I 168
+
Sbjct: 174 L 174
>gi|258648993|ref|ZP_05736462.1| serine/threonine kinase [Prevotella tannerae ATCC 51259]
gi|260850608|gb|EEX70477.1| serine/threonine kinase [Prevotella tannerae ATCC 51259]
Length = 291
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 72/166 (43%), Positives = 91/166 (54%), Gaps = 11/166 (6%)
Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANV 234
R N PV+ VSW DAVA+C W G RLPTEAEWEY CR G F G+NLT +
Sbjct: 130 RENRPVIDVSWEDAVAFCEWTGCRLPTEAEWEYACRAG-STTPFNTGDNLTTDDAN---- 184
Query: 235 WQGEFPTNNTAADGY-LSTAPVMSYKE-NKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
+ G +P NN Y T PV S + N +GLY+M GNV EW ADW+ + A +P
Sbjct: 185 YDGTYPYNNHEQGLYRQQTVPVGSVGQPNAWGLYDMHGNVQEWCADWYE-EYSKAEQTDP 243
Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
KGP GT ++ +GGS+ C R A R + +S + GFR
Sbjct: 244 KGPANGTYRILRGGSWTKRALQC---RSAYRGGSEYNSRFDDAGFR 286
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 59/140 (42%), Gaps = 38/140 (27%)
Query: 14 DMVLLPGDTFRMG--TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ V + +F MG T +P + GE + FY+ ++ V+ QF +F ATG
Sbjct: 64 EFVKIQAGSFVMGSPTTEPQR-EAGEIQHKVTLTKDFYMSKYPVTFEQFDKFCEATGR-- 120
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
+ D R + PV+ +SW DAVA+C
Sbjct: 121 ---------------------------------QKPDDYGWGRENRPVIDVSWEDAVAFC 147
Query: 132 TWRGARLPTEAEWEYGCRGG 151
W G RLPTEAEWEY CR G
Sbjct: 148 EWTGCRLPTEAEWEYACRAG 167
>gi|390344998|ref|XP_001186776.2| PREDICTED: sulfatase-modifying factor 2-like [Strongylocentrotus
purpuratus]
Length = 198
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 66/162 (40%), Positives = 98/162 (60%), Gaps = 13/162 (8%)
Query: 4 LPAPPV-ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
+PA PV ++Y++MV LPG F MG++K KDGE R VT+ F ++++ ++ F++
Sbjct: 32 VPADPVFDKYEEMVALPGGKFMMGSSKSDG-KDGESVQRKVTVKPFKINKYPITAASFRK 90
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ----------VRHDM-KRFEGLDSTI 111
FV A Y T+AEK+ +F F+ + E + I Q V++ ++ EG S I
Sbjct: 91 FVRAKKYKTDAEKYSWSFAFDAFVPEAIKNTIEQRLPGAPWWIPVQNAYWRQPEGKGSGI 150
Query: 112 EHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 153
E+R+++P VHIS DA A+C W G RLPTE EWE+ RG LE
Sbjct: 151 ENRLNYPAVHISNVDARAFCQWHGWRLPTEPEWEFAARGSLE 192
>gi|194336982|ref|YP_002018776.1| hypothetical protein Ppha_1942 [Pelodictyon phaeoclathratiforme
BU-1]
gi|194309459|gb|ACF44159.1| protein of unknown function DUF323 [Pelodictyon phaeoclathratiforme
BU-1]
Length = 517
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 74/176 (42%), Positives = 94/176 (53%), Gaps = 20/176 (11%)
Query: 177 NHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
NHPV+HVSWNDAVAYC W A RLPTEAEWE CR G F G NL+
Sbjct: 347 NHPVLHVSWNDAVAYCKWLSATTGEAFRLPTEAEWECACRAGTTTP-FNTGENLS---TG 402
Query: 231 RANVWQGEFPTNNTAADGYL-STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---- 285
+AN + G +P + Y +T V S+ N +GLYNM GNV EW DW+ ++
Sbjct: 403 QAN-YDGNYPYKSNQKGVYRDNTVAVDSFSSNAWGLYNMHGNVLEWCGDWYGDKYYEECK 461
Query: 286 -PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
NP GP TG+ +V +GG ++ +YC R A RS PD + +GFR
Sbjct: 462 TKGTVENPSGPATGSCRVLRGGRWVSYARYC---RSAFRSLGPPDYRSYFVGFRLV 514
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 85/155 (54%), Gaps = 18/155 (11%)
Query: 7 PPVE--RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV 64
PP++ + D VL+ G F MG+ + + + V + FYL ++ V+ F+EF+
Sbjct: 242 PPIQPKVFSDFVLIRGGIFPMGSPTGEVDRFDDETQHQVKVSDFYLCRYTVTVAGFREFI 301
Query: 65 SATGYVTEAEKFGDTFVFEPLLSEEERAKISQV--RHDMKRFEGLDSTIEHRMHHPVVHI 122
A+ Y T+AEK ++V+ + E K++ + R+ + E D+ +HPV+H+
Sbjct: 302 EASRYRTDAEKGDGSYVW----NGNEWNKMAGINWRYGISGSERNDA----EENHPVLHV 353
Query: 123 SWNDAVAYCTWRGA------RLPTEAEWEYGCRGG 151
SWNDAVAYC W A RLPTEAEWE CR G
Sbjct: 354 SWNDAVAYCKWLSATTGEAFRLPTEAEWECACRAG 388
>gi|451980968|ref|ZP_21929349.1| conserved exported hypothetical protein [Nitrospina gracilis 3/211]
gi|451761800|emb|CCQ90594.1| conserved exported hypothetical protein [Nitrospina gracilis 3/211]
Length = 265
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 69/170 (40%), Positives = 92/170 (54%), Gaps = 14/170 (8%)
Query: 174 HRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRAN 233
+R NHPVV VSW DA+AY W+G RLPTEAEWE RG + RL+PWG+ A
Sbjct: 103 NRPNHPVVGVSWFDAMAYARWKGRRLPTEAEWEKAARGN-DTRLYPWGDKW-------AK 154
Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKE--NKFGLYNMVGNVWEWTADWW-NVHHHPAPSY 290
+ F D Y TAPV +Y + FG++NM GNVWEW DW+ + ++ +P
Sbjct: 155 GFVLYFVNVFGVEDKYRHTAPVDAYPAGISPFGVFNMAGNVWEWCLDWYEDDYYRKSPEL 214
Query: 291 NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
NP+GP KV +GGS++ R R +N P++ GFR
Sbjct: 215 NPEGPEPTKMKVVRGGSWINTIDGA---RLVRRGRNFPNTKNMIYGFRTV 261
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 79/153 (51%), Gaps = 35/153 (22%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P E + MV +P F MG N ++G P +V A+++D++EVSN ++ +FV T
Sbjct: 32 PPEAPEGMVYIPEGYFAMGVNSGNASEEG--PQHHVYTSAYFIDRYEVSNAEYMKFVKDT 89
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
G+ +PL E+ER +R +HPVV +SW DA
Sbjct: 90 GHP------------KPLFWEDERF--------------------NRPNHPVVGVSWFDA 117
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 160
+AY W+G RLPTEAEWE RG + RL+PWG
Sbjct: 118 MAYARWKGRRLPTEAEWEKAARGN-DTRLYPWG 149
>gi|363581291|ref|ZP_09314101.1| gliding motility-associated lipoprotein GldK [Flavobacteriaceae
bacterium HQM9]
Length = 454
Score = 113 bits (283), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 115/427 (26%), Positives = 167/427 (39%), Gaps = 94/427 (22%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF-------VSAT 67
M L+PG F MG + P + VT+ +FY+D+ E++N+++++F V T
Sbjct: 36 MTLIPGGAFIMGKSDDDKAGLLNAPPKTVTVRSFYMDETEITNSEYRQFTEWVKDSVVRT 95
Query: 68 GYVTEAEKFGDT---------FVFEPLLSEEERAKISQVRHDMKRFEGLDST-------- 110
A++ G T F F EEE + + D +EGL T
Sbjct: 96 QLAITADEVGQTGSDEDGIGQFAFRNTEDEEELSVYEKYVRD--NYEGLGETGYEGKALN 153
Query: 111 -----IEHRMHHPVVHISWNDAVAYC----------TWRGARLPTEAEWEYGCRGGLENR 155
IE +P + + Y T++ +L + +W Y +R
Sbjct: 154 KDVDLIEDLNEYPDEYYAEVMDSLYIRAEESHNGVRTFKTEKLVYDYKW-YDIEAMARSR 212
Query: 156 LFPWGSWLHPEGI----DSTI------------EHRM--------NHPVVHVSWNDAVAY 191
+L E + D+T+ H M ++PVV V+W A A+
Sbjct: 213 SKDSKKFLKREALKIYPDTTVWIRDFSYSYNEPMHDMYFTHTSYDDYPVVGVTWKQAKAF 272
Query: 192 CTWRGA------------------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRAN 233
C WR RLPTEAEWEY RGGLE+ FPWG P ++
Sbjct: 273 CDWRTKNHNDALRKKKNKDQVSRYRLPTEAEWEYAARGGLESGTFPWGG---PYAKNDRG 329
Query: 234 VWQGEFP--TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSY 290
+ F + AAD L T SY N +GLYNM GNV EW ++ + S
Sbjct: 330 CFLANFKPMRGDYAADQALYTVEANSYDPNDYGLYNMAGNVSEWVGSTYDPASYEYTSSM 389
Query: 291 NPKGPTT-GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
NP KV +GGS+ + Y + + R DS+ +GFR D T T
Sbjct: 390 NPNSSDDENKRKVVRGGSW---KDVAYFLQVSTRDYEYADSARSYIGFRAVQDYMGTDVT 446
Query: 350 DKVKKGG 356
K G
Sbjct: 447 GNTKTTG 453
>gi|149919635|ref|ZP_01908114.1| hypothetical protein PPSIR1_07178 [Plesiocystis pacifica SIR-1]
gi|149819578|gb|EDM79006.1| hypothetical protein PPSIR1_07178 [Plesiocystis pacifica SIR-1]
Length = 289
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 92/330 (27%), Positives = 128/330 (38%), Gaps = 94/330 (28%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+++ + G +F MG D + P R VT+ F +++ EV+ T + E V A G
Sbjct: 52 NLIAITGGSFEMGCPAGAEACDADNPLRTVTVSDFAIERTEVTMTAYTECVDAGGCSATT 111
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
P+L + HP+ +SW A YC W
Sbjct: 112 NTDTSCTFGSPVL----------------------------LEHPINCVSWQQAADYCAW 143
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
+G RLPTEAEWEY RG E R FPWG + A CT
Sbjct: 144 KGRRLPTEAEWEYAARGA-EGRTFPWG--------------------------ETPASCT 176
Query: 194 WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
+ +YGC G+ ANV G +P
Sbjct: 177 LAHMFMMMGDGGDYGCLTGI-----------------TANV--GTYP------------- 204
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTTGTDKVKKGGSYLCNE 312
E L +M GNV EW ADW+ ++ PS +P+GP TGT +V +GG +L +
Sbjct: 205 ----MGETPETLADMAGNVDEWVADWYAADYYTTGPSADPQGPDTGTQRVHRGGDFL--D 258
Query: 313 QYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
R R ++ PD+ + GFRCAAD
Sbjct: 259 ASPNNLRSFERWRSNPDNDGPSRGFRCAAD 288
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 73/223 (32%), Positives = 94/223 (42%), Gaps = 65/223 (29%)
Query: 176 MNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW 235
+ HP+ VSW A YC W+G RLPTEAEWEY RG E R FPWG TP A++
Sbjct: 125 LEHPINCVSWQQAADYCAWKGRRLPTEAEWEYAARGA-EGRTFPWGE--TPASCTLAHM- 180
Query: 236 QGEFPTNNTAAD-GYLS--TAPVMSYK--ENKFGLYNMVGNVWEWTADWWNVHHHP-APS 289
F D G L+ TA V +Y E L +M GNV EW ADW+ ++ PS
Sbjct: 181 ---FMMMGDGGDYGCLTGITANVGTYPMGETPETLADMAGNVDEWVADWYAADYYTTGPS 237
Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
+P+ GP TGT
Sbjct: 238 ADPQ--------------------------------------------------GPDTGT 247
Query: 350 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 392
+V +GG +L + R R ++ PD+ + GFRCAAD
Sbjct: 248 QRVHRGGDFL--DASPNNLRSFERWRSNPDNDGPSRGFRCAAD 288
>gi|387133017|ref|YP_006298989.1| formylglycine-generating sulfatase enzyme [Prevotella intermedia
17]
gi|386375865|gb|AFJ08724.1| formylglycine-generating sulfatase enzyme [Prevotella intermedia
17]
Length = 293
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 71/172 (41%), Positives = 93/172 (54%), Gaps = 11/172 (6%)
Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANV 234
R N PV+ VSW+DAVA+C W G RLPTEAEWEY C+ G F G+NLT AN
Sbjct: 130 RENRPVIDVSWDDAVAFCEWIGCRLPTEAEWEYACKSGTTTP-FNTGDNLT---TDEAN- 184
Query: 235 WQGEFPTNNTAADGYL-STAPVMSY-KENKFGLYNMVGNVWEWTADWWNVHHHPAPSY-N 291
+ G +P NN Y T PV + K N +GL++M GN+ EW +DW+ + +
Sbjct: 185 YDGNYPYNNNEKGTYRKKTVPVGTLGKPNAWGLFDMHGNIQEWCSDWYAEYSAADEVVTD 244
Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
PKGP +GT ++ +GGS+ C R A R P S + GFR K
Sbjct: 245 PKGPDSGTYRILRGGSWTTRAMQC---RSAYRGGIEPSSRLDDAGFRIVLVK 293
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 38/146 (26%)
Query: 8 PVERYKDMVLLPGDTFRMGT--NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
P E + V +P +F MG+ +P + GE R V FY+ ++ ++ Q+ +F
Sbjct: 58 PKELGIEFVKIPAGSFLMGSPLEEPQR-ESGEIQHRVVLSKDFYMSKYPITFEQYDKFCE 116
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
ATG + D R + PV+ +SW+
Sbjct: 117 ATGR-----------------------------------QKPDDYTWGRENRPVIDVSWD 141
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGG 151
DAVA+C W G RLPTEAEWEY C+ G
Sbjct: 142 DAVAFCEWIGCRLPTEAEWEYACKSG 167
>gi|445439319|ref|ZP_21441656.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC021]
gi|444752273|gb|ELW76962.1| cysteine-type sulfatase aerobic maturase family protein
[Acinetobacter baumannii OIFC021]
Length = 335
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 68/182 (37%), Positives = 96/182 (52%), Gaps = 6/182 (3%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W P G + + + PV +V+ NDA Y W G LPTE EWEY + + P
Sbjct: 150 TWKTPNGKNGALP-PPHEPVRYVTKNDAEHYAVWLGRDLPTELEWEYAAKANSKTDT-PL 207
Query: 221 GNNLTPRGEH-RANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTAD- 278
T +H +AN WQG FP N D + APV +K N F L++M+GNVWEWT+
Sbjct: 208 HQAPTDEHQHPQANYWQGTFPFENLNHDHFTGVAPVGCFKSNGFKLFDMIGNVWEWTSSP 267
Query: 279 WWNVHHHPAPSYNP--KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLG 336
+ H +Y+ + + T V KGGS+LC E YC R+R ++R D +A ++G
Sbjct: 268 YQGAHDQHMGNYSDLRQKEMSSTQYVIKGGSFLCAENYCSRYRNSSRYPQDFDLAATHVG 327
Query: 337 FR 338
FR
Sbjct: 328 FR 329
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 46/136 (33%), Positives = 66/136 (48%), Gaps = 2/136 (1%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F G+ K + F + + F++DQ EV+ QF FV ATGY+T+AE
Sbjct: 66 MVFIPDGQFNFGSEK-AYPDELNFGKKQREVKGFWIDQTEVTVAQFASFVKATGYITDAE 124
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
K VF P ++ + + K G + + H PV +++ NDA Y W
Sbjct: 125 KQKQAAVFTPDPRHPQQWWQLKSGYTWKTPNGKNGALP-PPHEPVRYVTKNDAEHYAVWL 183
Query: 135 GARLPTEAEWEYGCRG 150
G LPTE EWEY +
Sbjct: 184 GRDLPTELEWEYAAKA 199
>gi|302036551|ref|YP_003796873.1| hypothetical protein NIDE1190 [Candidatus Nitrospira defluvii]
gi|300604615|emb|CBK40947.1| conserved exported protein of unknown function, DUF323 [Candidatus
Nitrospira defluvii]
Length = 282
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 72/178 (40%), Positives = 94/178 (52%), Gaps = 17/178 (9%)
Query: 166 EGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLT 225
E +D H + PV+ SW DA AYC WRGARLPTEAEWE RG + RLFPWG+ T
Sbjct: 110 ESVDPAQHH--DRPVIGASWFDAEAYCRWRGARLPTEAEWEKAARGS-DGRLFPWGHR-T 165
Query: 226 PRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNVH 283
P E F + APV Y + +GL+ M GNV EW ADW+ +
Sbjct: 166 PTDEEANFGLGARFSYSQV-------LAPVQGYPSGRSPYGLHQMAGNVGEWVADWYGAN 218
Query: 284 HH-PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
++ + S NP GP GT +V +GGS+ +Y + RS+ P++ GFRCA
Sbjct: 219 YYETSASKNPAGPERGTFRVVRGGSWSDLPKYLLTY---GRSKLPPETRNSYTGFRCA 273
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 53/148 (35%), Positives = 69/148 (46%), Gaps = 37/148 (25%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F MG N ++D E P V LD + +D +EV+ Q+ F+ E
Sbjct: 50 MVSIPEGDFLMGANGMDALED-EKPQHRVWLDWYQIDTYEVTTGQYAAFLVQ-------E 101
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH--PVVHISWNDAVAYCT 132
K + +E +++ HH PV+ SW DA AYC
Sbjct: 102 KVSPPWQWE--------------------------SVDPAQHHDRPVIGASWFDAEAYCR 135
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
WRGARLPTEAEWE RG + RLFPWG
Sbjct: 136 WRGARLPTEAEWEKAARGS-DGRLFPWG 162
>gi|254495108|ref|ZP_05108032.1| gliding motility protein GldK [Polaribacter sp. MED152]
gi|85819458|gb|EAQ40615.1| gliding motility protein GldK [Polaribacter sp. MED152]
Length = 453
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 118/429 (27%), Positives = 170/429 (39%), Gaps = 115/429 (26%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF-------VSAT 67
MVL+PG +F MG +I P++ VT+ +++D+ E++N +++EF V T
Sbjct: 42 MVLIPGGSFTMGKQDEDIIGTMNSPTKTVTVRPYFMDETEITNNEYKEFVFWVRDSVVRT 101
Query: 68 GYVTEAE-----------------------KFGDTF------------------VFEPLL 86
+AE KF DT +PL
Sbjct: 102 KLAYQAEFAALGSEPDPTGNNNRATGIQLYKFKDTVENSTPYQRYMYENYYQFDTIQPLN 161
Query: 87 SEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV-AYCTWRGARLPTEAEW- 144
EE+ + DM E +DS ++IS DAV T+ L + W
Sbjct: 162 WEEDLIWKKEEYPDMDYVEVMDS----------LYISREDAVDGVRTFNTKFLKYKYSWF 211
Query: 145 --EYGCRGGLENRLFPWGSWL--HPEGIDSTI--------------------EHRMNHPV 180
+ R G + + F L +P D+T+ + ++PV
Sbjct: 212 DRDNAARKGGDRKDFVQTEILNIYP---DTTVWVKDFNYSYNDPMHQDYFSHQSYGDYPV 268
Query: 181 VHVSWNDAVAYCTWR------------------GARLPTEAEWEYGCRGGLENRLFPWGN 222
V V+W A A+C WR RLPTEAEWEY RGGLE +PWG
Sbjct: 269 VGVTWGQANAFCNWRTKKKNDYLKNKKNAAQVPAFRLPTEAEWEYAARGGLEFATYPWGT 328
Query: 223 NLTP--RGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW 280
T RG AN + N A DG L T SY N +GLYNM GNV EWT +
Sbjct: 329 GSTTSDRGCFLANF---KPVRGNYAVDGALYTMEAESYNANDYGLYNMAGNVSEWTNTAY 385
Query: 281 NV-HHHPAPSYNPK-GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
N+ ++ A + NP K+ +GGS+ + Y +R D++ +GFR
Sbjct: 386 NLSSYYMASTMNPNVEDRKNKRKIVRGGSW---KDVAYYLEVGSRDYEYADTARSYIGFR 442
Query: 339 CAADKGPTT 347
+ TT
Sbjct: 443 TVQNYIGTT 451
>gi|21637141|gb|AAM70357.1|AF505622_29 CalU17 [Micromonospora echinospora]
Length = 316
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 110/347 (31%), Positives = 148/347 (42%), Gaps = 80/347 (23%)
Query: 14 DMVLLPGDTFRMGTNKPILI---KDG----------EFPSRNVTLDAFYLDQHEVSNTQF 60
DMV +PG TF G+ + L ++G E P VTL + +D+H+V+ QF
Sbjct: 20 DMVRIPGGTFLQGSPERTLDWLDREGQAFPRDWFTDETPQIPVTLPDYLIDRHQVTVAQF 79
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
FVS TGYVT AE+ G + V+ E R G S I R HPVV
Sbjct: 80 AAFVSRTGYVTSAERAGGSMVYGEQYWEIREGAC------WHRPAGYGSGIRGRDDHPVV 133
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
HIS+ DA AY W G RLPTE+EWE G RL+PWG
Sbjct: 134 HISFADAEAYARWAGRRLPTESEWERAAT-GPSYRLWPWGD------------------- 173
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+W+ +R AE G G L+ WG + QG P
Sbjct: 174 ---TWD----------SRNANTAEHTAGALGDLDAWRTWWGA---------IHAVQGPMP 211
Query: 241 TNNTAADGYLSTAPVMSYK---ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK-GPT 296
T PV ++ ++ G +M GNV+EWT+ ++ + PA +P
Sbjct: 212 ----------QTTPVGAFSPRGDSVDGCADMTGNVYEWTSTLAHL-YSPATRCDPTIHLV 260
Query: 297 TGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDS-SAGNLGFRCAAD 342
G +V +GGS++ + Y+ RCA R P S LGFRCA D
Sbjct: 261 MGRSRVIRGGSWM---NFRYQVRCAERLYGDPTGWSNFALGFRCARD 304
>gi|340349800|ref|ZP_08672803.1| serine/threonine kinase [Prevotella nigrescens ATCC 33563]
gi|339610003|gb|EGQ14864.1| serine/threonine kinase [Prevotella nigrescens ATCC 33563]
Length = 291
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 70/171 (40%), Positives = 92/171 (53%), Gaps = 11/171 (6%)
Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANV 234
R N PV+ VSW+DA+A+C W G RLPTEAEWEY C+ G F GNNLT +
Sbjct: 130 RENRPVIDVSWDDAIAFCEWIGCRLPTEAEWEYACKAGTTTP-FNTGNNLTTNDAN---- 184
Query: 235 WQGEFPTNNTAADGYL-STAPVMSYKE-NKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
+ G +P NN Y T V + E N +GLY+M GNV EW ADW++ + + +P
Sbjct: 185 YDGNYPYNNNEKGDYRKKTVAVGTIGEPNAWGLYDMHGNVQEWCADWYDDYPQTEQT-DP 243
Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
KGP GT ++ +GGS+ C R A R + S + GFR K
Sbjct: 244 KGPENGTYRILRGGSWTSRAMQC---RSAYRGGSEHSSRLDDAGFRVVLMK 291
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 43/148 (29%), Positives = 64/148 (43%), Gaps = 36/148 (24%)
Query: 5 PAPPVERYKDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
P P + + V +P +F MG+ N + GE + FY+ ++ ++ Q+ +F
Sbjct: 55 PVTPKDLGIEFVKVPAGSFLMGSPNNEPQRESGEIQHKVTITKDFYMSKYPITFEQYDKF 114
Query: 64 VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
ATG R K S R + PV+ +S
Sbjct: 115 CEATG----------------------RQKPSDYTW-------------GRENRPVIDVS 139
Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGG 151
W+DA+A+C W G RLPTEAEWEY C+ G
Sbjct: 140 WDDAIAFCEWIGCRLPTEAEWEYACKAG 167
>gi|421854856|ref|ZP_16287241.1| hypothetical protein ACRAD_02_02150 [Acinetobacter radioresistens
DSM 6976 = NBRC 102413]
gi|403189871|dbj|GAB73442.1| hypothetical protein ACRAD_02_02150 [Acinetobacter radioresistens
DSM 6976 = NBRC 102413]
Length = 270
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 71/185 (38%), Positives = 99/185 (53%), Gaps = 10/185 (5%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W PEG + R N V +++ NDA Y W G LP+E EWEY +G E + P
Sbjct: 85 NWRLPEGPEGPPA-RANEAVRYITKNDAEHYALWLGHDLPSEVEWEYAAKGNDETDV-PL 142
Query: 221 GNNLTPRG-EHR--ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
+ PR +HR AN WQG+FP NT D + APV + N FGLY+M+GNVWEWT+
Sbjct: 143 KHE--PRNHQHRPDANYWQGDFPFQNTQEDHFKGIAPVGCFSPNAFGLYDMIGNVWEWTS 200
Query: 278 DWWN-VHHHPAPSYNPKGPTTGTDK--VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGN 334
+ H +Y+ + + V KGGS+LC QYC R+R ++R + + +
Sbjct: 201 SVYTGAHDEHMGNYSALRTSQHIPQSFVIKGGSFLCASQYCARYRKSSRHPQEFNLATTH 260
Query: 335 LGFRC 339
GFR
Sbjct: 261 TGFRT 265
>gi|332663535|ref|YP_004446323.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
1100]
gi|332332349|gb|AEE49450.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis
DSM 1100]
Length = 505
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 94/340 (27%), Positives = 149/340 (43%), Gaps = 85/340 (25%)
Query: 15 MVLLPGDTFRMGTNKPILIKDG-----EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
+V + G +RMG I +DG E P V++ +FYL ++EV+N +F +F++A
Sbjct: 234 LVKIDGGKYRMGC---IEARDGSCYSNETPDHEVSISSFYLSKYEVTNAEFVQFLNAISA 290
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
+ GD ++ K FE
Sbjct: 291 QISIDANGDGVNYQ----------------GKKIFE-----------------------L 311
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
+C + + GC G E + + P G + + HP V V+ A
Sbjct: 312 FC----------GDKKGGCSGFKEQ--IEYSAADRPGGRFTVVSGFEKHPAVMVTKYGAE 359
Query: 190 AYCTW------RGARLPTEAEWEYGCRGGLENRLFPWG--NNLTPRGEHRANVWQGEFPT 241
AY W + RLPTEAEWE+ RGG++++ + + N+L G W + +
Sbjct: 360 AYSAWLSSKTGKSYRLPTEAEWEFVARGGIQSQRYKYAGSNDLDAVG------W---YSS 410
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTD 300
N+++A PV N+ GLY++ GNVWEW +DW+ ++ +P+ NPKGP++GT+
Sbjct: 411 NSSSA-----AHPVGLKNANELGLYDLSGNVWEWCSDWYEKEYYKNSPASNPKGPSSGTN 465
Query: 301 KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
V +GGS+ + C R A R N P +GFR A
Sbjct: 466 SVLRGGSWYVIDINC---RLAIRGINDPGYCGNVIGFRVA 502
>gi|78189652|ref|YP_379990.1| hypothetical protein Cag_1696 [Chlorobium chlorochromatii CaD3]
gi|78171851|gb|ABB28947.1| conserved hypothetical protein [Chlorobium chlorochromatii CaD3]
Length = 751
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 78/206 (37%), Positives = 108/206 (52%), Gaps = 30/206 (14%)
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
+R+ T EW+Y + G+ +W G +++ NHPV+HVSWNDA AYC W
Sbjct: 538 SSRIWTGKEWKY--KAGV--------NWRCGVGGQLRLQNEENHPVIHVSWNDAKAYCDW 587
Query: 195 ------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADG 248
+ RLPTEAEWEY CR G F G+NLT +AN + G +P N A
Sbjct: 588 LSKKTGKKYRLPTEAEWEYACRAGTTTP-FNTGDNLTT---AQAN-YDGNYPYNGNAKGK 642
Query: 249 Y-LSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----PSYNPKGPTTGTDKV 302
Y +T PV S+ N +GLYNM GNVWEW +DW+N ++ NP+ + +V
Sbjct: 643 YRQTTVPVDSFAPNAWGLYNMHGNVWEWCSDWYNDKYYEVCKAKGVVENPECTEEQSYRV 702
Query: 303 KKGGSYLCNEQYCYRHRCAARSQNTP 328
+GGS+ + + C R A R+ P
Sbjct: 703 LRGGSWGNDARSC---RSAIRNLLRP 725
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 75/144 (52%), Gaps = 12/144 (8%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
D V + G F MG+ + + V L FY+ ++ V+ +F++F+ GY T+A
Sbjct: 474 DYVPIKGGEFTMGSPDGEVDRSSTETQHQVKLHNFYIAKYVVTVAEFRKFIEECGYKTDA 533
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
EK + ++ +E + K + + G +++ +HPV+H+SWNDA AYC W
Sbjct: 534 EKANSSRIWT---GKEWKYKAGV---NWRCGVGGQLRLQNEENHPVIHVSWNDAKAYCDW 587
Query: 134 ------RGARLPTEAEWEYGCRGG 151
+ RLPTEAEWEY CR G
Sbjct: 588 LSKKTGKKYRLPTEAEWEYACRAG 611
>gi|293607291|ref|ZP_06689632.1| DNA replication and repair protein RecF [Achromobacter piechaudii
ATCC 43553]
gi|292814383|gb|EFF73523.1| DNA replication and repair protein RecF [Achromobacter piechaudii
ATCC 43553]
Length = 355
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 110/374 (29%), Positives = 150/374 (40%), Gaps = 110/374 (29%)
Query: 5 PAPPVERYKDM--VLLPGDTFRMGTNKPI--LIKD-------------GEFPSRNVTLD- 46
PA VE M V +P TF+MG+ + L +D E P V +
Sbjct: 48 PAALVENSLGMAFVAVPAGTFQMGSEESADSLGRDYPQYPMERFAQLFDEAPVHTVRITR 107
Query: 47 AFYLDQHEVSNTQFQEFVSATGYVTEAEK-----FGDTFVFEPLLSEEERAKISQVRHDM 101
FY+ + EV+ QF+ F+ A+GYV E+E +G ++P SE A
Sbjct: 108 PFYMGRTEVTVGQFRRFIEASGYVPESEADGTGGYGYNAAYDPDKSERGDA--------- 158
Query: 102 KRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
FEG D S
Sbjct: 159 --FEGRDRRY-------------------------------------------------S 167
Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLEN 215
W +P G T EH PVV++SWNDA A W RLPTEAEWEY CR G
Sbjct: 168 WRNP-GFAQTDEH----PVVNISWNDATALARWLTQQEGHVYRLPTEAEWEYACRAGTRT 222
Query: 216 RLFPWGNNLTPRGEHRAN---------VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLY 266
R + N P G +W DG TAPV SY N FGLY
Sbjct: 223 R---YQNGDAPDGMAEVGNGFDADAEPLWPKWHAFATPRHDGNTFTAPVASYAVNAFGLY 279
Query: 267 NMVGNVWEWTADWWNVHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQ 325
+M GN WEW +DW+ ++ +P+ +P+GP +G +V++GGS+ Y R + R+
Sbjct: 280 DMHGNAWEWVSDWYGEDYYAHSPTDDPEGPASGNVRVRRGGSWHTWALYT---RSSYRNW 336
Query: 326 NTPDSSAGNLGFRC 339
NT ++ +G R
Sbjct: 337 NTQETRYPLVGLRL 350
>gi|134094839|ref|YP_001099914.1| hypothetical protein HEAR1630 [Herminiimonas arsenicoxydans]
gi|133738742|emb|CAL61789.1| conserved hypothetical protein; putative C-type lectin domain;
putative exported protein [Herminiimonas arsenicoxydans]
Length = 328
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/178 (36%), Positives = 98/178 (55%), Gaps = 20/178 (11%)
Query: 177 NHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
+HPVV+V+WNDA+A C W + RLPTEAEWEY R G + + + + G
Sbjct: 151 DHPVVNVTWNDALAMCKWLSQVEGKTYRLPTEAEWEYAARAGTSTQYYTGDDPQSLLGA- 209
Query: 231 RANVWQGEFPTN--------NTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV 282
AN++ + N D + T+PV S+ N FGLY++ GN WEWTADW +
Sbjct: 210 -ANIFDADAAKNWGRWSSYALAGHDNFAFTSPVGSFAPNAFGLYDIHGNAWEWTADWHDD 268
Query: 283 HHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
H++ +P +P+GP TG +V++GGS+ + + R + R+ N PD+ +G R
Sbjct: 269 HYYADSPLNDPQGPATGNVRVRRGGSW---HTWSFYARSSYRNWNAPDTRYTLVGMRL 323
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 75/166 (45%), Gaps = 23/166 (13%)
Query: 15 MVLLPGDTFRMGTNKP---------------ILIKDGEFPSRNVTLD-AFYLDQHEVSNT 58
VL+P F MG++ P L E P V + AFYL QHEV+
Sbjct: 33 FVLIPAGEFMMGSDVPADKFAQIYPQYERKRFLELGDEAPVHKVRITRAFYLGQHEVTVG 92
Query: 59 QFQEFVSATGYVTEAEKFGD-TFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
QF+ F+ A+GY E+E G + + P A+ ++ + H
Sbjct: 93 QFRRFLLASGYQPESEADGSGGYGYNPAYDPALSARGDAFEGRNPKYSWRNPGFAQDDDH 152
Query: 118 PVVHISWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLF 157
PVV+++WNDA+A C W + RLPTEAEWEY R G + +
Sbjct: 153 PVVNVTWNDALAMCKWLSQVEGKTYRLPTEAEWEYAARAGTSTQYY 198
>gi|189501384|ref|YP_001960854.1| hypothetical protein Cphamn1_2479 [Chlorobium phaeobacteroides BS1]
gi|189496825|gb|ACE05373.1| protein of unknown function DUF323 [Chlorobium phaeobacteroides
BS1]
Length = 398
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 74/175 (42%), Positives = 94/175 (53%), Gaps = 20/175 (11%)
Query: 178 HPVVHVSWNDAVAYCTW----RGA--RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR 231
+P+V+VSWNDAVAYC W RG RLPTEAEWEY CR G F G NLT +
Sbjct: 225 YPLVNVSWNDAVAYCRWLSEKRGELFRLPTEAEWEYACRAGTRTP-FSTGENLT---TYE 280
Query: 232 ANVWQGEFPTNNTAADGYLS-TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH----- 285
AN + G +P N Y T PV S+K N +GLYNM GNVWEW D + ++
Sbjct: 281 AN-YDGNYPYRNNPKGMYRKKTVPVDSFKPNSYGLYNMHGNVWEWCGDRYGAGYYEECRK 339
Query: 286 PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
NP+G G++ V +GG +L Q+C R A R + P + +GFR
Sbjct: 340 QGIMENPQGAEKGSNSVLRGGGWLTYAQHC---RSAYRHFDGPVYRSNYVGFRLV 391
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/41 (65%), Positives = 31/41 (75%), Gaps = 6/41 (14%)
Query: 117 HPVVHISWNDAVAYCTW----RGA--RLPTEAEWEYGCRGG 151
+P+V++SWNDAVAYC W RG RLPTEAEWEY CR G
Sbjct: 225 YPLVNVSWNDAVAYCRWLSEKRGELFRLPTEAEWEYACRAG 265
>gi|347754869|ref|YP_004862433.1| hypothetical protein [Candidatus Chloracidobacterium thermophilum
B]
gi|347587387|gb|AEP11917.1| Uncharacterized conserved protein [Candidatus Chloracidobacterium
thermophilum B]
Length = 271
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 69/169 (40%), Positives = 86/169 (50%), Gaps = 6/169 (3%)
Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANV 234
R HPVV V+WNDAVAY W G RLPTEAEWE+ R G + T AN
Sbjct: 97 REQHPVVAVTWNDAVAYTRWAGKRLPTEAEWEFAARAGRPTAQYGTAGTGTVLAGA-ANF 155
Query: 235 WQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPK 293
G + T PV ++ N FGLY++ GN EW DW++ ++ PS+NP
Sbjct: 156 GHIAVNPKRMPLRG-IQTTPVATFAPNPFGLYDLSGNAAEWCQDWFDADYYAFGPSHNPP 214
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GGS+ E Y R R P + A LGFRCAAD
Sbjct: 215 GPDFGRGRVVRGGSWNDREDYLQLTR---RYGLPPGTIAFTLGFRCAAD 260
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 48/137 (35%), Positives = 62/137 (45%), Gaps = 39/137 (28%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+PG F + +D + F+LD+ EV+N +F+ FV ATGY +E
Sbjct: 37 MVLVPGGAF--------VRRDDAGRGHLTVVAPFFLDRTEVTNAEFRRFVMATGYQSEG- 87
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
R + G R HPVV ++WNDAVAY W
Sbjct: 88 -----------------------RWALYATPG-------REQHPVVAVTWNDAVAYTRWA 117
Query: 135 GARLPTEAEWEYGCRGG 151
G RLPTEAEWE+ R G
Sbjct: 118 GKRLPTEAEWEFAARAG 134
>gi|152981513|ref|YP_001353598.1| hypothetical protein mma_1908 [Janthinobacterium sp. Marseille]
gi|151281590|gb|ABR90000.1| Uncharacterized conserved protein [Janthinobacterium sp. Marseille]
Length = 326
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 68/176 (38%), Positives = 93/176 (52%), Gaps = 16/176 (9%)
Query: 177 NHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWG------NNL 224
+HPVV+V+WNDAVA C W R RLPTEAEWEY R G R + N
Sbjct: 149 SHPVVNVTWNDAVAMCKWLSRTEGRTYRLPTEAEWEYAGRAGTRTRYYSGDDPQSLLNAA 208
Query: 225 TPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH 284
A WQ DG+ TAPV S+ N FGLY++ GN WEWTADW + +
Sbjct: 209 NVFDADAAKNWQQWKKFALDGPDGFAFTAPVGSFAPNAFGLYDIHGNAWEWTADWHDERY 268
Query: 285 HP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+ +P +P+GP GT +V++GGS+ + + R + R+ N+ D+ +G R
Sbjct: 269 YARSPVDDPQGPADGTVRVRRGGSW---HTWSFYARSSYRNWNSQDTRYTLVGMRL 321
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 76/166 (45%), Gaps = 23/166 (13%)
Query: 15 MVLLPGDTFRMGTN---------------KPILIKDGEFPSRNVTLD-AFYLDQHEVSNT 58
VL+P F MG++ K + E P+ V + FY Q+EV+
Sbjct: 31 FVLIPAGEFVMGSDESSEAMLRHYPQYDHKRFVELSDEGPAHKVRITHPFYFGQYEVTVG 90
Query: 59 QFQEFVSATGYVTEAEKFGDT-FVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
+F+ FV A+GY+ EAE G + + P + A+ ++ + H
Sbjct: 91 EFRRFVEASGYLPEAEADGTGGYGYNPDYDPAKSARGDAFEGRNPKYSWRNPGFPQDDSH 150
Query: 118 PVVHISWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLF 157
PVV+++WNDAVA C W R RLPTEAEWEY R G R +
Sbjct: 151 PVVNVTWNDAVAMCKWLSRTEGRTYRLPTEAEWEYAGRAGTRTRYY 196
>gi|406592493|ref|YP_006739673.1| hypothetical protein B711_0714 [Chlamydia psittaci CP3]
gi|405788365|gb|AFS27108.1| hypothetical protein B711_0714 [Chlamydia psittaci CP3]
Length = 618
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 65/167 (38%), Positives = 86/167 (51%), Gaps = 22/167 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A Y W G RLPTEAEWE G GG+ + +P G + + AN
Sbjct: 467 KHPVVGVTWYGASGYAAWVGKRLPTEAEWEVGAYGGVAQQRYPCGEEI---DKSLANF-- 521
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPK 293
F ++ TA VMSY N +GLY+M GNV+EW DW++ + S+ P+
Sbjct: 522 --FSSDTTA---------VMSYPANAYGLYDMAGNVYEWCEDWYSYDFYELSAQESHAPQ 570
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GP G +V +GG C + RCA R +N P + GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 57/190 (30%), Positives = 88/190 (46%), Gaps = 23/190 (12%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P + +MV + G TF G+ + + E P + L +F+LD H V+N QF ++ +
Sbjct: 377 PQPLFTEMVFIEGGTFIRGSREG---QRDEHPVHEIFLQSFFLDIHPVTNEQFVRYLECS 433
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
G +E +K+ + + R K S++ +R G HPVV ++W A
Sbjct: 434 G--SEQDKYYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWYGA 478
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
Y W G RLPTEAEWE G GG+ + +P G E ID ++ + + V
Sbjct: 479 SGYAAWVGKRLPTEAEWEVGAYGGVAQQRYPCG-----EEIDKSLANFFSSDTTAVMSYP 533
Query: 188 AVAYCTWRGA 197
A AY + A
Sbjct: 534 ANAYGLYDMA 543
>gi|421465527|ref|ZP_15914214.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
radioresistens WC-A-157]
gi|400203794|gb|EJO34779.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
radioresistens WC-A-157]
Length = 327
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 72/190 (37%), Positives = 103/190 (54%), Gaps = 11/190 (5%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W PEG + R N V +++ NDA Y W G LP+E EWEY +G E + P
Sbjct: 142 NWRLPEGPEGPPA-RANEAVRYITKNDAEHYALWLGHDLPSEVEWEYAAKGNDETDV-PL 199
Query: 221 GNNLTPRG-EHR--ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
+ PR +HR AN WQG+FP +N+ D + APV + N FGLY+M+GNVWEWT+
Sbjct: 200 KHE--PRNHQHRPDANYWQGDFPFHNSQEDHFKGIAPVGCFSPNAFGLYDMIGNVWEWTS 257
Query: 278 DWWN-VHHHPAPSYNPKGPTTGTDK--VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGN 334
+ H +Y+ + + V KGGS+LC QYC R+R ++R + + +
Sbjct: 258 SVYTGAHDEHMGNYSALRTSQHIPQSFVIKGGSFLCASQYCARYRKSSRHPQEFNLATTH 317
Query: 335 LGFRCAADKG 344
GFR D+G
Sbjct: 318 TGFR-TIDRG 326
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 79/151 (52%), Gaps = 3/151 (1%)
Query: 4 LPAPPVER-YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
LP+ +E+ + M+ +P ++G+ + + F ++ + F++DQ EV+ QF
Sbjct: 46 LPSGWLEQAHAGMLYIPEGEVKLGS-ELAYPDEINFGAQKRKVKGFWIDQTEVTVAQFAS 104
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
F++AT Y+T+AEK + VFE ++ + ++ + EG + R + V +I
Sbjct: 105 FIAATHYITDAEKQNEAAVFEADEQSPQQWWKLKSGYNWRLPEGPEGPPA-RANEAVRYI 163
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 153
+ NDA Y W G LP+E EWEY +G E
Sbjct: 164 TKNDAEHYALWLGHDLPSEVEWEYAAKGNDE 194
>gi|359800113|ref|ZP_09302664.1| hypothetical protein KYC_24182 [Achromobacter arsenitoxydans SY8]
gi|359361911|gb|EHK63657.1| hypothetical protein KYC_24182 [Achromobacter arsenitoxydans SY8]
Length = 330
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 70/182 (38%), Positives = 94/182 (51%), Gaps = 22/182 (12%)
Query: 177 NHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENR---------LFPWG 221
+HPVV++SWNDA A W R RLPTEAEWEY CR G R + G
Sbjct: 153 DHPVVNISWNDATALARWLTQKEGRVYRLPTEAEWEYACRAGTRTRYQNGDDPAAMPDVG 212
Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
N + WQ P + DG TAPV SY N FGLY+M GN WEW ADW+
Sbjct: 213 NGFDADAQPLWPKWQ---PYASVRHDGNTFTAPVASYSPNAFGLYDMHGNAWEWVADWYG 269
Query: 282 VHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
++ +P +P+GP TG +V++GGS+ Y R + R+ NT ++ +G R
Sbjct: 270 EDYYARSPVDDPQGPATGNVRVRRGGSWHTWALYT---RSSYRNWNTQETRYPLVGLRLL 326
Query: 341 AD 342
++
Sbjct: 327 SE 328
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 84/180 (46%), Gaps = 35/180 (19%)
Query: 7 PPVERYKDM----VLLPGDTFRMGTNK---------PILIKD------GEFPSRNVTLD- 46
PPV+ + V +P TF+MG+++ P D E P V +
Sbjct: 23 PPVQFENSLGMTFVRVPAGTFQMGSDEAPDTLAAAYPQYPADRFAQLFDEAPVHTVRITR 82
Query: 47 AFYLDQHEVSNTQFQEFVSATGYVTEAEK-----FGDTFVFEPLLSEEERAKISQVRHDM 101
FYL++ EV+ QF+ F+ A+GYV E+E +G ++P S+ A + R
Sbjct: 83 PFYLERTEVTVGQFRRFIEASGYVPESEADGTGGYGYNAAYDPEKSQSGDAFEGRDR--- 139
Query: 102 KRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENR 155
R+ + HPVV+ISWNDA A W R RLPTEAEWEY CR G R
Sbjct: 140 -RYSWRNPGFAQGDDHPVVNISWNDATALARWLTQKEGRVYRLPTEAEWEYACRAGTRTR 198
>gi|255319125|ref|ZP_05360343.1| sulfatase-modifying factor 1 [Acinetobacter radioresistens SK82]
gi|262379257|ref|ZP_06072413.1| conserved hypothetical protein [Acinetobacter radioresistens SH164]
gi|255303771|gb|EET82970.1| sulfatase-modifying factor 1 [Acinetobacter radioresistens SK82]
gi|262298714|gb|EEY86627.1| conserved hypothetical protein [Acinetobacter radioresistens SH164]
Length = 327
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 72/190 (37%), Positives = 103/190 (54%), Gaps = 11/190 (5%)
Query: 161 SWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPW 220
+W PEG + R N V +++ NDA Y W G LP+E EWEY +G E + P
Sbjct: 142 NWRLPEGPEGPPA-RANEAVRYITKNDAEHYALWLGHDLPSEVEWEYAAKGNDETDV-PL 199
Query: 221 GNNLTPRG-EHR--ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTA 277
+ PR +HR AN WQG+FP +N+ D + APV + N FGLY+M+GNVWEWT+
Sbjct: 200 KHE--PRNHQHRPDANYWQGDFPFHNSQEDHFKGIAPVGCFSPNAFGLYDMIGNVWEWTS 257
Query: 278 DWWN-VHHHPAPSYNPKGPTTGTDK--VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGN 334
+ H +Y+ + + V KGGS+LC QYC R+R ++R + + +
Sbjct: 258 SVYTGAHDEHMGNYSALRTSQHIPQSFVIKGGSFLCASQYCARYRKSSRHPQEFNLATTH 317
Query: 335 LGFRCAADKG 344
GFR D+G
Sbjct: 318 TGFR-TIDRG 326
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 79/151 (52%), Gaps = 3/151 (1%)
Query: 4 LPAPPVER-YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQE 62
LP+ +E+ + M+ +P ++G+ + + F ++ + F++DQ EV+ QF
Sbjct: 46 LPSGWLEQAHAGMLYIPKGEVKLGS-ELAYPDEINFGAQKRKVKGFWIDQTEVTVAQFAS 104
Query: 63 FVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
F++AT Y+T+AEK + +FE ++ + ++ + EG + R + V +I
Sbjct: 105 FIAATHYITDAEKQNEAAIFEADEQSPQQWWKLKSGYNWRLPEGPEGPPA-RANEAVRYI 163
Query: 123 SWNDAVAYCTWRGARLPTEAEWEYGCRGGLE 153
+ NDA Y W G LP+E EWEY +G E
Sbjct: 164 TKNDAEHYALWLGHDLPSEVEWEYAAKGNDE 194
>gi|445114478|ref|ZP_21377940.1| hypothetical protein HMPREF0662_00992 [Prevotella nigrescens F0103]
gi|444840707|gb|ELX67733.1| hypothetical protein HMPREF0662_00992 [Prevotella nigrescens F0103]
Length = 291
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 69/171 (40%), Positives = 92/171 (53%), Gaps = 11/171 (6%)
Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANV 234
R N PV+ V+W+DA+A+C W G RLPTEAEWEY C+ G F GNNLT +
Sbjct: 130 RENRPVIDVNWDDAIAFCEWIGCRLPTEAEWEYACKAGTTTP-FNTGNNLTTNDAN---- 184
Query: 235 WQGEFPTNNTAADGYL-STAPVMSYKE-NKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
+ G +P NN Y T V + E N +GLY+M GNV EW ADW++ + + +P
Sbjct: 185 YDGNYPYNNNEKGDYRKKTVAVGTIGEPNAWGLYDMHGNVQEWCADWYDDYPQTEQT-DP 243
Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
KGP GT ++ +GGS+ C R A R + S + GFR K
Sbjct: 244 KGPENGTYRILRGGSWTSRAMQC---RSAYRGGSEHSSRLDDAGFRVVLMK 291
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 64/148 (43%), Gaps = 36/148 (24%)
Query: 5 PAPPVERYKDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
P P + + V +P +F MG+ N + GE + FY+ ++ ++ Q+ +F
Sbjct: 55 PVTPKDLGIEFVKVPAGSFLMGSPNDEPQRESGEIQHKVTITKDFYMSKYPITFEQYDKF 114
Query: 64 VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
ATG R K S R + PV+ ++
Sbjct: 115 CEATG----------------------RQKPSDYTW-------------GRENRPVIDVN 139
Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGG 151
W+DA+A+C W G RLPTEAEWEY C+ G
Sbjct: 140 WDDAIAFCEWIGCRLPTEAEWEYACKAG 167
>gi|406594806|ref|YP_006741779.1| hypothetical protein B599_0664 [Chlamydia psittaci MN]
gi|410858565|ref|YP_006974505.1| putative serine/threonine protein kinase [Chlamydia psittaci
01DC12]
gi|405783238|gb|AFS21986.1| hypothetical protein B599_0664 [Chlamydia psittaci MN]
gi|410811460|emb|CCO02111.1| putative serine/threonine protein kinase [Chlamydia psittaci
01DC12]
Length = 618
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 65/167 (38%), Positives = 86/167 (51%), Gaps = 22/167 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A Y W G RLPTEAEWE G GG+ + +P G + + AN
Sbjct: 467 KHPVVGVTWYGASGYAAWVGKRLPTEAEWEVGAYGGVAQQRYPCGEEI---DKSLANF-- 521
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPK 293
F ++ TA VMSY N +GLY+M GNV+EW DW++ + S+ P+
Sbjct: 522 --FSSDTTA---------VMSYPANAYGLYDMAGNVYEWCEDWYSYDFYELSAQESHAPQ 570
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GP G +V +GG C + RCA R +N P + GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 57/190 (30%), Positives = 88/190 (46%), Gaps = 23/190 (12%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P + +MV + G TF G+ + + E P + L +F+LD H V+N QF ++ +
Sbjct: 377 PQPLFTEMVFIEGGTFIRGSREG---QRDEHPVHEIFLQSFFLDIHPVTNEQFVRYLECS 433
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
G +E +K+ + + R K S++ +R G HPVV ++W A
Sbjct: 434 G--SEQDKYYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWYGA 478
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
Y W G RLPTEAEWE G GG+ + +P G E ID ++ + + V
Sbjct: 479 SGYAAWVGKRLPTEAEWEVGAYGGVAQQRYPCG-----EEIDKSLANFFSSDTTAVMSYP 533
Query: 188 AVAYCTWRGA 197
A AY + A
Sbjct: 534 ANAYGLYDMA 543
>gi|340353143|ref|ZP_08675971.1| serine/threonine kinase [Prevotella pallens ATCC 700821]
gi|339611339|gb|EGQ16169.1| serine/threonine kinase [Prevotella pallens ATCC 700821]
Length = 291
Score = 111 bits (277), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 68/171 (39%), Positives = 91/171 (53%), Gaps = 11/171 (6%)
Query: 175 RMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANV 234
R N PV+ VSW+DA A+C W G RLPTEAEWEY C+ G + GNN+T +
Sbjct: 130 RENRPVIDVSWDDATAFCQWIGCRLPTEAEWEYACKAGTTTPFYT-GNNITTNDAN---- 184
Query: 235 WQGEFPTNNTAADGY-LSTAPVMSYKE-NKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
+ G +P NN Y T V + E N +GLY+M GNV EW ADW++ + +P
Sbjct: 185 YDGNYPYNNNDQGEYRKKTVAVGTVGEPNSWGLYDMHGNVQEWCADWYD-DYPQGEQTDP 243
Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADK 343
KGP GT ++ +GGS+ C R A R + +S + GFR K
Sbjct: 244 KGPDNGTYRILRGGSWTSRAMQC---RSAYRGGSEHNSRLDDAGFRIVLIK 291
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 64/154 (41%), Gaps = 36/154 (23%)
Query: 5 PAPPVERYKDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
P P + + V +P +F MG+ N + GE + FY+ ++ ++ Q+ +F
Sbjct: 55 PVTPKDLGIEFVKIPAGSFLMGSPNDEPQRESGEIQHKVTITKDFYMSKYPITFEQYDKF 114
Query: 64 VSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHIS 123
ATG R K S R + PV+ +S
Sbjct: 115 CEATG----------------------RQKPSDYTW-------------GRENRPVIDVS 139
Query: 124 WNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLF 157
W+DA A+C W G RLPTEAEWEY C+ G +
Sbjct: 140 WDDATAFCQWIGCRLPTEAEWEYACKAGTTTPFY 173
>gi|29840376|ref|NP_829482.1| protein kinase [Chlamydophila caviae GPIC]
gi|55583926|sp|Q822R1.1|PKN1_CHLCV RecName: Full=Serine/threonine-protein kinase pkn1
gi|29834725|gb|AAP05360.1| protein kinase, putative [Chlamydophila caviae GPIC]
Length = 618
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 63/167 (37%), Positives = 84/167 (50%), Gaps = 22/167 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A Y +W G RLPTEAEWE GG+ +P G + + +AN +
Sbjct: 467 KHPVVGVTWYGASGYASWVGKRLPTEAEWEIASCGGVTQLRYPCGEEI---DKSQANFFS 523
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP---SYNPK 293
+ T PVMSY N +GLY+M GNV+EW DW+ + S+ P+
Sbjct: 524 SD-------------TTPVMSYPANPYGLYDMAGNVYEWCEDWYGYDFYEISAQESHAPQ 570
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GP G +V +GG C + RCA R +N P + GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 47/153 (30%), Positives = 73/153 (47%), Gaps = 18/153 (11%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P + +MV + G F G+ + + E P + L +F+LD H V+N QF ++ +
Sbjct: 377 PQPLFTEMVFIEGGKFLRGSREG---QRDEHPVHEIFLHSFFLDIHPVTNEQFVRYLECS 433
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
G +E +K+ + + R K S++ +R G HPVV ++W A
Sbjct: 434 G--SEQDKYYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWYGA 478
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 160
Y +W G RLPTEAEWE GG+ +P G
Sbjct: 479 SGYASWVGKRLPTEAEWEIASCGGVTQLRYPCG 511
>gi|297621236|ref|YP_003709373.1| serine/threonine-protein kinase pkn1 [Waddlia chondrophila WSU
86-1044]
gi|297376537|gb|ADI38367.1| putative serine/threonine-protein kinase pkn1 [Waddlia chondrophila
WSU 86-1044]
gi|337293465|emb|CCB91454.1| Serine/threonine-protein kinase pkn1 [Waddlia chondrophila 2032/99]
Length = 599
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/170 (37%), Positives = 88/170 (51%), Gaps = 22/170 (12%)
Query: 176 MNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW 235
+ HPVV V+W AVAY W G RLPTEAEWE + ++ +P GN++ + +AN +
Sbjct: 447 LKHPVVGVTWYGAVAYAKWLGKRLPTEAEWEVASQSHKDHAEYPTGNDIE---KTQANFF 503
Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS---YNP 292
+ T VMSY N GLY+M GNV+EW DW+ +++ +NP
Sbjct: 504 SSD-------------TTSVMSYPSNDNGLYDMAGNVYEWCQDWYGYNYYEISVQEPFNP 550
Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + R A R +N P + GFRCAAD
Sbjct: 551 TGPLQGVYRVLRGG---CWKGLKEDLRSAKRHRNNPGTGNSTYGFRCAAD 597
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 74/150 (49%), Gaps = 18/150 (12%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ +MV++ G TF G+++ E P +++D+F +D H V+N QF F+ A
Sbjct: 362 HTEMVVIEGGTFTRGSSQG---NRDEMPKHEISIDSFAIDIHPVTNEQFVRFLEAM---- 414
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
E D+ + + +E R + R G + + HPVV ++W AVAY
Sbjct: 415 --EGIKDSNHNDLINLKESR---------IHRSRGKCTIESGYLKHPVVGVTWYGAVAYA 463
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
W G RLPTEAEWE + ++ +P G+
Sbjct: 464 KWLGKRLPTEAEWEVASQSHKDHAEYPTGN 493
>gi|90023627|ref|YP_529454.1| RecF protein [Saccharophagus degradans 2-40]
gi|89953227|gb|ABD83242.1| protein of unknown function DUF323 [Saccharophagus degradans 2-40]
Length = 338
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 104/347 (29%), Positives = 146/347 (42%), Gaps = 79/347 (22%)
Query: 14 DMVLLPGDTFRMGTNKPILIK-DGEFPSRNVTL-DAFYLDQHEVSNTQFQEFVSATGYVT 71
+M+++P +F MG+ + + E P VTL F + +HEV+ QF+ FV ATGY+T
Sbjct: 51 EMIVVPAGSFLMGSPASEEGRSEAEGPQLKVTLAKVFAMGKHEVTRGQFRAFVDATGYLT 110
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
+AE D + ++ + R L + HPV+ +S DA+AY
Sbjct: 111 QAEVEADRGCLTLDVRDDGNGNWGWSANQNWRNATLYGLEQGSDAHPVLCVSQQDALAYI 170
Query: 132 TW----------RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVV 181
TW R RLPTEAEWEY R G S + G D T
Sbjct: 171 TWFNETHLPPDERAYRLPTEAEWEYAARAG--------SSTTYSHGTDVT---------- 212
Query: 182 HVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPT 241
A C YG G P+G +P
Sbjct: 213 --------AMC-------------RYGNIG---------DQTALPKGH--------AWPE 234
Query: 242 NNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP----SYNPKGPTT 297
T DGY+ TAPV S++ N FGL++M GNVWEWT D W+ ++ AP ++ +
Sbjct: 235 WVTCDDGYVFTAPVGSFQANAFGLHDMHGNVWEWTEDCWHDNYIDAPVNGEAWEEENGGD 294
Query: 298 GTDKVKKGGSYLCNEQYCYRH--RCAARSQNTPDSSAGNLGFRCAAD 342
++V +GGS+ Y +R R A R N P +GFR A D
Sbjct: 295 CANRVARGGSW-----YFHRGDLRVAFRFINNPAMRYSLMGFRLAQD 336
>gi|338733048|ref|YP_004671521.1| serine/threonine-protein kinase pkn1 [Simkania negevensis Z]
gi|336482431|emb|CCB89030.1| serine/threonine-protein kinase pkn1 [Simkania negevensis Z]
Length = 672
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/168 (36%), Positives = 87/168 (51%), Gaps = 22/168 (13%)
Query: 178 HPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG 237
HPV+ V+W A+AY W G RLPTEAEWE G E ++P G + +AN +
Sbjct: 520 HPVIGVTWYGAIAYAKWIGKRLPTEAEWEIAASCGNEENIYPTGATIE---RTQANFFSS 576
Query: 238 EFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPKG 294
+ T V SY N++GLY+M GN++EW DW++ +++ NP+G
Sbjct: 577 D-------------TVTVKSYPPNEYGLYDMAGNIYEWCQDWYDYNYYEVSQQEPENPQG 623
Query: 295 PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
P G +V +GG C + RC+ R++N P GFRCAAD
Sbjct: 624 PLQGVFRVLRGG---CWKSLKEDLRCSHRNRNNPGIVNRTYGFRCAAD 668
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 71/150 (47%), Gaps = 18/150 (12%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV++ G F G+N+ E P + + +F +D H V+N QF F+ G +A
Sbjct: 435 EMVIIKGGKFERGSNQG---GRDERPRHKIEISSFAMDIHAVTNEQFVRFLEVMGGEKDA 491
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ ++ ++ + R +KR G HPV+ ++W A+AY W
Sbjct: 492 -------------NNQDIIQLKESR--IKRHSGKLVIESGYAKHPVIGVTWYGAIAYAKW 536
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGSWL 163
G RLPTEAEWE G E ++P G+ +
Sbjct: 537 IGKRLPTEAEWEIAASCGNEENIYPTGATI 566
>gi|421481579|ref|ZP_15929162.1| hypothetical protein QWC_03223 [Achromobacter piechaudii HLE]
gi|400199894|gb|EJO32847.1| hypothetical protein QWC_03223 [Achromobacter piechaudii HLE]
Length = 335
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 69/179 (38%), Positives = 94/179 (52%), Gaps = 22/179 (12%)
Query: 177 NHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
+HPVV++SWNDA A W R RLPTEAEWEY CR G R + N P G
Sbjct: 156 DHPVVNISWNDAQALARWLTQKEGRVYRLPTEAEWEYACRAGTRTR---YQNGDDPAGMP 212
Query: 231 RAN---------VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
A +W + DGY TAPV +Y N FGLY+M GN WEW ADW+
Sbjct: 213 DAGNGFDADAQPLWPKWQAFASARHDGYTFTAPVATYAPNAFGLYDMHGNAWEWVADWYG 272
Query: 282 VHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
++ +P+ +P+GP +G +V++GGS+ Y R + R+ NT ++ +G R
Sbjct: 273 EDYYAHSPTDDPQGPDSGNVRVRRGGSWHTWALYT---RSSYRNWNTQETRYPLVGLRL 328
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 62/168 (36%), Positives = 82/168 (48%), Gaps = 31/168 (18%)
Query: 15 MVLLPGDTFRMGTNK---------PILIKD------GEFPSRNVTLD-AFYLDQHEVSNT 58
V +P TF+MG+++ P KD E P +V + FYL++ EV+
Sbjct: 38 FVAIPAGTFQMGSDETPEALARDYPQYPKDRFAQLFDEAPVHSVRITRPFYLERTEVTVG 97
Query: 59 QFQEFVSATGYVTEAEK-----FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEH 113
QF+ F+ A+GYV E+E +G ++P SE A + R R G T +H
Sbjct: 98 QFRRFIEASGYVPESEADGTGGYGYNAAYDPDKSERGDAFEGRDRRYSWRNPGFAQTDDH 157
Query: 114 RMHHPVVHISWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENR 155
PVV+ISWNDA A W R RLPTEAEWEY CR G R
Sbjct: 158 ----PVVNISWNDAQALARWLTQKEGRVYRLPTEAEWEYACRAGTRTR 201
>gi|89898193|ref|YP_515303.1| serine/threonine protein kinase [Chlamydophila felis Fe/C-56]
gi|89331565|dbj|BAE81158.1| serine/threonine protein kinase [Chlamydophila felis Fe/C-56]
Length = 618
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/167 (37%), Positives = 84/167 (50%), Gaps = 22/167 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A Y +W G RLPTEAEWE GG+ + +P G + Q
Sbjct: 467 KHPVVGVTWYGASGYASWVGKRLPTEAEWEVAAYGGIAQQRYPCGEEIDKS--------Q 518
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAP---SYNPK 293
F +++T A VMSY N +GLY+M GNV+EW DW+ + S+ P+
Sbjct: 519 ANFFSSDTTA--------VMSYPANPYGLYDMAGNVYEWCEDWYGYDFYEISAQESHCPR 570
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GP G +V +GG C + RCA R +N P + GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 41/124 (33%), Positives = 62/124 (50%), Gaps = 15/124 (12%)
Query: 37 EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQ 96
E P + L++F+LD H V+N QF ++ G +E +K+ + + R K S+
Sbjct: 403 EHPVHEIFLESFFLDIHPVTNEQFVRYLECAG--SEQDKYYNELI---------RLKDSR 451
Query: 97 VRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRL 156
+ +R G HPVV ++W A Y +W G RLPTEAEWE GG+ +
Sbjct: 452 I----QRRSGKLVIEPGYAKHPVVGVTWYGASGYASWVGKRLPTEAEWEVAAYGGIAQQR 507
Query: 157 FPWG 160
+P G
Sbjct: 508 YPCG 511
>gi|395007433|ref|ZP_10391177.1| hypothetical protein PMI14_03869 [Acidovorax sp. CF316]
gi|394314580|gb|EJE51467.1| hypothetical protein PMI14_03869 [Acidovorax sp. CF316]
Length = 360
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 96/177 (54%), Gaps = 18/177 (10%)
Query: 177 NHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGN-------N 223
+HPVV+V+WNDA A W + RLPTEAEWEY CR G R +P G+ N
Sbjct: 160 DHPVVNVTWNDAQALAQWLSRTEGQRYRLPTEAEWEYACRAGTRTR-YPHGDDPEGLLQN 218
Query: 224 LTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADW-WNV 282
+ A W+ +DG+ TAPV SY N FGL +M+GN WEW ADW +
Sbjct: 219 ANTFDQEAAPHWERWRQHALRGSDGHAFTAPVGSYAPNAFGLQDMLGNAWEWVADWHGDS 278
Query: 283 HHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
++ +P+ +P GP G+ +V++GGS+ + + RC R+ N+P + +G R
Sbjct: 279 YYAESPAVDPGGPADGSVRVRRGGSW---HTWSFYARCGYRNWNSPQTRYTLVGMRL 332
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 50/135 (37%), Positives = 66/135 (48%), Gaps = 10/135 (7%)
Query: 41 RNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGD-TFVFEPLLSEEERAKISQVRH 99
R V AFYL QHEV+ QF+ F+ A+GY E+E G + + P +
Sbjct: 84 RVVIRQAFYLGQHEVTVGQFRRFLQASGYQPESEADGTGGYGYNPRYDPATTRRGDAFEG 143
Query: 100 DMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW------RGARLPTEAEWEYGCRGGLE 153
R+ + HPVV+++WNDA A W + RLPTEAEWEY CR G
Sbjct: 144 RDPRYSWRNPGFAQGEDHPVVNVTWNDAQALAQWLSRTEGQRYRLPTEAEWEYACRAGTR 203
Query: 154 NRLFPWGSWLHPEGI 168
R +P G PEG+
Sbjct: 204 TR-YPHGD--DPEGL 215
>gi|209733864|gb|ACI67801.1| Sulfatase-modifying factor 1 precursor [Salmo salar]
Length = 209
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 51/82 (62%), Positives = 65/82 (79%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+PG F MGT+ P + +DGE P R V LD FY+D HEVSN FQ F++ATGY+TEAE
Sbjct: 98 MVLIPGGEFLMGTDNPGIPQDGEGPQRRVHLDHFYMDAHEVSNRHFQSFINATGYITEAE 157
Query: 75 KFGDTFVFEPLLSEEERAKISQ 96
+FGD+FVFE +LSEE +++ISQ
Sbjct: 158 RFGDSFVFEGVLSEEVKSQISQ 179
>gi|402495015|ref|ZP_10841749.1| gliding motility-associated lipoprotein GldK [Aquimarina
agarilytica ZC1]
Length = 453
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 110/413 (26%), Positives = 162/413 (39%), Gaps = 94/413 (22%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF-------VSAT 67
M L+PG F MG + P + VT+ +FY+D+ E++N+++++F V T
Sbjct: 36 MTLIPGGAFIMGKSDDDKAGLLNAPPKTVTVRSFYMDETEITNSEYRQFTEWVKDSVVRT 95
Query: 68 GYVTEAEKFGDT---------FVFEPLLSEEERAKISQVRHDMKRFEGLDST-------- 110
A++ G T F F EEE + + D +EGL T
Sbjct: 96 QLAITADEVGQTGSDEDGIGQFAFRNTEDEEELSVYEKYVRD--NYEGLGETGYEGKALN 153
Query: 111 -----IEHRMHHPVVHISWNDAVAYC----------TWRGARLPTEAEWEYGCRGGLENR 155
I +P + + Y T++ +L + +W Y +R
Sbjct: 154 KDVDLITDLNEYPDEYYAEVMDSLYVRAEEAHNGVRTFKTEKLVYDYKW-YDVEAMARSR 212
Query: 156 LFPWGSWLHPEGI----DSTI------------EHRM--------NHPVVHVSWNDAVAY 191
+L E + D+T+ H M ++PVV V+W A A+
Sbjct: 213 SKDSKKFLKREALKIYPDTTVWIRDFSYSYNEPMHDMYFTHTSYDDYPVVGVTWKQAKAF 272
Query: 192 CTWRGA------------------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRAN 233
C WR RLPTEAEWEY RGGLE+ FPWG P ++
Sbjct: 273 CDWRTKNHNDALRKKKNKDLVSRYRLPTEAEWEYAARGGLESGTFPWGG---PYAKNDRG 329
Query: 234 VWQGEFP--TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSY 290
+ F + AAD L T SY N +GLYNM GNV EW ++ + S
Sbjct: 330 CFLANFKPMRGDYAADQALYTVEANSYDPNDYGLYNMAGNVSEWVGSTYDPASYEYTSSM 389
Query: 291 NPKGPTT-GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
NP KV +GGS+ + Y + + R DS+ +GFR D
Sbjct: 390 NPNSSDDENKRKVVRGGSW---KDVAYFLQVSTRDYEYADSARSYIGFRAVQD 439
>gi|425744693|ref|ZP_18862748.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii WC-323]
gi|425490289|gb|EKU56589.1| putative cysteine-type sulfatase aerobic maturase [Acinetobacter
baumannii WC-323]
Length = 334
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 64/164 (39%), Positives = 88/164 (53%), Gaps = 5/164 (3%)
Query: 179 PVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVWQG 237
PV +V+ ND Y W G LPTE EWEY + G + P +H +AN WQG
Sbjct: 165 PVRYVTKNDTEHYAVWLGHDLPTELEWEYAAKAGSKTDT-PLHQGPQDAHQHPQANYWQG 223
Query: 238 EFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNP--KG 294
+FP NNT D + APV Y N F L++M+GNVWEWT+ ++ H +Y +
Sbjct: 224 DFPGNNTVEDHFEGVAPVGCYPANAFKLFDMIGNVWEWTSSVYHGAHDQHMGNYTDLRQH 283
Query: 295 PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
T V KGGS+LC +C R+R ++R D +A ++GFR
Sbjct: 284 NNATTQYVLKGGSFLCASNFCARYRNSSRYPQEFDLAATHVGFR 327
>gi|162454345|ref|YP_001616712.1| protein kinase [Sorangium cellulosum So ce56]
gi|161164927|emb|CAN96232.1| Protein kinase [Sorangium cellulosum So ce56]
Length = 733
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 104/350 (29%), Positives = 138/350 (39%), Gaps = 108/350 (30%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEF----PSRNVTLDAFYLDQHEVSNTQFQ--------- 61
MVL+PG F MG+++P EF P+ VT+D F +D HEV+ ++
Sbjct: 467 MVLVPGGKFFMGSDEP------EFKLWQPAHKVTIDTFCIDVHEVTAGDYKACSDKGECK 520
Query: 62 ------EFVSATGYVTEAEKFGDTFVFEP-LLSEEERAKISQVRHDMKRFEGLDSTIEHR 114
+F + +G A + T + E + E AK
Sbjct: 521 RPPAAPDFPNTSGAPAAAHQKTLTALAEQCTFGKPELAK--------------------- 559
Query: 115 MHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEH 174
HP+ + W+ A AYC G RLPTEAEWEY RG + R FPWG
Sbjct: 560 --HPINCLPWSLANAYCEVHGKRLPTEAEWEYAARGS-DGRKFPWG-------------- 602
Query: 175 RMNHPVVHVSWNDAVAYCT-WRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRAN 233
+ P N CT W AR T +E Y
Sbjct: 603 --DKPGDQTFMNACGTECTAWEKARGLTPSERMY-------------------------- 634
Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYK--ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYN 291
A DG+ TAPV S+ +FG ++VGNVWEWTADW+ + N
Sbjct: 635 ----------EADDGFPGTAPVGSFPNGRTRFGADDVVGNVWEWTADWFETYKD-EEQVN 683
Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
PKG G K +GG+Y N + A R +SA +GFRCAA
Sbjct: 684 PKGAPAGDRKAIRGGAY--NGGFALWLNPAFRYHQLATASAPGIGFRCAA 731
>gi|406039676|ref|ZP_11047031.1| hypothetical protein AursD1_07627 [Acinetobacter ursingii DSM 16037
= CIP 107286]
Length = 332
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/166 (39%), Positives = 86/166 (51%), Gaps = 5/166 (3%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH-RANVW 235
N PV V+ NDA Y W G LP+E EWEY + G + + P +H +AN W
Sbjct: 163 NEPVRFVTKNDAEHYAVWLGRDLPSEIEWEYAAKAG-QTQDTPLHQAPMDHHQHPQANYW 221
Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKG 294
QGEFP +NT D + APV + N F LY+ +GNVWEWT+ + H Y
Sbjct: 222 QGEFPFHNTKQDQFEGVAPVGCFAANNFQLYDTIGNVWEWTSSVYQGAHDEHMGDYQHLR 281
Query: 295 PTT--GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
T + V KGGSYLC + YC R R ++R D + ++GFR
Sbjct: 282 QTNIPSQNFVIKGGSYLCADNYCARFRNSSRHPQEFDLATSHVGFR 327
>gi|407455448|ref|YP_006734339.1| hypothetical protein B598_0661 [Chlamydia psittaci GR9]
gi|405781991|gb|AFS20740.1| hypothetical protein B598_0661 [Chlamydia psittaci GR9]
Length = 618
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 64/167 (38%), Positives = 85/167 (50%), Gaps = 22/167 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A Y W G RLPTEAEWE GG+ + +P G + + AN
Sbjct: 467 KHPVVGVTWYGASGYAAWVGKRLPTEAEWEVAAYGGVAQQRYPCGEEI---DKSLANF-- 521
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPK 293
F ++ TA VMSY N +GLY+M GNV+EW DW++ + S+ P+
Sbjct: 522 --FSSDTTA---------VMSYPANAYGLYDMAGNVYEWCEDWYSYDFYELSAQESHAPQ 570
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GP G +V +GG C + RCA R +N P + GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 92/210 (43%), Gaps = 37/210 (17%)
Query: 2 VLLPAPPVERYK--------------DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDA 47
+L+ P V RY +MV + G TF G+ + + E P + L +
Sbjct: 357 LLIREPVVSRYVEEEKEEVKPQPLLTEMVFIEGGTFIRGSREG---QRDEHPVHEIFLQS 413
Query: 48 FYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGL 107
F+LD H V+N QF ++ +G +E +K+ + + R K S++ +R G
Sbjct: 414 FFLDIHPVTNEQFVRYLECSG--SEQDKYYNELI---------RLKDSRI----QRRSGK 458
Query: 108 DSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG 167
HPVV ++W A Y W G RLPTEAEWE GG+ + +P G E
Sbjct: 459 LVIEPGYAKHPVVGVTWYGASGYAAWVGKRLPTEAEWEVAAYGGVAQQRYPCG-----EE 513
Query: 168 IDSTIEHRMNHPVVHVSWNDAVAYCTWRGA 197
ID ++ + + V A AY + A
Sbjct: 514 IDKSLANFFSSDTTAVMSYPANAYGLYDMA 543
>gi|332661785|ref|YP_004451255.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
1100]
gi|332337282|gb|AEE54382.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis DSM
1100]
Length = 2048
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 71/191 (37%), Positives = 98/191 (51%), Gaps = 42/191 (21%)
Query: 175 RMNHPVVHVSWNDAVAYCTWR------------------------GARLPTEAEWEYGCR 210
R PV++VSW DA+ YC WR G RLPTEAEWE+ R
Sbjct: 965 RGTRPVINVSWFDAIDYCNWRSQEEGLQQVYQVNKEQVTPNWSANGYRLPTEAEWEFAAR 1024
Query: 211 GGLENRLFPW-GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMV 269
GG E++ F + G+N NV + + + N+ A T PV K N+ G+Y+M
Sbjct: 1025 GGSESQGFTYAGSN---------NVDEVAWFSENSGA----KTQPVGQKKANELGIYDMS 1071
Query: 270 GNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPD 329
GNVWEW DW++ + A + +PKGP TG+ +V + GS+ + C R A RS N P+
Sbjct: 1072 GNVWEWCWDWYSAYSSSA-TKDPKGPDTGSIRVLRCGSWFYDADGC---RVAFRSDNDPN 1127
Query: 330 SSAGNLGFRCA 340
+ N GFR A
Sbjct: 1128 VANSNCGFRIA 1138
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 74/259 (28%), Positives = 104/259 (40%), Gaps = 72/259 (27%)
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
+++ + A+CT G LP ++ W G R PV
Sbjct: 1234 ELTFEEYDAFCTATGRELPDDSGWGRGKR-----------------------------PV 1264
Query: 181 VHVSWNDAVAYCTWR---------------------------------------GARLPT 201
++V W DA+ YC WR G RLPT
Sbjct: 1265 INVDWYDAIEYCNWRSSQEGLQTVYSINKNTKDSNNSSPSDTKQWQVTTSRSANGYRLPT 1324
Query: 202 EAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKEN 261
EAEWEY R G + F G ++ E N G + + T PV S+ N
Sbjct: 1325 EAEWEYAARQGGQKVRFGNGKDIADPQEINFNALSGYKKRYSVVGEYRQKTLPVGSFSPN 1384
Query: 262 KFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCA 321
L++M GNVWEW DW+ + A S +P+G T G+D+V +GGS+ N + R A
Sbjct: 1385 TLDLFDMSGNVWEWCGDWYGPYPSSA-SNDPQGATGGSDRVFRGGSWRSNPIFT---RVA 1440
Query: 322 ARSQNTPDSSAGNLGFRCA 340
R + +P +GN GFR A
Sbjct: 1441 LRYRISPGDRSGNKGFRLA 1459
Score = 100 bits (248), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/192 (34%), Positives = 90/192 (46%), Gaps = 44/192 (22%)
Query: 175 RMNHPVVHVSWNDAVAYCTWR------------------------GARLPTEAEWEYGCR 210
R PV++V+W DA+ YC WR G RLPTEAEWEY +
Sbjct: 1572 RGTRPVINVNWFDAIDYCNWRSQQENLKPVYNGGKDGFTADWSANGYRLPTEAEWEYAAK 1631
Query: 211 GGLENRLFPWG--NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNM 268
GGL +R F + NN+ W G N T PV K N+ GL++M
Sbjct: 1632 GGLSSRGFKYAGSNNVDEVA------WYGSNSGN--------KTQPVGQKKANELGLFDM 1677
Query: 269 VGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTP 328
GNV EW DW+ + + A + +P+GP G +V + GS+ + YC R A R +N P
Sbjct: 1678 SGNVQEWCWDWYEAYSNRAFN-DPRGPDFGMYRVLRSGSWFNSSDYC---RVAHRDRNVP 1733
Query: 329 DSSAGNLGFRCA 340
+ + GFR A
Sbjct: 1734 RRAGDDYGFRLA 1745
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 90/193 (46%), Gaps = 46/193 (23%)
Query: 175 RMNHPVVHVSWNDAVAYCTWR------------------------GARLPTEAEWEYGCR 210
R + P ++V+W DAV YC WR G RLPTEAEWEY R
Sbjct: 1859 RGDRPAINVNWFDAVDYCNWRSQEEGLQQVYHVNMEQVTPNWSANGYRLPTEAEWEYAAR 1918
Query: 211 GGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVG 270
GG E++ F + + V + + + N+ +L V K N+ GL++M G
Sbjct: 1919 GGSESQGFTYAGS--------RKVLEVAWFSENSGGSTHL----VGQKKTNELGLFDMSG 1966
Query: 271 NVWEWTADWWNVHHHPAPSY---NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNT 327
NVWEW DW H P+ +P+GP+ G+ +V +GGS+L + C+ A R
Sbjct: 1967 NVWEWCWDW----HAAYPNSFINDPRGPSAGSRRVLRGGSWLNEAELCH---VATRLNYN 2019
Query: 328 PDSSAGNLGFRCA 340
P + GFR A
Sbjct: 2020 PLYADNRFGFRLA 2032
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 68/171 (39%), Gaps = 70/171 (40%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY-VTEA 73
MVL+ G TF MG E + +VTL FY+ +H+++ ++ F ATG +T+
Sbjct: 910 MVLVKGGTFEMGE---------ENNTHSVTLSDFYIGKHQLTFAEYDAFCRATGRELTKD 960
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+G R PV+++SW DA+ YC W
Sbjct: 961 RGWG------------------------------------RGTRPVINVSWFDAIDYCNW 984
Query: 134 R------------------------GARLPTEAEWEYGCRGGLENRLFPWG 160
R G RLPTEAEWE+ RGG E++ F +
Sbjct: 985 RSQEEGLQQVYQVNKEQVTPNWSANGYRLPTEAEWEFAARGGSESQGFTYA 1035
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 48/182 (26%), Positives = 68/182 (37%), Gaps = 69/182 (37%)
Query: 4 LPAPPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF 63
L A E+ MVL+ G TF+MG K S VTL F + + +++ +F F
Sbjct: 1792 LIAIIQEKQDSMVLVKGGTFQMGEVKK--------NSHQVTLSDFLIAKRQLTFDEFDAF 1843
Query: 64 VSATGYVTEAEK-FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHI 122
AT +++ +G R P +++
Sbjct: 1844 CKATSRELPSDRNWG------------------------------------RGDRPAINV 1867
Query: 123 SWNDAVAYCTWR------------------------GARLPTEAEWEYGCRGGLENRLFP 158
+W DAV YC WR G RLPTEAEWEY RGG E++ F
Sbjct: 1868 NWFDAVDYCNWRSQEEGLQQVYHVNMEQVTPNWSANGYRLPTEAEWEYAARGGSESQGFT 1927
Query: 159 WG 160
+
Sbjct: 1928 YA 1929
>gi|329942958|ref|ZP_08291737.1| hypothetical protein G5Q_0636 [Chlamydophila psittaci Cal10]
gi|332287549|ref|YP_004422450.1| putative serine/threonine protein kinase [Chlamydophila psittaci
6BC]
gi|384450705|ref|YP_005663305.1| s/t protein kinase [Chlamydophila psittaci 6BC]
gi|384451703|ref|YP_005664301.1| putative serine/threonine protein kinase [Chlamydophila psittaci
01DC11]
gi|384452677|ref|YP_005665274.1| putative serine/threonine protein kinase [Chlamydophila psittaci
08DC60]
gi|384453653|ref|YP_005666249.1| putative serine/threonine protein kinase [Chlamydophila psittaci
C19/98]
gi|384454632|ref|YP_005667227.1| putative serine/threonine protein kinase [Chlamydophila psittaci
02DC15]
gi|392376781|ref|YP_004064559.1| putative serine/threonine protein kinase [Chlamydophila psittaci
RD1]
gi|407454166|ref|YP_006733274.1| hypothetical protein B595_0715 [Chlamydia psittaci 84/55]
gi|407456852|ref|YP_006735425.1| hypothetical protein B600_0709 [Chlamydia psittaci VS225]
gi|407458186|ref|YP_006736491.1| hypothetical protein B601_0665 [Chlamydia psittaci WS/RT/E30]
gi|407459433|ref|YP_006737536.1| hypothetical protein B602_0666 [Chlamydia psittaci M56]
gi|407460805|ref|YP_006738580.1| hypothetical protein B603_0670 [Chlamydia psittaci WC]
gi|313848124|emb|CBY17125.1| putative serine/threonine protein kinase [Chlamydophila psittaci
RD1]
gi|325506747|gb|ADZ18385.1| putative serine/threonine protein kinase [Chlamydophila psittaci
6BC]
gi|328815218|gb|EGF85207.1| hypothetical protein G5Q_0636 [Chlamydophila psittaci Cal10]
gi|328914799|gb|AEB55632.1| s/t protein kinase [Chlamydophila psittaci 6BC]
gi|334692434|gb|AEG85653.1| putative serine/threonine protein kinase [Chlamydophila psittaci
C19/98]
gi|334693413|gb|AEG86631.1| putative serine/threonine protein kinase [Chlamydophila psittaci
01DC11]
gi|334694389|gb|AEG87606.1| putative serine/threonine protein kinase [Chlamydophila psittaci
02DC15]
gi|334695366|gb|AEG88582.1| putative serine/threonine protein kinase [Chlamydophila psittaci
08DC60]
gi|405780925|gb|AFS19675.1| hypothetical protein B595_0715 [Chlamydia psittaci 84/55]
gi|405784113|gb|AFS22860.1| hypothetical protein B600_0709 [Chlamydia psittaci VS225]
gi|405785555|gb|AFS24301.1| hypothetical protein B601_0665 [Chlamydia psittaci WS/RT/E30]
gi|405785667|gb|AFS24412.1| hypothetical protein B602_0666 [Chlamydia psittaci M56]
gi|405787550|gb|AFS26294.1| hypothetical protein B603_0670 [Chlamydia psittaci WC]
Length = 618
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 64/167 (38%), Positives = 85/167 (50%), Gaps = 22/167 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A Y W G RLPTEAEWE GG+ + +P G + + AN
Sbjct: 467 KHPVVGVTWYGASGYAAWVGKRLPTEAEWEVAAYGGVAQQRYPCGEEI---DKSLANF-- 521
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPK 293
F ++ TA VMSY N +GLY+M GNV+EW DW++ + S+ P+
Sbjct: 522 --FSSDTTA---------VMSYPANAYGLYDMAGNVYEWCEDWYSYDFYELSAQESHAPQ 570
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GP G +V +GG C + RCA R +N P + GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 87/190 (45%), Gaps = 23/190 (12%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P + +MV + G TF G+ + + E P + L +F+LD H V+N QF ++ +
Sbjct: 377 PQPLFTEMVFIEGGTFIRGSREG---QRDEHPVHEIFLQSFFLDIHPVTNEQFVRYLECS 433
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
G +E +K+ + + R K S++ +R G HPVV ++W A
Sbjct: 434 G--SEQDKYYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWYGA 478
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
Y W G RLPTEAEWE GG+ + +P G E ID ++ + + V
Sbjct: 479 SGYAAWVGKRLPTEAEWEVAAYGGVAQQRYPCG-----EEIDKSLANFFSSDTTAVMSYP 533
Query: 188 AVAYCTWRGA 197
A AY + A
Sbjct: 534 ANAYGLYDMA 543
>gi|424825250|ref|ZP_18250237.1| putative serine/threonine protein kinase [Chlamydophila abortus
LLG]
gi|333410349|gb|EGK69336.1| putative serine/threonine protein kinase [Chlamydophila abortus
LLG]
Length = 618
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 64/167 (38%), Positives = 85/167 (50%), Gaps = 22/167 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A Y W G RLPTEAEWE GG+ + +P G + + AN
Sbjct: 467 KHPVVGVTWYGASGYAAWVGKRLPTEAEWEIAAYGGVAQQRYPCGEEI---DKSLANF-- 521
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPK 293
F ++ TA VMSY N +GLY+M GNV+EW DW++ + S+ P+
Sbjct: 522 --FSSDTTA---------VMSYPANAYGLYDMAGNVYEWCEDWYSYDFYELSAQESHAPQ 570
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GP G +V +GG C + RCA R +N P + GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 55/190 (28%), Positives = 86/190 (45%), Gaps = 23/190 (12%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P + +MV + G F G+ + + E P + L +F+LD H V+N QF ++ +
Sbjct: 377 PQPLFTEMVFIEGGAFIRGSREG---QRDEHPVHEIFLQSFFLDIHPVTNEQFVRYLECS 433
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
G +E +K+ + + R K S++ +R G HPVV ++W A
Sbjct: 434 G--SEQDKYYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWYGA 478
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
Y W G RLPTEAEWE GG+ + +P G E ID ++ + + V
Sbjct: 479 SGYAAWVGKRLPTEAEWEIAAYGGVAQQRYPCG-----EEIDKSLANFFSSDTTAVMSYP 533
Query: 188 AVAYCTWRGA 197
A AY + A
Sbjct: 534 ANAYGLYDMA 543
>gi|62185206|ref|YP_219991.1| serine/threonine protein kinase [Chlamydophila abortus S26/3]
gi|62148273|emb|CAH64038.1| putative serine/threonine protein kinase [Chlamydophila abortus
S26/3]
Length = 618
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 64/167 (38%), Positives = 85/167 (50%), Gaps = 22/167 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A Y W G RLPTEAEWE GG+ + +P G + + AN
Sbjct: 467 KHPVVGVTWYGASGYAAWVGKRLPTEAEWEIAAYGGVAQQRYPCGEEI---DKSLANF-- 521
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPK 293
F ++ TA VMSY N +GLY+M GNV+EW DW++ + S+ P+
Sbjct: 522 --FSSDTTA---------VMSYPANAYGLYDMAGNVYEWCEDWYSYDFYELSAQESHAPQ 570
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GP G +V +GG C + RCA R +N P + GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 55/190 (28%), Positives = 86/190 (45%), Gaps = 23/190 (12%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P + +MV + G F G+ + + E P + L +F+LD H V+N QF ++ +
Sbjct: 377 PQPLFTEMVFIEGGAFIRGSREG---QRDEHPVHEIFLQSFFLDIHPVTNEQFVRYLECS 433
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
G +E +K+ + + R K S++ +R G HPVV ++W A
Sbjct: 434 G--SEQDKYYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWYGA 478
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
Y W G RLPTEAEWE GG+ + +P G E ID ++ + + V
Sbjct: 479 SGYAAWVGKRLPTEAEWEIAAYGGVAQQRYPCG-----EEIDKSLANFFSSDTTAVMSYP 533
Query: 188 AVAYCTWRGA 197
A AY + A
Sbjct: 534 ANAYGLYDMA 543
>gi|448734250|ref|ZP_21716476.1| sulfatase-modifying factor [Halococcus salifodinae DSM 8989]
gi|445800298|gb|EMA50653.1| sulfatase-modifying factor [Halococcus salifodinae DSM 8989]
Length = 133
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 58/128 (45%), Positives = 74/128 (57%), Gaps = 18/128 (14%)
Query: 232 ANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH------- 284
AN WQGEFP N A DGY T+PV S+ N+FGLY+M GNVW+WT DW++
Sbjct: 2 ANTWQGEFPHENKALDGYERTSPVGSFPANEFGLYDMAGNVWDWTQDWYSADPPTSEPEA 61
Query: 285 ---HPA--------PSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAG 333
PA S +P+ P+ KV KGGS+LC YC+R+R AAR D+S
Sbjct: 62 SCCTPANLRTVTEEQSVDPRDPSEIPRKVLKGGSHLCAPNYCFRYRPAARYPEPIDTSTS 121
Query: 334 NLGFRCAA 341
++GFRC A
Sbjct: 122 HVGFRCVA 129
>gi|406593560|ref|YP_006740739.1| hypothetical protein B712_0664 [Chlamydia psittaci NJ1]
gi|405789432|gb|AFS28174.1| hypothetical protein B712_0664 [Chlamydia psittaci NJ1]
Length = 618
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 64/167 (38%), Positives = 85/167 (50%), Gaps = 22/167 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A Y W G RLPTEAEWE GG+ + +P G + + AN
Sbjct: 467 KHPVVGVTWYGASGYAAWVGKRLPTEAEWEVAAYGGVAQQRYPCGEEI---DKSLANF-- 521
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPK 293
F ++ TA VMSY N +GLY+M GNV+EW DW++ + S+ P+
Sbjct: 522 --FSSDTTA---------VMSYPANAYGLYDMAGNVYEWCEDWYSYDFYELSAQESHAPQ 570
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GP G +V +GG C + RCA R +N P + GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 55/190 (28%), Positives = 86/190 (45%), Gaps = 23/190 (12%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P + +MV + G F G+ + + E P + L +F+LD H V+N QF ++ +
Sbjct: 377 PQPLFTEMVFIEGGAFTRGSREG---QRDEHPVHEIFLQSFFLDIHPVTNEQFVRYLECS 433
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
G +E +K+ + + R K S++ +R G HPVV ++W A
Sbjct: 434 G--SEQDKYYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWYGA 478
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
Y W G RLPTEAEWE GG+ + +P G E ID ++ + + V
Sbjct: 479 SGYAAWVGKRLPTEAEWEVAAYGGVAQQRYPCG-----EEIDKSLANFFSSDTTAVMSYP 533
Query: 188 AVAYCTWRGA 197
A AY + A
Sbjct: 534 ANAYGLYDMA 543
>gi|449071261|ref|YP_007438341.1| serine/threonine protein kinase [Chlamydophila psittaci Mat116]
gi|449039769|gb|AGE75193.1| serine/threonine protein kinase [Chlamydophila psittaci Mat116]
Length = 618
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 64/167 (38%), Positives = 85/167 (50%), Gaps = 22/167 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A Y W G RLPTEAEWE GG+ + +P G + + AN
Sbjct: 467 KHPVVGVTWYGASGYAAWVGKRLPTEAEWEVAAYGGVAQQRYPCGEEI---DKSLANF-- 521
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---PAPSYNPK 293
F ++ TA VMSY N +GLY+M GNV+EW DW++ + S+ P+
Sbjct: 522 --FSSDTTA---------VMSYPANAYGLYDMAGNVYEWCEDWYSYDFYELSAQESHAPQ 570
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GP G +V +GG C + RCA R +N P + GFRCA
Sbjct: 571 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCA 614
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 87/190 (45%), Gaps = 23/190 (12%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P + +MV + G TF G+ + + E P + L +F+LD H V+N QF ++ +
Sbjct: 377 PQPLFTEMVFIEGGTFIRGSREG---QRDEHPVHEIFLQSFFLDIHPVTNEQFVRYLECS 433
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
G +E +K+ + + R K S++ +R G HPVV ++W A
Sbjct: 434 G--SEQDKYYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWYGA 478
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWND 187
Y W G RLPTEAEWE GG+ + +P G E ID ++ + + V
Sbjct: 479 SGYAAWVGKRLPTEAEWEVAAYGGVAQQRYPCG-----EEIDKSLANFFSSDTTAVMSYP 533
Query: 188 AVAYCTWRGA 197
A AY + A
Sbjct: 534 ANAYGLYDMA 543
>gi|373456878|ref|ZP_09548645.1| Sulphatase-modifying factor protein [Caldithrix abyssi DSM 13497]
gi|371718542|gb|EHO40313.1| Sulphatase-modifying factor protein [Caldithrix abyssi DSM 13497]
Length = 298
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 69/183 (37%), Positives = 98/183 (53%), Gaps = 10/183 (5%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGG---LENRLFPWGNNLTPRGE-HRA 232
NHPVV+V+W DA AY W G RLPTEAEWEY + G L L + +T G
Sbjct: 111 NHPVVNVNWYDAQAYAKWAGKRLPTEAEWEYVAKAGGNSLNYTLTAERSYITSLGNVADF 170
Query: 233 NVWQGEF--PTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APS 289
++ Q ++ N DG+ T+PV + N FG+Y++ GNV EW ADW++ ++ +P+
Sbjct: 171 SLRQKDYHRIVMNNYEDGFPYTSPVGYFPPNPFGIYDLEGNVLEWCADWYDPQYYAQSPA 230
Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
NP GP G +V +GGS+ + Y R R+ P+ + LGFRCA D
Sbjct: 231 ENPTGPAQGKYRVIRGGSWNRSGDYL---RTTYRTWYPPECTFEFLGFRCAMDAEAALPA 287
Query: 350 DKV 352
K+
Sbjct: 288 KKI 290
>gi|15835040|ref|NP_296799.1| hypothetical protein TC0422 [Chlamydia muridarum Nigg]
gi|270285205|ref|ZP_06194599.1| serine/threonine protein kinase [Chlamydia muridarum Nigg]
gi|270289224|ref|ZP_06195526.1| serine/threonine protein kinase [Chlamydia muridarum Weiss]
gi|301336601|ref|ZP_07224803.1| serine-threonine-protein kinase [Chlamydia muridarum MopnTet14]
gi|55667874|sp|Q9PKP3.1|PKN1_CHLMU RecName: Full=Serine/threonine-protein kinase pkn1
gi|7190465|gb|AAF39278.1| conserved hypothetical protein [Chlamydia muridarum Nigg]
Length = 614
Score = 108 bits (271), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 86/169 (50%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A +Y +W G RLP+EAEWE GG +P G ++ + +AN +
Sbjct: 463 KHPVVGVTWYGASSYASWIGKRLPSEAEWEVAAAGGKLGMRYPTGEDV---DKSKANFFS 519
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPK 293
+ T PVMSY N GLY+M GN++EW DW++ + + +P+
Sbjct: 520 SD-------------TTPVMSYPANILGLYDMAGNIYEWCQDWYSYDFYENSALEPDSPQ 566
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RCA R +N P + GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKEDLRCAHRHRNNPGAINSTYGFRCAKD 612
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 69/151 (45%), Gaps = 26/151 (17%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDG---EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
+MV + G F G+ +DG E P N+TL F LD H V+N F F+ G
Sbjct: 379 EMVFIEGGDFSRGS------RDGQRDELPVHNITLPGFLLDIHPVTNEHFVRFLEFIG-- 430
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVA 129
E ++ + + R K S+++ R IE HPVV ++W A +
Sbjct: 431 GEQDEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASS 476
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWG 160
Y +W G RLP+EAEWE GG +P G
Sbjct: 477 YASWIGKRLPSEAEWEVAAAGGKLGMRYPTG 507
>gi|302527058|ref|ZP_07279400.1| CalU17 [Streptomyces sp. AA4]
gi|302435953|gb|EFL07769.1| CalU17 [Streptomyces sp. AA4]
Length = 287
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 102/344 (29%), Positives = 151/344 (43%), Gaps = 79/344 (22%)
Query: 14 DMVLLPGDTFRMGT-----------NKPILIK--DGEFPSRNVTLDAFYLDQHEVSNTQF 60
+MV +P +F G+ ++P+ + E P + L + +D+ V+ +F
Sbjct: 3 EMVSIPAGSFLQGSPPWVLRWLDESDQPLPPEWFGDETPQVSRRLPPYRIDRRPVTVAEF 62
Query: 61 QEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVV 120
EFV ATGY T+AE G +++ +E E D + G S R +HPVV
Sbjct: 63 AEFVRATGYRTDAESRGYGMLYDRGWTEREGV-------DWRAPAGPGSGTAGRENHPVV 115
Query: 121 HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPV 180
++SW+DA AY W G RLPTE EWEY RG R++PWG+ +P G+ +T E+
Sbjct: 116 NVSWDDASAYAAWTGKRLPTEPEWEYAARGA-GFRIWPWGNRWNP-GLANTAEY------ 167
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
G ++ T AEW W + GE RA++
Sbjct: 168 --------------YGEKMTTLAEWRK------------WWATV---GESRADL------ 192
Query: 241 TNNTAADGYLSTAPVMSYK---ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTT 297
T PV ++ ++ FG +M GNV+EWTA + A P
Sbjct: 193 ---------PLTTPVGAFSPAGDSVFGCADMAGNVYEWTATVSEPYADTARFDPAIRPAM 243
Query: 298 GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDS-SAGNLGFRCA 340
G +V +GGS++ + Y+ RC R P S LGFRCA
Sbjct: 244 GNYRVIRGGSWM---NFRYQVRCCERLHGDPLGWSNFALGFRCA 284
>gi|302038590|ref|YP_003798912.1| hypothetical protein NIDE3299 [Candidatus Nitrospira defluvii]
gi|300606654|emb|CBK42987.1| conserved exported protein of unknown function DUF323 [Candidatus
Nitrospira defluvii]
Length = 317
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 71/179 (39%), Positives = 93/179 (51%), Gaps = 21/179 (11%)
Query: 176 MNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVW 235
+N P V+VSW DA AYC WRG RLP+EAEWE R G + RL+PWGN P + A V
Sbjct: 133 VNQPAVYVSWEDAKAYCEWRGRRLPSEAEWEKAMR-GPDGRLWPWGNVEVPNAANWARV- 190
Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNVHHHPAPS-YNP 292
DG+ APV K + +G+ + GNV EW DW+ + A S NP
Sbjct: 191 ----------DDGFDVAAPVGRVKSDASPYGVMDGAGNVMEWVNDWYLEGAYAAASERNP 240
Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA---ADKGPTTG 348
+ P GT KV +G +Y + R ARS+ D +GFRCA A+ G ++G
Sbjct: 241 ESPEYGTYKVLRGAAYTSSGSDL---RITARSKMMQDFRDETIGFRCAQSSAENGRSSG 296
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 71/144 (49%), Gaps = 31/144 (21%)
Query: 18 LPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFG 77
+P F GTN+ E P R + LD F +D++EV+N Q+ FV ATG+
Sbjct: 66 IPAGPFVFGTNRGGF---DEQPERTIYLDQFLIDRYEVTNAQYAAFVKATGH-------- 114
Query: 78 DTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGAR 137
+ S+ + R G+ + P V++SW DA AYC WRG R
Sbjct: 115 -----------RKSGPPSRYAKNTLRMRGV--------NQPAVYVSWEDAKAYCEWRGRR 155
Query: 138 LPTEAEWEYGCRGGLENRLFPWGS 161
LP+EAEWE R G + RL+PWG+
Sbjct: 156 LPSEAEWEKAMR-GPDGRLWPWGN 178
>gi|319792296|ref|YP_004153936.1| hypothetical protein Varpa_1615 [Variovorax paradoxus EPS]
gi|315594759|gb|ADU35825.1| protein of unknown function DUF323 [Variovorax paradoxus EPS]
Length = 323
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 69/182 (37%), Positives = 92/182 (50%), Gaps = 22/182 (12%)
Query: 178 HPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENR---------LFPWGN 222
HPVV+V+WNDA A W + RLPTEAEWEY CR G R L N
Sbjct: 148 HPVVNVTWNDAQALAAWLSKSEGKRYRLPTEAEWEYACRAGTRTRYSAGDDPQSLLRTAN 207
Query: 223 NLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV 282
R A W P DG+ TAPV S+ N +GLY+M GN WEW ADW
Sbjct: 208 VFDAR---TARYWPRWAPMALKGDDGFAFTAPVGSFAPNAWGLYDMHGNAWEWVADWHGE 264
Query: 283 HHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
++ +P +P+GP GT +V++GGS+ + + R A R+ N PD+ +G R
Sbjct: 265 EYYARSPVDDPQGPAEGTVRVRRGGSW---HTWPFYARAAYRNWNAPDTRYTLVGTRLVR 321
Query: 342 DK 343
++
Sbjct: 322 EQ 323
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 58/175 (33%), Positives = 78/175 (44%), Gaps = 45/175 (25%)
Query: 15 MVLLPGDTFRMGTNK---------PILIKD------GEFPSRNVTLD-AFYLDQHEVSNT 58
V LP F MG+++ P L K+ E P V + +F++ QHEV+
Sbjct: 29 FVKLPAGEFMMGSDETPEALAAAYPELPKERFTKLGDEGPVHRVRITRSFWMGQHEVTVG 88
Query: 59 QFQEFVSATGYVTEAEK-----FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEH 113
QF+ FV A+GYV E+ +G ++P + A FEG D
Sbjct: 89 QFRRFVEASGYVPESVADGTGGYGWRADYDPATTRRGDA-----------FEGRDVRYSW 137
Query: 114 RM-------HHPVVHISWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENR 155
R HPVV+++WNDA A W + RLPTEAEWEY CR G R
Sbjct: 138 RNPGFAQSDEHPVVNVTWNDAQALAAWLSKSEGKRYRLPTEAEWEYACRAGTRTR 192
>gi|284114704|ref|ZP_06386680.1| protein of unknown function DUF323 [Candidatus Poribacteria sp.
WGA-A3]
gi|283829573|gb|EFC33916.1| protein of unknown function DUF323 [Candidatus Poribacteria sp.
WGA-A3]
Length = 433
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 66/176 (37%), Positives = 88/176 (50%), Gaps = 23/176 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
NHPVV+VSW A+AY W G RLPTEAEWE RGGL ++ +PWGN++ P + +
Sbjct: 270 NHPVVYVSWYAAMAYADWAGKRLPTEAEWEKAARGGLVSQAYPWGNSIDPSKANYSGKIG 329
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPS-YNPKGP 295
G T PV +Y EN +GLY+MVGN WEW D + + S NP
Sbjct: 330 G--------------TTPVGNYPENAYGLYDMVGNAWEWCLDIYKTDFYSGSSRRNPIAG 375
Query: 296 TTGTDKVKK----GGSYLCNEQYCYRH----RCAARSQNTPDSSAGNLGFRCAADK 343
+ T+ +K S + Y Y R A R + +PDS+ FRC +
Sbjct: 376 ESVTELTEKYLTVKDSRVVRSGYWYNQSVYVRVADRYKLSPDSTHRGGSFRCVMSR 431
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 57/158 (36%), Positives = 85/158 (53%), Gaps = 24/158 (15%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
++ +M + F+MG+N I E P V LD FY+D +EV+N Q+++FV A
Sbjct: 183 QFTEMATISAGDFQMGSND-IEADHDEQPIHTVYLDTFYMDVYEVTNAQYKQFVEAN--- 238
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHD---MKRFEGLDSTIEHRMHHPVVHISWNDA 127
E ++ +I HD +K +EG D + + +HPVV++SW A
Sbjct: 239 ----------------PEWQKDRIPSAYHDGNYLKYWEGNDYP-QGQENHPVVYVSWYAA 281
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
+AY W G RLPTEAEWE RGGL ++ +PWG+ + P
Sbjct: 282 MAYADWAGKRLPTEAEWEKAARGGLVSQAYPWGNSIDP 319
>gi|302039208|ref|YP_003799530.1| hypothetical protein NIDE3935 [Candidatus Nitrospira defluvii]
gi|300607272|emb|CBK43605.1| conserved exported protein of unknown function, Sulfatase-modifying
factor-like [Candidatus Nitrospira defluvii]
Length = 310
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 97/334 (29%), Positives = 135/334 (40%), Gaps = 112/334 (33%)
Query: 15 MVLLPGDTFRMGTNKPILIKDG--EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG---- 68
MV +PG F MG++ G E P V +DAF +D++EVSN + FV A G
Sbjct: 59 MVPVPGGPFLMGSDPKFDRAAGPQELPQHQVYVDAFSIDRYEVSNVNYLRFVLAAGAAWP 118
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
+ A+ F D HPV+ +SW +A
Sbjct: 119 HYWRAQPFPDKMA----------------------------------KHPVIGVSWREAD 144
Query: 129 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEG-IDSTIEHRMNHPVVHVSWND 187
AYC WRGARLPTEAEWE R G + R+FPWG+ P G I S I H +
Sbjct: 145 AYCRWRGARLPTEAEWEKAAR-GEDGRMFPWGN--EPAGWIKSNIAHPGSK--------- 192
Query: 188 AVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAAD 247
RGA+ P A + + N ++P G H+
Sbjct: 193 -------RGAKYPPLANVDR------------YDNGVSPYGVHQ---------------- 217
Query: 248 GYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH-PAPSYNPKGPTTGTDKVKKGG 306
+ GNV EW +DW++ ++ + NP+GP +G DKV +GG
Sbjct: 218 --------------------LAGNVSEWVSDWFDPEYYRRQENRNPQGPASGQDKVFRGG 257
Query: 307 SYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
S+ + + R A R+ D + GFRCA
Sbjct: 258 SWNEDPEVA---RSAGRNSGGLDHWSYLTGFRCA 288
>gi|389579892|ref|ZP_10169919.1| hypothetical protein DespoDRAFT_01834 [Desulfobacter postgatei
2ac9]
gi|389401527|gb|EIM63749.1| hypothetical protein DespoDRAFT_01834 [Desulfobacter postgatei
2ac9]
Length = 581
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 89/335 (26%), Positives = 138/335 (41%), Gaps = 84/335 (25%)
Query: 10 ERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
+++ + +P T+ +GT K + + + Y+ ++ V+N F+ F+ TGY
Sbjct: 326 KKFNAYITIPAGTYTIGTEKTL---KSSLALQPFDMPRVYIGKYPVTNALFEIFIEQTGY 382
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
VT AE+ G I V H + +G + W+
Sbjct: 383 VTTAERKG----------------IGTVYHSRYKKQG-------------EKVIWSKQAG 413
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAV 189
+GA C W P G ST+ + NHPVV +S +DA
Sbjct: 414 SSIVKGA-----------C-------------WHQPLGPGSTLYGKRNHPVVQISVDDAF 449
Query: 190 AYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGY 249
AY +W G RLPTEAEWE R + + +PWGN P N + G
Sbjct: 450 AYASWIGRRLPTEAEWEAAARTDMALK-YPWGNQFDPDAL-------------NIESSGL 495
Query: 250 LSTAPVMSYK--ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGS 307
T+PV +Y N FG+ +M+GNV EWTAD + P P + K V KGG+
Sbjct: 496 ADTSPVDNYDHAANVFGMADMLGNVMEWTAD-----NQPPPFASSKAKFFN---VAKGGA 547
Query: 308 YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
+ + ++R +++A +GFRC ++
Sbjct: 548 WNAKDTVT----ISSRGLFPFETTANIIGFRCISE 578
>gi|451980739|ref|ZP_21929125.1| conserved exported hypothetical protein [Nitrospina gracilis 3/211]
gi|451762075|emb|CCQ90364.1| conserved exported hypothetical protein [Nitrospina gracilis 3/211]
Length = 249
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 69/168 (41%), Positives = 91/168 (54%), Gaps = 20/168 (11%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
N P V+W +A AYC G RLPTEAEWE R G + + WG+ EH
Sbjct: 95 NLPADQVTWLEASAYCKQVGKRLPTEAEWEKAARAGTQTAYY-WGD------EH-----S 142
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKGP 295
GE+ +D +T PV K N FGLY+M GNVWEW +DW++ ++ +P NP+GP
Sbjct: 143 GEYAWFEDNSD--ETTHPVGQKKPNGFGLYDMSGNVWEWVSDWYDKTYYERSPPANPQGP 200
Query: 296 TTGTDKVKKGGSYLCNEQYCYRHRCAA-RSQNTPDSSAGNLGFRCAAD 342
TGT+K +GGS+ RH+ +A R + P N GFRCA D
Sbjct: 201 ETGTEKTLRGGSWYSGA----RHQMSATRFWSEPHLRNSNFGFRCAMD 244
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 41/153 (26%), Positives = 63/153 (41%), Gaps = 47/153 (30%)
Query: 8 PVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSAT 67
P++ MV +P T+ MG ++P + V +D F+LD+ EV+ +E+V+
Sbjct: 33 PLQPPAGMVYIPPGTYTMGLEG----ASNKYPHK-VFVDGFFLDKFEVTQ---KEYVAIR 84
Query: 68 GYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDA 127
G + +FEG + P ++W +A
Sbjct: 85 G------------------------------ANPSKFEGENL--------PADQVTWLEA 106
Query: 128 VAYCTWRGARLPTEAEWEYGCRGGLENRLFPWG 160
AYC G RLPTEAEWE R G + + WG
Sbjct: 107 SAYCKQVGKRLPTEAEWEKAARAGTQTAYY-WG 138
>gi|308273234|emb|CBX29837.1| hypothetical protein N47_F15320 [uncultured Desulfobacterium sp.]
Length = 665
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 96/332 (28%), Positives = 135/332 (40%), Gaps = 87/332 (26%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+ +L+PG + +G+ +P K E RN+ L AFY+ ++ V+N F+ F TGY T
Sbjct: 418 FNQYILIPGGNYIIGSRQP---KRNECLERNINLPAFYMGKYPVTNALFEVFAEKTGYKT 474
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
AEK G V+ R+E T++ + + + +N Y
Sbjct: 475 TAEKRGYGKVY------------------FGRYE---KTLDGQT--GMGKLIFNATTTYK 511
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAY 191
T GA C W P G STI + NHPVV VS+ DA+A+
Sbjct: 512 TIEGA-----------C-------------WYQPCGPKSTIHRKRNHPVVQVSFEDAMAF 547
Query: 192 CTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLS 251
W G RLPTE EWE R + FPWG + NV +
Sbjct: 548 AAWTGKRLPTENEWEAASRTD-KGYSFPWGESFRKEA---CNVEESSI----------CG 593
Query: 252 TAPVMSYK--ENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYL 309
T+PV Y+ EN FG+ + +GNV+EWT V KGG ++
Sbjct: 594 TSPVDLYREFENGFGIADTLGNVFEWTTSRLEKFM-----------------VVKGGCWI 636
Query: 310 CNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
+ R Q PD+ + LGFRC A
Sbjct: 637 SGNDI----KLFTRFQIEPDNHSNILGFRCIA 664
>gi|311108477|ref|YP_003981330.1| hypothetical protein AXYL_05315 [Achromobacter xylosoxidans A8]
gi|310763166|gb|ADP18615.1| hypothetical protein AXYL_05315 [Achromobacter xylosoxidans A8]
Length = 330
Score = 108 bits (270), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 67/179 (37%), Positives = 92/179 (51%), Gaps = 22/179 (12%)
Query: 177 NHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENR---------LFPWG 221
+HPVV++SWNDA A W + RLPTEAEWEY CR G R + G
Sbjct: 153 DHPVVNISWNDATALARWLTQTEGKVYRLPTEAEWEYACRAGTRTRYQNGDDPAAMPDIG 212
Query: 222 NNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN 281
N WQ + ADG+ TAPV SY N FGLY+M GN WEW +DW+
Sbjct: 213 NGFDADAAPLWPKWQAH---ASARADGHAFTAPVASYAPNAFGLYDMHGNAWEWVSDWYG 269
Query: 282 VHHHP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
++ +P +P+GP +G +V++GGS+ Y R + R+ NT ++ +G R
Sbjct: 270 EDYYAHSPVDDPQGPASGNVRVRRGGSWHTWALYT---RASYRNWNTQETRYPLVGLRL 325
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 79/164 (48%), Gaps = 23/164 (14%)
Query: 15 MVLLPGDTFRMGTNK---------PILIKD------GEFPSRNVTLD-AFYLDQHEVSNT 58
V +P TF+MG+++ P+ D E P V + AFY+++ EV+
Sbjct: 35 FVRVPAGTFQMGSDEAPEALAQAYPLYPADRYAQLSDEAPVHAVRITRAFYMERTEVTVG 94
Query: 59 QFQEFVSATGYVTEAEKFGDT-FVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
QF+ F+ A+GYV E+E G + + P ++ + +R+ + H
Sbjct: 95 QFRRFIEASGYVPESEADGTGGYGYNPQYDADKSERGDAFEGRDRRYSWRNPGFAQGDDH 154
Query: 118 PVVHISWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENR 155
PVV+ISWNDA A W + RLPTEAEWEY CR G R
Sbjct: 155 PVVNISWNDATALARWLTQTEGKVYRLPTEAEWEYACRAGTRTR 198
>gi|330444630|ref|YP_004377616.1| s/t protein kinase [Chlamydophila pecorum E58]
gi|328807740|gb|AEB41913.1| s/t protein kinase [Chlamydophila pecorum E58]
Length = 617
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 63/169 (37%), Positives = 82/169 (48%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A Y W G RLP EAEWE RGG +P G + + Q
Sbjct: 466 KHPVVGVTWYGAAGYAAWVGKRLPKEAEWEIAARGGGVRHRYPCGEEI--------DKTQ 517
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPK 293
F +++T A VMSY N +GLY+M GNV+EW DW+ + + PK
Sbjct: 518 ANFFSSDTTA--------VMSYPPNGYGLYDMAGNVYEWCQDWYAYDFYEIAAQEIETPK 569
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RCA R +N P + GFRCA +
Sbjct: 570 GPVQGVYRVLRGG---CWKSLKNDLRCAHRHRNNPGAVNSTYGFRCAQE 615
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 72/166 (43%), Gaps = 24/166 (14%)
Query: 6 APPVERYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
A P +MV + G F G+ + + E P V L +F+LD H V+N QF ++
Sbjct: 375 ARPQPLLTEMVFIEGGVFSRGSLEG---QRDEHPVHQVFLQSFFLDIHPVTNEQFIRYLD 431
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWN 125
G E +K + + R K S++ +R G HPVV ++W
Sbjct: 432 CCG---EQDKHYNELI---------RLKDSRI----QRRSGKLVIEPGYAKHPVVGVTWY 475
Query: 126 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDST 171
A Y W G RLP EAEWE RGG +P G E ID T
Sbjct: 476 GAAGYAAWVGKRLPKEAEWEIAARGGGVRHRYPCG-----EEIDKT 516
>gi|410028720|ref|ZP_11278556.1| hypothetical protein MaAK2_05930 [Marinilabilia sp. AK2]
Length = 166
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 51/102 (50%), Positives = 70/102 (68%)
Query: 181 VHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFP 240
+HV++ DA AY W G RLPTEAEWE+ +GG ++ + WG P G + AN +QG+FP
Sbjct: 1 MHVAYEDAEAYAAWAGKRLPTEAEWEFAAKGGRAHQRYAWGEEFRPNGVYMANTFQGKFP 60
Query: 241 TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNV 282
N + DG+ TAPV S+ N FGLY+M+GNVWE T+DW++
Sbjct: 61 HQNESTDGFSGTAPVKSFSPNDFGLYDMIGNVWELTSDWYDA 102
Score = 45.1 bits (105), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 22/49 (44%), Positives = 32/49 (65%)
Query: 120 VHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGI 168
+H+++ DA AY W G RLPTEAEWE+ +GG ++ + WG P G+
Sbjct: 1 MHVAYEDAEAYAAWAGKRLPTEAEWEFAAKGGRAHQRYAWGEEFRPNGV 49
>gi|451982162|ref|ZP_21930490.1| conserved hypothetical protein [Nitrospina gracilis 3/211]
gi|451760631|emb|CCQ91770.1| conserved hypothetical protein [Nitrospina gracilis 3/211]
Length = 296
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 68/174 (39%), Positives = 91/174 (52%), Gaps = 22/174 (12%)
Query: 173 EHRM-NHP---VVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRG 228
EH + HP V VSW DAV YCTW G RL TEAEWE R G ++PWGN + PR
Sbjct: 132 EHELLKHPNVAQVGVSWYDAVNYCTWAGKRLLTEAEWEKAAR-GTHGLVYPWGNEMLPR- 189
Query: 229 EHRANVWQGEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWN-VHHH 285
RAN+ + AD Y AP+ S+ + +G+Y+M GNV EW DW++ ++
Sbjct: 190 --RANI--------HGTADDYQYLAPIGSFPMGRSVYGVYDMAGNVSEWVEDWYDQFYYQ 239
Query: 286 PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
AP NP G ++V +GGS+ + R + R TP +GFRC
Sbjct: 240 EAPMMNPTGAENKKNRVFRGGSWDSTKVDV---RASKRYAATPGRKDSVVGFRC 290
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/156 (33%), Positives = 74/156 (47%), Gaps = 38/156 (24%)
Query: 13 KDMVLLPGDTFRMGTNKPILIKDGEF---PSRNVTLDAFYLDQHEVSNTQFQEFVSATGY 69
++MVL+P F G+ +G+F P + V LDAFY+D++EVS + +F A Y
Sbjct: 68 REMVLVPEGVFSRGS------LEGDFDEKPPQEVYLDAFYVDKYEVSVEAYNKFRKAANY 121
Query: 70 VTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVA 129
V + F FEG ++H + V +SW DAV
Sbjct: 122 VEPSVPF---------------------------FEGEHELLKHP-NVAQVGVSWYDAVN 153
Query: 130 YCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
YCTW G RL TEAEWE R G ++PWG+ + P
Sbjct: 154 YCTWAGKRLLTEAEWEKAAR-GTHGLVYPWGNEMLP 188
>gi|15618072|ref|NP_224356.1| S/T protein kinase [Chlamydophila pneumoniae CWL029]
gi|15835683|ref|NP_300207.1| S/T protein kinase [Chlamydophila pneumoniae J138]
gi|55667873|sp|Q7AJA5.1|PKN1_CHLPN RecName: Full=Serine/threonine-protein kinase pkn1
gi|4376414|gb|AAD18301.1| S/T Protein Kinase [Chlamydophila pneumoniae CWL029]
gi|8978521|dbj|BAA98358.1| S/T protein kinase [Chlamydophila pneumoniae J138]
Length = 619
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A Y W G RLPTEAEWE GG+ +P G + + RAN +
Sbjct: 468 KHPVVGVTWYGASGYAEWIGKRLPTEAEWEIAASGGVAALRYPCGEEIE---KSRANFFT 524
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPK 293
+ T VMSY N +GLY+M GNV+EW DW+ + + +P+
Sbjct: 525 AD-------------TTTVMSYPPNPYGLYDMAGNVYEWCQDWYGYDFYEISAQEPESPQ 571
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RCA R +N P + GFRCA +
Sbjct: 572 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCAKN 617
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 71/148 (47%), Gaps = 20/148 (13%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MVL+ G F G+ + + E P V L +F+LD H V+N QF ++ G +E
Sbjct: 384 EMVLIEGGEFSRGSVEG---QRDELPVHKVILHSFFLDVHPVTNEQFIRYLECCG--SEQ 438
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
+K+ + + R + S+++ R IE HPVV ++W A Y
Sbjct: 439 DKYYNELI---------RLRDSRIQRRSGRL-----VIEPGYAKHPVVGVTWYGASGYAE 484
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
W G RLPTEAEWE GG+ +P G
Sbjct: 485 WIGKRLPTEAEWEIAASGGVAALRYPCG 512
>gi|16752896|ref|NP_445167.1| hypothetical protein CP0625 [Chlamydophila pneumoniae AR39]
gi|7189538|gb|AAF38440.1| conserved hypothetical protein [Chlamydophila pneumoniae AR39]
Length = 619
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A Y W G RLPTEAEWE GG+ +P G + + RAN +
Sbjct: 468 KHPVVGVTWYGASGYAEWIGKRLPTEAEWEIAASGGVAALRYPCGEEIE---KSRANFFT 524
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPK 293
+ T VMSY N +GLY+M GNV+EW DW+ + + +P+
Sbjct: 525 AD-------------TTTVMSYPPNPYGLYDMAGNVYEWCQDWYGYDFYEISAQEPESPQ 571
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RCA R +N P + GFRCA +
Sbjct: 572 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCAKN 617
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 71/148 (47%), Gaps = 20/148 (13%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MVL+ G F G+ + + E P V L +F+LD H V+N QF ++ G +E
Sbjct: 384 EMVLIEGGEFSRGSVEG---QRDELPVHKVILHSFFLDVHPVTNEQFIRYLECCG--SEQ 438
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
+K+ + + R + S+++ R IE HPVV ++W A Y
Sbjct: 439 DKYYNELI---------RLRDSRIQRRSGRL-----VIEPGYAKHPVVGVTWYGASGYAE 484
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
W G RLPTEAEWE GG+ +P G
Sbjct: 485 WIGKRLPTEAEWEIAASGGVAALRYPCG 512
>gi|295135973|ref|YP_003586649.1| gliding motility protein GldK [Zunongwangia profunda SM-A87]
gi|294983988|gb|ADF54453.1| gliding motility protein GldK [Zunongwangia profunda SM-A87]
Length = 456
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 110/412 (26%), Positives = 165/412 (40%), Gaps = 95/412 (23%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG------ 68
MVL+PG +F MG + + P + VT+++FY+D+ E++N ++++FV+
Sbjct: 41 MVLVPGGSFIMGKSDDDIAGQLNAPPKTVTVNSFYMDETEITNAEYRQFVNYVKDSVVRV 100
Query: 69 -YVTEAEKFGDT----FVFEPLLSEEERAKISQVRHDMKRFEGLDSTI---EHRMHHPVV 120
EAE G T + E ++ + + ++ + M G S + E+R + V
Sbjct: 101 MLAMEAELEGATPGASGIGEYAFADADESNLTPYQEYMLETYGTGSEMDAYENRKLNKDV 160
Query: 121 HISWNDA----VAYC-TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHR 175
I W+ VAY +P E + ++N F + SW+ E R
Sbjct: 161 DIIWDTEDYPDVAYARVMDQLYIPQEDAYNGTRTIDVKNLKFKY-SWMDVEAAVKGKGER 219
Query: 176 MNH----------------------------------------PVVHVSWNDAVAYCTWR 195
N+ PVV V+W A A+ WR
Sbjct: 220 RNYMKTEVVEVYPDTAVWIKDFNYSYNEPMHNDYFWHSAFDDYPVVGVTWKQARAFAQWR 279
Query: 196 GA-----------------RLPTEAEWEYGCRGGLENRLFPWGNNLTP--RGEHRANV-- 234
RLPTEAEWEY RGGLE +PWG T RG AN
Sbjct: 280 TKYNNDFRKSRGDSNVPHYRLPTEAEWEYAARGGLEGGTYPWGGPYTKNDRGCFMANFKP 339
Query: 235 WQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKG 294
+G++ AAD L T S+ N +GLYNM GNV EW ++ + S
Sbjct: 340 LRGDY-----AADQALYTVEAKSFDPNGYGLYNMAGNVAEWVNSSYDAASYDYMS--SMN 392
Query: 295 PTTGT----DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
PT KV +GGS+ + Y + ++R DS+ +GFR D
Sbjct: 393 PTVNNYEDPRKVVRGGSW---KDVAYFLQVSSRDYEYADSARSYIGFRTVQD 441
>gi|332661980|ref|YP_004451449.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
1100]
gi|332337477|gb|AEE54576.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis
DSM 1100]
Length = 655
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 69/191 (36%), Positives = 93/191 (48%), Gaps = 42/191 (21%)
Query: 175 RMNHPVVHVSWNDAVAYCTWR------------------------GARLPTEAEWEYGCR 210
R P ++V+W DAV YC WR G RLPTEAEWEY R
Sbjct: 476 RGKRPAIYVNWFDAVDYCNWRSQQEGLSQVYQVNKQQVNPNWQANGYRLPTEAEWEYAAR 535
Query: 211 GGLENRLFPW-GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMV 269
GGL ++ F + G+N NV + + N+ T PV K N+ G+Y++
Sbjct: 536 GGLSSQGFTYAGSN---------NVDEVAWYDKNSGN----KTQPVGQKKANELGIYDLS 582
Query: 270 GNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPD 329
GNVWEW DW+ + A + +P GP TG+D+V +G S + R A RS NTPD
Sbjct: 583 GNVWEWCWDWYGAYPSSATN-DPNGPNTGSDRVIRGSSRYSDPAVV---RVAIRSYNTPD 638
Query: 330 SSAGNLGFRCA 340
++GFR A
Sbjct: 639 GRGDDIGFRLA 649
>gi|384449585|ref|YP_005662187.1| hypothetical protein CPK_ORF00662 [Chlamydophila pneumoniae LPCoLN]
gi|269303031|gb|ACZ33131.1| conserved hypothetical protein [Chlamydophila pneumoniae LPCoLN]
Length = 619
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A Y W G RLPTEAEWE GG+ +P G + + RAN +
Sbjct: 468 KHPVVGVTWYGASGYAEWIGKRLPTEAEWEIAASGGVAALRYPCGEEIE---KSRANFFT 524
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPK 293
+ T VMSY N +GLY+M GNV+EW DW+ + + +P+
Sbjct: 525 AD-------------TTTVMSYPPNPYGLYDMAGNVYEWCQDWYGYDFYEISAQEPESPQ 571
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RCA R +N P + GFRCA +
Sbjct: 572 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCAKN 617
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 71/148 (47%), Gaps = 20/148 (13%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MVL+ G F G+ + + E P V L +F+LD H V+N QF ++ G +E
Sbjct: 384 EMVLIEGGEFSRGSVEG---QRDELPVHKVILHSFFLDVHPVTNEQFIRYLECCG--SEQ 438
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
+K+ + + R + S+++ R IE HPVV ++W A Y
Sbjct: 439 DKYYNELI---------RLRDSRIQRRSGRL-----VIEPGYAKHPVVGVTWYGASGYAE 484
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
W G RLPTEAEWE GG+ +P G
Sbjct: 485 WIGKRLPTEAEWEIAASGGVAALRYPCG 512
>gi|91204532|emb|CAJ70760.1| similar to sulfatase modifying factor 1 (
C-alpha-formylglycine-generating enzyme) [Candidatus
Kuenenia stuttgartiensis]
Length = 312
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 81/290 (27%), Positives = 131/290 (45%), Gaps = 58/290 (20%)
Query: 82 FEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTE 141
E LL +++ + ++ +++ + E S + +PV + ++ +A ++ +E
Sbjct: 46 IEKLLEQDKSIETYRLNNEIPQREIFLSAY-YIDKYPVTNAQFSQFIAANGYKKKLYWSE 104
Query: 142 AEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPT 201
A W+Y P GS ID+ ++ + P V++SW A A+ W G RLPT
Sbjct: 105 AGWQYVLD------FNPSGS----NDIDAILQGDKDCPAVNISWYAAEAFANWAGKRLPT 154
Query: 202 EAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKEN 261
EAEWE RG + R++PWGN+ + R N + + + PV Y +
Sbjct: 155 EAEWEKAARGA-DGRIYPWGNDFD---KTRLNCAEAKIE----------KSTPVTQYPQG 200
Query: 262 K--FGLYNMVGNVWEWTADWWNV-HHHPAPSYNPKGP----------------------- 295
+ +G ++M GNVWEW DW++ ++H AP NP+GP
Sbjct: 201 QSVYGCFDMAGNVWEWIKDWYDSQYYHNAPDKNPQGPDVPEEKPYSGRPEEVGVSIYELK 260
Query: 296 -----TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
T T KV +GGS+ N RCA R + P + +GFRCA
Sbjct: 261 PSLSKTLTTCKVIRGGSW--NGSGIVHIRCANRDYDEPSFKSDIIGFRCA 308
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 55/167 (32%), Positives = 85/167 (50%), Gaps = 35/167 (20%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIK-------------DGEFPSRNVTLDAFYLDQHEVSNT 58
++DM+L+P TF MG+ K + K + E P R + L A+Y+D++ V+N
Sbjct: 26 FEDMILIPEGTFLMGSTKEDIEKLLEQDKSIETYRLNNEIPQREIFLSAYYIDKYPVTNA 85
Query: 59 QFQEFVSATGYVTEA--EKFGDTFV--FEPLLSEEERAKISQVRHDMKRFEGLDSTIEHR 114
QF +F++A GY + + G +V F P S + +D+ ++
Sbjct: 86 QFSQFIAANGYKKKLYWSEAGWQYVLDFNPSGSND-----------------IDAILQGD 128
Query: 115 MHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
P V+ISW A A+ W G RLPTEAEWE RG + R++PWG+
Sbjct: 129 KDCPAVNISWYAAEAFANWAGKRLPTEAEWEKAARGA-DGRIYPWGN 174
>gi|33241484|ref|NP_876425.1| protein kinase [Chlamydophila pneumoniae TW-183]
gi|33235992|gb|AAP98082.1| protein kinase [Chlamydophila pneumoniae TW-183]
Length = 619
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A Y W G RLPTEAEWE GG+ +P G + + RAN +
Sbjct: 468 KHPVVGVTWYGASGYAEWIGKRLPTEAEWEIAASGGVAALRYPCGEEVE---KSRANFFT 524
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY---NPK 293
+ T VMSY N +GLY+M GNV+EW DW+ + + +P+
Sbjct: 525 AD-------------TTTVMSYPPNPYGLYDMAGNVYEWCQDWYGYDFYEISAQEPESPQ 571
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RCA R +N P + GFRCA +
Sbjct: 572 GPAQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAVNSTYGFRCAKN 617
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 71/148 (47%), Gaps = 20/148 (13%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MVL+ G F G+ + + E P V L +F+LD H V+N QF ++ G +E
Sbjct: 384 EMVLIEGGEFSRGSVEG---QRDELPVHKVILHSFFLDVHPVTNEQFIRYLECCG--SEQ 438
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
+K+ + + R + S+++ R IE HPVV ++W A Y
Sbjct: 439 DKYYNELI---------RLRDSRIQRRSGRL-----VIEPGYAKHPVVGVTWYGASGYAE 484
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
W G RLPTEAEWE GG+ +P G
Sbjct: 485 WIGKRLPTEAEWEIAASGGVAALRYPCG 512
>gi|194336145|ref|YP_002017939.1| hypothetical protein Ppha_1040 [Pelodictyon phaeoclathratiforme
BU-1]
gi|194308622|gb|ACF43322.1| protein of unknown function DUF323 [Pelodictyon phaeoclathratiforme
BU-1]
Length = 517
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 94/337 (27%), Positives = 140/337 (41%), Gaps = 103/337 (30%)
Query: 14 DMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
+ V + G TF MG+ D E P V L AFY+ ++EV+ ++F++F+ AT Y+T+
Sbjct: 275 NFVPIRGGTFFMGSPGSEDEAYDDEGPQHLVRLSAFYMSRYEVTVSEFRKFIEATEYLTD 334
Query: 73 AEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCT 132
AE+ + + D K + +HPV+++SWNDA++YC
Sbjct: 335 AEQV-------------------RSKRDWKCGVLGSRRPANEDNHPVIYVSWNDAISYCK 375
Query: 133 W------RGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWN 186
W + RLP E EWEY CR G R + P + +N
Sbjct: 376 WLSNKTGKTFRLPKEEEWEYACRAG----------------------SRTSTP-FNTGYN 412
Query: 187 DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAA 246
+ + G FP+ +N +GE R N
Sbjct: 413 LTTSQANYNGH--------------------FPYDHN--KKGEFRKN------------- 437
Query: 247 DGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGT-DKVKKG 305
T PV S+ N +GLYNM GNVWEW +W Y P G + T +V +G
Sbjct: 438 -----TVPVNSFAPNNWGLYNMHGNVWEWCENW----------YGPYGSSDDTFQRVIRG 482
Query: 306 GSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
G C +Q+ R R A R P++S ++GFR +
Sbjct: 483 G---CWDQHAGRCRSANRGYYPPENSLNHVGFRVVCE 516
>gi|406987727|gb|EKE07977.1| hypothetical protein ACD_17C00415G0002 [uncultured bacterium]
Length = 594
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/170 (38%), Positives = 88/170 (51%), Gaps = 25/170 (14%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W AVAY W G RLPTE EWE C G E ++P G+++ Q
Sbjct: 444 KHPVVGVTWYGAVAYAKWIGKRLPTEVEWEAACSVGKEG-IYPTGSDIERS--------Q 494
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA----PSYNP 292
F +++T A VMSY N GLY+MVGNV+EW DW+ +++ P+Y P
Sbjct: 495 ANFFSSDTTA--------VMSYPPNLLGLYDMVGNVYEWCQDWYAYNYYDTSVVEPNY-P 545
Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + R + R +N P + GFRCA D
Sbjct: 546 MGPPQGVYRVLRGG---CWKSLKEDLRSSHRHRNNPGAVNSTYGFRCATD 592
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/149 (36%), Positives = 72/149 (48%), Gaps = 21/149 (14%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+MV++PG T+ G+ + E P +V L +F LD H V+N QF F+ A G
Sbjct: 360 EMVVVPGGTYLRGSAEG---ARDEMPRHSVKLRSFALDIHPVTNEQFVRFLEAMG----G 412
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEH-RMHHPVVHISWNDAVAYCT 132
EK + I ++R + G TIE HPVV ++W AVAY
Sbjct: 413 EK------------DHNNNDIIRLRDSRIKRSGGKLTIESGYAKHPVVGVTWYGAVAYAK 460
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWGS 161
W G RLPTE EWE C G E ++P GS
Sbjct: 461 WIGKRLPTEVEWEAACSVGKEG-IYPTGS 488
>gi|116624323|ref|YP_826479.1| hypothetical protein Acid_5241 [Candidatus Solibacter usitatus
Ellin6076]
gi|116227485|gb|ABJ86194.1| protein of unknown function DUF323 [Candidatus Solibacter usitatus
Ellin6076]
Length = 226
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/172 (38%), Positives = 89/172 (51%), Gaps = 28/172 (16%)
Query: 178 HPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR 231
PVV V+W DA A+C W+G RLPTEAEWE+ RGG RL+PWGN L
Sbjct: 73 QPVVAVNWFDAAAFCHWQGEQWNLHLRLPTEAEWEFAARGGFAQRLYPWGNELP------ 126
Query: 232 ANVWQGEFPTNNTAADGYLS-TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APS 289
AAD + + PV N FGL+++ NV EW ADW++ ++ +P+
Sbjct: 127 -----------VIAADRWRNGPEPVGLGDPNGFGLFDLCQNVHEWCADWYDPGYYAVSPT 175
Query: 290 YNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAA 341
NP GP G + +GG++ + RCAARS P+ + GFR AA
Sbjct: 176 ENPHGPEHGKRRASRGGAW---RHHIKVSRCAARSSIPPEFRYSDYGFRVAA 224
Score = 71.6 bits (174), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 49/156 (31%), Positives = 71/156 (45%), Gaps = 41/156 (26%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
D++ +PG F MG +D E P V L F L +++V+N + F AT + +
Sbjct: 5 DLLPVPGGEFLMGQAD---GRDEERPPHRVALTPFRLCRYQVTNADYDHFRKAT--LRDK 59
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
++ D P S+ + PVV ++W DA A+C W
Sbjct: 60 SEYRD----RPEFSDPAQ--------------------------PVVAVNWFDAAAFCHW 89
Query: 134 RGA------RLPTEAEWEYGCRGGLENRLFPWGSWL 163
+G RLPTEAEWE+ RGG RL+PWG+ L
Sbjct: 90 QGEQWNLHLRLPTEAEWEFAARGGFAQRLYPWGNEL 125
>gi|384097854|ref|ZP_09998974.1| gliding motility membrane lipoprotein [Imtechella halotolerans K1]
gi|383836736|gb|EID76143.1| gliding motility membrane lipoprotein [Imtechella halotolerans K1]
Length = 460
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 111/417 (26%), Positives = 170/417 (40%), Gaps = 91/417 (21%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQ---FQEFVSATGYVT 71
M L+PG +F MG + + P++ T+ +FY+D+ E++N+Q F E+V + T
Sbjct: 41 MTLVPGGSFIMGKPDDDVAQLYNAPTKTATVRSFYMDETEITNSQYRQFVEWVKDSVVRT 100
Query: 72 EAEKFGDTFVFEP---------LLSEEERAKISQVRHDMKRFEGLDST-IEHRMHHPVVH 121
D P L + ++ + GL T E R + +
Sbjct: 101 RLAILADEMQLTPDDGGIGEYAFLDADTTNMTEYEKYMFNNYVGLGPTGYEGRKLNKKIK 160
Query: 122 ISWNDA------------VAYC----TWRGAR------LPTEAEWEYGCRGGLENRLFPW 159
I W+ + Y T+ G R L + +W ++++
Sbjct: 161 IEWSTSKYPDQYYAEVMDSMYIPVDQTYNGRRTIDVNKLVFKYQWR-DINAAVQSKTRKD 219
Query: 160 GSWLHPEGI----DSTI-----EHRMN---------------HPVVHVSWNDAVAYCTWR 195
++L E + D+T+ E+ N +PVV V+WN A A+C WR
Sbjct: 220 KNFLRDEELKIYPDTTVWIRDFEYSYNEPMHNDYFWHQAYGEYPVVGVTWNQARAFCEWR 279
Query: 196 -----------------GARLPTEAEWEYGCRGGLENRLFPWGN--NLTPRGEHRANV-- 234
RLPTEAEWEY RGGLE+ +PWG L RG AN
Sbjct: 280 TLLKNGYQKSRGLQPVNNFRLPTEAEWEYAARGGLESATYPWGGPYILNDRGCFMANFKP 339
Query: 235 WQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-APSYNPK 293
+G++ AAD L T SY N + LYNM GNV EWT ++ + + + NP
Sbjct: 340 LRGDY-----AADEALYTVEAKSYDPNDYNLYNMAGNVAEWTNSSYDPNSYEYVSTMNPN 394
Query: 294 G-PTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
T KV +GGS+ + Y + + R+ D++ +GFR D T T
Sbjct: 395 AYDTDNKRKVIRGGSW---KDVAYFLQVSTRAFEYQDTARSYIGFRTVQDYMGTEAT 448
>gi|332666261|ref|YP_004449049.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
1100]
gi|332335075|gb|AEE52176.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis
DSM 1100]
Length = 958
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/188 (35%), Positives = 89/188 (47%), Gaps = 40/188 (21%)
Query: 177 NHPVVHVSWNDAVAYCTW------------------------RGARLPTEAEWEYGCRGG 212
N+P V++SW DA YC W +G RLPTEAEWEY RGG
Sbjct: 784 NNPAVYLSWYDATDYCNWVSTQMGYPNVYARQGDEVSINWNAKGFRLPTEAEWEYAARGG 843
Query: 213 LENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNV 272
+ + F + + E W E N+ + T PV K N GL++M GNV
Sbjct: 844 AQQQNFEYSGS----NEIEEVAWYAE----NSGS----RTRPVGIKKANSLGLFDMSGNV 891
Query: 273 WEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSA 332
WEW DW+ + PA +P+GP G+ +V++GGS+ Y + R A R TPD
Sbjct: 892 WEWCWDWYG-DYDPAAKADPRGPNKGSLRVRRGGSWFY---YPFSARVADRYDVTPDRRD 947
Query: 333 GNLGFRCA 340
+GFR A
Sbjct: 948 DGIGFRLA 955
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 32/68 (47%), Gaps = 24/68 (35%)
Query: 116 HHPVVHISWNDAVAYCTW------------------------RGARLPTEAEWEYGCRGG 151
++P V++SW DA YC W +G RLPTEAEWEY RGG
Sbjct: 784 NNPAVYLSWYDATDYCNWVSTQMGYPNVYARQGDEVSINWNAKGFRLPTEAEWEYAARGG 843
Query: 152 LENRLFPW 159
+ + F +
Sbjct: 844 AQQQNFEY 851
>gi|386815297|ref|ZP_10102515.1| Sulphatase-modifying factor protein [Thiothrix nivea DSM 5205]
gi|386419873|gb|EIJ33708.1| Sulphatase-modifying factor protein [Thiothrix nivea DSM 5205]
Length = 367
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/174 (39%), Positives = 88/174 (50%), Gaps = 21/174 (12%)
Query: 175 RMNHPVVHVSWNDAV-AYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPR 227
R N PV++VSW D Y W R RLPTEAEWEY RGG + +PWGN ++
Sbjct: 199 RGNRPVINVSWEDITQKYIPWLNQQTGRHYRLPTEAEWEYAARGG-SDTAYPWGNTISC- 256
Query: 228 GEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW-NVHHHP 286
N G + + T PV SY N +GLY+ GNVWEW DW+ ++
Sbjct: 257 ----GNANYGSYSNQCET----VRTKPVGSYTANGYGLYDTAGNVWEWCQDWYAGDYYKS 308
Query: 287 APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+P P GP G +V +GGS+ + QY R A RS +TP LGFR A
Sbjct: 309 SPVSAPHGPGAGAARVLRGGSWYNDAQYV---RSADRSHDTPVPRINLLGFRLA 359
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 66/160 (41%), Gaps = 48/160 (30%)
Query: 14 DMVLLPGDTFRMG-----TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATG 68
+M L+P F MG + +D E P+ V+++AFY+ ++EV+ Q+
Sbjct: 129 EMQLIPAGKFMMGCVEGRDDVEGGCEDDEKPAHEVSINAFYMGKYEVTFDQWDAC----- 183
Query: 69 YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAV 128
E+AK D G + PV+++SW D
Sbjct: 184 ---------------------EQAKACPHAGDSGWGRG---------NRPVINVSWEDIT 213
Query: 129 -AYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGS 161
Y W R RLPTEAEWEY RGG + +PWG+
Sbjct: 214 QKYIPWLNQQTGRHYRLPTEAEWEYAARGG-SDTAYPWGN 252
>gi|291278721|ref|YP_003495556.1| hypothetical protein DEFDS_0296 [Deferribacter desulfuricans SSM1]
gi|290753423|dbj|BAI79800.1| conserved hypothetical protein [Deferribacter desulfuricans SSM1]
Length = 280
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 72/185 (38%), Positives = 99/185 (53%), Gaps = 24/185 (12%)
Query: 178 HPVVHVSWNDAVAYCTWRGAR------LPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR 231
PVV++SW DA + W R LPTEAEWEY R G + F WG+N++ E+
Sbjct: 100 QPVVNISWEDAKNFTKWLSKRNDIIFRLPTEAEWEYTARAG-SRKDFYWGDNVSVICEY- 157
Query: 232 ANV--------WQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW-NV 282
ANV ++ N DG+ TAPV S+K NKFGLY+M+GNVWEWT D + +
Sbjct: 158 ANVHDTTSKKTFKNFTWDNFQCNDGFAVTAPVGSFKPNKFGLYDMLGNVWEWTEDVYAHD 217
Query: 283 HHHPAPSYNPKGPTTGTDK----VKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
++ +P NPKGP +K V +GG + C R A+R+ + DS + +GFR
Sbjct: 218 YYLKSPLRNPKGPKPELNKENKRVIRGGGWPCKPSLV---RLASRNWDYQDSISVRIGFR 274
Query: 339 CAADK 343
K
Sbjct: 275 IVMIK 279
>gi|385239656|ref|YP_005807498.1| serine-threonine-protein kinase [Chlamydia trachomatis G/9768]
gi|385240576|ref|YP_005808417.1| serine-threonine-protein kinase [Chlamydia trachomatis G/11222]
gi|385242432|ref|YP_005810271.1| serine-threonine-protein kinase [Chlamydia trachomatis G/9301]
gi|385246042|ref|YP_005814864.1| serine-threonine-protein kinase [Chlamydia trachomatis G/11074]
gi|296435661|gb|ADH17835.1| serine-threonine-protein kinase [Chlamydia trachomatis G/9768]
gi|296436584|gb|ADH18754.1| serine-threonine-protein kinase [Chlamydia trachomatis G/11222]
gi|296437521|gb|ADH19682.1| serine-threonine-protein kinase [Chlamydia trachomatis G/11074]
gi|297140020|gb|ADH96778.1| serine-threonine-protein kinase [Chlamydia trachomatis G/9301]
Length = 614
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A +Y W G RLP+EAEWE GG +P G + + +AN +
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPIGEEID---KSKANFFS 519
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
+ T PVMSY + GLY+M GNV+EW DW++ + + + P
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RCA R +N P + GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+M+ + G F G+ + E P N+TL F+LD H V+N QF F+ G +E
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLECVG--SEQ 433
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
++ + + R K S+++ R IE HPVV ++W A +Y
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
W G RLP+EAEWE GG +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPIG 507
>gi|237802574|ref|YP_002887768.1| serine-threonine-protein kinase [Chlamydia trachomatis B/Jali20/OT]
gi|231273808|emb|CAX10592.1| serine-threonine-protein kinase [Chlamydia trachomatis B/Jali20/OT]
Length = 614
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A +Y W G RLP+EAEWE GG +P G + + +AN +
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPIGEEID---KSKANFFS 519
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
+ T PVMSY + GLY+M GNV+EW DW++ + + + P
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RCA R +N P + GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+M+ + G F G+ + E P N+TL F+LD H V+N QF F+ G +E
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLECVG--SEQ 433
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
++ + + R K S+++ R IE HPVV ++W A +Y
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
W G RLP+EAEWE GG +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPIG 507
>gi|302039307|ref|YP_003799629.1| hypothetical protein NIDE4035 [Candidatus Nitrospira defluvii]
gi|300607371|emb|CBK43704.1| conserved exported protein of unknown function, Sulfatase-modifying
factor-like [Candidatus Nitrospira defluvii]
Length = 306
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 65/173 (37%), Positives = 95/173 (54%), Gaps = 17/173 (9%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
PVV V+W DA YC W G RLPTEAEWE R G E R++PWG+ RG A
Sbjct: 144 KRPVVGVNWYDARDYCRWAGKRLPTEAEWEIAAR-GTEGRIYPWGSAHPTRGHANA---- 198
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKF--GLYNMVGNVWEWTADWWN-VHHHPAPSYNPK 293
T GY + + V ++ K G+Y++ GN+WEW ADW++ ++ + NPK
Sbjct: 199 -----GETRWRGYDTLSNVGRFELGKTPEGVYDLSGNLWEWVADWYDPTYYQFSVRDNPK 253
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
GP+ G + +GG++ + + R + R+ PD+ ++GFRCA D GP+
Sbjct: 254 GPSAGPLRALRGGAWNNDSKAI---RSSNRAGYAPDARRNDVGFRCAQD-GPS 302
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 53/155 (34%), Positives = 73/155 (47%), Gaps = 36/155 (23%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
R MVL+P F MG+++ ++ E P V++ AFYLD +E + + EF
Sbjct: 73 RGTPMVLIPAGEFAMGSDR---GQEDEQPVHRVSVKAFYLDVYETTVAHYAEF------- 122
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
L S++ A F+ ++T PVV ++W DA Y
Sbjct: 123 --------------LASQKPDAP----------FKWSEATAGTHDKRPVVGVNWYDARDY 158
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHP 165
C W G RLPTEAEWE R G E R++PWGS HP
Sbjct: 159 CRWAGKRLPTEAEWEIAAR-GTEGRIYPWGS-AHP 191
>gi|27465032|gb|AAN71625.1| serine/threonine kinase [Chlamydia trachomatis]
Length = 614
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A +Y W G RLP+EAEWE GG +P G + + +AN +
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPTGEEID---KSKANFFS 519
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
+ T PVMSY + GLY+M GNV+EW DW++ + + + P
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RCA R +N P + GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+M+ + G F G+ + E P N+TL F+LD H V+N QF F+ G +E
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLECVG--SEQ 433
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
++ + + R K S+++ R IE HPVV ++W A +Y
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
W G RLP+EAEWE GG +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPTG 507
>gi|440525059|emb|CCP50310.1| putative ATPase [Chlamydia trachomatis K/SotonK1]
Length = 614
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A +Y W G RLP+EAEWE GG +P G + + +AN +
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPTGEEID---KSKANFFS 519
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
+ T PVMSY + GLY+M GNV+EW DW++ + + + P
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RCA R +N P + GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+M+ + G F G+ + E P N+TL F+LD H V+N QF F+ G +E
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLECVG--SEQ 433
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
++ + + R K S+++ R IE HPVV ++W A +Y
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
W G RLP+EAEWE GG +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPTG 507
>gi|76788860|ref|YP_327946.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis
A/HAR-13]
gi|255348507|ref|ZP_05380514.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis 70]
gi|255503048|ref|ZP_05381438.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis 70s]
gi|255506720|ref|ZP_05382359.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis
D(s)2923]
gi|385241509|ref|YP_005809349.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis
E/11023]
gi|385245118|ref|YP_005813941.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis E/150]
gi|389857840|ref|YP_006360082.1| serine-threonine-protein kinase [Chlamydia trachomatis F/SW4]
gi|389858714|ref|YP_006360955.1| serine-threonine-protein kinase [Chlamydia trachomatis E/SW3]
gi|389859591|ref|YP_006361831.1| serine-threonine-protein kinase [Chlamydia trachomatis F/SW5]
gi|76167390|gb|AAX50398.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis
A/HAR-13]
gi|296434734|gb|ADH16912.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis E/150]
gi|296438452|gb|ADH20605.1| serine/threonine-protein kinase PKN1 [Chlamydia trachomatis
E/11023]
gi|380248911|emb|CCE14199.1| serine-threonine-protein kinase [Chlamydia trachomatis F/SW5]
gi|380249787|emb|CCE13310.1| serine-threonine-protein kinase [Chlamydia trachomatis F/SW4]
gi|380250663|emb|CCE12421.1| serine-threonine-protein kinase [Chlamydia trachomatis E/SW3]
gi|440526841|emb|CCP52325.1| putative ATPase [Chlamydia trachomatis D/SotonD1]
gi|440529517|emb|CCP55001.1| putative ATPase [Chlamydia trachomatis E/SotonE4]
gi|440530410|emb|CCP55894.1| putative ATPase [Chlamydia trachomatis E/SotonE8]
Length = 614
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A +Y W G RLP+EAEWE GG +P G + + +AN +
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPTGEEID---KSKANFFS 519
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
+ T PVMSY + GLY+M GNV+EW DW++ + + + P
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RCA R +N P + GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+M+ + G F G+ + E P N+TL F+LD H V+N QF F+ G +E
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLECVG--SEQ 433
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
++ + + R K S+++ R IE HPVV ++W A +Y
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
W G RLP+EAEWE GG +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPTG 507
>gi|237804492|ref|YP_002888646.1| serine-threonine-protein kinase [Chlamydia trachomatis
B/TZ1A828/OT]
gi|376282151|ref|YP_005155977.1| serine-threonine-protein kinase [Chlamydia trachomatis A2497]
gi|385269804|ref|YP_005812964.1| NIRV family protein [Chlamydia trachomatis A2497]
gi|231272792|emb|CAX09698.1| serine-threonine-protein kinase [Chlamydia trachomatis
B/TZ1A828/OT]
gi|347974944|gb|AEP34965.1| Serine/threonine protein kinase [Chlamydia trachomatis A2497]
gi|371908181|emb|CAX08804.1| serine-threonine-protein kinase [Chlamydia trachomatis A2497]
gi|438690063|emb|CCP49320.1| putative ATPase [Chlamydia trachomatis A/7249]
gi|438691147|emb|CCP48421.1| putative ATPase [Chlamydia trachomatis A/5291]
gi|438692520|emb|CCP47522.1| putative ATPase [Chlamydia trachomatis A/363]
gi|440534882|emb|CCP60392.1| putative ATPase [Chlamydia trachomatis E/Bour]
Length = 614
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A +Y W G RLP+EAEWE GG +P G + + +AN +
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPTGEEID---KSKANFFS 519
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
+ T PVMSY + GLY+M GNV+EW DW++ + + + P
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RCA R +N P + GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+M+ + G F G+ + E P N+TL F+LD H V+N QF F+ G +E
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLECVG--SEQ 433
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
++ + + R K S+++ R IE HPVV ++W A +Y
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
W G RLP+EAEWE GG +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPTG 507
>gi|255310945|ref|ZP_05353515.1| serine-threonine-protein kinase [Chlamydia trachomatis 6276]
gi|255317246|ref|ZP_05358492.1| serine-threonine-protein kinase [Chlamydia trachomatis 6276s]
gi|440533091|emb|CCP58601.1| putative ATPase [Chlamydia trachomatis Ia/SotonIa1]
gi|440533985|emb|CCP59495.1| putative ATPase [Chlamydia trachomatis Ia/SotonIa3]
Length = 614
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A +Y W G RLP+EAEWE GG +P G + + +AN +
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPTGEEID---KSKANFFS 519
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
+ T PVMSY + GLY+M GNV+EW DW++ + + + P
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RCA R +N P + GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+M+ + G F G+ + E P N+TL F+LD H V+N QF F+ G +E
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLEYVG--SEQ 433
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
++ + + R K S+++ R IE HPVV ++W A +Y
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
W G RLP+EAEWE GG +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPTG 507
>gi|166154366|ref|YP_001654484.1| serine-threonine-protein kinase [Chlamydia trachomatis 434/Bu]
gi|166155241|ref|YP_001653496.1| serine-threonine-protein kinase [Chlamydia trachomatis
L2b/UCH-1/proctitis]
gi|301335619|ref|ZP_07223863.1| serine-threonine-protein kinase [Chlamydia trachomatis L2tet1]
gi|339625799|ref|YP_004717278.1| protein tyrosine kinase family protein [Chlamydia trachomatis L2c]
gi|165930354|emb|CAP03840.1| serine-threonine-protein kinase [Chlamydia trachomatis 434/Bu]
gi|165931229|emb|CAP06794.1| serine-threonine-protein kinase [Chlamydia trachomatis
L2b/UCH-1/proctitis]
gi|339460555|gb|AEJ77058.1| protein tyrosine kinase family protein [Chlamydia trachomatis L2c]
gi|440525951|emb|CCP51435.1| putative ATPase [Chlamydia trachomatis L2b/8200/07]
gi|440535775|emb|CCP61288.1| putative ATPase [Chlamydia trachomatis L2b/795]
gi|440536665|emb|CCP62179.1| putative ATPase [Chlamydia trachomatis L1/440/LN]
gi|440537557|emb|CCP63071.1| putative ATPase [Chlamydia trachomatis L1/1322/p2]
gi|440538447|emb|CCP63961.1| putative ATPase [Chlamydia trachomatis L1/115]
gi|440539336|emb|CCP64850.1| putative ATPase [Chlamydia trachomatis L1/224]
gi|440540226|emb|CCP65740.1| putative ATPase [Chlamydia trachomatis L2/25667R]
gi|440541116|emb|CCP66630.1| putative ATPase [Chlamydia trachomatis L3/404/LN]
gi|440542004|emb|CCP67518.1| putative ATPase [Chlamydia trachomatis L2b/UCH-2]
gi|440542894|emb|CCP68408.1| putative ATPase [Chlamydia trachomatis L2b/Canada2]
gi|440543786|emb|CCP69300.1| putative ATPase [Chlamydia trachomatis L2b/LST]
gi|440544676|emb|CCP70190.1| putative ATPase [Chlamydia trachomatis L2b/Ams1]
gi|440545566|emb|CCP71080.1| putative ATPase [Chlamydia trachomatis L2b/CV204]
gi|440913828|emb|CCP90245.1| putative ATPase [Chlamydia trachomatis L2b/Ams2]
gi|440914718|emb|CCP91135.1| putative ATPase [Chlamydia trachomatis L2b/Ams3]
gi|440915609|emb|CCP92026.1| putative ATPase [Chlamydia trachomatis L2b/Canada1]
gi|440916503|emb|CCP92920.1| putative ATPase [Chlamydia trachomatis L2b/Ams4]
gi|440917393|emb|CCP93810.1| putative ATPase [Chlamydia trachomatis L2b/Ams5]
Length = 614
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A +Y W G RLP+EAEWE GG +P G + + +AN +
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPTGEEID---KSKANFFS 519
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
+ T PVMSY + GLY+M GNV+EW DW++ + + + P
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RCA R +N P + GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+M+ + G F G+ + E P N+TL F+LD H V+N QF F+ G +E
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLECVG--SEQ 433
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
++ + + R K S+++ R IE HPVV ++W A +Y
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
W G RLP+EAEWE GG +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPTG 507
>gi|15604864|ref|NP_219648.1| serine/threonine protein kinase [Chlamydia trachomatis D/UW-3/CX]
gi|385243352|ref|YP_005811198.1| NIRV family protein [Chlamydia trachomatis D-EC]
gi|385244232|ref|YP_005812076.1| NIRV family protein [Chlamydia trachomatis D-LC]
gi|386262498|ref|YP_005815777.1| serine-threonine-protein kinase [Chlamydia trachomatis Sweden2]
gi|55667872|sp|O84147.1|PKN1_CHLTR RecName: Full=Serine/threonine-protein kinase pkn1
gi|3328546|gb|AAC67736.1| Serine/threonine Protein Kinase [Chlamydia trachomatis D/UW-3/CX]
gi|289525186|emb|CBJ14661.1| serine-threonine-protein kinase [Chlamydia trachomatis Sweden2]
gi|297748275|gb|ADI50821.1| Serine/threonine protein kinase [Chlamydia trachomatis D-EC]
gi|297749155|gb|ADI51833.1| Serine/threonine protein kinase [Chlamydia trachomatis D-LC]
gi|440527735|emb|CCP53219.1| putative ATPase [Chlamydia trachomatis D/SotonD5]
gi|440528626|emb|CCP54110.1| putative ATPase [Chlamydia trachomatis D/SotonD6]
gi|440531307|emb|CCP56817.1| putative ATPase [Chlamydia trachomatis F/SotonF3]
gi|440532198|emb|CCP57708.1| putative ATPase [Chlamydia trachomatis G/SotonG1]
Length = 614
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 83/169 (49%), Gaps = 22/169 (13%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
HPVV V+W A +Y W G RLP+EAEWE GG +P G + + +AN +
Sbjct: 463 KHPVVGVTWYGASSYACWIGKRLPSEAEWEVAASGGKLGLRYPTGEEID---KSKANFFS 519
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK--- 293
+ T PVMSY + GLY+M GNV+EW DW++ + + + P
Sbjct: 520 SD-------------TTPVMSYPSSILGLYDMAGNVYEWCQDWYSYDFYESSALEPDAPL 566
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
GP G +V +GG C + RCA R +N P + GFRCA D
Sbjct: 567 GPPQGVYRVLRGG---CWKSLKDDLRCAHRHRNNPGAINSTYGFRCAKD 612
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 69/148 (46%), Gaps = 20/148 (13%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+M+ + G F G+ + E P N+TL F+LD H V+N QF F+ G +E
Sbjct: 379 EMIFIEGGEFSRGSGDG---QRDELPVHNITLPGFFLDIHPVTNEQFVRFLECVG--SEQ 433
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIE-HRMHHPVVHISWNDAVAYCT 132
++ + + R K S+++ R IE HPVV ++W A +Y
Sbjct: 434 DEHYNELI---------RLKDSRIQRRSGRL-----IIEPGYAKHPVVGVTWYGASSYAC 479
Query: 133 WRGARLPTEAEWEYGCRGGLENRLFPWG 160
W G RLP+EAEWE GG +P G
Sbjct: 480 WIGKRLPSEAEWEVAASGGKLGLRYPTG 507
>gi|260831884|ref|XP_002610888.1| hypothetical protein BRAFLDRAFT_91482 [Branchiostoma floridae]
gi|229296257|gb|EEN66898.1| hypothetical protein BRAFLDRAFT_91482 [Branchiostoma floridae]
Length = 297
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 68/200 (34%), Positives = 94/200 (47%), Gaps = 29/200 (14%)
Query: 164 HPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNN 223
P G S I ++ HPVVHV W DA AYC W G RLPTE EWE +GGL+ R PWG
Sbjct: 56 QPAGPRSNIMDKLQHPVVHVRWTDAAAYCYWAGKRLPTEEEWEVAAKGGLDGRKLPWGGR 115
Query: 224 LTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKF-------GLY-NMVGNVWEW 275
+P R N+WQ L + ++ +F G Y +GNVWEW
Sbjct: 116 YSP---ERLNIWQ---------TTTLLKMVTSLRHQFCRFQSRTITVGTYVRPLGNVWEW 163
Query: 276 TADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHR--CAARSQNTPDSSAG 333
T+ + P + + V +GGS+L ++ + +R R D
Sbjct: 164 TSSQF------IPPGMRREEVEDKEYVVRGGSWLDSKDGSFNNRVHVTTRKGQAADVGCD 217
Query: 334 NLGFRCAADKGPTTGTDKVK 353
++GFRCA PT+ +DKV+
Sbjct: 218 HVGFRCAQSVVPTS-SDKVR 236
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 48/121 (39%), Positives = 71/121 (58%), Gaps = 4/121 (3%)
Query: 51 DQHEVSNTQFQEFVSATG--YVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLD 108
D VS + ++F+ + G + T++EK G + VF+ L+SEE + + ++ + G
Sbjct: 4 DPAVVSRFRIRKFIRSKGKRFQTDSEKIGWSHVFQDLVSEEVKQNFTV--ENVTQPAGPR 61
Query: 109 STIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGI 168
S I ++ HPVVH+ W DA AYC W G RLPTE EWE +GGL+ R PWG PE +
Sbjct: 62 SNIMDKLQHPVVHVRWTDAAAYCYWAGKRLPTEEEWEVAAKGGLDGRKLPWGGRYSPERL 121
Query: 169 D 169
+
Sbjct: 122 N 122
>gi|325105235|ref|YP_004274889.1| protein involved in gliding motility GldK [Pedobacter saltans DSM
12145]
gi|324974083|gb|ADY53067.1| protein involved in gliding motility GldK [Pedobacter saltans DSM
12145]
Length = 451
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 108/411 (26%), Positives = 169/411 (41%), Gaps = 91/411 (22%)
Query: 15 MVLLPGDTFRMG-TNKPILIKDGEFP-SRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTE 72
MV +P TF MG ++ + I +FP ++ VT+ AFY+D+ E++N+++++FV
Sbjct: 47 MVYIPAGTFVMGQVDQDVTI--SQFPQNKQVTISAFYMDETEITNSEYKQFVHWVRDSIA 104
Query: 73 AEKFG---DTFVFEPLLSEE--------------------ERAKISQVRHDM-----KRF 104
++ G D ++ +P + ++A + ++ +M +RF
Sbjct: 105 VKRLGNGADKYMLKPKGKDANTGATFIDWKKVQSGKGIWGDKALVEELNKNMYYQDNERF 164
Query: 105 EGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLH 164
G ++ + W D RGAR P + G + + +++
Sbjct: 165 FGKKELNVEQLKY---DYEWIDYRKAADMRGARDPNRNKNGLGVT--RDQFIVKEEVYVY 219
Query: 165 PEGI----DSTIEHR-------------MNHPVVHVSWNDAVAYCTWRGA---------- 197
P+ + D T N+PVV VSW A A+ WR
Sbjct: 220 PDTLVWISDFTYAQNDPMTKGYFSHPSFANYPVVGVSWKQAKAFTVWRSRLNEAYKTSRN 279
Query: 198 -------RLPTEAEWEYGCRGGLENRLFPWGNNLT--PRGEHRANVWQGEFPTNNTAADG 248
LPTEA++EY RGG +PWG +G AN G N DG
Sbjct: 280 LPSRLDYDLPTEAQFEYAARGGRVGTTYPWGGPYIRNAKGCLLANFKPGR---GNYIDDG 336
Query: 249 YLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-----VHHHPAPSYNPK----GPTTGT 299
T SY N +GLYNM GNV EWT+ +N + PSY+ P
Sbjct: 337 GAFTVYAKSYFPNDYGLYNMAGNVAEWTSSIYNPTATAYINTLNPSYDKAVKEGDPDYDK 396
Query: 300 DKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTD 350
+V KGGS+ + Y + +A+S PDS+ ++GFRC P G D
Sbjct: 397 RRVVKGGSW---KDIGYFLQNSAKSYEYPDSARSSIGFRCVT---PFVGRD 441
>gi|294661128|ref|YP_003573003.1| hypothetical protein Aasi_1513 [Candidatus Amoebophilus asiaticus
5a2]
gi|227336278|gb|ACP20875.1| hypothetical protein Aasi_1513 [Candidatus Amoebophilus asiaticus
5a2]
Length = 369
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 99/357 (27%), Positives = 145/357 (40%), Gaps = 84/357 (23%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MVL+P F MG + + K P+R V++ +F++D+ EV+N +++ F++ E
Sbjct: 58 MVLIPTGAFMMGGVEEDIFKRAN-PNRRVSVSSFFMDETEVTNNEYRYFLTKV-----KE 111
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+F D F P + E I D + + LD + V+
Sbjct: 112 RF-DYFKNNPSIGENTGTTID----DGQTTDPLDQKLTDEFIKDVLT------------- 153
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
P W + +R+ + G D N+PVV VSW A + W
Sbjct: 154 ----PNMDVWRTDFTNHMADRILE--GYFELRGYD-------NYPVVGVSWEAASYFAAW 200
Query: 195 R-----------------GARLPTEAEWEYGCRGGLENRLFPWGNNLT--PRGEHRANVW 235
R LP+ A+WEY RGG E +PWG G RAN
Sbjct: 201 RTKYFNENKEKKGLEKYPSFSLPSAAQWEYAARGGKELAKYPWGGPYIRDAEGNLRANFK 260
Query: 236 QGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-----VHHHPAPSY 290
G+ N + GY T+PV ++ N +GLY+M GNV EWT D +N + P Y
Sbjct: 261 SGQ---GNYSECGYTYTSPVTAFSPNDYGLYDMAGNVAEWTLDAYNPAAVALTWDSDPLY 317
Query: 291 -NPKGPTTGTDKVKKGGS------YLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+ + P K+ KGGS +L Y Y H+ ARS +GFRC
Sbjct: 318 LDDEQPM----KIIKGGSWKDISRFLQTGAYDYEHKDTARSY---------IGFRCV 361
>gi|254444401|ref|ZP_05057877.1| conserved domain protein [Verrucomicrobiae bacterium DG1235]
gi|198258709|gb|EDY83017.1| conserved domain protein [Verrucomicrobiae bacterium DG1235]
Length = 336
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 70/174 (40%), Positives = 90/174 (51%), Gaps = 16/174 (9%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLT-----PRGEHR 231
N PVV V W DA AY W G RLPTEAEWE RG E RL+PWGN+ + G+
Sbjct: 170 NMPVVGVDWFDAYAYAKWAGKRLPTEAEWELAARGDDE-RLWPWGNSWSWGLCNIGGDKS 228
Query: 232 ANVWQGEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNVHHHPA-P 288
+ + P DGY+ APV S+ ++ FG +M GNV EW ADW++ + P
Sbjct: 229 GKDARDKSPDR----DGYIYPAPVGSFADSASPFGCMDMAGNVAEWCADWYSEDAYKVFP 284
Query: 289 SYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
NP G + +GGS ++ RC+ARS P+ LGFRCA D
Sbjct: 285 DRNPIGTIDSGYRSIRGGS---SQSVPSGVRCSARSFEEPEFRKFTLGFRCAKD 335
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 62/188 (32%), Positives = 85/188 (45%), Gaps = 50/188 (26%)
Query: 14 DMVLLPGDTFRMGTN----------------KP--ILIK--------DGEFPSRNVTLDA 47
+MV++PG F MG + +P +LI + E P R V +D
Sbjct: 40 EMVIIPGSIFAMGNDLTHPDFANQAVEGKALRPYHVLIARAKKAWQMEAERPVRKVRVDR 99
Query: 48 FYLDQHEVSNTQFQEF---VSATGYVT---EAEKFGDTFV------FEPLLSEEERAKIS 95
F +D+HEV+N Q++ F V G + E + G + PLL +E A ++
Sbjct: 100 FAIDRHEVTNRQYRLFLDWVQLNGDDSVRHEDQPVGKDHTPRYWTSYNPLLQDEATAALA 159
Query: 96 QVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENR 155
S + + PVV + W DA AY W G RLPTEAEWE RG E R
Sbjct: 160 P----------FSSETFLKDNMPVVGVDWFDAYAYAKWAGKRLPTEAEWELAARGDDE-R 208
Query: 156 LFPWG-SW 162
L+PWG SW
Sbjct: 209 LWPWGNSW 216
>gi|120613095|ref|YP_972773.1| hypothetical protein Aave_4459 [Acidovorax citrulli AAC00-1]
gi|120591559|gb|ABM34999.1| protein of unknown function DUF323 [Acidovorax citrulli AAC00-1]
Length = 390
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 69/182 (37%), Positives = 92/182 (50%), Gaps = 28/182 (15%)
Query: 177 NHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
+HPVV+++W+DA A W A RLPTEAEWEY CR G R + GN+ H
Sbjct: 197 DHPVVNITWHDAHAMAAWLSATEGHRYRLPTEAEWEYACRAGKAAR-YGHGNDPAALPRH 255
Query: 231 RANVWQGEFP-------TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH 283
Q P DG+ TAPV Y N FGL++M GNVWEW +DW
Sbjct: 256 ANTFDQAAAPYWPRWQAQAVPGHDGHAFTAPVAGYPANAFGLHDMSGNVWEWVSDW---- 311
Query: 284 HHPAPSY------NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGF 337
H +Y +P+GP TGT +V++GGS+ Y RC R+ N+P++ +G
Sbjct: 312 -HADDAYARSARDDPQGPATGTVRVRRGGSWHTWPLYA---RCTYRNWNSPETRYTLVGM 367
Query: 338 RC 339
R
Sbjct: 368 RL 369
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/164 (33%), Positives = 77/164 (46%), Gaps = 23/164 (14%)
Query: 15 MVLLPGDTFRMGTNKPI--LIKD-------------GEFPSRNVTLD-AFYLDQHEVSNT 58
V +P +FRMG+++ + L +D E P V + AFYL Q EV+
Sbjct: 79 FVRVPAGSFRMGSDESVKHLARDYPGMEAGRLSALADEAPVHTVRITRAFYLGQTEVTVG 138
Query: 59 QFQEFVSATGYVTEAEKFGDT-FVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
QF+ FV A+GY E++ G + + P ++ A+ R+ H
Sbjct: 139 QFRRFVEASGYRAESDADGTGGYGYNPAYDPDKSARGDAFEGRSPRYSWRYPGFAQGDDH 198
Query: 118 PVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENR 155
PVV+I+W+DA A W A RLPTEAEWEY CR G R
Sbjct: 199 PVVNITWHDAHAMAAWLSATEGHRYRLPTEAEWEYACRAGKAAR 242
>gi|451981317|ref|ZP_21929679.1| conserved hypothetical protein [Nitrospina gracilis 3/211]
gi|451761462|emb|CCQ90935.1| conserved hypothetical protein [Nitrospina gracilis 3/211]
Length = 280
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 69/179 (38%), Positives = 92/179 (51%), Gaps = 22/179 (12%)
Query: 165 PEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNL 224
P+ +D+ ++ PVV V+W +A YC W G RLPTEAEWE R G + R +PWGN+
Sbjct: 120 PKYMDNEKFNQPTQPVVGVTWQEAYDYCEWAGKRLPTEAEWEKAAR-GPDGRFYPWGNDP 178
Query: 225 TPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKE--NKFGLYNMVGNVWEWTADWWNV 282
P RAN DG+ APV ++ + + +G +M GNVWEWTADW
Sbjct: 179 DPT---RANA--------RGMKDGHRYPAPVGTFPKGASPYGALDMAGNVWEWTADW--- 224
Query: 283 HHHPAPSYNPKGPTTG-TDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+ P P K G T +V +GGS+ N R R + TPD +GFRCA
Sbjct: 225 -YLPYPENLVKNDLYGNTFRVMRGGSWFSNMDLA---RSTVRGKLTPDQRQNYIGFRCA 279
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 72/160 (45%), Gaps = 46/160 (28%)
Query: 15 MVLLPGDTFRMGTN--------KPILIKDGEF-----PSRNVTLDAFYLDQHEVSNTQFQ 61
MV +P MG++ K + E+ PS V ++AF +DQHEV+N ++
Sbjct: 50 MVFVPAGFLTMGSSEEDIQWAAKTFFSESLEYYRDETPSHKVQVEAFEIDQHEVTNGEYG 109
Query: 62 EFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVH 121
+++ ATG A K+ D F ++ PVV
Sbjct: 110 KYLKATG--RPAPKYMDNEKF------------------------------NQPTQPVVG 137
Query: 122 ISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
++W +A YC W G RLPTEAEWE R G + R +PWG+
Sbjct: 138 VTWQEAYDYCEWAGKRLPTEAEWEKAAR-GPDGRFYPWGN 176
>gi|268325661|emb|CBH39249.1| hypothetical protein, calcineurin-like phosphoesterase, NACHT domain,
and DUF323 family [uncultured archaeon]
Length = 1107
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 65/174 (37%), Positives = 92/174 (52%), Gaps = 30/174 (17%)
Query: 177 NHPVVHVSWNDAVAYCTWRGAR------LPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
N PVV + W +A AY W R LPTEAE E RG + +PWG+
Sbjct: 949 NFPVVGIIWYEAEAYANWLSKRTEHPYRLPTEAECEKAARG-IGGYKYPWGD-------- 999
Query: 231 RANVWQGEFPTN--NTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNVHHH- 285
EF N N++ G T+PV + + K +G ++M GNVWEW +DW+N ++
Sbjct: 1000 -------EFDKNLCNSSESGLYHTSPVGIFPKGKSPYGCFDMAGNVWEWCSDWYNDKYYG 1052
Query: 286 PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+P NPKGP++G D+V +GGS+L +YC R A R + P + +LGFR
Sbjct: 1053 NSPDRNPKGPSSGADRVFRGGSWLHGARYC---RSAYRDLSVPRDRSSDLGFRL 1103
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 51/163 (31%), Positives = 74/163 (45%), Gaps = 33/163 (20%)
Query: 11 RYKDMVLLPGDTFRMGTNKPILIKDG-----EFPSRNVTLDAFYLDQHEVSNTQFQEFVS 65
+ +MVL+ G F G+++ D E P R + +D F + ++ V+N +F+EFV
Sbjct: 856 KVDNMVLVKGGKFMRGSSE----DDADAYPEEKPQREIYIDDFMIGKYPVTNEEFKEFVD 911
Query: 66 ATGYVTEAEKFGDTFVFEPLLSEEERAKISQVR--HDMKRFEGLDSTIEHRMHHPVVHIS 123
GY E F E+ KI + R HD +++ G + PVV I
Sbjct: 912 DGGYDREE------FWTREGWRWREKNKIFKPRYWHD-RKWNG--------SNFPVVGII 956
Query: 124 WNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWG 160
W +A AY W RLPTEAE E R G+ +PWG
Sbjct: 957 WYEAEAYANWLSKRTEHPYRLPTEAECEKAAR-GIGGYKYPWG 998
>gi|340617143|ref|YP_004735596.1| gliding motility membrane lipoprotein [Zobellia galactanivorans]
gi|339731940|emb|CAZ95208.1| Gliding motility membrane lipoprotein [Zobellia galactanivorans]
Length = 451
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 108/410 (26%), Positives = 168/410 (40%), Gaps = 91/410 (22%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF-------VSAT 67
M L+P +F MG ++ K P++ VT+ +FY+D E++N+++++F ++ T
Sbjct: 42 MELIPRGSFIMGKSEEDQAKVLNAPTKTVTVRSFYMDDTEITNSEYRQFVEWVQDSITRT 101
Query: 68 GYVTEAEKFG----DTFVFEPLLSEEERAKISQV-RHDMKRFEGLDST-IEHRMHHPVVH 121
A++ G D + E + + K+S ++ + + G+ T E R +
Sbjct: 102 RLAILADELGLGPEDGGIGEYAFKDADTTKLSAYDKYMLDNYAGMGETGFEGRALNKEED 161
Query: 122 ISWN-----DAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGI-------- 168
+ W+ D +P E + G R +L SW+ E
Sbjct: 162 LIWDISEYPDEYYTEVMDSLYIPEEESYN-GQRTIDVTKLKYKYSWMDIEAAARSRTGSR 220
Query: 169 -------------DSTI-----EHRMN---------------HPVVHVSWNDAVAYCTWR 195
D+T+ E+ N +PVV VSW A A+C WR
Sbjct: 221 KDFIRQEELEIYPDTTVWIRDFEYSYNEPMHNDYFWHDAYSEYPVVGVSWQQARAFCNWR 280
Query: 196 GA-----------------RLPTEAEWEYGCRGGLENRLFPWGN--NLTPRGEHRANV-- 234
RLPTEAEWEY RGG+E +PWG ++ G AN
Sbjct: 281 TKFKNDDQKSKGKQFVNQFRLPTEAEWEYAARGGIEGGTYPWGGPYVISDTGCFMANFKP 340
Query: 235 WQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPK 293
+G++ AAD L T SY+ N F LYNM GNV EWT ++ + + NP
Sbjct: 341 QRGDY-----AADAALYTVEAKSYEPNDFNLYNMAGNVSEWTNSSYDPSSYEFVSTMNPN 395
Query: 294 -GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
G KV +GGS+ + Y + + R DS+ +GFR D
Sbjct: 396 GGDGANARKVIRGGSW---KDVAYFLQVSTRDYEYADSARSYIGFRTVQD 442
>gi|326319127|ref|YP_004236799.1| sulfatase-modifying factor protein [Acidovorax avenae subsp. avenae
ATCC 19860]
gi|323375963|gb|ADX48232.1| Sulphatase-modifying factor protein [Acidovorax avenae subsp.
avenae ATCC 19860]
Length = 339
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 70/182 (38%), Positives = 92/182 (50%), Gaps = 28/182 (15%)
Query: 177 NHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
+HPVV+V+W+DA A W A RLPTEAEWEY C G + R + GN+ H
Sbjct: 155 DHPVVNVTWHDAHALAAWLSATEGHRYRLPTEAEWEYACLAGRQAR-YGHGNDPAALARH 213
Query: 231 RANVWQGEFP-------TNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVH 283
Q P DG+ TAPV SY N FGL++M GN WEW +DW
Sbjct: 214 ANTFDQDAAPYWPRWQAQAVPGHDGHAFTAPVASYPPNAFGLHDMNGNAWEWVSDW---- 269
Query: 284 HHPAPSY------NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGF 337
H +Y +P+GP TGT +V++GGS+ Y RC R+ NTP++ +G
Sbjct: 270 -HADDAYARSVREDPQGPATGTVRVRRGGSWHTWPLYA---RCTYRNWNTPETRYTLVGM 325
Query: 338 RC 339
R
Sbjct: 326 RL 327
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 53/164 (32%), Positives = 75/164 (45%), Gaps = 23/164 (14%)
Query: 15 MVLLPGDTFRMGTNKPI---------------LIKDGEFPSRNVTLD-AFYLDQHEVSNT 58
V +P +FRMG+++P+ + D E P V + AFY+ Q EV+
Sbjct: 37 FVRVPAGSFRMGSDEPVERLARDYPGMEARRLAVLDDEAPVHRVRITRAFYMGQTEVTVG 96
Query: 59 QFQEFVSATGYVTEAEKFGD-TFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHH 117
QF+ FV A+GY E+E G + + P + R+ D H
Sbjct: 97 QFRRFVLASGYRAESEADGTGGYGYNPAYDPATTVRGDAFEGRSPRYSWRDPGFAQGDDH 156
Query: 118 PVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENR 155
PVV+++W+DA A W A RLPTEAEWEY C G + R
Sbjct: 157 PVVNVTWHDAHALAAWLSATEGHRYRLPTEAEWEYACLAGRQAR 200
>gi|387792441|ref|YP_006257506.1| hypothetical protein Solca_3323 [Solitalea canadensis DSM 3403]
gi|379655274|gb|AFD08330.1| hypothetical protein Solca_3323 [Solitalea canadensis DSM 3403]
Length = 422
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 102/379 (26%), Positives = 152/379 (40%), Gaps = 63/379 (16%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P TF MG + +R VT+ FY+D+ E+SN ++++FV A+
Sbjct: 45 MVYIPAGTFIMGQGDQDVTASRVSQNRQVTIAPFYMDETEISNNEYRQFVLWVRDSIAAK 104
Query: 75 KFGDTFVF------EPLLSEEERAKISQVRHDMKRFEGL----DSTIEHRMHHPVVHISW 124
G FV E + +++ S+ + + +G+ D I R V + +
Sbjct: 105 TMGGNFVVKAEDGTEFINPKQKVTYTSKNNPNAETLKGMYYQGDDKITGRNDFDVRQLVY 164
Query: 125 NDAVAYCTWRGARLPTEAEWEYGCRGGLENR----LFP----WGSWLHPEGIDSTIEHRM 176
TW R + R R ++P W + + +E
Sbjct: 165 Q-----YTWFDIRSAALKQNRTKPRSSFIKREKIAIYPDTLVWLADFSYAQNEPMVEQYF 219
Query: 177 NHP------VVHVSWNDAVAYCTWRG-----------------ARLPTEAEWEYGCRGGL 213
+HP VV V+W A A+C WR LPTEA++EY RGG
Sbjct: 220 SHPAFDDYPVVGVTWKQANAFCAWRTEFNNTYRDKMKKPRRGVVSLPTEAQFEYAARGGR 279
Query: 214 ENRLFPWGNNLT--PRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGN 271
+PWG +G AN G N A DG T V SY N +GLYNM GN
Sbjct: 280 LGADYPWGGPYIRNSKGCILANFKPGR---GNYADDGGFYTVHVKSYFPNDYGLYNMAGN 336
Query: 272 VWEWTADWWN-----VHHHPAPSY----NPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAA 322
V EWT+ ++ H P+Y N P KV +GGS+ + + +
Sbjct: 337 VSEWTSSTYDESSYTFVHDMNPTYTSEVNETTPMAQRRKVVRGGSW---KDIGFFLQNGT 393
Query: 323 RSQNTPDSSAGNLGFRCAA 341
R+ D++ +GFRC A
Sbjct: 394 RAYEYLDTAKSYIGFRCVA 412
>gi|302039446|ref|YP_003799768.1| hypothetical protein NIDE4175 [Candidatus Nitrospira defluvii]
gi|300607510|emb|CBK43843.1| conserved exported protein of unknown function, Sulfatase-modifying
factor-like [Candidatus Nitrospira defluvii]
Length = 279
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 70/180 (38%), Positives = 93/180 (51%), Gaps = 22/180 (12%)
Query: 167 GIDSTIEHRMNHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTP 226
GI +++ R PV++V W DA AYC W G RLP EAEWE +G + R FPWGN
Sbjct: 116 GIPASLADR---PVINVDWADADAYCKWAGRRLPREAEWEKAAKGNNDWR-FPWGNVEPT 171
Query: 227 RGEHRAN-VWQGEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNV- 282
G N W GE + PV SY++ K +G+Y+M GNVWEW DW++
Sbjct: 172 DGHLNFNQKWVGE-----------KTLMPVGSYEKGKSPYGVYDMAGNVWEWVNDWYDSR 220
Query: 283 HHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
++ +P NP GP +G KV +G + NE R R + P + GFRCA D
Sbjct: 221 YYEKSPDKNPPGPDSGEKKVIRGAGW-QNETPTV--RIFTRVDSDPTMRNESTGFRCAMD 277
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 79/151 (52%), Gaps = 30/151 (19%)
Query: 15 MVLLPGDTFRMGTNKPILIKDG---EFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGY-V 70
+VL+P ++ MG P +DG E+P + + FY+D++EV+N ++ EFV AT + V
Sbjct: 43 LVLIPSGSYPMGV--PTGDRDGGRDEYPRHVIEITDFYIDKYEVTNARYLEFVKATNHRV 100
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
+ K ++E + G+ +++ R PV+++ W DA AY
Sbjct: 101 PQNPKNPTRNLWEGI--------------------GIPASLADR---PVINVDWADADAY 137
Query: 131 CTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
C W G RLP EAEWE +G + R FPWG+
Sbjct: 138 CKWAGRRLPREAEWEKAAKGNNDWR-FPWGN 167
>gi|124008148|ref|ZP_01692846.1| GldK [Microscilla marina ATCC 23134]
gi|123986396|gb|EAY26209.1| GldK [Microscilla marina ATCC 23134]
Length = 398
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 101/359 (28%), Positives = 147/359 (40%), Gaps = 73/359 (20%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV++P TF MG + ++ +T+ AF++DQ EV+N Q++ F
Sbjct: 64 MVVVPNGTFHMGQADEDVPATQINLNKQITISAFFMDQTEVTNNQYRLFT---------- 113
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
D + SE +A+ ++ + + EG+ + + PV D++
Sbjct: 114 ---DVLL---AYSEGRQAQGEELPQEYQ--EGVMDKLFLKNGTPVRP----DSIKVMN-- 159
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW 194
P W + + L + ++HP D +PVV V W A +C W
Sbjct: 160 -EMYPDTTRWSTEYAHHMGDPLMEY-YFMHPAFDD--------YPVVGVDWQAANFFCQW 209
Query: 195 RGA-------------------RLPTEAEWEYGCRGGLENRLFPWGNNL--TPRGEHRAN 233
R RLPTEAEWEY RGG + +PWG+ G AN
Sbjct: 210 RTMHLNYYRHTVEGDPYPAPRFRLPTEAEWEYASRGGRDMAKYPWGSPYLRNTLGCMLAN 269
Query: 234 VWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPA-----P 288
G N DG+ T PV Y N +GLY+M GNV EW D + P P
Sbjct: 270 FKPGR---GNYYDDGFAYTGPVAQYFPNDYGLYDMAGNVAEWVLDAYFESSVPVVWDLNP 326
Query: 289 SYNPKGPTTGTD-------KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
YNP G + +V +GGS+ YC AR+ DSS+ ++GFRCA
Sbjct: 327 VYNPPITKAGKNDVNINNRRVIRGGSWKDIAYYC---ETGARTFQHWDSSSTSIGFRCA 382
>gi|332661979|ref|YP_004451448.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
1100]
gi|332337476|gb|AEE54575.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis
DSM 1100]
Length = 591
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 93/324 (28%), Positives = 136/324 (41%), Gaps = 83/324 (25%)
Query: 54 EVSNTQFQEFVSATGYVTEAEKFGDTFV--FEPLLSEEERAKISQVRHD-MKRFEGLDST 110
E + +F SA+ YV EA++ D +P K S + D + +G
Sbjct: 311 EAALLRFTRENSASPYVAEAQRLIDQLQSNIKPPAESLVPPKPSIIVPDHLVLVKGGTFQ 370
Query: 111 IEHRMHHPVV---------HISWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGS 161
+ HPV +++++ A+C +LP + +W G R
Sbjct: 371 LGEDKTHPVTLSDFLIAKNQLTFDEYDAFCKATDRKLPDDRKWGRGKR------------ 418
Query: 162 WLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWR------------------------GA 197
PV++V+W DA+ YC WR G
Sbjct: 419 -----------------PVIYVNWFDAIDYCNWRSQQEGLSQVYQVNKPQVNPNWQANGY 461
Query: 198 RLPTEAEWEYGCRGGLENRLFPW-GNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVM 256
RLPTEAEWEY RGG+ ++ F + G+N NV + + N+ + T PV
Sbjct: 462 RLPTEAEWEYAARGGVSSQGFTYAGSN---------NVGEVAWHDTNSGS----KTQPVG 508
Query: 257 SYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCY 316
K N+ GLY++ GNVWEW D W V + S +PKGP TG+ +V +GGS+ + YC
Sbjct: 509 QKKTNELGLYDLSGNVWEWCWD-WRVAYPSNASNDPKGPDTGSYRVLRGGSWSYSAGYC- 566
Query: 317 RHRCAARSQNTPDSSAGNLGFRCA 340
R A R N P + GFR A
Sbjct: 567 --RIALRFSNDPYDAYNYCGFRLA 588
>gi|196228313|ref|ZP_03127180.1| protein of unknown function DUF323 [Chthoniobacter flavus Ellin428]
gi|196227716|gb|EDY22219.1| protein of unknown function DUF323 [Chthoniobacter flavus Ellin428]
Length = 235
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 64/172 (37%), Positives = 92/172 (53%), Gaps = 14/172 (8%)
Query: 177 NHPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
NHPVV+V+ +A +C W R RLPTE EWEY R G++ R FPWG L
Sbjct: 63 NHPVVYVNSREAEKFCQWLSTKEGRKYRLPTEPEWEYAAR-GMDGRTFPWGERLDAGNLA 121
Query: 231 RANVWQGEFP-TNNTAADGYLSTAPVMSYKE--NKFGLYNMVGNVWEWTADWWNVHHHPA 287
Q F + DG+ +APV S+ + FG+ +M GNV+EW D++ V+
Sbjct: 122 NFADKQTSFAWRDPNIDDGFPESAPVGSFPRGASPFGIEDMAGNVFEWCLDYFEVYRG-K 180
Query: 288 PSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
NP+GP+ GT ++ +GGS+ + R AR+ N PD S+ ++GFR
Sbjct: 181 DRVNPRGPSNGTKRIYRGGSW---KSRATNLRTTARNHNLPDYSSNDVGFRV 229
Score = 58.5 bits (140), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 46/154 (29%), Positives = 63/154 (40%), Gaps = 52/154 (33%)
Query: 16 VLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEK 75
VLL G T R + E P VTL F+L + ++N Q+++F A
Sbjct: 8 VLLMGSTARDASPH-------EQPITQVTLSCFFLSRFPITNAQYEKFDPA--------- 51
Query: 76 FGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW-- 133
H MKR + +HPVV+++ +A +C W
Sbjct: 52 -----------------------HKMKR------ALNAGDNHPVVYVNSREAEKFCQWLS 82
Query: 134 ----RGARLPTEAEWEYGCRGGLENRLFPWGSWL 163
R RLPTE EWEY R G++ R FPWG L
Sbjct: 83 TKEGRKYRLPTEPEWEYAAR-GMDGRTFPWGERL 115
>gi|408370220|ref|ZP_11167998.1| gliding motility protein gldk [Galbibacter sp. ck-I2-15]
gi|407744298|gb|EKF55867.1| gliding motility protein gldk [Galbibacter sp. ck-I2-15]
Length = 451
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 106/419 (25%), Positives = 175/419 (41%), Gaps = 94/419 (22%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEF-------VSAT 67
M L+PG F MG + + P++ VT+ +FY+D+ E++N+++++F + T
Sbjct: 41 MSLVPGGAFIMGRSDEDMADVLNAPTKTVTVRSFYMDETEITNSEYRQFVEWVKDSIVRT 100
Query: 68 GYVTEAEKFGDT----FVFEPLLSEEERAKISQV-RHDMKRFEGLDST-IEHRMHHPVVH 121
A++ G+ + E + + + +S+ ++ + + G+ T E R + V
Sbjct: 101 KLAVLADELGEVPGTGGIGEFAFKDADTSNLSEYDKYMLFNYAGMGETGYEGRSLNKDVD 160
Query: 122 ISWN-------------DAVAYCT---WRGARLPTEAEWEYG-----------CRGG--- 151
+ W+ D+V T + G R E +Y +GG
Sbjct: 161 LIWDVTEYPDEYYAEVMDSVYLSTEESYNGQRTIDVKELKYKYSWMDIQAAARAKGGGSR 220
Query: 152 ------------------LENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCT 193
+++ + + +H + S E ++PVV V+W A A+C
Sbjct: 221 LDFIKHEELNIYPDTTVWIKDFAYSYNEPMHNDYFWS--EAYSDYPVVGVTWKQAKAFCA 278
Query: 194 WRGA-----------------RLPTEAEWEYGCRGGLENRLFPWGN--NLTPRGEHRANV 234
WR RLPTEAEWEY RGG+++ +PWG L RG AN
Sbjct: 279 WRTKYKNDYQKENGKQFVNHFRLPTEAEWEYAARGGIQSGTYPWGGPYILNDRGCFMANF 338
Query: 235 --WQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNP 292
+G++ AAD L T SY N + LYNM GNV EWT ++ + + S
Sbjct: 339 KPLRGDY-----AADQALYTVEAKSYDPNDYNLYNMAGNVSEWTNSSYDPNSYEYVSTMN 393
Query: 293 KGPTTGTD--KVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGT 349
G + KV +GGS+ + Y + + R DS+ +GFR D T T
Sbjct: 394 PNTYDGENKRKVIRGGSW---KDVAYFLQVSTRDYEYQDSARSYIGFRTVQDYMGTEST 449
>gi|91202283|emb|CAJ75343.1| hypothetical protein kuste4581 [Candidatus Kuenenia
stuttgartiensis]
Length = 476
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/167 (37%), Positives = 84/167 (50%), Gaps = 20/167 (11%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
++PV V W DA AY W+G RLPTEAEWE R GL+ R FPWGN R N+
Sbjct: 324 DYPVARVDWYDAYAYAAWKGKRLPTEAEWEKAAR-GLDVRAFPWGNEW---DHTRCNLSG 379
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWW-NVHHHPAPSYNPKGP 295
P G + + M +G Y+M G+V+EW DW+ + ++ +PS NP GP
Sbjct: 380 EPMPV------GSVESGKSM------YGCYDMSGSVFEWCNDWFSSTYYQYSPSINPTGP 427
Query: 296 TTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAAD 342
G K+ +GGS ++ R R PD +GFRCA D
Sbjct: 428 EKGERKLIRGGSRFSRP---FQVRVTERKSERPDLFNMAIGFRCAKD 471
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/152 (37%), Positives = 78/152 (51%), Gaps = 21/152 (13%)
Query: 14 DMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
D VL+P F MGTN+ +D P V LDA+ +D++E++N Q+ EFV Y+ E
Sbjct: 241 DTVLIPAGKFSMGTNEA---RDVSQPEHVVYLDAYEIDRYEITNAQYWEFVK---YIEET 294
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
F EP+ + H + +E + +PV + W DA AY W
Sbjct: 295 NDHSKCFKDEPIGKD----------HKPRYWEDEYYNVPD---YPVARVDWYDAYAYAAW 341
Query: 134 RGARLPTEAEWEYGCRGGLENRLFPWGS-WLH 164
+G RLPTEAEWE R GL+ R FPWG+ W H
Sbjct: 342 KGKRLPTEAEWEKAAR-GLDVRAFPWGNEWDH 372
>gi|332668391|ref|YP_004451179.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
1100]
gi|332337205|gb|AEE54306.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis
DSM 1100]
Length = 686
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 68/174 (39%), Positives = 92/174 (52%), Gaps = 20/174 (11%)
Query: 177 NHPVVHVSWNDAVAYCTWRG------ARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEH 230
HPV+HVSWNDA AYC W RLP EAEWEY CR G F G NLT
Sbjct: 516 QHPVIHVSWNDARAYCDWLEQRLSLPIRLPREAEWEYACRAGTTTP-FNTGANLTTV--- 571
Query: 231 RANVWQGEFPTNNTAADGYLS-TAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHH---- 285
+AN + G +P + YL T+P+ Y N +GLY+M GNV EW DW++ ++
Sbjct: 572 QAN-YDGNYPYADFPKGTYLGRTSPIGLYPPNAWGLYDMHGNVLEWCEDWYSETYYNECK 630
Query: 286 -PAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFR 338
+P+G + G+ +V +GGS+L ++C R A R N + +GFR
Sbjct: 631 KQGVVADPEGSSAGSARVLRGGSWLSFARHC---RTAYRFWNGSVYRSAGVGFR 681
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/146 (35%), Positives = 74/146 (50%), Gaps = 15/146 (10%)
Query: 13 KDMVLLPGDTFRMGT-NKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+D VL+ G F MG ++ + E P V L FYL ++ + QF+EF+ ATGY T
Sbjct: 420 QDFVLIKGGEFMMGAPDEEADARADEKPQHQVRLTDFYLSKYPTTLAQFEEFILATGYQT 479
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
+A++ G S+ K +++ + + + HPV+H+SWNDA AYC
Sbjct: 480 DADQSGG--------SDFWNGKNWELQAGVNWTYDVHGAQQKDKQHPVIHVSWNDARAYC 531
Query: 132 TWRG------ARLPTEAEWEYGCRGG 151
W RLP EAEWEY CR G
Sbjct: 532 DWLEQRLSLPIRLPREAEWEYACRAG 557
>gi|451980491|ref|ZP_21928880.1| conserved exported hypothetical protein, DUF323 [Nitrospina
gracilis 3/211]
gi|451762227|emb|CCQ90112.1| conserved exported hypothetical protein, DUF323 [Nitrospina
gracilis 3/211]
Length = 287
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 67/167 (40%), Positives = 89/167 (53%), Gaps = 17/167 (10%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
PVV ++W DAV YC W G RLPTEAEWE R G + R+FPWG + PR ++ AN
Sbjct: 122 KRPVVQITWYDAVDYCLWAGKRLPTEAEWEKAAR-GTDGRMFPWGGD--PRLQNAANF-A 177
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNVHHHP-APSYNPK 293
+ NT PV S +N +G+ +M GNV EW ADW++ ++ P NP+
Sbjct: 178 HNWEDRNT-------LWPVGSQPQNASPYGVLDMAGNVREWVADWYSPDYYRWGPVRNPQ 230
Query: 294 GPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
GP G KV +GGS+ + R AAR + +GFRCA
Sbjct: 231 GPGQGILKVIRGGSW---HSFAADIRTAARGKGGFALKTDGIGFRCA 274
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 69/146 (47%), Gaps = 29/146 (19%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV + F MG N +D E P R V +D FY+D EV+NT++ ++ +
Sbjct: 49 MVWVQEGPFLMG-NVEGSGRDDERPQREVFVDGFYIDSVEVTNTRYLAYIKSAHRKEPPN 107
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+GD PL E++ K PVV I+W DAV YC W
Sbjct: 108 PYGDG----PLSGEKDIGK-----------------------RPVVQITWYDAVDYCLWA 140
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWG 160
G RLPTEAEWE R G + R+FPWG
Sbjct: 141 GKRLPTEAEWEKAAR-GTDGRMFPWG 165
>gi|302035580|ref|YP_003795902.1| hypothetical protein NIDE0192 [Candidatus Nitrospira defluvii]
gi|300603644|emb|CBK39975.1| conserved exported protein of unknown function, DUF323 [Candidatus
Nitrospira defluvii]
Length = 320
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 71/190 (37%), Positives = 101/190 (53%), Gaps = 29/190 (15%)
Query: 178 HPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQG 237
PVV VSW DA A+C W G RLPTEAEWE +G + +PWG++L P+ +AN Q
Sbjct: 100 QPVVGVSWTDANAFCKWDGKRLPTEAEWEKAAKGPEGDNHYPWGHHLDPK---KANYGQ- 155
Query: 238 EFPTNNTAADGYLSTAPVMSYKE--NKFGLYNMVGNVWEWTADWWN-VHHHPAPSYNPKG 294
T PV SY E + +G+YNM GNV+EW DW++ ++ + + NP+G
Sbjct: 156 ----------NVGRTTPVDSYPEGVSGYGVYNMAGNVFEWVDDWYDPKYYRTSNALNPRG 205
Query: 295 PTTGTD-------KVKKGGSYLCNEQYCY-RHRCAARSQNTPDSSAGNLGFRCAADKGPT 346
G + KV +GGS+L E + HR + +N +S LGFRCA K +
Sbjct: 206 ADKGYNFANQGPVKVLRGGSWLAPETSLHTSHRFWNQPEN--NSYGVGLGFRCA--KSAS 261
Query: 347 TGTDKVKKGG 356
+D+ + G
Sbjct: 262 VVSDEAAQAG 271
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/155 (34%), Positives = 77/155 (49%), Gaps = 37/155 (23%)
Query: 12 YKDMVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVT 71
+KDMVL+P F MG+N+ + V LDA+ +D+ E SN +++EF+ ATG+
Sbjct: 32 HKDMVLIPSGEFTMGSNE-----HSDEVRHQVVLDAYLIDKFEASNARYKEFMRATGHPA 86
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
A + +P LS+ E+ PVV +SW DA A+C
Sbjct: 87 PA------YWDDPRLSKPEQ--------------------------PVVGVSWTDANAFC 114
Query: 132 TWRGARLPTEAEWEYGCRGGLENRLFPWGSWLHPE 166
W G RLPTEAEWE +G + +PWG L P+
Sbjct: 115 KWDGKRLPTEAEWEKAAKGPEGDNHYPWGHHLDPK 149
>gi|257093415|ref|YP_003167056.1| peptidase C14 caspase catalytic subunit p20 [Candidatus
Accumulibacter phosphatis clade IIA str. UW-1]
gi|257045939|gb|ACV35127.1| peptidase C14 caspase catalytic subunit p20 [Candidatus
Accumulibacter phosphatis clade IIA str. UW-1]
Length = 776
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/312 (31%), Positives = 123/312 (39%), Gaps = 62/312 (19%)
Query: 39 PSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAEKFGDTFVFEPLLSEEERAKISQVR 98
P NV + F L +EV+ QF++FV AT Y TEAE+ G + S E A
Sbjct: 513 PQHNVKVANFELAYYEVTVGQFRKFVEATAYQTEAERVGRSPGCHVARSGGEFAVERSAT 572
Query: 99 HDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW-----RGARLPTEAEWEYGCRGGLE 153
F DS HPVV +SWND AY W G RLP+EAEWEY R G
Sbjct: 573 WRAPGFAQTDS-------HPVVCVSWNDTQAYVAWLNGAAAGYRLPSEAEWEYAARAGT- 624
Query: 154 NRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDA---VAYCTWRGARLPTEAEWEYGCR 210
V W+D A G E CR
Sbjct: 625 --------------------------VTPRPWSDTGNFFARMFQSGKGTDPEQPPSRTCR 658
Query: 211 GGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVG 270
N+ +A W P + +DG + + N F LY+M+G
Sbjct: 659 NA----------NVADESLKKALAW----PVTHNCSDGQAYASSGGYFGRNGFSLYDMIG 704
Query: 271 NVWEWTADWWNVHHHPAPSYNPKGPTTG--TDKVKKGGSYLCNEQYCYRHRCAARSQNTP 328
NV EWT D WN H AP + K +G + +V +GGS+ Y R AAR +N
Sbjct: 705 NVSEWTQDCWNPSHAGAP-LDGKARLSGDCSQRVIRGGSWSSGPVYA---RSAARQKNPA 760
Query: 329 DSSAGNLGFRCA 340
A +LGFR A
Sbjct: 761 SYRASDLGFRLA 772
>gi|423013771|ref|ZP_17004492.1| hypothetical protein AXXA_04948 [Achromobacter xylosoxidans AXX-A]
gi|338783265|gb|EGP47633.1| hypothetical protein AXXA_04948 [Achromobacter xylosoxidans AXX-A]
Length = 328
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 68/176 (38%), Positives = 91/176 (51%), Gaps = 18/176 (10%)
Query: 178 HPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGGLENRLFPWGNNLTPRGE-- 229
HPVV+VSWNDA A W RLPTEAEWEY R G R + G++ E
Sbjct: 152 HPVVNVSWNDATAMARWLSREEGHVYRLPTEAEWEYAARAGSRGR-YQNGDDPAALPEVA 210
Query: 230 -----HRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH 284
A +W DG+ TAPV SY N FGL++M GN WEW +DW+ +
Sbjct: 211 NTFDADAAPLWPKWQAFAAARRDGHAFTAPVASYAPNAFGLHDMHGNAWEWVSDWYGEDY 270
Query: 285 HP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
+ +P +P+GP +G KV++GGS+ Y R A R+ NT ++ +GFR
Sbjct: 271 YAHSPQDDPQGPASGNVKVRRGGSWHTWALYV---RSAYRNWNTVETRYPLVGFRL 323
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 61/184 (33%), Positives = 82/184 (44%), Gaps = 33/184 (17%)
Query: 1 MVLLPAPPVERYKDMVL--LPGDTFRMGTNK--PILIKD-------------GEFPSRNV 43
M P P+E MV LP +F+MG+++ L +D E P V
Sbjct: 17 MAAEPPAPIENSLGMVFVPLPAGSFQMGSDETPEALARDYPQYPKERFAQLFDEAPVHTV 76
Query: 44 TLD-AFYLDQHEVSNTQFQEFVSATGYVTEAEK-----FGDTFVFEPLLSEEERAKISQV 97
+ F + + EV+ QF+ F+ A+GYV E+E +G ++P SE A +
Sbjct: 77 RITRPFLMARTEVTVGQFRRFLEASGYVPESEADGTGGYGYNAAYDPDASERGDAFEGRD 136
Query: 98 RHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGA------RLPTEAEWEYGCRGG 151
R R G T H PVV++SWNDA A W RLPTEAEWEY R G
Sbjct: 137 RRYSWRNPGFAQTDAH----PVVNVSWNDATAMARWLSREEGHVYRLPTEAEWEYAARAG 192
Query: 152 LENR 155
R
Sbjct: 193 SRGR 196
>gi|332666591|ref|YP_004449379.1| sulfatase-modifying factor protein [Haliscomenobacter hydrossis DSM
1100]
gi|332335405|gb|AEE52506.1| Sulphatase-modifying factor protein [Haliscomenobacter hydrossis
DSM 1100]
Length = 625
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/178 (39%), Positives = 90/178 (50%), Gaps = 34/178 (19%)
Query: 179 PVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGG--------LENRLFPWGNNL 224
PVV VSW DA AYC W RLPTEAEWEY RGG + + N++
Sbjct: 463 PVVGVSWEDAKAYCEWLSRKTGLKYRLPTEAEWEYAARGGGLGITQTNFQVTQYAGSNSI 522
Query: 225 TPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHH 284
H +N G P PV K N+ GLY+M GNVWEW ADW++ +
Sbjct: 523 DEVAWHSSN--SGSKP------------HPVGQKKPNRLGLYDMSGNVWEWCADWYDSSY 568
Query: 285 HP-APSYNPKGPTTGTDKVKKGGSYLCNEQYCYRH-RCAARSQNTPDSSAGNLGFRCA 340
+ +PS NP G TG+D+V +GGS+ Y R R A R + TP + ++GFR A
Sbjct: 569 YKNSPSTNPPGAITGSDRVCRGGSW----NYVPRDARMAGRGRYTPVNRIYDIGFRVA 622
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 53/181 (29%), Positives = 77/181 (42%), Gaps = 29/181 (16%)
Query: 15 MVLLPGDTFRMG-TNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEA 73
+VL+ G +F MG T++ + E P+ V L FYL ++E++ ++ +F T
Sbjct: 373 LVLVKGGSFTMGCTSEQQNCGEDEKPTHQVNLGDFYLSKNELTIGEYLKFADET------ 426
Query: 74 EKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW 133
+ L + + + D + +GL + PVV +SW DA AYC W
Sbjct: 427 -----RSHYPEWLEQGSKYNLETGSDDHYKNKGLS---RNATDLPVVGVSWEDAKAYCEW 478
Query: 134 RGA------RLPTEAEWEYGCRG---GLENRLFPWGSWLHPEGIDSTIEHRMN-----HP 179
RLPTEAEWEY RG G+ F + ID H N HP
Sbjct: 479 LSRKTGLKYRLPTEAEWEYAARGGGLGITQTNFQVTQYAGSNSIDEVAWHSSNSGSKPHP 538
Query: 180 V 180
V
Sbjct: 539 V 539
>gi|163753939|ref|ZP_02161062.1| lipoprotein, putative [Kordia algicida OT-1]
gi|161326153|gb|EDP97479.1| lipoprotein, putative [Kordia algicida OT-1]
Length = 454
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 114/424 (26%), Positives = 180/424 (42%), Gaps = 95/424 (22%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFV-------SAT 67
M L+PG +F MG + + + P++ VT+ +FY+D+ E++N++++EFV T
Sbjct: 41 MTLVPGGSFIMGKSDDDIAQTKNAPTKTVTVPSFYMDETEITNSEYREFVYWVRDSIIRT 100
Query: 68 GYVTEAEKFGDTF-----VFEPLLSEEERAKISQ-VRHDMKRFEGL-DSTIEHRMHHPVV 120
A++ G T + + + + A ++ +++++ + GL D+ E R +
Sbjct: 101 KLAILADEEGKTDPAEDGIAKYAFKDADTADLTAWEKYNLENYVGLGDTGYEGRRLNRDE 160
Query: 121 HISWNDAVA----YCTWRGARLPTEAEWEYGCR-----------------GGLENRLFPW 159
+ W+ + Y + L EAE G R +N+
Sbjct: 161 ELIWDYSEIPDEFYAEVMDSMLIPEAESYNGQRTIDWTKVRFQYTWMDIQAAAKNKGMKR 220
Query: 160 GSWLHPEGI----DSTI-----EHRMNHPVV-----HVSWND----------AVAYCTWR 195
++ E + D+T+ E+ N P+ H ++ND A A+C WR
Sbjct: 221 TEFIKRENLEIYPDTTVWIRDFEYSYNEPMHNDYFWHDAYNDYPVVGVTWKQAKAFCEWR 280
Query: 196 GA-----------------RLPTEAEWEYGCRGGLENRLFPWGN--NLTPRGEHRANVWQ 236
RLP+EAEWEY RGGLE +PWG + RG AN
Sbjct: 281 TLKKNIYRKERKRHMINKFRLPSEAEWEYAARGGLEGATYPWGGPYTMNDRGCFLANF-- 338
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHP-----APSYN 291
+ N AAD L T SY+ N F LYNM GNV EWTA ++ + +P+ N
Sbjct: 339 -KPLRGNYAADQALYTVEAESYEPNDFNLYNMAGNVAEWTASSYDPSSYEYMSTMSPNVN 397
Query: 292 PKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCAADKGPTTGTDK 351
T KV +GGS+ E + + + R DS+ +GFR GTD
Sbjct: 398 ---DTLNKRKVIRGGSWKDVEYFL---QVSTRDFEYRDSARSYIGFRTVQ---SYMGTDV 448
Query: 352 VKKG 355
V KG
Sbjct: 449 VGKG 452
>gi|218781314|ref|YP_002432632.1| hypothetical protein Dalk_3476 [Desulfatibacillum alkenivorans
AK-01]
gi|218762698|gb|ACL05164.1| protein of unknown function DUF323 [Desulfatibacillum alkenivorans
AK-01]
Length = 307
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 70/188 (37%), Positives = 99/188 (52%), Gaps = 25/188 (13%)
Query: 164 HPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGAR--------LPTEAEWEYGCRGGLEN 215
HP +D+ PVV+VSW +A A+ W A+ LPTEAEWE+ CR G E
Sbjct: 127 HPLNMDA-------QPVVNVSWKNARAFAIWLSAQNQGQYSFWLPTEAEWEFACRAGTET 179
Query: 216 RLFPWGNNLTPRGEHR--ANVWQGEF-PTNNT--AADGYLSTAPVMSYKENKFGLYNMVG 270
+ WG++++ ++ A+ G++ P+ N DG+ +APV S+ N FGLY+M+G
Sbjct: 180 SRY-WGDDVSQTCQYANVADQTAGKYWPSWNVHPCNDGFAVSAPVGSFSPNAFGLYDMMG 238
Query: 271 NVWEWTADWWNVHHHPAPSYNPKGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDS 330
NVWE D W + P NP GP TG ++ +GGS+ + R A RS TP
Sbjct: 239 NVWEACQD-WKAPYRPGEQVNPAGPQTGKRRMVRGGSW---DNESRGIRSANRSYATPSF 294
Query: 331 SAGNLGFR 338
N GFR
Sbjct: 295 KRYNNGFR 302
Score = 62.0 bits (149), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 46/158 (29%), Positives = 67/158 (42%), Gaps = 48/158 (30%)
Query: 14 DMVLLPGDTFRMGTN---KPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYV 70
+ V +PG F MG+ P +D E P V LD F++ ++EV+N Q+ F
Sbjct: 64 EFVFIPGQCFMMGSTGKTDPDRYQD-EGPIHKVCLDGFWMGRYEVTNAQYHFFRPG---- 118
Query: 71 TEAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAY 130
HD +EG ++ + PVV++SW +A A+
Sbjct: 119 ----------------------------HDSGEYEGHPLNMDAQ---PVVNVSWKNARAF 147
Query: 131 CTWRGAR--------LPTEAEWEYGCRGGLENRLFPWG 160
W A+ LPTEAEWE+ CR G E + WG
Sbjct: 148 AIWLSAQNQGQYSFWLPTEAEWEFACRAGTETSRY-WG 184
>gi|389795065|ref|ZP_10198203.1| sulfatase-modifying factor protein [Rhodanobacter fulvus Jip2]
gi|388431516|gb|EIL88587.1| sulfatase-modifying factor protein [Rhodanobacter fulvus Jip2]
Length = 645
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 99/335 (29%), Positives = 146/335 (43%), Gaps = 76/335 (22%)
Query: 15 MVLLPGDTFRMGT--NKPILIKDGEFPSRNVTLD-AFYLDQHEVSNTQFQEFVSATGYVT 71
MV++P +F+MG+ ++P D E P VT+ F + + ++ QF+EFV A+GY
Sbjct: 373 MVVIPTGSFQMGSPASEPGH-NDAETPQHAVTIAMGFAMARSAITVAQFREFVRASGYRP 431
Query: 72 EAEKFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYC 131
++EK G V++ ER+ + D +E + + PVV++SWNDA AY
Sbjct: 432 DSEKLGGASVYD------ERSGTMRDDADAS-WEDDYAGRKAADDLPVVNVSWNDASAYA 484
Query: 132 TWRGA------RLPTEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSW 185
W G RL +EAE+EY RGG R + WG D T + + +
Sbjct: 485 AWLGERTGKIYRLASEAEFEYALRGGTTTRYW-WG--------DGTPQKTVEN------- 528
Query: 186 NDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTA 245
T G PT W + RG +R
Sbjct: 529 ------LTGSGDHSPTGRRWSHAFRG------------------YR-------------- 550
Query: 246 ADGYLSTAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPKGPTTG-TDKVKK 304
DGY APVM + N FGLY++ GNV EW AD W+ +++ AP G T +V +
Sbjct: 551 -DGYWGPAPVMKFAPNPFGLYDINGNVSEWVADCWHDNYNRAPVNGGAWINPGCTARVVR 609
Query: 305 GGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRC 339
GGS+ + + R A R +G +GFR
Sbjct: 610 GGSWGSSPD---QERSAYRQGVDATVRSGRVGFRV 641
>gi|359461177|ref|ZP_09249740.1| serine/threonine protein kinase [Acaryochloris sp. CCMEE 5410]
Length = 627
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 93/172 (54%), Gaps = 17/172 (9%)
Query: 178 HPVVHVSWNDAVAYCTW------RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHR 231
+PV +SW +A+ +C R RLP+EAEWEY CR G F +G L+ +
Sbjct: 461 YPVEQISWREAIEFCARLSQKAGREYRLPSEAEWEYACRAGTATP-FSYGPTLS---SQQ 516
Query: 232 ANVWQGEFPTNNTAADGYL-STAPVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSY 290
AN +QG+F N Y ST PV + N FGLY+M GNV+EW D W+ ++ AP
Sbjct: 517 AN-YQGQFVYGNGPKGVYRESTTPVGRFSPNNFGLYDMHGNVYEWCLDCWHKNYDGAPVD 575
Query: 291 NPKGPTT--GTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
TT G +V +GGS+ +YC R A RS+ TP++ ++GFR A
Sbjct: 576 GSPWMTTREGDPRVLRGGSWYNKPRYC---RAANRSRYTPNNRVNDIGFRVA 624
>gi|153873484|ref|ZP_02002053.1| protein of unknown function DUF323 [Beggiatoa sp. PS]
gi|152070051|gb|EDN67948.1| protein of unknown function DUF323 [Beggiatoa sp. PS]
Length = 442
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 95/312 (30%), Positives = 142/312 (45%), Gaps = 60/312 (19%)
Query: 50 LDQHEVSNTQFQEF--VSATGYVTEAEKFGDTFVFEPLLSE----EERAKISQVRHDMKR 103
+Q+ + +Q E + T ++ +++K VF LL++ E I + M
Sbjct: 166 FNQNTLPESQLSESHPLDITTFLPKSQKTIGK-VFSDLLTDGTEGPEMVMIPAGKFTMGD 224
Query: 104 FEGLDSTIEHRMHHPVVHISWNDAVAYCTWRGARLP-TEAEWEYGCRGGLENRLFP---- 158
+G E +H V+ ++ R P T AE++Y C+ NR P
Sbjct: 225 IQGTGEASEKPVHE----------VSIKSFAIGRYPVTFAEYDYFCKA--TNREKPDNRG 272
Query: 159 WGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTWRGA------RLPTEAEWEYGCRGG 212
WG R PV+++SW DAVAY W RLP+EA+WEY R G
Sbjct: 273 WG--------------RYKRPVMNISWKDAVAYTDWLSKQTGNTYRLPSEAQWEYAARAG 318
Query: 213 LENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTAPVMSYKENKFGLYNMVGNV 272
E + WGN++ G ++AN + + + +D + T+PV S+K N FGLY+ VGNV
Sbjct: 319 TETDYW-WGNDI---GINQANC----YLSGSQWSD--IQTSPVGSFKPNPFGLYDTVGNV 368
Query: 273 WEWTADWWNVHHHPAPSYNP---KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPD 329
WEW D W+ ++ AP+ T KV +GGS+ N R A R ++ PD
Sbjct: 369 WEWCVDSWHENYQNAPTDGSVWNNNKKTNEYKVLRGGSWSNNP---INARAANRYRHKPD 425
Query: 330 SSAGNLGFRCAA 341
LGFR A
Sbjct: 426 IRLNFLGFRVVA 437
>gi|451979721|ref|ZP_21928134.1| exported hypothetical protein [Nitrospina gracilis 3/211]
gi|451763090|emb|CCQ89331.1| exported hypothetical protein [Nitrospina gracilis 3/211]
Length = 835
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 64/168 (38%), Positives = 89/168 (52%), Gaps = 15/168 (8%)
Query: 177 NHPVVHVSWNDAVAYCTWRGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQ 236
N+P +VSW A AYC W+ RLPTEAEWE RG N LFPWGN+ + R N+
Sbjct: 642 NYPANNVSWFGAAAYCKWKDQRLPTEAEWEKAARGPNGN-LFPWGNDPISPQKSRYNLQW 700
Query: 237 GEFPTNNTAADGYLSTAPVMSYKENK--FGLYNMVGNVWEWTADWWNVHHHPAPSY--NP 292
E + PV S E K +G+++M+GNV EW DW++ ++ ++ NP
Sbjct: 701 NENIRHRV-------MVPVDSMPEGKSFYGVFHMLGNVKEWVDDWYDREYYKEENHKTNP 753
Query: 293 KGPTTGTDKVKKGGSYLCNEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
KG G KV KGGS+ + Y + R+ + P + + GFRCA
Sbjct: 754 KGQIGGEYKVLKGGSWRDLRSFVY---ASFRNNSYPAAMVDDYGFRCA 798
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 53/147 (36%), Positives = 74/147 (50%), Gaps = 23/147 (15%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P F MG + D E P V LD +Y+D+HEVS +F EF++A + E +
Sbjct: 563 MVKIPQGFFIMGDDHH---GDDEAPKHKVYLDTYYIDKHEVSAAEFAEFLNA---MPETK 616
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTWR 134
+F F + E+R R ++ ++P ++SW A AYC W+
Sbjct: 617 RF---FGINKFGTIEQRGGNFIPRRGLE-------------NYPANNVSWFGAAAYCKWK 660
Query: 135 GARLPTEAEWEYGCRGGLENRLFPWGS 161
RLPTEAEWE RG N LFPWG+
Sbjct: 661 DQRLPTEAEWEKAARGPNGN-LFPWGN 686
>gi|153871211|ref|ZP_02000440.1| Protein of unknown function DUF323 [Beggiatoa sp. PS]
gi|152072325|gb|EDN69562.1| Protein of unknown function DUF323 [Beggiatoa sp. PS]
Length = 323
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 73/210 (34%), Positives = 98/210 (46%), Gaps = 27/210 (12%)
Query: 140 TEAEWEYGCRGGLENRLFPWGSWLHPEGIDSTIEHRMNHPVVHVSWNDAVAYCTW----- 194
TEAE E GC N W H + +HPVV +SWNDA+ Y W
Sbjct: 130 TEAEQENGCYIWEGN---DWKKHKHANWRNPYFSQNDDHPVVCISWNDAMVYVNWLSQQT 186
Query: 195 -RGARLPTEAEWEYGCRGGLENRLFPWGNNLTPRGEHRANVWQGEFPTNNTAADGYLSTA 253
+ RLP+EAEWEY R G E + + WGN+ W + DG+ TA
Sbjct: 187 GKNYRLPSEAEWEYAARAGTETQYW-WGNH-----------WDKTKANCSVKEDGFKYTA 234
Query: 254 PVMSYKENKFGLYNMVGNVWEWTADWWNVHHHPAPSYNPK---GPTTGTDKVKKGGSYLC 310
PV ++K N FGL +M+GNVWEW AD W+ ++ AP+ + GP +V +GG++
Sbjct: 235 PVNAFKSNAFGLDDMLGNVWEWMADPWHENYQNAPTDSRVWNFGPKKNKYRVLRGGAWPY 294
Query: 311 NEQYCYRHRCAARSQNTPDSSAGNLGFRCA 340
+C R A R GFR A
Sbjct: 295 APNFC---RTANRDWGNQTYRFYTFGFRVA 321
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 81/153 (52%), Gaps = 19/153 (12%)
Query: 15 MVLLPGDTFRMGTNKPILIKDGEFPSRNVTLDAFYLDQHEVSNTQFQEFVSATGYVTEAE 74
MV +P FRMG + I D E P V++ F + ++ V+ +F++FV ATGY TEAE
Sbjct: 75 MVWIPAGRFRMGDIQGIG-SDNEKPVHFVSVKRFAMGKNGVTVGEFRQFVKATGYQTEAE 133
Query: 75 KFGDTFVFEPLLSEEERAKISQVRHDMKRFEGLDSTIEHRMHHPVVHISWNDAVAYCTW- 133
+ +++E ++ ++ K + R + HPVV ISWNDA+ Y W
Sbjct: 134 QENGCYIWEG--NDWKKHKHANWR---------NPYFSQNDDHPVVCISWNDAMVYVNWL 182
Query: 134 -----RGARLPTEAEWEYGCRGGLENRLFPWGS 161
+ RLP+EAEWEY R G E + + WG+
Sbjct: 183 SQQTGKNYRLPSEAEWEYAARAGTETQYW-WGN 214
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.135 0.450
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,258,641,178
Number of Sequences: 23463169
Number of extensions: 332813634
Number of successful extensions: 635517
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 3620
Number of HSP's successfully gapped in prelim test: 1640
Number of HSP's that attempted gapping in prelim test: 610393
Number of HSP's gapped (non-prelim): 15146
length of query: 394
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 250
effective length of database: 8,980,499,031
effective search space: 2245124757750
effective search space used: 2245124757750
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)