BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023516
(281 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|449469369|ref|XP_004152393.1| PREDICTED: uncharacterized protein LOC101222761 [Cucumis sativus]
Length = 281
Score = 511 bits (1316), Expect = e-142, Method: Compositional matrix adjust.
Identities = 241/281 (85%), Positives = 270/281 (96%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTP QPYVYQSPQDA+IPFQILGGEAQV QIMLKPQEKV+ARPGSMCFMSGS+E
Sbjct: 1 MAAPFFSTPIQPYVYQSPQDAMIPFQILGGEAQVLQIMLKPQEKVIARPGSMCFMSGSIE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
MEN+++PENEVG+W WLFGK++TS+VL+N G SDG+VGIAAPSLARILPID+AMFGGELL
Sbjct: 61 MENVFLPENEVGVWHWLFGKSVTSIVLQNSGTSDGYVGIAAPSLARILPIDMAMFGGELL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
CQPDAFLCSVNDVKV+NT+D RVRNV+ G+EGFLRQKL+GQGLAFIL GGSVVQKNLEVG
Sbjct: 121 CQPDAFLCSVNDVKVNNTIDHRVRNVVPGVEGFLRQKLSGQGLAFILGGGSVVQKNLEVG 180
Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
EVI VD+SCIVA+T+S++VQIKYNGP+RRAVFGG+NLVTA +TGPGIVFIQSLPF RLSQ
Sbjct: 181 EVIAVDISCIVAMTTSIDVQIKYNGPVRRAVFGGENLVTATLTGPGIVFIQSLPFQRLSQ 240
Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RIARAVTSPNMRENPKFFVQ+A+FFFLAYVVIVSS+ILTDV
Sbjct: 241 RIARAVTSPNMRENPKFFVQLALFFFLAYVVIVSSIILTDV 281
>gi|363808166|ref|NP_001242226.1| uncharacterized protein LOC100799063 [Glycine max]
gi|255639877|gb|ACU20231.1| unknown [Glycine max]
gi|255640131|gb|ACU20356.1| unknown [Glycine max]
Length = 281
Score = 506 bits (1303), Expect = e-141, Method: Compositional matrix adjust.
Identities = 242/281 (86%), Positives = 262/281 (93%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQSPQDA+IPFQILGGEAQV QIMLKPQEK++A+PGSMCFMSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSPQDAIIPFQILGGEAQVVQIMLKPQEKIIAKPGSMCFMSGSVE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
MEN Y+PENEVG+WQWLFGKTITS+VLRN GPSDGFVGIAAP ARILPIDLAMF GE+L
Sbjct: 61 MENAYLPENEVGIWQWLFGKTITSIVLRNSGPSDGFVGIAAPYFARILPIDLAMFNGEIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
CQPDAFLCSVNDVKVSN +DQR RN + EGFLRQKL+GQGLAFILAGGSVVQKNLE+G
Sbjct: 121 CQPDAFLCSVNDVKVSNIVDQRGRNAVASAEGFLRQKLSGQGLAFILAGGSVVQKNLEIG 180
Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
EV+ VDVS IVAVTS+VN+QIKYNGP RR +FGGDN VTAV+TGPGIVFIQSLPFHR SQ
Sbjct: 181 EVLAVDVSSIVAVTSAVNIQIKYNGPARRTMFGGDNAVTAVLTGPGIVFIQSLPFHRFSQ 240
Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RIARAVTSPNMRENPKFF+QIA+FFFLA VVIVSSLILTDV
Sbjct: 241 RIARAVTSPNMRENPKFFIQIAVFFFLACVVIVSSLILTDV 281
>gi|363808252|ref|NP_001241981.1| uncharacterized protein LOC100809243 [Glycine max]
gi|255636372|gb|ACU18525.1| unknown [Glycine max]
Length = 281
Score = 506 bits (1302), Expect = e-141, Method: Compositional matrix adjust.
Identities = 242/281 (86%), Positives = 262/281 (93%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQSPQDA+IPFQILGGEAQV QIMLKPQEK++A+PGSMCFMSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSPQDAIIPFQILGGEAQVVQIMLKPQEKIIAKPGSMCFMSGSIE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
MEN Y+PENEVG+WQWLFGKTITS+VL N GPSDGFVGIAAP ARILPIDLAMF GE+L
Sbjct: 61 MENAYLPENEVGIWQWLFGKTITSIVLCNSGPSDGFVGIAAPYFARILPIDLAMFNGEIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
CQPDAFLCSVNDVKVSN +DQR RN + EGFLRQKL+GQGLAFILAGGSVVQKNLE+G
Sbjct: 121 CQPDAFLCSVNDVKVSNIVDQRGRNAVASAEGFLRQKLSGQGLAFILAGGSVVQKNLEIG 180
Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
EV+ VDVS IVAVTS+VNVQIKYNGP RR +FGGDN VTA++TGP IVFIQSLPFHRLSQ
Sbjct: 181 EVLAVDVSSIVAVTSTVNVQIKYNGPARRTMFGGDNAVTALLTGPSIVFIQSLPFHRLSQ 240
Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RIARAVTSPNMRENPKFF+QIA+FFFLAYVVIVSSLILTDV
Sbjct: 241 RIARAVTSPNMRENPKFFIQIAVFFFLAYVVIVSSLILTDV 281
>gi|224138636|ref|XP_002326652.1| predicted protein [Populus trichocarpa]
gi|118487466|gb|ABK95560.1| unknown [Populus trichocarpa]
gi|222833974|gb|EEE72451.1| predicted protein [Populus trichocarpa]
Length = 281
Score = 498 bits (1282), Expect = e-138, Method: Compositional matrix adjust.
Identities = 240/281 (85%), Positives = 260/281 (92%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQ+ QDAVIPFQI GGEAQV QIMLKPQEKV+A+PGSMCFMSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQNHQDAVIPFQIFGGEAQVVQIMLKPQEKVIAKPGSMCFMSGSIE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
MEN++I ENEVG+WQW FGK +TSVV NPGPSDGFVGI+APSL RILPIDLA FGGE+L
Sbjct: 61 MENVFIHENEVGVWQWFFGKAVTSVVFHNPGPSDGFVGISAPSLGRILPIDLAKFGGEIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
CQPDAFLCSV+DVKVSNT+DQR RNV+ IEGFLRQKLTGQGLAFILAGGSV QK LEVG
Sbjct: 121 CQPDAFLCSVSDVKVSNTVDQRARNVMPSIEGFLRQKLTGQGLAFILAGGSVAQKVLEVG 180
Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
EV+ VDVSCI A+ ++VNVQIKYNGP+RRAVFGGDNLVTA +TGPGIVFIQSLPF R SQ
Sbjct: 181 EVLAVDVSCIAALNTTVNVQIKYNGPVRRAVFGGDNLVTATLTGPGIVFIQSLPFRRFSQ 240
Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RIARAVTSPNMR+NPKFFVQIAIFFFLAYVVIVSSLILTDV
Sbjct: 241 RIARAVTSPNMRDNPKFFVQIAIFFFLAYVVIVSSLILTDV 281
>gi|224074179|ref|XP_002304288.1| predicted protein [Populus trichocarpa]
gi|222841720|gb|EEE79267.1| predicted protein [Populus trichocarpa]
Length = 281
Score = 497 bits (1279), Expect = e-138, Method: Compositional matrix adjust.
Identities = 240/281 (85%), Positives = 258/281 (91%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS Q AVIPFQILGGEAQ+ QIMLKPQEK++A+PGSMCFMSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSHQGAVIPFQILGGEAQMVQIMLKPQEKIIAKPGSMCFMSGSIE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
MEN+ IPENEVG+WQW FGK +TSV+ RNPGPSDGFVGIA PSLARILPIDLA FGGE+L
Sbjct: 61 MENVLIPENEVGVWQWFFGKAVTSVIFRNPGPSDGFVGIATPSLARILPIDLAKFGGEIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
C PDAFLCS+NDVKVSN LD R RNV+ IEGFLRQKLTGQGLAFILAGGSV QK LEVG
Sbjct: 121 CLPDAFLCSINDVKVSNALDHRARNVMPNIEGFLRQKLTGQGLAFILAGGSVAQKVLEVG 180
Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
EV+ VDVSCIVA+ ++VNVQIKYNGP+RRAVFGGDNLVTA +TGPGIVFIQSLPF SQ
Sbjct: 181 EVLAVDVSCIVALNTTVNVQIKYNGPVRRAVFGGDNLVTATLTGPGIVFIQSLPFQLFSQ 240
Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV
Sbjct: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
>gi|357476813|ref|XP_003608692.1| hypothetical protein MTR_4g100740 [Medicago truncatula]
gi|355509747|gb|AES90889.1| hypothetical protein MTR_4g100740 [Medicago truncatula]
Length = 281
Score = 492 bits (1267), Expect = e-137, Method: Compositional matrix adjust.
Identities = 236/281 (83%), Positives = 260/281 (92%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS QDA+IPFQILGGE+QV QIMLKP EK++A+PGSMCFMSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSQQDAIIPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
MEN Y+PENEVG+WQWLFGKT+T++V+RN GPSDGFVGIAAP ARILPIDLA F GE+L
Sbjct: 61 MENAYLPENEVGIWQWLFGKTVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
CQPDAFLCSVNDVKVSNT+DQR RNV+ G E FLRQKL+GQGLAFIL GGSVVQK LEVG
Sbjct: 121 CQPDAFLCSVNDVKVSNTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVG 180
Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
EV+ VDVSCIVAVTS+V++QIKYNGP RR +FGGDN VTAV+TGPGIVFIQSLPF R SQ
Sbjct: 181 EVLAVDVSCIVAVTSTVDIQIKYNGPARRTMFGGDNAVTAVLTGPGIVFIQSLPFPRFSQ 240
Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RIARAVTSPNMRENPKFF+QIA+FFFLAYVVIVSSLILTDV
Sbjct: 241 RIARAVTSPNMRENPKFFIQIALFFFLAYVVIVSSLILTDV 281
>gi|388508380|gb|AFK42256.1| unknown [Medicago truncatula]
Length = 281
Score = 489 bits (1258), Expect = e-136, Method: Compositional matrix adjust.
Identities = 234/281 (83%), Positives = 260/281 (92%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS QDA+IPFQILGGE+QV QIMLKP EK++A+PGSMCFMSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSQQDAIIPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
MEN Y+PENEVG+WQWLFGKT+T++V+RN GPSDGFVGIAAP ARILPIDLA F GE+L
Sbjct: 61 MENAYLPENEVGIWQWLFGKTVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
CQPDAFLCSVNDVKVS+T+DQR RNV+ G E FLRQKL+GQGLAFIL GGSVVQK LEVG
Sbjct: 121 CQPDAFLCSVNDVKVSDTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVG 180
Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
EV+ VDVSCIVAVTS+V++QIKYNGP RR +FGGDN VTAV+TGPGIVFIQSLPF R S+
Sbjct: 181 EVLAVDVSCIVAVTSTVDIQIKYNGPARRTMFGGDNAVTAVLTGPGIVFIQSLPFPRFSR 240
Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RIARAVTSPNMRENPKFF+QIA+FFFLAYVVIVSSLILTDV
Sbjct: 241 RIARAVTSPNMRENPKFFIQIALFFFLAYVVIVSSLILTDV 281
>gi|388508964|gb|AFK42548.1| unknown [Lotus japonicus]
Length = 281
Score = 488 bits (1257), Expect = e-135, Method: Compositional matrix adjust.
Identities = 233/281 (82%), Positives = 260/281 (92%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQSPQD +IPFQILGGE+QV QIMLKPQEK++A+PGSMCFMSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSPQDEIIPFQILGGESQVVQIMLKPQEKIIAKPGSMCFMSGSIE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
MEN Y+PENEVG QWLFGKTITS+VL N GPSDGFVGIAAP ARILPIDLAMF G +L
Sbjct: 61 MENAYLPENEVGFRQWLFGKTITSIVLCNSGPSDGFVGIAAPYFARILPIDLAMFNGGIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
CQPDAFLCSVNDVKVSNT DQR +NV+ G E FLRQKL+GQG+AFILAGGSVVQKNLE+G
Sbjct: 121 CQPDAFLCSVNDVKVSNTFDQRGQNVVAGAERFLRQKLSGQGVAFILAGGSVVQKNLEIG 180
Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
EV+ VDVSCIVAVTS+VNVQIK NGP RR +FGGD++VTA++TGPGIVFIQSLPFHRLSQ
Sbjct: 181 EVLAVDVSCIVAVTSTVNVQIKNNGPARRTMFGGDHVVTALLTGPGIVFIQSLPFHRLSQ 240
Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RIARAVTSPNMRENPKF++QIA+FF +AYVVI+SSL+LTDV
Sbjct: 241 RIARAVTSPNMRENPKFYIQIAVFFLMAYVVILSSLLLTDV 281
>gi|388499702|gb|AFK37917.1| unknown [Lotus japonicus]
Length = 281
Score = 486 bits (1252), Expect = e-135, Method: Compositional matrix adjust.
Identities = 232/281 (82%), Positives = 259/281 (92%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQSPQD +IPFQILGGE+QV QIMLKPQEK++A+PGSMCFMSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSPQDEIIPFQILGGESQVVQIMLKPQEKIIAKPGSMCFMSGSIE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
MEN Y+PENEVG QWLFGKTITS+VL N GPSDGFVGI AP ARILPIDLAMF G +L
Sbjct: 61 MENAYLPENEVGFRQWLFGKTITSIVLCNSGPSDGFVGIVAPYFARILPIDLAMFNGGIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
CQPDAFLCSVNDVKVSNT DQR +NV+ G E FLRQKL+GQG+AFILAGGSVVQKNLE+G
Sbjct: 121 CQPDAFLCSVNDVKVSNTFDQRGQNVVAGAERFLRQKLSGQGVAFILAGGSVVQKNLEIG 180
Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
EV+ VDVSCIVAVTS+VNVQIK NGP RR +FGGD++VTA++TGPGIVFIQSLPFHRLSQ
Sbjct: 181 EVLAVDVSCIVAVTSTVNVQIKNNGPARRTMFGGDHVVTALLTGPGIVFIQSLPFHRLSQ 240
Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RIARAVTSPNMRENPKF++QIA+FF +AYVVI+SSL+LTDV
Sbjct: 241 RIARAVTSPNMRENPKFYIQIAVFFLMAYVVILSSLLLTDV 281
>gi|297794473|ref|XP_002865121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297310956|gb|EFH41380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 1054
Score = 457 bits (1176), Expect = e-126, Method: Compositional matrix adjust.
Identities = 215/282 (76%), Positives = 251/282 (89%), Gaps = 1/282 (0%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS QD + PFQILGGE+QV QIMLK +EKV+A+P SMC+MSGS+E
Sbjct: 773 MAAPFFSTPFQPYVYQSQQDTITPFQILGGESQVVQIMLKSEEKVIAKPASMCYMSGSIE 832
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
MEN Y PE EVG+ QW+ GK+++SV+LRN G +DGFVGIAAP LARILPIDLAMFGGE+L
Sbjct: 833 MENTYTPEQEVGVLQWILGKSVSSVLLRNTGQNDGFVGIAAPYLARILPIDLAMFGGEIL 892
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVI-GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEV 179
CQPDAFLCSV+DVK+ N++DQR RN++ G EGFLRQ+L+GQGLAFILAGGSV+QK LEV
Sbjct: 893 CQPDAFLCSVHDVKIVNSVDQRARNIVAAGAEGFLRQRLSGQGLAFILAGGSVLQKVLEV 952
Query: 180 GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
GEV ++DVSCI A+T S++VQIK N P+RRAVFGGDN+V A +TGPGIVFIQSLPFHRLS
Sbjct: 953 GEVFSIDVSCIAALTPSIDVQIKNNAPLRRAVFGGDNVVMATLTGPGIVFIQSLPFHRLS 1012
Query: 240 QRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
QRIAR+VTSPNMRENPK +QIAIF FLAY VI+SSLILT+V
Sbjct: 1013 QRIARSVTSPNMRENPKLLIQIAIFAFLAYAVILSSLILTEV 1054
>gi|18414878|ref|NP_567527.1| Tryptophan RNA-binding attenuator protein-like protein [Arabidopsis
thaliana]
gi|28393695|gb|AAO42260.1| unknown protein [Arabidopsis thaliana]
gi|28973435|gb|AAO64042.1| unknown protein [Arabidopsis thaliana]
gi|332658487|gb|AEE83887.1| Tryptophan RNA-binding attenuator protein-like protein [Arabidopsis
thaliana]
Length = 285
Score = 456 bits (1172), Expect = e-126, Method: Compositional matrix adjust.
Identities = 219/285 (76%), Positives = 254/285 (89%), Gaps = 4/285 (1%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS +D + PFQILGGEAQV QIMLKPQEKV+A+PGSMC+MSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSQEDTITPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
M+N Y PE EVG+ QW+ GK+++S+VLRN G +DGFVGIAAPSLARILP+DLAMFGG++L
Sbjct: 61 MDNTYTPEQEVGVVQWILGKSVSSIVLRNTGQNDGFVGIAAPSLARILPLDLAMFGGDIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQ--RVRNV-IGGIEGFLRQKLTGQGLAFILAGGSVVQKNL 177
CQPDAFLCSV+DVKV NT+ Q R RN+ G EG LRQ+L+GQGLAFI+AGGSVVQKNL
Sbjct: 121 CQPDAFLCSVHDVKVVNTVYQRHRARNIAAAGAEGVLRQRLSGQGLAFIIAGGSVVQKNL 180
Query: 178 EVGEVITVDVSCIVAVTSSVNVQIKYN-GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFH 236
EVGEV+T+DVSCI A+T S+N QIKYN P+RRAVFGGDN+VTA +TGPGIVFIQSLPFH
Sbjct: 181 EVGEVLTIDVSCIAALTPSINFQIKYNAAPVRRAVFGGDNVVTATLTGPGIVFIQSLPFH 240
Query: 237 RLSQRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RLSQRIAR+VTSPNMRENP+ VQI +F FLAYVVIVSSLILT++
Sbjct: 241 RLSQRIARSVTSPNMRENPRLLVQIGLFIFLAYVVIVSSLILTEM 285
>gi|22327636|ref|NP_199553.2| tryptophan RNA-binding attenuator-like protein [Arabidopsis
thaliana]
gi|19424009|gb|AAL87286.1| unknown protein [Arabidopsis thaliana]
gi|21280831|gb|AAM45111.1| unknown protein [Arabidopsis thaliana]
gi|332008124|gb|AED95507.1| tryptophan RNA-binding attenuator-like protein [Arabidopsis
thaliana]
Length = 282
Score = 456 bits (1172), Expect = e-126, Method: Compositional matrix adjust.
Identities = 216/282 (76%), Positives = 250/282 (88%), Gaps = 1/282 (0%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS QD + PFQILGGE+QV QIMLK +EKV+A+P SMC+MSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSQQDTITPFQILGGESQVVQIMLKSEEKVIAKPASMCYMSGSIE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
MEN Y PE EVG+ QW+ GK+++SVVLRN G +DGFVGIAAP LARILPIDLAMFGGE+L
Sbjct: 61 MENTYTPEQEVGVLQWILGKSVSSVVLRNTGQNDGFVGIAAPYLARILPIDLAMFGGEIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVI-GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEV 179
CQPDAFLCSV+DVKV N++DQR RN++ G EGFLRQ+L+GQGLAFILAGGSVVQK LEV
Sbjct: 121 CQPDAFLCSVHDVKVVNSVDQRARNIVAAGAEGFLRQRLSGQGLAFILAGGSVVQKVLEV 180
Query: 180 GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
GEV ++DVSCI A+T S++V+IK N P RRA+FGGDN+V A +TGPGIVFIQSLPFHRLS
Sbjct: 181 GEVFSIDVSCIAALTPSIDVRIKNNAPFRRALFGGDNVVMATLTGPGIVFIQSLPFHRLS 240
Query: 240 QRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
QRIAR+VTSPNMRENPK +QIAIF FLAY VI+SSLILT+V
Sbjct: 241 QRIARSVTSPNMRENPKLLIQIAIFAFLAYAVILSSLILTEV 282
>gi|297800372|ref|XP_002868070.1| hypothetical protein ARALYDRAFT_493136 [Arabidopsis lyrata subsp.
lyrata]
gi|297313906|gb|EFH44329.1| hypothetical protein ARALYDRAFT_493136 [Arabidopsis lyrata subsp.
lyrata]
Length = 285
Score = 454 bits (1168), Expect = e-125, Method: Compositional matrix adjust.
Identities = 219/285 (76%), Positives = 254/285 (89%), Gaps = 4/285 (1%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS +D + PFQILGGEAQV QIMLKPQEKV+A+PGSMC+MSGS++
Sbjct: 1 MAAPFFSTPFQPYVYQSQEDTITPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIQ 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
M+N Y PE EVG+ QW+ GK+++S+VLRN G +DGFVGIAAPSLARILP+DLAMFGG++L
Sbjct: 61 MDNNYTPEQEVGVVQWILGKSVSSIVLRNNGQNDGFVGIAAPSLARILPLDLAMFGGDIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQ--RVRNVI-GGIEGFLRQKLTGQGLAFILAGGSVVQKNL 177
CQPDAFLCSV+DVKV N+L Q R RN+ G EGFLRQ+L+GQGLAFI+AGGSVVQKNL
Sbjct: 121 CQPDAFLCSVHDVKVVNSLYQRHRARNIAQAGAEGFLRQRLSGQGLAFIIAGGSVVQKNL 180
Query: 178 EVGEVITVDVSCIVAVTSSVNVQIKYN-GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFH 236
EVGEV+T DVSCI A+T S+N QIKYN P+RRAVFGG+NLVTA +TGPGIVFIQSLPFH
Sbjct: 181 EVGEVLTTDVSCIAALTPSINFQIKYNAAPVRRAVFGGENLVTATLTGPGIVFIQSLPFH 240
Query: 237 RLSQRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RLSQRIAR+VTSPNMRENP+ VQI +F FLAYVVIVSSLILT++
Sbjct: 241 RLSQRIARSVTSPNMRENPRLLVQIGLFIFLAYVVIVSSLILTEM 285
>gi|21554803|gb|AAM63696.1| unknown [Arabidopsis thaliana]
Length = 285
Score = 452 bits (1163), Expect = e-125, Method: Compositional matrix adjust.
Identities = 217/285 (76%), Positives = 253/285 (88%), Gaps = 4/285 (1%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS +D + PFQILGGEAQV QIMLKPQEKV+A+PGSMC+MSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSQEDTITPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
M+N Y PE EVG+ QW+ GK+++S+VLRN G +DGFVGIAAPSLARILP+DLAMFGG++L
Sbjct: 61 MDNTYTPEQEVGVVQWILGKSVSSIVLRNTGQNDGFVGIAAPSLARILPLDLAMFGGDIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQ--RVRNV-IGGIEGFLRQKLTGQGLAFILAGGSVVQKNL 177
CQPDAFLCSV+DVKV N++ Q R RN+ G EG LRQ+L+GQGLAFI+AGGSVVQKNL
Sbjct: 121 CQPDAFLCSVHDVKVVNSVYQRHRARNIAAAGAEGVLRQRLSGQGLAFIIAGGSVVQKNL 180
Query: 178 EVGEVITVDVSCIVAVTSSVNVQIKYN-GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFH 236
EVGE +T+DVSCI A+T S+N QIKYN P+RRAVFGGDN+VTA +TGPGIVFIQSLPFH
Sbjct: 181 EVGEFLTIDVSCIAALTPSINFQIKYNAAPVRRAVFGGDNVVTATLTGPGIVFIQSLPFH 240
Query: 237 RLSQRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RLSQRIAR+VTSPNMRENP+ VQI +F FLAYVVIVSSLILT++
Sbjct: 241 RLSQRIARSVTSPNMRENPRLLVQIGLFIFLAYVVIVSSLILTEM 285
>gi|89257604|gb|ABD65092.1| hypothetical protein 31.t00054 [Brassica oleracea]
Length = 285
Score = 447 bits (1150), Expect = e-123, Method: Compositional matrix adjust.
Identities = 216/285 (75%), Positives = 251/285 (88%), Gaps = 4/285 (1%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS +D V PFQILGGEAQV QIMLKPQEKV+A+PGSMC+MSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSQEDTVTPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
MEN Y PE EVG+ QW+ GK+++SV+LRN G +DGFVGIAAPSLARILP+DLAMFGG++L
Sbjct: 61 MENTYTPEQEVGVVQWVLGKSVSSVLLRNTGQTDGFVGIAAPSLARILPMDLAMFGGDIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQ--RVRNV-IGGIEGFLRQKLTGQGLAFILAGGSVVQKNL 177
CQPDAFLCSV+DVKV N++ Q R RN+ G E FLRQ+L+GQGLAFI AGGSVVQKNL
Sbjct: 121 CQPDAFLCSVHDVKVVNSVYQRHRARNIAAAGAEVFLRQRLSGQGLAFITAGGSVVQKNL 180
Query: 178 EVGEVITVDVSCIVAVTSSVNVQIKYN-GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFH 236
EVGEV+T+DVSCI A+T +N +I Y+ P+RRAVFGGDN+VTA +TGPGIVFIQSLPFH
Sbjct: 181 EVGEVLTIDVSCIAALTPPINFRINYHAAPVRRAVFGGDNVVTATLTGPGIVFIQSLPFH 240
Query: 237 RLSQRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RLSQRIAR+VTSPNMRENP+F VQI +F FLAYVVI SSLILT++
Sbjct: 241 RLSQRIARSVTSPNMRENPRFLVQIGLFLFLAYVVIASSLILTEM 285
>gi|359496492|ref|XP_002269493.2| PREDICTED: uncharacterized protein LOC100257929 [Vitis vinifera]
Length = 293
Score = 444 bits (1142), Expect = e-122, Method: Compositional matrix adjust.
Identities = 221/266 (83%), Positives = 249/266 (93%)
Query: 16 QSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ 75
+SPQ+ V PFQILGGEAQ+ QIMLKPQEK++A+PGSMC+MSGS++MEN+YIPENEVGMWQ
Sbjct: 28 RSPQENVTPFQILGGEAQIVQIMLKPQEKIIAKPGSMCYMSGSIQMENVYIPENEVGMWQ 87
Query: 76 WLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKV 135
WLFGK +TS+VL N G S+GFVGIAAPSLARILPIDLAMFGGE+LCQPDAFLCS+NDVKV
Sbjct: 88 WLFGKAVTSIVLLNQGSSNGFVGIAAPSLARILPIDLAMFGGEILCQPDAFLCSLNDVKV 147
Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
+NT DQR RN++ G EGFLRQKLTGQGLAFI+AGGSVVQKNLEVGE + VDVSCI A+++
Sbjct: 148 NNTTDQRARNIVVGAEGFLRQKLTGQGLAFIVAGGSVVQKNLEVGEELAVDVSCIAALSA 207
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENP 255
+V+VQIKYNGP+RR VFGGDNLVTAV+TGPGIVFIQSLPF R SQRIARAVTSPNMRENP
Sbjct: 208 TVSVQIKYNGPMRRVVFGGDNLVTAVLTGPGIVFIQSLPFQRFSQRIARAVTSPNMRENP 267
Query: 256 KFFVQIAIFFFLAYVVIVSSLILTDV 281
KFF+QIAIFFFLAYVVIVSSLILTDV
Sbjct: 268 KFFIQIAIFFFLAYVVIVSSLILTDV 293
>gi|296083496|emb|CBI23465.3| unnamed protein product [Vitis vinifera]
Length = 281
Score = 443 bits (1139), Expect = e-122, Method: Compositional matrix adjust.
Identities = 221/265 (83%), Positives = 248/265 (93%)
Query: 17 SPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW 76
SPQ+ V PFQILGGEAQ+ QIMLKPQEK++A+PGSMC+MSGS++MEN+YIPENEVGMWQW
Sbjct: 17 SPQENVTPFQILGGEAQIVQIMLKPQEKIIAKPGSMCYMSGSIQMENVYIPENEVGMWQW 76
Query: 77 LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
LFGK +TS+VL N G S+GFVGIAAPSLARILPIDLAMFGGE+LCQPDAFLCS+NDVKV+
Sbjct: 77 LFGKAVTSIVLLNQGSSNGFVGIAAPSLARILPIDLAMFGGEILCQPDAFLCSLNDVKVN 136
Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
NT DQR RN++ G EGFLRQKLTGQGLAFI+AGGSVVQKNLEVGE + VDVSCI A++++
Sbjct: 137 NTTDQRARNIVVGAEGFLRQKLTGQGLAFIVAGGSVVQKNLEVGEELAVDVSCIAALSAT 196
Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPK 256
V+VQIKYNGP+RR VFGGDNLVTAV+TGPGIVFIQSLPF R SQRIARAVTSPNMRENPK
Sbjct: 197 VSVQIKYNGPMRRVVFGGDNLVTAVLTGPGIVFIQSLPFQRFSQRIARAVTSPNMRENPK 256
Query: 257 FFVQIAIFFFLAYVVIVSSLILTDV 281
FF+QIAIFFFLAYVVIVSSLILTDV
Sbjct: 257 FFIQIAIFFFLAYVVIVSSLILTDV 281
>gi|255579647|ref|XP_002530664.1| conserved hypothetical protein [Ricinus communis]
gi|223529797|gb|EEF31733.1| conserved hypothetical protein [Ricinus communis]
Length = 290
Score = 414 bits (1063), Expect = e-113, Method: Compositional matrix adjust.
Identities = 194/238 (81%), Positives = 219/238 (92%)
Query: 7 STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
S+ F + +S QDAVIPFQILGGEAQV QIMLKPQEKVVA+PGSMCFMSGS+EMEN ++
Sbjct: 47 SSKFSEVLKKSQQDAVIPFQILGGEAQVVQIMLKPQEKVVAKPGSMCFMSGSIEMENTFL 106
Query: 67 PENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
PENEVGMWQWLFGKT+TS+VLRN GPSDGF+GIAAPSLARILPIDLAMFGGE+LCQPDAF
Sbjct: 107 PENEVGMWQWLFGKTVTSIVLRNSGPSDGFIGIAAPSLARILPIDLAMFGGEILCQPDAF 166
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LCSVNDVKVS+T+D R RNV+ EGFLRQKL+GQGLAF++ GGSV+QKNLEVGEV++VD
Sbjct: 167 LCSVNDVKVSSTVDYRARNVMASAEGFLRQKLSGQGLAFVVGGGSVIQKNLEVGEVLSVD 226
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
VSCIVAV ++VNVQ+KYNGP+RRAVFGGDNLVTA +TGPGIVFIQSLPF R SQRIAR
Sbjct: 227 VSCIVAVNTTVNVQVKYNGPMRRAVFGGDNLVTATLTGPGIVFIQSLPFQRFSQRIAR 284
>gi|89257554|gb|ABD65044.1| hypothetical protein 26.t00103 [Brassica oleracea]
Length = 274
Score = 413 bits (1061), Expect = e-113, Method: Compositional matrix adjust.
Identities = 206/282 (73%), Positives = 240/282 (85%), Gaps = 12/282 (4%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARP-GSMCFMSGSM 59
M APFFSTPFQPYVYQS QD V PFQILGGEAQV Q +E+V+A+P GSMC+MSGS+
Sbjct: 1 MPAPFFSTPFQPYVYQSQQDTVTPFQILGGEAQVVQ----SEERVIAKPVGSMCYMSGSV 56
Query: 60 EMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGEL 119
EMEN Y + +VG QW+FGK++TSVVLRN P+DGFVG+AA ARILPIDLAMFGGE+
Sbjct: 57 EMENTY--KQQVGGLQWIFGKSVTSVVLRNTAPNDGFVGVAAHYFARILPIDLAMFGGEI 114
Query: 120 LCQPDAFLCSVNDVKVSNTLDQRVRNVIG-GIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
LCQPDAFLCSV+DVKV N+ D R RN++ G E FLRQ+L+GQGLAFILAGGSV LE
Sbjct: 115 LCQPDAFLCSVSDVKVVNSFDPRARNIVAAGAEVFLRQRLSGQGLAFILAGGSV----LE 170
Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
VGEV+T+DVSCI A+T SV+VQIKYN P+RRAVFGGDN+V A +TGPGIVFIQSLPFHRL
Sbjct: 171 VGEVLTIDVSCIAALTPSVDVQIKYNTPLRRAVFGGDNVVMATLTGPGIVFIQSLPFHRL 230
Query: 239 SQRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTD 280
SQRIAR+VTSPNMRENP+ +QIA+F FLAYVVI+SSL+LT+
Sbjct: 231 SQRIARSVTSPNMRENPRLMIQIALFAFLAYVVILSSLLLTE 272
>gi|238480800|ref|NP_001154247.1| Tryptophan RNA-binding attenuator protein-like protein [Arabidopsis
thaliana]
gi|332658489|gb|AEE83889.1| Tryptophan RNA-binding attenuator protein-like protein [Arabidopsis
thaliana]
Length = 273
Score = 400 bits (1028), Expect = e-109, Method: Compositional matrix adjust.
Identities = 192/261 (73%), Positives = 223/261 (85%), Gaps = 12/261 (4%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS +D + PFQILGGEAQV QIMLKPQEKV+A+PGSMC+MSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSQEDTITPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
M+N Y PE EVG+ QW+ GK+++S+VLRN G +DGFVGIAAPSLARILP+DLAMFGG++L
Sbjct: 61 MDNTYTPEQEVGVVQWILGKSVSSIVLRNTGQNDGFVGIAAPSLARILPLDLAMFGGDIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQ--RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
CQPDAFLCSV+DVKV NT+ Q R RN+ +GLAFI+AGGSVVQKNLE
Sbjct: 121 CQPDAFLCSVHDVKVVNTVYQRHRARNIAAA---------GAEGLAFIIAGGSVVQKNLE 171
Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYN-GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR 237
VGEV+T+DVSCI A+T S+N QIKYN P+RRAVFGGDN+VTA +TGPGIVFIQSLPFHR
Sbjct: 172 VGEVLTIDVSCIAALTPSINFQIKYNAAPVRRAVFGGDNVVTATLTGPGIVFIQSLPFHR 231
Query: 238 LSQRIARAVTSPNMRENPKFF 258
LSQRIAR+VTSPNMRENP+
Sbjct: 232 LSQRIARSVTSPNMRENPRLL 252
>gi|357115799|ref|XP_003559673.1| PREDICTED: uncharacterized protein LOC100838340 [Brachypodium
distachyon]
Length = 282
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 187/282 (66%), Positives = 232/282 (82%), Gaps = 1/282 (0%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS Q +V FQI GG+ QV Q+M+K EK+ +PG+MC+MSG+++
Sbjct: 1 MAAPFFSTPFQPYVYQSQQGSVTAFQISGGDVQVVQVMVKSHEKLTVKPGTMCYMSGNIQ 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
M+N Y+ EN+ GMWQW+FGK+ITS V NPGP DG+VGI+AP RILPIDLA FGGELL
Sbjct: 61 MDNNYLHENDGGMWQWIFGKSITSTVFFNPGPDDGYVGISAPFPGRILPIDLANFGGELL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
Q DAFLCSVNDV V++T DQR RN G E L+QKL GQG+AFI+ GGSV+QK L
Sbjct: 121 SQADAFLCSVNDVSVTSTADQRPRNNEIGAEMILKQKLRGQGMAFIVGGGSVMQKILAPR 180
Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
EVITVD +CIVA+T+++N Q+K + +RRAVFGGDN +TA +TGPG+VFIQSLPF RLSQ
Sbjct: 181 EVITVDAACIVAMTTTINFQLKNHTQLRRAVFGGDNQLTASLTGPGVVFIQSLPFTRLSQ 240
Query: 241 RIA-RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RIA R++ +P++R+NPKFF+QI +FFFLAYV+IVSS+ILTDV
Sbjct: 241 RIASRSMAAPSLRDNPKFFIQIVMFFFLAYVMIVSSIILTDV 282
>gi|212724012|ref|NP_001132812.1| uncharacterized protein LOC100194302 [Zea mays]
gi|194695462|gb|ACF81815.1| unknown [Zea mays]
gi|414872762|tpg|DAA51319.1| TPA: hypothetical protein ZEAMMB73_320289 [Zea mays]
Length = 282
Score = 386 bits (992), Expect = e-105, Method: Compositional matrix adjust.
Identities = 184/282 (65%), Positives = 231/282 (81%), Gaps = 1/282 (0%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS Q +V FQI GG+ QV Q+MLK QEK+ A+PG+MC+MSG+M+
Sbjct: 1 MAAPFFSTPFQPYVYQSQQGSVTAFQISGGDVQVLQVMLKSQEKLTAKPGAMCYMSGNMQ 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
M+N Y+PEN+ G+WQW+FGK ++S + N G DG+VGIAAP RILP+DL F GELL
Sbjct: 61 MDNNYLPENDGGVWQWIFGKRVSSTIFFNSGSDDGYVGIAAPFPGRILPVDLTNFSGELL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
CQ DAFLCSVNDV VS+T++ R RN+ G E L+QKL GQG+AF++ GGSV+QK L
Sbjct: 121 CQADAFLCSVNDVSVSSTVEPRPRNIEIGAEMILKQKLRGQGMAFLVGGGSVMQKILAPR 180
Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
EVITVD +CIVA+++++N Q+K +RRAVFGGDN +TA +TGPG+VFIQSLPFHRLSQ
Sbjct: 181 EVITVDAACIVAMSATINFQLKSPNQLRRAVFGGDNQLTASLTGPGVVFIQSLPFHRLSQ 240
Query: 241 RIA-RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RIA R+V P++R+NPKFF+QI +FFFLAYV+IVSS+ILTDV
Sbjct: 241 RIASRSVAGPSLRDNPKFFIQIVMFFFLAYVMIVSSIILTDV 282
>gi|414872761|tpg|DAA51318.1| TPA: hypothetical protein ZEAMMB73_320289 [Zea mays]
Length = 283
Score = 386 bits (991), Expect = e-105, Method: Compositional matrix adjust.
Identities = 184/283 (65%), Positives = 231/283 (81%), Gaps = 2/283 (0%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS Q +V FQI GG+ QV Q+MLK QEK+ A+PG+MC+MSG+M+
Sbjct: 1 MAAPFFSTPFQPYVYQSQQGSVTAFQISGGDVQVLQVMLKSQEKLTAKPGAMCYMSGNMQ 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
M+N Y+PEN+ G+WQW+FGK ++S + N G DG+VGIAAP RILP+DL F GELL
Sbjct: 61 MDNNYLPENDGGVWQWIFGKRVSSTIFFNSGSDDGYVGIAAPFPGRILPVDLTNFSGELL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
CQ DAFLCSVNDV VS+T++ R RN+ G E L+QKL GQG+AF++ GGSV+QK L
Sbjct: 121 CQADAFLCSVNDVSVSSTVEPRPRNIEIGAEMILKQKLRGQGMAFLVGGGSVMQKILAPR 180
Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
EVITVD +CIVA+++++N Q+K +RRAVFGGDN +TA +TGPG+VFIQSLPFHRLSQ
Sbjct: 181 EVITVDAACIVAMSATINFQLKSPNQLRRAVFGGDNQLTASLTGPGVVFIQSLPFHRLSQ 240
Query: 241 RIA--RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RIA R+V P++R+NPKFF+QI +FFFLAYV+IVSS+ILTDV
Sbjct: 241 RIASSRSVAGPSLRDNPKFFIQIVMFFFLAYVMIVSSIILTDV 283
>gi|89257470|gb|ABD64961.1| hypothetical protein 25.t00068 [Brassica oleracea]
Length = 1161
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 187/258 (72%), Positives = 217/258 (84%), Gaps = 16/258 (6%)
Query: 13 YVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG 72
+V QS QD + PFQILGGEAQV QIMLK +E+V+A+PGSMC+MSGS+EMEN Y PE EVG
Sbjct: 843 HVMQSQQDTITPFQILGGEAQVVQIMLKSEERVIAKPGSMCYMSGSVEMENTYTPEQEVG 902
Query: 73 MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ QW+ GK+++SVVLRN G +DGFVGIAAP LARILPIDLAMFGGE+LCQPDAFLCSVND
Sbjct: 903 VLQWILGKSVSSVVLRNTGQNDGFVGIAAPYLARILPIDLAMFGGEILCQPDAFLCSVND 962
Query: 133 VKVSNTLDQRVRN-VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
VKV N++DQR RN V+ G EGFLRQ+L+GQGLAFILAGGSV LEVGEV+ +DVSCI
Sbjct: 963 VKVVNSVDQRARNIVVAGSEGFLRQRLSGQGLAFILAGGSV----LEVGEVLNIDVSCIA 1018
Query: 192 AVTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA------- 243
A+T S++VQIKY N P+RRAVFGGDN+V A +TGPGIVFIQSLPFHRLSQRIA
Sbjct: 1019 ALTPSIDVQIKYNNAPLRRAVFGGDNVVMATLTGPGIVFIQSLPFHRLSQRIASFVVTKA 1078
Query: 244 ---RAVTSPNMRENPKFF 258
R+VTSPNMRENP+ F
Sbjct: 1079 FGDRSVTSPNMRENPRCF 1096
>gi|115455313|ref|NP_001051257.1| Os03g0747300 [Oryza sativa Japonica Group]
gi|14626285|gb|AAK71553.1|AC087852_13 unknown protein [Oryza sativa Japonica Group]
gi|108711062|gb|ABF98857.1| HTH DNA-binding protein, putative, expressed [Oryza sativa Japonica
Group]
gi|113549728|dbj|BAF13171.1| Os03g0747300 [Oryza sativa Japonica Group]
gi|125545710|gb|EAY91849.1| hypothetical protein OsI_13494 [Oryza sativa Indica Group]
Length = 282
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 181/282 (64%), Positives = 231/282 (81%), Gaps = 1/282 (0%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS + +V FQI GG+ QV Q+M+K QEK+ +PG+MC+MSG+++
Sbjct: 1 MAAPFFSTPFQPYVYQSQEGSVTAFQISGGDVQVLQVMVKSQEKLTVKPGTMCYMSGNIQ 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
+N Y+PEN+ G+WQW+FGK+I+S V NPG DG+VGI+AP RILP+DLA FGGELL
Sbjct: 61 TDNNYLPENDGGVWQWIFGKSISSSVFFNPGSDDGYVGISAPFPGRILPMDLANFGGELL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
CQ DAFLCSVNDV V++T++QR RN+ G E L+QKL GQG+AF++ GGSV+QK L
Sbjct: 121 CQADAFLCSVNDVSVTSTVEQRPRNIEIGAEVILKQKLRGQGMAFLVGGGSVMQKILAPR 180
Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
EVITVD +CIVA+T+++N Q+K RR VF G N +TA +TGPG+VFIQSLPFHRLSQ
Sbjct: 181 EVITVDAACIVAMTTTINFQLKTPNQPRRVVFSGGNQLTASLTGPGVVFIQSLPFHRLSQ 240
Query: 241 RIA-RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
RIA R+V +P++R+NPKFF+QI +FFFLAYV+IVSS+ILTDV
Sbjct: 241 RIASRSVAAPSLRDNPKFFIQIVMFFFLAYVMIVSSIILTDV 282
>gi|238480798|ref|NP_001154246.1| Tryptophan RNA-binding attenuator protein-like protein [Arabidopsis
thaliana]
gi|332658488|gb|AEE83888.1| Tryptophan RNA-binding attenuator protein-like protein [Arabidopsis
thaliana]
Length = 267
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 182/247 (73%), Positives = 211/247 (85%), Gaps = 12/247 (4%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS +D + PFQILGGEAQV QIMLKPQEKV+A+PGSMC+MSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSQEDTITPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
M+N Y PE EVG+ QW+ GK+++S+VLRN G +DGFVGIAAPSLARILP+DLAMFGG++L
Sbjct: 61 MDNTYTPEQEVGVVQWILGKSVSSIVLRNTGQNDGFVGIAAPSLARILPLDLAMFGGDIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQ--RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
CQPDAFLCSV+DVKV NT+ Q R RN+ +GLAFI+AGGSVVQKNLE
Sbjct: 121 CQPDAFLCSVHDVKVVNTVYQRHRARNIAAA---------GAEGLAFIIAGGSVVQKNLE 171
Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYN-GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR 237
VGEV+T+DVSCI A+T S+N QIKYN P+RRAVFGGDN+VTA +TGPGIVFIQSLPFHR
Sbjct: 172 VGEVLTIDVSCIAALTPSINFQIKYNAAPVRRAVFGGDNVVTATLTGPGIVFIQSLPFHR 231
Query: 238 LSQRIAR 244
LSQRIAR
Sbjct: 232 LSQRIAR 238
>gi|326507602|dbj|BAK03194.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326509737|dbj|BAJ87084.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 284
Score = 357 bits (916), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 184/284 (64%), Positives = 233/284 (82%), Gaps = 3/284 (1%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS Q +V FQI GG+ QV Q+M+K Q+K+ A+PG+MC+MSG+++
Sbjct: 1 MAAPFFSTPFQPYVYQSQQGSVTAFQISGGDVQVLQVMVKSQDKLTAKPGTMCYMSGNIQ 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
M+N Y+PEN+ G+WQW+FGK+ITS V NPGP DG+VGI+AP RILPIDLA FGGELL
Sbjct: 61 MDNNYLPENDGGVWQWIFGKSITSTVFFNPGPDDGYVGISAPFPGRILPIDLANFGGELL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
CQ D+FLCSVNDV V++T+DQR RN+ G E L+QKL GQG+AF++ GGSV+QK L
Sbjct: 121 CQADSFLCSVNDVSVTSTVDQRPRNIEIGAEMILKQKLRGQGMAFLVGGGSVMQKILAPR 180
Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRA--VFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
EVITVD CIVA+TS+++ Q++ RRA VFGG N + A +TGPG+VFIQSLPF RL
Sbjct: 181 EVITVDCGCIVAMTSTISFQLQNPTQHRRAVLVFGGVNQLKASLTGPGVVFIQSLPFSRL 240
Query: 239 SQRIA-RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
SQRIA R+V +P++R+NP+FF+QI +FFFLAYV+IVSS+ILTDV
Sbjct: 241 SQRIASRSVAAPSLRDNPRFFIQIIMFFFLAYVMIVSSIILTDV 284
>gi|125587909|gb|EAZ28573.1| hypothetical protein OsJ_12558 [Oryza sativa Japonica Group]
Length = 283
Score = 347 bits (891), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 169/283 (59%), Positives = 222/283 (78%), Gaps = 2/283 (0%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS + +V FQI GG+ QV Q+M+K QEK+ +PG+MC+MSG+++
Sbjct: 1 MAAPFFSTPFQPYVYQSQEGSVTAFQISGGDVQVLQVMVKSQEKLTVKPGTMCYMSGNIQ 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
+N Y+PEN+ G+WQW+FGK+I+S V NPG DG+VGI+AP RILP+DLA FGGELL
Sbjct: 61 TDNNYLPENDGGVWQWIFGKSISSSVFFNPGSDDGYVGISAPFPGRILPMDLANFGGELL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
CQ DAFLCSVNDV V++T++QR RN+ G E L+QKL GQG+AF++ GGSV+QK L
Sbjct: 121 CQADAFLCSVNDVSVTSTVEQRPRNIEIGAEVILKQKLRGQGMAFLVGGGSVMQKILAPR 180
Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVF-IQSLPFHRLS 239
EVITVD +CIVA+T+++N Q+K RR VF G N +TA +TGPG+VF + PF
Sbjct: 181 EVITVDAACIVAMTTTINFQLKTPNQPRRVVFSGGNELTASLTGPGVVFHTEPNPFIGFQ 240
Query: 240 QRI-ARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
+ +R+V +P++R+NPKFF+QI +FFFLAYV+IVSS+ILTDV
Sbjct: 241 NALPSRSVAAPSLRDNPKFFIQIVMFFFLAYVMIVSSIILTDV 283
>gi|8809625|dbj|BAA97176.1| unnamed protein product [Arabidopsis thaliana]
Length = 268
Score = 343 bits (879), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 160/214 (74%), Positives = 188/214 (87%), Gaps = 1/214 (0%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS QD + PFQILGGE+QV QIMLK +EKV+A+P SMC+MSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSQQDTITPFQILGGESQVVQIMLKSEEKVIAKPASMCYMSGSIE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
MEN Y PE EVG+ QW+ GK+++SVVLRN G +DGFVGIAAP LARILPIDLAMFGGE+L
Sbjct: 61 MENTYTPEQEVGVLQWILGKSVSSVVLRNTGQNDGFVGIAAPYLARILPIDLAMFGGEIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVI-GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEV 179
CQPDAFLCSV+DVKV N++DQR RN++ G EGFLRQ+L+GQGLAFILAGGSVVQK LEV
Sbjct: 121 CQPDAFLCSVHDVKVVNSVDQRARNIVAAGAEGFLRQRLSGQGLAFILAGGSVVQKVLEV 180
Query: 180 GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
GEV ++DVSCI A+T S++V+IK N P RRA+FG
Sbjct: 181 GEVFSIDVSCIAALTPSIDVRIKNNAPFRRALFG 214
>gi|449533339|ref|XP_004173633.1| PREDICTED: uncharacterized protein LOC101231218, partial [Cucumis
sativus]
Length = 207
Score = 337 bits (863), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 168/207 (81%), Positives = 189/207 (91%)
Query: 75 QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
WLFGK++TS+VL+N G SDG+VGIAAPSLARILPID+AMFGGELLCQPDAFLCSVNDVK
Sbjct: 1 HWLFGKSVTSIVLQNSGTSDGYVGIAAPSLARILPIDMAMFGGELLCQPDAFLCSVNDVK 60
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
V+NT+D RVRNV+ G+EGFLRQKL+GQGLAFIL GGSVVQKNLEVGEVI VD+SCIVA+T
Sbjct: 61 VNNTIDHRVRNVVPGVEGFLRQKLSGQGLAFILGGGSVVQKNLEVGEVIAVDISCIVAMT 120
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
+S++VQIKYNGP+RRAVFGG+NLVTA +TGPGIVFIQSLPF RLSQRIARAVTSPNMRE
Sbjct: 121 TSIDVQIKYNGPVRRAVFGGENLVTATLTGPGIVFIQSLPFQRLSQRIARAVTSPNMREI 180
Query: 255 PKFFVQIAIFFFLAYVVIVSSLILTDV 281
F + FFLAYVVIVSS+ILTDV
Sbjct: 181 QSFSFSLHFSFFLAYVVIVSSIILTDV 207
>gi|195645802|gb|ACG42369.1| hypothetical protein [Zea mays]
gi|414872765|tpg|DAA51322.1| TPA: hypothetical protein ZEAMMB73_320289 [Zea mays]
Length = 245
Score = 331 bits (849), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 157/245 (64%), Positives = 201/245 (82%), Gaps = 1/245 (0%)
Query: 38 MLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFV 97
MLK QEK+ A+PG+MC+MSG+M+M+N Y+PEN+ G+WQW+FGK ++S + N G DG+V
Sbjct: 1 MLKSQEKLTAKPGAMCYMSGNMQMDNNYLPENDGGVWQWIFGKRVSSTIFFNSGSDDGYV 60
Query: 98 GIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQK 157
GIAAP RILP+DL F GELLCQ DAFLCSVNDV VS+T++ R RN+ G E L+QK
Sbjct: 61 GIAAPFPGRILPVDLTNFSGELLCQADAFLCSVNDVSVSSTVEPRPRNIEIGAEMILKQK 120
Query: 158 LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNL 217
L GQG+AF++ GGSV+QK L EVITVD +CIVA+++++N Q+K +RRAVFGGDN
Sbjct: 121 LRGQGMAFLVGGGSVMQKILAPREVITVDAACIVAMSATINFQLKSPNQLRRAVFGGDNQ 180
Query: 218 VTAVVTGPGIVFIQSLPFHRLSQRIA-RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSL 276
+TA +TGPG+VFIQSLPFHRLSQRIA R+V P++R+NPKFF+QI +FFFLAYV+IVSS+
Sbjct: 181 LTASLTGPGVVFIQSLPFHRLSQRIASRSVAGPSLRDNPKFFIQIVMFFFLAYVMIVSSI 240
Query: 277 ILTDV 281
ILTDV
Sbjct: 241 ILTDV 245
>gi|414872764|tpg|DAA51321.1| TPA: hypothetical protein ZEAMMB73_320289 [Zea mays]
Length = 246
Score = 331 bits (849), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 157/246 (63%), Positives = 201/246 (81%), Gaps = 2/246 (0%)
Query: 38 MLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFV 97
MLK QEK+ A+PG+MC+MSG+M+M+N Y+PEN+ G+WQW+FGK ++S + N G DG+V
Sbjct: 1 MLKSQEKLTAKPGAMCYMSGNMQMDNNYLPENDGGVWQWIFGKRVSSTIFFNSGSDDGYV 60
Query: 98 GIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQK 157
GIAAP RILP+DL F GELLCQ DAFLCSVNDV VS+T++ R RN+ G E L+QK
Sbjct: 61 GIAAPFPGRILPVDLTNFSGELLCQADAFLCSVNDVSVSSTVEPRPRNIEIGAEMILKQK 120
Query: 158 LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNL 217
L GQG+AF++ GGSV+QK L EVITVD +CIVA+++++N Q+K +RRAVFGGDN
Sbjct: 121 LRGQGMAFLVGGGSVMQKILAPREVITVDAACIVAMSATINFQLKSPNQLRRAVFGGDNQ 180
Query: 218 VTAVVTGPGIVFIQSLPFHRLSQRIA--RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSS 275
+TA +TGPG+VFIQSLPFHRLSQRIA R+V P++R+NPKFF+QI +FFFLAYV+IVSS
Sbjct: 181 LTASLTGPGVVFIQSLPFHRLSQRIASSRSVAGPSLRDNPKFFIQIVMFFFLAYVMIVSS 240
Query: 276 LILTDV 281
+ILTDV
Sbjct: 241 IILTDV 246
>gi|215700953|dbj|BAG92377.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 245
Score = 325 bits (833), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 155/245 (63%), Positives = 201/245 (82%), Gaps = 1/245 (0%)
Query: 38 MLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFV 97
M+K QEK+ +PG+MC+MSG+++ +N Y+PEN+ G+WQW+FGK+I+S V NPG DG+V
Sbjct: 1 MVKSQEKLTVKPGTMCYMSGNIQTDNNYLPENDGGVWQWIFGKSISSSVFFNPGSDDGYV 60
Query: 98 GIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQK 157
GI+AP RILP+DLA FGGELLCQ DAFLCSVNDV V++T++QR RN+ G E L+QK
Sbjct: 61 GISAPFPGRILPMDLANFGGELLCQADAFLCSVNDVSVTSTVEQRPRNIEIGAEVILKQK 120
Query: 158 LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNL 217
L GQG+AF++ GGSV+QK L EVITVD +CIVA+T+++N Q+K RR VF G N
Sbjct: 121 LRGQGMAFLVGGGSVMQKILAPREVITVDAACIVAMTTTINFQLKTPNQPRRVVFSGGNQ 180
Query: 218 VTAVVTGPGIVFIQSLPFHRLSQRIA-RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSL 276
+TA +TGPG+VFIQSLPFHRLSQRIA R+V +P++R+NPKFF+QI +FFFLAYV+IVSS+
Sbjct: 181 LTASLTGPGVVFIQSLPFHRLSQRIASRSVAAPSLRDNPKFFIQIVMFFFLAYVMIVSSI 240
Query: 277 ILTDV 281
ILTDV
Sbjct: 241 ILTDV 245
>gi|302761006|ref|XP_002963925.1| hypothetical protein SELMODRAFT_230257 [Selaginella moellendorffii]
gi|300167654|gb|EFJ34258.1| hypothetical protein SELMODRAFT_230257 [Selaginella moellendorffii]
Length = 284
Score = 315 bits (806), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 158/284 (55%), Positives = 205/284 (72%), Gaps = 5/284 (1%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAV-IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSM 59
MAAP STPFQPYVYQS Q A FQI G EAQ+ QIMLKPQEK+ A+ GSMC+ SGS+
Sbjct: 1 MAAPASSTPFQPYVYQSEQSAAATAFQIFGFEAQILQIMLKPQEKIQAKSGSMCYTSGSI 60
Query: 60 EMENIYIPENE--VGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGG 117
+ E + +P+ VG+ QW+ G+ TS N G DG++G+ APSLARILPIDLA+FGG
Sbjct: 61 QAE-VNVPQENAGVGVLQWMMGRPDTSTTFTNAGSEDGYIGLGAPSLARILPIDLAVFGG 119
Query: 118 ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNL 177
E++CQ +AFL SVNDV V + +RVR G +G L QKL GQGLAFI AGGSV+QK L
Sbjct: 120 EIICQREAFLGSVNDVTVKHLTPRRVRVGYFGADGLLTQKLVGQGLAFITAGGSVLQKKL 179
Query: 178 EVGEVITVDVSCIVAVTSSVNVQIKYNGP-IRRAVFGGDNLVTAVVTGPGIVFIQSLPFH 236
GE++ VD+ C+VA+TS+V +IK P RRA FGG+ A +TGPG+VFIQSLP
Sbjct: 180 TRGELLVVDIGCLVAMTSAVECEIKQASPAARRAFFGGECPFQAHLTGPGVVFIQSLPSQ 239
Query: 237 RLSQRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTD 280
RL+ +IAR +++P+ R+NP+FF+ +F FLAY +I +SL+L +
Sbjct: 240 RLAHKIARTISAPHFRDNPRFFISAGVFLFLAYAMIFASLLLAE 283
>gi|302769153|ref|XP_002967996.1| hypothetical protein SELMODRAFT_440175 [Selaginella moellendorffii]
gi|300164734|gb|EFJ31343.1| hypothetical protein SELMODRAFT_440175 [Selaginella moellendorffii]
Length = 286
Score = 311 bits (796), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 159/286 (55%), Positives = 205/286 (71%), Gaps = 7/286 (2%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAV-IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSM 59
MAAP STPFQPYVYQS Q A FQI G EAQV QIMLKPQEK+ A+ GSMC+ SGS+
Sbjct: 1 MAAPASSTPFQPYVYQSEQSAAATAFQIFGFEAQVLQIMLKPQEKIQAKSGSMCYTSGSI 60
Query: 60 EMENIYIPENE--VGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGG 117
+ E + +P+ VG+ QW+ G+ TS N G DG++G+ APSLARILPIDLA+FGG
Sbjct: 61 QAE-VNVPQENAGVGVLQWMMGRPDTSTTFTNAGSEDGYIGLGAPSLARILPIDLAVFGG 119
Query: 118 ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE--GFLRQKLTGQGLAFILAGGSVVQK 175
E++CQ +AFL SVNDV V + +RVR G + G L QKL GQGLAFI AGGSV+QK
Sbjct: 120 EIICQREAFLGSVNDVTVKHLTPRRVRVGYFGADVGGLLTQKLVGQGLAFITAGGSVLQK 179
Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGP-IRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
L GE++ VD+ C+VA+TS+V +IK P RRA FGG+ A +TGPG+VFIQSLP
Sbjct: 180 KLSRGELLVVDIGCLVAMTSAVECEIKQASPAARRAFFGGECPFQAHLTGPGVVFIQSLP 239
Query: 235 FHRLSQRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTD 280
RL+ +IAR +++P+ R+NP+FF+ +F FLAY +I +SL+L +
Sbjct: 240 SQRLAHKIARTISAPHFRDNPRFFISAGVFLFLAYAMIFASLLLAE 285
>gi|414872766|tpg|DAA51323.1| TPA: hypothetical protein ZEAMMB73_320289 [Zea mays]
Length = 232
Score = 310 bits (794), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 148/232 (63%), Positives = 189/232 (81%), Gaps = 2/232 (0%)
Query: 52 MCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPID 111
MC+MSG+M+M+N Y+PEN+ G+WQW+FGK ++S + N G DG+VGIAAP RILP+D
Sbjct: 1 MCYMSGNMQMDNNYLPENDGGVWQWIFGKRVSSTIFFNSGSDDGYVGIAAPFPGRILPVD 60
Query: 112 LAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGS 171
L F GELLCQ DAFLCSVNDV VS+T++ R RN+ G E L+QKL GQG+AF++ GGS
Sbjct: 61 LTNFSGELLCQADAFLCSVNDVSVSSTVEPRPRNIEIGAEMILKQKLRGQGMAFLVGGGS 120
Query: 172 VVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQ 231
V+QK L EVITVD +CIVA+++++N Q+K +RRAVFGGDN +TA +TGPG+VFIQ
Sbjct: 121 VMQKILAPREVITVDAACIVAMSATINFQLKSPNQLRRAVFGGDNQLTASLTGPGVVFIQ 180
Query: 232 SLPFHRLSQRIA--RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
SLPFHRLSQRIA R+V P++R+NPKFF+QI +FFFLAYV+IVSS+ILTDV
Sbjct: 181 SLPFHRLSQRIASSRSVAGPSLRDNPKFFIQIVMFFFLAYVMIVSSIILTDV 232
>gi|2245101|emb|CAB10523.1| hypothetical protein [Arabidopsis thaliana]
gi|7268494|emb|CAB78745.1| hypothetical protein [Arabidopsis thaliana]
Length = 270
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 143/241 (59%), Positives = 170/241 (70%), Gaps = 35/241 (14%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS +D + PFQILGGEAQV QIMLKPQEKV+A+PGSMC+MSGS+E
Sbjct: 1 MAAPFFSTPFQPYVYQSQEDTITPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIE 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
M+N Y PE EVG+ QW+ GK+++S+VLRN G +DGFVGIAAPSLARILP
Sbjct: 61 MDNTYTPEQEVGVVQWILGKSVSSIVLRNTGQNDGFVGIAAPSLARILP----------- 109
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRN------------VIGGIEG---------FLRQKLT 159
PDAFLCSV+DVKV NT+ QR R V+GG E + R +L
Sbjct: 110 --PDAFLCSVHDVKVVNTVYQRHRARNIAAAGAERVVVVGGSETKAFWSGSCFYYRWRLW 167
Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN-GPIRRAVFGGDNLV 218
L + VVQKNLEVGEV+T+DVSCI A+T S+N QIKYN P+RRAVFG D ++
Sbjct: 168 PFKLILLHLLHKVVQKNLEVGEVLTIDVSCIAALTPSINFQIKYNAAPVRRAVFGLDPMM 227
Query: 219 T 219
T
Sbjct: 228 T 228
>gi|195648342|gb|ACG43639.1| hypothetical protein [Zea mays]
Length = 215
Score = 244 bits (622), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 114/189 (60%), Positives = 150/189 (79%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+MC+MSG+M+M+N Y+PEN+ G+WQW+FGK ++S + N G DG+VGIAAP
Sbjct: 17 LTAKPGTMCYMSGNMQMDNNYLPENDGGVWQWIFGKRVSSTIFFNSGSDDGYVGIAAPFP 76
Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
RILP+DL F GELLCQ DAFLCSVNDV VS+T++ R RN+ G E L+QKL GQG+A
Sbjct: 77 GRILPVDLTNFSGELLCQADAFLCSVNDVSVSSTVEPRPRNIEIGAEMILKQKLRGQGMA 136
Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
F++ GGSV+QK L EVITVD +CIVA+++++N Q+K +RRAVFGGDN +TA +TG
Sbjct: 137 FLVGGGSVMQKILAPREVITVDAACIVAMSATINFQLKSPNQLRRAVFGGDNQLTASLTG 196
Query: 225 PGIVFIQSL 233
PG+VFIQSL
Sbjct: 197 PGVVFIQSL 205
>gi|147839862|emb|CAN63757.1| hypothetical protein VITISV_006491 [Vitis vinifera]
Length = 289
Score = 243 bits (621), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 114/161 (70%), Positives = 135/161 (83%), Gaps = 13/161 (8%)
Query: 11 QPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE 70
+ ++ +SPQ+ V PFQILGGEAQ+ QIMLKPQEK++A+PGSMC+MSGS++MEN+YIPENE
Sbjct: 131 EKHLVKSPQENVTPFQILGGEAQIVQIMLKPQEKIIAKPGSMCYMSGSIQMENVYIPENE 190
Query: 71 VGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
VGMWQWLFGK +TS+VL N G S+GFVGIAAPSLARILP PDAFLCS+
Sbjct: 191 VGMWQWLFGKAVTSIVLLNQGSSNGFVGIAAPSLARILP-------------PDAFLCSL 237
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGS 171
NDVKV+NT DQR RN++ G EGFLRQKLTGQGLAFI+AGGS
Sbjct: 238 NDVKVNNTTDQRARNIVVGAEGFLRQKLTGQGLAFIVAGGS 278
>gi|225468541|ref|XP_002270085.1| PREDICTED: uncharacterized protein LOC100251350 [Vitis vinifera]
Length = 161
Score = 242 bits (617), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 112/145 (77%), Positives = 125/145 (86%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVY+SPQ+ V PFQILGG AQ+ QIMLKPQEK++A+ GS+C+MSGS +
Sbjct: 1 MAAPFFSTPFQPYVYRSPQENVTPFQILGGAAQMVQIMLKPQEKIIAKLGSICYMSGSTQ 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
MEN+YIPEN+VGMWQW FGK TSVVL N G DGFVGI APSLARI PID AMF GE+L
Sbjct: 61 MENVYIPENQVGMWQWAFGKAATSVVLLNQGSGDGFVGIVAPSLARIFPIDSAMFDGEIL 120
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRN 145
CQPDAFLCS+NDVKVSNT DQ RN
Sbjct: 121 CQPDAFLCSLNDVKVSNTTDQGARN 145
>gi|406886961|gb|EKD33877.1| hypothetical protein ACD_75C02564G0002 [uncultured bacterium]
Length = 332
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 94/239 (39%), Positives = 142/239 (59%), Gaps = 11/239 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
I FQI+G E Q +I L QE VA GSM +MS +++M ++ +E +G
Sbjct: 73 IDFQIIGSEMQYVEIELDSQESAVAEAGSMMYMSDNIQMHTVFGDGSESAGAGFLDRMLG 132
Query: 73 MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+ + S + GP G V AAP +I+ +DL +GG+L+CQ DAFLC+
Sbjct: 133 AGKRLITGEGLFSTIFTYQGPGKGKVAFAAPYPGKIIALDLKEYGGKLVCQKDAFLCAAK 192
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + +++ + G EGF+ QKL G G F+ AGG+V+QK+L+ GE + VD C+V
Sbjct: 193 GVAIGVEFQKKIGTALFGGEGFIMQKLEGDGRVFVHAGGTVIQKDLQAGETMRVDTGCLV 252
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
A+T ++N I++ G ++ A+FGG+ A +TGPG ++QSLPF RL+ RI RA+ N
Sbjct: 253 AMTQNINYDIEFVGGVKSAIFGGEGFFFATITGPGHAWLQSLPFSRLAGRIHRAMPQNN 311
>gi|406888983|gb|EKD35298.1| hypothetical protein ACD_75C01963G0002 [uncultured bacterium]
Length = 332
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 98/246 (39%), Positives = 143/246 (58%), Gaps = 12/246 (4%)
Query: 16 QSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----- 70
QS D I +QI G E Q +I L PQE VA GSM +MS S++M I+ +E
Sbjct: 67 QSGADE-IDYQIFGNEMQYVEIELDPQESAVAEAGSMMYMSDSIQMHTIFGDGSESSGAG 125
Query: 71 -----VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPD 124
+G + L G+ + S V G G V AAP +I+ +DL +GG+L+CQ D
Sbjct: 126 FLDKMLGAGKRLITGEGLFSTVFTYQGNGKGKVAFAAPYPGKIIALDLKDYGGKLVCQKD 185
Query: 125 AFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVIT 184
AFLC+ V + +++ + G EGF+ QKL G G F+ AGG++VQK+L+ GE +
Sbjct: 186 AFLCAAKGVSIGVEFQKKIGTALFGGEGFIMQKLEGNGRIFVHAGGTIVQKDLQPGETMR 245
Query: 185 VDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+D C+VA+T ++N I++ G ++ A+FGG+ A +TGPG ++QSLPF RL+ RI R
Sbjct: 246 IDTGCLVAMTQNINYDIEFVGGVKSAIFGGEGFFFATITGPGHAWLQSLPFSRLAGRIHR 305
Query: 245 AVTSPN 250
A N
Sbjct: 306 AAPQNN 311
>gi|410672294|ref|YP_006924665.1| protein of unknown function DUF124 [Methanolobus psychrophilus R15]
gi|409171422|gb|AFV25297.1| protein of unknown function DUF124 [Methanolobus psychrophilus R15]
Length = 250
Score = 179 bits (455), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 94/234 (40%), Positives = 142/234 (60%), Gaps = 4/234 (1%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT- 81
I +QI+G + Q+ +I L P E V A G+M +M ++ME G+ + L G++
Sbjct: 5 IDYQIIGNDMQLVEIELDPGESVRAEAGAMMYMGPGIKMETSTGGGLLKGLKRTLTGESF 64
Query: 82 -ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
ITS V + G G+V AP +I+P+DL GG LCQ D+FLC+ +++ L
Sbjct: 65 FITSYV--HEGTGKGYVAFGAPYPGKIIPLDLKEVGGRFLCQKDSFLCAAKGIEIEVALT 122
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+++ + G EGF+ Q+LTG GLAF+ AGG++++K+L+ GEVI VD CIVA + V+
Sbjct: 123 KKIGAGLFGGEGFILQRLTGDGLAFVHAGGTIIKKDLQAGEVIRVDTGCIVAFSEKVDYN 182
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
I+ G + A+FGG+ LV A + GPG V++QSLPF RL+ RI A + E+
Sbjct: 183 IQLVGGFKNALFGGEGLVLATLKGPGTVYLQSLPFSRLADRIMAAASYQRRDES 236
>gi|366162605|ref|ZP_09462360.1| hypothetical protein AcelC_02941 [Acetivibrio cellulolyticus CD2]
Length = 264
Score = 179 bits (455), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 95/244 (38%), Positives = 141/244 (57%), Gaps = 12/244 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I ++I+G E Q+ +I L PQE V+A GSM +M ++ME I+ +E L GK +
Sbjct: 8 IDYEIVGSEMQLVEIELDPQESVIAEAGSMMYMESGIQMETIFGDGSEKSNSGGLMGKFL 67
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
T+ + N G + +AP +I+P+DL F EL+CQ D+FLC+
Sbjct: 68 TAGKRLLTGESLFMTIFTNCGSTKQKAAFSAPYPGKIIPMDLKQFNEELICQKDSFLCAA 127
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V V ++++ + G EGF+ QKL G GLAF+ AGG++V+K L GE + +D C+
Sbjct: 128 KGVSVGIAFNKKIGAGLFGGEGFIMQKLEGDGLAFVHAGGTIVRKELRPGETLRLDTGCL 187
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
VA+T +VN I+ I+ A+FGG+ L A +TGPG V++QSLPF RL+ RI A S +
Sbjct: 188 VALTRTVNYDIQLAKGIKSAIFGGEGLFLATLTGPGTVWLQSLPFSRLAGRIISASPSKS 247
Query: 251 MREN 254
E
Sbjct: 248 RDEG 251
>gi|373459159|ref|ZP_09550926.1| protein of unknown function DUF124 [Caldithrix abyssi DSM 13497]
gi|371720823|gb|EHO42594.1| protein of unknown function DUF124 [Caldithrix abyssi DSM 13497]
Length = 252
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 94/227 (41%), Positives = 140/227 (61%), Gaps = 4/227 (1%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
VI ++ILG E Q+ +I L P E V A G+M +M +EM+ G + + G++
Sbjct: 4 VIDYRILGDEMQIVEIELDPGEGVRAEAGAMLYMEDGIEMQTGTGGGIFKGFKRMITGES 63
Query: 82 --ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
IT+ + R G G V AAP +I+P+DL FGG +LCQ DAFLC+ +++
Sbjct: 64 FFITTFLYRGQGK--GLVSFAAPYPGKIIPVDLTQFGGRILCQKDAFLCAAAGIEIEVAF 121
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
+R+ + G EGF+ Q+L G G+AFI AGG+V++KNL GE + VD C+VA+ +V+
Sbjct: 122 TKRLGAGLFGGEGFILQELRGDGMAFIHAGGTVIEKNLAAGERLRVDTGCLVALAPTVDY 181
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
I++ G + A+FGG+ L AV+ GPG V++QSLPF RL+ RI+ A
Sbjct: 182 DIQFIGGFKNALFGGEGLFLAVLRGPGTVYLQSLPFSRLADRISAAA 228
>gi|332527627|ref|ZP_08403674.1| hypothetical protein RBXJA2T_16832 [Rubrivivax benzoatilyticus JA2]
gi|332112030|gb|EGJ12007.1| hypothetical protein RBXJA2T_16832 [Rubrivivax benzoatilyticus JA2]
Length = 269
Score = 178 bits (451), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 95/233 (40%), Positives = 134/233 (57%), Gaps = 13/233 (5%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
V+ F+I G E Q +I L P E V GSM FM + M+ ++ + LFGK
Sbjct: 3 VVDFEIKGSEMQFVEIELDPGEAAVGEAGSMMFMDAGITMDTVF-GDGRQNAGGGLFGKL 61
Query: 82 IT------------SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
++ + V N G V AAP +ILP+DL+ GG L+CQ DAFLC+
Sbjct: 62 LSAGKRLVTGESLFTTVYTNQGAGKQRVAFAAPYPGKILPMDLSKMGGTLICQKDAFLCA 121
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V + L Q++ G EGF+ Q+L G GLAF+ AGG+VV++ L+ G+ + VD C
Sbjct: 122 ARGVSLGIALQQKLSTGFFGGEGFIMQRLDGDGLAFVHAGGTVVRRELQAGQTLLVDTGC 181
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+VA T V+ +I+Y G I+ A+FGG+ L A +TGPG V++QSLPF RL+ RI
Sbjct: 182 VVAYTPGVDFEIQYVGKIKTALFGGEGLFLAKMTGPGTVWLQSLPFSRLASRI 234
>gi|383757289|ref|YP_005436274.1| hypothetical protein RGE_14340 [Rubrivivax gelatinosus IL144]
gi|381377958|dbj|BAL94775.1| hypothetical protein RGE_14340 [Rubrivivax gelatinosus IL144]
Length = 269
Score = 177 bits (449), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 99/246 (40%), Positives = 138/246 (56%), Gaps = 14/246 (5%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
V+ F+I G E Q +I L P E V GSM FM + M+ ++ + LFGK
Sbjct: 3 VVDFEIKGSEMQFVEIELDPGEAAVGEAGSMMFMDAGITMDTVF-GDGRQNAGGGLFGKL 61
Query: 82 IT------------SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
++ + V N G V AAP +ILP+DL+ GG L+CQ DAFLC+
Sbjct: 62 LSAGKRLVTGESLFTTVYTNQGAGKLRVAFAAPYPGKILPMDLSKMGGTLICQKDAFLCA 121
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V + L Q++ G EGF+ Q+L G GLAF+ AGG+VV++ L+ G+ + VD C
Sbjct: 122 ARGVSLGIALQQKLSTGFFGGEGFIMQRLDGDGLAFVHAGGTVVRRELQAGQTLLVDTGC 181
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI-ARAVTS 248
+VA T V+ +I+Y G I+ A+FGG+ L A +TGPG V++QSLPF RL+ RI A A
Sbjct: 182 VVAYTPGVDFEIQYVGKIKTALFGGEGLFLAKMTGPGTVWLQSLPFSRLASRIFAAAPQR 241
Query: 249 PNMREN 254
RE
Sbjct: 242 GGSREE 247
>gi|91772707|ref|YP_565399.1| hypothetical protein Mbur_0683 [Methanococcoides burtonii DSM 6242]
gi|91711722|gb|ABE51649.1| protein of unknown function DUF124 [Methanococcoides burtonii DSM
6242]
Length = 259
Score = 176 bits (447), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 91/234 (38%), Positives = 134/234 (57%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I ++I G E Q+ +I L E V A G+M FM ++M+ G+ + G++
Sbjct: 5 IDYKIHGNEMQLVEIELDQGETVRAEAGAMMFMGDGIKMDTSTGGGLLKGLKRAFSGESF 64
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
G G V AP +I+PIDLA GG LCQ DAFLC+ N V ++ ++
Sbjct: 65 FITSFTQNGKGKGTVAFGAPYPGKIIPIDLASEGGSFLCQKDAFLCAANGVDITVAFSKK 124
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ + G EGF+ Q+L G GLAF+ AGG++++K L GE + +D C+VA + SVN I+
Sbjct: 125 IGAGLFGGEGFILQRLKGNGLAFVHAGGTIIKKELAPGETLRIDTGCLVAFSESVNYDIE 184
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPK 256
+ G + A+FGG+ +V A V+GPG V++QSLPF RL+ RIA A+ N K
Sbjct: 185 FVGGFKNALFGGEGIVLATVSGPGTVYLQSLPFSRLADRIAAAIGPNNSSRGEK 238
>gi|294495704|ref|YP_003542197.1| hypothetical protein Mmah_1035 [Methanohalophilus mahii DSM 5219]
gi|292666703|gb|ADE36552.1| protein of unknown function DUF124 [Methanohalophilus mahii DSM
5219]
Length = 252
Score = 175 bits (443), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 89/224 (39%), Positives = 131/224 (58%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I + I+G + Q+ +I L P E V A G+M +M ++M+ G+ + + G++
Sbjct: 5 IDYDIIGNDMQLVEIELDPSESVRAEAGAMMYMGQDIKMQTSTGGGLFKGLKRMVTGESF 64
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
N G V AP +I+PIDL GG LCQ DAFLC+ ++V ++
Sbjct: 65 FITSFTNEGDGKERVAFGAPYPGKIIPIDLTEVGGSFLCQKDAFLCAAKGIEVGIAFSKK 124
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ + G EGF+ Q+L G GLAF+ AGG++++K L+ GE + VD C+VA + SVN IK
Sbjct: 125 IGAGLFGGEGFILQRLEGDGLAFVHAGGTIIKKELDAGETMRVDTGCLVAFSESVNYDIK 184
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
G R A+FGG+ +V A V+GPG V++QSLPF RL+ RIA A
Sbjct: 185 LIGGFRNALFGGEGVVLATVSGPGTVYLQSLPFSRLADRIAAAT 228
>gi|193215179|ref|YP_001996378.1| hypothetical protein Ctha_1472 [Chloroherpeton thalassium ATCC
35110]
gi|193088656|gb|ACF13931.1| protein of unknown function DUF124 [Chloroherpeton thalassium ATCC
35110]
Length = 254
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 87/227 (38%), Positives = 135/227 (59%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
+I +++ G + Q+ ++ L PQE V A G+M +M ++M+ G + L G++
Sbjct: 18 IIDYKLFGDDMQIVEVELDPQEGVRAEVGAMLYMEDGIQMQTDTGGGLFSGFKRMLTGES 77
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
N G V AAP +I+P+DL GGE LCQ D FLC N V ++ +
Sbjct: 78 FFITSFVNQGHGKQRVAFAAPYPGKIVPLDLGQLGGEFLCQKDGFLCCANGVDINVAFTR 137
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
R+ + + G EGF+ Q+LTG GLAFI AGG+++++ L+ GE + VD C+VA+ SV+ I
Sbjct: 138 RIGSGLFGGEGFILQRLTGDGLAFIHAGGTIIKRELQAGESLRVDTGCLVALAPSVDYDI 197
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+ G + A+FGG+ L A +TGPG V++QSLPF R++ RI ++ S
Sbjct: 198 NFVGGFKNALFGGEGLFLAKLTGPGTVYLQSLPFSRIADRILASLPS 244
>gi|407473686|ref|YP_006788086.1| hypothetical protein Curi_c12280 [Clostridium acidurici 9a]
gi|407050194|gb|AFS78239.1| hypothetical protein DUF124 [Clostridium acidurici 9a]
Length = 250
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 94/231 (40%), Positives = 138/231 (59%), Gaps = 11/231 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENE--------VG 72
I ++I G + Q +I L PQE V+A G+M +M ++ME I+ NE +G
Sbjct: 7 IDYKIYGDDMQFVEIELDPQESVIAEAGAMMYMDDKIQMETIFGDGSRNESRGLMGKLMG 66
Query: 73 MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ V N V AAP +I+PIDLA GGEL+CQ DAFLCS
Sbjct: 67 AGKRLITGESLFMTVFSNVAQGKAHVAFAAPYPGKIIPIDLAQLGGELICQKDAFLCSAK 126
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + ++++ G EGF+ QKL G G AF+ AGG++++K+L VGEV+ +D C+V
Sbjct: 127 GVSIGIHFNKKIGTGFFGGEGFIMQKLEGDGFAFLHAGGTIIRKDLNVGEVLRLDTGCLV 186
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
A+TS+V I + G I+ +FGG+ L A + GPG V++QSLPF R++ RI
Sbjct: 187 AMTSTVQYDISFTGDIKSGLFGGEGLFLATLKGPGSVWLQSLPFSRMADRI 237
>gi|387791094|ref|YP_006256159.1| hypothetical protein Solca_1923 [Solitalea canadensis DSM 3403]
gi|379653927|gb|AFD06983.1| TIGR00266 family protein [Solitalea canadensis DSM 3403]
Length = 261
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 97/242 (40%), Positives = 134/242 (55%), Gaps = 10/242 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
I F+I G E Q +I L PQE VVA GS M ++ME I+ + G LF
Sbjct: 7 IDFRIFGEELQCVEIELDPQETVVAESGSFMMMDQEIQMETIFGDGSQQNSGFMGKLFSA 66
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G+++ N G S V AAP RI+P+DL G+L+CQ DAFLC+
Sbjct: 67 GKRLLTGESLFMTAFTNQGQSKKHVTFAAPYPGRIIPMDLHRLDGKLICQKDAFLCAAKG 126
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ +KL G G+AF+ AGG VV++ L+ GEV+ VD CIVA
Sbjct: 127 VSIGIEFQRKLGTGLFGGEGFIMEKLEGDGMAFVHAGGMVVERELQPGEVLKVDTGCIVA 186
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
T V+ I++ G I+ ++FGG+ L A + GPG V+IQSLP RL+ RI T P
Sbjct: 187 FTHGVDYDIQFVGGIKNSLFGGEGLFFATLRGPGKVWIQSLPISRLAGRILSLATGPRKE 246
Query: 253 EN 254
E
Sbjct: 247 EG 248
>gi|435851682|ref|YP_007313268.1| TIGR00266 family protein [Methanomethylovorans hollandica DSM
15978]
gi|433662312|gb|AGB49738.1| TIGR00266 family protein [Methanomethylovorans hollandica DSM
15978]
Length = 257
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 84/234 (35%), Positives = 138/234 (58%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
+I ++I+G + Q+ +I L P+E V A G+M +M ++ME G+ + + G++
Sbjct: 4 IIDYKIIGNDMQLVEIELDPRESVRAEAGAMMYMGPGIKMETSTGGGLLKGLKRAVTGES 63
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
+ G G++ AP +I+ +DLA GG +CQ D+FLC+ +++ T +
Sbjct: 64 FFITNFTHEGEGKGYIAFGAPYPGKIICLDLAQLGGGFICQKDSFLCAAKGIEIEITFTK 123
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
++ + G EGF+ QKL G GLAF+ AGG++++++L+ GEVI VD CI A S++ I
Sbjct: 124 KLGAGLFGGEGFILQKLKGDGLAFVHAGGTIIERDLKAGEVIRVDTGCICAFAESIDYNI 183
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENP 255
+ G + A+FGG+ +V A + GPG VF+QSLPF RL+ RI A + E+
Sbjct: 184 GFVGGFKNALFGGEGVVLATLKGPGKVFLQSLPFSRLADRIMAAARYQSHDESS 237
>gi|255526734|ref|ZP_05393636.1| protein of unknown function DUF124 [Clostridium carboxidivorans P7]
gi|296187024|ref|ZP_06855424.1| hypothetical protein CLCAR_2499 [Clostridium carboxidivorans P7]
gi|255509569|gb|EET85907.1| protein of unknown function DUF124 [Clostridium carboxidivorans P7]
gi|296048462|gb|EFG87896.1| hypothetical protein CLCAR_2499 [Clostridium carboxidivorans P7]
Length = 263
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 92/246 (37%), Positives = 146/246 (59%), Gaps = 15/246 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLFG 79
I ++I G + Q +I L P+E V+A GSM M +++ME I+ +++ G+ LFG
Sbjct: 6 IDYKIYGDDMQFVEIELDPKETVIAEAGSMMMMDTNIQMETIFGDGSSQSKGGLMDKLFG 65
Query: 80 --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+++ N G V AAP + +ILP+DL+ GG+L+CQ D+FLCS
Sbjct: 66 AGKRVLTGESLFMTAYTNIGGGKQKVSFAAPYIGKILPMDLSALGGKLICQKDSFLCSAK 125
Query: 132 DVKVSNTLDQRVRNVIG--GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V V +D R + +G G EGF+ QKL G GLAF+ + G++++K+L GE++ +D C
Sbjct: 126 GVSVG--IDFRKKLSVGFFGGEGFILQKLEGDGLAFVHSCGALIKKDLAYGEILRIDTGC 183
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
+VA+T ++ I++ G I+ FGG+ + A V+GPG V+IQSLPF+RL++R+ A
Sbjct: 184 LVAMTKDIHYDIEFVGGIKNTFFGGEGVFFATVSGPGTVWIQSLPFNRLAERVFAAAPRG 243
Query: 250 NMRENP 255
R+
Sbjct: 244 GGRKEE 249
>gi|83815096|ref|YP_444449.1| hypothetical protein SRU_0303 [Salinibacter ruber DSM 13855]
gi|83756490|gb|ABC44603.1| conserved hypothetical protein TIGR00266 [Salinibacter ruber DSM
13855]
Length = 255
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 88/234 (37%), Positives = 135/234 (57%), Gaps = 1/234 (0%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I + I+G + Q+ ++ L PQE V A G+M +M + M+ G + + G++
Sbjct: 9 IDYDIIGDDMQLVEVELDPQEGVRAEAGTMIYMDDGVRMQTDTGGGLFKGFKRMVSGESF 68
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ G V AAP +I+P+ L FGGE LCQ DAFLC+ +V++ +R
Sbjct: 69 FITTFVHEGQGKSHVAFAAPYPGKIIPVQLDDFGGEFLCQKDAFLCANANVEIEVAFTKR 128
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ + G EGF+ Q+LTGQG FI AGG+VV++ LE GE + +D C+ A +SS++ I+
Sbjct: 129 MGAGLFGGEGFILQRLTGQGWTFIHAGGTVVERTLEAGETLRIDTGCLAAFSSSIDYDIE 188
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI-ARAVTSPNMRENP 255
+ G + A+FGG+ L +TGPG V++QSLPF RL+ RI A + + E P
Sbjct: 189 FIGGFKNALFGGEGLFLTTLTGPGTVYLQSLPFSRLADRIVAGELANRGESEGP 242
>gi|381187037|ref|ZP_09894602.1| DUF124 domain-containing protein [Flavobacterium frigoris PS1]
gi|379650647|gb|EIA09217.1| DUF124 domain-containing protein [Flavobacterium frigoris PS1]
Length = 266
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 90/230 (39%), Positives = 132/230 (57%), Gaps = 10/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
I +QI G E Q +I L PQE V+A GS M +++ME I+ E + G++ L
Sbjct: 6 IDYQIFGEEMQYVEIELDPQEIVIAEAGSFMMMDNNIQMETIFGDGSEQKSGIFGQLLNA 65
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G+++ N S G V A+P +ILPIDL F G+ +CQ +FLC+
Sbjct: 66 GKRVLTGESLFMTAFINQASSKGKVSFASPYPGKILPIDLTEFQGKFICQKSSFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ + G EGF+ QK+ G G+AF+ +GG++ +K L GEV+ VD CIV
Sbjct: 126 VSVGIEFSKKMGRGLFGGEGFIMQKIEGDGMAFVHSGGTLAKKELRAGEVLKVDTGCIVG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T V+ I++ G I+ +VFGG+ L A + GPG V++QSLPF RL+ RI
Sbjct: 186 FTKDVDYDIEFIGGIKNSVFGGEGLFYATLRGPGTVYVQSLPFSRLADRI 235
>gi|294506194|ref|YP_003570252.1| hypothetical protein SRM_00379 [Salinibacter ruber M8]
gi|294342522|emb|CBH23300.1| Conserved hypothetical protein [Salinibacter ruber M8]
Length = 255
Score = 170 bits (431), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 92/236 (38%), Positives = 138/236 (58%), Gaps = 5/236 (2%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT- 81
I + I+G + Q+ ++ L PQE V A G+M +M + M+ G + + G++
Sbjct: 9 IDYDIIGDDMQLVEVELDPQEGVRAEAGTMIYMDDGVRMQTDTGGGLFKGFKRMVSGESF 68
Query: 82 -ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
IT+ V G S V AAP +I+P+ L FGGE LCQ DAFLC+ V++
Sbjct: 69 FITTFVHEGQGKS--HVAFAAPYPGKIIPVQLDDFGGEFLCQKDAFLCANASVEIEVAFT 126
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+R+ + G EGF+ Q+LTGQG FI AGG+VV++ LE GE + +D C+ A +SS++
Sbjct: 127 KRMGAGLFGGEGFILQRLTGQGWTFIHAGGTVVERTLEAGETLRIDTGCLAAFSSSIDYD 186
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI-ARAVTSPNMRENP 255
I++ G + A+FGG+ L +TGPG V++QSLPF RL+ RI A + + E P
Sbjct: 187 IEFIGGFKNALFGGEGLFLTTLTGPGTVYLQSLPFSRLADRIVAGELANRGESEGP 242
>gi|20093431|ref|NP_619506.1| hypothetical protein MA4652 [Methanosarcina acetivorans C2A]
gi|19918805|gb|AAM07986.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
Length = 263
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 89/237 (37%), Positives = 133/237 (56%), Gaps = 7/237 (2%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------Q 75
I ++I+G + Q+ +I L P E V A G+M +M ++M+ E + +
Sbjct: 5 IDYEIIGNDMQIVEIELDPGEAVQAEAGAMAYMGPGIQMQTSMGSEGGGLLGGLKKGLKR 64
Query: 76 WLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKV 135
L G++ + G G V AAP +I+P+DL+ FGG +LCQ DAFLC+ V V
Sbjct: 65 ALTGESFFITNFIHKGSGKGHVAFAAPYPGKIIPLDLSKFGGSILCQKDAFLCAARGVDV 124
Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
+R+ + G EGF+ Q+L G G AF+ GG+VV+K+L GE VD C+ A T
Sbjct: 125 EVAFTRRIGTGLFGGEGFILQRLRGNGFAFVHIGGTVVRKDLAPGETYHVDTGCVAAFTE 184
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
+VN I ++ + A+FGG+ +V A +TGPG +++QSLPF RL+ RI A S R
Sbjct: 185 TVNYDITWSKDFKNALFGGEGIVLATLTGPGTIYMQSLPFSRLADRIYAASASHQNR 241
>gi|374339471|ref|YP_005096207.1| hypothetical protein Marpi_0466 [Marinitoga piezophila KA3]
gi|372101005|gb|AEX84909.1| TIGR00266 family protein [Marinitoga piezophila KA3]
Length = 257
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 85/233 (36%), Positives = 133/233 (57%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
VI ++I G + Q+ +I L P E V A G+M +M +EM+ G+ + L G++
Sbjct: 4 VIDYKIYGDDMQIVEIELDPGEGVRAEAGAMMYMEPGIEMQTKATGGIFGGLKRMLVGES 63
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
N G S V AP +++PIDL FGGE +CQ D++LCS + + T +
Sbjct: 64 FFITNFLNTGHSKSHVAFGAPYPGKVIPIDLTKFGGEFICQKDSYLCSAAGIDIEITFTK 123
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
++ + G EGF+ ++L G GLAF+ AGG+++++ L GE + VD CIV +++V+ I
Sbjct: 124 KIGAGLFGGEGFILERLVGDGLAFVHAGGTIIERQLMPGETLRVDTGCIVGFSNTVDYDI 183
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
++ G A+FGG+ + +TGPG V++QSLPF RL+ RI A N E
Sbjct: 184 QFVGGFVNALFGGEGIFLTHLTGPGTVYLQSLPFSRLADRIYSAANFSNRGET 236
>gi|395802295|ref|ZP_10481548.1| hypothetical protein FF52_10473 [Flavobacterium sp. F52]
gi|395435536|gb|EJG01477.1| hypothetical protein FF52_10473 [Flavobacterium sp. F52]
Length = 267
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 90/236 (38%), Positives = 131/236 (55%), Gaps = 13/236 (5%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +QI G E Q +I L PQE V+A GS M +++ME I+ + LF
Sbjct: 2 QAHEIDYQIFGEEMQYVEIELDPQEIVIAEAGSFMMMENNIQMETIF-GDGSQQQGSGLF 60
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK +++ N G + V A+P +ILPIDL F G+ +CQ +F
Sbjct: 61 GKLLSAGKRVLTGESLFMTAFLNQGHTKSKVSFASPYPGKILPIDLTEFQGKFICQKSSF 120
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V V Q++ + G EGF+ QK+ G G+AF+ +GG++ +K L GEV+ VD
Sbjct: 121 LCAAKGVSVGIEFSQKLGRGLFGGEGFIMQKIEGDGMAFVHSGGTMAKKELAHGEVLKVD 180
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
CI+ T V+ I++ G I+ ++FGG+ L A + GPG V+IQSLPF RL+ RI
Sbjct: 181 TGCIIGFTKDVDYDIEFIGGIKNSIFGGEGLFYATLKGPGTVYIQSLPFSRLADRI 236
>gi|171060011|ref|YP_001792360.1| hypothetical protein Lcho_3337 [Leptothrix cholodnii SP-6]
gi|170777456|gb|ACB35595.1| protein of unknown function DUF124 [Leptothrix cholodnii SP-6]
Length = 272
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 88/231 (38%), Positives = 136/231 (58%), Gaps = 10/231 (4%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLFG 79
VI ++I G E Q ++ L P E + GSM FM + M+ ++ + G++ L G
Sbjct: 5 VIDYEIKGSEMQFVEVELDPGEAAIGEAGSMMFMDAGIGMDTVFGDGRAQQGGLFGKLLG 64
Query: 80 --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+++ + V N G + AAP +ILP+DL+ GG L+CQ DAFLC+
Sbjct: 65 AGKRLVTGESLFTTVYTNNGSGKQRIAFAAPYPGKILPMDLSKLGGTLICQKDAFLCAAR 124
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + + Q++ G EGF+ QKL G GLAF+ AGG+VV++ L+ G+ + VD C+V
Sbjct: 125 GVSLGIAIQQKLSVGFFGGEGFIMQKLDGDGLAFVHAGGTVVRRELQPGQTLLVDTGCVV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
A T+ V+ +I+Y G I+ A+FGG+ L A ++GPG +++QSLP RL+ RI
Sbjct: 185 AYTAGVSFEIQYVGKIKTALFGGEGLFFAKLSGPGTIWLQSLPLSRLASRI 235
>gi|375104787|ref|ZP_09751048.1| TIGR00266 family protein [Burkholderiales bacterium JOSHI_001]
gi|374665518|gb|EHR70303.1| TIGR00266 family protein [Burkholderiales bacterium JOSHI_001]
Length = 272
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 89/232 (38%), Positives = 134/232 (57%), Gaps = 11/232 (4%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW------- 74
V+ ++I G E Q ++ L P E V GSM FM + M+ ++ ++ G
Sbjct: 5 VVDYEIKGSEMQFVEVELDPGEAAVGEAGSMMFMDAGIGMDTVFGDGSQGGGGGLFGKLL 64
Query: 75 ----QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ + G+++ + V N + V AAP +ILP+DL GG LLCQ DAFLC+
Sbjct: 65 GAGKRLITGESLFTTVYTNNASAKQRVAFAAPYPGKILPMDLRQLGGTLLCQKDAFLCAA 124
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + Q++ G EGF+ QKL G GLAF+ AGG+V+++ L G+ + VD C+
Sbjct: 125 RGVSLGIAFQQKMSVGFFGGEGFIMQKLEGNGLAFVHAGGTVLKRELAPGQTLLVDTGCV 184
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
VA T SVN +I+Y G ++ A+FGG+ L A +TGPG +++QSLPF RL+ R+
Sbjct: 185 VAYTPSVNFEIQYVGKVKTALFGGEGLFFAKLTGPGTLWLQSLPFSRLASRV 236
>gi|209524295|ref|ZP_03272845.1| protein of unknown function DUF124 [Arthrospira maxima CS-328]
gi|423066799|ref|ZP_17055589.1| hypothetical protein SPLC1_S511370 [Arthrospira platensis C1]
gi|209495386|gb|EDZ95691.1| protein of unknown function DUF124 [Arthrospira maxima CS-328]
gi|406711824|gb|EKD07023.1| hypothetical protein SPLC1_S511370 [Arthrospira platensis C1]
Length = 238
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 96/229 (41%), Positives = 135/229 (58%), Gaps = 1/229 (0%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV-GMWQWLFGK 80
VI ++I G + Q+ +I L P E V A G+M +M ++M+ G + L G+
Sbjct: 4 VIDYKIYGNDLQLVEIELDPGEGVQAEVGTMTYMEDGIQMQTGMGKGGLFSGFKRMLTGE 63
Query: 81 TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
N G VG AA S +I+P+DLA GGE LCQ DAFLC+ N +++
Sbjct: 64 GFFITTFLNNGKQRSRVGFAATSPGQIIPVDLAQIGGEFLCQKDAFLCAANGTEINVAFT 123
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+++ G EGF+ Q+L G GLAFI AGG+++QKNL+ GE + VD C+VA + SV
Sbjct: 124 KKLGAGFFGGEGFILQRLKGDGLAFIQAGGAMIQKNLQPGETLRVDTGCLVAFSPSVRYD 183
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
I++ G R A+FGG+ L A +TGPGIV +QSLPF RL+ RI +T P
Sbjct: 184 IQFVGGFRNALFGGEGLFLAKMTGPGIVIVQSLPFSRLADRINSMITLP 232
>gi|376003487|ref|ZP_09781297.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
gi|375328144|emb|CCE17050.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
Length = 238
Score = 167 bits (424), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 95/229 (41%), Positives = 136/229 (59%), Gaps = 1/229 (0%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV-GMWQWLFGK 80
VI ++I G + Q+ +I L P E V A G+M +M ++M+ G + + G+
Sbjct: 4 VIDYKIYGNDLQLVEIELDPGEGVQAEVGTMTYMEDGIQMQTGMGKGGLFSGFKRMITGE 63
Query: 81 TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
+ N G VG AA S +I+P+DLA GGE LCQ DAFLC+ N +++
Sbjct: 64 SFFITTFLNNGKQRSRVGFAATSPGQIIPVDLAQIGGEFLCQKDAFLCAANGTEINVAFT 123
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+++ G EGF+ Q+L G GLAFI AGG+++QKNL+ GE + VD C+VA + SV
Sbjct: 124 KKLGAGFFGGEGFILQRLKGDGLAFIQAGGAMIQKNLQPGETLRVDTGCLVAFSPSVRYD 183
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
I++ G R A+FGG+ L A +TGPGIV +QSLPF RL+ RI +T P
Sbjct: 184 IQFVGGFRNALFGGEGLFLAKMTGPGIVIVQSLPFSRLADRINSMITLP 232
>gi|124267405|ref|YP_001021409.1| hypothetical protein Mpe_A2218 [Methylibium petroleiphilum PM1]
gi|124260180|gb|ABM95174.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
Length = 270
Score = 167 bits (424), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 93/244 (38%), Positives = 138/244 (56%), Gaps = 11/244 (4%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY--IPENEVGMWQWLFG 79
VI ++I G E Q ++ L P E + GSM FM + M+ ++ + G + L G
Sbjct: 5 VIDYEIKGAEMQFVEVELDPGEAAIGEAGSMMFMDAGIGMDTVFGDASAQQGGFFGKLLG 64
Query: 80 --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+++ + V N S V AAP +ILP+DL GG L+CQ DAFLC+
Sbjct: 65 AGKRLVTGESLFTTVYTNNVGSKQRVAFAAPYPGKILPMDLRTLGGTLICQKDAFLCAAR 124
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + +++ G EGF+ QKL G GLAF+ AGG+V+++ L+ G+ + VD C+V
Sbjct: 125 GVSLGIAFQRKLSVGFFGGEGFIMQKLDGDGLAFVHAGGTVLKRELQPGQTLLVDTGCVV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI-ARAVTSPN 250
A T SV+ +I+Y G ++ A+FGG+ L A +TGPG V++QSLPF RL+ R+ A A
Sbjct: 185 AYTQSVDFEIQYVGKVKTALFGGEGLFFAKLTGPGTVWLQSLPFSRLASRVFAAAPQRGG 244
Query: 251 MREN 254
RE
Sbjct: 245 SREE 248
>gi|310658335|ref|YP_003936056.1| conserved protein of unknown function [[Clostridium] sticklandii]
gi|308825113|emb|CBH21151.1| conserved protein of unknown function [[Clostridium] sticklandii]
Length = 269
Score = 167 bits (423), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 84/235 (35%), Positives = 136/235 (57%), Gaps = 12/235 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ F++ G + Q +I L E ++A G+M +M ++ ME I+ + + L GK +
Sbjct: 7 VDFKVHGDDMQFVEIQLDQNETIIAEAGAMMYMDQNIRMETIFGDGSGSDEGKGLMGKVL 66
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ N P V AAP RI+P+DLA + GE++CQ DAFLC+
Sbjct: 67 GAGKRVLTGESLFMTSFTNTAPDRKSVAFAAPYPGRIVPLDLAEYSGEIICQRDAFLCAA 126
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
++V ++V + G EGF+ Q+L G GL+F+ AGG+++QK L GEV+ +D C+
Sbjct: 127 KGIRVEIAFQKKVGAGLFGGEGFIMQRLLGDGLSFLHAGGTIIQKELAPGEVLKLDTGCL 186
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
VA+T+++N ++ G ++ A+FGG+ L A + GPG V++QSLPF RL+ R+ A
Sbjct: 187 VAMTNTINYDVEMAGNVKSALFGGEGLFLATLRGPGHVWLQSLPFSRLANRMYAA 241
>gi|409122567|ref|ZP_11221962.1| hypothetical protein GCBA3_02899 [Gillisia sp. CBA3202]
Length = 266
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 88/245 (35%), Positives = 140/245 (57%), Gaps = 12/245 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENE------VGM 73
+ +QI G E Q ++ L P E V+A G+ M ++M I+ +NE +G
Sbjct: 6 VDYQIFGEEMQYVELELDPNEAVIAEAGNFMMMDSGIKMNTIFGDGSKQNEGFLGKVLGA 65
Query: 74 WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ + N V A+P +ILP+DL +GG+ +CQ DAFLC+
Sbjct: 66 GKRLLTGESLFMTIFSNEVSGKKKVSFASPYPGKILPLDLIKYGGKFICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRV-RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + +++ R +GG EGF+ QK+ G G+AF+ AGG++ +K L+VGE I VD CI+
Sbjct: 126 VSIGIEFSRKLGRGFLGG-EGFIMQKVEGDGMAFVHAGGTMARKELKVGEKICVDTGCII 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
T VN I++ G I+ +FGG+ L A +TGPG+V++QSLPF RL+ R+ +A
Sbjct: 185 GFTQDVNYDIEFVGGIKNTIFGGEGLFFATLTGPGVVYVQSLPFSRLANRVLQAAPQAGG 244
Query: 252 RENPK 256
++ +
Sbjct: 245 KDKGE 249
>gi|451946761|ref|YP_007467356.1| TIGR00266 family protein [Desulfocapsa sulfexigens DSM 10523]
gi|451906109|gb|AGF77703.1| TIGR00266 family protein [Desulfocapsa sulfexigens DSM 10523]
Length = 331
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 81/236 (34%), Positives = 133/236 (56%), Gaps = 12/236 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I ++I G E Q +I L P+E VV+ G+M +MS + M+ ++ +E F K +
Sbjct: 73 IDYEIFGHEMQFVEIELDPRESVVSEAGAMMYMSNGITMDTVFGDGSEESQSGGFFDKML 132
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ + + G V A+P +I+P+DL + G ++CQ DAFLC+
Sbjct: 133 GAGKRLISGEGLFVTMFTHTGQGKAKVAFASPYPGKIIPLDLKQYNGRIICQKDAFLCAA 192
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ + G EGF+ Q+L G G F+ AGG++++K L GE + VD C+
Sbjct: 193 KGVSIGIAFQKKIGTALFGGEGFIMQQLDGDGFCFVHAGGTIIEKELAAGETLRVDTGCL 252
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
VA+T +VN I++ G ++ A+FGG++ A + GPG V++QSLPF RL+ RI +A
Sbjct: 253 VALTQTVNYDIEFVGNVKSAIFGGESFFFATLQGPGHVWLQSLPFSRLAGRIHQAA 308
>gi|337280412|ref|YP_004619884.1| hypothetical protein Rta_27620 [Ramlibacter tataouinensis TTB310]
gi|334731489|gb|AEG93865.1| Conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
Length = 270
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 89/231 (38%), Positives = 132/231 (57%), Gaps = 10/231 (4%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLFG 79
VI + I G E Q ++ L P E + GSM FM ++ M+ ++ + G++ L G
Sbjct: 5 VIDYDIRGAEMQFVELELDPGEAAIGEAGSMMFMDAAIAMDTVFGDGRAQQGGLFGKLMG 64
Query: 80 --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+++ + V N G V AAP +ILP++L GG L+CQ D+FLC+
Sbjct: 65 AGKRLVTGESLFTTVYTNHGAGKQRVAFAAPYPGKILPMNLRELGGSLVCQKDSFLCAAR 124
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + L +R+ G EGF+ QKL G GLAF+ AGG+VV++ L+ G+ + D C+V
Sbjct: 125 GVSLGIALQRRLSAGFFGGEGFIMQKLEGDGLAFVHAGGTVVRRELQPGQTLLADTGCVV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
A T V I+Y G +R A+FGG+ L A +TGPG V++QSLPF RL+ R+
Sbjct: 185 AYTPGVGFDIQYVGKVRTALFGGEGLFFARLTGPGTVWLQSLPFSRLASRV 235
>gi|384098568|ref|ZP_09999682.1| hypothetical protein W5A_07917 [Imtechella halotolerans K1]
gi|383835263|gb|EID74690.1| hypothetical protein W5A_07917 [Imtechella halotolerans K1]
Length = 266
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 90/244 (36%), Positives = 134/244 (54%), Gaps = 10/244 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN--EVGMWQWLF-- 78
+ + I G E Q +I L PQE VVA G+ M + ME I+ + + G+ LF
Sbjct: 6 VDYHIYGEEMQYVEIELDPQEAVVAEAGNFMMMDSGIRMETIFGDGSGQDQGIMGKLFSA 65
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G+++ V N V AAP +I+PIDL + G+ +CQ DAFLC+
Sbjct: 66 GKRLLTGESLFMTVFLNEFQGKRKVSFAAPYPGKIIPIDLTAYQGKFICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ + G EGF+ QKL G G+AF+ AGG++ +K L GEV+ VD C+V
Sbjct: 126 VSVGIEFSKKIGRGLFGGEGFIMQKLEGDGMAFVHAGGTLARKELAAGEVLKVDTGCVVG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
T +V+ I++ G I+ VFGG+ L A + GPG+V+IQSLPF RL+ R+ A +
Sbjct: 186 FTQTVDYDIEFIGGIKNTVFGGEGLFFATLRGPGVVYIQSLPFSRLAGRVWSAAPQAGGK 245
Query: 253 ENPK 256
+ +
Sbjct: 246 QKGE 249
>gi|146298455|ref|YP_001193046.1| hypothetical protein Fjoh_0692 [Flavobacterium johnsoniae UW101]
gi|146152873|gb|ABQ03727.1| protein of unknown function DUF124 [Flavobacterium johnsoniae
UW101]
Length = 267
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/236 (38%), Positives = 131/236 (55%), Gaps = 13/236 (5%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +QI G E Q +I L PQE V+A GS M +++ME I+ + LF
Sbjct: 2 QAHEIDYQIFGEEMQYVEIELDPQEIVIAEAGSFMMMENNIQMETIF-GDGSQQQGSGLF 60
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK +++ N G + V A+P +ILPIDL F G+ +CQ +F
Sbjct: 61 GKLLSAGKRVLTGESLFMTAFLNQGNTKSKVSFASPYPGKILPIDLTEFQGKFICQKSSF 120
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V V Q++ + G EGF+ QK+ G G+AF+ +GG++ +K L GEV+ VD
Sbjct: 121 LCAAKGVSVGIEFSQKLGRGLFGGEGFIMQKIEGDGIAFVHSGGTMAKKVLGHGEVLKVD 180
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
CI+ T V+ I++ G I+ ++FGG+ L A + GPG V+IQSLPF RL+ RI
Sbjct: 181 TGCIIGFTKDVDYDIEFIGGIKNSIFGGEGLFYATLKGPGTVYIQSLPFSRLADRI 236
>gi|330752099|emb|CBL80609.1| protein containing DUF124 [uncultured Leeuwenhoekiella sp.]
Length = 266
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 131/230 (56%), Gaps = 10/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
I +QI G E Q +I L PQE V+A G+ M ++ME I+ +N+ G+ +FG
Sbjct: 6 IDYQIFGEEMQYVEIELDPQEGVIAEAGTFMMMQDGIQMETIFGDGSKNDTGLLNKIFGA 65
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ V N V A+P +I+PIDL+ GG +CQ DAFLC+
Sbjct: 66 GKRLLTGESMFMTVFYNNLIGKRKVSFASPYPGKIIPIDLSKNGGRFICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ Q+L G G+AF+ AGG+ K L+ GE + VD CI+
Sbjct: 126 VSIGIEFSKKIGRGLFGGEGFIMQRLEGDGMAFVHAGGTTFTKELQAGEKLKVDTGCIIG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+VN I++ G I+ +FGG+ + A +TGPG V+IQSLPF RL++R+
Sbjct: 186 FDHTVNYDIEFVGGIKNTLFGGEGVFFATLTGPGTVYIQSLPFSRLAKRV 235
>gi|291571684|dbj|BAI93956.1| hypothetical protein [Arthrospira platensis NIES-39]
Length = 238
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 94/229 (41%), Positives = 135/229 (58%), Gaps = 1/229 (0%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV-GMWQWLFGK 80
VI ++I G + Q+ +I L P E V A G+M +M ++M+ G + + G+
Sbjct: 4 VIDYKIYGNDLQLVEIELDPGEGVQAEVGTMTYMEDGIQMQTGMGKGGLFSGFKRMITGE 63
Query: 81 TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
+ N G VG AA S +I+P+DLA GGE LCQ DAFLC+ N +++
Sbjct: 64 SFFITTFLNNGKQRSRVGFAATSPGQIIPVDLAQIGGEFLCQKDAFLCAANGTEINVAFT 123
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+++ G EGF+ Q+L G GLAFI AGG+++QK L+ GE + VD C+VA + SV
Sbjct: 124 KKLGAGFFGGEGFILQRLKGDGLAFIQAGGAMIQKTLQPGETLRVDTGCLVAFSPSVRYD 183
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
I++ G R A+FGG+ L A +TGPGIV +QSLPF RL+ RI +T P
Sbjct: 184 IQFVGGFRNALFGGEGLFLAKMTGPGIVIVQSLPFSRLADRINSMITLP 232
>gi|451819306|ref|YP_007455507.1| TIGR00266 family protein [Clostridium saccharoperbutylacetonicum
N1-4(HMT)]
gi|451785285|gb|AGF56253.1| TIGR00266 family protein [Clostridium saccharoperbutylacetonicum
N1-4(HMT)]
Length = 265
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 89/239 (37%), Positives = 138/239 (57%), Gaps = 11/239 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLF- 78
I ++I G + Q +I L P E V+A GSM + +++ME I+ + G LF
Sbjct: 6 IDYKIYGDDMQYVEIELDPAETVIAEAGSMMMIDPNIQMETIFGDGSKQQSNGFMDKLFS 65
Query: 79 -------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
G+++ N G AAP +ILP+DL++ GG+L+CQ DAFLC+
Sbjct: 66 AGKRVLTGESLFMTAFTNAGQLKEKAYFAAPYPGKILPMDLSLLGGKLICQKDAFLCAAK 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V V +++ G EGF+ +KL G G+AFI A G++V+++L +GE++ +D C+V
Sbjct: 126 GVSVGIDFRKKLSVGFFGGEGFILEKLEGDGMAFIHACGAIVERDLALGEILKIDTGCLV 185
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
A+T V+ I++ G I+ +FGG+ + A VTGPG V+IQSLPF RL++R+ A S
Sbjct: 186 AMTQGVHYDIQFIGGIKNTLFGGEGVFFATVTGPGKVWIQSLPFSRLAERVFSAAPSAG 244
>gi|428316299|ref|YP_007114181.1| protein of unknown function DUF124 [Oscillatoria nigro-viridis PCC
7112]
gi|428239979|gb|AFZ05765.1| protein of unknown function DUF124 [Oscillatoria nigro-viridis PCC
7112]
Length = 240
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 88/224 (39%), Positives = 133/224 (59%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
+I ++I G + Q+ +I L P+E V A G+M +M G ++M+ G + L G
Sbjct: 4 IIDYKIYGDDLQLIEIELDPKEGVRAEAGTMTYMEGDIQMQTSTGGGLFQGFKRMLTGAG 63
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
N G V AAP +++P+DL GG+ LCQ D+FLC+ N +++ +
Sbjct: 64 FFITTFVNAGNRKARVAFAAPYPGKVIPLDLGQLGGKFLCQKDSFLCAANGIEIEVAFTK 123
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
R+ G EGF+ QKL G GLAF+ AGG++V+KNL VGEV+ VD C+VA +V+ I
Sbjct: 124 RLGAGFFGGEGFILQKLQGDGLAFVHAGGTIVEKNLGVGEVLRVDTGCLVAFAPTVDYDI 183
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
++ G + A+FGG+ L A +TGPG V++QSLP +L++RI A
Sbjct: 184 QFVGGFKNALFGGEGLFLAKMTGPGKVYVQSLPLSKLAERILAA 227
>gi|406672919|ref|ZP_11080144.1| TIGR00266 family protein [Bergeyella zoohelcum CCUG 30536]
gi|405587463|gb|EKB61191.1| TIGR00266 family protein [Bergeyella zoohelcum CCUG 30536]
Length = 265
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 87/250 (34%), Positives = 132/250 (52%), Gaps = 12/250 (4%)
Query: 18 PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL 77
++ I ++I G E Q I L PQE +A GS M ++M+ I+ +E L
Sbjct: 2 KKNHEIDYEIHGEEMQYVDIELDPQETAIAEAGSFMMMEDGIQMQTIFGDGSEKNQSTGL 61
Query: 78 FGKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDA 125
FGK +++ N G V A+P +I+P+DL G+++CQ DA
Sbjct: 62 FGKILSAGKRMLTGESLFMTAYTNMGQGKKKVAFASPYPGKIIPLDLEQLNGKVICQKDA 121
Query: 126 FLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITV 185
FLC+ V V +++ + G EGF+ QKL G G+AF+ AGG V +K L+VGE I +
Sbjct: 122 FLCAAKGVSVGIEFQKKLGTGLFGGEGFIMQKLEGDGMAFVHAGGHVTEKELQVGEKIKI 181
Query: 186 DVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
D CIV T +V+ +++ G I+ +FGG+ + A +TGPG ++IQ+LP RL+ RI
Sbjct: 182 DTGCIVGFTQTVHYDVEFVGGIKNTLFGGEGVFFATLTGPGKIWIQTLPVSRLASRILSY 241
Query: 246 VTSPNMRENP 255
T E
Sbjct: 242 ATYNKKEEGS 251
>gi|423316198|ref|ZP_17294103.1| TIGR00266 family protein [Bergeyella zoohelcum ATCC 43767]
gi|405584268|gb|EKB58184.1| TIGR00266 family protein [Bergeyella zoohelcum ATCC 43767]
Length = 265
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 88/250 (35%), Positives = 131/250 (52%), Gaps = 12/250 (4%)
Query: 18 PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL 77
++ I ++I G E Q I L PQE +A GS M ++M+ I+ +E L
Sbjct: 2 KKNHEIDYEIHGEEMQYVDIELDPQETAIAEAGSFMMMEDGIQMQTIFGDGSEKNQSTGL 61
Query: 78 FGKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDA 125
FGK +++ N G V A+P +I+P+DL G+++CQ DA
Sbjct: 62 FGKILSAGKRMLTGESLFMTAYTNMGQGKKKVAFASPYPGKIIPLDLEQLNGKVICQKDA 121
Query: 126 FLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITV 185
FLC+ V V +++ + G EGF+ QKL G G+AF+ AGG V +K L+ GE I +
Sbjct: 122 FLCAAKGVSVGIEFQKKLGTGLFGGEGFIMQKLEGDGMAFVHAGGHVTEKELQAGEKIKI 181
Query: 186 DVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
D CIV T +V+ I++ G I+ +FGG+ + A +TGPG V+IQ+LP RL+ RI
Sbjct: 182 DTGCIVGFTQTVHYDIEFVGGIKNTLFGGEGVFFATLTGPGKVWIQTLPVSRLASRILSY 241
Query: 246 VTSPNMRENP 255
T E
Sbjct: 242 ATYNKKEEGS 251
>gi|330752718|emb|CBL88182.1| protein containing DUF124 [uncultured Leeuwenhoekiella sp.]
Length = 266
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 131/230 (56%), Gaps = 10/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLFG- 79
I +QI G E Q +I L PQE V+A G+ M ++ME I+ +N+ G+ +FG
Sbjct: 6 IDYQIFGEEMQYVEIELDPQEGVIAEAGTFMMMQDGIQMETIFGDGSKNDTGLLNKIFGA 65
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ V N V A+P +I+PIDL GG +CQ DAFLC+
Sbjct: 66 GKRLLTGESMFMTVFYNNLIGKRKVSFASPYPGKIIPIDLNKNGGRFICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ Q+L G G++F+ AGG+ K L+VGE + VD CI+
Sbjct: 126 VSIGIEFSKKIGRGLFGGEGFIMQRLDGDGMSFVHAGGTTFTKELQVGEKLKVDTGCIIG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+VN I++ G I+ +FGG+ + A +TGPG V+IQSLPF RL++R+
Sbjct: 186 FDHTVNYDIEFVGGIKNTLFGGEGVFFATLTGPGTVYIQSLPFSRLAKRV 235
>gi|86144232|ref|ZP_01062566.1| hypothetical protein MED217_06549 [Leeuwenhoekiella blandensis
MED217]
gi|85829310|gb|EAQ47778.1| hypothetical protein MED217_06549 [Leeuwenhoekiella blandensis
MED217]
Length = 266
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 86/230 (37%), Positives = 130/230 (56%), Gaps = 10/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
I +QI G E Q +I L PQE V+A G+ M + ME I+ +N+ G+ +FG
Sbjct: 6 IDYQIYGEEMQYVEIELDPQEGVIAEAGTFMMMEDGIRMETIFGDGSKNDSGLLNKVFGA 65
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ V N V A+P +I+PIDL GG +CQ DAFLC+
Sbjct: 66 GKRLLTGESLFMTVFYNDIVGKKKVSFASPYPGKIIPIDLTKNGGRFICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G+AF+ +GG+ +K L +GE + VD CI+
Sbjct: 126 VSIGIEFSRKLGRGLFGGEGFIMQKLEGDGMAFVHSGGTTFKKELGLGEKLKVDTGCIIG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+VN I++ G I+ +FGG+ + A +TGPG V+IQSLPF RL++R+
Sbjct: 186 FDHTVNYDIEFVGGIKNTIFGGEGVFFATLTGPGTVYIQSLPFSRLAKRV 235
>gi|407773538|ref|ZP_11120839.1| hypothetical protein TH2_06548 [Thalassospira profundimaris WP0211]
gi|407284002|gb|EKF09530.1| hypothetical protein TH2_06548 [Thalassospira profundimaris WP0211]
Length = 235
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 89/227 (39%), Positives = 137/227 (60%), Gaps = 4/227 (1%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
VI + + G E Q+ +I L P E V A G+M +MS + M+ G + L G+
Sbjct: 4 VIDYTVHGEEMQLVEIELDPGEGVRAEAGAMMYMSDGIAMQTGTGGGLFSGFKRMLTGEG 63
Query: 82 --ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
ITS V + G G V AAP +++P+DL GGE++CQ DAFLC+ + + +
Sbjct: 64 FFITSFV--HEGSGKGHVAFAAPYPGKVVPLDLGEHGGEIICQKDAFLCAASGIDIDIAF 121
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
+++ + G EGF+ Q+L+G GLAF+ AGG++++K+L+ G+ + VD +VA+T SV+
Sbjct: 122 SKKLGAGLFGGEGFILQRLSGDGLAFMHAGGTIIRKDLKPGQTLRVDTGALVAMTPSVDY 181
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
IK+ G A+FGG+ L + GPG V++QSLPF RL+ RIA A+
Sbjct: 182 SIKFVGGFTNALFGGEGLFVTTLEGPGTVWLQSLPFARLADRIAAAL 228
>gi|409993428|ref|ZP_11276569.1| hypothetical protein APPUASWS_20002 [Arthrospira platensis str.
Paraca]
gi|409935697|gb|EKN77220.1| hypothetical protein APPUASWS_20002 [Arthrospira platensis str.
Paraca]
Length = 238
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 93/229 (40%), Positives = 135/229 (58%), Gaps = 1/229 (0%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV-GMWQWLFGK 80
VI ++I G + Q+ +I L P E V A G+M +M ++M+ G + + G+
Sbjct: 4 VIDYKIYGNDLQLVEIELDPGEGVQAEVGTMTYMEDGIQMQTGMGKGGLFSGFKRMITGE 63
Query: 81 TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
+ N G VG AA S +I+P+DL+ GGE LCQ DAFLC+ N +++
Sbjct: 64 SFFITTFLNNGKQRSRVGFAATSPGQIIPVDLSQIGGEFLCQKDAFLCAANGTEINVAFT 123
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+++ G EGF+ Q+L G GLAFI AGG+++QK L+ GE + VD C+VA + SV
Sbjct: 124 KKLGAGFFGGEGFILQRLKGDGLAFIQAGGAMIQKTLQPGETLRVDTGCLVAFSPSVRYD 183
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
I++ G R A+FGG+ L A +TGPGIV +QSLPF RL+ RI +T P
Sbjct: 184 IQFVGGFRNALFGGEGLFLAKMTGPGIVIVQSLPFSRLADRINSMITLP 232
>gi|340616149|ref|YP_004734602.1| hypothetical protein zobellia_137 [Zobellia galactanivorans]
gi|339730946|emb|CAZ94210.1| Conserved hypothetical protein [Zobellia galactanivorans]
Length = 265
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 89/234 (38%), Positives = 132/234 (56%), Gaps = 10/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
I +QI G E Q +I L PQE VVA GS M ++M+ I+ + G+ +F
Sbjct: 6 IDYQIYGEEMQYVEIELDPQEAVVAEAGSFMMMDSDIKMDTIFGDGSNQDTGVLGKIFSA 65
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G+++ N G V A+P +ILPIDL+ G+ +CQ DAFLC+
Sbjct: 66 GKRILTGESLFMTAFLNIGQGKRQVSFASPYPGKILPIDLSEKQGKFICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ + G EGF+ QKL G GLAF+ AGG++ +K L GE++ VD CIV
Sbjct: 126 VSVGIEFSKKLGRGLFGGEGFIMQKLEGDGLAFVHAGGTMAKKELAAGEILKVDTGCIVG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+ +V+ I++ G I+ +FGG+ L A + GPG V+IQSLPF RL+ R+ ++
Sbjct: 186 FSQTVDYDIEFVGGIKNTIFGGEGLFFATLRGPGTVYIQSLPFSRLAGRVLASI 239
>gi|126662554|ref|ZP_01733553.1| hypothetical protein FBBAL38_04345 [Flavobacteria bacterium BAL38]
gi|126625933|gb|EAZ96622.1| hypothetical protein FBBAL38_04345 [Flavobacteria bacterium BAL38]
Length = 266
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 87/232 (37%), Positives = 132/232 (56%), Gaps = 14/232 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I + I G E Q ++ L PQE VVA GS M ++ME I+ ++ Q +FGK +
Sbjct: 6 IDYHIYGEEMQYVEVELDPQEVVVAEAGSFMMMENGIKMETIFGDGSQ--EQQGIFGKLL 63
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
++ +N V A+P +I+PIDL FGG+ +CQ D+FLC+
Sbjct: 64 SAGKRVLTGESLFMTAYQNQDIGKRKVSFASPYPGKIVPIDLQQFGGKFICQKDSFLCAA 123
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V V +++ + G EGF+ QKL G G+AF+ +GG++ +K L+ GE++ VD C+
Sbjct: 124 KGVTVGIEFTKKLGTGLFGGEGFIMQKLEGDGMAFVHSGGTLARKELQSGELLRVDTGCL 183
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
V T V+ I++ G I+ +FGG+ + A + GPGIV+IQSLPF RL+ RI
Sbjct: 184 VGFTKDVDYDIEFVGGIKNTIFGGEGVFFATLRGPGIVYIQSLPFTRLADRI 235
>gi|297528959|ref|YP_003670234.1| hypothetical protein GC56T3_0608 [Geobacillus sp. C56-T3]
gi|297252211|gb|ADI25657.1| protein of unknown function DUF124 [Geobacillus sp. C56-T3]
Length = 264
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 92/240 (38%), Positives = 135/240 (56%), Gaps = 10/240 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I +++ G + Q +I L P E VVA G M M + ME ++ + G
Sbjct: 6 IDYRLYGDDMQFVEIELDPYESVVAEAGGMMMMDDGIVMETVFGDGSSSGKGLLGRLVGA 65
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ V N G V AAP +I+P+DL+ GG+L+CQ D+FLC+
Sbjct: 66 GKRLLTGESLFMTVFTNQGSGKRRVAFAAPYPGKIIPVDLSELGGKLICQKDSFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ G EGF+ QKL G GLAF+ AGG++ +++L+ GE + +D C+VA
Sbjct: 126 VSVGIEFQRKLGAGFFGGEGFIMQKLEGDGLAFLHAGGTIHRRDLQPGETLRIDTGCLVA 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
+T VN I+Y G I+ A FGG+ L A VTGPG V++QSLPF RL+ R+ A S R
Sbjct: 186 MTKDVNYDIEYVGSIKTAFFGGEGLFLATVTGPGAVWVQSLPFSRLADRVLAAAPSAGER 245
>gi|27357088|gb|AAN86542.1| unknown [Eubacterium acidaminophilum]
Length = 268
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 84/235 (35%), Positives = 132/235 (56%), Gaps = 12/235 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I + + G + Q+ +I L P E V+A G+M +M ++ME + + G L GK +
Sbjct: 7 IDYTLHGDDLQLVEIELDPGESVIAEAGAMLYMENGIQMEAVLGDASGKGEGTGLMGKLL 66
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ + N G V AAP +I+P DL +GG ++CQ DAFLC+
Sbjct: 67 GAGKRVIMGESLFMTLFTNKGSQKQKVAFAAPYPGKIVPFDLNAYGGRIICQKDAFLCAA 126
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
+ V +++ + G EGF+ ++L G G AF+ AGG++V+K L E++ VD C+
Sbjct: 127 KGISVQMEFQKKIGVGLFGGEGFIMERLEGDGFAFLHAGGAIVEKELSQAELLKVDTGCL 186
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
VA TS V+ I++ G I+ A+FGG+ +V A + GPG V++QSLPF RL+ R+ A
Sbjct: 187 VAFTSGVDYDIQFMGDIKSAIFGGEGIVLATLKGPGKVWLQSLPFSRLADRMISA 241
>gi|15894815|ref|NP_348164.1| hypothetical protein CA_C1537 [Clostridium acetobutylicum ATCC 824]
gi|337736757|ref|YP_004636204.1| hypothetical protein SMB_G1562 [Clostridium acetobutylicum DSM
1731]
gi|384458264|ref|YP_005670684.1| hypothetical protein CEA_G1552 [Clostridium acetobutylicum EA 2018]
gi|15024487|gb|AAK79504.1|AE007664_1 Uncharacterized conserved protein [Clostridium acetobutylicum ATCC
824]
gi|325508953|gb|ADZ20589.1| Conserved hypothetical protein [Clostridium acetobutylicum EA 2018]
gi|336290727|gb|AEI31861.1| hypothetical protein SMB_G1562 [Clostridium acetobutylicum DSM
1731]
Length = 260
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 92/255 (36%), Positives = 138/255 (54%), Gaps = 14/255 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF---- 78
I ++I+G E Q+ ++ L P E VVA G+M +M S+EME I+ ++ G L
Sbjct: 5 IDYKIIGSEMQIVEVELDPYESVVAEAGAMMYMDSSIEMETIFGDGSDKGSSGGLVSKLM 64
Query: 79 --------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
G+++ + N G V AAP +I+P+DL+ F ++CQ D FLC+
Sbjct: 65 GAGKRLVTGESLFMTIFTNRGVGKQKVAFAAPYPGKIIPMDLSSFNNYMICQKDCFLCAA 124
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ + G EGF+ QKL G GL F+ +GG++VQ+ L EVI VD C+
Sbjct: 125 KGVSIGVEFTRKIGVGLFGGEGFMLQKLEGDGLTFVHSGGTIVQRELLPKEVIKVDTGCL 184
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
VA T VN I+ I+ A+FGG+ L A + GPG V++QS+PF RL+ RI T+
Sbjct: 185 VAFTRDVNYDIEMVKGIKSAIFGGEGLFLATLKGPGTVWLQSIPFSRLAGRINS--TAAT 242
Query: 251 MRENPKFFVQIAIFF 265
+E I FF
Sbjct: 243 GKEEGSILGNIGNFF 257
>gi|448239158|ref|YP_007403216.1| DUF124 family protein [Geobacillus sp. GHH01]
gi|445208000|gb|AGE23465.1| DUF124 family protein [Geobacillus sp. GHH01]
Length = 264
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 92/240 (38%), Positives = 135/240 (56%), Gaps = 10/240 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I +++ G + Q +I L P E VVA G M M + ME ++ + G
Sbjct: 6 IDYRLYGDDMQFVEIELDPYESVVAEAGGMMMMDDGIVMETVFGDGSSSGKGLLGRLVGA 65
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ V N G V AAP +I+P+DL+ GG+L+CQ D+FLC+
Sbjct: 66 GKRLLTGESLFMTVFTNQGSGKRRVAFAAPYPGKIIPVDLSKLGGKLICQKDSFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ G EGF+ QKL G GLAF+ AGG++ +++L+ GE + +D C+VA
Sbjct: 126 VSVGIEFQRKLGAGFFGGEGFIMQKLEGDGLAFLHAGGTIHRRDLQPGETLRIDTGCLVA 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
+T VN I+Y G I+ A FGG+ L A VTGPG V++QSLPF RL+ R+ A S R
Sbjct: 186 MTKDVNYDIEYVGSIKTAFFGGEGLFLATVTGPGAVWVQSLPFSRLADRVLAAAPSAGGR 245
>gi|163787928|ref|ZP_02182374.1| hypothetical protein FBALC1_06103 [Flavobacteriales bacterium
ALC-1]
gi|159876248|gb|EDP70306.1| hypothetical protein FBALC1_06103 [Flavobacteriales bacterium
ALC-1]
Length = 274
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 89/242 (36%), Positives = 132/242 (54%), Gaps = 10/242 (4%)
Query: 15 YQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVG 72
Y I ++I G E Q +I L P E V+A GS M ++M+ I+ + + G
Sbjct: 7 YNDRNSHEIDYRIYGEEMQYVEIELDPDEAVIAESGSFMMMDDGIKMDTIFGDGSQKDTG 66
Query: 73 MW--------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPD 124
+ L G+++ N V A+P +I+PIDL FGG+ +CQ D
Sbjct: 67 FLGKILGAGKRILTGESLFMTAFYNDLVGKRNVSFASPYPGKIIPIDLNEFGGKFICQKD 126
Query: 125 AFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVIT 184
AFLC+ V V +++ + G EGF+ QKL G G+AF+ AGG++ +K LE GE +
Sbjct: 127 AFLCAAKGVSVGIEFSKKLGRGLFGGEGFIMQKLEGDGMAFVHAGGTMAEKTLEAGETLR 186
Query: 185 VDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
VD CIV T ++ I++ G I+ +VFGG+ L A + GPG V+IQSLPF RL+ R+
Sbjct: 187 VDTGCIVGFTQGIDYDIEFVGGIKNSVFGGEGLFFAKLQGPGKVYIQSLPFSRLAGRVLA 246
Query: 245 AV 246
+V
Sbjct: 247 SV 248
>gi|85819367|gb|EAQ40526.1| conserved hypothetical protein [Dokdonia donghaensis MED134]
Length = 266
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 87/230 (37%), Positives = 129/230 (56%), Gaps = 10/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
I ++I G E Q ++ L PQE V+A GS M + ME I+ + G+ +FG
Sbjct: 6 IDYEIFGEEMQYVEVELDPQEGVIAEAGSFMMMEEGIRMETIFGDGSAKDTGVMGKIFGA 65
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ N V A+P +I+PIDL GG+ +CQ DAFLC+
Sbjct: 66 GKRLLTGESLFMTAFYNDVVGKKKVSFASPYPGKIIPIDLGAIGGKFVCQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ + G EGF+ Q+L G G+AF+ AGG++++K L GE + VD CI+
Sbjct: 126 VSVGIEFSRKLGRGLFGGEGFIMQRLEGDGIAFVHAGGTMLKKELAPGERLKVDTGCIIG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
SSVN I++ G I+ +FGG+ L A +TGPG V+IQSLPF RL+ R+
Sbjct: 186 FDSSVNYDIEFVGGIKNTIFGGEGLFFATLTGPGTVYIQSLPFSRLANRV 235
>gi|305667437|ref|YP_003863724.1| hypothetical protein FB2170_14358 [Maribacter sp. HTCC2170]
gi|88709485|gb|EAR01718.1| hypothetical protein FB2170_14358 [Maribacter sp. HTCC2170]
Length = 265
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 90/234 (38%), Positives = 133/234 (56%), Gaps = 10/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP----ENEV------G 72
I ++I G E Q +I L PQE VVA GS M ++M+ I+ +N V
Sbjct: 6 IDYEIFGEEMQYVEIELDPQEAVVAEAGSFMMMDTDIKMDTIFGDGSNQDNSVLGKIFSA 65
Query: 73 MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ N G S V AAP +I+PIDL+ G+ +CQ DAFLC+
Sbjct: 66 GKRILTGESLFMTAFLNIGQSKRQVSFAAPYPGKIVPIDLSDKNGKFICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ + G EGF+ QKL G G+AF+ AGG++ +K L GEV+ VD CIV
Sbjct: 126 VSVGIEFSKKLGRGLFGGEGFIMQKLEGDGMAFVHAGGTLAKKELAAGEVMKVDTGCIVG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+ +V+ I++ G I+ VFGG+ L A + GPG V++QSLPF RL+ R+ ++
Sbjct: 186 FSQNVDYDIEFVGGIKNTVFGGEGLFFATLKGPGTVYVQSLPFSRLAGRVLASI 239
>gi|261418043|ref|YP_003251725.1| hypothetical protein GYMC61_0564 [Geobacillus sp. Y412MC61]
gi|319767998|ref|YP_004133499.1| hypothetical protein [Geobacillus sp. Y412MC52]
gi|261374500|gb|ACX77243.1| protein of unknown function DUF124 [Geobacillus sp. Y412MC61]
gi|317112864|gb|ADU95356.1| protein of unknown function DUF124 [Geobacillus sp. Y412MC52]
Length = 264
Score = 163 bits (412), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 92/240 (38%), Positives = 135/240 (56%), Gaps = 10/240 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I +++ G + Q +I L P E VVA G M M + ME ++ + G
Sbjct: 6 IDYRLYGDDMQFVEIELDPYESVVAEAGGMMMMDDGIVMETVFGDGSSSGKGLLGRLVGA 65
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ V N G V AAP +I+P+DL+ GG+L+CQ D+FLC+
Sbjct: 66 GKRLLTGESLFMTVFTNQGSGKRRVAFAAPYPGKIIPVDLSELGGKLICQKDSFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ G EGF+ QKL G GLAF+ AGG++ +++L+ GE + +D C+VA
Sbjct: 126 VSVGIEFQRKLGAGFFGGEGFIMQKLEGDGLAFLHAGGTIHRRDLQPGETLRIDTGCLVA 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
+T VN I+Y G I+ A FGG+ L A VTGPG V++QSLPF RL+ R+ A S R
Sbjct: 186 MTKDVNYDIEYVGNIKTAFFGGEGLFLATVTGPGAVWVQSLPFSRLADRVLAAAPSAGGR 245
>gi|56421414|ref|YP_148732.1| hypothetical protein GK2879 [Geobacillus kaustophilus HTA426]
gi|56381256|dbj|BAD77164.1| hypothetical conserved protein [Geobacillus kaustophilus HTA426]
Length = 264
Score = 163 bits (412), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 92/240 (38%), Positives = 135/240 (56%), Gaps = 10/240 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I +++ G + Q +I L P E VVA G M M + ME ++ + G
Sbjct: 6 IDYRLYGDDMQFVEIELDPYESVVAEAGGMMMMDDGIVMETVFGDGSSSGKGLLGRLVGA 65
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ V N G V AAP +I+P+DL+ GG+L+CQ D+FLC+
Sbjct: 66 GKRLLTGESLFMTVFTNQGSGKRRVAFAAPYPGKIIPVDLSELGGKLICQKDSFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ G EGF+ QKL G GLAF+ AGG++ +++L+ GE + +D C+VA
Sbjct: 126 VSVGIEFQRKLGAGFFGGEGFIMQKLEGDGLAFLHAGGTIHRRDLQPGETLRIDTGCLVA 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
+T VN I+Y G I+ A FGG+ L A VTGPG V++QSLPF RL+ R+ A S R
Sbjct: 186 MTKDVNYDIEYVGNIKTAFFGGEGLFLATVTGPGAVWVQSLPFSRLADRVLAAAPSAGGR 245
>gi|365960391|ref|YP_004941958.1| hypothetical protein FCOL_06720 [Flavobacterium columnare ATCC
49512]
gi|365737072|gb|AEW86165.1| hypothetical protein FCOL_06720 [Flavobacterium columnare ATCC
49512]
Length = 267
Score = 163 bits (412), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 90/250 (36%), Positives = 139/250 (55%), Gaps = 15/250 (6%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN---EVGMWQ 75
Q I + I G E Q +I L PQE VVA GS M ++ME I+ + + G+
Sbjct: 2 QAHEIDYHIYGEEMQYVEIELDPQEIVVAEAGSFMMMENGIKMETIFGDGSNYQDSGILG 61
Query: 76 WLF--------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFL 127
L G+++ +N S + A+P +I+ IDL +GG+ +CQ D+FL
Sbjct: 62 KLLSAGKRVLTGESLFMTAYQNQDFSKRKISFASPYPGKIIAIDLMKYGGKFICQKDSFL 121
Query: 128 CSVNDVKVSNTLDQRV-RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
C+ V V Q++ R + GG EGF+ QK+ G G+AF+ +GG++ K L+VGE++ +D
Sbjct: 122 CAAKGVSVGIEFSQKLGRGLFGG-EGFIMQKIEGDGMAFVHSGGTLACKELQVGEILKID 180
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI--AR 244
CIV T ++ I++ G I+ +FGG+ + A + GPG+V+IQSLPF RL+ RI A
Sbjct: 181 TGCIVGFTKDIDYDIEFIGGIKNTLFGGEGMFYATLKGPGVVYIQSLPFSRLAGRIIAAS 240
Query: 245 AVTSPNMREN 254
+ N +E
Sbjct: 241 TIGGQNNKEE 250
>gi|149372109|ref|ZP_01891379.1| hypothetical protein SCB49_00275 [unidentified eubacterium SCB49]
gi|149354876|gb|EDM43438.1| hypothetical protein SCB49_00275 [unidentified eubacterium SCB49]
Length = 266
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 87/242 (35%), Positives = 129/242 (53%), Gaps = 10/242 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
I + I G E Q ++ L P E V+A GS M ++ME I+ + + GM +FG
Sbjct: 6 IDYTIFGEEMQYVEVELDPNEGVIAEAGSFMMMDDGIKMETIFGDGSKKDSGMMGKIFGA 65
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ N V A+P +I+P+DL+ FG + +CQ AFLC+
Sbjct: 66 GKRLLTGESLFMTAFYNVQSGKKHVSFASPYPGKIIPLDLSKFGNKFICQKSAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ + G EGF+ QKL G G+AF+ AGG+ K L GE + VD CI+
Sbjct: 126 VAVGVEFSRKLGRGLFGGEGFIMQKLEGDGMAFVHAGGTTATKELAPGEKLKVDTGCIIG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
SSVN I++ G I+ VFGG+ L A +TGPG V++QSLPF RL+ R+ A +
Sbjct: 186 FDSSVNYDIEFVGGIKNTVFGGEGLFFATLTGPGTVYVQSLPFSRLANRVWAAAPKSGGK 245
Query: 253 EN 254
+
Sbjct: 246 DK 247
>gi|406833583|ref|ZP_11093177.1| hypothetical protein SpalD1_18131 [Schlesneria paludicola DSM
18645]
Length = 334
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 89/257 (34%), Positives = 143/257 (55%), Gaps = 11/257 (4%)
Query: 9 PFQPYVYQSPQDA-VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP 67
P P +P+ A I ++I G E Q + L PQE V+A PG+M FM+ ++ME +
Sbjct: 62 PHSPPRVSAPRSADEIDYEIFGEEMQYVVVELDPQEMVIAEPGAMMFMTAGIKMETRFGD 121
Query: 68 ENE--VGMWQWLF--------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGG 117
++ G+W L G+++ G G V A+P +IL +DLA GG
Sbjct: 122 PSKPSEGLWGKLVTAGRRAITGESLFVTTFTQAGNGKGQVAFASPYPGKILAVDLAQLGG 181
Query: 118 ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNL 177
EL+CQ D+F+C+ V+++ +++ + G EGF+ Q+L G G+ I AGG ++++ L
Sbjct: 182 ELICQKDSFVCAARGVQLAVAFQKKIGVGLFGGEGFVMQRLLGDGVVMIHAGGMLMRRTL 241
Query: 178 EVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR 237
+ GE + +D C+VA+ SV I++ G + +FGG+ L A VTGPG V++QSLPF R
Sbjct: 242 KAGETLRLDTGCLVALQPSVTYDIQFVGGFKNTLFGGEGLFMATVTGPGEVWLQSLPFSR 301
Query: 238 LSQRIARAVTSPNMREN 254
L+ R+ A ++
Sbjct: 302 LAGRVLAAGVGATRKDE 318
>gi|295134252|ref|YP_003584928.1| tryptophan RNA-binding attenuator like protein [Zunongwangia
profunda SM-A87]
gi|294982267|gb|ADF52732.1| tryptophan RNA-binding attenuator like protein [Zunongwangia
profunda SM-A87]
Length = 266
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 87/234 (37%), Positives = 135/234 (57%), Gaps = 10/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENE------VGM 73
I ++I G E Q ++ L PQE +A G+ M ++M+ I+ +NE +G
Sbjct: 6 IDYEIFGEEMQYVELELDPQEAAIAEAGNFMMMDNGIKMDTIFGDGSKQNEGFLGKVLGA 65
Query: 74 WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ + N G + A+P +I+P+DL FGG+ +CQ DAFLC+
Sbjct: 66 GKRLLTGESLFMTIFSNIGQGKKKISFASPYPGKIIPVDLTQFGGKFICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ G EGF+ Q L G G+AF+ AGG++ +K L VGEV+ VD CI+
Sbjct: 126 VSIGIEFSRKLGRGFFGGEGFIMQSLEGDGMAFVHAGGTMAKKELAVGEVLKVDTGCIIG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
T +VN I++ G IR FGG+ L A +TGPG+V++QSLPF RL+ R+ +A
Sbjct: 186 FTQTVNYDIEFIGGIRNTFFGGEGLFFATLTGPGVVYVQSLPFSRLANRVLQAA 239
>gi|283780460|ref|YP_003371215.1| hypothetical protein Psta_2686 [Pirellula staleyi DSM 6068]
gi|283438913|gb|ADB17355.1| protein of unknown function DUF124 [Pirellula staleyi DSM 6068]
Length = 288
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 86/260 (33%), Positives = 141/260 (54%), Gaps = 17/260 (6%)
Query: 10 FQPYVY---QSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
FQP V +S I ++I G E Q ++ L P E V+A G M +M+ ++ME ++
Sbjct: 12 FQPEVIGPGKSRNADEIDYKIFGEEMQYVEVTLDPSEMVIAEAGGMMYMTSGIKMETVFG 71
Query: 67 PENEVGMWQWLFGKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAM 114
+ Q FGK +++ N V A+P RI+P+ L
Sbjct: 72 DPGQ--KQQGFFGKLMSAGKRMVTGESLFMTTFSNASTQREVVAFASPYPGRIIPMHLDQ 129
Query: 115 FGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQ 174
GGEL+CQ D+FLC +++ +++ + G EGF+ Q+LTG G+ + AGG++++
Sbjct: 130 LGGELICQKDSFLCGARGIQIGIAFQKKIGVALFGGEGFIMQRLTGDGIVLMHAGGTLME 189
Query: 175 KNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
+ L GE + VD C+VA+ +VN I + G I+ ++FGG+ L A +TGPG +++QSLP
Sbjct: 190 RTLSAGETLRVDTGCLVALQPTVNYDIGFVGGIKNSLFGGEGLFLATLTGPGKIWLQSLP 249
Query: 235 FHRLSQRIARAVTSPNMREN 254
F RL+ RI A + + ++
Sbjct: 250 FSRLAGRIHAAGSGTSGKDE 269
>gi|157273311|gb|ABV27210.1| conserved hypothetical protein [Candidatus Chloracidobacterium
thermophilum]
Length = 252
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 95/241 (39%), Positives = 136/241 (56%), Gaps = 8/241 (3%)
Query: 18 PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL 77
PQ VI ++I+G + Q I L P E+V+A G M +M+ +EM+ ++ G + L
Sbjct: 2 PQSDVIDYRIVGDDLQAVIITLDPNEQVIAEAGVMLYMTAGIEMQTTMTTDSSKGFFGNL 61
Query: 78 ---FGKTITS-----VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
G+ + N G + V AAP +I+PIDLA FGGE+LCQ D+FLC+
Sbjct: 62 AKAVGRIFSGAGFFITTFTNYGYARADVAFAAPYPGKIIPIDLAKFGGEVLCQRDSFLCA 121
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
+S QR+ G EGF+ Q+L G GLAFI AGG+ + NL GE + VD C
Sbjct: 122 ARGTDLSVGFQQRLGAGFFGGEGFILQRLRGDGLAFIHAGGATMTYNLRPGETLRVDTGC 181
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
+VA SV I++ G + A+FGG+ + A +TGPG V +Q+LPF RL RI+ A+ P
Sbjct: 182 VVAFQPSVQFNIQFVGGFKNALFGGEGVFLASLTGPGQVILQTLPFSRLVGRISAALPRP 241
Query: 250 N 250
+
Sbjct: 242 S 242
>gi|407769274|ref|ZP_11116650.1| hypothetical protein TH3_07322 [Thalassospira xiamenensis M-5 = DSM
17429]
gi|407287601|gb|EKF13081.1| hypothetical protein TH3_07322 [Thalassospira xiamenensis M-5 = DSM
17429]
Length = 235
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 87/227 (38%), Positives = 137/227 (60%), Gaps = 4/227 (1%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
VI ++I G E Q+ +I L P E V A G+M +MS + M+ G + L G+
Sbjct: 4 VIDYKIHGEEMQLVEIELDPGEGVRAEAGAMMYMSDGIAMQTGTGGGIFSGFKRMLTGEG 63
Query: 82 --ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
ITS V + G G V AAP +++P++L GGE++CQ DAFLC+ + + +
Sbjct: 64 FFITSFV--HEGSGKGHVAFAAPYPGKVVPMNLPELGGEIICQKDAFLCAASGIDIDIAF 121
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
+++ + G EGF+ Q+L G GLAF+ AGG++++K+L+ G+ + VD +VA+T S++
Sbjct: 122 SKKLGAGLFGGEGFILQRLRGDGLAFVHAGGTIIRKDLQAGQTLRVDTGALVAMTPSIDY 181
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
IK+ G A+FGG+ L + GPG V++Q+LPF RL+ RIA A+
Sbjct: 182 SIKFIGGFTNALFGGEGLFVTTLEGPGTVWLQTLPFARLADRIAAAL 228
>gi|260063289|ref|YP_003196369.1| hypothetical protein RB2501_00726 [Robiginitalea biformata
HTCC2501]
gi|88783383|gb|EAR14555.1| hypothetical protein RB2501_00726 [Robiginitalea biformata
HTCC2501]
Length = 271
Score = 162 bits (410), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 87/237 (36%), Positives = 130/237 (54%), Gaps = 10/237 (4%)
Query: 20 DAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWL 77
+ V+ ++I G E Q + L P E V+A GS M + ME I+ + + G+ +
Sbjct: 8 NHVVDYEIFGEEMQYVSVELDPNEGVIAEAGSFMMMDPDIRMETIFGDGSKKDTGVLGKI 67
Query: 78 F--------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
F G+ + N G V A+P +I+PIDL GG +CQ DAFLC+
Sbjct: 68 FSAGKRVLTGEGLFMTAFYNGGQGKKKVSFASPYPGKIVPIDLKEKGGRFICQKDAFLCA 127
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V V +R+ + G EGF+ QKL G G+AF+ AGG++ ++ L+ GEV+ VD C
Sbjct: 128 AKGVSVGIEFSRRLGRGLFGGEGFIMQKLEGDGMAFVHAGGTLARRTLQAGEVLRVDTGC 187
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
IV + +V+ I++ G I+ +FGG+ L A + GPG V+IQSLPF RL+ R+ A
Sbjct: 188 IVGFSQTVDYDIEFVGGIKNTLFGGEGLFFATLRGPGTVYIQSLPFSRLAGRVLAAA 244
>gi|298674841|ref|YP_003726591.1| hypothetical protein Metev_0899 [Methanohalobium evestigatum
Z-7303]
gi|298287829|gb|ADI73795.1| protein of unknown function DUF124 [Methanohalobium evestigatum
Z-7303]
Length = 233
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 89/228 (39%), Positives = 137/228 (60%), Gaps = 4/228 (1%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
+I + I+G + Q+ ++ L P+E V A G+M +M ++M+ G+ + L G++
Sbjct: 1 MIDYNIIGDDMQIVEVELDPEESVRAEVGAMAYMGPGIKMQTSTGGGLFSGLKRMLTGES 60
Query: 82 --ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
ITS V G GFV AP +I+PIDLA F G +LCQ DAFLC+ + + L
Sbjct: 61 FFITSFVYE--GKDKGFVAFGAPYPGKIIPIDLAKFDGNILCQKDAFLCAAKGIDIDVAL 118
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
+R+ + G EGF+ QKL+G G+AF+ AGG++++K L GE + VD C+ A T S++
Sbjct: 119 TKRLGSGFFGGEGFILQKLSGDGMAFVHAGGTIIKKELADGEHLRVDTGCLAAFTDSIDY 178
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVT 247
I++ G + A+FGG+ + A + GPG V++QSLP RL+ RI A T
Sbjct: 179 NIQFIGGFKNALFGGEGVFLASLKGPGTVYLQSLPLSRLADRILAAST 226
>gi|320354387|ref|YP_004195726.1| hypothetical protein Despr_2291 [Desulfobulbus propionicus DSM
2032]
gi|320122889|gb|ADW18435.1| protein of unknown function DUF124 [Desulfobulbus propionicus DSM
2032]
Length = 331
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 85/244 (34%), Positives = 131/244 (53%), Gaps = 12/244 (4%)
Query: 15 YQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW 74
+ + I + I G E Q +I L QE V++ G+M +M + ME ++ +
Sbjct: 65 FSGKEAHEIDYTIFGNEMQFVEIELDSQESVISEAGAMMYMHDQVVMETVFGDGSRNSGS 124
Query: 75 QWLFGKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQ 122
F K + + + G G V AAP +I+P+DL + G ++CQ
Sbjct: 125 GTFFDKMLGAGKRLITGEGLFITMFTYTGAGKGKVAFAAPYPGKIIPLDLTKYNGRIICQ 184
Query: 123 PDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEV 182
DAFLC+ V V +++ + G EGF+ Q+L G GL F+ AGG++V+K L GE
Sbjct: 185 KDAFLCAAKGVSVGIAFQRKIGVALFGGEGFIMQQLDGDGLVFVHAGGTIVEKELAAGET 244
Query: 183 ITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ VD C+VA+T+SVN I++ G ++ A+FGG+ A + GPG V++QSLPF RL+ RI
Sbjct: 245 LRVDTGCLVALTASVNYDIEFVGNVKSAIFGGEGFFFANLRGPGHVWLQSLPFSRLAGRI 304
Query: 243 ARAV 246
A
Sbjct: 305 WEAA 308
>gi|334119417|ref|ZP_08493503.1| protein of unknown function DUF124 [Microcoleus vaginatus FGP-2]
gi|333458205|gb|EGK86824.1| protein of unknown function DUF124 [Microcoleus vaginatus FGP-2]
Length = 240
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 85/221 (38%), Positives = 130/221 (58%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
+I ++I G + Q+ +I L P+E V A G+M +M G ++M+ G + L G
Sbjct: 4 IIDYKIYGDDLQLIEIELDPKEGVRAEAGTMTYMEGDIQMQTSTGGGLFQGFKRMLTGAG 63
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
N G V AAP +++P+DL GG+ LCQ D+FLC+ N +++ +
Sbjct: 64 FFITTFVNAGNRKARVAFAAPYPGKVIPLDLGQLGGKFLCQKDSFLCAANGIEIEVAFTK 123
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
R+ G EGF+ QKL G GLAF+ AGG++V+KNL VGEV+ D C+VA +V+ I
Sbjct: 124 RLGAGFFGGEGFILQKLQGDGLAFVHAGGTIVEKNLGVGEVLRADTGCLVAFAPTVDYDI 183
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
++ G + A+FGG+ L +TGPG V++QSLP +L++RI
Sbjct: 184 QFVGGFKNALFGGEGLFLVKLTGPGKVYLQSLPLSKLAERI 224
>gi|347753828|ref|YP_004861392.1| hypothetical protein [Candidatus Chloracidobacterium thermophilum
B]
gi|347586346|gb|AEP10876.1| conserved hypothetical protein TIGR00266 [Candidatus
Chloracidobacterium thermophilum B]
Length = 254
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 95/234 (40%), Positives = 132/234 (56%), Gaps = 8/234 (3%)
Query: 18 PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL 77
PQ VI ++I+G + Q I L P E+V+A G M +M+ +EM+ ++ G + L
Sbjct: 2 PQSDVIDYRIVGDDLQGVIITLDPNEQVLAEAGVMLYMTAGIEMQTTMATDSSKGFFGNL 61
Query: 78 ---FGKTITS-----VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
G+ + N G + V AAP +I+PIDLA FGGE+LCQ D+FLC+
Sbjct: 62 AKAVGRVFSGAGFFITTFTNYGHARADVAFAAPYPGKIIPIDLAKFGGEVLCQKDSFLCA 121
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
+S QR+ G EGF+ Q+L G GLAFI AGG+ + NL GE + VD C
Sbjct: 122 ARGTDLSVGFQQRLGAGFFGGEGFILQRLRGDGLAFIHAGGATMTYNLRPGETLRVDTGC 181
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
+VA SV I++ G + A+FGG+ L A +TGPG V +Q+LPF RL RIA
Sbjct: 182 VVAFQPSVQFNIQFVGGFKNALFGGEGLFLASLTGPGQVILQTLPFSRLVGRIA 235
>gi|325287403|ref|YP_004263193.1| hypothetical protein Celly_2505 [Cellulophaga lytica DSM 7489]
gi|324322857|gb|ADY30322.1| protein of unknown function DUF124 [Cellulophaga lytica DSM 7489]
Length = 265
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 87/234 (37%), Positives = 130/234 (55%), Gaps = 10/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM--------- 73
I + + G E Q +I L PQE VVA GS M ++M+ I+ +
Sbjct: 6 IDYHVYGEEMQYVEIELDPQEAVVAEAGSFMMMDTDIKMDTIFGDGSNQDTSVLGKIFSA 65
Query: 74 -WQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ N G + V A+P +ILPIDL+ G+ +CQ DAFLC+
Sbjct: 66 GKRMLTGESLFMTAFLNIGANKRKVSFASPYPGKILPIDLSEKNGKFICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ + G EGF+ QKL G G+AF+ AGG++ +K L GEV+ VD CIV
Sbjct: 126 VSVGIEFSRKLGRGLFGGEGFIMQKLEGDGMAFVHAGGTMAKKVLAAGEVLKVDTGCIVG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+ ++N +++ G I+ VFGG+ L A + GPG V+IQSLPF RL+ R+ +V
Sbjct: 186 FSQTINYDVEFVGGIKNTVFGGEGLFFATLRGPGTVYIQSLPFSRLAGRVLASV 239
>gi|376262370|ref|YP_005149090.1| hypothetical protein [Clostridium sp. BNL1100]
gi|373946364|gb|AEY67285.1| TIGR00266 family protein [Clostridium sp. BNL1100]
Length = 256
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 87/224 (38%), Positives = 130/224 (58%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
VI ++I G + Q+ +I L P E V A G+M +M + M+ G + L G++
Sbjct: 4 VIDYKIYGDDMQIVEIELDPSEGVRAEAGAMMYMEDGIMMQTSTDGGAFKGFKRMLTGES 63
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
N G S V +AP +I+P+DL GG LCQ D+FLC+ + V + +
Sbjct: 64 FFITTFYNAGASKQHVAFSAPYPGKIIPLDLTKLGGNFLCQKDSFLCAASGVDIEIAFTR 123
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
++ G EGF+ Q+L G G+AF+ +GG+++QKNL GE + VD C+VA + SV+ I
Sbjct: 124 KLGAGFFGGEGFILQRLIGDGMAFVHSGGTIIQKNLLPGETLRVDTGCLVAFSPSVDYDI 183
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+ G + A+FGG+ L A +TGPGIVF+QSLP RLS R++ A
Sbjct: 184 QSVGGFKNALFGGEGLFLAKLTGPGIVFLQSLPLARLSDRLSSA 227
>gi|359462008|ref|ZP_09250571.1| hypothetical protein ACCM5_24999 [Acaryochloris sp. CCMEE 5410]
Length = 337
Score = 160 bits (406), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 87/237 (36%), Positives = 136/237 (57%), Gaps = 13/237 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY-----------IPENEV 71
I +QI G + Q ++ L P E VA G+M + ++ME I+ + V
Sbjct: 74 IDYQIFGEDMQFVEVELDPGESAVAEAGAMMYKDAVVQMETIFGDGSAQASQGGFMDKLV 133
Query: 72 GMWQWLF-GKTITSVVLRNPGP-SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
G + L G+++ + V + GP V AAP I+ L+ +GG L+CQ D+FLC+
Sbjct: 134 GAGKRLVTGESLFTTVFTHGGPQGKAHVAFAAPYPGTIIAAKLSDYGGRLICQKDSFLCA 193
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V++ +++ + G EGF+ QKL G GLAF+ AGG+VVQ++L+ GE + VD C
Sbjct: 194 AKGVQIGIHFQKKILTGLFGGEGFIMQKLDGTGLAFLHAGGTVVQRDLKAGETLHVDTGC 253
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+VA+T SV+ I+ G ++ +FGG+ L A +TGPG V++QSLPF R++ R+ A
Sbjct: 254 LVALTDSVHYDIQQAGNVKSIIFGGEGLFFATLTGPGHVWLQSLPFSRMAGRMLAAA 310
>gi|365158290|ref|ZP_09354519.1| TIGR00266 family protein [Bacillus smithii 7_3_47FAA]
gi|363620953|gb|EHL72197.1| TIGR00266 family protein [Bacillus smithii 7_3_47FAA]
Length = 264
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 87/230 (37%), Positives = 135/230 (58%), Gaps = 10/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMW------ 74
I +++ G + Q +I L PQE VVA G M M +EME ++ E + G++
Sbjct: 6 IDYELHGDDMQCVEIELDPQESVVAEAGGMMMMEDGIEMETVFGDGTEQKSGIFGKLVGA 65
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ V N V AAP +I+P+DL F G+++CQ DAFLC+
Sbjct: 66 GRRLLTGESLFMTVFTNQSSDKRRVSFAAPYPGKIIPLDLTQFNGKVICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ G EGF+ QKL G GLAF+ AGG++++K+L GE++ VD C+VA
Sbjct: 126 VSIGIEFQRKLGTGFFGGEGFIMQKLEGDGLAFLHAGGTIIKKDLSPGELLRVDTGCLVA 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+T V+ I+Y G ++ A FGG+ L A + GPG V++Q+LPF RL+ R+
Sbjct: 186 LTQDVDYNIEYVGKVKTAFFGGEGLFFATLRGPGTVWVQTLPFSRLADRV 235
>gi|414872763|tpg|DAA51320.1| TPA: hypothetical protein ZEAMMB73_320289 [Zea mays]
Length = 149
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 70/114 (61%), Positives = 91/114 (79%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
MAAPFFSTPFQPYVYQS Q +V FQI GG+ QV Q+MLK QEK+ A+PG+MC+MSG+M+
Sbjct: 1 MAAPFFSTPFQPYVYQSQQGSVTAFQISGGDVQVLQVMLKSQEKLTAKPGAMCYMSGNMQ 60
Query: 61 MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAM 114
M+N Y+PEN+ G+WQW+FGK ++S + N G DG+VGIAAP RILP+ + +
Sbjct: 61 MDNNYLPENDGGVWQWIFGKRVSSTIFFNSGSDDGYVGIAAPFPGRILPVSILL 114
>gi|158334252|ref|YP_001515424.1| hypothetical protein AM1_1071 [Acaryochloris marina MBIC11017]
gi|158304493|gb|ABW26110.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 337
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 87/237 (36%), Positives = 136/237 (57%), Gaps = 13/237 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY-----------IPENEV 71
I +QI G + Q ++ L P E VA G+M + ++ME I+ + V
Sbjct: 74 IDYQIYGEDMQFVEVELDPGESAVAEAGAMMYKDAVVQMETIFGDGSAQASQGGFMDKLV 133
Query: 72 GMWQWLF-GKTITSVVLRNPGP-SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
G + L G+++ + V + GP V AAP I+ L+ +GG L+CQ D+FLC+
Sbjct: 134 GAGKRLVTGESLFTTVFTHGGPQGKAHVAFAAPYPGTIIAAKLSDYGGRLICQKDSFLCA 193
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V++ +++ + G EGF+ QKL G GLAF+ AGG+VVQ++L+ GE + VD C
Sbjct: 194 AKGVQIGIHFQKKILTGLFGGEGFIMQKLDGTGLAFLHAGGTVVQRDLKAGETLHVDTGC 253
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+VA+T SV+ I+ G ++ +FGG+ L A +TGPG V++QSLPF R++ R+ A
Sbjct: 254 LVALTDSVHYDIQQAGNVKSIIFGGEGLFFATLTGPGHVWLQSLPFSRMAGRMLAAA 310
>gi|150016382|ref|YP_001308636.1| hypothetical protein Cbei_1504 [Clostridium beijerinckii NCIMB
8052]
gi|149902847|gb|ABR33680.1| protein of unknown function DUF124 [Clostridium beijerinckii NCIMB
8052]
Length = 265
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 92/240 (38%), Positives = 142/240 (59%), Gaps = 16/240 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW------ 76
I ++I G + Q +I L P E V+A G+M M +++ME I+ ++ +
Sbjct: 6 IDYKIKGDDIQYVEIELDPTETVIAEAGAMMMMDPNIQMETIFGDGSKQSGSSFMNKIFS 65
Query: 77 -----LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
L G+++ N G V AAP +I+P+DL++ GG+L+CQ DAFLC+
Sbjct: 66 AGKRVLTGESLFMTAFTNSGTLKEKVYFAAPYPGKIIPMDLSLLGGKLICQKDAFLCAAK 125
Query: 132 DVKVSNTLDQRVRNVIG--GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V V +D R + +G G EGF+ +KL G GLAFI A G++V++NL GE + +D C
Sbjct: 126 GVSVG--IDFRKKLSVGFFGGEGFILEKLEGDGLAFIHACGAIVERNLLPGETLKIDTGC 183
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI-ARAVTS 248
+VA+T V+ I++ G I+ +FGG+ + A VTGPG +++QSLPF RL++R+ A A TS
Sbjct: 184 LVAMTKDVHYDIQFIGGIKNTLFGGEGVFFATVTGPGKIWVQSLPFSRLAERVFAAAPTS 243
>gi|359412227|ref|ZP_09204692.1| protein of unknown function DUF124 [Clostridium sp. DL-VIII]
gi|357171111|gb|EHI99285.1| protein of unknown function DUF124 [Clostridium sp. DL-VIII]
Length = 265
Score = 160 bits (405), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 92/237 (38%), Positives = 139/237 (58%), Gaps = 15/237 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW------ 76
I ++I G + Q +I L P E V+A G+M M +++ME I+ ++ G +
Sbjct: 6 IDYKIYGDDMQYVEIELDPMETVIAEAGAMMMMDPNIQMETIFGDGSKQGNNGFMDKIFS 65
Query: 77 -----LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
L G+++ N G V AAP +ILP+DL+ G+L+CQ DAFLC+
Sbjct: 66 AGKRVLTGESLFMTAFTNSGALKEKVFFAAPYPGKILPMDLSRLEGKLICQKDAFLCAAK 125
Query: 132 DVKVSNTLDQRVRNVIG--GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V V +D R + +G G EGF+ +KL G GLAFI A G++V++NL GE + +D C
Sbjct: 126 GVSVG--IDFRKKLSVGFFGGEGFILEKLEGDGLAFIHACGAIVERNLMPGETLKIDTGC 183
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+VA+T +V+ I++ G I+ +FGG+ + A VTGPG V+IQSLPF RL++R+ A
Sbjct: 184 LVAMTRNVHYDIQFIGGIKNTLFGGEGVFFATVTGPGKVWIQSLPFSRLAERVFAAA 240
>gi|336173044|ref|YP_004580182.1| hypothetical protein [Lacinutrix sp. 5H-3-7-4]
gi|334727616|gb|AEH01754.1| protein of unknown function DUF124 [Lacinutrix sp. 5H-3-7-4]
Length = 265
Score = 160 bits (405), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 130/230 (56%), Gaps = 10/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMW------ 74
I ++I G E Q +I L PQE V+A GS M ++ME I+ + + G
Sbjct: 6 IDYRIYGEEMQYVEIELDPQEAVIAESGSFMMMDDGIKMETIFGDGSQKDKGFLNKVLGA 65
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ N V A+P +I+PI+LA GG+ +CQ DAFLC+
Sbjct: 66 GRRILTGESLFITAFYNTLSGKRNVSFASPYPGKIIPINLAECGGKFICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ +KL G G+AF+ AGG+ +K L+ GE + VD CI+
Sbjct: 126 VSIGIEFSKKLGRGLFGGEGFIMEKLEGDGMAFVHAGGTTAKKELKAGETLRVDTGCIIG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T +VN I++ G I+ ++FGG+ L A + GPG V+IQSLPF RL+ R+
Sbjct: 186 FTHNVNYDIEFIGGIKNSIFGGEGLFFAKLEGPGTVYIQSLPFSRLANRV 235
>gi|312109828|ref|YP_003988144.1| hypothetical protein GY4MC1_0716 [Geobacillus sp. Y4.1MC1]
gi|336234179|ref|YP_004586795.1| hypothetical protein Geoth_0715 [Geobacillus thermoglucosidasius
C56-YS93]
gi|423718877|ref|ZP_17693059.1| hypothetical protein GT20_0625 [Geobacillus thermoglucosidans
TNO-09.020]
gi|311214929|gb|ADP73533.1| protein of unknown function DUF124 [Geobacillus sp. Y4.1MC1]
gi|335361034|gb|AEH46714.1| protein of unknown function DUF124 [Geobacillus thermoglucosidasius
C56-YS93]
gi|383367780|gb|EID45055.1| hypothetical protein GT20_0625 [Geobacillus thermoglucosidans
TNO-09.020]
Length = 260
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 92/242 (38%), Positives = 138/242 (57%), Gaps = 12/242 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE---------VGM 73
I +++ G + Q +I L PQE V+A G M M + ME I+ ++ VG
Sbjct: 6 IDYKLYGDDMQFVEIELDPQESVIAEAGGMMMMEDGITMETIFGDGSDSRKGFFNKLVGA 65
Query: 74 WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ V N V AAP +I+P+DL+ GG+++CQ D+FLC+
Sbjct: 66 GKRLLTGESLFMTVFTNNSAEKRRVSFAAPYPGKIIPVDLSELGGKVICQKDSFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ G EGF+ QKL G GLAF+ AGG++ Q+ L+ GE + +D C+VA
Sbjct: 126 VSVGIDFQRKLGTGFFGGEGFIMQKLEGDGLAFLHAGGTIYQRELQPGEKLRIDTGCLVA 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
+T V+ I+Y G I+ A F G+ L A +TGPG V++QSLPF RL+ RI A +P+ R
Sbjct: 186 MTKEVDYDIEYVGKIKTAFFSGEGLFFATLTGPGTVWVQSLPFSRLADRIIAA--APSSR 243
Query: 253 EN 254
E
Sbjct: 244 EE 245
>gi|326200864|ref|ZP_08190736.1| protein of unknown function DUF124 [Clostridium papyrosolvens DSM
2782]
gi|325988432|gb|EGD49256.1| protein of unknown function DUF124 [Clostridium papyrosolvens DSM
2782]
Length = 256
Score = 159 bits (403), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 86/224 (38%), Positives = 130/224 (58%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
VI ++I G + Q+ +I L P E V A G+M +M + M+ G + L G++
Sbjct: 4 VIDYKIYGDDMQIVEIELDPSEGVRAEAGAMMYMEDGIMMQTSTDGGAFKGFKRMLTGES 63
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
N G S V +AP +I+P+DL GG LCQ D+FLC+ + V + +
Sbjct: 64 FFITTFYNAGTSKQHVAFSAPYPGKIVPLDLTKLGGNFLCQKDSFLCAASGVDIEVAFTR 123
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
++ G EGF+ Q+L G G+AF+ +GG+++QKNL GE + VD C+VA + SV+ I
Sbjct: 124 KLGAGFFGGEGFILQRLVGDGMAFVHSGGTIIQKNLLPGETLRVDTGCLVAFSPSVDYDI 183
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+ G + A+FGG+ L A +TGPGIVF+QSLP RLS R++ +
Sbjct: 184 QSVGGFKNALFGGEGLFLAKMTGPGIVFLQSLPLARLSDRLSSS 227
>gi|332292924|ref|YP_004431533.1| hypothetical protein Krodi_2287 [Krokinobacter sp. 4H-3-7-5]
gi|332171010|gb|AEE20265.1| protein of unknown function DUF124 [Krokinobacter sp. 4H-3-7-5]
Length = 266
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 10/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
I ++I G E Q ++ L PQE V+A GS M + ME I+ + G+ +FG
Sbjct: 6 IDYEIFGEEMQYVEVELDPQEGVIAEAGSFMMMEDGIRMETIFGDGSAKDTGLMGKIFGA 65
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ N V A+P +I+PIDL GG+ +CQ DAFLC+
Sbjct: 66 GKRLLTGESLFMTAFYNDVIGKKKVSFASPYPGKIIPIDLDKIGGKFICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ + G EGF+ Q+L G G+AF+ AGG+ ++K L GE + VD CI+
Sbjct: 126 VSVGIEFSRKLGRGLFGGEGFIMQRLEGDGIAFVHAGGTTLKKELAPGERLKVDTGCIIG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
S+V I++ G I+ +FGG+ L A ++GPG V+IQSLPF RL+ R+
Sbjct: 186 FDSTVTYDIEFVGGIKNTIFGGEGLFFATLSGPGTVYIQSLPFSRLANRV 235
>gi|138896414|ref|YP_001126867.1| hypothetical protein GTNG_2777 [Geobacillus thermodenitrificans
NG80-2]
gi|134267927|gb|ABO68122.1| Conserved hypothetical protein [Geobacillus thermodenitrificans
NG80-2]
Length = 265
Score = 159 bits (401), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 90/240 (37%), Positives = 134/240 (55%), Gaps = 10/240 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I +Q+ G + Q +I L P+E V+A G M M + ME ++ + G
Sbjct: 7 IDYQLYGDDMQFVEIELDPRESVIAEAGGMMMMEDGIVMETVFGDGSSSGKGLLGRLVGA 66
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ V N G V AAP +I+P++L GG+L+CQ D+FLC+
Sbjct: 67 GKRLLTGESLFMTVFTNQGTGKRRVAFAAPYPGKIIPVNLQELGGKLICQKDSFLCAAKG 126
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ G EGF+ QKL G GLAF+ AGG + ++ L+ GE + +D C+VA
Sbjct: 127 VSVGIEFQRKLGTGFFGGEGFIMQKLEGDGLAFLHAGGMIHRRELQPGEKLRIDTGCLVA 186
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
+T VN I+Y G I+ A FGG+ L A +TGPG+V++QSLPF RL+ R+ A S R
Sbjct: 187 MTKEVNYDIEYVGNIKTAFFGGEGLFFATLTGPGVVWVQSLPFSRLADRVIAAAPSSGGR 246
>gi|220930001|ref|YP_002506910.1| hypothetical protein Ccel_2602 [Clostridium cellulolyticum H10]
gi|220000329|gb|ACL76930.1| protein of unknown function DUF124 [Clostridium cellulolyticum H10]
Length = 256
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 87/224 (38%), Positives = 129/224 (57%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
VI ++I G + Q+ +I L P E V A G+M +M + M+ G + L G++
Sbjct: 4 VIDYKIYGDDMQIVEIELDPSEGVRAEAGAMMYMEDDIMMQTSTDGGAFKGFKRMLTGES 63
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
N G S V +AP +I+P+DL+ GG LCQ D+FLC+ + V + +
Sbjct: 64 FFITTFYNAGASKQHVAFSAPYPGKIIPLDLSRLGGNFLCQKDSFLCAASGVDIEIAFTR 123
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
++ G EGF+ Q+L G GLAF+ +GG+++QKNL GE + VD C+VA + SV+ I
Sbjct: 124 KLGAGFFGGEGFILQRLVGDGLAFVHSGGTIIQKNLLPGETLRVDTGCLVAFSPSVDYDI 183
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+ + A+FGG+ L A +TGPGIVF+QSLP RLS R+ A
Sbjct: 184 ESVRGFKNALFGGEGLFLAKLTGPGIVFLQSLPLARLSDRLTSA 227
>gi|255535414|ref|YP_003095785.1| hypothetical protein FIC_01275 [Flavobacteriaceae bacterium
3519-10]
gi|255341610|gb|ACU07723.1| hypothetical protein FIC_01275 [Flavobacteriaceae bacterium
3519-10]
Length = 260
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 84/239 (35%), Positives = 132/239 (55%), Gaps = 8/239 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF---- 78
I ++I G E Q +I L P E V++ PGS M+ ++M+ ++ NE G L
Sbjct: 6 IDYRIHGEEMQCVEIELDPNESVISEPGSFMMMTDGIQMQTMFGDGNEKGFMGKLLSAGK 65
Query: 79 ----GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
G+ + N V AAP +I+P+DL+ G+++CQ D+FLC+ V
Sbjct: 66 RVLTGENLFMTAFTNISNQKRQVSFAAPYSGKIIPLDLSELDGKVICQKDSFLCAAKGVA 125
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
V +++ + G EGF+ QKL G G+ F+ +GG V++K L+ GE++ +D CIVA T
Sbjct: 126 VDVEFQRKLGTGLFGGEGFIMQKLEGDGMTFVHSGGHVIEKQLQPGEILKIDTGCIVAFT 185
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRE 253
V+ I++ G I+ +FGG+ L A + GPG V+IQ+LP RL+ RI + T+ E
Sbjct: 186 KDVDYDIQFVGGIKNTIFGGEGLFFAQLRGPGKVWIQTLPIARLAARILQFGTTKKGEE 244
>gi|403071089|ref|ZP_10912421.1| hypothetical protein ONdio_16036 [Oceanobacillus sp. Ndiop]
Length = 264
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 93/230 (40%), Positives = 136/230 (59%), Gaps = 10/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLFG- 79
I F++ G + Q ++ L PQE V+A GS+ M + M+ I+ +N+ G+ L G
Sbjct: 6 IDFKLYGDDMQFVEVELDPQETVIAEAGSLMMMDDQIRMQTIFGDGTDNQSGLMGKLIGA 65
Query: 80 --KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ IT L N G V A+P +I+P+DL+ G+L+CQ DAFL +
Sbjct: 66 GKRVITGESLFMTTFTNEGSGKKHVSFASPYPGKIIPMDLSEINGKLICQKDAFLAAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ G EGF+ QKL G G+AFI AGG++ +K LE GEV+ VD C+VA
Sbjct: 126 VAVGIEFQKKIGTGFFGGEGFIMQKLEGDGMAFIHAGGTMQKKVLEPGEVLKVDTGCLVA 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+T+SV+ I+Y G ++ A+FGG+ L A + GPG V+IQSLPF RL+ RI
Sbjct: 186 MTASVDYNIEYIGGVKTALFGGEGLFFATLRGPGTVWIQSLPFSRLASRI 235
>gi|310779547|ref|YP_003967880.1| hypothetical protein [Ilyobacter polytropus DSM 2926]
gi|309748870|gb|ADO83532.1| protein of unknown function DUF124 [Ilyobacter polytropus DSM 2926]
Length = 258
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 82/231 (35%), Positives = 132/231 (57%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
VI ++I G + Q+ +I L P E V A G+M +M +EM+ G + + G++
Sbjct: 4 VIDYKIYGHDMQLVEIELDPGEGVRAEAGAMTYMESQIEMQTGAGGGLFGGFKRMITGES 63
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
+N G + VG +AP +I+P+DL F GE +CQ D+FLC+ N + + +
Sbjct: 64 FFITTFKNNGKAKKSVGFSAPYPGQIIPLDLNKFNGEFICQKDSFLCAANGIDIDVAFTK 123
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
++ G EGF+ Q+L G G+AF+ AGGS+++K L+ GE + VD CIV SV+ I
Sbjct: 124 KLGAGFFGGEGFILQRLKGDGMAFVHAGGSIIEKELKAGETLRVDTGCIVGFERSVDYDI 183
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
++ G + A+FG + L A + GPG V+IQS+P +L++R+ + S R
Sbjct: 184 RFVGGFKNALFGKEGLFLATLEGPGRVYIQSMPISKLAERLYANMGSSRER 234
>gi|163756287|ref|ZP_02163401.1| hypothetical protein KAOT1_01524 [Kordia algicida OT-1]
gi|161323639|gb|EDP94974.1| hypothetical protein KAOT1_01524 [Kordia algicida OT-1]
Length = 266
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 93/235 (39%), Positives = 131/235 (55%), Gaps = 11/235 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
I ++I G E Q +I L PQE VVA GS M + M I+ E G+ LF
Sbjct: 6 IDYEIFGAEMQYVEIELDPQEAVVAEAGSFMMMDDGITMNTIFGDGSNQEKGILGKLFSA 65
Query: 79 GKTI---TSVVLRNPGPSDGF----VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
GK I S+ + + F V AAP +I+PIDL + G+ +CQ DAFLC+
Sbjct: 66 GKRILTGESLFMTSFLNDSSFGVRKVSFAAPYPGKIVPIDLTDYRGKFICQKDAFLCAAK 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V V +++ + G EGF+ QKL G G+AF+ AGG++ +K L GE + VD CIV
Sbjct: 126 GVTVGIEFSKKLGRGLFGGEGFIMQKLEGDGMAFVNAGGTLARKQLAPGETLRVDTGCIV 185
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
T V+ I++ G I+ +FGG+ + A + GPGIV+IQSLPF RL+ RI ++
Sbjct: 186 GFTQDVDYDIEFVGGIKNTLFGGEGMFFATLKGPGIVYIQSLPFSRLAGRIVASI 240
>gi|421074999|ref|ZP_15536017.1| protein of unknown function DUF124 [Pelosinus fermentans JBW45]
gi|392526891|gb|EIW49999.1| protein of unknown function DUF124 [Pelosinus fermentans JBW45]
Length = 347
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 85/247 (34%), Positives = 139/247 (56%), Gaps = 12/247 (4%)
Query: 12 PYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY----IP 67
P ++ + ++I G E Q ++ L P E V+A G+M + ++EM+ I+
Sbjct: 66 PTTIRTRTSDEVDYKIFGTEMQFVEVELDPGESVIAEAGAMMYKDSAVEMDTIFGDGSAK 125
Query: 68 ENEVGMW--------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGEL 119
++ G++ + L G+++ V + G + V AP I+P+ LA GG +
Sbjct: 126 SSQGGLFDKLVGAGKRLLTGESLFMTVFTHRGYNKAKVAFGAPYPGNIIPMHLAQLGGSI 185
Query: 120 LCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEV 179
+CQ D+FLC+ V V +++ + G EGF+ QKL G GL+F+ AGG++V++ L+
Sbjct: 186 ICQKDSFLCAAKGVSVGIYFQRKIMTGLFGGEGFIMQKLEGDGLSFMHAGGTIVERQLQS 245
Query: 180 GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
GEV+ VD CIVA SV I+ G I+ ++FGG+ L AV+ GPG +++QSLPF RL+
Sbjct: 246 GEVLHVDTGCIVAFEPSVQFDIQQAGNIKSSLFGGEGLFFAVLQGPGKIWLQSLPFSRLA 305
Query: 240 QRIARAV 246
R+ A
Sbjct: 306 GRMLAAA 312
>gi|149180844|ref|ZP_01859346.1| hypothetical protein BSG1_00395 [Bacillus sp. SG-1]
gi|148851363|gb|EDL65511.1| hypothetical protein BSG1_00395 [Bacillus sp. SG-1]
Length = 265
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 88/231 (38%), Positives = 131/231 (56%), Gaps = 11/231 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW------ 76
I +Q+ G + Q ++ L P E V+A G+M M +EME I+ + G +
Sbjct: 6 IEYQLYGDDMQFVEVELDPSESVIAEAGAMMMMEDGIEMETIFGDGSASGKSGFFGKLVG 65
Query: 77 -----LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
L G+++ V N G V AAP +I+ +DL+ GG+++CQ D+FLC+
Sbjct: 66 AGKRVLTGESLFMTVFTNQGMGKKHVSFAAPYPGKIISVDLSEMGGKVICQKDSFLCAAK 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V V +++ G EGF+ QKL G GLAF+ AGG++ +K L+ GE + VD C+V
Sbjct: 126 GVSVGIDFQRKLGTGFFGGEGFIMQKLEGDGLAFLHAGGTIHRKELQPGEKLRVDTGCLV 185
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
A+T V+ I+Y G I+ A FGG+ + A V GPG V+IQSLPF RL+ R+
Sbjct: 186 AMTREVDYNIEYVGKIKSAFFGGEGMFFATVQGPGTVWIQSLPFSRLADRV 236
>gi|410461812|ref|ZP_11315452.1| hypothetical protein BAZO_21083 [Bacillus azotoformans LMG 9581]
gi|409925200|gb|EKN62423.1| hypothetical protein BAZO_21083 [Bacillus azotoformans LMG 9581]
Length = 265
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 88/230 (38%), Positives = 133/230 (57%), Gaps = 10/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLFG- 79
I +++ G + Q +I L P E +A G+M +EME I+ ++ G++ L G
Sbjct: 6 IEYKLYGDDMQFVEIELDPGESTIAEAGAMMMQEDGIEMETIFGDGASSDKGLFGKLLGA 65
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ V N PS V AAP +I+P+DL+ GG+++CQ DAFLC+
Sbjct: 66 GKRILTGESLFMTVFTNQSPSIKHVSFAAPYPGKIIPMDLSELGGKVICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +R+ G EGF+ QKL G GLAF+ AGG++ ++ L+ GE + +D C+VA
Sbjct: 126 VAVGIEFQRRLGAGFFGGEGFIMQKLEGDGLAFLHAGGTIYRRELQPGERLKIDTGCLVA 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+T ++ I+Y G I+ A FGG+ L A V GPG V++QSLPF RL+ RI
Sbjct: 186 MTRDIDYNIEYVGKIKTAFFGGEGLFFATVQGPGTVWVQSLPFSRLADRI 235
>gi|410657403|ref|YP_006909774.1| DUF124 domain-containing protein [Dehalobacter sp. DCA]
gi|410660439|ref|YP_006912810.1| DUF124 domain-containing protein [Dehalobacter sp. CF]
gi|409019758|gb|AFV01789.1| DUF124 domain-containing protein [Dehalobacter sp. DCA]
gi|409022795|gb|AFV04825.1| DUF124 domain-containing protein [Dehalobacter sp. CF]
Length = 267
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 87/232 (37%), Positives = 132/232 (56%), Gaps = 12/232 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK-- 80
I +++ G + Q ++ L P E V+A G++ FM ++ME I+ + G + L GK
Sbjct: 7 IEYKVYGDDMQFVEVELDPAESVIAEAGALMFMEPGIQMETIFGDGSGQGENRGLTGKLF 66
Query: 81 ----------TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
++ + N G AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 67 GAGKRLLTGESLFMTLFTNSGAGKAHASFAAPYPGKIIPVDLTAYNGKVICQKDAFLCAA 126
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + ++R+ G EGF+ QKL G GLAF+ AGG++V+K L GE + VD C+
Sbjct: 127 KGVAIGIEFNKRLGTGFFGGEGFIMQKLEGDGLAFLHAGGTIVEKELRPGETLRVDTGCL 186
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
V +T SV I++ G I+ A FGG+ L A V+GPG V++QSLPF RL+ RI
Sbjct: 187 VGMTGSVCYDIQFVGGIKTAFFGGEGLFFATVSGPGKVWLQSLPFSRLADRI 238
>gi|337747242|ref|YP_004641404.1| hypothetical protein KNP414_02976 [Paenibacillus mucilaginosus
KNP414]
gi|386723772|ref|YP_006190098.1| hypothetical protein B2K_16665 [Paenibacillus mucilaginosus K02]
gi|336298431|gb|AEI41534.1| protein of unknown function DUF124 [Paenibacillus mucilaginosus
KNP414]
gi|384090897|gb|AFH62333.1| hypothetical protein B2K_16665 [Paenibacillus mucilaginosus K02]
Length = 262
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 89/242 (36%), Positives = 135/242 (55%), Gaps = 10/242 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I + I G E Q +I L P E V+A GS+ M S+ ME I+ ++ G
Sbjct: 6 IDYAIHGSEMQYVEIELDPGESVIAEAGSLMMMDQSIHMETIFGDGSDQGKGLMGKLFGA 65
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ + N G A+P RILP+DL+M GG+++CQ D+FLC+
Sbjct: 66 GKRLLTGESLFMTIFTNRGHGKERASFASPYPGRILPMDLSMLGGKIICQKDSFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ G EGF+ QK+ G GLAF+ AGG++ ++ L GE+I VD C+VA
Sbjct: 126 VSVGIDFQRKLGAGFFGGEGFIMQKIEGDGLAFVHAGGAICERELRPGEMIRVDTGCLVA 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
+T V+ I++ ++ A+FGG+ L A + GPG V+IQSLPF RL+ R+ A +
Sbjct: 186 MTQGVDYDIEFVKGVKTALFGGEGLFFATLRGPGRVWIQSLPFSRLADRVLAAAGPGGRK 245
Query: 253 EN 254
E
Sbjct: 246 EE 247
>gi|256420192|ref|YP_003120845.1| hypothetical protein Cpin_1146 [Chitinophaga pinensis DSM 2588]
gi|256035100|gb|ACU58644.1| protein of unknown function DUF124 [Chitinophaga pinensis DSM 2588]
Length = 261
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 85/244 (34%), Positives = 134/244 (54%), Gaps = 14/244 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I ++I G E Q+ +I L PQE +A GS M ++M+ ++ ++ Q +FGK +
Sbjct: 7 IDYRIYGEEIQIVEIELDPQETAIAESGSFMMMDQDIQMQTMFGDGSQAN--QGIFGKLV 64
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
++ N G V AAP +I+P+DL GG+++CQ DAFLC+
Sbjct: 65 SAGKRMLTGESLFMTAFTNMGQGKKKVSFAAPYPGKIIPMDLQRMGGKVICQKDAFLCAA 124
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ I G EGF+ +KL G G+AF+ AGG V++++L G+V+ +D C+
Sbjct: 125 KGVSIGIEWQRKLGTGIFGGEGFIMEKLEGDGMAFVHAGGMVIERDLLPGQVLKIDTGCV 184
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
VA TS V+ +++ IR VFGG+ L A + GPG V+IQSLP R++ R+ T
Sbjct: 185 VAYTSGVHFDVEFVKGIRNMVFGGEGLFFATLRGPGKVWIQSLPISRMAARLVSYGTGKR 244
Query: 251 MREN 254
E
Sbjct: 245 KEEG 248
>gi|182413474|ref|YP_001818540.1| hypothetical protein Oter_1656 [Opitutus terrae PB90-1]
gi|177840688|gb|ACB74940.1| protein of unknown function DUF124 [Opitutus terrae PB90-1]
Length = 266
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 92/244 (37%), Positives = 134/244 (54%), Gaps = 12/244 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE---------VGM 73
+ +QI G + Q +I L P E VVA G M FM + ME ++ + +G
Sbjct: 7 VDYQIHGDDMQFVEIELDPAEAVVAEAGGMMFMEDGIAMETVFGDGSTRNSGVMGALLGA 66
Query: 74 WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ V +N V AP +ILPI LA GGEL+ Q D+FLC+
Sbjct: 67 GKRLLTGESLFMTVFQNQAAGKKRVSFGAPYPGKILPIKLADVGGELIAQKDSFLCAAKG 126
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +R+ + G EGF+ Q+LTG G AF+ AGG++ ++ L GE++ VD CIVA
Sbjct: 127 VSIGIAFQRRLGAGLFGGEGFIMQRLTGDGWAFVHAGGTLCERTLAPGELLRVDTGCIVA 186
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV--TSPN 250
SV+ I++ G I+ A+FGG+ L A + GPG +++QSLPF RL+ RI A T
Sbjct: 187 FQPSVDYDIQWVGGIKSALFGGEGLFFATLRGPGKLWLQSLPFSRLAGRIIAAAPQTGRG 246
Query: 251 MREN 254
RE
Sbjct: 247 GREE 250
>gi|302875052|ref|YP_003843685.1| hypothetical protein Clocel_2178 [Clostridium cellulovorans 743B]
gi|307690329|ref|ZP_07632775.1| hypothetical protein Ccel74_19401 [Clostridium cellulovorans 743B]
gi|302577909|gb|ADL51921.1| protein of unknown function DUF124 [Clostridium cellulovorans 743B]
Length = 266
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 91/248 (36%), Positives = 146/248 (58%), Gaps = 18/248 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQ---- 75
I ++I G + Q +I L P+E VVA G+M M S++ME I+ +N ++
Sbjct: 6 IDYKITGDDIQYVEIELDPKETVVAEAGAMMMMDPSIQMETIFGDGSNQNPSSTFKDKLF 65
Query: 76 -----WLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
L G+++ N G V AAP +I+P+DL++ GG ++CQ DAFLC+
Sbjct: 66 YAGKRVLTGESLFMTAFTNLGNIKEKVYFAAPYPGKIIPMDLSLLGGRIICQKDAFLCAA 125
Query: 131 NDVKVSNTLDQRVRNVIG--GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
V + +D R + +G G EGF+ +KL G G+AFI A G++++K+L GE + VD
Sbjct: 126 KGVSIG--IDFRKKLSVGFFGGEGFILEKLEGDGVAFIHACGAIIKKDLMPGETLRVDTG 183
Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
C+VA+T V+ I++ G ++ +FGG+ + A +TGPG V+IQSLPF RL++R+ A +
Sbjct: 184 CLVAMTQEVHYDIQFVGGVKNTLFGGEGVFFATLTGPGTVWIQSLPFSRLAERVFAA--A 241
Query: 249 PNMRENPK 256
P++ + K
Sbjct: 242 PHLGGHRK 249
>gi|392953242|ref|ZP_10318796.1| hypothetical protein WQQ_28680 [Hydrocarboniphaga effusa AP103]
gi|391858757|gb|EIT69286.1| hypothetical protein WQQ_28680 [Hydrocarboniphaga effusa AP103]
Length = 271
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 86/245 (35%), Positives = 134/245 (54%), Gaps = 10/245 (4%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWL-- 77
+I ++I G + Q ++ L P E V G+M FM + M+ ++ E G W L
Sbjct: 5 IIDYKIYGHDMQFVEVELDPGEAAVGEAGAMMFMEDGIVMDTVFGDGSAQEKGFWGKLAG 64
Query: 78 ------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
G+++ + V N G V AAP I+P+ L GG L+CQ DAFLC+
Sbjct: 65 AGKRLVTGESLFTTVFSNQGQGKRKVSFAAPYPGTIVPVLLPDVGGTLICQKDAFLCAAK 124
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + +++ + G EGF+ Q+L G GLAF+ AGG++ +++L GE + VD CIV
Sbjct: 125 GVSLGIAFQRKLGVGLFGGEGFIMQRLDGDGLAFVHAGGTLYKRDLRPGETLRVDTGCIV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
A+T +V+ I+Y G ++ A+F G+ L AV+ GPG V++QSLP R++ RI A +
Sbjct: 185 ALTQTVSYDIQYVGKLKSALFSGEGLFYAVLQGPGTVWLQSLPLSRMANRIVMAAPAMGG 244
Query: 252 RENPK 256
R +
Sbjct: 245 RSRDE 249
>gi|73668506|ref|YP_304521.1| hypothetical protein Mbar_A0969 [Methanosarcina barkeri str.
Fusaro]
gi|72395668|gb|AAZ69941.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
Length = 262
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 91/241 (37%), Positives = 137/241 (56%), Gaps = 11/241 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------Q 75
I ++I+G + Q+ +I L P+E V A G+M +M ++M+ + +
Sbjct: 5 IDYEIIGNDMQIVEIELDPEEAVQAEAGAMAYMGPGIQMQTSMSNDGGGLFGGLKKGLKR 64
Query: 76 WLFGKT--ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
L G++ ITS V + G G V AAP +I+P+DL FGG LLCQ D+FLC+ V
Sbjct: 65 ALTGESFFITSFVQKGSGK--GHVAFAAPYPGKIIPLDLTKFGGGLLCQKDSFLCAARGV 122
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
++ +++ G EGF+ Q+L G GLAF+ GG+V++K+L VGE VD C+ A
Sbjct: 123 EIEVAFTRKLGVGFFGGEGFILQRLKGDGLAFLHIGGTVIRKDLAVGETYRVDTGCVAAF 182
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRE 253
T +V I ++ + A+FGG+ +V A +TGPG V++QSLPF RL+ RI A N E
Sbjct: 183 TETVTYDITWSRNFKNALFGGEGVVLATLTGPGTVYMQSLPFSRLADRIFAASAFGNREE 242
Query: 254 N 254
Sbjct: 243 Q 243
>gi|21227408|ref|NP_633330.1| HTH DNA-binding protein [Methanosarcina mazei Go1]
gi|452209891|ref|YP_007490005.1| Hypothetical protein MmTuc01_1354 [Methanosarcina mazei Tuc01]
gi|20905772|gb|AAM31002.1| HTH DNA-binding protein [Methanosarcina mazei Go1]
gi|452099793|gb|AGF96733.1| Hypothetical protein MmTuc01_1354 [Methanosarcina mazei Tuc01]
Length = 250
Score = 157 bits (396), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 86/229 (37%), Positives = 125/229 (54%), Gaps = 7/229 (3%)
Query: 33 QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------QWLFGKTITSV 85
Q+ +I L P E V A G+M +M + M+ E + L G++
Sbjct: 2 QIVEIELDPGEAVQAEAGAMAYMGPGILMQTGMGNEGGGLFGGLKKGLKRALTGESFFIT 61
Query: 86 VLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRN 145
+ G G V AAP +ILP+DL+ FGG +LCQ DAFLC+ ++V +++
Sbjct: 62 SFIHKGSGKGHVAFAAPYPGKILPLDLSKFGGSILCQKDAFLCAAKGIEVELAFTRKLGA 121
Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
+ G EGF+ Q+L G GLAFI GG+V++K+L GE VD C+ A T +V I ++
Sbjct: 122 GLFGGEGFILQRLRGDGLAFIHIGGTVIRKDLAPGETYKVDTGCVAAFTENVTYDITWSR 181
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
+ A+FGG+ +V A +TGPG V++QSLPF RL+ RI A RE
Sbjct: 182 DFKNALFGGEGVVLATLTGPGTVYMQSLPFSRLADRIFAASAYGQNREE 230
>gi|452995794|emb|CCQ92476.1| conserved hypothetical protein [Clostridium ultunense Esp]
Length = 261
Score = 156 bits (395), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 88/229 (38%), Positives = 132/229 (57%), Gaps = 9/229 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I ++I G E Q ++ L P E V+A G++ M ++ME I+ G +
Sbjct: 6 IDYRIYGEEMQFVEVELDPGESVIAEAGALMMMDDEIQMETIFGDGTHQGGFMSKLFAAG 65
Query: 75 -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
+ L G+++ V N G V A+P RI+P+DL+ G+++CQ DAFLC+ V
Sbjct: 66 KRVLTGESLFMTVFTNEGRGKKRVSFASPYPGRIIPLDLSELDGKVICQKDAFLCAAKGV 125
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
V +++ G EGF+ QKL G GLAF+ AGG++V++ L GE + VD C+VA+
Sbjct: 126 SVGIDFQRKLGTGFFGGEGFIMQKLEGDGLAFVHAGGTIVERTLAPGERLKVDTGCLVAL 185
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T V+ I++ G I+ +FGG+ L A +TGPG V+IQSLPF RL+ RI
Sbjct: 186 TQDVDYDIQFVGGIKNTLFGGEGLFFATLTGPGKVWIQSLPFARLADRI 234
>gi|118581514|ref|YP_902764.1| hypothetical protein Ppro_3113 [Pelobacter propionicus DSM 2379]
gi|118504224|gb|ABL00707.1| protein of unknown function DUF124 [Pelobacter propionicus DSM
2379]
Length = 327
Score = 156 bits (395), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 91/247 (36%), Positives = 135/247 (54%), Gaps = 12/247 (4%)
Query: 9 PFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPE 68
P P ++ D I ++I+G E Q +I L P E V+A G+M + S+ ME I+
Sbjct: 58 PAPPSGRRASDD--IDYRIVGSEMQFVEIELDPGESVIAEAGAMMYKDASITMEAIFGDG 115
Query: 69 NEVGMW----------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE 118
+ G + L G+++ + G AAP I+P LA GG
Sbjct: 116 SSQGGGFTDKLFSAGKRILTGESLFMTLFTQSGQGKARAAFAAPYPGNIIPATLASLGGT 175
Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
L+CQ D FLC+ V V +R+ + G EGF+ QKL G GLAF+ AGG++V++ L
Sbjct: 176 LICQKDCFLCAARGVSVGIFFQRRIMTGLFGGEGFIMQKLEGDGLAFLHAGGTIVERVLS 235
Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
GE + VD C+VA+ +V+ I+ G I+ A+FGG+ L A +TGPG V++QSLPF RL
Sbjct: 236 AGETLHVDTGCLVAMEPTVSFDIQQAGNIKTALFGGEGLFFARLTGPGRVWLQSLPFSRL 295
Query: 239 SQRIARA 245
+ R+ +A
Sbjct: 296 AGRMLQA 302
>gi|408370142|ref|ZP_11167921.1| hypothetical protein I215_04540 [Galbibacter sp. ck-I2-15]
gi|407744617|gb|EKF56185.1| hypothetical protein I215_04540 [Galbibacter sp. ck-I2-15]
Length = 266
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 91/234 (38%), Positives = 129/234 (55%), Gaps = 10/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
I +QI G E Q +I L PQE VVA GS M + M+ I+ + G+ LF
Sbjct: 6 IDYQIYGEEMQFVEIELDPQEAVVAEAGSFMMMEDGLNMDTIFGDGSNQDRGVLGKLFSA 65
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G+++ N V AAP +I+PIDL + G+ +CQ DAFLC+
Sbjct: 66 GKRMLTGESLFMTAYLNTTGLKKKVSFAAPYPGKIVPIDLRKYQGKFICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +R+ G EGF+ QKL G G+AF+ AGG++ +K L GE++ VD CIV
Sbjct: 126 VSVGIEFSKRLGRGFFGGEGFIMQKLQGDGMAFVHAGGTLARKELAEGELLKVDTGCIVG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
T ++ I++ G I+ VFGG+ L A + GPG+V+IQSLPF RL+ R+ A+
Sbjct: 186 FTKDIDYDIQFVGGIKNTVFGGEGLFYATLRGPGVVYIQSLPFSRLASRVFAAI 239
>gi|260221740|emb|CBA30605.1| hypothetical protein Csp_C24390 [Curvibacter putative symbiont of
Hydra magnipapillata]
Length = 293
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 131/230 (56%), Gaps = 9/230 (3%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW------- 74
V+ ++I G E Q ++ L P E + G M +M + M+ I+ ++ G +
Sbjct: 30 VVDYEIFGSEMQYVEVELDPGEAAIGEAGVMMYMQEGITMDTIFGDGSQQGGFFGKLMGA 89
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+ + + V N G V AAP +I+P+DL+ GG ++CQ D+FLC+
Sbjct: 90 GKRLLTGEGLFTTVFENSGSGKQRVAFAAPYPGKIVPMDLSKLGGTIICQKDSFLCAAKG 149
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G+AF+ AGG+++++ L GE + VD C+VA
Sbjct: 150 VALGIAFQKKLGVGLFGGEGFIMQKLDGDGMAFVHAGGTLMERTLAPGETLRVDTGCVVA 209
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+V+ I++ G ++ A+FGG+ L A + GPG V++QSLP RL+ RI
Sbjct: 210 FQPTVDFDIQFVGKVKSAIFGGEGLFFATLRGPGKVWLQSLPLSRLANRI 259
>gi|95931485|ref|ZP_01314184.1| protein of unknown function DUF124 [Desulfuromonas acetoxidans DSM
684]
gi|95132462|gb|EAT14162.1| protein of unknown function DUF124 [Desulfuromonas acetoxidans DSM
684]
Length = 333
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 86/234 (36%), Positives = 131/234 (55%), Gaps = 10/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI---PENEVGMWQWL-- 77
I F I G E Q +I L P E VA G+M + + ++ ME ++ P+ M + L
Sbjct: 75 IDFTIHGTEMQFVEIELDPGESAVAEAGAMMYKASTISMETVFGDGGPQTGGFMGKLLGA 134
Query: 78 -----FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G+++ + V + G V AP I+P+ L GG L+CQ DAFLC+
Sbjct: 135 GKRLVTGESLFTTVFTHQGHGKAHVAFGAPYPGNIIPVALDAMGGSLICQKDAFLCAARG 194
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + L +R+ + G EGF+ QKL G G+AF+ AGGS+V++ L+ GE + VD C+VA
Sbjct: 195 VSIGLHLQKRILTGLFGGEGFIMQKLEGDGMAFLHAGGSIVERELKPGEELHVDTGCVVA 254
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
V+ I+ G I+ ++FGG+ L A + GPG +++QSLPF RL+ R+ +A
Sbjct: 255 YEPKVSFDIQQAGGIKTSLFGGEGLFFAKLQGPGKIWLQSLPFSRLAGRMLQAA 308
>gi|169826442|ref|YP_001696600.1| hypothetical protein Bsph_0854 [Lysinibacillus sphaericus C3-41]
gi|168990930|gb|ACA38470.1| conserved hypothetical protein [Lysinibacillus sphaericus C3-41]
Length = 266
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 86/235 (36%), Positives = 134/235 (57%), Gaps = 12/235 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q Q+ L PQE VVA G++ M S+ ME I+ ++ G L GK +
Sbjct: 6 IDYKLFGDDMQYVQVELDPQETVVAEAGALMMMDDSIHMETIFGDGSKGGSQSGLMGKLL 65
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ N G V A+P +I+P+DL+ G+++CQ DAFL +
Sbjct: 66 GAGKRLVTGESLFMTTFTNVGSGKKHVYFASPYPGKIIPMDLSQLNGKIICQKDAFLAAA 125
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V V +++ G EGF+ QKL G G+AF+ AGG++ +K L+ GEV+ VD C+
Sbjct: 126 KGVSVGVEFQKKIGVGFFGGEGFIMQKLEGDGMAFVHAGGAIHEKTLQPGEVLRVDTGCL 185
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
VA+TS+V+ I+ G ++ A+FGG+ + A + GPG V++QSLPF RL+ R+ A
Sbjct: 186 VAMTSNVDYDIELVGGVKTALFGGEGIFFATLRGPGTVWVQSLPFSRLASRVFAA 240
>gi|189423881|ref|YP_001951058.1| hypothetical protein Glov_0812 [Geobacter lovleyi SZ]
gi|189420140|gb|ACD94538.1| protein of unknown function DUF124 [Geobacter lovleyi SZ]
Length = 267
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 82/241 (34%), Positives = 133/241 (55%), Gaps = 10/241 (4%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE---------VG 72
VI ++I G E Q ++ L P E + G M +M ++M+ ++ ++ +G
Sbjct: 5 VIDYEIFGSEMQFVEVELDPGEAAIGEAGVMMYMQDGIQMDTVFGDGSQQTSGFMGKLLG 64
Query: 73 MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ + V N V AAP +I+P+ L GG L+CQ D+FLC+
Sbjct: 65 AGKRLLTGESLFTTVFHNESSGKRRVAFAAPYPGKIVPVQLTDIGGTLICQKDSFLCAAK 124
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + +++ + G EGF+ QKL G G+AFI AGG++ ++ L GE + VD C+V
Sbjct: 125 GVSLGIAFQKKIGTGLFGGEGFIMQKLDGDGMAFIHAGGTLHERTLAPGETLRVDTGCVV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
A SV+ I++ G I+ AVFGG+ L A ++GPG +++QSLP RL+ RI + +
Sbjct: 185 AFQPSVDFDIQFVGKIKSAVFGGEGLFFATLSGPGKIWLQSLPLSRLANRIVMSAPATGG 244
Query: 252 R 252
R
Sbjct: 245 R 245
>gi|126651021|ref|ZP_01723232.1| hypothetical protein BB14905_20240 [Bacillus sp. B14905]
gi|126592222|gb|EAZ86271.1| hypothetical protein BB14905_20240 [Bacillus sp. B14905]
Length = 266
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 86/235 (36%), Positives = 134/235 (57%), Gaps = 12/235 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q Q+ L PQE VVA G++ M S+ ME I+ ++ G L GK +
Sbjct: 6 IDYKLFGDDMQYVQVELDPQETVVAEAGALMMMDDSIHMETIFGDGSKGGSQSGLMGKLL 65
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ N G V A+P +I+P+DL+ G+++CQ DAFL +
Sbjct: 66 GAGKRLVTGESLFMTTFTNVGSGKKHVYFASPYPGKIIPMDLSQLNGKIICQKDAFLAAA 125
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V V +++ G EGF+ QKL G G+AF+ AGG++ +K L+ GEV+ VD C+
Sbjct: 126 KGVSVGVEFQKKIGVGFFGGEGFIMQKLEGDGMAFVHAGGAIHEKTLQPGEVLRVDTGCL 185
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
VA+TS+V+ I+ G ++ A+FGG+ + A + GPG V++QSLPF RL+ R+ A
Sbjct: 186 VAMTSNVDYDIELVGGVKTALFGGEGIFFATLRGPGTVWVQSLPFSRLASRVFAA 240
>gi|383451724|ref|YP_005358445.1| hypothetical protein KQS_12415 [Flavobacterium indicum GPTSA100-9]
gi|380503346|emb|CCG54388.1| Protein of unknown function [Flavobacterium indicum GPTSA100-9]
Length = 264
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 126/232 (54%), Gaps = 14/232 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I + I G E Q +I L PQE V+A G M ++ME I+ ++ Q GK +
Sbjct: 6 IDYHIYGEEMQYVEIELDPQEVVIAEAGGFMMMEDGIKMETIFGDGSQ--QQQGFMGKLL 63
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
++ +N G V A+P +I+ IDL + G+ +CQ DAFLC+
Sbjct: 64 SAGKRVLTGESLFMTAFQNQGLGKKKVSFASPYPGKIIAIDLREYNGKFICQKDAFLCAA 123
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V V +++ + G EGF+ QK+ G G+AF+ +GG++ +K L GEV+ +D C+
Sbjct: 124 KGVSVGIEFSKKIGRALFGGEGFIMQKIEGDGMAFVHSGGTLAKKVLAPGEVMRIDTGCV 183
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
V T V+ I++ G I+ +FGG+ + A + GPG V+IQSLPF RL+ RI
Sbjct: 184 VGYTQDVDFDIEFVGGIKNTIFGGEGMFYATLRGPGTVYIQSLPFSRLAGRI 235
>gi|389701311|ref|ZP_10185260.1| TIGR00266 family protein [Leptothrix ochracea L12]
gi|388591131|gb|EIM31397.1| TIGR00266 family protein [Leptothrix ochracea L12]
Length = 271
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 96/252 (38%), Positives = 139/252 (55%), Gaps = 11/252 (4%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMW----- 74
VI + I G E Q +I L P E V GSM FM + M+ ++ + G +
Sbjct: 5 VIDYDIRGSEMQFVEIELDPGEAAVGEAGSMMFMDTGITMDAVFGDGSAQQGGFFGKLLG 64
Query: 75 ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ + V N + V AAP +I+P+DL GG L+CQ D+FLC+
Sbjct: 65 AGKRLLTGESLFTTVYTNQAGAKQRVAFAAPFPGKIMPMDLRQLGGMLICQKDSFLCAAR 124
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + Q++ G EGF+ QKL G GLAFI AGG+VV++ L+ + + VD C+V
Sbjct: 125 GVSIGIAFQQKMGVGFFGGEGFIMQKLEGDGLAFIHAGGTVVRRELQAEQSLLVDTGCLV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI-ARAVTSPN 250
A T SVN +I+Y G I+ A+FGG+ L A ++GPG V++QSLPF RL+ R+ A A
Sbjct: 185 AHTPSVNFEIQYVGKIKTALFGGEGLFLARLSGPGQVWLQSLPFSRLASRVFAAAPQRGG 244
Query: 251 MRENPKFFVQIA 262
RE +A
Sbjct: 245 SREEGSLLGGLA 256
>gi|403237899|ref|ZP_10916485.1| hypothetical protein B1040_19211 [Bacillus sp. 10403023]
Length = 264
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 86/231 (37%), Positives = 129/231 (55%), Gaps = 11/231 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I + + G + Q +I L P E V+A G+M M + ME I+ +
Sbjct: 6 IEYNLYGDDMQFVEIELDPNESVIAEAGAMMMMEDGISMETIFGDGSGTSGGGLLGKLFG 65
Query: 75 ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ V N V AAP +I+PIDL++ GG+++CQ D+FLC+
Sbjct: 66 AGKRVLTGESLFMTVFTNSQMGKKHVSFAAPYPGKIIPIDLSLLGGKVVCQKDSFLCAAK 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V V +++ G EGF+ QKL G GLAF+ AGG++ +K L+ GE++ +D C+V
Sbjct: 126 GVSVGIDFQRKLGTGFFGGEGFIMQKLEGDGLAFLHAGGTIHKKELQPGEILRIDTGCLV 185
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+T V+ I+Y G I+ A FGG+ L A V GPG V++QSLPF RL+ RI
Sbjct: 186 GMTKEVDYNIEYVGKIKTAFFGGEGLFYATVRGPGTVWVQSLPFSRLADRI 236
>gi|325289821|ref|YP_004266002.1| hypothetical protein Sgly_1704 [Syntrophobotulus glycolicus DSM
8271]
gi|324965222|gb|ADY56001.1| protein of unknown function DUF124 [Syntrophobotulus glycolicus DSM
8271]
Length = 267
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 87/232 (37%), Positives = 131/232 (56%), Gaps = 13/232 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ +QI G + Q ++ L P E VVA G++ +M + ME I+ + L GK +
Sbjct: 7 VDYQIFGDDMQFVEVELDPSESVVAEAGALMYMEPGIRMETIF-GDGSSNQGSGLMGKLL 65
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G G V AAP +I+P+DL + G+L+CQ DAFLC+
Sbjct: 66 GAGKRVLTGESLFMTVFTNSGSGKGHVSFAAPFPGKIIPVDLTNYNGKLICQKDAFLCAA 125
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ Q+L G G++F+ AGG++V+K L+ GE + VD C+
Sbjct: 126 KGVSIGIEFTKKLGAGFFGGEGFIMQRLEGDGMSFLHAGGTIVEKVLQPGETLRVDTGCL 185
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
VA+T V I++ G I+ A FGG+ L A V+GPG V++QSLPF RL+ RI
Sbjct: 186 VAMTGQVAYDIQFVGGIKTAFFGGEGLFFATVSGPGRVWLQSLPFSRLADRI 237
>gi|229135710|ref|ZP_04264486.1| Protein containing DUF124 [Bacillus cereus BDRD-ST196]
gi|228647747|gb|EEL03806.1| Protein containing DUF124 [Bacillus cereus BDRD-ST196]
Length = 260
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 91/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E VVA G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVVAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN I++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDVNYDIQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|404369660|ref|ZP_10974993.1| TIGR00266 family protein [Clostridium sp. 7_2_43FAA]
gi|226914338|gb|EEH99539.1| TIGR00266 family protein [Clostridium sp. 7_2_43FAA]
Length = 262
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 91/235 (38%), Positives = 131/235 (55%), Gaps = 11/235 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL----- 77
+ ++ILG E Q +I L P+E VVA G+M M S+ ME I ++ +L
Sbjct: 5 VDYEILGNEMQYVEIELDPRESVVAEAGAMMMMDNSITMETILGDGSQKSQGGFLGKLGG 64
Query: 78 ------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
G+++ N G V AAP +I+ +DL GG L+CQ D+FLC
Sbjct: 65 AAKRVITGESLFMTEFTNSGVLKQKVAFAAPYPGKIIAMDLRKLGGRLICQKDSFLCCAK 124
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
+ V +++ G EGF+ QKL G GLAFI AGG++V+K L G+ I +D C+V
Sbjct: 125 GISVGIAFTKKLGTGFFGGEGFILQKLEGDGLAFIHAGGTIVKKTLMPGDTIKIDTGCLV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
A+T +VN I++ G + AVFGG+ L A +TGPG V+IQSLPF +L+ R+ A
Sbjct: 185 AMTQNVNYDIQFVGGFKNAVFGGEGLFFATLTGPGDVWIQSLPFSKLASRVYSAA 239
>gi|228993600|ref|ZP_04153508.1| Protein containing DUF124 [Bacillus pseudomycoides DSM 12442]
gi|228999636|ref|ZP_04159213.1| Protein containing DUF124 [Bacillus mycoides Rock3-17]
gi|228760162|gb|EEM09131.1| Protein containing DUF124 [Bacillus mycoides Rock3-17]
gi|228766191|gb|EEM14837.1| Protein containing DUF124 [Bacillus pseudomycoides DSM 12442]
Length = 260
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 89/239 (37%), Positives = 135/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 6 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGAGKSSGLFGKLM 63
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 64 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIVPVDLTEYQGKVVCQKDAFLCAA 123
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ + G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 124 KGVSIGIEFTKKIGSGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGERLRIDTGCL 183
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T VN I++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 184 VAMTRDVNYDIQFVGNVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238
>gi|299536964|ref|ZP_07050269.1| hypothetical protein BFZC1_13108 [Lysinibacillus fusiformis ZC1]
gi|424738144|ref|ZP_18166587.1| hypothetical protein C518_2699 [Lysinibacillus fusiformis ZB2]
gi|298727544|gb|EFI68114.1| hypothetical protein BFZC1_13108 [Lysinibacillus fusiformis ZC1]
gi|422947960|gb|EKU42348.1| hypothetical protein C518_2699 [Lysinibacillus fusiformis ZB2]
Length = 266
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 86/235 (36%), Positives = 133/235 (56%), Gaps = 12/235 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q Q+ L PQE VVA G++ M S+ ME I+ + G L GK +
Sbjct: 6 IDYKLFGDDMQYVQVELDPQETVVAEAGALMMMDDSIHMETIFGDGSRGGSQSGLMGKLL 65
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ N G V A+P +I+P+DL+ G+++CQ DAFL +
Sbjct: 66 GAGKRLITGESLFMTTFTNVGSGKKHVYFASPYPGKIIPMDLSQLNGKIICQKDAFLAAA 125
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V V +++ G EGF+ QKL G G+AF+ AGG++ +K L+ GEV+ VD C+
Sbjct: 126 KGVSVGVEFQKKIGVGFFGGEGFIMQKLEGDGMAFVHAGGAIHEKTLQPGEVLRVDTGCL 185
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
VA+TS+V+ I+ G ++ A+FGG+ + A + GPG V++QSLPF RL+ R+ A
Sbjct: 186 VAMTSNVDYDIEMVGGVKTALFGGEGIFFATLRGPGTVWVQSLPFSRLASRVFAA 240
>gi|229175580|ref|ZP_04303090.1| Protein containing DUF124 [Bacillus cereus MM3]
gi|423456874|ref|ZP_17433671.1| TIGR00266 family protein [Bacillus cereus BAG5X2-1]
gi|228607976|gb|EEK65288.1| Protein containing DUF124 [Bacillus cereus MM3]
gi|401149314|gb|EJQ56788.1| TIGR00266 family protein [Bacillus cereus BAG5X2-1]
Length = 260
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTQYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGERLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN I++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDVNYDIEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|423400263|ref|ZP_17377436.1| TIGR00266 family protein [Bacillus cereus BAG2X1-2]
gi|423479034|ref|ZP_17455749.1| TIGR00266 family protein [Bacillus cereus BAG6X1-1]
gi|401655620|gb|EJS73149.1| TIGR00266 family protein [Bacillus cereus BAG2X1-2]
gi|402426344|gb|EJV58472.1| TIGR00266 family protein [Bacillus cereus BAG6X1-1]
Length = 260
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTQYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGERLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN I++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDVNYDIQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|423512976|ref|ZP_17489507.1| TIGR00266 family protein [Bacillus cereus HuA2-1]
gi|402447269|gb|EJV79126.1| TIGR00266 family protein [Bacillus cereus HuA2-1]
Length = 260
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 90/248 (36%), Positives = 137/248 (55%), Gaps = 18/248 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E VVA G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVVAEAGAMMMMEDHIEMETIF--GDGSGPSNGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDVNYDVQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSPNMREN 254
TSP + +
Sbjct: 236 TSPTAQNS 243
>gi|196250042|ref|ZP_03148737.1| protein of unknown function DUF124 [Geobacillus sp. G11MC16]
gi|196210556|gb|EDY05320.1| protein of unknown function DUF124 [Geobacillus sp. G11MC16]
Length = 264
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 89/240 (37%), Positives = 133/240 (55%), Gaps = 10/240 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I +Q+ G + Q +I L P+E V+A G M M + ME ++ + G
Sbjct: 6 IDYQLYGDDMQFVEIELDPRESVIAEAGGMMMMEDGIVMETVFGDCSSSGKGLLGRLVGA 65
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ V N G V AAP +I+P++L GG+L+CQ D+FLC+
Sbjct: 66 GKRLLTGESLFMTVFTNQGTGKRRVAFAAPYPGKIIPVNLQELGGKLICQKDSFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ G EGF+ QKL G GLAF+ AGG + ++ L+ GE + +D C+VA
Sbjct: 126 VSVGIEFQRKLGTGFFGGEGFIMQKLEGDGLAFLHAGGMIHRRELQPGEKLRIDTGCLVA 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
+T VN I+Y G I+ A FGG+ L A +T PG+V++QSLPF RL+ R+ A S R
Sbjct: 186 MTKEVNYDIEYVGNIKTAFFGGEGLFFATLTEPGVVWVQSLPFSRLADRVIAAAPSSGGR 245
>gi|229007195|ref|ZP_04164801.1| Protein containing DUF124 [Bacillus mycoides Rock1-4]
gi|228754054|gb|EEM03486.1| Protein containing DUF124 [Bacillus mycoides Rock1-4]
Length = 260
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 89/239 (37%), Positives = 135/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 6 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGAGKSSGLFGKLM 63
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 64 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIVPVDLTEYQGKVVCQKDAFLCAA 123
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ + G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 124 KGVSIGIEFTKKIGSGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGERLRIDTGCL 183
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T VN I++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 184 VAMTRDVNYDIQFVGNVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238
>gi|386821223|ref|ZP_10108439.1| TIGR00266 family protein [Joostella marina DSM 19592]
gi|386426329|gb|EIJ40159.1| TIGR00266 family protein [Joostella marina DSM 19592]
Length = 266
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 88/230 (38%), Positives = 128/230 (55%), Gaps = 10/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLF-- 78
I +QI G E Q +I L PQE VVA GS M ++M+ I+ + G+ LF
Sbjct: 6 IDYQIYGEEMQYVEIELDPQEAVVAEAGSFMMMDSGLQMDTIFGDGSNQDRGVLGKLFSA 65
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G+++ N V A+P +I+PIDL + G+ +CQ DAFLC+
Sbjct: 66 GKRLLTGESLFMTAFLNTDLGKRKVSFASPYPGKIIPIDLTKYRGKFICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ G EGF+ QKL G G+AF+ AGG++ ++ L GEV+ VD CIV
Sbjct: 126 VSVGIEFSRKLGRGFFGGEGFIMQKLEGDGMAFVHAGGTLARRELAAGEVLKVDTGCIVG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T ++ I++ G I+ VFGG+ L A + GPG+V+IQSLPF RL+ R+
Sbjct: 186 FTKDIDYDIEFIGGIKNTVFGGEGLFFASLRGPGVVYIQSLPFSRLASRV 235
>gi|423471068|ref|ZP_17447812.1| TIGR00266 family protein [Bacillus cereus BAG6O-2]
gi|402433529|gb|EJV65580.1| TIGR00266 family protein [Bacillus cereus BAG6O-2]
Length = 260
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDASGPSSGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDVNYDVQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|423670445|ref|ZP_17645474.1| TIGR00266 family protein [Bacillus cereus VDM034]
gi|423673349|ref|ZP_17648288.1| TIGR00266 family protein [Bacillus cereus VDM062]
gi|401296539|gb|EJS02157.1| TIGR00266 family protein [Bacillus cereus VDM034]
gi|401310766|gb|EJS16077.1| TIGR00266 family protein [Bacillus cereus VDM062]
Length = 260
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E VVA G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVVAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDVNYDVQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|163942599|ref|YP_001647483.1| hypothetical protein BcerKBAB4_4702 [Bacillus weihenstephanensis
KBAB4]
gi|229169606|ref|ZP_04297309.1| Protein containing DUF124 [Bacillus cereus AH621]
gi|423368885|ref|ZP_17346317.1| TIGR00266 family protein [Bacillus cereus VD142]
gi|423490040|ref|ZP_17466722.1| TIGR00266 family protein [Bacillus cereus BtB2-4]
gi|423495764|ref|ZP_17472408.1| TIGR00266 family protein [Bacillus cereus CER057]
gi|423497442|ref|ZP_17474059.1| TIGR00266 family protein [Bacillus cereus CER074]
gi|423519565|ref|ZP_17496046.1| TIGR00266 family protein [Bacillus cereus HuA2-4]
gi|423591146|ref|ZP_17567177.1| TIGR00266 family protein [Bacillus cereus VD048]
gi|423597830|ref|ZP_17573830.1| TIGR00266 family protein [Bacillus cereus VD078]
gi|423660277|ref|ZP_17635446.1| TIGR00266 family protein [Bacillus cereus VDM022]
gi|163864796|gb|ABY45855.1| protein of unknown function DUF124 [Bacillus weihenstephanensis
KBAB4]
gi|228613881|gb|EEK71003.1| Protein containing DUF124 [Bacillus cereus AH621]
gi|401079142|gb|EJP87444.1| TIGR00266 family protein [Bacillus cereus VD142]
gi|401150093|gb|EJQ57558.1| TIGR00266 family protein [Bacillus cereus CER057]
gi|401158584|gb|EJQ65975.1| TIGR00266 family protein [Bacillus cereus HuA2-4]
gi|401162373|gb|EJQ69729.1| TIGR00266 family protein [Bacillus cereus CER074]
gi|401233293|gb|EJR39786.1| TIGR00266 family protein [Bacillus cereus VD048]
gi|401238550|gb|EJR44989.1| TIGR00266 family protein [Bacillus cereus VD078]
gi|401302947|gb|EJS08514.1| TIGR00266 family protein [Bacillus cereus VDM022]
gi|402430359|gb|EJV62437.1| TIGR00266 family protein [Bacillus cereus BtB2-4]
Length = 260
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E VVA G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVVAEAGAMMMMEDHIEMETIF--GDGSGPSNGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDVNYDVQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|229014071|ref|ZP_04171194.1| Protein containing DUF124 [Bacillus mycoides DSM 2048]
gi|228747217|gb|EEL97097.1| Protein containing DUF124 [Bacillus mycoides DSM 2048]
Length = 260
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E VVA G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVVAEAGAMMMMEDHIEMETIF--GDGSGPSNGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTDESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDVNYDVQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|423394880|ref|ZP_17372081.1| TIGR00266 family protein [Bacillus cereus BAG2X1-1]
gi|423405740|ref|ZP_17382889.1| TIGR00266 family protein [Bacillus cereus BAG2X1-3]
gi|401656014|gb|EJS73538.1| TIGR00266 family protein [Bacillus cereus BAG2X1-1]
gi|401660787|gb|EJS78261.1| TIGR00266 family protein [Bacillus cereus BAG2X1-3]
Length = 260
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 89/239 (37%), Positives = 135/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 6 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSGGLFGKLM 63
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 64 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIVPVDLTEYQGKVVCQKDAFLCAA 123
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ + G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 124 KGVSIGIEFTKKIGSGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKHGEKLRIDTGCL 183
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T VN I++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 184 VAMTKDVNYDIQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238
>gi|75760312|ref|ZP_00740361.1| Hypothetical protein RBTH_05472 [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|218900032|ref|YP_002448443.1| hypothetical protein BCG9842_B0247 [Bacillus cereus G9842]
gi|228903379|ref|ZP_04067510.1| Protein containing DUF124 [Bacillus thuringiensis IBL 4222]
gi|423358034|ref|ZP_17335537.1| TIGR00266 family protein [Bacillus cereus VD022]
gi|434378029|ref|YP_006612673.1| hypothetical protein BTF1_22930 [Bacillus thuringiensis HD-789]
gi|74492192|gb|EAO55359.1| Hypothetical protein RBTH_05472 [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|218541395|gb|ACK93789.1| conserved hypothetical protein [Bacillus cereus G9842]
gi|228856340|gb|EEN00869.1| Protein containing DUF124 [Bacillus thuringiensis IBL 4222]
gi|401086527|gb|EJP94749.1| TIGR00266 family protein [Bacillus cereus VD022]
gi|401876586|gb|AFQ28753.1| hypothetical protein BTF1_22930 [Bacillus thuringiensis HD-789]
Length = 260
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E VVA G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPKESVVAEAGAMMMMEDYIEMETIF--GDGSGPSNGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKIVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T +N I++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDINYDIEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|344201479|ref|YP_004786622.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
gi|343953401|gb|AEM69200.1| protein of unknown function DUF124 [Muricauda ruestringensis DSM
13258]
Length = 266
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 91/234 (38%), Positives = 129/234 (55%), Gaps = 10/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLF-- 78
I + I G E Q +I L PQE VVA GS M ++M+ I+ + G+ LF
Sbjct: 6 IDYHIYGEEMQYVEIELDPQEAVVAEAGSFMMMDTDIKMDTIFGDGSNQDTGVLGKLFSA 65
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G+++ N G V A+P +I+PIDL+ GG+ +CQ DAFLC+
Sbjct: 66 GKRLLTGESLFMTAFLNIGQGKKLVSFASPYPGKIVPIDLSEKGGKFICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +R+ G EGF+ QKL G G+AF+ AGG++ +K L GE + VD CIV
Sbjct: 126 VSVGIEFSKRLGRGFFGGEGFIMQKLEGDGMAFVHAGGTMAKKVLGPGETLKVDTGCIVG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+ +V+ I++ G I+ VFGG+ L A + GPG V+IQSLPF RL+ R+ A
Sbjct: 186 FSQTVDYDIEFVGGIKNTVFGGEGLFFATLRGPGTVYIQSLPFSRLASRVWAAA 239
>gi|423560610|ref|ZP_17536886.1| TIGR00266 family protein [Bacillus cereus MSX-A1]
gi|401203147|gb|EJR09987.1| TIGR00266 family protein [Bacillus cereus MSX-A1]
Length = 260
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E VVA G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPKESVVAEAGAMMMMEDYIEMETIF--GDGSGPSNGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGDGKRHVSFAAPYPGKIIPVDLTEYQGKIVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T +N I++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDINYDIEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|229026345|ref|ZP_04182703.1| Protein containing DUF124 [Bacillus cereus AH1272]
gi|228734945|gb|EEL85582.1| Protein containing DUF124 [Bacillus cereus AH1272]
Length = 270
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 88/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 16 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDASGPSGGLFGKLM 73
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 74 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 133
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 134 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 193
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 194 VAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 248
>gi|423451822|ref|ZP_17428675.1| TIGR00266 family protein [Bacillus cereus BAG5X1-1]
gi|423557548|ref|ZP_17533850.1| TIGR00266 family protein [Bacillus cereus MC67]
gi|401142628|gb|EJQ50168.1| TIGR00266 family protein [Bacillus cereus BAG5X1-1]
gi|401192792|gb|EJQ99801.1| TIGR00266 family protein [Bacillus cereus MC67]
Length = 260
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDVNYDVQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|423521243|ref|ZP_17497716.1| TIGR00266 family protein [Bacillus cereus HuA4-10]
gi|401178602|gb|EJQ85776.1| TIGR00266 family protein [Bacillus cereus HuA4-10]
Length = 260
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDVNYDVQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|229062555|ref|ZP_04199866.1| Protein containing DUF124 [Bacillus cereus AH603]
gi|228716729|gb|EEL68423.1| Protein containing DUF124 [Bacillus cereus AH603]
Length = 260
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|229099342|ref|ZP_04230273.1| Protein containing DUF124 [Bacillus cereus Rock3-29]
gi|229105501|ref|ZP_04236142.1| Protein containing DUF124 [Bacillus cereus Rock3-28]
gi|229118353|ref|ZP_04247709.1| Protein containing DUF124 [Bacillus cereus Rock1-3]
gi|228665183|gb|EEL20669.1| Protein containing DUF124 [Bacillus cereus Rock1-3]
gi|228677926|gb|EEL32162.1| Protein containing DUF124 [Bacillus cereus Rock3-28]
gi|228684160|gb|EEL38107.1| Protein containing DUF124 [Bacillus cereus Rock3-29]
Length = 263
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 89/239 (37%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 9 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSNGLFGKLM 66
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 67 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIVPVDLTEYQGKIVCQKDAFLCAA 126
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 127 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 186
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T VN I++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 187 VAMTKDVNYDIEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 241
>gi|442317456|ref|YP_007357477.1| hypothetical protein MYSTI_00442 [Myxococcus stipitatus DSM 14675]
gi|441485098|gb|AGC41793.1| hypothetical protein MYSTI_00442 [Myxococcus stipitatus DSM 14675]
Length = 264
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 87/242 (35%), Positives = 134/242 (55%), Gaps = 10/242 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE---------VGM 73
+ F+I G + Q ++ L PQE VA G++ +M +EME I+ +E +G
Sbjct: 7 VDFKIHGDDLQFVEVELDPQEAAVAEAGTLMYMDDGIEMETIFGDGSEKKSGFLGSLLGA 66
Query: 74 WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ + V N G V AAP +I+ ++L GGEL+ Q D+FL +
Sbjct: 67 GKRLLTGESLFTTVFLNRGSGKRKVAFAAPYPGKIIAVNLQELGGELIAQKDSFLAAAKG 126
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ Q+L G GLAFI AGG+++++ L GE++ VD CIVA
Sbjct: 127 VSLGIAFQKKLGTGLFGGEGFIMQRLQGDGLAFIHAGGTLLERTLAPGELLRVDTGCIVA 186
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
SV+ I+ I+ A FGG+ L A + GPG V++QSLPF RL+ RI A R
Sbjct: 187 FQPSVDYDIQMVSGIKTAFFGGEGLFFATLRGPGKVWLQSLPFSRLAGRILSAAGPGGSR 246
Query: 253 EN 254
+
Sbjct: 247 DE 248
>gi|403668071|ref|ZP_10933368.1| hypothetical protein KJC8E_04832 [Kurthia sp. JC8E]
Length = 267
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 85/232 (36%), Positives = 135/232 (58%), Gaps = 12/232 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY----IPENEVGMW---- 74
I F+I G + Q ++ L P E +A GSM M + ME I+ ++ G +
Sbjct: 6 IDFKIHGDDMQFVEVELDPGETAIAEAGSMMMMDDGIVMETIFGDGSAGQHGSGFFDKLK 65
Query: 75 ----QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ L G+++ N G V AAP RI+P+DL+ + G+++CQ DAFL +
Sbjct: 66 GASKRLLTGESLFVTTFTNNGSGKKHVSFAAPYPGRIIPMDLSRYNGKIICQKDAFLAAA 125
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V V +++ G EGF+ QKL G GLAF+ AGG++++K+L+VGE + +D C+
Sbjct: 126 RGVSVGVEFQRKLGTGFFGGEGFIMQKLEGDGLAFVHAGGTIIRKDLQVGERLRLDTGCL 185
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
VA+T +V+ I+ G I+ ++FGG+ L A +TGPG V++QS+PF RL+ R+
Sbjct: 186 VAMTENVDYNIEAAGGIKTSLFGGEGLFLATLTGPGSVWVQSMPFSRLASRV 237
>gi|407707386|ref|YP_006830971.1| major facilitator superfamily MFS_1 [Bacillus thuringiensis MC28]
gi|423377285|ref|ZP_17354569.1| TIGR00266 family protein [Bacillus cereus BAG1O-2]
gi|423440397|ref|ZP_17417303.1| TIGR00266 family protein [Bacillus cereus BAG4X2-1]
gi|423449454|ref|ZP_17426333.1| TIGR00266 family protein [Bacillus cereus BAG5O-1]
gi|423463460|ref|ZP_17440228.1| TIGR00266 family protein [Bacillus cereus BAG6O-1]
gi|423532813|ref|ZP_17509231.1| TIGR00266 family protein [Bacillus cereus HuB2-9]
gi|423541923|ref|ZP_17518314.1| TIGR00266 family protein [Bacillus cereus HuB4-10]
gi|423548159|ref|ZP_17524517.1| TIGR00266 family protein [Bacillus cereus HuB5-5]
gi|423614689|ref|ZP_17590523.1| TIGR00266 family protein [Bacillus cereus VD115]
gi|423622049|ref|ZP_17597827.1| TIGR00266 family protein [Bacillus cereus VD148]
gi|401128127|gb|EJQ35829.1| TIGR00266 family protein [Bacillus cereus BAG5O-1]
gi|401169743|gb|EJQ76987.1| TIGR00266 family protein [Bacillus cereus HuB4-10]
gi|401176826|gb|EJQ84020.1| TIGR00266 family protein [Bacillus cereus HuB5-5]
gi|401262147|gb|EJR68291.1| TIGR00266 family protein [Bacillus cereus VD148]
gi|401262345|gb|EJR68487.1| TIGR00266 family protein [Bacillus cereus VD115]
gi|401639094|gb|EJS56834.1| TIGR00266 family protein [Bacillus cereus BAG1O-2]
gi|402419540|gb|EJV51819.1| TIGR00266 family protein [Bacillus cereus BAG4X2-1]
gi|402421661|gb|EJV53909.1| TIGR00266 family protein [Bacillus cereus BAG6O-1]
gi|402464535|gb|EJV96225.1| TIGR00266 family protein [Bacillus cereus HuB2-9]
gi|407385071|gb|AFU15572.1| Protein containing DUF124 [Bacillus thuringiensis MC28]
Length = 260
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 89/239 (37%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 6 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSNGLFGKLM 63
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 64 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIVPVDLTEYQGKIVCQKDAFLCAA 123
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 124 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 183
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T VN I++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 184 VAMTKDVNYDIEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238
>gi|49479346|ref|YP_038905.1| hypothetical protein BT9727_4593 [Bacillus thuringiensis serovar
konkukian str. 97-27]
gi|165869724|ref|ZP_02214382.1| conserved hypothetical protein [Bacillus anthracis str. A0488]
gi|167633901|ref|ZP_02392224.1| conserved hypothetical protein [Bacillus anthracis str. A0442]
gi|167638080|ref|ZP_02396358.1| conserved hypothetical protein [Bacillus anthracis str. A0193]
gi|170685759|ref|ZP_02876982.1| conserved hypothetical protein [Bacillus anthracis str. A0465]
gi|177651305|ref|ZP_02934136.1| conserved hypothetical protein [Bacillus anthracis str. A0174]
gi|190567250|ref|ZP_03020165.1| conserved hypothetical protein [Bacillus anthracis str.
Tsiankovskii-I]
gi|196033164|ref|ZP_03100577.1| conserved hypothetical protein [Bacillus cereus W]
gi|196043959|ref|ZP_03111196.1| conserved hypothetical protein [Bacillus cereus 03BB108]
gi|218906085|ref|YP_002453919.1| hypothetical protein BCAH820_4973 [Bacillus cereus AH820]
gi|227817656|ref|YP_002817665.1| hypothetical protein BAMEG_5150 [Bacillus anthracis str. CDC 684]
gi|228917512|ref|ZP_04081061.1| Protein containing DUF124 [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
gi|228929910|ref|ZP_04092925.1| Protein containing DUF124 [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|228936171|ref|ZP_04098974.1| Protein containing DUF124 [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|229124426|ref|ZP_04253614.1| Protein containing DUF124 [Bacillus cereus 95/8201]
gi|229187124|ref|ZP_04314272.1| Protein containing DUF124 [Bacillus cereus BGSC 6E1]
gi|229601942|ref|YP_002869130.1| hypothetical protein BAA_5128 [Bacillus anthracis str. A0248]
gi|254687667|ref|ZP_05151523.1| hypothetical protein BantC_27985 [Bacillus anthracis str.
CNEVA-9066]
gi|254725231|ref|ZP_05187014.1| hypothetical protein BantA1_22639 [Bacillus anthracis str. A1055]
gi|254736973|ref|ZP_05194679.1| hypothetical protein BantWNA_17531 [Bacillus anthracis str. Western
North America USA6153]
gi|254742007|ref|ZP_05199694.1| hypothetical protein BantKB_13453 [Bacillus anthracis str. Kruger
B]
gi|254754394|ref|ZP_05206429.1| hypothetical protein BantV_18087 [Bacillus anthracis str. Vollum]
gi|254757226|ref|ZP_05209253.1| hypothetical protein BantA9_02876 [Bacillus anthracis str.
Australia 94]
gi|270000527|ref|NP_847302.2| hypothetical protein BA_5116 [Bacillus anthracis str. Ames]
gi|301056374|ref|YP_003794585.1| hypothetical protein BACI_c48660 [Bacillus cereus biovar anthracis
str. CI]
gi|376268794|ref|YP_005121506.1| hypothetical protein bcf_24370 [Bacillus cereus F837/76]
gi|386738759|ref|YP_006211940.1| hypothetical protein [Bacillus anthracis str. H9401]
gi|421508720|ref|ZP_15955632.1| hypothetical protein B353_13259 [Bacillus anthracis str. UR-1]
gi|421639948|ref|ZP_16080537.1| hypothetical protein BABF1_22911 [Bacillus anthracis str. BF1]
gi|49330902|gb|AAT61548.1| conserved hypothetical protein [Bacillus thuringiensis serovar
konkukian str. 97-27]
gi|164714553|gb|EDR20072.1| conserved hypothetical protein [Bacillus anthracis str. A0488]
gi|167513897|gb|EDR89265.1| conserved hypothetical protein [Bacillus anthracis str. A0193]
gi|167530702|gb|EDR93404.1| conserved hypothetical protein [Bacillus anthracis str. A0442]
gi|170670223|gb|EDT20963.1| conserved hypothetical protein [Bacillus anthracis str. A0465]
gi|172083131|gb|EDT68193.1| conserved hypothetical protein [Bacillus anthracis str. A0174]
gi|190561754|gb|EDV15724.1| conserved hypothetical protein [Bacillus anthracis str.
Tsiankovskii-I]
gi|195994593|gb|EDX58548.1| conserved hypothetical protein [Bacillus cereus W]
gi|196025295|gb|EDX63965.1| conserved hypothetical protein [Bacillus cereus 03BB108]
gi|218537815|gb|ACK90213.1| conserved hypothetical protein [Bacillus cereus AH820]
gi|227005995|gb|ACP15738.1| conserved hypothetical protein [Bacillus anthracis str. CDC 684]
gi|228596365|gb|EEK54037.1| Protein containing DUF124 [Bacillus cereus BGSC 6E1]
gi|228659078|gb|EEL14730.1| Protein containing DUF124 [Bacillus cereus 95/8201]
gi|228823516|gb|EEM69345.1| Protein containing DUF124 [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|228829826|gb|EEM75448.1| Protein containing DUF124 [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|228842184|gb|EEM87283.1| Protein containing DUF124 [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
gi|229266350|gb|ACQ47987.1| conserved hypothetical protein [Bacillus anthracis str. A0248]
gi|269850231|gb|AAP28788.2| conserved hypothetical protein [Bacillus anthracis str. Ames]
gi|300378543|gb|ADK07447.1| conserved hypothetical protein [Bacillus cereus biovar anthracis
str. CI]
gi|364514594|gb|AEW57993.1| Hypothetical protein bcf_24370 [Bacillus cereus F837/76]
gi|384388611|gb|AFH86272.1| Hypothetical Protein H9401_4886 [Bacillus anthracis str. H9401]
gi|401821343|gb|EJT20501.1| hypothetical protein B353_13259 [Bacillus anthracis str. UR-1]
gi|403393036|gb|EJY90283.1| hypothetical protein BABF1_22911 [Bacillus anthracis str. BF1]
Length = 260
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RLS R+
Sbjct: 180 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLSARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|52140639|ref|YP_086190.1| hypothetical protein BCZK4615 [Bacillus cereus E33L]
gi|51974108|gb|AAU15658.1| conserved hypothetical protein [Bacillus cereus E33L]
Length = 263
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 5 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 62
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 63 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 122
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 123 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 182
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RLS R+
Sbjct: 183 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLSARL---- 238
Query: 247 TSP 249
TSP
Sbjct: 239 TSP 241
>gi|228910716|ref|ZP_04074527.1| Protein containing DUF124 [Bacillus thuringiensis IBL 200]
gi|228848984|gb|EEM93827.1| Protein containing DUF124 [Bacillus thuringiensis IBL 200]
Length = 260
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E VVA G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPKESVVAEAGAMMMMEDYIEMETIF--GDGSGPSNGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKIVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T +N I++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDINYDIEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|423388819|ref|ZP_17366045.1| TIGR00266 family protein [Bacillus cereus BAG1X1-3]
gi|423417197|ref|ZP_17394286.1| TIGR00266 family protein [Bacillus cereus BAG3X2-1]
gi|401108615|gb|EJQ16546.1| TIGR00266 family protein [Bacillus cereus BAG3X2-1]
gi|401642894|gb|EJS60600.1| TIGR00266 family protein [Bacillus cereus BAG1X1-3]
Length = 260
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 88/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 6 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDASGPSGGLFGKLM 63
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 64 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 124 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 183
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 184 VAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238
>gi|118479980|ref|YP_897131.1| hypothetical protein BALH_4426 [Bacillus thuringiensis str. Al
Hakam]
gi|118419205|gb|ABK87624.1| conserved hypothetical protein [Bacillus thuringiensis str. Al
Hakam]
Length = 263
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 5 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 62
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 63 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 122
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 123 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 182
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RLS R+
Sbjct: 183 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLSARL---- 238
Query: 247 TSP 249
TSP
Sbjct: 239 TSP 241
>gi|345866906|ref|ZP_08818927.1| hypothetical protein BZARG_802 [Bizionia argentinensis JUB59]
gi|344048826|gb|EGV44429.1| hypothetical protein BZARG_802 [Bizionia argentinensis JUB59]
Length = 266
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 85/230 (36%), Positives = 125/230 (54%), Gaps = 10/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
I + I G E Q +I L PQE VVA GS M ++ME I+ G LFG
Sbjct: 6 IDYAIYGEEMQYVEIELDPQEGVVAEAGSFMMMDDGIKMETIFGDGSSQNKGFMGSLFGA 65
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ V N + A+P +IL IDL G+ +CQ DAFLC+
Sbjct: 66 GKRILTGESLFMTVFYNNLVGKRRISFASPYPGKILAIDLTECDGKFICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +R+ + G EGF+ QKL G G+AF+ AGG++ ++ L+ GE + VD CIV
Sbjct: 126 VTIGIEFSKRLGRGLFGGEGFIMQKLEGDGMAFVHAGGTLAKRILQAGETLRVDTGCIVG 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T ++ +++ G I+ +FGG+ L A + GPG V+IQSLPF RL++R+
Sbjct: 186 FTQGIDYDVEFVGGIKNTIFGGEGLFFAKLQGPGTVYIQSLPFSRLARRV 235
>gi|302343437|ref|YP_003807966.1| hypothetical protein Deba_2007 [Desulfarculus baarsii DSM 2075]
gi|301640050|gb|ADK85372.1| protein of unknown function DUF124 [Desulfarculus baarsii DSM 2075]
Length = 338
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 84/234 (35%), Positives = 134/234 (57%), Gaps = 14/234 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY------------IPENE 70
+ ++I G E Q +I L E VVA G M +MS + M+ ++ + +
Sbjct: 76 VDYRIEGNEMQYVEIELDQGESVVAEAGGMLYMSADIAMDTVFGDGRRDNAKSSGLMGSL 135
Query: 71 VGMWQWLF-GKTITSVVLRNPGPS-DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
+G + L G+++ + + G S V AAP +I+P+ L GGEL+CQ DAFLC
Sbjct: 136 LGAGKRLLTGESLFMTIFTHKGASPKAHVAFAAPYPGKIMPMHLGELGGELICQKDAFLC 195
Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
+ V + +++ + G EGF+ Q+L G GLAF+ AGG+V +K+L+ GE + VD
Sbjct: 196 AARGVALEIAFQKKIGAGLFGGEGFIMQRLLGDGLAFVHAGGTVFEKDLKPGETLRVDTG 255
Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
C+VA+ SVN +++ G ++ A+FGG+ L A + GPG V++QSLP R++ RI
Sbjct: 256 CLVALMPSVNYDVEFVGGVKTALFGGEGLFLATLGGPGRVWLQSLPLSRVADRI 309
>gi|443320263|ref|ZP_21049376.1| TIGR00266 family protein [Gloeocapsa sp. PCC 73106]
gi|442790027|gb|ELR99647.1| TIGR00266 family protein [Gloeocapsa sp. PCC 73106]
Length = 236
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 87/232 (37%), Positives = 133/232 (57%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
VI +QI G + Q+ +I L P E V A G+M +M +EM+ G + L G++
Sbjct: 4 VIDYQIFGSDLQLVEIGLDPGEGVSAEIGTMTYMEQGIEMQTTAKGGLFGGFKRILTGES 63
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
N G VG AAP R++PIDLA GG LC+ +FLC+ + ++ +
Sbjct: 64 FFISNFVNQGRQKSRVGFAAPYPGRVVPIDLAQIGGTFLCEKGSFLCAAKGIDINVAFTK 123
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
R+ G EGF+ Q+L+G GLAFI AGG+++ K L+ G+ + VD CIV +SSV+ I
Sbjct: 124 RLGAGFFGGEGFILQRLSGDGLAFIHAGGTLIPKQLQSGQELRVDTGCIVGFSSSVDYSI 183
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRE 253
++ G R A+FG + L A + GPG+V++QSLP +L +RI ++ P + +
Sbjct: 184 EFVGGFRNALFGSEGLFLAKLRGPGLVYLQSLPLSKLVERIQASLVFPRVSD 235
>gi|228942045|ref|ZP_04104588.1| Protein containing DUF124 [Bacillus thuringiensis serovar berliner
ATCC 10792]
gi|228974976|ref|ZP_04135537.1| Protein containing DUF124 [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228981569|ref|ZP_04141866.1| Protein containing DUF124 [Bacillus thuringiensis Bt407]
gi|384188938|ref|YP_005574834.1| hypothetical protein CT43_CH4885 [Bacillus thuringiensis serovar
chinensis CT-43]
gi|410677266|ref|YP_006929637.1| protein of unknown function DUF124 [Bacillus thuringiensis Bt407]
gi|423386389|ref|ZP_17363645.1| TIGR00266 family protein [Bacillus cereus BAG1X1-2]
gi|423527281|ref|ZP_17503726.1| TIGR00266 family protein [Bacillus cereus HuB1-1]
gi|452201345|ref|YP_007481426.1| hypothetical protein H175_ch4968 [Bacillus thuringiensis serovar
thuringiensis str. IS5056]
gi|228778248|gb|EEM26518.1| Protein containing DUF124 [Bacillus thuringiensis Bt407]
gi|228784829|gb|EEM32847.1| Protein containing DUF124 [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228817714|gb|EEM63796.1| Protein containing DUF124 [Bacillus thuringiensis serovar berliner
ATCC 10792]
gi|326942647|gb|AEA18543.1| hypothetical protein CT43_CH4885 [Bacillus thuringiensis serovar
chinensis CT-43]
gi|401633344|gb|EJS51125.1| TIGR00266 family protein [Bacillus cereus BAG1X1-2]
gi|402453334|gb|EJV85135.1| TIGR00266 family protein [Bacillus cereus HuB1-1]
gi|409176395|gb|AFV20700.1| protein of unknown function DUF124 [Bacillus thuringiensis Bt407]
gi|452106738|gb|AGG03678.1| hypothetical protein H175_ch4968 [Bacillus thuringiensis serovar
thuringiensis str. IS5056]
Length = 260
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPKESVIAEAGAMMMMEDYIEMETIF--GDGSGPSNGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T +N I++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDINYDIEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|229032525|ref|ZP_04188491.1| Protein containing DUF124 [Bacillus cereus AH1271]
gi|228728783|gb|EEL79793.1| Protein containing DUF124 [Bacillus cereus AH1271]
Length = 260
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIVPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGERLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDVNYDVQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|423549380|ref|ZP_17525707.1| TIGR00266 family protein [Bacillus cereus ISP3191]
gi|401191133|gb|EJQ98156.1| TIGR00266 family protein [Bacillus cereus ISP3191]
Length = 260
Score = 153 bits (387), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T +N +++ G ++ A+FGG+ L A + GPG V+IQSL RLS R+
Sbjct: 180 TGCLVAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLSARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|229163863|ref|ZP_04291804.1| Protein containing DUF124 [Bacillus cereus R309803]
gi|228619605|gb|EEK76490.1| Protein containing DUF124 [Bacillus cereus R309803]
Length = 260
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 88/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPKESVIAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T +N +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|225866860|ref|YP_002752238.1| hypothetical protein BCA_4996 [Bacillus cereus 03BB102]
gi|225787587|gb|ACO27804.1| conserved hypothetical protein [Bacillus cereus 03BB102]
Length = 260
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 6 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSSGLFGKLM 63
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 64 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 124 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 183
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T +N +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 184 VAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238
>gi|229199033|ref|ZP_04325718.1| Protein containing DUF124 [Bacillus cereus m1293]
gi|228584446|gb|EEK42579.1| Protein containing DUF124 [Bacillus cereus m1293]
Length = 263
Score = 153 bits (387), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 88/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 9 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 66
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 67 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 126
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 127 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKHGEKLRIDTGCL 186
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 187 VAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 241
>gi|169344060|ref|ZP_02865048.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens C
str. JGS1495]
gi|169297795|gb|EDS79892.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens C
str. JGS1495]
Length = 266
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 89/232 (38%), Positives = 132/232 (56%), Gaps = 12/232 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV----------- 71
I F++ G + Q +I L P+E VVA G+M M S+EME I+ +
Sbjct: 7 IDFELFGDDMQFVEIELDPRETVVAEAGAMMMMDSSIEMETIFGDGSNSSGSGGFFGKLG 66
Query: 72 -GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ L G+++ N G V A+P +I+P+DL +GG+L+CQ DAFLC+
Sbjct: 67 GAAKRVLTGESLFMTAFTNMGAGREKVAFASPYPGKIIPVDLRNYGGKLICQKDAFLCAA 126
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G G+AFI AGG++V+K L G+ + VD C+
Sbjct: 127 KGVSIGIDFRRKLGTGFFGGEGFILQKLEGDGMAFIHAGGTIVRKRLLPGQKLKVDTGCL 186
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
VA+T VN I+Y I+ AVFGG+ + A + GPG V+IQSLPF RL+ ++
Sbjct: 187 VAMTKDVNYDIEYVKGIKNAVFGGEGIFFASLVGPGEVWIQSLPFSRLASKV 238
>gi|423613040|ref|ZP_17588901.1| TIGR00266 family protein [Bacillus cereus VD107]
gi|401243511|gb|EJR49881.1| TIGR00266 family protein [Bacillus cereus VD107]
Length = 260
Score = 153 bits (386), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 88/243 (36%), Positives = 134/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGQSGGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G G AF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGFAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|229141601|ref|ZP_04270133.1| Protein containing DUF124 [Bacillus cereus BDRD-ST26]
gi|228641881|gb|EEK98180.1| Protein containing DUF124 [Bacillus cereus BDRD-ST26]
Length = 263
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 9 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 66
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 67 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 126
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 127 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKHGEKLRIDTGCL 186
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T +N +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 187 VAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 241
>gi|42784066|ref|NP_981313.1| hypothetical protein BCE_5020 [Bacillus cereus ATCC 10987]
gi|217962351|ref|YP_002340923.1| hypothetical protein BCAH187_A5002 [Bacillus cereus AH187]
gi|375286872|ref|YP_005107311.1| hypothetical protein BCN_4778 [Bacillus cereus NC7401]
gi|402555001|ref|YP_006596272.1| hypothetical protein BCK_10835 [Bacillus cereus FRI-35]
gi|423355348|ref|ZP_17332973.1| TIGR00266 family protein [Bacillus cereus IS075]
gi|423571089|ref|ZP_17547334.1| TIGR00266 family protein [Bacillus cereus MSX-A12]
gi|42739997|gb|AAS43921.1| conserved hypothetical protein [Bacillus cereus ATCC 10987]
gi|217067731|gb|ACJ81981.1| conserved hypothetical protein [Bacillus cereus AH187]
gi|358355399|dbj|BAL20571.1| conserved hypothetical protein [Bacillus cereus NC7401]
gi|401083970|gb|EJP92221.1| TIGR00266 family protein [Bacillus cereus IS075]
gi|401202546|gb|EJR09399.1| TIGR00266 family protein [Bacillus cereus MSX-A12]
gi|401796211|gb|AFQ10070.1| hypothetical protein BCK_10835 [Bacillus cereus FRI-35]
Length = 260
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 6 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 63
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 64 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 124 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKHGEKLRIDTGCL 183
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T +N +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 184 VAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238
>gi|423484455|ref|ZP_17461145.1| TIGR00266 family protein [Bacillus cereus BAG6X1-2]
gi|401138615|gb|EJQ46183.1| TIGR00266 family protein [Bacillus cereus BAG6X1-2]
Length = 260
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 89/243 (36%), Positives = 136/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSGGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIVPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ + G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGSGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|423573437|ref|ZP_17549556.1| TIGR00266 family protein [Bacillus cereus MSX-D12]
gi|401214984|gb|EJR21705.1| TIGR00266 family protein [Bacillus cereus MSX-D12]
Length = 260
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|196041505|ref|ZP_03108798.1| conserved hypothetical protein [Bacillus cereus NVH0597-99]
gi|228948607|ref|ZP_04110886.1| Protein containing DUF124 [Bacillus thuringiensis serovar monterrey
BGSC 4AJ1]
gi|229093979|ref|ZP_04225070.1| Protein containing DUF124 [Bacillus cereus Rock3-42]
gi|196027753|gb|EDX66367.1| conserved hypothetical protein [Bacillus cereus NVH0597-99]
gi|228689461|gb|EEL43275.1| Protein containing DUF124 [Bacillus cereus Rock3-42]
gi|228811106|gb|EEM57448.1| Protein containing DUF124 [Bacillus thuringiensis serovar monterrey
BGSC 4AJ1]
Length = 260
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|168204486|ref|ZP_02630491.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens E
str. JGS1987]
gi|170663922|gb|EDT16605.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens E
str. JGS1987]
Length = 265
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 89/231 (38%), Positives = 132/231 (57%), Gaps = 11/231 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV----------- 71
I F++ G + Q +I L P+E VVA G+M M S+EME I+ +
Sbjct: 7 IDFELFGDDMQFVEIELDPRETVVAEAGAMMMMDSSIEMETIFGDGSNSGSGGFFGKLGG 66
Query: 72 GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ N G V A+P +I+P+DL +GG+L+CQ DAFLC+
Sbjct: 67 AAKRVLTGESLFMTAFTNMGVGREKVAFASPYPGKIIPVDLRNYGGKLICQKDAFLCAAK 126
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + +++ G EGF+ QKL G G+AFI AGG++V+K L G+ + VD C+V
Sbjct: 127 GVSIGIDFRRKLGTGFFGGEGFILQKLEGDGMAFIHAGGTIVRKRLLPGQKLKVDTGCLV 186
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
A+T VN I+Y I+ AVFGG+ + A + GPG V+IQSLPF RL+ ++
Sbjct: 187 AMTKDVNYDIEYVKGIKNAVFGGEGIFFASLVGPGEVWIQSLPFSRLASKV 237
>gi|206976957|ref|ZP_03237858.1| conserved hypothetical protein [Bacillus cereus H3081.97]
gi|384182680|ref|YP_005568442.1| hypothetical protein YBT020_23980 [Bacillus thuringiensis serovar
finitimus YBT-020]
gi|423373182|ref|ZP_17350521.1| TIGR00266 family protein [Bacillus cereus AND1407]
gi|206744762|gb|EDZ56168.1| conserved hypothetical protein [Bacillus cereus H3081.97]
gi|324328764|gb|ADY24024.1| hypothetical protein YBT020_23980 [Bacillus thuringiensis serovar
finitimus YBT-020]
gi|401096886|gb|EJQ04922.1| TIGR00266 family protein [Bacillus cereus AND1407]
Length = 260
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 6 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 63
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 64 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 124 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 183
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 184 VAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238
>gi|373855886|ref|ZP_09598632.1| protein of unknown function DUF124 [Bacillus sp. 1NLA3E]
gi|372454955|gb|EHP28420.1| protein of unknown function DUF124 [Bacillus sp. 1NLA3E]
Length = 264
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 85/232 (36%), Positives = 130/232 (56%), Gaps = 12/232 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I ++I G + Q ++ L PQE V+A GS M G ++ME I+ ++ + GK +
Sbjct: 6 IDYKIYGDDMQFVEVELDPQETVIAEAGSFMMMDGDIQMETIFGDGSQGSETGGIMGKLL 65
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
++ N + V AAP ++P+DL+ G+++CQ DAFL +
Sbjct: 66 SAGKRVITGESLFMTAFTNHANTKKHVSFAAPYPGSVIPMDLSKLNGKIICQKDAFLAAA 125
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V V ++V G EGF+ QKL G GLAF+ AGG++ +K L G++I VD C+
Sbjct: 126 KGVSVGIEFQRKVGAGFFGGEGFIMQKLEGDGLAFVHAGGTIHKKELASGDMIRVDTGCL 185
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
VA+TS VN I++ ++ A+FGG+ L A + GPG V+IQSLPF RL+ R+
Sbjct: 186 VAMTSGVNYDIEFVKGVKTALFGGEGLFFATLRGPGTVWIQSLPFSRLASRV 237
>gi|334134590|ref|ZP_08508094.1| TIGR00266 family protein [Paenibacillus sp. HGF7]
gi|333607745|gb|EGL19055.1| TIGR00266 family protein [Paenibacillus sp. HGF7]
Length = 260
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 91/231 (39%), Positives = 132/231 (57%), Gaps = 11/231 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
I + I G E Q +I L PQE V+A GSM M G++ ME I+ + +G
Sbjct: 6 IDYIIHGNEMQYVEIELDPQESVIAEAGSMMMMEGNIGMETIFGDGSGGGGRGLMGKLMG 65
Query: 73 MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ V N G V AAP +I+P+DL GG+L+CQ D+FLC+
Sbjct: 66 AGKRLLTGESLFMTVYTNNGRGKEHVTFAAPYPGKIIPMDLHALGGKLICQKDSFLCAAK 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + L +++ G EGF+ QKL G GL F+ A GS++++ L+ GEV+ VD C V
Sbjct: 126 GVSIGIELQRKLGAGFFGGEGFIMQKLEGDGLGFVHACGSILERELQPGEVLRVDTGCFV 185
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
A+T V I++ I+ AVFGG+ L V GPG V++QSLPF RL++R+
Sbjct: 186 AMTPDVQYDIEFVKGIKTAVFGGEGLFFVTVRGPGKVWLQSLPFSRLAERV 236
>gi|228923617|ref|ZP_04086897.1| Protein containing DUF124 [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
gi|423583077|ref|ZP_17559188.1| TIGR00266 family protein [Bacillus cereus VD014]
gi|423634248|ref|ZP_17609901.1| TIGR00266 family protein [Bacillus cereus VD156]
gi|228836086|gb|EEM81447.1| Protein containing DUF124 [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
gi|401210386|gb|EJR17138.1| TIGR00266 family protein [Bacillus cereus VD014]
gi|401281034|gb|EJR86948.1| TIGR00266 family protein [Bacillus cereus VD156]
Length = 260
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 6 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 63
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 64 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 124 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 183
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 184 VAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238
>gi|228955143|ref|ZP_04117157.1| Protein containing DUF124 [Bacillus thuringiensis serovar kurstaki
str. T03a001]
gi|228804554|gb|EEM51159.1| Protein containing DUF124 [Bacillus thuringiensis serovar kurstaki
str. T03a001]
Length = 263
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 5 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 62
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 63 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 122
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 123 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 182
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T +N +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 183 TGCLVAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 238
Query: 247 TSP 249
TSP
Sbjct: 239 TSP 241
>gi|168212760|ref|ZP_02638385.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens
CPE str. F4969]
gi|170715718|gb|EDT27900.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens
CPE str. F4969]
Length = 265
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 89/232 (38%), Positives = 133/232 (57%), Gaps = 12/232 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV----------- 71
I F++ G + Q +I L P+E VVA G+M M S+EME+I+ +
Sbjct: 6 IDFELFGDDMQFVEIELDPRETVVAEAGAMMMMDSSIEMESIFGDGSNSSGSGGFFGKLG 65
Query: 72 -GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ L G+++ N G V A+P +I+P+DL +GG+L+CQ DAFLC+
Sbjct: 66 GAAKRVLTGESLFMTAFTNMGVGREKVAFASPYPGKIIPVDLRNYGGKLICQKDAFLCAA 125
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G G+AFI AGG++V+K L G+ + VD C+
Sbjct: 126 KGVSIGIDFRRKLGTGFFGGEGFILQKLEGDGMAFIHAGGTIVRKRLLPGQKLKVDTGCL 185
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
VA+T VN I+Y I+ AVFGG+ + A + GPG V+IQSLPF RL+ ++
Sbjct: 186 VAMTKDVNYDIEYVKGIKNAVFGGEGIFFASLVGPGEVWIQSLPFSRLASKV 237
>gi|404329598|ref|ZP_10970046.1| hypothetical protein SvinD2_05844 [Sporolactobacillus vineae DSM
21990 = SL153]
Length = 263
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 10/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---------IPENEVGM 73
I ++I G E Q + L P E V+A GS M + ME ++ + + G
Sbjct: 6 IDYRIEGQEMQFVTVELDPGETVIAEAGSFMMMDEGIGMETLFGDGSSSSGGLMDKLFGA 65
Query: 74 WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ N G V A+P +I+P+DL+ GG+++CQ DAFLC+
Sbjct: 66 GKRLLTGESLFMTAFTNNGSGKKHVSFASPYPGKIIPLDLSALGGQMICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ G EGF+ QKL G GLAF+ AGG + + +L GE + VD C+VA
Sbjct: 126 VSVGIAFQKKLSTGFFGGEGFIMQKLEGDGLAFVHAGGMIKEIDLAAGETLRVDTGCLVA 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+T + I++ ++ A+FGG+ L A + GPG V +QSLPF RL+ R+ A+
Sbjct: 186 LTQETDYDIEFVRGVKTALFGGEGLFFATLRGPGKVLVQSLPFSRLASRVFSAM 239
>gi|120435350|ref|YP_861036.1| hypothetical protein GFO_0995 [Gramella forsetii KT0803]
gi|117577500|emb|CAL65969.1| protein containing DUF124 [Gramella forsetii KT0803]
Length = 277
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 84/234 (35%), Positives = 133/234 (56%), Gaps = 10/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENE------VGM 73
I +QI G E Q ++ L P+E V+A G+ M ++M+ I+ +NE +G
Sbjct: 17 IDYQIYGEEMQYVELELDPREAVIAEAGNFMMMDDGIKMDTIFGDGSKQNEGFLGKVLGA 76
Query: 74 WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ + N + A+P +++PIDL F G+ +CQ DAFLC+
Sbjct: 77 GKRLLTGESLFMTIFSNEIQGKKKISFASPYPGKVIPIDLTRFNGKFICQKDAFLCAAKG 136
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ G EGF+ QK+ G G+AF+ +GG++ +K L+ GE + VD CI+
Sbjct: 137 VSIGIEFSKKLGRGFFGGEGFIMQKVEGDGMAFVHSGGTMAKKELQSGEKLKVDTGCIIG 196
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
T V+ I++ G IR +FGG+ L A +TGPG V+IQSLPF RL+ RI +A
Sbjct: 197 FTQGVDYDIEFVGGIRNTLFGGEGLFFATLTGPGTVYIQSLPFSRLASRIHQAA 250
>gi|406878089|gb|EKD27110.1| hypothetical protein ACD_79C00869G0003 [uncultured bacterium]
Length = 261
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 87/230 (37%), Positives = 132/230 (57%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPEN-EVGMWQWLF-- 78
I ++I G + Q +I L EKVVA G+M +M S+ E + N E G+ LF
Sbjct: 6 IDYKIFGDDMQFVEIELDSMEKVVAEAGAMMYMDDSISFEAKMGDGSNPEAGLMDKLFSA 65
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G+++ N G S V AAP +I+P++LA + E++CQ DAFLC+
Sbjct: 66 GKRAIMGESLFMTHFTNNGKSKKRVAFAAPYPGKIIPLNLASYNEEIICQKDAFLCAALG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
+++ T +R + G EGF+ +KL+G G+ F+ AGG+V++K L E I VD CIVA
Sbjct: 126 TQINITFTKRFGAGLFGGEGFILEKLSGDGMVFVHAGGTVIEKELN-NEKILVDTGCIVA 184
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
S+++ I+ G ++ VFGG+ L A ++G G +++QSLPF RLS RI
Sbjct: 185 FESNISYDIERAGNLKSMVFGGEGLFLATLSGKGKIWLQSLPFSRLSDRI 234
>gi|423427001|ref|ZP_17404032.1| TIGR00266 family protein [Bacillus cereus BAG3X2-2]
gi|423502446|ref|ZP_17479038.1| TIGR00266 family protein [Bacillus cereus HD73]
gi|449091835|ref|YP_007424276.1| Protein containing DUF124 [Bacillus thuringiensis serovar kurstaki
str. HD73]
gi|401109916|gb|EJQ17834.1| TIGR00266 family protein [Bacillus cereus BAG3X2-2]
gi|402460287|gb|EJV92009.1| TIGR00266 family protein [Bacillus cereus HD73]
gi|449025592|gb|AGE80755.1| Protein containing DUF124 [Bacillus thuringiensis serovar kurstaki
str. HD73]
Length = 260
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T +N +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 180 TGCLVAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|228961134|ref|ZP_04122760.1| Protein containing DUF124 [Bacillus thuringiensis serovar pakistani
str. T13001]
gi|229048574|ref|ZP_04194134.1| Protein containing DUF124 [Bacillus cereus AH676]
gi|229112328|ref|ZP_04241867.1| Protein containing DUF124 [Bacillus cereus Rock1-15]
gi|229130143|ref|ZP_04259104.1| Protein containing DUF124 [Bacillus cereus BDRD-Cer4]
gi|229147435|ref|ZP_04275784.1| Protein containing DUF124 [Bacillus cereus BDRD-ST24]
gi|229153071|ref|ZP_04281252.1| Protein containing DUF124 [Bacillus cereus m1550]
gi|229158482|ref|ZP_04286543.1| Protein containing DUF124 [Bacillus cereus ATCC 4342]
gi|229181181|ref|ZP_04308513.1| Protein containing DUF124 [Bacillus cereus 172560W]
gi|229193147|ref|ZP_04320102.1| Protein containing DUF124 [Bacillus cereus ATCC 10876]
gi|228590411|gb|EEK48275.1| Protein containing DUF124 [Bacillus cereus ATCC 10876]
gi|228602372|gb|EEK59861.1| Protein containing DUF124 [Bacillus cereus 172560W]
gi|228625001|gb|EEK81767.1| Protein containing DUF124 [Bacillus cereus ATCC 4342]
gi|228630491|gb|EEK87139.1| Protein containing DUF124 [Bacillus cereus m1550]
gi|228636117|gb|EEK92598.1| Protein containing DUF124 [Bacillus cereus BDRD-ST24]
gi|228653358|gb|EEL09235.1| Protein containing DUF124 [Bacillus cereus BDRD-Cer4]
gi|228671168|gb|EEL26473.1| Protein containing DUF124 [Bacillus cereus Rock1-15]
gi|228722776|gb|EEL74161.1| Protein containing DUF124 [Bacillus cereus AH676]
gi|228798577|gb|EEM45564.1| Protein containing DUF124 [Bacillus thuringiensis serovar pakistani
str. T13001]
Length = 263
Score = 152 bits (385), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 9 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 66
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 67 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 126
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 127 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 186
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T +N +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 187 VAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 241
>gi|168216590|ref|ZP_02642215.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens
NCTC 8239]
gi|182624521|ref|ZP_02952304.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens D
str. JGS1721]
gi|422347228|ref|ZP_16428141.1| TIGR00266 family protein [Clostridium perfringens WAL-14572]
gi|422872907|ref|ZP_16919392.1| hypothetical protein HA1_01642 [Clostridium perfringens F262]
gi|177910329|gb|EDT72710.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens D
str. JGS1721]
gi|182381439|gb|EDT78918.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens
NCTC 8239]
gi|373225140|gb|EHP47475.1| TIGR00266 family protein [Clostridium perfringens WAL-14572]
gi|380306165|gb|EIA18439.1| hypothetical protein HA1_01642 [Clostridium perfringens F262]
Length = 266
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 89/232 (38%), Positives = 132/232 (56%), Gaps = 12/232 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV----------- 71
I F++ G + Q +I L P+E VVA G+M M S+EME I+ +
Sbjct: 7 IDFELFGDDMQFVEIELDPRETVVAEAGAMMMMDSSIEMETIFGDGSNSSGSGGFFGKLG 66
Query: 72 -GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ L G+++ N G V A+P +I+P+DL +GG+L+CQ DAFLC+
Sbjct: 67 GAAKRVLTGESLFMTAFTNMGVGREKVAFASPYPGKIIPVDLRNYGGKLICQKDAFLCAA 126
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G G+AFI AGG++V+K L G+ + VD C+
Sbjct: 127 KGVSIGIDFRRKLGTGFFGGEGFILQKLEGDGMAFIHAGGTIVRKRLLPGQKLKVDTGCL 186
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
VA+T VN I+Y I+ AVFGG+ + A + GPG V+IQSLPF RL+ ++
Sbjct: 187 VAMTKDVNYDIEYVKGIKNAVFGGEGIFFASLVGPGEVWIQSLPFSRLASKV 238
>gi|110803643|ref|YP_697641.1| hypothetical protein CPR_0312 [Clostridium perfringens SM101]
gi|110684144|gb|ABG87514.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens
SM101]
Length = 265
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 89/232 (38%), Positives = 132/232 (56%), Gaps = 12/232 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV----------- 71
I F++ G + Q +I L P+E VVA G+M M S+EME I+ +
Sbjct: 6 IDFELFGDDMQFVEIELDPRETVVAEAGAMMMMDSSIEMETIFGDGSNSSGSGGFFGKLG 65
Query: 72 -GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ L G+++ N G V A+P +I+P+DL +GG+L+CQ DAFLC+
Sbjct: 66 GAAKRVLTGESLFMTAFTNMGVGREKVAFASPYPGKIIPVDLRNYGGKLICQKDAFLCAA 125
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G G+AFI AGG++V+K L G+ + VD C+
Sbjct: 126 KGVSIGIDFRRKLGTGFFGGEGFILQKLEGDGMAFIHAGGTIVRKRLLPGQKLKVDTGCL 185
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
VA+T VN I+Y I+ AVFGG+ + A + GPG V+IQSLPF RL+ ++
Sbjct: 186 VAMTKDVNYDIEYVKGIKNAVFGGEGIFFASLVGPGEVWIQSLPFSRLASKV 237
>gi|206969526|ref|ZP_03230480.1| conserved hypothetical protein [Bacillus cereus AH1134]
gi|218234045|ref|YP_002369675.1| hypothetical protein BCB4264_A4991 [Bacillus cereus B4264]
gi|229072369|ref|ZP_04205573.1| Protein containing DUF124 [Bacillus cereus F65185]
gi|296505327|ref|YP_003667027.1| hypothetical protein BMB171_C4499 [Bacillus thuringiensis BMB171]
gi|365158352|ref|ZP_09354548.1| TIGR00266 family protein [Bacillus sp. 7_6_55CFAA_CT2]
gi|423411345|ref|ZP_17388465.1| TIGR00266 family protein [Bacillus cereus BAG3O-2]
gi|423432869|ref|ZP_17409873.1| TIGR00266 family protein [Bacillus cereus BAG4O-1]
gi|423438300|ref|ZP_17415281.1| TIGR00266 family protein [Bacillus cereus BAG4X12-1]
gi|423631593|ref|ZP_17607340.1| TIGR00266 family protein [Bacillus cereus VD154]
gi|423640067|ref|ZP_17615685.1| TIGR00266 family protein [Bacillus cereus VD166]
gi|423650762|ref|ZP_17626332.1| TIGR00266 family protein [Bacillus cereus VD169]
gi|423657816|ref|ZP_17633115.1| TIGR00266 family protein [Bacillus cereus VD200]
gi|206735214|gb|EDZ52382.1| conserved hypothetical protein [Bacillus cereus AH1134]
gi|218162002|gb|ACK61994.1| conserved hypothetical protein [Bacillus cereus B4264]
gi|228710794|gb|EEL62765.1| Protein containing DUF124 [Bacillus cereus F65185]
gi|296326379|gb|ADH09307.1| hypothetical protein BMB171_C4499 [Bacillus thuringiensis BMB171]
gi|363626887|gb|EHL77850.1| TIGR00266 family protein [Bacillus sp. 7_6_55CFAA_CT2]
gi|401107400|gb|EJQ15347.1| TIGR00266 family protein [Bacillus cereus BAG3O-2]
gi|401114325|gb|EJQ22187.1| TIGR00266 family protein [Bacillus cereus BAG4O-1]
gi|401118680|gb|EJQ26510.1| TIGR00266 family protein [Bacillus cereus BAG4X12-1]
gi|401263730|gb|EJR69852.1| TIGR00266 family protein [Bacillus cereus VD154]
gi|401281433|gb|EJR87345.1| TIGR00266 family protein [Bacillus cereus VD169]
gi|401282091|gb|EJR87995.1| TIGR00266 family protein [Bacillus cereus VD166]
gi|401288827|gb|EJR94566.1| TIGR00266 family protein [Bacillus cereus VD200]
Length = 260
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 6 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 63
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 64 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 124 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 183
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T +N +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 184 VAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238
>gi|229082124|ref|ZP_04214596.1| Protein containing DUF124 [Bacillus cereus Rock4-2]
gi|228701181|gb|EEL53695.1| Protein containing DUF124 [Bacillus cereus Rock4-2]
Length = 263
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 9 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 66
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 67 GAGKRLVTGESMFMTVFTNTGRGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 126
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 127 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 186
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T +N +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 187 VAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 241
>gi|114568603|ref|YP_755283.1| hypothetical protein Mmar10_0049 [Maricaulis maris MCS10]
gi|114339065|gb|ABI64345.1| protein of unknown function DUF124 [Maricaulis maris MCS10]
Length = 263
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 85/234 (36%), Positives = 131/234 (55%), Gaps = 10/234 (4%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMW-- 74
Q I ++I G + Q +I L P E VV+ G+M F + S+ ME ++ + G+W
Sbjct: 3 QTDEIDYEIGGEDIQYVEIELDPGESVVSEAGAMMFKAPSITMEAVFGDGSQKNQGVWGA 62
Query: 75 ------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
+ L G+++ V + G V AAP I+P+ L GGEL+CQ D+FL
Sbjct: 63 LAGAGKRLLTGESLFMTVFTHGGQGKAKVAFAAPYPGTIIPVHLPDLGGELICQKDSFLA 122
Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
+ V V L +R+ + G EGF+ Q+L G G F+ AGG+++++ L GE + VD
Sbjct: 123 AAKGVSVGIALQKRIMTGLFGGEGFIMQRLEGDGWVFVHAGGTLIERELAAGEELHVDTG 182
Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
C+VA T V+ +I+ G I+ +FGG+ + A + GPG V+IQS+PF RL+ RI
Sbjct: 183 CVVAYTPDVDFEIERAGNIKSMIFGGEGVFFARLRGPGKVWIQSMPFSRLAGRI 236
>gi|168210434|ref|ZP_02636059.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens B
str. ATCC 3626]
gi|170711528|gb|EDT23710.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens B
str. ATCC 3626]
Length = 266
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 88/232 (37%), Positives = 132/232 (56%), Gaps = 12/232 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV----------- 71
I F++ G + Q +I L P+E VVA G+M M S+EME I+ +
Sbjct: 7 IDFELFGDDMQFVEIELDPRETVVAEAGAMMMMDSSIEMETIFGDGSNSSGSGGFFGKLG 66
Query: 72 -GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ L G+++ N G V A+P +I+P+DL +GG+L+CQ DAFLC+
Sbjct: 67 GAAKRVLTGESLFMTAFTNMGVGREKVAFASPYPGKIIPVDLRNYGGKLICQKDAFLCAA 126
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G G+AFI AGG++++K L G+ + VD C+
Sbjct: 127 KGVSIGIDFRRKLGTGFFGGEGFILQKLEGDGMAFIHAGGTIIRKRLLPGQKLKVDTGCL 186
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
VA+T VN I+Y I+ AVFGG+ + A + GPG V+IQSLPF RL+ ++
Sbjct: 187 VAMTKDVNYDIEYVKGIKNAVFGGEGIFFASLVGPGEVWIQSLPFSRLASKV 238
>gi|110799859|ref|YP_694778.1| hypothetical protein CPF_0318 [Clostridium perfringens ATCC 13124]
gi|110674506|gb|ABG83493.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens
ATCC 13124]
Length = 265
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 88/232 (37%), Positives = 132/232 (56%), Gaps = 12/232 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV----------- 71
I F++ G + Q +I L P+E VVA G+M M S+EME I+ +
Sbjct: 6 IDFELFGDDMQFVEIELDPRETVVAEAGAMMMMDSSIEMETIFGDGSNSSGSGGFFGKLG 65
Query: 72 -GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ L G+++ N G V A+P +I+P+DL +GG+L+CQ DAFLC+
Sbjct: 66 GAAKRVLTGESLFMTAFTNMGVGREKVAFASPYPGKIIPVDLRNYGGKLICQKDAFLCAA 125
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G G+AFI AGG++++K L G+ + VD C+
Sbjct: 126 KGVSIGIDFRRKLGTGFFGGEGFILQKLEGDGMAFIHAGGTIIRKRLLPGQKLKVDTGCL 185
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
VA+T VN I+Y I+ AVFGG+ + A + GPG V+IQSLPF RL+ ++
Sbjct: 186 VAMTKDVNYDIEYVKGIKNAVFGGEGIFFASLVGPGEVWIQSLPFSRLASKV 237
>gi|357009687|ref|ZP_09074686.1| hypothetical protein PelgB_09421 [Paenibacillus elgii B69]
Length = 262
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 90/242 (37%), Positives = 136/242 (56%), Gaps = 10/242 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE-----VGMW--- 74
+ ++I+G E Q +I L P E V A G+M M +++ME I+ ++ VG
Sbjct: 6 VDYRIIGSEMQFVEIELDPGETVFAEAGAMMMMDQNIQMETIFGDGSDQNKGFVGKLFGA 65
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ N G V A+P RILP+DL G+L+CQ DAFLC+
Sbjct: 66 GKRLLSGESLFMTAFTNRGSGKECVSFASPYPGRILPMDLVELNGKLICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ G EGF+ QK+ G GLAF+ AGG++ ++ L GE+I VD C+VA
Sbjct: 126 VSVGIEFQRKLGAGFFGGEGFIMQKIEGDGLAFVHAGGAIHERVLGPGEMIRVDTGCLVA 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
+T VN I++ ++ A+FGG+ L A + GPG V+IQSLPF RL+ R+ A + +
Sbjct: 186 MTQEVNYDIEFVKGVKTAIFGGEGLFFATLRGPGKVWIQSLPFSRLADRVLSASRTGGRK 245
Query: 253 EN 254
E
Sbjct: 246 EE 247
>gi|30022928|ref|NP_834559.1| hypothetical protein BC4860 [Bacillus cereus ATCC 14579]
gi|29898487|gb|AAP11760.1| hypothetical protein BC_4860 [Bacillus cereus ATCC 14579]
Length = 260
Score = 151 bits (382), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LFGK +
Sbjct: 6 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 63
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 64 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 124 KGVSLGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 183
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
VA+T +N +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 184 VAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238
>gi|108759112|ref|YP_628683.1| hypothetical protein MXAN_0412 [Myxococcus xanthus DK 1622]
gi|108462992|gb|ABF88177.1| conserved hypothetical protein TIGR00266 [Myxococcus xanthus DK
1622]
Length = 265
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 87/243 (35%), Positives = 129/243 (53%), Gaps = 11/243 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F I G + Q ++ L P E VA G++ +M +EME I+ E G + L G
Sbjct: 7 VDFHIYGEDLQFVEVELDPGEAAVAEAGTLMYMEDGIEMETIFGDGSEKTSGFFGSLLGA 66
Query: 80 --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
T+V L G V AAP +I+P++L+ GGEL+ Q D+FL +
Sbjct: 67 GKRLLTGESLFTTVFLNKSGSGKRKVSFAAPYPGKIIPVNLSELGGELIAQKDSFLAAAK 126
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + +++ + G EGF+ Q+L G GLAFI AGG++ ++ L GE++ VD CIV
Sbjct: 127 GVSLGIAFQKKLGTGLFGGEGFIMQRLQGDGLAFIHAGGTLHERTLGPGEMLRVDTGCIV 186
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
A +V+ I+ I+ A FGG+ L A + GPG V++QSLPF RL+ RI A
Sbjct: 187 AFQPTVDYDIQMVSGIKTAFFGGEGLFFATLRGPGKVWLQSLPFSRLAGRILSASRPGGA 246
Query: 252 REN 254
R+
Sbjct: 247 RDE 249
>gi|170705443|ref|ZP_02895907.1| conserved hypothetical protein [Bacillus anthracis str. A0389]
gi|170129568|gb|EDS98431.1| conserved hypothetical protein [Bacillus anthracis str. A0389]
Length = 260
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 89/243 (36%), Positives = 134/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +E E I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIETETIF--GDGSGPSGGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN +++ G ++ A+FGG+ L A + GPG V+IQSL RLS R+
Sbjct: 180 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLSARL---- 235
Query: 247 TSP 249
TSP
Sbjct: 236 TSP 238
>gi|332665477|ref|YP_004448265.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332334291|gb|AEE51392.1| protein of unknown function DUF124 [Haliscomenobacter hydrossis DSM
1100]
Length = 264
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 84/234 (35%), Positives = 129/234 (55%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLF- 78
I + I G E Q +I L P E V+A GS M +EM ++ + G+W L
Sbjct: 6 IDYHIHGEEMQFVEIELDPMETVIAEAGSFMMMDDGIEMATLFGDGSQSDNGGLWGKLLS 65
Query: 79 -------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
G+++ + V A+P +I+P+DL+ G+++CQ DAFLC+
Sbjct: 66 AGKRLITGESLFMTAYTHASDGKKRVSFASPYPGKIIPLDLSKLDGKIICQKDAFLCAAK 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V V Q++ G EGF+ QKL G G+AF+ AGGS+++K LE+GEV+ +D C+V
Sbjct: 126 GVSVGIEFSQKLGRGFFGGEGFIMQKLEGDGMAFVHAGGSIIEKKLELGEVLKIDTGCLV 185
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
A T V+ I+ +R +FGG+ + A + GPG V+IQSLPF R++ R+ +A
Sbjct: 186 AFTRDVDYDIEMVRGVRNMLFGGEGIFLARLEGPGTVWIQSLPFSRMADRVLQA 239
>gi|298208358|ref|YP_003716537.1| hypothetical protein CA2559_08951 [Croceibacter atlanticus
HTCC2559]
gi|83848279|gb|EAP86149.1| hypothetical protein CA2559_08951 [Croceibacter atlanticus
HTCC2559]
Length = 267
Score = 151 bits (382), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 88/235 (37%), Positives = 127/235 (54%), Gaps = 11/235 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
I + I G E Q +I L PQE V+A GS M ++M+ I+ E G + + G
Sbjct: 6 IDYHIYGEEMQYVEIELDPQEGVIAEAGSFMMMDSGIKMDTIFGDGSEQNKGFLKSILGA 65
Query: 80 --KTITSVVL------RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ +T L + A+P +ILPIDL+ FG + +CQ DAFLC+
Sbjct: 66 GKRLLTGESLFMTAFYHAETAGKKRLSFASPYPGKILPIDLSRFGNKFICQKDAFLCAAK 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V V +++ G EGF+ QKL G G+AF+ AGG+ QK L GE + VD CI+
Sbjct: 126 GVSVGVEFSKKLGRGFFGGEGFIMQKLEGVGMAFVHAGGTTAQKELAPGETLKVDTGCII 185
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
T VN +++ G I+ ++FGG+ L A +TGPG V++QSLPF RL+ R+ A
Sbjct: 186 GFTKDVNYDVEFVGGIKNSLFGGEGLFFATLTGPGTVYVQSLPFSRLASRVWAAA 240
>gi|405356330|ref|ZP_11025350.1| DUF124 domain-containing protein [Chondromyces apiculatus DSM 436]
gi|397090926|gb|EJJ21767.1| DUF124 domain-containing protein [Myxococcus sp. (contaminant ex
DSM 436)]
Length = 265
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 86/243 (35%), Positives = 129/243 (53%), Gaps = 11/243 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F I G + Q ++ L P E VA G++ +M +EME I+ + G + L G
Sbjct: 7 VDFHIYGEDLQFVEVELDPGEAAVAEAGTLMYMEDGIEMETIFGDGSQQSSGFFGSLLGA 66
Query: 80 --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
T+V G V AAP +I+P++L GGEL+ Q D+FL +
Sbjct: 67 GKRLLTGESLFTTVFFNKSGRGKRKVSFAAPYPGKIIPVNLGELGGELIAQKDSFLAAAK 126
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + +++ + G EGF+ Q+L G GLAFI AGG++ ++ L GE++ VD CIV
Sbjct: 127 GVSLGIAFQKKLGTGLFGGEGFIMQRLQGDGLAFIHAGGTLHERTLGPGELLRVDTGCIV 186
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
A +V+ I+ I+ A+FGG+ L A + GPG V++QSLPF RL+ RI A S
Sbjct: 187 AFQPTVDYDIQLVSGIKTALFGGEGLFFATLRGPGKVWLQSLPFSRLAGRILSASRSGGA 246
Query: 252 REN 254
R+
Sbjct: 247 RDE 249
>gi|421616380|ref|ZP_16057394.1| hypothetical protein B597_05767 [Pseudomonas stutzeri KOS6]
gi|409781763|gb|EKN61340.1| hypothetical protein B597_05767 [Pseudomonas stutzeri KOS6]
Length = 248
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 88/232 (37%), Positives = 133/232 (57%), Gaps = 9/232 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--- 79
+ ++ILG AQ +I+L P E V+A G+M +M+G + E + G+ L+G
Sbjct: 6 LEYEILGQSAQSVEIILDPGETVIAEAGAMNYMTGGVRFEARMGDGSSSGVLGKLWGMGK 65
Query: 80 KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
+ IT L N G V AAP ++PIDLA FGG+L+CQ DAFLC+ +
Sbjct: 66 RMITGESLFLTHFSNQGDGPARVAFAAPYPGTVVPIDLAAFGGQLICQKDAFLCAAYGTR 125
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
+ + +R+ G EGF+ Q+L G GLAF+ AGG+V++K L E + +D C+VA T
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLEGDGLAFVHAGGTVIRKELN-NETLRLDSGCLVAFT 184
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
++ I+ G +R +FGG+ L+ + G G V+IQSLPF RL++R+ A
Sbjct: 185 PGIDYDIQLAGGLRSMLFGGEGLLLTTLKGSGSVWIQSLPFSRLAERVYEAT 236
>gi|228988126|ref|ZP_04148225.1| Protein containing DUF124 [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
gi|228771624|gb|EEM20091.1| Protein containing DUF124 [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
Length = 263
Score = 149 bits (377), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 87/243 (35%), Positives = 134/243 (55%), Gaps = 18/243 (7%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 5 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDFIEMETIF--GDGSGPSGGLF 62
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 63 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 122
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ V + +++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 123 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 182
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T + +++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+
Sbjct: 183 TGCLVAMTKDIKYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 238
Query: 247 TSP 249
TSP
Sbjct: 239 TSP 241
>gi|338530988|ref|YP_004664322.1| hypothetical protein LILAB_06635 [Myxococcus fulvus HW-1]
gi|337257084|gb|AEI63244.1| hypothetical protein LILAB_06635 [Myxococcus fulvus HW-1]
Length = 262
Score = 149 bits (377), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 86/234 (36%), Positives = 128/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV--GMWQWLFG- 79
+ F I G + Q ++ L P E VA G++ +M +EME I+ +E G + L G
Sbjct: 4 VDFHIYGEDLQFVEVELDPGEAAVAEAGTLMYMEDGIEMETIFGDGSEKTGGFFGSLLGA 63
Query: 80 --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
T+V L G V AAP +I+P++L+ GGEL+ Q D+FL +
Sbjct: 64 GKRLLTGESLFTTVFLNKSGRGKRKVSFAAPYPGKIIPVNLSELGGELIAQKDSFLAAAK 123
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + +++ + G EGF+ Q+L G GLAFI AGG++ ++ L GE++ VD CIV
Sbjct: 124 GVSLGIAFQKKLGTGLFGGEGFIMQRLQGDGLAFIHAGGTLHERTLGPGELLRVDTGCIV 183
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
A +V+ I+ I+ A FGG+ L A + GPG V++QSLPF RL+ RI A
Sbjct: 184 AFQPTVDYDIQMVSGIKTAFFGGEGLFFATLRGPGKVWLQSLPFSRLAGRILSA 237
>gi|381161030|ref|ZP_09870261.1| hypothetical protein Thi970DRAFT_04876 [Thiorhodovibrio sp. 970]
gi|380877266|gb|EIC19359.1| hypothetical protein Thi970DRAFT_04876 [Thiorhodovibrio sp. 970]
Length = 334
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 82/236 (34%), Positives = 126/236 (53%), Gaps = 12/236 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG----MWQWLF 78
+ ++I G E Q ++ L P E VA G+M + +++M+ ++ + G LF
Sbjct: 73 VDYEIFGHEMQFVEVELDPGESAVAEAGAMMYKHPAIQMDTVFGDGSHTGQGGGFMDRLF 132
Query: 79 G--------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
G +++ + V + G V AP I+P+ L GG L+CQ D+FLC
Sbjct: 133 GAGKRLVTGESLFTTVFTHTGQGKARVAFGAPYPGSIIPVHLTDLGGTLICQKDSFLCGA 192
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ + G EGF+ Q+L G GL FI AGG V ++ LE GE + VD CI
Sbjct: 193 KGVAMGIFFQRKILTGLFGGEGFIMQRLDGDGLVFIHAGGCVTERVLEAGETLHVDTGCI 252
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
VA V ++ G I+ A+FGG+ + AV+TGPG V++QSLPF RL+ R+ A
Sbjct: 253 VAFEQKVGFDVQQVGGIKSALFGGEGVFFAVLTGPGRVWLQSLPFSRLAGRMLAAA 308
>gi|399521686|ref|ZP_10762426.1| Uncharacterized protein M6_Spy0233 [Pseudomonas pseudoalcaligenes
CECT 5344]
gi|399110924|emb|CCH38986.1| Uncharacterized protein M6_Spy0233 [Pseudomonas pseudoalcaligenes
CECT 5344]
Length = 256
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 84/232 (36%), Positives = 131/232 (56%), Gaps = 9/232 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM----W---- 74
+ ++ILG AQ +I+L P E V+A G+M +M+ + E + G+ W
Sbjct: 14 LEYEILGASAQSVEIILDPGETVIAEAGAMNYMTDGVRFETRMGDGSSSGLLGKLWSAGK 73
Query: 75 QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
+ L G+++ N G S V AAP ++PIDLA GG L+CQ DAFLC+ +
Sbjct: 74 RMLTGESLFMTHFSNAGKSQARVAFAAPYPGTVVPIDLAKIGGRLICQKDAFLCAAHGTS 133
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
+ + +R+ G EGF+ QKL G GLAF+ AGG+V++K L E + +D C+VA +
Sbjct: 134 IGISFAKRLGAGFFGGEGFILQKLEGDGLAFVHAGGTVIRKELN-NETLRLDTGCLVAFS 192
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
S ++ I G ++ +FGG+ ++ + G G V+IQSLPF RL++R+ A
Sbjct: 193 SGIDYDIALAGGLKSMLFGGEGILLTTLKGTGTVWIQSLPFSRLAERVYAAT 244
>gi|229087392|ref|ZP_04219531.1| Protein containing DUF124 [Bacillus cereus Rock3-44]
gi|228695921|gb|EEL48767.1| Protein containing DUF124 [Bacillus cereus Rock3-44]
Length = 260
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 132/232 (56%), Gaps = 14/232 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I +++ G + Q +I L P+E V+A G+M M ++ME I+ + G LFGK +
Sbjct: 6 IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIDMETIF--GDGAGKSGGLFGKLM 63
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ V N G V AAP +I+P+DL + G+++CQ DAFLC+
Sbjct: 64 GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIVPVDLREYQGKVVCQKDAFLCAA 123
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ + G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+
Sbjct: 124 KGVSIGIEFTKKIGSGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGERLRIDTGCL 183
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
VA+T V+ I++ G ++ A+FGG+ L A + GPG V++QSL RL+ R+
Sbjct: 184 VAMTRDVDYDIQFVGNVKTALFGGEGLFFATLEGPGTVWLQSLTLSRLAARL 235
>gi|312879522|ref|ZP_07739322.1| protein of unknown function DUF124 [Aminomonas paucivorans DSM
12260]
gi|310782813|gb|EFQ23211.1| protein of unknown function DUF124 [Aminomonas paucivorans DSM
12260]
Length = 334
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 80/235 (34%), Positives = 132/235 (56%), Gaps = 11/235 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY----------IPENEVG 72
+ F++ G E Q ++ L P E VVA G+M + S+ M++++ + + +G
Sbjct: 74 VDFRVFGQEMQYVELELDPGESVVAEAGAMMYKDPSVHMDSVFGDGSASSGGTLMDKVLG 133
Query: 73 MWQW-LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ + G G V AP ILP+ L GG L+CQ D+FLC+
Sbjct: 134 AGKRVLTGESLFMTLFSQSGAGKGRVAFGAPYPGCILPVSLPDLGGTLICQKDSFLCAAR 193
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + +++ + G EGF+ ++L G G+AF+ AGG+V++++L GE + VD C+V
Sbjct: 194 GVSIGIYFQKKILTGLFGGEGFIMERLEGDGMAFVHAGGTVLERDLAAGETVHVDTGCVV 253
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
SV+ I+ G I+ A+FGG+ L A +TGPG +++QSLPF RL+ R+ A
Sbjct: 254 GFVPSVSFDIQQVGNIKTALFGGEGLFFASLTGPGRIWLQSLPFSRLAGRMLAAA 308
>gi|222150709|ref|YP_002559862.1| hypothetical protein MCCL_0459 [Macrococcus caseolyticus JCSC5402]
gi|222119831|dbj|BAH17166.1| conserved hypothetical protein [Macrococcus caseolyticus JCSC5402]
Length = 261
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 83/235 (35%), Positives = 133/235 (56%), Gaps = 11/235 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM--------- 73
I ++I G + Q +I L P E V++ GSM ++ ++ ME I+ + G
Sbjct: 6 IDYKIYGDDMQYVEIELDPNETVISEAGSMMMINPNIHMETIFGDGSTSGGNGFMDKLFA 65
Query: 74 --WQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ N G V AAP +I+PIDL+ FGGE +CQ DAFL +
Sbjct: 66 AGRRTLTGESLFMTTFTNRGTGKDHVYFAAPYPGKIIPIDLSQFGGEFICQKDAFLAAAK 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V++ L +R+ G EGF+ Q+L+G G+AFI AGG++++++L+ GE + VD C+V
Sbjct: 126 GVQIGIALQKRLGVGFFGGEGFIMQRLSGDGMAFIHAGGAIMKRDLQPGETLLVDTGCLV 185
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+T V +I+ ++ +FGG+ + A + GPG V++QSLPF R + RI A
Sbjct: 186 GMTKDVAYEIETIPGLKTKLFGGEGIFLARLRGPGTVYVQSLPFSRFASRIFAAA 240
>gi|146308385|ref|YP_001188850.1| hypothetical protein Pmen_3366 [Pseudomonas mendocina ymp]
gi|421505032|ref|ZP_15951972.1| hypothetical protein A471_17213 [Pseudomonas mendocina DLHK]
gi|145576586|gb|ABP86118.1| protein of unknown function DUF124 [Pseudomonas mendocina ymp]
gi|400344255|gb|EJO92625.1| hypothetical protein A471_17213 [Pseudomonas mendocina DLHK]
Length = 248
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 84/232 (36%), Positives = 131/232 (56%), Gaps = 9/232 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM----W---- 74
+ ++ILG AQ +I+L P E V+A G+M +M+ + E + G+ W
Sbjct: 6 LDYEILGASAQSVEIVLDPGETVIAEAGAMNYMTDGVRFETRMGDGSSSGLLGKLWSAGK 65
Query: 75 QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
+ L G+++ N G + V AAP ++PIDLA GG L+CQ DAFLC+ +
Sbjct: 66 RMLTGESLFMTHFSNGGKTQARVAFAAPYPGTVVPIDLAQVGGRLICQKDAFLCAAHGTS 125
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
+ + +R+ G EGF+ QKL G GLAF+ AGG+V++K L E + +D C+VA +
Sbjct: 126 IGISFAKRIGAGFFGGEGFILQKLEGDGLAFVHAGGTVIRKELN-NETLRLDTGCLVAFS 184
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
S ++ I G ++ +FGG+ L+ + G G V+IQSLPF RL++R+ A
Sbjct: 185 SGIDYDIALAGGLKSMLFGGEGLLLTTLKGTGTVWIQSLPFSRLAERVYAAT 236
>gi|345864473|ref|ZP_08816674.1| HTH DNA-binding protein [endosymbiont of Tevnia jerichonana (vent
Tica)]
gi|345879308|ref|ZP_08830971.1| hypothetical protein Rifp1Sym_eb00090 [endosymbiont of Riftia
pachyptila (vent Ph05)]
gi|344223680|gb|EGV50120.1| hypothetical protein Rifp1Sym_eb00090 [endosymbiont of Riftia
pachyptila (vent Ph05)]
gi|345124494|gb|EGW54373.1| HTH DNA-binding protein [endosymbiont of Tevnia jerichonana (vent
Tica)]
Length = 329
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 83/243 (34%), Positives = 131/243 (53%), Gaps = 11/243 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP----------ENEVG 72
I +QI G E+Q ++ L P E VA G+M + S+ M ++ + +G
Sbjct: 70 IDYQIHGHESQFVEVELDPGESAVAEAGAMMYKDASIGMTTLFGDGSGSDGGGFMDKLLG 129
Query: 73 MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ + V + G V AAP I+PI LA GG L+CQ D+FLC+
Sbjct: 130 AGKRLLTGESLFTTVFTHNGSGKAKVAFAAPFPGTIIPISLADMGGRLICQKDSFLCAAK 189
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + +++ + G EGF+ QKL G GL F+ AGG+++++ L GE + VD C+V
Sbjct: 190 GVSIGIHFQKKILTGLFGGEGFIMQKLEGDGLVFVQAGGTLLERELAPGEQLHVDTGCLV 249
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
+ SV ++ G ++ A+FGG+ L A + GPG V++QSLPF RL+ R+ A
Sbjct: 250 GMQDSVQFDLQQAGGVKTALFGGEGLFFANLQGPGKVWLQSLPFSRLAGRMLEAAPQGGG 309
Query: 252 REN 254
R+
Sbjct: 310 RDQ 312
>gi|347750822|ref|YP_004858387.1| hypothetical protein Bcoa_0383 [Bacillus coagulans 36D1]
gi|347583340|gb|AEO99606.1| protein of unknown function DUF124 [Bacillus coagulans 36D1]
Length = 262
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 85/234 (36%), Positives = 135/234 (57%), Gaps = 10/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
I F+++G + Q ++ L PQE V+A GS+ M + ME I+ ++ G+ LF
Sbjct: 6 IDFKLVGDDMQFVEVELDPQETVIAEAGSLMMMEDGIRMETIFGDGSGSQGGIMGKLFSA 65
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G+++ N G V AAP +I+P+DL+ GG+++CQ DAFL +
Sbjct: 66 GKRLITGESLFMTAFTNEGRDKKHVAFAAPYPGKIVPMDLSEQGGKIICQKDAFLAAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +++ G EGF+ QKL G GLAFI AGG++ ++ L GE + VD C+VA
Sbjct: 126 VSVGVEFQRKLGTGFFGGEGFIMQKLEGDGLAFIHAGGTIQKRELAPGETLKVDTGCLVA 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
++ +++ I++ I+ A+FGG+ L A + GPG V++QSLPF RL+ R+ A+
Sbjct: 186 MSRTIDYDIEFVKGIKTALFGGEGLFFATLRGPGSVWVQSLPFSRLASRVFAAM 239
>gi|397687729|ref|YP_006525048.1| hypothetical protein PSJM300_13140 [Pseudomonas stutzeri DSM 10701]
gi|395809285|gb|AFN78690.1| hypothetical protein PSJM300_13140 [Pseudomonas stutzeri DSM 10701]
Length = 248
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 84/232 (36%), Positives = 132/232 (56%), Gaps = 9/232 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM----W---- 74
+ ++ILGG AQ +I+L P E V+A G+M +M+ + E + GM W
Sbjct: 6 LEYEILGGSAQSVEIILDPGETVIAEAGAMNYMTEGVRFEARMGDGSSNGMLSKLWGMGK 65
Query: 75 QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
+ L G+++ N G V AAP ++P+DLA GG L+CQ D+FLC+ +
Sbjct: 66 RMLTGESLFLTHFSNQGKGKARVAFAAPYPGTVVPVDLAAVGGRLICQKDSFLCAAYGTR 125
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
V + +R+ G EGF+ Q+L G GLAF+ AGG+V++K L E + +D C+VA T
Sbjct: 126 VGISFSKRLGAGFFGGEGFILQRLEGDGLAFVHAGGTVIRKELNE-ETLRLDTGCLVAFT 184
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
S ++ I+ G ++ +FGG+ ++ + G G V+IQSLPF RL++R+ A
Sbjct: 185 SGIDYDIQLAGGLKSMLFGGEGILLTTLKGTGSVWIQSLPFSRLAERVYEAT 236
>gi|84688037|ref|ZP_01015898.1| hypothetical protein 1099457000243_RB2654_01510 [Maritimibacter
alkaliphilus HTCC2654]
gi|84663943|gb|EAQ10446.1| hypothetical protein RB2654_01510 [Rhodobacterales bacterium
HTCC2654]
Length = 263
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 85/235 (36%), Positives = 129/235 (54%), Gaps = 12/235 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME-----------NIYIPENEV 71
+ ++I G + Q+ ++ L P E V+A G+M +M + E I V
Sbjct: 6 VDYEIFGDDMQIVEVELDPAETVIAEAGAMNYMDDGIGFETRMGDGARPSGGIMDSLLNV 65
Query: 72 GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
G + L G++I N G V AAP +I+P+D+A GGEL+CQ DAFLC+
Sbjct: 66 G-KRVLTGESIFMTHFTNNGQGKKRVAFAAPYPGKIIPVDMAEMGGELICQKDAFLCAAF 124
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
+++ G EGF+ Q+L G G+AFI GG+V+++ L GEV+ VD C+
Sbjct: 125 GTTTDIAFQRKLGAGFFGGEGFILQRLNGDGMAFIHVGGTVIKRELAPGEVLRVDTGCLA 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
A+T SVN I+ G ++ +FGG+ + A + GPG V++QSLPF RL+ R+ RA
Sbjct: 185 AMTGSVNYDIERAGNLKSMIFGGEGIFLATLRGPGTVYLQSLPFSRLADRVLRAA 239
>gi|253576664|ref|ZP_04853991.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
gi|251843874|gb|EES71895.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
Length = 261
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 86/233 (36%), Positives = 132/233 (56%), Gaps = 10/233 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
I ++I+G E Q ++ L E VVA GS M + ME I+ +
Sbjct: 6 IDYRIIGSEMQCVEVHLDQGESVVAEAGSFMMMDPDIRMETIFGDGSNQGGGLMGKLMGA 65
Query: 73 MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+ + V N G V A+P +I+P+DL M GG+++CQ DAFLC+
Sbjct: 66 GKRLLTGEGLFMTVFTNAGYQRQAVTFASPYPGKIVPLDLQMLGGKVICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +R+ G EGF+ QK+ G GLAF+ +GG V++++L GEV+ +D C+VA
Sbjct: 126 VSVGIEFQRRLGTGFFGGEGFIMQKIEGDGLAFVHSGGLVIERDLAPGEVLRLDTGCLVA 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T+SV+ I++ ++ A+FGG+ L A + GPG V++QSLPF RL+ R+ A
Sbjct: 186 MTASVDYNIEFVKGVKTALFGGEGLFFATLRGPGKVWVQSLPFSRLADRVLSA 238
>gi|392957757|ref|ZP_10323277.1| hypothetical protein A374_13480 [Bacillus macauensis ZFHKF-1]
gi|391876106|gb|EIT84706.1| hypothetical protein A374_13480 [Bacillus macauensis ZFHKF-1]
Length = 259
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 134/234 (57%), Gaps = 10/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
I ++I G + Q ++ L P E V+A G++ + ++ ME I+ + G LF
Sbjct: 6 IDYKIYGSDMQFVEVELDPSETVIAEAGAVMMLEENIVMETIFGDGSAQKSGFMGKLFSA 65
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G+++ N G V AAP +I+P+DL+++ G+++CQ DAFL +
Sbjct: 66 GKRLLTGESLFMTTFTNEGSGKKHVSFAAPYPGKIIPMDLSVYNGKIICQKDAFLAAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + L +++ G EGF+ QKL G G+AF+ AGG++ +K L+ GEV+ VD C+VA
Sbjct: 126 VSIGIELQRKIGAGFFGGEGFIMQKLEGDGMAFVHAGGTLHEKKLQPGEVLRVDTGCLVA 185
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+T V I++ ++ A+FGG+ L A + GPG V++QSLPF RL+ R+ A+
Sbjct: 186 MTQDVEYNIEFVKGVKTALFGGEGLFFATLRGPGTVWVQSLPFSRLASRVFAAI 239
>gi|339494813|ref|YP_004715106.1| hypothetical protein PSTAB_2736 [Pseudomonas stutzeri ATCC 17588 =
LMG 11199]
gi|386021469|ref|YP_005939493.1| hypothetical protein PSTAA_2872 [Pseudomonas stutzeri DSM 4166]
gi|327481441|gb|AEA84751.1| conserved hypothetical protein [Pseudomonas stutzeri DSM 4166]
gi|338802185|gb|AEJ06017.1| hypothetical protein PSTAB_2736 [Pseudomonas stutzeri ATCC 17588 =
LMG 11199]
Length = 248
Score = 147 bits (371), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 87/232 (37%), Positives = 131/232 (56%), Gaps = 9/232 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--- 79
+ ++ILG AQ +I+L P E V+A G+M +M+ + E + G+ L+G
Sbjct: 6 LEYEILGQSAQSVEIILDPGETVIAEAGAMNYMTDGVRFEARMGDGSASGVLGKLWGMGK 65
Query: 80 KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
+ IT L N G V AAP ++P+DLA GGEL+CQ DAFLC+ +
Sbjct: 66 RMITGESLFLTHFTNQGKGRARVAFAAPYPGTVVPVDLARLGGELICQKDAFLCAAYGTR 125
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
+ + +R+ G EGF+ Q+L G GLAFI AGG+V++K L E + +D C+VA T
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLQGDGLAFIHAGGTVIRKELN-NETLRLDTGCLVAFT 184
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
++ I+ G +R +FGG+ L+ + G G V+IQSLPF RL++R+ A
Sbjct: 185 PGIDYDIQLAGGLRSMLFGGEGLLLTTLKGTGSVWIQSLPFSRLAERVYEAT 236
>gi|418294343|ref|ZP_12906239.1| hypothetical protein PstZobell_13606 [Pseudomonas stutzeri ATCC
14405 = CCUG 16156]
gi|379065722|gb|EHY78465.1| hypothetical protein PstZobell_13606 [Pseudomonas stutzeri ATCC
14405 = CCUG 16156]
Length = 248
Score = 147 bits (371), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 86/232 (37%), Positives = 131/232 (56%), Gaps = 9/232 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--- 79
+ ++ILG AQ +I+L P E V+A G+M +M+ + E + G+ L G
Sbjct: 6 LEYEILGQSAQSVEIILDPGETVIAEAGAMNYMTDGVRFEARMGDGSASGVLGKLLGMGK 65
Query: 80 KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
+ IT L N G S V AAP ++P+DLA GG+L+CQ DAFLC+ +
Sbjct: 66 RMITGESLFLTHFTNQGKSKARVAFAAPYPGTVVPVDLAALGGQLICQKDAFLCAAYGTR 125
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
+ + +R+ G EGF+ Q+L G GLAF+ AGG+V++K L E + +D C+VA T
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLDGDGLAFVHAGGTVIRKELN-NETLRLDTGCLVAFT 184
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
++ I+ G +R +FGG+ L+ + G G V+IQSLPF RL++R+ A
Sbjct: 185 PGIDYDIQLAGGLRSMLFGGEGLLLTTLKGTGSVWIQSLPFSRLAERVYEAT 236
>gi|452748294|ref|ZP_21948075.1| hypothetical protein B381_11078 [Pseudomonas stutzeri NF13]
gi|452007875|gb|EME00127.1| hypothetical protein B381_11078 [Pseudomonas stutzeri NF13]
Length = 248
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 86/233 (36%), Positives = 131/233 (56%), Gaps = 9/233 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--- 79
+ ++ILG AQ +I+L P E V+A G+M +M+ + E + G+ L+G
Sbjct: 6 LEYEILGQSAQSVEIILDPDETVIAEAGAMNYMTNGVRFEARMGDGSASGVLGKLWGVGK 65
Query: 80 KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
+ IT L N G V AAP ++PIDLA GG+L CQ DAFLC+ +
Sbjct: 66 RMITGESLFLTHFTNEGKGKACVAFAAPYPGTVVPIDLAAAGGKLFCQKDAFLCAAYGTR 125
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
+ + +R+ G EGF+ Q+L G GLAF+ AGG+V++K L E + +D C+VA T
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLEGNGLAFVHAGGTVIRKELN-NETLRLDTGCLVAFT 184
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVT 247
++ I+ G +R +FGG+ L+ + G G V+IQSLPF RL++R+ A +
Sbjct: 185 QGIDYDIQLAGGLRSMLFGGEGLLLTTLKGTGSVWIQSLPFSRLAERVYEATS 237
>gi|386716499|ref|YP_006182825.1| hypothetical protein SMD_0032 [Stenotrophomonas maltophilia D457]
gi|384076061|emb|CCH10635.1| DUF124 domain-containing protein [Stenotrophomonas maltophilia
D457]
Length = 339
Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 79/230 (34%), Positives = 125/230 (54%), Gaps = 10/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
I F+I+G E Q ++ L P E +A G++ F +++M+ ++ N
Sbjct: 79 IEFRIVGHEMQFVELELDPGESAIAEAGALMFKDATVQMDTVFGAANGDQGGFMGKMMAA 138
Query: 73 MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ + V G G V AAP +L + L GG L+CQ D+FL
Sbjct: 139 GRRVLTGESLFATVYTQSGHGKGKVAFAAPYPGTVLAMKLDQHGGRLICQKDSFLAGARG 198
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V++ +R+ + G EGF+ QKL G G FI AGG V+++ L GE + VD C+VA
Sbjct: 199 VQIGVQFQRRIMTGLFGGEGFIMQKLEGDGWVFIHAGGCVIERELAAGERLDVDTGCVVA 258
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
SSV++ ++ I+ +FGG+ + A +TGPG V++QSLPF RL+ R+
Sbjct: 259 YHSSVDMDVRRVAGIKSMLFGGEGVFLATLTGPGKVWLQSLPFSRLAGRM 308
>gi|330504607|ref|YP_004381476.1| hypothetical protein [Pseudomonas mendocina NK-01]
gi|328918893|gb|AEB59724.1| hypothetical protein MDS_3693 [Pseudomonas mendocina NK-01]
Length = 248
Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 83/232 (35%), Positives = 130/232 (56%), Gaps = 9/232 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM----W---- 74
+ ++ILG AQ +I+L P E V+A G+M +M+ + E + G+ W
Sbjct: 6 LEYEILGASAQSVEIILDPGETVIAEAGAMNYMTDGVRFETRMGDGSSSGLLGKLWSVGK 65
Query: 75 QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
+ L G+++ N G V AAP ++PIDLA GG L+CQ DAFLC+ +
Sbjct: 66 RMLTGESLFMTHFSNAGKRQARVAFAAPYPGTVVPIDLAKIGGRLVCQKDAFLCAAHGTS 125
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
+ + +R+ G EGF+ QKL G GLAF+ AGG+V++K L E + +D C+VA +
Sbjct: 126 IGISFAKRLGAGFFGGEGFILQKLEGDGLAFVHAGGTVIRKELN-NETLRLDTGCLVAFS 184
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
S ++ I G ++ +FGG+ ++ + G G V+IQSLPF RL++R+ A
Sbjct: 185 SGIDYDIALAGGLKSMLFGGEGILLTTLKGTGTVWIQSLPFSRLAERVYAAT 236
>gi|146283089|ref|YP_001173242.1| hypothetical protein PST_2750 [Pseudomonas stutzeri A1501]
gi|145571294|gb|ABP80400.1| conserved hypothetical protein [Pseudomonas stutzeri A1501]
Length = 248
Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 86/232 (37%), Positives = 131/232 (56%), Gaps = 9/232 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--- 79
+ ++ILG AQ +I+L P E V+A G+M +M+ + E + G+ L+G
Sbjct: 6 LEYEILGQSAQSVEIILDPGETVIAEAGAMNYMTDGVRFEARMGDGSASGVLGKLWGMGK 65
Query: 80 KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
+ IT L N G V AAP ++P+DLA GGEL+CQ DAFLC+ +
Sbjct: 66 RMITGESLFLTHFTNQGKGRARVAFAAPYPGTVVPVDLARLGGELICQKDAFLCAAYGTR 125
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
+ + +R+ G EGF+ Q+L G GLAFI AGG+V++K L E + +D C+VA T
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLQGDGLAFIHAGGTVIRKELN-NETLRLDTGCLVAFT 184
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
++ I+ G +R +FGG+ L+ + G G ++IQSLPF RL++R+ A
Sbjct: 185 PGIDYDIQLAGGLRSMLFGGEGLLLTTLKGTGSIWIQSLPFSRLAERVYEAT 236
>gi|403383765|ref|ZP_10925822.1| hypothetical protein KJC30_03660 [Kurthia sp. JC30]
Length = 266
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 84/232 (36%), Positives = 129/232 (55%), Gaps = 12/232 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY----IPENEVGMW---- 74
I F+I G + Q ++ L P E +A G+M M + ME I+ N G
Sbjct: 6 IDFKIHGDDMQFVEVELDPGETAIAEAGAMMMMEDGIVMETIFGDGSQSSNASGFMGKLM 65
Query: 75 ----QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ L G+++ N G V AAP RI+P+DL+ + G+++CQ DAFL +
Sbjct: 66 GAGKRLLTGESLFVTTFTNQGQGKQHVSFAAPYPGRIIPLDLSQYNGKIICQKDAFLAAA 125
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V V +++ G EGF+ QKL G GLAF+ AGG++++K+L GE + +D C+
Sbjct: 126 RGVSVGIEFQRKLGTGFFGGEGFIMQKLEGDGLAFVHAGGTIIRKDLRPGERLRIDTGCL 185
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
VA+T V+ I+ G I+ ++FGG+ L A + GPG V+IQS+PF RL+ R+
Sbjct: 186 VAMTDRVDYNIEAAGGIKTSLFGGEGLFLATLQGPGSVWIQSMPFSRLASRV 237
>gi|393773438|ref|ZP_10361836.1| hypothetical protein WSK_2833 [Novosphingobium sp. Rr 2-17]
gi|392721318|gb|EIZ78785.1| hypothetical protein WSK_2833 [Novosphingobium sp. Rr 2-17]
Length = 276
Score = 146 bits (369), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 89/253 (35%), Positives = 131/253 (51%), Gaps = 12/253 (4%)
Query: 7 STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
++P+Q S D V F+I G E Q +I L P E VA G+M + S++M ++
Sbjct: 3 NSPWQHNRSSSIADDV-DFEIKGQELQFVEIELDPGESAVAEAGAMVWKDASIDMTTVFG 61
Query: 67 PENE----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF 115
+ +G + L G+++ + V + G V AAP ILPI L
Sbjct: 62 DGSGGQGGGFMGKLLGAGKRLVTGESLFTTVFTHQGRGKARVAFAAPVPGAILPIKLDHI 121
Query: 116 GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQK 175
GG L+CQ DAFL + V + QR+ + G EGF+ Q+L G G F+ GG+VV++
Sbjct: 122 GGRLICQKDAFLAAARGVSIGVHFQQRIMTGMFGGEGFIMQRLDGDGWVFVQMGGTVVER 181
Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
L VGE I VD C+ A T + + G ++ +FGG+ + A + GPG V+IQSLPF
Sbjct: 182 ELAVGEEIHVDTGCVAAFTPGIEFDVIRAGSVKSMIFGGEGVFFARLRGPGKVWIQSLPF 241
Query: 236 HRLSQRIARAVTS 248
RL+ R+ A S
Sbjct: 242 SRLAGRMLAAAGS 254
>gi|402826312|ref|ZP_10875524.1| hypothetical protein LH128_24617 [Sphingomonas sp. LH128]
gi|402260151|gb|EJU10302.1| hypothetical protein LH128_24617 [Sphingomonas sp. LH128]
Length = 276
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 88/255 (34%), Positives = 130/255 (50%), Gaps = 12/255 (4%)
Query: 7 STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
++P+ S D V F+I G E Q +I L P E VA G+M + S++M ++
Sbjct: 3 NSPWSHTRSSSLSDDV-DFEIKGQELQFVEIELDPGESAVAEAGAMVWKDASIDMTTVFG 61
Query: 67 PENE----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF 115
+ +G + L G+++ + V + G V AAP ILPI L
Sbjct: 62 DGSADQGGGFMGKLLGAGKRLVTGESLFTTVFTHRGSGKARVSFAAPIPGSILPIKLDEV 121
Query: 116 GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQK 175
GG L+CQ DAFLC+ V + QR+ + G EGF+ QKL G G F+ GG+V+++
Sbjct: 122 GGRLICQKDAFLCAARGVSIGIHFQQRIMTGLFGGEGFIMQKLEGDGWVFVQMGGTVIER 181
Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
L GE + VD C+ A SV + G ++ +FGG+ + A + GPG V+IQSLPF
Sbjct: 182 ELRAGEELHVDTGCVAAFQPSVEFDVVRAGSVKSMIFGGEGVFFARLRGPGKVWIQSLPF 241
Query: 236 HRLSQRIARAVTSPN 250
RL+ R+ A S
Sbjct: 242 SRLAGRMLAAAGSTG 256
>gi|431926698|ref|YP_007239732.1| hypothetical protein Psest_1552 [Pseudomonas stutzeri RCH2]
gi|431824985|gb|AGA86102.1| TIGR00266 family protein [Pseudomonas stutzeri RCH2]
Length = 248
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 85/232 (36%), Positives = 131/232 (56%), Gaps = 9/232 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--- 79
+ ++ILG AQ +I+L P E V+A G+M +M+ + E + G+ L+G
Sbjct: 6 LEYEILGQSAQSVEIILDPGETVIAEAGAMNYMTDGVRFEARMGDGSASGVLGKLWGMGK 65
Query: 80 KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
+ IT L N G V AAP ++P+DLA GG+L+CQ DAFLC+ +
Sbjct: 66 RMITGESLFLTHFTNQGKGQARVAFAAPYPGTVVPVDLAALGGQLICQKDAFLCAAYGTR 125
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
+ + +R+ G EGF+ Q+L G GLAF+ AGG+V++K L E + +D C+VA T
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLDGDGLAFVHAGGTVIRKELN-NETLRLDTGCLVAFT 184
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
++ I+ G +R +FGG+ L+ + G G V+IQSLPF RL++R+ A
Sbjct: 185 PGIDYDIQLAGGLRSMLFGGEGLLLTTLKGTGSVWIQSLPFSRLAERVYEAT 236
>gi|344205455|ref|YP_004790596.1| hypothetical protein [Stenotrophomonas maltophilia JV3]
gi|343776817|gb|AEM49370.1| protein of unknown function DUF124 [Stenotrophomonas maltophilia
JV3]
Length = 338
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 79/231 (34%), Positives = 125/231 (54%), Gaps = 11/231 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE-----------V 71
I F+I+G E Q +I L P E +A G++ F +++M+ ++ N
Sbjct: 77 IEFRIVGHEMQFVEIELDPGESAIAEAGALMFKDATVQMDTVFGAANGDQGGGFMGKVMA 136
Query: 72 GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ + V G G V AAP +L + L GG L+CQ D+FL
Sbjct: 137 AGKRVLTGESLFATVYTQTGHGKGKVAFAAPYPGTVLAMRLDQHGGRLICQKDSFLAGAR 196
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V++ +++ + G EGF+ QKL G G FI AGG V+++ L GE + VD C+V
Sbjct: 197 GVQIGVQFQRKIMTGLFGGEGFIMQKLEGDGWVFIHAGGCVIERELAAGERLDVDTGCVV 256
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
A SSV++ ++ I+ +FGG+ + A +TGPG V++QSLPF RL+ R+
Sbjct: 257 AYHSSVDMDVRRVAGIKSMLFGGEGVFLATLTGPGKVWLQSLPFSRLAGRM 307
>gi|28868167|ref|NP_790786.1| hypothetical protein PSPTO_0947 [Pseudomonas syringae pv. tomato
str. DC3000]
gi|28851404|gb|AAO54481.1| conserved protein of unknown function [Pseudomonas syringae pv.
tomato str. DC3000]
Length = 244
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 1 MDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 60
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 61 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 120
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 121 VSIGIQFQKKILTGLFGGEGFVLQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 180
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 181 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 234
>gi|257484023|ref|ZP_05638064.1| hypothetical protein PsyrptA_12263, partial [Pseudomonas syringae
pv. tabaci str. ATCC 11528]
Length = 282
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 39 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 98
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 99 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 158
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 159 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 218
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 219 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 272
>gi|213970164|ref|ZP_03398295.1| hypothetical protein PSPTOT1_5474 [Pseudomonas syringae pv. tomato
T1]
gi|213925045|gb|EEB58609.1| hypothetical protein PSPTOT1_5474 [Pseudomonas syringae pv. tomato
T1]
Length = 289
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 46 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 105
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 106 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 165
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 166 VSIGIQFQKKILTGLFGGEGFVLQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 225
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 226 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 279
>gi|71734043|ref|YP_273121.1| hypothetical protein PSPPH_0840 [Pseudomonas syringae pv.
phaseolicola 1448A]
gi|71554596|gb|AAZ33807.1| conserved hypothetical protein TIGR00266 [Pseudomonas syringae pv.
phaseolicola 1448A]
Length = 244
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 1 MDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 60
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 61 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 120
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 121 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 180
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 181 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 234
>gi|302185871|ref|ZP_07262544.1| hypothetical protein Psyrps6_05980 [Pseudomonas syringae pv.
syringae 642]
Length = 358
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 84/238 (35%), Positives = 129/238 (54%), Gaps = 11/238 (4%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG 79
V+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 114 VVDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFG 173
Query: 80 --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 174 AGKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAK 233
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+
Sbjct: 234 GVSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLA 293
Query: 192 AVTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
A+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A S
Sbjct: 294 AMTQTVDYDIRTVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 351
>gi|389817739|ref|ZP_10208332.1| hypothetical protein A1A1_09841 [Planococcus antarcticus DSM 14505]
gi|388464507|gb|EIM06838.1| hypothetical protein A1A1_09841 [Planococcus antarcticus DSM 14505]
Length = 266
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 89/259 (34%), Positives = 141/259 (54%), Gaps = 13/259 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE-----------V 71
I F++ G + Q ++ L P E VVA G++ M + ME I+ ++ +
Sbjct: 6 IDFKLHGDDMQFVEVELDPSETVVAEAGALMMMEDGITMETIFGDGSKSSGGSGLMGKLM 65
Query: 72 GMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
G + + G+++ N G V AAP +I+P+DL+M G+++CQ D+FL +
Sbjct: 66 GAGKRIITGESLFMTTFTNNGTGKRHVSFAAPYPGKIIPMDLSMMNGKIICQKDSFLAAA 125
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG++ +KNL+ GEV+ VD C+
Sbjct: 126 KGVSIGIEFQRKLGAGFFGGEGFIMQKLEGDGLAFVHAGGTIYRKNLQKGEVLRVDTGCL 185
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI-ARAVTSP 249
VA+T V+ I+ I+ A+FGG+ L A + GPG V+IQS+PF RL+ R+ A A +P
Sbjct: 186 VAMTGDVDYNIETVPGIKTALFGGEGLFLATLRGPGSVWIQSMPFSRLASRVFAAAPQNP 245
Query: 250 NMRENPKFFVQIAIFFFLA 268
R + +F L
Sbjct: 246 AGRSKGEGSAAGGLFDLLG 264
>gi|347541722|ref|YP_004849149.1| hypothetical protein NH8B_3998 [Pseudogulbenkiania sp. NH8B]
gi|345644902|dbj|BAK78735.1| protein of unknown function [Pseudogulbenkiania sp. NH8B]
Length = 264
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 79/245 (32%), Positives = 129/245 (52%), Gaps = 10/245 (4%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV--GMW----- 74
VI ++I G + Q ++ L P E V G+M +M + M+ ++ + G+
Sbjct: 5 VIDYEIFGDDMQYVEVELDPGEAAVGEAGTMYYMEDGISMDTVFGDGSAAQGGLLGKLMG 64
Query: 75 ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ + V N P V AA +I+P+ L GG L Q D+FL +
Sbjct: 65 AGKRLLTGESLFTTVFVNQAPGKKRVAFAASYPGKIVPVHLLELGGTLYAQKDSFLAAAK 124
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + L +++ + G EGF+ Q+L G G AF+ AGG++ ++ L GE + VD C+V
Sbjct: 125 GVSLGIALQKKIGTGLFGGEGFIMQRLDGDGYAFLHAGGTLTERQLAPGETLRVDTGCVV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
A SV+ I+Y G ++ A+F G+ L A +TGPG V++QSLP RL+ R+ A
Sbjct: 185 AYQPSVDFDIQYVGKLKSALFAGEGLFLATLTGPGRVWLQSLPLSRLADRLIAASPKAGG 244
Query: 252 RENPK 256
R + +
Sbjct: 245 RSSEQ 249
>gi|224826743|ref|ZP_03699843.1| protein of unknown function DUF124 [Pseudogulbenkiania ferrooxidans
2002]
gi|224600963|gb|EEG07146.1| protein of unknown function DUF124 [Pseudogulbenkiania ferrooxidans
2002]
Length = 264
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 79/245 (32%), Positives = 129/245 (52%), Gaps = 10/245 (4%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV--GMW----- 74
VI ++I G + Q ++ L P E V G+M +M + M+ ++ + G+
Sbjct: 5 VIDYEIFGDDMQYVEVELDPGEAAVGEAGTMYYMEDGISMDTVFGDGSAAQGGLLGKLMG 64
Query: 75 ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ + V N P V AA +I+P+ L GG L Q D+FL +
Sbjct: 65 AGKRLLTGESLFTTVFVNQAPGKKRVAFAASYPGKIVPVHLLELGGTLYAQKDSFLAAAK 124
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + L +++ + G EGF+ Q+L G G AF+ AGG++ ++ L GE + VD C+V
Sbjct: 125 GVSLGIALQKKIGTGLFGGEGFIMQRLDGDGYAFLHAGGTLTERQLAPGETLRVDTGCVV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
A SV+ I+Y G ++ A+F G+ L A +TGPG V++QSLP RL+ R+ A
Sbjct: 185 AYQPSVDFDIQYVGKLKSALFAGEGLFLATLTGPGRVWLQSLPLSRLADRLIAASPKAGG 244
Query: 252 RENPK 256
R + +
Sbjct: 245 RSSEQ 249
>gi|87199726|ref|YP_496983.1| hypothetical protein Saro_1709 [Novosphingobium aromaticivorans DSM
12444]
gi|87135407|gb|ABD26149.1| protein of unknown function DUF124 [Novosphingobium aromaticivorans
DSM 12444]
Length = 277
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 88/255 (34%), Positives = 135/255 (52%), Gaps = 12/255 (4%)
Query: 7 STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
++P+Q + S D I F+I G E Q +I L P E VA G++ + G++ M ++
Sbjct: 3 NSPWQHHRSTSIADD-IDFEIKGQELQFLEIELDPGESAVAEAGALVWKDGAVGMTTVFG 61
Query: 67 PENE----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF 115
+ +G + L G+++ + V + G V A+P+ ILPI L+
Sbjct: 62 DGSGGSEGGFMGKLLGAGKRLVTGESLFTTVFTHNGHGKARVAFASPTPGAILPIKLSDV 121
Query: 116 GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQK 175
GG L+CQ D+FL + V + +RV + G EGF+ QKL G G F+ GG++V+K
Sbjct: 122 GGTLICQKDSFLAAARGVSIGIAFQRRVMTGLFGGEGFIMQKLEGDGWVFVQMGGTLVEK 181
Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
L GE + VD C+ A T S++ + G +R +FGG+ L A + GPG V+IQSLPF
Sbjct: 182 ELRSGEELHVDTGCLAAYTPSIDFDLVTAGGVRSVLFGGEGLFFARLRGPGKVWIQSLPF 241
Query: 236 HRLSQRIARAVTSPN 250
RL+ R+ A S
Sbjct: 242 SRLAGRMLAAAGSSG 256
>gi|66044069|ref|YP_233910.1| hypothetical protein Psyr_0814 [Pseudomonas syringae pv. syringae
B728a]
gi|63254776|gb|AAY35872.1| Protein of unknown function DUF124 [Pseudomonas syringae pv.
syringae B728a]
Length = 291
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 48 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 107
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 108 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 167
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 168 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 227
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 228 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 281
>gi|375148997|ref|YP_005011438.1| hypothetical protein [Niastella koreensis GR20-10]
gi|361063043|gb|AEW02035.1| protein of unknown function DUF124 [Niastella koreensis GR20-10]
Length = 264
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 78/244 (31%), Positives = 130/244 (53%), Gaps = 14/244 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I ++I G E Q +I L P E +A G+ M ++M ++ ++ Q L GK +
Sbjct: 7 IDYKIYGEEMQYVEIELDPNETAIAESGAFMMMDDGIQMATMFGDGSK--QQQGLLGKLM 64
Query: 83 TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
++ N G V A+P +I+ +DL GG+++ Q DAFLC+
Sbjct: 65 SAGKRMLTGESLFMTAFTNMGQGKKRVSFASPYPGKIIALDLQELGGKIVAQKDAFLCAA 124
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ + G EGF+ +KL G G+AF+ AGG V ++ L+ GEV+ +D C+
Sbjct: 125 KGVSIGIEFQRKLGTGLFGGEGFIMEKLEGDGMAFVHAGGHVFERTLQPGEVLRIDTGCL 184
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
VA T +++ I++ G I+ +FGG+ L A + GPG V+IQ+LP RL+ RI ++
Sbjct: 185 VAYTQTIDYDIQFVGGIKNTLFGGEGLFFATLRGPGKVWIQTLPISRLASRILTYGSAGG 244
Query: 251 MREN 254
+E
Sbjct: 245 RKEE 248
>gi|422654987|ref|ZP_16717708.1| hypothetical protein PSYAC_26095 [Pseudomonas syringae pv.
actinidiae str. M302091]
gi|330968051|gb|EGH68311.1| hypothetical protein PSYAC_26095 [Pseudomonas syringae pv.
actinidiae str. M302091]
Length = 356
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 113 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 172
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 173 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 232
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 233 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 292
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 293 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 346
>gi|422607037|ref|ZP_16679041.1| hypothetical protein PSYMO_18538 [Pseudomonas syringae pv. mori
str. 301020]
gi|330890683|gb|EGH23344.1| hypothetical protein PSYMO_18538 [Pseudomonas syringae pv. mori
str. 301020]
Length = 358
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 115 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 174
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 294
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 348
>gi|302062223|ref|ZP_07253764.1| hypothetical protein PsyrptK_19726 [Pseudomonas syringae pv. tomato
K40]
gi|302134180|ref|ZP_07260170.1| hypothetical protein PsyrptN_22461 [Pseudomonas syringae pv. tomato
NCPPB 1108]
gi|422660507|ref|ZP_16722918.1| hypothetical protein PLA106_23988 [Pseudomonas syringae pv.
lachrymans str. M302278]
gi|331019111|gb|EGH99167.1| hypothetical protein PLA106_23988 [Pseudomonas syringae pv.
lachrymans str. M302278]
Length = 356
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 113 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 172
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 173 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 232
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 233 VSIGIQFQKKILTGLFGGEGFVLQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 292
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 293 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 346
>gi|237802055|ref|ZP_04590516.1| hypothetical protein POR16_24750 [Pseudomonas syringae pv. oryzae
str. 1_6]
gi|331024911|gb|EGI04967.1| hypothetical protein POR16_24750 [Pseudomonas syringae pv. oryzae
str. 1_6]
Length = 364
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 121 VDFKLYGSETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 180
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 181 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 240
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 241 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 300
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 301 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 354
>gi|374710374|ref|ZP_09714808.1| hypothetical protein SinuC_09125 [Sporolactobacillus inulinus CASD]
Length = 263
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 86/233 (36%), Positives = 125/233 (53%), Gaps = 9/233 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM--------- 73
I ++I G E Q +I L E V+A GS M + ME I+ E G
Sbjct: 6 IDYKIEGQEMQFVEIELDSSETVIAEAGSFMMMEDGITMETIFGDGAEQGGVMGKLFSAG 65
Query: 74 WQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
+ L G+++ N G V A+P +I+P+DL+++ G ++CQ DAFLC+ V
Sbjct: 66 KRMLTGESLFMTAFTNEGGGKRHVSFASPYPGKIIPLDLSLYSGRMICQKDAFLCAAKGV 125
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
V +R+ G EGF+ QKL G G+AF+ AGG V + L GE + VD C+VA+
Sbjct: 126 SVGIAFQKRLGAGFFGGEGFIMQKLEGDGMAFVHAGGMVKELTLAPGETLKVDTGCLVAM 185
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
T V+ I + I+ A+FGG+ L A + GPG V +QSLPF RL+ R+ A+
Sbjct: 186 TRDVDYDITFVRGIKTALFGGEGLFFASLRGPGKVLVQSLPFSRLASRVFSAM 238
>gi|422296928|ref|ZP_16384577.1| Protein of unknown function DUF124 [Pseudomonas avellanae BPIC 631]
gi|407991798|gb|EKG33569.1| Protein of unknown function DUF124 [Pseudomonas avellanae BPIC 631]
Length = 356
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 113 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 172
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 173 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 232
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 233 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 292
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 293 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 346
>gi|307718072|ref|YP_003873604.1| HTH DNA-binding protein [Spirochaeta thermophila DSM 6192]
gi|306531797|gb|ADN01331.1| HTH DNA-binding protein [Spirochaeta thermophila DSM 6192]
Length = 246
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 80/220 (36%), Positives = 123/220 (55%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I + I G E Q+ +++L P E V A G+M +M + M+ G + L G++
Sbjct: 5 IDYTIHGDEMQLVEVVLDPGEGVRAEAGAMLYMEEGIAMDTRMEGGLLGGFKRILAGESF 64
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
N G V AAP +I+P+DL GG +CQ DAFLC+ +++ +R
Sbjct: 65 FITNFVNTGSRRAAVAFAAPYPGKIVPLDLGALGGRFVCQKDAFLCAAAGIEIDVVFTRR 124
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ G EGF+ Q+L+G G AFI AGG++V+K L G+ + VD C V + +V+ ++
Sbjct: 125 LGAGFFGGEGFILQELSGDGWAFIHAGGTIVEKRLAPGQTLRVDTGCFVGCSPTVDYDVR 184
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ G R A+FGG+ L +TGPG+V++QSLP RL+ RI
Sbjct: 185 FVGGFRNALFGGEGLFLTELTGPGVVYLQSLPLARLADRI 224
>gi|226945977|ref|YP_002801050.1| hypothetical protein Avin_39350 [Azotobacter vinelandii DJ]
gi|226720904|gb|ACO80075.1| conserved hypothetical protein [Azotobacter vinelandii DJ]
Length = 248
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 78/228 (34%), Positives = 131/228 (57%), Gaps = 9/228 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG----MW---- 74
+ ++ILG AQ +I+L P E V+A G+M +M+ + E +E G +W
Sbjct: 6 LDYEILGAHAQSVEIILDPGETVIAEAGAMNYMTEGVRFETRMGDGSESGVLGKLWGMGK 65
Query: 75 QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
+ L G+++ N G V AAP ++P+DLA GG L+CQ D+FLC+ +
Sbjct: 66 RMLTGESLFMTHFSNHGKRQARVAFAAPYPGTVVPVDLAEIGGTLICQKDSFLCAARGTE 125
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
+ + +R+ G EGF+ Q+L+G GLAF+ AGG+V++K L E + +D C+V +
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLSGDGLAFLHAGGAVIRKELR-DETLRLDTGCLVGFS 184
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
++ I+ G ++ +FGG+ ++ A + G G V++QSLPF RL++R+
Sbjct: 185 RGIDYDIQLAGGLKSMLFGGEGILLATLKGTGTVWLQSLPFSRLAERV 232
>gi|301385947|ref|ZP_07234365.1| hypothetical protein PsyrptM_25068 [Pseudomonas syringae pv. tomato
Max13]
Length = 350
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 107 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 166
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 167 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 226
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 227 VSIGIQFQKKILTGLFGGEGFVLQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 286
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 287 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 340
>gi|289628836|ref|ZP_06461790.1| hypothetical protein PsyrpaN_27475 [Pseudomonas syringae pv.
aesculi str. NCPPB 3681]
gi|416024917|ref|ZP_11568827.1| hypothetical protein PsgRace4_09542 [Pseudomonas syringae pv.
glycinea str. race 4]
gi|422582537|ref|ZP_16657672.1| hypothetical protein PSYAE_09004 [Pseudomonas syringae pv. aesculi
str. 0893_23]
gi|422683710|ref|ZP_16741968.1| hypothetical protein PSYTB_25931 [Pseudomonas syringae pv. tabaci
str. ATCC 11528]
gi|320330238|gb|EFW86223.1| hypothetical protein PsgRace4_09542 [Pseudomonas syringae pv.
glycinea str. race 4]
gi|330867379|gb|EGH02088.1| hypothetical protein PSYAE_09004 [Pseudomonas syringae pv. aesculi
str. 0893_23]
gi|331013042|gb|EGH93098.1| hypothetical protein PSYTB_25931 [Pseudomonas syringae pv. tabaci
str. ATCC 11528]
Length = 358
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 115 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 174
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 294
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 348
>gi|336113473|ref|YP_004568240.1| hypothetical protein BCO26_0795 [Bacillus coagulans 2-6]
gi|335366903|gb|AEH52854.1| protein of unknown function DUF124 [Bacillus coagulans 2-6]
Length = 263
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 80/235 (34%), Positives = 130/235 (55%), Gaps = 11/235 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I F+++G + Q ++ L PQE V+A GS+ M + ME I+ +
Sbjct: 6 IDFKLVGDDMQFVEVELDPQETVIAEAGSLMMMEDGIRMETIFGDGSGGQGGGIMGKLFS 65
Query: 75 ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ + G+++ N AAP +I+P+DL+ GG+++CQ DAFL +
Sbjct: 66 AGKRLITGESLFMTAFTNEARDKKHAAFAAPYPGKIVPMDLSELGGKIICQKDAFLAAAK 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V V +++ G EGF+ QKL G G AFI AGG++ ++ L GE++ VD C+V
Sbjct: 126 GVSVGIEFQRKLGTGFFGGEGFIMQKLEGDGFAFIHAGGTIQKRELAPGEILRVDTGCLV 185
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
A++ +V+ I++ I+ A+FGG+ L A + GPG V++QSLPF RL+ R+ A+
Sbjct: 186 AMSKTVDYDIEFVKGIKTALFGGEGLFFATLRGPGSVWVQSLPFSRLASRVFAAM 240
>gi|326387594|ref|ZP_08209200.1| hypothetical protein Y88_0506 [Novosphingobium nitrogenifigens DSM
19370]
gi|326207640|gb|EGD58451.1| hypothetical protein Y88_0506 [Novosphingobium nitrogenifigens DSM
19370]
Length = 277
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 80/235 (34%), Positives = 125/235 (53%), Gaps = 11/235 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
+ F I G + Q +I L P E V G+M + S+EM ++ +E +G
Sbjct: 19 VDFAIKGHDLQFVEIELDPGESAVGEAGAMVWKDASVEMTTVFGDGSEAADNGFMGKLLG 78
Query: 73 MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G T+ + V + G V AP +LP+ LA GG L+CQ D+FLC+
Sbjct: 79 AGKRLLTGATLFTTVFTHRGTGKARVAFGAPIPGTVLPLRLADLGGTLICQKDSFLCAAR 138
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + +R+ + G +GF+ Q+L G G F+ GG+++++ L GE I VD C+
Sbjct: 139 GVSIGIAFQKRIMTGLFGGDGFVMQRLDGDGWVFVQMGGTLIERELAPGEQIHVDTGCLA 198
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
A T SV+ ++ G ++ +FGG+ + A +TGPG V+IQSLPF RL+ R+ A
Sbjct: 199 AFTPSVDFDVQAAGGVKTMLFGGEGVFLARLTGPGKVWIQSLPFSRLAGRMLAAA 253
>gi|298159927|gb|EFI00966.1| HTH DNA-binding protein [Pseudomonas savastanoi pv. savastanoi
NCPPB 3335]
Length = 358
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 115 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGRNQSSGLLGSLFGA 174
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 294
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 348
>gi|429765885|ref|ZP_19298165.1| TIGR00266 family protein [Clostridium celatum DSM 1785]
gi|429185738|gb|EKY26712.1| TIGR00266 family protein [Clostridium celatum DSM 1785]
Length = 262
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 91/245 (37%), Positives = 139/245 (56%), Gaps = 13/245 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMW----- 74
I ++I G E Q +I L P E V+A G+M M S+ ME + +N G++
Sbjct: 5 IDYKIHGSEMQFVEIELDPNETVMAEAGAMMMMDNSIVMETVLGDGSEKNSRGLFGKIGG 64
Query: 75 ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G++I N G V +AP +ILP+DL + GGEL+CQ DAFLC
Sbjct: 65 AAKRALTGESIFMTAFTNMGYGRESVAFSAPYPGKILPMDLTVLGGELICQKDAFLCCAK 124
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
+ + +++ G EGF+ Q+L G GLAFI GG++V+K L G+++ +D C+V
Sbjct: 125 GISLDIAFTKKIGAGFFGGEGFILQRLQGNGLAFIHGGGTIVRKTLMPGQILKIDTGCLV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
A+T +V+ I++ G ++ ++FGG+ L A + GPG V+IQSLPF R + RI A +P +
Sbjct: 185 AMTKNVSYDIQFVGGLKNSIFGGEGLFFATLKGPGDVWIQSLPFSRFADRIISA--APQL 242
Query: 252 RENPK 256
N K
Sbjct: 243 TNNTK 247
>gi|440721263|ref|ZP_20901663.1| hypothetical protein A979_10584 [Pseudomonas syringae BRIP34876]
gi|440728601|ref|ZP_20908814.1| hypothetical protein A987_21097 [Pseudomonas syringae BRIP34881]
gi|440361390|gb|ELP98621.1| hypothetical protein A987_21097 [Pseudomonas syringae BRIP34881]
gi|440363885|gb|ELQ01039.1| hypothetical protein A979_10584 [Pseudomonas syringae BRIP34876]
Length = 358
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 83/237 (35%), Positives = 128/237 (54%), Gaps = 11/237 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 115 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 174
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 294
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A S
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 351
>gi|422661660|ref|ZP_16723892.1| hypothetical protein PSYAP_30263, partial [Pseudomonas syringae pv.
aptata str. DSM 50252]
gi|330982769|gb|EGH80872.1| hypothetical protein PSYAP_30263 [Pseudomonas syringae pv. aptata
str. DSM 50252]
Length = 329
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 83/237 (35%), Positives = 128/237 (54%), Gaps = 11/237 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 86 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 145
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 146 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 205
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 206 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 265
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A S
Sbjct: 266 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 322
>gi|422620056|ref|ZP_16688740.1| hypothetical protein PSYJA_23843 [Pseudomonas syringae pv. japonica
str. M301072]
gi|330900420|gb|EGH31839.1| hypothetical protein PSYJA_23843 [Pseudomonas syringae pv. japonica
str. M301072]
Length = 361
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 83/237 (35%), Positives = 128/237 (54%), Gaps = 11/237 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 118 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 177
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 178 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 237
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 238 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 297
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A S
Sbjct: 298 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 354
>gi|289651012|ref|ZP_06482355.1| hypothetical protein Psyrpa2_25244 [Pseudomonas syringae pv.
aesculi str. 2250]
Length = 352
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 109 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 168
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 169 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 228
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 229 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 288
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 289 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 342
>gi|456011513|gb|EMF45259.1| DUF124 domain-containing protein [Planococcus halocryophilus Or1]
Length = 266
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 88/259 (33%), Positives = 140/259 (54%), Gaps = 13/259 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE-----------V 71
I +++ G + Q ++ L P E V+A G++ M + ME I+ ++ +
Sbjct: 6 IDYKLHGDDMQFVEVELDPSETVIAEAGALMMMEDGIAMETIFGDGSQSSGGSGLMGKLM 65
Query: 72 GMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
G + + G+++ N G V AAP +I+P+DL+ G+++CQ DAFL +
Sbjct: 66 GAGKRIITGESLFMTTFTNNGTGKRHVSFAAPYPGKIIPMDLSTLNGKIICQKDAFLAAA 125
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG++ +KNL+ GEVI VD C+
Sbjct: 126 KGVSIGIEFQRKIGAGFFGGEGFIMQKLEGDGLAFVHAGGTIYRKNLQKGEVIRVDTGCL 185
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI-ARAVTSP 249
VA+T V+ I+ I+ A+FGG+ L A + GPG V+IQS+PF RL+ R+ A A +P
Sbjct: 186 VAMTGDVDYNIEAVPGIKTALFGGEGLFFATLRGPGSVWIQSMPFSRLASRVFAAAPQNP 245
Query: 250 NMRENPKFFVQIAIFFFLA 268
R + +F L
Sbjct: 246 AGRSKGEGSAAGGLFDLLG 264
>gi|422597437|ref|ZP_16671710.1| hypothetical protein PLA107_22068 [Pseudomonas syringae pv.
lachrymans str. M301315]
gi|330987727|gb|EGH85830.1| hypothetical protein PLA107_22068 [Pseudomonas syringae pv.
lachrymans str. M301315]
Length = 358
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 81/231 (35%), Positives = 126/231 (54%), Gaps = 11/231 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 115 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 174
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 294
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRM 345
>gi|422586855|ref|ZP_16661526.1| hypothetical protein PSYMP_00190 [Pseudomonas syringae pv.
morsprunorum str. M302280]
gi|330872497|gb|EGH06646.1| hypothetical protein PSYMP_00190 [Pseudomonas syringae pv.
morsprunorum str. M302280]
Length = 356
Score = 144 bits (362), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 113 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 172
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 173 GKRMLTGESLFTSVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 232
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 233 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 292
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 293 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 346
>gi|443641520|ref|ZP_21125370.1| AIM24 domain protein [Pseudomonas syringae pv. syringae B64]
gi|443281537|gb|ELS40542.1| AIM24 domain protein [Pseudomonas syringae pv. syringae B64]
Length = 358
Score = 144 bits (362), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 83/237 (35%), Positives = 128/237 (54%), Gaps = 11/237 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 115 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 174
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 294
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A S
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 351
>gi|422647248|ref|ZP_16710378.1| hypothetical protein PMA4326_19830 [Pseudomonas syringae pv.
maculicola str. ES4326]
gi|330960792|gb|EGH61052.1| hypothetical protein PMA4326_19830 [Pseudomonas syringae pv.
maculicola str. ES4326]
Length = 359
Score = 144 bits (362), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 116 VDFKLYGSETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 175
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 176 GKRMLTGESLFTTVFSQQGAGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 235
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 236 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEGLDVDTGCLAA 295
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 296 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 349
>gi|94498230|ref|ZP_01304791.1| hypothetical protein SKA58_14132 [Sphingomonas sp. SKA58]
gi|94422360|gb|EAT07400.1| hypothetical protein SKA58_14132 [Sphingomonas sp. SKA58]
Length = 279
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 83/253 (32%), Positives = 134/253 (52%), Gaps = 12/253 (4%)
Query: 7 STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
++P+ + S D I F+I G E Q +I L P E VA G+M + G++ M ++
Sbjct: 5 NSPWSHHRSASLSDD-IDFEIKGQELQFLEIELNPGESAVAEAGAMVWKDGAISMTTVFG 63
Query: 67 PENE----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF 115
+ +G + L G+++ + V + G V A+P+ ILP+ L +
Sbjct: 64 DGSGGADGGFMGKLLGAGKRLVTGESLFTTVFTHQGHGKARVAFASPTPGSILPLKLDQY 123
Query: 116 GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQK 175
GG L+CQ D+FL + V + +R+ + G EGF+ Q+L+G G F+ GG++V+K
Sbjct: 124 GGALICQKDSFLAAAKGVAIGVHFQKRIMTGLFGGEGFIMQRLSGDGWVFVQMGGTLVEK 183
Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
L GE + VD C+ A+T V+ + G ++ +FGG+ A + GPG V+IQSLPF
Sbjct: 184 ELAPGEELHVDTGCLAAMTPGVDFDLVAAGGVKSMIFGGEGAFFARLRGPGHVWIQSLPF 243
Query: 236 HRLSQRIARAVTS 248
RL+ R+ A S
Sbjct: 244 SRLAGRMLAAAGS 256
>gi|254523742|ref|ZP_05135797.1| hypothetical protein SSKA14_2876 [Stenotrophomonas sp. SKA14]
gi|219721333|gb|EED39858.1| hypothetical protein SSKA14_2876 [Stenotrophomonas sp. SKA14]
Length = 337
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 127/230 (55%), Gaps = 10/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN-EVGMW------- 74
I F+I+G E Q +I L P E +A G++ F +++M+ ++ N + G +
Sbjct: 77 IDFRIVGHEMQFVEIELDPGESAIAEAGALMFKDATVQMDTVFGAANGDQGGFMGKVMAA 136
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ + V G G V AAP +L + L GG L+CQ D+FL
Sbjct: 137 GKRVLTGESLFATVYTQSGHGKGKVAFAAPYPGTVLAMKLDQHGGRLICQKDSFLAGARG 196
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V++ +R+ + G EGF+ QKL G G FI AGG V+++ L GE + VD C+VA
Sbjct: 197 VQIGVQFQRRIMTGLFGGEGFIMQKLEGDGWVFIHAGGCVIERELAAGERLDVDTGCVVA 256
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+V++ ++ ++ +FGG+ + A +TGPG V++QSLPF RL+ R+
Sbjct: 257 YHPTVDMDVRRVTGVKSMLFGGEGVFLATLTGPGKVWLQSLPFSRLAGRM 306
>gi|251796470|ref|YP_003011201.1| hypothetical protein Pjdr2_2460 [Paenibacillus sp. JDR-2]
gi|247544096|gb|ACT01115.1| protein of unknown function DUF124 [Paenibacillus sp. JDR-2]
Length = 266
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 85/231 (36%), Positives = 127/231 (54%), Gaps = 11/231 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMW----- 74
I + I G + Q +I L P E V+A GS+ M + ME I+ ++ G+
Sbjct: 7 IDYYIHGDDMQFVEIELDPAETVIAEAGSLMMMDDDIHMEAIFGDGSAQSSGGLMGKLMS 66
Query: 75 ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ N G AAP +I+PIDL GG+++CQ DAFL +
Sbjct: 67 AGKRILTGESLFMTAFTNEGMGKKKASFAAPYPGKIIPIDLHAAGGKMICQKDAFLAAAK 126
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V V +++ G EGF+ QKL G GLAF+ AGG++++K+L GE + VD C+V
Sbjct: 127 GVSVGIEFQRKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTIIRKDLNPGEKLRVDTGCLV 186
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
A T SV+ I+ ++ +FGG+ L A + GPG V+IQSLPF RL+ R+
Sbjct: 187 AFTPSVHYSIEMVKGVKTFLFGGEGLFFATLEGPGTVYIQSLPFSRLASRV 237
>gi|424066025|ref|ZP_17803498.1| Protein of unknown function DUF124 [Pseudomonas syringae pv.
avellanae str. ISPaVe013]
gi|408002727|gb|EKG42963.1| Protein of unknown function DUF124 [Pseudomonas syringae pv.
avellanae str. ISPaVe013]
Length = 358
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 83/237 (35%), Positives = 128/237 (54%), Gaps = 11/237 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 115 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSPGLLGSLFGA 174
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 294
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A S
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 351
>gi|424070674|ref|ZP_17808107.1| DUF124 domain-containing protein [Pseudomonas syringae pv.
avellanae str. ISPaVe037]
gi|408000091|gb|EKG40456.1| DUF124 domain-containing protein [Pseudomonas syringae pv.
avellanae str. ISPaVe037]
Length = 358
Score = 143 bits (361), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 83/237 (35%), Positives = 128/237 (54%), Gaps = 11/237 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 115 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSPGLLGSLFGA 174
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 294
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A S
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 351
>gi|440730209|ref|ZP_20910303.1| hypothetical protein A989_02770 [Xanthomonas translucens DAR61454]
gi|440379300|gb|ELQ15897.1| hypothetical protein A989_02770 [Xanthomonas translucens DAR61454]
Length = 333
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 84/246 (34%), Positives = 130/246 (52%), Gaps = 12/246 (4%)
Query: 7 STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
S P P S D I F+I+G E Q +I L P E VA G++ F +++M+ ++
Sbjct: 59 SLPPPPGGRGSADD--IDFRIVGHEMQFVEIALDPGESAVAEAGALMFKDAAVQMDTVFG 116
Query: 67 PENEVG--MWQWLF--------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFG 116
+ G + LF G+++ + V + G V AAP +L + L G
Sbjct: 117 DGSSQGGGLMGKLFSAGKRLVTGESLFTTVFTHQGQGKATVAFAAPYPGTVLAMKLDQHG 176
Query: 117 GELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKN 176
G L+CQ D+FL V + +++ + G EGF+ QKL G G F+ AGG VV++
Sbjct: 177 GRLICQKDSFLAGARGVSLGIHFQRKIMTGLFGGEGFIMQKLEGDGWVFVHAGGCVVERE 236
Query: 177 LEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFH 236
L GE + VD C+VA S+V++ ++ I+ FGG+ + A +TGPG V++QSLPF
Sbjct: 237 LAAGERLDVDTGCVVAFHSTVDMDVRAVSGIKSMFFGGEGMFLATLTGPGKVWLQSLPFS 296
Query: 237 RLSQRI 242
RL+ R+
Sbjct: 297 RLAGRM 302
>gi|440741756|ref|ZP_20921089.1| hypothetical protein A988_00190 [Pseudomonas syringae BRIP39023]
gi|440378614|gb|ELQ15232.1| hypothetical protein A988_00190 [Pseudomonas syringae BRIP39023]
Length = 358
Score = 143 bits (361), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 83/237 (35%), Positives = 127/237 (53%), Gaps = 11/237 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 115 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 174
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG L+CQ D+FL
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGRLICQKDSFLAGAKG 234
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 294
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A S
Sbjct: 295 MTQTVDYDIRTVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 351
>gi|190572163|ref|YP_001970008.1| hypothetical protein Smlt0082 [Stenotrophomonas maltophilia K279a]
gi|424666447|ref|ZP_18103474.1| TIGR00266 family protein [Stenotrophomonas maltophilia Ab55555]
gi|190010085|emb|CAQ43693.1| conserved hypothetical protein [Stenotrophomonas maltophilia K279a]
gi|401072302|gb|EJP80809.1| TIGR00266 family protein [Stenotrophomonas maltophilia Ab55555]
Length = 338
Score = 143 bits (360), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 78/231 (33%), Positives = 124/231 (53%), Gaps = 11/231 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE-----------V 71
I F+I+G E Q +I L P E +A G++ F +++M+ ++ N
Sbjct: 77 IEFRIVGHEMQFVEIELDPGESAIAEAGALMFKDATVQMDTVFGAANGDQGGGFMGKVMA 136
Query: 72 GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ + V G G V AAP +L + L GG L+CQ D+FL
Sbjct: 137 AGKRVLTGESLFATVYTQSGHGKGKVAFAAPYPGTVLAMKLDQHGGRLICQKDSFLAGAR 196
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V++ +++ + G EGF+ QKL G G FI AGG VV++ L GE + VD C+V
Sbjct: 197 GVQIGVQFQRKIMTGLFGGEGFIMQKLEGDGWVFIHAGGCVVERELAAGERLDVDTGCVV 256
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
A +V++ ++ I+ +FGG+ + A +TGPG V++QSLPF RL+ R+
Sbjct: 257 AYHPTVDMDVRRVAGIKSMLFGGEGVFLATLTGPGKVWLQSLPFSRLAGRM 307
>gi|386346080|ref|YP_006044329.1| hypothetical protein [Spirochaeta thermophila DSM 6578]
gi|339411047|gb|AEJ60612.1| protein of unknown function DUF124 [Spirochaeta thermophila DSM
6578]
Length = 246
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 80/220 (36%), Positives = 121/220 (55%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
I + I G + Q+ +++L P E V A G+M +M + M+ G + L G++
Sbjct: 5 IDYTIHGDDMQLVEVILDPGEGVRAEAGAMLYMEEGITMDTRMEGGLLGGFKRILAGESF 64
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
N G V AAP +I+P+DL GG +CQ DAFLC+ + +S +R
Sbjct: 65 FITNFVNTGSRRAAVAFAAPYPGKIVPLDLGTLGGRFICQKDAFLCAAVGIDISIVFTRR 124
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ G EGF+ ++L G G AFI AGG++V+K L GE + VD C V + V+ ++
Sbjct: 125 LGAGFFGGEGFILEELRGDGWAFIHAGGTIVEKRLAAGETLRVDTGCFVGCSPPVDYDVR 184
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ G R A+FGG+ L +TGPG+V++QSLP RL+ RI
Sbjct: 185 FVGGFRNALFGGEGLFLTELTGPGVVYLQSLPLARLADRI 224
>gi|150401531|ref|YP_001325297.1| hypothetical protein Maeo_1108 [Methanococcus aeolicus Nankai-3]
gi|150014234|gb|ABR56685.1| protein of unknown function DUF124 [Methanococcus aeolicus
Nankai-3]
Length = 262
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 81/232 (34%), Positives = 125/232 (53%), Gaps = 11/232 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV--GMWQWLFG- 79
+ ++I G + QV ++ L P E V+A G+M +M + E ++ G+ LFG
Sbjct: 6 VDYEIFGDDMQVVEVELDPNESVIAEAGAMNWMEDGINFETKMGDGSDANEGLVGKLFGA 65
Query: 80 --KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ IT L N G V AAP I+PIDL+ GE CQ D+FLC+
Sbjct: 66 AKRVITGESLFLTHFTNVGRGKKRVAFAAPYPGSIIPIDLSKLDGEFFCQKDSFLCAALG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
KV + ++++ + G EGF+ + L G G+ F+ AGG++V+K L G+ I VD CIV
Sbjct: 126 TKVGISFNKKIGTGLFGGEGFIMEHLEGDGMIFVHAGGTIVKKELN-GDKIKVDTGCIVG 184
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
T ++ I+ G ++ +FG + L A + G G V++QSLPF RL+ RI +
Sbjct: 185 FTKGIDFSIERAGGLKSMIFGSEGLFLATLQGHGTVYLQSLPFSRLADRILK 236
>gi|78358149|ref|YP_389598.1| hypothetical protein [Desulfovibrio alaskensis G20]
gi|78220554|gb|ABB39903.1| protein of unknown function DUF124 [Desulfovibrio alaskensis G20]
Length = 242
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 130/230 (56%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME----MENIYIPENE-----VGM 73
+ ++I GG+ QV ++ L P E V+A G+MC+M G +E M + + +G
Sbjct: 6 VEYRITGGDLQVVEVELDPGETVIAEAGAMCWMDGDIEFAARMGDGSAADGGFFGKLLGA 65
Query: 74 WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ N G + V A ++P+DLA GGEL+CQ DAFLC+
Sbjct: 66 GKRLVTGESLFMTHFTNQGQAKASVAFAGQVPGHVVPVDLAEIGGELICQRDAFLCAARG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
++ +R+ G EGF+ Q+L G GLAF+ AGG+VV+K L G + VD C+VA
Sbjct: 126 TRIDVAFSKRLGAGFFGGEGFVLQRLRGDGLAFVHAGGAVVRKELH-GGTLRVDTGCLVA 184
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T V+ I +G ++ +FGG+ A ++G G VF+QS+PF+RL+ RI
Sbjct: 185 FTPGVSYDIGLSGGLKSMMFGGEGAFLAALSGHGTVFLQSMPFNRLADRI 234
>gi|359783147|ref|ZP_09286364.1| hypothetical protein PPL19_18852 [Pseudomonas psychrotolerans L19]
gi|359369035|gb|EHK69609.1| hypothetical protein PPL19_18852 [Pseudomonas psychrotolerans L19]
Length = 247
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 82/233 (35%), Positives = 130/233 (55%), Gaps = 10/233 (4%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM----W--- 74
++ +ILG Q +I L P E V+A G+M +M ++ + + G+ W
Sbjct: 5 ILDHRILGESLQAVEIGLDPGETVIAEAGAMTYMEADIDF-TARMGDGSAGLLGKLWGAG 63
Query: 75 -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
+ L G+++ N G V AAP +++P+DLA GG L CQ D+FLC+
Sbjct: 64 KRMLGGESLFMTHFTNEGREKRHVVFAAPYPGQVVPLDLAALGGTLFCQRDSFLCAAKGT 123
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
++ +R+ G EGF+ Q+L G G+AF+ AGG+V++K L+ GE + +D C+V
Sbjct: 124 RIGVAFTRRLGAGFFGGEGFILQRLEGDGMAFVHAGGTVIRKELK-GETLRLDTGCLVGF 182
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
TS ++ IK G +R +FGG+ LV A ++G G V+IQ+LPF RL+ R+ A
Sbjct: 183 TSGIDYNIKLAGGLRSMLFGGEGLVLATLSGHGTVWIQTLPFSRLAGRVQAAA 235
>gi|456737361|gb|EMF62056.1| DUF124 domain-containing protein [Stenotrophomonas maltophilia
EPM1]
Length = 338
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 78/231 (33%), Positives = 124/231 (53%), Gaps = 11/231 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE-----------V 71
I F+I+G E Q +I L P E +A G++ F +++M+ ++ N
Sbjct: 77 IEFRIVGHEMQFVEIELDPGESAIAEAGALMFKDATVQMDTVFGAANGDQGGGFMGKVMA 136
Query: 72 GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ + V G G V AAP +L + L GG L+CQ D+FL
Sbjct: 137 AGKRVLTGESLFATVYTQSGHGKGKVAFAAPYPGTVLAMKLDQHGGRLVCQKDSFLAGAR 196
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V++ +++ + G EGF+ QKL G G FI AGG VV++ L GE + VD C+V
Sbjct: 197 GVQIGVQFQRKIMTGLFGGEGFIMQKLEGDGWVFIHAGGCVVERELAAGERLDVDTGCVV 256
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
A +V++ ++ I+ +FGG+ + A +TGPG V++QSLPF RL+ R+
Sbjct: 257 AYHPTVDMDVRRVAGIKSMLFGGEGVFLATLTGPGKVWLQSLPFSRLAGRM 307
>gi|408825445|ref|ZP_11210335.1| hypothetical protein PgenN_20122 [Pseudomonas geniculata N1]
Length = 338
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 78/231 (33%), Positives = 124/231 (53%), Gaps = 11/231 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE-----------V 71
I F+I+G E Q +I L P E +A G++ F +++M+ ++ N
Sbjct: 77 IEFRIVGHEMQFVEIELDPGESAIAEAGALMFKDATVQMDTVFGAANGDQGGGFMGKVMA 136
Query: 72 GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ + V G G V AAP +L + L GG L+CQ D+FL
Sbjct: 137 AGKRVLTGESLFATVYTQSGHGKGKVAFAAPYPGTVLAMKLDQHGGRLICQKDSFLAGAR 196
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V++ +++ + G EGF+ QKL G G FI AGG VV++ L GE + VD C+V
Sbjct: 197 GVQIGVQFQRKIMTGLFGGEGFIMQKLEGDGWVFIHAGGCVVERELAAGERLDVDTGCVV 256
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
A +V++ ++ I+ +FGG+ + A +TGPG V++QSLPF RL+ R+
Sbjct: 257 AYHPTVDMDVRRVTGIKSMLFGGEGVFLATLTGPGKVWLQSLPFSRLAGRM 307
>gi|422631722|ref|ZP_16696904.1| hypothetical protein PSYPI_19011 [Pseudomonas syringae pv. pisi
str. 1704B]
gi|330941577|gb|EGH44361.1| hypothetical protein PSYPI_19011 [Pseudomonas syringae pv. pisi
str. 1704B]
Length = 358
Score = 142 bits (358), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 81/237 (34%), Positives = 127/237 (53%), Gaps = 11/237 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ +
Sbjct: 115 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLLGA 174
Query: 73 MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 294
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A S
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 351
>gi|194363814|ref|YP_002026424.1| hypothetical protein Smal_0036 [Stenotrophomonas maltophilia
R551-3]
gi|194346618|gb|ACF49741.1| protein of unknown function DUF124 [Stenotrophomonas maltophilia
R551-3]
Length = 338
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 78/231 (33%), Positives = 126/231 (54%), Gaps = 11/231 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMW----- 74
I F+I+G E Q +I L P E +A G++ F +++M+ ++ + G+
Sbjct: 77 IEFRIVGHEMQFVEIELDPGESAIAEAGALMFKDATVQMDTVFGAATGDQGGGLMGKVMA 136
Query: 75 ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ + V G G V AAP +L + L GG L+CQ D+FL
Sbjct: 137 AGKRVLTGESLFATVYTQSGHGKGKVAFAAPYPGTVLAMKLDQHGGRLICQKDSFLAGAR 196
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V++ +++ + G EGF+ QKL G G FI AGG VV++ L GE + VD C+V
Sbjct: 197 GVQIGVQFQRKIMTGLFGGEGFIMQKLEGDGWVFIHAGGCVVERELAAGERLDVDTGCVV 256
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
A +V++ ++ I+ +FGG+ + A +TGPG V++QSLPF RL+ R+
Sbjct: 257 AYHPTVDMDVRRVAGIKSMLFGGEGVFLATLTGPGKVWLQSLPFSRLAGRM 307
>gi|344942790|ref|ZP_08782077.1| protein of unknown function DUF124 [Methylobacter tundripaludum
SV96]
gi|344260077|gb|EGW20349.1| protein of unknown function DUF124 [Methylobacter tundripaludum
SV96]
Length = 368
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 83/245 (33%), Positives = 128/245 (52%), Gaps = 11/245 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLFG 79
I F+I+G E Q +I L P E VA G+M + S ++ME ++ + G+ L G
Sbjct: 110 IDFKIVGHEMQFVEIELDPGESAVAEAGAMMYKSADVKMETVFGDGSASQQAGIMGLLLG 169
Query: 80 --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+++ V G V AAP IL ++L + +L+CQ D+FL +
Sbjct: 170 AGKRLLTNESLFITVFTQTAQCKGKVAFAAPFPGTILALNLKDYDRKLICQKDSFLAAAK 229
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V++ +R+ + G EGF+ QKL G G F+ AGG + + L GE + VD C+
Sbjct: 230 GVQIGIHFQRRILTGLFGGEGFIMQKLEGDGWVFVHAGGCLQRIELSAGETLHVDTGCLA 289
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
A+TSSV+ I+ I+ +FGG+ L A +TGPG V++QSLPF RL+ R+ A
Sbjct: 290 AMTSSVDFDIQQASGIKTMLFGGEGLFFAKLTGPGTVWLQSLPFSRLAGRMFSAAPQGGG 349
Query: 252 RENPK 256
R+ +
Sbjct: 350 RQQDE 354
>gi|389799295|ref|ZP_10202293.1| hypothetical protein UUC_16100 [Rhodanobacter sp. 116-2]
gi|388443286|gb|EIL99443.1| hypothetical protein UUC_16100 [Rhodanobacter sp. 116-2]
Length = 337
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/237 (32%), Positives = 130/237 (54%), Gaps = 13/237 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I ++I+G + Q +I L P E +A G++ + +++M+ ++ + G
Sbjct: 74 IDYRIVGTDMQFVEIELDPGESAIAEAGALMYKEAAVQMDTVFGDGSSGGQSGGGLMDKL 133
Query: 75 -----QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
+ + G+++ + V + G V AAP ++ + L+ GG L+CQ DAFL
Sbjct: 134 LSAGKRVITGESLFTTVFTHAGQGKAKVAFAAPYPGTVMAMKLSDHGGRLICQKDAFLAG 193
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V++ +++ + G EGF+ QKL G G F+ AGG+VVQ+ L+ GE + VD C
Sbjct: 194 ARGVQLGIFFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVQRELQAGERLDVDTGC 253
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+VA +VN+ +K G I+ +FGG+ + A +TGPG V++QSLPF R++ R+ A
Sbjct: 254 VVAFHDTVNMDVKPVGGIKSMLFGGEGVFLATLTGPGTVWLQSLPFSRMAGRMLAAA 310
>gi|389783910|ref|ZP_10195145.1| hypothetical protein UU7_14525 [Rhodanobacter spathiphylli B39]
gi|388434114|gb|EIL91067.1| hypothetical protein UU7_14525 [Rhodanobacter spathiphylli B39]
Length = 336
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/236 (32%), Positives = 128/236 (54%), Gaps = 12/236 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF---- 78
I ++I+G + Q +I L P E +A G++ + ++ M+ ++ + G L
Sbjct: 74 IDYRIVGTDMQFVEIELDPGESAIAEAGALMYKEAAVRMDTVFGDGSSGGQSGGLMDKLL 133
Query: 79 --------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
G+++ + V + G V AAP ++ + L+ GG L+CQ DAFL
Sbjct: 134 SAGKRVITGESLFTTVFTHTGQGKAKVAFAAPYPGTVMAMKLSDHGGRLICQKDAFLAGA 193
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V++ +++ + G EGF+ QKL G G F+ AGG+VVQ+ L+ GE + VD C+
Sbjct: 194 RGVQLGIFFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVQRELKAGERLDVDTGCV 253
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
VA +VN+ ++ G I+ +FGG+ + A +TGPG V++QSLPF R++ R+ A
Sbjct: 254 VAFHDTVNMDVRPVGGIKSMLFGGEGVFLATLTGPGTVWLQSLPFSRMAGRMLAAA 309
>gi|409397854|ref|ZP_11248712.1| hypothetical protein C211_20088 [Pseudomonas sp. Chol1]
gi|409117593|gb|EKM94020.1| hypothetical protein C211_20088 [Pseudomonas sp. Chol1]
Length = 248
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 129/232 (55%), Gaps = 9/232 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG----MW---- 74
+ + ILG AQ +I+L P E V+A G+M +M+ + E + G +W
Sbjct: 6 LEYDILGQSAQSVEIILDPGETVIAEAGAMNYMTEGVRFETRMGDGSASGVLGKLWGMGK 65
Query: 75 QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
+ L G+++ N G V AAP ++P+DLA GG L CQ D+FLC+ +
Sbjct: 66 RMLTGESLFLTHFTNNGKGKARVAFAAPYPGTVVPVDLAALGGRLTCQKDSFLCAAYGTR 125
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
V L +R+ G EGF+ Q+L G GLAF+ AGG+V++K L+ E + +D C+VA +
Sbjct: 126 VGIALSKRIGAGFFGGEGFILQRLEGDGLAFLHAGGTVIRKELK-DETLRLDTGCLVAFS 184
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
++ I+ G ++ +FGG+ L+ + G G V+IQSLPF RL++R+ A
Sbjct: 185 DGIDYDIQLAGGLKSMLFGGEGLLLTTLKGTGSVWIQSLPFSRLAERVYAAT 236
>gi|357415830|ref|YP_004928850.1| hypothetical protein DSC_00780 [Pseudoxanthomonas spadix BD-a59]
gi|355333408|gb|AER54809.1| hypothetical protein DSC_00780 [Pseudoxanthomonas spadix BD-a59]
Length = 335
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 78/236 (33%), Positives = 124/236 (52%), Gaps = 12/236 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF---- 78
I F+I+G + Q ++ L P E VA GS+ F S++M+ ++ G
Sbjct: 74 IEFRIVGHDMQFVEVELDPGESAVAEAGSLMFKDASVQMDTVFGDGRTSGTGGGFMDKLL 133
Query: 79 --------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
G+++ + V + G V AAP ++ + L+ GG LLCQ D+FL
Sbjct: 134 AAGKRVVTGESLFTTVYTHVGQGKAKVAFAAPYPGTVIALKLSEHGGRLLCQKDSFLAGA 193
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V++ +++ + G EGF+ QKL G G F+ AGG+VV++ L GE I VD C+
Sbjct: 194 RGVQLGIAFQRKIMTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELAAGERIDVDTGCV 253
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
VA V++ I+ I+ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 254 VAYHGGVDMDIRRVAGIKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 309
>gi|352080291|ref|ZP_08951360.1| protein of unknown function DUF124 [Rhodanobacter sp. 2APBS1]
gi|351685000|gb|EHA68069.1| protein of unknown function DUF124 [Rhodanobacter sp. 2APBS1]
Length = 337
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 76/237 (32%), Positives = 130/237 (54%), Gaps = 13/237 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I ++I+G + Q +I L P E +A G++ + +++M+ ++ + G
Sbjct: 74 IDYRIVGTDMQFVEIELDPGESAIAEAGALMYKEAAVQMDTVFGDGSSGGQSSGGLMDKL 133
Query: 75 -----QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
+ + G+++ + V + G V AAP ++ + L+ GG L+CQ DAFL
Sbjct: 134 LSAGKRVITGESLFTTVFTHAGQGKAKVAFAAPYPGTVMAMKLSDHGGRLICQKDAFLAG 193
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V++ +++ + G EGF+ QKL G G F+ AGG+VVQ+ L+ GE + VD C
Sbjct: 194 ARGVQLGIFFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVQRELQAGERLDVDTGC 253
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+VA +VN+ +K G I+ +FGG+ + A +TGPG V++QSLPF R++ R+ A
Sbjct: 254 VVAFHDTVNMDVKPVGGIKSMLFGGEGVFLATLTGPGTVWLQSLPFSRMAGRMLAAA 310
>gi|334141548|ref|YP_004534754.1| hypothetical protein [Novosphingobium sp. PP1Y]
gi|333939578|emb|CCA92936.1| conserved hypothetical protein [Novosphingobium sp. PP1Y]
Length = 276
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 81/237 (34%), Positives = 124/237 (52%), Gaps = 11/237 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
+ F+I G E Q +I L P E VA G+M + +EM ++ + +G
Sbjct: 18 VDFEIKGQELQFVEIELDPGESAVAEAGAMVWKDAPIEMSTVFGDGSGGEGSGFMGKLLG 77
Query: 73 MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ + V + G V AAP ILPI L GG L+CQ DAFL +
Sbjct: 78 AGKRLVTGESLFTTVFTHHGKGKARVAFAAPVPGAILPIKLDDVGGRLICQKDAFLAAAR 137
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + QR+ + G EGF+ QKL G G F+ GG+V+++ L GE + +D C+
Sbjct: 138 GVSIGVHFQQRIMTGLFGGEGFIMQKLEGDGWVFVQMGGTVIERELAAGEELHIDTGCVA 197
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
A TS ++ + G ++ +FGG+ + A + GPG V++QSLPF RL+ R+ A S
Sbjct: 198 AFTSGIDFDVIRAGSVKSMIFGGEGVFFARLRGPGKVWVQSLPFSRLAGRMLAAAGS 254
>gi|433677717|ref|ZP_20509665.1| putative protein M6_Spy0233 [Xanthomonas translucens pv.
translucens DSM 18974]
gi|430817166|emb|CCP40086.1| putative protein M6_Spy0233 [Xanthomonas translucens pv.
translucens DSM 18974]
Length = 337
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/230 (33%), Positives = 124/230 (53%), Gaps = 10/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I F+I+G E Q +I L P E VA G++ F +++M+ ++ + G
Sbjct: 77 IDFRIVGHEMQFVEIALDPGESAVAEAGALMFKDAAVQMDTVFGDGSGQGGGLMGKLFSA 136
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ + G+++ + V + G V AAP +L + L GG L+CQ D+FL
Sbjct: 137 GKRLVTGESLFTTVFTHQGQGKATVAFAAPYPGTVLAMKLDQHGGRLICQKDSFLAGARG 196
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ AGG VV++ L GE + VD C+VA
Sbjct: 197 VSLGIHFQRKIMTGLFGGEGFIMQKLEGDGWVFVHAGGCVVERELAAGERLDVDTGCVVA 256
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
S+V++ ++ I+ FGG+ + A +TGPG V++QSLPF RL+ R+
Sbjct: 257 FHSTVDMDVRAVSGIKSMFFGGEGMFLATLTGPGKVWLQSLPFSRLAGRM 306
>gi|389810514|ref|ZP_10205843.1| hypothetical protein UUA_15723 [Rhodanobacter thiooxydans LCS2]
gi|388440808|gb|EIL97141.1| hypothetical protein UUA_15723 [Rhodanobacter thiooxydans LCS2]
Length = 336
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 76/236 (32%), Positives = 128/236 (54%), Gaps = 12/236 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF---- 78
I ++I+G + Q +I L P E +A G++ + +++M+ ++ + G L
Sbjct: 74 IDYRIVGTDMQFVEIELDPGESAIAEAGALMYKESAVQMDTVFGDGSSSGQGGGLMDKLL 133
Query: 79 --------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
G+++ + V + G V AAP ++ + L+ GG L+CQ DAFL
Sbjct: 134 SAGKRVITGESLFTTVFTHAGSGKAKVAFAAPYPGTVMAMKLSDHGGRLICQKDAFLAGA 193
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V++ +++ + G EGF+ QKL G G F+ AGG+VVQ+ L GE + VD C+
Sbjct: 194 RGVQLGIFFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVQRELRAGERLDVDTGCV 253
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
VA +V++ +K G I+ +FGG+ + A +TGPG V++QSLPF R++ R+ A
Sbjct: 254 VAFHDTVSMDVKPVGGIKSMLFGGEGVFLATLTGPGTVWLQSLPFSRMAGRMLAAA 309
>gi|323489343|ref|ZP_08094573.1| hypothetical protein GPDM_08325 [Planococcus donghaensis MPA1U2]
gi|323396984|gb|EGA89800.1| hypothetical protein GPDM_08325 [Planococcus donghaensis MPA1U2]
Length = 266
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 81/232 (34%), Positives = 131/232 (56%), Gaps = 12/232 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE-----------V 71
I +++ G + Q ++ L P E V+A G++ M + ME I+ ++ +
Sbjct: 6 IDYKLHGDDMQFVEVELDPAETVIAEAGALMMMEDGIVMETIFGDGSQSSGGSGLMGKLM 65
Query: 72 GMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
G + + G+++ N G V AAP +I+P+DL+ G+++CQ DAFL +
Sbjct: 66 GAGKRIITGESLFMTTFTNNGTGKRHVSFAAPYPGKIIPMDLSTLNGKIICQKDAFLAAA 125
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +++ G EGF+ QKL G GLAF+ AGG++ +KNL+ GEV+ VD C+
Sbjct: 126 KGVSIGIEFQRKIGAGFFGGEGFIMQKLEGDGLAFVHAGGTIYRKNLQKGEVLRVDTGCL 185
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
VA+T V+ I+ I+ A+FGG+ L A + GPG V+IQS+PF RL+ R+
Sbjct: 186 VAMTGDVDYNIEAVPGIKTALFGGEGLFFATLRGPGSVWIQSMPFSRLASRV 237
>gi|416014288|ref|ZP_11562147.1| hypothetical protein PsgB076_03534 [Pseudomonas syringae pv.
glycinea str. B076]
gi|320326072|gb|EFW82129.1| hypothetical protein PsgB076_03534 [Pseudomonas syringae pv.
glycinea str. B076]
Length = 352
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 81/234 (34%), Positives = 126/234 (53%), Gaps = 11/234 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 109 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 168
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 169 GKRMLTGESLFTTVFSQQDSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 228
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 229 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 288
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 289 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 342
>gi|197106783|ref|YP_002132160.1| hypothetical protein PHZ_c3322 [Phenylobacterium zucineum HLK1]
gi|196480203|gb|ACG79731.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1]
Length = 311
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 85/254 (33%), Positives = 133/254 (52%), Gaps = 12/254 (4%)
Query: 4 PFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMEN 63
P +P+ + + D V F+I G E Q +I L P E VA G++ + S+EM
Sbjct: 34 PMSQSPWSHHRNPNVADDV-DFEIKGQELQFVEIELDPGESAVAEAGALVWKDASVEMTT 92
Query: 64 IYIPENE----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDL 112
++ + +G + L G+++ + V + G V A+P+ ILP++L
Sbjct: 93 VFGDGSGGQGGGFMGKLLGAGKRLITGESLFTTVFTHQGQGKARVAFASPTPGAILPLNL 152
Query: 113 AMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSV 172
A GG L+CQ D+FL + V + +RV + G EGF+ Q+L G G F+ GG+V
Sbjct: 153 AEVGGTLICQKDSFLAAARGVSIGVQFQRRVMTGLFGGEGFIMQRLEGDGWVFVQMGGTV 212
Query: 173 VQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
V++ L G+ I VD C+ A T SV+ + G ++ FGG+ L A + GPG V++QS
Sbjct: 213 VERELAAGQEIHVDTGCLAAYTPSVDFDLVMAGGVKSVFFGGEGLFFARLRGPGKVWVQS 272
Query: 233 LPFHRLSQRIARAV 246
LPF RL+ R+ A
Sbjct: 273 LPFSRLAGRMMAAA 286
>gi|419954779|ref|ZP_14470914.1| hypothetical protein YO5_07918 [Pseudomonas stutzeri TS44]
gi|387968392|gb|EIK52682.1| hypothetical protein YO5_07918 [Pseudomonas stutzeri TS44]
Length = 248
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 82/232 (35%), Positives = 128/232 (55%), Gaps = 9/232 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG----MW---- 74
+ + ILG AQ +I+L P E V+A G+M +M+ + E + G +W
Sbjct: 6 LEYDILGQSAQSVEIILDPDETVIAEAGAMNYMTEGVRFETRMGDGSSSGVLGKLWGMGR 65
Query: 75 QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
+ L G+++ N G V AAP ++P+DLA GG L CQ D+FLC+ +
Sbjct: 66 RMLTGESLFLTHFTNSGKGKARVAFAAPYPGTVVPVDLAALGGRLTCQKDSFLCAAYGTR 125
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
V L +R+ G EGF+ Q L G GLAF+ AGG+V++K L+ E + +D C+VA +
Sbjct: 126 VGIALSKRIGAGFFGGEGFILQCLEGDGLAFLHAGGTVIRKELK-DETLRLDTGCLVAFS 184
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
++ I+ G ++ +FGG+ L+ + G G V+IQSLPF RL++R+ A
Sbjct: 185 DGIDYDIQLAGGLKSMLFGGEGLLLTTLKGTGSVWIQSLPFSRLAERVYAAT 236
>gi|281209445|gb|EFA83613.1| hypothetical protein PPL_02679 [Polysphondylium pallidum PN500]
Length = 352
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 77/220 (35%), Positives = 123/220 (55%), Gaps = 1/220 (0%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++I G E+QV Q+ L P + A G++ MS +MEM+ M + + G+ I
Sbjct: 100 WEIFGHESQVIQLHLNPGTSITAETGALLEMSATMEMDTTARGGFMSSMKRMITGQGIFL 159
Query: 85 VVLRNPGPSD-GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV 143
+N ++ + ++P +++IL I ++ GGEL+C+ +AFLC ND+++ L R
Sbjct: 160 TKFKNSSDTETARITFSSPYISKILAIQMSEIGGELICKKNAFLCGDNDIEIGPRLTNRF 219
Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
G +GF+ QKLTG GLAFI GS++ + L+ E I V CIVA ++ I++
Sbjct: 220 SIGFFGGQGFILQKLTGHGLAFIHGSGSIMYRMLKPMETIKVSTGCIVAFEPTIQYDIEW 279
Query: 204 NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
+ FGG+ L A ++GPG+V +QSLPF +L IA
Sbjct: 280 VKGAKNIFFGGEGLFLATLSGPGLVILQSLPFEKLVSAIA 319
>gi|285019906|ref|YP_003377617.1| hypothetical protein XALc_3144 [Xanthomonas albilineans GPE PC73]
gi|283475124|emb|CBA17623.1| hypothetical protein XALC_3144 [Xanthomonas albilineans GPE PC73]
Length = 335
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 78/231 (33%), Positives = 125/231 (54%), Gaps = 10/231 (4%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF- 78
I ++I+G E Q +I L P E VA G++ F +++M+ ++ N+ G+ L
Sbjct: 74 AIDYRIVGHEMQFVEIELDPGESAVAEAGALMFKDTAVQMDTVFGDGSANQGGLMGKLLS 133
Query: 79 -------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
G T+ + V + G V AAP +L + L GG L+CQ D+FL
Sbjct: 134 AGKRLVTGATLFTTVFTHQGQGKAKVAFAAPYPGTVLAMKLDQHGGCLICQKDSFLAGAR 193
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + +++ + G EGF+ QKL G G F+ AGG VV++ L GE + VD C+V
Sbjct: 194 GVSLGIHFQRKIMTGLFGGEGFVMQKLEGDGWVFVHAGGCVVERELAAGERVEVDTGCVV 253
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
A ++V++ ++ I+ +FGG+ + A +TGPG V++QSLPF RL+ R+
Sbjct: 254 AFHATVDMDVRTVSGIKSMLFGGEGMFLATLTGPGKVWLQSLPFSRLAARM 304
>gi|359397666|ref|ZP_09190692.1| hypothetical protein NSU_0378 [Novosphingobium pentaromativorans
US6-1]
gi|357600857|gb|EHJ62550.1| hypothetical protein NSU_0378 [Novosphingobium pentaromativorans
US6-1]
Length = 348
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 80/237 (33%), Positives = 124/237 (52%), Gaps = 11/237 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
+ F+I G E Q +I L P E VA G+M + +EM ++ + +G
Sbjct: 90 VDFEIKGQELQFVEIELDPGESAVAEAGAMVWKDAPIEMSTVFGDGSGGEGSGFMGKLLG 149
Query: 73 MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ + V + G V AAP ILPI L GG L+CQ DAFL +
Sbjct: 150 AGKRLVTGESLFTTVFTHHGKGKARVAFAAPVPGAILPIKLDDVGGRLICQKDAFLAAAR 209
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + QR+ + G EGF+ QKL G G F+ GG+V+++ L GE + +D C+
Sbjct: 210 GVSIGVHFQQRIMTGLFGGEGFIMQKLEGDGWVFVQMGGTVIERELAAGEELHIDTGCVA 269
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
A T+ ++ + G ++ +FGG+ + A + GPG V++QSLPF RL+ R+ A S
Sbjct: 270 AFTAGIDFDVIRAGSVKSMIFGGEGVFFARLRGPGKVWVQSLPFSRLAGRMLAAAGS 326
>gi|345864493|ref|ZP_08816693.1| hypothetical protein TevJSym_ao00180 [endosymbiont of Tevnia
jerichonana (vent Tica)]
gi|345124352|gb|EGW54232.1| hypothetical protein TevJSym_ao00180 [endosymbiont of Tevnia
jerichonana (vent Tica)]
Length = 250
Score = 140 bits (353), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 78/231 (33%), Positives = 132/231 (57%), Gaps = 12/231 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN--EVGMWQWLF-- 78
I +QILG + Q+ ++ L P E V+A G+M ++ + E + + G++ LF
Sbjct: 6 IDYQILGHDLQLVEVELDPGETVIAEAGAMTYLEQEISFETKMGDGSNPDQGIFGKLFSA 65
Query: 79 ------GKTITSVVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
G+++ + + G + V +AP ++P++LA GG+++CQ DAFLC+
Sbjct: 66 GKRMITGESLFTTHFTHSGSAGKRKVAFSAPFPGSVIPLNLAALGGKVICQKDAFLCAAL 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
K+ +++ + + G EGF+ Q L G G AFI AGG+VV+K L GE + VD C+V
Sbjct: 126 GTKLGIAFTKKLGSGLFGGEGFILQSLEGDGTAFIQAGGTVVEKQLN-GETLRVDTGCLV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T ++ I+ +G ++ +FGG+ + A ++G G V+IQSLPF R++ RI
Sbjct: 185 GFTEGIDYSIEMSGGLKSMLFGGEGIFLATLSGSGTVWIQSLPFSRMADRI 235
>gi|380512373|ref|ZP_09855780.1| hypothetical protein XsacN4_14202 [Xanthomonas sacchari NCPPB 4393]
Length = 337
Score = 140 bits (352), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 79/235 (33%), Positives = 125/235 (53%), Gaps = 11/235 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP---ENEVGMWQWLF- 78
I ++I+G E Q +I L P E VA G++ F ++ M+ ++ ++ G+ L
Sbjct: 76 IDYRIVGHEMQFVEIELDPGESAVAEAGALMFKDAAVRMDTVFGDGSNADQGGLMGKLLS 135
Query: 79 -------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
G+++ + V + G V AAP +L + L GG L+CQ D+FL
Sbjct: 136 AGKRLVTGESLFTTVFTHQGHGKAKVAFAAPYPGTVLAMKLDQHGGRLICQKDSFLAGAR 195
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + ++V + G EGF+ QKL G G F+ AGG VV++ L GE + VD C+V
Sbjct: 196 GVSLGIHFQRKVMTGLFGGEGFIMQKLEGDGWVFVHAGGCVVERELAAGERLDVDTGCVV 255
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
A +SV++ ++ I+ FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 256 AFHASVDMDVRPVSGIKSMFFGGEGMFLATLTGPGKVWLQSLPFSRLAGRMLAAA 310
>gi|422641956|ref|ZP_16705377.1| hypothetical protein PSYCIT7_23825 [Pseudomonas syringae Cit 7]
gi|330954341|gb|EGH54601.1| hypothetical protein PSYCIT7_23825 [Pseudomonas syringae Cit 7]
Length = 358
Score = 140 bits (352), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 81/237 (34%), Positives = 126/237 (53%), Gaps = 11/237 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 115 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 174
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + + G G V A P ILP++L FGG L+CQ D+FL
Sbjct: 175 GKRMLTGESLFTTMFSQQGSGKGRVAFAPPYPGTILPLNLRDFGGRLICQKDSFLAGAKG 234
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 294
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A S
Sbjct: 295 MTQTVDYDIRTVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 351
>gi|167647234|ref|YP_001684897.1| hypothetical protein Caul_3272 [Caulobacter sp. K31]
gi|167349664|gb|ABZ72399.1| protein of unknown function DUF124 [Caulobacter sp. K31]
Length = 275
Score = 140 bits (352), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 82/248 (33%), Positives = 128/248 (51%), Gaps = 13/248 (5%)
Query: 12 PYVYQSPQDAV--IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN 69
P+ +Q I F+I G E Q +I L E VA GS + S+ M ++ +
Sbjct: 5 PWSHQRSASVADDIDFEIKGQELQFLEIELDAGESAVAEAGSFVWKDASVGMTTVFGDGS 64
Query: 70 E----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE 118
+G + L G+++ + V + G V A+P+ ILP++L GG
Sbjct: 65 GDNGGGFMGKLLGAGKRLVTGESLFTTVFTHNGAGKARVAFASPTPGAILPLNLGQLGGS 124
Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
L+CQ D+FL + V + +RV + G EGF+ Q+L G G F+ GG++V++ L+
Sbjct: 125 LICQKDSFLAAARGVSIGVHFQKRVMTGLFGGEGFIMQRLDGDGWVFVQMGGTLVERELK 184
Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
GE + VD C+ A T V+ + G +R +FGG+ L A + GPG V+IQSLPF RL
Sbjct: 185 AGEELHVDTGCLAAYTPGVDFDLVMAGGVRSVLFGGEGLFFARLRGPGKVWIQSLPFSRL 244
Query: 239 SQRIARAV 246
+ R+ +A
Sbjct: 245 AGRMLQAA 252
>gi|218438880|ref|YP_002377209.1| hypothetical protein PCC7424_1909 [Cyanothece sp. PCC 7424]
gi|218171608|gb|ACK70341.1| protein of unknown function DUF124 [Cyanothece sp. PCC 7424]
Length = 234
Score = 140 bits (352), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 82/229 (35%), Positives = 125/229 (54%), Gaps = 1/229 (0%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
VI +QI G + Q+ +I L E V A G+M ++ +EM+ G + L G+
Sbjct: 4 VIDYQIYGDDLQLVEITLDLGEGVRAEAGAMAYLEQGIEMQTSTGGGFLAGFKRALAGEG 63
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
N G V AAP +I+P+ L GG +CQ DAF+C+ +++ +
Sbjct: 64 FFITNFANRGTRRAKVAFAAPYPGKIIPLQLGQLGGTFICQKDAFICAAKGIEIDVEFTK 123
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
R+ G EGF+ Q+L G GLAFI AGG++V+K L GE I VD C+VA + +V +I
Sbjct: 124 RLGTGFFGGEGFILQRLQGDGLAFIHAGGALVEKRLMSGEDIRVDTGCVVAFSPTVRYEI 183
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
+ + +FGG+ L A +TGPG +++QSLP +L+ R+ R SP+
Sbjct: 184 EMVKGFKNVLFGGEGLFLAKLTGPGTIYLQSLPLSKLAARL-RQYISPS 231
>gi|392421934|ref|YP_006458538.1| hypothetical protein A458_14430 [Pseudomonas stutzeri CCUG 29243]
gi|390984122|gb|AFM34115.1| hypothetical protein A458_14430 [Pseudomonas stutzeri CCUG 29243]
Length = 248
Score = 140 bits (352), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 81/233 (34%), Positives = 128/233 (54%), Gaps = 9/233 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG----MW---- 74
+ ++ILG AQ +I+L P E V+A G+M +M+ + E + G +W
Sbjct: 6 LEYEILGQSAQSVEIILDPDETVIAEAGAMNYMTDGVRFEARMGDGSASGVLGKLWGAGK 65
Query: 75 QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
+ G+++ N + V AAP ++ IDLA GG+L CQ DAFLC+
Sbjct: 66 RMFTGESLFLTHFTNESKAKARVAFAAPYPGTVVAIDLAAVGGKLFCQKDAFLCAAYGTC 125
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
+ + +R+ G EGF+ Q+L G GLAF+ AGG+V++K L E + +D C+VA T
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLEGDGLAFVHAGGTVIRKELN-NETLRLDTGCLVAFT 184
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVT 247
++ I+ G +R +FGG+ L+ + G G V+IQSLPF RL++R+ A +
Sbjct: 185 QGIDYDIQLAGGLRSMLFGGEGLLLTTLKGTGSVWIQSLPFSRLAERVYEATS 237
>gi|116753572|ref|YP_842690.1| hypothetical protein Mthe_0253 [Methanosaeta thermophila PT]
gi|116665023|gb|ABK14050.1| protein of unknown function DUF124 [Methanosaeta thermophila PT]
Length = 233
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 133/230 (57%), Gaps = 3/230 (1%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ + ILG + QV ++ + P E + + G+M FM ++M + G+ + + G+ +
Sbjct: 3 LRYTILGDDLQVLEVQMAPGESIRSEAGAMLFMDDEIDM-SASTGGIIKGLKRAVAGEGL 61
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
N + +AP +ILP++L FGG+LLCQ AFLC + V++ +R
Sbjct: 62 FITSFTNRSDDMKRMAFSAPFPGKILPLNLDEFGGQLLCQRGAFLCGESGVEIELAFTKR 121
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ G EGF+ Q+L G+G+ FI AGG+++ ++L+ EVI VD C+VA + V+ I+
Sbjct: 122 LGAGFFGGEGFVLQRLRGRGMVFIHAGGALITRDLQEEEVINVDTGCLVAFSRDVDYDIR 181
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI--ARAVTSPN 250
+ G IR A+FGG+ + A + GPG V++QSLP RL+ RI R V+S N
Sbjct: 182 FVGGIRNALFGGEGIFLARLKGPGRVYLQSLPLSRLADRILQGRVVSSNN 231
>gi|209965162|ref|YP_002298077.1| hypothetical protein RC1_1868 [Rhodospirillum centenum SW]
gi|209958628|gb|ACI99264.1| conserved hypothetical protein [Rhodospirillum centenum SW]
Length = 335
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 83/236 (35%), Positives = 123/236 (52%), Gaps = 11/236 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI----PENEVGMWQWL- 77
I F+I G E Q ++ L P E VA G+M + +EM I+ PE M + +
Sbjct: 78 IDFRIYGEEMQFVEVELDPGESAVAEAGAMVYKDSRVEMTTIFGDGSGPEGGSFMDKLIG 137
Query: 78 ------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
G+++ + V + G G V AP IL + L+ GG L+CQ D+FL +
Sbjct: 138 AGKRVITGESLFTTVFTHQGTGKGKVAFGAPYPGTILALKLSDVGGRLICQKDSFLAAAK 197
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + +R+ + G EGF+ Q L G G F+ GG+VV++ L GE + VD C+
Sbjct: 198 GVSIGIHFQRRILTGLFGGEGFIMQNLAGDGWVFVHVGGTVVERQLAAGEELHVDTGCLA 257
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVT 247
A+T V+ + G I+ VFGG+ + A + GPG V+IQSLPF RL+ R+ A T
Sbjct: 258 AMTPEVDFDLVQVGGIKSMVFGGEGVFFASLRGPGRVWIQSLPFSRLAGRMLAAAT 313
>gi|295688666|ref|YP_003592359.1| hypothetical protein Cseg_1241 [Caulobacter segnis ATCC 21756]
gi|295430569|gb|ADG09741.1| protein of unknown function DUF124 [Caulobacter segnis ATCC 21756]
Length = 275
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 80/252 (31%), Positives = 132/252 (52%), Gaps = 13/252 (5%)
Query: 9 PFQPYVYQSPQDAV--IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
P P+ +Q I F+I G + Q +I L P E VA G+ + +++M ++
Sbjct: 2 PSSPWSHQRNASVADDIDFEIKGDDLQFVEIELDPGESAVAEAGAFVWKDATVQMTTVFG 61
Query: 67 PENE----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF 115
+ +G + L G+++ + V + G V A+P+ ILP++LA
Sbjct: 62 DGSADQGGGFMGKLLGAGKRLVTGESLFTTVFTHTGSGKARVAFASPTPGSILPLNLAEL 121
Query: 116 GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQK 175
GG L+CQ D+FL + V + +R+ + G EGF+ Q+L G G F+ GG++V++
Sbjct: 122 GGTLICQKDSFLAAARGVSIGLHFQKRMMTGLFGGEGFIMQRLEGDGWVFVQMGGALVER 181
Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
L GE + +D C+ A T SV+ + G ++ +FGG+ L A + GPG V+IQSLPF
Sbjct: 182 ELAPGEELHIDTGCVAAYTPSVDFDLVMAGGVKSVLFGGEGLFFARLRGPGKVWIQSLPF 241
Query: 236 HRLSQRIARAVT 247
RL+ R+ +A
Sbjct: 242 ARLAGRMLQAAN 253
>gi|399063909|ref|ZP_10747019.1| hypothetical protein PMI02_03433 [Novosphingobium sp. AP12]
gi|398031371|gb|EJL24758.1| hypothetical protein PMI02_03433 [Novosphingobium sp. AP12]
Length = 276
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 80/237 (33%), Positives = 125/237 (52%), Gaps = 11/237 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
+ F+I G E Q +I L P E VA G+M + +++M ++ + +G
Sbjct: 18 VDFEIKGQELQFVEIELDPGESAVAEAGAMVWKDAAIDMTTVFGDGSGDQGGGFMGKLLG 77
Query: 73 MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ + V + G V AAP ILPI L G+L+CQ DAFL +
Sbjct: 78 AGKRLVTGESLFTTVFTHRGSGKARVSFAAPVPGAILPIKLDDVDGQLICQKDAFLAAAR 137
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + Q++ + G EGF+ QKL G G F+ GG+V+++ L GE + VD C+
Sbjct: 138 GVSIGIHFQQKIMTGLFGGEGFIMQKLEGDGWVFVQMGGTVIERELRAGEELHVDTGCVA 197
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
A+T+ V + G ++ +FGG+ + A + GPG V+IQSLPF RL+ R+ A S
Sbjct: 198 AMTAGVGFDVIRAGSVKSMIFGGEGVFFARLRGPGKVWIQSLPFSRLAGRMLAAAGS 254
>gi|329851756|ref|ZP_08266437.1| hypothetical protein ABI_45250 [Asticcacaulis biprosthecum C19]
gi|328839605|gb|EGF89178.1| hypothetical protein ABI_45250 [Asticcacaulis biprosthecum C19]
Length = 279
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 87/261 (33%), Positives = 131/261 (50%), Gaps = 13/261 (4%)
Query: 7 STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
S P + S I F+I G E Q +I L P E +A G+M + S+ M ++
Sbjct: 5 SVPTTHHHRSSGVADDIDFEIKGQELQFVEIELDPGESAIAEAGAMVWKDASVGMTTVFG 64
Query: 67 PENE----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF 115
+ +G + L G+++ + V + G V +AP I+P+ L+
Sbjct: 65 DASGQQGGGFMGALLGAGKRLITGESLFTTVFTHNGHGKARVAFSAPVPGAIVPLKLSDV 124
Query: 116 GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQK 175
GG L+CQ D+FL + V + +R+ + G EGF+ QKL G G F+ GG VV++
Sbjct: 125 GGRLICQKDSFLAAAKGVSLGIAFQKRIMTGLFGGEGFIMQKLEGDGWVFVQFGGMVVER 184
Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
L+ GE + VD CI A T +V+ + G I+ +F G+ + A +TGPG V+IQSLPF
Sbjct: 185 ELKAGEQLHVDTGCIAAFTDTVDFDLIQAGGIKSMLFAGEGVFFAQLTGPGKVWIQSLPF 244
Query: 236 HRLSQRIARAV--TSPNMREN 254
RL+ RIA PN E
Sbjct: 245 SRLAGRIAAVAGGAGPNRGEG 265
>gi|16125367|ref|NP_419931.1| hypothetical protein CC_1115 [Caulobacter crescentus CB15]
gi|221234109|ref|YP_002516545.1| hypothetical protein CCNA_01172 [Caulobacter crescentus NA1000]
gi|13422427|gb|AAK23099.1| conserved hypothetical protein [Caulobacter crescentus CB15]
gi|220963281|gb|ACL94637.1| hypothetical protein CCNA_01172 [Caulobacter crescentus NA1000]
Length = 280
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 80/255 (31%), Positives = 131/255 (51%), Gaps = 13/255 (5%)
Query: 7 STPFQPYVYQSPQDAV--IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENI 64
S P P+ +Q + I F+I G + Q +I L P E VA G+ + +++M +
Sbjct: 5 SMPTSPWSHQRNANIADDIDFEIKGEDLQFVEIELDPGESAVAEAGAFVWKDATVQMTTV 64
Query: 65 YIPENE----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA 113
+ +G + L G+++ + V + G V A+P+ ILP++L
Sbjct: 65 FGDGAADQGGGFMGKLLGAGKRLITGESLFTTVFTHHGTGKARVAFASPTPGTILPLNLG 124
Query: 114 MFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVV 173
GG L+CQ D+FL + V + +R+ + G EGF+ Q+L G G F+ GG++V
Sbjct: 125 QLGGTLICQKDSFLAAARGVSIGLHFQKRMMTGLFGGEGFIMQRLEGDGWVFVQMGGTLV 184
Query: 174 QKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSL 233
++ L GE + +D C+ A T V+ + G ++ +FGG+ L A V GPG V+IQSL
Sbjct: 185 ERTLAPGEELHIDTGCLAAYTPDVDFDLVMAGGVKSVLFGGEGLFFARVRGPGKVWIQSL 244
Query: 234 PFHRLSQRIARAVTS 248
PF RL+ R+ +A
Sbjct: 245 PFARLAGRMLQAANH 259
>gi|410093948|ref|ZP_11290411.1| hypothetical protein AAI_24394 [Pseudomonas viridiflava UASWS0038]
gi|409758661|gb|EKN43938.1| hypothetical protein AAI_24394 [Pseudomonas viridiflava UASWS0038]
Length = 362
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 84/244 (34%), Positives = 129/244 (52%), Gaps = 12/244 (4%)
Query: 13 YVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENE 70
Y S D V F++ G E Q ++ L E VA G++ + + ++ME I+
Sbjct: 110 YGSGSATDGV-DFKLYGTETQFVELELDRGESAVAEAGALMYKTCDVQMETIFGDGSNQS 168
Query: 71 VGMWQWLFG--------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQ 122
G+ LFG +++ + V G G V AAP ILP++L FGG+L+CQ
Sbjct: 169 SGLLGSLFGAGKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLGDFGGKLICQ 228
Query: 123 PDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEV 182
D+FL V + +++ + G EGF+ QKL G G F+ GG+V + L GE
Sbjct: 229 KDSFLAGAKGVSIGIQFQKKILTGLFGGEGFVLQKLEGDGWVFVHMGGTVRKIELAAGEG 288
Query: 183 ITVDVSCIVAVTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQR 241
+ VD C+ A+T +V+ I+ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R
Sbjct: 289 LDVDTGCLAAMTQTVDYDIRMVGGGIKSMLFGGEGMFFARLTGPGTVWLQSLPFSRLAGR 348
Query: 242 IARA 245
+ A
Sbjct: 349 MLAA 352
>gi|393202307|ref|YP_006464149.1| hypothetical protein SSIL_3580 [Solibacillus silvestris StLB046]
gi|406666586|ref|ZP_11074352.1| hypothetical protein B857_02159 [Bacillus isronensis B3W22]
gi|327441638|dbj|BAK18003.1| uncharacterized conserved protein [Solibacillus silvestris StLB046]
gi|405385604|gb|EKB45037.1| hypothetical protein B857_02159 [Bacillus isronensis B3W22]
Length = 265
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 80/240 (33%), Positives = 130/240 (54%), Gaps = 13/240 (5%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
++ I + + G + Q ++ L PQE VVA GS+ M ++ME ++ + L
Sbjct: 2 KNHEIDYVLHGDDMQFVEVELDPQETVVAEAGSLMMMEDQIQMETVF-GDGSANQGSGLM 60
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + N G V A+P +I+P++L+ G+++CQ DAF
Sbjct: 61 GKLLGAGKRLITGESLFMTTFTNNGMGKRKVYFASPYPGKIIPMNLSELDGKIICQKDAF 120
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
L + V V +++ G EGF+ QKL G G+AF+ AGG++ ++ L+ GE + +D
Sbjct: 121 LAAAKGVSVGVEFQKKLGTGFFGGEGFIMQKLEGDGMAFVHAGGTIYERKLQPGETLRID 180
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA+T VN I+ I+ A+FGG+ L A ++GPG V++QSLPF RL+ R+ A
Sbjct: 181 TGCLVAMTQDVNYNIEMVSGIKTALFGGEGLFFATLSGPGTVWVQSLPFSRLASRVFAAA 240
>gi|167948526|ref|ZP_02535600.1| hypothetical protein Epers_19156 [Endoriftia persephone
'Hot96_1+Hot96_2']
Length = 289
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 77/231 (33%), Positives = 132/231 (57%), Gaps = 12/231 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN--EVGMWQWLF-- 78
I ++ILG + Q+ ++ L P E V+A G+M ++ + E + + G++ LF
Sbjct: 6 IDYKILGHDLQLVEVELDPGETVIAEAGAMTYLEQEISFETKMGDGSNPDQGIFGKLFSA 65
Query: 79 ------GKTITSVVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
G+++ + + G + V +AP ++P++LA GG+++CQ DAFLC+
Sbjct: 66 GKRMITGESLFTTHFTHSGSAGKRKVAFSAPFPGSVIPLNLAALGGKVICQKDAFLCAAL 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
K+ +++ + + G EGF+ Q L G G AFI AGG+VV+K L GE + VD C+V
Sbjct: 126 GTKLGIAFTKKLGSGLFGGEGFILQSLEGDGTAFIQAGGTVVEKQLN-GETLRVDTGCLV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T ++ I+ +G ++ +FGG+ + A ++G G V+IQSLPF R++ RI
Sbjct: 185 GFTEGIDYSIEMSGGLKSMLFGGEGIFLATLSGSGTVWIQSLPFSRMADRI 235
>gi|114047855|ref|YP_738405.1| hypothetical protein Shewmr7_2361 [Shewanella sp. MR-7]
gi|113889297|gb|ABI43348.1| protein of unknown function DUF124 [Shewanella sp. MR-7]
Length = 264
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 128/230 (55%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG--SMEMENIYIPENEVGMWQWLFG- 79
+ ++I+G Q+ ++ L P E V+A G+M ++ S E + E E G + L G
Sbjct: 8 VDYEIIGHSMQLVEVELDPHETVIAEAGAMNYLEQDISFEAKMGDGSEPESGFFGKLMGA 67
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
++I N G V AAP IL +DLA +GGEL+CQ D+FL +
Sbjct: 68 GKRVLTGESIFMTHFTNLGHQKRKVAFAAPYPGTILALDLAQYGGELICQKDSFLAAALG 127
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
+VS ++R+ G EGF+ Q L G G+AFI AGG++++K L+ GE + VD C+V
Sbjct: 128 TRVSMKFNRRLGTGFFGGEGFILQSLQGDGMAFIHAGGTLIKKELK-GETLRVDTGCLVG 186
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T ++ I+ G ++ VFGG+ L A +TG G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIERAGSLKSMVFGGEGLFLATLTGHGTVWLQSLPFSRMADRI 236
>gi|383640052|ref|ZP_09952458.1| hypothetical protein SeloA3_00235 [Sphingomonas elodea ATCC 31461]
gi|32170807|gb|AAP57683.1| conserved hypothetical protein [Sphingomonas elodea ATCC 31461]
Length = 276
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 82/252 (32%), Positives = 132/252 (52%), Gaps = 12/252 (4%)
Query: 9 PFQPYVYQSPQDAV--IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
P P+ + + I F+I G E Q +I L P E VA G+M + + M+ ++
Sbjct: 3 PMSPWSHTRREGYADDIDFEIKGQELQFVEIELDPGESAVAEAGAMVWKDAPIAMQTVFG 62
Query: 67 P-ENEVGMW---------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFG 116
N+ G + + + G+ + + V + G V A+P I+P+ LA G
Sbjct: 63 DGSNQGGGFMDKLLGAGKRLVTGEGLFTTVFTHQGSGKARVAFASPVPGAIVPLRLADIG 122
Query: 117 GELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKN 176
G L+CQ DAFL + V++ +R+ + G EGF+ Q+L G G F+ GG++V++
Sbjct: 123 GTLICQKDAFLAAAKGVQMGIAFQRRIMTGLFGGEGFIMQRLDGDGWVFVQMGGAIVERE 182
Query: 177 LEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFH 236
L GE + VD C+ A+T SV+ + G +R +FGG+ + A + GPG V+IQSLPF
Sbjct: 183 LAPGEELHVDTGCLAALTPSVDFDLVSVGGVRSMLFGGEGVFFARLRGPGKVWIQSLPFA 242
Query: 237 RLSQRIARAVTS 248
RL+ R+ A +
Sbjct: 243 RLAGRMMAAASG 254
>gi|372272302|ref|ZP_09508350.1| hypothetical protein MstaS_14538 [Marinobacterium stanieri S30]
Length = 263
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 129/230 (56%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN--EVGMWQWLF-- 78
+ ++++G E Q+ ++ L P E V+A G+M +M ++ E + + G+ LF
Sbjct: 6 LDYELIGSEMQLVEVELDPNETVIAEAGAMTYMEEDIQFETRMGDGSNPDEGIMGKLFSA 65
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G++I + N G V AAP I+P+++A G ++CQ DAFLC+
Sbjct: 66 GKRVFTGESIFTTHFTNQGIGKRRVAFAAPYPGNIVPLNMAELGERIVCQKDAFLCAALG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
+VS T ++R+ G EGF+ Q L G G+AFI AGG+V++K L+ GE + VD C+V
Sbjct: 126 TRVSITFNRRIGAGFFGGEGFILQTLEGDGMAFIQAGGTVIKKELK-GETLRVDTGCLVG 184
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T V I+ G ++ VFGG+ L A + G G V++QSLPF R++ RI
Sbjct: 185 FTEGVEYDIQRAGGLKSMVFGGEGLFLATLKGTGTVWLQSLPFSRMADRI 234
>gi|304392201|ref|ZP_07374143.1| conserved hypothetical protein [Ahrensia sp. R2A130]
gi|303296430|gb|EFL90788.1| conserved hypothetical protein [Ahrensia sp. R2A130]
Length = 282
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 80/225 (35%), Positives = 116/225 (51%), Gaps = 11/225 (4%)
Query: 29 GGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY------IPENEVGMW-----QWL 77
G E Q +I L P E VA GSM F + S+EM + P +G + +
Sbjct: 30 GSEMQFVEIELDPGESAVAEAGSMMFKAPSIEMTTAFGDGSYKAPAGLLGKLAGAGKRLI 89
Query: 78 FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
G+++ + V + G V A+P I + L GG L+CQ DAFLC+ V +
Sbjct: 90 TGESLFTTVFTHNGDGKARVAFASPYPGHIAAVKLDEVGGRLICQKDAFLCAARGVTIGI 149
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
++V + G EGF+ QKL G G AF+ GG+ ++L GE I VD C+ A T +
Sbjct: 150 EFQRKVMTGLFGGEGFIMQKLEGDGWAFVHVGGTTTVRDLAPGEEIHVDTGCVAAYTGDI 209
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ G I+ ++FGG+ + A + GPG V+IQSLPF RL+ RI
Sbjct: 210 DFDFVRAGSIKSSIFGGEGMFFAKLRGPGRVWIQSLPFSRLAGRI 254
>gi|21111117|gb|AAM39479.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
gi|66571846|gb|AAY47256.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 287
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/239 (30%), Positives = 129/239 (53%), Gaps = 15/239 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLFG 79
I ++I+G + Q ++ L P E +A G++ + +++M+ ++ + + G L G
Sbjct: 22 IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHDGQGGSSGGLMG 81
Query: 80 KTIT------------SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFL 127
K ++ + V + G V AAP +L + L+ GG L+CQ D+FL
Sbjct: 82 KLLSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGRLICQKDSFL 141
Query: 128 CSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDV 187
V + +++ + G EGF+ QKL G G F+ AGG+VV+++L GE I VD
Sbjct: 142 AGARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERDLAPGERIDVDT 201
Query: 188 SCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA +SV++ ++ ++ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 202 GCVVAYHASVDMDVRRIAGLKSMFFGGEGVFLATLTGPGKVWMQSLPFSRMAGRMLQAA 260
>gi|77747736|ref|NP_635555.2| hypothetical protein XCC0160 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|77761073|ref|YP_241276.2| hypothetical protein XC_0169 [Xanthomonas campestris pv. campestris
str. 8004]
Length = 338
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 74/239 (30%), Positives = 129/239 (53%), Gaps = 15/239 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLFG 79
I ++I+G + Q ++ L P E +A G++ + +++M+ ++ + + G L G
Sbjct: 73 IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHDGQGGSSGGLMG 132
Query: 80 KTIT------------SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFL 127
K ++ + V + G V AAP +L + L+ GG L+CQ D+FL
Sbjct: 133 KLLSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGRLICQKDSFL 192
Query: 128 CSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDV 187
V + +++ + G EGF+ QKL G G F+ AGG+VV+++L GE I VD
Sbjct: 193 AGARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERDLAPGERIDVDT 252
Query: 188 SCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA +SV++ ++ ++ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 253 GCVVAYHASVDMDVRRIAGLKSMFFGGEGVFLATLTGPGKVWMQSLPFSRMAGRMLQAA 311
>gi|34495639|ref|NP_899854.1| hypothetical protein CV_0184 [Chromobacterium violaceum ATCC 12472]
gi|34101494|gb|AAQ57863.1| conserved hypothetical protein [Chromobacterium violaceum ATCC
12472]
Length = 261
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 77/235 (32%), Positives = 124/235 (52%), Gaps = 10/235 (4%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---------IPENEVG 72
VI +++ G + Q ++ L P E V G++ +M + M+ ++ + + +G
Sbjct: 5 VIDYRVFGDDMQYVEVELDPGEAAVGEAGALYYMQDGIAMDTVFGDGSGRDGGVLGSLLG 64
Query: 73 MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ + V N V AA + +I+P+ L GG L Q D+FL
Sbjct: 65 AGKRLLTGESVFTTVFANQSAERRKVAFAAATPGKIVPVHLLELGGTLYAQKDSFLAGAK 124
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V + +R+ + G EGF+ QKL G G F+ AGG++ + L GE + VD C+V
Sbjct: 125 GVSLGLAWQKRIGTGLFGGEGFIMQKLEGDGYVFLHAGGALTELQLRPGETVRVDTGCVV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
A SV+ I+Y G ++ A+FGG+ L A +TGPG V++QSLP RL+ RI A
Sbjct: 185 AYQPSVDFDIEYVGKLKSALFGGEGLFFARLTGPGRVWLQSLPLSRLADRIVAAA 239
>gi|21106235|gb|AAM35070.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 287
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/237 (30%), Positives = 129/237 (54%), Gaps = 13/237 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG------MWQW 76
I ++I+G + Q ++ L P E +A G++ + +++M+ ++ + G M +
Sbjct: 24 IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGLMGKL 83
Query: 77 L-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
L G+++ + V + G V AAP +L + L+ GG+L+CQ D+FL
Sbjct: 84 LSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLAG 143
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V + +++ + G EGF+ QKL G G F+ AGG+VV++ L GE I VD C
Sbjct: 144 ARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTGC 203
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+VA + V++ ++ ++ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 204 VVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 260
>gi|319785664|ref|YP_004145139.1| hypothetical protein Psesu_0045 [Pseudoxanthomonas suwonensis 11-1]
gi|317464176|gb|ADV25908.1| protein of unknown function DUF124 [Pseudoxanthomonas suwonensis
11-1]
Length = 341
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 76/236 (32%), Positives = 125/236 (52%), Gaps = 12/236 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I F+I G E Q +I L P E +A G++ F +++M+ ++ + G
Sbjct: 79 IEFRIHGQEMQFVEIELDPGESAIAEAGALMFKDSAVQMDTVFGDGSHQGQGGGFMDKLL 138
Query: 75 ----QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ + G+++ + + + G V AAP +L + L GG ++CQ D+FL
Sbjct: 139 SAGKRVITGESLFTTMFTHGGSGKARVAFAAPYPGTVLAMKLDEHGGRIICQKDSFLAGA 198
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V++ +++ + G EGF+ QKL G G FI AGG VV++ L GE I VD C+
Sbjct: 199 RGVQIGVHFQRKILTGLFGGEGFIMQKLEGDGWVFIHAGGCVVERELAAGERIDVDTGCV 258
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
V + V++ ++ G I+ +FGG+ + A +TGPG V++QSLPF RL+ R+ A
Sbjct: 259 VGYHAGVSMDVRPVGGIKSMLFGGEGVFLATLTGPGKVWLQSLPFSRLAGRMLAAA 314
>gi|390992655|ref|ZP_10262879.1| putative uncharacterized domain protein [Xanthomonas axonopodis pv.
punicae str. LMG 859]
gi|372552611|emb|CCF69854.1| putative uncharacterized domain protein [Xanthomonas axonopodis pv.
punicae str. LMG 859]
Length = 335
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/237 (30%), Positives = 129/237 (54%), Gaps = 13/237 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG------MWQW 76
I ++I+G + Q ++ L P E +A G++ + +++M+ ++ + G M +
Sbjct: 72 IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGLMGKL 131
Query: 77 L-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
L G+++ + V + G V AAP +L + L+ GG+L+CQ D+FL
Sbjct: 132 LSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLAG 191
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V + +++ + G EGF+ QKL G G F+ AGG+VV++ L GE I VD C
Sbjct: 192 ARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTGC 251
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+VA + V++ ++ ++ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 252 VVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 308
>gi|77748515|ref|NP_640534.2| hypothetical protein XAC0178 [Xanthomonas axonopodis pv. citri str.
306]
Length = 335
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/237 (30%), Positives = 129/237 (54%), Gaps = 13/237 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG------MWQW 76
I ++I+G + Q ++ L P E +A G++ + +++M+ ++ + G M +
Sbjct: 72 IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGLMGKL 131
Query: 77 L-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
L G+++ + V + G V AAP +L + L+ GG+L+CQ D+FL
Sbjct: 132 LSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLAG 191
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V + +++ + G EGF+ QKL G G F+ AGG+VV++ L GE I VD C
Sbjct: 192 ARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTGC 251
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+VA + V++ ++ ++ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 252 VVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 308
>gi|384429987|ref|YP_005639348.1| hypothetical protein XCR_4374 [Xanthomonas campestris pv. raphani
756C]
gi|341939091|gb|AEL09230.1| conserved hypothetical protein [Xanthomonas campestris pv. raphani
756C]
Length = 338
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 74/239 (30%), Positives = 129/239 (53%), Gaps = 15/239 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLFG 79
I ++I+G + Q ++ L P E +A G++ + +++M+ ++ + + G L G
Sbjct: 73 IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDSAVQMDTVFGDGSHDGQGGSSGGLMG 132
Query: 80 KTIT------------SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFL 127
K ++ + V + G V AAP +L + L+ GG L+CQ D+FL
Sbjct: 133 KLLSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGRLICQKDSFL 192
Query: 128 CSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDV 187
V + +++ + G EGF+ QKL G G F+ AGG+VV+++L GE I VD
Sbjct: 193 AGARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERDLAPGERIDVDT 252
Query: 188 SCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA +SV++ ++ ++ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 253 GCVVAYHASVDMDVRRIAGLKSMFFGGEGVFLATLTGPGKVWMQSLPFSRMAGRMLQAA 311
>gi|418521253|ref|ZP_13087298.1| hypothetical protein WS7_09543 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410702802|gb|EKQ61302.1| hypothetical protein WS7_09543 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 335
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 73/237 (30%), Positives = 129/237 (54%), Gaps = 13/237 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG------MWQW 76
I ++I+G + Q ++ L P E +A G++ + +++M+ ++ + G M +
Sbjct: 72 IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGLMGKL 131
Query: 77 L-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
L G+++ + V + G V AAP +L + L+ GG+L+CQ D+FL
Sbjct: 132 LSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLAG 191
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V + +++ + G EGF+ QKL G G F+ AGG+VV++ L GE I VD C
Sbjct: 192 ARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTGC 251
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+VA + V++ ++ ++ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 252 VVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 308
>gi|188989575|ref|YP_001901585.1| hypothetical protein xccb100_0179 [Xanthomonas campestris pv.
campestris str. B100]
gi|167731335|emb|CAP49510.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris]
Length = 338
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 74/239 (30%), Positives = 129/239 (53%), Gaps = 15/239 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLFG 79
I ++I+G + Q ++ L P E +A G++ + +++M+ ++ + + G L G
Sbjct: 73 IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHDGQGGSSGGLMG 132
Query: 80 KTIT------------SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFL 127
K ++ + V + G V AAP +L + L+ GG L+CQ D+FL
Sbjct: 133 KLLSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGRLICQKDSFL 192
Query: 128 CSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDV 187
V + +++ + G EGF+ QKL G G F+ AGG+VV+++L GE I VD
Sbjct: 193 AGARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERDLAPGERIDVDT 252
Query: 188 SCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA +SV++ ++ ++ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 253 GCVVAYHASVDMDVRRIAGLKSMFFGGEGVFLATLTGPGKVWMQSLPFSRMAGRMLQAA 311
>gi|78045718|ref|YP_361893.1| hypothetical protein XCV0162 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|325927462|ref|ZP_08188706.1| hypothetical protein TIGR00266 [Xanthomonas perforans 91-118]
gi|78034148|emb|CAJ21793.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|325542163|gb|EGD13661.1| hypothetical protein TIGR00266 [Xanthomonas perforans 91-118]
Length = 335
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 75/248 (30%), Positives = 132/248 (53%), Gaps = 13/248 (5%)
Query: 12 PYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV 71
P S + I ++I+G + Q ++ L P E +A G++ + +++M+ ++ +
Sbjct: 61 PLTGMSGRADEIDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHA 120
Query: 72 G------MWQWL-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE 118
G M + L G+++ + V + G V AAP +L + L+ GG+
Sbjct: 121 GQSGGGLMGKLLSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQ 180
Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
L+CQ D+FL V + +++ + G EGF+ QKL G G F+ AGG+VV++ L
Sbjct: 181 LICQKDSFLAGARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELG 240
Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
GE I VD C+VA + V++ ++ ++ FGG+ + A +TGPG V++QSLPF R+
Sbjct: 241 PGERIDVDTGCVVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRM 300
Query: 239 SQRIARAV 246
+ R+ +A
Sbjct: 301 AGRMLQAA 308
>gi|327398767|ref|YP_004339636.1| hypothetical protein Hipma_0607 [Hippea maritima DSM 10411]
gi|327181396|gb|AEA33577.1| protein of unknown function DUF124 [Hippea maritima DSM 10411]
Length = 264
Score = 137 bits (345), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 123/230 (53%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLF-- 78
+ ++ILG + Q+ ++ L P E V+A G+M +M + E + + G+ LF
Sbjct: 6 VDYEILGDDMQIVEVELDPNETVIAEAGAMNYMEEGITFEAKLGDGSDADSGLLGKLFKA 65
Query: 79 GKTITS------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
GK + + N V +AP +I+P+D+ G+LLCQ DAFLC+
Sbjct: 66 GKRMLAQESLFLTHFTNRDSKKRKVAFSAPYPGKIIPVDMGKLQGDLLCQKDAFLCAAYG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
+ +R + G EGF+ ++L+G G+ FI AGG+V++K L GE + VD CIVA
Sbjct: 126 TSIDIAFTKRFGTGLFGGEGFILERLSGDGMVFIHAGGTVIKKQLN-GEKLRVDTGCIVA 184
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
S++ I+ ++ FGG+ L A + G G V++QSLPF RL+ RI
Sbjct: 185 FDESIDYSIEMVKGLKSMFFGGEGLFLATLQGYGTVYLQSLPFSRLADRI 234
>gi|383316796|ref|YP_005377638.1| hypothetical protein [Frateuria aurantia DSM 6220]
gi|379043900|gb|AFC85956.1| TIGR00266 family protein [Frateuria aurantia DSM 6220]
Length = 332
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 77/236 (32%), Positives = 126/236 (53%), Gaps = 12/236 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY-----IPENEVGMWQWL 77
I F+I+G + Q ++ L P E +A G++ + ++ ME I+ +N M + L
Sbjct: 70 IDFRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVAMETIFGDGSTASQNSGFMGKLL 129
Query: 78 F-------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
G+++ + V + G V A+P ++ L+ GG L+CQ D+FL
Sbjct: 130 SAGKRVISGESMFTTVFTHTGSGKAKVAFASPFPGTVIAARLSEHGGRLICQKDSFLAGA 189
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V++ +++ + G EGF+ Q+L G G F AGG++V++ L GE I VD C+
Sbjct: 190 KGVQLGIFFQRKILTGLFGGEGFVMQQLDGDGWVFFHAGGTLVERELAAGESIDVDTGCV 249
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
VA + VN+ I+ G I+ +F G+ L A +TGPG V++QSLPF RL+ R+ A
Sbjct: 250 VAFHAGVNMDIRPVGGIKSMLFAGEGLFLATLTGPGKVWLQSLPFSRLAGRMLAAA 305
>gi|346723102|ref|YP_004849771.1| hypothetical protein XACM_0163 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346647849|gb|AEO40473.1| hypothetical protein XACM_0163 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 335
Score = 137 bits (345), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 75/248 (30%), Positives = 132/248 (53%), Gaps = 13/248 (5%)
Query: 12 PYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV 71
P S + I ++I+G + Q ++ L P E +A G++ + +++M+ ++ +
Sbjct: 61 PLTGMSGRADEIDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHA 120
Query: 72 G------MWQWL-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE 118
G M + L G+++ + V + G V AAP +L + L+ GG+
Sbjct: 121 GQSGGGLMGKLLSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQ 180
Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
L+CQ D+FL V + +++ + G EGF+ QKL G G F+ AGG+VV++ L
Sbjct: 181 LICQKDSFLAGARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELG 240
Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
GE I VD C+VA + V++ ++ ++ FGG+ + A +TGPG V++QSLPF R+
Sbjct: 241 PGERIDVDTGCVVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRM 300
Query: 239 SQRIARAV 246
+ R+ +A
Sbjct: 301 AGRMLQAA 308
>gi|294627382|ref|ZP_06705967.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292598337|gb|EFF42489.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 335
Score = 137 bits (344), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 73/237 (30%), Positives = 129/237 (54%), Gaps = 13/237 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG------MWQW 76
I ++I+G + Q ++ L P E +A G++ + +++M+ ++ + G M +
Sbjct: 72 IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGLMGKL 131
Query: 77 L-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
L G+++ + V + G V AAP +L + L+ GG+L+CQ D+FL
Sbjct: 132 LSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLAG 191
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V + +++ + G EGF+ QKL G G F+ AGG+VV++ L GE I VD C
Sbjct: 192 ARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTGC 251
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+VA + V++ ++ ++ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 252 VVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 308
>gi|255038010|ref|YP_003088631.1| hypothetical protein Dfer_4264 [Dyadobacter fermentans DSM 18053]
gi|254950766|gb|ACT95466.1| protein of unknown function DUF124 [Dyadobacter fermentans DSM
18053]
Length = 259
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 81/244 (33%), Positives = 130/244 (53%), Gaps = 14/244 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM--------- 73
I ++I+G + QV +I L P E V+A G+M FM ++ E +E
Sbjct: 6 IDYKIIGDDIQVVEIELDPNETVIAEAGAMLFMEDGIQFETKMGDGSEANQSIMGKIFQA 65
Query: 74 -WQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAFLCSVN 131
+ L G+++ N G V +AP ++PIDL+ ++G EL+ Q D FLC+
Sbjct: 66 GTRLLTGESLFMTHFTNRGVGKKKVAFSAPYPGTVMPIDLSKIYGNELIVQKDGFLCAAM 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
+ +QR + + G EGF+ QKL G G+AF+ AGG V++K L E + +D C+V
Sbjct: 126 GTSMKIHFNQRFGSGLFGGEGFILQKLKGDGMAFVHAGGVVMEKQLN-NETLRIDTGCVV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
A S++ I+ +G ++ VFGG+ + A + G G +IQS+P +L QR+ +V PN
Sbjct: 185 AFEQSLSFDIQRSGGLKSMVFGGEGMFLATLRGTGRCWIQSMPISKLIQRL--SVGGPNS 242
Query: 252 RENP 255
R+
Sbjct: 243 RKES 246
>gi|83644424|ref|YP_432859.1| hypothetical protein HCH_01577 [Hahella chejuensis KCTC 2396]
gi|83632467|gb|ABC28434.1| uncharacterized conserved protein [Hahella chejuensis KCTC 2396]
Length = 250
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 131/230 (56%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
+ ++I+G + Q+ ++ L P E V+A G+M +M+G +E E + E G + LF
Sbjct: 6 VDYKIIGHDMQMVEVELDPGETVIAEAGAMNYMTGGIEFETKMGDGSDPEQGFFSKLFSA 65
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G+++ N G V AAP I+P+DLA GG ++CQ D+FL +
Sbjct: 66 GKRMMTGESLFMTHFTNSGHGKQQVSFAAPYPGSIIPVDLANQGGTIICQKDSFLAAAMG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
++S ++R+ + G EGF+ QKL G G+AFI A G+VV+K L+ E + +D C+VA
Sbjct: 126 TRISVEFNRRLGSGFFGGEGFVLQKLEGDGMAFIHACGTVVEKRLD-NETLRLDTGCLVA 184
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T S++ +I+ IR FGG+ + A + G G V+IQSLPF RL+ RI
Sbjct: 185 FTPSISYEIEMVKGIRSMFFGGEGMFLATLRGTGTVWIQSLPFSRLADRI 234
>gi|117920922|ref|YP_870114.1| hypothetical protein Shewana3_2479 [Shewanella sp. ANA-3]
gi|117613254|gb|ABK48708.1| Protein of unknown function DUF124 [Shewanella sp. ANA-3]
Length = 264
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG--SMEMENIYIPENEVGMWQWLFG- 79
+ ++I+G Q+ ++ L P E V+A G+M ++ S E + E E G + L G
Sbjct: 8 VDYEIIGHSMQLVEVELDPHETVIAEAGAMNYLEQDISFEAKMGDGSEPESGFFGKLMGA 67
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
++I N G V AAP IL IDLA +GGEL+CQ D+FL +
Sbjct: 68 GKRVLTGESIFMTHFTNLGHQKRKVAFAAPYPGTILAIDLAQYGGELICQKDSFLAAALG 127
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
+VS ++R+ G EGF+ Q L G G+AFI AGG++++K L+ GE + VD C+V
Sbjct: 128 TRVSMKFNRRLGTGFFGGEGFILQSLQGDGMAFIHAGGTLIKKELK-GETLRVDTGCLVG 186
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T ++ I+ G ++ VFGG+ L A + G G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIERAGSLKSMVFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236
>gi|315498037|ref|YP_004086841.1| hypothetical protein Astex_1009 [Asticcacaulis excentricus CB 48]
gi|315416049|gb|ADU12690.1| protein of unknown function DUF124 [Asticcacaulis excentricus CB
48]
Length = 278
Score = 136 bits (343), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 80/244 (32%), Positives = 128/244 (52%), Gaps = 12/244 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE---------VGM 73
I F+I G E Q +I L P E +A G M + S+ M ++ + +G
Sbjct: 21 IDFEIKGEELQFVEIELDPGESAIAEAGGMVWKDSSVGMTTVFGDGSGQQKGFMGALLGA 80
Query: 74 WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ + V + G V +AP I+P+ L+ GG L+CQ D+FL +
Sbjct: 81 GKRLISGESLFTTVFTHNGQGKARVAFSAPVPGSIIPLKLSDVGGRLICQKDSFLAAAKG 140
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + ++V + G EGF+ QKL G G F+ GG+V+++ L GE + +D C+ A
Sbjct: 141 VSLGIAFQRKVMTGLFGGEGFIMQKLEGDGWVFVQFGGAVIERTLAPGEQLHIDTGCVAA 200
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVT--SPN 250
T +V+ + G ++ +FGG+ + A +TGPG V+IQSLPF RL+ R+ A + PN
Sbjct: 201 FTDTVDFDLVQAGGVKSMLFGGEGVFFAQLTGPGKVWIQSLPFSRLAGRVLAAASGAGPN 260
Query: 251 MREN 254
E
Sbjct: 261 RGEG 264
>gi|381173682|ref|ZP_09882759.1| putative uncharacterized domain protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380685874|emb|CCG39246.1| putative uncharacterized domain protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 335
Score = 136 bits (343), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 72/237 (30%), Positives = 129/237 (54%), Gaps = 13/237 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG------MWQW 76
I ++I+G + Q ++ L P E +A G++ + +++M+ ++ + G M +
Sbjct: 72 IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGLMGKL 131
Query: 77 L-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
L G+++ + V + G + AAP +L + L+ GG+L+CQ D+FL
Sbjct: 132 LSAGKRVVTGESMFTTVFTHAGSGKAKIAFAAPYPGTVLALRLSEHGGQLICQKDSFLAG 191
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V + +++ + G EGF+ QKL G G F+ AGG+VV++ L GE I VD C
Sbjct: 192 ARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTGC 251
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+VA + V++ ++ ++ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 252 VVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 308
>gi|113970624|ref|YP_734417.1| hypothetical protein Shewmr4_2289 [Shewanella sp. MR-4]
gi|113885308|gb|ABI39360.1| protein of unknown function DUF124 [Shewanella sp. MR-4]
Length = 264
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 127/230 (55%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG--SMEMENIYIPENEVGMWQWLFG- 79
+ ++I+G Q+ ++ L P E V+A G+M ++ S E + E E G + L G
Sbjct: 8 VDYEIIGHSMQLVEVELDPHETVIAEAGAMNYLEQDISFEAKMGDGSEPESGFFGKLMGA 67
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
++I N G V AAP IL +DLA +GGEL+CQ D+FL +
Sbjct: 68 GKRVLTGESIFMTHFTNLGHQKRKVAFAAPYPGTILALDLAQYGGELICQKDSFLAAALG 127
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
+VS ++R+ G EGF+ Q L G G+AFI AGG++++K L+ GE + VD C+V
Sbjct: 128 TRVSMKFNRRLGTGFFGGEGFILQSLQGDGMAFIHAGGTLIKKELK-GETLRVDTGCLVG 186
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T ++ I+ G ++ VFGG+ L A + G G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIERAGSLKSMVFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236
>gi|418515938|ref|ZP_13082115.1| hypothetical protein MOU_03879 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|410707272|gb|EKQ65725.1| hypothetical protein MOU_03879 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 335
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/237 (30%), Positives = 128/237 (54%), Gaps = 13/237 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG------MWQW 76
I ++I+G + Q ++ L P E +A G++ + +++M+ ++ + G M +
Sbjct: 72 IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGLMGKL 131
Query: 77 L-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
L G+++ + V + G V AAP L + L+ GG+L+CQ D+FL
Sbjct: 132 LSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTALALRLSEHGGQLICQKDSFLAG 191
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V + +++ + G EGF+ QKL G G F+ AGG+VV++ L GE I VD C
Sbjct: 192 ARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTGC 251
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+VA + V++ ++ ++ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 252 VVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 308
>gi|384044528|ref|YP_005492545.1| hypothetical protein BMWSH_0352 [Bacillus megaterium WSH-002]
gi|345442219|gb|AEN87236.1| hypothetical protein BMWSH_0352 [Bacillus megaterium WSH-002]
Length = 266
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 11/238 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I +++ G + Q +I L P E V+A G M M +EME I+ +
Sbjct: 7 IDYKLYGDDMQFVEIELDPSESVIAEAGGMMMMEDDIEMETIFGDGSSSNSSGLFNKLKG 66
Query: 75 ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ V N G V A+P +I+P+DL +++CQ +FLC+
Sbjct: 67 AGKRVLTGESLFMTVYTNNGYHKRKVSFASPFPGKIIPVDLEQLDHKIICQKSSFLCAAK 126
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V V +++ G EGF+ QKL G GLAF+ AGG++ ++ L+ GE + VD C+V
Sbjct: 127 GVSVGIDFQKKLGTGFFGGEGFIMQKLEGNGLAFLHAGGTIHKRQLQPGERLRVDTGCLV 186
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
A+T VN I+Y G I+ A+FGG+ + A ++GPG V+IQSLPF RL+ R+ + SP
Sbjct: 187 AMTRDVNYNIEYVGKIKTALFGGEGMFFATLSGPGTVWIQSLPFSRLADRVLSSAVSP 244
>gi|295706978|ref|YP_003600053.1| hypothetical protein BMD_4880 [Bacillus megaterium DSM 319]
gi|294804637|gb|ADF41703.1| protein of unknown function (DUF124) [Bacillus megaterium DSM 319]
Length = 265
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 11/238 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I +++ G + Q +I L P E V+A G M M +EME I+ +
Sbjct: 6 IDYKLYGDDMQFVEIELDPSESVIAEAGGMMMMEDDIEMETIFGDGSSSTSSGLFNKLKG 65
Query: 75 ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ V N G V A+P +I+P+DL +++CQ +FLC+
Sbjct: 66 AGKRVLTGESLFMTVYTNNGYHKRKVSFASPFPGKIIPVDLEQLDHKIICQKSSFLCAAK 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V V +++ G EGF+ QKL G GLAF+ AGG++ ++ L+ GE + VD C+V
Sbjct: 126 GVSVGIDFQKKLGTGFFGGEGFIMQKLEGNGLAFLHAGGTIHKRQLQPGERLRVDTGCLV 185
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
A+T VN I+Y G I+ A+FGG+ + A ++GPG V+IQSLPF RL+ R+ + SP
Sbjct: 186 AMTRDVNYNIEYVGKIKTALFGGEGMFFATLSGPGTVWIQSLPFSRLADRVLSSAVSP 243
>gi|294501630|ref|YP_003565330.1| hypothetical protein BMQ_4894 [Bacillus megaterium QM B1551]
gi|294351567|gb|ADE71896.1| protein of unknown function (DUF124) [Bacillus megaterium QM B1551]
Length = 265
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 11/238 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I +++ G + Q +I L P E V+A G M M +EME I+ +
Sbjct: 6 IDYKLYGDDMQFVEIELDPSESVIAEAGGMMMMEDDIEMETIFGDGSSSNSSGLFNKLKG 65
Query: 75 ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
+ L G+++ V N G V A+P +I+P+DL +++CQ +FLC+
Sbjct: 66 AGKRVLTGESLFMTVYTNNGYHKRKVSFASPFPGKIIPVDLEQLDHKIICQKSSFLCAAK 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V V +++ G EGF+ QKL G GLAF+ AGG++ ++ L+ GE + VD C+V
Sbjct: 126 GVSVGIDFQKKLGTGFFGGEGFIMQKLEGNGLAFLHAGGTIHKRQLQPGERLRVDTGCLV 185
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
A+T VN I+Y G I+ A+FGG+ + A ++GPG V+IQSLPF RL+ R+ + SP
Sbjct: 186 AMTRDVNYNIEYVGKIKTALFGGEGMFFATLSGPGTVWIQSLPFSRLADRVLSSAVSP 243
>gi|149177722|ref|ZP_01856322.1| hypothetical protein PM8797T_08654 [Planctomyces maris DSM 8797]
gi|148843372|gb|EDL57735.1| hypothetical protein PM8797T_08654 [Planctomyces maris DSM 8797]
Length = 262
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 80/230 (34%), Positives = 128/230 (55%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV--GMWQWLF-- 78
+ ++I G + Q+ +I+L P E VVA GSM +M + E +++ G + +F
Sbjct: 6 VDYEIFGSDLQIVEIILDPGESVVAEAGSMNYMEEGIRFEARMGDGSQLNEGFFGKIFKA 65
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G+++ N G + V AAP +I+ ID+ GG + CQ D+FLC+
Sbjct: 66 GKRMLSGESLFMTHFTNEGSREQRVAFAAPYPGKIIAIDMDKVGGAITCQKDSFLCAALG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
+V+ ++R+ + G EGF+ Q+L G G+AF+ AGG+V++K L GE + VD CIVA
Sbjct: 126 TEVTMAFNKRLGSGFFGGEGFILQQLRGDGMAFVHAGGTVIKKKLN-GETLRVDTGCIVA 184
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T ++ I+ G ++ V GG+ L A + G G V +QSLPF RL+ R+
Sbjct: 185 FTGDIDYSIEKAGNLKSMVLGGEGLFLATLRGHGTVLLQSLPFSRLADRV 234
>gi|325916159|ref|ZP_08178443.1| hypothetical protein TIGR00266 [Xanthomonas vesicatoria ATCC 35937]
gi|325537579|gb|EGD09291.1| hypothetical protein TIGR00266 [Xanthomonas vesicatoria ATCC 35937]
Length = 339
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/239 (30%), Positives = 127/239 (53%), Gaps = 15/239 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLFG 79
I ++I+G + Q ++ L P E +A G++ + +++M+ ++ + + G L G
Sbjct: 74 IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDSAVQMDTVFGDGSHDGQGGSSGGLMG 133
Query: 80 KTIT------------SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFL 127
K ++ + V + G V AAP +L + L+ GG L+CQ D+FL
Sbjct: 134 KLLSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSAHGGRLICQKDSFL 193
Query: 128 CSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDV 187
V + +++ + G EGF+ QKL G G F+ AGG+VV++ L GE I VD
Sbjct: 194 AGARGVSLGIAFQKKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELAPGERIDVDT 253
Query: 188 SCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA + V++ ++ ++ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 254 GCVVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 312
>gi|389792837|ref|ZP_10196019.1| hypothetical protein UU9_01604 [Rhodanobacter fulvus Jip2]
gi|388435701|gb|EIL92598.1| hypothetical protein UU9_01604 [Rhodanobacter fulvus Jip2]
Length = 338
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/237 (30%), Positives = 124/237 (52%), Gaps = 13/237 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF---- 78
I ++I+G + Q +I L P E +A G++ + +++M+ ++ G
Sbjct: 75 IDYRIVGTDMQFVEIELDPGESAIAEAGALMYKESAVQMDTVFGDGASSGQSGGGLMDKL 134
Query: 79 ---------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
G+++ + V + G V AAP ++ + L+ GG L+CQ DAFL
Sbjct: 135 LSAGKRVVTGESLFTTVFTHRGQGKAKVAFAAPYPGTVMAMKLSDHGGRLICQKDAFLAG 194
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V++ +++ + G EGF+ QKL G G F+ AGG+V+Q+ L GE + VD C
Sbjct: 195 ARGVQLGIFFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVMQRELAAGERLDVDTGC 254
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+VA V++ + G I+ +FGG+ + A +TGPG V++QSLPF R++ R+ A
Sbjct: 255 VVAFHDGVSMDVHPVGGIKSMLFGGEGMFLATLTGPGTVWLQSLPFSRMAGRMLAAA 311
>gi|58584140|ref|YP_203156.1| hypothetical protein XOO4517 [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|188574531|ref|YP_001911460.1| hypothetical protein PXO_03595 [Xanthomonas oryzae pv. oryzae
PXO99A]
gi|58428734|gb|AAW77771.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|188518983|gb|ACD56928.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
PXO99A]
Length = 336
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 71/238 (29%), Positives = 129/238 (54%), Gaps = 14/238 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I ++I+G + Q ++ L P E +A G++ + +++M+ ++ + G
Sbjct: 72 IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGGLMGK 131
Query: 75 ------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
+ + G+++ + + + G V AAP +L + L+ GG+L+CQ D+FL
Sbjct: 132 LLSAGKRVVTGESMFTTIFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLA 191
Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
V + +++ + G EGF+ QKL G G F+ AGG+VV++ L GE I VD
Sbjct: 192 GARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGECIDVDTG 251
Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA +SV++ ++ ++ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 252 CVVAYHASVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 309
>gi|406981828|gb|EKE03225.1| hypothetical protein ACD_20C00234G0047 [uncultured bacterium]
Length = 239
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 76/237 (32%), Positives = 137/237 (57%), Gaps = 12/237 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++I+G + Q +I L ++ V+A PG++ +++ +++M+ + E ++ + +
Sbjct: 7 YEIIGDDLQAVKINLSKEKAVLADPGNLVYITNNIKMD---VTLGEFNYFEKVLSAFKRA 63
Query: 85 VVLRN------PGPSD-GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
+V N SD G + ++P +I+P+++ G L+C+ ++LCS +++K+
Sbjct: 64 IVGENFFLTRFSSDSDIGEIVFSSPVPGKIIPVEIPPEG--LICEKQSYLCSDSNIKIEI 121
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
+R+ + G EGF+ QKLTG+G AFI A G++++K L+ E I +D C+VA SV
Sbjct: 122 AFARRIGAGLFGREGFILQKLTGEGTAFIQASGTIIKKTLQPEESIRLDPGCVVAFEPSV 181
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
+ IKY G IR A+FGG + + GPG V+IQ+LP ++LSQ+I P + N
Sbjct: 182 DYDIKYVGDIRTALFGGVGIFFVTLIGPGDVYIQTLPLNKLSQKILSKSKFPKAKSN 238
>gi|126174826|ref|YP_001050975.1| hypothetical protein Sbal_2613 [Shewanella baltica OS155]
gi|386341579|ref|YP_006037945.1| hypothetical protein [Shewanella baltica OS117]
gi|125998031|gb|ABN62106.1| protein of unknown function DUF124 [Shewanella baltica OS155]
gi|334863980|gb|AEH14451.1| protein of unknown function DUF124 [Shewanella baltica OS117]
Length = 264
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 125/230 (54%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCF----------MSGSMEMENIYIPENEVG 72
+ ++I+G Q+ ++ L P E VVA G+M + M E E+ + + V
Sbjct: 8 VDYEIIGHSMQLVELELDPNETVVAEAGAMNYLEQDISFEAKMGDGSEPESGFFGKLMVA 67
Query: 73 MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G++I N G V AAP IL IDLA +GGEL+CQ D+FL +
Sbjct: 68 GKRALTGESIFMTHFTNQGNQKRKVAFAAPYPGTILAIDLAEYGGELICQKDSFLAAALG 127
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
+V+ ++R+ G EGF+ Q L G G+ FI AGG++V+K L GE + VD C+V
Sbjct: 128 TRVTMKFNRRLGTGFFGGEGFILQSLQGDGMTFIHAGGTLVKKELR-GETLRVDTGCLVG 186
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T ++ I+ G ++ VFGG+ L A + G G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIERAGSLKSMVFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236
>gi|384417311|ref|YP_005626671.1| hypothetical protein XOC_0265 [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353460225|gb|AEQ94504.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 336
Score = 135 bits (339), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 71/238 (29%), Positives = 129/238 (54%), Gaps = 14/238 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I ++I+G + Q ++ L P E +A G++ + +++M+ ++ + G
Sbjct: 72 IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGGLMGK 131
Query: 75 ------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
+ + G+++ + + + G V AAP +L + L+ GG+L+CQ D+FL
Sbjct: 132 LLSAGKRVVSGESMFTTIFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLA 191
Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
V + +++ + G EGF+ QKL G G F+ AGG+VV++ L GE I VD
Sbjct: 192 GARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTG 251
Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA +SV++ ++ ++ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 252 CVVAYHASVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 309
>gi|374702753|ref|ZP_09709623.1| hypothetical protein PseS9_05019 [Pseudomonas sp. S9]
Length = 248
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 78/232 (33%), Positives = 124/232 (53%), Gaps = 9/232 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ--WLFGK 80
+ ++ILG AQ +++L P E V+A G + +M+ + E + G+ W GK
Sbjct: 6 LEYEILGNSAQSVEVILDPGETVIAEAGVLNYMTEDVRFETRMGDGSSSGLLGKLWSAGK 65
Query: 81 TITS------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
+ + N G V AAP ++P+ LA GG L+CQ D+FLC+
Sbjct: 66 RMVTGESMFMTHFSNTGDKQARVAFAAPYPGSVVPVSLAQVGGSLVCQRDSFLCAARGTS 125
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
+ + ++R+ G EGF+ QKL G G F+ AGG+V++K L E + +D C+V +
Sbjct: 126 IGISFNKRLGAGFFGGEGFILQKLEGDGPVFLHAGGTVIRKELN-NETLRLDTGCLVGFS 184
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
S ++ IK G ++ +FGG+ + A + G G V+IQSLPF RL++RI A
Sbjct: 185 SGIDYDIKLAGGLKSMLFGGEGIFLATLKGTGTVWIQSLPFSRLAERIYEAT 236
>gi|153001168|ref|YP_001366849.1| hypothetical protein Shew185_2651 [Shewanella baltica OS185]
gi|373949985|ref|ZP_09609946.1| protein of unknown function DUF124 [Shewanella baltica OS183]
gi|386324182|ref|YP_006020299.1| hypothetical protein [Shewanella baltica BA175]
gi|151365786|gb|ABS08786.1| protein of unknown function DUF124 [Shewanella baltica OS185]
gi|333818327|gb|AEG10993.1| protein of unknown function DUF124 [Shewanella baltica BA175]
gi|373886585|gb|EHQ15477.1| protein of unknown function DUF124 [Shewanella baltica OS183]
Length = 264
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 126/230 (54%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG--SMEMENIYIPENEVGMW------ 74
+ ++I+G Q+ ++ L P E VVA G+M ++ S E + E E G +
Sbjct: 8 VDYEIIGHSMQLVEVELDPNETVVAEAGAMNYLEQDISFEAKMGDGSEPESGFFGKLVGA 67
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G++I N G V AAP IL IDLA +GGEL+CQ D+FL +
Sbjct: 68 GKRALTGESIFMTHFTNQGNQKRKVAFAAPYPGTILAIDLAEYGGELICQKDSFLAAALG 127
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
+V+ ++R+ G EGF+ Q L G G+ FI AGG++V+K L GE + VD C+V
Sbjct: 128 TRVTMKFNRRLGTGFFGGEGFILQSLQGDGMTFIHAGGTLVKKELR-GETLRVDTGCLVG 186
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T ++ I+ G ++ VFGG+ L A + G G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIERAGSLKSMVFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236
>gi|152988055|ref|YP_001346826.1| hypothetical protein PSPA7_1442 [Pseudomonas aeruginosa PA7]
gi|452878803|ref|ZP_21955976.1| hypothetical protein G039_19623 [Pseudomonas aeruginosa VRFPA01]
gi|150963213|gb|ABR85238.1| conserved hypothetical protein [Pseudomonas aeruginosa PA7]
gi|452184571|gb|EME11589.1| hypothetical protein G039_19623 [Pseudomonas aeruginosa VRFPA01]
Length = 248
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 79/233 (33%), Positives = 124/233 (53%), Gaps = 11/233 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-----MW--- 74
+ ++ILG Q +I L P E V+A G+M +M+G + + + G +W
Sbjct: 6 LDYRILGESMQTVEIELDPGETVIAEAGAMNYMTGDIRF-TARMGDGSDGSLLGKLWSAG 64
Query: 75 -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
+ L G+++ N G V AAP +LP+DL GG L CQ D+FLC+
Sbjct: 65 KRKLGGESVFMTHFTNEGQGKQHVAFAAPYPGSVLPVDLDDVGGRLFCQKDSFLCAAYGT 124
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
+V +R+ G EGF+ QKL G GL F+ AGG+++++ L E + VD C+VA
Sbjct: 125 RVGIAFAKRLGAGFFGGEGFILQKLEGDGLVFVHAGGTLIRRQLH-DETLRVDTGCLVAF 183
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
T ++ ++ G ++ +FGG+ L+ + G G V++QSLPF RL+ RI A
Sbjct: 184 TDGIDYDVQLAGGLKSMLFGGEGLLLTTLKGSGTVWLQSLPFSRLAGRIYDAT 236
>gi|160875838|ref|YP_001555154.1| hypothetical protein Sbal195_2726 [Shewanella baltica OS195]
gi|378709040|ref|YP_005273934.1| hypothetical protein [Shewanella baltica OS678]
gi|418024588|ref|ZP_12663570.1| protein of unknown function DUF124 [Shewanella baltica OS625]
gi|160861360|gb|ABX49894.1| protein of unknown function DUF124 [Shewanella baltica OS195]
gi|315268029|gb|ADT94882.1| protein of unknown function DUF124 [Shewanella baltica OS678]
gi|353535874|gb|EHC05434.1| protein of unknown function DUF124 [Shewanella baltica OS625]
Length = 264
Score = 134 bits (337), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 126/230 (54%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG--SMEMENIYIPENEVGMW------ 74
+ ++I+G Q+ ++ L P E VVA G+M ++ S E + E E G +
Sbjct: 8 VDYEIIGHSMQLVEVELDPNETVVAEAGAMNYLEQDISFEAKMGDGSEPESGFFGKLMGA 67
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G++I N G V AAP IL IDLA +GGEL+CQ D+FL +
Sbjct: 68 GKRALTGESIFMTHFTNQGNQKRKVAFAAPYPGTILAIDLAEYGGELICQKDSFLAAALG 127
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
+V+ ++R+ G EGF+ Q L G G+ FI AGG++V+K L GE + VD C+V
Sbjct: 128 TRVTMKFNRRLGTGFFGGEGFILQSLQGDGMTFIHAGGTLVKKELR-GETLRVDTGCLVG 186
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T ++ I+ G ++ VFGG+ L A + G G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIERAGSLKSMVFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236
>gi|289665045|ref|ZP_06486626.1| hypothetical protein XcampvN_18745 [Xanthomonas campestris pv.
vasculorum NCPPB 702]
gi|289669829|ref|ZP_06490904.1| hypothetical protein XcampmN_15372 [Xanthomonas campestris pv.
musacearum NCPPB 4381]
Length = 336
Score = 134 bits (336), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 71/238 (29%), Positives = 128/238 (53%), Gaps = 14/238 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I ++I+G + Q ++ L P E +A G++ + +++M+ ++ + G
Sbjct: 72 IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGGLMGK 131
Query: 75 ------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
+ + G+++ + V + G V AAP +L + L+ GG+L+CQ D+FL
Sbjct: 132 LLSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLA 191
Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
V + +++ + G EGF+ QKL G G F+ AGG+VV++ L GE I VD
Sbjct: 192 GARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTG 251
Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
C+VA + V++ ++ ++ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 252 CVVAYHAGVDMDVRRVAGLKSMFFGGEGMFLATLTGPGKVWLQSLPFSRMAGRMLQAA 309
>gi|355639904|ref|ZP_09051449.1| hypothetical protein HMPREF1030_00535 [Pseudomonas sp. 2_1_26]
gi|354831610|gb|EHF15620.1| hypothetical protein HMPREF1030_00535 [Pseudomonas sp. 2_1_26]
Length = 248
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 79/233 (33%), Positives = 125/233 (53%), Gaps = 11/233 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-----MW--- 74
+ ++ILG Q +I L P E V+A G+M +M+G + + + G +W
Sbjct: 6 LDYRILGESMQTVEIELDPGETVIAEAGAMNYMTGDIRF-TARMGDGSDGSLLGKLWSAG 64
Query: 75 -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
+ L G+++ N G V AAP ++ +DL GG L CQ D+FLC+
Sbjct: 65 KRKLGGESVFMTHFTNEGQGKQHVAFAAPYPGSVVAVDLDDVGGRLFCQKDSFLCAAYGT 124
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
+V +R+ G EGF+ QKL G GL F+ AGG+++++ L GE + VD C+VA
Sbjct: 125 RVGIAFAKRLGAGFFGGEGFILQKLEGDGLVFVHAGGTLIRRQLN-GETLRVDTGCLVAF 183
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
T S++ ++ G ++ +FGG+ L+ + G G V++QSLPF RL+ RI A
Sbjct: 184 TDSIDYDVQLAGGLKSMLFGGEGLLLTTLKGSGTVWLQSLPFSRLAGRIYDAT 236
>gi|24373530|ref|NP_717573.1| protein of unknown function DUF124 [Shewanella oneidensis MR-1]
gi|24347839|gb|AAN55017.1| protein of unknown function DUF124 [Shewanella oneidensis MR-1]
Length = 264
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 83/230 (36%), Positives = 127/230 (55%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG--SMEMENIYIPENEVGMW------ 74
+ ++I+G Q+ ++ L P E V+A G+M ++ S E + E + G +
Sbjct: 8 VDYEIIGHSMQLVEVELDPNETVIAEAGAMNYLEQDISFEAKMGDGSEPDSGFFGKLMGA 67
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G++I N G V AAP IL IDLA +GGEL+CQ D+FL +
Sbjct: 68 GKRALTGESIFMTHFTNLGHQKCKVAFAAPYPGTILAIDLAQYGGELICQKDSFLAAALG 127
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
+VS ++R+ G EGF+ Q L G G+AFI AGG++++K L+ GE + VD C+V
Sbjct: 128 TRVSMKFNRRLGTGFFGGEGFILQSLQGDGMAFIHAGGTLIKKELK-GETLRVDTGCLVG 186
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T V+ I+ G ++ FGG+ L A + G G V++QSLPF R++ RI
Sbjct: 187 FTPGVDYDIERAGSLKSMFFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236
>gi|120599156|ref|YP_963730.1| hypothetical protein Sputw3181_2352 [Shewanella sp. W3-18-1]
gi|146292773|ref|YP_001183197.1| hypothetical protein Sputcn32_1673 [Shewanella putrefaciens CN-32]
gi|386313452|ref|YP_006009617.1| hypothetical protein [Shewanella putrefaciens 200]
gi|120559249|gb|ABM25176.1| protein of unknown function DUF124 [Shewanella sp. W3-18-1]
gi|145564463|gb|ABP75398.1| protein of unknown function DUF124 [Shewanella putrefaciens CN-32]
gi|319426077|gb|ADV54151.1| protein of unknown function DUF124 [Shewanella putrefaciens 200]
Length = 264
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 82/230 (35%), Positives = 125/230 (54%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ ++I+G Q+ ++ L P E V+A G+M ++ + E E E G + L G
Sbjct: 8 VDYEIIGHSMQLVEVELDPNETVIAEAGAMNYLEQDIRFEAKMGDGSEPESGFFGKLMGA 67
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
++I N G V AAP IL IDLA +GGEL+CQ D+FL +
Sbjct: 68 GKRVLTGESIFMTHFTNQGNQKRKVAFAAPYPGTILAIDLAEYGGELICQKDSFLAAALG 127
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
+V+ ++R+ G EGF+ Q L G G+ FI AGG++V+K L+ GE + VD C+V
Sbjct: 128 TRVTMKFNRRLGTGFFGGEGFILQSLQGDGMTFIHAGGTLVKKELK-GETLRVDTGCLVG 186
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T ++ I+ G ++ VFGG+ L A + G G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIERAGSLKSMVFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236
>gi|152993590|ref|YP_001359311.1| hypothetical protein SUN_2012 [Sulfurovum sp. NBC37-1]
gi|151425451|dbj|BAF72954.1| conserved hypothetical protein [Sulfurovum sp. NBC37-1]
Length = 344
Score = 133 bits (335), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 88/276 (31%), Positives = 140/276 (50%), Gaps = 13/276 (4%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
M A F+ P + + I + I G + Q +I L P E VV+ G+M + + +
Sbjct: 58 MVASSFTPPSRARGSRLNSADEIDYTIYGEDIQYVEIELDPGESVVSEAGAMMYKNSHIV 117
Query: 61 MENIY-----IPENEVGMWQWLFG--------KTITSVVLRNPGPSDGFVGIAAPSLARI 107
M+ ++ E+ G++ L G +++ + V + G V AP RI
Sbjct: 118 MDTVFGDGSGSKEDTGGLFDKLVGAGKRLVTGESLFTTVFTHRGNGKATVAFGAPYPGRI 177
Query: 108 LPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFIL 167
+P+DL GG L+CQ D FLC+ V + ++V + G EGF+ QKL G GLAFI
Sbjct: 178 IPVDLDEMGGTLICQKDCFLCAAKGVSIGIYFQKKVLTGLFGGEGFIMQKLEGDGLAFIH 237
Query: 168 AGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGI 227
AGG + + L+ GE + +D C+VA+ V+ ++ G I+ +FGG+ A + GPG
Sbjct: 238 AGGMLKEIELKDGEELHLDTGCLVAMQPHVHFDLEQAGGIKTGLFGGEGFFFAKLRGPGK 297
Query: 228 VFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQIAI 263
++IQSLPF RL+ R+ A ++ + V AI
Sbjct: 298 IWIQSLPFSRLAGRMLSAAPQGGGKDTGEGSVLGAI 333
>gi|296447765|ref|ZP_06889680.1| protein of unknown function DUF124 [Methylosinus trichosporium
OB3b]
gi|296254742|gb|EFH01854.1| protein of unknown function DUF124 [Methylosinus trichosporium
OB3b]
Length = 304
Score = 133 bits (335), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 88/247 (35%), Positives = 132/247 (53%), Gaps = 11/247 (4%)
Query: 12 PYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PEN 69
P +SP +A + F+I G E Q +I L P VA G+M F ++ ME I+ +
Sbjct: 26 PGANRSPAEA-LRFEIKGHEMQFVEIELDPGRACVAEAGAMMFKEAAVGMETIFGDGAQE 84
Query: 70 EVGM----WQW----LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLC 121
+ G+ W+ L G T+ N G S AAP +ILP+ L + GGELLC
Sbjct: 85 DSGLLGKLWRGAKRALSGSTLFMTRFTNNGYSPARAAFAAPYPGKILPLRLDLLGGELLC 144
Query: 122 QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGE 181
Q D FL V + +++ + G EGF+ Q+L+G G+ F+ AGG+VV++ L GE
Sbjct: 145 QRDTFLAGAPGVAIDIAFQKKIMTGLFGGEGFIMQRLSGNGIVFVHAGGAVVERELAPGE 204
Query: 182 VITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQR 241
+ VD C+VA T++V+ + G ++ FGG+ A + GPG V++QS+PF RL
Sbjct: 205 ELHVDSGCLVAQTATVSFDVVPVGGVKSMFFGGEGFFFARLVGPGHVWLQSMPFSRLVGH 264
Query: 242 IARAVTS 248
IA + S
Sbjct: 265 IASRMPS 271
>gi|217972911|ref|YP_002357662.1| hypothetical protein Sbal223_1734 [Shewanella baltica OS223]
gi|217498046|gb|ACK46239.1| protein of unknown function DUF124 [Shewanella baltica OS223]
Length = 264
Score = 133 bits (335), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 125/230 (54%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG--SMEMENIYIPENEVGMW------ 74
+ ++I+G Q+ ++ L P E VVA G+M ++ S E + E E G +
Sbjct: 8 VDYEIIGHSMQLVEVELDPNETVVAEAGAMNYLEQDISFEAKMGDGSEPESGFFGKLMGA 67
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G++I N G V AAP IL IDLA +GGEL+CQ D FL +
Sbjct: 68 GKRALTGESIFMTHFTNQGNQKRKVAFAAPYPGTILAIDLAEYGGELICQKDCFLAAALG 127
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
+V+ ++R+ G EGF+ Q L G G+ FI AGG++V+K L GE + VD C+V
Sbjct: 128 TRVTMKFNRRLGTGFFGGEGFILQSLQGDGMTFIHAGGTLVKKELR-GETLRVDTGCLVG 186
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T ++ I+ G ++ VFGG+ L A + G G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIERAGSLKSMVFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236
>gi|325109149|ref|YP_004270217.1| hypothetical protein Plabr_2595 [Planctomyces brasiliensis DSM
5305]
gi|324969417|gb|ADY60195.1| protein of unknown function DUF124 [Planctomyces brasiliensis DSM
5305]
Length = 261
Score = 133 bits (334), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 81/231 (35%), Positives = 120/231 (51%), Gaps = 13/231 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFM-----------SGSMEMENIYIPENEV 71
I ++I G + Q+ +I L P E VVA G M +M GS + +
Sbjct: 6 IDYEIFGNDMQIVEIELDPYETVVAEAGMMNYMEDGIRYEAKMGDGSQASQGFFSKMMSA 65
Query: 72 GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
G + L G++I N P + AAP RI+ +DL GG ++CQ DAFLC+
Sbjct: 66 G-ARALTGESIFMTHFTNESPQKRRIAFAAPYPGRIVALDLEKIGGTVICQKDAFLCAAK 124
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
++S +R+ + G EGF+ QKL G G AF+ AGG+V+++ L E + VD C+V
Sbjct: 125 GTRLSIAFQKRLGSGFFGGEGFILQKLEGDGKAFVHAGGTVIKRKLN-DETLRVDTGCLV 183
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
A T ++ I G ++ FGG+ L A + G G V +QSLPF RL+ R+
Sbjct: 184 AFTPGIDYDIARAGSLKSMFFGGEGLFLATLRGTGTVLLQSLPFSRLADRV 234
>gi|338210593|ref|YP_004654642.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336304408|gb|AEI47510.1| protein of unknown function DUF124 [Runella slithyformis DSM 19594]
Length = 259
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 77/232 (33%), Positives = 125/232 (53%), Gaps = 12/232 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I ++I+G + Q+ +I L P E V+A GSM FM ++ E +
Sbjct: 6 IDYKIIGEDIQIVEIELDPNETVIAEAGSMLFMEDGIQFETKMGDGSNATQSLMGKIFQA 65
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAFLCSVN 131
+ L G+++ N G V AAP ++PIDL+ ++G L+ Q D FLC+
Sbjct: 66 GSRLLTGESLFMTHFTNRGYGKRKVAFAAPYPGTVMPIDLSRIYGNSLIVQKDGFLCAAM 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
K++ +QR+ + + G EGF+ QKL G GLAF+ AGG V+++ L E + VD C+V
Sbjct: 126 GTKLNIHFNQRLGSGLFGGEGFILQKLQGDGLAFVHAGGVVIERQLN-NETLRVDTGCVV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
A SVN ++ +G ++ +FGG+ + A + G G +IQS+P +L R++
Sbjct: 185 AFEPSVNFSVQRSGSLKSMIFGGEGIFLATLQGTGRCWIQSMPISKLVDRLS 236
>gi|49082264|gb|AAT50532.1| PA3696, partial [synthetic construct]
Length = 249
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 78/233 (33%), Positives = 124/233 (53%), Gaps = 11/233 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-----MW--- 74
+ ++ILGG Q +I L P E V+A G+M +M+G + + + G +W
Sbjct: 6 LDYRILGGSMQTVEIELDPGETVIAEAGAMNYMTGDIRF-TARMGDGSDGSLLGKLWSAG 64
Query: 75 -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
+ L G+++ N G V AAP ++ +DL GG L CQ D+FLC+
Sbjct: 65 KRKLGGESVFMTHFTNEGQGKQHVAFAAPYPGSVVAVDLDDVGGRLFCQKDSFLCAAYGT 124
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
+V +R+ G EGF+ QKL G GL F+ AGG+++++ L GE + VD C+VA
Sbjct: 125 RVGIAFTKRLGAGFFGGEGFILQKLEGDGLVFVHAGGTLIRRQLN-GETLRVDTGCLVAF 183
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
T ++ ++ G ++ +FG + L+ + G G V++QSLPF RL+ RI A
Sbjct: 184 TDGIDYDVQLAGGLKSMLFGSEGLLLTTLKGSGTVWLQSLPFSRLAGRIYDAT 236
>gi|171914117|ref|ZP_02929587.1| hypothetical protein VspiD_23100 [Verrucomicrobium spinosum DSM
4136]
Length = 340
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 85/250 (34%), Positives = 137/250 (54%), Gaps = 14/250 (5%)
Query: 5 FFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME-- 62
F P P + + I ++I+G + Q+ +I L P E V+A G M +M ++ E
Sbjct: 65 FGCPPPIPQGFNQQRSHEIEYEIIGDDLQIVEIELDPGETVIAEAGGMNYMEDDIQFETR 124
Query: 63 --NIYIPE-------NEVGMWQWLFGKTITSVVLRNPGPSDGF-VGIAAPSLARILPIDL 112
+ P+ +G + L G+++ + G S V AAP +I+P+ L
Sbjct: 125 MGDGSQPDGGFMGALKNIGK-RMLTGESLFMTHFTHRGSSGKRRVAFAAPYPGKIIPMRL 183
Query: 113 AMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSV 172
+ GGE++C+ DAFLC+ +V ++++ G EGF+ Q+L G GL F+ AGG++
Sbjct: 184 SELGGEIICEKDAFLCAALGTEVGIAFNRKLGTGFFGGEGFILQRLRGDGLVFVHAGGTI 243
Query: 173 VQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
V+K L G + VD C+VA+TS +N I+ G ++ VFGG+ L A ++G G V++QS
Sbjct: 244 VKKELR-GGTLRVDTGCLVAMTSGINYDIQRAGNLKSMVFGGEGLFLATLSGHGTVWLQS 302
Query: 233 LPFHRLSQRI 242
LPF RL+ RI
Sbjct: 303 LPFSRLADRI 312
>gi|330814446|ref|XP_003291402.1| hypothetical protein DICPUDRAFT_155999 [Dictyostelium purpureum]
gi|325078427|gb|EGC32079.1| hypothetical protein DICPUDRAFT_155999 [Dictyostelium purpureum]
Length = 346
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 84/243 (34%), Positives = 130/243 (53%), Gaps = 11/243 (4%)
Query: 7 STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMEN--- 63
ST FQ + ++I G E+QV Q+ L P + A G++ M +EM+
Sbjct: 80 STLFQNSQSLKGKATSFDWEIFGLESQVIQMNLNPGSSITAETGALLEMESEIEMDTSAR 139
Query: 64 --IYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELL 120
I + Q LF +T V N ++ + ++P +++IL I L+ GGEL+
Sbjct: 140 GGILSSFKRMFTGQGLF---LTKFV--NKSETEKMRITFSSPYISKILAIKLSDVGGELI 194
Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
CQ +AFLC + V++ L + G +GF+ QKL G G+ F+ AGGSV+ + L G
Sbjct: 195 CQKNAFLCGEDTVEIEPRLTKNFSTGFFGGQGFILQKLRGNGMVFVHAGGSVIYRVLSPG 254
Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
E I V IVA S+V+ QI++ ++ +F G+ L A +TGPG+V +QS+PF RL +
Sbjct: 255 EKIRVTTGSIVAFESTVDFQIEFVKGVKNILFSGEGLTFAAITGPGMVVLQSMPFERLVR 314
Query: 241 RIA 243
IA
Sbjct: 315 AIA 317
>gi|416860159|ref|ZP_11914168.1| hypothetical protein PA13_19985 [Pseudomonas aeruginosa 138244]
gi|334837751|gb|EGM16500.1| hypothetical protein PA13_19985 [Pseudomonas aeruginosa 138244]
gi|453047297|gb|EME95011.1| hypothetical protein H123_04106 [Pseudomonas aeruginosa PA21_ST175]
Length = 248
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 78/233 (33%), Positives = 124/233 (53%), Gaps = 11/233 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-----MW--- 74
+ ++ILG Q +I L P E V+A G+M +M+G + + + G +W
Sbjct: 6 LDYRILGESMQTVEIELDPGETVIAEAGAMNYMTGDIRF-TARMGDGSDGSLLGKLWSAG 64
Query: 75 -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
+ L G+++ N G V AAP ++ +DL GG L CQ D+FLC+
Sbjct: 65 KRKLGGESVFMTHFTNEGQGKQHVAFAAPYPGSVVAVDLDDVGGRLFCQKDSFLCAAYGT 124
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
+V +R+ G EGF+ QKL G GL F+ AGG+++++ L GE + VD C+VA
Sbjct: 125 RVGIAFTKRLGAGFFGGEGFILQKLEGDGLVFVHAGGTLIRRQLN-GETLRVDTGCLVAF 183
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
T ++ ++ G ++ +FGG+ L+ + G G V++QSLPF RL+ RI A
Sbjct: 184 TDDIDYDVQLAGGLKSMLFGGEGLLLTTLKGSGTVWLQSLPFSRLAGRIYDAT 236
>gi|424792016|ref|ZP_18218294.1| hypothetical protein XTG29_01349 [Xanthomonas translucens pv.
graminis ART-Xtg29]
gi|422797287|gb|EKU25645.1| hypothetical protein XTG29_01349 [Xanthomonas translucens pv.
graminis ART-Xtg29]
Length = 252
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 72/220 (32%), Positives = 117/220 (53%), Gaps = 10/220 (4%)
Query: 33 QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW----------QWLFGKTI 82
Q +I L P E VA G++ F +++M+ ++ + G + + G+++
Sbjct: 2 QFVEIELDPGESAVAEAGALMFKDAAVQMDTVFGDGSGQGGGLMGKLFSAGKRLVTGESL 61
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ V + G V AAP +L + L GG L+CQ D+FL V + ++
Sbjct: 62 FTTVFTHQGQGKATVAFAAPYPGTVLAMKLDQHGGRLICQKDSFLAGARGVSLGIHFQRK 121
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ + G EGF+ QKL G G F+ AGG VV++ L GE + VD C+VA S+V++ ++
Sbjct: 122 IMTGLFGGEGFIMQKLEGDGWVFVHAGGCVVERELAAGERLDVDTGCVVAFHSTVDMDVR 181
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
I+ FGG+ + A +TGPG V++QSLPF RL+ R+
Sbjct: 182 AVSGIKSMFFGGEGMFLATLTGPGKVWLQSLPFSRLAGRM 221
>gi|297738535|emb|CBI27780.3| unnamed protein product [Vitis vinifera]
Length = 142
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 59/85 (69%), Positives = 70/85 (82%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
V P++ILGGEAQ+ QIMLKPQEK++A+PGS C+MSGS +MEN+YI +NEVGMWQW FGK
Sbjct: 39 VTPYRILGGEAQMVQIMLKPQEKIIAKPGSTCYMSGSTQMENVYISKNEVGMWQWAFGKA 98
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLAR 106
SVVL N G DGFVG APSLA+
Sbjct: 99 AASVVLLNQGSGDGFVGTVAPSLAK 123
>gi|15598892|ref|NP_252386.1| hypothetical protein PA3696 [Pseudomonas aeruginosa PAO1]
gi|107103210|ref|ZP_01367128.1| hypothetical protein PaerPA_01004279 [Pseudomonas aeruginosa PACS2]
gi|218890027|ref|YP_002438891.1| hypothetical protein PLES_12881 [Pseudomonas aeruginosa LESB58]
gi|254236606|ref|ZP_04929929.1| conserved hypothetical protein [Pseudomonas aeruginosa C3719]
gi|254242391|ref|ZP_04935713.1| conserved hypothetical protein [Pseudomonas aeruginosa 2192]
gi|386057317|ref|YP_005973839.1| hypothetical protein PAM18_1250 [Pseudomonas aeruginosa M18]
gi|392982580|ref|YP_006481167.1| hypothetical protein PADK2_05860 [Pseudomonas aeruginosa DK2]
gi|418586069|ref|ZP_13150115.1| hypothetical protein O1O_15358 [Pseudomonas aeruginosa MPAO1/P1]
gi|418589503|ref|ZP_13153425.1| hypothetical protein O1Q_02893 [Pseudomonas aeruginosa MPAO1/P2]
gi|419754713|ref|ZP_14281071.1| hypothetical protein CF510_16988 [Pseudomonas aeruginosa
PADK2_CF510]
gi|420138105|ref|ZP_14646046.1| hypothetical protein PACIG1_1544 [Pseudomonas aeruginosa CIG1]
gi|421152495|ref|ZP_15612075.1| hypothetical protein PABE171_1421 [Pseudomonas aeruginosa ATCC
14886]
gi|421158510|ref|ZP_15617759.1| hypothetical protein PABE173_1370 [Pseudomonas aeruginosa ATCC
25324]
gi|421166068|ref|ZP_15624338.1| hypothetical protein PABE177_1160 [Pseudomonas aeruginosa ATCC
700888]
gi|421179140|ref|ZP_15636736.1| hypothetical protein PAE2_1185 [Pseudomonas aeruginosa E2]
gi|421518239|ref|ZP_15964913.1| hypothetical protein A161_18210 [Pseudomonas aeruginosa PAO579]
gi|424939445|ref|ZP_18355208.1| conserved hypothetical protein [Pseudomonas aeruginosa NCMG1179]
gi|451987723|ref|ZP_21935875.1| DUF124 domain-containing protein [Pseudomonas aeruginosa 18A]
gi|9949861|gb|AAG07084.1|AE004789_4 conserved hypothetical protein [Pseudomonas aeruginosa PAO1]
gi|126168537|gb|EAZ54048.1| conserved hypothetical protein [Pseudomonas aeruginosa C3719]
gi|126195769|gb|EAZ59832.1| conserved hypothetical protein [Pseudomonas aeruginosa 2192]
gi|218770250|emb|CAW26015.1| conserved hypothetical protein [Pseudomonas aeruginosa LESB58]
gi|346055891|dbj|GAA15774.1| conserved hypothetical protein [Pseudomonas aeruginosa NCMG1179]
gi|347303623|gb|AEO73737.1| hypothetical protein PAM18_1250 [Pseudomonas aeruginosa M18]
gi|375043743|gb|EHS36359.1| hypothetical protein O1O_15358 [Pseudomonas aeruginosa MPAO1/P1]
gi|375051737|gb|EHS44203.1| hypothetical protein O1Q_02893 [Pseudomonas aeruginosa MPAO1/P2]
gi|384398531|gb|EIE44936.1| hypothetical protein CF510_16988 [Pseudomonas aeruginosa
PADK2_CF510]
gi|392318085|gb|AFM63465.1| hypothetical protein PADK2_05860 [Pseudomonas aeruginosa DK2]
gi|403249088|gb|EJY62603.1| hypothetical protein PACIG1_1544 [Pseudomonas aeruginosa CIG1]
gi|404347721|gb|EJZ74070.1| hypothetical protein A161_18210 [Pseudomonas aeruginosa PAO579]
gi|404525255|gb|EKA35531.1| hypothetical protein PABE171_1421 [Pseudomonas aeruginosa ATCC
14886]
gi|404539215|gb|EKA48712.1| hypothetical protein PABE177_1160 [Pseudomonas aeruginosa ATCC
700888]
gi|404547383|gb|EKA56381.1| hypothetical protein PAE2_1185 [Pseudomonas aeruginosa E2]
gi|404549518|gb|EKA58375.1| hypothetical protein PABE173_1370 [Pseudomonas aeruginosa ATCC
25324]
gi|451754482|emb|CCQ88398.1| DUF124 domain-containing protein [Pseudomonas aeruginosa 18A]
Length = 248
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 78/233 (33%), Positives = 124/233 (53%), Gaps = 11/233 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-----MW--- 74
+ ++ILG Q +I L P E V+A G+M +M+G + + + G +W
Sbjct: 6 LDYRILGESMQTVEIELDPGETVIAEAGAMNYMTGDIRF-TARMGDGSDGSLLGKLWSAG 64
Query: 75 -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
+ L G+++ N G V AAP ++ +DL GG L CQ D+FLC+
Sbjct: 65 KRKLGGESVFMTHFTNEGQGKQHVAFAAPYPGSVVAVDLDDVGGRLFCQKDSFLCAAYGT 124
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
+V +R+ G EGF+ QKL G GL F+ AGG+++++ L GE + VD C+VA
Sbjct: 125 RVGIAFTKRLGAGFFGGEGFILQKLEGDGLVFVHAGGTLIRRQLN-GETLRVDTGCLVAF 183
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
T ++ ++ G ++ +FGG+ L+ + G G V++QSLPF RL+ RI A
Sbjct: 184 TDGIDYDVQLAGGLKSMLFGGEGLLLTTLKGSGTVWLQSLPFSRLAGRIYDAT 236
>gi|389736882|ref|ZP_10190389.1| hypothetical protein UU5_11043 [Rhodanobacter sp. 115]
gi|388438772|gb|EIL95500.1| hypothetical protein UU5_11043 [Rhodanobacter sp. 115]
Length = 343
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 70/234 (29%), Positives = 127/234 (54%), Gaps = 14/234 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I ++I+G + Q +I L P E VA G++ + +++M+ ++ +
Sbjct: 79 IDYRIVGHDMQFVEIELDPGESAVAEAGALMYKESAIQMDTVFGDGSSSSGQSGSGLMDK 138
Query: 75 ------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
+ + G+++ + V + G V AAP ++ + L+ GG L+CQ D+FL
Sbjct: 139 LLSAGKRVITGESLFTTVFTHTGSGKAKVAFAAPYPGTVMAMKLSDHGGRLICQKDSFLA 198
Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
V++ +++ + G EGF+ QKL G G F+ AGG++V++ L GE + VD
Sbjct: 199 GARGVRLGIFFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTMVERELAPGERLDVDTG 258
Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
C+VA +V++ ++ G I+ +FGG+ + A ++GPG V++QSLPF RL+ R+
Sbjct: 259 CVVAFHDTVSMDVRPVGGIKSMLFGGEGVFLATLSGPGKVWLQSLPFSRLAGRM 312
>gi|116051693|ref|YP_789468.1| hypothetical protein PA14_16590 [Pseudomonas aeruginosa UCBPP-PA14]
gi|296387797|ref|ZP_06877272.1| hypothetical protein PaerPAb_06574 [Pseudomonas aeruginosa PAb1]
gi|313109102|ref|ZP_07795074.1| hypothetical protein PA39016_001670058 [Pseudomonas aeruginosa
39016]
gi|386067731|ref|YP_005983035.1| hypothetical protein NCGM2_4827 [Pseudomonas aeruginosa NCGM2.S1]
gi|416878067|ref|ZP_11920194.1| hypothetical protein PA15_18983 [Pseudomonas aeruginosa 152504]
gi|421173065|ref|ZP_15630820.1| hypothetical protein PACI27_1306 [Pseudomonas aeruginosa CI27]
gi|115586914|gb|ABJ12929.1| conserved hypothetical protein [Pseudomonas aeruginosa UCBPP-PA14]
gi|310881576|gb|EFQ40170.1| hypothetical protein PA39016_001670058 [Pseudomonas aeruginosa
39016]
gi|334838752|gb|EGM17460.1| hypothetical protein PA15_18983 [Pseudomonas aeruginosa 152504]
gi|348036290|dbj|BAK91650.1| hypothetical protein NCGM2_4827 [Pseudomonas aeruginosa NCGM2.S1]
gi|404536736|gb|EKA46372.1| hypothetical protein PACI27_1306 [Pseudomonas aeruginosa CI27]
Length = 248
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 78/233 (33%), Positives = 124/233 (53%), Gaps = 11/233 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-----MW--- 74
+ ++ILG Q +I L P E V+A G+M +M+G + + + G +W
Sbjct: 6 LDYRILGESMQTVEIELDPGETVIAEAGAMNYMTGDIRF-TARMGDGSDGSLLGKLWSAG 64
Query: 75 -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
+ L G+++ N G V AAP ++ +DL GG L CQ D+FLC+
Sbjct: 65 KRKLGGESVFMTHFTNEGQGKQHVAFAAPYPGSVVAVDLDDVGGRLFCQKDSFLCAAYGT 124
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
+V +R+ G EGF+ QKL G GL F+ AGG+++++ L GE + VD C+VA
Sbjct: 125 RVGIAFAKRLGAGFFGGEGFILQKLEGDGLVFVHAGGTLIRRQLN-GETLRVDTGCLVAF 183
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
T ++ ++ G ++ +FGG+ L+ + G G V++QSLPF RL+ RI A
Sbjct: 184 TDGIDYDVQLAGGLKSMLFGGEGLLLTTLKGSGTVWLQSLPFSRLAGRIYDAT 236
>gi|158520989|ref|YP_001528859.1| hypothetical protein Dole_0972 [Desulfococcus oleovorans Hxd3]
gi|158509815|gb|ABW66782.1| protein of unknown function DUF124 [Desulfococcus oleovorans Hxd3]
Length = 251
Score = 131 bits (329), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 79/236 (33%), Positives = 125/236 (52%), Gaps = 11/236 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY-----IPENEVG----M 73
+ + ++G Q +I L P E V+A G+M +M+ ++ E PE +G
Sbjct: 6 VDYTLIGDVMQAVEIGLDPGETVIADAGNMNYMNDAIRFETRMGDGAPAPEGLLGSLVAA 65
Query: 74 WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ LF G+ + N S V + P R++ +D+A G E+ CQ +AFLC+
Sbjct: 66 GRRLFAGENLFLTHFTNTAGSKRHVTFSGPCPGRVIVLDMAALGQEVFCQKEAFLCAARG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
+VS +R+ G +GF+ +L G G+AF+ AGG VV+K L+ E I VD CIVA
Sbjct: 126 TQVSIAFTRRMGPGFFGRDGFVLMRLVGDGMAFVHAGGMVVEKTLK-NETICVDTGCIVA 184
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
TS ++ + ++ FGG+NL A ++G G VF+Q+LP R + RI R ++S
Sbjct: 185 FTSGIHYSARQADNLKSMFFGGENLFLATLSGTGTVFLQTLPLSRPADRILRQLSS 240
>gi|47530420|ref|YP_021769.1| hypothetical protein GBAA_5116 [Bacillus anthracis str. 'Ames
Ancestor']
gi|49187746|ref|YP_030999.1| hypothetical protein BAS4755 [Bacillus anthracis str. Sterne]
gi|47505568|gb|AAT34244.1| conserved hypothetical protein [Bacillus anthracis str. 'Ames
Ancestor']
gi|49181673|gb|AAT57049.1| conserved hypothetical protein [Bacillus anthracis str. Sterne]
Length = 226
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 79/210 (37%), Positives = 116/210 (55%), Gaps = 18/210 (8%)
Query: 52 MCFMSGSMEMENIYIPENEVGMWQWLFGKTITS------------VVLRNPGPSDGFVGI 99
M M +EME I+ + G LFGK + + V N G V
Sbjct: 1 MMMMEDYIEMETIF--GDGSGPSGGLFGKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSF 58
Query: 100 AAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLT 159
AAP +I+P+DL + G+++CQ DAFLC+ V + +++ G EGF+ QKL
Sbjct: 59 AAPYPGKIIPVDLTEYQGKVVCQKDAFLCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLE 118
Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVT 219
G GLAF+ AGG+V ++ L+ GE + +D C+VA+T VN +++ G ++ A+FGG+ L
Sbjct: 119 GDGLAFMHAGGTVYKRELKPGEKLRIDTGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFF 178
Query: 220 AVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
A + GPG V+IQSL RLS R+ TSP
Sbjct: 179 ATLEGPGTVWIQSLTLSRLSARL----TSP 204
>gi|436834358|ref|YP_007319574.1| protein of unknown function DUF124 [Fibrella aestuarina BUZ 2]
gi|384065771|emb|CCG98981.1| protein of unknown function DUF124 [Fibrella aestuarina BUZ 2]
Length = 259
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 83/254 (32%), Positives = 132/254 (51%), Gaps = 13/254 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I ++I+G + QV +I L P E V+A GSM FM + E ++
Sbjct: 6 IDYKIIGEDIQVVEIELDPNETVIAEAGSMLFMEEGISFETRMGDGSQTSGGFLGKLMQA 65
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAFLCSVN 131
+ + G+++ N G V AAP ++PI+LA ++ L+ Q DAFLC+
Sbjct: 66 GSRMIMGESLFMTHFTNRGVGKRKVAFAAPYPGTVMPINLANIYQNTLIVQKDAFLCAAM 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
++S +QR+ + G EGF+ +K+ G GLAFI AGG V+++ L EV+ VD C+V
Sbjct: 126 GTRLSIHFNQRLGAGLFGGEGFILEKVQGDGLAFIHAGGVVMERTLN-NEVLRVDTGCVV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
+N I+ G +R +FGG+ L A + G G V+IQS+P +L QR+A ++ +
Sbjct: 185 GFEPQINFSIEQAGGLRSMIFGGEGLFLATLRGTGKVWIQSMPISKLVQRLA-PYSAQSH 243
Query: 252 RENPKFFVQIAIFF 265
+E Q+ F
Sbjct: 244 KEGGSVLGQLGNLF 257
>gi|313676285|ref|YP_004054281.1| hypothetical protein Ftrac_2191 [Marivirga tractuosa DSM 4126]
gi|312942983|gb|ADR22173.1| protein of unknown function DUF124 [Marivirga tractuosa DSM 4126]
Length = 258
Score = 130 bits (328), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 79/237 (33%), Positives = 131/237 (55%), Gaps = 14/237 (5%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSM----EMENIYIPENEVGMW 74
Q + ++ILG Q+ +I L PQE V+A G+M +M S+ +M + P NE +
Sbjct: 4 QSHEVDYKILGDGVQLVEIELDPQETVIAEAGAMVYMDDSILFETKMGDGSNP-NEGFLG 62
Query: 75 QWL-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAF 126
+ + G+++ + G V + P ++ +DL + E++ Q D F
Sbjct: 63 KLMSAGSRIFTGESLFMTHFTHQGQGKAHVAFSGPYPGTVIALDLKDHYNNEVIVQKDGF 122
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
LC+ KVS TL++++ + + G EGF+ QKL G G AF+ AGG+V++K L E + VD
Sbjct: 123 LCAAMGTKVSITLNKKIGSGLLGGEGFIMQKLQGNGRAFVHAGGTVIEKQLN-NETLKVD 181
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
C+VA SS++ I+ +G I+ +FGG+ + A + G G V++QS+P +L Q IA
Sbjct: 182 TGCVVAYESSLDFDIQTSGGIKSMLFGGEGIFLATLRGTGKVWLQSMPIRKLIQAIA 238
>gi|91793514|ref|YP_563165.1| hypothetical protein Sden_2160 [Shewanella denitrificans OS217]
gi|91715516|gb|ABE55442.1| protein of unknown function DUF124 [Shewanella denitrificans OS217]
Length = 264
Score = 130 bits (327), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 127/230 (55%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMW------ 74
+ ++I+G Q+ ++ L P E VVA G+M ++ + E E + G +
Sbjct: 8 VDYEIIGESMQLVEVELDPNETVVAEAGAMNYLEQDINFEAKMGDGSEPDSGFFGKMLGA 67
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G++I N G V AAP IL +DL+++G EL+CQ D+FL +
Sbjct: 68 GKRALSGESIFMTHFTNQGSQKRKVAFAAPYPGTILALDLSLYGEELICQKDSFLAAALG 127
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
+V+ ++R+ G EGF+ Q L G G+ F+ AGG++++K L+ GE + VD CIV
Sbjct: 128 TQVTMKFNRRLGTGFFGGEGFILQNLKGDGMVFVHAGGTLIKKELK-GETLRVDTGCIVG 186
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T+ ++ I+ G ++ VFGG+ L A + G G V++QSLPF R++ RI
Sbjct: 187 FTAGIDYDIQRAGSLKSMVFGGEGLFLATLQGHGTVWLQSLPFSRMADRI 236
>gi|66803082|ref|XP_635384.1| hypothetical protein DDB_G0291181 [Dictyostelium discoideum AX4]
gi|60463692|gb|EAL61874.1| hypothetical protein DDB_G0291181 [Dictyostelium discoideum AX4]
Length = 359
Score = 130 bits (327), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 77/238 (32%), Positives = 126/238 (52%), Gaps = 2/238 (0%)
Query: 7 STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
ST FQ + ++I G E+QV Q+ L P + A GS+ M +EM+
Sbjct: 91 STLFQNNQSLRGKSTSFEWEIYGLESQVVQMNLNPGTSITAETGSLLEMGSDIEMDTTAR 150
Query: 67 PENEVGMWQWLFGKTITSVVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLCQPDA 125
+ G+ + N ++ + ++P +++I+PI L+ FGGEL+CQ +A
Sbjct: 151 GGFLSSFKRMFTGQGLFLTRFINKSETEKMRITFSSPYISKIVPIKLSEFGGELICQKNA 210
Query: 126 FLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITV 185
FLC + V++ + + G +GF+ Q+L G G+ F+ AGGS++ + L GE I V
Sbjct: 211 FLCGEDSVEIEPRFTKNFKTGFFGGQGFILQRLVGNGMVFVHAGGSIIYRVLLPGEKIKV 270
Query: 186 DVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
IVA +SV +++ I+ +F G+ L A +TGPG++ +QSLPF RL + IA
Sbjct: 271 TTGSIVAFETSVEYNVEFVKGIKNILF-GEGLSLATITGPGLIILQSLPFERLVRAIA 327
>gi|222098336|ref|YP_002532393.1| hypothetical protein BCQ_4678 [Bacillus cereus Q1]
gi|221242394|gb|ACM15104.1| conserved hypothetical protein [Bacillus cereus Q1]
Length = 226
Score = 130 bits (326), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 78/210 (37%), Positives = 116/210 (55%), Gaps = 18/210 (8%)
Query: 52 MCFMSGSMEMENIYIPENEVGMWQWLFGKTITS------------VVLRNPGPSDGFVGI 99
M M +EME I+ + G LFGK + + V N G V
Sbjct: 1 MMMMEDYIEMETIF--GDGSGPSGGLFGKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSF 58
Query: 100 AAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLT 159
AAP +I+P+DL + G+++CQ DAFLC+ V + +++ G EGF+ QKL
Sbjct: 59 AAPYPGKIIPVDLTEYQGKVVCQKDAFLCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLE 118
Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVT 219
G GLAF+ AGG+V ++ L+ GE + +D C+VA+T VN +++ G ++ A+FGG+ L
Sbjct: 119 GDGLAFMHAGGTVYKRELKPGEKLRIDTGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFF 178
Query: 220 AVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
A + GPG V+IQSL RL+ R+ TSP
Sbjct: 179 ATLEGPGTVWIQSLTLSRLAARL----TSP 204
>gi|114563495|ref|YP_751008.1| hypothetical protein Sfri_2324 [Shewanella frigidimarina NCIMB 400]
gi|114334788|gb|ABI72170.1| protein of unknown function DUF124 [Shewanella frigidimarina NCIMB
400]
Length = 262
Score = 130 bits (326), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 82/236 (34%), Positives = 124/236 (52%), Gaps = 11/236 (4%)
Query: 17 SPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMW 74
S + + ++I+G Q+ ++ L P E V+A G+M ++ + E E E G +
Sbjct: 2 SMKSHEVDYEIIGSSMQLVEVELDPNESVIAEAGAMTYLEEDIMFEARMGDGSEPESGFF 61
Query: 75 QWLFG--------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
L G + I N G V AAP IL IDLA GEL+CQ D+F
Sbjct: 62 GKLLGAGKRAISGEGIFMTHFTNQGQQKRKVAFAAPFPGTILAIDLAQHQGELICQKDSF 121
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
L + +VS ++R+ G EGF+ Q L G G+AF+ AGG++++K L GE + VD
Sbjct: 122 LTAALGTQVSMKFNKRLGTGFFGGEGFILQSLKGDGMAFVHAGGTLIKKELH-GETLRVD 180
Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
C+V T ++ I+ G ++ VFGG+ L A + G G V++QSLPF R++ RI
Sbjct: 181 TGCLVGFTPGIDYDIQRAGSLKSMVFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236
>gi|441499671|ref|ZP_20981848.1| DUF124 domain-containing protein [Fulvivirga imtechensis AK7]
gi|441436595|gb|ELR69962.1| DUF124 domain-containing protein [Fulvivirga imtechensis AK7]
Length = 258
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 82/240 (34%), Positives = 126/240 (52%), Gaps = 20/240 (8%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I ++I G Q+ ++ L P E V+A G+M +M + E +E Q LF
Sbjct: 4 QSHEIDYEIKGESIQIVEVELDPNETVIAEAGAMLYMEDGIVFETKMGDGSEPN--QGLF 61
Query: 79 GKTI--------------TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF-GGELLCQP 123
GK + T R G + V +AP ++PIDLA G E++ Q
Sbjct: 62 GKLLSAGTRMLTGESMFMTHFTYRGVGKAK--VAFSAPYPGTVIPIDLAQIPGREVIVQK 119
Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
D FLC+ KV+ T ++R+ + + G EGF+ QKL G G AF+ AGG+V++K L EV+
Sbjct: 120 DGFLCAALGTKVAITFNKRLGSGLVGGEGFILQKLQGDGKAFVHAGGTVIEKQLN-NEVL 178
Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
VD C+VA ++ ++ G +R +FGG+ L A + G G V++QS+P +L Q +A
Sbjct: 179 RVDTGCVVAFEPGIDFNVESAGGLRSMIFGGEGLFLATLRGTGRVWLQSMPIRKLVQALA 238
>gi|226227458|ref|YP_002761564.1| hypothetical protein GAU_2052 [Gemmatimonas aurantiaca T-27]
gi|226090649|dbj|BAH39094.1| hypothetical protein GAU_2052 [Gemmatimonas aurantiaca T-27]
Length = 263
Score = 130 bits (326), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 82/229 (35%), Positives = 119/229 (51%), Gaps = 9/229 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
I +QI+G + Q + L P E V A G+M FM + M P G
Sbjct: 6 IDYQIIGDDLQAVIVTLDPGESVFAEAGAMMFMREGITMATTLDPNARSGSLFDKLVGAG 65
Query: 75 -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
+ L G + + N G V AAP +I+P++L +GG +L Q D+FLC+ +
Sbjct: 66 KRVLAGDSFFVTLFGNAGARRSDVAFAAPYPGKIVPLNLRDWGGTVLAQKDSFLCAARGI 125
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
+S +R+ G EGF+ QKL G GLAF+ A G++ +L GE + VD C+VA
Sbjct: 126 DISVAFTRRIGAGFFGGEGFILQKLQGDGLAFLHASGTLHAIDLAPGEQLRVDTGCLVAF 185
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+VN I+ I+ A+FGG+ L +TGPG V +Q+LPF RL+ RI
Sbjct: 186 QPTVNYDIQRVPGIKTALFGGEGLFFVSLTGPGRVILQTLPFSRLADRI 234
>gi|328869835|gb|EGG18210.1| hypothetical protein DFA_03697 [Dictyostelium fasciculatum]
Length = 354
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 71/227 (31%), Positives = 126/227 (55%), Gaps = 1/227 (0%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
+++ ++I G E+QV Q+ L P + A G++ MS S+EM+ +
Sbjct: 96 KESTFEWEISGHESQVVQLKLDPGSSITAETGALLEMSSSVEMDTSARGGFLTSFKRMFT 155
Query: 79 GKTITSVVLRNPGPSD-GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
G+ + +N ++ G V ++P +++IL + ++ GGEL+C+ ++FLC N +++
Sbjct: 156 GQGLFLTKFKNVSDNEIGRVTFSSPYISKILAVKMSELGGELICKKNSFLCGDNAIEIEP 215
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
L +R G EGF+ Q+L G G+AF+ GS++ ++L+ GE + V V IVA +SV
Sbjct: 216 QLTKRFSIGFFGGEGFILQRLKGSGMAFLHGCGSIIFRSLQPGEKVRVSVGSIVAFQTSV 275
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
I++ F G+ L ++GPG+V +QSLPF +L +I+R
Sbjct: 276 EYDIEWVKGATNIFFSGEGLFLTTLSGPGLVILQSLPFEKLVNQISR 322
>gi|284039740|ref|YP_003389670.1| hypothetical protein Slin_4893 [Spirosoma linguale DSM 74]
gi|283819033|gb|ADB40871.1| protein of unknown function DUF124 [Spirosoma linguale DSM 74]
Length = 259
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 79/254 (31%), Positives = 134/254 (52%), Gaps = 13/254 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENI----------YIPENEVG 72
I ++I+G + Q+ +I L P E V+A GSM +M + E ++ +
Sbjct: 6 IDYKIIGEDIQIVEIELDPNETVIAEAGSMLYMEDGITFETKMGDGSQPDQGFLGKLLQA 65
Query: 73 MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAFLCSVN 131
+ + G+++ N G V +AP I+PI++A ++G L+ Q DAFLC+
Sbjct: 66 GSRMVMGESLFMTHFTNRGVGKRKVAFSAPYPGTIMPINMANVYGNTLIVQKDAFLCAAL 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
K+S +Q++ + G EGF+ +K+ G+AFI AGG V+++ L E + VD C+V
Sbjct: 126 GTKLSIHFNQKIGAGLFGGEGFILEKVQSDGMAFIHAGGVVIERTLN-NETLRVDTGCVV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
S+N I+ G +R VFGG+ L A + G G V+IQS+P +L QR++ ++ +
Sbjct: 185 GFEPSINFDIQRAGGLRSMVFGGEGLFLATLRGSGKVWIQSMPISKLVQRLSTG-SAQSH 243
Query: 252 RENPKFFVQIAIFF 265
+E+ Q+ F
Sbjct: 244 KESGSVLGQLGNLF 257
>gi|47568039|ref|ZP_00238745.1| uncharacterized conserved protein CAC1537 [Bacillus cereus G9241]
gi|47555342|gb|EAL13687.1| uncharacterized conserved protein CAC1537 [Bacillus cereus G9241]
Length = 226
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 77/210 (36%), Positives = 116/210 (55%), Gaps = 18/210 (8%)
Query: 52 MCFMSGSMEMENIYIPENEVGMWQWLFGKTITS------------VVLRNPGPSDGFVGI 99
M M +EME I+ + G LFGK + + V N G V
Sbjct: 1 MMMMEDYIEMETIF--GDGSGPSGGLFGKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSF 58
Query: 100 AAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLT 159
AAP +I+P+DL + G+++CQ DAFLC+ V + +++ G EGF+ QKL
Sbjct: 59 AAPYPGKIIPVDLTEYQGKVVCQKDAFLCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLE 118
Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVT 219
G GLAF+ AGG+V ++ L+ GE + +D C+VA+T +N +++ G ++ A+FGG+ L
Sbjct: 119 GDGLAFMHAGGTVYKRELKPGEKLRIDTGCLVAMTKDINYDVEFVGKVKTALFGGEGLFF 178
Query: 220 AVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
A + GPG V+IQSL RL+ R+ TSP
Sbjct: 179 ATLEGPGTVWIQSLTLSRLAARL----TSP 204
>gi|84625912|ref|YP_453284.1| hypothetical protein XOO_4255 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|84369852|dbj|BAE71010.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
Length = 256
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 68/228 (29%), Positives = 122/228 (53%), Gaps = 14/228 (6%)
Query: 33 QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW--------------QWLF 78
Q ++ L P E +A G++ + +++M+ ++ + G + +
Sbjct: 2 QFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGGLMGKLLSAGKRVVT 61
Query: 79 GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
G+++ + + + G V AAP +L + L+ GG+L+CQ D+FL V +
Sbjct: 62 GESMFTTIFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLAGARGVSLGIA 121
Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
+++ + G EGF+ QKL G G F+ AGG+VV++ L GE I VD C+VA +SV+
Sbjct: 122 FQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGECIDVDTGCVVAYHASVD 181
Query: 199 VQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+ ++ ++ FGG+ + A +TGPG V++QSLPF R++ R+ +A
Sbjct: 182 MDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 229
>gi|345876582|ref|ZP_08828349.1| protein of unknown function DUF124 [endosymbiont of Riftia
pachyptila (vent Ph05)]
gi|344226418|gb|EGV52754.1| protein of unknown function DUF124 [endosymbiont of Riftia
pachyptila (vent Ph05)]
Length = 233
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 72/218 (33%), Positives = 123/218 (56%), Gaps = 12/218 (5%)
Query: 36 QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN--EVGMWQWLF--------GKTITSV 85
++ L P E V+A G+M ++ + E + + G++ LF G+++ +
Sbjct: 2 EVELDPGETVIAEAGAMTYLEQEISFETKMGDGSNPDQGIFGKLFSAGKRMITGESLFTT 61
Query: 86 VLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
+ G + V +AP ++P++LA GG+++CQ DAFLC+ K+ +++
Sbjct: 62 HFTHSGSAGKRKVAFSAPFPGSVIPLNLAALGGKVICQKDAFLCAALGTKLGIAFTKKLG 121
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
+ + G EGF+ Q L G G AFI AGG+VV+K L GE + VD C+V T ++ I+ +
Sbjct: 122 SGLFGGEGFILQSLEGDGTAFIQAGGTVVEKQLN-GETLRVDTGCLVGFTEGIDYSIEMS 180
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
G ++ +FGG+ + A ++G G V+IQSLPF R++ RI
Sbjct: 181 GGLKSMLFGGEGIFLATLSGSGTVWIQSLPFSRMADRI 218
>gi|119774708|ref|YP_927448.1| hypothetical protein Sama_1571 [Shewanella amazonensis SB2B]
gi|119767208|gb|ABL99778.1| conserved hypothetical protein [Shewanella amazonensis SB2B]
Length = 264
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/230 (33%), Positives = 128/230 (55%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN--EVGMW------ 74
+ ++I G Q+ ++ L P E V+A G+M +M ++ E + E G +
Sbjct: 8 VDYEIRGESMQLVEVELDPGETVIAEAGAMTYMEEGIQFEARMGDGSQPESGFFGALMGA 67
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+ L G+++ N G V +AP +L +D+A GGEL+ Q D+FL +
Sbjct: 68 GKRMLTGESLFMTHFTNQGHGKRRVAFSAPYPGTLLALDMAELGGELILQKDSFLAAALG 127
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
+++ T ++R+ G EGF+ QKL G G+AF+ AGG++++K L+ GE + VD C+V
Sbjct: 128 TQITMTFNKRLGTGFFGGEGFIMQKLKGDGMAFVHAGGTLIKKELK-GETLRVDTGCLVG 186
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T ++ I+ G ++ VFGG+ L A ++G G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIQRAGSLKSMVFGGEGLFLATLSGHGTVWLQSLPFSRMADRI 236
>gi|385800164|ref|YP_005836568.1| hypothetical protein [Halanaerobium praevalens DSM 2228]
gi|309389528|gb|ADO77408.1| protein of unknown function DUF124 [Halanaerobium praevalens DSM
2228]
Length = 265
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 79/235 (33%), Positives = 128/235 (54%), Gaps = 12/235 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSM----------EMENIYIPENEVG 72
I +QI+G + QV ++ L P+E V+A G+M +M + E+ N + +
Sbjct: 6 IDYQIIGNDMQVVEVDLDPEETVIAEAGAMNWMDNGISFEAKMGDGTELNNSILAKVFGA 65
Query: 73 MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAFLCSVN 131
+ + G+++ N + AAP I+ +DLA + + CQ DAFLC+
Sbjct: 66 GKRAITGESVFMTHFTNTDYEKKKIAFAAPYPGNIIALDLAELEANQFTCQKDAFLCAAL 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
KV T +R + + G EGF+ QK++G G AFI AGG+VV+K L+ E + VD C+V
Sbjct: 126 GTKVDITFSKRFGSGLFGGEGFILQKISGDGKAFIHAGGTVVKKELKN-EKLKVDTGCLV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
T+ ++ I+ G ++ +FGG+ L A + G G V++QSLPF RL+ RI ++
Sbjct: 185 GFTAGIDYNIEKAGKLKSMLFGGEGLFLATLAGTGTVYLQSLPFSRLADRIIKSA 239
>gi|89094062|ref|ZP_01167006.1| hypothetical protein MED92_02156 [Neptuniibacter caesariensis]
gi|89081736|gb|EAR60964.1| hypothetical protein MED92_02156 [Oceanospirillum sp. MED92]
Length = 262
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 81/230 (35%), Positives = 123/230 (53%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
+ ++I+G QV ++ L P EKV+A G M +M + E E + G+ LF
Sbjct: 6 VDYEIIGESLQVVEVELDPGEKVIAEAGMMNYMEEGISFETKMGDGSEPDQGLMGKLFSA 65
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G+++ N G V AAP I+P++LA FG +++CQ DAFLC+
Sbjct: 66 GKRMVMGESVFMTHFFNEGQGKQRVAFAAPYPGSIVPLNLADFGEKVICQKDAFLCAALG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
KV +++ G EGF+ QKL G G+AF+ AGG+VV+K L E + VD C+V
Sbjct: 126 TKVEIAFSKKIGAGFFGGEGFILQKLEGDGMAFMQAGGTVVRKELN-NETLRVDTGCLVG 184
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T + I+ ++ +FGG+ L A + G G V++QSLPF RL+ R+
Sbjct: 185 FTEGIEYDIEMVKGLKSMIFGGEGLWLATLKGTGSVWVQSLPFSRLADRL 234
>gi|423584601|ref|ZP_17560688.1| TIGR00266 family protein [Bacillus cereus VD045]
gi|401235827|gb|EJR42294.1| TIGR00266 family protein [Bacillus cereus VD045]
Length = 217
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 67/171 (39%), Positives = 104/171 (60%), Gaps = 4/171 (2%)
Query: 79 GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
G+++ V N G V AAP +I+P+DL + G+++CQ DAFLC+ V +
Sbjct: 29 GESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAAKGVSIGIE 88
Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
+++ G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D C+VA+T +N
Sbjct: 89 FTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCLVAMTKDIN 148
Query: 199 VQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
+++ G ++ A+FGG+ L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 149 YDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 195
>gi|452077091|gb|AGF93061.1| protein containing DUF124 [uncultured organism]
Length = 225
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 70/227 (30%), Positives = 123/227 (54%), Gaps = 2/227 (0%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I G Q+ I L PQ++V GSM +MSG++EME GM + L G++
Sbjct: 1 MKYKIDGNNLQMVTIELGPQDEVYGEAGSMKYMSGNVEMETKGKGGFMKGMKRKLSGESY 60
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ +G +G A + +I +DL+ G +CQ DAFLC+ V + ++
Sbjct: 61 FLTHFKTK-SGNGLIGFAGNTPGKIKTLDLSS-GNNYVCQKDAFLCAEEGVDLDIAFQKK 118
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ + G EGF+ QKL+G G+AF+ G +++K+L+ G+V+ V + A SSVN I+
Sbjct: 119 LGAGLFGGEGFILQKLSGNGMAFVHVMGDLMEKDLDQGQVLRVSTGNVAAFESSVNYDIQ 178
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
G ++ ++ G+ + + GPG V++QS+ L+ ++R V+
Sbjct: 179 RTGGVKSSLLSGEGIFMTTLHGPGKVWLQSMTLRDLASELSRYVSGD 225
>gi|422675984|ref|ZP_16735321.1| hypothetical protein PSYAR_24722, partial [Pseudomonas syringae pv.
aceris str. M302273]
gi|330973695|gb|EGH73761.1| hypothetical protein PSYAR_24722, partial [Pseudomonas syringae pv.
aceris str. M302273]
Length = 332
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 73/218 (33%), Positives = 115/218 (52%), Gaps = 11/218 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 115 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 174
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 294
Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVF 229
+T +V+ I+ G I+ +FGG+ + A +TGPG V+
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVW 332
>gi|392969555|ref|ZP_10334970.1| protein of unknown function DUF124 [Fibrisoma limi BUZ 3]
gi|387841749|emb|CCH57028.1| protein of unknown function DUF124 [Fibrisoma limi BUZ 3]
Length = 259
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 81/231 (35%), Positives = 128/231 (55%), Gaps = 12/231 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWL-- 77
I ++I+G + Q+ +I L P E V+A G+M FM ++ E N+ M + L
Sbjct: 6 IDYKIIGEDIQIVEIELDPNETVIAEAGAMLFMEDGIQFETKMGDGSQPNQGFMGKLLQA 65
Query: 78 -----FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAFLCSVN 131
G+++ N G V AAP +LP++LA ++G L+ Q DAFLC+
Sbjct: 66 GSRLITGESLFMTHFTNRGVGKRKVAFAAPYPGTVLPVNLATIYGNSLIVQKDAFLCAAL 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
++S +QR+ G EGF+ QKL G GL+FI AGG V+++ L E++ VD C+V
Sbjct: 126 GTRLSIHFNQRLGAGFFGGEGFILQKLQGDGLSFIHAGGVVIERQLN-NELLRVDTGCVV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
A +N I+ +G ++ +FGG+ L A ++G G V+IQS+P +L QR+
Sbjct: 185 AFEPQINFDIQRSGGLKSMIFGGEGLFVATLSGTGKVWIQSMPISKLVQRL 235
>gi|18309316|ref|NP_561250.1| hypothetical protein CPE0334 [Clostridium perfringens str. 13]
gi|18143992|dbj|BAB80040.1| conserved hypothetical protein [Clostridium perfringens str. 13]
Length = 187
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 68/158 (43%), Positives = 101/158 (63%), Gaps = 1/158 (0%)
Query: 97 VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQ 156
V A+P +I+P+DL +GG+L+CQ DAF+C+ V + +++ G EGF+ Q
Sbjct: 14 VAFASPYPGKIIPVDLRNYGGKLICQKDAFICAAKGVSIGIDFRRKLGTGFFGGEGFILQ 73
Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDN 216
KL G G+AFI AGG++V+K L G+ + VD C+VA+T VN I+Y I+ AVFGG+
Sbjct: 74 KLEGDGMAFIHAGGTIVRKRLLPGQKLKVDTGCLVAMTKDVNYDIEYVKGIKNAVFGGEG 133
Query: 217 LVTAVVTGPGIVFIQSLPFHRLSQR-IARAVTSPNMRE 253
+ A + GPG V+IQSLPF RL+ + ++ A P R+
Sbjct: 134 IFFASLVGPGEVWIQSLPFSRLASKVVSSAPQVPGSRD 171
>gi|255599941|ref|XP_002537348.1| conserved hypothetical protein [Ricinus communis]
gi|223516663|gb|EEF25035.1| conserved hypothetical protein [Ricinus communis]
Length = 203
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 69/172 (40%), Positives = 95/172 (55%)
Query: 79 GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
G+++ + V + G V AAP I+P+ L GG L+CQ DAFLC+ V +
Sbjct: 14 GESLFTTVFTHNGSGKARVAFAAPVPGAIVPLKLDSVGGTLICQKDAFLCAAKGVSMGIA 73
Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
+RV + G EGF+ QKL G G F+ GG VV++ L GE + VD C+ A T SV+
Sbjct: 74 FQRRVMTGLFGGEGFIMQKLDGDGWVFVQFGGMVVERELAPGEELHVDTGCVAAFTPSVD 133
Query: 199 VQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
+ G +R FGG+ L A + GPG V+IQSLPF RL+ R+ A S
Sbjct: 134 FDLVTAGGVRSVFFGGEGLFFARLRGPGKVWIQSLPFSRLAGRMMAAAGSHG 185
>gi|395218495|ref|ZP_10402141.1| hypothetical protein O71_17321 [Pontibacter sp. BAB1700]
gi|394454374|gb|EJF09046.1| hypothetical protein O71_17321 [Pontibacter sp. BAB1700]
Length = 255
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 71/227 (31%), Positives = 122/227 (53%), Gaps = 12/227 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENI----------YIPENEVG 72
I F+I G + QV +I L P E V+A G+M +M ++ + ++ +
Sbjct: 7 IDFRIFGNDIQVLEIELDPNETVIAEAGAMVYMEEGIDFQTKMGDGSNPSQGFLGKLVSA 66
Query: 73 MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVN 131
+ + G+++ + G V +AP ILP+DL F G L+ Q DAFL +
Sbjct: 67 GSRLITGESLFMTHFTHRGYGKSRVAFSAPYPGTILPLDLQQFNGNTLIAQKDAFLAAAL 126
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
K++ +Q++ + + G EGF+ QK++G GLAF+ AGG++V+K L E + VD C+V
Sbjct: 127 GTKLAMHFNQKLGSGLFGGEGFILQKISGDGLAFLHAGGTIVEKELN-NETLRVDTGCVV 185
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
A ++ I+ G ++ +FGG+ + A + G G V++QS+P +L
Sbjct: 186 AFEQGIDFSIQRAGGLKSMIFGGEGIFLATLRGTGRVWLQSMPVKKL 232
>gi|395646536|ref|ZP_10434396.1| protein of unknown function DUF124 [Methanofollis liminatans DSM
4140]
gi|395443276|gb|EJG08033.1| protein of unknown function DUF124 [Methanofollis liminatans DSM
4140]
Length = 270
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/225 (31%), Positives = 121/225 (53%), Gaps = 3/225 (1%)
Query: 18 PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL 77
P++ + ++I+G Q+ ++ L E+V A G+M MSG+M+M++ G+ + L
Sbjct: 38 PRELTMKYEIIGDNLQMVKLTLAQGERVNAEAGAMVNMSGNMQMDSHLKGGLLGGLKRAL 97
Query: 78 FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
+++ G S GFV A RI P+D+ G E L Q DA+LCS V +
Sbjct: 98 TNESLFLTEFTPQGES-GFVSFAGNVPGRIFPVDVT--GREFLAQKDAYLCSEQGVHLDI 154
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
+++R+ G EGF+ Q+L+G G AF+ G ++ +L GEV+ V+ +V ++V
Sbjct: 155 AFTKKIRSGFFGGEGFILQRLSGSGTAFLHCCGDTIEMDLAAGEVVKVETGLVVGFDATV 214
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ I+ G ++ FGG+ L +TGPG V +QS+ +L+ +
Sbjct: 215 DYSIQMAGGVKTVFFGGEGLFLTTLTGPGKVVLQSMDIAKLASSL 259
>gi|312129436|ref|YP_003996776.1| hypothetical protein Lbys_0661 [Leadbetterella byssophila DSM
17132]
gi|311905982|gb|ADQ16423.1| protein of unknown function DUF124 [Leadbetterella byssophila DSM
17132]
Length = 259
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 81/244 (33%), Positives = 131/244 (53%), Gaps = 16/244 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME----NIYIPEN-------EV 71
I FQI G + Q+ +I L PQE V+A G+M FM ++ E + PE +
Sbjct: 6 IDFQIHGEDIQIVEIELDPQETVIAEAGAMLFMEDGIQFETKLGDGTAPEQGFLGKVFQA 65
Query: 72 GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAFLCSV 130
G + L G+++ N G V AAP I P+DLA ++ L+ Q DAFLC+
Sbjct: 66 GS-RLLTGESLFMTHFTNRGYGKRKVAFAAPYPGTIKPVDLAQIYQNTLIVQKDAFLCAA 124
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
++ ++R+ G EGF+ Q++ G G+AFI +GG V+++ L E + VD CI
Sbjct: 125 LGTSINIHFNKRLGAGFFGGEGFILQRIQGDGMAFIHSGGVVIERQLS-NETLRVDTGCI 183
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
VA ++ I+ G ++ +FGG+ + A ++G G V++QSLP +L QR++ + PN
Sbjct: 184 VAFEPQIHFDIQAAGNLKSMIFGGEGMFLATLSGTGRVWLQSLPISKLIQRLSS--SGPN 241
Query: 251 MREN 254
++
Sbjct: 242 TKKE 245
>gi|298712641|emb|CBJ48666.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 403
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 72/223 (32%), Positives = 119/223 (53%), Gaps = 11/223 (4%)
Query: 26 QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSV 85
++ G E+Q+ L+P + + A G++ FM +EM+ N G +F + IT
Sbjct: 4 KVEGQESQMVTFELEPGQVIRAEAGNLVFMEDGIEMDT-----NTGGGASSMFRRVITGQ 58
Query: 86 VLR-----NPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
+ N G + G VG+ ++I+ + LA +GGE++CQ AFL + + +
Sbjct: 59 NMFVTDYVNRGDASGKVGLGTDVPSKIVRLPLAAYGGEVICQRGAFLAGSHTIDIQTEFT 118
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+ + GG EGF+ Q+LTG+G AF+ A G ++Q+ L GE + V +VA ++SV
Sbjct: 119 KMMAGFFGG-EGFVLQRLTGKGDAFVKASGPLIQRELSEGETLRVSSGSVVAFSTSVQYD 177
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
++ + VFGG+ L + GPG VF+Q LPF R+ +IA
Sbjct: 178 VQTIPGFKNVVFGGEGLFITKLAGPGTVFLQGLPFDRMVDQIA 220
>gi|117925123|ref|YP_865740.1| hypothetical protein Mmc1_1826 [Magnetococcus marinus MC-1]
gi|117608879|gb|ABK44334.1| protein of unknown function DUF124 [Magnetococcus marinus MC-1]
Length = 253
Score = 120 bits (300), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 77/231 (33%), Positives = 123/231 (53%), Gaps = 12/231 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV------GMW-- 74
+ F+I G + Q+ ++ L P E V+A G+MC+M ++ E ++ G+
Sbjct: 16 VDFEIHGSDLQLVEVELDPGETVIAEAGTMCYMEENISFETKMGDGSQADEGFLGGLLSI 75
Query: 75 --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVN 131
+ L G++I N G V AAP +I+P+D+ + L CQ DAFLC+
Sbjct: 76 GKRALTGESIFMTHFSNRGSVKRRVAFAAPFPGKIIPLDMRQLPEQTLYCQRDAFLCAAL 135
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
+VS +R+ G EGF+ QKL G G+AFI AGG +V+K LE + + VD C+V
Sbjct: 136 GTQVSIAFSRRLGAGFFGGEGFILQKLVGDGMAFIHAGGVIVEKKLE-NQRLRVDTGCLV 194
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T ++ I+ G ++ +FGG+ + + G G V++QSLPF L+ R+
Sbjct: 195 GFTPGIDYNIEAAGGMKSMIFGGEGIFLTTLAGTGTVWLQSLPFANLADRV 245
>gi|372223454|ref|ZP_09501875.1| hypothetical protein MzeaS_14126, partial [Mesoflavibacter
zeaxanthinifaciens S86]
Length = 190
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 69/185 (37%), Positives = 99/185 (53%), Gaps = 10/185 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
I ++I G E Q +I L PQE VVA GS M ++M+ I+ E G+ +F
Sbjct: 6 IDYKIYGEEMQYVEIELDPQEAVVAEAGSFMMMDSGIKMDTIFGDGSNQESGVLGKIFSA 65
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G+++ N V A+P +ILPIDL+ GG+ +CQ DAFLC+
Sbjct: 66 GKRILTGESLFMTAFLNVEHGKKQVSFASPYPGKILPIDLSEKGGKFICQKDAFLCAAKG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V V +R+ + G EGF+ QKL G G+AF+ AGG++ +K L GEV+ VD CIV
Sbjct: 126 VTVGIEFSKRLGRGLFGGEGFIMQKLEGDGMAFVHAGGTMAKKELAPGEVLKVDTGCIVG 185
Query: 193 VTSSV 197
+ +V
Sbjct: 186 FSQTV 190
>gi|87307766|ref|ZP_01089909.1| hypothetical protein DSM3645_22806 [Blastopirellula marina DSM
3645]
gi|87289380|gb|EAQ81271.1| hypothetical protein DSM3645_22806 [Blastopirellula marina DSM
3645]
Length = 249
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 80/247 (32%), Positives = 128/247 (51%), Gaps = 18/247 (7%)
Query: 12 PYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIP 67
P + Q D I + I+G + Q+ ++ L P E V+A G+M +M S +M + P
Sbjct: 4 PVLQQHSHD--IDYHIIGDDMQLVEVELDPGETVIAEAGAMNYMQEGITFSTKMGDGSDP 61
Query: 68 ENEVGMWQWLF--------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGEL 119
+ G+ LF G ++ N + V AP +ILPI+L+ GG L
Sbjct: 62 DQ--GLVGKLFSAGKRMVSGNSLFMGHFTNESSTKRHVAFGAPYPGKILPIELSEIGGTL 119
Query: 120 LCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEV 179
CQ D+FLC+ ++ +R+ G EGF+ Q LTG G AF+ A G+++ + L+
Sbjct: 120 TCQKDSFLCAAMGTRLDIAFQKRLGAGFFGGEGFILQHLTGDGNAFLHACGAIITRELK- 178
Query: 180 GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
GE + V+ +VA T ++ I+ G ++ + GG+ L A ++G G V +QSLPF R++
Sbjct: 179 GETLLVEAGSLVAFTQGIDYDIERAGNLKSMLLGGEGLYLAKLSGHGTVMMQSLPFSRMA 238
Query: 240 QRI-ARA 245
I ARA
Sbjct: 239 NHILARA 245
>gi|442610563|ref|ZP_21025274.1| DUF124 domain-containing protein [Pseudoalteromonas luteoviolacea B
= ATCC 29581]
gi|441747780|emb|CCQ11336.1| DUF124 domain-containing protein [Pseudoalteromonas luteoviolacea B
= ATCC 29581]
Length = 262
Score = 117 bits (293), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 74/230 (32%), Positives = 118/230 (51%), Gaps = 11/230 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV--GMWQWLF-- 78
I ++I+G Q+ ++ L P E V+A G+M ++ ++ E +EV G LF
Sbjct: 6 IDYKIIGESMQMVEVELDPGETVIAEAGAMNYLEDGIQFEAKMGDGSEVDQGFMGKLFSA 65
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
G+++ N G V AAP I+P+D++ G + Q D+FLC+
Sbjct: 66 GKRMISGESLFMTYFTNQGLGKKHVAFAAPFPGTIIPMDMSTLGQSVYLQKDSFLCAALG 125
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
KV +++ G EGF+ + L G G+AF AGG+V++K L GE + VD C+V
Sbjct: 126 TKVDIAFQRKLGAGFFGGEGFILEHLQGDGMAFAHAGGTVIKKELR-GETLKVDTGCVVG 184
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
T + I+ ++ FGG+ L A + G G V+IQSLPF R++ R+
Sbjct: 185 FTGGIEFDIERIKGLKSMFFGGEGLFLATLRGHGTVWIQSLPFSRMADRV 234
>gi|126179804|ref|YP_001047769.1| hypothetical protein Memar_1861 [Methanoculleus marisnigri JR1]
gi|125862598|gb|ABN57787.1| protein of unknown function DUF124 [Methanoculleus marisnigri JR1]
Length = 228
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 115/220 (52%), Gaps = 3/220 (1%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I G Q+ ++ L P EK A G+M MSG+M+M G+ + + G+ +
Sbjct: 1 MQYEITGDNLQMVKLRLAPGEKACAEAGAMVNMSGNMQMTTNMKGGLFKGLKRMMTGEGL 60
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
P +GFV A +I +DL G E + Q DAFLCS + + ++
Sbjct: 61 FMTEF-TPQGGEGFVSFAGNVPGKIFTLDLN--GNEFIAQKDAFLCSEQGIDLDIAFTKK 117
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+R+ + G EGF+ Q+L+G G AF+ G +++ L GEV+ V+ +V S+V+ I+
Sbjct: 118 LRSGVFGGEGFILQRLSGSGKAFLHCCGDIMEMTLAPGEVVKVETGLVVGFESTVDYSIQ 177
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
G ++ FGG+ L +TGPG V +QS+ +L+ +
Sbjct: 178 VAGGVKTVFFGGEGLFLTTLTGPGKVVMQSMDIAKLASSL 217
>gi|392396335|ref|YP_006432936.1| hypothetical protein Fleli_0680 [Flexibacter litoralis DSM 6794]
gi|390527413|gb|AFM03143.1| TIGR00266 family protein [Flexibacter litoralis DSM 6794]
Length = 255
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 76/232 (32%), Positives = 122/232 (52%), Gaps = 12/232 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
I ++I+G + Q+ ++ L P E V+A G+M +M ++ E E E G++ L
Sbjct: 6 IDYKIIGNDIQIVEVELDPNETVIAEAGAMLYMEENINFETKMGDGSEPEKGIFGKLMSA 65
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVN 131
G+++ G G V AAP + P+ LA L+ Q DAFLC+
Sbjct: 66 GARMITGESLFMTHFTQRGGEKGKVAFAAPYPGTVTPVQLASCNNNTLIVQKDAFLCAAL 125
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
K+S L++R + G EGF+ QKL G G+AF AGG++++K L GE + VD CIV
Sbjct: 126 GTKISIHLNRRFGSGFFGGEGFIMQKLQGDGMAFFHAGGTIIEKELN-GESLKVDTGCIV 184
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
A + + I+ G ++ GG+ L A ++G G V++QS+P +L Q ++
Sbjct: 185 AFEEGITMDIQKAGNLKSMFLGGEGLFLATLSGTGKVWLQSMPLSKLVQALS 236
>gi|224006938|ref|XP_002292429.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220972071|gb|EED90404.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 340
Score = 114 bits (285), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 68/233 (29%), Positives = 115/233 (49%), Gaps = 2/233 (0%)
Query: 18 PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL 77
P D I I G E+Q+ I L+P + + A G+M +M+ + M G ++L
Sbjct: 74 PIDFDIASSIEGKESQIVTIRLEPNQVLRAESGAMMYMTSGVSMNTTTGGGMSAGFQRFL 133
Query: 78 FGKT--ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKV 135
G+ I+ GFV + + ++I+ + L +GG+++CQ A LC+ + V +
Sbjct: 134 TGQNVFISDYTYDGSNGPFGFVALGSDFPSKIVRLSLDEYGGKVICQKGALLCASHTVDI 193
Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
++ G EGF+ Q LTG G + AGG++++K+L+ GE + V C+V
Sbjct: 194 QVEFTKKFSTGFFGGEGFVLQGLTGTGDVLVKAGGTLIRKDLKEGESLRVSSGCLVGFQQ 253
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
V + + +FGG+ L + GPG V++Q +P R+ IAR V S
Sbjct: 254 GVEYDVAMVPGFKNVLFGGEGLFLTTLKGPGTVWLQGMPPQRMISEIARRVPS 306
>gi|408673168|ref|YP_006872916.1| protein of unknown function DUF124 [Emticicia oligotrophica DSM
17448]
gi|387854792|gb|AFK02889.1| protein of unknown function DUF124 [Emticicia oligotrophica DSM
17448]
Length = 304
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 77/233 (33%), Positives = 122/233 (52%), Gaps = 14/233 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFM-----------SGSMEMENIYIPENEV 71
I ++I G + Q+ +I L P E V+A GSM FM GS ++++ +
Sbjct: 51 IDYKIHGEDIQIVEIELDPNETVIAEAGSMLFMEDGIMFETKMGDGSQPQQSLFGKLLQA 110
Query: 72 GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAFLCSV 130
G + L G++I N V AAP I P++LA + G L+ Q DAFLC+
Sbjct: 111 G-SRALMGESIFMTHFTNRSGIKRKVAFAAPYPGTIKPVNLANIMGNTLIVQKDAFLCAA 169
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
+S +QR+ G EGF+ +K+ G G+AFI +GG V+++ L E + +D C+
Sbjct: 170 MGTSISIHFNQRLGAGFFGGEGFILEKIQGDGMAFIHSGGVVIERQLN-NETLRIDTGCV 228
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
V + I+ +G ++ VFGG+ + A + G G V+IQSLP +L QR++
Sbjct: 229 VGFEPQLQFDIERSGSLKSMVFGGEGIFLATLRGTGKVWIQSLPISKLIQRLS 281
>gi|66361112|pdb|1YOX|A Chain A, Structure Of The Conserved Protein Of Unknown Function
Pa3696 From Pseudomonas Aeruginosa
gi|66361113|pdb|1YOX|B Chain B, Structure Of The Conserved Protein Of Unknown Function
Pa3696 From Pseudomonas Aeruginosa
gi|66361114|pdb|1YOX|C Chain C, Structure Of The Conserved Protein Of Unknown Function
Pa3696 From Pseudomonas Aeruginosa
gi|66361115|pdb|1YOX|D Chain D, Structure Of The Conserved Protein Of Unknown Function
Pa3696 From Pseudomonas Aeruginosa
gi|66361116|pdb|1YOX|E Chain E, Structure Of The Conserved Protein Of Unknown Function
Pa3696 From Pseudomonas Aeruginosa
gi|66361117|pdb|1YOX|F Chain F, Structure Of The Conserved Protein Of Unknown Function
Pa3696 From Pseudomonas Aeruginosa
Length = 250
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 76/233 (32%), Positives = 121/233 (51%), Gaps = 11/233 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-----MW--- 74
+ ++ILG Q +I L P E V+A G+ + +G + + G +W
Sbjct: 8 LDYRILGESXQTVEIELDPGETVIAEAGAXNYXTGDIRF-TARXGDGSDGSLLGKLWSAG 66
Query: 75 -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
+ L G+++ N G V AAP ++ +DL GG L CQ D+FLC+
Sbjct: 67 KRKLGGESVFXTHFTNEGQGKQHVAFAAPYPGSVVAVDLDDVGGRLFCQKDSFLCAAYGT 126
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
+V +R+ G EGF+ QKL G GL F+ AGG+++++ L GE + VD C+VA
Sbjct: 127 RVGIAFTKRLGAGFFGGEGFILQKLEGDGLVFVHAGGTLIRRQLN-GETLRVDTGCLVAF 185
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
T ++ ++ G ++ +FGG+ L+ + G G V++QSLPF RL+ RI A
Sbjct: 186 TDGIDYDVQLAGGLKSXLFGGEGLLLTTLKGSGTVWLQSLPFSRLAGRIYDAT 238
>gi|167628246|ref|YP_001678745.1| hypothetical protein HM1_0109 [Heliobacterium modesticaldum Ice1]
gi|167590986|gb|ABZ82734.1| conserved hypothetical protein [Heliobacterium modesticaldum Ice1]
Length = 240
Score = 113 bits (283), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 74/244 (30%), Positives = 133/244 (54%), Gaps = 7/244 (2%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ +Q+LG Q I L+P E++ G+M +MS ++ M+ + + + L G+++
Sbjct: 1 MRYQVLGTTMQALNIDLEPGERIYTESGAMTWMSSNVRMDTNFKGGLLKSLGRALSGESL 60
Query: 83 TSVVLRN-PGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
T GP++ +G + ++LP+++ G EL+CQ D+FLC+ V++S +
Sbjct: 61 TYTFFEAVGGPAN--IGFTPSAPGKVLPVEVER-GYELICQRDSFLCAQESVEMSIFFQR 117
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
++ G EGF+ QKL+G+G+AF+ G V++K L GE + VD + + S+V++ I
Sbjct: 118 KLGAGFFGGEGFIMQKLSGRGMAFVEIDGEVIEKKLAPGERLMVDTGHVAMIESTVSMDI 177
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQI 261
+ ++ A+FGG+ L +TGPG V+IQ+L L+ RI + RE+ +
Sbjct: 178 QMVKGMKNAIFGGEGLFLTSLTGPGRVWIQTLTLANLAGRI---LAHAGKRESSGGIGSL 234
Query: 262 AIFF 265
FF
Sbjct: 235 TDFF 238
>gi|374855197|dbj|BAL58060.1| hypothetical conserved protein [uncultured Chloroflexi bacterium]
Length = 250
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 84/248 (33%), Positives = 130/248 (52%), Gaps = 18/248 (7%)
Query: 7 STPFQP---YVYQSPQDAV-IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME 62
+TP P + QS + + ++I+G Q + L P E V + G M +MSG++ M+
Sbjct: 3 ATPLNPPDLEIAQSGRGVTGMEYKIIGSILQAVILELDPGETVYSESGGMAWMSGNIRMQ 62
Query: 63 NIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQ 122
+ L G++I V + G S GFV A+ +I+P+ L G E++CQ
Sbjct: 63 TATRGGLGGVFKRALSGESIFLVEFTSQG-SKGFVAFASDFPGKIIPLHLTE-GQEIICQ 120
Query: 123 PDAFLCSVNDVKVSNTLDQRVRNVIG----GIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
AFLC+ V +LD R +G G EGF+ QKL+G G+AF+ G +++ NL
Sbjct: 121 KQAFLCAEKTV----SLDIHFRRKLGVGLFGGEGFILQKLSGPGVAFVCLDGEIIEYNLG 176
Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
GEV+ VD I SV+ I+ + +FGG+ L AV+ GPG V++QS+P
Sbjct: 177 PGEVLKVDTGHIGMYEPSVSFDIEMVKGVTNLLFGGEGLFLAVLRGPGRVWLQSMP---- 232
Query: 239 SQRIARAV 246
+Q +ARA+
Sbjct: 233 TQNLARAI 240
>gi|355677720|ref|ZP_09060487.1| hypothetical protein HMPREF9469_03524 [Clostridium citroniae
WAL-17108]
gi|354812806|gb|EHE97420.1| hypothetical protein HMPREF9469_03524 [Clostridium citroniae
WAL-17108]
Length = 225
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 66/228 (28%), Positives = 120/228 (52%), Gaps = 3/228 (1%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++ILG ++ L+ E + + G M +MS + +++ G+ + G+++
Sbjct: 1 MDYRILGETLPAVEVRLQTGEAMYTQSGGMAWMSDGLALDSNIKGGLMKGIGRMFTGESL 60
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
P D F+ A+ +ILP+D+ L+CQ AFLC+ V+VS L ++
Sbjct: 61 FMATYTASRP-DCFIAFASTVPGKILPVDMGKT--SLICQKGAFLCAQPTVEVSTVLTKK 117
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ G EGF+ Q+L G G+AF+ G V+++ L GEVI VD + A +V+ +I+
Sbjct: 118 LSAGFFGGEGFILQQLKGSGMAFLEVDGDVIERTLAPGEVIKVDTGNVFAFEPTVSYEIE 177
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
++ +FGG+ L +TGPG +++Q++ + RIA+ + S N
Sbjct: 178 TIKGVKNILFGGEGLFLTKLTGPGKIYMQTMNIAEFTGRIAKGLPSSN 225
>gi|432330883|ref|YP_007249026.1| TIGR00266 family protein [Methanoregula formicicum SMSP]
gi|432137592|gb|AGB02519.1| TIGR00266 family protein [Methanoregula formicicum SMSP]
Length = 232
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 121/230 (52%), Gaps = 2/230 (0%)
Query: 26 QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSV 85
+I+G Q+ +I L P E + A G+M M+GS ME+ G+ + + G+++ +
Sbjct: 4 EIIGSNLQMVKIDLLPGEGIFAEAGAMVNMTGSFTMESQLKGGILSGIKRAVVGESVF-L 62
Query: 86 VLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRN 145
+PG GFV A +I P+ ++ E + + D+FLC V++ ++++
Sbjct: 63 TRFSPGAGAGFVSFAGTMPGKIFPVSVSP-DREFISRKDSFLCCHEGVELDIAFTRKIQA 121
Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
+ G EGF+ Q++TGQG F+ G +++ +L+ GEV+ V +V +V+ I G
Sbjct: 122 GLFGGEGFVLQRMTGQGTVFLHCCGDIIELDLKPGEVVKVQTGLVVGFDHTVSYDIALAG 181
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENP 255
I A+FGG+ L +TGPG V +QS+ +++ + + P ++ P
Sbjct: 182 GITTALFGGEGLFVTTLTGPGKVILQSMNLEKIAASLIPYLPKPEVKVKP 231
>gi|219129648|ref|XP_002184995.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217403490|gb|EEC43442.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 225
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 62/225 (27%), Positives = 118/225 (52%), Gaps = 2/225 (0%)
Query: 26 QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT--IT 83
++ G E+Q+ + L+P + + A G+M +M+ ++M G + L G+ I+
Sbjct: 1 KVEGNESQILTVALEPGQVLRAESGAMMYMTDGVQMNTTSGGGLSSGFQRMLTGQNFFIS 60
Query: 84 SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV 143
+ G V + ++I+ ++L +GG+++CQ A LC+ + V + + +
Sbjct: 61 DYSFEGEPGTVGHVALGTDFPSKIVRLNLEEYGGKIVCQKGALLCASHTVDIQMEYSKNM 120
Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
G EGF+ Q L+G+G F+ AGG++++++L+ GE + + C+VA T+ V+ ++
Sbjct: 121 STGFFGGEGFVLQGLSGEGDVFVKAGGTLIRRDLKEGEQLRISSGCLVAFTNGVDYDVQM 180
Query: 204 NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
GG+ L +TGPG+V++Q P R+ IAR V S
Sbjct: 181 VKGFANVFAGGEGLFMTTLTGPGVVWLQGQPAQRMISEIARRVPS 225
>gi|289674960|ref|ZP_06495850.1| hypothetical protein PsyrpsF_16953, partial [Pseudomonas syringae
pv. syringae FF5]
Length = 267
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 99/193 (51%), Gaps = 10/193 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 70 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 129
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 130 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 189
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 190 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 249
Query: 193 VTSSVNVQIKYNG 205
+T +V+ I+ G
Sbjct: 250 MTQTVDYDIRMVG 262
>gi|397626028|gb|EJK68002.1| hypothetical protein THAOC_10872 [Thalassiosira oceanica]
Length = 392
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 69/233 (29%), Positives = 117/233 (50%), Gaps = 2/233 (0%)
Query: 18 PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL 77
P D + G E+Q+ I L+P + + A G+M +MS + M G+ + L
Sbjct: 129 PIDFDTSTTVEGRESQIATIRLEPGQVLRAESGAMMYMSQGVNMNTTTGGGLGAGLKRAL 188
Query: 78 FGKT--ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKV 135
G+ I+ G G V + ++I+ ++L ++GG+++CQ A LC+ + V +
Sbjct: 189 TGQNLFISDYTYDGSGGPFGIVALGTDFPSKIVRLNLQVYGGKIVCQQGALLCASHTVDI 248
Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
+++ G EGF+ Q LTG G + AGG++++K+L GE + + C+V
Sbjct: 249 DIEFTKKLSTGFFGGEGFVLQGLTGTGDVMVKAGGTLIRKDLREGEELRISSGCLVGYQD 308
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
V IK + +FGG+ L +V+ GPG V++Q P R+ IAR V S
Sbjct: 309 GVEFDIKMLDGYKNVLFGGEGLFVSVLRGPGTVWLQGQPPQRMVSEIARRVPS 361
>gi|348676356|gb|EGZ16174.1| hypothetical protein PHYSODRAFT_509459 [Phytophthora sojae]
Length = 322
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 68/221 (30%), Positives = 116/221 (52%), Gaps = 1/221 (0%)
Query: 27 ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVV 86
I G ++QV I L+P++ + A G+M +M+ +EME GM + + G+
Sbjct: 3 IEGHDSQVVHIELEPEQCLRAETGAMIYMTEGVEMETTTAGGFGEGMKRMMTGENFFVSR 62
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRN 145
G G V + ++I+ + L+ F GE ++CQ AFLC + V + ++
Sbjct: 63 FTYHGSEKGKVALGTSFPSKIVHLRLSDFLGESIICQKGAFLCGSDTVNIEMEFAKKFGV 122
Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
G EGF+ Q+LTG G + A G+++++ L+ GEV+ + C+VA SV+ I
Sbjct: 123 GFFGGEGFILQRLTGSGDVLVRASGTLIERELQPGEVLRISSGCLVAFEPSVHYDITMLK 182
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+ +FGG+ L +TGPG +++QSLPF R+ +A +
Sbjct: 183 GAKNVLFGGEGLFVTTLTGPGKIYLQSLPFDRVVGEMASRI 223
>gi|422405471|ref|ZP_16482514.1| hypothetical protein Pgy4_16829, partial [Pseudomonas syringae pv.
glycinea str. race 4]
gi|330880405|gb|EGH14554.1| hypothetical protein Pgy4_16829 [Pseudomonas syringae pv. glycinea
str. race 4]
Length = 307
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/193 (32%), Positives = 99/193 (51%), Gaps = 10/193 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LFG
Sbjct: 109 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 168
Query: 80 -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
+++ + V G G V AAP ILP++L FGG+L+CQ D+FL
Sbjct: 169 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 228
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V + +++ + G EGF+ QKL G G F+ GG+V + L GE + VD C+ A
Sbjct: 229 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 288
Query: 193 VTSSVNVQIKYNG 205
+T +V+ I+ G
Sbjct: 289 MTQTVDYDIRMVG 301
>gi|163792500|ref|ZP_02186477.1| hypothetical protein BAL199_16673 [alpha proteobacterium BAL199]
gi|159182205|gb|EDP66714.1| hypothetical protein BAL199_16673 [alpha proteobacterium BAL199]
Length = 236
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 72/213 (33%), Positives = 111/213 (52%), Gaps = 2/213 (0%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-MWQWLFGKT 81
I + I G + + I L P E V A G+M M +EM + +P G + + + G+T
Sbjct: 6 IDYTIEGDFSPMLTITLDPGEAVQAEAGTMVMMEPDIEM-STEMPGGFFGSILRKVSGET 64
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
N G + V A+P +I P+DLA GG CQ A+L S + +S L +
Sbjct: 65 FFMTFFTNTGRTRRSVSFASPMPGQIRPLDLAAEGGAFYCQRRAYLASAKGIGISIALTK 124
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
R+ + + G EG + QKL G G AF+ AGG++V++ L GE I+V +V +++ + I
Sbjct: 125 RLSSGLFGGEGLILQKLEGDGWAFLAAGGTLVERQLAKGEKISVSTGSLVGFSATCDYDI 184
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
I+ +FGG+ L VTGPG V +Q+ P
Sbjct: 185 SIQRGIKNMLFGGEGLFVTHVTGPGKVVVQTQP 217
>gi|303271207|ref|XP_003054965.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462939|gb|EEH60217.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 247
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 70/224 (31%), Positives = 122/224 (54%), Gaps = 16/224 (7%)
Query: 26 QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME----MENIYIPENEVGMWQWLFGKT 81
Q++G +AQ+ Q+ L P E + A PG+MC+MS ++ +E ++ G + L G+
Sbjct: 15 QVMGTDAQLVQVELSPGESLSAEPGAMCYMSANVRSITRLEGGFL----AGCSRLLAGEP 70
Query: 82 ITSVVLRNPGP--SDGFVGIAAPSLA-RILPIDLAMFGGELLCQPDAFLCSVNDVKV--S 136
+N G SDG++ +A +I+ +D+ G E LC D++LCS V++ +
Sbjct: 71 FFLNNFQNVGKDGSDGYIALAGRREGDKIIVLDMDQLGNEFLCARDSYLCSKGTVEIGAA 130
Query: 137 NTLDQR---VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
+TL Q +R + QKL+G G+A I G+V+++ L+ G+ + VD +V
Sbjct: 131 STLMQGQMGLRLFLSNQNTVFMQKLSGSGVACISGNGTVIRQELKPGQEMVVDARAVVGF 190
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR 237
+S+ Q++ A+FGG+ L A ++GPG+ ++QSLP R
Sbjct: 191 QNSIGYQLRMMSSPLAALFGGEGLFFARLSGPGVFYLQSLPAAR 234
>gi|294667625|ref|ZP_06732839.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292602620|gb|EFF46057.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 167
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 52/140 (37%), Positives = 85/140 (60%)
Query: 107 ILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFI 166
+L + L+ GG+L+CQ D+FL V + +++ + G EGF+ QKL G G F+
Sbjct: 1 MLALRLSEHGGQLICQKDSFLAGARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFV 60
Query: 167 LAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPG 226
AGG+VV++ L GE I VD C+VA + V++ ++ ++ FG + + A +TGPG
Sbjct: 61 HAGGTVVERELGPGERIDVDTGCVVAYHAGVDMDVRRVAGLKSMFFGSEGVFLATLTGPG 120
Query: 227 IVFIQSLPFHRLSQRIARAV 246
V++QSLPF R++ R+ +A
Sbjct: 121 KVWLQSLPFSRMAGRMLQAA 140
>gi|239623794|ref|ZP_04666825.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47_FAA]
gi|239521825|gb|EEQ61691.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47FAA]
Length = 225
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 66/228 (28%), Positives = 119/228 (52%), Gaps = 3/228 (1%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++ILG V ++ L E + + G M +MS + +++ G+ + G+++
Sbjct: 1 MDYRILGETLPVVEVRLNAGEAMYTQSGGMAWMSEGLTLDSNIKGGLMKGIGRMFTGESL 60
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
P D + A+ RILP+D++ L+CQ AFLC+ V+VS ++
Sbjct: 61 FMATYTAVRP-DCIIAFASTVPGRILPVDMSRT--SLICQKGAFLCAQPTVEVSTVFTKK 117
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ G EGF+ Q+L G G+AF+ G VV+K L GEV+ VD + A +V+ +I+
Sbjct: 118 LSAGFFGGEGFILQQLKGSGMAFLEVDGDVVEKTLAPGEVMKVDTGNVFAFEPTVSYEIE 177
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
++ +FGG+ L +TGPG +++Q++ + RIA+ + + N
Sbjct: 178 TIKGVKNMLFGGEGLFLTRLTGPGKIYMQTMNIAEFTGRIAQGLPTSN 225
>gi|116624027|ref|YP_826183.1| hypothetical protein Acid_4939 [Candidatus Solibacter usitatus
Ellin6076]
gi|116227189|gb|ABJ85898.1| protein of unknown function DUF124 [Candidatus Solibacter usitatus
Ellin6076]
Length = 259
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 83/241 (34%), Positives = 128/241 (53%), Gaps = 10/241 (4%)
Query: 17 SPQDAVIP--FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME----MENIYIPENE 70
+P AV P + I G QV ++ LK ++V A G M + + +++ M I E
Sbjct: 6 APAPAVTPLDYTIQGDNLQVARVRLKAGQEVYAEAGKMVYKTANVQWDTRMSGATIGEKL 65
Query: 71 VG-MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
+G + + + G+++ R G DG VG A RI +LA G L+ Q DAFL +
Sbjct: 66 MGALRRTVTGESLFLTYFRANG--DGEVGFAGNYPGRIQVFELAA-GRSLIAQRDAFLFA 122
Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
V+ + L +++ G EGF+ +K TG G FI AGG V+ +L G+VI VD C
Sbjct: 123 QPTVQFNIALVKKLGAGFFGGEGFILEKFTGPGAVFIHAGGDFVEFDLAPGQVIQVDTGC 182
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
IVA +V+ I+ G I+ A+FGG+ L A +TGPG V IQS+ +++ + +A T
Sbjct: 183 IVAFDETVDYDIQLAGGIKTALFGGEGLFLATLTGPGKVVIQSMTLNKMRRELAPGRTGG 242
Query: 250 N 250
+
Sbjct: 243 D 243
>gi|255080806|ref|XP_002503976.1| predicted protein [Micromonas sp. RCC299]
gi|226519243|gb|ACO65234.1| predicted protein [Micromonas sp. RCC299]
Length = 540
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 68/220 (30%), Positives = 115/220 (52%), Gaps = 8/220 (3%)
Query: 26 QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSV 85
+ILG +AQ+ Q+ LK E++ A PG+MC+MS ++ G+ + L G+
Sbjct: 308 RILGTDAQMVQVDLKAGEEMSAEPGAMCYMSSNVRSITSLQGGVFAGISRVLAGEPFFLN 367
Query: 86 VLRNPGPS--DGFVGIAAPSLA-RILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL--- 139
+N DG++ +A +I+ +DL FGGE+LC D++LCS +V +
Sbjct: 368 SFKNIANDGRDGYIALAGRREGDKIIVLDLDAFGGEILCARDSYLCSKGEVSIDAATTFT 427
Query: 140 --DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
+R + QKL+G GLA I G+V++++L G+ + VD + A + ++
Sbjct: 428 RGQMGLRLFLSNQNTIFMQKLSGSGLACISGNGTVIRQDLLEGQEMVVDARAVCAFSKTI 487
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR 237
Q++ A+FGG+ L A ++GPG ++QSLP R
Sbjct: 488 GYQLRLMSSPLAAIFGGEGLFFARLSGPGTFYLQSLPAAR 527
>gi|301122111|ref|XP_002908782.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099544|gb|EEY57596.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 322
Score = 107 bits (266), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 70/224 (31%), Positives = 116/224 (51%), Gaps = 5/224 (2%)
Query: 27 ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVV 86
I G ++QV I L+P++ + A G+M +M+ +EME GM + + G+
Sbjct: 3 IEGHDSQVVHIELEPEQCLRAETGAMIYMTDGVEMETTTAGGLGEGMKRMMTGENFFVSR 62
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRN 145
G G V + ++I+ + L F GE ++CQ AFLC + V + ++
Sbjct: 63 FTYHGSEKGKVALGTSFPSKIVHLRLNDFLGESIICQKGAFLCGSDTVNIEMEFAKKFGV 122
Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
G EGF+ Q+LTG G + A G+++++ L+ GEV+ + C+VA SV+ I
Sbjct: 123 GFFGGEGFILQRLTGSGDVLVRATGTLIERELQPGEVLRISSGCLVAFEPSVHYDITMMK 182
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR----LSQRIARA 245
+ +FGG+ L +TGPG +++QSLPF R ++ RI R
Sbjct: 183 GAKNVLFGGEGLFVTTLTGPGKIYLQSLPFERVVGEMAARIPRG 226
>gi|219850466|ref|YP_002464899.1| hypothetical protein Cagg_3626 [Chloroflexus aggregans DSM 9485]
gi|219544725|gb|ACL26463.1| protein of unknown function DUF124 [Chloroflexus aggregans DSM
9485]
Length = 316
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 67/236 (28%), Positives = 123/236 (52%), Gaps = 2/236 (0%)
Query: 9 PFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPE 68
P P V + + F+I+G Q + + P + V + G M +MS +++M+ +
Sbjct: 60 PEPPVVGSGVGASGLRFKIIGTTMQAVVLEVPPGQTVFSERGGMSWMSANVQMQT-NMEG 118
Query: 69 NEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
G ++ +F +V P G +G +A +I+P++LA G ++CQ DAF+C
Sbjct: 119 GLGGAFKRMFSGESIFMVNFTPQGGPGIIGFSAEFPGKIVPLNLAP-GQVMICQKDAFMC 177
Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
+ V + +R+ + G EGF+ QKLTG GLAF+ G +++ LE +++ VD
Sbjct: 178 AERSVSLDIHFRRRLGAGLFGGEGFIMQKLTGPGLAFVELDGEIIEYTLEANQMLKVDTG 237
Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+ +V + I+ + +FGG+ L + GPG V++Q++P L+++IA+
Sbjct: 238 HVAMYEPTVQLDIEMVRGFKNILFGGEGLFLTTLRGPGRVWLQTMPAMNLAKKIAQ 293
>gi|374628526|ref|ZP_09700911.1| protein of unknown function DUF124 [Methanoplanus limicola DSM
2279]
gi|373906639|gb|EHQ34743.1| protein of unknown function DUF124 [Methanoplanus limicola DSM
2279]
Length = 238
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 65/221 (29%), Positives = 115/221 (52%), Gaps = 5/221 (2%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT- 81
+ ++I+G Q + + P E + A G+M +MSG++ ME+ G+ + L G++
Sbjct: 1 MDYKIIGDNLQFVNLAINPGELIYAEAGAMVYMSGNINMESKLQGGLFTGIKRKLAGESF 60
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
+ S GP + G P +I+ D+ G+L+ Q DAFLC+ VK +
Sbjct: 61 MVSHFTSTGGPGNVAFGGNCP--GKIITRDMG--DGDLIVQKDAFLCAEESVKWEIAFQK 116
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
++ + G EG + + L+G GL F A G +V+ +L+ G+V V + +V +V I
Sbjct: 117 KLGSTFFGGEGLILEHLSGSGLVFCHAAGDIVEIDLKAGQVYKVSTAHVVGWEPTVQYDI 176
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ G ++ A+FGG+ +TGPG V+IQS+ +L+ +
Sbjct: 177 QAAGGVKTALFGGEGFFVTTLTGPGRVYIQSMTLPQLANAL 217
>gi|452077090|gb|AGF93060.1| protein containing DUF124 [uncultured organism]
Length = 253
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/224 (29%), Positives = 116/224 (51%), Gaps = 2/224 (0%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I G Q+ + L PQ++V GSM +MSG++EME G+ + L G++
Sbjct: 10 MKYKIDGDNLQMVTLQLGPQDEVYGEAGSMKYMSGNVEMETKGKGGFMKGVKRKLSGESY 69
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ DG VG A + +I +DL+ G + Q DAFLC+ V + ++
Sbjct: 70 FLTHFKTK-SGDGLVGFAGNTPGKIKTLDLSS-GKNYIAQQDAFLCAEESVDLDIAFQKK 127
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ G EGF+ QKL+G G AFI G ++ KNL+ G+V+ V + A +V+ I+
Sbjct: 128 LGAGFFGGEGFILQKLSGNGKAFIHITGDMMTKNLKEGQVLKVSTGNVAAFEDTVDYDIQ 187
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
G ++ ++ G+ + + GPG V++QS+ L+ + + V
Sbjct: 188 RTGGVKSSLLSGEGIFMTTLRGPGKVWLQSMTLRDLAAELGKYV 231
>gi|397779696|ref|YP_006544169.1| hypothetical protein BN140_0530 [Methanoculleus bourgensis MS2]
gi|396938198|emb|CCJ35453.1| putative protein M6_Spy0233 [Methanoculleus bourgensis MS2]
Length = 229
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 67/217 (30%), Positives = 113/217 (52%), Gaps = 3/217 (1%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I G Q+ + L P E A G+M MSG+M+M G+ + + G+++
Sbjct: 1 MQYEITGDNLQMVTLRLAPGEAACAEAGAMVNMSGNMQMSTNMKGGLFGGLKRMVTGESL 60
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
P +GFV A +I +DL GE + Q DAFLCS + + +R
Sbjct: 61 FMTEF-TPKGGEGFVSFAGNVPGKIFTLDLTE--GEFIAQKDAFLCSEQGIDLDVAFAKR 117
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+R+ + G EGF+ Q+L+G+G AF+ G +++ L GE++ V+ +V S+V+ I
Sbjct: 118 LRSGVFGGEGFILQRLSGRGTAFLHCCGDIMEMTLAPGEMVRVETGLVVGFESTVDYNIA 177
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
G ++ FGG+ L +TGPG V +QS+ +L+
Sbjct: 178 LAGGVKTVFFGGEGLFLTTLTGPGRVVLQSMNIAKLA 214
>gi|219117842|ref|XP_002179708.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408761|gb|EEC48694.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 298
Score = 104 bits (260), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 74/233 (31%), Positives = 116/233 (49%), Gaps = 13/233 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME---NIYIPEN---EVGMWQWLF 78
+ I G + QV + P E+VV GS FM GS++M+ + + G+ + L
Sbjct: 55 WMISGHDMQVLTTTVPPGEQVVTEIGS--FMYGSVDMKTSVELTLCSRVGFAGGLSRILG 112
Query: 79 GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF---GGELLCQPDAFLCSVNDVKV 135
G++ V+L N +G+VGI A+I+PI L+ Q A++ + DV V
Sbjct: 113 GESCVKVMLTNASNEEGYVGITPNFPAKIVPIKFGHHISPDHALISQGGAYMSHLGDVNV 172
Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTG--QGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
LD +R G RQK+TG + +AF+ AGG+++ + L E ITVD +VA+
Sbjct: 173 GCDLDASLRTCCCAGFGCFRQKITGSDESIAFLAAGGTLIYRYLSENETITVDSRSVVAM 232
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+V + I NG I GG+ + +TGPG VF+QS+ F + + V
Sbjct: 233 EETVTLGIAPNGRICMCCLGGEGCFSTTLTGPGKVFMQSMNFQKFRDAVQTTV 285
>gi|326434560|gb|EGD80130.1| hypothetical protein PTSG_10404 [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 63/216 (29%), Positives = 116/216 (53%), Gaps = 1/216 (0%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTIT 83
+QI+G +Q+ +++L P + V A PG+M MS ++ + ++ + G++
Sbjct: 150 WQIVGDGSQILEVVLNPNDAVTAEPGTMVHMSSGLKPDVHLGGGCGQACTRSCCAGESFF 209
Query: 84 SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV 143
V +N + +G+ A+I+P++LA FGG + + AF+C+ N + +
Sbjct: 210 RVKYQNNTETTQHIGLTPNFPAKIVPVNLAEFGGRITIKSRAFMCAHNCEPTFSYRFAGI 269
Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
G +GF+ L+G G+ F+ A G+V+ + LE GE + D S +VA S+VN ++
Sbjct: 270 GAGCCGGQGFILNDLSGDGMVFLNASGTVLMRKLEAGEELICDQSSVVAFQSTVNYAVRR 329
Query: 204 NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
G + GG+ L +V+TGPG V +QS+P +++
Sbjct: 330 AGGLLMCCCGGEGLFNSVLTGPGTVVLQSMPIEKIA 365
>gi|429761339|ref|ZP_19293767.1| TIGR00266 family protein [Anaerostipes hadrus DSM 3319]
gi|429184164|gb|EKY25195.1| TIGR00266 family protein [Anaerostipes hadrus DSM 3319]
Length = 243
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 68/228 (29%), Positives = 119/228 (52%), Gaps = 12/228 (5%)
Query: 17 SPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW 76
S ++ ++ ++ILG V L+ EK+V G M +MS +M+ME G
Sbjct: 11 SKEENIMRYEILGETLPVVICHLESGEKMVTEGGGMSWMSPNMKMET-----TSNGGIGK 65
Query: 77 LFGKTIT-SVVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVN 131
+FG+ + + +N + G G+ A A P + F G E++ Q A+L N
Sbjct: 66 IFGRAFSGEKMFQNVYTAQGGEGMIA--FASCFPGSIKAFEIAPGQEMILQKSAYLAGEN 123
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V++S ++++ + G EGF+ QK +G G+ F G VV+ LE GE I VD +
Sbjct: 124 GVELSAFFNKKIGAGLFGGEGFIMQKASGHGIMFAEFDGHVVEYELEAGEQIVVDTGHLA 183
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
A+T+S ++IK ++ +FGG+ + ++TGPG V++Q++P ++
Sbjct: 184 AMTASCTMEIKSVPGVKNMLFGGEGVFNTILTGPGKVWLQTMPISNVA 231
>gi|307352226|ref|YP_003893277.1| hypothetical protein Mpet_0058 [Methanoplanus petrolearius DSM
11571]
gi|307155459|gb|ADN34839.1| protein of unknown function DUF124 [Methanoplanus petrolearius DSM
11571]
Length = 230
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 65/225 (28%), Positives = 117/225 (52%), Gaps = 13/225 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW----LF 78
+ + I G Q + ++ E V A G+M +MS +MEM+ E E G+++ L
Sbjct: 1 MKYSITGDNLQFVTMEIENAETVWAEAGAMVYMSANMEMK----AELEGGLFKGIKRKLA 56
Query: 79 GKT-ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
G++ + + + GP G V + +I +D+A G E + Q DAFLC+ V
Sbjct: 57 GESFMVTNFTSHGGP--GVVAFGGNAPGKIYEMDIA--GKEFIAQKDAFLCAEKSVNWEI 112
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
+++ + G EG + +K++G G+ F A G +V+ +L+ G+ + + +V SSV
Sbjct: 113 AFQKKLGSTFFGGEGLILEKISGSGMVFFHACGDLVEIDLKPGQTYKIATAHVVGWESSV 172
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
N I+ G I+ ++FGG+ +TGPG + IQS+ +L++ +
Sbjct: 173 NYDIQAAGGIKTSMFGGEGFFLTTLTGPGKIVIQSMTLEQLAESL 217
>gi|167766978|ref|ZP_02439031.1| hypothetical protein CLOSS21_01495 [Clostridium sp. SS2/1]
gi|167710953|gb|EDS21532.1| TIGR00266 family protein [Clostridium sp. SS2/1]
Length = 243
Score = 102 bits (254), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 68/228 (29%), Positives = 119/228 (52%), Gaps = 12/228 (5%)
Query: 17 SPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW 76
S ++ ++ ++ILG V L+ EK+V G M +MS +M+ME G
Sbjct: 11 SKEENIMRYEILGETLPVVICYLESGEKMVTEGGGMSWMSPNMKMET-----TSNGGIGK 65
Query: 77 LFGKTIT-SVVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVN 131
+FG+ + + +N + G G+ A A P + F G E++ Q A+L N
Sbjct: 66 IFGRVFSGEKMFQNVYTAQGGKGMIA--FASCFPGSIKAFEIAPGQEMILQKSAYLAGEN 123
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V++S ++++ + G EGF+ QK +G G+ F G VV+ LE GE I VD +
Sbjct: 124 GVELSAFFNKKIGAGLFGGEGFIMQKASGHGIMFAEFDGHVVEYELEAGEQIVVDTGHLA 183
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
A+T+S ++IK ++ +FGG+ + ++TGPG V++Q++P ++
Sbjct: 184 AMTASCTMEIKSVPGVKNMLFGGEGVFNTILTGPGKVWLQTMPISNVA 231
>gi|339009165|ref|ZP_08641737.1| hypothetical protein BRLA_c29840 [Brevibacillus laterosporus LMG
15441]
gi|421873566|ref|ZP_16305178.1| conserved hypothetical protein [Brevibacillus laterosporus GI-9]
gi|338773643|gb|EGP33174.1| hypothetical protein BRLA_c29840 [Brevibacillus laterosporus LMG
15441]
gi|372457353|emb|CCF14727.1| conserved hypothetical protein [Brevibacillus laterosporus GI-9]
Length = 240
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 65/224 (29%), Positives = 119/224 (53%), Gaps = 8/224 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ + I+G Q Q L+P E++ GSM +MS +++M+ + + + G+++
Sbjct: 1 MEYHIMGSTMQALQFNLQPGERLFTESGSMIWMSENIKMDTSFKGGMLKSLGRAFSGESL 60
Query: 83 TSV---VLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
T L PG + A + +I+P+ + G E++ Q AFL ++ VS
Sbjct: 61 TFTFFEALHTPG----MIAFAPAAPGKIIPVTIHP-GNEIIAQKHAFLVGTENIDVSIHF 115
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
+++ G EGF+ Q+L+G+G+ F+ G V + NL+ GEVI VD + + A SSV++
Sbjct: 116 QKKLGTGFFGGEGFIMQRLSGRGMVFLEIDGEVAEMNLQHGEVIKVDTAHVAAYESSVDM 175
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
I+ I+ +FGG+ L + GPG +++Q++ L+ +IA
Sbjct: 176 SIERVKGIKNIIFGGEGLFLTTLRGPGKIWLQTMTIAGLAGKIA 219
>gi|403389619|ref|ZP_10931676.1| hypothetical protein CJC12_17781 [Clostridium sp. JC122]
Length = 228
Score = 101 bits (251), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 72/221 (32%), Positives = 113/221 (51%), Gaps = 8/221 (3%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT- 83
++I GG V L E VV+ G+M +MS ENI + N G G+ ++
Sbjct: 3 YEIKGGNMPVAICTLDQGETVVSESGAMGWMS-----ENIVMDTNMKGGLFGGIGRALSG 57
Query: 84 -SVVLRNPGPSDGFVGIAAP-SLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
S+ L ++G IA P S+ L + G L+CQ AFL + V+ L +
Sbjct: 58 DSLFLNTFTCTNGQGTIAFPCSVPGKLLVKNLYQGQTLICQKGAFLAAETSVEFKIHLKK 117
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
R+ + + G EGF+ Q++TG G+ F+ G V + +L GE I VD + A S+VN I
Sbjct: 118 RISSGLFGGEGFILQRMTGPGIVFLEFDGHVEEMDLSPGETIKVDTGHVAAFESTVNFDI 177
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ ++ +FGG+ L +TGPG V++Q++P L+ RI
Sbjct: 178 EMVKGVKNMLFGGEGLFLTTLTGPGRVYLQTMPISNLASRI 218
>gi|153813867|ref|ZP_01966535.1| hypothetical protein RUMTOR_00073 [Ruminococcus torques ATCC 27756]
gi|317502002|ref|ZP_07960186.1| hypothetical protein HMPREF1026_02130 [Lachnospiraceae bacterium
8_1_57FAA]
gi|331088180|ref|ZP_08337100.1| hypothetical protein HMPREF1025_00683 [Lachnospiraceae bacterium
3_1_46FAA]
gi|336440015|ref|ZP_08619614.1| hypothetical protein HMPREF0990_02008 [Lachnospiraceae bacterium
1_1_57FAA]
gi|145848263|gb|EDK25181.1| TIGR00266 family protein [Ruminococcus torques ATCC 27756]
gi|316896682|gb|EFV18769.1| hypothetical protein HMPREF1026_02130 [Lachnospiraceae bacterium
8_1_57FAA]
gi|330408836|gb|EGG88299.1| hypothetical protein HMPREF1025_00683 [Lachnospiraceae bacterium
3_1_46FAA]
gi|336014364|gb|EGN44213.1| hypothetical protein HMPREF0990_02008 [Lachnospiraceae bacterium
1_1_57FAA]
Length = 225
Score = 100 bits (249), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 66/223 (29%), Positives = 120/223 (53%), Gaps = 4/223 (1%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I GGE V + E ++ GSM +MS +M+ME + + G+ I
Sbjct: 1 MKYEIKGGELPVVVCFMDAGETMITEKGSMSWMSPNMKMETTSNGGVGKALGRMFAGEAI 60
Query: 83 -TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
+ GP G + A+ +I P ++A G E++ Q FL + V++S +
Sbjct: 61 FQNRYTAEGGP--GMIAFASSFPGQIRPWEIAP-GKEIVVQKAGFLAAQESVELSIFFQK 117
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
++ + + G EGF+ QK++G GLAF+ GSV++ L+ G+ I VD + A+ ++ N++I
Sbjct: 118 KLGSGLFGGEGFIMQKISGHGLAFLEFDGSVIEYVLQPGQQIVVDTGYLAAMEATCNMEI 177
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+ I+ +FGG+ L V+TGPG V++Q++P ++ I +
Sbjct: 178 RTVPGIKNMMFGGEGLFNTVITGPGRVWLQTMPISNVAGAIMK 220
>gi|424513771|emb|CCO66393.1| Protein containing DUF124 [Bathycoccus prasinos]
Length = 558
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 68/239 (28%), Positives = 117/239 (48%), Gaps = 23/239 (9%)
Query: 27 ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGS-MEMENIYIPENEVGMWQWLFGKTITSV 85
+ G +AQ+ Q+ LKP E V A PG+MC+MS + + + ++ + + L G+
Sbjct: 313 LFGTDAQIVQMKLKPGESVNAEPGAMCYMSSNVLSITSLGPGGLSKAIARVLAGEPFFIN 372
Query: 86 VLRNPGPSDGFVGIAAPSLA-RILPIDLAMFGGELLCQPDAFLCSVNDVKVS-------- 136
RN G SDG++ + + +I ++L +LLC D++LCS+ D+ VS
Sbjct: 373 TFRNLGSSDGYIALGSLRQGDKIAVVNLKEREADLLCARDSYLCSMGDISVSAAVSFSQE 432
Query: 137 --------NTLDQRVRNVI-----GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
L QR R + G F + L G G A I G+V+++ L+ GE +
Sbjct: 433 ARGGRTLSQVLKQRFRTGVRLILSKGNNIFNGEWLVGDGTACITGKGTVMRRTLKEGEEM 492
Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
VD ++A+ ++ ++ AVFGG+ L + GPG ++QSLP + +++
Sbjct: 493 VVDARAVLALERTIEYDLELVSSPVAAVFGGEGLFHVRLKGPGGFYLQSLPIDKAKKKL 551
>gi|357053067|ref|ZP_09114170.1| hypothetical protein HMPREF9467_01142 [Clostridium clostridioforme
2_1_49FAA]
gi|355386046|gb|EHG33087.1| hypothetical protein HMPREF9467_01142 [Clostridium clostridioforme
2_1_49FAA]
Length = 225
Score = 100 bits (249), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 114/224 (50%), Gaps = 3/224 (1%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ +++LG ++ L+ E + + G M +MS +++ G+ + G+++
Sbjct: 1 MDYRMLGDILPAVEVRLQAGEAMYTQSGGMAWMSDGFTLDSNVKGGLMKGLGRMFSGESL 60
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
P D + A+ +IL ID A L+CQ AFLC+ V+++ L ++
Sbjct: 61 FMATYTASRP-DSVIAFASTVPGKILAIDTAKT--SLICQKGAFLCAQTTVEINTVLTKK 117
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
G EGF+ Q++ G G+AF+ G VV++ L GEVI VD + A SV+ +I+
Sbjct: 118 FTAGFFGGEGFILQQIQGSGMAFLEVDGDVVERVLAPGEVIKVDTGNVFAFEPSVSYEIE 177
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
++ +FGG+ L +TGPG V++Q++ + RIA+ +
Sbjct: 178 TMKGVKNILFGGEGLFLTKLTGPGKVYMQTMNIAEFTGRIAQGL 221
>gi|78044630|ref|YP_360947.1| hypothetical protein CHY_2133 [Carboxydothermus hydrogenoformans
Z-2901]
gi|77996745|gb|ABB15644.1| conserved hypothetical protein TIGR00266 [Carboxydothermus
hydrogenoformans Z-2901]
Length = 228
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 67/221 (30%), Positives = 113/221 (51%), Gaps = 8/221 (3%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI-- 82
F+I+G V + L EKV + G M +MS EM+ G+ + L G+T+
Sbjct: 3 FKIVGETLPVVILTLGKGEKVFTQAGGMAWMSEGFEMQTNMEGGLLKGIARKLAGETLFM 62
Query: 83 -TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
T R+ G + A+ +I+P++L G ++CQ AFLC + VK+ +
Sbjct: 63 TTYTCTRDSGE----IAFASSFPGKIIPLELKA-GQSIICQKKAFLCGESSVKLEIFFRK 117
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
++ + + G EGF+ +K+TG GL F+ G+VV+ L +++ VD + +V +I
Sbjct: 118 KIGSGLFGGEGFILEKVTGPGLCFVEIDGAVVEYELGSNDILKVDQGHVAMFEPTVTFEI 177
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ +FGG+ L A + GPG V++QSLP L+ R+
Sbjct: 178 EMVKGFTNVLFGGEGLFLATLKGPGKVWLQSLPIENLAARL 218
>gi|291562839|emb|CBL41655.1| conserved hypothetical protein TIGR00266 [butyrate-producing
bacterium SS3/4]
Length = 228
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/223 (29%), Positives = 111/223 (49%), Gaps = 3/223 (1%)
Query: 23 IPFQILGGEAQVCQIMLK-PQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
+ ++I+G +I P E + + G M +MS + M G+ + G++
Sbjct: 1 MKYRIIGTTIPAVEIKFDAPGETMYTQSGGMSWMSEGISMSTNTRGGFMKGLGRMFAGES 60
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
+ R P + A+ ++LPID A GG ++CQ AFLC+ + V + T +
Sbjct: 61 MFMATYRADAPG-ATIAFASTVPGQVLPIDAAENGG-MICQKGAFLCAQDSVNLDVTFTK 118
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+ + + G EGF+ + ++G G+AF+ G VV+K L GEVI VD IVA +V ++
Sbjct: 119 KFTSGLFGGEGFILESISGSGMAFLEIDGDVVEKQLAPGEVIKVDTGNIVAFEKTVKYEV 178
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+ + FGG+ L +TGPG V +Q+ + RIA+
Sbjct: 179 ETIKGLGNIFFGGEGLFLTKLTGPGRVILQTQNIADFAGRIAQ 221
>gi|219852667|ref|YP_002467099.1| hypothetical protein Mpal_2078 [Methanosphaerula palustris E1-9c]
gi|219546926|gb|ACL17376.1| protein of unknown function DUF124 [Methanosphaerula palustris
E1-9c]
Length = 229
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 69/222 (31%), Positives = 110/222 (49%), Gaps = 7/222 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ + I G Q+ + L E++ A G+M MSG+M+M + G+ + + G+
Sbjct: 1 MQYTITGDNLQMVTLSLVEGEEIAAEAGAMVNMSGNMKMTSAVTGGLFKGLKRMVTGE-- 58
Query: 83 TSVVLRNPGPSDGF--VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
S L + PS G V A +I+P+ G E + Q DA+LC+ V +
Sbjct: 59 -SFFLSHFTPSGGGGQVAFAGNVPGKIVPVSFN--GNEFIAQRDAYLCAEMGVDLDIAFT 115
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+R+R I G EGF+ QKL+G G AF+ G VV+ L +V+ V +V S+V
Sbjct: 116 KRIRAGIFGGEGFVMQKLSGHGTAFLHCCGDVVEMTLAPDQVMKVQTGLVVGFDSTVRYD 175
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
I G + +FGG+ L + GPG V +QS+ +L+Q +
Sbjct: 176 IARAGGVTTILFGGEGLFLTTLQGPGKVVLQSMDVAKLAQAL 217
>gi|291559678|emb|CBL38478.1| conserved hypothetical protein TIGR00266 [butyrate-producing
bacterium SSC/2]
Length = 227
Score = 99.4 bits (246), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/220 (30%), Positives = 114/220 (51%), Gaps = 12/220 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++ILG V L+ EK+V G M +MS +M+ME G +FG+ +
Sbjct: 3 YEILGETLPVVICYLESGEKMVTEGGGMSWMSPNMKMET-----TSNGGIGKIFGRVFSG 57
Query: 85 -VVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTL 139
+ +N + G G+ A A P + F G E++ Q A+L N V++S
Sbjct: 58 EKMFQNVYTAQGGKGMIA--FASCFPGSIKAFEIAPGQEMILQKSAYLAGENGVELSAFF 115
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
++++ + G EGF+ QK +G G+ F G VV+ LE GE I VD + A+T+S +
Sbjct: 116 NKKIGAGLFGGEGFIMQKASGHGIMFAEFDGHVVEYELEAGEQIVVDTGHLAAMTASCTM 175
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
+IK ++ +FGG+ + ++TGPG V++Q++P ++
Sbjct: 176 EIKSVPGVKNMLFGGEGVFNTILTGPGKVWLQTMPISNVA 215
>gi|219109733|ref|XP_002176620.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411155|gb|EEC51083.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1]
Length = 218
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/219 (30%), Positives = 108/219 (49%), Gaps = 2/219 (0%)
Query: 26 QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSV 85
+I G E+ + + L P E + A G+M +M+ + ME + + L G+ +
Sbjct: 1 KIEGEESHIATVQLSPGEVLRAESGAMLYMTHGVVMET-SLQGASSAFQRMLTGQNVFLT 59
Query: 86 VLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVR 144
+ G S G V + ++IL + L + L+CQ A+L S V + + +
Sbjct: 60 DFKYEGQSSGTVCLGTDFPSKILRLRLDDYSNSTLICQRGAYLASNPTVDIQMEATKSLT 119
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
G +GF+ QKLTG G + GG+VV K L GE + V IVA SS++ ++
Sbjct: 120 AGFFGGQGFILQKLTGHGDVLVKGGGTVVNKTLSEGETLRVTSGSIVAFESSISYDVQMM 179
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
I+ A+FGG+ L A + GPG +++Q +P R+ IA
Sbjct: 180 PGIKNAMFGGEGLFVATLQGPGQIWLQGMPPDRMIAEIA 218
>gi|67925051|ref|ZP_00518431.1| Protein of unknown function DUF124 [Crocosphaera watsonii WH 8501]
gi|67853098|gb|EAM48477.1| Protein of unknown function DUF124 [Crocosphaera watsonii WH 8501]
Length = 226
Score = 99.0 bits (245), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/228 (28%), Positives = 114/228 (50%), Gaps = 2/228 (0%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I G Q+ + L E + G M +M G +EM+ G+ + L G+++
Sbjct: 1 MQYKIHGTVMQILETHLMMGESIYTESGGMAWMKGDIEMKTNTRGGLMAGIGRALAGESL 60
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ N G V IL L M G L+CQ DAF+C+ V++ ++
Sbjct: 61 F-MTSYNCNSGQGMVAFTMEVPGTILDFQL-MPGQSLICQRDAFMCAEETVEIELHFRKK 118
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ + G EGF+ QK++G G AF+ G + +L+ E + VD I +VN +I+
Sbjct: 119 LGAGLFGGEGFILQKISGPGKAFLEIPGEMQSYDLKPQETLLVDPGHIALFEPTVNFEIE 178
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
I+ +F G+ L A +TGPG V++QS+PF L+ ++A+ + + N
Sbjct: 179 SVKGIKNVLFSGEGLFLAKLTGPGKVWLQSMPFANLAAKLAKYIPTKN 226
>gi|416407190|ref|ZP_11688251.1| Protein of unknown function DUF124 [Crocosphaera watsonii WH 0003]
gi|357260911|gb|EHJ10240.1| Protein of unknown function DUF124 [Crocosphaera watsonii WH 0003]
Length = 226
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 66/226 (29%), Positives = 113/226 (50%), Gaps = 2/226 (0%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++I G Q+ + L E + G M +M G +EM+ G+ + L G+++
Sbjct: 3 YKIHGTVMQILETHLMMGESIYTESGGMAWMKGDIEMKTNTRGGLMAGIGRALAGESLF- 61
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
+ N G V IL L M G L+CQ DAF+C+ V++ +++
Sbjct: 62 MTSYNCNSGQGMVAFTMEVPGTILDFQL-MPGQSLICQRDAFICAEETVEIELHFRKKLG 120
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
+ G EGF+ QK++G G AF+ G + +L+ E + VD I +VN +I+
Sbjct: 121 AGLFGGEGFILQKISGPGKAFLEIPGEMQSYDLKPQETLLVDPGHIALFEPTVNFEIESV 180
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
I+ +F G+ L A +TGPG V++QS+PF L+ ++A+ + + N
Sbjct: 181 KGIKNVLFSGEGLFLAKLTGPGKVWLQSMPFANLAAKLAKYIPTKN 226
>gi|160939087|ref|ZP_02086438.1| hypothetical protein CLOBOL_03981 [Clostridium bolteae ATCC
BAA-613]
gi|158438050|gb|EDP15810.1| hypothetical protein CLOBOL_03981 [Clostridium bolteae ATCC
BAA-613]
Length = 225
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 62/224 (27%), Positives = 113/224 (50%), Gaps = 3/224 (1%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ +++LG +I L+ E + + G M +MS +++ G+ + G+++
Sbjct: 1 MDYRMLGDILPAVEIRLQAGEAMYTQSGGMAWMSDGFTLDSNVKGGLMKGLGRMFSGESL 60
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
P D + A+ +IL ID A L+CQ AFLC+ V+++ L ++
Sbjct: 61 FMATYTASRP-DSTIAFASTVPGKILAIDTAKT--SLICQKGAFLCAQTTVEINTVLTKK 117
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
G EGF+ Q++ G G+AF+ G VV++ L GEVI VD + A ++ +I+
Sbjct: 118 FTAGFFGGEGFILQQIQGSGMAFLEVDGDVVERVLAPGEVIKVDTGNVFAFEPGISYEIE 177
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
++ +FGG+ L +TGPG V++Q++ + RIA+ +
Sbjct: 178 TVKGVKNILFGGEGLFLTKLTGPGKVYMQTMNIAEFTGRIAQGL 221
>gi|428225765|ref|YP_007109862.1| hypothetical protein GEI7407_2335 [Geitlerinema sp. PCC 7407]
gi|427985666|gb|AFY66810.1| protein of unknown function DUF124 [Geitlerinema sp. PCC 7407]
Length = 229
Score = 98.2 bits (243), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 72/224 (32%), Positives = 115/224 (51%), Gaps = 6/224 (2%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI-- 82
++I G Q I L+ E V G M +M G +EM+ G+ + L G+++
Sbjct: 3 YEIHGTVMQTVDIYLQAGESVYTESGGMAWMRGDLEMKTNTKGGLMAGLGRALAGESLFM 62
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
T+ V R+ S G V A + +I+ IDLA G L+CQ DAF+C+ + VK+ ++
Sbjct: 63 TTYVCRS---SAGMVVFAPEAPGKIMAIDLAA-GQSLICQKDAFMCAESSVKLEMHFRRK 118
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ G EGF+ QK+TG G F+ G V + +L GE + VD + +V+ I
Sbjct: 119 LGAGFFGGEGFILQKVTGPGTVFLEIPGEVREYDLRPGESMQVDPGHLAVFEPTVSYDIT 178
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+R G+ L A +TGPG +++QSLP L+ +IA+ +
Sbjct: 179 MVKGLRNIFLSGEGLFLAKMTGPGKIWLQSLPLTNLAAKIAQHL 222
>gi|322434398|ref|YP_004216610.1| hypothetical protein AciX9_0761 [Granulicella tundricola MP5ACTX9]
gi|321162125|gb|ADW67830.1| protein of unknown function DUF124 [Granulicella tundricola
MP5ACTX9]
Length = 254
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 66/224 (29%), Positives = 112/224 (50%), Gaps = 4/224 (1%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMEN---IYIPENEVGMWQWLFGKTITSVVLRNP 90
V +++L+P E V++ G + +MS S++M + G + + G ++
Sbjct: 12 VLEVLLQPGEAVISEAGELSWMSQSIQMTTHTQMGGGGGFFGAIKRVAGGGTLFMIEYTA 71
Query: 91 GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
G + A ILPI++ G E++ FLC+ V++ Q + I G
Sbjct: 72 AGIPGEIAFATKIPGHILPIEVGQ-GNEIMVHRHGFLCATQQVQLGVGFQQSLGAGIFGG 130
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
+GFL QK++G G A++ G V+ K+L GE++ V + A T ++ QI I+
Sbjct: 131 DGFLLQKISGYGTAWLELSGEVIVKDLRPGEMLRVHPGHVGAFTGGMSFQITRVPGIKNM 190
Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
+FGGD L A +TGPG V++Q+LP RL+ +I S +E+
Sbjct: 191 IFGGDGLFLAALTGPGRVWLQTLPLARLAHQILEFAPSERRQES 234
>gi|295115123|emb|CBL35970.1| conserved hypothetical protein TIGR00266 [butyrate-producing
bacterium SM4/1]
Length = 251
Score = 97.8 bits (242), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 66/229 (28%), Positives = 120/229 (52%), Gaps = 17/229 (7%)
Query: 13 YVYQSPQDAVIPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE 70
Y+Y ++ + ++I G A VC+ L+ E ++ G+M +MS +M+ME
Sbjct: 16 YIYMRREEQDMKYEIHGETLPAVVCE--LEKGEAMITEKGAMAWMSPNMQMET------N 67
Query: 71 VGMWQWLFGKTIT-SVVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDA 125
VG FG+ + + RN + G G+ A A P + F G E + Q A
Sbjct: 68 VGGIGKAFGRLFSGESMFRNTYTAKGGPGMIA--FASSFPGSIRAFEVGPGQEYIFQKSA 125
Query: 126 FLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITV 185
FL S V +S +++ + + G EGF+ QKL+G+G+ F G +++ +LE G+ + +
Sbjct: 126 FLASEAGVDLSVYFHKKLGSGLFGGEGFIMQKLSGRGIVFAEFDGHIMEYDLEAGQQMVI 185
Query: 186 DVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
D + A+T+ ++I+ ++ +FGG+ L +VTGPG +++Q++P
Sbjct: 186 DTGYLAAMTAGCQMEIQSVPGVKNMLFGGEGLFNTIVTGPGHIWLQTMP 234
>gi|317499708|ref|ZP_07957966.1| hypothetical protein HMPREF0996_02950 [Lachnospiraceae bacterium
5_1_63FAA]
gi|316893014|gb|EFV15238.1| hypothetical protein HMPREF0996_02950 [Lachnospiraceae bacterium
5_1_63FAA]
Length = 227
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 66/220 (30%), Positives = 114/220 (51%), Gaps = 12/220 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++ILG V L+ EK+V G M +MS +M+ME G +FG+ +
Sbjct: 3 YEILGETLPVVICYLESGEKMVTEGGGMSWMSPNMKMET-----TSNGGIGKIFGRAFSG 57
Query: 85 -VVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTL 139
+ +N + G G+ A A P + F G E++ Q A+L N V++S
Sbjct: 58 EKMFQNVYTAQGGEGMIA--FASCFPGSIKAFEIAPGQEMILQKSAYLAGENGVELSAFF 115
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
++++ + G EGF+ QK +G G+ F G VV+ LE G+ I VD + A+T+S +
Sbjct: 116 NKKIGAGLFGGEGFIMQKASGHGIMFAEFDGHVVEYELEAGKQIVVDTGHLAAMTASCTM 175
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
+IK ++ +FGG+ + ++TGPG V++Q++P ++
Sbjct: 176 EIKSVPGVKNMLFGGEGVFNTILTGPGKVWLQTMPISNVA 215
>gi|283797697|ref|ZP_06346850.1| HTH DNA-binding protein [Clostridium sp. M62/1]
gi|291074591|gb|EFE11955.1| TIGR00266 family protein [Clostridium sp. M62/1]
gi|295092075|emb|CBK78182.1| conserved hypothetical protein TIGR00266 [Clostridium cf.
saccharolyticum K10]
Length = 251
Score = 97.8 bits (242), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 66/229 (28%), Positives = 122/229 (53%), Gaps = 17/229 (7%)
Query: 13 YVYQSPQDAVIPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE 70
Y+Y ++ + ++I G A VC+ L+ E ++ G+M +MS +M+ME N
Sbjct: 16 YIYMRREEQDMKYEIHGETLPAVVCE--LEKGEAMITEKGAMAWMSPNMQMET-----NA 68
Query: 71 VGMWQWLFGKTIT-SVVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDA 125
G+ + FG+ + + RN + G G+ A A P + F G E + Q A
Sbjct: 69 GGIGK-AFGRLFSGESMFRNTYTAKGGPGMIA--FASSFPGSIRAFEVGPGQEYIFQKSA 125
Query: 126 FLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITV 185
FL S V +S +++ + + G EGF+ QKL+G+G+ F G +++ +LE G+ + +
Sbjct: 126 FLASEAGVDLSVYFHKKLGSGLFGGEGFIMQKLSGRGIVFAEFDGHIMEYDLEAGQQMVI 185
Query: 186 DVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
D + A+T+ ++I+ ++ +FGG+ L +VTGPG +++Q++P
Sbjct: 186 DTGYLAAMTAGCQMEIQSVPGVKNMLFGGEGLFNTIVTGPGHIWLQTMP 234
>gi|383761740|ref|YP_005440722.1| hypothetical protein CLDAP_07850 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
gi|381382008|dbj|BAL98824.1| hypothetical protein CLDAP_07850 [Caldilinea aerophila DSM 14535 =
NBRC 104270]
Length = 251
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 76/236 (32%), Positives = 121/236 (51%), Gaps = 11/236 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG--MWQWLFGK 80
+ ++I+G Q + L P E + + G M +MSG+++M+ +G + + L G+
Sbjct: 23 MEYKIIGSTLQTVILELDPGETIYSESGGMAWMSGNIQMKTT-TRGGGIGDVLKRALSGE 81
Query: 81 TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
+ V G V AA +IL + L G EL+CQ AFLC+ VK+
Sbjct: 82 SFFLVEF-TAESQKGVVAFAADFPGKILAVSLEQ-GQELICQRQAFLCAEKTVKLDIHFQ 139
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+R G EGF+ QKLTG G+AF+ G +V+ L GEV+ VD + SV+
Sbjct: 140 RRPGAGFFGGEGFILQKLTGPGVAFVSLDGEIVEYTLSPGEVLKVDTGHVGMYEPSVSFD 199
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS--PNMREN 254
I+ R +FGG+ L A + GPG V++Q++P +Q +A+A+ S P +R +
Sbjct: 200 IEMVPGFRNILFGGEGLFLATLRGPGRVWLQTMP----TQNLAKAILSYFPRLRAD 251
>gi|326433927|gb|EGD79497.1| hypothetical protein PTSG_12989 [Salpingoeca sp. ATCC 50818]
Length = 282
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 69/230 (30%), Positives = 122/230 (53%), Gaps = 4/230 (1%)
Query: 17 SPQ-DAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN--EVGM 73
+PQ + + +QI+G E+Q+ +++L + V PG+M MS + +I++ +
Sbjct: 42 TPQFNGQVQWQIVGDESQILEVVLNQGDAVTTEPGTMVHMSSGLS-PDIHLSGGCGQACT 100
Query: 74 WQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
G++ V N + +G+ A+I+PI+L FGG + + AF+C++N
Sbjct: 101 RSCCAGESFFRVKYENSTNTPQHIGLTPNFPAKIVPINLPEFGGRITIKSRAFMCALNCQ 160
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
+ + G +GF+ L+G G+AF+ A G+V+ +NL GE + D S +VA
Sbjct: 161 PTFSYRFAGLGAGCCGGQGFVLNDLSGDGMAFLNASGTVLMRNLGAGEELICDQSSVVAF 220
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
S+VN I+ G GG+ L +V+TGPGIV +QS+P +++ +A
Sbjct: 221 QSTVNFTIRRAGGCLMCCCGGEGLFNSVLTGPGIVVLQSMPIEKIAAALA 270
>gi|309789698|ref|ZP_07684278.1| protein of unknown function DUF124 [Oscillochloris trichoides DG-6]
gi|308228184|gb|EFO81832.1| protein of unknown function DUF124 [Oscillochloris trichoides DG6]
Length = 314
Score = 97.1 bits (240), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 67/236 (28%), Positives = 126/236 (53%), Gaps = 5/236 (2%)
Query: 11 QPYVYQSPQDAV-IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN 69
P + S Q A + ++I+G Q I L P + + + G M +MS ++ M+ +
Sbjct: 58 DPAIVGSGQGASGLKYKIIGTTMQAVVIELPPNQTIFSERGGMSWMSANINMQT-NMEGG 116
Query: 70 EVGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
G ++ +F G++I V + G G +G +A +I+P++L G +++CQ D+F+C
Sbjct: 117 LGGAFKRMFSGESIFMVSFTSQG-GMGIIGFSAEFPGKIVPLNLGA-GQQMICQKDSFMC 174
Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
+ V + +++ G EGF+ QK+TG GLAF+ G +V+ LE +++ VD
Sbjct: 175 AERSVTLDLHFRRKLGAGFFGGEGFIMQKITGPGLAFVELDGEIVEYTLEANQMLKVDTG 234
Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+ +V ++ + +FGG+ L A + GPG V++Q++P L+++IA+
Sbjct: 235 HVAMYEPTVQFDVEMVRGFKNILFGGEGLFLATLRGPGRVWLQTMPAMNLAKKIAQ 290
>gi|159900866|ref|YP_001547113.1| hypothetical protein Haur_4353 [Herpetosiphon aurantiacus DSM 785]
gi|159893905|gb|ABX06985.1| protein of unknown function DUF124 [Herpetosiphon aurantiacus DSM
785]
Length = 340
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/224 (28%), Positives = 114/224 (50%), Gaps = 10/224 (4%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG----MWQWLFGK 80
++I+G Q I L P + + + G M +MSG+++M N G M++ F
Sbjct: 100 YKIIGTVLQAAIIELDPGQMIFSESGGMSWMSGNVQMNT-----NSGGGIGKMFKRAFSG 154
Query: 81 TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
+V G +G A+ +I+P+ LA G E++ Q D+F+C+ V++
Sbjct: 155 ESLFIVDFTVAGGTGLIGFASEMPGKIVPLHLAP-GQEIIMQKDSFMCAEKSVQLDIHFR 213
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+++ + G EGF+ Q+ TG G+ F G VV+ L+ G+V+ VD + SV
Sbjct: 214 RKLGAGLFGGEGFIMQRATGPGVVFAELDGEVVEYYLQPGQVMKVDTGHVAMYEPSVQFD 273
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
I+ + +FGG+ L A + GPG V++Q++P +++++A
Sbjct: 274 IEMVRGFKNILFGGEGLFLATLRGPGKVWLQTMPVMNVAKKLAE 317
>gi|390956112|ref|YP_006419869.1| hypothetical protein Terro_0183 [Terriglobus roseus DSM 18391]
gi|390411030|gb|AFL86534.1| TIGR00266 family protein [Terriglobus roseus DSM 18391]
Length = 254
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 65/232 (28%), Positives = 108/232 (46%), Gaps = 4/232 (1%)
Query: 26 QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE---VGMWQWLFGKTI 82
+I G + +L+P E V++ G + +MS S+ M +G + + G
Sbjct: 4 RIHGTTMPAIEFLLQPGETVISEAGELSWMSASIGMHTHTQAGGGGGFLGAIKRMAGGGT 63
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ G V A I+P++L G E + FLC+ +V + Q
Sbjct: 64 LFMTEYTAQGYPGEVAFATKVPGHIVPVELHGNGDEYMVHRHGFLCATPNVGIGLGFQQS 123
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ I G EGFL QK+ G G A++ G V+ ++L+ GE + V + A + V+ QI
Sbjct: 124 LGAGIFGGEGFLLQKIYGHGTAWLELSGEVIVRDLQPGETLRVHPGHVGAFHAGVSFQIT 183
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
IR FGGD + A +TGPG +++Q+LP L+ ++ + P+ RE
Sbjct: 184 RVPGIRNMFFGGDGIFLAALTGPGRIWLQTLPISNLAHKLQEYLPGPD-RET 234
>gi|219118253|ref|XP_002179905.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408958|gb|EEC48891.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 305
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 68/239 (28%), Positives = 125/239 (52%), Gaps = 14/239 (5%)
Query: 9 PFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPE 68
P +P VY D + ++ILG +AQ+ Q+ L+ + +V+ G+M +MS M+ME
Sbjct: 48 PPKPNVYA---DGLPVYEILGTDAQIVQLPLRARRQVMCFSGAMAYMSDGMKMEAKLAGL 104
Query: 69 NEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF-GGELLCQPDAFL 127
+ + G ++ + N DG++ + I+PI++A G+++ D+FL
Sbjct: 105 GKT-FGRLAGGGSLFQLTYTNETDQDGYIAMTPDYPGVIVPINMASCPAGKIVTMRDSFL 163
Query: 128 CS---VNDVKVSNTLDQRVRNVIG-----GIEGFLRQKLTGQGLAFILAGGSVVQKNLEV 179
C+ V+++ + IG GI+ F+ Q ++ AF++A G+V+QKNL+
Sbjct: 164 CATVGVDNLTTEVGAGFNPASSIGSFLCSGID-FIVQTVSNGEWAFLMAMGTVIQKNLQP 222
Query: 180 GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
GE + VD I+ SSV + I++ G + GG+ L +TGPG +++QS+ ++
Sbjct: 223 GEKLLVDTESILCFESSVTIDIQWVGNVAAICCGGEGLFNTTMTGPGKIWMQSMSIDKM 281
>gi|332798969|ref|YP_004460468.1| hypothetical protein TepRe1_1013 [Tepidanaerobacter acetatoxydans
Re1]
gi|438002061|ref|YP_007271804.1| DUF124 domain-containing protein [Tepidanaerobacter acetatoxydans
Re1]
gi|332696704|gb|AEE91161.1| protein of unknown function DUF124 [Tepidanaerobacter acetatoxydans
Re1]
gi|432178855|emb|CCP25828.1| DUF124 domain-containing protein [Tepidanaerobacter acetatoxydans
Re1]
Length = 227
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 66/222 (29%), Positives = 108/222 (48%), Gaps = 6/222 (2%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ +++ G V + L E + G M +MS EM+ G+ + L G+++
Sbjct: 1 MKYELRGDTLPVVLLTLDEGESIFTESGGMSWMSDGFEMKTNMEGGLFGGIARKLAGESL 60
Query: 83 --TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
T+ L S G + ++ +I+PI L G L+CQ +FLC+ VK+ L
Sbjct: 61 FMTTYTLNK---SSGTIAFSSSFPGKIVPIYLDT-GESLICQKTSFLCAERSVKLEIHLK 116
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+++ I G EGF+ Q++TG G F+ G V+ L+ GE + VD + SVN
Sbjct: 117 KKLGAGIFGGEGFILQRVTGPGWVFLEIDGEAVEYQLDPGEKMKVDTGHVAIFEPSVNFD 176
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
I+ VFGG+ L + GPG +++QS+P L+ RI
Sbjct: 177 IETVKGFTNVVFGGEGLFLTTLKGPGKIWLQSMPIGNLASRI 218
>gi|442804464|ref|YP_007372613.1| hypothetical protein Cst_c10050 [Clostridium stercorarium subsp.
stercorarium DSM 8532]
gi|442740314|gb|AGC68003.1| hypothetical protein Cst_c10050 [Clostridium stercorarium subsp.
stercorarium DSM 8532]
Length = 270
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 65/226 (28%), Positives = 109/226 (48%), Gaps = 3/226 (1%)
Query: 17 SPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW 76
S + ++I GG I L E + + G M +M + ME G +
Sbjct: 40 SANPGKLNYKIFGGSFPAVSIRLNTNESIYTQAGGMTWMDAGITMETNMQGGLMKGFGRM 99
Query: 77 LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
L G+++ V + P+ V IA+ IL ID+ ++ Q AFLC+ V +S
Sbjct: 100 LSGESLFMVTYTSHAPNQEIV-IASSFPGCILDIDVGRC--PIIAQKSAFLCAQTTVNLS 156
Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
+ + ++ + G EGF+ Q+L+G G+AFI GS+V++ L GE I VD + A +
Sbjct: 157 IYVARGLKAGLFGGEGFVLQRLSGSGIAFIEIDGSLVERTLAPGETIKVDTGNVAAFEET 216
Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
V Q + + +FGG+ L +TGPG V++Q++ ++ I
Sbjct: 217 VQYQAEMVKGFKNILFGGEGLFLTTLTGPGKVWLQTMSMPEFAKSI 262
>gi|393719536|ref|ZP_10339463.1| hypothetical protein SechA1_07285 [Sphingomonas echinoides ATCC
14820]
Length = 259
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 67/229 (29%), Positives = 110/229 (48%), Gaps = 3/229 (1%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++I G Q I L P E++V++ +M +MS + M+ G+ + L G +I
Sbjct: 3 YEISGTVMQTVSIDLSPGERIVSQTHAMAWMSDGITMDTHTGGGLFAGLKRALSGGSI-- 60
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
+ G V A I+ +L G L+C+ + FLC+ V + QR
Sbjct: 61 FITEYGAERPGHVAFAPRFPGTIIARELRA-GESLICRKETFLCAEYSVSLEIAFQQRFG 119
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
+ EGF+ Q++TG G F+ G V++K L GE + V I +V+ I+
Sbjct: 120 AGVFAGEGFILQRVTGPGTVFLDLSGEVIEKTLSPGERLRVHAGHIGMQEPTVSTDIQMV 179
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRE 253
R +FGG+ L A +TGPG +++QS+P L++ IAR + S R+
Sbjct: 180 RGFRNILFGGEGLFLATLTGPGKIWLQSMPILNLAEEIARHLPSNEARD 228
>gi|52550378|gb|AAU84227.1| HTH DNA-binding protein [uncultured archaeon GZfos3D4]
Length = 230
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 99/200 (49%), Gaps = 4/200 (2%)
Query: 43 EKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAP 102
E V G+M +MS ++EM G+ + + G++ + G DGFV A
Sbjct: 10 EMVYGEAGAMIYMSENVEMNAKAKGGFLKGIKRKVSGESFFLTEFTSNG--DGFVAFAGN 67
Query: 103 SLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQG 162
+ RI I+LA G E + Q DAFLC+ V + +++ + G EGF+ ++L G G
Sbjct: 68 APGRIKAIELA--GNEFIAQKDAFLCAQEGVDLDIAFTKKLGAGLFGGEGFIMERLGGNG 125
Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
AFI G + +L GEV+ VD +V SSV+ I G ++ +F G+ + +
Sbjct: 126 TAFIHGCGDFKEMDLAPGEVVKVDTGSVVGFDSSVSYDIMKVGNVKSMLFSGEGIFLTTL 185
Query: 223 TGPGIVFIQSLPFHRLSQRI 242
GPG V +QS+ L+ +
Sbjct: 186 KGPGHVILQSMTIENLAAAL 205
>gi|347531876|ref|YP_004838639.1| hypothetical protein RHOM_07970 [Roseburia hominis A2-183]
gi|345502024|gb|AEN96707.1| hypothetical protein RHOM_07970 [Roseburia hominis A2-183]
Length = 226
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 62/208 (29%), Positives = 110/208 (52%), Gaps = 11/208 (5%)
Query: 39 LKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT--SVVLRNPGP--SD 94
L+ E ++ G+M +MS +M+ME G +FG+ ++ S+ L +
Sbjct: 17 LEQGEVMITESGAMSWMSPNMKMETT------GGGIGKMFGRALSGESIFLNRYTAMNGN 70
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
G + A+ I D+ G +L+ Q AFL +V+ V +S QR+ + + G EGF+
Sbjct: 71 GMIAFASSFPGSIRAFDIVP-GQDLIVQKSAFLAAVSSVDLSVFFQQRIGSGLFGGEGFI 129
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGG 214
QKL+G G AF+ G + + L G+ I VD + A+T S +++K ++ +FGG
Sbjct: 130 MQKLSGSGTAFLEFDGYIKEYELASGQQIVVDTGYLAAMTGSCKIEVKTVPGVKNMLFGG 189
Query: 215 DNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ + V+TGPG V++QS+P +++ +
Sbjct: 190 EGIFNTVITGPGRVWLQSMPIAQMANTL 217
>gi|317473273|ref|ZP_07932569.1| hypothetical protein HMPREF1011_02919 [Anaerostipes sp. 3_2_56FAA]
gi|316899267|gb|EFV21285.1| hypothetical protein HMPREF1011_02919 [Anaerostipes sp. 3_2_56FAA]
Length = 226
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 64/231 (27%), Positives = 112/231 (48%), Gaps = 12/231 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMS----GSMEMENIYIPENEVGMWQWLF 78
+ +I+G ++ L E + + G+M +M G +ME ++ G+ +
Sbjct: 1 MKHEIIGQTVPAVEVELDKGEAMFTQSGAMAWMDPSIKGESKMEGGFLK----GIGRKFA 56
Query: 79 GKTITSVVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
G+++ V + DG + A+ ILP+ GG ++CQ AFLC+ V +
Sbjct: 57 GESLFMVTYSSD--KDGAKIAFASTVPGTILPLQFQGTGG-MICQKKAFLCAQRSVSLET 113
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
+++ G EGF+ Q+L+G G+AF+ G V K L GE I VD +VA +
Sbjct: 114 IFTKKLSAGAFGGEGFILQRLSGDGMAFLEVDGDAVTKELAPGETILVDTGNVVAFEDGI 173
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+ +I+ ++ FGG+ L +TGPG + +Q+ F+ + RIA + S
Sbjct: 174 SYEIERIKGVKNIFFGGEGLFLTKLTGPGKIILQTQNFNDFAGRIASLIPS 224
>gi|167746172|ref|ZP_02418299.1| hypothetical protein ANACAC_00868 [Anaerostipes caccae DSM 14662]
gi|167654687|gb|EDR98816.1| TIGR00266 family protein [Anaerostipes caccae DSM 14662]
Length = 226
Score = 94.0 bits (232), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 64/231 (27%), Positives = 112/231 (48%), Gaps = 12/231 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMS----GSMEMENIYIPENEVGMWQWLF 78
+ +I+G ++ L E + + G+M +M G +ME ++ G+ +
Sbjct: 1 MKHEIIGQTVPAVEVELDKGEAMFTQSGAMAWMDPSIKGESKMEGGFLK----GIGRKFA 56
Query: 79 GKTITSVVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
G+++ V + DG + A+ ILP+ GG ++CQ AFLC+ V +
Sbjct: 57 GESLFMVTYSSE--KDGAKIAFASTVPGTILPLQFQGTGG-MICQKKAFLCAQRSVSLET 113
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
+++ G EGF+ Q+L+G G+AF+ G V K L GE I VD +VA +
Sbjct: 114 IFTKKLSAGALGGEGFILQRLSGDGMAFLEVDGDAVTKELAPGETILVDTGNVVAFEDGI 173
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+ +I+ ++ FGG+ L +TGPG + +Q+ F+ + RIA + S
Sbjct: 174 SYEIERIKGVKNIFFGGEGLFLTKLTGPGKIILQTQNFNDFAGRIASLIPS 224
>gi|323449198|gb|EGB05088.1| hypothetical protein AURANDRAFT_14032 [Aureococcus anophagefferens]
Length = 201
Score = 93.6 bits (231), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 58/208 (27%), Positives = 102/208 (49%), Gaps = 7/208 (3%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++++G + Q+ +I ++P +KVVA PG+M +M ++ ++ M + G
Sbjct: 1 YEVVGHDMQLLRIQMEPGDKVVAEPGAMVYMHSDVQAGC----DSSDCMGRCCSGSPCIM 56
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
P + G++G+ + A+++P++L G L + A+ S+ DV V D
Sbjct: 57 GTFETP-DTGGYLGLTPVNPAKVIPLELG--GRSFLGKDGAYFASIGDVAVGYDCDCNPA 113
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
G +G +RQK++G G+AF+ A G + K L GE VD + +VA +V IK
Sbjct: 114 TCCCGGQGLVRQKISGNGMAFLGAMGVLTTKTLAEGETFVVDSNSLVAWEETVKFSIKRT 173
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
G G+ + + GPG +F QS
Sbjct: 174 GGCLTCCCSGEGMFNTTLEGPGTIFTQS 201
>gi|413933084|gb|AFW67635.1| hypothetical protein ZEAMMB73_642216 [Zea mays]
Length = 90
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 44/68 (64%), Positives = 56/68 (82%), Gaps = 1/68 (1%)
Query: 215 DNLVTAVVTGPGIVFIQSLPFHRLSQRIA-RAVTSPNMRENPKFFVQIAIFFFLAYVVIV 273
DN +T +TGPG V IQSLPFHRLSQ IA R+V P++R+NPKFF+Q+ +FFF AYV+IV
Sbjct: 6 DNQLTTSLTGPGFVLIQSLPFHRLSQWIASRSVVGPSLRDNPKFFIQVVMFFFSAYVMIV 65
Query: 274 SSLILTDV 281
SS+IL D+
Sbjct: 66 SSIILADI 73
>gi|331092539|ref|ZP_08341361.1| hypothetical protein HMPREF9477_02004 [Lachnospiraceae bacterium
2_1_46FAA]
gi|330400760|gb|EGG80363.1| hypothetical protein HMPREF9477_02004 [Lachnospiraceae bacterium
2_1_46FAA]
Length = 226
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 67/217 (30%), Positives = 109/217 (50%), Gaps = 12/217 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ +QI G V L+ EK++ GSM +MS +M ME G FG+
Sbjct: 1 MQYQIKGETLPVVICHLEAGEKMITEKGSMSWMSPNMLMET-----GTNGGLGKAFGRMF 55
Query: 83 TS-VVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSN 137
+ + +N S G G A A P + F G E++ Q AFL + V++S
Sbjct: 56 SGESMFQNTYTSQGGNGTIA--FASSFPGSIKAFEISTGNEMIFQKSAFLAAEAGVQLSV 113
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
+++ + + G EGF+ Q+++GQG F G V++ L+ G+ I VD + A+T S
Sbjct: 114 HFQKKLGSGLFGGEGFILQRVSGQGTMFAEFDGHVIEYELQPGQQIVVDTGHLAAMTPSC 173
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
+ IK ++ VFGG+ L ++TGPG V++Q++P
Sbjct: 174 QMDIKTIKGVKNIVFGGEGLFNTIITGPGRVWLQTMP 210
>gi|219117839|ref|XP_002179707.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217408760|gb|EEC48693.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 259
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 69/221 (31%), Positives = 109/221 (49%), Gaps = 9/221 (4%)
Query: 18 PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL 77
P+ F + G E+QV + + E G+M ++S M Y E +
Sbjct: 26 PKPDAAKFAVTGHESQVVTLQIGVGEACQGEAGTMMYLSPGMRQSVTY----EGCCQRCC 81
Query: 78 FGKTITSVVLRNPGPSDGFVGIA-APSL--ARILPIDLAM--FGGELLCQPDAFLCSVND 132
G++ V N G + +A P+ A+++P+DL+ GG+L+ Q A++ S D
Sbjct: 82 SGESCFVVNFTNSGSTGNHEFVALTPNFPTAKVVPVDLSSPDVGGKLVAQQGAYMASYGD 141
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
V+V +LD G +RQKL G G F+ G++VQK L GE I VD +CI+A
Sbjct: 142 VRVGISLDTNCMRCCCAGLGLVRQKLEGSGTVFLGGTGTIVQKVLADGETILVDTNCIMA 201
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSL 233
S+ + +K G I V GG+ + +TGPG+V +QS+
Sbjct: 202 FASTCKLDLKRAGGILGMVGGGEGIFNTTLTGPGLVVVQSM 242
>gi|163845873|ref|YP_001633917.1| hypothetical protein Caur_0275 [Chloroflexus aurantiacus J-10-fl]
gi|222523587|ref|YP_002568057.1| hypothetical protein Chy400_0293 [Chloroflexus sp. Y-400-fl]
gi|163667162|gb|ABY33528.1| protein of unknown function DUF124 [Chloroflexus aurantiacus
J-10-fl]
gi|222447466|gb|ACM51732.1| protein of unknown function DUF124 [Chloroflexus sp. Y-400-fl]
Length = 315
Score = 92.0 bits (227), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 116/220 (52%), Gaps = 2/220 (0%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++I+G Q + + + + + G M +MS +++M+ + G ++ +F
Sbjct: 75 YKIIGTTMQAVVLEVPAGQTIFSERGGMSWMSANVQMQT-NMEGGLGGAFKRMFSGESIF 133
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
+V P G +G +A +I+P++LA G ++CQ D+F+C+ V + +++
Sbjct: 134 MVNFTPQGGPGIIGFSAELPGKIVPLNLAP-GQTMICQKDSFMCAERSVSLDIHFRRKLG 192
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
G EGF+ QKLTG GLAF+ G VV+ LE +++ VD + ++N ++
Sbjct: 193 AGFFGGEGFIMQKLTGPGLAFVELDGEVVEYTLEANQMLKVDTGHVAMYEPTINFDVEMV 252
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+ +FGG+ L + GPG V++Q++P L++++A+
Sbjct: 253 RGFKNILFGGEGLFLTTLRGPGRVWLQTMPAMNLAKKLAQ 292
>gi|379721089|ref|YP_005313220.1| hypothetical protein PM3016_3216, partial [Paenibacillus
mucilaginosus 3016]
gi|378569761|gb|AFC30071.1| hypothetical protein PM3016_3216 [Paenibacillus mucilaginosus 3016]
Length = 115
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 44/99 (44%), Positives = 65/99 (65%), Gaps = 1/99 (1%)
Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
QK+ G GLAF+ AGG++ ++ L GE+I VD C+VA+T V+ I++ ++ A+FGG+
Sbjct: 2 QKIEGDGLAFVHAGGAICERELRPGEMIRVDTGCLVAMTQGVDYDIEFVKGVKTALFGGE 61
Query: 216 NLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
L A + GPG V+IQSLPF RL+ R+ A P R+
Sbjct: 62 GLFFATLRGPGRVWIQSLPFSRLADRVL-AAAGPGGRKE 99
>gi|65322230|ref|ZP_00395189.1| COG2013: Uncharacterized conserved protein [Bacillus anthracis str.
A2012]
Length = 113
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 44/94 (46%), Positives = 63/94 (67%), Gaps = 4/94 (4%)
Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
QKL G GLAF+ AGG+V ++ L+ GE + +D C+VA+T VN +++ G ++ A+FGG+
Sbjct: 2 QKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCLVAMTKDVNYDVEFVGKVKTALFGGE 61
Query: 216 NLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
L A + GPG V+IQSL RLS R+ TSP
Sbjct: 62 GLFFATLEGPGTVWIQSLTLSRLSARL----TSP 91
>gi|148658157|ref|YP_001278362.1| hypothetical protein RoseRS_4068 [Roseiflexus sp. RS-1]
gi|148570267|gb|ABQ92412.1| protein of unknown function DUF124 [Roseiflexus sp. RS-1]
Length = 253
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 69/227 (30%), Positives = 112/227 (49%), Gaps = 13/227 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW------QW 76
+ +QI+G Q + L P E V + G+M +MSG+++M GM+ +
Sbjct: 26 MEYQIIGTTMQAVILELDPGETVYSESGAMSWMSGNIQMATNTRGGGLGGMFKRAISGES 85
Query: 77 LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
LF TSV G V A+ +I+P+ LA G ++ Q AFLC+ V +
Sbjct: 86 LFLNEFTSV------GGKGIVAFASDFPGKIVPVALAE-GQMMIVQKQAFLCAEKTVGLD 138
Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
+R+ G EGF+ QKLTG G+AF+ G +V+ L+ G+V+ VD + +
Sbjct: 139 IHFRKRLGAGFFGGEGFIMQKLTGPGIAFVCLDGEIVEYTLDAGQVLKVDTGHVAMYEPT 198
Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
V I+ + + GG+ L + GPG V++Q++P L++ IA
Sbjct: 199 VEFDIEMVKGFKNILLGGEGLFLTTLRGPGRVWLQTMPTANLAKAIA 245
>gi|346308928|ref|ZP_08851032.1| hypothetical protein HMPREF9457_02741 [Dorea formicigenerans
4_6_53AFAA]
gi|345901477|gb|EGX71276.1| hypothetical protein HMPREF9457_02741 [Dorea formicigenerans
4_6_53AFAA]
Length = 226
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 112/217 (51%), Gaps = 6/217 (2%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+QI G V L+ E+++ G M +MS +M+ME + + G+ +
Sbjct: 3 YQIQGDTLPVVICQLEAGERMITEGGGMSWMSPNMKMETTTNGGIGKAVGRMFSGEKMFQ 62
Query: 85 VVLRNPGPSDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ G +G + A+ P R I G E++ Q AFL S V++S ++
Sbjct: 63 NIYTAQG-GNGMIAFASCFPGSIRAFNIRP---GQEMIFQKSAFLASEAGVQLSVHFHKK 118
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
V + + G EGF+ QK++GQG+AF G VV+ L+ G+ I +D + A+T+S + I+
Sbjct: 119 VASGLFGGEGFVLQKVSGQGVAFAEFDGHVVEYELQPGQQIVIDTGHLAAMTASCQMDIQ 178
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
++ +FGG+ L V+TGPG V++Q++P ++
Sbjct: 179 TVPGVKNMLFGGEGLFNTVITGPGRVWLQTMPISNVA 215
>gi|166030603|ref|ZP_02233432.1| hypothetical protein DORFOR_00269 [Dorea formicigenerans ATCC
27755]
gi|166029605|gb|EDR48362.1| TIGR00266 family protein [Dorea formicigenerans ATCC 27755]
Length = 229
Score = 91.3 bits (225), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 112/217 (51%), Gaps = 6/217 (2%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+QI G V L+ E+++ G M +MS +M+ME + + G+ +
Sbjct: 6 YQIQGDTLPVVICQLEAGERMITEGGGMSWMSPNMKMETTTNGGIGKAVGRMFSGEKMFQ 65
Query: 85 VVLRNPGPSDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ G +G + A+ P R I G E++ Q AFL S V++S ++
Sbjct: 66 NIYTAQG-GNGMIAFASCFPGSIRAFNIRP---GQEMIFQKSAFLASEAGVQLSVHFHKK 121
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
V + + G EGF+ QK++GQG+AF G VV+ L+ G+ I +D + A+T+S + I+
Sbjct: 122 VASGLFGGEGFVLQKVSGQGVAFAEFDGHVVEYELQPGQQIVIDTGHLAAMTASCQMDIQ 181
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
++ +FGG+ L V+TGPG V++Q++P ++
Sbjct: 182 TVPGVKNMLFGGEGLFNTVITGPGRVWLQTMPISNVA 218
>gi|228967985|ref|ZP_04128994.1| Protein containing DUF124 [Bacillus thuringiensis serovar sotto
str. T04001]
gi|228791709|gb|EEM39302.1| Protein containing DUF124 [Bacillus thuringiensis serovar sotto
str. T04001]
Length = 113
Score = 90.5 bits (223), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 43/94 (45%), Positives = 63/94 (67%), Gaps = 4/94 (4%)
Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
QKL G GLAF+ AGG+V ++ L+ GE + +D C+VA+T +N I++ G ++ A+FGG+
Sbjct: 2 QKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCLVAMTKDINYDIEFVGKVKTALFGGE 61
Query: 216 NLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 62 GLFFATLEGPGTVWIQSLTLSRLAARL----TSP 91
>gi|153852988|ref|ZP_01994425.1| hypothetical protein DORLON_00410 [Dorea longicatena DSM 13814]
gi|149754630|gb|EDM64561.1| TIGR00266 family protein [Dorea longicatena DSM 13814]
Length = 227
Score = 90.5 bits (223), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 66/229 (28%), Positives = 115/229 (50%), Gaps = 12/229 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++I G V L+ EK++ G M +MS +M+ME +G FG+ +
Sbjct: 3 YEIKGDTLPVVICYLEGGEKMITEGGGMSWMSPNMKMETT--TNGGIGK---AFGRMFSG 57
Query: 85 -VVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTL 139
+ +N + G G+ A A P + F G E++ Q AFL S V++S
Sbjct: 58 EHIFQNIYTAQGGKGMIA--FASCFPGSIKAFEIGPGREMVFQKSAFLASEAGVELSVFF 115
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
+++ + + G EGF+ QK++GQG F G VV+ L G+ I VD + A+ +S +
Sbjct: 116 NKKFSSGLFGGEGFIMQKVSGQGTVFAEFDGHVVEYELNAGQEIVVDTGHLAAMDASCQI 175
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+IK ++ +FGG+ + VV GPG V++Q++P ++ + +T+
Sbjct: 176 EIKSVPGVKNMLFGGEGIFNTVVHGPGRVWLQTMPISNVAGALGPYITT 224
>gi|423603430|ref|ZP_17579323.1| hypothetical protein IIK_00011 [Bacillus cereus VD102]
gi|401247409|gb|EJR53745.1| hypothetical protein IIK_00011 [Bacillus cereus VD102]
Length = 113
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 43/94 (45%), Positives = 63/94 (67%), Gaps = 4/94 (4%)
Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
QKL G GLAF+ AGG+V ++ L+ GE + +D C+VA+T VN +++ G ++ A+FGG+
Sbjct: 2 QKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCLVAMTKDVNYDVEFVGKVKTALFGGE 61
Query: 216 NLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
L A + GPG V+IQSL RL+ R+ TSP
Sbjct: 62 GLFFATLEGPGTVWIQSLTLSRLAARL----TSP 91
>gi|210617031|ref|ZP_03291366.1| hypothetical protein CLONEX_03588 [Clostridium nexile DSM 1787]
gi|210149554|gb|EEA80563.1| hypothetical protein CLONEX_03588 [Clostridium nexile DSM 1787]
Length = 242
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 66/215 (30%), Positives = 109/215 (50%), Gaps = 12/215 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++I G V L+ EK++ GSM +MS +MEM+ G +FG+ +
Sbjct: 19 YEIKGETLPVVICYLESGEKMITERGSMSWMSPNMEMQT-----TSNGGIGKVFGRAFSG 73
Query: 85 VVL-RNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTL 139
L +N + G G+ A A P + F G E++ Q AFL V++S
Sbjct: 74 EALFQNTYTAQGGNGMIA--FASSFPGSIKAFQVSPGNEMIFQKSAFLAGEAGVELSIHF 131
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
++V + + G EGF+ Q+++G G F G VV+ L+ G+ I +D + A+T S +
Sbjct: 132 HKKVASGLFGGEGFILQRVSGIGTVFAEFDGHVVEYELQPGQQIVIDSGHLAAMTPSCQM 191
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
I+ ++ VFGG+ L V+TGPG V++Q++P
Sbjct: 192 DIQTIRGLKNVVFGGEGLFNTVITGPGRVWLQTMP 226
>gi|309776314|ref|ZP_07671302.1| HTH DNA-binding protein [Erysipelotrichaceae bacterium 3_1_53]
gi|308915910|gb|EFP61662.1| HTH DNA-binding protein [Erysipelotrichaceae bacterium 3_1_53]
Length = 225
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 66/222 (29%), Positives = 110/222 (49%), Gaps = 6/222 (2%)
Query: 23 IPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
+ ++I+GG+ A VC+ L EKV G MC+M M++ G+ + + G+
Sbjct: 1 MKYEIIGGQLPAVVCK--LDRGEKVFTESGGMCWMEEGFSMDSNTRGGLLKGLGRAMSGE 58
Query: 81 TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
+I + D + + +ILP+ L G L+ Q AFL + + V +
Sbjct: 59 SIF-LTTYTSSQDDAEIAFGSSFPGKILPVALEP-GKTLIVQKHAFLAAEDGVNLQAHFR 116
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+++ G EGF+ QKLTG G+AF+ G +++K L GE + VD I SV
Sbjct: 117 KKLGTGFFGGEGFVLQKLTGNGMAFLEIDGDIIEKTLAPGETLQVDQGYIAGFEESVQFD 176
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
I ++ F G+ L A ++GPG +++Q++PF L+ RI
Sbjct: 177 ITTVKGLKNKFFSGEGLFLATLSGPGKIWLQTMPFSVLADRI 218
>gi|226325602|ref|ZP_03801120.1| hypothetical protein COPCOM_03415 [Coprococcus comes ATCC 27758]
gi|225205726|gb|EEG88080.1| TIGR00266 family protein [Coprococcus comes ATCC 27758]
Length = 227
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 61/223 (27%), Positives = 110/223 (49%), Gaps = 14/223 (6%)
Query: 24 PFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT 83
++ILGG V L E ++ GSM +MS +M+ME I G + +FG+ ++
Sbjct: 3 KYEILGGNLPVVVCELSAGESMITESGSMSWMSPNMKMETI-----SGGGMKKMFGRLMS 57
Query: 84 -SVVLRNPGPSDGFVGIAA-----PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
+N ++G G+ A P + L I G ++ Q AFL S V++S
Sbjct: 58 GDSAFQNRYTAEGADGMIAFASSFPGAIKALEIS---NGHSMIVQKSAFLASEEGVELSM 114
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
+++ I G EGF+ Q+L+G G AF+ G V+ +L G+ I + + A+ +
Sbjct: 115 HFQKKLGKGIFGGEGFIMQRLSGNGTAFVEIDGHAVEYDLSAGQEILISTGYLAAMEETC 174
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
+ + ++ + GG+ + VV GPG V +Q++P ++++
Sbjct: 175 TMDVVAVKGVKNMLIGGEGIFNTVVKGPGKVILQTMPISKVAE 217
>gi|260437558|ref|ZP_05791374.1| HTH DNA-binding protein [Butyrivibrio crossotus DSM 2876]
gi|292810190|gb|EFF69395.1| HTH DNA-binding protein [Butyrivibrio crossotus DSM 2876]
Length = 223
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 67/219 (30%), Positives = 111/219 (50%), Gaps = 12/219 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+ I GG + L+ E ++ G+M +MS +M ME G + + G+ +
Sbjct: 3 YTIEGGTLPTLILNLEAGETIMTEGGAMSWMSPNMLMET-----TTNGGFGKMMGRMFSG 57
Query: 85 VVL-RNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTL 139
L N + G G+ A +A LP + F G EL+CQ +FL +++S
Sbjct: 58 EKLFMNRYTAQGGPGMIA--MASSLPGSIKPFEIVPGRELICQKSSFLSCTPGIELSVAF 115
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
+++ G EGF+ QKL+GQGLAFI G + L G+ I +D + A+ ++ +
Sbjct: 116 QKKLGAGFFGGEGFIMQKLSGQGLAFIEIDGYCKEYELAPGQQIVLDTGYLAAMEATCKM 175
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
IK G ++ A+FGG+ +VTGPG V++QS+P ++
Sbjct: 176 DIKSTGGVKNALFGGEGFFNTIVTGPGKVYVQSMPMSKM 214
>gi|293376638|ref|ZP_06622866.1| conserved hypothetical protein TIGR00266 [Turicibacter sanguinis
PC909]
gi|325839400|ref|ZP_08166839.1| TIGR00266 family protein [Turicibacter sp. HGF1]
gi|292644864|gb|EFF62946.1| conserved hypothetical protein TIGR00266 [Turicibacter sanguinis
PC909]
gi|325490520|gb|EGC92836.1| TIGR00266 family protein [Turicibacter sp. HGF1]
Length = 227
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 62/216 (28%), Positives = 103/216 (47%), Gaps = 2/216 (0%)
Query: 27 ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVV 86
I G + I+L E V + G M +M + M G+ + G+++
Sbjct: 5 IFGNDLPGVSIVLDKGESVYTQSGGMAWMDNGITMNTNIKGGLMKGIGRMFSGESLFMAT 64
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
+ P+ V A+ +I+ +DL G + +CQ +FL V +S +RV
Sbjct: 65 YTSSLPNQEIV-FASTFPGKIIELDLVP-GQDWICQKSSFLVGEPGVNLSVEWTRRVSAG 122
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
+ G EGF+ Q+L+G G F+ GSVV++ L+ GEVI VD +V V +I+
Sbjct: 123 LFGGEGFILQRLSGHGKVFLEVAGSVVKRELKPGEVIKVDTGNVVGFDRQVKYEIETVKG 182
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ FGG+ L +TGPG V++Q+L ++ R+
Sbjct: 183 FKNIFFGGEGLFLTKLTGPGTVYLQTLTMQNVASRL 218
>gi|154483410|ref|ZP_02025858.1| hypothetical protein EUBVEN_01113 [Eubacterium ventriosum ATCC
27560]
gi|149735662|gb|EDM51548.1| TIGR00266 family protein [Eubacterium ventriosum ATCC 27560]
Length = 226
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 60/210 (28%), Positives = 108/210 (51%), Gaps = 13/210 (6%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL----RN 89
+C+ L EK++ G M +MS +M+ME G +FG+ ++ L
Sbjct: 14 ICE--LDAGEKIITEGGGMAWMSPNMKMETTG------GGIGKMFGRALSGDTLFQNIYT 65
Query: 90 PGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
++G + IA+ I ++ G E++ Q AFL S V++S +++ + + G
Sbjct: 66 AERTNGMISIASSFPGSIKAFEITP-GNEIILQKSAFLASEAGVELSMHFRKKLGSGLFG 124
Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
EGF+ QKL+G G+AF G V++ L+ G+ I +D + A++ + + I+ I+
Sbjct: 125 GEGFILQKLSGNGIAFAEFDGHVIEYELQPGQQIVIDTGHLAAMSVTCQMDIQTVPGIKN 184
Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
GG+ L VVTGPG V++Q++P ++
Sbjct: 185 MFLGGEGLFNTVVTGPGHVWLQTMPISNMA 214
>gi|145352533|ref|XP_001420596.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580831|gb|ABO98889.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 243
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 63/220 (28%), Positives = 110/220 (50%), Gaps = 10/220 (4%)
Query: 27 ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGS-MEMENIYIPENEVGMWQWLFGKTITSV 85
++G + Q+ QI LKP E+V A PG+MC+MS + M + ++ + + L G+
Sbjct: 7 VVGTDTQIAQIKLKPGEEVAASPGAMCYMSSNVMSLTSLGPGGLPRAIMRALAGEPFFIN 66
Query: 86 VLRNPGPSDGFVGIAA--PSLARILPIDLAM-FGGELLCQPDAFLCSVNDVKVS-----N 137
RN G +G++ + PS +I I+L ++LC D+F+CS +++++
Sbjct: 67 TFRNLGSVEGYIAVGGLEPS-DKIAVIELKDDTKHDILCARDSFMCSKGNIRITAAAALR 125
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
T + + + F + L G G I G+VV++ L+ E + VD ++A+ SV
Sbjct: 126 TTENDFASFADMLNIFNGEWLIGNGTVCINGKGTVVRRTLKKDEEMVVDARAVMALERSV 185
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR 237
++ AVFGG L + GPG ++QSLP +
Sbjct: 186 GYDLEIQSSPLAAVFGGAGLFFVRLHGPGTFYLQSLPVQK 225
>gi|336433991|ref|ZP_08613797.1| hypothetical protein HMPREF0991_02916 [Lachnospiraceae bacterium
2_1_58FAA]
gi|336014730|gb|EGN44564.1| hypothetical protein HMPREF0991_02916 [Lachnospiraceae bacterium
2_1_58FAA]
Length = 226
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/219 (26%), Positives = 115/219 (52%), Gaps = 2/219 (0%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++I+G L+ E ++ G+M +MS +M+ME + G++I
Sbjct: 3 YEIVGETLPAVICTLEEGETMITEKGAMSWMSPNMKMETSTNGGIGKAFGRMFSGESIFQ 62
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
G G + A+ +I+ ++A G E++ Q FL + V +S +R+
Sbjct: 63 NTYTAKG-GHGMIAFASSFPGKIVAHEVAP-GREIVVQKSGFLAAEAGVDLSIFFQKRIG 120
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
+ I G EGF+ QKL+G+G+AF+ G VV+ L+ G+ I VD + A+ ++ +++I+
Sbjct: 121 SGIFGGEGFIMQKLSGRGMAFLEFDGHVVEYELQPGQQIVVDTGYLAAMDATCSMEIQSV 180
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
++ +FGG+ + V++GPG +++Q++P ++ +A
Sbjct: 181 PGVKNMLFGGEGVFNTVISGPGHIWLQTMPISNVAGVLA 219
>gi|226325805|ref|ZP_03801323.1| hypothetical protein COPCOM_03618 [Coprococcus comes ATCC 27758]
gi|225205929|gb|EEG88283.1| TIGR00266 family protein [Coprococcus comes ATCC 27758]
Length = 227
Score = 88.2 bits (217), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 63/225 (28%), Positives = 113/225 (50%), Gaps = 12/225 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I G V L+ E+++ GSM +MS +M+ME + G + FG+
Sbjct: 1 MKYEIKGDTLPVVICHLEAGERMITERGSMAWMSPNMKMET-----STNGGFGKAFGRMF 55
Query: 83 T-SVVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSN 137
+ + +N + G G+ A A P + F G E + Q AFL V +S
Sbjct: 56 SGEAMFQNIYTAQGGNGLIA--FASSFPGSIKAFEIGPGQEYIFQKKAFLAGEPGVNISV 113
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
++V + + G EGF+ QK++G G+ F G VV+ L+ G+ I +D + A++ +
Sbjct: 114 HFHKKVASGLFGGEGFILQKVSGNGIVFAEFDGHVVEYELQPGQQIVIDSGYLAAMSVTC 173
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ I+ ++ VFGG+ L V+TGPG V++Q++P ++ +
Sbjct: 174 QMDIQTVPGLKNIVFGGEGLFNTVITGPGHVWLQTMPISSVADSL 218
>gi|154505420|ref|ZP_02042158.1| hypothetical protein RUMGNA_02942 [Ruminococcus gnavus ATCC 29149]
gi|153794263|gb|EDN76683.1| TIGR00266 family protein [Ruminococcus gnavus ATCC 29149]
Length = 228
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 58/219 (26%), Positives = 115/219 (52%), Gaps = 2/219 (0%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++I+G L+ E ++ G+M +MS +M+ME + G++I
Sbjct: 5 YEIVGETLPAVICTLEEGETMITEKGAMSWMSPNMKMETSTNGGIGKAFGRMFSGESIFQ 64
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
G G + A+ +I+ ++A G E++ Q FL + V +S +R+
Sbjct: 65 NTYTAKG-GHGMIAFASSFPGKIVAHEVAP-GREIVVQKSGFLAAEAGVDLSIFFQKRIG 122
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
+ I G EGF+ QKL+G+G+AF+ G VV+ L+ G+ I VD + A+ ++ +++I+
Sbjct: 123 SGIFGGEGFIMQKLSGRGMAFLEFDGHVVEYELQPGQQIVVDTGYLAAMDATCSMEIQSV 182
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
++ +FGG+ + V++GPG +++Q++P ++ +A
Sbjct: 183 PGVKNMLFGGEGVFNTVISGPGHIWLQTMPISNVAGVLA 221
>gi|156741188|ref|YP_001431317.1| hypothetical protein Rcas_1202 [Roseiflexus castenholzii DSM 13941]
gi|156232516|gb|ABU57299.1| protein of unknown function DUF124 [Roseiflexus castenholzii DSM
13941]
Length = 253
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 68/227 (29%), Positives = 110/227 (48%), Gaps = 13/227 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW------QW 76
+ +QI+G Q + L P E V + G+M +MSG+++M GM+ +
Sbjct: 26 MEYQIIGTTMQAVILELDPGETVYSESGAMSWMSGNIQMATNTRGGGLGGMFKRAISGES 85
Query: 77 LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
LF TSV G V A+ +I+P L G ++ Q AFLC+ V +
Sbjct: 86 LFLNEFTSV------GGKGIVAFASDFPGKIVPAPLRE-GQTMIVQKQAFLCAEKTVGLD 138
Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
+R+ G EGF+ QKLTG G+AF+ G +V+ L+ G+V+ VD + +
Sbjct: 139 IHFRKRMGAGFFGGEGFIMQKLTGPGVAFVCLDGEIVEYTLDAGQVLKVDTGHVAMYEPT 198
Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
V I+ + + GG+ L + GPG V++Q++P L++ IA
Sbjct: 199 VEFDIEMVKGFKNILLGGEGLFLTTLRGPGRVWLQTMPTTNLAKAIA 245
>gi|323454064|gb|EGB09935.1| hypothetical protein AURANDRAFT_14036, partial [Aureococcus
anophagefferens]
Length = 235
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 64/233 (27%), Positives = 108/233 (46%), Gaps = 8/233 (3%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+ +LG + L P E + PG+M FMS + M + + Q + G+ +
Sbjct: 1 YDVLGDTYGIVAKTLHPGETFHSEPGAMVFMSNDVRMSAKFGRGFRGAVSQAVSGEALAK 60
Query: 85 VVLRNPGPSDGFVGIAAPS-LARILPIDLAMF-GGELLCQPDAFLCSVNDVKVSNTLDQR 142
V N G G+VG+ A + ++P+ LA GG + + A++ DV S
Sbjct: 61 VEYTNAGAGPGYVGMTANQPFSTVIPVQLAALPGGSINVKRGAYMAGTPDVTASMKF-LP 119
Query: 143 VRNVIGGIEGFLR---QKLTG--QGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
R++ G L Q ++G G AF+ A G++V K+L+ GE + VD + IV + V
Sbjct: 120 ARDLAACCCGGLPPVIQSVSGAPHGTAFLSAAGTIVAKSLQPGEAVVVDSTSIVGFENDV 179
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
+K G GG+ V+TGPG V++QS+ +L ++ +S +
Sbjct: 180 QFDVKRVGDCSTCCLGGEGCYNTVLTGPGNVYLQSISIDKLMAQLVTVQSSDD 232
>gi|308809826|ref|XP_003082222.1| unnamed protein product [Ostreococcus tauri]
gi|116060690|emb|CAL57168.1| unnamed protein product, partial [Ostreococcus tauri]
Length = 475
Score = 87.4 bits (215), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 68/245 (27%), Positives = 118/245 (48%), Gaps = 30/245 (12%)
Query: 18 PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGS-MEMENIYIPENEVGMWQW 76
PQD I+G + Q+ QI L P E+V A PG+MC+MS + M + ++ + +
Sbjct: 218 PQDV----NIVGTDTQIAQIKLSPGEEVAASPGAMCYMSSNVMSVTSLGPGGLPRAIMRA 273
Query: 77 LFGKTITSVVLRNPGPSDGFVGIAA--PSLARILPIDLA-MFGGELLCQPDAFLCSVNDV 133
L G+ +N G ++G++ + PS +I I+L ++LC D+++CS +V
Sbjct: 274 LAGEPFFLNTFKNLGVTEGYIAVGGLEPS-DKIAVIELKDEIKHDILCARDSYMCSKGNV 332
Query: 134 KVS------------NTLDQRVRNVIG---------GIEGFLRQKLTGQGLAFILAGGSV 172
+++ ++ ++ IG G F + LTG G I G+V
Sbjct: 333 RITAAAALRSGVDDFDSFSDMLKYRIGLAARLVLSKGNSIFNGEWLTGSGTVCITGKGTV 392
Query: 173 VQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
V++ L+ E + VD ++A+ SV +++ AVFGG L + GPG ++QS
Sbjct: 393 VRRTLKKDEELVVDARSVMALERSVGYELEVQNSPLAAVFGGAGLFFVRLQGPGTFYLQS 452
Query: 233 LPFHR 237
LP +
Sbjct: 453 LPVQK 457
>gi|148657327|ref|YP_001277532.1| hypothetical protein RoseRS_3221 [Roseiflexus sp. RS-1]
gi|148569437|gb|ABQ91582.1| protein of unknown function DUF124 [Roseiflexus sp. RS-1]
Length = 269
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 64/225 (28%), Positives = 110/225 (48%), Gaps = 9/225 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ +QI+G Q + L P E V + G+M +MSG+++M G +F + I
Sbjct: 22 MEYQIVGTTLQAVILELDPGETVYSESGAMSWMSGNIQMAT----NTRGGGLGGMFKRAI 77
Query: 83 TSVVLR----NPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
+ L G V A+ +I+P+ L + G ++ Q AFLC+ V +
Sbjct: 78 SGESLFLNEFTSAGGKGIVAFASDFPGKIVPVPL-LEGQSMIVQKQAFLCAEKTVGLDIH 136
Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
+R+ G EGF+ QKLTG G+AF+ G +V+ L+ G+V+ VD + +V
Sbjct: 137 FRKRLGAGFFGGEGFIMQKLTGPGVAFVCLDGEIVEYTLDAGQVLKVDTGHVAMYEPTVE 196
Query: 199 VQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
I+ + + GG+ L + GPG V++Q++P L++ +A
Sbjct: 197 FDIEMVKGFKNILLGGEGLFLTTLRGPGRVWLQTMPTANLAKAVA 241
>gi|313897924|ref|ZP_07831465.1| TIGR00266 family protein [Clostridium sp. HGF2]
gi|346317754|ref|ZP_08859229.1| hypothetical protein HMPREF9022_04886 [Erysipelotrichaceae
bacterium 2_2_44A]
gi|422329865|ref|ZP_16410889.1| TIGR00266 family protein [Erysipelotrichaceae bacterium 6_1_45]
gi|312957459|gb|EFR39086.1| TIGR00266 family protein [Clostridium sp. HGF2]
gi|345898897|gb|EGX68756.1| hypothetical protein HMPREF9022_04886 [Erysipelotrichaceae
bacterium 2_2_44A]
gi|371655570|gb|EHO20916.1| TIGR00266 family protein [Erysipelotrichaceae bacterium 6_1_45]
Length = 225
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 68/226 (30%), Positives = 112/226 (49%), Gaps = 14/226 (6%)
Query: 23 IPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
+ ++I+GG+ A VC+ L EK+ G MC+M M++ N G G+
Sbjct: 1 MKYEIIGGQLPAVVCK--LDRGEKMFTESGGMCWMEEGFSMDS-----NTRGGLLKGLGR 53
Query: 81 TIT--SVVLRNPGPSDGFVGIAAPSL--ARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
++ S+ L S IA S +ILP+ L G L+ Q +AFL + + V +
Sbjct: 54 AMSGESIFLTTYTSSHDHAEIAFGSSFPGKILPVTLHD-GKTLIVQKNAFLAAEDGVNLQ 112
Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
+++ G EGF+ QKLTG G AF+ G +++K L GE + VD I S
Sbjct: 113 AHFRKKLGTGFFGGEGFVLQKLTGNGTAFLEIDGDIIEKTLAPGETLQVDQGYIAGFEES 172
Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
V+ I ++ F G+ + A ++GPG +++Q++PF L+ R+
Sbjct: 173 VHFDITTVKGLKNKFFSGEGMFLATLSGPGKIWLQTMPFSVLADRV 218
>gi|373125276|ref|ZP_09539110.1| TIGR00266 family protein [Erysipelotrichaceae bacterium 21_3]
gi|371657477|gb|EHO22775.1| TIGR00266 family protein [Erysipelotrichaceae bacterium 21_3]
Length = 225
Score = 87.0 bits (214), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 68/226 (30%), Positives = 112/226 (49%), Gaps = 14/226 (6%)
Query: 23 IPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
+ ++I+GG+ A VC+ L EK+ G MC+M M++ N G G+
Sbjct: 1 MKYEIIGGQLPAVVCK--LDRGEKMFTESGGMCWMEEGFSMDS-----NTRGGLLKGLGR 53
Query: 81 TIT--SVVLRNPGPSDGFVGIAAPSL--ARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
++ S+ L S IA S +ILP+ L G L+ Q +AFL + + V +
Sbjct: 54 AMSGESIFLTTYTSSHDHAEIAFGSSFPGKILPVTLHD-GKTLIVQKNAFLAAEDGVDLQ 112
Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
+++ G EGF+ QKLTG G AF+ G +++K L GE + VD I S
Sbjct: 113 AHFRKKLGTGFFGGEGFVLQKLTGNGTAFLEIDGDIIEKTLAPGETLQVDQGYIAGFEES 172
Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
V+ I ++ F G+ + A ++GPG +++Q++PF L+ R+
Sbjct: 173 VHFDITTVKGLKNKFFSGEGMFLATLSGPGKIWLQTMPFSVLADRV 218
>gi|167770183|ref|ZP_02442236.1| hypothetical protein ANACOL_01526 [Anaerotruncus colihominis DSM
17241]
gi|167667505|gb|EDS11635.1| TIGR00266 family protein [Anaerotruncus colihominis DSM 17241]
Length = 226
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 60/206 (29%), Positives = 111/206 (53%), Gaps = 4/206 (1%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
VC+ L+P EK++ GSM +M+ +M+ME + + G+ + + G S
Sbjct: 14 VCE--LQPGEKMITERGSMSWMTSNMQMETTSNGGVGKALGRMFSGEALFQNIYTAVG-S 70
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
G + A+ I ++++ EL+ Q FL S + V++S +++ + G EGF
Sbjct: 71 SGIIAFASSFPGSIRAVEISP-NNELIVQKSGFLASESGVELSIFFQKKLGSGFFGGEGF 129
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
+ Q+L+G+G+AF+ G V+ +L+ GE + VD + A++ S ++ I+ I+ +FG
Sbjct: 130 IMQRLSGKGMAFVEIDGYAVEYDLDPGEKLVVDTGYLAAMSGSCSIDIQTVPGIKNKLFG 189
Query: 214 GDNLVTAVVTGPGIVFIQSLPFHRLS 239
G+ VVTGPG V +Q++P ++
Sbjct: 190 GEGFFNTVVTGPGKVILQTMPISTVA 215
>gi|224012403|ref|XP_002294854.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220969293|gb|EED87634.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 556
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 69/274 (25%), Positives = 123/274 (44%), Gaps = 38/274 (13%)
Query: 13 YVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMS---------------- 56
+ + P D QI G E+Q+ ++ L P + + A G+M +MS
Sbjct: 97 HAHLQPIDFTTASQISGEESQILEVQLAPNQMLRAESGAMLYMSYLVRILCDVVFLCVAF 156
Query: 57 ---GSMEME-NIYIPEN----EVGMWQWLFGKTITSVVLRNPGPSD-------GFVGIAA 101
++ME ++ + N + G+ + + G+ + + PS G V +
Sbjct: 157 NRTEGVQMETSMGVGSNRTNFQTGLTRMITGQNLMVSDFQYVPPSGSPASKQYGTVALGT 216
Query: 102 PSLARILPIDLAMF-GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTG 160
A+IL L+ + L+CQ AFL + + + + + G EGF+ Q L G
Sbjct: 217 DFPAKILKFQLSDYPDDRLICQKGAFLAGSHSINMEMAFTKNFTSGFFGGEGFVLQSLQG 276
Query: 161 QG------LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGG 214
F+ A G+VV+K L+ E + V +VA+TS+++ + + +FGG
Sbjct: 277 SDSTNRGETVFLKAYGTVVKKELKENETLRVSSGSLVAMTSTIDFDVTTLPGFKNVLFGG 336
Query: 215 DNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+ L +TGPG+V++Q +P R+ IAR + S
Sbjct: 337 EGLFVTTLTGPGVVWLQGMPIDRMISEIARKIPS 370
>gi|298705724|emb|CBJ49032.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 273
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 76/249 (30%), Positives = 116/249 (46%), Gaps = 18/249 (7%)
Query: 1 MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
M+ PF + P DA F LG ++V Q+ L P + ++A + + ++
Sbjct: 3 MSEPFANPDASP-------DASPRFLSLGEGSEVLQVALCPGDAILANEKAFVYRGLGLQ 55
Query: 61 MENIYIPENEVG-MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGEL 119
E P +G W+ K V N G S ++G+ A +++P+ LA G L
Sbjct: 56 QE--LQPTGPMGRALGWIGVKNDRVVKFTNEGNSPTYLGLTASLPGKVIPVSLAK-SGTL 112
Query: 120 LCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEV 179
L +P FLCSV + +S + + +K+ G G+ +I GG+ VQK L
Sbjct: 113 LAKPSVFLCSVVENDISTAKIKLQSEGQSSQQSVEMRKIAGAGMCYIHGGGAAVQKALGR 172
Query: 180 GEVITVDVSCIVAVTSSVN-VQIKYNGPIRRA--VFGGDNLVTAVVTGPGIVFIQSLPFH 236
GE + V+ C+ AV+ + V + NGP+ R F G +TGPG VFIQS
Sbjct: 173 GESMVVEFGCVAAVSDACKIVPEERNGPLIREPPYFQG---FQVKITGPGSVFIQSTHTG 229
Query: 237 RLSQRIARA 245
RLS R +RA
Sbjct: 230 RLS-RASRA 237
>gi|342215534|ref|ZP_08708181.1| putative TIGR00266 family protein [Peptoniphilus sp. oral taxon 375
str. F0436]
gi|341586424|gb|EGS29824.1| putative TIGR00266 family protein [Peptoniphilus sp. oral taxon 375
str. F0436]
Length = 225
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 60/221 (27%), Positives = 108/221 (48%), Gaps = 3/221 (1%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ + I G V + L+ E ++++ G +M G ++ E I + + G+++
Sbjct: 1 MKYYIEGDNLPVVRFELEEGEGLISQAGGRAWMKGDIKTETISGGGVGKMLGRMFSGESL 60
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ PS+ + A+ I DLA G ++ Q AFLC+ V +S ++
Sbjct: 61 FLSSYKAKSPSE--IVFASDFPGSIYAFDLAE-GQSIIAQKKAFLCADQGVDLSIYFRKK 117
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ + G EGF+ QK+TG G F+ G + L VGE + +D + + S+ + I+
Sbjct: 118 LGTGLFGGEGFIMQKITGPGKVFLEIDGHCQEYILNVGEKLVMDTGSLAMMDSTCEMTIE 177
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
++ +FGG+ L VV GPG V +QS+P +L++ IA
Sbjct: 178 QVEGMKNILFGGEGLFNTVVEGPGKVLVQSMPIQKLARNIA 218
>gi|358067878|ref|ZP_09154350.1| hypothetical protein HMPREF9333_01231 [Johnsonella ignava ATCC
51276]
gi|356693847|gb|EHI55516.1| hypothetical protein HMPREF9333_01231 [Johnsonella ignava ATCC
51276]
Length = 229
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 64/214 (29%), Positives = 106/214 (49%), Gaps = 3/214 (1%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+QI GG I L P EK+V+ G + G + E + + G+++
Sbjct: 3 YQIEGGNLPAAVIQLDPGEKIVSELGGRTWAKGPILTETKAPGGVGKSLGRMFSGESLFM 62
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
GP++ +G A+ I+ +LA G L+CQ AFLC+ +V++S +++
Sbjct: 63 SFYTAQGPAE--IGFASSFAGSIVVRELAQ-GESLICQKSAFLCASAEVELSIYFQKKLG 119
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
G EGF+ QK+TG G+ F+ G V + NL GE I D + + +S + I+
Sbjct: 120 AGFFGGEGFIMQKVTGPGIVFLELDGYVKEYNLMPGEEIVCDTGLVALMDASCTMDIRVV 179
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
++ +FGG+ L+ VV GPG +IQ++ RL
Sbjct: 180 KGLKNKLFGGEGLIDTVVIGPGRAYIQTMSLPRL 213
>gi|225873709|ref|YP_002755168.1| hypothetical protein ACP_2115 [Acidobacterium capsulatum ATCC
51196]
gi|225791728|gb|ACO31818.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC
51196]
Length = 252
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 65/236 (27%), Positives = 110/236 (46%), Gaps = 14/236 (5%)
Query: 26 QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK----- 80
+ILG + +L+P E +++ G + +M+ ++M ++G +FG
Sbjct: 4 RILGTTMPAIEFLLEPNEVIISESGELSWMTAPIQMRT----HTQMGGGGGIFGAIKRMA 59
Query: 81 ---TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
++ R G + G V A I+P+++ G E FLC V +
Sbjct: 60 GGGSLFMTEYRAIG-APGEVAFATKVPGHIVPVEVGQ-GREFYIHRHGFLCGTEQVTLGV 117
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
Q + I G +GFL QK+ G G+A++ G VV K+L GE + V + A +SV
Sbjct: 118 GFQQSLGAGIFGGDGFLLQKVGGNGVAWLELSGEVVVKDLRPGETLRVHPGHVGAFENSV 177
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRE 253
+ QI I+ +FGGD + A +TGPG V++Q+LP L+ + + R+
Sbjct: 178 SFQITRVPGIKNMIFGGDGIFLAALTGPGRVWLQTLPLANLAHALMEYLPQGENRQ 233
>gi|225571037|ref|ZP_03780054.1| hypothetical protein CLOHYLEM_07135 [Clostridium hylemonae DSM
15053]
gi|225160118|gb|EEG72737.1| hypothetical protein CLOHYLEM_07135 [Clostridium hylemonae DSM
15053]
Length = 238
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 62/226 (27%), Positives = 114/226 (50%), Gaps = 12/226 (5%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
+DA + ++I G V L+ EK++ G M +MS +M ME +G F
Sbjct: 8 EDANMKYEIQGETLPVVICQLEAGEKMITEGGGMAWMSPNMLMETT--TNGGIGK---AF 62
Query: 79 GKTITSV-VLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDV 133
G+ + + +N + G G+ A A P + F G E++ Q AFL V
Sbjct: 63 GRMFSGERMFQNIYTAQGGEGMIA--FASSFPGSVEAFQVGPGQEMILQKSAFLAGEASV 120
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
++S +++ + + G EGF+ QK G G F G V++ L+ G+ I VD + A+
Sbjct: 121 ELSVFFNKKFSSGLFGGEGFIMQKAGGYGTVFAEFDGHVIKYELQPGQQIVVDTGHLAAM 180
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
+++ +++IK ++ +FGG+ + ++TGPG V++Q++P ++
Sbjct: 181 SATCSMEIKTVPGVKNMLFGGEGIFNTIITGPGKVWLQTMPISNVA 226
>gi|291524805|emb|CBK90392.1| conserved hypothetical protein TIGR00266 [Eubacterium rectale DSM
17629]
Length = 226
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 66/209 (31%), Positives = 109/209 (52%), Gaps = 12/209 (5%)
Query: 39 LKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT-SVVLRNPGPSDGFV 97
L+ E ++ G+M +M+ +M+ME G +FG+ + + +N S G
Sbjct: 17 LEANETMITEKGAMSWMTPNMKMET-----TSNGGIGKMFGRAFSGESMFQNRYTSMGGP 71
Query: 98 GIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
G+ A A P + F G E++ Q AFL S + V++S +R+ G EGF
Sbjct: 72 GMIA--FASCFPGCIRPFQIAPGQEIIAQKSAFLASTSGVELSVFFQKRIGAGFFGGEGF 129
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
+ Q+L+G GLAF+ G + + L G+ + +D + A+T S ++I+ I+ VFG
Sbjct: 130 IMQRLSGNGLAFLEFDGHIKEYELAPGQQLVIDTGYLAAMTGSCQLEIQTVPGIKNVVFG 189
Query: 214 GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
G+ L VVTGPG V++QS+P +L+ I
Sbjct: 190 GEGLFNTVVTGPGRVWLQSMPVSQLAGAI 218
>gi|238924070|ref|YP_002937586.1| hypothetical protein EUBREC_1708 [Eubacterium rectale ATCC 33656]
gi|238875745|gb|ACR75452.1| Hypothetical protein EUBREC_1708 [Eubacterium rectale ATCC 33656]
gi|291528874|emb|CBK94460.1| conserved hypothetical protein TIGR00266 [Eubacterium rectale
M104/1]
Length = 226
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 66/209 (31%), Positives = 110/209 (52%), Gaps = 12/209 (5%)
Query: 39 LKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT-SVVLRNPGPSDGFV 97
L+ E ++ G+M +M+ +M+ME G +FG+ + + +N S G
Sbjct: 17 LEANETMITEKGAMSWMTPNMKMET-----TSNGGIGKMFGRAFSGESMFQNRYTSMGGP 71
Query: 98 GIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
G+ A A P + F G E++ Q AFL S + V++S +R+ + G EGF
Sbjct: 72 GMIA--FASCFPGCIRPFQIAPGQEIIAQKSAFLASTSGVELSVFFQKRIGSGFFGGEGF 129
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
+ Q+L+G GLAF+ G + + L G+ + +D + A+T S ++I+ I+ VFG
Sbjct: 130 IMQRLSGNGLAFLEFDGHIKEYELAPGQQLVIDTGYLAAMTGSCQLEIQTVPGIKNVVFG 189
Query: 214 GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
G+ L VVTGPG V++QS+P +L+ I
Sbjct: 190 GEGLFNTVVTGPGRVWLQSMPVSQLAGAI 218
>gi|393722397|ref|ZP_10342324.1| hypothetical protein SPAM2_02025 [Sphingomonas sp. PAMC 26605]
Length = 259
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 63/230 (27%), Positives = 109/230 (47%), Gaps = 7/230 (3%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI-- 82
+ I G Q I L E++V++ +M +MS + M+ G+ + + G +I
Sbjct: 3 YDIAGTVMQTVSIDLDRGERIVSQTHAMAWMSDGIRMDTHTGGGLFAGLKRAMSGGSIFI 62
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
T + P V A +I+ L G L+C+ + FLC+ V + QR
Sbjct: 63 TEYIADAPA----HVAFAPRFPGKIIARTLRA-GESLICRKETFLCAEYSVSLDIAFQQR 117
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ + EGF+ Q++TG G ++ G V++K+L GE + V I +V+ ++
Sbjct: 118 LGAGVFAGEGFILQRVTGPGTVWLDLSGEVIEKDLAPGERLRVHAGHIGIQEPTVSTDVQ 177
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
R +FGG+ L A +TGPG V++QS+P L++ +AR + R
Sbjct: 178 MVRGFRNILFGGEGLFLATLTGPGRVWLQSMPIMNLAEEVARHLPGNEER 227
>gi|156743638|ref|YP_001433767.1| hypothetical protein Rcas_3709 [Roseiflexus castenholzii DSM 13941]
gi|156234966|gb|ABU59749.1| protein of unknown function DUF124 [Roseiflexus castenholzii DSM
13941]
Length = 319
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 67/227 (29%), Positives = 110/227 (48%), Gaps = 13/227 (5%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW------QW 76
+ +QI+G Q + L P E V + G+M +MSG+++M GM+ +
Sbjct: 71 MEYQIVGTTLQAVILELDPGETVYSESGAMSWMSGNIQMATNTRGGGLGGMFKRAISGES 130
Query: 77 LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
LF TSV G V A+ +I+P L G ++ Q AFLC+ V +
Sbjct: 131 LFLNEFTSV------GGKGIVAFASDFPGKIVPAPLRE-GQTMIVQKQAFLCAEKTVGLD 183
Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
+R+ G EGF+ QKLTG G+AF+ G +V+ L+ G+V+ VD + +
Sbjct: 184 IHFRKRMGAGFFGGEGFIMQKLTGPGVAFVCLDGEIVEYTLDAGQVLKVDTGHVAMYEPT 243
Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
V I+ + + GG+ L + GPG V++Q++P L++ +A
Sbjct: 244 VEFDIEMVKGFKNILLGGEGLFLTTLRGPGRVWLQTMPTTNLAKAVA 290
>gi|325663733|ref|ZP_08152137.1| hypothetical protein HMPREF0490_02878 [Lachnospiraceae bacterium
4_1_37FAA]
gi|325470226|gb|EGC73459.1| hypothetical protein HMPREF0490_02878 [Lachnospiraceae bacterium
4_1_37FAA]
Length = 226
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 58/215 (26%), Positives = 106/215 (49%), Gaps = 12/215 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+QILG V L E ++ GSM +MS +M M+ + G FG+ +
Sbjct: 3 YQILGDTLPVVICRLDGGESMITEKGSMAWMSPNMRMDT-----STNGGLGKAFGRMFSG 57
Query: 85 -VVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTL 139
+ +N + G G + A P + F G E++ Q FL + +++S
Sbjct: 58 EAIFQNIYTAQG--GAGEIAFASSFPGSIRAFHIRPGEEIVVQKGGFLAAEAGIQLSVHF 115
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
+++ + G EGF+ Q+++G+G+ F G V++ L+ G+ I +D + A+T+S +
Sbjct: 116 HKKLASGFFGGEGFILQRISGEGVMFAEFDGHVIEYELQPGQQIVIDTGHLAAMTASCQM 175
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
I+ ++ FGG+ V+TGPG V++Q++P
Sbjct: 176 DIQTVPGLKNKFFGGEGFFNTVITGPGHVWLQTMP 210
>gi|289450222|ref|YP_003475428.1| hypothetical protein HMPREF0868_1144 [Clostridiales genomosp. BVAB3
str. UPII9-5]
gi|289184769|gb|ADC91194.1| TIGR00266 family protein [Clostridiales genomosp. BVAB3 str.
UPII9-5]
Length = 224
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 60/228 (26%), Positives = 108/228 (47%), Gaps = 9/228 (3%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+ I GG V I L P E++++ G +M G + E E+ G+ +FG+ ++
Sbjct: 3 YSIEGGSLPVVIINLNPGERLISEAGGRRWMRGDIRTET----ESHGGIG-GMFGRMLSG 57
Query: 85 VVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
L S+ + A+ +I+ +L G ++CQ AFLC+ + V + +
Sbjct: 58 ESLFLSSYTAFSESEIAFASSFPGKIIARELRN-GESIVCQKSAFLCATDQVSLRVFFQK 116
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+ + G EGF+ QK+TG G+ F+ G + + +L E I D + + + + +
Sbjct: 117 NIGSGYFGGEGFIMQKITGPGMVFLELDGYIKEYDLAPNESIVCDTGAVAMIDETCQIDV 176
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
+ ++ VFGG+ L VVTGPG V +QS S+ I+ + +
Sbjct: 177 RVVSGLKNIVFGGEGLFDTVVTGPGKVALQSASITDFSKLISSCIKTK 224
>gi|423075594|ref|ZP_17064311.1| hypothetical protein HMPREF0322_03744 [Desulfitobacterium hafniense
DP7]
gi|361853368|gb|EHL05524.1| hypothetical protein HMPREF0322_03744 [Desulfitobacterium hafniense
DP7]
Length = 111
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 37/75 (49%), Positives = 54/75 (72%)
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
AF+ AGG+++QK L+ GE I +D C+VA+T +VN I++ G I+ A+FGG+ L +T
Sbjct: 17 AFMHAGGTIIQKELQPGETIQLDTGCLVAMTKTVNHDIRFAGDIKNALFGGEGLALGTLT 76
Query: 224 GPGIVFIQSLPFHRL 238
GPG V++QSLPF L
Sbjct: 77 GPGTVWLQSLPFCSL 91
>gi|167761381|ref|ZP_02433508.1| hypothetical protein CLOSCI_03789 [Clostridium scindens ATCC 35704]
gi|167661047|gb|EDS05177.1| TIGR00266 family protein [Clostridium scindens ATCC 35704]
Length = 226
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 62/215 (28%), Positives = 110/215 (51%), Gaps = 12/215 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+QI G V L+ EK++ G M +MS +M ME +G FG+ +
Sbjct: 3 YQIQGETLPVVICELEGGEKMITEGGGMAWMSPNMLMETT--TNGGIGK---AFGRMFSG 57
Query: 85 -VVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTL 139
+ +N + G G+ A A P + F G E++ Q AFL +++S
Sbjct: 58 EHMFQNIYTAQGGPGMIA--FASSFPGSVRAFQIAPGQEMILQKSAFLAGEMGIELSVFF 115
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
+++ + + G EGF+ QK++G G+ F G V++ L G+ I VD + A+T++ ++
Sbjct: 116 NKKFSSGLFGGEGFIMQKVSGHGIVFTEFDGHVIEYELHPGQQIVVDTGHLAAMTATCSM 175
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
+IK ++ +FGG+ + V+TGPG V++Q++P
Sbjct: 176 EIKGVPGVKNMLFGGEGIFNTVITGPGRVWLQTMP 210
>gi|197302962|ref|ZP_03168012.1| hypothetical protein RUMLAC_01689 [Ruminococcus lactaris ATCC
29176]
gi|197297957|gb|EDY32507.1| hypothetical protein RUMLAC_01689 [Ruminococcus lactaris ATCC
29176]
Length = 154
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 46/133 (34%), Positives = 78/133 (58%)
Query: 116 GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQK 175
G EL+ Q FL S V +S +++ I G EGF+ Q+L+G G+AF+ G VVQ
Sbjct: 20 GQELIVQKTGFLASEAGVSLSVYFQKKLGAGIFGGEGFIMQRLSGSGMAFLEFDGHVVQY 79
Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
+L+ G+ I VD + A+ S +++++ ++ VFGG+ + VVTGPG V++Q++P
Sbjct: 80 DLQPGQQIVVDTGYLAAMEGSCSMEVQTVPGLKNMVFGGEGVFNTVVTGPGRVWLQTMPI 139
Query: 236 HRLSQRIARAVTS 248
++ I + + S
Sbjct: 140 SNVAGEILKFMPS 152
>gi|310829668|ref|YP_003962025.1| hypothetical protein [Eubacterium limosum KIST612]
gi|308741402|gb|ADO39062.1| hypothetical protein ELI_4120 [Eubacterium limosum KIST612]
Length = 228
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 99/220 (45%), Gaps = 2/220 (0%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I+G V ++ L E++ G M +M +ME + + G ++
Sbjct: 1 MKYEIIGDTLPVVELELDRGEQIYTESGGMSWMDPCFDMETSTRGGALKAIKRSFSGNSL 60
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
G G + A+ I + L G ++C AFL + V S +
Sbjct: 61 FLTTYTCTG-DKGKIAFASSFPGNIRAVHLEA-GQSIICAKTAFLAAEESVDFSIFFKKS 118
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
V+ + G GF+ QKLTG GL F+ GS ++ NLE G+ I VD I V+ +
Sbjct: 119 VKTGVFGGAGFILQKLTGPGLVFLELNGSTIEYNLEEGQTINVDQGHIAVFQEKVHFDVT 178
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ +F G+ L A +TGPG V +QS+P +L++ +
Sbjct: 179 QVKGAKNILFSGEGLFFATLTGPGRVVLQSMPVDKLAKAL 218
>gi|169335942|ref|ZP_02863135.1| hypothetical protein ANASTE_02377 [Anaerofustis stercorihominis DSM
17244]
gi|169258680|gb|EDS72646.1| TIGR00266 family protein [Anaerofustis stercorihominis DSM 17244]
Length = 225
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 62/225 (27%), Positives = 111/225 (49%), Gaps = 4/225 (1%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++++G I L EK+V + G M +MSG++EM G+ + L G++I
Sbjct: 1 MDYKVIGDTFPAVVITLNQGEKIVTQAGGMSWMSGNIEMTTSTRGGLGKGVKRALAGESI 60
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGG-ELLCQPDAFLCSVNDVKVSNTLDQ 141
++ + +A +I IDL + G E + Q AFL V + +
Sbjct: 61 FQATYTAKQDNE-HITCSATMPGKI--IDLYLDGEREYIAQKGAFLAGEESVNMDMFTVK 117
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
++ + + G EG KL+G+G AF+ GG + + NL V EVI V+ + SSVN ++
Sbjct: 118 KISSALFGGEGIFLLKLSGKGTAFLEIGGHLEEYNLGVDEVIKVNTGNVAFFESSVNYEV 177
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+ + +FGG+ L +TGPG V++Q++ L+ ++ +
Sbjct: 178 ETVKGAKNILFGGEGLFLTKLTGPGKVYLQTMNMGELAGALSSYI 222
>gi|374313431|ref|YP_005059861.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358755441|gb|AEU38831.1| protein of unknown function DUF124 [Granulicella mallensis
MP5ACTX8]
Length = 253
Score = 84.0 bits (206), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 62/226 (27%), Positives = 108/226 (47%), Gaps = 4/226 (1%)
Query: 26 QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMEN---IYIPENEVGMWQWLFGKTI 82
+I+G V + +L+P E +++ G + +M+ S+ M + G + + G
Sbjct: 4 RIIGTTMPVLEALLQPGESLISEAGELSWMTQSIAMSTHTQMAGGGGFFGAIKRMAGGGT 63
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ G V A I+PI++ G E + FLCS +++ Q
Sbjct: 64 LFMSEYTAQGYPGEVAFATKIPGHIVPIEVGP-GHEFMIHRHGFLCSTPQIQLGVGFQQS 122
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ I G +GFL QK+ G G A++ G VV ++L+ GE + V + A +SV+ QI
Sbjct: 123 LGAGIFGGDGFLLQKVAGYGTAWLELSGEVVIRDLQPGETLRVHPGHVGAFQASVSFQIT 182
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
IR FGGD + A +TGPG +++Q+LP ++ + ++S
Sbjct: 183 RIQGIRNMFFGGDGIFLAALTGPGRIWLQTLPLSNMAHALQPYLSS 228
>gi|166030398|ref|ZP_02233227.1| hypothetical protein DORFOR_00059 [Dorea formicigenerans ATCC
27755]
gi|166029756|gb|EDR48513.1| TIGR00266 family protein [Dorea formicigenerans ATCC 27755]
Length = 226
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 60/219 (27%), Positives = 107/219 (48%), Gaps = 11/219 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++I GG V L+ E ++ GSM +MS +M+ME + G + + G+ +
Sbjct: 3 YEITGGNLPVVICYLQSGEGMITESGSMSWMSPNMKMETV------GGGMKKMLGRAFSG 56
Query: 85 V-VLRN---PGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
+N DG + A+ + +++ G ++ Q AFL + V++S +
Sbjct: 57 ENAFQNRYSAQGGDGMIAFASSFPGSVNALEIGP-GKSMIVQKKAFLAAEEGVELSVYMQ 115
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+++ + G EGF+ QKLTGQG AFI G V+ NL G+ + VD + A+ + +
Sbjct: 116 KKLGKGLFGGEGFIMQKLTGQGTAFIEIDGYGVEYNLAPGQEMIVDTGYLAAMEETCTMD 175
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
I ++ GG+ L VV GPG + +Q++P ++
Sbjct: 176 IVKVEGMKNMFLGGEGLFNTVVRGPGKIILQTMPIANVA 214
>gi|154496883|ref|ZP_02035579.1| hypothetical protein BACCAP_01176 [Bacteroides capillosus ATCC
29799]
gi|150273841|gb|EDN00954.1| TIGR00266 family protein [Pseudoflavonifractor capillosus ATCC
29799]
Length = 238
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 112/213 (52%), Gaps = 10/213 (4%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++I GG V L+ EK++ GSM +MS +M+ME G +F + +
Sbjct: 16 YEIAGGTFPVVMCYLENGEKMITESGSMVWMSPNMQMET------SGGGLGKMFSRAFSG 69
Query: 85 V-VLRN--PGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
+ +N DG + + +I+PI++ G E++ Q AFL + V++S ++
Sbjct: 70 ENIFQNIYTARGDGMITFGSSFPGKIIPIEIGP-GQEMILQKSAFLAAEAGVELSIHFNR 128
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
++ G EGF+ Q+L+G+G+AF G +V+ +L G+ + VD + +V++ I
Sbjct: 129 KLGAGFFGGEGFIMQRLSGRGMAFAEIDGDLVEYDLAPGQQLVVDTGNVAGFAPTVSMDI 188
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
+ ++ FGG+ + ++TGPG V++Q++P
Sbjct: 189 QQVPGLKNMFFGGEGIFNTLLTGPGKVWLQTMP 221
>gi|42526357|ref|NP_971455.1| hypothetical protein TDE0845 [Treponema denticola ATCC 35405]
gi|422342684|ref|ZP_16423623.1| hypothetical protein HMPREF9353_02290 [Treponema denticola F0402]
gi|449112516|ref|ZP_21749070.1| TIGR00266 family protein [Treponema denticola ATCC 33521]
gi|449115262|ref|ZP_21751727.1| TIGR00266 family protein [Treponema denticola ATCC 35404]
gi|41816469|gb|AAS11336.1| conserved hypothetical protein TIGR00266 [Treponema denticola ATCC
35405]
gi|325473300|gb|EGC76495.1| hypothetical protein HMPREF9353_02290 [Treponema denticola F0402]
gi|448953614|gb|EMB34404.1| TIGR00266 family protein [Treponema denticola ATCC 35404]
gi|448955978|gb|EMB36742.1| TIGR00266 family protein [Treponema denticola ATCC 33521]
Length = 224
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 62/225 (27%), Positives = 109/225 (48%), Gaps = 14/225 (6%)
Query: 23 IPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
+ ++I+G A VC+ L+ EK++ + G M +MS + ME G G+
Sbjct: 1 MEYKIIGDAFPAVVCK--LRAGEKMITQSGGMAWMSSGISMET------SAGGVGKALGR 52
Query: 81 TITSVVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
+ L + +DG + A+ I ++ G L+ Q AFL S V +S
Sbjct: 53 MFSGESLFLNKYTANTDGEISFASRFPGAIKAFEITS-GSSLIAQKTAFLASTEGVDLSI 111
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
+++ G EGF+ QKL+G G F+ GS V+ L GE + +D + A+ +
Sbjct: 112 YFQKKLGAGFFGGEGFIMQKLSGNGTVFLEIDGSAVEYTLGAGETMLLDTGYLAAMEETC 171
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
++++ ++ +FGG+ L VTGPG + IQ++P H ++Q +
Sbjct: 172 KMEVEMIKGVKNILFGGEGLFNTKVTGPGKIIIQTMPIHAVAQSL 216
>gi|449110017|ref|ZP_21746649.1| TIGR00266 family protein [Treponema denticola ATCC 33520]
gi|449120778|ref|ZP_21757161.1| TIGR00266 family protein [Treponema denticola H1-T]
gi|449123183|ref|ZP_21759513.1| TIGR00266 family protein [Treponema denticola MYR-T]
gi|448946703|gb|EMB27557.1| TIGR00266 family protein [Treponema denticola MYR-T]
gi|448947349|gb|EMB28196.1| TIGR00266 family protein [Treponema denticola H1-T]
gi|448957725|gb|EMB38465.1| TIGR00266 family protein [Treponema denticola ATCC 33520]
Length = 224
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 61/225 (27%), Positives = 109/225 (48%), Gaps = 14/225 (6%)
Query: 23 IPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
+ ++I+G A VC+ L+ EK++ + G M +MS + ME G G+
Sbjct: 1 MEYKIIGDAFPAVVCK--LRAGEKMITQSGGMAWMSSGISMET------SAGGVGKALGR 52
Query: 81 TITSVVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
+ L + +DG + A+ I ++ G L+ Q AFL S V +S
Sbjct: 53 MFSGESLFLNKYTASTDGEISFASRFPGAIKAFEITS-GSSLIAQKTAFLASTEGVDLSI 111
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
+++ G EGF+ QKL+G G F+ GS ++ L GE + +D + A+ +
Sbjct: 112 YFQKKLGAGFFGGEGFIMQKLSGNGTVFLEIDGSAIEYTLGAGETMLLDTGYLAAMEETC 171
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
++++ ++ +FGG+ L VTGPG + IQ++P H ++Q +
Sbjct: 172 KMEVEMIKGVKNILFGGEGLFNTKVTGPGKIIIQTMPIHAVAQSL 216
>gi|449103210|ref|ZP_21739956.1| TIGR00266 family protein [Treponema denticola AL-2]
gi|449125604|ref|ZP_21761906.1| TIGR00266 family protein [Treponema denticola OTK]
gi|449130631|ref|ZP_21766851.1| TIGR00266 family protein [Treponema denticola SP37]
gi|448939573|gb|EMB20490.1| TIGR00266 family protein [Treponema denticola OTK]
gi|448942352|gb|EMB23247.1| TIGR00266 family protein [Treponema denticola SP37]
gi|448965062|gb|EMB45727.1| TIGR00266 family protein [Treponema denticola AL-2]
Length = 224
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 61/225 (27%), Positives = 109/225 (48%), Gaps = 14/225 (6%)
Query: 23 IPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
+ ++I+G A VC+ L+ EK++ + G M +MS + ME G G+
Sbjct: 1 MEYKIIGDAFPAVVCK--LRAGEKMITQSGGMAWMSSGISMET------SAGGVGKALGR 52
Query: 81 TITSVVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
+ L + +DG + A+ I ++ G L+ Q AFL S V +S
Sbjct: 53 MFSGESLFLNKYTANTDGEISFASRFPGAIKAFEITS-GSSLIAQKTAFLASTEGVDLSI 111
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
+++ G EGF+ QKL+G G F+ GS ++ L GE + +D + A+ +
Sbjct: 112 YFQKKLGAGFFGGEGFIMQKLSGNGTVFLEIDGSAIEYTLGAGETMLLDTGYLAAMEETC 171
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
++++ ++ +FGG+ L VTGPG + IQ++P H ++Q +
Sbjct: 172 KMEVEMIKGVKNILFGGEGLFNTKVTGPGKIIIQTMPIHAVAQSL 216
>gi|449117826|ref|ZP_21754241.1| TIGR00266 family protein [Treponema denticola H-22]
gi|448949717|gb|EMB30541.1| TIGR00266 family protein [Treponema denticola H-22]
Length = 224
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 61/225 (27%), Positives = 109/225 (48%), Gaps = 14/225 (6%)
Query: 23 IPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
+ ++I+G A VC+ L+ EK++ + G M +MS + ME G G+
Sbjct: 1 MEYKIIGDAFPAVVCK--LRAGEKMITQSGGMAWMSSGISMET------SAGGVGKALGR 52
Query: 81 TITSVVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
+ L + +DG + A+ I ++ G L+ Q AFL S V +S
Sbjct: 53 MFSGESLFLNKYTANADGEISFASRFPGAIKAFEITS-GSSLIAQKTAFLASTEGVDLSI 111
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
+++ G EGF+ QKL+G G F+ GS ++ L GE + +D + A+ +
Sbjct: 112 YFQKKLGAGFFGGEGFIMQKLSGNGTVFLEIDGSAIEYTLGAGETMLLDTGYLAAMEETC 171
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
++++ ++ +FGG+ L VTGPG + IQ++P H ++Q +
Sbjct: 172 KMEVEMIKGVKNILFGGEGLFNTKVTGPGKIIIQTMPIHAVAQSL 216
>gi|288560063|ref|YP_003423549.1| hypothetical protein mru_0806 [Methanobrevibacter ruminantium M1]
gi|288542773|gb|ADC46657.1| hypothetical protein mru_0806 [Methanobrevibacter ruminantium M1]
Length = 226
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 62/222 (27%), Positives = 110/222 (49%), Gaps = 7/222 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I GG + L+ E + G+M FM+ M+M+ G+ + L G TI
Sbjct: 1 MEYEIQGGTFPIVVCTLQEGETMKNETGAMAFMTSGMKMDTNTGGGLLKGLGRALSGDTI 60
Query: 83 --TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
+ G VG +A S RI+PI L ++ Q +AFL + + V +
Sbjct: 61 FLNYFTAQRDGEQ---VGFSACSPGRIMPIRLDGTN-TIIGQKNAFLAAEDSVDIDIHFR 116
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+ + + G EGF+ QK +G G+AF+ G +++K L GE + +D + A+ SV
Sbjct: 117 KSLGAGLFGGEGFVLQKFSGTGVAFLEIDGEIIKKELAPGEKLLIDPGHLAAMEESVGFD 176
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
I+ + +F G+ L + +TGPG V++Q++P L++ +
Sbjct: 177 IERVKGAKNILF-GEGLFFSRLTGPGTVWLQTMPISSLARAL 217
>gi|449106766|ref|ZP_21743428.1| TIGR00266 family protein [Treponema denticola ASLM]
gi|449127490|ref|ZP_21763763.1| TIGR00266 family protein [Treponema denticola SP33]
gi|451968466|ref|ZP_21921695.1| TIGR00266 family protein [Treponema denticola US-Trep]
gi|448944223|gb|EMB25104.1| TIGR00266 family protein [Treponema denticola SP33]
gi|448964116|gb|EMB44789.1| TIGR00266 family protein [Treponema denticola ASLM]
gi|451702479|gb|EMD56881.1| TIGR00266 family protein [Treponema denticola US-Trep]
Length = 224
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 61/225 (27%), Positives = 109/225 (48%), Gaps = 14/225 (6%)
Query: 23 IPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
+ ++I+G A VC+ L+ EK++ + G M +MS + ME G G+
Sbjct: 1 MEYKIIGDAFPAVVCK--LRAGEKMITQSGGMAWMSSGISMET------SAGGVGKALGR 52
Query: 81 TITSVVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
+ L + +DG + A+ I ++ G L+ Q AFL S V +S
Sbjct: 53 MFSGESLFLNKYTANTDGEISFASRFPGAIKAFEITS-GSSLIAQKTAFLASTEGVDLSI 111
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
+++ G EGF+ QKL+G G F+ GS V+ L GE + +D + A+ +
Sbjct: 112 HFQKKLGAGFFGGEGFIMQKLSGNGTVFLEIDGSAVEYTLGAGETMLLDTGYLAAMEETC 171
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
++++ ++ +FGG+ L VTGPG + +Q++P H ++Q +
Sbjct: 172 KMEVEMIKGVKNILFGGEGLFNTKVTGPGKIIVQTMPIHAVAQSL 216
>gi|410726976|ref|ZP_11365206.1| hypothetical protein A370_03314 [Clostridium sp. Maddingley
MBC34-26]
gi|410599725|gb|EKQ54266.1| hypothetical protein A370_03314 [Clostridium sp. Maddingley
MBC34-26]
Length = 109
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 38/94 (40%), Positives = 62/94 (65%)
Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
+AFI A G++V+++L GE++ +D C+VA+T +V I++ G I+ +FGG+ + A V
Sbjct: 1 MAFIHACGAIVKRDLMPGEMLKIDTGCLVAMTRNVQYDIQFVGGIKNTLFGGEGVFFATV 60
Query: 223 TGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPK 256
+GPG V+IQSLPF RL++R+ A R +
Sbjct: 61 SGPGTVWIQSLPFSRLAERVFAAAPGAGGRNKEE 94
>gi|169334218|ref|ZP_02861411.1| hypothetical protein ANASTE_00616 [Anaerofustis stercorihominis DSM
17244]
gi|169258935|gb|EDS72901.1| TIGR00266 family protein [Anaerofustis stercorihominis DSM 17244]
Length = 252
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 59/217 (27%), Positives = 105/217 (48%), Gaps = 8/217 (3%)
Query: 27 ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVV 86
ILGG L P E ++ GSM +MS ++ ME GM + L +
Sbjct: 31 ILGGSLPTVVCNLAPGETMITESGSMSWMSPNIRMETT----TGGGMKKMLGRLASGESM 86
Query: 87 LRNP---GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV 143
+N ++G + ++ I+P ++A G ++ Q AFL + V++S +++
Sbjct: 87 FQNKYTAQNTEGKIAFSSSFPGAIIPFEIAP-GKNMIVQKSAFLAAEESVELSMYFQKKL 145
Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
+ G EG + QKL+GQG AFI G V+ +L GE + V + A+ + ++ ++
Sbjct: 146 GKGLFGGEGLIMQKLSGQGTAFIEIDGGSVEYDLAPGEELIVSTGYLAAMEETCSMDVQM 205
Query: 204 NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
+ + GG+ L VV GPG V +Q++P +++
Sbjct: 206 VKGAKNILLGGEGLFNTVVKGPGKVILQTMPISNVAE 242
>gi|293400841|ref|ZP_06644986.1| HTH DNA-binding protein [Erysipelotrichaceae bacterium 5_2_54FAA]
gi|291305867|gb|EFE47111.1| HTH DNA-binding protein [Erysipelotrichaceae bacterium 5_2_54FAA]
Length = 233
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 63/221 (28%), Positives = 104/221 (47%), Gaps = 4/221 (1%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I GG+ V L+ EKV G MC+M EM G+ + + G++I
Sbjct: 9 MEYEIKGGQLPVVVCHLQKDEKVFTESGGMCWMDDGFEMYANTRGGLLKGLGRAMAGESI 68
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQ 141
+ G + + + +ILPI L GE ++ Q AFL + + V + +
Sbjct: 69 F-LTTYTAGNDNSEIAFGSSFPGKILPITLG--DGECIIIQKTAFLAAEDSVTLQPFFRK 125
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
R+ G EGF+ QKL G G AF+ G +V+K L+ G+++ +D I V+ I
Sbjct: 126 RLGAGFFGGEGFILQKLIGPGTAFLEIDGDIVEKELQPGQLLKIDQGYIAMFEEHVDFDI 185
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
++ G+ A GPG V++Q++PF L+ RI
Sbjct: 186 TTVKGLKNKFLSGEGFFLATCKGPGKVWLQTMPFSVLADRI 226
>gi|146303673|ref|YP_001190989.1| hypothetical protein Msed_0893 [Metallosphaera sedula DSM 5348]
gi|145701923|gb|ABP95065.1| protein of unknown function DUF124 [Metallosphaera sedula DSM 5348]
Length = 230
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 58/218 (26%), Positives = 103/218 (47%), Gaps = 7/218 (3%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+QI+G + Q ++ L PQE A G M + S+ M+ G+ + L G +
Sbjct: 4 YQIMGDDLQYLKVQLAPQEGFYADAGHMIMKTASVNMQTRMRGGFLSGLKRVLTGGSF-- 61
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
V GP D + P +I+PI L GG +L + +FL + ++ T+ +
Sbjct: 62 FVTEFYGPGDVILSGIFP--GKIVPIQLQ--GGSILVEAHSFLGAETSIEYDATMARLTA 117
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
+GG EG + G G F+ + G ++ ++L GE I V+ S ++A +N ++
Sbjct: 118 GWLGG-EGLFLARFRGVGNVFLHSYGGLIVRDLAPGETIQVEASHLMAFQEGMNYSVQLV 176
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
G +R F + L +TGPG V++ +L +L+ +
Sbjct: 177 GGLRSVFFAHEGLFFVTITGPGRVWLHTLTAEQLASAL 214
>gi|163815175|ref|ZP_02206556.1| hypothetical protein COPEUT_01333 [Coprococcus eutactus ATCC 27759]
gi|158449584|gb|EDP26579.1| TIGR00266 family protein [Coprococcus eutactus ATCC 27759]
Length = 234
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 66/232 (28%), Positives = 116/232 (50%), Gaps = 18/232 (7%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT- 83
++I GG V +I+L+ E + G+M +MS +M+M+ G +F + I+
Sbjct: 12 YEIKGGMMPVVEIILEAGESINCEKGAMVWMSPNMQMQT------SGGGVGKMFTRAISG 65
Query: 84 SVVLRNP-----GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
+ RN GP G + + + I+ +++ G E++CQ A+L S + +
Sbjct: 66 ESMFRNTYTAQGGP--GLIAFGSSFVGNIMAVEIGP-GKEIICQKSAYLASTPGINLEIA 122
Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS--CIVAVTSS 196
+++ G EGF+ Q++TGQG+ FI GSVV L G+ + +D ++ T S
Sbjct: 123 FQKKIAGGFFGGEGFIMQRITGQGIVFIEIDGSVVDYMLAPGQQMVIDTGYLAMMDATCS 182
Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+N + G + FGG+ VVTGPG + +Q++P +L+ IA + S
Sbjct: 183 LNAETVKGGAMNM-FFGGEGFFNTVVTGPGRITLQTMPKSQLAAEIASMIPS 233
>gi|373451763|ref|ZP_09543682.1| TIGR00266 family protein [Eubacterium sp. 3_1_31]
gi|371967984|gb|EHO85451.1| TIGR00266 family protein [Eubacterium sp. 3_1_31]
Length = 225
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 62/220 (28%), Positives = 103/220 (46%), Gaps = 2/220 (0%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I GG+ V L+ EKV G MC+M EM G+ + + G++I
Sbjct: 1 MEYEIKGGQLPVVVCHLQKDEKVFTESGGMCWMDDGFEMYANTRGGLLKGLGRAMAGESI 60
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ G + + + +ILPI L G ++ Q AFL + + V + +R
Sbjct: 61 F-LTTYTAGNDNSEIAFGSSFPGKILPITLCD-GECIIIQKTAFLAAEDSVTLQPFFRKR 118
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ G EGF+ QKL G G AF+ G +V+K L+ G+++ +D I V+ I
Sbjct: 119 LGAGFFGGEGFILQKLIGPGTAFLEIDGDIVEKELQPGQLLKIDQGYIAMFEEHVDFDIT 178
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
++ G+ A GPG V++Q++PF L+ RI
Sbjct: 179 TVKGLKNKFLSGEGFFLATCKGPGKVWLQTMPFSVLADRI 218
>gi|325264387|ref|ZP_08131118.1| HTH DNA-binding protein [Clostridium sp. D5]
gi|324030458|gb|EGB91742.1| HTH DNA-binding protein [Clostridium sp. D5]
Length = 229
Score = 81.3 bits (199), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 64/231 (27%), Positives = 114/231 (49%), Gaps = 10/231 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I GG V L+ +E+++ GSM +MS +M+M+ G +F K
Sbjct: 5 MKYEIKGGAFPVVVCELEDREQMITEKGSMVWMSPNMQMDTAG------GGLGKMFSKAF 58
Query: 83 T-SVVLRN--PGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
+ + +N G + + RILP+ + G E++ Q AFL S +++S
Sbjct: 59 SGESMFQNIYTAQGAGMIAFGSSFPGRILPLTIEP-GKEMVVQKSAFLASEAGIELSIHF 117
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
++++ G EGF+ Q+L G G+AFI G +V+ L G+ I VD + V +
Sbjct: 118 NKKMGAGFFGGEGFIMQRLAGTGMAFIEIDGELVEYQLGPGQQIVVDTGNVAGFEVGVQI 177
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
I+ I+ + GG+ L V+TGPG +++Q++P ++ I + S N
Sbjct: 178 DIQQVPGIKNKLLGGEGLFNTVLTGPGKIWLQTMPISSVAASIRPFIPSGN 228
>gi|291549051|emb|CBL25313.1| conserved hypothetical protein TIGR00266 [Ruminococcus torques
L2-14]
Length = 226
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 64/224 (28%), Positives = 111/224 (49%), Gaps = 10/224 (4%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++I G V L+ E+++ GSM +MS +M+ME G FG+ +
Sbjct: 3 YEIKGDTLPVVVCYLENGERMITERGSMSWMSPNMKMET-----TSNGGIGKAFGRMFSG 57
Query: 85 VVL----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
L +G + A+ +I+ +A G EL+ Q FL S V +S
Sbjct: 58 EALFQNRYTATGGNGMIAFASSFPGQIVARQIAP-GQELIVQKSGFLASEEGVDLSIFFQ 116
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
++ G EGF+ Q+L+G G+AF+ G VV+ L G+ I VD + A+ ++ +++
Sbjct: 117 KKFGAGFFGGEGFIMQRLSGNGIAFLEFDGHVVEYELRPGQQIVVDTGYLAAIDATCSME 176
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
I+ ++ VFGG+ + VVTGPG V++Q++P ++ + R
Sbjct: 177 IQTVPGLKNMVFGGEGVFNTVVTGPGHVWLQTMPISSVAGELLR 220
>gi|331694270|ref|YP_004330509.1| hypothetical protein Psed_0384 [Pseudonocardia dioxanivorans
CB1190]
gi|326948959|gb|AEA22656.1| protein of unknown function DUF124 [Pseudonocardia dioxanivorans
CB1190]
Length = 237
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 64/217 (29%), Positives = 105/217 (48%), Gaps = 14/217 (6%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMEN------IYIPENEVGMWQWLFGKTITSVVL 87
V +IML P E+V A G + + + M++E G+ + L G + +
Sbjct: 12 VLEIMLGPGERVTAEGGEVSWYTPGMQLETSTRFGSGGRGGFMSGLKRALGGGQL--FLT 69
Query: 88 RNPGPSDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRN 145
P+ G V AA P + R L ID A E + Q +++ S DV+VS L +R+
Sbjct: 70 EYTAPTGGMVAFAARMPGVIRELTIDPA---DEFMVQSGSYIASTTDVEVSVALQKRLGA 126
Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
I G G + QKL G G A++ G +V+ +L G+ + V + T+ +++Q
Sbjct: 127 GIFGGAGLVFQKLAGHGRAWVELSGEIVEYDLAQGQSMIVHPGHLALFTAGMDLQFATVK 186
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
I+ F GD L+ A + GPG V++QS+ +L+ I
Sbjct: 187 GIKNKFF-GDALMMAEIHGPGHVWLQSMTAAKLAAAI 222
>gi|150391921|ref|YP_001321970.1| hypothetical protein Amet_4231 [Alkaliphilus metalliredigens QYMF]
gi|149951783|gb|ABR50311.1| protein of unknown function DUF124 [Alkaliphilus metalliredigens
QYMF]
Length = 227
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 44/148 (29%), Positives = 82/148 (55%), Gaps = 2/148 (1%)
Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
I+PID+ ++CQ +FLCS +V++ +R+ + G EGF+ Q+L G+G+A
Sbjct: 82 GHIIPIDVT--NRSIICQKRSFLCSTEEVELEIAFQKRLGSGFFGGEGFIMQELRGKGMA 139
Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
F+ G V+ L+ GE I V+ + S++ + ++ +FGG+ L +TG
Sbjct: 140 FVEIDGESVEIQLKAGESIKVETGSVGMYESTIQMSVEMVKGFSNMLFGGEGLFLTTLTG 199
Query: 225 PGIVFIQSLPFHRLSQRIARAVTSPNMR 252
PG+V+IQ++P ++ + + +P+ R
Sbjct: 200 PGLVWIQTMPIQSMAGEMYPYLPTPSNR 227
>gi|389575957|ref|ZP_10165985.1| TIGR00266 family protein [Eubacterium cellulosolvens 6]
gi|389311442|gb|EIM56375.1| TIGR00266 family protein [Eubacterium cellulosolvens 6]
Length = 225
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 57/227 (25%), Positives = 108/227 (47%), Gaps = 14/227 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I GG V + +L+ E + GSM +M +EM E G +FG+
Sbjct: 1 MQYKIDGGNLPVLKCLLEAGESMQCEGGSMAWMDDGIEMST------EGGSIGKVFGRMF 54
Query: 83 TSVVL---RNPGPSDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
T+ L R G + ++ P R + I ++ Q +FL + + VS
Sbjct: 55 TNEKLFMNRYTARQTGEIAFSSSFPGSIRCIEITPDR---PVIAQKGSFLAATEGIDVSV 111
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
++V N+ G EG + ++ +G+GL F+ GS + +L G+ +D +VA+ ++
Sbjct: 112 FFQKKVGNMFFGGEGLIMEQFSGKGLVFLEVDGSAHEYDLAPGQKKILDTGYLVAMDNTC 171
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
++ + ++ +FGG+ L V+TGPG + +QS+P + + R
Sbjct: 172 SMDVVTIKGVKNVLFGGEGLFNTVITGPGHITVQSMPIATTAAVLYR 218
>gi|323454065|gb|EGB09936.1| hypothetical protein AURANDRAFT_14012, partial [Aureococcus
anophagefferens]
Length = 209
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 52/213 (24%), Positives = 102/213 (47%), Gaps = 5/213 (2%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+ ++G + + L P E + PGSM FM S++ ++ ++ + + L G+
Sbjct: 2 WDLIGDSMHILNVGLAPGETMATEPGSMMFMGSSID---AHVDCSDC-LSRCLGGEACAM 57
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
+ N G + +V + + A I+ +D+ +GG++ + A++ SV D +D
Sbjct: 58 LNYTNRGAAPSYVALTSAKPADIVALDMDRYGGKISARSGAYMSSVGDAVPGCEVDCNPL 117
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
G +RQ +TG G AF+ A G+V +K L GE+ +D + +VA + + ++
Sbjct: 118 TCCFAGFGCVRQTITGSGTAFVEASGTVERKELAPGELFIIDSNSLVAWHGAT-LGVRTA 176
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR 237
G G+ + GPG ++QS+P+ +
Sbjct: 177 GSCLACCCNGEGCCNTTLLGPGTAWVQSMPWEQ 209
>gi|323482977|ref|ZP_08088373.1| hypothetical protein HMPREF9474_00122 [Clostridium symbiosum
WAL-14163]
gi|323691588|ref|ZP_08105852.1| hypothetical protein HMPREF9475_00714 [Clostridium symbiosum
WAL-14673]
gi|355625943|ref|ZP_09048474.1| hypothetical protein HMPREF1020_02553 [Clostridium sp. 7_3_54FAA]
gi|323403683|gb|EGA95985.1| hypothetical protein HMPREF9474_00122 [Clostridium symbiosum
WAL-14163]
gi|323504381|gb|EGB20179.1| hypothetical protein HMPREF9475_00714 [Clostridium symbiosum
WAL-14673]
gi|354821076|gb|EHF05473.1| hypothetical protein HMPREF1020_02553 [Clostridium sp. 7_3_54FAA]
Length = 226
Score = 77.4 bits (189), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 61/233 (26%), Positives = 111/233 (47%), Gaps = 17/233 (7%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++I GG V L E V G+MC+MS +MEM G +FG+ +
Sbjct: 3 YEIKGGNLPVVICSLDNGESVFNESGAMCWMSPNMEMTT------SGGGIGKMFGRMFSG 56
Query: 85 VVL-------RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
L RN G + A+ I P+++ G L+ Q FL + V++S
Sbjct: 57 ENLFQNIYTARN---DSGMIAFASSFPGSIRPVEIRP-GDGLVVQKSGFLAAEAGVELSA 112
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
++ G EGF+ QKL+G+G AF+ G+ V+ +L G+ + VD + A++ +
Sbjct: 113 FFQKKAGAGFFGGEGFIMQKLSGRGTAFLEIDGAAVEYDLAPGQKMIVDTGYLAAMSETC 172
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
++ I+ + +FGG+ ++TGPG + +Q++P + + + + + P+
Sbjct: 173 SIDIQAVKGAKNMLFGGEGFFNTIITGPGHIILQTMPVSKTAMCMYQYMPHPS 225
>gi|320159924|ref|YP_004173148.1| hypothetical protein ANT_05140 [Anaerolinea thermophila UNI-1]
gi|319993777|dbj|BAJ62548.1| hypothetical protein ANT_05140 [Anaerolinea thermophila UNI-1]
Length = 269
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 56/222 (25%), Positives = 106/222 (47%), Gaps = 5/222 (2%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF-GKTIT 83
+QI+G Q + +KP GS+ +MS + M+ +G +F G+++
Sbjct: 31 YQIIGTVMQALTLRMKPGMGFYTEVGSLSWMSQGVRMDTNLGKGGLMGALGRMFTGESL- 89
Query: 84 SVVLRNPGPSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
V+ DG + + +I+P++LA G ++ Q D L + V +S +R
Sbjct: 90 -FVVNYTAEIDGAMATFSCDFPGKIVPVNLAQ-GQAMIAQKDTLLVAEQSVSLSIAFQKR 147
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ + G EGF+ QK G G F G +V+ L+ G+ + VD + +V ++
Sbjct: 148 LGAGLFGGEGFILQKFEGPGTFFASFDGEIVEYTLQPGQKMLVDTGHLAMFEPTVTYDVQ 207
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
++ +FGG+ L +TGPG +++Q++P +L+ I +
Sbjct: 208 MVKGVKNLLFGGEGLFLVSLTGPGRIWLQTMPMSKLAGAIHK 249
>gi|218288578|ref|ZP_03492855.1| protein of unknown function DUF124 [Alicyclobacillus acidocaldarius
LAA1]
gi|218241235|gb|EED08410.1| protein of unknown function DUF124 [Alicyclobacillus acidocaldarius
LAA1]
Length = 225
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 56/228 (24%), Positives = 97/228 (42%), Gaps = 3/228 (1%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I G Q ++ L+P E V + G + M+ ME P GM++
Sbjct: 1 MEYRIHGTTMQTLEVRLEPGESVFSEAGCLLAMTPGTYMET-RAPGGLAGMFRRALSGNS 59
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ + V IL + L +G ++ + +FLC+ + V +
Sbjct: 60 VFLTYYRAARGESVVQFTTRMPGHILALPLRQYG-RVIAERHSFLCAESTVDYGVEVTLN 118
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ +GG G L G GLAF+ G +V+++L GE + V I A T ++ +++
Sbjct: 119 IGRFLGG-NGLALTHLAGDGLAFLSIDGEIVERDLARGESLLVHPGHIAAFTDGIDYRVQ 177
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
+ +F GD L +TGPG V++ S+ H L I SP
Sbjct: 178 LMQGVANMLFSGDGLFLVELTGPGRVWLHSVTVHNLVHIIQEYWGSPR 225
>gi|323341905|ref|ZP_08082138.1| protein of hypothetical function DUF124 [Erysipelothrix
rhusiopathiae ATCC 19414]
gi|336066016|ref|YP_004560874.1| hypothetical protein ERH_0778 [Erysipelothrix rhusiopathiae str.
Fujisawa]
gi|322464330|gb|EFY09523.1| protein of hypothetical function DUF124 [Erysipelothrix
rhusiopathiae ATCC 19414]
gi|334295962|dbj|BAK31833.1| conserved hypothetical protein [Erysipelothrix rhusiopathiae str.
Fujisawa]
Length = 225
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 57/223 (25%), Positives = 103/223 (46%), Gaps = 14/223 (6%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL----RN 89
V + L E ++ G+M +M+ +M + + G ++ FG+ +
Sbjct: 11 VLRCFLDEGESLITSAGNMSWMTANMSYK-----VHSGGGFKKAFGRAFSGEGFFQNTYT 65
Query: 90 PGPSDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
G S+ + A P + + + +D + + Q +AFL S V+ SN +++
Sbjct: 66 AGDSNQEIAFAMSMPGVIKHIEMDGSK---TFIAQKNAFLASEPGVEFSNEFTKKLSAGF 122
Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
G EGF+ QK +G G F+ A GS++ +L+ GE + VD + S V+ I+ +
Sbjct: 123 FGGEGFILQKFSGVGNLFLEADGSLITYDLKEGESLLVDQGNLFMFESGVSYAIETVKGL 182
Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
FGG+ +TGPG V +Q+LP L+ + R + + N
Sbjct: 183 SNKFFGGEGFFLVKLTGPGQVVLQTLPISNLAGEVNRVLPTSN 225
>gi|383790957|ref|YP_005475531.1| hypothetical protein [Spirochaeta africana DSM 8902]
gi|383107491|gb|AFG37824.1| TIGR00266 family protein [Spirochaeta africana DSM 8902]
Length = 228
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 65/232 (28%), Positives = 108/232 (46%), Gaps = 27/232 (11%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ + ILG + + L EKV+++PG+M +M + M+ G F +T+
Sbjct: 1 MQYTILGSSMPLLECTLNAGEKVLSQPGAMKYMDYGVVMKT-----GIQGGISGFFKRTL 55
Query: 83 TSVVLRNPGPSDGFVGI--AAPSLAR----------ILPIDLAMFGGELLCQPDAFLCSV 130
+ GF+ AA S R I+PID+A +++CQ AFL
Sbjct: 56 SG--------ESGFMSTFEAARSGQRIAFGHTYPGSIIPIDVAH--SDIICQKRAFLACT 105
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
V + +R+ + G EGF+ Q+L G G AF G V+ LE G+ I ++ +
Sbjct: 106 EGVTFDIAIQRRLGSGFFGGEGFIMQRLGGTGTAFAEIDGETVEMQLEPGQKIQIETGAV 165
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
++V++ I+ I FGG+ L +TGPG V++Q++ RL+ I
Sbjct: 166 AMFEATVDMNIEMVKGISNVFFGGEGLFLTTLTGPGKVWLQTMSVQRLASEI 217
>gi|167756077|ref|ZP_02428204.1| hypothetical protein CLORAM_01597 [Clostridium ramosum DSM 1402]
gi|237734062|ref|ZP_04564543.1| conserved hypothetical protein [Mollicutes bacterium D7]
gi|374624270|ref|ZP_09696687.1| TIGR00266 family protein [Coprobacillus sp. 8_2_54BFAA]
gi|167704069|gb|EDS18648.1| TIGR00266 family protein [Clostridium ramosum DSM 1402]
gi|229382888|gb|EEO32979.1| conserved hypothetical protein [Coprobacillus sp. D7]
gi|373915553|gb|EHQ47301.1| TIGR00266 family protein [Coprobacillus sp. 8_2_54BFAA]
Length = 224
Score = 77.0 bits (188), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 52/215 (24%), Positives = 101/215 (46%), Gaps = 10/215 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I G V L+ E ++ G+M +M MEME G + FG+
Sbjct: 1 MKYEIKGDNLPVVICHLEKGESMITDSGAMSWMDPVMEMETTS------GGFGKAFGRMF 54
Query: 83 TSVVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
+ + R DG + A+ I +++ E++ Q +FL + + +S
Sbjct: 55 SGETMFQNRYHAKEDGMIAFASSFPGSIEAVEINP-NHEIIVQKSSFLAGTSGIDMSVFF 113
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
++++ I G EGF+ K++G G F+ GS + +L G+ + +D + + ++ +
Sbjct: 114 NKKMATGIFGGEGFIMNKISGTGTVFLEIDGSAISYDLLPGQQMVIDTGYLAMMDATCKM 173
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
+I+ ++ + GG+ VVTGPG V +QS+P
Sbjct: 174 EIQSVKGVKNKLLGGEGFFNTVVTGPGKVVLQSMP 208
>gi|224542111|ref|ZP_03682650.1| hypothetical protein CATMIT_01286 [Catenibacterium mitsuokai DSM
15897]
gi|224524948|gb|EEF94053.1| TIGR00266 family protein [Catenibacterium mitsuokai DSM 15897]
Length = 223
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/224 (23%), Positives = 105/224 (46%), Gaps = 11/224 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I G V + L E ++ G+M +M ++ M+ + G + LFG+
Sbjct: 1 MKYEIKGKPLPVAILHLDQGESILTDSGAMVWMKPALTMDTV------SGGMKSLFGRAF 54
Query: 83 TSVVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
+ L + + + + IL D++ G+++ Q ++L S VK +
Sbjct: 55 SGETLFLNKYTATQPSMIALGSSFPGDILVYDVSQ--GDIIVQKSSYLASEETVKREISF 112
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
++ G EGF+ + G GL FI G+ ++ +L GE + V+ +VA++SS ++
Sbjct: 113 SKKGTIGFFGGEGFIMNRYFGSGLLFIEIDGTAIEYDLNPGEELIVNTGNLVAMSSSCHI 172
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
+ I+ GG+ VV GPG V +Q++P L++ ++
Sbjct: 173 DVVSVKGIKNKFLGGEGFFNTVVKGPGHVIVQTMPISTLAESLS 216
>gi|291460386|ref|ZP_06599776.1| HTH DNA-binding protein [Oribacterium sp. oral taxon 078 str.
F0262]
gi|291416953|gb|EFE90672.1| HTH DNA-binding protein [Oribacterium sp. oral taxon 078 str.
F0262]
Length = 241
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 59/228 (25%), Positives = 106/228 (46%), Gaps = 3/228 (1%)
Query: 11 QPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE 70
+ + Q + + +QI GG I L E +++ + G +E E
Sbjct: 5 EKHEKQKIGGSSMRYQIEGGSLPAVVIQLDAGETIMSELAGRTWAKGPIETETRGTGGIG 64
Query: 71 VGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
+ + G+++ GP++ +G A+ I+ +L G L+CQ AFLCS
Sbjct: 65 KSIGRMFSGESLFMSHYTAKGPAE--IGFASSFAGSIIARELGP-GESLICQKSAFLCST 121
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
+ V +S +++ G EGF+ QK+TG G+ F+ G V + +L GE + D +
Sbjct: 122 SGVDLSIFFQKKLGAGFFGGEGFIMQKVTGPGVVFLELDGYVKEYDLAAGEELVCDTGVL 181
Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
+ +S + I+ ++ +FGG+ LV V+ GPG +IQ++ L
Sbjct: 182 ALMDASCKMDIRMVKGLKNKLFGGEGLVDTVIIGPGKAYIQTMSLAHL 229
>gi|384135763|ref|YP_005518477.1| hypothetical protein TC41_2047 [Alicyclobacillus acidocaldarius
subsp. acidocaldarius Tc-4-1]
gi|339289848|gb|AEJ43958.1| protein of unknown function DUF124 [Alicyclobacillus acidocaldarius
subsp. acidocaldarius Tc-4-1]
Length = 267
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 55/228 (24%), Positives = 96/228 (42%), Gaps = 3/228 (1%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ + I G Q ++ L+P E V + G + M+ ME P GM++
Sbjct: 43 MDYHIHGSTMQTLEVRLEPGESVFSEAGCLLSMTPGTYMET-RAPGGLAGMFRRALSGNS 101
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ + V IL + L +G ++ + +FLC+ + V +
Sbjct: 102 VFLTYYRAARGESVVQFTTRMPGHILALPLHQYG-RVIAERHSFLCAESTVDYGVEVTLN 160
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ +GG G L G G+AF+ G +V+++L GE + V I A T ++ +++
Sbjct: 161 IGRFLGG-NGLALTHLAGDGMAFLSIDGEIVERDLARGESLLVHPGHIAAFTDGIDYRVQ 219
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
+ +F GD L +TGPG V++ S+ H L I SP
Sbjct: 220 LMQGVANMLFSGDGLFLVELTGPGRVWLHSVTVHNLVHVIQAYWGSPR 267
>gi|256372358|ref|YP_003110182.1| hypothetical protein Afer_1587 [Acidimicrobium ferrooxidans DSM
10331]
gi|256008942|gb|ACU54509.1| protein of unknown function DUF124 [Acidimicrobium ferrooxidans DSM
10331]
Length = 254
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 60/219 (27%), Positives = 95/219 (43%), Gaps = 17/219 (7%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENI---------YIPENEVGMWQWLFGKTITS 84
V +I+L P + +V+ G + +MS ++ M + G G + +
Sbjct: 12 VLEIVLDPGDHLVSDRGELSWMSANVRMTQTTGAAGGGLMGALKRVAG-----GGSLLLT 66
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
GP GFV A I+P+++A G L D ++C DV S Q +
Sbjct: 67 HFEATGGP--GFVAFGAKLPGHIVPLEIAQ-GQAYLVHRDGWVCGTGDVVASMAFQQSLG 123
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
+ G EGF+ Q+L G G A+I GG +V L G+ + V + A +V Q+
Sbjct: 124 GALFGKEGFILQRLEGHGTAWIELGGEMVNYQLAEGQQLLVHPGHVGAFAETVQFQVTRV 183
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
I GGD + GPG V++QS+P L+Q +
Sbjct: 184 PGIVNRWAGGDGHWLVQLVGPGQVWLQSMPLAVLAQALG 222
>gi|70607532|ref|YP_256402.1| hypothetical protein Saci_1803 [Sulfolobus acidocaldarius DSM 639]
gi|449067783|ref|YP_007434865.1| hypothetical protein SacN8_08785 [Sulfolobus acidocaldarius N8]
gi|449070057|ref|YP_007437138.1| hypothetical protein SacRon12I_08795 [Sulfolobus acidocaldarius
Ron12/I]
gi|68568180|gb|AAY81109.1| conserved Prokaryal protein [Sulfolobus acidocaldarius DSM 639]
gi|449036291|gb|AGE71717.1| hypothetical protein SacN8_08785 [Sulfolobus acidocaldarius N8]
gi|449038565|gb|AGE73990.1| hypothetical protein SacRon12I_08795 [Sulfolobus acidocaldarius
Ron12/I]
Length = 236
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/218 (25%), Positives = 98/218 (44%), Gaps = 8/218 (3%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+QILG + Q ++ L E A G M S ++ ++ GM + L G T
Sbjct: 4 YQILGDDLQYLKVELAQGEGFYADAGHMVMKSSTVNLQTKMRGGILSGMKRALTGGTFFV 63
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
PG V ++ +I+ + + G +L + FL + N V+ T+ +
Sbjct: 64 TEFYGPGE----VMLSGIFPGKIVQVQVQ---GSILAEAHTFLGAENTVEYDATMARLTT 116
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
++GG EG + G G F+ A G + K+L GE I V+ S ++A +N ++
Sbjct: 117 GILGG-EGLFLARFKGYGNLFLHAYGGLYVKDLAPGETIQVEASHLMAFQEGMNYSVQLV 175
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
G +R F + L +TGPG V++ +L +L+ +
Sbjct: 176 GGLRSIFFAHEGLFFVTITGPGRVWLHTLTVQQLAHAL 213
>gi|149918517|ref|ZP_01907006.1| hypothetical protein PPSIR1_20299 [Plesiocystis pacifica SIR-1]
gi|149820593|gb|EDM80005.1| hypothetical protein PPSIR1_20299 [Plesiocystis pacifica SIR-1]
Length = 217
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 61/209 (29%), Positives = 101/209 (48%), Gaps = 6/209 (2%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+Q+ G AQ ++ L + + A G++ S+E ++ IP N + G T+T
Sbjct: 10 YQLHGNIAQTIELSLDEGQSMWASKGALICYDDSIEW-SLEIPGNSLSRMLSGEGLTMTR 68
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
V + PG V + A +I DL+ G + C P AFL +V V++ T +R
Sbjct: 69 VTAKRPGAK---VVLGANDTGKIAAWDLSW--GPVTCTPGAFLAAVGGVQIDVTAARRAG 123
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
G G L QK++G+GL FI G +Q++L GE + V + A S V+ +
Sbjct: 124 AAFFGGAGLLLQKISGRGLVFIHGKGDFIQRDLAAGERLLVSTGNLAAYGSEVDYGVVGV 183
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSL 233
G + + +F + + +TGPG V +Q+L
Sbjct: 184 GGLFKMLFAREGVFMTQLTGPGRVLLQTL 212
>gi|358067922|ref|ZP_09154394.1| hypothetical protein HMPREF9333_01275 [Johnsonella ignava ATCC
51276]
gi|356693891|gb|EHI55560.1| hypothetical protein HMPREF9333_01275 [Johnsonella ignava ATCC
51276]
Length = 225
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 63/226 (27%), Positives = 110/226 (48%), Gaps = 14/226 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I GG V +P EK++ G+M +MS +M M G FG+
Sbjct: 1 MKYEIQGGNFPVAICSCEPGEKMICEGGAMSWMSPNMVMNT-----TTNGGLGKAFGRMF 55
Query: 83 T-SVVLRNPGPSDGFVGIAA-----PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
+ + N + G G+ A P R + I+ G +++ Q AFL V++S
Sbjct: 56 SGESMFMNTYEAQGGPGMIAFASSFPGEIRAINIEP---GRDVVAQKRAFLACEPGVELS 112
Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
+++ G EGF+ QKL+GQG+AFI GS V+ L G+ I VD + + ++
Sbjct: 113 IFFQKKIGAGFFGGEGFIMQKLSGQGIAFIEIDGSAVEYELAAGQTIVVDTGYVALMDAT 172
Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
++ +K ++ + GG+ L VTGPG V +Q++P +++ +
Sbjct: 173 CSMDVKTVPGVKNILLGGEGLFNTHVTGPGRVIVQTMPLSKMAASL 218
>gi|48477824|ref|YP_023530.1| HTH DNA-binding protein [Picrophilus torridus DSM 9790]
gi|48430472|gb|AAT43337.1| HTH DNA-binding protein [Picrophilus torridus DSM 9790]
Length = 219
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 108/229 (47%), Gaps = 17/229 (7%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
++ ++I+GG+ Q + +L E + PG + + S S ++ G + F
Sbjct: 1 MVNYEIVGGDVQYLKSILNEGESMYVEPGHLIYKSPSARLDV------RAGGLRGAFSHM 54
Query: 82 ITS---VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
+ +L+ GP + V A ++ I+L ++ + +AFLC + V S
Sbjct: 55 LAGSDVFLLKVDGPGE--VASAGFLPGKVFKIELN--NNSIIAEFNAFLCMDSTVNYSTK 110
Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
+ V GG EG +K +G G + G V+++ L+ GE I V++S ++A +V
Sbjct: 111 FAGLWQGVFGG-EGLFLEKFSGTGSVMMHGHGQVIERELKPGEEIQVELSHVLAFEETVG 169
Query: 199 VQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+ G ++ V G G+ L A +TGPG V++ S+ LS ++AR
Sbjct: 170 YNVVRIGGLKTMVLGGLEGEGLFFANMTGPGKVWLHSISLFELSAKLAR 218
>gi|258511908|ref|YP_003185342.1| hypothetical protein Aaci_1941 [Alicyclobacillus acidocaldarius
subsp. acidocaldarius DSM 446]
gi|257478634|gb|ACV58953.1| protein of unknown function DUF124 [Alicyclobacillus acidocaldarius
subsp. acidocaldarius DSM 446]
Length = 225
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/228 (24%), Positives = 97/228 (42%), Gaps = 3/228 (1%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I G Q ++ L+P E V + G + M+ ME P GM++
Sbjct: 1 MDYRIHGTTMQTLEVRLEPGESVFSEAGCLLAMTPGTYMET-RAPGGLAGMFRRALSGNS 59
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ + V IL + L +G ++ + +FLC+ + V +
Sbjct: 60 VFLTYYRAARGESVVQFTTRMPGHILALPLRQYG-RVIAERHSFLCAESTVDYGVEVTLN 118
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ +GG G L G G+AF+ G +V+++L GE + V I A T ++ +++
Sbjct: 119 IGRFLGG-NGLALTHLAGDGMAFLSIDGEIVERDLARGESLLVHPGHIAAFTDGIDYRVQ 177
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
+ +F GD L +TGPG V++ S+ H L I SP
Sbjct: 178 LMQGVANMLFSGDGLFLVELTGPGRVWLHSVTVHNLIHIIQEYWGSPR 225
>gi|410723384|ref|ZP_11362623.1| hypothetical protein A370_00664 [Clostridium sp. Maddingley
MBC34-26]
gi|410603236|gb|EKQ57676.1| hypothetical protein A370_00664 [Clostridium sp. Maddingley
MBC34-26]
Length = 146
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/137 (34%), Positives = 70/137 (51%), Gaps = 13/137 (9%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLF- 78
I + I G + Q +I L P E VVA G++ M+ S+EME I+ +N G +F
Sbjct: 6 IDYSIYGDDMQYVEIELDPMETVVAEAGALMMMAPSIEMETIFGDGSKQNTGGFMDKIFS 65
Query: 79 -------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
G+++ N G V AAP +ILP+DL +GG+L+CQ DAFLC+
Sbjct: 66 AGKRVLTGESLFMTAFTNAGQLREKVYFAAPYPGKILPMDLRAYGGQLICQKDAFLCAAK 125
Query: 132 DVKVSNTLDQRVRNVIG 148
V + +D R + +G
Sbjct: 126 GVSIG--IDFRKKLGVG 140
>gi|229582748|ref|YP_002841147.1| hypothetical protein YN1551_2234 [Sulfolobus islandicus Y.N.15.51]
gi|228013464|gb|ACP49225.1| protein of unknown function DUF124 [Sulfolobus islandicus
Y.N.15.51]
Length = 232
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 60/240 (25%), Positives = 109/240 (45%), Gaps = 14/240 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++ILG + Q ++ML E + A G M S+ M+ M FG
Sbjct: 4 YEILGKDLQYLKVMLGEGEAIYADAGHMVAKQSSVIMKTT--------MRGGFFGALKRE 55
Query: 85 VVLRNPGPSDGFVGIAAPSLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
+ + ++ F+G L+ + P + + + G +L + +FLC+ N+V TL++
Sbjct: 56 ITGGSFFVTE-FLGPGEVYLSGVFPGPIVQIQLQGNGILAESHSFLCAENNVNYDATLNK 114
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+GG EG + G G F+ A G +++K L GE + V+ S ++A + I
Sbjct: 115 LGVGWLGG-EGIFLARFRGDGNVFLHAYGGLIEKYLNPGESLEVEASHLMAFEEGMRYGI 173
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQI 261
+ G IR +F G+ L + GPG +++ +L +L + R + P F ++I
Sbjct: 174 QPVGGIRSLLFSGEGLFFVNIEGPGKIWLHTLTVEQLVASL-RPYLPQGQQGGPSFNIRI 232
>gi|227829995|ref|YP_002831774.1| hypothetical protein LS215_1106 [Sulfolobus islandicus L.S.2.15]
gi|284996932|ref|YP_003418699.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
gi|227456442|gb|ACP35129.1| protein of unknown function DUF124 [Sulfolobus islandicus L.S.2.15]
gi|284444827|gb|ADB86329.1| protein of unknown function DUF124 [Sulfolobus islandicus L.D.8.5]
Length = 232
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 60/240 (25%), Positives = 109/240 (45%), Gaps = 14/240 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++ILG + Q ++ML E + A G M S+ M+ M FG
Sbjct: 4 YEILGKDLQYLKVMLGEGEAIYADAGHMVAKQSSVIMKTT--------MRGGFFGALKRE 55
Query: 85 VVLRNPGPSDGFVGIAAPSLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
+ + ++ F+G L+ + P + + + G +L + +FLC+ N+V TL++
Sbjct: 56 ITGGSFFVTE-FLGPGEVYLSGVFPGPIVQIQLQGNGILAESHSFLCAENNVNYDATLNK 114
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+GG EG + G G F+ A G +++K L GE + V+ S ++A + I
Sbjct: 115 LGVGWLGG-EGIFLARFRGDGNVFLHAYGGLIEKYLNPGESLQVEASHLMAFEEGMRYGI 173
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQI 261
+ G IR +F G+ L + GPG +++ +L +L + R + P F ++I
Sbjct: 174 QPVGGIRSLLFSGEGLFFVNIEGPGKIWLHTLTVEQLVASL-RPYLPQGQQGGPSFNIRI 232
>gi|317056444|ref|YP_004104911.1| hypothetical protein Rumal_1779 [Ruminococcus albus 7]
gi|315448713|gb|ADU22277.1| protein of unknown function DUF124 [Ruminococcus albus 7]
Length = 225
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 79/149 (53%), Gaps = 2/149 (1%)
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
DG + +A+ I P+D++ ++ Q AFL S V + +R G EGF
Sbjct: 71 DGMIAMASNFPGSIKPMDVSQM--PIVAQKSAFLASEMGVNMEIFFQKRFGAGFFGGEGF 128
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
+ QK +GQG F+ GS+++ L G+ + VD C+ A+ +S ++ I+ ++ +FG
Sbjct: 129 IMQKFSGQGTVFLELDGSIIEYQLAQGQSMLVDTGCLAAMEASCSIDIEQVSGLKNKLFG 188
Query: 214 GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
G+ +TGPG V++Q++P L+ I
Sbjct: 189 GEGFFNTRITGPGHVWLQTMPVSNLAGAI 217
>gi|406872158|gb|EKD22793.1| hypothetical protein ACD_83C00267G0005 [uncultured bacterium]
Length = 228
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 59/210 (28%), Positives = 96/210 (45%), Gaps = 4/210 (1%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+ +LG Q I LK E + A PGS+ F + +++ G + + G+++
Sbjct: 3 YDVLGEVMQRLIITLKKGELIYAEPGSVYFANEAIQPATKLKGGILEGAKRVMTGESLF- 61
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
+ L G + I + +IL DL + L+CQ AFLC V + +
Sbjct: 62 IPLYASSSDGGEIHIKTKFIGKILTFDL-LPKQVLICQKGAFLCGQESVDIGLHIRHLTA 120
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
GG EGF+ Q++ G G F+ G V + NL E I + + SV+ I++
Sbjct: 121 GAFGG-EGFIMQRIVGPGWVFLAIDGEVEEHNLSASETIRAEPGHVAFHEPSVHCGIEFA 179
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
G + FGG L A + GPG V++Q++P
Sbjct: 180 G-FKNVFFGGFPLFWATLVGPGKVWLQTMP 208
>gi|383787060|ref|YP_005471629.1| hypothetical protein [Fervidobacterium pennivorans DSM 9078]
gi|383109907|gb|AFG35510.1| TIGR00266 family protein [Fervidobacterium pennivorans DSM 9078]
Length = 224
Score = 74.7 bits (182), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 101/211 (47%), Gaps = 18/211 (8%)
Query: 30 GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL-----FGKTITS 84
G + ++ L P E+V PG+M +M G + +E + G+W+ L G++
Sbjct: 9 GSYALLKVYLGPGERVKVEPGAMVYMKGPVNVET-----STGGVWKALKRAVLGGESFFM 63
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
+ G DG +GIA P L I+ G L Q +FL V + + +
Sbjct: 64 NTYYSSG--DGEIGIA-PELPG--DIETVHVNGTLFVQSTSFLACDAQVDLDVSFGG-FK 117
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
+ G EG KL G G A I + G++ L+VGE IT+D +VA SVN ++
Sbjct: 118 SFFAG-EGIFLLKLQGYGDAAISSFGAIKILELQVGEKITIDTGHVVAFDGSVNYSVRTF 176
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
G I+ +FGG+ LV TGPG +++Q+ +
Sbjct: 177 GGIKSTLFGGEGLVCD-FTGPGRIYVQTRNY 206
>gi|331003350|ref|ZP_08326853.1| hypothetical protein HMPREF0491_01715 [Lachnospiraceae oral taxon
107 str. F0167]
gi|330412620|gb|EGG92005.1| hypothetical protein HMPREF0491_01715 [Lachnospiraceae oral taxon
107 str. F0167]
Length = 226
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 57/219 (26%), Positives = 102/219 (46%), Gaps = 3/219 (1%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+QI GG I L+ E +++ G + G +E + + + G++
Sbjct: 3 YQIEGGNLPAVVIELETGEVIISEVGGRTWAKGPIETDTKATGGIGKAIGRMFSGESFFM 62
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
G ++ +G ++ I+ +L G ++CQ AFLC+ ++V +S +++
Sbjct: 63 SHYTAKGHAE--IGFSSSFAGSIIAKELKA-GESIICQKSAFLCATSEVSLSIHFQKKIG 119
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
G EGF+ QK+TG G F+ G V + +L GE + D I + S + +
Sbjct: 120 AGFFGGEGFIMQKVTGPGTVFLELDGYVKEYDLNPGEQVVCDTGLIALMDESCTMDVVVV 179
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
I+ FG + L+ VVTGPG V+IQ++ RL + +
Sbjct: 180 KGIKNKFFGSEGLIDTVVTGPGKVYIQTMSLPRLGEMLT 218
>gi|306821076|ref|ZP_07454694.1| protein of hypothetical function DUF124 [Eubacterium yurii subsp.
margaretiae ATCC 43715]
gi|304550912|gb|EFM38885.1| protein of hypothetical function DUF124 [Eubacterium yurii subsp.
margaretiae ATCC 43715]
Length = 226
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 57/214 (26%), Positives = 103/214 (48%), Gaps = 3/214 (1%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+QI GG I L+ E +V+ G + G + E + + G+ +
Sbjct: 3 YQIEGGSLPAAVIQLEKGETLVSELGGRTWAKGPITTETKAPGGIGKSIGRMFSGENLFM 62
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
GP + +G A+ I+ +LA G ++CQ AFLC+ + V++S +++
Sbjct: 63 SYYTAEGPVE--IGFASSFAGSIVVKELAA-GQSIICQKSAFLCATSQVELSIFFQKKLG 119
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
G EGF+ QK+TG G F+ G V + +L GE + D + + + ++ ++
Sbjct: 120 AGFFGGEGFIMQKVTGPGTVFLELDGYVKEYDLVQGEELVCDTGVVALMDETCSMDVRMV 179
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
++ +FGG+ L+ V+ GPG V+IQ++ RL
Sbjct: 180 KGLKNKLFGGEGLIDTVIIGPGKVYIQTMSLPRL 213
>gi|229578344|ref|YP_002836742.1| hypothetical protein YG5714_0526 [Sulfolobus islandicus Y.G.57.14]
gi|228009058|gb|ACP44820.1| protein of unknown function DUF124 [Sulfolobus islandicus
Y.G.57.14]
Length = 232
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 60/240 (25%), Positives = 109/240 (45%), Gaps = 14/240 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++ILG + Q ++ML E + A G M S+ M+ M FG
Sbjct: 4 YEILGKDLQYLKVMLGEGEAIYADAGHMVAKQSSVIMKTT--------MRGGFFGALKRE 55
Query: 85 VVLRNPGPSDGFVGIAAPSLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
+ + ++ F+G L+ + P + + + G +L + +FLC+ N+V TL++
Sbjct: 56 ITGGSFFVTE-FLGPGEVYLSGVFPGPIVQIQLQGNGILTESHSFLCAENNVNYDATLNK 114
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+GG EG + G G F+ A G +++K L GE + V+ S ++A + I
Sbjct: 115 LGVGWLGG-EGIFLARFRGDGNVFLHAYGGLIEKYLNPGESLQVEASHLMAFEEGMRYGI 173
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQI 261
+ G IR +F G+ L + GPG +++ +L +L + R + P F ++I
Sbjct: 174 QPVGGIRSLLFSGEGLFFVNIEGPGKIWLHTLTVEQLVASL-RPYLPQGQQGGPSFNIRI 232
>gi|229584086|ref|YP_002842587.1| hypothetical protein M1627_0570 [Sulfolobus islandicus M.16.27]
gi|385772809|ref|YP_005645375.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
gi|385775278|ref|YP_005647846.1| hypothetical protein [Sulfolobus islandicus REY15A]
gi|228019135|gb|ACP54542.1| protein of unknown function DUF124 [Sulfolobus islandicus M.16.27]
gi|323474026|gb|ADX84632.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
gi|323476923|gb|ADX82161.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
Length = 232
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 60/240 (25%), Positives = 109/240 (45%), Gaps = 14/240 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++ILG + Q ++ML E + A G M S+ M+ M FG
Sbjct: 4 YEILGKDLQYLKVMLGEGEAIYADAGHMVAKQSSVIMKTT--------MRGGFFGALKRE 55
Query: 85 VVLRNPGPSDGFVGIAAPSLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
+ + ++ F+G L+ + P + + + G +L + +FLC+ N+V TL++
Sbjct: 56 ITGGSFFVTE-FLGPGEVYLSGVFPGPIVQIQLQGNGILAESHSFLCAENNVNYDATLNK 114
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+GG EG + G G F+ A G +++K L GE + V+ S ++A + I
Sbjct: 115 LGVGWLGG-EGIFLARFRGDGNVFLHAYGGLIEKYLNPGESLQVEASHLMAFEEGMRYGI 173
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQI 261
+ G IR +F G+ L + GPG +++ +L +L + R + P F ++I
Sbjct: 174 QPVGGIRSLLFSGEGLFFVNIEGPGKIWLHTLTVEQLVASL-RPYLPQGQQGGPGFNIRI 232
>gi|373118928|ref|ZP_09533043.1| TIGR00266 family protein [Lachnospiraceae bacterium 7_1_58FAA]
gi|371665851|gb|EHO31009.1| TIGR00266 family protein [Lachnospiraceae bacterium 7_1_58FAA]
Length = 226
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 61/227 (26%), Positives = 110/227 (48%), Gaps = 15/227 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I+G V L+ E ++ GSM +MS +M+ME G FG+
Sbjct: 1 MKYEIIGEPMPVVVCDLEANEAMITERGSMVWMSPNMKMET------GAGGIGKAFGRMF 54
Query: 83 T-SVVLRNPGPSDGFVGIAA-----PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
+ + +N + G G+ A P + + ID A ++ Q FL S V++S
Sbjct: 55 SGESMFQNTYTAQGGAGMIAFASSFPGSIKAVEIDPAH---PVVVQKSGFLASEQGVELS 111
Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
++ G EGF+ QKL+G+G AF+ G V+ L+ G+ + VD + + ++
Sbjct: 112 IFFQKKAGAGFFGGEGFIMQKLSGRGTAFLEVDGYAVEYELQAGQSMVVDSGNLAMMDAT 171
Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
+ I+ ++ +FGG+ L VVTGPG + +Q++P ++ +A
Sbjct: 172 CTIDIQMVKGVKNVLFGGEGLFNTVVTGPGKILLQTMPVSGVAAALA 218
>gi|227826910|ref|YP_002828689.1| hypothetical protein M1425_0548 [Sulfolobus islandicus M.14.25]
gi|238619045|ref|YP_002913870.1| hypothetical protein M164_0580 [Sulfolobus islandicus M.16.4]
gi|227458705|gb|ACP37391.1| protein of unknown function DUF124 [Sulfolobus islandicus M.14.25]
gi|238380114|gb|ACR41202.1| protein of unknown function DUF124 [Sulfolobus islandicus M.16.4]
Length = 232
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 60/240 (25%), Positives = 109/240 (45%), Gaps = 14/240 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++ILG + Q ++ML E + A G M S+ M+ M FG
Sbjct: 4 YEILGKDLQYLKVMLGEGEAIYADAGHMVAKQSSVIMKTT--------MRGGFFGALKRE 55
Query: 85 VVLRNPGPSDGFVGIAAPSLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
+ + ++ F+G L+ + P + + + G +L + +FLC+ N+V TL++
Sbjct: 56 ITGGSFFVTE-FLGPGEVYLSGVFPGPIVQIQLQGNGILAESHSFLCAENNVNYDATLNR 114
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+GG EG + G G F+ A G +++K L GE + V+ S ++A + I
Sbjct: 115 LGVGWLGG-EGIFLARFRGDGNVFLHAYGGLIEKYLNPGESLQVEASHLMAFEEGMRYGI 173
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQI 261
+ G IR +F G+ L + GPG +++ +L +L + R + P F ++I
Sbjct: 174 QPVGGIRSLLFSGEGLFFVNIEGPGKIWLHTLTVEQLVASL-RPYLPQGQQGGPGFNIRI 232
>gi|428183260|gb|EKX52118.1| hypothetical protein GUITHDRAFT_150692 [Guillardia theta CCMP2712]
Length = 271
Score = 74.3 bits (181), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 104/225 (46%), Gaps = 17/225 (7%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGK 80
+ ++G QV L +++V+ PG M S SM M +++ + G+
Sbjct: 44 WSVVGESTQVLMAKLLSEQEVLTEPGGMMIHSSGIKPSMNMGGLWLGIRRA----FCAGE 99
Query: 81 TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN-----DVKV 135
+ V +N + I ++++P++L F G + + A++ S+N D+K+
Sbjct: 100 SCCRVQWKNHESQPKIMTIHPSFPSKVVPVNLDEFQGSMYMKRAAWMASLNTQLEFDIKM 159
Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
+ + +G + G+GL F+ GG+V++K L GE + +D S +VA +
Sbjct: 160 APSCTA----AWCAGQGCCLTLVKGRGLTFLNVGGTVLKKTLRDGEKVFIDTSALVAWAT 215
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
+ + + G GG L +TGPG+V+IQS+ + +Q
Sbjct: 216 TATISAERAGNCCTICCGGMGLANTTITGPGVVYIQSMSREKAAQ 260
>gi|375084053|ref|ZP_09731063.1| hypothetical protein OCC_09481 [Thermococcus litoralis DSM 5473]
gi|374741219|gb|EHR77647.1| hypothetical protein OCC_09481 [Thermococcus litoralis DSM 5473]
Length = 222
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 56/199 (28%), Positives = 96/199 (48%), Gaps = 8/199 (4%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
+ ++ L+ E + A PG+M MS ++E+E + L G++I R G
Sbjct: 13 LVEVELEDGEAIKAEPGAMVHMSPNVEIETKTGGVFKALKRSMLGGESIFINTFRARG-G 71
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
G +G+A P + I ++L G L Q AFL S ++++ EG
Sbjct: 72 KGNIGLAPPYMGDIEALELR---GTLYAQSGAFLASSENIEIDTKFGGA--KTFFSREGL 126
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
KLTG+G F+ + G++ +K L E +D +VA T ++ +++ G I+ +FG
Sbjct: 127 FLLKLTGEGTVFLSSFGAIYKKELR-DERFIIDTGHLVAFTEGLDFKVRRVGGIKSTIFG 185
Query: 214 GDNLVTAVVTGPGIVFIQS 232
G+ LV A G G ++IQ+
Sbjct: 186 GEGLV-AEFYGTGTLYIQT 203
>gi|365843251|ref|ZP_09384197.1| TIGR00266 family protein [Flavonifractor plautii ATCC 29863]
gi|364572827|gb|EHM50357.1| TIGR00266 family protein [Flavonifractor plautii ATCC 29863]
Length = 226
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 61/227 (26%), Positives = 110/227 (48%), Gaps = 15/227 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I+G V L+ E ++ GSM +MS +M+ME G FG+
Sbjct: 1 MKYEIIGEPMPVVVCDLEANEAMITERGSMVWMSPNMKMET------GAGGIGKAFGRMF 54
Query: 83 T-SVVLRNPGPSDGFVGIAA-----PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
+ + +N + G G+ A P + + ID A ++ Q FL S V++S
Sbjct: 55 SGESMFQNTYTAQGGAGMIAFASSFPGSIKAVEIDPAH---PVVVQKSGFLASEQGVELS 111
Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
++ G EGF+ QKL+G+G AF+ G V+ L+ G+ + VD + + ++
Sbjct: 112 IFFQKKAGAGFFGGEGFIMQKLSGRGTAFLEVDGYAVEYELQAGQSMVVDSGNLAMMDAT 171
Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
+ I+ ++ +FGG+ L VVTGPG + +Q++P ++ +A
Sbjct: 172 CTIDIQIVKGVKNVLFGGEGLFNTVVTGPGKILLQTMPVSGVAAALA 218
>gi|332652858|ref|ZP_08418603.1| conserved hypothetical protein [Ruminococcaceae bacterium D16]
gi|332518004|gb|EGJ47607.1| conserved hypothetical protein [Ruminococcaceae bacterium D16]
Length = 241
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 59/221 (26%), Positives = 111/221 (50%), Gaps = 10/221 (4%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT- 83
+++ GG V L E+++ GSM +MS +M+M+ G +F K +
Sbjct: 19 YEVKGGSFPVVVCNLDSGEQMITEKGSMVWMSPNMKMDT------RGGGLGKMFSKAFSG 72
Query: 84 SVVLRNP--GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
+ +N DG + + +I + +A G E++ Q AFL + + V++S +
Sbjct: 73 ESMFQNTYTAQGDGMIAFGSSFPGQIKVVTIAP-GQEMIVQKSAFLAAESGVELSIHFSK 131
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
++ G EGF+ Q+L+G+G+AF+ G +V+ L+ G+ I VD + V + I
Sbjct: 132 KLGAGFFGGEGFIMQRLSGRGIAFVEIDGELVEYELKPGQQIIVDTGNVAGFEPGVQMDI 191
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ ++ + GG+ L V+TGPG V++Q++P ++ I
Sbjct: 192 QQVPGLKNKLLGGEGLFNTVLTGPGRVWLQTMPICNVAASI 232
>gi|228967984|ref|ZP_04128993.1| Protein containing DUF124 [Bacillus thuringiensis serovar sotto
str. T04001]
gi|228791708|gb|EEM39301.1| Protein containing DUF124 [Bacillus thuringiensis serovar sotto
str. T04001]
Length = 135
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 64/129 (49%), Gaps = 14/129 (10%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E VVA G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPKESVVAEAGAMMMMEDYIEMETIF--GDGSGPSNGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKIVCQKDAF 119
Query: 127 LCSVNDVKV 135
LC+ V +
Sbjct: 120 LCAAKGVSI 128
>gi|339442957|ref|YP_004708962.1| hypothetical protein CXIVA_18930 [Clostridium sp. SY8519]
gi|338902358|dbj|BAK47860.1| hypothetical protein CXIVA_18930 [Clostridium sp. SY8519]
Length = 224
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 56/204 (27%), Positives = 102/204 (50%), Gaps = 11/204 (5%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT--SVVLRNP- 90
V + L E + + G+M +M+ + M+ + G +F K I S N
Sbjct: 12 VLECSLSAGESMKCQKGAMAWMTRGITMQT------KAGGIGKMFKKAIAGESAFWNNYM 65
Query: 91 GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
+DG + + +I+P+D+ ++ Q AFL S V+++ L +++ G
Sbjct: 66 AEADGQIAFSTTFPGKIIPVDVGTM--PIVAQKSAFLASEAGVEMNIYLQEKIGAGFFGG 123
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
EGF+ ++ +G+G F+ G V+K LE GE I VD + A+ S +++I+ I
Sbjct: 124 EGFIMEQFSGRGYVFLEIDGGFVEKQLEAGEEIVVDSGYLAAMDQSCSMRIERIKGIGNV 183
Query: 211 VFGGDNLVTAVVTGPGIVFIQSLP 234
+ GG+ L ++TGPG V++Q++P
Sbjct: 184 MLGGEGLFNTIITGPGKVWLQTMP 207
>gi|332796149|ref|YP_004457649.1| hypothetical protein Ahos_0463 [Acidianus hospitalis W1]
gi|332693884|gb|AEE93351.1| conserved hypothetical protein [Acidianus hospitalis W1]
Length = 227
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 60/222 (27%), Positives = 103/222 (46%), Gaps = 11/222 (4%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+QI+G + Q ++ L P EK+ A G + S S+ ++ + + L G T
Sbjct: 3 YQIIGDDIQYVKVFLNPGEKIYADAGHIMSKSSSVTIQAKMRGGILSAIKRELTGATFFV 62
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
L PG D IA +I+PI L G +L + +FL + + + L
Sbjct: 63 TELMGPGEVD----IAGVFPGKIVPIQLN--GKGILAESHSFLFAEDSINYDARLAPLAP 116
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
++GG EG +G G F+ A G + + NL G+ I ++ S ++A+ + +
Sbjct: 117 AILGG-EGLFLATFSGVGQVFLHAYGGLYEINLMPGQSIDIEASHLLALEEGMQFTVTRV 175
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
G + +FGG+ L V GPG V+I S+ +Q++A A+
Sbjct: 176 GGFKTMLFGGEGLYFVRVVGPGKVWIHSIT----AQQMASAL 213
>gi|423603451|ref|ZP_17579344.1| hypothetical protein IIK_00032 [Bacillus cereus VD102]
gi|401247430|gb|EJR53766.1| hypothetical protein IIK_00032 [Bacillus cereus VD102]
Length = 160
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 67/137 (48%), Gaps = 14/137 (10%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
GK + + V N G V AAP +I+P+DL + G+++CQ DAF
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119
Query: 127 LCSVNDVKVSNTLDQRV 143
LC+ V + +++
Sbjct: 120 LCAAKGVSIGIEFTKKI 136
>gi|325680701|ref|ZP_08160239.1| TIGR00266 family protein [Ruminococcus albus 8]
gi|324107481|gb|EGC01759.1| TIGR00266 family protein [Ruminococcus albus 8]
Length = 225
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 58/218 (26%), Positives = 102/218 (46%), Gaps = 3/218 (1%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++I G V L E + + G+M +MS +M M + + G++I
Sbjct: 3 YEIQGEPLPVVICYLANGEAIKCQQGAMSWMSPNMNMSTNAGGGIGKALSRAFSGESIFQ 62
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
V DG + +A+ I P+D++ ++ Q AFL S V + +R
Sbjct: 63 NVYTAMN-GDGMIAMASNFPGSIKPMDVSQM--PIVAQKSAFLASEMGVNMEIFFQKRFG 119
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
G EGF+ QK +G G F+ GS+++ L G+ + VD C+ A+ +S ++ I+
Sbjct: 120 AGFFGGEGFIMQKFSGNGTVFLELDGSIIEYQLAAGQSMLVDTGCLAAMEASCSIDIEQV 179
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
++ FGG+ +TGPG V++Q++P L+ I
Sbjct: 180 SGLKNMFFGGEGFFNTRITGPGHVWLQTMPVSNLAGAI 217
>gi|196233638|ref|ZP_03132479.1| protein of unknown function DUF124 [Chthoniobacter flavus Ellin428]
gi|196222308|gb|EDY16837.1| protein of unknown function DUF124 [Chthoniobacter flavus Ellin428]
Length = 230
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 61/224 (27%), Positives = 99/224 (44%), Gaps = 10/224 (4%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENI----YIPENEVGMWQWLFGK 80
+I+G V I L E VVA G + +M+ S++M + + + G
Sbjct: 3 HEIIGTTMPVLSISLAKGESVVAVSGELSWMTSSIQMNTTTNAAGGGGFFGAIKRAVGGG 62
Query: 81 TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE--LLCQPDAFLCSVNDVKVSNT 138
++ R G G V A I+PI+L+ G+ + +LC V++
Sbjct: 63 SLFMTEYRAEG-DHGVVAFATKLPGHIVPIELS---GDKTYMIHRHGYLCGTPGVELGVG 118
Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
Q + + G GF+ QK+ G G A+I GG +V +L GE + V + +V
Sbjct: 119 FQQSLGAGVFGGSGFVMQKMAGTGSAWIELGGEIVTYDLAAGETLRVHPGHVGMFEGNVK 178
Query: 199 VQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+I I +FGGD + A +TGPG V++Q+L L+ I
Sbjct: 179 FEITRIKGIANMLFGGDGVFLAALTGPGRVWLQTLTVSNLAHAI 222
>gi|325968543|ref|YP_004244735.1| HTH DNA-binding protein [Vulcanisaeta moutnovskia 768-28]
gi|323707746|gb|ADY01233.1| HTH DNA-binding protein [Vulcanisaeta moutnovskia 768-28]
Length = 225
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 58/212 (27%), Positives = 99/212 (46%), Gaps = 7/212 (3%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
+I F+I+G + Q + L P +++ A G + + S S+ ++ G+ + + G +
Sbjct: 1 MINFKIMGNDIQHLYVELGPSDRIYAEGGHLIWKSSSVNIKATTGSGFMAGLKRAITGAS 60
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
+ L PG D IA + +I+ IDL G ++ + AFL V L
Sbjct: 61 FFVLELEGPGAID----IAGFAPGKIVEIDLG--GNGVMVEHRAFLAMEPTVNYDVKLVG 114
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+GG EG L +L G G F+ A G + L GE I V+ ++A + V I
Sbjct: 115 LGFGWLGG-EGLLMARLQGNGRVFLHAIGDALMLELRPGESIDVEAGHVLAFDEGMRVSI 173
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSL 233
+ G +R +FG + L A + GPG ++++L
Sbjct: 174 RRVGGLRTMLFGEEGLWLAHIEGPGKAWLRTL 205
>gi|402309879|ref|ZP_10828850.1| TIGR00266 family protein [Eubacterium sp. AS15]
gi|400370260|gb|EJP23250.1| TIGR00266 family protein [Eubacterium sp. AS15]
Length = 226
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 56/214 (26%), Positives = 102/214 (47%), Gaps = 3/214 (1%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+QI GG I L+ E +V+ G + G + E + + G+ +
Sbjct: 3 YQIEGGSLPAAVIQLEKGETLVSELGGRTWAKGPITTETKAPGGIGKSIGRMFSGENLFM 62
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
GP + +G A+ I+ +LA G ++CQ AFLC+ + V++S +++
Sbjct: 63 SYYTAEGPVE--IGFASSFAGSIVVKELAA-GQSIICQKSAFLCATSQVELSIFFQKKLG 119
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
G EGF+ QK+TG F+ G V + +L GE + D + + + ++ ++
Sbjct: 120 AGFFGGEGFIMQKVTGPATVFLELDGYVKEYDLVQGEELVCDTGVVALMDETCSMDVRMV 179
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
++ +FGG+ L+ V+ GPG V+IQ++ RL
Sbjct: 180 KGLKNKLFGGEGLIDTVIIGPGKVYIQTMSLPRL 213
>gi|217077739|ref|YP_002335457.1| hypothetical protein THA_1679 [Thermosipho africanus TCF52B]
gi|419760339|ref|ZP_14286618.1| hypothetical protein H17ap60334_05724 [Thermosipho africanus
H17ap60334]
gi|217037594|gb|ACJ76116.1| conserved hypothetical protein [Thermosipho africanus TCF52B]
gi|407514442|gb|EKF49257.1| hypothetical protein H17ap60334_05724 [Thermosipho africanus
H17ap60334]
Length = 224
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 61/211 (28%), Positives = 93/211 (44%), Gaps = 24/211 (11%)
Query: 30 GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMEN----IY--IPENEVGMWQWLFGKTIT 83
G + + L+ E +VA PG+M M G +E+ I+ + +G + K I+
Sbjct: 9 GSYAILFVELEIGETIVAEPGAMVSMEGDIEVNTSTGGIFKALKRAFLGGEHFFLNKYIS 68
Query: 84 SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFG--GELLCQPDAFLCSVNDVKVSNTLDQ 141
+ S A +LP D+ + Q ++L S + +
Sbjct: 69 HSK-------------SKISFAPLLPGDIEVIDIKNGFFLQSTSYLASQETISIDTKFGG 115
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+ G EGF KL+G G A I A G + K LE GE ITVD IV +V +
Sbjct: 116 -FKTFFSG-EGFFLLKLSGYGKAAISAFGGIYHKELEPGEAITVDTGHIVGFDETVKYSV 173
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+ G I+ +FGG+ LV +V GPG V+IQ+
Sbjct: 174 RTFGGIKSTLFGGEGLVVDLV-GPGRVYIQT 203
>gi|347757146|ref|YP_004864708.1| hypothetical protein MICA_361 [Micavibrio aeruginosavorus ARL-13]
gi|347589664|gb|AEP08706.1| conserved hypothetical protein [Micavibrio aeruginosavorus ARL-13]
Length = 251
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 60/247 (24%), Positives = 107/247 (43%), Gaps = 25/247 (10%)
Query: 4 PFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMEN 63
P ST F + Q F+I G ++ ++P + V G++ S S++M++
Sbjct: 7 PKASTAFNG----TAQTHGFEFRINGAHTPYLELHMQPGQAFVGEAGTLMGQSDSVKMKS 62
Query: 64 IYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVG---------IAAPSLARILP----- 109
+ G LF V+R G + F+ + SLA LP
Sbjct: 63 TLGDGSNSGFLSSLF-----RSVMRKMGGENFFINQYVNMASDKVGKLSLAAALPGEIVA 117
Query: 110 IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAG 169
ID+A G ++ Q +FL + ++ + + + ++ + Q++ G+ F+ A
Sbjct: 118 IDIAEHGKRIVAQRGSFLAAEQGAQLGSKMKLSLFGMLS--NSVVMQEIKGEAWTFLNAP 175
Query: 170 GSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVF 229
G+V L+ GEV D C+VA + SV + + +G + V GG+ A GPG ++
Sbjct: 176 GNVQCIQLDAGEVYKTDARCLVAASDSVKLGMALSGGLLTMVTGGEGAFVAKAEGPGTIW 235
Query: 230 IQSLPFH 236
+ S PF
Sbjct: 236 VGSKPFE 242
>gi|319935941|ref|ZP_08010365.1| hypothetical protein HMPREF9488_01196 [Coprobacillus sp. 29_1]
gi|319809055|gb|EFW05551.1| hypothetical protein HMPREF9488_01196 [Coprobacillus sp. 29_1]
Length = 225
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 54/225 (24%), Positives = 105/225 (46%), Gaps = 9/225 (4%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I G V LK + +++ G+M +M MEM G FG+
Sbjct: 1 MEYEIKGVPLPVVICYLKKGDVMLSDSGAMAWMDPCMEMGT-----TSNGGIGKAFGRMF 55
Query: 83 TSVVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
+ + +DG + ++ I D+ G E++ Q ++L S V+ +
Sbjct: 56 SGETMFLNSYTANADGKIAFSSSFPGEIRKYDIEP-GKEIVIQKSSYLASEPTVERTIFF 114
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
+++ I G EGF+ KL+G+G F+ G+ V+ +L G+ I VD + + ++ +
Sbjct: 115 NKKAMTGIFGGEGFIMNKLSGKGTVFVEIDGATVEYDLAPGQKIVVDTGYVAVMDATCTM 174
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+ ++ +FGG+ VVTGPG V +Q++P ++ +A+
Sbjct: 175 TTQSVPGLKNKMFGGEGFFNTVVTGPGHVVLQTMPISAIAGSLAQ 219
>gi|451343926|ref|ZP_21912992.1| TIGR00266 family protein [Eggerthia catenaformis OT 569 = DSM
20559]
gi|449337501|gb|EMD16663.1| TIGR00266 family protein [Eggerthia catenaformis OT 569 = DSM
20559]
Length = 225
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 52/224 (23%), Positives = 103/224 (45%), Gaps = 3/224 (1%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I G + L E +++ G+M +M+ ++ME + + G+T+
Sbjct: 1 MKYEIKGSPFPAVLLNLNQGESMLSDSGAMAWMTPGIQMETTSNGGVGKALGRAFSGETL 60
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ + +DG + ++ I + L G ++ Q AFL S V ++
Sbjct: 61 --FINKYTAMADGQIAFSSSFPGEIRAVQLQP-GQGIVLQKSAFLASTTGVSREVFFSRK 117
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ G EGF+ + G GL F+ G+ V+ +L G+ + +D + A++ + + I+
Sbjct: 118 FSTGLFGGEGFIMSRYIGPGLLFLEIDGTAVEYDLAAGQQMIIDSGYLAAMSDTCKMDIQ 177
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
++ GG+ L VVTGPG + IQ++P L+ I R +
Sbjct: 178 RIKGMKNIFLGGEGLFNTVVTGPGHIIIQTMPISSLAGTIGRYI 221
>gi|15898554|ref|NP_343159.1| hypothetical protein SSO1755 [Sulfolobus solfataricus P2]
gi|284174124|ref|ZP_06388093.1| hypothetical protein Ssol98_05607 [Sulfolobus solfataricus 98/2]
gi|384434925|ref|YP_005644283.1| hypothetical protein [Sulfolobus solfataricus 98/2]
gi|13814997|gb|AAK41949.1| Conserved hypothetical protein [Sulfolobus solfataricus P2]
gi|261603079|gb|ACX92682.1| protein of unknown function DUF124 [Sulfolobus solfataricus 98/2]
Length = 232
Score = 71.6 bits (174), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 57/240 (23%), Positives = 106/240 (44%), Gaps = 14/240 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++ILG + Q ++ML E + G M S+ M+ + + + G +
Sbjct: 4 YEILGKDLQYLKVMLGEGEAIYVDAGHMIAKQSSVTMKTTMRGGFFGALKREITGGSFFV 63
Query: 85 VVLRNPGPSDGFVGIAAPSLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
L PG L+ + P + + + G +L + +FLC+ N V TL++
Sbjct: 64 TELYGPGEV---------YLSGVFPGPIVQIQLQGNGILAESHSFLCAENGVNYDATLNK 114
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+GG EG K G G F+ A G ++++ L+ GE + V+ + ++A + I
Sbjct: 115 LSVGWLGG-EGIFLAKFRGTGNVFLHAYGGLIERYLDPGERLQVEAAHLMAFEEGMRYGI 173
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQI 261
+ G I+ +F G+ L + GPG V++ +L +L+ + R + P F + I
Sbjct: 174 QRVGGIKSILFSGEGLFFVDIEGPGKVWLHTLTVEQLAASL-RPYLPQGQQGGPSFNIGI 232
>gi|15920866|ref|NP_376535.1| hypothetical protein ST0645 [Sulfolobus tokodaii str. 7]
gi|15621650|dbj|BAB65644.1| hypothetical protein STK_06450 [Sulfolobus tokodaii str. 7]
Length = 230
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 96/221 (43%), Gaps = 13/221 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+QI G + Q +++L P E + A G M S+ ++ G+ + L G +
Sbjct: 4 YQIFGNDLQYLKVILAPGEGIYADAGHMIAKQASVTLQTRLRGGILSGLKRVLTGGSFFV 63
Query: 85 VVLRNPGPSDGFVGIAAPSLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
PG L+ I P I + + G +L + FL + N V L +
Sbjct: 64 TEFYGPGEL---------YLSGIFPGKIIQIPLEGRGILAEAHTFLAAENSVNYDTQLAK 114
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+GG EG K G G F+ + G V+ ++L GE++ V+ S ++A +N +
Sbjct: 115 LTAGWLGG-EGLFLAKFRGFGNVFLHSYGDVIVRDLAPGEILQVEASHLMAFQEGMNYDV 173
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ G +R F + L + GPG V++ +L +L+ +
Sbjct: 174 QLVGGLRSIFFAHEGLFFVTIQGPGRVWLHTLTAEQLASAL 214
>gi|150021217|ref|YP_001306571.1| hypothetical protein Tmel_1337 [Thermosipho melanesiensis BI429]
gi|149793738|gb|ABR31186.1| protein of unknown function DUF124 [Thermosipho melanesiensis
BI429]
Length = 224
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 59/205 (28%), Positives = 95/205 (46%), Gaps = 12/205 (5%)
Query: 30 GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRN 89
G + + L+P E ++ PG+M MSG +E+E + +L G+ I +
Sbjct: 9 GSYSILFVDLEPGEIIIGEPGAMVTMSGDLEIETSTGGVFKALKRAFLGGEHIFLNKYKA 68
Query: 90 PGPSDGFVGIAAPSLARILP--IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
S S A LP ID+ G + Q +FL S + + +T ++
Sbjct: 69 KNKS-------RISFAPKLPGDIDIIDLNGTVYLQSTSFLASEEGINL-DTKFTGFKSFF 120
Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
G EGF KL+G G I + G++ L+ E IT+D +VA SVN ++ G
Sbjct: 121 SG-EGFFLLKLSGYGKLAISSFGAIFSLELKEDEKITIDTGHVVAFDESVNYSVRTFGGF 179
Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQS 232
+ +FGG+ LV + GPG ++IQ+
Sbjct: 180 KSTIFGGEGLVVDFI-GPGKIYIQT 203
>gi|428174779|gb|EKX43673.1| hypothetical protein GUITHDRAFT_140416 [Guillardia theta CCMP2712]
Length = 234
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 54/218 (24%), Positives = 96/218 (44%), Gaps = 42/218 (19%)
Query: 24 PFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT 83
P+QI+G ++Q+ L P ++V+ PG C + ++ + +N +G +L + I
Sbjct: 29 PWQIIGTDSQLIVFRLAPGQEVMMEPG--CMVHADDGVKPVTSLQNPLG--AFLADEDIF 84
Query: 84 SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV 143
+ +N +D + AA ++I+P+DL+++ G + Q +FL +
Sbjct: 85 RLKWKNESSTDKTLSAAAMVPSKIIPLDLSVYSG-ITIQKGSFLAT-------------- 129
Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
L ++ KNL GE D C++AV+S+V+V I+
Sbjct: 130 -----------------------LGRWQIMTKNLAPGEKYIFDTDCMLAVSSTVDVSIRR 166
Query: 204 NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQR 241
G FGG L TGPG++ Q+LP + R
Sbjct: 167 AGGCAMMCFGGKGLFNTEYTGPGLIIFQTLPAAKSGAR 204
>gi|297526400|ref|YP_003668424.1| hypothetical protein Shell_0393 [Staphylothermus hellenicus DSM
12710]
gi|297255316|gb|ADI31525.1| protein of unknown function DUF124 [Staphylothermus hellenicus DSM
12710]
Length = 224
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 66/227 (29%), Positives = 110/227 (48%), Gaps = 31/227 (13%)
Query: 33 QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNP-- 90
+ Q++L+P E V A PG+M +M+G +I + + G+ GK I +L
Sbjct: 12 SLLQVVLEPNENVTAEPGAMMYMAG-----DINVKTHTGGL-----GKAIARKLLGGESI 61
Query: 91 -------GPSDGFVGIAAPSLARILPIDL---AMFGGE-LLCQPDAFLCSVNDVKVSNTL 139
GP G V A PSL P D+ A+ G L+ Q ++L D+K+S
Sbjct: 62 FMNTFIAGPRGGEVWFA-PSL----PGDIKYIALNGSRNLIIQDTSYLAHHGDIKLSVAW 116
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
+ +R ++ E F + +G G +I + G +++K L VGE I +D VA+ +
Sbjct: 117 -RGLRGLLAEGEMFW-LRASGVGGVWINSYGGIIEKELGVGEKIIIDNFHFVAMDDGMKW 174
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
I+ G ++ +FGG+ LV V GPG +++Q+ Q IA+ +
Sbjct: 175 NIRRFGGLKSFLFGGEGLVIE-VEGPGRIYVQTRSLPPFMQLIAKYI 220
>gi|257076770|ref|ZP_05571131.1| HTH DNA-binding protein [Ferroplasma acidarmanus fer1]
Length = 219
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 55/229 (24%), Positives = 101/229 (44%), Gaps = 17/229 (7%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
++ ++I GG+ Q L+ E PG + + + ++ G + F
Sbjct: 1 MVKYEIRGGDVQYLHATLEAGESAYIEPGHLIYKTPGARLDT------GTGGLRGAFSHM 54
Query: 82 ITS---VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
+ +L+ GP + G A ++ I L + + +AFLC + +K S
Sbjct: 55 LAGSAVFLLKVDGPGN--FGSAGFLPGKVFEIKLN--NNSIFAEFNAFLCMDSTIKYSTK 110
Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
+ + GG EG ++ +G G + G V++ L+ GE I ++S ++A SSV+
Sbjct: 111 FAGFWQAIFGG-EGIFLEEFSGTGSVMLHGHGQVIEYELKDGEEIQAELSHVLAFESSVS 169
Query: 199 VQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+ G I+ V G G+ A + GPG V++ S+ LS ++AR
Sbjct: 170 YNVSRIGGIKTMVLGGMEGEGFFFANMRGPGKVWLHSISLLELSAKLAR 218
>gi|393796012|ref|ZP_10379376.1| hypothetical protein CNitlB_06707 [Candidatus Nitrosoarchaeum
limnia BG20]
Length = 236
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 55/225 (24%), Positives = 104/225 (46%), Gaps = 22/225 (9%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ +QIL + ++ LK E + A G++ FM G ++++ + G+ KT+
Sbjct: 1 MEYQILKNPMSLLEVNLKKDESITAEAGALVFMQGDIQID----TKTRSGIL-----KTL 51
Query: 83 TSVVLRNP--------GPSDG-FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
+L N DG +G+ P + I+ I + G + Q +++ S DV
Sbjct: 52 KVSILGNESFFVNKYTAKHDGCILGLTGPPVGDIIDIPINSNAG-FIVQSGSYIASTPDV 110
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
+ + + G E F+ K TG G F A G +++K + GE +D +VA+
Sbjct: 111 DIDTKWQGFTKGIFGS-ELFML-KATGDGTVFCNAYGGIIKKQILSGETFVLDNYHLVAL 168
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
+ + + ++ G ++ + GG+ LVT +TGP V++Q+ L
Sbjct: 169 SINSDYEVTKFGGLKNTILGGEGLVTK-ITGPATVYLQTKNLKEL 212
>gi|315923941|ref|ZP_07920169.1| protein of hypothetical function DUF124 [Pseudoramibacter
alactolyticus ATCC 23263]
gi|315622781|gb|EFV02734.1| protein of hypothetical function DUF124 [Pseudoramibacter
alactolyticus ATCC 23263]
Length = 224
Score = 69.3 bits (168), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 59/224 (26%), Positives = 107/224 (47%), Gaps = 8/224 (3%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I GG V L E + G M +M ++EM+ ++ M + + G+++
Sbjct: 1 MQYKISGGNLPVAICKLAAGESIKCESGGMSWMDNTIEMQTKGGGFGKM-MGRMVSGESM 59
Query: 83 --TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
+ R S+G + A+ I I ++ G L+ Q AFL V++
Sbjct: 60 FFNHYIAR----SEGEIAFASSFPGEIKAIHISP-GNSLIAQKGAFLACQESVEMKIFFQ 114
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+++ + G EGF+ Q+ G+GL FI GS + +L GE VD +VA+ SV +
Sbjct: 115 RKLGSGFFGGEGFIMQRYIGEGLVFIEIDGSAITYDLAAGETKVVDTGYLVAMEESVKMD 174
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
I ++ + GG+ L +TGPG + +Q++P ++ + I R
Sbjct: 175 IVRVKGVKNVLLGGEGLFNTTLTGPGKIVLQTMPINKTAAVIHR 218
>gi|352681258|ref|YP_004891782.1| hypothetical protein TTX_0024 [Thermoproteus tenax Kra 1]
gi|350274057|emb|CCC80702.1| conserved hypothetical protein [Thermoproteus tenax Kra 1]
Length = 225
Score = 68.9 bits (167), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 53/209 (25%), Positives = 98/209 (46%), Gaps = 7/209 (3%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
F+ILG + Q + L P E++ A G + + S + + G+ + L G +
Sbjct: 4 FRILGSDIQHLLVTLGPGERIYAEGGHLIWKSSTARLNATVRGGLLAGLKRALTGASFFV 63
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
+ + PG D +A + +I+ I+L G +L + +FL + + V+ +L
Sbjct: 64 LEVEGPGQVD----LAGFAPGKIVEIEL--HGSSILAEHRSFLAAEDIVEYDASLVGLGF 117
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
+GG EG L + G G F+ A G ++ L GE I + ++A V ++
Sbjct: 118 GWLGG-EGLLMARFRGNGRVFLHAVGDALELRLGSGEAIDAEAGHVLAFEEGVRATVRRV 176
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSL 233
G ++ +FG + L A + GPG V++++L
Sbjct: 177 GGLKTMLFGEEGLWFAHLEGPGRVWLRTL 205
>gi|429219791|ref|YP_007181435.1| hypothetical protein Deipe_2183 [Deinococcus peraridilitoris DSM
19664]
gi|429130654|gb|AFZ67669.1| hypothetical protein Deipe_2183 [Deinococcus peraridilitoris DSM
19664]
Length = 228
Score = 68.9 bits (167), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 46/136 (33%), Positives = 70/136 (51%), Gaps = 3/136 (2%)
Query: 100 AAP-SLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKL 158
AAP +L I DL+ G LL AFL + +V +S T+ +R + G G Q L
Sbjct: 87 AAPGTLGHIRAWDLSQ--GPLLTTRGAFLAAWGEVDISVTVARRAGAALFGGAGLFLQHL 144
Query: 159 TGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLV 218
+GQG I G +++ L GE V + A ++SV+ I+ G +R+ +FGG+ L
Sbjct: 145 SGQGTVLIQGSGDLLEHKLHGGESQLVSTGHLAAFSASVDYGIEAVGGVRKLLFGGEGLF 204
Query: 219 TAVVTGPGIVFIQSLP 234
+ GPG ++QSL
Sbjct: 205 MTRLCGPGTAYLQSLK 220
>gi|453328726|dbj|GAC89022.1| hypothetical protein NBRC3255_2683 [Gluconobacter thailandicus NBRC
3255]
Length = 221
Score = 68.6 bits (166), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 93/217 (42%), Gaps = 13/217 (5%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSV------VL 87
V +I+L+P+E ++A G + S + M+ + + G FG ++ +
Sbjct: 3 VLRILLEPRESILAETGEFAWKSSHVVMQTVTGGAGQSG----FFGVVTRALAGGGLFMT 58
Query: 88 RNPGPSD-GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
R PS G V A I L G L F+ DV++ + +
Sbjct: 59 RFTAPSQPGLVAFTAKIPGHITEHTLQ--GQAYLVHRHGFIAGTEDVRMELAFQRNLGAG 116
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
+ G +GF Q+L+G G + GG V+ +L GE I V + SV I
Sbjct: 117 VFGGDGFRLQRLSGSGRFWCSLGGEVITHDLAPGEQIDVHPGHVGMFEESVQFDITMIPG 176
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
+R +FGGD A +TGPG +++Q++ L+ +A
Sbjct: 177 LRNKLFGGDGFFMARLTGPGRIWLQTMTPAGLAHSLA 213
>gi|154249582|ref|YP_001410407.1| hypothetical protein Fnod_0898 [Fervidobacterium nodosum Rt17-B1]
gi|154153518|gb|ABS60750.1| protein of unknown function DUF124 [Fervidobacterium nodosum
Rt17-B1]
Length = 224
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 59/210 (28%), Positives = 99/210 (47%), Gaps = 22/210 (10%)
Query: 30 GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL-----FGKTITS 84
G + +I + P E PG+M +M G +E++ + G+W+ L G++
Sbjct: 9 GSYALLKIFIGPGESAKVEPGAMVYMKGPVEVQT-----STGGVWKALKRAILGGESFFM 63
Query: 85 VVLRNPGPSDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ G +G VG+A P ++P++ +F Q +FL S V + +
Sbjct: 64 NTYVSHG--NGEVGVAPELPGDIDVVPLNQTLF-----VQSTSFLASDTSVDIDVSFGG- 115
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
++ G EG KL G G I + G + L+ GE +T+D +VA SVN ++
Sbjct: 116 FKSFFSG-EGIFLLKLQGYGDVAISSFGGIKLIELQPGEQMTIDTGHVVAFDGSVNYSVR 174
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
G ++ +FGG+ LV TGPG V+IQ+
Sbjct: 175 TFGGLKSTIFGGEGLV-CTFTGPGRVYIQT 203
>gi|158422039|ref|YP_001523331.1| hypothetical protein AZC_0415 [Azorhizobium caulinodans ORS 571]
gi|158328928|dbj|BAF86413.1| protein of unknown function [Azorhizobium caulinodans ORS 571]
Length = 232
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 63/123 (51%)
Query: 120 LCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEV 179
+ FL ++ ++ Q + + + G GF+ Q+LTGQG AFI GG +V L+
Sbjct: 101 MIHRHGFLAGSAELTLAIGFQQNLGSGVFGGNGFVLQRLTGQGTAFIELGGEIVSYELQP 160
Query: 180 GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
G+ + V + SV I IR +FGGD L A +TGPG V++Q+L +L+
Sbjct: 161 GQQLLVHPGHVGMFEESVGFDITMMRGIRNVLFGGDGLFLAQLTGPGRVWLQTLTPSKLA 220
Query: 240 QRI 242
+
Sbjct: 221 HAL 223
>gi|414343168|ref|YP_006984689.1| hypothetical protein B932_2196 [Gluconobacter oxydans H24]
gi|411028503|gb|AFW01758.1| hypothetical protein B932_2196 [Gluconobacter oxydans H24]
Length = 221
Score = 68.2 bits (165), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 93/217 (42%), Gaps = 13/217 (5%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSV------VL 87
V +I+L+P+E ++A G + S + M+ + + G FG ++ +
Sbjct: 3 VLRILLEPRESILAETGEFAWKSSHVVMQTVTGGAGQSG----FFGVVTRALAGGGLFMT 58
Query: 88 RNPGPSD-GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
R PS G V A I L G L F+ DV++ + +
Sbjct: 59 RFTAPSQPGLVAFTAKIPGHITEHTLQ--GQAYLVHRHGFIAGTEDVRMELAFQRNLGAG 116
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
+ G +GF Q+L+G G + GG V+ +L GE I V + SV I
Sbjct: 117 VFGGDGFRLQRLSGSGRFWCSLGGEVITHDLAPGEQIDVHPGHVGMFEESVQFDITMILG 176
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
+R +FGGD A +TGPG +++Q++ L+ +A
Sbjct: 177 LRNKLFGGDGFFMARLTGPGRIWLQTMTPAGLAHSLA 213
>gi|89894010|ref|YP_517497.1| hypothetical protein DSY1264 [Desulfitobacterium hafniense Y51]
gi|89333458|dbj|BAE83053.1| hypothetical protein [Desulfitobacterium hafniense Y51]
Length = 82
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 29/62 (46%), Positives = 43/62 (69%)
Query: 177 LEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFH 236
++ GE I +D C+VA+T +VN I++ G I+ A+FGG+ L +TGPG V++QSLPF
Sbjct: 1 MQPGETIQLDTGCLVAMTKTVNHDIRFAGDIKNALFGGEGLALGTLTGPGTVWLQSLPFC 60
Query: 237 RL 238
L
Sbjct: 61 SL 62
>gi|407464224|ref|YP_006775106.1| hypothetical protein NSED_01760 [Candidatus Nitrosopumilus sp. AR2]
gi|407047412|gb|AFS82164.1| hypothetical protein NSED_01760 [Candidatus Nitrosopumilus sp. AR2]
Length = 223
Score = 67.0 bits (162), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 57/221 (25%), Positives = 99/221 (44%), Gaps = 27/221 (12%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I+ + + L EK+ A +M +M G++ E M + F K++
Sbjct: 1 MEYEIVKNPMGLIEFSLNKNEKITAEAAAMVYMKGNI--------VTETRMRKGGFFKSL 52
Query: 83 TSVVLRNPG---------PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
+ L D +G+ L I ID+ E + Q +F+ S DV
Sbjct: 53 KAAALGGESFFVNEFIAEEDDCKLGLTGNMLGDIEVIDVDE---EFIVQSGSFVGSTTDV 109
Query: 134 KVSNTLDQRVRNVIGGIEG--FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
TLD + GI G K G G F+ G +++K L GE + +D +V
Sbjct: 110 ----TLDTAWQGFTKGIFGSNLFMLKTVGAGKMFVNGWGGILKKQLGSGEKMILDNYQLV 165
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
A++S+ ++ +G ++ +FGGD LV +V GPG V++Q+
Sbjct: 166 ALSSTAGYRVTKHGSMKTTLFGGDALVIEIV-GPGTVYLQT 205
>gi|379010233|ref|YP_005268045.1| hypothetical protein Awo_c03550 [Acetobacterium woodii DSM 1030]
gi|375301022|gb|AFA47156.1| hypothetical protein Awo_c03550 [Acetobacterium woodii DSM 1030]
Length = 226
Score = 66.6 bits (161), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 53/226 (23%), Positives = 102/226 (45%), Gaps = 6/226 (2%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++I G V + L+ E + G M +MS ++ME G+ + L G ++
Sbjct: 1 MKYEIKGDNLPVVIVTLENGEAIFTESGGMSWMSEHIKMETNTKGGLIKGLRRSLSGDSL 60
Query: 83 --TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
T+ F + P + + + G ++CQ +FL + V
Sbjct: 61 FMTTYTAERENSQIAFCS-SFPGEIKAITLQA---GQSIICQKRSFLAGAATLDVDMYFK 116
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+ + I G EGF+ Q+++G G F+ G++++ +L G+V+ VD I SV+
Sbjct: 117 KNIGVGIFGGEGFILQRISGPGTVFVEMDGALMEYDLAPGQVMKVDQGHIGMFEESVSFD 176
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
I+ ++ G+ L A + GPG ++Q++P L Q I+ ++
Sbjct: 177 IQAVKGMKNIFLSGEGLFLATLKGPGKAWLQTMPLSNLVQLISASL 222
>gi|340344480|ref|ZP_08667612.1| DUF124 domain containing protein [Candidatus Nitrosoarchaeum
koreensis MY1]
gi|339519621|gb|EGP93344.1| DUF124 domain containing protein [Candidatus Nitrosoarchaeum
koreensis MY1]
Length = 230
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 85/167 (50%), Gaps = 5/167 (2%)
Query: 97 VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQ 156
+G++ P + I+ I + G + Q ++ S + V + +T Q I G E F+
Sbjct: 59 LGLSGPPVGDIIEIPITAEQG-FIIQSGTYVASTSSVDL-DTKWQGFSKGIFGTELFML- 115
Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDN 216
K TG+G F+ A G +++K LE GE +T+D +VA++++ + + G I+ +F G+
Sbjct: 116 KATGKGSVFVNAYGGIIKKELESGETMTLDNYHLVALSNNSHYVVTKFGGIKSTLFSGEG 175
Query: 217 LVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQIAI 263
LVT +TGP IV+ Q+ L I+ T N N Q A+
Sbjct: 176 LVTE-ITGPAIVYFQTKNLKELMDLISGEHTKDN-SSNTHTMAQAAM 220
>gi|255513447|gb|EET89713.1| protein of unknown function DUF124 [Candidatus Micrarchaeum
acidiphilum ARMAN-2]
Length = 230
Score = 65.5 bits (158), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 99/222 (44%), Gaps = 11/222 (4%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
++I+GG Q +I L+P EK+ + G + S +++M P G+ + K +
Sbjct: 3 YEIIGGSMQALRITLEPGEKIYSDSGKLISKSKTIKM----TPRLAGGLVGAIERKMTGA 58
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTLD 140
+ +DG G A S++ ++P + M G E + + AF+ + VK S
Sbjct: 59 TAMVTEFEADGAEGSA--SMSGVIPGKIYMINLNEGEEFVAEHYAFVAAQESVKFSIETV 116
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
GG G + QK G G FI G +++ +L+ + +D I SS+ +
Sbjct: 117 GIGAAFFGGA-GLVLQKFVGPGAVFIHVTGDIIEYDLDGNNPLEIDPGHIAGFDSSLKYK 175
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
I + IR A+FGG L A G G V S+ +L+ +
Sbjct: 176 ITFVDNIRTAMFGGVGLFLAKFEGRGKVVAHSVSRFKLASEL 217
>gi|124027471|ref|YP_001012791.1| hypothetical protein Hbut_0586 [Hyperthermus butylicus DSM 5456]
gi|123978165|gb|ABM80446.1| universally conserved protein [Hyperthermus butylicus DSM 5456]
Length = 220
Score = 65.1 bits (157), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 102/220 (46%), Gaps = 29/220 (13%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
V ++ L+P E+V + PG++ + G +E+ ITS +LR S
Sbjct: 13 VLKVELEPGEEVWSEPGALMLLRGDVEVRTQ--------------SGGITSGLLRKLAGS 58
Query: 94 DGFV--GIAAPSLARI-----LP--IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
+ F A S A + LP ID GE + Q ++L D+KV+ + + +R
Sbjct: 59 ESFFLNRYRARSRAEVWFAPSLPGDIDTIEVNGEWVIQDASYLAHYGDLKVTASF-RGLR 117
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
+I E F K+ G G+ ++ + G + + + GE + +D VA+ + ++
Sbjct: 118 GLIAEGELFW-LKVEGHGIVWVNSYGGIERVEVGPGERLVIDNYHFVAMPADTRYTVRKF 176
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSL---PFHRLSQR 241
G ++ +FGG+ V VTGP V++Q+ PF RL R
Sbjct: 177 GGLKTFLFGGEGFVIE-VTGPATVYVQTRILPPFARLLAR 215
>gi|14325670|dbj|BAB60573.1| hypothetical protein [Thermoplasma volcanium GSS1]
Length = 222
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 105/229 (45%), Gaps = 15/229 (6%)
Query: 21 AVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
+++ ++I G E Q + L P+E V PG + + S+ + L G
Sbjct: 3 SMVDYEIAGQEVQYLKAKLSPEEYVYIEPGHLIYKDASVRL-GASAGGLRGAFSHLLSGS 61
Query: 81 TITSVVLRNPGP--SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
+ + + PG S GF+ P +++ IDL G ++ + +AFL + V S
Sbjct: 62 AVFLLKVEGPGEIVSAGFL----P--GKVVRIDLN--GNSIIAEFNAFLAMDSTVSYSTK 113
Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
+ + GG EG ++ +G G F+ G + K+L+ GE I ++S ++A +
Sbjct: 114 FAGITQAIFGG-EGVFLERFSGNGSVFLHGHGLPIVKDLKPGEEIQAEMSHLLAFEDGME 172
Query: 199 VQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
I+ G ++ + G+ L A + GPG V++ ++ +L+ ++ R
Sbjct: 173 YTIERIGGLKTMLLAGAEGEGLFFARIRGPGRVWLHTISLFQLAAKLMR 221
>gi|242398201|ref|YP_002993625.1| hypothetical protein TSIB_0209 [Thermococcus sibiricus MM 739]
gi|242264594|gb|ACS89276.1| hypothetical protein TSIB_0209 [Thermococcus sibiricus MM 739]
Length = 210
Score = 64.7 bits (156), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 53/199 (26%), Positives = 95/199 (47%), Gaps = 8/199 (4%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
+ ++ L+ E + A PG+M MS ++++E + L G++I R
Sbjct: 1 MVEVELEEGEAIKAEPGAMVHMSPTIKIETKTGGVFKALKRSMLGGESIFINTFRAE-EG 59
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
G +G+A + I +++ G L Q AFL S +++ NT + EG
Sbjct: 60 KGNIGLAPAYMGDIEALEIR---GTLYAQSGAFLASSETIEI-NTKFGGAKTFFS-REGL 114
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
KL+G+G F+ + G + +K L E +D +VA T ++ ++K G ++ +FG
Sbjct: 115 FLLKLSGEGTVFLSSFGGIYKKELR-NERFIIDTGHLVAFTEGLDFRVKRVGGLKSTIFG 173
Query: 214 GDNLVTAVVTGPGIVFIQS 232
G+ LV A G G ++IQ+
Sbjct: 174 GEGLV-AEFYGSGTLYIQT 191
>gi|403743391|ref|ZP_10953005.1| hypothetical protein URH17368_0291 [Alicyclobacillus hesperidum
URH17-3-68]
gi|403122914|gb|EJY57106.1| hypothetical protein URH17368_0291 [Alicyclobacillus hesperidum
URH17-3-68]
Length = 132
Score = 64.3 bits (155), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 37/120 (30%), Positives = 61/120 (50%), Gaps = 1/120 (0%)
Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
++ Q AFLC+ + + ++GG G + L G G F+ G VV+++L
Sbjct: 1 MIAQRHAFLCAEEHIDFGIEATLNIGRLLGG-NGLVFTYLDGDGTGFLSIDGEVVERDLA 59
Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
GE + V I A TSS++ +++ + +FGGD + +TGPG V++ SL H L
Sbjct: 60 RGESLLVHPGHIAAFTSSIDYRVQVMQGVSNMLFGGDGVFLVQLTGPGRVWMHSLTVHNL 119
>gi|18976524|ref|NP_577881.1| hypothetical protein PF0152 [Pyrococcus furiosus DSM 3638]
gi|397652135|ref|YP_006492716.1| hypothetical protein PFC_07440 [Pyrococcus furiosus COM1]
gi|18892075|gb|AAL80276.1| hypothetical protein PF0152 [Pyrococcus furiosus DSM 3638]
gi|393189726|gb|AFN04424.1| hypothetical protein PFC_07440 [Pyrococcus furiosus COM1]
Length = 219
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 102/205 (49%), Gaps = 20/205 (9%)
Query: 33 QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-----QWLFGKTITSVVL 87
+ ++ L E + A G+M +M ++E+ + + + G+ L G++ +
Sbjct: 12 SLLEVELDAGESIQAEAGAMVYMDPTVEI----VTKAKGGLLGALKRSILGGESFFVNIF 67
Query: 88 RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
+ PG VG+A I+ +++ G+L Q AF+ + D+ + L ++ VI
Sbjct: 68 KGPGK----VGLAPGYPGDIIGLEVN---GKLYAQSGAFIAASEDIDIDVKLGG-IKTVI 119
Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
G EG +L G+G+AF+ + G + + +L GE + VD S +VA T ++ IK G +
Sbjct: 120 G-REGLFLLELRGRGIAFLSSYGGIQKISLR-GESLIVDTSHLVAFTEGIDFTIKTIGGL 177
Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQS 232
+ +FGG+ V G G V+IQ+
Sbjct: 178 KSTLFGGEGFVFE-FRGTGDVYIQT 201
>gi|374985687|ref|YP_004961182.1| hypothetical protein SBI_02930 [Streptomyces bingchenggensis BCW-1]
gi|297156339|gb|ADI06051.1| hypothetical protein SBI_02930 [Streptomyces bingchenggensis BCW-1]
Length = 276
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 52/162 (32%), Positives = 75/162 (46%), Gaps = 14/162 (8%)
Query: 93 SDGFVGIAA--PSLARILPIDLAMFGGEL-LCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
+G VG A P R L +D GG + DAF+ + + V+ Q R G
Sbjct: 102 GEGTVGFAGVLPGEMRALELD----GGRAWFAEKDAFVAAESTVEFGIAF-QGGRTGRSG 156
Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKN-LEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
EGF+ +K TG+G I G+ + N + G I VD CIVA + ++ G +
Sbjct: 157 GEGFILEKFTGRGTVIICGAGNFIDLNPADFGGRIEVDTGCIVAFEEGIQYGVQRIGGLN 216
Query: 209 R-----AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
R AVFGG+ L A + G G V +QSL L+ + +A
Sbjct: 217 RQGLMNAVFGGEGLSLATLEGNGRVILQSLTMEGLANALKKA 258
>gi|13542236|ref|NP_111924.1| hypothetical protein TVN1405 [Thermoplasma volcanium GSS1]
Length = 219
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 54/228 (23%), Positives = 104/228 (45%), Gaps = 15/228 (6%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
++ ++I G E Q + L P+E V PG + + S+ + L G
Sbjct: 1 MVDYEIAGQEVQYLKAKLSPEEYVYIEPGHLIYKDASVRL-GASAGGLRGAFSHLLSGSA 59
Query: 82 ITSVVLRNPGP--SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
+ + + PG S GF+ P +++ IDL G ++ + +AFL + V S
Sbjct: 60 VFLLKVEGPGEIVSAGFL----P--GKVVRIDLN--GNSIIAEFNAFLAMDSTVSYSTKF 111
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
+ + GG EG ++ +G G F+ G + K+L+ GE I ++S ++A +
Sbjct: 112 AGITQAIFGG-EGVFLERFSGNGSVFLHGHGLPIVKDLKPGEEIQAEMSHLLAFEDGMEY 170
Query: 200 QIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
I+ G ++ + G+ L A + GPG V++ ++ +L+ ++ R
Sbjct: 171 TIERIGGLKTMLLAGAEGEGLFFARIRGPGRVWLHTISLFQLAAKLMR 218
>gi|88602530|ref|YP_502708.1| hypothetical protein Mhun_1244 [Methanospirillum hungatei JF-1]
gi|88187992|gb|ABD40989.1| protein of unknown function DUF124 [Methanospirillum hungatei JF-1]
Length = 231
Score = 63.9 bits (154), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 57/210 (27%), Positives = 96/210 (45%), Gaps = 5/210 (2%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-MWQWLFGKTITSVVLRNPGP 92
+ ++ L P E V A GSM +M ++E+ + +G + L G V
Sbjct: 13 MLEVTLAPGESVTAEAGSMSWMQPTIEVHTRMRKKGLLGSIGMALIGGQSFFVNDFTATN 72
Query: 93 SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
+ G G +A L + + +A G + Q A+L S DV + + + G +G
Sbjct: 73 ASGKAGFSAAPLGDVDTMQVAP-GKGYIIQKAAYLASEKDVNLDVEWQGFTKGLFG--QG 129
Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
K++GQGL FI G++ + L GE + VD +VA + + Q+ G + +
Sbjct: 130 LFMIKVSGQGLLFINTFGAIDKHVLAPGERLIVDNFHLVAFSDTCQYQVTKFGGWKETLL 189
Query: 213 GGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
G+ LVT + TGPG V+IQ+ ++ I
Sbjct: 190 SGEFLVTDI-TGPGEVYIQTKNIREMADWI 218
>gi|345010079|ref|YP_004812433.1| hypothetical protein [Streptomyces violaceusniger Tu 4113]
gi|344036428|gb|AEM82153.1| protein of unknown function DUF124 [Streptomyces violaceusniger Tu
4113]
Length = 279
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 50/161 (31%), Positives = 72/161 (44%), Gaps = 12/161 (7%)
Query: 93 SDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
+G VG A P R L +D A + DAF+ + + V+ GG
Sbjct: 105 GEGTVGFAGVLPGEMRALELDGAR---AWFAEKDAFVAAESTVEFGIAFQGGKTGRSGG- 160
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKN-LEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
EGF+ +K TG G I G+ + N + G I VD CIVA + ++ G + R
Sbjct: 161 EGFILEKFTGHGTVIICGAGNFIDLNPADFGGRIEVDTGCIVAFEEGIRYGVERIGGLNR 220
Query: 210 -----AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
AVFGG+ L A + G G V +QSL L+ + +A
Sbjct: 221 QGLMNAVFGGEGLSLATLEGNGRVILQSLTIEGLANALKKA 261
>gi|158319748|ref|YP_001512255.1| hypothetical protein Clos_0700 [Alkaliphilus oremlandii OhILAs]
gi|158139947|gb|ABW18259.1| protein of unknown function DUF124 [Alkaliphilus oremlandii OhILAs]
Length = 226
Score = 62.4 bits (150), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 61/215 (28%), Positives = 106/215 (49%), Gaps = 12/215 (5%)
Query: 23 IPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-K 80
+ ++IL GEA + +I L+ +++ A +M MS ++++E G+ + L G K
Sbjct: 1 MKYEILYGEAFPIVKINLQKGDRIKAESDAMVSMSPTIDLEGKLEGGLLKGLGRMLAGEK 60
Query: 81 TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTL 139
+ N GP + + + P +D+ + G LC Q D FL + + + V +
Sbjct: 61 FFFQTLTANRGPGEVLLASSVPGGT----VDVELDGSYGLCVQKDGFLAASDSISVDTQM 116
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV-- 197
R + G EGF K+ G+G F+ + G++ NLE GE + +D S +VA +
Sbjct: 117 QNLTRGLFSG-EGFFILKIGGRGTVFLNSYGAIHPINLEAGEEVVIDNSHLVAWPDYMQY 175
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
N++ +G I +V G++LV GPG V IQ+
Sbjct: 176 NIEKASSGWI-SSVTSGESLVCR-FRGPGTVLIQT 208
>gi|428174781|gb|EKX43675.1| hypothetical protein GUITHDRAFT_110474 [Guillardia theta CCMP2712]
Length = 199
Score = 62.0 bits (149), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 46/79 (58%)
Query: 170 GSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVF 229
G ++ KNL GE +D C++AV+S+V+V I+ G FGG L TGPG++
Sbjct: 78 GQIMTKNLAPGEKYILDTDCMLAVSSTVDVSIRRAGGCAMMCFGGKGLFNTEYTGPGLII 137
Query: 230 IQSLPFHRLSQRIARAVTS 248
Q+LP +++ R+A+ S
Sbjct: 138 FQTLPAAKIAARLAQVTAS 156
>gi|57640117|ref|YP_182595.1| hypothetical protein TK0182 [Thermococcus kodakarensis KOD1]
gi|57158441|dbj|BAD84371.1| hypothetical protein, conserved, DUF124 family [Thermococcus
kodakarensis KOD1]
Length = 222
Score = 61.6 bits (148), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 96/206 (46%), Gaps = 19/206 (9%)
Query: 33 QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-----QWLFGKTITSVVL 87
+ +I L+ E V A G+M MS ++++E + + G++ L G++
Sbjct: 12 SLLEIELERGEAVQAEAGAMVHMSPTIKLET----KAKGGVFGALKRSMLGGESFFVNTF 67
Query: 88 R-NPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
R GP G +G+A + I +L G L Q AFL S D+ +
Sbjct: 68 RAEDGP--GSIGLAPAYMGDIEAFELD---GTLYAQSGAFLASYGDIDIDTKWGGA--KT 120
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
G EG K++G+G+ F+ + G++ +K L E +D +VA + ++ +K G
Sbjct: 121 FFGREGLFLLKMSGRGIVFLSSFGAIYKKELH-NERFIIDTGHMVAFSEGLDFNVKRVGG 179
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
++ +F G+ LV A G G ++IQ+
Sbjct: 180 LKSTLFSGEGLV-AEFYGTGTLYIQT 204
>gi|392529832|ref|ZP_10276969.1| hypothetical protein CmalA3_03933 [Carnobacterium maltaromaticum
ATCC 35586]
gi|414082972|ref|YP_006991678.1| hypothetical protein BN424_899 [Carnobacterium maltaromaticum
LMA28]
gi|412996554|emb|CCO10363.1| conserved hypothetical protein [Carnobacterium maltaromaticum
LMA28]
Length = 224
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 90/212 (42%), Gaps = 11/212 (5%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL---RNP 90
+C+ L+ E +V+ G + G + E G + + G+ + L R
Sbjct: 14 ICE--LEKGEAMVSENGGRSWALGEITTET-----TSGGGMKKMLGRAFSGESLFLSRYE 66
Query: 91 GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
S+ + A+ I+ +L G ++ Q AFL + V++S ++ G
Sbjct: 67 AQSNATIAFASSFPGCIIAKELQA-GESVIAQKTAFLAATEGVELSTFFQKKGTAGFFGG 125
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
EGF+ QKL G G F GS +L GE + D + + + ++I ++
Sbjct: 126 EGFIMQKLQGPGTVFFEIDGSTKTYDLAAGERLVCDTGLVALMDETCKLEIVAVKGLKNK 185
Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ GG+ VVTGPG V +Q++ +Q I
Sbjct: 186 LLGGEGFFDTVVTGPGKVTVQTMTLGGFAQSI 217
>gi|429216548|ref|YP_007174538.1| hypothetical protein Calag_0310 [Caldisphaera lagunensis DSM 15908]
gi|429133077|gb|AFZ70089.1| TIGR00266 family protein [Caldisphaera lagunensis DSM 15908]
Length = 220
Score = 61.6 bits (148), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 99/213 (46%), Gaps = 14/213 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ ++ +G E Q + L+P E++ + + S + + VG + + I
Sbjct: 1 MKYETIGSEFQYLLVDLQPGEEIYVDGSHVIYKDKS-----VTLNAKMVGGFLSALKREI 55
Query: 83 TSV---VLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
T +L GP G + +++P +++ IDL G +L + +FL S N VK + L
Sbjct: 56 TGTTFFLLEAIGP--GKIALSSPLAGKVVKIDLGNRG--ILVEHTSFLASENTVKYESKL 111
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
+ GG EG KL G+G F+ A G V + L E + V+ + ++A +N
Sbjct: 112 GSLTAGLFGG-EGIFLAKLEGKGSVFLHAVGQVNELYLNNQE-LQVEAAHLLAFEEGLNY 169
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
I G ++ +FG + L ++G G V+++S
Sbjct: 170 GITRVGNLKTMLFGDEGLFFVNLSGTGKVWVKS 202
>gi|347523482|ref|YP_004781052.1| hypothetical protein Pyrfu_0931 [Pyrolobus fumarii 1A]
gi|343460364|gb|AEM38800.1| protein of unknown function DUF124 [Pyrolobus fumarii 1A]
Length = 236
Score = 61.2 bits (147), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 58/214 (27%), Positives = 105/214 (49%), Gaps = 14/214 (6%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRN---P 90
V ++ L+ E + A PG+M + G + +E + + LFG S L
Sbjct: 25 VLKVRLEAGESIWAEPGAMMLLRGPVRVETRAYGGVLTALKRALFGG--ESFFLNRYVAE 82
Query: 91 GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
GP++ ++ P I ID+ GG + Q ++L D+ V + + +R +I
Sbjct: 83 GPAEVWLVPGTP--GDIATIDM-QGGGSWIVQDTSYLAHSGDINV-DVAWRGLRGLIAEG 138
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
E + KL G+GLA+I + G++ + ++ GE + +D VA+++ ++ G ++
Sbjct: 139 E-IVWLKLEGRGLAWISSYGAIDKIDVPPGEKVVIDNFHFVAMSADAKWSVRKFGDLKSF 197
Query: 211 VFGGDNLVTAVVTGPGIVFIQS--LP-FHRLSQR 241
+FGG+ V + GP V++QS LP F RL Q+
Sbjct: 198 IFGGEGFVIE-IEGPATVYVQSRMLPLFARLLQK 230
>gi|386843380|ref|YP_006248438.1| hypothetical protein SHJG_7298 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|374103681|gb|AEY92565.1| hypothetical protein SHJG_7298 [Streptomyces hygroscopicus subsp.
jinggangensis 5008]
gi|451796671|gb|AGF66720.1| hypothetical protein SHJGH_7058 [Streptomyces hygroscopicus subsp.
jinggangensis TL01]
Length = 260
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 71/161 (44%), Gaps = 12/161 (7%)
Query: 93 SDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
+G VG A P R L +D + DAF+ + + V + GG
Sbjct: 86 GEGTVGFAGVLPGEMRALELDGTR---AWFAEKDAFVAAESTVDFGIAFAGGRTGMSGG- 141
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKN-LEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
EGF+ +K TG G I G+ + N + G I VD C+VA + ++ G + R
Sbjct: 142 EGFVLEKFTGHGTVIIAGAGNFIDLNPADFGGRIEVDTGCVVAFEEGIEYGVQRVGGLNR 201
Query: 210 -----AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
AVFGG+ L A + G G V +QSL L+ + +A
Sbjct: 202 QGLMNAVFGGEGLSLATLEGNGRVILQSLTIESLANALKKA 242
>gi|331087255|ref|ZP_08336325.1| hypothetical protein HMPREF0987_02628 [Lachnospiraceae bacterium
9_1_43BFAA]
gi|330408941|gb|EGG88402.1| hypothetical protein HMPREF0987_02628 [Lachnospiraceae bacterium
9_1_43BFAA]
Length = 194
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 92/198 (46%), Gaps = 12/198 (6%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT- 83
+QILG V L E ++ GSM +MS NI + + G FG+ +
Sbjct: 3 YQILGDTLPVVICRLDVGESMITEKGSMAWMS-----PNIRMDTSTNGGLGKAFGRMFSG 57
Query: 84 SVVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTL 139
+ +N + G G + A P + F G E++ Q FL + +++S
Sbjct: 58 EAIFQNIYTAQG--GAGEIAFASSFPGSIRAFHIRPGEEIVVQKGGFLAAEAGIQLSVHF 115
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
+++ + + G EGF+ Q+++G+G+ F G V++ L+ G+ I +D + A+T+S +
Sbjct: 116 HKKLASGLFGGEGFILQRISGEGVMFAEFDGHVIEYELQPGQQIVIDTGHLAAMTASCQM 175
Query: 200 QIKYNGPIRRAVFGGDNL 217
I+ ++ F G L
Sbjct: 176 DIQTVPGLKNKFFRGRRL 193
>gi|218281715|ref|ZP_03488097.1| hypothetical protein EUBIFOR_00664 [Eubacterium biforme DSM 3989]
gi|218217222|gb|EEC90760.1| hypothetical protein EUBIFOR_00664 [Eubacterium biforme DSM 3989]
Length = 250
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 65/241 (26%), Positives = 109/241 (45%), Gaps = 19/241 (7%)
Query: 14 VYQSPQDAVIPFQILG-GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG 72
+ + D + ++I G + + +I L+ E + GSM +MS ++ +E + N+ G
Sbjct: 10 IEERKGDHTMDYKIKGDSDCPIVEINLQNNETIKIERGSMAYMS-NVTIEG-KMNSNKKG 67
Query: 73 MWQWL--FGKTITS----VVLRNPGPS-DGFVGIAAPSLARILPIDLAMFGGELLCQPDA 125
+ L G+++TS + G S DG +GIA +I+ +++ L A
Sbjct: 68 LGGVLGAIGRSLTSGESMFITEATGTSHDGLLGIAPSIPGKIVCLEVNSQNHYYLNNG-A 126
Query: 126 FLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITV 185
FL N V + + GG GF GQG I A G +++ N++ IT+
Sbjct: 127 FLACDNSVSYEMKTQSVGKALFGGTGGFFVMHTFGQGDLLINAYGDIMEINVDDAHPITI 186
Query: 186 DVSCIVAVTSSVNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
D ++A S++ +IK I FG G+ LV V G G V+IQS H L+ +
Sbjct: 187 DNEHVIAWDDSLDYEIK----IASGTFGFTTGEGLVN-VFHGNGKVYIQSRNLHSLADAL 241
Query: 243 A 243
Sbjct: 242 T 242
>gi|329940620|ref|ZP_08289901.1| hypothetical protein SGM_5393 [Streptomyces griseoaurantiacus M045]
gi|329300681|gb|EGG44578.1| hypothetical protein SGM_5393 [Streptomyces griseoaurantiacus M045]
Length = 272
Score = 61.2 bits (147), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 49/161 (30%), Positives = 74/161 (45%), Gaps = 12/161 (7%)
Query: 93 SDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
++G VG A P R L +D L + DAF+ + + V Q R + G
Sbjct: 98 AEGTVGFAGVLPGEMRALELDGNR---AWLAEKDAFVAAESTVDFGIAF-QGGRTGMSGG 153
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKN-LEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
EGF+ +K TG+G I G+ + N + G I VD C+VA + ++ G + R
Sbjct: 154 EGFVLEKFTGRGTVIIAGAGNFIDLNPADFGGRIEVDTGCVVAFEEGIQYGVQRIGGLDR 213
Query: 210 -----AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
AV GG+ L A + G G V +QSL L+ + +A
Sbjct: 214 QGLMNAVLGGEGLSLATLEGNGRVILQSLTIEGLANALKKA 254
>gi|330835083|ref|YP_004409811.1| hypothetical protein Mcup_1222 [Metallosphaera cuprina Ar-4]
gi|329567222|gb|AEB95327.1| conserved hypothetical protein [Metallosphaera cuprina Ar-4]
Length = 230
Score = 60.8 bits (146), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 57/214 (26%), Positives = 101/214 (47%), Gaps = 7/214 (3%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+ ILG + Q ++ L E+ A G + ++ M+ G+ + L G T
Sbjct: 4 YNILGDDLQYLRVNLAQGEQFYADAGHTVMKNVTVSMQTRMRGGLLGGLKRALTGGTF-- 61
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
V GP + + P +++PI L G +L + +FL + + V+ +TL R+
Sbjct: 62 FVTEFYGPGELILSGIFP--GKVVPISLE--GRPILAEAHSFLGAESTVQYDSTL-ARLT 116
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
+ G EG K +G G F+ A G +++K L GE I V+ S ++A +N ++
Sbjct: 117 AGLLGGEGLFLAKFSGVGNVFLHAYGGLIEKYLSPGETIQVEASHLMAFEQGMNYSVQLV 176
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
G +R +F + L +TGPG V++ +L +L
Sbjct: 177 GGLRSILFAHEGLFFVTITGPGKVWLHTLTAEQL 210
>gi|15643135|ref|NP_228178.1| hypothetical protein TM0367 [Thermotoga maritima MSB8]
gi|418046301|ref|ZP_12684395.1| protein of unknown function DUF124 [Thermotoga maritima MSB8]
gi|4980870|gb|AAD35454.1|AE001716_17 conserved hypothetical protein [Thermotoga maritima MSB8]
gi|351675854|gb|EHA59014.1| protein of unknown function DUF124 [Thermotoga maritima MSB8]
Length = 225
Score = 60.5 bits (145), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 61/228 (26%), Positives = 100/228 (43%), Gaps = 23/228 (10%)
Query: 27 ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVV 86
+L G + ++ L E VV PG+M +M G +E++ G+W K + +
Sbjct: 6 VLKGSYALLKVFLSTGESVVVEPGAMVYMKGPIEVKT----SATGGVW-----KALKRAI 56
Query: 87 LRNPGPSDGFV------GIAAPSLARILP--IDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
L G + F+ G + LA LP ID+ L Q +FL V++ +
Sbjct: 57 L---GGENFFMNTYISRGESEIGLAPQLPGDIDVIPLKDILYVQSTSFLACDPSVEMDVS 113
Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
+++ G EG K G G + + G + L+ GE TVD +VA +V
Sbjct: 114 FGG-LKSFFSG-EGIFLLKFIGHGDVAVSSFGGIKSVELQPGEEFTVDTGHVVAFDGTVK 171
Query: 199 VQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
++ G ++ +FGG+ LV TGPG V+IQ+ + + I V
Sbjct: 172 WNVRTFGGLKSTLFGGEGLV-CTFTGPGRVYIQTRNYPAFVEWIKSLV 218
>gi|255514015|gb|EET90279.1| protein of unknown function DUF124 [Candidatus Micrarchaeum
acidiphilum ARMAN-2]
Length = 257
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/229 (21%), Positives = 100/229 (43%), Gaps = 15/229 (6%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--- 79
+ + I+GG Q+ I LK +K+ G + S +M + G +L G
Sbjct: 1 MDYSIIGGSMQILNINLKTGDKIYVDGGKVASRSEDAQMNAKW-----AGGAGFLKGIEM 55
Query: 80 --KTITSVVLRNPGPSD-GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
T+ +L G SD V ++ ++ I+L G + + AFL + + K+
Sbjct: 56 AFAGTTTFLLEVTGQSDTSTVSVSGDIPGKVKAIELKE-GESIYVEQMAFLATNDPSKID 114
Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
+ +R + G G +K G + F+ G +++ +L+ G + +D + + +
Sbjct: 115 --VKASLRGAMAGA-GLFLEKFNGPCMVFVHVAGDIIEYDLQDGYTMLIDPGHLATFSGN 171
Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
+ + G + +FG +NL+ TGPG + + S+ RL+ + +A
Sbjct: 172 MQIHFAPVGDFKAEIFGHENLIMGKFTGPGKIIMHSVSRQRLAASLGKA 220
>gi|240102519|ref|YP_002958828.1| hypothetical protein TGAM_0462 [Thermococcus gammatolerans EJ3]
gi|239910073|gb|ACS32964.1| Conserved hypothetical protein [Thermococcus gammatolerans EJ3]
Length = 222
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 53/205 (25%), Positives = 94/205 (45%), Gaps = 17/205 (8%)
Query: 33 QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-----QWLFGKTITSVVL 87
V ++ L E + A G+M +MS ++++E + + G++ L G++
Sbjct: 12 SVLEVELNTGESIQAESGAMVYMSPTIKLET----KAKGGIFGALKRSMLGGESFFINTF 67
Query: 88 RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
R G G VG+A P I +L G L Q AFL S ++ +
Sbjct: 68 RAEG-GPGTVGLAPPYPGDIEAFELD---GTLYAQSGAFLASSGEIDIDTKWGGA--KTF 121
Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
G EG K++G+G F+ + G++V+K L E +D +VA + ++ +K G +
Sbjct: 122 FGREGLFLLKMSGRGTVFLSSFGAIVRKELH-NERFIIDTGHLVAFSEGLDFTVKRVGGL 180
Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQS 232
+ + G+ LV A G G ++IQ+
Sbjct: 181 KSTLLSGEGLV-AEFYGTGTLYIQT 204
>gi|16081813|ref|NP_394204.1| hypothetical protein Ta0740 [Thermoplasma acidophilum DSM 1728]
gi|10640022|emb|CAC11874.1| conserved hypothetical protein [Thermoplasma acidophilum]
Length = 220
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 55/224 (24%), Positives = 107/224 (47%), Gaps = 11/224 (4%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
++ ++I G + Q + +L E V PG + + ++ GM + G +
Sbjct: 1 MVDYEIQGTDVQYLRAVLGSGESVYVEPGHLIYKDRQSRLD--VRAGGLKGMISHMLGGS 58
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
+L+ GP + V A +I IDL GG ++ + +AFLC + ++ S
Sbjct: 59 AV-FLLKVDGPGE--VASAGFLPGKIHRIDLR--GGGIIAEFNAFLCMDSGIQYSTKF-A 112
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+ + G EG + +G G F+ G V++++L G+ I+V++S ++A +V +
Sbjct: 113 GIWQGLLGGEGLFLEHFSGTGSIFLHGHGQVIERDLGAGDEISVELSHVLAFDDTVQYNV 172
Query: 202 KYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ G +R + G G+ L A + GPG V+I S+ +L+ ++
Sbjct: 173 ERIGGLRTMILGGLEGEGLFFAHMRGPGRVWIHSISLFQLAAKL 216
>gi|345562047|gb|EGX45119.1| hypothetical protein AOL_s00173g220 [Arthrobotrys oligospora ATCC
24927]
Length = 359
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 90/188 (47%), Gaps = 11/188 (5%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M M+ ++ ++ +V + + + G + PG V +A SL
Sbjct: 156 ITAKPGAMVAMAATVTLKGSV----KVSLKKLISGGELAHSTFTGPGE----VILAPASL 207
Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
I+PI L + DA+L + V V + Q + + + EG K+TGQG+
Sbjct: 208 GDIVPIRLDG-NQQWTMGKDAWLANTQGV-VRDYKSQGIGSAMFSGEGLFVYKITGQGVL 265
Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
F+ + G+++QKNL E VD +VA ++ +G I A+ + LV TG
Sbjct: 266 FVTSLGAIIQKNLAPNEQYIVDNGHLVAWNCKYIIERIASGGIISALSSNEGLVCR-FTG 324
Query: 225 PGIVFIQS 232
PG VFIQ+
Sbjct: 325 PGTVFIQT 332
>gi|406864015|gb|EKD17061.1| DUF124 domain-containing protein [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 378
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 57/190 (30%), Positives = 91/190 (47%), Gaps = 15/190 (7%)
Query: 45 VVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPS 103
++ARPG+M M+ ++ ++ N+ + M + + G +T PG + +A PS
Sbjct: 185 LIARPGAMISMTPTITLKGNV-----KFSMKKLITGGDMTHSTFTGPGE----LLLAPPS 235
Query: 104 LARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQG 162
L I I LA G E DAFL V V Q + + EG K++G G
Sbjct: 236 LGDITNIHLA--GSEQWSVGRDAFLACTQGV-VKEYKRQGIGKAMFSGEGLFVYKISGNG 292
Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
L +I + G++++K+L GE VD +VA + ++ +G I G+ LV
Sbjct: 293 LLWITSLGAIIRKDLADGEKYIVDNGHLVAWNTKYILERVASGGIISGFSSGEGLVCK-F 351
Query: 223 TGPGIVFIQS 232
TGPG VF+Q+
Sbjct: 352 TGPGTVFMQT 361
>gi|170724824|ref|YP_001758850.1| hypothetical protein Swoo_0457 [Shewanella woodyi ATCC 51908]
gi|169810171|gb|ACA84755.1| protein of unknown function DUF124 [Shewanella woodyi ATCC 51908]
Length = 220
Score = 59.7 bits (143), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 67/241 (27%), Positives = 110/241 (45%), Gaps = 34/241 (14%)
Query: 26 QILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+I+G +A I L+P E V+A +M M +++ + G+ + FG
Sbjct: 4 KIMGSQAFSYIDIDLEPGETVIAESDAMSSMDADLDLTATFNGGFFKGLLKKFFGGETLF 63
Query: 85 VVLRNPGPSDGFVGIAAP---SLARILPIDL---AMFGGELLCQPDAFLCSVNDVKVSNT 138
+ + S AP +L + +P +L + G E QP A+L S VK
Sbjct: 64 ISRYSNNTS-------APKRVTLVQPVPGELRCHELNGDEFNLQPGAYLASTEGVK---- 112
Query: 139 LDQRVRNVIGGI--EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
L R + I EG + +TG+G + A G++++K ++ GE I VD S +VA
Sbjct: 113 LGLRWAGFVSFIAREGLFKLTVTGEGKVWYGAYGALLEKEID-GEYI-VDTSHLVAYEPG 170
Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPK 256
+ ++++ G I + FGG+ LVT V G G + +QS R+V+ NPK
Sbjct: 171 IKLKLQLAGGIFSSFFGGEGLVTR-VEGKGKIIVQS-----------RSVSGLAGWLNPK 218
Query: 257 F 257
F
Sbjct: 219 F 219
>gi|296415383|ref|XP_002837369.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295633234|emb|CAZ81560.1| unnamed protein product [Tuber melanosporum]
Length = 358
Score = 59.3 bits (142), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 54/198 (27%), Positives = 91/198 (45%), Gaps = 14/198 (7%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
+ML V A+PG+M M+ ++ ++ + + + L G + PG
Sbjct: 155 LMLAHGCPVTAKPGAMVAMTPTVTLKGSV----KFSLKKILAGGEMAQSTYTGPGE---- 206
Query: 97 VGIAAPSLARILPIDLAMFGGELLCQ--PDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
+ +A +L I+PI L G+ DAFL V V + Q + + EG
Sbjct: 207 LLLAPAALGDIVPIRLD---GQQTWSVGKDAFLACTQGV-VKDYKSQGLGKAMFSGEGLF 262
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGG 214
K++GQG+ F+ + G+++QKN+ GE VD +VA ++ +G I ++
Sbjct: 263 VYKISGQGVLFVTSLGAIIQKNMTQGEQFIVDNDHLVAWNCKYTIERVASGGIISSMSAA 322
Query: 215 DNLVTAVVTGPGIVFIQS 232
+ LV GPG VFIQ+
Sbjct: 323 EGLVCRFTVGPGTVFIQT 340
>gi|385803598|ref|YP_005839998.1| hypothetical protein Hqrw_2456 [Haloquadratum walsbyi C23]
gi|339729090|emb|CCC40311.1| conserved hypothetical protein [Haloquadratum walsbyi C23]
Length = 240
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 94/206 (45%), Gaps = 10/206 (4%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL-RNPGP 92
V ++ L+ E V + G+M S + +E E+E GM+ L + L RN
Sbjct: 13 VVEVTLERGETVTSEAGAMVSHSEEIRLETGIGDEDE-GMFSSLKDSVLGDESLFRNRFT 71
Query: 93 SDGFVGIAAPSLARILPID---LAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
S+G G LA P D L + EL Q A++ DV + + L + G
Sbjct: 72 SEG--GQGTVELAPSTPGDVTALTLDHDELYLQGGAYIGCGTDVSIDSELGGLDTLLGGE 129
Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
+ K G G+ FI + G + +K ++ E +TVD IVA + +N + + G ++
Sbjct: 130 GISLI--KAHGSGVLFIGSFGGIDRKEIQRDETLTVDSGHIVAWNADMNYETRRVGGLKE 187
Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPF 235
V G+ LV TGPG+VF Q+ +
Sbjct: 188 TVLSGEGLVMD-FTGPGVVFTQTRNY 212
>gi|110668161|ref|YP_657972.1| hypothetical protein HQ2223A [Haloquadratum walsbyi DSM 16790]
gi|109625908|emb|CAJ52349.1| conserved hypothetical protein [Haloquadratum walsbyi DSM 16790]
Length = 240
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 59/206 (28%), Positives = 94/206 (45%), Gaps = 10/206 (4%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL-RNPGP 92
V ++ L+ E V + G+M S + +E E+E GM+ L + L RN
Sbjct: 13 VVEVTLERGETVTSEAGAMVSHSEEIRLETGIGDEDE-GMFSSLKDSVLGDESLFRNRFT 71
Query: 93 SDGFVGIAAPSLARILPID---LAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
S+G G LA P D L + EL Q A++ DV + + L + G
Sbjct: 72 SEG--GQGTVELAPSTPGDVTALTLDHDELYLQGGAYIGCGTDVSIDSELGGLDTLLGGE 129
Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
+ K G G+ FI + G + +K ++ E +TVD IVA + +N + + G ++
Sbjct: 130 GISLI--KAHGSGVLFIGSFGGIDRKEIQRDETLTVDSGHIVAWNADMNYETRRVGGLKE 187
Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPF 235
V G+ LV TGPG+VF Q+ +
Sbjct: 188 TVLSGEGLVMD-FTGPGVVFTQTRNY 212
>gi|448410693|ref|ZP_21575398.1| hypothetical protein C475_13742 [Halosimplex carlsbadense 2-9-1]
gi|445671729|gb|ELZ24316.1| hypothetical protein C475_13742 [Halosimplex carlsbadense 2-9-1]
Length = 240
Score = 58.9 bits (141), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 56/221 (25%), Positives = 101/221 (45%), Gaps = 10/221 (4%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL-RNPGP 92
V + L E++ A G+M + ++ ME I +++ G + + + L RN
Sbjct: 13 VASVELSSGEELTAEAGAMVSHTETIRME-TGIGDSDEGFLESVKDSVLGDESLFRNRFT 71
Query: 93 SDGFVGIAAPSLARILPIDLA---MFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
++G GI +A P D+ + G E+ Q A+ + + V++ D + + G
Sbjct: 72 AEGGDGIV--QIAATTPGDMTARELDGDEVYVQSGAYEAAGSGVELDT--DVGGLDTLFG 127
Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
EG K G G F+ A G + ++ ++ GEV TVD +VA +S+ + G ++
Sbjct: 128 GEGLFLLKAAGTGPLFLSAFGGIQEREVDAGEVFTVDSGHVVAWDASMEYNTERVGGMKE 187
Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
+F + LV TGPG V +Q+ + +A + S N
Sbjct: 188 TLFSDEGLVMR-FTGPGTVLLQTRNYGEFVSDLASRLPSGN 227
>gi|170288364|ref|YP_001738602.1| hypothetical protein TRQ2_0565 [Thermotoga sp. RQ2]
gi|403252814|ref|ZP_10919119.1| hypothetical protein EMP_03520 [Thermotoga sp. EMP]
gi|170175867|gb|ACB08919.1| protein of unknown function DUF124 [Thermotoga sp. RQ2]
gi|402811576|gb|EJX26060.1| hypothetical protein EMP_03520 [Thermotoga sp. EMP]
Length = 225
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 59/228 (25%), Positives = 99/228 (43%), Gaps = 23/228 (10%)
Query: 27 ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVV 86
+L G + ++ L E VV PG+M +M G +E+ G+W K + +
Sbjct: 6 VLKGSYALLKVFLSIGESVVVEPGAMVYMKGPIEVNT----SATGGVW-----KALKRAI 56
Query: 87 LRNPGPSDGFV------GIAAPSLARILP--IDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
L G + F+ G + LA LP ID+ L Q +FL V++ +
Sbjct: 57 L---GGENFFMNTYISRGESEVGLAPQLPGDIDVIPLKDILYVQSTSFLACDPSVEMDVS 113
Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
+++ G EG K G G + + G + L+ GE TVD +VA +V
Sbjct: 114 FGG-LKSFFSG-EGIFLLKFVGHGDVAVSSFGGIKSIELQSGEEFTVDTGHVVAFDGTVK 171
Query: 199 VQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
++ G ++ +FGG+ LV TGPG +++Q+ + + I V
Sbjct: 172 WNVRTFGGLKSTLFGGEGLV-CTFTGPGRIYVQTRNYPAFVEWIKSLV 218
>gi|428183259|gb|EKX52117.1| hypothetical protein GUITHDRAFT_161312 [Guillardia theta CCMP2712]
Length = 256
Score = 58.5 bits (140), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 53/227 (23%), Positives = 97/227 (42%), Gaps = 13/227 (5%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG------MWQWLF 78
+Q++G + Q +L+ ++V+ +PG+M +G++E PE G W
Sbjct: 35 WQVVGDDIQALLAVLQQGQEVMCQPGAMVSTTGTVE------PELTTGGCFNGCKRCWCA 88
Query: 79 GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV-NDVKVSN 137
G+ +N V ++ P ++I+P+ + L ++L SV D +
Sbjct: 89 GELCCRTHYKNVNGRLEHVTVSPPYPSKIIPVSMNEHTSGLYITRQSWLASVGRDADFNV 148
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
+ G L G G +F+ GG+V++K L GE + V + ++A V
Sbjct: 149 SCMPSFSQCCCLGLGCCIPSLVGSGYSFLAGGGTVMKKTLREGESVVVQTTALLAWDKKV 208
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
++ G GG TA TGPG +++QS+ F + + R
Sbjct: 209 KLEGACAGGPGVVCCGGMGFCTAKFTGPGTIYVQSMSFSKWAMSYMR 255
>gi|340923625|gb|EGS18528.1| hypothetical protein CTHT_0051310 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 475
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 51/188 (27%), Positives = 88/188 (46%), Gaps = 10/188 (5%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS S++++ Y ++ + ++L G + + PG V A L
Sbjct: 281 INAKPGAMLAMSPSVQLKGHY----KITVKKFLAGGDMGTSTFTGPGE----VLFAPHML 332
Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
I + L DA+L S + V + Q + I EG K+TG GL
Sbjct: 333 GDITSLRLNGGNEAWSVSSDAYLASTQYI-VKDYKRQGLSKAIFSGEGLWVYKITGTGLV 391
Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
++ + G++++K+L GE VD +VA + ++ +G + + G+ LV G
Sbjct: 392 WLTSFGAIIRKDLAEGEQYIVDNGHLVAWNTKYTIERVASGGLISGMAAGEGLVCK-FHG 450
Query: 225 PGIVFIQS 232
PG VFIQ+
Sbjct: 451 PGTVFIQT 458
>gi|306820186|ref|ZP_07453830.1| conserved hypothetical protein [Eubacterium yurii subsp.
margaretiae ATCC 43715]
gi|304551787|gb|EFM39734.1| conserved hypothetical protein [Eubacterium yurii subsp.
margaretiae ATCC 43715]
Length = 226
Score = 58.2 bits (139), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 62/218 (28%), Positives = 103/218 (47%), Gaps = 16/218 (7%)
Query: 36 QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDG 95
+ L E + A G+M M +++++ G+ + L G+ I +N S G
Sbjct: 15 ECYLSSGESIKAESGAMVGMDNTLDIDGKLEKGLMGGLTRMLAGEKI---FFQNIKASRG 71
Query: 96 F--VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
V +A +L +L L +L+ Q D +L S + V VS + + G EGF
Sbjct: 72 AGKVLLAPTTLGSVLDYQLDGMN-DLIVQKDGYLASEHSVDVSTKTQNLAQGLFSG-EGF 129
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY---NGPIRRA 210
K++G+G F+ + GS+ + +L+ G+ VD +VA +SS + +I+ +G I +
Sbjct: 130 FILKVSGRGTVFLNSYGSIHEIDLQAGQEYIVDNQHLVAWSSSTDYKIEKASSSGWI-SS 188
Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+ G+ LV GPG VFIQ+ R AR V S
Sbjct: 189 LTSGEGLVCR-FRGPGKVFIQT----RNPSSFARWVNS 221
>gi|258514331|ref|YP_003190553.1| hypothetical protein Dtox_1044 [Desulfotomaculum acetoxidans DSM
771]
gi|257778036|gb|ACV61930.1| protein of unknown function DUF124 [Desulfotomaculum acetoxidans
DSM 771]
Length = 228
Score = 57.8 bits (138), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 52/203 (25%), Positives = 93/203 (45%), Gaps = 11/203 (5%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI---TSVVLRNP 90
+ ++ L+ E + A +M MS ++++E G+ + + G++ T R P
Sbjct: 13 LLEVELQHGESIKAESDAMVSMSDTLDVEGKMDGGIMGGLARMVAGESFFFQTLTAKRGP 72
Query: 91 GPSDGFVGIAAPSLARILPIDLAMFGG-ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
G + + APS + ID+ + G L+ Q D F D++VS + + + G
Sbjct: 73 GKA-----LLAPSAPGDI-IDVELDGSYSLVVQKDGFFAGSRDLQVSAKMQNLAQGIFSG 126
Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
EGF K++GQG+ F+ + G + NLE GE + VD +VA ++ ++
Sbjct: 127 -EGFFVLKVSGQGIVFLSSYGGIHVINLEKGEEVVVDNGHLVAWPEYMSYTVEKASKGWI 185
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
+ F + GPG V IQ+
Sbjct: 186 STFTSGEVAVCRFRGPGPVIIQT 208
>gi|326789156|ref|YP_004306977.1| hypothetical protein Clole_0017 [Clostridium lentocellum DSM 5427]
gi|326539920|gb|ADZ81779.1| protein of unknown function DUF124 [Clostridium lentocellum DSM
5427]
Length = 230
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 68/141 (48%), Gaps = 5/141 (3%)
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
DG V P I+ ++L G L Q D FL S V+VS VR + G EGF
Sbjct: 72 DGEVLFGHPLPGDIIDVELDGSYG-LRVQKDGFLASTEGVEVSTKAQNLVRGMFSG-EGF 129
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV--NVQIKYNGPIRRAV 211
++G+G F+ + GS+ NLE GE VD +VA + N++ N R++
Sbjct: 130 FILNVSGRGTVFLSSYGSIHAINLEPGEEYIVDNGHLVAWADYMQYNIEKASNNGWLRSI 189
Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
G+ LV GPG+V IQ+
Sbjct: 190 ASGECLVCR-FRGPGVVLIQT 209
>gi|449546595|gb|EMD37564.1| hypothetical protein CERSUDRAFT_94566 [Ceriporiopsis subvermispora
B]
Length = 315
Score = 57.4 bits (137), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 91/220 (41%), Gaps = 28/220 (12%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
V I L+P + ARPG+M M S++++ M + G + PG
Sbjct: 110 VLYIRLQPGYEFKARPGAMVAMDPSVQIQGNLT----FSMKKLFTGGQLAQSRFTGPGE- 164
Query: 94 DGFVGIAAPSLARILPIDLA-----MFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIG 148
V +A I+PI L FG A+L S D+ TLD + +N +G
Sbjct: 165 ---VLLAPQVWGDIIPIPLDGRTRWHFGKH------AWLASTRDI----TLDTKAQNSVG 211
Query: 149 GI----EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
G +G G+ F+ A GS++ + L GE V+ +VA T+ N++
Sbjct: 212 KALFSGRGLFVMTASGAGMIFVQAMGSIITRTLADGEQWIVENDHLVAWTAKYNIEFIQT 271
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
G +V D TGPG V+IQ+ S+ +A
Sbjct: 272 GSFLSSV-KTDQGFVCRFTGPGTVYIQTRNPEAFSKWVAE 310
>gi|428183258|gb|EKX52116.1| hypothetical protein GUITHDRAFT_102018 [Guillardia theta CCMP2712]
Length = 247
Score = 57.4 bits (137), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 49/221 (22%), Positives = 92/221 (41%), Gaps = 33/221 (14%)
Query: 24 PFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT 83
P+QI+G ++Q+ L+P ++V+ +PGSM ++ N G ++ + I
Sbjct: 41 PWQIVGSDSQLIVFKLEPGQEVMMQPGSMVHADEGVKPSTSL--RNPFG--AFMASEDIF 96
Query: 84 SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV 143
+ +N + F+ + ++++PIDL+++ G + Q +FL + T R
Sbjct: 97 RLKWKNESSTVKFLSATSMFPSKVIPIDLSVYSG-ITIQKGSFLATTGSELRFETFFPRS 155
Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
G L+ KL G G+ F+ G+++ K L GE +D
Sbjct: 156 LTECLGQGNLLQTKLFGNGICFLCGSGTIITKTLAAGEKYILDT---------------- 199
Query: 204 NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
L TGPG++ Q+LP ++ A+
Sbjct: 200 ------------GLFNTEYTGPGLIVFQTLPACKIGALFAQ 228
>gi|448605699|ref|ZP_21658325.1| hypothetical protein C441_10061 [Haloferax sulfurifontis ATCC
BAA-897]
gi|448622567|ref|ZP_21669261.1| hypothetical protein C438_09512 [Haloferax denitrificans ATCC
35960]
gi|445741725|gb|ELZ93224.1| hypothetical protein C441_10061 [Haloferax sulfurifontis ATCC
BAA-897]
gi|445754649|gb|EMA06054.1| hypothetical protein C438_09512 [Haloferax denitrificans ATCC
35960]
Length = 225
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 64/133 (48%), Gaps = 6/133 (4%)
Query: 103 SLARILPIDLAMFG---GELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLT 159
SLA LP D+ G L Q ++L S + + + G EG KLT
Sbjct: 76 SLAPPLPGDIVHHGLADETLYVQSGSYLASDPALDLDTSFGGA--KTFFGSEGLFLLKLT 133
Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVT 219
G G +F+ + G++ + L+ GE TVD IVA + + ++ G ++ +F G+ LV
Sbjct: 134 GTGDSFLSSYGAIHEVELDDGERYTVDTGHIVAFDETTDFSVERVGGLKSTLFSGEGLV- 192
Query: 220 AVVTGPGIVFIQS 232
TGPG V+ QS
Sbjct: 193 CTFTGPGTVWTQS 205
>gi|153004312|ref|YP_001378637.1| hypothetical protein Anae109_1446 [Anaeromyxobacter sp. Fw109-5]
gi|152027885|gb|ABS25653.1| protein of unknown function DUF124 [Anaeromyxobacter sp. Fw109-5]
Length = 246
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 60/233 (25%), Positives = 111/233 (47%), Gaps = 31/233 (13%)
Query: 21 AVIPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMEN-----IYIPENEVGMW 74
+ + +Q++GG A ++ L P + + A G+M MSG++++++ + V
Sbjct: 9 SALRYQLVGGSAFAAARVELAPDQAIRAEAGAMVSMSGNVDLQSQLQGGLLGALKRVVSR 68
Query: 75 QWLFGKTITS------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
+ LF T T+ V+L P P D ++ ++LA G LL Q ++L
Sbjct: 69 ESLFVSTFTAQGGAGEVILAPPVPGD------------LVGVELA--GRALLVQSSSWLA 114
Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
S +V++ +T R + G EG + +GQG + + G++V++ L G VD
Sbjct: 115 SEPEVQL-DTEFAGFRGLFAG-EGLFFIRASGQGTVLLSSFGAIVRRPLPAGARYVVDTG 172
Query: 189 CIVAVTSSVNVQIKYNG--PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
+VA + + Q++ R++ G+ LV A TGPG V++Q+ L+
Sbjct: 173 HVVAFDARMPFQVRKASRRGWMRSLLSGEALV-AEFTGPGEVYLQTRNLRSLA 224
>gi|148269688|ref|YP_001244148.1| hypothetical protein Tpet_0551 [Thermotoga petrophila RKU-1]
gi|147735232|gb|ABQ46572.1| protein of unknown function DUF124 [Thermotoga petrophila RKU-1]
Length = 225
Score = 57.0 bits (136), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 59/228 (25%), Positives = 99/228 (43%), Gaps = 23/228 (10%)
Query: 27 ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVV 86
+L G + ++ L E VV PG+M +M G +E+ G+W K + +
Sbjct: 6 VLKGSYALLKVFLSIGESVVVEPGAMVYMKGPIEVNT----SATGGVW-----KALKRAI 56
Query: 87 LRNPGPSDGFV------GIAAPSLARILP--IDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
L G + F+ G + LA LP ID+ L Q +FL V++ +
Sbjct: 57 L---GGENFFMNTYISRGESEVGLAPQLPGDIDVIPLKDVLYVQSTSFLACDPSVEMDVS 113
Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
+++ G EG K G G + + G + L+ GE TVD +VA +V
Sbjct: 114 FGG-LKSFFSG-EGIFLLKFFGYGDVAVSSFGGIKSIELKPGEEFTVDTGHVVAFDGTVK 171
Query: 199 VQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
++ G ++ +FGG+ LV TGPG +++Q+ + + I V
Sbjct: 172 WNVRTFGGLKSTLFGGEGLV-CTFTGPGRIYVQTRNYPAFVEWIKSLV 218
>gi|389748369|gb|EIM89546.1| DUF124-domain-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 258
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 92/197 (46%), Gaps = 8/197 (4%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
+ L+P+ ++ +PG+M M +M+++ ++ G+ + + PG
Sbjct: 49 VTLQPEAQIKVKPGAMLAMDPTMQIQGKVPHMQHHSFSKFFSGEEASESIFTGPGE---- 104
Query: 97 VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQ 156
V IA I+PI L + + AFL S + + V Q + EGF
Sbjct: 105 VLIAPEVWGDIVPITLDGQAPWSVSK-YAFLASTSGL-VRGVRSQGFAKGLFSGEGFFVS 162
Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ-IKYNGPIRRAVFGGD 215
+++GQG ++ A G++VQ++L GE VD +VA ++ V+ IK G + + +
Sbjct: 163 EISGQGTLWVQAIGAIVQRHLRPGEEWIVDNGHLVAWSAQYKVETIKAKGGLLSKLHTDE 222
Query: 216 NLVTAVVTGPGIVFIQS 232
V TGPG V+IQ+
Sbjct: 223 GKV-CRFTGPGTVYIQT 238
>gi|332797638|ref|YP_004459138.1| hypothetical protein Ahos_1967 [Acidianus hospitalis W1]
gi|332695373|gb|AEE94840.1| conserved hypothetical protein [Acidianus hospitalis W1]
Length = 242
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 110/244 (45%), Gaps = 17/244 (6%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV--GMWQWLFGKTI 82
++ILG + Q ++ L E + G + S S+++ Y + + + L G
Sbjct: 2 YRILGYDMQHVRVYLNEGEMIYGDGGHLVAKSPSVKIN--YAAQGGIIKSFERELTGSRF 59
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ + PG ++ ++ RI+ IDL G + + ++FL + V+ S TLD+
Sbjct: 60 FVMQVYGPGVAE----FSSFLPGRIVKIDLN--GNGIRVEHNSFLFAEQGVQYSATLDKI 113
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+GG EG L +GQG F+ A G V L+ GE I V+ ++A S+ I
Sbjct: 114 SVGWLGG-EGLLMAHFSGQGSVFVHALGGVSSFVLQQGEEIEVEEGHLLAFDDSMQYSIT 172
Query: 203 YNGPIRRAVFGGDNLVTAV---VTGPGIVFIQSLPFHRLSQRIARAV---TSPNMRENPK 256
G I+ +FG A +TGPG V++ ++ +L ++ V P ++P
Sbjct: 173 RAGGIKTMLFGHIEKEGAFFIKLTGPGRVWLHNVSLEQLIAKLNLMVGGNGGPAQPQDPS 232
Query: 257 FFVQ 260
+Q
Sbjct: 233 IDIQ 236
>gi|355571049|ref|ZP_09042319.1| protein of unknown function DUF124 [Methanolinea tarda NOBI-1]
gi|354826331|gb|EHF10547.1| protein of unknown function DUF124 [Methanolinea tarda NOBI-1]
Length = 231
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 53/213 (24%), Positives = 92/213 (43%), Gaps = 17/213 (7%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSM-------EMENIYIPENEVGMWQWLFGKTITSVVLRN 89
+ L P E + A GSM +M ++ E + + Q F +
Sbjct: 16 VTLSPGESIAAESGSMTYMQPTLTVRTQKREKSLLGSLGLALAGGQSFFVNEFCATT--- 72
Query: 90 PGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
+ G G +A L I + ++ G +L Q A++ S V + + + + G
Sbjct: 73 ---AAGEAGFSAAPLGDIGTLRVSPGKGCIL-QKSAYIASQGTVDLDVKWEGFSKGLFG- 127
Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
+G + TG+G FI G++ Q L+ GE + VD +VA + + Q++ G I+
Sbjct: 128 -QGLFMIRATGEGTLFINTFGAIDQHTLKEGESLIVDNFHLVAFSDTCTYQVRKFGGIKE 186
Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ G+ VT + GPG VFIQ+ L++ I
Sbjct: 187 TLLSGEGFVTE-IRGPGDVFIQTKNIRELAEWI 218
>gi|164425315|ref|XP_959188.2| hypothetical protein NCU04645 [Neurospora crassa OR74A]
gi|157070879|gb|EAA29952.2| predicted protein [Neurospora crassa OR74A]
Length = 445
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 51/187 (27%), Positives = 88/187 (47%), Gaps = 13/187 (6%)
Query: 47 ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
A+PGSM MS SM ++ Y + G+ + + G + S PG + AP++
Sbjct: 254 AKPGSMIAMSPSMSLKGAY----KFGLKKLVAGGEMGSSTYTGPGEL-----LLAPAMLG 304
Query: 107 ILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAF 165
+ L + G + D +L S +V + Q + + EG K++G GL +
Sbjct: 305 DI-TSLRLDGSQTWSVSHDGYLASTQNV-TKDYKRQGLGKAMFSGEGLWVYKISGTGLLW 362
Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGP 225
+ + G++V+K+L GE VD +VA + ++ +G + G+ LV TGP
Sbjct: 363 LTSFGAIVRKDLADGEKYIVDNGHLVAWNTKYVLERVASGGLLSGFASGEGLV-CKFTGP 421
Query: 226 GIVFIQS 232
G +FIQ+
Sbjct: 422 GTIFIQT 428
>gi|448678514|ref|ZP_21689521.1| hypothetical protein C443_07778 [Haloarcula argentinensis DSM
12282]
gi|445772501|gb|EMA23546.1| hypothetical protein C443_07778 [Haloarcula argentinensis DSM
12282]
Length = 237
Score = 56.6 bits (135), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/227 (26%), Positives = 98/227 (43%), Gaps = 23/227 (10%)
Query: 36 QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPSD 94
++ L E ++A PG+M S ++E+E + + + + G +++ + G S
Sbjct: 15 RVALDSGESILAEPGAMVSHSPTIEIETTTSRDGLLSSAKSMLGGESLLANEFTAEGGS- 73
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
G V +A P+ + +L GE L D AFL + D+ + + GGI+
Sbjct: 74 GTVTLAPPTPGDVHHHELT---GETLYAVDGAFLAADPDIDIDSEF--------GGIKSM 122
Query: 154 LRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
L L G G I A G + L+ GE T+D +VA SVN G +
Sbjct: 123 LAGASITPLALKGSGNVLIEAFGGLETVELDAGESYTIDNDHVVAWEDSVNFDAHRVGGL 182
Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
+ + G+ LV TGPG V+ Q+ + IA A+ P +N
Sbjct: 183 KSTLLSGEGLVMD-FTGPGTVWYQTRGLDSFTSAIADAL--PGTGDN 226
>gi|448330451|ref|ZP_21519733.1| hypothetical protein C489_14920 [Natrinema versiforme JCM 10478]
gi|445611832|gb|ELY65578.1| hypothetical protein C489_14920 [Natrinema versiforme JCM 10478]
Length = 224
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/221 (25%), Positives = 91/221 (41%), Gaps = 19/221 (8%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
+ L+P E V+A PG+M S ++ M+ + + + L G G
Sbjct: 16 VTLEPGESVIAEPGAMVGHSANVSMDTGTSRDGLLSSAKSLLGGESAFANEFTAEDEPGT 75
Query: 97 VGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
V A PS ++P +L E L D AFL + + + + IGG++ L
Sbjct: 76 VTFAPPSPGDVMPHELR---DETLYSTDGAFLAATAGIDIDSE--------IGGLKSMLG 124
Query: 156 QK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
+ L G G FI A G + + L GE +D ++A V+ I+ G ++
Sbjct: 125 EASVTPLALKGTGTVFIDAYGGLEKLELAAGESYVLDNEHLIAWDDQVDYSIRRVGNLKS 184
Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
+ G+ LV TGPG + Q+ L +A + S N
Sbjct: 185 TLLSGEGLVFE-FTGPGTAWYQTRDMDALVSVLAPRLPSDN 224
>gi|449546594|gb|EMD37563.1| hypothetical protein CERSUDRAFT_114200 [Ceriporiopsis subvermispora
B]
Length = 300
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/222 (26%), Positives = 98/222 (44%), Gaps = 21/222 (9%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
V + L+P ++ A+PG+M M S++++ +G ++ G + PG
Sbjct: 95 VLYMRLQPGYEIKAKPGAMVAMDASVQIQGNLT----LGFKKFFTGGQMAESRFTGPGE- 149
Query: 94 DGFVGIAAPSLARILPIDLA-----MFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIG 148
V +A I+PI L FG + +L S +V + NT Q + +
Sbjct: 150 ---VLLAPEVWGDIVPIHLDGRTQWHFGKHV------WLASTREVTL-NTKAQSLGKTLF 199
Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
G +G G+ F+ A GS++ + L GE VD +VA T++ +V+ G +
Sbjct: 200 SGRGLFVMTASGAGMLFVQALGSIITRTLANGEQWVVDNDHLVAWTANYSVERIQAGSLL 259
Query: 209 RAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
+ + LV TGPG V+IQS L + IA + + N
Sbjct: 260 SSAKTDEGLV-CRFTGPGTVYIQSRNPEALCKWIAERIPTDN 300
>gi|392570466|gb|EIW63639.1| DUF124-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 213
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 58/216 (26%), Positives = 99/216 (45%), Gaps = 16/216 (7%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
+ L+P ++ A+PGSM M +++++ + + M + G +T + PG
Sbjct: 4 LRLQPGYEIKAKPGSMVAMDATVQIKG----KLKFSMKKMFTGSDMTESLFTGPGE---- 55
Query: 97 VGIAAPSLARILPIDLAMFGGELLCQ--PDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
V +A I+PI L G+ AFL S V + + + G EG
Sbjct: 56 VLLAPEIWGDIVPIHLD---GQTPWNVGKHAFLASTPGVSRTAKSQGFGKALFSG-EGLF 111
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGG 214
Q++TGQG ++ + G++ Q+ L+ GE VD +VA T+ V+ G +
Sbjct: 112 VQQVTGQGTMWVTSLGAIFQRTLQPGEQWIVDNGHLVAWTAGYKVERINAGGLFSMSHTD 171
Query: 215 DNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
+ LV TGPG+++IQ+ L + I RA P+
Sbjct: 172 EGLVCR-FTGPGVIYIQTRNPETLGEWI-RAQVPPS 205
>gi|220916631|ref|YP_002491935.1| hypothetical protein A2cp1_1525 [Anaeromyxobacter dehalogenans
2CP-1]
gi|219954485|gb|ACL64869.1| protein of unknown function DUF124 [Anaeromyxobacter dehalogenans
2CP-1]
Length = 247
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 57/229 (24%), Positives = 118/229 (51%), Gaps = 21/229 (9%)
Query: 20 DAVIPFQILGGEAQVC-QIMLKPQEKVVARPGSMCFMSGSMEMEN-----IYIPENEVGM 73
+ + +Q++GG A C ++ L + ++A G+M MSG++++++ + +
Sbjct: 8 SSTLRYQLVGGNAFACARVELAAGQSILAEAGAMVSMSGNVDLQSRMQGGVMGALRRMVT 67
Query: 74 WQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
+ +F T T++ GP++ V +A P ++ ++LA G LL Q ++L S +
Sbjct: 68 RESVFVSTFTAM----GGPAE--VLLAPPVPGDVIGLELA--GRTLLVQSSSWLASDPET 119
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
++ +T R + G EG +L+G+G A + + G++V++ + G VD +VA
Sbjct: 120 RI-DTEFAGFRGLFAG-EGLFFIRLSGRGTALLSSYGAIVRRPIPAGGRYVVDTGHVVAF 177
Query: 194 TSSVNVQIK---YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
+++ Q++ G + R++ G+ LV A GPG V++Q+ L+
Sbjct: 178 DAAMPYQVRKASRRGWL-RSIVSGEALV-AEFAGPGEVWLQTRNLQALA 224
>gi|448573289|ref|ZP_21640873.1| hypothetical protein C456_16242 [Haloferax lucentense DSM 14919]
gi|448597542|ref|ZP_21654467.1| hypothetical protein C452_08881 [Haloferax alexandrinus JCM 10717]
gi|445719054|gb|ELZ70737.1| hypothetical protein C456_16242 [Haloferax lucentense DSM 14919]
gi|445739003|gb|ELZ90512.1| hypothetical protein C452_08881 [Haloferax alexandrinus JCM 10717]
Length = 225
Score = 56.2 bits (134), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 64/133 (48%), Gaps = 6/133 (4%)
Query: 103 SLARILPIDLAMFGGE---LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLT 159
S A LP D+ G E L Q +++ S + + + G EG KLT
Sbjct: 76 SFAPPLPGDIVHHGLEDETLFVQSGSYIASDPALDLDTSFGGA--KTFFGSEGLFLLKLT 133
Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVT 219
G G +F+ + G++ + L GE TVD IVA + + ++ G ++ +F G+ LV
Sbjct: 134 GTGDSFLSSYGAIHEVELGEGERYTVDTGHIVAFDETTSFSVERVGGLKSTLFSGEGLV- 192
Query: 220 AVVTGPGIVFIQS 232
TGPG V+IQS
Sbjct: 193 CTFTGPGTVWIQS 205
>gi|333984102|ref|YP_004513312.1| hypothetical protein [Methylomonas methanica MC09]
gi|333808143|gb|AEG00813.1| protein of unknown function DUF124 [Methylomonas methanica MC09]
Length = 226
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 95/206 (46%), Gaps = 22/206 (10%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW----------QWLFGKTITSVV 86
+ L+ E + A G+M MS ++++++ + E G+ + LF +T+T+V
Sbjct: 16 VHLERGEAIKAESGAMITMSETLDVDS----KMERGLLGGLSRKLLTGESLFFQTLTAV- 70
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
GP + + A P +L +D + E + Q D FL + V+VS + +
Sbjct: 71 ---RGPGEVLMASAYPGEIAVLELDGSE---EYILQKDGFLAAEQSVQVSTKAQNLTKGL 124
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
G EGF +++G+G I G + + L GE +D S +VA +++ N +I+
Sbjct: 125 FSG-EGFFVMRISGKGKLAISTYGGIQKLTLAPGETRIIDNSHMVAWSANANYKIEKAAK 183
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
+ F + GPG V+IQ+
Sbjct: 184 GWISSFTSGEGLVCRFHGPGDVYIQT 209
>gi|448579214|ref|ZP_21644491.1| hypothetical protein C455_16088 [Haloferax larsenii JCM 13917]
gi|445723893|gb|ELZ75529.1| hypothetical protein C455_16088 [Haloferax larsenii JCM 13917]
Length = 226
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 57/114 (50%), Gaps = 3/114 (2%)
Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
L Q ++L S + + + R G EG KL G G AF+ + G++ + L+
Sbjct: 95 LYVQSGSYLASETTLDLDTSFGG-ARTFFGS-EGLFLLKLEGTGDAFLSSYGAIHEVELD 152
Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
G+ TVD IVA + N ++ G ++ +F G+ LV TGPG V++QS
Sbjct: 153 EGQKYTVDTGHIVAFDGTTNFDVERVGGLKSTLFSGEGLV-CTFTGPGSVWLQS 205
>gi|448610962|ref|ZP_21661596.1| hypothetical protein C440_07362 [Haloferax mucosum ATCC BAA-1512]
gi|445743394|gb|ELZ94875.1| hypothetical protein C440_07362 [Haloferax mucosum ATCC BAA-1512]
Length = 226
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/114 (31%), Positives = 57/114 (50%), Gaps = 3/114 (2%)
Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
L Q ++L S DV++ G EG KL G G +F+ + G++ + L+
Sbjct: 95 LFVQSGSYLAS--DVELDLDTSFGGAKTFFGSEGLFLLKLEGTGDSFLSSYGAIHEVELD 152
Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
GE TVD IVA + + ++ G ++ +F G+ LV TGPG V++QS
Sbjct: 153 AGERYTVDTGHIVAFDGTTDFSVERVGGLKSTLFSGEGLV-CTFTGPGTVWLQS 205
>gi|448582557|ref|ZP_21646061.1| hypothetical protein C454_05742 [Haloferax gibbonsii ATCC 33959]
gi|445732205|gb|ELZ83788.1| hypothetical protein C454_05742 [Haloferax gibbonsii ATCC 33959]
Length = 225
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 47/84 (55%), Gaps = 1/84 (1%)
Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
G EG KLTG G +F+ + G++ + L GE TVD IVA + + ++ G ++
Sbjct: 123 GSEGLFLLKLTGTGDSFLSSYGAIHEVELGEGERYTVDTGHIVAFDETTSFSVERVGGLK 182
Query: 209 RAVFGGDNLVTAVVTGPGIVFIQS 232
+F G+ LV TGPG V+IQS
Sbjct: 183 STLFSGEGLV-CTFTGPGTVWIQS 205
>gi|292654579|ref|YP_003534476.1| hypothetical protein HVO_0402 [Haloferax volcanii DS2]
gi|448293417|ref|ZP_21483523.1| hypothetical protein C498_16688 [Haloferax volcanii DS2]
gi|291371176|gb|ADE03403.1| conserved hypothetical protein TIGR00266 [Haloferax volcanii DS2]
gi|445570471|gb|ELY25031.1| hypothetical protein C498_16688 [Haloferax volcanii DS2]
Length = 225
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 47/84 (55%), Gaps = 1/84 (1%)
Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
G EG KLTG G +F+ + G++ + L GE TVD IVA + + ++ G ++
Sbjct: 123 GSEGLFLLKLTGTGDSFLSSYGAIHEVELGEGERYTVDTGHIVAFDETTSFSVERVGGLK 182
Query: 209 RAVFGGDNLVTAVVTGPGIVFIQS 232
+F G+ LV TGPG V+IQS
Sbjct: 183 STLFSGEGLV-CTFTGPGTVWIQS 205
>gi|448543631|ref|ZP_21625185.1| hypothetical protein C460_10643 [Haloferax sp. ATCC BAA-646]
gi|448550723|ref|ZP_21629026.1| hypothetical protein C459_12020 [Haloferax sp. ATCC BAA-645]
gi|448558976|ref|ZP_21633297.1| hypothetical protein C458_15651 [Haloferax sp. ATCC BAA-644]
gi|445706354|gb|ELZ58237.1| hypothetical protein C460_10643 [Haloferax sp. ATCC BAA-646]
gi|445711228|gb|ELZ63022.1| hypothetical protein C459_12020 [Haloferax sp. ATCC BAA-645]
gi|445711817|gb|ELZ63605.1| hypothetical protein C458_15651 [Haloferax sp. ATCC BAA-644]
Length = 225
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/84 (38%), Positives = 47/84 (55%), Gaps = 1/84 (1%)
Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
G EG KLTG G +F+ + G++ + L GE TVD IVA + + ++ G ++
Sbjct: 123 GSEGLFLLKLTGTGDSFLSSYGAIHEVELGEGERYTVDTGHIVAFDETTSFSVERVGGLK 182
Query: 209 RAVFGGDNLVTAVVTGPGIVFIQS 232
+F G+ LV TGPG V+IQS
Sbjct: 183 STLFSGEGLV-CTFTGPGTVWIQS 205
>gi|323453474|gb|EGB09345.1| hypothetical protein AURANDRAFT_13803, partial [Aureococcus
anophagefferens]
Length = 107
Score = 55.8 bits (133), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/96 (31%), Positives = 47/96 (48%), Gaps = 4/96 (4%)
Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
G +G +RQ + G G+AF+ A G + K L GE + VD + +VA SV ++ G
Sbjct: 1 GGQGLIRQVIKGDGMAFMGAMGVITTKTLAAGETLQVDTTSVVAWEDSVKFAVRRTGSFF 60
Query: 209 RAVFGGDN----LVTAVVTGPGIVFIQSLPFHRLSQ 240
G+ + + GPG+VF+QS + Q
Sbjct: 61 TCCCAGEGTFQRMFNTTLEGPGLVFLQSYSHGKFKQ 96
>gi|197121839|ref|YP_002133790.1| hypothetical protein AnaeK_1430 [Anaeromyxobacter sp. K]
gi|196171688|gb|ACG72661.1| protein of unknown function DUF124 [Anaeromyxobacter sp. K]
Length = 247
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 56/230 (24%), Positives = 111/230 (48%), Gaps = 23/230 (10%)
Query: 20 DAVIPFQILGGEAQVC-QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
+ + +Q++GG A C ++ L + ++A G+M MSG++++++ M +
Sbjct: 8 SSTLRYQLVGGNAFACARVELAAGQSILAEAGAMVSMSGNVDLQSR--------MQGGVM 59
Query: 79 GKTITSVVLRNPGPSDGFVGIAAPSLARILP------IDLAMFGGELLCQPDAFLCSVND 132
G + +V R F + P+ + P I L + G LL Q ++L S +
Sbjct: 60 G-ALRRMVTRESVFVSTFTAMGGPAEVLLAPPVPGDVIGLELEGRTLLVQSSSWLASDPE 118
Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
++ +T R + G EG +L+G+G A + + G++V++ + G VD +VA
Sbjct: 119 TRI-DTEFAGFRGLFAG-EGLFFIRLSGRGTALLSSYGAIVRRPIPAGGRYVVDTGHVVA 176
Query: 193 VTSSVNVQIK---YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
+++ Q++ G + R++ G+ LV A GPG V++Q+ L+
Sbjct: 177 FDAAMPYQVRKASRRGWL-RSIVSGEALV-AEFAGPGEVWLQTRNLQALA 224
>gi|448589550|ref|ZP_21649709.1| hypothetical protein C453_04209 [Haloferax elongans ATCC BAA-1513]
gi|445735978|gb|ELZ87526.1| hypothetical protein C453_04209 [Haloferax elongans ATCC BAA-1513]
Length = 226
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 47/84 (55%), Gaps = 1/84 (1%)
Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
G EG KL G G AF+ + G++ + L+ G+ TVD IVA + N ++ G ++
Sbjct: 123 GSEGLFLLKLEGTGDAFLSSYGAIHEVELDEGQKYTVDTGHIVAFDGTTNFDVERVGGLK 182
Query: 209 RAVFGGDNLVTAVVTGPGIVFIQS 232
+F G+ LV TGPG V++QS
Sbjct: 183 STLFSGEGLV-CTFTGPGSVWLQS 205
>gi|159041055|ref|YP_001540307.1| hypothetical protein Cmaq_0472 [Caldivirga maquilingensis IC-167]
gi|157919890|gb|ABW01317.1| protein of unknown function DUF124 [Caldivirga maquilingensis
IC-167]
Length = 238
Score = 55.5 bits (132), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 45/211 (21%), Positives = 93/211 (44%), Gaps = 6/211 (2%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+ + ILG + Q+ ++++ E + + + + ++ + G+ + L G +
Sbjct: 1 MEYSILGNDLQLVRVVMGEGESIYGEGSHLLYKTPAVGLTTKASGGILAGLKRTLTGASF 60
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+ L PG + A + +I + L G ++ + +FL + N VK +L +
Sbjct: 61 FVLELTGPGEAA----FAGSTPGKITQVTLNE-GESIMAEHGSFLFAENSVKYDASLTRL 115
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
+ GG EG K TG G F+ G V L E + ++ ++A + + +
Sbjct: 116 SVGLFGG-EGLFLAKFTGLGNVFLHGTGHVHMIQLNESEELNIEAGHLLAFDAGMQYTVG 174
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSL 233
G +R + GG+ L + GPG V+++S+
Sbjct: 175 RVGGLRTMLLGGEGLFFVNIKGPGRVWVRSI 205
>gi|116179870|ref|XP_001219784.1| hypothetical protein CHGG_00563 [Chaetomium globosum CBS 148.51]
gi|88184860|gb|EAQ92328.1| hypothetical protein CHGG_00563 [Chaetomium globosum CBS 148.51]
Length = 425
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 89/189 (47%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
V ARPGSM MS S+ ++ + M + + G + S PG + +A P L
Sbjct: 232 VEARPGSMIAMSPSIVLKG----HIKFSMKKLVAGGDMNSSTFTGPGE----LLLAPPML 283
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
I + L+ G E D FL +V + + Q + + EG K++G GL
Sbjct: 284 GDITSLRLS--GQEAWSVSHDGFLACTQNV-IKDYKRQGLGKAMFSGEGLWVYKISGTGL 340
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
++ + G++++K+L GE VD +VA + ++ +G + + G+ LV T
Sbjct: 341 LWLTSFGAIIRKDLMDGEKYIVDNGHLVAWNTKYVLERVASGGLMSGMASGEGLVCK-FT 399
Query: 224 GPGIVFIQS 232
GPG VFIQ+
Sbjct: 400 GPGTVFIQT 408
>gi|86158855|ref|YP_465640.1| hypothetical protein Adeh_2433 [Anaeromyxobacter dehalogenans
2CP-C]
gi|85775366|gb|ABC82203.1| protein of unknown function DUF124 [Anaeromyxobacter dehalogenans
2CP-C]
Length = 247
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 56/229 (24%), Positives = 111/229 (48%), Gaps = 23/229 (10%)
Query: 21 AVIPFQILGGEAQVC-QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG 79
+ + +Q++GG A C ++ L + ++A G+M MSG++++++ M + G
Sbjct: 9 STLRYQLVGGNAFACARVELAAGQSILAEAGAMVSMSGNVDLQSR--------MQGGVMG 60
Query: 80 KTITSVVLRNPGPSDGFVGIAAPSLARILP------IDLAMFGGELLCQPDAFLCSVNDV 133
+ +V R F + P+ + P I L + G LL Q ++L S +
Sbjct: 61 -ALRRMVTRESVFVSTFTAMGGPAEVLLAPPVPGDVIGLELEGRTLLVQSSSWLASDPET 119
Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
++ +T R + G EG +L+G+G A + + G++V++ + G VD +VA
Sbjct: 120 RI-DTEFAGFRGLFAG-EGLFFIRLSGRGTALLSSYGAIVRRPIPAGGRYVVDTGHVVAF 177
Query: 194 TSSVNVQIK---YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
+++ Q++ G + R++ G+ LV A GPG V++Q+ L+
Sbjct: 178 DAAMPYQVRKASRRGWL-RSIVSGEALV-AEFAGPGEVWLQTRNLQALA 224
>gi|171691685|ref|XP_001910767.1| hypothetical protein [Podospora anserina S mat+]
gi|170945791|emb|CAP72591.1| unnamed protein product [Podospora anserina S mat+]
Length = 399
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 87/187 (46%), Gaps = 13/187 (6%)
Query: 47 ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
ARPG+M MS S++++ Y + M + + G ++ PG + +A P L
Sbjct: 207 ARPGAMIAMSPSIQLKGSY----KFSMKKLVAGGEMSQSHYTGPGE----LLLAPPMLGD 258
Query: 107 ILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAF 165
I L + G E D +L S V + + Q + + EG K++G GL +
Sbjct: 259 I--TSLRLTGNESWSVSHDGYLASTQHV-IKDYKRQGLGKAMFSGEGLWVYKISGTGLLW 315
Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGP 225
+ + G++++K+L GE VD +VA ++ +G I G+ LV TGP
Sbjct: 316 LTSFGAIIRKDLAEGEKYIVDNGHLVAWNVKYIMERVTSGGIISGFASGEGLVCK-FTGP 374
Query: 226 GIVFIQS 232
G VFIQ+
Sbjct: 375 GTVFIQT 381
>gi|395333196|gb|EJF65574.1| hypothetical protein DICSQDRAFT_79838 [Dichomitus squalens LYAD-421
SS1]
Length = 208
Score = 55.5 bits (132), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 58/214 (27%), Positives = 97/214 (45%), Gaps = 11/214 (5%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
+ L+P ++ A+PGSM M S++++ + +V + + G ++ + PG
Sbjct: 4 LRLQPGYQIKAKPGSMVAMDASVQIKG----KLKVSLKKIFTGGEMSESIFTGPGE---- 55
Query: 97 VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQ 156
V +A ++PI L + + DAFL V S T Q + + EG
Sbjct: 56 VLLAPEIWGDVVPIRLDGSQNWRVGR-DAFLACTMGVTRS-TKSQGLGKALFSGEGLFVY 113
Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDN 216
+ GQG+ ++ + G++ + L+ GE VD +VA T+ V+ G A +
Sbjct: 114 NIAGQGVMWVQSLGAITSRTLQPGEQWIVDNGHLVAWTAQYKVERIDAGGFFSASHTDEG 173
Query: 217 LVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
LV TGPG+V+IQS L + I V S N
Sbjct: 174 LVCR-FTGPGVVYIQSRNPETLGEWIREQVPSVN 206
>gi|402311312|ref|ZP_10830260.1| TIGR00266 family protein [Eubacterium sp. AS15]
gi|400365458|gb|EJP18510.1| TIGR00266 family protein [Eubacterium sp. AS15]
Length = 226
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 58/200 (29%), Positives = 98/200 (49%), Gaps = 14/200 (7%)
Query: 39 LKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF-- 96
L E + A G+M M ++++E G+ + L G+ I +N S G
Sbjct: 18 LAAGESIKAESGAMVGMDNTLDIEGKLEKGLMGGLTRMLAGEKI---FFQNITASRGAGK 74
Query: 97 VGIAAPSLARILPIDLAMFG-GELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
V +A +L + ID + G +L+ Q D +L S + V VS + + G EGF
Sbjct: 75 VLLAPTTLGSV--IDYKLDGMNDLIVQKDGYLASEHGVDVSTKTQNLAQGLFSG-EGFFI 131
Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY---NGPIRRAVF 212
K++G+G F+ + GS+ + +L+ G+ VD +VA TS+ + +I+ +G I +
Sbjct: 132 LKVSGRGTVFLNSYGSIHEIDLQAGQEYIVDNQHLVAWTSNTDYKIEKASSSGWI-SSFT 190
Query: 213 GGDNLVTAVVTGPGIVFIQS 232
G+ LV GPG VF+Q+
Sbjct: 191 SGEGLVCR-FRGPGKVFMQT 209
>gi|448560743|ref|ZP_21634191.1| hypothetical protein C457_02205 [Haloferax prahovense DSM 18310]
gi|445722393|gb|ELZ74056.1| hypothetical protein C457_02205 [Haloferax prahovense DSM 18310]
Length = 225
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 47/84 (55%), Gaps = 1/84 (1%)
Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
G EG KLTG G +F+ + G++ + L GE TVD IVA + + ++ G ++
Sbjct: 123 GSEGLFLLKLTGTGDSFLSSYGAIHEVELGEGERYTVDTGHIVAFDETTSFSVERVGGLK 182
Query: 209 RAVFGGDNLVTAVVTGPGIVFIQS 232
+F G+ LV TGPG V++QS
Sbjct: 183 STLFSGEGLV-CTFTGPGTVWMQS 205
>gi|365832830|ref|ZP_09374357.1| hypothetical protein HMPREF1021_03121 [Coprobacillus sp. 3_3_56FAA]
gi|365259960|gb|EHM89935.1| hypothetical protein HMPREF1021_03121 [Coprobacillus sp. 3_3_56FAA]
Length = 118
Score = 55.1 bits (131), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 55/102 (53%), Gaps = 1/102 (0%)
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
++ + + GG EGF+ K++G G F+ GS + +L G+ + +D + + ++
Sbjct: 7 SIKKMATGIFGG-EGFIMNKISGTGTVFLEIDGSAISYDLLPGQQMVIDTGYLAMMDATC 65
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
++I+ ++ + GG+ VVTGPG V +QS+P ++
Sbjct: 66 KMEIQSVKGVKNKLLGGEGFFNTVVTGPGKVVLQSMPISAVA 107
>gi|403722528|ref|ZP_10945084.1| hypothetical protein GORHZ_060_00470 [Gordonia rhizosphera NBRC
16068]
gi|403206628|dbj|GAB89415.1| hypothetical protein GORHZ_060_00470 [Gordonia rhizosphera NBRC
16068]
Length = 225
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 56/208 (26%), Positives = 92/208 (44%), Gaps = 10/208 (4%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
I + P V G+M G +++E G+ + L G++ V S G
Sbjct: 16 ITVPPGGSVRVEAGAMAMTRGDIQIETSTRGGFFKGLRRSLGGESF--FVNDFHSGSGGQ 73
Query: 97 VGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
VG+AA P L +D M LL Q +++ S V V + G EG +
Sbjct: 74 VGVAATLPGDMTSLSLDARM---PLLVQSGSWIASDPSVDVDSKWGGHKSFFSG--EGLI 128
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGG 214
+ TG G + + G++V L GE +T+D +VA +V +++ G + + GG
Sbjct: 129 LLRCTGAGGLLLSSYGAIVATELAAGETLTLDTGHVVAFDEAVQYKVRKAGSWKSTLLGG 188
Query: 215 DNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ LVT GPG V++Q+ + L + I
Sbjct: 189 EGLVTD-FHGPGRVWLQTRSSNDLIEWI 215
>gi|83644432|ref|YP_432867.1| hypothetical protein HCH_01585 [Hahella chejuensis KCTC 2396]
gi|83632475|gb|ABC28442.1| uncharacterized conserved protein [Hahella chejuensis KCTC 2396]
Length = 220
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 64/237 (27%), Positives = 108/237 (45%), Gaps = 24/237 (10%)
Query: 26 QILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
+I GG A + L+P E++ A +M MS ++ME + + + G T
Sbjct: 4 EIKGGAAFGYVDVELEPGEQITAESDAMSSMSADLDMETRTNGSFFIAILRKFLGGE-TF 62
Query: 85 VVLRNPGPSDGFVGIAAPSLARILP--IDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQ 141
+ R ++G + +L + P + A E LC QP AFL + V++
Sbjct: 63 FINRFSNKTNG---VRKLTLVQPTPGEVRCAELNNETLCFQPGAFLAATEGVRLGVRW-A 118
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
++ I EG + ++G G + A G++++K ++ GE I VD S +VA +++ I
Sbjct: 119 GFKSFIA-KEGLFKLVVSGTGKVWYGAYGALLEKEID-GEYI-VDTSHLVAYDPGLSLHI 175
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFF 258
+ G + + FGG+ LVT V G G + IQ+ L+ I NPK F
Sbjct: 176 QLAGGLFSSFFGGEGLVTR-VEGKGKIIIQTRSISGLTDWI-----------NPKLF 220
>gi|315231067|ref|YP_004071503.1| hypothetical protein TERMP_01305 [Thermococcus barophilus MP]
gi|315184095|gb|ADT84280.1| hypothetical protein TERMP_01305 [Thermococcus barophilus MP]
Length = 222
Score = 55.1 bits (131), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 88/201 (43%), Gaps = 11/201 (5%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
+ ++ L+ E V A G+M FMS S+++E + + L + +
Sbjct: 13 LVEVELERGEAVQAEAGAMVFMSPSIKIETKAKGGIFGALKRSLLAGESFFINIFKAERD 72
Query: 94 DGFVGIAAPSLARILPIDLAMF--GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
G VG A P P D+ F G L Q AFL S D+K+ E
Sbjct: 73 GGKVGFAPP-----YPGDIEAFEIDGTLYAQSGAFLASSADIKIDTKWGG--AKTFFARE 125
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
G K++G+G+ F+ + G++ + L E +D +VA + ++ ++K G ++
Sbjct: 126 GLFLLKMSGRGVVFLSSYGAIQKVELN-NEHFIIDTGHLVAFSEGLDFRVKRVGGLKSTF 184
Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
G+ LV A G G ++IQ+
Sbjct: 185 LSGEGLV-AEFYGTGTLYIQT 204
>gi|344211296|ref|YP_004795616.1| hypothetical protein HAH_1010 [Haloarcula hispanica ATCC 33960]
gi|343782651|gb|AEM56628.1| conserved hypothetical protein [Haloarcula hispanica ATCC 33960]
Length = 237
Score = 54.7 bits (130), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 59/227 (25%), Positives = 98/227 (43%), Gaps = 23/227 (10%)
Query: 36 QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPSD 94
++ L E ++A PG+M S ++E+E + + + + G +++ + G S
Sbjct: 15 RVALDSGESILAEPGAMVSHSPTIEIETTTSRDGLLSSAKSMLGGESLLANEFTAQGGS- 73
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
G + +A P+ + +L GE L D AFL + D+ + + GGI+
Sbjct: 74 GTLTLAPPTPGDVHHHELT---GETLYAVDGAFLAADPDIDIDSEF--------GGIKSM 122
Query: 154 LRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
L L G G I A G + L+ GE T+D +VA SVN G +
Sbjct: 123 LAGASITPLALKGTGNVLIEAFGGLETVELDAGESYTIDNDHVVAWEESVNFDAHRVGGL 182
Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
+ + G+ LV TGPG V+ Q+ + IA A+ P +N
Sbjct: 183 KSTLLSGEGLVMD-FTGPGTVWYQTRGLDSFTSAIADAL--PGTGDN 226
>gi|448667307|ref|ZP_21685849.1| hypothetical protein C442_10226 [Haloarcula amylolytica JCM 13557]
gi|445770342|gb|EMA21406.1| hypothetical protein C442_10226 [Haloarcula amylolytica JCM 13557]
Length = 237
Score = 54.7 bits (130), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 59/229 (25%), Positives = 96/229 (41%), Gaps = 25/229 (10%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT--ITSVVLRNPGP 92
++ L E ++A PG+M S ++E+E + + + + G + + GP
Sbjct: 14 VRVALDSGESILAEPGAMVSHSPTIEIETTTSRDGLLSSAKSMLGGESLLANEFTAKGGP 73
Query: 93 SDGFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIE 151
G V +A P+ + +L GE L D AFL + D+ + + GGI+
Sbjct: 74 --GTVTLAPPTPGDVHHHELT---GETLYAVDGAFLAADPDIDIDSEF--------GGIK 120
Query: 152 GFLRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
L L G G I A G + L+ GE T+D +VA SVN G
Sbjct: 121 SMLAGASITPLALKGTGNVLIEAFGGLETVELDAGESYTIDNDHVVAWEESVNFDAHRVG 180
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
++ + G+ +V TGPG V+ Q+ + IA A+ P +N
Sbjct: 181 GLKSTLLSGEGIVMD-FTGPGTVWYQTRGLGSFTSAIADAL--PGTGDN 226
>gi|350635083|gb|EHA23445.1| hypothetical protein ASPNIDRAFT_173126 [Aspergillus niger ATCC
1015]
Length = 268
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 78/188 (41%), Gaps = 11/188 (5%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ ARPG M MS SM + G + L G I PG + APS+
Sbjct: 75 IQARPGVMIAMSPSMSLRGSL----SFGWMKALAGGNIARSTYTGPGEL-----LIAPSV 125
Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
+ I E + DAFL S VK ++ V G EGF + G GL
Sbjct: 126 MGDITIIRLDGSNEWIVGRDAFLASTTGVKTEYKTQGLMKGVFSG-EGFFVWRFYGAGLV 184
Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
++ + G++V+K L E V+ +VA ++ +G I G+ L TG
Sbjct: 185 WMQSFGAIVKKELADDETYYVNNGHLVAWNCRYKIERVASGGIISNWSAGEGLA-CRFTG 243
Query: 225 PGIVFIQS 232
PG V++Q+
Sbjct: 244 PGTVYMQT 251
>gi|353243514|emb|CCA75046.1| hypothetical protein PIIN_09031 [Piriformospora indica DSM 11827]
Length = 292
Score = 54.3 bits (129), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 56/218 (25%), Positives = 102/218 (46%), Gaps = 19/218 (8%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
+ QI L+ ++ +RPG+M MS S++++ ++ + + + G ++ PG
Sbjct: 85 ILQITLQQGYQIKSRPGAMVAMSASVQIKG----SMKISLSKMITGGEMSESKYTGPGEV 140
Query: 94 DGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
I AP + I+PI + DA+L +V S ++ ++ G EG
Sbjct: 141 -----ILAPEVWGDIVPIQI-QPNTSWSVGKDAYLACTANVTRSTKSQGFMKGLMSG-EG 193
Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ-IKYNGPIRRAV 211
F ++ GQG+ F+ + G++V+++L GE VD +VA ++S ++ I+ G
Sbjct: 194 FFVARVEGQGVLFVQSLGAIVRRDLRPGEEWIVDNGHLVAWSASYTMERIQTTGG---GF 250
Query: 212 FGGDNLVTAVV---TGPGIVFIQSLPFHRLSQRIARAV 246
G + V GPG V++Q+ L Q IA V
Sbjct: 251 LSGSHTGEGAVCRFRGPGTVYLQTRNPQSLGQWIAAQV 288
>gi|254169050|ref|ZP_04875888.1| conserved hypothetical protein TIGR00266 [Aciduliprofundum boonei
T469]
gi|197622020|gb|EDY34597.1| conserved hypothetical protein TIGR00266 [Aciduliprofundum boonei
T469]
Length = 222
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 58/206 (28%), Positives = 97/206 (47%), Gaps = 23/206 (11%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL---RNPG 91
++ L P E++VA G+M +MS ++ +E + G + L K ++ L
Sbjct: 14 LKVYLSPGERIVAEAGAMVYMSPTIGVETT----TKGGFMKGLLRKMLSGQTLFMNTYYA 69
Query: 92 PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV-IGG 149
S+G + APSL I+ I++ QP S + S L+ V+ G
Sbjct: 70 QSEGGYVVFAPSLPGDIIEIEV--------TQP--IYISDTNYLASTELEFGVKFTGFKG 119
Query: 150 I---EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
I G K++G G A+I A G + L GE + +D + A +S+N ++ G
Sbjct: 120 IFTPGGMFWFKVSGIGKAWIAAYGGIDMLQLGPGERLLIDNIHLAAFDASMNFTLRKFGK 179
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
++ +FGG+ ++T V GPG V+IQS
Sbjct: 180 LKSFLFGGEYILTEFV-GPGRVWIQS 204
>gi|152967256|ref|YP_001363040.1| hypothetical protein Krad_3312 [Kineococcus radiotolerans SRS30216]
gi|151361773|gb|ABS04776.1| protein of unknown function DUF124 [Kineococcus radiotolerans
SRS30216]
Length = 263
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 56/252 (22%), Positives = 97/252 (38%), Gaps = 25/252 (9%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGK 80
+ Q++G Q+ + L+P + + G F SG + ME + P G +
Sbjct: 2 TLDVQLVGSTTQMAVVTLRPGQTLYCEAGKFLFSSGDVVMETKLTAPGGGEGGGLGGLLR 61
Query: 81 TITSVVLRN-PGPSDGFVGIAAPSLA-------------RILPIDLAMFGGELLCQPDAF 126
R G S F P R L +D + + DAF
Sbjct: 62 GAAQAGKRVLAGESFAFQHFHTPGGDGLLGLAGVLPGEMRHLELDGST---TWFAEKDAF 118
Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKN-LEVGEVITV 185
+ + V + ++GG EGF+ ++ TG+G I G + N + G + V
Sbjct: 119 VAAEAGVHFDIAFSGVGQGLMGG-EGFILERFTGRGSLLIAGAGDFIDINPADYGGTLRV 177
Query: 186 DVSCIVAVTSSVNVQIKYNGPIRR-----AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
D C+VA + ++ G + R AV GG+ L A + G G V +QS+ ++
Sbjct: 178 DTGCVVAWDDRITYGVETVGRLNRQGIVSAVLGGEGLTLATLRGDGRVILQSVTIEAFAK 237
Query: 241 RIARAVTSPNMR 252
+ + P+ +
Sbjct: 238 ALVKNSAKPDQQ 249
>gi|358367770|dbj|GAA84388.1| DUF124 domain protein [Aspergillus kawachii IFO 4308]
Length = 359
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 53/188 (28%), Positives = 78/188 (41%), Gaps = 11/188 (5%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ ARPG M MS SM + G + L G I PG + APS+
Sbjct: 165 IQARPGVMIAMSPSMSLRGSL----SFGWMKALAGGNIARSTYTGPGEL-----LIAPSI 215
Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
+ I E + DAFL S VK ++ V G EGF + G GL
Sbjct: 216 MGDITIIRLDGSNEWIVGRDAFLASTTGVKTEYKTQGLMKGVFSG-EGFFVWRFYGAGLV 274
Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
++ + G++V+K L E V+ +VA ++ +G I G+ L TG
Sbjct: 275 WMQSFGAIVKKELADDETYYVNNGHLVAWNCRYKIERVASGGIISNWSAGEGLA-CRFTG 333
Query: 225 PGIVFIQS 232
PG V++Q+
Sbjct: 334 PGTVYMQT 341
>gi|218244961|ref|YP_002370332.1| hypothetical protein PCC8801_0070 [Cyanothece sp. PCC 8801]
gi|257057986|ref|YP_003135874.1| hypothetical protein Cyan8802_0068 [Cyanothece sp. PCC 8802]
gi|218165439|gb|ACK64176.1| protein of unknown function DUF124 [Cyanothece sp. PCC 8801]
gi|256588152|gb|ACU99038.1| protein of unknown function DUF124 [Cyanothece sp. PCC 8802]
Length = 225
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 97/207 (46%), Gaps = 13/207 (6%)
Query: 31 EAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGKTITSVV 86
++ + ++ L E++VA+ GSM MSG S + G + L G+++ V
Sbjct: 10 DSAIARVTLNAGEELVAQAGSMVAMSGFINASTTLRQGKGGGIMGGFKRMLGGESLFLSV 69
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
R+P P +G V +A + IL + M G EL+ Q ++L S ++V + Q ++V
Sbjct: 70 FRSPIP-NGQVFLAPKLMGDILVYE--MEGKELIVQSGSYLASASNVDIDLGF-QGFKSV 125
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNG 205
G E +TG G + + G + + +EV VD IVA S++ I K N
Sbjct: 126 FSG-EAIFWLSITGYGSVILTSFGGIYE--VEVNGDYIVDTGHIVAFERSLDFSITKVNS 182
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
A FGG+ LV G G V+ Q+
Sbjct: 183 SWWGAFFGGEGLVCH-FHGYGKVYCQT 208
>gi|339500593|ref|YP_004698628.1| hypothetical protein Spica_1992 [Spirochaeta caldaria DSM 7334]
gi|338834942|gb|AEJ20120.1| protein of unknown function DUF124 [Spirochaeta caldaria DSM 7334]
Length = 226
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 52/213 (24%), Positives = 100/213 (46%), Gaps = 13/213 (6%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
V +++ + E+V A G+M MSG+ME++ + G + G G S
Sbjct: 13 VLRLIFEAGEQVRAEAGAMISMSGTMELKAKTSGKGVFGTLGAMMGGESLFASEYTAGTS 72
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
+ I APS + + L + + Q A+L +++S+ ++ ++ G EG
Sbjct: 73 GDEL-ILAPSYPGDI-LHLPLRDETIFAQSGAYLAGSTHLQLSS--QGSLKALVSG-EGL 127
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF- 212
QK++G G ++ + G++++K L G+ VD +VA +S++ I+ R +F
Sbjct: 128 FLQKISGSGDLWLASYGAIIEKQLNAGDEYVVDTGNMVAFDASLSYTIQ---TASRGLFS 184
Query: 213 ---GGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
G+ LV GPG +++Q+ L+Q +
Sbjct: 185 SMASGEGLV-CRFRGPGRLWLQTRSISALAQTL 216
>gi|345519252|ref|ZP_08798679.1| hypothetical protein BSFG_01154 [Bacteroides sp. 4_3_47FAA]
gi|254834698|gb|EET15007.1| hypothetical protein BSFG_01154 [Bacteroides sp. 4_3_47FAA]
Length = 218
Score = 54.3 bits (129), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 104/217 (47%), Gaps = 16/217 (7%)
Query: 26 QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG--------MWQWL 77
+I+G E + ++ L E A GS+ + +++ + + + G + L
Sbjct: 4 KIIGYEFKTLEVTLASGESFYAERGSIVYTDEALQRDGEFNNDGGSGLGSILGGVVKSAL 63
Query: 78 FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
G++I + NP +D + + + S ++PI L G L+C+ ++ S + KV
Sbjct: 64 SGESILIIRFSNPTNTDKKL-VLSGSCCALVPIKLQ--GENLICRRGHYVASSD--KVHL 118
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
L+ ++ + GG+ F QK+ G F+ + GS ++K L GE I VD + IVA+
Sbjct: 119 NLNLNLQGLFGGVGLF--QKVEGNATIFLDSLGSPIEKILNHGESIEVDENHIVALHGFS 176
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
+ QI+ + V G+ L +TGPG +++ LP
Sbjct: 177 SSQIQAGWSVGN-VLRGEGLSLMKITGPGKLYLSPLP 212
>gi|392963280|ref|ZP_10328706.1| protein of unknown function DUF124 [Pelosinus fermentans DSM 17108]
gi|421056838|ref|ZP_15519755.1| protein of unknown function DUF124 [Pelosinus fermentans B4]
gi|421059742|ref|ZP_15522304.1| protein of unknown function DUF124 [Pelosinus fermentans B3]
gi|421065078|ref|ZP_15526881.1| protein of unknown function DUF124 [Pelosinus fermentans A12]
gi|421069341|ref|ZP_15530513.1| protein of unknown function DUF124 [Pelosinus fermentans A11]
gi|392438018|gb|EIW15880.1| protein of unknown function DUF124 [Pelosinus fermentans B4]
gi|392450361|gb|EIW27414.1| protein of unknown function DUF124 [Pelosinus fermentans A11]
gi|392451104|gb|EIW28098.1| protein of unknown function DUF124 [Pelosinus fermentans DSM 17108]
gi|392458387|gb|EIW34924.1| protein of unknown function DUF124 [Pelosinus fermentans B3]
gi|392459765|gb|EIW36144.1| protein of unknown function DUF124 [Pelosinus fermentans A12]
Length = 225
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/133 (28%), Positives = 68/133 (51%), Gaps = 3/133 (2%)
Query: 101 APSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLT 159
APS+ I+ +DL L+ Q D FL ++VS+ + ++ + G EGF K++
Sbjct: 78 APSIPGDIIDVDLDG-SYSLVVQKDGFLAGSTGIEVSSKMQNLMQGMFSG-EGFFTLKVS 135
Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVT 219
G+G F+ + G++ NLE GE +D + +VA ++ +I+ + F ++
Sbjct: 136 GKGTVFLNSYGAIHAINLEAGEERIIDNNHLVAWPDYMDYKIEKATSGWISSFTSGEMLV 195
Query: 220 AVVTGPGIVFIQS 232
GPG+V IQ+
Sbjct: 196 CRFKGPGVVLIQT 208
>gi|325092436|gb|EGC45746.1| DUF124 domain-containing protein [Ajellomyces capsulatus H88]
Length = 450
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 91/189 (48%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS +M ++ + M + + G ++S PG + AP++
Sbjct: 254 LTAKPGAMIAMSPTMTLKGSI----KFSMKKLIAGGEMSSSTYTGPGEL-----LLAPTM 304
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
+ + L + G E DAFL + V V + +Q + EG K++G GL
Sbjct: 305 LGDISV-LRLNGHETWSVGKDAFLAATQGV-VKDYKNQGLSKAFFSGEGLFVYKISGMGL 362
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
F+ + G++++K+L GE VD +VA + ++ +G + + G+ LV T
Sbjct: 363 LFMQSFGAIIKKDLADGEKYNVDNGHLVAWSCKYVMERVASGGLISGMTSGEGLVCK-FT 421
Query: 224 GPGIVFIQS 232
GPG V++Q+
Sbjct: 422 GPGSVYLQT 430
>gi|448651425|ref|ZP_21680494.1| hypothetical protein C435_05268 [Haloarcula californiae ATCC 33799]
gi|445770952|gb|EMA22010.1| hypothetical protein C435_05268 [Haloarcula californiae ATCC 33799]
Length = 237
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 62/239 (25%), Positives = 97/239 (40%), Gaps = 25/239 (10%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT--ITSVVLRNPGP 92
++ L E ++A PG+M S ++E+E + + + + G + + GP
Sbjct: 14 VRVDLDSGESILAEPGAMVSHSPTIEIETTTSRDGLLSSAKSMLGGESLLANEFTARGGP 73
Query: 93 SDGFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIE 151
G V +A P+ + +L GE L D AFL S + + + GGI+
Sbjct: 74 --GTVTLAPPTPGDVHHHELT---GETLYAVDGAFLASDPGIDIDSEF--------GGIK 120
Query: 152 GFLRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
L L G G I A G + L+ GE T+D +VA SVN G
Sbjct: 121 SLLAGASITPLALKGTGNVLIEAFGGLETVELDAGESYTIDNDHVVAWEESVNFDAHRVG 180
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQIAIF 264
++ + G+ LV TGPG V+ Q+ + IA A+ P +N I F
Sbjct: 181 GLKSTLLSGEGLVMD-FTGPGTVWYQTRGLDSFTSAIADAL--PGTGDNDGNSTGIDDF 236
>gi|113474372|ref|YP_720433.1| hypothetical protein Tery_0505 [Trichodesmium erythraeum IMS101]
gi|110165420|gb|ABG49960.1| protein of unknown function DUF124 [Trichodesmium erythraeum
IMS101]
Length = 233
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 51/215 (23%), Positives = 97/215 (45%), Gaps = 28/215 (13%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
+ L P + ++A PG+M M G + M+ L G ++++ + G F
Sbjct: 29 VTLYPGDSIIAEPGAMTSMDGRLSMKTK------------LSGGFFSALLRKFLGGESLF 76
Query: 97 VGIAAPSLARILPIDLA-----------MFGGELLCQPDAFLCSVNDVKVSNTLDQRVRN 145
V + ++ L + L + G E+ QP A++ S +K+
Sbjct: 77 VNVFENKTSQPLNVTLTQSIIGDIDCLELTGNEICFQPGAYIASSPKIKLGVRWAGFASW 136
Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
+G EG + K++GQG F A G + ++++ GE I VD S +VA + ++ + I +G
Sbjct: 137 FMG--EGLFKLKVSGQGQVFFGAYGGLSKQDI-AGEFI-VDNSHLVAYSPNIKMSIGLSG 192
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
+ ++ G+ V + G G++++QS L Q
Sbjct: 193 GLISSLTSGEGFVNR-LKGNGVIYLQSRSIEGLVQ 226
>gi|302039464|ref|YP_003799786.1| hypothetical protein NIDE4194 [Candidatus Nitrospira defluvii]
gi|300607528|emb|CBK43861.1| conserved protein of unknown function DUF124 [Candidatus Nitrospira
defluvii]
Length = 227
Score = 53.9 bits (128), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 62/117 (52%), Gaps = 5/117 (4%)
Query: 118 ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNL 177
E + Q D FL VK+ + + R ++GG EGF K+ G G + + G++ + L
Sbjct: 96 EYMVQKDGFLAGAEGVKIESKMQSLTRGLLGG-EGFFILKIGGTGQLILNSFGAIHKIEL 154
Query: 178 EVGEVITVDVSCIVAVTSSV--NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+ E VD S +VA TS+ N++ +G I ++ G+ LV GPG+V+IQS
Sbjct: 155 KPNEEYIVDNSHLVAWTSTTTYNIEKASSGWI-ASLTSGEGLVCR-FRGPGVVYIQS 209
>gi|381157051|ref|ZP_09866285.1| TIGR00266 family protein [Thiorhodovibrio sp. 970]
gi|380880914|gb|EIC23004.1| TIGR00266 family protein [Thiorhodovibrio sp. 970]
Length = 241
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 54/221 (24%), Positives = 104/221 (47%), Gaps = 12/221 (5%)
Query: 16 QSPQDAVIPFQIL-GGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW 74
Q+ QD + F++L G + ++ L+ + V A+ +M M +++++ G+
Sbjct: 9 QTLQD--LTFRVLYRGAFAMLEVHLRASQSVKAQSDAMVAMDATVDVDGKMEGGLLGGLG 66
Query: 75 QWLFGKTITSVVLR-NPGPSDGFVGIAAPSLARILPIDLAMFGGEL--LCQPDAFLCSVN 131
+ L G++ LR + GP + A P +LP++L G + + Q D FL +
Sbjct: 67 RMLSGESFFFQTLRASRGPGVAHLAPAQP--GDLLPVEL---DGSVPYVIQKDGFLAASE 121
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V +S T + + G EGF K G+G+ F+ + G+V + ++ GE + +D +V
Sbjct: 122 SVDISTTAQNLTKGLFSG-EGFFVLKARGRGMLFVESYGAVHEIDIPSGEEMIIDNGHLV 180
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
A ++ +++ + F + GPG V+IQS
Sbjct: 181 AWPETMQYKLEKASSGWISSFTSGEGLVCRFKGPGKVYIQS 221
>gi|421077167|ref|ZP_15538138.1| protein of unknown function DUF124 [Pelosinus fermentans JBW45]
gi|392524555|gb|EIW47710.1| protein of unknown function DUF124 [Pelosinus fermentans JBW45]
Length = 225
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 38/133 (28%), Positives = 68/133 (51%), Gaps = 3/133 (2%)
Query: 101 APSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLT 159
APS+ I+ +DL L+ Q D FL ++VS+ + ++ + G EGF K++
Sbjct: 78 APSIPGDIIDVDLDG-SYSLVVQKDGFLAGSTGIEVSSKMQNLMQGMFSG-EGFFTLKVS 135
Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVT 219
G+G F+ + G++ NLE GE +D + +VA ++ +I+ + F ++
Sbjct: 136 GKGTVFLNSYGAIHAINLEAGEERIIDNNHLVAWPDYMDYKIEKATSGWISSFTSGEMLV 195
Query: 220 AVVTGPGIVFIQS 232
GPG+V IQ+
Sbjct: 196 CRFKGPGVVLIQT 208
>gi|113474373|ref|YP_720434.1| hypothetical protein Tery_0506 [Trichodesmium erythraeum IMS101]
gi|110165421|gb|ABG49961.1| protein of unknown function DUF124 [Trichodesmium erythraeum
IMS101]
Length = 226
Score = 53.9 bits (128), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 60/207 (28%), Positives = 101/207 (48%), Gaps = 13/207 (6%)
Query: 31 EAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGKTITSVV 86
+ + ++ L E++VA+ G M MSG S + G+ + L G+++ V
Sbjct: 10 DNTIAKVSLNRGEEIVAQAGCMVAMSGFINTSTTLRQGKGGGIMGGIKRMLGGESLFLSV 69
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
R+P P DG V +A + IL + M G EL+ Q ++L S N+V + Q ++++
Sbjct: 70 FRSPSP-DGQVFLAPKLMGDILVYE--MKGQELVVQASSYLASDNNVDLDLGF-QGIKSL 125
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNG 205
G E +TG G + + G + + +++ G+ I VD IVA SS++ +I K N
Sbjct: 126 FSG-ESIFWLSITGYGPVILTSFGGIYEIDVK-GDYI-VDTGHIVAFESSLDFRIDKANS 182
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
A FGG+ V G G V+ Q+
Sbjct: 183 SWLGAFFGGEGFVCR-FQGRGKVYCQT 208
>gi|320591779|gb|EFX04218.1| duf124 domain containing protein [Grosmannia clavigera kw1407]
Length = 411
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 86/188 (45%), Gaps = 11/188 (5%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS S+ ++ Y + M + + G + S P G + +A P L
Sbjct: 217 LSAKPGAMIAMSPSVTLKGAY----KFSMKKLVAGAEMGSSTYTGP----GELLLAPPML 268
Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
I I L G + D FL V + Q + + EG K+ G GL
Sbjct: 269 GDITSIRLNGNNGWSVGH-DGFLACTQGV-TKDYKRQGLGKAMFSGEGLWVYKIGGIGLL 326
Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
+I + G++++K+L GE VD +VA + N++ +G I + + L+ TG
Sbjct: 327 WISSFGAIIKKDLADGEKYIVDNGHLVAWNTKYNLERVASGGIISGLASNEGLICK-FTG 385
Query: 225 PGIVFIQS 232
PG V+IQ+
Sbjct: 386 PGTVYIQT 393
>gi|134298221|ref|YP_001111717.1| hypothetical protein Dred_0344 [Desulfotomaculum reducens MI-1]
gi|134050921|gb|ABO48892.1| protein of unknown function DUF124 [Desulfotomaculum reducens MI-1]
Length = 227
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 55/203 (27%), Positives = 91/203 (44%), Gaps = 11/203 (5%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI---TSVVLRNP 90
+ Q+ L+ E V A +M MS ++++E G+ + + G++ T R
Sbjct: 13 LVQLNLEQGESVKAESDAMVAMSSTIDVEGKMDGGLLGGLARMVAGESFFFQTLTARRGA 72
Query: 91 GPSDGFVGIAAPSLARILPIDLAMFGG-ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
G + AP++ + ID+ + G L+ Q D FL ++VS + + + G
Sbjct: 73 GEV-----LLAPAVPGDI-IDVELDGSYSLMVQKDGFLAGSAGIQVSTKMQNLSQGLFSG 126
Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
EGF K++GQG+ FI G + NLE G+ VD S +VA +N I+
Sbjct: 127 -EGFFVVKVSGQGIVFINTYGGIHVINLEPGQEFVVDNSHLVAWPEYMNFSIEKASSGWI 185
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
+ F + GPG V IQ+
Sbjct: 186 STFTSGEVAVCKFRGPGPVIIQT 208
>gi|154279368|ref|XP_001540497.1| predicted protein [Ajellomyces capsulatus NAm1]
gi|150412440|gb|EDN07827.1| predicted protein [Ajellomyces capsulatus NAm1]
Length = 312
Score = 53.5 bits (127), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 92/189 (48%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS +M ++ + M + + G ++S PG + AP++
Sbjct: 116 LTAKPGAMIAMSPTMTLKGSI----KFSMKKLIAGGEMSSSTYTGPGEL-----LLAPTM 166
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
+ + L + G E DAFL + V V + +Q + + EG K++G GL
Sbjct: 167 LGDISV-LRLNGHETWSVGKDAFLAATQGV-VKDYKNQGLSKALFSGEGLFVYKISGMGL 224
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
F+ + G++++K+L GE VD +VA + ++ +G + + G+ LV T
Sbjct: 225 LFMQSFGAIIKKDLADGEKYNVDNGHLVAWSCKYVMERVASGGLISGMTSGEGLVCK-FT 283
Query: 224 GPGIVFIQS 232
GPG V++Q+
Sbjct: 284 GPGSVYLQT 292
>gi|126656109|ref|ZP_01727493.1| hypothetical protein CY0110_03464 [Cyanothece sp. CCY0110]
gi|126622389|gb|EAZ93095.1| hypothetical protein CY0110_03464 [Cyanothece sp. CCY0110]
Length = 225
Score = 53.5 bits (127), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 57/207 (27%), Positives = 99/207 (47%), Gaps = 13/207 (6%)
Query: 31 EAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGKTITSVV 86
++ + +++L E++VA+ G+M MSG S + G + L G+++ V
Sbjct: 10 DSAIAKVILNAGEEIVAQAGAMVAMSGFINTSTTLRQGKGGGIMGGFKRMLGGESLFLSV 69
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
R+P ++G V +A + IL + M G EL+ Q ++L S NDV + Q ++++
Sbjct: 70 FRSP-TAEGEVFLAPRLMGDILVYE--MQGQELVVQAASYLASANDVDLDLGF-QGMKSL 125
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNG 205
G E +TG G + + G + + +EV VD IVA S++ +I K N
Sbjct: 126 FSG-ESIFWLSVTGYGPVILTSFGGIYE--IEVDGDYVVDTGHIVAFERSLDFRIDKANS 182
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
A FGG+ + G G V+ Q+
Sbjct: 183 SWLGAFFGGEGFICR-FHGKGKVYCQT 208
>gi|409044766|gb|EKM54247.1| hypothetical protein PHACADRAFT_257964 [Phanerochaete carnosa
HHB-10118-sp]
Length = 264
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 55/201 (27%), Positives = 93/201 (46%), Gaps = 21/201 (10%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNP---GPS 93
+ L P ++ A+PG M M +++++ G + F + ITS L GP
Sbjct: 62 VRLPPGMELKAKPGCMVAMDATVKIK---------GKMKVSFKRLITSSDLSESTFTGP- 111
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQ--PDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
G V +A + I+PI L G + AFLC V + + G E
Sbjct: 112 -GEVLMAPETWGDIVPIHLD---GSTVWNFSKHAFLCCTQGVTRTYKTQSLGKTFFSG-E 166
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
G ++ +G G+ F+ + G++VQ++L+ E VD +VA T+ +V+ G + +
Sbjct: 167 GLFVEQCSGAGVIFVSSIGAIVQRHLQPNEQWIVDNGHLVAWTAKYSVERIQAGSLLSSA 226
Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
+ LV TGPGIV++Q+
Sbjct: 227 KTDEGLVCR-FTGPGIVYLQT 246
>gi|407916932|gb|EKG10260.1| hypothetical protein MPH_12642 [Macrophomina phaseolina MS6]
Length = 387
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 92/200 (46%), Gaps = 13/200 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ ARPG+M MS ++ ++ + + +++ G + + PG + +A PS+
Sbjct: 194 LTARPGAMFAMSPTITLKGTL----KFSLKKFVAGGELATSTYTGPGE----LLLAPPSM 245
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
I I L+ G E+ D FL + V Q + + EG K++G G+
Sbjct: 246 GDITNIRLS--GQEMWSVSKDGFLACTQGI-VKEYKSQSLSKAMFSGEGLFVYKISGSGI 302
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
++ + G++++K+L+ E VD +VA ++ +G I + G+ LV T
Sbjct: 303 LWVTSLGAILRKDLQENEKYIVDNGHLVAWNCKYALERVASGGIISNMSSGEGLVCK-FT 361
Query: 224 GPGIVFIQSLPFHRLSQRIA 243
GPG +FIQ+ SQ I
Sbjct: 362 GPGTIFIQTRNPTAFSQWIG 381
Score = 38.1 bits (87), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 37/155 (23%), Positives = 66/155 (42%), Gaps = 15/155 (9%)
Query: 107 ILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFI 166
IL + LAM G L +P A + + TL ++ + G E TG G +
Sbjct: 183 ILTVQLAM-GCPLTARPGAMFAMSPTITLKGTLKFSLKKFVAGGE-LATSTYTGPGELLL 240
Query: 167 L--AGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
+ G + L E+ +V +A T + + K + +A+F G+ L ++G
Sbjct: 241 APPSMGDITNIRLSGQEMWSVSKDGFLACTQGIVKEYKSQS-LSKAMFSGEGLFVYKISG 299
Query: 225 PGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFV 259
GI+++ SL A+ +++EN K+ V
Sbjct: 300 SGILWVTSL----------GAILRKDLQENEKYIV 324
>gi|83766193|dbj|BAE56336.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 330
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 88/189 (46%), Gaps = 14/189 (7%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MSG + + + G+ + + G +TS + PG + +A P L
Sbjct: 142 IEAKPGAMVAMSGDISLRGTV----KFGLAKMVAGG-MTSSIYTGPGE----ILLAPPFL 192
Query: 105 ARILPIDLAMFGGELL-CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
I I L M G E DAFL + V+ + V G EGF+ ++G GL
Sbjct: 193 GDI--IVLRMDGSERWKVGKDAFLAKTSGVEKDYKSQGMTKAVFSG-EGFIIYHMSGVGL 249
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
++ + G++++K++ G+ VD IVA ++ +G + A F +
Sbjct: 250 VWLQSFGAIIRKDIPEGKTYLVDNGYIVAWNCKYKIERAASGGLLSA-FSSSEGLACKFE 308
Query: 224 GPGIVFIQS 232
GPG V++Q+
Sbjct: 309 GPGTVYLQT 317
>gi|297619153|ref|YP_003707258.1| hypothetical protein Mvol_0626 [Methanococcus voltae A3]
gi|297378130|gb|ADI36285.1| protein of unknown function DUF124 [Methanococcus voltae A3]
Length = 220
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 49/191 (25%), Positives = 91/191 (47%), Gaps = 11/191 (5%)
Query: 42 QEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAA 101
E ++ GSM +M S+E++ + + G++ L + + N G + +A
Sbjct: 22 NEGIITETGSMVYMEPSVEIKT----QAKGGLFGALKRAVVGESIFLNTFSGSGKLALA- 76
Query: 102 PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQ 161
PSL+ I G L Q A+L S ++++ +T ++ GG +G KLTGQ
Sbjct: 77 PSLSG--DIQYHPLNGTLYVQHGAYLASSPNIEI-DTKFGGAKSFFGG-KGLFLMKLTGQ 132
Query: 162 GLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAV 221
G F+ + G++ L E + VD ++ T + ++ G ++ + GG+ LV
Sbjct: 133 GDLFLSSYGAIETIELN-NESLVVDNGNLIGFTDGLQFELTKVGGLKSTLLGGEGLVYR- 190
Query: 222 VTGPGIVFIQS 232
++G G V+IQ+
Sbjct: 191 LSGSGTVYIQT 201
>gi|358395742|gb|EHK45129.1| hypothetical protein TRIATDRAFT_299843 [Trichoderma atroviride IMI
206040]
Length = 340
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 93/200 (46%), Gaps = 13/200 (6%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
+ I L + A+PG+M MS S+ ++ + + + + G ++S PG
Sbjct: 135 ILTIQLANGAPIHAKPGAMVAMSPSITLKG----HIKFSVKKIIAGGELSSSTFTGPGE- 189
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
+ +A SL + I L+ G E DA+L + + Q + I EG
Sbjct: 190 ---LILAPSSLGDVTSIRLS--GNETWSVGKDAYLAYTQGIN-KDYKRQGLGKAIFSGEG 243
Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
K+TG GL ++ + G++++K+L GE VD +VA +S ++ +G + +
Sbjct: 244 LFVYKITGTGLLWVSSFGAIIRKDLAQGEKYIVDNGHLVAWNTSYVLERVASGGLISGLS 303
Query: 213 GGDNLVTAVVTGPGIVFIQS 232
G+ LV TGPG VF+Q+
Sbjct: 304 SGEGLVCK-FTGPGTVFMQT 322
>gi|337283582|ref|YP_004623056.1| hypothetical protein PYCH_00710 [Pyrococcus yayanosii CH1]
gi|334899516|gb|AEH23784.1| hypothetical protein PYCH_00710 [Pyrococcus yayanosii CH1]
Length = 221
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 25/207 (12%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW--LFGKTITSVVLR-NPG 91
++ L+P E V A G+M +MS ++ +E +G + L G++ V R G
Sbjct: 14 LEVELQPGEAVQAEAGAMVYMSPTVRVET-KARGGLLGALKRSVLGGESFFINVFRAEDG 72
Query: 92 PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
P G VG A I +++ G + Q AFL D+++ GG +
Sbjct: 73 P--GVVGFAPGYPGDIEALEI---NGTIYAQSGAFLAGSEDIQIDIKF--------GGAK 119
Query: 152 GFLRQ------KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
FL + +L+G+G+ F+ + G++ +L+ GE VD +VA T ++ +K G
Sbjct: 120 TFLGRESLFLLELSGRGIVFLASYGAIRPVHLK-GERFVVDTGHMVAFTEGLDFTVKSIG 178
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
++ +F G+ LV G G V+IQ+
Sbjct: 179 GLKSVLFSGEGLVME-FRGTGTVYIQT 204
>gi|317140662|ref|XP_001818338.2| hypothetical protein AOR_1_2448174 [Aspergillus oryzae RIB40]
Length = 326
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 88/189 (46%), Gaps = 14/189 (7%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MSG + + + G+ + + G +TS + PG + +A P L
Sbjct: 138 IEAKPGAMVAMSGDISLRGTV----KFGLAKMVAGG-MTSSIYTGPGE----ILLAPPFL 188
Query: 105 ARILPIDLAMFGGELL-CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
I I L M G E DAFL + V+ + V G EGF+ ++G GL
Sbjct: 189 GDI--IVLRMDGSERWKVGKDAFLAKTSGVEKDYKSQGMTKAVFSG-EGFIIYHMSGVGL 245
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
++ + G++++K++ G+ VD IVA ++ +G + A F +
Sbjct: 246 VWLQSFGAIIRKDIPEGKTYLVDNGYIVAWNCKYKIERAASGGLLSA-FSSSEGLACKFE 304
Query: 224 GPGIVFIQS 232
GPG V++Q+
Sbjct: 305 GPGTVYLQT 313
>gi|225562500|gb|EEH10779.1| DUF124 domain-containing protein [Ajellomyces capsulatus G186AR]
Length = 327
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 91/189 (48%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS +M ++ + M + + G ++S PG + AP++
Sbjct: 131 LTAKPGAMIAMSPTMTLKGSI----KFSMKKLIAGGEMSSSTYTGPGEL-----LLAPTM 181
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
+ + L + G E DAFL + V V + +Q + EG K++G GL
Sbjct: 182 LGDISV-LRLNGHETWSVGKDAFLAATQGV-VKDYKNQGLSKAFFSGEGLFVYKISGMGL 239
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
F+ + G++++K+L GE VD +VA + ++ +G + + G+ LV T
Sbjct: 240 LFMQSFGAIIKKDLADGEKYNVDNGHLVAWSCKYVMERVASGGLISGMTSGEGLVCK-FT 298
Query: 224 GPGIVFIQS 232
GPG V++Q+
Sbjct: 299 GPGSVYLQT 307
>gi|11498234|ref|NP_069460.1| hypothetical protein AF0626 [Archaeoglobus fulgidus DSM 4304]
gi|2649993|gb|AAB90613.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
Length = 218
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 96/200 (48%), Gaps = 10/200 (5%)
Query: 33 QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGP 92
+ ++ L+P E+V+A G+M +M +E++ G+ + L G + V R
Sbjct: 12 SMLELKLRPGEEVLAETGAMVYMK-DVELKTEMKGGLLGGLKRSLLGGE-SFFVNRFVSR 69
Query: 93 SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
+G +G+A I+ I L G +L Q AFL S + ++V +T R G EG
Sbjct: 70 GEGVLGLAPTYQGDIIHIPLN---GTILAQSGAFLASSDGIEV-DTKWGGGRTFFAG-EG 124
Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
K+ G G F+ + GS+ + L+V + VD IVA +++ ++ G ++ V
Sbjct: 125 LFLLKIEGTGDLFLSSFGSI--EELQVDGHLRVDTGHIVAFEPTLDYKVSKVGGLKATVL 182
Query: 213 GGDNLVTAVVTGPGIVFIQS 232
G+ LV G G V+IQ+
Sbjct: 183 SGEGLVVD-FKGKGKVWIQT 201
>gi|170077981|ref|YP_001734619.1| hypothetical protein SYNPCC7002_A1367 [Synechococcus sp. PCC 7002]
gi|169885650|gb|ACA99363.1| conserved hypothetical protein TIGR00266 [Synechococcus sp. PCC
7002]
Length = 220
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 52/197 (26%), Positives = 92/197 (46%), Gaps = 8/197 (4%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF-GKTITSVVLRNPGPSDG 95
+ L+P E+V A G+M M G + M+ + + + LF G+++ V RN
Sbjct: 16 VTLEPGEQVTAESGAMVSMDGGILMKTEFSGGFFPAILRRLFGGESLFVNVYRNTSQRPQ 75
Query: 96 FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
V + + I IDL+ G + QP A++ K+ G EG +
Sbjct: 76 TVILTQSMVGDIHRIDLSQ--GPICFQPGAYIAHTPGAKMGIRWAGFASWFAG--EGLFK 131
Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
+ TG G F G +++K+++ GE I VD S +VA + + I+ +G + ++ G+
Sbjct: 132 LQFTGSGRVFYGCYGGIIEKHIK-GEFI-VDNSHLVAYDPGITMNIRMSGGLFGSLTSGE 189
Query: 216 NLVTAVVTGPGIVFIQS 232
LV + G G +++QS
Sbjct: 190 GLVNK-LKGRGRIYLQS 205
>gi|448630295|ref|ZP_21672950.1| hypothetical protein C437_09113 [Haloarcula vallismortis ATCC
29715]
gi|445756218|gb|EMA07593.1| hypothetical protein C437_09113 [Haloarcula vallismortis ATCC
29715]
Length = 237
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 59/224 (26%), Positives = 96/224 (42%), Gaps = 23/224 (10%)
Query: 39 LKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPSDGFV 97
L E ++A PG+M S ++E+E + + + + G +++ + G S G +
Sbjct: 18 LDSGESILAEPGAMVSHSPTIEIETTTSRDGLLSSAKSMLGGESLLANEFTARGGS-GTL 76
Query: 98 GIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQ 156
+A P+ + +L GE L D AFL + D+ + + GGI+ L
Sbjct: 77 TLAPPTPGDVHHHELT---GETLYAVDGAFLAADPDIDIDSEF--------GGIKSLLAG 125
Query: 157 K------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
L G G I A G + L+ GE T+D +VA SVN G ++
Sbjct: 126 ASITPLALKGTGNVLIEAFGGLETVELDAGESYTIDNDHVVAWEESVNFDAHRVGGLKST 185
Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
+ G+ LV TGPG V+ Q+ + IA A+ P +N
Sbjct: 186 LLSGEGLVMD-FTGPGTVWYQTRGLDSFTSAIADAL--PGTGDN 226
>gi|319641092|ref|ZP_07995796.1| hypothetical protein HMPREF9011_01393 [Bacteroides sp. 3_1_40A]
gi|317387336|gb|EFV68211.1| hypothetical protein HMPREF9011_01393 [Bacteroides sp. 3_1_40A]
Length = 218
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 56/217 (25%), Positives = 104/217 (47%), Gaps = 16/217 (7%)
Query: 26 QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG--------MWQWL 77
+I+G E + ++ L E A GS+ + +++ + + + G + L
Sbjct: 4 KIIGYEFKTLEVTLASGESFYAERGSIVYTDEALQRDVEFNNDGGSGLGSILGGVVKSAL 63
Query: 78 FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
G++I + NP +D + + + S ++PI L G L+C+ ++ S + KV
Sbjct: 64 SGESILIIRFSNPTNTDKKL-VLSGSCCALVPIKLQ--GENLICRRGHYVASSD--KVHL 118
Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
L+ ++ + GG+ F QK+ G F+ + GS ++K L GE I VD + IVA+
Sbjct: 119 NLNLNLQGLFGGVGLF--QKVEGNATIFLDSLGSPIEKILNHGESIEVDENHIVALHGFS 176
Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
+ QI+ + V G+ L +TGPG +++ LP
Sbjct: 177 SSQIQAGWSVGN-VLRGEGLSLMKITGPGKLYLSPLP 212
>gi|389851747|ref|YP_006353981.1| hypothetical protein Py04_0304 [Pyrococcus sp. ST04]
gi|388249053|gb|AFK21906.1| hypothetical protein Py04_0304 [Pyrococcus sp. ST04]
Length = 218
Score = 53.1 bits (126), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/202 (25%), Positives = 95/202 (47%), Gaps = 14/202 (6%)
Query: 33 QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW--LFGKTITSVVLRNP 90
+ +I L+ E + A G+M +M ++++E +G + L G++ + R P
Sbjct: 12 SLLEINLRDGEAIQAEAGAMVYMDPTIKLET-KARGGILGALKRSVLGGESFFVNIFRGP 70
Query: 91 GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
G VG A I+ +++ G+L Q AF+ + ++ + + IG
Sbjct: 71 GR----VGFAPGYPGDIVAMEVN---GKLYAQSGAFIAASENIDIDVKFGG-AKTFIG-R 121
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
EG ++ G G F+ + G++ + +L E + VD +VA T +N IK G ++
Sbjct: 122 EGIFLLEMKGHGTVFLSSYGAIQKISLN-NESLIVDTGHMVAFTEGINFTIKRVGGLKST 180
Query: 211 VFGGDNLVTAVVTGPGIVFIQS 232
+F G+ LV +G G V+IQ+
Sbjct: 181 LFSGEGLVFE-FSGTGDVYIQT 201
>gi|448688943|ref|ZP_21694680.1| hypothetical protein C444_13187 [Haloarcula japonica DSM 6131]
gi|445778813|gb|EMA29755.1| hypothetical protein C444_13187 [Haloarcula japonica DSM 6131]
Length = 237
Score = 52.8 bits (125), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 59/228 (25%), Positives = 95/228 (41%), Gaps = 25/228 (10%)
Query: 36 QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT--ITSVVLRNPGPS 93
+ L E ++A PG+M S ++E+E + + + + G + + GP
Sbjct: 15 RAALDSGESILAEPGAMVSHSPTIEIETTTSRDGLLSSAKSMLGGESLLANEFTARGGP- 73
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
G V +A P+ + +L GE L D AFL + D+ + + GGI+
Sbjct: 74 -GTVTLAPPTPGDVHHHELT---GETLYAVDGAFLAADPDIDIDSEF--------GGIKS 121
Query: 153 FLRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
L L G G I A G + L+ GE T+D +VA SV+ G
Sbjct: 122 MLAGASITPLALKGTGNVLIEAFGGLETVELDAGESYTIDNDHVVAWDESVSFDAHRVGG 181
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
++ + G+ LV TGPG V+ Q+ + IA A+ P +N
Sbjct: 182 LKSTLLSGEGLVMD-FTGPGTVWYQTRGLDSFTSAIADAL--PGTGDN 226
>gi|289596749|ref|YP_003483445.1| protein of unknown function DUF124 [Aciduliprofundum boonei T469]
gi|289534536|gb|ADD08883.1| protein of unknown function DUF124 [Aciduliprofundum boonei T469]
Length = 223
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 97/206 (47%), Gaps = 23/206 (11%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL---RNPG 91
++ L P E++VA G+M +MS ++ +E + G + + K ++ L
Sbjct: 15 LKVYLSPGERIVAEAGAMVYMSPTIGVETT----TKGGFMKGVLRKMLSGQTLFMNTYYA 70
Query: 92 PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR-NVIGG 149
S+G + APSL I+ I++ QP S + S L+ V+ G
Sbjct: 71 QSEGGYVVFAPSLPGDIIKIEV--------TQP--IYISDTNYLASTELEFGVKFTGFKG 120
Query: 150 I---EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
I G K++G G A+I A G + L GE + +D + A +S+N ++ G
Sbjct: 121 IFTPGGMFWFKVSGIGKAWIAAYGGIDMLQLGPGERLLIDNIHLAAFDASMNFTLRKFGK 180
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
++ +FGG+ ++T V GPG V+IQS
Sbjct: 181 LKSFLFGGEYILTEFV-GPGRVWIQS 205
>gi|428214331|ref|YP_007087475.1| hypothetical protein Oscil6304_4017 [Oscillatoria acuminata PCC
6304]
gi|428002712|gb|AFY83555.1| TIGR00266 family protein [Oscillatoria acuminata PCC 6304]
Length = 220
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 52/205 (25%), Positives = 90/205 (43%), Gaps = 8/205 (3%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-WLFGKTITSVVLRNPGPSDG 95
+ L P E + A PG+ M + M+ + G+ + W G+++ V RNP
Sbjct: 16 VTLSPGESITAEPGAAISMDSRIAMKTQFSGGFFSGLLKKWFGGESLLVNVFRNPTTHPL 75
Query: 96 FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
V ++ ++ + I+L + QP A++ +K+ TL EG +
Sbjct: 76 EVVLSQATIGDMACIELTQ--SSICLQPGAYIAHTPGIKI--TLGWAGFASWFAREGLFK 131
Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
KLTG G F A G + K + GE I VD +VA ++ + I +G + ++ +
Sbjct: 132 LKLTGTGRVFFGAYGGLTPKRIN-GEFI-VDNGHLVAYEPTIKMGISMSGGLFSSISSRE 189
Query: 216 NLVTAVVTGPGIVFIQSLPFHRLSQ 240
V + G GI+++QS L Q
Sbjct: 190 GFVNR-LKGQGIIYLQSRSVEGLVQ 213
>gi|254169226|ref|ZP_04876060.1| conserved hypothetical protein TIGR00266 [Aciduliprofundum boonei
T469]
gi|197621822|gb|EDY34403.1| conserved hypothetical protein TIGR00266 [Aciduliprofundum boonei
T469]
Length = 222
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/206 (27%), Positives = 97/206 (47%), Gaps = 23/206 (11%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL---RNPG 91
++ L P E++VA G+M +MS ++ +E + G + + K ++ L
Sbjct: 14 LKVYLSPGERIVAEAGAMVYMSPTIGVETT----TKGGFMKGVLRKMLSGQTLFMNTYYA 69
Query: 92 PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR-NVIGG 149
S+G + APSL I+ I++ QP S + S L+ V+ G
Sbjct: 70 QSEGGYVVFAPSLPGDIIKIEV--------TQP--IYISDTNYLASTELEFGVKFTGFKG 119
Query: 150 I---EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
I G K++G G A+I A G + L GE + +D + A +S+N ++ G
Sbjct: 120 IFTPGGMFWFKVSGIGKAWIAAYGGIDMLQLGPGERLLIDNIHLAAFDASMNFTLRKFGK 179
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
++ +FGG+ ++T V GPG V+IQS
Sbjct: 180 LKSFLFGGEYILTEFV-GPGRVWIQS 204
>gi|358388687|gb|EHK26280.1| hypothetical protein TRIVIDRAFT_166887 [Trichoderma virens Gv29-8]
Length = 436
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 94/201 (46%), Gaps = 15/201 (7%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITSVVLRNPGP 92
+ I L + A+PG+M MS ++ ++ N+ ++ + G + S PG
Sbjct: 231 ILTIQLAMGAPLHAKPGAMVAMSPTITLKGNVKFSVKKI-----IAGGELASSTYTGPGE 285
Query: 93 SDGFVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
+ +A SL I I L G E DA+L S V V + Q + I E
Sbjct: 286 ----LILAPASLGDITSIRLT--GKEQWSVGHDAYLASTQAV-VKDYKRQGLGKAIFSGE 338
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
G K++G GL ++ + G++++K+L GE VD +VA +S ++ +G I +
Sbjct: 339 GLFVYKISGTGLLWVSSFGAIIRKDLAQGEKYIVDNGHLVAWNTSYVLERVASGGIISGL 398
Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
G+ LV TGPG VF+Q+
Sbjct: 399 SSGEGLVCK-FTGPGTVFLQT 418
>gi|340517568|gb|EGR47812.1| predicted protein [Trichoderma reesei QM6a]
Length = 254
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 56/189 (29%), Positives = 91/189 (48%), Gaps = 17/189 (8%)
Query: 47 ARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPS-L 104
A+PG+M MS ++ ++ N+ ++ + G ++S PG I APS L
Sbjct: 62 AKPGAMVAMSPTITLKGNVKFSVKKI-----IAGGELSSSTYTGPGEL-----ILAPSAL 111
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
I I L G E DA+L S V V + Q + I EG K++G GL
Sbjct: 112 GDITSIRLT--GQEQWSVGHDAYLASTQAV-VKDYKRQGLGKAIFSGEGLFVYKISGTGL 168
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
++ + G++++K+L GE VD +VA +S ++ +G + + G+ LV T
Sbjct: 169 LWVSSFGAIIRKDLAAGEKYIVDNGHLVAWNTSYVLERVASGGLISGLSSGEGLVCK-FT 227
Query: 224 GPGIVFIQS 232
GPG VF+Q+
Sbjct: 228 GPGTVFLQT 236
>gi|443324895|ref|ZP_21053617.1| TIGR00266 family protein [Xenococcus sp. PCC 7305]
gi|442795497|gb|ELS04862.1| TIGR00266 family protein [Xenococcus sp. PCC 7305]
Length = 245
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 58/233 (24%), Positives = 105/233 (45%), Gaps = 12/233 (5%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF-GKTITSVVLRNPGPSDG 95
+ L+P E + A G+M M + M+ + + + LF G+++ V N
Sbjct: 16 VTLEPGESITAEAGAMVSMDSMLTMKTEFSGGFLPALGKKLFGGESLFVNVFANKTSRPL 75
Query: 96 FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
V ++ + I I+L G EL QP A++ VK++ ++G EG +
Sbjct: 76 QVVLSQSVIGDIAEIELQ--GRELYFQPGAYIAHTAGVKMTVGWAGFTSWLMG--EGLFK 131
Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
K+ G G F A G + +K + G+ I VD S +VA + + IK +G + ++ G+
Sbjct: 132 LKIGGIGKVFFGAYGGITKKTI-AGQFI-VDNSHLVAYEPGIKMGIKLSGGLLGSLTSGE 189
Query: 216 NLVTAVVTGPGIVFIQSLPFHRLSQ----RIARAVTSPNMRENPKFFVQIAIF 264
LV ++G G +++QS F L + +I + R + +Q +F
Sbjct: 190 GLVNR-LSGHGDIYLQSRSFDGLLRFLRPQIGKIANQARGRRKVRGMIQDRVF 241
>gi|239611489|gb|EEQ88476.1| DUF124 domain-containing protein [Ajellomyces dermatitidis ER-3]
Length = 334
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 91/189 (48%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS +M ++ + M + + G +++ PG + AP+L
Sbjct: 139 LSAKPGAMIAMSPTMTLKG----SVKFSMKKLIAGGEMSTSTYTGPGEL-----LLAPTL 189
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
+ + L + G + DAFL + V V + +Q + + EG K++G GL
Sbjct: 190 LGDITV-LRLNGHDTWSVGKDAFLAATQGV-VKDYKNQGLSKALFSGEGLFVYKMSGNGL 247
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
F+ + G++++K+L GE VD +VA ++ +G I + G+ LV T
Sbjct: 248 LFMQSFGAIIKKDLADGEKYNVDNGHLVAWNCKYVMERIASGGIISGMSAGEGLVCK-FT 306
Query: 224 GPGIVFIQS 232
GPG V++Q+
Sbjct: 307 GPGTVYLQT 315
>gi|390938192|ref|YP_006401930.1| hypothetical protein Desfe_0450 [Desulfurococcus fermentans DSM
16532]
gi|390191299|gb|AFL66355.1| TIGR00266 family protein [Desulfurococcus fermentans DSM 16532]
Length = 220
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 52/205 (25%), Positives = 95/205 (46%), Gaps = 18/205 (8%)
Query: 33 QVCQIMLKPQEKVVARPGSMCFMSGSMEMEN-----IYIPENEVGMWQWLFGKTITSVVL 87
+ ++ L P E + GS G++++E + V + LF IT++
Sbjct: 12 SILKVSLDPGESITIEAGSYMLHKGNVKIETTTGGVLSAIARAVAGGESLFLDKITALT- 70
Query: 88 RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
PS+ ++ P I+ ID+ GEL+ Q +L + + +R ++
Sbjct: 71 ----PSEVWIAPNVP--GDIVAIDVD---GELIVQDTGYLAHAGGFDIGIAW-RGLRGLL 120
Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
E F +LTG+G F+ + G++ + L+ GE T+D +VA+ SV IK +
Sbjct: 121 AEGELFWL-RLTGKGTVFVNSYGAIEELVLKPGEKATIDNFHLVAMEPSVKWSIKTFSGL 179
Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQS 232
+ + GG+ LV V GPG +++Q+
Sbjct: 180 KSTILGGEGLVVE-VEGPGKLWVQT 203
>gi|315641638|ref|ZP_07896702.1| conserved hypothetical protein [Enterococcus italicus DSM 15952]
gi|315482598|gb|EFU73129.1| conserved hypothetical protein [Enterococcus italicus DSM 15952]
Length = 269
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 97/229 (42%), Gaps = 15/229 (6%)
Query: 23 IPFQILGG-EAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
+ +++LGG ++ + QI LK E + G+M + SG + ++ +G G++
Sbjct: 37 MNYKLLGGTDSPILQIQLKTGESIQLERGAMAYHSGVELTGKLNASKSGLGGLMHAIGRS 96
Query: 82 ITS----VVLRNPG-PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
+TS + G D F+GIA +I P+ ++ + AFL + V
Sbjct: 97 MTSGESLFISSATGLTEDSFIGIAPGKPGKIHPLQVSN-AKQYCLNTGAFLACDSQVTYK 155
Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
GG G + TG G F+ A G +++ + + +T+D +VA
Sbjct: 156 MARQNISGAFFGGTGGLFVMETTGDGTIFVSAFGDLLELTVSPNQALTIDNEHVVA---- 211
Query: 197 VNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ + Y+ + FG G+ LV V G G V IQ+ L + +
Sbjct: 212 WDTNLTYDIQVASGTFGFKTGEGLVN-VFQGSGKVIIQTRNLRNLGEAL 259
>gi|55377179|ref|YP_135029.1| hypothetical protein rrnAC0266 [Haloarcula marismortui ATCC 43049]
gi|448640547|ref|ZP_21677450.1| hypothetical protein C436_11803 [Haloarcula sinaiiensis ATCC 33800]
gi|55229904|gb|AAV45323.1| unknown [Haloarcula marismortui ATCC 43049]
gi|445761857|gb|EMA13096.1| hypothetical protein C436_11803 [Haloarcula sinaiiensis ATCC 33800]
Length = 237
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 62/238 (26%), Positives = 96/238 (40%), Gaps = 25/238 (10%)
Query: 36 QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT--ITSVVLRNPGPS 93
+ L E ++A PG+M S ++E+E + + + + G + + GP
Sbjct: 15 RAALDSGESILAEPGAMVSHSPTIEIETTTSRDGLLSSAKSMLGGESLLANEFTARGGP- 73
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
G V +A P+ + +L GE L D AFL S + + + GGI+
Sbjct: 74 -GTVTLAPPTPGDVHHHELT---GETLYAVDGAFLASDPGIDIDSEF--------GGIKS 121
Query: 153 FLRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
L L G G I A G + L+ GE T+D +VA SVN G
Sbjct: 122 LLAGASITPLALKGTGNVLIEAFGGLETVELDAGESYTIDNDHVVAWEESVNFDAHRVGG 181
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQIAIF 264
++ + G+ LV TGPG V+ Q+ + IA A+ P +N I F
Sbjct: 182 LKSTLLSGEGLVMD-FTGPGTVWYQTRGLASFTSAIADAL--PGTGDNDGNSTGIDDF 236
>gi|212546527|ref|XP_002153417.1| DUF124 domain protein [Talaromyces marneffei ATCC 18224]
gi|210064937|gb|EEA19032.1| DUF124 domain protein [Talaromyces marneffei ATCC 18224]
Length = 379
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 50/202 (24%), Positives = 87/202 (43%), Gaps = 10/202 (4%)
Query: 47 ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
ARPG M MS ++ ++ + M + L G + PG V +A L
Sbjct: 187 ARPGVMIAMSPTVTLKGSF----HFSMKKLLIGGEMAHSTYTGPGE----VLLAPSVLGD 238
Query: 107 ILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFI 166
I + L G DA+L + VK + + G EG+ +TG GL ++
Sbjct: 239 ITILRLTDDGETWKVGRDAYLANTTSVKKKYKAQNLSKTLFSG-EGWFVYNITGPGLVWM 297
Query: 167 LAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPG 226
+ G+V++K+L GE VD +VA ++ +G + ++ G+ L GPG
Sbjct: 298 QSFGAVIRKDLVEGESYFVDNGHLVAWNCEYGIERVASGGVISSLSSGEGLA-CRFKGPG 356
Query: 227 IVFIQSLPFHRLSQRIARAVTS 248
V++Q+ + + +I + S
Sbjct: 357 TVYLQTRNLNAFAMQIGASTAS 378
>gi|357038024|ref|ZP_09099823.1| protein of unknown function DUF124 [Desulfotomaculum gibsoniae DSM
7213]
gi|355360580|gb|EHG08338.1| protein of unknown function DUF124 [Desulfotomaculum gibsoniae DSM
7213]
Length = 227
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 91/203 (44%), Gaps = 11/203 (5%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI---TSVVLRNP 90
+ Q+ L+ E + A +M MS ++++E G+ + L G++ T R
Sbjct: 13 LLQVNLQKGEVIKAESDAMVAMSTNIDVEGKMDGGLLGGLSRMLTGESFFFQTLSARRGA 72
Query: 91 GPSDGFVGIAAPSLARILPIDLAMFGG-ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
G + APS+ + +D+ + G LL Q D FL +++VS + + + G
Sbjct: 73 GEV-----LLAPSIPGDI-VDVELDGSYSLLVQKDGFLAGAAELQVSTKMQNLAQGIFSG 126
Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
EGF K++G+G+ FI G + NLE G+ VD S +VA + I+
Sbjct: 127 -EGFFVVKISGKGMVFINTYGGIHIINLEPGQEYIVDNSHLVAWPDYMQYTIEKASSGWI 185
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
+ F + GPG V IQ+
Sbjct: 186 STFTSGEVAVCRFRGPGPVIIQT 208
>gi|295657406|ref|XP_002789272.1| DUF124 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
gi|226283988|gb|EEH39554.1| DUF124 domain-containing protein [Paracoccidioides sp. 'lutzii'
Pb01]
Length = 338
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 91/189 (48%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS +M ++ + M + + G +++ PG + AP++
Sbjct: 142 LTAKPGAMIAMSPTMTLKGSV----KFSMKKLIAGGEMSTSTYTGPGEL-----LLAPTM 192
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
+ + L + G E DA+L S V + + +Q + EG K++G GL
Sbjct: 193 LGDISV-LRLSGHETWSVGKDAYLSSTQGV-IKDYKNQGLSKAFFSGEGLFVYKISGNGL 250
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
F+ + G++++K+L GE +D +VA + ++ +G I + G+ LV T
Sbjct: 251 LFLQSFGAIIKKDLADGEKYNIDNGHLVAWSCKYVMERVASGGIISGMTSGEGLVCK-FT 309
Query: 224 GPGIVFIQS 232
GPG V++Q+
Sbjct: 310 GPGSVYMQT 318
>gi|427420415|ref|ZP_18910598.1| TIGR00266 family protein [Leptolyngbya sp. PCC 7375]
gi|425763128|gb|EKV03981.1| TIGR00266 family protein [Leptolyngbya sp. PCC 7375]
Length = 220
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 88/199 (44%), Gaps = 12/199 (6%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT---ITSVVLRNPGPS 93
+ L+P E++ A G+M M G + M + G+ + FG + R P
Sbjct: 16 VTLQPGERIFAEAGAMASMDGQISMGTGFFGGLIPGLLKAFFGGESLFLNHFANRTNHPQ 75
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
+ + P++ I ++L G EL QP A++ + VKV+ G EG
Sbjct: 76 T--LVLTQPTIGDIARMELN--GNELCLQPGAYIAHSSGVKVALQWAGLASWFAG--EGL 129
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
+ KL G+GL F A G + + + V + VD +VA + + I +G + ++
Sbjct: 130 FKLKLGGRGLVFFGAYGGLSE--VTVKDEFIVDNGHLVAYEPGIKMNIGLSGGMVGSITS 187
Query: 214 GDNLVTAVVTGPGIVFIQS 232
G+ + + G G +++QS
Sbjct: 188 GEGFINR-LQGRGKIYLQS 205
>gi|172035646|ref|YP_001802147.1| hypothetical protein cce_0730 [Cyanothece sp. ATCC 51142]
gi|354554887|ref|ZP_08974190.1| protein of unknown function DUF124 [Cyanothece sp. ATCC 51472]
gi|171697100|gb|ACB50081.1| unknown [Cyanothece sp. ATCC 51142]
gi|353553041|gb|EHC22434.1| protein of unknown function DUF124 [Cyanothece sp. ATCC 51472]
Length = 225
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 102/207 (49%), Gaps = 13/207 (6%)
Query: 31 EAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGKTITSVV 86
++ + +++L E++VA+ GSM MSG S + G + L G+++ V
Sbjct: 10 DSAIAKVILNAGEEIVAQAGSMVAMSGFINTSTTLRQGKGGGIMGGFKRILGGESLFLSV 69
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
R+P ++G V +A + IL + M G EL+ Q ++L S NDV + Q ++++
Sbjct: 70 FRSP-TAEGEVFLAPRLMGDILVYE--MQGQELVVQAASYLASANDVDLDLGF-QGMKSL 125
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNG 205
G E +TG G + + G + + +++ G+ + VD IVA +++ +I K N
Sbjct: 126 FSG-ESIFWLSVTGYGPVILTSFGGIYEIDVD-GDYV-VDTGHIVAFEKTLDFRIDKANS 182
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
A FGG+ + G G V+ Q+
Sbjct: 183 SWLGAFFGGEGFICR-FHGKGKVYCQT 208
>gi|261205112|ref|XP_002627293.1| DUF124 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
gi|239592352|gb|EEQ74933.1| DUF124 domain-containing protein [Ajellomyces dermatitidis
SLH14081]
Length = 312
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 91/189 (48%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS +M ++ + M + + G +++ PG + AP+L
Sbjct: 117 LSAKPGAMIAMSPTMTLKGSV----KFSMKKLIAGGEMSTSTYTGPGEL-----LLAPTL 167
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
+ + L + G + DAFL + V V + +Q + + EG K++G GL
Sbjct: 168 LGDITV-LRLNGHDTWSVGKDAFLAATQGV-VKDYKNQGLSKALFSGEGLFVYKMSGNGL 225
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
F+ + G++++K+L GE VD +VA ++ +G I + G+ LV T
Sbjct: 226 LFMQSFGAIIKKDLADGEKYNVDNGHLVAWNCKYVMERIASGGIISGMSAGEGLVCK-FT 284
Query: 224 GPGIVFIQS 232
GPG V++Q+
Sbjct: 285 GPGTVYLQT 293
>gi|417765794|ref|ZP_12413750.1| TIGR00266 family protein [Leptospira interrogans serovar Bulgarica
str. Mallika]
gi|400351733|gb|EJP03946.1| TIGR00266 family protein [Leptospira interrogans serovar Bulgarica
str. Mallika]
Length = 221
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 89/199 (44%), Gaps = 8/199 (4%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSD 94
++ L P + + A G+M +M+ +++E +++ FG + +
Sbjct: 14 LKVKLSPGQTIKAESGAMVYMTPGIDVETKMGSGFLSAIYRRFFGGESFFFNVFKASVTG 73
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
G +G+A ++ IDL G L+ + A+L S + + +R+ GG EG
Sbjct: 74 GEIGLAPELPGDVVGIDLTDTG--LIVESGAYLASDETISIKPMFGG-IRSFFGG-EGIF 129
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFG 213
+ G G F+ A G ++ ++V T+D IVA S+ +I K G + FG
Sbjct: 130 LLEAFGNGKLFLNAYGGIIP--IDVQGSYTIDTGHIVAFDKSLQYKIAKAGGSWKSTFFG 187
Query: 214 GDNLVTAVVTGPGIVFIQS 232
G+ LV TG G V IQ+
Sbjct: 188 GEGLVMEF-TGHGRVLIQT 205
>gi|254428814|ref|ZP_05042521.1| conserved hypothetical protein TIGR00266 [Alcanivorax sp. DG881]
gi|196194983|gb|EDX89942.1| conserved hypothetical protein TIGR00266 [Alcanivorax sp. DG881]
Length = 220
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 58/211 (27%), Positives = 98/211 (46%), Gaps = 13/211 (6%)
Query: 26 QILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-QWLFGKTIT 83
+I GG A + L+P E V+ +M M ++++ I+ G+ ++L G+T+
Sbjct: 4 EIKGGAAFSYIDMDLEPGESVITESDAMSSMDARLDLKAIFNGGLLRGLLRKFLGGETLF 63
Query: 84 SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV 143
N S + + P+ I +L G L QP AFL V +L R
Sbjct: 64 VSRFSNHSDSPARLTVVQPTPGEIRCHELK--DGSLFLQPGAFLACEETV----SLGLRW 117
Query: 144 RNVIGGI--EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
I I EG R ++ G G + A G+++ + ++ GE I VD S +VA ++ +++
Sbjct: 118 AGFISWIAREGLFRLEVKGTGKVWYGAYGALLDREVD-GEFI-VDTSHLVAYDPTIKLKL 175
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+ + + F G+ LVT V G G V IQ+
Sbjct: 176 QLAAGLFSSFFSGEGLVTR-VQGKGRVVIQT 205
>gi|392570467|gb|EIW63640.1| DUF124-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 299
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 95/213 (44%), Gaps = 17/213 (7%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
+ L+P +V A+PGSM M +++++ + + + + L G ++ + PG
Sbjct: 97 LRLQPGAEVKAKPGSMVAMDATVKIKG----KLKFSVKKLLTGGEMSESIFTGPGE---- 148
Query: 97 VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQ 156
V +A + I+PI L + DA+L V + T Q V + EG
Sbjct: 149 VMVAPETWGDIVPIHLDGR-TQWSVGRDAYLACTMGVSRT-TKSQGVGKALFSGEGLFVH 206
Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDN 216
+ G G+ ++ + G++ + L+ GE VD +VA T+ V+ G F G +
Sbjct: 207 NVVGGGMMWVTSLGAIFSQTLQPGEQWIVDNGHLVAWTAKYKVERIDAG----GFFSGSH 262
Query: 217 LVTAVV---TGPGIVFIQSLPFHRLSQRIARAV 246
++ TGPGIV+IQ+ L + I V
Sbjct: 263 TDEGLICRFTGPGIVYIQTRNPETLGEWIRDQV 295
>gi|325000442|ref|ZP_08121554.1| hypothetical protein PseP1_16822 [Pseudonocardia sp. P1]
Length = 238
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 58/226 (25%), Positives = 93/226 (41%), Gaps = 39/226 (17%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
+ + L+ V A G+M MS +++ E E G+W S + R+ G
Sbjct: 28 LATLWLEAGRSVRAEAGAMVAMSREIDL----TTELEGGLW---------SAMKRSAGGR 74
Query: 94 DGFVGI--AAPSLARIL-----PIDL---AMFGGELLCQPDAFLCSVNDVKVSNTLDQRV 143
V A R+L P DL A+ G A+L + DV V
Sbjct: 75 SASVSTFTAQDGRGRLLLAPPAPGDLVAAALSGNSYDIAGHAYLAATEDVAVDTQW---- 130
Query: 144 RNVIGGIEGFLRQ------KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
GG + F + +G G+ F+ A G+++ + L GE + VD +VA SS+
Sbjct: 131 ----GGAKAFFASDNALVVRASGDGVVFLSAFGALLPQELAAGETLIVDTGHLVAWDSSM 186
Query: 198 NVQI-KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
Q+ K +++ G+ LV A TGPG + +QS L+ +
Sbjct: 187 QYQVRKAAKSAWKSMTSGEGLV-AEFTGPGNLLLQSRNLEALADAL 231
>gi|406836321|ref|ZP_11095915.1| hypothetical protein SpalD1_31979 [Schlesneria paludicola DSM
18645]
Length = 229
Score = 52.0 bits (123), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 52/211 (24%), Positives = 93/211 (44%), Gaps = 23/211 (10%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-----QWLFGKTITSVVLR 88
V + L E+VVA+P SM + + + G W L G++ V R
Sbjct: 13 VVEFQLNESEEVVAQPNSMISFTPGIHISAAIGQTQGEGSWFAGFKSLLGGESFFRAVFR 72
Query: 89 NPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIG 148
+ + +A + ILPI+L L + ++L + D + LD R G
Sbjct: 73 AKRDAQTLL-LAPDNNGDILPIELNASETWYLSR-GSYLAHIGDCR----LDARY----G 122
Query: 149 GIEGFLRQ------KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ-I 201
G++G L + ++G G F G+++++ L GE +D +VA + +V + +
Sbjct: 123 GVKGVLSKTGLFLLHVSGVGTLFCQTYGAIMERELGEGEQFLIDNRYVVAFSGTVEYELV 182
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
K + +R + G+ +V TGPG V+ Q+
Sbjct: 183 KASRSLRHSFMSGEGMVNR-YTGPGRVYYQT 212
>gi|315043859|ref|XP_003171305.1| DUF124 domain-containing protein [Arthroderma gypseum CBS 118893]
gi|311343648|gb|EFR02851.1| DUF124 domain-containing protein [Arthroderma gypseum CBS 118893]
Length = 330
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 89/189 (47%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS S+ ++ + M + L G ++S PG + APS+
Sbjct: 133 LTAKPGAMIAMSPSVSLKGSI----KFSMKKMLAGGEMSSSTFTGPGEL-----LLAPSM 183
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
+ I L + G E DAFL V V + +Q + EG K++G G+
Sbjct: 184 LGDISI-LKLNGQEQWSVGKDAFLACTQGV-VKDYKNQGLSKAFFSGEGLFVYKISGTGI 241
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
F+ + G++++K+L GE VD +V+ ++ +G + + G+ LV T
Sbjct: 242 LFVQSFGAIIRKDLADGEKYIVDNGHLVSWNCKYVMERVASGGVLSNLSSGEGLVCK-FT 300
Query: 224 GPGIVFIQS 232
GPG +++Q+
Sbjct: 301 GPGTIYLQT 309
>gi|423232305|ref|ZP_17218706.1| hypothetical protein HMPREF1063_04526 [Bacteroides dorei
CL02T00C15]
gi|423242816|ref|ZP_17223892.1| hypothetical protein HMPREF1064_00098 [Bacteroides dorei
CL02T12C06]
gi|392624606|gb|EIY18686.1| hypothetical protein HMPREF1063_04526 [Bacteroides dorei
CL02T00C15]
gi|392646809|gb|EIY40516.1| hypothetical protein HMPREF1064_00098 [Bacteroides dorei
CL02T12C06]
Length = 219
Score = 51.6 bits (122), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 57/112 (50%), Gaps = 4/112 (3%)
Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
L+C ++ S N V+VS + + ++GG+ G L QK++G F+ G + K L
Sbjct: 100 LICNRGVYVASNNLVQVSAKIS--IAGIVGGMGGLL-QKISGTSTVFLDCKGQPITKELC 156
Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFI 230
GE I VD I+A+ Q+ ++ VFGG+ L +TGPG V +
Sbjct: 157 AGETIEVDEDHIIAMQGISEHQLSSAWSLKN-VFGGEGLSMLRITGPGKVHL 207
>gi|327348500|gb|EGE77357.1| hypothetical protein BDDG_00294 [Ajellomyces dermatitidis ATCC
18188]
Length = 334
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 50/189 (26%), Positives = 90/189 (47%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS +M ++ + M + + G +++ PG + AP+L
Sbjct: 139 LSAKPGAMIAMSPTMTLKGSV----KFSMKKLIAGGEMSTSTYTGPGEL-----LLAPTL 189
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
+ + L + G + DAFL + V V + +Q + + EG K++G GL
Sbjct: 190 LGDITV-LRLNGHDTWSVGKDAFLAATQGV-VKDYKNQGLSKALFSGEGLFVYKMSGNGL 247
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
F+ G++++K+L GE VD +VA ++ +G I + G+ LV T
Sbjct: 248 LFMQNFGAIIKKDLADGEKYNVDNGHLVAWNCKYVMERIASGGIISGMSAGEGLVCK-FT 306
Query: 224 GPGIVFIQS 232
GPG V++Q+
Sbjct: 307 GPGTVYLQT 315
>gi|336473276|gb|EGO61436.1| hypothetical protein NEUTE1DRAFT_120406 [Neurospora tetrasperma
FGSC 2508]
gi|350293451|gb|EGZ74536.1| DUF124-domain-containing protein [Neurospora tetrasperma FGSC 2509]
Length = 446
Score = 51.6 bits (122), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 49/187 (26%), Positives = 87/187 (46%), Gaps = 13/187 (6%)
Query: 47 ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
A+PGSM MS S+ ++ Y + M + + G + + PG + AP++
Sbjct: 255 AKPGSMIAMSPSVSLKGAY----KFSMKKLVAGGEMGTSTYTGPGEL-----LLAPAMLG 305
Query: 107 ILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAF 165
+ L + G + D +L S +V + Q + + EG K++G GL +
Sbjct: 306 DI-TSLRLDGSQTWSVSHDGYLASTQNV-TKDYKRQGLGKAMFSGEGLWVYKISGTGLLW 363
Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGP 225
+ + G++V+K+L GE VD +VA + ++ +G + G+ LV TGP
Sbjct: 364 LTSFGAIVRKDLADGEKYIVDNGHLVAWNTKYILERVASGGLLSGFASGEGLV-CKFTGP 422
Query: 226 GIVFIQS 232
G +FIQ+
Sbjct: 423 GTIFIQT 429
>gi|126656110|ref|ZP_01727494.1| hypothetical protein CY0110_03469 [Cyanothece sp. CCY0110]
gi|126622390|gb|EAZ93096.1| hypothetical protein CY0110_03469 [Cyanothece sp. CCY0110]
Length = 220
Score = 51.2 bits (121), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 51/197 (25%), Positives = 91/197 (46%), Gaps = 8/197 (4%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-MWQWLFGKTITSVVLRNPGPSDG 95
+ L P E ++A G+M M G + M+ + ++L G+++ V +N
Sbjct: 16 VTLAPGESIIAEAGAMASMDGRLSMKTKLSGGFFAAILRKFLGGESLFVNVFKNQTSQPL 75
Query: 96 FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
+ + + I IDL G E+ QP A++ S + + G EG +
Sbjct: 76 TLVLTQSIIGDIECIDLR--GNEICFQPGAYIASSPKINLGVQWAGFASWFAG--EGLFK 131
Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
K++G G F A G + ++ + GE I VD S +VA + + + I +G + +V G+
Sbjct: 132 LKVSGHGTVFFGAYGGISKQTIS-GEFI-VDNSHLVAYSPEIKMSIGLSGGLISSVTSGE 189
Query: 216 NLVTAVVTGPGIVFIQS 232
LV + G GI+++QS
Sbjct: 190 GLVNR-LKGNGIIYLQS 205
>gi|451997933|gb|EMD90398.1| hypothetical protein COCHEDRAFT_1139761 [Cochliobolus
heterostrophus C5]
Length = 250
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 50/201 (24%), Positives = 92/201 (45%), Gaps = 11/201 (5%)
Query: 48 RPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARI 107
+PGSM MS +M ++ + + + L G ++ + PG + +A P++ I
Sbjct: 61 KPGSMIAMSPTMTLKG----NLKFSLKKALIGGEMSKSIYTGPGE----LLLAPPAIGDI 112
Query: 108 LPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFIL 167
I L L + DAFL + + + + + G EGF K++G G+ +
Sbjct: 113 TIIKLGGNDTWSLGR-DAFLACTQGIVMEHKSQGISKGMFSG-EGFFVYKISGTGILWCT 170
Query: 168 AGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGI 227
+ G++++K+L E +D +VA ++ +G I + G+ LV TGPG
Sbjct: 171 SFGAIIRKDLAANEKYIIDNGHLVAWNCKYVLERVASGGIVSTMAAGEGLVCK-FTGPGT 229
Query: 228 VFIQSLPFHRLSQRIARAVTS 248
VF+Q+ +Q +A T+
Sbjct: 230 VFMQTRNATAFAQFMASHATA 250
>gi|336055002|ref|YP_004563289.1| hypothetical protein WANG_1492 [Lactobacillus kefiranofaciens ZW3]
gi|333958379|gb|AEG41187.1| Protein of hypothetical function DUF124 [Lactobacillus
kefiranofaciens ZW3]
Length = 250
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 51/215 (23%), Positives = 93/215 (43%), Gaps = 10/215 (4%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITS-----VVL 87
+ I L+P EKV + GSM + S S+++ + + +G G++I+S +
Sbjct: 32 LIDIELEPSEKVYIQRGSMVYHSPSIKLNTKVNASGSGIGKLVRAMGRSISSGESFWITE 91
Query: 88 RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
G + IA +LP++L + FL S R ++
Sbjct: 92 AQAHKELGHLAIAPGLPGEVLPLNLGE--KQYRINDGKFLAMDGSASYSMKKQTLGRALL 149
Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
G GF +G G+ A GS+ + +L E IT+D + +VA ++ + I +
Sbjct: 150 SGTGGFFVMTTSGDGVVLCNAYGSIKKIDLNDDE-ITIDNNHVVAWSTDLKYDIHFESGF 208
Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+++ G+ +V G G V+IQSL +++I
Sbjct: 209 VQSIGTGEGVVN-TFKGTGEVYIQSLNLETFAKKI 242
>gi|218244960|ref|YP_002370331.1| hypothetical protein PCC8801_0069 [Cyanothece sp. PCC 8801]
gi|257057985|ref|YP_003135873.1| hypothetical protein Cyan8802_0067 [Cyanothece sp. PCC 8802]
gi|218165438|gb|ACK64175.1| protein of unknown function DUF124 [Cyanothece sp. PCC 8801]
gi|256588151|gb|ACU99037.1| protein of unknown function DUF124 [Cyanothece sp. PCC 8802]
Length = 220
Score = 51.2 bits (121), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 48/200 (24%), Positives = 94/200 (47%), Gaps = 14/200 (7%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIY----IPENEVGMWQWLFGKTITSVVLRNPGP 92
+ L+P E ++A G+M M G++ M+ + IP + ++ G+++ V +N
Sbjct: 16 LTLEPGETMIAEAGAMASMDGALSMKTEFSGGLIPAL---LKKFFGGESLFVNVFKNTTG 72
Query: 93 SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
V + + I ++L G E+ QP A++ S + VK+ G EG
Sbjct: 73 EPLTVVLTQSVIGDIDRLELN--GNEMYLQPGAYIASTSKVKLGVGWAGFASWFAG--EG 128
Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
+ K++G G A G +++ + GE I VD S +VA + + + I +G ++
Sbjct: 129 LFKLKVSGTGRVLFGAYGGIIKHRIS-GEFI-VDNSHLVAYSPKIKMSIGLSGNFISSIT 186
Query: 213 GGDNLVTAVVTGPGIVFIQS 232
G+ + + G G++++QS
Sbjct: 187 SGEGFINR-LKGDGVIYLQS 205
>gi|448377165|ref|ZP_21560008.1| hypothetical protein C479_12204 [Halovivax asiaticus JCM 14624]
gi|445656046|gb|ELZ08887.1| hypothetical protein C479_12204 [Halovivax asiaticus JCM 14624]
Length = 225
Score = 51.2 bits (121), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 92/199 (46%), Gaps = 11/199 (5%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--KTITSVVLRNPGPSD 94
+ L E ++A PG+M S ++E+ + + + + G + + GP
Sbjct: 16 VELAAGETILAEPGAMVGHSPNVEISTTSSRDGLLSSAKSMLGGESLVVNEFTAEGGP-- 73
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
G V +A P+ + +L E L D AFL S D+ + + +++++GG
Sbjct: 74 GEVRLAPPTPGDVKEHELQ---DETLYTTDGAFLASSPDIDIDSEF-GGLKSMLGGAS-L 128
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
L G G A+I A G + + +L+ GE T+D ++A SV + + G ++ +
Sbjct: 129 TPLALKGTGTAYIDAYGGLERLDLDPGESYTLDNEHLIAWDDSVEFETRRVGGLKSTLLS 188
Query: 214 GDNLVTAVVTGPGIVFIQS 232
G+ LV TGPG V+ Q+
Sbjct: 189 GEGLVFD-FTGPGSVWYQT 206
>gi|346318745|gb|EGX88347.1| DUF124 domain-containing protein [Cordyceps militaris CM01]
Length = 475
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 49/191 (25%), Positives = 86/191 (45%), Gaps = 21/191 (10%)
Query: 47 ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVG-----IAA 101
A+PGSM MS +M + G +++ K I L + ++G +A
Sbjct: 283 AKPGSMIAMSPTMTLR---------GEFKFSVKKMIAGAELGH----SNYIGPGELLLAP 329
Query: 102 PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQ 161
P L I I L + C DA+L S V+ + + + G EG K++G
Sbjct: 330 PMLGDITTIRLDG-AAQWTCGHDAYLASTQGVQKDHKRQGLGKAMFSG-EGLFVYKMSGT 387
Query: 162 GLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAV 221
G+ ++ + G++++K+L GE +D +VA + ++ +G I + LV
Sbjct: 388 GILWVSSFGAIIRKDLADGEKYIIDNGHLVAWNTKYILERVASGGIISNFASAEGLVCK- 446
Query: 222 VTGPGIVFIQS 232
TGPG V++Q+
Sbjct: 447 FTGPGTVYMQT 457
>gi|380482182|emb|CCF41393.1| hypothetical protein CH063_11691 [Colletotrichum higginsianum]
Length = 199
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 86/187 (45%), Gaps = 13/187 (6%)
Query: 47 ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
A+PG M MS S+ + + + M + + G + S PG + +A P L
Sbjct: 7 AKPGVMVAMSHSITLRG----QLKFSMKKLVAGAEMASSTFVGPGE----LLLAPPMLGD 58
Query: 107 ILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAF 165
I + L G E DA+L S V + + Q + I EG K++G GL +
Sbjct: 59 ITSLRLT--GQETWSVGQDAYLASTQGV-IKDYKRQGLSKAIFSGEGLYVYKISGTGLLW 115
Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGP 225
+ + G++++K+L GE VD +VA ++ +G I + G+ LV TGP
Sbjct: 116 LTSFGAIIRKDLMDGEKYVVDNGHLVAWNVKYIMERVASGGIISGLSSGEGLVCK-FTGP 174
Query: 226 GIVFIQS 232
G VF+Q+
Sbjct: 175 GTVFMQT 181
>gi|402834605|ref|ZP_10883204.1| TIGR00266 family protein [Selenomonas sp. CM52]
gi|402277553|gb|EJU26627.1| TIGR00266 family protein [Selenomonas sp. CM52]
Length = 229
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 73/144 (50%), Gaps = 15/144 (10%)
Query: 110 IDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI---EGFLRQKLTGQGLAF 165
+D+A+ G ++ Q FL + V + D +++ ++ G EGF KLTG+G+AF
Sbjct: 88 MDVALDGSYGMIVQKGGFLAAEETVNI----DSKMQGLMQGFFSQEGFFLLKLTGKGIAF 143
Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY--NGPIRRAVFGGDNLVTAVVT 223
+ + G + NLE GE + VD +VA +N +I+ NG I +V G+ LV
Sbjct: 144 LSSYGVIHVLNLEAGEEVIVDNGHLVAWPDYMNYKIEKASNGWI-SSVMSGECLVCR-FR 201
Query: 224 GPGIVFIQS---LPFHRLSQRIAR 244
GPG V IQ+ F Q I R
Sbjct: 202 GPGPVLIQTRNPAGFETWIQEIVR 225
>gi|422600397|ref|ZP_16674232.1| hypothetical protein PSYMO_31452, partial [Pseudomonas syringae pv.
mori str. 301020]
gi|330893030|gb|EGH25691.1| hypothetical protein PSYMO_31452 [Pseudomonas syringae pv. mori
str. 301020]
Length = 220
Score = 50.8 bits (120), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 41/145 (28%), Positives = 66/145 (45%), Gaps = 26/145 (17%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY--IPENEVGMWQWLF-- 78
+ F++ G E Q ++ L P E VA G+M + + ++ME I+ G+ LF
Sbjct: 34 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 93
Query: 79 ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGG---------ELLCQP 123
G+++ + V G G V AAP ILP++L FGG + QP
Sbjct: 94 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGYRNWSWKEPAVQYQP 153
Query: 124 DAFLCSVNDVKVSNTL-DQRVRNVI 147
D +D ++ + L +QR+R I
Sbjct: 154 D------DDPRLKSDLTEQRIRQSI 172
>gi|410584105|ref|ZP_11321210.1| TIGR00266 family protein [Thermaerobacter subterraneus DSM 13965]
gi|410504967|gb|EKP94477.1| TIGR00266 family protein [Thermaerobacter subterraneus DSM 13965]
Length = 226
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 93/207 (44%), Gaps = 26/207 (12%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-----QWLFGKTITSVVLRNPG 91
+ L+P E++ A PG++ MSG++ ++ E G+ L G++ + R G
Sbjct: 16 VELEPGEELQAEPGALVSMSGNVSLQ----AEARGGLLGALSRSLLGGESFFTSRYRADG 71
Query: 92 PSDGFVGIAAPSLARILPID---LAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIG 148
+ LA LP D L + G + + A+L + T+D R G
Sbjct: 72 ARGEVM------LAPALPGDITVLELAGEVVYLKSGAYLAGATTL----TVDARWGGARG 121
Query: 149 --GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
G G ++ G G FI + G++ +K+L G+ VD +VA T + Q++ G
Sbjct: 122 FFGSGGLFLLRVAGHGALFINSYGALHRKDLAPGQRYVVDTGHVVAFTDGMAFQVRTAGS 181
Query: 207 -IRRAVFGGDNLVTAVVTGPGIVFIQS 232
+ ++ G+ L TGPG V+IQ+
Sbjct: 182 GLFSSIASGEGLACE-FTGPGTVYIQT 207
>gi|260887588|ref|ZP_05898851.1| conserved hypothetical protein [Selenomonas sputigena ATCC 35185]
gi|330837957|ref|YP_004412537.1| hypothetical protein Selsp_0098 [Selenomonas sputigena ATCC 35185]
gi|260862604|gb|EEX77104.1| conserved hypothetical protein [Selenomonas sputigena ATCC 35185]
gi|329745721|gb|AEB99077.1| protein of unknown function DUF124 [Selenomonas sputigena ATCC
35185]
Length = 229
Score = 50.8 bits (120), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 69/129 (53%), Gaps = 12/129 (9%)
Query: 110 IDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI---EGFLRQKLTGQGLAF 165
+D+A+ G ++ Q FL + V + D +++ ++ G EGF KLTG+G+AF
Sbjct: 88 MDVALDGSYGMIVQKGGFLAAEETVNI----DSKMQGLMQGFFSQEGFFLLKLTGKGIAF 143
Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY--NGPIRRAVFGGDNLVTAVVT 223
+ + G + NLE GE + VD +VA +N +I+ NG I +V G+ LV
Sbjct: 144 LSSYGVIHVLNLEAGEEVIVDNGHLVAWPDYMNYKIEKASNGWI-SSVMSGECLVCR-FR 201
Query: 224 GPGIVFIQS 232
GPG V IQ+
Sbjct: 202 GPGPVLIQT 210
>gi|374725017|gb|EHR77097.1| putative protein of unknown function DUF124 [uncultured marine
group II euryarchaeote]
Length = 216
Score = 50.4 bits (119), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 59/213 (27%), Positives = 98/213 (46%), Gaps = 19/213 (8%)
Query: 39 LKPQEKVVARPGSMCFMSGSMEME------NIYIPENEVGMW-QWLFGKTITSVVLRNPG 91
L P E V PG+M M G +EM+ I+ ++ + + F T T+ G
Sbjct: 5 LSPGESVNVEPGAMVAMGG-VEMQTRKSGGGIFKSLKKMALGGESFFLNTFTA------G 57
Query: 92 PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
+ G+V +A S I +L G L Q ++L S +++ ++T Q + + G
Sbjct: 58 AAGGWVSVAPGSPGDIQGFNLVP-GQALFIQGGSYLASTLNIQ-TDTKFQGAKGFLSGES 115
Query: 152 GFLRQKL--TGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
F Q GQG + + G++ Q +E G+ ITVD +VA + V I G ++
Sbjct: 116 MFFVQATCAEGQGRVWYNSFGAIKQIPVEPGQTITVDTGHVVAFDAGVQYTISKVGGMKS 175
Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ GG+ LV TG G V+IQ+ L+ ++
Sbjct: 176 FLVGGEGLVMN-FTGQGNVWIQTRNVGSLAAKL 207
>gi|427724444|ref|YP_007071721.1| hypothetical protein Lepto7376_2616 [Leptolyngbya sp. PCC 7376]
gi|427356164|gb|AFY38887.1| protein of unknown function DUF124 [Leptolyngbya sp. PCC 7376]
Length = 221
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 50/197 (25%), Positives = 88/197 (44%), Gaps = 8/197 (4%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF-GKTITSVVLRNPGPSDG 95
+ L P E+V A G+M M G + M+ + + + F G+++ + RN
Sbjct: 16 VTLDPGEQVTAEAGAMVSMDGGITMKTEFSGGFVQAILRSFFGGESLFVNIYRNTSNKKQ 75
Query: 96 FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
V + + I IDL+ G + QP A++ + G EG +
Sbjct: 76 TVILTQSMIGDIHRIDLSQ--GPICFQPGAYIAHTPGANMGVKWAGFASWFAG--EGLFK 131
Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
+ TG+G F G +++K + GE I VD S +VA + + I+ +G + ++ G+
Sbjct: 132 LEFTGKGRVFYGCYGGIMEKQI-AGEYI-VDNSHLVAYEPGITMNIRMSGGLLGSLTSGE 189
Query: 216 NLVTAVVTGPGIVFIQS 232
LV + G G V++QS
Sbjct: 190 GLVNK-LKGRGKVYLQS 205
>gi|182417418|ref|ZP_02948746.1| conserved hypothetical protein [Clostridium butyricum 5521]
gi|237667008|ref|ZP_04526992.1| conserved hypothetical protein [Clostridium butyricum E4 str. BoNT
E BL5262]
gi|182378732|gb|EDT76258.1| conserved hypothetical protein [Clostridium butyricum 5521]
gi|237655356|gb|EEP52912.1| conserved hypothetical protein [Clostridium butyricum E4 str. BoNT
E BL5262]
Length = 226
Score = 50.4 bits (119), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 72/152 (47%), Gaps = 13/152 (8%)
Query: 99 IAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI---EGFL 154
I A SL I+PI L G L Q D FL S DV L+ +V+N+ G+ EGF
Sbjct: 76 IFAHSLPGDIVPITLNGTG--LKIQKDGFLASTIDVD----LETKVQNLTKGLFSGEGFF 129
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV--NVQIKYNGPIRRAVF 212
GQG F+ + G++ + NL E I +D S +VA ++ N++ +
Sbjct: 130 VLGARGQGTVFVSSYGAIHEINLNSNEEIIIDNSHLVAWDETLEYNIEKASKKGWISSFT 189
Query: 213 GGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
G+ LV GPG +FIQ+ + IA+
Sbjct: 190 SGEGLVCR-FKGPGKIFIQTRNPDGMKNWIAK 220
>gi|119715736|ref|YP_922701.1| hypothetical protein Noca_1500 [Nocardioides sp. JS614]
gi|119536397|gb|ABL81014.1| protein of unknown function DUF124 [Nocardioides sp. JS614]
Length = 231
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 53/207 (25%), Positives = 89/207 (42%), Gaps = 22/207 (10%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG------KTITSVVL 87
++ L E V A G+M MS + ME G+ + + G T T+ V
Sbjct: 13 TAKLTLAAGEAVRAESGAMLAMSPGIAMETSTQGGVLKGLKRSVLGGESFFMNTFTAQV- 71
Query: 88 RNPGPSDGFVGIAAPSLARILPIDLAMF--GGELLCQPDAFLCSVNDVKVSNTLDQRVRN 145
DG AP+L P D+ + G L Q +FL + ++++V
Sbjct: 72 ------DGAELYVAPAL----PGDVVAWPLNGVLYVQSGSFLAASSNIQVDTKWGGG--K 119
Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
EG ++ GQG + + G++ +L+ G+ TVD +VA T V ++ G
Sbjct: 120 TFFSSEGLFMLRIEGQGELVLSSYGAIHAIDLQAGQTYTVDTGHMVAWTDGVQYNVRKVG 179
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+ F G+ LV +TGPG +++Q+
Sbjct: 180 NWKSTFFSGEGLVCD-LTGPGRIYLQT 205
>gi|417783112|ref|ZP_12430835.1| TIGR00266 family protein [Leptospira interrogans str. C10069]
gi|418726164|ref|ZP_13284775.1| TIGR00266 family protein [Leptospira interrogans str. UI 12621]
gi|409953813|gb|EKO08309.1| TIGR00266 family protein [Leptospira interrogans str. C10069]
gi|409960074|gb|EKO23828.1| TIGR00266 family protein [Leptospira interrogans str. UI 12621]
Length = 221
Score = 50.4 bits (119), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 52/199 (26%), Positives = 88/199 (44%), Gaps = 8/199 (4%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSD 94
++ L P + + A G+M +M+ +++E + + FG + +
Sbjct: 14 LKVKLSPGQTIKAESGAMVYMTPGIDVETKMGSGFLSAISRRFFGGESFFFNVFKASVTG 73
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
G +G+A I+ IDL G L+ + A+L S + + +R+ GG EG
Sbjct: 74 GEIGLAPELPGDIVGIDLTDTG--LIVESGAYLASDETISIKPMFGG-IRSFFGG-EGIF 129
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFG 213
+ G G F+ A G ++ ++V T+D IVA S+ +I K G + FG
Sbjct: 130 LLEAFGNGKLFLNAYGGIIP--IDVQGSYTIDTGHIVAFDKSLQYKIAKAGGSWKSTFFG 187
Query: 214 GDNLVTAVVTGPGIVFIQS 232
G+ LV TG G V IQ+
Sbjct: 188 GEGLVMEF-TGHGRVLIQT 205
>gi|363891399|ref|ZP_09318578.1| TIGR00266 family protein [Eubacteriaceae bacterium CM2]
gi|402837607|ref|ZP_10886124.1| TIGR00266 family protein [Eubacteriaceae bacterium OBRC8]
gi|361965456|gb|EHL18438.1| TIGR00266 family protein [Eubacteriaceae bacterium CM2]
gi|402274627|gb|EJU23804.1| TIGR00266 family protein [Eubacteriaceae bacterium OBRC8]
Length = 225
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 71/143 (49%), Gaps = 10/143 (6%)
Query: 94 DGFVGIAAPSLARILPIDLAMFGG-ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI-- 150
DG V + APSL + ID+ + G L Q D FL ++V D +V+N+ G+
Sbjct: 72 DGEV-LLAPSLLGDV-IDVELDGTYSLRVQKDGFLAGTQGIEV----DTKVQNLSKGLFS 125
Query: 151 -EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
EGF ++G+G F+ + GS+ NLE G+ I +D + +VA + +I+
Sbjct: 126 GEGFFILNVSGKGTVFLSSYGSIHSINLEAGQEIVIDNNHLVAWPDHMEYKIEKAASGWI 185
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
+ F ++ GPG + IQ+
Sbjct: 186 SSFTSGEMLVCRFRGPGNILIQT 208
>gi|398412410|ref|XP_003857529.1| hypothetical protein MYCGRDRAFT_65617 [Zymoseptoria tritici IPO323]
gi|339477414|gb|EGP92505.1| hypothetical protein MYCGRDRAFT_65617 [Zymoseptoria tritici IPO323]
Length = 362
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/120 (30%), Positives = 59/120 (49%), Gaps = 2/120 (1%)
Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
DAFL S V + Q + EGF K+ G G+ +I + G++++K+L GE
Sbjct: 240 DAFLASTQGV-TKTYVSQNFSKAMFSGEGFFIYKIAGVGICWIQSMGAIIRKDLRDGEKY 298
Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
+D +VA ++ +G I + G+ LV TGPG VF+Q+ + +Q +A
Sbjct: 299 IIDNGHLVAWNCDYVLERVASGGIISNLSAGEGLVCKF-TGPGTVFMQTRNPQQFAQYMA 357
>gi|429191128|ref|YP_007176806.1| hypothetical protein Natgr_1130 [Natronobacterium gregoryi SP2]
gi|448326682|ref|ZP_21516029.1| hypothetical protein C490_14755 [Natronobacterium gregoryi SP2]
gi|429135346|gb|AFZ72357.1| TIGR00266 family protein [Natronobacterium gregoryi SP2]
gi|445610487|gb|ELY64258.1| hypothetical protein C490_14755 [Natronobacterium gregoryi SP2]
Length = 229
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/210 (24%), Positives = 90/210 (42%), Gaps = 24/210 (11%)
Query: 39 LKPQEKVVARPGSMCFMSGSMEMENIYIP-------ENEVGMWQWLFGKTITSVVLRNPG 91
L+ E + GSM S +EME ++ + + LF T T+ R
Sbjct: 23 LENGEAITVESGSMVSYSEHVEMETHSSSGGLLSSVKDSMLSGEQLFRNTFTATANRQT- 81
Query: 92 PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS---VNDVKVSNTLDQRVRNVIG 148
V A + ++LA G + Q +++ + + VS D +++G
Sbjct: 82 -----VQFAHTQPGDMRALELA--GESVYVQSGSYVANGPGIETDSVSGGFD----SLLG 130
Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
G +G + +G G F+ + G +V++ LE GE +T+D VA SV G ++
Sbjct: 131 G-KGLFVLEASGTGDLFVGSYGGIVERELEQGEQVTIDAGHSVAWDESVEFSTHRVGGLK 189
Query: 209 RAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
+ + G+ V +GPG VF+Q+ + L
Sbjct: 190 KTMLSGEGFV-MTFSGPGRVFLQTRDYDSL 218
>gi|290559433|gb|EFD92765.1| protein of unknown function DUF124 [Candidatus Parvarchaeum
acidophilus ARMAN-5]
Length = 220
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 57/231 (24%), Positives = 103/231 (44%), Gaps = 21/231 (9%)
Query: 22 VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
++ F +G E Q Q L E ++ G M F S ++ME G++ L
Sbjct: 1 MVNFTEVGKEIQYIQATLNEGESIMCEGGHMIFKSPDIKMET-----QSGGIFSALSRAM 55
Query: 82 ITSVV--LRNPGPSDGFVGIAAPSLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVS 136
S V L+ GP G+ A + LP I++ + G ++ + ++FLC + V S
Sbjct: 56 AGSSVFLLKLDGP-----GVVA--FSGFLPGEIINIPLQGKGIIAEFNSFLCMDSTVTYS 108
Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
+ G EG ++ +G G F+ G ++ +L+ GE I V+ S ++A +
Sbjct: 109 AKF-AGTWAGLLGGEGLFLERFSGNGNVFLHCHGQPIEFSLKEGESIDVEASHVLAFEDT 167
Query: 197 VNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
IK G ++ + G+ A +TGPG V++ S+ +L+ ++
Sbjct: 168 AQYSIKRVGGVKTMLLAGMEGEGFFFATITGPGKVWLHSISLAQLAMKMGN 218
>gi|431930111|ref|YP_007243157.1| hypothetical protein Thimo_0688 [Thioflavicoccus mobilis 8321]
gi|431828414|gb|AGA89527.1| TIGR00266 family protein [Thioflavicoccus mobilis 8321]
Length = 240
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/205 (23%), Positives = 92/205 (44%), Gaps = 7/205 (3%)
Query: 30 GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRN 89
G + ++ L+PQ+ + A+ +M M ++++E G+ + G++ LR
Sbjct: 22 GAFALLEVHLQPQQTIKAQSDAMVAMDPTIDVEGKLEGGLLGGLGRMFSGESFFFQTLRA 81
Query: 90 PGPSDGFVGIAAPSL-ARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
S V + AP+ +LP+ L G + + Q D FL + +V++S R +
Sbjct: 82 ARGSG--VALLAPAQPGDLLPVHLD--GTQPYIIQKDGFLAASEEVQISTAAQNLARGIF 137
Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
G EGF + G+G+ F + G++ + GE + VD +VA ++ ++
Sbjct: 138 SG-EGFFVLRAQGRGMLFAESYGAIHALEVPAGEEMVVDNGHLVAWPEGMHYSLEKASSG 196
Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQS 232
+ F + GPG V+IQS
Sbjct: 197 WISSFTSGEGLVCRFRGPGTVYIQS 221
>gi|375308935|ref|ZP_09774217.1| hypothetical protein WG8_2742 [Paenibacillus sp. Aloe-11]
gi|390453566|ref|ZP_10239094.1| hypothetical protein PpeoK3_05980 [Paenibacillus peoriae KCTC 3763]
gi|375079147|gb|EHS57373.1| hypothetical protein WG8_2742 [Paenibacillus sp. Aloe-11]
Length = 226
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 52/205 (25%), Positives = 96/205 (46%), Gaps = 7/205 (3%)
Query: 30 GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRN 89
G + ++ L P E + A G+M MS S++++ G+ + L G+ LR
Sbjct: 9 GAFAMLKVQLSPGETIKAEMGAMVSMSSSVDIKGTVDGGLLRGLGRMLSGEKFFFQELRA 68
Query: 90 P-GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIG 148
GP++ + A+ + + +D + LL Q D FL ++VS + ++ +
Sbjct: 69 ARGPAEVLLAPASIGDVQAVELDGSY---RLLVQKDGFLACTEGIEVSTKMQNLMKGLFS 125
Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPI 207
G EGF +++G+G F+ + G++ + GE +D + +VA ++ +I K +
Sbjct: 126 G-EGFFIVEISGRGTVFLSSYGAIYPLYVAPGEERIIDNAHLVAWPDYMDYKIEKASKGW 184
Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQS 232
+V G+ LV G G V IQS
Sbjct: 185 LSSVTSGEALVCR-FRGEGTVLIQS 208
>gi|327400089|ref|YP_004340928.1| hypothetical protein Arcve_0174 [Archaeoglobus veneficus SNP6]
gi|327315597|gb|AEA46213.1| protein of unknown function DUF124 [Archaeoglobus veneficus SNP6]
Length = 223
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 93/200 (46%), Gaps = 10/200 (5%)
Query: 33 QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGP 92
+ ++ L+ E++VA G+M +M ++E++ G+ + L G + V R
Sbjct: 12 SLLEVELEDGEEIVAETGAMVYMR-NVELKTEVKGGLLAGLKRSLLGGE-SFFVNRFISK 69
Query: 93 SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
G +G+A P ++ I L G + Q AFL S D+ + +T R G EG
Sbjct: 70 GKGLLGLAPPYQGDVIHIPLN---GRIYAQSGAFLASSPDISI-DTKWGGARTFFAG-EG 124
Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
KL G G + + G + + ++V + VD IVA ++ +++ G ++ +
Sbjct: 125 LFLLKLDGLGDVLLSSFGGI--ERIDVNGSLIVDTGHIVAFEDTLAFKVRKAGGLKATLL 182
Query: 213 GGDNLVTAVVTGPGIVFIQS 232
G+ L+ A G G V+IQ+
Sbjct: 183 SGEGLI-AEFEGEGSVWIQT 201
>gi|304405138|ref|ZP_07386798.1| protein of unknown function DUF124 [Paenibacillus curdlanolyticus
YK9]
gi|304346017|gb|EFM11851.1| protein of unknown function DUF124 [Paenibacillus curdlanolyticus
YK9]
Length = 225
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 56/203 (27%), Positives = 94/203 (46%), Gaps = 11/203 (5%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI---TSVVLRNP 90
+ ++ L+ E V A G+M MS S+++ G+ + L G+ V R
Sbjct: 13 MLKVKLERGESVKAEMGAMVSMSPSIDLTGTVDGGIMRGLGRMLSGEKFFFQELVAARGG 72
Query: 91 GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
G V +A S+ I ++L +LL Q D FL + ++V+ + R ++ G
Sbjct: 73 GE----VLLAPTSMGDIQAVELDG-SYKLLVQKDGFLAGTSGIEVNTKMQNLSRGLLSG- 126
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
EGF +++G+G FI + GS+ NL GE + VD +VA V I K +
Sbjct: 127 EGFFIVEISGRGTVFISSYGSIHAINLAPGEEVIVDNGHLVAWPDYVQYNIEKASKGWLN 186
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
++ G+ +V G G+V IQ+
Sbjct: 187 SIKSGEGVVCR-FRGEGVVLIQT 208
>gi|336265228|ref|XP_003347387.1| hypothetical protein SMAC_08357 [Sordaria macrospora k-hell]
gi|380093212|emb|CCC08870.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 453
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/186 (25%), Positives = 85/186 (45%), Gaps = 11/186 (5%)
Query: 47 ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
A+PG+M MS S+ ++ Y + M + + G + + PG + +G A
Sbjct: 262 AKPGTMIAMSPSVSLKGAY----KFSMKKLVAGGEMGTSTFTGPG--ELLLGPAMLGDIT 315
Query: 107 ILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFI 166
L +D + D +L S V + + Q + + EG K++G GL ++
Sbjct: 316 SLRLDGSQ---TWSVSHDGYLASTQHV-IKDYKRQGLGKAMFSGEGLWVYKISGTGLLWL 371
Query: 167 LAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPG 226
+ G++++K+L GE VD +VA + ++ +G + G+ LV TGPG
Sbjct: 372 TSFGAIIRKDLTDGEKYIVDNGHLVAWNTKYILERVASGGMLSGFASGEGLVCK-FTGPG 430
Query: 227 IVFIQS 232
VFIQ+
Sbjct: 431 TVFIQT 436
>gi|326483972|gb|EGE07982.1| DUF124 domain-containing protein [Trichophyton equinum CBS 127.97]
Length = 335
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/189 (25%), Positives = 90/189 (47%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
++A+PG+M MS ++ ++ + M + L G ++S PG + APS+
Sbjct: 138 LIAKPGAMIAMSPTVSLKGSI----KFSMKKMLAGGEMSSSTFTGPGEL-----LLAPSM 188
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
+ I L + G E DAFL V V + +Q + EG K++G G+
Sbjct: 189 LGDISI-LRLNGQEQWSVGKDAFLACTQGV-VKDYKNQGLSKAFFSGEGLFVYKISGTGI 246
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
F+ + G++++K+L GE VD +V+ ++ +G + + G+ LV T
Sbjct: 247 LFVQSFGAIIRKDLVDGEKYIVDNGHLVSWNCKYVMERVASGGVLSNLSSGEGLVCK-FT 305
Query: 224 GPGIVFIQS 232
GPG +++Q+
Sbjct: 306 GPGTIYLQT 314
>gi|226292711|gb|EEH48131.1| DUF124 domain-containing protein [Paracoccidioides brasiliensis
Pb18]
Length = 328
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 52/203 (25%), Positives = 95/203 (46%), Gaps = 18/203 (8%)
Query: 32 AQVCQIMLKPQEKVVARP-GSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNP 90
A C + KP+ A P G+M MS +M ++ + M + + G +++ P
Sbjct: 122 AMGCPLTAKPE----AHPTGAMIAMSPTMTLKGSV----KFSMKKLIAGGEMSTSTYTGP 173
Query: 91 GPSDGFVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
G + AP++ + + L + G E DA+L S V + + +Q +
Sbjct: 174 GEL-----LLAPTMLGDISV-LRLSGHETWSVGKDAYLSSTQGV-IKDYKNQGLSKAFFS 226
Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
EG K++G GL F+ + G++++K+L GE +D +VA + ++ +G I
Sbjct: 227 GEGLFVYKISGNGLLFLQSFGAIIKKDLADGEKYNIDNGHLVAWSCKYVMERVASGGIIS 286
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
+ G+ LV TGPG V++Q+
Sbjct: 287 GMTSGEGLVCK-FTGPGSVYMQT 308
>gi|327296604|ref|XP_003232996.1| hypothetical protein TERG_05993 [Trichophyton rubrum CBS 118892]
gi|326464302|gb|EGD89755.1| hypothetical protein TERG_05993 [Trichophyton rubrum CBS 118892]
Length = 331
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 48/190 (25%), Positives = 89/190 (46%), Gaps = 15/190 (7%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
++A+PG+M MS ++ ++ + M + L G ++S PG + APS+
Sbjct: 134 LIAKPGAMIAMSPTVSLKGSI----KFSMKKMLAGGEMSSSTFTGPGEL-----LLAPSM 184
Query: 105 ARILPIDLAMFGGE--LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQG 162
+ I L G+ DAFL V V + +Q + EG K++G G
Sbjct: 185 --LGDISLLRLNGQEQWSVGKDAFLACTQGV-VKDYKNQGLSKAFFSGEGLFVYKISGTG 241
Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
+ F+ + G++++K+L GE VD +V+ ++ +G + + G+ LV
Sbjct: 242 ILFVQSFGAIIRKDLVDGEKYIVDNGHLVSWNCKYVMERVASGGVLSNLSSGEGLVCK-F 300
Query: 223 TGPGIVFIQS 232
TGPG +++Q+
Sbjct: 301 TGPGTIYLQT 310
>gi|363890242|ref|ZP_09317581.1| hypothetical protein HMPREF9628_02023 [Eubacteriaceae bacterium
CM5]
gi|361965849|gb|EHL18817.1| hypothetical protein HMPREF9628_02023 [Eubacteriaceae bacterium
CM5]
Length = 225
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 67/140 (47%), Gaps = 4/140 (2%)
Query: 94 DGFVGIAAPSLARILPIDLAMFGG-ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
DG V + APSL + ID+ + G L Q D FL ++V + + + G EG
Sbjct: 72 DGEV-LLAPSLLGDV-IDVELDGTYSLRVQKDGFLAGTQGIEVDTKMQNLSKGLFSG-EG 128
Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
F ++G+G F+ + GS+ NLE G+ I +D + +VA + +I+ + F
Sbjct: 129 FFILNVSGKGTVFLSSYGSIHSINLEAGQEIVIDNNHLVAWPDHMEYKIEKAASGWISSF 188
Query: 213 GGDNLVTAVVTGPGIVFIQS 232
++ GPG + IQ+
Sbjct: 189 TSGEMLVCRFRGPGNILIQT 208
>gi|297617354|ref|YP_003702513.1| hypothetical protein Slip_1175 [Syntrophothermus lipocalidus DSM
12680]
gi|297145191|gb|ADI01948.1| protein of unknown function DUF124 [Syntrophothermus lipocalidus
DSM 12680]
Length = 226
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 55/202 (27%), Positives = 90/202 (44%), Gaps = 16/202 (7%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI--TSVVLRNPGPSD 94
+ L P E + A PG+M MS ++ ++ E + G L K + S + N ++
Sbjct: 16 VKLNPGEGIQAEPGAMVSMSPTISLKT----EMKGGFMGALKRKVLGGESFFVSNY-VAE 70
Query: 95 GFVGIAAPSLARILPID---LAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
G G +LA +P D L + Q ++L +++ G E
Sbjct: 71 G--GSGEVTLAPAVPGDVMGLVLNNETYFVQSGSYLAGDTSLQIDTKFGGAKTFFAG--E 126
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP-IRRA 210
G K++G G+ + + G++++K L GE VD S IVA S+VN IK I
Sbjct: 127 GLFLLKISGSGMLILTSYGAIMEKKLVPGERYVVDTSHIVAFQSTVNYSIKKAAAGIMST 186
Query: 211 VFGGDNLVTAVVTGPGIVFIQS 232
+ G+ LV GPG +F+Q+
Sbjct: 187 ITSGEGLVCE-YEGPGTIFLQT 207
>gi|363894754|ref|ZP_09321822.1| hypothetical protein HMPREF9629_00289 [Eubacteriaceae bacterium
ACC19a]
gi|361961549|gb|EHL14740.1| hypothetical protein HMPREF9629_00289 [Eubacteriaceae bacterium
ACC19a]
Length = 225
Score = 50.1 bits (118), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 67/140 (47%), Gaps = 4/140 (2%)
Query: 94 DGFVGIAAPSLARILPIDLAMFGG-ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
DG V + APSL + ID+ + G L Q D FL ++V + + + G EG
Sbjct: 72 DGEV-LLAPSLLGDV-IDVELDGTYSLRVQKDGFLAGTQGIEVDTKMQNLSKGLFSG-EG 128
Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
F ++G+G F+ + GS+ NLE G+ I +D + +VA + +I+ + F
Sbjct: 129 FFILNVSGKGTVFLSSYGSIHSINLEAGQEIVIDNNHLVAWPDHMEYKIEKAASGWISSF 188
Query: 213 GGDNLVTAVVTGPGIVFIQS 232
++ GPG + IQ+
Sbjct: 189 TSGEMLVCRFRGPGNILIQT 208
>gi|392867521|gb|EAS29247.2| TIGR00266 family protein [Coccidioides immitis RS]
Length = 345
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/204 (26%), Positives = 92/204 (45%), Gaps = 19/204 (9%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS +M ++ + + + L G ++S PG + APS
Sbjct: 150 LTAKPGAMIAMSPTMTLKG----SVKFSVKKMLAGGEMSSSTYTGPGEL-----LLAPS- 199
Query: 105 ARILPIDLAMFGGE--LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQG 162
+ I L G+ DAFL + V + Q + EG ++G G
Sbjct: 200 -ALGDISLLRLNGQEQWFVGKDAFLACTQGI-VKDYKRQGLGKAFFSGEGLYIYTVSGTG 257
Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
+ FI + G++++K+L GE VD +VA T + ++ +G I + G+ LV
Sbjct: 258 ILFIQSFGAIIKKDLVAGEKYIVDNGHLVAWTCNYVMERVASGGIMSGLASGEGLVCK-F 316
Query: 223 TGPGIVFIQSLPFHRLSQRIARAV 246
TGPG +++Q+ R +Q A V
Sbjct: 317 TGPGTIYMQT----RNAQAFAHWV 336
>gi|119177498|ref|XP_001240515.1| hypothetical protein CIMG_07678 [Coccidioides immitis RS]
Length = 415
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/204 (26%), Positives = 92/204 (45%), Gaps = 19/204 (9%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS +M ++ + + + L G ++S PG + APS
Sbjct: 150 LTAKPGAMIAMSPTMTLKG----SVKFSVKKMLAGGEMSSSTYTGPGEL-----LLAPSA 200
Query: 105 ARILPIDLAMFGGE--LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQG 162
+ I L G+ DAFL + V + Q + EG ++G G
Sbjct: 201 --LGDISLLRLNGQEQWFVGKDAFLACTQGI-VKDYKRQGLGKAFFSGEGLYIYTVSGTG 257
Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
+ FI + G++++K+L GE VD +VA T + ++ +G I + G+ LV
Sbjct: 258 ILFIQSFGAIIKKDLVAGEKYIVDNGHLVAWTCNYVMERVASGGIMSGLASGEGLVCK-F 316
Query: 223 TGPGIVFIQSLPFHRLSQRIARAV 246
TGPG +++Q+ R +Q A V
Sbjct: 317 TGPGTIYMQT----RNAQAFAHWV 336
>gi|398339048|ref|ZP_10523751.1| hypothetical protein LkirsB1_05502 [Leptospira kirschneri serovar
Bim str. 1051]
gi|418676821|ref|ZP_13238099.1| TIGR00266 family protein [Leptospira kirschneri serovar
Grippotyphosa str. RM52]
gi|418688414|ref|ZP_13249570.1| TIGR00266 family protein [Leptospira kirschneri serovar
Grippotyphosa str. Moskva]
gi|418739685|ref|ZP_13296066.1| TIGR00266 family protein [Leptospira kirschneri serovar Valbuzzi
str. 200702274]
gi|400322721|gb|EJO70577.1| TIGR00266 family protein [Leptospira kirschneri serovar
Grippotyphosa str. RM52]
gi|410737271|gb|EKQ82013.1| TIGR00266 family protein [Leptospira kirschneri serovar
Grippotyphosa str. Moskva]
gi|410752807|gb|EKR09779.1| TIGR00266 family protein [Leptospira kirschneri serovar Valbuzzi
str. 200702274]
Length = 221
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 53/200 (26%), Positives = 95/200 (47%), Gaps = 10/200 (5%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPS 93
++ L P + + A G+M +M+ +++E + + FG ++ V + P +
Sbjct: 14 LKVKLSPGQTIKAESGAMVYMTPGIDVETKMGSGFLSAISRRFFGGESFFFNVFKAP-VA 72
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
G +G+A ++ +DL G L+ + A+L S + + +R+ +GG E F
Sbjct: 73 GGEIGLAPELPGDVVGVDLTDTG--LIVESGAYLASDETISIKPMFGG-IRSFLGGEEVF 129
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVF 212
L + + G G F+ A G +V ++V T+D IVA S+ +I K G + F
Sbjct: 130 LLEAI-GVGKLFLNAYGGIVP--IDVQGSYTIDTGHIVAFDKSLQYKIAKAGGSWKSTFF 186
Query: 213 GGDNLVTAVVTGPGIVFIQS 232
GG+ LV +G G V IQ+
Sbjct: 187 GGEGLVME-FSGHGRVLIQT 205
>gi|58338207|ref|YP_194792.1| hypothetical protein LBA1968 [Lactobacillus acidophilus NCFM]
gi|227902606|ref|ZP_04020411.1| protein of hypothetical function DUF124 [Lactobacillus acidophilus
ATCC 4796]
gi|58255524|gb|AAV43761.1| hypothetical protein LBA1968 [Lactobacillus acidophilus NCFM]
gi|227869695|gb|EEJ77116.1| protein of hypothetical function DUF124 [Lactobacillus acidophilus
ATCC 4796]
Length = 237
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/208 (25%), Positives = 96/208 (46%), Gaps = 14/208 (6%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL--FGKTITS-----VV 86
+ I L+P EKV + GSM + S S+++ N + G+ + + G++ITS +
Sbjct: 18 LVDIELEPGEKVFIQRGSMVYHSTSIKL-NTKVNGTGSGIGKLIRAVGRSITSGESFWIT 76
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV-RN 145
+ G + IA +LP+ L G E D +++ + Q V R
Sbjct: 77 EAQASRTTGKLAIAPALPGEVLPLKL---GKEQYRINDGKFLAMDGSASYSMKKQSVGRA 133
Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
+ G GF +G+G A GS+ + L E IT+D + +VA ++ ++ I ++
Sbjct: 134 IFSGTGGFFVMTTSGEGTVLCNAYGSIKKITLNNDE-ITIDNNHVVAWSTGLDYDIHFDN 192
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSL 233
+++ G+ +V G G V+IQSL
Sbjct: 193 GFIQSIGTGEGIVN-TFKGSGEVYIQSL 219
>gi|320032100|gb|EFW14056.1| hypothetical protein CPSG_09423 [Coccidioides posadasii str.
Silveira]
Length = 345
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 54/204 (26%), Positives = 92/204 (45%), Gaps = 19/204 (9%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS +M ++ + + + L G ++S PG + APS
Sbjct: 150 LTAKPGAMIAMSPTMTLKG----SVKFSVKKMLAGGEMSSSTYTGPGEL-----LLAPS- 199
Query: 105 ARILPIDLAMFGGE--LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQG 162
+ I L G+ DAFL + V + Q + EG ++G G
Sbjct: 200 -ALGDISLLRLNGQEQWFVGKDAFLACTQGI-VKDYKRQGLGKAFFSGEGLFIYTVSGTG 257
Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
+ FI + G++++K+L GE VD +VA T + ++ +G I + G+ LV
Sbjct: 258 ILFIQSFGAIIKKDLVAGEKYIVDNGHLVAWTCNYVMERVASGGIMSGLASGEGLVCK-F 316
Query: 223 TGPGIVFIQSLPFHRLSQRIARAV 246
TGPG +++Q+ R +Q A V
Sbjct: 317 TGPGTIYMQT----RNAQAFAHWV 336
>gi|24215002|ref|NP_712483.1| hypothetical protein LA_2302 [Leptospira interrogans serovar Lai
str. 56601]
gi|45657507|ref|YP_001593.1| hypothetical protein LIC11635 [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|386074341|ref|YP_005988658.1| hypothetical protein LIF_A1880 [Leptospira interrogans serovar Lai
str. IPAV]
gi|417762665|ref|ZP_12410653.1| TIGR00266 family protein [Leptospira interrogans str. 2002000624]
gi|417772368|ref|ZP_12420257.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
str. Pomona]
gi|417777699|ref|ZP_12425516.1| TIGR00266 family protein [Leptospira interrogans str. 2002000621]
gi|418667879|ref|ZP_13229284.1| TIGR00266 family protein [Leptospira interrogans serovar Pyrogenes
str. 2006006960]
gi|418671690|ref|ZP_13233039.1| TIGR00266 family protein [Leptospira interrogans str. 2002000623]
gi|418680947|ref|ZP_13242184.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
str. Kennewicki LC82-25]
gi|418698166|ref|ZP_13259145.1| TIGR00266 family protein [Leptospira interrogans serovar Bataviae
str. L1111]
gi|418704174|ref|ZP_13265053.1| TIGR00266 family protein [Leptospira interrogans serovar Hebdomadis
str. R499]
gi|418711824|ref|ZP_13272576.1| TIGR00266 family protein [Leptospira interrogans serovar
Grippotyphosa str. UI 08368]
gi|418717138|ref|ZP_13276975.1| TIGR00266 family protein [Leptospira interrogans str. UI 08452]
gi|418730607|ref|ZP_13289101.1| TIGR00266 family protein [Leptospira interrogans str. UI 12758]
gi|421085922|ref|ZP_15546773.1| TIGR00266 family protein [Leptospira santarosai str. HAI1594]
gi|421102770|ref|ZP_15563374.1| TIGR00266 family protein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|421117563|ref|ZP_15577923.1| TIGR00266 family protein [Leptospira interrogans serovar Canicola
str. Fiocruz LV133]
gi|421120876|ref|ZP_15581181.1| TIGR00266 family protein [Leptospira interrogans str. Brem 329]
gi|421123904|ref|ZP_15584174.1| TIGR00266 family protein [Leptospira interrogans serovar
Grippotyphosa str. 2006006986]
gi|421134974|ref|ZP_15595104.1| TIGR00266 family protein [Leptospira interrogans serovar
Grippotyphosa str. Andaman]
gi|24196047|gb|AAN49501.1|AE011400_6 conserved hypothetical protein [Leptospira interrogans serovar Lai
str. 56601]
gi|45600746|gb|AAS70230.1| conserved hypothetical protein [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|353458130|gb|AER02675.1| conserved hypothetical protein [Leptospira interrogans serovar Lai
str. IPAV]
gi|400327385|gb|EJO79637.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
str. Kennewicki LC82-25]
gi|409941410|gb|EKN87039.1| TIGR00266 family protein [Leptospira interrogans str. 2002000624]
gi|409945739|gb|EKN95754.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
str. Pomona]
gi|410010897|gb|EKO69028.1| TIGR00266 family protein [Leptospira interrogans serovar Canicola
str. Fiocruz LV133]
gi|410020857|gb|EKO87652.1| TIGR00266 family protein [Leptospira interrogans serovar
Grippotyphosa str. Andaman]
gi|410346214|gb|EKO97224.1| TIGR00266 family protein [Leptospira interrogans str. Brem 329]
gi|410367884|gb|EKP23268.1| TIGR00266 family protein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|410431487|gb|EKP75847.1| TIGR00266 family protein [Leptospira santarosai str. HAI1594]
gi|410438391|gb|EKP87477.1| TIGR00266 family protein [Leptospira interrogans serovar
Grippotyphosa str. 2006006986]
gi|410572686|gb|EKQ35751.1| TIGR00266 family protein [Leptospira interrogans str. 2002000621]
gi|410581388|gb|EKQ49200.1| TIGR00266 family protein [Leptospira interrogans str. 2002000623]
gi|410756324|gb|EKR17949.1| TIGR00266 family protein [Leptospira interrogans serovar Pyrogenes
str. 2006006960]
gi|410762870|gb|EKR29029.1| TIGR00266 family protein [Leptospira interrogans serovar Bataviae
str. L1111]
gi|410766154|gb|EKR36842.1| TIGR00266 family protein [Leptospira interrogans serovar Hebdomadis
str. R499]
gi|410767790|gb|EKR43051.1| TIGR00266 family protein [Leptospira interrogans serovar
Grippotyphosa str. UI 08368]
gi|410774816|gb|EKR54820.1| TIGR00266 family protein [Leptospira interrogans str. UI 12758]
gi|410787205|gb|EKR80939.1| TIGR00266 family protein [Leptospira interrogans str. UI 08452]
gi|455667356|gb|EMF32677.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
str. Fox 32256]
gi|455792658|gb|EMF44398.1| TIGR00266 family protein [Leptospira interrogans serovar Lora str.
TE 1992]
gi|456821914|gb|EMF70420.1| TIGR00266 family protein [Leptospira interrogans serovar Canicola
str. LT1962]
gi|456969815|gb|EMG10731.1| TIGR00266 family protein [Leptospira interrogans serovar
Grippotyphosa str. LT2186]
Length = 221
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 88/199 (44%), Gaps = 8/199 (4%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSD 94
++ L P + + A G+M +M+ +++E + + FG + +
Sbjct: 14 LKVKLSPGQTIKAESGAMVYMTPGIDVETKMGSGFLSAISRRFFGGESFFFNVFKASVTG 73
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
G +G+A ++ IDL G L+ + A+L S + + +R+ GG EG
Sbjct: 74 GEIGLAPELPGDVVGIDLTDTG--LIVESGAYLASDETISIKPMFGG-IRSFFGG-EGIF 129
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFG 213
+ G G F+ A G ++ ++V T+D IVA S+ +I K G + FG
Sbjct: 130 LLEAFGNGKLFLNAYGGIIP--IDVQGSYTIDTGHIVAFDKSLQYKIAKAGGSWKSTFFG 187
Query: 214 GDNLVTAVVTGPGIVFIQS 232
G+ LV TG G V IQ+
Sbjct: 188 GEGLVMEF-TGHGRVLIQT 205
>gi|448731397|ref|ZP_21713697.1| hypothetical protein C449_16508 [Halococcus saccharolyticus DSM
5350]
gi|445792150|gb|EMA42762.1| hypothetical protein C449_16508 [Halococcus saccharolyticus DSM
5350]
Length = 233
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/232 (25%), Positives = 98/232 (42%), Gaps = 23/232 (9%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSD-G 95
+ L E V+A PG+M S S+ +E + V + + G ++ R S+ G
Sbjct: 16 VELAAGETVMAEPGAMVTHSPSVSIETESSRDGLVSSAKSMLGGE-SAFANRFTAESEPG 74
Query: 96 FVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
V ++ P+ + +L GE L D A+L S D+ + + GG+E L
Sbjct: 75 TVTLSPPTPGDVHHHELD---GETLYAVDGAYLASDPDIDIDSEF--------GGLESML 123
Query: 155 RQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
L G G FI A G + L+ GE VD +VA SV + G ++
Sbjct: 124 SGAGLTPLALKGTGDVFIEAFGGLETIALDHGESHVVDNDHVVAWEGSVEFDARRVGGLK 183
Query: 209 RAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQ 260
+ G+ +V TGPG V+ Q+ ++ I A+ P ++ F V+
Sbjct: 184 STLLSGEGIVMD-FTGPGTVWYQTRGLDTFTEEIMEAM--PGTGDDGGFEVE 232
>gi|254416762|ref|ZP_05030512.1| conserved hypothetical protein TIGR00266 [Coleofasciculus
chthonoplastes PCC 7420]
gi|196176502|gb|EDX71516.1| conserved hypothetical protein TIGR00266 [Coleofasciculus
chthonoplastes PCC 7420]
Length = 220
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/201 (23%), Positives = 94/201 (46%), Gaps = 16/201 (7%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSM----EMENIYIPENEVGMWQWLF-GKTITSVVLRNPG 91
+ L P +++ A G+M M G + E +IP G+ + F G+++ N
Sbjct: 16 LTLNPGDRITAEAGAMTSMDGKLTVKTEFSGGFIP----GLLKKFFGGESLFVNTFHNQT 71
Query: 92 PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
+ + ++ I+ +DL G L QP A++C + VK+ +++ G E
Sbjct: 72 QQPLTLVLTQSTIGDIVGVDLR--GNSLCFQPGAYICHTSGVKLGVRW-AGLKSWFSG-E 127
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
G + ++ GQG F A G + +K + G+ + VD +VA ++ + + G + ++
Sbjct: 128 GLFKLQVKGQGKVFFGAYGGLTKKRVN-GDFV-VDTGHLVAYEPTIKMNVGLAGGLLGSM 185
Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
G+ + +TG G +++QS
Sbjct: 186 TSGEGFINR-LTGTGEIYLQS 205
>gi|302511745|ref|XP_003017824.1| hypothetical protein ARB_04708 [Arthroderma benhamiae CBS 112371]
gi|291181395|gb|EFE37179.1| hypothetical protein ARB_04708 [Arthroderma benhamiae CBS 112371]
Length = 333
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 49/189 (25%), Positives = 89/189 (47%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS ++ ++ + M + L G ++S PG + APS+
Sbjct: 136 LTAKPGAMIAMSPTVSLKGSI----KFSMKKMLAGGEMSSSTFTGPGEL-----LLAPSM 186
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
+ I L + G E DAFL V V + +Q + EG K++G G+
Sbjct: 187 LGDISI-LRLNGQEQWSVGKDAFLACTQGV-VKDYKNQGLSKAFFSGEGLFVYKISGTGI 244
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
F+ + G++++K+L GE VD +V+ ++ +G + + G+ LV T
Sbjct: 245 LFVQSFGAIIRKDLVDGEKYIVDNGHLVSWNCKYVMERVASGGVLSNLSSGEGLVCK-FT 303
Query: 224 GPGIVFIQS 232
GPG +++Q+
Sbjct: 304 GPGTIYLQT 312
>gi|159488167|ref|XP_001702091.1| hypothetical protein CHLREDRAFT_181241 [Chlamydomonas reinhardtii]
gi|158269539|gb|EDO95908.1| predicted protein [Chlamydomonas reinhardtii]
Length = 240
Score = 49.7 bits (117), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 57/123 (46%), Gaps = 3/123 (2%)
Query: 122 QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGE 181
QP +L + V + + + GG EGF + G+G I + G++ + +L GE
Sbjct: 115 QPGGYLACDHSVNIGVKMLDIAQGCCGG-EGFFVMEANGRGRLLICSYGAITRYDLAPGE 173
Query: 182 VITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
+D VA TS + I K + V G+ LV A TGPG VF+Q+ L+
Sbjct: 174 KRKIDNGYCVAWTSGMEWNIAKAAKGLWNTVISGEGLV-ANFTGPGTVFVQTRSMQNLAN 232
Query: 241 RIA 243
+A
Sbjct: 233 ALA 235
>gi|303315955|ref|XP_003067982.1| hypothetical protein CPC735_042810 [Coccidioides posadasii C735
delta SOWgp]
gi|240107658|gb|EER25837.1| hypothetical protein CPC735_042810 [Coccidioides posadasii C735
delta SOWgp]
Length = 338
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/204 (26%), Positives = 92/204 (45%), Gaps = 19/204 (9%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS +M ++ + + + L G ++S PG + APS
Sbjct: 143 LTAKPGAMIAMSPTMTLKG----SVKFSVKKMLAGGEMSSSTYTGPGEL-----LLAPS- 192
Query: 105 ARILPIDLAMFGGE--LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQG 162
+ I L G+ DAFL + V + Q + EG ++G G
Sbjct: 193 -ALGDISLLRLNGQEQWFVGKDAFLACTQGI-VKDYKRQGLGKAFFSGEGLFIYTVSGTG 250
Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
+ FI + G++++K+L GE VD +VA T + ++ +G I + G+ LV
Sbjct: 251 ILFIQSFGAIIKKDLVAGEKYIVDNGHLVAWTCNYVMERVASGGIMSGLASGEGLVCK-F 309
Query: 223 TGPGIVFIQSLPFHRLSQRIARAV 246
TGPG +++Q+ R +Q A V
Sbjct: 310 TGPGTIYMQT----RNAQAFAHWV 329
>gi|427420414|ref|ZP_18910597.1| TIGR00266 family protein [Leptolyngbya sp. PCC 7375]
gi|425763127|gb|EKV03980.1| TIGR00266 family protein [Leptolyngbya sp. PCC 7375]
Length = 225
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/207 (26%), Positives = 92/207 (44%), Gaps = 13/207 (6%)
Query: 31 EAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGKTITSVV 86
++ + +I + PQE+++A G+M MSG S + G+ + L G+++ V
Sbjct: 10 DSAIAKIFMDPQEELIAEAGAMVAMSGFVNVSTTLRKGKGGGVFGGLKRMLAGESLFLSV 69
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
R+ G S + +A + ILP L EL+ Q ++ S DV + +++
Sbjct: 70 FRS-GNSSAELYLAPKLIGDILPYTLKE--TELVVQSTGYMASTPDVDIDLGF-TGFKSL 125
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNG 205
G E +G GL + A G + + ++V VD IVA +++ I K N
Sbjct: 126 FSG-ESLFWLTASGSGLLLLSAFGGIYE--IDVDSEYVVDTGHIVAFERTLDYSISKANK 182
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
A FGG+ V G G VF Q+
Sbjct: 183 SWAGAFFGGEGFVCQ-FRGRGKVFCQT 208
>gi|242823093|ref|XP_002488022.1| DUF124 domain protein [Talaromyces stipitatus ATCC 10500]
gi|218712943|gb|EED12368.1| DUF124 domain protein [Talaromyces stipitatus ATCC 10500]
Length = 370
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/187 (26%), Positives = 81/187 (43%), Gaps = 12/187 (6%)
Query: 47 ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPS-LA 105
ARPG M MS ++ ++ + M + L G + PG + APS L
Sbjct: 178 ARPGVMIAMSPTVTLKGNF----HFSMKKLLIGGEMNHSTYTGPGEI-----LLAPSVLG 228
Query: 106 RILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAF 165
I + L G DAFL + VK + + G EG+ +TG GL +
Sbjct: 229 DITVLRLTDDGQTWKVGRDAFLANTASVKKKYKAQNLSKTLFSG-EGWFVYNITGPGLLW 287
Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGP 225
+ + G+V++K L GE VD +VA ++ +G + ++ G+ L GP
Sbjct: 288 LQSFGAVIKKELVEGESYYVDNGHLVAWNCEYIIERVASGGMISSLSSGEGLA-CRFKGP 346
Query: 226 GIVFIQS 232
G V++Q+
Sbjct: 347 GTVYLQT 353
>gi|367036511|ref|XP_003648636.1| hypothetical protein THITE_2106335 [Thielavia terrestris NRRL 8126]
gi|346995897|gb|AEO62300.1| hypothetical protein THITE_2106335 [Thielavia terrestris NRRL 8126]
Length = 381
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 55/109 (50%), Gaps = 2/109 (1%)
Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
D +L S +V L + + G EG K+TG GL ++ + G++++K+L GE
Sbjct: 258 DGYLASTENVVKDYKLQGLSKAMFSG-EGLWVYKITGTGLVWLTSFGAIIRKDLAEGEKY 316
Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
VD +VA + ++ +G I + + LV TGPG VFIQ+
Sbjct: 317 IVDNGHLVAWNTKYVLERVASGGIISGLASAEGLVCK-FTGPGTVFIQT 364
>gi|354617112|ref|ZP_09034609.1| protein of unknown function DUF124 [Saccharomonospora
paurometabolica YIM 90007]
gi|353218537|gb|EHB83279.1| protein of unknown function DUF124 [Saccharomonospora
paurometabolica YIM 90007]
Length = 221
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/216 (28%), Positives = 92/216 (42%), Gaps = 30/216 (13%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
V ++ L PQE V G M S + M++ G+ + + G V P+
Sbjct: 9 VARLALAPQEPVQVEAGGMVATSYGVHMQSQSQGGLMKGLGRAVLGGESLFVSTFTAPPN 68
Query: 94 DGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN----TLDQR---VR 144
G+V +A P RI+ +D A C N ++N TLD +
Sbjct: 69 GGWVDVAPGLPGDVRIVEMD-----------GRAGWCVTNGSWLANSHGITLDTKWGGFG 117
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
N+ GG GFL TGQG + G++ L+ GE+ITVD +VA +V Q
Sbjct: 118 NLFGGEGGFLSHA-TGQGQLVVACYGAMDLVTLQQGEMITVDAGHVVAYADTVQHQ---- 172
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
RR G +V ++ +G G+VF + P L+Q
Sbjct: 173 --TRRISSG---MVQSLKSGEGLVFDFAGPGQVLTQ 203
>gi|453085936|gb|EMF13978.1| DUF124-domain-containing protein [Mycosphaerella populorum SO2202]
Length = 358
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 89/199 (44%), Gaps = 10/199 (5%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ +PG+M MS ++ ++ + M + L ++ PG + +A +L
Sbjct: 164 ITVKPGAMIAMSPTITVKGSV----KFSMKKLLARGEMSHTTFTGPGE----LLLAPFAL 215
Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
I I L+ G DAFL V + + Q + I EG KL G G+
Sbjct: 216 GDITNIRLSGKGETWNIGKDAFLACTQGV-TKDYVSQALSKAIFSGEGLFVYKLGGAGIC 274
Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
+I + G++++K+L GE +D +VA + ++ +G I + G+ LV +G
Sbjct: 275 WIQSMGAIIRKDLREGEKYIIDNGHLVAWNCNYVLERVASGGIISNMSAGEGLVCK-FSG 333
Query: 225 PGIVFIQSLPFHRLSQRIA 243
PG VF+Q+ SQ +A
Sbjct: 334 PGTVFMQTRNPAAFSQYMA 352
>gi|310642467|ref|YP_003947225.1| hypothetical protein [Paenibacillus polymyxa SC2]
gi|386041537|ref|YP_005960491.1| hypothetical protein PPM_2847 [Paenibacillus polymyxa M1]
gi|309247417|gb|ADO56984.1| Putative uncharacterized protein [Paenibacillus polymyxa SC2]
gi|343097575|emb|CCC85784.1| conserved hypothetical protein [Paenibacillus polymyxa M1]
Length = 226
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/206 (26%), Positives = 98/206 (47%), Gaps = 9/206 (4%)
Query: 30 GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRN 89
G + ++ L P E + A G+M MS S++++ G+ + L G+ LR
Sbjct: 9 GAFAMLKVQLSPGETIKAEMGAMVSMSSSVDIKGTVDGGLLRGLGRMLSGEKFFFQELRA 68
Query: 90 P-GPSDGFVGIAAPSLARILPIDL-AMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
GP++ V +A S+ + ++L + LL Q D FL ++VS + ++ +
Sbjct: 69 ARGPAE--VLLAPASIGDVQAVELDGTY--RLLVQKDGFLACTEGIEVSTKMQNLMKGLF 124
Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGP 206
G EGF +++G+G F+ + G++ + GE +D + +VA ++ +I K +
Sbjct: 125 SG-EGFFIVEISGRGTVFLSSYGAIHPIYIAPGEERIIDNAHLVAWPDYMDYRIEKASKG 183
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
+V G+ LV G G V IQS
Sbjct: 184 WLSSVTSGEALVCR-FRGEGTVLIQS 208
>gi|410940911|ref|ZP_11372710.1| TIGR00266 family protein [Leptospira noguchii str. 2006001870]
gi|410783470|gb|EKR72462.1| TIGR00266 family protein [Leptospira noguchii str. 2006001870]
Length = 221
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 90/199 (45%), Gaps = 8/199 (4%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSD 94
++ L P + + A G+M +M+ +++E + + FG + +
Sbjct: 14 LKVKLSPGQTIKAESGAMVYMTPGIDVETKMGSGFLSAISRRFFGGESFFFNVFKATATG 73
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
G +G+A ++ +DL G L+ + A+L S + + +R+ +GG EG
Sbjct: 74 GEIGLAPELPGDLVGVDLTDTG--LIVESGAYLASDETISIKPMFGG-IRSFLGG-EGIF 129
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFG 213
+ G G F+ + G ++ ++V T+D IVA S+ +I K G + +FG
Sbjct: 130 LLEAIGNGKLFLNSYGGIIP--IDVQGSYTIDTGHIVAFDKSLQYKIAKAGGSWKSTLFG 187
Query: 214 GDNLVTAVVTGPGIVFIQS 232
G+ LV TG G V IQ+
Sbjct: 188 GEGLVMEF-TGHGRVLIQT 205
>gi|145233977|ref|XP_001400361.1| hypothetical protein ANI_1_3122024 [Aspergillus niger CBS 513.88]
gi|134057301|emb|CAK44500.1| unnamed protein product [Aspergillus niger]
Length = 361
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 79/188 (42%), Gaps = 11/188 (5%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ ARPG M MS SM + G + L G + PG + APS+
Sbjct: 168 IQARPGIMIAMSPSMSLRGSL----SFGWMKALAGGQLARSTYTGPGEL-----LLAPSI 218
Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
+ I E DAFL V ++ ++ V G EGF+ K TG GL
Sbjct: 219 LGDVTIIRLDGHTEWTVGKDAFLACTVGVNINFKTQGLMKGVFSG-EGFIVFKFTGSGLV 277
Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
++ + G++V+K L E V+ +VA ++ +G I G+ L TG
Sbjct: 278 WMQSFGAIVKKELADDETYYVNNGHLVAWNCRYKIERVASGGIISNWSAGEGLA-CRFTG 336
Query: 225 PGIVFIQS 232
PG V++Q+
Sbjct: 337 PGTVYMQT 344
>gi|418692288|ref|ZP_13253366.1| TIGR00266 family protein [Leptospira interrogans str. FPW2026]
gi|400357521|gb|EJP13641.1| TIGR00266 family protein [Leptospira interrogans str. FPW2026]
Length = 221
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 87/199 (43%), Gaps = 8/199 (4%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSD 94
++ L P + + A G+M +M+ +++E + + FG + +
Sbjct: 14 LKVKLSPGQTIKAESGAMVYMTPGIDVETKMGSGFLSAISRRFFGGESFFFNVFKASVTG 73
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
G +G A ++ IDL G L+ + A+L S + + +R+ GG EG
Sbjct: 74 GEIGFAPELPGDVVGIDLTDTG--LIVESGAYLASDETISIKPMFGG-IRSFFGG-EGIF 129
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFG 213
+ G G F+ A G ++ ++V T+D IVA S+ +I K G + FG
Sbjct: 130 LLEAFGNGKLFLNAYGGIIP--IDVQGSYTIDTGHIVAFDKSLQYKIAKAGGSWKSTFFG 187
Query: 214 GDNLVTAVVTGPGIVFIQS 232
G+ LV TG G V IQ+
Sbjct: 188 GEGLVMEF-TGHGRVLIQT 205
>gi|322710039|gb|EFZ01614.1| DUF124 domain-containing protein [Metarhizium anisopliae ARSEF 23]
Length = 413
Score = 49.3 bits (116), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/186 (26%), Positives = 89/186 (47%), Gaps = 11/186 (5%)
Query: 47 ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
A+PG+M MS S+ ++ E + + + + G +T+ PG + +A P L
Sbjct: 222 AKPGAMIAMSPSITLKG----ELKFSVKKMIAGADMTTSRYVGPG----ELLLAPPMLGD 273
Query: 107 ILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFI 166
I I L ++ DAFL + + + + Q + I EG +++G G+ ++
Sbjct: 274 ITSIRLTGQESWVVGH-DAFLAATQGI-IKDHKRQGLGKAIFSGEGLFVYRISGTGIMWL 331
Query: 167 LAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPG 226
+ G++++K+L GE VD +VA + ++ +G I + + LV TGPG
Sbjct: 332 TSFGAIIRKDLVDGEKYIVDNGHLVAWNTKYVLERVASGGIISNMASAEGLVCK-FTGPG 390
Query: 227 IVFIQS 232
VFIQ+
Sbjct: 391 TVFIQT 396
>gi|433775565|ref|YP_007306032.1| hypothetical protein Mesau_04318 [Mesorhizobium australicum
WSM2073]
gi|433667580|gb|AGB46656.1| hypothetical protein Mesau_04318 [Mesorhizobium australicum
WSM2073]
Length = 232
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 38/146 (26%), Positives = 69/146 (47%), Gaps = 6/146 (4%)
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
G + + I+PI L G E+ + FL + ++V + + + ++ G GF
Sbjct: 84 GIIAFSRDGPGHIVPIHLRR-GEEIQVREHQFLAATSNVDYTFERVRGLATMLFGQSGFF 142
Query: 155 RQKL---TGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
+ TG G+ ++ G+V +K L GE+I ++ + +SV ++ K + +
Sbjct: 143 IDRFRGDTGDGIVWLHGYGNVFEKTLAPGEIIDIEPGGWLFKDASVRMETKID-RLSSGF 201
Query: 212 FGGD-NLVTAVVTGPGIVFIQSLPFH 236
FG + N + TGPG V IQS+ H
Sbjct: 202 FGANLNFIVNRFTGPGRVGIQSMYLH 227
>gi|372222877|ref|ZP_09501298.1| hypothetical protein MzeaS_11221, partial [Mesoflavibacter
zeaxanthinifaciens S86]
Length = 76
Score = 48.9 bits (115), Expect = 0.002, Method: Composition-based stats.
Identities = 22/42 (52%), Positives = 30/42 (71%)
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
I++ G I+ VFGG+ L A + GPG V+IQSLPF RL+ R+
Sbjct: 4 IEFVGGIKNTVFGGEGLFFATLRGPGTVYIQSLPFSRLAGRV 45
>gi|418721405|ref|ZP_13280582.1| TIGR00266 family protein [Leptospira borgpetersenii str. UI 09149]
gi|410742097|gb|EKQ90847.1| TIGR00266 family protein [Leptospira borgpetersenii str. UI 09149]
Length = 221
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 95/203 (46%), Gaps = 16/203 (7%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNP--G 91
++ L P + + A G+M +M+ +++E + + FG ++ + + P G
Sbjct: 14 LKVKLSPGQTIKAEAGAMVYMTPGVDVETKMGSGFLSAISRKFFGGESFFFNIFKAPAQG 73
Query: 92 PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
GF AP L ++ +DL G L+ + A+L S V + + +R+ GG
Sbjct: 74 AEIGF----APELPGDVVGVDLTNTG--LIVEAGAYLASDETVTMKSKFGG-LRSFFGG- 125
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
EG ++ G G F+ A G ++ ++V T+D +VA S+ +I K G +
Sbjct: 126 EGIFLLEILGNGKLFLNAYGGILP--IDVQGSYTIDTGHVVAFDKSLQYKITKAGGSWKS 183
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
+FGG+ LV TG G V IQ+
Sbjct: 184 TLFGGEGLVMEF-TGHGKVLIQT 205
>gi|326476653|gb|EGE00663.1| hypothetical protein TESG_07960 [Trichophyton tonsurans CBS 112818]
Length = 335
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 49/189 (25%), Positives = 89/189 (47%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS ++ ++ + M + L G ++S PG + APS+
Sbjct: 138 LTAKPGAMIAMSPTVSLKGSI----KFSMKKMLAGGEMSSSTFTGPGEL-----LLAPSM 188
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
+ I L + G E DAFL V V + +Q + EG K++G G+
Sbjct: 189 LGDISI-LRLNGQEQWSVGKDAFLACTQGV-VKDYKNQGLSKAFFSGEGLFVYKISGTGI 246
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
F+ + G++++K+L GE VD +V+ ++ +G + + G+ LV T
Sbjct: 247 LFVQSFGAIIRKDLVDGEKYIVDNGHLVSWNCKYVMERVASGGVLSNLSSGEGLVCK-FT 305
Query: 224 GPGIVFIQS 232
GPG +++Q+
Sbjct: 306 GPGTIYLQT 314
>gi|159483657|ref|XP_001699877.1| hypothetical protein CHLREDRAFT_97437 [Chlamydomonas reinhardtii]
gi|158281819|gb|EDP07573.1| predicted protein [Chlamydomonas reinhardtii]
Length = 238
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 37/127 (29%), Positives = 58/127 (45%), Gaps = 3/127 (2%)
Query: 117 GELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKN 176
G + QP +L + V + + + GG EGF + G+G I + G++ + +
Sbjct: 92 GSWVVQPGGYLACDHSVSIGVKMLDLAQGCCGG-EGFFVMEANGRGRLLICSYGAITRYD 150
Query: 177 LEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
L GE +D VA TS + I K + V G+ LV A TGPG VF+Q+
Sbjct: 151 LAPGEKRKIDNGYCVAWTSGMQWNIAKAARGLWNTVISGEGLV-ANFTGPGTVFVQTRSM 209
Query: 236 HRLSQRI 242
L+ +
Sbjct: 210 QNLANAL 216
>gi|428207316|ref|YP_007091669.1| hypothetical protein Chro_2308 [Chroococcidiopsis thermalis PCC
7203]
gi|428009237|gb|AFY87800.1| protein of unknown function DUF124 [Chroococcidiopsis thermalis PCC
7203]
Length = 220
Score = 48.9 bits (115), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 93/200 (46%), Gaps = 14/200 (7%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIY----IPENEVGMWQWLFGKTITSVVLRNPGP 92
+ L P E + A G+M M + ++ + IP + ++L G+++ N
Sbjct: 16 VTLDPGESITAEAGAMTSMDAQLSIQTKFSGGLIPAL---LKKFLGGESLFVNTFTNRTQ 72
Query: 93 SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
+ + ++ I +DL+ G++ QP AF+ VK+ +++ + G EG
Sbjct: 73 QSLRLVLTQSTIGDIGRLDLSQ--GDICLQPGAFVAHSAGVKMGLQW-AGIKSWLAG-EG 128
Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
+ KL+G G AF G + +K ++ GE I VD +VA + + I G + +V
Sbjct: 129 LFKLKLSGSGRAFFGCYGGITKKQIQ-GEFI-VDSGHLVAYEPKIKMSIGLAGGLVGSVT 186
Query: 213 GGDNLVTAVVTGPGIVFIQS 232
G+ LV + G G +++QS
Sbjct: 187 SGEGLVNRLA-GTGDIYLQS 205
>gi|172035645|ref|YP_001802146.1| hypothetical protein cce_0729 [Cyanothece sp. ATCC 51142]
gi|171697099|gb|ACB50080.1| hypothetical protein cce_0729 [Cyanothece sp. ATCC 51142]
Length = 227
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 48/201 (23%), Positives = 96/201 (47%), Gaps = 16/201 (7%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-----QWLFGKTITSVVLRNPG 91
+ L P E ++A G+M M G + M+ + G++ ++L G+++ V +N
Sbjct: 23 VTLAPGESIIAEAGAMASMDGRLSMKT----KLSGGLFSALLRKFLGGESLFVNVFKNQT 78
Query: 92 PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
+ + + I ++L+ G E+ QP A++ S + + G E
Sbjct: 79 SQALTLVLTQSIIGDIECLELS--GNEMCFQPGAYIASSPKIDLGVRWAGLASWFAG--E 134
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
G + K++G G F A G + ++ + GE I VD S +VA + +++ I +G + ++
Sbjct: 135 GLFKLKVSGHGKVFFGAYGGLSKQTI-TGEFI-VDNSHLVAYSPDISMSIGLSGGLISSM 192
Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
G+ LV + G G++++QS
Sbjct: 193 TSGEGLVNR-LKGHGVIYLQS 212
>gi|396493741|ref|XP_003844129.1| hypothetical protein LEMA_P017800.1 [Leptosphaeria maculans JN3]
gi|312220709|emb|CBY00650.1| hypothetical protein LEMA_P017800.1 [Leptosphaeria maculans JN3]
Length = 407
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 90/199 (45%), Gaps = 19/199 (9%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNP---GPSDGFVGIAA 101
+ A+PGSM M+ ++ ++ G ++ K T + GP + + AA
Sbjct: 214 LTAKPGSMIAMTPTITLK---------GTMKFSLKKAFTGGEMSKSQYIGPGELLLAPAA 264
Query: 102 PSLARILPIDLAMFGGELL-CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTG 160
+ I I L+ G E C DAFL + V+ Q + + EG K++G
Sbjct: 265 --IGDITIIKLS--GKETWSCGHDAFLACTQGI-VTEYKSQGLSKAMFSGEGLFVYKISG 319
Query: 161 QGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTA 220
QG+ + + G++++K+L+ E +D +VA ++ +G I + G+ LV
Sbjct: 320 QGVLWCTSFGAIIRKDLQKDEKYIIDNGHLVAWNCKYVMERVASGGIMSNMMAGEGLVCK 379
Query: 221 VVTGPGIVFIQSLPFHRLS 239
TGPG +F+Q+ + S
Sbjct: 380 -FTGPGTIFMQTRNAQQFS 397
>gi|428304804|ref|YP_007141629.1| hypothetical protein Cri9333_1218 [Crinalium epipsammum PCC 9333]
gi|428246339|gb|AFZ12119.1| protein of unknown function DUF124 [Crinalium epipsammum PCC 9333]
Length = 221
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 52/198 (26%), Positives = 89/198 (44%), Gaps = 9/198 (4%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPSDG 95
+ L P E + A G+M M+G + M + + + FG +++ V N
Sbjct: 16 LSLSPGESITAEAGAMTSMNGQISMRTEFSGGLLSALLKKFFGGESLFVNVFTNNTQLPL 75
Query: 96 FVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
V + ++ I IDL GG +C QP A++ + + G EG
Sbjct: 76 EVVLTQSTIGDIACIDLR--GGREICFQPGAYIAHTPGINIGVQWAGFSSFFAG--EGLF 131
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGG 214
+ KL+GQG F A G + QK + GE I VD +VA + +++ G + ++ G
Sbjct: 132 KLKLSGQGRVFFGAYGGISQKQIN-GEFI-VDSGHLVAYDPGIKMKLGLAGGLIGSMTSG 189
Query: 215 DNLVTAVVTGPGIVFIQS 232
+ LV ++G G +++QS
Sbjct: 190 EGLVNR-LSGNGEIYLQS 206
>gi|395244439|ref|ZP_10421406.1| Putative uncharacterized protein [Lactobacillus hominis CRBIP
24.179]
gi|394483329|emb|CCI82414.1| Putative uncharacterized protein [Lactobacillus hominis CRBIP
24.179]
Length = 236
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 54/218 (24%), Positives = 94/218 (43%), Gaps = 12/218 (5%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITS-----VVL 87
+ I L+ EKV GSM + S S+ + + + +G G++I+S +
Sbjct: 18 LVDIELEDGEKVYIERGSMVYHSTSINLNTKVNAAGSGIGKLMRAVGRSISSGESFWITE 77
Query: 88 RNPGPSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
G + IA PSL IL ++L + FL S R +
Sbjct: 78 AQAKGETGHLAIA-PSLPGEILHLELG--DKQYRINDGKFLAMDGSASYSMKKQTLGRAL 134
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
+ G GF +GQG A GS+ + L+ E IT+D + +VA ++ ++ I ++
Sbjct: 135 LSGTGGFFVMTTSGQGTVLCNAYGSIKKIVLDDDE-ITIDNNHVVAWSTDLSYDIHFDNG 193
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+++ G+ +V G G V+IQSL + R+ +
Sbjct: 194 FIQSIGTGEGVVN-TFKGSGEVYIQSLNIETFASRLDK 230
>gi|452983776|gb|EME83534.1| hypothetical protein MYCFIDRAFT_188507 [Pseudocercospora fijiensis
CIRAD86]
Length = 335
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 55/109 (50%), Gaps = 2/109 (1%)
Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
DAFL V + + Q + I EG KL G G+ +I + G++++K+L GE
Sbjct: 212 DAFLACTQGV-TRDYVSQNLSKAIFSGEGLFVYKLGGTGICWIQSMGAILRKDLREGEKY 270
Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+D +VA + V+ +G I + G+ LV +GPG VF+Q+
Sbjct: 271 IIDNGHLVAWNCNYVVERVASGGIISNISAGEGLVCK-FSGPGTVFMQT 318
>gi|116328217|ref|YP_797937.1| hypothetical protein LBL_1541 [Leptospira borgpetersenii serovar
Hardjo-bovis str. L550]
gi|418736821|ref|ZP_13293220.1| TIGR00266 family protein [Leptospira borgpetersenii serovar
Castellonis str. 200801910]
gi|421096067|ref|ZP_15556774.1| TIGR00266 family protein [Leptospira borgpetersenii str. 200801926]
gi|116120961|gb|ABJ79004.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
Hardjo-bovis str. L550]
gi|410360968|gb|EKP12014.1| TIGR00266 family protein [Leptospira borgpetersenii str. 200801926]
gi|410747707|gb|EKR00612.1| TIGR00266 family protein [Leptospira borgpetersenii serovar
Castellonis str. 200801910]
gi|456888184|gb|EMF99177.1| TIGR00266 family protein [Leptospira borgpetersenii str. 200701203]
Length = 221
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/203 (26%), Positives = 95/203 (46%), Gaps = 16/203 (7%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNP--G 91
++ L P + + A G+M +M+ +++E + + FG ++ + + P G
Sbjct: 14 LKVKLSPGQTIKAEAGAMVYMTPGVDVETKMGSGFLSAISRKFFGGESFFFNIFKAPAQG 73
Query: 92 PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
GF AP L ++ +DL G L+ + A+L S + + + +R+ GG
Sbjct: 74 AEIGF----APELPGDVVGVDLTNTG--LIVEAGAYLASDETITMKSKFGG-LRSFFGG- 125
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
EG ++ G G F+ A G ++ ++V T+D +VA S+ +I K G +
Sbjct: 126 EGIFLLEILGNGKLFLNAYGGILP--IDVQGSYTIDTGHVVAFDKSLQYKITKAGGSWKS 183
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
+FGG+ LV TG G V IQ+
Sbjct: 184 TLFGGEGLVMEF-TGHGKVLIQT 205
>gi|389641153|ref|XP_003718209.1| hypothetical protein MGG_11494 [Magnaporthe oryzae 70-15]
gi|351640762|gb|EHA48625.1| hypothetical protein MGG_11494 [Magnaporthe oryzae 70-15]
Length = 478
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/109 (31%), Positives = 56/109 (51%), Gaps = 2/109 (1%)
Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
D +L S V V Q + + EG K++G+GL ++ + G++++K+L GE
Sbjct: 354 DGYLASTQGV-VKEYKRQGLGKAMFSGEGLWIYKISGKGLLWLTSFGAIIRKDLADGERY 412
Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
VD +VA + ++ +G I V G+ LV TGPG VF+Q+
Sbjct: 413 VVDNGHLVAWNTKYILERVASGGIVSGVASGEGLV-CRFTGPGTVFLQT 460
>gi|399575836|ref|ZP_10769593.1| hypothetical protein HSB1_16320 [Halogranum salarium B-1]
gi|399238547|gb|EJN59474.1| hypothetical protein HSB1_16320 [Halogranum salarium B-1]
Length = 228
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 42/76 (55%), Gaps = 1/76 (1%)
Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDN 216
KLTG G F+ + G+V + L GE VD IVA S +++ G ++ +F G+
Sbjct: 131 KLTGTGPTFLSSYGAVHEIELTEGEQYVVDTGHIVAFEESTTFEVRRVGGMKSTLFSGEG 190
Query: 217 LVTAVVTGPGIVFIQS 232
LV TGPG +++QS
Sbjct: 191 LVCE-FTGPGKLWLQS 205
>gi|354554886|ref|ZP_08974189.1| protein of unknown function DUF124 [Cyanothece sp. ATCC 51472]
gi|353553040|gb|EHC22433.1| protein of unknown function DUF124 [Cyanothece sp. ATCC 51472]
Length = 220
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 48/201 (23%), Positives = 96/201 (47%), Gaps = 16/201 (7%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-----QWLFGKTITSVVLRNPG 91
+ L P E ++A G+M M G + M+ + G++ ++L G+++ V +N
Sbjct: 16 VTLAPGESIIAEAGAMASMDGRLSMKT----KLSGGLFSALLRKFLGGESLFVNVFKNQT 71
Query: 92 PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
+ + + I ++L+ G E+ QP A++ S + + G E
Sbjct: 72 SQALTLVLTQSIIGDIECLELS--GNEMCFQPGAYIASSPKIDLGVRWAGLASWFAG--E 127
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
G + K++G G F A G + ++ + GE I VD S +VA + +++ I +G + ++
Sbjct: 128 GLFKLKVSGHGKVFFGAYGGLSKQTI-TGEFI-VDNSHLVAYSPDISMSIGLSGGLISSM 185
Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
G+ LV + G G++++QS
Sbjct: 186 TSGEGLVNR-LKGHGVIYLQS 205
>gi|296811802|ref|XP_002846239.1| DUF124 domain-containing protein [Arthroderma otae CBS 113480]
gi|238843627|gb|EEQ33289.1| DUF124 domain-containing protein [Arthroderma otae CBS 113480]
Length = 319
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 48/189 (25%), Positives = 88/189 (46%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ +PG+M MS S+ ++ + M + L G ++S PG + APS+
Sbjct: 123 LTVKPGAMIAMSPSVSLKG----SVKFSMKKMLAGGEMSSSTFTGPGEL-----LLAPSM 173
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
+ +L + G E DAFL + V + +Q + EG K++G G+
Sbjct: 174 LGDI-SNLKLNGQEQWSVGKDAFLACTQGI-VKDYKNQGLSKAFFSGEGLFVYKISGTGI 231
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
F+ + G++++K+L GE VD +VA ++ +G + + G+ LV T
Sbjct: 232 LFVQSFGAIIRKDLVEGEKYIVDNGHLVAWNCKYVMERVASGGVLSNLSSGEGLVCK-FT 290
Query: 224 GPGIVFIQS 232
GPG +++Q+
Sbjct: 291 GPGTIYLQT 299
>gi|116330941|ref|YP_800659.1| hypothetical protein LBJ_1316 [Leptospira borgpetersenii serovar
Hardjo-bovis str. JB197]
gi|116124630|gb|ABJ75901.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
Hardjo-bovis str. JB197]
Length = 221
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/203 (26%), Positives = 95/203 (46%), Gaps = 16/203 (7%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNP--G 91
++ L P + + A G+M +M+ +++E + + FG ++ + + P G
Sbjct: 14 LKVKLSPGQTIKAEAGAMVYMTPGVDVETKMGSGFLSAISRKFFGGESFFFNIFKAPAQG 73
Query: 92 PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
GF AP L ++ +DL G L+ + A+L S + + + +R+ GG
Sbjct: 74 AEIGF----APELPGDVVGVDLTNTG--LIVEAGAYLASDETITMKSKFGG-LRSFFGG- 125
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
EG ++ G G F+ A G ++ ++V T+D +VA S+ +I K G +
Sbjct: 126 EGIFLLEILGNGKLFLNAYGGILP--IDVQGSYTIDTGHVVAFDKSLQYKITKAGGSWKS 183
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
+FGG+ LV TG G V IQ+
Sbjct: 184 TLFGGEGLVMEF-TGHGKVLIQT 205
>gi|448420452|ref|ZP_21581199.1| hypothetical protein C474_20721 [Halosarcina pallida JCM 14848]
gi|445673603|gb|ELZ26163.1| hypothetical protein C474_20721 [Halosarcina pallida JCM 14848]
Length = 226
Score = 48.5 bits (114), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 81/199 (40%), Gaps = 7/199 (3%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
+ + L E + A G+M ++EME + + G++ R P
Sbjct: 13 LLTVSLAESESLRAESGAMVSHDTAIEMETNATGGFLKSIRRAFGGESFFQNTFRATAPG 72
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
D V A P I ++L G L Q ++L + V R GG EG
Sbjct: 73 D--VQFAPPLPGDISHVELT--GETLYVQSGSYLAGDAGLDVDTEFGG-ARTFFGG-EGL 126
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
K+TG G F+ + G++ + +L+ E VD +VA + ++ G +R +
Sbjct: 127 FLLKVTGSGPLFLSSYGAIEEISLDDRESFVVDTGHVVAFEDTAEFTVRRVGGLRSTLTS 186
Query: 214 GDNLVTAVVTGPGIVFIQS 232
G+ LV G G V++QS
Sbjct: 187 GEGLVCE-FGGSGTVWVQS 204
>gi|335438291|ref|ZP_08561039.1| hypothetical protein HLRTI_14150 [Halorhabdus tiamatea SARL4B]
gi|334892485|gb|EGM30718.1| hypothetical protein HLRTI_14150 [Halorhabdus tiamatea SARL4B]
Length = 235
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 1/76 (1%)
Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDN 216
+L+G G F+ + G++ + +L+ GE TVD IVA S N +K G ++ +F G+
Sbjct: 131 ELSGTGRTFLSSYGAIERVDLDPGETYTVDTGHIVAFEGSTNFDVKKVGGLKSTLFSGEG 190
Query: 217 LVTAVVTGPGIVFIQS 232
LV +G G V++Q+
Sbjct: 191 LVCE-FSGEGSVWLQT 205
>gi|337269156|ref|YP_004613211.1| hypothetical protein Mesop_4696 [Mesorhizobium opportunistum
WSM2075]
gi|336029466|gb|AEH89117.1| protein of unknown function DUF124 [Mesorhizobium opportunistum
WSM2075]
Length = 232
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 67/146 (45%), Gaps = 6/146 (4%)
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
G + + I+PI L +G E+ + FL + +V + + + ++ G GF
Sbjct: 84 GIIAFSRDGPGHIVPIHLR-YGEEIQVREHQFLAATGNVDYTFERVRGLATMLFGQSGFF 142
Query: 155 RQKL---TGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
+ G G+ ++ G+V +K L GE I ++ + +SV ++ K + +
Sbjct: 143 IDRFRGDAGDGIVWLHGYGNVFEKTLAPGETIDIEPGGWLFKDTSVKMETKID-RLSSGF 201
Query: 212 FGGD-NLVTAVVTGPGIVFIQSLPFH 236
FG + N + TGPG V IQS+ H
Sbjct: 202 FGANMNFIVNRFTGPGRVGIQSMYLH 227
>gi|451847140|gb|EMD60448.1| hypothetical protein COCSADRAFT_99184 [Cochliobolus sativus ND90Pr]
Length = 250
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 50/198 (25%), Positives = 89/198 (44%), Gaps = 15/198 (7%)
Query: 48 RPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARI 107
+PG+M MS +M ++ + + + L G +T PG + +A P++ I
Sbjct: 61 KPGAMIAMSPTMTLKG----NVKFSLKKALIGGDMTKSTYTGPGE----LLLAPPAIGDI 112
Query: 108 LPIDLAMFGGELLCQ--PDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAF 165
I L GG DAFL + V+ Q + + EGF K++G G+ +
Sbjct: 113 TIIKL---GGNDTWSVGRDAFLACTQGI-VTEYKSQGIGKAMFSGEGFFVYKISGTGILW 168
Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGP 225
+ G++++K+L E +D +VA ++ +G I + G+ +V TGP
Sbjct: 169 CTSFGAIIRKDLAANEKYIIDNGHLVAWNCKYVLERVASGGIVSNLASGEGMVCK-FTGP 227
Query: 226 GIVFIQSLPFHRLSQRIA 243
G VF+Q+ +Q +A
Sbjct: 228 GTVFMQTRNATAFAQFMA 245
>gi|367023727|ref|XP_003661148.1| hypothetical protein MYCTH_2300213 [Myceliophthora thermophila ATCC
42464]
gi|347008416|gb|AEO55903.1| hypothetical protein MYCTH_2300213 [Myceliophthora thermophila ATCC
42464]
Length = 425
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 87/189 (46%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ ARPG+M MS S+ ++ + + + + G + S PG + + P L
Sbjct: 232 IEARPGAMIAMSPSIVLKG----SVKFSVKKLVAGGDMGSSTYTGPGE----LLLGPPML 283
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
I L + G E D FL +V V + Q + + EG K++G GL
Sbjct: 284 GDI--TSLRLTGQESWSVSHDGFLACTQNV-VKDYKRQGLGKAMFSGEGLWVYKISGTGL 340
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
++ + G++++K+L GE VD +VA + ++ +G I + G+ LV T
Sbjct: 341 LWLTSFGAIIRKDLVEGEKYIVDNGHLVAWNTKYVLERVASGGIISGLASGEGLVCK-FT 399
Query: 224 GPGIVFIQS 232
GPG VFIQ+
Sbjct: 400 GPGTVFIQT 408
>gi|159483201|ref|XP_001699649.1| hypothetical protein CHLREDRAFT_97504 [Chlamydomonas reinhardtii]
gi|158281591|gb|EDP07345.1| predicted protein [Chlamydomonas reinhardtii]
Length = 232
Score = 48.1 bits (113), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 56/122 (45%), Gaps = 3/122 (2%)
Query: 122 QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGE 181
QP +L + V + + + GG EGF + G+G I + G++ + +L GE
Sbjct: 94 QPGGYLACDHSVNIGVKMLDIAQGCCGG-EGFFVMEANGRGRLLICSYGAITRYDLAPGE 152
Query: 182 VITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
+D VA TS + I K + V G+ LV A TGPG VF+Q+ L+
Sbjct: 153 KRKIDNGYCVAWTSGMEWNIAKAAKGLWNTVISGEGLV-ANFTGPGTVFVQTRSMQNLAN 211
Query: 241 RI 242
+
Sbjct: 212 AL 213
>gi|15790569|ref|NP_280393.1| hypothetical protein VNG1609C [Halobacterium sp. NRC-1]
gi|169236306|ref|YP_001689506.1| hypothetical protein OE3284R [Halobacterium salinarum R1]
gi|10581083|gb|AAG19873.1| conserved hypothetical protein [Halobacterium sp. NRC-1]
gi|167727372|emb|CAP14160.1| conserved hypothetical protein [Halobacterium salinarum R1]
Length = 235
Score = 48.1 bits (113), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 60/237 (25%), Positives = 97/237 (40%), Gaps = 27/237 (11%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSD-- 94
+ L P E ++A PG+M S + ++ + + + + G N +D
Sbjct: 16 VDLDPGESILAEPGAMVSHSAGVSIDTSTTRDGLLSSAKSMLGG---ESAFANEFAADHE 72
Query: 95 -GFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
G V +A P I DL GE L D AFL S + V++ GG++
Sbjct: 73 AGRVTLAPPKPGDIAQHDLD---GETLYAVDGAFLASDPSLDVASEF--------GGLKS 121
Query: 153 FLRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
L L+G G F+ A G + + +L+ GE VD +VA SV+ + G
Sbjct: 122 LLAGASITPLALSGTGTVFLEAFGGIERVDLDPGESYVVDNEHVVAWEESVDFDARRVGG 181
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQIAI 263
++ + G+ LV TG G V+ Q+ + IA A+ P + + I I
Sbjct: 182 LKSTLLSGEGLVME-FTGGGSVWYQTRGLDAFTSIIAGAL--PMTEHDSDGGIDIDI 235
>gi|456862794|gb|EMF81306.1| TIGR00266 family protein [Leptospira weilii serovar Topaz str.
LT2116]
Length = 221
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 53/203 (26%), Positives = 96/203 (47%), Gaps = 16/203 (7%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF-GKTITSVVLRNP--G 91
++ L P + + A G+M +M+ +++E + + F G++ + + P G
Sbjct: 14 LKVKLSPGQTIKAEAGAMVYMTPGVDVETKMGSGFLSAISRRFFGGESFFFNIFKAPSGG 73
Query: 92 PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
GF AP L ++ +DL G L+ + A+L S + + + +R+ GG
Sbjct: 74 AELGF----APELPGDVVGVDLTDTG--LIVEAGAYLASDETITMKSKFGG-LRSFFGG- 125
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
EG ++ G G F+ A G ++ ++V T+D +VA S+ +I K +G +
Sbjct: 126 EGIFLLEVLGNGKLFLNAYGGILP--IDVQGSYTIDTGHVVAFDKSLRYKITKASGSWKS 183
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
+FGG+ LV TG G V IQ+
Sbjct: 184 TLFGGEGLVME-FTGHGKVLIQT 205
>gi|308069406|ref|YP_003871011.1| hypothetical protein PPE_02645 [Paenibacillus polymyxa E681]
gi|305858685|gb|ADM70473.1| DUF124 domain-containing protein [Paenibacillus polymyxa E681]
Length = 226
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 9/206 (4%)
Query: 30 GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRN 89
G + ++ L P E + A G+M MS S++++ G+ + L G+ LR
Sbjct: 9 GAFAMLKVQLNPGEMIKAEMGAMVSMSSSVDIKGTVDGGLLRGLGRMLSGEKFFFQELRA 68
Query: 90 P-GPSDGFVGIAAPSLARILPIDL-AMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
GP++ V +A S+ + ++L + L Q D FL ++VS + ++ +
Sbjct: 69 ARGPAE--VLLAPASIGDVQAVELDGTY--RLFVQKDGFLACTEGIEVSTKMQNLMKGLF 124
Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGP 206
G EGF +++G+G F+ + G++ + GE +D + +VA ++ +I K +
Sbjct: 125 SG-EGFFIVEISGRGTVFLSSYGAIHPIYIAPGEERIIDNAHLVAWPDYMDYKIEKASKG 183
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
+V G+ LV G G V IQS
Sbjct: 184 WLSSVTSGEALVCR-FRGEGTVLIQS 208
>gi|433639418|ref|YP_007285178.1| TIGR00266 family protein [Halovivax ruber XH-70]
gi|433291222|gb|AGB17045.1| TIGR00266 family protein [Halovivax ruber XH-70]
Length = 225
Score = 47.8 bits (112), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 51/199 (25%), Positives = 90/199 (45%), Gaps = 11/199 (5%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--KTITSVVLRNPGPSD 94
+ L E ++A PG+M S ++E+ + + + + G + + GP
Sbjct: 16 VELAAGETILAEPGAMVGHSPNVEISTASSRDGLLSSAKSMLGGESLVVNEFTAEGGP-- 73
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
G V +A P+ + +L E L D AFL S + + + +++++GG
Sbjct: 74 GEVRLAPPTPGDVKEHELQ---DETLYTTDGAFLASSPGIDIDSEF-GGLKSMLGGAS-L 128
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
L G G AFI A G + + +L+ GE T+D ++A SV + G ++ +
Sbjct: 129 TPLALKGTGTAFIDAYGGLERLDLDPGESYTLDNEHLIAWDDSVEFDTRRVGGLKSTLLS 188
Query: 214 GDNLVTAVVTGPGIVFIQS 232
G+ LV TGPG V+ Q+
Sbjct: 189 GEGLVFD-FTGPGSVWYQT 206
>gi|418696575|ref|ZP_13257584.1| TIGR00266 family protein [Leptospira kirschneri str. H1]
gi|421089051|ref|ZP_15549866.1| TIGR00266 family protein [Leptospira kirschneri str. 200802841]
gi|421109955|ref|ZP_15570462.1| TIGR00266 family protein [Leptospira kirschneri str. H2]
gi|421130944|ref|ZP_15591135.1| TIGR00266 family protein [Leptospira kirschneri str. 2008720114]
gi|409956104|gb|EKO15036.1| TIGR00266 family protein [Leptospira kirschneri str. H1]
gi|410002172|gb|EKO52694.1| TIGR00266 family protein [Leptospira kirschneri str. 200802841]
gi|410004982|gb|EKO58786.1| TIGR00266 family protein [Leptospira kirschneri str. H2]
gi|410357748|gb|EKP04968.1| TIGR00266 family protein [Leptospira kirschneri str. 2008720114]
Length = 221
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 52/200 (26%), Positives = 93/200 (46%), Gaps = 10/200 (5%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPS 93
++ L P + + A G+M +M+ +++E + + FG ++ V + P +
Sbjct: 14 LKVKLSPGQTIKAESGAMVYMTPGIDVETKMGSGFLSAISRRFFGGESFFFNVFKAP-VA 72
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
G +G+A ++ +DL G L+ + A+L S + + +R+ +GG EG
Sbjct: 73 GGEIGLAPELPGDVVGVDLTDTG--LIVESGAYLASDETISIKPMFGG-IRSFLGG-EGV 128
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVF 212
+ G G F+ A G +V ++V T+D IVA S+ +I K G + F
Sbjct: 129 FLLEAIGVGKLFLNAYGGIVP--IDVQGSYTIDTGHIVAFDKSLQYKIAKAGGSWKSTFF 186
Query: 213 GGDNLVTAVVTGPGIVFIQS 232
GG+ LV +G G V IQ+
Sbjct: 187 GGEGLVME-FSGHGRVLIQT 205
>gi|223997894|ref|XP_002288620.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975728|gb|EED94056.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 302
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 44/191 (23%), Positives = 79/191 (41%), Gaps = 8/191 (4%)
Query: 85 VVLRNPGPS---DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
+++N PS D + + P ++PI+ EL+ +FL N V V+ + +
Sbjct: 64 TIVKNTSPSRMGDVLLAPSEPGGICLVPIN--GMRDELMLTKGSFLACDNSVDVTTIVQR 121
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
VRN + GF K +G+G + + GS+ + L E +D +VA T+S+
Sbjct: 122 GVRNSLSSGTGFFLMKASGRGTIAMASYGSIHKYTLSSDETRMIDNGHLVAWTASMRYST 181
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQI 261
+ GPG+V++QS H+ +Q A F
Sbjct: 182 TLASSSVWGSISSGEGLMCRFEGPGVVYLQS---HKPNQSDEDAKRRGRNGGGSPFAACF 238
Query: 262 AIFFFLAYVVI 272
A+ F + +++I
Sbjct: 239 ALLFVIVWILI 249
>gi|146323366|ref|XP_754754.2| DUF124 domain protein [Aspergillus fumigatus Af293]
gi|129558326|gb|EAL92716.2| DUF124 domain protein [Aspergillus fumigatus Af293]
gi|159127762|gb|EDP52877.1| DUF124 domain protein [Aspergillus fumigatus A1163]
Length = 333
Score = 47.8 bits (112), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 54/109 (49%), Gaps = 2/109 (1%)
Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
DAFL S + + + V G EG KL+G GL ++ + G++++K++ GE
Sbjct: 210 DAFLASTSGISKDYQAQGLSKGVFSG-EGLFVYKLSGVGLVWLQSFGAIIRKDIADGESY 268
Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
VD +VA ++ +G I + G+ L TGPG V++Q+
Sbjct: 269 YVDNGHLVAWNCKYKMERVASGGIISNISSGEGLA-CRFTGPGTVYLQT 316
>gi|359726625|ref|ZP_09265321.1| hypothetical protein Lwei2_06447 [Leptospira weilii str.
2006001855]
gi|417777919|ref|ZP_12425731.1| TIGR00266 family protein [Leptospira weilii str. 2006001853]
gi|410781889|gb|EKR66456.1| TIGR00266 family protein [Leptospira weilii str. 2006001853]
Length = 221
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 53/203 (26%), Positives = 95/203 (46%), Gaps = 16/203 (7%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNP--G 91
++ L P + + A G+M +M+ +++E + + FG ++ + + P G
Sbjct: 14 LKVKLSPGQTIKAEAGAMVYMTPGVDVETKMGSGFLSAISRRFFGGESFFFNIFKAPSGG 73
Query: 92 PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
GF AP L ++ +DL G L+ + A+L S + + + +R+ GG
Sbjct: 74 AELGF----APELPGDVVGVDLTDTG--LIVEAGAYLASDETITMKSKFGG-LRSFFGG- 125
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
EG ++ G G F+ A G ++ ++V T+D +VA S+ +I K G +
Sbjct: 126 EGIFLLEVLGNGKLFLNAYGGILP--IDVQGSYTIDTGHVVAFDKSLQYKITKAGGSWKS 183
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
+FGG+ LV TG G V IQ+
Sbjct: 184 TLFGGEGLVMEF-TGHGKVLIQT 205
>gi|65322229|ref|ZP_00395188.1| COG2013: Uncharacterized conserved protein [Bacillus anthracis str.
A2012]
Length = 104
Score = 47.4 bits (111), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 48/105 (45%), Gaps = 14/105 (13%)
Query: 19 QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
Q I +++ G + Q +I L P+E V+A G+M M +EME I+ + G LF
Sbjct: 2 QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 59
Query: 79 GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPID 111
GK + + V N G V AAP +I+P+D
Sbjct: 60 GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPRKIIPVD 104
>gi|159469370|ref|XP_001692840.1| predicted protein [Chlamydomonas reinhardtii]
gi|158268629|gb|EDO95688.1| predicted protein [Chlamydomonas reinhardtii]
Length = 152
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 54/122 (44%), Gaps = 2/122 (1%)
Query: 122 QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGE 181
QP +L + ++ L G EGF + G+G I + G++ + +L GE
Sbjct: 27 QPGGYLACDHSHCIARILAHAPSQGCCGGEGFFVMEANGRGRLLICSYGAITRYDLAPGE 86
Query: 182 VITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
+D VA TS + I K + V G+ LV A TGPG VF+Q+ L+
Sbjct: 87 KRKIDNGYCVAWTSGMEWNIAKAAKGLWNTVISGEGLV-ANFTGPGTVFVQTRSMQNLAN 145
Query: 241 RI 242
+
Sbjct: 146 AL 147
>gi|257051409|ref|YP_003129242.1| hypothetical protein Huta_0322 [Halorhabdus utahensis DSM 12940]
gi|256690172|gb|ACV10509.1| protein of unknown function DUF124 [Halorhabdus utahensis DSM
12940]
Length = 235
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 44/76 (57%), Gaps = 1/76 (1%)
Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDN 216
+L+G G F+ + G++ + +L+ GE TVD IVA + N +K G ++ +F G+
Sbjct: 131 ELSGTGRTFLSSYGAIERVDLDPGETYTVDTGHIVAFEGTTNFDVKKVGGLKSTLFSGEG 190
Query: 217 LVTAVVTGPGIVFIQS 232
LV +G G V++Q+
Sbjct: 191 LVCE-FSGEGSVWLQT 205
>gi|440638033|gb|ELR07952.1| hypothetical protein GMDG_02811 [Geomyces destructans 20631-21]
Length = 365
Score = 47.4 bits (111), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 57/109 (52%), Gaps = 2/109 (1%)
Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
DA L S V V + Q + + EG K++G G+ +I + G++++K+L+ GE
Sbjct: 241 DALLASTQGV-VKDFKRQGLGKAMFSGEGLFVFKISGTGIMWITSFGAIIRKDLQEGERY 299
Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
VD ++A + ++ +G I + G+ LV TGPG +F+Q+
Sbjct: 300 VVDNGHLIAWNTKYVLERIASGGIISGIASGEGLVCK-FTGPGTIFLQT 347
>gi|281411601|ref|YP_003345680.1| hypothetical protein Tnap_0161 [Thermotoga naphthophila RKU-10]
gi|281372704|gb|ADA66266.1| protein of unknown function DUF124 [Thermotoga naphthophila RKU-10]
Length = 224
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 48/198 (24%), Positives = 87/198 (43%), Gaps = 20/198 (10%)
Query: 27 ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF-----GKT 81
+L G + ++ L E VV PG+M +M G +E+ G+W+ L G++
Sbjct: 6 VLKGSYALLKVFLSIGESVVVEPGAMVYMKGPIEVNT----SATGGVWKALKRAILGGES 61
Query: 82 ITSVVLRNPGPSDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
+ G S+ VG+A P ++P+ ++ Q +FL V++ +
Sbjct: 62 FFMNTYISRGESE--VGLAPQLPGDIDVIPLKDILY-----VQSTSFLACDPSVEMDVSF 114
Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
+++ G EG K G G + + G + L+ GE TVD +VA +V
Sbjct: 115 GG-LKSFFSG-EGIFLLKFVGHGDVAVSSFGGIKSIELKPGEEFTVDTGHVVAFDGTVKW 172
Query: 200 QIKYNGPIRRAVFGGDNL 217
++ G ++ +FGG+ L
Sbjct: 173 NVRTFGGLKSTLFGGEGL 190
>gi|300790364|ref|YP_003770655.1| hypothetical protein AMED_8558 [Amycolatopsis mediterranei U32]
gi|384153893|ref|YP_005536709.1| hypothetical protein RAM_43925 [Amycolatopsis mediterranei S699]
gi|399542242|ref|YP_006554904.1| hypothetical protein AMES_8427 [Amycolatopsis mediterranei S699]
gi|299799878|gb|ADJ50253.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
gi|340532047|gb|AEK47252.1| hypothetical protein RAM_43925 [Amycolatopsis mediterranei S699]
gi|398323012|gb|AFO81959.1| hypothetical protein AMES_8427 [Amycolatopsis mediterranei S699]
Length = 225
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 57/201 (28%), Positives = 97/201 (48%), Gaps = 7/201 (3%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-WLFGKTITSVVLRNPGP 92
V ++ML P E G+M S +++++ G+ + +L G++ P P
Sbjct: 13 VARLMLAPGEPCQVESGAMMATSYGVQVQSQAQGGIMKGLGRAFLSGESFFISTFTAP-P 71
Query: 93 SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
+ G+V +AA I I L G + + +L S + V+ + T ++N++GG G
Sbjct: 72 NGGWVDVAANLPGDIQVITLDGRTGWAVTR-GCWLASSHGVQ-TETKWGGMKNLMGGEGG 129
Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAV 211
FL TGQG + G+V L+ GE++T+D +VA +V QI K I +++
Sbjct: 130 FLTHA-TGQGQLLVSCYGAVETITLQQGEMVTIDTGHVVAYADTVQYQIRKVAQGIIQSM 188
Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
G+ LV V GPG + Q+
Sbjct: 189 KSGEGLVFDFV-GPGQIMTQT 208
>gi|116514068|ref|YP_812974.1| hypothetical protein LBUL_0969 [Lactobacillus delbrueckii subsp.
bulgaricus ATCC BAA-365]
gi|116514121|ref|YP_813027.1| hypothetical protein LBUL_1039 [Lactobacillus delbrueckii subsp.
bulgaricus ATCC BAA-365]
gi|116093383|gb|ABJ58536.1| hypothetical protein LBUL_0969 [Lactobacillus delbrueckii subsp.
bulgaricus ATCC BAA-365]
gi|116093436|gb|ABJ58589.1| hypothetical protein LBUL_1039 [Lactobacillus delbrueckii subsp.
bulgaricus ATCC BAA-365]
Length = 222
Score = 47.4 bits (111), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 51/230 (22%), Positives = 101/230 (43%), Gaps = 18/230 (7%)
Query: 23 IPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
+ +++ GG + + L E + G+M +M + + +G G++
Sbjct: 1 MKYELSGGNTCPLATVTLDKGESIKVENGAMVYMKDVTIAGKMNSSKKGLGSLLGAIGRS 60
Query: 82 ITS----VVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKV 135
+TS + + G +DG +G+A +I + + G + C AFL S + V
Sbjct: 61 LTSGESMFITQATGDADGGQIGVAPAIPGKIEKLSV---GKQQYCLNTGAFLASDDSVSY 117
Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
+ + +GG GF + G+G + A G +V+ + + I++D ++A +
Sbjct: 118 KMRSQKLSKAALGGTGGFFVMETEGEGDMLVNAFGDLVELTVTSDKPISIDNEHVIAWDA 177
Query: 196 SVNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+++ IK + +FG G+ LV TG G V IQ+ H L++ +
Sbjct: 178 NLDYDIK----VASGMFGFTSGEGLVNH-FTGDGKVLIQTRNLHSLAEAL 222
>gi|378731982|gb|EHY58441.1| hypothetical protein HMPREF1120_06451 [Exophiala dermatitidis
NIH/UT8656]
Length = 395
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 51/189 (26%), Positives = 85/189 (44%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS S+ ++ + M + L G +T PG + +A SL
Sbjct: 201 LTAKPGAMIAMSPSITLKGAV----KFSMKKLLIGGEMTHSTFTGPGE----LLLAPSSL 252
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
I + L G E DAFL V V Q + + EG K++G+G+
Sbjct: 253 GDISLLRLD--GNETWSVGRDAFLACTQGV-VKEYKSQGIGKAMFSGEGLFVYKMSGRGI 309
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
++ G++++K+L GE VD +VA ++ +G I G+ LV +
Sbjct: 310 LWMSTFGAILKKDLVDGERYIVDNGHLVAWNCKYVLERVASGGIISGAASGEGLVCN-FS 368
Query: 224 GPGIVFIQS 232
GPG V++Q+
Sbjct: 369 GPGTVYLQT 377
>gi|381163294|ref|ZP_09872524.1| TIGR00266 family protein [Saccharomonospora azurea NA-128]
gi|418461391|ref|ZP_13032466.1| hypothetical protein SZMC14600_10578 [Saccharomonospora azurea SZMC
14600]
gi|359738494|gb|EHK87379.1| hypothetical protein SZMC14600_10578 [Saccharomonospora azurea SZMC
14600]
gi|379255199|gb|EHY89125.1| TIGR00266 family protein [Saccharomonospora azurea NA-128]
Length = 225
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 51/203 (25%), Positives = 89/203 (43%), Gaps = 11/203 (5%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
V +++L P E G+M S M M+ G+ + G V P+
Sbjct: 13 VARLLLAPHEPAQVEAGAMLATSYGMHMQASTQGGVMKGLGRAFLGGESLFVSTYTAPPN 72
Query: 94 DGFVGIAA--PSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
G+V +AA P R++ +D G + C ++L + + +++ N+ GG
Sbjct: 73 GGWVDVAAGLPGDVRVIEMD----GRQGWCVTRGSWLANSHGIQLETKWG-GFSNLFGGE 127
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
GFL GQG + G++ NL+ GE +T+D +VA +V Q+ K + + +
Sbjct: 128 GGFLTHA-QGQGQLVVSCYGAIDVVNLQPGEYVTIDSGHVVAYADTVQSQLRKVSQGVIQ 186
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
++ G+ V GPG V Q+
Sbjct: 187 SLKSGEGFVFDFA-GPGQVLTQT 208
>gi|257057406|ref|YP_003135238.1| hypothetical protein Svir_34470 [Saccharomonospora viridis DSM
43017]
gi|256587278|gb|ACU98411.1| conserved hypothetical protein TIGR00266 [Saccharomonospora viridis
DSM 43017]
Length = 225
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 53/210 (25%), Positives = 92/210 (43%), Gaps = 18/210 (8%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
V +++L P E G+M S M +E G+ + L G V P+
Sbjct: 13 VARLLLAPHEPAHVEAGAMLATSYGMHVEAQAQGGVMKGLGRALLGGESLFVSTYTAPPN 72
Query: 94 DGFVGIAA--PSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
G+V IA+ P R++ +D G C ++L + + +++ N+ GG
Sbjct: 73 GGWVDIASSLPGDIRLIEMD----GRVGWCVTRGSWLANSHGIQLETRWGG-FSNLFGGE 127
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
GFL GQG + G++ NL+ GE +T+D +VA +V Q++ + R
Sbjct: 128 GGFLTHA-QGQGQLIVACYGAIDVINLQPGEYVTIDSGHVVAYADTVQSQLR---KVSRG 183
Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
V + ++ +G G+VF + P L+Q
Sbjct: 184 V------IQSLKSGEGLVFDFAGPGQILTQ 207
>gi|448408461|ref|ZP_21574256.1| hypothetical protein C475_07996 [Halosimplex carlsbadense 2-9-1]
gi|445674316|gb|ELZ26860.1| hypothetical protein C475_07996 [Halosimplex carlsbadense 2-9-1]
Length = 229
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 34/114 (29%), Positives = 56/114 (49%), Gaps = 3/114 (2%)
Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
L Q +FL S V+V +T + GG EG +L G G AF+ + G++ + +
Sbjct: 95 LYVQSSSFLASDPTVEV-DTEFGGAKTFFGG-EGLFLLRLEGTGPAFLSSYGAIDEHEVT 152
Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
E VD IVA S++ ++ G ++ +F G+ LV +G G V+ Q+
Sbjct: 153 SDESFVVDTGHIVAFEESLSFDVRKVGGLKSTLFSGEGLVCE-FSGEGTVWTQT 205
>gi|432329095|ref|YP_007247239.1| TIGR00266 family protein [Aciduliprofundum sp. MAR08-339]
gi|432135804|gb|AGB05073.1| TIGR00266 family protein [Aciduliprofundum sp. MAR08-339]
Length = 223
Score = 47.0 bits (110), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 54/204 (26%), Positives = 92/204 (45%), Gaps = 19/204 (9%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSD 94
++ L P E V A G+M +MS ++ ++ G+ + + K + S+
Sbjct: 14 LKVHLSPGESVTAEAGAMVYMSSNIGVKTTTGGGFLKGLARKILTKQTLFMNTYYAEGSE 73
Query: 95 GFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVN---DVKVSNTLDQRVRNVI--G 148
G+V + AP L I+ ID++ L +L S VK + R + G
Sbjct: 74 GYV-VFAPGLPGDIMEIDVSK---PLYVSDTNYLASTGLQFGVKFTG-----FRGIFTPG 124
Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
G+ F KL G G ++ G V L+ GE + +D + A + + +++ G ++
Sbjct: 125 GMFWF---KLEGIGKVWLATFGGVDVIELKPGEHLLIDNIHLAAFDAGMQFRLRKFGKLK 181
Query: 209 RAVFGGDNLVTAVVTGPGIVFIQS 232
+FGG+ L+T GPG VFIQS
Sbjct: 182 SFLFGGEYLLTE-FEGPGRVFIQS 204
>gi|51891979|ref|YP_074670.1| hypothetical protein STH841 [Symbiobacterium thermophilum IAM
14863]
gi|51855668|dbj|BAD39826.1| conserved hypothetical protein [Symbiobacterium thermophilum IAM
14863]
Length = 226
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 58/216 (26%), Positives = 91/216 (42%), Gaps = 44/216 (20%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV-GMWQWLFGKTI-------TSVVLR 88
+ L P E+V A PG+M MS ++E+ E EV G + G+++ TS
Sbjct: 16 VYLGPGEEVQAEPGAMVAMSSTVEL------ETEVKGGFLGALGRSVLGGESFFTSRYRA 69
Query: 89 NPGPSDGFVGIAAPSLARILPIDLA---MFGGELLCQPDAFLCSVNDVKVSNTLDQRVRN 145
GP + +LA LP D+ + G + A+L + ++ V D R
Sbjct: 70 RGGPGE-------LALAPTLPGDIGYVELRGETFYLKSGAYLAADPELAV----DSRWGG 118
Query: 146 VIG--GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
G G G ++ G G A G++ +K L GE VD +V + +N
Sbjct: 119 ARGFFGSGGLFLLRVQGTGGLLFTAYGALHEKELAPGERYRVDTGHVVGFSEGLNF---- 174
Query: 204 NGPIRRAVFG-------GDNLVTAVVTGPGIVFIQS 232
+R+A G G+ LV TGPG V++Q+
Sbjct: 175 --AVRKAARGWFSTLASGEGLVCE-FTGPGRVYLQT 207
>gi|411118304|ref|ZP_11390685.1| TIGR00266 family protein [Oscillatoriales cyanobacterium JSC-12]
gi|410712028|gb|EKQ69534.1| TIGR00266 family protein [Oscillatoriales cyanobacterium JSC-12]
Length = 231
Score = 46.6 bits (109), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 55/205 (26%), Positives = 95/205 (46%), Gaps = 17/205 (8%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGP 92
+ ++ L E+V SM MS + +E + + L G +++ V P P
Sbjct: 13 ISRVRLAGGEQVRVEAASMVGMSAGITLETTTTGGFMQSLKRSLLGGESLFQNVYTAP-P 71
Query: 93 SDGFVGIAAPSLARILPIDLAMFG--GELLCQPDAFLCSVNDVKVSNTLDQR---VRNVI 147
G V +A PSL P DL + +L Q A++ S DV ++ LD + ++
Sbjct: 72 QGGEVWVA-PSL----PGDLNVLTITEPMLIQSGAYVAS--DVTIA--LDTKWGGSKSFF 122
Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
G G + GQG + + G++ + L GE T+D +VA++ S+ + + G I
Sbjct: 123 GTGGGLFMLRAEGQGQVVVSSYGAIHEMTLAPGESFTLDTGHLVALSESMQFKTRSIGGI 182
Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQS 232
+ +F G+ V A +TGPG +Q+
Sbjct: 183 KTFMFSGEGFV-ADLTGPGRFLMQT 206
>gi|334127084|ref|ZP_08501014.1| hypothetical protein HMPREF9081_0601 [Centipeda periodontii DSM
2778]
gi|333390046|gb|EGK61198.1| hypothetical protein HMPREF9081_0601 [Centipeda periodontii DSM
2778]
Length = 227
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 64/119 (53%), Gaps = 11/119 (9%)
Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI---EGFLRQKLTGQGLAFILAGGSVVQK 175
++ Q FL + + V D +++ ++ G+ EGF KLTG+G+AF+ + G +
Sbjct: 97 MIVQKGGFLAAEESINV----DSKMQGLMQGLFSQEGFFLLKLTGRGVAFLSSYGVIHVL 152
Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKY--NGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
NLE GE + VD +VA ++ +++ NG + ++ G+ LV GPG V IQ+
Sbjct: 153 NLEQGEEVIVDNGHLVAWPDYMDYKVEKASNGWV-SSIMSGECLVCR-FRGPGPVLIQT 209
>gi|119492280|ref|XP_001263579.1| hypothetical protein NFIA_068520 [Neosartorya fischeri NRRL 181]
gi|119411739|gb|EAW21682.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 334
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 32/109 (29%), Positives = 54/109 (49%), Gaps = 2/109 (1%)
Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
DAFL S + + + V G EG KL+G GL ++ + G++++K++ GE
Sbjct: 211 DAFLASTSGISKDYQAQGLSKGVFSG-EGLFVYKLSGVGLVWLQSFGAIIKKDIADGESY 269
Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
VD +VA ++ +G I + G+ L TGPG V++Q+
Sbjct: 270 YVDNGHLVAWNCKYKMERVASGGIISNMSSGEGLA-CRFTGPGTVYLQT 317
>gi|374324403|ref|YP_005077532.1| hypothetical protein HPL003_22935 [Paenibacillus terrae HPL-003]
gi|357203412|gb|AET61309.1| hypothetical protein HPL003_22935 [Paenibacillus terrae HPL-003]
Length = 226
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 99/206 (48%), Gaps = 9/206 (4%)
Query: 30 GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLR- 88
G + ++ + P E + A G+M MS S++++ G+ + L G+ LR
Sbjct: 9 GAFAMLKVQMNPGETIKAEMGAMVSMSSSVDIKGTVDGGLLRGLGRMLSGEKFFFQELRA 68
Query: 89 NPGPSDGFVGIAAPSLARILPIDL-AMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
+ GP++ V +A S+ + ++L + LL Q D FL ++VS + ++ +
Sbjct: 69 SRGPAE--VLLAPASIGDVQAVELDGTY--RLLVQKDGFLACTEGIEVSTKMQNLMKGLF 124
Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGP 206
G EGF +++G+G F+ + G++ + GE +D + +VA ++ +I K +
Sbjct: 125 SG-EGFFIVEISGRGTVFLSSYGAIHPIYVAPGEERIIDNAHLVAWPDYMDYKIEKASKG 183
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
+V G+ LV + G V IQS
Sbjct: 184 WLSSVTSGEALVCRFRS-EGTVLIQS 208
>gi|302913033|ref|XP_003050829.1| hypothetical protein NECHADRAFT_61207 [Nectria haematococca mpVI
77-13-4]
gi|256731767|gb|EEU45116.1| hypothetical protein NECHADRAFT_61207 [Nectria haematococca mpVI
77-13-4]
Length = 455
Score = 46.6 bits (109), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 48/187 (25%), Positives = 88/187 (47%), Gaps = 13/187 (6%)
Query: 47 ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
A+PG+M MS S+ ++ + + + + + G ++S PG V +A SL
Sbjct: 264 AKPGAMIAMSPSVTLKG----QIKFSVKKMITGGEMSSSTFVGPG----EVLLAPHSLGD 315
Query: 107 ILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAF 165
+ L + G E DA+L S V + + Q + + EG K++G GL +
Sbjct: 316 V--TTLRLTGQEKWSVGHDAYLASTQGV-IKDFKRQSISKAMFSGEGLWVYKISGTGLMW 372
Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGP 225
I + G++++K L GE VD ++A + ++ +G I + + LV TGP
Sbjct: 373 ISSFGAIIKKTLVDGEKYIVDNGHLIAWNTKYVMERVASGGIISGMASAEGLVCK-FTGP 431
Query: 226 GIVFIQS 232
G +++Q+
Sbjct: 432 GTIYMQT 438
>gi|410450302|ref|ZP_11304343.1| TIGR00266 family protein [Leptospira sp. Fiocruz LV3954]
gi|410015815|gb|EKO77906.1| TIGR00266 family protein [Leptospira sp. Fiocruz LV3954]
Length = 221
Score = 46.6 bits (109), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 95/203 (46%), Gaps = 16/203 (7%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNP--G 91
++ L P + + A G+M +M+ + +E + + FG ++ + + P G
Sbjct: 14 LKVKLLPGQTIKAEAGAMVYMTPGISVETKMGSGFLSAVSRRFFGGESFFFNIFKAPTGG 73
Query: 92 PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
GF AP L ++ IDL G L+ + A+L S V + + +R+ +GG
Sbjct: 74 AEIGF----APELPGDVVGIDLTDTG--LIVEAGAYLASDETVTMKSMFGG-LRSFLGG- 125
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
EG + G G F+ + G ++ ++V T+D +VA S++ +I K G +
Sbjct: 126 EGIFLLEAIGNGKLFLNSYGGIIP--IDVQGSYTIDTGHVVAFDKSLSYKIAKAGGSWKS 183
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
+FGG+ LV TG G V IQ+
Sbjct: 184 TLFGGEGLVMQF-TGHGKVLIQT 205
>gi|170290249|ref|YP_001737065.1| hypothetical protein Kcr_0629 [Candidatus Korarchaeum cryptofilum
OPF8]
gi|170174329|gb|ACB07382.1| protein of unknown function DUF124 [Candidatus Korarchaeum
cryptofilum OPF8]
Length = 231
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 50/202 (24%), Positives = 93/202 (46%), Gaps = 17/202 (8%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIY----IPENEVGMW--QWLFGKTITSVVLRNP 90
+ L E + A G+M +M+ +++M+ + ++G+ Q F T+V
Sbjct: 16 VKLNQGETITAEAGAMTYMTPNIQMKTRARAGVLDTIKLGVLGGQSFFVNDYTAV----G 71
Query: 91 GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
GP G V +A+ L I + L G + Q +++ S V++ + + G
Sbjct: 72 GP--GEVALASAPLGDIEKLTLDGRG--YIIQRSSYVASDPTVELDIKWQGFTKGIFG-- 125
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
+G K +G+G FI G++ + L GE + VD +VA + + + +++ G ++
Sbjct: 126 QGLFMIKTSGRGNLFINTFGAIDRHELSAGESLIVDNFHLVAFSDTCSYEVRRMGGLKEL 185
Query: 211 VFGGDNLVTAVVTGPGIVFIQS 232
G+ LV V GPG V IQ+
Sbjct: 186 ALSGEGLVVE-VRGPGEVLIQT 206
>gi|398336773|ref|ZP_10521478.1| hypothetical protein LkmesMB_14976 [Leptospira kmetyi serovar
Malaysia str. Bejo-Iso9]
Length = 221
Score = 46.2 bits (108), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 48/200 (24%), Positives = 94/200 (47%), Gaps = 10/200 (5%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPS 93
++ L P + + A G+M +M+ +++E + + FG ++ + + P +
Sbjct: 14 LKVKLSPGQTIKAEAGAMVYMTPGVDVETKMGSGFLSAISRRFFGGESFFFNIFKAP-SA 72
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
+G+A ++ +DL G L+ + A+L S + + +R+ GG EG
Sbjct: 73 GAEIGLAPELPGDVVGVDLTDTG--LIVEAGAYLASDETITMKPKFGG-IRSFFGG-EGV 128
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVF 212
++ G+G F+ A G ++ ++V T+D +VA ++ +I K G + +F
Sbjct: 129 FLLEILGKGKLFLNAYGGILP--IDVQGSYTIDTGHVVAFDKTLQYRIAKAGGSWKSTLF 186
Query: 213 GGDNLVTAVVTGPGIVFIQS 232
GG+ LV TG G V IQ+
Sbjct: 187 GGEGLVMEF-TGHGKVLIQT 205
>gi|13470475|ref|NP_102044.1| hypothetical protein mlr0194 [Mesorhizobium loti MAFF303099]
gi|14021217|dbj|BAB47830.1| mlr0194 [Mesorhizobium loti MAFF303099]
Length = 221
Score = 46.2 bits (108), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 37/146 (25%), Positives = 66/146 (45%), Gaps = 6/146 (4%)
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
G + + I+PI L G E+ + FL + +V + + + ++ G GF
Sbjct: 73 GIIAFSRDGPGHIVPIHLRR-GEEIQVREHQFLAATGNVDYTFERVRGLATMLFGQSGFF 131
Query: 155 RQKLTGQG---LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
+ G G + ++ G+V +K L GE I ++ + +SV ++ K + +
Sbjct: 132 IDRFRGDGGDGIVWLHGYGNVFEKTLAAGETIDIEPGGWLFKDASVRMETKID-RLSSGF 190
Query: 212 FGGD-NLVTAVVTGPGIVFIQSLPFH 236
FG + N + TGPG V IQS+ H
Sbjct: 191 FGANMNFIVNRFTGPGRVGIQSMYLH 216
>gi|365960832|ref|YP_004942399.1| hypothetical protein FCOL_08985 [Flavobacterium columnare ATCC
49512]
gi|365737513|gb|AEW86606.1| hypothetical protein FCOL_08985 [Flavobacterium columnare ATCC
49512]
Length = 220
Score = 46.2 bits (108), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 58/225 (25%), Positives = 102/225 (45%), Gaps = 35/225 (15%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--------------KTI 82
+ L+P E +VA +M MS ++++ + VG+ + + G KT+
Sbjct: 16 VELEPGETIVAESDAMSSMSAELDLDAKFNGGFFVGLAKKILGGESLFINHFTNNTNKTL 75
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQ 141
S+ L P D I +D G + C Q A++ S VK+S
Sbjct: 76 -SLHLTQSTPGD----------IEIKELD----GTQSYCIQRGAYIASEKGVKLSVKW-A 119
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+ ++IGG EG + ++G G A G +++K ++ GE I VD +VA ++ ++
Sbjct: 120 GLGSLIGG-EGLFKLVVSGTGKVIFGAYGGLLEKEID-GEYI-VDTGHLVAYEPNMKLKP 176
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
+ +G I + FGG+ VT + G G ++IQ+ L I R +
Sbjct: 177 QLSGGIFSSFFGGEGFVTR-IEGKGKIYIQTRNLSGLVSWINRQI 220
>gi|359685935|ref|ZP_09255936.1| hypothetical protein Lsan2_15273 [Leptospira santarosai str.
2000030832]
gi|418746675|ref|ZP_13302995.1| TIGR00266 family protein [Leptospira santarosai str. CBC379]
gi|418755449|ref|ZP_13311654.1| TIGR00266 family protein [Leptospira santarosai str. MOR084]
gi|421113424|ref|ZP_15573868.1| TIGR00266 family protein [Leptospira santarosai str. JET]
gi|422004334|ref|ZP_16351554.1| hypothetical protein LSS_12579 [Leptospira santarosai serovar
Shermani str. LT 821]
gi|409964244|gb|EKO32136.1| TIGR00266 family protein [Leptospira santarosai str. MOR084]
gi|410792384|gb|EKR90319.1| TIGR00266 family protein [Leptospira santarosai str. CBC379]
gi|410801198|gb|EKS07372.1| TIGR00266 family protein [Leptospira santarosai str. JET]
gi|417257003|gb|EKT86411.1| hypothetical protein LSS_12579 [Leptospira santarosai serovar
Shermani str. LT 821]
gi|456875109|gb|EMF90340.1| TIGR00266 family protein [Leptospira santarosai str. ST188]
Length = 221
Score = 45.8 bits (107), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 54/203 (26%), Positives = 94/203 (46%), Gaps = 16/203 (7%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNP--G 91
++ L P + + A G+M +M+ + +E + + FG ++ + + P G
Sbjct: 14 LKVKLLPGQTIKAEAGAMVYMTPGISVETKMGSGFLSAVSRRFFGGESFFFNIFKAPTGG 73
Query: 92 PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
GF AP L ++ IDL G L+ + A+L S V + + +R+ GG
Sbjct: 74 AEIGF----APELPGDVVGIDLTDTG--LIVEAGAYLASDETVTMKSMFGG-LRSFFGG- 125
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
EG + G G F+ + G ++ ++V T+D +VA S++ +I K G +
Sbjct: 126 EGIFLLEAIGNGKLFLNSYGGIIP--IDVQGSYTIDTGHVVAFDKSLSYKIAKAGGSWKS 183
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
+FGG+ LV TG G V IQ+
Sbjct: 184 TLFGGEGLVMQF-TGHGKVLIQT 205
>gi|381157052|ref|ZP_09866286.1| TIGR00266 family protein [Thiorhodovibrio sp. 970]
gi|380880915|gb|EIC23005.1| TIGR00266 family protein [Thiorhodovibrio sp. 970]
Length = 464
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 68/133 (51%), Gaps = 3/133 (2%)
Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
++ Q D FL +V+V+ L + + G EGF G+GL F+ + G++ + NL
Sbjct: 333 MIIQKDGFLACTEEVQVNTQLQNLAQGLFSG-EGFFVVGAEGRGLLFLESYGAIHEFNLG 391
Query: 179 VGEVITVDVSCIVAVTSSVNVQIK-YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR 237
GE VD +VA + ++ +++ N I A+ G+ +V TGPG + +Q+ +
Sbjct: 392 PGEQKIVDNGYLVAWSKNMGYRLEAANIGIVAALTSGEAIVCR-FTGPGKILVQTRMARQ 450
Query: 238 LSQRIARAVTSPN 250
I++ + +P+
Sbjct: 451 FGLWISKFLPTPS 463
>gi|258564182|ref|XP_002582836.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237908343|gb|EEP82744.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 342
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 2/109 (1%)
Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
DAFL + V + Q + I EG ++G G+ FI + G++++K+L GE
Sbjct: 217 DAFLAGTQGI-VKDYKRQGLSKAIFSGEGLYTYMVSGTGILFIQSFGAIIRKDLANGEKY 275
Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
VD +VA ++ +G I + G+ LV TGPG V++Q+
Sbjct: 276 IVDNGHLVAWNCGYVMERIASGGIISGLSSGEGLVCK-FTGPGTVYLQT 323
>gi|319783243|ref|YP_004142719.1| hypothetical protein [Mesorhizobium ciceri biovar biserrulae
WSM1271]
gi|317169131|gb|ADV12669.1| protein of unknown function DUF124 [Mesorhizobium ciceri biovar
biserrulae WSM1271]
Length = 232
Score = 45.8 bits (107), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 36/146 (24%), Positives = 66/146 (45%), Gaps = 6/146 (4%)
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
G + + I+PI L G E+ + F + +V + + + ++ G GF
Sbjct: 84 GIIAFSRDGPGHIVPIHLR-HGEEIQVREHQFRAATGNVDYTFERVRGLATMLFGQSGFF 142
Query: 155 RQKL---TGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
+ +G G+ ++ G+V +K L GE I ++ + +SV ++ K + +
Sbjct: 143 IDRFRGDSGDGIVWLHGYGNVFEKTLAAGETIDIEPGGWLFKDASVRMETKID-RLSSGF 201
Query: 212 FGGD-NLVTAVVTGPGIVFIQSLPFH 236
FG + N + TGPG V IQS+ H
Sbjct: 202 FGANMNFIVNRFTGPGRVGIQSMYLH 227
>gi|443311726|ref|ZP_21041351.1| TIGR00266 family protein [Synechocystis sp. PCC 7509]
gi|442778299|gb|ELR88567.1| TIGR00266 family protein [Synechocystis sp. PCC 7509]
Length = 225
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 56/207 (27%), Positives = 96/207 (46%), Gaps = 13/207 (6%)
Query: 31 EAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGKTITSVV 86
++ + + L+ E++VA G M MSG S + G+ + + G+++ V
Sbjct: 10 DSAIAHVTLESGEELVAEAGCMIAMSGYINASTTLRQGKGGGILGGLKRLVAGESLFLSV 69
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
R+P S G V +A + IL + G L+ Q ++L S ++V + Q ++V
Sbjct: 70 FRSP-TSGGEVFLAPKFMGDILLYKVVGMG--LVVQSTSYLASESNVDIELGF-QGFKSV 125
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
G E +TG G + + G++ + N++ GE I VD IVA S++ +I G
Sbjct: 126 FSG-ESVFWLNITGNGYVILSSFGAIYEINVD-GEYI-VDTGHIVAFEKSLSFEITKAGS 182
Query: 207 IRRAVF-GGDNLVTAVVTGPGIVFIQS 232
F GG+ LV G G ++ QS
Sbjct: 183 SWMGAFLGGEGLVCKF-KGQGKIYCQS 208
>gi|255944277|ref|XP_002562906.1| Pc20g03550 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211587641|emb|CAP85684.1| Pc20g03550 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 371
Score = 45.8 bits (107), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 50/190 (26%), Positives = 83/190 (43%), Gaps = 15/190 (7%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ RPG+M MS ++ + + + G ++T+ PG V +A P L
Sbjct: 178 LTVRPGAMIAMSPTITLRG----HVSFSFKKLIIGGSMTTSQYTGPGE----VLLAPPML 229
Query: 105 ARI--LPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQG 162
I LPID G D+FL + V + G EGF + TG G
Sbjct: 230 GDIVVLPIDT---GEAWNAGRDSFLAHTSGVHHLYQSQGLSKGFFSG-EGFFVYEFTGHG 285
Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
L ++ + G++V+K+L GE VD +VA + ++ +G + + G+ L
Sbjct: 286 LLWLQSFGAIVKKDLREGESYFVDNGHLVAWKCAYEIERPASGGLLSSFSSGEGLA-CRF 344
Query: 223 TGPGIVFIQS 232
GPG V++Q+
Sbjct: 345 KGPGTVYLQT 354
>gi|452949131|gb|EME54602.1| hypothetical protein H074_28018 [Amycolatopsis decaplanina DSM
44594]
Length = 225
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 57/211 (27%), Positives = 101/211 (47%), Gaps = 20/211 (9%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-WLFGKTITSVVLRNPGP 92
V ++ML P E G+M S +++++ G+ + +L G++ P
Sbjct: 13 VARLMLAPGEPAQVESGAMMATSYGVQVQSQAQGGIMKGLGRAFLSGESFFISTYTAP-Q 71
Query: 93 SDGFVGIAA--PSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
+ G+V +AA P +I+ +D G C ++L S + V+ + T ++N++GG
Sbjct: 72 NGGWVDVAANLPGDMQIINLD----GRTGWCVTRGSWLASSHGVQ-TETKWGGLKNLVGG 126
Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
GFL TGQG + G++ L+ GEVIT+D +VA +V Q IR+
Sbjct: 127 EGGFLTHA-TGQGPLVVACYGALETVTLQQGEVITIDTGHVVAFADTVQYQ------IRK 179
Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
G ++ ++ +G G+VF + P L+Q
Sbjct: 180 VATG---VIQSMKSGEGLVFDFAGPGQLLTQ 207
>gi|410941358|ref|ZP_11373157.1| TIGR00266 family protein [Leptospira noguchii str. 2006001870]
gi|410783917|gb|EKR72909.1| TIGR00266 family protein [Leptospira noguchii str. 2006001870]
Length = 224
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 67/135 (49%), Gaps = 15/135 (11%)
Query: 104 LARILPIDLAM--FGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG----FLRQK 157
LA LP D+ G + Q +FL S +++ +D + + + G I G FLR
Sbjct: 80 LAPTLPGDVERIELSGTVYVQSSSFLASSPNIE----MDTKFQGLKGFISGESLFFLR-- 133
Query: 158 LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNL 217
L+G GL I + G + N++ GE+I VD IVA +N ++ G + GG+ L
Sbjct: 134 LSGNGLLLISSYGGIDLLNVD-GEMI-VDTGHIVAFEEGLNYEMTKFGGWKSFFLGGEGL 191
Query: 218 VTAVVTGPGIVFIQS 232
V A G G V+IQS
Sbjct: 192 V-ARFKGKGKVWIQS 205
>gi|284044779|ref|YP_003395119.1| hypothetical protein Cwoe_3326 [Conexibacter woesei DSM 14684]
gi|283949000|gb|ADB51744.1| protein of unknown function DUF124 [Conexibacter woesei DSM 14684]
Length = 225
Score = 45.8 bits (107), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 62/220 (28%), Positives = 89/220 (40%), Gaps = 24/220 (10%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
V + L P E + A G+M SG + +E G+ + + G V
Sbjct: 13 VARCTLAPGETMRAESGAMMATSGGVGVEAKMQGGLMKGLKRSMLGGESLFVTTFTAPSE 72
Query: 94 DGFVGIAAPSLARILPIDLAMF--GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
G+V AA LP D+A+ L A+LCS V++ +T +N++GG
Sbjct: 73 GGWVDCAAN-----LPGDVAVLEIADALNISRGAWLCSSAGVEL-DTKWGGFKNLMGGEG 126
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRA 210
GFL TG G A + G++ L GE + +D +VA SV K +
Sbjct: 127 GFLIHA-TGSGHAVVACYGALDTVELSAGEALVIDSGHVVAFDPSVQFTTRKVTKGMMAT 185
Query: 211 VFGGDNLVTAVVTGPGIVFIQS-------------LPFHR 237
+ G+ LV TGPG V QS LPF R
Sbjct: 186 LKSGEGLVME-FTGPGRVLTQSRNPGALISWLTTVLPFSR 224
>gi|421097931|ref|ZP_15558608.1| TIGR00266 family protein [Leptospira borgpetersenii str. 200901122]
gi|410799003|gb|EKS01086.1| TIGR00266 family protein [Leptospira borgpetersenii str. 200901122]
Length = 221
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 53/203 (26%), Positives = 94/203 (46%), Gaps = 16/203 (7%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNP--G 91
++ L P + + A G+M +M+ + +E + + FG ++ + + P G
Sbjct: 14 LKVKLLPGQTIKAEAGAMVYMTPGVNVETKMGSGFLSAISRRFFGGESFFFNIFKAPSGG 73
Query: 92 PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
GF AP L ++ +DL G L+ + A+L S + + + +R+ GG
Sbjct: 74 AELGF----APELPGDVVGVDLTDTG--LIVEAGAYLASDETITMKSMFGG-LRSFFGG- 125
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
EG ++ G G F+ A G ++ ++V T+D +VA S+ +I K G +
Sbjct: 126 EGIFLLEILGNGKLFLNAYGGILP--IDVQGSYTIDTGHVVAFDKSLQYKITKAGGNWKS 183
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
+FGG+ LV TG G V IQ+
Sbjct: 184 TLFGGEGLVMEF-TGHGKVLIQT 205
>gi|451338897|ref|ZP_21909424.1| hypothetical protein C791_6527 [Amycolatopsis azurea DSM 43854]
gi|449418388|gb|EMD23976.1| hypothetical protein C791_6527 [Amycolatopsis azurea DSM 43854]
Length = 225
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 55/208 (26%), Positives = 100/208 (48%), Gaps = 14/208 (6%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-WLFGKTITSVVLRNPGP 92
V ++ML P E G+M S +++++ G+ + +L G++ P
Sbjct: 13 VARLMLAPGEPAQVESGAMMATSYGVQVQSQAQGGIMKGLGRAFLSGESFFISTYTAP-Q 71
Query: 93 SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
+ G+V +AA + I+L G + + ++L S + V+ + T ++N++GG G
Sbjct: 72 NGGWVDVAANLPGDMQVINLDGRTGWCVTR-GSWLASSHSVQ-TETKWGGLKNLVGGEGG 129
Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
FL TGQG + G++ L+ GEVIT+D +VA +V Q IR+
Sbjct: 130 FLTHA-TGQGPLVVACYGALETVTLQQGEVITIDTGHVVAFADTVQYQ------IRKVAT 182
Query: 213 GGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
G ++ ++ +G G+VF + P L+Q
Sbjct: 183 G---VIQSMKSGEGLVFDFAGPGQLLTQ 207
>gi|312865613|ref|ZP_07725838.1| TIGR00266 family protein [Streptococcus downei F0415]
gi|311098881|gb|EFQ57100.1| TIGR00266 family protein [Streptococcus downei F0415]
Length = 248
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 51/223 (22%), Positives = 96/223 (43%), Gaps = 10/223 (4%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMEN-IYIPENEVGMWQWLFGKTITS----VVLR 88
+ +I LK + + GSM F + S+ + + N +G G+++TS + +
Sbjct: 18 LVEIFLKAGQSAYIQRGSMVFHTPSVTLNTQLNAKGNGLGKLFRAVGRSMTSGESAFITQ 77
Query: 89 NPGPSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
+DG APS+ +I+ ++L + AFL + + R
Sbjct: 78 ALSNADGGRLAIAPSMPGQIIALELG--SKQYRLNDGAFLALDGSTQYTMERQSLGRAFF 135
Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
GG G GQG + + GS+ + L E +T+D + +VA + ++ QI
Sbjct: 136 GGQGGLFVMSTQGQGTLLVNSFGSIQKIELNQQE-MTIDNAHVVAWSKDLDYQIHLENGF 194
Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
+++ G+ +V G G V++Q+L +Q I + SP+
Sbjct: 195 LQSIGNGEGIVN-TFRGTGEVYVQTLNIETFAQVIGGHLASPS 236
>gi|330922098|ref|XP_003299695.1| hypothetical protein PTT_10746 [Pyrenophora teres f. teres 0-1]
gi|311326520|gb|EFQ92207.1| hypothetical protein PTT_10746 [Pyrenophora teres f. teres 0-1]
Length = 395
Score = 45.4 bits (106), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 49/189 (25%), Positives = 86/189 (45%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS +M ++ + + + L G ++ V PG + +A +
Sbjct: 203 LTAKPGAMIAMSPTMTLKGAV----KFSLKKALTGGDMSKSVYTGPGE----ILLAPACI 254
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
I I LA G E DAFL + V+ Q + + EG K++G G+
Sbjct: 255 GDITIIKLA--GKETWSVGRDAFLACTQGI-VTEYKSQGIGKGMFSGEGLFVYKISGTGV 311
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
+ + G++++K+L E +D +VA ++ +G I + G+ LV T
Sbjct: 312 LWCTSFGAIIRKDLAANEKYIIDNGHLVAWNCKYVLERVASGGIISNLTAGEGLVCK-FT 370
Query: 224 GPGIVFIQS 232
GPG VF+Q+
Sbjct: 371 GPGTVFMQT 379
>gi|121705236|ref|XP_001270881.1| DUF124 domain protein [Aspergillus clavatus NRRL 1]
gi|119399027|gb|EAW09455.1| DUF124 domain protein [Aspergillus clavatus NRRL 1]
Length = 336
Score = 45.4 bits (106), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 46/188 (24%), Positives = 86/188 (45%), Gaps = 11/188 (5%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+VA+PG+M MS S+ + + + + G ++ PG + +A +L
Sbjct: 143 IVAKPGAMIAMSHSVTLRGSI----KFSLKKMFAGGEMSLSTFTGPGE----LLLAPSTL 194
Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
I+ + L+ + + DAFL + + + + V G EG K+TG GL
Sbjct: 195 GDIIVMRLSGSDSWKVGK-DAFLAATSGISKDLQAQGLTKGVFSG-EGLFIYKMTGVGLL 252
Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
++ + G++++K + E VD +VA ++ +G I + G+ L TG
Sbjct: 253 WLQSFGAIIKKEIAEDESYFVDNGHLVAWNCKYRMERVASGGIISNIGSGEGLA-CRFTG 311
Query: 225 PGIVFIQS 232
PGIV++Q+
Sbjct: 312 PGIVYMQT 319
>gi|257057405|ref|YP_003135237.1| hypothetical protein Svir_34460 [Saccharomonospora viridis DSM
43017]
gi|256587277|gb|ACU98410.1| uncharacterized conserved protein [Saccharomonospora viridis DSM
43017]
Length = 174
Score = 45.1 bits (105), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 69/139 (49%), Gaps = 5/139 (3%)
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
G++ +A + P++L+ G + DA L V+ + +++ GG GFL
Sbjct: 36 GWIDLAPKHPGDVYPLELSGRLGWSVAH-DAVLARPATVRHDHPWPAH-QSLFGGDSGFL 93
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR-RAVFG 213
+ +G G + + G V L+ GEV+TV+ ++A SV V+++ P+ +++
Sbjct: 94 KH-FSGTGPLVLASRGPVDSLTLKAGEVLTVNPLFVLAYPDSVQVRLRALDPVEPQSIRT 152
Query: 214 GDNLVTAVVTGPGIVFIQS 232
G L V GPG +F+Q+
Sbjct: 153 GAGLALDVA-GPGAIFVQT 170
>gi|402085536|gb|EJT80434.1| hypothetical protein GGTG_00433 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 446
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 31/109 (28%), Positives = 54/109 (49%), Gaps = 2/109 (1%)
Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
D +L S V + Q + + EG K++G GL ++ + G++++K+L GE
Sbjct: 322 DGYLASTQGV-IKEYKRQSLGKAMFSGEGLWIYKISGNGLLWLTSFGAIIRKDLADGEKY 380
Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
VD +VA + ++ +G + G+ LV TGPG VF+Q+
Sbjct: 381 VVDNGHLVAWDTKYILERVSSGGLVSGYASGEGLV-CKFTGPGTVFLQT 428
>gi|398333660|ref|ZP_10518365.1| hypothetical protein LalesM3_21091 [Leptospira alexanderi serovar
Manhao 3 str. L 60]
Length = 221
Score = 45.1 bits (105), Expect = 0.032, Method: Compositional matrix adjust.
Identities = 51/203 (25%), Positives = 94/203 (46%), Gaps = 16/203 (7%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNP--G 91
++ L P + + A G+M +M+ +++E + + FG ++ + + P G
Sbjct: 14 LKVKLSPGQTIKAEAGAMVYMTPGVDVETKMGSGFLSAISRRFFGGESFFFNIFKAPSGG 73
Query: 92 PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
GF AP L ++ +DL G L+ + A+L S + + + +R+ GG
Sbjct: 74 AELGF----APELPGDVVGVDLTDTG--LIVEAGAYLASDETITMKSKFGG-LRSFFGG- 125
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
EG ++ G G ++ A G ++ ++V T+D +VA S+ +I K G +
Sbjct: 126 EGIFLLEVLGNGKLYLNAYGGILP--IDVQGSYTIDTGHVVAFDKSLQYKITKAGGSWKS 183
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
+FGG+ L TG G V IQ+
Sbjct: 184 TLFGGEGLAMEF-TGHGKVLIQT 205
>gi|385815721|ref|YP_005852112.1| hypothetical protein [Lactobacillus delbrueckii subsp. bulgaricus
2038]
gi|385815782|ref|YP_005852173.1| hypothetical protein [Lactobacillus delbrueckii subsp. bulgaricus
2038]
gi|325125758|gb|ADY85088.1| Hypothetical conserved protein [Lactobacillus delbrueckii subsp.
bulgaricus 2038]
gi|325125819|gb|ADY85149.1| Hypothetical conserved protein [Lactobacillus delbrueckii subsp.
bulgaricus 2038]
Length = 233
Score = 45.1 bits (105), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 51/230 (22%), Positives = 100/230 (43%), Gaps = 18/230 (7%)
Query: 23 IPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
+ +++ GG + + L E + G+M +M + + +G G++
Sbjct: 1 MKYELSGGNTCPLATVTLDKGESIKVENGAMVYMKDVTIAGKMNSSKKGLGGLLGAIGRS 60
Query: 82 ITS----VVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKV 135
+TS + + G +DG +G+A I+ + + G + C AFL S + V
Sbjct: 61 LTSGESMFITQATGDADGGQIGVAPAIPGEIVKLSV---GKQQYCLNTGAFLASDDSVSY 117
Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
+ + GG GF + G+G + A G +V+ + + I++D ++A +
Sbjct: 118 KMRSQKLSKAAFGGTGGFFVMETEGEGDMLVNAFGDLVELTVTSDKPISIDNEHVIAWDA 177
Query: 196 SVNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+++ IK + +FG G+ LV TG G V IQ+ H L++ +
Sbjct: 178 NLDYDIK----VASGMFGFTSGEGLVNH-FTGDGKVLIQTRNLHSLAEAL 222
>gi|15895995|ref|NP_349344.1| hypothetical protein CA_C2738 [Clostridium acetobutylicum ATCC 824]
gi|337737948|ref|YP_004637395.1| hypothetical protein SMB_G2773 [Clostridium acetobutylicum DSM
1731]
gi|384459459|ref|YP_005671879.1| hypothetical protein CEA_G2747 [Clostridium acetobutylicum EA 2018]
gi|15025774|gb|AAK80684.1|AE007771_4 Uncharacterized conserved protein [Clostridium acetobutylicum ATCC
824]
gi|325510148|gb|ADZ21784.1| Conserved hypothetical protein [Clostridium acetobutylicum EA 2018]
gi|336291390|gb|AEI32524.1| hypothetical protein SMB_G2773 [Clostridium acetobutylicum DSM
1731]
Length = 218
Score = 45.1 bits (105), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 58/108 (53%), Gaps = 3/108 (2%)
Query: 125 AFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVIT 184
+FL S + + + N + + G EGF++ + G G+ F+ A G++ + L GE
Sbjct: 102 SFLASTDKINL-NIKSGGGKGFLSG-EGFIQIEAEGSGILFLSAYGAIHEITLSQGEKYI 159
Query: 185 VDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
VD + +V SS+N +++ + ++ GG+ LV V GPG + IQS
Sbjct: 160 VDTNHLVLWESSMNYKVELLNGLFGSITGGEGLV-CVFEGPGKMLIQS 206
>gi|189188070|ref|XP_001930374.1| hypothetical protein PTRG_00041 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187971980|gb|EDU39479.1| hypothetical protein PTRG_00041 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 392
Score = 45.1 bits (105), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 91/206 (44%), Gaps = 13/206 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG+M MS +M ++ + + + L G ++ V PG + +A S+
Sbjct: 199 LTAKPGAMIAMSPTMTLKGAV----KFSLKKVLTGGDMSKSVYTGPG----EILLAPSSI 250
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
I I LA G E D FL + V+ Q + + EG K++G G+
Sbjct: 251 GDITIIKLA--GKETWSVGRDGFLACTQGI-VTEYKSQGIGKGMFSGEGLFVYKISGTGV 307
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
+ + G++++K+L E +D +VA ++ +G I + G+ LV T
Sbjct: 308 LWCTSFGAIIRKDLAANEKYIIDNGHLVAWNCKYVLERIASGGIISNLTAGEGLVCK-FT 366
Query: 224 GPGIVFIQSLPFHRLSQRIARAVTSP 249
GPG VF+Q+ Q +A +P
Sbjct: 367 GPGTVFMQTRNAVAFGQWLAAHSAAP 392
>gi|104774038|ref|YP_619018.1| hypothetical protein Ldb1059 [Lactobacillus delbrueckii subsp.
bulgaricus ATCC 11842]
gi|103423119|emb|CAI97861.1| Conserved hypothetical protein [Lactobacillus delbrueckii subsp.
bulgaricus ATCC 11842]
Length = 233
Score = 45.1 bits (105), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 50/230 (21%), Positives = 100/230 (43%), Gaps = 18/230 (7%)
Query: 23 IPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
+ +++ GG + + L E + G+M +M + + +G G++
Sbjct: 1 MKYELSGGNTCPLATVTLDKGESIKVENGAMVYMKDVTIAGKMNSSKKGLGGLLGAIGRS 60
Query: 82 ITS----VVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKV 135
+TS + + G +DG +G+A I+ + + G + C AFL S + V
Sbjct: 61 LTSGESMFITQATGDADGGQIGVAPAIPGEIVKLSV---GKQQYCLNTGAFLASDDSVSY 117
Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
+ + GG GF + G+G + A G +V+ + + +++D ++A +
Sbjct: 118 KMRSQKLSKAAFGGTGGFFVMETEGEGDMLVNAFGDLVELTVTSDKPLSIDNEHVIAWDA 177
Query: 196 SVNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+++ IK + +FG G+ LV TG G V IQ+ H L++ +
Sbjct: 178 NLDYDIK----VASGIFGFTSGEGLVNH-FTGDGKVLIQTRNLHSLAEAL 222
>gi|418030065|ref|ZP_12668581.1| hypothetical protein LDBUL1632_01375 [Lactobacillus delbrueckii
subsp. bulgaricus CNCM I-1632]
gi|354688283|gb|EHE88325.1| hypothetical protein LDBUL1632_01375 [Lactobacillus delbrueckii
subsp. bulgaricus CNCM I-1632]
Length = 233
Score = 45.1 bits (105), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 51/230 (22%), Positives = 100/230 (43%), Gaps = 18/230 (7%)
Query: 23 IPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
+ +++ GG + + L E + G+M +M + + +G G++
Sbjct: 1 MKYELSGGNTCPLATVTLDKGESIKVENGAMVYMKDVTIAGKMNSSKKGLGGLLGAIGRS 60
Query: 82 ITS----VVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKV 135
+TS + + G +DG +G+A I+ + + G + C AFL S + V
Sbjct: 61 LTSGESMFITQATGDADGGQIGVAPAIPGEIVKLSV---GKQQYCLNTGAFLASDDSVSY 117
Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
+ + GG GF + G+G + A G +V+ + + I++D ++A +
Sbjct: 118 KMRSQKLSKAAFGGTGGFYVMQTEGEGDMLVNAFGDLVELTVTSDKPISIDNEHVIAWDA 177
Query: 196 SVNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+++ IK + +FG G+ LV TG G V IQ+ H L++ +
Sbjct: 178 NLDYDIK----VASGMFGFTSGEGLVNH-FTGDGKVLIQTRNLHSLAEAL 222
>gi|383830129|ref|ZP_09985218.1| TIGR00266 family protein [Saccharomonospora xinjiangensis XJ-54]
gi|383462782|gb|EID54872.1| TIGR00266 family protein [Saccharomonospora xinjiangensis XJ-54]
Length = 225
Score = 44.7 bits (104), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 54/215 (25%), Positives = 96/215 (44%), Gaps = 10/215 (4%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
V +++L P E G+M S M M+ G+ + + G V P+
Sbjct: 13 VARLLLAPHEPAQVESGAMVATSYGMHMQASTQGGVMKGLGRAILGGESLFVSTYTAPPN 72
Query: 94 DGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
G+V +A P RI+ +D + ++L + + +++ +T N+ GG
Sbjct: 73 GGWVDVAPGLPGDVRIIEMDGRV---GWCVTRGSWLANSHGIQL-DTKWGGFGNLFGGEG 128
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRA 210
GFL + GQG + G++ NL+ GE +T+D +VA +V Q+ K + + ++
Sbjct: 129 GFLTHAV-GQGQLVVACYGAIDVVNLQPGEYVTIDSGHVVAYADTVQSQLRKVSQGVIQS 187
Query: 211 VFGGDNLVTAVVTGPGIVFIQSL-PFHRLSQRIAR 244
+ G+ V GPG + Q+ P L+ IAR
Sbjct: 188 LKSGEGFVFD-FAGPGQILTQTRNPDALLAWLIAR 221
>gi|443311725|ref|ZP_21041350.1| TIGR00266 family protein [Synechocystis sp. PCC 7509]
gi|442778298|gb|ELR88566.1| TIGR00266 family protein [Synechocystis sp. PCC 7509]
Length = 230
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 45/197 (22%), Positives = 83/197 (42%), Gaps = 8/197 (4%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPSDG 95
+ L P E + A G+M M + M + G+ + FG +++ V N
Sbjct: 26 LTLNPGETITAEAGAMTSMDSQLSMRTEFSGGLIPGLLKKFFGGESLFVNVFTNQTKDPL 85
Query: 96 FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
+ + ++ I IDL E+ QP A++ + + G EG +
Sbjct: 86 KLTLTQSTIGDIDRIDLN--NSEICFQPGAYIAHTPGINMGVQWAGFASFFAG--EGLFK 141
Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
+L+G+G F A G + +K + VD +VA + + IK G + +V G+
Sbjct: 142 LRLSGKGRVFFGAYGGITKK--RISNEFIVDSGHLVAYEPQIKMSIKLAGGLWGSVTSGE 199
Query: 216 NLVTAVVTGPGIVFIQS 232
+V ++G G +++QS
Sbjct: 200 GIVNR-LSGQGEIYLQS 215
>gi|431930110|ref|YP_007243156.1| hypothetical protein Thimo_0687 [Thioflavicoccus mobilis 8321]
gi|431828413|gb|AGA89526.1| TIGR00266 family protein [Thioflavicoccus mobilis 8321]
Length = 447
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 70/157 (44%), Gaps = 15/157 (9%)
Query: 98 GIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI--- 150
G A LA P D+A G L+ Q FL V+V +V+N+ G+
Sbjct: 294 GPGAVYLAPASPGDIAAVEVRPGDGLVIQRGGFLACTEGVEVGT----QVQNIAEGLFSG 349
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
EGF K G+GL + + G++ + NL VGE VD +VA + +++ ++ A
Sbjct: 350 EGFFVLKAHGEGLVLLESFGAIHELNLAVGEQKIVDNGHLVAWSQAMHYDLELGSRGLVA 409
Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVT 247
F + GPG + +Q+ R Q+ R ++
Sbjct: 410 AFTSGEKIVCRFHGPGRILVQT----RQPQQFGRWIS 442
>gi|422843752|ref|ZP_16890462.1| hypothetical protein HMPREF5505_0127 [Lactobacillus delbrueckii
subsp. lactis DSM 20072]
gi|325686164|gb|EGD28214.1| hypothetical protein HMPREF5505_0127 [Lactobacillus delbrueckii
subsp. lactis DSM 20072]
Length = 246
Score = 44.7 bits (104), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 52/235 (22%), Positives = 101/235 (42%), Gaps = 18/235 (7%)
Query: 18 PQDAVIPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW 76
Q ++ +++ GG + + L E + G+M +M + + +G
Sbjct: 9 KQREIMKYELSGGNTCPLATVTLDKGESIKVENGAMVYMKDVTIAGKMNSSKKGLGGLLG 68
Query: 77 LFGKTITS----VVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSV 130
G+++TS + + G +DG +G+A +IL + + G + C AFL S
Sbjct: 69 AIGRSLTSGESMFITQATGDADGGQIGVAPAIPGKILKLSV---GKQQYCLNTGAFLASD 125
Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
+ V + GG GF + G+G + A G +V+ + + I++D +
Sbjct: 126 DSVSYKMKSQSFSKAAFGGTGGFYVMQTEGKGDMLVNAFGDLVELTVTSNKPISIDNEHV 185
Query: 191 VAVTSSVNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+A ++++ IK + +FG G+ LV TG G V IQ+ L++ +
Sbjct: 186 IAWDANLDYDIK----VASGMFGFTSGEGLVNH-FTGDGKVIIQTRNLRSLAEAL 235
>gi|332707812|ref|ZP_08427839.1| conserved hypothetical protein TIGR00266 [Moorea producens 3L]
gi|332353515|gb|EGJ33028.1| conserved hypothetical protein TIGR00266 [Moorea producens 3L]
Length = 219
Score = 44.7 bits (104), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 49/198 (24%), Positives = 85/198 (42%), Gaps = 10/198 (5%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPSDG 95
+ L P E + A G+M M + M+ + G+ + FG +++ N
Sbjct: 16 VTLDPGESITAEAGAMTSMDAQLTMKTQFSGGFFSGLLKKFFGGESLLVNTFSNQTQQPL 75
Query: 96 FVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
+ + + I ++L G LC QP A++C VK+ G EG
Sbjct: 76 NLVLTQSIIGDIAGVEL---GERSLCFQPGAYICHTTGVKLGVRWAGFKSWFSG--EGLF 130
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGG 214
+ K++G+G F A G + +K + GE I VD +VA + + I G + +V G
Sbjct: 131 KLKVSGKGTVFFGAYGGLTRKQIN-GEFI-VDTGHLVAYEPGIMMNIGLAGGLLGSVTSG 188
Query: 215 DNLVTAVVTGPGIVFIQS 232
+ V + G G +++QS
Sbjct: 189 EGFVNR-LRGQGEIYLQS 205
>gi|443324897|ref|ZP_21053619.1| TIGR00266 family protein [Xenococcus sp. PCC 7305]
gi|442795499|gb|ELS04864.1| TIGR00266 family protein [Xenococcus sp. PCC 7305]
Length = 232
Score = 44.3 bits (103), Expect = 0.053, Method: Compositional matrix adjust.
Identities = 39/134 (29%), Positives = 65/134 (48%), Gaps = 14/134 (10%)
Query: 108 LPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
+P D+A + G L+ Q F+ + +++ NT Q + G E K TGQG
Sbjct: 85 VPGDIAHYYVEPGHNLMVQSSGFVACASTIEI-NTQFQGFKGFFSG-ESLFLLKATGQGD 142
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR-----RAVFGGDNLV 218
+ + G +++ +EV E VD S IVA S+N ++ G + +FGG+ LV
Sbjct: 143 FWFSSYGGILE--IEVSENYIVDTSYIVAFEDSLNYNVELIGGLSFNNLLTGIFGGEGLV 200
Query: 219 TAVVTGPGIVFIQS 232
+G G ++IQ+
Sbjct: 201 CR-FSGKGRLWIQT 213
>gi|322692627|gb|EFY84525.1| DUF124 domain protein [Metarhizium acridum CQMa 102]
Length = 325
Score = 44.3 bits (103), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 49/200 (24%), Positives = 93/200 (46%), Gaps = 14/200 (7%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITSVVLRNPGP 92
+ + L+P V ++ G+M MSG+++++ NI + M + G ++ PG
Sbjct: 123 ILNVNLQPGGTVRSKSGAMIHMSGTIQLQGNI-----KFSMKKLFTGGRMSESTYTGPG- 176
Query: 93 SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
+ + + I+ + + G + + D FL DV V T Q V + E
Sbjct: 177 ---RLALGPTLMGDIITLHVDGRTGWTVGK-DTFLACTPDV-VMETKAQGVSKSMFSGED 231
Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
+++GQGL ++ + G+V + +L GE VD +VA + ++ K G ++
Sbjct: 232 LFVYRVSGQGLLWLTSFGAVDRLDLPPGEQHIVDNGHLVAWSCDYKIE-KAGGGAMSSLK 290
Query: 213 GGDNLVTAVVTGPGIVFIQS 232
G+ LV TGPG +++Q+
Sbjct: 291 TGEGLV-CRFTGPGAIYVQT 309
>gi|257388303|ref|YP_003178076.1| hypothetical protein Hmuk_2257 [Halomicrobium mukohataei DSM 12286]
gi|257170610|gb|ACV48369.1| protein of unknown function DUF124 [Halomicrobium mukohataei DSM
12286]
Length = 240
Score = 44.3 bits (103), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 49/214 (22%), Positives = 87/214 (40%), Gaps = 17/214 (7%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
+ L E ++A PG+M S ++ M+ E + + + G N ++G
Sbjct: 16 VELGSGETILAEPGAMTTHSSNISMDTTTSNEGLLSSAKSMLGG---ESFFANEFTAEG- 71
Query: 97 VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQ 156
G +LA P D+ + EL D + +V+ +++ + GG++ L
Sbjct: 72 -GTGTVTLAPPTPGDVMQY--EL---SDETIYAVDGAWLASEPSISIETEFGGLKSMLAG 125
Query: 157 ------KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
L G G F+ A G + L GE VD +VA SV+ + ++
Sbjct: 126 ASITPLTLEGTGTVFVEAFGGLESIELGAGETYHVDNENVVAWDGSVDFDARRPDGVKST 185
Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+ G+ LV TGPG V+ Q+ + + +A
Sbjct: 186 LLSGEGLVFE-FTGPGTVWYQTRGLNAFASAVAE 218
>gi|46137297|ref|XP_390340.1| hypothetical protein FG10164.1 [Gibberella zeae PH-1]
Length = 477
Score = 44.3 bits (103), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 55/109 (50%), Gaps = 2/109 (1%)
Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
DAFL S V V + Q + + EG K++G GL +I + G++++K L E
Sbjct: 354 DAFLASTQGV-VKDHKRQGLGKAMFSGEGLWVYKVSGTGLLWISSFGAIIKKTLVENEKY 412
Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
VD ++A + ++ +G I + G+ LV TGPG V++Q+
Sbjct: 413 IVDNGHLIAWDTKYVMERVASGGIISGLASGEGLV-CKFTGPGTVYMQT 460
>gi|145493977|ref|XP_001432983.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124400099|emb|CAK65586.1| unnamed protein product [Paramecium tetraurelia]
Length = 231
Score = 44.3 bits (103), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 58/271 (21%), Positives = 114/271 (42%), Gaps = 57/271 (21%)
Query: 16 QSPQDAV---IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME---NIYIPEN 69
+ PQ+ + I F I+G V Q++L P E++ + + + S ++ ++ + +I EN
Sbjct: 2 EYPQNLLTQGIKFSIIGDN--VVQVVLLPGEQINTQNSYVQYYSDNISIQTKKSYFITEN 59
Query: 70 EVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
+V VL + +VGI+A + +++ +D A+F L+ + + +
Sbjct: 60 QV-------------TVLNTSKDNIAYVGISAAN-GKVMILDAAIFNNYLIKESS--IIA 103
Query: 130 VNDV-------KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEV 182
ND +T + V + LT Q L FI G+V+ K+L GE
Sbjct: 104 ANDFTEVGQFFHFQSTYQKIV----------FTKDLTKQQLIFIKTNGTVIDKDLGDGES 153
Query: 183 ITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
I V + I+ + V ++ + + G PG + F +Q +
Sbjct: 154 IIVPNNSILGIQDFVRYEVNLKDTTKIKLIG-----------PG-----RILFEINNQTL 197
Query: 243 ARAVTSPNMRENPKFFVQIAIFFFLAYVVIV 273
+ + KFF+ +++ FFL +V ++
Sbjct: 198 ENNLGLVGRKYLAKFFMIMSMVFFLIFVELI 228
>gi|119485102|ref|ZP_01619487.1| hypothetical protein L8106_06629 [Lyngbya sp. PCC 8106]
gi|119457330|gb|EAW38455.1| hypothetical protein L8106_06629 [Lyngbya sp. PCC 8106]
Length = 221
Score = 43.9 bits (102), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 50/201 (24%), Positives = 94/201 (46%), Gaps = 19/201 (9%)
Query: 39 LKPQEKVVARPGSMCFMSGSMEMENIY-------IPENEVGMWQWLFGKTITSVVLRNPG 91
L+P + ++A G+M M G + M+ + +N +G + LF T T+ +
Sbjct: 18 LQPGDSIIAEAGAMVGMEGKISMKTSFSGGFFSAFIKNLLG-GESLFVNTFTN---KTNQ 73
Query: 92 PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
P + V ++ ++ I+ ++L G EL QP A++ S VK+ G E
Sbjct: 74 PLE--VVLSQATIGDIVAVELQR-GQELCFQPGAYIASSKPVKLGVGWAGFASWFAG--E 128
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
G + K+TG GL F A G + QK ++ + VD +VA + + + G + ++
Sbjct: 129 GLFKLKVTGPGLVFFGAYGGITQK--QITQDFIVDNGHLVAYEPGIKMGVGLAGGLLGSL 186
Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
G+ + + G G++++QS
Sbjct: 187 TSGEGFINK-LRGNGVIYLQS 206
>gi|167042544|gb|ABZ07268.1| putative protein of unknown function DUF124 [uncultured marine
microorganism HF4000_ANIW133F6]
gi|167043072|gb|ABZ07783.1| putative protein of unknown function DUF124 [uncultured marine
microorganism HF4000_ANIW141C7]
Length = 229
Score = 43.9 bits (102), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 43/165 (26%), Positives = 71/165 (43%), Gaps = 4/165 (2%)
Query: 39 LKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVG 98
L P E++ PG+M S ++M++ G+ + G + GP+ G+V
Sbjct: 18 LDPNEQMKVEPGAMVAQSPGVDMKSGMSGGFFKGLKKMALGGESFILNTYTAGPNGGWVS 77
Query: 99 IAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKL 158
IA S I DLA G L Q AF+ S +V+ ++T Q + + G F +
Sbjct: 78 IAPGSPGDIGSFDLAP-GENLFMQGGAFIASTANVE-TDTKFQGAKGLFSGESAFFLKAS 135
Query: 159 T--GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
T G G F A G++ + + VD +VA + V ++
Sbjct: 136 TTGGAGTVFYSAYGAIKEIEITPATPAIVDTGHVVAFSDGVGYKL 180
>gi|391873276|gb|EIT82329.1| hypothetical protein Ao3042_00482 [Aspergillus oryzae 3.042]
Length = 326
Score = 43.9 bits (102), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 56/229 (24%), Positives = 96/229 (41%), Gaps = 24/229 (10%)
Query: 13 YVYQSPQDAVIPFQILGGEAQVCQ--------IMLKPQEKVVARPGSMCFMSGSMEMENI 64
YV S QD V F GG ++ + L + A+PG M MSG++ +
Sbjct: 100 YVPASNQDEVGTFN--GGSFRISHRDTNSVLTLQLAMGCPIEAKPGVMIAMSGNISLR-- 155
Query: 65 YIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQP 123
G ++ F K + + + G + +A P L I I L + G E
Sbjct: 156 -------GGAKFSFMKMLAGSMTFSTYTGPGELLLAPPFLGDI--IVLRLNGNESWKVGK 206
Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
D FL + + ++ + V G EGF+ ++G GL ++ + G+V++K++ GE
Sbjct: 207 DGFLAATSGIEKDYKSQGLTKAVFSG-EGFIIYHMSGVGLLWLQSFGAVIRKDIAEGETY 265
Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
VD +VA + +G + + L GPG V++Q+
Sbjct: 266 LVDNGYLVAWNCKYKMTRAASGGMWSTYSSAEGLACK-FEGPGTVYLQT 313
>gi|428207315|ref|YP_007091668.1| hypothetical protein Chro_2307 [Chroococcidiopsis thermalis PCC
7203]
gi|428009236|gb|AFY87799.1| protein of unknown function DUF124 [Chroococcidiopsis thermalis PCC
7203]
Length = 225
Score = 43.9 bits (102), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 92/207 (44%), Gaps = 13/207 (6%)
Query: 31 EAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG----MWQWLFGKTITSVV 86
++ + I + E++VA GSM MS + G + + + G+++ +
Sbjct: 10 DSAIAHIRMSAGEELVAEAGSMIAMSSYINASTTLRQGKSGGILGGLKRMMAGESLFLSI 69
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
R+P DG V +A + IL ++ FG L+ Q ++L S +V + N Q R+
Sbjct: 70 FRSP-THDGEVFLAPKLMGDILIYQMSSFG--LVVQASSYLASEANVDI-NVGFQGFRSF 125
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNG 205
G E + G G + + G++ + ++ GE I VD IVA S+N I K
Sbjct: 126 FSG-ESIFWLDIGGSGQVILTSFGAIYEILVD-GEYI-VDTGHIVAFEKSLNFSITKVGS 182
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+ A GG+ L G G +F Q+
Sbjct: 183 SLIGAFLGGEGLACR-FQGKGRLFCQT 208
>gi|448739945|ref|ZP_21721938.1| hypothetical protein C451_20492 [Halococcus thailandensis JCM
13552]
gi|445797968|gb|EMA48404.1| hypothetical protein C451_20492 [Halococcus thailandensis JCM
13552]
Length = 205
Score = 43.5 bits (101), Expect = 0.089, Method: Compositional matrix adjust.
Identities = 34/119 (28%), Positives = 50/119 (42%), Gaps = 7/119 (5%)
Query: 148 GGIEGFLRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
GG+E L L G G FI G + +L+ GE VD +VA SSV
Sbjct: 87 GGLESMLSGASLTPLALNGTGSVFIETFGGIETIDLDHGESYVVDNDHVVAWESSVEFDA 146
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQ 260
+ G + V G+ LV TGPG V+ Q+ ++ I A+ + F ++
Sbjct: 147 RRVGGLASTVLSGEGLVME-FTGPGTVWYQTRGLDTFTESIVEAMPGTGEDDGGGFNIE 204
>gi|302653267|ref|XP_003018462.1| hypothetical protein TRV_07527 [Trichophyton verrucosum HKI 0517]
gi|291182110|gb|EFE37817.1| hypothetical protein TRV_07527 [Trichophyton verrucosum HKI 0517]
Length = 191
Score = 43.5 bits (101), Expect = 0.090, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 75/161 (46%), Gaps = 9/161 (5%)
Query: 73 MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVN 131
M + L G ++S PG + APS+ + I L + G E DAFL
Sbjct: 18 MKKMLAGGEMSSSTFTGPGEL-----LLAPSMLGDISI-LRLNGQEQWSVGKDAFLACTQ 71
Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
V V + +Q + EG K++G G+ F+ + G++++K+L GE VD +V
Sbjct: 72 GV-VKDYKNQGLSKAFFSGEGLFVYKISGTGILFVQSFGAIIRKDLVDGEKYIVDNGHLV 130
Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+ ++ +G + + G+ LV TGPG +++Q+
Sbjct: 131 SWNCKYVMERVASGGVLSNLSSGEGLVCK-FTGPGTIYLQT 170
>gi|238484645|ref|XP_002373561.1| DUF124 domain protein [Aspergillus flavus NRRL3357]
gi|220701611|gb|EED57949.1| DUF124 domain protein [Aspergillus flavus NRRL3357]
Length = 131
Score = 43.5 bits (101), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 29/110 (26%), Positives = 53/110 (48%), Gaps = 2/110 (1%)
Query: 123 PDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEV 182
DAFL + V+ + V G EGF+ ++G GL ++ + G++++K++ G+
Sbjct: 11 KDAFLAKTSGVEKDYKSQGMTKAVFSG-EGFIIYHMSGVGLVWLQSFGAIIRKDIPEGKT 69
Query: 183 ITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
VD IVA ++ +G + A F + GPG V++Q+
Sbjct: 70 YLVDNGYIVAWNCKYKIERAASGGLLSA-FSSSEGLACKFEGPGTVYLQT 118
>gi|390629838|ref|ZP_10257830.1| TIGR00266 family protein [Weissella confusa LBAE C39-2]
gi|390485010|emb|CCF30178.1| TIGR00266 family protein [Weissella confusa LBAE C39-2]
Length = 237
Score = 43.5 bits (101), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 47/204 (23%), Positives = 79/204 (38%), Gaps = 12/204 (5%)
Query: 39 LKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL--FGKTITS-----VVLRNPG 91
L P E V + G+M + +G E++ G+ + + G+++ S + G
Sbjct: 18 LLPHETVRIQSGAMVYHTGDTELKGRLNANGGSGIGKLIRAAGRSMVSGESVFITEVTAG 77
Query: 92 PSDGFVGIA--APSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
G + +A P + L + AFL V+ S R GG
Sbjct: 78 SQGGELALAPNVPGTIQALEVTADK---NYYLNDSAFLAMDGTVQYSMERQSLGRAFFGG 134
Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
G K +G+G + A GS+ + +L+ T+D + +VA +S+N I G
Sbjct: 135 QGGLFVMKTSGEGTLLVAAFGSIKEIDLQDAHDFTIDNAHVVAWETSLNYDIHLEGGGVI 194
Query: 210 AVFGGDNLVTAVVTGPGIVFIQSL 233
G + G G + IQSL
Sbjct: 195 GSIGTGEGIVNTFNGTGKILIQSL 218
>gi|359689649|ref|ZP_09259650.1| hypothetical protein LlicsVM_14727 [Leptospira licerasiae serovar
Varillal str. MMD0835]
gi|418749868|ref|ZP_13306156.1| TIGR00266 family protein [Leptospira licerasiae str. MMD4847]
gi|418759263|ref|ZP_13315443.1| TIGR00266 family protein [Leptospira licerasiae serovar Varillal
str. VAR 010]
gi|384113754|gb|EIE00019.1| TIGR00266 family protein [Leptospira licerasiae serovar Varillal
str. VAR 010]
gi|404274753|gb|EJZ42071.1| TIGR00266 family protein [Leptospira licerasiae str. MMD4847]
Length = 253
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 53/206 (25%), Positives = 94/206 (45%), Gaps = 20/206 (9%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
+ ++ L P + + + G+M +MS M +E + + +FG S
Sbjct: 45 LLKLRLGPGQSIKSEAGAMVYMSSRMGVETKMGSGFLSALSRKIFGGESFFFNTYTAPDS 104
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
G +G+A P L + IDL + G + Q ++L S ++V + GGI
Sbjct: 105 GGEIGLA-PDLPGDI-IDLDLVGKSIFVQSGSYLASDPGIQVVSKF--------GGIRSL 154
Query: 154 LRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGP 206
L + ++G G F+ + G++V ++V TVD IVA +S+ ++ K G
Sbjct: 155 LGGEGLFLLEISGTGKVFLSSYGAIVP--IQVQGNYTVDTGHIVAFENSLQFKVGKAGGN 212
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
+ + GG+ LV A +G G ++IQS
Sbjct: 213 WKSTLLGGEGLV-ANFSGNGTLWIQS 237
>gi|119485103|ref|ZP_01619488.1| hypothetical protein L8106_06634 [Lyngbya sp. PCC 8106]
gi|119457331|gb|EAW38456.1| hypothetical protein L8106_06634 [Lyngbya sp. PCC 8106]
Length = 227
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 53/207 (25%), Positives = 95/207 (45%), Gaps = 13/207 (6%)
Query: 31 EAQVCQIMLKPQEKVVARPGSMCFMS----GSMEMENIYIPENEVGMWQWLFGKTITSVV 86
+ + ++ L +E+++A G+M MS S + G+ + + G+++ V
Sbjct: 10 DCAIARVTLDTREELIAEAGAMVAMSEYINTSTTLRQGKGGGILGGLKRMVGGESLFLSV 69
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
R+P + G V +A L IL ++ G L+ Q ++L S +DV + Q +++
Sbjct: 70 FRSP-IAGGEVFLAPKLLGDILHYQVSEIG--LVVQATSYLASHSDVNIDLGF-QGFKSL 125
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
G E LTGQG I + G + + ++E + VD IVA ++N +I G
Sbjct: 126 FSG-ENIFWLDLTGQGDVLINSFGGIYEVDVE--DEYIVDTGHIVAFEKTLNFEITKPGS 182
Query: 207 IRRAVF-GGDNLVTAVVTGPGIVFIQS 232
F GG+ LV G G ++ Q+
Sbjct: 183 SWLGAFLGGEGLVCR-FRGQGKLYCQT 208
>gi|429858192|gb|ELA33021.1| duf124 domain-containing protein [Colletotrichum gloeosporioides
Nara gc5]
Length = 442
Score = 43.5 bits (101), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 52/189 (27%), Positives = 87/189 (46%), Gaps = 13/189 (6%)
Query: 45 VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
+ A+PG M MS S+ + + + M + + G + S PG + +A P L
Sbjct: 248 LTAKPGVMVAMSHSITLRG----QIKFSMKKLVAGAELASSTFVGPG----ELLLAPPML 299
Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
I L + G E DA++ S V + + Q + I EG K++G GL
Sbjct: 300 GDI--TSLRLTGKETWSVGQDAYVASTQGV-IKDYKRQGLSKAIFSGEGLYVYKISGTGL 356
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
++ + G++++K+L GE VD +VA ++ +G I + G+ LV T
Sbjct: 357 LWLTSFGAIIRKDLMEGEKYVVDNGHLVAWNVKYIMERVASGGIISGISSGEGLVCK-FT 415
Query: 224 GPGIVFIQS 232
GPG VF+Q+
Sbjct: 416 GPGTVFMQT 424
>gi|392570465|gb|EIW63638.1| DUF124-domain-containing protein [Trametes versicolor FP-101664
SS1]
Length = 238
Score = 43.1 bits (100), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 52/215 (24%), Positives = 95/215 (44%), Gaps = 11/215 (5%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
+ L+P +V A+ GSM M +++++ + + + + L G ++ PG
Sbjct: 23 LQLEPGAEVKAKSGSMVAMDATVKIKG----KLKFSVAKLLSGDKMSESTFTGPGE---- 74
Query: 97 VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQ 156
V IA + I+PI L + DA+L V N + ++ G E
Sbjct: 75 VMIAPSTWGDIVPIYLDGHT-QWSVGKDAYLACTIGVTHKNKSQGVGKALLSG-EFLSAY 132
Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDN 216
+ G G+ +I + G++ + L+ GE VD +VA T++ + G + ++ ++
Sbjct: 133 NVIGAGIMWITSLGAIYSRTLQPGEQWIVDKGHLVAWTATCKDEDIDAGGLFSSMNTDED 192
Query: 217 LVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
V TGPGIV+IQ+ L I V P++
Sbjct: 193 SVCRF-TGPGIVYIQTRNPETLGGWINDRVNPPDV 226
>gi|332638675|ref|ZP_08417538.1| hypothetical protein WcibK1_08272 [Weissella cibaria KACC 11862]
Length = 220
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 46/209 (22%), Positives = 83/209 (39%), Gaps = 8/209 (3%)
Query: 41 PQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL--FGKTITS-----VVLRNPGPS 93
P E V + G+M + +G +++ G+ + L G+ + S + + G
Sbjct: 3 PNETVRIQSGAMVYHTGDTQLKGRLNANGGSGIGKLLRAAGRAMVSGESVFITEVSAGAK 62
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
G + +A I +++ G AFL V+ S R ++GG G
Sbjct: 63 GGDLALAPAVPGTIQALEVTP-GRNYYLNDSAFLAMDGTVQYSMERQSVGRAILGGQGGL 121
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
K +G+G + A GS+ + +L+ T+D + +VA +S+ I G + G
Sbjct: 122 FVMKTSGEGTLLVAAFGSIKEIDLQDAHDFTIDNAHVVAWETSLTYDIHLEGGGLISSIG 181
Query: 214 GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
V G G V +QSL + +
Sbjct: 182 TGEGVVNTFNGTGKVLVQSLNLENFANML 210
>gi|172036227|ref|YP_001802728.1| hypothetical protein cce_1312 [Cyanothece sp. ATCC 51142]
gi|354553022|ref|ZP_08972329.1| protein of unknown function DUF124 [Cyanothece sp. ATCC 51472]
gi|171697681|gb|ACB50662.1| hypothetical protein cce_1312 [Cyanothece sp. ATCC 51142]
gi|353554852|gb|EHC24241.1| protein of unknown function DUF124 [Cyanothece sp. ATCC 51472]
Length = 219
Score = 43.1 bits (100), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 37/142 (26%), Positives = 62/142 (43%), Gaps = 5/142 (3%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSV-VLRNPGPSDG 95
+ L P EK+ + GS+ M G + + + + + FG T V L N
Sbjct: 16 VTLNPGEKITVKMGSIISMDGEVTVHTGFCGSWLSAIVRKCFGGTDIFVDYLTNETEEPI 75
Query: 96 FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
VG++ ++ I IDL+ G + QP L VK+ N ++G +G +
Sbjct: 76 TVGLSQTTIGDIERIDLSQ--GPICLQPRVLLAYTKGVKIENHWGGFGSWLVG--DGLFK 131
Query: 156 QKLTGQGLAFILAGGSVVQKNL 177
K G+G FI A G +++K +
Sbjct: 132 TKCKGKGRVFIGAYGGIIKKTI 153
>gi|385830079|ref|YP_005867892.1| hypothetical protein CVCAS_0496 [Lactococcus lactis subsp. lactis
CV56]
gi|418037761|ref|ZP_12676129.1| hypothetical protein LLCRE1631_00936 [Lactococcus lactis subsp.
cremoris CNCM I-1631]
gi|326406087|gb|ADZ63158.1| conserved hypothetical protein [Lactococcus lactis subsp. lactis
CV56]
gi|354694191|gb|EHE93879.1| hypothetical protein LLCRE1631_00936 [Lactococcus lactis subsp.
cremoris CNCM I-1631]
Length = 232
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 56/222 (25%), Positives = 92/222 (41%), Gaps = 19/222 (8%)
Query: 32 AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL--FGKTITS----- 84
A + +I L E++ G+M + + ++++E + G+ L G++ITS
Sbjct: 11 APLVEISLDNNEEIQIESGAMVYHNAAVKLEGKMNSNGKGGLGDALKALGRSITSGESFF 70
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL-CQPDAFLCSVNDVKVSNTLDQRV 143
+ + ++G + +A SL I I + G E AFL S V +
Sbjct: 71 ITKASAIGANGKISLAPASLGSIKEI---LVGKEQWNLNTGAFLVSEGGVHYIMERQKLD 127
Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
+ GG G K G G I G +V+ L+ E VD ++A T S++ I+
Sbjct: 128 GALFGGTGGLFVMKTQGSGKMLISGYGDIVEIALDGTEDFVVDNQHVLAWTESLSYSIE- 186
Query: 204 NGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ FG G+ LV G G + IQS L+Q I
Sbjct: 187 ---VASGTFGFKTGEGLVNK-FRGKGKILIQSRNVEALAQSI 224
>gi|297617355|ref|YP_003702514.1| hypothetical protein Slip_1176 [Syntrophothermus lipocalidus DSM
12680]
gi|297145192|gb|ADI01949.1| protein of unknown function DUF124 [Syntrophothermus lipocalidus
DSM 12680]
Length = 230
Score = 43.1 bits (100), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 54/210 (25%), Positives = 86/210 (40%), Gaps = 21/210 (10%)
Query: 31 EAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV-GMWQWLFGK----TITSV 85
E V +I L P E V+A PG+M M+ ++E++ + + M + FGK T T
Sbjct: 10 EYAVAKIKLLPGESVIAEPGAMIAMTPNLEVKAEFARGGVLKSMARSFFGKESFFTTTFT 69
Query: 86 VLRNPGPSDGFVGIAAPSLARILPIDL---AMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
L G A L R P D+ ++ + Q AF+ V V N
Sbjct: 70 AL----------GEAELVLGRGQPGDMWCVSLNNETIYAQSGAFVACTPGVHV-NAGWGG 118
Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
R G FL + +TG GL ++ G++ + L +GE +D + + +
Sbjct: 119 ARGFFAGNMLFLLE-ITGTGLVWLATFGALRETTLGLGEDYVIDTGHVAGFHGNTMYTVT 177
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
G + + G+ V GPG V+ QS
Sbjct: 178 SAGGFKSFLTSGEGFVCR-FRGPGKVWTQS 206
>gi|449296420|gb|EMC92440.1| hypothetical protein BAUCODRAFT_27721 [Baudoinia compniacensis UAMH
10762]
Length = 373
Score = 42.7 bits (99), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 29/109 (26%), Positives = 53/109 (48%), Gaps = 2/109 (1%)
Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
DAF+ V + + G EG K+ G G+ ++ + G++++K+L GE
Sbjct: 252 DAFMACTQGVTKDYKAQSLSKAMFSG-EGLFVYKIGGVGIVWVQSMGAIIRKDLREGEKY 310
Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+D +VA + ++ +G + + G+ LV TGPG VF+Q+
Sbjct: 311 IIDNGHLVAWNCNYVLERIASGGLVSTLSAGEGLVCKF-TGPGTVFLQT 358
>gi|398339049|ref|ZP_10523752.1| hypothetical protein LkirsB1_05507 [Leptospira kirschneri serovar
Bim str. 1051]
gi|418676896|ref|ZP_13238174.1| TIGR00266 family protein [Leptospira kirschneri serovar
Grippotyphosa str. RM52]
gi|418688238|ref|ZP_13249394.1| TIGR00266 family protein [Leptospira kirschneri serovar
Grippotyphosa str. Moskva]
gi|418696512|ref|ZP_13257521.1| TIGR00266 family protein [Leptospira kirschneri str. H1]
gi|418739949|ref|ZP_13296330.1| TIGR00266 family protein [Leptospira kirschneri serovar Valbuzzi
str. 200702274]
gi|421089058|ref|ZP_15549873.1| TIGR00266 family protein [Leptospira kirschneri str. 200802841]
gi|421109871|ref|ZP_15570378.1| TIGR00266 family protein [Leptospira kirschneri str. H2]
gi|421130951|ref|ZP_15591142.1| TIGR00266 family protein [Leptospira kirschneri str. 2008720114]
gi|400322796|gb|EJO70652.1| TIGR00266 family protein [Leptospira kirschneri serovar
Grippotyphosa str. RM52]
gi|409956041|gb|EKO14973.1| TIGR00266 family protein [Leptospira kirschneri str. H1]
gi|410002179|gb|EKO52701.1| TIGR00266 family protein [Leptospira kirschneri str. 200802841]
gi|410004898|gb|EKO58702.1| TIGR00266 family protein [Leptospira kirschneri str. H2]
gi|410357755|gb|EKP04975.1| TIGR00266 family protein [Leptospira kirschneri str. 2008720114]
gi|410737095|gb|EKQ81837.1| TIGR00266 family protein [Leptospira kirschneri serovar
Grippotyphosa str. Moskva]
gi|410753071|gb|EKR10043.1| TIGR00266 family protein [Leptospira kirschneri serovar Valbuzzi
str. 200702274]
Length = 224
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 65/135 (48%), Gaps = 15/135 (11%)
Query: 104 LARILPIDLAM--FGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG----FLRQK 157
LA LP D+ G + Q +FL S +++ +D + + + G I G FLR
Sbjct: 80 LAPTLPGDVERIELSGTVYVQSSSFLASSPNIE----MDTKFQGLKGFISGESLFFLR-- 133
Query: 158 LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNL 217
L+G G I + G + N++ GE+I VD IVA +N ++ G + GG+
Sbjct: 134 LSGNGFLLISSYGGIDVLNVD-GEMI-VDTGHIVAFEEGLNYEMTKFGGWKSFFLGGEGF 191
Query: 218 VTAVVTGPGIVFIQS 232
V A G G V+IQS
Sbjct: 192 V-ARFKGKGKVWIQS 205
>gi|222099271|ref|YP_002533839.1| hypothetical protein CTN_0297 [Thermotoga neapolitana DSM 4359]
gi|221571661|gb|ACM22473.1| Putative uncharacterized protein [Thermotoga neapolitana DSM 4359]
Length = 211
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 49/173 (28%), Positives = 76/173 (43%), Gaps = 20/173 (11%)
Query: 28 LGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI----- 82
L G + ++ L E VVA PG+M +M G +E++ + G+W+ L +
Sbjct: 7 LKGSYALLRVFLSGGESVVAEPGAMVYMRGPIEVQTSVMG----GVWKALKRAVLGGENF 62
Query: 83 --TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
+ V R G GF AP L ID+ G L Q ++L S V++ +
Sbjct: 63 FMNTYVSRGEG-EIGF----APQLPG--DIDVIPMNGTLYVQSTSYLASDPSVEMDVSFG 115
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
+ G EG KL G+G + + G + L+ GE ITVD +VA+
Sbjct: 116 G-FKTFFAG-EGIFLLKLQGEGDVAVSSFGGIRSVTLQPGEEITVDTGHVVAL 166
>gi|375102749|ref|ZP_09749012.1| TIGR00266 family protein [Saccharomonospora cyanea NA-134]
gi|374663481|gb|EHR63359.1| TIGR00266 family protein [Saccharomonospora cyanea NA-134]
Length = 226
Score = 42.7 bits (99), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 53/216 (24%), Positives = 96/216 (44%), Gaps = 11/216 (5%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-WLFGKTITSVVLRNPGP 92
V +++L P E G+M S M M+ G+ + +L G+++ P P
Sbjct: 13 VARLLLAPNEPAQVESGAMVATSYGMHMQANTQGGVMKGLGRAFLGGESLFVSTYTAPPP 72
Query: 93 SDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
+ G+V +A P R++ +D + ++L + + + + N+ GG
Sbjct: 73 NGGWVDVAGGLPGDIRVIEMDGRV---GWCVTRGSWLANSHGIHLETKWGG-FGNLFGGE 128
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
GFL GQG + G++ NL+ GE +T+D +VA +V Q+ K + + +
Sbjct: 129 GGFLTHA-QGQGQLVVSCYGAIDVVNLQPGEYVTIDSGHVVAYADTVQSQLRKVSQGVIQ 187
Query: 210 AVFGGDNLVTAVVTGPGIVFIQSL-PFHRLSQRIAR 244
++ G+ V GPG + Q+ P L+ IAR
Sbjct: 188 SLKSGEGFVFDFA-GPGQILTQTRNPDALLAWLIAR 222
>gi|126657121|ref|ZP_01728292.1| hypothetical protein CY0110_28484 [Cyanothece sp. CCY0110]
gi|126621664|gb|EAZ92374.1| hypothetical protein CY0110_28484 [Cyanothece sp. CCY0110]
Length = 219
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 37/151 (24%), Positives = 68/151 (45%), Gaps = 9/151 (5%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT---ITSVVLRNPGPS 93
+ L P EK+ + GS+ M G + ++ + + + + FG T + S++ P
Sbjct: 16 VTLNPGEKITVKTGSLVSMDGEITVKTGFCGAWQSALLRKCFGGTDIFVDSLINETAEPI 75
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
V ++ + I IDL+ G + +P FL VK+ N +G +G
Sbjct: 76 T--VVLSQTTTGDIERIDLSQ--GSICLRPGVFLAYTKGVKIENRWAGFGSWWVG--DGL 129
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVIT 184
+ +L G+G FI A G +++K + V+T
Sbjct: 130 FQTQLKGKGRVFIAAYGGIIKKTVYQDLVMT 160
>gi|67922246|ref|ZP_00515760.1| Protein of unknown function DUF124 [Crocosphaera watsonii WH 8501]
gi|416389341|ref|ZP_11685340.1| hypothetical protein CWATWH0003_2162 [Crocosphaera watsonii WH
0003]
gi|67855949|gb|EAM51194.1| Protein of unknown function DUF124 [Crocosphaera watsonii WH 8501]
gi|357264250|gb|EHJ13163.1| hypothetical protein CWATWH0003_2162 [Crocosphaera watsonii WH
0003]
Length = 219
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 74/165 (44%), Gaps = 7/165 (4%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSV-VLRNPGPSDG 95
+ LKP+E + + S+ M G + ++ + + + FG T V L N
Sbjct: 16 VTLKPEETITVKTDSIISMDGKVTVKTKFFGFWLFALLRKCFGGTDVFVDYLINEKDYPI 75
Query: 96 FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
V ++ ++ I IDL+ G + QP FL VK+ N + G EG +
Sbjct: 76 TVVLSQTTIGDIERIDLSE--GSICLQPGVFLAYTKGVKIKNHWAGFGSWLAG--EGLFK 131
Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
KL G+G FI A G ++++ V + + + ++A TS +++
Sbjct: 132 TKLKGKGRVFIAAYGGIIKQT--VYQDLEMSQGHLLAYTSKTSLK 174
>gi|24215001|ref|NP_712482.1| hypothetical protein LA_2301 [Leptospira interrogans serovar Lai
str. 56601]
gi|45657508|ref|YP_001594.1| hypothetical protein LIC11636 [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|386074340|ref|YP_005988657.1| hypothetical protein LIF_A1879 [Leptospira interrogans serovar Lai
str. IPAV]
gi|417762747|ref|ZP_12410735.1| TIGR00266 family protein [Leptospira interrogans str. 2002000624]
gi|417772197|ref|ZP_12420086.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
str. Pomona]
gi|417777661|ref|ZP_12425478.1| TIGR00266 family protein [Leptospira interrogans str. 2002000621]
gi|417783259|ref|ZP_12430982.1| TIGR00266 family protein [Leptospira interrogans str. C10069]
gi|418667806|ref|ZP_13229211.1| TIGR00266 family protein [Leptospira interrogans serovar Pyrogenes
str. 2006006960]
gi|418671665|ref|ZP_13233014.1| TIGR00266 family protein [Leptospira interrogans str. 2002000623]
gi|418680995|ref|ZP_13242232.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
str. Kennewicki LC82-25]
gi|418692429|ref|ZP_13253507.1| TIGR00266 family protein [Leptospira interrogans str. FPW2026]
gi|418698038|ref|ZP_13259017.1| TIGR00266 family protein [Leptospira interrogans serovar Bataviae
str. L1111]
gi|418704255|ref|ZP_13265134.1| TIGR00266 family protein [Leptospira interrogans serovar Hebdomadis
str. R499]
gi|418711851|ref|ZP_13272603.1| TIGR00266 family protein [Leptospira interrogans serovar
Grippotyphosa str. UI 08368]
gi|418717170|ref|ZP_13277007.1| TIGR00266 family protein [Leptospira interrogans str. UI 08452]
gi|418726178|ref|ZP_13284789.1| TIGR00266 family protein [Leptospira interrogans str. UI 12621]
gi|418730486|ref|ZP_13288980.1| TIGR00266 family protein [Leptospira interrogans str. UI 12758]
gi|421085985|ref|ZP_15546836.1| TIGR00266 family protein [Leptospira santarosai str. HAI1594]
gi|421102588|ref|ZP_15563192.1| TIGR00266 family protein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|421117573|ref|ZP_15577933.1| TIGR00266 family protein [Leptospira interrogans serovar Canicola
str. Fiocruz LV133]
gi|421120883|ref|ZP_15581188.1| TIGR00266 family protein [Leptospira interrogans str. Brem 329]
gi|421124191|ref|ZP_15584461.1| TIGR00266 family protein [Leptospira interrogans serovar
Grippotyphosa str. 2006006986]
gi|421135007|ref|ZP_15595137.1| TIGR00266 family protein [Leptospira interrogans serovar
Grippotyphosa str. Andaman]
gi|24196046|gb|AAN49500.1|AE011400_5 conserved hypothetical protein [Leptospira interrogans serovar Lai
str. 56601]
gi|45600747|gb|AAS70231.1| conserved hypothetical protein [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|353458129|gb|AER02674.1| conserved hypothetical protein [Leptospira interrogans serovar Lai
str. IPAV]
gi|400327433|gb|EJO79685.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
str. Kennewicki LC82-25]
gi|400357662|gb|EJP13782.1| TIGR00266 family protein [Leptospira interrogans str. FPW2026]
gi|409941492|gb|EKN87121.1| TIGR00266 family protein [Leptospira interrogans str. 2002000624]
gi|409945568|gb|EKN95583.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
str. Pomona]
gi|409953960|gb|EKO08456.1| TIGR00266 family protein [Leptospira interrogans str. C10069]
gi|409960088|gb|EKO23842.1| TIGR00266 family protein [Leptospira interrogans str. UI 12621]
gi|410010907|gb|EKO69038.1| TIGR00266 family protein [Leptospira interrogans serovar Canicola
str. Fiocruz LV133]
gi|410020890|gb|EKO87685.1| TIGR00266 family protein [Leptospira interrogans serovar
Grippotyphosa str. Andaman]
gi|410346221|gb|EKO97231.1| TIGR00266 family protein [Leptospira interrogans str. Brem 329]
gi|410367702|gb|EKP23086.1| TIGR00266 family protein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|410431550|gb|EKP75910.1| TIGR00266 family protein [Leptospira santarosai str. HAI1594]
gi|410438678|gb|EKP87764.1| TIGR00266 family protein [Leptospira interrogans serovar
Grippotyphosa str. 2006006986]
gi|410572648|gb|EKQ35713.1| TIGR00266 family protein [Leptospira interrogans str. 2002000621]
gi|410581363|gb|EKQ49175.1| TIGR00266 family protein [Leptospira interrogans str. 2002000623]
gi|410756251|gb|EKR17876.1| TIGR00266 family protein [Leptospira interrogans serovar Pyrogenes
str. 2006006960]
gi|410762742|gb|EKR28901.1| TIGR00266 family protein [Leptospira interrogans serovar Bataviae
str. L1111]
gi|410766235|gb|EKR36923.1| TIGR00266 family protein [Leptospira interrogans serovar Hebdomadis
str. R499]
gi|410767817|gb|EKR43078.1| TIGR00266 family protein [Leptospira interrogans serovar
Grippotyphosa str. UI 08368]
gi|410774695|gb|EKR54699.1| TIGR00266 family protein [Leptospira interrogans str. UI 12758]
gi|410787237|gb|EKR80971.1| TIGR00266 family protein [Leptospira interrogans str. UI 08452]
gi|455667415|gb|EMF32736.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
str. Fox 32256]
gi|455792745|gb|EMF44485.1| TIGR00266 family protein [Leptospira interrogans serovar Lora str.
TE 1992]
gi|456821475|gb|EMF69981.1| TIGR00266 family protein [Leptospira interrogans serovar Canicola
str. LT1962]
gi|456986315|gb|EMG21911.1| TIGR00266 family protein [Leptospira interrogans serovar
Copenhageni str. LT2050]
Length = 224
Score = 42.7 bits (99), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 7/131 (5%)
Query: 104 LARILPIDLAM--FGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQ 161
LA LP D+ G + Q +FL S ++++ +T Q ++ I G E +L+G
Sbjct: 80 LAPTLPGDVERIELSGTVYVQSSSFLASSPNIEM-DTKFQGLKGFISG-ESLFFLRLSGN 137
Query: 162 GLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAV 221
G I + G + N++ GE+I VD IVA +N ++ G + GG+ V A
Sbjct: 138 GFLLISSYGGIDVLNVD-GEMI-VDTGHIVAFEEGLNYEMTKFGGWKSFFLGGEGFV-AR 194
Query: 222 VTGPGIVFIQS 232
G G V+IQS
Sbjct: 195 FKGKGKVWIQS 205
>gi|310793723|gb|EFQ29184.1| hypothetical protein GLRG_04328 [Glomerella graminicola M1.001]
Length = 456
Score = 42.7 bits (99), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 86/187 (45%), Gaps = 13/187 (6%)
Query: 47 ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
A+PG M MS S+ + + + M + + G + S PG + +A P L
Sbjct: 264 AKPGVMVAMSHSVTLRG----QIKFSMKKLVAGAEMASSTFVGPG----ELLLAPPMLGD 315
Query: 107 ILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAF 165
+ L + G E DA+L S V + + Q + I EG K++G GL +
Sbjct: 316 V--TSLRLTGKETWSVGHDAYLASTQGV-IKDHKRQGLSKAIFSGEGLFVYKISGTGLLW 372
Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGP 225
+ + G++++K+L GE VD +VA ++ +G I + G+ LV TGP
Sbjct: 373 LTSFGAIIRKDLMDGEKYIVDNGHLVAWNVKYIMERVASGGIISGISSGEGLV-CKFTGP 431
Query: 226 GIVFIQS 232
G VF+Q+
Sbjct: 432 GTVFMQT 438
>gi|325848476|ref|ZP_08170136.1| hypothetical protein HMPREF9246_1944 [Anaerococcus hydrogenalis
ACS-025-V-Sch4]
gi|325480704|gb|EGC83761.1| hypothetical protein HMPREF9246_1944 [Anaerococcus hydrogenalis
ACS-025-V-Sch4]
Length = 256
Score = 42.4 bits (98), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 45/202 (22%), Positives = 88/202 (43%), Gaps = 26/202 (12%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
C + + P V ++ G+M +M G +E+ + +G + F K+I V +
Sbjct: 55 ACDLSISP---VTSQTGAMQWMVGDVELTSGI---KGIGDF---FSKSIKGSVTKESAVK 105
Query: 94 DGFVG----IAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
+ G I P+ I+ IDLA + ++ + FL +++K + + + G
Sbjct: 106 PEYRGSGRVILEPTYKHIILIDLAEWNNSIVLEDGYFLACTSNLKQKIKTRTNISSSLMG 165
Query: 150 IEGFLRQKLTGQGLAFI---LAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
EG KL G G+ + +A +++ LE + I +D S +A + S++ +K
Sbjct: 166 NEGMFNLKLEGDGILALESPIAKDDLIEIELE-NDQIKIDGSYAIAWSDSLDFSVKK--- 221
Query: 207 IRRAVFGGDNLVTAVVTGPGIV 228
+LV++ +T G V
Sbjct: 222 ------ASKSLVSSAMTEEGFV 237
>gi|124003941|ref|ZP_01688788.1| protein of unknown function [Microscilla marina ATCC 23134]
gi|123990520|gb|EAY30000.1| protein of unknown function [Microscilla marina ATCC 23134]
Length = 482
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 26/97 (26%), Positives = 57/97 (58%), Gaps = 5/97 (5%)
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
EG + +KL G G+ F+ A G VV++ LE + I V+++ +VA S+ + + + +
Sbjct: 285 EGLVIEKLEGDGMVFLKAKGDVVERILE-DDAIRVNLASVVAFEPSIELDMDSIQSL-ES 342
Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVT 247
+ D ++ A+ +G G++++QS ++++ I + +T
Sbjct: 343 LQDADPVILALFSGSGMLWLQS---NQMTPHIIQHIT 376
>gi|448562034|ref|ZP_21635167.1| hypothetical protein C457_07151 [Haloferax prahovense DSM 18310]
gi|445720130|gb|ELZ71807.1| hypothetical protein C457_07151 [Haloferax prahovense DSM 18310]
Length = 220
Score = 42.4 bits (98), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 69/157 (43%), Gaps = 28/157 (17%)
Query: 94 DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
D FV APS L +D+A+ G ++ + + + D+ R
Sbjct: 4 DEFVSTHAPSEGGEAFELENSKLLDVAL-DGSVMAKAGSMVGYTGDISFE-------RKS 55
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
GG++G L++K TG+G + A G+ V L+ GE ++V+ + ++A S
Sbjct: 56 AGGLKGMLKKKATGEGDVMMQATGTGHLYLADQAKEVQILELDAGEELSVNGNDVLAFES 115
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
SVN IK I A GG L + GPG V I +
Sbjct: 116 SVNWDIKMLTSIAGASSGG--LFNVFLEGPGHVAITT 150
>gi|421113342|ref|ZP_15573786.1| TIGR00266 family protein [Leptospira santarosai str. JET]
gi|410801116|gb|EKS07290.1| TIGR00266 family protein [Leptospira santarosai str. JET]
gi|456875150|gb|EMF90381.1| TIGR00266 family protein [Leptospira santarosai str. ST188]
Length = 227
Score = 42.4 bits (98), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 52/211 (24%), Positives = 91/211 (43%), Gaps = 31/211 (14%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-----WLFGKTITSVVLR 88
+ Q+ L+ E + A G+M MS S++M + + E G+W L G++ +
Sbjct: 13 IVQVQLEDGESIRAESGAMVAMSPSVKM----VTKAEGGIWASAKRALLSGESF----FQ 64
Query: 89 NPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
N ++ G+ + A I+ GE L+ A++ + + +
Sbjct: 65 NTFKAENGRGMIFLTSATQGDIEYRKMNGEDLILSKGAYVAGSESLLIDSKW-------- 116
Query: 148 GGIEGFLRQ------KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
GG +GF K++G G F + G++ +EV VD IV S+N I
Sbjct: 117 GGFKGFFSGEGLFFLKVSGVGDLFFSSFGAI--HTIEVNGQYVVDTGHIVGFEGSLNYTI 174
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+ G ++ G+ LV AV +G G ++IQS
Sbjct: 175 QKVGGLKSLFLSGEGLV-AVFSGSGKLYIQS 204
>gi|456969821|gb|EMG10737.1| TIGR00266 family protein [Leptospira interrogans serovar
Grippotyphosa str. LT2186]
Length = 159
Score = 42.4 bits (98), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 7/131 (5%)
Query: 104 LARILPIDLAM--FGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQ 161
LA LP D+ G + Q +FL S ++++ +T Q ++ I G E +L+G
Sbjct: 15 LAPTLPGDVERIELSGTVYVQSSSFLASSPNIEM-DTKFQGLKGFISG-ESLFFLRLSGN 72
Query: 162 GLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAV 221
G I + G + N++ GE+I VD IVA +N ++ G + GG+ V A
Sbjct: 73 GFLLISSYGGIDVLNVD-GEMI-VDTGHIVAFEEGLNYEMTKFGGWKSFFLGGEGFV-AR 129
Query: 222 VTGPGIVFIQS 232
G G V+IQS
Sbjct: 130 FKGKGKVWIQS 140
>gi|428769066|ref|YP_007160856.1| hypothetical protein Cyan10605_0674 [Cyanobacterium aponinum PCC
10605]
gi|428683345|gb|AFZ52812.1| protein of unknown function DUF124 [Cyanobacterium aponinum PCC
10605]
Length = 235
Score = 42.4 bits (98), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 50/213 (23%), Positives = 92/213 (43%), Gaps = 24/213 (11%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL------RNP 90
+ L+P + + G M M S++ME+ G + FG+ + L
Sbjct: 22 LELEPNQTAIVEAGGMAAMDSSIKMES-----KMKGGFGQSFGRMLGGESLFLNEFTAQA 76
Query: 91 GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
+ ++ P + +D + L+ Q F+ S +V++++ Q + G
Sbjct: 77 SQGELYISPGVPGDIQYYHLDGSK---GLMIQSSGFVASSKNVEINSNF-QGFKGFFSGE 132
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG----- 205
FL K TG+G + + G++V+ N+E V VD IVA ++N ++ G
Sbjct: 133 SIFLL-KATGKGDIWFSSYGAIVEINVENNYV--VDTGYIVAFEDTLNYNVEMIGGLSFR 189
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
+R + GG+ LV +G G ++IQS + L
Sbjct: 190 NLRTGILGGEGLVCR-FSGSGKLWIQSRGLYPL 221
>gi|55376854|ref|YP_134705.1| hypothetical protein pNG7303 [Haloarcula marismortui ATCC 43049]
gi|55229579|gb|AAV44999.1| unknown [Haloarcula marismortui ATCC 43049]
Length = 223
Score = 42.0 bits (97), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 41/157 (26%), Positives = 76/157 (48%), Gaps = 22/157 (14%)
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
+R+ P+DG G + R+L I L G ++ + + + +V + +V
Sbjct: 7 VRSNAPADGGDGFQKEN-NRLLDIPLD---GTVMVKAGSMVAYTGEVTFTGK-----SSV 57
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS---VVQKN--------LEVGEVITVDVSCIVAVTS 195
GGI GF+++ ++G+G + A GS V +N L+ GE I+V+ + ++A S
Sbjct: 58 EGGITGFVKEAVSGEGTPVMEAEGSGHLYVAENGKKVQVLALDDGESISVNGTDVLAFES 117
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+++ +I G + GG L +TGPG V + +
Sbjct: 118 TIDYEINTVGSLSGMAAGG--LTNVYLTGPGEVALTT 152
>gi|313125220|ref|YP_004035484.1| hypothetical protein Hbor_04410 [Halogeometricum borinquense DSM
11551]
gi|448287182|ref|ZP_21478398.1| hypothetical protein C499_10354 [Halogeometricum borinquense DSM
11551]
gi|312291585|gb|ADQ66045.1| conserved hypothetical protein TIGR00266 [Halogeometricum
borinquense DSM 11551]
gi|445572928|gb|ELY27458.1| hypothetical protein C499_10354 [Halogeometricum borinquense DSM
11551]
Length = 220
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 51/212 (24%), Positives = 89/212 (41%), Gaps = 6/212 (2%)
Query: 23 IPFQILGGEAQVC-QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
+ F+I G + +I E+V PGSM S ++ +E + GM +
Sbjct: 1 MQFEIADGHSYATLEIDFDEGERVGIEPGSMVTRSENVRVETTSGDDGIGGMLKRAVSDE 60
Query: 82 ITSVVLRNPGPSDGFVGIAAPS-LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
I + SDG + AP L I ++L G + Q L V+ T
Sbjct: 61 IDVMTTYFIAESDGAHALLAPDYLGDIARVELDGTEG-VKVQSGGLLAWSEGVE-RGTAR 118
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
N E KL G+G AFI A G+V ++ + + + VD ++A T ++
Sbjct: 119 NEASNFFSSGE-LTVLKLDGKGSAFISAFGAVRKETVTAEDPLIVDEDHLLAWTDGLSAS 177
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+ + + ++ GG+ VT ++G G ++Q+
Sbjct: 178 RQKDSNLTSSLLGGEGFVTK-LSGEGSAWVQT 208
>gi|15672568|ref|NP_266742.1| hypothetical protein L178908 [Lactococcus lactis subsp. lactis
Il1403]
gi|12723478|gb|AAK04684.1|AE006292_4 conserved hypothetical protein [Lactococcus lactis subsp. lactis
Il1403]
Length = 232
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 56/222 (25%), Positives = 92/222 (41%), Gaps = 19/222 (8%)
Query: 32 AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL--FGKTITS----- 84
A + +I L E++ G+M + + ++++E + G+ L G++ITS
Sbjct: 11 APLVEISLDNNEEIQIESGAMVYHNAAVKLEGKMNSNGKGGLRGALKALGRSITSGESFF 70
Query: 85 VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL-CQPDAFLCSVNDVKVSNTLDQRV 143
+ + ++G + +A SL I I + G E AFL S V +
Sbjct: 71 ITKASAIGANGKISLAPASLGSIKEI---LVGKEQWNLNTGAFLVSEGGVHYIMERQKLD 127
Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
+ GG G K G G I G +V+ L+ E VD ++A T S++ I+
Sbjct: 128 GALFGGTGGLFVMKTQGSGKMLISGYGDIVEIALDGTEDFVVDNQHVLAWTESLSYSIE- 186
Query: 204 NGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+ FG G+ LV G G + IQS L+Q I
Sbjct: 187 ---VASGTFGFKTGEGLVNK-FRGKGKILIQSRNVEALAQSI 224
>gi|448390578|ref|ZP_21566201.1| hypothetical protein C477_08238 [Haloterrigena salina JCM 13891]
gi|445666992|gb|ELZ19644.1| hypothetical protein C477_08238 [Haloterrigena salina JCM 13891]
Length = 221
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 36/139 (25%), Positives = 63/139 (45%), Gaps = 21/139 (15%)
Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
++L +DL G ++ + + + D+ Q + GGI GFL++K TG+G
Sbjct: 24 GKLLDVDLD---GTVIAKAGSMIAYDGDISF-----QGKSSAEGGITGFLKEKTTGEGTP 75
Query: 165 FILAGG-----------SVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
+ A G + L+ E I+V+ + ++A SS++ +I+ I A F
Sbjct: 76 VMEASGRGHLYLADQEKKIQLLELDADEEISVNGNDVLAFESSIDYEIRTMNSI--AGFS 133
Query: 214 GDNLVTAVVTGPGIVFIQS 232
L +TGPG V I +
Sbjct: 134 AGGLTNVSLTGPGSVAITT 152
>gi|389846904|ref|YP_006349143.1| hypothetical protein HFX_1448 [Haloferax mediterranei ATCC 33500]
gi|448615255|ref|ZP_21664180.1| hypothetical protein C439_03253 [Haloferax mediterranei ATCC 33500]
gi|388244210|gb|AFK19156.1| hypothetical protein HFX_1448 [Haloferax mediterranei ATCC 33500]
gi|445752519|gb|EMA03942.1| hypothetical protein C439_03253 [Haloferax mediterranei ATCC 33500]
Length = 220
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 63/139 (45%), Gaps = 23/139 (16%)
Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
+++L I L G ++ + + + D+ R GG++G L++K TG+G
Sbjct: 24 SKLLDITLD---GSIMAKAGSMVSYTGDISFE-------RKSTGGLKGMLKKKATGEGEV 73
Query: 165 FILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
+ A GS V L+ GE I+V+ + ++A +S++ IK I A G
Sbjct: 74 MMEASGSGHLYLADQGKEVQILELDTGEEISVNGNDVLAFENSIDWDIKMMKSIAGASSG 133
Query: 214 GDNLVTAVVTGPGIVFIQS 232
G L + GPG V I +
Sbjct: 134 G--LFNVFLEGPGHVAITT 150
>gi|448541097|ref|ZP_21623928.1| hypothetical protein C460_04280 [Haloferax sp. ATCC BAA-646]
gi|448549482|ref|ZP_21628087.1| hypothetical protein C459_07270 [Haloferax sp. ATCC BAA-645]
gi|448555405|ref|ZP_21631445.1| hypothetical protein C458_06224 [Haloferax sp. ATCC BAA-644]
gi|445708259|gb|ELZ60099.1| hypothetical protein C460_04280 [Haloferax sp. ATCC BAA-646]
gi|445712530|gb|ELZ64311.1| hypothetical protein C459_07270 [Haloferax sp. ATCC BAA-645]
gi|445718150|gb|ELZ69853.1| hypothetical protein C458_06224 [Haloferax sp. ATCC BAA-644]
Length = 220
Score = 42.0 bits (97), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 68/157 (43%), Gaps = 28/157 (17%)
Query: 94 DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
D FV APS L +D+A+ G ++ + + + D+ R
Sbjct: 4 DEFVSAHAPSEGGDSFELESSKLLDVAL-DGHVMAKAGSMVGYTGDISFE-------RKS 55
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
GG++G L+QK TG+G + A G+ V L+ GE ++V+ + ++A S
Sbjct: 56 AGGLKGMLKQKATGEGDVMMQATGTGHLYLADQGKEVQILELDAGEELSVNGNDVLAFES 115
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
SVN IK I GG L + GPG V I +
Sbjct: 116 SVNWDIKMLTSIAGTSSGG--LFNVFLEGPGHVAITT 150
>gi|427420413|ref|ZP_18910596.1| TIGR00266 family protein [Leptolyngbya sp. PCC 7375]
gi|425763126|gb|EKV03979.1| TIGR00266 family protein [Leptolyngbya sp. PCC 7375]
Length = 233
Score = 42.0 bits (97), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 51/210 (24%), Positives = 97/210 (46%), Gaps = 16/210 (7%)
Query: 36 QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS-D 94
++ L+P + V G+M M +EM++ + G+ + L G+++ G S D
Sbjct: 19 KLSLQPNQTVSVEAGAMAAMDSCIEMKSKAKGGWKKGLGRMLGGESLFISEFTAQGSSGD 78
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
++ P + +D + L+ Q F+ + V + + Q + G FL
Sbjct: 79 LYISPGVPGDIQHYILDGSQ---SLMVQSSGFVAAGPGVSIDSKF-QGFKGFFSGESLFL 134
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK------YNGPIR 208
++TGQG + + G++++ +E V VD IVA S++ ++ + G +R
Sbjct: 135 L-RVTGQGDFWFSSFGAILEIPVEGNYV--VDTGYIVAFEDSLSYNVEMISGLSFRG-LR 190
Query: 209 RAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
+FGG+ LV A +G G ++IQS + L
Sbjct: 191 TGIFGGEGLV-ARFSGSGRLWIQSRNVYSL 219
>gi|313123760|ref|YP_004034019.1| hypothetical protein LDBND_1002 [Lactobacillus delbrueckii subsp.
bulgaricus ND02]
gi|312280323|gb|ADQ61042.1| Hypothetical protein LDBND_1002 [Lactobacillus delbrueckii subsp.
bulgaricus ND02]
Length = 233
Score = 41.6 bits (96), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 52/238 (21%), Positives = 100/238 (42%), Gaps = 18/238 (7%)
Query: 23 IPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
+ +++ GG + + L E + G+M +M + + +G G++
Sbjct: 1 MKYELSGGNTCPLATVTLDKGESIKVENGAMVYMKDVTIAGKMNSSKKGLGGLLGAIGRS 60
Query: 82 ITS----VVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKV 135
+TS + + G +DG +G+A +IL + + G + C AFL S + V
Sbjct: 61 LTSGESMFITQATGDADGGQIGVAPAIPGKILKLSV---GKQQYCLNTGAFLASDDSVSY 117
Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
+ GG GF + G+G + A G +V+ + + I++D ++A +
Sbjct: 118 KMKSQSFSKAAFGGTGGFYVMQTEGEGDMLVNAFGDLVELTVTSNKPISIDNEHVIAWDA 177
Query: 196 SVNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
++ IK + +FG G+ LV TG G V IQ+ L++ + + N
Sbjct: 178 NLGYDIK----VASGMFGFTSGEGLVNH-FTGDGKVIIQTRNLRSLAEALQPLIVEKN 230
>gi|448603235|ref|ZP_21657056.1| hypothetical protein C441_03642 [Haloferax sulfurifontis ATCC
BAA-897]
gi|445746431|gb|ELZ97893.1| hypothetical protein C441_03642 [Haloferax sulfurifontis ATCC
BAA-897]
Length = 220
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 68/157 (43%), Gaps = 28/157 (17%)
Query: 94 DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
D FV APS L +D+A+ GE++ + + + D+ R
Sbjct: 4 DEFVSAHAPSEGGDSFELENSKLLDVAL-DGEVMAKAGSMVGYTGDISFE-------RKS 55
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
GG+ G L++K TG+G + A G+ V L+ GE ++V+ + ++A S
Sbjct: 56 AGGLTGMLKKKATGEGDVMMQATGTGHLYLADQGKEVQILELDAGEELSVNGNDVLAFES 115
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
SVN IK I GG L + GPG V I +
Sbjct: 116 SVNWDIKMLTSIAGTSSGG--LFNVFLEGPGHVAITT 150
>gi|427724445|ref|YP_007071722.1| hypothetical protein Lepto7376_2617 [Leptolyngbya sp. PCC 7376]
gi|427356165|gb|AFY38888.1| protein of unknown function DUF124 [Leptolyngbya sp. PCC 7376]
Length = 225
Score = 41.6 bits (96), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 52/204 (25%), Positives = 95/204 (46%), Gaps = 13/204 (6%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSME----MENIYIPENEVGMWQWLFGKTITSVVLRN 89
+ I + E++VA+ G+M MSGS+ + G+ + L G+++ V R
Sbjct: 13 IAHIKMDAGEEIVAQAGAMIAMSGSINTSTTLRKGKGGGIMGGLKRMLAGESLFLSVFRA 72
Query: 90 PGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
P P D + A + +L M G E + Q ++L S ++V + Q ++ G
Sbjct: 73 PVP-DSEIWFAPKLMGDLLLYQ--MQGDEFVVQASSYLASGSNVDIDLGW-QGFKSFFSG 128
Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIR 208
E ++GQG + + G++ + +++ GE I VD IVA +++ ++ K N
Sbjct: 129 -ESIFWLSISGQGPLLVTSFGAIYEIDVD-GEYI-VDTGHIVAFEKTLSFKVGKANPSWM 185
Query: 209 RAVFGGDNLVTAVVTGPGIVFIQS 232
A GG+ LV G G ++ Q+
Sbjct: 186 GAFLGGEGLVCR-FQGKGKLYCQT 208
>gi|422004336|ref|ZP_16351556.1| hypothetical protein LSS_12589 [Leptospira santarosai serovar
Shermani str. LT 821]
gi|417257005|gb|EKT86413.1| hypothetical protein LSS_12589 [Leptospira santarosai serovar
Shermani str. LT 821]
Length = 227
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 51/211 (24%), Positives = 91/211 (43%), Gaps = 31/211 (14%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-----WLFGKTITSVVLR 88
+ Q+ L+ E + A G+M MS +++M + + E G+W L G++ +
Sbjct: 13 IVQVQLEDGESIRAESGAMVAMSPTIKM----VTKAEGGIWASAKRALLSGESF----FQ 64
Query: 89 NPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
N ++ G+ + A I+ GE L+ A++ + + +
Sbjct: 65 NTFKAENGRGMIFLTSATQGDIEYRKMNGEDLILSKGAYVAGSESLLIDSKW-------- 116
Query: 148 GGIEGFLRQ------KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
GG +GF K++G G F + G++ +EV VD IV S+N I
Sbjct: 117 GGFKGFFSGEGLFFLKVSGVGDLFFSSFGAI--HTIEVNGQYVVDTGHIVGFEGSLNYTI 174
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+ G ++ G+ LV AV +G G ++IQS
Sbjct: 175 QKVGGLKSLFLSGEGLV-AVFSGSGKLYIQS 204
>gi|359685937|ref|ZP_09255938.1| hypothetical protein Lsan2_15283 [Leptospira santarosai str.
2000030832]
gi|410450315|ref|ZP_11304356.1| TIGR00266 family protein [Leptospira sp. Fiocruz LV3954]
gi|418746767|ref|ZP_13303087.1| TIGR00266 family protein [Leptospira santarosai str. CBC379]
gi|410015828|gb|EKO77919.1| TIGR00266 family protein [Leptospira sp. Fiocruz LV3954]
gi|410792476|gb|EKR90411.1| TIGR00266 family protein [Leptospira santarosai str. CBC379]
Length = 227
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 51/211 (24%), Positives = 91/211 (43%), Gaps = 31/211 (14%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-----WLFGKTITSVVLR 88
+ Q+ L+ E + A G+M MS +++M + + E G+W L G++ +
Sbjct: 13 IVQVQLEDGESIRAESGAMVAMSPTIKM----VTKAEGGIWASAKRALLSGESF----FQ 64
Query: 89 NPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
N ++ G+ + A I+ GE L+ A++ + + +
Sbjct: 65 NTFKAENGRGMIFLTSATQGDIEYRKMNGEDLILSKGAYVAGSESLLIDSKW-------- 116
Query: 148 GGIEGFLRQ------KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
GG +GF K++G G F + G++ +EV VD IV S+N I
Sbjct: 117 GGFKGFFSGEGLFFLKVSGVGDLFFSSFGAI--HTIEVNGQYVVDTGHIVGFEGSLNYTI 174
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+ G ++ G+ LV AV +G G ++IQS
Sbjct: 175 QKVGGLKSLFLSGEGLV-AVFSGSGKLYIQS 204
>gi|228478056|ref|ZP_04062667.1| conserved hypothetical protein [Streptococcus salivarius SK126]
gi|418018712|ref|ZP_12658267.1| hypothetical protein SSALIVM18_09357 [Streptococcus salivarius M18]
gi|228250236|gb|EEK09489.1| conserved hypothetical protein [Streptococcus salivarius SK126]
gi|345526154|gb|EGX29466.1| hypothetical protein SSALIVM18_09357 [Streptococcus salivarius M18]
Length = 231
Score = 41.6 bits (96), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 72/163 (44%), Gaps = 7/163 (4%)
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
IT VV + ++G++ +A S +++P+ L + AFL + L
Sbjct: 70 ITQVVSQ---SNNGYIALAPDSPGQVIPLYLGE--KQYRLNDGAFLALDGTAYYTMELQS 124
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+ + GG GF GQG A GS+ + L EV T+D + +VA + +++ I
Sbjct: 125 VGKAIFGGQGGFFVMTTQGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQTLDYNI 183
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+++ G+ +V G G +++QSL + + R
Sbjct: 184 HLENGFWQSIGTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 225
>gi|448585889|ref|ZP_21648061.1| hypothetical protein C454_15881 [Haloferax gibbonsii ATCC 33959]
gi|445725507|gb|ELZ77130.1| hypothetical protein C454_15881 [Haloferax gibbonsii ATCC 33959]
Length = 220
Score = 41.6 bits (96), Expect = 0.44, Method: Compositional matrix adjust.
Identities = 44/157 (28%), Positives = 68/157 (43%), Gaps = 28/157 (17%)
Query: 94 DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
D FV APS L +D+A+ G ++ + + + D+ R
Sbjct: 4 DEFVSTHAPSEGGESFELENSKLLDVAL-DGSVMAKAGSMVGYTGDISFE-------RKS 55
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
GG+ G L++K TG+G + A G+ V L+ GE ++V+ + ++A S
Sbjct: 56 AGGLTGMLKKKATGEGDVMMQATGTGHLYLADQAKEVQILELDAGEELSVNGNDVLAFES 115
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
SVN IK I A GG L + GPG V I +
Sbjct: 116 SVNWDIKMLTSIAGASSGG--LFNVFLEGPGHVAITT 150
>gi|422824507|ref|ZP_16872694.1| hypothetical protein HMPREF9390_1542 [Streptococcus sanguinis
SK405]
gi|422825413|ref|ZP_16873592.1| hypothetical protein HMPREF9392_0347 [Streptococcus sanguinis
SK678]
gi|422856609|ref|ZP_16903265.1| hypothetical protein HMPREF9378_1536 [Streptococcus sanguinis SK1]
gi|422863528|ref|ZP_16910159.1| hypothetical protein HMPREF9391_2053 [Streptococcus sanguinis
SK408]
gi|422866440|ref|ZP_16913065.1| hypothetical protein HMPREF9395_2151 [Streptococcus sanguinis
SK1058]
gi|422877404|ref|ZP_16923874.1| hypothetical protein HMPREF9393_1638 [Streptococcus sanguinis
SK1056]
gi|324992556|gb|EGC24477.1| hypothetical protein HMPREF9390_1542 [Streptococcus sanguinis
SK405]
gi|324995915|gb|EGC27826.1| hypothetical protein HMPREF9392_0347 [Streptococcus sanguinis
SK678]
gi|327459968|gb|EGF06307.1| hypothetical protein HMPREF9378_1536 [Streptococcus sanguinis SK1]
gi|327472502|gb|EGF17933.1| hypothetical protein HMPREF9391_2053 [Streptococcus sanguinis
SK408]
gi|327488549|gb|EGF20349.1| hypothetical protein HMPREF9395_2151 [Streptococcus sanguinis
SK1058]
gi|332360043|gb|EGJ37857.1| hypothetical protein HMPREF9393_1638 [Streptococcus sanguinis
SK1056]
Length = 231
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 48/220 (21%), Positives = 102/220 (46%), Gaps = 16/220 (7%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL--FGKTITS----VVL 87
+ ++ L E V + GSM + + ++ + N + + G+ +++ G+++ S +
Sbjct: 13 LVELSLNQGETVFIQRGSMVYHTPNVTL-NTQLNASGSGLGRFVKAVGRSMVSGESTFIT 71
Query: 88 RNPGPSD-GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--R 144
+ SD G++ +A + +++P+ L + AFL D T++++ +
Sbjct: 72 QAVAQSDNGYLALAPDAPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGK 127
Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
+ GG GF GQG A GS+ + L EV T+D + +VA + S+N +
Sbjct: 128 ALFGGQGGFFVMTTQGQGTLLANAFGSIKKLELHNQEV-TIDNAHVVAWSQSLNYHVHLE 186
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+++ G+ +V G G V++QSL + +++
Sbjct: 187 NGFWQSIGTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSK 225
>gi|307152758|ref|YP_003888142.1| hypothetical protein Cyan7822_2910 [Cyanothece sp. PCC 7822]
gi|306982986|gb|ADN14867.1| protein of unknown function DUF124 [Cyanothece sp. PCC 7822]
Length = 227
Score = 41.2 bits (95), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 51/206 (24%), Positives = 92/206 (44%), Gaps = 19/206 (9%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRN--PG 91
+ ++ L E+V GSM MS + +E + + L G +N
Sbjct: 13 IGRLRLVGGEQVRVESGSMVGMSAGVTLETTTTGGFMQSLRRSLLGG---ESFFQNVFTA 69
Query: 92 PSDGFVGIAAPSLARILPIDLAMFGGE--LLCQPDAFLCSVNDVKVSNTLDQR---VRNV 146
P+ G + APSL P DL + +L Q ++ S D+ ++ LD + ++
Sbjct: 70 PAQGGEVLVAPSL----PGDLKVLDMSEPMLLQSGCYVAS--DMGIA--LDSKWGGSKSF 121
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
G G + GQG + + G++ + L+ GE TVD +VA++ S+ + + G
Sbjct: 122 FGTGGGLFMLRAEGQGQIVVSSYGAIHEITLDRGESYTVDTGHLVALSESMPFKTRSIGG 181
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
++ +F G+ V +TGPG +Q+
Sbjct: 182 MKTFMFSGEGFVID-LTGPGKFLVQT 206
>gi|387760488|ref|YP_006067465.1| hypothetical protein Ssal_00185 [Streptococcus salivarius 57.I]
gi|339291255|gb|AEJ52602.1| conserved hypothetical protein [Streptococcus salivarius 57.I]
Length = 231
Score = 41.2 bits (95), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 73/163 (44%), Gaps = 7/163 (4%)
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
IT VV ++ ++G++ +A S +++P+ L + AFL + L
Sbjct: 70 ITQVVSQS---NNGYIALAPDSPGQVIPLYLGE--KQYRLNDGAFLALDGTAYYTMELQS 124
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+ + GG GF GQG A GS+ + L EV T+D + +VA + +++ I
Sbjct: 125 VGKAIFGGQGGFFVMTTQGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQTLDYDI 183
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+++ G+ +V G G +++QSL + + R
Sbjct: 184 HLENGFWQSIGTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 225
>gi|384567619|ref|ZP_10014723.1| TIGR00266 family protein [Saccharomonospora glauca K62]
gi|384523473|gb|EIF00669.1| TIGR00266 family protein [Saccharomonospora glauca K62]
Length = 225
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 49/209 (23%), Positives = 87/209 (41%), Gaps = 16/209 (7%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
V +++L P E G+M S M M+ G+ + +FG V P+
Sbjct: 13 VARLLLAPNEPAQVEAGAMVATSYGMHMQASTQGGVMKGLGRAIFGGESLFVSTYTAPPN 72
Query: 94 DGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
G+V +A P R++ +D + ++L + + + N+ GG
Sbjct: 73 GGWVDVAGGLPGDIRVIEMDGRV---GWCVTRGSWLANSYGIHLETKWGG-FGNLFGGEG 128
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
GFL GQG + G++ L+ GE +T+D +VA +V Q +R+
Sbjct: 129 GFLTHA-QGQGQLLVSCYGAIDVVTLQPGEYVTIDSGHVVAYADTVQSQ------LRKVA 181
Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
G ++ ++ +G G VF + P L+Q
Sbjct: 182 QG---VIQSLKSGEGFVFDFAGPGQILTQ 207
>gi|300812994|ref|ZP_07093380.1| conserved hypothetical protein TIGR00266 [Lactobacillus delbrueckii
subsp. bulgaricus PB2003/044-T3-4]
gi|300496028|gb|EFK31164.1| conserved hypothetical protein TIGR00266 [Lactobacillus delbrueckii
subsp. bulgaricus PB2003/044-T3-4]
Length = 233
Score = 41.2 bits (95), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 50/230 (21%), Positives = 99/230 (43%), Gaps = 18/230 (7%)
Query: 23 IPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
+ +++ GG + + L E + G+M +M + + +G G++
Sbjct: 1 MKYELSGGNTCPLATVTLDKGESIKVENGAMVYMKDVTIAGKMNSSKKGLGGLLGAIGRS 60
Query: 82 ITS----VVLRNPGPSD-GFVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKV 135
+TS + + G +D G +G+A +I+ + + G + C AFL S + V
Sbjct: 61 LTSGESMFITQATGDADSGQIGVAPAIPGKIVKLSV---GKQQYCLNTGAFLASDDSVSY 117
Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
+ GG GF + G+G + A G +V+ + + I++D ++A +
Sbjct: 118 KMKSQSFSKAAFGGTGGFYVMQTEGEGDMLVNAFGDLVELTVTSNKPISIDNEHVIAWDA 177
Query: 196 SVNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+++ IK + +FG G+ LV TG G V IQ+ L++ +
Sbjct: 178 NLDYDIK----VASGMFGFTSGEGLVNH-FTGDGKVIIQTRNLRSLAEAL 222
>gi|332672253|ref|YP_004455261.1| hypothetical protein Celf_3768 [Cellulomonas fimi ATCC 484]
gi|332341291|gb|AEE47874.1| protein of unknown function DUF124 [Cellulomonas fimi ATCC 484]
Length = 213
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 54/220 (24%), Positives = 94/220 (42%), Gaps = 22/220 (10%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
+PF+ + ++V +I + PQ V+AR G+M SG + I + VG
Sbjct: 1 MPFEKV--NSKVVRIPVTPQNAVLARRGAMLGYSGQVAFRPISGQGHGVGGMVGAAMSGE 58
Query: 83 TSVVLRNPGPSD---GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS--- 136
++ ++ G GF G+ + IDL G L + D L ++ S
Sbjct: 59 SNPMMATEGNGSVLYGFRGL------HVTVIDLT--GDSLTVEADRVLAHDAHLQTSVEF 110
Query: 137 ---NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
+ VR + G +G +++GQG +L+ G L G+ + VD V
Sbjct: 111 IGQGGVRSAVRGAMTG-QGLFTTRISGQGSVAVLSHGGTFPLQLN-GDTVGVDPQAYVGH 168
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAV-VTGPGIVFIQS 232
T ++NV +K + +R V G + V+G G V++Q+
Sbjct: 169 TGALNVDLKASVGLRDLVGRGSGEAFQLHVSGHGTVYVQA 208
>gi|322710681|gb|EFZ02255.1| DUF124 domain protein [Metarhizium anisopliae ARSEF 23]
Length = 270
Score = 41.2 bits (95), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 91/201 (45%), Gaps = 16/201 (7%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITSVVLRNPGP 92
+ + L+P V ++ G+M MSG+++++ NI + M + G ++ PG
Sbjct: 68 ILNVNLQPGGTVRSKSGAMIHMSGTIQLQGNI-----KFSMKKLFTGGRMSESTYTGPGR 122
Query: 93 SDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
P+L + I L + G D FL DV + T Q + + E
Sbjct: 123 L-----ALGPTLMGDI-ITLHVDGRTPWTVGKDTFLACTPDVAM-ETKSQGISKSMFSGE 175
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
++ GQGL ++ + G+V + +L+ GE VD +VA + ++ K G ++
Sbjct: 176 DLFVYRVGGQGLLWLTSFGAVDRLDLQPGEQHIVDNGHLVAWSCDYRIE-KAGGGAMSSL 234
Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
G+ LV TGPG V++Q+
Sbjct: 235 KTGEGLV-CRFTGPGAVYVQT 254
>gi|398336774|ref|ZP_10521479.1| hypothetical protein LkmesMB_14981 [Leptospira kmetyi serovar
Malaysia str. Bejo-Iso9]
Length = 224
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 43/135 (31%), Positives = 68/135 (50%), Gaps = 9/135 (6%)
Query: 99 IAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQK 157
+ AP+L + IDL+ G + Q +FL S ++++ +T Q ++ I G E K
Sbjct: 79 LLAPTLPGDVDRIDLS---GTVYVQSSSFLASSPNIEM-DTKFQGLKGFISG-ESLFFLK 133
Query: 158 LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNL 217
L+G G I + G + N++ GE+I VD IVA +N ++ G + GG+
Sbjct: 134 LSGNGPLLISSYGGIDVLNVD-GEMI-VDTGHIVAFDEGLNYEMTKFGGWKSFFLGGEGF 191
Query: 218 VTAVVTGPGIVFIQS 232
V A G G V+IQS
Sbjct: 192 V-ARFKGRGRVWIQS 205
>gi|186681073|ref|YP_001864269.1| hypothetical protein Npun_R0563 [Nostoc punctiforme PCC 73102]
gi|186463525|gb|ACC79326.1| protein of unknown function DUF124 [Nostoc punctiforme PCC 73102]
Length = 222
Score = 41.2 bits (95), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 70/140 (50%), Gaps = 10/140 (7%)
Query: 102 PSLA--RILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLT 159
PSLA IL +D G E + A+ S ++V ++ V +IGG EG + K+
Sbjct: 75 PSLAGYHILELD----GSEWILDTGAYWASDGSIEVGIERNKFVSGLIGG-EGLFQTKVK 129
Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFGGDNLV 218
G+G + A G V NL+ + + VD + +A T+++N ++ K + ++ G+ LV
Sbjct: 130 GRGKVVMAAQGPVEVINLQ-NDRLVVDGNFAIARTNTLNYRVEKATKSLLGSMTSGEFLV 188
Query: 219 TAVVTGPGIVFIQSLPFHRL 238
G G V + +P+ ++
Sbjct: 189 N-TFEGTGTVLLAPIPYWKV 207
>gi|292655523|ref|YP_003535420.1| hypothetical protein HVO_1369 [Haloferax volcanii DS2]
gi|448291905|ref|ZP_21482579.1| hypothetical protein C498_11898 [Haloferax volcanii DS2]
gi|291371101|gb|ADE03328.1| hypothetical protein HVO_1369 [Haloferax volcanii DS2]
gi|445573424|gb|ELY27945.1| hypothetical protein C498_11898 [Haloferax volcanii DS2]
Length = 220
Score = 41.2 bits (95), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 68/157 (43%), Gaps = 28/157 (17%)
Query: 94 DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
D FV APS L +D+A+ G ++ + + + D+ R
Sbjct: 4 DEFVSAHAPSEGGDSFELENSKLLDVAL-DGNVMAKAGSMVGYTGDISFE-------RKS 55
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
GG++G L+QK TG+G + A G+ V L+ GE ++V+ + ++A S
Sbjct: 56 AGGLKGMLKQKATGEGDVMMQATGTGHLYLADQGKEVQILELDAGEELSVNGNDVLAFES 115
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
SVN IK + GG L + GPG V I +
Sbjct: 116 SVNWDIKMLTSVAGTSSGG--LFNVFLEGPGHVAITT 150
>gi|417765826|ref|ZP_12413782.1| TIGR00266 family protein [Leptospira interrogans serovar Bulgarica
str. Mallika]
gi|400351765|gb|EJP03978.1| TIGR00266 family protein [Leptospira interrogans serovar Bulgarica
str. Mallika]
Length = 224
Score = 41.2 bits (95), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 40/131 (30%), Positives = 63/131 (48%), Gaps = 7/131 (5%)
Query: 104 LARILPIDLAM--FGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQ 161
LA LP D+ G + Q +FL S ++++ +T Q ++ I G E +L+G
Sbjct: 80 LAPTLPGDVERIELSGTVYVQSSSFLASSPNIEM-DTKFQGLKGFISG-ESLFFLRLSGN 137
Query: 162 GLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAV 221
G I + G + N++ GE+I VD IVA +N ++ G + GG+ V
Sbjct: 138 GFLLISSYGGIDVLNVD-GEMI-VDTGHIVAFEEGLNYEMTKFGGWKSFFLGGEGFVVR- 194
Query: 222 VTGPGIVFIQS 232
G G V+IQS
Sbjct: 195 FKGKGKVWIQS 205
>gi|421453270|ref|ZP_15902626.1| hypothetical protein RSSL_01892 [Streptococcus salivarius K12]
gi|400181579|gb|EJO15846.1| hypothetical protein RSSL_01892 [Streptococcus salivarius K12]
Length = 253
Score = 41.2 bits (95), Expect = 0.56, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 73/163 (44%), Gaps = 7/163 (4%)
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
IT VV ++ ++G++ +A S +++P+ L + AFL + L
Sbjct: 92 ITQVVSQS---NNGYIALAPDSPGQVIPLYLGE--KQYRLNDGAFLALDGTAYYTMELQS 146
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+ + GG GF GQG A GS+ + L EV T+D + +VA + +++ I
Sbjct: 147 VGKALFGGQGGFFVMTTQGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQTLDYNI 205
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+++ G+ +V G G +++QSL + + R
Sbjct: 206 HLENGFWQSIGTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 247
>gi|344210191|ref|YP_004786367.1| hypothetical protein HAH_5250 [Haloarcula hispanica ATCC 33960]
gi|343785408|gb|AEM59383.1| conserved hypothetical protein [Haloarcula hispanica ATCC 33960]
Length = 223
Score = 40.8 bits (94), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 75/157 (47%), Gaps = 22/157 (14%)
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
+R+ P+DG G + R+L I L G ++ + + + +V + +
Sbjct: 7 VRSNAPADGGDGFQKEN-NRLLDIPLD---GTVMVKAGSMVAYTGEVTFTGK-----SSA 57
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS---VVQKN--------LEVGEVITVDVSCIVAVTS 195
GGI GF+++ ++G+G + A GS V +N L+ GE I+V+ + ++A S
Sbjct: 58 EGGITGFVKEAVSGEGTPVMEAEGSGHLYVAENGKKVQVLALDDGESISVNGTDVLAFES 117
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+++ +I G + GG L +TGPG V + +
Sbjct: 118 TIDYEINTVGSLSGMAAGG--LTNVYLTGPGEVALTT 152
>gi|398345152|ref|ZP_10529855.1| hypothetical protein LinasL1_19337 [Leptospira inadai serovar Lyme
str. 10]
Length = 191
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 46/139 (33%), Positives = 68/139 (48%), Gaps = 21/139 (15%)
Query: 104 LARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKL-- 158
LA LP ++L M G ++L Q A+L S + V++ + GG+ L +
Sbjct: 48 LAPELPGDIVELDMNGKKILVQAGAYLASDDAVQIVSKF--------GGLRSLLGGEGLF 99
Query: 159 ----TGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFG 213
+G G AF+ A GS+ +EV TVD IVA SS+ + K G + +F
Sbjct: 100 LLELSGIGKAFLSAYGSIFP--IEVQGGYTVDTGHIVAFDSSLQFTVGKAGGSWKSTLFS 157
Query: 214 GDNLVTAVVTGPGIVFIQS 232
G+ LV A TG G ++IQS
Sbjct: 158 GEGLV-ANFTGNGTLWIQS 175
>gi|422405472|ref|ZP_16482515.1| hypothetical protein Pgy4_16834, partial [Pseudomonas syringae pv.
glycinea str. race 4]
gi|330880406|gb|EGH14555.1| hypothetical protein Pgy4_16834 [Pseudomonas syringae pv. glycinea
str. race 4]
Length = 48
Score = 40.8 bits (94), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 25/32 (78%)
Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
+FGG+ + A +TGPG V++QSLPF RL+ R+
Sbjct: 4 LFGGEGVFFARLTGPGTVWLQSLPFSRLAGRM 35
>gi|425781278|gb|EKV19254.1| hypothetical protein PDIG_04140 [Penicillium digitatum PHI26]
gi|425783360|gb|EKV21214.1| hypothetical protein PDIP_08810 [Penicillium digitatum Pd1]
Length = 184
Score = 40.8 bits (94), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 40/158 (25%), Positives = 74/158 (46%), Gaps = 7/158 (4%)
Query: 75 QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
++L G ++T+ PG + +A P L I+ ++ G + D+FL + V
Sbjct: 17 KFLIGGSMTASHYTGPGE----LLLAPPILGDIIVHEINP-GDKWNVGRDSFLAHTSGVC 71
Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
+ G EGF K TGQGL ++ + G++++K+L GE +D +VA
Sbjct: 72 HEYKRQDLSKGFFSG-EGFFVYKFTGQGLFWMQSFGAIIKKDLVEGEAYFIDNGHLVAWN 130
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+++ +G + + G+ L GPG V++Q+
Sbjct: 131 CKYDIERVASGGLLSSFSSGEGLACK-FKGPGTVYLQT 167
>gi|448642708|ref|ZP_21678667.1| hypothetical protein C436_17970 [Haloarcula sinaiiensis ATCC 33800]
gi|448660409|ref|ZP_21683469.1| hypothetical protein C435_20388 [Haloarcula californiae ATCC 33799]
gi|445759198|gb|EMA10484.1| hypothetical protein C435_20388 [Haloarcula californiae ATCC 33799]
gi|445759508|gb|EMA10786.1| hypothetical protein C436_17970 [Haloarcula sinaiiensis ATCC 33800]
Length = 223
Score = 40.8 bits (94), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 75/157 (47%), Gaps = 22/157 (14%)
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
+R+ P+DG G + R+L I L G ++ + + + +V + +
Sbjct: 7 VRSNAPADGGDGFQKEN-NRLLDIPLD---GTVMVKAGSMVAYTGEVTFTGK-----SSA 57
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS---VVQKN--------LEVGEVITVDVSCIVAVTS 195
GGI GF+++ ++G+G + A GS V +N L+ GE I+V+ + ++A S
Sbjct: 58 EGGITGFVKEAVSGEGTPVMEAEGSGHLYVAENGKKVQVLALDDGESISVNGTDVLAFES 117
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+++ +I G + GG L +TGPG V + +
Sbjct: 118 TIDYEINTVGSLSGMAAGG--LTNVYLTGPGEVALTT 152
>gi|254416733|ref|ZP_05030483.1| conserved hypothetical protein TIGR00266 [Coleofasciculus
chthonoplastes PCC 7420]
gi|196176473|gb|EDX71487.1| conserved hypothetical protein TIGR00266 [Coleofasciculus
chthonoplastes PCC 7420]
Length = 225
Score = 40.8 bits (94), Expect = 0.73, Method: Compositional matrix adjust.
Identities = 53/207 (25%), Positives = 99/207 (47%), Gaps = 13/207 (6%)
Query: 31 EAQVCQIMLKPQEKVVARPGSMCFMSGSME----MENIYIPENEVGMWQWLFGKTITSVV 86
++ + +++L+ E++VA+ G M M+GS+ + G+ + + G+++ V
Sbjct: 10 DSAIARVVLEANEELVAQAGCMIAMNGSINASTTLRKGKGGGILGGLKRMVAGESLFLSV 69
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
R+P + G V +A + +L + M G EL+ Q ++L S + V + Q +++
Sbjct: 70 FRSP-IAGGEVFLAPKLMGDLLVYE--MKGKELVVQATSYLASESGVDIDLGF-QGFKSL 125
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
G E L+G G+ + + G + + + GE + VD IVA S+N I G
Sbjct: 126 FSG-ESIFWLSLSGYGVVLLTSFGGIYKVPVN-GEYV-VDTGHIVAFEKSLNFDITKAGS 182
Query: 207 IRRAVF-GGDNLVTAVVTGPGIVFIQS 232
+ F GG+ LV G G V+ Q+
Sbjct: 183 SWLSAFLGGEGLVCR-FKGQGNVYCQT 208
>gi|340399707|ref|YP_004728732.1| hypothetical protein SALIVB_1961 [Streptococcus salivarius CCHSS3]
gi|338743700|emb|CCB94210.1| uncharacterized protein M6_Spy0233 [Streptococcus salivarius
CCHSS3]
Length = 231
Score = 40.8 bits (94), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 73/163 (44%), Gaps = 7/163 (4%)
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
IT VV ++ ++G++ +A S +++P+ L + AFL + L
Sbjct: 70 ITQVVSQS---NNGYIALAPDSPGQVIPLYLGE--KQYRLNDGAFLALDGTAYYTMELQS 124
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+ + GG GF GQG A GS+ + L EV T+D + +VA + +++ I
Sbjct: 125 VGKALFGGQGGFFVMTTQGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQTLDYDI 183
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+++ G+ +V G G +++QSL + + R
Sbjct: 184 HLENGFWQSIGTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 225
>gi|322515959|ref|ZP_08068900.1| hypothetical protein HMPREF9425_0177 [Streptococcus vestibularis
ATCC 49124]
gi|322125633|gb|EFX96963.1| hypothetical protein HMPREF9425_0177 [Streptococcus vestibularis
ATCC 49124]
Length = 231
Score = 40.8 bits (94), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 35/153 (22%), Positives = 67/153 (43%), Gaps = 4/153 (2%)
Query: 92 PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
++G++ +A S +++P+ L + AFL + L + + GG
Sbjct: 77 SNNGYIALAPDSPGQVIPLYLGE--KQYRLNDGAFLALDGTAYYTMELQSVGKALFGGQG 134
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
GF GQG A GS+ + L EV T+D + +VA + +++ I +++
Sbjct: 135 GFFVMTTQGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQTLDYNIHLENGFWQSI 193
Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
G+ +V G G +++QSL + + R
Sbjct: 194 GTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 225
>gi|125718940|ref|YP_001036073.1| hypothetical protein SSA_2155 [Streptococcus sanguinis SK36]
gi|422850676|ref|ZP_16897346.1| hypothetical protein HMPREF9383_0355 [Streptococcus sanguinis
SK150]
gi|422853341|ref|ZP_16900005.1| hypothetical protein HMPREF9384_0724 [Streptococcus sanguinis
SK160]
gi|125498857|gb|ABN45523.1| Conserved hypothetical protein [Streptococcus sanguinis SK36]
gi|325695424|gb|EGD37324.1| hypothetical protein HMPREF9383_0355 [Streptococcus sanguinis
SK150]
gi|325697353|gb|EGD39239.1| hypothetical protein HMPREF9384_0724 [Streptococcus sanguinis
SK160]
Length = 231
Score = 40.4 bits (93), Expect = 0.78, Method: Compositional matrix adjust.
Identities = 37/153 (24%), Positives = 70/153 (45%), Gaps = 8/153 (5%)
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGGIE 151
+G++ +A S +++P+ L + AFL D T++++ + + GG
Sbjct: 79 NGYLALAPDSPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKAIFGGQG 134
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
G GQG A GS+ + L EV T+D + +VA + S+N I +++
Sbjct: 135 GLFVMTTQGQGTLLANAFGSIKKIELHNQEV-TIDNAHVVAWSQSLNYNIHLENGFWQSI 193
Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
G+ +V G G V++QSL + +++
Sbjct: 194 GTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSK 225
>gi|312862475|ref|ZP_07722718.1| TIGR00266 family protein [Streptococcus vestibularis F0396]
gi|311102118|gb|EFQ60318.1| TIGR00266 family protein [Streptococcus vestibularis F0396]
Length = 231
Score = 40.4 bits (93), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 35/153 (22%), Positives = 67/153 (43%), Gaps = 4/153 (2%)
Query: 92 PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
++G++ +A S +++P+ L + AFL + L + + GG
Sbjct: 77 SNNGYIALAPDSPGQVIPLYLGE--KQYRLNDGAFLALDGTAYYTMELQSVGKALFGGQG 134
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
GF GQG A GS+ + L EV T+D + +VA + +++ I +++
Sbjct: 135 GFFVMTTRGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQTLDYNIHLENGFWQSI 193
Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
G+ +V G G +++QSL + + R
Sbjct: 194 GTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 225
>gi|284166382|ref|YP_003404661.1| hypothetical protein Htur_3123 [Haloterrigena turkmenica DSM 5511]
gi|284016037|gb|ADB61988.1| protein of unknown function DUF124 [Haloterrigena turkmenica DSM
5511]
Length = 221
Score = 40.4 bits (93), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 65/139 (46%), Gaps = 21/139 (15%)
Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
+++L +DL G ++ + + + D+ + GG++GFL++K T +G +
Sbjct: 24 SKLLDVDLD---GAVIAKAGSMIAYSGDISFKGK-----SSAEGGLKGFLKEKATSEGTS 75
Query: 165 FILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
+ A G+ + L+ GE I+V+ + ++A SSVN +I+ I A F
Sbjct: 76 VMEATGNGHLYLADDEKKIQILELDAGEEISVNGNDVLAFESSVNYEIRTIKSI--AGFS 133
Query: 214 GDNLVTAVVTGPGIVFIQS 232
L + GPG V I +
Sbjct: 134 AGGLTNVSLVGPGSVAITT 152
>gi|119485104|ref|ZP_01619489.1| hypothetical protein L8106_06639 [Lyngbya sp. PCC 8106]
gi|119457332|gb|EAW38457.1| hypothetical protein L8106_06639 [Lyngbya sp. PCC 8106]
Length = 261
Score = 40.4 bits (93), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 50/207 (24%), Positives = 98/207 (47%), Gaps = 23/207 (11%)
Query: 36 QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT--ITSVVLRNPGPS 93
++ L+P + V+ G+M M ++M++ G+ + L G++ ++ +N P
Sbjct: 19 RLDLQPNQTVIVESGAMAAMDSCIQMQSKLKGGLMKGVGRMLSGESLFLSEFTAKNK-PG 77
Query: 94 DGFVGIAAPSLARILPIDLAMF---GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
+ F L+ +P D+ + G L+ Q F+ S N+V + Q + G
Sbjct: 78 ELF-------LSPGIPGDIQHYYLNGKGLMIQSSGFVASGNEVNLDTNF-QGFKGFFSGE 129
Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG----- 205
FL + TGQG ++ + G +++ +E G+ + VD IVA +++ ++ G
Sbjct: 130 SLFLI-RATGQGDIWLSSYGGILEIPVE-GDYV-VDTGYIVAFEETLDYNVEVIGGLSFK 186
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
++ +FGG+ LV G G ++IQS
Sbjct: 187 NLKTGIFGGEGLVCR-FRGQGKLWIQS 212
>gi|149922842|ref|ZP_01911265.1| hypothetical protein PPSIR1_34772 [Plesiocystis pacifica SIR-1]
gi|149816309|gb|EDM75813.1| hypothetical protein PPSIR1_34772 [Plesiocystis pacifica SIR-1]
Length = 235
Score = 40.4 bits (93), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 20/124 (16%)
Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVR-----NVIGGIEGFLRQKLTGQGLAFILAGGSVV 173
L Q A+L S + T+D ++R ++IGG EG + Q+ TGQG +I + G +
Sbjct: 107 LFIQSSAYLAS------TGTVDTKLRFGGFKSMIGG-EGMVLQECTGQGDVWINSYGGIT 159
Query: 174 QKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG-----PIRRAVFGGDNLVTAVVTGPGIV 228
+ + V VD IVA ++ ++K G ++ +F G+ LV +G G V
Sbjct: 160 E--VPVNGTFIVDTGHIVAFDGNMTFKVKAAGSSGGFSLKSFLFSGEGLVCE-FSGTGRV 216
Query: 229 FIQS 232
+IQS
Sbjct: 217 WIQS 220
>gi|448571304|ref|ZP_21639649.1| hypothetical protein C456_10074 [Haloferax lucentense DSM 14919]
gi|448596140|ref|ZP_21653480.1| hypothetical protein C452_03878 [Haloferax alexandrinus JCM 10717]
gi|445722516|gb|ELZ74174.1| hypothetical protein C456_10074 [Haloferax lucentense DSM 14919]
gi|445741828|gb|ELZ93326.1| hypothetical protein C452_03878 [Haloferax alexandrinus JCM 10717]
Length = 220
Score = 40.4 bits (93), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 68/157 (43%), Gaps = 28/157 (17%)
Query: 94 DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
D FV APS L +D+A+ G ++ + + + D+ R
Sbjct: 4 DEFVSAHAPSEGGDSFELENSKLLDVAL-DGNVMAKAGSMVGYTGDISFE-------RKS 55
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
GG++G L++K TG+G + A G+ V L+ GE ++V+ + ++A S
Sbjct: 56 AGGLKGMLKKKATGEGDVMMQATGTGHLYLADQAKEVQILELDAGEELSVNGNDVLAFES 115
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
SVN IK I GG L + GPG V I +
Sbjct: 116 SVNWDIKMLTSIAGTSSGG--LFNVFLEGPGHVAITT 150
>gi|433417405|ref|ZP_20404716.1| hypothetical protein D320_00733 [Haloferax sp. BAB2207]
gi|432200061|gb|ELK56178.1| hypothetical protein D320_00733 [Haloferax sp. BAB2207]
Length = 220
Score = 40.0 bits (92), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 68/157 (43%), Gaps = 28/157 (17%)
Query: 94 DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
D FV APS L +D+A+ G ++ + + + D+ R
Sbjct: 4 DEFVSAHAPSEGGDSFELENSKLLDVAL-DGNVMAKAGSMVGYTGDISFE-------RKS 55
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
GG++G L++K TG+G + A G+ V L+ GE ++V+ + ++A S
Sbjct: 56 AGGLKGMLKKKATGEGDVMMQATGTGHLYLADQAKEVQILELDAGEELSVNGNDVLAFES 115
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
SVN IK I GG L + GPG V I +
Sbjct: 116 SVNWDIKMLTSIAGTSSGG--LFNVFLEGPGHVAITT 150
>gi|422847658|ref|ZP_16894341.1| hypothetical protein HMPREF9381_2000 [Streptococcus sanguinis SK72]
gi|325686656|gb|EGD28682.1| hypothetical protein HMPREF9381_2000 [Streptococcus sanguinis SK72]
Length = 231
Score = 40.0 bits (92), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 37/155 (23%), Positives = 71/155 (45%), Gaps = 8/155 (5%)
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGGIE 151
+G++ +A S +++P+ L + AFL D T++++ + + GG
Sbjct: 79 NGYLALAPDSPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKALFGGQG 134
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
G GQG A GS+ + L EV T+D + +VA + S+N I +++
Sbjct: 135 GLFVMTTQGQGTLLANAFGSIKKIELHNQEV-TIDNAHVVAWSQSLNYNIHLENGFWQSI 193
Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
G+ +V G G V++QSL + +++ +
Sbjct: 194 GTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSKYI 227
>gi|422822558|ref|ZP_16870751.1| hypothetical protein HMPREF9388_2019 [Streptococcus sanguinis
SK353]
gi|422857502|ref|ZP_16904152.1| hypothetical protein HMPREF9394_0339 [Streptococcus sanguinis
SK1057]
gi|324989828|gb|EGC21771.1| hypothetical protein HMPREF9388_2019 [Streptococcus sanguinis
SK353]
gi|327463553|gb|EGF09872.1| hypothetical protein HMPREF9394_0339 [Streptococcus sanguinis
SK1057]
Length = 231
Score = 40.0 bits (92), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 37/155 (23%), Positives = 71/155 (45%), Gaps = 8/155 (5%)
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGGIE 151
+G++ +A S +++P+ L + AFL D T++++ + + GG
Sbjct: 79 NGYLALAPDSPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKALFGGQG 134
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
G GQG A GS+ + L EV T+D + +VA + S+N I +++
Sbjct: 135 GLFVMTTQGQGTLLANAFGSIKKIELHNQEV-TIDNAHVVAWSQSLNYNIHLENGFWQSI 193
Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
G+ +V G G V++QSL + +++ +
Sbjct: 194 GTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSKYI 227
>gi|401682713|ref|ZP_10814603.1| TIGR00266 family protein [Streptococcus sp. AS14]
gi|422872397|ref|ZP_16918890.1| hypothetical protein HMPREF9397_2136 [Streptococcus sanguinis
SK1087]
gi|328944647|gb|EGG38808.1| hypothetical protein HMPREF9397_2136 [Streptococcus sanguinis
SK1087]
gi|400183953|gb|EJO18200.1| TIGR00266 family protein [Streptococcus sp. AS14]
Length = 231
Score = 40.0 bits (92), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 37/153 (24%), Positives = 70/153 (45%), Gaps = 8/153 (5%)
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGGIE 151
+G++ +A S +++P+ L + AFL D T++++ + + GG
Sbjct: 79 NGYLALAPDSPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKALFGGQG 134
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
G GQG A GS+ + L EV T+D + +VA + S+N I +++
Sbjct: 135 GLFVMTTQGQGTLLANAFGSIKKIELHNQEV-TIDNAHVVAWSQSLNYNIHLENGFWQSI 193
Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
G+ +V G G V++QSL + +++
Sbjct: 194 GTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSK 225
>gi|322374139|ref|ZP_08048673.1| hypothetical protein HMPREF0848_01845 [Streptococcus sp. C150]
gi|321277105|gb|EFX54176.1| hypothetical protein HMPREF0848_01845 [Streptococcus sp. C150]
Length = 231
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 39/163 (23%), Positives = 71/163 (43%), Gaps = 7/163 (4%)
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
IT VV + ++G++ +A S +++P+ L + AFL +
Sbjct: 70 ITQVVAQ---SNNGYLALAPDSPGQVIPLYLGE--KQYRLNDGAFLALDGTAYYTMEYQS 124
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+ + GG GF GQG A GS+ + L EV T+D + +VA + S++ I
Sbjct: 125 VGKALFGGQGGFFVMTTQGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQSLDYNI 183
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+++ G+ +V G G +++QSL + + R
Sbjct: 184 HLENGFWQSIGTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 225
>gi|448477879|ref|ZP_21603763.1| hypothetical protein C462_00162 [Halorubrum arcis JCM 13916]
gi|445822992|gb|EMA72735.1| hypothetical protein C462_00162 [Halorubrum arcis JCM 13916]
Length = 219
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 39/153 (25%), Positives = 70/153 (45%), Gaps = 25/153 (16%)
Query: 91 GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
G SDGF + R+L I F G ++ + + + DV + + GGI
Sbjct: 14 GSSDGFRKVNK----RLLEIP---FEGTVMVKAGSMIAYTGDVTFTGK-----SSAEGGI 61
Query: 151 EGFLRQKLTGQGLAFILAGGS----VVQK-------NLEVGEVITVDVSCIVAVTSSVNV 199
GF++ ++G+G + A GS V ++ L+ E I+V+ + ++A ++
Sbjct: 62 AGFVKDAVSGEGTPIMEAEGSGTLYVAEQGKKVQILTLDADESISVNGTDVLAFEKKIDY 121
Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+I G + +A GG L +TGPG V + +
Sbjct: 122 EISTVGGLSQAAAGG--LTNVFLTGPGEVAVTT 152
>gi|428773943|ref|YP_007165731.1| hypothetical protein Cyast_2132 [Cyanobacterium stanieri PCC 7202]
gi|428688222|gb|AFZ48082.1| protein of unknown function DUF124 [Cyanobacterium stanieri PCC
7202]
Length = 225
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 47/207 (22%), Positives = 100/207 (48%), Gaps = 13/207 (6%)
Query: 31 EAQVCQIMLKPQEKVVARPGSMC----FMSGSMEMENIYIPENEVGMWQWLFGKTITSVV 86
++ + +I L QE+++A GSM F++ S + G+ + + G+++ V
Sbjct: 10 DSAIAKITLDAQEELLAEAGSMIAMSDFINASTTLRQGKGGGILGGLKRMMAGESLFLSV 69
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
R P +G + +A + I+ + M G EL+ Q ++L + V++ Q +++
Sbjct: 70 FRCYQP-NGEIYLAPHLMGDIVVYE--MTGNELVVQSGSYLACASGVEIDLGF-QGLKSF 125
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNG 205
G E +++G G + + G + + +++ GE + VD IVA +++ +I K +
Sbjct: 126 FSG-EAIFWLQISGYGAVILTSFGGIYEIDVD-GEYV-VDTGHIVAFEKTLDFKITKASS 182
Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+ + GG+ LV G G ++ Q+
Sbjct: 183 SLLGSFLGGEGLVCRFY-GKGKLYCQT 208
>gi|156937119|ref|YP_001434915.1| hypothetical protein Igni_0325 [Ignicoccus hospitalis KIN4/I]
gi|156566103|gb|ABU81508.1| protein of unknown function DUF124 [Ignicoccus hospitalis KIN4/I]
Length = 234
Score = 39.7 bits (91), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 58/234 (24%), Positives = 104/234 (44%), Gaps = 27/234 (11%)
Query: 28 LGGEAQ-VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM----------WQW 76
L GEA V ++ L+ EK+ A G+M M G ++++ + I E G +
Sbjct: 5 LEGEAYPVIRVELEEGEKLYAEAGAMMMMKGDVKVDTVLIDLEEEGNVLRALAEALGRKL 64
Query: 77 LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKV 135
L G+T+ V PG V +PSL ++ GE D +++ D+K
Sbjct: 65 LAGETVFHNVFEGPG-----VVWLSPSLPG--GVEYIEVKGECWTVQDYSYVAHFGDLK- 116
Query: 136 SNTLDQRVRNVIGGIEGFL-RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
LD + G G L + G+G ++ + G +++ +EV + + +D VA+
Sbjct: 117 ---LDLAWKGPKGLFLGDLVWLRACGEGGLWVSSYGDIMK--VEVEDEMVIDNMHFVALP 171
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
+++ G ++ + GG+ V V GP V++Q+ L+ IAR + S
Sbjct: 172 DDAEWRVEKFGGLKSFITGGEGYVIR-VKGPTTVYVQTRILPPLAAAIARFMPS 224
>gi|322368155|ref|ZP_08042724.1| hypothetical protein ZOD2009_01695 [Haladaptatus paucihalophilus
DX253]
gi|320552171|gb|EFW93816.1| hypothetical protein ZOD2009_01695 [Haladaptatus paucihalophilus
DX253]
Length = 215
Score = 39.7 bits (91), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 47/199 (23%), Positives = 84/199 (42%), Gaps = 20/199 (10%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI---TSVVLRNPGPS 93
+ L E++ GS+ S +E+ P++E + + + G+ T+ +PG
Sbjct: 16 LTLDADEEIRVEAGSLVSHSAGVEVP----PDSEALLERSVLGEETPFSTTFAADHPGT- 70
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
+ +A P I +L L Q AFL + + +D + G
Sbjct: 71 ---IRLAPPFPGDIFHYELR--DETLYVQSCAFLAA------ESGIDFEEKGSFVECAGD 119
Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
+L+G GLAF+ + G+V +L+ GE +D +VA SV+ ++ + G
Sbjct: 120 FLLELSGSGLAFLSSYGAVSIVSLDPGERYILDTGHVVAFEGSVDFEVTRVDTVHSHAGG 179
Query: 214 GDNLVTAVVTGPGIVFIQS 232
LV + GPG ++ QS
Sbjct: 180 RGGLVCE-LEGPGTIWTQS 197
>gi|448621380|ref|ZP_21668355.1| hypothetical protein C438_04932 [Haloferax denitrificans ATCC
35960]
gi|445755873|gb|EMA07255.1| hypothetical protein C438_04932 [Haloferax denitrificans ATCC
35960]
Length = 220
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 43/157 (27%), Positives = 67/157 (42%), Gaps = 28/157 (17%)
Query: 94 DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
D FV APS L +D+A+ G ++ + + + D+ R
Sbjct: 4 DEFVSAHAPSEGGDSFELENSKLLDVAL-DGNVMAKAGSMVGYTGDISFE-------RKS 55
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
GG+ G L++K TG+G + A G+ V L+ GE ++V+ + ++A S
Sbjct: 56 AGGLTGMLKKKATGEGDVMMQATGTGHLYLADQGKEVQILELDAGEELSVNGNDVLAFES 115
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
SVN IK I GG L + GPG V I +
Sbjct: 116 SVNWDIKMLTSIAGTSSGG--LFNVFLEGPGHVAITT 150
>gi|422885126|ref|ZP_16931574.1| hypothetical protein HMPREF9380_2225 [Streptococcus sanguinis SK49]
gi|332358097|gb|EGJ35929.1| hypothetical protein HMPREF9380_2225 [Streptococcus sanguinis SK49]
Length = 231
Score = 39.7 bits (91), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 36/157 (22%), Positives = 72/157 (45%), Gaps = 8/157 (5%)
Query: 92 PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGG 149
++G++ +A + +++P+ L + AFL D T++++ + + GG
Sbjct: 77 SNNGYLALAPDAPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKAIFGG 132
Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
G GQG A GS+ + L EV T+D + +VA + S+N I +
Sbjct: 133 QGGLFVMTTQGQGTLLANAFGSIKRIELHNQEV-TIDNAHVVAWSQSLNYNIHLENGFWQ 191
Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
++ G+ +V G G V++QSL + +++ +
Sbjct: 192 SIGTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSKYI 227
>gi|448398830|ref|ZP_21570193.1| hypothetical protein C476_05362 [Haloterrigena limicola JCM 13563]
gi|445670377|gb|ELZ22978.1| hypothetical protein C476_05362 [Haloterrigena limicola JCM 13563]
Length = 221
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 64/139 (46%), Gaps = 21/139 (15%)
Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
+ +L IDL G ++ + + + D+ + + GGI GF+++K TG+G
Sbjct: 24 STLLDIDLD---GTVMAKAGSMIAYDGDISFTGK-----SSAEGGITGFIKEKATGEGTP 75
Query: 165 FILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
+ A G+ + L+ G+ I+V+ + ++A SSVN +I+ I A F
Sbjct: 76 VMEATGTGHLYLADQEKKIQLLELDTGQSISVNGNDVLAFESSVNYEIRTIKSI--AGFS 133
Query: 214 GDNLVTAVVTGPGIVFIQS 232
+ + GPG V I +
Sbjct: 134 AGGMTNVFLEGPGSVAITT 152
>gi|428769068|ref|YP_007160858.1| hypothetical protein Cyan10605_0676 [Cyanobacterium aponinum PCC
10605]
gi|428683347|gb|AFZ52814.1| protein of unknown function DUF124 [Cyanobacterium aponinum PCC
10605]
Length = 220
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 44/201 (21%), Positives = 87/201 (43%), Gaps = 16/201 (7%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIY----IPENEVGMWQWLFGKTITSVVLRNPGP 92
+ L+P+E++ A G+M M G + M+ + IP + ++L G+++ V NP
Sbjct: 16 VTLEPRERITAEAGAMVSMDGDIVMKTEFSGGLIPAL---IRKFLGGESMFVNVFYNPTN 72
Query: 93 SDGFVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
+ + + I I L + +C QP A++ K+ G E
Sbjct: 73 RPLNLVLTQSMIGDIEAIKLE---NKSICFQPGAYIAHTPGAKMGVKWAGFASWFAG--E 127
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
G + +G+G G + ++ E+ VD + +VA + + I +G + ++
Sbjct: 128 GLFKLHFSGKGRVLFGCYGGITRR--EINGDFIVDNNHLVAYDDGIKMNITLSGNLLSSL 185
Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
G+ V +TG G +++QS
Sbjct: 186 TSGEGFVNK-LTGRGAIYLQS 205
>gi|389580754|ref|ZP_10170781.1| hypothetical protein DespoDRAFT_02788 [Desulfobacter postgatei
2ac9]
gi|389402389|gb|EIM64611.1| hypothetical protein DespoDRAFT_02788 [Desulfobacter postgatei
2ac9]
Length = 139
Score = 39.3 bits (90), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 47/103 (45%), Gaps = 10/103 (9%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY--IPENEVGMWQWLFG- 79
+ +QI G + Q ++ L P E V+A G+M +M + E E + G+ LFG
Sbjct: 6 VDYQIFGDDMQAVEVGLDPGETVIAEAGAMNWMEQGIAFEAKMGDGSEADKGLMGKLFGA 65
Query: 80 --KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMF 115
+ +T L N V +AP +I+P+D+A F
Sbjct: 66 DKRALTGESLFLTHFTNQAHDKKRVAFSAPYPGKIIPVDMAAF 108
>gi|381163293|ref|ZP_09872523.1| hypothetical protein SacazDRAFT_02213 [Saccharomonospora azurea
NA-128]
gi|418461390|ref|ZP_13032465.1| hypothetical protein SZMC14600_10573 [Saccharomonospora azurea SZMC
14600]
gi|359738493|gb|EHK87378.1| hypothetical protein SZMC14600_10573 [Saccharomonospora azurea SZMC
14600]
gi|379255198|gb|EHY89124.1| hypothetical protein SacazDRAFT_02213 [Saccharomonospora azurea
NA-128]
Length = 197
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 28/92 (30%), Positives = 48/92 (52%), Gaps = 3/92 (3%)
Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
+ + GG GFL+ +G G + G V L+ GEV+TV+ ++A +V V+++
Sbjct: 106 QTLFGGDSGFLKH-YSGTGPLVFASRGPVDALTLKAGEVVTVNPLFVLAYPDTVQVRLRA 164
Query: 204 NGPIR-RAVFGGDNLVTAVVTGPGIVFIQSLP 234
P +++ G+ L V GPG V +Q+ P
Sbjct: 165 LDPAEPQSIKTGEGLALDVA-GPGTVLVQTRP 195
>gi|427724446|ref|YP_007071723.1| hypothetical protein Lepto7376_2618 [Leptolyngbya sp. PCC 7376]
gi|427356166|gb|AFY38889.1| protein of unknown function DUF124 [Leptolyngbya sp. PCC 7376]
Length = 231
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 52/219 (23%), Positives = 95/219 (43%), Gaps = 48/219 (21%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK----------TITSVV 86
+ L P + ++ G+M M ++EM++ G+ + L G+ T T+ +
Sbjct: 19 VKLPPNQTLLVEAGAMAAMDANIEMKSKMRGGLMKGLGRMLSGESLFISEFTAQTTTAEI 78
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTLDQR 142
+PG P DLA + L+ Q F+ S V+ LD +
Sbjct: 79 YLSPGA----------------PGDLAHYELDGSKSLMVQSSGFVASSPSVE----LDTK 118
Query: 143 VRNVIGGIEG----FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
+ + G G FLR +GQG + + G++++ +E G+ + VD IVA ++N
Sbjct: 119 FQGLKGFFSGESIFFLRA--SGQGDFWFSSYGAIIEIPVE-GDYV-VDTGYIVAFEDTLN 174
Query: 199 VQIKYNG-----PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
++ G +R + GG+ LV +G G +++QS
Sbjct: 175 YNVEMIGGLSFRSLRTGILGGEGLVCR-FSGKGKLWVQS 212
>gi|448575666|ref|ZP_21641946.1| hypothetical protein C455_03259 [Haloferax larsenii JCM 13917]
gi|445730607|gb|ELZ82195.1| hypothetical protein C455_03259 [Haloferax larsenii JCM 13917]
Length = 220
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 32/100 (32%), Positives = 47/100 (47%), Gaps = 13/100 (13%)
Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVA 192
R GG+ G L++K TG+G + A GS V L+ GE ++V+ + ++A
Sbjct: 53 RKSAGGLTGMLKKKATGEGSVMMEASGSGQLYLADQGKEVQILELDAGEELSVNGNDVLA 112
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
SV+ IK I A GG L + GPG V I +
Sbjct: 113 FEDSVSWDIKMLSSIAGAKTGG--LFNVFLEGPGHVAITT 150
>gi|390630390|ref|ZP_10258374.1| Putative uncharacterized protein [Weissella confusa LBAE C39-2]
gi|390484398|emb|CCF30722.1| Putative uncharacterized protein [Weissella confusa LBAE C39-2]
Length = 265
Score = 39.3 bits (90), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 51/235 (21%), Positives = 98/235 (41%), Gaps = 26/235 (11%)
Query: 3 APFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQE-KVVARPGSMCFMSGSMEM 61
F FQ + SP DA + + + + Q+++ Q V + G+M +M G ++
Sbjct: 20 GSFQVVEFQRDLSVSPFDAQVQYFMTQMNVKKKQLLINLQNADVTIQAGAMQWMLGDVKA 79
Query: 62 ENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIA----APSLARILPIDLAMFGG 117
+ GK I V + G P+ IL +D++ +GG
Sbjct: 80 TTGIKGAGD------FLGKMIKGSVTGEAAIKPEYKGTGMLALEPTYKHILLLDVSEWGG 133
Query: 118 ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNL 177
+++ +L S + +K + + G EG L G G+A + + V Q+ L
Sbjct: 134 QVVIDDGFYLASEDGLKQKLVARSNFSSAVAGGEGLFNLSLVGNGVAALES--PVPQEEL 191
Query: 178 -EV---GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIV 228
EV +V+ +D + +A +SS+N ++ +G +L+ + +G G+V
Sbjct: 192 IEVELKDDVLKIDGNLAIAWSSSLNFTVERSGK---------SLIGSAASGEGLV 237
>gi|408388668|gb|EKJ68347.1| hypothetical protein FPSE_11355 [Fusarium pseudograminearum CS3096]
Length = 242
Score = 39.3 bits (90), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 52/203 (25%), Positives = 92/203 (45%), Gaps = 20/203 (9%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNP--- 90
V I L+ V ++ G+M MS ++E+ G ++ FGK T +
Sbjct: 39 VLNIDLQQGTTVRSKSGAMIHMSSTVEI---------TGKSKFSFGKLFTGGNMYESLYA 89
Query: 91 GPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
GP G V + I I L + G + D+FL S ++V + + G
Sbjct: 90 GP--GRVALGPTLFGDI--ISLHVDGNQSWTIGKDSFLASTSEVTKKRETQGLGKALFSG 145
Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
+ F+ ++ GQG+ ++ + G+V + +L+ GE VD +VA + + ++ K G
Sbjct: 146 EDLFVF-RVEGQGIMWLTSFGAVDKLDLQAGEQHIVDNGHLVAWSCNYAIE-KAGGGAMS 203
Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
+ G+ LV TGPG V++Q+
Sbjct: 204 GMKTGEGLV-CRFTGPGSVYVQT 225
>gi|422861607|ref|ZP_16908247.1| hypothetical protein HMPREF9386_1803 [Streptococcus sanguinis
SK330]
gi|327467840|gb|EGF13330.1| hypothetical protein HMPREF9386_1803 [Streptococcus sanguinis
SK330]
Length = 231
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 36/153 (23%), Positives = 70/153 (45%), Gaps = 8/153 (5%)
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGGIE 151
+G++ +A + +++P+ L + AFL D T++++ + + GG
Sbjct: 79 NGYLALAPDAPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKAIFGGQG 134
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
G GQG A GS+ + L EV T+D + +VA + S+N I +++
Sbjct: 135 GLFVMTTQGQGTLLANAFGSIKKIELHNQEV-TIDNAHVVAWSQSLNYNIHLENGFWQSI 193
Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
G+ +V G G V++QSL + +++
Sbjct: 194 GTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSK 225
>gi|291302980|ref|YP_003514258.1| hypothetical protein Snas_5534 [Stackebrandtia nassauensis DSM
44728]
gi|290572200|gb|ADD45165.1| protein of unknown function DUF124 [Stackebrandtia nassauensis DSM
44728]
Length = 228
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 38/136 (27%), Positives = 63/136 (46%), Gaps = 9/136 (6%)
Query: 103 SLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKL 158
LA LP D+A+ L+ +++ S +V++ ++N+IGG EG +
Sbjct: 77 DLAHSLPGDVAVVPVRPEAALMIAKGSWIASNGEVEIQTKWGG-MKNLIGG-EGAFLVRA 134
Query: 159 TGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY--NGPIRRAVFGGDN 216
GQG + G++ L+ GE I +D +VA +V ++ G ++ G+
Sbjct: 135 QGQGQLVLGCYGAIDSYALDPGETIVIDNGHVVAYEETVGHSLRRAVEGKTMQSRKSGEG 194
Query: 217 LVTAVVTGPGIVFIQS 232
LV TGPG V QS
Sbjct: 195 LVYE-FTGPGRVLTQS 209
>gi|383830130|ref|ZP_09985219.1| hypothetical protein SacxiDRAFT_2653 [Saccharomonospora
xinjiangensis XJ-54]
gi|383462783|gb|EID54873.1| hypothetical protein SacxiDRAFT_2653 [Saccharomonospora
xinjiangensis XJ-54]
Length = 197
Score = 39.3 bits (90), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 37/139 (26%), Positives = 67/139 (48%), Gaps = 5/139 (3%)
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
G++ +A + P++L+ G + DA L V+ + +++ GG GFL
Sbjct: 59 GWIDLAPRQPGDVYPLELSGQLGWSVAS-DAVLARPATVRHDHPWPAH-QSLFGGDSGFL 116
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR-RAVFG 213
+ +G G + G V L+ GEV+TV+ ++A +V V+++ P ++V
Sbjct: 117 KH-FSGVGSLVFASRGPVDALTLKAGEVVTVNPLFVLAFPDTVQVRLRALDPAEPQSVRT 175
Query: 214 GDNLVTAVVTGPGIVFIQS 232
G+ L V GPG V +Q+
Sbjct: 176 GEGLALDVA-GPGTVLVQT 193
>gi|403415219|emb|CCM01919.1| predicted protein [Fibroporia radiculosa]
Length = 268
Score = 39.3 bits (90), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 52/208 (25%), Positives = 90/208 (43%), Gaps = 29/208 (13%)
Query: 37 IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLR-NPGPSDG 95
+ ++P +++A+PGSM M S++++ G + F K +T + G S
Sbjct: 60 MRIQPGYEILAKPGSMVTMDPSVQIK---------GKMNFSFKKLLTGGEASIDLGLS-- 108
Query: 96 FVGIAAPSLARILPIDLAMFGGELLCQPD----------AFLCSVNDVKVSNTLDQRVRN 145
F+ P + P M+G Q D AFL V+++ T Q
Sbjct: 109 FMHFWGPGEVVLTP---EMWGDITRIQLDGNSPWHFGKHAFLACTPGVQMT-TKSQSFGK 164
Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ-IKYN 204
++ G + TG G+ F+ G+++ ++L GE V+ + +VA N++ I+
Sbjct: 165 MLFSGHGLFVSQATGVGMLFLHGLGTIISRDLRPGEQWIVNNNHLVAWNCQYNMEKIQAG 224
Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
G + + D TGPG VFIQS
Sbjct: 225 GLL--STLQTDEGAVCRFTGPGTVFIQS 250
>gi|205373536|ref|ZP_03226339.1| hypothetical protein Bcoam_09740 [Bacillus coahuilensis m4-4]
Length = 61
Score = 39.3 bits (90), Expect = 2.2, Method: Composition-based stats.
Identities = 17/43 (39%), Positives = 26/43 (60%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY 65
I + I G + Q ++ L P+E V+A GS+ M S+EME I+
Sbjct: 6 IDYVIHGNDMQFVEVELDPKETVIAEAGSLMMMEDSIEMETIF 48
>gi|357636516|ref|ZP_09134391.1| TIGR00266 family protein [Streptococcus macacae NCTC 11558]
gi|357584970|gb|EHJ52173.1| TIGR00266 family protein [Streptococcus macacae NCTC 11558]
Length = 237
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 47/223 (21%), Positives = 94/223 (42%), Gaps = 10/223 (4%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITS-----VVL 87
+ +I+L+ E + GSM + + S+ + + + VG G+++TS +
Sbjct: 18 LVEIVLETGESAYIQRGSMVYHTPSVTLNTKVNGRGSGVGKLVGAIGRSVTSGESFFITQ 77
Query: 88 RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
DG + +A +++ + L + AFL + R +
Sbjct: 78 AVSNADDGKLALAPSMPGQVIALKLGE--KQYRLNDGAFLALDGSARYQMKSQSIGRALF 135
Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
GG G GQG + + GS+ + L+ E IT+D + +VA + +N I +
Sbjct: 136 GGQGGLFVMTTEGQGTLLVNSFGSIKKIELQNQE-ITIDNAHVVAWSRDLNYDIHLENGL 194
Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
+++ G+ +V G G V++QSL + + + +T+ N
Sbjct: 195 MQSIGTGEGVVN-TFRGTGEVYVQSLNLRQFAGVLQNFLTNAN 236
>gi|322372055|ref|ZP_08046597.1| hypothetical protein ZOD2009_21187 [Haladaptatus paucihalophilus
DX253]
gi|320548477|gb|EFW90149.1| hypothetical protein ZOD2009_21187 [Haladaptatus paucihalophilus
DX253]
Length = 220
Score = 38.9 bits (89), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 13/94 (13%)
Query: 148 GGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTSS 196
GG++G L+QK TG+G + A G+ V L+ + I+V+ + ++A S
Sbjct: 57 GGLKGMLKQKATGEGAVMMKATGTGHLYLADQGKEVQLLELDADDEISVNGNDVLAFEDS 116
Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFI 230
V+ IK I A GG L ++GPG + I
Sbjct: 117 VSWDIKMMSSIAGATSGG--LFNVYLSGPGYIAI 148
>gi|419707153|ref|ZP_14234650.1| Hypothetical protein PS4_83945 [Streptococcus salivarius PS4]
gi|383283113|gb|EIC81080.1| Hypothetical protein PS4_83945 [Streptococcus salivarius PS4]
Length = 231
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 38/163 (23%), Positives = 71/163 (43%), Gaps = 7/163 (4%)
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
IT VV + ++G++ +A S +++P+ L + AFL +
Sbjct: 70 ITQVVAQ---SNNGYLALAPDSPGQVIPLYLGE--KQYRLNDGAFLALDGTAYYTMEFQS 124
Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
+ + GG GF GQG A GS+ + L EV T+D + +VA + +++ I
Sbjct: 125 VGKALFGGQGGFFVMTTQGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQTLDYNI 183
Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
+++ G+ +V G G +++QSL + + R
Sbjct: 184 HLENGFWQSIGTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 225
>gi|323350601|ref|ZP_08086263.1| hypothetical protein HMPREF9398_0311 [Streptococcus sanguinis
VMC66]
gi|322123283|gb|EFX94968.1| hypothetical protein HMPREF9398_0311 [Streptococcus sanguinis
VMC66]
Length = 231
Score = 38.9 bits (89), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 36/155 (23%), Positives = 71/155 (45%), Gaps = 8/155 (5%)
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGGIE 151
+G++ +A + +++P+ L + AFL D T++++ + + GG
Sbjct: 79 NGYLALAPDAPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKALFGGQG 134
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
G GQG A GS+ + L EV T+D + +VA + S+N I +++
Sbjct: 135 GLFVMTTQGQGTLLANAFGSIKKIELHNQEV-TIDNAHVVAWSQSLNYNIHLENGFWQSI 193
Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
G+ +V G G V++QSL + +++ +
Sbjct: 194 GTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSKYI 227
>gi|375102748|ref|ZP_09749011.1| hypothetical protein SaccyDRAFT_4550 [Saccharomonospora cyanea
NA-134]
gi|374663480|gb|EHR63358.1| hypothetical protein SaccyDRAFT_4550 [Saccharomonospora cyanea
NA-134]
Length = 197
Score = 38.9 bits (89), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 37/141 (26%), Positives = 65/141 (46%), Gaps = 5/141 (3%)
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
G++ +A P++L+ G + DA L V+ + +++ GG GFL
Sbjct: 59 GWIDLAPRHPGDTYPLELSGQLGWSVAS-DAVLARPATVRHDHPWPAH-QSLFGGDSGFL 116
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR-RAVFG 213
+ +G G G V L+ GEV+TV+ ++A +V V+++ P +++
Sbjct: 117 KH-YSGTGPLVFACRGPVDALTLKAGEVVTVNPLFVLAFPDTVQVRLRAIDPAEPQSIRT 175
Query: 214 GDNLVTAVVTGPGIVFIQSLP 234
G L V GPG V +Q+ P
Sbjct: 176 GKGLALDVA-GPGTVLVQTRP 195
>gi|422850001|ref|ZP_16896677.1| hypothetical protein HMPREF9382_1981 [Streptococcus sanguinis
SK115]
gi|422880913|ref|ZP_16927369.1| hypothetical protein HMPREF9389_0338 [Streptococcus sanguinis
SK355]
gi|325688889|gb|EGD30897.1| hypothetical protein HMPREF9382_1981 [Streptococcus sanguinis
SK115]
gi|332365613|gb|EGJ43372.1| hypothetical protein HMPREF9389_0338 [Streptococcus sanguinis
SK355]
Length = 231
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 36/153 (23%), Positives = 70/153 (45%), Gaps = 8/153 (5%)
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGGIE 151
+G++ +A + +++P+ L + AFL D T++++ + + GG
Sbjct: 79 NGYLALAPDAPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKALFGGQG 134
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
G GQG A GS+ + L EV T+D + +VA + S+N I +++
Sbjct: 135 GLFVMTTQGQGTLLANAFGSIKKIELHNQEV-TIDNAHVVAWSQSLNYNIHLENGFWQSI 193
Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
G+ +V G G V++QSL + +++
Sbjct: 194 GTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSK 225
>gi|50085091|ref|YP_046601.1| hypothetical protein ACIAD1953 [Acinetobacter sp. ADP1]
gi|49531067|emb|CAG68779.1| conserved hypothetical protein [Acinetobacter sp. ADP1]
Length = 239
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 45/216 (20%), Positives = 87/216 (40%), Gaps = 25/216 (11%)
Query: 25 FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL------- 77
F ++G + ++ +K+ +M M +E++ + G+ Q L
Sbjct: 4 FSLVGSPEPFLHVSMRRGDKIYCESNAMVMMESPLELKGRL----QGGLMQSLLRRFAND 59
Query: 78 ---FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
F + I +V G D + A +IL + + L AF+ + + V+
Sbjct: 60 ESLFQQHIEAV----RGDGDCLLSAALDGDMQILDVGATQY----LLSDGAFVAAHDSVQ 111
Query: 135 VSNTLDQRVRNVI-GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
V L+ + G GFL + +GQG + G++ + +E + +T+D +V
Sbjct: 112 VKARLNSSLGGAFFGNTGGFLVMETSGQGQVVVSGCGTLFELEVEPNKDVTIDNGHVVCW 171
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVF 229
S +N + + G NLV +V +G G+V
Sbjct: 172 DSRLNYNLSISTSQNSGFLG--NLVNSVTSGEGMVL 205
>gi|449887791|ref|ZP_21787043.1| hypothetical protein SMU104_06916 [Streptococcus mutans SA41]
gi|449252179|gb|EMC50166.1| hypothetical protein SMU104_06916 [Streptococcus mutans SA41]
Length = 237
Score = 38.5 bits (88), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 47/224 (20%), Positives = 95/224 (42%), Gaps = 12/224 (5%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL--FGKTITS-----VV 86
+ +I L+ E + GSM + + S+ + N + G+ + + G+++TS +
Sbjct: 18 LVEITLETGEFAYIQRGSMVYHTPSVTL-NTKVNGRGSGLGKLVGAIGRSVTSGESFFIT 76
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
SDG + +A +++ ++L + AFL + R +
Sbjct: 77 QAVSNASDGKLALAPSMPGQVIALELGE--KQYRLNDGAFLALDGSAQYQMKAQSVGRAL 134
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
GG G GQG + GS+ + L+ E IT+D + +VA + +N I
Sbjct: 135 FGGQGGLFVMTTEGQGTLLANSFGSIKKIELQNQE-ITIDNAHVVAWSKDLNYDIHLENG 193
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
+++ G+ +V G G +++QSL + + + +T+ N
Sbjct: 194 FMQSIGTGEGVVN-TFRGTGEIYVQSLNLQQFAGVLQGFITNTN 236
>gi|169617449|ref|XP_001802139.1| hypothetical protein SNOG_11904 [Phaeosphaeria nodorum SN15]
gi|160703411|gb|EAT80948.2| hypothetical protein SNOG_11904 [Phaeosphaeria nodorum SN15]
Length = 271
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 43/181 (23%), Positives = 78/181 (43%), Gaps = 14/181 (7%)
Query: 46 VARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLA 105
A+PG+M MS ++ ++ + + + + L G +T PG + AP A
Sbjct: 87 TAKPGAMIAMSPTITLKG----QIKFSLKKALMGGDMTKSSYTGPGEL-----LLAP--A 135
Query: 106 RILPIDLAMFGG--ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
I I + GG + DAFL + VS Q + + EG K++GQG+
Sbjct: 136 AIGDITIIKLGGHEQWSVGKDAFLACTQGI-VSEYKSQGIGKAMFSGEGLFVYKISGQGI 194
Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
+ + G++++K+L+ E +D +VA ++ +G I + + LV
Sbjct: 195 LWCTSFGAIIRKDLQKDEKYIIDNGHLVAWNCKYVLERVASGGILSNIHAAEGLVYEKCA 254
Query: 224 G 224
G
Sbjct: 255 G 255
>gi|398347134|ref|ZP_10531837.1| hypothetical protein Lbro5_07894 [Leptospira broomii str. 5399]
Length = 223
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 49/98 (50%), Gaps = 9/98 (9%)
Query: 139 LDQRVRNVIGGIEG----FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
+D + + + G G FLR L+G+G I + G + + LEV VD IVA
Sbjct: 112 MDTKFQGMKGFFSGESLFFLR--LSGKGTLLIASYGGI--ELLEVDGDFIVDTGHIVAFE 167
Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
+N +I G + FGG+ LV A +G G ++IQ+
Sbjct: 168 EGLNYKITKFGGWKSFFFGGEGLV-ARFSGKGKLWIQT 204
>gi|448733514|ref|ZP_21715758.1| hypothetical protein C450_09578 [Halococcus salifodinae DSM 8989]
gi|445802751|gb|EMA53054.1| hypothetical protein C450_09578 [Halococcus salifodinae DSM 8989]
Length = 220
Score = 38.5 bits (88), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 13/100 (13%)
Query: 144 RNVIGGIEGFLRQKLTGQG-----------LAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
R GG++G L++K++G+G L AG V L+ + I+V+ + ++A
Sbjct: 53 RKSSGGLKGMLKKKVSGEGGVMMRAEGNGHLYLADAGKEVQILELDADDEISVNGNDVLA 112
Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
SVN IK I A GG L + GPG + I +
Sbjct: 113 FEESVNWDIKMMNSIAGASSGG--LFNVYLKGPGHIAITT 150
>gi|422879786|ref|ZP_16926251.1| hypothetical protein HMPREF9396_1638 [Streptococcus sanguinis
SK1059]
gi|422929630|ref|ZP_16962571.1| hypothetical protein HMPREF8573_1626 [Streptococcus sanguinis ATCC
29667]
gi|422932598|ref|ZP_16965529.1| hypothetical protein HMPREF9387_2177 [Streptococcus sanguinis
SK340]
gi|332365197|gb|EGJ42960.1| hypothetical protein HMPREF9396_1638 [Streptococcus sanguinis
SK1059]
gi|339614532|gb|EGQ19227.1| hypothetical protein HMPREF8573_1626 [Streptococcus sanguinis ATCC
29667]
gi|339618349|gb|EGQ22947.1| hypothetical protein HMPREF9387_2177 [Streptococcus sanguinis
SK340]
Length = 231
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 35/153 (22%), Positives = 70/153 (45%), Gaps = 8/153 (5%)
Query: 94 DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGGIE 151
+G++ +A + +++P+ L + AFL D T++++ + + GG
Sbjct: 79 NGYLALAPDAPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKALFGGQG 134
Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
G GQG A GS+ + L EV T+D + +VA + S+N + +++
Sbjct: 135 GLFVMTTQGQGTLLANAFGSIKKIELHNQEV-TIDNAHVVAWSQSLNYHVHLENGFWQSI 193
Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
G+ +V G G V++QSL + +++
Sbjct: 194 GTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSK 225
>gi|448612787|ref|ZP_21662667.1| hypothetical protein C440_12789 [Haloferax mucosum ATCC BAA-1512]
gi|445739684|gb|ELZ91190.1| hypothetical protein C440_12789 [Haloferax mucosum ATCC BAA-1512]
Length = 220
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 40/157 (25%), Positives = 68/157 (43%), Gaps = 28/157 (17%)
Query: 94 DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
D FV AP+ L +D+A+ G ++ + + + D+ R
Sbjct: 4 DEFVDTHAPNEGGDAFELEHSKLLDVAL-DGTIMAKAGSMVGYTGDISFE-------RKS 55
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
GG++G L++K TG+G + A GS V L+ GE I+V+ + ++A +
Sbjct: 56 AGGLKGMLKKKATGEGTVMMEATGSGHLYLADQGKEVQILELDAGEEISVNGNDVLAFEN 115
Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
++ I+ I A GG L + GPG V I +
Sbjct: 116 TIEWDIRMMKSIAGASTGG--LFNVFLEGPGHVAITT 150
>gi|302421758|ref|XP_003008709.1| DUF124 domain-containing protein [Verticillium albo-atrum VaMs.102]
gi|261351855|gb|EEY14283.1| DUF124 domain-containing protein [Verticillium albo-atrum VaMs.102]
Length = 469
Score = 38.5 bits (88), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 33/109 (30%), Positives = 57/109 (52%), Gaps = 2/109 (1%)
Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
DA++ S V + + Q + I EG K++G GL +I + G++++K+L GE
Sbjct: 346 DAYVASTQGV-ICDYKRQGLGKAIFSGEGLFVYKISGTGLLWITSFGAIIRKDLIPGEKY 404
Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
VD +VA + ++ +G I + G+ LV TGPG V++Q+
Sbjct: 405 IVDNGHLVAWNTKYIIERVASGGIISGISSGEGLVCK-FTGPGTVYMQT 452
>gi|398345153|ref|ZP_10529856.1| hypothetical protein LinasL1_19342 [Leptospira inadai serovar Lyme
str. 10]
Length = 223
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 30/90 (33%), Positives = 45/90 (50%), Gaps = 9/90 (10%)
Query: 149 GIEGFLRQ------KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
G++GF KL+G+G I + G + + LEV VD IVA +N +I
Sbjct: 118 GMKGFFSGESLFFLKLSGKGTLLIASYGGI--ELLEVDGDFIVDTGHIVAFEEGLNYKIA 175
Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
G + FGG+ LV A +G G ++IQ+
Sbjct: 176 KFGGWKSFFFGGEGLV-ARFSGKGKLWIQT 204
>gi|384567618|ref|ZP_10014722.1| hypothetical protein SacglDRAFT_03821 [Saccharomonospora glauca
K62]
gi|384523472|gb|EIF00668.1| hypothetical protein SacglDRAFT_03821 [Saccharomonospora glauca
K62]
Length = 197
Score = 38.5 bits (88), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 61/140 (43%), Gaps = 3/140 (2%)
Query: 95 GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
G++ +A P++L+ G + DA L V+ + + + GG GFL
Sbjct: 59 GWIDLAPRHPGDAYPLELSGQLGWSVAS-DAVLARPATVRHDHPWPAH-QTLFGGDSGFL 116
Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGG 214
+ +G G G V +L+ GEV+TV+ ++A +V V+++ P
Sbjct: 117 KH-YSGTGPLVFACRGPVDALSLKAGEVVTVNPLFVLAFPDTVQVRLRALDPAEPQSIKT 175
Query: 215 DNLVTAVVTGPGIVFIQSLP 234
+ V GPG V +Q+ P
Sbjct: 176 GTGLALDVAGPGTVLVQTRP 195
>gi|422847659|ref|ZP_16894342.1| hypothetical protein HMPREF9381_2001 [Streptococcus sanguinis SK72]
gi|325686657|gb|EGD28683.1| hypothetical protein HMPREF9381_2001 [Streptococcus sanguinis SK72]
Length = 238
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 54/213 (25%), Positives = 93/213 (43%), Gaps = 23/213 (10%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
+ +I L E V + GSM + + N+ + Q +FGK I +V R+
Sbjct: 18 LAEIALDASESVRIQTGSMIYHT-----PNVVLNAKVNAESQSIFGKAIQAVG-RSMASG 71
Query: 94 DG-FVGIAAP-------SLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
+G F+ A +LA +P + L + + AFL D S TL+++
Sbjct: 72 EGVFITEATAQSNNGRLALAPNVPGQIVALELGEKQYRLNDGAFLAM--DGSASYTLERQ 129
Query: 143 V--RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
R + GG GF GQG + + GS+ + L + IT+D + +VA + +
Sbjct: 130 SVGRALFGGQGGFYVMSTQGQGTLLVNSFGSIQKVELH-NDRITIDNAHVVAWSRELEYD 188
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSL 233
I + +++ G+ +V G G ++IQSL
Sbjct: 189 IHLENGLWQSMGTGEGVVNTFY-GTGEIYIQSL 220
>gi|387784941|ref|YP_006071024.1| hypothetical protein SALIVA_1894 [Streptococcus salivarius JIM8777]
gi|338745823|emb|CCB96189.1| uncharacterized conserved protein [Streptococcus salivarius
JIM8777]
Length = 231
Score = 38.5 bits (88), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 38/164 (23%), Positives = 74/164 (45%), Gaps = 9/164 (5%)
Query: 82 ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
IT VV + ++G++ +A S +++P+ L G + D +++ Q
Sbjct: 70 ITQVVSQ---SNNGYIALAPDSPGQVIPLYL---GEKQYRLNDGAFLALDGTAYYTMESQ 123
Query: 142 RV-RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+ + + GG GF GQG A GS+ + L EV T+D + +VA + +++
Sbjct: 124 SIGKALFGGQGGFFVMTTQGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQTLDYD 182
Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
I +++ G+ +V G G +++QSL + + R
Sbjct: 183 IHLENGFWQSIGTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 225
>gi|336320650|ref|YP_004600618.1| hypothetical protein Celgi_1531 [[Cellvibrio] gilvus ATCC 13127]
gi|336104231|gb|AEI12050.1| protein of unknown function DUF124 [[Cellvibrio] gilvus ATCC 13127]
Length = 213
Score = 38.5 bits (88), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 52/220 (23%), Positives = 96/220 (43%), Gaps = 22/220 (10%)
Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG--MWQWLFGK 80
+PF+ + ++V ++ + Q V+AR G+M +G + I+ VG + + G+
Sbjct: 1 MPFEKV--NSKVVRVPVTAQSPVLARRGAMLGYTGQVAFRPIHGQGQGVGGMIGTAMAGE 58
Query: 81 TITSVVLRNPGPS-DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS--- 136
+ + G GF G+ + I+L G L + D L +++ S
Sbjct: 59 SNPMMATEGTGSVLYGFRGL------HVTVIELT--GDSLTVEADRLLAHDANLQTSVEF 110
Query: 137 ---NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
+ VR + G +G +++GQG +L+ G L GE + VD V
Sbjct: 111 IGQGGIRSAVRGAMTG-QGLFTTRISGQGSVALLSHGGTFPLQL-TGETVGVDPQAYVGH 168
Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAV-VTGPGIVFIQS 232
T ++NV +K + +R V G + V+G G V++Q+
Sbjct: 169 TGALNVDLKASVGLRDIVGRGSGEAFQLHVSGHGTVYVQA 208
>gi|397645845|gb|EJK77010.1| hypothetical protein THAOC_01190 [Thalassiosira oceanica]
Length = 439
Score = 38.5 bits (88), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 45/159 (28%), Positives = 70/159 (44%), Gaps = 18/159 (11%)
Query: 83 TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
T+ V P D V APS + + + G EL A+L V T++
Sbjct: 219 TTEVRSTAAPGDVLV---APSDPGGVALHRMVRGEELTLTSGAYLAGDGGV----TVNSD 271
Query: 143 VRNVIGGIEGFLRQ-----KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
++N G F + +GQG + A GS+++ L+ GE VD +VA ++++
Sbjct: 272 MQNPFSGFTNFSGTGVFLLRASGQGNLALSAYGSIIKYTLQPGEKRAVDNGHLVAWSATM 331
Query: 198 NVQIKYNGPIRRAVFG----GDNLVTAVVTGPGIVFIQS 232
Q+ + VFG G+ L GPGIV+IQS
Sbjct: 332 RTQMVL-ASRKAGVFGSLTSGEGLHVE-FQGPGIVYIQS 368
>gi|365960834|ref|YP_004942401.1| hypothetical protein FCOL_08995 [Flavobacterium columnare ATCC
49512]
gi|365737515|gb|AEW86608.1| hypothetical protein FCOL_08995 [Flavobacterium columnare ATCC
49512]
Length = 227
Score = 38.5 bits (88), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 49/111 (44%), Gaps = 4/111 (3%)
Query: 122 QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGE 181
Q AFL S ++ + + + G E K +G+G + + G ++ N+E G
Sbjct: 101 QNSAFLASSENITIDSKFQGLTKGFFSG-ENLFLIKCSGEGDLWFNSYGGIIPINVEDGY 159
Query: 182 VITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
VI D IVA T + + G + F G+ LV G G ++IQ+
Sbjct: 160 VI--DTGHIVAFTEGLTYDVTRVGGYKSLFFSGEGLV-CRFQGKGRIWIQT 207
>gi|262281812|ref|ZP_06059581.1| conserved hypothetical protein [Streptococcus sp. 2_1_36FAA]
gi|262262266|gb|EEY80963.1| conserved hypothetical protein [Streptococcus sp. 2_1_36FAA]
Length = 236
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 47/207 (22%), Positives = 95/207 (45%), Gaps = 14/207 (6%)
Query: 35 CQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITS----VVLRN 89
+I+L EK + GSM + S S+++ + + +G G+++ S +
Sbjct: 19 VEIVLDKGEKAYIQRGSMVYHSSSVKLRTKLNAKGSGLGKLVKAVGRSMVSGESTFITEA 78
Query: 90 PGPSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNV 146
+DG + AP++ +++P+ L + AFL D T++++ R +
Sbjct: 79 LSDADGGLLALAPNVPGQVMPLYLGEH--QYRLNDGAFLAL--DGSAFYTMERQSIGRAL 134
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
GG GF G+G + A G++ + L+ E +T+D + +VA + ++ I
Sbjct: 135 FGGQGGFFVMTTQGEGTLLVNAFGAIKKIELKDQE-MTIDNAHVVAWSKDLHYDIHLENG 193
Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSL 233
++V G+ +V G G +++QSL
Sbjct: 194 FLQSVGTGEGVVN-TFRGTGEIYVQSL 219
>gi|170077980|ref|YP_001734618.1| hypothetical protein SYNPCC7002_A1366 [Synechococcus sp. PCC 7002]
gi|169885649|gb|ACA99362.1| conserved hypothetical protein TIGR00266 [Synechococcus sp. PCC
7002]
Length = 225
Score = 38.1 bits (87), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 52/204 (25%), Positives = 92/204 (45%), Gaps = 13/204 (6%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGKTITSVVLRN 89
+ + L E+++A+ G+M MSG S + G+ + L G+++ V R
Sbjct: 13 IAHVTLDAGEEIIAQAGAMVAMSGNINASTTLRKGKGGGIMGGLKRMLAGESLFLSVFRA 72
Query: 90 PGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
P P+ AP L L + M G E + Q ++L S +V + Q ++ G
Sbjct: 73 PLPNSEI--WFAPKLMGDLLL-YEMRGDEFVVQASSYLASGPNVDIDLGW-QGFKSFFSG 128
Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIR 208
E ++GQG + + G++ + +++ GE + VD IVA S++ + K N
Sbjct: 129 -ESIFWLSISGQGPLLVTSFGAIYEVDVD-GEYV-VDTGHIVAFEKSLSFTVGKANPSWI 185
Query: 209 RAVFGGDNLVTAVVTGPGIVFIQS 232
A GG+ LV G G ++ Q+
Sbjct: 186 GAFLGGEGLVCR-FRGKGKLYCQT 208
>gi|456986310|gb|EMG21906.1| TIGR00266-like family protein [Leptospira interrogans serovar
Copenhageni str. LT2050]
Length = 124
Score = 38.1 bits (87), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 6/112 (5%)
Query: 122 QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGE 181
+ A+L S + + +R+ GG EG + G G F+ A G ++ ++V
Sbjct: 2 ESGAYLASDETISIKPMFGG-IRSFFGG-EGIFLLEAFGNGKLFLNAYGGIIP--IDVQG 57
Query: 182 VITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
T+D IVA S+ +I K G + FGG+ LV TG G V IQ+
Sbjct: 58 SYTIDTGHIVAFDKSLQYKIAKAGGSWKSTFFGGEGLVMEF-TGHGRVLIQT 108
>gi|332707811|ref|ZP_08427838.1| conserved hypothetical protein TIGR00266 [Moorea producens 3L]
gi|332353514|gb|EGJ33027.1| conserved hypothetical protein TIGR00266 [Moorea producens 3L]
Length = 225
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 54/207 (26%), Positives = 94/207 (45%), Gaps = 13/207 (6%)
Query: 31 EAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGKTITSVV 86
++ + +++L ++ VA G M MSG S + G+ + + G+++ V
Sbjct: 10 DSAIARVVLDENQEFVAEAGCMIAMSGYINASTTLRQGKGGGIFGGLKRLVGGESLFLSV 69
Query: 87 LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
R+P G + +A + IL M G EL+ Q ++L S + V + Q +++
Sbjct: 70 FRSP-VGGGEIFLAPKFIGDILVYQ--MNGQELVVQATSYLASESGVDIDLGF-QGFKSL 125
Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
G E ++G G + + G++ + +E GE I VD IVA S+N +I G
Sbjct: 126 FSG-ESIFWLTISGYGSVLLNSFGAIYEVPVE-GEYI-VDTGHIVAFEKSLNFEITKPGS 182
Query: 207 IRRAVF-GGDNLVTAVVTGPGIVFIQS 232
F GG+ LV G G V+ Q+
Sbjct: 183 SWIGAFLGGEGLVCR-FKGQGKVYCQT 208
>gi|296141808|ref|YP_003649051.1| hypothetical protein [Tsukamurella paurometabola DSM 20162]
gi|296029942|gb|ADG80712.1| protein of unknown function DUF124 [Tsukamurella paurometabola DSM
20162]
Length = 226
Score = 38.1 bits (87), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 50/214 (23%), Positives = 91/214 (42%), Gaps = 32/214 (14%)
Query: 34 VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-----WLFGKTITSVVLR 88
+ + L P E + G+M S M +++ + + G+ Q L G++
Sbjct: 13 MARCHLAPNEPMTVESGAMVAHSAGMNLQS----QAQGGIMQGLKRSLLSGESFFVSTFT 68
Query: 89 NPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPD--------AFLCSVNDVKVSNTLD 140
P S G+V +A LP D+A+ + +PD +L S + V+ S
Sbjct: 69 AP-ASGGWVDVAG-----TLPGDIAL----VQIRPDRPFYITRGGWLASAHGVQTSAQFG 118
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
+ GG EG + +GQG + G++ +L+ GE + +D +VA S+
Sbjct: 119 G-AKTFFGG-EGAFGLQASGQGEVLVACYGAIDVIDLQPGEQVVIDTGHVVAYDLSIRFS 176
Query: 201 IKY--NGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
++ G ++ G+ +V TGPG V +Q+
Sbjct: 177 MRRAVQGKWMQSAKSGEGMVFE-FTGPGRVLLQT 209
>gi|433589647|ref|YP_007279143.1| hypothetical protein Natpe_0305 [Natrinema pellirubrum DSM 15624]
gi|448332759|ref|ZP_21521986.1| hypothetical protein C488_05282 [Natrinema pellirubrum DSM 15624]
gi|448379084|ref|ZP_21561048.1| hypothetical protein C478_01650 [Haloterrigena thermotolerans DSM
11522]
gi|433304427|gb|AGB30239.1| hypothetical protein Natpe_0305 [Natrinema pellirubrum DSM 15624]
gi|445625372|gb|ELY78733.1| hypothetical protein C488_05282 [Natrinema pellirubrum DSM 15624]
gi|445665646|gb|ELZ18322.1| hypothetical protein C478_01650 [Haloterrigena thermotolerans DSM
11522]
Length = 221
Score = 38.1 bits (87), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 29/103 (28%), Positives = 50/103 (48%), Gaps = 13/103 (12%)
Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSC 189
+ + + GGI GFL++K TG+G + A G+ + L+ G+ I+V+ +
Sbjct: 52 EGISSAEGGITGFLKEKATGEGTPVMEATGTGHLYLADQEKKIQLLELDAGQSISVNGND 111
Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
++A SV+ +I+ I A F L + GPG V I +
Sbjct: 112 VLAFEPSVDYEIRTVESI--AGFSAGGLTNVFLEGPGTVAITT 152
>gi|325681067|ref|ZP_08160599.1| hypothetical protein CUS_5721 [Ruminococcus albus 8]
gi|324107296|gb|EGC01580.1| hypothetical protein CUS_5721 [Ruminococcus albus 8]
Length = 251
Score = 37.7 bits (86), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 32/125 (25%), Positives = 58/125 (46%), Gaps = 8/125 (6%)
Query: 116 GGELLCQPDAFLCSVNDVKVSNTLDQR---VRNVIGGI---EGFLRQKLTGQGLAFIL-A 168
G ++ Q A DV+V+ + + +IGG E ++ + TGQG ++
Sbjct: 60 GNGIILQAGAMQVMTGDVQVATNVKGAGDFFKKMIGGSVTGESAIKPRYTGQGFVYLEPT 119
Query: 169 GGSVVQKNLEV-GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGI 227
++ ++L+ G +T+D +A SVNV + + A GG+ L + G GI
Sbjct: 120 YKYIILEDLDTWGGAVTIDDGMFLACDDSVNVGVVARSNLSSAALGGEGLFNTSLNGKGI 179
Query: 228 VFIQS 232
V ++S
Sbjct: 180 VALES 184
>gi|317498198|ref|ZP_07956499.1| hypothetical protein HMPREF0996_01480 [Lachnospiraceae bacterium
5_1_63FAA]
gi|316894553|gb|EFV16734.1| hypothetical protein HMPREF0996_01480 [Lachnospiraceae bacterium
5_1_63FAA]
Length = 696
Score = 37.7 bits (86), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 52/233 (22%), Positives = 100/233 (42%), Gaps = 25/233 (10%)
Query: 7 STPFQPYVYQS-----PQDAVIPFQILGGEAQVCQIMLK---PQEKVVARPGSMCFMSGS 58
S PF+ YQ P+DA + F + V + LK + V + G+M +M+G
Sbjct: 441 SGPFEVIEYQRDLSVMPEDAQLAF--FCSQMNVRKRQLKCELSRGNVTIQSGAMQWMAGD 498
Query: 59 MEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE 118
+ + + + + + GK V++ DG + + P+ I+ +DL +G
Sbjct: 499 VSVTTGIKGAGDF-LSKTIRGKVTGESVIKPEYTGDGTL-VLEPTYKHIILLDLDEWGNS 556
Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAG---GSVVQK 175
++ FL + +K + + G EGF L G G+ I + +V+
Sbjct: 557 IVLDDGLFLACESTLKQKAVRRKTFSSAFAGGEGFFNLGLKGSGVVCIESDCPREELVEI 616
Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIV 228
LE +V+ +D + +A + S++ + RA G +L+ + +G G+V
Sbjct: 617 TLE-DDVVKIDGNLAIAWSGSLDF------TVERA---GKSLLGSAASGEGLV 659
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.325 0.140 0.417
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,317,950,768
Number of Sequences: 23463169
Number of extensions: 175345379
Number of successful extensions: 490460
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 682
Number of HSP's successfully gapped in prelim test: 476
Number of HSP's that attempted gapping in prelim test: 488090
Number of HSP's gapped (non-prelim): 1256
length of query: 281
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 140
effective length of database: 9,050,888,538
effective search space: 1267124395320
effective search space used: 1267124395320
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 76 (33.9 bits)