BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 023516
         (281 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|449469369|ref|XP_004152393.1| PREDICTED: uncharacterized protein LOC101222761 [Cucumis sativus]
          Length = 281

 Score =  511 bits (1316), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 241/281 (85%), Positives = 270/281 (96%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTP QPYVYQSPQDA+IPFQILGGEAQV QIMLKPQEKV+ARPGSMCFMSGS+E
Sbjct: 1   MAAPFFSTPIQPYVYQSPQDAMIPFQILGGEAQVLQIMLKPQEKVIARPGSMCFMSGSIE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           MEN+++PENEVG+W WLFGK++TS+VL+N G SDG+VGIAAPSLARILPID+AMFGGELL
Sbjct: 61  MENVFLPENEVGVWHWLFGKSVTSIVLQNSGTSDGYVGIAAPSLARILPIDMAMFGGELL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
           CQPDAFLCSVNDVKV+NT+D RVRNV+ G+EGFLRQKL+GQGLAFIL GGSVVQKNLEVG
Sbjct: 121 CQPDAFLCSVNDVKVNNTIDHRVRNVVPGVEGFLRQKLSGQGLAFILGGGSVVQKNLEVG 180

Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           EVI VD+SCIVA+T+S++VQIKYNGP+RRAVFGG+NLVTA +TGPGIVFIQSLPF RLSQ
Sbjct: 181 EVIAVDISCIVAMTTSIDVQIKYNGPVRRAVFGGENLVTATLTGPGIVFIQSLPFQRLSQ 240

Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RIARAVTSPNMRENPKFFVQ+A+FFFLAYVVIVSS+ILTDV
Sbjct: 241 RIARAVTSPNMRENPKFFVQLALFFFLAYVVIVSSIILTDV 281


>gi|363808166|ref|NP_001242226.1| uncharacterized protein LOC100799063 [Glycine max]
 gi|255639877|gb|ACU20231.1| unknown [Glycine max]
 gi|255640131|gb|ACU20356.1| unknown [Glycine max]
          Length = 281

 Score =  506 bits (1303), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 242/281 (86%), Positives = 262/281 (93%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQSPQDA+IPFQILGGEAQV QIMLKPQEK++A+PGSMCFMSGS+E
Sbjct: 1   MAAPFFSTPFQPYVYQSPQDAIIPFQILGGEAQVVQIMLKPQEKIIAKPGSMCFMSGSVE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           MEN Y+PENEVG+WQWLFGKTITS+VLRN GPSDGFVGIAAP  ARILPIDLAMF GE+L
Sbjct: 61  MENAYLPENEVGIWQWLFGKTITSIVLRNSGPSDGFVGIAAPYFARILPIDLAMFNGEIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
           CQPDAFLCSVNDVKVSN +DQR RN +   EGFLRQKL+GQGLAFILAGGSVVQKNLE+G
Sbjct: 121 CQPDAFLCSVNDVKVSNIVDQRGRNAVASAEGFLRQKLSGQGLAFILAGGSVVQKNLEIG 180

Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           EV+ VDVS IVAVTS+VN+QIKYNGP RR +FGGDN VTAV+TGPGIVFIQSLPFHR SQ
Sbjct: 181 EVLAVDVSSIVAVTSAVNIQIKYNGPARRTMFGGDNAVTAVLTGPGIVFIQSLPFHRFSQ 240

Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RIARAVTSPNMRENPKFF+QIA+FFFLA VVIVSSLILTDV
Sbjct: 241 RIARAVTSPNMRENPKFFIQIAVFFFLACVVIVSSLILTDV 281


>gi|363808252|ref|NP_001241981.1| uncharacterized protein LOC100809243 [Glycine max]
 gi|255636372|gb|ACU18525.1| unknown [Glycine max]
          Length = 281

 Score =  506 bits (1302), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 242/281 (86%), Positives = 262/281 (93%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQSPQDA+IPFQILGGEAQV QIMLKPQEK++A+PGSMCFMSGS+E
Sbjct: 1   MAAPFFSTPFQPYVYQSPQDAIIPFQILGGEAQVVQIMLKPQEKIIAKPGSMCFMSGSIE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           MEN Y+PENEVG+WQWLFGKTITS+VL N GPSDGFVGIAAP  ARILPIDLAMF GE+L
Sbjct: 61  MENAYLPENEVGIWQWLFGKTITSIVLCNSGPSDGFVGIAAPYFARILPIDLAMFNGEIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
           CQPDAFLCSVNDVKVSN +DQR RN +   EGFLRQKL+GQGLAFILAGGSVVQKNLE+G
Sbjct: 121 CQPDAFLCSVNDVKVSNIVDQRGRNAVASAEGFLRQKLSGQGLAFILAGGSVVQKNLEIG 180

Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           EV+ VDVS IVAVTS+VNVQIKYNGP RR +FGGDN VTA++TGP IVFIQSLPFHRLSQ
Sbjct: 181 EVLAVDVSSIVAVTSTVNVQIKYNGPARRTMFGGDNAVTALLTGPSIVFIQSLPFHRLSQ 240

Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RIARAVTSPNMRENPKFF+QIA+FFFLAYVVIVSSLILTDV
Sbjct: 241 RIARAVTSPNMRENPKFFIQIAVFFFLAYVVIVSSLILTDV 281


>gi|224138636|ref|XP_002326652.1| predicted protein [Populus trichocarpa]
 gi|118487466|gb|ABK95560.1| unknown [Populus trichocarpa]
 gi|222833974|gb|EEE72451.1| predicted protein [Populus trichocarpa]
          Length = 281

 Score =  498 bits (1282), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 240/281 (85%), Positives = 260/281 (92%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQ+ QDAVIPFQI GGEAQV QIMLKPQEKV+A+PGSMCFMSGS+E
Sbjct: 1   MAAPFFSTPFQPYVYQNHQDAVIPFQIFGGEAQVVQIMLKPQEKVIAKPGSMCFMSGSIE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           MEN++I ENEVG+WQW FGK +TSVV  NPGPSDGFVGI+APSL RILPIDLA FGGE+L
Sbjct: 61  MENVFIHENEVGVWQWFFGKAVTSVVFHNPGPSDGFVGISAPSLGRILPIDLAKFGGEIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
           CQPDAFLCSV+DVKVSNT+DQR RNV+  IEGFLRQKLTGQGLAFILAGGSV QK LEVG
Sbjct: 121 CQPDAFLCSVSDVKVSNTVDQRARNVMPSIEGFLRQKLTGQGLAFILAGGSVAQKVLEVG 180

Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           EV+ VDVSCI A+ ++VNVQIKYNGP+RRAVFGGDNLVTA +TGPGIVFIQSLPF R SQ
Sbjct: 181 EVLAVDVSCIAALNTTVNVQIKYNGPVRRAVFGGDNLVTATLTGPGIVFIQSLPFRRFSQ 240

Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RIARAVTSPNMR+NPKFFVQIAIFFFLAYVVIVSSLILTDV
Sbjct: 241 RIARAVTSPNMRDNPKFFVQIAIFFFLAYVVIVSSLILTDV 281


>gi|224074179|ref|XP_002304288.1| predicted protein [Populus trichocarpa]
 gi|222841720|gb|EEE79267.1| predicted protein [Populus trichocarpa]
          Length = 281

 Score =  497 bits (1279), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 240/281 (85%), Positives = 258/281 (91%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS Q AVIPFQILGGEAQ+ QIMLKPQEK++A+PGSMCFMSGS+E
Sbjct: 1   MAAPFFSTPFQPYVYQSHQGAVIPFQILGGEAQMVQIMLKPQEKIIAKPGSMCFMSGSIE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           MEN+ IPENEVG+WQW FGK +TSV+ RNPGPSDGFVGIA PSLARILPIDLA FGGE+L
Sbjct: 61  MENVLIPENEVGVWQWFFGKAVTSVIFRNPGPSDGFVGIATPSLARILPIDLAKFGGEIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
           C PDAFLCS+NDVKVSN LD R RNV+  IEGFLRQKLTGQGLAFILAGGSV QK LEVG
Sbjct: 121 CLPDAFLCSINDVKVSNALDHRARNVMPNIEGFLRQKLTGQGLAFILAGGSVAQKVLEVG 180

Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           EV+ VDVSCIVA+ ++VNVQIKYNGP+RRAVFGGDNLVTA +TGPGIVFIQSLPF   SQ
Sbjct: 181 EVLAVDVSCIVALNTTVNVQIKYNGPVRRAVFGGDNLVTATLTGPGIVFIQSLPFQLFSQ 240

Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV
Sbjct: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281


>gi|357476813|ref|XP_003608692.1| hypothetical protein MTR_4g100740 [Medicago truncatula]
 gi|355509747|gb|AES90889.1| hypothetical protein MTR_4g100740 [Medicago truncatula]
          Length = 281

 Score =  492 bits (1267), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 236/281 (83%), Positives = 260/281 (92%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS QDA+IPFQILGGE+QV QIMLKP EK++A+PGSMCFMSGS+E
Sbjct: 1   MAAPFFSTPFQPYVYQSQQDAIIPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           MEN Y+PENEVG+WQWLFGKT+T++V+RN GPSDGFVGIAAP  ARILPIDLA F GE+L
Sbjct: 61  MENAYLPENEVGIWQWLFGKTVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
           CQPDAFLCSVNDVKVSNT+DQR RNV+ G E FLRQKL+GQGLAFIL GGSVVQK LEVG
Sbjct: 121 CQPDAFLCSVNDVKVSNTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVG 180

Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           EV+ VDVSCIVAVTS+V++QIKYNGP RR +FGGDN VTAV+TGPGIVFIQSLPF R SQ
Sbjct: 181 EVLAVDVSCIVAVTSTVDIQIKYNGPARRTMFGGDNAVTAVLTGPGIVFIQSLPFPRFSQ 240

Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RIARAVTSPNMRENPKFF+QIA+FFFLAYVVIVSSLILTDV
Sbjct: 241 RIARAVTSPNMRENPKFFIQIALFFFLAYVVIVSSLILTDV 281


>gi|388508380|gb|AFK42256.1| unknown [Medicago truncatula]
          Length = 281

 Score =  489 bits (1258), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 234/281 (83%), Positives = 260/281 (92%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS QDA+IPFQILGGE+QV QIMLKP EK++A+PGSMCFMSGS+E
Sbjct: 1   MAAPFFSTPFQPYVYQSQQDAIIPFQILGGESQVVQIMLKPHEKIIAKPGSMCFMSGSVE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           MEN Y+PENEVG+WQWLFGKT+T++V+RN GPSDGFVGIAAP  ARILPIDLA F GE+L
Sbjct: 61  MENAYLPENEVGIWQWLFGKTVTNIVVRNSGPSDGFVGIAAPYFARILPIDLATFNGEIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
           CQPDAFLCSVNDVKVS+T+DQR RNV+ G E FLRQKL+GQGLAFIL GGSVVQK LEVG
Sbjct: 121 CQPDAFLCSVNDVKVSDTVDQRGRNVVAGAEVFLRQKLSGQGLAFILGGGSVVQKILEVG 180

Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           EV+ VDVSCIVAVTS+V++QIKYNGP RR +FGGDN VTAV+TGPGIVFIQSLPF R S+
Sbjct: 181 EVLAVDVSCIVAVTSTVDIQIKYNGPARRTMFGGDNAVTAVLTGPGIVFIQSLPFPRFSR 240

Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RIARAVTSPNMRENPKFF+QIA+FFFLAYVVIVSSLILTDV
Sbjct: 241 RIARAVTSPNMRENPKFFIQIALFFFLAYVVIVSSLILTDV 281


>gi|388508964|gb|AFK42548.1| unknown [Lotus japonicus]
          Length = 281

 Score =  488 bits (1257), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 233/281 (82%), Positives = 260/281 (92%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQSPQD +IPFQILGGE+QV QIMLKPQEK++A+PGSMCFMSGS+E
Sbjct: 1   MAAPFFSTPFQPYVYQSPQDEIIPFQILGGESQVVQIMLKPQEKIIAKPGSMCFMSGSIE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           MEN Y+PENEVG  QWLFGKTITS+VL N GPSDGFVGIAAP  ARILPIDLAMF G +L
Sbjct: 61  MENAYLPENEVGFRQWLFGKTITSIVLCNSGPSDGFVGIAAPYFARILPIDLAMFNGGIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
           CQPDAFLCSVNDVKVSNT DQR +NV+ G E FLRQKL+GQG+AFILAGGSVVQKNLE+G
Sbjct: 121 CQPDAFLCSVNDVKVSNTFDQRGQNVVAGAERFLRQKLSGQGVAFILAGGSVVQKNLEIG 180

Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           EV+ VDVSCIVAVTS+VNVQIK NGP RR +FGGD++VTA++TGPGIVFIQSLPFHRLSQ
Sbjct: 181 EVLAVDVSCIVAVTSTVNVQIKNNGPARRTMFGGDHVVTALLTGPGIVFIQSLPFHRLSQ 240

Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RIARAVTSPNMRENPKF++QIA+FF +AYVVI+SSL+LTDV
Sbjct: 241 RIARAVTSPNMRENPKFYIQIAVFFLMAYVVILSSLLLTDV 281


>gi|388499702|gb|AFK37917.1| unknown [Lotus japonicus]
          Length = 281

 Score =  486 bits (1252), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 232/281 (82%), Positives = 259/281 (92%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQSPQD +IPFQILGGE+QV QIMLKPQEK++A+PGSMCFMSGS+E
Sbjct: 1   MAAPFFSTPFQPYVYQSPQDEIIPFQILGGESQVVQIMLKPQEKIIAKPGSMCFMSGSIE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           MEN Y+PENEVG  QWLFGKTITS+VL N GPSDGFVGI AP  ARILPIDLAMF G +L
Sbjct: 61  MENAYLPENEVGFRQWLFGKTITSIVLCNSGPSDGFVGIVAPYFARILPIDLAMFNGGIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
           CQPDAFLCSVNDVKVSNT DQR +NV+ G E FLRQKL+GQG+AFILAGGSVVQKNLE+G
Sbjct: 121 CQPDAFLCSVNDVKVSNTFDQRGQNVVAGAERFLRQKLSGQGVAFILAGGSVVQKNLEIG 180

Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           EV+ VDVSCIVAVTS+VNVQIK NGP RR +FGGD++VTA++TGPGIVFIQSLPFHRLSQ
Sbjct: 181 EVLAVDVSCIVAVTSTVNVQIKNNGPARRTMFGGDHVVTALLTGPGIVFIQSLPFHRLSQ 240

Query: 241 RIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RIARAVTSPNMRENPKF++QIA+FF +AYVVI+SSL+LTDV
Sbjct: 241 RIARAVTSPNMRENPKFYIQIAVFFLMAYVVILSSLLLTDV 281


>gi|297794473|ref|XP_002865121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297310956|gb|EFH41380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 1054

 Score =  457 bits (1176), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 215/282 (76%), Positives = 251/282 (89%), Gaps = 1/282 (0%)

Query: 1    MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
            MAAPFFSTPFQPYVYQS QD + PFQILGGE+QV QIMLK +EKV+A+P SMC+MSGS+E
Sbjct: 773  MAAPFFSTPFQPYVYQSQQDTITPFQILGGESQVVQIMLKSEEKVIAKPASMCYMSGSIE 832

Query: 61   MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
            MEN Y PE EVG+ QW+ GK+++SV+LRN G +DGFVGIAAP LARILPIDLAMFGGE+L
Sbjct: 833  MENTYTPEQEVGVLQWILGKSVSSVLLRNTGQNDGFVGIAAPYLARILPIDLAMFGGEIL 892

Query: 121  CQPDAFLCSVNDVKVSNTLDQRVRNVI-GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEV 179
            CQPDAFLCSV+DVK+ N++DQR RN++  G EGFLRQ+L+GQGLAFILAGGSV+QK LEV
Sbjct: 893  CQPDAFLCSVHDVKIVNSVDQRARNIVAAGAEGFLRQRLSGQGLAFILAGGSVLQKVLEV 952

Query: 180  GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
            GEV ++DVSCI A+T S++VQIK N P+RRAVFGGDN+V A +TGPGIVFIQSLPFHRLS
Sbjct: 953  GEVFSIDVSCIAALTPSIDVQIKNNAPLRRAVFGGDNVVMATLTGPGIVFIQSLPFHRLS 1012

Query: 240  QRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
            QRIAR+VTSPNMRENPK  +QIAIF FLAY VI+SSLILT+V
Sbjct: 1013 QRIARSVTSPNMRENPKLLIQIAIFAFLAYAVILSSLILTEV 1054


>gi|18414878|ref|NP_567527.1| Tryptophan RNA-binding attenuator protein-like protein [Arabidopsis
           thaliana]
 gi|28393695|gb|AAO42260.1| unknown protein [Arabidopsis thaliana]
 gi|28973435|gb|AAO64042.1| unknown protein [Arabidopsis thaliana]
 gi|332658487|gb|AEE83887.1| Tryptophan RNA-binding attenuator protein-like protein [Arabidopsis
           thaliana]
          Length = 285

 Score =  456 bits (1172), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 219/285 (76%), Positives = 254/285 (89%), Gaps = 4/285 (1%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS +D + PFQILGGEAQV QIMLKPQEKV+A+PGSMC+MSGS+E
Sbjct: 1   MAAPFFSTPFQPYVYQSQEDTITPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           M+N Y PE EVG+ QW+ GK+++S+VLRN G +DGFVGIAAPSLARILP+DLAMFGG++L
Sbjct: 61  MDNTYTPEQEVGVVQWILGKSVSSIVLRNTGQNDGFVGIAAPSLARILPLDLAMFGGDIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQ--RVRNV-IGGIEGFLRQKLTGQGLAFILAGGSVVQKNL 177
           CQPDAFLCSV+DVKV NT+ Q  R RN+   G EG LRQ+L+GQGLAFI+AGGSVVQKNL
Sbjct: 121 CQPDAFLCSVHDVKVVNTVYQRHRARNIAAAGAEGVLRQRLSGQGLAFIIAGGSVVQKNL 180

Query: 178 EVGEVITVDVSCIVAVTSSVNVQIKYN-GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFH 236
           EVGEV+T+DVSCI A+T S+N QIKYN  P+RRAVFGGDN+VTA +TGPGIVFIQSLPFH
Sbjct: 181 EVGEVLTIDVSCIAALTPSINFQIKYNAAPVRRAVFGGDNVVTATLTGPGIVFIQSLPFH 240

Query: 237 RLSQRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RLSQRIAR+VTSPNMRENP+  VQI +F FLAYVVIVSSLILT++
Sbjct: 241 RLSQRIARSVTSPNMRENPRLLVQIGLFIFLAYVVIVSSLILTEM 285


>gi|22327636|ref|NP_199553.2| tryptophan RNA-binding attenuator-like protein [Arabidopsis
           thaliana]
 gi|19424009|gb|AAL87286.1| unknown protein [Arabidopsis thaliana]
 gi|21280831|gb|AAM45111.1| unknown protein [Arabidopsis thaliana]
 gi|332008124|gb|AED95507.1| tryptophan RNA-binding attenuator-like protein [Arabidopsis
           thaliana]
          Length = 282

 Score =  456 bits (1172), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 216/282 (76%), Positives = 250/282 (88%), Gaps = 1/282 (0%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS QD + PFQILGGE+QV QIMLK +EKV+A+P SMC+MSGS+E
Sbjct: 1   MAAPFFSTPFQPYVYQSQQDTITPFQILGGESQVVQIMLKSEEKVIAKPASMCYMSGSIE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           MEN Y PE EVG+ QW+ GK+++SVVLRN G +DGFVGIAAP LARILPIDLAMFGGE+L
Sbjct: 61  MENTYTPEQEVGVLQWILGKSVSSVVLRNTGQNDGFVGIAAPYLARILPIDLAMFGGEIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVI-GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEV 179
           CQPDAFLCSV+DVKV N++DQR RN++  G EGFLRQ+L+GQGLAFILAGGSVVQK LEV
Sbjct: 121 CQPDAFLCSVHDVKVVNSVDQRARNIVAAGAEGFLRQRLSGQGLAFILAGGSVVQKVLEV 180

Query: 180 GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
           GEV ++DVSCI A+T S++V+IK N P RRA+FGGDN+V A +TGPGIVFIQSLPFHRLS
Sbjct: 181 GEVFSIDVSCIAALTPSIDVRIKNNAPFRRALFGGDNVVMATLTGPGIVFIQSLPFHRLS 240

Query: 240 QRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           QRIAR+VTSPNMRENPK  +QIAIF FLAY VI+SSLILT+V
Sbjct: 241 QRIARSVTSPNMRENPKLLIQIAIFAFLAYAVILSSLILTEV 282


>gi|297800372|ref|XP_002868070.1| hypothetical protein ARALYDRAFT_493136 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313906|gb|EFH44329.1| hypothetical protein ARALYDRAFT_493136 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 285

 Score =  454 bits (1168), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 219/285 (76%), Positives = 254/285 (89%), Gaps = 4/285 (1%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS +D + PFQILGGEAQV QIMLKPQEKV+A+PGSMC+MSGS++
Sbjct: 1   MAAPFFSTPFQPYVYQSQEDTITPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIQ 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           M+N Y PE EVG+ QW+ GK+++S+VLRN G +DGFVGIAAPSLARILP+DLAMFGG++L
Sbjct: 61  MDNNYTPEQEVGVVQWILGKSVSSIVLRNNGQNDGFVGIAAPSLARILPLDLAMFGGDIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQ--RVRNVI-GGIEGFLRQKLTGQGLAFILAGGSVVQKNL 177
           CQPDAFLCSV+DVKV N+L Q  R RN+   G EGFLRQ+L+GQGLAFI+AGGSVVQKNL
Sbjct: 121 CQPDAFLCSVHDVKVVNSLYQRHRARNIAQAGAEGFLRQRLSGQGLAFIIAGGSVVQKNL 180

Query: 178 EVGEVITVDVSCIVAVTSSVNVQIKYN-GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFH 236
           EVGEV+T DVSCI A+T S+N QIKYN  P+RRAVFGG+NLVTA +TGPGIVFIQSLPFH
Sbjct: 181 EVGEVLTTDVSCIAALTPSINFQIKYNAAPVRRAVFGGENLVTATLTGPGIVFIQSLPFH 240

Query: 237 RLSQRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RLSQRIAR+VTSPNMRENP+  VQI +F FLAYVVIVSSLILT++
Sbjct: 241 RLSQRIARSVTSPNMRENPRLLVQIGLFIFLAYVVIVSSLILTEM 285


>gi|21554803|gb|AAM63696.1| unknown [Arabidopsis thaliana]
          Length = 285

 Score =  452 bits (1163), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 217/285 (76%), Positives = 253/285 (88%), Gaps = 4/285 (1%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS +D + PFQILGGEAQV QIMLKPQEKV+A+PGSMC+MSGS+E
Sbjct: 1   MAAPFFSTPFQPYVYQSQEDTITPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           M+N Y PE EVG+ QW+ GK+++S+VLRN G +DGFVGIAAPSLARILP+DLAMFGG++L
Sbjct: 61  MDNTYTPEQEVGVVQWILGKSVSSIVLRNTGQNDGFVGIAAPSLARILPLDLAMFGGDIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQ--RVRNV-IGGIEGFLRQKLTGQGLAFILAGGSVVQKNL 177
           CQPDAFLCSV+DVKV N++ Q  R RN+   G EG LRQ+L+GQGLAFI+AGGSVVQKNL
Sbjct: 121 CQPDAFLCSVHDVKVVNSVYQRHRARNIAAAGAEGVLRQRLSGQGLAFIIAGGSVVQKNL 180

Query: 178 EVGEVITVDVSCIVAVTSSVNVQIKYN-GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFH 236
           EVGE +T+DVSCI A+T S+N QIKYN  P+RRAVFGGDN+VTA +TGPGIVFIQSLPFH
Sbjct: 181 EVGEFLTIDVSCIAALTPSINFQIKYNAAPVRRAVFGGDNVVTATLTGPGIVFIQSLPFH 240

Query: 237 RLSQRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RLSQRIAR+VTSPNMRENP+  VQI +F FLAYVVIVSSLILT++
Sbjct: 241 RLSQRIARSVTSPNMRENPRLLVQIGLFIFLAYVVIVSSLILTEM 285


>gi|89257604|gb|ABD65092.1| hypothetical protein 31.t00054 [Brassica oleracea]
          Length = 285

 Score =  447 bits (1150), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 216/285 (75%), Positives = 251/285 (88%), Gaps = 4/285 (1%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS +D V PFQILGGEAQV QIMLKPQEKV+A+PGSMC+MSGS+E
Sbjct: 1   MAAPFFSTPFQPYVYQSQEDTVTPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           MEN Y PE EVG+ QW+ GK+++SV+LRN G +DGFVGIAAPSLARILP+DLAMFGG++L
Sbjct: 61  MENTYTPEQEVGVVQWVLGKSVSSVLLRNTGQTDGFVGIAAPSLARILPMDLAMFGGDIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQ--RVRNV-IGGIEGFLRQKLTGQGLAFILAGGSVVQKNL 177
           CQPDAFLCSV+DVKV N++ Q  R RN+   G E FLRQ+L+GQGLAFI AGGSVVQKNL
Sbjct: 121 CQPDAFLCSVHDVKVVNSVYQRHRARNIAAAGAEVFLRQRLSGQGLAFITAGGSVVQKNL 180

Query: 178 EVGEVITVDVSCIVAVTSSVNVQIKYN-GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFH 236
           EVGEV+T+DVSCI A+T  +N +I Y+  P+RRAVFGGDN+VTA +TGPGIVFIQSLPFH
Sbjct: 181 EVGEVLTIDVSCIAALTPPINFRINYHAAPVRRAVFGGDNVVTATLTGPGIVFIQSLPFH 240

Query: 237 RLSQRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RLSQRIAR+VTSPNMRENP+F VQI +F FLAYVVI SSLILT++
Sbjct: 241 RLSQRIARSVTSPNMRENPRFLVQIGLFLFLAYVVIASSLILTEM 285


>gi|359496492|ref|XP_002269493.2| PREDICTED: uncharacterized protein LOC100257929 [Vitis vinifera]
          Length = 293

 Score =  444 bits (1142), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 221/266 (83%), Positives = 249/266 (93%)

Query: 16  QSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ 75
           +SPQ+ V PFQILGGEAQ+ QIMLKPQEK++A+PGSMC+MSGS++MEN+YIPENEVGMWQ
Sbjct: 28  RSPQENVTPFQILGGEAQIVQIMLKPQEKIIAKPGSMCYMSGSIQMENVYIPENEVGMWQ 87

Query: 76  WLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKV 135
           WLFGK +TS+VL N G S+GFVGIAAPSLARILPIDLAMFGGE+LCQPDAFLCS+NDVKV
Sbjct: 88  WLFGKAVTSIVLLNQGSSNGFVGIAAPSLARILPIDLAMFGGEILCQPDAFLCSLNDVKV 147

Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
           +NT DQR RN++ G EGFLRQKLTGQGLAFI+AGGSVVQKNLEVGE + VDVSCI A+++
Sbjct: 148 NNTTDQRARNIVVGAEGFLRQKLTGQGLAFIVAGGSVVQKNLEVGEELAVDVSCIAALSA 207

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENP 255
           +V+VQIKYNGP+RR VFGGDNLVTAV+TGPGIVFIQSLPF R SQRIARAVTSPNMRENP
Sbjct: 208 TVSVQIKYNGPMRRVVFGGDNLVTAVLTGPGIVFIQSLPFQRFSQRIARAVTSPNMRENP 267

Query: 256 KFFVQIAIFFFLAYVVIVSSLILTDV 281
           KFF+QIAIFFFLAYVVIVSSLILTDV
Sbjct: 268 KFFIQIAIFFFLAYVVIVSSLILTDV 293


>gi|296083496|emb|CBI23465.3| unnamed protein product [Vitis vinifera]
          Length = 281

 Score =  443 bits (1139), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 221/265 (83%), Positives = 248/265 (93%)

Query: 17  SPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW 76
           SPQ+ V PFQILGGEAQ+ QIMLKPQEK++A+PGSMC+MSGS++MEN+YIPENEVGMWQW
Sbjct: 17  SPQENVTPFQILGGEAQIVQIMLKPQEKIIAKPGSMCYMSGSIQMENVYIPENEVGMWQW 76

Query: 77  LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
           LFGK +TS+VL N G S+GFVGIAAPSLARILPIDLAMFGGE+LCQPDAFLCS+NDVKV+
Sbjct: 77  LFGKAVTSIVLLNQGSSNGFVGIAAPSLARILPIDLAMFGGEILCQPDAFLCSLNDVKVN 136

Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
           NT DQR RN++ G EGFLRQKLTGQGLAFI+AGGSVVQKNLEVGE + VDVSCI A++++
Sbjct: 137 NTTDQRARNIVVGAEGFLRQKLTGQGLAFIVAGGSVVQKNLEVGEELAVDVSCIAALSAT 196

Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPK 256
           V+VQIKYNGP+RR VFGGDNLVTAV+TGPGIVFIQSLPF R SQRIARAVTSPNMRENPK
Sbjct: 197 VSVQIKYNGPMRRVVFGGDNLVTAVLTGPGIVFIQSLPFQRFSQRIARAVTSPNMRENPK 256

Query: 257 FFVQIAIFFFLAYVVIVSSLILTDV 281
           FF+QIAIFFFLAYVVIVSSLILTDV
Sbjct: 257 FFIQIAIFFFLAYVVIVSSLILTDV 281


>gi|255579647|ref|XP_002530664.1| conserved hypothetical protein [Ricinus communis]
 gi|223529797|gb|EEF31733.1| conserved hypothetical protein [Ricinus communis]
          Length = 290

 Score =  414 bits (1063), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 194/238 (81%), Positives = 219/238 (92%)

Query: 7   STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
           S+ F   + +S QDAVIPFQILGGEAQV QIMLKPQEKVVA+PGSMCFMSGS+EMEN ++
Sbjct: 47  SSKFSEVLKKSQQDAVIPFQILGGEAQVVQIMLKPQEKVVAKPGSMCFMSGSIEMENTFL 106

Query: 67  PENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           PENEVGMWQWLFGKT+TS+VLRN GPSDGF+GIAAPSLARILPIDLAMFGGE+LCQPDAF
Sbjct: 107 PENEVGMWQWLFGKTVTSIVLRNSGPSDGFIGIAAPSLARILPIDLAMFGGEILCQPDAF 166

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LCSVNDVKVS+T+D R RNV+   EGFLRQKL+GQGLAF++ GGSV+QKNLEVGEV++VD
Sbjct: 167 LCSVNDVKVSSTVDYRARNVMASAEGFLRQKLSGQGLAFVVGGGSVIQKNLEVGEVLSVD 226

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
           VSCIVAV ++VNVQ+KYNGP+RRAVFGGDNLVTA +TGPGIVFIQSLPF R SQRIAR
Sbjct: 227 VSCIVAVNTTVNVQVKYNGPMRRAVFGGDNLVTATLTGPGIVFIQSLPFQRFSQRIAR 284


>gi|89257554|gb|ABD65044.1| hypothetical protein 26.t00103 [Brassica oleracea]
          Length = 274

 Score =  413 bits (1061), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 206/282 (73%), Positives = 240/282 (85%), Gaps = 12/282 (4%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARP-GSMCFMSGSM 59
           M APFFSTPFQPYVYQS QD V PFQILGGEAQV Q     +E+V+A+P GSMC+MSGS+
Sbjct: 1   MPAPFFSTPFQPYVYQSQQDTVTPFQILGGEAQVVQ----SEERVIAKPVGSMCYMSGSV 56

Query: 60  EMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGEL 119
           EMEN Y  + +VG  QW+FGK++TSVVLRN  P+DGFVG+AA   ARILPIDLAMFGGE+
Sbjct: 57  EMENTY--KQQVGGLQWIFGKSVTSVVLRNTAPNDGFVGVAAHYFARILPIDLAMFGGEI 114

Query: 120 LCQPDAFLCSVNDVKVSNTLDQRVRNVIG-GIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
           LCQPDAFLCSV+DVKV N+ D R RN++  G E FLRQ+L+GQGLAFILAGGSV    LE
Sbjct: 115 LCQPDAFLCSVSDVKVVNSFDPRARNIVAAGAEVFLRQRLSGQGLAFILAGGSV----LE 170

Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
           VGEV+T+DVSCI A+T SV+VQIKYN P+RRAVFGGDN+V A +TGPGIVFIQSLPFHRL
Sbjct: 171 VGEVLTIDVSCIAALTPSVDVQIKYNTPLRRAVFGGDNVVMATLTGPGIVFIQSLPFHRL 230

Query: 239 SQRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTD 280
           SQRIAR+VTSPNMRENP+  +QIA+F FLAYVVI+SSL+LT+
Sbjct: 231 SQRIARSVTSPNMRENPRLMIQIALFAFLAYVVILSSLLLTE 272


>gi|238480800|ref|NP_001154247.1| Tryptophan RNA-binding attenuator protein-like protein [Arabidopsis
           thaliana]
 gi|332658489|gb|AEE83889.1| Tryptophan RNA-binding attenuator protein-like protein [Arabidopsis
           thaliana]
          Length = 273

 Score =  400 bits (1028), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 192/261 (73%), Positives = 223/261 (85%), Gaps = 12/261 (4%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS +D + PFQILGGEAQV QIMLKPQEKV+A+PGSMC+MSGS+E
Sbjct: 1   MAAPFFSTPFQPYVYQSQEDTITPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           M+N Y PE EVG+ QW+ GK+++S+VLRN G +DGFVGIAAPSLARILP+DLAMFGG++L
Sbjct: 61  MDNTYTPEQEVGVVQWILGKSVSSIVLRNTGQNDGFVGIAAPSLARILPLDLAMFGGDIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQ--RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
           CQPDAFLCSV+DVKV NT+ Q  R RN+              +GLAFI+AGGSVVQKNLE
Sbjct: 121 CQPDAFLCSVHDVKVVNTVYQRHRARNIAAA---------GAEGLAFIIAGGSVVQKNLE 171

Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYN-GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR 237
           VGEV+T+DVSCI A+T S+N QIKYN  P+RRAVFGGDN+VTA +TGPGIVFIQSLPFHR
Sbjct: 172 VGEVLTIDVSCIAALTPSINFQIKYNAAPVRRAVFGGDNVVTATLTGPGIVFIQSLPFHR 231

Query: 238 LSQRIARAVTSPNMRENPKFF 258
           LSQRIAR+VTSPNMRENP+  
Sbjct: 232 LSQRIARSVTSPNMRENPRLL 252


>gi|357115799|ref|XP_003559673.1| PREDICTED: uncharacterized protein LOC100838340 [Brachypodium
           distachyon]
          Length = 282

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 187/282 (66%), Positives = 232/282 (82%), Gaps = 1/282 (0%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS Q +V  FQI GG+ QV Q+M+K  EK+  +PG+MC+MSG+++
Sbjct: 1   MAAPFFSTPFQPYVYQSQQGSVTAFQISGGDVQVVQVMVKSHEKLTVKPGTMCYMSGNIQ 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           M+N Y+ EN+ GMWQW+FGK+ITS V  NPGP DG+VGI+AP   RILPIDLA FGGELL
Sbjct: 61  MDNNYLHENDGGMWQWIFGKSITSTVFFNPGPDDGYVGISAPFPGRILPIDLANFGGELL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
            Q DAFLCSVNDV V++T DQR RN   G E  L+QKL GQG+AFI+ GGSV+QK L   
Sbjct: 121 SQADAFLCSVNDVSVTSTADQRPRNNEIGAEMILKQKLRGQGMAFIVGGGSVMQKILAPR 180

Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           EVITVD +CIVA+T+++N Q+K +  +RRAVFGGDN +TA +TGPG+VFIQSLPF RLSQ
Sbjct: 181 EVITVDAACIVAMTTTINFQLKNHTQLRRAVFGGDNQLTASLTGPGVVFIQSLPFTRLSQ 240

Query: 241 RIA-RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RIA R++ +P++R+NPKFF+QI +FFFLAYV+IVSS+ILTDV
Sbjct: 241 RIASRSMAAPSLRDNPKFFIQIVMFFFLAYVMIVSSIILTDV 282


>gi|212724012|ref|NP_001132812.1| uncharacterized protein LOC100194302 [Zea mays]
 gi|194695462|gb|ACF81815.1| unknown [Zea mays]
 gi|414872762|tpg|DAA51319.1| TPA: hypothetical protein ZEAMMB73_320289 [Zea mays]
          Length = 282

 Score =  386 bits (992), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 184/282 (65%), Positives = 231/282 (81%), Gaps = 1/282 (0%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS Q +V  FQI GG+ QV Q+MLK QEK+ A+PG+MC+MSG+M+
Sbjct: 1   MAAPFFSTPFQPYVYQSQQGSVTAFQISGGDVQVLQVMLKSQEKLTAKPGAMCYMSGNMQ 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           M+N Y+PEN+ G+WQW+FGK ++S +  N G  DG+VGIAAP   RILP+DL  F GELL
Sbjct: 61  MDNNYLPENDGGVWQWIFGKRVSSTIFFNSGSDDGYVGIAAPFPGRILPVDLTNFSGELL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
           CQ DAFLCSVNDV VS+T++ R RN+  G E  L+QKL GQG+AF++ GGSV+QK L   
Sbjct: 121 CQADAFLCSVNDVSVSSTVEPRPRNIEIGAEMILKQKLRGQGMAFLVGGGSVMQKILAPR 180

Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           EVITVD +CIVA+++++N Q+K    +RRAVFGGDN +TA +TGPG+VFIQSLPFHRLSQ
Sbjct: 181 EVITVDAACIVAMSATINFQLKSPNQLRRAVFGGDNQLTASLTGPGVVFIQSLPFHRLSQ 240

Query: 241 RIA-RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RIA R+V  P++R+NPKFF+QI +FFFLAYV+IVSS+ILTDV
Sbjct: 241 RIASRSVAGPSLRDNPKFFIQIVMFFFLAYVMIVSSIILTDV 282


>gi|414872761|tpg|DAA51318.1| TPA: hypothetical protein ZEAMMB73_320289 [Zea mays]
          Length = 283

 Score =  386 bits (991), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 184/283 (65%), Positives = 231/283 (81%), Gaps = 2/283 (0%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS Q +V  FQI GG+ QV Q+MLK QEK+ A+PG+MC+MSG+M+
Sbjct: 1   MAAPFFSTPFQPYVYQSQQGSVTAFQISGGDVQVLQVMLKSQEKLTAKPGAMCYMSGNMQ 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           M+N Y+PEN+ G+WQW+FGK ++S +  N G  DG+VGIAAP   RILP+DL  F GELL
Sbjct: 61  MDNNYLPENDGGVWQWIFGKRVSSTIFFNSGSDDGYVGIAAPFPGRILPVDLTNFSGELL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
           CQ DAFLCSVNDV VS+T++ R RN+  G E  L+QKL GQG+AF++ GGSV+QK L   
Sbjct: 121 CQADAFLCSVNDVSVSSTVEPRPRNIEIGAEMILKQKLRGQGMAFLVGGGSVMQKILAPR 180

Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           EVITVD +CIVA+++++N Q+K    +RRAVFGGDN +TA +TGPG+VFIQSLPFHRLSQ
Sbjct: 181 EVITVDAACIVAMSATINFQLKSPNQLRRAVFGGDNQLTASLTGPGVVFIQSLPFHRLSQ 240

Query: 241 RIA--RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RIA  R+V  P++R+NPKFF+QI +FFFLAYV+IVSS+ILTDV
Sbjct: 241 RIASSRSVAGPSLRDNPKFFIQIVMFFFLAYVMIVSSIILTDV 283


>gi|89257470|gb|ABD64961.1| hypothetical protein 25.t00068 [Brassica oleracea]
          Length = 1161

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 187/258 (72%), Positives = 217/258 (84%), Gaps = 16/258 (6%)

Query: 13   YVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG 72
            +V QS QD + PFQILGGEAQV QIMLK +E+V+A+PGSMC+MSGS+EMEN Y PE EVG
Sbjct: 843  HVMQSQQDTITPFQILGGEAQVVQIMLKSEERVIAKPGSMCYMSGSVEMENTYTPEQEVG 902

Query: 73   MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
            + QW+ GK+++SVVLRN G +DGFVGIAAP LARILPIDLAMFGGE+LCQPDAFLCSVND
Sbjct: 903  VLQWILGKSVSSVVLRNTGQNDGFVGIAAPYLARILPIDLAMFGGEILCQPDAFLCSVND 962

Query: 133  VKVSNTLDQRVRN-VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            VKV N++DQR RN V+ G EGFLRQ+L+GQGLAFILAGGSV    LEVGEV+ +DVSCI 
Sbjct: 963  VKVVNSVDQRARNIVVAGSEGFLRQRLSGQGLAFILAGGSV----LEVGEVLNIDVSCIA 1018

Query: 192  AVTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA------- 243
            A+T S++VQIKY N P+RRAVFGGDN+V A +TGPGIVFIQSLPFHRLSQRIA       
Sbjct: 1019 ALTPSIDVQIKYNNAPLRRAVFGGDNVVMATLTGPGIVFIQSLPFHRLSQRIASFVVTKA 1078

Query: 244  ---RAVTSPNMRENPKFF 258
               R+VTSPNMRENP+ F
Sbjct: 1079 FGDRSVTSPNMRENPRCF 1096


>gi|115455313|ref|NP_001051257.1| Os03g0747300 [Oryza sativa Japonica Group]
 gi|14626285|gb|AAK71553.1|AC087852_13 unknown protein [Oryza sativa Japonica Group]
 gi|108711062|gb|ABF98857.1| HTH DNA-binding protein, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113549728|dbj|BAF13171.1| Os03g0747300 [Oryza sativa Japonica Group]
 gi|125545710|gb|EAY91849.1| hypothetical protein OsI_13494 [Oryza sativa Indica Group]
          Length = 282

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 181/282 (64%), Positives = 231/282 (81%), Gaps = 1/282 (0%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS + +V  FQI GG+ QV Q+M+K QEK+  +PG+MC+MSG+++
Sbjct: 1   MAAPFFSTPFQPYVYQSQEGSVTAFQISGGDVQVLQVMVKSQEKLTVKPGTMCYMSGNIQ 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
            +N Y+PEN+ G+WQW+FGK+I+S V  NPG  DG+VGI+AP   RILP+DLA FGGELL
Sbjct: 61  TDNNYLPENDGGVWQWIFGKSISSSVFFNPGSDDGYVGISAPFPGRILPMDLANFGGELL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
           CQ DAFLCSVNDV V++T++QR RN+  G E  L+QKL GQG+AF++ GGSV+QK L   
Sbjct: 121 CQADAFLCSVNDVSVTSTVEQRPRNIEIGAEVILKQKLRGQGMAFLVGGGSVMQKILAPR 180

Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           EVITVD +CIVA+T+++N Q+K     RR VF G N +TA +TGPG+VFIQSLPFHRLSQ
Sbjct: 181 EVITVDAACIVAMTTTINFQLKTPNQPRRVVFSGGNQLTASLTGPGVVFIQSLPFHRLSQ 240

Query: 241 RIA-RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           RIA R+V +P++R+NPKFF+QI +FFFLAYV+IVSS+ILTDV
Sbjct: 241 RIASRSVAAPSLRDNPKFFIQIVMFFFLAYVMIVSSIILTDV 282


>gi|238480798|ref|NP_001154246.1| Tryptophan RNA-binding attenuator protein-like protein [Arabidopsis
           thaliana]
 gi|332658488|gb|AEE83888.1| Tryptophan RNA-binding attenuator protein-like protein [Arabidopsis
           thaliana]
          Length = 267

 Score =  377 bits (968), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 182/247 (73%), Positives = 211/247 (85%), Gaps = 12/247 (4%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS +D + PFQILGGEAQV QIMLKPQEKV+A+PGSMC+MSGS+E
Sbjct: 1   MAAPFFSTPFQPYVYQSQEDTITPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           M+N Y PE EVG+ QW+ GK+++S+VLRN G +DGFVGIAAPSLARILP+DLAMFGG++L
Sbjct: 61  MDNTYTPEQEVGVVQWILGKSVSSIVLRNTGQNDGFVGIAAPSLARILPLDLAMFGGDIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQ--RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
           CQPDAFLCSV+DVKV NT+ Q  R RN+              +GLAFI+AGGSVVQKNLE
Sbjct: 121 CQPDAFLCSVHDVKVVNTVYQRHRARNIAAA---------GAEGLAFIIAGGSVVQKNLE 171

Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYN-GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR 237
           VGEV+T+DVSCI A+T S+N QIKYN  P+RRAVFGGDN+VTA +TGPGIVFIQSLPFHR
Sbjct: 172 VGEVLTIDVSCIAALTPSINFQIKYNAAPVRRAVFGGDNVVTATLTGPGIVFIQSLPFHR 231

Query: 238 LSQRIAR 244
           LSQRIAR
Sbjct: 232 LSQRIAR 238


>gi|326507602|dbj|BAK03194.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326509737|dbj|BAJ87084.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 284

 Score =  357 bits (916), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 184/284 (64%), Positives = 233/284 (82%), Gaps = 3/284 (1%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS Q +V  FQI GG+ QV Q+M+K Q+K+ A+PG+MC+MSG+++
Sbjct: 1   MAAPFFSTPFQPYVYQSQQGSVTAFQISGGDVQVLQVMVKSQDKLTAKPGTMCYMSGNIQ 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           M+N Y+PEN+ G+WQW+FGK+ITS V  NPGP DG+VGI+AP   RILPIDLA FGGELL
Sbjct: 61  MDNNYLPENDGGVWQWIFGKSITSTVFFNPGPDDGYVGISAPFPGRILPIDLANFGGELL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
           CQ D+FLCSVNDV V++T+DQR RN+  G E  L+QKL GQG+AF++ GGSV+QK L   
Sbjct: 121 CQADSFLCSVNDVSVTSTVDQRPRNIEIGAEMILKQKLRGQGMAFLVGGGSVMQKILAPR 180

Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRA--VFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
           EVITVD  CIVA+TS+++ Q++     RRA  VFGG N + A +TGPG+VFIQSLPF RL
Sbjct: 181 EVITVDCGCIVAMTSTISFQLQNPTQHRRAVLVFGGVNQLKASLTGPGVVFIQSLPFSRL 240

Query: 239 SQRIA-RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           SQRIA R+V +P++R+NP+FF+QI +FFFLAYV+IVSS+ILTDV
Sbjct: 241 SQRIASRSVAAPSLRDNPRFFIQIIMFFFLAYVMIVSSIILTDV 284


>gi|125587909|gb|EAZ28573.1| hypothetical protein OsJ_12558 [Oryza sativa Japonica Group]
          Length = 283

 Score =  347 bits (891), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 169/283 (59%), Positives = 222/283 (78%), Gaps = 2/283 (0%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS + +V  FQI GG+ QV Q+M+K QEK+  +PG+MC+MSG+++
Sbjct: 1   MAAPFFSTPFQPYVYQSQEGSVTAFQISGGDVQVLQVMVKSQEKLTVKPGTMCYMSGNIQ 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
            +N Y+PEN+ G+WQW+FGK+I+S V  NPG  DG+VGI+AP   RILP+DLA FGGELL
Sbjct: 61  TDNNYLPENDGGVWQWIFGKSISSSVFFNPGSDDGYVGISAPFPGRILPMDLANFGGELL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
           CQ DAFLCSVNDV V++T++QR RN+  G E  L+QKL GQG+AF++ GGSV+QK L   
Sbjct: 121 CQADAFLCSVNDVSVTSTVEQRPRNIEIGAEVILKQKLRGQGMAFLVGGGSVMQKILAPR 180

Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVF-IQSLPFHRLS 239
           EVITVD +CIVA+T+++N Q+K     RR VF G N +TA +TGPG+VF  +  PF    
Sbjct: 181 EVITVDAACIVAMTTTINFQLKTPNQPRRVVFSGGNELTASLTGPGVVFHTEPNPFIGFQ 240

Query: 240 QRI-ARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
             + +R+V +P++R+NPKFF+QI +FFFLAYV+IVSS+ILTDV
Sbjct: 241 NALPSRSVAAPSLRDNPKFFIQIVMFFFLAYVMIVSSIILTDV 283


>gi|8809625|dbj|BAA97176.1| unnamed protein product [Arabidopsis thaliana]
          Length = 268

 Score =  343 bits (879), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 160/214 (74%), Positives = 188/214 (87%), Gaps = 1/214 (0%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS QD + PFQILGGE+QV QIMLK +EKV+A+P SMC+MSGS+E
Sbjct: 1   MAAPFFSTPFQPYVYQSQQDTITPFQILGGESQVVQIMLKSEEKVIAKPASMCYMSGSIE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           MEN Y PE EVG+ QW+ GK+++SVVLRN G +DGFVGIAAP LARILPIDLAMFGGE+L
Sbjct: 61  MENTYTPEQEVGVLQWILGKSVSSVVLRNTGQNDGFVGIAAPYLARILPIDLAMFGGEIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVI-GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEV 179
           CQPDAFLCSV+DVKV N++DQR RN++  G EGFLRQ+L+GQGLAFILAGGSVVQK LEV
Sbjct: 121 CQPDAFLCSVHDVKVVNSVDQRARNIVAAGAEGFLRQRLSGQGLAFILAGGSVVQKVLEV 180

Query: 180 GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
           GEV ++DVSCI A+T S++V+IK N P RRA+FG
Sbjct: 181 GEVFSIDVSCIAALTPSIDVRIKNNAPFRRALFG 214


>gi|449533339|ref|XP_004173633.1| PREDICTED: uncharacterized protein LOC101231218, partial [Cucumis
           sativus]
          Length = 207

 Score =  337 bits (863), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 168/207 (81%), Positives = 189/207 (91%)

Query: 75  QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
            WLFGK++TS+VL+N G SDG+VGIAAPSLARILPID+AMFGGELLCQPDAFLCSVNDVK
Sbjct: 1   HWLFGKSVTSIVLQNSGTSDGYVGIAAPSLARILPIDMAMFGGELLCQPDAFLCSVNDVK 60

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           V+NT+D RVRNV+ G+EGFLRQKL+GQGLAFIL GGSVVQKNLEVGEVI VD+SCIVA+T
Sbjct: 61  VNNTIDHRVRNVVPGVEGFLRQKLSGQGLAFILGGGSVVQKNLEVGEVIAVDISCIVAMT 120

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
           +S++VQIKYNGP+RRAVFGG+NLVTA +TGPGIVFIQSLPF RLSQRIARAVTSPNMRE 
Sbjct: 121 TSIDVQIKYNGPVRRAVFGGENLVTATLTGPGIVFIQSLPFQRLSQRIARAVTSPNMREI 180

Query: 255 PKFFVQIAIFFFLAYVVIVSSLILTDV 281
             F   +   FFLAYVVIVSS+ILTDV
Sbjct: 181 QSFSFSLHFSFFLAYVVIVSSIILTDV 207


>gi|195645802|gb|ACG42369.1| hypothetical protein [Zea mays]
 gi|414872765|tpg|DAA51322.1| TPA: hypothetical protein ZEAMMB73_320289 [Zea mays]
          Length = 245

 Score =  331 bits (849), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 157/245 (64%), Positives = 201/245 (82%), Gaps = 1/245 (0%)

Query: 38  MLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFV 97
           MLK QEK+ A+PG+MC+MSG+M+M+N Y+PEN+ G+WQW+FGK ++S +  N G  DG+V
Sbjct: 1   MLKSQEKLTAKPGAMCYMSGNMQMDNNYLPENDGGVWQWIFGKRVSSTIFFNSGSDDGYV 60

Query: 98  GIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQK 157
           GIAAP   RILP+DL  F GELLCQ DAFLCSVNDV VS+T++ R RN+  G E  L+QK
Sbjct: 61  GIAAPFPGRILPVDLTNFSGELLCQADAFLCSVNDVSVSSTVEPRPRNIEIGAEMILKQK 120

Query: 158 LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNL 217
           L GQG+AF++ GGSV+QK L   EVITVD +CIVA+++++N Q+K    +RRAVFGGDN 
Sbjct: 121 LRGQGMAFLVGGGSVMQKILAPREVITVDAACIVAMSATINFQLKSPNQLRRAVFGGDNQ 180

Query: 218 VTAVVTGPGIVFIQSLPFHRLSQRIA-RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSL 276
           +TA +TGPG+VFIQSLPFHRLSQRIA R+V  P++R+NPKFF+QI +FFFLAYV+IVSS+
Sbjct: 181 LTASLTGPGVVFIQSLPFHRLSQRIASRSVAGPSLRDNPKFFIQIVMFFFLAYVMIVSSI 240

Query: 277 ILTDV 281
           ILTDV
Sbjct: 241 ILTDV 245


>gi|414872764|tpg|DAA51321.1| TPA: hypothetical protein ZEAMMB73_320289 [Zea mays]
          Length = 246

 Score =  331 bits (849), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 157/246 (63%), Positives = 201/246 (81%), Gaps = 2/246 (0%)

Query: 38  MLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFV 97
           MLK QEK+ A+PG+MC+MSG+M+M+N Y+PEN+ G+WQW+FGK ++S +  N G  DG+V
Sbjct: 1   MLKSQEKLTAKPGAMCYMSGNMQMDNNYLPENDGGVWQWIFGKRVSSTIFFNSGSDDGYV 60

Query: 98  GIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQK 157
           GIAAP   RILP+DL  F GELLCQ DAFLCSVNDV VS+T++ R RN+  G E  L+QK
Sbjct: 61  GIAAPFPGRILPVDLTNFSGELLCQADAFLCSVNDVSVSSTVEPRPRNIEIGAEMILKQK 120

Query: 158 LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNL 217
           L GQG+AF++ GGSV+QK L   EVITVD +CIVA+++++N Q+K    +RRAVFGGDN 
Sbjct: 121 LRGQGMAFLVGGGSVMQKILAPREVITVDAACIVAMSATINFQLKSPNQLRRAVFGGDNQ 180

Query: 218 VTAVVTGPGIVFIQSLPFHRLSQRIA--RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSS 275
           +TA +TGPG+VFIQSLPFHRLSQRIA  R+V  P++R+NPKFF+QI +FFFLAYV+IVSS
Sbjct: 181 LTASLTGPGVVFIQSLPFHRLSQRIASSRSVAGPSLRDNPKFFIQIVMFFFLAYVMIVSS 240

Query: 276 LILTDV 281
           +ILTDV
Sbjct: 241 IILTDV 246


>gi|215700953|dbj|BAG92377.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 245

 Score =  325 bits (833), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 155/245 (63%), Positives = 201/245 (82%), Gaps = 1/245 (0%)

Query: 38  MLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFV 97
           M+K QEK+  +PG+MC+MSG+++ +N Y+PEN+ G+WQW+FGK+I+S V  NPG  DG+V
Sbjct: 1   MVKSQEKLTVKPGTMCYMSGNIQTDNNYLPENDGGVWQWIFGKSISSSVFFNPGSDDGYV 60

Query: 98  GIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQK 157
           GI+AP   RILP+DLA FGGELLCQ DAFLCSVNDV V++T++QR RN+  G E  L+QK
Sbjct: 61  GISAPFPGRILPMDLANFGGELLCQADAFLCSVNDVSVTSTVEQRPRNIEIGAEVILKQK 120

Query: 158 LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNL 217
           L GQG+AF++ GGSV+QK L   EVITVD +CIVA+T+++N Q+K     RR VF G N 
Sbjct: 121 LRGQGMAFLVGGGSVMQKILAPREVITVDAACIVAMTTTINFQLKTPNQPRRVVFSGGNQ 180

Query: 218 VTAVVTGPGIVFIQSLPFHRLSQRIA-RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSL 276
           +TA +TGPG+VFIQSLPFHRLSQRIA R+V +P++R+NPKFF+QI +FFFLAYV+IVSS+
Sbjct: 181 LTASLTGPGVVFIQSLPFHRLSQRIASRSVAAPSLRDNPKFFIQIVMFFFLAYVMIVSSI 240

Query: 277 ILTDV 281
           ILTDV
Sbjct: 241 ILTDV 245


>gi|302761006|ref|XP_002963925.1| hypothetical protein SELMODRAFT_230257 [Selaginella moellendorffii]
 gi|300167654|gb|EFJ34258.1| hypothetical protein SELMODRAFT_230257 [Selaginella moellendorffii]
          Length = 284

 Score =  315 bits (806), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 158/284 (55%), Positives = 205/284 (72%), Gaps = 5/284 (1%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAV-IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSM 59
           MAAP  STPFQPYVYQS Q A    FQI G EAQ+ QIMLKPQEK+ A+ GSMC+ SGS+
Sbjct: 1   MAAPASSTPFQPYVYQSEQSAAATAFQIFGFEAQILQIMLKPQEKIQAKSGSMCYTSGSI 60

Query: 60  EMENIYIPENE--VGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGG 117
           + E + +P+    VG+ QW+ G+  TS    N G  DG++G+ APSLARILPIDLA+FGG
Sbjct: 61  QAE-VNVPQENAGVGVLQWMMGRPDTSTTFTNAGSEDGYIGLGAPSLARILPIDLAVFGG 119

Query: 118 ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNL 177
           E++CQ +AFL SVNDV V +   +RVR    G +G L QKL GQGLAFI AGGSV+QK L
Sbjct: 120 EIICQREAFLGSVNDVTVKHLTPRRVRVGYFGADGLLTQKLVGQGLAFITAGGSVLQKKL 179

Query: 178 EVGEVITVDVSCIVAVTSSVNVQIKYNGP-IRRAVFGGDNLVTAVVTGPGIVFIQSLPFH 236
             GE++ VD+ C+VA+TS+V  +IK   P  RRA FGG+    A +TGPG+VFIQSLP  
Sbjct: 180 TRGELLVVDIGCLVAMTSAVECEIKQASPAARRAFFGGECPFQAHLTGPGVVFIQSLPSQ 239

Query: 237 RLSQRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTD 280
           RL+ +IAR +++P+ R+NP+FF+   +F FLAY +I +SL+L +
Sbjct: 240 RLAHKIARTISAPHFRDNPRFFISAGVFLFLAYAMIFASLLLAE 283


>gi|302769153|ref|XP_002967996.1| hypothetical protein SELMODRAFT_440175 [Selaginella moellendorffii]
 gi|300164734|gb|EFJ31343.1| hypothetical protein SELMODRAFT_440175 [Selaginella moellendorffii]
          Length = 286

 Score =  311 bits (796), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 159/286 (55%), Positives = 205/286 (71%), Gaps = 7/286 (2%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAV-IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSM 59
           MAAP  STPFQPYVYQS Q A    FQI G EAQV QIMLKPQEK+ A+ GSMC+ SGS+
Sbjct: 1   MAAPASSTPFQPYVYQSEQSAAATAFQIFGFEAQVLQIMLKPQEKIQAKSGSMCYTSGSI 60

Query: 60  EMENIYIPENE--VGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGG 117
           + E + +P+    VG+ QW+ G+  TS    N G  DG++G+ APSLARILPIDLA+FGG
Sbjct: 61  QAE-VNVPQENAGVGVLQWMMGRPDTSTTFTNAGSEDGYIGLGAPSLARILPIDLAVFGG 119

Query: 118 ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE--GFLRQKLTGQGLAFILAGGSVVQK 175
           E++CQ +AFL SVNDV V +   +RVR    G +  G L QKL GQGLAFI AGGSV+QK
Sbjct: 120 EIICQREAFLGSVNDVTVKHLTPRRVRVGYFGADVGGLLTQKLVGQGLAFITAGGSVLQK 179

Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGP-IRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
            L  GE++ VD+ C+VA+TS+V  +IK   P  RRA FGG+    A +TGPG+VFIQSLP
Sbjct: 180 KLSRGELLVVDIGCLVAMTSAVECEIKQASPAARRAFFGGECPFQAHLTGPGVVFIQSLP 239

Query: 235 FHRLSQRIARAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTD 280
             RL+ +IAR +++P+ R+NP+FF+   +F FLAY +I +SL+L +
Sbjct: 240 SQRLAHKIARTISAPHFRDNPRFFISAGVFLFLAYAMIFASLLLAE 285


>gi|414872766|tpg|DAA51323.1| TPA: hypothetical protein ZEAMMB73_320289 [Zea mays]
          Length = 232

 Score =  310 bits (794), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 148/232 (63%), Positives = 189/232 (81%), Gaps = 2/232 (0%)

Query: 52  MCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPID 111
           MC+MSG+M+M+N Y+PEN+ G+WQW+FGK ++S +  N G  DG+VGIAAP   RILP+D
Sbjct: 1   MCYMSGNMQMDNNYLPENDGGVWQWIFGKRVSSTIFFNSGSDDGYVGIAAPFPGRILPVD 60

Query: 112 LAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGS 171
           L  F GELLCQ DAFLCSVNDV VS+T++ R RN+  G E  L+QKL GQG+AF++ GGS
Sbjct: 61  LTNFSGELLCQADAFLCSVNDVSVSSTVEPRPRNIEIGAEMILKQKLRGQGMAFLVGGGS 120

Query: 172 VVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQ 231
           V+QK L   EVITVD +CIVA+++++N Q+K    +RRAVFGGDN +TA +TGPG+VFIQ
Sbjct: 121 VMQKILAPREVITVDAACIVAMSATINFQLKSPNQLRRAVFGGDNQLTASLTGPGVVFIQ 180

Query: 232 SLPFHRLSQRIA--RAVTSPNMRENPKFFVQIAIFFFLAYVVIVSSLILTDV 281
           SLPFHRLSQRIA  R+V  P++R+NPKFF+QI +FFFLAYV+IVSS+ILTDV
Sbjct: 181 SLPFHRLSQRIASSRSVAGPSLRDNPKFFIQIVMFFFLAYVMIVSSIILTDV 232


>gi|2245101|emb|CAB10523.1| hypothetical protein [Arabidopsis thaliana]
 gi|7268494|emb|CAB78745.1| hypothetical protein [Arabidopsis thaliana]
          Length = 270

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 143/241 (59%), Positives = 170/241 (70%), Gaps = 35/241 (14%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS +D + PFQILGGEAQV QIMLKPQEKV+A+PGSMC+MSGS+E
Sbjct: 1   MAAPFFSTPFQPYVYQSQEDTITPFQILGGEAQVVQIMLKPQEKVIAKPGSMCYMSGSIE 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           M+N Y PE EVG+ QW+ GK+++S+VLRN G +DGFVGIAAPSLARILP           
Sbjct: 61  MDNTYTPEQEVGVVQWILGKSVSSIVLRNTGQNDGFVGIAAPSLARILP----------- 109

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRN------------VIGGIEG---------FLRQKLT 159
             PDAFLCSV+DVKV NT+ QR R             V+GG E          + R +L 
Sbjct: 110 --PDAFLCSVHDVKVVNTVYQRHRARNIAAAGAERVVVVGGSETKAFWSGSCFYYRWRLW 167

Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN-GPIRRAVFGGDNLV 218
              L  +     VVQKNLEVGEV+T+DVSCI A+T S+N QIKYN  P+RRAVFG D ++
Sbjct: 168 PFKLILLHLLHKVVQKNLEVGEVLTIDVSCIAALTPSINFQIKYNAAPVRRAVFGLDPMM 227

Query: 219 T 219
           T
Sbjct: 228 T 228


>gi|195648342|gb|ACG43639.1| hypothetical protein [Zea mays]
          Length = 215

 Score =  244 bits (622), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 114/189 (60%), Positives = 150/189 (79%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+MC+MSG+M+M+N Y+PEN+ G+WQW+FGK ++S +  N G  DG+VGIAAP  
Sbjct: 17  LTAKPGTMCYMSGNMQMDNNYLPENDGGVWQWIFGKRVSSTIFFNSGSDDGYVGIAAPFP 76

Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
            RILP+DL  F GELLCQ DAFLCSVNDV VS+T++ R RN+  G E  L+QKL GQG+A
Sbjct: 77  GRILPVDLTNFSGELLCQADAFLCSVNDVSVSSTVEPRPRNIEIGAEMILKQKLRGQGMA 136

Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
           F++ GGSV+QK L   EVITVD +CIVA+++++N Q+K    +RRAVFGGDN +TA +TG
Sbjct: 137 FLVGGGSVMQKILAPREVITVDAACIVAMSATINFQLKSPNQLRRAVFGGDNQLTASLTG 196

Query: 225 PGIVFIQSL 233
           PG+VFIQSL
Sbjct: 197 PGVVFIQSL 205


>gi|147839862|emb|CAN63757.1| hypothetical protein VITISV_006491 [Vitis vinifera]
          Length = 289

 Score =  243 bits (621), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 114/161 (70%), Positives = 135/161 (83%), Gaps = 13/161 (8%)

Query: 11  QPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE 70
           + ++ +SPQ+ V PFQILGGEAQ+ QIMLKPQEK++A+PGSMC+MSGS++MEN+YIPENE
Sbjct: 131 EKHLVKSPQENVTPFQILGGEAQIVQIMLKPQEKIIAKPGSMCYMSGSIQMENVYIPENE 190

Query: 71  VGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
           VGMWQWLFGK +TS+VL N G S+GFVGIAAPSLARILP             PDAFLCS+
Sbjct: 191 VGMWQWLFGKAVTSIVLLNQGSSNGFVGIAAPSLARILP-------------PDAFLCSL 237

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGS 171
           NDVKV+NT DQR RN++ G EGFLRQKLTGQGLAFI+AGGS
Sbjct: 238 NDVKVNNTTDQRARNIVVGAEGFLRQKLTGQGLAFIVAGGS 278


>gi|225468541|ref|XP_002270085.1| PREDICTED: uncharacterized protein LOC100251350 [Vitis vinifera]
          Length = 161

 Score =  242 bits (617), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 112/145 (77%), Positives = 125/145 (86%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVY+SPQ+ V PFQILGG AQ+ QIMLKPQEK++A+ GS+C+MSGS +
Sbjct: 1   MAAPFFSTPFQPYVYRSPQENVTPFQILGGAAQMVQIMLKPQEKIIAKLGSICYMSGSTQ 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL 120
           MEN+YIPEN+VGMWQW FGK  TSVVL N G  DGFVGI APSLARI PID AMF GE+L
Sbjct: 61  MENVYIPENQVGMWQWAFGKAATSVVLLNQGSGDGFVGIVAPSLARIFPIDSAMFDGEIL 120

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRN 145
           CQPDAFLCS+NDVKVSNT DQ  RN
Sbjct: 121 CQPDAFLCSLNDVKVSNTTDQGARN 145


>gi|406886961|gb|EKD33877.1| hypothetical protein ACD_75C02564G0002 [uncultured bacterium]
          Length = 332

 Score =  184 bits (467), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 94/239 (39%), Positives = 142/239 (59%), Gaps = 11/239 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
           I FQI+G E Q  +I L  QE  VA  GSM +MS +++M  ++   +E          +G
Sbjct: 73  IDFQIIGSEMQYVEIELDSQESAVAEAGSMMYMSDNIQMHTVFGDGSESAGAGFLDRMLG 132

Query: 73  MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
             + L  G+ + S +    GP  G V  AAP   +I+ +DL  +GG+L+CQ DAFLC+  
Sbjct: 133 AGKRLITGEGLFSTIFTYQGPGKGKVAFAAPYPGKIIALDLKEYGGKLVCQKDAFLCAAK 192

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     +++   + G EGF+ QKL G G  F+ AGG+V+QK+L+ GE + VD  C+V
Sbjct: 193 GVAIGVEFQKKIGTALFGGEGFIMQKLEGDGRVFVHAGGTVIQKDLQAGETMRVDTGCLV 252

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
           A+T ++N  I++ G ++ A+FGG+    A +TGPG  ++QSLPF RL+ RI RA+   N
Sbjct: 253 AMTQNINYDIEFVGGVKSAIFGGEGFFFATITGPGHAWLQSLPFSRLAGRIHRAMPQNN 311


>gi|406888983|gb|EKD35298.1| hypothetical protein ACD_75C01963G0002 [uncultured bacterium]
          Length = 332

 Score =  182 bits (462), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 98/246 (39%), Positives = 143/246 (58%), Gaps = 12/246 (4%)

Query: 16  QSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----- 70
           QS  D  I +QI G E Q  +I L PQE  VA  GSM +MS S++M  I+   +E     
Sbjct: 67  QSGADE-IDYQIFGNEMQYVEIELDPQESAVAEAGSMMYMSDSIQMHTIFGDGSESSGAG 125

Query: 71  -----VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPD 124
                +G  + L  G+ + S V    G   G V  AAP   +I+ +DL  +GG+L+CQ D
Sbjct: 126 FLDKMLGAGKRLITGEGLFSTVFTYQGNGKGKVAFAAPYPGKIIALDLKDYGGKLVCQKD 185

Query: 125 AFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVIT 184
           AFLC+   V +     +++   + G EGF+ QKL G G  F+ AGG++VQK+L+ GE + 
Sbjct: 186 AFLCAAKGVSIGVEFQKKIGTALFGGEGFIMQKLEGNGRIFVHAGGTIVQKDLQPGETMR 245

Query: 185 VDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
           +D  C+VA+T ++N  I++ G ++ A+FGG+    A +TGPG  ++QSLPF RL+ RI R
Sbjct: 246 IDTGCLVAMTQNINYDIEFVGGVKSAIFGGEGFFFATITGPGHAWLQSLPFSRLAGRIHR 305

Query: 245 AVTSPN 250
           A    N
Sbjct: 306 AAPQNN 311


>gi|410672294|ref|YP_006924665.1| protein of unknown function DUF124 [Methanolobus psychrophilus R15]
 gi|409171422|gb|AFV25297.1| protein of unknown function DUF124 [Methanolobus psychrophilus R15]
          Length = 250

 Score =  179 bits (455), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 94/234 (40%), Positives = 142/234 (60%), Gaps = 4/234 (1%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT- 81
           I +QI+G + Q+ +I L P E V A  G+M +M   ++ME         G+ + L G++ 
Sbjct: 5   IDYQIIGNDMQLVEIELDPGESVRAEAGAMMYMGPGIKMETSTGGGLLKGLKRTLTGESF 64

Query: 82  -ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
            ITS V  + G   G+V   AP   +I+P+DL   GG  LCQ D+FLC+   +++   L 
Sbjct: 65  FITSYV--HEGTGKGYVAFGAPYPGKIIPLDLKEVGGRFLCQKDSFLCAAKGIEIEVALT 122

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           +++   + G EGF+ Q+LTG GLAF+ AGG++++K+L+ GEVI VD  CIVA +  V+  
Sbjct: 123 KKIGAGLFGGEGFILQRLTGDGLAFVHAGGTIIKKDLQAGEVIRVDTGCIVAFSEKVDYN 182

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
           I+  G  + A+FGG+ LV A + GPG V++QSLPF RL+ RI  A +     E+
Sbjct: 183 IQLVGGFKNALFGGEGLVLATLKGPGTVYLQSLPFSRLADRIMAAASYQRRDES 236


>gi|366162605|ref|ZP_09462360.1| hypothetical protein AcelC_02941 [Acetivibrio cellulolyticus CD2]
          Length = 264

 Score =  179 bits (455), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 95/244 (38%), Positives = 141/244 (57%), Gaps = 12/244 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I ++I+G E Q+ +I L PQE V+A  GSM +M   ++ME I+   +E      L GK +
Sbjct: 8   IDYEIVGSEMQLVEIELDPQESVIAEAGSMMYMESGIQMETIFGDGSEKSNSGGLMGKFL 67

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
           T+             +  N G +      +AP   +I+P+DL  F  EL+CQ D+FLC+ 
Sbjct: 68  TAGKRLLTGESLFMTIFTNCGSTKQKAAFSAPYPGKIIPMDLKQFNEELICQKDSFLCAA 127

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V V    ++++   + G EGF+ QKL G GLAF+ AGG++V+K L  GE + +D  C+
Sbjct: 128 KGVSVGIAFNKKIGAGLFGGEGFIMQKLEGDGLAFVHAGGTIVRKELRPGETLRLDTGCL 187

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
           VA+T +VN  I+    I+ A+FGG+ L  A +TGPG V++QSLPF RL+ RI  A  S +
Sbjct: 188 VALTRTVNYDIQLAKGIKSAIFGGEGLFLATLTGPGTVWLQSLPFSRLAGRIISASPSKS 247

Query: 251 MREN 254
             E 
Sbjct: 248 RDEG 251


>gi|373459159|ref|ZP_09550926.1| protein of unknown function DUF124 [Caldithrix abyssi DSM 13497]
 gi|371720823|gb|EHO42594.1| protein of unknown function DUF124 [Caldithrix abyssi DSM 13497]
          Length = 252

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 94/227 (41%), Positives = 140/227 (61%), Gaps = 4/227 (1%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           VI ++ILG E Q+ +I L P E V A  G+M +M   +EM+         G  + + G++
Sbjct: 4   VIDYRILGDEMQIVEIELDPGEGVRAEAGAMLYMEDGIEMQTGTGGGIFKGFKRMITGES 63

Query: 82  --ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
             IT+ + R  G   G V  AAP   +I+P+DL  FGG +LCQ DAFLC+   +++    
Sbjct: 64  FFITTFLYRGQGK--GLVSFAAPYPGKIIPVDLTQFGGRILCQKDAFLCAAAGIEIEVAF 121

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
            +R+   + G EGF+ Q+L G G+AFI AGG+V++KNL  GE + VD  C+VA+  +V+ 
Sbjct: 122 TKRLGAGLFGGEGFILQELRGDGMAFIHAGGTVIEKNLAAGERLRVDTGCLVALAPTVDY 181

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            I++ G  + A+FGG+ L  AV+ GPG V++QSLPF RL+ RI+ A 
Sbjct: 182 DIQFIGGFKNALFGGEGLFLAVLRGPGTVYLQSLPFSRLADRISAAA 228


>gi|332527627|ref|ZP_08403674.1| hypothetical protein RBXJA2T_16832 [Rubrivivax benzoatilyticus JA2]
 gi|332112030|gb|EGJ12007.1| hypothetical protein RBXJA2T_16832 [Rubrivivax benzoatilyticus JA2]
          Length = 269

 Score =  178 bits (451), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 95/233 (40%), Positives = 134/233 (57%), Gaps = 13/233 (5%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           V+ F+I G E Q  +I L P E  V   GSM FM   + M+ ++  +        LFGK 
Sbjct: 3   VVDFEIKGSEMQFVEIELDPGEAAVGEAGSMMFMDAGITMDTVF-GDGRQNAGGGLFGKL 61

Query: 82  IT------------SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
           ++            + V  N G     V  AAP   +ILP+DL+  GG L+CQ DAFLC+
Sbjct: 62  LSAGKRLVTGESLFTTVYTNQGAGKQRVAFAAPYPGKILPMDLSKMGGTLICQKDAFLCA 121

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
              V +   L Q++     G EGF+ Q+L G GLAF+ AGG+VV++ L+ G+ + VD  C
Sbjct: 122 ARGVSLGIALQQKLSTGFFGGEGFIMQRLDGDGLAFVHAGGTVVRRELQAGQTLLVDTGC 181

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +VA T  V+ +I+Y G I+ A+FGG+ L  A +TGPG V++QSLPF RL+ RI
Sbjct: 182 VVAYTPGVDFEIQYVGKIKTALFGGEGLFLAKMTGPGTVWLQSLPFSRLASRI 234


>gi|383757289|ref|YP_005436274.1| hypothetical protein RGE_14340 [Rubrivivax gelatinosus IL144]
 gi|381377958|dbj|BAL94775.1| hypothetical protein RGE_14340 [Rubrivivax gelatinosus IL144]
          Length = 269

 Score =  177 bits (449), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 99/246 (40%), Positives = 138/246 (56%), Gaps = 14/246 (5%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           V+ F+I G E Q  +I L P E  V   GSM FM   + M+ ++  +        LFGK 
Sbjct: 3   VVDFEIKGSEMQFVEIELDPGEAAVGEAGSMMFMDAGITMDTVF-GDGRQNAGGGLFGKL 61

Query: 82  IT------------SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
           ++            + V  N G     V  AAP   +ILP+DL+  GG L+CQ DAFLC+
Sbjct: 62  LSAGKRLVTGESLFTTVYTNQGAGKLRVAFAAPYPGKILPMDLSKMGGTLICQKDAFLCA 121

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
              V +   L Q++     G EGF+ Q+L G GLAF+ AGG+VV++ L+ G+ + VD  C
Sbjct: 122 ARGVSLGIALQQKLSTGFFGGEGFIMQRLDGDGLAFVHAGGTVVRRELQAGQTLLVDTGC 181

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI-ARAVTS 248
           +VA T  V+ +I+Y G I+ A+FGG+ L  A +TGPG V++QSLPF RL+ RI A A   
Sbjct: 182 VVAYTPGVDFEIQYVGKIKTALFGGEGLFLAKMTGPGTVWLQSLPFSRLASRIFAAAPQR 241

Query: 249 PNMREN 254
              RE 
Sbjct: 242 GGSREE 247


>gi|91772707|ref|YP_565399.1| hypothetical protein Mbur_0683 [Methanococcoides burtonii DSM 6242]
 gi|91711722|gb|ABE51649.1| protein of unknown function DUF124 [Methanococcoides burtonii DSM
           6242]
          Length = 259

 Score =  176 bits (447), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 91/234 (38%), Positives = 134/234 (57%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I ++I G E Q+ +I L   E V A  G+M FM   ++M+         G+ +   G++ 
Sbjct: 5   IDYKIHGNEMQLVEIELDQGETVRAEAGAMMFMGDGIKMDTSTGGGLLKGLKRAFSGESF 64

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
                   G   G V   AP   +I+PIDLA  GG  LCQ DAFLC+ N V ++    ++
Sbjct: 65  FITSFTQNGKGKGTVAFGAPYPGKIIPIDLASEGGSFLCQKDAFLCAANGVDITVAFSKK 124

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +   + G EGF+ Q+L G GLAF+ AGG++++K L  GE + +D  C+VA + SVN  I+
Sbjct: 125 IGAGLFGGEGFILQRLKGNGLAFVHAGGTIIKKELAPGETLRIDTGCLVAFSESVNYDIE 184

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPK 256
           + G  + A+FGG+ +V A V+GPG V++QSLPF RL+ RIA A+   N     K
Sbjct: 185 FVGGFKNALFGGEGIVLATVSGPGTVYLQSLPFSRLADRIAAAIGPNNSSRGEK 238


>gi|294495704|ref|YP_003542197.1| hypothetical protein Mmah_1035 [Methanohalophilus mahii DSM 5219]
 gi|292666703|gb|ADE36552.1| protein of unknown function DUF124 [Methanohalophilus mahii DSM
           5219]
          Length = 252

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 89/224 (39%), Positives = 131/224 (58%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I + I+G + Q+ +I L P E V A  G+M +M   ++M+         G+ + + G++ 
Sbjct: 5   IDYDIIGNDMQLVEIELDPSESVRAEAGAMMYMGQDIKMQTSTGGGLFKGLKRMVTGESF 64

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
                 N G     V   AP   +I+PIDL   GG  LCQ DAFLC+   ++V     ++
Sbjct: 65  FITSFTNEGDGKERVAFGAPYPGKIIPIDLTEVGGSFLCQKDAFLCAAKGIEVGIAFSKK 124

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +   + G EGF+ Q+L G GLAF+ AGG++++K L+ GE + VD  C+VA + SVN  IK
Sbjct: 125 IGAGLFGGEGFILQRLEGDGLAFVHAGGTIIKKELDAGETMRVDTGCLVAFSESVNYDIK 184

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             G  R A+FGG+ +V A V+GPG V++QSLPF RL+ RIA A 
Sbjct: 185 LIGGFRNALFGGEGVVLATVSGPGTVYLQSLPFSRLADRIAAAT 228


>gi|193215179|ref|YP_001996378.1| hypothetical protein Ctha_1472 [Chloroherpeton thalassium ATCC
           35110]
 gi|193088656|gb|ACF13931.1| protein of unknown function DUF124 [Chloroherpeton thalassium ATCC
           35110]
          Length = 254

 Score =  174 bits (441), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 87/227 (38%), Positives = 135/227 (59%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           +I +++ G + Q+ ++ L PQE V A  G+M +M   ++M+         G  + L G++
Sbjct: 18  IIDYKLFGDDMQIVEVELDPQEGVRAEVGAMLYMEDGIQMQTDTGGGLFSGFKRMLTGES 77

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
                  N G     V  AAP   +I+P+DL   GGE LCQ D FLC  N V ++    +
Sbjct: 78  FFITSFVNQGHGKQRVAFAAPYPGKIVPLDLGQLGGEFLCQKDGFLCCANGVDINVAFTR 137

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           R+ + + G EGF+ Q+LTG GLAFI AGG+++++ L+ GE + VD  C+VA+  SV+  I
Sbjct: 138 RIGSGLFGGEGFILQRLTGDGLAFIHAGGTIIKRELQAGESLRVDTGCLVALAPSVDYDI 197

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
            + G  + A+FGG+ L  A +TGPG V++QSLPF R++ RI  ++ S
Sbjct: 198 NFVGGFKNALFGGEGLFLAKLTGPGTVYLQSLPFSRIADRILASLPS 244


>gi|407473686|ref|YP_006788086.1| hypothetical protein Curi_c12280 [Clostridium acidurici 9a]
 gi|407050194|gb|AFS78239.1| hypothetical protein DUF124 [Clostridium acidurici 9a]
          Length = 250

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 94/231 (40%), Positives = 138/231 (59%), Gaps = 11/231 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENE--------VG 72
           I ++I G + Q  +I L PQE V+A  G+M +M   ++ME I+     NE        +G
Sbjct: 7   IDYKIYGDDMQFVEIELDPQESVIAEAGAMMYMDDKIQMETIFGDGSRNESRGLMGKLMG 66

Query: 73  MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
             + L  G+++   V  N       V  AAP   +I+PIDLA  GGEL+CQ DAFLCS  
Sbjct: 67  AGKRLITGESLFMTVFSNVAQGKAHVAFAAPYPGKIIPIDLAQLGGELICQKDAFLCSAK 126

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +    ++++     G EGF+ QKL G G AF+ AGG++++K+L VGEV+ +D  C+V
Sbjct: 127 GVSIGIHFNKKIGTGFFGGEGFIMQKLEGDGFAFLHAGGTIIRKDLNVGEVLRLDTGCLV 186

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           A+TS+V   I + G I+  +FGG+ L  A + GPG V++QSLPF R++ RI
Sbjct: 187 AMTSTVQYDISFTGDIKSGLFGGEGLFLATLKGPGSVWLQSLPFSRMADRI 237


>gi|387791094|ref|YP_006256159.1| hypothetical protein Solca_1923 [Solitalea canadensis DSM 3403]
 gi|379653927|gb|AFD06983.1| TIGR00266 family protein [Solitalea canadensis DSM 3403]
          Length = 261

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 97/242 (40%), Positives = 134/242 (55%), Gaps = 10/242 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
           I F+I G E Q  +I L PQE VVA  GS   M   ++ME I+    +   G    LF  
Sbjct: 7   IDFRIFGEELQCVEIELDPQETVVAESGSFMMMDQEIQMETIFGDGSQQNSGFMGKLFSA 66

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G+++      N G S   V  AAP   RI+P+DL    G+L+CQ DAFLC+   
Sbjct: 67  GKRLLTGESLFMTAFTNQGQSKKHVTFAAPYPGRIIPMDLHRLDGKLICQKDAFLCAAKG 126

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ +KL G G+AF+ AGG VV++ L+ GEV+ VD  CIVA
Sbjct: 127 VSIGIEFQRKLGTGLFGGEGFIMEKLEGDGMAFVHAGGMVVERELQPGEVLKVDTGCIVA 186

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
            T  V+  I++ G I+ ++FGG+ L  A + GPG V+IQSLP  RL+ RI    T P   
Sbjct: 187 FTHGVDYDIQFVGGIKNSLFGGEGLFFATLRGPGKVWIQSLPISRLAGRILSLATGPRKE 246

Query: 253 EN 254
           E 
Sbjct: 247 EG 248


>gi|435851682|ref|YP_007313268.1| TIGR00266 family protein [Methanomethylovorans hollandica DSM
           15978]
 gi|433662312|gb|AGB49738.1| TIGR00266 family protein [Methanomethylovorans hollandica DSM
           15978]
          Length = 257

 Score =  171 bits (434), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 84/234 (35%), Positives = 138/234 (58%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           +I ++I+G + Q+ +I L P+E V A  G+M +M   ++ME         G+ + + G++
Sbjct: 4   IIDYKIIGNDMQLVEIELDPRESVRAEAGAMMYMGPGIKMETSTGGGLLKGLKRAVTGES 63

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
                  + G   G++   AP   +I+ +DLA  GG  +CQ D+FLC+   +++  T  +
Sbjct: 64  FFITNFTHEGEGKGYIAFGAPYPGKIICLDLAQLGGGFICQKDSFLCAAKGIEIEITFTK 123

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           ++   + G EGF+ QKL G GLAF+ AGG++++++L+ GEVI VD  CI A   S++  I
Sbjct: 124 KLGAGLFGGEGFILQKLKGDGLAFVHAGGTIIERDLKAGEVIRVDTGCICAFAESIDYNI 183

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENP 255
            + G  + A+FGG+ +V A + GPG VF+QSLPF RL+ RI  A    +  E+ 
Sbjct: 184 GFVGGFKNALFGGEGVVLATLKGPGKVFLQSLPFSRLADRIMAAARYQSHDESS 237


>gi|255526734|ref|ZP_05393636.1| protein of unknown function DUF124 [Clostridium carboxidivorans P7]
 gi|296187024|ref|ZP_06855424.1| hypothetical protein CLCAR_2499 [Clostridium carboxidivorans P7]
 gi|255509569|gb|EET85907.1| protein of unknown function DUF124 [Clostridium carboxidivorans P7]
 gi|296048462|gb|EFG87896.1| hypothetical protein CLCAR_2499 [Clostridium carboxidivorans P7]
          Length = 263

 Score =  171 bits (433), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 92/246 (37%), Positives = 146/246 (59%), Gaps = 15/246 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLFG 79
           I ++I G + Q  +I L P+E V+A  GSM  M  +++ME I+     +++ G+   LFG
Sbjct: 6   IDYKIYGDDMQFVEIELDPKETVIAEAGSMMMMDTNIQMETIFGDGSSQSKGGLMDKLFG 65

Query: 80  --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                   +++      N G     V  AAP + +ILP+DL+  GG+L+CQ D+FLCS  
Sbjct: 66  AGKRVLTGESLFMTAYTNIGGGKQKVSFAAPYIGKILPMDLSALGGKLICQKDSFLCSAK 125

Query: 132 DVKVSNTLDQRVRNVIG--GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
            V V   +D R +  +G  G EGF+ QKL G GLAF+ + G++++K+L  GE++ +D  C
Sbjct: 126 GVSVG--IDFRKKLSVGFFGGEGFILQKLEGDGLAFVHSCGALIKKDLAYGEILRIDTGC 183

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           +VA+T  ++  I++ G I+   FGG+ +  A V+GPG V+IQSLPF+RL++R+  A    
Sbjct: 184 LVAMTKDIHYDIEFVGGIKNTFFGGEGVFFATVSGPGTVWIQSLPFNRLAERVFAAAPRG 243

Query: 250 NMRENP 255
             R+  
Sbjct: 244 GGRKEE 249


>gi|83815096|ref|YP_444449.1| hypothetical protein SRU_0303 [Salinibacter ruber DSM 13855]
 gi|83756490|gb|ABC44603.1| conserved hypothetical protein TIGR00266 [Salinibacter ruber DSM
           13855]
          Length = 255

 Score =  171 bits (432), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 88/234 (37%), Positives = 135/234 (57%), Gaps = 1/234 (0%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I + I+G + Q+ ++ L PQE V A  G+M +M   + M+         G  + + G++ 
Sbjct: 9   IDYDIIGDDMQLVEVELDPQEGVRAEAGTMIYMDDGVRMQTDTGGGLFKGFKRMVSGESF 68

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
                 + G     V  AAP   +I+P+ L  FGGE LCQ DAFLC+  +V++     +R
Sbjct: 69  FITTFVHEGQGKSHVAFAAPYPGKIIPVQLDDFGGEFLCQKDAFLCANANVEIEVAFTKR 128

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +   + G EGF+ Q+LTGQG  FI AGG+VV++ LE GE + +D  C+ A +SS++  I+
Sbjct: 129 MGAGLFGGEGFILQRLTGQGWTFIHAGGTVVERTLEAGETLRIDTGCLAAFSSSIDYDIE 188

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI-ARAVTSPNMRENP 255
           + G  + A+FGG+ L    +TGPG V++QSLPF RL+ RI A  + +    E P
Sbjct: 189 FIGGFKNALFGGEGLFLTTLTGPGTVYLQSLPFSRLADRIVAGELANRGESEGP 242


>gi|381187037|ref|ZP_09894602.1| DUF124 domain-containing protein [Flavobacterium frigoris PS1]
 gi|379650647|gb|EIA09217.1| DUF124 domain-containing protein [Flavobacterium frigoris PS1]
          Length = 266

 Score =  171 bits (432), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 90/230 (39%), Positives = 132/230 (57%), Gaps = 10/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
           I +QI G E Q  +I L PQE V+A  GS   M  +++ME I+    E + G++  L   
Sbjct: 6   IDYQIFGEEMQYVEIELDPQEIVIAEAGSFMMMDNNIQMETIFGDGSEQKSGIFGQLLNA 65

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G+++      N   S G V  A+P   +ILPIDL  F G+ +CQ  +FLC+   
Sbjct: 66  GKRVLTGESLFMTAFINQASSKGKVSFASPYPGKILPIDLTEFQGKFICQKSSFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++   + G EGF+ QK+ G G+AF+ +GG++ +K L  GEV+ VD  CIV 
Sbjct: 126 VSVGIEFSKKMGRGLFGGEGFIMQKIEGDGMAFVHSGGTLAKKELRAGEVLKVDTGCIVG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  V+  I++ G I+ +VFGG+ L  A + GPG V++QSLPF RL+ RI
Sbjct: 186 FTKDVDYDIEFIGGIKNSVFGGEGLFYATLRGPGTVYVQSLPFSRLADRI 235


>gi|294506194|ref|YP_003570252.1| hypothetical protein SRM_00379 [Salinibacter ruber M8]
 gi|294342522|emb|CBH23300.1| Conserved hypothetical protein [Salinibacter ruber M8]
          Length = 255

 Score =  170 bits (431), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 92/236 (38%), Positives = 138/236 (58%), Gaps = 5/236 (2%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT- 81
           I + I+G + Q+ ++ L PQE V A  G+M +M   + M+         G  + + G++ 
Sbjct: 9   IDYDIIGDDMQLVEVELDPQEGVRAEAGTMIYMDDGVRMQTDTGGGLFKGFKRMVSGESF 68

Query: 82  -ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
            IT+ V    G S   V  AAP   +I+P+ L  FGGE LCQ DAFLC+   V++     
Sbjct: 69  FITTFVHEGQGKS--HVAFAAPYPGKIIPVQLDDFGGEFLCQKDAFLCANASVEIEVAFT 126

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           +R+   + G EGF+ Q+LTGQG  FI AGG+VV++ LE GE + +D  C+ A +SS++  
Sbjct: 127 KRMGAGLFGGEGFILQRLTGQGWTFIHAGGTVVERTLEAGETLRIDTGCLAAFSSSIDYD 186

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI-ARAVTSPNMRENP 255
           I++ G  + A+FGG+ L    +TGPG V++QSLPF RL+ RI A  + +    E P
Sbjct: 187 IEFIGGFKNALFGGEGLFLTTLTGPGTVYLQSLPFSRLADRIVAGELANRGESEGP 242


>gi|20093431|ref|NP_619506.1| hypothetical protein MA4652 [Methanosarcina acetivorans C2A]
 gi|19918805|gb|AAM07986.1| conserved hypothetical protein [Methanosarcina acetivorans C2A]
          Length = 263

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 89/237 (37%), Positives = 133/237 (56%), Gaps = 7/237 (2%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------Q 75
           I ++I+G + Q+ +I L P E V A  G+M +M   ++M+     E    +        +
Sbjct: 5   IDYEIIGNDMQIVEIELDPGEAVQAEAGAMAYMGPGIQMQTSMGSEGGGLLGGLKKGLKR 64

Query: 76  WLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKV 135
            L G++       + G   G V  AAP   +I+P+DL+ FGG +LCQ DAFLC+   V V
Sbjct: 65  ALTGESFFITNFIHKGSGKGHVAFAAPYPGKIIPLDLSKFGGSILCQKDAFLCAARGVDV 124

Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
                +R+   + G EGF+ Q+L G G AF+  GG+VV+K+L  GE   VD  C+ A T 
Sbjct: 125 EVAFTRRIGTGLFGGEGFILQRLRGNGFAFVHIGGTVVRKDLAPGETYHVDTGCVAAFTE 184

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
           +VN  I ++   + A+FGG+ +V A +TGPG +++QSLPF RL+ RI  A  S   R
Sbjct: 185 TVNYDITWSKDFKNALFGGEGIVLATLTGPGTIYMQSLPFSRLADRIYAASASHQNR 241


>gi|374339471|ref|YP_005096207.1| hypothetical protein Marpi_0466 [Marinitoga piezophila KA3]
 gi|372101005|gb|AEX84909.1| TIGR00266 family protein [Marinitoga piezophila KA3]
          Length = 257

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 85/233 (36%), Positives = 133/233 (57%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           VI ++I G + Q+ +I L P E V A  G+M +M   +EM+         G+ + L G++
Sbjct: 4   VIDYKIYGDDMQIVEIELDPGEGVRAEAGAMMYMEPGIEMQTKATGGIFGGLKRMLVGES 63

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
                  N G S   V   AP   +++PIDL  FGGE +CQ D++LCS   + +  T  +
Sbjct: 64  FFITNFLNTGHSKSHVAFGAPYPGKVIPIDLTKFGGEFICQKDSYLCSAAGIDIEITFTK 123

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           ++   + G EGF+ ++L G GLAF+ AGG+++++ L  GE + VD  CIV  +++V+  I
Sbjct: 124 KIGAGLFGGEGFILERLVGDGLAFVHAGGTIIERQLMPGETLRVDTGCIVGFSNTVDYDI 183

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
           ++ G    A+FGG+ +    +TGPG V++QSLPF RL+ RI  A    N  E 
Sbjct: 184 QFVGGFVNALFGGEGIFLTHLTGPGTVYLQSLPFSRLADRIYSAANFSNRGET 236


>gi|395802295|ref|ZP_10481548.1| hypothetical protein FF52_10473 [Flavobacterium sp. F52]
 gi|395435536|gb|EJG01477.1| hypothetical protein FF52_10473 [Flavobacterium sp. F52]
          Length = 267

 Score =  169 bits (428), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 90/236 (38%), Positives = 131/236 (55%), Gaps = 13/236 (5%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +QI G E Q  +I L PQE V+A  GS   M  +++ME I+  +        LF
Sbjct: 2   QAHEIDYQIFGEEMQYVEIELDPQEIVIAEAGSFMMMENNIQMETIF-GDGSQQQGSGLF 60

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK +++                N G +   V  A+P   +ILPIDL  F G+ +CQ  +F
Sbjct: 61  GKLLSAGKRVLTGESLFMTAFLNQGHTKSKVSFASPYPGKILPIDLTEFQGKFICQKSSF 120

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V V     Q++   + G EGF+ QK+ G G+AF+ +GG++ +K L  GEV+ VD
Sbjct: 121 LCAAKGVSVGIEFSQKLGRGLFGGEGFIMQKIEGDGMAFVHSGGTMAKKELAHGEVLKVD 180

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
             CI+  T  V+  I++ G I+ ++FGG+ L  A + GPG V+IQSLPF RL+ RI
Sbjct: 181 TGCIIGFTKDVDYDIEFIGGIKNSIFGGEGLFYATLKGPGTVYIQSLPFSRLADRI 236


>gi|171060011|ref|YP_001792360.1| hypothetical protein Lcho_3337 [Leptothrix cholodnii SP-6]
 gi|170777456|gb|ACB35595.1| protein of unknown function DUF124 [Leptothrix cholodnii SP-6]
          Length = 272

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 88/231 (38%), Positives = 136/231 (58%), Gaps = 10/231 (4%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLFG 79
           VI ++I G E Q  ++ L P E  +   GSM FM   + M+ ++      + G++  L G
Sbjct: 5   VIDYEIKGSEMQFVEVELDPGEAAIGEAGSMMFMDAGIGMDTVFGDGRAQQGGLFGKLLG 64

Query: 80  --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                   +++ + V  N G     +  AAP   +ILP+DL+  GG L+CQ DAFLC+  
Sbjct: 65  AGKRLVTGESLFTTVYTNNGSGKQRIAFAAPYPGKILPMDLSKLGGTLICQKDAFLCAAR 124

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +   + Q++     G EGF+ QKL G GLAF+ AGG+VV++ L+ G+ + VD  C+V
Sbjct: 125 GVSLGIAIQQKLSVGFFGGEGFIMQKLDGDGLAFVHAGGTVVRRELQPGQTLLVDTGCVV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           A T+ V+ +I+Y G I+ A+FGG+ L  A ++GPG +++QSLP  RL+ RI
Sbjct: 185 AYTAGVSFEIQYVGKIKTALFGGEGLFFAKLSGPGTIWLQSLPLSRLASRI 235


>gi|375104787|ref|ZP_09751048.1| TIGR00266 family protein [Burkholderiales bacterium JOSHI_001]
 gi|374665518|gb|EHR70303.1| TIGR00266 family protein [Burkholderiales bacterium JOSHI_001]
          Length = 272

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 89/232 (38%), Positives = 134/232 (57%), Gaps = 11/232 (4%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW------- 74
           V+ ++I G E Q  ++ L P E  V   GSM FM   + M+ ++   ++ G         
Sbjct: 5   VVDYEIKGSEMQFVEVELDPGEAAVGEAGSMMFMDAGIGMDTVFGDGSQGGGGGLFGKLL 64

Query: 75  ----QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
               + + G+++ + V  N   +   V  AAP   +ILP+DL   GG LLCQ DAFLC+ 
Sbjct: 65  GAGKRLITGESLFTTVYTNNASAKQRVAFAAPYPGKILPMDLRQLGGTLLCQKDAFLCAA 124

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     Q++     G EGF+ QKL G GLAF+ AGG+V+++ L  G+ + VD  C+
Sbjct: 125 RGVSLGIAFQQKMSVGFFGGEGFIMQKLEGNGLAFVHAGGTVLKRELAPGQTLLVDTGCV 184

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           VA T SVN +I+Y G ++ A+FGG+ L  A +TGPG +++QSLPF RL+ R+
Sbjct: 185 VAYTPSVNFEIQYVGKVKTALFGGEGLFFAKLTGPGTLWLQSLPFSRLASRV 236


>gi|209524295|ref|ZP_03272845.1| protein of unknown function DUF124 [Arthrospira maxima CS-328]
 gi|423066799|ref|ZP_17055589.1| hypothetical protein SPLC1_S511370 [Arthrospira platensis C1]
 gi|209495386|gb|EDZ95691.1| protein of unknown function DUF124 [Arthrospira maxima CS-328]
 gi|406711824|gb|EKD07023.1| hypothetical protein SPLC1_S511370 [Arthrospira platensis C1]
          Length = 238

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 96/229 (41%), Positives = 135/229 (58%), Gaps = 1/229 (0%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV-GMWQWLFGK 80
           VI ++I G + Q+ +I L P E V A  G+M +M   ++M+          G  + L G+
Sbjct: 4   VIDYKIYGNDLQLVEIELDPGEGVQAEVGTMTYMEDGIQMQTGMGKGGLFSGFKRMLTGE 63

Query: 81  TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
                   N G     VG AA S  +I+P+DLA  GGE LCQ DAFLC+ N  +++    
Sbjct: 64  GFFITTFLNNGKQRSRVGFAATSPGQIIPVDLAQIGGEFLCQKDAFLCAANGTEINVAFT 123

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           +++     G EGF+ Q+L G GLAFI AGG+++QKNL+ GE + VD  C+VA + SV   
Sbjct: 124 KKLGAGFFGGEGFILQRLKGDGLAFIQAGGAMIQKNLQPGETLRVDTGCLVAFSPSVRYD 183

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           I++ G  R A+FGG+ L  A +TGPGIV +QSLPF RL+ RI   +T P
Sbjct: 184 IQFVGGFRNALFGGEGLFLAKMTGPGIVIVQSLPFSRLADRINSMITLP 232


>gi|376003487|ref|ZP_09781297.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
 gi|375328144|emb|CCE17050.1| conserved hypothetical protein [Arthrospira sp. PCC 8005]
          Length = 238

 Score =  167 bits (424), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 95/229 (41%), Positives = 136/229 (59%), Gaps = 1/229 (0%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV-GMWQWLFGK 80
           VI ++I G + Q+ +I L P E V A  G+M +M   ++M+          G  + + G+
Sbjct: 4   VIDYKIYGNDLQLVEIELDPGEGVQAEVGTMTYMEDGIQMQTGMGKGGLFSGFKRMITGE 63

Query: 81  TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
           +       N G     VG AA S  +I+P+DLA  GGE LCQ DAFLC+ N  +++    
Sbjct: 64  SFFITTFLNNGKQRSRVGFAATSPGQIIPVDLAQIGGEFLCQKDAFLCAANGTEINVAFT 123

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           +++     G EGF+ Q+L G GLAFI AGG+++QKNL+ GE + VD  C+VA + SV   
Sbjct: 124 KKLGAGFFGGEGFILQRLKGDGLAFIQAGGAMIQKNLQPGETLRVDTGCLVAFSPSVRYD 183

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           I++ G  R A+FGG+ L  A +TGPGIV +QSLPF RL+ RI   +T P
Sbjct: 184 IQFVGGFRNALFGGEGLFLAKMTGPGIVIVQSLPFSRLADRINSMITLP 232


>gi|124267405|ref|YP_001021409.1| hypothetical protein Mpe_A2218 [Methylibium petroleiphilum PM1]
 gi|124260180|gb|ABM95174.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
          Length = 270

 Score =  167 bits (424), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 93/244 (38%), Positives = 138/244 (56%), Gaps = 11/244 (4%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY--IPENEVGMWQWLFG 79
           VI ++I G E Q  ++ L P E  +   GSM FM   + M+ ++      + G +  L G
Sbjct: 5   VIDYEIKGAEMQFVEVELDPGEAAIGEAGSMMFMDAGIGMDTVFGDASAQQGGFFGKLLG 64

Query: 80  --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                   +++ + V  N   S   V  AAP   +ILP+DL   GG L+CQ DAFLC+  
Sbjct: 65  AGKRLVTGESLFTTVYTNNVGSKQRVAFAAPYPGKILPMDLRTLGGTLICQKDAFLCAAR 124

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     +++     G EGF+ QKL G GLAF+ AGG+V+++ L+ G+ + VD  C+V
Sbjct: 125 GVSLGIAFQRKLSVGFFGGEGFIMQKLDGDGLAFVHAGGTVLKRELQPGQTLLVDTGCVV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI-ARAVTSPN 250
           A T SV+ +I+Y G ++ A+FGG+ L  A +TGPG V++QSLPF RL+ R+ A A     
Sbjct: 185 AYTQSVDFEIQYVGKVKTALFGGEGLFFAKLTGPGTVWLQSLPFSRLASRVFAAAPQRGG 244

Query: 251 MREN 254
            RE 
Sbjct: 245 SREE 248


>gi|310658335|ref|YP_003936056.1| conserved protein of unknown function [[Clostridium] sticklandii]
 gi|308825113|emb|CBH21151.1| conserved protein of unknown function [[Clostridium] sticklandii]
          Length = 269

 Score =  167 bits (423), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 84/235 (35%), Positives = 136/235 (57%), Gaps = 12/235 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + F++ G + Q  +I L   E ++A  G+M +M  ++ ME I+   +     + L GK +
Sbjct: 7   VDFKVHGDDMQFVEIQLDQNETIIAEAGAMMYMDQNIRMETIFGDGSGSDEGKGLMGKVL 66

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +                N  P    V  AAP   RI+P+DLA + GE++CQ DAFLC+ 
Sbjct: 67  GAGKRVLTGESLFMTSFTNTAPDRKSVAFAAPYPGRIVPLDLAEYSGEIICQRDAFLCAA 126

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             ++V     ++V   + G EGF+ Q+L G GL+F+ AGG+++QK L  GEV+ +D  C+
Sbjct: 127 KGIRVEIAFQKKVGAGLFGGEGFIMQRLLGDGLSFLHAGGTIIQKELAPGEVLKLDTGCL 186

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           VA+T+++N  ++  G ++ A+FGG+ L  A + GPG V++QSLPF RL+ R+  A
Sbjct: 187 VAMTNTINYDVEMAGNVKSALFGGEGLFLATLRGPGHVWLQSLPFSRLANRMYAA 241


>gi|409122567|ref|ZP_11221962.1| hypothetical protein GCBA3_02899 [Gillisia sp. CBA3202]
          Length = 266

 Score =  167 bits (422), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 88/245 (35%), Positives = 140/245 (57%), Gaps = 12/245 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENE------VGM 73
           + +QI G E Q  ++ L P E V+A  G+   M   ++M  I+     +NE      +G 
Sbjct: 6   VDYQIFGEEMQYVELELDPNEAVIAEAGNFMMMDSGIKMNTIFGDGSKQNEGFLGKVLGA 65

Query: 74  WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
            + L  G+++   +  N       V  A+P   +ILP+DL  +GG+ +CQ DAFLC+   
Sbjct: 66  GKRLLTGESLFMTIFSNEVSGKKKVSFASPYPGKILPLDLIKYGGKFICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRV-RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
           V +     +++ R  +GG EGF+ QK+ G G+AF+ AGG++ +K L+VGE I VD  CI+
Sbjct: 126 VSIGIEFSRKLGRGFLGG-EGFIMQKVEGDGMAFVHAGGTMARKELKVGEKICVDTGCII 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
             T  VN  I++ G I+  +FGG+ L  A +TGPG+V++QSLPF RL+ R+ +A      
Sbjct: 185 GFTQDVNYDIEFVGGIKNTIFGGEGLFFATLTGPGVVYVQSLPFSRLANRVLQAAPQAGG 244

Query: 252 RENPK 256
           ++  +
Sbjct: 245 KDKGE 249


>gi|451946761|ref|YP_007467356.1| TIGR00266 family protein [Desulfocapsa sulfexigens DSM 10523]
 gi|451906109|gb|AGF77703.1| TIGR00266 family protein [Desulfocapsa sulfexigens DSM 10523]
          Length = 331

 Score =  167 bits (422), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 81/236 (34%), Positives = 133/236 (56%), Gaps = 12/236 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I ++I G E Q  +I L P+E VV+  G+M +MS  + M+ ++   +E       F K +
Sbjct: 73  IDYEIFGHEMQFVEIELDPRESVVSEAGAMMYMSNGITMDTVFGDGSEESQSGGFFDKML 132

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             +  + G     V  A+P   +I+P+DL  + G ++CQ DAFLC+ 
Sbjct: 133 GAGKRLISGEGLFVTMFTHTGQGKAKVAFASPYPGKIIPLDLKQYNGRIICQKDAFLCAA 192

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++   + G EGF+ Q+L G G  F+ AGG++++K L  GE + VD  C+
Sbjct: 193 KGVSIGIAFQKKIGTALFGGEGFIMQQLDGDGFCFVHAGGTIIEKELAAGETLRVDTGCL 252

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           VA+T +VN  I++ G ++ A+FGG++   A + GPG V++QSLPF RL+ RI +A 
Sbjct: 253 VALTQTVNYDIEFVGNVKSAIFGGESFFFATLQGPGHVWLQSLPFSRLAGRIHQAA 308


>gi|337280412|ref|YP_004619884.1| hypothetical protein Rta_27620 [Ramlibacter tataouinensis TTB310]
 gi|334731489|gb|AEG93865.1| Conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
          Length = 270

 Score =  167 bits (422), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 89/231 (38%), Positives = 132/231 (57%), Gaps = 10/231 (4%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLFG 79
           VI + I G E Q  ++ L P E  +   GSM FM  ++ M+ ++      + G++  L G
Sbjct: 5   VIDYDIRGAEMQFVELELDPGEAAIGEAGSMMFMDAAIAMDTVFGDGRAQQGGLFGKLMG 64

Query: 80  --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                   +++ + V  N G     V  AAP   +ILP++L   GG L+CQ D+FLC+  
Sbjct: 65  AGKRLVTGESLFTTVYTNHGAGKQRVAFAAPYPGKILPMNLRELGGSLVCQKDSFLCAAR 124

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +   L +R+     G EGF+ QKL G GLAF+ AGG+VV++ L+ G+ +  D  C+V
Sbjct: 125 GVSLGIALQRRLSAGFFGGEGFIMQKLEGDGLAFVHAGGTVVRRELQPGQTLLADTGCVV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           A T  V   I+Y G +R A+FGG+ L  A +TGPG V++QSLPF RL+ R+
Sbjct: 185 AYTPGVGFDIQYVGKVRTALFGGEGLFFARLTGPGTVWLQSLPFSRLASRV 235


>gi|384098568|ref|ZP_09999682.1| hypothetical protein W5A_07917 [Imtechella halotolerans K1]
 gi|383835263|gb|EID74690.1| hypothetical protein W5A_07917 [Imtechella halotolerans K1]
          Length = 266

 Score =  166 bits (421), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 90/244 (36%), Positives = 134/244 (54%), Gaps = 10/244 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN--EVGMWQWLF-- 78
           + + I G E Q  +I L PQE VVA  G+   M   + ME I+   +  + G+   LF  
Sbjct: 6   VDYHIYGEEMQYVEIELDPQEAVVAEAGNFMMMDSGIRMETIFGDGSGQDQGIMGKLFSA 65

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G+++   V  N       V  AAP   +I+PIDL  + G+ +CQ DAFLC+   
Sbjct: 66  GKRLLTGESLFMTVFLNEFQGKRKVSFAAPYPGKIIPIDLTAYQGKFICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++   + G EGF+ QKL G G+AF+ AGG++ +K L  GEV+ VD  C+V 
Sbjct: 126 VSVGIEFSKKIGRGLFGGEGFIMQKLEGDGMAFVHAGGTLARKELAAGEVLKVDTGCVVG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
            T +V+  I++ G I+  VFGG+ L  A + GPG+V+IQSLPF RL+ R+  A      +
Sbjct: 186 FTQTVDYDIEFIGGIKNTVFGGEGLFFATLRGPGVVYIQSLPFSRLAGRVWSAAPQAGGK 245

Query: 253 ENPK 256
           +  +
Sbjct: 246 QKGE 249


>gi|146298455|ref|YP_001193046.1| hypothetical protein Fjoh_0692 [Flavobacterium johnsoniae UW101]
 gi|146152873|gb|ABQ03727.1| protein of unknown function DUF124 [Flavobacterium johnsoniae
           UW101]
          Length = 267

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/236 (38%), Positives = 131/236 (55%), Gaps = 13/236 (5%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +QI G E Q  +I L PQE V+A  GS   M  +++ME I+  +        LF
Sbjct: 2   QAHEIDYQIFGEEMQYVEIELDPQEIVIAEAGSFMMMENNIQMETIF-GDGSQQQGSGLF 60

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK +++                N G +   V  A+P   +ILPIDL  F G+ +CQ  +F
Sbjct: 61  GKLLSAGKRVLTGESLFMTAFLNQGNTKSKVSFASPYPGKILPIDLTEFQGKFICQKSSF 120

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V V     Q++   + G EGF+ QK+ G G+AF+ +GG++ +K L  GEV+ VD
Sbjct: 121 LCAAKGVSVGIEFSQKLGRGLFGGEGFIMQKIEGDGIAFVHSGGTMAKKVLGHGEVLKVD 180

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
             CI+  T  V+  I++ G I+ ++FGG+ L  A + GPG V+IQSLPF RL+ RI
Sbjct: 181 TGCIIGFTKDVDYDIEFIGGIKNSIFGGEGLFYATLKGPGTVYIQSLPFSRLADRI 236


>gi|330752099|emb|CBL80609.1| protein containing DUF124 [uncultured Leeuwenhoekiella sp.]
          Length = 266

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 131/230 (56%), Gaps = 10/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           I +QI G E Q  +I L PQE V+A  G+   M   ++ME I+    +N+ G+   +FG 
Sbjct: 6   IDYQIFGEEMQYVEIELDPQEGVIAEAGTFMMMQDGIQMETIFGDGSKNDTGLLNKIFGA 65

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++   V  N       V  A+P   +I+PIDL+  GG  +CQ DAFLC+   
Sbjct: 66  GKRLLTGESMFMTVFYNNLIGKRKVSFASPYPGKIIPIDLSKNGGRFICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ Q+L G G+AF+ AGG+   K L+ GE + VD  CI+ 
Sbjct: 126 VSIGIEFSKKIGRGLFGGEGFIMQRLEGDGMAFVHAGGTTFTKELQAGEKLKVDTGCIIG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
              +VN  I++ G I+  +FGG+ +  A +TGPG V+IQSLPF RL++R+
Sbjct: 186 FDHTVNYDIEFVGGIKNTLFGGEGVFFATLTGPGTVYIQSLPFSRLAKRV 235


>gi|291571684|dbj|BAI93956.1| hypothetical protein [Arthrospira platensis NIES-39]
          Length = 238

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 94/229 (41%), Positives = 135/229 (58%), Gaps = 1/229 (0%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV-GMWQWLFGK 80
           VI ++I G + Q+ +I L P E V A  G+M +M   ++M+          G  + + G+
Sbjct: 4   VIDYKIYGNDLQLVEIELDPGEGVQAEVGTMTYMEDGIQMQTGMGKGGLFSGFKRMITGE 63

Query: 81  TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
           +       N G     VG AA S  +I+P+DLA  GGE LCQ DAFLC+ N  +++    
Sbjct: 64  SFFITTFLNNGKQRSRVGFAATSPGQIIPVDLAQIGGEFLCQKDAFLCAANGTEINVAFT 123

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           +++     G EGF+ Q+L G GLAFI AGG+++QK L+ GE + VD  C+VA + SV   
Sbjct: 124 KKLGAGFFGGEGFILQRLKGDGLAFIQAGGAMIQKTLQPGETLRVDTGCLVAFSPSVRYD 183

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           I++ G  R A+FGG+ L  A +TGPGIV +QSLPF RL+ RI   +T P
Sbjct: 184 IQFVGGFRNALFGGEGLFLAKMTGPGIVIVQSLPFSRLADRINSMITLP 232


>gi|451819306|ref|YP_007455507.1| TIGR00266 family protein [Clostridium saccharoperbutylacetonicum
           N1-4(HMT)]
 gi|451785285|gb|AGF56253.1| TIGR00266 family protein [Clostridium saccharoperbutylacetonicum
           N1-4(HMT)]
          Length = 265

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 89/239 (37%), Positives = 138/239 (57%), Gaps = 11/239 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLF- 78
           I ++I G + Q  +I L P E V+A  GSM  +  +++ME I+     +   G    LF 
Sbjct: 6   IDYKIYGDDMQYVEIELDPAETVIAEAGSMMMIDPNIQMETIFGDGSKQQSNGFMDKLFS 65

Query: 79  -------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                  G+++      N G        AAP   +ILP+DL++ GG+L+CQ DAFLC+  
Sbjct: 66  AGKRVLTGESLFMTAFTNAGQLKEKAYFAAPYPGKILPMDLSLLGGKLICQKDAFLCAAK 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V V     +++     G EGF+ +KL G G+AFI A G++V+++L +GE++ +D  C+V
Sbjct: 126 GVSVGIDFRKKLSVGFFGGEGFILEKLEGDGMAFIHACGAIVERDLALGEILKIDTGCLV 185

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
           A+T  V+  I++ G I+  +FGG+ +  A VTGPG V+IQSLPF RL++R+  A  S  
Sbjct: 186 AMTQGVHYDIQFIGGIKNTLFGGEGVFFATVTGPGKVWIQSLPFSRLAERVFSAAPSAG 244


>gi|428316299|ref|YP_007114181.1| protein of unknown function DUF124 [Oscillatoria nigro-viridis PCC
           7112]
 gi|428239979|gb|AFZ05765.1| protein of unknown function DUF124 [Oscillatoria nigro-viridis PCC
           7112]
          Length = 240

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 88/224 (39%), Positives = 133/224 (59%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           +I ++I G + Q+ +I L P+E V A  G+M +M G ++M+         G  + L G  
Sbjct: 4   IIDYKIYGDDLQLIEIELDPKEGVRAEAGTMTYMEGDIQMQTSTGGGLFQGFKRMLTGAG 63

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
                  N G     V  AAP   +++P+DL   GG+ LCQ D+FLC+ N +++     +
Sbjct: 64  FFITTFVNAGNRKARVAFAAPYPGKVIPLDLGQLGGKFLCQKDSFLCAANGIEIEVAFTK 123

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           R+     G EGF+ QKL G GLAF+ AGG++V+KNL VGEV+ VD  C+VA   +V+  I
Sbjct: 124 RLGAGFFGGEGFILQKLQGDGLAFVHAGGTIVEKNLGVGEVLRVDTGCLVAFAPTVDYDI 183

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           ++ G  + A+FGG+ L  A +TGPG V++QSLP  +L++RI  A
Sbjct: 184 QFVGGFKNALFGGEGLFLAKMTGPGKVYVQSLPLSKLAERILAA 227


>gi|406672919|ref|ZP_11080144.1| TIGR00266 family protein [Bergeyella zoohelcum CCUG 30536]
 gi|405587463|gb|EKB61191.1| TIGR00266 family protein [Bergeyella zoohelcum CCUG 30536]
          Length = 265

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 87/250 (34%), Positives = 132/250 (52%), Gaps = 12/250 (4%)

Query: 18  PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL 77
            ++  I ++I G E Q   I L PQE  +A  GS   M   ++M+ I+   +E      L
Sbjct: 2   KKNHEIDYEIHGEEMQYVDIELDPQETAIAEAGSFMMMEDGIQMQTIFGDGSEKNQSTGL 61

Query: 78  FGKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDA 125
           FGK +++                N G     V  A+P   +I+P+DL    G+++CQ DA
Sbjct: 62  FGKILSAGKRMLTGESLFMTAYTNMGQGKKKVAFASPYPGKIIPLDLEQLNGKVICQKDA 121

Query: 126 FLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITV 185
           FLC+   V V     +++   + G EGF+ QKL G G+AF+ AGG V +K L+VGE I +
Sbjct: 122 FLCAAKGVSVGIEFQKKLGTGLFGGEGFIMQKLEGDGMAFVHAGGHVTEKELQVGEKIKI 181

Query: 186 DVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           D  CIV  T +V+  +++ G I+  +FGG+ +  A +TGPG ++IQ+LP  RL+ RI   
Sbjct: 182 DTGCIVGFTQTVHYDVEFVGGIKNTLFGGEGVFFATLTGPGKIWIQTLPVSRLASRILSY 241

Query: 246 VTSPNMRENP 255
            T     E  
Sbjct: 242 ATYNKKEEGS 251


>gi|423316198|ref|ZP_17294103.1| TIGR00266 family protein [Bergeyella zoohelcum ATCC 43767]
 gi|405584268|gb|EKB58184.1| TIGR00266 family protein [Bergeyella zoohelcum ATCC 43767]
          Length = 265

 Score =  165 bits (418), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 88/250 (35%), Positives = 131/250 (52%), Gaps = 12/250 (4%)

Query: 18  PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL 77
            ++  I ++I G E Q   I L PQE  +A  GS   M   ++M+ I+   +E      L
Sbjct: 2   KKNHEIDYEIHGEEMQYVDIELDPQETAIAEAGSFMMMEDGIQMQTIFGDGSEKNQSTGL 61

Query: 78  FGKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDA 125
           FGK +++                N G     V  A+P   +I+P+DL    G+++CQ DA
Sbjct: 62  FGKILSAGKRMLTGESLFMTAYTNMGQGKKKVAFASPYPGKIIPLDLEQLNGKVICQKDA 121

Query: 126 FLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITV 185
           FLC+   V V     +++   + G EGF+ QKL G G+AF+ AGG V +K L+ GE I +
Sbjct: 122 FLCAAKGVSVGIEFQKKLGTGLFGGEGFIMQKLEGDGMAFVHAGGHVTEKELQAGEKIKI 181

Query: 186 DVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           D  CIV  T +V+  I++ G I+  +FGG+ +  A +TGPG V+IQ+LP  RL+ RI   
Sbjct: 182 DTGCIVGFTQTVHYDIEFVGGIKNTLFGGEGVFFATLTGPGKVWIQTLPVSRLASRILSY 241

Query: 246 VTSPNMRENP 255
            T     E  
Sbjct: 242 ATYNKKEEGS 251


>gi|330752718|emb|CBL88182.1| protein containing DUF124 [uncultured Leeuwenhoekiella sp.]
          Length = 266

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 131/230 (56%), Gaps = 10/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLFG- 79
           I +QI G E Q  +I L PQE V+A  G+   M   ++ME I+    +N+ G+   +FG 
Sbjct: 6   IDYQIFGEEMQYVEIELDPQEGVIAEAGTFMMMQDGIQMETIFGDGSKNDTGLLNKIFGA 65

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++   V  N       V  A+P   +I+PIDL   GG  +CQ DAFLC+   
Sbjct: 66  GKRLLTGESMFMTVFYNNLIGKRKVSFASPYPGKIIPIDLNKNGGRFICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ Q+L G G++F+ AGG+   K L+VGE + VD  CI+ 
Sbjct: 126 VSIGIEFSKKIGRGLFGGEGFIMQRLDGDGMSFVHAGGTTFTKELQVGEKLKVDTGCIIG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
              +VN  I++ G I+  +FGG+ +  A +TGPG V+IQSLPF RL++R+
Sbjct: 186 FDHTVNYDIEFVGGIKNTLFGGEGVFFATLTGPGTVYIQSLPFSRLAKRV 235


>gi|86144232|ref|ZP_01062566.1| hypothetical protein MED217_06549 [Leeuwenhoekiella blandensis
           MED217]
 gi|85829310|gb|EAQ47778.1| hypothetical protein MED217_06549 [Leeuwenhoekiella blandensis
           MED217]
          Length = 266

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 130/230 (56%), Gaps = 10/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           I +QI G E Q  +I L PQE V+A  G+   M   + ME I+    +N+ G+   +FG 
Sbjct: 6   IDYQIYGEEMQYVEIELDPQEGVIAEAGTFMMMEDGIRMETIFGDGSKNDSGLLNKVFGA 65

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++   V  N       V  A+P   +I+PIDL   GG  +CQ DAFLC+   
Sbjct: 66  GKRLLTGESLFMTVFYNDIVGKKKVSFASPYPGKIIPIDLTKNGGRFICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G+AF+ +GG+  +K L +GE + VD  CI+ 
Sbjct: 126 VSIGIEFSRKLGRGLFGGEGFIMQKLEGDGMAFVHSGGTTFKKELGLGEKLKVDTGCIIG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
              +VN  I++ G I+  +FGG+ +  A +TGPG V+IQSLPF RL++R+
Sbjct: 186 FDHTVNYDIEFVGGIKNTIFGGEGVFFATLTGPGTVYIQSLPFSRLAKRV 235


>gi|407773538|ref|ZP_11120839.1| hypothetical protein TH2_06548 [Thalassospira profundimaris WP0211]
 gi|407284002|gb|EKF09530.1| hypothetical protein TH2_06548 [Thalassospira profundimaris WP0211]
          Length = 235

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 89/227 (39%), Positives = 137/227 (60%), Gaps = 4/227 (1%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           VI + + G E Q+ +I L P E V A  G+M +MS  + M+         G  + L G+ 
Sbjct: 4   VIDYTVHGEEMQLVEIELDPGEGVRAEAGAMMYMSDGIAMQTGTGGGLFSGFKRMLTGEG 63

Query: 82  --ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
             ITS V  + G   G V  AAP   +++P+DL   GGE++CQ DAFLC+ + + +    
Sbjct: 64  FFITSFV--HEGSGKGHVAFAAPYPGKVVPLDLGEHGGEIICQKDAFLCAASGIDIDIAF 121

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
            +++   + G EGF+ Q+L+G GLAF+ AGG++++K+L+ G+ + VD   +VA+T SV+ 
Sbjct: 122 SKKLGAGLFGGEGFILQRLSGDGLAFMHAGGTIIRKDLKPGQTLRVDTGALVAMTPSVDY 181

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            IK+ G    A+FGG+ L    + GPG V++QSLPF RL+ RIA A+
Sbjct: 182 SIKFVGGFTNALFGGEGLFVTTLEGPGTVWLQSLPFARLADRIAAAL 228


>gi|409993428|ref|ZP_11276569.1| hypothetical protein APPUASWS_20002 [Arthrospira platensis str.
           Paraca]
 gi|409935697|gb|EKN77220.1| hypothetical protein APPUASWS_20002 [Arthrospira platensis str.
           Paraca]
          Length = 238

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 93/229 (40%), Positives = 135/229 (58%), Gaps = 1/229 (0%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV-GMWQWLFGK 80
           VI ++I G + Q+ +I L P E V A  G+M +M   ++M+          G  + + G+
Sbjct: 4   VIDYKIYGNDLQLVEIELDPGEGVQAEVGTMTYMEDGIQMQTGMGKGGLFSGFKRMITGE 63

Query: 81  TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
           +       N G     VG AA S  +I+P+DL+  GGE LCQ DAFLC+ N  +++    
Sbjct: 64  SFFITTFLNNGKQRSRVGFAATSPGQIIPVDLSQIGGEFLCQKDAFLCAANGTEINVAFT 123

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           +++     G EGF+ Q+L G GLAFI AGG+++QK L+ GE + VD  C+VA + SV   
Sbjct: 124 KKLGAGFFGGEGFILQRLKGDGLAFIQAGGAMIQKTLQPGETLRVDTGCLVAFSPSVRYD 183

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           I++ G  R A+FGG+ L  A +TGPGIV +QSLPF RL+ RI   +T P
Sbjct: 184 IQFVGGFRNALFGGEGLFLAKMTGPGIVIVQSLPFSRLADRINSMITLP 232


>gi|340616149|ref|YP_004734602.1| hypothetical protein zobellia_137 [Zobellia galactanivorans]
 gi|339730946|emb|CAZ94210.1| Conserved hypothetical protein [Zobellia galactanivorans]
          Length = 265

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 89/234 (38%), Positives = 132/234 (56%), Gaps = 10/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
           I +QI G E Q  +I L PQE VVA  GS   M   ++M+ I+      + G+   +F  
Sbjct: 6   IDYQIYGEEMQYVEIELDPQEAVVAEAGSFMMMDSDIKMDTIFGDGSNQDTGVLGKIFSA 65

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G+++      N G     V  A+P   +ILPIDL+   G+ +CQ DAFLC+   
Sbjct: 66  GKRILTGESLFMTAFLNIGQGKRQVSFASPYPGKILPIDLSEKQGKFICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++   + G EGF+ QKL G GLAF+ AGG++ +K L  GE++ VD  CIV 
Sbjct: 126 VSVGIEFSKKLGRGLFGGEGFIMQKLEGDGLAFVHAGGTMAKKELAAGEILKVDTGCIVG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            + +V+  I++ G I+  +FGG+ L  A + GPG V+IQSLPF RL+ R+  ++
Sbjct: 186 FSQTVDYDIEFVGGIKNTIFGGEGLFFATLRGPGTVYIQSLPFSRLAGRVLASI 239


>gi|126662554|ref|ZP_01733553.1| hypothetical protein FBBAL38_04345 [Flavobacteria bacterium BAL38]
 gi|126625933|gb|EAZ96622.1| hypothetical protein FBBAL38_04345 [Flavobacteria bacterium BAL38]
          Length = 266

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 87/232 (37%), Positives = 132/232 (56%), Gaps = 14/232 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I + I G E Q  ++ L PQE VVA  GS   M   ++ME I+   ++    Q +FGK +
Sbjct: 6   IDYHIYGEEMQYVEVELDPQEVVVAEAGSFMMMENGIKMETIFGDGSQ--EQQGIFGKLL 63

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
           ++               +N       V  A+P   +I+PIDL  FGG+ +CQ D+FLC+ 
Sbjct: 64  SAGKRVLTGESLFMTAYQNQDIGKRKVSFASPYPGKIVPIDLQQFGGKFICQKDSFLCAA 123

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V V     +++   + G EGF+ QKL G G+AF+ +GG++ +K L+ GE++ VD  C+
Sbjct: 124 KGVTVGIEFTKKLGTGLFGGEGFIMQKLEGDGMAFVHSGGTLARKELQSGELLRVDTGCL 183

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           V  T  V+  I++ G I+  +FGG+ +  A + GPGIV+IQSLPF RL+ RI
Sbjct: 184 VGFTKDVDYDIEFVGGIKNTIFGGEGVFFATLRGPGIVYIQSLPFTRLADRI 235


>gi|297528959|ref|YP_003670234.1| hypothetical protein GC56T3_0608 [Geobacillus sp. C56-T3]
 gi|297252211|gb|ADI25657.1| protein of unknown function DUF124 [Geobacillus sp. C56-T3]
          Length = 264

 Score =  164 bits (415), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 92/240 (38%), Positives = 135/240 (56%), Gaps = 10/240 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I +++ G + Q  +I L P E VVA  G M  M   + ME ++   +  G          
Sbjct: 6   IDYRLYGDDMQFVEIELDPYESVVAEAGGMMMMDDGIVMETVFGDGSSSGKGLLGRLVGA 65

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+++   V  N G     V  AAP   +I+P+DL+  GG+L+CQ D+FLC+   
Sbjct: 66  GKRLLTGESLFMTVFTNQGSGKRRVAFAAPYPGKIIPVDLSELGGKLICQKDSFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++     G EGF+ QKL G GLAF+ AGG++ +++L+ GE + +D  C+VA
Sbjct: 126 VSVGIEFQRKLGAGFFGGEGFIMQKLEGDGLAFLHAGGTIHRRDLQPGETLRIDTGCLVA 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
           +T  VN  I+Y G I+ A FGG+ L  A VTGPG V++QSLPF RL+ R+  A  S   R
Sbjct: 186 MTKDVNYDIEYVGSIKTAFFGGEGLFLATVTGPGAVWVQSLPFSRLADRVLAAAPSAGER 245


>gi|27357088|gb|AAN86542.1| unknown [Eubacterium acidaminophilum]
          Length = 268

 Score =  164 bits (415), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 84/235 (35%), Positives = 132/235 (56%), Gaps = 12/235 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I + + G + Q+ +I L P E V+A  G+M +M   ++ME +    +  G    L GK +
Sbjct: 7   IDYTLHGDDLQLVEIELDPGESVIAEAGAMLYMENGIQMEAVLGDASGKGEGTGLMGKLL 66

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             +  N G     V  AAP   +I+P DL  +GG ++CQ DAFLC+ 
Sbjct: 67  GAGKRVIMGESLFMTLFTNKGSQKQKVAFAAPYPGKIVPFDLNAYGGRIICQKDAFLCAA 126

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             + V     +++   + G EGF+ ++L G G AF+ AGG++V+K L   E++ VD  C+
Sbjct: 127 KGISVQMEFQKKIGVGLFGGEGFIMERLEGDGFAFLHAGGAIVEKELSQAELLKVDTGCL 186

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           VA TS V+  I++ G I+ A+FGG+ +V A + GPG V++QSLPF RL+ R+  A
Sbjct: 187 VAFTSGVDYDIQFMGDIKSAIFGGEGIVLATLKGPGKVWLQSLPFSRLADRMISA 241


>gi|15894815|ref|NP_348164.1| hypothetical protein CA_C1537 [Clostridium acetobutylicum ATCC 824]
 gi|337736757|ref|YP_004636204.1| hypothetical protein SMB_G1562 [Clostridium acetobutylicum DSM
           1731]
 gi|384458264|ref|YP_005670684.1| hypothetical protein CEA_G1552 [Clostridium acetobutylicum EA 2018]
 gi|15024487|gb|AAK79504.1|AE007664_1 Uncharacterized conserved protein [Clostridium acetobutylicum ATCC
           824]
 gi|325508953|gb|ADZ20589.1| Conserved hypothetical protein [Clostridium acetobutylicum EA 2018]
 gi|336290727|gb|AEI31861.1| hypothetical protein SMB_G1562 [Clostridium acetobutylicum DSM
           1731]
          Length = 260

 Score =  164 bits (415), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 92/255 (36%), Positives = 138/255 (54%), Gaps = 14/255 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF---- 78
           I ++I+G E Q+ ++ L P E VVA  G+M +M  S+EME I+   ++ G    L     
Sbjct: 5   IDYKIIGSEMQIVEVELDPYESVVAEAGAMMYMDSSIEMETIFGDGSDKGSSGGLVSKLM 64

Query: 79  --------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
                   G+++   +  N G     V  AAP   +I+P+DL+ F   ++CQ D FLC+ 
Sbjct: 65  GAGKRLVTGESLFMTIFTNRGVGKQKVAFAAPYPGKIIPMDLSSFNNYMICQKDCFLCAA 124

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++   + G EGF+ QKL G GL F+ +GG++VQ+ L   EVI VD  C+
Sbjct: 125 KGVSIGVEFTRKIGVGLFGGEGFMLQKLEGDGLTFVHSGGTIVQRELLPKEVIKVDTGCL 184

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
           VA T  VN  I+    I+ A+FGG+ L  A + GPG V++QS+PF RL+ RI    T+  
Sbjct: 185 VAFTRDVNYDIEMVKGIKSAIFGGEGLFLATLKGPGTVWLQSIPFSRLAGRINS--TAAT 242

Query: 251 MRENPKFFVQIAIFF 265
            +E       I  FF
Sbjct: 243 GKEEGSILGNIGNFF 257


>gi|448239158|ref|YP_007403216.1| DUF124 family protein [Geobacillus sp. GHH01]
 gi|445208000|gb|AGE23465.1| DUF124 family protein [Geobacillus sp. GHH01]
          Length = 264

 Score =  164 bits (414), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 92/240 (38%), Positives = 135/240 (56%), Gaps = 10/240 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I +++ G + Q  +I L P E VVA  G M  M   + ME ++   +  G          
Sbjct: 6   IDYRLYGDDMQFVEIELDPYESVVAEAGGMMMMDDGIVMETVFGDGSSSGKGLLGRLVGA 65

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+++   V  N G     V  AAP   +I+P+DL+  GG+L+CQ D+FLC+   
Sbjct: 66  GKRLLTGESLFMTVFTNQGSGKRRVAFAAPYPGKIIPVDLSKLGGKLICQKDSFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++     G EGF+ QKL G GLAF+ AGG++ +++L+ GE + +D  C+VA
Sbjct: 126 VSVGIEFQRKLGAGFFGGEGFIMQKLEGDGLAFLHAGGTIHRRDLQPGETLRIDTGCLVA 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
           +T  VN  I+Y G I+ A FGG+ L  A VTGPG V++QSLPF RL+ R+  A  S   R
Sbjct: 186 MTKDVNYDIEYVGSIKTAFFGGEGLFLATVTGPGAVWVQSLPFSRLADRVLAAAPSAGGR 245


>gi|163787928|ref|ZP_02182374.1| hypothetical protein FBALC1_06103 [Flavobacteriales bacterium
           ALC-1]
 gi|159876248|gb|EDP70306.1| hypothetical protein FBALC1_06103 [Flavobacteriales bacterium
           ALC-1]
          Length = 274

 Score =  164 bits (414), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 89/242 (36%), Positives = 132/242 (54%), Gaps = 10/242 (4%)

Query: 15  YQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVG 72
           Y       I ++I G E Q  +I L P E V+A  GS   M   ++M+ I+    + + G
Sbjct: 7   YNDRNSHEIDYRIYGEEMQYVEIELDPDEAVIAESGSFMMMDDGIKMDTIFGDGSQKDTG 66

Query: 73  MW--------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPD 124
                     + L G+++      N       V  A+P   +I+PIDL  FGG+ +CQ D
Sbjct: 67  FLGKILGAGKRILTGESLFMTAFYNDLVGKRNVSFASPYPGKIIPIDLNEFGGKFICQKD 126

Query: 125 AFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVIT 184
           AFLC+   V V     +++   + G EGF+ QKL G G+AF+ AGG++ +K LE GE + 
Sbjct: 127 AFLCAAKGVSVGIEFSKKLGRGLFGGEGFIMQKLEGDGMAFVHAGGTMAEKTLEAGETLR 186

Query: 185 VDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
           VD  CIV  T  ++  I++ G I+ +VFGG+ L  A + GPG V+IQSLPF RL+ R+  
Sbjct: 187 VDTGCIVGFTQGIDYDIEFVGGIKNSVFGGEGLFFAKLQGPGKVYIQSLPFSRLAGRVLA 246

Query: 245 AV 246
           +V
Sbjct: 247 SV 248


>gi|85819367|gb|EAQ40526.1| conserved hypothetical protein [Dokdonia donghaensis MED134]
          Length = 266

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 87/230 (37%), Positives = 129/230 (56%), Gaps = 10/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           I ++I G E Q  ++ L PQE V+A  GS   M   + ME I+      + G+   +FG 
Sbjct: 6   IDYEIFGEEMQYVEVELDPQEGVIAEAGSFMMMEEGIRMETIFGDGSAKDTGVMGKIFGA 65

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++      N       V  A+P   +I+PIDL   GG+ +CQ DAFLC+   
Sbjct: 66  GKRLLTGESLFMTAFYNDVVGKKKVSFASPYPGKIIPIDLGAIGGKFVCQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++   + G EGF+ Q+L G G+AF+ AGG++++K L  GE + VD  CI+ 
Sbjct: 126 VSVGIEFSRKLGRGLFGGEGFIMQRLEGDGIAFVHAGGTMLKKELAPGERLKVDTGCIIG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
             SSVN  I++ G I+  +FGG+ L  A +TGPG V+IQSLPF RL+ R+
Sbjct: 186 FDSSVNYDIEFVGGIKNTIFGGEGLFFATLTGPGTVYIQSLPFSRLANRV 235


>gi|305667437|ref|YP_003863724.1| hypothetical protein FB2170_14358 [Maribacter sp. HTCC2170]
 gi|88709485|gb|EAR01718.1| hypothetical protein FB2170_14358 [Maribacter sp. HTCC2170]
          Length = 265

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 90/234 (38%), Positives = 133/234 (56%), Gaps = 10/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP----ENEV------G 72
           I ++I G E Q  +I L PQE VVA  GS   M   ++M+ I+      +N V       
Sbjct: 6   IDYEIFGEEMQYVEIELDPQEAVVAEAGSFMMMDTDIKMDTIFGDGSNQDNSVLGKIFSA 65

Query: 73  MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+++      N G S   V  AAP   +I+PIDL+   G+ +CQ DAFLC+   
Sbjct: 66  GKRILTGESLFMTAFLNIGQSKRQVSFAAPYPGKIVPIDLSDKNGKFICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++   + G EGF+ QKL G G+AF+ AGG++ +K L  GEV+ VD  CIV 
Sbjct: 126 VSVGIEFSKKLGRGLFGGEGFIMQKLEGDGMAFVHAGGTLAKKELAAGEVMKVDTGCIVG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            + +V+  I++ G I+  VFGG+ L  A + GPG V++QSLPF RL+ R+  ++
Sbjct: 186 FSQNVDYDIEFVGGIKNTVFGGEGLFFATLKGPGTVYVQSLPFSRLAGRVLASI 239


>gi|261418043|ref|YP_003251725.1| hypothetical protein GYMC61_0564 [Geobacillus sp. Y412MC61]
 gi|319767998|ref|YP_004133499.1| hypothetical protein [Geobacillus sp. Y412MC52]
 gi|261374500|gb|ACX77243.1| protein of unknown function DUF124 [Geobacillus sp. Y412MC61]
 gi|317112864|gb|ADU95356.1| protein of unknown function DUF124 [Geobacillus sp. Y412MC52]
          Length = 264

 Score =  163 bits (412), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 92/240 (38%), Positives = 135/240 (56%), Gaps = 10/240 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I +++ G + Q  +I L P E VVA  G M  M   + ME ++   +  G          
Sbjct: 6   IDYRLYGDDMQFVEIELDPYESVVAEAGGMMMMDDGIVMETVFGDGSSSGKGLLGRLVGA 65

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+++   V  N G     V  AAP   +I+P+DL+  GG+L+CQ D+FLC+   
Sbjct: 66  GKRLLTGESLFMTVFTNQGSGKRRVAFAAPYPGKIIPVDLSELGGKLICQKDSFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++     G EGF+ QKL G GLAF+ AGG++ +++L+ GE + +D  C+VA
Sbjct: 126 VSVGIEFQRKLGAGFFGGEGFIMQKLEGDGLAFLHAGGTIHRRDLQPGETLRIDTGCLVA 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
           +T  VN  I+Y G I+ A FGG+ L  A VTGPG V++QSLPF RL+ R+  A  S   R
Sbjct: 186 MTKDVNYDIEYVGNIKTAFFGGEGLFLATVTGPGAVWVQSLPFSRLADRVLAAAPSAGGR 245


>gi|56421414|ref|YP_148732.1| hypothetical protein GK2879 [Geobacillus kaustophilus HTA426]
 gi|56381256|dbj|BAD77164.1| hypothetical conserved protein [Geobacillus kaustophilus HTA426]
          Length = 264

 Score =  163 bits (412), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 92/240 (38%), Positives = 135/240 (56%), Gaps = 10/240 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I +++ G + Q  +I L P E VVA  G M  M   + ME ++   +  G          
Sbjct: 6   IDYRLYGDDMQFVEIELDPYESVVAEAGGMMMMDDGIVMETVFGDGSSSGKGLLGRLVGA 65

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+++   V  N G     V  AAP   +I+P+DL+  GG+L+CQ D+FLC+   
Sbjct: 66  GKRLLTGESLFMTVFTNQGSGKRRVAFAAPYPGKIIPVDLSELGGKLICQKDSFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++     G EGF+ QKL G GLAF+ AGG++ +++L+ GE + +D  C+VA
Sbjct: 126 VSVGIEFQRKLGAGFFGGEGFIMQKLEGDGLAFLHAGGTIHRRDLQPGETLRIDTGCLVA 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
           +T  VN  I+Y G I+ A FGG+ L  A VTGPG V++QSLPF RL+ R+  A  S   R
Sbjct: 186 MTKDVNYDIEYVGNIKTAFFGGEGLFLATVTGPGAVWVQSLPFSRLADRVLAAAPSAGGR 245


>gi|365960391|ref|YP_004941958.1| hypothetical protein FCOL_06720 [Flavobacterium columnare ATCC
           49512]
 gi|365737072|gb|AEW86165.1| hypothetical protein FCOL_06720 [Flavobacterium columnare ATCC
           49512]
          Length = 267

 Score =  163 bits (412), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 90/250 (36%), Positives = 139/250 (55%), Gaps = 15/250 (6%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN---EVGMWQ 75
           Q   I + I G E Q  +I L PQE VVA  GS   M   ++ME I+   +   + G+  
Sbjct: 2   QAHEIDYHIYGEEMQYVEIELDPQEIVVAEAGSFMMMENGIKMETIFGDGSNYQDSGILG 61

Query: 76  WLF--------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFL 127
            L         G+++     +N   S   +  A+P   +I+ IDL  +GG+ +CQ D+FL
Sbjct: 62  KLLSAGKRVLTGESLFMTAYQNQDFSKRKISFASPYPGKIIAIDLMKYGGKFICQKDSFL 121

Query: 128 CSVNDVKVSNTLDQRV-RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           C+   V V     Q++ R + GG EGF+ QK+ G G+AF+ +GG++  K L+VGE++ +D
Sbjct: 122 CAAKGVSVGIEFSQKLGRGLFGG-EGFIMQKIEGDGMAFVHSGGTLACKELQVGEILKID 180

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI--AR 244
             CIV  T  ++  I++ G I+  +FGG+ +  A + GPG+V+IQSLPF RL+ RI  A 
Sbjct: 181 TGCIVGFTKDIDYDIEFIGGIKNTLFGGEGMFYATLKGPGVVYIQSLPFSRLAGRIIAAS 240

Query: 245 AVTSPNMREN 254
            +   N +E 
Sbjct: 241 TIGGQNNKEE 250


>gi|149372109|ref|ZP_01891379.1| hypothetical protein SCB49_00275 [unidentified eubacterium SCB49]
 gi|149354876|gb|EDM43438.1| hypothetical protein SCB49_00275 [unidentified eubacterium SCB49]
          Length = 266

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 87/242 (35%), Positives = 129/242 (53%), Gaps = 10/242 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           I + I G E Q  ++ L P E V+A  GS   M   ++ME I+    + + GM   +FG 
Sbjct: 6   IDYTIFGEEMQYVEVELDPNEGVIAEAGSFMMMDDGIKMETIFGDGSKKDSGMMGKIFGA 65

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++      N       V  A+P   +I+P+DL+ FG + +CQ  AFLC+   
Sbjct: 66  GKRLLTGESLFMTAFYNVQSGKKHVSFASPYPGKIIPLDLSKFGNKFICQKSAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++   + G EGF+ QKL G G+AF+ AGG+   K L  GE + VD  CI+ 
Sbjct: 126 VAVGVEFSRKLGRGLFGGEGFIMQKLEGDGMAFVHAGGTTATKELAPGEKLKVDTGCIIG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
             SSVN  I++ G I+  VFGG+ L  A +TGPG V++QSLPF RL+ R+  A      +
Sbjct: 186 FDSSVNYDIEFVGGIKNTVFGGEGLFFATLTGPGTVYVQSLPFSRLANRVWAAAPKSGGK 245

Query: 253 EN 254
           + 
Sbjct: 246 DK 247


>gi|406833583|ref|ZP_11093177.1| hypothetical protein SpalD1_18131 [Schlesneria paludicola DSM
           18645]
          Length = 334

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 89/257 (34%), Positives = 143/257 (55%), Gaps = 11/257 (4%)

Query: 9   PFQPYVYQSPQDA-VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP 67
           P  P    +P+ A  I ++I G E Q   + L PQE V+A PG+M FM+  ++ME  +  
Sbjct: 62  PHSPPRVSAPRSADEIDYEIFGEEMQYVVVELDPQEMVIAEPGAMMFMTAGIKMETRFGD 121

Query: 68  ENE--VGMWQWLF--------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGG 117
            ++   G+W  L         G+++        G   G V  A+P   +IL +DLA  GG
Sbjct: 122 PSKPSEGLWGKLVTAGRRAITGESLFVTTFTQAGNGKGQVAFASPYPGKILAVDLAQLGG 181

Query: 118 ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNL 177
           EL+CQ D+F+C+   V+++    +++   + G EGF+ Q+L G G+  I AGG ++++ L
Sbjct: 182 ELICQKDSFVCAARGVQLAVAFQKKIGVGLFGGEGFVMQRLLGDGVVMIHAGGMLMRRTL 241

Query: 178 EVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR 237
           + GE + +D  C+VA+  SV   I++ G  +  +FGG+ L  A VTGPG V++QSLPF R
Sbjct: 242 KAGETLRLDTGCLVALQPSVTYDIQFVGGFKNTLFGGEGLFMATVTGPGEVWLQSLPFSR 301

Query: 238 LSQRIARAVTSPNMREN 254
           L+ R+  A      ++ 
Sbjct: 302 LAGRVLAAGVGATRKDE 318


>gi|295134252|ref|YP_003584928.1| tryptophan RNA-binding attenuator like protein [Zunongwangia
           profunda SM-A87]
 gi|294982267|gb|ADF52732.1| tryptophan RNA-binding attenuator like protein [Zunongwangia
           profunda SM-A87]
          Length = 266

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 87/234 (37%), Positives = 135/234 (57%), Gaps = 10/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENE------VGM 73
           I ++I G E Q  ++ L PQE  +A  G+   M   ++M+ I+     +NE      +G 
Sbjct: 6   IDYEIFGEEMQYVELELDPQEAAIAEAGNFMMMDNGIKMDTIFGDGSKQNEGFLGKVLGA 65

Query: 74  WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
            + L  G+++   +  N G     +  A+P   +I+P+DL  FGG+ +CQ DAFLC+   
Sbjct: 66  GKRLLTGESLFMTIFSNIGQGKKKISFASPYPGKIIPVDLTQFGGKFICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++     G EGF+ Q L G G+AF+ AGG++ +K L VGEV+ VD  CI+ 
Sbjct: 126 VSIGIEFSRKLGRGFFGGEGFIMQSLEGDGMAFVHAGGTMAKKELAVGEVLKVDTGCIIG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            T +VN  I++ G IR   FGG+ L  A +TGPG+V++QSLPF RL+ R+ +A 
Sbjct: 186 FTQTVNYDIEFIGGIRNTFFGGEGLFFATLTGPGVVYVQSLPFSRLANRVLQAA 239


>gi|283780460|ref|YP_003371215.1| hypothetical protein Psta_2686 [Pirellula staleyi DSM 6068]
 gi|283438913|gb|ADB17355.1| protein of unknown function DUF124 [Pirellula staleyi DSM 6068]
          Length = 288

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 86/260 (33%), Positives = 141/260 (54%), Gaps = 17/260 (6%)

Query: 10  FQPYVY---QSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
           FQP V    +S     I ++I G E Q  ++ L P E V+A  G M +M+  ++ME ++ 
Sbjct: 12  FQPEVIGPGKSRNADEIDYKIFGEEMQYVEVTLDPSEMVIAEAGGMMYMTSGIKMETVFG 71

Query: 67  PENEVGMWQWLFGKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAM 114
              +    Q  FGK +++                N       V  A+P   RI+P+ L  
Sbjct: 72  DPGQ--KQQGFFGKLMSAGKRMVTGESLFMTTFSNASTQREVVAFASPYPGRIIPMHLDQ 129

Query: 115 FGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQ 174
            GGEL+CQ D+FLC    +++     +++   + G EGF+ Q+LTG G+  + AGG++++
Sbjct: 130 LGGELICQKDSFLCGARGIQIGIAFQKKIGVALFGGEGFIMQRLTGDGIVLMHAGGTLME 189

Query: 175 KNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
           + L  GE + VD  C+VA+  +VN  I + G I+ ++FGG+ L  A +TGPG +++QSLP
Sbjct: 190 RTLSAGETLRVDTGCLVALQPTVNYDIGFVGGIKNSLFGGEGLFLATLTGPGKIWLQSLP 249

Query: 235 FHRLSQRIARAVTSPNMREN 254
           F RL+ RI  A +  + ++ 
Sbjct: 250 FSRLAGRIHAAGSGTSGKDE 269


>gi|157273311|gb|ABV27210.1| conserved hypothetical protein [Candidatus Chloracidobacterium
           thermophilum]
          Length = 252

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 95/241 (39%), Positives = 136/241 (56%), Gaps = 8/241 (3%)

Query: 18  PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL 77
           PQ  VI ++I+G + Q   I L P E+V+A  G M +M+  +EM+     ++  G +  L
Sbjct: 2   PQSDVIDYRIVGDDLQAVIITLDPNEQVIAEAGVMLYMTAGIEMQTTMTTDSSKGFFGNL 61

Query: 78  ---FGKTITS-----VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
               G+  +          N G +   V  AAP   +I+PIDLA FGGE+LCQ D+FLC+
Sbjct: 62  AKAVGRIFSGAGFFITTFTNYGYARADVAFAAPYPGKIIPIDLAKFGGEVLCQRDSFLCA 121

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
                +S    QR+     G EGF+ Q+L G GLAFI AGG+ +  NL  GE + VD  C
Sbjct: 122 ARGTDLSVGFQQRLGAGFFGGEGFILQRLRGDGLAFIHAGGATMTYNLRPGETLRVDTGC 181

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           +VA   SV   I++ G  + A+FGG+ +  A +TGPG V +Q+LPF RL  RI+ A+  P
Sbjct: 182 VVAFQPSVQFNIQFVGGFKNALFGGEGVFLASLTGPGQVILQTLPFSRLVGRISAALPRP 241

Query: 250 N 250
           +
Sbjct: 242 S 242


>gi|407769274|ref|ZP_11116650.1| hypothetical protein TH3_07322 [Thalassospira xiamenensis M-5 = DSM
           17429]
 gi|407287601|gb|EKF13081.1| hypothetical protein TH3_07322 [Thalassospira xiamenensis M-5 = DSM
           17429]
          Length = 235

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 87/227 (38%), Positives = 137/227 (60%), Gaps = 4/227 (1%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           VI ++I G E Q+ +I L P E V A  G+M +MS  + M+         G  + L G+ 
Sbjct: 4   VIDYKIHGEEMQLVEIELDPGEGVRAEAGAMMYMSDGIAMQTGTGGGIFSGFKRMLTGEG 63

Query: 82  --ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
             ITS V  + G   G V  AAP   +++P++L   GGE++CQ DAFLC+ + + +    
Sbjct: 64  FFITSFV--HEGSGKGHVAFAAPYPGKVVPMNLPELGGEIICQKDAFLCAASGIDIDIAF 121

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
            +++   + G EGF+ Q+L G GLAF+ AGG++++K+L+ G+ + VD   +VA+T S++ 
Sbjct: 122 SKKLGAGLFGGEGFILQRLRGDGLAFVHAGGTIIRKDLQAGQTLRVDTGALVAMTPSIDY 181

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            IK+ G    A+FGG+ L    + GPG V++Q+LPF RL+ RIA A+
Sbjct: 182 SIKFIGGFTNALFGGEGLFVTTLEGPGTVWLQTLPFARLADRIAAAL 228


>gi|260063289|ref|YP_003196369.1| hypothetical protein RB2501_00726 [Robiginitalea biformata
           HTCC2501]
 gi|88783383|gb|EAR14555.1| hypothetical protein RB2501_00726 [Robiginitalea biformata
           HTCC2501]
          Length = 271

 Score =  162 bits (410), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 87/237 (36%), Positives = 130/237 (54%), Gaps = 10/237 (4%)

Query: 20  DAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWL 77
           + V+ ++I G E Q   + L P E V+A  GS   M   + ME I+    + + G+   +
Sbjct: 8   NHVVDYEIFGEEMQYVSVELDPNEGVIAEAGSFMMMDPDIRMETIFGDGSKKDTGVLGKI 67

Query: 78  F--------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
           F        G+ +      N G     V  A+P   +I+PIDL   GG  +CQ DAFLC+
Sbjct: 68  FSAGKRVLTGEGLFMTAFYNGGQGKKKVSFASPYPGKIVPIDLKEKGGRFICQKDAFLCA 127

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
              V V     +R+   + G EGF+ QKL G G+AF+ AGG++ ++ L+ GEV+ VD  C
Sbjct: 128 AKGVSVGIEFSRRLGRGLFGGEGFIMQKLEGDGMAFVHAGGTLARRTLQAGEVLRVDTGC 187

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           IV  + +V+  I++ G I+  +FGG+ L  A + GPG V+IQSLPF RL+ R+  A 
Sbjct: 188 IVGFSQTVDYDIEFVGGIKNTLFGGEGLFFATLRGPGTVYIQSLPFSRLAGRVLAAA 244


>gi|298674841|ref|YP_003726591.1| hypothetical protein Metev_0899 [Methanohalobium evestigatum
           Z-7303]
 gi|298287829|gb|ADI73795.1| protein of unknown function DUF124 [Methanohalobium evestigatum
           Z-7303]
          Length = 233

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 89/228 (39%), Positives = 137/228 (60%), Gaps = 4/228 (1%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           +I + I+G + Q+ ++ L P+E V A  G+M +M   ++M+         G+ + L G++
Sbjct: 1   MIDYNIIGDDMQIVEVELDPEESVRAEVGAMAYMGPGIKMQTSTGGGLFSGLKRMLTGES 60

Query: 82  --ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
             ITS V    G   GFV   AP   +I+PIDLA F G +LCQ DAFLC+   + +   L
Sbjct: 61  FFITSFVYE--GKDKGFVAFGAPYPGKIIPIDLAKFDGNILCQKDAFLCAAKGIDIDVAL 118

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
            +R+ +   G EGF+ QKL+G G+AF+ AGG++++K L  GE + VD  C+ A T S++ 
Sbjct: 119 TKRLGSGFFGGEGFILQKLSGDGMAFVHAGGTIIKKELADGEHLRVDTGCLAAFTDSIDY 178

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVT 247
            I++ G  + A+FGG+ +  A + GPG V++QSLP  RL+ RI  A T
Sbjct: 179 NIQFIGGFKNALFGGEGVFLASLKGPGTVYLQSLPLSRLADRILAAST 226


>gi|320354387|ref|YP_004195726.1| hypothetical protein Despr_2291 [Desulfobulbus propionicus DSM
           2032]
 gi|320122889|gb|ADW18435.1| protein of unknown function DUF124 [Desulfobulbus propionicus DSM
           2032]
          Length = 331

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 85/244 (34%), Positives = 131/244 (53%), Gaps = 12/244 (4%)

Query: 15  YQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW 74
           +   +   I + I G E Q  +I L  QE V++  G+M +M   + ME ++   +     
Sbjct: 65  FSGKEAHEIDYTIFGNEMQFVEIELDSQESVISEAGAMMYMHDQVVMETVFGDGSRNSGS 124

Query: 75  QWLFGKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQ 122
              F K + +             +    G   G V  AAP   +I+P+DL  + G ++CQ
Sbjct: 125 GTFFDKMLGAGKRLITGEGLFITMFTYTGAGKGKVAFAAPYPGKIIPLDLTKYNGRIICQ 184

Query: 123 PDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEV 182
            DAFLC+   V V     +++   + G EGF+ Q+L G GL F+ AGG++V+K L  GE 
Sbjct: 185 KDAFLCAAKGVSVGIAFQRKIGVALFGGEGFIMQQLDGDGLVFVHAGGTIVEKELAAGET 244

Query: 183 ITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           + VD  C+VA+T+SVN  I++ G ++ A+FGG+    A + GPG V++QSLPF RL+ RI
Sbjct: 245 LRVDTGCLVALTASVNYDIEFVGNVKSAIFGGEGFFFANLRGPGHVWLQSLPFSRLAGRI 304

Query: 243 ARAV 246
             A 
Sbjct: 305 WEAA 308


>gi|334119417|ref|ZP_08493503.1| protein of unknown function DUF124 [Microcoleus vaginatus FGP-2]
 gi|333458205|gb|EGK86824.1| protein of unknown function DUF124 [Microcoleus vaginatus FGP-2]
          Length = 240

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 85/221 (38%), Positives = 130/221 (58%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           +I ++I G + Q+ +I L P+E V A  G+M +M G ++M+         G  + L G  
Sbjct: 4   IIDYKIYGDDLQLIEIELDPKEGVRAEAGTMTYMEGDIQMQTSTGGGLFQGFKRMLTGAG 63

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
                  N G     V  AAP   +++P+DL   GG+ LCQ D+FLC+ N +++     +
Sbjct: 64  FFITTFVNAGNRKARVAFAAPYPGKVIPLDLGQLGGKFLCQKDSFLCAANGIEIEVAFTK 123

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           R+     G EGF+ QKL G GLAF+ AGG++V+KNL VGEV+  D  C+VA   +V+  I
Sbjct: 124 RLGAGFFGGEGFILQKLQGDGLAFVHAGGTIVEKNLGVGEVLRADTGCLVAFAPTVDYDI 183

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           ++ G  + A+FGG+ L    +TGPG V++QSLP  +L++RI
Sbjct: 184 QFVGGFKNALFGGEGLFLVKLTGPGKVYLQSLPLSKLAERI 224


>gi|347753828|ref|YP_004861392.1| hypothetical protein [Candidatus Chloracidobacterium thermophilum
           B]
 gi|347586346|gb|AEP10876.1| conserved hypothetical protein TIGR00266 [Candidatus
           Chloracidobacterium thermophilum B]
          Length = 254

 Score =  161 bits (408), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 95/234 (40%), Positives = 132/234 (56%), Gaps = 8/234 (3%)

Query: 18  PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL 77
           PQ  VI ++I+G + Q   I L P E+V+A  G M +M+  +EM+     ++  G +  L
Sbjct: 2   PQSDVIDYRIVGDDLQGVIITLDPNEQVLAEAGVMLYMTAGIEMQTTMATDSSKGFFGNL 61

Query: 78  ---FGKTITS-----VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
               G+  +          N G +   V  AAP   +I+PIDLA FGGE+LCQ D+FLC+
Sbjct: 62  AKAVGRVFSGAGFFITTFTNYGHARADVAFAAPYPGKIIPIDLAKFGGEVLCQKDSFLCA 121

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
                +S    QR+     G EGF+ Q+L G GLAFI AGG+ +  NL  GE + VD  C
Sbjct: 122 ARGTDLSVGFQQRLGAGFFGGEGFILQRLRGDGLAFIHAGGATMTYNLRPGETLRVDTGC 181

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
           +VA   SV   I++ G  + A+FGG+ L  A +TGPG V +Q+LPF RL  RIA
Sbjct: 182 VVAFQPSVQFNIQFVGGFKNALFGGEGLFLASLTGPGQVILQTLPFSRLVGRIA 235


>gi|325287403|ref|YP_004263193.1| hypothetical protein Celly_2505 [Cellulophaga lytica DSM 7489]
 gi|324322857|gb|ADY30322.1| protein of unknown function DUF124 [Cellulophaga lytica DSM 7489]
          Length = 265

 Score =  161 bits (407), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 87/234 (37%), Positives = 130/234 (55%), Gaps = 10/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM--------- 73
           I + + G E Q  +I L PQE VVA  GS   M   ++M+ I+   +             
Sbjct: 6   IDYHVYGEEMQYVEIELDPQEAVVAEAGSFMMMDTDIKMDTIFGDGSNQDTSVLGKIFSA 65

Query: 74  -WQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+++      N G +   V  A+P   +ILPIDL+   G+ +CQ DAFLC+   
Sbjct: 66  GKRMLTGESLFMTAFLNIGANKRKVSFASPYPGKILPIDLSEKNGKFICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++   + G EGF+ QKL G G+AF+ AGG++ +K L  GEV+ VD  CIV 
Sbjct: 126 VSVGIEFSRKLGRGLFGGEGFIMQKLEGDGMAFVHAGGTMAKKVLAAGEVLKVDTGCIVG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            + ++N  +++ G I+  VFGG+ L  A + GPG V+IQSLPF RL+ R+  +V
Sbjct: 186 FSQTINYDVEFVGGIKNTVFGGEGLFFATLRGPGTVYIQSLPFSRLAGRVLASV 239


>gi|376262370|ref|YP_005149090.1| hypothetical protein [Clostridium sp. BNL1100]
 gi|373946364|gb|AEY67285.1| TIGR00266 family protein [Clostridium sp. BNL1100]
          Length = 256

 Score =  161 bits (407), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 87/224 (38%), Positives = 130/224 (58%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           VI ++I G + Q+ +I L P E V A  G+M +M   + M+         G  + L G++
Sbjct: 4   VIDYKIYGDDMQIVEIELDPSEGVRAEAGAMMYMEDGIMMQTSTDGGAFKGFKRMLTGES 63

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
                  N G S   V  +AP   +I+P+DL   GG  LCQ D+FLC+ + V +     +
Sbjct: 64  FFITTFYNAGASKQHVAFSAPYPGKIIPLDLTKLGGNFLCQKDSFLCAASGVDIEIAFTR 123

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           ++     G EGF+ Q+L G G+AF+ +GG+++QKNL  GE + VD  C+VA + SV+  I
Sbjct: 124 KLGAGFFGGEGFILQRLIGDGMAFVHSGGTIIQKNLLPGETLRVDTGCLVAFSPSVDYDI 183

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +  G  + A+FGG+ L  A +TGPGIVF+QSLP  RLS R++ A
Sbjct: 184 QSVGGFKNALFGGEGLFLAKLTGPGIVFLQSLPLARLSDRLSSA 227


>gi|359462008|ref|ZP_09250571.1| hypothetical protein ACCM5_24999 [Acaryochloris sp. CCMEE 5410]
          Length = 337

 Score =  160 bits (406), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 87/237 (36%), Positives = 136/237 (57%), Gaps = 13/237 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY-----------IPENEV 71
           I +QI G + Q  ++ L P E  VA  G+M +    ++ME I+             +  V
Sbjct: 74  IDYQIFGEDMQFVEVELDPGESAVAEAGAMMYKDAVVQMETIFGDGSAQASQGGFMDKLV 133

Query: 72  GMWQWLF-GKTITSVVLRNPGP-SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
           G  + L  G+++ + V  + GP     V  AAP    I+   L+ +GG L+CQ D+FLC+
Sbjct: 134 GAGKRLVTGESLFTTVFTHGGPQGKAHVAFAAPYPGTIIAAKLSDYGGRLICQKDSFLCA 193

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
              V++     +++   + G EGF+ QKL G GLAF+ AGG+VVQ++L+ GE + VD  C
Sbjct: 194 AKGVQIGIHFQKKILTGLFGGEGFIMQKLDGTGLAFLHAGGTVVQRDLKAGETLHVDTGC 253

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           +VA+T SV+  I+  G ++  +FGG+ L  A +TGPG V++QSLPF R++ R+  A 
Sbjct: 254 LVALTDSVHYDIQQAGNVKSIIFGGEGLFFATLTGPGHVWLQSLPFSRMAGRMLAAA 310


>gi|365158290|ref|ZP_09354519.1| TIGR00266 family protein [Bacillus smithii 7_3_47FAA]
 gi|363620953|gb|EHL72197.1| TIGR00266 family protein [Bacillus smithii 7_3_47FAA]
          Length = 264

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 87/230 (37%), Positives = 135/230 (58%), Gaps = 10/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMW------ 74
           I +++ G + Q  +I L PQE VVA  G M  M   +EME ++    E + G++      
Sbjct: 6   IDYELHGDDMQCVEIELDPQESVVAEAGGMMMMEDGIEMETVFGDGTEQKSGIFGKLVGA 65

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+++   V  N       V  AAP   +I+P+DL  F G+++CQ DAFLC+   
Sbjct: 66  GRRLLTGESLFMTVFTNQSSDKRRVSFAAPYPGKIIPLDLTQFNGKVICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++     G EGF+ QKL G GLAF+ AGG++++K+L  GE++ VD  C+VA
Sbjct: 126 VSIGIEFQRKLGTGFFGGEGFIMQKLEGDGLAFLHAGGTIIKKDLSPGELLRVDTGCLVA 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +T  V+  I+Y G ++ A FGG+ L  A + GPG V++Q+LPF RL+ R+
Sbjct: 186 LTQDVDYNIEYVGKVKTAFFGGEGLFFATLRGPGTVWVQTLPFSRLADRV 235


>gi|414872763|tpg|DAA51320.1| TPA: hypothetical protein ZEAMMB73_320289 [Zea mays]
          Length = 149

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 70/114 (61%), Positives = 91/114 (79%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           MAAPFFSTPFQPYVYQS Q +V  FQI GG+ QV Q+MLK QEK+ A+PG+MC+MSG+M+
Sbjct: 1   MAAPFFSTPFQPYVYQSQQGSVTAFQISGGDVQVLQVMLKSQEKLTAKPGAMCYMSGNMQ 60

Query: 61  MENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAM 114
           M+N Y+PEN+ G+WQW+FGK ++S +  N G  DG+VGIAAP   RILP+ + +
Sbjct: 61  MDNNYLPENDGGVWQWIFGKRVSSTIFFNSGSDDGYVGIAAPFPGRILPVSILL 114


>gi|158334252|ref|YP_001515424.1| hypothetical protein AM1_1071 [Acaryochloris marina MBIC11017]
 gi|158304493|gb|ABW26110.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
          Length = 337

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 87/237 (36%), Positives = 136/237 (57%), Gaps = 13/237 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY-----------IPENEV 71
           I +QI G + Q  ++ L P E  VA  G+M +    ++ME I+             +  V
Sbjct: 74  IDYQIYGEDMQFVEVELDPGESAVAEAGAMMYKDAVVQMETIFGDGSAQASQGGFMDKLV 133

Query: 72  GMWQWLF-GKTITSVVLRNPGP-SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
           G  + L  G+++ + V  + GP     V  AAP    I+   L+ +GG L+CQ D+FLC+
Sbjct: 134 GAGKRLVTGESLFTTVFTHGGPQGKAHVAFAAPYPGTIIAAKLSDYGGRLICQKDSFLCA 193

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
              V++     +++   + G EGF+ QKL G GLAF+ AGG+VVQ++L+ GE + VD  C
Sbjct: 194 AKGVQIGIHFQKKILTGLFGGEGFIMQKLDGTGLAFLHAGGTVVQRDLKAGETLHVDTGC 253

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           +VA+T SV+  I+  G ++  +FGG+ L  A +TGPG V++QSLPF R++ R+  A 
Sbjct: 254 LVALTDSVHYDIQQAGNVKSIIFGGEGLFFATLTGPGHVWLQSLPFSRMAGRMLAAA 310


>gi|150016382|ref|YP_001308636.1| hypothetical protein Cbei_1504 [Clostridium beijerinckii NCIMB
           8052]
 gi|149902847|gb|ABR33680.1| protein of unknown function DUF124 [Clostridium beijerinckii NCIMB
           8052]
          Length = 265

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 92/240 (38%), Positives = 142/240 (59%), Gaps = 16/240 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW------ 76
           I ++I G + Q  +I L P E V+A  G+M  M  +++ME I+   ++     +      
Sbjct: 6   IDYKIKGDDIQYVEIELDPTETVIAEAGAMMMMDPNIQMETIFGDGSKQSGSSFMNKIFS 65

Query: 77  -----LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                L G+++      N G     V  AAP   +I+P+DL++ GG+L+CQ DAFLC+  
Sbjct: 66  AGKRVLTGESLFMTAFTNSGTLKEKVYFAAPYPGKIIPMDLSLLGGKLICQKDAFLCAAK 125

Query: 132 DVKVSNTLDQRVRNVIG--GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
            V V   +D R +  +G  G EGF+ +KL G GLAFI A G++V++NL  GE + +D  C
Sbjct: 126 GVSVG--IDFRKKLSVGFFGGEGFILEKLEGDGLAFIHACGAIVERNLLPGETLKIDTGC 183

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI-ARAVTS 248
           +VA+T  V+  I++ G I+  +FGG+ +  A VTGPG +++QSLPF RL++R+ A A TS
Sbjct: 184 LVAMTKDVHYDIQFIGGIKNTLFGGEGVFFATVTGPGKIWVQSLPFSRLAERVFAAAPTS 243


>gi|359412227|ref|ZP_09204692.1| protein of unknown function DUF124 [Clostridium sp. DL-VIII]
 gi|357171111|gb|EHI99285.1| protein of unknown function DUF124 [Clostridium sp. DL-VIII]
          Length = 265

 Score =  160 bits (405), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 92/237 (38%), Positives = 139/237 (58%), Gaps = 15/237 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW------ 76
           I ++I G + Q  +I L P E V+A  G+M  M  +++ME I+   ++ G   +      
Sbjct: 6   IDYKIYGDDMQYVEIELDPMETVIAEAGAMMMMDPNIQMETIFGDGSKQGNNGFMDKIFS 65

Query: 77  -----LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                L G+++      N G     V  AAP   +ILP+DL+   G+L+CQ DAFLC+  
Sbjct: 66  AGKRVLTGESLFMTAFTNSGALKEKVFFAAPYPGKILPMDLSRLEGKLICQKDAFLCAAK 125

Query: 132 DVKVSNTLDQRVRNVIG--GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
            V V   +D R +  +G  G EGF+ +KL G GLAFI A G++V++NL  GE + +D  C
Sbjct: 126 GVSVG--IDFRKKLSVGFFGGEGFILEKLEGDGLAFIHACGAIVERNLMPGETLKIDTGC 183

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           +VA+T +V+  I++ G I+  +FGG+ +  A VTGPG V+IQSLPF RL++R+  A 
Sbjct: 184 LVAMTRNVHYDIQFIGGIKNTLFGGEGVFFATVTGPGKVWIQSLPFSRLAERVFAAA 240


>gi|336173044|ref|YP_004580182.1| hypothetical protein [Lacinutrix sp. 5H-3-7-4]
 gi|334727616|gb|AEH01754.1| protein of unknown function DUF124 [Lacinutrix sp. 5H-3-7-4]
          Length = 265

 Score =  160 bits (405), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 130/230 (56%), Gaps = 10/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMW------ 74
           I ++I G E Q  +I L PQE V+A  GS   M   ++ME I+    + + G        
Sbjct: 6   IDYRIYGEEMQYVEIELDPQEAVIAESGSFMMMDDGIKMETIFGDGSQKDKGFLNKVLGA 65

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+++      N       V  A+P   +I+PI+LA  GG+ +CQ DAFLC+   
Sbjct: 66  GRRILTGESLFITAFYNTLSGKRNVSFASPYPGKIIPINLAECGGKFICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ +KL G G+AF+ AGG+  +K L+ GE + VD  CI+ 
Sbjct: 126 VSIGIEFSKKLGRGLFGGEGFIMEKLEGDGMAFVHAGGTTAKKELKAGETLRVDTGCIIG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T +VN  I++ G I+ ++FGG+ L  A + GPG V+IQSLPF RL+ R+
Sbjct: 186 FTHNVNYDIEFIGGIKNSIFGGEGLFFAKLEGPGTVYIQSLPFSRLANRV 235


>gi|312109828|ref|YP_003988144.1| hypothetical protein GY4MC1_0716 [Geobacillus sp. Y4.1MC1]
 gi|336234179|ref|YP_004586795.1| hypothetical protein Geoth_0715 [Geobacillus thermoglucosidasius
           C56-YS93]
 gi|423718877|ref|ZP_17693059.1| hypothetical protein GT20_0625 [Geobacillus thermoglucosidans
           TNO-09.020]
 gi|311214929|gb|ADP73533.1| protein of unknown function DUF124 [Geobacillus sp. Y4.1MC1]
 gi|335361034|gb|AEH46714.1| protein of unknown function DUF124 [Geobacillus thermoglucosidasius
           C56-YS93]
 gi|383367780|gb|EID45055.1| hypothetical protein GT20_0625 [Geobacillus thermoglucosidans
           TNO-09.020]
          Length = 260

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 92/242 (38%), Positives = 138/242 (57%), Gaps = 12/242 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE---------VGM 73
           I +++ G + Q  +I L PQE V+A  G M  M   + ME I+   ++         VG 
Sbjct: 6   IDYKLYGDDMQFVEIELDPQESVIAEAGGMMMMEDGITMETIFGDGSDSRKGFFNKLVGA 65

Query: 74  WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
            + L  G+++   V  N       V  AAP   +I+P+DL+  GG+++CQ D+FLC+   
Sbjct: 66  GKRLLTGESLFMTVFTNNSAEKRRVSFAAPYPGKIIPVDLSELGGKVICQKDSFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++     G EGF+ QKL G GLAF+ AGG++ Q+ L+ GE + +D  C+VA
Sbjct: 126 VSVGIDFQRKLGTGFFGGEGFIMQKLEGDGLAFLHAGGTIYQRELQPGEKLRIDTGCLVA 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
           +T  V+  I+Y G I+ A F G+ L  A +TGPG V++QSLPF RL+ RI  A  +P+ R
Sbjct: 186 MTKEVDYDIEYVGKIKTAFFSGEGLFFATLTGPGTVWVQSLPFSRLADRIIAA--APSSR 243

Query: 253 EN 254
           E 
Sbjct: 244 EE 245


>gi|326200864|ref|ZP_08190736.1| protein of unknown function DUF124 [Clostridium papyrosolvens DSM
           2782]
 gi|325988432|gb|EGD49256.1| protein of unknown function DUF124 [Clostridium papyrosolvens DSM
           2782]
          Length = 256

 Score =  159 bits (403), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 86/224 (38%), Positives = 130/224 (58%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           VI ++I G + Q+ +I L P E V A  G+M +M   + M+         G  + L G++
Sbjct: 4   VIDYKIYGDDMQIVEIELDPSEGVRAEAGAMMYMEDGIMMQTSTDGGAFKGFKRMLTGES 63

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
                  N G S   V  +AP   +I+P+DL   GG  LCQ D+FLC+ + V +     +
Sbjct: 64  FFITTFYNAGTSKQHVAFSAPYPGKIVPLDLTKLGGNFLCQKDSFLCAASGVDIEVAFTR 123

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           ++     G EGF+ Q+L G G+AF+ +GG+++QKNL  GE + VD  C+VA + SV+  I
Sbjct: 124 KLGAGFFGGEGFILQRLVGDGMAFVHSGGTIIQKNLLPGETLRVDTGCLVAFSPSVDYDI 183

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +  G  + A+FGG+ L  A +TGPGIVF+QSLP  RLS R++ +
Sbjct: 184 QSVGGFKNALFGGEGLFLAKMTGPGIVFLQSLPLARLSDRLSSS 227


>gi|332292924|ref|YP_004431533.1| hypothetical protein Krodi_2287 [Krokinobacter sp. 4H-3-7-5]
 gi|332171010|gb|AEE20265.1| protein of unknown function DUF124 [Krokinobacter sp. 4H-3-7-5]
          Length = 266

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 10/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           I ++I G E Q  ++ L PQE V+A  GS   M   + ME I+      + G+   +FG 
Sbjct: 6   IDYEIFGEEMQYVEVELDPQEGVIAEAGSFMMMEDGIRMETIFGDGSAKDTGLMGKIFGA 65

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++      N       V  A+P   +I+PIDL   GG+ +CQ DAFLC+   
Sbjct: 66  GKRLLTGESLFMTAFYNDVIGKKKVSFASPYPGKIIPIDLDKIGGKFICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++   + G EGF+ Q+L G G+AF+ AGG+ ++K L  GE + VD  CI+ 
Sbjct: 126 VSVGIEFSRKLGRGLFGGEGFIMQRLEGDGIAFVHAGGTTLKKELAPGERLKVDTGCIIG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
             S+V   I++ G I+  +FGG+ L  A ++GPG V+IQSLPF RL+ R+
Sbjct: 186 FDSTVTYDIEFVGGIKNTIFGGEGLFFATLSGPGTVYIQSLPFSRLANRV 235


>gi|138896414|ref|YP_001126867.1| hypothetical protein GTNG_2777 [Geobacillus thermodenitrificans
           NG80-2]
 gi|134267927|gb|ABO68122.1| Conserved hypothetical protein [Geobacillus thermodenitrificans
           NG80-2]
          Length = 265

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 90/240 (37%), Positives = 134/240 (55%), Gaps = 10/240 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I +Q+ G + Q  +I L P+E V+A  G M  M   + ME ++   +  G          
Sbjct: 7   IDYQLYGDDMQFVEIELDPRESVIAEAGGMMMMEDGIVMETVFGDGSSSGKGLLGRLVGA 66

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+++   V  N G     V  AAP   +I+P++L   GG+L+CQ D+FLC+   
Sbjct: 67  GKRLLTGESLFMTVFTNQGTGKRRVAFAAPYPGKIIPVNLQELGGKLICQKDSFLCAAKG 126

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++     G EGF+ QKL G GLAF+ AGG + ++ L+ GE + +D  C+VA
Sbjct: 127 VSVGIEFQRKLGTGFFGGEGFIMQKLEGDGLAFLHAGGMIHRRELQPGEKLRIDTGCLVA 186

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
           +T  VN  I+Y G I+ A FGG+ L  A +TGPG+V++QSLPF RL+ R+  A  S   R
Sbjct: 187 MTKEVNYDIEYVGNIKTAFFGGEGLFFATLTGPGVVWVQSLPFSRLADRVIAAAPSSGGR 246


>gi|220930001|ref|YP_002506910.1| hypothetical protein Ccel_2602 [Clostridium cellulolyticum H10]
 gi|220000329|gb|ACL76930.1| protein of unknown function DUF124 [Clostridium cellulolyticum H10]
          Length = 256

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 87/224 (38%), Positives = 129/224 (57%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           VI ++I G + Q+ +I L P E V A  G+M +M   + M+         G  + L G++
Sbjct: 4   VIDYKIYGDDMQIVEIELDPSEGVRAEAGAMMYMEDDIMMQTSTDGGAFKGFKRMLTGES 63

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
                  N G S   V  +AP   +I+P+DL+  GG  LCQ D+FLC+ + V +     +
Sbjct: 64  FFITTFYNAGASKQHVAFSAPYPGKIIPLDLSRLGGNFLCQKDSFLCAASGVDIEIAFTR 123

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           ++     G EGF+ Q+L G GLAF+ +GG+++QKNL  GE + VD  C+VA + SV+  I
Sbjct: 124 KLGAGFFGGEGFILQRLVGDGLAFVHSGGTIIQKNLLPGETLRVDTGCLVAFSPSVDYDI 183

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +     + A+FGG+ L  A +TGPGIVF+QSLP  RLS R+  A
Sbjct: 184 ESVRGFKNALFGGEGLFLAKLTGPGIVFLQSLPLARLSDRLTSA 227


>gi|255535414|ref|YP_003095785.1| hypothetical protein FIC_01275 [Flavobacteriaceae bacterium
           3519-10]
 gi|255341610|gb|ACU07723.1| hypothetical protein FIC_01275 [Flavobacteriaceae bacterium
           3519-10]
          Length = 260

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 84/239 (35%), Positives = 132/239 (55%), Gaps = 8/239 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF---- 78
           I ++I G E Q  +I L P E V++ PGS   M+  ++M+ ++   NE G    L     
Sbjct: 6   IDYRIHGEEMQCVEIELDPNESVISEPGSFMMMTDGIQMQTMFGDGNEKGFMGKLLSAGK 65

Query: 79  ----GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
               G+ +      N       V  AAP   +I+P+DL+   G+++CQ D+FLC+   V 
Sbjct: 66  RVLTGENLFMTAFTNISNQKRQVSFAAPYSGKIIPLDLSELDGKVICQKDSFLCAAKGVA 125

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           V     +++   + G EGF+ QKL G G+ F+ +GG V++K L+ GE++ +D  CIVA T
Sbjct: 126 VDVEFQRKLGTGLFGGEGFIMQKLEGDGMTFVHSGGHVIEKQLQPGEILKIDTGCIVAFT 185

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRE 253
             V+  I++ G I+  +FGG+ L  A + GPG V+IQ+LP  RL+ RI +  T+    E
Sbjct: 186 KDVDYDIQFVGGIKNTIFGGEGLFFAQLRGPGKVWIQTLPIARLAARILQFGTTKKGEE 244


>gi|403071089|ref|ZP_10912421.1| hypothetical protein ONdio_16036 [Oceanobacillus sp. Ndiop]
          Length = 264

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 93/230 (40%), Positives = 136/230 (59%), Gaps = 10/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLFG- 79
           I F++ G + Q  ++ L PQE V+A  GS+  M   + M+ I+    +N+ G+   L G 
Sbjct: 6   IDFKLYGDDMQFVEVELDPQETVIAEAGSLMMMDDQIRMQTIFGDGTDNQSGLMGKLIGA 65

Query: 80  --KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + IT   L      N G     V  A+P   +I+P+DL+   G+L+CQ DAFL +   
Sbjct: 66  GKRVITGESLFMTTFTNEGSGKKHVSFASPYPGKIIPMDLSEINGKLICQKDAFLAAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++     G EGF+ QKL G G+AFI AGG++ +K LE GEV+ VD  C+VA
Sbjct: 126 VAVGIEFQKKIGTGFFGGEGFIMQKLEGDGMAFIHAGGTMQKKVLEPGEVLKVDTGCLVA 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +T+SV+  I+Y G ++ A+FGG+ L  A + GPG V+IQSLPF RL+ RI
Sbjct: 186 MTASVDYNIEYIGGVKTALFGGEGLFFATLRGPGTVWIQSLPFSRLASRI 235


>gi|310779547|ref|YP_003967880.1| hypothetical protein [Ilyobacter polytropus DSM 2926]
 gi|309748870|gb|ADO83532.1| protein of unknown function DUF124 [Ilyobacter polytropus DSM 2926]
          Length = 258

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 82/231 (35%), Positives = 132/231 (57%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           VI ++I G + Q+ +I L P E V A  G+M +M   +EM+         G  + + G++
Sbjct: 4   VIDYKIYGHDMQLVEIELDPGEGVRAEAGAMTYMESQIEMQTGAGGGLFGGFKRMITGES 63

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
                 +N G +   VG +AP   +I+P+DL  F GE +CQ D+FLC+ N + +     +
Sbjct: 64  FFITTFKNNGKAKKSVGFSAPYPGQIIPLDLNKFNGEFICQKDSFLCAANGIDIDVAFTK 123

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           ++     G EGF+ Q+L G G+AF+ AGGS+++K L+ GE + VD  CIV    SV+  I
Sbjct: 124 KLGAGFFGGEGFILQRLKGDGMAFVHAGGSIIEKELKAGETLRVDTGCIVGFERSVDYDI 183

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
           ++ G  + A+FG + L  A + GPG V+IQS+P  +L++R+   + S   R
Sbjct: 184 RFVGGFKNALFGKEGLFLATLEGPGRVYIQSMPISKLAERLYANMGSSRER 234


>gi|163756287|ref|ZP_02163401.1| hypothetical protein KAOT1_01524 [Kordia algicida OT-1]
 gi|161323639|gb|EDP94974.1| hypothetical protein KAOT1_01524 [Kordia algicida OT-1]
          Length = 266

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 93/235 (39%), Positives = 131/235 (55%), Gaps = 11/235 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
           I ++I G E Q  +I L PQE VVA  GS   M   + M  I+      E G+   LF  
Sbjct: 6   IDYEIFGAEMQYVEIELDPQEAVVAEAGSFMMMDDGITMNTIFGDGSNQEKGILGKLFSA 65

Query: 79  GKTI---TSVVLRNPGPSDGF----VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
           GK I    S+ + +      F    V  AAP   +I+PIDL  + G+ +CQ DAFLC+  
Sbjct: 66  GKRILTGESLFMTSFLNDSSFGVRKVSFAAPYPGKIVPIDLTDYRGKFICQKDAFLCAAK 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V V     +++   + G EGF+ QKL G G+AF+ AGG++ +K L  GE + VD  CIV
Sbjct: 126 GVTVGIEFSKKLGRGLFGGEGFIMQKLEGDGMAFVNAGGTLARKQLAPGETLRVDTGCIV 185

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             T  V+  I++ G I+  +FGG+ +  A + GPGIV+IQSLPF RL+ RI  ++
Sbjct: 186 GFTQDVDYDIEFVGGIKNTLFGGEGMFFATLKGPGIVYIQSLPFSRLAGRIVASI 240


>gi|421074999|ref|ZP_15536017.1| protein of unknown function DUF124 [Pelosinus fermentans JBW45]
 gi|392526891|gb|EIW49999.1| protein of unknown function DUF124 [Pelosinus fermentans JBW45]
          Length = 347

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 85/247 (34%), Positives = 139/247 (56%), Gaps = 12/247 (4%)

Query: 12  PYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY----IP 67
           P   ++     + ++I G E Q  ++ L P E V+A  G+M +   ++EM+ I+      
Sbjct: 66  PTTIRTRTSDEVDYKIFGTEMQFVEVELDPGESVIAEAGAMMYKDSAVEMDTIFGDGSAK 125

Query: 68  ENEVGMW--------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGEL 119
            ++ G++        + L G+++   V  + G +   V   AP    I+P+ LA  GG +
Sbjct: 126 SSQGGLFDKLVGAGKRLLTGESLFMTVFTHRGYNKAKVAFGAPYPGNIIPMHLAQLGGSI 185

Query: 120 LCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEV 179
           +CQ D+FLC+   V V     +++   + G EGF+ QKL G GL+F+ AGG++V++ L+ 
Sbjct: 186 ICQKDSFLCAAKGVSVGIYFQRKIMTGLFGGEGFIMQKLEGDGLSFMHAGGTIVERQLQS 245

Query: 180 GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
           GEV+ VD  CIVA   SV   I+  G I+ ++FGG+ L  AV+ GPG +++QSLPF RL+
Sbjct: 246 GEVLHVDTGCIVAFEPSVQFDIQQAGNIKSSLFGGEGLFFAVLQGPGKIWLQSLPFSRLA 305

Query: 240 QRIARAV 246
            R+  A 
Sbjct: 306 GRMLAAA 312


>gi|149180844|ref|ZP_01859346.1| hypothetical protein BSG1_00395 [Bacillus sp. SG-1]
 gi|148851363|gb|EDL65511.1| hypothetical protein BSG1_00395 [Bacillus sp. SG-1]
          Length = 265

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 88/231 (38%), Positives = 131/231 (56%), Gaps = 11/231 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW------ 76
           I +Q+ G + Q  ++ L P E V+A  G+M  M   +EME I+   +  G   +      
Sbjct: 6   IEYQLYGDDMQFVEVELDPSESVIAEAGAMMMMEDGIEMETIFGDGSASGKSGFFGKLVG 65

Query: 77  -----LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                L G+++   V  N G     V  AAP   +I+ +DL+  GG+++CQ D+FLC+  
Sbjct: 66  AGKRVLTGESLFMTVFTNQGMGKKHVSFAAPYPGKIISVDLSEMGGKVICQKDSFLCAAK 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V V     +++     G EGF+ QKL G GLAF+ AGG++ +K L+ GE + VD  C+V
Sbjct: 126 GVSVGIDFQRKLGTGFFGGEGFIMQKLEGDGLAFLHAGGTIHRKELQPGEKLRVDTGCLV 185

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           A+T  V+  I+Y G I+ A FGG+ +  A V GPG V+IQSLPF RL+ R+
Sbjct: 186 AMTREVDYNIEYVGKIKSAFFGGEGMFFATVQGPGTVWIQSLPFSRLADRV 236


>gi|410461812|ref|ZP_11315452.1| hypothetical protein BAZO_21083 [Bacillus azotoformans LMG 9581]
 gi|409925200|gb|EKN62423.1| hypothetical protein BAZO_21083 [Bacillus azotoformans LMG 9581]
          Length = 265

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 88/230 (38%), Positives = 133/230 (57%), Gaps = 10/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLFG- 79
           I +++ G + Q  +I L P E  +A  G+M      +EME I+     ++ G++  L G 
Sbjct: 6   IEYKLYGDDMQFVEIELDPGESTIAEAGAMMMQEDGIEMETIFGDGASSDKGLFGKLLGA 65

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++   V  N  PS   V  AAP   +I+P+DL+  GG+++CQ DAFLC+   
Sbjct: 66  GKRILTGESLFMTVFTNQSPSIKHVSFAAPYPGKIIPMDLSELGGKVICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +R+     G EGF+ QKL G GLAF+ AGG++ ++ L+ GE + +D  C+VA
Sbjct: 126 VAVGIEFQRRLGAGFFGGEGFIMQKLEGDGLAFLHAGGTIYRRELQPGERLKIDTGCLVA 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +T  ++  I+Y G I+ A FGG+ L  A V GPG V++QSLPF RL+ RI
Sbjct: 186 MTRDIDYNIEYVGKIKTAFFGGEGLFFATVQGPGTVWVQSLPFSRLADRI 235


>gi|410657403|ref|YP_006909774.1| DUF124 domain-containing protein [Dehalobacter sp. DCA]
 gi|410660439|ref|YP_006912810.1| DUF124 domain-containing protein [Dehalobacter sp. CF]
 gi|409019758|gb|AFV01789.1| DUF124 domain-containing protein [Dehalobacter sp. DCA]
 gi|409022795|gb|AFV04825.1| DUF124 domain-containing protein [Dehalobacter sp. CF]
          Length = 267

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 87/232 (37%), Positives = 132/232 (56%), Gaps = 12/232 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK-- 80
           I +++ G + Q  ++ L P E V+A  G++ FM   ++ME I+   +  G  + L GK  
Sbjct: 7   IEYKVYGDDMQFVEVELDPAESVIAEAGALMFMEPGIQMETIFGDGSGQGENRGLTGKLF 66

Query: 81  ----------TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
                     ++   +  N G        AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 67  GAGKRLLTGESLFMTLFTNSGAGKAHASFAAPYPGKIIPVDLTAYNGKVICQKDAFLCAA 126

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +    ++R+     G EGF+ QKL G GLAF+ AGG++V+K L  GE + VD  C+
Sbjct: 127 KGVAIGIEFNKRLGTGFFGGEGFIMQKLEGDGLAFLHAGGTIVEKELRPGETLRVDTGCL 186

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           V +T SV   I++ G I+ A FGG+ L  A V+GPG V++QSLPF RL+ RI
Sbjct: 187 VGMTGSVCYDIQFVGGIKTAFFGGEGLFFATVSGPGKVWLQSLPFSRLADRI 238


>gi|337747242|ref|YP_004641404.1| hypothetical protein KNP414_02976 [Paenibacillus mucilaginosus
           KNP414]
 gi|386723772|ref|YP_006190098.1| hypothetical protein B2K_16665 [Paenibacillus mucilaginosus K02]
 gi|336298431|gb|AEI41534.1| protein of unknown function DUF124 [Paenibacillus mucilaginosus
           KNP414]
 gi|384090897|gb|AFH62333.1| hypothetical protein B2K_16665 [Paenibacillus mucilaginosus K02]
          Length = 262

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 89/242 (36%), Positives = 135/242 (55%), Gaps = 10/242 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I + I G E Q  +I L P E V+A  GS+  M  S+ ME I+   ++ G          
Sbjct: 6   IDYAIHGSEMQYVEIELDPGESVIAEAGSLMMMDQSIHMETIFGDGSDQGKGLMGKLFGA 65

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+++   +  N G        A+P   RILP+DL+M GG+++CQ D+FLC+   
Sbjct: 66  GKRLLTGESLFMTIFTNRGHGKERASFASPYPGRILPMDLSMLGGKIICQKDSFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++     G EGF+ QK+ G GLAF+ AGG++ ++ L  GE+I VD  C+VA
Sbjct: 126 VSVGIDFQRKLGAGFFGGEGFIMQKIEGDGLAFVHAGGAICERELRPGEMIRVDTGCLVA 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
           +T  V+  I++   ++ A+FGG+ L  A + GPG V+IQSLPF RL+ R+  A      +
Sbjct: 186 MTQGVDYDIEFVKGVKTALFGGEGLFFATLRGPGRVWIQSLPFSRLADRVLAAAGPGGRK 245

Query: 253 EN 254
           E 
Sbjct: 246 EE 247


>gi|256420192|ref|YP_003120845.1| hypothetical protein Cpin_1146 [Chitinophaga pinensis DSM 2588]
 gi|256035100|gb|ACU58644.1| protein of unknown function DUF124 [Chitinophaga pinensis DSM 2588]
          Length = 261

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 85/244 (34%), Positives = 134/244 (54%), Gaps = 14/244 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I ++I G E Q+ +I L PQE  +A  GS   M   ++M+ ++   ++    Q +FGK +
Sbjct: 7   IDYRIYGEEIQIVEIELDPQETAIAESGSFMMMDQDIQMQTMFGDGSQAN--QGIFGKLV 64

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
           ++                N G     V  AAP   +I+P+DL   GG+++CQ DAFLC+ 
Sbjct: 65  SAGKRMLTGESLFMTAFTNMGQGKKKVSFAAPYPGKIIPMDLQRMGGKVICQKDAFLCAA 124

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++   I G EGF+ +KL G G+AF+ AGG V++++L  G+V+ +D  C+
Sbjct: 125 KGVSIGIEWQRKLGTGIFGGEGFIMEKLEGDGMAFVHAGGMVIERDLLPGQVLKIDTGCV 184

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
           VA TS V+  +++   IR  VFGG+ L  A + GPG V+IQSLP  R++ R+    T   
Sbjct: 185 VAYTSGVHFDVEFVKGIRNMVFGGEGLFFATLRGPGKVWIQSLPISRMAARLVSYGTGKR 244

Query: 251 MREN 254
             E 
Sbjct: 245 KEEG 248


>gi|182413474|ref|YP_001818540.1| hypothetical protein Oter_1656 [Opitutus terrae PB90-1]
 gi|177840688|gb|ACB74940.1| protein of unknown function DUF124 [Opitutus terrae PB90-1]
          Length = 266

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 92/244 (37%), Positives = 134/244 (54%), Gaps = 12/244 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE---------VGM 73
           + +QI G + Q  +I L P E VVA  G M FM   + ME ++   +          +G 
Sbjct: 7   VDYQIHGDDMQFVEIELDPAEAVVAEAGGMMFMEDGIAMETVFGDGSTRNSGVMGALLGA 66

Query: 74  WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
            + L  G+++   V +N       V   AP   +ILPI LA  GGEL+ Q D+FLC+   
Sbjct: 67  GKRLLTGESLFMTVFQNQAAGKKRVSFGAPYPGKILPIKLADVGGELIAQKDSFLCAAKG 126

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +R+   + G EGF+ Q+LTG G AF+ AGG++ ++ L  GE++ VD  CIVA
Sbjct: 127 VSIGIAFQRRLGAGLFGGEGFIMQRLTGDGWAFVHAGGTLCERTLAPGELLRVDTGCIVA 186

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV--TSPN 250
              SV+  I++ G I+ A+FGG+ L  A + GPG +++QSLPF RL+ RI  A   T   
Sbjct: 187 FQPSVDYDIQWVGGIKSALFGGEGLFFATLRGPGKLWLQSLPFSRLAGRIIAAAPQTGRG 246

Query: 251 MREN 254
            RE 
Sbjct: 247 GREE 250


>gi|302875052|ref|YP_003843685.1| hypothetical protein Clocel_2178 [Clostridium cellulovorans 743B]
 gi|307690329|ref|ZP_07632775.1| hypothetical protein Ccel74_19401 [Clostridium cellulovorans 743B]
 gi|302577909|gb|ADL51921.1| protein of unknown function DUF124 [Clostridium cellulovorans 743B]
          Length = 266

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 91/248 (36%), Positives = 146/248 (58%), Gaps = 18/248 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQ---- 75
           I ++I G + Q  +I L P+E VVA  G+M  M  S++ME I+     +N    ++    
Sbjct: 6   IDYKITGDDIQYVEIELDPKETVVAEAGAMMMMDPSIQMETIFGDGSNQNPSSTFKDKLF 65

Query: 76  -----WLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
                 L G+++      N G     V  AAP   +I+P+DL++ GG ++CQ DAFLC+ 
Sbjct: 66  YAGKRVLTGESLFMTAFTNLGNIKEKVYFAAPYPGKIIPMDLSLLGGRIICQKDAFLCAA 125

Query: 131 NDVKVSNTLDQRVRNVIG--GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
             V +   +D R +  +G  G EGF+ +KL G G+AFI A G++++K+L  GE + VD  
Sbjct: 126 KGVSIG--IDFRKKLSVGFFGGEGFILEKLEGDGVAFIHACGAIIKKDLMPGETLRVDTG 183

Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           C+VA+T  V+  I++ G ++  +FGG+ +  A +TGPG V+IQSLPF RL++R+  A  +
Sbjct: 184 CLVAMTQEVHYDIQFVGGVKNTLFGGEGVFFATLTGPGTVWIQSLPFSRLAERVFAA--A 241

Query: 249 PNMRENPK 256
           P++  + K
Sbjct: 242 PHLGGHRK 249


>gi|392953242|ref|ZP_10318796.1| hypothetical protein WQQ_28680 [Hydrocarboniphaga effusa AP103]
 gi|391858757|gb|EIT69286.1| hypothetical protein WQQ_28680 [Hydrocarboniphaga effusa AP103]
          Length = 271

 Score =  157 bits (396), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 86/245 (35%), Positives = 134/245 (54%), Gaps = 10/245 (4%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWL-- 77
           +I ++I G + Q  ++ L P E  V   G+M FM   + M+ ++      E G W  L  
Sbjct: 5   IIDYKIYGHDMQFVEVELDPGEAAVGEAGAMMFMEDGIVMDTVFGDGSAQEKGFWGKLAG 64

Query: 78  ------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                  G+++ + V  N G     V  AAP    I+P+ L   GG L+CQ DAFLC+  
Sbjct: 65  AGKRLVTGESLFTTVFSNQGQGKRKVSFAAPYPGTIVPVLLPDVGGTLICQKDAFLCAAK 124

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     +++   + G EGF+ Q+L G GLAF+ AGG++ +++L  GE + VD  CIV
Sbjct: 125 GVSLGIAFQRKLGVGLFGGEGFIMQRLDGDGLAFVHAGGTLYKRDLRPGETLRVDTGCIV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
           A+T +V+  I+Y G ++ A+F G+ L  AV+ GPG V++QSLP  R++ RI  A  +   
Sbjct: 185 ALTQTVSYDIQYVGKLKSALFSGEGLFYAVLQGPGTVWLQSLPLSRMANRIVMAAPAMGG 244

Query: 252 RENPK 256
           R   +
Sbjct: 245 RSRDE 249


>gi|73668506|ref|YP_304521.1| hypothetical protein Mbar_A0969 [Methanosarcina barkeri str.
           Fusaro]
 gi|72395668|gb|AAZ69941.1| conserved hypothetical protein [Methanosarcina barkeri str. Fusaro]
          Length = 262

 Score =  157 bits (396), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 91/241 (37%), Positives = 137/241 (56%), Gaps = 11/241 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------Q 75
           I ++I+G + Q+ +I L P+E V A  G+M +M   ++M+     +             +
Sbjct: 5   IDYEIIGNDMQIVEIELDPEEAVQAEAGAMAYMGPGIQMQTSMSNDGGGLFGGLKKGLKR 64

Query: 76  WLFGKT--ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
            L G++  ITS V +  G   G V  AAP   +I+P+DL  FGG LLCQ D+FLC+   V
Sbjct: 65  ALTGESFFITSFVQKGSGK--GHVAFAAPYPGKIIPLDLTKFGGGLLCQKDSFLCAARGV 122

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
           ++     +++     G EGF+ Q+L G GLAF+  GG+V++K+L VGE   VD  C+ A 
Sbjct: 123 EIEVAFTRKLGVGFFGGEGFILQRLKGDGLAFLHIGGTVIRKDLAVGETYRVDTGCVAAF 182

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRE 253
           T +V   I ++   + A+FGG+ +V A +TGPG V++QSLPF RL+ RI  A    N  E
Sbjct: 183 TETVTYDITWSRNFKNALFGGEGVVLATLTGPGTVYMQSLPFSRLADRIFAASAFGNREE 242

Query: 254 N 254
            
Sbjct: 243 Q 243


>gi|21227408|ref|NP_633330.1| HTH DNA-binding protein [Methanosarcina mazei Go1]
 gi|452209891|ref|YP_007490005.1| Hypothetical protein MmTuc01_1354 [Methanosarcina mazei Tuc01]
 gi|20905772|gb|AAM31002.1| HTH DNA-binding protein [Methanosarcina mazei Go1]
 gi|452099793|gb|AGF96733.1| Hypothetical protein MmTuc01_1354 [Methanosarcina mazei Tuc01]
          Length = 250

 Score =  157 bits (396), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 86/229 (37%), Positives = 125/229 (54%), Gaps = 7/229 (3%)

Query: 33  QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------QWLFGKTITSV 85
           Q+ +I L P E V A  G+M +M   + M+     E             + L G++    
Sbjct: 2   QIVEIELDPGEAVQAEAGAMAYMGPGILMQTGMGNEGGGLFGGLKKGLKRALTGESFFIT 61

Query: 86  VLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRN 145
              + G   G V  AAP   +ILP+DL+ FGG +LCQ DAFLC+   ++V     +++  
Sbjct: 62  SFIHKGSGKGHVAFAAPYPGKILPLDLSKFGGSILCQKDAFLCAAKGIEVELAFTRKLGA 121

Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
            + G EGF+ Q+L G GLAFI  GG+V++K+L  GE   VD  C+ A T +V   I ++ 
Sbjct: 122 GLFGGEGFILQRLRGDGLAFIHIGGTVIRKDLAPGETYKVDTGCVAAFTENVTYDITWSR 181

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
             + A+FGG+ +V A +TGPG V++QSLPF RL+ RI  A      RE 
Sbjct: 182 DFKNALFGGEGVVLATLTGPGTVYMQSLPFSRLADRIFAASAYGQNREE 230


>gi|452995794|emb|CCQ92476.1| conserved hypothetical protein [Clostridium ultunense Esp]
          Length = 261

 Score =  156 bits (395), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 88/229 (38%), Positives = 132/229 (57%), Gaps = 9/229 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I ++I G E Q  ++ L P E V+A  G++  M   ++ME I+      G +        
Sbjct: 6   IDYRIYGEEMQFVEVELDPGESVIAEAGALMMMDDEIQMETIFGDGTHQGGFMSKLFAAG 65

Query: 75  -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
            + L G+++   V  N G     V  A+P   RI+P+DL+   G+++CQ DAFLC+   V
Sbjct: 66  KRVLTGESLFMTVFTNEGRGKKRVSFASPYPGRIIPLDLSELDGKVICQKDAFLCAAKGV 125

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
            V     +++     G EGF+ QKL G GLAF+ AGG++V++ L  GE + VD  C+VA+
Sbjct: 126 SVGIDFQRKLGTGFFGGEGFIMQKLEGDGLAFVHAGGTIVERTLAPGERLKVDTGCLVAL 185

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           T  V+  I++ G I+  +FGG+ L  A +TGPG V+IQSLPF RL+ RI
Sbjct: 186 TQDVDYDIQFVGGIKNTLFGGEGLFFATLTGPGKVWIQSLPFARLADRI 234


>gi|118581514|ref|YP_902764.1| hypothetical protein Ppro_3113 [Pelobacter propionicus DSM 2379]
 gi|118504224|gb|ABL00707.1| protein of unknown function DUF124 [Pelobacter propionicus DSM
           2379]
          Length = 327

 Score =  156 bits (395), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 91/247 (36%), Positives = 135/247 (54%), Gaps = 12/247 (4%)

Query: 9   PFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPE 68
           P  P   ++  D  I ++I+G E Q  +I L P E V+A  G+M +   S+ ME I+   
Sbjct: 58  PAPPSGRRASDD--IDYRIVGSEMQFVEIELDPGESVIAEAGAMMYKDASITMEAIFGDG 115

Query: 69  NEVGMW----------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE 118
           +  G            + L G+++   +    G        AAP    I+P  LA  GG 
Sbjct: 116 SSQGGGFTDKLFSAGKRILTGESLFMTLFTQSGQGKARAAFAAPYPGNIIPATLASLGGT 175

Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
           L+CQ D FLC+   V V     +R+   + G EGF+ QKL G GLAF+ AGG++V++ L 
Sbjct: 176 LICQKDCFLCAARGVSVGIFFQRRIMTGLFGGEGFIMQKLEGDGLAFLHAGGTIVERVLS 235

Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
            GE + VD  C+VA+  +V+  I+  G I+ A+FGG+ L  A +TGPG V++QSLPF RL
Sbjct: 236 AGETLHVDTGCLVAMEPTVSFDIQQAGNIKTALFGGEGLFFARLTGPGRVWLQSLPFSRL 295

Query: 239 SQRIARA 245
           + R+ +A
Sbjct: 296 AGRMLQA 302


>gi|408370142|ref|ZP_11167921.1| hypothetical protein I215_04540 [Galbibacter sp. ck-I2-15]
 gi|407744617|gb|EKF56185.1| hypothetical protein I215_04540 [Galbibacter sp. ck-I2-15]
          Length = 266

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 91/234 (38%), Positives = 129/234 (55%), Gaps = 10/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
           I +QI G E Q  +I L PQE VVA  GS   M   + M+ I+      + G+   LF  
Sbjct: 6   IDYQIYGEEMQFVEIELDPQEAVVAEAGSFMMMEDGLNMDTIFGDGSNQDRGVLGKLFSA 65

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G+++      N       V  AAP   +I+PIDL  + G+ +CQ DAFLC+   
Sbjct: 66  GKRMLTGESLFMTAYLNTTGLKKKVSFAAPYPGKIVPIDLRKYQGKFICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +R+     G EGF+ QKL G G+AF+ AGG++ +K L  GE++ VD  CIV 
Sbjct: 126 VSVGIEFSKRLGRGFFGGEGFIMQKLQGDGMAFVHAGGTLARKELAEGELLKVDTGCIVG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            T  ++  I++ G I+  VFGG+ L  A + GPG+V+IQSLPF RL+ R+  A+
Sbjct: 186 FTKDIDYDIQFVGGIKNTVFGGEGLFYATLRGPGVVYIQSLPFSRLASRVFAAI 239


>gi|260221740|emb|CBA30605.1| hypothetical protein Csp_C24390 [Curvibacter putative symbiont of
           Hydra magnipapillata]
          Length = 293

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 79/230 (34%), Positives = 131/230 (56%), Gaps = 9/230 (3%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW------- 74
           V+ ++I G E Q  ++ L P E  +   G M +M   + M+ I+   ++ G +       
Sbjct: 30  VVDYEIFGSEMQYVEVELDPGEAAIGEAGVMMYMQEGITMDTIFGDGSQQGGFFGKLMGA 89

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+ + + V  N G     V  AAP   +I+P+DL+  GG ++CQ D+FLC+   
Sbjct: 90  GKRLLTGEGLFTTVFENSGSGKQRVAFAAPYPGKIVPMDLSKLGGTIICQKDSFLCAAKG 149

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G+AF+ AGG+++++ L  GE + VD  C+VA
Sbjct: 150 VALGIAFQKKLGVGLFGGEGFIMQKLDGDGMAFVHAGGTLMERTLAPGETLRVDTGCVVA 209

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
              +V+  I++ G ++ A+FGG+ L  A + GPG V++QSLP  RL+ RI
Sbjct: 210 FQPTVDFDIQFVGKVKSAIFGGEGLFFATLRGPGKVWLQSLPLSRLANRI 259


>gi|95931485|ref|ZP_01314184.1| protein of unknown function DUF124 [Desulfuromonas acetoxidans DSM
           684]
 gi|95132462|gb|EAT14162.1| protein of unknown function DUF124 [Desulfuromonas acetoxidans DSM
           684]
          Length = 333

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 86/234 (36%), Positives = 131/234 (55%), Gaps = 10/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI---PENEVGMWQWL-- 77
           I F I G E Q  +I L P E  VA  G+M + + ++ ME ++    P+    M + L  
Sbjct: 75  IDFTIHGTEMQFVEIELDPGESAVAEAGAMMYKASTISMETVFGDGGPQTGGFMGKLLGA 134

Query: 78  -----FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G+++ + V  + G     V   AP    I+P+ L   GG L+CQ DAFLC+   
Sbjct: 135 GKRLVTGESLFTTVFTHQGHGKAHVAFGAPYPGNIIPVALDAMGGSLICQKDAFLCAARG 194

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +   L +R+   + G EGF+ QKL G G+AF+ AGGS+V++ L+ GE + VD  C+VA
Sbjct: 195 VSIGLHLQKRILTGLFGGEGFIMQKLEGDGMAFLHAGGSIVERELKPGEELHVDTGCVVA 254

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
               V+  I+  G I+ ++FGG+ L  A + GPG +++QSLPF RL+ R+ +A 
Sbjct: 255 YEPKVSFDIQQAGGIKTSLFGGEGLFFAKLQGPGKIWLQSLPFSRLAGRMLQAA 308


>gi|169826442|ref|YP_001696600.1| hypothetical protein Bsph_0854 [Lysinibacillus sphaericus C3-41]
 gi|168990930|gb|ACA38470.1| conserved hypothetical protein [Lysinibacillus sphaericus C3-41]
          Length = 266

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 86/235 (36%), Positives = 134/235 (57%), Gaps = 12/235 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  Q+ L PQE VVA  G++  M  S+ ME I+   ++ G    L GK +
Sbjct: 6   IDYKLFGDDMQYVQVELDPQETVVAEAGALMMMDDSIHMETIFGDGSKGGSQSGLMGKLL 65

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +                N G     V  A+P   +I+P+DL+   G+++CQ DAFL + 
Sbjct: 66  GAGKRLVTGESLFMTTFTNVGSGKKHVYFASPYPGKIIPMDLSQLNGKIICQKDAFLAAA 125

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V V     +++     G EGF+ QKL G G+AF+ AGG++ +K L+ GEV+ VD  C+
Sbjct: 126 KGVSVGVEFQKKIGVGFFGGEGFIMQKLEGDGMAFVHAGGAIHEKTLQPGEVLRVDTGCL 185

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           VA+TS+V+  I+  G ++ A+FGG+ +  A + GPG V++QSLPF RL+ R+  A
Sbjct: 186 VAMTSNVDYDIELVGGVKTALFGGEGIFFATLRGPGTVWVQSLPFSRLASRVFAA 240


>gi|189423881|ref|YP_001951058.1| hypothetical protein Glov_0812 [Geobacter lovleyi SZ]
 gi|189420140|gb|ACD94538.1| protein of unknown function DUF124 [Geobacter lovleyi SZ]
          Length = 267

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 82/241 (34%), Positives = 133/241 (55%), Gaps = 10/241 (4%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE---------VG 72
           VI ++I G E Q  ++ L P E  +   G M +M   ++M+ ++   ++         +G
Sbjct: 5   VIDYEIFGSEMQFVEVELDPGEAAIGEAGVMMYMQDGIQMDTVFGDGSQQTSGFMGKLLG 64

Query: 73  MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
             + L  G+++ + V  N       V  AAP   +I+P+ L   GG L+CQ D+FLC+  
Sbjct: 65  AGKRLLTGESLFTTVFHNESSGKRRVAFAAPYPGKIVPVQLTDIGGTLICQKDSFLCAAK 124

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     +++   + G EGF+ QKL G G+AFI AGG++ ++ L  GE + VD  C+V
Sbjct: 125 GVSLGIAFQKKIGTGLFGGEGFIMQKLDGDGMAFIHAGGTLHERTLAPGETLRVDTGCVV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
           A   SV+  I++ G I+ AVFGG+ L  A ++GPG +++QSLP  RL+ RI  +  +   
Sbjct: 185 AFQPSVDFDIQFVGKIKSAVFGGEGLFFATLSGPGKIWLQSLPLSRLANRIVMSAPATGG 244

Query: 252 R 252
           R
Sbjct: 245 R 245


>gi|126651021|ref|ZP_01723232.1| hypothetical protein BB14905_20240 [Bacillus sp. B14905]
 gi|126592222|gb|EAZ86271.1| hypothetical protein BB14905_20240 [Bacillus sp. B14905]
          Length = 266

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 86/235 (36%), Positives = 134/235 (57%), Gaps = 12/235 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  Q+ L PQE VVA  G++  M  S+ ME I+   ++ G    L GK +
Sbjct: 6   IDYKLFGDDMQYVQVELDPQETVVAEAGALMMMDDSIHMETIFGDGSKGGSQSGLMGKLL 65

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +                N G     V  A+P   +I+P+DL+   G+++CQ DAFL + 
Sbjct: 66  GAGKRLVTGESLFMTTFTNVGSGKKHVYFASPYPGKIIPMDLSQLNGKIICQKDAFLAAA 125

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V V     +++     G EGF+ QKL G G+AF+ AGG++ +K L+ GEV+ VD  C+
Sbjct: 126 KGVSVGVEFQKKIGVGFFGGEGFIMQKLEGDGMAFVHAGGAIHEKTLQPGEVLRVDTGCL 185

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           VA+TS+V+  I+  G ++ A+FGG+ +  A + GPG V++QSLPF RL+ R+  A
Sbjct: 186 VAMTSNVDYDIELVGGVKTALFGGEGIFFATLRGPGTVWVQSLPFSRLASRVFAA 240


>gi|383451724|ref|YP_005358445.1| hypothetical protein KQS_12415 [Flavobacterium indicum GPTSA100-9]
 gi|380503346|emb|CCG54388.1| Protein of unknown function [Flavobacterium indicum GPTSA100-9]
          Length = 264

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 126/232 (54%), Gaps = 14/232 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I + I G E Q  +I L PQE V+A  G    M   ++ME I+   ++    Q   GK +
Sbjct: 6   IDYHIYGEEMQYVEIELDPQEVVIAEAGGFMMMEDGIKMETIFGDGSQ--QQQGFMGKLL 63

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
           ++               +N G     V  A+P   +I+ IDL  + G+ +CQ DAFLC+ 
Sbjct: 64  SAGKRVLTGESLFMTAFQNQGLGKKKVSFASPYPGKIIAIDLREYNGKFICQKDAFLCAA 123

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V V     +++   + G EGF+ QK+ G G+AF+ +GG++ +K L  GEV+ +D  C+
Sbjct: 124 KGVSVGIEFSKKIGRALFGGEGFIMQKIEGDGMAFVHSGGTLAKKVLAPGEVMRIDTGCV 183

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           V  T  V+  I++ G I+  +FGG+ +  A + GPG V+IQSLPF RL+ RI
Sbjct: 184 VGYTQDVDFDIEFVGGIKNTIFGGEGMFYATLRGPGTVYIQSLPFSRLAGRI 235


>gi|389701311|ref|ZP_10185260.1| TIGR00266 family protein [Leptothrix ochracea L12]
 gi|388591131|gb|EIM31397.1| TIGR00266 family protein [Leptothrix ochracea L12]
          Length = 271

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 96/252 (38%), Positives = 139/252 (55%), Gaps = 11/252 (4%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMW----- 74
           VI + I G E Q  +I L P E  V   GSM FM   + M+ ++      + G +     
Sbjct: 5   VIDYDIRGSEMQFVEIELDPGEAAVGEAGSMMFMDTGITMDAVFGDGSAQQGGFFGKLLG 64

Query: 75  ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + L G+++ + V  N   +   V  AAP   +I+P+DL   GG L+CQ D+FLC+  
Sbjct: 65  AGKRLLTGESLFTTVYTNQAGAKQRVAFAAPFPGKIMPMDLRQLGGMLICQKDSFLCAAR 124

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     Q++     G EGF+ QKL G GLAFI AGG+VV++ L+  + + VD  C+V
Sbjct: 125 GVSIGIAFQQKMGVGFFGGEGFIMQKLEGDGLAFIHAGGTVVRRELQAEQSLLVDTGCLV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI-ARAVTSPN 250
           A T SVN +I+Y G I+ A+FGG+ L  A ++GPG V++QSLPF RL+ R+ A A     
Sbjct: 185 AHTPSVNFEIQYVGKIKTALFGGEGLFLARLSGPGQVWLQSLPFSRLASRVFAAAPQRGG 244

Query: 251 MRENPKFFVQIA 262
            RE       +A
Sbjct: 245 SREEGSLLGGLA 256


>gi|403237899|ref|ZP_10916485.1| hypothetical protein B1040_19211 [Bacillus sp. 10403023]
          Length = 264

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 86/231 (37%), Positives = 129/231 (55%), Gaps = 11/231 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I + + G + Q  +I L P E V+A  G+M  M   + ME I+   +             
Sbjct: 6   IEYNLYGDDMQFVEIELDPNESVIAEAGAMMMMEDGISMETIFGDGSGTSGGGLLGKLFG 65

Query: 75  ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + L G+++   V  N       V  AAP   +I+PIDL++ GG+++CQ D+FLC+  
Sbjct: 66  AGKRVLTGESLFMTVFTNSQMGKKHVSFAAPYPGKIIPIDLSLLGGKVVCQKDSFLCAAK 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V V     +++     G EGF+ QKL G GLAF+ AGG++ +K L+ GE++ +D  C+V
Sbjct: 126 GVSVGIDFQRKLGTGFFGGEGFIMQKLEGDGLAFLHAGGTIHKKELQPGEILRIDTGCLV 185

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            +T  V+  I+Y G I+ A FGG+ L  A V GPG V++QSLPF RL+ RI
Sbjct: 186 GMTKEVDYNIEYVGKIKTAFFGGEGLFYATVRGPGTVWVQSLPFSRLADRI 236


>gi|325289821|ref|YP_004266002.1| hypothetical protein Sgly_1704 [Syntrophobotulus glycolicus DSM
           8271]
 gi|324965222|gb|ADY56001.1| protein of unknown function DUF124 [Syntrophobotulus glycolicus DSM
           8271]
          Length = 267

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 87/232 (37%), Positives = 131/232 (56%), Gaps = 13/232 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + +QI G + Q  ++ L P E VVA  G++ +M   + ME I+  +        L GK +
Sbjct: 7   VDYQIFGDDMQFVEVELDPSESVVAEAGALMYMEPGIRMETIF-GDGSSNQGSGLMGKLL 65

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G   G V  AAP   +I+P+DL  + G+L+CQ DAFLC+ 
Sbjct: 66  GAGKRVLTGESLFMTVFTNSGSGKGHVSFAAPFPGKIIPVDLTNYNGKLICQKDAFLCAA 125

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ Q+L G G++F+ AGG++V+K L+ GE + VD  C+
Sbjct: 126 KGVSIGIEFTKKLGAGFFGGEGFIMQRLEGDGMSFLHAGGTIVEKVLQPGETLRVDTGCL 185

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           VA+T  V   I++ G I+ A FGG+ L  A V+GPG V++QSLPF RL+ RI
Sbjct: 186 VAMTGQVAYDIQFVGGIKTAFFGGEGLFFATVSGPGRVWLQSLPFSRLADRI 237


>gi|229135710|ref|ZP_04264486.1| Protein containing DUF124 [Bacillus cereus BDRD-ST196]
 gi|228647747|gb|EEL03806.1| Protein containing DUF124 [Bacillus cereus BDRD-ST196]
          Length = 260

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 91/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E VVA  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVVAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  I++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDVNYDIQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|404369660|ref|ZP_10974993.1| TIGR00266 family protein [Clostridium sp. 7_2_43FAA]
 gi|226914338|gb|EEH99539.1| TIGR00266 family protein [Clostridium sp. 7_2_43FAA]
          Length = 262

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 91/235 (38%), Positives = 131/235 (55%), Gaps = 11/235 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL----- 77
           + ++ILG E Q  +I L P+E VVA  G+M  M  S+ ME I    ++     +L     
Sbjct: 5   VDYEILGNEMQYVEIELDPRESVVAEAGAMMMMDNSITMETILGDGSQKSQGGFLGKLGG 64

Query: 78  ------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                  G+++      N G     V  AAP   +I+ +DL   GG L+CQ D+FLC   
Sbjct: 65  AAKRVITGESLFMTEFTNSGVLKQKVAFAAPYPGKIIAMDLRKLGGRLICQKDSFLCCAK 124

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            + V     +++     G EGF+ QKL G GLAFI AGG++V+K L  G+ I +D  C+V
Sbjct: 125 GISVGIAFTKKLGTGFFGGEGFILQKLEGDGLAFIHAGGTIVKKTLMPGDTIKIDTGCLV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           A+T +VN  I++ G  + AVFGG+ L  A +TGPG V+IQSLPF +L+ R+  A 
Sbjct: 185 AMTQNVNYDIQFVGGFKNAVFGGEGLFFATLTGPGDVWIQSLPFSKLASRVYSAA 239


>gi|228993600|ref|ZP_04153508.1| Protein containing DUF124 [Bacillus pseudomycoides DSM 12442]
 gi|228999636|ref|ZP_04159213.1| Protein containing DUF124 [Bacillus mycoides Rock3-17]
 gi|228760162|gb|EEM09131.1| Protein containing DUF124 [Bacillus mycoides Rock3-17]
 gi|228766191|gb|EEM14837.1| Protein containing DUF124 [Bacillus pseudomycoides DSM 12442]
          Length = 260

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 89/239 (37%), Positives = 135/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 6   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGAGKSSGLFGKLM 63

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 64  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIVPVDLTEYQGKVVCQKDAFLCAA 123

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++ +   G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 124 KGVSIGIEFTKKIGSGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGERLRIDTGCL 183

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  VN  I++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 184 VAMTRDVNYDIQFVGNVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238


>gi|299536964|ref|ZP_07050269.1| hypothetical protein BFZC1_13108 [Lysinibacillus fusiformis ZC1]
 gi|424738144|ref|ZP_18166587.1| hypothetical protein C518_2699 [Lysinibacillus fusiformis ZB2]
 gi|298727544|gb|EFI68114.1| hypothetical protein BFZC1_13108 [Lysinibacillus fusiformis ZC1]
 gi|422947960|gb|EKU42348.1| hypothetical protein C518_2699 [Lysinibacillus fusiformis ZB2]
          Length = 266

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 86/235 (36%), Positives = 133/235 (56%), Gaps = 12/235 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  Q+ L PQE VVA  G++  M  S+ ME I+   +  G    L GK +
Sbjct: 6   IDYKLFGDDMQYVQVELDPQETVVAEAGALMMMDDSIHMETIFGDGSRGGSQSGLMGKLL 65

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +                N G     V  A+P   +I+P+DL+   G+++CQ DAFL + 
Sbjct: 66  GAGKRLITGESLFMTTFTNVGSGKKHVYFASPYPGKIIPMDLSQLNGKIICQKDAFLAAA 125

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V V     +++     G EGF+ QKL G G+AF+ AGG++ +K L+ GEV+ VD  C+
Sbjct: 126 KGVSVGVEFQKKIGVGFFGGEGFIMQKLEGDGMAFVHAGGAIHEKTLQPGEVLRVDTGCL 185

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           VA+TS+V+  I+  G ++ A+FGG+ +  A + GPG V++QSLPF RL+ R+  A
Sbjct: 186 VAMTSNVDYDIEMVGGVKTALFGGEGIFFATLRGPGTVWVQSLPFSRLASRVFAA 240


>gi|229175580|ref|ZP_04303090.1| Protein containing DUF124 [Bacillus cereus MM3]
 gi|423456874|ref|ZP_17433671.1| TIGR00266 family protein [Bacillus cereus BAG5X2-1]
 gi|228607976|gb|EEK65288.1| Protein containing DUF124 [Bacillus cereus MM3]
 gi|401149314|gb|EJQ56788.1| TIGR00266 family protein [Bacillus cereus BAG5X2-1]
          Length = 260

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTQYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGERLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  I++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDVNYDIEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|423400263|ref|ZP_17377436.1| TIGR00266 family protein [Bacillus cereus BAG2X1-2]
 gi|423479034|ref|ZP_17455749.1| TIGR00266 family protein [Bacillus cereus BAG6X1-1]
 gi|401655620|gb|EJS73149.1| TIGR00266 family protein [Bacillus cereus BAG2X1-2]
 gi|402426344|gb|EJV58472.1| TIGR00266 family protein [Bacillus cereus BAG6X1-1]
          Length = 260

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTQYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGERLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  I++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDVNYDIQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|423512976|ref|ZP_17489507.1| TIGR00266 family protein [Bacillus cereus HuA2-1]
 gi|402447269|gb|EJV79126.1| TIGR00266 family protein [Bacillus cereus HuA2-1]
          Length = 260

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 90/248 (36%), Positives = 137/248 (55%), Gaps = 18/248 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E VVA  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVVAEAGAMMMMEDHIEMETIF--GDGSGPSNGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDVNYDVQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSPNMREN 254
           TSP  + +
Sbjct: 236 TSPTAQNS 243


>gi|196250042|ref|ZP_03148737.1| protein of unknown function DUF124 [Geobacillus sp. G11MC16]
 gi|196210556|gb|EDY05320.1| protein of unknown function DUF124 [Geobacillus sp. G11MC16]
          Length = 264

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 89/240 (37%), Positives = 133/240 (55%), Gaps = 10/240 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I +Q+ G + Q  +I L P+E V+A  G M  M   + ME ++   +  G          
Sbjct: 6   IDYQLYGDDMQFVEIELDPRESVIAEAGGMMMMEDGIVMETVFGDCSSSGKGLLGRLVGA 65

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+++   V  N G     V  AAP   +I+P++L   GG+L+CQ D+FLC+   
Sbjct: 66  GKRLLTGESLFMTVFTNQGTGKRRVAFAAPYPGKIIPVNLQELGGKLICQKDSFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++     G EGF+ QKL G GLAF+ AGG + ++ L+ GE + +D  C+VA
Sbjct: 126 VSVGIEFQRKLGTGFFGGEGFIMQKLEGDGLAFLHAGGMIHRRELQPGEKLRIDTGCLVA 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
           +T  VN  I+Y G I+ A FGG+ L  A +T PG+V++QSLPF RL+ R+  A  S   R
Sbjct: 186 MTKEVNYDIEYVGNIKTAFFGGEGLFFATLTEPGVVWVQSLPFSRLADRVIAAAPSSGGR 245


>gi|229007195|ref|ZP_04164801.1| Protein containing DUF124 [Bacillus mycoides Rock1-4]
 gi|228754054|gb|EEM03486.1| Protein containing DUF124 [Bacillus mycoides Rock1-4]
          Length = 260

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 89/239 (37%), Positives = 135/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 6   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGAGKSSGLFGKLM 63

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 64  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIVPVDLTEYQGKVVCQKDAFLCAA 123

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++ +   G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 124 KGVSIGIEFTKKIGSGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGERLRIDTGCL 183

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  VN  I++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 184 VAMTRDVNYDIQFVGNVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238


>gi|386821223|ref|ZP_10108439.1| TIGR00266 family protein [Joostella marina DSM 19592]
 gi|386426329|gb|EIJ40159.1| TIGR00266 family protein [Joostella marina DSM 19592]
          Length = 266

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 88/230 (38%), Positives = 128/230 (55%), Gaps = 10/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLF-- 78
           I +QI G E Q  +I L PQE VVA  GS   M   ++M+ I+      + G+   LF  
Sbjct: 6   IDYQIYGEEMQYVEIELDPQEAVVAEAGSFMMMDSGLQMDTIFGDGSNQDRGVLGKLFSA 65

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G+++      N       V  A+P   +I+PIDL  + G+ +CQ DAFLC+   
Sbjct: 66  GKRLLTGESLFMTAFLNTDLGKRKVSFASPYPGKIIPIDLTKYRGKFICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++     G EGF+ QKL G G+AF+ AGG++ ++ L  GEV+ VD  CIV 
Sbjct: 126 VSVGIEFSRKLGRGFFGGEGFIMQKLEGDGMAFVHAGGTLARRELAAGEVLKVDTGCIVG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  ++  I++ G I+  VFGG+ L  A + GPG+V+IQSLPF RL+ R+
Sbjct: 186 FTKDIDYDIEFIGGIKNTVFGGEGLFFASLRGPGVVYIQSLPFSRLASRV 235


>gi|423471068|ref|ZP_17447812.1| TIGR00266 family protein [Bacillus cereus BAG6O-2]
 gi|402433529|gb|EJV65580.1| TIGR00266 family protein [Bacillus cereus BAG6O-2]
          Length = 260

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDASGPSSGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDVNYDVQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|423670445|ref|ZP_17645474.1| TIGR00266 family protein [Bacillus cereus VDM034]
 gi|423673349|ref|ZP_17648288.1| TIGR00266 family protein [Bacillus cereus VDM062]
 gi|401296539|gb|EJS02157.1| TIGR00266 family protein [Bacillus cereus VDM034]
 gi|401310766|gb|EJS16077.1| TIGR00266 family protein [Bacillus cereus VDM062]
          Length = 260

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E VVA  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVVAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDVNYDVQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|163942599|ref|YP_001647483.1| hypothetical protein BcerKBAB4_4702 [Bacillus weihenstephanensis
           KBAB4]
 gi|229169606|ref|ZP_04297309.1| Protein containing DUF124 [Bacillus cereus AH621]
 gi|423368885|ref|ZP_17346317.1| TIGR00266 family protein [Bacillus cereus VD142]
 gi|423490040|ref|ZP_17466722.1| TIGR00266 family protein [Bacillus cereus BtB2-4]
 gi|423495764|ref|ZP_17472408.1| TIGR00266 family protein [Bacillus cereus CER057]
 gi|423497442|ref|ZP_17474059.1| TIGR00266 family protein [Bacillus cereus CER074]
 gi|423519565|ref|ZP_17496046.1| TIGR00266 family protein [Bacillus cereus HuA2-4]
 gi|423591146|ref|ZP_17567177.1| TIGR00266 family protein [Bacillus cereus VD048]
 gi|423597830|ref|ZP_17573830.1| TIGR00266 family protein [Bacillus cereus VD078]
 gi|423660277|ref|ZP_17635446.1| TIGR00266 family protein [Bacillus cereus VDM022]
 gi|163864796|gb|ABY45855.1| protein of unknown function DUF124 [Bacillus weihenstephanensis
           KBAB4]
 gi|228613881|gb|EEK71003.1| Protein containing DUF124 [Bacillus cereus AH621]
 gi|401079142|gb|EJP87444.1| TIGR00266 family protein [Bacillus cereus VD142]
 gi|401150093|gb|EJQ57558.1| TIGR00266 family protein [Bacillus cereus CER057]
 gi|401158584|gb|EJQ65975.1| TIGR00266 family protein [Bacillus cereus HuA2-4]
 gi|401162373|gb|EJQ69729.1| TIGR00266 family protein [Bacillus cereus CER074]
 gi|401233293|gb|EJR39786.1| TIGR00266 family protein [Bacillus cereus VD048]
 gi|401238550|gb|EJR44989.1| TIGR00266 family protein [Bacillus cereus VD078]
 gi|401302947|gb|EJS08514.1| TIGR00266 family protein [Bacillus cereus VDM022]
 gi|402430359|gb|EJV62437.1| TIGR00266 family protein [Bacillus cereus BtB2-4]
          Length = 260

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E VVA  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVVAEAGAMMMMEDHIEMETIF--GDGSGPSNGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDVNYDVQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|229014071|ref|ZP_04171194.1| Protein containing DUF124 [Bacillus mycoides DSM 2048]
 gi|228747217|gb|EEL97097.1| Protein containing DUF124 [Bacillus mycoides DSM 2048]
          Length = 260

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E VVA  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVVAEAGAMMMMEDHIEMETIF--GDGSGPSNGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTDESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDVNYDVQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|423394880|ref|ZP_17372081.1| TIGR00266 family protein [Bacillus cereus BAG2X1-1]
 gi|423405740|ref|ZP_17382889.1| TIGR00266 family protein [Bacillus cereus BAG2X1-3]
 gi|401656014|gb|EJS73538.1| TIGR00266 family protein [Bacillus cereus BAG2X1-1]
 gi|401660787|gb|EJS78261.1| TIGR00266 family protein [Bacillus cereus BAG2X1-3]
          Length = 260

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 89/239 (37%), Positives = 135/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 6   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSGGLFGKLM 63

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 64  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIVPVDLTEYQGKVVCQKDAFLCAA 123

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++ +   G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 124 KGVSIGIEFTKKIGSGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKHGEKLRIDTGCL 183

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  VN  I++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 184 VAMTKDVNYDIQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238


>gi|75760312|ref|ZP_00740361.1| Hypothetical protein RBTH_05472 [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|218900032|ref|YP_002448443.1| hypothetical protein BCG9842_B0247 [Bacillus cereus G9842]
 gi|228903379|ref|ZP_04067510.1| Protein containing DUF124 [Bacillus thuringiensis IBL 4222]
 gi|423358034|ref|ZP_17335537.1| TIGR00266 family protein [Bacillus cereus VD022]
 gi|434378029|ref|YP_006612673.1| hypothetical protein BTF1_22930 [Bacillus thuringiensis HD-789]
 gi|74492192|gb|EAO55359.1| Hypothetical protein RBTH_05472 [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|218541395|gb|ACK93789.1| conserved hypothetical protein [Bacillus cereus G9842]
 gi|228856340|gb|EEN00869.1| Protein containing DUF124 [Bacillus thuringiensis IBL 4222]
 gi|401086527|gb|EJP94749.1| TIGR00266 family protein [Bacillus cereus VD022]
 gi|401876586|gb|AFQ28753.1| hypothetical protein BTF1_22930 [Bacillus thuringiensis HD-789]
          Length = 260

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E VVA  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPKESVVAEAGAMMMMEDYIEMETIF--GDGSGPSNGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKIVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  +N  I++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDINYDIEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|344201479|ref|YP_004786622.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
 gi|343953401|gb|AEM69200.1| protein of unknown function DUF124 [Muricauda ruestringensis DSM
           13258]
          Length = 266

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 91/234 (38%), Positives = 129/234 (55%), Gaps = 10/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLF-- 78
           I + I G E Q  +I L PQE VVA  GS   M   ++M+ I+      + G+   LF  
Sbjct: 6   IDYHIYGEEMQYVEIELDPQEAVVAEAGSFMMMDTDIKMDTIFGDGSNQDTGVLGKLFSA 65

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G+++      N G     V  A+P   +I+PIDL+  GG+ +CQ DAFLC+   
Sbjct: 66  GKRLLTGESLFMTAFLNIGQGKKLVSFASPYPGKIVPIDLSEKGGKFICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +R+     G EGF+ QKL G G+AF+ AGG++ +K L  GE + VD  CIV 
Sbjct: 126 VSVGIEFSKRLGRGFFGGEGFIMQKLEGDGMAFVHAGGTMAKKVLGPGETLKVDTGCIVG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            + +V+  I++ G I+  VFGG+ L  A + GPG V+IQSLPF RL+ R+  A 
Sbjct: 186 FSQTVDYDIEFVGGIKNTVFGGEGLFFATLRGPGTVYIQSLPFSRLASRVWAAA 239


>gi|423560610|ref|ZP_17536886.1| TIGR00266 family protein [Bacillus cereus MSX-A1]
 gi|401203147|gb|EJR09987.1| TIGR00266 family protein [Bacillus cereus MSX-A1]
          Length = 260

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E VVA  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPKESVVAEAGAMMMMEDYIEMETIF--GDGSGPSNGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGDGKRHVSFAAPYPGKIIPVDLTEYQGKIVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  +N  I++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDINYDIEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|229026345|ref|ZP_04182703.1| Protein containing DUF124 [Bacillus cereus AH1272]
 gi|228734945|gb|EEL85582.1| Protein containing DUF124 [Bacillus cereus AH1272]
          Length = 270

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 88/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 16  IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDASGPSGGLFGKLM 73

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 74  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 133

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 134 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 193

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 194 VAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 248


>gi|423451822|ref|ZP_17428675.1| TIGR00266 family protein [Bacillus cereus BAG5X1-1]
 gi|423557548|ref|ZP_17533850.1| TIGR00266 family protein [Bacillus cereus MC67]
 gi|401142628|gb|EJQ50168.1| TIGR00266 family protein [Bacillus cereus BAG5X1-1]
 gi|401192792|gb|EJQ99801.1| TIGR00266 family protein [Bacillus cereus MC67]
          Length = 260

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDVNYDVQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|423521243|ref|ZP_17497716.1| TIGR00266 family protein [Bacillus cereus HuA4-10]
 gi|401178602|gb|EJQ85776.1| TIGR00266 family protein [Bacillus cereus HuA4-10]
          Length = 260

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDVNYDVQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|229062555|ref|ZP_04199866.1| Protein containing DUF124 [Bacillus cereus AH603]
 gi|228716729|gb|EEL68423.1| Protein containing DUF124 [Bacillus cereus AH603]
          Length = 260

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|229099342|ref|ZP_04230273.1| Protein containing DUF124 [Bacillus cereus Rock3-29]
 gi|229105501|ref|ZP_04236142.1| Protein containing DUF124 [Bacillus cereus Rock3-28]
 gi|229118353|ref|ZP_04247709.1| Protein containing DUF124 [Bacillus cereus Rock1-3]
 gi|228665183|gb|EEL20669.1| Protein containing DUF124 [Bacillus cereus Rock1-3]
 gi|228677926|gb|EEL32162.1| Protein containing DUF124 [Bacillus cereus Rock3-28]
 gi|228684160|gb|EEL38107.1| Protein containing DUF124 [Bacillus cereus Rock3-29]
          Length = 263

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 89/239 (37%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 9   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSNGLFGKLM 66

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 67  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIVPVDLTEYQGKIVCQKDAFLCAA 126

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 127 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 186

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  VN  I++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 187 VAMTKDVNYDIEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 241


>gi|442317456|ref|YP_007357477.1| hypothetical protein MYSTI_00442 [Myxococcus stipitatus DSM 14675]
 gi|441485098|gb|AGC41793.1| hypothetical protein MYSTI_00442 [Myxococcus stipitatus DSM 14675]
          Length = 264

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 87/242 (35%), Positives = 134/242 (55%), Gaps = 10/242 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE---------VGM 73
           + F+I G + Q  ++ L PQE  VA  G++ +M   +EME I+   +E         +G 
Sbjct: 7   VDFKIHGDDLQFVEVELDPQEAAVAEAGTLMYMDDGIEMETIFGDGSEKKSGFLGSLLGA 66

Query: 74  WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
            + L  G+++ + V  N G     V  AAP   +I+ ++L   GGEL+ Q D+FL +   
Sbjct: 67  GKRLLTGESLFTTVFLNRGSGKRKVAFAAPYPGKIIAVNLQELGGELIAQKDSFLAAAKG 126

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ Q+L G GLAFI AGG+++++ L  GE++ VD  CIVA
Sbjct: 127 VSLGIAFQKKLGTGLFGGEGFIMQRLQGDGLAFIHAGGTLLERTLAPGELLRVDTGCIVA 186

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
              SV+  I+    I+ A FGG+ L  A + GPG V++QSLPF RL+ RI  A      R
Sbjct: 187 FQPSVDYDIQMVSGIKTAFFGGEGLFFATLRGPGKVWLQSLPFSRLAGRILSAAGPGGSR 246

Query: 253 EN 254
           + 
Sbjct: 247 DE 248


>gi|403668071|ref|ZP_10933368.1| hypothetical protein KJC8E_04832 [Kurthia sp. JC8E]
          Length = 267

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 85/232 (36%), Positives = 135/232 (58%), Gaps = 12/232 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY----IPENEVGMW---- 74
           I F+I G + Q  ++ L P E  +A  GSM  M   + ME I+      ++  G +    
Sbjct: 6   IDFKIHGDDMQFVEVELDPGETAIAEAGSMMMMDDGIVMETIFGDGSAGQHGSGFFDKLK 65

Query: 75  ----QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
               + L G+++      N G     V  AAP   RI+P+DL+ + G+++CQ DAFL + 
Sbjct: 66  GASKRLLTGESLFVTTFTNNGSGKKHVSFAAPYPGRIIPMDLSRYNGKIICQKDAFLAAA 125

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V V     +++     G EGF+ QKL G GLAF+ AGG++++K+L+VGE + +D  C+
Sbjct: 126 RGVSVGVEFQRKLGTGFFGGEGFIMQKLEGDGLAFVHAGGTIIRKDLQVGERLRLDTGCL 185

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           VA+T +V+  I+  G I+ ++FGG+ L  A +TGPG V++QS+PF RL+ R+
Sbjct: 186 VAMTENVDYNIEAAGGIKTSLFGGEGLFLATLTGPGSVWVQSMPFSRLASRV 237


>gi|407707386|ref|YP_006830971.1| major facilitator superfamily MFS_1 [Bacillus thuringiensis MC28]
 gi|423377285|ref|ZP_17354569.1| TIGR00266 family protein [Bacillus cereus BAG1O-2]
 gi|423440397|ref|ZP_17417303.1| TIGR00266 family protein [Bacillus cereus BAG4X2-1]
 gi|423449454|ref|ZP_17426333.1| TIGR00266 family protein [Bacillus cereus BAG5O-1]
 gi|423463460|ref|ZP_17440228.1| TIGR00266 family protein [Bacillus cereus BAG6O-1]
 gi|423532813|ref|ZP_17509231.1| TIGR00266 family protein [Bacillus cereus HuB2-9]
 gi|423541923|ref|ZP_17518314.1| TIGR00266 family protein [Bacillus cereus HuB4-10]
 gi|423548159|ref|ZP_17524517.1| TIGR00266 family protein [Bacillus cereus HuB5-5]
 gi|423614689|ref|ZP_17590523.1| TIGR00266 family protein [Bacillus cereus VD115]
 gi|423622049|ref|ZP_17597827.1| TIGR00266 family protein [Bacillus cereus VD148]
 gi|401128127|gb|EJQ35829.1| TIGR00266 family protein [Bacillus cereus BAG5O-1]
 gi|401169743|gb|EJQ76987.1| TIGR00266 family protein [Bacillus cereus HuB4-10]
 gi|401176826|gb|EJQ84020.1| TIGR00266 family protein [Bacillus cereus HuB5-5]
 gi|401262147|gb|EJR68291.1| TIGR00266 family protein [Bacillus cereus VD148]
 gi|401262345|gb|EJR68487.1| TIGR00266 family protein [Bacillus cereus VD115]
 gi|401639094|gb|EJS56834.1| TIGR00266 family protein [Bacillus cereus BAG1O-2]
 gi|402419540|gb|EJV51819.1| TIGR00266 family protein [Bacillus cereus BAG4X2-1]
 gi|402421661|gb|EJV53909.1| TIGR00266 family protein [Bacillus cereus BAG6O-1]
 gi|402464535|gb|EJV96225.1| TIGR00266 family protein [Bacillus cereus HuB2-9]
 gi|407385071|gb|AFU15572.1| Protein containing DUF124 [Bacillus thuringiensis MC28]
          Length = 260

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 89/239 (37%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 6   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSNGLFGKLM 63

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 64  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIVPVDLTEYQGKIVCQKDAFLCAA 123

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 124 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 183

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  VN  I++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 184 VAMTKDVNYDIEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238


>gi|49479346|ref|YP_038905.1| hypothetical protein BT9727_4593 [Bacillus thuringiensis serovar
           konkukian str. 97-27]
 gi|165869724|ref|ZP_02214382.1| conserved hypothetical protein [Bacillus anthracis str. A0488]
 gi|167633901|ref|ZP_02392224.1| conserved hypothetical protein [Bacillus anthracis str. A0442]
 gi|167638080|ref|ZP_02396358.1| conserved hypothetical protein [Bacillus anthracis str. A0193]
 gi|170685759|ref|ZP_02876982.1| conserved hypothetical protein [Bacillus anthracis str. A0465]
 gi|177651305|ref|ZP_02934136.1| conserved hypothetical protein [Bacillus anthracis str. A0174]
 gi|190567250|ref|ZP_03020165.1| conserved hypothetical protein [Bacillus anthracis str.
           Tsiankovskii-I]
 gi|196033164|ref|ZP_03100577.1| conserved hypothetical protein [Bacillus cereus W]
 gi|196043959|ref|ZP_03111196.1| conserved hypothetical protein [Bacillus cereus 03BB108]
 gi|218906085|ref|YP_002453919.1| hypothetical protein BCAH820_4973 [Bacillus cereus AH820]
 gi|227817656|ref|YP_002817665.1| hypothetical protein BAMEG_5150 [Bacillus anthracis str. CDC 684]
 gi|228917512|ref|ZP_04081061.1| Protein containing DUF124 [Bacillus thuringiensis serovar
           pulsiensis BGSC 4CC1]
 gi|228929910|ref|ZP_04092925.1| Protein containing DUF124 [Bacillus thuringiensis serovar
           pondicheriensis BGSC 4BA1]
 gi|228936171|ref|ZP_04098974.1| Protein containing DUF124 [Bacillus thuringiensis serovar
           andalousiensis BGSC 4AW1]
 gi|229124426|ref|ZP_04253614.1| Protein containing DUF124 [Bacillus cereus 95/8201]
 gi|229187124|ref|ZP_04314272.1| Protein containing DUF124 [Bacillus cereus BGSC 6E1]
 gi|229601942|ref|YP_002869130.1| hypothetical protein BAA_5128 [Bacillus anthracis str. A0248]
 gi|254687667|ref|ZP_05151523.1| hypothetical protein BantC_27985 [Bacillus anthracis str.
           CNEVA-9066]
 gi|254725231|ref|ZP_05187014.1| hypothetical protein BantA1_22639 [Bacillus anthracis str. A1055]
 gi|254736973|ref|ZP_05194679.1| hypothetical protein BantWNA_17531 [Bacillus anthracis str. Western
           North America USA6153]
 gi|254742007|ref|ZP_05199694.1| hypothetical protein BantKB_13453 [Bacillus anthracis str. Kruger
           B]
 gi|254754394|ref|ZP_05206429.1| hypothetical protein BantV_18087 [Bacillus anthracis str. Vollum]
 gi|254757226|ref|ZP_05209253.1| hypothetical protein BantA9_02876 [Bacillus anthracis str.
           Australia 94]
 gi|270000527|ref|NP_847302.2| hypothetical protein BA_5116 [Bacillus anthracis str. Ames]
 gi|301056374|ref|YP_003794585.1| hypothetical protein BACI_c48660 [Bacillus cereus biovar anthracis
           str. CI]
 gi|376268794|ref|YP_005121506.1| hypothetical protein bcf_24370 [Bacillus cereus F837/76]
 gi|386738759|ref|YP_006211940.1| hypothetical protein [Bacillus anthracis str. H9401]
 gi|421508720|ref|ZP_15955632.1| hypothetical protein B353_13259 [Bacillus anthracis str. UR-1]
 gi|421639948|ref|ZP_16080537.1| hypothetical protein BABF1_22911 [Bacillus anthracis str. BF1]
 gi|49330902|gb|AAT61548.1| conserved hypothetical protein [Bacillus thuringiensis serovar
           konkukian str. 97-27]
 gi|164714553|gb|EDR20072.1| conserved hypothetical protein [Bacillus anthracis str. A0488]
 gi|167513897|gb|EDR89265.1| conserved hypothetical protein [Bacillus anthracis str. A0193]
 gi|167530702|gb|EDR93404.1| conserved hypothetical protein [Bacillus anthracis str. A0442]
 gi|170670223|gb|EDT20963.1| conserved hypothetical protein [Bacillus anthracis str. A0465]
 gi|172083131|gb|EDT68193.1| conserved hypothetical protein [Bacillus anthracis str. A0174]
 gi|190561754|gb|EDV15724.1| conserved hypothetical protein [Bacillus anthracis str.
           Tsiankovskii-I]
 gi|195994593|gb|EDX58548.1| conserved hypothetical protein [Bacillus cereus W]
 gi|196025295|gb|EDX63965.1| conserved hypothetical protein [Bacillus cereus 03BB108]
 gi|218537815|gb|ACK90213.1| conserved hypothetical protein [Bacillus cereus AH820]
 gi|227005995|gb|ACP15738.1| conserved hypothetical protein [Bacillus anthracis str. CDC 684]
 gi|228596365|gb|EEK54037.1| Protein containing DUF124 [Bacillus cereus BGSC 6E1]
 gi|228659078|gb|EEL14730.1| Protein containing DUF124 [Bacillus cereus 95/8201]
 gi|228823516|gb|EEM69345.1| Protein containing DUF124 [Bacillus thuringiensis serovar
           andalousiensis BGSC 4AW1]
 gi|228829826|gb|EEM75448.1| Protein containing DUF124 [Bacillus thuringiensis serovar
           pondicheriensis BGSC 4BA1]
 gi|228842184|gb|EEM87283.1| Protein containing DUF124 [Bacillus thuringiensis serovar
           pulsiensis BGSC 4CC1]
 gi|229266350|gb|ACQ47987.1| conserved hypothetical protein [Bacillus anthracis str. A0248]
 gi|269850231|gb|AAP28788.2| conserved hypothetical protein [Bacillus anthracis str. Ames]
 gi|300378543|gb|ADK07447.1| conserved hypothetical protein [Bacillus cereus biovar anthracis
           str. CI]
 gi|364514594|gb|AEW57993.1| Hypothetical protein bcf_24370 [Bacillus cereus F837/76]
 gi|384388611|gb|AFH86272.1| Hypothetical Protein H9401_4886 [Bacillus anthracis str. H9401]
 gi|401821343|gb|EJT20501.1| hypothetical protein B353_13259 [Bacillus anthracis str. UR-1]
 gi|403393036|gb|EJY90283.1| hypothetical protein BABF1_22911 [Bacillus anthracis str. BF1]
          Length = 260

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RLS R+    
Sbjct: 180 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLSARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|52140639|ref|YP_086190.1| hypothetical protein BCZK4615 [Bacillus cereus E33L]
 gi|51974108|gb|AAU15658.1| conserved hypothetical protein [Bacillus cereus E33L]
          Length = 263

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 5   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 62

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 63  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 122

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 123 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 182

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RLS R+    
Sbjct: 183 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLSARL---- 238

Query: 247 TSP 249
           TSP
Sbjct: 239 TSP 241


>gi|228910716|ref|ZP_04074527.1| Protein containing DUF124 [Bacillus thuringiensis IBL 200]
 gi|228848984|gb|EEM93827.1| Protein containing DUF124 [Bacillus thuringiensis IBL 200]
          Length = 260

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E VVA  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPKESVVAEAGAMMMMEDYIEMETIF--GDGSGPSNGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKIVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  +N  I++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDINYDIEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|423388819|ref|ZP_17366045.1| TIGR00266 family protein [Bacillus cereus BAG1X1-3]
 gi|423417197|ref|ZP_17394286.1| TIGR00266 family protein [Bacillus cereus BAG3X2-1]
 gi|401108615|gb|EJQ16546.1| TIGR00266 family protein [Bacillus cereus BAG3X2-1]
 gi|401642894|gb|EJS60600.1| TIGR00266 family protein [Bacillus cereus BAG1X1-3]
          Length = 260

 Score =  154 bits (388), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 88/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 6   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDASGPSGGLFGKLM 63

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 64  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 124 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 183

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 184 VAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238


>gi|118479980|ref|YP_897131.1| hypothetical protein BALH_4426 [Bacillus thuringiensis str. Al
           Hakam]
 gi|118419205|gb|ABK87624.1| conserved hypothetical protein [Bacillus thuringiensis str. Al
           Hakam]
          Length = 263

 Score =  154 bits (388), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 90/243 (37%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 5   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 62

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 63  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 122

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 123 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 182

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RLS R+    
Sbjct: 183 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLSARL---- 238

Query: 247 TSP 249
           TSP
Sbjct: 239 TSP 241


>gi|345866906|ref|ZP_08818927.1| hypothetical protein BZARG_802 [Bizionia argentinensis JUB59]
 gi|344048826|gb|EGV44429.1| hypothetical protein BZARG_802 [Bizionia argentinensis JUB59]
          Length = 266

 Score =  154 bits (388), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 125/230 (54%), Gaps = 10/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           I + I G E Q  +I L PQE VVA  GS   M   ++ME I+        G    LFG 
Sbjct: 6   IDYAIYGEEMQYVEIELDPQEGVVAEAGSFMMMDDGIKMETIFGDGSSQNKGFMGSLFGA 65

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++   V  N       +  A+P   +IL IDL    G+ +CQ DAFLC+   
Sbjct: 66  GKRILTGESLFMTVFYNNLVGKRRISFASPYPGKILAIDLTECDGKFICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +R+   + G EGF+ QKL G G+AF+ AGG++ ++ L+ GE + VD  CIV 
Sbjct: 126 VTIGIEFSKRLGRGLFGGEGFIMQKLEGDGMAFVHAGGTLAKRILQAGETLRVDTGCIVG 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  ++  +++ G I+  +FGG+ L  A + GPG V+IQSLPF RL++R+
Sbjct: 186 FTQGIDYDVEFVGGIKNTIFGGEGLFFAKLQGPGTVYIQSLPFSRLARRV 235


>gi|302343437|ref|YP_003807966.1| hypothetical protein Deba_2007 [Desulfarculus baarsii DSM 2075]
 gi|301640050|gb|ADK85372.1| protein of unknown function DUF124 [Desulfarculus baarsii DSM 2075]
          Length = 338

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 84/234 (35%), Positives = 134/234 (57%), Gaps = 14/234 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY------------IPENE 70
           + ++I G E Q  +I L   E VVA  G M +MS  + M+ ++            +  + 
Sbjct: 76  VDYRIEGNEMQYVEIELDQGESVVAEAGGMLYMSADIAMDTVFGDGRRDNAKSSGLMGSL 135

Query: 71  VGMWQWLF-GKTITSVVLRNPGPS-DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
           +G  + L  G+++   +  + G S    V  AAP   +I+P+ L   GGEL+CQ DAFLC
Sbjct: 136 LGAGKRLLTGESLFMTIFTHKGASPKAHVAFAAPYPGKIMPMHLGELGGELICQKDAFLC 195

Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
           +   V +     +++   + G EGF+ Q+L G GLAF+ AGG+V +K+L+ GE + VD  
Sbjct: 196 AARGVALEIAFQKKIGAGLFGGEGFIMQRLLGDGLAFVHAGGTVFEKDLKPGETLRVDTG 255

Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           C+VA+  SVN  +++ G ++ A+FGG+ L  A + GPG V++QSLP  R++ RI
Sbjct: 256 CLVALMPSVNYDVEFVGGVKTALFGGEGLFLATLGGPGRVWLQSLPLSRVADRI 309


>gi|443320263|ref|ZP_21049376.1| TIGR00266 family protein [Gloeocapsa sp. PCC 73106]
 gi|442790027|gb|ELR99647.1| TIGR00266 family protein [Gloeocapsa sp. PCC 73106]
          Length = 236

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 87/232 (37%), Positives = 133/232 (57%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           VI +QI G + Q+ +I L P E V A  G+M +M   +EM+         G  + L G++
Sbjct: 4   VIDYQIFGSDLQLVEIGLDPGEGVSAEIGTMTYMEQGIEMQTTAKGGLFGGFKRILTGES 63

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
                  N G     VG AAP   R++PIDLA  GG  LC+  +FLC+   + ++    +
Sbjct: 64  FFISNFVNQGRQKSRVGFAAPYPGRVVPIDLAQIGGTFLCEKGSFLCAAKGIDINVAFTK 123

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           R+     G EGF+ Q+L+G GLAFI AGG+++ K L+ G+ + VD  CIV  +SSV+  I
Sbjct: 124 RLGAGFFGGEGFILQRLSGDGLAFIHAGGTLIPKQLQSGQELRVDTGCIVGFSSSVDYSI 183

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRE 253
           ++ G  R A+FG + L  A + GPG+V++QSLP  +L +RI  ++  P + +
Sbjct: 184 EFVGGFRNALFGSEGLFLAKLRGPGLVYLQSLPLSKLVERIQASLVFPRVSD 235


>gi|228942045|ref|ZP_04104588.1| Protein containing DUF124 [Bacillus thuringiensis serovar berliner
           ATCC 10792]
 gi|228974976|ref|ZP_04135537.1| Protein containing DUF124 [Bacillus thuringiensis serovar
           thuringiensis str. T01001]
 gi|228981569|ref|ZP_04141866.1| Protein containing DUF124 [Bacillus thuringiensis Bt407]
 gi|384188938|ref|YP_005574834.1| hypothetical protein CT43_CH4885 [Bacillus thuringiensis serovar
           chinensis CT-43]
 gi|410677266|ref|YP_006929637.1| protein of unknown function DUF124 [Bacillus thuringiensis Bt407]
 gi|423386389|ref|ZP_17363645.1| TIGR00266 family protein [Bacillus cereus BAG1X1-2]
 gi|423527281|ref|ZP_17503726.1| TIGR00266 family protein [Bacillus cereus HuB1-1]
 gi|452201345|ref|YP_007481426.1| hypothetical protein H175_ch4968 [Bacillus thuringiensis serovar
           thuringiensis str. IS5056]
 gi|228778248|gb|EEM26518.1| Protein containing DUF124 [Bacillus thuringiensis Bt407]
 gi|228784829|gb|EEM32847.1| Protein containing DUF124 [Bacillus thuringiensis serovar
           thuringiensis str. T01001]
 gi|228817714|gb|EEM63796.1| Protein containing DUF124 [Bacillus thuringiensis serovar berliner
           ATCC 10792]
 gi|326942647|gb|AEA18543.1| hypothetical protein CT43_CH4885 [Bacillus thuringiensis serovar
           chinensis CT-43]
 gi|401633344|gb|EJS51125.1| TIGR00266 family protein [Bacillus cereus BAG1X1-2]
 gi|402453334|gb|EJV85135.1| TIGR00266 family protein [Bacillus cereus HuB1-1]
 gi|409176395|gb|AFV20700.1| protein of unknown function DUF124 [Bacillus thuringiensis Bt407]
 gi|452106738|gb|AGG03678.1| hypothetical protein H175_ch4968 [Bacillus thuringiensis serovar
           thuringiensis str. IS5056]
          Length = 260

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPKESVIAEAGAMMMMEDYIEMETIF--GDGSGPSNGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  +N  I++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDINYDIEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|229032525|ref|ZP_04188491.1| Protein containing DUF124 [Bacillus cereus AH1271]
 gi|228728783|gb|EEL79793.1| Protein containing DUF124 [Bacillus cereus AH1271]
          Length = 260

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIVPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGERLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDVNYDVQFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|423549380|ref|ZP_17525707.1| TIGR00266 family protein [Bacillus cereus ISP3191]
 gi|401191133|gb|EJQ98156.1| TIGR00266 family protein [Bacillus cereus ISP3191]
          Length = 260

 Score =  153 bits (387), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  +N  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RLS R+    
Sbjct: 180 TGCLVAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLSARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|229163863|ref|ZP_04291804.1| Protein containing DUF124 [Bacillus cereus R309803]
 gi|228619605|gb|EEK76490.1| Protein containing DUF124 [Bacillus cereus R309803]
          Length = 260

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 88/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPKESVIAEAGAMMMMEDHIEMETIF--GDGSGPSSGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  +N  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|225866860|ref|YP_002752238.1| hypothetical protein BCA_4996 [Bacillus cereus 03BB102]
 gi|225787587|gb|ACO27804.1| conserved hypothetical protein [Bacillus cereus 03BB102]
          Length = 260

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 6   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSSGLFGKLM 63

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 64  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 124 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 183

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  +N  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 184 VAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238


>gi|229199033|ref|ZP_04325718.1| Protein containing DUF124 [Bacillus cereus m1293]
 gi|228584446|gb|EEK42579.1| Protein containing DUF124 [Bacillus cereus m1293]
          Length = 263

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 88/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 9   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 66

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 67  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 126

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 127 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKHGEKLRIDTGCL 186

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 187 VAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 241


>gi|169344060|ref|ZP_02865048.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens C
           str. JGS1495]
 gi|169297795|gb|EDS79892.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens C
           str. JGS1495]
          Length = 266

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 89/232 (38%), Positives = 132/232 (56%), Gaps = 12/232 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV----------- 71
           I F++ G + Q  +I L P+E VVA  G+M  M  S+EME I+   +             
Sbjct: 7   IDFELFGDDMQFVEIELDPRETVVAEAGAMMMMDSSIEMETIFGDGSNSSGSGGFFGKLG 66

Query: 72  -GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
               + L G+++      N G     V  A+P   +I+P+DL  +GG+L+CQ DAFLC+ 
Sbjct: 67  GAAKRVLTGESLFMTAFTNMGAGREKVAFASPYPGKIIPVDLRNYGGKLICQKDAFLCAA 126

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G G+AFI AGG++V+K L  G+ + VD  C+
Sbjct: 127 KGVSIGIDFRRKLGTGFFGGEGFILQKLEGDGMAFIHAGGTIVRKRLLPGQKLKVDTGCL 186

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           VA+T  VN  I+Y   I+ AVFGG+ +  A + GPG V+IQSLPF RL+ ++
Sbjct: 187 VAMTKDVNYDIEYVKGIKNAVFGGEGIFFASLVGPGEVWIQSLPFSRLASKV 238


>gi|423613040|ref|ZP_17588901.1| TIGR00266 family protein [Bacillus cereus VD107]
 gi|401243511|gb|EJR49881.1| TIGR00266 family protein [Bacillus cereus VD107]
          Length = 260

 Score =  153 bits (386), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 88/243 (36%), Positives = 134/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGQSGGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G G AF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGFAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|229141601|ref|ZP_04270133.1| Protein containing DUF124 [Bacillus cereus BDRD-ST26]
 gi|228641881|gb|EEK98180.1| Protein containing DUF124 [Bacillus cereus BDRD-ST26]
          Length = 263

 Score =  153 bits (386), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 9   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 66

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 67  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 126

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 127 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKHGEKLRIDTGCL 186

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  +N  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 187 VAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 241


>gi|42784066|ref|NP_981313.1| hypothetical protein BCE_5020 [Bacillus cereus ATCC 10987]
 gi|217962351|ref|YP_002340923.1| hypothetical protein BCAH187_A5002 [Bacillus cereus AH187]
 gi|375286872|ref|YP_005107311.1| hypothetical protein BCN_4778 [Bacillus cereus NC7401]
 gi|402555001|ref|YP_006596272.1| hypothetical protein BCK_10835 [Bacillus cereus FRI-35]
 gi|423355348|ref|ZP_17332973.1| TIGR00266 family protein [Bacillus cereus IS075]
 gi|423571089|ref|ZP_17547334.1| TIGR00266 family protein [Bacillus cereus MSX-A12]
 gi|42739997|gb|AAS43921.1| conserved hypothetical protein [Bacillus cereus ATCC 10987]
 gi|217067731|gb|ACJ81981.1| conserved hypothetical protein [Bacillus cereus AH187]
 gi|358355399|dbj|BAL20571.1| conserved hypothetical protein [Bacillus cereus NC7401]
 gi|401083970|gb|EJP92221.1| TIGR00266 family protein [Bacillus cereus IS075]
 gi|401202546|gb|EJR09399.1| TIGR00266 family protein [Bacillus cereus MSX-A12]
 gi|401796211|gb|AFQ10070.1| hypothetical protein BCK_10835 [Bacillus cereus FRI-35]
          Length = 260

 Score =  153 bits (386), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 6   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 63

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 64  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 124 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKHGEKLRIDTGCL 183

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  +N  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 184 VAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238


>gi|423484455|ref|ZP_17461145.1| TIGR00266 family protein [Bacillus cereus BAG6X1-2]
 gi|401138615|gb|EJQ46183.1| TIGR00266 family protein [Bacillus cereus BAG6X1-2]
          Length = 260

 Score =  153 bits (386), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 89/243 (36%), Positives = 136/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIEMETIF--GDGSGPSGGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIVPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++ +   G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGSGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|423573437|ref|ZP_17549556.1| TIGR00266 family protein [Bacillus cereus MSX-D12]
 gi|401214984|gb|EJR21705.1| TIGR00266 family protein [Bacillus cereus MSX-D12]
          Length = 260

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|196041505|ref|ZP_03108798.1| conserved hypothetical protein [Bacillus cereus NVH0597-99]
 gi|228948607|ref|ZP_04110886.1| Protein containing DUF124 [Bacillus thuringiensis serovar monterrey
           BGSC 4AJ1]
 gi|229093979|ref|ZP_04225070.1| Protein containing DUF124 [Bacillus cereus Rock3-42]
 gi|196027753|gb|EDX66367.1| conserved hypothetical protein [Bacillus cereus NVH0597-99]
 gi|228689461|gb|EEL43275.1| Protein containing DUF124 [Bacillus cereus Rock3-42]
 gi|228811106|gb|EEM57448.1| Protein containing DUF124 [Bacillus thuringiensis serovar monterrey
           BGSC 4AJ1]
          Length = 260

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 89/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|168204486|ref|ZP_02630491.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens E
           str. JGS1987]
 gi|170663922|gb|EDT16605.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens E
           str. JGS1987]
          Length = 265

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 89/231 (38%), Positives = 132/231 (57%), Gaps = 11/231 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV----------- 71
           I F++ G + Q  +I L P+E VVA  G+M  M  S+EME I+   +             
Sbjct: 7   IDFELFGDDMQFVEIELDPRETVVAEAGAMMMMDSSIEMETIFGDGSNSGSGGFFGKLGG 66

Query: 72  GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + L G+++      N G     V  A+P   +I+P+DL  +GG+L+CQ DAFLC+  
Sbjct: 67  AAKRVLTGESLFMTAFTNMGVGREKVAFASPYPGKIIPVDLRNYGGKLICQKDAFLCAAK 126

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     +++     G EGF+ QKL G G+AFI AGG++V+K L  G+ + VD  C+V
Sbjct: 127 GVSIGIDFRRKLGTGFFGGEGFILQKLEGDGMAFIHAGGTIVRKRLLPGQKLKVDTGCLV 186

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           A+T  VN  I+Y   I+ AVFGG+ +  A + GPG V+IQSLPF RL+ ++
Sbjct: 187 AMTKDVNYDIEYVKGIKNAVFGGEGIFFASLVGPGEVWIQSLPFSRLASKV 237


>gi|206976957|ref|ZP_03237858.1| conserved hypothetical protein [Bacillus cereus H3081.97]
 gi|384182680|ref|YP_005568442.1| hypothetical protein YBT020_23980 [Bacillus thuringiensis serovar
           finitimus YBT-020]
 gi|423373182|ref|ZP_17350521.1| TIGR00266 family protein [Bacillus cereus AND1407]
 gi|206744762|gb|EDZ56168.1| conserved hypothetical protein [Bacillus cereus H3081.97]
 gi|324328764|gb|ADY24024.1| hypothetical protein YBT020_23980 [Bacillus thuringiensis serovar
           finitimus YBT-020]
 gi|401096886|gb|EJQ04922.1| TIGR00266 family protein [Bacillus cereus AND1407]
          Length = 260

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 88/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 6   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 63

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 64  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 124 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 183

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 184 VAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238


>gi|373855886|ref|ZP_09598632.1| protein of unknown function DUF124 [Bacillus sp. 1NLA3E]
 gi|372454955|gb|EHP28420.1| protein of unknown function DUF124 [Bacillus sp. 1NLA3E]
          Length = 264

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 85/232 (36%), Positives = 130/232 (56%), Gaps = 12/232 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I ++I G + Q  ++ L PQE V+A  GS   M G ++ME I+   ++      + GK +
Sbjct: 6   IDYKIYGDDMQFVEVELDPQETVIAEAGSFMMMDGDIQMETIFGDGSQGSETGGIMGKLL 65

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
           ++                N   +   V  AAP    ++P+DL+   G+++CQ DAFL + 
Sbjct: 66  SAGKRVITGESLFMTAFTNHANTKKHVSFAAPYPGSVIPMDLSKLNGKIICQKDAFLAAA 125

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V V     ++V     G EGF+ QKL G GLAF+ AGG++ +K L  G++I VD  C+
Sbjct: 126 KGVSVGIEFQRKVGAGFFGGEGFIMQKLEGDGLAFVHAGGTIHKKELASGDMIRVDTGCL 185

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           VA+TS VN  I++   ++ A+FGG+ L  A + GPG V+IQSLPF RL+ R+
Sbjct: 186 VAMTSGVNYDIEFVKGVKTALFGGEGLFFATLRGPGTVWIQSLPFSRLASRV 237


>gi|334134590|ref|ZP_08508094.1| TIGR00266 family protein [Paenibacillus sp. HGF7]
 gi|333607745|gb|EGL19055.1| TIGR00266 family protein [Paenibacillus sp. HGF7]
          Length = 260

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 91/231 (39%), Positives = 132/231 (57%), Gaps = 11/231 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
           I + I G E Q  +I L PQE V+A  GSM  M G++ ME I+   +           +G
Sbjct: 6   IDYIIHGNEMQYVEIELDPQESVIAEAGSMMMMEGNIGMETIFGDGSGGGGRGLMGKLMG 65

Query: 73  MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
             + L  G+++   V  N G     V  AAP   +I+P+DL   GG+L+CQ D+FLC+  
Sbjct: 66  AGKRLLTGESLFMTVYTNNGRGKEHVTFAAPYPGKIIPMDLHALGGKLICQKDSFLCAAK 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +   L +++     G EGF+ QKL G GL F+ A GS++++ L+ GEV+ VD  C V
Sbjct: 126 GVSIGIELQRKLGAGFFGGEGFIMQKLEGDGLGFVHACGSILERELQPGEVLRVDTGCFV 185

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           A+T  V   I++   I+ AVFGG+ L    V GPG V++QSLPF RL++R+
Sbjct: 186 AMTPDVQYDIEFVKGIKTAVFGGEGLFFVTVRGPGKVWLQSLPFSRLAERV 236


>gi|228923617|ref|ZP_04086897.1| Protein containing DUF124 [Bacillus thuringiensis serovar
           huazhongensis BGSC 4BD1]
 gi|423583077|ref|ZP_17559188.1| TIGR00266 family protein [Bacillus cereus VD014]
 gi|423634248|ref|ZP_17609901.1| TIGR00266 family protein [Bacillus cereus VD156]
 gi|228836086|gb|EEM81447.1| Protein containing DUF124 [Bacillus thuringiensis serovar
           huazhongensis BGSC 4BD1]
 gi|401210386|gb|EJR17138.1| TIGR00266 family protein [Bacillus cereus VD014]
 gi|401281034|gb|EJR86948.1| TIGR00266 family protein [Bacillus cereus VD156]
          Length = 260

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 88/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 6   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 63

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 64  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 124 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 183

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 184 VAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238


>gi|228955143|ref|ZP_04117157.1| Protein containing DUF124 [Bacillus thuringiensis serovar kurstaki
           str. T03a001]
 gi|228804554|gb|EEM51159.1| Protein containing DUF124 [Bacillus thuringiensis serovar kurstaki
           str. T03a001]
          Length = 263

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 88/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 5   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 62

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 63  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 122

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 123 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 182

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  +N  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 183 TGCLVAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 238

Query: 247 TSP 249
           TSP
Sbjct: 239 TSP 241


>gi|168212760|ref|ZP_02638385.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens
           CPE str. F4969]
 gi|170715718|gb|EDT27900.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens
           CPE str. F4969]
          Length = 265

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 89/232 (38%), Positives = 133/232 (57%), Gaps = 12/232 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV----------- 71
           I F++ G + Q  +I L P+E VVA  G+M  M  S+EME+I+   +             
Sbjct: 6   IDFELFGDDMQFVEIELDPRETVVAEAGAMMMMDSSIEMESIFGDGSNSSGSGGFFGKLG 65

Query: 72  -GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
               + L G+++      N G     V  A+P   +I+P+DL  +GG+L+CQ DAFLC+ 
Sbjct: 66  GAAKRVLTGESLFMTAFTNMGVGREKVAFASPYPGKIIPVDLRNYGGKLICQKDAFLCAA 125

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G G+AFI AGG++V+K L  G+ + VD  C+
Sbjct: 126 KGVSIGIDFRRKLGTGFFGGEGFILQKLEGDGMAFIHAGGTIVRKRLLPGQKLKVDTGCL 185

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           VA+T  VN  I+Y   I+ AVFGG+ +  A + GPG V+IQSLPF RL+ ++
Sbjct: 186 VAMTKDVNYDIEYVKGIKNAVFGGEGIFFASLVGPGEVWIQSLPFSRLASKV 237


>gi|404329598|ref|ZP_10970046.1| hypothetical protein SvinD2_05844 [Sporolactobacillus vineae DSM
           21990 = SL153]
          Length = 263

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 10/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---------IPENEVGM 73
           I ++I G E Q   + L P E V+A  GS   M   + ME ++         + +   G 
Sbjct: 6   IDYRIEGQEMQFVTVELDPGETVIAEAGSFMMMDEGIGMETLFGDGSSSSGGLMDKLFGA 65

Query: 74  WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
            + L  G+++      N G     V  A+P   +I+P+DL+  GG+++CQ DAFLC+   
Sbjct: 66  GKRLLTGESLFMTAFTNNGSGKKHVSFASPYPGKIIPLDLSALGGQMICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++     G EGF+ QKL G GLAF+ AGG + + +L  GE + VD  C+VA
Sbjct: 126 VSVGIAFQKKLSTGFFGGEGFIMQKLEGDGLAFVHAGGMIKEIDLAAGETLRVDTGCLVA 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           +T   +  I++   ++ A+FGG+ L  A + GPG V +QSLPF RL+ R+  A+
Sbjct: 186 LTQETDYDIEFVRGVKTALFGGEGLFFATLRGPGKVLVQSLPFSRLASRVFSAM 239


>gi|120435350|ref|YP_861036.1| hypothetical protein GFO_0995 [Gramella forsetii KT0803]
 gi|117577500|emb|CAL65969.1| protein containing DUF124 [Gramella forsetii KT0803]
          Length = 277

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 84/234 (35%), Positives = 133/234 (56%), Gaps = 10/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENE------VGM 73
           I +QI G E Q  ++ L P+E V+A  G+   M   ++M+ I+     +NE      +G 
Sbjct: 17  IDYQIYGEEMQYVELELDPREAVIAEAGNFMMMDDGIKMDTIFGDGSKQNEGFLGKVLGA 76

Query: 74  WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
            + L  G+++   +  N       +  A+P   +++PIDL  F G+ +CQ DAFLC+   
Sbjct: 77  GKRLLTGESLFMTIFSNEIQGKKKISFASPYPGKVIPIDLTRFNGKFICQKDAFLCAAKG 136

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++     G EGF+ QK+ G G+AF+ +GG++ +K L+ GE + VD  CI+ 
Sbjct: 137 VSIGIEFSKKLGRGFFGGEGFIMQKVEGDGMAFVHSGGTMAKKELQSGEKLKVDTGCIIG 196

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            T  V+  I++ G IR  +FGG+ L  A +TGPG V+IQSLPF RL+ RI +A 
Sbjct: 197 FTQGVDYDIEFVGGIRNTLFGGEGLFFATLTGPGTVYIQSLPFSRLASRIHQAA 250


>gi|406878089|gb|EKD27110.1| hypothetical protein ACD_79C00869G0003 [uncultured bacterium]
          Length = 261

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 87/230 (37%), Positives = 132/230 (57%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPEN-EVGMWQWLF-- 78
           I ++I G + Q  +I L   EKVVA  G+M +M  S+  E  +    N E G+   LF  
Sbjct: 6   IDYKIFGDDMQFVEIELDSMEKVVAEAGAMMYMDDSISFEAKMGDGSNPEAGLMDKLFSA 65

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G+++      N G S   V  AAP   +I+P++LA +  E++CQ DAFLC+   
Sbjct: 66  GKRAIMGESLFMTHFTNNGKSKKRVAFAAPYPGKIIPLNLASYNEEIICQKDAFLCAALG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            +++ T  +R    + G EGF+ +KL+G G+ F+ AGG+V++K L   E I VD  CIVA
Sbjct: 126 TQINITFTKRFGAGLFGGEGFILEKLSGDGMVFVHAGGTVIEKELN-NEKILVDTGCIVA 184

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
             S+++  I+  G ++  VFGG+ L  A ++G G +++QSLPF RLS RI
Sbjct: 185 FESNISYDIERAGNLKSMVFGGEGLFLATLSGKGKIWLQSLPFSRLSDRI 234


>gi|423427001|ref|ZP_17404032.1| TIGR00266 family protein [Bacillus cereus BAG3X2-2]
 gi|423502446|ref|ZP_17479038.1| TIGR00266 family protein [Bacillus cereus HD73]
 gi|449091835|ref|YP_007424276.1| Protein containing DUF124 [Bacillus thuringiensis serovar kurstaki
           str. HD73]
 gi|401109916|gb|EJQ17834.1| TIGR00266 family protein [Bacillus cereus BAG3X2-2]
 gi|402460287|gb|EJV92009.1| TIGR00266 family protein [Bacillus cereus HD73]
 gi|449025592|gb|AGE80755.1| Protein containing DUF124 [Bacillus thuringiensis serovar kurstaki
           str. HD73]
          Length = 260

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 88/243 (36%), Positives = 135/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  +N  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 180 TGCLVAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|228961134|ref|ZP_04122760.1| Protein containing DUF124 [Bacillus thuringiensis serovar pakistani
           str. T13001]
 gi|229048574|ref|ZP_04194134.1| Protein containing DUF124 [Bacillus cereus AH676]
 gi|229112328|ref|ZP_04241867.1| Protein containing DUF124 [Bacillus cereus Rock1-15]
 gi|229130143|ref|ZP_04259104.1| Protein containing DUF124 [Bacillus cereus BDRD-Cer4]
 gi|229147435|ref|ZP_04275784.1| Protein containing DUF124 [Bacillus cereus BDRD-ST24]
 gi|229153071|ref|ZP_04281252.1| Protein containing DUF124 [Bacillus cereus m1550]
 gi|229158482|ref|ZP_04286543.1| Protein containing DUF124 [Bacillus cereus ATCC 4342]
 gi|229181181|ref|ZP_04308513.1| Protein containing DUF124 [Bacillus cereus 172560W]
 gi|229193147|ref|ZP_04320102.1| Protein containing DUF124 [Bacillus cereus ATCC 10876]
 gi|228590411|gb|EEK48275.1| Protein containing DUF124 [Bacillus cereus ATCC 10876]
 gi|228602372|gb|EEK59861.1| Protein containing DUF124 [Bacillus cereus 172560W]
 gi|228625001|gb|EEK81767.1| Protein containing DUF124 [Bacillus cereus ATCC 4342]
 gi|228630491|gb|EEK87139.1| Protein containing DUF124 [Bacillus cereus m1550]
 gi|228636117|gb|EEK92598.1| Protein containing DUF124 [Bacillus cereus BDRD-ST24]
 gi|228653358|gb|EEL09235.1| Protein containing DUF124 [Bacillus cereus BDRD-Cer4]
 gi|228671168|gb|EEL26473.1| Protein containing DUF124 [Bacillus cereus Rock1-15]
 gi|228722776|gb|EEL74161.1| Protein containing DUF124 [Bacillus cereus AH676]
 gi|228798577|gb|EEM45564.1| Protein containing DUF124 [Bacillus thuringiensis serovar pakistani
           str. T13001]
          Length = 263

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 9   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 66

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 67  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 126

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 127 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 186

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  +N  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 187 VAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 241


>gi|168216590|ref|ZP_02642215.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens
           NCTC 8239]
 gi|182624521|ref|ZP_02952304.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens D
           str. JGS1721]
 gi|422347228|ref|ZP_16428141.1| TIGR00266 family protein [Clostridium perfringens WAL-14572]
 gi|422872907|ref|ZP_16919392.1| hypothetical protein HA1_01642 [Clostridium perfringens F262]
 gi|177910329|gb|EDT72710.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens D
           str. JGS1721]
 gi|182381439|gb|EDT78918.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens
           NCTC 8239]
 gi|373225140|gb|EHP47475.1| TIGR00266 family protein [Clostridium perfringens WAL-14572]
 gi|380306165|gb|EIA18439.1| hypothetical protein HA1_01642 [Clostridium perfringens F262]
          Length = 266

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 89/232 (38%), Positives = 132/232 (56%), Gaps = 12/232 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV----------- 71
           I F++ G + Q  +I L P+E VVA  G+M  M  S+EME I+   +             
Sbjct: 7   IDFELFGDDMQFVEIELDPRETVVAEAGAMMMMDSSIEMETIFGDGSNSSGSGGFFGKLG 66

Query: 72  -GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
               + L G+++      N G     V  A+P   +I+P+DL  +GG+L+CQ DAFLC+ 
Sbjct: 67  GAAKRVLTGESLFMTAFTNMGVGREKVAFASPYPGKIIPVDLRNYGGKLICQKDAFLCAA 126

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G G+AFI AGG++V+K L  G+ + VD  C+
Sbjct: 127 KGVSIGIDFRRKLGTGFFGGEGFILQKLEGDGMAFIHAGGTIVRKRLLPGQKLKVDTGCL 186

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           VA+T  VN  I+Y   I+ AVFGG+ +  A + GPG V+IQSLPF RL+ ++
Sbjct: 187 VAMTKDVNYDIEYVKGIKNAVFGGEGIFFASLVGPGEVWIQSLPFSRLASKV 238


>gi|110803643|ref|YP_697641.1| hypothetical protein CPR_0312 [Clostridium perfringens SM101]
 gi|110684144|gb|ABG87514.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens
           SM101]
          Length = 265

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 89/232 (38%), Positives = 132/232 (56%), Gaps = 12/232 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV----------- 71
           I F++ G + Q  +I L P+E VVA  G+M  M  S+EME I+   +             
Sbjct: 6   IDFELFGDDMQFVEIELDPRETVVAEAGAMMMMDSSIEMETIFGDGSNSSGSGGFFGKLG 65

Query: 72  -GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
               + L G+++      N G     V  A+P   +I+P+DL  +GG+L+CQ DAFLC+ 
Sbjct: 66  GAAKRVLTGESLFMTAFTNMGVGREKVAFASPYPGKIIPVDLRNYGGKLICQKDAFLCAA 125

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G G+AFI AGG++V+K L  G+ + VD  C+
Sbjct: 126 KGVSIGIDFRRKLGTGFFGGEGFILQKLEGDGMAFIHAGGTIVRKRLLPGQKLKVDTGCL 185

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           VA+T  VN  I+Y   I+ AVFGG+ +  A + GPG V+IQSLPF RL+ ++
Sbjct: 186 VAMTKDVNYDIEYVKGIKNAVFGGEGIFFASLVGPGEVWIQSLPFSRLASKV 237


>gi|206969526|ref|ZP_03230480.1| conserved hypothetical protein [Bacillus cereus AH1134]
 gi|218234045|ref|YP_002369675.1| hypothetical protein BCB4264_A4991 [Bacillus cereus B4264]
 gi|229072369|ref|ZP_04205573.1| Protein containing DUF124 [Bacillus cereus F65185]
 gi|296505327|ref|YP_003667027.1| hypothetical protein BMB171_C4499 [Bacillus thuringiensis BMB171]
 gi|365158352|ref|ZP_09354548.1| TIGR00266 family protein [Bacillus sp. 7_6_55CFAA_CT2]
 gi|423411345|ref|ZP_17388465.1| TIGR00266 family protein [Bacillus cereus BAG3O-2]
 gi|423432869|ref|ZP_17409873.1| TIGR00266 family protein [Bacillus cereus BAG4O-1]
 gi|423438300|ref|ZP_17415281.1| TIGR00266 family protein [Bacillus cereus BAG4X12-1]
 gi|423631593|ref|ZP_17607340.1| TIGR00266 family protein [Bacillus cereus VD154]
 gi|423640067|ref|ZP_17615685.1| TIGR00266 family protein [Bacillus cereus VD166]
 gi|423650762|ref|ZP_17626332.1| TIGR00266 family protein [Bacillus cereus VD169]
 gi|423657816|ref|ZP_17633115.1| TIGR00266 family protein [Bacillus cereus VD200]
 gi|206735214|gb|EDZ52382.1| conserved hypothetical protein [Bacillus cereus AH1134]
 gi|218162002|gb|ACK61994.1| conserved hypothetical protein [Bacillus cereus B4264]
 gi|228710794|gb|EEL62765.1| Protein containing DUF124 [Bacillus cereus F65185]
 gi|296326379|gb|ADH09307.1| hypothetical protein BMB171_C4499 [Bacillus thuringiensis BMB171]
 gi|363626887|gb|EHL77850.1| TIGR00266 family protein [Bacillus sp. 7_6_55CFAA_CT2]
 gi|401107400|gb|EJQ15347.1| TIGR00266 family protein [Bacillus cereus BAG3O-2]
 gi|401114325|gb|EJQ22187.1| TIGR00266 family protein [Bacillus cereus BAG4O-1]
 gi|401118680|gb|EJQ26510.1| TIGR00266 family protein [Bacillus cereus BAG4X12-1]
 gi|401263730|gb|EJR69852.1| TIGR00266 family protein [Bacillus cereus VD154]
 gi|401281433|gb|EJR87345.1| TIGR00266 family protein [Bacillus cereus VD169]
 gi|401282091|gb|EJR87995.1| TIGR00266 family protein [Bacillus cereus VD166]
 gi|401288827|gb|EJR94566.1| TIGR00266 family protein [Bacillus cereus VD200]
          Length = 260

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 6   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 63

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 64  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 124 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 183

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  +N  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 184 VAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238


>gi|229082124|ref|ZP_04214596.1| Protein containing DUF124 [Bacillus cereus Rock4-2]
 gi|228701181|gb|EEL53695.1| Protein containing DUF124 [Bacillus cereus Rock4-2]
          Length = 263

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 9   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 66

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 67  GAGKRLVTGESMFMTVFTNTGRGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 126

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 127 KGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 186

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  +N  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 187 VAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 241


>gi|114568603|ref|YP_755283.1| hypothetical protein Mmar10_0049 [Maricaulis maris MCS10]
 gi|114339065|gb|ABI64345.1| protein of unknown function DUF124 [Maricaulis maris MCS10]
          Length = 263

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 85/234 (36%), Positives = 131/234 (55%), Gaps = 10/234 (4%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMW-- 74
           Q   I ++I G + Q  +I L P E VV+  G+M F + S+ ME ++    +   G+W  
Sbjct: 3   QTDEIDYEIGGEDIQYVEIELDPGESVVSEAGAMMFKAPSITMEAVFGDGSQKNQGVWGA 62

Query: 75  ------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
                 + L G+++   V  + G     V  AAP    I+P+ L   GGEL+CQ D+FL 
Sbjct: 63  LAGAGKRLLTGESLFMTVFTHGGQGKAKVAFAAPYPGTIIPVHLPDLGGELICQKDSFLA 122

Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
           +   V V   L +R+   + G EGF+ Q+L G G  F+ AGG+++++ L  GE + VD  
Sbjct: 123 AAKGVSVGIALQKRIMTGLFGGEGFIMQRLEGDGWVFVHAGGTLIERELAAGEELHVDTG 182

Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           C+VA T  V+ +I+  G I+  +FGG+ +  A + GPG V+IQS+PF RL+ RI
Sbjct: 183 CVVAYTPDVDFEIERAGNIKSMIFGGEGVFFARLRGPGKVWIQSMPFSRLAGRI 236


>gi|168210434|ref|ZP_02636059.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens B
           str. ATCC 3626]
 gi|170711528|gb|EDT23710.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens B
           str. ATCC 3626]
          Length = 266

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 88/232 (37%), Positives = 132/232 (56%), Gaps = 12/232 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV----------- 71
           I F++ G + Q  +I L P+E VVA  G+M  M  S+EME I+   +             
Sbjct: 7   IDFELFGDDMQFVEIELDPRETVVAEAGAMMMMDSSIEMETIFGDGSNSSGSGGFFGKLG 66

Query: 72  -GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
               + L G+++      N G     V  A+P   +I+P+DL  +GG+L+CQ DAFLC+ 
Sbjct: 67  GAAKRVLTGESLFMTAFTNMGVGREKVAFASPYPGKIIPVDLRNYGGKLICQKDAFLCAA 126

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G G+AFI AGG++++K L  G+ + VD  C+
Sbjct: 127 KGVSIGIDFRRKLGTGFFGGEGFILQKLEGDGMAFIHAGGTIIRKRLLPGQKLKVDTGCL 186

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           VA+T  VN  I+Y   I+ AVFGG+ +  A + GPG V+IQSLPF RL+ ++
Sbjct: 187 VAMTKDVNYDIEYVKGIKNAVFGGEGIFFASLVGPGEVWIQSLPFSRLASKV 238


>gi|110799859|ref|YP_694778.1| hypothetical protein CPF_0318 [Clostridium perfringens ATCC 13124]
 gi|110674506|gb|ABG83493.1| conserved hypothetical protein TIGR00266 [Clostridium perfringens
           ATCC 13124]
          Length = 265

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 88/232 (37%), Positives = 132/232 (56%), Gaps = 12/232 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV----------- 71
           I F++ G + Q  +I L P+E VVA  G+M  M  S+EME I+   +             
Sbjct: 6   IDFELFGDDMQFVEIELDPRETVVAEAGAMMMMDSSIEMETIFGDGSNSSGSGGFFGKLG 65

Query: 72  -GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
               + L G+++      N G     V  A+P   +I+P+DL  +GG+L+CQ DAFLC+ 
Sbjct: 66  GAAKRVLTGESLFMTAFTNMGVGREKVAFASPYPGKIIPVDLRNYGGKLICQKDAFLCAA 125

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G G+AFI AGG++++K L  G+ + VD  C+
Sbjct: 126 KGVSIGIDFRRKLGTGFFGGEGFILQKLEGDGMAFIHAGGTIIRKRLLPGQKLKVDTGCL 185

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           VA+T  VN  I+Y   I+ AVFGG+ +  A + GPG V+IQSLPF RL+ ++
Sbjct: 186 VAMTKDVNYDIEYVKGIKNAVFGGEGIFFASLVGPGEVWIQSLPFSRLASKV 237


>gi|357009687|ref|ZP_09074686.1| hypothetical protein PelgB_09421 [Paenibacillus elgii B69]
          Length = 262

 Score =  151 bits (382), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 90/242 (37%), Positives = 136/242 (56%), Gaps = 10/242 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE-----VGMW--- 74
           + ++I+G E Q  +I L P E V A  G+M  M  +++ME I+   ++     VG     
Sbjct: 6   VDYRIIGSEMQFVEIELDPGETVFAEAGAMMMMDQNIQMETIFGDGSDQNKGFVGKLFGA 65

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+++      N G     V  A+P   RILP+DL    G+L+CQ DAFLC+   
Sbjct: 66  GKRLLSGESLFMTAFTNRGSGKECVSFASPYPGRILPMDLVELNGKLICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++     G EGF+ QK+ G GLAF+ AGG++ ++ L  GE+I VD  C+VA
Sbjct: 126 VSVGIEFQRKLGAGFFGGEGFIMQKIEGDGLAFVHAGGAIHERVLGPGEMIRVDTGCLVA 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
           +T  VN  I++   ++ A+FGG+ L  A + GPG V+IQSLPF RL+ R+  A  +   +
Sbjct: 186 MTQEVNYDIEFVKGVKTAIFGGEGLFFATLRGPGKVWIQSLPFSRLADRVLSASRTGGRK 245

Query: 253 EN 254
           E 
Sbjct: 246 EE 247


>gi|30022928|ref|NP_834559.1| hypothetical protein BC4860 [Bacillus cereus ATCC 14579]
 gi|29898487|gb|AAP11760.1| hypothetical protein BC_4860 [Bacillus cereus ATCC 14579]
          Length = 260

 Score =  151 bits (382), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 87/239 (36%), Positives = 134/239 (56%), Gaps = 18/239 (7%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LFGK +
Sbjct: 6   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLFGKLM 63

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 64  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAA 123

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 124 KGVSLGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCL 183

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           VA+T  +N  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 184 VAMTKDINYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 238


>gi|108759112|ref|YP_628683.1| hypothetical protein MXAN_0412 [Myxococcus xanthus DK 1622]
 gi|108462992|gb|ABF88177.1| conserved hypothetical protein TIGR00266 [Myxococcus xanthus DK
           1622]
          Length = 265

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 87/243 (35%), Positives = 129/243 (53%), Gaps = 11/243 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F I G + Q  ++ L P E  VA  G++ +M   +EME I+    E   G +  L G 
Sbjct: 7   VDFHIYGEDLQFVEVELDPGEAAVAEAGTLMYMEDGIEMETIFGDGSEKTSGFFGSLLGA 66

Query: 80  --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                      T+V L   G     V  AAP   +I+P++L+  GGEL+ Q D+FL +  
Sbjct: 67  GKRLLTGESLFTTVFLNKSGSGKRKVSFAAPYPGKIIPVNLSELGGELIAQKDSFLAAAK 126

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     +++   + G EGF+ Q+L G GLAFI AGG++ ++ L  GE++ VD  CIV
Sbjct: 127 GVSLGIAFQKKLGTGLFGGEGFIMQRLQGDGLAFIHAGGTLHERTLGPGEMLRVDTGCIV 186

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
           A   +V+  I+    I+ A FGG+ L  A + GPG V++QSLPF RL+ RI  A      
Sbjct: 187 AFQPTVDYDIQMVSGIKTAFFGGEGLFFATLRGPGKVWLQSLPFSRLAGRILSASRPGGA 246

Query: 252 REN 254
           R+ 
Sbjct: 247 RDE 249


>gi|170705443|ref|ZP_02895907.1| conserved hypothetical protein [Bacillus anthracis str. A0389]
 gi|170129568|gb|EDS98431.1| conserved hypothetical protein [Bacillus anthracis str. A0389]
          Length = 260

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 89/243 (36%), Positives = 134/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +E E I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIETETIF--GDGSGPSGGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 120 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 179

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  +++ G ++ A+FGG+ L  A + GPG V+IQSL   RLS R+    
Sbjct: 180 TGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLSARL---- 235

Query: 247 TSP 249
           TSP
Sbjct: 236 TSP 238


>gi|332665477|ref|YP_004448265.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
 gi|332334291|gb|AEE51392.1| protein of unknown function DUF124 [Haliscomenobacter hydrossis DSM
           1100]
          Length = 264

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 84/234 (35%), Positives = 129/234 (55%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLF- 78
           I + I G E Q  +I L P E V+A  GS   M   +EM  ++      +  G+W  L  
Sbjct: 6   IDYHIHGEEMQFVEIELDPMETVIAEAGSFMMMDDGIEMATLFGDGSQSDNGGLWGKLLS 65

Query: 79  -------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                  G+++      +       V  A+P   +I+P+DL+   G+++CQ DAFLC+  
Sbjct: 66  AGKRLITGESLFMTAYTHASDGKKRVSFASPYPGKIIPLDLSKLDGKIICQKDAFLCAAK 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V V     Q++     G EGF+ QKL G G+AF+ AGGS+++K LE+GEV+ +D  C+V
Sbjct: 126 GVSVGIEFSQKLGRGFFGGEGFIMQKLEGDGMAFVHAGGSIIEKKLELGEVLKIDTGCLV 185

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           A T  V+  I+    +R  +FGG+ +  A + GPG V+IQSLPF R++ R+ +A
Sbjct: 186 AFTRDVDYDIEMVRGVRNMLFGGEGIFLARLEGPGTVWIQSLPFSRMADRVLQA 239


>gi|298208358|ref|YP_003716537.1| hypothetical protein CA2559_08951 [Croceibacter atlanticus
           HTCC2559]
 gi|83848279|gb|EAP86149.1| hypothetical protein CA2559_08951 [Croceibacter atlanticus
           HTCC2559]
          Length = 267

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 88/235 (37%), Positives = 127/235 (54%), Gaps = 11/235 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           I + I G E Q  +I L PQE V+A  GS   M   ++M+ I+    E   G  + + G 
Sbjct: 6   IDYHIYGEEMQYVEIELDPQEGVIAEAGSFMMMDSGIKMDTIFGDGSEQNKGFLKSILGA 65

Query: 80  --KTITSVVL------RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
             + +T   L               +  A+P   +ILPIDL+ FG + +CQ DAFLC+  
Sbjct: 66  GKRLLTGESLFMTAFYHAETAGKKRLSFASPYPGKILPIDLSRFGNKFICQKDAFLCAAK 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V V     +++     G EGF+ QKL G G+AF+ AGG+  QK L  GE + VD  CI+
Sbjct: 126 GVSVGVEFSKKLGRGFFGGEGFIMQKLEGVGMAFVHAGGTTAQKELAPGETLKVDTGCII 185

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             T  VN  +++ G I+ ++FGG+ L  A +TGPG V++QSLPF RL+ R+  A 
Sbjct: 186 GFTKDVNYDVEFVGGIKNSLFGGEGLFFATLTGPGTVYVQSLPFSRLASRVWAAA 240


>gi|405356330|ref|ZP_11025350.1| DUF124 domain-containing protein [Chondromyces apiculatus DSM 436]
 gi|397090926|gb|EJJ21767.1| DUF124 domain-containing protein [Myxococcus sp. (contaminant ex
           DSM 436)]
          Length = 265

 Score =  151 bits (381), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 86/243 (35%), Positives = 129/243 (53%), Gaps = 11/243 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F I G + Q  ++ L P E  VA  G++ +M   +EME I+    +   G +  L G 
Sbjct: 7   VDFHIYGEDLQFVEVELDPGEAAVAEAGTLMYMEDGIEMETIFGDGSQQSSGFFGSLLGA 66

Query: 80  --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                      T+V     G     V  AAP   +I+P++L   GGEL+ Q D+FL +  
Sbjct: 67  GKRLLTGESLFTTVFFNKSGRGKRKVSFAAPYPGKIIPVNLGELGGELIAQKDSFLAAAK 126

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     +++   + G EGF+ Q+L G GLAFI AGG++ ++ L  GE++ VD  CIV
Sbjct: 127 GVSLGIAFQKKLGTGLFGGEGFIMQRLQGDGLAFIHAGGTLHERTLGPGELLRVDTGCIV 186

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
           A   +V+  I+    I+ A+FGG+ L  A + GPG V++QSLPF RL+ RI  A  S   
Sbjct: 187 AFQPTVDYDIQLVSGIKTALFGGEGLFFATLRGPGKVWLQSLPFSRLAGRILSASRSGGA 246

Query: 252 REN 254
           R+ 
Sbjct: 247 RDE 249


>gi|421616380|ref|ZP_16057394.1| hypothetical protein B597_05767 [Pseudomonas stutzeri KOS6]
 gi|409781763|gb|EKN61340.1| hypothetical protein B597_05767 [Pseudomonas stutzeri KOS6]
          Length = 248

 Score =  150 bits (379), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 88/232 (37%), Positives = 133/232 (57%), Gaps = 9/232 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--- 79
           + ++ILG  AQ  +I+L P E V+A  G+M +M+G +  E      +  G+   L+G   
Sbjct: 6   LEYEILGQSAQSVEIILDPGETVIAEAGAMNYMTGGVRFEARMGDGSSSGVLGKLWGMGK 65

Query: 80  KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
           + IT   L      N G     V  AAP    ++PIDLA FGG+L+CQ DAFLC+    +
Sbjct: 66  RMITGESLFLTHFSNQGDGPARVAFAAPYPGTVVPIDLAAFGGQLICQKDAFLCAAYGTR 125

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           +  +  +R+     G EGF+ Q+L G GLAF+ AGG+V++K L   E + +D  C+VA T
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLEGDGLAFVHAGGTVIRKELN-NETLRLDSGCLVAFT 184

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             ++  I+  G +R  +FGG+ L+   + G G V+IQSLPF RL++R+  A 
Sbjct: 185 PGIDYDIQLAGGLRSMLFGGEGLLLTTLKGSGSVWIQSLPFSRLAERVYEAT 236


>gi|228988126|ref|ZP_04148225.1| Protein containing DUF124 [Bacillus thuringiensis serovar
           tochigiensis BGSC 4Y1]
 gi|228771624|gb|EEM20091.1| Protein containing DUF124 [Bacillus thuringiensis serovar
           tochigiensis BGSC 4Y1]
          Length = 263

 Score =  149 bits (377), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 87/243 (35%), Positives = 134/243 (55%), Gaps = 18/243 (7%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 5   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDFIEMETIF--GDGSGPSGGLF 62

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 63  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 122

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+   V +     +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D
Sbjct: 123 LCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRID 182

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  +   +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    
Sbjct: 183 TGCLVAMTKDIKYDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL---- 238

Query: 247 TSP 249
           TSP
Sbjct: 239 TSP 241


>gi|338530988|ref|YP_004664322.1| hypothetical protein LILAB_06635 [Myxococcus fulvus HW-1]
 gi|337257084|gb|AEI63244.1| hypothetical protein LILAB_06635 [Myxococcus fulvus HW-1]
          Length = 262

 Score =  149 bits (377), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 86/234 (36%), Positives = 128/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV--GMWQWLFG- 79
           + F I G + Q  ++ L P E  VA  G++ +M   +EME I+   +E   G +  L G 
Sbjct: 4   VDFHIYGEDLQFVEVELDPGEAAVAEAGTLMYMEDGIEMETIFGDGSEKTGGFFGSLLGA 63

Query: 80  --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                      T+V L   G     V  AAP   +I+P++L+  GGEL+ Q D+FL +  
Sbjct: 64  GKRLLTGESLFTTVFLNKSGRGKRKVSFAAPYPGKIIPVNLSELGGELIAQKDSFLAAAK 123

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     +++   + G EGF+ Q+L G GLAFI AGG++ ++ L  GE++ VD  CIV
Sbjct: 124 GVSLGIAFQKKLGTGLFGGEGFIMQRLQGDGLAFIHAGGTLHERTLGPGELLRVDTGCIV 183

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           A   +V+  I+    I+ A FGG+ L  A + GPG V++QSLPF RL+ RI  A
Sbjct: 184 AFQPTVDYDIQMVSGIKTAFFGGEGLFFATLRGPGKVWLQSLPFSRLAGRILSA 237


>gi|381161030|ref|ZP_09870261.1| hypothetical protein Thi970DRAFT_04876 [Thiorhodovibrio sp. 970]
 gi|380877266|gb|EIC19359.1| hypothetical protein Thi970DRAFT_04876 [Thiorhodovibrio sp. 970]
          Length = 334

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 82/236 (34%), Positives = 126/236 (53%), Gaps = 12/236 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG----MWQWLF 78
           + ++I G E Q  ++ L P E  VA  G+M +   +++M+ ++   +  G        LF
Sbjct: 73  VDYEIFGHEMQFVEVELDPGESAVAEAGAMMYKHPAIQMDTVFGDGSHTGQGGGFMDRLF 132

Query: 79  G--------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
           G        +++ + V  + G     V   AP    I+P+ L   GG L+CQ D+FLC  
Sbjct: 133 GAGKRLVTGESLFTTVFTHTGQGKARVAFGAPYPGSIIPVHLTDLGGTLICQKDSFLCGA 192

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++   + G EGF+ Q+L G GL FI AGG V ++ LE GE + VD  CI
Sbjct: 193 KGVAMGIFFQRKILTGLFGGEGFIMQRLDGDGLVFIHAGGCVTERVLEAGETLHVDTGCI 252

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           VA    V   ++  G I+ A+FGG+ +  AV+TGPG V++QSLPF RL+ R+  A 
Sbjct: 253 VAFEQKVGFDVQQVGGIKSALFGGEGVFFAVLTGPGRVWLQSLPFSRLAGRMLAAA 308


>gi|399521686|ref|ZP_10762426.1| Uncharacterized protein M6_Spy0233 [Pseudomonas pseudoalcaligenes
           CECT 5344]
 gi|399110924|emb|CCH38986.1| Uncharacterized protein M6_Spy0233 [Pseudomonas pseudoalcaligenes
           CECT 5344]
          Length = 256

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 84/232 (36%), Positives = 131/232 (56%), Gaps = 9/232 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM----W---- 74
           + ++ILG  AQ  +I+L P E V+A  G+M +M+  +  E      +  G+    W    
Sbjct: 14  LEYEILGASAQSVEIILDPGETVIAEAGAMNYMTDGVRFETRMGDGSSSGLLGKLWSAGK 73

Query: 75  QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
           + L G+++      N G S   V  AAP    ++PIDLA  GG L+CQ DAFLC+ +   
Sbjct: 74  RMLTGESLFMTHFSNAGKSQARVAFAAPYPGTVVPIDLAKIGGRLICQKDAFLCAAHGTS 133

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           +  +  +R+     G EGF+ QKL G GLAF+ AGG+V++K L   E + +D  C+VA +
Sbjct: 134 IGISFAKRLGAGFFGGEGFILQKLEGDGLAFVHAGGTVIRKELN-NETLRLDTGCLVAFS 192

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           S ++  I   G ++  +FGG+ ++   + G G V+IQSLPF RL++R+  A 
Sbjct: 193 SGIDYDIALAGGLKSMLFGGEGILLTTLKGTGTVWIQSLPFSRLAERVYAAT 244


>gi|229087392|ref|ZP_04219531.1| Protein containing DUF124 [Bacillus cereus Rock3-44]
 gi|228695921|gb|EEL48767.1| Protein containing DUF124 [Bacillus cereus Rock3-44]
          Length = 260

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 132/232 (56%), Gaps = 14/232 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I +++ G + Q  +I L P+E V+A  G+M  M   ++ME I+   +  G    LFGK +
Sbjct: 6   IEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDHIDMETIF--GDGAGKSGGLFGKLM 63

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
            +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+ 
Sbjct: 64  GAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIVPVDLREYQGKVVCQKDAFLCAA 123

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++ +   G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+
Sbjct: 124 KGVSIGIEFTKKIGSGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGERLRIDTGCL 183

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           VA+T  V+  I++ G ++ A+FGG+ L  A + GPG V++QSL   RL+ R+
Sbjct: 184 VAMTRDVDYDIQFVGNVKTALFGGEGLFFATLEGPGTVWLQSLTLSRLAARL 235


>gi|312879522|ref|ZP_07739322.1| protein of unknown function DUF124 [Aminomonas paucivorans DSM
           12260]
 gi|310782813|gb|EFQ23211.1| protein of unknown function DUF124 [Aminomonas paucivorans DSM
           12260]
          Length = 334

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 80/235 (34%), Positives = 132/235 (56%), Gaps = 11/235 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY----------IPENEVG 72
           + F++ G E Q  ++ L P E VVA  G+M +   S+ M++++          + +  +G
Sbjct: 74  VDFRVFGQEMQYVELELDPGESVVAEAGAMMYKDPSVHMDSVFGDGSASSGGTLMDKVLG 133

Query: 73  MWQW-LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
             +  L G+++   +    G   G V   AP    ILP+ L   GG L+CQ D+FLC+  
Sbjct: 134 AGKRVLTGESLFMTLFSQSGAGKGRVAFGAPYPGCILPVSLPDLGGTLICQKDSFLCAAR 193

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     +++   + G EGF+ ++L G G+AF+ AGG+V++++L  GE + VD  C+V
Sbjct: 194 GVSIGIYFQKKILTGLFGGEGFIMERLEGDGMAFVHAGGTVLERDLAAGETVHVDTGCVV 253

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
               SV+  I+  G I+ A+FGG+ L  A +TGPG +++QSLPF RL+ R+  A 
Sbjct: 254 GFVPSVSFDIQQVGNIKTALFGGEGLFFASLTGPGRIWLQSLPFSRLAGRMLAAA 308


>gi|222150709|ref|YP_002559862.1| hypothetical protein MCCL_0459 [Macrococcus caseolyticus JCSC5402]
 gi|222119831|dbj|BAH17166.1| conserved hypothetical protein [Macrococcus caseolyticus JCSC5402]
          Length = 261

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 83/235 (35%), Positives = 133/235 (56%), Gaps = 11/235 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM--------- 73
           I ++I G + Q  +I L P E V++  GSM  ++ ++ ME I+   +  G          
Sbjct: 6   IDYKIYGDDMQYVEIELDPNETVISEAGSMMMINPNIHMETIFGDGSTSGGNGFMDKLFA 65

Query: 74  --WQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + L G+++      N G     V  AAP   +I+PIDL+ FGGE +CQ DAFL +  
Sbjct: 66  AGRRTLTGESLFMTTFTNRGTGKDHVYFAAPYPGKIIPIDLSQFGGEFICQKDAFLAAAK 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V++   L +R+     G EGF+ Q+L+G G+AFI AGG++++++L+ GE + VD  C+V
Sbjct: 126 GVQIGIALQKRLGVGFFGGEGFIMQRLSGDGMAFIHAGGAIMKRDLQPGETLLVDTGCLV 185

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            +T  V  +I+    ++  +FGG+ +  A + GPG V++QSLPF R + RI  A 
Sbjct: 186 GMTKDVAYEIETIPGLKTKLFGGEGIFLARLRGPGTVYVQSLPFSRFASRIFAAA 240


>gi|146308385|ref|YP_001188850.1| hypothetical protein Pmen_3366 [Pseudomonas mendocina ymp]
 gi|421505032|ref|ZP_15951972.1| hypothetical protein A471_17213 [Pseudomonas mendocina DLHK]
 gi|145576586|gb|ABP86118.1| protein of unknown function DUF124 [Pseudomonas mendocina ymp]
 gi|400344255|gb|EJO92625.1| hypothetical protein A471_17213 [Pseudomonas mendocina DLHK]
          Length = 248

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 84/232 (36%), Positives = 131/232 (56%), Gaps = 9/232 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM----W---- 74
           + ++ILG  AQ  +I+L P E V+A  G+M +M+  +  E      +  G+    W    
Sbjct: 6   LDYEILGASAQSVEIVLDPGETVIAEAGAMNYMTDGVRFETRMGDGSSSGLLGKLWSAGK 65

Query: 75  QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
           + L G+++      N G +   V  AAP    ++PIDLA  GG L+CQ DAFLC+ +   
Sbjct: 66  RMLTGESLFMTHFSNGGKTQARVAFAAPYPGTVVPIDLAQVGGRLICQKDAFLCAAHGTS 125

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           +  +  +R+     G EGF+ QKL G GLAF+ AGG+V++K L   E + +D  C+VA +
Sbjct: 126 IGISFAKRIGAGFFGGEGFILQKLEGDGLAFVHAGGTVIRKELN-NETLRLDTGCLVAFS 184

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           S ++  I   G ++  +FGG+ L+   + G G V+IQSLPF RL++R+  A 
Sbjct: 185 SGIDYDIALAGGLKSMLFGGEGLLLTTLKGTGTVWIQSLPFSRLAERVYAAT 236


>gi|345864473|ref|ZP_08816674.1| HTH DNA-binding protein [endosymbiont of Tevnia jerichonana (vent
           Tica)]
 gi|345879308|ref|ZP_08830971.1| hypothetical protein Rifp1Sym_eb00090 [endosymbiont of Riftia
           pachyptila (vent Ph05)]
 gi|344223680|gb|EGV50120.1| hypothetical protein Rifp1Sym_eb00090 [endosymbiont of Riftia
           pachyptila (vent Ph05)]
 gi|345124494|gb|EGW54373.1| HTH DNA-binding protein [endosymbiont of Tevnia jerichonana (vent
           Tica)]
          Length = 329

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 83/243 (34%), Positives = 131/243 (53%), Gaps = 11/243 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP----------ENEVG 72
           I +QI G E+Q  ++ L P E  VA  G+M +   S+ M  ++            +  +G
Sbjct: 70  IDYQIHGHESQFVEVELDPGESAVAEAGAMMYKDASIGMTTLFGDGSGSDGGGFMDKLLG 129

Query: 73  MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
             + L  G+++ + V  + G     V  AAP    I+PI LA  GG L+CQ D+FLC+  
Sbjct: 130 AGKRLLTGESLFTTVFTHNGSGKAKVAFAAPFPGTIIPISLADMGGRLICQKDSFLCAAK 189

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     +++   + G EGF+ QKL G GL F+ AGG+++++ L  GE + VD  C+V
Sbjct: 190 GVSIGIHFQKKILTGLFGGEGFIMQKLEGDGLVFVQAGGTLLERELAPGEQLHVDTGCLV 249

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
            +  SV   ++  G ++ A+FGG+ L  A + GPG V++QSLPF RL+ R+  A      
Sbjct: 250 GMQDSVQFDLQQAGGVKTALFGGEGLFFANLQGPGKVWLQSLPFSRLAGRMLEAAPQGGG 309

Query: 252 REN 254
           R+ 
Sbjct: 310 RDQ 312


>gi|347750822|ref|YP_004858387.1| hypothetical protein Bcoa_0383 [Bacillus coagulans 36D1]
 gi|347583340|gb|AEO99606.1| protein of unknown function DUF124 [Bacillus coagulans 36D1]
          Length = 262

 Score =  148 bits (374), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 85/234 (36%), Positives = 135/234 (57%), Gaps = 10/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
           I F+++G + Q  ++ L PQE V+A  GS+  M   + ME I+     ++ G+   LF  
Sbjct: 6   IDFKLVGDDMQFVEVELDPQETVIAEAGSLMMMEDGIRMETIFGDGSGSQGGIMGKLFSA 65

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G+++      N G     V  AAP   +I+P+DL+  GG+++CQ DAFL +   
Sbjct: 66  GKRLITGESLFMTAFTNEGRDKKHVAFAAPYPGKIVPMDLSEQGGKIICQKDAFLAAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +++     G EGF+ QKL G GLAFI AGG++ ++ L  GE + VD  C+VA
Sbjct: 126 VSVGVEFQRKLGTGFFGGEGFIMQKLEGDGLAFIHAGGTIQKRELAPGETLKVDTGCLVA 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           ++ +++  I++   I+ A+FGG+ L  A + GPG V++QSLPF RL+ R+  A+
Sbjct: 186 MSRTIDYDIEFVKGIKTALFGGEGLFFATLRGPGSVWVQSLPFSRLASRVFAAM 239


>gi|397687729|ref|YP_006525048.1| hypothetical protein PSJM300_13140 [Pseudomonas stutzeri DSM 10701]
 gi|395809285|gb|AFN78690.1| hypothetical protein PSJM300_13140 [Pseudomonas stutzeri DSM 10701]
          Length = 248

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 84/232 (36%), Positives = 132/232 (56%), Gaps = 9/232 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM----W---- 74
           + ++ILGG AQ  +I+L P E V+A  G+M +M+  +  E      +  GM    W    
Sbjct: 6   LEYEILGGSAQSVEIILDPGETVIAEAGAMNYMTEGVRFEARMGDGSSNGMLSKLWGMGK 65

Query: 75  QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
           + L G+++      N G     V  AAP    ++P+DLA  GG L+CQ D+FLC+    +
Sbjct: 66  RMLTGESLFLTHFSNQGKGKARVAFAAPYPGTVVPVDLAAVGGRLICQKDSFLCAAYGTR 125

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           V  +  +R+     G EGF+ Q+L G GLAF+ AGG+V++K L   E + +D  C+VA T
Sbjct: 126 VGISFSKRLGAGFFGGEGFILQRLEGDGLAFVHAGGTVIRKELNE-ETLRLDTGCLVAFT 184

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           S ++  I+  G ++  +FGG+ ++   + G G V+IQSLPF RL++R+  A 
Sbjct: 185 SGIDYDIQLAGGLKSMLFGGEGILLTTLKGTGSVWIQSLPFSRLAERVYEAT 236


>gi|84688037|ref|ZP_01015898.1| hypothetical protein 1099457000243_RB2654_01510 [Maritimibacter
           alkaliphilus HTCC2654]
 gi|84663943|gb|EAQ10446.1| hypothetical protein RB2654_01510 [Rhodobacterales bacterium
           HTCC2654]
          Length = 263

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 85/235 (36%), Positives = 129/235 (54%), Gaps = 12/235 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME-----------NIYIPENEV 71
           + ++I G + Q+ ++ L P E V+A  G+M +M   +  E            I      V
Sbjct: 6   VDYEIFGDDMQIVEVELDPAETVIAEAGAMNYMDDGIGFETRMGDGARPSGGIMDSLLNV 65

Query: 72  GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
           G  + L G++I      N G     V  AAP   +I+P+D+A  GGEL+CQ DAFLC+  
Sbjct: 66  G-KRVLTGESIFMTHFTNNGQGKKRVAFAAPYPGKIIPVDMAEMGGELICQKDAFLCAAF 124

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
                    +++     G EGF+ Q+L G G+AFI  GG+V+++ L  GEV+ VD  C+ 
Sbjct: 125 GTTTDIAFQRKLGAGFFGGEGFILQRLNGDGMAFIHVGGTVIKRELAPGEVLRVDTGCLA 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           A+T SVN  I+  G ++  +FGG+ +  A + GPG V++QSLPF RL+ R+ RA 
Sbjct: 185 AMTGSVNYDIERAGNLKSMIFGGEGIFLATLRGPGTVYLQSLPFSRLADRVLRAA 239


>gi|253576664|ref|ZP_04853991.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
           str. D14]
 gi|251843874|gb|EES71895.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
           str. D14]
          Length = 261

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 86/233 (36%), Positives = 132/233 (56%), Gaps = 10/233 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
           I ++I+G E Q  ++ L   E VVA  GS   M   + ME I+   +             
Sbjct: 6   IDYRIIGSEMQCVEVHLDQGESVVAEAGSFMMMDPDIRMETIFGDGSNQGGGLMGKLMGA 65

Query: 73  MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+ +   V  N G     V  A+P   +I+P+DL M GG+++CQ DAFLC+   
Sbjct: 66  GKRLLTGEGLFMTVFTNAGYQRQAVTFASPYPGKIVPLDLQMLGGKVICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +R+     G EGF+ QK+ G GLAF+ +GG V++++L  GEV+ +D  C+VA
Sbjct: 126 VSVGIEFQRRLGTGFFGGEGFIMQKIEGDGLAFVHSGGLVIERDLAPGEVLRLDTGCLVA 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T+SV+  I++   ++ A+FGG+ L  A + GPG V++QSLPF RL+ R+  A
Sbjct: 186 MTASVDYNIEFVKGVKTALFGGEGLFFATLRGPGKVWVQSLPFSRLADRVLSA 238


>gi|392957757|ref|ZP_10323277.1| hypothetical protein A374_13480 [Bacillus macauensis ZFHKF-1]
 gi|391876106|gb|EIT84706.1| hypothetical protein A374_13480 [Bacillus macauensis ZFHKF-1]
          Length = 259

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 134/234 (57%), Gaps = 10/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
           I ++I G + Q  ++ L P E V+A  G++  +  ++ ME I+      + G    LF  
Sbjct: 6   IDYKIYGSDMQFVEVELDPSETVIAEAGAVMMLEENIVMETIFGDGSAQKSGFMGKLFSA 65

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G+++      N G     V  AAP   +I+P+DL+++ G+++CQ DAFL +   
Sbjct: 66  GKRLLTGESLFMTTFTNEGSGKKHVSFAAPYPGKIIPMDLSVYNGKIICQKDAFLAAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +   L +++     G EGF+ QKL G G+AF+ AGG++ +K L+ GEV+ VD  C+VA
Sbjct: 126 VSIGIELQRKIGAGFFGGEGFIMQKLEGDGMAFVHAGGTLHEKKLQPGEVLRVDTGCLVA 185

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           +T  V   I++   ++ A+FGG+ L  A + GPG V++QSLPF RL+ R+  A+
Sbjct: 186 MTQDVEYNIEFVKGVKTALFGGEGLFFATLRGPGTVWVQSLPFSRLASRVFAAI 239


>gi|339494813|ref|YP_004715106.1| hypothetical protein PSTAB_2736 [Pseudomonas stutzeri ATCC 17588 =
           LMG 11199]
 gi|386021469|ref|YP_005939493.1| hypothetical protein PSTAA_2872 [Pseudomonas stutzeri DSM 4166]
 gi|327481441|gb|AEA84751.1| conserved hypothetical protein [Pseudomonas stutzeri DSM 4166]
 gi|338802185|gb|AEJ06017.1| hypothetical protein PSTAB_2736 [Pseudomonas stutzeri ATCC 17588 =
           LMG 11199]
          Length = 248

 Score =  147 bits (371), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 87/232 (37%), Positives = 131/232 (56%), Gaps = 9/232 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--- 79
           + ++ILG  AQ  +I+L P E V+A  G+M +M+  +  E      +  G+   L+G   
Sbjct: 6   LEYEILGQSAQSVEIILDPGETVIAEAGAMNYMTDGVRFEARMGDGSASGVLGKLWGMGK 65

Query: 80  KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
           + IT   L      N G     V  AAP    ++P+DLA  GGEL+CQ DAFLC+    +
Sbjct: 66  RMITGESLFLTHFTNQGKGRARVAFAAPYPGTVVPVDLARLGGELICQKDAFLCAAYGTR 125

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           +  +  +R+     G EGF+ Q+L G GLAFI AGG+V++K L   E + +D  C+VA T
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLQGDGLAFIHAGGTVIRKELN-NETLRLDTGCLVAFT 184

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             ++  I+  G +R  +FGG+ L+   + G G V+IQSLPF RL++R+  A 
Sbjct: 185 PGIDYDIQLAGGLRSMLFGGEGLLLTTLKGTGSVWIQSLPFSRLAERVYEAT 236


>gi|418294343|ref|ZP_12906239.1| hypothetical protein PstZobell_13606 [Pseudomonas stutzeri ATCC
           14405 = CCUG 16156]
 gi|379065722|gb|EHY78465.1| hypothetical protein PstZobell_13606 [Pseudomonas stutzeri ATCC
           14405 = CCUG 16156]
          Length = 248

 Score =  147 bits (371), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 86/232 (37%), Positives = 131/232 (56%), Gaps = 9/232 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--- 79
           + ++ILG  AQ  +I+L P E V+A  G+M +M+  +  E      +  G+   L G   
Sbjct: 6   LEYEILGQSAQSVEIILDPGETVIAEAGAMNYMTDGVRFEARMGDGSASGVLGKLLGMGK 65

Query: 80  KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
           + IT   L      N G S   V  AAP    ++P+DLA  GG+L+CQ DAFLC+    +
Sbjct: 66  RMITGESLFLTHFTNQGKSKARVAFAAPYPGTVVPVDLAALGGQLICQKDAFLCAAYGTR 125

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           +  +  +R+     G EGF+ Q+L G GLAF+ AGG+V++K L   E + +D  C+VA T
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLDGDGLAFVHAGGTVIRKELN-NETLRLDTGCLVAFT 184

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             ++  I+  G +R  +FGG+ L+   + G G V+IQSLPF RL++R+  A 
Sbjct: 185 PGIDYDIQLAGGLRSMLFGGEGLLLTTLKGTGSVWIQSLPFSRLAERVYEAT 236


>gi|452748294|ref|ZP_21948075.1| hypothetical protein B381_11078 [Pseudomonas stutzeri NF13]
 gi|452007875|gb|EME00127.1| hypothetical protein B381_11078 [Pseudomonas stutzeri NF13]
          Length = 248

 Score =  147 bits (371), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 86/233 (36%), Positives = 131/233 (56%), Gaps = 9/233 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--- 79
           + ++ILG  AQ  +I+L P E V+A  G+M +M+  +  E      +  G+   L+G   
Sbjct: 6   LEYEILGQSAQSVEIILDPDETVIAEAGAMNYMTNGVRFEARMGDGSASGVLGKLWGVGK 65

Query: 80  KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
           + IT   L      N G     V  AAP    ++PIDLA  GG+L CQ DAFLC+    +
Sbjct: 66  RMITGESLFLTHFTNEGKGKACVAFAAPYPGTVVPIDLAAAGGKLFCQKDAFLCAAYGTR 125

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           +  +  +R+     G EGF+ Q+L G GLAF+ AGG+V++K L   E + +D  C+VA T
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLEGNGLAFVHAGGTVIRKELN-NETLRLDTGCLVAFT 184

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVT 247
             ++  I+  G +R  +FGG+ L+   + G G V+IQSLPF RL++R+  A +
Sbjct: 185 QGIDYDIQLAGGLRSMLFGGEGLLLTTLKGTGSVWIQSLPFSRLAERVYEATS 237


>gi|386716499|ref|YP_006182825.1| hypothetical protein SMD_0032 [Stenotrophomonas maltophilia D457]
 gi|384076061|emb|CCH10635.1| DUF124 domain-containing protein [Stenotrophomonas maltophilia
           D457]
          Length = 339

 Score =  147 bits (370), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 79/230 (34%), Positives = 125/230 (54%), Gaps = 10/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
           I F+I+G E Q  ++ L P E  +A  G++ F   +++M+ ++   N             
Sbjct: 79  IEFRIVGHEMQFVELELDPGESAIAEAGALMFKDATVQMDTVFGAANGDQGGFMGKMMAA 138

Query: 73  MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+++ + V    G   G V  AAP    +L + L   GG L+CQ D+FL     
Sbjct: 139 GRRVLTGESLFATVYTQSGHGKGKVAFAAPYPGTVLAMKLDQHGGRLICQKDSFLAGARG 198

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V++     +R+   + G EGF+ QKL G G  FI AGG V+++ L  GE + VD  C+VA
Sbjct: 199 VQIGVQFQRRIMTGLFGGEGFIMQKLEGDGWVFIHAGGCVIERELAAGERLDVDTGCVVA 258

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
             SSV++ ++    I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+
Sbjct: 259 YHSSVDMDVRRVAGIKSMLFGGEGVFLATLTGPGKVWLQSLPFSRLAGRM 308


>gi|330504607|ref|YP_004381476.1| hypothetical protein [Pseudomonas mendocina NK-01]
 gi|328918893|gb|AEB59724.1| hypothetical protein MDS_3693 [Pseudomonas mendocina NK-01]
          Length = 248

 Score =  147 bits (370), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 130/232 (56%), Gaps = 9/232 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM----W---- 74
           + ++ILG  AQ  +I+L P E V+A  G+M +M+  +  E      +  G+    W    
Sbjct: 6   LEYEILGASAQSVEIILDPGETVIAEAGAMNYMTDGVRFETRMGDGSSSGLLGKLWSVGK 65

Query: 75  QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
           + L G+++      N G     V  AAP    ++PIDLA  GG L+CQ DAFLC+ +   
Sbjct: 66  RMLTGESLFMTHFSNAGKRQARVAFAAPYPGTVVPIDLAKIGGRLVCQKDAFLCAAHGTS 125

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           +  +  +R+     G EGF+ QKL G GLAF+ AGG+V++K L   E + +D  C+VA +
Sbjct: 126 IGISFAKRLGAGFFGGEGFILQKLEGDGLAFVHAGGTVIRKELN-NETLRLDTGCLVAFS 184

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           S ++  I   G ++  +FGG+ ++   + G G V+IQSLPF RL++R+  A 
Sbjct: 185 SGIDYDIALAGGLKSMLFGGEGILLTTLKGTGTVWIQSLPFSRLAERVYAAT 236


>gi|146283089|ref|YP_001173242.1| hypothetical protein PST_2750 [Pseudomonas stutzeri A1501]
 gi|145571294|gb|ABP80400.1| conserved hypothetical protein [Pseudomonas stutzeri A1501]
          Length = 248

 Score =  147 bits (370), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 86/232 (37%), Positives = 131/232 (56%), Gaps = 9/232 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--- 79
           + ++ILG  AQ  +I+L P E V+A  G+M +M+  +  E      +  G+   L+G   
Sbjct: 6   LEYEILGQSAQSVEIILDPGETVIAEAGAMNYMTDGVRFEARMGDGSASGVLGKLWGMGK 65

Query: 80  KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
           + IT   L      N G     V  AAP    ++P+DLA  GGEL+CQ DAFLC+    +
Sbjct: 66  RMITGESLFLTHFTNQGKGRARVAFAAPYPGTVVPVDLARLGGELICQKDAFLCAAYGTR 125

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           +  +  +R+     G EGF+ Q+L G GLAFI AGG+V++K L   E + +D  C+VA T
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLQGDGLAFIHAGGTVIRKELN-NETLRLDTGCLVAFT 184

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             ++  I+  G +R  +FGG+ L+   + G G ++IQSLPF RL++R+  A 
Sbjct: 185 PGIDYDIQLAGGLRSMLFGGEGLLLTTLKGTGSIWIQSLPFSRLAERVYEAT 236


>gi|403383765|ref|ZP_10925822.1| hypothetical protein KJC30_03660 [Kurthia sp. JC30]
          Length = 266

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 84/232 (36%), Positives = 129/232 (55%), Gaps = 12/232 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY----IPENEVGMW---- 74
           I F+I G + Q  ++ L P E  +A  G+M  M   + ME I+       N  G      
Sbjct: 6   IDFKIHGDDMQFVEVELDPGETAIAEAGAMMMMEDGIVMETIFGDGSQSSNASGFMGKLM 65

Query: 75  ----QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
               + L G+++      N G     V  AAP   RI+P+DL+ + G+++CQ DAFL + 
Sbjct: 66  GAGKRLLTGESLFVTTFTNQGQGKQHVSFAAPYPGRIIPLDLSQYNGKIICQKDAFLAAA 125

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V V     +++     G EGF+ QKL G GLAF+ AGG++++K+L  GE + +D  C+
Sbjct: 126 RGVSVGIEFQRKLGTGFFGGEGFIMQKLEGDGLAFVHAGGTIIRKDLRPGERLRIDTGCL 185

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           VA+T  V+  I+  G I+ ++FGG+ L  A + GPG V+IQS+PF RL+ R+
Sbjct: 186 VAMTDRVDYNIEAAGGIKTSLFGGEGLFLATLQGPGSVWIQSMPFSRLASRV 237


>gi|393773438|ref|ZP_10361836.1| hypothetical protein WSK_2833 [Novosphingobium sp. Rr 2-17]
 gi|392721318|gb|EIZ78785.1| hypothetical protein WSK_2833 [Novosphingobium sp. Rr 2-17]
          Length = 276

 Score =  146 bits (369), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 89/253 (35%), Positives = 131/253 (51%), Gaps = 12/253 (4%)

Query: 7   STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
           ++P+Q     S  D V  F+I G E Q  +I L P E  VA  G+M +   S++M  ++ 
Sbjct: 3   NSPWQHNRSSSIADDV-DFEIKGQELQFVEIELDPGESAVAEAGAMVWKDASIDMTTVFG 61

Query: 67  PENE----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF 115
             +           +G  + L  G+++ + V  + G     V  AAP    ILPI L   
Sbjct: 62  DGSGGQGGGFMGKLLGAGKRLVTGESLFTTVFTHQGRGKARVAFAAPVPGAILPIKLDHI 121

Query: 116 GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQK 175
           GG L+CQ DAFL +   V +     QR+   + G EGF+ Q+L G G  F+  GG+VV++
Sbjct: 122 GGRLICQKDAFLAAARGVSIGVHFQQRIMTGMFGGEGFIMQRLDGDGWVFVQMGGTVVER 181

Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
            L VGE I VD  C+ A T  +   +   G ++  +FGG+ +  A + GPG V+IQSLPF
Sbjct: 182 ELAVGEEIHVDTGCVAAFTPGIEFDVIRAGSVKSMIFGGEGVFFARLRGPGKVWIQSLPF 241

Query: 236 HRLSQRIARAVTS 248
            RL+ R+  A  S
Sbjct: 242 SRLAGRMLAAAGS 254


>gi|402826312|ref|ZP_10875524.1| hypothetical protein LH128_24617 [Sphingomonas sp. LH128]
 gi|402260151|gb|EJU10302.1| hypothetical protein LH128_24617 [Sphingomonas sp. LH128]
          Length = 276

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 88/255 (34%), Positives = 130/255 (50%), Gaps = 12/255 (4%)

Query: 7   STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
           ++P+      S  D V  F+I G E Q  +I L P E  VA  G+M +   S++M  ++ 
Sbjct: 3   NSPWSHTRSSSLSDDV-DFEIKGQELQFVEIELDPGESAVAEAGAMVWKDASIDMTTVFG 61

Query: 67  PENE----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF 115
             +           +G  + L  G+++ + V  + G     V  AAP    ILPI L   
Sbjct: 62  DGSADQGGGFMGKLLGAGKRLVTGESLFTTVFTHRGSGKARVSFAAPIPGSILPIKLDEV 121

Query: 116 GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQK 175
           GG L+CQ DAFLC+   V +     QR+   + G EGF+ QKL G G  F+  GG+V+++
Sbjct: 122 GGRLICQKDAFLCAARGVSIGIHFQQRIMTGLFGGEGFIMQKLEGDGWVFVQMGGTVIER 181

Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
            L  GE + VD  C+ A   SV   +   G ++  +FGG+ +  A + GPG V+IQSLPF
Sbjct: 182 ELRAGEELHVDTGCVAAFQPSVEFDVVRAGSVKSMIFGGEGVFFARLRGPGKVWIQSLPF 241

Query: 236 HRLSQRIARAVTSPN 250
            RL+ R+  A  S  
Sbjct: 242 SRLAGRMLAAAGSTG 256


>gi|431926698|ref|YP_007239732.1| hypothetical protein Psest_1552 [Pseudomonas stutzeri RCH2]
 gi|431824985|gb|AGA86102.1| TIGR00266 family protein [Pseudomonas stutzeri RCH2]
          Length = 248

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 85/232 (36%), Positives = 131/232 (56%), Gaps = 9/232 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--- 79
           + ++ILG  AQ  +I+L P E V+A  G+M +M+  +  E      +  G+   L+G   
Sbjct: 6   LEYEILGQSAQSVEIILDPGETVIAEAGAMNYMTDGVRFEARMGDGSASGVLGKLWGMGK 65

Query: 80  KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
           + IT   L      N G     V  AAP    ++P+DLA  GG+L+CQ DAFLC+    +
Sbjct: 66  RMITGESLFLTHFTNQGKGQARVAFAAPYPGTVVPVDLAALGGQLICQKDAFLCAAYGTR 125

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           +  +  +R+     G EGF+ Q+L G GLAF+ AGG+V++K L   E + +D  C+VA T
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLDGDGLAFVHAGGTVIRKELN-NETLRLDTGCLVAFT 184

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             ++  I+  G +R  +FGG+ L+   + G G V+IQSLPF RL++R+  A 
Sbjct: 185 PGIDYDIQLAGGLRSMLFGGEGLLLTTLKGTGSVWIQSLPFSRLAERVYEAT 236


>gi|344205455|ref|YP_004790596.1| hypothetical protein [Stenotrophomonas maltophilia JV3]
 gi|343776817|gb|AEM49370.1| protein of unknown function DUF124 [Stenotrophomonas maltophilia
           JV3]
          Length = 338

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 79/231 (34%), Positives = 125/231 (54%), Gaps = 11/231 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE-----------V 71
           I F+I+G E Q  +I L P E  +A  G++ F   +++M+ ++   N             
Sbjct: 77  IEFRIVGHEMQFVEIELDPGESAIAEAGALMFKDATVQMDTVFGAANGDQGGGFMGKVMA 136

Query: 72  GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + L G+++ + V    G   G V  AAP    +L + L   GG L+CQ D+FL    
Sbjct: 137 AGKRVLTGESLFATVYTQTGHGKGKVAFAAPYPGTVLAMRLDQHGGRLICQKDSFLAGAR 196

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V++     +++   + G EGF+ QKL G G  FI AGG V+++ L  GE + VD  C+V
Sbjct: 197 GVQIGVQFQRKIMTGLFGGEGFIMQKLEGDGWVFIHAGGCVIERELAAGERLDVDTGCVV 256

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           A  SSV++ ++    I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+
Sbjct: 257 AYHSSVDMDVRRVAGIKSMLFGGEGVFLATLTGPGKVWLQSLPFSRLAGRM 307


>gi|28868167|ref|NP_790786.1| hypothetical protein PSPTO_0947 [Pseudomonas syringae pv. tomato
           str. DC3000]
 gi|28851404|gb|AAO54481.1| conserved protein of unknown function [Pseudomonas syringae pv.
           tomato str. DC3000]
          Length = 244

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 1   MDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 60

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 61  GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 120

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 121 VSIGIQFQKKILTGLFGGEGFVLQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 180

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 181 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 234


>gi|257484023|ref|ZP_05638064.1| hypothetical protein PsyrptA_12263, partial [Pseudomonas syringae
           pv. tabaci str. ATCC 11528]
          Length = 282

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 39  VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 98

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 99  GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 158

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 159 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 218

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 219 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 272


>gi|213970164|ref|ZP_03398295.1| hypothetical protein PSPTOT1_5474 [Pseudomonas syringae pv. tomato
           T1]
 gi|213925045|gb|EEB58609.1| hypothetical protein PSPTOT1_5474 [Pseudomonas syringae pv. tomato
           T1]
          Length = 289

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 46  VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 105

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 106 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 165

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 166 VSIGIQFQKKILTGLFGGEGFVLQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 225

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 226 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 279


>gi|71734043|ref|YP_273121.1| hypothetical protein PSPPH_0840 [Pseudomonas syringae pv.
           phaseolicola 1448A]
 gi|71554596|gb|AAZ33807.1| conserved hypothetical protein TIGR00266 [Pseudomonas syringae pv.
           phaseolicola 1448A]
          Length = 244

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 1   MDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 60

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 61  GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 120

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 121 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 180

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 181 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 234


>gi|302185871|ref|ZP_07262544.1| hypothetical protein Psyrps6_05980 [Pseudomonas syringae pv.
           syringae 642]
          Length = 358

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 84/238 (35%), Positives = 129/238 (54%), Gaps = 11/238 (4%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG 79
           V+ F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG
Sbjct: 114 VVDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFG 173

Query: 80  --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                   +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL    
Sbjct: 174 AGKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAK 233

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ 
Sbjct: 234 GVSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLA 293

Query: 192 AVTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           A+T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A  S
Sbjct: 294 AMTQTVDYDIRTVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 351


>gi|389817739|ref|ZP_10208332.1| hypothetical protein A1A1_09841 [Planococcus antarcticus DSM 14505]
 gi|388464507|gb|EIM06838.1| hypothetical protein A1A1_09841 [Planococcus antarcticus DSM 14505]
          Length = 266

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 89/259 (34%), Positives = 141/259 (54%), Gaps = 13/259 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE-----------V 71
           I F++ G + Q  ++ L P E VVA  G++  M   + ME I+   ++           +
Sbjct: 6   IDFKLHGDDMQFVEVELDPSETVVAEAGALMMMEDGITMETIFGDGSKSSGGSGLMGKLM 65

Query: 72  GMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
           G  + +  G+++      N G     V  AAP   +I+P+DL+M  G+++CQ D+FL + 
Sbjct: 66  GAGKRIITGESLFMTTFTNNGTGKRHVSFAAPYPGKIIPMDLSMMNGKIICQKDSFLAAA 125

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG++ +KNL+ GEV+ VD  C+
Sbjct: 126 KGVSIGIEFQRKLGAGFFGGEGFIMQKLEGDGLAFVHAGGTIYRKNLQKGEVLRVDTGCL 185

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI-ARAVTSP 249
           VA+T  V+  I+    I+ A+FGG+ L  A + GPG V+IQS+PF RL+ R+ A A  +P
Sbjct: 186 VAMTGDVDYNIETVPGIKTALFGGEGLFLATLRGPGSVWIQSMPFSRLASRVFAAAPQNP 245

Query: 250 NMRENPKFFVQIAIFFFLA 268
             R   +      +F  L 
Sbjct: 246 AGRSKGEGSAAGGLFDLLG 264


>gi|347541722|ref|YP_004849149.1| hypothetical protein NH8B_3998 [Pseudogulbenkiania sp. NH8B]
 gi|345644902|dbj|BAK78735.1| protein of unknown function [Pseudogulbenkiania sp. NH8B]
          Length = 264

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 79/245 (32%), Positives = 129/245 (52%), Gaps = 10/245 (4%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV--GMW----- 74
           VI ++I G + Q  ++ L P E  V   G+M +M   + M+ ++   +    G+      
Sbjct: 5   VIDYEIFGDDMQYVEVELDPGEAAVGEAGTMYYMEDGISMDTVFGDGSAAQGGLLGKLMG 64

Query: 75  ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + L G+++ + V  N  P    V  AA    +I+P+ L   GG L  Q D+FL +  
Sbjct: 65  AGKRLLTGESLFTTVFVNQAPGKKRVAFAASYPGKIVPVHLLELGGTLYAQKDSFLAAAK 124

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +   L +++   + G EGF+ Q+L G G AF+ AGG++ ++ L  GE + VD  C+V
Sbjct: 125 GVSLGIALQKKIGTGLFGGEGFIMQRLDGDGYAFLHAGGTLTERQLAPGETLRVDTGCVV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
           A   SV+  I+Y G ++ A+F G+ L  A +TGPG V++QSLP  RL+ R+  A      
Sbjct: 185 AYQPSVDFDIQYVGKLKSALFAGEGLFLATLTGPGRVWLQSLPLSRLADRLIAASPKAGG 244

Query: 252 RENPK 256
           R + +
Sbjct: 245 RSSEQ 249


>gi|224826743|ref|ZP_03699843.1| protein of unknown function DUF124 [Pseudogulbenkiania ferrooxidans
           2002]
 gi|224600963|gb|EEG07146.1| protein of unknown function DUF124 [Pseudogulbenkiania ferrooxidans
           2002]
          Length = 264

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 79/245 (32%), Positives = 129/245 (52%), Gaps = 10/245 (4%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV--GMW----- 74
           VI ++I G + Q  ++ L P E  V   G+M +M   + M+ ++   +    G+      
Sbjct: 5   VIDYEIFGDDMQYVEVELDPGEAAVGEAGTMYYMEDGISMDTVFGDGSAAQGGLLGKLMG 64

Query: 75  ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + L G+++ + V  N  P    V  AA    +I+P+ L   GG L  Q D+FL +  
Sbjct: 65  AGKRLLTGESLFTTVFVNQAPGKKRVAFAASYPGKIVPVHLLELGGTLYAQKDSFLAAAK 124

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +   L +++   + G EGF+ Q+L G G AF+ AGG++ ++ L  GE + VD  C+V
Sbjct: 125 GVSLGIALQKKIGTGLFGGEGFIMQRLDGDGYAFLHAGGTLTERQLAPGETLRVDTGCVV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
           A   SV+  I+Y G ++ A+F G+ L  A +TGPG V++QSLP  RL+ R+  A      
Sbjct: 185 AYQPSVDFDIQYVGKLKSALFAGEGLFLATLTGPGRVWLQSLPLSRLADRLIAASPKAGG 244

Query: 252 RENPK 256
           R + +
Sbjct: 245 RSSEQ 249


>gi|87199726|ref|YP_496983.1| hypothetical protein Saro_1709 [Novosphingobium aromaticivorans DSM
           12444]
 gi|87135407|gb|ABD26149.1| protein of unknown function DUF124 [Novosphingobium aromaticivorans
           DSM 12444]
          Length = 277

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 88/255 (34%), Positives = 135/255 (52%), Gaps = 12/255 (4%)

Query: 7   STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
           ++P+Q +   S  D  I F+I G E Q  +I L P E  VA  G++ +  G++ M  ++ 
Sbjct: 3   NSPWQHHRSTSIADD-IDFEIKGQELQFLEIELDPGESAVAEAGALVWKDGAVGMTTVFG 61

Query: 67  PENE----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF 115
             +           +G  + L  G+++ + V  + G     V  A+P+   ILPI L+  
Sbjct: 62  DGSGGSEGGFMGKLLGAGKRLVTGESLFTTVFTHNGHGKARVAFASPTPGAILPIKLSDV 121

Query: 116 GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQK 175
           GG L+CQ D+FL +   V +     +RV   + G EGF+ QKL G G  F+  GG++V+K
Sbjct: 122 GGTLICQKDSFLAAARGVSIGIAFQRRVMTGLFGGEGFIMQKLEGDGWVFVQMGGTLVEK 181

Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
            L  GE + VD  C+ A T S++  +   G +R  +FGG+ L  A + GPG V+IQSLPF
Sbjct: 182 ELRSGEELHVDTGCLAAYTPSIDFDLVTAGGVRSVLFGGEGLFFARLRGPGKVWIQSLPF 241

Query: 236 HRLSQRIARAVTSPN 250
            RL+ R+  A  S  
Sbjct: 242 SRLAGRMLAAAGSSG 256


>gi|66044069|ref|YP_233910.1| hypothetical protein Psyr_0814 [Pseudomonas syringae pv. syringae
           B728a]
 gi|63254776|gb|AAY35872.1| Protein of unknown function DUF124 [Pseudomonas syringae pv.
           syringae B728a]
          Length = 291

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 48  VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 107

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 108 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 167

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 168 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 227

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 228 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 281


>gi|375148997|ref|YP_005011438.1| hypothetical protein [Niastella koreensis GR20-10]
 gi|361063043|gb|AEW02035.1| protein of unknown function DUF124 [Niastella koreensis GR20-10]
          Length = 264

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 78/244 (31%), Positives = 130/244 (53%), Gaps = 14/244 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I ++I G E Q  +I L P E  +A  G+   M   ++M  ++   ++    Q L GK +
Sbjct: 7   IDYKIYGEEMQYVEIELDPNETAIAESGAFMMMDDGIQMATMFGDGSK--QQQGLLGKLM 64

Query: 83  TS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
           ++                N G     V  A+P   +I+ +DL   GG+++ Q DAFLC+ 
Sbjct: 65  SAGKRMLTGESLFMTAFTNMGQGKKRVSFASPYPGKIIALDLQELGGKIVAQKDAFLCAA 124

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++   + G EGF+ +KL G G+AF+ AGG V ++ L+ GEV+ +D  C+
Sbjct: 125 KGVSIGIEFQRKLGTGLFGGEGFIMEKLEGDGMAFVHAGGHVFERTLQPGEVLRIDTGCL 184

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
           VA T +++  I++ G I+  +FGG+ L  A + GPG V+IQ+LP  RL+ RI    ++  
Sbjct: 185 VAYTQTIDYDIQFVGGIKNTLFGGEGLFFATLRGPGKVWIQTLPISRLASRILTYGSAGG 244

Query: 251 MREN 254
            +E 
Sbjct: 245 RKEE 248


>gi|422654987|ref|ZP_16717708.1| hypothetical protein PSYAC_26095 [Pseudomonas syringae pv.
           actinidiae str. M302091]
 gi|330968051|gb|EGH68311.1| hypothetical protein PSYAC_26095 [Pseudomonas syringae pv.
           actinidiae str. M302091]
          Length = 356

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 113 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 172

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 173 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 232

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 233 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 292

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 293 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 346


>gi|422607037|ref|ZP_16679041.1| hypothetical protein PSYMO_18538 [Pseudomonas syringae pv. mori
           str. 301020]
 gi|330890683|gb|EGH23344.1| hypothetical protein PSYMO_18538 [Pseudomonas syringae pv. mori
           str. 301020]
          Length = 358

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 115 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 174

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 294

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 348


>gi|302062223|ref|ZP_07253764.1| hypothetical protein PsyrptK_19726 [Pseudomonas syringae pv. tomato
           K40]
 gi|302134180|ref|ZP_07260170.1| hypothetical protein PsyrptN_22461 [Pseudomonas syringae pv. tomato
           NCPPB 1108]
 gi|422660507|ref|ZP_16722918.1| hypothetical protein PLA106_23988 [Pseudomonas syringae pv.
           lachrymans str. M302278]
 gi|331019111|gb|EGH99167.1| hypothetical protein PLA106_23988 [Pseudomonas syringae pv.
           lachrymans str. M302278]
          Length = 356

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 113 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 172

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 173 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 232

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 233 VSIGIQFQKKILTGLFGGEGFVLQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 292

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 293 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 346


>gi|237802055|ref|ZP_04590516.1| hypothetical protein POR16_24750 [Pseudomonas syringae pv. oryzae
           str. 1_6]
 gi|331024911|gb|EGI04967.1| hypothetical protein POR16_24750 [Pseudomonas syringae pv. oryzae
           str. 1_6]
          Length = 364

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 121 VDFKLYGSETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 180

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 181 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 240

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 241 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 300

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 301 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 354


>gi|374710374|ref|ZP_09714808.1| hypothetical protein SinuC_09125 [Sporolactobacillus inulinus CASD]
          Length = 263

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 86/233 (36%), Positives = 125/233 (53%), Gaps = 9/233 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM--------- 73
           I ++I G E Q  +I L   E V+A  GS   M   + ME I+    E G          
Sbjct: 6   IDYKIEGQEMQFVEIELDSSETVIAEAGSFMMMEDGITMETIFGDGAEQGGVMGKLFSAG 65

Query: 74  WQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
            + L G+++      N G     V  A+P   +I+P+DL+++ G ++CQ DAFLC+   V
Sbjct: 66  KRMLTGESLFMTAFTNEGGGKRHVSFASPYPGKIIPLDLSLYSGRMICQKDAFLCAAKGV 125

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
            V     +R+     G EGF+ QKL G G+AF+ AGG V +  L  GE + VD  C+VA+
Sbjct: 126 SVGIAFQKRLGAGFFGGEGFIMQKLEGDGMAFVHAGGMVKELTLAPGETLKVDTGCLVAM 185

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           T  V+  I +   I+ A+FGG+ L  A + GPG V +QSLPF RL+ R+  A+
Sbjct: 186 TRDVDYDITFVRGIKTALFGGEGLFFASLRGPGKVLVQSLPFSRLASRVFSAM 238


>gi|422296928|ref|ZP_16384577.1| Protein of unknown function DUF124 [Pseudomonas avellanae BPIC 631]
 gi|407991798|gb|EKG33569.1| Protein of unknown function DUF124 [Pseudomonas avellanae BPIC 631]
          Length = 356

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 113 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 172

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 173 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 232

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 233 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 292

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 293 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 346


>gi|307718072|ref|YP_003873604.1| HTH DNA-binding protein [Spirochaeta thermophila DSM 6192]
 gi|306531797|gb|ADN01331.1| HTH DNA-binding protein [Spirochaeta thermophila DSM 6192]
          Length = 246

 Score =  144 bits (364), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 80/220 (36%), Positives = 123/220 (55%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I + I G E Q+ +++L P E V A  G+M +M   + M+         G  + L G++ 
Sbjct: 5   IDYTIHGDEMQLVEVVLDPGEGVRAEAGAMLYMEEGIAMDTRMEGGLLGGFKRILAGESF 64

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
                 N G     V  AAP   +I+P+DL   GG  +CQ DAFLC+   +++     +R
Sbjct: 65  FITNFVNTGSRRAAVAFAAPYPGKIVPLDLGALGGRFVCQKDAFLCAAAGIEIDVVFTRR 124

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +     G EGF+ Q+L+G G AFI AGG++V+K L  G+ + VD  C V  + +V+  ++
Sbjct: 125 LGAGFFGGEGFILQELSGDGWAFIHAGGTIVEKRLAPGQTLRVDTGCFVGCSPTVDYDVR 184

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           + G  R A+FGG+ L    +TGPG+V++QSLP  RL+ RI
Sbjct: 185 FVGGFRNALFGGEGLFLTELTGPGVVYLQSLPLARLADRI 224


>gi|226945977|ref|YP_002801050.1| hypothetical protein Avin_39350 [Azotobacter vinelandii DJ]
 gi|226720904|gb|ACO80075.1| conserved hypothetical protein [Azotobacter vinelandii DJ]
          Length = 248

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 78/228 (34%), Positives = 131/228 (57%), Gaps = 9/228 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG----MW---- 74
           + ++ILG  AQ  +I+L P E V+A  G+M +M+  +  E      +E G    +W    
Sbjct: 6   LDYEILGAHAQSVEIILDPGETVIAEAGAMNYMTEGVRFETRMGDGSESGVLGKLWGMGK 65

Query: 75  QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
           + L G+++      N G     V  AAP    ++P+DLA  GG L+CQ D+FLC+    +
Sbjct: 66  RMLTGESLFMTHFSNHGKRQARVAFAAPYPGTVVPVDLAEIGGTLICQKDSFLCAARGTE 125

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           +  +  +R+     G EGF+ Q+L+G GLAF+ AGG+V++K L   E + +D  C+V  +
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLSGDGLAFLHAGGAVIRKELR-DETLRLDTGCLVGFS 184

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
             ++  I+  G ++  +FGG+ ++ A + G G V++QSLPF RL++R+
Sbjct: 185 RGIDYDIQLAGGLKSMLFGGEGILLATLKGTGTVWLQSLPFSRLAERV 232


>gi|301385947|ref|ZP_07234365.1| hypothetical protein PsyrptM_25068 [Pseudomonas syringae pv. tomato
           Max13]
          Length = 350

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 107 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 166

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 167 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 226

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 227 VSIGIQFQKKILTGLFGGEGFVLQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 286

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 287 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 340


>gi|289628836|ref|ZP_06461790.1| hypothetical protein PsyrpaN_27475 [Pseudomonas syringae pv.
           aesculi str. NCPPB 3681]
 gi|416024917|ref|ZP_11568827.1| hypothetical protein PsgRace4_09542 [Pseudomonas syringae pv.
           glycinea str. race 4]
 gi|422582537|ref|ZP_16657672.1| hypothetical protein PSYAE_09004 [Pseudomonas syringae pv. aesculi
           str. 0893_23]
 gi|422683710|ref|ZP_16741968.1| hypothetical protein PSYTB_25931 [Pseudomonas syringae pv. tabaci
           str. ATCC 11528]
 gi|320330238|gb|EFW86223.1| hypothetical protein PsgRace4_09542 [Pseudomonas syringae pv.
           glycinea str. race 4]
 gi|330867379|gb|EGH02088.1| hypothetical protein PSYAE_09004 [Pseudomonas syringae pv. aesculi
           str. 0893_23]
 gi|331013042|gb|EGH93098.1| hypothetical protein PSYTB_25931 [Pseudomonas syringae pv. tabaci
           str. ATCC 11528]
          Length = 358

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 115 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 174

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 294

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 348


>gi|336113473|ref|YP_004568240.1| hypothetical protein BCO26_0795 [Bacillus coagulans 2-6]
 gi|335366903|gb|AEH52854.1| protein of unknown function DUF124 [Bacillus coagulans 2-6]
          Length = 263

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 80/235 (34%), Positives = 130/235 (55%), Gaps = 11/235 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I F+++G + Q  ++ L PQE V+A  GS+  M   + ME I+   +             
Sbjct: 6   IDFKLVGDDMQFVEVELDPQETVIAEAGSLMMMEDGIRMETIFGDGSGGQGGGIMGKLFS 65

Query: 75  ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + + G+++      N          AAP   +I+P+DL+  GG+++CQ DAFL +  
Sbjct: 66  AGKRLITGESLFMTAFTNEARDKKHAAFAAPYPGKIVPMDLSELGGKIICQKDAFLAAAK 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V V     +++     G EGF+ QKL G G AFI AGG++ ++ L  GE++ VD  C+V
Sbjct: 126 GVSVGIEFQRKLGTGFFGGEGFIMQKLEGDGFAFIHAGGTIQKRELAPGEILRVDTGCLV 185

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           A++ +V+  I++   I+ A+FGG+ L  A + GPG V++QSLPF RL+ R+  A+
Sbjct: 186 AMSKTVDYDIEFVKGIKTALFGGEGLFFATLRGPGSVWVQSLPFSRLASRVFAAM 240


>gi|326387594|ref|ZP_08209200.1| hypothetical protein Y88_0506 [Novosphingobium nitrogenifigens DSM
           19370]
 gi|326207640|gb|EGD58451.1| hypothetical protein Y88_0506 [Novosphingobium nitrogenifigens DSM
           19370]
          Length = 277

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 80/235 (34%), Positives = 125/235 (53%), Gaps = 11/235 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
           + F I G + Q  +I L P E  V   G+M +   S+EM  ++   +E          +G
Sbjct: 19  VDFAIKGHDLQFVEIELDPGESAVGEAGAMVWKDASVEMTTVFGDGSEAADNGFMGKLLG 78

Query: 73  MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
             + L  G T+ + V  + G     V   AP    +LP+ LA  GG L+CQ D+FLC+  
Sbjct: 79  AGKRLLTGATLFTTVFTHRGTGKARVAFGAPIPGTVLPLRLADLGGTLICQKDSFLCAAR 138

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     +R+   + G +GF+ Q+L G G  F+  GG+++++ L  GE I VD  C+ 
Sbjct: 139 GVSIGIAFQKRIMTGLFGGDGFVMQRLDGDGWVFVQMGGTLIERELAPGEQIHVDTGCLA 198

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           A T SV+  ++  G ++  +FGG+ +  A +TGPG V+IQSLPF RL+ R+  A 
Sbjct: 199 AFTPSVDFDVQAAGGVKTMLFGGEGVFLARLTGPGKVWIQSLPFSRLAGRMLAAA 253


>gi|298159927|gb|EFI00966.1| HTH DNA-binding protein [Pseudomonas savastanoi pv. savastanoi
           NCPPB 3335]
          Length = 358

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 115 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGRNQSSGLLGSLFGA 174

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 294

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 348


>gi|429765885|ref|ZP_19298165.1| TIGR00266 family protein [Clostridium celatum DSM 1785]
 gi|429185738|gb|EKY26712.1| TIGR00266 family protein [Clostridium celatum DSM 1785]
          Length = 262

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 91/245 (37%), Positives = 139/245 (56%), Gaps = 13/245 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMW----- 74
           I ++I G E Q  +I L P E V+A  G+M  M  S+ ME +      +N  G++     
Sbjct: 5   IDYKIHGSEMQFVEIELDPNETVMAEAGAMMMMDNSIVMETVLGDGSEKNSRGLFGKIGG 64

Query: 75  ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + L G++I      N G     V  +AP   +ILP+DL + GGEL+CQ DAFLC   
Sbjct: 65  AAKRALTGESIFMTAFTNMGYGRESVAFSAPYPGKILPMDLTVLGGELICQKDAFLCCAK 124

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            + +     +++     G EGF+ Q+L G GLAFI  GG++V+K L  G+++ +D  C+V
Sbjct: 125 GISLDIAFTKKIGAGFFGGEGFILQRLQGNGLAFIHGGGTIVRKTLMPGQILKIDTGCLV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
           A+T +V+  I++ G ++ ++FGG+ L  A + GPG V+IQSLPF R + RI  A  +P +
Sbjct: 185 AMTKNVSYDIQFVGGLKNSIFGGEGLFFATLKGPGDVWIQSLPFSRFADRIISA--APQL 242

Query: 252 RENPK 256
             N K
Sbjct: 243 TNNTK 247


>gi|440721263|ref|ZP_20901663.1| hypothetical protein A979_10584 [Pseudomonas syringae BRIP34876]
 gi|440728601|ref|ZP_20908814.1| hypothetical protein A987_21097 [Pseudomonas syringae BRIP34881]
 gi|440361390|gb|ELP98621.1| hypothetical protein A987_21097 [Pseudomonas syringae BRIP34881]
 gi|440363885|gb|ELQ01039.1| hypothetical protein A979_10584 [Pseudomonas syringae BRIP34876]
          Length = 358

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 83/237 (35%), Positives = 128/237 (54%), Gaps = 11/237 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 115 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 174

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 294

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A  S
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 351


>gi|422661660|ref|ZP_16723892.1| hypothetical protein PSYAP_30263, partial [Pseudomonas syringae pv.
           aptata str. DSM 50252]
 gi|330982769|gb|EGH80872.1| hypothetical protein PSYAP_30263 [Pseudomonas syringae pv. aptata
           str. DSM 50252]
          Length = 329

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 83/237 (35%), Positives = 128/237 (54%), Gaps = 11/237 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 86  VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 145

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 146 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 205

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 206 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 265

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A  S
Sbjct: 266 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 322


>gi|422620056|ref|ZP_16688740.1| hypothetical protein PSYJA_23843 [Pseudomonas syringae pv. japonica
           str. M301072]
 gi|330900420|gb|EGH31839.1| hypothetical protein PSYJA_23843 [Pseudomonas syringae pv. japonica
           str. M301072]
          Length = 361

 Score =  144 bits (362), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 83/237 (35%), Positives = 128/237 (54%), Gaps = 11/237 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 118 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 177

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 178 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 237

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 238 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 297

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A  S
Sbjct: 298 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 354


>gi|289651012|ref|ZP_06482355.1| hypothetical protein Psyrpa2_25244 [Pseudomonas syringae pv.
           aesculi str. 2250]
          Length = 352

 Score =  144 bits (362), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 109 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 168

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 169 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 228

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 229 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 288

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 289 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 342


>gi|456011513|gb|EMF45259.1| DUF124 domain-containing protein [Planococcus halocryophilus Or1]
          Length = 266

 Score =  144 bits (362), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 88/259 (33%), Positives = 140/259 (54%), Gaps = 13/259 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE-----------V 71
           I +++ G + Q  ++ L P E V+A  G++  M   + ME I+   ++           +
Sbjct: 6   IDYKLHGDDMQFVEVELDPSETVIAEAGALMMMEDGIAMETIFGDGSQSSGGSGLMGKLM 65

Query: 72  GMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
           G  + +  G+++      N G     V  AAP   +I+P+DL+   G+++CQ DAFL + 
Sbjct: 66  GAGKRIITGESLFMTTFTNNGTGKRHVSFAAPYPGKIIPMDLSTLNGKIICQKDAFLAAA 125

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG++ +KNL+ GEVI VD  C+
Sbjct: 126 KGVSIGIEFQRKIGAGFFGGEGFIMQKLEGDGLAFVHAGGTIYRKNLQKGEVIRVDTGCL 185

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI-ARAVTSP 249
           VA+T  V+  I+    I+ A+FGG+ L  A + GPG V+IQS+PF RL+ R+ A A  +P
Sbjct: 186 VAMTGDVDYNIEAVPGIKTALFGGEGLFFATLRGPGSVWIQSMPFSRLASRVFAAAPQNP 245

Query: 250 NMRENPKFFVQIAIFFFLA 268
             R   +      +F  L 
Sbjct: 246 AGRSKGEGSAAGGLFDLLG 264


>gi|422597437|ref|ZP_16671710.1| hypothetical protein PLA107_22068 [Pseudomonas syringae pv.
           lachrymans str. M301315]
 gi|330987727|gb|EGH85830.1| hypothetical protein PLA107_22068 [Pseudomonas syringae pv.
           lachrymans str. M301315]
          Length = 358

 Score =  144 bits (362), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 81/231 (35%), Positives = 126/231 (54%), Gaps = 11/231 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 115 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 174

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 294

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRM 345


>gi|422586855|ref|ZP_16661526.1| hypothetical protein PSYMP_00190 [Pseudomonas syringae pv.
           morsprunorum str. M302280]
 gi|330872497|gb|EGH06646.1| hypothetical protein PSYMP_00190 [Pseudomonas syringae pv.
           morsprunorum str. M302280]
          Length = 356

 Score =  144 bits (362), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 113 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 172

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 173 GKRMLTGESLFTSVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 232

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 233 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 292

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 293 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 346


>gi|443641520|ref|ZP_21125370.1| AIM24 domain protein [Pseudomonas syringae pv. syringae B64]
 gi|443281537|gb|ELS40542.1| AIM24 domain protein [Pseudomonas syringae pv. syringae B64]
          Length = 358

 Score =  144 bits (362), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 83/237 (35%), Positives = 128/237 (54%), Gaps = 11/237 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 115 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 174

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 294

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A  S
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 351


>gi|422647248|ref|ZP_16710378.1| hypothetical protein PMA4326_19830 [Pseudomonas syringae pv.
           maculicola str. ES4326]
 gi|330960792|gb|EGH61052.1| hypothetical protein PMA4326_19830 [Pseudomonas syringae pv.
           maculicola str. ES4326]
          Length = 359

 Score =  144 bits (362), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 82/234 (35%), Positives = 127/234 (54%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 116 VDFKLYGSETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 175

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 176 GKRMLTGESLFTTVFSQQGAGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 235

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 236 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEGLDVDTGCLAA 295

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 296 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 349


>gi|94498230|ref|ZP_01304791.1| hypothetical protein SKA58_14132 [Sphingomonas sp. SKA58]
 gi|94422360|gb|EAT07400.1| hypothetical protein SKA58_14132 [Sphingomonas sp. SKA58]
          Length = 279

 Score =  143 bits (361), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 83/253 (32%), Positives = 134/253 (52%), Gaps = 12/253 (4%)

Query: 7   STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
           ++P+  +   S  D  I F+I G E Q  +I L P E  VA  G+M +  G++ M  ++ 
Sbjct: 5   NSPWSHHRSASLSDD-IDFEIKGQELQFLEIELNPGESAVAEAGAMVWKDGAISMTTVFG 63

Query: 67  PENE----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF 115
             +           +G  + L  G+++ + V  + G     V  A+P+   ILP+ L  +
Sbjct: 64  DGSGGADGGFMGKLLGAGKRLVTGESLFTTVFTHQGHGKARVAFASPTPGSILPLKLDQY 123

Query: 116 GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQK 175
           GG L+CQ D+FL +   V +     +R+   + G EGF+ Q+L+G G  F+  GG++V+K
Sbjct: 124 GGALICQKDSFLAAAKGVAIGVHFQKRIMTGLFGGEGFIMQRLSGDGWVFVQMGGTLVEK 183

Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
            L  GE + VD  C+ A+T  V+  +   G ++  +FGG+    A + GPG V+IQSLPF
Sbjct: 184 ELAPGEELHVDTGCLAAMTPGVDFDLVAAGGVKSMIFGGEGAFFARLRGPGHVWIQSLPF 243

Query: 236 HRLSQRIARAVTS 248
            RL+ R+  A  S
Sbjct: 244 SRLAGRMLAAAGS 256


>gi|254523742|ref|ZP_05135797.1| hypothetical protein SSKA14_2876 [Stenotrophomonas sp. SKA14]
 gi|219721333|gb|EED39858.1| hypothetical protein SSKA14_2876 [Stenotrophomonas sp. SKA14]
          Length = 337

 Score =  143 bits (361), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 78/230 (33%), Positives = 127/230 (55%), Gaps = 10/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN-EVGMW------- 74
           I F+I+G E Q  +I L P E  +A  G++ F   +++M+ ++   N + G +       
Sbjct: 77  IDFRIVGHEMQFVEIELDPGESAIAEAGALMFKDATVQMDTVFGAANGDQGGFMGKVMAA 136

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+++ + V    G   G V  AAP    +L + L   GG L+CQ D+FL     
Sbjct: 137 GKRVLTGESLFATVYTQSGHGKGKVAFAAPYPGTVLAMKLDQHGGRLICQKDSFLAGARG 196

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V++     +R+   + G EGF+ QKL G G  FI AGG V+++ L  GE + VD  C+VA
Sbjct: 197 VQIGVQFQRRIMTGLFGGEGFIMQKLEGDGWVFIHAGGCVIERELAAGERLDVDTGCVVA 256

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
              +V++ ++    ++  +FGG+ +  A +TGPG V++QSLPF RL+ R+
Sbjct: 257 YHPTVDMDVRRVTGVKSMLFGGEGVFLATLTGPGKVWLQSLPFSRLAGRM 306


>gi|251796470|ref|YP_003011201.1| hypothetical protein Pjdr2_2460 [Paenibacillus sp. JDR-2]
 gi|247544096|gb|ACT01115.1| protein of unknown function DUF124 [Paenibacillus sp. JDR-2]
          Length = 266

 Score =  143 bits (361), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 85/231 (36%), Positives = 127/231 (54%), Gaps = 11/231 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMW----- 74
           I + I G + Q  +I L P E V+A  GS+  M   + ME I+     ++  G+      
Sbjct: 7   IDYYIHGDDMQFVEIELDPAETVIAEAGSLMMMDDDIHMEAIFGDGSAQSSGGLMGKLMS 66

Query: 75  ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + L G+++      N G        AAP   +I+PIDL   GG+++CQ DAFL +  
Sbjct: 67  AGKRILTGESLFMTAFTNEGMGKKKASFAAPYPGKIIPIDLHAAGGKMICQKDAFLAAAK 126

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V V     +++     G EGF+ QKL G GLAF+ AGG++++K+L  GE + VD  C+V
Sbjct: 127 GVSVGIEFQRKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTIIRKDLNPGEKLRVDTGCLV 186

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           A T SV+  I+    ++  +FGG+ L  A + GPG V+IQSLPF RL+ R+
Sbjct: 187 AFTPSVHYSIEMVKGVKTFLFGGEGLFFATLEGPGTVYIQSLPFSRLASRV 237


>gi|424066025|ref|ZP_17803498.1| Protein of unknown function DUF124 [Pseudomonas syringae pv.
           avellanae str. ISPaVe013]
 gi|408002727|gb|EKG42963.1| Protein of unknown function DUF124 [Pseudomonas syringae pv.
           avellanae str. ISPaVe013]
          Length = 358

 Score =  143 bits (361), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 83/237 (35%), Positives = 128/237 (54%), Gaps = 11/237 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 115 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSPGLLGSLFGA 174

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 294

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A  S
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 351


>gi|424070674|ref|ZP_17808107.1| DUF124 domain-containing protein [Pseudomonas syringae pv.
           avellanae str. ISPaVe037]
 gi|408000091|gb|EKG40456.1| DUF124 domain-containing protein [Pseudomonas syringae pv.
           avellanae str. ISPaVe037]
          Length = 358

 Score =  143 bits (361), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 83/237 (35%), Positives = 128/237 (54%), Gaps = 11/237 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 115 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSPGLLGSLFGA 174

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 294

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A  S
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 351


>gi|440730209|ref|ZP_20910303.1| hypothetical protein A989_02770 [Xanthomonas translucens DAR61454]
 gi|440379300|gb|ELQ15897.1| hypothetical protein A989_02770 [Xanthomonas translucens DAR61454]
          Length = 333

 Score =  143 bits (361), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 84/246 (34%), Positives = 130/246 (52%), Gaps = 12/246 (4%)

Query: 7   STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
           S P  P    S  D  I F+I+G E Q  +I L P E  VA  G++ F   +++M+ ++ 
Sbjct: 59  SLPPPPGGRGSADD--IDFRIVGHEMQFVEIALDPGESAVAEAGALMFKDAAVQMDTVFG 116

Query: 67  PENEVG--MWQWLF--------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFG 116
             +  G  +   LF        G+++ + V  + G     V  AAP    +L + L   G
Sbjct: 117 DGSSQGGGLMGKLFSAGKRLVTGESLFTTVFTHQGQGKATVAFAAPYPGTVLAMKLDQHG 176

Query: 117 GELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKN 176
           G L+CQ D+FL     V +     +++   + G EGF+ QKL G G  F+ AGG VV++ 
Sbjct: 177 GRLICQKDSFLAGARGVSLGIHFQRKIMTGLFGGEGFIMQKLEGDGWVFVHAGGCVVERE 236

Query: 177 LEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFH 236
           L  GE + VD  C+VA  S+V++ ++    I+   FGG+ +  A +TGPG V++QSLPF 
Sbjct: 237 LAAGERLDVDTGCVVAFHSTVDMDVRAVSGIKSMFFGGEGMFLATLTGPGKVWLQSLPFS 296

Query: 237 RLSQRI 242
           RL+ R+
Sbjct: 297 RLAGRM 302


>gi|440741756|ref|ZP_20921089.1| hypothetical protein A988_00190 [Pseudomonas syringae BRIP39023]
 gi|440378614|gb|ELQ15232.1| hypothetical protein A988_00190 [Pseudomonas syringae BRIP39023]
          Length = 358

 Score =  143 bits (361), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 83/237 (35%), Positives = 127/237 (53%), Gaps = 11/237 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 115 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 174

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG L+CQ D+FL     
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGRLICQKDSFLAGAKG 234

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 294

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A  S
Sbjct: 295 MTQTVDYDIRTVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 351


>gi|190572163|ref|YP_001970008.1| hypothetical protein Smlt0082 [Stenotrophomonas maltophilia K279a]
 gi|424666447|ref|ZP_18103474.1| TIGR00266 family protein [Stenotrophomonas maltophilia Ab55555]
 gi|190010085|emb|CAQ43693.1| conserved hypothetical protein [Stenotrophomonas maltophilia K279a]
 gi|401072302|gb|EJP80809.1| TIGR00266 family protein [Stenotrophomonas maltophilia Ab55555]
          Length = 338

 Score =  143 bits (360), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 78/231 (33%), Positives = 124/231 (53%), Gaps = 11/231 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE-----------V 71
           I F+I+G E Q  +I L P E  +A  G++ F   +++M+ ++   N             
Sbjct: 77  IEFRIVGHEMQFVEIELDPGESAIAEAGALMFKDATVQMDTVFGAANGDQGGGFMGKVMA 136

Query: 72  GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + L G+++ + V    G   G V  AAP    +L + L   GG L+CQ D+FL    
Sbjct: 137 AGKRVLTGESLFATVYTQSGHGKGKVAFAAPYPGTVLAMKLDQHGGRLICQKDSFLAGAR 196

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V++     +++   + G EGF+ QKL G G  FI AGG VV++ L  GE + VD  C+V
Sbjct: 197 GVQIGVQFQRKIMTGLFGGEGFIMQKLEGDGWVFIHAGGCVVERELAAGERLDVDTGCVV 256

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           A   +V++ ++    I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+
Sbjct: 257 AYHPTVDMDVRRVAGIKSMLFGGEGVFLATLTGPGKVWLQSLPFSRLAGRM 307


>gi|386346080|ref|YP_006044329.1| hypothetical protein [Spirochaeta thermophila DSM 6578]
 gi|339411047|gb|AEJ60612.1| protein of unknown function DUF124 [Spirochaeta thermophila DSM
           6578]
          Length = 246

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 80/220 (36%), Positives = 121/220 (55%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           I + I G + Q+ +++L P E V A  G+M +M   + M+         G  + L G++ 
Sbjct: 5   IDYTIHGDDMQLVEVILDPGEGVRAEAGAMLYMEEGITMDTRMEGGLLGGFKRILAGESF 64

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
                 N G     V  AAP   +I+P+DL   GG  +CQ DAFLC+   + +S    +R
Sbjct: 65  FITNFVNTGSRRAAVAFAAPYPGKIVPLDLGTLGGRFICQKDAFLCAAVGIDISIVFTRR 124

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +     G EGF+ ++L G G AFI AGG++V+K L  GE + VD  C V  +  V+  ++
Sbjct: 125 LGAGFFGGEGFILEELRGDGWAFIHAGGTIVEKRLAAGETLRVDTGCFVGCSPPVDYDVR 184

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           + G  R A+FGG+ L    +TGPG+V++QSLP  RL+ RI
Sbjct: 185 FVGGFRNALFGGEGLFLTELTGPGVVYLQSLPLARLADRI 224


>gi|150401531|ref|YP_001325297.1| hypothetical protein Maeo_1108 [Methanococcus aeolicus Nankai-3]
 gi|150014234|gb|ABR56685.1| protein of unknown function DUF124 [Methanococcus aeolicus
           Nankai-3]
          Length = 262

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 81/232 (34%), Positives = 125/232 (53%), Gaps = 11/232 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV--GMWQWLFG- 79
           + ++I G + QV ++ L P E V+A  G+M +M   +  E      ++   G+   LFG 
Sbjct: 6   VDYEIFGDDMQVVEVELDPNESVIAEAGAMNWMEDGINFETKMGDGSDANEGLVGKLFGA 65

Query: 80  --KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + IT   L      N G     V  AAP    I+PIDL+   GE  CQ D+FLC+   
Sbjct: 66  AKRVITGESLFLTHFTNVGRGKKRVAFAAPYPGSIIPIDLSKLDGEFFCQKDSFLCAALG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            KV  + ++++   + G EGF+ + L G G+ F+ AGG++V+K L  G+ I VD  CIV 
Sbjct: 126 TKVGISFNKKIGTGLFGGEGFIMEHLEGDGMIFVHAGGTIVKKELN-GDKIKVDTGCIVG 184

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
            T  ++  I+  G ++  +FG + L  A + G G V++QSLPF RL+ RI +
Sbjct: 185 FTKGIDFSIERAGGLKSMIFGSEGLFLATLQGHGTVYLQSLPFSRLADRILK 236


>gi|78358149|ref|YP_389598.1| hypothetical protein [Desulfovibrio alaskensis G20]
 gi|78220554|gb|ABB39903.1| protein of unknown function DUF124 [Desulfovibrio alaskensis G20]
          Length = 242

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 130/230 (56%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME----MENIYIPENE-----VGM 73
           + ++I GG+ QV ++ L P E V+A  G+MC+M G +E    M +    +       +G 
Sbjct: 6   VEYRITGGDLQVVEVELDPGETVIAEAGAMCWMDGDIEFAARMGDGSAADGGFFGKLLGA 65

Query: 74  WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
            + L  G+++      N G +   V  A      ++P+DLA  GGEL+CQ DAFLC+   
Sbjct: 66  GKRLVTGESLFMTHFTNQGQAKASVAFAGQVPGHVVPVDLAEIGGELICQRDAFLCAARG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            ++     +R+     G EGF+ Q+L G GLAF+ AGG+VV+K L  G  + VD  C+VA
Sbjct: 126 TRIDVAFSKRLGAGFFGGEGFVLQRLRGDGLAFVHAGGAVVRKELH-GGTLRVDTGCLVA 184

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  V+  I  +G ++  +FGG+    A ++G G VF+QS+PF+RL+ RI
Sbjct: 185 FTPGVSYDIGLSGGLKSMMFGGEGAFLAALSGHGTVFLQSMPFNRLADRI 234


>gi|359783147|ref|ZP_09286364.1| hypothetical protein PPL19_18852 [Pseudomonas psychrotolerans L19]
 gi|359369035|gb|EHK69609.1| hypothetical protein PPL19_18852 [Pseudomonas psychrotolerans L19]
          Length = 247

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 82/233 (35%), Positives = 130/233 (55%), Gaps = 10/233 (4%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM----W--- 74
           ++  +ILG   Q  +I L P E V+A  G+M +M   ++     + +   G+    W   
Sbjct: 5   ILDHRILGESLQAVEIGLDPGETVIAEAGAMTYMEADIDF-TARMGDGSAGLLGKLWGAG 63

Query: 75  -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
            + L G+++      N G     V  AAP   +++P+DLA  GG L CQ D+FLC+    
Sbjct: 64  KRMLGGESLFMTHFTNEGREKRHVVFAAPYPGQVVPLDLAALGGTLFCQRDSFLCAAKGT 123

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
           ++     +R+     G EGF+ Q+L G G+AF+ AGG+V++K L+ GE + +D  C+V  
Sbjct: 124 RIGVAFTRRLGAGFFGGEGFILQRLEGDGMAFVHAGGTVIRKELK-GETLRLDTGCLVGF 182

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           TS ++  IK  G +R  +FGG+ LV A ++G G V+IQ+LPF RL+ R+  A 
Sbjct: 183 TSGIDYNIKLAGGLRSMLFGGEGLVLATLSGHGTVWIQTLPFSRLAGRVQAAA 235


>gi|456737361|gb|EMF62056.1| DUF124 domain-containing protein [Stenotrophomonas maltophilia
           EPM1]
          Length = 338

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 78/231 (33%), Positives = 124/231 (53%), Gaps = 11/231 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE-----------V 71
           I F+I+G E Q  +I L P E  +A  G++ F   +++M+ ++   N             
Sbjct: 77  IEFRIVGHEMQFVEIELDPGESAIAEAGALMFKDATVQMDTVFGAANGDQGGGFMGKVMA 136

Query: 72  GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + L G+++ + V    G   G V  AAP    +L + L   GG L+CQ D+FL    
Sbjct: 137 AGKRVLTGESLFATVYTQSGHGKGKVAFAAPYPGTVLAMKLDQHGGRLVCQKDSFLAGAR 196

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V++     +++   + G EGF+ QKL G G  FI AGG VV++ L  GE + VD  C+V
Sbjct: 197 GVQIGVQFQRKIMTGLFGGEGFIMQKLEGDGWVFIHAGGCVVERELAAGERLDVDTGCVV 256

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           A   +V++ ++    I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+
Sbjct: 257 AYHPTVDMDVRRVAGIKSMLFGGEGVFLATLTGPGKVWLQSLPFSRLAGRM 307


>gi|408825445|ref|ZP_11210335.1| hypothetical protein PgenN_20122 [Pseudomonas geniculata N1]
          Length = 338

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 78/231 (33%), Positives = 124/231 (53%), Gaps = 11/231 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE-----------V 71
           I F+I+G E Q  +I L P E  +A  G++ F   +++M+ ++   N             
Sbjct: 77  IEFRIVGHEMQFVEIELDPGESAIAEAGALMFKDATVQMDTVFGAANGDQGGGFMGKVMA 136

Query: 72  GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + L G+++ + V    G   G V  AAP    +L + L   GG L+CQ D+FL    
Sbjct: 137 AGKRVLTGESLFATVYTQSGHGKGKVAFAAPYPGTVLAMKLDQHGGRLICQKDSFLAGAR 196

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V++     +++   + G EGF+ QKL G G  FI AGG VV++ L  GE + VD  C+V
Sbjct: 197 GVQIGVQFQRKIMTGLFGGEGFIMQKLEGDGWVFIHAGGCVVERELAAGERLDVDTGCVV 256

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           A   +V++ ++    I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+
Sbjct: 257 AYHPTVDMDVRRVTGIKSMLFGGEGVFLATLTGPGKVWLQSLPFSRLAGRM 307


>gi|422631722|ref|ZP_16696904.1| hypothetical protein PSYPI_19011 [Pseudomonas syringae pv. pisi
           str. 1704B]
 gi|330941577|gb|EGH44361.1| hypothetical protein PSYPI_19011 [Pseudomonas syringae pv. pisi
           str. 1704B]
          Length = 358

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 81/237 (34%), Positives = 127/237 (53%), Gaps = 11/237 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+   +             
Sbjct: 115 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLLGA 174

Query: 73  MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 294

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A  S
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 351


>gi|194363814|ref|YP_002026424.1| hypothetical protein Smal_0036 [Stenotrophomonas maltophilia
           R551-3]
 gi|194346618|gb|ACF49741.1| protein of unknown function DUF124 [Stenotrophomonas maltophilia
           R551-3]
          Length = 338

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 78/231 (33%), Positives = 126/231 (54%), Gaps = 11/231 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMW----- 74
           I F+I+G E Q  +I L P E  +A  G++ F   +++M+ ++     +   G+      
Sbjct: 77  IEFRIVGHEMQFVEIELDPGESAIAEAGALMFKDATVQMDTVFGAATGDQGGGLMGKVMA 136

Query: 75  ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + L G+++ + V    G   G V  AAP    +L + L   GG L+CQ D+FL    
Sbjct: 137 AGKRVLTGESLFATVYTQSGHGKGKVAFAAPYPGTVLAMKLDQHGGRLICQKDSFLAGAR 196

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V++     +++   + G EGF+ QKL G G  FI AGG VV++ L  GE + VD  C+V
Sbjct: 197 GVQIGVQFQRKIMTGLFGGEGFIMQKLEGDGWVFIHAGGCVVERELAAGERLDVDTGCVV 256

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           A   +V++ ++    I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+
Sbjct: 257 AYHPTVDMDVRRVAGIKSMLFGGEGVFLATLTGPGKVWLQSLPFSRLAGRM 307


>gi|344942790|ref|ZP_08782077.1| protein of unknown function DUF124 [Methylobacter tundripaludum
           SV96]
 gi|344260077|gb|EGW20349.1| protein of unknown function DUF124 [Methylobacter tundripaludum
           SV96]
          Length = 368

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 83/245 (33%), Positives = 128/245 (52%), Gaps = 11/245 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLFG 79
           I F+I+G E Q  +I L P E  VA  G+M + S  ++ME ++       + G+   L G
Sbjct: 110 IDFKIVGHEMQFVEIELDPGESAVAEAGAMMYKSADVKMETVFGDGSASQQAGIMGLLLG 169

Query: 80  --------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                   +++   V        G V  AAP    IL ++L  +  +L+CQ D+FL +  
Sbjct: 170 AGKRLLTNESLFITVFTQTAQCKGKVAFAAPFPGTILALNLKDYDRKLICQKDSFLAAAK 229

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V++     +R+   + G EGF+ QKL G G  F+ AGG + +  L  GE + VD  C+ 
Sbjct: 230 GVQIGIHFQRRILTGLFGGEGFIMQKLEGDGWVFVHAGGCLQRIELSAGETLHVDTGCLA 289

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
           A+TSSV+  I+    I+  +FGG+ L  A +TGPG V++QSLPF RL+ R+  A      
Sbjct: 290 AMTSSVDFDIQQASGIKTMLFGGEGLFFAKLTGPGTVWLQSLPFSRLAGRMFSAAPQGGG 349

Query: 252 RENPK 256
           R+  +
Sbjct: 350 RQQDE 354


>gi|389799295|ref|ZP_10202293.1| hypothetical protein UUC_16100 [Rhodanobacter sp. 116-2]
 gi|388443286|gb|EIL99443.1| hypothetical protein UUC_16100 [Rhodanobacter sp. 116-2]
          Length = 337

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 76/237 (32%), Positives = 130/237 (54%), Gaps = 13/237 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I ++I+G + Q  +I L P E  +A  G++ +   +++M+ ++   +  G          
Sbjct: 74  IDYRIVGTDMQFVEIELDPGESAIAEAGALMYKEAAVQMDTVFGDGSSGGQSGGGLMDKL 133

Query: 75  -----QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
                + + G+++ + V  + G     V  AAP    ++ + L+  GG L+CQ DAFL  
Sbjct: 134 LSAGKRVITGESLFTTVFTHAGQGKAKVAFAAPYPGTVMAMKLSDHGGRLICQKDAFLAG 193

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
              V++     +++   + G EGF+ QKL G G  F+ AGG+VVQ+ L+ GE + VD  C
Sbjct: 194 ARGVQLGIFFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVQRELQAGERLDVDTGC 253

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           +VA   +VN+ +K  G I+  +FGG+ +  A +TGPG V++QSLPF R++ R+  A 
Sbjct: 254 VVAFHDTVNMDVKPVGGIKSMLFGGEGVFLATLTGPGTVWLQSLPFSRMAGRMLAAA 310


>gi|389783910|ref|ZP_10195145.1| hypothetical protein UU7_14525 [Rhodanobacter spathiphylli B39]
 gi|388434114|gb|EIL91067.1| hypothetical protein UU7_14525 [Rhodanobacter spathiphylli B39]
          Length = 336

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 76/236 (32%), Positives = 128/236 (54%), Gaps = 12/236 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF---- 78
           I ++I+G + Q  +I L P E  +A  G++ +   ++ M+ ++   +  G    L     
Sbjct: 74  IDYRIVGTDMQFVEIELDPGESAIAEAGALMYKEAAVRMDTVFGDGSSGGQSGGLMDKLL 133

Query: 79  --------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
                   G+++ + V  + G     V  AAP    ++ + L+  GG L+CQ DAFL   
Sbjct: 134 SAGKRVITGESLFTTVFTHTGQGKAKVAFAAPYPGTVMAMKLSDHGGRLICQKDAFLAGA 193

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V++     +++   + G EGF+ QKL G G  F+ AGG+VVQ+ L+ GE + VD  C+
Sbjct: 194 RGVQLGIFFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVQRELKAGERLDVDTGCV 253

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           VA   +VN+ ++  G I+  +FGG+ +  A +TGPG V++QSLPF R++ R+  A 
Sbjct: 254 VAFHDTVNMDVRPVGGIKSMLFGGEGVFLATLTGPGTVWLQSLPFSRMAGRMLAAA 309


>gi|409397854|ref|ZP_11248712.1| hypothetical protein C211_20088 [Pseudomonas sp. Chol1]
 gi|409117593|gb|EKM94020.1| hypothetical protein C211_20088 [Pseudomonas sp. Chol1]
          Length = 248

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 129/232 (55%), Gaps = 9/232 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG----MW---- 74
           + + ILG  AQ  +I+L P E V+A  G+M +M+  +  E      +  G    +W    
Sbjct: 6   LEYDILGQSAQSVEIILDPGETVIAEAGAMNYMTEGVRFETRMGDGSASGVLGKLWGMGK 65

Query: 75  QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
           + L G+++      N G     V  AAP    ++P+DLA  GG L CQ D+FLC+    +
Sbjct: 66  RMLTGESLFLTHFTNNGKGKARVAFAAPYPGTVVPVDLAALGGRLTCQKDSFLCAAYGTR 125

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           V   L +R+     G EGF+ Q+L G GLAF+ AGG+V++K L+  E + +D  C+VA +
Sbjct: 126 VGIALSKRIGAGFFGGEGFILQRLEGDGLAFLHAGGTVIRKELK-DETLRLDTGCLVAFS 184

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             ++  I+  G ++  +FGG+ L+   + G G V+IQSLPF RL++R+  A 
Sbjct: 185 DGIDYDIQLAGGLKSMLFGGEGLLLTTLKGTGSVWIQSLPFSRLAERVYAAT 236


>gi|357415830|ref|YP_004928850.1| hypothetical protein DSC_00780 [Pseudoxanthomonas spadix BD-a59]
 gi|355333408|gb|AER54809.1| hypothetical protein DSC_00780 [Pseudoxanthomonas spadix BD-a59]
          Length = 335

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 78/236 (33%), Positives = 124/236 (52%), Gaps = 12/236 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF---- 78
           I F+I+G + Q  ++ L P E  VA  GS+ F   S++M+ ++      G          
Sbjct: 74  IEFRIVGHDMQFVEVELDPGESAVAEAGSLMFKDASVQMDTVFGDGRTSGTGGGFMDKLL 133

Query: 79  --------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
                   G+++ + V  + G     V  AAP    ++ + L+  GG LLCQ D+FL   
Sbjct: 134 AAGKRVVTGESLFTTVYTHVGQGKAKVAFAAPYPGTVIALKLSEHGGRLLCQKDSFLAGA 193

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V++     +++   + G EGF+ QKL G G  F+ AGG+VV++ L  GE I VD  C+
Sbjct: 194 RGVQLGIAFQRKIMTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELAAGERIDVDTGCV 253

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           VA    V++ I+    I+   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 254 VAYHGGVDMDIRRVAGIKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 309


>gi|352080291|ref|ZP_08951360.1| protein of unknown function DUF124 [Rhodanobacter sp. 2APBS1]
 gi|351685000|gb|EHA68069.1| protein of unknown function DUF124 [Rhodanobacter sp. 2APBS1]
          Length = 337

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 76/237 (32%), Positives = 130/237 (54%), Gaps = 13/237 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I ++I+G + Q  +I L P E  +A  G++ +   +++M+ ++   +  G          
Sbjct: 74  IDYRIVGTDMQFVEIELDPGESAIAEAGALMYKEAAVQMDTVFGDGSSGGQSSGGLMDKL 133

Query: 75  -----QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
                + + G+++ + V  + G     V  AAP    ++ + L+  GG L+CQ DAFL  
Sbjct: 134 LSAGKRVITGESLFTTVFTHAGQGKAKVAFAAPYPGTVMAMKLSDHGGRLICQKDAFLAG 193

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
              V++     +++   + G EGF+ QKL G G  F+ AGG+VVQ+ L+ GE + VD  C
Sbjct: 194 ARGVQLGIFFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVQRELQAGERLDVDTGC 253

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           +VA   +VN+ +K  G I+  +FGG+ +  A +TGPG V++QSLPF R++ R+  A 
Sbjct: 254 VVAFHDTVNMDVKPVGGIKSMLFGGEGVFLATLTGPGTVWLQSLPFSRMAGRMLAAA 310


>gi|334141548|ref|YP_004534754.1| hypothetical protein [Novosphingobium sp. PP1Y]
 gi|333939578|emb|CCA92936.1| conserved hypothetical protein [Novosphingobium sp. PP1Y]
          Length = 276

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 81/237 (34%), Positives = 124/237 (52%), Gaps = 11/237 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
           + F+I G E Q  +I L P E  VA  G+M +    +EM  ++   +           +G
Sbjct: 18  VDFEIKGQELQFVEIELDPGESAVAEAGAMVWKDAPIEMSTVFGDGSGGEGSGFMGKLLG 77

Query: 73  MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
             + L  G+++ + V  + G     V  AAP    ILPI L   GG L+CQ DAFL +  
Sbjct: 78  AGKRLVTGESLFTTVFTHHGKGKARVAFAAPVPGAILPIKLDDVGGRLICQKDAFLAAAR 137

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     QR+   + G EGF+ QKL G G  F+  GG+V+++ L  GE + +D  C+ 
Sbjct: 138 GVSIGVHFQQRIMTGLFGGEGFIMQKLEGDGWVFVQMGGTVIERELAAGEELHIDTGCVA 197

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           A TS ++  +   G ++  +FGG+ +  A + GPG V++QSLPF RL+ R+  A  S
Sbjct: 198 AFTSGIDFDVIRAGSVKSMIFGGEGVFFARLRGPGKVWVQSLPFSRLAGRMLAAAGS 254


>gi|433677717|ref|ZP_20509665.1| putative protein M6_Spy0233 [Xanthomonas translucens pv.
           translucens DSM 18974]
 gi|430817166|emb|CCP40086.1| putative protein M6_Spy0233 [Xanthomonas translucens pv.
           translucens DSM 18974]
          Length = 337

 Score =  141 bits (356), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 77/230 (33%), Positives = 124/230 (53%), Gaps = 10/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I F+I+G E Q  +I L P E  VA  G++ F   +++M+ ++   +  G          
Sbjct: 77  IDFRIVGHEMQFVEIALDPGESAVAEAGALMFKDAAVQMDTVFGDGSGQGGGLMGKLFSA 136

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + + G+++ + V  + G     V  AAP    +L + L   GG L+CQ D+FL     
Sbjct: 137 GKRLVTGESLFTTVFTHQGQGKATVAFAAPYPGTVLAMKLDQHGGRLICQKDSFLAGARG 196

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+ AGG VV++ L  GE + VD  C+VA
Sbjct: 197 VSLGIHFQRKIMTGLFGGEGFIMQKLEGDGWVFVHAGGCVVERELAAGERLDVDTGCVVA 256

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
             S+V++ ++    I+   FGG+ +  A +TGPG V++QSLPF RL+ R+
Sbjct: 257 FHSTVDMDVRAVSGIKSMFFGGEGMFLATLTGPGKVWLQSLPFSRLAGRM 306


>gi|389810514|ref|ZP_10205843.1| hypothetical protein UUA_15723 [Rhodanobacter thiooxydans LCS2]
 gi|388440808|gb|EIL97141.1| hypothetical protein UUA_15723 [Rhodanobacter thiooxydans LCS2]
          Length = 336

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 76/236 (32%), Positives = 128/236 (54%), Gaps = 12/236 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF---- 78
           I ++I+G + Q  +I L P E  +A  G++ +   +++M+ ++   +  G    L     
Sbjct: 74  IDYRIVGTDMQFVEIELDPGESAIAEAGALMYKESAVQMDTVFGDGSSSGQGGGLMDKLL 133

Query: 79  --------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
                   G+++ + V  + G     V  AAP    ++ + L+  GG L+CQ DAFL   
Sbjct: 134 SAGKRVITGESLFTTVFTHAGSGKAKVAFAAPYPGTVMAMKLSDHGGRLICQKDAFLAGA 193

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V++     +++   + G EGF+ QKL G G  F+ AGG+VVQ+ L  GE + VD  C+
Sbjct: 194 RGVQLGIFFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVQRELRAGERLDVDTGCV 253

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           VA   +V++ +K  G I+  +FGG+ +  A +TGPG V++QSLPF R++ R+  A 
Sbjct: 254 VAFHDTVSMDVKPVGGIKSMLFGGEGVFLATLTGPGTVWLQSLPFSRMAGRMLAAA 309


>gi|323489343|ref|ZP_08094573.1| hypothetical protein GPDM_08325 [Planococcus donghaensis MPA1U2]
 gi|323396984|gb|EGA89800.1| hypothetical protein GPDM_08325 [Planococcus donghaensis MPA1U2]
          Length = 266

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 81/232 (34%), Positives = 131/232 (56%), Gaps = 12/232 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE-----------V 71
           I +++ G + Q  ++ L P E V+A  G++  M   + ME I+   ++           +
Sbjct: 6   IDYKLHGDDMQFVEVELDPAETVIAEAGALMMMEDGIVMETIFGDGSQSSGGSGLMGKLM 65

Query: 72  GMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
           G  + +  G+++      N G     V  AAP   +I+P+DL+   G+++CQ DAFL + 
Sbjct: 66  GAGKRIITGESLFMTTFTNNGTGKRHVSFAAPYPGKIIPMDLSTLNGKIICQKDAFLAAA 125

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V +     +++     G EGF+ QKL G GLAF+ AGG++ +KNL+ GEV+ VD  C+
Sbjct: 126 KGVSIGIEFQRKIGAGFFGGEGFIMQKLEGDGLAFVHAGGTIYRKNLQKGEVLRVDTGCL 185

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           VA+T  V+  I+    I+ A+FGG+ L  A + GPG V+IQS+PF RL+ R+
Sbjct: 186 VAMTGDVDYNIEAVPGIKTALFGGEGLFFATLRGPGSVWIQSMPFSRLASRV 237


>gi|416014288|ref|ZP_11562147.1| hypothetical protein PsgB076_03534 [Pseudomonas syringae pv.
           glycinea str. B076]
 gi|320326072|gb|EFW82129.1| hypothetical protein PsgB076_03534 [Pseudomonas syringae pv.
           glycinea str. B076]
          Length = 352

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 81/234 (34%), Positives = 126/234 (53%), Gaps = 11/234 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 109 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 168

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V        G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 169 GKRMLTGESLFTTVFSQQDSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 228

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 229 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 288

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A
Sbjct: 289 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAA 342


>gi|197106783|ref|YP_002132160.1| hypothetical protein PHZ_c3322 [Phenylobacterium zucineum HLK1]
 gi|196480203|gb|ACG79731.1| conserved hypothetical protein [Phenylobacterium zucineum HLK1]
          Length = 311

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 85/254 (33%), Positives = 133/254 (52%), Gaps = 12/254 (4%)

Query: 4   PFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMEN 63
           P   +P+  +   +  D V  F+I G E Q  +I L P E  VA  G++ +   S+EM  
Sbjct: 34  PMSQSPWSHHRNPNVADDV-DFEIKGQELQFVEIELDPGESAVAEAGALVWKDASVEMTT 92

Query: 64  IYIPENE----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDL 112
           ++   +           +G  + L  G+++ + V  + G     V  A+P+   ILP++L
Sbjct: 93  VFGDGSGGQGGGFMGKLLGAGKRLITGESLFTTVFTHQGQGKARVAFASPTPGAILPLNL 152

Query: 113 AMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSV 172
           A  GG L+CQ D+FL +   V +     +RV   + G EGF+ Q+L G G  F+  GG+V
Sbjct: 153 AEVGGTLICQKDSFLAAARGVSIGVQFQRRVMTGLFGGEGFIMQRLEGDGWVFVQMGGTV 212

Query: 173 VQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           V++ L  G+ I VD  C+ A T SV+  +   G ++   FGG+ L  A + GPG V++QS
Sbjct: 213 VERELAAGQEIHVDTGCLAAYTPSVDFDLVMAGGVKSVFFGGEGLFFARLRGPGKVWVQS 272

Query: 233 LPFHRLSQRIARAV 246
           LPF RL+ R+  A 
Sbjct: 273 LPFSRLAGRMMAAA 286


>gi|419954779|ref|ZP_14470914.1| hypothetical protein YO5_07918 [Pseudomonas stutzeri TS44]
 gi|387968392|gb|EIK52682.1| hypothetical protein YO5_07918 [Pseudomonas stutzeri TS44]
          Length = 248

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 128/232 (55%), Gaps = 9/232 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG----MW---- 74
           + + ILG  AQ  +I+L P E V+A  G+M +M+  +  E      +  G    +W    
Sbjct: 6   LEYDILGQSAQSVEIILDPDETVIAEAGAMNYMTEGVRFETRMGDGSSSGVLGKLWGMGR 65

Query: 75  QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
           + L G+++      N G     V  AAP    ++P+DLA  GG L CQ D+FLC+    +
Sbjct: 66  RMLTGESLFLTHFTNSGKGKARVAFAAPYPGTVVPVDLAALGGRLTCQKDSFLCAAYGTR 125

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           V   L +R+     G EGF+ Q L G GLAF+ AGG+V++K L+  E + +D  C+VA +
Sbjct: 126 VGIALSKRIGAGFFGGEGFILQCLEGDGLAFLHAGGTVIRKELK-DETLRLDTGCLVAFS 184

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             ++  I+  G ++  +FGG+ L+   + G G V+IQSLPF RL++R+  A 
Sbjct: 185 DGIDYDIQLAGGLKSMLFGGEGLLLTTLKGTGSVWIQSLPFSRLAERVYAAT 236


>gi|281209445|gb|EFA83613.1| hypothetical protein PPL_02679 [Polysphondylium pallidum PN500]
          Length = 352

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 77/220 (35%), Positives = 123/220 (55%), Gaps = 1/220 (0%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++I G E+QV Q+ L P   + A  G++  MS +MEM+          M + + G+ I  
Sbjct: 100 WEIFGHESQVIQLHLNPGTSITAETGALLEMSATMEMDTTARGGFMSSMKRMITGQGIFL 159

Query: 85  VVLRNPGPSD-GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV 143
              +N   ++   +  ++P +++IL I ++  GGEL+C+ +AFLC  ND+++   L  R 
Sbjct: 160 TKFKNSSDTETARITFSSPYISKILAIQMSEIGGELICKKNAFLCGDNDIEIGPRLTNRF 219

Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
                G +GF+ QKLTG GLAFI   GS++ + L+  E I V   CIVA   ++   I++
Sbjct: 220 SIGFFGGQGFILQKLTGHGLAFIHGSGSIMYRMLKPMETIKVSTGCIVAFEPTIQYDIEW 279

Query: 204 NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
               +   FGG+ L  A ++GPG+V +QSLPF +L   IA
Sbjct: 280 VKGAKNIFFGGEGLFLATLSGPGLVILQSLPFEKLVSAIA 319


>gi|285019906|ref|YP_003377617.1| hypothetical protein XALc_3144 [Xanthomonas albilineans GPE PC73]
 gi|283475124|emb|CBA17623.1| hypothetical protein XALC_3144 [Xanthomonas albilineans GPE PC73]
          Length = 335

 Score =  140 bits (354), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 78/231 (33%), Positives = 125/231 (54%), Gaps = 10/231 (4%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF- 78
            I ++I+G E Q  +I L P E  VA  G++ F   +++M+ ++     N+ G+   L  
Sbjct: 74  AIDYRIVGHEMQFVEIELDPGESAVAEAGALMFKDTAVQMDTVFGDGSANQGGLMGKLLS 133

Query: 79  -------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                  G T+ + V  + G     V  AAP    +L + L   GG L+CQ D+FL    
Sbjct: 134 AGKRLVTGATLFTTVFTHQGQGKAKVAFAAPYPGTVLAMKLDQHGGCLICQKDSFLAGAR 193

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     +++   + G EGF+ QKL G G  F+ AGG VV++ L  GE + VD  C+V
Sbjct: 194 GVSLGIHFQRKIMTGLFGGEGFVMQKLEGDGWVFVHAGGCVVERELAAGERVEVDTGCVV 253

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           A  ++V++ ++    I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+
Sbjct: 254 AFHATVDMDVRTVSGIKSMLFGGEGMFLATLTGPGKVWLQSLPFSRLAARM 304


>gi|359397666|ref|ZP_09190692.1| hypothetical protein NSU_0378 [Novosphingobium pentaromativorans
           US6-1]
 gi|357600857|gb|EHJ62550.1| hypothetical protein NSU_0378 [Novosphingobium pentaromativorans
           US6-1]
          Length = 348

 Score =  140 bits (354), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 80/237 (33%), Positives = 124/237 (52%), Gaps = 11/237 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
           + F+I G E Q  +I L P E  VA  G+M +    +EM  ++   +           +G
Sbjct: 90  VDFEIKGQELQFVEIELDPGESAVAEAGAMVWKDAPIEMSTVFGDGSGGEGSGFMGKLLG 149

Query: 73  MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
             + L  G+++ + V  + G     V  AAP    ILPI L   GG L+CQ DAFL +  
Sbjct: 150 AGKRLVTGESLFTTVFTHHGKGKARVAFAAPVPGAILPIKLDDVGGRLICQKDAFLAAAR 209

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     QR+   + G EGF+ QKL G G  F+  GG+V+++ L  GE + +D  C+ 
Sbjct: 210 GVSIGVHFQQRIMTGLFGGEGFIMQKLEGDGWVFVQMGGTVIERELAAGEELHIDTGCVA 269

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           A T+ ++  +   G ++  +FGG+ +  A + GPG V++QSLPF RL+ R+  A  S
Sbjct: 270 AFTAGIDFDVIRAGSVKSMIFGGEGVFFARLRGPGKVWVQSLPFSRLAGRMLAAAGS 326


>gi|345864493|ref|ZP_08816693.1| hypothetical protein TevJSym_ao00180 [endosymbiont of Tevnia
           jerichonana (vent Tica)]
 gi|345124352|gb|EGW54232.1| hypothetical protein TevJSym_ao00180 [endosymbiont of Tevnia
           jerichonana (vent Tica)]
          Length = 250

 Score =  140 bits (353), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 78/231 (33%), Positives = 132/231 (57%), Gaps = 12/231 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN--EVGMWQWLF-- 78
           I +QILG + Q+ ++ L P E V+A  G+M ++   +  E      +  + G++  LF  
Sbjct: 6   IDYQILGHDLQLVEVELDPGETVIAEAGAMTYLEQEISFETKMGDGSNPDQGIFGKLFSA 65

Query: 79  ------GKTITSVVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                 G+++ +    + G +    V  +AP    ++P++LA  GG+++CQ DAFLC+  
Sbjct: 66  GKRMITGESLFTTHFTHSGSAGKRKVAFSAPFPGSVIPLNLAALGGKVICQKDAFLCAAL 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
             K+     +++ + + G EGF+ Q L G G AFI AGG+VV+K L  GE + VD  C+V
Sbjct: 126 GTKLGIAFTKKLGSGLFGGEGFILQSLEGDGTAFIQAGGTVVEKQLN-GETLRVDTGCLV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
             T  ++  I+ +G ++  +FGG+ +  A ++G G V+IQSLPF R++ RI
Sbjct: 185 GFTEGIDYSIEMSGGLKSMLFGGEGIFLATLSGSGTVWIQSLPFSRMADRI 235


>gi|380512373|ref|ZP_09855780.1| hypothetical protein XsacN4_14202 [Xanthomonas sacchari NCPPB 4393]
          Length = 337

 Score =  140 bits (352), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 79/235 (33%), Positives = 125/235 (53%), Gaps = 11/235 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP---ENEVGMWQWLF- 78
           I ++I+G E Q  +I L P E  VA  G++ F   ++ M+ ++      ++ G+   L  
Sbjct: 76  IDYRIVGHEMQFVEIELDPGESAVAEAGALMFKDAAVRMDTVFGDGSNADQGGLMGKLLS 135

Query: 79  -------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                  G+++ + V  + G     V  AAP    +L + L   GG L+CQ D+FL    
Sbjct: 136 AGKRLVTGESLFTTVFTHQGHGKAKVAFAAPYPGTVLAMKLDQHGGRLICQKDSFLAGAR 195

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     ++V   + G EGF+ QKL G G  F+ AGG VV++ L  GE + VD  C+V
Sbjct: 196 GVSLGIHFQRKVMTGLFGGEGFIMQKLEGDGWVFVHAGGCVVERELAAGERLDVDTGCVV 255

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           A  +SV++ ++    I+   FGG+ +  A +TGPG V++QSLPF RL+ R+  A 
Sbjct: 256 AFHASVDMDVRPVSGIKSMFFGGEGMFLATLTGPGKVWLQSLPFSRLAGRMLAAA 310


>gi|422641956|ref|ZP_16705377.1| hypothetical protein PSYCIT7_23825 [Pseudomonas syringae Cit 7]
 gi|330954341|gb|EGH54601.1| hypothetical protein PSYCIT7_23825 [Pseudomonas syringae Cit 7]
          Length = 358

 Score =  140 bits (352), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 81/237 (34%), Positives = 126/237 (53%), Gaps = 11/237 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 115 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 174

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + +    G   G V  A P    ILP++L  FGG L+CQ D+FL     
Sbjct: 175 GKRMLTGESLFTTMFSQQGSGKGRVAFAPPYPGTILPLNLRDFGGRLICQKDSFLAGAKG 234

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 294

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A  S
Sbjct: 295 MTQTVDYDIRTVGGGIKSMLFGGEGVFFARLTGPGTVWLQSLPFSRLAGRMLAAGPS 351


>gi|167647234|ref|YP_001684897.1| hypothetical protein Caul_3272 [Caulobacter sp. K31]
 gi|167349664|gb|ABZ72399.1| protein of unknown function DUF124 [Caulobacter sp. K31]
          Length = 275

 Score =  140 bits (352), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 82/248 (33%), Positives = 128/248 (51%), Gaps = 13/248 (5%)

Query: 12  PYVYQSPQDAV--IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN 69
           P+ +Q        I F+I G E Q  +I L   E  VA  GS  +   S+ M  ++   +
Sbjct: 5   PWSHQRSASVADDIDFEIKGQELQFLEIELDAGESAVAEAGSFVWKDASVGMTTVFGDGS 64

Query: 70  E----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE 118
                      +G  + L  G+++ + V  + G     V  A+P+   ILP++L   GG 
Sbjct: 65  GDNGGGFMGKLLGAGKRLVTGESLFTTVFTHNGAGKARVAFASPTPGAILPLNLGQLGGS 124

Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
           L+CQ D+FL +   V +     +RV   + G EGF+ Q+L G G  F+  GG++V++ L+
Sbjct: 125 LICQKDSFLAAARGVSIGVHFQKRVMTGLFGGEGFIMQRLDGDGWVFVQMGGTLVERELK 184

Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
            GE + VD  C+ A T  V+  +   G +R  +FGG+ L  A + GPG V+IQSLPF RL
Sbjct: 185 AGEELHVDTGCLAAYTPGVDFDLVMAGGVRSVLFGGEGLFFARLRGPGKVWIQSLPFSRL 244

Query: 239 SQRIARAV 246
           + R+ +A 
Sbjct: 245 AGRMLQAA 252


>gi|218438880|ref|YP_002377209.1| hypothetical protein PCC7424_1909 [Cyanothece sp. PCC 7424]
 gi|218171608|gb|ACK70341.1| protein of unknown function DUF124 [Cyanothece sp. PCC 7424]
          Length = 234

 Score =  140 bits (352), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 82/229 (35%), Positives = 125/229 (54%), Gaps = 1/229 (0%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           VI +QI G + Q+ +I L   E V A  G+M ++   +EM+         G  + L G+ 
Sbjct: 4   VIDYQIYGDDLQLVEITLDLGEGVRAEAGAMAYLEQGIEMQTSTGGGFLAGFKRALAGEG 63

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
                  N G     V  AAP   +I+P+ L   GG  +CQ DAF+C+   +++     +
Sbjct: 64  FFITNFANRGTRRAKVAFAAPYPGKIIPLQLGQLGGTFICQKDAFICAAKGIEIDVEFTK 123

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           R+     G EGF+ Q+L G GLAFI AGG++V+K L  GE I VD  C+VA + +V  +I
Sbjct: 124 RLGTGFFGGEGFILQRLQGDGLAFIHAGGALVEKRLMSGEDIRVDTGCVVAFSPTVRYEI 183

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
           +     +  +FGG+ L  A +TGPG +++QSLP  +L+ R+ R   SP+
Sbjct: 184 EMVKGFKNVLFGGEGLFLAKLTGPGTIYLQSLPLSKLAARL-RQYISPS 231


>gi|392421934|ref|YP_006458538.1| hypothetical protein A458_14430 [Pseudomonas stutzeri CCUG 29243]
 gi|390984122|gb|AFM34115.1| hypothetical protein A458_14430 [Pseudomonas stutzeri CCUG 29243]
          Length = 248

 Score =  140 bits (352), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 81/233 (34%), Positives = 128/233 (54%), Gaps = 9/233 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG----MW---- 74
           + ++ILG  AQ  +I+L P E V+A  G+M +M+  +  E      +  G    +W    
Sbjct: 6   LEYEILGQSAQSVEIILDPDETVIAEAGAMNYMTDGVRFEARMGDGSASGVLGKLWGAGK 65

Query: 75  QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
           +   G+++      N   +   V  AAP    ++ IDLA  GG+L CQ DAFLC+     
Sbjct: 66  RMFTGESLFLTHFTNESKAKARVAFAAPYPGTVVAIDLAAVGGKLFCQKDAFLCAAYGTC 125

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           +  +  +R+     G EGF+ Q+L G GLAF+ AGG+V++K L   E + +D  C+VA T
Sbjct: 126 IGISFSKRIGAGFFGGEGFILQRLEGDGLAFVHAGGTVIRKELN-NETLRLDTGCLVAFT 184

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVT 247
             ++  I+  G +R  +FGG+ L+   + G G V+IQSLPF RL++R+  A +
Sbjct: 185 QGIDYDIQLAGGLRSMLFGGEGLLLTTLKGTGSVWIQSLPFSRLAERVYEATS 237


>gi|116753572|ref|YP_842690.1| hypothetical protein Mthe_0253 [Methanosaeta thermophila PT]
 gi|116665023|gb|ABK14050.1| protein of unknown function DUF124 [Methanosaeta thermophila PT]
          Length = 233

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 133/230 (57%), Gaps = 3/230 (1%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + + ILG + QV ++ + P E + +  G+M FM   ++M +        G+ + + G+ +
Sbjct: 3   LRYTILGDDLQVLEVQMAPGESIRSEAGAMLFMDDEIDM-SASTGGIIKGLKRAVAGEGL 61

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
                 N       +  +AP   +ILP++L  FGG+LLCQ  AFLC  + V++     +R
Sbjct: 62  FITSFTNRSDDMKRMAFSAPFPGKILPLNLDEFGGQLLCQRGAFLCGESGVEIELAFTKR 121

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +     G EGF+ Q+L G+G+ FI AGG+++ ++L+  EVI VD  C+VA +  V+  I+
Sbjct: 122 LGAGFFGGEGFVLQRLRGRGMVFIHAGGALITRDLQEEEVINVDTGCLVAFSRDVDYDIR 181

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI--ARAVTSPN 250
           + G IR A+FGG+ +  A + GPG V++QSLP  RL+ RI   R V+S N
Sbjct: 182 FVGGIRNALFGGEGIFLARLKGPGRVYLQSLPLSRLADRILQGRVVSSNN 231


>gi|209965162|ref|YP_002298077.1| hypothetical protein RC1_1868 [Rhodospirillum centenum SW]
 gi|209958628|gb|ACI99264.1| conserved hypothetical protein [Rhodospirillum centenum SW]
          Length = 335

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 83/236 (35%), Positives = 123/236 (52%), Gaps = 11/236 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI----PENEVGMWQWL- 77
           I F+I G E Q  ++ L P E  VA  G+M +    +EM  I+     PE    M + + 
Sbjct: 78  IDFRIYGEEMQFVEVELDPGESAVAEAGAMVYKDSRVEMTTIFGDGSGPEGGSFMDKLIG 137

Query: 78  ------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                  G+++ + V  + G   G V   AP    IL + L+  GG L+CQ D+FL +  
Sbjct: 138 AGKRVITGESLFTTVFTHQGTGKGKVAFGAPYPGTILALKLSDVGGRLICQKDSFLAAAK 197

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     +R+   + G EGF+ Q L G G  F+  GG+VV++ L  GE + VD  C+ 
Sbjct: 198 GVSIGIHFQRRILTGLFGGEGFIMQNLAGDGWVFVHVGGTVVERQLAAGEELHVDTGCLA 257

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVT 247
           A+T  V+  +   G I+  VFGG+ +  A + GPG V+IQSLPF RL+ R+  A T
Sbjct: 258 AMTPEVDFDLVQVGGIKSMVFGGEGVFFASLRGPGRVWIQSLPFSRLAGRMLAAAT 313


>gi|295688666|ref|YP_003592359.1| hypothetical protein Cseg_1241 [Caulobacter segnis ATCC 21756]
 gi|295430569|gb|ADG09741.1| protein of unknown function DUF124 [Caulobacter segnis ATCC 21756]
          Length = 275

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 80/252 (31%), Positives = 132/252 (52%), Gaps = 13/252 (5%)

Query: 9   PFQPYVYQSPQDAV--IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
           P  P+ +Q        I F+I G + Q  +I L P E  VA  G+  +   +++M  ++ 
Sbjct: 2   PSSPWSHQRNASVADDIDFEIKGDDLQFVEIELDPGESAVAEAGAFVWKDATVQMTTVFG 61

Query: 67  PENE----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF 115
             +           +G  + L  G+++ + V  + G     V  A+P+   ILP++LA  
Sbjct: 62  DGSADQGGGFMGKLLGAGKRLVTGESLFTTVFTHTGSGKARVAFASPTPGSILPLNLAEL 121

Query: 116 GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQK 175
           GG L+CQ D+FL +   V +     +R+   + G EGF+ Q+L G G  F+  GG++V++
Sbjct: 122 GGTLICQKDSFLAAARGVSIGLHFQKRMMTGLFGGEGFIMQRLEGDGWVFVQMGGALVER 181

Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
            L  GE + +D  C+ A T SV+  +   G ++  +FGG+ L  A + GPG V+IQSLPF
Sbjct: 182 ELAPGEELHIDTGCVAAYTPSVDFDLVMAGGVKSVLFGGEGLFFARLRGPGKVWIQSLPF 241

Query: 236 HRLSQRIARAVT 247
            RL+ R+ +A  
Sbjct: 242 ARLAGRMLQAAN 253


>gi|399063909|ref|ZP_10747019.1| hypothetical protein PMI02_03433 [Novosphingobium sp. AP12]
 gi|398031371|gb|EJL24758.1| hypothetical protein PMI02_03433 [Novosphingobium sp. AP12]
          Length = 276

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 80/237 (33%), Positives = 125/237 (52%), Gaps = 11/237 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE----------VG 72
           + F+I G E Q  +I L P E  VA  G+M +   +++M  ++   +           +G
Sbjct: 18  VDFEIKGQELQFVEIELDPGESAVAEAGAMVWKDAAIDMTTVFGDGSGDQGGGFMGKLLG 77

Query: 73  MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
             + L  G+++ + V  + G     V  AAP    ILPI L    G+L+CQ DAFL +  
Sbjct: 78  AGKRLVTGESLFTTVFTHRGSGKARVSFAAPVPGAILPIKLDDVDGQLICQKDAFLAAAR 137

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     Q++   + G EGF+ QKL G G  F+  GG+V+++ L  GE + VD  C+ 
Sbjct: 138 GVSIGIHFQQKIMTGLFGGEGFIMQKLEGDGWVFVQMGGTVIERELRAGEELHVDTGCVA 197

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           A+T+ V   +   G ++  +FGG+ +  A + GPG V+IQSLPF RL+ R+  A  S
Sbjct: 198 AMTAGVGFDVIRAGSVKSMIFGGEGVFFARLRGPGKVWIQSLPFSRLAGRMLAAAGS 254


>gi|329851756|ref|ZP_08266437.1| hypothetical protein ABI_45250 [Asticcacaulis biprosthecum C19]
 gi|328839605|gb|EGF89178.1| hypothetical protein ABI_45250 [Asticcacaulis biprosthecum C19]
          Length = 279

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 87/261 (33%), Positives = 131/261 (50%), Gaps = 13/261 (4%)

Query: 7   STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
           S P   +   S     I F+I G E Q  +I L P E  +A  G+M +   S+ M  ++ 
Sbjct: 5   SVPTTHHHRSSGVADDIDFEIKGQELQFVEIELDPGESAIAEAGAMVWKDASVGMTTVFG 64

Query: 67  PENE----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF 115
             +           +G  + L  G+++ + V  + G     V  +AP    I+P+ L+  
Sbjct: 65  DASGQQGGGFMGALLGAGKRLITGESLFTTVFTHNGHGKARVAFSAPVPGAIVPLKLSDV 124

Query: 116 GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQK 175
           GG L+CQ D+FL +   V +     +R+   + G EGF+ QKL G G  F+  GG VV++
Sbjct: 125 GGRLICQKDSFLAAAKGVSLGIAFQKRIMTGLFGGEGFIMQKLEGDGWVFVQFGGMVVER 184

Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
            L+ GE + VD  CI A T +V+  +   G I+  +F G+ +  A +TGPG V+IQSLPF
Sbjct: 185 ELKAGEQLHVDTGCIAAFTDTVDFDLIQAGGIKSMLFAGEGVFFAQLTGPGKVWIQSLPF 244

Query: 236 HRLSQRIARAV--TSPNMREN 254
            RL+ RIA       PN  E 
Sbjct: 245 SRLAGRIAAVAGGAGPNRGEG 265


>gi|16125367|ref|NP_419931.1| hypothetical protein CC_1115 [Caulobacter crescentus CB15]
 gi|221234109|ref|YP_002516545.1| hypothetical protein CCNA_01172 [Caulobacter crescentus NA1000]
 gi|13422427|gb|AAK23099.1| conserved hypothetical protein [Caulobacter crescentus CB15]
 gi|220963281|gb|ACL94637.1| hypothetical protein CCNA_01172 [Caulobacter crescentus NA1000]
          Length = 280

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 80/255 (31%), Positives = 131/255 (51%), Gaps = 13/255 (5%)

Query: 7   STPFQPYVYQSPQDAV--IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENI 64
           S P  P+ +Q   +    I F+I G + Q  +I L P E  VA  G+  +   +++M  +
Sbjct: 5   SMPTSPWSHQRNANIADDIDFEIKGEDLQFVEIELDPGESAVAEAGAFVWKDATVQMTTV 64

Query: 65  YIPENE----------VGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA 113
           +               +G  + L  G+++ + V  + G     V  A+P+   ILP++L 
Sbjct: 65  FGDGAADQGGGFMGKLLGAGKRLITGESLFTTVFTHHGTGKARVAFASPTPGTILPLNLG 124

Query: 114 MFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVV 173
             GG L+CQ D+FL +   V +     +R+   + G EGF+ Q+L G G  F+  GG++V
Sbjct: 125 QLGGTLICQKDSFLAAARGVSIGLHFQKRMMTGLFGGEGFIMQRLEGDGWVFVQMGGTLV 184

Query: 174 QKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSL 233
           ++ L  GE + +D  C+ A T  V+  +   G ++  +FGG+ L  A V GPG V+IQSL
Sbjct: 185 ERTLAPGEELHIDTGCLAAYTPDVDFDLVMAGGVKSVLFGGEGLFFARVRGPGKVWIQSL 244

Query: 234 PFHRLSQRIARAVTS 248
           PF RL+ R+ +A   
Sbjct: 245 PFARLAGRMLQAANH 259


>gi|410093948|ref|ZP_11290411.1| hypothetical protein AAI_24394 [Pseudomonas viridiflava UASWS0038]
 gi|409758661|gb|EKN43938.1| hypothetical protein AAI_24394 [Pseudomonas viridiflava UASWS0038]
          Length = 362

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 84/244 (34%), Positives = 129/244 (52%), Gaps = 12/244 (4%)

Query: 13  YVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENE 70
           Y   S  D V  F++ G E Q  ++ L   E  VA  G++ + +  ++ME I+       
Sbjct: 110 YGSGSATDGV-DFKLYGTETQFVELELDRGESAVAEAGALMYKTCDVQMETIFGDGSNQS 168

Query: 71  VGMWQWLFG--------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQ 122
            G+   LFG        +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ
Sbjct: 169 SGLLGSLFGAGKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLGDFGGKLICQ 228

Query: 123 PDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEV 182
            D+FL     V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE 
Sbjct: 229 KDSFLAGAKGVSIGIQFQKKILTGLFGGEGFVLQKLEGDGWVFVHMGGTVRKIELAAGEG 288

Query: 183 ITVDVSCIVAVTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQR 241
           + VD  C+ A+T +V+  I+   G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R
Sbjct: 289 LDVDTGCLAAMTQTVDYDIRMVGGGIKSMLFGGEGMFFARLTGPGTVWLQSLPFSRLAGR 348

Query: 242 IARA 245
           +  A
Sbjct: 349 MLAA 352


>gi|393202307|ref|YP_006464149.1| hypothetical protein SSIL_3580 [Solibacillus silvestris StLB046]
 gi|406666586|ref|ZP_11074352.1| hypothetical protein B857_02159 [Bacillus isronensis B3W22]
 gi|327441638|dbj|BAK18003.1| uncharacterized conserved protein [Solibacillus silvestris StLB046]
 gi|405385604|gb|EKB45037.1| hypothetical protein B857_02159 [Bacillus isronensis B3W22]
          Length = 265

 Score =  139 bits (350), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 80/240 (33%), Positives = 130/240 (54%), Gaps = 13/240 (5%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           ++  I + + G + Q  ++ L PQE VVA  GS+  M   ++ME ++  +        L 
Sbjct: 2   KNHEIDYVLHGDDMQFVEVELDPQETVVAEAGSLMMMEDQIQMETVF-GDGSANQGSGLM 60

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +                N G     V  A+P   +I+P++L+   G+++CQ DAF
Sbjct: 61  GKLLGAGKRLITGESLFMTTFTNNGMGKRKVYFASPYPGKIIPMNLSELDGKIICQKDAF 120

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           L +   V V     +++     G EGF+ QKL G G+AF+ AGG++ ++ L+ GE + +D
Sbjct: 121 LAAAKGVSVGVEFQKKLGTGFFGGEGFIMQKLEGDGMAFVHAGGTIYERKLQPGETLRID 180

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             C+VA+T  VN  I+    I+ A+FGG+ L  A ++GPG V++QSLPF RL+ R+  A 
Sbjct: 181 TGCLVAMTQDVNYNIEMVSGIKTALFGGEGLFFATLSGPGTVWVQSLPFSRLASRVFAAA 240


>gi|167948526|ref|ZP_02535600.1| hypothetical protein Epers_19156 [Endoriftia persephone
           'Hot96_1+Hot96_2']
          Length = 289

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 77/231 (33%), Positives = 132/231 (57%), Gaps = 12/231 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN--EVGMWQWLF-- 78
           I ++ILG + Q+ ++ L P E V+A  G+M ++   +  E      +  + G++  LF  
Sbjct: 6   IDYKILGHDLQLVEVELDPGETVIAEAGAMTYLEQEISFETKMGDGSNPDQGIFGKLFSA 65

Query: 79  ------GKTITSVVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                 G+++ +    + G +    V  +AP    ++P++LA  GG+++CQ DAFLC+  
Sbjct: 66  GKRMITGESLFTTHFTHSGSAGKRKVAFSAPFPGSVIPLNLAALGGKVICQKDAFLCAAL 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
             K+     +++ + + G EGF+ Q L G G AFI AGG+VV+K L  GE + VD  C+V
Sbjct: 126 GTKLGIAFTKKLGSGLFGGEGFILQSLEGDGTAFIQAGGTVVEKQLN-GETLRVDTGCLV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
             T  ++  I+ +G ++  +FGG+ +  A ++G G V+IQSLPF R++ RI
Sbjct: 185 GFTEGIDYSIEMSGGLKSMLFGGEGIFLATLSGSGTVWIQSLPFSRMADRI 235


>gi|114047855|ref|YP_738405.1| hypothetical protein Shewmr7_2361 [Shewanella sp. MR-7]
 gi|113889297|gb|ABI43348.1| protein of unknown function DUF124 [Shewanella sp. MR-7]
          Length = 264

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 128/230 (55%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG--SMEMENIYIPENEVGMWQWLFG- 79
           + ++I+G   Q+ ++ L P E V+A  G+M ++    S E +     E E G +  L G 
Sbjct: 8   VDYEIIGHSMQLVEVELDPHETVIAEAGAMNYLEQDISFEAKMGDGSEPESGFFGKLMGA 67

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  ++I      N G     V  AAP    IL +DLA +GGEL+CQ D+FL +   
Sbjct: 68  GKRVLTGESIFMTHFTNLGHQKRKVAFAAPYPGTILALDLAQYGGELICQKDSFLAAALG 127

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            +VS   ++R+     G EGF+ Q L G G+AFI AGG++++K L+ GE + VD  C+V 
Sbjct: 128 TRVSMKFNRRLGTGFFGGEGFILQSLQGDGMAFIHAGGTLIKKELK-GETLRVDTGCLVG 186

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  ++  I+  G ++  VFGG+ L  A +TG G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIERAGSLKSMVFGGEGLFLATLTGHGTVWLQSLPFSRMADRI 236


>gi|383640052|ref|ZP_09952458.1| hypothetical protein SeloA3_00235 [Sphingomonas elodea ATCC 31461]
 gi|32170807|gb|AAP57683.1| conserved hypothetical protein [Sphingomonas elodea ATCC 31461]
          Length = 276

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 82/252 (32%), Positives = 132/252 (52%), Gaps = 12/252 (4%)

Query: 9   PFQPYVYQSPQDAV--IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
           P  P+ +   +     I F+I G E Q  +I L P E  VA  G+M +    + M+ ++ 
Sbjct: 3   PMSPWSHTRREGYADDIDFEIKGQELQFVEIELDPGESAVAEAGAMVWKDAPIAMQTVFG 62

Query: 67  P-ENEVGMW---------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFG 116
              N+ G +         + + G+ + + V  + G     V  A+P    I+P+ LA  G
Sbjct: 63  DGSNQGGGFMDKLLGAGKRLVTGEGLFTTVFTHQGSGKARVAFASPVPGAIVPLRLADIG 122

Query: 117 GELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKN 176
           G L+CQ DAFL +   V++     +R+   + G EGF+ Q+L G G  F+  GG++V++ 
Sbjct: 123 GTLICQKDAFLAAAKGVQMGIAFQRRIMTGLFGGEGFIMQRLDGDGWVFVQMGGAIVERE 182

Query: 177 LEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFH 236
           L  GE + VD  C+ A+T SV+  +   G +R  +FGG+ +  A + GPG V+IQSLPF 
Sbjct: 183 LAPGEELHVDTGCLAALTPSVDFDLVSVGGVRSMLFGGEGVFFARLRGPGKVWIQSLPFA 242

Query: 237 RLSQRIARAVTS 248
           RL+ R+  A + 
Sbjct: 243 RLAGRMMAAASG 254


>gi|372272302|ref|ZP_09508350.1| hypothetical protein MstaS_14538 [Marinobacterium stanieri S30]
          Length = 263

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 129/230 (56%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN--EVGMWQWLF-- 78
           + ++++G E Q+ ++ L P E V+A  G+M +M   ++ E      +  + G+   LF  
Sbjct: 6   LDYELIGSEMQLVEVELDPNETVIAEAGAMTYMEEDIQFETRMGDGSNPDEGIMGKLFSA 65

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G++I +    N G     V  AAP    I+P+++A  G  ++CQ DAFLC+   
Sbjct: 66  GKRVFTGESIFTTHFTNQGIGKRRVAFAAPYPGNIVPLNMAELGERIVCQKDAFLCAALG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            +VS T ++R+     G EGF+ Q L G G+AFI AGG+V++K L+ GE + VD  C+V 
Sbjct: 126 TRVSITFNRRIGAGFFGGEGFILQTLEGDGMAFIQAGGTVIKKELK-GETLRVDTGCLVG 184

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  V   I+  G ++  VFGG+ L  A + G G V++QSLPF R++ RI
Sbjct: 185 FTEGVEYDIQRAGGLKSMVFGGEGLFLATLKGTGTVWLQSLPFSRMADRI 234


>gi|304392201|ref|ZP_07374143.1| conserved hypothetical protein [Ahrensia sp. R2A130]
 gi|303296430|gb|EFL90788.1| conserved hypothetical protein [Ahrensia sp. R2A130]
          Length = 282

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 80/225 (35%), Positives = 116/225 (51%), Gaps = 11/225 (4%)

Query: 29  GGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY------IPENEVGMW-----QWL 77
           G E Q  +I L P E  VA  GSM F + S+EM   +       P   +G       + +
Sbjct: 30  GSEMQFVEIELDPGESAVAEAGSMMFKAPSIEMTTAFGDGSYKAPAGLLGKLAGAGKRLI 89

Query: 78  FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
            G+++ + V  + G     V  A+P    I  + L   GG L+CQ DAFLC+   V +  
Sbjct: 90  TGESLFTTVFTHNGDGKARVAFASPYPGHIAAVKLDEVGGRLICQKDAFLCAARGVTIGI 149

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              ++V   + G EGF+ QKL G G AF+  GG+   ++L  GE I VD  C+ A T  +
Sbjct: 150 EFQRKVMTGLFGGEGFIMQKLEGDGWAFVHVGGTTTVRDLAPGEEIHVDTGCVAAYTGDI 209

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +      G I+ ++FGG+ +  A + GPG V+IQSLPF RL+ RI
Sbjct: 210 DFDFVRAGSIKSSIFGGEGMFFAKLRGPGRVWIQSLPFSRLAGRI 254


>gi|21111117|gb|AAM39479.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. ATCC 33913]
 gi|66571846|gb|AAY47256.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. 8004]
          Length = 287

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 74/239 (30%), Positives = 129/239 (53%), Gaps = 15/239 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLFG 79
           I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++     + + G    L G
Sbjct: 22  IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHDGQGGSSGGLMG 81

Query: 80  KTIT------------SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFL 127
           K ++            + V  + G     V  AAP    +L + L+  GG L+CQ D+FL
Sbjct: 82  KLLSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGRLICQKDSFL 141

Query: 128 CSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDV 187
                V +     +++   + G EGF+ QKL G G  F+ AGG+VV+++L  GE I VD 
Sbjct: 142 AGARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERDLAPGERIDVDT 201

Query: 188 SCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            C+VA  +SV++ ++    ++   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 202 GCVVAYHASVDMDVRRIAGLKSMFFGGEGVFLATLTGPGKVWMQSLPFSRMAGRMLQAA 260


>gi|77747736|ref|NP_635555.2| hypothetical protein XCC0160 [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|77761073|ref|YP_241276.2| hypothetical protein XC_0169 [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 338

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 74/239 (30%), Positives = 129/239 (53%), Gaps = 15/239 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLFG 79
           I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++     + + G    L G
Sbjct: 73  IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHDGQGGSSGGLMG 132

Query: 80  KTIT------------SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFL 127
           K ++            + V  + G     V  AAP    +L + L+  GG L+CQ D+FL
Sbjct: 133 KLLSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGRLICQKDSFL 192

Query: 128 CSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDV 187
                V +     +++   + G EGF+ QKL G G  F+ AGG+VV+++L  GE I VD 
Sbjct: 193 AGARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERDLAPGERIDVDT 252

Query: 188 SCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            C+VA  +SV++ ++    ++   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 253 GCVVAYHASVDMDVRRIAGLKSMFFGGEGVFLATLTGPGKVWMQSLPFSRMAGRMLQAA 311


>gi|34495639|ref|NP_899854.1| hypothetical protein CV_0184 [Chromobacterium violaceum ATCC 12472]
 gi|34101494|gb|AAQ57863.1| conserved hypothetical protein [Chromobacterium violaceum ATCC
           12472]
          Length = 261

 Score =  137 bits (346), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 77/235 (32%), Positives = 124/235 (52%), Gaps = 10/235 (4%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---------IPENEVG 72
           VI +++ G + Q  ++ L P E  V   G++ +M   + M+ ++         +  + +G
Sbjct: 5   VIDYRVFGDDMQYVEVELDPGEAAVGEAGALYYMQDGIAMDTVFGDGSGRDGGVLGSLLG 64

Query: 73  MWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
             + L  G+++ + V  N       V  AA +  +I+P+ L   GG L  Q D+FL    
Sbjct: 65  AGKRLLTGESVFTTVFANQSAERRKVAFAAATPGKIVPVHLLELGGTLYAQKDSFLAGAK 124

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +     +R+   + G EGF+ QKL G G  F+ AGG++ +  L  GE + VD  C+V
Sbjct: 125 GVSLGLAWQKRIGTGLFGGEGFIMQKLEGDGYVFLHAGGALTELQLRPGETVRVDTGCVV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           A   SV+  I+Y G ++ A+FGG+ L  A +TGPG V++QSLP  RL+ RI  A 
Sbjct: 185 AYQPSVDFDIEYVGKLKSALFGGEGLFFARLTGPGRVWLQSLPLSRLADRIVAAA 239


>gi|21106235|gb|AAM35070.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 287

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 73/237 (30%), Positives = 129/237 (54%), Gaps = 13/237 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG------MWQW 76
           I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++   +  G      M + 
Sbjct: 24  IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGLMGKL 83

Query: 77  L-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
           L        G+++ + V  + G     V  AAP    +L + L+  GG+L+CQ D+FL  
Sbjct: 84  LSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLAG 143

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
              V +     +++   + G EGF+ QKL G G  F+ AGG+VV++ L  GE I VD  C
Sbjct: 144 ARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTGC 203

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           +VA  + V++ ++    ++   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 204 VVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 260


>gi|319785664|ref|YP_004145139.1| hypothetical protein Psesu_0045 [Pseudoxanthomonas suwonensis 11-1]
 gi|317464176|gb|ADV25908.1| protein of unknown function DUF124 [Pseudoxanthomonas suwonensis
           11-1]
          Length = 341

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 76/236 (32%), Positives = 125/236 (52%), Gaps = 12/236 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I F+I G E Q  +I L P E  +A  G++ F   +++M+ ++   +  G          
Sbjct: 79  IEFRIHGQEMQFVEIELDPGESAIAEAGALMFKDSAVQMDTVFGDGSHQGQGGGFMDKLL 138

Query: 75  ----QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
               + + G+++ + +  + G     V  AAP    +L + L   GG ++CQ D+FL   
Sbjct: 139 SAGKRVITGESLFTTMFTHGGSGKARVAFAAPYPGTVLAMKLDEHGGRIICQKDSFLAGA 198

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V++     +++   + G EGF+ QKL G G  FI AGG VV++ L  GE I VD  C+
Sbjct: 199 RGVQIGVHFQRKILTGLFGGEGFIMQKLEGDGWVFIHAGGCVVERELAAGERIDVDTGCV 258

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           V   + V++ ++  G I+  +FGG+ +  A +TGPG V++QSLPF RL+ R+  A 
Sbjct: 259 VGYHAGVSMDVRPVGGIKSMLFGGEGVFLATLTGPGKVWLQSLPFSRLAGRMLAAA 314


>gi|390992655|ref|ZP_10262879.1| putative uncharacterized domain protein [Xanthomonas axonopodis pv.
           punicae str. LMG 859]
 gi|372552611|emb|CCF69854.1| putative uncharacterized domain protein [Xanthomonas axonopodis pv.
           punicae str. LMG 859]
          Length = 335

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 73/237 (30%), Positives = 129/237 (54%), Gaps = 13/237 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG------MWQW 76
           I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++   +  G      M + 
Sbjct: 72  IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGLMGKL 131

Query: 77  L-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
           L        G+++ + V  + G     V  AAP    +L + L+  GG+L+CQ D+FL  
Sbjct: 132 LSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLAG 191

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
              V +     +++   + G EGF+ QKL G G  F+ AGG+VV++ L  GE I VD  C
Sbjct: 192 ARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTGC 251

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           +VA  + V++ ++    ++   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 252 VVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 308


>gi|77748515|ref|NP_640534.2| hypothetical protein XAC0178 [Xanthomonas axonopodis pv. citri str.
           306]
          Length = 335

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 73/237 (30%), Positives = 129/237 (54%), Gaps = 13/237 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG------MWQW 76
           I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++   +  G      M + 
Sbjct: 72  IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGLMGKL 131

Query: 77  L-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
           L        G+++ + V  + G     V  AAP    +L + L+  GG+L+CQ D+FL  
Sbjct: 132 LSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLAG 191

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
              V +     +++   + G EGF+ QKL G G  F+ AGG+VV++ L  GE I VD  C
Sbjct: 192 ARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTGC 251

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           +VA  + V++ ++    ++   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 252 VVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 308


>gi|384429987|ref|YP_005639348.1| hypothetical protein XCR_4374 [Xanthomonas campestris pv. raphani
           756C]
 gi|341939091|gb|AEL09230.1| conserved hypothetical protein [Xanthomonas campestris pv. raphani
           756C]
          Length = 338

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 74/239 (30%), Positives = 129/239 (53%), Gaps = 15/239 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLFG 79
           I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++     + + G    L G
Sbjct: 73  IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDSAVQMDTVFGDGSHDGQGGSSGGLMG 132

Query: 80  KTIT------------SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFL 127
           K ++            + V  + G     V  AAP    +L + L+  GG L+CQ D+FL
Sbjct: 133 KLLSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGRLICQKDSFL 192

Query: 128 CSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDV 187
                V +     +++   + G EGF+ QKL G G  F+ AGG+VV+++L  GE I VD 
Sbjct: 193 AGARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERDLAPGERIDVDT 252

Query: 188 SCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            C+VA  +SV++ ++    ++   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 253 GCVVAYHASVDMDVRRIAGLKSMFFGGEGVFLATLTGPGKVWMQSLPFSRMAGRMLQAA 311


>gi|418521253|ref|ZP_13087298.1| hypothetical protein WS7_09543 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410702802|gb|EKQ61302.1| hypothetical protein WS7_09543 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
          Length = 335

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 73/237 (30%), Positives = 129/237 (54%), Gaps = 13/237 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG------MWQW 76
           I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++   +  G      M + 
Sbjct: 72  IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGLMGKL 131

Query: 77  L-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
           L        G+++ + V  + G     V  AAP    +L + L+  GG+L+CQ D+FL  
Sbjct: 132 LSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLAG 191

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
              V +     +++   + G EGF+ QKL G G  F+ AGG+VV++ L  GE I VD  C
Sbjct: 192 ARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTGC 251

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           +VA  + V++ ++    ++   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 252 VVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 308


>gi|188989575|ref|YP_001901585.1| hypothetical protein xccb100_0179 [Xanthomonas campestris pv.
           campestris str. B100]
 gi|167731335|emb|CAP49510.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris]
          Length = 338

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 74/239 (30%), Positives = 129/239 (53%), Gaps = 15/239 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLFG 79
           I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++     + + G    L G
Sbjct: 73  IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHDGQGGSSGGLMG 132

Query: 80  KTIT------------SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFL 127
           K ++            + V  + G     V  AAP    +L + L+  GG L+CQ D+FL
Sbjct: 133 KLLSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGRLICQKDSFL 192

Query: 128 CSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDV 187
                V +     +++   + G EGF+ QKL G G  F+ AGG+VV+++L  GE I VD 
Sbjct: 193 AGARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERDLAPGERIDVDT 252

Query: 188 SCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            C+VA  +SV++ ++    ++   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 253 GCVVAYHASVDMDVRRIAGLKSMFFGGEGVFLATLTGPGKVWMQSLPFSRMAGRMLQAA 311


>gi|78045718|ref|YP_361893.1| hypothetical protein XCV0162 [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
 gi|325927462|ref|ZP_08188706.1| hypothetical protein TIGR00266 [Xanthomonas perforans 91-118]
 gi|78034148|emb|CAJ21793.1| conserved hypothetical protein [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
 gi|325542163|gb|EGD13661.1| hypothetical protein TIGR00266 [Xanthomonas perforans 91-118]
          Length = 335

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/248 (30%), Positives = 132/248 (53%), Gaps = 13/248 (5%)

Query: 12  PYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV 71
           P    S +   I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++   +  
Sbjct: 61  PLTGMSGRADEIDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHA 120

Query: 72  G------MWQWL-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE 118
           G      M + L        G+++ + V  + G     V  AAP    +L + L+  GG+
Sbjct: 121 GQSGGGLMGKLLSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQ 180

Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
           L+CQ D+FL     V +     +++   + G EGF+ QKL G G  F+ AGG+VV++ L 
Sbjct: 181 LICQKDSFLAGARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELG 240

Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
            GE I VD  C+VA  + V++ ++    ++   FGG+ +  A +TGPG V++QSLPF R+
Sbjct: 241 PGERIDVDTGCVVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRM 300

Query: 239 SQRIARAV 246
           + R+ +A 
Sbjct: 301 AGRMLQAA 308


>gi|327398767|ref|YP_004339636.1| hypothetical protein Hipma_0607 [Hippea maritima DSM 10411]
 gi|327181396|gb|AEA33577.1| protein of unknown function DUF124 [Hippea maritima DSM 10411]
          Length = 264

 Score =  137 bits (345), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 78/230 (33%), Positives = 123/230 (53%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIP--ENEVGMWQWLF-- 78
           + ++ILG + Q+ ++ L P E V+A  G+M +M   +  E       + + G+   LF  
Sbjct: 6   VDYEILGDDMQIVEVELDPNETVIAEAGAMNYMEEGITFEAKLGDGSDADSGLLGKLFKA 65

Query: 79  GKTITS------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
           GK + +          N       V  +AP   +I+P+D+    G+LLCQ DAFLC+   
Sbjct: 66  GKRMLAQESLFLTHFTNRDSKKRKVAFSAPYPGKIIPVDMGKLQGDLLCQKDAFLCAAYG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
             +     +R    + G EGF+ ++L+G G+ FI AGG+V++K L  GE + VD  CIVA
Sbjct: 126 TSIDIAFTKRFGTGLFGGEGFILERLSGDGMVFIHAGGTVIKKQLN-GEKLRVDTGCIVA 184

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
              S++  I+    ++   FGG+ L  A + G G V++QSLPF RL+ RI
Sbjct: 185 FDESIDYSIEMVKGLKSMFFGGEGLFLATLQGYGTVYLQSLPFSRLADRI 234


>gi|383316796|ref|YP_005377638.1| hypothetical protein [Frateuria aurantia DSM 6220]
 gi|379043900|gb|AFC85956.1| TIGR00266 family protein [Frateuria aurantia DSM 6220]
          Length = 332

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 77/236 (32%), Positives = 126/236 (53%), Gaps = 12/236 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY-----IPENEVGMWQWL 77
           I F+I+G + Q  ++ L P E  +A  G++ +   ++ ME I+       +N   M + L
Sbjct: 70  IDFRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVAMETIFGDGSTASQNSGFMGKLL 129

Query: 78  F-------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
                   G+++ + V  + G     V  A+P    ++   L+  GG L+CQ D+FL   
Sbjct: 130 SAGKRVISGESMFTTVFTHTGSGKAKVAFASPFPGTVIAARLSEHGGRLICQKDSFLAGA 189

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V++     +++   + G EGF+ Q+L G G  F  AGG++V++ L  GE I VD  C+
Sbjct: 190 KGVQLGIFFQRKILTGLFGGEGFVMQQLDGDGWVFFHAGGTLVERELAAGESIDVDTGCV 249

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           VA  + VN+ I+  G I+  +F G+ L  A +TGPG V++QSLPF RL+ R+  A 
Sbjct: 250 VAFHAGVNMDIRPVGGIKSMLFAGEGLFLATLTGPGKVWLQSLPFSRLAGRMLAAA 305


>gi|346723102|ref|YP_004849771.1| hypothetical protein XACM_0163 [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346647849|gb|AEO40473.1| hypothetical protein XACM_0163 [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 335

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 75/248 (30%), Positives = 132/248 (53%), Gaps = 13/248 (5%)

Query: 12  PYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV 71
           P    S +   I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++   +  
Sbjct: 61  PLTGMSGRADEIDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHA 120

Query: 72  G------MWQWL-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE 118
           G      M + L        G+++ + V  + G     V  AAP    +L + L+  GG+
Sbjct: 121 GQSGGGLMGKLLSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQ 180

Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
           L+CQ D+FL     V +     +++   + G EGF+ QKL G G  F+ AGG+VV++ L 
Sbjct: 181 LICQKDSFLAGARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELG 240

Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
            GE I VD  C+VA  + V++ ++    ++   FGG+ +  A +TGPG V++QSLPF R+
Sbjct: 241 PGERIDVDTGCVVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRM 300

Query: 239 SQRIARAV 246
           + R+ +A 
Sbjct: 301 AGRMLQAA 308


>gi|294627382|ref|ZP_06705967.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292598337|gb|EFF42489.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 335

 Score =  137 bits (344), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 73/237 (30%), Positives = 129/237 (54%), Gaps = 13/237 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG------MWQW 76
           I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++   +  G      M + 
Sbjct: 72  IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGLMGKL 131

Query: 77  L-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
           L        G+++ + V  + G     V  AAP    +L + L+  GG+L+CQ D+FL  
Sbjct: 132 LSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLAG 191

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
              V +     +++   + G EGF+ QKL G G  F+ AGG+VV++ L  GE I VD  C
Sbjct: 192 ARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTGC 251

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           +VA  + V++ ++    ++   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 252 VVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 308


>gi|255038010|ref|YP_003088631.1| hypothetical protein Dfer_4264 [Dyadobacter fermentans DSM 18053]
 gi|254950766|gb|ACT95466.1| protein of unknown function DUF124 [Dyadobacter fermentans DSM
           18053]
          Length = 259

 Score =  137 bits (344), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 81/244 (33%), Positives = 130/244 (53%), Gaps = 14/244 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM--------- 73
           I ++I+G + QV +I L P E V+A  G+M FM   ++ E      +E            
Sbjct: 6   IDYKIIGDDIQVVEIELDPNETVIAEAGAMLFMEDGIQFETKMGDGSEANQSIMGKIFQA 65

Query: 74  -WQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAFLCSVN 131
             + L G+++      N G     V  +AP    ++PIDL+ ++G EL+ Q D FLC+  
Sbjct: 66  GTRLLTGESLFMTHFTNRGVGKKKVAFSAPYPGTVMPIDLSKIYGNELIVQKDGFLCAAM 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
              +    +QR  + + G EGF+ QKL G G+AF+ AGG V++K L   E + +D  C+V
Sbjct: 126 GTSMKIHFNQRFGSGLFGGEGFILQKLKGDGMAFVHAGGVVMEKQLN-NETLRIDTGCVV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
           A   S++  I+ +G ++  VFGG+ +  A + G G  +IQS+P  +L QR+  +V  PN 
Sbjct: 185 AFEQSLSFDIQRSGGLKSMVFGGEGMFLATLRGTGRCWIQSMPISKLIQRL--SVGGPNS 242

Query: 252 RENP 255
           R+  
Sbjct: 243 RKES 246


>gi|83644424|ref|YP_432859.1| hypothetical protein HCH_01577 [Hahella chejuensis KCTC 2396]
 gi|83632467|gb|ABC28434.1| uncharacterized conserved protein [Hahella chejuensis KCTC 2396]
          Length = 250

 Score =  137 bits (344), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 131/230 (56%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
           + ++I+G + Q+ ++ L P E V+A  G+M +M+G +E E       + E G +  LF  
Sbjct: 6   VDYKIIGHDMQMVEVELDPGETVIAEAGAMNYMTGGIEFETKMGDGSDPEQGFFSKLFSA 65

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G+++      N G     V  AAP    I+P+DLA  GG ++CQ D+FL +   
Sbjct: 66  GKRMMTGESLFMTHFTNSGHGKQQVSFAAPYPGSIIPVDLANQGGTIICQKDSFLAAAMG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            ++S   ++R+ +   G EGF+ QKL G G+AFI A G+VV+K L+  E + +D  C+VA
Sbjct: 126 TRISVEFNRRLGSGFFGGEGFVLQKLEGDGMAFIHACGTVVEKRLD-NETLRLDTGCLVA 184

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T S++ +I+    IR   FGG+ +  A + G G V+IQSLPF RL+ RI
Sbjct: 185 FTPSISYEIEMVKGIRSMFFGGEGMFLATLRGTGTVWIQSLPFSRLADRI 234


>gi|117920922|ref|YP_870114.1| hypothetical protein Shewana3_2479 [Shewanella sp. ANA-3]
 gi|117613254|gb|ABK48708.1| Protein of unknown function DUF124 [Shewanella sp. ANA-3]
          Length = 264

 Score =  137 bits (344), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG--SMEMENIYIPENEVGMWQWLFG- 79
           + ++I+G   Q+ ++ L P E V+A  G+M ++    S E +     E E G +  L G 
Sbjct: 8   VDYEIIGHSMQLVEVELDPHETVIAEAGAMNYLEQDISFEAKMGDGSEPESGFFGKLMGA 67

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  ++I      N G     V  AAP    IL IDLA +GGEL+CQ D+FL +   
Sbjct: 68  GKRVLTGESIFMTHFTNLGHQKRKVAFAAPYPGTILAIDLAQYGGELICQKDSFLAAALG 127

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            +VS   ++R+     G EGF+ Q L G G+AFI AGG++++K L+ GE + VD  C+V 
Sbjct: 128 TRVSMKFNRRLGTGFFGGEGFILQSLQGDGMAFIHAGGTLIKKELK-GETLRVDTGCLVG 186

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  ++  I+  G ++  VFGG+ L  A + G G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIERAGSLKSMVFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236


>gi|315498037|ref|YP_004086841.1| hypothetical protein Astex_1009 [Asticcacaulis excentricus CB 48]
 gi|315416049|gb|ADU12690.1| protein of unknown function DUF124 [Asticcacaulis excentricus CB
           48]
          Length = 278

 Score =  136 bits (343), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 80/244 (32%), Positives = 128/244 (52%), Gaps = 12/244 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE---------VGM 73
           I F+I G E Q  +I L P E  +A  G M +   S+ M  ++   +          +G 
Sbjct: 21  IDFEIKGEELQFVEIELDPGESAIAEAGGMVWKDSSVGMTTVFGDGSGQQKGFMGALLGA 80

Query: 74  WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
            + L  G+++ + V  + G     V  +AP    I+P+ L+  GG L+CQ D+FL +   
Sbjct: 81  GKRLISGESLFTTVFTHNGQGKARVAFSAPVPGSIIPLKLSDVGGRLICQKDSFLAAAKG 140

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     ++V   + G EGF+ QKL G G  F+  GG+V+++ L  GE + +D  C+ A
Sbjct: 141 VSLGIAFQRKVMTGLFGGEGFIMQKLEGDGWVFVQFGGAVIERTLAPGEQLHIDTGCVAA 200

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVT--SPN 250
            T +V+  +   G ++  +FGG+ +  A +TGPG V+IQSLPF RL+ R+  A +   PN
Sbjct: 201 FTDTVDFDLVQAGGVKSMLFGGEGVFFAQLTGPGKVWIQSLPFSRLAGRVLAAASGAGPN 260

Query: 251 MREN 254
             E 
Sbjct: 261 RGEG 264


>gi|381173682|ref|ZP_09882759.1| putative uncharacterized domain protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380685874|emb|CCG39246.1| putative uncharacterized domain protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 335

 Score =  136 bits (343), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 72/237 (30%), Positives = 129/237 (54%), Gaps = 13/237 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG------MWQW 76
           I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++   +  G      M + 
Sbjct: 72  IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGLMGKL 131

Query: 77  L-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
           L        G+++ + V  + G     +  AAP    +L + L+  GG+L+CQ D+FL  
Sbjct: 132 LSAGKRVVTGESMFTTVFTHAGSGKAKIAFAAPYPGTVLALRLSEHGGQLICQKDSFLAG 191

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
              V +     +++   + G EGF+ QKL G G  F+ AGG+VV++ L  GE I VD  C
Sbjct: 192 ARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTGC 251

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           +VA  + V++ ++    ++   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 252 VVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 308


>gi|113970624|ref|YP_734417.1| hypothetical protein Shewmr4_2289 [Shewanella sp. MR-4]
 gi|113885308|gb|ABI39360.1| protein of unknown function DUF124 [Shewanella sp. MR-4]
          Length = 264

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 127/230 (55%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG--SMEMENIYIPENEVGMWQWLFG- 79
           + ++I+G   Q+ ++ L P E V+A  G+M ++    S E +     E E G +  L G 
Sbjct: 8   VDYEIIGHSMQLVEVELDPHETVIAEAGAMNYLEQDISFEAKMGDGSEPESGFFGKLMGA 67

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  ++I      N G     V  AAP    IL +DLA +GGEL+CQ D+FL +   
Sbjct: 68  GKRVLTGESIFMTHFTNLGHQKRKVAFAAPYPGTILALDLAQYGGELICQKDSFLAAALG 127

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            +VS   ++R+     G EGF+ Q L G G+AFI AGG++++K L+ GE + VD  C+V 
Sbjct: 128 TRVSMKFNRRLGTGFFGGEGFILQSLQGDGMAFIHAGGTLIKKELK-GETLRVDTGCLVG 186

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  ++  I+  G ++  VFGG+ L  A + G G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIERAGSLKSMVFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236


>gi|418515938|ref|ZP_13082115.1| hypothetical protein MOU_03879 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
 gi|410707272|gb|EKQ65725.1| hypothetical protein MOU_03879 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
          Length = 335

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 73/237 (30%), Positives = 128/237 (54%), Gaps = 13/237 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG------MWQW 76
           I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++   +  G      M + 
Sbjct: 72  IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGLMGKL 131

Query: 77  L-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
           L        G+++ + V  + G     V  AAP     L + L+  GG+L+CQ D+FL  
Sbjct: 132 LSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTALALRLSEHGGQLICQKDSFLAG 191

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
              V +     +++   + G EGF+ QKL G G  F+ AGG+VV++ L  GE I VD  C
Sbjct: 192 ARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTGC 251

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           +VA  + V++ ++    ++   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 252 VVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 308


>gi|384044528|ref|YP_005492545.1| hypothetical protein BMWSH_0352 [Bacillus megaterium WSH-002]
 gi|345442219|gb|AEN87236.1| hypothetical protein BMWSH_0352 [Bacillus megaterium WSH-002]
          Length = 266

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 11/238 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I +++ G + Q  +I L P E V+A  G M  M   +EME I+   +             
Sbjct: 7   IDYKLYGDDMQFVEIELDPSESVIAEAGGMMMMEDDIEMETIFGDGSSSNSSGLFNKLKG 66

Query: 75  ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + L G+++   V  N G     V  A+P   +I+P+DL     +++CQ  +FLC+  
Sbjct: 67  AGKRVLTGESLFMTVYTNNGYHKRKVSFASPFPGKIIPVDLEQLDHKIICQKSSFLCAAK 126

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V V     +++     G EGF+ QKL G GLAF+ AGG++ ++ L+ GE + VD  C+V
Sbjct: 127 GVSVGIDFQKKLGTGFFGGEGFIMQKLEGNGLAFLHAGGTIHKRQLQPGERLRVDTGCLV 186

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           A+T  VN  I+Y G I+ A+FGG+ +  A ++GPG V+IQSLPF RL+ R+  +  SP
Sbjct: 187 AMTRDVNYNIEYVGKIKTALFGGEGMFFATLSGPGTVWIQSLPFSRLADRVLSSAVSP 244


>gi|295706978|ref|YP_003600053.1| hypothetical protein BMD_4880 [Bacillus megaterium DSM 319]
 gi|294804637|gb|ADF41703.1| protein of unknown function (DUF124) [Bacillus megaterium DSM 319]
          Length = 265

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 11/238 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I +++ G + Q  +I L P E V+A  G M  M   +EME I+   +             
Sbjct: 6   IDYKLYGDDMQFVEIELDPSESVIAEAGGMMMMEDDIEMETIFGDGSSSTSSGLFNKLKG 65

Query: 75  ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + L G+++   V  N G     V  A+P   +I+P+DL     +++CQ  +FLC+  
Sbjct: 66  AGKRVLTGESLFMTVYTNNGYHKRKVSFASPFPGKIIPVDLEQLDHKIICQKSSFLCAAK 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V V     +++     G EGF+ QKL G GLAF+ AGG++ ++ L+ GE + VD  C+V
Sbjct: 126 GVSVGIDFQKKLGTGFFGGEGFIMQKLEGNGLAFLHAGGTIHKRQLQPGERLRVDTGCLV 185

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           A+T  VN  I+Y G I+ A+FGG+ +  A ++GPG V+IQSLPF RL+ R+  +  SP
Sbjct: 186 AMTRDVNYNIEYVGKIKTALFGGEGMFFATLSGPGTVWIQSLPFSRLADRVLSSAVSP 243


>gi|294501630|ref|YP_003565330.1| hypothetical protein BMQ_4894 [Bacillus megaterium QM B1551]
 gi|294351567|gb|ADE71896.1| protein of unknown function (DUF124) [Bacillus megaterium QM B1551]
          Length = 265

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 85/238 (35%), Positives = 131/238 (55%), Gaps = 11/238 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I +++ G + Q  +I L P E V+A  G M  M   +EME I+   +             
Sbjct: 6   IDYKLYGDDMQFVEIELDPSESVIAEAGGMMMMEDDIEMETIFGDGSSSNSSGLFNKLKG 65

Query: 75  ---QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
              + L G+++   V  N G     V  A+P   +I+P+DL     +++CQ  +FLC+  
Sbjct: 66  AGKRVLTGESLFMTVYTNNGYHKRKVSFASPFPGKIIPVDLEQLDHKIICQKSSFLCAAK 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V V     +++     G EGF+ QKL G GLAF+ AGG++ ++ L+ GE + VD  C+V
Sbjct: 126 GVSVGIDFQKKLGTGFFGGEGFIMQKLEGNGLAFLHAGGTIHKRQLQPGERLRVDTGCLV 185

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           A+T  VN  I+Y G I+ A+FGG+ +  A ++GPG V+IQSLPF RL+ R+  +  SP
Sbjct: 186 AMTRDVNYNIEYVGKIKTALFGGEGMFFATLSGPGTVWIQSLPFSRLADRVLSSAVSP 243


>gi|149177722|ref|ZP_01856322.1| hypothetical protein PM8797T_08654 [Planctomyces maris DSM 8797]
 gi|148843372|gb|EDL57735.1| hypothetical protein PM8797T_08654 [Planctomyces maris DSM 8797]
          Length = 262

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 80/230 (34%), Positives = 128/230 (55%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV--GMWQWLF-- 78
           + ++I G + Q+ +I+L P E VVA  GSM +M   +  E      +++  G +  +F  
Sbjct: 6   VDYEIFGSDLQIVEIILDPGESVVAEAGSMNYMEEGIRFEARMGDGSQLNEGFFGKIFKA 65

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G+++      N G  +  V  AAP   +I+ ID+   GG + CQ D+FLC+   
Sbjct: 66  GKRMLSGESLFMTHFTNEGSREQRVAFAAPYPGKIIAIDMDKVGGAITCQKDSFLCAALG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            +V+   ++R+ +   G EGF+ Q+L G G+AF+ AGG+V++K L  GE + VD  CIVA
Sbjct: 126 TEVTMAFNKRLGSGFFGGEGFILQQLRGDGMAFVHAGGTVIKKKLN-GETLRVDTGCIVA 184

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  ++  I+  G ++  V GG+ L  A + G G V +QSLPF RL+ R+
Sbjct: 185 FTGDIDYSIEKAGNLKSMVLGGEGLFLATLRGHGTVLLQSLPFSRLADRV 234


>gi|325916159|ref|ZP_08178443.1| hypothetical protein TIGR00266 [Xanthomonas vesicatoria ATCC 35937]
 gi|325537579|gb|EGD09291.1| hypothetical protein TIGR00266 [Xanthomonas vesicatoria ATCC 35937]
          Length = 339

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 73/239 (30%), Positives = 127/239 (53%), Gaps = 15/239 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLFG 79
           I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++     + + G    L G
Sbjct: 74  IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDSAVQMDTVFGDGSHDGQGGSSGGLMG 133

Query: 80  KTIT------------SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFL 127
           K ++            + V  + G     V  AAP    +L + L+  GG L+CQ D+FL
Sbjct: 134 KLLSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSAHGGRLICQKDSFL 193

Query: 128 CSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDV 187
                V +     +++   + G EGF+ QKL G G  F+ AGG+VV++ L  GE I VD 
Sbjct: 194 AGARGVSLGIAFQKKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELAPGERIDVDT 253

Query: 188 SCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            C+VA  + V++ ++    ++   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 254 GCVVAYHAGVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 312


>gi|389792837|ref|ZP_10196019.1| hypothetical protein UU9_01604 [Rhodanobacter fulvus Jip2]
 gi|388435701|gb|EIL92598.1| hypothetical protein UU9_01604 [Rhodanobacter fulvus Jip2]
          Length = 338

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 73/237 (30%), Positives = 124/237 (52%), Gaps = 13/237 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF---- 78
           I ++I+G + Q  +I L P E  +A  G++ +   +++M+ ++      G          
Sbjct: 75  IDYRIVGTDMQFVEIELDPGESAIAEAGALMYKESAVQMDTVFGDGASSGQSGGGLMDKL 134

Query: 79  ---------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
                    G+++ + V  + G     V  AAP    ++ + L+  GG L+CQ DAFL  
Sbjct: 135 LSAGKRVVTGESLFTTVFTHRGQGKAKVAFAAPYPGTVMAMKLSDHGGRLICQKDAFLAG 194

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
              V++     +++   + G EGF+ QKL G G  F+ AGG+V+Q+ L  GE + VD  C
Sbjct: 195 ARGVQLGIFFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVMQRELAAGERLDVDTGC 254

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           +VA    V++ +   G I+  +FGG+ +  A +TGPG V++QSLPF R++ R+  A 
Sbjct: 255 VVAFHDGVSMDVHPVGGIKSMLFGGEGMFLATLTGPGTVWLQSLPFSRMAGRMLAAA 311


>gi|58584140|ref|YP_203156.1| hypothetical protein XOO4517 [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|188574531|ref|YP_001911460.1| hypothetical protein PXO_03595 [Xanthomonas oryzae pv. oryzae
           PXO99A]
 gi|58428734|gb|AAW77771.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|188518983|gb|ACD56928.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae
           PXO99A]
          Length = 336

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 71/238 (29%), Positives = 129/238 (54%), Gaps = 14/238 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++   +  G          
Sbjct: 72  IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGGLMGK 131

Query: 75  ------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
                 + + G+++ + +  + G     V  AAP    +L + L+  GG+L+CQ D+FL 
Sbjct: 132 LLSAGKRVVTGESMFTTIFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLA 191

Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
               V +     +++   + G EGF+ QKL G G  F+ AGG+VV++ L  GE I VD  
Sbjct: 192 GARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGECIDVDTG 251

Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           C+VA  +SV++ ++    ++   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 252 CVVAYHASVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 309


>gi|406981828|gb|EKE03225.1| hypothetical protein ACD_20C00234G0047 [uncultured bacterium]
          Length = 239

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 76/237 (32%), Positives = 137/237 (57%), Gaps = 12/237 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++I+G + Q  +I L  ++ V+A PG++ +++ +++M+   +   E   ++ +      +
Sbjct: 7   YEIIGDDLQAVKINLSKEKAVLADPGNLVYITNNIKMD---VTLGEFNYFEKVLSAFKRA 63

Query: 85  VVLRN------PGPSD-GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
           +V  N         SD G +  ++P   +I+P+++   G  L+C+  ++LCS +++K+  
Sbjct: 64  IVGENFFLTRFSSDSDIGEIVFSSPVPGKIIPVEIPPEG--LICEKQSYLCSDSNIKIEI 121

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              +R+   + G EGF+ QKLTG+G AFI A G++++K L+  E I +D  C+VA   SV
Sbjct: 122 AFARRIGAGLFGREGFILQKLTGEGTAFIQASGTIIKKTLQPEESIRLDPGCVVAFEPSV 181

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
           +  IKY G IR A+FGG  +    + GPG V+IQ+LP ++LSQ+I      P  + N
Sbjct: 182 DYDIKYVGDIRTALFGGVGIFFVTLIGPGDVYIQTLPLNKLSQKILSKSKFPKAKSN 238


>gi|126174826|ref|YP_001050975.1| hypothetical protein Sbal_2613 [Shewanella baltica OS155]
 gi|386341579|ref|YP_006037945.1| hypothetical protein [Shewanella baltica OS117]
 gi|125998031|gb|ABN62106.1| protein of unknown function DUF124 [Shewanella baltica OS155]
 gi|334863980|gb|AEH14451.1| protein of unknown function DUF124 [Shewanella baltica OS117]
          Length = 264

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 125/230 (54%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCF----------MSGSMEMENIYIPENEVG 72
           + ++I+G   Q+ ++ L P E VVA  G+M +          M    E E+ +  +  V 
Sbjct: 8   VDYEIIGHSMQLVELELDPNETVVAEAGAMNYLEQDISFEAKMGDGSEPESGFFGKLMVA 67

Query: 73  MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G++I      N G     V  AAP    IL IDLA +GGEL+CQ D+FL +   
Sbjct: 68  GKRALTGESIFMTHFTNQGNQKRKVAFAAPYPGTILAIDLAEYGGELICQKDSFLAAALG 127

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            +V+   ++R+     G EGF+ Q L G G+ FI AGG++V+K L  GE + VD  C+V 
Sbjct: 128 TRVTMKFNRRLGTGFFGGEGFILQSLQGDGMTFIHAGGTLVKKELR-GETLRVDTGCLVG 186

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  ++  I+  G ++  VFGG+ L  A + G G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIERAGSLKSMVFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236


>gi|384417311|ref|YP_005626671.1| hypothetical protein XOC_0265 [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353460225|gb|AEQ94504.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 336

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 71/238 (29%), Positives = 129/238 (54%), Gaps = 14/238 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++   +  G          
Sbjct: 72  IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGGLMGK 131

Query: 75  ------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
                 + + G+++ + +  + G     V  AAP    +L + L+  GG+L+CQ D+FL 
Sbjct: 132 LLSAGKRVVSGESMFTTIFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLA 191

Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
               V +     +++   + G EGF+ QKL G G  F+ AGG+VV++ L  GE I VD  
Sbjct: 192 GARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTG 251

Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           C+VA  +SV++ ++    ++   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 252 CVVAYHASVDMDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 309


>gi|374702753|ref|ZP_09709623.1| hypothetical protein PseS9_05019 [Pseudomonas sp. S9]
          Length = 248

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 78/232 (33%), Positives = 124/232 (53%), Gaps = 9/232 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ--WLFGK 80
           + ++ILG  AQ  +++L P E V+A  G + +M+  +  E      +  G+    W  GK
Sbjct: 6   LEYEILGNSAQSVEVILDPGETVIAEAGVLNYMTEDVRFETRMGDGSSSGLLGKLWSAGK 65

Query: 81  TITS------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
            + +          N G     V  AAP    ++P+ LA  GG L+CQ D+FLC+     
Sbjct: 66  RMVTGESMFMTHFSNTGDKQARVAFAAPYPGSVVPVSLAQVGGSLVCQRDSFLCAARGTS 125

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           +  + ++R+     G EGF+ QKL G G  F+ AGG+V++K L   E + +D  C+V  +
Sbjct: 126 IGISFNKRLGAGFFGGEGFILQKLEGDGPVFLHAGGTVIRKELN-NETLRLDTGCLVGFS 184

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           S ++  IK  G ++  +FGG+ +  A + G G V+IQSLPF RL++RI  A 
Sbjct: 185 SGIDYDIKLAGGLKSMLFGGEGIFLATLKGTGTVWIQSLPFSRLAERIYEAT 236


>gi|153001168|ref|YP_001366849.1| hypothetical protein Shew185_2651 [Shewanella baltica OS185]
 gi|373949985|ref|ZP_09609946.1| protein of unknown function DUF124 [Shewanella baltica OS183]
 gi|386324182|ref|YP_006020299.1| hypothetical protein [Shewanella baltica BA175]
 gi|151365786|gb|ABS08786.1| protein of unknown function DUF124 [Shewanella baltica OS185]
 gi|333818327|gb|AEG10993.1| protein of unknown function DUF124 [Shewanella baltica BA175]
 gi|373886585|gb|EHQ15477.1| protein of unknown function DUF124 [Shewanella baltica OS183]
          Length = 264

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 126/230 (54%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG--SMEMENIYIPENEVGMW------ 74
           + ++I+G   Q+ ++ L P E VVA  G+M ++    S E +     E E G +      
Sbjct: 8   VDYEIIGHSMQLVEVELDPNETVVAEAGAMNYLEQDISFEAKMGDGSEPESGFFGKLVGA 67

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G++I      N G     V  AAP    IL IDLA +GGEL+CQ D+FL +   
Sbjct: 68  GKRALTGESIFMTHFTNQGNQKRKVAFAAPYPGTILAIDLAEYGGELICQKDSFLAAALG 127

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            +V+   ++R+     G EGF+ Q L G G+ FI AGG++V+K L  GE + VD  C+V 
Sbjct: 128 TRVTMKFNRRLGTGFFGGEGFILQSLQGDGMTFIHAGGTLVKKELR-GETLRVDTGCLVG 186

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  ++  I+  G ++  VFGG+ L  A + G G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIERAGSLKSMVFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236


>gi|152988055|ref|YP_001346826.1| hypothetical protein PSPA7_1442 [Pseudomonas aeruginosa PA7]
 gi|452878803|ref|ZP_21955976.1| hypothetical protein G039_19623 [Pseudomonas aeruginosa VRFPA01]
 gi|150963213|gb|ABR85238.1| conserved hypothetical protein [Pseudomonas aeruginosa PA7]
 gi|452184571|gb|EME11589.1| hypothetical protein G039_19623 [Pseudomonas aeruginosa VRFPA01]
          Length = 248

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 79/233 (33%), Positives = 124/233 (53%), Gaps = 11/233 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-----MW--- 74
           + ++ILG   Q  +I L P E V+A  G+M +M+G +      + +   G     +W   
Sbjct: 6   LDYRILGESMQTVEIELDPGETVIAEAGAMNYMTGDIRF-TARMGDGSDGSLLGKLWSAG 64

Query: 75  -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
            + L G+++      N G     V  AAP    +LP+DL   GG L CQ D+FLC+    
Sbjct: 65  KRKLGGESVFMTHFTNEGQGKQHVAFAAPYPGSVLPVDLDDVGGRLFCQKDSFLCAAYGT 124

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
           +V     +R+     G EGF+ QKL G GL F+ AGG+++++ L   E + VD  C+VA 
Sbjct: 125 RVGIAFAKRLGAGFFGGEGFILQKLEGDGLVFVHAGGTLIRRQLH-DETLRVDTGCLVAF 183

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           T  ++  ++  G ++  +FGG+ L+   + G G V++QSLPF RL+ RI  A 
Sbjct: 184 TDGIDYDVQLAGGLKSMLFGGEGLLLTTLKGSGTVWLQSLPFSRLAGRIYDAT 236


>gi|160875838|ref|YP_001555154.1| hypothetical protein Sbal195_2726 [Shewanella baltica OS195]
 gi|378709040|ref|YP_005273934.1| hypothetical protein [Shewanella baltica OS678]
 gi|418024588|ref|ZP_12663570.1| protein of unknown function DUF124 [Shewanella baltica OS625]
 gi|160861360|gb|ABX49894.1| protein of unknown function DUF124 [Shewanella baltica OS195]
 gi|315268029|gb|ADT94882.1| protein of unknown function DUF124 [Shewanella baltica OS678]
 gi|353535874|gb|EHC05434.1| protein of unknown function DUF124 [Shewanella baltica OS625]
          Length = 264

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 126/230 (54%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG--SMEMENIYIPENEVGMW------ 74
           + ++I+G   Q+ ++ L P E VVA  G+M ++    S E +     E E G +      
Sbjct: 8   VDYEIIGHSMQLVEVELDPNETVVAEAGAMNYLEQDISFEAKMGDGSEPESGFFGKLMGA 67

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G++I      N G     V  AAP    IL IDLA +GGEL+CQ D+FL +   
Sbjct: 68  GKRALTGESIFMTHFTNQGNQKRKVAFAAPYPGTILAIDLAEYGGELICQKDSFLAAALG 127

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            +V+   ++R+     G EGF+ Q L G G+ FI AGG++V+K L  GE + VD  C+V 
Sbjct: 128 TRVTMKFNRRLGTGFFGGEGFILQSLQGDGMTFIHAGGTLVKKELR-GETLRVDTGCLVG 186

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  ++  I+  G ++  VFGG+ L  A + G G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIERAGSLKSMVFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236


>gi|289665045|ref|ZP_06486626.1| hypothetical protein XcampvN_18745 [Xanthomonas campestris pv.
           vasculorum NCPPB 702]
 gi|289669829|ref|ZP_06490904.1| hypothetical protein XcampmN_15372 [Xanthomonas campestris pv.
           musacearum NCPPB 4381]
          Length = 336

 Score =  134 bits (336), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 71/238 (29%), Positives = 128/238 (53%), Gaps = 14/238 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I ++I+G + Q  ++ L P E  +A  G++ +   +++M+ ++   +  G          
Sbjct: 72  IDYRIVGNDMQFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGGLMGK 131

Query: 75  ------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
                 + + G+++ + V  + G     V  AAP    +L + L+  GG+L+CQ D+FL 
Sbjct: 132 LLSAGKRVVTGESMFTTVFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLA 191

Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
               V +     +++   + G EGF+ QKL G G  F+ AGG+VV++ L  GE I VD  
Sbjct: 192 GARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGERIDVDTG 251

Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           C+VA  + V++ ++    ++   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 252 CVVAYHAGVDMDVRRVAGLKSMFFGGEGMFLATLTGPGKVWLQSLPFSRMAGRMLQAA 309


>gi|355639904|ref|ZP_09051449.1| hypothetical protein HMPREF1030_00535 [Pseudomonas sp. 2_1_26]
 gi|354831610|gb|EHF15620.1| hypothetical protein HMPREF1030_00535 [Pseudomonas sp. 2_1_26]
          Length = 248

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 79/233 (33%), Positives = 125/233 (53%), Gaps = 11/233 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-----MW--- 74
           + ++ILG   Q  +I L P E V+A  G+M +M+G +      + +   G     +W   
Sbjct: 6   LDYRILGESMQTVEIELDPGETVIAEAGAMNYMTGDIRF-TARMGDGSDGSLLGKLWSAG 64

Query: 75  -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
            + L G+++      N G     V  AAP    ++ +DL   GG L CQ D+FLC+    
Sbjct: 65  KRKLGGESVFMTHFTNEGQGKQHVAFAAPYPGSVVAVDLDDVGGRLFCQKDSFLCAAYGT 124

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
           +V     +R+     G EGF+ QKL G GL F+ AGG+++++ L  GE + VD  C+VA 
Sbjct: 125 RVGIAFAKRLGAGFFGGEGFILQKLEGDGLVFVHAGGTLIRRQLN-GETLRVDTGCLVAF 183

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           T S++  ++  G ++  +FGG+ L+   + G G V++QSLPF RL+ RI  A 
Sbjct: 184 TDSIDYDVQLAGGLKSMLFGGEGLLLTTLKGSGTVWLQSLPFSRLAGRIYDAT 236


>gi|24373530|ref|NP_717573.1| protein of unknown function DUF124 [Shewanella oneidensis MR-1]
 gi|24347839|gb|AAN55017.1| protein of unknown function DUF124 [Shewanella oneidensis MR-1]
          Length = 264

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 127/230 (55%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG--SMEMENIYIPENEVGMW------ 74
           + ++I+G   Q+ ++ L P E V+A  G+M ++    S E +     E + G +      
Sbjct: 8   VDYEIIGHSMQLVEVELDPNETVIAEAGAMNYLEQDISFEAKMGDGSEPDSGFFGKLMGA 67

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G++I      N G     V  AAP    IL IDLA +GGEL+CQ D+FL +   
Sbjct: 68  GKRALTGESIFMTHFTNLGHQKCKVAFAAPYPGTILAIDLAQYGGELICQKDSFLAAALG 127

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            +VS   ++R+     G EGF+ Q L G G+AFI AGG++++K L+ GE + VD  C+V 
Sbjct: 128 TRVSMKFNRRLGTGFFGGEGFILQSLQGDGMAFIHAGGTLIKKELK-GETLRVDTGCLVG 186

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  V+  I+  G ++   FGG+ L  A + G G V++QSLPF R++ RI
Sbjct: 187 FTPGVDYDIERAGSLKSMFFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236


>gi|120599156|ref|YP_963730.1| hypothetical protein Sputw3181_2352 [Shewanella sp. W3-18-1]
 gi|146292773|ref|YP_001183197.1| hypothetical protein Sputcn32_1673 [Shewanella putrefaciens CN-32]
 gi|386313452|ref|YP_006009617.1| hypothetical protein [Shewanella putrefaciens 200]
 gi|120559249|gb|ABM25176.1| protein of unknown function DUF124 [Shewanella sp. W3-18-1]
 gi|145564463|gb|ABP75398.1| protein of unknown function DUF124 [Shewanella putrefaciens CN-32]
 gi|319426077|gb|ADV54151.1| protein of unknown function DUF124 [Shewanella putrefaciens 200]
          Length = 264

 Score =  134 bits (336), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 125/230 (54%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + ++I+G   Q+ ++ L P E V+A  G+M ++   +  E       E E G +  L G 
Sbjct: 8   VDYEIIGHSMQLVEVELDPNETVIAEAGAMNYLEQDIRFEAKMGDGSEPESGFFGKLMGA 67

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  ++I      N G     V  AAP    IL IDLA +GGEL+CQ D+FL +   
Sbjct: 68  GKRVLTGESIFMTHFTNQGNQKRKVAFAAPYPGTILAIDLAEYGGELICQKDSFLAAALG 127

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            +V+   ++R+     G EGF+ Q L G G+ FI AGG++V+K L+ GE + VD  C+V 
Sbjct: 128 TRVTMKFNRRLGTGFFGGEGFILQSLQGDGMTFIHAGGTLVKKELK-GETLRVDTGCLVG 186

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  ++  I+  G ++  VFGG+ L  A + G G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIERAGSLKSMVFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236


>gi|152993590|ref|YP_001359311.1| hypothetical protein SUN_2012 [Sulfurovum sp. NBC37-1]
 gi|151425451|dbj|BAF72954.1| conserved hypothetical protein [Sulfurovum sp. NBC37-1]
          Length = 344

 Score =  133 bits (335), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 88/276 (31%), Positives = 140/276 (50%), Gaps = 13/276 (4%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           M A  F+ P +    +      I + I G + Q  +I L P E VV+  G+M + +  + 
Sbjct: 58  MVASSFTPPSRARGSRLNSADEIDYTIYGEDIQYVEIELDPGESVVSEAGAMMYKNSHIV 117

Query: 61  MENIY-----IPENEVGMWQWLFG--------KTITSVVLRNPGPSDGFVGIAAPSLARI 107
           M+ ++       E+  G++  L G        +++ + V  + G     V   AP   RI
Sbjct: 118 MDTVFGDGSGSKEDTGGLFDKLVGAGKRLVTGESLFTTVFTHRGNGKATVAFGAPYPGRI 177

Query: 108 LPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFIL 167
           +P+DL   GG L+CQ D FLC+   V +     ++V   + G EGF+ QKL G GLAFI 
Sbjct: 178 IPVDLDEMGGTLICQKDCFLCAAKGVSIGIYFQKKVLTGLFGGEGFIMQKLEGDGLAFIH 237

Query: 168 AGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGI 227
           AGG + +  L+ GE + +D  C+VA+   V+  ++  G I+  +FGG+    A + GPG 
Sbjct: 238 AGGMLKEIELKDGEELHLDTGCLVAMQPHVHFDLEQAGGIKTGLFGGEGFFFAKLRGPGK 297

Query: 228 VFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQIAI 263
           ++IQSLPF RL+ R+  A      ++  +  V  AI
Sbjct: 298 IWIQSLPFSRLAGRMLSAAPQGGGKDTGEGSVLGAI 333


>gi|296447765|ref|ZP_06889680.1| protein of unknown function DUF124 [Methylosinus trichosporium
           OB3b]
 gi|296254742|gb|EFH01854.1| protein of unknown function DUF124 [Methylosinus trichosporium
           OB3b]
          Length = 304

 Score =  133 bits (335), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 88/247 (35%), Positives = 132/247 (53%), Gaps = 11/247 (4%)

Query: 12  PYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PEN 69
           P   +SP +A + F+I G E Q  +I L P    VA  G+M F   ++ ME I+    + 
Sbjct: 26  PGANRSPAEA-LRFEIKGHEMQFVEIELDPGRACVAEAGAMMFKEAAVGMETIFGDGAQE 84

Query: 70  EVGM----WQW----LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLC 121
           + G+    W+     L G T+      N G S      AAP   +ILP+ L + GGELLC
Sbjct: 85  DSGLLGKLWRGAKRALSGSTLFMTRFTNNGYSPARAAFAAPYPGKILPLRLDLLGGELLC 144

Query: 122 QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGE 181
           Q D FL     V +     +++   + G EGF+ Q+L+G G+ F+ AGG+VV++ L  GE
Sbjct: 145 QRDTFLAGAPGVAIDIAFQKKIMTGLFGGEGFIMQRLSGNGIVFVHAGGAVVERELAPGE 204

Query: 182 VITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQR 241
            + VD  C+VA T++V+  +   G ++   FGG+    A + GPG V++QS+PF RL   
Sbjct: 205 ELHVDSGCLVAQTATVSFDVVPVGGVKSMFFGGEGFFFARLVGPGHVWLQSMPFSRLVGH 264

Query: 242 IARAVTS 248
           IA  + S
Sbjct: 265 IASRMPS 271


>gi|217972911|ref|YP_002357662.1| hypothetical protein Sbal223_1734 [Shewanella baltica OS223]
 gi|217498046|gb|ACK46239.1| protein of unknown function DUF124 [Shewanella baltica OS223]
          Length = 264

 Score =  133 bits (335), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 125/230 (54%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG--SMEMENIYIPENEVGMW------ 74
           + ++I+G   Q+ ++ L P E VVA  G+M ++    S E +     E E G +      
Sbjct: 8   VDYEIIGHSMQLVEVELDPNETVVAEAGAMNYLEQDISFEAKMGDGSEPESGFFGKLMGA 67

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G++I      N G     V  AAP    IL IDLA +GGEL+CQ D FL +   
Sbjct: 68  GKRALTGESIFMTHFTNQGNQKRKVAFAAPYPGTILAIDLAEYGGELICQKDCFLAAALG 127

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            +V+   ++R+     G EGF+ Q L G G+ FI AGG++V+K L  GE + VD  C+V 
Sbjct: 128 TRVTMKFNRRLGTGFFGGEGFILQSLQGDGMTFIHAGGTLVKKELR-GETLRVDTGCLVG 186

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  ++  I+  G ++  VFGG+ L  A + G G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIERAGSLKSMVFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236


>gi|325109149|ref|YP_004270217.1| hypothetical protein Plabr_2595 [Planctomyces brasiliensis DSM
           5305]
 gi|324969417|gb|ADY60195.1| protein of unknown function DUF124 [Planctomyces brasiliensis DSM
           5305]
          Length = 261

 Score =  133 bits (334), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 81/231 (35%), Positives = 120/231 (51%), Gaps = 13/231 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFM-----------SGSMEMENIYIPENEV 71
           I ++I G + Q+ +I L P E VVA  G M +M            GS   +  +      
Sbjct: 6   IDYEIFGNDMQIVEIELDPYETVVAEAGMMNYMEDGIRYEAKMGDGSQASQGFFSKMMSA 65

Query: 72  GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
           G  + L G++I      N  P    +  AAP   RI+ +DL   GG ++CQ DAFLC+  
Sbjct: 66  G-ARALTGESIFMTHFTNESPQKRRIAFAAPYPGRIVALDLEKIGGTVICQKDAFLCAAK 124

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
             ++S    +R+ +   G EGF+ QKL G G AF+ AGG+V+++ L   E + VD  C+V
Sbjct: 125 GTRLSIAFQKRLGSGFFGGEGFILQKLEGDGKAFVHAGGTVIKRKLN-DETLRVDTGCLV 183

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           A T  ++  I   G ++   FGG+ L  A + G G V +QSLPF RL+ R+
Sbjct: 184 AFTPGIDYDIARAGSLKSMFFGGEGLFLATLRGTGTVLLQSLPFSRLADRV 234


>gi|338210593|ref|YP_004654642.1| hypothetical protein [Runella slithyformis DSM 19594]
 gi|336304408|gb|AEI47510.1| protein of unknown function DUF124 [Runella slithyformis DSM 19594]
          Length = 259

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 77/232 (33%), Positives = 125/232 (53%), Gaps = 12/232 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I ++I+G + Q+ +I L P E V+A  GSM FM   ++ E      +             
Sbjct: 6   IDYKIIGEDIQIVEIELDPNETVIAEAGSMLFMEDGIQFETKMGDGSNATQSLMGKIFQA 65

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAFLCSVN 131
             + L G+++      N G     V  AAP    ++PIDL+ ++G  L+ Q D FLC+  
Sbjct: 66  GSRLLTGESLFMTHFTNRGYGKRKVAFAAPYPGTVMPIDLSRIYGNSLIVQKDGFLCAAM 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
             K++   +QR+ + + G EGF+ QKL G GLAF+ AGG V+++ L   E + VD  C+V
Sbjct: 126 GTKLNIHFNQRLGSGLFGGEGFILQKLQGDGLAFVHAGGVVIERQLN-NETLRVDTGCVV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
           A   SVN  ++ +G ++  +FGG+ +  A + G G  +IQS+P  +L  R++
Sbjct: 185 AFEPSVNFSVQRSGSLKSMIFGGEGIFLATLQGTGRCWIQSMPISKLVDRLS 236


>gi|49082264|gb|AAT50532.1| PA3696, partial [synthetic construct]
          Length = 249

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 78/233 (33%), Positives = 124/233 (53%), Gaps = 11/233 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-----MW--- 74
           + ++ILGG  Q  +I L P E V+A  G+M +M+G +      + +   G     +W   
Sbjct: 6   LDYRILGGSMQTVEIELDPGETVIAEAGAMNYMTGDIRF-TARMGDGSDGSLLGKLWSAG 64

Query: 75  -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
            + L G+++      N G     V  AAP    ++ +DL   GG L CQ D+FLC+    
Sbjct: 65  KRKLGGESVFMTHFTNEGQGKQHVAFAAPYPGSVVAVDLDDVGGRLFCQKDSFLCAAYGT 124

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
           +V     +R+     G EGF+ QKL G GL F+ AGG+++++ L  GE + VD  C+VA 
Sbjct: 125 RVGIAFTKRLGAGFFGGEGFILQKLEGDGLVFVHAGGTLIRRQLN-GETLRVDTGCLVAF 183

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           T  ++  ++  G ++  +FG + L+   + G G V++QSLPF RL+ RI  A 
Sbjct: 184 TDGIDYDVQLAGGLKSMLFGSEGLLLTTLKGSGTVWLQSLPFSRLAGRIYDAT 236


>gi|171914117|ref|ZP_02929587.1| hypothetical protein VspiD_23100 [Verrucomicrobium spinosum DSM
           4136]
          Length = 340

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 85/250 (34%), Positives = 137/250 (54%), Gaps = 14/250 (5%)

Query: 5   FFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME-- 62
           F   P  P  +   +   I ++I+G + Q+ +I L P E V+A  G M +M   ++ E  
Sbjct: 65  FGCPPPIPQGFNQQRSHEIEYEIIGDDLQIVEIELDPGETVIAEAGGMNYMEDDIQFETR 124

Query: 63  --NIYIPE-------NEVGMWQWLFGKTITSVVLRNPGPSDGF-VGIAAPSLARILPIDL 112
             +   P+         +G  + L G+++      + G S    V  AAP   +I+P+ L
Sbjct: 125 MGDGSQPDGGFMGALKNIGK-RMLTGESLFMTHFTHRGSSGKRRVAFAAPYPGKIIPMRL 183

Query: 113 AMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSV 172
           +  GGE++C+ DAFLC+    +V    ++++     G EGF+ Q+L G GL F+ AGG++
Sbjct: 184 SELGGEIICEKDAFLCAALGTEVGIAFNRKLGTGFFGGEGFILQRLRGDGLVFVHAGGTI 243

Query: 173 VQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           V+K L  G  + VD  C+VA+TS +N  I+  G ++  VFGG+ L  A ++G G V++QS
Sbjct: 244 VKKELR-GGTLRVDTGCLVAMTSGINYDIQRAGNLKSMVFGGEGLFLATLSGHGTVWLQS 302

Query: 233 LPFHRLSQRI 242
           LPF RL+ RI
Sbjct: 303 LPFSRLADRI 312


>gi|330814446|ref|XP_003291402.1| hypothetical protein DICPUDRAFT_155999 [Dictyostelium purpureum]
 gi|325078427|gb|EGC32079.1| hypothetical protein DICPUDRAFT_155999 [Dictyostelium purpureum]
          Length = 346

 Score =  132 bits (333), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 84/243 (34%), Positives = 130/243 (53%), Gaps = 11/243 (4%)

Query: 7   STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMEN--- 63
           ST FQ       +     ++I G E+QV Q+ L P   + A  G++  M   +EM+    
Sbjct: 80  STLFQNSQSLKGKATSFDWEIFGLESQVIQMNLNPGSSITAETGALLEMESEIEMDTSAR 139

Query: 64  --IYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELL 120
             I      +   Q LF   +T  V  N   ++   +  ++P +++IL I L+  GGEL+
Sbjct: 140 GGILSSFKRMFTGQGLF---LTKFV--NKSETEKMRITFSSPYISKILAIKLSDVGGELI 194

Query: 121 CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVG 180
           CQ +AFLC  + V++   L +       G +GF+ QKL G G+ F+ AGGSV+ + L  G
Sbjct: 195 CQKNAFLCGEDTVEIEPRLTKNFSTGFFGGQGFILQKLRGNGMVFVHAGGSVIYRVLSPG 254

Query: 181 EVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           E I V    IVA  S+V+ QI++   ++  +F G+ L  A +TGPG+V +QS+PF RL +
Sbjct: 255 EKIRVTTGSIVAFESTVDFQIEFVKGVKNILFSGEGLTFAAITGPGMVVLQSMPFERLVR 314

Query: 241 RIA 243
            IA
Sbjct: 315 AIA 317


>gi|416860159|ref|ZP_11914168.1| hypothetical protein PA13_19985 [Pseudomonas aeruginosa 138244]
 gi|334837751|gb|EGM16500.1| hypothetical protein PA13_19985 [Pseudomonas aeruginosa 138244]
 gi|453047297|gb|EME95011.1| hypothetical protein H123_04106 [Pseudomonas aeruginosa PA21_ST175]
          Length = 248

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 78/233 (33%), Positives = 124/233 (53%), Gaps = 11/233 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-----MW--- 74
           + ++ILG   Q  +I L P E V+A  G+M +M+G +      + +   G     +W   
Sbjct: 6   LDYRILGESMQTVEIELDPGETVIAEAGAMNYMTGDIRF-TARMGDGSDGSLLGKLWSAG 64

Query: 75  -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
            + L G+++      N G     V  AAP    ++ +DL   GG L CQ D+FLC+    
Sbjct: 65  KRKLGGESVFMTHFTNEGQGKQHVAFAAPYPGSVVAVDLDDVGGRLFCQKDSFLCAAYGT 124

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
           +V     +R+     G EGF+ QKL G GL F+ AGG+++++ L  GE + VD  C+VA 
Sbjct: 125 RVGIAFTKRLGAGFFGGEGFILQKLEGDGLVFVHAGGTLIRRQLN-GETLRVDTGCLVAF 183

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           T  ++  ++  G ++  +FGG+ L+   + G G V++QSLPF RL+ RI  A 
Sbjct: 184 TDDIDYDVQLAGGLKSMLFGGEGLLLTTLKGSGTVWLQSLPFSRLAGRIYDAT 236


>gi|424792016|ref|ZP_18218294.1| hypothetical protein XTG29_01349 [Xanthomonas translucens pv.
           graminis ART-Xtg29]
 gi|422797287|gb|EKU25645.1| hypothetical protein XTG29_01349 [Xanthomonas translucens pv.
           graminis ART-Xtg29]
          Length = 252

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 72/220 (32%), Positives = 117/220 (53%), Gaps = 10/220 (4%)

Query: 33  QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW----------QWLFGKTI 82
           Q  +I L P E  VA  G++ F   +++M+ ++   +  G            + + G+++
Sbjct: 2   QFVEIELDPGESAVAEAGALMFKDAAVQMDTVFGDGSGQGGGLMGKLFSAGKRLVTGESL 61

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
            + V  + G     V  AAP    +L + L   GG L+CQ D+FL     V +     ++
Sbjct: 62  FTTVFTHQGQGKATVAFAAPYPGTVLAMKLDQHGGRLICQKDSFLAGARGVSLGIHFQRK 121

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +   + G EGF+ QKL G G  F+ AGG VV++ L  GE + VD  C+VA  S+V++ ++
Sbjct: 122 IMTGLFGGEGFIMQKLEGDGWVFVHAGGCVVERELAAGERLDVDTGCVVAFHSTVDMDVR 181

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
               I+   FGG+ +  A +TGPG V++QSLPF RL+ R+
Sbjct: 182 AVSGIKSMFFGGEGMFLATLTGPGKVWLQSLPFSRLAGRM 221


>gi|297738535|emb|CBI27780.3| unnamed protein product [Vitis vinifera]
          Length = 142

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 59/85 (69%), Positives = 70/85 (82%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           V P++ILGGEAQ+ QIMLKPQEK++A+PGS C+MSGS +MEN+YI +NEVGMWQW FGK 
Sbjct: 39  VTPYRILGGEAQMVQIMLKPQEKIIAKPGSTCYMSGSTQMENVYISKNEVGMWQWAFGKA 98

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLAR 106
             SVVL N G  DGFVG  APSLA+
Sbjct: 99  AASVVLLNQGSGDGFVGTVAPSLAK 123


>gi|15598892|ref|NP_252386.1| hypothetical protein PA3696 [Pseudomonas aeruginosa PAO1]
 gi|107103210|ref|ZP_01367128.1| hypothetical protein PaerPA_01004279 [Pseudomonas aeruginosa PACS2]
 gi|218890027|ref|YP_002438891.1| hypothetical protein PLES_12881 [Pseudomonas aeruginosa LESB58]
 gi|254236606|ref|ZP_04929929.1| conserved hypothetical protein [Pseudomonas aeruginosa C3719]
 gi|254242391|ref|ZP_04935713.1| conserved hypothetical protein [Pseudomonas aeruginosa 2192]
 gi|386057317|ref|YP_005973839.1| hypothetical protein PAM18_1250 [Pseudomonas aeruginosa M18]
 gi|392982580|ref|YP_006481167.1| hypothetical protein PADK2_05860 [Pseudomonas aeruginosa DK2]
 gi|418586069|ref|ZP_13150115.1| hypothetical protein O1O_15358 [Pseudomonas aeruginosa MPAO1/P1]
 gi|418589503|ref|ZP_13153425.1| hypothetical protein O1Q_02893 [Pseudomonas aeruginosa MPAO1/P2]
 gi|419754713|ref|ZP_14281071.1| hypothetical protein CF510_16988 [Pseudomonas aeruginosa
           PADK2_CF510]
 gi|420138105|ref|ZP_14646046.1| hypothetical protein PACIG1_1544 [Pseudomonas aeruginosa CIG1]
 gi|421152495|ref|ZP_15612075.1| hypothetical protein PABE171_1421 [Pseudomonas aeruginosa ATCC
           14886]
 gi|421158510|ref|ZP_15617759.1| hypothetical protein PABE173_1370 [Pseudomonas aeruginosa ATCC
           25324]
 gi|421166068|ref|ZP_15624338.1| hypothetical protein PABE177_1160 [Pseudomonas aeruginosa ATCC
           700888]
 gi|421179140|ref|ZP_15636736.1| hypothetical protein PAE2_1185 [Pseudomonas aeruginosa E2]
 gi|421518239|ref|ZP_15964913.1| hypothetical protein A161_18210 [Pseudomonas aeruginosa PAO579]
 gi|424939445|ref|ZP_18355208.1| conserved hypothetical protein [Pseudomonas aeruginosa NCMG1179]
 gi|451987723|ref|ZP_21935875.1| DUF124 domain-containing protein [Pseudomonas aeruginosa 18A]
 gi|9949861|gb|AAG07084.1|AE004789_4 conserved hypothetical protein [Pseudomonas aeruginosa PAO1]
 gi|126168537|gb|EAZ54048.1| conserved hypothetical protein [Pseudomonas aeruginosa C3719]
 gi|126195769|gb|EAZ59832.1| conserved hypothetical protein [Pseudomonas aeruginosa 2192]
 gi|218770250|emb|CAW26015.1| conserved hypothetical protein [Pseudomonas aeruginosa LESB58]
 gi|346055891|dbj|GAA15774.1| conserved hypothetical protein [Pseudomonas aeruginosa NCMG1179]
 gi|347303623|gb|AEO73737.1| hypothetical protein PAM18_1250 [Pseudomonas aeruginosa M18]
 gi|375043743|gb|EHS36359.1| hypothetical protein O1O_15358 [Pseudomonas aeruginosa MPAO1/P1]
 gi|375051737|gb|EHS44203.1| hypothetical protein O1Q_02893 [Pseudomonas aeruginosa MPAO1/P2]
 gi|384398531|gb|EIE44936.1| hypothetical protein CF510_16988 [Pseudomonas aeruginosa
           PADK2_CF510]
 gi|392318085|gb|AFM63465.1| hypothetical protein PADK2_05860 [Pseudomonas aeruginosa DK2]
 gi|403249088|gb|EJY62603.1| hypothetical protein PACIG1_1544 [Pseudomonas aeruginosa CIG1]
 gi|404347721|gb|EJZ74070.1| hypothetical protein A161_18210 [Pseudomonas aeruginosa PAO579]
 gi|404525255|gb|EKA35531.1| hypothetical protein PABE171_1421 [Pseudomonas aeruginosa ATCC
           14886]
 gi|404539215|gb|EKA48712.1| hypothetical protein PABE177_1160 [Pseudomonas aeruginosa ATCC
           700888]
 gi|404547383|gb|EKA56381.1| hypothetical protein PAE2_1185 [Pseudomonas aeruginosa E2]
 gi|404549518|gb|EKA58375.1| hypothetical protein PABE173_1370 [Pseudomonas aeruginosa ATCC
           25324]
 gi|451754482|emb|CCQ88398.1| DUF124 domain-containing protein [Pseudomonas aeruginosa 18A]
          Length = 248

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 78/233 (33%), Positives = 124/233 (53%), Gaps = 11/233 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-----MW--- 74
           + ++ILG   Q  +I L P E V+A  G+M +M+G +      + +   G     +W   
Sbjct: 6   LDYRILGESMQTVEIELDPGETVIAEAGAMNYMTGDIRF-TARMGDGSDGSLLGKLWSAG 64

Query: 75  -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
            + L G+++      N G     V  AAP    ++ +DL   GG L CQ D+FLC+    
Sbjct: 65  KRKLGGESVFMTHFTNEGQGKQHVAFAAPYPGSVVAVDLDDVGGRLFCQKDSFLCAAYGT 124

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
           +V     +R+     G EGF+ QKL G GL F+ AGG+++++ L  GE + VD  C+VA 
Sbjct: 125 RVGIAFTKRLGAGFFGGEGFILQKLEGDGLVFVHAGGTLIRRQLN-GETLRVDTGCLVAF 183

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           T  ++  ++  G ++  +FGG+ L+   + G G V++QSLPF RL+ RI  A 
Sbjct: 184 TDGIDYDVQLAGGLKSMLFGGEGLLLTTLKGSGTVWLQSLPFSRLAGRIYDAT 236


>gi|389736882|ref|ZP_10190389.1| hypothetical protein UU5_11043 [Rhodanobacter sp. 115]
 gi|388438772|gb|EIL95500.1| hypothetical protein UU5_11043 [Rhodanobacter sp. 115]
          Length = 343

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 70/234 (29%), Positives = 127/234 (54%), Gaps = 14/234 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I ++I+G + Q  +I L P E  VA  G++ +   +++M+ ++   +             
Sbjct: 79  IDYRIVGHDMQFVEIELDPGESAVAEAGALMYKESAIQMDTVFGDGSSSSGQSGSGLMDK 138

Query: 75  ------QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
                 + + G+++ + V  + G     V  AAP    ++ + L+  GG L+CQ D+FL 
Sbjct: 139 LLSAGKRVITGESLFTTVFTHTGSGKAKVAFAAPYPGTVMAMKLSDHGGRLICQKDSFLA 198

Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
               V++     +++   + G EGF+ QKL G G  F+ AGG++V++ L  GE + VD  
Sbjct: 199 GARGVRLGIFFQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTMVERELAPGERLDVDTG 258

Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           C+VA   +V++ ++  G I+  +FGG+ +  A ++GPG V++QSLPF RL+ R+
Sbjct: 259 CVVAFHDTVSMDVRPVGGIKSMLFGGEGVFLATLSGPGKVWLQSLPFSRLAGRM 312


>gi|116051693|ref|YP_789468.1| hypothetical protein PA14_16590 [Pseudomonas aeruginosa UCBPP-PA14]
 gi|296387797|ref|ZP_06877272.1| hypothetical protein PaerPAb_06574 [Pseudomonas aeruginosa PAb1]
 gi|313109102|ref|ZP_07795074.1| hypothetical protein PA39016_001670058 [Pseudomonas aeruginosa
           39016]
 gi|386067731|ref|YP_005983035.1| hypothetical protein NCGM2_4827 [Pseudomonas aeruginosa NCGM2.S1]
 gi|416878067|ref|ZP_11920194.1| hypothetical protein PA15_18983 [Pseudomonas aeruginosa 152504]
 gi|421173065|ref|ZP_15630820.1| hypothetical protein PACI27_1306 [Pseudomonas aeruginosa CI27]
 gi|115586914|gb|ABJ12929.1| conserved hypothetical protein [Pseudomonas aeruginosa UCBPP-PA14]
 gi|310881576|gb|EFQ40170.1| hypothetical protein PA39016_001670058 [Pseudomonas aeruginosa
           39016]
 gi|334838752|gb|EGM17460.1| hypothetical protein PA15_18983 [Pseudomonas aeruginosa 152504]
 gi|348036290|dbj|BAK91650.1| hypothetical protein NCGM2_4827 [Pseudomonas aeruginosa NCGM2.S1]
 gi|404536736|gb|EKA46372.1| hypothetical protein PACI27_1306 [Pseudomonas aeruginosa CI27]
          Length = 248

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 78/233 (33%), Positives = 124/233 (53%), Gaps = 11/233 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-----MW--- 74
           + ++ILG   Q  +I L P E V+A  G+M +M+G +      + +   G     +W   
Sbjct: 6   LDYRILGESMQTVEIELDPGETVIAEAGAMNYMTGDIRF-TARMGDGSDGSLLGKLWSAG 64

Query: 75  -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
            + L G+++      N G     V  AAP    ++ +DL   GG L CQ D+FLC+    
Sbjct: 65  KRKLGGESVFMTHFTNEGQGKQHVAFAAPYPGSVVAVDLDDVGGRLFCQKDSFLCAAYGT 124

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
           +V     +R+     G EGF+ QKL G GL F+ AGG+++++ L  GE + VD  C+VA 
Sbjct: 125 RVGIAFAKRLGAGFFGGEGFILQKLEGDGLVFVHAGGTLIRRQLN-GETLRVDTGCLVAF 183

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           T  ++  ++  G ++  +FGG+ L+   + G G V++QSLPF RL+ RI  A 
Sbjct: 184 TDGIDYDVQLAGGLKSMLFGGEGLLLTTLKGSGTVWLQSLPFSRLAGRIYDAT 236


>gi|158520989|ref|YP_001528859.1| hypothetical protein Dole_0972 [Desulfococcus oleovorans Hxd3]
 gi|158509815|gb|ABW66782.1| protein of unknown function DUF124 [Desulfococcus oleovorans Hxd3]
          Length = 251

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 79/236 (33%), Positives = 125/236 (52%), Gaps = 11/236 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY-----IPENEVG----M 73
           + + ++G   Q  +I L P E V+A  G+M +M+ ++  E         PE  +G     
Sbjct: 6   VDYTLIGDVMQAVEIGLDPGETVIADAGNMNYMNDAIRFETRMGDGAPAPEGLLGSLVAA 65

Query: 74  WQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
            + LF G+ +      N   S   V  + P   R++ +D+A  G E+ CQ +AFLC+   
Sbjct: 66  GRRLFAGENLFLTHFTNTAGSKRHVTFSGPCPGRVIVLDMAALGQEVFCQKEAFLCAARG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            +VS    +R+     G +GF+  +L G G+AF+ AGG VV+K L+  E I VD  CIVA
Sbjct: 126 TQVSIAFTRRMGPGFFGRDGFVLMRLVGDGMAFVHAGGMVVEKTLK-NETICVDTGCIVA 184

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
            TS ++   +    ++   FGG+NL  A ++G G VF+Q+LP  R + RI R ++S
Sbjct: 185 FTSGIHYSARQADNLKSMFFGGENLFLATLSGTGTVFLQTLPLSRPADRILRQLSS 240


>gi|47530420|ref|YP_021769.1| hypothetical protein GBAA_5116 [Bacillus anthracis str. 'Ames
           Ancestor']
 gi|49187746|ref|YP_030999.1| hypothetical protein BAS4755 [Bacillus anthracis str. Sterne]
 gi|47505568|gb|AAT34244.1| conserved hypothetical protein [Bacillus anthracis str. 'Ames
           Ancestor']
 gi|49181673|gb|AAT57049.1| conserved hypothetical protein [Bacillus anthracis str. Sterne]
          Length = 226

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 79/210 (37%), Positives = 116/210 (55%), Gaps = 18/210 (8%)

Query: 52  MCFMSGSMEMENIYIPENEVGMWQWLFGKTITS------------VVLRNPGPSDGFVGI 99
           M  M   +EME I+   +  G    LFGK + +             V  N G     V  
Sbjct: 1   MMMMEDYIEMETIF--GDGSGPSGGLFGKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSF 58

Query: 100 AAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLT 159
           AAP   +I+P+DL  + G+++CQ DAFLC+   V +     +++     G EGF+ QKL 
Sbjct: 59  AAPYPGKIIPVDLTEYQGKVVCQKDAFLCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLE 118

Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVT 219
           G GLAF+ AGG+V ++ L+ GE + +D  C+VA+T  VN  +++ G ++ A+FGG+ L  
Sbjct: 119 GDGLAFMHAGGTVYKRELKPGEKLRIDTGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFF 178

Query: 220 AVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           A + GPG V+IQSL   RLS R+    TSP
Sbjct: 179 ATLEGPGTVWIQSLTLSRLSARL----TSP 204


>gi|436834358|ref|YP_007319574.1| protein of unknown function DUF124 [Fibrella aestuarina BUZ 2]
 gi|384065771|emb|CCG98981.1| protein of unknown function DUF124 [Fibrella aestuarina BUZ 2]
          Length = 259

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 83/254 (32%), Positives = 132/254 (51%), Gaps = 13/254 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I ++I+G + QV +I L P E V+A  GSM FM   +  E      ++            
Sbjct: 6   IDYKIIGEDIQVVEIELDPNETVIAEAGSMLFMEEGISFETRMGDGSQTSGGFLGKLMQA 65

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAFLCSVN 131
             + + G+++      N G     V  AAP    ++PI+LA ++   L+ Q DAFLC+  
Sbjct: 66  GSRMIMGESLFMTHFTNRGVGKRKVAFAAPYPGTVMPINLANIYQNTLIVQKDAFLCAAM 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
             ++S   +QR+   + G EGF+ +K+ G GLAFI AGG V+++ L   EV+ VD  C+V
Sbjct: 126 GTRLSIHFNQRLGAGLFGGEGFILEKVQGDGLAFIHAGGVVMERTLN-NEVLRVDTGCVV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
                +N  I+  G +R  +FGG+ L  A + G G V+IQS+P  +L QR+A   ++ + 
Sbjct: 185 GFEPQINFSIEQAGGLRSMIFGGEGLFLATLRGTGKVWIQSMPISKLVQRLA-PYSAQSH 243

Query: 252 RENPKFFVQIAIFF 265
           +E      Q+   F
Sbjct: 244 KEGGSVLGQLGNLF 257


>gi|313676285|ref|YP_004054281.1| hypothetical protein Ftrac_2191 [Marivirga tractuosa DSM 4126]
 gi|312942983|gb|ADR22173.1| protein of unknown function DUF124 [Marivirga tractuosa DSM 4126]
          Length = 258

 Score =  130 bits (328), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 79/237 (33%), Positives = 131/237 (55%), Gaps = 14/237 (5%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSM----EMENIYIPENEVGMW 74
           Q   + ++ILG   Q+ +I L PQE V+A  G+M +M  S+    +M +   P NE  + 
Sbjct: 4   QSHEVDYKILGDGVQLVEIELDPQETVIAEAGAMVYMDDSILFETKMGDGSNP-NEGFLG 62

Query: 75  QWL-------FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAF 126
           + +        G+++      + G     V  + P    ++ +DL   +  E++ Q D F
Sbjct: 63  KLMSAGSRIFTGESLFMTHFTHQGQGKAHVAFSGPYPGTVIALDLKDHYNNEVIVQKDGF 122

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           LC+    KVS TL++++ + + G EGF+ QKL G G AF+ AGG+V++K L   E + VD
Sbjct: 123 LCAAMGTKVSITLNKKIGSGLLGGEGFIMQKLQGNGRAFVHAGGTVIEKQLN-NETLKVD 181

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
             C+VA  SS++  I+ +G I+  +FGG+ +  A + G G V++QS+P  +L Q IA
Sbjct: 182 TGCVVAYESSLDFDIQTSGGIKSMLFGGEGIFLATLRGTGKVWLQSMPIRKLIQAIA 238


>gi|91793514|ref|YP_563165.1| hypothetical protein Sden_2160 [Shewanella denitrificans OS217]
 gi|91715516|gb|ABE55442.1| protein of unknown function DUF124 [Shewanella denitrificans OS217]
          Length = 264

 Score =  130 bits (327), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 78/230 (33%), Positives = 127/230 (55%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMW------ 74
           + ++I+G   Q+ ++ L P E VVA  G+M ++   +  E       E + G +      
Sbjct: 8   VDYEIIGESMQLVEVELDPNETVVAEAGAMNYLEQDINFEAKMGDGSEPDSGFFGKMLGA 67

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G++I      N G     V  AAP    IL +DL+++G EL+CQ D+FL +   
Sbjct: 68  GKRALSGESIFMTHFTNQGSQKRKVAFAAPYPGTILALDLSLYGEELICQKDSFLAAALG 127

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            +V+   ++R+     G EGF+ Q L G G+ F+ AGG++++K L+ GE + VD  CIV 
Sbjct: 128 TQVTMKFNRRLGTGFFGGEGFILQNLKGDGMVFVHAGGTLIKKELK-GETLRVDTGCIVG 186

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T+ ++  I+  G ++  VFGG+ L  A + G G V++QSLPF R++ RI
Sbjct: 187 FTAGIDYDIQRAGSLKSMVFGGEGLFLATLQGHGTVWLQSLPFSRMADRI 236


>gi|66803082|ref|XP_635384.1| hypothetical protein DDB_G0291181 [Dictyostelium discoideum AX4]
 gi|60463692|gb|EAL61874.1| hypothetical protein DDB_G0291181 [Dictyostelium discoideum AX4]
          Length = 359

 Score =  130 bits (327), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 77/238 (32%), Positives = 126/238 (52%), Gaps = 2/238 (0%)

Query: 7   STPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI 66
           ST FQ       +     ++I G E+QV Q+ L P   + A  GS+  M   +EM+    
Sbjct: 91  STLFQNNQSLRGKSTSFEWEIYGLESQVVQMNLNPGTSITAETGSLLEMGSDIEMDTTAR 150

Query: 67  PENEVGMWQWLFGKTITSVVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLCQPDA 125
                   +   G+ +      N   ++   +  ++P +++I+PI L+ FGGEL+CQ +A
Sbjct: 151 GGFLSSFKRMFTGQGLFLTRFINKSETEKMRITFSSPYISKIVPIKLSEFGGELICQKNA 210

Query: 126 FLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITV 185
           FLC  + V++     +  +    G +GF+ Q+L G G+ F+ AGGS++ + L  GE I V
Sbjct: 211 FLCGEDSVEIEPRFTKNFKTGFFGGQGFILQRLVGNGMVFVHAGGSIIYRVLLPGEKIKV 270

Query: 186 DVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
               IVA  +SV   +++   I+  +F G+ L  A +TGPG++ +QSLPF RL + IA
Sbjct: 271 TTGSIVAFETSVEYNVEFVKGIKNILF-GEGLSLATITGPGLIILQSLPFERLVRAIA 327


>gi|222098336|ref|YP_002532393.1| hypothetical protein BCQ_4678 [Bacillus cereus Q1]
 gi|221242394|gb|ACM15104.1| conserved hypothetical protein [Bacillus cereus Q1]
          Length = 226

 Score =  130 bits (326), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 78/210 (37%), Positives = 116/210 (55%), Gaps = 18/210 (8%)

Query: 52  MCFMSGSMEMENIYIPENEVGMWQWLFGKTITS------------VVLRNPGPSDGFVGI 99
           M  M   +EME I+   +  G    LFGK + +             V  N G     V  
Sbjct: 1   MMMMEDYIEMETIF--GDGSGPSGGLFGKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSF 58

Query: 100 AAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLT 159
           AAP   +I+P+DL  + G+++CQ DAFLC+   V +     +++     G EGF+ QKL 
Sbjct: 59  AAPYPGKIIPVDLTEYQGKVVCQKDAFLCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLE 118

Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVT 219
           G GLAF+ AGG+V ++ L+ GE + +D  C+VA+T  VN  +++ G ++ A+FGG+ L  
Sbjct: 119 GDGLAFMHAGGTVYKRELKPGEKLRIDTGCLVAMTKDVNYDVEFVGKVKTALFGGEGLFF 178

Query: 220 AVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 179 ATLEGPGTVWIQSLTLSRLAARL----TSP 204


>gi|114563495|ref|YP_751008.1| hypothetical protein Sfri_2324 [Shewanella frigidimarina NCIMB 400]
 gi|114334788|gb|ABI72170.1| protein of unknown function DUF124 [Shewanella frigidimarina NCIMB
           400]
          Length = 262

 Score =  130 bits (326), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 82/236 (34%), Positives = 124/236 (52%), Gaps = 11/236 (4%)

Query: 17  SPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMW 74
           S +   + ++I+G   Q+ ++ L P E V+A  G+M ++   +  E       E E G +
Sbjct: 2   SMKSHEVDYEIIGSSMQLVEVELDPNESVIAEAGAMTYLEEDIMFEARMGDGSEPESGFF 61

Query: 75  QWLFG--------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
             L G        + I      N G     V  AAP    IL IDLA   GEL+CQ D+F
Sbjct: 62  GKLLGAGKRAISGEGIFMTHFTNQGQQKRKVAFAAPFPGTILAIDLAQHQGELICQKDSF 121

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVD 186
           L +    +VS   ++R+     G EGF+ Q L G G+AF+ AGG++++K L  GE + VD
Sbjct: 122 LTAALGTQVSMKFNKRLGTGFFGGEGFILQSLKGDGMAFVHAGGTLIKKELH-GETLRVD 180

Query: 187 VSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
             C+V  T  ++  I+  G ++  VFGG+ L  A + G G V++QSLPF R++ RI
Sbjct: 181 TGCLVGFTPGIDYDIQRAGSLKSMVFGGEGLFLATLKGHGTVWLQSLPFSRMADRI 236


>gi|441499671|ref|ZP_20981848.1| DUF124 domain-containing protein [Fulvivirga imtechensis AK7]
 gi|441436595|gb|ELR69962.1| DUF124 domain-containing protein [Fulvivirga imtechensis AK7]
          Length = 258

 Score =  130 bits (326), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 82/240 (34%), Positives = 126/240 (52%), Gaps = 20/240 (8%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I ++I G   Q+ ++ L P E V+A  G+M +M   +  E      +E    Q LF
Sbjct: 4   QSHEIDYEIKGESIQIVEVELDPNETVIAEAGAMLYMEDGIVFETKMGDGSEPN--QGLF 61

Query: 79  GKTI--------------TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF-GGELLCQP 123
           GK +              T    R  G +   V  +AP    ++PIDLA   G E++ Q 
Sbjct: 62  GKLLSAGTRMLTGESMFMTHFTYRGVGKAK--VAFSAPYPGTVIPIDLAQIPGREVIVQK 119

Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
           D FLC+    KV+ T ++R+ + + G EGF+ QKL G G AF+ AGG+V++K L   EV+
Sbjct: 120 DGFLCAALGTKVAITFNKRLGSGLVGGEGFILQKLQGDGKAFVHAGGTVIEKQLN-NEVL 178

Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
            VD  C+VA    ++  ++  G +R  +FGG+ L  A + G G V++QS+P  +L Q +A
Sbjct: 179 RVDTGCVVAFEPGIDFNVESAGGLRSMIFGGEGLFLATLRGTGRVWLQSMPIRKLVQALA 238


>gi|226227458|ref|YP_002761564.1| hypothetical protein GAU_2052 [Gemmatimonas aurantiaca T-27]
 gi|226090649|dbj|BAH39094.1| hypothetical protein GAU_2052 [Gemmatimonas aurantiaca T-27]
          Length = 263

 Score =  130 bits (326), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 82/229 (35%), Positives = 119/229 (51%), Gaps = 9/229 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-------- 74
           I +QI+G + Q   + L P E V A  G+M FM   + M     P    G          
Sbjct: 6   IDYQIIGDDLQAVIVTLDPGESVFAEAGAMMFMREGITMATTLDPNARSGSLFDKLVGAG 65

Query: 75  -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
            + L G +    +  N G     V  AAP   +I+P++L  +GG +L Q D+FLC+   +
Sbjct: 66  KRVLAGDSFFVTLFGNAGARRSDVAFAAPYPGKIVPLNLRDWGGTVLAQKDSFLCAARGI 125

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
            +S    +R+     G EGF+ QKL G GLAF+ A G++   +L  GE + VD  C+VA 
Sbjct: 126 DISVAFTRRIGAGFFGGEGFILQKLQGDGLAFLHASGTLHAIDLAPGEQLRVDTGCLVAF 185

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
             +VN  I+    I+ A+FGG+ L    +TGPG V +Q+LPF RL+ RI
Sbjct: 186 QPTVNYDIQRVPGIKTALFGGEGLFFVSLTGPGRVILQTLPFSRLADRI 234


>gi|328869835|gb|EGG18210.1| hypothetical protein DFA_03697 [Dictyostelium fasciculatum]
          Length = 354

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/227 (31%), Positives = 126/227 (55%), Gaps = 1/227 (0%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           +++   ++I G E+QV Q+ L P   + A  G++  MS S+EM+            +   
Sbjct: 96  KESTFEWEISGHESQVVQLKLDPGSSITAETGALLEMSSSVEMDTSARGGFLTSFKRMFT 155

Query: 79  GKTITSVVLRNPGPSD-GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
           G+ +     +N   ++ G V  ++P +++IL + ++  GGEL+C+ ++FLC  N +++  
Sbjct: 156 GQGLFLTKFKNVSDNEIGRVTFSSPYISKILAVKMSELGGELICKKNSFLCGDNAIEIEP 215

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
            L +R      G EGF+ Q+L G G+AF+   GS++ ++L+ GE + V V  IVA  +SV
Sbjct: 216 QLTKRFSIGFFGGEGFILQRLKGSGMAFLHGCGSIIFRSLQPGEKVRVSVGSIVAFQTSV 275

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
              I++        F G+ L    ++GPG+V +QSLPF +L  +I+R
Sbjct: 276 EYDIEWVKGATNIFFSGEGLFLTTLSGPGLVILQSLPFEKLVNQISR 322


>gi|284039740|ref|YP_003389670.1| hypothetical protein Slin_4893 [Spirosoma linguale DSM 74]
 gi|283819033|gb|ADB40871.1| protein of unknown function DUF124 [Spirosoma linguale DSM 74]
          Length = 259

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 79/254 (31%), Positives = 134/254 (52%), Gaps = 13/254 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENI----------YIPENEVG 72
           I ++I+G + Q+ +I L P E V+A  GSM +M   +  E            ++ +    
Sbjct: 6   IDYKIIGEDIQIVEIELDPNETVIAEAGSMLYMEDGITFETKMGDGSQPDQGFLGKLLQA 65

Query: 73  MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAFLCSVN 131
             + + G+++      N G     V  +AP    I+PI++A ++G  L+ Q DAFLC+  
Sbjct: 66  GSRMVMGESLFMTHFTNRGVGKRKVAFSAPYPGTIMPINMANVYGNTLIVQKDAFLCAAL 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
             K+S   +Q++   + G EGF+ +K+   G+AFI AGG V+++ L   E + VD  C+V
Sbjct: 126 GTKLSIHFNQKIGAGLFGGEGFILEKVQSDGMAFIHAGGVVIERTLN-NETLRVDTGCVV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
               S+N  I+  G +R  VFGG+ L  A + G G V+IQS+P  +L QR++   ++ + 
Sbjct: 185 GFEPSINFDIQRAGGLRSMVFGGEGLFLATLRGSGKVWIQSMPISKLVQRLSTG-SAQSH 243

Query: 252 RENPKFFVQIAIFF 265
           +E+     Q+   F
Sbjct: 244 KESGSVLGQLGNLF 257


>gi|47568039|ref|ZP_00238745.1| uncharacterized conserved protein CAC1537 [Bacillus cereus G9241]
 gi|47555342|gb|EAL13687.1| uncharacterized conserved protein CAC1537 [Bacillus cereus G9241]
          Length = 226

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 77/210 (36%), Positives = 116/210 (55%), Gaps = 18/210 (8%)

Query: 52  MCFMSGSMEMENIYIPENEVGMWQWLFGKTITS------------VVLRNPGPSDGFVGI 99
           M  M   +EME I+   +  G    LFGK + +             V  N G     V  
Sbjct: 1   MMMMEDYIEMETIF--GDGSGPSGGLFGKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSF 58

Query: 100 AAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLT 159
           AAP   +I+P+DL  + G+++CQ DAFLC+   V +     +++     G EGF+ QKL 
Sbjct: 59  AAPYPGKIIPVDLTEYQGKVVCQKDAFLCAAKGVSIGIEFTKKIGTGFFGGEGFIMQKLE 118

Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVT 219
           G GLAF+ AGG+V ++ L+ GE + +D  C+VA+T  +N  +++ G ++ A+FGG+ L  
Sbjct: 119 GDGLAFMHAGGTVYKRELKPGEKLRIDTGCLVAMTKDINYDVEFVGKVKTALFGGEGLFF 178

Query: 220 AVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 179 ATLEGPGTVWIQSLTLSRLAARL----TSP 204


>gi|84625912|ref|YP_453284.1| hypothetical protein XOO_4255 [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|84369852|dbj|BAE71010.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
           311018]
          Length = 256

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 68/228 (29%), Positives = 122/228 (53%), Gaps = 14/228 (6%)

Query: 33  QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW--------------QWLF 78
           Q  ++ L P E  +A  G++ +   +++M+ ++   +  G                + + 
Sbjct: 2   QFVEVELDPGESAIAEAGALMYKDAAVQMDTVFGDGSHAGQSGGGGLMGKLLSAGKRVVT 61

Query: 79  GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
           G+++ + +  + G     V  AAP    +L + L+  GG+L+CQ D+FL     V +   
Sbjct: 62  GESMFTTIFTHAGSGKAKVAFAAPYPGTVLALRLSEHGGQLICQKDSFLAGARGVSLGIA 121

Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
             +++   + G EGF+ QKL G G  F+ AGG+VV++ L  GE I VD  C+VA  +SV+
Sbjct: 122 FQRKILTGLFGGEGFIMQKLEGDGWVFVHAGGTVVERELGPGECIDVDTGCVVAYHASVD 181

Query: 199 VQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           + ++    ++   FGG+ +  A +TGPG V++QSLPF R++ R+ +A 
Sbjct: 182 MDVRRVAGLKSMFFGGEGVFLATLTGPGKVWLQSLPFSRMAGRMLQAA 229


>gi|345876582|ref|ZP_08828349.1| protein of unknown function DUF124 [endosymbiont of Riftia
           pachyptila (vent Ph05)]
 gi|344226418|gb|EGV52754.1| protein of unknown function DUF124 [endosymbiont of Riftia
           pachyptila (vent Ph05)]
          Length = 233

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 72/218 (33%), Positives = 123/218 (56%), Gaps = 12/218 (5%)

Query: 36  QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN--EVGMWQWLF--------GKTITSV 85
           ++ L P E V+A  G+M ++   +  E      +  + G++  LF        G+++ + 
Sbjct: 2   EVELDPGETVIAEAGAMTYLEQEISFETKMGDGSNPDQGIFGKLFSAGKRMITGESLFTT 61

Query: 86  VLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
              + G +    V  +AP    ++P++LA  GG+++CQ DAFLC+    K+     +++ 
Sbjct: 62  HFTHSGSAGKRKVAFSAPFPGSVIPLNLAALGGKVICQKDAFLCAALGTKLGIAFTKKLG 121

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
           + + G EGF+ Q L G G AFI AGG+VV+K L  GE + VD  C+V  T  ++  I+ +
Sbjct: 122 SGLFGGEGFILQSLEGDGTAFIQAGGTVVEKQLN-GETLRVDTGCLVGFTEGIDYSIEMS 180

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           G ++  +FGG+ +  A ++G G V+IQSLPF R++ RI
Sbjct: 181 GGLKSMLFGGEGIFLATLSGSGTVWIQSLPFSRMADRI 218


>gi|119774708|ref|YP_927448.1| hypothetical protein Sama_1571 [Shewanella amazonensis SB2B]
 gi|119767208|gb|ABL99778.1| conserved hypothetical protein [Shewanella amazonensis SB2B]
          Length = 264

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/230 (33%), Positives = 128/230 (55%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN--EVGMW------ 74
           + ++I G   Q+ ++ L P E V+A  G+M +M   ++ E      +  E G +      
Sbjct: 8   VDYEIRGESMQLVEVELDPGETVIAEAGAMTYMEEGIQFEARMGDGSQPESGFFGALMGA 67

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
             + L G+++      N G     V  +AP    +L +D+A  GGEL+ Q D+FL +   
Sbjct: 68  GKRMLTGESLFMTHFTNQGHGKRRVAFSAPYPGTLLALDMAELGGELILQKDSFLAAALG 127

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            +++ T ++R+     G EGF+ QKL G G+AF+ AGG++++K L+ GE + VD  C+V 
Sbjct: 128 TQITMTFNKRLGTGFFGGEGFIMQKLKGDGMAFVHAGGTLIKKELK-GETLRVDTGCLVG 186

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  ++  I+  G ++  VFGG+ L  A ++G G V++QSLPF R++ RI
Sbjct: 187 FTPGIDYDIQRAGSLKSMVFGGEGLFLATLSGHGTVWLQSLPFSRMADRI 236


>gi|385800164|ref|YP_005836568.1| hypothetical protein [Halanaerobium praevalens DSM 2228]
 gi|309389528|gb|ADO77408.1| protein of unknown function DUF124 [Halanaerobium praevalens DSM
           2228]
          Length = 265

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 79/235 (33%), Positives = 128/235 (54%), Gaps = 12/235 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSM----------EMENIYIPENEVG 72
           I +QI+G + QV ++ L P+E V+A  G+M +M   +          E+ N  + +    
Sbjct: 6   IDYQIIGNDMQVVEVDLDPEETVIAEAGAMNWMDNGISFEAKMGDGTELNNSILAKVFGA 65

Query: 73  MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAFLCSVN 131
             + + G+++      N       +  AAP    I+ +DLA +   +  CQ DAFLC+  
Sbjct: 66  GKRAITGESVFMTHFTNTDYEKKKIAFAAPYPGNIIALDLAELEANQFTCQKDAFLCAAL 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
             KV  T  +R  + + G EGF+ QK++G G AFI AGG+VV+K L+  E + VD  C+V
Sbjct: 126 GTKVDITFSKRFGSGLFGGEGFILQKISGDGKAFIHAGGTVVKKELKN-EKLKVDTGCLV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             T+ ++  I+  G ++  +FGG+ L  A + G G V++QSLPF RL+ RI ++ 
Sbjct: 185 GFTAGIDYNIEKAGKLKSMLFGGEGLFLATLAGTGTVYLQSLPFSRLADRIIKSA 239


>gi|89094062|ref|ZP_01167006.1| hypothetical protein MED92_02156 [Neptuniibacter caesariensis]
 gi|89081736|gb|EAR60964.1| hypothetical protein MED92_02156 [Oceanospirillum sp. MED92]
          Length = 262

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 81/230 (35%), Positives = 123/230 (53%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
           + ++I+G   QV ++ L P EKV+A  G M +M   +  E       E + G+   LF  
Sbjct: 6   VDYEIIGESLQVVEVELDPGEKVIAEAGMMNYMEEGISFETKMGDGSEPDQGLMGKLFSA 65

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G+++      N G     V  AAP    I+P++LA FG +++CQ DAFLC+   
Sbjct: 66  GKRMVMGESVFMTHFFNEGQGKQRVAFAAPYPGSIVPLNLADFGEKVICQKDAFLCAALG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            KV     +++     G EGF+ QKL G G+AF+ AGG+VV+K L   E + VD  C+V 
Sbjct: 126 TKVEIAFSKKIGAGFFGGEGFILQKLEGDGMAFMQAGGTVVRKELN-NETLRVDTGCLVG 184

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  +   I+    ++  +FGG+ L  A + G G V++QSLPF RL+ R+
Sbjct: 185 FTEGIEYDIEMVKGLKSMIFGGEGLWLATLKGTGSVWVQSLPFSRLADRL 234


>gi|423584601|ref|ZP_17560688.1| TIGR00266 family protein [Bacillus cereus VD045]
 gi|401235827|gb|EJR42294.1| TIGR00266 family protein [Bacillus cereus VD045]
          Length = 217

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 67/171 (39%), Positives = 104/171 (60%), Gaps = 4/171 (2%)

Query: 79  GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
           G+++   V  N G     V  AAP   +I+P+DL  + G+++CQ DAFLC+   V +   
Sbjct: 29  GESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAFLCAAKGVSIGIE 88

Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
             +++     G EGF+ QKL G GLAF+ AGG+V ++ L+ GE + +D  C+VA+T  +N
Sbjct: 89  FTKKIGTGFFGGEGFIMQKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCLVAMTKDIN 148

Query: 199 VQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
             +++ G ++ A+FGG+ L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 149 YDVEFVGKVKTALFGGEGLFFATLEGPGTVWIQSLTLSRLAARL----TSP 195


>gi|452077091|gb|AGF93061.1| protein containing DUF124 [uncultured organism]
          Length = 225

 Score =  127 bits (318), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 70/227 (30%), Positives = 123/227 (54%), Gaps = 2/227 (0%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I G   Q+  I L PQ++V    GSM +MSG++EME         GM + L G++ 
Sbjct: 1   MKYKIDGNNLQMVTIELGPQDEVYGEAGSMKYMSGNVEMETKGKGGFMKGMKRKLSGESY 60

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
                +     +G +G A  +  +I  +DL+  G   +CQ DAFLC+   V +     ++
Sbjct: 61  FLTHFKTK-SGNGLIGFAGNTPGKIKTLDLSS-GNNYVCQKDAFLCAEEGVDLDIAFQKK 118

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +   + G EGF+ QKL+G G+AF+   G +++K+L+ G+V+ V    + A  SSVN  I+
Sbjct: 119 LGAGLFGGEGFILQKLSGNGMAFVHVMGDLMEKDLDQGQVLRVSTGNVAAFESSVNYDIQ 178

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
             G ++ ++  G+ +    + GPG V++QS+    L+  ++R V+  
Sbjct: 179 RTGGVKSSLLSGEGIFMTTLHGPGKVWLQSMTLRDLASELSRYVSGD 225


>gi|422675984|ref|ZP_16735321.1| hypothetical protein PSYAR_24722, partial [Pseudomonas syringae pv.
           aceris str. M302273]
 gi|330973695|gb|EGH73761.1| hypothetical protein PSYAR_24722, partial [Pseudomonas syringae pv.
           aceris str. M302273]
          Length = 332

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 73/218 (33%), Positives = 115/218 (52%), Gaps = 11/218 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 115 VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 174

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 175 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 234

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 235 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 294

Query: 193 VTSSVNVQIKY-NGPIRRAVFGGDNLVTAVVTGPGIVF 229
           +T +V+  I+   G I+  +FGG+ +  A +TGPG V+
Sbjct: 295 MTQTVDYDIRMVGGGIKSMLFGGEGVFFARLTGPGTVW 332


>gi|392969555|ref|ZP_10334970.1| protein of unknown function DUF124 [Fibrisoma limi BUZ 3]
 gi|387841749|emb|CCH57028.1| protein of unknown function DUF124 [Fibrisoma limi BUZ 3]
          Length = 259

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 81/231 (35%), Positives = 128/231 (55%), Gaps = 12/231 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWL-- 77
           I ++I+G + Q+ +I L P E V+A  G+M FM   ++ E         N+  M + L  
Sbjct: 6   IDYKIIGEDIQIVEIELDPNETVIAEAGAMLFMEDGIQFETKMGDGSQPNQGFMGKLLQA 65

Query: 78  -----FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAFLCSVN 131
                 G+++      N G     V  AAP    +LP++LA ++G  L+ Q DAFLC+  
Sbjct: 66  GSRLITGESLFMTHFTNRGVGKRKVAFAAPYPGTVLPVNLATIYGNSLIVQKDAFLCAAL 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
             ++S   +QR+     G EGF+ QKL G GL+FI AGG V+++ L   E++ VD  C+V
Sbjct: 126 GTRLSIHFNQRLGAGFFGGEGFILQKLQGDGLSFIHAGGVVIERQLN-NELLRVDTGCVV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           A    +N  I+ +G ++  +FGG+ L  A ++G G V+IQS+P  +L QR+
Sbjct: 185 AFEPQINFDIQRSGGLKSMIFGGEGLFVATLSGTGKVWIQSMPISKLVQRL 235


>gi|18309316|ref|NP_561250.1| hypothetical protein CPE0334 [Clostridium perfringens str. 13]
 gi|18143992|dbj|BAB80040.1| conserved hypothetical protein [Clostridium perfringens str. 13]
          Length = 187

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 68/158 (43%), Positives = 101/158 (63%), Gaps = 1/158 (0%)

Query: 97  VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQ 156
           V  A+P   +I+P+DL  +GG+L+CQ DAF+C+   V +     +++     G EGF+ Q
Sbjct: 14  VAFASPYPGKIIPVDLRNYGGKLICQKDAFICAAKGVSIGIDFRRKLGTGFFGGEGFILQ 73

Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDN 216
           KL G G+AFI AGG++V+K L  G+ + VD  C+VA+T  VN  I+Y   I+ AVFGG+ 
Sbjct: 74  KLEGDGMAFIHAGGTIVRKRLLPGQKLKVDTGCLVAMTKDVNYDIEYVKGIKNAVFGGEG 133

Query: 217 LVTAVVTGPGIVFIQSLPFHRLSQR-IARAVTSPNMRE 253
           +  A + GPG V+IQSLPF RL+ + ++ A   P  R+
Sbjct: 134 IFFASLVGPGEVWIQSLPFSRLASKVVSSAPQVPGSRD 171


>gi|255599941|ref|XP_002537348.1| conserved hypothetical protein [Ricinus communis]
 gi|223516663|gb|EEF25035.1| conserved hypothetical protein [Ricinus communis]
          Length = 203

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 69/172 (40%), Positives = 95/172 (55%)

Query: 79  GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
           G+++ + V  + G     V  AAP    I+P+ L   GG L+CQ DAFLC+   V +   
Sbjct: 14  GESLFTTVFTHNGSGKARVAFAAPVPGAIVPLKLDSVGGTLICQKDAFLCAAKGVSMGIA 73

Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
             +RV   + G EGF+ QKL G G  F+  GG VV++ L  GE + VD  C+ A T SV+
Sbjct: 74  FQRRVMTGLFGGEGFIMQKLDGDGWVFVQFGGMVVERELAPGEELHVDTGCVAAFTPSVD 133

Query: 199 VQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
             +   G +R   FGG+ L  A + GPG V+IQSLPF RL+ R+  A  S  
Sbjct: 134 FDLVTAGGVRSVFFGGEGLFFARLRGPGKVWIQSLPFSRLAGRMMAAAGSHG 185


>gi|395218495|ref|ZP_10402141.1| hypothetical protein O71_17321 [Pontibacter sp. BAB1700]
 gi|394454374|gb|EJF09046.1| hypothetical protein O71_17321 [Pontibacter sp. BAB1700]
          Length = 255

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 71/227 (31%), Positives = 122/227 (53%), Gaps = 12/227 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENI----------YIPENEVG 72
           I F+I G + QV +I L P E V+A  G+M +M   ++ +            ++ +    
Sbjct: 7   IDFRIFGNDIQVLEIELDPNETVIAEAGAMVYMEEGIDFQTKMGDGSNPSQGFLGKLVSA 66

Query: 73  MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVN 131
             + + G+++      + G     V  +AP    ILP+DL  F G  L+ Q DAFL +  
Sbjct: 67  GSRLITGESLFMTHFTHRGYGKSRVAFSAPYPGTILPLDLQQFNGNTLIAQKDAFLAAAL 126

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
             K++   +Q++ + + G EGF+ QK++G GLAF+ AGG++V+K L   E + VD  C+V
Sbjct: 127 GTKLAMHFNQKLGSGLFGGEGFILQKISGDGLAFLHAGGTIVEKELN-NETLRVDTGCVV 185

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
           A    ++  I+  G ++  +FGG+ +  A + G G V++QS+P  +L
Sbjct: 186 AFEQGIDFSIQRAGGLKSMIFGGEGIFLATLRGTGRVWLQSMPVKKL 232


>gi|395646536|ref|ZP_10434396.1| protein of unknown function DUF124 [Methanofollis liminatans DSM
           4140]
 gi|395443276|gb|EJG08033.1| protein of unknown function DUF124 [Methanofollis liminatans DSM
           4140]
          Length = 270

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 70/225 (31%), Positives = 121/225 (53%), Gaps = 3/225 (1%)

Query: 18  PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL 77
           P++  + ++I+G   Q+ ++ L   E+V A  G+M  MSG+M+M++        G+ + L
Sbjct: 38  PRELTMKYEIIGDNLQMVKLTLAQGERVNAEAGAMVNMSGNMQMDSHLKGGLLGGLKRAL 97

Query: 78  FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
             +++        G S GFV  A     RI P+D+   G E L Q DA+LCS   V +  
Sbjct: 98  TNESLFLTEFTPQGES-GFVSFAGNVPGRIFPVDVT--GREFLAQKDAYLCSEQGVHLDI 154

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              +++R+   G EGF+ Q+L+G G AF+   G  ++ +L  GEV+ V+   +V   ++V
Sbjct: 155 AFTKKIRSGFFGGEGFILQRLSGSGTAFLHCCGDTIEMDLAAGEVVKVETGLVVGFDATV 214

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +  I+  G ++   FGG+ L    +TGPG V +QS+   +L+  +
Sbjct: 215 DYSIQMAGGVKTVFFGGEGLFLTTLTGPGKVVLQSMDIAKLASSL 259


>gi|312129436|ref|YP_003996776.1| hypothetical protein Lbys_0661 [Leadbetterella byssophila DSM
           17132]
 gi|311905982|gb|ADQ16423.1| protein of unknown function DUF124 [Leadbetterella byssophila DSM
           17132]
          Length = 259

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 81/244 (33%), Positives = 131/244 (53%), Gaps = 16/244 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME----NIYIPEN-------EV 71
           I FQI G + Q+ +I L PQE V+A  G+M FM   ++ E    +   PE        + 
Sbjct: 6   IDFQIHGEDIQIVEIELDPQETVIAEAGAMLFMEDGIQFETKLGDGTAPEQGFLGKVFQA 65

Query: 72  GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAFLCSV 130
           G  + L G+++      N G     V  AAP    I P+DLA ++   L+ Q DAFLC+ 
Sbjct: 66  GS-RLLTGESLFMTHFTNRGYGKRKVAFAAPYPGTIKPVDLAQIYQNTLIVQKDAFLCAA 124

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
               ++   ++R+     G EGF+ Q++ G G+AFI +GG V+++ L   E + VD  CI
Sbjct: 125 LGTSINIHFNKRLGAGFFGGEGFILQRIQGDGMAFIHSGGVVIERQLS-NETLRVDTGCI 183

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
           VA    ++  I+  G ++  +FGG+ +  A ++G G V++QSLP  +L QR++   + PN
Sbjct: 184 VAFEPQIHFDIQAAGNLKSMIFGGEGMFLATLSGTGRVWLQSLPISKLIQRLSS--SGPN 241

Query: 251 MREN 254
            ++ 
Sbjct: 242 TKKE 245


>gi|298712641|emb|CBJ48666.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 403

 Score =  120 bits (302), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 72/223 (32%), Positives = 119/223 (53%), Gaps = 11/223 (4%)

Query: 26  QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSV 85
           ++ G E+Q+    L+P + + A  G++ FM   +EM+      N  G    +F + IT  
Sbjct: 4   KVEGQESQMVTFELEPGQVIRAEAGNLVFMEDGIEMDT-----NTGGGASSMFRRVITGQ 58

Query: 86  VLR-----NPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
            +      N G + G VG+     ++I+ + LA +GGE++CQ  AFL   + + +     
Sbjct: 59  NMFVTDYVNRGDASGKVGLGTDVPSKIVRLPLAAYGGEVICQRGAFLAGSHTIDIQTEFT 118

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           + +    GG EGF+ Q+LTG+G AF+ A G ++Q+ L  GE + V    +VA ++SV   
Sbjct: 119 KMMAGFFGG-EGFVLQRLTGKGDAFVKASGPLIQRELSEGETLRVSSGSVVAFSTSVQYD 177

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
           ++     +  VFGG+ L    + GPG VF+Q LPF R+  +IA
Sbjct: 178 VQTIPGFKNVVFGGEGLFITKLAGPGTVFLQGLPFDRMVDQIA 220


>gi|117925123|ref|YP_865740.1| hypothetical protein Mmc1_1826 [Magnetococcus marinus MC-1]
 gi|117608879|gb|ABK44334.1| protein of unknown function DUF124 [Magnetococcus marinus MC-1]
          Length = 253

 Score =  120 bits (300), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 77/231 (33%), Positives = 123/231 (53%), Gaps = 12/231 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV------GMW-- 74
           + F+I G + Q+ ++ L P E V+A  G+MC+M  ++  E      ++       G+   
Sbjct: 16  VDFEIHGSDLQLVEVELDPGETVIAEAGTMCYMEENISFETKMGDGSQADEGFLGGLLSI 75

Query: 75  --QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVN 131
             + L G++I      N G     V  AAP   +I+P+D+     + L CQ DAFLC+  
Sbjct: 76  GKRALTGESIFMTHFSNRGSVKRRVAFAAPFPGKIIPLDMRQLPEQTLYCQRDAFLCAAL 135

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
             +VS    +R+     G EGF+ QKL G G+AFI AGG +V+K LE  + + VD  C+V
Sbjct: 136 GTQVSIAFSRRLGAGFFGGEGFILQKLVGDGMAFIHAGGVIVEKKLE-NQRLRVDTGCLV 194

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
             T  ++  I+  G ++  +FGG+ +    + G G V++QSLPF  L+ R+
Sbjct: 195 GFTPGIDYNIEAAGGMKSMIFGGEGIFLTTLAGTGTVWLQSLPFANLADRV 245


>gi|372223454|ref|ZP_09501875.1| hypothetical protein MzeaS_14126, partial [Mesoflavibacter
           zeaxanthinifaciens S86]
          Length = 190

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 69/185 (37%), Positives = 99/185 (53%), Gaps = 10/185 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
           I ++I G E Q  +I L PQE VVA  GS   M   ++M+ I+      E G+   +F  
Sbjct: 6   IDYKIYGEEMQYVEIELDPQEAVVAEAGSFMMMDSGIKMDTIFGDGSNQESGVLGKIFSA 65

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G+++      N       V  A+P   +ILPIDL+  GG+ +CQ DAFLC+   
Sbjct: 66  GKRILTGESLFMTAFLNVEHGKKQVSFASPYPGKILPIDLSEKGGKFICQKDAFLCAAKG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V V     +R+   + G EGF+ QKL G G+AF+ AGG++ +K L  GEV+ VD  CIV 
Sbjct: 126 VTVGIEFSKRLGRGLFGGEGFIMQKLEGDGMAFVHAGGTMAKKELAPGEVLKVDTGCIVG 185

Query: 193 VTSSV 197
            + +V
Sbjct: 186 FSQTV 190


>gi|87307766|ref|ZP_01089909.1| hypothetical protein DSM3645_22806 [Blastopirellula marina DSM
           3645]
 gi|87289380|gb|EAQ81271.1| hypothetical protein DSM3645_22806 [Blastopirellula marina DSM
           3645]
          Length = 249

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 80/247 (32%), Positives = 128/247 (51%), Gaps = 18/247 (7%)

Query: 12  PYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIP 67
           P + Q   D  I + I+G + Q+ ++ L P E V+A  G+M +M      S +M +   P
Sbjct: 4   PVLQQHSHD--IDYHIIGDDMQLVEVELDPGETVIAEAGAMNYMQEGITFSTKMGDGSDP 61

Query: 68  ENEVGMWQWLF--------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGEL 119
           +   G+   LF        G ++      N   +   V   AP   +ILPI+L+  GG L
Sbjct: 62  DQ--GLVGKLFSAGKRMVSGNSLFMGHFTNESSTKRHVAFGAPYPGKILPIELSEIGGTL 119

Query: 120 LCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEV 179
            CQ D+FLC+    ++     +R+     G EGF+ Q LTG G AF+ A G+++ + L+ 
Sbjct: 120 TCQKDSFLCAAMGTRLDIAFQKRLGAGFFGGEGFILQHLTGDGNAFLHACGAIITRELK- 178

Query: 180 GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
           GE + V+   +VA T  ++  I+  G ++  + GG+ L  A ++G G V +QSLPF R++
Sbjct: 179 GETLLVEAGSLVAFTQGIDYDIERAGNLKSMLLGGEGLYLAKLSGHGTVMMQSLPFSRMA 238

Query: 240 QRI-ARA 245
             I ARA
Sbjct: 239 NHILARA 245


>gi|442610563|ref|ZP_21025274.1| DUF124 domain-containing protein [Pseudoalteromonas luteoviolacea B
           = ATCC 29581]
 gi|441747780|emb|CCQ11336.1| DUF124 domain-containing protein [Pseudoalteromonas luteoviolacea B
           = ATCC 29581]
          Length = 262

 Score =  117 bits (293), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 74/230 (32%), Positives = 118/230 (51%), Gaps = 11/230 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV--GMWQWLF-- 78
           I ++I+G   Q+ ++ L P E V+A  G+M ++   ++ E      +EV  G    LF  
Sbjct: 6   IDYKIIGESMQMVEVELDPGETVIAEAGAMNYLEDGIQFEAKMGDGSEVDQGFMGKLFSA 65

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                 G+++      N G     V  AAP    I+P+D++  G  +  Q D+FLC+   
Sbjct: 66  GKRMISGESLFMTYFTNQGLGKKHVAFAAPFPGTIIPMDMSTLGQSVYLQKDSFLCAALG 125

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            KV     +++     G EGF+ + L G G+AF  AGG+V++K L  GE + VD  C+V 
Sbjct: 126 TKVDIAFQRKLGAGFFGGEGFILEHLQGDGMAFAHAGGTVIKKELR-GETLKVDTGCVVG 184

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            T  +   I+    ++   FGG+ L  A + G G V+IQSLPF R++ R+
Sbjct: 185 FTGGIEFDIERIKGLKSMFFGGEGLFLATLRGHGTVWIQSLPFSRMADRV 234


>gi|126179804|ref|YP_001047769.1| hypothetical protein Memar_1861 [Methanoculleus marisnigri JR1]
 gi|125862598|gb|ABN57787.1| protein of unknown function DUF124 [Methanoculleus marisnigri JR1]
          Length = 228

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 68/220 (30%), Positives = 115/220 (52%), Gaps = 3/220 (1%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I G   Q+ ++ L P EK  A  G+M  MSG+M+M          G+ + + G+ +
Sbjct: 1   MQYEITGDNLQMVKLRLAPGEKACAEAGAMVNMSGNMQMTTNMKGGLFKGLKRMMTGEGL 60

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
                  P   +GFV  A     +I  +DL   G E + Q DAFLCS   + +     ++
Sbjct: 61  FMTEF-TPQGGEGFVSFAGNVPGKIFTLDLN--GNEFIAQKDAFLCSEQGIDLDIAFTKK 117

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +R+ + G EGF+ Q+L+G G AF+   G +++  L  GEV+ V+   +V   S+V+  I+
Sbjct: 118 LRSGVFGGEGFILQRLSGSGKAFLHCCGDIMEMTLAPGEVVKVETGLVVGFESTVDYSIQ 177

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
             G ++   FGG+ L    +TGPG V +QS+   +L+  +
Sbjct: 178 VAGGVKTVFFGGEGLFLTTLTGPGKVVMQSMDIAKLASSL 217


>gi|392396335|ref|YP_006432936.1| hypothetical protein Fleli_0680 [Flexibacter litoralis DSM 6794]
 gi|390527413|gb|AFM03143.1| TIGR00266 family protein [Flexibacter litoralis DSM 6794]
          Length = 255

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 76/232 (32%), Positives = 122/232 (52%), Gaps = 12/232 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLF-- 78
           I ++I+G + Q+ ++ L P E V+A  G+M +M  ++  E       E E G++  L   
Sbjct: 6   IDYKIIGNDIQIVEVELDPNETVIAEAGAMLYMEENINFETKMGDGSEPEKGIFGKLMSA 65

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVN 131
                 G+++        G   G V  AAP    + P+ LA      L+ Q DAFLC+  
Sbjct: 66  GARMITGESLFMTHFTQRGGEKGKVAFAAPYPGTVTPVQLASCNNNTLIVQKDAFLCAAL 125

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
             K+S  L++R  +   G EGF+ QKL G G+AF  AGG++++K L  GE + VD  CIV
Sbjct: 126 GTKISIHLNRRFGSGFFGGEGFIMQKLQGDGMAFFHAGGTIIEKELN-GESLKVDTGCIV 184

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
           A    + + I+  G ++    GG+ L  A ++G G V++QS+P  +L Q ++
Sbjct: 185 AFEEGITMDIQKAGNLKSMFLGGEGLFLATLSGTGKVWLQSMPLSKLVQALS 236


>gi|224006938|ref|XP_002292429.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220972071|gb|EED90404.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 340

 Score =  114 bits (285), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 68/233 (29%), Positives = 115/233 (49%), Gaps = 2/233 (0%)

Query: 18  PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL 77
           P D  I   I G E+Q+  I L+P + + A  G+M +M+  + M          G  ++L
Sbjct: 74  PIDFDIASSIEGKESQIVTIRLEPNQVLRAESGAMMYMTSGVSMNTTTGGGMSAGFQRFL 133

Query: 78  FGKT--ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKV 135
            G+   I+           GFV + +   ++I+ + L  +GG+++CQ  A LC+ + V +
Sbjct: 134 TGQNVFISDYTYDGSNGPFGFVALGSDFPSKIVRLSLDEYGGKVICQKGALLCASHTVDI 193

Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
                ++      G EGF+ Q LTG G   + AGG++++K+L+ GE + V   C+V    
Sbjct: 194 QVEFTKKFSTGFFGGEGFVLQGLTGTGDVLVKAGGTLIRKDLKEGESLRVSSGCLVGFQQ 253

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
            V   +      +  +FGG+ L    + GPG V++Q +P  R+   IAR V S
Sbjct: 254 GVEYDVAMVPGFKNVLFGGEGLFLTTLKGPGTVWLQGMPPQRMISEIARRVPS 306


>gi|408673168|ref|YP_006872916.1| protein of unknown function DUF124 [Emticicia oligotrophica DSM
           17448]
 gi|387854792|gb|AFK02889.1| protein of unknown function DUF124 [Emticicia oligotrophica DSM
           17448]
          Length = 304

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 77/233 (33%), Positives = 122/233 (52%), Gaps = 14/233 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFM-----------SGSMEMENIYIPENEV 71
           I ++I G + Q+ +I L P E V+A  GSM FM            GS   ++++    + 
Sbjct: 51  IDYKIHGEDIQIVEIELDPNETVIAEAGSMLFMEDGIMFETKMGDGSQPQQSLFGKLLQA 110

Query: 72  GMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLA-MFGGELLCQPDAFLCSV 130
           G  + L G++I      N       V  AAP    I P++LA + G  L+ Q DAFLC+ 
Sbjct: 111 G-SRALMGESIFMTHFTNRSGIKRKVAFAAPYPGTIKPVNLANIMGNTLIVQKDAFLCAA 169

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
               +S   +QR+     G EGF+ +K+ G G+AFI +GG V+++ L   E + +D  C+
Sbjct: 170 MGTSISIHFNQRLGAGFFGGEGFILEKIQGDGMAFIHSGGVVIERQLN-NETLRIDTGCV 228

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
           V     +   I+ +G ++  VFGG+ +  A + G G V+IQSLP  +L QR++
Sbjct: 229 VGFEPQLQFDIERSGSLKSMVFGGEGIFLATLRGTGKVWIQSLPISKLIQRLS 281


>gi|66361112|pdb|1YOX|A Chain A, Structure Of The Conserved Protein Of Unknown Function
           Pa3696 From Pseudomonas Aeruginosa
 gi|66361113|pdb|1YOX|B Chain B, Structure Of The Conserved Protein Of Unknown Function
           Pa3696 From Pseudomonas Aeruginosa
 gi|66361114|pdb|1YOX|C Chain C, Structure Of The Conserved Protein Of Unknown Function
           Pa3696 From Pseudomonas Aeruginosa
 gi|66361115|pdb|1YOX|D Chain D, Structure Of The Conserved Protein Of Unknown Function
           Pa3696 From Pseudomonas Aeruginosa
 gi|66361116|pdb|1YOX|E Chain E, Structure Of The Conserved Protein Of Unknown Function
           Pa3696 From Pseudomonas Aeruginosa
 gi|66361117|pdb|1YOX|F Chain F, Structure Of The Conserved Protein Of Unknown Function
           Pa3696 From Pseudomonas Aeruginosa
          Length = 250

 Score =  114 bits (284), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 76/233 (32%), Positives = 121/233 (51%), Gaps = 11/233 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-----MW--- 74
           + ++ILG   Q  +I L P E V+A  G+  + +G +        +   G     +W   
Sbjct: 8   LDYRILGESXQTVEIELDPGETVIAEAGAXNYXTGDIRF-TARXGDGSDGSLLGKLWSAG 66

Query: 75  -QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
            + L G+++      N G     V  AAP    ++ +DL   GG L CQ D+FLC+    
Sbjct: 67  KRKLGGESVFXTHFTNEGQGKQHVAFAAPYPGSVVAVDLDDVGGRLFCQKDSFLCAAYGT 126

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
           +V     +R+     G EGF+ QKL G GL F+ AGG+++++ L  GE + VD  C+VA 
Sbjct: 127 RVGIAFTKRLGAGFFGGEGFILQKLEGDGLVFVHAGGTLIRRQLN-GETLRVDTGCLVAF 185

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           T  ++  ++  G ++  +FGG+ L+   + G G V++QSLPF RL+ RI  A 
Sbjct: 186 TDGIDYDVQLAGGLKSXLFGGEGLLLTTLKGSGTVWLQSLPFSRLAGRIYDAT 238


>gi|167628246|ref|YP_001678745.1| hypothetical protein HM1_0109 [Heliobacterium modesticaldum Ice1]
 gi|167590986|gb|ABZ82734.1| conserved hypothetical protein [Heliobacterium modesticaldum Ice1]
          Length = 240

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 74/244 (30%), Positives = 133/244 (54%), Gaps = 7/244 (2%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + +Q+LG   Q   I L+P E++    G+M +MS ++ M+  +       + + L G+++
Sbjct: 1   MRYQVLGTTMQALNIDLEPGERIYTESGAMTWMSSNVRMDTNFKGGLLKSLGRALSGESL 60

Query: 83  TSVVLRN-PGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
           T        GP++  +G    +  ++LP+++   G EL+CQ D+FLC+   V++S    +
Sbjct: 61  TYTFFEAVGGPAN--IGFTPSAPGKVLPVEVER-GYELICQRDSFLCAQESVEMSIFFQR 117

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           ++     G EGF+ QKL+G+G+AF+   G V++K L  GE + VD   +  + S+V++ I
Sbjct: 118 KLGAGFFGGEGFIMQKLSGRGMAFVEIDGEVIEKKLAPGERLMVDTGHVAMIESTVSMDI 177

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQI 261
           +    ++ A+FGG+ L    +TGPG V+IQ+L    L+ RI   +     RE+      +
Sbjct: 178 QMVKGMKNAIFGGEGLFLTSLTGPGRVWIQTLTLANLAGRI---LAHAGKRESSGGIGSL 234

Query: 262 AIFF 265
             FF
Sbjct: 235 TDFF 238


>gi|374855197|dbj|BAL58060.1| hypothetical conserved protein [uncultured Chloroflexi bacterium]
          Length = 250

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 84/248 (33%), Positives = 130/248 (52%), Gaps = 18/248 (7%)

Query: 7   STPFQP---YVYQSPQDAV-IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME 62
           +TP  P    + QS +    + ++I+G   Q   + L P E V +  G M +MSG++ M+
Sbjct: 3   ATPLNPPDLEIAQSGRGVTGMEYKIIGSILQAVILELDPGETVYSESGGMAWMSGNIRMQ 62

Query: 63  NIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQ 122
                       + L G++I  V   + G S GFV  A+    +I+P+ L   G E++CQ
Sbjct: 63  TATRGGLGGVFKRALSGESIFLVEFTSQG-SKGFVAFASDFPGKIIPLHLTE-GQEIICQ 120

Query: 123 PDAFLCSVNDVKVSNTLDQRVRNVIG----GIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
             AFLC+   V    +LD   R  +G    G EGF+ QKL+G G+AF+   G +++ NL 
Sbjct: 121 KQAFLCAEKTV----SLDIHFRRKLGVGLFGGEGFILQKLSGPGVAFVCLDGEIIEYNLG 176

Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
            GEV+ VD   I     SV+  I+    +   +FGG+ L  AV+ GPG V++QS+P    
Sbjct: 177 PGEVLKVDTGHIGMYEPSVSFDIEMVKGVTNLLFGGEGLFLAVLRGPGRVWLQSMP---- 232

Query: 239 SQRIARAV 246
           +Q +ARA+
Sbjct: 233 TQNLARAI 240


>gi|355677720|ref|ZP_09060487.1| hypothetical protein HMPREF9469_03524 [Clostridium citroniae
           WAL-17108]
 gi|354812806|gb|EHE97420.1| hypothetical protein HMPREF9469_03524 [Clostridium citroniae
           WAL-17108]
          Length = 225

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 66/228 (28%), Positives = 120/228 (52%), Gaps = 3/228 (1%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++ILG      ++ L+  E +  + G M +MS  + +++        G+ +   G+++
Sbjct: 1   MDYRILGETLPAVEVRLQTGEAMYTQSGGMAWMSDGLALDSNIKGGLMKGIGRMFTGESL 60

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
                    P D F+  A+    +ILP+D+      L+CQ  AFLC+   V+VS  L ++
Sbjct: 61  FMATYTASRP-DCFIAFASTVPGKILPVDMGKT--SLICQKGAFLCAQPTVEVSTVLTKK 117

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +     G EGF+ Q+L G G+AF+   G V+++ L  GEVI VD   + A   +V+ +I+
Sbjct: 118 LSAGFFGGEGFILQQLKGSGMAFLEVDGDVIERTLAPGEVIKVDTGNVFAFEPTVSYEIE 177

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
               ++  +FGG+ L    +TGPG +++Q++     + RIA+ + S N
Sbjct: 178 TIKGVKNILFGGEGLFLTKLTGPGKIYMQTMNIAEFTGRIAKGLPSSN 225


>gi|432330883|ref|YP_007249026.1| TIGR00266 family protein [Methanoregula formicicum SMSP]
 gi|432137592|gb|AGB02519.1| TIGR00266 family protein [Methanoregula formicicum SMSP]
          Length = 232

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 66/230 (28%), Positives = 121/230 (52%), Gaps = 2/230 (0%)

Query: 26  QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSV 85
           +I+G   Q+ +I L P E + A  G+M  M+GS  ME+        G+ + + G+++  +
Sbjct: 4   EIIGSNLQMVKIDLLPGEGIFAEAGAMVNMTGSFTMESQLKGGILSGIKRAVVGESVF-L 62

Query: 86  VLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRN 145
              +PG   GFV  A     +I P+ ++    E + + D+FLC    V++     ++++ 
Sbjct: 63  TRFSPGAGAGFVSFAGTMPGKIFPVSVSP-DREFISRKDSFLCCHEGVELDIAFTRKIQA 121

Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
            + G EGF+ Q++TGQG  F+   G +++ +L+ GEV+ V    +V    +V+  I   G
Sbjct: 122 GLFGGEGFVLQRMTGQGTVFLHCCGDIIELDLKPGEVVKVQTGLVVGFDHTVSYDIALAG 181

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENP 255
            I  A+FGG+ L    +TGPG V +QS+   +++  +   +  P ++  P
Sbjct: 182 GITTALFGGEGLFVTTLTGPGKVILQSMNLEKIAASLIPYLPKPEVKVKP 231


>gi|219129648|ref|XP_002184995.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217403490|gb|EEC43442.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 225

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 62/225 (27%), Positives = 118/225 (52%), Gaps = 2/225 (0%)

Query: 26  QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT--IT 83
           ++ G E+Q+  + L+P + + A  G+M +M+  ++M          G  + L G+   I+
Sbjct: 1   KVEGNESQILTVALEPGQVLRAESGAMMYMTDGVQMNTTSGGGLSSGFQRMLTGQNFFIS 60

Query: 84  SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV 143
                    + G V +     ++I+ ++L  +GG+++CQ  A LC+ + V +     + +
Sbjct: 61  DYSFEGEPGTVGHVALGTDFPSKIVRLNLEEYGGKIVCQKGALLCASHTVDIQMEYSKNM 120

Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
                G EGF+ Q L+G+G  F+ AGG++++++L+ GE + +   C+VA T+ V+  ++ 
Sbjct: 121 STGFFGGEGFVLQGLSGEGDVFVKAGGTLIRRDLKEGEQLRISSGCLVAFTNGVDYDVQM 180

Query: 204 NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
                    GG+ L    +TGPG+V++Q  P  R+   IAR V S
Sbjct: 181 VKGFANVFAGGEGLFMTTLTGPGVVWLQGQPAQRMISEIARRVPS 225


>gi|289674960|ref|ZP_06495850.1| hypothetical protein PsyrpsF_16953, partial [Pseudomonas syringae
           pv. syringae FF5]
          Length = 267

 Score =  110 bits (276), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 63/193 (32%), Positives = 99/193 (51%), Gaps = 10/193 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 70  VDFKLYGTETQFIELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 129

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 130 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 189

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 190 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGEALDVDTGCLAA 249

Query: 193 VTSSVNVQIKYNG 205
           +T +V+  I+  G
Sbjct: 250 MTQTVDYDIRMVG 262


>gi|397626028|gb|EJK68002.1| hypothetical protein THAOC_10872 [Thalassiosira oceanica]
          Length = 392

 Score =  110 bits (275), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 69/233 (29%), Positives = 117/233 (50%), Gaps = 2/233 (0%)

Query: 18  PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL 77
           P D      + G E+Q+  I L+P + + A  G+M +MS  + M          G+ + L
Sbjct: 129 PIDFDTSTTVEGRESQIATIRLEPGQVLRAESGAMMYMSQGVNMNTTTGGGLGAGLKRAL 188

Query: 78  FGKT--ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKV 135
            G+   I+       G   G V +     ++I+ ++L ++GG+++CQ  A LC+ + V +
Sbjct: 189 TGQNLFISDYTYDGSGGPFGIVALGTDFPSKIVRLNLQVYGGKIVCQQGALLCASHTVDI 248

Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
                +++     G EGF+ Q LTG G   + AGG++++K+L  GE + +   C+V    
Sbjct: 249 DIEFTKKLSTGFFGGEGFVLQGLTGTGDVMVKAGGTLIRKDLREGEELRISSGCLVGYQD 308

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
            V   IK     +  +FGG+ L  +V+ GPG V++Q  P  R+   IAR V S
Sbjct: 309 GVEFDIKMLDGYKNVLFGGEGLFVSVLRGPGTVWLQGQPPQRMVSEIARRVPS 361


>gi|348676356|gb|EGZ16174.1| hypothetical protein PHYSODRAFT_509459 [Phytophthora sojae]
          Length = 322

 Score =  110 bits (274), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 68/221 (30%), Positives = 116/221 (52%), Gaps = 1/221 (0%)

Query: 27  ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVV 86
           I G ++QV  I L+P++ + A  G+M +M+  +EME         GM + + G+      
Sbjct: 3   IEGHDSQVVHIELEPEQCLRAETGAMIYMTEGVEMETTTAGGFGEGMKRMMTGENFFVSR 62

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRN 145
               G   G V +     ++I+ + L+ F GE ++CQ  AFLC  + V +     ++   
Sbjct: 63  FTYHGSEKGKVALGTSFPSKIVHLRLSDFLGESIICQKGAFLCGSDTVNIEMEFAKKFGV 122

Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
              G EGF+ Q+LTG G   + A G+++++ L+ GEV+ +   C+VA   SV+  I    
Sbjct: 123 GFFGGEGFILQRLTGSGDVLVRASGTLIERELQPGEVLRISSGCLVAFEPSVHYDITMLK 182

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             +  +FGG+ L    +TGPG +++QSLPF R+   +A  +
Sbjct: 183 GAKNVLFGGEGLFVTTLTGPGKIYLQSLPFDRVVGEMASRI 223


>gi|422405471|ref|ZP_16482514.1| hypothetical protein Pgy4_16829, partial [Pseudomonas syringae pv.
           glycinea str. race 4]
 gi|330880405|gb|EGH14554.1| hypothetical protein Pgy4_16829 [Pseudomonas syringae pv. glycinea
           str. race 4]
          Length = 307

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 63/193 (32%), Positives = 99/193 (51%), Gaps = 10/193 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYI--PENEVGMWQWLFG- 79
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LFG 
Sbjct: 109 VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 168

Query: 80  -------KTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVND 132
                  +++ + V    G   G V  AAP    ILP++L  FGG+L+CQ D+FL     
Sbjct: 169 GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGKLICQKDSFLAGAKG 228

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V +     +++   + G EGF+ QKL G G  F+  GG+V +  L  GE + VD  C+ A
Sbjct: 229 VSIGIQFQKKILTGLFGGEGFILQKLEGDGWVFVHMGGTVRKIELAAGESLDVDTGCLAA 288

Query: 193 VTSSVNVQIKYNG 205
           +T +V+  I+  G
Sbjct: 289 MTQTVDYDIRMVG 301


>gi|163792500|ref|ZP_02186477.1| hypothetical protein BAL199_16673 [alpha proteobacterium BAL199]
 gi|159182205|gb|EDP66714.1| hypothetical protein BAL199_16673 [alpha proteobacterium BAL199]
          Length = 236

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 72/213 (33%), Positives = 111/213 (52%), Gaps = 2/213 (0%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-MWQWLFGKT 81
           I + I G  + +  I L P E V A  G+M  M   +EM +  +P    G + + + G+T
Sbjct: 6   IDYTIEGDFSPMLTITLDPGEAVQAEAGTMVMMEPDIEM-STEMPGGFFGSILRKVSGET 64

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
                  N G +   V  A+P   +I P+DLA  GG   CQ  A+L S   + +S  L +
Sbjct: 65  FFMTFFTNTGRTRRSVSFASPMPGQIRPLDLAAEGGAFYCQRRAYLASAKGIGISIALTK 124

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           R+ + + G EG + QKL G G AF+ AGG++V++ L  GE I+V    +V  +++ +  I
Sbjct: 125 RLSSGLFGGEGLILQKLEGDGWAFLAAGGTLVERQLAKGEKISVSTGSLVGFSATCDYDI 184

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
                I+  +FGG+ L    VTGPG V +Q+ P
Sbjct: 185 SIQRGIKNMLFGGEGLFVTHVTGPGKVVVQTQP 217


>gi|303271207|ref|XP_003054965.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226462939|gb|EEH60217.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 247

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 70/224 (31%), Positives = 122/224 (54%), Gaps = 16/224 (7%)

Query: 26  QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME----MENIYIPENEVGMWQWLFGKT 81
           Q++G +AQ+ Q+ L P E + A PG+MC+MS ++     +E  ++     G  + L G+ 
Sbjct: 15  QVMGTDAQLVQVELSPGESLSAEPGAMCYMSANVRSITRLEGGFL----AGCSRLLAGEP 70

Query: 82  ITSVVLRNPGP--SDGFVGIAAPSLA-RILPIDLAMFGGELLCQPDAFLCSVNDVKV--S 136
                 +N G   SDG++ +A      +I+ +D+   G E LC  D++LCS   V++  +
Sbjct: 71  FFLNNFQNVGKDGSDGYIALAGRREGDKIIVLDMDQLGNEFLCARDSYLCSKGTVEIGAA 130

Query: 137 NTLDQR---VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
           +TL Q    +R  +        QKL+G G+A I   G+V+++ L+ G+ + VD   +V  
Sbjct: 131 STLMQGQMGLRLFLSNQNTVFMQKLSGSGVACISGNGTVIRQELKPGQEMVVDARAVVGF 190

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR 237
            +S+  Q++       A+FGG+ L  A ++GPG+ ++QSLP  R
Sbjct: 191 QNSIGYQLRMMSSPLAALFGGEGLFFARLSGPGVFYLQSLPAAR 234


>gi|294667625|ref|ZP_06732839.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292602620|gb|EFF46057.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 167

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 52/140 (37%), Positives = 85/140 (60%)

Query: 107 ILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFI 166
           +L + L+  GG+L+CQ D+FL     V +     +++   + G EGF+ QKL G G  F+
Sbjct: 1   MLALRLSEHGGQLICQKDSFLAGARGVSLGIAFQRKILTGLFGGEGFIMQKLEGDGWVFV 60

Query: 167 LAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPG 226
            AGG+VV++ L  GE I VD  C+VA  + V++ ++    ++   FG + +  A +TGPG
Sbjct: 61  HAGGTVVERELGPGERIDVDTGCVVAYHAGVDMDVRRVAGLKSMFFGSEGVFLATLTGPG 120

Query: 227 IVFIQSLPFHRLSQRIARAV 246
            V++QSLPF R++ R+ +A 
Sbjct: 121 KVWLQSLPFSRMAGRMLQAA 140


>gi|239623794|ref|ZP_04666825.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47_FAA]
 gi|239521825|gb|EEQ61691.1| conserved hypothetical protein [Clostridiales bacterium 1_7_47FAA]
          Length = 225

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 66/228 (28%), Positives = 119/228 (52%), Gaps = 3/228 (1%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++ILG    V ++ L   E +  + G M +MS  + +++        G+ +   G+++
Sbjct: 1   MDYRILGETLPVVEVRLNAGEAMYTQSGGMAWMSEGLTLDSNIKGGLMKGIGRMFTGESL 60

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
                    P D  +  A+    RILP+D++     L+CQ  AFLC+   V+VS    ++
Sbjct: 61  FMATYTAVRP-DCIIAFASTVPGRILPVDMSRT--SLICQKGAFLCAQPTVEVSTVFTKK 117

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +     G EGF+ Q+L G G+AF+   G VV+K L  GEV+ VD   + A   +V+ +I+
Sbjct: 118 LSAGFFGGEGFILQQLKGSGMAFLEVDGDVVEKTLAPGEVMKVDTGNVFAFEPTVSYEIE 177

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
               ++  +FGG+ L    +TGPG +++Q++     + RIA+ + + N
Sbjct: 178 TIKGVKNMLFGGEGLFLTRLTGPGKIYMQTMNIAEFTGRIAQGLPTSN 225


>gi|116624027|ref|YP_826183.1| hypothetical protein Acid_4939 [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116227189|gb|ABJ85898.1| protein of unknown function DUF124 [Candidatus Solibacter usitatus
           Ellin6076]
          Length = 259

 Score =  107 bits (268), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 83/241 (34%), Positives = 128/241 (53%), Gaps = 10/241 (4%)

Query: 17  SPQDAVIP--FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME----MENIYIPENE 70
           +P  AV P  + I G   QV ++ LK  ++V A  G M + + +++    M    I E  
Sbjct: 6   APAPAVTPLDYTIQGDNLQVARVRLKAGQEVYAEAGKMVYKTANVQWDTRMSGATIGEKL 65

Query: 71  VG-MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
           +G + + + G+++     R  G  DG VG A     RI   +LA  G  L+ Q DAFL +
Sbjct: 66  MGALRRTVTGESLFLTYFRANG--DGEVGFAGNYPGRIQVFELAA-GRSLIAQRDAFLFA 122

Query: 130 VNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSC 189
              V+ +  L +++     G EGF+ +K TG G  FI AGG  V+ +L  G+VI VD  C
Sbjct: 123 QPTVQFNIALVKKLGAGFFGGEGFILEKFTGPGAVFIHAGGDFVEFDLAPGQVIQVDTGC 182

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           IVA   +V+  I+  G I+ A+FGG+ L  A +TGPG V IQS+  +++ + +A   T  
Sbjct: 183 IVAFDETVDYDIQLAGGIKTALFGGEGLFLATLTGPGKVVIQSMTLNKMRRELAPGRTGG 242

Query: 250 N 250
           +
Sbjct: 243 D 243


>gi|255080806|ref|XP_002503976.1| predicted protein [Micromonas sp. RCC299]
 gi|226519243|gb|ACO65234.1| predicted protein [Micromonas sp. RCC299]
          Length = 540

 Score =  107 bits (267), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 68/220 (30%), Positives = 115/220 (52%), Gaps = 8/220 (3%)

Query: 26  QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSV 85
           +ILG +AQ+ Q+ LK  E++ A PG+MC+MS ++            G+ + L G+     
Sbjct: 308 RILGTDAQMVQVDLKAGEEMSAEPGAMCYMSSNVRSITSLQGGVFAGISRVLAGEPFFLN 367

Query: 86  VLRNPGPS--DGFVGIAAPSLA-RILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL--- 139
             +N      DG++ +A      +I+ +DL  FGGE+LC  D++LCS  +V +       
Sbjct: 368 SFKNIANDGRDGYIALAGRREGDKIIVLDLDAFGGEILCARDSYLCSKGEVSIDAATTFT 427

Query: 140 --DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
                +R  +        QKL+G GLA I   G+V++++L  G+ + VD   + A + ++
Sbjct: 428 RGQMGLRLFLSNQNTIFMQKLSGSGLACISGNGTVIRQDLLEGQEMVVDARAVCAFSKTI 487

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR 237
             Q++       A+FGG+ L  A ++GPG  ++QSLP  R
Sbjct: 488 GYQLRLMSSPLAAIFGGEGLFFARLSGPGTFYLQSLPAAR 527


>gi|301122111|ref|XP_002908782.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099544|gb|EEY57596.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 322

 Score =  107 bits (266), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 70/224 (31%), Positives = 116/224 (51%), Gaps = 5/224 (2%)

Query: 27  ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVV 86
           I G ++QV  I L+P++ + A  G+M +M+  +EME         GM + + G+      
Sbjct: 3   IEGHDSQVVHIELEPEQCLRAETGAMIYMTDGVEMETTTAGGLGEGMKRMMTGENFFVSR 62

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRN 145
               G   G V +     ++I+ + L  F GE ++CQ  AFLC  + V +     ++   
Sbjct: 63  FTYHGSEKGKVALGTSFPSKIVHLRLNDFLGESIICQKGAFLCGSDTVNIEMEFAKKFGV 122

Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
              G EGF+ Q+LTG G   + A G+++++ L+ GEV+ +   C+VA   SV+  I    
Sbjct: 123 GFFGGEGFILQRLTGSGDVLVRATGTLIERELQPGEVLRISSGCLVAFEPSVHYDITMMK 182

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR----LSQRIARA 245
             +  +FGG+ L    +TGPG +++QSLPF R    ++ RI R 
Sbjct: 183 GAKNVLFGGEGLFVTTLTGPGKIYLQSLPFERVVGEMAARIPRG 226


>gi|219850466|ref|YP_002464899.1| hypothetical protein Cagg_3626 [Chloroflexus aggregans DSM 9485]
 gi|219544725|gb|ACL26463.1| protein of unknown function DUF124 [Chloroflexus aggregans DSM
           9485]
          Length = 316

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 67/236 (28%), Positives = 123/236 (52%), Gaps = 2/236 (0%)

Query: 9   PFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPE 68
           P  P V      + + F+I+G   Q   + + P + V +  G M +MS +++M+   +  
Sbjct: 60  PEPPVVGSGVGASGLRFKIIGTTMQAVVLEVPPGQTVFSERGGMSWMSANVQMQT-NMEG 118

Query: 69  NEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
              G ++ +F      +V   P    G +G +A    +I+P++LA  G  ++CQ DAF+C
Sbjct: 119 GLGGAFKRMFSGESIFMVNFTPQGGPGIIGFSAEFPGKIVPLNLAP-GQVMICQKDAFMC 177

Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
           +   V +     +R+   + G EGF+ QKLTG GLAF+   G +++  LE  +++ VD  
Sbjct: 178 AERSVSLDIHFRRRLGAGLFGGEGFIMQKLTGPGLAFVELDGEIIEYTLEANQMLKVDTG 237

Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
            +     +V + I+     +  +FGG+ L    + GPG V++Q++P   L+++IA+
Sbjct: 238 HVAMYEPTVQLDIEMVRGFKNILFGGEGLFLTTLRGPGRVWLQTMPAMNLAKKIAQ 293


>gi|374628526|ref|ZP_09700911.1| protein of unknown function DUF124 [Methanoplanus limicola DSM
           2279]
 gi|373906639|gb|EHQ34743.1| protein of unknown function DUF124 [Methanoplanus limicola DSM
           2279]
          Length = 238

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 65/221 (29%), Positives = 115/221 (52%), Gaps = 5/221 (2%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT- 81
           + ++I+G   Q   + + P E + A  G+M +MSG++ ME+        G+ + L G++ 
Sbjct: 1   MDYKIIGDNLQFVNLAINPGELIYAEAGAMVYMSGNINMESKLQGGLFTGIKRKLAGESF 60

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
           + S      GP +   G   P   +I+  D+    G+L+ Q DAFLC+   VK      +
Sbjct: 61  MVSHFTSTGGPGNVAFGGNCP--GKIITRDMG--DGDLIVQKDAFLCAEESVKWEIAFQK 116

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           ++ +   G EG + + L+G GL F  A G +V+ +L+ G+V  V  + +V    +V   I
Sbjct: 117 KLGSTFFGGEGLILEHLSGSGLVFCHAAGDIVEIDLKAGQVYKVSTAHVVGWEPTVQYDI 176

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +  G ++ A+FGG+      +TGPG V+IQS+   +L+  +
Sbjct: 177 QAAGGVKTALFGGEGFFVTTLTGPGRVYIQSMTLPQLANAL 217


>gi|452077090|gb|AGF93060.1| protein containing DUF124 [uncultured organism]
          Length = 253

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 67/224 (29%), Positives = 116/224 (51%), Gaps = 2/224 (0%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I G   Q+  + L PQ++V    GSM +MSG++EME         G+ + L G++ 
Sbjct: 10  MKYKIDGDNLQMVTLQLGPQDEVYGEAGSMKYMSGNVEMETKGKGGFMKGVKRKLSGESY 69

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
                +     DG VG A  +  +I  +DL+  G   + Q DAFLC+   V +     ++
Sbjct: 70  FLTHFKTK-SGDGLVGFAGNTPGKIKTLDLSS-GKNYIAQQDAFLCAEESVDLDIAFQKK 127

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +     G EGF+ QKL+G G AFI   G ++ KNL+ G+V+ V    + A   +V+  I+
Sbjct: 128 LGAGFFGGEGFILQKLSGNGKAFIHITGDMMTKNLKEGQVLKVSTGNVAAFEDTVDYDIQ 187

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             G ++ ++  G+ +    + GPG V++QS+    L+  + + V
Sbjct: 188 RTGGVKSSLLSGEGIFMTTLRGPGKVWLQSMTLRDLAAELGKYV 231


>gi|397779696|ref|YP_006544169.1| hypothetical protein BN140_0530 [Methanoculleus bourgensis MS2]
 gi|396938198|emb|CCJ35453.1| putative protein M6_Spy0233 [Methanoculleus bourgensis MS2]
          Length = 229

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 67/217 (30%), Positives = 113/217 (52%), Gaps = 3/217 (1%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I G   Q+  + L P E   A  G+M  MSG+M+M          G+ + + G+++
Sbjct: 1   MQYEITGDNLQMVTLRLAPGEAACAEAGAMVNMSGNMQMSTNMKGGLFGGLKRMVTGESL 60

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
                  P   +GFV  A     +I  +DL    GE + Q DAFLCS   + +     +R
Sbjct: 61  FMTEF-TPKGGEGFVSFAGNVPGKIFTLDLTE--GEFIAQKDAFLCSEQGIDLDVAFAKR 117

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +R+ + G EGF+ Q+L+G+G AF+   G +++  L  GE++ V+   +V   S+V+  I 
Sbjct: 118 LRSGVFGGEGFILQRLSGRGTAFLHCCGDIMEMTLAPGEMVRVETGLVVGFESTVDYNIA 177

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
             G ++   FGG+ L    +TGPG V +QS+   +L+
Sbjct: 178 LAGGVKTVFFGGEGLFLTTLTGPGRVVLQSMNIAKLA 214


>gi|219117842|ref|XP_002179708.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217408761|gb|EEC48694.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 298

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 74/233 (31%), Positives = 116/233 (49%), Gaps = 13/233 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME---NIYIPEN---EVGMWQWLF 78
           + I G + QV    + P E+VV   GS  FM GS++M+    + +        G+ + L 
Sbjct: 55  WMISGHDMQVLTTTVPPGEQVVTEIGS--FMYGSVDMKTSVELTLCSRVGFAGGLSRILG 112

Query: 79  GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF---GGELLCQPDAFLCSVNDVKV 135
           G++   V+L N    +G+VGI     A+I+PI           L+ Q  A++  + DV V
Sbjct: 113 GESCVKVMLTNASNEEGYVGITPNFPAKIVPIKFGHHISPDHALISQGGAYMSHLGDVNV 172

Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTG--QGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
              LD  +R       G  RQK+TG  + +AF+ AGG+++ + L   E ITVD   +VA+
Sbjct: 173 GCDLDASLRTCCCAGFGCFRQKITGSDESIAFLAAGGTLIYRYLSENETITVDSRSVVAM 232

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             +V + I  NG I     GG+   +  +TGPG VF+QS+ F +    +   V
Sbjct: 233 EETVTLGIAPNGRICMCCLGGEGCFSTTLTGPGKVFMQSMNFQKFRDAVQTTV 285


>gi|326434560|gb|EGD80130.1| hypothetical protein PTSG_10404 [Salpingoeca sp. ATCC 50818]
          Length = 381

 Score =  104 bits (259), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 63/216 (29%), Positives = 116/216 (53%), Gaps = 1/216 (0%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTIT 83
           +QI+G  +Q+ +++L P + V A PG+M  MS  ++ + ++     +        G++  
Sbjct: 150 WQIVGDGSQILEVVLNPNDAVTAEPGTMVHMSSGLKPDVHLGGGCGQACTRSCCAGESFF 209

Query: 84  SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV 143
            V  +N   +   +G+     A+I+P++LA FGG +  +  AF+C+ N     +     +
Sbjct: 210 RVKYQNNTETTQHIGLTPNFPAKIVPVNLAEFGGRITIKSRAFMCAHNCEPTFSYRFAGI 269

Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
                G +GF+   L+G G+ F+ A G+V+ + LE GE +  D S +VA  S+VN  ++ 
Sbjct: 270 GAGCCGGQGFILNDLSGDGMVFLNASGTVLMRKLEAGEELICDQSSVVAFQSTVNYAVRR 329

Query: 204 NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
            G +     GG+ L  +V+TGPG V +QS+P  +++
Sbjct: 330 AGGLLMCCCGGEGLFNSVLTGPGTVVLQSMPIEKIA 365


>gi|429761339|ref|ZP_19293767.1| TIGR00266 family protein [Anaerostipes hadrus DSM 3319]
 gi|429184164|gb|EKY25195.1| TIGR00266 family protein [Anaerostipes hadrus DSM 3319]
          Length = 243

 Score =  102 bits (255), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 68/228 (29%), Positives = 119/228 (52%), Gaps = 12/228 (5%)

Query: 17  SPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW 76
           S ++ ++ ++ILG    V    L+  EK+V   G M +MS +M+ME         G    
Sbjct: 11  SKEENIMRYEILGETLPVVICHLESGEKMVTEGGGMSWMSPNMKMET-----TSNGGIGK 65

Query: 77  LFGKTIT-SVVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVN 131
           +FG+  +   + +N   + G  G+ A   A   P  +  F    G E++ Q  A+L   N
Sbjct: 66  IFGRAFSGEKMFQNVYTAQGGEGMIA--FASCFPGSIKAFEIAPGQEMILQKSAYLAGEN 123

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V++S   ++++   + G EGF+ QK +G G+ F    G VV+  LE GE I VD   + 
Sbjct: 124 GVELSAFFNKKIGAGLFGGEGFIMQKASGHGIMFAEFDGHVVEYELEAGEQIVVDTGHLA 183

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
           A+T+S  ++IK    ++  +FGG+ +   ++TGPG V++Q++P   ++
Sbjct: 184 AMTASCTMEIKSVPGVKNMLFGGEGVFNTILTGPGKVWLQTMPISNVA 231


>gi|307352226|ref|YP_003893277.1| hypothetical protein Mpet_0058 [Methanoplanus petrolearius DSM
           11571]
 gi|307155459|gb|ADN34839.1| protein of unknown function DUF124 [Methanoplanus petrolearius DSM
           11571]
          Length = 230

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 65/225 (28%), Positives = 117/225 (52%), Gaps = 13/225 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW----LF 78
           + + I G   Q   + ++  E V A  G+M +MS +MEM+     E E G+++     L 
Sbjct: 1   MKYSITGDNLQFVTMEIENAETVWAEAGAMVYMSANMEMK----AELEGGLFKGIKRKLA 56

Query: 79  GKT-ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
           G++ + +    + GP  G V     +  +I  +D+A  G E + Q DAFLC+   V    
Sbjct: 57  GESFMVTNFTSHGGP--GVVAFGGNAPGKIYEMDIA--GKEFIAQKDAFLCAEKSVNWEI 112

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              +++ +   G EG + +K++G G+ F  A G +V+ +L+ G+   +  + +V   SSV
Sbjct: 113 AFQKKLGSTFFGGEGLILEKISGSGMVFFHACGDLVEIDLKPGQTYKIATAHVVGWESSV 172

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           N  I+  G I+ ++FGG+      +TGPG + IQS+   +L++ +
Sbjct: 173 NYDIQAAGGIKTSMFGGEGFFLTTLTGPGKIVIQSMTLEQLAESL 217


>gi|167766978|ref|ZP_02439031.1| hypothetical protein CLOSS21_01495 [Clostridium sp. SS2/1]
 gi|167710953|gb|EDS21532.1| TIGR00266 family protein [Clostridium sp. SS2/1]
          Length = 243

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 68/228 (29%), Positives = 119/228 (52%), Gaps = 12/228 (5%)

Query: 17  SPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW 76
           S ++ ++ ++ILG    V    L+  EK+V   G M +MS +M+ME         G    
Sbjct: 11  SKEENIMRYEILGETLPVVICYLESGEKMVTEGGGMSWMSPNMKMET-----TSNGGIGK 65

Query: 77  LFGKTIT-SVVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVN 131
           +FG+  +   + +N   + G  G+ A   A   P  +  F    G E++ Q  A+L   N
Sbjct: 66  IFGRVFSGEKMFQNVYTAQGGKGMIA--FASCFPGSIKAFEIAPGQEMILQKSAYLAGEN 123

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V++S   ++++   + G EGF+ QK +G G+ F    G VV+  LE GE I VD   + 
Sbjct: 124 GVELSAFFNKKIGAGLFGGEGFIMQKASGHGIMFAEFDGHVVEYELEAGEQIVVDTGHLA 183

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
           A+T+S  ++IK    ++  +FGG+ +   ++TGPG V++Q++P   ++
Sbjct: 184 AMTASCTMEIKSVPGVKNMLFGGEGVFNTILTGPGKVWLQTMPISNVA 231


>gi|339009165|ref|ZP_08641737.1| hypothetical protein BRLA_c29840 [Brevibacillus laterosporus LMG
           15441]
 gi|421873566|ref|ZP_16305178.1| conserved hypothetical protein [Brevibacillus laterosporus GI-9]
 gi|338773643|gb|EGP33174.1| hypothetical protein BRLA_c29840 [Brevibacillus laterosporus LMG
           15441]
 gi|372457353|emb|CCF14727.1| conserved hypothetical protein [Brevibacillus laterosporus GI-9]
          Length = 240

 Score =  101 bits (252), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 65/224 (29%), Positives = 119/224 (53%), Gaps = 8/224 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + + I+G   Q  Q  L+P E++    GSM +MS +++M+  +       + +   G+++
Sbjct: 1   MEYHIMGSTMQALQFNLQPGERLFTESGSMIWMSENIKMDTSFKGGMLKSLGRAFSGESL 60

Query: 83  TSV---VLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
           T      L  PG     +  A  +  +I+P+ +   G E++ Q  AFL    ++ VS   
Sbjct: 61  TFTFFEALHTPG----MIAFAPAAPGKIIPVTIHP-GNEIIAQKHAFLVGTENIDVSIHF 115

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
            +++     G EGF+ Q+L+G+G+ F+   G V + NL+ GEVI VD + + A  SSV++
Sbjct: 116 QKKLGTGFFGGEGFIMQRLSGRGMVFLEIDGEVAEMNLQHGEVIKVDTAHVAAYESSVDM 175

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
            I+    I+  +FGG+ L    + GPG +++Q++    L+ +IA
Sbjct: 176 SIERVKGIKNIIFGGEGLFLTTLRGPGKIWLQTMTIAGLAGKIA 219


>gi|403389619|ref|ZP_10931676.1| hypothetical protein CJC12_17781 [Clostridium sp. JC122]
          Length = 228

 Score =  101 bits (251), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 72/221 (32%), Positives = 113/221 (51%), Gaps = 8/221 (3%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT- 83
           ++I GG   V    L   E VV+  G+M +MS     ENI +  N  G      G+ ++ 
Sbjct: 3   YEIKGGNMPVAICTLDQGETVVSESGAMGWMS-----ENIVMDTNMKGGLFGGIGRALSG 57

Query: 84  -SVVLRNPGPSDGFVGIAAP-SLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
            S+ L     ++G   IA P S+   L +     G  L+CQ  AFL +   V+    L +
Sbjct: 58  DSLFLNTFTCTNGQGTIAFPCSVPGKLLVKNLYQGQTLICQKGAFLAAETSVEFKIHLKK 117

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           R+ + + G EGF+ Q++TG G+ F+   G V + +L  GE I VD   + A  S+VN  I
Sbjct: 118 RISSGLFGGEGFILQRMTGPGIVFLEFDGHVEEMDLSPGETIKVDTGHVAAFESTVNFDI 177

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +    ++  +FGG+ L    +TGPG V++Q++P   L+ RI
Sbjct: 178 EMVKGVKNMLFGGEGLFLTTLTGPGRVYLQTMPISNLASRI 218


>gi|153813867|ref|ZP_01966535.1| hypothetical protein RUMTOR_00073 [Ruminococcus torques ATCC 27756]
 gi|317502002|ref|ZP_07960186.1| hypothetical protein HMPREF1026_02130 [Lachnospiraceae bacterium
           8_1_57FAA]
 gi|331088180|ref|ZP_08337100.1| hypothetical protein HMPREF1025_00683 [Lachnospiraceae bacterium
           3_1_46FAA]
 gi|336440015|ref|ZP_08619614.1| hypothetical protein HMPREF0990_02008 [Lachnospiraceae bacterium
           1_1_57FAA]
 gi|145848263|gb|EDK25181.1| TIGR00266 family protein [Ruminococcus torques ATCC 27756]
 gi|316896682|gb|EFV18769.1| hypothetical protein HMPREF1026_02130 [Lachnospiraceae bacterium
           8_1_57FAA]
 gi|330408836|gb|EGG88299.1| hypothetical protein HMPREF1025_00683 [Lachnospiraceae bacterium
           3_1_46FAA]
 gi|336014364|gb|EGN44213.1| hypothetical protein HMPREF0990_02008 [Lachnospiraceae bacterium
           1_1_57FAA]
          Length = 225

 Score =  100 bits (249), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 66/223 (29%), Positives = 120/223 (53%), Gaps = 4/223 (1%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I GGE  V    +   E ++   GSM +MS +M+ME          + +   G+ I
Sbjct: 1   MKYEIKGGELPVVVCFMDAGETMITEKGSMSWMSPNMKMETTSNGGVGKALGRMFAGEAI 60

Query: 83  -TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
             +      GP  G +  A+    +I P ++A  G E++ Q   FL +   V++S    +
Sbjct: 61  FQNRYTAEGGP--GMIAFASSFPGQIRPWEIAP-GKEIVVQKAGFLAAQESVELSIFFQK 117

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           ++ + + G EGF+ QK++G GLAF+   GSV++  L+ G+ I VD   + A+ ++ N++I
Sbjct: 118 KLGSGLFGGEGFIMQKISGHGLAFLEFDGSVIEYVLQPGQQIVVDTGYLAAMEATCNMEI 177

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
           +    I+  +FGG+ L   V+TGPG V++Q++P   ++  I +
Sbjct: 178 RTVPGIKNMMFGGEGLFNTVITGPGRVWLQTMPISNVAGAIMK 220


>gi|424513771|emb|CCO66393.1| Protein containing DUF124 [Bathycoccus prasinos]
          Length = 558

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 68/239 (28%), Positives = 117/239 (48%), Gaps = 23/239 (9%)

Query: 27  ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGS-MEMENIYIPENEVGMWQWLFGKTITSV 85
           + G +AQ+ Q+ LKP E V A PG+MC+MS + + + ++        + + L G+     
Sbjct: 313 LFGTDAQIVQMKLKPGESVNAEPGAMCYMSSNVLSITSLGPGGLSKAIARVLAGEPFFIN 372

Query: 86  VLRNPGPSDGFVGIAAPSLA-RILPIDLAMFGGELLCQPDAFLCSVNDVKVS-------- 136
             RN G SDG++ + +     +I  ++L     +LLC  D++LCS+ D+ VS        
Sbjct: 373 TFRNLGSSDGYIALGSLRQGDKIAVVNLKEREADLLCARDSYLCSMGDISVSAAVSFSQE 432

Query: 137 --------NTLDQRVRNVI-----GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
                     L QR R  +      G   F  + L G G A I   G+V+++ L+ GE +
Sbjct: 433 ARGGRTLSQVLKQRFRTGVRLILSKGNNIFNGEWLVGDGTACITGKGTVMRRTLKEGEEM 492

Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            VD   ++A+  ++   ++       AVFGG+ L    + GPG  ++QSLP  +  +++
Sbjct: 493 VVDARAVLALERTIEYDLELVSSPVAAVFGGEGLFHVRLKGPGGFYLQSLPIDKAKKKL 551


>gi|357053067|ref|ZP_09114170.1| hypothetical protein HMPREF9467_01142 [Clostridium clostridioforme
           2_1_49FAA]
 gi|355386046|gb|EHG33087.1| hypothetical protein HMPREF9467_01142 [Clostridium clostridioforme
           2_1_49FAA]
          Length = 225

 Score =  100 bits (249), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 63/224 (28%), Positives = 114/224 (50%), Gaps = 3/224 (1%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + +++LG      ++ L+  E +  + G M +MS    +++        G+ +   G+++
Sbjct: 1   MDYRMLGDILPAVEVRLQAGEAMYTQSGGMAWMSDGFTLDSNVKGGLMKGLGRMFSGESL 60

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
                    P D  +  A+    +IL ID A     L+CQ  AFLC+   V+++  L ++
Sbjct: 61  FMATYTASRP-DSVIAFASTVPGKILAIDTAKT--SLICQKGAFLCAQTTVEINTVLTKK 117

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
                 G EGF+ Q++ G G+AF+   G VV++ L  GEVI VD   + A   SV+ +I+
Sbjct: 118 FTAGFFGGEGFILQQIQGSGMAFLEVDGDVVERVLAPGEVIKVDTGNVFAFEPSVSYEIE 177

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
               ++  +FGG+ L    +TGPG V++Q++     + RIA+ +
Sbjct: 178 TMKGVKNILFGGEGLFLTKLTGPGKVYMQTMNIAEFTGRIAQGL 221


>gi|78044630|ref|YP_360947.1| hypothetical protein CHY_2133 [Carboxydothermus hydrogenoformans
           Z-2901]
 gi|77996745|gb|ABB15644.1| conserved hypothetical protein TIGR00266 [Carboxydothermus
           hydrogenoformans Z-2901]
          Length = 228

 Score =  100 bits (248), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 67/221 (30%), Positives = 113/221 (51%), Gaps = 8/221 (3%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI-- 82
           F+I+G    V  + L   EKV  + G M +MS   EM+         G+ + L G+T+  
Sbjct: 3   FKIVGETLPVVILTLGKGEKVFTQAGGMAWMSEGFEMQTNMEGGLLKGIARKLAGETLFM 62

Query: 83  -TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
            T    R+ G     +  A+    +I+P++L   G  ++CQ  AFLC  + VK+     +
Sbjct: 63  TTYTCTRDSGE----IAFASSFPGKIIPLELKA-GQSIICQKKAFLCGESSVKLEIFFRK 117

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           ++ + + G EGF+ +K+TG GL F+   G+VV+  L   +++ VD   +     +V  +I
Sbjct: 118 KIGSGLFGGEGFILEKVTGPGLCFVEIDGAVVEYELGSNDILKVDQGHVAMFEPTVTFEI 177

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +        +FGG+ L  A + GPG V++QSLP   L+ R+
Sbjct: 178 EMVKGFTNVLFGGEGLFLATLKGPGKVWLQSLPIENLAARL 218


>gi|291562839|emb|CBL41655.1| conserved hypothetical protein TIGR00266 [butyrate-producing
           bacterium SS3/4]
          Length = 228

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 66/223 (29%), Positives = 111/223 (49%), Gaps = 3/223 (1%)

Query: 23  IPFQILGGEAQVCQIMLK-PQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           + ++I+G      +I    P E +  + G M +MS  + M          G+ +   G++
Sbjct: 1   MKYRIIGTTIPAVEIKFDAPGETMYTQSGGMSWMSEGISMSTNTRGGFMKGLGRMFAGES 60

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
           +     R   P    +  A+    ++LPID A  GG ++CQ  AFLC+ + V +  T  +
Sbjct: 61  MFMATYRADAPG-ATIAFASTVPGQVLPIDAAENGG-MICQKGAFLCAQDSVNLDVTFTK 118

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           +  + + G EGF+ + ++G G+AF+   G VV+K L  GEVI VD   IVA   +V  ++
Sbjct: 119 KFTSGLFGGEGFILESISGSGMAFLEIDGDVVEKQLAPGEVIKVDTGNIVAFEKTVKYEV 178

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
           +    +    FGG+ L    +TGPG V +Q+      + RIA+
Sbjct: 179 ETIKGLGNIFFGGEGLFLTKLTGPGRVILQTQNIADFAGRIAQ 221


>gi|219852667|ref|YP_002467099.1| hypothetical protein Mpal_2078 [Methanosphaerula palustris E1-9c]
 gi|219546926|gb|ACL17376.1| protein of unknown function DUF124 [Methanosphaerula palustris
           E1-9c]
          Length = 229

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 69/222 (31%), Positives = 110/222 (49%), Gaps = 7/222 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + + I G   Q+  + L   E++ A  G+M  MSG+M+M +        G+ + + G+  
Sbjct: 1   MQYTITGDNLQMVTLSLVEGEEIAAEAGAMVNMSGNMKMTSAVTGGLFKGLKRMVTGE-- 58

Query: 83  TSVVLRNPGPSDGF--VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
            S  L +  PS G   V  A     +I+P+     G E + Q DA+LC+   V +     
Sbjct: 59  -SFFLSHFTPSGGGGQVAFAGNVPGKIVPVSFN--GNEFIAQRDAYLCAEMGVDLDIAFT 115

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           +R+R  I G EGF+ QKL+G G AF+   G VV+  L   +V+ V    +V   S+V   
Sbjct: 116 KRIRAGIFGGEGFVMQKLSGHGTAFLHCCGDVVEMTLAPDQVMKVQTGLVVGFDSTVRYD 175

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           I   G +   +FGG+ L    + GPG V +QS+   +L+Q +
Sbjct: 176 IARAGGVTTILFGGEGLFLTTLQGPGKVVLQSMDVAKLAQAL 217


>gi|291559678|emb|CBL38478.1| conserved hypothetical protein TIGR00266 [butyrate-producing
           bacterium SSC/2]
          Length = 227

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 67/220 (30%), Positives = 114/220 (51%), Gaps = 12/220 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++ILG    V    L+  EK+V   G M +MS +M+ME         G    +FG+  + 
Sbjct: 3   YEILGETLPVVICYLESGEKMVTEGGGMSWMSPNMKMET-----TSNGGIGKIFGRVFSG 57

Query: 85  -VVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTL 139
             + +N   + G  G+ A   A   P  +  F    G E++ Q  A+L   N V++S   
Sbjct: 58  EKMFQNVYTAQGGKGMIA--FASCFPGSIKAFEIAPGQEMILQKSAYLAGENGVELSAFF 115

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
           ++++   + G EGF+ QK +G G+ F    G VV+  LE GE I VD   + A+T+S  +
Sbjct: 116 NKKIGAGLFGGEGFIMQKASGHGIMFAEFDGHVVEYELEAGEQIVVDTGHLAAMTASCTM 175

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
           +IK    ++  +FGG+ +   ++TGPG V++Q++P   ++
Sbjct: 176 EIKSVPGVKNMLFGGEGVFNTILTGPGKVWLQTMPISNVA 215


>gi|219109733|ref|XP_002176620.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411155|gb|EEC51083.1| predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 218

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 66/219 (30%), Positives = 108/219 (49%), Gaps = 2/219 (0%)

Query: 26  QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSV 85
           +I G E+ +  + L P E + A  G+M +M+  + ME   +        + L G+ +   
Sbjct: 1   KIEGEESHIATVQLSPGEVLRAESGAMLYMTHGVVMET-SLQGASSAFQRMLTGQNVFLT 59

Query: 86  VLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVR 144
             +  G S G V +     ++IL + L  +    L+CQ  A+L S   V +     + + 
Sbjct: 60  DFKYEGQSSGTVCLGTDFPSKILRLRLDDYSNSTLICQRGAYLASNPTVDIQMEATKSLT 119

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
               G +GF+ QKLTG G   +  GG+VV K L  GE + V    IVA  SS++  ++  
Sbjct: 120 AGFFGGQGFILQKLTGHGDVLVKGGGTVVNKTLSEGETLRVTSGSIVAFESSISYDVQMM 179

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
             I+ A+FGG+ L  A + GPG +++Q +P  R+   IA
Sbjct: 180 PGIKNAMFGGEGLFVATLQGPGQIWLQGMPPDRMIAEIA 218


>gi|67925051|ref|ZP_00518431.1| Protein of unknown function DUF124 [Crocosphaera watsonii WH 8501]
 gi|67853098|gb|EAM48477.1| Protein of unknown function DUF124 [Crocosphaera watsonii WH 8501]
          Length = 226

 Score = 99.0 bits (245), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 66/228 (28%), Positives = 114/228 (50%), Gaps = 2/228 (0%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I G   Q+ +  L   E +    G M +M G +EM+         G+ + L G+++
Sbjct: 1   MQYKIHGTVMQILETHLMMGESIYTESGGMAWMKGDIEMKTNTRGGLMAGIGRALAGESL 60

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
             +   N     G V         IL   L M G  L+CQ DAF+C+   V++     ++
Sbjct: 61  F-MTSYNCNSGQGMVAFTMEVPGTILDFQL-MPGQSLICQRDAFMCAEETVEIELHFRKK 118

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +   + G EGF+ QK++G G AF+   G +   +L+  E + VD   I     +VN +I+
Sbjct: 119 LGAGLFGGEGFILQKISGPGKAFLEIPGEMQSYDLKPQETLLVDPGHIALFEPTVNFEIE 178

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
               I+  +F G+ L  A +TGPG V++QS+PF  L+ ++A+ + + N
Sbjct: 179 SVKGIKNVLFSGEGLFLAKLTGPGKVWLQSMPFANLAAKLAKYIPTKN 226


>gi|416407190|ref|ZP_11688251.1| Protein of unknown function DUF124 [Crocosphaera watsonii WH 0003]
 gi|357260911|gb|EHJ10240.1| Protein of unknown function DUF124 [Crocosphaera watsonii WH 0003]
          Length = 226

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 66/226 (29%), Positives = 113/226 (50%), Gaps = 2/226 (0%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++I G   Q+ +  L   E +    G M +M G +EM+         G+ + L G+++  
Sbjct: 3   YKIHGTVMQILETHLMMGESIYTESGGMAWMKGDIEMKTNTRGGLMAGIGRALAGESLF- 61

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
           +   N     G V         IL   L M G  L+CQ DAF+C+   V++     +++ 
Sbjct: 62  MTSYNCNSGQGMVAFTMEVPGTILDFQL-MPGQSLICQRDAFICAEETVEIELHFRKKLG 120

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
             + G EGF+ QK++G G AF+   G +   +L+  E + VD   I     +VN +I+  
Sbjct: 121 AGLFGGEGFILQKISGPGKAFLEIPGEMQSYDLKPQETLLVDPGHIALFEPTVNFEIESV 180

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
             I+  +F G+ L  A +TGPG V++QS+PF  L+ ++A+ + + N
Sbjct: 181 KGIKNVLFSGEGLFLAKLTGPGKVWLQSMPFANLAAKLAKYIPTKN 226


>gi|160939087|ref|ZP_02086438.1| hypothetical protein CLOBOL_03981 [Clostridium bolteae ATCC
           BAA-613]
 gi|158438050|gb|EDP15810.1| hypothetical protein CLOBOL_03981 [Clostridium bolteae ATCC
           BAA-613]
          Length = 225

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/224 (27%), Positives = 113/224 (50%), Gaps = 3/224 (1%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + +++LG      +I L+  E +  + G M +MS    +++        G+ +   G+++
Sbjct: 1   MDYRMLGDILPAVEIRLQAGEAMYTQSGGMAWMSDGFTLDSNVKGGLMKGLGRMFSGESL 60

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
                    P D  +  A+    +IL ID A     L+CQ  AFLC+   V+++  L ++
Sbjct: 61  FMATYTASRP-DSTIAFASTVPGKILAIDTAKT--SLICQKGAFLCAQTTVEINTVLTKK 117

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
                 G EGF+ Q++ G G+AF+   G VV++ L  GEVI VD   + A    ++ +I+
Sbjct: 118 FTAGFFGGEGFILQQIQGSGMAFLEVDGDVVERVLAPGEVIKVDTGNVFAFEPGISYEIE 177

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
               ++  +FGG+ L    +TGPG V++Q++     + RIA+ +
Sbjct: 178 TVKGVKNILFGGEGLFLTKLTGPGKVYMQTMNIAEFTGRIAQGL 221


>gi|428225765|ref|YP_007109862.1| hypothetical protein GEI7407_2335 [Geitlerinema sp. PCC 7407]
 gi|427985666|gb|AFY66810.1| protein of unknown function DUF124 [Geitlerinema sp. PCC 7407]
          Length = 229

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 72/224 (32%), Positives = 115/224 (51%), Gaps = 6/224 (2%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI-- 82
           ++I G   Q   I L+  E V    G M +M G +EM+         G+ + L G+++  
Sbjct: 3   YEIHGTVMQTVDIYLQAGESVYTESGGMAWMRGDLEMKTNTKGGLMAGLGRALAGESLFM 62

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
           T+ V R+   S G V  A  +  +I+ IDLA  G  L+CQ DAF+C+ + VK+     ++
Sbjct: 63  TTYVCRS---SAGMVVFAPEAPGKIMAIDLAA-GQSLICQKDAFMCAESSVKLEMHFRRK 118

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +     G EGF+ QK+TG G  F+   G V + +L  GE + VD   +     +V+  I 
Sbjct: 119 LGAGFFGGEGFILQKVTGPGTVFLEIPGEVREYDLRPGESMQVDPGHLAVFEPTVSYDIT 178

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
               +R     G+ L  A +TGPG +++QSLP   L+ +IA+ +
Sbjct: 179 MVKGLRNIFLSGEGLFLAKMTGPGKIWLQSLPLTNLAAKIAQHL 222


>gi|322434398|ref|YP_004216610.1| hypothetical protein AciX9_0761 [Granulicella tundricola MP5ACTX9]
 gi|321162125|gb|ADW67830.1| protein of unknown function DUF124 [Granulicella tundricola
           MP5ACTX9]
          Length = 254

 Score = 97.8 bits (242), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 66/224 (29%), Positives = 112/224 (50%), Gaps = 4/224 (1%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMEN---IYIPENEVGMWQWLFGKTITSVVLRNP 90
           V +++L+P E V++  G + +MS S++M     +       G  + + G     ++    
Sbjct: 12  VLEVLLQPGEAVISEAGELSWMSQSIQMTTHTQMGGGGGFFGAIKRVAGGGTLFMIEYTA 71

Query: 91  GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
               G +  A      ILPI++   G E++     FLC+   V++     Q +   I G 
Sbjct: 72  AGIPGEIAFATKIPGHILPIEVGQ-GNEIMVHRHGFLCATQQVQLGVGFQQSLGAGIFGG 130

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
           +GFL QK++G G A++   G V+ K+L  GE++ V    + A T  ++ QI     I+  
Sbjct: 131 DGFLLQKISGYGTAWLELSGEVIVKDLRPGEMLRVHPGHVGAFTGGMSFQITRVPGIKNM 190

Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
           +FGGD L  A +TGPG V++Q+LP  RL+ +I     S   +E+
Sbjct: 191 IFGGDGLFLAALTGPGRVWLQTLPLARLAHQILEFAPSERRQES 234


>gi|295115123|emb|CBL35970.1| conserved hypothetical protein TIGR00266 [butyrate-producing
           bacterium SM4/1]
          Length = 251

 Score = 97.8 bits (242), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 66/229 (28%), Positives = 120/229 (52%), Gaps = 17/229 (7%)

Query: 13  YVYQSPQDAVIPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE 70
           Y+Y   ++  + ++I G    A VC+  L+  E ++   G+M +MS +M+ME        
Sbjct: 16  YIYMRREEQDMKYEIHGETLPAVVCE--LEKGEAMITEKGAMAWMSPNMQMET------N 67

Query: 71  VGMWQWLFGKTIT-SVVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDA 125
           VG     FG+  +   + RN   + G  G+ A   A   P  +  F    G E + Q  A
Sbjct: 68  VGGIGKAFGRLFSGESMFRNTYTAKGGPGMIA--FASSFPGSIRAFEVGPGQEYIFQKSA 125

Query: 126 FLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITV 185
           FL S   V +S    +++ + + G EGF+ QKL+G+G+ F    G +++ +LE G+ + +
Sbjct: 126 FLASEAGVDLSVYFHKKLGSGLFGGEGFIMQKLSGRGIVFAEFDGHIMEYDLEAGQQMVI 185

Query: 186 DVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
           D   + A+T+   ++I+    ++  +FGG+ L   +VTGPG +++Q++P
Sbjct: 186 DTGYLAAMTAGCQMEIQSVPGVKNMLFGGEGLFNTIVTGPGHIWLQTMP 234


>gi|317499708|ref|ZP_07957966.1| hypothetical protein HMPREF0996_02950 [Lachnospiraceae bacterium
           5_1_63FAA]
 gi|316893014|gb|EFV15238.1| hypothetical protein HMPREF0996_02950 [Lachnospiraceae bacterium
           5_1_63FAA]
          Length = 227

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 66/220 (30%), Positives = 114/220 (51%), Gaps = 12/220 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++ILG    V    L+  EK+V   G M +MS +M+ME         G    +FG+  + 
Sbjct: 3   YEILGETLPVVICYLESGEKMVTEGGGMSWMSPNMKMET-----TSNGGIGKIFGRAFSG 57

Query: 85  -VVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTL 139
             + +N   + G  G+ A   A   P  +  F    G E++ Q  A+L   N V++S   
Sbjct: 58  EKMFQNVYTAQGGEGMIA--FASCFPGSIKAFEIAPGQEMILQKSAYLAGENGVELSAFF 115

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
           ++++   + G EGF+ QK +G G+ F    G VV+  LE G+ I VD   + A+T+S  +
Sbjct: 116 NKKIGAGLFGGEGFIMQKASGHGIMFAEFDGHVVEYELEAGKQIVVDTGHLAAMTASCTM 175

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
           +IK    ++  +FGG+ +   ++TGPG V++Q++P   ++
Sbjct: 176 EIKSVPGVKNMLFGGEGVFNTILTGPGKVWLQTMPISNVA 215


>gi|283797697|ref|ZP_06346850.1| HTH DNA-binding protein [Clostridium sp. M62/1]
 gi|291074591|gb|EFE11955.1| TIGR00266 family protein [Clostridium sp. M62/1]
 gi|295092075|emb|CBK78182.1| conserved hypothetical protein TIGR00266 [Clostridium cf.
           saccharolyticum K10]
          Length = 251

 Score = 97.8 bits (242), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 66/229 (28%), Positives = 122/229 (53%), Gaps = 17/229 (7%)

Query: 13  YVYQSPQDAVIPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE 70
           Y+Y   ++  + ++I G    A VC+  L+  E ++   G+M +MS +M+ME      N 
Sbjct: 16  YIYMRREEQDMKYEIHGETLPAVVCE--LEKGEAMITEKGAMAWMSPNMQMET-----NA 68

Query: 71  VGMWQWLFGKTIT-SVVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDA 125
            G+ +  FG+  +   + RN   + G  G+ A   A   P  +  F    G E + Q  A
Sbjct: 69  GGIGK-AFGRLFSGESMFRNTYTAKGGPGMIA--FASSFPGSIRAFEVGPGQEYIFQKSA 125

Query: 126 FLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITV 185
           FL S   V +S    +++ + + G EGF+ QKL+G+G+ F    G +++ +LE G+ + +
Sbjct: 126 FLASEAGVDLSVYFHKKLGSGLFGGEGFIMQKLSGRGIVFAEFDGHIMEYDLEAGQQMVI 185

Query: 186 DVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
           D   + A+T+   ++I+    ++  +FGG+ L   +VTGPG +++Q++P
Sbjct: 186 DTGYLAAMTAGCQMEIQSVPGVKNMLFGGEGLFNTIVTGPGHIWLQTMP 234


>gi|383761740|ref|YP_005440722.1| hypothetical protein CLDAP_07850 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
 gi|381382008|dbj|BAL98824.1| hypothetical protein CLDAP_07850 [Caldilinea aerophila DSM 14535 =
           NBRC 104270]
          Length = 251

 Score = 97.4 bits (241), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 76/236 (32%), Positives = 121/236 (51%), Gaps = 11/236 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG--MWQWLFGK 80
           + ++I+G   Q   + L P E + +  G M +MSG+++M+        +G  + + L G+
Sbjct: 23  MEYKIIGSTLQTVILELDPGETIYSESGGMAWMSGNIQMKTT-TRGGGIGDVLKRALSGE 81

Query: 81  TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
           +   V         G V  AA    +IL + L   G EL+CQ  AFLC+   VK+     
Sbjct: 82  SFFLVEF-TAESQKGVVAFAADFPGKILAVSLEQ-GQELICQRQAFLCAEKTVKLDIHFQ 139

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           +R      G EGF+ QKLTG G+AF+   G +V+  L  GEV+ VD   +     SV+  
Sbjct: 140 RRPGAGFFGGEGFILQKLTGPGVAFVSLDGEIVEYTLSPGEVLKVDTGHVGMYEPSVSFD 199

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS--PNMREN 254
           I+     R  +FGG+ L  A + GPG V++Q++P    +Q +A+A+ S  P +R +
Sbjct: 200 IEMVPGFRNILFGGEGLFLATLRGPGRVWLQTMP----TQNLAKAILSYFPRLRAD 251


>gi|326433927|gb|EGD79497.1| hypothetical protein PTSG_12989 [Salpingoeca sp. ATCC 50818]
          Length = 282

 Score = 97.1 bits (240), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 69/230 (30%), Positives = 122/230 (53%), Gaps = 4/230 (1%)

Query: 17  SPQ-DAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN--EVGM 73
           +PQ +  + +QI+G E+Q+ +++L   + V   PG+M  MS  +   +I++     +   
Sbjct: 42  TPQFNGQVQWQIVGDESQILEVVLNQGDAVTTEPGTMVHMSSGLS-PDIHLSGGCGQACT 100

Query: 74  WQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
                G++   V   N   +   +G+     A+I+PI+L  FGG +  +  AF+C++N  
Sbjct: 101 RSCCAGESFFRVKYENSTNTPQHIGLTPNFPAKIVPINLPEFGGRITIKSRAFMCALNCQ 160

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
              +     +     G +GF+   L+G G+AF+ A G+V+ +NL  GE +  D S +VA 
Sbjct: 161 PTFSYRFAGLGAGCCGGQGFVLNDLSGDGMAFLNASGTVLMRNLGAGEELICDQSSVVAF 220

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
            S+VN  I+  G       GG+ L  +V+TGPGIV +QS+P  +++  +A
Sbjct: 221 QSTVNFTIRRAGGCLMCCCGGEGLFNSVLTGPGIVVLQSMPIEKIAAALA 270


>gi|309789698|ref|ZP_07684278.1| protein of unknown function DUF124 [Oscillochloris trichoides DG-6]
 gi|308228184|gb|EFO81832.1| protein of unknown function DUF124 [Oscillochloris trichoides DG6]
          Length = 314

 Score = 97.1 bits (240), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 67/236 (28%), Positives = 126/236 (53%), Gaps = 5/236 (2%)

Query: 11  QPYVYQSPQDAV-IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPEN 69
            P +  S Q A  + ++I+G   Q   I L P + + +  G M +MS ++ M+   +   
Sbjct: 58  DPAIVGSGQGASGLKYKIIGTTMQAVVIELPPNQTIFSERGGMSWMSANINMQT-NMEGG 116

Query: 70  EVGMWQWLF-GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
             G ++ +F G++I  V   + G   G +G +A    +I+P++L   G +++CQ D+F+C
Sbjct: 117 LGGAFKRMFSGESIFMVSFTSQG-GMGIIGFSAEFPGKIVPLNLGA-GQQMICQKDSFMC 174

Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
           +   V +     +++     G EGF+ QK+TG GLAF+   G +V+  LE  +++ VD  
Sbjct: 175 AERSVTLDLHFRRKLGAGFFGGEGFIMQKITGPGLAFVELDGEIVEYTLEANQMLKVDTG 234

Query: 189 CIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
            +     +V   ++     +  +FGG+ L  A + GPG V++Q++P   L+++IA+
Sbjct: 235 HVAMYEPTVQFDVEMVRGFKNILFGGEGLFLATLRGPGRVWLQTMPAMNLAKKIAQ 290


>gi|159900866|ref|YP_001547113.1| hypothetical protein Haur_4353 [Herpetosiphon aurantiacus DSM 785]
 gi|159893905|gb|ABX06985.1| protein of unknown function DUF124 [Herpetosiphon aurantiacus DSM
           785]
          Length = 340

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 63/224 (28%), Positives = 114/224 (50%), Gaps = 10/224 (4%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG----MWQWLFGK 80
           ++I+G   Q   I L P + + +  G M +MSG+++M       N  G    M++  F  
Sbjct: 100 YKIIGTVLQAAIIELDPGQMIFSESGGMSWMSGNVQMNT-----NSGGGIGKMFKRAFSG 154

Query: 81  TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
               +V        G +G A+    +I+P+ LA  G E++ Q D+F+C+   V++     
Sbjct: 155 ESLFIVDFTVAGGTGLIGFASEMPGKIVPLHLAP-GQEIIMQKDSFMCAEKSVQLDIHFR 213

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           +++   + G EGF+ Q+ TG G+ F    G VV+  L+ G+V+ VD   +     SV   
Sbjct: 214 RKLGAGLFGGEGFIMQRATGPGVVFAELDGEVVEYYLQPGQVMKVDTGHVAMYEPSVQFD 273

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
           I+     +  +FGG+ L  A + GPG V++Q++P   +++++A 
Sbjct: 274 IEMVRGFKNILFGGEGLFLATLRGPGKVWLQTMPVMNVAKKLAE 317


>gi|390956112|ref|YP_006419869.1| hypothetical protein Terro_0183 [Terriglobus roseus DSM 18391]
 gi|390411030|gb|AFL86534.1| TIGR00266 family protein [Terriglobus roseus DSM 18391]
          Length = 254

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 65/232 (28%), Positives = 108/232 (46%), Gaps = 4/232 (1%)

Query: 26  QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE---VGMWQWLFGKTI 82
           +I G      + +L+P E V++  G + +MS S+ M            +G  + + G   
Sbjct: 4   RIHGTTMPAIEFLLQPGETVISEAGELSWMSASIGMHTHTQAGGGGGFLGAIKRMAGGGT 63

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
             +         G V  A      I+P++L   G E +     FLC+  +V +     Q 
Sbjct: 64  LFMTEYTAQGYPGEVAFATKVPGHIVPVELHGNGDEYMVHRHGFLCATPNVGIGLGFQQS 123

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +   I G EGFL QK+ G G A++   G V+ ++L+ GE + V    + A  + V+ QI 
Sbjct: 124 LGAGIFGGEGFLLQKIYGHGTAWLELSGEVIVRDLQPGETLRVHPGHVGAFHAGVSFQIT 183

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
               IR   FGGD +  A +TGPG +++Q+LP   L+ ++   +  P+ RE 
Sbjct: 184 RVPGIRNMFFGGDGIFLAALTGPGRIWLQTLPISNLAHKLQEYLPGPD-RET 234


>gi|219118253|ref|XP_002179905.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217408958|gb|EEC48891.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 305

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 68/239 (28%), Positives = 125/239 (52%), Gaps = 14/239 (5%)

Query: 9   PFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPE 68
           P +P VY    D +  ++ILG +AQ+ Q+ L+ + +V+   G+M +MS  M+ME      
Sbjct: 48  PPKPNVYA---DGLPVYEILGTDAQIVQLPLRARRQVMCFSGAMAYMSDGMKMEAKLAGL 104

Query: 69  NEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMF-GGELLCQPDAFL 127
            +    +   G ++  +   N    DG++ +       I+PI++A    G+++   D+FL
Sbjct: 105 GKT-FGRLAGGGSLFQLTYTNETDQDGYIAMTPDYPGVIVPINMASCPAGKIVTMRDSFL 163

Query: 128 CS---VNDVKVSNTLDQRVRNVIG-----GIEGFLRQKLTGQGLAFILAGGSVVQKNLEV 179
           C+   V+++           + IG     GI+ F+ Q ++    AF++A G+V+QKNL+ 
Sbjct: 164 CATVGVDNLTTEVGAGFNPASSIGSFLCSGID-FIVQTVSNGEWAFLMAMGTVIQKNLQP 222

Query: 180 GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
           GE + VD   I+   SSV + I++ G +     GG+ L    +TGPG +++QS+   ++
Sbjct: 223 GEKLLVDTESILCFESSVTIDIQWVGNVAAICCGGEGLFNTTMTGPGKIWMQSMSIDKM 281


>gi|332798969|ref|YP_004460468.1| hypothetical protein TepRe1_1013 [Tepidanaerobacter acetatoxydans
           Re1]
 gi|438002061|ref|YP_007271804.1| DUF124 domain-containing protein [Tepidanaerobacter acetatoxydans
           Re1]
 gi|332696704|gb|AEE91161.1| protein of unknown function DUF124 [Tepidanaerobacter acetatoxydans
           Re1]
 gi|432178855|emb|CCP25828.1| DUF124 domain-containing protein [Tepidanaerobacter acetatoxydans
           Re1]
          Length = 227

 Score = 96.3 bits (238), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 66/222 (29%), Positives = 108/222 (48%), Gaps = 6/222 (2%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + +++ G    V  + L   E +    G M +MS   EM+         G+ + L G+++
Sbjct: 1   MKYELRGDTLPVVLLTLDEGESIFTESGGMSWMSDGFEMKTNMEGGLFGGIARKLAGESL 60

Query: 83  --TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
             T+  L     S G +  ++    +I+PI L   G  L+CQ  +FLC+   VK+   L 
Sbjct: 61  FMTTYTLNK---SSGTIAFSSSFPGKIVPIYLDT-GESLICQKTSFLCAERSVKLEIHLK 116

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           +++   I G EGF+ Q++TG G  F+   G  V+  L+ GE + VD   +     SVN  
Sbjct: 117 KKLGAGIFGGEGFILQRVTGPGWVFLEIDGEAVEYQLDPGEKMKVDTGHVAIFEPSVNFD 176

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           I+        VFGG+ L    + GPG +++QS+P   L+ RI
Sbjct: 177 IETVKGFTNVVFGGEGLFLTTLKGPGKIWLQSMPIGNLASRI 218


>gi|442804464|ref|YP_007372613.1| hypothetical protein Cst_c10050 [Clostridium stercorarium subsp.
           stercorarium DSM 8532]
 gi|442740314|gb|AGC68003.1| hypothetical protein Cst_c10050 [Clostridium stercorarium subsp.
           stercorarium DSM 8532]
          Length = 270

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 65/226 (28%), Positives = 109/226 (48%), Gaps = 3/226 (1%)

Query: 17  SPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW 76
           S     + ++I GG      I L   E +  + G M +M   + ME         G  + 
Sbjct: 40  SANPGKLNYKIFGGSFPAVSIRLNTNESIYTQAGGMTWMDAGITMETNMQGGLMKGFGRM 99

Query: 77  LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
           L G+++  V   +  P+   V IA+     IL ID+      ++ Q  AFLC+   V +S
Sbjct: 100 LSGESLFMVTYTSHAPNQEIV-IASSFPGCILDIDVGRC--PIIAQKSAFLCAQTTVNLS 156

Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
             + + ++  + G EGF+ Q+L+G G+AFI   GS+V++ L  GE I VD   + A   +
Sbjct: 157 IYVARGLKAGLFGGEGFVLQRLSGSGIAFIEIDGSLVERTLAPGETIKVDTGNVAAFEET 216

Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           V  Q +     +  +FGG+ L    +TGPG V++Q++     ++ I
Sbjct: 217 VQYQAEMVKGFKNILFGGEGLFLTTLTGPGKVWLQTMSMPEFAKSI 262


>gi|393719536|ref|ZP_10339463.1| hypothetical protein SechA1_07285 [Sphingomonas echinoides ATCC
           14820]
          Length = 259

 Score = 94.4 bits (233), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 67/229 (29%), Positives = 110/229 (48%), Gaps = 3/229 (1%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++I G   Q   I L P E++V++  +M +MS  + M+         G+ + L G +I  
Sbjct: 3   YEISGTVMQTVSIDLSPGERIVSQTHAMAWMSDGITMDTHTGGGLFAGLKRALSGGSI-- 60

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
            +        G V  A      I+  +L   G  L+C+ + FLC+   V +     QR  
Sbjct: 61  FITEYGAERPGHVAFAPRFPGTIIARELRA-GESLICRKETFLCAEYSVSLEIAFQQRFG 119

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
             +   EGF+ Q++TG G  F+   G V++K L  GE + V    I     +V+  I+  
Sbjct: 120 AGVFAGEGFILQRVTGPGTVFLDLSGEVIEKTLSPGERLRVHAGHIGMQEPTVSTDIQMV 179

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRE 253
              R  +FGG+ L  A +TGPG +++QS+P   L++ IAR + S   R+
Sbjct: 180 RGFRNILFGGEGLFLATLTGPGKIWLQSMPILNLAEEIARHLPSNEARD 228


>gi|52550378|gb|AAU84227.1| HTH DNA-binding protein [uncultured archaeon GZfos3D4]
          Length = 230

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 62/200 (31%), Positives = 99/200 (49%), Gaps = 4/200 (2%)

Query: 43  EKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAP 102
           E V    G+M +MS ++EM          G+ + + G++       + G  DGFV  A  
Sbjct: 10  EMVYGEAGAMIYMSENVEMNAKAKGGFLKGIKRKVSGESFFLTEFTSNG--DGFVAFAGN 67

Query: 103 SLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQG 162
           +  RI  I+LA  G E + Q DAFLC+   V +     +++   + G EGF+ ++L G G
Sbjct: 68  APGRIKAIELA--GNEFIAQKDAFLCAQEGVDLDIAFTKKLGAGLFGGEGFIMERLGGNG 125

Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
            AFI   G   + +L  GEV+ VD   +V   SSV+  I   G ++  +F G+ +    +
Sbjct: 126 TAFIHGCGDFKEMDLAPGEVVKVDTGSVVGFDSSVSYDIMKVGNVKSMLFSGEGIFLTTL 185

Query: 223 TGPGIVFIQSLPFHRLSQRI 242
            GPG V +QS+    L+  +
Sbjct: 186 KGPGHVILQSMTIENLAAAL 205


>gi|347531876|ref|YP_004838639.1| hypothetical protein RHOM_07970 [Roseburia hominis A2-183]
 gi|345502024|gb|AEN96707.1| hypothetical protein RHOM_07970 [Roseburia hominis A2-183]
          Length = 226

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 62/208 (29%), Positives = 110/208 (52%), Gaps = 11/208 (5%)

Query: 39  LKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT--SVVLRNPGP--SD 94
           L+  E ++   G+M +MS +M+ME         G    +FG+ ++  S+ L        +
Sbjct: 17  LEQGEVMITESGAMSWMSPNMKMETT------GGGIGKMFGRALSGESIFLNRYTAMNGN 70

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           G +  A+     I   D+   G +L+ Q  AFL +V+ V +S    QR+ + + G EGF+
Sbjct: 71  GMIAFASSFPGSIRAFDIVP-GQDLIVQKSAFLAAVSSVDLSVFFQQRIGSGLFGGEGFI 129

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGG 214
            QKL+G G AF+   G + +  L  G+ I VD   + A+T S  +++K    ++  +FGG
Sbjct: 130 MQKLSGSGTAFLEFDGYIKEYELASGQQIVVDTGYLAAMTGSCKIEVKTVPGVKNMLFGG 189

Query: 215 DNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           + +   V+TGPG V++QS+P  +++  +
Sbjct: 190 EGIFNTVITGPGRVWLQSMPIAQMANTL 217


>gi|317473273|ref|ZP_07932569.1| hypothetical protein HMPREF1011_02919 [Anaerostipes sp. 3_2_56FAA]
 gi|316899267|gb|EFV21285.1| hypothetical protein HMPREF1011_02919 [Anaerostipes sp. 3_2_56FAA]
          Length = 226

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 64/231 (27%), Positives = 112/231 (48%), Gaps = 12/231 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMS----GSMEMENIYIPENEVGMWQWLF 78
           +  +I+G      ++ L   E +  + G+M +M     G  +ME  ++     G+ +   
Sbjct: 1   MKHEIIGQTVPAVEVELDKGEAMFTQSGAMAWMDPSIKGESKMEGGFLK----GIGRKFA 56

Query: 79  GKTITSVVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
           G+++  V   +    DG  +  A+     ILP+     GG ++CQ  AFLC+   V +  
Sbjct: 57  GESLFMVTYSSD--KDGAKIAFASTVPGTILPLQFQGTGG-MICQKKAFLCAQRSVSLET 113

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              +++     G EGF+ Q+L+G G+AF+   G  V K L  GE I VD   +VA    +
Sbjct: 114 IFTKKLSAGAFGGEGFILQRLSGDGMAFLEVDGDAVTKELAPGETILVDTGNVVAFEDGI 173

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           + +I+    ++   FGG+ L    +TGPG + +Q+  F+  + RIA  + S
Sbjct: 174 SYEIERIKGVKNIFFGGEGLFLTKLTGPGKIILQTQNFNDFAGRIASLIPS 224


>gi|167746172|ref|ZP_02418299.1| hypothetical protein ANACAC_00868 [Anaerostipes caccae DSM 14662]
 gi|167654687|gb|EDR98816.1| TIGR00266 family protein [Anaerostipes caccae DSM 14662]
          Length = 226

 Score = 94.0 bits (232), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 64/231 (27%), Positives = 112/231 (48%), Gaps = 12/231 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMS----GSMEMENIYIPENEVGMWQWLF 78
           +  +I+G      ++ L   E +  + G+M +M     G  +ME  ++     G+ +   
Sbjct: 1   MKHEIIGQTVPAVEVELDKGEAMFTQSGAMAWMDPSIKGESKMEGGFLK----GIGRKFA 56

Query: 79  GKTITSVVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
           G+++  V   +    DG  +  A+     ILP+     GG ++CQ  AFLC+   V +  
Sbjct: 57  GESLFMVTYSSE--KDGAKIAFASTVPGTILPLQFQGTGG-MICQKKAFLCAQRSVSLET 113

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              +++     G EGF+ Q+L+G G+AF+   G  V K L  GE I VD   +VA    +
Sbjct: 114 IFTKKLSAGALGGEGFILQRLSGDGMAFLEVDGDAVTKELAPGETILVDTGNVVAFEDGI 173

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           + +I+    ++   FGG+ L    +TGPG + +Q+  F+  + RIA  + S
Sbjct: 174 SYEIERIKGVKNIFFGGEGLFLTKLTGPGKIILQTQNFNDFAGRIASLIPS 224


>gi|323449198|gb|EGB05088.1| hypothetical protein AURANDRAFT_14032 [Aureococcus anophagefferens]
          Length = 201

 Score = 93.6 bits (231), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 58/208 (27%), Positives = 102/208 (49%), Gaps = 7/208 (3%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++++G + Q+ +I ++P +KVVA PG+M +M   ++       ++   M +   G     
Sbjct: 1   YEVVGHDMQLLRIQMEPGDKVVAEPGAMVYMHSDVQAGC----DSSDCMGRCCSGSPCIM 56

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
                P  + G++G+   + A+++P++L   G   L +  A+  S+ DV V    D    
Sbjct: 57  GTFETP-DTGGYLGLTPVNPAKVIPLELG--GRSFLGKDGAYFASIGDVAVGYDCDCNPA 113

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
               G +G +RQK++G G+AF+ A G +  K L  GE   VD + +VA   +V   IK  
Sbjct: 114 TCCCGGQGLVRQKISGNGMAFLGAMGVLTTKTLAEGETFVVDSNSLVAWEETVKFSIKRT 173

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           G        G+ +    + GPG +F QS
Sbjct: 174 GGCLTCCCSGEGMFNTTLEGPGTIFTQS 201


>gi|413933084|gb|AFW67635.1| hypothetical protein ZEAMMB73_642216 [Zea mays]
          Length = 90

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/68 (64%), Positives = 56/68 (82%), Gaps = 1/68 (1%)

Query: 215 DNLVTAVVTGPGIVFIQSLPFHRLSQRIA-RAVTSPNMRENPKFFVQIAIFFFLAYVVIV 273
           DN +T  +TGPG V IQSLPFHRLSQ IA R+V  P++R+NPKFF+Q+ +FFF AYV+IV
Sbjct: 6   DNQLTTSLTGPGFVLIQSLPFHRLSQWIASRSVVGPSLRDNPKFFIQVVMFFFSAYVMIV 65

Query: 274 SSLILTDV 281
           SS+IL D+
Sbjct: 66  SSIILADI 73


>gi|331092539|ref|ZP_08341361.1| hypothetical protein HMPREF9477_02004 [Lachnospiraceae bacterium
           2_1_46FAA]
 gi|330400760|gb|EGG80363.1| hypothetical protein HMPREF9477_02004 [Lachnospiraceae bacterium
           2_1_46FAA]
          Length = 226

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 67/217 (30%), Positives = 109/217 (50%), Gaps = 12/217 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + +QI G    V    L+  EK++   GSM +MS +M ME         G     FG+  
Sbjct: 1   MQYQIKGETLPVVICHLEAGEKMITEKGSMSWMSPNMLMET-----GTNGGLGKAFGRMF 55

Query: 83  TS-VVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSN 137
           +   + +N   S G  G  A   A   P  +  F    G E++ Q  AFL +   V++S 
Sbjct: 56  SGESMFQNTYTSQGGNGTIA--FASSFPGSIKAFEISTGNEMIFQKSAFLAAEAGVQLSV 113

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              +++ + + G EGF+ Q+++GQG  F    G V++  L+ G+ I VD   + A+T S 
Sbjct: 114 HFQKKLGSGLFGGEGFILQRVSGQGTMFAEFDGHVIEYELQPGQQIVVDTGHLAAMTPSC 173

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
            + IK    ++  VFGG+ L   ++TGPG V++Q++P
Sbjct: 174 QMDIKTIKGVKNIVFGGEGLFNTIITGPGRVWLQTMP 210


>gi|219117839|ref|XP_002179707.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217408760|gb|EEC48693.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 259

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 69/221 (31%), Positives = 109/221 (49%), Gaps = 9/221 (4%)

Query: 18  PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL 77
           P+     F + G E+QV  + +   E      G+M ++S  M     Y    E    +  
Sbjct: 26  PKPDAAKFAVTGHESQVVTLQIGVGEACQGEAGTMMYLSPGMRQSVTY----EGCCQRCC 81

Query: 78  FGKTITSVVLRNPGPSDGFVGIA-APSL--ARILPIDLAM--FGGELLCQPDAFLCSVND 132
            G++   V   N G +     +A  P+   A+++P+DL+    GG+L+ Q  A++ S  D
Sbjct: 82  SGESCFVVNFTNSGSTGNHEFVALTPNFPTAKVVPVDLSSPDVGGKLVAQQGAYMASYGD 141

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           V+V  +LD           G +RQKL G G  F+   G++VQK L  GE I VD +CI+A
Sbjct: 142 VRVGISLDTNCMRCCCAGLGLVRQKLEGSGTVFLGGTGTIVQKVLADGETILVDTNCIMA 201

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSL 233
             S+  + +K  G I   V GG+ +    +TGPG+V +QS+
Sbjct: 202 FASTCKLDLKRAGGILGMVGGGEGIFNTTLTGPGLVVVQSM 242


>gi|163845873|ref|YP_001633917.1| hypothetical protein Caur_0275 [Chloroflexus aurantiacus J-10-fl]
 gi|222523587|ref|YP_002568057.1| hypothetical protein Chy400_0293 [Chloroflexus sp. Y-400-fl]
 gi|163667162|gb|ABY33528.1| protein of unknown function DUF124 [Chloroflexus aurantiacus
           J-10-fl]
 gi|222447466|gb|ACM51732.1| protein of unknown function DUF124 [Chloroflexus sp. Y-400-fl]
          Length = 315

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 59/220 (26%), Positives = 116/220 (52%), Gaps = 2/220 (0%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++I+G   Q   + +   + + +  G M +MS +++M+   +     G ++ +F      
Sbjct: 75  YKIIGTTMQAVVLEVPAGQTIFSERGGMSWMSANVQMQT-NMEGGLGGAFKRMFSGESIF 133

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
           +V   P    G +G +A    +I+P++LA  G  ++CQ D+F+C+   V +     +++ 
Sbjct: 134 MVNFTPQGGPGIIGFSAELPGKIVPLNLAP-GQTMICQKDSFMCAERSVSLDIHFRRKLG 192

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
               G EGF+ QKLTG GLAF+   G VV+  LE  +++ VD   +     ++N  ++  
Sbjct: 193 AGFFGGEGFIMQKLTGPGLAFVELDGEVVEYTLEANQMLKVDTGHVAMYEPTINFDVEMV 252

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
              +  +FGG+ L    + GPG V++Q++P   L++++A+
Sbjct: 253 RGFKNILFGGEGLFLTTLRGPGRVWLQTMPAMNLAKKLAQ 292


>gi|379721089|ref|YP_005313220.1| hypothetical protein PM3016_3216, partial [Paenibacillus
           mucilaginosus 3016]
 gi|378569761|gb|AFC30071.1| hypothetical protein PM3016_3216 [Paenibacillus mucilaginosus 3016]
          Length = 115

 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 44/99 (44%), Positives = 65/99 (65%), Gaps = 1/99 (1%)

Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
           QK+ G GLAF+ AGG++ ++ L  GE+I VD  C+VA+T  V+  I++   ++ A+FGG+
Sbjct: 2   QKIEGDGLAFVHAGGAICERELRPGEMIRVDTGCLVAMTQGVDYDIEFVKGVKTALFGGE 61

Query: 216 NLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
            L  A + GPG V+IQSLPF RL+ R+  A   P  R+ 
Sbjct: 62  GLFFATLRGPGRVWIQSLPFSRLADRVL-AAAGPGGRKE 99


>gi|65322230|ref|ZP_00395189.1| COG2013: Uncharacterized conserved protein [Bacillus anthracis str.
           A2012]
          Length = 113

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 44/94 (46%), Positives = 63/94 (67%), Gaps = 4/94 (4%)

Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
           QKL G GLAF+ AGG+V ++ L+ GE + +D  C+VA+T  VN  +++ G ++ A+FGG+
Sbjct: 2   QKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCLVAMTKDVNYDVEFVGKVKTALFGGE 61

Query: 216 NLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
            L  A + GPG V+IQSL   RLS R+    TSP
Sbjct: 62  GLFFATLEGPGTVWIQSLTLSRLSARL----TSP 91


>gi|148658157|ref|YP_001278362.1| hypothetical protein RoseRS_4068 [Roseiflexus sp. RS-1]
 gi|148570267|gb|ABQ92412.1| protein of unknown function DUF124 [Roseiflexus sp. RS-1]
          Length = 253

 Score = 91.3 bits (225), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 69/227 (30%), Positives = 112/227 (49%), Gaps = 13/227 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW------QW 76
           + +QI+G   Q   + L P E V +  G+M +MSG+++M          GM+      + 
Sbjct: 26  MEYQIIGTTMQAVILELDPGETVYSESGAMSWMSGNIQMATNTRGGGLGGMFKRAISGES 85

Query: 77  LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
           LF    TSV         G V  A+    +I+P+ LA  G  ++ Q  AFLC+   V + 
Sbjct: 86  LFLNEFTSV------GGKGIVAFASDFPGKIVPVALAE-GQMMIVQKQAFLCAEKTVGLD 138

Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
               +R+     G EGF+ QKLTG G+AF+   G +V+  L+ G+V+ VD   +     +
Sbjct: 139 IHFRKRLGAGFFGGEGFIMQKLTGPGIAFVCLDGEIVEYTLDAGQVLKVDTGHVAMYEPT 198

Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
           V   I+     +  + GG+ L    + GPG V++Q++P   L++ IA
Sbjct: 199 VEFDIEMVKGFKNILLGGEGLFLTTLRGPGRVWLQTMPTANLAKAIA 245


>gi|346308928|ref|ZP_08851032.1| hypothetical protein HMPREF9457_02741 [Dorea formicigenerans
           4_6_53AFAA]
 gi|345901477|gb|EGX71276.1| hypothetical protein HMPREF9457_02741 [Dorea formicigenerans
           4_6_53AFAA]
          Length = 226

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 64/217 (29%), Positives = 112/217 (51%), Gaps = 6/217 (2%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           +QI G    V    L+  E+++   G M +MS +M+ME          + +   G+ +  
Sbjct: 3   YQIQGDTLPVVICQLEAGERMITEGGGMSWMSPNMKMETTTNGGIGKAVGRMFSGEKMFQ 62

Query: 85  VVLRNPGPSDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
            +    G  +G +  A+  P   R   I     G E++ Q  AFL S   V++S    ++
Sbjct: 63  NIYTAQG-GNGMIAFASCFPGSIRAFNIRP---GQEMIFQKSAFLASEAGVQLSVHFHKK 118

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           V + + G EGF+ QK++GQG+AF    G VV+  L+ G+ I +D   + A+T+S  + I+
Sbjct: 119 VASGLFGGEGFVLQKVSGQGVAFAEFDGHVVEYELQPGQQIVIDTGHLAAMTASCQMDIQ 178

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
               ++  +FGG+ L   V+TGPG V++Q++P   ++
Sbjct: 179 TVPGVKNMLFGGEGLFNTVITGPGRVWLQTMPISNVA 215


>gi|166030603|ref|ZP_02233432.1| hypothetical protein DORFOR_00269 [Dorea formicigenerans ATCC
           27755]
 gi|166029605|gb|EDR48362.1| TIGR00266 family protein [Dorea formicigenerans ATCC 27755]
          Length = 229

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 64/217 (29%), Positives = 112/217 (51%), Gaps = 6/217 (2%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           +QI G    V    L+  E+++   G M +MS +M+ME          + +   G+ +  
Sbjct: 6   YQIQGDTLPVVICQLEAGERMITEGGGMSWMSPNMKMETTTNGGIGKAVGRMFSGEKMFQ 65

Query: 85  VVLRNPGPSDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
            +    G  +G +  A+  P   R   I     G E++ Q  AFL S   V++S    ++
Sbjct: 66  NIYTAQG-GNGMIAFASCFPGSIRAFNIRP---GQEMIFQKSAFLASEAGVQLSVHFHKK 121

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           V + + G EGF+ QK++GQG+AF    G VV+  L+ G+ I +D   + A+T+S  + I+
Sbjct: 122 VASGLFGGEGFVLQKVSGQGVAFAEFDGHVVEYELQPGQQIVIDTGHLAAMTASCQMDIQ 181

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
               ++  +FGG+ L   V+TGPG V++Q++P   ++
Sbjct: 182 TVPGVKNMLFGGEGLFNTVITGPGRVWLQTMPISNVA 218


>gi|228967985|ref|ZP_04128994.1| Protein containing DUF124 [Bacillus thuringiensis serovar sotto
           str. T04001]
 gi|228791709|gb|EEM39302.1| Protein containing DUF124 [Bacillus thuringiensis serovar sotto
           str. T04001]
          Length = 113

 Score = 90.5 bits (223), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 43/94 (45%), Positives = 63/94 (67%), Gaps = 4/94 (4%)

Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
           QKL G GLAF+ AGG+V ++ L+ GE + +D  C+VA+T  +N  I++ G ++ A+FGG+
Sbjct: 2   QKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCLVAMTKDINYDIEFVGKVKTALFGGE 61

Query: 216 NLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
            L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 62  GLFFATLEGPGTVWIQSLTLSRLAARL----TSP 91


>gi|153852988|ref|ZP_01994425.1| hypothetical protein DORLON_00410 [Dorea longicatena DSM 13814]
 gi|149754630|gb|EDM64561.1| TIGR00266 family protein [Dorea longicatena DSM 13814]
          Length = 227

 Score = 90.5 bits (223), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 66/229 (28%), Positives = 115/229 (50%), Gaps = 12/229 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++I G    V    L+  EK++   G M +MS +M+ME        +G     FG+  + 
Sbjct: 3   YEIKGDTLPVVICYLEGGEKMITEGGGMSWMSPNMKMETT--TNGGIGK---AFGRMFSG 57

Query: 85  -VVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTL 139
             + +N   + G  G+ A   A   P  +  F    G E++ Q  AFL S   V++S   
Sbjct: 58  EHIFQNIYTAQGGKGMIA--FASCFPGSIKAFEIGPGREMVFQKSAFLASEAGVELSVFF 115

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
           +++  + + G EGF+ QK++GQG  F    G VV+  L  G+ I VD   + A+ +S  +
Sbjct: 116 NKKFSSGLFGGEGFIMQKVSGQGTVFAEFDGHVVEYELNAGQEIVVDTGHLAAMDASCQI 175

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           +IK    ++  +FGG+ +   VV GPG V++Q++P   ++  +   +T+
Sbjct: 176 EIKSVPGVKNMLFGGEGIFNTVVHGPGRVWLQTMPISNVAGALGPYITT 224


>gi|423603430|ref|ZP_17579323.1| hypothetical protein IIK_00011 [Bacillus cereus VD102]
 gi|401247409|gb|EJR53745.1| hypothetical protein IIK_00011 [Bacillus cereus VD102]
          Length = 113

 Score = 90.1 bits (222), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 43/94 (45%), Positives = 63/94 (67%), Gaps = 4/94 (4%)

Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
           QKL G GLAF+ AGG+V ++ L+ GE + +D  C+VA+T  VN  +++ G ++ A+FGG+
Sbjct: 2   QKLEGDGLAFMHAGGTVYKRELKPGEKLRIDTGCLVAMTKDVNYDVEFVGKVKTALFGGE 61

Query: 216 NLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
            L  A + GPG V+IQSL   RL+ R+    TSP
Sbjct: 62  GLFFATLEGPGTVWIQSLTLSRLAARL----TSP 91


>gi|210617031|ref|ZP_03291366.1| hypothetical protein CLONEX_03588 [Clostridium nexile DSM 1787]
 gi|210149554|gb|EEA80563.1| hypothetical protein CLONEX_03588 [Clostridium nexile DSM 1787]
          Length = 242

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 66/215 (30%), Positives = 109/215 (50%), Gaps = 12/215 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++I G    V    L+  EK++   GSM +MS +MEM+         G    +FG+  + 
Sbjct: 19  YEIKGETLPVVICYLESGEKMITERGSMSWMSPNMEMQT-----TSNGGIGKVFGRAFSG 73

Query: 85  VVL-RNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTL 139
             L +N   + G  G+ A   A   P  +  F    G E++ Q  AFL     V++S   
Sbjct: 74  EALFQNTYTAQGGNGMIA--FASSFPGSIKAFQVSPGNEMIFQKSAFLAGEAGVELSIHF 131

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
            ++V + + G EGF+ Q+++G G  F    G VV+  L+ G+ I +D   + A+T S  +
Sbjct: 132 HKKVASGLFGGEGFILQRVSGIGTVFAEFDGHVVEYELQPGQQIVIDSGHLAAMTPSCQM 191

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
            I+    ++  VFGG+ L   V+TGPG V++Q++P
Sbjct: 192 DIQTIRGLKNVVFGGEGLFNTVITGPGRVWLQTMP 226


>gi|309776314|ref|ZP_07671302.1| HTH DNA-binding protein [Erysipelotrichaceae bacterium 3_1_53]
 gi|308915910|gb|EFP61662.1| HTH DNA-binding protein [Erysipelotrichaceae bacterium 3_1_53]
          Length = 225

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 66/222 (29%), Positives = 110/222 (49%), Gaps = 6/222 (2%)

Query: 23  IPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
           + ++I+GG+  A VC+  L   EKV    G MC+M     M++        G+ + + G+
Sbjct: 1   MKYEIIGGQLPAVVCK--LDRGEKVFTESGGMCWMEEGFSMDSNTRGGLLKGLGRAMSGE 58

Query: 81  TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
           +I  +        D  +   +    +ILP+ L   G  L+ Q  AFL + + V +     
Sbjct: 59  SIF-LTTYTSSQDDAEIAFGSSFPGKILPVALEP-GKTLIVQKHAFLAAEDGVNLQAHFR 116

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           +++     G EGF+ QKLTG G+AF+   G +++K L  GE + VD   I     SV   
Sbjct: 117 KKLGTGFFGGEGFVLQKLTGNGMAFLEIDGDIIEKTLAPGETLQVDQGYIAGFEESVQFD 176

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           I     ++   F G+ L  A ++GPG +++Q++PF  L+ RI
Sbjct: 177 ITTVKGLKNKFFSGEGLFLATLSGPGKIWLQTMPFSVLADRI 218


>gi|226325602|ref|ZP_03801120.1| hypothetical protein COPCOM_03415 [Coprococcus comes ATCC 27758]
 gi|225205726|gb|EEG88080.1| TIGR00266 family protein [Coprococcus comes ATCC 27758]
          Length = 227

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 61/223 (27%), Positives = 110/223 (49%), Gaps = 14/223 (6%)

Query: 24  PFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT 83
            ++ILGG   V    L   E ++   GSM +MS +M+ME I       G  + +FG+ ++
Sbjct: 3   KYEILGGNLPVVVCELSAGESMITESGSMSWMSPNMKMETI-----SGGGMKKMFGRLMS 57

Query: 84  -SVVLRNPGPSDGFVGIAA-----PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
                +N   ++G  G+ A     P   + L I     G  ++ Q  AFL S   V++S 
Sbjct: 58  GDSAFQNRYTAEGADGMIAFASSFPGAIKALEIS---NGHSMIVQKSAFLASEEGVELSM 114

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              +++   I G EGF+ Q+L+G G AF+   G  V+ +L  G+ I +    + A+  + 
Sbjct: 115 HFQKKLGKGIFGGEGFIMQRLSGNGTAFVEIDGHAVEYDLSAGQEILISTGYLAAMEETC 174

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
            + +     ++  + GG+ +   VV GPG V +Q++P  ++++
Sbjct: 175 TMDVVAVKGVKNMLIGGEGIFNTVVKGPGKVILQTMPISKVAE 217


>gi|260437558|ref|ZP_05791374.1| HTH DNA-binding protein [Butyrivibrio crossotus DSM 2876]
 gi|292810190|gb|EFF69395.1| HTH DNA-binding protein [Butyrivibrio crossotus DSM 2876]
          Length = 223

 Score = 89.7 bits (221), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 67/219 (30%), Positives = 111/219 (50%), Gaps = 12/219 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           + I GG      + L+  E ++   G+M +MS +M ME         G +  + G+  + 
Sbjct: 3   YTIEGGTLPTLILNLEAGETIMTEGGAMSWMSPNMLMET-----TTNGGFGKMMGRMFSG 57

Query: 85  VVL-RNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTL 139
             L  N   + G  G+ A  +A  LP  +  F    G EL+CQ  +FL     +++S   
Sbjct: 58  EKLFMNRYTAQGGPGMIA--MASSLPGSIKPFEIVPGRELICQKSSFLSCTPGIELSVAF 115

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
            +++     G EGF+ QKL+GQGLAFI   G   +  L  G+ I +D   + A+ ++  +
Sbjct: 116 QKKLGAGFFGGEGFIMQKLSGQGLAFIEIDGYCKEYELAPGQQIVLDTGYLAAMEATCKM 175

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
            IK  G ++ A+FGG+     +VTGPG V++QS+P  ++
Sbjct: 176 DIKSTGGVKNALFGGEGFFNTIVTGPGKVYVQSMPMSKM 214


>gi|293376638|ref|ZP_06622866.1| conserved hypothetical protein TIGR00266 [Turicibacter sanguinis
           PC909]
 gi|325839400|ref|ZP_08166839.1| TIGR00266 family protein [Turicibacter sp. HGF1]
 gi|292644864|gb|EFF62946.1| conserved hypothetical protein TIGR00266 [Turicibacter sanguinis
           PC909]
 gi|325490520|gb|EGC92836.1| TIGR00266 family protein [Turicibacter sp. HGF1]
          Length = 227

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 62/216 (28%), Positives = 103/216 (47%), Gaps = 2/216 (0%)

Query: 27  ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVV 86
           I G +     I+L   E V  + G M +M   + M          G+ +   G+++    
Sbjct: 5   IFGNDLPGVSIVLDKGESVYTQSGGMAWMDNGITMNTNIKGGLMKGIGRMFSGESLFMAT 64

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
             +  P+   V  A+    +I+ +DL   G + +CQ  +FL     V +S    +RV   
Sbjct: 65  YTSSLPNQEIV-FASTFPGKIIELDLVP-GQDWICQKSSFLVGEPGVNLSVEWTRRVSAG 122

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
           + G EGF+ Q+L+G G  F+   GSVV++ L+ GEVI VD   +V     V  +I+    
Sbjct: 123 LFGGEGFILQRLSGHGKVFLEVAGSVVKRELKPGEVIKVDTGNVVGFDRQVKYEIETVKG 182

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            +   FGG+ L    +TGPG V++Q+L    ++ R+
Sbjct: 183 FKNIFFGGEGLFLTKLTGPGTVYLQTLTMQNVASRL 218


>gi|154483410|ref|ZP_02025858.1| hypothetical protein EUBVEN_01113 [Eubacterium ventriosum ATCC
           27560]
 gi|149735662|gb|EDM51548.1| TIGR00266 family protein [Eubacterium ventriosum ATCC 27560]
          Length = 226

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 60/210 (28%), Positives = 108/210 (51%), Gaps = 13/210 (6%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL----RN 89
           +C+  L   EK++   G M +MS +M+ME         G    +FG+ ++   L      
Sbjct: 14  ICE--LDAGEKIITEGGGMAWMSPNMKMETTG------GGIGKMFGRALSGDTLFQNIYT 65

Query: 90  PGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
              ++G + IA+     I   ++   G E++ Q  AFL S   V++S    +++ + + G
Sbjct: 66  AERTNGMISIASSFPGSIKAFEITP-GNEIILQKSAFLASEAGVELSMHFRKKLGSGLFG 124

Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
            EGF+ QKL+G G+AF    G V++  L+ G+ I +D   + A++ +  + I+    I+ 
Sbjct: 125 GEGFILQKLSGNGIAFAEFDGHVIEYELQPGQQIVIDTGHLAAMSVTCQMDIQTVPGIKN 184

Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
              GG+ L   VVTGPG V++Q++P   ++
Sbjct: 185 MFLGGEGLFNTVVTGPGHVWLQTMPISNMA 214


>gi|145352533|ref|XP_001420596.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580831|gb|ABO98889.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 243

 Score = 88.2 bits (217), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 63/220 (28%), Positives = 110/220 (50%), Gaps = 10/220 (4%)

Query: 27  ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGS-MEMENIYIPENEVGMWQWLFGKTITSV 85
           ++G + Q+ QI LKP E+V A PG+MC+MS + M + ++        + + L G+     
Sbjct: 7   VVGTDTQIAQIKLKPGEEVAASPGAMCYMSSNVMSLTSLGPGGLPRAIMRALAGEPFFIN 66

Query: 86  VLRNPGPSDGFVGIAA--PSLARILPIDLAM-FGGELLCQPDAFLCSVNDVKVS-----N 137
             RN G  +G++ +    PS  +I  I+L      ++LC  D+F+CS  +++++      
Sbjct: 67  TFRNLGSVEGYIAVGGLEPS-DKIAVIELKDDTKHDILCARDSFMCSKGNIRITAAAALR 125

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
           T +    +    +  F  + L G G   I   G+VV++ L+  E + VD   ++A+  SV
Sbjct: 126 TTENDFASFADMLNIFNGEWLIGNGTVCINGKGTVVRRTLKKDEEMVVDARAVMALERSV 185

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR 237
              ++       AVFGG  L    + GPG  ++QSLP  +
Sbjct: 186 GYDLEIQSSPLAAVFGGAGLFFVRLHGPGTFYLQSLPVQK 225


>gi|336433991|ref|ZP_08613797.1| hypothetical protein HMPREF0991_02916 [Lachnospiraceae bacterium
           2_1_58FAA]
 gi|336014730|gb|EGN44564.1| hypothetical protein HMPREF0991_02916 [Lachnospiraceae bacterium
           2_1_58FAA]
          Length = 226

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 58/219 (26%), Positives = 115/219 (52%), Gaps = 2/219 (0%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++I+G         L+  E ++   G+M +MS +M+ME            +   G++I  
Sbjct: 3   YEIVGETLPAVICTLEEGETMITEKGAMSWMSPNMKMETSTNGGIGKAFGRMFSGESIFQ 62

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
                 G   G +  A+    +I+  ++A  G E++ Q   FL +   V +S    +R+ 
Sbjct: 63  NTYTAKG-GHGMIAFASSFPGKIVAHEVAP-GREIVVQKSGFLAAEAGVDLSIFFQKRIG 120

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
           + I G EGF+ QKL+G+G+AF+   G VV+  L+ G+ I VD   + A+ ++ +++I+  
Sbjct: 121 SGIFGGEGFIMQKLSGRGMAFLEFDGHVVEYELQPGQQIVVDTGYLAAMDATCSMEIQSV 180

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
             ++  +FGG+ +   V++GPG +++Q++P   ++  +A
Sbjct: 181 PGVKNMLFGGEGVFNTVISGPGHIWLQTMPISNVAGVLA 219


>gi|226325805|ref|ZP_03801323.1| hypothetical protein COPCOM_03618 [Coprococcus comes ATCC 27758]
 gi|225205929|gb|EEG88283.1| TIGR00266 family protein [Coprococcus comes ATCC 27758]
          Length = 227

 Score = 88.2 bits (217), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 63/225 (28%), Positives = 113/225 (50%), Gaps = 12/225 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I G    V    L+  E+++   GSM +MS +M+ME      +  G +   FG+  
Sbjct: 1   MKYEIKGDTLPVVICHLEAGERMITERGSMAWMSPNMKMET-----STNGGFGKAFGRMF 55

Query: 83  T-SVVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSN 137
           +   + +N   + G  G+ A   A   P  +  F    G E + Q  AFL     V +S 
Sbjct: 56  SGEAMFQNIYTAQGGNGLIA--FASSFPGSIKAFEIGPGQEYIFQKKAFLAGEPGVNISV 113

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              ++V + + G EGF+ QK++G G+ F    G VV+  L+ G+ I +D   + A++ + 
Sbjct: 114 HFHKKVASGLFGGEGFILQKVSGNGIVFAEFDGHVVEYELQPGQQIVIDSGYLAAMSVTC 173

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            + I+    ++  VFGG+ L   V+TGPG V++Q++P   ++  +
Sbjct: 174 QMDIQTVPGLKNIVFGGEGLFNTVITGPGHVWLQTMPISSVADSL 218


>gi|154505420|ref|ZP_02042158.1| hypothetical protein RUMGNA_02942 [Ruminococcus gnavus ATCC 29149]
 gi|153794263|gb|EDN76683.1| TIGR00266 family protein [Ruminococcus gnavus ATCC 29149]
          Length = 228

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 58/219 (26%), Positives = 115/219 (52%), Gaps = 2/219 (0%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++I+G         L+  E ++   G+M +MS +M+ME            +   G++I  
Sbjct: 5   YEIVGETLPAVICTLEEGETMITEKGAMSWMSPNMKMETSTNGGIGKAFGRMFSGESIFQ 64

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
                 G   G +  A+    +I+  ++A  G E++ Q   FL +   V +S    +R+ 
Sbjct: 65  NTYTAKG-GHGMIAFASSFPGKIVAHEVAP-GREIVVQKSGFLAAEAGVDLSIFFQKRIG 122

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
           + I G EGF+ QKL+G+G+AF+   G VV+  L+ G+ I VD   + A+ ++ +++I+  
Sbjct: 123 SGIFGGEGFIMQKLSGRGMAFLEFDGHVVEYELQPGQQIVVDTGYLAAMDATCSMEIQSV 182

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
             ++  +FGG+ +   V++GPG +++Q++P   ++  +A
Sbjct: 183 PGVKNMLFGGEGVFNTVISGPGHIWLQTMPISNVAGVLA 221


>gi|156741188|ref|YP_001431317.1| hypothetical protein Rcas_1202 [Roseiflexus castenholzii DSM 13941]
 gi|156232516|gb|ABU57299.1| protein of unknown function DUF124 [Roseiflexus castenholzii DSM
           13941]
          Length = 253

 Score = 87.8 bits (216), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 68/227 (29%), Positives = 110/227 (48%), Gaps = 13/227 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW------QW 76
           + +QI+G   Q   + L P E V +  G+M +MSG+++M          GM+      + 
Sbjct: 26  MEYQIIGTTMQAVILELDPGETVYSESGAMSWMSGNIQMATNTRGGGLGGMFKRAISGES 85

Query: 77  LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
           LF    TSV         G V  A+    +I+P  L   G  ++ Q  AFLC+   V + 
Sbjct: 86  LFLNEFTSV------GGKGIVAFASDFPGKIVPAPLRE-GQTMIVQKQAFLCAEKTVGLD 138

Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
               +R+     G EGF+ QKLTG G+AF+   G +V+  L+ G+V+ VD   +     +
Sbjct: 139 IHFRKRMGAGFFGGEGFIMQKLTGPGVAFVCLDGEIVEYTLDAGQVLKVDTGHVAMYEPT 198

Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
           V   I+     +  + GG+ L    + GPG V++Q++P   L++ IA
Sbjct: 199 VEFDIEMVKGFKNILLGGEGLFLTTLRGPGRVWLQTMPTTNLAKAIA 245


>gi|323454064|gb|EGB09935.1| hypothetical protein AURANDRAFT_14036, partial [Aureococcus
           anophagefferens]
          Length = 235

 Score = 87.4 bits (215), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 64/233 (27%), Positives = 108/233 (46%), Gaps = 8/233 (3%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           + +LG    +    L P E   + PG+M FMS  + M   +       + Q + G+ +  
Sbjct: 1   YDVLGDTYGIVAKTLHPGETFHSEPGAMVFMSNDVRMSAKFGRGFRGAVSQAVSGEALAK 60

Query: 85  VVLRNPGPSDGFVGIAAPS-LARILPIDLAMF-GGELLCQPDAFLCSVNDVKVSNTLDQR 142
           V   N G   G+VG+ A    + ++P+ LA   GG +  +  A++    DV  S      
Sbjct: 61  VEYTNAGAGPGYVGMTANQPFSTVIPVQLAALPGGSINVKRGAYMAGTPDVTASMKF-LP 119

Query: 143 VRNVIGGIEGFLR---QKLTG--QGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
            R++     G L    Q ++G   G AF+ A G++V K+L+ GE + VD + IV   + V
Sbjct: 120 ARDLAACCCGGLPPVIQSVSGAPHGTAFLSAAGTIVAKSLQPGEAVVVDSTSIVGFENDV 179

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
              +K  G       GG+     V+TGPG V++QS+   +L  ++    +S +
Sbjct: 180 QFDVKRVGDCSTCCLGGEGCYNTVLTGPGNVYLQSISIDKLMAQLVTVQSSDD 232


>gi|308809826|ref|XP_003082222.1| unnamed protein product [Ostreococcus tauri]
 gi|116060690|emb|CAL57168.1| unnamed protein product, partial [Ostreococcus tauri]
          Length = 475

 Score = 87.4 bits (215), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 68/245 (27%), Positives = 118/245 (48%), Gaps = 30/245 (12%)

Query: 18  PQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGS-MEMENIYIPENEVGMWQW 76
           PQD      I+G + Q+ QI L P E+V A PG+MC+MS + M + ++        + + 
Sbjct: 218 PQDV----NIVGTDTQIAQIKLSPGEEVAASPGAMCYMSSNVMSVTSLGPGGLPRAIMRA 273

Query: 77  LFGKTITSVVLRNPGPSDGFVGIAA--PSLARILPIDLA-MFGGELLCQPDAFLCSVNDV 133
           L G+       +N G ++G++ +    PS  +I  I+L      ++LC  D+++CS  +V
Sbjct: 274 LAGEPFFLNTFKNLGVTEGYIAVGGLEPS-DKIAVIELKDEIKHDILCARDSYMCSKGNV 332

Query: 134 KVS------------NTLDQRVRNVIG---------GIEGFLRQKLTGQGLAFILAGGSV 172
           +++            ++    ++  IG         G   F  + LTG G   I   G+V
Sbjct: 333 RITAAAALRSGVDDFDSFSDMLKYRIGLAARLVLSKGNSIFNGEWLTGSGTVCITGKGTV 392

Query: 173 VQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           V++ L+  E + VD   ++A+  SV  +++       AVFGG  L    + GPG  ++QS
Sbjct: 393 VRRTLKKDEELVVDARSVMALERSVGYELEVQNSPLAAVFGGAGLFFVRLQGPGTFYLQS 452

Query: 233 LPFHR 237
           LP  +
Sbjct: 453 LPVQK 457


>gi|148657327|ref|YP_001277532.1| hypothetical protein RoseRS_3221 [Roseiflexus sp. RS-1]
 gi|148569437|gb|ABQ91582.1| protein of unknown function DUF124 [Roseiflexus sp. RS-1]
          Length = 269

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 64/225 (28%), Positives = 110/225 (48%), Gaps = 9/225 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + +QI+G   Q   + L P E V +  G+M +MSG+++M          G    +F + I
Sbjct: 22  MEYQIVGTTLQAVILELDPGETVYSESGAMSWMSGNIQMAT----NTRGGGLGGMFKRAI 77

Query: 83  TSVVLR----NPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
           +   L           G V  A+    +I+P+ L + G  ++ Q  AFLC+   V +   
Sbjct: 78  SGESLFLNEFTSAGGKGIVAFASDFPGKIVPVPL-LEGQSMIVQKQAFLCAEKTVGLDIH 136

Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
             +R+     G EGF+ QKLTG G+AF+   G +V+  L+ G+V+ VD   +     +V 
Sbjct: 137 FRKRLGAGFFGGEGFIMQKLTGPGVAFVCLDGEIVEYTLDAGQVLKVDTGHVAMYEPTVE 196

Query: 199 VQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
             I+     +  + GG+ L    + GPG V++Q++P   L++ +A
Sbjct: 197 FDIEMVKGFKNILLGGEGLFLTTLRGPGRVWLQTMPTANLAKAVA 241


>gi|313897924|ref|ZP_07831465.1| TIGR00266 family protein [Clostridium sp. HGF2]
 gi|346317754|ref|ZP_08859229.1| hypothetical protein HMPREF9022_04886 [Erysipelotrichaceae
           bacterium 2_2_44A]
 gi|422329865|ref|ZP_16410889.1| TIGR00266 family protein [Erysipelotrichaceae bacterium 6_1_45]
 gi|312957459|gb|EFR39086.1| TIGR00266 family protein [Clostridium sp. HGF2]
 gi|345898897|gb|EGX68756.1| hypothetical protein HMPREF9022_04886 [Erysipelotrichaceae
           bacterium 2_2_44A]
 gi|371655570|gb|EHO20916.1| TIGR00266 family protein [Erysipelotrichaceae bacterium 6_1_45]
          Length = 225

 Score = 87.0 bits (214), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 68/226 (30%), Positives = 112/226 (49%), Gaps = 14/226 (6%)

Query: 23  IPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
           + ++I+GG+  A VC+  L   EK+    G MC+M     M++     N  G      G+
Sbjct: 1   MKYEIIGGQLPAVVCK--LDRGEKMFTESGGMCWMEEGFSMDS-----NTRGGLLKGLGR 53

Query: 81  TIT--SVVLRNPGPSDGFVGIAAPSL--ARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
            ++  S+ L     S     IA  S    +ILP+ L   G  L+ Q +AFL + + V + 
Sbjct: 54  AMSGESIFLTTYTSSHDHAEIAFGSSFPGKILPVTLHD-GKTLIVQKNAFLAAEDGVNLQ 112

Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
               +++     G EGF+ QKLTG G AF+   G +++K L  GE + VD   I     S
Sbjct: 113 AHFRKKLGTGFFGGEGFVLQKLTGNGTAFLEIDGDIIEKTLAPGETLQVDQGYIAGFEES 172

Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           V+  I     ++   F G+ +  A ++GPG +++Q++PF  L+ R+
Sbjct: 173 VHFDITTVKGLKNKFFSGEGMFLATLSGPGKIWLQTMPFSVLADRV 218


>gi|373125276|ref|ZP_09539110.1| TIGR00266 family protein [Erysipelotrichaceae bacterium 21_3]
 gi|371657477|gb|EHO22775.1| TIGR00266 family protein [Erysipelotrichaceae bacterium 21_3]
          Length = 225

 Score = 87.0 bits (214), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 68/226 (30%), Positives = 112/226 (49%), Gaps = 14/226 (6%)

Query: 23  IPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
           + ++I+GG+  A VC+  L   EK+    G MC+M     M++     N  G      G+
Sbjct: 1   MKYEIIGGQLPAVVCK--LDRGEKMFTESGGMCWMEEGFSMDS-----NTRGGLLKGLGR 53

Query: 81  TIT--SVVLRNPGPSDGFVGIAAPSL--ARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
            ++  S+ L     S     IA  S    +ILP+ L   G  L+ Q +AFL + + V + 
Sbjct: 54  AMSGESIFLTTYTSSHDHAEIAFGSSFPGKILPVTLHD-GKTLIVQKNAFLAAEDGVDLQ 112

Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
               +++     G EGF+ QKLTG G AF+   G +++K L  GE + VD   I     S
Sbjct: 113 AHFRKKLGTGFFGGEGFVLQKLTGNGTAFLEIDGDIIEKTLAPGETLQVDQGYIAGFEES 172

Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           V+  I     ++   F G+ +  A ++GPG +++Q++PF  L+ R+
Sbjct: 173 VHFDITTVKGLKNKFFSGEGMFLATLSGPGKIWLQTMPFSVLADRV 218


>gi|167770183|ref|ZP_02442236.1| hypothetical protein ANACOL_01526 [Anaerotruncus colihominis DSM
           17241]
 gi|167667505|gb|EDS11635.1| TIGR00266 family protein [Anaerotruncus colihominis DSM 17241]
          Length = 226

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 60/206 (29%), Positives = 111/206 (53%), Gaps = 4/206 (1%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           VC+  L+P EK++   GSM +M+ +M+ME          + +   G+ +   +    G S
Sbjct: 14  VCE--LQPGEKMITERGSMSWMTSNMQMETTSNGGVGKALGRMFSGEALFQNIYTAVG-S 70

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
            G +  A+     I  ++++    EL+ Q   FL S + V++S    +++ +   G EGF
Sbjct: 71  SGIIAFASSFPGSIRAVEISP-NNELIVQKSGFLASESGVELSIFFQKKLGSGFFGGEGF 129

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
           + Q+L+G+G+AF+   G  V+ +L+ GE + VD   + A++ S ++ I+    I+  +FG
Sbjct: 130 IMQRLSGKGMAFVEIDGYAVEYDLDPGEKLVVDTGYLAAMSGSCSIDIQTVPGIKNKLFG 189

Query: 214 GDNLVTAVVTGPGIVFIQSLPFHRLS 239
           G+     VVTGPG V +Q++P   ++
Sbjct: 190 GEGFFNTVVTGPGKVILQTMPISTVA 215


>gi|224012403|ref|XP_002294854.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220969293|gb|EED87634.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 556

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 69/274 (25%), Positives = 123/274 (44%), Gaps = 38/274 (13%)

Query: 13  YVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMS---------------- 56
           + +  P D     QI G E+Q+ ++ L P + + A  G+M +MS                
Sbjct: 97  HAHLQPIDFTTASQISGEESQILEVQLAPNQMLRAESGAMLYMSYLVRILCDVVFLCVAF 156

Query: 57  ---GSMEME-NIYIPEN----EVGMWQWLFGKTITSVVLRNPGPSD-------GFVGIAA 101
                ++ME ++ +  N    + G+ + + G+ +     +   PS        G V +  
Sbjct: 157 NRTEGVQMETSMGVGSNRTNFQTGLTRMITGQNLMVSDFQYVPPSGSPASKQYGTVALGT 216

Query: 102 PSLARILPIDLAMF-GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTG 160
              A+IL   L+ +    L+CQ  AFL   + + +     +   +   G EGF+ Q L G
Sbjct: 217 DFPAKILKFQLSDYPDDRLICQKGAFLAGSHSINMEMAFTKNFTSGFFGGEGFVLQSLQG 276

Query: 161 QG------LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGG 214
                     F+ A G+VV+K L+  E + V    +VA+TS+++  +      +  +FGG
Sbjct: 277 SDSTNRGETVFLKAYGTVVKKELKENETLRVSSGSLVAMTSTIDFDVTTLPGFKNVLFGG 336

Query: 215 DNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           + L    +TGPG+V++Q +P  R+   IAR + S
Sbjct: 337 EGLFVTTLTGPGVVWLQGMPIDRMISEIARKIPS 370


>gi|298705724|emb|CBJ49032.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 273

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 76/249 (30%), Positives = 116/249 (46%), Gaps = 18/249 (7%)

Query: 1   MAAPFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSME 60
           M+ PF +    P       DA   F  LG  ++V Q+ L P + ++A   +  +    ++
Sbjct: 3   MSEPFANPDASP-------DASPRFLSLGEGSEVLQVALCPGDAILANEKAFVYRGLGLQ 55

Query: 61  MENIYIPENEVG-MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGEL 119
            E    P   +G    W+  K    V   N G S  ++G+ A    +++P+ LA   G L
Sbjct: 56  QE--LQPTGPMGRALGWIGVKNDRVVKFTNEGNSPTYLGLTASLPGKVIPVSLAK-SGTL 112

Query: 120 LCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEV 179
           L +P  FLCSV +  +S    +         +    +K+ G G+ +I  GG+ VQK L  
Sbjct: 113 LAKPSVFLCSVVENDISTAKIKLQSEGQSSQQSVEMRKIAGAGMCYIHGGGAAVQKALGR 172

Query: 180 GEVITVDVSCIVAVTSSVN-VQIKYNGPIRRA--VFGGDNLVTAVVTGPGIVFIQSLPFH 236
           GE + V+  C+ AV+ +   V  + NGP+ R    F G       +TGPG VFIQS    
Sbjct: 173 GESMVVEFGCVAAVSDACKIVPEERNGPLIREPPYFQG---FQVKITGPGSVFIQSTHTG 229

Query: 237 RLSQRIARA 245
           RLS R +RA
Sbjct: 230 RLS-RASRA 237


>gi|342215534|ref|ZP_08708181.1| putative TIGR00266 family protein [Peptoniphilus sp. oral taxon 375
           str. F0436]
 gi|341586424|gb|EGS29824.1| putative TIGR00266 family protein [Peptoniphilus sp. oral taxon 375
           str. F0436]
          Length = 225

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 60/221 (27%), Positives = 108/221 (48%), Gaps = 3/221 (1%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + + I G    V +  L+  E ++++ G   +M G ++ E I        + +   G+++
Sbjct: 1   MKYYIEGDNLPVVRFELEEGEGLISQAGGRAWMKGDIKTETISGGGVGKMLGRMFSGESL 60

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
                +   PS+  +  A+     I   DLA  G  ++ Q  AFLC+   V +S    ++
Sbjct: 61  FLSSYKAKSPSE--IVFASDFPGSIYAFDLAE-GQSIIAQKKAFLCADQGVDLSIYFRKK 117

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +   + G EGF+ QK+TG G  F+   G   +  L VGE + +D   +  + S+  + I+
Sbjct: 118 LGTGLFGGEGFIMQKITGPGKVFLEIDGHCQEYILNVGEKLVMDTGSLAMMDSTCEMTIE 177

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
               ++  +FGG+ L   VV GPG V +QS+P  +L++ IA
Sbjct: 178 QVEGMKNILFGGEGLFNTVVEGPGKVLVQSMPIQKLARNIA 218


>gi|358067878|ref|ZP_09154350.1| hypothetical protein HMPREF9333_01231 [Johnsonella ignava ATCC
           51276]
 gi|356693847|gb|EHI55516.1| hypothetical protein HMPREF9333_01231 [Johnsonella ignava ATCC
           51276]
          Length = 229

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 64/214 (29%), Positives = 106/214 (49%), Gaps = 3/214 (1%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           +QI GG      I L P EK+V+  G   +  G +  E          + +   G+++  
Sbjct: 3   YQIEGGNLPAAVIQLDPGEKIVSELGGRTWAKGPILTETKAPGGVGKSLGRMFSGESLFM 62

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
                 GP++  +G A+     I+  +LA  G  L+CQ  AFLC+  +V++S    +++ 
Sbjct: 63  SFYTAQGPAE--IGFASSFAGSIVVRELAQ-GESLICQKSAFLCASAEVELSIYFQKKLG 119

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
               G EGF+ QK+TG G+ F+   G V + NL  GE I  D   +  + +S  + I+  
Sbjct: 120 AGFFGGEGFIMQKVTGPGIVFLELDGYVKEYNLMPGEEIVCDTGLVALMDASCTMDIRVV 179

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
             ++  +FGG+ L+  VV GPG  +IQ++   RL
Sbjct: 180 KGLKNKLFGGEGLIDTVVIGPGRAYIQTMSLPRL 213


>gi|225873709|ref|YP_002755168.1| hypothetical protein ACP_2115 [Acidobacterium capsulatum ATCC
           51196]
 gi|225791728|gb|ACO31818.1| conserved hypothetical protein [Acidobacterium capsulatum ATCC
           51196]
          Length = 252

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 65/236 (27%), Positives = 110/236 (46%), Gaps = 14/236 (5%)

Query: 26  QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK----- 80
           +ILG      + +L+P E +++  G + +M+  ++M        ++G    +FG      
Sbjct: 4   RILGTTMPAIEFLLEPNEVIISESGELSWMTAPIQMRT----HTQMGGGGGIFGAIKRMA 59

Query: 81  ---TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
              ++     R  G + G V  A      I+P+++   G E       FLC    V +  
Sbjct: 60  GGGSLFMTEYRAIG-APGEVAFATKVPGHIVPVEVGQ-GREFYIHRHGFLCGTEQVTLGV 117

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              Q +   I G +GFL QK+ G G+A++   G VV K+L  GE + V    + A  +SV
Sbjct: 118 GFQQSLGAGIFGGDGFLLQKVGGNGVAWLELSGEVVVKDLRPGETLRVHPGHVGAFENSV 177

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRE 253
           + QI     I+  +FGGD +  A +TGPG V++Q+LP   L+  +   +     R+
Sbjct: 178 SFQITRVPGIKNMIFGGDGIFLAALTGPGRVWLQTLPLANLAHALMEYLPQGENRQ 233


>gi|225571037|ref|ZP_03780054.1| hypothetical protein CLOHYLEM_07135 [Clostridium hylemonae DSM
           15053]
 gi|225160118|gb|EEG72737.1| hypothetical protein CLOHYLEM_07135 [Clostridium hylemonae DSM
           15053]
          Length = 238

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 62/226 (27%), Positives = 114/226 (50%), Gaps = 12/226 (5%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           +DA + ++I G    V    L+  EK++   G M +MS +M ME        +G     F
Sbjct: 8   EDANMKYEIQGETLPVVICQLEAGEKMITEGGGMAWMSPNMLMETT--TNGGIGK---AF 62

Query: 79  GKTITSV-VLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDV 133
           G+  +   + +N   + G  G+ A   A   P  +  F    G E++ Q  AFL     V
Sbjct: 63  GRMFSGERMFQNIYTAQGGEGMIA--FASSFPGSVEAFQVGPGQEMILQKSAFLAGEASV 120

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
           ++S   +++  + + G EGF+ QK  G G  F    G V++  L+ G+ I VD   + A+
Sbjct: 121 ELSVFFNKKFSSGLFGGEGFIMQKAGGYGTVFAEFDGHVIKYELQPGQQIVVDTGHLAAM 180

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
           +++ +++IK    ++  +FGG+ +   ++TGPG V++Q++P   ++
Sbjct: 181 SATCSMEIKTVPGVKNMLFGGEGIFNTIITGPGKVWLQTMPISNVA 226


>gi|291524805|emb|CBK90392.1| conserved hypothetical protein TIGR00266 [Eubacterium rectale DSM
           17629]
          Length = 226

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 66/209 (31%), Positives = 109/209 (52%), Gaps = 12/209 (5%)

Query: 39  LKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT-SVVLRNPGPSDGFV 97
           L+  E ++   G+M +M+ +M+ME         G    +FG+  +   + +N   S G  
Sbjct: 17  LEANETMITEKGAMSWMTPNMKMET-----TSNGGIGKMFGRAFSGESMFQNRYTSMGGP 71

Query: 98  GIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
           G+ A   A   P  +  F    G E++ Q  AFL S + V++S    +R+     G EGF
Sbjct: 72  GMIA--FASCFPGCIRPFQIAPGQEIIAQKSAFLASTSGVELSVFFQKRIGAGFFGGEGF 129

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
           + Q+L+G GLAF+   G + +  L  G+ + +D   + A+T S  ++I+    I+  VFG
Sbjct: 130 IMQRLSGNGLAFLEFDGHIKEYELAPGQQLVIDTGYLAAMTGSCQLEIQTVPGIKNVVFG 189

Query: 214 GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           G+ L   VVTGPG V++QS+P  +L+  I
Sbjct: 190 GEGLFNTVVTGPGRVWLQSMPVSQLAGAI 218


>gi|238924070|ref|YP_002937586.1| hypothetical protein EUBREC_1708 [Eubacterium rectale ATCC 33656]
 gi|238875745|gb|ACR75452.1| Hypothetical protein EUBREC_1708 [Eubacterium rectale ATCC 33656]
 gi|291528874|emb|CBK94460.1| conserved hypothetical protein TIGR00266 [Eubacterium rectale
           M104/1]
          Length = 226

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 66/209 (31%), Positives = 110/209 (52%), Gaps = 12/209 (5%)

Query: 39  LKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT-SVVLRNPGPSDGFV 97
           L+  E ++   G+M +M+ +M+ME         G    +FG+  +   + +N   S G  
Sbjct: 17  LEANETMITEKGAMSWMTPNMKMET-----TSNGGIGKMFGRAFSGESMFQNRYTSMGGP 71

Query: 98  GIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
           G+ A   A   P  +  F    G E++ Q  AFL S + V++S    +R+ +   G EGF
Sbjct: 72  GMIA--FASCFPGCIRPFQIAPGQEIIAQKSAFLASTSGVELSVFFQKRIGSGFFGGEGF 129

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
           + Q+L+G GLAF+   G + +  L  G+ + +D   + A+T S  ++I+    I+  VFG
Sbjct: 130 IMQRLSGNGLAFLEFDGHIKEYELAPGQQLVIDTGYLAAMTGSCQLEIQTVPGIKNVVFG 189

Query: 214 GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           G+ L   VVTGPG V++QS+P  +L+  I
Sbjct: 190 GEGLFNTVVTGPGRVWLQSMPVSQLAGAI 218


>gi|393722397|ref|ZP_10342324.1| hypothetical protein SPAM2_02025 [Sphingomonas sp. PAMC 26605]
          Length = 259

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 63/230 (27%), Positives = 109/230 (47%), Gaps = 7/230 (3%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI-- 82
           + I G   Q   I L   E++V++  +M +MS  + M+         G+ + + G +I  
Sbjct: 3   YDIAGTVMQTVSIDLDRGERIVSQTHAMAWMSDGIRMDTHTGGGLFAGLKRAMSGGSIFI 62

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
           T  +   P      V  A     +I+   L   G  L+C+ + FLC+   V +     QR
Sbjct: 63  TEYIADAPA----HVAFAPRFPGKIIARTLRA-GESLICRKETFLCAEYSVSLDIAFQQR 117

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +   +   EGF+ Q++TG G  ++   G V++K+L  GE + V    I     +V+  ++
Sbjct: 118 LGAGVFAGEGFILQRVTGPGTVWLDLSGEVIEKDLAPGERLRVHAGHIGIQEPTVSTDVQ 177

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMR 252
                R  +FGG+ L  A +TGPG V++QS+P   L++ +AR +     R
Sbjct: 178 MVRGFRNILFGGEGLFLATLTGPGRVWLQSMPIMNLAEEVARHLPGNEER 227


>gi|156743638|ref|YP_001433767.1| hypothetical protein Rcas_3709 [Roseiflexus castenholzii DSM 13941]
 gi|156234966|gb|ABU59749.1| protein of unknown function DUF124 [Roseiflexus castenholzii DSM
           13941]
          Length = 319

 Score = 85.1 bits (209), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 67/227 (29%), Positives = 110/227 (48%), Gaps = 13/227 (5%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW------QW 76
           + +QI+G   Q   + L P E V +  G+M +MSG+++M          GM+      + 
Sbjct: 71  MEYQIVGTTLQAVILELDPGETVYSESGAMSWMSGNIQMATNTRGGGLGGMFKRAISGES 130

Query: 77  LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
           LF    TSV         G V  A+    +I+P  L   G  ++ Q  AFLC+   V + 
Sbjct: 131 LFLNEFTSV------GGKGIVAFASDFPGKIVPAPLRE-GQTMIVQKQAFLCAEKTVGLD 183

Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
               +R+     G EGF+ QKLTG G+AF+   G +V+  L+ G+V+ VD   +     +
Sbjct: 184 IHFRKRMGAGFFGGEGFIMQKLTGPGVAFVCLDGEIVEYTLDAGQVLKVDTGHVAMYEPT 243

Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
           V   I+     +  + GG+ L    + GPG V++Q++P   L++ +A
Sbjct: 244 VEFDIEMVKGFKNILLGGEGLFLTTLRGPGRVWLQTMPTTNLAKAVA 290


>gi|325663733|ref|ZP_08152137.1| hypothetical protein HMPREF0490_02878 [Lachnospiraceae bacterium
           4_1_37FAA]
 gi|325470226|gb|EGC73459.1| hypothetical protein HMPREF0490_02878 [Lachnospiraceae bacterium
           4_1_37FAA]
          Length = 226

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 58/215 (26%), Positives = 106/215 (49%), Gaps = 12/215 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           +QILG    V    L   E ++   GSM +MS +M M+      +  G     FG+  + 
Sbjct: 3   YQILGDTLPVVICRLDGGESMITEKGSMAWMSPNMRMDT-----STNGGLGKAFGRMFSG 57

Query: 85  -VVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTL 139
             + +N   + G  G    + A   P  +  F    G E++ Q   FL +   +++S   
Sbjct: 58  EAIFQNIYTAQG--GAGEIAFASSFPGSIRAFHIRPGEEIVVQKGGFLAAEAGIQLSVHF 115

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
            +++ +   G EGF+ Q+++G+G+ F    G V++  L+ G+ I +D   + A+T+S  +
Sbjct: 116 HKKLASGFFGGEGFILQRISGEGVMFAEFDGHVIEYELQPGQQIVIDTGHLAAMTASCQM 175

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
            I+    ++   FGG+     V+TGPG V++Q++P
Sbjct: 176 DIQTVPGLKNKFFGGEGFFNTVITGPGHVWLQTMP 210


>gi|289450222|ref|YP_003475428.1| hypothetical protein HMPREF0868_1144 [Clostridiales genomosp. BVAB3
           str. UPII9-5]
 gi|289184769|gb|ADC91194.1| TIGR00266 family protein [Clostridiales genomosp. BVAB3 str.
           UPII9-5]
          Length = 224

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 60/228 (26%), Positives = 108/228 (47%), Gaps = 9/228 (3%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           + I GG   V  I L P E++++  G   +M G +  E     E+  G+   +FG+ ++ 
Sbjct: 3   YSIEGGSLPVVIINLNPGERLISEAGGRRWMRGDIRTET----ESHGGIG-GMFGRMLSG 57

Query: 85  VVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
             L        S+  +  A+    +I+  +L   G  ++CQ  AFLC+ + V +     +
Sbjct: 58  ESLFLSSYTAFSESEIAFASSFPGKIIARELRN-GESIVCQKSAFLCATDQVSLRVFFQK 116

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
            + +   G EGF+ QK+TG G+ F+   G + + +L   E I  D   +  +  +  + +
Sbjct: 117 NIGSGYFGGEGFIMQKITGPGMVFLELDGYIKEYDLAPNESIVCDTGAVAMIDETCQIDV 176

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSP 249
           +    ++  VFGG+ L   VVTGPG V +QS      S+ I+  + + 
Sbjct: 177 RVVSGLKNIVFGGEGLFDTVVTGPGKVALQSASITDFSKLISSCIKTK 224


>gi|423075594|ref|ZP_17064311.1| hypothetical protein HMPREF0322_03744 [Desulfitobacterium hafniense
           DP7]
 gi|361853368|gb|EHL05524.1| hypothetical protein HMPREF0322_03744 [Desulfitobacterium hafniense
           DP7]
          Length = 111

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 37/75 (49%), Positives = 54/75 (72%)

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
           AF+ AGG+++QK L+ GE I +D  C+VA+T +VN  I++ G I+ A+FGG+ L    +T
Sbjct: 17  AFMHAGGTIIQKELQPGETIQLDTGCLVAMTKTVNHDIRFAGDIKNALFGGEGLALGTLT 76

Query: 224 GPGIVFIQSLPFHRL 238
           GPG V++QSLPF  L
Sbjct: 77  GPGTVWLQSLPFCSL 91


>gi|167761381|ref|ZP_02433508.1| hypothetical protein CLOSCI_03789 [Clostridium scindens ATCC 35704]
 gi|167661047|gb|EDS05177.1| TIGR00266 family protein [Clostridium scindens ATCC 35704]
          Length = 226

 Score = 84.7 bits (208), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 62/215 (28%), Positives = 110/215 (51%), Gaps = 12/215 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           +QI G    V    L+  EK++   G M +MS +M ME        +G     FG+  + 
Sbjct: 3   YQIQGETLPVVICELEGGEKMITEGGGMAWMSPNMLMETT--TNGGIGK---AFGRMFSG 57

Query: 85  -VVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTL 139
             + +N   + G  G+ A   A   P  +  F    G E++ Q  AFL     +++S   
Sbjct: 58  EHMFQNIYTAQGGPGMIA--FASSFPGSVRAFQIAPGQEMILQKSAFLAGEMGIELSVFF 115

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
           +++  + + G EGF+ QK++G G+ F    G V++  L  G+ I VD   + A+T++ ++
Sbjct: 116 NKKFSSGLFGGEGFIMQKVSGHGIVFTEFDGHVIEYELHPGQQIVVDTGHLAAMTATCSM 175

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
           +IK    ++  +FGG+ +   V+TGPG V++Q++P
Sbjct: 176 EIKGVPGVKNMLFGGEGIFNTVITGPGRVWLQTMP 210


>gi|197302962|ref|ZP_03168012.1| hypothetical protein RUMLAC_01689 [Ruminococcus lactaris ATCC
           29176]
 gi|197297957|gb|EDY32507.1| hypothetical protein RUMLAC_01689 [Ruminococcus lactaris ATCC
           29176]
          Length = 154

 Score = 84.0 bits (206), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 46/133 (34%), Positives = 78/133 (58%)

Query: 116 GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQK 175
           G EL+ Q   FL S   V +S    +++   I G EGF+ Q+L+G G+AF+   G VVQ 
Sbjct: 20  GQELIVQKTGFLASEAGVSLSVYFQKKLGAGIFGGEGFIMQRLSGSGMAFLEFDGHVVQY 79

Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
           +L+ G+ I VD   + A+  S +++++    ++  VFGG+ +   VVTGPG V++Q++P 
Sbjct: 80  DLQPGQQIVVDTGYLAAMEGSCSMEVQTVPGLKNMVFGGEGVFNTVVTGPGRVWLQTMPI 139

Query: 236 HRLSQRIARAVTS 248
             ++  I + + S
Sbjct: 140 SNVAGEILKFMPS 152


>gi|310829668|ref|YP_003962025.1| hypothetical protein [Eubacterium limosum KIST612]
 gi|308741402|gb|ADO39062.1| hypothetical protein ELI_4120 [Eubacterium limosum KIST612]
          Length = 228

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 59/220 (26%), Positives = 99/220 (45%), Gaps = 2/220 (0%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I+G    V ++ L   E++    G M +M    +ME          + +   G ++
Sbjct: 1   MKYEIIGDTLPVVELELDRGEQIYTESGGMSWMDPCFDMETSTRGGALKAIKRSFSGNSL 60

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
                   G   G +  A+     I  + L   G  ++C   AFL +   V  S    + 
Sbjct: 61  FLTTYTCTG-DKGKIAFASSFPGNIRAVHLEA-GQSIICAKTAFLAAEESVDFSIFFKKS 118

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           V+  + G  GF+ QKLTG GL F+   GS ++ NLE G+ I VD   I      V+  + 
Sbjct: 119 VKTGVFGGAGFILQKLTGPGLVFLELNGSTIEYNLEEGQTINVDQGHIAVFQEKVHFDVT 178

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
                +  +F G+ L  A +TGPG V +QS+P  +L++ +
Sbjct: 179 QVKGAKNILFSGEGLFFATLTGPGRVVLQSMPVDKLAKAL 218


>gi|169335942|ref|ZP_02863135.1| hypothetical protein ANASTE_02377 [Anaerofustis stercorihominis DSM
           17244]
 gi|169258680|gb|EDS72646.1| TIGR00266 family protein [Anaerofustis stercorihominis DSM 17244]
          Length = 225

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 62/225 (27%), Positives = 111/225 (49%), Gaps = 4/225 (1%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++++G       I L   EK+V + G M +MSG++EM          G+ + L G++I
Sbjct: 1   MDYKVIGDTFPAVVITLNQGEKIVTQAGGMSWMSGNIEMTTSTRGGLGKGVKRALAGESI 60

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGG-ELLCQPDAFLCSVNDVKVSNTLDQ 141
                     ++  +  +A    +I  IDL + G  E + Q  AFL     V +     +
Sbjct: 61  FQATYTAKQDNE-HITCSATMPGKI--IDLYLDGEREYIAQKGAFLAGEESVNMDMFTVK 117

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           ++ + + G EG    KL+G+G AF+  GG + + NL V EVI V+   +    SSVN ++
Sbjct: 118 KISSALFGGEGIFLLKLSGKGTAFLEIGGHLEEYNLGVDEVIKVNTGNVAFFESSVNYEV 177

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           +     +  +FGG+ L    +TGPG V++Q++    L+  ++  +
Sbjct: 178 ETVKGAKNILFGGEGLFLTKLTGPGKVYLQTMNMGELAGALSSYI 222


>gi|374313431|ref|YP_005059861.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
 gi|358755441|gb|AEU38831.1| protein of unknown function DUF124 [Granulicella mallensis
           MP5ACTX8]
          Length = 253

 Score = 84.0 bits (206), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 62/226 (27%), Positives = 108/226 (47%), Gaps = 4/226 (1%)

Query: 26  QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMEN---IYIPENEVGMWQWLFGKTI 82
           +I+G    V + +L+P E +++  G + +M+ S+ M     +       G  + + G   
Sbjct: 4   RIIGTTMPVLEALLQPGESLISEAGELSWMTQSIAMSTHTQMAGGGGFFGAIKRMAGGGT 63

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
             +         G V  A      I+PI++   G E +     FLCS   +++     Q 
Sbjct: 64  LFMSEYTAQGYPGEVAFATKIPGHIVPIEVGP-GHEFMIHRHGFLCSTPQIQLGVGFQQS 122

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +   I G +GFL QK+ G G A++   G VV ++L+ GE + V    + A  +SV+ QI 
Sbjct: 123 LGAGIFGGDGFLLQKVAGYGTAWLELSGEVVIRDLQPGETLRVHPGHVGAFQASVSFQIT 182

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
               IR   FGGD +  A +TGPG +++Q+LP   ++  +   ++S
Sbjct: 183 RIQGIRNMFFGGDGIFLAALTGPGRIWLQTLPLSNMAHALQPYLSS 228


>gi|166030398|ref|ZP_02233227.1| hypothetical protein DORFOR_00059 [Dorea formicigenerans ATCC
           27755]
 gi|166029756|gb|EDR48513.1| TIGR00266 family protein [Dorea formicigenerans ATCC 27755]
          Length = 226

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 60/219 (27%), Positives = 107/219 (48%), Gaps = 11/219 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++I GG   V    L+  E ++   GSM +MS +M+ME +       G  + + G+  + 
Sbjct: 3   YEITGGNLPVVICYLQSGEGMITESGSMSWMSPNMKMETV------GGGMKKMLGRAFSG 56

Query: 85  V-VLRN---PGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
               +N       DG +  A+     +  +++   G  ++ Q  AFL +   V++S  + 
Sbjct: 57  ENAFQNRYSAQGGDGMIAFASSFPGSVNALEIGP-GKSMIVQKKAFLAAEEGVELSVYMQ 115

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           +++   + G EGF+ QKLTGQG AFI   G  V+ NL  G+ + VD   + A+  +  + 
Sbjct: 116 KKLGKGLFGGEGFIMQKLTGQGTAFIEIDGYGVEYNLAPGQEMIVDTGYLAAMEETCTMD 175

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
           I     ++    GG+ L   VV GPG + +Q++P   ++
Sbjct: 176 IVKVEGMKNMFLGGEGLFNTVVRGPGKIILQTMPIANVA 214


>gi|154496883|ref|ZP_02035579.1| hypothetical protein BACCAP_01176 [Bacteroides capillosus ATCC
           29799]
 gi|150273841|gb|EDN00954.1| TIGR00266 family protein [Pseudoflavonifractor capillosus ATCC
           29799]
          Length = 238

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 59/213 (27%), Positives = 112/213 (52%), Gaps = 10/213 (4%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++I GG   V    L+  EK++   GSM +MS +M+ME         G    +F +  + 
Sbjct: 16  YEIAGGTFPVVMCYLENGEKMITESGSMVWMSPNMQMET------SGGGLGKMFSRAFSG 69

Query: 85  V-VLRN--PGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
             + +N      DG +   +    +I+PI++   G E++ Q  AFL +   V++S   ++
Sbjct: 70  ENIFQNIYTARGDGMITFGSSFPGKIIPIEIGP-GQEMILQKSAFLAAEAGVELSIHFNR 128

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           ++     G EGF+ Q+L+G+G+AF    G +V+ +L  G+ + VD   +     +V++ I
Sbjct: 129 KLGAGFFGGEGFIMQRLSGRGMAFAEIDGDLVEYDLAPGQQLVVDTGNVAGFAPTVSMDI 188

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
           +    ++   FGG+ +   ++TGPG V++Q++P
Sbjct: 189 QQVPGLKNMFFGGEGIFNTLLTGPGKVWLQTMP 221


>gi|42526357|ref|NP_971455.1| hypothetical protein TDE0845 [Treponema denticola ATCC 35405]
 gi|422342684|ref|ZP_16423623.1| hypothetical protein HMPREF9353_02290 [Treponema denticola F0402]
 gi|449112516|ref|ZP_21749070.1| TIGR00266 family protein [Treponema denticola ATCC 33521]
 gi|449115262|ref|ZP_21751727.1| TIGR00266 family protein [Treponema denticola ATCC 35404]
 gi|41816469|gb|AAS11336.1| conserved hypothetical protein TIGR00266 [Treponema denticola ATCC
           35405]
 gi|325473300|gb|EGC76495.1| hypothetical protein HMPREF9353_02290 [Treponema denticola F0402]
 gi|448953614|gb|EMB34404.1| TIGR00266 family protein [Treponema denticola ATCC 35404]
 gi|448955978|gb|EMB36742.1| TIGR00266 family protein [Treponema denticola ATCC 33521]
          Length = 224

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 62/225 (27%), Positives = 109/225 (48%), Gaps = 14/225 (6%)

Query: 23  IPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
           + ++I+G    A VC+  L+  EK++ + G M +MS  + ME         G      G+
Sbjct: 1   MEYKIIGDAFPAVVCK--LRAGEKMITQSGGMAWMSSGISMET------SAGGVGKALGR 52

Query: 81  TITSVVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
             +   L   +    +DG +  A+     I   ++   G  L+ Q  AFL S   V +S 
Sbjct: 53  MFSGESLFLNKYTANTDGEISFASRFPGAIKAFEITS-GSSLIAQKTAFLASTEGVDLSI 111

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              +++     G EGF+ QKL+G G  F+   GS V+  L  GE + +D   + A+  + 
Sbjct: 112 YFQKKLGAGFFGGEGFIMQKLSGNGTVFLEIDGSAVEYTLGAGETMLLDTGYLAAMEETC 171

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            ++++    ++  +FGG+ L    VTGPG + IQ++P H ++Q +
Sbjct: 172 KMEVEMIKGVKNILFGGEGLFNTKVTGPGKIIIQTMPIHAVAQSL 216


>gi|449110017|ref|ZP_21746649.1| TIGR00266 family protein [Treponema denticola ATCC 33520]
 gi|449120778|ref|ZP_21757161.1| TIGR00266 family protein [Treponema denticola H1-T]
 gi|449123183|ref|ZP_21759513.1| TIGR00266 family protein [Treponema denticola MYR-T]
 gi|448946703|gb|EMB27557.1| TIGR00266 family protein [Treponema denticola MYR-T]
 gi|448947349|gb|EMB28196.1| TIGR00266 family protein [Treponema denticola H1-T]
 gi|448957725|gb|EMB38465.1| TIGR00266 family protein [Treponema denticola ATCC 33520]
          Length = 224

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 61/225 (27%), Positives = 109/225 (48%), Gaps = 14/225 (6%)

Query: 23  IPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
           + ++I+G    A VC+  L+  EK++ + G M +MS  + ME         G      G+
Sbjct: 1   MEYKIIGDAFPAVVCK--LRAGEKMITQSGGMAWMSSGISMET------SAGGVGKALGR 52

Query: 81  TITSVVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
             +   L   +    +DG +  A+     I   ++   G  L+ Q  AFL S   V +S 
Sbjct: 53  MFSGESLFLNKYTASTDGEISFASRFPGAIKAFEITS-GSSLIAQKTAFLASTEGVDLSI 111

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              +++     G EGF+ QKL+G G  F+   GS ++  L  GE + +D   + A+  + 
Sbjct: 112 YFQKKLGAGFFGGEGFIMQKLSGNGTVFLEIDGSAIEYTLGAGETMLLDTGYLAAMEETC 171

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            ++++    ++  +FGG+ L    VTGPG + IQ++P H ++Q +
Sbjct: 172 KMEVEMIKGVKNILFGGEGLFNTKVTGPGKIIIQTMPIHAVAQSL 216


>gi|449103210|ref|ZP_21739956.1| TIGR00266 family protein [Treponema denticola AL-2]
 gi|449125604|ref|ZP_21761906.1| TIGR00266 family protein [Treponema denticola OTK]
 gi|449130631|ref|ZP_21766851.1| TIGR00266 family protein [Treponema denticola SP37]
 gi|448939573|gb|EMB20490.1| TIGR00266 family protein [Treponema denticola OTK]
 gi|448942352|gb|EMB23247.1| TIGR00266 family protein [Treponema denticola SP37]
 gi|448965062|gb|EMB45727.1| TIGR00266 family protein [Treponema denticola AL-2]
          Length = 224

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 61/225 (27%), Positives = 109/225 (48%), Gaps = 14/225 (6%)

Query: 23  IPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
           + ++I+G    A VC+  L+  EK++ + G M +MS  + ME         G      G+
Sbjct: 1   MEYKIIGDAFPAVVCK--LRAGEKMITQSGGMAWMSSGISMET------SAGGVGKALGR 52

Query: 81  TITSVVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
             +   L   +    +DG +  A+     I   ++   G  L+ Q  AFL S   V +S 
Sbjct: 53  MFSGESLFLNKYTANTDGEISFASRFPGAIKAFEITS-GSSLIAQKTAFLASTEGVDLSI 111

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              +++     G EGF+ QKL+G G  F+   GS ++  L  GE + +D   + A+  + 
Sbjct: 112 YFQKKLGAGFFGGEGFIMQKLSGNGTVFLEIDGSAIEYTLGAGETMLLDTGYLAAMEETC 171

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            ++++    ++  +FGG+ L    VTGPG + IQ++P H ++Q +
Sbjct: 172 KMEVEMIKGVKNILFGGEGLFNTKVTGPGKIIIQTMPIHAVAQSL 216


>gi|449117826|ref|ZP_21754241.1| TIGR00266 family protein [Treponema denticola H-22]
 gi|448949717|gb|EMB30541.1| TIGR00266 family protein [Treponema denticola H-22]
          Length = 224

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 61/225 (27%), Positives = 109/225 (48%), Gaps = 14/225 (6%)

Query: 23  IPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
           + ++I+G    A VC+  L+  EK++ + G M +MS  + ME         G      G+
Sbjct: 1   MEYKIIGDAFPAVVCK--LRAGEKMITQSGGMAWMSSGISMET------SAGGVGKALGR 52

Query: 81  TITSVVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
             +   L   +    +DG +  A+     I   ++   G  L+ Q  AFL S   V +S 
Sbjct: 53  MFSGESLFLNKYTANADGEISFASRFPGAIKAFEITS-GSSLIAQKTAFLASTEGVDLSI 111

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              +++     G EGF+ QKL+G G  F+   GS ++  L  GE + +D   + A+  + 
Sbjct: 112 YFQKKLGAGFFGGEGFIMQKLSGNGTVFLEIDGSAIEYTLGAGETMLLDTGYLAAMEETC 171

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            ++++    ++  +FGG+ L    VTGPG + IQ++P H ++Q +
Sbjct: 172 KMEVEMIKGVKNILFGGEGLFNTKVTGPGKIIIQTMPIHAVAQSL 216


>gi|288560063|ref|YP_003423549.1| hypothetical protein mru_0806 [Methanobrevibacter ruminantium M1]
 gi|288542773|gb|ADC46657.1| hypothetical protein mru_0806 [Methanobrevibacter ruminantium M1]
          Length = 226

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 62/222 (27%), Positives = 110/222 (49%), Gaps = 7/222 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I GG   +    L+  E +    G+M FM+  M+M+         G+ + L G TI
Sbjct: 1   MEYEIQGGTFPIVVCTLQEGETMKNETGAMAFMTSGMKMDTNTGGGLLKGLGRALSGDTI 60

Query: 83  --TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
                  +  G     VG +A S  RI+PI L      ++ Q +AFL + + V +     
Sbjct: 61  FLNYFTAQRDGEQ---VGFSACSPGRIMPIRLDGTN-TIIGQKNAFLAAEDSVDIDIHFR 116

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           + +   + G EGF+ QK +G G+AF+   G +++K L  GE + +D   + A+  SV   
Sbjct: 117 KSLGAGLFGGEGFVLQKFSGTGVAFLEIDGEIIKKELAPGEKLLIDPGHLAAMEESVGFD 176

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           I+     +  +F G+ L  + +TGPG V++Q++P   L++ +
Sbjct: 177 IERVKGAKNILF-GEGLFFSRLTGPGTVWLQTMPISSLARAL 217


>gi|449106766|ref|ZP_21743428.1| TIGR00266 family protein [Treponema denticola ASLM]
 gi|449127490|ref|ZP_21763763.1| TIGR00266 family protein [Treponema denticola SP33]
 gi|451968466|ref|ZP_21921695.1| TIGR00266 family protein [Treponema denticola US-Trep]
 gi|448944223|gb|EMB25104.1| TIGR00266 family protein [Treponema denticola SP33]
 gi|448964116|gb|EMB44789.1| TIGR00266 family protein [Treponema denticola ASLM]
 gi|451702479|gb|EMD56881.1| TIGR00266 family protein [Treponema denticola US-Trep]
          Length = 224

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 61/225 (27%), Positives = 109/225 (48%), Gaps = 14/225 (6%)

Query: 23  IPFQILGGE--AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
           + ++I+G    A VC+  L+  EK++ + G M +MS  + ME         G      G+
Sbjct: 1   MEYKIIGDAFPAVVCK--LRAGEKMITQSGGMAWMSSGISMET------SAGGVGKALGR 52

Query: 81  TITSVVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
             +   L   +    +DG +  A+     I   ++   G  L+ Q  AFL S   V +S 
Sbjct: 53  MFSGESLFLNKYTANTDGEISFASRFPGAIKAFEITS-GSSLIAQKTAFLASTEGVDLSI 111

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              +++     G EGF+ QKL+G G  F+   GS V+  L  GE + +D   + A+  + 
Sbjct: 112 HFQKKLGAGFFGGEGFIMQKLSGNGTVFLEIDGSAVEYTLGAGETMLLDTGYLAAMEETC 171

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            ++++    ++  +FGG+ L    VTGPG + +Q++P H ++Q +
Sbjct: 172 KMEVEMIKGVKNILFGGEGLFNTKVTGPGKIIVQTMPIHAVAQSL 216


>gi|410726976|ref|ZP_11365206.1| hypothetical protein A370_03314 [Clostridium sp. Maddingley
           MBC34-26]
 gi|410599725|gb|EKQ54266.1| hypothetical protein A370_03314 [Clostridium sp. Maddingley
           MBC34-26]
          Length = 109

 Score = 82.4 bits (202), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 38/94 (40%), Positives = 62/94 (65%)

Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
           +AFI A G++V+++L  GE++ +D  C+VA+T +V   I++ G I+  +FGG+ +  A V
Sbjct: 1   MAFIHACGAIVKRDLMPGEMLKIDTGCLVAMTRNVQYDIQFVGGIKNTLFGGEGVFFATV 60

Query: 223 TGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPK 256
           +GPG V+IQSLPF RL++R+  A      R   +
Sbjct: 61  SGPGTVWIQSLPFSRLAERVFAAAPGAGGRNKEE 94


>gi|169334218|ref|ZP_02861411.1| hypothetical protein ANASTE_00616 [Anaerofustis stercorihominis DSM
           17244]
 gi|169258935|gb|EDS72901.1| TIGR00266 family protein [Anaerofustis stercorihominis DSM 17244]
          Length = 252

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 59/217 (27%), Positives = 105/217 (48%), Gaps = 8/217 (3%)

Query: 27  ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVV 86
           ILGG        L P E ++   GSM +MS ++ ME         GM + L        +
Sbjct: 31  ILGGSLPTVVCNLAPGETMITESGSMSWMSPNIRMETT----TGGGMKKMLGRLASGESM 86

Query: 87  LRNP---GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV 143
            +N      ++G +  ++     I+P ++A  G  ++ Q  AFL +   V++S    +++
Sbjct: 87  FQNKYTAQNTEGKIAFSSSFPGAIIPFEIAP-GKNMIVQKSAFLAAEESVELSMYFQKKL 145

Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
              + G EG + QKL+GQG AFI   G  V+ +L  GE + V    + A+  + ++ ++ 
Sbjct: 146 GKGLFGGEGLIMQKLSGQGTAFIEIDGGSVEYDLAPGEELIVSTGYLAAMEETCSMDVQM 205

Query: 204 NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
               +  + GG+ L   VV GPG V +Q++P   +++
Sbjct: 206 VKGAKNILLGGEGLFNTVVKGPGKVILQTMPISNVAE 242


>gi|293400841|ref|ZP_06644986.1| HTH DNA-binding protein [Erysipelotrichaceae bacterium 5_2_54FAA]
 gi|291305867|gb|EFE47111.1| HTH DNA-binding protein [Erysipelotrichaceae bacterium 5_2_54FAA]
          Length = 233

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 63/221 (28%), Positives = 104/221 (47%), Gaps = 4/221 (1%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I GG+  V    L+  EKV    G MC+M    EM          G+ + + G++I
Sbjct: 9   MEYEIKGGQLPVVVCHLQKDEKVFTESGGMCWMDDGFEMYANTRGGLLKGLGRAMAGESI 68

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQ 141
             +     G  +  +   +    +ILPI L    GE ++ Q  AFL + + V +     +
Sbjct: 69  F-LTTYTAGNDNSEIAFGSSFPGKILPITLG--DGECIIIQKTAFLAAEDSVTLQPFFRK 125

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           R+     G EGF+ QKL G G AF+   G +V+K L+ G+++ +D   I      V+  I
Sbjct: 126 RLGAGFFGGEGFILQKLIGPGTAFLEIDGDIVEKELQPGQLLKIDQGYIAMFEEHVDFDI 185

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
                ++     G+    A   GPG V++Q++PF  L+ RI
Sbjct: 186 TTVKGLKNKFLSGEGFFLATCKGPGKVWLQTMPFSVLADRI 226


>gi|146303673|ref|YP_001190989.1| hypothetical protein Msed_0893 [Metallosphaera sedula DSM 5348]
 gi|145701923|gb|ABP95065.1| protein of unknown function DUF124 [Metallosphaera sedula DSM 5348]
          Length = 230

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 58/218 (26%), Positives = 103/218 (47%), Gaps = 7/218 (3%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           +QI+G + Q  ++ L PQE   A  G M   + S+ M+         G+ + L G +   
Sbjct: 4   YQIMGDDLQYLKVQLAPQEGFYADAGHMIMKTASVNMQTRMRGGFLSGLKRVLTGGSF-- 61

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
            V    GP D  +    P   +I+PI L   GG +L +  +FL +   ++   T+ +   
Sbjct: 62  FVTEFYGPGDVILSGIFP--GKIVPIQLQ--GGSILVEAHSFLGAETSIEYDATMARLTA 117

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
             +GG EG    +  G G  F+ + G ++ ++L  GE I V+ S ++A    +N  ++  
Sbjct: 118 GWLGG-EGLFLARFRGVGNVFLHSYGGLIVRDLAPGETIQVEASHLMAFQEGMNYSVQLV 176

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           G +R   F  + L    +TGPG V++ +L   +L+  +
Sbjct: 177 GGLRSVFFAHEGLFFVTITGPGRVWLHTLTAEQLASAL 214


>gi|163815175|ref|ZP_02206556.1| hypothetical protein COPEUT_01333 [Coprococcus eutactus ATCC 27759]
 gi|158449584|gb|EDP26579.1| TIGR00266 family protein [Coprococcus eutactus ATCC 27759]
          Length = 234

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 66/232 (28%), Positives = 116/232 (50%), Gaps = 18/232 (7%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT- 83
           ++I GG   V +I+L+  E +    G+M +MS +M+M+         G    +F + I+ 
Sbjct: 12  YEIKGGMMPVVEIILEAGESINCEKGAMVWMSPNMQMQT------SGGGVGKMFTRAISG 65

Query: 84  SVVLRNP-----GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
             + RN      GP  G +   +  +  I+ +++   G E++CQ  A+L S   + +   
Sbjct: 66  ESMFRNTYTAQGGP--GLIAFGSSFVGNIMAVEIGP-GKEIICQKSAYLASTPGINLEIA 122

Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS--CIVAVTSS 196
             +++     G EGF+ Q++TGQG+ FI   GSVV   L  G+ + +D     ++  T S
Sbjct: 123 FQKKIAGGFFGGEGFIMQRITGQGIVFIEIDGSVVDYMLAPGQQMVIDTGYLAMMDATCS 182

Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           +N +    G +    FGG+     VVTGPG + +Q++P  +L+  IA  + S
Sbjct: 183 LNAETVKGGAMNM-FFGGEGFFNTVVTGPGRITLQTMPKSQLAAEIASMIPS 233


>gi|373451763|ref|ZP_09543682.1| TIGR00266 family protein [Eubacterium sp. 3_1_31]
 gi|371967984|gb|EHO85451.1| TIGR00266 family protein [Eubacterium sp. 3_1_31]
          Length = 225

 Score = 81.3 bits (199), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 62/220 (28%), Positives = 103/220 (46%), Gaps = 2/220 (0%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I GG+  V    L+  EKV    G MC+M    EM          G+ + + G++I
Sbjct: 1   MEYEIKGGQLPVVVCHLQKDEKVFTESGGMCWMDDGFEMYANTRGGLLKGLGRAMAGESI 60

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
             +     G  +  +   +    +ILPI L   G  ++ Q  AFL + + V +     +R
Sbjct: 61  F-LTTYTAGNDNSEIAFGSSFPGKILPITLCD-GECIIIQKTAFLAAEDSVTLQPFFRKR 118

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +     G EGF+ QKL G G AF+   G +V+K L+ G+++ +D   I      V+  I 
Sbjct: 119 LGAGFFGGEGFILQKLIGPGTAFLEIDGDIVEKELQPGQLLKIDQGYIAMFEEHVDFDIT 178

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
               ++     G+    A   GPG V++Q++PF  L+ RI
Sbjct: 179 TVKGLKNKFLSGEGFFLATCKGPGKVWLQTMPFSVLADRI 218


>gi|325264387|ref|ZP_08131118.1| HTH DNA-binding protein [Clostridium sp. D5]
 gi|324030458|gb|EGB91742.1| HTH DNA-binding protein [Clostridium sp. D5]
          Length = 229

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 64/231 (27%), Positives = 114/231 (49%), Gaps = 10/231 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I GG   V    L+ +E+++   GSM +MS +M+M+         G    +F K  
Sbjct: 5   MKYEIKGGAFPVVVCELEDREQMITEKGSMVWMSPNMQMDTAG------GGLGKMFSKAF 58

Query: 83  T-SVVLRN--PGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
           +   + +N       G +   +    RILP+ +   G E++ Q  AFL S   +++S   
Sbjct: 59  SGESMFQNIYTAQGAGMIAFGSSFPGRILPLTIEP-GKEMVVQKSAFLASEAGIELSIHF 117

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
           ++++     G EGF+ Q+L G G+AFI   G +V+  L  G+ I VD   +      V +
Sbjct: 118 NKKMGAGFFGGEGFIMQRLAGTGMAFIEIDGELVEYQLGPGQQIVVDTGNVAGFEVGVQI 177

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
            I+    I+  + GG+ L   V+TGPG +++Q++P   ++  I   + S N
Sbjct: 178 DIQQVPGIKNKLLGGEGLFNTVLTGPGKIWLQTMPISSVAASIRPFIPSGN 228


>gi|291549051|emb|CBL25313.1| conserved hypothetical protein TIGR00266 [Ruminococcus torques
           L2-14]
          Length = 226

 Score = 80.9 bits (198), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 64/224 (28%), Positives = 111/224 (49%), Gaps = 10/224 (4%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++I G    V    L+  E+++   GSM +MS +M+ME         G     FG+  + 
Sbjct: 3   YEIKGDTLPVVVCYLENGERMITERGSMSWMSPNMKMET-----TSNGGIGKAFGRMFSG 57

Query: 85  VVL----RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
             L          +G +  A+    +I+   +A  G EL+ Q   FL S   V +S    
Sbjct: 58  EALFQNRYTATGGNGMIAFASSFPGQIVARQIAP-GQELIVQKSGFLASEEGVDLSIFFQ 116

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           ++      G EGF+ Q+L+G G+AF+   G VV+  L  G+ I VD   + A+ ++ +++
Sbjct: 117 KKFGAGFFGGEGFIMQRLSGNGIAFLEFDGHVVEYELRPGQQIVVDTGYLAAIDATCSME 176

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
           I+    ++  VFGG+ +   VVTGPG V++Q++P   ++  + R
Sbjct: 177 IQTVPGLKNMVFGGEGVFNTVVTGPGHVWLQTMPISSVAGELLR 220


>gi|331694270|ref|YP_004330509.1| hypothetical protein Psed_0384 [Pseudonocardia dioxanivorans
           CB1190]
 gi|326948959|gb|AEA22656.1| protein of unknown function DUF124 [Pseudonocardia dioxanivorans
           CB1190]
          Length = 237

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 64/217 (29%), Positives = 105/217 (48%), Gaps = 14/217 (6%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMEN------IYIPENEVGMWQWLFGKTITSVVL 87
           V +IML P E+V A  G + + +  M++E               G+ + L G  +   + 
Sbjct: 12  VLEIMLGPGERVTAEGGEVSWYTPGMQLETSTRFGSGGRGGFMSGLKRALGGGQL--FLT 69

Query: 88  RNPGPSDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRN 145
               P+ G V  AA  P + R L ID A    E + Q  +++ S  DV+VS  L +R+  
Sbjct: 70  EYTAPTGGMVAFAARMPGVIRELTIDPA---DEFMVQSGSYIASTTDVEVSVALQKRLGA 126

Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
            I G  G + QKL G G A++   G +V+ +L  G+ + V    +   T+ +++Q     
Sbjct: 127 GIFGGAGLVFQKLAGHGRAWVELSGEIVEYDLAQGQSMIVHPGHLALFTAGMDLQFATVK 186

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            I+   F GD L+ A + GPG V++QS+   +L+  I
Sbjct: 187 GIKNKFF-GDALMMAEIHGPGHVWLQSMTAAKLAAAI 222


>gi|150391921|ref|YP_001321970.1| hypothetical protein Amet_4231 [Alkaliphilus metalliredigens QYMF]
 gi|149951783|gb|ABR50311.1| protein of unknown function DUF124 [Alkaliphilus metalliredigens
           QYMF]
          Length = 227

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 44/148 (29%), Positives = 82/148 (55%), Gaps = 2/148 (1%)

Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
             I+PID+      ++CQ  +FLCS  +V++     +R+ +   G EGF+ Q+L G+G+A
Sbjct: 82  GHIIPIDVT--NRSIICQKRSFLCSTEEVELEIAFQKRLGSGFFGGEGFIMQELRGKGMA 139

Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
           F+   G  V+  L+ GE I V+   +    S++ + ++        +FGG+ L    +TG
Sbjct: 140 FVEIDGESVEIQLKAGESIKVETGSVGMYESTIQMSVEMVKGFSNMLFGGEGLFLTTLTG 199

Query: 225 PGIVFIQSLPFHRLSQRIARAVTSPNMR 252
           PG+V+IQ++P   ++  +   + +P+ R
Sbjct: 200 PGLVWIQTMPIQSMAGEMYPYLPTPSNR 227


>gi|389575957|ref|ZP_10165985.1| TIGR00266 family protein [Eubacterium cellulosolvens 6]
 gi|389311442|gb|EIM56375.1| TIGR00266 family protein [Eubacterium cellulosolvens 6]
          Length = 225

 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 57/227 (25%), Positives = 108/227 (47%), Gaps = 14/227 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I GG   V + +L+  E +    GSM +M   +EM        E G    +FG+  
Sbjct: 1   MQYKIDGGNLPVLKCLLEAGESMQCEGGSMAWMDDGIEMST------EGGSIGKVFGRMF 54

Query: 83  TSVVL---RNPGPSDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
           T+  L   R      G +  ++  P   R + I        ++ Q  +FL +   + VS 
Sbjct: 55  TNEKLFMNRYTARQTGEIAFSSSFPGSIRCIEITPDR---PVIAQKGSFLAATEGIDVSV 111

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              ++V N+  G EG + ++ +G+GL F+   GS  + +L  G+   +D   +VA+ ++ 
Sbjct: 112 FFQKKVGNMFFGGEGLIMEQFSGKGLVFLEVDGSAHEYDLAPGQKKILDTGYLVAMDNTC 171

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
           ++ +     ++  +FGG+ L   V+TGPG + +QS+P    +  + R
Sbjct: 172 SMDVVTIKGVKNVLFGGEGLFNTVITGPGHITVQSMPIATTAAVLYR 218


>gi|323454065|gb|EGB09936.1| hypothetical protein AURANDRAFT_14012, partial [Aureococcus
           anophagefferens]
          Length = 209

 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 52/213 (24%), Positives = 102/213 (47%), Gaps = 5/213 (2%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           + ++G    +  + L P E +   PGSM FM  S++    ++  ++  + + L G+    
Sbjct: 2   WDLIGDSMHILNVGLAPGETMATEPGSMMFMGSSID---AHVDCSDC-LSRCLGGEACAM 57

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
           +   N G +  +V + +   A I+ +D+  +GG++  +  A++ SV D      +D    
Sbjct: 58  LNYTNRGAAPSYVALTSAKPADIVALDMDRYGGKISARSGAYMSSVGDAVPGCEVDCNPL 117

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
                  G +RQ +TG G AF+ A G+V +K L  GE+  +D + +VA   +  + ++  
Sbjct: 118 TCCFAGFGCVRQTITGSGTAFVEASGTVERKELAPGELFIIDSNSLVAWHGAT-LGVRTA 176

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR 237
           G        G+      + GPG  ++QS+P+ +
Sbjct: 177 GSCLACCCNGEGCCNTTLLGPGTAWVQSMPWEQ 209


>gi|323482977|ref|ZP_08088373.1| hypothetical protein HMPREF9474_00122 [Clostridium symbiosum
           WAL-14163]
 gi|323691588|ref|ZP_08105852.1| hypothetical protein HMPREF9475_00714 [Clostridium symbiosum
           WAL-14673]
 gi|355625943|ref|ZP_09048474.1| hypothetical protein HMPREF1020_02553 [Clostridium sp. 7_3_54FAA]
 gi|323403683|gb|EGA95985.1| hypothetical protein HMPREF9474_00122 [Clostridium symbiosum
           WAL-14163]
 gi|323504381|gb|EGB20179.1| hypothetical protein HMPREF9475_00714 [Clostridium symbiosum
           WAL-14673]
 gi|354821076|gb|EHF05473.1| hypothetical protein HMPREF1020_02553 [Clostridium sp. 7_3_54FAA]
          Length = 226

 Score = 77.4 bits (189), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 61/233 (26%), Positives = 111/233 (47%), Gaps = 17/233 (7%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++I GG   V    L   E V    G+MC+MS +MEM          G    +FG+  + 
Sbjct: 3   YEIKGGNLPVVICSLDNGESVFNESGAMCWMSPNMEMTT------SGGGIGKMFGRMFSG 56

Query: 85  VVL-------RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
             L       RN     G +  A+     I P+++   G  L+ Q   FL +   V++S 
Sbjct: 57  ENLFQNIYTARN---DSGMIAFASSFPGSIRPVEIRP-GDGLVVQKSGFLAAEAGVELSA 112

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
              ++      G EGF+ QKL+G+G AF+   G+ V+ +L  G+ + VD   + A++ + 
Sbjct: 113 FFQKKAGAGFFGGEGFIMQKLSGRGTAFLEIDGAAVEYDLAPGQKMIVDTGYLAAMSETC 172

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
           ++ I+     +  +FGG+     ++TGPG + +Q++P  + +  + + +  P+
Sbjct: 173 SIDIQAVKGAKNMLFGGEGFFNTIITGPGHIILQTMPVSKTAMCMYQYMPHPS 225


>gi|320159924|ref|YP_004173148.1| hypothetical protein ANT_05140 [Anaerolinea thermophila UNI-1]
 gi|319993777|dbj|BAJ62548.1| hypothetical protein ANT_05140 [Anaerolinea thermophila UNI-1]
          Length = 269

 Score = 77.0 bits (188), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 56/222 (25%), Positives = 106/222 (47%), Gaps = 5/222 (2%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF-GKTIT 83
           +QI+G   Q   + +KP        GS+ +MS  + M+        +G    +F G+++ 
Sbjct: 31  YQIIGTVMQALTLRMKPGMGFYTEVGSLSWMSQGVRMDTNLGKGGLMGALGRMFTGESL- 89

Query: 84  SVVLRNPGPSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
             V+      DG +   +     +I+P++LA  G  ++ Q D  L +   V +S    +R
Sbjct: 90  -FVVNYTAEIDGAMATFSCDFPGKIVPVNLAQ-GQAMIAQKDTLLVAEQSVSLSIAFQKR 147

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +   + G EGF+ QK  G G  F    G +V+  L+ G+ + VD   +     +V   ++
Sbjct: 148 LGAGLFGGEGFILQKFEGPGTFFASFDGEIVEYTLQPGQKMLVDTGHLAMFEPTVTYDVQ 207

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
               ++  +FGG+ L    +TGPG +++Q++P  +L+  I +
Sbjct: 208 MVKGVKNLLFGGEGLFLVSLTGPGRIWLQTMPMSKLAGAIHK 249


>gi|218288578|ref|ZP_03492855.1| protein of unknown function DUF124 [Alicyclobacillus acidocaldarius
           LAA1]
 gi|218241235|gb|EED08410.1| protein of unknown function DUF124 [Alicyclobacillus acidocaldarius
           LAA1]
          Length = 225

 Score = 77.0 bits (188), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 56/228 (24%), Positives = 97/228 (42%), Gaps = 3/228 (1%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I G   Q  ++ L+P E V +  G +  M+    ME    P    GM++       
Sbjct: 1   MEYRIHGTTMQTLEVRLEPGESVFSEAGCLLAMTPGTYMET-RAPGGLAGMFRRALSGNS 59

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
             +        +  V         IL + L  +G  ++ +  +FLC+ + V     +   
Sbjct: 60  VFLTYYRAARGESVVQFTTRMPGHILALPLRQYG-RVIAERHSFLCAESTVDYGVEVTLN 118

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +   +GG  G     L G GLAF+   G +V+++L  GE + V    I A T  ++ +++
Sbjct: 119 IGRFLGG-NGLALTHLAGDGLAFLSIDGEIVERDLARGESLLVHPGHIAAFTDGIDYRVQ 177

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
               +   +F GD L    +TGPG V++ S+  H L   I     SP 
Sbjct: 178 LMQGVANMLFSGDGLFLVELTGPGRVWLHSVTVHNLVHIIQEYWGSPR 225


>gi|323341905|ref|ZP_08082138.1| protein of hypothetical function DUF124 [Erysipelothrix
           rhusiopathiae ATCC 19414]
 gi|336066016|ref|YP_004560874.1| hypothetical protein ERH_0778 [Erysipelothrix rhusiopathiae str.
           Fujisawa]
 gi|322464330|gb|EFY09523.1| protein of hypothetical function DUF124 [Erysipelothrix
           rhusiopathiae ATCC 19414]
 gi|334295962|dbj|BAK31833.1| conserved hypothetical protein [Erysipelothrix rhusiopathiae str.
           Fujisawa]
          Length = 225

 Score = 77.0 bits (188), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 57/223 (25%), Positives = 103/223 (46%), Gaps = 14/223 (6%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL----RN 89
           V +  L   E ++   G+M +M+ +M  +      +  G ++  FG+  +          
Sbjct: 11  VLRCFLDEGESLITSAGNMSWMTANMSYK-----VHSGGGFKKAFGRAFSGEGFFQNTYT 65

Query: 90  PGPSDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
            G S+  +  A   P + + + +D +      + Q +AFL S   V+ SN   +++    
Sbjct: 66  AGDSNQEIAFAMSMPGVIKHIEMDGSK---TFIAQKNAFLASEPGVEFSNEFTKKLSAGF 122

Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
            G EGF+ QK +G G  F+ A GS++  +L+ GE + VD   +    S V+  I+    +
Sbjct: 123 FGGEGFILQKFSGVGNLFLEADGSLITYDLKEGESLLVDQGNLFMFESGVSYAIETVKGL 182

Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
               FGG+      +TGPG V +Q+LP   L+  + R + + N
Sbjct: 183 SNKFFGGEGFFLVKLTGPGQVVLQTLPISNLAGEVNRVLPTSN 225


>gi|383790957|ref|YP_005475531.1| hypothetical protein [Spirochaeta africana DSM 8902]
 gi|383107491|gb|AFG37824.1| TIGR00266 family protein [Spirochaeta africana DSM 8902]
          Length = 228

 Score = 77.0 bits (188), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 65/232 (28%), Positives = 108/232 (46%), Gaps = 27/232 (11%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + + ILG    + +  L   EKV+++PG+M +M   + M+         G     F +T+
Sbjct: 1   MQYTILGSSMPLLECTLNAGEKVLSQPGAMKYMDYGVVMKT-----GIQGGISGFFKRTL 55

Query: 83  TSVVLRNPGPSDGFVGI--AAPSLAR----------ILPIDLAMFGGELLCQPDAFLCSV 130
           +           GF+    AA S  R          I+PID+A    +++CQ  AFL   
Sbjct: 56  SG--------ESGFMSTFEAARSGQRIAFGHTYPGSIIPIDVAH--SDIICQKRAFLACT 105

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
             V     + +R+ +   G EGF+ Q+L G G AF    G  V+  LE G+ I ++   +
Sbjct: 106 EGVTFDIAIQRRLGSGFFGGEGFIMQRLGGTGTAFAEIDGETVEMQLEPGQKIQIETGAV 165

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
               ++V++ I+    I    FGG+ L    +TGPG V++Q++   RL+  I
Sbjct: 166 AMFEATVDMNIEMVKGISNVFFGGEGLFLTTLTGPGKVWLQTMSVQRLASEI 217


>gi|167756077|ref|ZP_02428204.1| hypothetical protein CLORAM_01597 [Clostridium ramosum DSM 1402]
 gi|237734062|ref|ZP_04564543.1| conserved hypothetical protein [Mollicutes bacterium D7]
 gi|374624270|ref|ZP_09696687.1| TIGR00266 family protein [Coprobacillus sp. 8_2_54BFAA]
 gi|167704069|gb|EDS18648.1| TIGR00266 family protein [Clostridium ramosum DSM 1402]
 gi|229382888|gb|EEO32979.1| conserved hypothetical protein [Coprobacillus sp. D7]
 gi|373915553|gb|EHQ47301.1| TIGR00266 family protein [Coprobacillus sp. 8_2_54BFAA]
          Length = 224

 Score = 77.0 bits (188), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 52/215 (24%), Positives = 101/215 (46%), Gaps = 10/215 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I G    V    L+  E ++   G+M +M   MEME         G +   FG+  
Sbjct: 1   MKYEIKGDNLPVVICHLEKGESMITDSGAMSWMDPVMEMETTS------GGFGKAFGRMF 54

Query: 83  TSVVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
           +   +   R     DG +  A+     I  +++     E++ Q  +FL   + + +S   
Sbjct: 55  SGETMFQNRYHAKEDGMIAFASSFPGSIEAVEINP-NHEIIVQKSSFLAGTSGIDMSVFF 113

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
           ++++   I G EGF+  K++G G  F+   GS +  +L  G+ + +D   +  + ++  +
Sbjct: 114 NKKMATGIFGGEGFIMNKISGTGTVFLEIDGSAISYDLLPGQQMVIDTGYLAMMDATCKM 173

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
           +I+    ++  + GG+     VVTGPG V +QS+P
Sbjct: 174 EIQSVKGVKNKLLGGEGFFNTVVTGPGKVVLQSMP 208


>gi|224542111|ref|ZP_03682650.1| hypothetical protein CATMIT_01286 [Catenibacterium mitsuokai DSM
           15897]
 gi|224524948|gb|EEF94053.1| TIGR00266 family protein [Catenibacterium mitsuokai DSM 15897]
          Length = 223

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 53/224 (23%), Positives = 105/224 (46%), Gaps = 11/224 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I G    V  + L   E ++   G+M +M  ++ M+ +       G  + LFG+  
Sbjct: 1   MKYEIKGKPLPVAILHLDQGESILTDSGAMVWMKPALTMDTV------SGGMKSLFGRAF 54

Query: 83  TSVVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
           +   L   +        + + +     IL  D++   G+++ Q  ++L S   VK   + 
Sbjct: 55  SGETLFLNKYTATQPSMIALGSSFPGDILVYDVSQ--GDIIVQKSSYLASEETVKREISF 112

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
            ++      G EGF+  +  G GL FI   G+ ++ +L  GE + V+   +VA++SS ++
Sbjct: 113 SKKGTIGFFGGEGFIMNRYFGSGLLFIEIDGTAIEYDLNPGEELIVNTGNLVAMSSSCHI 172

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
            +     I+    GG+     VV GPG V +Q++P   L++ ++
Sbjct: 173 DVVSVKGIKNKFLGGEGFFNTVVKGPGHVIVQTMPISTLAESLS 216


>gi|291460386|ref|ZP_06599776.1| HTH DNA-binding protein [Oribacterium sp. oral taxon 078 str.
           F0262]
 gi|291416953|gb|EFE90672.1| HTH DNA-binding protein [Oribacterium sp. oral taxon 078 str.
           F0262]
          Length = 241

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 59/228 (25%), Positives = 106/228 (46%), Gaps = 3/228 (1%)

Query: 11  QPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENE 70
           + +  Q    + + +QI GG      I L   E +++      +  G +E E        
Sbjct: 5   EKHEKQKIGGSSMRYQIEGGSLPAVVIQLDAGETIMSELAGRTWAKGPIETETRGTGGIG 64

Query: 71  VGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV 130
             + +   G+++        GP++  +G A+     I+  +L   G  L+CQ  AFLCS 
Sbjct: 65  KSIGRMFSGESLFMSHYTAKGPAE--IGFASSFAGSIIARELGP-GESLICQKSAFLCST 121

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
           + V +S    +++     G EGF+ QK+TG G+ F+   G V + +L  GE +  D   +
Sbjct: 122 SGVDLSIFFQKKLGAGFFGGEGFIMQKVTGPGVVFLELDGYVKEYDLAAGEELVCDTGVL 181

Query: 191 VAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
             + +S  + I+    ++  +FGG+ LV  V+ GPG  +IQ++    L
Sbjct: 182 ALMDASCKMDIRMVKGLKNKLFGGEGLVDTVIIGPGKAYIQTMSLAHL 229


>gi|384135763|ref|YP_005518477.1| hypothetical protein TC41_2047 [Alicyclobacillus acidocaldarius
           subsp. acidocaldarius Tc-4-1]
 gi|339289848|gb|AEJ43958.1| protein of unknown function DUF124 [Alicyclobacillus acidocaldarius
           subsp. acidocaldarius Tc-4-1]
          Length = 267

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 55/228 (24%), Positives = 96/228 (42%), Gaps = 3/228 (1%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + + I G   Q  ++ L+P E V +  G +  M+    ME    P    GM++       
Sbjct: 43  MDYHIHGSTMQTLEVRLEPGESVFSEAGCLLSMTPGTYMET-RAPGGLAGMFRRALSGNS 101

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
             +        +  V         IL + L  +G  ++ +  +FLC+ + V     +   
Sbjct: 102 VFLTYYRAARGESVVQFTTRMPGHILALPLHQYG-RVIAERHSFLCAESTVDYGVEVTLN 160

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +   +GG  G     L G G+AF+   G +V+++L  GE + V    I A T  ++ +++
Sbjct: 161 IGRFLGG-NGLALTHLAGDGMAFLSIDGEIVERDLARGESLLVHPGHIAAFTDGIDYRVQ 219

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
               +   +F GD L    +TGPG V++ S+  H L   I     SP 
Sbjct: 220 LMQGVANMLFSGDGLFLVELTGPGRVWLHSVTVHNLVHVIQAYWGSPR 267


>gi|256372358|ref|YP_003110182.1| hypothetical protein Afer_1587 [Acidimicrobium ferrooxidans DSM
           10331]
 gi|256008942|gb|ACU54509.1| protein of unknown function DUF124 [Acidimicrobium ferrooxidans DSM
           10331]
          Length = 254

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 60/219 (27%), Positives = 95/219 (43%), Gaps = 17/219 (7%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENI---------YIPENEVGMWQWLFGKTITS 84
           V +I+L P + +V+  G + +MS ++ M               +   G      G  + +
Sbjct: 12  VLEIVLDPGDHLVSDRGELSWMSANVRMTQTTGAAGGGLMGALKRVAG-----GGSLLLT 66

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
                 GP  GFV   A     I+P+++A  G   L   D ++C   DV  S    Q + 
Sbjct: 67  HFEATGGP--GFVAFGAKLPGHIVPLEIAQ-GQAYLVHRDGWVCGTGDVVASMAFQQSLG 123

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
             + G EGF+ Q+L G G A+I  GG +V   L  G+ + V    + A   +V  Q+   
Sbjct: 124 GALFGKEGFILQRLEGHGTAWIELGGEMVNYQLAEGQQLLVHPGHVGAFAETVQFQVTRV 183

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
             I     GGD      + GPG V++QS+P   L+Q + 
Sbjct: 184 PGIVNRWAGGDGHWLVQLVGPGQVWLQSMPLAVLAQALG 222


>gi|70607532|ref|YP_256402.1| hypothetical protein Saci_1803 [Sulfolobus acidocaldarius DSM 639]
 gi|449067783|ref|YP_007434865.1| hypothetical protein SacN8_08785 [Sulfolobus acidocaldarius N8]
 gi|449070057|ref|YP_007437138.1| hypothetical protein SacRon12I_08795 [Sulfolobus acidocaldarius
           Ron12/I]
 gi|68568180|gb|AAY81109.1| conserved Prokaryal protein [Sulfolobus acidocaldarius DSM 639]
 gi|449036291|gb|AGE71717.1| hypothetical protein SacN8_08785 [Sulfolobus acidocaldarius N8]
 gi|449038565|gb|AGE73990.1| hypothetical protein SacRon12I_08795 [Sulfolobus acidocaldarius
           Ron12/I]
          Length = 236

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 56/218 (25%), Positives = 98/218 (44%), Gaps = 8/218 (3%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           +QILG + Q  ++ L   E   A  G M   S ++ ++         GM + L G T   
Sbjct: 4   YQILGDDLQYLKVELAQGEGFYADAGHMVMKSSTVNLQTKMRGGILSGMKRALTGGTFFV 63

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
                PG     V ++     +I+ + +    G +L +   FL + N V+   T+ +   
Sbjct: 64  TEFYGPGE----VMLSGIFPGKIVQVQVQ---GSILAEAHTFLGAENTVEYDATMARLTT 116

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
            ++GG EG    +  G G  F+ A G +  K+L  GE I V+ S ++A    +N  ++  
Sbjct: 117 GILGG-EGLFLARFKGYGNLFLHAYGGLYVKDLAPGETIQVEASHLMAFQEGMNYSVQLV 175

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           G +R   F  + L    +TGPG V++ +L   +L+  +
Sbjct: 176 GGLRSIFFAHEGLFFVTITGPGRVWLHTLTVQQLAHAL 213


>gi|149918517|ref|ZP_01907006.1| hypothetical protein PPSIR1_20299 [Plesiocystis pacifica SIR-1]
 gi|149820593|gb|EDM80005.1| hypothetical protein PPSIR1_20299 [Plesiocystis pacifica SIR-1]
          Length = 217

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 101/209 (48%), Gaps = 6/209 (2%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           +Q+ G  AQ  ++ L   + + A  G++     S+E  ++ IP N +       G T+T 
Sbjct: 10  YQLHGNIAQTIELSLDEGQSMWASKGALICYDDSIEW-SLEIPGNSLSRMLSGEGLTMTR 68

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
           V  + PG     V + A    +I   DL+   G + C P AFL +V  V++  T  +R  
Sbjct: 69  VTAKRPGAK---VVLGANDTGKIAAWDLSW--GPVTCTPGAFLAAVGGVQIDVTAARRAG 123

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
               G  G L QK++G+GL FI   G  +Q++L  GE + V    + A  S V+  +   
Sbjct: 124 AAFFGGAGLLLQKISGRGLVFIHGKGDFIQRDLAAGERLLVSTGNLAAYGSEVDYGVVGV 183

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSL 233
           G + + +F  + +    +TGPG V +Q+L
Sbjct: 184 GGLFKMLFAREGVFMTQLTGPGRVLLQTL 212


>gi|358067922|ref|ZP_09154394.1| hypothetical protein HMPREF9333_01275 [Johnsonella ignava ATCC
           51276]
 gi|356693891|gb|EHI55560.1| hypothetical protein HMPREF9333_01275 [Johnsonella ignava ATCC
           51276]
          Length = 225

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 63/226 (27%), Positives = 110/226 (48%), Gaps = 14/226 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I GG   V     +P EK++   G+M +MS +M M          G     FG+  
Sbjct: 1   MKYEIQGGNFPVAICSCEPGEKMICEGGAMSWMSPNMVMNT-----TTNGGLGKAFGRMF 55

Query: 83  T-SVVLRNPGPSDGFVGIAA-----PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
           +   +  N   + G  G+ A     P   R + I+    G +++ Q  AFL     V++S
Sbjct: 56  SGESMFMNTYEAQGGPGMIAFASSFPGEIRAINIEP---GRDVVAQKRAFLACEPGVELS 112

Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
               +++     G EGF+ QKL+GQG+AFI   GS V+  L  G+ I VD   +  + ++
Sbjct: 113 IFFQKKIGAGFFGGEGFIMQKLSGQGIAFIEIDGSAVEYELAAGQTIVVDTGYVALMDAT 172

Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            ++ +K    ++  + GG+ L    VTGPG V +Q++P  +++  +
Sbjct: 173 CSMDVKTVPGVKNILLGGEGLFNTHVTGPGRVIVQTMPLSKMAASL 218


>gi|48477824|ref|YP_023530.1| HTH DNA-binding protein [Picrophilus torridus DSM 9790]
 gi|48430472|gb|AAT43337.1| HTH DNA-binding protein [Picrophilus torridus DSM 9790]
          Length = 219

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 59/229 (25%), Positives = 108/229 (47%), Gaps = 17/229 (7%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           ++ ++I+GG+ Q  + +L   E +   PG + + S S  ++         G  +  F   
Sbjct: 1   MVNYEIVGGDVQYLKSILNEGESMYVEPGHLIYKSPSARLDV------RAGGLRGAFSHM 54

Query: 82  ITS---VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
           +      +L+  GP +  V  A     ++  I+L      ++ + +AFLC  + V  S  
Sbjct: 55  LAGSDVFLLKVDGPGE--VASAGFLPGKVFKIELN--NNSIIAEFNAFLCMDSTVNYSTK 110

Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
                + V GG EG   +K +G G   +   G V+++ L+ GE I V++S ++A   +V 
Sbjct: 111 FAGLWQGVFGG-EGLFLEKFSGTGSVMMHGHGQVIERELKPGEEIQVELSHVLAFEETVG 169

Query: 199 VQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
             +   G ++  V G   G+ L  A +TGPG V++ S+    LS ++AR
Sbjct: 170 YNVVRIGGLKTMVLGGLEGEGLFFANMTGPGKVWLHSISLFELSAKLAR 218


>gi|258511908|ref|YP_003185342.1| hypothetical protein Aaci_1941 [Alicyclobacillus acidocaldarius
           subsp. acidocaldarius DSM 446]
 gi|257478634|gb|ACV58953.1| protein of unknown function DUF124 [Alicyclobacillus acidocaldarius
           subsp. acidocaldarius DSM 446]
          Length = 225

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 55/228 (24%), Positives = 97/228 (42%), Gaps = 3/228 (1%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I G   Q  ++ L+P E V +  G +  M+    ME    P    GM++       
Sbjct: 1   MDYRIHGTTMQTLEVRLEPGESVFSEAGCLLAMTPGTYMET-RAPGGLAGMFRRALSGNS 59

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
             +        +  V         IL + L  +G  ++ +  +FLC+ + V     +   
Sbjct: 60  VFLTYYRAARGESVVQFTTRMPGHILALPLRQYG-RVIAERHSFLCAESTVDYGVEVTLN 118

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           +   +GG  G     L G G+AF+   G +V+++L  GE + V    I A T  ++ +++
Sbjct: 119 IGRFLGG-NGLALTHLAGDGMAFLSIDGEIVERDLARGESLLVHPGHIAAFTDGIDYRVQ 177

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
               +   +F GD L    +TGPG V++ S+  H L   I     SP 
Sbjct: 178 LMQGVANMLFSGDGLFLVELTGPGRVWLHSVTVHNLIHIIQEYWGSPR 225


>gi|410723384|ref|ZP_11362623.1| hypothetical protein A370_00664 [Clostridium sp. Maddingley
           MBC34-26]
 gi|410603236|gb|EKQ57676.1| hypothetical protein A370_00664 [Clostridium sp. Maddingley
           MBC34-26]
          Length = 146

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 47/137 (34%), Positives = 70/137 (51%), Gaps = 13/137 (9%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY---IPENEVGMWQWLF- 78
           I + I G + Q  +I L P E VVA  G++  M+ S+EME I+     +N  G    +F 
Sbjct: 6   IDYSIYGDDMQYVEIELDPMETVVAEAGALMMMAPSIEMETIFGDGSKQNTGGFMDKIFS 65

Query: 79  -------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN 131
                  G+++      N G     V  AAP   +ILP+DL  +GG+L+CQ DAFLC+  
Sbjct: 66  AGKRVLTGESLFMTAFTNAGQLREKVYFAAPYPGKILPMDLRAYGGQLICQKDAFLCAAK 125

Query: 132 DVKVSNTLDQRVRNVIG 148
            V +   +D R +  +G
Sbjct: 126 GVSIG--IDFRKKLGVG 140


>gi|229582748|ref|YP_002841147.1| hypothetical protein YN1551_2234 [Sulfolobus islandicus Y.N.15.51]
 gi|228013464|gb|ACP49225.1| protein of unknown function DUF124 [Sulfolobus islandicus
           Y.N.15.51]
          Length = 232

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 60/240 (25%), Positives = 109/240 (45%), Gaps = 14/240 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++ILG + Q  ++ML   E + A  G M     S+ M+          M    FG     
Sbjct: 4   YEILGKDLQYLKVMLGEGEAIYADAGHMVAKQSSVIMKTT--------MRGGFFGALKRE 55

Query: 85  VVLRNPGPSDGFVGIAAPSLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
           +   +   ++ F+G     L+ + P   + + + G  +L +  +FLC+ N+V    TL++
Sbjct: 56  ITGGSFFVTE-FLGPGEVYLSGVFPGPIVQIQLQGNGILAESHSFLCAENNVNYDATLNK 114

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
                +GG EG    +  G G  F+ A G +++K L  GE + V+ S ++A    +   I
Sbjct: 115 LGVGWLGG-EGIFLARFRGDGNVFLHAYGGLIEKYLNPGESLEVEASHLMAFEEGMRYGI 173

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQI 261
           +  G IR  +F G+ L    + GPG +++ +L   +L   + R       +  P F ++I
Sbjct: 174 QPVGGIRSLLFSGEGLFFVNIEGPGKIWLHTLTVEQLVASL-RPYLPQGQQGGPSFNIRI 232


>gi|227829995|ref|YP_002831774.1| hypothetical protein LS215_1106 [Sulfolobus islandicus L.S.2.15]
 gi|284996932|ref|YP_003418699.1| hypothetical protein [Sulfolobus islandicus L.D.8.5]
 gi|227456442|gb|ACP35129.1| protein of unknown function DUF124 [Sulfolobus islandicus L.S.2.15]
 gi|284444827|gb|ADB86329.1| protein of unknown function DUF124 [Sulfolobus islandicus L.D.8.5]
          Length = 232

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 60/240 (25%), Positives = 109/240 (45%), Gaps = 14/240 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++ILG + Q  ++ML   E + A  G M     S+ M+          M    FG     
Sbjct: 4   YEILGKDLQYLKVMLGEGEAIYADAGHMVAKQSSVIMKTT--------MRGGFFGALKRE 55

Query: 85  VVLRNPGPSDGFVGIAAPSLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
           +   +   ++ F+G     L+ + P   + + + G  +L +  +FLC+ N+V    TL++
Sbjct: 56  ITGGSFFVTE-FLGPGEVYLSGVFPGPIVQIQLQGNGILAESHSFLCAENNVNYDATLNK 114

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
                +GG EG    +  G G  F+ A G +++K L  GE + V+ S ++A    +   I
Sbjct: 115 LGVGWLGG-EGIFLARFRGDGNVFLHAYGGLIEKYLNPGESLQVEASHLMAFEEGMRYGI 173

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQI 261
           +  G IR  +F G+ L    + GPG +++ +L   +L   + R       +  P F ++I
Sbjct: 174 QPVGGIRSLLFSGEGLFFVNIEGPGKIWLHTLTVEQLVASL-RPYLPQGQQGGPSFNIRI 232


>gi|317056444|ref|YP_004104911.1| hypothetical protein Rumal_1779 [Ruminococcus albus 7]
 gi|315448713|gb|ADU22277.1| protein of unknown function DUF124 [Ruminococcus albus 7]
          Length = 225

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 45/149 (30%), Positives = 79/149 (53%), Gaps = 2/149 (1%)

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
           DG + +A+     I P+D++     ++ Q  AFL S   V +     +R      G EGF
Sbjct: 71  DGMIAMASNFPGSIKPMDVSQM--PIVAQKSAFLASEMGVNMEIFFQKRFGAGFFGGEGF 128

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
           + QK +GQG  F+   GS+++  L  G+ + VD  C+ A+ +S ++ I+    ++  +FG
Sbjct: 129 IMQKFSGQGTVFLELDGSIIEYQLAQGQSMLVDTGCLAAMEASCSIDIEQVSGLKNKLFG 188

Query: 214 GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           G+      +TGPG V++Q++P   L+  I
Sbjct: 189 GEGFFNTRITGPGHVWLQTMPVSNLAGAI 217


>gi|406872158|gb|EKD22793.1| hypothetical protein ACD_83C00267G0005 [uncultured bacterium]
          Length = 228

 Score = 74.7 bits (182), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 59/210 (28%), Positives = 96/210 (45%), Gaps = 4/210 (1%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           + +LG   Q   I LK  E + A PGS+ F + +++           G  + + G+++  
Sbjct: 3   YDVLGEVMQRLIITLKKGELIYAEPGSVYFANEAIQPATKLKGGILEGAKRVMTGESLF- 61

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
           + L       G + I    + +IL  DL +    L+CQ  AFLC    V +   +     
Sbjct: 62  IPLYASSSDGGEIHIKTKFIGKILTFDL-LPKQVLICQKGAFLCGQESVDIGLHIRHLTA 120

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
              GG EGF+ Q++ G G  F+   G V + NL   E I  +   +     SV+  I++ 
Sbjct: 121 GAFGG-EGFIMQRIVGPGWVFLAIDGEVEEHNLSASETIRAEPGHVAFHEPSVHCGIEFA 179

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
           G  +   FGG  L  A + GPG V++Q++P
Sbjct: 180 G-FKNVFFGGFPLFWATLVGPGKVWLQTMP 208


>gi|383787060|ref|YP_005471629.1| hypothetical protein [Fervidobacterium pennivorans DSM 9078]
 gi|383109907|gb|AFG35510.1| TIGR00266 family protein [Fervidobacterium pennivorans DSM 9078]
          Length = 224

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 64/211 (30%), Positives = 101/211 (47%), Gaps = 18/211 (8%)

Query: 30  GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL-----FGKTITS 84
           G   + ++ L P E+V   PG+M +M G + +E      +  G+W+ L      G++   
Sbjct: 9   GSYALLKVYLGPGERVKVEPGAMVYMKGPVNVET-----STGGVWKALKRAVLGGESFFM 63

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
               + G  DG +GIA P L     I+     G L  Q  +FL     V +  +     +
Sbjct: 64  NTYYSSG--DGEIGIA-PELPG--DIETVHVNGTLFVQSTSFLACDAQVDLDVSFGG-FK 117

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
           +   G EG    KL G G A I + G++    L+VGE IT+D   +VA   SVN  ++  
Sbjct: 118 SFFAG-EGIFLLKLQGYGDAAISSFGAIKILELQVGEKITIDTGHVVAFDGSVNYSVRTF 176

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
           G I+  +FGG+ LV    TGPG +++Q+  +
Sbjct: 177 GGIKSTLFGGEGLVCD-FTGPGRIYVQTRNY 206


>gi|331003350|ref|ZP_08326853.1| hypothetical protein HMPREF0491_01715 [Lachnospiraceae oral taxon
           107 str. F0167]
 gi|330412620|gb|EGG92005.1| hypothetical protein HMPREF0491_01715 [Lachnospiraceae oral taxon
           107 str. F0167]
          Length = 226

 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 57/219 (26%), Positives = 102/219 (46%), Gaps = 3/219 (1%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           +QI GG      I L+  E +++  G   +  G +E +          + +   G++   
Sbjct: 3   YQIEGGNLPAVVIELETGEVIISEVGGRTWAKGPIETDTKATGGIGKAIGRMFSGESFFM 62

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
                 G ++  +G ++     I+  +L   G  ++CQ  AFLC+ ++V +S    +++ 
Sbjct: 63  SHYTAKGHAE--IGFSSSFAGSIIAKELKA-GESIICQKSAFLCATSEVSLSIHFQKKIG 119

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
               G EGF+ QK+TG G  F+   G V + +L  GE +  D   I  +  S  + +   
Sbjct: 120 AGFFGGEGFIMQKVTGPGTVFLELDGYVKEYDLNPGEQVVCDTGLIALMDESCTMDVVVV 179

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
             I+   FG + L+  VVTGPG V+IQ++   RL + + 
Sbjct: 180 KGIKNKFFGSEGLIDTVVTGPGKVYIQTMSLPRLGEMLT 218


>gi|306821076|ref|ZP_07454694.1| protein of hypothetical function DUF124 [Eubacterium yurii subsp.
           margaretiae ATCC 43715]
 gi|304550912|gb|EFM38885.1| protein of hypothetical function DUF124 [Eubacterium yurii subsp.
           margaretiae ATCC 43715]
          Length = 226

 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 57/214 (26%), Positives = 103/214 (48%), Gaps = 3/214 (1%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           +QI GG      I L+  E +V+  G   +  G +  E          + +   G+ +  
Sbjct: 3   YQIEGGSLPAAVIQLEKGETLVSELGGRTWAKGPITTETKAPGGIGKSIGRMFSGENLFM 62

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
                 GP +  +G A+     I+  +LA  G  ++CQ  AFLC+ + V++S    +++ 
Sbjct: 63  SYYTAEGPVE--IGFASSFAGSIVVKELAA-GQSIICQKSAFLCATSQVELSIFFQKKLG 119

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
               G EGF+ QK+TG G  F+   G V + +L  GE +  D   +  +  + ++ ++  
Sbjct: 120 AGFFGGEGFIMQKVTGPGTVFLELDGYVKEYDLVQGEELVCDTGVVALMDETCSMDVRMV 179

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
             ++  +FGG+ L+  V+ GPG V+IQ++   RL
Sbjct: 180 KGLKNKLFGGEGLIDTVIIGPGKVYIQTMSLPRL 213


>gi|229578344|ref|YP_002836742.1| hypothetical protein YG5714_0526 [Sulfolobus islandicus Y.G.57.14]
 gi|228009058|gb|ACP44820.1| protein of unknown function DUF124 [Sulfolobus islandicus
           Y.G.57.14]
          Length = 232

 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 60/240 (25%), Positives = 109/240 (45%), Gaps = 14/240 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++ILG + Q  ++ML   E + A  G M     S+ M+          M    FG     
Sbjct: 4   YEILGKDLQYLKVMLGEGEAIYADAGHMVAKQSSVIMKTT--------MRGGFFGALKRE 55

Query: 85  VVLRNPGPSDGFVGIAAPSLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
           +   +   ++ F+G     L+ + P   + + + G  +L +  +FLC+ N+V    TL++
Sbjct: 56  ITGGSFFVTE-FLGPGEVYLSGVFPGPIVQIQLQGNGILTESHSFLCAENNVNYDATLNK 114

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
                +GG EG    +  G G  F+ A G +++K L  GE + V+ S ++A    +   I
Sbjct: 115 LGVGWLGG-EGIFLARFRGDGNVFLHAYGGLIEKYLNPGESLQVEASHLMAFEEGMRYGI 173

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQI 261
           +  G IR  +F G+ L    + GPG +++ +L   +L   + R       +  P F ++I
Sbjct: 174 QPVGGIRSLLFSGEGLFFVNIEGPGKIWLHTLTVEQLVASL-RPYLPQGQQGGPSFNIRI 232


>gi|229584086|ref|YP_002842587.1| hypothetical protein M1627_0570 [Sulfolobus islandicus M.16.27]
 gi|385772809|ref|YP_005645375.1| hypothetical protein [Sulfolobus islandicus HVE10/4]
 gi|385775278|ref|YP_005647846.1| hypothetical protein [Sulfolobus islandicus REY15A]
 gi|228019135|gb|ACP54542.1| protein of unknown function DUF124 [Sulfolobus islandicus M.16.27]
 gi|323474026|gb|ADX84632.1| conserved hypothetical protein [Sulfolobus islandicus REY15A]
 gi|323476923|gb|ADX82161.1| conserved hypothetical protein [Sulfolobus islandicus HVE10/4]
          Length = 232

 Score = 74.3 bits (181), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 60/240 (25%), Positives = 109/240 (45%), Gaps = 14/240 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++ILG + Q  ++ML   E + A  G M     S+ M+          M    FG     
Sbjct: 4   YEILGKDLQYLKVMLGEGEAIYADAGHMVAKQSSVIMKTT--------MRGGFFGALKRE 55

Query: 85  VVLRNPGPSDGFVGIAAPSLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
           +   +   ++ F+G     L+ + P   + + + G  +L +  +FLC+ N+V    TL++
Sbjct: 56  ITGGSFFVTE-FLGPGEVYLSGVFPGPIVQIQLQGNGILAESHSFLCAENNVNYDATLNK 114

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
                +GG EG    +  G G  F+ A G +++K L  GE + V+ S ++A    +   I
Sbjct: 115 LGVGWLGG-EGIFLARFRGDGNVFLHAYGGLIEKYLNPGESLQVEASHLMAFEEGMRYGI 173

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQI 261
           +  G IR  +F G+ L    + GPG +++ +L   +L   + R       +  P F ++I
Sbjct: 174 QPVGGIRSLLFSGEGLFFVNIEGPGKIWLHTLTVEQLVASL-RPYLPQGQQGGPGFNIRI 232


>gi|373118928|ref|ZP_09533043.1| TIGR00266 family protein [Lachnospiraceae bacterium 7_1_58FAA]
 gi|371665851|gb|EHO31009.1| TIGR00266 family protein [Lachnospiraceae bacterium 7_1_58FAA]
          Length = 226

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 61/227 (26%), Positives = 110/227 (48%), Gaps = 15/227 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I+G    V    L+  E ++   GSM +MS +M+ME         G     FG+  
Sbjct: 1   MKYEIIGEPMPVVVCDLEANEAMITERGSMVWMSPNMKMET------GAGGIGKAFGRMF 54

Query: 83  T-SVVLRNPGPSDGFVGIAA-----PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
           +   + +N   + G  G+ A     P   + + ID A     ++ Q   FL S   V++S
Sbjct: 55  SGESMFQNTYTAQGGAGMIAFASSFPGSIKAVEIDPAH---PVVVQKSGFLASEQGVELS 111

Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
               ++      G EGF+ QKL+G+G AF+   G  V+  L+ G+ + VD   +  + ++
Sbjct: 112 IFFQKKAGAGFFGGEGFIMQKLSGRGTAFLEVDGYAVEYELQAGQSMVVDSGNLAMMDAT 171

Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
             + I+    ++  +FGG+ L   VVTGPG + +Q++P   ++  +A
Sbjct: 172 CTIDIQMVKGVKNVLFGGEGLFNTVVTGPGKILLQTMPVSGVAAALA 218


>gi|227826910|ref|YP_002828689.1| hypothetical protein M1425_0548 [Sulfolobus islandicus M.14.25]
 gi|238619045|ref|YP_002913870.1| hypothetical protein M164_0580 [Sulfolobus islandicus M.16.4]
 gi|227458705|gb|ACP37391.1| protein of unknown function DUF124 [Sulfolobus islandicus M.14.25]
 gi|238380114|gb|ACR41202.1| protein of unknown function DUF124 [Sulfolobus islandicus M.16.4]
          Length = 232

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 60/240 (25%), Positives = 109/240 (45%), Gaps = 14/240 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++ILG + Q  ++ML   E + A  G M     S+ M+          M    FG     
Sbjct: 4   YEILGKDLQYLKVMLGEGEAIYADAGHMVAKQSSVIMKTT--------MRGGFFGALKRE 55

Query: 85  VVLRNPGPSDGFVGIAAPSLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
           +   +   ++ F+G     L+ + P   + + + G  +L +  +FLC+ N+V    TL++
Sbjct: 56  ITGGSFFVTE-FLGPGEVYLSGVFPGPIVQIQLQGNGILAESHSFLCAENNVNYDATLNR 114

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
                +GG EG    +  G G  F+ A G +++K L  GE + V+ S ++A    +   I
Sbjct: 115 LGVGWLGG-EGIFLARFRGDGNVFLHAYGGLIEKYLNPGESLQVEASHLMAFEEGMRYGI 173

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQI 261
           +  G IR  +F G+ L    + GPG +++ +L   +L   + R       +  P F ++I
Sbjct: 174 QPVGGIRSLLFSGEGLFFVNIEGPGKIWLHTLTVEQLVASL-RPYLPQGQQGGPGFNIRI 232


>gi|428183260|gb|EKX52118.1| hypothetical protein GUITHDRAFT_150692 [Guillardia theta CCMP2712]
          Length = 271

 Score = 74.3 bits (181), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 54/225 (24%), Positives = 104/225 (46%), Gaps = 17/225 (7%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGK 80
           + ++G   QV    L  +++V+  PG M   S     SM M  +++         +  G+
Sbjct: 44  WSVVGESTQVLMAKLLSEQEVLTEPGGMMIHSSGIKPSMNMGGLWLGIRRA----FCAGE 99

Query: 81  TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVN-----DVKV 135
           +   V  +N       + I     ++++P++L  F G +  +  A++ S+N     D+K+
Sbjct: 100 SCCRVQWKNHESQPKIMTIHPSFPSKVVPVNLDEFQGSMYMKRAAWMASLNTQLEFDIKM 159

Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
           + +            +G     + G+GL F+  GG+V++K L  GE + +D S +VA  +
Sbjct: 160 APSCTA----AWCAGQGCCLTLVKGRGLTFLNVGGTVLKKTLRDGEKVFIDTSALVAWAT 215

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           +  +  +  G       GG  L    +TGPG+V+IQS+   + +Q
Sbjct: 216 TATISAERAGNCCTICCGGMGLANTTITGPGVVYIQSMSREKAAQ 260


>gi|375084053|ref|ZP_09731063.1| hypothetical protein OCC_09481 [Thermococcus litoralis DSM 5473]
 gi|374741219|gb|EHR77647.1| hypothetical protein OCC_09481 [Thermococcus litoralis DSM 5473]
          Length = 222

 Score = 73.9 bits (180), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 56/199 (28%), Positives = 96/199 (48%), Gaps = 8/199 (4%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           + ++ L+  E + A PG+M  MS ++E+E       +      L G++I     R  G  
Sbjct: 13  LVEVELEDGEAIKAEPGAMVHMSPNVEIETKTGGVFKALKRSMLGGESIFINTFRARG-G 71

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
            G +G+A P +  I  ++L    G L  Q  AFL S  ++++               EG 
Sbjct: 72  KGNIGLAPPYMGDIEALELR---GTLYAQSGAFLASSENIEIDTKFGGA--KTFFSREGL 126

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
              KLTG+G  F+ + G++ +K L   E   +D   +VA T  ++ +++  G I+  +FG
Sbjct: 127 FLLKLTGEGTVFLSSFGAIYKKELR-DERFIIDTGHLVAFTEGLDFKVRRVGGIKSTIFG 185

Query: 214 GDNLVTAVVTGPGIVFIQS 232
           G+ LV A   G G ++IQ+
Sbjct: 186 GEGLV-AEFYGTGTLYIQT 203


>gi|365843251|ref|ZP_09384197.1| TIGR00266 family protein [Flavonifractor plautii ATCC 29863]
 gi|364572827|gb|EHM50357.1| TIGR00266 family protein [Flavonifractor plautii ATCC 29863]
          Length = 226

 Score = 73.9 bits (180), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 61/227 (26%), Positives = 110/227 (48%), Gaps = 15/227 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I+G    V    L+  E ++   GSM +MS +M+ME         G     FG+  
Sbjct: 1   MKYEIIGEPMPVVVCDLEANEAMITERGSMVWMSPNMKMET------GAGGIGKAFGRMF 54

Query: 83  T-SVVLRNPGPSDGFVGIAA-----PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
           +   + +N   + G  G+ A     P   + + ID A     ++ Q   FL S   V++S
Sbjct: 55  SGESMFQNTYTAQGGAGMIAFASSFPGSIKAVEIDPAH---PVVVQKSGFLASEQGVELS 111

Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
               ++      G EGF+ QKL+G+G AF+   G  V+  L+ G+ + VD   +  + ++
Sbjct: 112 IFFQKKAGAGFFGGEGFIMQKLSGRGTAFLEVDGYAVEYELQAGQSMVVDSGNLAMMDAT 171

Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
             + I+    ++  +FGG+ L   VVTGPG + +Q++P   ++  +A
Sbjct: 172 CTIDIQIVKGVKNVLFGGEGLFNTVVTGPGKILLQTMPVSGVAAALA 218


>gi|332652858|ref|ZP_08418603.1| conserved hypothetical protein [Ruminococcaceae bacterium D16]
 gi|332518004|gb|EGJ47607.1| conserved hypothetical protein [Ruminococcaceae bacterium D16]
          Length = 241

 Score = 73.9 bits (180), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 59/221 (26%), Positives = 111/221 (50%), Gaps = 10/221 (4%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT- 83
           +++ GG   V    L   E+++   GSM +MS +M+M+         G    +F K  + 
Sbjct: 19  YEVKGGSFPVVVCNLDSGEQMITEKGSMVWMSPNMKMDT------RGGGLGKMFSKAFSG 72

Query: 84  SVVLRNP--GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
             + +N      DG +   +    +I  + +A  G E++ Q  AFL + + V++S    +
Sbjct: 73  ESMFQNTYTAQGDGMIAFGSSFPGQIKVVTIAP-GQEMIVQKSAFLAAESGVELSIHFSK 131

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           ++     G EGF+ Q+L+G+G+AF+   G +V+  L+ G+ I VD   +      V + I
Sbjct: 132 KLGAGFFGGEGFIMQRLSGRGIAFVEIDGELVEYELKPGQQIIVDTGNVAGFEPGVQMDI 191

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +    ++  + GG+ L   V+TGPG V++Q++P   ++  I
Sbjct: 192 QQVPGLKNKLLGGEGLFNTVLTGPGRVWLQTMPICNVAASI 232


>gi|228967984|ref|ZP_04128993.1| Protein containing DUF124 [Bacillus thuringiensis serovar sotto
           str. T04001]
 gi|228791708|gb|EEM39301.1| Protein containing DUF124 [Bacillus thuringiensis serovar sotto
           str. T04001]
          Length = 135

 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 43/129 (33%), Positives = 64/129 (49%), Gaps = 14/129 (10%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E VVA  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPKESVVAEAGAMMMMEDYIEMETIF--GDGSGPSNGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKIVCQKDAF 119

Query: 127 LCSVNDVKV 135
           LC+   V +
Sbjct: 120 LCAAKGVSI 128


>gi|339442957|ref|YP_004708962.1| hypothetical protein CXIVA_18930 [Clostridium sp. SY8519]
 gi|338902358|dbj|BAK47860.1| hypothetical protein CXIVA_18930 [Clostridium sp. SY8519]
          Length = 224

 Score = 73.6 bits (179), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 56/204 (27%), Positives = 102/204 (50%), Gaps = 11/204 (5%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT--SVVLRNP- 90
           V +  L   E +  + G+M +M+  + M+       + G    +F K I   S    N  
Sbjct: 12  VLECSLSAGESMKCQKGAMAWMTRGITMQT------KAGGIGKMFKKAIAGESAFWNNYM 65

Query: 91  GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
             +DG +  +     +I+P+D+      ++ Q  AFL S   V+++  L +++     G 
Sbjct: 66  AEADGQIAFSTTFPGKIIPVDVGTM--PIVAQKSAFLASEAGVEMNIYLQEKIGAGFFGG 123

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
           EGF+ ++ +G+G  F+   G  V+K LE GE I VD   + A+  S +++I+    I   
Sbjct: 124 EGFIMEQFSGRGYVFLEIDGGFVEKQLEAGEEIVVDSGYLAAMDQSCSMRIERIKGIGNV 183

Query: 211 VFGGDNLVTAVVTGPGIVFIQSLP 234
           + GG+ L   ++TGPG V++Q++P
Sbjct: 184 MLGGEGLFNTIITGPGKVWLQTMP 207


>gi|332796149|ref|YP_004457649.1| hypothetical protein Ahos_0463 [Acidianus hospitalis W1]
 gi|332693884|gb|AEE93351.1| conserved hypothetical protein [Acidianus hospitalis W1]
          Length = 227

 Score = 73.6 bits (179), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 60/222 (27%), Positives = 103/222 (46%), Gaps = 11/222 (4%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           +QI+G + Q  ++ L P EK+ A  G +   S S+ ++          + + L G T   
Sbjct: 3   YQIIGDDIQYVKVFLNPGEKIYADAGHIMSKSSSVTIQAKMRGGILSAIKRELTGATFFV 62

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
             L  PG  D    IA     +I+PI L   G  +L +  +FL + + +     L     
Sbjct: 63  TELMGPGEVD----IAGVFPGKIVPIQLN--GKGILAESHSFLFAEDSINYDARLAPLAP 116

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
            ++GG EG      +G G  F+ A G + + NL  G+ I ++ S ++A+   +   +   
Sbjct: 117 AILGG-EGLFLATFSGVGQVFLHAYGGLYEINLMPGQSIDIEASHLLALEEGMQFTVTRV 175

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           G  +  +FGG+ L    V GPG V+I S+     +Q++A A+
Sbjct: 176 GGFKTMLFGGEGLYFVRVVGPGKVWIHSIT----AQQMASAL 213


>gi|423603451|ref|ZP_17579344.1| hypothetical protein IIK_00032 [Bacillus cereus VD102]
 gi|401247430|gb|EJR53766.1| hypothetical protein IIK_00032 [Bacillus cereus VD102]
          Length = 160

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 42/137 (30%), Positives = 67/137 (48%), Gaps = 14/137 (10%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAF 126
           GK + +             V  N G     V  AAP   +I+P+DL  + G+++CQ DAF
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPGKIIPVDLTEYQGKVVCQKDAF 119

Query: 127 LCSVNDVKVSNTLDQRV 143
           LC+   V +     +++
Sbjct: 120 LCAAKGVSIGIEFTKKI 136


>gi|325680701|ref|ZP_08160239.1| TIGR00266 family protein [Ruminococcus albus 8]
 gi|324107481|gb|EGC01759.1| TIGR00266 family protein [Ruminococcus albus 8]
          Length = 225

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 58/218 (26%), Positives = 102/218 (46%), Gaps = 3/218 (1%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++I G    V    L   E +  + G+M +MS +M M           + +   G++I  
Sbjct: 3   YEIQGEPLPVVICYLANGEAIKCQQGAMSWMSPNMNMSTNAGGGIGKALSRAFSGESIFQ 62

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
            V       DG + +A+     I P+D++     ++ Q  AFL S   V +     +R  
Sbjct: 63  NVYTAMN-GDGMIAMASNFPGSIKPMDVSQM--PIVAQKSAFLASEMGVNMEIFFQKRFG 119

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
               G EGF+ QK +G G  F+   GS+++  L  G+ + VD  C+ A+ +S ++ I+  
Sbjct: 120 AGFFGGEGFIMQKFSGNGTVFLELDGSIIEYQLAAGQSMLVDTGCLAAMEASCSIDIEQV 179

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
             ++   FGG+      +TGPG V++Q++P   L+  I
Sbjct: 180 SGLKNMFFGGEGFFNTRITGPGHVWLQTMPVSNLAGAI 217


>gi|196233638|ref|ZP_03132479.1| protein of unknown function DUF124 [Chthoniobacter flavus Ellin428]
 gi|196222308|gb|EDY16837.1| protein of unknown function DUF124 [Chthoniobacter flavus Ellin428]
          Length = 230

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 61/224 (27%), Positives = 99/224 (44%), Gaps = 10/224 (4%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENI----YIPENEVGMWQWLFGK 80
            +I+G    V  I L   E VVA  G + +M+ S++M               + + + G 
Sbjct: 3   HEIIGTTMPVLSISLAKGESVVAVSGELSWMTSSIQMNTTTNAAGGGGFFGAIKRAVGGG 62

Query: 81  TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE--LLCQPDAFLCSVNDVKVSNT 138
           ++     R  G   G V  A      I+PI+L+   G+   +     +LC    V++   
Sbjct: 63  SLFMTEYRAEG-DHGVVAFATKLPGHIVPIELS---GDKTYMIHRHGYLCGTPGVELGVG 118

Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
             Q +   + G  GF+ QK+ G G A+I  GG +V  +L  GE + V    +     +V 
Sbjct: 119 FQQSLGAGVFGGSGFVMQKMAGTGSAWIELGGEIVTYDLAAGETLRVHPGHVGMFEGNVK 178

Query: 199 VQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            +I     I   +FGGD +  A +TGPG V++Q+L    L+  I
Sbjct: 179 FEITRIKGIANMLFGGDGVFLAALTGPGRVWLQTLTVSNLAHAI 222


>gi|325968543|ref|YP_004244735.1| HTH DNA-binding protein [Vulcanisaeta moutnovskia 768-28]
 gi|323707746|gb|ADY01233.1| HTH DNA-binding protein [Vulcanisaeta moutnovskia 768-28]
          Length = 225

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 58/212 (27%), Positives = 99/212 (46%), Gaps = 7/212 (3%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           +I F+I+G + Q   + L P +++ A  G + + S S+ ++         G+ + + G +
Sbjct: 1   MINFKIMGNDIQHLYVELGPSDRIYAEGGHLIWKSSSVNIKATTGSGFMAGLKRAITGAS 60

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
              + L  PG  D    IA  +  +I+ IDL   G  ++ +  AFL     V     L  
Sbjct: 61  FFVLELEGPGAID----IAGFAPGKIVEIDLG--GNGVMVEHRAFLAMEPTVNYDVKLVG 114

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
                +GG EG L  +L G G  F+ A G  +   L  GE I V+   ++A    + V I
Sbjct: 115 LGFGWLGG-EGLLMARLQGNGRVFLHAIGDALMLELRPGESIDVEAGHVLAFDEGMRVSI 173

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSL 233
           +  G +R  +FG + L  A + GPG  ++++L
Sbjct: 174 RRVGGLRTMLFGEEGLWLAHIEGPGKAWLRTL 205


>gi|402309879|ref|ZP_10828850.1| TIGR00266 family protein [Eubacterium sp. AS15]
 gi|400370260|gb|EJP23250.1| TIGR00266 family protein [Eubacterium sp. AS15]
          Length = 226

 Score = 72.4 bits (176), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 56/214 (26%), Positives = 102/214 (47%), Gaps = 3/214 (1%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           +QI GG      I L+  E +V+  G   +  G +  E          + +   G+ +  
Sbjct: 3   YQIEGGSLPAAVIQLEKGETLVSELGGRTWAKGPITTETKAPGGIGKSIGRMFSGENLFM 62

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
                 GP +  +G A+     I+  +LA  G  ++CQ  AFLC+ + V++S    +++ 
Sbjct: 63  SYYTAEGPVE--IGFASSFAGSIVVKELAA-GQSIICQKSAFLCATSQVELSIFFQKKLG 119

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
               G EGF+ QK+TG    F+   G V + +L  GE +  D   +  +  + ++ ++  
Sbjct: 120 AGFFGGEGFIMQKVTGPATVFLELDGYVKEYDLVQGEELVCDTGVVALMDETCSMDVRMV 179

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
             ++  +FGG+ L+  V+ GPG V+IQ++   RL
Sbjct: 180 KGLKNKLFGGEGLIDTVIIGPGKVYIQTMSLPRL 213


>gi|217077739|ref|YP_002335457.1| hypothetical protein THA_1679 [Thermosipho africanus TCF52B]
 gi|419760339|ref|ZP_14286618.1| hypothetical protein H17ap60334_05724 [Thermosipho africanus
           H17ap60334]
 gi|217037594|gb|ACJ76116.1| conserved hypothetical protein [Thermosipho africanus TCF52B]
 gi|407514442|gb|EKF49257.1| hypothetical protein H17ap60334_05724 [Thermosipho africanus
           H17ap60334]
          Length = 224

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 61/211 (28%), Positives = 93/211 (44%), Gaps = 24/211 (11%)

Query: 30  GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMEN----IY--IPENEVGMWQWLFGKTIT 83
           G   +  + L+  E +VA PG+M  M G +E+      I+  +    +G   +   K I+
Sbjct: 9   GSYAILFVELEIGETIVAEPGAMVSMEGDIEVNTSTGGIFKALKRAFLGGEHFFLNKYIS 68

Query: 84  SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFG--GELLCQPDAFLCSVNDVKVSNTLDQ 141
                           +  S A +LP D+ +         Q  ++L S   + +      
Sbjct: 69  HSK-------------SKISFAPLLPGDIEVIDIKNGFFLQSTSYLASQETISIDTKFGG 115

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
             +    G EGF   KL+G G A I A G +  K LE GE ITVD   IV    +V   +
Sbjct: 116 -FKTFFSG-EGFFLLKLSGYGKAAISAFGGIYHKELEPGEAITVDTGHIVGFDETVKYSV 173

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           +  G I+  +FGG+ LV  +V GPG V+IQ+
Sbjct: 174 RTFGGIKSTLFGGEGLVVDLV-GPGRVYIQT 203


>gi|347757146|ref|YP_004864708.1| hypothetical protein MICA_361 [Micavibrio aeruginosavorus ARL-13]
 gi|347589664|gb|AEP08706.1| conserved hypothetical protein [Micavibrio aeruginosavorus ARL-13]
          Length = 251

 Score = 72.0 bits (175), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 60/247 (24%), Positives = 107/247 (43%), Gaps = 25/247 (10%)

Query: 4   PFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMEN 63
           P  ST F      + Q     F+I G      ++ ++P +  V   G++   S S++M++
Sbjct: 7   PKASTAFNG----TAQTHGFEFRINGAHTPYLELHMQPGQAFVGEAGTLMGQSDSVKMKS 62

Query: 64  IYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVG---------IAAPSLARILP----- 109
                +  G    LF       V+R  G  + F+          +   SLA  LP     
Sbjct: 63  TLGDGSNSGFLSSLF-----RSVMRKMGGENFFINQYVNMASDKVGKLSLAAALPGEIVA 117

Query: 110 IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAG 169
           ID+A  G  ++ Q  +FL +    ++ + +   +  ++      + Q++ G+   F+ A 
Sbjct: 118 IDIAEHGKRIVAQRGSFLAAEQGAQLGSKMKLSLFGMLS--NSVVMQEIKGEAWTFLNAP 175

Query: 170 GSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVF 229
           G+V    L+ GEV   D  C+VA + SV + +  +G +   V GG+    A   GPG ++
Sbjct: 176 GNVQCIQLDAGEVYKTDARCLVAASDSVKLGMALSGGLLTMVTGGEGAFVAKAEGPGTIW 235

Query: 230 IQSLPFH 236
           + S PF 
Sbjct: 236 VGSKPFE 242


>gi|319935941|ref|ZP_08010365.1| hypothetical protein HMPREF9488_01196 [Coprobacillus sp. 29_1]
 gi|319809055|gb|EFW05551.1| hypothetical protein HMPREF9488_01196 [Coprobacillus sp. 29_1]
          Length = 225

 Score = 71.6 bits (174), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 54/225 (24%), Positives = 105/225 (46%), Gaps = 9/225 (4%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I G    V    LK  + +++  G+M +M   MEM          G     FG+  
Sbjct: 1   MEYEIKGVPLPVVICYLKKGDVMLSDSGAMAWMDPCMEMGT-----TSNGGIGKAFGRMF 55

Query: 83  TSVVL---RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
           +   +        +DG +  ++     I   D+   G E++ Q  ++L S   V+ +   
Sbjct: 56  SGETMFLNSYTANADGKIAFSSSFPGEIRKYDIEP-GKEIVIQKSSYLASEPTVERTIFF 114

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
           +++    I G EGF+  KL+G+G  F+   G+ V+ +L  G+ I VD   +  + ++  +
Sbjct: 115 NKKAMTGIFGGEGFIMNKLSGKGTVFVEIDGATVEYDLAPGQKIVVDTGYVAVMDATCTM 174

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
             +    ++  +FGG+     VVTGPG V +Q++P   ++  +A+
Sbjct: 175 TTQSVPGLKNKMFGGEGFFNTVVTGPGHVVLQTMPISAIAGSLAQ 219


>gi|451343926|ref|ZP_21912992.1| TIGR00266 family protein [Eggerthia catenaformis OT 569 = DSM
           20559]
 gi|449337501|gb|EMD16663.1| TIGR00266 family protein [Eggerthia catenaformis OT 569 = DSM
           20559]
          Length = 225

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 52/224 (23%), Positives = 103/224 (45%), Gaps = 3/224 (1%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I G       + L   E +++  G+M +M+  ++ME          + +   G+T+
Sbjct: 1   MKYEIKGSPFPAVLLNLNQGESMLSDSGAMAWMTPGIQMETTSNGGVGKALGRAFSGETL 60

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
              + +    +DG +  ++     I  + L   G  ++ Q  AFL S   V       ++
Sbjct: 61  --FINKYTAMADGQIAFSSSFPGEIRAVQLQP-GQGIVLQKSAFLASTTGVSREVFFSRK 117

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
               + G EGF+  +  G GL F+   G+ V+ +L  G+ + +D   + A++ +  + I+
Sbjct: 118 FSTGLFGGEGFIMSRYIGPGLLFLEIDGTAVEYDLAAGQQMIIDSGYLAAMSDTCKMDIQ 177

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
               ++    GG+ L   VVTGPG + IQ++P   L+  I R +
Sbjct: 178 RIKGMKNIFLGGEGLFNTVVTGPGHIIIQTMPISSLAGTIGRYI 221


>gi|15898554|ref|NP_343159.1| hypothetical protein SSO1755 [Sulfolobus solfataricus P2]
 gi|284174124|ref|ZP_06388093.1| hypothetical protein Ssol98_05607 [Sulfolobus solfataricus 98/2]
 gi|384434925|ref|YP_005644283.1| hypothetical protein [Sulfolobus solfataricus 98/2]
 gi|13814997|gb|AAK41949.1| Conserved hypothetical protein [Sulfolobus solfataricus P2]
 gi|261603079|gb|ACX92682.1| protein of unknown function DUF124 [Sulfolobus solfataricus 98/2]
          Length = 232

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 57/240 (23%), Positives = 106/240 (44%), Gaps = 14/240 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++ILG + Q  ++ML   E +    G M     S+ M+          + + + G +   
Sbjct: 4   YEILGKDLQYLKVMLGEGEAIYVDAGHMIAKQSSVTMKTTMRGGFFGALKREITGGSFFV 63

Query: 85  VVLRNPGPSDGFVGIAAPSLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
             L  PG            L+ + P   + + + G  +L +  +FLC+ N V    TL++
Sbjct: 64  TELYGPGEV---------YLSGVFPGPIVQIQLQGNGILAESHSFLCAENGVNYDATLNK 114

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
                +GG EG    K  G G  F+ A G ++++ L+ GE + V+ + ++A    +   I
Sbjct: 115 LSVGWLGG-EGIFLAKFRGTGNVFLHAYGGLIERYLDPGERLQVEAAHLMAFEEGMRYGI 173

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQI 261
           +  G I+  +F G+ L    + GPG V++ +L   +L+  + R       +  P F + I
Sbjct: 174 QRVGGIKSILFSGEGLFFVDIEGPGKVWLHTLTVEQLAASL-RPYLPQGQQGGPSFNIGI 232


>gi|15920866|ref|NP_376535.1| hypothetical protein ST0645 [Sulfolobus tokodaii str. 7]
 gi|15621650|dbj|BAB65644.1| hypothetical protein STK_06450 [Sulfolobus tokodaii str. 7]
          Length = 230

 Score = 71.2 bits (173), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 54/221 (24%), Positives = 96/221 (43%), Gaps = 13/221 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           +QI G + Q  +++L P E + A  G M     S+ ++         G+ + L G +   
Sbjct: 4   YQIFGNDLQYLKVILAPGEGIYADAGHMIAKQASVTLQTRLRGGILSGLKRVLTGGSFFV 63

Query: 85  VVLRNPGPSDGFVGIAAPSLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
                PG            L+ I P   I + + G  +L +   FL + N V     L +
Sbjct: 64  TEFYGPGEL---------YLSGIFPGKIIQIPLEGRGILAEAHTFLAAENSVNYDTQLAK 114

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
                +GG EG    K  G G  F+ + G V+ ++L  GE++ V+ S ++A    +N  +
Sbjct: 115 LTAGWLGG-EGLFLAKFRGFGNVFLHSYGDVIVRDLAPGEILQVEASHLMAFQEGMNYDV 173

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +  G +R   F  + L    + GPG V++ +L   +L+  +
Sbjct: 174 QLVGGLRSIFFAHEGLFFVTIQGPGRVWLHTLTAEQLASAL 214


>gi|150021217|ref|YP_001306571.1| hypothetical protein Tmel_1337 [Thermosipho melanesiensis BI429]
 gi|149793738|gb|ABR31186.1| protein of unknown function DUF124 [Thermosipho melanesiensis
           BI429]
          Length = 224

 Score = 71.2 bits (173), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 59/205 (28%), Positives = 95/205 (46%), Gaps = 12/205 (5%)

Query: 30  GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRN 89
           G   +  + L+P E ++  PG+M  MSG +E+E       +     +L G+ I     + 
Sbjct: 9   GSYSILFVDLEPGEIIIGEPGAMVTMSGDLEIETSTGGVFKALKRAFLGGEHIFLNKYKA 68

Query: 90  PGPSDGFVGIAAPSLARILP--IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
              S         S A  LP  ID+    G +  Q  +FL S   + + +T     ++  
Sbjct: 69  KNKS-------RISFAPKLPGDIDIIDLNGTVYLQSTSFLASEEGINL-DTKFTGFKSFF 120

Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
            G EGF   KL+G G   I + G++    L+  E IT+D   +VA   SVN  ++  G  
Sbjct: 121 SG-EGFFLLKLSGYGKLAISSFGAIFSLELKEDEKITIDTGHVVAFDESVNYSVRTFGGF 179

Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQS 232
           +  +FGG+ LV   + GPG ++IQ+
Sbjct: 180 KSTIFGGEGLVVDFI-GPGKIYIQT 203


>gi|428174779|gb|EKX43673.1| hypothetical protein GUITHDRAFT_140416 [Guillardia theta CCMP2712]
          Length = 234

 Score = 70.5 bits (171), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 54/218 (24%), Positives = 96/218 (44%), Gaps = 42/218 (19%)

Query: 24  PFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT 83
           P+QI+G ++Q+    L P ++V+  PG  C +     ++ +   +N +G   +L  + I 
Sbjct: 29  PWQIIGTDSQLIVFRLAPGQEVMMEPG--CMVHADDGVKPVTSLQNPLG--AFLADEDIF 84

Query: 84  SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV 143
            +  +N   +D  +  AA   ++I+P+DL+++ G +  Q  +FL +              
Sbjct: 85  RLKWKNESSTDKTLSAAAMVPSKIIPLDLSVYSG-ITIQKGSFLAT-------------- 129

Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
                                  L    ++ KNL  GE    D  C++AV+S+V+V I+ 
Sbjct: 130 -----------------------LGRWQIMTKNLAPGEKYIFDTDCMLAVSSTVDVSIRR 166

Query: 204 NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQR 241
            G      FGG  L     TGPG++  Q+LP  +   R
Sbjct: 167 AGGCAMMCFGGKGLFNTEYTGPGLIIFQTLPAAKSGAR 204


>gi|297526400|ref|YP_003668424.1| hypothetical protein Shell_0393 [Staphylothermus hellenicus DSM
           12710]
 gi|297255316|gb|ADI31525.1| protein of unknown function DUF124 [Staphylothermus hellenicus DSM
           12710]
          Length = 224

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 66/227 (29%), Positives = 110/227 (48%), Gaps = 31/227 (13%)

Query: 33  QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNP-- 90
            + Q++L+P E V A PG+M +M+G     +I +  +  G+     GK I   +L     
Sbjct: 12  SLLQVVLEPNENVTAEPGAMMYMAG-----DINVKTHTGGL-----GKAIARKLLGGESI 61

Query: 91  -------GPSDGFVGIAAPSLARILPIDL---AMFGGE-LLCQPDAFLCSVNDVKVSNTL 139
                  GP  G V  A PSL    P D+   A+ G   L+ Q  ++L    D+K+S   
Sbjct: 62  FMNTFIAGPRGGEVWFA-PSL----PGDIKYIALNGSRNLIIQDTSYLAHHGDIKLSVAW 116

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
            + +R ++   E F   + +G G  +I + G +++K L VGE I +D    VA+   +  
Sbjct: 117 -RGLRGLLAEGEMFW-LRASGVGGVWINSYGGIIEKELGVGEKIIIDNFHFVAMDDGMKW 174

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
            I+  G ++  +FGG+ LV   V GPG +++Q+       Q IA+ +
Sbjct: 175 NIRRFGGLKSFLFGGEGLVIE-VEGPGRIYVQTRSLPPFMQLIAKYI 220


>gi|257076770|ref|ZP_05571131.1| HTH DNA-binding protein [Ferroplasma acidarmanus fer1]
          Length = 219

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 55/229 (24%), Positives = 101/229 (44%), Gaps = 17/229 (7%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           ++ ++I GG+ Q     L+  E     PG + + +    ++         G  +  F   
Sbjct: 1   MVKYEIRGGDVQYLHATLEAGESAYIEPGHLIYKTPGARLDT------GTGGLRGAFSHM 54

Query: 82  ITS---VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
           +      +L+  GP +   G A     ++  I L      +  + +AFLC  + +K S  
Sbjct: 55  LAGSAVFLLKVDGPGN--FGSAGFLPGKVFEIKLN--NNSIFAEFNAFLCMDSTIKYSTK 110

Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
                + + GG EG   ++ +G G   +   G V++  L+ GE I  ++S ++A  SSV+
Sbjct: 111 FAGFWQAIFGG-EGIFLEEFSGTGSVMLHGHGQVIEYELKDGEEIQAELSHVLAFESSVS 169

Query: 199 VQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
             +   G I+  V G   G+    A + GPG V++ S+    LS ++AR
Sbjct: 170 YNVSRIGGIKTMVLGGMEGEGFFFANMRGPGKVWLHSISLLELSAKLAR 218


>gi|393796012|ref|ZP_10379376.1| hypothetical protein CNitlB_06707 [Candidatus Nitrosoarchaeum
           limnia BG20]
          Length = 236

 Score = 69.7 bits (169), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 55/225 (24%), Positives = 104/225 (46%), Gaps = 22/225 (9%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + +QIL     + ++ LK  E + A  G++ FM G ++++     +   G+      KT+
Sbjct: 1   MEYQILKNPMSLLEVNLKKDESITAEAGALVFMQGDIQID----TKTRSGIL-----KTL 51

Query: 83  TSVVLRNP--------GPSDG-FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
              +L N            DG  +G+  P +  I+ I +    G  + Q  +++ S  DV
Sbjct: 52  KVSILGNESFFVNKYTAKHDGCILGLTGPPVGDIIDIPINSNAG-FIVQSGSYIASTPDV 110

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
            +        + + G  E F+  K TG G  F  A G +++K +  GE   +D   +VA+
Sbjct: 111 DIDTKWQGFTKGIFGS-ELFML-KATGDGTVFCNAYGGIIKKQILSGETFVLDNYHLVAL 168

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
           + + + ++   G ++  + GG+ LVT  +TGP  V++Q+     L
Sbjct: 169 SINSDYEVTKFGGLKNTILGGEGLVTK-ITGPATVYLQTKNLKEL 212


>gi|315923941|ref|ZP_07920169.1| protein of hypothetical function DUF124 [Pseudoramibacter
           alactolyticus ATCC 23263]
 gi|315622781|gb|EFV02734.1| protein of hypothetical function DUF124 [Pseudoramibacter
           alactolyticus ATCC 23263]
          Length = 224

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 59/224 (26%), Positives = 107/224 (47%), Gaps = 8/224 (3%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I GG   V    L   E +    G M +M  ++EM+       ++ M + + G+++
Sbjct: 1   MQYKISGGNLPVAICKLAAGESIKCESGGMSWMDNTIEMQTKGGGFGKM-MGRMVSGESM 59

Query: 83  --TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
                + R    S+G +  A+     I  I ++  G  L+ Q  AFL     V++     
Sbjct: 60  FFNHYIAR----SEGEIAFASSFPGEIKAIHISP-GNSLIAQKGAFLACQESVEMKIFFQ 114

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           +++ +   G EGF+ Q+  G+GL FI   GS +  +L  GE   VD   +VA+  SV + 
Sbjct: 115 RKLGSGFFGGEGFIMQRYIGEGLVFIEIDGSAITYDLAAGETKVVDTGYLVAMEESVKMD 174

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
           I     ++  + GG+ L    +TGPG + +Q++P ++ +  I R
Sbjct: 175 IVRVKGVKNVLLGGEGLFNTTLTGPGKIVLQTMPINKTAAVIHR 218


>gi|352681258|ref|YP_004891782.1| hypothetical protein TTX_0024 [Thermoproteus tenax Kra 1]
 gi|350274057|emb|CCC80702.1| conserved hypothetical protein [Thermoproteus tenax Kra 1]
          Length = 225

 Score = 68.9 bits (167), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 53/209 (25%), Positives = 98/209 (46%), Gaps = 7/209 (3%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           F+ILG + Q   + L P E++ A  G + + S +  +          G+ + L G +   
Sbjct: 4   FRILGSDIQHLLVTLGPGERIYAEGGHLIWKSSTARLNATVRGGLLAGLKRALTGASFFV 63

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
           + +  PG  D    +A  +  +I+ I+L   G  +L +  +FL + + V+   +L     
Sbjct: 64  LEVEGPGQVD----LAGFAPGKIVEIEL--HGSSILAEHRSFLAAEDIVEYDASLVGLGF 117

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
             +GG EG L  +  G G  F+ A G  ++  L  GE I  +   ++A    V   ++  
Sbjct: 118 GWLGG-EGLLMARFRGNGRVFLHAVGDALELRLGSGEAIDAEAGHVLAFEEGVRATVRRV 176

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSL 233
           G ++  +FG + L  A + GPG V++++L
Sbjct: 177 GGLKTMLFGEEGLWFAHLEGPGRVWLRTL 205


>gi|429219791|ref|YP_007181435.1| hypothetical protein Deipe_2183 [Deinococcus peraridilitoris DSM
           19664]
 gi|429130654|gb|AFZ67669.1| hypothetical protein Deipe_2183 [Deinococcus peraridilitoris DSM
           19664]
          Length = 228

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 46/136 (33%), Positives = 70/136 (51%), Gaps = 3/136 (2%)

Query: 100 AAP-SLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKL 158
           AAP +L  I   DL+   G LL    AFL +  +V +S T+ +R    + G  G   Q L
Sbjct: 87  AAPGTLGHIRAWDLSQ--GPLLTTRGAFLAAWGEVDISVTVARRAGAALFGGAGLFLQHL 144

Query: 159 TGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLV 218
           +GQG   I   G +++  L  GE   V    + A ++SV+  I+  G +R+ +FGG+ L 
Sbjct: 145 SGQGTVLIQGSGDLLEHKLHGGESQLVSTGHLAAFSASVDYGIEAVGGVRKLLFGGEGLF 204

Query: 219 TAVVTGPGIVFIQSLP 234
              + GPG  ++QSL 
Sbjct: 205 MTRLCGPGTAYLQSLK 220


>gi|453328726|dbj|GAC89022.1| hypothetical protein NBRC3255_2683 [Gluconobacter thailandicus NBRC
           3255]
          Length = 221

 Score = 68.6 bits (166), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 56/217 (25%), Positives = 93/217 (42%), Gaps = 13/217 (5%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSV------VL 87
           V +I+L+P+E ++A  G   + S  + M+ +     + G     FG    ++      + 
Sbjct: 3   VLRILLEPRESILAETGEFAWKSSHVVMQTVTGGAGQSG----FFGVVTRALAGGGLFMT 58

Query: 88  RNPGPSD-GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
           R   PS  G V   A     I    L   G   L     F+    DV++     + +   
Sbjct: 59  RFTAPSQPGLVAFTAKIPGHITEHTLQ--GQAYLVHRHGFIAGTEDVRMELAFQRNLGAG 116

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
           + G +GF  Q+L+G G  +   GG V+  +L  GE I V    +     SV   I     
Sbjct: 117 VFGGDGFRLQRLSGSGRFWCSLGGEVITHDLAPGEQIDVHPGHVGMFEESVQFDITMIPG 176

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
           +R  +FGGD    A +TGPG +++Q++    L+  +A
Sbjct: 177 LRNKLFGGDGFFMARLTGPGRIWLQTMTPAGLAHSLA 213


>gi|154249582|ref|YP_001410407.1| hypothetical protein Fnod_0898 [Fervidobacterium nodosum Rt17-B1]
 gi|154153518|gb|ABS60750.1| protein of unknown function DUF124 [Fervidobacterium nodosum
           Rt17-B1]
          Length = 224

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 59/210 (28%), Positives = 99/210 (47%), Gaps = 22/210 (10%)

Query: 30  GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL-----FGKTITS 84
           G   + +I + P E     PG+M +M G +E++      +  G+W+ L      G++   
Sbjct: 9   GSYALLKIFIGPGESAKVEPGAMVYMKGPVEVQT-----STGGVWKALKRAILGGESFFM 63

Query: 85  VVLRNPGPSDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
               + G  +G VG+A   P    ++P++  +F      Q  +FL S   V +  +    
Sbjct: 64  NTYVSHG--NGEVGVAPELPGDIDVVPLNQTLF-----VQSTSFLASDTSVDIDVSFGG- 115

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
            ++   G EG    KL G G   I + G +    L+ GE +T+D   +VA   SVN  ++
Sbjct: 116 FKSFFSG-EGIFLLKLQGYGDVAISSFGGIKLIELQPGEQMTIDTGHVVAFDGSVNYSVR 174

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
             G ++  +FGG+ LV    TGPG V+IQ+
Sbjct: 175 TFGGLKSTIFGGEGLV-CTFTGPGRVYIQT 203


>gi|158422039|ref|YP_001523331.1| hypothetical protein AZC_0415 [Azorhizobium caulinodans ORS 571]
 gi|158328928|dbj|BAF86413.1| protein of unknown function [Azorhizobium caulinodans ORS 571]
          Length = 232

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 63/123 (51%)

Query: 120 LCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEV 179
           +     FL    ++ ++    Q + + + G  GF+ Q+LTGQG AFI  GG +V   L+ 
Sbjct: 101 MIHRHGFLAGSAELTLAIGFQQNLGSGVFGGNGFVLQRLTGQGTAFIELGGEIVSYELQP 160

Query: 180 GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
           G+ + V    +     SV   I     IR  +FGGD L  A +TGPG V++Q+L   +L+
Sbjct: 161 GQQLLVHPGHVGMFEESVGFDITMMRGIRNVLFGGDGLFLAQLTGPGRVWLQTLTPSKLA 220

Query: 240 QRI 242
             +
Sbjct: 221 HAL 223


>gi|414343168|ref|YP_006984689.1| hypothetical protein B932_2196 [Gluconobacter oxydans H24]
 gi|411028503|gb|AFW01758.1| hypothetical protein B932_2196 [Gluconobacter oxydans H24]
          Length = 221

 Score = 68.2 bits (165), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 56/217 (25%), Positives = 93/217 (42%), Gaps = 13/217 (5%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSV------VL 87
           V +I+L+P+E ++A  G   + S  + M+ +     + G     FG    ++      + 
Sbjct: 3   VLRILLEPRESILAETGEFAWKSSHVVMQTVTGGAGQSG----FFGVVTRALAGGGLFMT 58

Query: 88  RNPGPSD-GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
           R   PS  G V   A     I    L   G   L     F+    DV++     + +   
Sbjct: 59  RFTAPSQPGLVAFTAKIPGHITEHTLQ--GQAYLVHRHGFIAGTEDVRMELAFQRNLGAG 116

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
           + G +GF  Q+L+G G  +   GG V+  +L  GE I V    +     SV   I     
Sbjct: 117 VFGGDGFRLQRLSGSGRFWCSLGGEVITHDLAPGEQIDVHPGHVGMFEESVQFDITMILG 176

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
           +R  +FGGD    A +TGPG +++Q++    L+  +A
Sbjct: 177 LRNKLFGGDGFFMARLTGPGRIWLQTMTPAGLAHSLA 213


>gi|89894010|ref|YP_517497.1| hypothetical protein DSY1264 [Desulfitobacterium hafniense Y51]
 gi|89333458|dbj|BAE83053.1| hypothetical protein [Desulfitobacterium hafniense Y51]
          Length = 82

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 29/62 (46%), Positives = 43/62 (69%)

Query: 177 LEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFH 236
           ++ GE I +D  C+VA+T +VN  I++ G I+ A+FGG+ L    +TGPG V++QSLPF 
Sbjct: 1   MQPGETIQLDTGCLVAMTKTVNHDIRFAGDIKNALFGGEGLALGTLTGPGTVWLQSLPFC 60

Query: 237 RL 238
            L
Sbjct: 61  SL 62


>gi|407464224|ref|YP_006775106.1| hypothetical protein NSED_01760 [Candidatus Nitrosopumilus sp. AR2]
 gi|407047412|gb|AFS82164.1| hypothetical protein NSED_01760 [Candidatus Nitrosopumilus sp. AR2]
          Length = 223

 Score = 67.0 bits (162), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 57/221 (25%), Positives = 99/221 (44%), Gaps = 27/221 (12%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I+     + +  L   EK+ A   +M +M G++          E  M +  F K++
Sbjct: 1   MEYEIVKNPMGLIEFSLNKNEKITAEAAAMVYMKGNI--------VTETRMRKGGFFKSL 52

Query: 83  TSVVLRNPG---------PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
            +  L               D  +G+    L  I  ID+     E + Q  +F+ S  DV
Sbjct: 53  KAAALGGESFFVNEFIAEEDDCKLGLTGNMLGDIEVIDVDE---EFIVQSGSFVGSTTDV 109

Query: 134 KVSNTLDQRVRNVIGGIEG--FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
               TLD   +    GI G      K  G G  F+   G +++K L  GE + +D   +V
Sbjct: 110 ----TLDTAWQGFTKGIFGSNLFMLKTVGAGKMFVNGWGGILKKQLGSGEKMILDNYQLV 165

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           A++S+   ++  +G ++  +FGGD LV  +V GPG V++Q+
Sbjct: 166 ALSSTAGYRVTKHGSMKTTLFGGDALVIEIV-GPGTVYLQT 205


>gi|379010233|ref|YP_005268045.1| hypothetical protein Awo_c03550 [Acetobacterium woodii DSM 1030]
 gi|375301022|gb|AFA47156.1| hypothetical protein Awo_c03550 [Acetobacterium woodii DSM 1030]
          Length = 226

 Score = 66.6 bits (161), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 53/226 (23%), Positives = 102/226 (45%), Gaps = 6/226 (2%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++I G    V  + L+  E +    G M +MS  ++ME         G+ + L G ++
Sbjct: 1   MKYEIKGDNLPVVIVTLENGEAIFTESGGMSWMSEHIKMETNTKGGLIKGLRRSLSGDSL 60

Query: 83  --TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
             T+           F   + P   + + +     G  ++CQ  +FL     + V     
Sbjct: 61  FMTTYTAERENSQIAFCS-SFPGEIKAITLQA---GQSIICQKRSFLAGAATLDVDMYFK 116

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
           + +   I G EGF+ Q+++G G  F+   G++++ +L  G+V+ VD   I     SV+  
Sbjct: 117 KNIGVGIFGGEGFILQRISGPGTVFVEMDGALMEYDLAPGQVMKVDQGHIGMFEESVSFD 176

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           I+    ++     G+ L  A + GPG  ++Q++P   L Q I+ ++
Sbjct: 177 IQAVKGMKNIFLSGEGLFLATLKGPGKAWLQTMPLSNLVQLISASL 222


>gi|340344480|ref|ZP_08667612.1| DUF124 domain containing protein [Candidatus Nitrosoarchaeum
           koreensis MY1]
 gi|339519621|gb|EGP93344.1| DUF124 domain containing protein [Candidatus Nitrosoarchaeum
           koreensis MY1]
          Length = 230

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 50/167 (29%), Positives = 85/167 (50%), Gaps = 5/167 (2%)

Query: 97  VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQ 156
           +G++ P +  I+ I +    G  + Q   ++ S + V + +T  Q     I G E F+  
Sbjct: 59  LGLSGPPVGDIIEIPITAEQG-FIIQSGTYVASTSSVDL-DTKWQGFSKGIFGTELFML- 115

Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDN 216
           K TG+G  F+ A G +++K LE GE +T+D   +VA++++ +  +   G I+  +F G+ 
Sbjct: 116 KATGKGSVFVNAYGGIIKKELESGETMTLDNYHLVALSNNSHYVVTKFGGIKSTLFSGEG 175

Query: 217 LVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQIAI 263
           LVT  +TGP IV+ Q+     L   I+   T  N   N     Q A+
Sbjct: 176 LVTE-ITGPAIVYFQTKNLKELMDLISGEHTKDN-SSNTHTMAQAAM 220


>gi|255513447|gb|EET89713.1| protein of unknown function DUF124 [Candidatus Micrarchaeum
           acidiphilum ARMAN-2]
          Length = 230

 Score = 65.5 bits (158), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 58/222 (26%), Positives = 99/222 (44%), Gaps = 11/222 (4%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           ++I+GG  Q  +I L+P EK+ +  G +   S +++M     P    G+   +  K   +
Sbjct: 3   YEIIGGSMQALRITLEPGEKIYSDSGKLISKSKTIKM----TPRLAGGLVGAIERKMTGA 58

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTLD 140
             +     +DG  G A  S++ ++P  + M     G E + +  AF+ +   VK S    
Sbjct: 59  TAMVTEFEADGAEGSA--SMSGVIPGKIYMINLNEGEEFVAEHYAFVAAQESVKFSIETV 116

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
                  GG  G + QK  G G  FI   G +++ +L+    + +D   I    SS+  +
Sbjct: 117 GIGAAFFGGA-GLVLQKFVGPGAVFIHVTGDIIEYDLDGNNPLEIDPGHIAGFDSSLKYK 175

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           I +   IR A+FGG  L  A   G G V   S+   +L+  +
Sbjct: 176 ITFVDNIRTAMFGGVGLFLAKFEGRGKVVAHSVSRFKLASEL 217


>gi|124027471|ref|YP_001012791.1| hypothetical protein Hbut_0586 [Hyperthermus butylicus DSM 5456]
 gi|123978165|gb|ABM80446.1| universally conserved protein [Hyperthermus butylicus DSM 5456]
          Length = 220

 Score = 65.1 bits (157), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 59/220 (26%), Positives = 102/220 (46%), Gaps = 29/220 (13%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           V ++ L+P E+V + PG++  + G +E+                    ITS +LR    S
Sbjct: 13  VLKVELEPGEEVWSEPGALMLLRGDVEVRTQ--------------SGGITSGLLRKLAGS 58

Query: 94  DGFV--GIAAPSLARI-----LP--IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
           + F      A S A +     LP  ID     GE + Q  ++L    D+KV+ +  + +R
Sbjct: 59  ESFFLNRYRARSRAEVWFAPSLPGDIDTIEVNGEWVIQDASYLAHYGDLKVTASF-RGLR 117

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
            +I   E F   K+ G G+ ++ + G + +  +  GE + +D    VA+ +     ++  
Sbjct: 118 GLIAEGELFW-LKVEGHGIVWVNSYGGIERVEVGPGERLVIDNYHFVAMPADTRYTVRKF 176

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSL---PFHRLSQR 241
           G ++  +FGG+  V   VTGP  V++Q+    PF RL  R
Sbjct: 177 GGLKTFLFGGEGFVIE-VTGPATVYVQTRILPPFARLLAR 215


>gi|14325670|dbj|BAB60573.1| hypothetical protein [Thermoplasma volcanium GSS1]
          Length = 222

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 54/229 (23%), Positives = 105/229 (45%), Gaps = 15/229 (6%)

Query: 21  AVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK 80
           +++ ++I G E Q  +  L P+E V   PG + +   S+ +               L G 
Sbjct: 3   SMVDYEIAGQEVQYLKAKLSPEEYVYIEPGHLIYKDASVRL-GASAGGLRGAFSHLLSGS 61

Query: 81  TITSVVLRNPGP--SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
            +  + +  PG   S GF+    P   +++ IDL   G  ++ + +AFL   + V  S  
Sbjct: 62  AVFLLKVEGPGEIVSAGFL----P--GKVVRIDLN--GNSIIAEFNAFLAMDSTVSYSTK 113

Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
                + + GG EG   ++ +G G  F+   G  + K+L+ GE I  ++S ++A    + 
Sbjct: 114 FAGITQAIFGG-EGVFLERFSGNGSVFLHGHGLPIVKDLKPGEEIQAEMSHLLAFEDGME 172

Query: 199 VQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
             I+  G ++  +     G+ L  A + GPG V++ ++   +L+ ++ R
Sbjct: 173 YTIERIGGLKTMLLAGAEGEGLFFARIRGPGRVWLHTISLFQLAAKLMR 221


>gi|242398201|ref|YP_002993625.1| hypothetical protein TSIB_0209 [Thermococcus sibiricus MM 739]
 gi|242264594|gb|ACS89276.1| hypothetical protein TSIB_0209 [Thermococcus sibiricus MM 739]
          Length = 210

 Score = 64.7 bits (156), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 53/199 (26%), Positives = 95/199 (47%), Gaps = 8/199 (4%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           + ++ L+  E + A PG+M  MS ++++E       +      L G++I     R     
Sbjct: 1   MVEVELEEGEAIKAEPGAMVHMSPTIKIETKTGGVFKALKRSMLGGESIFINTFRAE-EG 59

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
            G +G+A   +  I  +++    G L  Q  AFL S   +++ NT     +      EG 
Sbjct: 60  KGNIGLAPAYMGDIEALEIR---GTLYAQSGAFLASSETIEI-NTKFGGAKTFFS-REGL 114

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
              KL+G+G  F+ + G + +K L   E   +D   +VA T  ++ ++K  G ++  +FG
Sbjct: 115 FLLKLSGEGTVFLSSFGGIYKKELR-NERFIIDTGHLVAFTEGLDFRVKRVGGLKSTIFG 173

Query: 214 GDNLVTAVVTGPGIVFIQS 232
           G+ LV A   G G ++IQ+
Sbjct: 174 GEGLV-AEFYGSGTLYIQT 191


>gi|403743391|ref|ZP_10953005.1| hypothetical protein URH17368_0291 [Alicyclobacillus hesperidum
           URH17-3-68]
 gi|403122914|gb|EJY57106.1| hypothetical protein URH17368_0291 [Alicyclobacillus hesperidum
           URH17-3-68]
          Length = 132

 Score = 64.3 bits (155), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 37/120 (30%), Positives = 61/120 (50%), Gaps = 1/120 (0%)

Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
           ++ Q  AFLC+   +         +  ++GG  G +   L G G  F+   G VV+++L 
Sbjct: 1   MIAQRHAFLCAEEHIDFGIEATLNIGRLLGG-NGLVFTYLDGDGTGFLSIDGEVVERDLA 59

Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
            GE + V    I A TSS++ +++    +   +FGGD +    +TGPG V++ SL  H L
Sbjct: 60  RGESLLVHPGHIAAFTSSIDYRVQVMQGVSNMLFGGDGVFLVQLTGPGRVWMHSLTVHNL 119


>gi|18976524|ref|NP_577881.1| hypothetical protein PF0152 [Pyrococcus furiosus DSM 3638]
 gi|397652135|ref|YP_006492716.1| hypothetical protein PFC_07440 [Pyrococcus furiosus COM1]
 gi|18892075|gb|AAL80276.1| hypothetical protein PF0152 [Pyrococcus furiosus DSM 3638]
 gi|393189726|gb|AFN04424.1| hypothetical protein PFC_07440 [Pyrococcus furiosus COM1]
          Length = 219

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 55/205 (26%), Positives = 102/205 (49%), Gaps = 20/205 (9%)

Query: 33  QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-----QWLFGKTITSVVL 87
            + ++ L   E + A  G+M +M  ++E+    + + + G+        L G++    + 
Sbjct: 12  SLLEVELDAGESIQAEAGAMVYMDPTVEI----VTKAKGGLLGALKRSILGGESFFVNIF 67

Query: 88  RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
           + PG     VG+A      I+ +++    G+L  Q  AF+ +  D+ +   L   ++ VI
Sbjct: 68  KGPGK----VGLAPGYPGDIIGLEVN---GKLYAQSGAFIAASEDIDIDVKLGG-IKTVI 119

Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
           G  EG    +L G+G+AF+ + G + + +L  GE + VD S +VA T  ++  IK  G +
Sbjct: 120 G-REGLFLLELRGRGIAFLSSYGGIQKISLR-GESLIVDTSHLVAFTEGIDFTIKTIGGL 177

Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQS 232
           +  +FGG+  V     G G V+IQ+
Sbjct: 178 KSTLFGGEGFVFE-FRGTGDVYIQT 201


>gi|374985687|ref|YP_004961182.1| hypothetical protein SBI_02930 [Streptomyces bingchenggensis BCW-1]
 gi|297156339|gb|ADI06051.1| hypothetical protein SBI_02930 [Streptomyces bingchenggensis BCW-1]
          Length = 276

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 52/162 (32%), Positives = 75/162 (46%), Gaps = 14/162 (8%)

Query: 93  SDGFVGIAA--PSLARILPIDLAMFGGEL-LCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
            +G VG A   P   R L +D    GG     + DAF+ + + V+      Q  R    G
Sbjct: 102 GEGTVGFAGVLPGEMRALELD----GGRAWFAEKDAFVAAESTVEFGIAF-QGGRTGRSG 156

Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKN-LEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
            EGF+ +K TG+G   I   G+ +  N  + G  I VD  CIVA    +   ++  G + 
Sbjct: 157 GEGFILEKFTGRGTVIICGAGNFIDLNPADFGGRIEVDTGCIVAFEEGIQYGVQRIGGLN 216

Query: 209 R-----AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           R     AVFGG+ L  A + G G V +QSL    L+  + +A
Sbjct: 217 RQGLMNAVFGGEGLSLATLEGNGRVILQSLTMEGLANALKKA 258


>gi|13542236|ref|NP_111924.1| hypothetical protein TVN1405 [Thermoplasma volcanium GSS1]
          Length = 219

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 54/228 (23%), Positives = 104/228 (45%), Gaps = 15/228 (6%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           ++ ++I G E Q  +  L P+E V   PG + +   S+ +               L G  
Sbjct: 1   MVDYEIAGQEVQYLKAKLSPEEYVYIEPGHLIYKDASVRL-GASAGGLRGAFSHLLSGSA 59

Query: 82  ITSVVLRNPGP--SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
           +  + +  PG   S GF+    P   +++ IDL   G  ++ + +AFL   + V  S   
Sbjct: 60  VFLLKVEGPGEIVSAGFL----P--GKVVRIDLN--GNSIIAEFNAFLAMDSTVSYSTKF 111

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
               + + GG EG   ++ +G G  F+   G  + K+L+ GE I  ++S ++A    +  
Sbjct: 112 AGITQAIFGG-EGVFLERFSGNGSVFLHGHGLPIVKDLKPGEEIQAEMSHLLAFEDGMEY 170

Query: 200 QIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
            I+  G ++  +     G+ L  A + GPG V++ ++   +L+ ++ R
Sbjct: 171 TIERIGGLKTMLLAGAEGEGLFFARIRGPGRVWLHTISLFQLAAKLMR 218


>gi|88602530|ref|YP_502708.1| hypothetical protein Mhun_1244 [Methanospirillum hungatei JF-1]
 gi|88187992|gb|ABD40989.1| protein of unknown function DUF124 [Methanospirillum hungatei JF-1]
          Length = 231

 Score = 63.9 bits (154), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 57/210 (27%), Positives = 96/210 (45%), Gaps = 5/210 (2%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-MWQWLFGKTITSVVLRNPGP 92
           + ++ L P E V A  GSM +M  ++E+      +  +G +   L G     V       
Sbjct: 13  MLEVTLAPGESVTAEAGSMSWMQPTIEVHTRMRKKGLLGSIGMALIGGQSFFVNDFTATN 72

Query: 93  SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
           + G  G +A  L  +  + +A  G   + Q  A+L S  DV +        + + G  +G
Sbjct: 73  ASGKAGFSAAPLGDVDTMQVAP-GKGYIIQKAAYLASEKDVNLDVEWQGFTKGLFG--QG 129

Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
               K++GQGL FI   G++ +  L  GE + VD   +VA + +   Q+   G  +  + 
Sbjct: 130 LFMIKVSGQGLLFINTFGAIDKHVLAPGERLIVDNFHLVAFSDTCQYQVTKFGGWKETLL 189

Query: 213 GGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            G+ LVT + TGPG V+IQ+     ++  I
Sbjct: 190 SGEFLVTDI-TGPGEVYIQTKNIREMADWI 218


>gi|345010079|ref|YP_004812433.1| hypothetical protein [Streptomyces violaceusniger Tu 4113]
 gi|344036428|gb|AEM82153.1| protein of unknown function DUF124 [Streptomyces violaceusniger Tu
           4113]
          Length = 279

 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 72/161 (44%), Gaps = 12/161 (7%)

Query: 93  SDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
            +G VG A   P   R L +D A        + DAF+ + + V+             GG 
Sbjct: 105 GEGTVGFAGVLPGEMRALELDGAR---AWFAEKDAFVAAESTVEFGIAFQGGKTGRSGG- 160

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKN-LEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
           EGF+ +K TG G   I   G+ +  N  + G  I VD  CIVA    +   ++  G + R
Sbjct: 161 EGFILEKFTGHGTVIICGAGNFIDLNPADFGGRIEVDTGCIVAFEEGIRYGVERIGGLNR 220

Query: 210 -----AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
                AVFGG+ L  A + G G V +QSL    L+  + +A
Sbjct: 221 QGLMNAVFGGEGLSLATLEGNGRVILQSLTIEGLANALKKA 261


>gi|158319748|ref|YP_001512255.1| hypothetical protein Clos_0700 [Alkaliphilus oremlandii OhILAs]
 gi|158139947|gb|ABW18259.1| protein of unknown function DUF124 [Alkaliphilus oremlandii OhILAs]
          Length = 226

 Score = 62.4 bits (150), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 61/215 (28%), Positives = 106/215 (49%), Gaps = 12/215 (5%)

Query: 23  IPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-K 80
           + ++IL GEA  + +I L+  +++ A   +M  MS ++++E         G+ + L G K
Sbjct: 1   MKYEILYGEAFPIVKINLQKGDRIKAESDAMVSMSPTIDLEGKLEGGLLKGLGRMLAGEK 60

Query: 81  TITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTL 139
                +  N GP +  +  + P       +D+ + G   LC Q D FL + + + V   +
Sbjct: 61  FFFQTLTANRGPGEVLLASSVPGGT----VDVELDGSYGLCVQKDGFLAASDSISVDTQM 116

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV-- 197
               R +  G EGF   K+ G+G  F+ + G++   NLE GE + +D S +VA    +  
Sbjct: 117 QNLTRGLFSG-EGFFILKIGGRGTVFLNSYGAIHPINLEAGEEVVIDNSHLVAWPDYMQY 175

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           N++   +G I  +V  G++LV     GPG V IQ+
Sbjct: 176 NIEKASSGWI-SSVTSGESLVCR-FRGPGTVLIQT 208


>gi|428174781|gb|EKX43675.1| hypothetical protein GUITHDRAFT_110474 [Guillardia theta CCMP2712]
          Length = 199

 Score = 62.0 bits (149), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 46/79 (58%)

Query: 170 GSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVF 229
           G ++ KNL  GE   +D  C++AV+S+V+V I+  G      FGG  L     TGPG++ 
Sbjct: 78  GQIMTKNLAPGEKYILDTDCMLAVSSTVDVSIRRAGGCAMMCFGGKGLFNTEYTGPGLII 137

Query: 230 IQSLPFHRLSQRIARAVTS 248
            Q+LP  +++ R+A+   S
Sbjct: 138 FQTLPAAKIAARLAQVTAS 156


>gi|57640117|ref|YP_182595.1| hypothetical protein TK0182 [Thermococcus kodakarensis KOD1]
 gi|57158441|dbj|BAD84371.1| hypothetical protein, conserved, DUF124 family [Thermococcus
           kodakarensis KOD1]
          Length = 222

 Score = 61.6 bits (148), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 54/206 (26%), Positives = 96/206 (46%), Gaps = 19/206 (9%)

Query: 33  QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-----QWLFGKTITSVVL 87
            + +I L+  E V A  G+M  MS ++++E     + + G++       L G++      
Sbjct: 12  SLLEIELERGEAVQAEAGAMVHMSPTIKLET----KAKGGVFGALKRSMLGGESFFVNTF 67

Query: 88  R-NPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
           R   GP  G +G+A   +  I   +L    G L  Q  AFL S  D+ +           
Sbjct: 68  RAEDGP--GSIGLAPAYMGDIEAFELD---GTLYAQSGAFLASYGDIDIDTKWGGA--KT 120

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
             G EG    K++G+G+ F+ + G++ +K L   E   +D   +VA +  ++  +K  G 
Sbjct: 121 FFGREGLFLLKMSGRGIVFLSSFGAIYKKELH-NERFIIDTGHMVAFSEGLDFNVKRVGG 179

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
           ++  +F G+ LV A   G G ++IQ+
Sbjct: 180 LKSTLFSGEGLV-AEFYGTGTLYIQT 204


>gi|392529832|ref|ZP_10276969.1| hypothetical protein CmalA3_03933 [Carnobacterium maltaromaticum
           ATCC 35586]
 gi|414082972|ref|YP_006991678.1| hypothetical protein BN424_899 [Carnobacterium maltaromaticum
           LMA28]
 gi|412996554|emb|CCO10363.1| conserved hypothetical protein [Carnobacterium maltaromaticum
           LMA28]
          Length = 224

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 51/212 (24%), Positives = 90/212 (42%), Gaps = 11/212 (5%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL---RNP 90
           +C+  L+  E +V+  G   +  G +  E         G  + + G+  +   L   R  
Sbjct: 14  ICE--LEKGEAMVSENGGRSWALGEITTET-----TSGGGMKKMLGRAFSGESLFLSRYE 66

Query: 91  GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
             S+  +  A+     I+  +L   G  ++ Q  AFL +   V++S    ++      G 
Sbjct: 67  AQSNATIAFASSFPGCIIAKELQA-GESVIAQKTAFLAATEGVELSTFFQKKGTAGFFGG 125

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
           EGF+ QKL G G  F    GS    +L  GE +  D   +  +  +  ++I     ++  
Sbjct: 126 EGFIMQKLQGPGTVFFEIDGSTKTYDLAAGERLVCDTGLVALMDETCKLEIVAVKGLKNK 185

Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           + GG+     VVTGPG V +Q++     +Q I
Sbjct: 186 LLGGEGFFDTVVTGPGKVTVQTMTLGGFAQSI 217


>gi|429216548|ref|YP_007174538.1| hypothetical protein Calag_0310 [Caldisphaera lagunensis DSM 15908]
 gi|429133077|gb|AFZ70089.1| TIGR00266 family protein [Caldisphaera lagunensis DSM 15908]
          Length = 220

 Score = 61.6 bits (148), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 54/213 (25%), Positives = 99/213 (46%), Gaps = 14/213 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + ++ +G E Q   + L+P E++      + +   S     + +    VG +     + I
Sbjct: 1   MKYETIGSEFQYLLVDLQPGEEIYVDGSHVIYKDKS-----VTLNAKMVGGFLSALKREI 55

Query: 83  TSV---VLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
           T     +L   GP  G + +++P   +++ IDL   G  +L +  +FL S N VK  + L
Sbjct: 56  TGTTFFLLEAIGP--GKIALSSPLAGKVVKIDLGNRG--ILVEHTSFLASENTVKYESKL 111

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
                 + GG EG    KL G+G  F+ A G V +  L   E + V+ + ++A    +N 
Sbjct: 112 GSLTAGLFGG-EGIFLAKLEGKGSVFLHAVGQVNELYLNNQE-LQVEAAHLLAFEEGLNY 169

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            I   G ++  +FG + L    ++G G V+++S
Sbjct: 170 GITRVGNLKTMLFGDEGLFFVNLSGTGKVWVKS 202


>gi|347523482|ref|YP_004781052.1| hypothetical protein Pyrfu_0931 [Pyrolobus fumarii 1A]
 gi|343460364|gb|AEM38800.1| protein of unknown function DUF124 [Pyrolobus fumarii 1A]
          Length = 236

 Score = 61.2 bits (147), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 58/214 (27%), Positives = 105/214 (49%), Gaps = 14/214 (6%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRN---P 90
           V ++ L+  E + A PG+M  + G + +E          + + LFG    S  L      
Sbjct: 25  VLKVRLEAGESIWAEPGAMMLLRGPVRVETRAYGGVLTALKRALFGG--ESFFLNRYVAE 82

Query: 91  GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
           GP++ ++    P    I  ID+   GG  + Q  ++L    D+ V +   + +R +I   
Sbjct: 83  GPAEVWLVPGTP--GDIATIDM-QGGGSWIVQDTSYLAHSGDINV-DVAWRGLRGLIAEG 138

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
           E  +  KL G+GLA+I + G++ + ++  GE + +D    VA+++     ++  G ++  
Sbjct: 139 E-IVWLKLEGRGLAWISSYGAIDKIDVPPGEKVVIDNFHFVAMSADAKWSVRKFGDLKSF 197

Query: 211 VFGGDNLVTAVVTGPGIVFIQS--LP-FHRLSQR 241
           +FGG+  V   + GP  V++QS  LP F RL Q+
Sbjct: 198 IFGGEGFVIE-IEGPATVYVQSRMLPLFARLLQK 230


>gi|386843380|ref|YP_006248438.1| hypothetical protein SHJG_7298 [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|374103681|gb|AEY92565.1| hypothetical protein SHJG_7298 [Streptomyces hygroscopicus subsp.
           jinggangensis 5008]
 gi|451796671|gb|AGF66720.1| hypothetical protein SHJGH_7058 [Streptomyces hygroscopicus subsp.
           jinggangensis TL01]
          Length = 260

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 48/161 (29%), Positives = 71/161 (44%), Gaps = 12/161 (7%)

Query: 93  SDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
            +G VG A   P   R L +D          + DAF+ + + V            + GG 
Sbjct: 86  GEGTVGFAGVLPGEMRALELDGTR---AWFAEKDAFVAAESTVDFGIAFAGGRTGMSGG- 141

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKN-LEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
           EGF+ +K TG G   I   G+ +  N  + G  I VD  C+VA    +   ++  G + R
Sbjct: 142 EGFVLEKFTGHGTVIIAGAGNFIDLNPADFGGRIEVDTGCVVAFEEGIEYGVQRVGGLNR 201

Query: 210 -----AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
                AVFGG+ L  A + G G V +QSL    L+  + +A
Sbjct: 202 QGLMNAVFGGEGLSLATLEGNGRVILQSLTIESLANALKKA 242


>gi|331087255|ref|ZP_08336325.1| hypothetical protein HMPREF0987_02628 [Lachnospiraceae bacterium
           9_1_43BFAA]
 gi|330408941|gb|EGG88402.1| hypothetical protein HMPREF0987_02628 [Lachnospiraceae bacterium
           9_1_43BFAA]
          Length = 194

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 50/198 (25%), Positives = 92/198 (46%), Gaps = 12/198 (6%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT- 83
           +QILG    V    L   E ++   GSM +MS      NI +  +  G     FG+  + 
Sbjct: 3   YQILGDTLPVVICRLDVGESMITEKGSMAWMS-----PNIRMDTSTNGGLGKAFGRMFSG 57

Query: 84  SVVLRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTL 139
             + +N   + G  G    + A   P  +  F    G E++ Q   FL +   +++S   
Sbjct: 58  EAIFQNIYTAQG--GAGEIAFASSFPGSIRAFHIRPGEEIVVQKGGFLAAEAGIQLSVHF 115

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
            +++ + + G EGF+ Q+++G+G+ F    G V++  L+ G+ I +D   + A+T+S  +
Sbjct: 116 HKKLASGLFGGEGFILQRISGEGVMFAEFDGHVIEYELQPGQQIVIDTGHLAAMTASCQM 175

Query: 200 QIKYNGPIRRAVFGGDNL 217
            I+    ++   F G  L
Sbjct: 176 DIQTVPGLKNKFFRGRRL 193


>gi|218281715|ref|ZP_03488097.1| hypothetical protein EUBIFOR_00664 [Eubacterium biforme DSM 3989]
 gi|218217222|gb|EEC90760.1| hypothetical protein EUBIFOR_00664 [Eubacterium biforme DSM 3989]
          Length = 250

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 65/241 (26%), Positives = 109/241 (45%), Gaps = 19/241 (7%)

Query: 14  VYQSPQDAVIPFQILG-GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG 72
           + +   D  + ++I G  +  + +I L+  E +    GSM +MS ++ +E   +  N+ G
Sbjct: 10  IEERKGDHTMDYKIKGDSDCPIVEINLQNNETIKIERGSMAYMS-NVTIEG-KMNSNKKG 67

Query: 73  MWQWL--FGKTITS----VVLRNPGPS-DGFVGIAAPSLARILPIDLAMFGGELLCQPDA 125
           +   L   G+++TS     +    G S DG +GIA     +I+ +++       L    A
Sbjct: 68  LGGVLGAIGRSLTSGESMFITEATGTSHDGLLGIAPSIPGKIVCLEVNSQNHYYLNNG-A 126

Query: 126 FLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITV 185
           FL   N V          + + GG  GF      GQG   I A G +++ N++    IT+
Sbjct: 127 FLACDNSVSYEMKTQSVGKALFGGTGGFFVMHTFGQGDLLINAYGDIMEINVDDAHPITI 186

Query: 186 DVSCIVAVTSSVNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           D   ++A   S++ +IK    I    FG   G+ LV  V  G G V+IQS   H L+  +
Sbjct: 187 DNEHVIAWDDSLDYEIK----IASGTFGFTTGEGLVN-VFHGNGKVYIQSRNLHSLADAL 241

Query: 243 A 243
            
Sbjct: 242 T 242


>gi|329940620|ref|ZP_08289901.1| hypothetical protein SGM_5393 [Streptomyces griseoaurantiacus M045]
 gi|329300681|gb|EGG44578.1| hypothetical protein SGM_5393 [Streptomyces griseoaurantiacus M045]
          Length = 272

 Score = 61.2 bits (147), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 49/161 (30%), Positives = 74/161 (45%), Gaps = 12/161 (7%)

Query: 93  SDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
           ++G VG A   P   R L +D        L + DAF+ + + V       Q  R  + G 
Sbjct: 98  AEGTVGFAGVLPGEMRALELDGNR---AWLAEKDAFVAAESTVDFGIAF-QGGRTGMSGG 153

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKN-LEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
           EGF+ +K TG+G   I   G+ +  N  + G  I VD  C+VA    +   ++  G + R
Sbjct: 154 EGFVLEKFTGRGTVIIAGAGNFIDLNPADFGGRIEVDTGCVVAFEEGIQYGVQRIGGLDR 213

Query: 210 -----AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
                AV GG+ L  A + G G V +QSL    L+  + +A
Sbjct: 214 QGLMNAVLGGEGLSLATLEGNGRVILQSLTIEGLANALKKA 254


>gi|330835083|ref|YP_004409811.1| hypothetical protein Mcup_1222 [Metallosphaera cuprina Ar-4]
 gi|329567222|gb|AEB95327.1| conserved hypothetical protein [Metallosphaera cuprina Ar-4]
          Length = 230

 Score = 60.8 bits (146), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 57/214 (26%), Positives = 101/214 (47%), Gaps = 7/214 (3%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           + ILG + Q  ++ L   E+  A  G     + ++ M+         G+ + L G T   
Sbjct: 4   YNILGDDLQYLRVNLAQGEQFYADAGHTVMKNVTVSMQTRMRGGLLGGLKRALTGGTF-- 61

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR 144
            V    GP +  +    P   +++PI L   G  +L +  +FL + + V+  +TL  R+ 
Sbjct: 62  FVTEFYGPGELILSGIFP--GKVVPISLE--GRPILAEAHSFLGAESTVQYDSTL-ARLT 116

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
             + G EG    K +G G  F+ A G +++K L  GE I V+ S ++A    +N  ++  
Sbjct: 117 AGLLGGEGLFLAKFSGVGNVFLHAYGGLIEKYLSPGETIQVEASHLMAFEQGMNYSVQLV 176

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
           G +R  +F  + L    +TGPG V++ +L   +L
Sbjct: 177 GGLRSILFAHEGLFFVTITGPGKVWLHTLTAEQL 210


>gi|15643135|ref|NP_228178.1| hypothetical protein TM0367 [Thermotoga maritima MSB8]
 gi|418046301|ref|ZP_12684395.1| protein of unknown function DUF124 [Thermotoga maritima MSB8]
 gi|4980870|gb|AAD35454.1|AE001716_17 conserved hypothetical protein [Thermotoga maritima MSB8]
 gi|351675854|gb|EHA59014.1| protein of unknown function DUF124 [Thermotoga maritima MSB8]
          Length = 225

 Score = 60.5 bits (145), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 61/228 (26%), Positives = 100/228 (43%), Gaps = 23/228 (10%)

Query: 27  ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVV 86
           +L G   + ++ L   E VV  PG+M +M G +E++         G+W     K +   +
Sbjct: 6   VLKGSYALLKVFLSTGESVVVEPGAMVYMKGPIEVKT----SATGGVW-----KALKRAI 56

Query: 87  LRNPGPSDGFV------GIAAPSLARILP--IDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
           L   G  + F+      G +   LA  LP  ID+      L  Q  +FL     V++  +
Sbjct: 57  L---GGENFFMNTYISRGESEIGLAPQLPGDIDVIPLKDILYVQSTSFLACDPSVEMDVS 113

Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
               +++   G EG    K  G G   + + G +    L+ GE  TVD   +VA   +V 
Sbjct: 114 FGG-LKSFFSG-EGIFLLKFIGHGDVAVSSFGGIKSVELQPGEEFTVDTGHVVAFDGTVK 171

Query: 199 VQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             ++  G ++  +FGG+ LV    TGPG V+IQ+  +    + I   V
Sbjct: 172 WNVRTFGGLKSTLFGGEGLV-CTFTGPGRVYIQTRNYPAFVEWIKSLV 218


>gi|255514015|gb|EET90279.1| protein of unknown function DUF124 [Candidatus Micrarchaeum
           acidiphilum ARMAN-2]
          Length = 257

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 50/229 (21%), Positives = 100/229 (43%), Gaps = 15/229 (6%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--- 79
           + + I+GG  Q+  I LK  +K+    G +   S   +M   +      G   +L G   
Sbjct: 1   MDYSIIGGSMQILNINLKTGDKIYVDGGKVASRSEDAQMNAKW-----AGGAGFLKGIEM 55

Query: 80  --KTITSVVLRNPGPSD-GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
                T+ +L   G SD   V ++     ++  I+L   G  +  +  AFL + +  K+ 
Sbjct: 56  AFAGTTTFLLEVTGQSDTSTVSVSGDIPGKVKAIELKE-GESIYVEQMAFLATNDPSKID 114

Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
             +   +R  + G  G   +K  G  + F+   G +++ +L+ G  + +D   +   + +
Sbjct: 115 --VKASLRGAMAGA-GLFLEKFNGPCMVFVHVAGDIIEYDLQDGYTMLIDPGHLATFSGN 171

Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARA 245
           + +     G  +  +FG +NL+    TGPG + + S+   RL+  + +A
Sbjct: 172 MQIHFAPVGDFKAEIFGHENLIMGKFTGPGKIIMHSVSRQRLAASLGKA 220


>gi|240102519|ref|YP_002958828.1| hypothetical protein TGAM_0462 [Thermococcus gammatolerans EJ3]
 gi|239910073|gb|ACS32964.1| Conserved hypothetical protein [Thermococcus gammatolerans EJ3]
          Length = 222

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 53/205 (25%), Positives = 94/205 (45%), Gaps = 17/205 (8%)

Query: 33  QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-----QWLFGKTITSVVL 87
            V ++ L   E + A  G+M +MS ++++E     + + G++       L G++      
Sbjct: 12  SVLEVELNTGESIQAESGAMVYMSPTIKLET----KAKGGIFGALKRSMLGGESFFINTF 67

Query: 88  RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
           R  G   G VG+A P    I   +L    G L  Q  AFL S  ++ +            
Sbjct: 68  RAEG-GPGTVGLAPPYPGDIEAFELD---GTLYAQSGAFLASSGEIDIDTKWGGA--KTF 121

Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
            G EG    K++G+G  F+ + G++V+K L   E   +D   +VA +  ++  +K  G +
Sbjct: 122 FGREGLFLLKMSGRGTVFLSSFGAIVRKELH-NERFIIDTGHLVAFSEGLDFTVKRVGGL 180

Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQS 232
           +  +  G+ LV A   G G ++IQ+
Sbjct: 181 KSTLLSGEGLV-AEFYGTGTLYIQT 204


>gi|16081813|ref|NP_394204.1| hypothetical protein Ta0740 [Thermoplasma acidophilum DSM 1728]
 gi|10640022|emb|CAC11874.1| conserved hypothetical protein [Thermoplasma acidophilum]
          Length = 220

 Score = 60.1 bits (144), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 55/224 (24%), Positives = 107/224 (47%), Gaps = 11/224 (4%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           ++ ++I G + Q  + +L   E V   PG + +      ++         GM   + G +
Sbjct: 1   MVDYEIQGTDVQYLRAVLGSGESVYVEPGHLIYKDRQSRLD--VRAGGLKGMISHMLGGS 58

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
               +L+  GP +  V  A     +I  IDL   GG ++ + +AFLC  + ++ S     
Sbjct: 59  AV-FLLKVDGPGE--VASAGFLPGKIHRIDLR--GGGIIAEFNAFLCMDSGIQYSTKF-A 112

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
            +   + G EG   +  +G G  F+   G V++++L  G+ I+V++S ++A   +V   +
Sbjct: 113 GIWQGLLGGEGLFLEHFSGTGSIFLHGHGQVIERDLGAGDEISVELSHVLAFDDTVQYNV 172

Query: 202 KYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +  G +R  + G   G+ L  A + GPG V+I S+   +L+ ++
Sbjct: 173 ERIGGLRTMILGGLEGEGLFFAHMRGPGRVWIHSISLFQLAAKL 216


>gi|345562047|gb|EGX45119.1| hypothetical protein AOL_s00173g220 [Arthrobotrys oligospora ATCC
           24927]
          Length = 359

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 54/188 (28%), Positives = 90/188 (47%), Gaps = 11/188 (5%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  M+ ++ ++       +V + + + G  +       PG     V +A  SL
Sbjct: 156 ITAKPGAMVAMAATVTLKGSV----KVSLKKLISGGELAHSTFTGPGE----VILAPASL 207

Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
             I+PI L     +     DA+L +   V V +   Q + + +   EG    K+TGQG+ 
Sbjct: 208 GDIVPIRLDG-NQQWTMGKDAWLANTQGV-VRDYKSQGIGSAMFSGEGLFVYKITGQGVL 265

Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
           F+ + G+++QKNL   E   VD   +VA      ++   +G I  A+   + LV    TG
Sbjct: 266 FVTSLGAIIQKNLAPNEQYIVDNGHLVAWNCKYIIERIASGGIISALSSNEGLVCR-FTG 324

Query: 225 PGIVFIQS 232
           PG VFIQ+
Sbjct: 325 PGTVFIQT 332


>gi|406864015|gb|EKD17061.1| DUF124 domain-containing protein [Marssonina brunnea f. sp.
           'multigermtubi' MB_m1]
          Length = 378

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 57/190 (30%), Positives = 91/190 (47%), Gaps = 15/190 (7%)

Query: 45  VVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPS 103
           ++ARPG+M  M+ ++ ++ N+     +  M + + G  +T      PG     + +A PS
Sbjct: 185 LIARPGAMISMTPTITLKGNV-----KFSMKKLITGGDMTHSTFTGPGE----LLLAPPS 235

Query: 104 LARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQG 162
           L  I  I LA  G E      DAFL     V V     Q +   +   EG    K++G G
Sbjct: 236 LGDITNIHLA--GSEQWSVGRDAFLACTQGV-VKEYKRQGIGKAMFSGEGLFVYKISGNG 292

Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
           L +I + G++++K+L  GE   VD   +VA  +   ++   +G I      G+ LV    
Sbjct: 293 LLWITSLGAIIRKDLADGEKYIVDNGHLVAWNTKYILERVASGGIISGFSSGEGLVCK-F 351

Query: 223 TGPGIVFIQS 232
           TGPG VF+Q+
Sbjct: 352 TGPGTVFMQT 361


>gi|170724824|ref|YP_001758850.1| hypothetical protein Swoo_0457 [Shewanella woodyi ATCC 51908]
 gi|169810171|gb|ACA84755.1| protein of unknown function DUF124 [Shewanella woodyi ATCC 51908]
          Length = 220

 Score = 59.7 bits (143), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 67/241 (27%), Positives = 110/241 (45%), Gaps = 34/241 (14%)

Query: 26  QILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           +I+G +A     I L+P E V+A   +M  M   +++   +      G+ +  FG     
Sbjct: 4   KIMGSQAFSYIDIDLEPGETVIAESDAMSSMDADLDLTATFNGGFFKGLLKKFFGGETLF 63

Query: 85  VVLRNPGPSDGFVGIAAP---SLARILPIDL---AMFGGELLCQPDAFLCSVNDVKVSNT 138
           +   +   S       AP   +L + +P +L    + G E   QP A+L S   VK    
Sbjct: 64  ISRYSNNTS-------APKRVTLVQPVPGELRCHELNGDEFNLQPGAYLASTEGVK---- 112

Query: 139 LDQRVRNVIGGI--EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
           L  R    +  I  EG  +  +TG+G  +  A G++++K ++ GE I VD S +VA    
Sbjct: 113 LGLRWAGFVSFIAREGLFKLTVTGEGKVWYGAYGALLEKEID-GEYI-VDTSHLVAYEPG 170

Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPK 256
           + ++++  G I  + FGG+ LVT  V G G + +QS           R+V+      NPK
Sbjct: 171 IKLKLQLAGGIFSSFFGGEGLVTR-VEGKGKIIVQS-----------RSVSGLAGWLNPK 218

Query: 257 F 257
           F
Sbjct: 219 F 219


>gi|296415383|ref|XP_002837369.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295633234|emb|CAZ81560.1| unnamed protein product [Tuber melanosporum]
          Length = 358

 Score = 59.3 bits (142), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 54/198 (27%), Positives = 91/198 (45%), Gaps = 14/198 (7%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
           +ML     V A+PG+M  M+ ++ ++       +  + + L G  +       PG     
Sbjct: 155 LMLAHGCPVTAKPGAMVAMTPTVTLKGSV----KFSLKKILAGGEMAQSTYTGPGE---- 206

Query: 97  VGIAAPSLARILPIDLAMFGGELLCQ--PDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           + +A  +L  I+PI L    G+       DAFL     V V +   Q +   +   EG  
Sbjct: 207 LLLAPAALGDIVPIRLD---GQQTWSVGKDAFLACTQGV-VKDYKSQGLGKAMFSGEGLF 262

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGG 214
             K++GQG+ F+ + G+++QKN+  GE   VD   +VA      ++   +G I  ++   
Sbjct: 263 VYKISGQGVLFVTSLGAIIQKNMTQGEQFIVDNDHLVAWNCKYTIERVASGGIISSMSAA 322

Query: 215 DNLVTAVVTGPGIVFIQS 232
           + LV     GPG VFIQ+
Sbjct: 323 EGLVCRFTVGPGTVFIQT 340


>gi|385803598|ref|YP_005839998.1| hypothetical protein Hqrw_2456 [Haloquadratum walsbyi C23]
 gi|339729090|emb|CCC40311.1| conserved hypothetical protein [Haloquadratum walsbyi C23]
          Length = 240

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 59/206 (28%), Positives = 94/206 (45%), Gaps = 10/206 (4%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL-RNPGP 92
           V ++ L+  E V +  G+M   S  + +E     E+E GM+  L    +    L RN   
Sbjct: 13  VVEVTLERGETVTSEAGAMVSHSEEIRLETGIGDEDE-GMFSSLKDSVLGDESLFRNRFT 71

Query: 93  SDGFVGIAAPSLARILPID---LAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
           S+G  G     LA   P D   L +   EL  Q  A++    DV + + L      + G 
Sbjct: 72  SEG--GQGTVELAPSTPGDVTALTLDHDELYLQGGAYIGCGTDVSIDSELGGLDTLLGGE 129

Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
               +  K  G G+ FI + G + +K ++  E +TVD   IVA  + +N + +  G ++ 
Sbjct: 130 GISLI--KAHGSGVLFIGSFGGIDRKEIQRDETLTVDSGHIVAWNADMNYETRRVGGLKE 187

Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPF 235
            V  G+ LV    TGPG+VF Q+  +
Sbjct: 188 TVLSGEGLVMD-FTGPGVVFTQTRNY 212


>gi|110668161|ref|YP_657972.1| hypothetical protein HQ2223A [Haloquadratum walsbyi DSM 16790]
 gi|109625908|emb|CAJ52349.1| conserved hypothetical protein [Haloquadratum walsbyi DSM 16790]
          Length = 240

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 59/206 (28%), Positives = 94/206 (45%), Gaps = 10/206 (4%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL-RNPGP 92
           V ++ L+  E V +  G+M   S  + +E     E+E GM+  L    +    L RN   
Sbjct: 13  VVEVTLERGETVTSEAGAMVSHSEEIRLETGIGDEDE-GMFSSLKDSVLGDESLFRNRFT 71

Query: 93  SDGFVGIAAPSLARILPID---LAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
           S+G  G     LA   P D   L +   EL  Q  A++    DV + + L      + G 
Sbjct: 72  SEG--GQGTVELAPSTPGDVTALTLDHDELYLQGGAYIGCGTDVSIDSELGGLDTLLGGE 129

Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
               +  K  G G+ FI + G + +K ++  E +TVD   IVA  + +N + +  G ++ 
Sbjct: 130 GISLI--KAHGSGVLFIGSFGGIDRKEIQRDETLTVDSGHIVAWNADMNYETRRVGGLKE 187

Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPF 235
            V  G+ LV    TGPG+VF Q+  +
Sbjct: 188 TVLSGEGLVMD-FTGPGVVFTQTRNY 212


>gi|448410693|ref|ZP_21575398.1| hypothetical protein C475_13742 [Halosimplex carlsbadense 2-9-1]
 gi|445671729|gb|ELZ24316.1| hypothetical protein C475_13742 [Halosimplex carlsbadense 2-9-1]
          Length = 240

 Score = 58.9 bits (141), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 56/221 (25%), Positives = 101/221 (45%), Gaps = 10/221 (4%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL-RNPGP 92
           V  + L   E++ A  G+M   + ++ ME   I +++ G  + +    +    L RN   
Sbjct: 13  VASVELSSGEELTAEAGAMVSHTETIRME-TGIGDSDEGFLESVKDSVLGDESLFRNRFT 71

Query: 93  SDGFVGIAAPSLARILPIDLA---MFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
           ++G  GI    +A   P D+    + G E+  Q  A+  + + V++    D    + + G
Sbjct: 72  AEGGDGIV--QIAATTPGDMTARELDGDEVYVQSGAYEAAGSGVELDT--DVGGLDTLFG 127

Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
            EG    K  G G  F+ A G + ++ ++ GEV TVD   +VA  +S+    +  G ++ 
Sbjct: 128 GEGLFLLKAAGTGPLFLSAFGGIQEREVDAGEVFTVDSGHVVAWDASMEYNTERVGGMKE 187

Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
            +F  + LV    TGPG V +Q+  +      +A  + S N
Sbjct: 188 TLFSDEGLVMR-FTGPGTVLLQTRNYGEFVSDLASRLPSGN 227


>gi|170288364|ref|YP_001738602.1| hypothetical protein TRQ2_0565 [Thermotoga sp. RQ2]
 gi|403252814|ref|ZP_10919119.1| hypothetical protein EMP_03520 [Thermotoga sp. EMP]
 gi|170175867|gb|ACB08919.1| protein of unknown function DUF124 [Thermotoga sp. RQ2]
 gi|402811576|gb|EJX26060.1| hypothetical protein EMP_03520 [Thermotoga sp. EMP]
          Length = 225

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 59/228 (25%), Positives = 99/228 (43%), Gaps = 23/228 (10%)

Query: 27  ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVV 86
           +L G   + ++ L   E VV  PG+M +M G +E+          G+W     K +   +
Sbjct: 6   VLKGSYALLKVFLSIGESVVVEPGAMVYMKGPIEVNT----SATGGVW-----KALKRAI 56

Query: 87  LRNPGPSDGFV------GIAAPSLARILP--IDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
           L   G  + F+      G +   LA  LP  ID+      L  Q  +FL     V++  +
Sbjct: 57  L---GGENFFMNTYISRGESEVGLAPQLPGDIDVIPLKDILYVQSTSFLACDPSVEMDVS 113

Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
               +++   G EG    K  G G   + + G +    L+ GE  TVD   +VA   +V 
Sbjct: 114 FGG-LKSFFSG-EGIFLLKFVGHGDVAVSSFGGIKSIELQSGEEFTVDTGHVVAFDGTVK 171

Query: 199 VQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             ++  G ++  +FGG+ LV    TGPG +++Q+  +    + I   V
Sbjct: 172 WNVRTFGGLKSTLFGGEGLV-CTFTGPGRIYVQTRNYPAFVEWIKSLV 218


>gi|428183259|gb|EKX52117.1| hypothetical protein GUITHDRAFT_161312 [Guillardia theta CCMP2712]
          Length = 256

 Score = 58.5 bits (140), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 53/227 (23%), Positives = 97/227 (42%), Gaps = 13/227 (5%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG------MWQWLF 78
           +Q++G + Q    +L+  ++V+ +PG+M   +G++E      PE   G         W  
Sbjct: 35  WQVVGDDIQALLAVLQQGQEVMCQPGAMVSTTGTVE------PELTTGGCFNGCKRCWCA 88

Query: 79  GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSV-NDVKVSN 137
           G+       +N       V ++ P  ++I+P+ +      L     ++L SV  D   + 
Sbjct: 89  GELCCRTHYKNVNGRLEHVTVSPPYPSKIIPVSMNEHTSGLYITRQSWLASVGRDADFNV 148

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
           +             G     L G G +F+  GG+V++K L  GE + V  + ++A    V
Sbjct: 149 SCMPSFSQCCCLGLGCCIPSLVGSGYSFLAGGGTVMKKTLREGESVVVQTTALLAWDKKV 208

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
            ++    G       GG    TA  TGPG +++QS+ F + +    R
Sbjct: 209 KLEGACAGGPGVVCCGGMGFCTAKFTGPGTIYVQSMSFSKWAMSYMR 255


>gi|340923625|gb|EGS18528.1| hypothetical protein CTHT_0051310 [Chaetomium thermophilum var.
           thermophilum DSM 1495]
          Length = 475

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 51/188 (27%), Positives = 88/188 (46%), Gaps = 10/188 (5%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS S++++  Y    ++ + ++L G  + +     PG     V  A   L
Sbjct: 281 INAKPGAMLAMSPSVQLKGHY----KITVKKFLAGGDMGTSTFTGPGE----VLFAPHML 332

Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
             I  + L           DA+L S   + V +   Q +   I   EG    K+TG GL 
Sbjct: 333 GDITSLRLNGGNEAWSVSSDAYLASTQYI-VKDYKRQGLSKAIFSGEGLWVYKITGTGLV 391

Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
           ++ + G++++K+L  GE   VD   +VA  +   ++   +G +   +  G+ LV     G
Sbjct: 392 WLTSFGAIIRKDLAEGEQYIVDNGHLVAWNTKYTIERVASGGLISGMAAGEGLVCK-FHG 450

Query: 225 PGIVFIQS 232
           PG VFIQ+
Sbjct: 451 PGTVFIQT 458


>gi|306820186|ref|ZP_07453830.1| conserved hypothetical protein [Eubacterium yurii subsp.
           margaretiae ATCC 43715]
 gi|304551787|gb|EFM39734.1| conserved hypothetical protein [Eubacterium yurii subsp.
           margaretiae ATCC 43715]
          Length = 226

 Score = 58.2 bits (139), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 62/218 (28%), Positives = 103/218 (47%), Gaps = 16/218 (7%)

Query: 36  QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDG 95
           +  L   E + A  G+M  M  +++++         G+ + L G+ I     +N   S G
Sbjct: 15  ECYLSSGESIKAESGAMVGMDNTLDIDGKLEKGLMGGLTRMLAGEKI---FFQNIKASRG 71

Query: 96  F--VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
              V +A  +L  +L   L     +L+ Q D +L S + V VS       + +  G EGF
Sbjct: 72  AGKVLLAPTTLGSVLDYQLDGMN-DLIVQKDGYLASEHSVDVSTKTQNLAQGLFSG-EGF 129

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY---NGPIRRA 210
              K++G+G  F+ + GS+ + +L+ G+   VD   +VA +SS + +I+    +G I  +
Sbjct: 130 FILKVSGRGTVFLNSYGSIHEIDLQAGQEYIVDNQHLVAWSSSTDYKIEKASSSGWI-SS 188

Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
           +  G+ LV     GPG VFIQ+    R     AR V S
Sbjct: 189 LTSGEGLVCR-FRGPGKVFIQT----RNPSSFARWVNS 221


>gi|258514331|ref|YP_003190553.1| hypothetical protein Dtox_1044 [Desulfotomaculum acetoxidans DSM
           771]
 gi|257778036|gb|ACV61930.1| protein of unknown function DUF124 [Desulfotomaculum acetoxidans
           DSM 771]
          Length = 228

 Score = 57.8 bits (138), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 52/203 (25%), Positives = 93/203 (45%), Gaps = 11/203 (5%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI---TSVVLRNP 90
           + ++ L+  E + A   +M  MS ++++E         G+ + + G++    T    R P
Sbjct: 13  LLEVELQHGESIKAESDAMVSMSDTLDVEGKMDGGIMGGLARMVAGESFFFQTLTAKRGP 72

Query: 91  GPSDGFVGIAAPSLARILPIDLAMFGG-ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
           G +     + APS    + ID+ + G   L+ Q D F     D++VS  +    + +  G
Sbjct: 73  GKA-----LLAPSAPGDI-IDVELDGSYSLVVQKDGFFAGSRDLQVSAKMQNLAQGIFSG 126

Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
            EGF   K++GQG+ F+ + G +   NLE GE + VD   +VA    ++  ++       
Sbjct: 127 -EGFFVLKVSGQGIVFLSSYGGIHVINLEKGEEVVVDNGHLVAWPEYMSYTVEKASKGWI 185

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
           + F    +      GPG V IQ+
Sbjct: 186 STFTSGEVAVCRFRGPGPVIIQT 208


>gi|326789156|ref|YP_004306977.1| hypothetical protein Clole_0017 [Clostridium lentocellum DSM 5427]
 gi|326539920|gb|ADZ81779.1| protein of unknown function DUF124 [Clostridium lentocellum DSM
           5427]
          Length = 230

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 48/141 (34%), Positives = 68/141 (48%), Gaps = 5/141 (3%)

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
           DG V    P    I+ ++L    G L  Q D FL S   V+VS      VR +  G EGF
Sbjct: 72  DGEVLFGHPLPGDIIDVELDGSYG-LRVQKDGFLASTEGVEVSTKAQNLVRGMFSG-EGF 129

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV--NVQIKYNGPIRRAV 211
               ++G+G  F+ + GS+   NLE GE   VD   +VA    +  N++   N    R++
Sbjct: 130 FILNVSGRGTVFLSSYGSIHAINLEPGEEYIVDNGHLVAWADYMQYNIEKASNNGWLRSI 189

Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
             G+ LV     GPG+V IQ+
Sbjct: 190 ASGECLVCR-FRGPGVVLIQT 209


>gi|449546595|gb|EMD37564.1| hypothetical protein CERSUDRAFT_94566 [Ceriporiopsis subvermispora
           B]
          Length = 315

 Score = 57.4 bits (137), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 59/220 (26%), Positives = 91/220 (41%), Gaps = 28/220 (12%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           V  I L+P  +  ARPG+M  M  S++++          M +   G  +       PG  
Sbjct: 110 VLYIRLQPGYEFKARPGAMVAMDPSVQIQGNLT----FSMKKLFTGGQLAQSRFTGPGE- 164

Query: 94  DGFVGIAAPSLARILPIDLA-----MFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIG 148
              V +A      I+PI L       FG        A+L S  D+    TLD + +N +G
Sbjct: 165 ---VLLAPQVWGDIIPIPLDGRTRWHFGKH------AWLASTRDI----TLDTKAQNSVG 211

Query: 149 GI----EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
                  G      +G G+ F+ A GS++ + L  GE   V+   +VA T+  N++    
Sbjct: 212 KALFSGRGLFVMTASGAGMIFVQAMGSIITRTLADGEQWIVENDHLVAWTAKYNIEFIQT 271

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
           G    +V   D       TGPG V+IQ+      S+ +A 
Sbjct: 272 GSFLSSV-KTDQGFVCRFTGPGTVYIQTRNPEAFSKWVAE 310


>gi|428183258|gb|EKX52116.1| hypothetical protein GUITHDRAFT_102018 [Guillardia theta CCMP2712]
          Length = 247

 Score = 57.4 bits (137), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 49/221 (22%), Positives = 92/221 (41%), Gaps = 33/221 (14%)

Query: 24  PFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTIT 83
           P+QI+G ++Q+    L+P ++V+ +PGSM      ++        N  G   ++  + I 
Sbjct: 41  PWQIVGSDSQLIVFKLEPGQEVMMQPGSMVHADEGVKPSTSL--RNPFG--AFMASEDIF 96

Query: 84  SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV 143
            +  +N   +  F+   +   ++++PIDL+++ G +  Q  +FL +        T   R 
Sbjct: 97  RLKWKNESSTVKFLSATSMFPSKVIPIDLSVYSG-ITIQKGSFLATTGSELRFETFFPRS 155

Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
                G    L+ KL G G+ F+   G+++ K L  GE   +D                 
Sbjct: 156 LTECLGQGNLLQTKLFGNGICFLCGSGTIITKTLAAGEKYILDT---------------- 199

Query: 204 NGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
                        L     TGPG++  Q+LP  ++    A+
Sbjct: 200 ------------GLFNTEYTGPGLIVFQTLPACKIGALFAQ 228


>gi|448605699|ref|ZP_21658325.1| hypothetical protein C441_10061 [Haloferax sulfurifontis ATCC
           BAA-897]
 gi|448622567|ref|ZP_21669261.1| hypothetical protein C438_09512 [Haloferax denitrificans ATCC
           35960]
 gi|445741725|gb|ELZ93224.1| hypothetical protein C441_10061 [Haloferax sulfurifontis ATCC
           BAA-897]
 gi|445754649|gb|EMA06054.1| hypothetical protein C438_09512 [Haloferax denitrificans ATCC
           35960]
          Length = 225

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 64/133 (48%), Gaps = 6/133 (4%)

Query: 103 SLARILPIDLAMFG---GELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLT 159
           SLA  LP D+   G     L  Q  ++L S   + +  +          G EG    KLT
Sbjct: 76  SLAPPLPGDIVHHGLADETLYVQSGSYLASDPALDLDTSFGGA--KTFFGSEGLFLLKLT 133

Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVT 219
           G G +F+ + G++ +  L+ GE  TVD   IVA   + +  ++  G ++  +F G+ LV 
Sbjct: 134 GTGDSFLSSYGAIHEVELDDGERYTVDTGHIVAFDETTDFSVERVGGLKSTLFSGEGLV- 192

Query: 220 AVVTGPGIVFIQS 232
              TGPG V+ QS
Sbjct: 193 CTFTGPGTVWTQS 205


>gi|153004312|ref|YP_001378637.1| hypothetical protein Anae109_1446 [Anaeromyxobacter sp. Fw109-5]
 gi|152027885|gb|ABS25653.1| protein of unknown function DUF124 [Anaeromyxobacter sp. Fw109-5]
          Length = 246

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 60/233 (25%), Positives = 111/233 (47%), Gaps = 31/233 (13%)

Query: 21  AVIPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMEN-----IYIPENEVGMW 74
           + + +Q++GG A    ++ L P + + A  G+M  MSG++++++     +      V   
Sbjct: 9   SALRYQLVGGSAFAAARVELAPDQAIRAEAGAMVSMSGNVDLQSQLQGGLLGALKRVVSR 68

Query: 75  QWLFGKTITS------VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLC 128
           + LF  T T+      V+L  P P D            ++ ++LA  G  LL Q  ++L 
Sbjct: 69  ESLFVSTFTAQGGAGEVILAPPVPGD------------LVGVELA--GRALLVQSSSWLA 114

Query: 129 SVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVS 188
           S  +V++ +T     R +  G EG    + +GQG   + + G++V++ L  G    VD  
Sbjct: 115 SEPEVQL-DTEFAGFRGLFAG-EGLFFIRASGQGTVLLSSFGAIVRRPLPAGARYVVDTG 172

Query: 189 CIVAVTSSVNVQIKYNG--PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
            +VA  + +  Q++        R++  G+ LV A  TGPG V++Q+     L+
Sbjct: 173 HVVAFDARMPFQVRKASRRGWMRSLLSGEALV-AEFTGPGEVYLQTRNLRSLA 224


>gi|148269688|ref|YP_001244148.1| hypothetical protein Tpet_0551 [Thermotoga petrophila RKU-1]
 gi|147735232|gb|ABQ46572.1| protein of unknown function DUF124 [Thermotoga petrophila RKU-1]
          Length = 225

 Score = 57.0 bits (136), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 59/228 (25%), Positives = 99/228 (43%), Gaps = 23/228 (10%)

Query: 27  ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVV 86
           +L G   + ++ L   E VV  PG+M +M G +E+          G+W     K +   +
Sbjct: 6   VLKGSYALLKVFLSIGESVVVEPGAMVYMKGPIEVNT----SATGGVW-----KALKRAI 56

Query: 87  LRNPGPSDGFV------GIAAPSLARILP--IDLAMFGGELLCQPDAFLCSVNDVKVSNT 138
           L   G  + F+      G +   LA  LP  ID+      L  Q  +FL     V++  +
Sbjct: 57  L---GGENFFMNTYISRGESEVGLAPQLPGDIDVIPLKDVLYVQSTSFLACDPSVEMDVS 113

Query: 139 LDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
               +++   G EG    K  G G   + + G +    L+ GE  TVD   +VA   +V 
Sbjct: 114 FGG-LKSFFSG-EGIFLLKFFGYGDVAVSSFGGIKSIELKPGEEFTVDTGHVVAFDGTVK 171

Query: 199 VQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             ++  G ++  +FGG+ LV    TGPG +++Q+  +    + I   V
Sbjct: 172 WNVRTFGGLKSTLFGGEGLV-CTFTGPGRIYVQTRNYPAFVEWIKSLV 218


>gi|389748369|gb|EIM89546.1| DUF124-domain-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 258

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 52/197 (26%), Positives = 92/197 (46%), Gaps = 8/197 (4%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
           + L+P+ ++  +PG+M  M  +M+++            ++  G+  +  +   PG     
Sbjct: 49  VTLQPEAQIKVKPGAMLAMDPTMQIQGKVPHMQHHSFSKFFSGEEASESIFTGPGE---- 104

Query: 97  VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQ 156
           V IA      I+PI L       + +  AFL S + + V     Q     +   EGF   
Sbjct: 105 VLIAPEVWGDIVPITLDGQAPWSVSK-YAFLASTSGL-VRGVRSQGFAKGLFSGEGFFVS 162

Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ-IKYNGPIRRAVFGGD 215
           +++GQG  ++ A G++VQ++L  GE   VD   +VA ++   V+ IK  G +   +   +
Sbjct: 163 EISGQGTLWVQAIGAIVQRHLRPGEEWIVDNGHLVAWSAQYKVETIKAKGGLLSKLHTDE 222

Query: 216 NLVTAVVTGPGIVFIQS 232
             V    TGPG V+IQ+
Sbjct: 223 GKV-CRFTGPGTVYIQT 238


>gi|332797638|ref|YP_004459138.1| hypothetical protein Ahos_1967 [Acidianus hospitalis W1]
 gi|332695373|gb|AEE94840.1| conserved hypothetical protein [Acidianus hospitalis W1]
          Length = 242

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 62/244 (25%), Positives = 110/244 (45%), Gaps = 17/244 (6%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV--GMWQWLFGKTI 82
           ++ILG + Q  ++ L   E +    G +   S S+++   Y  +  +     + L G   
Sbjct: 2   YRILGYDMQHVRVYLNEGEMIYGDGGHLVAKSPSVKIN--YAAQGGIIKSFERELTGSRF 59

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
             + +  PG ++     ++    RI+ IDL   G  +  + ++FL +   V+ S TLD+ 
Sbjct: 60  FVMQVYGPGVAE----FSSFLPGRIVKIDLN--GNGIRVEHNSFLFAEQGVQYSATLDKI 113

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
               +GG EG L    +GQG  F+ A G V    L+ GE I V+   ++A   S+   I 
Sbjct: 114 SVGWLGG-EGLLMAHFSGQGSVFVHALGGVSSFVLQQGEEIEVEEGHLLAFDDSMQYSIT 172

Query: 203 YNGPIRRAVFGGDNLVTAV---VTGPGIVFIQSLPFHRLSQRIARAV---TSPNMRENPK 256
             G I+  +FG      A    +TGPG V++ ++   +L  ++   V     P   ++P 
Sbjct: 173 RAGGIKTMLFGHIEKEGAFFIKLTGPGRVWLHNVSLEQLIAKLNLMVGGNGGPAQPQDPS 232

Query: 257 FFVQ 260
             +Q
Sbjct: 233 IDIQ 236


>gi|355571049|ref|ZP_09042319.1| protein of unknown function DUF124 [Methanolinea tarda NOBI-1]
 gi|354826331|gb|EHF10547.1| protein of unknown function DUF124 [Methanolinea tarda NOBI-1]
          Length = 231

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 53/213 (24%), Positives = 92/213 (43%), Gaps = 17/213 (7%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSM-------EMENIYIPENEVGMWQWLFGKTITSVVLRN 89
           + L P E + A  GSM +M  ++       E   +      +   Q  F     +     
Sbjct: 16  VTLSPGESIAAESGSMTYMQPTLTVRTQKREKSLLGSLGLALAGGQSFFVNEFCATT--- 72

Query: 90  PGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
              + G  G +A  L  I  + ++   G +L Q  A++ S   V +    +   + + G 
Sbjct: 73  ---AAGEAGFSAAPLGDIGTLRVSPGKGCIL-QKSAYIASQGTVDLDVKWEGFSKGLFG- 127

Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
            +G    + TG+G  FI   G++ Q  L+ GE + VD   +VA + +   Q++  G I+ 
Sbjct: 128 -QGLFMIRATGEGTLFINTFGAIDQHTLKEGESLIVDNFHLVAFSDTCTYQVRKFGGIKE 186

Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            +  G+  VT  + GPG VFIQ+     L++ I
Sbjct: 187 TLLSGEGFVTE-IRGPGDVFIQTKNIRELAEWI 218


>gi|164425315|ref|XP_959188.2| hypothetical protein NCU04645 [Neurospora crassa OR74A]
 gi|157070879|gb|EAA29952.2| predicted protein [Neurospora crassa OR74A]
          Length = 445

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 51/187 (27%), Positives = 88/187 (47%), Gaps = 13/187 (6%)

Query: 47  ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
           A+PGSM  MS SM ++  Y    + G+ + + G  + S     PG       + AP++  
Sbjct: 254 AKPGSMIAMSPSMSLKGAY----KFGLKKLVAGGEMGSSTYTGPGEL-----LLAPAMLG 304

Query: 107 ILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAF 165
            +   L + G +      D +L S  +V   +   Q +   +   EG    K++G GL +
Sbjct: 305 DI-TSLRLDGSQTWSVSHDGYLASTQNV-TKDYKRQGLGKAMFSGEGLWVYKISGTGLLW 362

Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGP 225
           + + G++V+K+L  GE   VD   +VA  +   ++   +G +      G+ LV    TGP
Sbjct: 363 LTSFGAIVRKDLADGEKYIVDNGHLVAWNTKYVLERVASGGLLSGFASGEGLV-CKFTGP 421

Query: 226 GIVFIQS 232
           G +FIQ+
Sbjct: 422 GTIFIQT 428


>gi|448678514|ref|ZP_21689521.1| hypothetical protein C443_07778 [Haloarcula argentinensis DSM
           12282]
 gi|445772501|gb|EMA23546.1| hypothetical protein C443_07778 [Haloarcula argentinensis DSM
           12282]
          Length = 237

 Score = 56.6 bits (135), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 60/227 (26%), Positives = 98/227 (43%), Gaps = 23/227 (10%)

Query: 36  QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPSD 94
           ++ L   E ++A PG+M   S ++E+E     +  +   + + G +++ +      G S 
Sbjct: 15  RVALDSGESILAEPGAMVSHSPTIEIETTTSRDGLLSSAKSMLGGESLLANEFTAEGGS- 73

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
           G V +A P+   +   +L    GE L   D AFL +  D+ + +          GGI+  
Sbjct: 74  GTVTLAPPTPGDVHHHELT---GETLYAVDGAFLAADPDIDIDSEF--------GGIKSM 122

Query: 154 LRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
           L         L G G   I A G +    L+ GE  T+D   +VA   SVN      G +
Sbjct: 123 LAGASITPLALKGSGNVLIEAFGGLETVELDAGESYTIDNDHVVAWEDSVNFDAHRVGGL 182

Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
           +  +  G+ LV    TGPG V+ Q+      +  IA A+  P   +N
Sbjct: 183 KSTLLSGEGLVMD-FTGPGTVWYQTRGLDSFTSAIADAL--PGTGDN 226


>gi|448330451|ref|ZP_21519733.1| hypothetical protein C489_14920 [Natrinema versiforme JCM 10478]
 gi|445611832|gb|ELY65578.1| hypothetical protein C489_14920 [Natrinema versiforme JCM 10478]
          Length = 224

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/221 (25%), Positives = 91/221 (41%), Gaps = 19/221 (8%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
           + L+P E V+A PG+M   S ++ M+     +  +   + L G               G 
Sbjct: 16  VTLEPGESVIAEPGAMVGHSANVSMDTGTSRDGLLSSAKSLLGGESAFANEFTAEDEPGT 75

Query: 97  VGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
           V  A PS   ++P +L     E L   D AFL +   + + +         IGG++  L 
Sbjct: 76  VTFAPPSPGDVMPHELR---DETLYSTDGAFLAATAGIDIDSE--------IGGLKSMLG 124

Query: 156 QK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
           +       L G G  FI A G + +  L  GE   +D   ++A    V+  I+  G ++ 
Sbjct: 125 EASVTPLALKGTGTVFIDAYGGLEKLELAAGESYVLDNEHLIAWDDQVDYSIRRVGNLKS 184

Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
            +  G+ LV    TGPG  + Q+     L   +A  + S N
Sbjct: 185 TLLSGEGLVFE-FTGPGTAWYQTRDMDALVSVLAPRLPSDN 224


>gi|449546594|gb|EMD37563.1| hypothetical protein CERSUDRAFT_114200 [Ceriporiopsis subvermispora
           B]
          Length = 300

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 58/222 (26%), Positives = 98/222 (44%), Gaps = 21/222 (9%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           V  + L+P  ++ A+PG+M  M  S++++        +G  ++  G  +       PG  
Sbjct: 95  VLYMRLQPGYEIKAKPGAMVAMDASVQIQGNLT----LGFKKFFTGGQMAESRFTGPGE- 149

Query: 94  DGFVGIAAPSLARILPIDLA-----MFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIG 148
              V +A      I+PI L       FG  +      +L S  +V + NT  Q +   + 
Sbjct: 150 ---VLLAPEVWGDIVPIHLDGRTQWHFGKHV------WLASTREVTL-NTKAQSLGKTLF 199

Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
              G      +G G+ F+ A GS++ + L  GE   VD   +VA T++ +V+    G + 
Sbjct: 200 SGRGLFVMTASGAGMLFVQALGSIITRTLANGEQWVVDNDHLVAWTANYSVERIQAGSLL 259

Query: 209 RAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
            +    + LV    TGPG V+IQS     L + IA  + + N
Sbjct: 260 SSAKTDEGLV-CRFTGPGTVYIQSRNPEALCKWIAERIPTDN 300


>gi|392570466|gb|EIW63639.1| DUF124-domain-containing protein [Trametes versicolor FP-101664
           SS1]
          Length = 213

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 58/216 (26%), Positives = 99/216 (45%), Gaps = 16/216 (7%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
           + L+P  ++ A+PGSM  M  +++++     + +  M +   G  +T  +   PG     
Sbjct: 4   LRLQPGYEIKAKPGSMVAMDATVQIKG----KLKFSMKKMFTGSDMTESLFTGPGE---- 55

Query: 97  VGIAAPSLARILPIDLAMFGGELLCQ--PDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           V +A      I+PI L    G+        AFL S   V  +       + +  G EG  
Sbjct: 56  VLLAPEIWGDIVPIHLD---GQTPWNVGKHAFLASTPGVSRTAKSQGFGKALFSG-EGLF 111

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGG 214
            Q++TGQG  ++ + G++ Q+ L+ GE   VD   +VA T+   V+    G +       
Sbjct: 112 VQQVTGQGTMWVTSLGAIFQRTLQPGEQWIVDNGHLVAWTAGYKVERINAGGLFSMSHTD 171

Query: 215 DNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
           + LV    TGPG+++IQ+     L + I RA   P+
Sbjct: 172 EGLVCR-FTGPGVIYIQTRNPETLGEWI-RAQVPPS 205


>gi|220916631|ref|YP_002491935.1| hypothetical protein A2cp1_1525 [Anaeromyxobacter dehalogenans
           2CP-1]
 gi|219954485|gb|ACL64869.1| protein of unknown function DUF124 [Anaeromyxobacter dehalogenans
           2CP-1]
          Length = 247

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 57/229 (24%), Positives = 118/229 (51%), Gaps = 21/229 (9%)

Query: 20  DAVIPFQILGGEAQVC-QIMLKPQEKVVARPGSMCFMSGSMEMEN-----IYIPENEVGM 73
            + + +Q++GG A  C ++ L   + ++A  G+M  MSG++++++     +      +  
Sbjct: 8   SSTLRYQLVGGNAFACARVELAAGQSILAEAGAMVSMSGNVDLQSRMQGGVMGALRRMVT 67

Query: 74  WQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDV 133
            + +F  T T++     GP++  V +A P    ++ ++LA  G  LL Q  ++L S  + 
Sbjct: 68  RESVFVSTFTAM----GGPAE--VLLAPPVPGDVIGLELA--GRTLLVQSSSWLASDPET 119

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
           ++ +T     R +  G EG    +L+G+G A + + G++V++ +  G    VD   +VA 
Sbjct: 120 RI-DTEFAGFRGLFAG-EGLFFIRLSGRGTALLSSYGAIVRRPIPAGGRYVVDTGHVVAF 177

Query: 194 TSSVNVQIK---YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
            +++  Q++     G + R++  G+ LV A   GPG V++Q+     L+
Sbjct: 178 DAAMPYQVRKASRRGWL-RSIVSGEALV-AEFAGPGEVWLQTRNLQALA 224


>gi|448573289|ref|ZP_21640873.1| hypothetical protein C456_16242 [Haloferax lucentense DSM 14919]
 gi|448597542|ref|ZP_21654467.1| hypothetical protein C452_08881 [Haloferax alexandrinus JCM 10717]
 gi|445719054|gb|ELZ70737.1| hypothetical protein C456_16242 [Haloferax lucentense DSM 14919]
 gi|445739003|gb|ELZ90512.1| hypothetical protein C452_08881 [Haloferax alexandrinus JCM 10717]
          Length = 225

 Score = 56.2 bits (134), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 64/133 (48%), Gaps = 6/133 (4%)

Query: 103 SLARILPIDLAMFGGE---LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLT 159
           S A  LP D+   G E   L  Q  +++ S   + +  +          G EG    KLT
Sbjct: 76  SFAPPLPGDIVHHGLEDETLFVQSGSYIASDPALDLDTSFGGA--KTFFGSEGLFLLKLT 133

Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVT 219
           G G +F+ + G++ +  L  GE  TVD   IVA   + +  ++  G ++  +F G+ LV 
Sbjct: 134 GTGDSFLSSYGAIHEVELGEGERYTVDTGHIVAFDETTSFSVERVGGLKSTLFSGEGLV- 192

Query: 220 AVVTGPGIVFIQS 232
              TGPG V+IQS
Sbjct: 193 CTFTGPGTVWIQS 205


>gi|333984102|ref|YP_004513312.1| hypothetical protein [Methylomonas methanica MC09]
 gi|333808143|gb|AEG00813.1| protein of unknown function DUF124 [Methylomonas methanica MC09]
          Length = 226

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 95/206 (46%), Gaps = 22/206 (10%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW----------QWLFGKTITSVV 86
           + L+  E + A  G+M  MS ++++++    + E G+           + LF +T+T+V 
Sbjct: 16  VHLERGEAIKAESGAMITMSETLDVDS----KMERGLLGGLSRKLLTGESLFFQTLTAV- 70

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
               GP +  +  A P    +L +D +    E + Q D FL +   V+VS       + +
Sbjct: 71  ---RGPGEVLMASAYPGEIAVLELDGSE---EYILQKDGFLAAEQSVQVSTKAQNLTKGL 124

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
             G EGF   +++G+G   I   G + +  L  GE   +D S +VA +++ N +I+    
Sbjct: 125 FSG-EGFFVMRISGKGKLAISTYGGIQKLTLAPGETRIIDNSHMVAWSANANYKIEKAAK 183

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
              + F     +     GPG V+IQ+
Sbjct: 184 GWISSFTSGEGLVCRFHGPGDVYIQT 209


>gi|448579214|ref|ZP_21644491.1| hypothetical protein C455_16088 [Haloferax larsenii JCM 13917]
 gi|445723893|gb|ELZ75529.1| hypothetical protein C455_16088 [Haloferax larsenii JCM 13917]
          Length = 226

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 57/114 (50%), Gaps = 3/114 (2%)

Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
           L  Q  ++L S   + +  +     R   G  EG    KL G G AF+ + G++ +  L+
Sbjct: 95  LYVQSGSYLASETTLDLDTSFGG-ARTFFGS-EGLFLLKLEGTGDAFLSSYGAIHEVELD 152

Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            G+  TVD   IVA   + N  ++  G ++  +F G+ LV    TGPG V++QS
Sbjct: 153 EGQKYTVDTGHIVAFDGTTNFDVERVGGLKSTLFSGEGLV-CTFTGPGSVWLQS 205


>gi|448610962|ref|ZP_21661596.1| hypothetical protein C440_07362 [Haloferax mucosum ATCC BAA-1512]
 gi|445743394|gb|ELZ94875.1| hypothetical protein C440_07362 [Haloferax mucosum ATCC BAA-1512]
          Length = 226

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/114 (31%), Positives = 57/114 (50%), Gaps = 3/114 (2%)

Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
           L  Q  ++L S  DV++             G EG    KL G G +F+ + G++ +  L+
Sbjct: 95  LFVQSGSYLAS--DVELDLDTSFGGAKTFFGSEGLFLLKLEGTGDSFLSSYGAIHEVELD 152

Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            GE  TVD   IVA   + +  ++  G ++  +F G+ LV    TGPG V++QS
Sbjct: 153 AGERYTVDTGHIVAFDGTTDFSVERVGGLKSTLFSGEGLV-CTFTGPGTVWLQS 205


>gi|448582557|ref|ZP_21646061.1| hypothetical protein C454_05742 [Haloferax gibbonsii ATCC 33959]
 gi|445732205|gb|ELZ83788.1| hypothetical protein C454_05742 [Haloferax gibbonsii ATCC 33959]
          Length = 225

 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 47/84 (55%), Gaps = 1/84 (1%)

Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
           G EG    KLTG G +F+ + G++ +  L  GE  TVD   IVA   + +  ++  G ++
Sbjct: 123 GSEGLFLLKLTGTGDSFLSSYGAIHEVELGEGERYTVDTGHIVAFDETTSFSVERVGGLK 182

Query: 209 RAVFGGDNLVTAVVTGPGIVFIQS 232
             +F G+ LV    TGPG V+IQS
Sbjct: 183 STLFSGEGLV-CTFTGPGTVWIQS 205


>gi|292654579|ref|YP_003534476.1| hypothetical protein HVO_0402 [Haloferax volcanii DS2]
 gi|448293417|ref|ZP_21483523.1| hypothetical protein C498_16688 [Haloferax volcanii DS2]
 gi|291371176|gb|ADE03403.1| conserved hypothetical protein TIGR00266 [Haloferax volcanii DS2]
 gi|445570471|gb|ELY25031.1| hypothetical protein C498_16688 [Haloferax volcanii DS2]
          Length = 225

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 47/84 (55%), Gaps = 1/84 (1%)

Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
           G EG    KLTG G +F+ + G++ +  L  GE  TVD   IVA   + +  ++  G ++
Sbjct: 123 GSEGLFLLKLTGTGDSFLSSYGAIHEVELGEGERYTVDTGHIVAFDETTSFSVERVGGLK 182

Query: 209 RAVFGGDNLVTAVVTGPGIVFIQS 232
             +F G+ LV    TGPG V+IQS
Sbjct: 183 STLFSGEGLV-CTFTGPGTVWIQS 205


>gi|448543631|ref|ZP_21625185.1| hypothetical protein C460_10643 [Haloferax sp. ATCC BAA-646]
 gi|448550723|ref|ZP_21629026.1| hypothetical protein C459_12020 [Haloferax sp. ATCC BAA-645]
 gi|448558976|ref|ZP_21633297.1| hypothetical protein C458_15651 [Haloferax sp. ATCC BAA-644]
 gi|445706354|gb|ELZ58237.1| hypothetical protein C460_10643 [Haloferax sp. ATCC BAA-646]
 gi|445711228|gb|ELZ63022.1| hypothetical protein C459_12020 [Haloferax sp. ATCC BAA-645]
 gi|445711817|gb|ELZ63605.1| hypothetical protein C458_15651 [Haloferax sp. ATCC BAA-644]
          Length = 225

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 47/84 (55%), Gaps = 1/84 (1%)

Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
           G EG    KLTG G +F+ + G++ +  L  GE  TVD   IVA   + +  ++  G ++
Sbjct: 123 GSEGLFLLKLTGTGDSFLSSYGAIHEVELGEGERYTVDTGHIVAFDETTSFSVERVGGLK 182

Query: 209 RAVFGGDNLVTAVVTGPGIVFIQS 232
             +F G+ LV    TGPG V+IQS
Sbjct: 183 STLFSGEGLV-CTFTGPGTVWIQS 205


>gi|323453474|gb|EGB09345.1| hypothetical protein AURANDRAFT_13803, partial [Aureococcus
           anophagefferens]
          Length = 107

 Score = 55.8 bits (133), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 47/96 (48%), Gaps = 4/96 (4%)

Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
           G +G +RQ + G G+AF+ A G +  K L  GE + VD + +VA   SV   ++  G   
Sbjct: 1   GGQGLIRQVIKGDGMAFMGAMGVITTKTLAAGETLQVDTTSVVAWEDSVKFAVRRTGSFF 60

Query: 209 RAVFGGDN----LVTAVVTGPGIVFIQSLPFHRLSQ 240
                G+     +    + GPG+VF+QS    +  Q
Sbjct: 61  TCCCAGEGTFQRMFNTTLEGPGLVFLQSYSHGKFKQ 96


>gi|197121839|ref|YP_002133790.1| hypothetical protein AnaeK_1430 [Anaeromyxobacter sp. K]
 gi|196171688|gb|ACG72661.1| protein of unknown function DUF124 [Anaeromyxobacter sp. K]
          Length = 247

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 56/230 (24%), Positives = 111/230 (48%), Gaps = 23/230 (10%)

Query: 20  DAVIPFQILGGEAQVC-QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
            + + +Q++GG A  C ++ L   + ++A  G+M  MSG++++++         M   + 
Sbjct: 8   SSTLRYQLVGGNAFACARVELAAGQSILAEAGAMVSMSGNVDLQSR--------MQGGVM 59

Query: 79  GKTITSVVLRNPGPSDGFVGIAAPSLARILP------IDLAMFGGELLCQPDAFLCSVND 132
           G  +  +V R       F  +  P+   + P      I L + G  LL Q  ++L S  +
Sbjct: 60  G-ALRRMVTRESVFVSTFTAMGGPAEVLLAPPVPGDVIGLELEGRTLLVQSSSWLASDPE 118

Query: 133 VKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
            ++ +T     R +  G EG    +L+G+G A + + G++V++ +  G    VD   +VA
Sbjct: 119 TRI-DTEFAGFRGLFAG-EGLFFIRLSGRGTALLSSYGAIVRRPIPAGGRYVVDTGHVVA 176

Query: 193 VTSSVNVQIK---YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
             +++  Q++     G + R++  G+ LV A   GPG V++Q+     L+
Sbjct: 177 FDAAMPYQVRKASRRGWL-RSIVSGEALV-AEFAGPGEVWLQTRNLQALA 224


>gi|448589550|ref|ZP_21649709.1| hypothetical protein C453_04209 [Haloferax elongans ATCC BAA-1513]
 gi|445735978|gb|ELZ87526.1| hypothetical protein C453_04209 [Haloferax elongans ATCC BAA-1513]
          Length = 226

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 47/84 (55%), Gaps = 1/84 (1%)

Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
           G EG    KL G G AF+ + G++ +  L+ G+  TVD   IVA   + N  ++  G ++
Sbjct: 123 GSEGLFLLKLEGTGDAFLSSYGAIHEVELDEGQKYTVDTGHIVAFDGTTNFDVERVGGLK 182

Query: 209 RAVFGGDNLVTAVVTGPGIVFIQS 232
             +F G+ LV    TGPG V++QS
Sbjct: 183 STLFSGEGLV-CTFTGPGSVWLQS 205


>gi|159041055|ref|YP_001540307.1| hypothetical protein Cmaq_0472 [Caldivirga maquilingensis IC-167]
 gi|157919890|gb|ABW01317.1| protein of unknown function DUF124 [Caldivirga maquilingensis
           IC-167]
          Length = 238

 Score = 55.5 bits (132), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 45/211 (21%), Positives = 93/211 (44%), Gaps = 6/211 (2%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           + + ILG + Q+ ++++   E +      + + + ++ +          G+ + L G + 
Sbjct: 1   MEYSILGNDLQLVRVVMGEGESIYGEGSHLLYKTPAVGLTTKASGGILAGLKRTLTGASF 60

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
             + L  PG +      A  +  +I  + L   G  ++ +  +FL + N VK   +L + 
Sbjct: 61  FVLELTGPGEAA----FAGSTPGKITQVTLNE-GESIMAEHGSFLFAENSVKYDASLTRL 115

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
              + GG EG    K TG G  F+   G V    L   E + ++   ++A  + +   + 
Sbjct: 116 SVGLFGG-EGLFLAKFTGLGNVFLHGTGHVHMIQLNESEELNIEAGHLLAFDAGMQYTVG 174

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSL 233
             G +R  + GG+ L    + GPG V+++S+
Sbjct: 175 RVGGLRTMLLGGEGLFFVNIKGPGRVWVRSI 205


>gi|116179870|ref|XP_001219784.1| hypothetical protein CHGG_00563 [Chaetomium globosum CBS 148.51]
 gi|88184860|gb|EAQ92328.1| hypothetical protein CHGG_00563 [Chaetomium globosum CBS 148.51]
          Length = 425

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 54/189 (28%), Positives = 89/189 (47%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           V ARPGSM  MS S+ ++       +  M + + G  + S     PG     + +A P L
Sbjct: 232 VEARPGSMIAMSPSIVLKG----HIKFSMKKLVAGGDMNSSTFTGPGE----LLLAPPML 283

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
             I  + L+  G E      D FL    +V + +   Q +   +   EG    K++G GL
Sbjct: 284 GDITSLRLS--GQEAWSVSHDGFLACTQNV-IKDYKRQGLGKAMFSGEGLWVYKISGTGL 340

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            ++ + G++++K+L  GE   VD   +VA  +   ++   +G +   +  G+ LV    T
Sbjct: 341 LWLTSFGAIIRKDLMDGEKYIVDNGHLVAWNTKYVLERVASGGLMSGMASGEGLVCK-FT 399

Query: 224 GPGIVFIQS 232
           GPG VFIQ+
Sbjct: 400 GPGTVFIQT 408


>gi|86158855|ref|YP_465640.1| hypothetical protein Adeh_2433 [Anaeromyxobacter dehalogenans
           2CP-C]
 gi|85775366|gb|ABC82203.1| protein of unknown function DUF124 [Anaeromyxobacter dehalogenans
           2CP-C]
          Length = 247

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 56/229 (24%), Positives = 111/229 (48%), Gaps = 23/229 (10%)

Query: 21  AVIPFQILGGEAQVC-QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG 79
           + + +Q++GG A  C ++ L   + ++A  G+M  MSG++++++         M   + G
Sbjct: 9   STLRYQLVGGNAFACARVELAAGQSILAEAGAMVSMSGNVDLQSR--------MQGGVMG 60

Query: 80  KTITSVVLRNPGPSDGFVGIAAPSLARILP------IDLAMFGGELLCQPDAFLCSVNDV 133
             +  +V R       F  +  P+   + P      I L + G  LL Q  ++L S  + 
Sbjct: 61  -ALRRMVTRESVFVSTFTAMGGPAEVLLAPPVPGDVIGLELEGRTLLVQSSSWLASDPET 119

Query: 134 KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
           ++ +T     R +  G EG    +L+G+G A + + G++V++ +  G    VD   +VA 
Sbjct: 120 RI-DTEFAGFRGLFAG-EGLFFIRLSGRGTALLSSYGAIVRRPIPAGGRYVVDTGHVVAF 177

Query: 194 TSSVNVQIK---YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
            +++  Q++     G + R++  G+ LV A   GPG V++Q+     L+
Sbjct: 178 DAAMPYQVRKASRRGWL-RSIVSGEALV-AEFAGPGEVWLQTRNLQALA 224


>gi|171691685|ref|XP_001910767.1| hypothetical protein [Podospora anserina S mat+]
 gi|170945791|emb|CAP72591.1| unnamed protein product [Podospora anserina S mat+]
          Length = 399

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 87/187 (46%), Gaps = 13/187 (6%)

Query: 47  ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
           ARPG+M  MS S++++  Y    +  M + + G  ++      PG     + +A P L  
Sbjct: 207 ARPGAMIAMSPSIQLKGSY----KFSMKKLVAGGEMSQSHYTGPGE----LLLAPPMLGD 258

Query: 107 ILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAF 165
           I    L + G E      D +L S   V + +   Q +   +   EG    K++G GL +
Sbjct: 259 I--TSLRLTGNESWSVSHDGYLASTQHV-IKDYKRQGLGKAMFSGEGLWVYKISGTGLLW 315

Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGP 225
           + + G++++K+L  GE   VD   +VA      ++   +G I      G+ LV    TGP
Sbjct: 316 LTSFGAIIRKDLAEGEKYIVDNGHLVAWNVKYIMERVTSGGIISGFASGEGLVCK-FTGP 374

Query: 226 GIVFIQS 232
           G VFIQ+
Sbjct: 375 GTVFIQT 381


>gi|395333196|gb|EJF65574.1| hypothetical protein DICSQDRAFT_79838 [Dichomitus squalens LYAD-421
           SS1]
          Length = 208

 Score = 55.5 bits (132), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 58/214 (27%), Positives = 97/214 (45%), Gaps = 11/214 (5%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
           + L+P  ++ A+PGSM  M  S++++     + +V + +   G  ++  +   PG     
Sbjct: 4   LRLQPGYQIKAKPGSMVAMDASVQIKG----KLKVSLKKIFTGGEMSESIFTGPGE---- 55

Query: 97  VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQ 156
           V +A      ++PI L       + + DAFL     V  S T  Q +   +   EG    
Sbjct: 56  VLLAPEIWGDVVPIRLDGSQNWRVGR-DAFLACTMGVTRS-TKSQGLGKALFSGEGLFVY 113

Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDN 216
            + GQG+ ++ + G++  + L+ GE   VD   +VA T+   V+    G    A    + 
Sbjct: 114 NIAGQGVMWVQSLGAITSRTLQPGEQWIVDNGHLVAWTAQYKVERIDAGGFFSASHTDEG 173

Query: 217 LVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
           LV    TGPG+V+IQS     L + I   V S N
Sbjct: 174 LVCR-FTGPGVVYIQSRNPETLGEWIREQVPSVN 206


>gi|402311312|ref|ZP_10830260.1| TIGR00266 family protein [Eubacterium sp. AS15]
 gi|400365458|gb|EJP18510.1| TIGR00266 family protein [Eubacterium sp. AS15]
          Length = 226

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 58/200 (29%), Positives = 98/200 (49%), Gaps = 14/200 (7%)

Query: 39  LKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF-- 96
           L   E + A  G+M  M  ++++E         G+ + L G+ I     +N   S G   
Sbjct: 18  LAAGESIKAESGAMVGMDNTLDIEGKLEKGLMGGLTRMLAGEKI---FFQNITASRGAGK 74

Query: 97  VGIAAPSLARILPIDLAMFG-GELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
           V +A  +L  +  ID  + G  +L+ Q D +L S + V VS       + +  G EGF  
Sbjct: 75  VLLAPTTLGSV--IDYKLDGMNDLIVQKDGYLASEHGVDVSTKTQNLAQGLFSG-EGFFI 131

Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY---NGPIRRAVF 212
            K++G+G  F+ + GS+ + +L+ G+   VD   +VA TS+ + +I+    +G I  +  
Sbjct: 132 LKVSGRGTVFLNSYGSIHEIDLQAGQEYIVDNQHLVAWTSNTDYKIEKASSSGWI-SSFT 190

Query: 213 GGDNLVTAVVTGPGIVFIQS 232
            G+ LV     GPG VF+Q+
Sbjct: 191 SGEGLVCR-FRGPGKVFMQT 209


>gi|448560743|ref|ZP_21634191.1| hypothetical protein C457_02205 [Haloferax prahovense DSM 18310]
 gi|445722393|gb|ELZ74056.1| hypothetical protein C457_02205 [Haloferax prahovense DSM 18310]
          Length = 225

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 47/84 (55%), Gaps = 1/84 (1%)

Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
           G EG    KLTG G +F+ + G++ +  L  GE  TVD   IVA   + +  ++  G ++
Sbjct: 123 GSEGLFLLKLTGTGDSFLSSYGAIHEVELGEGERYTVDTGHIVAFDETTSFSVERVGGLK 182

Query: 209 RAVFGGDNLVTAVVTGPGIVFIQS 232
             +F G+ LV    TGPG V++QS
Sbjct: 183 STLFSGEGLV-CTFTGPGTVWMQS 205


>gi|365832830|ref|ZP_09374357.1| hypothetical protein HMPREF1021_03121 [Coprobacillus sp. 3_3_56FAA]
 gi|365259960|gb|EHM89935.1| hypothetical protein HMPREF1021_03121 [Coprobacillus sp. 3_3_56FAA]
          Length = 118

 Score = 55.1 bits (131), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/102 (26%), Positives = 55/102 (53%), Gaps = 1/102 (0%)

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
           ++ +    + GG EGF+  K++G G  F+   GS +  +L  G+ + +D   +  + ++ 
Sbjct: 7   SIKKMATGIFGG-EGFIMNKISGTGTVFLEIDGSAISYDLLPGQQMVIDTGYLAMMDATC 65

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLS 239
            ++I+    ++  + GG+     VVTGPG V +QS+P   ++
Sbjct: 66  KMEIQSVKGVKNKLLGGEGFFNTVVTGPGKVVLQSMPISAVA 107


>gi|403722528|ref|ZP_10945084.1| hypothetical protein GORHZ_060_00470 [Gordonia rhizosphera NBRC
           16068]
 gi|403206628|dbj|GAB89415.1| hypothetical protein GORHZ_060_00470 [Gordonia rhizosphera NBRC
           16068]
          Length = 225

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 92/208 (44%), Gaps = 10/208 (4%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
           I + P   V    G+M    G +++E         G+ + L G++    V      S G 
Sbjct: 16  ITVPPGGSVRVEAGAMAMTRGDIQIETSTRGGFFKGLRRSLGGESF--FVNDFHSGSGGQ 73

Query: 97  VGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           VG+AA  P     L +D  M    LL Q  +++ S   V V +          G  EG +
Sbjct: 74  VGVAATLPGDMTSLSLDARM---PLLVQSGSWIASDPSVDVDSKWGGHKSFFSG--EGLI 128

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGG 214
             + TG G   + + G++V   L  GE +T+D   +VA   +V  +++  G  +  + GG
Sbjct: 129 LLRCTGAGGLLLSSYGAIVATELAAGETLTLDTGHVVAFDEAVQYKVRKAGSWKSTLLGG 188

Query: 215 DNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           + LVT    GPG V++Q+   + L + I
Sbjct: 189 EGLVTD-FHGPGRVWLQTRSSNDLIEWI 215


>gi|83644432|ref|YP_432867.1| hypothetical protein HCH_01585 [Hahella chejuensis KCTC 2396]
 gi|83632475|gb|ABC28442.1| uncharacterized conserved protein [Hahella chejuensis KCTC 2396]
          Length = 220

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 64/237 (27%), Positives = 108/237 (45%), Gaps = 24/237 (10%)

Query: 26  QILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITS 84
           +I GG A     + L+P E++ A   +M  MS  ++ME        + + +   G   T 
Sbjct: 4   EIKGGAAFGYVDVELEPGEQITAESDAMSSMSADLDMETRTNGSFFIAILRKFLGGE-TF 62

Query: 85  VVLRNPGPSDGFVGIAAPSLARILP--IDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQ 141
            + R    ++G   +   +L +  P  +  A    E LC QP AFL +   V++      
Sbjct: 63  FINRFSNKTNG---VRKLTLVQPTPGEVRCAELNNETLCFQPGAFLAATEGVRLGVRW-A 118

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
             ++ I   EG  +  ++G G  +  A G++++K ++ GE I VD S +VA    +++ I
Sbjct: 119 GFKSFIA-KEGLFKLVVSGTGKVWYGAYGALLEKEID-GEYI-VDTSHLVAYDPGLSLHI 175

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFF 258
           +  G +  + FGG+ LVT  V G G + IQ+     L+  I           NPK F
Sbjct: 176 QLAGGLFSSFFGGEGLVTR-VEGKGKIIIQTRSISGLTDWI-----------NPKLF 220


>gi|315231067|ref|YP_004071503.1| hypothetical protein TERMP_01305 [Thermococcus barophilus MP]
 gi|315184095|gb|ADT84280.1| hypothetical protein TERMP_01305 [Thermococcus barophilus MP]
          Length = 222

 Score = 55.1 bits (131), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 51/201 (25%), Positives = 88/201 (43%), Gaps = 11/201 (5%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           + ++ L+  E V A  G+M FMS S+++E          + + L       + +      
Sbjct: 13  LVEVELERGEAVQAEAGAMVFMSPSIKIETKAKGGIFGALKRSLLAGESFFINIFKAERD 72

Query: 94  DGFVGIAAPSLARILPIDLAMF--GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
            G VG A P      P D+  F   G L  Q  AFL S  D+K+               E
Sbjct: 73  GGKVGFAPP-----YPGDIEAFEIDGTLYAQSGAFLASSADIKIDTKWGG--AKTFFARE 125

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           G    K++G+G+ F+ + G++ +  L   E   +D   +VA +  ++ ++K  G ++   
Sbjct: 126 GLFLLKMSGRGVVFLSSYGAIQKVELN-NEHFIIDTGHLVAFSEGLDFRVKRVGGLKSTF 184

Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
             G+ LV A   G G ++IQ+
Sbjct: 185 LSGEGLV-AEFYGTGTLYIQT 204


>gi|344211296|ref|YP_004795616.1| hypothetical protein HAH_1010 [Haloarcula hispanica ATCC 33960]
 gi|343782651|gb|AEM56628.1| conserved hypothetical protein [Haloarcula hispanica ATCC 33960]
          Length = 237

 Score = 54.7 bits (130), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 59/227 (25%), Positives = 98/227 (43%), Gaps = 23/227 (10%)

Query: 36  QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPSD 94
           ++ L   E ++A PG+M   S ++E+E     +  +   + + G +++ +      G S 
Sbjct: 15  RVALDSGESILAEPGAMVSHSPTIEIETTTSRDGLLSSAKSMLGGESLLANEFTAQGGS- 73

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
           G + +A P+   +   +L    GE L   D AFL +  D+ + +          GGI+  
Sbjct: 74  GTLTLAPPTPGDVHHHELT---GETLYAVDGAFLAADPDIDIDSEF--------GGIKSM 122

Query: 154 LRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
           L         L G G   I A G +    L+ GE  T+D   +VA   SVN      G +
Sbjct: 123 LAGASITPLALKGTGNVLIEAFGGLETVELDAGESYTIDNDHVVAWEESVNFDAHRVGGL 182

Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
           +  +  G+ LV    TGPG V+ Q+      +  IA A+  P   +N
Sbjct: 183 KSTLLSGEGLVMD-FTGPGTVWYQTRGLDSFTSAIADAL--PGTGDN 226


>gi|448667307|ref|ZP_21685849.1| hypothetical protein C442_10226 [Haloarcula amylolytica JCM 13557]
 gi|445770342|gb|EMA21406.1| hypothetical protein C442_10226 [Haloarcula amylolytica JCM 13557]
          Length = 237

 Score = 54.7 bits (130), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 59/229 (25%), Positives = 96/229 (41%), Gaps = 25/229 (10%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT--ITSVVLRNPGP 92
            ++ L   E ++A PG+M   S ++E+E     +  +   + + G    + +      GP
Sbjct: 14  VRVALDSGESILAEPGAMVSHSPTIEIETTTSRDGLLSSAKSMLGGESLLANEFTAKGGP 73

Query: 93  SDGFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIE 151
             G V +A P+   +   +L    GE L   D AFL +  D+ + +          GGI+
Sbjct: 74  --GTVTLAPPTPGDVHHHELT---GETLYAVDGAFLAADPDIDIDSEF--------GGIK 120

Query: 152 GFLRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
             L         L G G   I A G +    L+ GE  T+D   +VA   SVN      G
Sbjct: 121 SMLAGASITPLALKGTGNVLIEAFGGLETVELDAGESYTIDNDHVVAWEESVNFDAHRVG 180

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
            ++  +  G+ +V    TGPG V+ Q+      +  IA A+  P   +N
Sbjct: 181 GLKSTLLSGEGIVMD-FTGPGTVWYQTRGLGSFTSAIADAL--PGTGDN 226


>gi|350635083|gb|EHA23445.1| hypothetical protein ASPNIDRAFT_173126 [Aspergillus niger ATCC
           1015]
          Length = 268

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 53/188 (28%), Positives = 78/188 (41%), Gaps = 11/188 (5%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + ARPG M  MS SM +          G  + L G  I       PG       + APS+
Sbjct: 75  IQARPGVMIAMSPSMSLRGSL----SFGWMKALAGGNIARSTYTGPGEL-----LIAPSV 125

Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
              + I       E +   DAFL S   VK        ++ V  G EGF   +  G GL 
Sbjct: 126 MGDITIIRLDGSNEWIVGRDAFLASTTGVKTEYKTQGLMKGVFSG-EGFFVWRFYGAGLV 184

Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
           ++ + G++V+K L   E   V+   +VA      ++   +G I      G+ L     TG
Sbjct: 185 WMQSFGAIVKKELADDETYYVNNGHLVAWNCRYKIERVASGGIISNWSAGEGLA-CRFTG 243

Query: 225 PGIVFIQS 232
           PG V++Q+
Sbjct: 244 PGTVYMQT 251


>gi|353243514|emb|CCA75046.1| hypothetical protein PIIN_09031 [Piriformospora indica DSM 11827]
          Length = 292

 Score = 54.3 bits (129), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 56/218 (25%), Positives = 102/218 (46%), Gaps = 19/218 (8%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           + QI L+   ++ +RPG+M  MS S++++       ++ + + + G  ++      PG  
Sbjct: 85  ILQITLQQGYQIKSRPGAMVAMSASVQIKG----SMKISLSKMITGGEMSESKYTGPGEV 140

Query: 94  DGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
                I AP +   I+PI +           DA+L    +V  S      ++ ++ G EG
Sbjct: 141 -----ILAPEVWGDIVPIQI-QPNTSWSVGKDAYLACTANVTRSTKSQGFMKGLMSG-EG 193

Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ-IKYNGPIRRAV 211
           F   ++ GQG+ F+ + G++V+++L  GE   VD   +VA ++S  ++ I+  G      
Sbjct: 194 FFVARVEGQGVLFVQSLGAIVRRDLRPGEEWIVDNGHLVAWSASYTMERIQTTGG---GF 250

Query: 212 FGGDNLVTAVV---TGPGIVFIQSLPFHRLSQRIARAV 246
             G +     V    GPG V++Q+     L Q IA  V
Sbjct: 251 LSGSHTGEGAVCRFRGPGTVYLQTRNPQSLGQWIAAQV 288


>gi|254169050|ref|ZP_04875888.1| conserved hypothetical protein TIGR00266 [Aciduliprofundum boonei
           T469]
 gi|197622020|gb|EDY34597.1| conserved hypothetical protein TIGR00266 [Aciduliprofundum boonei
           T469]
          Length = 222

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 58/206 (28%), Positives = 97/206 (47%), Gaps = 23/206 (11%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL---RNPG 91
            ++ L P E++VA  G+M +MS ++ +E       + G  + L  K ++   L       
Sbjct: 14  LKVYLSPGERIVAEAGAMVYMSPTIGVETT----TKGGFMKGLLRKMLSGQTLFMNTYYA 69

Query: 92  PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV-IGG 149
            S+G   + APSL   I+ I++         QP     S  +   S  L+  V+     G
Sbjct: 70  QSEGGYVVFAPSLPGDIIEIEV--------TQP--IYISDTNYLASTELEFGVKFTGFKG 119

Query: 150 I---EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
           I    G    K++G G A+I A G +    L  GE + +D   + A  +S+N  ++  G 
Sbjct: 120 IFTPGGMFWFKVSGIGKAWIAAYGGIDMLQLGPGERLLIDNIHLAAFDASMNFTLRKFGK 179

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
           ++  +FGG+ ++T  V GPG V+IQS
Sbjct: 180 LKSFLFGGEYILTEFV-GPGRVWIQS 204


>gi|152967256|ref|YP_001363040.1| hypothetical protein Krad_3312 [Kineococcus radiotolerans SRS30216]
 gi|151361773|gb|ABS04776.1| protein of unknown function DUF124 [Kineococcus radiotolerans
           SRS30216]
          Length = 263

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 56/252 (22%), Positives = 97/252 (38%), Gaps = 25/252 (9%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGK 80
            +  Q++G   Q+  + L+P + +    G   F SG + ME  +  P    G       +
Sbjct: 2   TLDVQLVGSTTQMAVVTLRPGQTLYCEAGKFLFSSGDVVMETKLTAPGGGEGGGLGGLLR 61

Query: 81  TITSVVLRN-PGPSDGFVGIAAPSLA-------------RILPIDLAMFGGELLCQPDAF 126
                  R   G S  F     P                R L +D +        + DAF
Sbjct: 62  GAAQAGKRVLAGESFAFQHFHTPGGDGLLGLAGVLPGEMRHLELDGST---TWFAEKDAF 118

Query: 127 LCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKN-LEVGEVITV 185
           + +   V          + ++GG EGF+ ++ TG+G   I   G  +  N  + G  + V
Sbjct: 119 VAAEAGVHFDIAFSGVGQGLMGG-EGFILERFTGRGSLLIAGAGDFIDINPADYGGTLRV 177

Query: 186 DVSCIVAVTSSVNVQIKYNGPIRR-----AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           D  C+VA    +   ++  G + R     AV GG+ L  A + G G V +QS+     ++
Sbjct: 178 DTGCVVAWDDRITYGVETVGRLNRQGIVSAVLGGEGLTLATLRGDGRVILQSVTIEAFAK 237

Query: 241 RIARAVTSPNMR 252
            + +    P+ +
Sbjct: 238 ALVKNSAKPDQQ 249


>gi|358367770|dbj|GAA84388.1| DUF124 domain protein [Aspergillus kawachii IFO 4308]
          Length = 359

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 53/188 (28%), Positives = 78/188 (41%), Gaps = 11/188 (5%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + ARPG M  MS SM +          G  + L G  I       PG       + APS+
Sbjct: 165 IQARPGVMIAMSPSMSLRGSL----SFGWMKALAGGNIARSTYTGPGEL-----LIAPSI 215

Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
              + I       E +   DAFL S   VK        ++ V  G EGF   +  G GL 
Sbjct: 216 MGDITIIRLDGSNEWIVGRDAFLASTTGVKTEYKTQGLMKGVFSG-EGFFVWRFYGAGLV 274

Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
           ++ + G++V+K L   E   V+   +VA      ++   +G I      G+ L     TG
Sbjct: 275 WMQSFGAIVKKELADDETYYVNNGHLVAWNCRYKIERVASGGIISNWSAGEGLA-CRFTG 333

Query: 225 PGIVFIQS 232
           PG V++Q+
Sbjct: 334 PGTVYMQT 341


>gi|218244961|ref|YP_002370332.1| hypothetical protein PCC8801_0070 [Cyanothece sp. PCC 8801]
 gi|257057986|ref|YP_003135874.1| hypothetical protein Cyan8802_0068 [Cyanothece sp. PCC 8802]
 gi|218165439|gb|ACK64176.1| protein of unknown function DUF124 [Cyanothece sp. PCC 8801]
 gi|256588152|gb|ACU99038.1| protein of unknown function DUF124 [Cyanothece sp. PCC 8802]
          Length = 225

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 60/207 (28%), Positives = 97/207 (46%), Gaps = 13/207 (6%)

Query: 31  EAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGKTITSVV 86
           ++ + ++ L   E++VA+ GSM  MSG    S  +          G  + L G+++   V
Sbjct: 10  DSAIARVTLNAGEELVAQAGSMVAMSGFINASTTLRQGKGGGIMGGFKRMLGGESLFLSV 69

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
            R+P P +G V +A   +  IL  +  M G EL+ Q  ++L S ++V +     Q  ++V
Sbjct: 70  FRSPIP-NGQVFLAPKLMGDILVYE--MEGKELIVQSGSYLASASNVDIDLGF-QGFKSV 125

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNG 205
             G E      +TG G   + + G + +  +EV     VD   IVA   S++  I K N 
Sbjct: 126 FSG-EAIFWLSITGYGSVILTSFGGIYE--VEVNGDYIVDTGHIVAFERSLDFSITKVNS 182

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
               A FGG+ LV     G G V+ Q+
Sbjct: 183 SWWGAFFGGEGLVCH-FHGYGKVYCQT 208


>gi|339500593|ref|YP_004698628.1| hypothetical protein Spica_1992 [Spirochaeta caldaria DSM 7334]
 gi|338834942|gb|AEJ20120.1| protein of unknown function DUF124 [Spirochaeta caldaria DSM 7334]
          Length = 226

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 52/213 (24%), Positives = 100/213 (46%), Gaps = 13/213 (6%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           V +++ +  E+V A  G+M  MSG+ME++     +   G    + G           G S
Sbjct: 13  VLRLIFEAGEQVRAEAGAMISMSGTMELKAKTSGKGVFGTLGAMMGGESLFASEYTAGTS 72

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
              + I APS    + + L +    +  Q  A+L     +++S+     ++ ++ G EG 
Sbjct: 73  GDEL-ILAPSYPGDI-LHLPLRDETIFAQSGAYLAGSTHLQLSS--QGSLKALVSG-EGL 127

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF- 212
             QK++G G  ++ + G++++K L  G+   VD   +VA  +S++  I+      R +F 
Sbjct: 128 FLQKISGSGDLWLASYGAIIEKQLNAGDEYVVDTGNMVAFDASLSYTIQ---TASRGLFS 184

Query: 213 ---GGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
               G+ LV     GPG +++Q+     L+Q +
Sbjct: 185 SMASGEGLV-CRFRGPGRLWLQTRSISALAQTL 216


>gi|345519252|ref|ZP_08798679.1| hypothetical protein BSFG_01154 [Bacteroides sp. 4_3_47FAA]
 gi|254834698|gb|EET15007.1| hypothetical protein BSFG_01154 [Bacteroides sp. 4_3_47FAA]
          Length = 218

 Score = 54.3 bits (129), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 56/217 (25%), Positives = 104/217 (47%), Gaps = 16/217 (7%)

Query: 26  QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG--------MWQWL 77
           +I+G E +  ++ L   E   A  GS+ +   +++ +  +  +   G        +   L
Sbjct: 4   KIIGYEFKTLEVTLASGESFYAERGSIVYTDEALQRDGEFNNDGGSGLGSILGGVVKSAL 63

Query: 78  FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
            G++I  +   NP  +D  + + + S   ++PI L   G  L+C+   ++ S +  KV  
Sbjct: 64  SGESILIIRFSNPTNTDKKL-VLSGSCCALVPIKLQ--GENLICRRGHYVASSD--KVHL 118

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
            L+  ++ + GG+  F  QK+ G    F+ + GS ++K L  GE I VD + IVA+    
Sbjct: 119 NLNLNLQGLFGGVGLF--QKVEGNATIFLDSLGSPIEKILNHGESIEVDENHIVALHGFS 176

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
           + QI+    +   V  G+ L    +TGPG +++  LP
Sbjct: 177 SSQIQAGWSVGN-VLRGEGLSLMKITGPGKLYLSPLP 212


>gi|392963280|ref|ZP_10328706.1| protein of unknown function DUF124 [Pelosinus fermentans DSM 17108]
 gi|421056838|ref|ZP_15519755.1| protein of unknown function DUF124 [Pelosinus fermentans B4]
 gi|421059742|ref|ZP_15522304.1| protein of unknown function DUF124 [Pelosinus fermentans B3]
 gi|421065078|ref|ZP_15526881.1| protein of unknown function DUF124 [Pelosinus fermentans A12]
 gi|421069341|ref|ZP_15530513.1| protein of unknown function DUF124 [Pelosinus fermentans A11]
 gi|392438018|gb|EIW15880.1| protein of unknown function DUF124 [Pelosinus fermentans B4]
 gi|392450361|gb|EIW27414.1| protein of unknown function DUF124 [Pelosinus fermentans A11]
 gi|392451104|gb|EIW28098.1| protein of unknown function DUF124 [Pelosinus fermentans DSM 17108]
 gi|392458387|gb|EIW34924.1| protein of unknown function DUF124 [Pelosinus fermentans B3]
 gi|392459765|gb|EIW36144.1| protein of unknown function DUF124 [Pelosinus fermentans A12]
          Length = 225

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/133 (28%), Positives = 68/133 (51%), Gaps = 3/133 (2%)

Query: 101 APSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLT 159
           APS+   I+ +DL      L+ Q D FL     ++VS+ +   ++ +  G EGF   K++
Sbjct: 78  APSIPGDIIDVDLDG-SYSLVVQKDGFLAGSTGIEVSSKMQNLMQGMFSG-EGFFTLKVS 135

Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVT 219
           G+G  F+ + G++   NLE GE   +D + +VA    ++ +I+       + F    ++ 
Sbjct: 136 GKGTVFLNSYGAIHAINLEAGEERIIDNNHLVAWPDYMDYKIEKATSGWISSFTSGEMLV 195

Query: 220 AVVTGPGIVFIQS 232
               GPG+V IQ+
Sbjct: 196 CRFKGPGVVLIQT 208


>gi|325092436|gb|EGC45746.1| DUF124 domain-containing protein [Ajellomyces capsulatus H88]
          Length = 450

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 91/189 (48%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS +M ++       +  M + + G  ++S     PG       + AP++
Sbjct: 254 LTAKPGAMIAMSPTMTLKGSI----KFSMKKLIAGGEMSSSTYTGPGEL-----LLAPTM 304

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
              + + L + G E      DAFL +   V V +  +Q +       EG    K++G GL
Sbjct: 305 LGDISV-LRLNGHETWSVGKDAFLAATQGV-VKDYKNQGLSKAFFSGEGLFVYKISGMGL 362

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            F+ + G++++K+L  GE   VD   +VA +    ++   +G +   +  G+ LV    T
Sbjct: 363 LFMQSFGAIIKKDLADGEKYNVDNGHLVAWSCKYVMERVASGGLISGMTSGEGLVCK-FT 421

Query: 224 GPGIVFIQS 232
           GPG V++Q+
Sbjct: 422 GPGSVYLQT 430


>gi|448651425|ref|ZP_21680494.1| hypothetical protein C435_05268 [Haloarcula californiae ATCC 33799]
 gi|445770952|gb|EMA22010.1| hypothetical protein C435_05268 [Haloarcula californiae ATCC 33799]
          Length = 237

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 62/239 (25%), Positives = 97/239 (40%), Gaps = 25/239 (10%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT--ITSVVLRNPGP 92
            ++ L   E ++A PG+M   S ++E+E     +  +   + + G    + +      GP
Sbjct: 14  VRVDLDSGESILAEPGAMVSHSPTIEIETTTSRDGLLSSAKSMLGGESLLANEFTARGGP 73

Query: 93  SDGFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIE 151
             G V +A P+   +   +L    GE L   D AFL S   + + +          GGI+
Sbjct: 74  --GTVTLAPPTPGDVHHHELT---GETLYAVDGAFLASDPGIDIDSEF--------GGIK 120

Query: 152 GFLRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
             L         L G G   I A G +    L+ GE  T+D   +VA   SVN      G
Sbjct: 121 SLLAGASITPLALKGTGNVLIEAFGGLETVELDAGESYTIDNDHVVAWEESVNFDAHRVG 180

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQIAIF 264
            ++  +  G+ LV    TGPG V+ Q+      +  IA A+  P   +N      I  F
Sbjct: 181 GLKSTLLSGEGLVMD-FTGPGTVWYQTRGLDSFTSAIADAL--PGTGDNDGNSTGIDDF 236


>gi|113474372|ref|YP_720433.1| hypothetical protein Tery_0505 [Trichodesmium erythraeum IMS101]
 gi|110165420|gb|ABG49960.1| protein of unknown function DUF124 [Trichodesmium erythraeum
           IMS101]
          Length = 233

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 51/215 (23%), Positives = 97/215 (45%), Gaps = 28/215 (13%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
           + L P + ++A PG+M  M G + M+              L G   ++++ +  G    F
Sbjct: 29  VTLYPGDSIIAEPGAMTSMDGRLSMKTK------------LSGGFFSALLRKFLGGESLF 76

Query: 97  VGIAAPSLARILPIDLA-----------MFGGELLCQPDAFLCSVNDVKVSNTLDQRVRN 145
           V +     ++ L + L            + G E+  QP A++ S   +K+          
Sbjct: 77  VNVFENKTSQPLNVTLTQSIIGDIDCLELTGNEICFQPGAYIASSPKIKLGVRWAGFASW 136

Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
            +G  EG  + K++GQG  F  A G + ++++  GE I VD S +VA + ++ + I  +G
Sbjct: 137 FMG--EGLFKLKVSGQGQVFFGAYGGLSKQDI-AGEFI-VDNSHLVAYSPNIKMSIGLSG 192

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
            +  ++  G+  V   + G G++++QS     L Q
Sbjct: 193 GLISSLTSGEGFVNR-LKGNGVIYLQSRSIEGLVQ 226


>gi|302039464|ref|YP_003799786.1| hypothetical protein NIDE4194 [Candidatus Nitrospira defluvii]
 gi|300607528|emb|CBK43861.1| conserved protein of unknown function DUF124 [Candidatus Nitrospira
           defluvii]
          Length = 227

 Score = 53.9 bits (128), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 62/117 (52%), Gaps = 5/117 (4%)

Query: 118 ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNL 177
           E + Q D FL     VK+ + +    R ++GG EGF   K+ G G   + + G++ +  L
Sbjct: 96  EYMVQKDGFLAGAEGVKIESKMQSLTRGLLGG-EGFFILKIGGTGQLILNSFGAIHKIEL 154

Query: 178 EVGEVITVDVSCIVAVTSSV--NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           +  E   VD S +VA TS+   N++   +G I  ++  G+ LV     GPG+V+IQS
Sbjct: 155 KPNEEYIVDNSHLVAWTSTTTYNIEKASSGWI-ASLTSGEGLVCR-FRGPGVVYIQS 209


>gi|381157051|ref|ZP_09866285.1| TIGR00266 family protein [Thiorhodovibrio sp. 970]
 gi|380880914|gb|EIC23004.1| TIGR00266 family protein [Thiorhodovibrio sp. 970]
          Length = 241

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 54/221 (24%), Positives = 104/221 (47%), Gaps = 12/221 (5%)

Query: 16  QSPQDAVIPFQIL-GGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW 74
           Q+ QD  + F++L  G   + ++ L+  + V A+  +M  M  +++++         G+ 
Sbjct: 9   QTLQD--LTFRVLYRGAFAMLEVHLRASQSVKAQSDAMVAMDATVDVDGKMEGGLLGGLG 66

Query: 75  QWLFGKTITSVVLR-NPGPSDGFVGIAAPSLARILPIDLAMFGGEL--LCQPDAFLCSVN 131
           + L G++     LR + GP    +  A P    +LP++L    G +  + Q D FL +  
Sbjct: 67  RMLSGESFFFQTLRASRGPGVAHLAPAQP--GDLLPVEL---DGSVPYVIQKDGFLAASE 121

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V +S T     + +  G EGF   K  G+G+ F+ + G+V + ++  GE + +D   +V
Sbjct: 122 SVDISTTAQNLTKGLFSG-EGFFVLKARGRGMLFVESYGAVHEIDIPSGEEMIIDNGHLV 180

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           A   ++  +++       + F     +     GPG V+IQS
Sbjct: 181 AWPETMQYKLEKASSGWISSFTSGEGLVCRFKGPGKVYIQS 221


>gi|421077167|ref|ZP_15538138.1| protein of unknown function DUF124 [Pelosinus fermentans JBW45]
 gi|392524555|gb|EIW47710.1| protein of unknown function DUF124 [Pelosinus fermentans JBW45]
          Length = 225

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 38/133 (28%), Positives = 68/133 (51%), Gaps = 3/133 (2%)

Query: 101 APSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLT 159
           APS+   I+ +DL      L+ Q D FL     ++VS+ +   ++ +  G EGF   K++
Sbjct: 78  APSIPGDIIDVDLDG-SYSLVVQKDGFLAGSTGIEVSSKMQNLMQGMFSG-EGFFTLKVS 135

Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVT 219
           G+G  F+ + G++   NLE GE   +D + +VA    ++ +I+       + F    ++ 
Sbjct: 136 GKGTVFLNSYGAIHAINLEAGEERIIDNNHLVAWPDYMDYKIEKATSGWISSFTSGEMLV 195

Query: 220 AVVTGPGIVFIQS 232
               GPG+V IQ+
Sbjct: 196 CRFKGPGVVLIQT 208


>gi|113474373|ref|YP_720434.1| hypothetical protein Tery_0506 [Trichodesmium erythraeum IMS101]
 gi|110165421|gb|ABG49961.1| protein of unknown function DUF124 [Trichodesmium erythraeum
           IMS101]
          Length = 226

 Score = 53.9 bits (128), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 60/207 (28%), Positives = 101/207 (48%), Gaps = 13/207 (6%)

Query: 31  EAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGKTITSVV 86
           +  + ++ L   E++VA+ G M  MSG    S  +          G+ + L G+++   V
Sbjct: 10  DNTIAKVSLNRGEEIVAQAGCMVAMSGFINTSTTLRQGKGGGIMGGIKRMLGGESLFLSV 69

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
            R+P P DG V +A   +  IL  +  M G EL+ Q  ++L S N+V +     Q ++++
Sbjct: 70  FRSPSP-DGQVFLAPKLMGDILVYE--MKGQELVVQASSYLASDNNVDLDLGF-QGIKSL 125

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNG 205
             G E      +TG G   + + G + + +++ G+ I VD   IVA  SS++ +I K N 
Sbjct: 126 FSG-ESIFWLSITGYGPVILTSFGGIYEIDVK-GDYI-VDTGHIVAFESSLDFRIDKANS 182

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
               A FGG+  V     G G V+ Q+
Sbjct: 183 SWLGAFFGGEGFVCR-FQGRGKVYCQT 208


>gi|320591779|gb|EFX04218.1| duf124 domain containing protein [Grosmannia clavigera kw1407]
          Length = 411

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 52/188 (27%), Positives = 86/188 (45%), Gaps = 11/188 (5%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS S+ ++  Y    +  M + + G  + S     P    G + +A P L
Sbjct: 217 LSAKPGAMIAMSPSVTLKGAY----KFSMKKLVAGAEMGSSTYTGP----GELLLAPPML 268

Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
             I  I L    G  +   D FL     V   +   Q +   +   EG    K+ G GL 
Sbjct: 269 GDITSIRLNGNNGWSVGH-DGFLACTQGV-TKDYKRQGLGKAMFSGEGLWVYKIGGIGLL 326

Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
           +I + G++++K+L  GE   VD   +VA  +  N++   +G I   +   + L+    TG
Sbjct: 327 WISSFGAIIKKDLADGEKYIVDNGHLVAWNTKYNLERVASGGIISGLASNEGLICK-FTG 385

Query: 225 PGIVFIQS 232
           PG V+IQ+
Sbjct: 386 PGTVYIQT 393


>gi|134298221|ref|YP_001111717.1| hypothetical protein Dred_0344 [Desulfotomaculum reducens MI-1]
 gi|134050921|gb|ABO48892.1| protein of unknown function DUF124 [Desulfotomaculum reducens MI-1]
          Length = 227

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 55/203 (27%), Positives = 91/203 (44%), Gaps = 11/203 (5%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI---TSVVLRNP 90
           + Q+ L+  E V A   +M  MS ++++E         G+ + + G++    T    R  
Sbjct: 13  LVQLNLEQGESVKAESDAMVAMSSTIDVEGKMDGGLLGGLARMVAGESFFFQTLTARRGA 72

Query: 91  GPSDGFVGIAAPSLARILPIDLAMFGG-ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
           G       + AP++   + ID+ + G   L+ Q D FL     ++VS  +    + +  G
Sbjct: 73  GEV-----LLAPAVPGDI-IDVELDGSYSLMVQKDGFLAGSAGIQVSTKMQNLSQGLFSG 126

Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
            EGF   K++GQG+ FI   G +   NLE G+   VD S +VA    +N  I+       
Sbjct: 127 -EGFFVVKVSGQGIVFINTYGGIHVINLEPGQEFVVDNSHLVAWPEYMNFSIEKASSGWI 185

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
           + F    +      GPG V IQ+
Sbjct: 186 STFTSGEVAVCKFRGPGPVIIQT 208


>gi|154279368|ref|XP_001540497.1| predicted protein [Ajellomyces capsulatus NAm1]
 gi|150412440|gb|EDN07827.1| predicted protein [Ajellomyces capsulatus NAm1]
          Length = 312

 Score = 53.5 bits (127), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 92/189 (48%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS +M ++       +  M + + G  ++S     PG       + AP++
Sbjct: 116 LTAKPGAMIAMSPTMTLKGSI----KFSMKKLIAGGEMSSSTYTGPGEL-----LLAPTM 166

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
              + + L + G E      DAFL +   V V +  +Q +   +   EG    K++G GL
Sbjct: 167 LGDISV-LRLNGHETWSVGKDAFLAATQGV-VKDYKNQGLSKALFSGEGLFVYKISGMGL 224

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            F+ + G++++K+L  GE   VD   +VA +    ++   +G +   +  G+ LV    T
Sbjct: 225 LFMQSFGAIIKKDLADGEKYNVDNGHLVAWSCKYVMERVASGGLISGMTSGEGLVCK-FT 283

Query: 224 GPGIVFIQS 232
           GPG V++Q+
Sbjct: 284 GPGSVYLQT 292


>gi|126656109|ref|ZP_01727493.1| hypothetical protein CY0110_03464 [Cyanothece sp. CCY0110]
 gi|126622389|gb|EAZ93095.1| hypothetical protein CY0110_03464 [Cyanothece sp. CCY0110]
          Length = 225

 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 57/207 (27%), Positives = 99/207 (47%), Gaps = 13/207 (6%)

Query: 31  EAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGKTITSVV 86
           ++ + +++L   E++VA+ G+M  MSG    S  +          G  + L G+++   V
Sbjct: 10  DSAIAKVILNAGEEIVAQAGAMVAMSGFINTSTTLRQGKGGGIMGGFKRMLGGESLFLSV 69

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
            R+P  ++G V +A   +  IL  +  M G EL+ Q  ++L S NDV +     Q ++++
Sbjct: 70  FRSP-TAEGEVFLAPRLMGDILVYE--MQGQELVVQAASYLASANDVDLDLGF-QGMKSL 125

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNG 205
             G E      +TG G   + + G + +  +EV     VD   IVA   S++ +I K N 
Sbjct: 126 FSG-ESIFWLSVTGYGPVILTSFGGIYE--IEVDGDYVVDTGHIVAFERSLDFRIDKANS 182

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
               A FGG+  +     G G V+ Q+
Sbjct: 183 SWLGAFFGGEGFICR-FHGKGKVYCQT 208


>gi|409044766|gb|EKM54247.1| hypothetical protein PHACADRAFT_257964 [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 264

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 55/201 (27%), Positives = 93/201 (46%), Gaps = 21/201 (10%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNP---GPS 93
           + L P  ++ A+PG M  M  +++++         G  +  F + ITS  L      GP 
Sbjct: 62  VRLPPGMELKAKPGCMVAMDATVKIK---------GKMKVSFKRLITSSDLSESTFTGP- 111

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQ--PDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
            G V +A  +   I+PI L    G  +      AFLC    V  +       +    G E
Sbjct: 112 -GEVLMAPETWGDIVPIHLD---GSTVWNFSKHAFLCCTQGVTRTYKTQSLGKTFFSG-E 166

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           G   ++ +G G+ F+ + G++VQ++L+  E   VD   +VA T+  +V+    G +  + 
Sbjct: 167 GLFVEQCSGAGVIFVSSIGAIVQRHLQPNEQWIVDNGHLVAWTAKYSVERIQAGSLLSSA 226

Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
              + LV    TGPGIV++Q+
Sbjct: 227 KTDEGLVCR-FTGPGIVYLQT 246


>gi|407916932|gb|EKG10260.1| hypothetical protein MPH_12642 [Macrophomina phaseolina MS6]
          Length = 387

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 51/200 (25%), Positives = 92/200 (46%), Gaps = 13/200 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + ARPG+M  MS ++ ++       +  + +++ G  + +     PG     + +A PS+
Sbjct: 194 LTARPGAMFAMSPTITLKGTL----KFSLKKFVAGGELATSTYTGPGE----LLLAPPSM 245

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
             I  I L+  G E+     D FL     + V     Q +   +   EG    K++G G+
Sbjct: 246 GDITNIRLS--GQEMWSVSKDGFLACTQGI-VKEYKSQSLSKAMFSGEGLFVYKISGSGI 302

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            ++ + G++++K+L+  E   VD   +VA      ++   +G I   +  G+ LV    T
Sbjct: 303 LWVTSLGAILRKDLQENEKYIVDNGHLVAWNCKYALERVASGGIISNMSSGEGLVCK-FT 361

Query: 224 GPGIVFIQSLPFHRLSQRIA 243
           GPG +FIQ+      SQ I 
Sbjct: 362 GPGTIFIQTRNPTAFSQWIG 381



 Score = 38.1 bits (87), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 37/155 (23%), Positives = 66/155 (42%), Gaps = 15/155 (9%)

Query: 107 ILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFI 166
           IL + LAM G  L  +P A       + +  TL   ++  + G E       TG G   +
Sbjct: 183 ILTVQLAM-GCPLTARPGAMFAMSPTITLKGTLKFSLKKFVAGGE-LATSTYTGPGELLL 240

Query: 167 L--AGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
              + G +    L   E+ +V     +A T  +  + K    + +A+F G+ L    ++G
Sbjct: 241 APPSMGDITNIRLSGQEMWSVSKDGFLACTQGIVKEYKSQS-LSKAMFSGEGLFVYKISG 299

Query: 225 PGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFV 259
            GI+++ SL           A+   +++EN K+ V
Sbjct: 300 SGILWVTSL----------GAILRKDLQENEKYIV 324


>gi|83766193|dbj|BAE56336.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 330

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 51/189 (26%), Positives = 88/189 (46%), Gaps = 14/189 (7%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MSG + +        + G+ + + G  +TS +   PG     + +A P L
Sbjct: 142 IEAKPGAMVAMSGDISLRGTV----KFGLAKMVAGG-MTSSIYTGPGE----ILLAPPFL 192

Query: 105 ARILPIDLAMFGGELL-CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
             I  I L M G E      DAFL   + V+         + V  G EGF+   ++G GL
Sbjct: 193 GDI--IVLRMDGSERWKVGKDAFLAKTSGVEKDYKSQGMTKAVFSG-EGFIIYHMSGVGL 249

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            ++ + G++++K++  G+   VD   IVA      ++   +G +  A F     +     
Sbjct: 250 VWLQSFGAIIRKDIPEGKTYLVDNGYIVAWNCKYKIERAASGGLLSA-FSSSEGLACKFE 308

Query: 224 GPGIVFIQS 232
           GPG V++Q+
Sbjct: 309 GPGTVYLQT 317


>gi|297619153|ref|YP_003707258.1| hypothetical protein Mvol_0626 [Methanococcus voltae A3]
 gi|297378130|gb|ADI36285.1| protein of unknown function DUF124 [Methanococcus voltae A3]
          Length = 220

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 49/191 (25%), Positives = 91/191 (47%), Gaps = 11/191 (5%)

Query: 42  QEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAA 101
            E ++   GSM +M  S+E++     + + G++  L    +   +  N     G + +A 
Sbjct: 22  NEGIITETGSMVYMEPSVEIKT----QAKGGLFGALKRAVVGESIFLNTFSGSGKLALA- 76

Query: 102 PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQ 161
           PSL+    I      G L  Q  A+L S  ++++ +T     ++  GG +G    KLTGQ
Sbjct: 77  PSLSG--DIQYHPLNGTLYVQHGAYLASSPNIEI-DTKFGGAKSFFGG-KGLFLMKLTGQ 132

Query: 162 GLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAV 221
           G  F+ + G++    L   E + VD   ++  T  +  ++   G ++  + GG+ LV   
Sbjct: 133 GDLFLSSYGAIETIELN-NESLVVDNGNLIGFTDGLQFELTKVGGLKSTLLGGEGLVYR- 190

Query: 222 VTGPGIVFIQS 232
           ++G G V+IQ+
Sbjct: 191 LSGSGTVYIQT 201


>gi|358395742|gb|EHK45129.1| hypothetical protein TRIATDRAFT_299843 [Trichoderma atroviride IMI
           206040]
          Length = 340

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 53/200 (26%), Positives = 93/200 (46%), Gaps = 13/200 (6%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           +  I L     + A+PG+M  MS S+ ++       +  + + + G  ++S     PG  
Sbjct: 135 ILTIQLANGAPIHAKPGAMVAMSPSITLKG----HIKFSVKKIIAGGELSSSTFTGPGE- 189

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
              + +A  SL  +  I L+  G E      DA+L     +   +   Q +   I   EG
Sbjct: 190 ---LILAPSSLGDVTSIRLS--GNETWSVGKDAYLAYTQGIN-KDYKRQGLGKAIFSGEG 243

Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
               K+TG GL ++ + G++++K+L  GE   VD   +VA  +S  ++   +G +   + 
Sbjct: 244 LFVYKITGTGLLWVSSFGAIIRKDLAQGEKYIVDNGHLVAWNTSYVLERVASGGLISGLS 303

Query: 213 GGDNLVTAVVTGPGIVFIQS 232
            G+ LV    TGPG VF+Q+
Sbjct: 304 SGEGLVCK-FTGPGTVFMQT 322


>gi|337283582|ref|YP_004623056.1| hypothetical protein PYCH_00710 [Pyrococcus yayanosii CH1]
 gi|334899516|gb|AEH23784.1| hypothetical protein PYCH_00710 [Pyrococcus yayanosii CH1]
          Length = 221

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 97/207 (46%), Gaps = 25/207 (12%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW--LFGKTITSVVLR-NPG 91
            ++ L+P E V A  G+M +MS ++ +E        +G  +   L G++    V R   G
Sbjct: 14  LEVELQPGEAVQAEAGAMVYMSPTVRVET-KARGGLLGALKRSVLGGESFFINVFRAEDG 72

Query: 92  PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
           P  G VG A      I  +++    G +  Q  AFL    D+++            GG +
Sbjct: 73  P--GVVGFAPGYPGDIEALEI---NGTIYAQSGAFLAGSEDIQIDIKF--------GGAK 119

Query: 152 GFLRQ------KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
            FL +      +L+G+G+ F+ + G++   +L+ GE   VD   +VA T  ++  +K  G
Sbjct: 120 TFLGRESLFLLELSGRGIVFLASYGAIRPVHLK-GERFVVDTGHMVAFTEGLDFTVKSIG 178

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            ++  +F G+ LV     G G V+IQ+
Sbjct: 179 GLKSVLFSGEGLVME-FRGTGTVYIQT 204


>gi|317140662|ref|XP_001818338.2| hypothetical protein AOR_1_2448174 [Aspergillus oryzae RIB40]
          Length = 326

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 51/189 (26%), Positives = 88/189 (46%), Gaps = 14/189 (7%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MSG + +        + G+ + + G  +TS +   PG     + +A P L
Sbjct: 138 IEAKPGAMVAMSGDISLRGTV----KFGLAKMVAGG-MTSSIYTGPGE----ILLAPPFL 188

Query: 105 ARILPIDLAMFGGELL-CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
             I  I L M G E      DAFL   + V+         + V  G EGF+   ++G GL
Sbjct: 189 GDI--IVLRMDGSERWKVGKDAFLAKTSGVEKDYKSQGMTKAVFSG-EGFIIYHMSGVGL 245

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            ++ + G++++K++  G+   VD   IVA      ++   +G +  A F     +     
Sbjct: 246 VWLQSFGAIIRKDIPEGKTYLVDNGYIVAWNCKYKIERAASGGLLSA-FSSSEGLACKFE 304

Query: 224 GPGIVFIQS 232
           GPG V++Q+
Sbjct: 305 GPGTVYLQT 313


>gi|225562500|gb|EEH10779.1| DUF124 domain-containing protein [Ajellomyces capsulatus G186AR]
          Length = 327

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 91/189 (48%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS +M ++       +  M + + G  ++S     PG       + AP++
Sbjct: 131 LTAKPGAMIAMSPTMTLKGSI----KFSMKKLIAGGEMSSSTYTGPGEL-----LLAPTM 181

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
              + + L + G E      DAFL +   V V +  +Q +       EG    K++G GL
Sbjct: 182 LGDISV-LRLNGHETWSVGKDAFLAATQGV-VKDYKNQGLSKAFFSGEGLFVYKISGMGL 239

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            F+ + G++++K+L  GE   VD   +VA +    ++   +G +   +  G+ LV    T
Sbjct: 240 LFMQSFGAIIKKDLADGEKYNVDNGHLVAWSCKYVMERVASGGLISGMTSGEGLVCK-FT 298

Query: 224 GPGIVFIQS 232
           GPG V++Q+
Sbjct: 299 GPGSVYLQT 307


>gi|11498234|ref|NP_069460.1| hypothetical protein AF0626 [Archaeoglobus fulgidus DSM 4304]
 gi|2649993|gb|AAB90613.1| conserved hypothetical protein [Archaeoglobus fulgidus DSM 4304]
          Length = 218

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 56/200 (28%), Positives = 96/200 (48%), Gaps = 10/200 (5%)

Query: 33  QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGP 92
            + ++ L+P E+V+A  G+M +M   +E++         G+ + L G   +  V R    
Sbjct: 12  SMLELKLRPGEEVLAETGAMVYMK-DVELKTEMKGGLLGGLKRSLLGGE-SFFVNRFVSR 69

Query: 93  SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
            +G +G+A      I+ I L    G +L Q  AFL S + ++V +T     R    G EG
Sbjct: 70  GEGVLGLAPTYQGDIIHIPLN---GTILAQSGAFLASSDGIEV-DTKWGGGRTFFAG-EG 124

Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
               K+ G G  F+ + GS+  + L+V   + VD   IVA   +++ ++   G ++  V 
Sbjct: 125 LFLLKIEGTGDLFLSSFGSI--EELQVDGHLRVDTGHIVAFEPTLDYKVSKVGGLKATVL 182

Query: 213 GGDNLVTAVVTGPGIVFIQS 232
            G+ LV     G G V+IQ+
Sbjct: 183 SGEGLVVD-FKGKGKVWIQT 201


>gi|170077981|ref|YP_001734619.1| hypothetical protein SYNPCC7002_A1367 [Synechococcus sp. PCC 7002]
 gi|169885650|gb|ACA99363.1| conserved hypothetical protein TIGR00266 [Synechococcus sp. PCC
           7002]
          Length = 220

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 52/197 (26%), Positives = 92/197 (46%), Gaps = 8/197 (4%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF-GKTITSVVLRNPGPSDG 95
           + L+P E+V A  G+M  M G + M+  +       + + LF G+++   V RN      
Sbjct: 16  VTLEPGEQVTAESGAMVSMDGGILMKTEFSGGFFPAILRRLFGGESLFVNVYRNTSQRPQ 75

Query: 96  FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
            V +    +  I  IDL+   G +  QP A++      K+            G  EG  +
Sbjct: 76  TVILTQSMVGDIHRIDLSQ--GPICFQPGAYIAHTPGAKMGIRWAGFASWFAG--EGLFK 131

Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
            + TG G  F    G +++K+++ GE I VD S +VA    + + I+ +G +  ++  G+
Sbjct: 132 LQFTGSGRVFYGCYGGIIEKHIK-GEFI-VDNSHLVAYDPGITMNIRMSGGLFGSLTSGE 189

Query: 216 NLVTAVVTGPGIVFIQS 232
            LV   + G G +++QS
Sbjct: 190 GLVNK-LKGRGRIYLQS 205


>gi|448630295|ref|ZP_21672950.1| hypothetical protein C437_09113 [Haloarcula vallismortis ATCC
           29715]
 gi|445756218|gb|EMA07593.1| hypothetical protein C437_09113 [Haloarcula vallismortis ATCC
           29715]
          Length = 237

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 59/224 (26%), Positives = 96/224 (42%), Gaps = 23/224 (10%)

Query: 39  LKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPSDGFV 97
           L   E ++A PG+M   S ++E+E     +  +   + + G +++ +      G S G +
Sbjct: 18  LDSGESILAEPGAMVSHSPTIEIETTTSRDGLLSSAKSMLGGESLLANEFTARGGS-GTL 76

Query: 98  GIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQ 156
            +A P+   +   +L    GE L   D AFL +  D+ + +          GGI+  L  
Sbjct: 77  TLAPPTPGDVHHHELT---GETLYAVDGAFLAADPDIDIDSEF--------GGIKSLLAG 125

Query: 157 K------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
                  L G G   I A G +    L+ GE  T+D   +VA   SVN      G ++  
Sbjct: 126 ASITPLALKGTGNVLIEAFGGLETVELDAGESYTIDNDHVVAWEESVNFDAHRVGGLKST 185

Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
           +  G+ LV    TGPG V+ Q+      +  IA A+  P   +N
Sbjct: 186 LLSGEGLVMD-FTGPGTVWYQTRGLDSFTSAIADAL--PGTGDN 226


>gi|319641092|ref|ZP_07995796.1| hypothetical protein HMPREF9011_01393 [Bacteroides sp. 3_1_40A]
 gi|317387336|gb|EFV68211.1| hypothetical protein HMPREF9011_01393 [Bacteroides sp. 3_1_40A]
          Length = 218

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 56/217 (25%), Positives = 104/217 (47%), Gaps = 16/217 (7%)

Query: 26  QILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG--------MWQWL 77
           +I+G E +  ++ L   E   A  GS+ +   +++ +  +  +   G        +   L
Sbjct: 4   KIIGYEFKTLEVTLASGESFYAERGSIVYTDEALQRDVEFNNDGGSGLGSILGGVVKSAL 63

Query: 78  FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN 137
            G++I  +   NP  +D  + + + S   ++PI L   G  L+C+   ++ S +  KV  
Sbjct: 64  SGESILIIRFSNPTNTDKKL-VLSGSCCALVPIKLQ--GENLICRRGHYVASSD--KVHL 118

Query: 138 TLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
            L+  ++ + GG+  F  QK+ G    F+ + GS ++K L  GE I VD + IVA+    
Sbjct: 119 NLNLNLQGLFGGVGLF--QKVEGNATIFLDSLGSPIEKILNHGESIEVDENHIVALHGFS 176

Query: 198 NVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLP 234
           + QI+    +   V  G+ L    +TGPG +++  LP
Sbjct: 177 SSQIQAGWSVGN-VLRGEGLSLMKITGPGKLYLSPLP 212


>gi|389851747|ref|YP_006353981.1| hypothetical protein Py04_0304 [Pyrococcus sp. ST04]
 gi|388249053|gb|AFK21906.1| hypothetical protein Py04_0304 [Pyrococcus sp. ST04]
          Length = 218

 Score = 53.1 bits (126), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 51/202 (25%), Positives = 95/202 (47%), Gaps = 14/202 (6%)

Query: 33  QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW--LFGKTITSVVLRNP 90
            + +I L+  E + A  G+M +M  ++++E        +G  +   L G++    + R P
Sbjct: 12  SLLEINLRDGEAIQAEAGAMVYMDPTIKLET-KARGGILGALKRSVLGGESFFVNIFRGP 70

Query: 91  GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
           G     VG A      I+ +++    G+L  Q  AF+ +  ++ +        +  IG  
Sbjct: 71  GR----VGFAPGYPGDIVAMEVN---GKLYAQSGAFIAASENIDIDVKFGG-AKTFIG-R 121

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
           EG    ++ G G  F+ + G++ + +L   E + VD   +VA T  +N  IK  G ++  
Sbjct: 122 EGIFLLEMKGHGTVFLSSYGAIQKISLN-NESLIVDTGHMVAFTEGINFTIKRVGGLKST 180

Query: 211 VFGGDNLVTAVVTGPGIVFIQS 232
           +F G+ LV    +G G V+IQ+
Sbjct: 181 LFSGEGLVFE-FSGTGDVYIQT 201


>gi|448688943|ref|ZP_21694680.1| hypothetical protein C444_13187 [Haloarcula japonica DSM 6131]
 gi|445778813|gb|EMA29755.1| hypothetical protein C444_13187 [Haloarcula japonica DSM 6131]
          Length = 237

 Score = 52.8 bits (125), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 59/228 (25%), Positives = 95/228 (41%), Gaps = 25/228 (10%)

Query: 36  QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT--ITSVVLRNPGPS 93
           +  L   E ++A PG+M   S ++E+E     +  +   + + G    + +      GP 
Sbjct: 15  RAALDSGESILAEPGAMVSHSPTIEIETTTSRDGLLSSAKSMLGGESLLANEFTARGGP- 73

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
            G V +A P+   +   +L    GE L   D AFL +  D+ + +          GGI+ 
Sbjct: 74  -GTVTLAPPTPGDVHHHELT---GETLYAVDGAFLAADPDIDIDSEF--------GGIKS 121

Query: 153 FLRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
            L         L G G   I A G +    L+ GE  T+D   +VA   SV+      G 
Sbjct: 122 MLAGASITPLALKGTGNVLIEAFGGLETVELDAGESYTIDNDHVVAWDESVSFDAHRVGG 181

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMREN 254
           ++  +  G+ LV    TGPG V+ Q+      +  IA A+  P   +N
Sbjct: 182 LKSTLLSGEGLVMD-FTGPGTVWYQTRGLDSFTSAIADAL--PGTGDN 226


>gi|289596749|ref|YP_003483445.1| protein of unknown function DUF124 [Aciduliprofundum boonei T469]
 gi|289534536|gb|ADD08883.1| protein of unknown function DUF124 [Aciduliprofundum boonei T469]
          Length = 223

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 57/206 (27%), Positives = 97/206 (47%), Gaps = 23/206 (11%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL---RNPG 91
            ++ L P E++VA  G+M +MS ++ +E       + G  + +  K ++   L       
Sbjct: 15  LKVYLSPGERIVAEAGAMVYMSPTIGVETT----TKGGFMKGVLRKMLSGQTLFMNTYYA 70

Query: 92  PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR-NVIGG 149
            S+G   + APSL   I+ I++         QP     S  +   S  L+  V+     G
Sbjct: 71  QSEGGYVVFAPSLPGDIIKIEV--------TQP--IYISDTNYLASTELEFGVKFTGFKG 120

Query: 150 I---EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
           I    G    K++G G A+I A G +    L  GE + +D   + A  +S+N  ++  G 
Sbjct: 121 IFTPGGMFWFKVSGIGKAWIAAYGGIDMLQLGPGERLLIDNIHLAAFDASMNFTLRKFGK 180

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
           ++  +FGG+ ++T  V GPG V+IQS
Sbjct: 181 LKSFLFGGEYILTEFV-GPGRVWIQS 205


>gi|428214331|ref|YP_007087475.1| hypothetical protein Oscil6304_4017 [Oscillatoria acuminata PCC
           6304]
 gi|428002712|gb|AFY83555.1| TIGR00266 family protein [Oscillatoria acuminata PCC 6304]
          Length = 220

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 52/205 (25%), Positives = 90/205 (43%), Gaps = 8/205 (3%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-WLFGKTITSVVLRNPGPSDG 95
           + L P E + A PG+   M   + M+  +      G+ + W  G+++   V RNP     
Sbjct: 16  VTLSPGESITAEPGAAISMDSRIAMKTQFSGGFFSGLLKKWFGGESLLVNVFRNPTTHPL 75

Query: 96  FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
            V ++  ++  +  I+L      +  QP A++     +K+  TL           EG  +
Sbjct: 76  EVVLSQATIGDMACIELTQ--SSICLQPGAYIAHTPGIKI--TLGWAGFASWFAREGLFK 131

Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
            KLTG G  F  A G +  K +  GE I VD   +VA   ++ + I  +G +  ++   +
Sbjct: 132 LKLTGTGRVFFGAYGGLTPKRIN-GEFI-VDNGHLVAYEPTIKMGISMSGGLFSSISSRE 189

Query: 216 NLVTAVVTGPGIVFIQSLPFHRLSQ 240
             V   + G GI+++QS     L Q
Sbjct: 190 GFVNR-LKGQGIIYLQSRSVEGLVQ 213


>gi|254169226|ref|ZP_04876060.1| conserved hypothetical protein TIGR00266 [Aciduliprofundum boonei
           T469]
 gi|197621822|gb|EDY34403.1| conserved hypothetical protein TIGR00266 [Aciduliprofundum boonei
           T469]
          Length = 222

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 57/206 (27%), Positives = 97/206 (47%), Gaps = 23/206 (11%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL---RNPG 91
            ++ L P E++VA  G+M +MS ++ +E       + G  + +  K ++   L       
Sbjct: 14  LKVYLSPGERIVAEAGAMVYMSPTIGVETT----TKGGFMKGVLRKMLSGQTLFMNTYYA 69

Query: 92  PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVR-NVIGG 149
            S+G   + APSL   I+ I++         QP     S  +   S  L+  V+     G
Sbjct: 70  QSEGGYVVFAPSLPGDIIKIEV--------TQP--IYISDTNYLASTELEFGVKFTGFKG 119

Query: 150 I---EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
           I    G    K++G G A+I A G +    L  GE + +D   + A  +S+N  ++  G 
Sbjct: 120 IFTPGGMFWFKVSGIGKAWIAAYGGIDMLQLGPGERLLIDNIHLAAFDASMNFTLRKFGK 179

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
           ++  +FGG+ ++T  V GPG V+IQS
Sbjct: 180 LKSFLFGGEYILTEFV-GPGRVWIQS 204


>gi|358388687|gb|EHK26280.1| hypothetical protein TRIVIDRAFT_166887 [Trichoderma virens Gv29-8]
          Length = 436

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 57/201 (28%), Positives = 94/201 (46%), Gaps = 15/201 (7%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITSVVLRNPGP 92
           +  I L     + A+PG+M  MS ++ ++ N+     ++     + G  + S     PG 
Sbjct: 231 ILTIQLAMGAPLHAKPGAMVAMSPTITLKGNVKFSVKKI-----IAGGELASSTYTGPGE 285

Query: 93  SDGFVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
               + +A  SL  I  I L   G E      DA+L S   V V +   Q +   I   E
Sbjct: 286 ----LILAPASLGDITSIRLT--GKEQWSVGHDAYLASTQAV-VKDYKRQGLGKAIFSGE 338

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           G    K++G GL ++ + G++++K+L  GE   VD   +VA  +S  ++   +G I   +
Sbjct: 339 GLFVYKISGTGLLWVSSFGAIIRKDLAQGEKYIVDNGHLVAWNTSYVLERVASGGIISGL 398

Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
             G+ LV    TGPG VF+Q+
Sbjct: 399 SSGEGLVCK-FTGPGTVFLQT 418


>gi|340517568|gb|EGR47812.1| predicted protein [Trichoderma reesei QM6a]
          Length = 254

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 56/189 (29%), Positives = 91/189 (48%), Gaps = 17/189 (8%)

Query: 47  ARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPS-L 104
           A+PG+M  MS ++ ++ N+     ++     + G  ++S     PG       I APS L
Sbjct: 62  AKPGAMVAMSPTITLKGNVKFSVKKI-----IAGGELSSSTYTGPGEL-----ILAPSAL 111

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
             I  I L   G E      DA+L S   V V +   Q +   I   EG    K++G GL
Sbjct: 112 GDITSIRLT--GQEQWSVGHDAYLASTQAV-VKDYKRQGLGKAIFSGEGLFVYKISGTGL 168

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            ++ + G++++K+L  GE   VD   +VA  +S  ++   +G +   +  G+ LV    T
Sbjct: 169 LWVSSFGAIIRKDLAAGEKYIVDNGHLVAWNTSYVLERVASGGLISGLSSGEGLVCK-FT 227

Query: 224 GPGIVFIQS 232
           GPG VF+Q+
Sbjct: 228 GPGTVFLQT 236


>gi|443324895|ref|ZP_21053617.1| TIGR00266 family protein [Xenococcus sp. PCC 7305]
 gi|442795497|gb|ELS04862.1| TIGR00266 family protein [Xenococcus sp. PCC 7305]
          Length = 245

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 58/233 (24%), Positives = 105/233 (45%), Gaps = 12/233 (5%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF-GKTITSVVLRNPGPSDG 95
           + L+P E + A  G+M  M   + M+  +       + + LF G+++   V  N      
Sbjct: 16  VTLEPGESITAEAGAMVSMDSMLTMKTEFSGGFLPALGKKLFGGESLFVNVFANKTSRPL 75

Query: 96  FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
            V ++   +  I  I+L   G EL  QP A++     VK++         ++G  EG  +
Sbjct: 76  QVVLSQSVIGDIAEIELQ--GRELYFQPGAYIAHTAGVKMTVGWAGFTSWLMG--EGLFK 131

Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
            K+ G G  F  A G + +K +  G+ I VD S +VA    + + IK +G +  ++  G+
Sbjct: 132 LKIGGIGKVFFGAYGGITKKTI-AGQFI-VDNSHLVAYEPGIKMGIKLSGGLLGSLTSGE 189

Query: 216 NLVTAVVTGPGIVFIQSLPFHRLSQ----RIARAVTSPNMRENPKFFVQIAIF 264
            LV   ++G G +++QS  F  L +    +I +       R   +  +Q  +F
Sbjct: 190 GLVNR-LSGHGDIYLQSRSFDGLLRFLRPQIGKIANQARGRRKVRGMIQDRVF 241


>gi|239611489|gb|EEQ88476.1| DUF124 domain-containing protein [Ajellomyces dermatitidis ER-3]
          Length = 334

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 91/189 (48%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS +M ++       +  M + + G  +++     PG       + AP+L
Sbjct: 139 LSAKPGAMIAMSPTMTLKG----SVKFSMKKLIAGGEMSTSTYTGPGEL-----LLAPTL 189

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
              + + L + G +      DAFL +   V V +  +Q +   +   EG    K++G GL
Sbjct: 190 LGDITV-LRLNGHDTWSVGKDAFLAATQGV-VKDYKNQGLSKALFSGEGLFVYKMSGNGL 247

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            F+ + G++++K+L  GE   VD   +VA      ++   +G I   +  G+ LV    T
Sbjct: 248 LFMQSFGAIIKKDLADGEKYNVDNGHLVAWNCKYVMERIASGGIISGMSAGEGLVCK-FT 306

Query: 224 GPGIVFIQS 232
           GPG V++Q+
Sbjct: 307 GPGTVYLQT 315


>gi|390938192|ref|YP_006401930.1| hypothetical protein Desfe_0450 [Desulfurococcus fermentans DSM
           16532]
 gi|390191299|gb|AFL66355.1| TIGR00266 family protein [Desulfurococcus fermentans DSM 16532]
          Length = 220

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 52/205 (25%), Positives = 95/205 (46%), Gaps = 18/205 (8%)

Query: 33  QVCQIMLKPQEKVVARPGSMCFMSGSMEMEN-----IYIPENEVGMWQWLFGKTITSVVL 87
            + ++ L P E +    GS     G++++E      +      V   + LF   IT++  
Sbjct: 12  SILKVSLDPGESITIEAGSYMLHKGNVKIETTTGGVLSAIARAVAGGESLFLDKITALT- 70

Query: 88  RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
               PS+ ++    P    I+ ID+    GEL+ Q   +L       +     + +R ++
Sbjct: 71  ----PSEVWIAPNVP--GDIVAIDVD---GELIVQDTGYLAHAGGFDIGIAW-RGLRGLL 120

Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
              E F   +LTG+G  F+ + G++ +  L+ GE  T+D   +VA+  SV   IK    +
Sbjct: 121 AEGELFWL-RLTGKGTVFVNSYGAIEELVLKPGEKATIDNFHLVAMEPSVKWSIKTFSGL 179

Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQS 232
           +  + GG+ LV   V GPG +++Q+
Sbjct: 180 KSTILGGEGLVVE-VEGPGKLWVQT 203


>gi|315641638|ref|ZP_07896702.1| conserved hypothetical protein [Enterococcus italicus DSM 15952]
 gi|315482598|gb|EFU73129.1| conserved hypothetical protein [Enterococcus italicus DSM 15952]
          Length = 269

 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 54/229 (23%), Positives = 97/229 (42%), Gaps = 15/229 (6%)

Query: 23  IPFQILGG-EAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           + +++LGG ++ + QI LK  E +    G+M + SG      +   ++ +G      G++
Sbjct: 37  MNYKLLGGTDSPILQIQLKTGESIQLERGAMAYHSGVELTGKLNASKSGLGGLMHAIGRS 96

Query: 82  ITS----VVLRNPG-PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS 136
           +TS     +    G   D F+GIA     +I P+ ++    +      AFL   + V   
Sbjct: 97  MTSGESLFISSATGLTEDSFIGIAPGKPGKIHPLQVSN-AKQYCLNTGAFLACDSQVTYK 155

Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
                      GG  G    + TG G  F+ A G +++  +   + +T+D   +VA    
Sbjct: 156 MARQNISGAFFGGTGGLFVMETTGDGTIFVSAFGDLLELTVSPNQALTIDNEHVVA---- 211

Query: 197 VNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            +  + Y+  +    FG   G+ LV  V  G G V IQ+     L + +
Sbjct: 212 WDTNLTYDIQVASGTFGFKTGEGLVN-VFQGSGKVIIQTRNLRNLGEAL 259


>gi|55377179|ref|YP_135029.1| hypothetical protein rrnAC0266 [Haloarcula marismortui ATCC 43049]
 gi|448640547|ref|ZP_21677450.1| hypothetical protein C436_11803 [Haloarcula sinaiiensis ATCC 33800]
 gi|55229904|gb|AAV45323.1| unknown [Haloarcula marismortui ATCC 43049]
 gi|445761857|gb|EMA13096.1| hypothetical protein C436_11803 [Haloarcula sinaiiensis ATCC 33800]
          Length = 237

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 62/238 (26%), Positives = 96/238 (40%), Gaps = 25/238 (10%)

Query: 36  QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT--ITSVVLRNPGPS 93
           +  L   E ++A PG+M   S ++E+E     +  +   + + G    + +      GP 
Sbjct: 15  RAALDSGESILAEPGAMVSHSPTIEIETTTSRDGLLSSAKSMLGGESLLANEFTARGGP- 73

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
            G V +A P+   +   +L    GE L   D AFL S   + + +          GGI+ 
Sbjct: 74  -GTVTLAPPTPGDVHHHELT---GETLYAVDGAFLASDPGIDIDSEF--------GGIKS 121

Query: 153 FLRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
            L         L G G   I A G +    L+ GE  T+D   +VA   SVN      G 
Sbjct: 122 LLAGASITPLALKGTGNVLIEAFGGLETVELDAGESYTIDNDHVVAWEESVNFDAHRVGG 181

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQIAIF 264
           ++  +  G+ LV    TGPG V+ Q+      +  IA A+  P   +N      I  F
Sbjct: 182 LKSTLLSGEGLVMD-FTGPGTVWYQTRGLASFTSAIADAL--PGTGDNDGNSTGIDDF 236


>gi|212546527|ref|XP_002153417.1| DUF124 domain protein [Talaromyces marneffei ATCC 18224]
 gi|210064937|gb|EEA19032.1| DUF124 domain protein [Talaromyces marneffei ATCC 18224]
          Length = 379

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 50/202 (24%), Positives = 87/202 (43%), Gaps = 10/202 (4%)

Query: 47  ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
           ARPG M  MS ++ ++  +       M + L G  +       PG     V +A   L  
Sbjct: 187 ARPGVMIAMSPTVTLKGSF----HFSMKKLLIGGEMAHSTYTGPGE----VLLAPSVLGD 238

Query: 107 ILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFI 166
           I  + L   G       DA+L +   VK         + +  G EG+    +TG GL ++
Sbjct: 239 ITILRLTDDGETWKVGRDAYLANTTSVKKKYKAQNLSKTLFSG-EGWFVYNITGPGLVWM 297

Query: 167 LAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPG 226
            + G+V++K+L  GE   VD   +VA      ++   +G +  ++  G+ L      GPG
Sbjct: 298 QSFGAVIRKDLVEGESYFVDNGHLVAWNCEYGIERVASGGVISSLSSGEGLA-CRFKGPG 356

Query: 227 IVFIQSLPFHRLSQRIARAVTS 248
            V++Q+   +  + +I  +  S
Sbjct: 357 TVYLQTRNLNAFAMQIGASTAS 378


>gi|357038024|ref|ZP_09099823.1| protein of unknown function DUF124 [Desulfotomaculum gibsoniae DSM
           7213]
 gi|355360580|gb|EHG08338.1| protein of unknown function DUF124 [Desulfotomaculum gibsoniae DSM
           7213]
          Length = 227

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 91/203 (44%), Gaps = 11/203 (5%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI---TSVVLRNP 90
           + Q+ L+  E + A   +M  MS ++++E         G+ + L G++    T    R  
Sbjct: 13  LLQVNLQKGEVIKAESDAMVAMSTNIDVEGKMDGGLLGGLSRMLTGESFFFQTLSARRGA 72

Query: 91  GPSDGFVGIAAPSLARILPIDLAMFGG-ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
           G       + APS+   + +D+ + G   LL Q D FL    +++VS  +    + +  G
Sbjct: 73  GEV-----LLAPSIPGDI-VDVELDGSYSLLVQKDGFLAGAAELQVSTKMQNLAQGIFSG 126

Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
            EGF   K++G+G+ FI   G +   NLE G+   VD S +VA    +   I+       
Sbjct: 127 -EGFFVVKISGKGMVFINTYGGIHIINLEPGQEYIVDNSHLVAWPDYMQYTIEKASSGWI 185

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
           + F    +      GPG V IQ+
Sbjct: 186 STFTSGEVAVCRFRGPGPVIIQT 208


>gi|295657406|ref|XP_002789272.1| DUF124 domain-containing protein [Paracoccidioides sp. 'lutzii'
           Pb01]
 gi|226283988|gb|EEH39554.1| DUF124 domain-containing protein [Paracoccidioides sp. 'lutzii'
           Pb01]
          Length = 338

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 91/189 (48%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS +M ++       +  M + + G  +++     PG       + AP++
Sbjct: 142 LTAKPGAMIAMSPTMTLKGSV----KFSMKKLIAGGEMSTSTYTGPGEL-----LLAPTM 192

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
              + + L + G E      DA+L S   V + +  +Q +       EG    K++G GL
Sbjct: 193 LGDISV-LRLSGHETWSVGKDAYLSSTQGV-IKDYKNQGLSKAFFSGEGLFVYKISGNGL 250

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            F+ + G++++K+L  GE   +D   +VA +    ++   +G I   +  G+ LV    T
Sbjct: 251 LFLQSFGAIIKKDLADGEKYNIDNGHLVAWSCKYVMERVASGGIISGMTSGEGLVCK-FT 309

Query: 224 GPGIVFIQS 232
           GPG V++Q+
Sbjct: 310 GPGSVYMQT 318


>gi|427420415|ref|ZP_18910598.1| TIGR00266 family protein [Leptolyngbya sp. PCC 7375]
 gi|425763128|gb|EKV03981.1| TIGR00266 family protein [Leptolyngbya sp. PCC 7375]
          Length = 220

 Score = 52.4 bits (124), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 49/199 (24%), Positives = 88/199 (44%), Gaps = 12/199 (6%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT---ITSVVLRNPGPS 93
           + L+P E++ A  G+M  M G + M   +      G+ +  FG     +     R   P 
Sbjct: 16  VTLQPGERIFAEAGAMASMDGQISMGTGFFGGLIPGLLKAFFGGESLFLNHFANRTNHPQ 75

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
              + +  P++  I  ++L   G EL  QP A++   + VKV+           G  EG 
Sbjct: 76  T--LVLTQPTIGDIARMELN--GNELCLQPGAYIAHSSGVKVALQWAGLASWFAG--EGL 129

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
            + KL G+GL F  A G + +  + V +   VD   +VA    + + I  +G +  ++  
Sbjct: 130 FKLKLGGRGLVFFGAYGGLSE--VTVKDEFIVDNGHLVAYEPGIKMNIGLSGGMVGSITS 187

Query: 214 GDNLVTAVVTGPGIVFIQS 232
           G+  +   + G G +++QS
Sbjct: 188 GEGFINR-LQGRGKIYLQS 205


>gi|172035646|ref|YP_001802147.1| hypothetical protein cce_0730 [Cyanothece sp. ATCC 51142]
 gi|354554887|ref|ZP_08974190.1| protein of unknown function DUF124 [Cyanothece sp. ATCC 51472]
 gi|171697100|gb|ACB50081.1| unknown [Cyanothece sp. ATCC 51142]
 gi|353553041|gb|EHC22434.1| protein of unknown function DUF124 [Cyanothece sp. ATCC 51472]
          Length = 225

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 102/207 (49%), Gaps = 13/207 (6%)

Query: 31  EAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGKTITSVV 86
           ++ + +++L   E++VA+ GSM  MSG    S  +          G  + L G+++   V
Sbjct: 10  DSAIAKVILNAGEEIVAQAGSMVAMSGFINTSTTLRQGKGGGIMGGFKRILGGESLFLSV 69

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
            R+P  ++G V +A   +  IL  +  M G EL+ Q  ++L S NDV +     Q ++++
Sbjct: 70  FRSP-TAEGEVFLAPRLMGDILVYE--MQGQELVVQAASYLASANDVDLDLGF-QGMKSL 125

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNG 205
             G E      +TG G   + + G + + +++ G+ + VD   IVA   +++ +I K N 
Sbjct: 126 FSG-ESIFWLSVTGYGPVILTSFGGIYEIDVD-GDYV-VDTGHIVAFEKTLDFRIDKANS 182

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
               A FGG+  +     G G V+ Q+
Sbjct: 183 SWLGAFFGGEGFICR-FHGKGKVYCQT 208


>gi|261205112|ref|XP_002627293.1| DUF124 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
 gi|239592352|gb|EEQ74933.1| DUF124 domain-containing protein [Ajellomyces dermatitidis
           SLH14081]
          Length = 312

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 91/189 (48%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS +M ++       +  M + + G  +++     PG       + AP+L
Sbjct: 117 LSAKPGAMIAMSPTMTLKGSV----KFSMKKLIAGGEMSTSTYTGPGEL-----LLAPTL 167

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
              + + L + G +      DAFL +   V V +  +Q +   +   EG    K++G GL
Sbjct: 168 LGDITV-LRLNGHDTWSVGKDAFLAATQGV-VKDYKNQGLSKALFSGEGLFVYKMSGNGL 225

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            F+ + G++++K+L  GE   VD   +VA      ++   +G I   +  G+ LV    T
Sbjct: 226 LFMQSFGAIIKKDLADGEKYNVDNGHLVAWNCKYVMERIASGGIISGMSAGEGLVCK-FT 284

Query: 224 GPGIVFIQS 232
           GPG V++Q+
Sbjct: 285 GPGTVYLQT 293


>gi|417765794|ref|ZP_12413750.1| TIGR00266 family protein [Leptospira interrogans serovar Bulgarica
           str. Mallika]
 gi|400351733|gb|EJP03946.1| TIGR00266 family protein [Leptospira interrogans serovar Bulgarica
           str. Mallika]
          Length = 221

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 89/199 (44%), Gaps = 8/199 (4%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSD 94
            ++ L P + + A  G+M +M+  +++E          +++  FG       +     + 
Sbjct: 14  LKVKLSPGQTIKAESGAMVYMTPGIDVETKMGSGFLSAIYRRFFGGESFFFNVFKASVTG 73

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           G +G+A      ++ IDL   G  L+ +  A+L S   + +       +R+  GG EG  
Sbjct: 74  GEIGLAPELPGDVVGIDLTDTG--LIVESGAYLASDETISIKPMFGG-IRSFFGG-EGIF 129

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFG 213
             +  G G  F+ A G ++   ++V    T+D   IVA   S+  +I K  G  +   FG
Sbjct: 130 LLEAFGNGKLFLNAYGGIIP--IDVQGSYTIDTGHIVAFDKSLQYKIAKAGGSWKSTFFG 187

Query: 214 GDNLVTAVVTGPGIVFIQS 232
           G+ LV    TG G V IQ+
Sbjct: 188 GEGLVMEF-TGHGRVLIQT 205


>gi|254428814|ref|ZP_05042521.1| conserved hypothetical protein TIGR00266 [Alcanivorax sp. DG881]
 gi|196194983|gb|EDX89942.1| conserved hypothetical protein TIGR00266 [Alcanivorax sp. DG881]
          Length = 220

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 58/211 (27%), Positives = 98/211 (46%), Gaps = 13/211 (6%)

Query: 26  QILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-QWLFGKTIT 83
           +I GG A     + L+P E V+    +M  M   ++++ I+      G+  ++L G+T+ 
Sbjct: 4   EIKGGAAFSYIDMDLEPGESVITESDAMSSMDARLDLKAIFNGGLLRGLLRKFLGGETLF 63

Query: 84  SVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV 143
                N   S   + +  P+   I   +L    G L  QP AFL     V    +L  R 
Sbjct: 64  VSRFSNHSDSPARLTVVQPTPGEIRCHELK--DGSLFLQPGAFLACEETV----SLGLRW 117

Query: 144 RNVIGGI--EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
              I  I  EG  R ++ G G  +  A G+++ + ++ GE I VD S +VA   ++ +++
Sbjct: 118 AGFISWIAREGLFRLEVKGTGKVWYGAYGALLDREVD-GEFI-VDTSHLVAYDPTIKLKL 175

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           +    +  + F G+ LVT  V G G V IQ+
Sbjct: 176 QLAAGLFSSFFSGEGLVTR-VQGKGRVVIQT 205


>gi|392570467|gb|EIW63640.1| DUF124-domain-containing protein [Trametes versicolor FP-101664
           SS1]
          Length = 299

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 54/213 (25%), Positives = 95/213 (44%), Gaps = 17/213 (7%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
           + L+P  +V A+PGSM  M  +++++     + +  + + L G  ++  +   PG     
Sbjct: 97  LRLQPGAEVKAKPGSMVAMDATVKIKG----KLKFSVKKLLTGGEMSESIFTGPGE---- 148

Query: 97  VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQ 156
           V +A  +   I+PI L     +     DA+L     V  + T  Q V   +   EG    
Sbjct: 149 VMVAPETWGDIVPIHLDGR-TQWSVGRDAYLACTMGVSRT-TKSQGVGKALFSGEGLFVH 206

Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDN 216
            + G G+ ++ + G++  + L+ GE   VD   +VA T+   V+    G      F G +
Sbjct: 207 NVVGGGMMWVTSLGAIFSQTLQPGEQWIVDNGHLVAWTAKYKVERIDAG----GFFSGSH 262

Query: 217 LVTAVV---TGPGIVFIQSLPFHRLSQRIARAV 246
               ++   TGPGIV+IQ+     L + I   V
Sbjct: 263 TDEGLICRFTGPGIVYIQTRNPETLGEWIRDQV 295


>gi|325000442|ref|ZP_08121554.1| hypothetical protein PseP1_16822 [Pseudonocardia sp. P1]
          Length = 238

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 58/226 (25%), Positives = 93/226 (41%), Gaps = 39/226 (17%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           +  + L+    V A  G+M  MS  +++      E E G+W         S + R+ G  
Sbjct: 28  LATLWLEAGRSVRAEAGAMVAMSREIDL----TTELEGGLW---------SAMKRSAGGR 74

Query: 94  DGFVGI--AAPSLARIL-----PIDL---AMFGGELLCQPDAFLCSVNDVKVSNTLDQRV 143
              V    A     R+L     P DL   A+ G        A+L +  DV V        
Sbjct: 75  SASVSTFTAQDGRGRLLLAPPAPGDLVAAALSGNSYDIAGHAYLAATEDVAVDTQW---- 130

Query: 144 RNVIGGIEGFLRQ------KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
               GG + F         + +G G+ F+ A G+++ + L  GE + VD   +VA  SS+
Sbjct: 131 ----GGAKAFFASDNALVVRASGDGVVFLSAFGALLPQELAAGETLIVDTGHLVAWDSSM 186

Query: 198 NVQI-KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
             Q+ K      +++  G+ LV A  TGPG + +QS     L+  +
Sbjct: 187 QYQVRKAAKSAWKSMTSGEGLV-AEFTGPGNLLLQSRNLEALADAL 231


>gi|406836321|ref|ZP_11095915.1| hypothetical protein SpalD1_31979 [Schlesneria paludicola DSM
           18645]
          Length = 229

 Score = 52.0 bits (123), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 52/211 (24%), Positives = 93/211 (44%), Gaps = 23/211 (10%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-----QWLFGKTITSVVLR 88
           V +  L   E+VVA+P SM   +  + +          G W       L G++    V R
Sbjct: 13  VVEFQLNESEEVVAQPNSMISFTPGIHISAAIGQTQGEGSWFAGFKSLLGGESFFRAVFR 72

Query: 89  NPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIG 148
               +   + +A  +   ILPI+L       L +  ++L  + D +    LD R     G
Sbjct: 73  AKRDAQTLL-LAPDNNGDILPIELNASETWYLSR-GSYLAHIGDCR----LDARY----G 122

Query: 149 GIEGFLRQ------KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ-I 201
           G++G L +       ++G G  F    G+++++ L  GE   +D   +VA + +V  + +
Sbjct: 123 GVKGVLSKTGLFLLHVSGVGTLFCQTYGAIMERELGEGEQFLIDNRYVVAFSGTVEYELV 182

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           K +  +R +   G+ +V    TGPG V+ Q+
Sbjct: 183 KASRSLRHSFMSGEGMVNR-YTGPGRVYYQT 212


>gi|315043859|ref|XP_003171305.1| DUF124 domain-containing protein [Arthroderma gypseum CBS 118893]
 gi|311343648|gb|EFR02851.1| DUF124 domain-containing protein [Arthroderma gypseum CBS 118893]
          Length = 330

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 89/189 (47%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS S+ ++       +  M + L G  ++S     PG       + APS+
Sbjct: 133 LTAKPGAMIAMSPSVSLKGSI----KFSMKKMLAGGEMSSSTFTGPGEL-----LLAPSM 183

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
              + I L + G E      DAFL     V V +  +Q +       EG    K++G G+
Sbjct: 184 LGDISI-LKLNGQEQWSVGKDAFLACTQGV-VKDYKNQGLSKAFFSGEGLFVYKISGTGI 241

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            F+ + G++++K+L  GE   VD   +V+      ++   +G +   +  G+ LV    T
Sbjct: 242 LFVQSFGAIIRKDLADGEKYIVDNGHLVSWNCKYVMERVASGGVLSNLSSGEGLVCK-FT 300

Query: 224 GPGIVFIQS 232
           GPG +++Q+
Sbjct: 301 GPGTIYLQT 309


>gi|423232305|ref|ZP_17218706.1| hypothetical protein HMPREF1063_04526 [Bacteroides dorei
           CL02T00C15]
 gi|423242816|ref|ZP_17223892.1| hypothetical protein HMPREF1064_00098 [Bacteroides dorei
           CL02T12C06]
 gi|392624606|gb|EIY18686.1| hypothetical protein HMPREF1063_04526 [Bacteroides dorei
           CL02T00C15]
 gi|392646809|gb|EIY40516.1| hypothetical protein HMPREF1064_00098 [Bacteroides dorei
           CL02T12C06]
          Length = 219

 Score = 51.6 bits (122), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 36/112 (32%), Positives = 57/112 (50%), Gaps = 4/112 (3%)

Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
           L+C    ++ S N V+VS  +   +  ++GG+ G L QK++G    F+   G  + K L 
Sbjct: 100 LICNRGVYVASNNLVQVSAKIS--IAGIVGGMGGLL-QKISGTSTVFLDCKGQPITKELC 156

Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFI 230
            GE I VD   I+A+      Q+     ++  VFGG+ L    +TGPG V +
Sbjct: 157 AGETIEVDEDHIIAMQGISEHQLSSAWSLKN-VFGGEGLSMLRITGPGKVHL 207


>gi|327348500|gb|EGE77357.1| hypothetical protein BDDG_00294 [Ajellomyces dermatitidis ATCC
           18188]
          Length = 334

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 50/189 (26%), Positives = 90/189 (47%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS +M ++       +  M + + G  +++     PG       + AP+L
Sbjct: 139 LSAKPGAMIAMSPTMTLKGSV----KFSMKKLIAGGEMSTSTYTGPGEL-----LLAPTL 189

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
              + + L + G +      DAFL +   V V +  +Q +   +   EG    K++G GL
Sbjct: 190 LGDITV-LRLNGHDTWSVGKDAFLAATQGV-VKDYKNQGLSKALFSGEGLFVYKMSGNGL 247

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            F+   G++++K+L  GE   VD   +VA      ++   +G I   +  G+ LV    T
Sbjct: 248 LFMQNFGAIIKKDLADGEKYNVDNGHLVAWNCKYVMERIASGGIISGMSAGEGLVCK-FT 306

Query: 224 GPGIVFIQS 232
           GPG V++Q+
Sbjct: 307 GPGTVYLQT 315


>gi|336473276|gb|EGO61436.1| hypothetical protein NEUTE1DRAFT_120406 [Neurospora tetrasperma
           FGSC 2508]
 gi|350293451|gb|EGZ74536.1| DUF124-domain-containing protein [Neurospora tetrasperma FGSC 2509]
          Length = 446

 Score = 51.6 bits (122), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 49/187 (26%), Positives = 87/187 (46%), Gaps = 13/187 (6%)

Query: 47  ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
           A+PGSM  MS S+ ++  Y    +  M + + G  + +     PG       + AP++  
Sbjct: 255 AKPGSMIAMSPSVSLKGAY----KFSMKKLVAGGEMGTSTYTGPGEL-----LLAPAMLG 305

Query: 107 ILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAF 165
            +   L + G +      D +L S  +V   +   Q +   +   EG    K++G GL +
Sbjct: 306 DI-TSLRLDGSQTWSVSHDGYLASTQNV-TKDYKRQGLGKAMFSGEGLWVYKISGTGLLW 363

Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGP 225
           + + G++V+K+L  GE   VD   +VA  +   ++   +G +      G+ LV    TGP
Sbjct: 364 LTSFGAIVRKDLADGEKYIVDNGHLVAWNTKYILERVASGGLLSGFASGEGLV-CKFTGP 422

Query: 226 GIVFIQS 232
           G +FIQ+
Sbjct: 423 GTIFIQT 429


>gi|126656110|ref|ZP_01727494.1| hypothetical protein CY0110_03469 [Cyanothece sp. CCY0110]
 gi|126622390|gb|EAZ93096.1| hypothetical protein CY0110_03469 [Cyanothece sp. CCY0110]
          Length = 220

 Score = 51.2 bits (121), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 51/197 (25%), Positives = 91/197 (46%), Gaps = 8/197 (4%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG-MWQWLFGKTITSVVLRNPGPSDG 95
           + L P E ++A  G+M  M G + M+           + ++L G+++   V +N      
Sbjct: 16  VTLAPGESIIAEAGAMASMDGRLSMKTKLSGGFFAAILRKFLGGESLFVNVFKNQTSQPL 75

Query: 96  FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
            + +    +  I  IDL   G E+  QP A++ S   + +            G  EG  +
Sbjct: 76  TLVLTQSIIGDIECIDLR--GNEICFQPGAYIASSPKINLGVQWAGFASWFAG--EGLFK 131

Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
            K++G G  F  A G + ++ +  GE I VD S +VA +  + + I  +G +  +V  G+
Sbjct: 132 LKVSGHGTVFFGAYGGISKQTIS-GEFI-VDNSHLVAYSPEIKMSIGLSGGLISSVTSGE 189

Query: 216 NLVTAVVTGPGIVFIQS 232
            LV   + G GI+++QS
Sbjct: 190 GLVNR-LKGNGIIYLQS 205


>gi|451997933|gb|EMD90398.1| hypothetical protein COCHEDRAFT_1139761 [Cochliobolus
           heterostrophus C5]
          Length = 250

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 50/201 (24%), Positives = 92/201 (45%), Gaps = 11/201 (5%)

Query: 48  RPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARI 107
           +PGSM  MS +M ++       +  + + L G  ++  +   PG     + +A P++  I
Sbjct: 61  KPGSMIAMSPTMTLKG----NLKFSLKKALIGGEMSKSIYTGPGE----LLLAPPAIGDI 112

Query: 108 LPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFIL 167
             I L       L + DAFL     + + +      + +  G EGF   K++G G+ +  
Sbjct: 113 TIIKLGGNDTWSLGR-DAFLACTQGIVMEHKSQGISKGMFSG-EGFFVYKISGTGILWCT 170

Query: 168 AGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGI 227
           + G++++K+L   E   +D   +VA      ++   +G I   +  G+ LV    TGPG 
Sbjct: 171 SFGAIIRKDLAANEKYIIDNGHLVAWNCKYVLERVASGGIVSTMAAGEGLVCK-FTGPGT 229

Query: 228 VFIQSLPFHRLSQRIARAVTS 248
           VF+Q+      +Q +A   T+
Sbjct: 230 VFMQTRNATAFAQFMASHATA 250


>gi|336055002|ref|YP_004563289.1| hypothetical protein WANG_1492 [Lactobacillus kefiranofaciens ZW3]
 gi|333958379|gb|AEG41187.1| Protein of hypothetical function DUF124 [Lactobacillus
           kefiranofaciens ZW3]
          Length = 250

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 51/215 (23%), Positives = 93/215 (43%), Gaps = 10/215 (4%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITS-----VVL 87
           +  I L+P EKV  + GSM + S S+++   +    + +G      G++I+S     +  
Sbjct: 32  LIDIELEPSEKVYIQRGSMVYHSPSIKLNTKVNASGSGIGKLVRAMGRSISSGESFWITE 91

Query: 88  RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
                  G + IA      +LP++L     +       FL        S       R ++
Sbjct: 92  AQAHKELGHLAIAPGLPGEVLPLNLGE--KQYRINDGKFLAMDGSASYSMKKQTLGRALL 149

Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
            G  GF     +G G+    A GS+ + +L   E IT+D + +VA ++ +   I +    
Sbjct: 150 SGTGGFFVMTTSGDGVVLCNAYGSIKKIDLNDDE-ITIDNNHVVAWSTDLKYDIHFESGF 208

Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            +++  G+ +V     G G V+IQSL     +++I
Sbjct: 209 VQSIGTGEGVVN-TFKGTGEVYIQSLNLETFAKKI 242


>gi|218244960|ref|YP_002370331.1| hypothetical protein PCC8801_0069 [Cyanothece sp. PCC 8801]
 gi|257057985|ref|YP_003135873.1| hypothetical protein Cyan8802_0067 [Cyanothece sp. PCC 8802]
 gi|218165438|gb|ACK64175.1| protein of unknown function DUF124 [Cyanothece sp. PCC 8801]
 gi|256588151|gb|ACU99037.1| protein of unknown function DUF124 [Cyanothece sp. PCC 8802]
          Length = 220

 Score = 51.2 bits (121), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 48/200 (24%), Positives = 94/200 (47%), Gaps = 14/200 (7%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIY----IPENEVGMWQWLFGKTITSVVLRNPGP 92
           + L+P E ++A  G+M  M G++ M+  +    IP     + ++  G+++   V +N   
Sbjct: 16  LTLEPGETMIAEAGAMASMDGALSMKTEFSGGLIPAL---LKKFFGGESLFVNVFKNTTG 72

Query: 93  SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
               V +    +  I  ++L   G E+  QP A++ S + VK+            G  EG
Sbjct: 73  EPLTVVLTQSVIGDIDRLELN--GNEMYLQPGAYIASTSKVKLGVGWAGFASWFAG--EG 128

Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
             + K++G G     A G +++  +  GE I VD S +VA +  + + I  +G    ++ 
Sbjct: 129 LFKLKVSGTGRVLFGAYGGIIKHRIS-GEFI-VDNSHLVAYSPKIKMSIGLSGNFISSIT 186

Query: 213 GGDNLVTAVVTGPGIVFIQS 232
            G+  +   + G G++++QS
Sbjct: 187 SGEGFINR-LKGDGVIYLQS 205


>gi|448377165|ref|ZP_21560008.1| hypothetical protein C479_12204 [Halovivax asiaticus JCM 14624]
 gi|445656046|gb|ELZ08887.1| hypothetical protein C479_12204 [Halovivax asiaticus JCM 14624]
          Length = 225

 Score = 51.2 bits (121), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 92/199 (46%), Gaps = 11/199 (5%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--KTITSVVLRNPGPSD 94
           + L   E ++A PG+M   S ++E+      +  +   + + G    + +      GP  
Sbjct: 16  VELAAGETILAEPGAMVGHSPNVEISTTSSRDGLLSSAKSMLGGESLVVNEFTAEGGP-- 73

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
           G V +A P+   +   +L     E L   D AFL S  D+ + +     +++++GG    
Sbjct: 74  GEVRLAPPTPGDVKEHELQ---DETLYTTDGAFLASSPDIDIDSEF-GGLKSMLGGAS-L 128

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
               L G G A+I A G + + +L+ GE  T+D   ++A   SV  + +  G ++  +  
Sbjct: 129 TPLALKGTGTAYIDAYGGLERLDLDPGESYTLDNEHLIAWDDSVEFETRRVGGLKSTLLS 188

Query: 214 GDNLVTAVVTGPGIVFIQS 232
           G+ LV    TGPG V+ Q+
Sbjct: 189 GEGLVFD-FTGPGSVWYQT 206


>gi|346318745|gb|EGX88347.1| DUF124 domain-containing protein [Cordyceps militaris CM01]
          Length = 475

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 49/191 (25%), Positives = 86/191 (45%), Gaps = 21/191 (10%)

Query: 47  ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVG-----IAA 101
           A+PGSM  MS +M +          G +++   K I    L +      ++G     +A 
Sbjct: 283 AKPGSMIAMSPTMTLR---------GEFKFSVKKMIAGAELGH----SNYIGPGELLLAP 329

Query: 102 PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQ 161
           P L  I  I L     +  C  DA+L S   V+  +      + +  G EG    K++G 
Sbjct: 330 PMLGDITTIRLDG-AAQWTCGHDAYLASTQGVQKDHKRQGLGKAMFSG-EGLFVYKMSGT 387

Query: 162 GLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAV 221
           G+ ++ + G++++K+L  GE   +D   +VA  +   ++   +G I       + LV   
Sbjct: 388 GILWVSSFGAIIRKDLADGEKYIIDNGHLVAWNTKYILERVASGGIISNFASAEGLVCK- 446

Query: 222 VTGPGIVFIQS 232
            TGPG V++Q+
Sbjct: 447 FTGPGTVYMQT 457


>gi|380482182|emb|CCF41393.1| hypothetical protein CH063_11691 [Colletotrichum higginsianum]
          Length = 199

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 53/187 (28%), Positives = 86/187 (45%), Gaps = 13/187 (6%)

Query: 47  ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
           A+PG M  MS S+ +      + +  M + + G  + S     PG     + +A P L  
Sbjct: 7   AKPGVMVAMSHSITLRG----QLKFSMKKLVAGAEMASSTFVGPGE----LLLAPPMLGD 58

Query: 107 ILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAF 165
           I  + L   G E      DA+L S   V + +   Q +   I   EG    K++G GL +
Sbjct: 59  ITSLRLT--GQETWSVGQDAYLASTQGV-IKDYKRQGLSKAIFSGEGLYVYKISGTGLLW 115

Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGP 225
           + + G++++K+L  GE   VD   +VA      ++   +G I   +  G+ LV    TGP
Sbjct: 116 LTSFGAIIRKDLMDGEKYVVDNGHLVAWNVKYIMERVASGGIISGLSSGEGLVCK-FTGP 174

Query: 226 GIVFIQS 232
           G VF+Q+
Sbjct: 175 GTVFMQT 181


>gi|402834605|ref|ZP_10883204.1| TIGR00266 family protein [Selenomonas sp. CM52]
 gi|402277553|gb|EJU26627.1| TIGR00266 family protein [Selenomonas sp. CM52]
          Length = 229

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 48/144 (33%), Positives = 73/144 (50%), Gaps = 15/144 (10%)

Query: 110 IDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI---EGFLRQKLTGQGLAF 165
           +D+A+ G   ++ Q   FL +   V +    D +++ ++ G    EGF   KLTG+G+AF
Sbjct: 88  MDVALDGSYGMIVQKGGFLAAEETVNI----DSKMQGLMQGFFSQEGFFLLKLTGKGIAF 143

Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY--NGPIRRAVFGGDNLVTAVVT 223
           + + G +   NLE GE + VD   +VA    +N +I+   NG I  +V  G+ LV     
Sbjct: 144 LSSYGVIHVLNLEAGEEVIVDNGHLVAWPDYMNYKIEKASNGWI-SSVMSGECLVCR-FR 201

Query: 224 GPGIVFIQS---LPFHRLSQRIAR 244
           GPG V IQ+     F    Q I R
Sbjct: 202 GPGPVLIQTRNPAGFETWIQEIVR 225


>gi|422600397|ref|ZP_16674232.1| hypothetical protein PSYMO_31452, partial [Pseudomonas syringae pv.
           mori str. 301020]
 gi|330893030|gb|EGH25691.1| hypothetical protein PSYMO_31452 [Pseudomonas syringae pv. mori
           str. 301020]
          Length = 220

 Score = 50.8 bits (120), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 41/145 (28%), Positives = 66/145 (45%), Gaps = 26/145 (17%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY--IPENEVGMWQWLF-- 78
           + F++ G E Q  ++ L P E  VA  G+M + +  ++ME I+        G+   LF  
Sbjct: 34  VDFKLYGAETQFVELELDPGESAVAEAGAMMYKTCDVQMETIFGDGSNQSSGLLGSLFGA 93

Query: 79  ------GKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGG---------ELLCQP 123
                 G+++ + V    G   G V  AAP    ILP++L  FGG          +  QP
Sbjct: 94  GKRMLTGESLFTTVFSQQGSGKGRVAFAAPYPGTILPLNLRDFGGYRNWSWKEPAVQYQP 153

Query: 124 DAFLCSVNDVKVSNTL-DQRVRNVI 147
           D      +D ++ + L +QR+R  I
Sbjct: 154 D------DDPRLKSDLTEQRIRQSI 172


>gi|410584105|ref|ZP_11321210.1| TIGR00266 family protein [Thermaerobacter subterraneus DSM 13965]
 gi|410504967|gb|EKP94477.1| TIGR00266 family protein [Thermaerobacter subterraneus DSM 13965]
          Length = 226

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 54/207 (26%), Positives = 93/207 (44%), Gaps = 26/207 (12%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-----QWLFGKTITSVVLRNPG 91
           + L+P E++ A PG++  MSG++ ++     E   G+        L G++  +   R  G
Sbjct: 16  VELEPGEELQAEPGALVSMSGNVSLQ----AEARGGLLGALSRSLLGGESFFTSRYRADG 71

Query: 92  PSDGFVGIAAPSLARILPID---LAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIG 148
                +      LA  LP D   L + G  +  +  A+L     +    T+D R     G
Sbjct: 72  ARGEVM------LAPALPGDITVLELAGEVVYLKSGAYLAGATTL----TVDARWGGARG 121

Query: 149 --GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
             G  G    ++ G G  FI + G++ +K+L  G+   VD   +VA T  +  Q++  G 
Sbjct: 122 FFGSGGLFLLRVAGHGALFINSYGALHRKDLAPGQRYVVDTGHVVAFTDGMAFQVRTAGS 181

Query: 207 -IRRAVFGGDNLVTAVVTGPGIVFIQS 232
            +  ++  G+ L     TGPG V+IQ+
Sbjct: 182 GLFSSIASGEGLACE-FTGPGTVYIQT 207


>gi|260887588|ref|ZP_05898851.1| conserved hypothetical protein [Selenomonas sputigena ATCC 35185]
 gi|330837957|ref|YP_004412537.1| hypothetical protein Selsp_0098 [Selenomonas sputigena ATCC 35185]
 gi|260862604|gb|EEX77104.1| conserved hypothetical protein [Selenomonas sputigena ATCC 35185]
 gi|329745721|gb|AEB99077.1| protein of unknown function DUF124 [Selenomonas sputigena ATCC
           35185]
          Length = 229

 Score = 50.8 bits (120), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 44/129 (34%), Positives = 69/129 (53%), Gaps = 12/129 (9%)

Query: 110 IDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI---EGFLRQKLTGQGLAF 165
           +D+A+ G   ++ Q   FL +   V +    D +++ ++ G    EGF   KLTG+G+AF
Sbjct: 88  MDVALDGSYGMIVQKGGFLAAEETVNI----DSKMQGLMQGFFSQEGFFLLKLTGKGIAF 143

Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY--NGPIRRAVFGGDNLVTAVVT 223
           + + G +   NLE GE + VD   +VA    +N +I+   NG I  +V  G+ LV     
Sbjct: 144 LSSYGVIHVLNLEAGEEVIVDNGHLVAWPDYMNYKIEKASNGWI-SSVMSGECLVCR-FR 201

Query: 224 GPGIVFIQS 232
           GPG V IQ+
Sbjct: 202 GPGPVLIQT 210


>gi|374725017|gb|EHR77097.1| putative protein of unknown function DUF124 [uncultured marine
           group II euryarchaeote]
          Length = 216

 Score = 50.4 bits (119), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 59/213 (27%), Positives = 98/213 (46%), Gaps = 19/213 (8%)

Query: 39  LKPQEKVVARPGSMCFMSGSMEME------NIYIPENEVGMW-QWLFGKTITSVVLRNPG 91
           L P E V   PG+M  M G +EM+       I+    ++ +  +  F  T T+      G
Sbjct: 5   LSPGESVNVEPGAMVAMGG-VEMQTRKSGGGIFKSLKKMALGGESFFLNTFTA------G 57

Query: 92  PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
            + G+V +A  S   I   +L   G  L  Q  ++L S  +++ ++T  Q  +  + G  
Sbjct: 58  AAGGWVSVAPGSPGDIQGFNLVP-GQALFIQGGSYLASTLNIQ-TDTKFQGAKGFLSGES 115

Query: 152 GFLRQKL--TGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
            F  Q     GQG  +  + G++ Q  +E G+ ITVD   +VA  + V   I   G ++ 
Sbjct: 116 MFFVQATCAEGQGRVWYNSFGAIKQIPVEPGQTITVDTGHVVAFDAGVQYTISKVGGMKS 175

Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
            + GG+ LV    TG G V+IQ+     L+ ++
Sbjct: 176 FLVGGEGLVMN-FTGQGNVWIQTRNVGSLAAKL 207


>gi|427724444|ref|YP_007071721.1| hypothetical protein Lepto7376_2616 [Leptolyngbya sp. PCC 7376]
 gi|427356164|gb|AFY38887.1| protein of unknown function DUF124 [Leptolyngbya sp. PCC 7376]
          Length = 221

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 50/197 (25%), Positives = 88/197 (44%), Gaps = 8/197 (4%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF-GKTITSVVLRNPGPSDG 95
           + L P E+V A  G+M  M G + M+  +       + +  F G+++   + RN      
Sbjct: 16  VTLDPGEQVTAEAGAMVSMDGGITMKTEFSGGFVQAILRSFFGGESLFVNIYRNTSNKKQ 75

Query: 96  FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
            V +    +  I  IDL+   G +  QP A++       +            G  EG  +
Sbjct: 76  TVILTQSMIGDIHRIDLSQ--GPICFQPGAYIAHTPGANMGVKWAGFASWFAG--EGLFK 131

Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
            + TG+G  F    G +++K +  GE I VD S +VA    + + I+ +G +  ++  G+
Sbjct: 132 LEFTGKGRVFYGCYGGIMEKQI-AGEYI-VDNSHLVAYEPGITMNIRMSGGLLGSLTSGE 189

Query: 216 NLVTAVVTGPGIVFIQS 232
            LV   + G G V++QS
Sbjct: 190 GLVNK-LKGRGKVYLQS 205


>gi|182417418|ref|ZP_02948746.1| conserved hypothetical protein [Clostridium butyricum 5521]
 gi|237667008|ref|ZP_04526992.1| conserved hypothetical protein [Clostridium butyricum E4 str. BoNT
           E BL5262]
 gi|182378732|gb|EDT76258.1| conserved hypothetical protein [Clostridium butyricum 5521]
 gi|237655356|gb|EEP52912.1| conserved hypothetical protein [Clostridium butyricum E4 str. BoNT
           E BL5262]
          Length = 226

 Score = 50.4 bits (119), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 49/152 (32%), Positives = 72/152 (47%), Gaps = 13/152 (8%)

Query: 99  IAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI---EGFL 154
           I A SL   I+PI L   G  L  Q D FL S  DV     L+ +V+N+  G+   EGF 
Sbjct: 76  IFAHSLPGDIVPITLNGTG--LKIQKDGFLASTIDVD----LETKVQNLTKGLFSGEGFF 129

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV--NVQIKYNGPIRRAVF 212
                GQG  F+ + G++ + NL   E I +D S +VA   ++  N++         +  
Sbjct: 130 VLGARGQGTVFVSSYGAIHEINLNSNEEIIIDNSHLVAWDETLEYNIEKASKKGWISSFT 189

Query: 213 GGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
            G+ LV     GPG +FIQ+     +   IA+
Sbjct: 190 SGEGLVCR-FKGPGKIFIQTRNPDGMKNWIAK 220


>gi|119715736|ref|YP_922701.1| hypothetical protein Noca_1500 [Nocardioides sp. JS614]
 gi|119536397|gb|ABL81014.1| protein of unknown function DUF124 [Nocardioides sp. JS614]
          Length = 231

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 53/207 (25%), Positives = 89/207 (42%), Gaps = 22/207 (10%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG------KTITSVVL 87
             ++ L   E V A  G+M  MS  + ME         G+ + + G       T T+ V 
Sbjct: 13  TAKLTLAAGEAVRAESGAMLAMSPGIAMETSTQGGVLKGLKRSVLGGESFFMNTFTAQV- 71

Query: 88  RNPGPSDGFVGIAAPSLARILPIDLAMF--GGELLCQPDAFLCSVNDVKVSNTLDQRVRN 145
                 DG     AP+L    P D+  +   G L  Q  +FL + ++++V          
Sbjct: 72  ------DGAELYVAPAL----PGDVVAWPLNGVLYVQSGSFLAASSNIQVDTKWGGG--K 119

Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
                EG    ++ GQG   + + G++   +L+ G+  TVD   +VA T  V   ++  G
Sbjct: 120 TFFSSEGLFMLRIEGQGELVLSSYGAIHAIDLQAGQTYTVDTGHMVAWTDGVQYNVRKVG 179

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
             +   F G+ LV   +TGPG +++Q+
Sbjct: 180 NWKSTFFSGEGLVCD-LTGPGRIYLQT 205


>gi|417783112|ref|ZP_12430835.1| TIGR00266 family protein [Leptospira interrogans str. C10069]
 gi|418726164|ref|ZP_13284775.1| TIGR00266 family protein [Leptospira interrogans str. UI 12621]
 gi|409953813|gb|EKO08309.1| TIGR00266 family protein [Leptospira interrogans str. C10069]
 gi|409960074|gb|EKO23828.1| TIGR00266 family protein [Leptospira interrogans str. UI 12621]
          Length = 221

 Score = 50.4 bits (119), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 52/199 (26%), Positives = 88/199 (44%), Gaps = 8/199 (4%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSD 94
            ++ L P + + A  G+M +M+  +++E          + +  FG       +     + 
Sbjct: 14  LKVKLSPGQTIKAESGAMVYMTPGIDVETKMGSGFLSAISRRFFGGESFFFNVFKASVTG 73

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           G +G+A      I+ IDL   G  L+ +  A+L S   + +       +R+  GG EG  
Sbjct: 74  GEIGLAPELPGDIVGIDLTDTG--LIVESGAYLASDETISIKPMFGG-IRSFFGG-EGIF 129

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFG 213
             +  G G  F+ A G ++   ++V    T+D   IVA   S+  +I K  G  +   FG
Sbjct: 130 LLEAFGNGKLFLNAYGGIIP--IDVQGSYTIDTGHIVAFDKSLQYKIAKAGGSWKSTFFG 187

Query: 214 GDNLVTAVVTGPGIVFIQS 232
           G+ LV    TG G V IQ+
Sbjct: 188 GEGLVMEF-TGHGRVLIQT 205


>gi|363891399|ref|ZP_09318578.1| TIGR00266 family protein [Eubacteriaceae bacterium CM2]
 gi|402837607|ref|ZP_10886124.1| TIGR00266 family protein [Eubacteriaceae bacterium OBRC8]
 gi|361965456|gb|EHL18438.1| TIGR00266 family protein [Eubacteriaceae bacterium CM2]
 gi|402274627|gb|EJU23804.1| TIGR00266 family protein [Eubacteriaceae bacterium OBRC8]
          Length = 225

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 43/143 (30%), Positives = 71/143 (49%), Gaps = 10/143 (6%)

Query: 94  DGFVGIAAPSLARILPIDLAMFGG-ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI-- 150
           DG V + APSL   + ID+ + G   L  Q D FL     ++V    D +V+N+  G+  
Sbjct: 72  DGEV-LLAPSLLGDV-IDVELDGTYSLRVQKDGFLAGTQGIEV----DTKVQNLSKGLFS 125

Query: 151 -EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
            EGF    ++G+G  F+ + GS+   NLE G+ I +D + +VA    +  +I+       
Sbjct: 126 GEGFFILNVSGKGTVFLSSYGSIHSINLEAGQEIVIDNNHLVAWPDHMEYKIEKAASGWI 185

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
           + F    ++     GPG + IQ+
Sbjct: 186 SSFTSGEMLVCRFRGPGNILIQT 208


>gi|398412410|ref|XP_003857529.1| hypothetical protein MYCGRDRAFT_65617 [Zymoseptoria tritici IPO323]
 gi|339477414|gb|EGP92505.1| hypothetical protein MYCGRDRAFT_65617 [Zymoseptoria tritici IPO323]
          Length = 362

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/120 (30%), Positives = 59/120 (49%), Gaps = 2/120 (1%)

Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
           DAFL S   V     + Q     +   EGF   K+ G G+ +I + G++++K+L  GE  
Sbjct: 240 DAFLASTQGV-TKTYVSQNFSKAMFSGEGFFIYKIAGVGICWIQSMGAIIRKDLRDGEKY 298

Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIA 243
            +D   +VA      ++   +G I   +  G+ LV    TGPG VF+Q+    + +Q +A
Sbjct: 299 IIDNGHLVAWNCDYVLERVASGGIISNLSAGEGLVCKF-TGPGTVFMQTRNPQQFAQYMA 357


>gi|429191128|ref|YP_007176806.1| hypothetical protein Natgr_1130 [Natronobacterium gregoryi SP2]
 gi|448326682|ref|ZP_21516029.1| hypothetical protein C490_14755 [Natronobacterium gregoryi SP2]
 gi|429135346|gb|AFZ72357.1| TIGR00266 family protein [Natronobacterium gregoryi SP2]
 gi|445610487|gb|ELY64258.1| hypothetical protein C490_14755 [Natronobacterium gregoryi SP2]
          Length = 229

 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 51/210 (24%), Positives = 90/210 (42%), Gaps = 24/210 (11%)

Query: 39  LKPQEKVVARPGSMCFMSGSMEMENIYIP-------ENEVGMWQWLFGKTITSVVLRNPG 91
           L+  E +    GSM   S  +EME            ++ +   + LF  T T+   R   
Sbjct: 23  LENGEAITVESGSMVSYSEHVEMETHSSSGGLLSSVKDSMLSGEQLFRNTFTATANRQT- 81

Query: 92  PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS---VNDVKVSNTLDQRVRNVIG 148
                V  A      +  ++LA  G  +  Q  +++ +   +    VS   D    +++G
Sbjct: 82  -----VQFAHTQPGDMRALELA--GESVYVQSGSYVANGPGIETDSVSGGFD----SLLG 130

Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
           G +G    + +G G  F+ + G +V++ LE GE +T+D    VA   SV       G ++
Sbjct: 131 G-KGLFVLEASGTGDLFVGSYGGIVERELEQGEQVTIDAGHSVAWDESVEFSTHRVGGLK 189

Query: 209 RAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
           + +  G+  V    +GPG VF+Q+  +  L
Sbjct: 190 KTMLSGEGFV-MTFSGPGRVFLQTRDYDSL 218


>gi|290559433|gb|EFD92765.1| protein of unknown function DUF124 [Candidatus Parvarchaeum
           acidophilus ARMAN-5]
          Length = 220

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 57/231 (24%), Positives = 103/231 (44%), Gaps = 21/231 (9%)

Query: 22  VIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           ++ F  +G E Q  Q  L   E ++   G M F S  ++ME         G++  L    
Sbjct: 1   MVNFTEVGKEIQYIQATLNEGESIMCEGGHMIFKSPDIKMET-----QSGGIFSALSRAM 55

Query: 82  ITSVV--LRNPGPSDGFVGIAAPSLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVS 136
             S V  L+  GP     G+ A   +  LP   I++ + G  ++ + ++FLC  + V  S
Sbjct: 56  AGSSVFLLKLDGP-----GVVA--FSGFLPGEIINIPLQGKGIIAEFNSFLCMDSTVTYS 108

Query: 137 NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSS 196
                     + G EG   ++ +G G  F+   G  ++ +L+ GE I V+ S ++A   +
Sbjct: 109 AKF-AGTWAGLLGGEGLFLERFSGNGNVFLHCHGQPIEFSLKEGESIDVEASHVLAFEDT 167

Query: 197 VNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
               IK  G ++  +     G+    A +TGPG V++ S+   +L+ ++  
Sbjct: 168 AQYSIKRVGGVKTMLLAGMEGEGFFFATITGPGKVWLHSISLAQLAMKMGN 218


>gi|431930111|ref|YP_007243157.1| hypothetical protein Thimo_0688 [Thioflavicoccus mobilis 8321]
 gi|431828414|gb|AGA89527.1| TIGR00266 family protein [Thioflavicoccus mobilis 8321]
          Length = 240

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/205 (23%), Positives = 92/205 (44%), Gaps = 7/205 (3%)

Query: 30  GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRN 89
           G   + ++ L+PQ+ + A+  +M  M  ++++E         G+ +   G++     LR 
Sbjct: 22  GAFALLEVHLQPQQTIKAQSDAMVAMDPTIDVEGKLEGGLLGGLGRMFSGESFFFQTLRA 81

Query: 90  PGPSDGFVGIAAPSL-ARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
              S   V + AP+    +LP+ L   G +  + Q D FL +  +V++S       R + 
Sbjct: 82  ARGSG--VALLAPAQPGDLLPVHLD--GTQPYIIQKDGFLAASEEVQISTAAQNLARGIF 137

Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
            G EGF   +  G+G+ F  + G++    +  GE + VD   +VA    ++  ++     
Sbjct: 138 SG-EGFFVLRAQGRGMLFAESYGAIHALEVPAGEEMVVDNGHLVAWPEGMHYSLEKASSG 196

Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQS 232
             + F     +     GPG V+IQS
Sbjct: 197 WISSFTSGEGLVCRFRGPGTVYIQS 221


>gi|375308935|ref|ZP_09774217.1| hypothetical protein WG8_2742 [Paenibacillus sp. Aloe-11]
 gi|390453566|ref|ZP_10239094.1| hypothetical protein PpeoK3_05980 [Paenibacillus peoriae KCTC 3763]
 gi|375079147|gb|EHS57373.1| hypothetical protein WG8_2742 [Paenibacillus sp. Aloe-11]
          Length = 226

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 52/205 (25%), Positives = 96/205 (46%), Gaps = 7/205 (3%)

Query: 30  GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRN 89
           G   + ++ L P E + A  G+M  MS S++++         G+ + L G+      LR 
Sbjct: 9   GAFAMLKVQLSPGETIKAEMGAMVSMSSSVDIKGTVDGGLLRGLGRMLSGEKFFFQELRA 68

Query: 90  P-GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIG 148
             GP++  +  A+    + + +D +     LL Q D FL     ++VS  +   ++ +  
Sbjct: 69  ARGPAEVLLAPASIGDVQAVELDGSY---RLLVQKDGFLACTEGIEVSTKMQNLMKGLFS 125

Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPI 207
           G EGF   +++G+G  F+ + G++    +  GE   +D + +VA    ++ +I K +   
Sbjct: 126 G-EGFFIVEISGRGTVFLSSYGAIYPLYVAPGEERIIDNAHLVAWPDYMDYKIEKASKGW 184

Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQS 232
             +V  G+ LV     G G V IQS
Sbjct: 185 LSSVTSGEALVCR-FRGEGTVLIQS 208


>gi|327400089|ref|YP_004340928.1| hypothetical protein Arcve_0174 [Archaeoglobus veneficus SNP6]
 gi|327315597|gb|AEA46213.1| protein of unknown function DUF124 [Archaeoglobus veneficus SNP6]
          Length = 223

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 51/200 (25%), Positives = 93/200 (46%), Gaps = 10/200 (5%)

Query: 33  QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGP 92
            + ++ L+  E++VA  G+M +M  ++E++         G+ + L G   +  V R    
Sbjct: 12  SLLEVELEDGEEIVAETGAMVYMR-NVELKTEVKGGLLAGLKRSLLGGE-SFFVNRFISK 69

Query: 93  SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
             G +G+A P    ++ I L    G +  Q  AFL S  D+ + +T     R    G EG
Sbjct: 70  GKGLLGLAPPYQGDVIHIPLN---GRIYAQSGAFLASSPDISI-DTKWGGARTFFAG-EG 124

Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
               KL G G   + + G +  + ++V   + VD   IVA   ++  +++  G ++  + 
Sbjct: 125 LFLLKLDGLGDVLLSSFGGI--ERIDVNGSLIVDTGHIVAFEDTLAFKVRKAGGLKATLL 182

Query: 213 GGDNLVTAVVTGPGIVFIQS 232
            G+ L+ A   G G V+IQ+
Sbjct: 183 SGEGLI-AEFEGEGSVWIQT 201


>gi|304405138|ref|ZP_07386798.1| protein of unknown function DUF124 [Paenibacillus curdlanolyticus
           YK9]
 gi|304346017|gb|EFM11851.1| protein of unknown function DUF124 [Paenibacillus curdlanolyticus
           YK9]
          Length = 225

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 56/203 (27%), Positives = 94/203 (46%), Gaps = 11/203 (5%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI---TSVVLRNP 90
           + ++ L+  E V A  G+M  MS S+++          G+ + L G+       V  R  
Sbjct: 13  MLKVKLERGESVKAEMGAMVSMSPSIDLTGTVDGGIMRGLGRMLSGEKFFFQELVAARGG 72

Query: 91  GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
           G     V +A  S+  I  ++L     +LL Q D FL   + ++V+  +    R ++ G 
Sbjct: 73  GE----VLLAPTSMGDIQAVELDG-SYKLLVQKDGFLAGTSGIEVNTKMQNLSRGLLSG- 126

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
           EGF   +++G+G  FI + GS+   NL  GE + VD   +VA    V   I K +     
Sbjct: 127 EGFFIVEISGRGTVFISSYGSIHAINLAPGEEVIVDNGHLVAWPDYVQYNIEKASKGWLN 186

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
           ++  G+ +V     G G+V IQ+
Sbjct: 187 SIKSGEGVVCR-FRGEGVVLIQT 208


>gi|336265228|ref|XP_003347387.1| hypothetical protein SMAC_08357 [Sordaria macrospora k-hell]
 gi|380093212|emb|CCC08870.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 453

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 48/186 (25%), Positives = 85/186 (45%), Gaps = 11/186 (5%)

Query: 47  ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
           A+PG+M  MS S+ ++  Y    +  M + + G  + +     PG  +  +G A      
Sbjct: 262 AKPGTMIAMSPSVSLKGAY----KFSMKKLVAGGEMGTSTFTGPG--ELLLGPAMLGDIT 315

Query: 107 ILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFI 166
            L +D +          D +L S   V + +   Q +   +   EG    K++G GL ++
Sbjct: 316 SLRLDGSQ---TWSVSHDGYLASTQHV-IKDYKRQGLGKAMFSGEGLWVYKISGTGLLWL 371

Query: 167 LAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPG 226
            + G++++K+L  GE   VD   +VA  +   ++   +G +      G+ LV    TGPG
Sbjct: 372 TSFGAIIRKDLTDGEKYIVDNGHLVAWNTKYILERVASGGMLSGFASGEGLVCK-FTGPG 430

Query: 227 IVFIQS 232
            VFIQ+
Sbjct: 431 TVFIQT 436


>gi|326483972|gb|EGE07982.1| DUF124 domain-containing protein [Trichophyton equinum CBS 127.97]
          Length = 335

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/189 (25%), Positives = 90/189 (47%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           ++A+PG+M  MS ++ ++       +  M + L G  ++S     PG       + APS+
Sbjct: 138 LIAKPGAMIAMSPTVSLKGSI----KFSMKKMLAGGEMSSSTFTGPGEL-----LLAPSM 188

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
              + I L + G E      DAFL     V V +  +Q +       EG    K++G G+
Sbjct: 189 LGDISI-LRLNGQEQWSVGKDAFLACTQGV-VKDYKNQGLSKAFFSGEGLFVYKISGTGI 246

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            F+ + G++++K+L  GE   VD   +V+      ++   +G +   +  G+ LV    T
Sbjct: 247 LFVQSFGAIIRKDLVDGEKYIVDNGHLVSWNCKYVMERVASGGVLSNLSSGEGLVCK-FT 305

Query: 224 GPGIVFIQS 232
           GPG +++Q+
Sbjct: 306 GPGTIYLQT 314


>gi|226292711|gb|EEH48131.1| DUF124 domain-containing protein [Paracoccidioides brasiliensis
           Pb18]
          Length = 328

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 52/203 (25%), Positives = 95/203 (46%), Gaps = 18/203 (8%)

Query: 32  AQVCQIMLKPQEKVVARP-GSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNP 90
           A  C +  KP+    A P G+M  MS +M ++       +  M + + G  +++     P
Sbjct: 122 AMGCPLTAKPE----AHPTGAMIAMSPTMTLKGSV----KFSMKKLIAGGEMSTSTYTGP 173

Query: 91  GPSDGFVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
           G       + AP++   + + L + G E      DA+L S   V + +  +Q +      
Sbjct: 174 GEL-----LLAPTMLGDISV-LRLSGHETWSVGKDAYLSSTQGV-IKDYKNQGLSKAFFS 226

Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
            EG    K++G GL F+ + G++++K+L  GE   +D   +VA +    ++   +G I  
Sbjct: 227 GEGLFVYKISGNGLLFLQSFGAIIKKDLADGEKYNIDNGHLVAWSCKYVMERVASGGIIS 286

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
            +  G+ LV    TGPG V++Q+
Sbjct: 287 GMTSGEGLVCK-FTGPGSVYMQT 308


>gi|327296604|ref|XP_003232996.1| hypothetical protein TERG_05993 [Trichophyton rubrum CBS 118892]
 gi|326464302|gb|EGD89755.1| hypothetical protein TERG_05993 [Trichophyton rubrum CBS 118892]
          Length = 331

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 48/190 (25%), Positives = 89/190 (46%), Gaps = 15/190 (7%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           ++A+PG+M  MS ++ ++       +  M + L G  ++S     PG       + APS+
Sbjct: 134 LIAKPGAMIAMSPTVSLKGSI----KFSMKKMLAGGEMSSSTFTGPGEL-----LLAPSM 184

Query: 105 ARILPIDLAMFGGE--LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQG 162
             +  I L    G+       DAFL     V V +  +Q +       EG    K++G G
Sbjct: 185 --LGDISLLRLNGQEQWSVGKDAFLACTQGV-VKDYKNQGLSKAFFSGEGLFVYKISGTG 241

Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
           + F+ + G++++K+L  GE   VD   +V+      ++   +G +   +  G+ LV    
Sbjct: 242 ILFVQSFGAIIRKDLVDGEKYIVDNGHLVSWNCKYVMERVASGGVLSNLSSGEGLVCK-F 300

Query: 223 TGPGIVFIQS 232
           TGPG +++Q+
Sbjct: 301 TGPGTIYLQT 310


>gi|363890242|ref|ZP_09317581.1| hypothetical protein HMPREF9628_02023 [Eubacteriaceae bacterium
           CM5]
 gi|361965849|gb|EHL18817.1| hypothetical protein HMPREF9628_02023 [Eubacteriaceae bacterium
           CM5]
          Length = 225

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/140 (28%), Positives = 67/140 (47%), Gaps = 4/140 (2%)

Query: 94  DGFVGIAAPSLARILPIDLAMFGG-ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
           DG V + APSL   + ID+ + G   L  Q D FL     ++V   +    + +  G EG
Sbjct: 72  DGEV-LLAPSLLGDV-IDVELDGTYSLRVQKDGFLAGTQGIEVDTKMQNLSKGLFSG-EG 128

Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
           F    ++G+G  F+ + GS+   NLE G+ I +D + +VA    +  +I+       + F
Sbjct: 129 FFILNVSGKGTVFLSSYGSIHSINLEAGQEIVIDNNHLVAWPDHMEYKIEKAASGWISSF 188

Query: 213 GGDNLVTAVVTGPGIVFIQS 232
               ++     GPG + IQ+
Sbjct: 189 TSGEMLVCRFRGPGNILIQT 208


>gi|297617354|ref|YP_003702513.1| hypothetical protein Slip_1175 [Syntrophothermus lipocalidus DSM
           12680]
 gi|297145191|gb|ADI01948.1| protein of unknown function DUF124 [Syntrophothermus lipocalidus
           DSM 12680]
          Length = 226

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 55/202 (27%), Positives = 90/202 (44%), Gaps = 16/202 (7%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI--TSVVLRNPGPSD 94
           + L P E + A PG+M  MS ++ ++     E + G    L  K +   S  + N   ++
Sbjct: 16  VKLNPGEGIQAEPGAMVSMSPTISLKT----EMKGGFMGALKRKVLGGESFFVSNY-VAE 70

Query: 95  GFVGIAAPSLARILPID---LAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
           G  G    +LA  +P D   L +       Q  ++L     +++            G  E
Sbjct: 71  G--GSGEVTLAPAVPGDVMGLVLNNETYFVQSGSYLAGDTSLQIDTKFGGAKTFFAG--E 126

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP-IRRA 210
           G    K++G G+  + + G++++K L  GE   VD S IVA  S+VN  IK     I   
Sbjct: 127 GLFLLKISGSGMLILTSYGAIMEKKLVPGERYVVDTSHIVAFQSTVNYSIKKAAAGIMST 186

Query: 211 VFGGDNLVTAVVTGPGIVFIQS 232
           +  G+ LV     GPG +F+Q+
Sbjct: 187 ITSGEGLVCE-YEGPGTIFLQT 207


>gi|363894754|ref|ZP_09321822.1| hypothetical protein HMPREF9629_00289 [Eubacteriaceae bacterium
           ACC19a]
 gi|361961549|gb|EHL14740.1| hypothetical protein HMPREF9629_00289 [Eubacteriaceae bacterium
           ACC19a]
          Length = 225

 Score = 50.1 bits (118), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 40/140 (28%), Positives = 67/140 (47%), Gaps = 4/140 (2%)

Query: 94  DGFVGIAAPSLARILPIDLAMFGG-ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
           DG V + APSL   + ID+ + G   L  Q D FL     ++V   +    + +  G EG
Sbjct: 72  DGEV-LLAPSLLGDV-IDVELDGTYSLRVQKDGFLAGTQGIEVDTKMQNLSKGLFSG-EG 128

Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
           F    ++G+G  F+ + GS+   NLE G+ I +D + +VA    +  +I+       + F
Sbjct: 129 FFILNVSGKGTVFLSSYGSIHSINLEAGQEIVIDNNHLVAWPDHMEYKIEKAASGWISSF 188

Query: 213 GGDNLVTAVVTGPGIVFIQS 232
               ++     GPG + IQ+
Sbjct: 189 TSGEMLVCRFRGPGNILIQT 208


>gi|392867521|gb|EAS29247.2| TIGR00266 family protein [Coccidioides immitis RS]
          Length = 345

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/204 (26%), Positives = 92/204 (45%), Gaps = 19/204 (9%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS +M ++       +  + + L G  ++S     PG       + APS 
Sbjct: 150 LTAKPGAMIAMSPTMTLKG----SVKFSVKKMLAGGEMSSSTYTGPGEL-----LLAPS- 199

Query: 105 ARILPIDLAMFGGE--LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQG 162
             +  I L    G+       DAFL     + V +   Q +       EG     ++G G
Sbjct: 200 -ALGDISLLRLNGQEQWFVGKDAFLACTQGI-VKDYKRQGLGKAFFSGEGLYIYTVSGTG 257

Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
           + FI + G++++K+L  GE   VD   +VA T +  ++   +G I   +  G+ LV    
Sbjct: 258 ILFIQSFGAIIKKDLVAGEKYIVDNGHLVAWTCNYVMERVASGGIMSGLASGEGLVCK-F 316

Query: 223 TGPGIVFIQSLPFHRLSQRIARAV 246
           TGPG +++Q+    R +Q  A  V
Sbjct: 317 TGPGTIYMQT----RNAQAFAHWV 336


>gi|119177498|ref|XP_001240515.1| hypothetical protein CIMG_07678 [Coccidioides immitis RS]
          Length = 415

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/204 (26%), Positives = 92/204 (45%), Gaps = 19/204 (9%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS +M ++       +  + + L G  ++S     PG       + APS 
Sbjct: 150 LTAKPGAMIAMSPTMTLKG----SVKFSVKKMLAGGEMSSSTYTGPGEL-----LLAPSA 200

Query: 105 ARILPIDLAMFGGE--LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQG 162
             +  I L    G+       DAFL     + V +   Q +       EG     ++G G
Sbjct: 201 --LGDISLLRLNGQEQWFVGKDAFLACTQGI-VKDYKRQGLGKAFFSGEGLYIYTVSGTG 257

Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
           + FI + G++++K+L  GE   VD   +VA T +  ++   +G I   +  G+ LV    
Sbjct: 258 ILFIQSFGAIIKKDLVAGEKYIVDNGHLVAWTCNYVMERVASGGIMSGLASGEGLVCK-F 316

Query: 223 TGPGIVFIQSLPFHRLSQRIARAV 246
           TGPG +++Q+    R +Q  A  V
Sbjct: 317 TGPGTIYMQT----RNAQAFAHWV 336


>gi|398339048|ref|ZP_10523751.1| hypothetical protein LkirsB1_05502 [Leptospira kirschneri serovar
           Bim str. 1051]
 gi|418676821|ref|ZP_13238099.1| TIGR00266 family protein [Leptospira kirschneri serovar
           Grippotyphosa str. RM52]
 gi|418688414|ref|ZP_13249570.1| TIGR00266 family protein [Leptospira kirschneri serovar
           Grippotyphosa str. Moskva]
 gi|418739685|ref|ZP_13296066.1| TIGR00266 family protein [Leptospira kirschneri serovar Valbuzzi
           str. 200702274]
 gi|400322721|gb|EJO70577.1| TIGR00266 family protein [Leptospira kirschneri serovar
           Grippotyphosa str. RM52]
 gi|410737271|gb|EKQ82013.1| TIGR00266 family protein [Leptospira kirschneri serovar
           Grippotyphosa str. Moskva]
 gi|410752807|gb|EKR09779.1| TIGR00266 family protein [Leptospira kirschneri serovar Valbuzzi
           str. 200702274]
          Length = 221

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 53/200 (26%), Positives = 95/200 (47%), Gaps = 10/200 (5%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPS 93
            ++ L P + + A  G+M +M+  +++E          + +  FG ++    V + P  +
Sbjct: 14  LKVKLSPGQTIKAESGAMVYMTPGIDVETKMGSGFLSAISRRFFGGESFFFNVFKAP-VA 72

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
            G +G+A      ++ +DL   G  L+ +  A+L S   + +       +R+ +GG E F
Sbjct: 73  GGEIGLAPELPGDVVGVDLTDTG--LIVESGAYLASDETISIKPMFGG-IRSFLGGEEVF 129

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVF 212
           L + + G G  F+ A G +V   ++V    T+D   IVA   S+  +I K  G  +   F
Sbjct: 130 LLEAI-GVGKLFLNAYGGIVP--IDVQGSYTIDTGHIVAFDKSLQYKIAKAGGSWKSTFF 186

Query: 213 GGDNLVTAVVTGPGIVFIQS 232
           GG+ LV    +G G V IQ+
Sbjct: 187 GGEGLVME-FSGHGRVLIQT 205


>gi|58338207|ref|YP_194792.1| hypothetical protein LBA1968 [Lactobacillus acidophilus NCFM]
 gi|227902606|ref|ZP_04020411.1| protein of hypothetical function DUF124 [Lactobacillus acidophilus
           ATCC 4796]
 gi|58255524|gb|AAV43761.1| hypothetical protein LBA1968 [Lactobacillus acidophilus NCFM]
 gi|227869695|gb|EEJ77116.1| protein of hypothetical function DUF124 [Lactobacillus acidophilus
           ATCC 4796]
          Length = 237

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/208 (25%), Positives = 96/208 (46%), Gaps = 14/208 (6%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL--FGKTITS-----VV 86
           +  I L+P EKV  + GSM + S S+++ N  +     G+ + +   G++ITS     + 
Sbjct: 18  LVDIELEPGEKVFIQRGSMVYHSTSIKL-NTKVNGTGSGIGKLIRAVGRSITSGESFWIT 76

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV-RN 145
                 + G + IA      +LP+ L   G E     D    +++     +   Q V R 
Sbjct: 77  EAQASRTTGKLAIAPALPGEVLPLKL---GKEQYRINDGKFLAMDGSASYSMKKQSVGRA 133

Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG 205
           +  G  GF     +G+G     A GS+ +  L   E IT+D + +VA ++ ++  I ++ 
Sbjct: 134 IFSGTGGFFVMTTSGEGTVLCNAYGSIKKITLNNDE-ITIDNNHVVAWSTGLDYDIHFDN 192

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSL 233
              +++  G+ +V     G G V+IQSL
Sbjct: 193 GFIQSIGTGEGIVN-TFKGSGEVYIQSL 219


>gi|320032100|gb|EFW14056.1| hypothetical protein CPSG_09423 [Coccidioides posadasii str.
           Silveira]
          Length = 345

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 54/204 (26%), Positives = 92/204 (45%), Gaps = 19/204 (9%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS +M ++       +  + + L G  ++S     PG       + APS 
Sbjct: 150 LTAKPGAMIAMSPTMTLKG----SVKFSVKKMLAGGEMSSSTYTGPGEL-----LLAPS- 199

Query: 105 ARILPIDLAMFGGE--LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQG 162
             +  I L    G+       DAFL     + V +   Q +       EG     ++G G
Sbjct: 200 -ALGDISLLRLNGQEQWFVGKDAFLACTQGI-VKDYKRQGLGKAFFSGEGLFIYTVSGTG 257

Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
           + FI + G++++K+L  GE   VD   +VA T +  ++   +G I   +  G+ LV    
Sbjct: 258 ILFIQSFGAIIKKDLVAGEKYIVDNGHLVAWTCNYVMERVASGGIMSGLASGEGLVCK-F 316

Query: 223 TGPGIVFIQSLPFHRLSQRIARAV 246
           TGPG +++Q+    R +Q  A  V
Sbjct: 317 TGPGTIYMQT----RNAQAFAHWV 336


>gi|24215002|ref|NP_712483.1| hypothetical protein LA_2302 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45657507|ref|YP_001593.1| hypothetical protein LIC11635 [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|386074341|ref|YP_005988658.1| hypothetical protein LIF_A1880 [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|417762665|ref|ZP_12410653.1| TIGR00266 family protein [Leptospira interrogans str. 2002000624]
 gi|417772368|ref|ZP_12420257.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
           str. Pomona]
 gi|417777699|ref|ZP_12425516.1| TIGR00266 family protein [Leptospira interrogans str. 2002000621]
 gi|418667879|ref|ZP_13229284.1| TIGR00266 family protein [Leptospira interrogans serovar Pyrogenes
           str. 2006006960]
 gi|418671690|ref|ZP_13233039.1| TIGR00266 family protein [Leptospira interrogans str. 2002000623]
 gi|418680947|ref|ZP_13242184.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
           str. Kennewicki LC82-25]
 gi|418698166|ref|ZP_13259145.1| TIGR00266 family protein [Leptospira interrogans serovar Bataviae
           str. L1111]
 gi|418704174|ref|ZP_13265053.1| TIGR00266 family protein [Leptospira interrogans serovar Hebdomadis
           str. R499]
 gi|418711824|ref|ZP_13272576.1| TIGR00266 family protein [Leptospira interrogans serovar
           Grippotyphosa str. UI 08368]
 gi|418717138|ref|ZP_13276975.1| TIGR00266 family protein [Leptospira interrogans str. UI 08452]
 gi|418730607|ref|ZP_13289101.1| TIGR00266 family protein [Leptospira interrogans str. UI 12758]
 gi|421085922|ref|ZP_15546773.1| TIGR00266 family protein [Leptospira santarosai str. HAI1594]
 gi|421102770|ref|ZP_15563374.1| TIGR00266 family protein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|421117563|ref|ZP_15577923.1| TIGR00266 family protein [Leptospira interrogans serovar Canicola
           str. Fiocruz LV133]
 gi|421120876|ref|ZP_15581181.1| TIGR00266 family protein [Leptospira interrogans str. Brem 329]
 gi|421123904|ref|ZP_15584174.1| TIGR00266 family protein [Leptospira interrogans serovar
           Grippotyphosa str. 2006006986]
 gi|421134974|ref|ZP_15595104.1| TIGR00266 family protein [Leptospira interrogans serovar
           Grippotyphosa str. Andaman]
 gi|24196047|gb|AAN49501.1|AE011400_6 conserved hypothetical protein [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45600746|gb|AAS70230.1| conserved hypothetical protein [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|353458130|gb|AER02675.1| conserved hypothetical protein [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|400327385|gb|EJO79637.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
           str. Kennewicki LC82-25]
 gi|409941410|gb|EKN87039.1| TIGR00266 family protein [Leptospira interrogans str. 2002000624]
 gi|409945739|gb|EKN95754.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
           str. Pomona]
 gi|410010897|gb|EKO69028.1| TIGR00266 family protein [Leptospira interrogans serovar Canicola
           str. Fiocruz LV133]
 gi|410020857|gb|EKO87652.1| TIGR00266 family protein [Leptospira interrogans serovar
           Grippotyphosa str. Andaman]
 gi|410346214|gb|EKO97224.1| TIGR00266 family protein [Leptospira interrogans str. Brem 329]
 gi|410367884|gb|EKP23268.1| TIGR00266 family protein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|410431487|gb|EKP75847.1| TIGR00266 family protein [Leptospira santarosai str. HAI1594]
 gi|410438391|gb|EKP87477.1| TIGR00266 family protein [Leptospira interrogans serovar
           Grippotyphosa str. 2006006986]
 gi|410572686|gb|EKQ35751.1| TIGR00266 family protein [Leptospira interrogans str. 2002000621]
 gi|410581388|gb|EKQ49200.1| TIGR00266 family protein [Leptospira interrogans str. 2002000623]
 gi|410756324|gb|EKR17949.1| TIGR00266 family protein [Leptospira interrogans serovar Pyrogenes
           str. 2006006960]
 gi|410762870|gb|EKR29029.1| TIGR00266 family protein [Leptospira interrogans serovar Bataviae
           str. L1111]
 gi|410766154|gb|EKR36842.1| TIGR00266 family protein [Leptospira interrogans serovar Hebdomadis
           str. R499]
 gi|410767790|gb|EKR43051.1| TIGR00266 family protein [Leptospira interrogans serovar
           Grippotyphosa str. UI 08368]
 gi|410774816|gb|EKR54820.1| TIGR00266 family protein [Leptospira interrogans str. UI 12758]
 gi|410787205|gb|EKR80939.1| TIGR00266 family protein [Leptospira interrogans str. UI 08452]
 gi|455667356|gb|EMF32677.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
           str. Fox 32256]
 gi|455792658|gb|EMF44398.1| TIGR00266 family protein [Leptospira interrogans serovar Lora str.
           TE 1992]
 gi|456821914|gb|EMF70420.1| TIGR00266 family protein [Leptospira interrogans serovar Canicola
           str. LT1962]
 gi|456969815|gb|EMG10731.1| TIGR00266 family protein [Leptospira interrogans serovar
           Grippotyphosa str. LT2186]
          Length = 221

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 88/199 (44%), Gaps = 8/199 (4%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSD 94
            ++ L P + + A  G+M +M+  +++E          + +  FG       +     + 
Sbjct: 14  LKVKLSPGQTIKAESGAMVYMTPGIDVETKMGSGFLSAISRRFFGGESFFFNVFKASVTG 73

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           G +G+A      ++ IDL   G  L+ +  A+L S   + +       +R+  GG EG  
Sbjct: 74  GEIGLAPELPGDVVGIDLTDTG--LIVESGAYLASDETISIKPMFGG-IRSFFGG-EGIF 129

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFG 213
             +  G G  F+ A G ++   ++V    T+D   IVA   S+  +I K  G  +   FG
Sbjct: 130 LLEAFGNGKLFLNAYGGIIP--IDVQGSYTIDTGHIVAFDKSLQYKIAKAGGSWKSTFFG 187

Query: 214 GDNLVTAVVTGPGIVFIQS 232
           G+ LV    TG G V IQ+
Sbjct: 188 GEGLVMEF-TGHGRVLIQT 205


>gi|448731397|ref|ZP_21713697.1| hypothetical protein C449_16508 [Halococcus saccharolyticus DSM
           5350]
 gi|445792150|gb|EMA42762.1| hypothetical protein C449_16508 [Halococcus saccharolyticus DSM
           5350]
          Length = 233

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 60/232 (25%), Positives = 98/232 (42%), Gaps = 23/232 (9%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSD-G 95
           + L   E V+A PG+M   S S+ +E     +  V   + + G   ++   R    S+ G
Sbjct: 16  VELAAGETVMAEPGAMVTHSPSVSIETESSRDGLVSSAKSMLGGE-SAFANRFTAESEPG 74

Query: 96  FVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
            V ++ P+   +   +L    GE L   D A+L S  D+ + +          GG+E  L
Sbjct: 75  TVTLSPPTPGDVHHHELD---GETLYAVDGAYLASDPDIDIDSEF--------GGLESML 123

Query: 155 RQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
                    L G G  FI A G +    L+ GE   VD   +VA   SV    +  G ++
Sbjct: 124 SGAGLTPLALKGTGDVFIEAFGGLETIALDHGESHVVDNDHVVAWEGSVEFDARRVGGLK 183

Query: 209 RAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQ 260
             +  G+ +V    TGPG V+ Q+      ++ I  A+  P   ++  F V+
Sbjct: 184 STLLSGEGIVMD-FTGPGTVWYQTRGLDTFTEEIMEAM--PGTGDDGGFEVE 232


>gi|254416762|ref|ZP_05030512.1| conserved hypothetical protein TIGR00266 [Coleofasciculus
           chthonoplastes PCC 7420]
 gi|196176502|gb|EDX71516.1| conserved hypothetical protein TIGR00266 [Coleofasciculus
           chthonoplastes PCC 7420]
          Length = 220

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 47/201 (23%), Positives = 94/201 (46%), Gaps = 16/201 (7%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSM----EMENIYIPENEVGMWQWLF-GKTITSVVLRNPG 91
           + L P +++ A  G+M  M G +    E    +IP    G+ +  F G+++      N  
Sbjct: 16  LTLNPGDRITAEAGAMTSMDGKLTVKTEFSGGFIP----GLLKKFFGGESLFVNTFHNQT 71

Query: 92  PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
                + +   ++  I+ +DL   G  L  QP A++C  + VK+       +++   G E
Sbjct: 72  QQPLTLVLTQSTIGDIVGVDLR--GNSLCFQPGAYICHTSGVKLGVRW-AGLKSWFSG-E 127

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           G  + ++ GQG  F  A G + +K +  G+ + VD   +VA   ++ + +   G +  ++
Sbjct: 128 GLFKLQVKGQGKVFFGAYGGLTKKRVN-GDFV-VDTGHLVAYEPTIKMNVGLAGGLLGSM 185

Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
             G+  +   +TG G +++QS
Sbjct: 186 TSGEGFINR-LTGTGEIYLQS 205


>gi|302511745|ref|XP_003017824.1| hypothetical protein ARB_04708 [Arthroderma benhamiae CBS 112371]
 gi|291181395|gb|EFE37179.1| hypothetical protein ARB_04708 [Arthroderma benhamiae CBS 112371]
          Length = 333

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 49/189 (25%), Positives = 89/189 (47%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS ++ ++       +  M + L G  ++S     PG       + APS+
Sbjct: 136 LTAKPGAMIAMSPTVSLKGSI----KFSMKKMLAGGEMSSSTFTGPGEL-----LLAPSM 186

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
              + I L + G E      DAFL     V V +  +Q +       EG    K++G G+
Sbjct: 187 LGDISI-LRLNGQEQWSVGKDAFLACTQGV-VKDYKNQGLSKAFFSGEGLFVYKISGTGI 244

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            F+ + G++++K+L  GE   VD   +V+      ++   +G +   +  G+ LV    T
Sbjct: 245 LFVQSFGAIIRKDLVDGEKYIVDNGHLVSWNCKYVMERVASGGVLSNLSSGEGLVCK-FT 303

Query: 224 GPGIVFIQS 232
           GPG +++Q+
Sbjct: 304 GPGTIYLQT 312


>gi|159488167|ref|XP_001702091.1| hypothetical protein CHLREDRAFT_181241 [Chlamydomonas reinhardtii]
 gi|158269539|gb|EDO95908.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 240

 Score = 49.7 bits (117), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 57/123 (46%), Gaps = 3/123 (2%)

Query: 122 QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGE 181
           QP  +L   + V +   +    +   GG EGF   +  G+G   I + G++ + +L  GE
Sbjct: 115 QPGGYLACDHSVNIGVKMLDIAQGCCGG-EGFFVMEANGRGRLLICSYGAITRYDLAPGE 173

Query: 182 VITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
              +D    VA TS +   I K    +   V  G+ LV A  TGPG VF+Q+     L+ 
Sbjct: 174 KRKIDNGYCVAWTSGMEWNIAKAAKGLWNTVISGEGLV-ANFTGPGTVFVQTRSMQNLAN 232

Query: 241 RIA 243
            +A
Sbjct: 233 ALA 235


>gi|303315955|ref|XP_003067982.1| hypothetical protein CPC735_042810 [Coccidioides posadasii C735
           delta SOWgp]
 gi|240107658|gb|EER25837.1| hypothetical protein CPC735_042810 [Coccidioides posadasii C735
           delta SOWgp]
          Length = 338

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 54/204 (26%), Positives = 92/204 (45%), Gaps = 19/204 (9%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS +M ++       +  + + L G  ++S     PG       + APS 
Sbjct: 143 LTAKPGAMIAMSPTMTLKG----SVKFSVKKMLAGGEMSSSTYTGPGEL-----LLAPS- 192

Query: 105 ARILPIDLAMFGGE--LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQG 162
             +  I L    G+       DAFL     + V +   Q +       EG     ++G G
Sbjct: 193 -ALGDISLLRLNGQEQWFVGKDAFLACTQGI-VKDYKRQGLGKAFFSGEGLFIYTVSGTG 250

Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
           + FI + G++++K+L  GE   VD   +VA T +  ++   +G I   +  G+ LV    
Sbjct: 251 ILFIQSFGAIIKKDLVAGEKYIVDNGHLVAWTCNYVMERVASGGIMSGLASGEGLVCK-F 309

Query: 223 TGPGIVFIQSLPFHRLSQRIARAV 246
           TGPG +++Q+    R +Q  A  V
Sbjct: 310 TGPGTIYMQT----RNAQAFAHWV 329


>gi|427420414|ref|ZP_18910597.1| TIGR00266 family protein [Leptolyngbya sp. PCC 7375]
 gi|425763127|gb|EKV03980.1| TIGR00266 family protein [Leptolyngbya sp. PCC 7375]
          Length = 225

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 55/207 (26%), Positives = 92/207 (44%), Gaps = 13/207 (6%)

Query: 31  EAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGKTITSVV 86
           ++ + +I + PQE+++A  G+M  MSG    S  +          G+ + L G+++   V
Sbjct: 10  DSAIAKIFMDPQEELIAEAGAMVAMSGFVNVSTTLRKGKGGGVFGGLKRMLAGESLFLSV 69

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
            R+ G S   + +A   +  ILP  L     EL+ Q   ++ S  DV +        +++
Sbjct: 70  FRS-GNSSAELYLAPKLIGDILPYTLKE--TELVVQSTGYMASTPDVDIDLGF-TGFKSL 125

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNG 205
             G E       +G GL  + A G + +  ++V     VD   IVA   +++  I K N 
Sbjct: 126 FSG-ESLFWLTASGSGLLLLSAFGGIYE--IDVDSEYVVDTGHIVAFERTLDYSISKANK 182

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
               A FGG+  V     G G VF Q+
Sbjct: 183 SWAGAFFGGEGFVCQ-FRGRGKVFCQT 208


>gi|242823093|ref|XP_002488022.1| DUF124 domain protein [Talaromyces stipitatus ATCC 10500]
 gi|218712943|gb|EED12368.1| DUF124 domain protein [Talaromyces stipitatus ATCC 10500]
          Length = 370

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 50/187 (26%), Positives = 81/187 (43%), Gaps = 12/187 (6%)

Query: 47  ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPS-LA 105
           ARPG M  MS ++ ++  +       M + L G  +       PG       + APS L 
Sbjct: 178 ARPGVMIAMSPTVTLKGNF----HFSMKKLLIGGEMNHSTYTGPGEI-----LLAPSVLG 228

Query: 106 RILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAF 165
            I  + L   G       DAFL +   VK         + +  G EG+    +TG GL +
Sbjct: 229 DITVLRLTDDGQTWKVGRDAFLANTASVKKKYKAQNLSKTLFSG-EGWFVYNITGPGLLW 287

Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGP 225
           + + G+V++K L  GE   VD   +VA      ++   +G +  ++  G+ L      GP
Sbjct: 288 LQSFGAVIKKELVEGESYYVDNGHLVAWNCEYIIERVASGGMISSLSSGEGLA-CRFKGP 346

Query: 226 GIVFIQS 232
           G V++Q+
Sbjct: 347 GTVYLQT 353


>gi|367036511|ref|XP_003648636.1| hypothetical protein THITE_2106335 [Thielavia terrestris NRRL 8126]
 gi|346995897|gb|AEO62300.1| hypothetical protein THITE_2106335 [Thielavia terrestris NRRL 8126]
          Length = 381

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 34/109 (31%), Positives = 55/109 (50%), Gaps = 2/109 (1%)

Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
           D +L S  +V     L    + +  G EG    K+TG GL ++ + G++++K+L  GE  
Sbjct: 258 DGYLASTENVVKDYKLQGLSKAMFSG-EGLWVYKITGTGLVWLTSFGAIIRKDLAEGEKY 316

Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            VD   +VA  +   ++   +G I   +   + LV    TGPG VFIQ+
Sbjct: 317 IVDNGHLVAWNTKYVLERVASGGIISGLASAEGLVCK-FTGPGTVFIQT 364


>gi|354617112|ref|ZP_09034609.1| protein of unknown function DUF124 [Saccharomonospora
           paurometabolica YIM 90007]
 gi|353218537|gb|EHB83279.1| protein of unknown function DUF124 [Saccharomonospora
           paurometabolica YIM 90007]
          Length = 221

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/216 (28%), Positives = 92/216 (42%), Gaps = 30/216 (13%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           V ++ L PQE V    G M   S  + M++        G+ + + G     V      P+
Sbjct: 9   VARLALAPQEPVQVEAGGMVATSYGVHMQSQSQGGLMKGLGRAVLGGESLFVSTFTAPPN 68

Query: 94  DGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSN----TLDQR---VR 144
            G+V +A   P   RI+ +D             A  C  N   ++N    TLD +     
Sbjct: 69  GGWVDVAPGLPGDVRIVEMD-----------GRAGWCVTNGSWLANSHGITLDTKWGGFG 117

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
           N+ GG  GFL    TGQG   +   G++    L+ GE+ITVD   +VA   +V  Q    
Sbjct: 118 NLFGGEGGFLSHA-TGQGQLVVACYGAMDLVTLQQGEMITVDAGHVVAYADTVQHQ---- 172

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
              RR   G   +V ++ +G G+VF  + P   L+Q
Sbjct: 173 --TRRISSG---MVQSLKSGEGLVFDFAGPGQVLTQ 203


>gi|453085936|gb|EMF13978.1| DUF124-domain-containing protein [Mycosphaerella populorum SO2202]
          Length = 358

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 89/199 (44%), Gaps = 10/199 (5%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           +  +PG+M  MS ++ ++       +  M + L    ++      PG     + +A  +L
Sbjct: 164 ITVKPGAMIAMSPTITVKGSV----KFSMKKLLARGEMSHTTFTGPGE----LLLAPFAL 215

Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
             I  I L+  G       DAFL     V   + + Q +   I   EG    KL G G+ 
Sbjct: 216 GDITNIRLSGKGETWNIGKDAFLACTQGV-TKDYVSQALSKAIFSGEGLFVYKLGGAGIC 274

Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
           +I + G++++K+L  GE   +D   +VA   +  ++   +G I   +  G+ LV    +G
Sbjct: 275 WIQSMGAIIRKDLREGEKYIIDNGHLVAWNCNYVLERVASGGIISNMSAGEGLVCK-FSG 333

Query: 225 PGIVFIQSLPFHRLSQRIA 243
           PG VF+Q+      SQ +A
Sbjct: 334 PGTVFMQTRNPAAFSQYMA 352


>gi|310642467|ref|YP_003947225.1| hypothetical protein [Paenibacillus polymyxa SC2]
 gi|386041537|ref|YP_005960491.1| hypothetical protein PPM_2847 [Paenibacillus polymyxa M1]
 gi|309247417|gb|ADO56984.1| Putative uncharacterized protein [Paenibacillus polymyxa SC2]
 gi|343097575|emb|CCC85784.1| conserved hypothetical protein [Paenibacillus polymyxa M1]
          Length = 226

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 54/206 (26%), Positives = 98/206 (47%), Gaps = 9/206 (4%)

Query: 30  GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRN 89
           G   + ++ L P E + A  G+M  MS S++++         G+ + L G+      LR 
Sbjct: 9   GAFAMLKVQLSPGETIKAEMGAMVSMSSSVDIKGTVDGGLLRGLGRMLSGEKFFFQELRA 68

Query: 90  P-GPSDGFVGIAAPSLARILPIDL-AMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
             GP++  V +A  S+  +  ++L   +   LL Q D FL     ++VS  +   ++ + 
Sbjct: 69  ARGPAE--VLLAPASIGDVQAVELDGTY--RLLVQKDGFLACTEGIEVSTKMQNLMKGLF 124

Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGP 206
            G EGF   +++G+G  F+ + G++    +  GE   +D + +VA    ++ +I K +  
Sbjct: 125 SG-EGFFIVEISGRGTVFLSSYGAIHPIYIAPGEERIIDNAHLVAWPDYMDYRIEKASKG 183

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
              +V  G+ LV     G G V IQS
Sbjct: 184 WLSSVTSGEALVCR-FRGEGTVLIQS 208


>gi|410940911|ref|ZP_11372710.1| TIGR00266 family protein [Leptospira noguchii str. 2006001870]
 gi|410783470|gb|EKR72462.1| TIGR00266 family protein [Leptospira noguchii str. 2006001870]
          Length = 221

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/199 (24%), Positives = 90/199 (45%), Gaps = 8/199 (4%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSD 94
            ++ L P + + A  G+M +M+  +++E          + +  FG       +     + 
Sbjct: 14  LKVKLSPGQTIKAESGAMVYMTPGIDVETKMGSGFLSAISRRFFGGESFFFNVFKATATG 73

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           G +G+A      ++ +DL   G  L+ +  A+L S   + +       +R+ +GG EG  
Sbjct: 74  GEIGLAPELPGDLVGVDLTDTG--LIVESGAYLASDETISIKPMFGG-IRSFLGG-EGIF 129

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFG 213
             +  G G  F+ + G ++   ++V    T+D   IVA   S+  +I K  G  +  +FG
Sbjct: 130 LLEAIGNGKLFLNSYGGIIP--IDVQGSYTIDTGHIVAFDKSLQYKIAKAGGSWKSTLFG 187

Query: 214 GDNLVTAVVTGPGIVFIQS 232
           G+ LV    TG G V IQ+
Sbjct: 188 GEGLVMEF-TGHGRVLIQT 205


>gi|145233977|ref|XP_001400361.1| hypothetical protein ANI_1_3122024 [Aspergillus niger CBS 513.88]
 gi|134057301|emb|CAK44500.1| unnamed protein product [Aspergillus niger]
          Length = 361

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 52/188 (27%), Positives = 79/188 (42%), Gaps = 11/188 (5%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + ARPG M  MS SM +          G  + L G  +       PG       + APS+
Sbjct: 168 IQARPGIMIAMSPSMSLRGSL----SFGWMKALAGGQLARSTYTGPGEL-----LLAPSI 218

Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
              + I       E     DAFL     V ++      ++ V  G EGF+  K TG GL 
Sbjct: 219 LGDVTIIRLDGHTEWTVGKDAFLACTVGVNINFKTQGLMKGVFSG-EGFIVFKFTGSGLV 277

Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
           ++ + G++V+K L   E   V+   +VA      ++   +G I      G+ L     TG
Sbjct: 278 WMQSFGAIVKKELADDETYYVNNGHLVAWNCRYKIERVASGGIISNWSAGEGLA-CRFTG 336

Query: 225 PGIVFIQS 232
           PG V++Q+
Sbjct: 337 PGTVYMQT 344


>gi|418692288|ref|ZP_13253366.1| TIGR00266 family protein [Leptospira interrogans str. FPW2026]
 gi|400357521|gb|EJP13641.1| TIGR00266 family protein [Leptospira interrogans str. FPW2026]
          Length = 221

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 87/199 (43%), Gaps = 8/199 (4%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSD 94
            ++ L P + + A  G+M +M+  +++E          + +  FG       +     + 
Sbjct: 14  LKVKLSPGQTIKAESGAMVYMTPGIDVETKMGSGFLSAISRRFFGGESFFFNVFKASVTG 73

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           G +G A      ++ IDL   G  L+ +  A+L S   + +       +R+  GG EG  
Sbjct: 74  GEIGFAPELPGDVVGIDLTDTG--LIVESGAYLASDETISIKPMFGG-IRSFFGG-EGIF 129

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFG 213
             +  G G  F+ A G ++   ++V    T+D   IVA   S+  +I K  G  +   FG
Sbjct: 130 LLEAFGNGKLFLNAYGGIIP--IDVQGSYTIDTGHIVAFDKSLQYKIAKAGGSWKSTFFG 187

Query: 214 GDNLVTAVVTGPGIVFIQS 232
           G+ LV    TG G V IQ+
Sbjct: 188 GEGLVMEF-TGHGRVLIQT 205


>gi|322710039|gb|EFZ01614.1| DUF124 domain-containing protein [Metarhizium anisopliae ARSEF 23]
          Length = 413

 Score = 49.3 bits (116), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/186 (26%), Positives = 89/186 (47%), Gaps = 11/186 (5%)

Query: 47  ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
           A+PG+M  MS S+ ++     E +  + + + G  +T+     PG     + +A P L  
Sbjct: 222 AKPGAMIAMSPSITLKG----ELKFSVKKMIAGADMTTSRYVGPG----ELLLAPPMLGD 273

Query: 107 ILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFI 166
           I  I L      ++   DAFL +   + + +   Q +   I   EG    +++G G+ ++
Sbjct: 274 ITSIRLTGQESWVVGH-DAFLAATQGI-IKDHKRQGLGKAIFSGEGLFVYRISGTGIMWL 331

Query: 167 LAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPG 226
            + G++++K+L  GE   VD   +VA  +   ++   +G I   +   + LV    TGPG
Sbjct: 332 TSFGAIIRKDLVDGEKYIVDNGHLVAWNTKYVLERVASGGIISNMASAEGLVCK-FTGPG 390

Query: 227 IVFIQS 232
            VFIQ+
Sbjct: 391 TVFIQT 396


>gi|433775565|ref|YP_007306032.1| hypothetical protein Mesau_04318 [Mesorhizobium australicum
           WSM2073]
 gi|433667580|gb|AGB46656.1| hypothetical protein Mesau_04318 [Mesorhizobium australicum
           WSM2073]
          Length = 232

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/146 (26%), Positives = 69/146 (47%), Gaps = 6/146 (4%)

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           G +  +      I+PI L   G E+  +   FL + ++V  +    + +  ++ G  GF 
Sbjct: 84  GIIAFSRDGPGHIVPIHLRR-GEEIQVREHQFLAATSNVDYTFERVRGLATMLFGQSGFF 142

Query: 155 RQKL---TGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
             +    TG G+ ++   G+V +K L  GE+I ++    +   +SV ++ K +  +    
Sbjct: 143 IDRFRGDTGDGIVWLHGYGNVFEKTLAPGEIIDIEPGGWLFKDASVRMETKID-RLSSGF 201

Query: 212 FGGD-NLVTAVVTGPGIVFIQSLPFH 236
           FG + N +    TGPG V IQS+  H
Sbjct: 202 FGANLNFIVNRFTGPGRVGIQSMYLH 227


>gi|372222877|ref|ZP_09501298.1| hypothetical protein MzeaS_11221, partial [Mesoflavibacter
           zeaxanthinifaciens S86]
          Length = 76

 Score = 48.9 bits (115), Expect = 0.002,   Method: Composition-based stats.
 Identities = 22/42 (52%), Positives = 30/42 (71%)

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           I++ G I+  VFGG+ L  A + GPG V+IQSLPF RL+ R+
Sbjct: 4   IEFVGGIKNTVFGGEGLFFATLRGPGTVYIQSLPFSRLAGRV 45


>gi|418721405|ref|ZP_13280582.1| TIGR00266 family protein [Leptospira borgpetersenii str. UI 09149]
 gi|410742097|gb|EKQ90847.1| TIGR00266 family protein [Leptospira borgpetersenii str. UI 09149]
          Length = 221

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 95/203 (46%), Gaps = 16/203 (7%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNP--G 91
            ++ L P + + A  G+M +M+  +++E          + +  FG ++    + + P  G
Sbjct: 14  LKVKLSPGQTIKAEAGAMVYMTPGVDVETKMGSGFLSAISRKFFGGESFFFNIFKAPAQG 73

Query: 92  PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
              GF    AP L   ++ +DL   G  L+ +  A+L S   V + +     +R+  GG 
Sbjct: 74  AEIGF----APELPGDVVGVDLTNTG--LIVEAGAYLASDETVTMKSKFGG-LRSFFGG- 125

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
           EG    ++ G G  F+ A G ++   ++V    T+D   +VA   S+  +I K  G  + 
Sbjct: 126 EGIFLLEILGNGKLFLNAYGGILP--IDVQGSYTIDTGHVVAFDKSLQYKITKAGGSWKS 183

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
            +FGG+ LV    TG G V IQ+
Sbjct: 184 TLFGGEGLVMEF-TGHGKVLIQT 205


>gi|326476653|gb|EGE00663.1| hypothetical protein TESG_07960 [Trichophyton tonsurans CBS 112818]
          Length = 335

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 49/189 (25%), Positives = 89/189 (47%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS ++ ++       +  M + L G  ++S     PG       + APS+
Sbjct: 138 LTAKPGAMIAMSPTVSLKGSI----KFSMKKMLAGGEMSSSTFTGPGEL-----LLAPSM 188

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
              + I L + G E      DAFL     V V +  +Q +       EG    K++G G+
Sbjct: 189 LGDISI-LRLNGQEQWSVGKDAFLACTQGV-VKDYKNQGLSKAFFSGEGLFVYKISGTGI 246

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            F+ + G++++K+L  GE   VD   +V+      ++   +G +   +  G+ LV    T
Sbjct: 247 LFVQSFGAIIRKDLVDGEKYIVDNGHLVSWNCKYVMERVASGGVLSNLSSGEGLVCK-FT 305

Query: 224 GPGIVFIQS 232
           GPG +++Q+
Sbjct: 306 GPGTIYLQT 314


>gi|159483657|ref|XP_001699877.1| hypothetical protein CHLREDRAFT_97437 [Chlamydomonas reinhardtii]
 gi|158281819|gb|EDP07573.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 238

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/127 (29%), Positives = 58/127 (45%), Gaps = 3/127 (2%)

Query: 117 GELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKN 176
           G  + QP  +L   + V +   +    +   GG EGF   +  G+G   I + G++ + +
Sbjct: 92  GSWVVQPGGYLACDHSVSIGVKMLDLAQGCCGG-EGFFVMEANGRGRLLICSYGAITRYD 150

Query: 177 LEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPF 235
           L  GE   +D    VA TS +   I K    +   V  G+ LV A  TGPG VF+Q+   
Sbjct: 151 LAPGEKRKIDNGYCVAWTSGMQWNIAKAARGLWNTVISGEGLV-ANFTGPGTVFVQTRSM 209

Query: 236 HRLSQRI 242
             L+  +
Sbjct: 210 QNLANAL 216


>gi|428207316|ref|YP_007091669.1| hypothetical protein Chro_2308 [Chroococcidiopsis thermalis PCC
           7203]
 gi|428009237|gb|AFY87800.1| protein of unknown function DUF124 [Chroococcidiopsis thermalis PCC
           7203]
          Length = 220

 Score = 48.9 bits (115), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 93/200 (46%), Gaps = 14/200 (7%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIY----IPENEVGMWQWLFGKTITSVVLRNPGP 92
           + L P E + A  G+M  M   + ++  +    IP     + ++L G+++      N   
Sbjct: 16  VTLDPGESITAEAGAMTSMDAQLSIQTKFSGGLIPAL---LKKFLGGESLFVNTFTNRTQ 72

Query: 93  SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
               + +   ++  I  +DL+   G++  QP AF+     VK+       +++ + G EG
Sbjct: 73  QSLRLVLTQSTIGDIGRLDLSQ--GDICLQPGAFVAHSAGVKMGLQW-AGIKSWLAG-EG 128

Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
             + KL+G G AF    G + +K ++ GE I VD   +VA    + + I   G +  +V 
Sbjct: 129 LFKLKLSGSGRAFFGCYGGITKKQIQ-GEFI-VDSGHLVAYEPKIKMSIGLAGGLVGSVT 186

Query: 213 GGDNLVTAVVTGPGIVFIQS 232
            G+ LV  +  G G +++QS
Sbjct: 187 SGEGLVNRLA-GTGDIYLQS 205


>gi|172035645|ref|YP_001802146.1| hypothetical protein cce_0729 [Cyanothece sp. ATCC 51142]
 gi|171697099|gb|ACB50080.1| hypothetical protein cce_0729 [Cyanothece sp. ATCC 51142]
          Length = 227

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 48/201 (23%), Positives = 96/201 (47%), Gaps = 16/201 (7%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-----QWLFGKTITSVVLRNPG 91
           + L P E ++A  G+M  M G + M+     +   G++     ++L G+++   V +N  
Sbjct: 23  VTLAPGESIIAEAGAMASMDGRLSMKT----KLSGGLFSALLRKFLGGESLFVNVFKNQT 78

Query: 92  PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
                + +    +  I  ++L+  G E+  QP A++ S   + +            G  E
Sbjct: 79  SQALTLVLTQSIIGDIECLELS--GNEMCFQPGAYIASSPKIDLGVRWAGLASWFAG--E 134

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           G  + K++G G  F  A G + ++ +  GE I VD S +VA +  +++ I  +G +  ++
Sbjct: 135 GLFKLKVSGHGKVFFGAYGGLSKQTI-TGEFI-VDNSHLVAYSPDISMSIGLSGGLISSM 192

Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
             G+ LV   + G G++++QS
Sbjct: 193 TSGEGLVNR-LKGHGVIYLQS 212


>gi|396493741|ref|XP_003844129.1| hypothetical protein LEMA_P017800.1 [Leptosphaeria maculans JN3]
 gi|312220709|emb|CBY00650.1| hypothetical protein LEMA_P017800.1 [Leptosphaeria maculans JN3]
          Length = 407

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 90/199 (45%), Gaps = 19/199 (9%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNP---GPSDGFVGIAA 101
           + A+PGSM  M+ ++ ++         G  ++   K  T   +      GP +  +  AA
Sbjct: 214 LTAKPGSMIAMTPTITLK---------GTMKFSLKKAFTGGEMSKSQYIGPGELLLAPAA 264

Query: 102 PSLARILPIDLAMFGGELL-CQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTG 160
             +  I  I L+  G E   C  DAFL     + V+    Q +   +   EG    K++G
Sbjct: 265 --IGDITIIKLS--GKETWSCGHDAFLACTQGI-VTEYKSQGLSKAMFSGEGLFVYKISG 319

Query: 161 QGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTA 220
           QG+ +  + G++++K+L+  E   +D   +VA      ++   +G I   +  G+ LV  
Sbjct: 320 QGVLWCTSFGAIIRKDLQKDEKYIIDNGHLVAWNCKYVMERVASGGIMSNMMAGEGLVCK 379

Query: 221 VVTGPGIVFIQSLPFHRLS 239
             TGPG +F+Q+    + S
Sbjct: 380 -FTGPGTIFMQTRNAQQFS 397


>gi|428304804|ref|YP_007141629.1| hypothetical protein Cri9333_1218 [Crinalium epipsammum PCC 9333]
 gi|428246339|gb|AFZ12119.1| protein of unknown function DUF124 [Crinalium epipsammum PCC 9333]
          Length = 221

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 52/198 (26%), Positives = 89/198 (44%), Gaps = 9/198 (4%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPSDG 95
           + L P E + A  G+M  M+G + M   +       + +  FG +++   V  N      
Sbjct: 16  LSLSPGESITAEAGAMTSMNGQISMRTEFSGGLLSALLKKFFGGESLFVNVFTNNTQLPL 75

Query: 96  FVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
            V +   ++  I  IDL   GG  +C QP A++     + +            G  EG  
Sbjct: 76  EVVLTQSTIGDIACIDLR--GGREICFQPGAYIAHTPGINIGVQWAGFSSFFAG--EGLF 131

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGG 214
           + KL+GQG  F  A G + QK +  GE I VD   +VA    + +++   G +  ++  G
Sbjct: 132 KLKLSGQGRVFFGAYGGISQKQIN-GEFI-VDSGHLVAYDPGIKMKLGLAGGLIGSMTSG 189

Query: 215 DNLVTAVVTGPGIVFIQS 232
           + LV   ++G G +++QS
Sbjct: 190 EGLVNR-LSGNGEIYLQS 206


>gi|395244439|ref|ZP_10421406.1| Putative uncharacterized protein [Lactobacillus hominis CRBIP
           24.179]
 gi|394483329|emb|CCI82414.1| Putative uncharacterized protein [Lactobacillus hominis CRBIP
           24.179]
          Length = 236

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 54/218 (24%), Positives = 94/218 (43%), Gaps = 12/218 (5%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITS-----VVL 87
           +  I L+  EKV    GSM + S S+ +   +    + +G      G++I+S     +  
Sbjct: 18  LVDIELEDGEKVYIERGSMVYHSTSINLNTKVNAAGSGIGKLMRAVGRSISSGESFWITE 77

Query: 88  RNPGPSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
                  G + IA PSL   IL ++L     +       FL        S       R +
Sbjct: 78  AQAKGETGHLAIA-PSLPGEILHLELG--DKQYRINDGKFLAMDGSASYSMKKQTLGRAL 134

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
           + G  GF     +GQG     A GS+ +  L+  E IT+D + +VA ++ ++  I ++  
Sbjct: 135 LSGTGGFFVMTTSGQGTVLCNAYGSIKKIVLDDDE-ITIDNNHVVAWSTDLSYDIHFDNG 193

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
             +++  G+ +V     G G V+IQSL     + R+ +
Sbjct: 194 FIQSIGTGEGVVN-TFKGSGEVYIQSLNIETFASRLDK 230


>gi|452983776|gb|EME83534.1| hypothetical protein MYCFIDRAFT_188507 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 335

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/109 (31%), Positives = 55/109 (50%), Gaps = 2/109 (1%)

Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
           DAFL     V   + + Q +   I   EG    KL G G+ +I + G++++K+L  GE  
Sbjct: 212 DAFLACTQGV-TRDYVSQNLSKAIFSGEGLFVYKLGGTGICWIQSMGAILRKDLREGEKY 270

Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            +D   +VA   +  V+   +G I   +  G+ LV    +GPG VF+Q+
Sbjct: 271 IIDNGHLVAWNCNYVVERVASGGIISNISAGEGLVCK-FSGPGTVFMQT 318


>gi|116328217|ref|YP_797937.1| hypothetical protein LBL_1541 [Leptospira borgpetersenii serovar
           Hardjo-bovis str. L550]
 gi|418736821|ref|ZP_13293220.1| TIGR00266 family protein [Leptospira borgpetersenii serovar
           Castellonis str. 200801910]
 gi|421096067|ref|ZP_15556774.1| TIGR00266 family protein [Leptospira borgpetersenii str. 200801926]
 gi|116120961|gb|ABJ79004.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
           Hardjo-bovis str. L550]
 gi|410360968|gb|EKP12014.1| TIGR00266 family protein [Leptospira borgpetersenii str. 200801926]
 gi|410747707|gb|EKR00612.1| TIGR00266 family protein [Leptospira borgpetersenii serovar
           Castellonis str. 200801910]
 gi|456888184|gb|EMF99177.1| TIGR00266 family protein [Leptospira borgpetersenii str. 200701203]
          Length = 221

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 53/203 (26%), Positives = 95/203 (46%), Gaps = 16/203 (7%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNP--G 91
            ++ L P + + A  G+M +M+  +++E          + +  FG ++    + + P  G
Sbjct: 14  LKVKLSPGQTIKAEAGAMVYMTPGVDVETKMGSGFLSAISRKFFGGESFFFNIFKAPAQG 73

Query: 92  PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
              GF    AP L   ++ +DL   G  L+ +  A+L S   + + +     +R+  GG 
Sbjct: 74  AEIGF----APELPGDVVGVDLTNTG--LIVEAGAYLASDETITMKSKFGG-LRSFFGG- 125

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
           EG    ++ G G  F+ A G ++   ++V    T+D   +VA   S+  +I K  G  + 
Sbjct: 126 EGIFLLEILGNGKLFLNAYGGILP--IDVQGSYTIDTGHVVAFDKSLQYKITKAGGSWKS 183

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
            +FGG+ LV    TG G V IQ+
Sbjct: 184 TLFGGEGLVMEF-TGHGKVLIQT 205


>gi|389641153|ref|XP_003718209.1| hypothetical protein MGG_11494 [Magnaporthe oryzae 70-15]
 gi|351640762|gb|EHA48625.1| hypothetical protein MGG_11494 [Magnaporthe oryzae 70-15]
          Length = 478

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 34/109 (31%), Positives = 56/109 (51%), Gaps = 2/109 (1%)

Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
           D +L S   V V     Q +   +   EG    K++G+GL ++ + G++++K+L  GE  
Sbjct: 354 DGYLASTQGV-VKEYKRQGLGKAMFSGEGLWIYKISGKGLLWLTSFGAIIRKDLADGERY 412

Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            VD   +VA  +   ++   +G I   V  G+ LV    TGPG VF+Q+
Sbjct: 413 VVDNGHLVAWNTKYILERVASGGIVSGVASGEGLV-CRFTGPGTVFLQT 460


>gi|399575836|ref|ZP_10769593.1| hypothetical protein HSB1_16320 [Halogranum salarium B-1]
 gi|399238547|gb|EJN59474.1| hypothetical protein HSB1_16320 [Halogranum salarium B-1]
          Length = 228

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 42/76 (55%), Gaps = 1/76 (1%)

Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDN 216
           KLTG G  F+ + G+V +  L  GE   VD   IVA   S   +++  G ++  +F G+ 
Sbjct: 131 KLTGTGPTFLSSYGAVHEIELTEGEQYVVDTGHIVAFEESTTFEVRRVGGMKSTLFSGEG 190

Query: 217 LVTAVVTGPGIVFIQS 232
           LV    TGPG +++QS
Sbjct: 191 LVCE-FTGPGKLWLQS 205


>gi|354554886|ref|ZP_08974189.1| protein of unknown function DUF124 [Cyanothece sp. ATCC 51472]
 gi|353553040|gb|EHC22433.1| protein of unknown function DUF124 [Cyanothece sp. ATCC 51472]
          Length = 220

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 48/201 (23%), Positives = 96/201 (47%), Gaps = 16/201 (7%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMW-----QWLFGKTITSVVLRNPG 91
           + L P E ++A  G+M  M G + M+     +   G++     ++L G+++   V +N  
Sbjct: 16  VTLAPGESIIAEAGAMASMDGRLSMKT----KLSGGLFSALLRKFLGGESLFVNVFKNQT 71

Query: 92  PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
                + +    +  I  ++L+  G E+  QP A++ S   + +            G  E
Sbjct: 72  SQALTLVLTQSIIGDIECLELS--GNEMCFQPGAYIASSPKIDLGVRWAGLASWFAG--E 127

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           G  + K++G G  F  A G + ++ +  GE I VD S +VA +  +++ I  +G +  ++
Sbjct: 128 GLFKLKVSGHGKVFFGAYGGLSKQTI-TGEFI-VDNSHLVAYSPDISMSIGLSGGLISSM 185

Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
             G+ LV   + G G++++QS
Sbjct: 186 TSGEGLVNR-LKGHGVIYLQS 205


>gi|296811802|ref|XP_002846239.1| DUF124 domain-containing protein [Arthroderma otae CBS 113480]
 gi|238843627|gb|EEQ33289.1| DUF124 domain-containing protein [Arthroderma otae CBS 113480]
          Length = 319

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 48/189 (25%), Positives = 88/189 (46%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           +  +PG+M  MS S+ ++       +  M + L G  ++S     PG       + APS+
Sbjct: 123 LTVKPGAMIAMSPSVSLKG----SVKFSMKKMLAGGEMSSSTFTGPGEL-----LLAPSM 173

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
              +  +L + G E      DAFL     + V +  +Q +       EG    K++G G+
Sbjct: 174 LGDI-SNLKLNGQEQWSVGKDAFLACTQGI-VKDYKNQGLSKAFFSGEGLFVYKISGTGI 231

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            F+ + G++++K+L  GE   VD   +VA      ++   +G +   +  G+ LV    T
Sbjct: 232 LFVQSFGAIIRKDLVEGEKYIVDNGHLVAWNCKYVMERVASGGVLSNLSSGEGLVCK-FT 290

Query: 224 GPGIVFIQS 232
           GPG +++Q+
Sbjct: 291 GPGTIYLQT 299


>gi|116330941|ref|YP_800659.1| hypothetical protein LBJ_1316 [Leptospira borgpetersenii serovar
           Hardjo-bovis str. JB197]
 gi|116124630|gb|ABJ75901.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
           Hardjo-bovis str. JB197]
          Length = 221

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 53/203 (26%), Positives = 95/203 (46%), Gaps = 16/203 (7%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNP--G 91
            ++ L P + + A  G+M +M+  +++E          + +  FG ++    + + P  G
Sbjct: 14  LKVKLSPGQTIKAEAGAMVYMTPGVDVETKMGSGFLSAISRKFFGGESFFFNIFKAPAQG 73

Query: 92  PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
              GF    AP L   ++ +DL   G  L+ +  A+L S   + + +     +R+  GG 
Sbjct: 74  AEIGF----APELPGDVVGVDLTNTG--LIVEAGAYLASDETITMKSKFGG-LRSFFGG- 125

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
           EG    ++ G G  F+ A G ++   ++V    T+D   +VA   S+  +I K  G  + 
Sbjct: 126 EGIFLLEILGNGKLFLNAYGGILP--IDVQGSYTIDTGHVVAFDKSLQYKITKAGGSWKS 183

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
            +FGG+ LV    TG G V IQ+
Sbjct: 184 TLFGGEGLVMEF-TGHGKVLIQT 205


>gi|448420452|ref|ZP_21581199.1| hypothetical protein C474_20721 [Halosarcina pallida JCM 14848]
 gi|445673603|gb|ELZ26163.1| hypothetical protein C474_20721 [Halosarcina pallida JCM 14848]
          Length = 226

 Score = 48.5 bits (114), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 49/199 (24%), Positives = 81/199 (40%), Gaps = 7/199 (3%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           +  + L   E + A  G+M     ++EME          + +   G++      R   P 
Sbjct: 13  LLTVSLAESESLRAESGAMVSHDTAIEMETNATGGFLKSIRRAFGGESFFQNTFRATAPG 72

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
           D  V  A P    I  ++L   G  L  Q  ++L     + V        R   GG EG 
Sbjct: 73  D--VQFAPPLPGDISHVELT--GETLYVQSGSYLAGDAGLDVDTEFGG-ARTFFGG-EGL 126

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
              K+TG G  F+ + G++ + +L+  E   VD   +VA   +    ++  G +R  +  
Sbjct: 127 FLLKVTGSGPLFLSSYGAIEEISLDDRESFVVDTGHVVAFEDTAEFTVRRVGGLRSTLTS 186

Query: 214 GDNLVTAVVTGPGIVFIQS 232
           G+ LV     G G V++QS
Sbjct: 187 GEGLVCE-FGGSGTVWVQS 204


>gi|335438291|ref|ZP_08561039.1| hypothetical protein HLRTI_14150 [Halorhabdus tiamatea SARL4B]
 gi|334892485|gb|EGM30718.1| hypothetical protein HLRTI_14150 [Halorhabdus tiamatea SARL4B]
          Length = 235

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 44/76 (57%), Gaps = 1/76 (1%)

Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDN 216
           +L+G G  F+ + G++ + +L+ GE  TVD   IVA   S N  +K  G ++  +F G+ 
Sbjct: 131 ELSGTGRTFLSSYGAIERVDLDPGETYTVDTGHIVAFEGSTNFDVKKVGGLKSTLFSGEG 190

Query: 217 LVTAVVTGPGIVFIQS 232
           LV    +G G V++Q+
Sbjct: 191 LVCE-FSGEGSVWLQT 205


>gi|337269156|ref|YP_004613211.1| hypothetical protein Mesop_4696 [Mesorhizobium opportunistum
           WSM2075]
 gi|336029466|gb|AEH89117.1| protein of unknown function DUF124 [Mesorhizobium opportunistum
           WSM2075]
          Length = 232

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/146 (25%), Positives = 67/146 (45%), Gaps = 6/146 (4%)

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           G +  +      I+PI L  +G E+  +   FL +  +V  +    + +  ++ G  GF 
Sbjct: 84  GIIAFSRDGPGHIVPIHLR-YGEEIQVREHQFLAATGNVDYTFERVRGLATMLFGQSGFF 142

Query: 155 RQKL---TGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
             +     G G+ ++   G+V +K L  GE I ++    +   +SV ++ K +  +    
Sbjct: 143 IDRFRGDAGDGIVWLHGYGNVFEKTLAPGETIDIEPGGWLFKDTSVKMETKID-RLSSGF 201

Query: 212 FGGD-NLVTAVVTGPGIVFIQSLPFH 236
           FG + N +    TGPG V IQS+  H
Sbjct: 202 FGANMNFIVNRFTGPGRVGIQSMYLH 227


>gi|451847140|gb|EMD60448.1| hypothetical protein COCSADRAFT_99184 [Cochliobolus sativus ND90Pr]
          Length = 250

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 50/198 (25%), Positives = 89/198 (44%), Gaps = 15/198 (7%)

Query: 48  RPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARI 107
           +PG+M  MS +M ++       +  + + L G  +T      PG     + +A P++  I
Sbjct: 61  KPGAMIAMSPTMTLKG----NVKFSLKKALIGGDMTKSTYTGPGE----LLLAPPAIGDI 112

Query: 108 LPIDLAMFGGELLCQ--PDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAF 165
             I L   GG        DAFL     + V+    Q +   +   EGF   K++G G+ +
Sbjct: 113 TIIKL---GGNDTWSVGRDAFLACTQGI-VTEYKSQGIGKAMFSGEGFFVYKISGTGILW 168

Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGP 225
             + G++++K+L   E   +D   +VA      ++   +G I   +  G+ +V    TGP
Sbjct: 169 CTSFGAIIRKDLAANEKYIIDNGHLVAWNCKYVLERVASGGIVSNLASGEGMVCK-FTGP 227

Query: 226 GIVFIQSLPFHRLSQRIA 243
           G VF+Q+      +Q +A
Sbjct: 228 GTVFMQTRNATAFAQFMA 245


>gi|367023727|ref|XP_003661148.1| hypothetical protein MYCTH_2300213 [Myceliophthora thermophila ATCC
           42464]
 gi|347008416|gb|AEO55903.1| hypothetical protein MYCTH_2300213 [Myceliophthora thermophila ATCC
           42464]
          Length = 425

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 52/189 (27%), Positives = 87/189 (46%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + ARPG+M  MS S+ ++       +  + + + G  + S     PG     + +  P L
Sbjct: 232 IEARPGAMIAMSPSIVLKG----SVKFSVKKLVAGGDMGSSTYTGPGE----LLLGPPML 283

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
             I    L + G E      D FL    +V V +   Q +   +   EG    K++G GL
Sbjct: 284 GDI--TSLRLTGQESWSVSHDGFLACTQNV-VKDYKRQGLGKAMFSGEGLWVYKISGTGL 340

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            ++ + G++++K+L  GE   VD   +VA  +   ++   +G I   +  G+ LV    T
Sbjct: 341 LWLTSFGAIIRKDLVEGEKYIVDNGHLVAWNTKYVLERVASGGIISGLASGEGLVCK-FT 399

Query: 224 GPGIVFIQS 232
           GPG VFIQ+
Sbjct: 400 GPGTVFIQT 408


>gi|159483201|ref|XP_001699649.1| hypothetical protein CHLREDRAFT_97504 [Chlamydomonas reinhardtii]
 gi|158281591|gb|EDP07345.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 232

 Score = 48.1 bits (113), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 36/122 (29%), Positives = 56/122 (45%), Gaps = 3/122 (2%)

Query: 122 QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGE 181
           QP  +L   + V +   +    +   GG EGF   +  G+G   I + G++ + +L  GE
Sbjct: 94  QPGGYLACDHSVNIGVKMLDIAQGCCGG-EGFFVMEANGRGRLLICSYGAITRYDLAPGE 152

Query: 182 VITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
              +D    VA TS +   I K    +   V  G+ LV A  TGPG VF+Q+     L+ 
Sbjct: 153 KRKIDNGYCVAWTSGMEWNIAKAAKGLWNTVISGEGLV-ANFTGPGTVFVQTRSMQNLAN 211

Query: 241 RI 242
            +
Sbjct: 212 AL 213


>gi|15790569|ref|NP_280393.1| hypothetical protein VNG1609C [Halobacterium sp. NRC-1]
 gi|169236306|ref|YP_001689506.1| hypothetical protein OE3284R [Halobacterium salinarum R1]
 gi|10581083|gb|AAG19873.1| conserved hypothetical protein [Halobacterium sp. NRC-1]
 gi|167727372|emb|CAP14160.1| conserved hypothetical protein [Halobacterium salinarum R1]
          Length = 235

 Score = 48.1 bits (113), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 60/237 (25%), Positives = 97/237 (40%), Gaps = 27/237 (11%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSD-- 94
           + L P E ++A PG+M   S  + ++     +  +   + + G         N   +D  
Sbjct: 16  VDLDPGESILAEPGAMVSHSAGVSIDTSTTRDGLLSSAKSMLGG---ESAFANEFAADHE 72

Query: 95  -GFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
            G V +A P    I   DL    GE L   D AFL S   + V++          GG++ 
Sbjct: 73  AGRVTLAPPKPGDIAQHDLD---GETLYAVDGAFLASDPSLDVASEF--------GGLKS 121

Query: 153 FLRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
            L         L+G G  F+ A G + + +L+ GE   VD   +VA   SV+   +  G 
Sbjct: 122 LLAGASITPLALSGTGTVFLEAFGGIERVDLDPGESYVVDNEHVVAWEESVDFDARRVGG 181

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQIAI 263
           ++  +  G+ LV    TG G V+ Q+      +  IA A+  P    +    + I I
Sbjct: 182 LKSTLLSGEGLVME-FTGGGSVWYQTRGLDAFTSIIAGAL--PMTEHDSDGGIDIDI 235


>gi|456862794|gb|EMF81306.1| TIGR00266 family protein [Leptospira weilii serovar Topaz str.
           LT2116]
          Length = 221

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 53/203 (26%), Positives = 96/203 (47%), Gaps = 16/203 (7%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF-GKTITSVVLRNP--G 91
            ++ L P + + A  G+M +M+  +++E          + +  F G++    + + P  G
Sbjct: 14  LKVKLSPGQTIKAEAGAMVYMTPGVDVETKMGSGFLSAISRRFFGGESFFFNIFKAPSGG 73

Query: 92  PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
              GF    AP L   ++ +DL   G  L+ +  A+L S   + + +     +R+  GG 
Sbjct: 74  AELGF----APELPGDVVGVDLTDTG--LIVEAGAYLASDETITMKSKFGG-LRSFFGG- 125

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
           EG    ++ G G  F+ A G ++   ++V    T+D   +VA   S+  +I K +G  + 
Sbjct: 126 EGIFLLEVLGNGKLFLNAYGGILP--IDVQGSYTIDTGHVVAFDKSLRYKITKASGSWKS 183

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
            +FGG+ LV    TG G V IQ+
Sbjct: 184 TLFGGEGLVME-FTGHGKVLIQT 205


>gi|308069406|ref|YP_003871011.1| hypothetical protein PPE_02645 [Paenibacillus polymyxa E681]
 gi|305858685|gb|ADM70473.1| DUF124 domain-containing protein [Paenibacillus polymyxa E681]
          Length = 226

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 53/206 (25%), Positives = 97/206 (47%), Gaps = 9/206 (4%)

Query: 30  GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRN 89
           G   + ++ L P E + A  G+M  MS S++++         G+ + L G+      LR 
Sbjct: 9   GAFAMLKVQLNPGEMIKAEMGAMVSMSSSVDIKGTVDGGLLRGLGRMLSGEKFFFQELRA 68

Query: 90  P-GPSDGFVGIAAPSLARILPIDL-AMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
             GP++  V +A  S+  +  ++L   +   L  Q D FL     ++VS  +   ++ + 
Sbjct: 69  ARGPAE--VLLAPASIGDVQAVELDGTY--RLFVQKDGFLACTEGIEVSTKMQNLMKGLF 124

Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGP 206
            G EGF   +++G+G  F+ + G++    +  GE   +D + +VA    ++ +I K +  
Sbjct: 125 SG-EGFFIVEISGRGTVFLSSYGAIHPIYIAPGEERIIDNAHLVAWPDYMDYKIEKASKG 183

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
              +V  G+ LV     G G V IQS
Sbjct: 184 WLSSVTSGEALVCR-FRGEGTVLIQS 208


>gi|433639418|ref|YP_007285178.1| TIGR00266 family protein [Halovivax ruber XH-70]
 gi|433291222|gb|AGB17045.1| TIGR00266 family protein [Halovivax ruber XH-70]
          Length = 225

 Score = 47.8 bits (112), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 51/199 (25%), Positives = 90/199 (45%), Gaps = 11/199 (5%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--KTITSVVLRNPGPSD 94
           + L   E ++A PG+M   S ++E+      +  +   + + G    + +      GP  
Sbjct: 16  VELAAGETILAEPGAMVGHSPNVEISTASSRDGLLSSAKSMLGGESLVVNEFTAEGGP-- 73

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
           G V +A P+   +   +L     E L   D AFL S   + + +     +++++GG    
Sbjct: 74  GEVRLAPPTPGDVKEHELQ---DETLYTTDGAFLASSPGIDIDSEF-GGLKSMLGGAS-L 128

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
               L G G AFI A G + + +L+ GE  T+D   ++A   SV    +  G ++  +  
Sbjct: 129 TPLALKGTGTAFIDAYGGLERLDLDPGESYTLDNEHLIAWDDSVEFDTRRVGGLKSTLLS 188

Query: 214 GDNLVTAVVTGPGIVFIQS 232
           G+ LV    TGPG V+ Q+
Sbjct: 189 GEGLVFD-FTGPGSVWYQT 206


>gi|418696575|ref|ZP_13257584.1| TIGR00266 family protein [Leptospira kirschneri str. H1]
 gi|421089051|ref|ZP_15549866.1| TIGR00266 family protein [Leptospira kirschneri str. 200802841]
 gi|421109955|ref|ZP_15570462.1| TIGR00266 family protein [Leptospira kirschneri str. H2]
 gi|421130944|ref|ZP_15591135.1| TIGR00266 family protein [Leptospira kirschneri str. 2008720114]
 gi|409956104|gb|EKO15036.1| TIGR00266 family protein [Leptospira kirschneri str. H1]
 gi|410002172|gb|EKO52694.1| TIGR00266 family protein [Leptospira kirschneri str. 200802841]
 gi|410004982|gb|EKO58786.1| TIGR00266 family protein [Leptospira kirschneri str. H2]
 gi|410357748|gb|EKP04968.1| TIGR00266 family protein [Leptospira kirschneri str. 2008720114]
          Length = 221

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 52/200 (26%), Positives = 93/200 (46%), Gaps = 10/200 (5%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPS 93
            ++ L P + + A  G+M +M+  +++E          + +  FG ++    V + P  +
Sbjct: 14  LKVKLSPGQTIKAESGAMVYMTPGIDVETKMGSGFLSAISRRFFGGESFFFNVFKAP-VA 72

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
            G +G+A      ++ +DL   G  L+ +  A+L S   + +       +R+ +GG EG 
Sbjct: 73  GGEIGLAPELPGDVVGVDLTDTG--LIVESGAYLASDETISIKPMFGG-IRSFLGG-EGV 128

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVF 212
              +  G G  F+ A G +V   ++V    T+D   IVA   S+  +I K  G  +   F
Sbjct: 129 FLLEAIGVGKLFLNAYGGIVP--IDVQGSYTIDTGHIVAFDKSLQYKIAKAGGSWKSTFF 186

Query: 213 GGDNLVTAVVTGPGIVFIQS 232
           GG+ LV    +G G V IQ+
Sbjct: 187 GGEGLVME-FSGHGRVLIQT 205


>gi|223997894|ref|XP_002288620.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220975728|gb|EED94056.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 302

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 44/191 (23%), Positives = 79/191 (41%), Gaps = 8/191 (4%)

Query: 85  VVLRNPGPS---DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
            +++N  PS   D  +  + P    ++PI+      EL+    +FL   N V V+  + +
Sbjct: 64  TIVKNTSPSRMGDVLLAPSEPGGICLVPIN--GMRDELMLTKGSFLACDNSVDVTTIVQR 121

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
            VRN +    GF   K +G+G   + + GS+ +  L   E   +D   +VA T+S+    
Sbjct: 122 GVRNSLSSGTGFFLMKASGRGTIAMASYGSIHKYTLSSDETRMIDNGHLVAWTASMRYST 181

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQI 261
                           +     GPG+V++QS   H+ +Q    A           F    
Sbjct: 182 TLASSSVWGSISSGEGLMCRFEGPGVVYLQS---HKPNQSDEDAKRRGRNGGGSPFAACF 238

Query: 262 AIFFFLAYVVI 272
           A+ F + +++I
Sbjct: 239 ALLFVIVWILI 249


>gi|146323366|ref|XP_754754.2| DUF124 domain protein [Aspergillus fumigatus Af293]
 gi|129558326|gb|EAL92716.2| DUF124 domain protein [Aspergillus fumigatus Af293]
 gi|159127762|gb|EDP52877.1| DUF124 domain protein [Aspergillus fumigatus A1163]
          Length = 333

 Score = 47.8 bits (112), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/109 (29%), Positives = 54/109 (49%), Gaps = 2/109 (1%)

Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
           DAFL S + +          + V  G EG    KL+G GL ++ + G++++K++  GE  
Sbjct: 210 DAFLASTSGISKDYQAQGLSKGVFSG-EGLFVYKLSGVGLVWLQSFGAIIRKDIADGESY 268

Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            VD   +VA      ++   +G I   +  G+ L     TGPG V++Q+
Sbjct: 269 YVDNGHLVAWNCKYKMERVASGGIISNISSGEGLA-CRFTGPGTVYLQT 316


>gi|359726625|ref|ZP_09265321.1| hypothetical protein Lwei2_06447 [Leptospira weilii str.
           2006001855]
 gi|417777919|ref|ZP_12425731.1| TIGR00266 family protein [Leptospira weilii str. 2006001853]
 gi|410781889|gb|EKR66456.1| TIGR00266 family protein [Leptospira weilii str. 2006001853]
          Length = 221

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 53/203 (26%), Positives = 95/203 (46%), Gaps = 16/203 (7%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNP--G 91
            ++ L P + + A  G+M +M+  +++E          + +  FG ++    + + P  G
Sbjct: 14  LKVKLSPGQTIKAEAGAMVYMTPGVDVETKMGSGFLSAISRRFFGGESFFFNIFKAPSGG 73

Query: 92  PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
              GF    AP L   ++ +DL   G  L+ +  A+L S   + + +     +R+  GG 
Sbjct: 74  AELGF----APELPGDVVGVDLTDTG--LIVEAGAYLASDETITMKSKFGG-LRSFFGG- 125

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
           EG    ++ G G  F+ A G ++   ++V    T+D   +VA   S+  +I K  G  + 
Sbjct: 126 EGIFLLEVLGNGKLFLNAYGGILP--IDVQGSYTIDTGHVVAFDKSLQYKITKAGGSWKS 183

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
            +FGG+ LV    TG G V IQ+
Sbjct: 184 TLFGGEGLVMEF-TGHGKVLIQT 205


>gi|65322229|ref|ZP_00395188.1| COG2013: Uncharacterized conserved protein [Bacillus anthracis str.
           A2012]
          Length = 104

 Score = 47.4 bits (111), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 48/105 (45%), Gaps = 14/105 (13%)

Query: 19  QDAVIPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF 78
           Q   I +++ G + Q  +I L P+E V+A  G+M  M   +EME I+   +  G    LF
Sbjct: 2   QAHEIEYKLYGDDMQFVEIELDPEESVIAEAGAMMMMEDYIEMETIF--GDGSGPSGGLF 59

Query: 79  GKTITS------------VVLRNPGPSDGFVGIAAPSLARILPID 111
           GK + +             V  N G     V  AAP   +I+P+D
Sbjct: 60  GKLMGAGKRLVTGESMFMTVFTNTGHGKRHVSFAAPYPRKIIPVD 104


>gi|159469370|ref|XP_001692840.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158268629|gb|EDO95688.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 152

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 54/122 (44%), Gaps = 2/122 (1%)

Query: 122 QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGE 181
           QP  +L   +   ++  L         G EGF   +  G+G   I + G++ + +L  GE
Sbjct: 27  QPGGYLACDHSHCIARILAHAPSQGCCGGEGFFVMEANGRGRLLICSYGAITRYDLAPGE 86

Query: 182 VITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
              +D    VA TS +   I K    +   V  G+ LV A  TGPG VF+Q+     L+ 
Sbjct: 87  KRKIDNGYCVAWTSGMEWNIAKAAKGLWNTVISGEGLV-ANFTGPGTVFVQTRSMQNLAN 145

Query: 241 RI 242
            +
Sbjct: 146 AL 147


>gi|257051409|ref|YP_003129242.1| hypothetical protein Huta_0322 [Halorhabdus utahensis DSM 12940]
 gi|256690172|gb|ACV10509.1| protein of unknown function DUF124 [Halorhabdus utahensis DSM
           12940]
          Length = 235

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 44/76 (57%), Gaps = 1/76 (1%)

Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDN 216
           +L+G G  F+ + G++ + +L+ GE  TVD   IVA   + N  +K  G ++  +F G+ 
Sbjct: 131 ELSGTGRTFLSSYGAIERVDLDPGETYTVDTGHIVAFEGTTNFDVKKVGGLKSTLFSGEG 190

Query: 217 LVTAVVTGPGIVFIQS 232
           LV    +G G V++Q+
Sbjct: 191 LVCE-FSGEGSVWLQT 205


>gi|440638033|gb|ELR07952.1| hypothetical protein GMDG_02811 [Geomyces destructans 20631-21]
          Length = 365

 Score = 47.4 bits (111), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 32/109 (29%), Positives = 57/109 (52%), Gaps = 2/109 (1%)

Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
           DA L S   V V +   Q +   +   EG    K++G G+ +I + G++++K+L+ GE  
Sbjct: 241 DALLASTQGV-VKDFKRQGLGKAMFSGEGLFVFKISGTGIMWITSFGAIIRKDLQEGERY 299

Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            VD   ++A  +   ++   +G I   +  G+ LV    TGPG +F+Q+
Sbjct: 300 VVDNGHLIAWNTKYVLERIASGGIISGIASGEGLVCK-FTGPGTIFLQT 347


>gi|281411601|ref|YP_003345680.1| hypothetical protein Tnap_0161 [Thermotoga naphthophila RKU-10]
 gi|281372704|gb|ADA66266.1| protein of unknown function DUF124 [Thermotoga naphthophila RKU-10]
          Length = 224

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 48/198 (24%), Positives = 87/198 (43%), Gaps = 20/198 (10%)

Query: 27  ILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLF-----GKT 81
           +L G   + ++ L   E VV  PG+M +M G +E+          G+W+ L      G++
Sbjct: 6   VLKGSYALLKVFLSIGESVVVEPGAMVYMKGPIEVNT----SATGGVWKALKRAILGGES 61

Query: 82  ITSVVLRNPGPSDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTL 139
                  + G S+  VG+A   P    ++P+   ++      Q  +FL     V++  + 
Sbjct: 62  FFMNTYISRGESE--VGLAPQLPGDIDVIPLKDILY-----VQSTSFLACDPSVEMDVSF 114

Query: 140 DQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNV 199
              +++   G EG    K  G G   + + G +    L+ GE  TVD   +VA   +V  
Sbjct: 115 GG-LKSFFSG-EGIFLLKFVGHGDVAVSSFGGIKSIELKPGEEFTVDTGHVVAFDGTVKW 172

Query: 200 QIKYNGPIRRAVFGGDNL 217
            ++  G ++  +FGG+ L
Sbjct: 173 NVRTFGGLKSTLFGGEGL 190


>gi|300790364|ref|YP_003770655.1| hypothetical protein AMED_8558 [Amycolatopsis mediterranei U32]
 gi|384153893|ref|YP_005536709.1| hypothetical protein RAM_43925 [Amycolatopsis mediterranei S699]
 gi|399542242|ref|YP_006554904.1| hypothetical protein AMES_8427 [Amycolatopsis mediterranei S699]
 gi|299799878|gb|ADJ50253.1| conserved hypothetical protein [Amycolatopsis mediterranei U32]
 gi|340532047|gb|AEK47252.1| hypothetical protein RAM_43925 [Amycolatopsis mediterranei S699]
 gi|398323012|gb|AFO81959.1| hypothetical protein AMES_8427 [Amycolatopsis mediterranei S699]
          Length = 225

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 57/201 (28%), Positives = 97/201 (48%), Gaps = 7/201 (3%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-WLFGKTITSVVLRNPGP 92
           V ++ML P E      G+M   S  +++++        G+ + +L G++        P P
Sbjct: 13  VARLMLAPGEPCQVESGAMMATSYGVQVQSQAQGGIMKGLGRAFLSGESFFISTFTAP-P 71

Query: 93  SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
           + G+V +AA     I  I L    G  + +   +L S + V+ + T    ++N++GG  G
Sbjct: 72  NGGWVDVAANLPGDIQVITLDGRTGWAVTR-GCWLASSHGVQ-TETKWGGMKNLMGGEGG 129

Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAV 211
           FL    TGQG   +   G+V    L+ GE++T+D   +VA   +V  QI K    I +++
Sbjct: 130 FLTHA-TGQGQLLVSCYGAVETITLQQGEMVTIDTGHVVAYADTVQYQIRKVAQGIIQSM 188

Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
             G+ LV   V GPG +  Q+
Sbjct: 189 KSGEGLVFDFV-GPGQIMTQT 208


>gi|116514068|ref|YP_812974.1| hypothetical protein LBUL_0969 [Lactobacillus delbrueckii subsp.
           bulgaricus ATCC BAA-365]
 gi|116514121|ref|YP_813027.1| hypothetical protein LBUL_1039 [Lactobacillus delbrueckii subsp.
           bulgaricus ATCC BAA-365]
 gi|116093383|gb|ABJ58536.1| hypothetical protein LBUL_0969 [Lactobacillus delbrueckii subsp.
           bulgaricus ATCC BAA-365]
 gi|116093436|gb|ABJ58589.1| hypothetical protein LBUL_1039 [Lactobacillus delbrueckii subsp.
           bulgaricus ATCC BAA-365]
          Length = 222

 Score = 47.4 bits (111), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 51/230 (22%), Positives = 101/230 (43%), Gaps = 18/230 (7%)

Query: 23  IPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           + +++ GG    +  + L   E +    G+M +M        +   +  +G      G++
Sbjct: 1   MKYELSGGNTCPLATVTLDKGESIKVENGAMVYMKDVTIAGKMNSSKKGLGSLLGAIGRS 60

Query: 82  ITS----VVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKV 135
           +TS     + +  G +DG  +G+A     +I  + +   G +  C    AFL S + V  
Sbjct: 61  LTSGESMFITQATGDADGGQIGVAPAIPGKIEKLSV---GKQQYCLNTGAFLASDDSVSY 117

Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
                +  +  +GG  GF   +  G+G   + A G +V+  +   + I++D   ++A  +
Sbjct: 118 KMRSQKLSKAALGGTGGFFVMETEGEGDMLVNAFGDLVELTVTSDKPISIDNEHVIAWDA 177

Query: 196 SVNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +++  IK    +   +FG   G+ LV    TG G V IQ+   H L++ +
Sbjct: 178 NLDYDIK----VASGMFGFTSGEGLVNH-FTGDGKVLIQTRNLHSLAEAL 222


>gi|378731982|gb|EHY58441.1| hypothetical protein HMPREF1120_06451 [Exophiala dermatitidis
           NIH/UT8656]
          Length = 395

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 51/189 (26%), Positives = 85/189 (44%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS S+ ++       +  M + L G  +T      PG     + +A  SL
Sbjct: 201 LTAKPGAMIAMSPSITLKGAV----KFSMKKLLIGGEMTHSTFTGPGE----LLLAPSSL 252

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
             I  + L   G E      DAFL     V V     Q +   +   EG    K++G+G+
Sbjct: 253 GDISLLRLD--GNETWSVGRDAFLACTQGV-VKEYKSQGIGKAMFSGEGLFVYKMSGRGI 309

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            ++   G++++K+L  GE   VD   +VA      ++   +G I      G+ LV    +
Sbjct: 310 LWMSTFGAILKKDLVDGERYIVDNGHLVAWNCKYVLERVASGGIISGAASGEGLVCN-FS 368

Query: 224 GPGIVFIQS 232
           GPG V++Q+
Sbjct: 369 GPGTVYLQT 377


>gi|381163294|ref|ZP_09872524.1| TIGR00266 family protein [Saccharomonospora azurea NA-128]
 gi|418461391|ref|ZP_13032466.1| hypothetical protein SZMC14600_10578 [Saccharomonospora azurea SZMC
           14600]
 gi|359738494|gb|EHK87379.1| hypothetical protein SZMC14600_10578 [Saccharomonospora azurea SZMC
           14600]
 gi|379255199|gb|EHY89125.1| TIGR00266 family protein [Saccharomonospora azurea NA-128]
          Length = 225

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 51/203 (25%), Positives = 89/203 (43%), Gaps = 11/203 (5%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           V +++L P E      G+M   S  M M+         G+ +   G     V      P+
Sbjct: 13  VARLLLAPHEPAQVEAGAMLATSYGMHMQASTQGGVMKGLGRAFLGGESLFVSTYTAPPN 72

Query: 94  DGFVGIAA--PSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
            G+V +AA  P   R++ +D    G +  C    ++L + + +++         N+ GG 
Sbjct: 73  GGWVDVAAGLPGDVRVIEMD----GRQGWCVTRGSWLANSHGIQLETKWG-GFSNLFGGE 127

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
            GFL     GQG   +   G++   NL+ GE +T+D   +VA   +V  Q+ K +  + +
Sbjct: 128 GGFLTHA-QGQGQLVVSCYGAIDVVNLQPGEYVTIDSGHVVAYADTVQSQLRKVSQGVIQ 186

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
           ++  G+  V     GPG V  Q+
Sbjct: 187 SLKSGEGFVFDFA-GPGQVLTQT 208


>gi|257057406|ref|YP_003135238.1| hypothetical protein Svir_34470 [Saccharomonospora viridis DSM
           43017]
 gi|256587278|gb|ACU98411.1| conserved hypothetical protein TIGR00266 [Saccharomonospora viridis
           DSM 43017]
          Length = 225

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 53/210 (25%), Positives = 92/210 (43%), Gaps = 18/210 (8%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           V +++L P E      G+M   S  M +E         G+ + L G     V      P+
Sbjct: 13  VARLLLAPHEPAHVEAGAMLATSYGMHVEAQAQGGVMKGLGRALLGGESLFVSTYTAPPN 72

Query: 94  DGFVGIAA--PSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
            G+V IA+  P   R++ +D    G    C    ++L + + +++         N+ GG 
Sbjct: 73  GGWVDIASSLPGDIRLIEMD----GRVGWCVTRGSWLANSHGIQLETRWGG-FSNLFGGE 127

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
            GFL     GQG   +   G++   NL+ GE +T+D   +VA   +V  Q++    + R 
Sbjct: 128 GGFLTHA-QGQGQLIVACYGAIDVINLQPGEYVTIDSGHVVAYADTVQSQLR---KVSRG 183

Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           V      + ++ +G G+VF  + P   L+Q
Sbjct: 184 V------IQSLKSGEGLVFDFAGPGQILTQ 207


>gi|448408461|ref|ZP_21574256.1| hypothetical protein C475_07996 [Halosimplex carlsbadense 2-9-1]
 gi|445674316|gb|ELZ26860.1| hypothetical protein C475_07996 [Halosimplex carlsbadense 2-9-1]
          Length = 229

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 34/114 (29%), Positives = 56/114 (49%), Gaps = 3/114 (2%)

Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
           L  Q  +FL S   V+V +T     +   GG EG    +L G G AF+ + G++ +  + 
Sbjct: 95  LYVQSSSFLASDPTVEV-DTEFGGAKTFFGG-EGLFLLRLEGTGPAFLSSYGAIDEHEVT 152

Query: 179 VGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
             E   VD   IVA   S++  ++  G ++  +F G+ LV    +G G V+ Q+
Sbjct: 153 SDESFVVDTGHIVAFEESLSFDVRKVGGLKSTLFSGEGLVCE-FSGEGTVWTQT 205


>gi|432329095|ref|YP_007247239.1| TIGR00266 family protein [Aciduliprofundum sp. MAR08-339]
 gi|432135804|gb|AGB05073.1| TIGR00266 family protein [Aciduliprofundum sp. MAR08-339]
          Length = 223

 Score = 47.0 bits (110), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 54/204 (26%), Positives = 92/204 (45%), Gaps = 19/204 (9%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSD 94
            ++ L P E V A  G+M +MS ++ ++         G+ + +  K    +       S+
Sbjct: 14  LKVHLSPGESVTAEAGAMVYMSSNIGVKTTTGGGFLKGLARKILTKQTLFMNTYYAEGSE 73

Query: 95  GFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVN---DVKVSNTLDQRVRNVI--G 148
           G+V + AP L   I+ ID++     L      +L S      VK +       R +   G
Sbjct: 74  GYV-VFAPGLPGDIMEIDVSK---PLYVSDTNYLASTGLQFGVKFTG-----FRGIFTPG 124

Query: 149 GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR 208
           G+  F   KL G G  ++   G V    L+ GE + +D   + A  + +  +++  G ++
Sbjct: 125 GMFWF---KLEGIGKVWLATFGGVDVIELKPGEHLLIDNIHLAAFDAGMQFRLRKFGKLK 181

Query: 209 RAVFGGDNLVTAVVTGPGIVFIQS 232
             +FGG+ L+T    GPG VFIQS
Sbjct: 182 SFLFGGEYLLTE-FEGPGRVFIQS 204


>gi|51891979|ref|YP_074670.1| hypothetical protein STH841 [Symbiobacterium thermophilum IAM
           14863]
 gi|51855668|dbj|BAD39826.1| conserved hypothetical protein [Symbiobacterium thermophilum IAM
           14863]
          Length = 226

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 58/216 (26%), Positives = 91/216 (42%), Gaps = 44/216 (20%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV-GMWQWLFGKTI-------TSVVLR 88
           + L P E+V A PG+M  MS ++E+      E EV G +    G+++       TS    
Sbjct: 16  VYLGPGEEVQAEPGAMVAMSSTVEL------ETEVKGGFLGALGRSVLGGESFFTSRYRA 69

Query: 89  NPGPSDGFVGIAAPSLARILPIDLA---MFGGELLCQPDAFLCSVNDVKVSNTLDQRVRN 145
             GP +        +LA  LP D+    + G     +  A+L +  ++ V    D R   
Sbjct: 70  RGGPGE-------LALAPTLPGDIGYVELRGETFYLKSGAYLAADPELAV----DSRWGG 118

Query: 146 VIG--GIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
             G  G  G    ++ G G     A G++ +K L  GE   VD   +V  +  +N     
Sbjct: 119 ARGFFGSGGLFLLRVQGTGGLLFTAYGALHEKELAPGERYRVDTGHVVGFSEGLNF---- 174

Query: 204 NGPIRRAVFG-------GDNLVTAVVTGPGIVFIQS 232
              +R+A  G       G+ LV    TGPG V++Q+
Sbjct: 175 --AVRKAARGWFSTLASGEGLVCE-FTGPGRVYLQT 207


>gi|411118304|ref|ZP_11390685.1| TIGR00266 family protein [Oscillatoriales cyanobacterium JSC-12]
 gi|410712028|gb|EKQ69534.1| TIGR00266 family protein [Oscillatoriales cyanobacterium JSC-12]
          Length = 231

 Score = 46.6 bits (109), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 55/205 (26%), Positives = 95/205 (46%), Gaps = 17/205 (8%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGP 92
           + ++ L   E+V     SM  MS  + +E          + + L G +++   V   P P
Sbjct: 13  ISRVRLAGGEQVRVEAASMVGMSAGITLETTTTGGFMQSLKRSLLGGESLFQNVYTAP-P 71

Query: 93  SDGFVGIAAPSLARILPIDLAMFG--GELLCQPDAFLCSVNDVKVSNTLDQR---VRNVI 147
             G V +A PSL    P DL +      +L Q  A++ S  DV ++  LD +    ++  
Sbjct: 72  QGGEVWVA-PSL----PGDLNVLTITEPMLIQSGAYVAS--DVTIA--LDTKWGGSKSFF 122

Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
           G   G    +  GQG   + + G++ +  L  GE  T+D   +VA++ S+  + +  G I
Sbjct: 123 GTGGGLFMLRAEGQGQVVVSSYGAIHEMTLAPGESFTLDTGHLVALSESMQFKTRSIGGI 182

Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQS 232
           +  +F G+  V A +TGPG   +Q+
Sbjct: 183 KTFMFSGEGFV-ADLTGPGRFLMQT 206


>gi|334127084|ref|ZP_08501014.1| hypothetical protein HMPREF9081_0601 [Centipeda periodontii DSM
           2778]
 gi|333390046|gb|EGK61198.1| hypothetical protein HMPREF9081_0601 [Centipeda periodontii DSM
           2778]
          Length = 227

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 37/119 (31%), Positives = 64/119 (53%), Gaps = 11/119 (9%)

Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI---EGFLRQKLTGQGLAFILAGGSVVQK 175
           ++ Q   FL +   + V    D +++ ++ G+   EGF   KLTG+G+AF+ + G +   
Sbjct: 97  MIVQKGGFLAAEESINV----DSKMQGLMQGLFSQEGFFLLKLTGRGVAFLSSYGVIHVL 152

Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKY--NGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           NLE GE + VD   +VA    ++ +++   NG +  ++  G+ LV     GPG V IQ+
Sbjct: 153 NLEQGEEVIVDNGHLVAWPDYMDYKVEKASNGWV-SSIMSGECLVCR-FRGPGPVLIQT 209


>gi|119492280|ref|XP_001263579.1| hypothetical protein NFIA_068520 [Neosartorya fischeri NRRL 181]
 gi|119411739|gb|EAW21682.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 334

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 32/109 (29%), Positives = 54/109 (49%), Gaps = 2/109 (1%)

Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
           DAFL S + +          + V  G EG    KL+G GL ++ + G++++K++  GE  
Sbjct: 211 DAFLASTSGISKDYQAQGLSKGVFSG-EGLFVYKLSGVGLVWLQSFGAIIKKDIADGESY 269

Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            VD   +VA      ++   +G I   +  G+ L     TGPG V++Q+
Sbjct: 270 YVDNGHLVAWNCKYKMERVASGGIISNMSSGEGLA-CRFTGPGTVYLQT 317


>gi|374324403|ref|YP_005077532.1| hypothetical protein HPL003_22935 [Paenibacillus terrae HPL-003]
 gi|357203412|gb|AET61309.1| hypothetical protein HPL003_22935 [Paenibacillus terrae HPL-003]
          Length = 226

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 99/206 (48%), Gaps = 9/206 (4%)

Query: 30  GEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLR- 88
           G   + ++ + P E + A  G+M  MS S++++         G+ + L G+      LR 
Sbjct: 9   GAFAMLKVQMNPGETIKAEMGAMVSMSSSVDIKGTVDGGLLRGLGRMLSGEKFFFQELRA 68

Query: 89  NPGPSDGFVGIAAPSLARILPIDL-AMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
           + GP++  V +A  S+  +  ++L   +   LL Q D FL     ++VS  +   ++ + 
Sbjct: 69  SRGPAE--VLLAPASIGDVQAVELDGTY--RLLVQKDGFLACTEGIEVSTKMQNLMKGLF 124

Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGP 206
            G EGF   +++G+G  F+ + G++    +  GE   +D + +VA    ++ +I K +  
Sbjct: 125 SG-EGFFIVEISGRGTVFLSSYGAIHPIYVAPGEERIIDNAHLVAWPDYMDYKIEKASKG 183

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
              +V  G+ LV    +  G V IQS
Sbjct: 184 WLSSVTSGEALVCRFRS-EGTVLIQS 208


>gi|302913033|ref|XP_003050829.1| hypothetical protein NECHADRAFT_61207 [Nectria haematococca mpVI
           77-13-4]
 gi|256731767|gb|EEU45116.1| hypothetical protein NECHADRAFT_61207 [Nectria haematococca mpVI
           77-13-4]
          Length = 455

 Score = 46.6 bits (109), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 48/187 (25%), Positives = 88/187 (47%), Gaps = 13/187 (6%)

Query: 47  ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
           A+PG+M  MS S+ ++     + +  + + + G  ++S     PG     V +A  SL  
Sbjct: 264 AKPGAMIAMSPSVTLKG----QIKFSVKKMITGGEMSSSTFVGPG----EVLLAPHSLGD 315

Query: 107 ILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAF 165
           +    L + G E      DA+L S   V + +   Q +   +   EG    K++G GL +
Sbjct: 316 V--TTLRLTGQEKWSVGHDAYLASTQGV-IKDFKRQSISKAMFSGEGLWVYKISGTGLMW 372

Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGP 225
           I + G++++K L  GE   VD   ++A  +   ++   +G I   +   + LV    TGP
Sbjct: 373 ISSFGAIIKKTLVDGEKYIVDNGHLIAWNTKYVMERVASGGIISGMASAEGLVCK-FTGP 431

Query: 226 GIVFIQS 232
           G +++Q+
Sbjct: 432 GTIYMQT 438


>gi|410450302|ref|ZP_11304343.1| TIGR00266 family protein [Leptospira sp. Fiocruz LV3954]
 gi|410015815|gb|EKO77906.1| TIGR00266 family protein [Leptospira sp. Fiocruz LV3954]
          Length = 221

 Score = 46.6 bits (109), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 95/203 (46%), Gaps = 16/203 (7%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNP--G 91
            ++ L P + + A  G+M +M+  + +E          + +  FG ++    + + P  G
Sbjct: 14  LKVKLLPGQTIKAEAGAMVYMTPGISVETKMGSGFLSAVSRRFFGGESFFFNIFKAPTGG 73

Query: 92  PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
              GF    AP L   ++ IDL   G  L+ +  A+L S   V + +     +R+ +GG 
Sbjct: 74  AEIGF----APELPGDVVGIDLTDTG--LIVEAGAYLASDETVTMKSMFGG-LRSFLGG- 125

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
           EG    +  G G  F+ + G ++   ++V    T+D   +VA   S++ +I K  G  + 
Sbjct: 126 EGIFLLEAIGNGKLFLNSYGGIIP--IDVQGSYTIDTGHVVAFDKSLSYKIAKAGGSWKS 183

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
            +FGG+ LV    TG G V IQ+
Sbjct: 184 TLFGGEGLVMQF-TGHGKVLIQT 205


>gi|170290249|ref|YP_001737065.1| hypothetical protein Kcr_0629 [Candidatus Korarchaeum cryptofilum
           OPF8]
 gi|170174329|gb|ACB07382.1| protein of unknown function DUF124 [Candidatus Korarchaeum
           cryptofilum OPF8]
          Length = 231

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 50/202 (24%), Positives = 93/202 (46%), Gaps = 17/202 (8%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIY----IPENEVGMW--QWLFGKTITSVVLRNP 90
           + L   E + A  G+M +M+ +++M+       +   ++G+   Q  F    T+V     
Sbjct: 16  VKLNQGETITAEAGAMTYMTPNIQMKTRARAGVLDTIKLGVLGGQSFFVNDYTAV----G 71

Query: 91  GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
           GP  G V +A+  L  I  + L   G   + Q  +++ S   V++        + + G  
Sbjct: 72  GP--GEVALASAPLGDIEKLTLDGRG--YIIQRSSYVASDPTVELDIKWQGFTKGIFG-- 125

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
           +G    K +G+G  FI   G++ +  L  GE + VD   +VA + + + +++  G ++  
Sbjct: 126 QGLFMIKTSGRGNLFINTFGAIDRHELSAGESLIVDNFHLVAFSDTCSYEVRRMGGLKEL 185

Query: 211 VFGGDNLVTAVVTGPGIVFIQS 232
              G+ LV   V GPG V IQ+
Sbjct: 186 ALSGEGLVVE-VRGPGEVLIQT 206


>gi|398336773|ref|ZP_10521478.1| hypothetical protein LkmesMB_14976 [Leptospira kmetyi serovar
           Malaysia str. Bejo-Iso9]
          Length = 221

 Score = 46.2 bits (108), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 48/200 (24%), Positives = 94/200 (47%), Gaps = 10/200 (5%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPS 93
            ++ L P + + A  G+M +M+  +++E          + +  FG ++    + + P  +
Sbjct: 14  LKVKLSPGQTIKAEAGAMVYMTPGVDVETKMGSGFLSAISRRFFGGESFFFNIFKAP-SA 72

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
              +G+A      ++ +DL   G  L+ +  A+L S   + +       +R+  GG EG 
Sbjct: 73  GAEIGLAPELPGDVVGVDLTDTG--LIVEAGAYLASDETITMKPKFGG-IRSFFGG-EGV 128

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVF 212
              ++ G+G  F+ A G ++   ++V    T+D   +VA   ++  +I K  G  +  +F
Sbjct: 129 FLLEILGKGKLFLNAYGGILP--IDVQGSYTIDTGHVVAFDKTLQYRIAKAGGSWKSTLF 186

Query: 213 GGDNLVTAVVTGPGIVFIQS 232
           GG+ LV    TG G V IQ+
Sbjct: 187 GGEGLVMEF-TGHGKVLIQT 205


>gi|13470475|ref|NP_102044.1| hypothetical protein mlr0194 [Mesorhizobium loti MAFF303099]
 gi|14021217|dbj|BAB47830.1| mlr0194 [Mesorhizobium loti MAFF303099]
          Length = 221

 Score = 46.2 bits (108), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 37/146 (25%), Positives = 66/146 (45%), Gaps = 6/146 (4%)

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           G +  +      I+PI L   G E+  +   FL +  +V  +    + +  ++ G  GF 
Sbjct: 73  GIIAFSRDGPGHIVPIHLRR-GEEIQVREHQFLAATGNVDYTFERVRGLATMLFGQSGFF 131

Query: 155 RQKLTGQG---LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
             +  G G   + ++   G+V +K L  GE I ++    +   +SV ++ K +  +    
Sbjct: 132 IDRFRGDGGDGIVWLHGYGNVFEKTLAAGETIDIEPGGWLFKDASVRMETKID-RLSSGF 190

Query: 212 FGGD-NLVTAVVTGPGIVFIQSLPFH 236
           FG + N +    TGPG V IQS+  H
Sbjct: 191 FGANMNFIVNRFTGPGRVGIQSMYLH 216


>gi|365960832|ref|YP_004942399.1| hypothetical protein FCOL_08985 [Flavobacterium columnare ATCC
           49512]
 gi|365737513|gb|AEW86606.1| hypothetical protein FCOL_08985 [Flavobacterium columnare ATCC
           49512]
          Length = 220

 Score = 46.2 bits (108), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 58/225 (25%), Positives = 102/225 (45%), Gaps = 35/225 (15%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG--------------KTI 82
           + L+P E +VA   +M  MS  ++++  +     VG+ + + G              KT+
Sbjct: 16  VELEPGETIVAESDAMSSMSAELDLDAKFNGGFFVGLAKKILGGESLFINHFTNNTNKTL 75

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQ 141
            S+ L    P D            I  +D    G +  C Q  A++ S   VK+S     
Sbjct: 76  -SLHLTQSTPGD----------IEIKELD----GTQSYCIQRGAYIASEKGVKLSVKW-A 119

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
            + ++IGG EG  +  ++G G     A G +++K ++ GE I VD   +VA   ++ ++ 
Sbjct: 120 GLGSLIGG-EGLFKLVVSGTGKVIFGAYGGLLEKEID-GEYI-VDTGHLVAYEPNMKLKP 176

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           + +G I  + FGG+  VT  + G G ++IQ+     L   I R +
Sbjct: 177 QLSGGIFSSFFGGEGFVTR-IEGKGKIYIQTRNLSGLVSWINRQI 220


>gi|359685935|ref|ZP_09255936.1| hypothetical protein Lsan2_15273 [Leptospira santarosai str.
           2000030832]
 gi|418746675|ref|ZP_13302995.1| TIGR00266 family protein [Leptospira santarosai str. CBC379]
 gi|418755449|ref|ZP_13311654.1| TIGR00266 family protein [Leptospira santarosai str. MOR084]
 gi|421113424|ref|ZP_15573868.1| TIGR00266 family protein [Leptospira santarosai str. JET]
 gi|422004334|ref|ZP_16351554.1| hypothetical protein LSS_12579 [Leptospira santarosai serovar
           Shermani str. LT 821]
 gi|409964244|gb|EKO32136.1| TIGR00266 family protein [Leptospira santarosai str. MOR084]
 gi|410792384|gb|EKR90319.1| TIGR00266 family protein [Leptospira santarosai str. CBC379]
 gi|410801198|gb|EKS07372.1| TIGR00266 family protein [Leptospira santarosai str. JET]
 gi|417257003|gb|EKT86411.1| hypothetical protein LSS_12579 [Leptospira santarosai serovar
           Shermani str. LT 821]
 gi|456875109|gb|EMF90340.1| TIGR00266 family protein [Leptospira santarosai str. ST188]
          Length = 221

 Score = 45.8 bits (107), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 54/203 (26%), Positives = 94/203 (46%), Gaps = 16/203 (7%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNP--G 91
            ++ L P + + A  G+M +M+  + +E          + +  FG ++    + + P  G
Sbjct: 14  LKVKLLPGQTIKAEAGAMVYMTPGISVETKMGSGFLSAVSRRFFGGESFFFNIFKAPTGG 73

Query: 92  PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
              GF    AP L   ++ IDL   G  L+ +  A+L S   V + +     +R+  GG 
Sbjct: 74  AEIGF----APELPGDVVGIDLTDTG--LIVEAGAYLASDETVTMKSMFGG-LRSFFGG- 125

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
           EG    +  G G  F+ + G ++   ++V    T+D   +VA   S++ +I K  G  + 
Sbjct: 126 EGIFLLEAIGNGKLFLNSYGGIIP--IDVQGSYTIDTGHVVAFDKSLSYKIAKAGGSWKS 183

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
            +FGG+ LV    TG G V IQ+
Sbjct: 184 TLFGGEGLVMQF-TGHGKVLIQT 205


>gi|381157052|ref|ZP_09866286.1| TIGR00266 family protein [Thiorhodovibrio sp. 970]
 gi|380880915|gb|EIC23005.1| TIGR00266 family protein [Thiorhodovibrio sp. 970]
          Length = 464

 Score = 45.8 bits (107), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 36/133 (27%), Positives = 68/133 (51%), Gaps = 3/133 (2%)

Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLE 178
           ++ Q D FL    +V+V+  L    + +  G EGF      G+GL F+ + G++ + NL 
Sbjct: 333 MIIQKDGFLACTEEVQVNTQLQNLAQGLFSG-EGFFVVGAEGRGLLFLESYGAIHEFNLG 391

Query: 179 VGEVITVDVSCIVAVTSSVNVQIK-YNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHR 237
            GE   VD   +VA + ++  +++  N  I  A+  G+ +V    TGPG + +Q+    +
Sbjct: 392 PGEQKIVDNGYLVAWSKNMGYRLEAANIGIVAALTSGEAIVCR-FTGPGKILVQTRMARQ 450

Query: 238 LSQRIARAVTSPN 250
               I++ + +P+
Sbjct: 451 FGLWISKFLPTPS 463


>gi|258564182|ref|XP_002582836.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237908343|gb|EEP82744.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 342

 Score = 45.8 bits (107), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 2/109 (1%)

Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
           DAFL     + V +   Q +   I   EG     ++G G+ FI + G++++K+L  GE  
Sbjct: 217 DAFLAGTQGI-VKDYKRQGLSKAIFSGEGLYTYMVSGTGILFIQSFGAIIRKDLANGEKY 275

Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            VD   +VA      ++   +G I   +  G+ LV    TGPG V++Q+
Sbjct: 276 IVDNGHLVAWNCGYVMERIASGGIISGLSSGEGLVCK-FTGPGTVYLQT 323


>gi|319783243|ref|YP_004142719.1| hypothetical protein [Mesorhizobium ciceri biovar biserrulae
           WSM1271]
 gi|317169131|gb|ADV12669.1| protein of unknown function DUF124 [Mesorhizobium ciceri biovar
           biserrulae WSM1271]
          Length = 232

 Score = 45.8 bits (107), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 36/146 (24%), Positives = 66/146 (45%), Gaps = 6/146 (4%)

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           G +  +      I+PI L   G E+  +   F  +  +V  +    + +  ++ G  GF 
Sbjct: 84  GIIAFSRDGPGHIVPIHLR-HGEEIQVREHQFRAATGNVDYTFERVRGLATMLFGQSGFF 142

Query: 155 RQKL---TGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
             +    +G G+ ++   G+V +K L  GE I ++    +   +SV ++ K +  +    
Sbjct: 143 IDRFRGDSGDGIVWLHGYGNVFEKTLAAGETIDIEPGGWLFKDASVRMETKID-RLSSGF 201

Query: 212 FGGD-NLVTAVVTGPGIVFIQSLPFH 236
           FG + N +    TGPG V IQS+  H
Sbjct: 202 FGANMNFIVNRFTGPGRVGIQSMYLH 227


>gi|443311726|ref|ZP_21041351.1| TIGR00266 family protein [Synechocystis sp. PCC 7509]
 gi|442778299|gb|ELR88567.1| TIGR00266 family protein [Synechocystis sp. PCC 7509]
          Length = 225

 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 56/207 (27%), Positives = 96/207 (46%), Gaps = 13/207 (6%)

Query: 31  EAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGKTITSVV 86
           ++ +  + L+  E++VA  G M  MSG    S  +          G+ + + G+++   V
Sbjct: 10  DSAIAHVTLESGEELVAEAGCMIAMSGYINASTTLRQGKGGGILGGLKRLVAGESLFLSV 69

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
            R+P  S G V +A   +  IL   +   G  L+ Q  ++L S ++V +     Q  ++V
Sbjct: 70  FRSP-TSGGEVFLAPKFMGDILLYKVVGMG--LVVQSTSYLASESNVDIELGF-QGFKSV 125

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
             G E      +TG G   + + G++ + N++ GE I VD   IVA   S++ +I   G 
Sbjct: 126 FSG-ESVFWLNITGNGYVILSSFGAIYEINVD-GEYI-VDTGHIVAFEKSLSFEITKAGS 182

Query: 207 IRRAVF-GGDNLVTAVVTGPGIVFIQS 232
                F GG+ LV     G G ++ QS
Sbjct: 183 SWMGAFLGGEGLVCKF-KGQGKIYCQS 208


>gi|255944277|ref|XP_002562906.1| Pc20g03550 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211587641|emb|CAP85684.1| Pc20g03550 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 371

 Score = 45.8 bits (107), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 50/190 (26%), Positives = 83/190 (43%), Gaps = 15/190 (7%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           +  RPG+M  MS ++ +             + + G ++T+     PG     V +A P L
Sbjct: 178 LTVRPGAMIAMSPTITLRG----HVSFSFKKLIIGGSMTTSQYTGPGE----VLLAPPML 229

Query: 105 ARI--LPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQG 162
             I  LPID    G       D+FL   + V          +    G EGF   + TG G
Sbjct: 230 GDIVVLPIDT---GEAWNAGRDSFLAHTSGVHHLYQSQGLSKGFFSG-EGFFVYEFTGHG 285

Query: 163 LAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVV 222
           L ++ + G++V+K+L  GE   VD   +VA   +  ++   +G +  +   G+ L     
Sbjct: 286 LLWLQSFGAIVKKDLREGESYFVDNGHLVAWKCAYEIERPASGGLLSSFSSGEGLA-CRF 344

Query: 223 TGPGIVFIQS 232
            GPG V++Q+
Sbjct: 345 KGPGTVYLQT 354


>gi|452949131|gb|EME54602.1| hypothetical protein H074_28018 [Amycolatopsis decaplanina DSM
           44594]
          Length = 225

 Score = 45.8 bits (107), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 57/211 (27%), Positives = 101/211 (47%), Gaps = 20/211 (9%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-WLFGKTITSVVLRNPGP 92
           V ++ML P E      G+M   S  +++++        G+ + +L G++        P  
Sbjct: 13  VARLMLAPGEPAQVESGAMMATSYGVQVQSQAQGGIMKGLGRAFLSGESFFISTYTAP-Q 71

Query: 93  SDGFVGIAA--PSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
           + G+V +AA  P   +I+ +D    G    C    ++L S + V+ + T    ++N++GG
Sbjct: 72  NGGWVDVAANLPGDMQIINLD----GRTGWCVTRGSWLASSHGVQ-TETKWGGLKNLVGG 126

Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
             GFL    TGQG   +   G++    L+ GEVIT+D   +VA   +V  Q      IR+
Sbjct: 127 EGGFLTHA-TGQGPLVVACYGALETVTLQQGEVITIDTGHVVAFADTVQYQ------IRK 179

Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
              G   ++ ++ +G G+VF  + P   L+Q
Sbjct: 180 VATG---VIQSMKSGEGLVFDFAGPGQLLTQ 207


>gi|410941358|ref|ZP_11373157.1| TIGR00266 family protein [Leptospira noguchii str. 2006001870]
 gi|410783917|gb|EKR72909.1| TIGR00266 family protein [Leptospira noguchii str. 2006001870]
          Length = 224

 Score = 45.8 bits (107), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 45/135 (33%), Positives = 67/135 (49%), Gaps = 15/135 (11%)

Query: 104 LARILPIDLAM--FGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG----FLRQK 157
           LA  LP D+      G +  Q  +FL S  +++    +D + + + G I G    FLR  
Sbjct: 80  LAPTLPGDVERIELSGTVYVQSSSFLASSPNIE----MDTKFQGLKGFISGESLFFLR-- 133

Query: 158 LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNL 217
           L+G GL  I + G +   N++ GE+I VD   IVA    +N ++   G  +    GG+ L
Sbjct: 134 LSGNGLLLISSYGGIDLLNVD-GEMI-VDTGHIVAFEEGLNYEMTKFGGWKSFFLGGEGL 191

Query: 218 VTAVVTGPGIVFIQS 232
           V A   G G V+IQS
Sbjct: 192 V-ARFKGKGKVWIQS 205


>gi|284044779|ref|YP_003395119.1| hypothetical protein Cwoe_3326 [Conexibacter woesei DSM 14684]
 gi|283949000|gb|ADB51744.1| protein of unknown function DUF124 [Conexibacter woesei DSM 14684]
          Length = 225

 Score = 45.8 bits (107), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 62/220 (28%), Positives = 89/220 (40%), Gaps = 24/220 (10%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           V +  L P E + A  G+M   SG + +E         G+ + + G     V        
Sbjct: 13  VARCTLAPGETMRAESGAMMATSGGVGVEAKMQGGLMKGLKRSMLGGESLFVTTFTAPSE 72

Query: 94  DGFVGIAAPSLARILPIDLAMF--GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
            G+V  AA      LP D+A+      L     A+LCS   V++ +T     +N++GG  
Sbjct: 73  GGWVDCAAN-----LPGDVAVLEIADALNISRGAWLCSSAGVEL-DTKWGGFKNLMGGEG 126

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRA 210
           GFL    TG G A +   G++    L  GE + +D   +VA   SV     K    +   
Sbjct: 127 GFLIHA-TGSGHAVVACYGALDTVELSAGEALVIDSGHVVAFDPSVQFTTRKVTKGMMAT 185

Query: 211 VFGGDNLVTAVVTGPGIVFIQS-------------LPFHR 237
           +  G+ LV    TGPG V  QS             LPF R
Sbjct: 186 LKSGEGLVME-FTGPGRVLTQSRNPGALISWLTTVLPFSR 224


>gi|421097931|ref|ZP_15558608.1| TIGR00266 family protein [Leptospira borgpetersenii str. 200901122]
 gi|410799003|gb|EKS01086.1| TIGR00266 family protein [Leptospira borgpetersenii str. 200901122]
          Length = 221

 Score = 45.4 bits (106), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 53/203 (26%), Positives = 94/203 (46%), Gaps = 16/203 (7%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNP--G 91
            ++ L P + + A  G+M +M+  + +E          + +  FG ++    + + P  G
Sbjct: 14  LKVKLLPGQTIKAEAGAMVYMTPGVNVETKMGSGFLSAISRRFFGGESFFFNIFKAPSGG 73

Query: 92  PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
              GF    AP L   ++ +DL   G  L+ +  A+L S   + + +     +R+  GG 
Sbjct: 74  AELGF----APELPGDVVGVDLTDTG--LIVEAGAYLASDETITMKSMFGG-LRSFFGG- 125

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
           EG    ++ G G  F+ A G ++   ++V    T+D   +VA   S+  +I K  G  + 
Sbjct: 126 EGIFLLEILGNGKLFLNAYGGILP--IDVQGSYTIDTGHVVAFDKSLQYKITKAGGNWKS 183

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
            +FGG+ LV    TG G V IQ+
Sbjct: 184 TLFGGEGLVMEF-TGHGKVLIQT 205


>gi|451338897|ref|ZP_21909424.1| hypothetical protein C791_6527 [Amycolatopsis azurea DSM 43854]
 gi|449418388|gb|EMD23976.1| hypothetical protein C791_6527 [Amycolatopsis azurea DSM 43854]
          Length = 225

 Score = 45.4 bits (106), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 55/208 (26%), Positives = 100/208 (48%), Gaps = 14/208 (6%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-WLFGKTITSVVLRNPGP 92
           V ++ML P E      G+M   S  +++++        G+ + +L G++        P  
Sbjct: 13  VARLMLAPGEPAQVESGAMMATSYGVQVQSQAQGGIMKGLGRAFLSGESFFISTYTAP-Q 71

Query: 93  SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
           + G+V +AA     +  I+L    G  + +  ++L S + V+ + T    ++N++GG  G
Sbjct: 72  NGGWVDVAANLPGDMQVINLDGRTGWCVTR-GSWLASSHSVQ-TETKWGGLKNLVGGEGG 129

Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
           FL    TGQG   +   G++    L+ GEVIT+D   +VA   +V  Q      IR+   
Sbjct: 130 FLTHA-TGQGPLVVACYGALETVTLQQGEVITIDTGHVVAFADTVQYQ------IRKVAT 182

Query: 213 GGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
           G   ++ ++ +G G+VF  + P   L+Q
Sbjct: 183 G---VIQSMKSGEGLVFDFAGPGQLLTQ 207


>gi|312865613|ref|ZP_07725838.1| TIGR00266 family protein [Streptococcus downei F0415]
 gi|311098881|gb|EFQ57100.1| TIGR00266 family protein [Streptococcus downei F0415]
          Length = 248

 Score = 45.4 bits (106), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 51/223 (22%), Positives = 96/223 (43%), Gaps = 10/223 (4%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMEN-IYIPENEVGMWQWLFGKTITS----VVLR 88
           + +I LK  +    + GSM F + S+ +   +    N +G      G+++TS     + +
Sbjct: 18  LVEIFLKAGQSAYIQRGSMVFHTPSVTLNTQLNAKGNGLGKLFRAVGRSMTSGESAFITQ 77

Query: 89  NPGPSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
               +DG     APS+  +I+ ++L     +      AFL      + +       R   
Sbjct: 78  ALSNADGGRLAIAPSMPGQIIALELG--SKQYRLNDGAFLALDGSTQYTMERQSLGRAFF 135

Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
           GG  G       GQG   + + GS+ +  L   E +T+D + +VA +  ++ QI      
Sbjct: 136 GGQGGLFVMSTQGQGTLLVNSFGSIQKIELNQQE-MTIDNAHVVAWSKDLDYQIHLENGF 194

Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
            +++  G+ +V     G G V++Q+L     +Q I   + SP+
Sbjct: 195 LQSIGNGEGIVN-TFRGTGEVYVQTLNIETFAQVIGGHLASPS 236


>gi|330922098|ref|XP_003299695.1| hypothetical protein PTT_10746 [Pyrenophora teres f. teres 0-1]
 gi|311326520|gb|EFQ92207.1| hypothetical protein PTT_10746 [Pyrenophora teres f. teres 0-1]
          Length = 395

 Score = 45.4 bits (106), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 49/189 (25%), Positives = 86/189 (45%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS +M ++       +  + + L G  ++  V   PG     + +A   +
Sbjct: 203 LTAKPGAMIAMSPTMTLKGAV----KFSLKKALTGGDMSKSVYTGPGE----ILLAPACI 254

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
             I  I LA  G E      DAFL     + V+    Q +   +   EG    K++G G+
Sbjct: 255 GDITIIKLA--GKETWSVGRDAFLACTQGI-VTEYKSQGIGKGMFSGEGLFVYKISGTGV 311

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            +  + G++++K+L   E   +D   +VA      ++   +G I   +  G+ LV    T
Sbjct: 312 LWCTSFGAIIRKDLAANEKYIIDNGHLVAWNCKYVLERVASGGIISNLTAGEGLVCK-FT 370

Query: 224 GPGIVFIQS 232
           GPG VF+Q+
Sbjct: 371 GPGTVFMQT 379


>gi|121705236|ref|XP_001270881.1| DUF124 domain protein [Aspergillus clavatus NRRL 1]
 gi|119399027|gb|EAW09455.1| DUF124 domain protein [Aspergillus clavatus NRRL 1]
          Length = 336

 Score = 45.4 bits (106), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 46/188 (24%), Positives = 86/188 (45%), Gaps = 11/188 (5%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           +VA+PG+M  MS S+ +        +  + +   G  ++      PG     + +A  +L
Sbjct: 143 IVAKPGAMIAMSHSVTLRGSI----KFSLKKMFAGGEMSLSTFTGPGE----LLLAPSTL 194

Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
             I+ + L+      + + DAFL + + +          + V  G EG    K+TG GL 
Sbjct: 195 GDIIVMRLSGSDSWKVGK-DAFLAATSGISKDLQAQGLTKGVFSG-EGLFIYKMTGVGLL 252

Query: 165 FILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTG 224
           ++ + G++++K +   E   VD   +VA      ++   +G I   +  G+ L     TG
Sbjct: 253 WLQSFGAIIKKEIAEDESYFVDNGHLVAWNCKYRMERVASGGIISNIGSGEGLA-CRFTG 311

Query: 225 PGIVFIQS 232
           PGIV++Q+
Sbjct: 312 PGIVYMQT 319


>gi|257057405|ref|YP_003135237.1| hypothetical protein Svir_34460 [Saccharomonospora viridis DSM
           43017]
 gi|256587277|gb|ACU98410.1| uncharacterized conserved protein [Saccharomonospora viridis DSM
           43017]
          Length = 174

 Score = 45.1 bits (105), Expect = 0.031,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 69/139 (49%), Gaps = 5/139 (3%)

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           G++ +A      + P++L+   G  +   DA L     V+  +      +++ GG  GFL
Sbjct: 36  GWIDLAPKHPGDVYPLELSGRLGWSVAH-DAVLARPATVRHDHPWPAH-QSLFGGDSGFL 93

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR-RAVFG 213
           +   +G G   + + G V    L+ GEV+TV+   ++A   SV V+++   P+  +++  
Sbjct: 94  KH-FSGTGPLVLASRGPVDSLTLKAGEVLTVNPLFVLAYPDSVQVRLRALDPVEPQSIRT 152

Query: 214 GDNLVTAVVTGPGIVFIQS 232
           G  L   V  GPG +F+Q+
Sbjct: 153 GAGLALDVA-GPGAIFVQT 170


>gi|402085536|gb|EJT80434.1| hypothetical protein GGTG_00433 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 446

 Score = 45.1 bits (105), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 31/109 (28%), Positives = 54/109 (49%), Gaps = 2/109 (1%)

Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
           D +L S   V +     Q +   +   EG    K++G GL ++ + G++++K+L  GE  
Sbjct: 322 DGYLASTQGV-IKEYKRQSLGKAMFSGEGLWIYKISGNGLLWLTSFGAIIRKDLADGEKY 380

Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            VD   +VA  +   ++   +G +      G+ LV    TGPG VF+Q+
Sbjct: 381 VVDNGHLVAWDTKYILERVSSGGLVSGYASGEGLV-CKFTGPGTVFLQT 428


>gi|398333660|ref|ZP_10518365.1| hypothetical protein LalesM3_21091 [Leptospira alexanderi serovar
           Manhao 3 str. L 60]
          Length = 221

 Score = 45.1 bits (105), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 51/203 (25%), Positives = 94/203 (46%), Gaps = 16/203 (7%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNP--G 91
            ++ L P + + A  G+M +M+  +++E          + +  FG ++    + + P  G
Sbjct: 14  LKVKLSPGQTIKAEAGAMVYMTPGVDVETKMGSGFLSAISRRFFGGESFFFNIFKAPSGG 73

Query: 92  PSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
              GF    AP L   ++ +DL   G  L+ +  A+L S   + + +     +R+  GG 
Sbjct: 74  AELGF----APELPGDVVGVDLTDTG--LIVEAGAYLASDETITMKSKFGG-LRSFFGG- 125

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
           EG    ++ G G  ++ A G ++   ++V    T+D   +VA   S+  +I K  G  + 
Sbjct: 126 EGIFLLEVLGNGKLYLNAYGGILP--IDVQGSYTIDTGHVVAFDKSLQYKITKAGGSWKS 183

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
            +FGG+ L     TG G V IQ+
Sbjct: 184 TLFGGEGLAMEF-TGHGKVLIQT 205


>gi|385815721|ref|YP_005852112.1| hypothetical protein [Lactobacillus delbrueckii subsp. bulgaricus
           2038]
 gi|385815782|ref|YP_005852173.1| hypothetical protein [Lactobacillus delbrueckii subsp. bulgaricus
           2038]
 gi|325125758|gb|ADY85088.1| Hypothetical conserved protein [Lactobacillus delbrueckii subsp.
           bulgaricus 2038]
 gi|325125819|gb|ADY85149.1| Hypothetical conserved protein [Lactobacillus delbrueckii subsp.
           bulgaricus 2038]
          Length = 233

 Score = 45.1 bits (105), Expect = 0.034,   Method: Compositional matrix adjust.
 Identities = 51/230 (22%), Positives = 100/230 (43%), Gaps = 18/230 (7%)

Query: 23  IPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           + +++ GG    +  + L   E +    G+M +M        +   +  +G      G++
Sbjct: 1   MKYELSGGNTCPLATVTLDKGESIKVENGAMVYMKDVTIAGKMNSSKKGLGGLLGAIGRS 60

Query: 82  ITS----VVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKV 135
           +TS     + +  G +DG  +G+A      I+ + +   G +  C    AFL S + V  
Sbjct: 61  LTSGESMFITQATGDADGGQIGVAPAIPGEIVKLSV---GKQQYCLNTGAFLASDDSVSY 117

Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
                +  +   GG  GF   +  G+G   + A G +V+  +   + I++D   ++A  +
Sbjct: 118 KMRSQKLSKAAFGGTGGFFVMETEGEGDMLVNAFGDLVELTVTSDKPISIDNEHVIAWDA 177

Query: 196 SVNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +++  IK    +   +FG   G+ LV    TG G V IQ+   H L++ +
Sbjct: 178 NLDYDIK----VASGMFGFTSGEGLVNH-FTGDGKVLIQTRNLHSLAEAL 222


>gi|15895995|ref|NP_349344.1| hypothetical protein CA_C2738 [Clostridium acetobutylicum ATCC 824]
 gi|337737948|ref|YP_004637395.1| hypothetical protein SMB_G2773 [Clostridium acetobutylicum DSM
           1731]
 gi|384459459|ref|YP_005671879.1| hypothetical protein CEA_G2747 [Clostridium acetobutylicum EA 2018]
 gi|15025774|gb|AAK80684.1|AE007771_4 Uncharacterized conserved protein [Clostridium acetobutylicum ATCC
           824]
 gi|325510148|gb|ADZ21784.1| Conserved hypothetical protein [Clostridium acetobutylicum EA 2018]
 gi|336291390|gb|AEI32524.1| hypothetical protein SMB_G2773 [Clostridium acetobutylicum DSM
           1731]
          Length = 218

 Score = 45.1 bits (105), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 33/108 (30%), Positives = 58/108 (53%), Gaps = 3/108 (2%)

Query: 125 AFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVIT 184
           +FL S + + + N      +  + G EGF++ +  G G+ F+ A G++ +  L  GE   
Sbjct: 102 SFLASTDKINL-NIKSGGGKGFLSG-EGFIQIEAEGSGILFLSAYGAIHEITLSQGEKYI 159

Query: 185 VDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           VD + +V   SS+N +++    +  ++ GG+ LV  V  GPG + IQS
Sbjct: 160 VDTNHLVLWESSMNYKVELLNGLFGSITGGEGLV-CVFEGPGKMLIQS 206


>gi|189188070|ref|XP_001930374.1| hypothetical protein PTRG_00041 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187971980|gb|EDU39479.1| hypothetical protein PTRG_00041 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 392

 Score = 45.1 bits (105), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 91/206 (44%), Gaps = 13/206 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG+M  MS +M ++       +  + + L G  ++  V   PG     + +A  S+
Sbjct: 199 LTAKPGAMIAMSPTMTLKGAV----KFSLKKVLTGGDMSKSVYTGPG----EILLAPSSI 250

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
             I  I LA  G E      D FL     + V+    Q +   +   EG    K++G G+
Sbjct: 251 GDITIIKLA--GKETWSVGRDGFLACTQGI-VTEYKSQGIGKGMFSGEGLFVYKISGTGV 307

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            +  + G++++K+L   E   +D   +VA      ++   +G I   +  G+ LV    T
Sbjct: 308 LWCTSFGAIIRKDLAANEKYIIDNGHLVAWNCKYVLERIASGGIISNLTAGEGLVCK-FT 366

Query: 224 GPGIVFIQSLPFHRLSQRIARAVTSP 249
           GPG VF+Q+       Q +A    +P
Sbjct: 367 GPGTVFMQTRNAVAFGQWLAAHSAAP 392


>gi|104774038|ref|YP_619018.1| hypothetical protein Ldb1059 [Lactobacillus delbrueckii subsp.
           bulgaricus ATCC 11842]
 gi|103423119|emb|CAI97861.1| Conserved hypothetical protein [Lactobacillus delbrueckii subsp.
           bulgaricus ATCC 11842]
          Length = 233

 Score = 45.1 bits (105), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 50/230 (21%), Positives = 100/230 (43%), Gaps = 18/230 (7%)

Query: 23  IPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           + +++ GG    +  + L   E +    G+M +M        +   +  +G      G++
Sbjct: 1   MKYELSGGNTCPLATVTLDKGESIKVENGAMVYMKDVTIAGKMNSSKKGLGGLLGAIGRS 60

Query: 82  ITS----VVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKV 135
           +TS     + +  G +DG  +G+A      I+ + +   G +  C    AFL S + V  
Sbjct: 61  LTSGESMFITQATGDADGGQIGVAPAIPGEIVKLSV---GKQQYCLNTGAFLASDDSVSY 117

Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
                +  +   GG  GF   +  G+G   + A G +V+  +   + +++D   ++A  +
Sbjct: 118 KMRSQKLSKAAFGGTGGFFVMETEGEGDMLVNAFGDLVELTVTSDKPLSIDNEHVIAWDA 177

Query: 196 SVNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +++  IK    +   +FG   G+ LV    TG G V IQ+   H L++ +
Sbjct: 178 NLDYDIK----VASGIFGFTSGEGLVNH-FTGDGKVLIQTRNLHSLAEAL 222


>gi|418030065|ref|ZP_12668581.1| hypothetical protein LDBUL1632_01375 [Lactobacillus delbrueckii
           subsp. bulgaricus CNCM I-1632]
 gi|354688283|gb|EHE88325.1| hypothetical protein LDBUL1632_01375 [Lactobacillus delbrueckii
           subsp. bulgaricus CNCM I-1632]
          Length = 233

 Score = 45.1 bits (105), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 51/230 (22%), Positives = 100/230 (43%), Gaps = 18/230 (7%)

Query: 23  IPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           + +++ GG    +  + L   E +    G+M +M        +   +  +G      G++
Sbjct: 1   MKYELSGGNTCPLATVTLDKGESIKVENGAMVYMKDVTIAGKMNSSKKGLGGLLGAIGRS 60

Query: 82  ITS----VVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKV 135
           +TS     + +  G +DG  +G+A      I+ + +   G +  C    AFL S + V  
Sbjct: 61  LTSGESMFITQATGDADGGQIGVAPAIPGEIVKLSV---GKQQYCLNTGAFLASDDSVSY 117

Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
                +  +   GG  GF   +  G+G   + A G +V+  +   + I++D   ++A  +
Sbjct: 118 KMRSQKLSKAAFGGTGGFYVMQTEGEGDMLVNAFGDLVELTVTSDKPISIDNEHVIAWDA 177

Query: 196 SVNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +++  IK    +   +FG   G+ LV    TG G V IQ+   H L++ +
Sbjct: 178 NLDYDIK----VASGMFGFTSGEGLVNH-FTGDGKVLIQTRNLHSLAEAL 222


>gi|383830129|ref|ZP_09985218.1| TIGR00266 family protein [Saccharomonospora xinjiangensis XJ-54]
 gi|383462782|gb|EID54872.1| TIGR00266 family protein [Saccharomonospora xinjiangensis XJ-54]
          Length = 225

 Score = 44.7 bits (104), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 54/215 (25%), Positives = 96/215 (44%), Gaps = 10/215 (4%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           V +++L P E      G+M   S  M M+         G+ + + G     V      P+
Sbjct: 13  VARLLLAPHEPAQVESGAMVATSYGMHMQASTQGGVMKGLGRAILGGESLFVSTYTAPPN 72

Query: 94  DGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
            G+V +A   P   RI+ +D  +          ++L + + +++ +T      N+ GG  
Sbjct: 73  GGWVDVAPGLPGDVRIIEMDGRV---GWCVTRGSWLANSHGIQL-DTKWGGFGNLFGGEG 128

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRA 210
           GFL   + GQG   +   G++   NL+ GE +T+D   +VA   +V  Q+ K +  + ++
Sbjct: 129 GFLTHAV-GQGQLVVACYGAIDVVNLQPGEYVTIDSGHVVAYADTVQSQLRKVSQGVIQS 187

Query: 211 VFGGDNLVTAVVTGPGIVFIQSL-PFHRLSQRIAR 244
           +  G+  V     GPG +  Q+  P   L+  IAR
Sbjct: 188 LKSGEGFVFD-FAGPGQILTQTRNPDALLAWLIAR 221


>gi|443311725|ref|ZP_21041350.1| TIGR00266 family protein [Synechocystis sp. PCC 7509]
 gi|442778298|gb|ELR88566.1| TIGR00266 family protein [Synechocystis sp. PCC 7509]
          Length = 230

 Score = 44.7 bits (104), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 45/197 (22%), Positives = 83/197 (42%), Gaps = 8/197 (4%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPSDG 95
           + L P E + A  G+M  M   + M   +      G+ +  FG +++   V  N      
Sbjct: 26  LTLNPGETITAEAGAMTSMDSQLSMRTEFSGGLIPGLLKKFFGGESLFVNVFTNQTKDPL 85

Query: 96  FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
            + +   ++  I  IDL     E+  QP A++     + +            G  EG  +
Sbjct: 86  KLTLTQSTIGDIDRIDLN--NSEICFQPGAYIAHTPGINMGVQWAGFASFFAG--EGLFK 141

Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGD 215
            +L+G+G  F  A G + +K   +     VD   +VA    + + IK  G +  +V  G+
Sbjct: 142 LRLSGKGRVFFGAYGGITKK--RISNEFIVDSGHLVAYEPQIKMSIKLAGGLWGSVTSGE 199

Query: 216 NLVTAVVTGPGIVFIQS 232
            +V   ++G G +++QS
Sbjct: 200 GIVNR-LSGQGEIYLQS 215


>gi|431930110|ref|YP_007243156.1| hypothetical protein Thimo_0687 [Thioflavicoccus mobilis 8321]
 gi|431828413|gb|AGA89526.1| TIGR00266 family protein [Thioflavicoccus mobilis 8321]
          Length = 447

 Score = 44.7 bits (104), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 43/157 (27%), Positives = 70/157 (44%), Gaps = 15/157 (9%)

Query: 98  GIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI--- 150
           G  A  LA   P D+A      G  L+ Q   FL     V+V      +V+N+  G+   
Sbjct: 294 GPGAVYLAPASPGDIAAVEVRPGDGLVIQRGGFLACTEGVEVGT----QVQNIAEGLFSG 349

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
           EGF   K  G+GL  + + G++ + NL VGE   VD   +VA + +++  ++       A
Sbjct: 350 EGFFVLKAHGEGLVLLESFGAIHELNLAVGEQKIVDNGHLVAWSQAMHYDLELGSRGLVA 409

Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVT 247
            F     +     GPG + +Q+    R  Q+  R ++
Sbjct: 410 AFTSGEKIVCRFHGPGRILVQT----RQPQQFGRWIS 442


>gi|422843752|ref|ZP_16890462.1| hypothetical protein HMPREF5505_0127 [Lactobacillus delbrueckii
           subsp. lactis DSM 20072]
 gi|325686164|gb|EGD28214.1| hypothetical protein HMPREF5505_0127 [Lactobacillus delbrueckii
           subsp. lactis DSM 20072]
          Length = 246

 Score = 44.7 bits (104), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 52/235 (22%), Positives = 101/235 (42%), Gaps = 18/235 (7%)

Query: 18  PQDAVIPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQW 76
            Q  ++ +++ GG    +  + L   E +    G+M +M        +   +  +G    
Sbjct: 9   KQREIMKYELSGGNTCPLATVTLDKGESIKVENGAMVYMKDVTIAGKMNSSKKGLGGLLG 68

Query: 77  LFGKTITS----VVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSV 130
             G+++TS     + +  G +DG  +G+A     +IL + +   G +  C    AFL S 
Sbjct: 69  AIGRSLTSGESMFITQATGDADGGQIGVAPAIPGKILKLSV---GKQQYCLNTGAFLASD 125

Query: 131 NDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCI 190
           + V          +   GG  GF   +  G+G   + A G +V+  +   + I++D   +
Sbjct: 126 DSVSYKMKSQSFSKAAFGGTGGFYVMQTEGKGDMLVNAFGDLVELTVTSNKPISIDNEHV 185

Query: 191 VAVTSSVNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +A  ++++  IK    +   +FG   G+ LV    TG G V IQ+     L++ +
Sbjct: 186 IAWDANLDYDIK----VASGMFGFTSGEGLVNH-FTGDGKVIIQTRNLRSLAEAL 235


>gi|332707812|ref|ZP_08427839.1| conserved hypothetical protein TIGR00266 [Moorea producens 3L]
 gi|332353515|gb|EGJ33028.1| conserved hypothetical protein TIGR00266 [Moorea producens 3L]
          Length = 219

 Score = 44.7 bits (104), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 49/198 (24%), Positives = 85/198 (42%), Gaps = 10/198 (5%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFG-KTITSVVLRNPGPSDG 95
           + L P E + A  G+M  M   + M+  +      G+ +  FG +++      N      
Sbjct: 16  VTLDPGESITAEAGAMTSMDAQLTMKTQFSGGFFSGLLKKFFGGESLLVNTFSNQTQQPL 75

Query: 96  FVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
            + +    +  I  ++L   G   LC QP A++C    VK+            G  EG  
Sbjct: 76  NLVLTQSIIGDIAGVEL---GERSLCFQPGAYICHTTGVKLGVRWAGFKSWFSG--EGLF 130

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGG 214
           + K++G+G  F  A G + +K +  GE I VD   +VA    + + I   G +  +V  G
Sbjct: 131 KLKVSGKGTVFFGAYGGLTRKQIN-GEFI-VDTGHLVAYEPGIMMNIGLAGGLLGSVTSG 188

Query: 215 DNLVTAVVTGPGIVFIQS 232
           +  V   + G G +++QS
Sbjct: 189 EGFVNR-LRGQGEIYLQS 205


>gi|443324897|ref|ZP_21053619.1| TIGR00266 family protein [Xenococcus sp. PCC 7305]
 gi|442795499|gb|ELS04864.1| TIGR00266 family protein [Xenococcus sp. PCC 7305]
          Length = 232

 Score = 44.3 bits (103), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 39/134 (29%), Positives = 65/134 (48%), Gaps = 14/134 (10%)

Query: 108 LPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
           +P D+A +    G  L+ Q   F+   + +++ NT  Q  +    G E     K TGQG 
Sbjct: 85  VPGDIAHYYVEPGHNLMVQSSGFVACASTIEI-NTQFQGFKGFFSG-ESLFLLKATGQGD 142

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR-----RAVFGGDNLV 218
            +  + G +++  +EV E   VD S IVA   S+N  ++  G +        +FGG+ LV
Sbjct: 143 FWFSSYGGILE--IEVSENYIVDTSYIVAFEDSLNYNVELIGGLSFNNLLTGIFGGEGLV 200

Query: 219 TAVVTGPGIVFIQS 232
               +G G ++IQ+
Sbjct: 201 CR-FSGKGRLWIQT 213


>gi|322692627|gb|EFY84525.1| DUF124 domain protein [Metarhizium acridum CQMa 102]
          Length = 325

 Score = 44.3 bits (103), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 49/200 (24%), Positives = 93/200 (46%), Gaps = 14/200 (7%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITSVVLRNPGP 92
           +  + L+P   V ++ G+M  MSG+++++ NI     +  M +   G  ++      PG 
Sbjct: 123 ILNVNLQPGGTVRSKSGAMIHMSGTIQLQGNI-----KFSMKKLFTGGRMSESTYTGPG- 176

Query: 93  SDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG 152
               + +    +  I+ + +    G  + + D FL    DV V  T  Q V   +   E 
Sbjct: 177 ---RLALGPTLMGDIITLHVDGRTGWTVGK-DTFLACTPDV-VMETKAQGVSKSMFSGED 231

Query: 153 FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVF 212
               +++GQGL ++ + G+V + +L  GE   VD   +VA +    ++ K  G    ++ 
Sbjct: 232 LFVYRVSGQGLLWLTSFGAVDRLDLPPGEQHIVDNGHLVAWSCDYKIE-KAGGGAMSSLK 290

Query: 213 GGDNLVTAVVTGPGIVFIQS 232
            G+ LV    TGPG +++Q+
Sbjct: 291 TGEGLV-CRFTGPGAIYVQT 309


>gi|257388303|ref|YP_003178076.1| hypothetical protein Hmuk_2257 [Halomicrobium mukohataei DSM 12286]
 gi|257170610|gb|ACV48369.1| protein of unknown function DUF124 [Halomicrobium mukohataei DSM
           12286]
          Length = 240

 Score = 44.3 bits (103), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 49/214 (22%), Positives = 87/214 (40%), Gaps = 17/214 (7%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
           + L   E ++A PG+M   S ++ M+     E  +   + + G         N   ++G 
Sbjct: 16  VELGSGETILAEPGAMTTHSSNISMDTTTSNEGLLSSAKSMLGG---ESFFANEFTAEG- 71

Query: 97  VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQ 156
            G    +LA   P D+  +  EL    D  + +V+   +++     +    GG++  L  
Sbjct: 72  -GTGTVTLAPPTPGDVMQY--EL---SDETIYAVDGAWLASEPSISIETEFGGLKSMLAG 125

Query: 157 ------KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
                  L G G  F+ A G +    L  GE   VD   +VA   SV+   +    ++  
Sbjct: 126 ASITPLTLEGTGTVFVEAFGGLESIELGAGETYHVDNENVVAWDGSVDFDARRPDGVKST 185

Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
           +  G+ LV    TGPG V+ Q+   +  +  +A 
Sbjct: 186 LLSGEGLVFE-FTGPGTVWYQTRGLNAFASAVAE 218


>gi|46137297|ref|XP_390340.1| hypothetical protein FG10164.1 [Gibberella zeae PH-1]
          Length = 477

 Score = 44.3 bits (103), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 55/109 (50%), Gaps = 2/109 (1%)

Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
           DAFL S   V V +   Q +   +   EG    K++G GL +I + G++++K L   E  
Sbjct: 354 DAFLASTQGV-VKDHKRQGLGKAMFSGEGLWVYKVSGTGLLWISSFGAIIKKTLVENEKY 412

Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            VD   ++A  +   ++   +G I   +  G+ LV    TGPG V++Q+
Sbjct: 413 IVDNGHLIAWDTKYVMERVASGGIISGLASGEGLV-CKFTGPGTVYMQT 460


>gi|145493977|ref|XP_001432983.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124400099|emb|CAK65586.1| unnamed protein product [Paramecium tetraurelia]
          Length = 231

 Score = 44.3 bits (103), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 58/271 (21%), Positives = 114/271 (42%), Gaps = 57/271 (21%)

Query: 16  QSPQDAV---IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEME---NIYIPEN 69
           + PQ+ +   I F I+G    V Q++L P E++  +   + + S ++ ++   + +I EN
Sbjct: 2   EYPQNLLTQGIKFSIIGDN--VVQVVLLPGEQINTQNSYVQYYSDNISIQTKKSYFITEN 59

Query: 70  EVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCS 129
           +V              VL     +  +VGI+A +  +++ +D A+F   L+ +    + +
Sbjct: 60  QV-------------TVLNTSKDNIAYVGISAAN-GKVMILDAAIFNNYLIKESS--IIA 103

Query: 130 VNDV-------KVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEV 182
            ND           +T  + V            + LT Q L FI   G+V+ K+L  GE 
Sbjct: 104 ANDFTEVGQFFHFQSTYQKIV----------FTKDLTKQQLIFIKTNGTVIDKDLGDGES 153

Query: 183 ITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           I V  + I+ +   V  ++      +  + G           PG      + F   +Q +
Sbjct: 154 IIVPNNSILGIQDFVRYEVNLKDTTKIKLIG-----------PG-----RILFEINNQTL 197

Query: 243 ARAVTSPNMRENPKFFVQIAIFFFLAYVVIV 273
              +     +   KFF+ +++ FFL +V ++
Sbjct: 198 ENNLGLVGRKYLAKFFMIMSMVFFLIFVELI 228


>gi|119485102|ref|ZP_01619487.1| hypothetical protein L8106_06629 [Lyngbya sp. PCC 8106]
 gi|119457330|gb|EAW38455.1| hypothetical protein L8106_06629 [Lyngbya sp. PCC 8106]
          Length = 221

 Score = 43.9 bits (102), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 50/201 (24%), Positives = 94/201 (46%), Gaps = 19/201 (9%)

Query: 39  LKPQEKVVARPGSMCFMSGSMEMENIY-------IPENEVGMWQWLFGKTITSVVLRNPG 91
           L+P + ++A  G+M  M G + M+  +         +N +G  + LF  T T+   +   
Sbjct: 18  LQPGDSIIAEAGAMVGMEGKISMKTSFSGGFFSAFIKNLLG-GESLFVNTFTN---KTNQ 73

Query: 92  PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
           P +  V ++  ++  I+ ++L   G EL  QP A++ S   VK+            G  E
Sbjct: 74  PLE--VVLSQATIGDIVAVELQR-GQELCFQPGAYIASSKPVKLGVGWAGFASWFAG--E 128

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           G  + K+TG GL F  A G + QK  ++ +   VD   +VA    + + +   G +  ++
Sbjct: 129 GLFKLKVTGPGLVFFGAYGGITQK--QITQDFIVDNGHLVAYEPGIKMGVGLAGGLLGSL 186

Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
             G+  +   + G G++++QS
Sbjct: 187 TSGEGFINK-LRGNGVIYLQS 206


>gi|167042544|gb|ABZ07268.1| putative protein of unknown function DUF124 [uncultured marine
           microorganism HF4000_ANIW133F6]
 gi|167043072|gb|ABZ07783.1| putative protein of unknown function DUF124 [uncultured marine
           microorganism HF4000_ANIW141C7]
          Length = 229

 Score = 43.9 bits (102), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 43/165 (26%), Positives = 71/165 (43%), Gaps = 4/165 (2%)

Query: 39  LKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVG 98
           L P E++   PG+M   S  ++M++        G+ +   G     +     GP+ G+V 
Sbjct: 18  LDPNEQMKVEPGAMVAQSPGVDMKSGMSGGFFKGLKKMALGGESFILNTYTAGPNGGWVS 77

Query: 99  IAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKL 158
           IA  S   I   DLA  G  L  Q  AF+ S  +V+ ++T  Q  + +  G   F  +  
Sbjct: 78  IAPGSPGDIGSFDLAP-GENLFMQGGAFIASTANVE-TDTKFQGAKGLFSGESAFFLKAS 135

Query: 159 T--GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           T  G G  F  A G++ +  +       VD   +VA +  V  ++
Sbjct: 136 TTGGAGTVFYSAYGAIKEIEITPATPAIVDTGHVVAFSDGVGYKL 180


>gi|391873276|gb|EIT82329.1| hypothetical protein Ao3042_00482 [Aspergillus oryzae 3.042]
          Length = 326

 Score = 43.9 bits (102), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 56/229 (24%), Positives = 96/229 (41%), Gaps = 24/229 (10%)

Query: 13  YVYQSPQDAVIPFQILGGEAQVCQ--------IMLKPQEKVVARPGSMCFMSGSMEMENI 64
           YV  S QD V  F   GG  ++          + L     + A+PG M  MSG++ +   
Sbjct: 100 YVPASNQDEVGTFN--GGSFRISHRDTNSVLTLQLAMGCPIEAKPGVMIAMSGNISLR-- 155

Query: 65  YIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQP 123
                  G  ++ F K +   +  +     G + +A P L  I  I L + G E      
Sbjct: 156 -------GGAKFSFMKMLAGSMTFSTYTGPGELLLAPPFLGDI--IVLRLNGNESWKVGK 206

Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
           D FL + + ++         + V  G EGF+   ++G GL ++ + G+V++K++  GE  
Sbjct: 207 DGFLAATSGIEKDYKSQGLTKAVFSG-EGFIIYHMSGVGLLWLQSFGAVIRKDIAEGETY 265

Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            VD   +VA      +    +G +       + L      GPG V++Q+
Sbjct: 266 LVDNGYLVAWNCKYKMTRAASGGMWSTYSSAEGLACK-FEGPGTVYLQT 313


>gi|428207315|ref|YP_007091668.1| hypothetical protein Chro_2307 [Chroococcidiopsis thermalis PCC
           7203]
 gi|428009236|gb|AFY87799.1| protein of unknown function DUF124 [Chroococcidiopsis thermalis PCC
           7203]
          Length = 225

 Score = 43.9 bits (102), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 54/207 (26%), Positives = 92/207 (44%), Gaps = 13/207 (6%)

Query: 31  EAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG----MWQWLFGKTITSVV 86
           ++ +  I +   E++VA  GSM  MS  +            G    + + + G+++   +
Sbjct: 10  DSAIAHIRMSAGEELVAEAGSMIAMSSYINASTTLRQGKSGGILGGLKRMMAGESLFLSI 69

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
            R+P   DG V +A   +  IL   ++ FG  L+ Q  ++L S  +V + N   Q  R+ 
Sbjct: 70  FRSP-THDGEVFLAPKLMGDILIYQMSSFG--LVVQASSYLASEANVDI-NVGFQGFRSF 125

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNG 205
             G E      + G G   + + G++ +  ++ GE I VD   IVA   S+N  I K   
Sbjct: 126 FSG-ESIFWLDIGGSGQVILTSFGAIYEILVD-GEYI-VDTGHIVAFEKSLNFSITKVGS 182

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            +  A  GG+ L      G G +F Q+
Sbjct: 183 SLIGAFLGGEGLACR-FQGKGRLFCQT 208


>gi|448739945|ref|ZP_21721938.1| hypothetical protein C451_20492 [Halococcus thailandensis JCM
           13552]
 gi|445797968|gb|EMA48404.1| hypothetical protein C451_20492 [Halococcus thailandensis JCM
           13552]
          Length = 205

 Score = 43.5 bits (101), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 34/119 (28%), Positives = 50/119 (42%), Gaps = 7/119 (5%)

Query: 148 GGIEGFLRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           GG+E  L         L G G  FI   G +   +L+ GE   VD   +VA  SSV    
Sbjct: 87  GGLESMLSGASLTPLALNGTGSVFIETFGGIETIDLDHGESYVVDNDHVVAWESSVEFDA 146

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNMRENPKFFVQ 260
           +  G +   V  G+ LV    TGPG V+ Q+      ++ I  A+      +   F ++
Sbjct: 147 RRVGGLASTVLSGEGLVME-FTGPGTVWYQTRGLDTFTESIVEAMPGTGEDDGGGFNIE 204


>gi|302653267|ref|XP_003018462.1| hypothetical protein TRV_07527 [Trichophyton verrucosum HKI 0517]
 gi|291182110|gb|EFE37817.1| hypothetical protein TRV_07527 [Trichophyton verrucosum HKI 0517]
          Length = 191

 Score = 43.5 bits (101), Expect = 0.090,   Method: Compositional matrix adjust.
 Identities = 43/161 (26%), Positives = 75/161 (46%), Gaps = 9/161 (5%)

Query: 73  MWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVN 131
           M + L G  ++S     PG       + APS+   + I L + G E      DAFL    
Sbjct: 18  MKKMLAGGEMSSSTFTGPGEL-----LLAPSMLGDISI-LRLNGQEQWSVGKDAFLACTQ 71

Query: 132 DVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIV 191
            V V +  +Q +       EG    K++G G+ F+ + G++++K+L  GE   VD   +V
Sbjct: 72  GV-VKDYKNQGLSKAFFSGEGLFVYKISGTGILFVQSFGAIIRKDLVDGEKYIVDNGHLV 130

Query: 192 AVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           +      ++   +G +   +  G+ LV    TGPG +++Q+
Sbjct: 131 SWNCKYVMERVASGGVLSNLSSGEGLVCK-FTGPGTIYLQT 170


>gi|238484645|ref|XP_002373561.1| DUF124 domain protein [Aspergillus flavus NRRL3357]
 gi|220701611|gb|EED57949.1| DUF124 domain protein [Aspergillus flavus NRRL3357]
          Length = 131

 Score = 43.5 bits (101), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 29/110 (26%), Positives = 53/110 (48%), Gaps = 2/110 (1%)

Query: 123 PDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEV 182
            DAFL   + V+         + V  G EGF+   ++G GL ++ + G++++K++  G+ 
Sbjct: 11  KDAFLAKTSGVEKDYKSQGMTKAVFSG-EGFIIYHMSGVGLVWLQSFGAIIRKDIPEGKT 69

Query: 183 ITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
             VD   IVA      ++   +G +  A F     +     GPG V++Q+
Sbjct: 70  YLVDNGYIVAWNCKYKIERAASGGLLSA-FSSSEGLACKFEGPGTVYLQT 118


>gi|390629838|ref|ZP_10257830.1| TIGR00266 family protein [Weissella confusa LBAE C39-2]
 gi|390485010|emb|CCF30178.1| TIGR00266 family protein [Weissella confusa LBAE C39-2]
          Length = 237

 Score = 43.5 bits (101), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 47/204 (23%), Positives = 79/204 (38%), Gaps = 12/204 (5%)

Query: 39  LKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL--FGKTITS-----VVLRNPG 91
           L P E V  + G+M + +G  E++         G+ + +   G+++ S     +     G
Sbjct: 18  LLPHETVRIQSGAMVYHTGDTELKGRLNANGGSGIGKLIRAAGRSMVSGESVFITEVTAG 77

Query: 92  PSDGFVGIA--APSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
              G + +A   P   + L +              AFL     V+ S       R   GG
Sbjct: 78  SQGGELALAPNVPGTIQALEVTADK---NYYLNDSAFLAMDGTVQYSMERQSLGRAFFGG 134

Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
             G    K +G+G   + A GS+ + +L+     T+D + +VA  +S+N  I   G    
Sbjct: 135 QGGLFVMKTSGEGTLLVAAFGSIKEIDLQDAHDFTIDNAHVVAWETSLNYDIHLEGGGVI 194

Query: 210 AVFGGDNLVTAVVTGPGIVFIQSL 233
              G    +     G G + IQSL
Sbjct: 195 GSIGTGEGIVNTFNGTGKILIQSL 218


>gi|359689649|ref|ZP_09259650.1| hypothetical protein LlicsVM_14727 [Leptospira licerasiae serovar
           Varillal str. MMD0835]
 gi|418749868|ref|ZP_13306156.1| TIGR00266 family protein [Leptospira licerasiae str. MMD4847]
 gi|418759263|ref|ZP_13315443.1| TIGR00266 family protein [Leptospira licerasiae serovar Varillal
           str. VAR 010]
 gi|384113754|gb|EIE00019.1| TIGR00266 family protein [Leptospira licerasiae serovar Varillal
           str. VAR 010]
 gi|404274753|gb|EJZ42071.1| TIGR00266 family protein [Leptospira licerasiae str. MMD4847]
          Length = 253

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 53/206 (25%), Positives = 94/206 (45%), Gaps = 20/206 (9%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           + ++ L P + + +  G+M +MS  M +E          + + +FG             S
Sbjct: 45  LLKLRLGPGQSIKSEAGAMVYMSSRMGVETKMGSGFLSALSRKIFGGESFFFNTYTAPDS 104

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
            G +G+A P L   + IDL + G  +  Q  ++L S   ++V +          GGI   
Sbjct: 105 GGEIGLA-PDLPGDI-IDLDLVGKSIFVQSGSYLASDPGIQVVSKF--------GGIRSL 154

Query: 154 LRQK------LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGP 206
           L  +      ++G G  F+ + G++V   ++V    TVD   IVA  +S+  ++ K  G 
Sbjct: 155 LGGEGLFLLEISGTGKVFLSSYGAIVP--IQVQGNYTVDTGHIVAFENSLQFKVGKAGGN 212

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
            +  + GG+ LV A  +G G ++IQS
Sbjct: 213 WKSTLLGGEGLV-ANFSGNGTLWIQS 237


>gi|119485103|ref|ZP_01619488.1| hypothetical protein L8106_06634 [Lyngbya sp. PCC 8106]
 gi|119457331|gb|EAW38456.1| hypothetical protein L8106_06634 [Lyngbya sp. PCC 8106]
          Length = 227

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 53/207 (25%), Positives = 95/207 (45%), Gaps = 13/207 (6%)

Query: 31  EAQVCQIMLKPQEKVVARPGSMCFMS----GSMEMENIYIPENEVGMWQWLFGKTITSVV 86
           +  + ++ L  +E+++A  G+M  MS     S  +          G+ + + G+++   V
Sbjct: 10  DCAIARVTLDTREELIAEAGAMVAMSEYINTSTTLRQGKGGGILGGLKRMVGGESLFLSV 69

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
            R+P  + G V +A   L  IL   ++  G  L+ Q  ++L S +DV +     Q  +++
Sbjct: 70  FRSP-IAGGEVFLAPKLLGDILHYQVSEIG--LVVQATSYLASHSDVNIDLGF-QGFKSL 125

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
             G E      LTGQG   I + G + + ++E  +   VD   IVA   ++N +I   G 
Sbjct: 126 FSG-ENIFWLDLTGQGDVLINSFGGIYEVDVE--DEYIVDTGHIVAFEKTLNFEITKPGS 182

Query: 207 IRRAVF-GGDNLVTAVVTGPGIVFIQS 232
                F GG+ LV     G G ++ Q+
Sbjct: 183 SWLGAFLGGEGLVCR-FRGQGKLYCQT 208


>gi|429858192|gb|ELA33021.1| duf124 domain-containing protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 442

 Score = 43.5 bits (101), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 52/189 (27%), Positives = 87/189 (46%), Gaps = 13/189 (6%)

Query: 45  VVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSL 104
           + A+PG M  MS S+ +      + +  M + + G  + S     PG     + +A P L
Sbjct: 248 LTAKPGVMVAMSHSITLRG----QIKFSMKKLVAGAELASSTFVGPG----ELLLAPPML 299

Query: 105 ARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
             I    L + G E      DA++ S   V + +   Q +   I   EG    K++G GL
Sbjct: 300 GDI--TSLRLTGKETWSVGQDAYVASTQGV-IKDYKRQGLSKAIFSGEGLYVYKISGTGL 356

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            ++ + G++++K+L  GE   VD   +VA      ++   +G I   +  G+ LV    T
Sbjct: 357 LWLTSFGAIIRKDLMEGEKYVVDNGHLVAWNVKYIMERVASGGIISGISSGEGLVCK-FT 415

Query: 224 GPGIVFIQS 232
           GPG VF+Q+
Sbjct: 416 GPGTVFMQT 424


>gi|392570465|gb|EIW63638.1| DUF124-domain-containing protein [Trametes versicolor FP-101664
           SS1]
          Length = 238

 Score = 43.1 bits (100), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 52/215 (24%), Positives = 95/215 (44%), Gaps = 11/215 (5%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGF 96
           + L+P  +V A+ GSM  M  +++++     + +  + + L G  ++      PG     
Sbjct: 23  LQLEPGAEVKAKSGSMVAMDATVKIKG----KLKFSVAKLLSGDKMSESTFTGPGE---- 74

Query: 97  VGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQ 156
           V IA  +   I+PI L     +     DA+L     V   N      + ++ G E     
Sbjct: 75  VMIAPSTWGDIVPIYLDGHT-QWSVGKDAYLACTIGVTHKNKSQGVGKALLSG-EFLSAY 132

Query: 157 KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDN 216
            + G G+ +I + G++  + L+ GE   VD   +VA T++   +    G +  ++   ++
Sbjct: 133 NVIGAGIMWITSLGAIYSRTLQPGEQWIVDKGHLVAWTATCKDEDIDAGGLFSSMNTDED 192

Query: 217 LVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPNM 251
            V    TGPGIV+IQ+     L   I   V  P++
Sbjct: 193 SVCRF-TGPGIVYIQTRNPETLGGWINDRVNPPDV 226


>gi|332638675|ref|ZP_08417538.1| hypothetical protein WcibK1_08272 [Weissella cibaria KACC 11862]
          Length = 220

 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 46/209 (22%), Positives = 83/209 (39%), Gaps = 8/209 (3%)

Query: 41  PQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL--FGKTITS-----VVLRNPGPS 93
           P E V  + G+M + +G  +++         G+ + L   G+ + S     +   + G  
Sbjct: 3   PNETVRIQSGAMVYHTGDTQLKGRLNANGGSGIGKLLRAAGRAMVSGESVFITEVSAGAK 62

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
            G + +A      I  +++   G        AFL     V+ S       R ++GG  G 
Sbjct: 63  GGDLALAPAVPGTIQALEVTP-GRNYYLNDSAFLAMDGTVQYSMERQSVGRAILGGQGGL 121

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
              K +G+G   + A GS+ + +L+     T+D + +VA  +S+   I   G    +  G
Sbjct: 122 FVMKTSGEGTLLVAAFGSIKEIDLQDAHDFTIDNAHVVAWETSLTYDIHLEGGGLISSIG 181

Query: 214 GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
               V     G G V +QSL     +  +
Sbjct: 182 TGEGVVNTFNGTGKVLVQSLNLENFANML 210


>gi|172036227|ref|YP_001802728.1| hypothetical protein cce_1312 [Cyanothece sp. ATCC 51142]
 gi|354553022|ref|ZP_08972329.1| protein of unknown function DUF124 [Cyanothece sp. ATCC 51472]
 gi|171697681|gb|ACB50662.1| hypothetical protein cce_1312 [Cyanothece sp. ATCC 51142]
 gi|353554852|gb|EHC24241.1| protein of unknown function DUF124 [Cyanothece sp. ATCC 51472]
          Length = 219

 Score = 43.1 bits (100), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 37/142 (26%), Positives = 62/142 (43%), Gaps = 5/142 (3%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSV-VLRNPGPSDG 95
           + L P EK+  + GS+  M G + +   +       + +  FG T   V  L N      
Sbjct: 16  VTLNPGEKITVKMGSIISMDGEVTVHTGFCGSWLSAIVRKCFGGTDIFVDYLTNETEEPI 75

Query: 96  FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
            VG++  ++  I  IDL+   G +  QP   L     VK+ N        ++G  +G  +
Sbjct: 76  TVGLSQTTIGDIERIDLSQ--GPICLQPRVLLAYTKGVKIENHWGGFGSWLVG--DGLFK 131

Query: 156 QKLTGQGLAFILAGGSVVQKNL 177
            K  G+G  FI A G +++K +
Sbjct: 132 TKCKGKGRVFIGAYGGIIKKTI 153


>gi|385830079|ref|YP_005867892.1| hypothetical protein CVCAS_0496 [Lactococcus lactis subsp. lactis
           CV56]
 gi|418037761|ref|ZP_12676129.1| hypothetical protein LLCRE1631_00936 [Lactococcus lactis subsp.
           cremoris CNCM I-1631]
 gi|326406087|gb|ADZ63158.1| conserved hypothetical protein [Lactococcus lactis subsp. lactis
           CV56]
 gi|354694191|gb|EHE93879.1| hypothetical protein LLCRE1631_00936 [Lactococcus lactis subsp.
           cremoris CNCM I-1631]
          Length = 232

 Score = 43.1 bits (100), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 56/222 (25%), Positives = 92/222 (41%), Gaps = 19/222 (8%)

Query: 32  AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL--FGKTITS----- 84
           A + +I L   E++    G+M + + ++++E       + G+   L   G++ITS     
Sbjct: 11  APLVEISLDNNEEIQIESGAMVYHNAAVKLEGKMNSNGKGGLGDALKALGRSITSGESFF 70

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL-CQPDAFLCSVNDVKVSNTLDQRV 143
           +   +   ++G + +A  SL  I  I   + G E       AFL S   V       +  
Sbjct: 71  ITKASAIGANGKISLAPASLGSIKEI---LVGKEQWNLNTGAFLVSEGGVHYIMERQKLD 127

Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
             + GG  G    K  G G   I   G +V+  L+  E   VD   ++A T S++  I+ 
Sbjct: 128 GALFGGTGGLFVMKTQGSGKMLISGYGDIVEIALDGTEDFVVDNQHVLAWTESLSYSIE- 186

Query: 204 NGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
              +    FG   G+ LV     G G + IQS     L+Q I
Sbjct: 187 ---VASGTFGFKTGEGLVNK-FRGKGKILIQSRNVEALAQSI 224


>gi|297617355|ref|YP_003702514.1| hypothetical protein Slip_1176 [Syntrophothermus lipocalidus DSM
           12680]
 gi|297145192|gb|ADI01949.1| protein of unknown function DUF124 [Syntrophothermus lipocalidus
           DSM 12680]
          Length = 230

 Score = 43.1 bits (100), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 54/210 (25%), Positives = 86/210 (40%), Gaps = 21/210 (10%)

Query: 31  EAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEV-GMWQWLFGK----TITSV 85
           E  V +I L P E V+A PG+M  M+ ++E++  +     +  M +  FGK    T T  
Sbjct: 10  EYAVAKIKLLPGESVIAEPGAMIAMTPNLEVKAEFARGGVLKSMARSFFGKESFFTTTFT 69

Query: 86  VLRNPGPSDGFVGIAAPSLARILPIDL---AMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
            L          G A   L R  P D+   ++    +  Q  AF+     V V N     
Sbjct: 70  AL----------GEAELVLGRGQPGDMWCVSLNNETIYAQSGAFVACTPGVHV-NAGWGG 118

Query: 143 VRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
            R    G   FL + +TG GL ++   G++ +  L +GE   +D   +     +    + 
Sbjct: 119 ARGFFAGNMLFLLE-ITGTGLVWLATFGALRETTLGLGEDYVIDTGHVAGFHGNTMYTVT 177

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
             G  +  +  G+  V     GPG V+ QS
Sbjct: 178 SAGGFKSFLTSGEGFVCR-FRGPGKVWTQS 206


>gi|449296420|gb|EMC92440.1| hypothetical protein BAUCODRAFT_27721 [Baudoinia compniacensis UAMH
           10762]
          Length = 373

 Score = 42.7 bits (99), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 53/109 (48%), Gaps = 2/109 (1%)

Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
           DAF+     V          + +  G EG    K+ G G+ ++ + G++++K+L  GE  
Sbjct: 252 DAFMACTQGVTKDYKAQSLSKAMFSG-EGLFVYKIGGVGIVWVQSMGAIIRKDLREGEKY 310

Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            +D   +VA   +  ++   +G +   +  G+ LV    TGPG VF+Q+
Sbjct: 311 IIDNGHLVAWNCNYVLERIASGGLVSTLSAGEGLVCKF-TGPGTVFLQT 358


>gi|398339049|ref|ZP_10523752.1| hypothetical protein LkirsB1_05507 [Leptospira kirschneri serovar
           Bim str. 1051]
 gi|418676896|ref|ZP_13238174.1| TIGR00266 family protein [Leptospira kirschneri serovar
           Grippotyphosa str. RM52]
 gi|418688238|ref|ZP_13249394.1| TIGR00266 family protein [Leptospira kirschneri serovar
           Grippotyphosa str. Moskva]
 gi|418696512|ref|ZP_13257521.1| TIGR00266 family protein [Leptospira kirschneri str. H1]
 gi|418739949|ref|ZP_13296330.1| TIGR00266 family protein [Leptospira kirschneri serovar Valbuzzi
           str. 200702274]
 gi|421089058|ref|ZP_15549873.1| TIGR00266 family protein [Leptospira kirschneri str. 200802841]
 gi|421109871|ref|ZP_15570378.1| TIGR00266 family protein [Leptospira kirschneri str. H2]
 gi|421130951|ref|ZP_15591142.1| TIGR00266 family protein [Leptospira kirschneri str. 2008720114]
 gi|400322796|gb|EJO70652.1| TIGR00266 family protein [Leptospira kirschneri serovar
           Grippotyphosa str. RM52]
 gi|409956041|gb|EKO14973.1| TIGR00266 family protein [Leptospira kirschneri str. H1]
 gi|410002179|gb|EKO52701.1| TIGR00266 family protein [Leptospira kirschneri str. 200802841]
 gi|410004898|gb|EKO58702.1| TIGR00266 family protein [Leptospira kirschneri str. H2]
 gi|410357755|gb|EKP04975.1| TIGR00266 family protein [Leptospira kirschneri str. 2008720114]
 gi|410737095|gb|EKQ81837.1| TIGR00266 family protein [Leptospira kirschneri serovar
           Grippotyphosa str. Moskva]
 gi|410753071|gb|EKR10043.1| TIGR00266 family protein [Leptospira kirschneri serovar Valbuzzi
           str. 200702274]
          Length = 224

 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 65/135 (48%), Gaps = 15/135 (11%)

Query: 104 LARILPIDLAM--FGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEG----FLRQK 157
           LA  LP D+      G +  Q  +FL S  +++    +D + + + G I G    FLR  
Sbjct: 80  LAPTLPGDVERIELSGTVYVQSSSFLASSPNIE----MDTKFQGLKGFISGESLFFLR-- 133

Query: 158 LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNL 217
           L+G G   I + G +   N++ GE+I VD   IVA    +N ++   G  +    GG+  
Sbjct: 134 LSGNGFLLISSYGGIDVLNVD-GEMI-VDTGHIVAFEEGLNYEMTKFGGWKSFFLGGEGF 191

Query: 218 VTAVVTGPGIVFIQS 232
           V A   G G V+IQS
Sbjct: 192 V-ARFKGKGKVWIQS 205


>gi|222099271|ref|YP_002533839.1| hypothetical protein CTN_0297 [Thermotoga neapolitana DSM 4359]
 gi|221571661|gb|ACM22473.1| Putative uncharacterized protein [Thermotoga neapolitana DSM 4359]
          Length = 211

 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 49/173 (28%), Positives = 76/173 (43%), Gaps = 20/173 (11%)

Query: 28  LGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI----- 82
           L G   + ++ L   E VVA PG+M +M G +E++   +     G+W+ L    +     
Sbjct: 7   LKGSYALLRVFLSGGESVVAEPGAMVYMRGPIEVQTSVMG----GVWKALKRAVLGGENF 62

Query: 83  --TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
              + V R  G   GF    AP L     ID+    G L  Q  ++L S   V++  +  
Sbjct: 63  FMNTYVSRGEG-EIGF----APQLPG--DIDVIPMNGTLYVQSTSYLASDPSVEMDVSFG 115

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
              +    G EG    KL G+G   + + G +    L+ GE ITVD   +VA+
Sbjct: 116 G-FKTFFAG-EGIFLLKLQGEGDVAVSSFGGIRSVTLQPGEEITVDTGHVVAL 166


>gi|375102749|ref|ZP_09749012.1| TIGR00266 family protein [Saccharomonospora cyanea NA-134]
 gi|374663481|gb|EHR63359.1| TIGR00266 family protein [Saccharomonospora cyanea NA-134]
          Length = 226

 Score = 42.7 bits (99), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 53/216 (24%), Positives = 96/216 (44%), Gaps = 11/216 (5%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-WLFGKTITSVVLRNPGP 92
           V +++L P E      G+M   S  M M+         G+ + +L G+++       P P
Sbjct: 13  VARLLLAPNEPAQVESGAMVATSYGMHMQANTQGGVMKGLGRAFLGGESLFVSTYTAPPP 72

Query: 93  SDGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
           + G+V +A   P   R++ +D  +          ++L + + + +         N+ GG 
Sbjct: 73  NGGWVDVAGGLPGDIRVIEMDGRV---GWCVTRGSWLANSHGIHLETKWGG-FGNLFGGE 128

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRR 209
            GFL     GQG   +   G++   NL+ GE +T+D   +VA   +V  Q+ K +  + +
Sbjct: 129 GGFLTHA-QGQGQLVVSCYGAIDVVNLQPGEYVTIDSGHVVAYADTVQSQLRKVSQGVIQ 187

Query: 210 AVFGGDNLVTAVVTGPGIVFIQSL-PFHRLSQRIAR 244
           ++  G+  V     GPG +  Q+  P   L+  IAR
Sbjct: 188 SLKSGEGFVFDFA-GPGQILTQTRNPDALLAWLIAR 222


>gi|126657121|ref|ZP_01728292.1| hypothetical protein CY0110_28484 [Cyanothece sp. CCY0110]
 gi|126621664|gb|EAZ92374.1| hypothetical protein CY0110_28484 [Cyanothece sp. CCY0110]
          Length = 219

 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 37/151 (24%), Positives = 68/151 (45%), Gaps = 9/151 (5%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT---ITSVVLRNPGPS 93
           + L P EK+  + GS+  M G + ++  +    +  + +  FG T   + S++     P 
Sbjct: 16  VTLNPGEKITVKTGSLVSMDGEITVKTGFCGAWQSALLRKCFGGTDIFVDSLINETAEPI 75

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
              V ++  +   I  IDL+   G +  +P  FL     VK+ N         +G  +G 
Sbjct: 76  T--VVLSQTTTGDIERIDLSQ--GSICLRPGVFLAYTKGVKIENRWAGFGSWWVG--DGL 129

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVIT 184
            + +L G+G  FI A G +++K +    V+T
Sbjct: 130 FQTQLKGKGRVFIAAYGGIIKKTVYQDLVMT 160


>gi|67922246|ref|ZP_00515760.1| Protein of unknown function DUF124 [Crocosphaera watsonii WH 8501]
 gi|416389341|ref|ZP_11685340.1| hypothetical protein CWATWH0003_2162 [Crocosphaera watsonii WH
           0003]
 gi|67855949|gb|EAM51194.1| Protein of unknown function DUF124 [Crocosphaera watsonii WH 8501]
 gi|357264250|gb|EHJ13163.1| hypothetical protein CWATWH0003_2162 [Crocosphaera watsonii WH
           0003]
          Length = 219

 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 74/165 (44%), Gaps = 7/165 (4%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSV-VLRNPGPSDG 95
           + LKP+E +  +  S+  M G + ++  +       + +  FG T   V  L N      
Sbjct: 16  VTLKPEETITVKTDSIISMDGKVTVKTKFFGFWLFALLRKCFGGTDVFVDYLINEKDYPI 75

Query: 96  FVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLR 155
            V ++  ++  I  IDL+   G +  QP  FL     VK+ N        + G  EG  +
Sbjct: 76  TVVLSQTTIGDIERIDLSE--GSICLQPGVFLAYTKGVKIKNHWAGFGSWLAG--EGLFK 131

Query: 156 QKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
            KL G+G  FI A G ++++   V + + +    ++A TS  +++
Sbjct: 132 TKLKGKGRVFIAAYGGIIKQT--VYQDLEMSQGHLLAYTSKTSLK 174


>gi|24215001|ref|NP_712482.1| hypothetical protein LA_2301 [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45657508|ref|YP_001594.1| hypothetical protein LIC11636 [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|386074340|ref|YP_005988657.1| hypothetical protein LIF_A1879 [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|417762747|ref|ZP_12410735.1| TIGR00266 family protein [Leptospira interrogans str. 2002000624]
 gi|417772197|ref|ZP_12420086.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
           str. Pomona]
 gi|417777661|ref|ZP_12425478.1| TIGR00266 family protein [Leptospira interrogans str. 2002000621]
 gi|417783259|ref|ZP_12430982.1| TIGR00266 family protein [Leptospira interrogans str. C10069]
 gi|418667806|ref|ZP_13229211.1| TIGR00266 family protein [Leptospira interrogans serovar Pyrogenes
           str. 2006006960]
 gi|418671665|ref|ZP_13233014.1| TIGR00266 family protein [Leptospira interrogans str. 2002000623]
 gi|418680995|ref|ZP_13242232.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
           str. Kennewicki LC82-25]
 gi|418692429|ref|ZP_13253507.1| TIGR00266 family protein [Leptospira interrogans str. FPW2026]
 gi|418698038|ref|ZP_13259017.1| TIGR00266 family protein [Leptospira interrogans serovar Bataviae
           str. L1111]
 gi|418704255|ref|ZP_13265134.1| TIGR00266 family protein [Leptospira interrogans serovar Hebdomadis
           str. R499]
 gi|418711851|ref|ZP_13272603.1| TIGR00266 family protein [Leptospira interrogans serovar
           Grippotyphosa str. UI 08368]
 gi|418717170|ref|ZP_13277007.1| TIGR00266 family protein [Leptospira interrogans str. UI 08452]
 gi|418726178|ref|ZP_13284789.1| TIGR00266 family protein [Leptospira interrogans str. UI 12621]
 gi|418730486|ref|ZP_13288980.1| TIGR00266 family protein [Leptospira interrogans str. UI 12758]
 gi|421085985|ref|ZP_15546836.1| TIGR00266 family protein [Leptospira santarosai str. HAI1594]
 gi|421102588|ref|ZP_15563192.1| TIGR00266 family protein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|421117573|ref|ZP_15577933.1| TIGR00266 family protein [Leptospira interrogans serovar Canicola
           str. Fiocruz LV133]
 gi|421120883|ref|ZP_15581188.1| TIGR00266 family protein [Leptospira interrogans str. Brem 329]
 gi|421124191|ref|ZP_15584461.1| TIGR00266 family protein [Leptospira interrogans serovar
           Grippotyphosa str. 2006006986]
 gi|421135007|ref|ZP_15595137.1| TIGR00266 family protein [Leptospira interrogans serovar
           Grippotyphosa str. Andaman]
 gi|24196046|gb|AAN49500.1|AE011400_5 conserved hypothetical protein [Leptospira interrogans serovar Lai
           str. 56601]
 gi|45600747|gb|AAS70231.1| conserved hypothetical protein [Leptospira interrogans serovar
           Copenhageni str. Fiocruz L1-130]
 gi|353458129|gb|AER02674.1| conserved hypothetical protein [Leptospira interrogans serovar Lai
           str. IPAV]
 gi|400327433|gb|EJO79685.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
           str. Kennewicki LC82-25]
 gi|400357662|gb|EJP13782.1| TIGR00266 family protein [Leptospira interrogans str. FPW2026]
 gi|409941492|gb|EKN87121.1| TIGR00266 family protein [Leptospira interrogans str. 2002000624]
 gi|409945568|gb|EKN95583.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
           str. Pomona]
 gi|409953960|gb|EKO08456.1| TIGR00266 family protein [Leptospira interrogans str. C10069]
 gi|409960088|gb|EKO23842.1| TIGR00266 family protein [Leptospira interrogans str. UI 12621]
 gi|410010907|gb|EKO69038.1| TIGR00266 family protein [Leptospira interrogans serovar Canicola
           str. Fiocruz LV133]
 gi|410020890|gb|EKO87685.1| TIGR00266 family protein [Leptospira interrogans serovar
           Grippotyphosa str. Andaman]
 gi|410346221|gb|EKO97231.1| TIGR00266 family protein [Leptospira interrogans str. Brem 329]
 gi|410367702|gb|EKP23086.1| TIGR00266 family protein [Leptospira interrogans serovar
           Icterohaemorrhagiae str. Verdun LP]
 gi|410431550|gb|EKP75910.1| TIGR00266 family protein [Leptospira santarosai str. HAI1594]
 gi|410438678|gb|EKP87764.1| TIGR00266 family protein [Leptospira interrogans serovar
           Grippotyphosa str. 2006006986]
 gi|410572648|gb|EKQ35713.1| TIGR00266 family protein [Leptospira interrogans str. 2002000621]
 gi|410581363|gb|EKQ49175.1| TIGR00266 family protein [Leptospira interrogans str. 2002000623]
 gi|410756251|gb|EKR17876.1| TIGR00266 family protein [Leptospira interrogans serovar Pyrogenes
           str. 2006006960]
 gi|410762742|gb|EKR28901.1| TIGR00266 family protein [Leptospira interrogans serovar Bataviae
           str. L1111]
 gi|410766235|gb|EKR36923.1| TIGR00266 family protein [Leptospira interrogans serovar Hebdomadis
           str. R499]
 gi|410767817|gb|EKR43078.1| TIGR00266 family protein [Leptospira interrogans serovar
           Grippotyphosa str. UI 08368]
 gi|410774695|gb|EKR54699.1| TIGR00266 family protein [Leptospira interrogans str. UI 12758]
 gi|410787237|gb|EKR80971.1| TIGR00266 family protein [Leptospira interrogans str. UI 08452]
 gi|455667415|gb|EMF32736.1| TIGR00266 family protein [Leptospira interrogans serovar Pomona
           str. Fox 32256]
 gi|455792745|gb|EMF44485.1| TIGR00266 family protein [Leptospira interrogans serovar Lora str.
           TE 1992]
 gi|456821475|gb|EMF69981.1| TIGR00266 family protein [Leptospira interrogans serovar Canicola
           str. LT1962]
 gi|456986315|gb|EMG21911.1| TIGR00266 family protein [Leptospira interrogans serovar
           Copenhageni str. LT2050]
          Length = 224

 Score = 42.7 bits (99), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 7/131 (5%)

Query: 104 LARILPIDLAM--FGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQ 161
           LA  LP D+      G +  Q  +FL S  ++++ +T  Q ++  I G E     +L+G 
Sbjct: 80  LAPTLPGDVERIELSGTVYVQSSSFLASSPNIEM-DTKFQGLKGFISG-ESLFFLRLSGN 137

Query: 162 GLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAV 221
           G   I + G +   N++ GE+I VD   IVA    +N ++   G  +    GG+  V A 
Sbjct: 138 GFLLISSYGGIDVLNVD-GEMI-VDTGHIVAFEEGLNYEMTKFGGWKSFFLGGEGFV-AR 194

Query: 222 VTGPGIVFIQS 232
             G G V+IQS
Sbjct: 195 FKGKGKVWIQS 205


>gi|310793723|gb|EFQ29184.1| hypothetical protein GLRG_04328 [Glomerella graminicola M1.001]
          Length = 456

 Score = 42.7 bits (99), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 52/187 (27%), Positives = 86/187 (45%), Gaps = 13/187 (6%)

Query: 47  ARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLAR 106
           A+PG M  MS S+ +      + +  M + + G  + S     PG     + +A P L  
Sbjct: 264 AKPGVMVAMSHSVTLRG----QIKFSMKKLVAGAEMASSTFVGPG----ELLLAPPMLGD 315

Query: 107 ILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAF 165
           +    L + G E      DA+L S   V + +   Q +   I   EG    K++G GL +
Sbjct: 316 V--TSLRLTGKETWSVGHDAYLASTQGV-IKDHKRQGLSKAIFSGEGLFVYKISGTGLLW 372

Query: 166 ILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGP 225
           + + G++++K+L  GE   VD   +VA      ++   +G I   +  G+ LV    TGP
Sbjct: 373 LTSFGAIIRKDLMDGEKYIVDNGHLVAWNVKYIMERVASGGIISGISSGEGLV-CKFTGP 431

Query: 226 GIVFIQS 232
           G VF+Q+
Sbjct: 432 GTVFMQT 438


>gi|325848476|ref|ZP_08170136.1| hypothetical protein HMPREF9246_1944 [Anaerococcus hydrogenalis
           ACS-025-V-Sch4]
 gi|325480704|gb|EGC83761.1| hypothetical protein HMPREF9246_1944 [Anaerococcus hydrogenalis
           ACS-025-V-Sch4]
          Length = 256

 Score = 42.4 bits (98), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 45/202 (22%), Positives = 88/202 (43%), Gaps = 26/202 (12%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
            C + + P   V ++ G+M +M G +E+ +       +G +   F K+I   V +     
Sbjct: 55  ACDLSISP---VTSQTGAMQWMVGDVELTSGI---KGIGDF---FSKSIKGSVTKESAVK 105

Query: 94  DGFVG----IAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
             + G    I  P+   I+ IDLA +   ++ +   FL   +++K        + + + G
Sbjct: 106 PEYRGSGRVILEPTYKHIILIDLAEWNNSIVLEDGYFLACTSNLKQKIKTRTNISSSLMG 165

Query: 150 IEGFLRQKLTGQGLAFI---LAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
            EG    KL G G+  +   +A   +++  LE  + I +D S  +A + S++  +K    
Sbjct: 166 NEGMFNLKLEGDGILALESPIAKDDLIEIELE-NDQIKIDGSYAIAWSDSLDFSVKK--- 221

Query: 207 IRRAVFGGDNLVTAVVTGPGIV 228
                    +LV++ +T  G V
Sbjct: 222 ------ASKSLVSSAMTEEGFV 237


>gi|124003941|ref|ZP_01688788.1| protein of unknown function [Microscilla marina ATCC 23134]
 gi|123990520|gb|EAY30000.1| protein of unknown function [Microscilla marina ATCC 23134]
          Length = 482

 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 26/97 (26%), Positives = 57/97 (58%), Gaps = 5/97 (5%)

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRA 210
           EG + +KL G G+ F+ A G VV++ LE  + I V+++ +VA   S+ + +     +  +
Sbjct: 285 EGLVIEKLEGDGMVFLKAKGDVVERILE-DDAIRVNLASVVAFEPSIELDMDSIQSL-ES 342

Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVT 247
           +   D ++ A+ +G G++++QS   ++++  I + +T
Sbjct: 343 LQDADPVILALFSGSGMLWLQS---NQMTPHIIQHIT 376


>gi|448562034|ref|ZP_21635167.1| hypothetical protein C457_07151 [Haloferax prahovense DSM 18310]
 gi|445720130|gb|ELZ71807.1| hypothetical protein C457_07151 [Haloferax prahovense DSM 18310]
          Length = 220

 Score = 42.4 bits (98), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 69/157 (43%), Gaps = 28/157 (17%)

Query: 94  DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
           D FV   APS       L     +D+A+  G ++ +  + +    D+          R  
Sbjct: 4   DEFVSTHAPSEGGEAFELENSKLLDVAL-DGSVMAKAGSMVGYTGDISFE-------RKS 55

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
            GG++G L++K TG+G   + A G+           V    L+ GE ++V+ + ++A  S
Sbjct: 56  AGGLKGMLKKKATGEGDVMMQATGTGHLYLADQAKEVQILELDAGEELSVNGNDVLAFES 115

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           SVN  IK    I  A  GG  L    + GPG V I +
Sbjct: 116 SVNWDIKMLTSIAGASSGG--LFNVFLEGPGHVAITT 150


>gi|421113342|ref|ZP_15573786.1| TIGR00266 family protein [Leptospira santarosai str. JET]
 gi|410801116|gb|EKS07290.1| TIGR00266 family protein [Leptospira santarosai str. JET]
 gi|456875150|gb|EMF90381.1| TIGR00266 family protein [Leptospira santarosai str. ST188]
          Length = 227

 Score = 42.4 bits (98), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 52/211 (24%), Positives = 91/211 (43%), Gaps = 31/211 (14%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-----WLFGKTITSVVLR 88
           + Q+ L+  E + A  G+M  MS S++M    + + E G+W       L G++      +
Sbjct: 13  IVQVQLEDGESIRAESGAMVAMSPSVKM----VTKAEGGIWASAKRALLSGESF----FQ 64

Query: 89  NPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
           N   ++   G+   + A    I+     GE L+    A++     + + +          
Sbjct: 65  NTFKAENGRGMIFLTSATQGDIEYRKMNGEDLILSKGAYVAGSESLLIDSKW-------- 116

Query: 148 GGIEGFLRQ------KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           GG +GF         K++G G  F  + G++    +EV     VD   IV    S+N  I
Sbjct: 117 GGFKGFFSGEGLFFLKVSGVGDLFFSSFGAI--HTIEVNGQYVVDTGHIVGFEGSLNYTI 174

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           +  G ++     G+ LV AV +G G ++IQS
Sbjct: 175 QKVGGLKSLFLSGEGLV-AVFSGSGKLYIQS 204


>gi|456969821|gb|EMG10737.1| TIGR00266 family protein [Leptospira interrogans serovar
           Grippotyphosa str. LT2186]
          Length = 159

 Score = 42.4 bits (98), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 7/131 (5%)

Query: 104 LARILPIDLAM--FGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQ 161
           LA  LP D+      G +  Q  +FL S  ++++ +T  Q ++  I G E     +L+G 
Sbjct: 15  LAPTLPGDVERIELSGTVYVQSSSFLASSPNIEM-DTKFQGLKGFISG-ESLFFLRLSGN 72

Query: 162 GLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAV 221
           G   I + G +   N++ GE+I VD   IVA    +N ++   G  +    GG+  V A 
Sbjct: 73  GFLLISSYGGIDVLNVD-GEMI-VDTGHIVAFEEGLNYEMTKFGGWKSFFLGGEGFV-AR 129

Query: 222 VTGPGIVFIQS 232
             G G V+IQS
Sbjct: 130 FKGKGKVWIQS 140


>gi|428769066|ref|YP_007160856.1| hypothetical protein Cyan10605_0674 [Cyanobacterium aponinum PCC
           10605]
 gi|428683345|gb|AFZ52812.1| protein of unknown function DUF124 [Cyanobacterium aponinum PCC
           10605]
          Length = 235

 Score = 42.4 bits (98), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 50/213 (23%), Positives = 92/213 (43%), Gaps = 24/213 (11%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVL------RNP 90
           + L+P +  +   G M  M  S++ME+        G +   FG+ +    L         
Sbjct: 22  LELEPNQTAIVEAGGMAAMDSSIKMES-----KMKGGFGQSFGRMLGGESLFLNEFTAQA 76

Query: 91  GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
              + ++    P   +   +D +     L+ Q   F+ S  +V++++   Q  +    G 
Sbjct: 77  SQGELYISPGVPGDIQYYHLDGSK---GLMIQSSGFVASSKNVEINSNF-QGFKGFFSGE 132

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG----- 205
             FL  K TG+G  +  + G++V+ N+E   V  VD   IVA   ++N  ++  G     
Sbjct: 133 SIFLL-KATGKGDIWFSSYGAIVEINVENNYV--VDTGYIVAFEDTLNYNVEMIGGLSFR 189

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
            +R  + GG+ LV    +G G ++IQS   + L
Sbjct: 190 NLRTGILGGEGLVCR-FSGSGKLWIQSRGLYPL 221


>gi|55376854|ref|YP_134705.1| hypothetical protein pNG7303 [Haloarcula marismortui ATCC 43049]
 gi|55229579|gb|AAV44999.1| unknown [Haloarcula marismortui ATCC 43049]
          Length = 223

 Score = 42.0 bits (97), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 41/157 (26%), Positives = 76/157 (48%), Gaps = 22/157 (14%)

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
           +R+  P+DG  G    +  R+L I L    G ++ +  + +    +V  +        +V
Sbjct: 7   VRSNAPADGGDGFQKEN-NRLLDIPLD---GTVMVKAGSMVAYTGEVTFTGK-----SSV 57

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS---VVQKN--------LEVGEVITVDVSCIVAVTS 195
            GGI GF+++ ++G+G   + A GS    V +N        L+ GE I+V+ + ++A  S
Sbjct: 58  EGGITGFVKEAVSGEGTPVMEAEGSGHLYVAENGKKVQVLALDDGESISVNGTDVLAFES 117

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           +++ +I   G +     GG  L    +TGPG V + +
Sbjct: 118 TIDYEINTVGSLSGMAAGG--LTNVYLTGPGEVALTT 152


>gi|313125220|ref|YP_004035484.1| hypothetical protein Hbor_04410 [Halogeometricum borinquense DSM
           11551]
 gi|448287182|ref|ZP_21478398.1| hypothetical protein C499_10354 [Halogeometricum borinquense DSM
           11551]
 gi|312291585|gb|ADQ66045.1| conserved hypothetical protein TIGR00266 [Halogeometricum
           borinquense DSM 11551]
 gi|445572928|gb|ELY27458.1| hypothetical protein C499_10354 [Halogeometricum borinquense DSM
           11551]
          Length = 220

 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 51/212 (24%), Positives = 89/212 (41%), Gaps = 6/212 (2%)

Query: 23  IPFQILGGEAQVC-QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           + F+I  G +    +I     E+V   PGSM   S ++ +E     +   GM +      
Sbjct: 1   MQFEIADGHSYATLEIDFDEGERVGIEPGSMVTRSENVRVETTSGDDGIGGMLKRAVSDE 60

Query: 82  ITSVVLRNPGPSDGFVGIAAPS-LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLD 140
           I  +       SDG   + AP  L  I  ++L    G +  Q    L     V+   T  
Sbjct: 61  IDVMTTYFIAESDGAHALLAPDYLGDIARVELDGTEG-VKVQSGGLLAWSEGVE-RGTAR 118

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
               N     E     KL G+G AFI A G+V ++ +   + + VD   ++A T  ++  
Sbjct: 119 NEASNFFSSGE-LTVLKLDGKGSAFISAFGAVRKETVTAEDPLIVDEDHLLAWTDGLSAS 177

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            + +  +  ++ GG+  VT  ++G G  ++Q+
Sbjct: 178 RQKDSNLTSSLLGGEGFVTK-LSGEGSAWVQT 208


>gi|15672568|ref|NP_266742.1| hypothetical protein L178908 [Lactococcus lactis subsp. lactis
           Il1403]
 gi|12723478|gb|AAK04684.1|AE006292_4 conserved hypothetical protein [Lactococcus lactis subsp. lactis
           Il1403]
          Length = 232

 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 56/222 (25%), Positives = 92/222 (41%), Gaps = 19/222 (8%)

Query: 32  AQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL--FGKTITS----- 84
           A + +I L   E++    G+M + + ++++E       + G+   L   G++ITS     
Sbjct: 11  APLVEISLDNNEEIQIESGAMVYHNAAVKLEGKMNSNGKGGLRGALKALGRSITSGESFF 70

Query: 85  VVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELL-CQPDAFLCSVNDVKVSNTLDQRV 143
           +   +   ++G + +A  SL  I  I   + G E       AFL S   V       +  
Sbjct: 71  ITKASAIGANGKISLAPASLGSIKEI---LVGKEQWNLNTGAFLVSEGGVHYIMERQKLD 127

Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
             + GG  G    K  G G   I   G +V+  L+  E   VD   ++A T S++  I+ 
Sbjct: 128 GALFGGTGGLFVMKTQGSGKMLISGYGDIVEIALDGTEDFVVDNQHVLAWTESLSYSIE- 186

Query: 204 NGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
              +    FG   G+ LV     G G + IQS     L+Q I
Sbjct: 187 ---VASGTFGFKTGEGLVNK-FRGKGKILIQSRNVEALAQSI 224


>gi|448390578|ref|ZP_21566201.1| hypothetical protein C477_08238 [Haloterrigena salina JCM 13891]
 gi|445666992|gb|ELZ19644.1| hypothetical protein C477_08238 [Haloterrigena salina JCM 13891]
          Length = 221

 Score = 42.0 bits (97), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 36/139 (25%), Positives = 63/139 (45%), Gaps = 21/139 (15%)

Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
            ++L +DL    G ++ +  + +    D+       Q   +  GGI GFL++K TG+G  
Sbjct: 24  GKLLDVDLD---GTVIAKAGSMIAYDGDISF-----QGKSSAEGGITGFLKEKTTGEGTP 75

Query: 165 FILAGG-----------SVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
            + A G            +    L+  E I+V+ + ++A  SS++ +I+    I  A F 
Sbjct: 76  VMEASGRGHLYLADQEKKIQLLELDADEEISVNGNDVLAFESSIDYEIRTMNSI--AGFS 133

Query: 214 GDNLVTAVVTGPGIVFIQS 232
              L    +TGPG V I +
Sbjct: 134 AGGLTNVSLTGPGSVAITT 152


>gi|389846904|ref|YP_006349143.1| hypothetical protein HFX_1448 [Haloferax mediterranei ATCC 33500]
 gi|448615255|ref|ZP_21664180.1| hypothetical protein C439_03253 [Haloferax mediterranei ATCC 33500]
 gi|388244210|gb|AFK19156.1| hypothetical protein HFX_1448 [Haloferax mediterranei ATCC 33500]
 gi|445752519|gb|EMA03942.1| hypothetical protein C439_03253 [Haloferax mediterranei ATCC 33500]
          Length = 220

 Score = 42.0 bits (97), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 63/139 (45%), Gaps = 23/139 (16%)

Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
           +++L I L    G ++ +  + +    D+          R   GG++G L++K TG+G  
Sbjct: 24  SKLLDITLD---GSIMAKAGSMVSYTGDISFE-------RKSTGGLKGMLKKKATGEGEV 73

Query: 165 FILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
            + A GS           V    L+ GE I+V+ + ++A  +S++  IK    I  A  G
Sbjct: 74  MMEASGSGHLYLADQGKEVQILELDTGEEISVNGNDVLAFENSIDWDIKMMKSIAGASSG 133

Query: 214 GDNLVTAVVTGPGIVFIQS 232
           G  L    + GPG V I +
Sbjct: 134 G--LFNVFLEGPGHVAITT 150


>gi|448541097|ref|ZP_21623928.1| hypothetical protein C460_04280 [Haloferax sp. ATCC BAA-646]
 gi|448549482|ref|ZP_21628087.1| hypothetical protein C459_07270 [Haloferax sp. ATCC BAA-645]
 gi|448555405|ref|ZP_21631445.1| hypothetical protein C458_06224 [Haloferax sp. ATCC BAA-644]
 gi|445708259|gb|ELZ60099.1| hypothetical protein C460_04280 [Haloferax sp. ATCC BAA-646]
 gi|445712530|gb|ELZ64311.1| hypothetical protein C459_07270 [Haloferax sp. ATCC BAA-645]
 gi|445718150|gb|ELZ69853.1| hypothetical protein C458_06224 [Haloferax sp. ATCC BAA-644]
          Length = 220

 Score = 42.0 bits (97), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 68/157 (43%), Gaps = 28/157 (17%)

Query: 94  DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
           D FV   APS       L     +D+A+  G ++ +  + +    D+          R  
Sbjct: 4   DEFVSAHAPSEGGDSFELESSKLLDVAL-DGHVMAKAGSMVGYTGDISFE-------RKS 55

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
            GG++G L+QK TG+G   + A G+           V    L+ GE ++V+ + ++A  S
Sbjct: 56  AGGLKGMLKQKATGEGDVMMQATGTGHLYLADQGKEVQILELDAGEELSVNGNDVLAFES 115

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           SVN  IK    I     GG  L    + GPG V I +
Sbjct: 116 SVNWDIKMLTSIAGTSSGG--LFNVFLEGPGHVAITT 150


>gi|427420413|ref|ZP_18910596.1| TIGR00266 family protein [Leptolyngbya sp. PCC 7375]
 gi|425763126|gb|EKV03979.1| TIGR00266 family protein [Leptolyngbya sp. PCC 7375]
          Length = 233

 Score = 42.0 bits (97), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 51/210 (24%), Positives = 97/210 (46%), Gaps = 16/210 (7%)

Query: 36  QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS-D 94
           ++ L+P + V    G+M  M   +EM++      + G+ + L G+++        G S D
Sbjct: 19  KLSLQPNQTVSVEAGAMAAMDSCIEMKSKAKGGWKKGLGRMLGGESLFISEFTAQGSSGD 78

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
            ++    P   +   +D +     L+ Q   F+ +   V + +   Q  +    G   FL
Sbjct: 79  LYISPGVPGDIQHYILDGSQ---SLMVQSSGFVAAGPGVSIDSKF-QGFKGFFSGESLFL 134

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK------YNGPIR 208
             ++TGQG  +  + G++++  +E   V  VD   IVA   S++  ++      + G +R
Sbjct: 135 L-RVTGQGDFWFSSFGAILEIPVEGNYV--VDTGYIVAFEDSLSYNVEMISGLSFRG-LR 190

Query: 209 RAVFGGDNLVTAVVTGPGIVFIQSLPFHRL 238
             +FGG+ LV A  +G G ++IQS   + L
Sbjct: 191 TGIFGGEGLV-ARFSGSGRLWIQSRNVYSL 219


>gi|313123760|ref|YP_004034019.1| hypothetical protein LDBND_1002 [Lactobacillus delbrueckii subsp.
           bulgaricus ND02]
 gi|312280323|gb|ADQ61042.1| Hypothetical protein LDBND_1002 [Lactobacillus delbrueckii subsp.
           bulgaricus ND02]
          Length = 233

 Score = 41.6 bits (96), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 52/238 (21%), Positives = 100/238 (42%), Gaps = 18/238 (7%)

Query: 23  IPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           + +++ GG    +  + L   E +    G+M +M        +   +  +G      G++
Sbjct: 1   MKYELSGGNTCPLATVTLDKGESIKVENGAMVYMKDVTIAGKMNSSKKGLGGLLGAIGRS 60

Query: 82  ITS----VVLRNPGPSDGF-VGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKV 135
           +TS     + +  G +DG  +G+A     +IL + +   G +  C    AFL S + V  
Sbjct: 61  LTSGESMFITQATGDADGGQIGVAPAIPGKILKLSV---GKQQYCLNTGAFLASDDSVSY 117

Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
                   +   GG  GF   +  G+G   + A G +V+  +   + I++D   ++A  +
Sbjct: 118 KMKSQSFSKAAFGGTGGFYVMQTEGEGDMLVNAFGDLVELTVTSNKPISIDNEHVIAWDA 177

Query: 196 SVNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
           ++   IK    +   +FG   G+ LV    TG G V IQ+     L++ +   +   N
Sbjct: 178 NLGYDIK----VASGMFGFTSGEGLVNH-FTGDGKVIIQTRNLRSLAEALQPLIVEKN 230


>gi|448603235|ref|ZP_21657056.1| hypothetical protein C441_03642 [Haloferax sulfurifontis ATCC
           BAA-897]
 gi|445746431|gb|ELZ97893.1| hypothetical protein C441_03642 [Haloferax sulfurifontis ATCC
           BAA-897]
          Length = 220

 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 68/157 (43%), Gaps = 28/157 (17%)

Query: 94  DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
           D FV   APS       L     +D+A+  GE++ +  + +    D+          R  
Sbjct: 4   DEFVSAHAPSEGGDSFELENSKLLDVAL-DGEVMAKAGSMVGYTGDISFE-------RKS 55

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
            GG+ G L++K TG+G   + A G+           V    L+ GE ++V+ + ++A  S
Sbjct: 56  AGGLTGMLKKKATGEGDVMMQATGTGHLYLADQGKEVQILELDAGEELSVNGNDVLAFES 115

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           SVN  IK    I     GG  L    + GPG V I +
Sbjct: 116 SVNWDIKMLTSIAGTSSGG--LFNVFLEGPGHVAITT 150


>gi|427724445|ref|YP_007071722.1| hypothetical protein Lepto7376_2617 [Leptolyngbya sp. PCC 7376]
 gi|427356165|gb|AFY38888.1| protein of unknown function DUF124 [Leptolyngbya sp. PCC 7376]
          Length = 225

 Score = 41.6 bits (96), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 52/204 (25%), Positives = 95/204 (46%), Gaps = 13/204 (6%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSME----MENIYIPENEVGMWQWLFGKTITSVVLRN 89
           +  I +   E++VA+ G+M  MSGS+     +          G+ + L G+++   V R 
Sbjct: 13  IAHIKMDAGEEIVAQAGAMIAMSGSINTSTTLRKGKGGGIMGGLKRMLAGESLFLSVFRA 72

Query: 90  PGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
           P P D  +  A   +  +L     M G E + Q  ++L S ++V +     Q  ++   G
Sbjct: 73  PVP-DSEIWFAPKLMGDLLLYQ--MQGDEFVVQASSYLASGSNVDIDLGW-QGFKSFFSG 128

Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIR 208
            E      ++GQG   + + G++ + +++ GE I VD   IVA   +++ ++ K N    
Sbjct: 129 -ESIFWLSISGQGPLLVTSFGAIYEIDVD-GEYI-VDTGHIVAFEKTLSFKVGKANPSWM 185

Query: 209 RAVFGGDNLVTAVVTGPGIVFIQS 232
            A  GG+ LV     G G ++ Q+
Sbjct: 186 GAFLGGEGLVCR-FQGKGKLYCQT 208


>gi|422004336|ref|ZP_16351556.1| hypothetical protein LSS_12589 [Leptospira santarosai serovar
           Shermani str. LT 821]
 gi|417257005|gb|EKT86413.1| hypothetical protein LSS_12589 [Leptospira santarosai serovar
           Shermani str. LT 821]
          Length = 227

 Score = 41.6 bits (96), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 51/211 (24%), Positives = 91/211 (43%), Gaps = 31/211 (14%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-----WLFGKTITSVVLR 88
           + Q+ L+  E + A  G+M  MS +++M    + + E G+W       L G++      +
Sbjct: 13  IVQVQLEDGESIRAESGAMVAMSPTIKM----VTKAEGGIWASAKRALLSGESF----FQ 64

Query: 89  NPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
           N   ++   G+   + A    I+     GE L+    A++     + + +          
Sbjct: 65  NTFKAENGRGMIFLTSATQGDIEYRKMNGEDLILSKGAYVAGSESLLIDSKW-------- 116

Query: 148 GGIEGFLRQ------KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           GG +GF         K++G G  F  + G++    +EV     VD   IV    S+N  I
Sbjct: 117 GGFKGFFSGEGLFFLKVSGVGDLFFSSFGAI--HTIEVNGQYVVDTGHIVGFEGSLNYTI 174

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           +  G ++     G+ LV AV +G G ++IQS
Sbjct: 175 QKVGGLKSLFLSGEGLV-AVFSGSGKLYIQS 204


>gi|359685937|ref|ZP_09255938.1| hypothetical protein Lsan2_15283 [Leptospira santarosai str.
           2000030832]
 gi|410450315|ref|ZP_11304356.1| TIGR00266 family protein [Leptospira sp. Fiocruz LV3954]
 gi|418746767|ref|ZP_13303087.1| TIGR00266 family protein [Leptospira santarosai str. CBC379]
 gi|410015828|gb|EKO77919.1| TIGR00266 family protein [Leptospira sp. Fiocruz LV3954]
 gi|410792476|gb|EKR90411.1| TIGR00266 family protein [Leptospira santarosai str. CBC379]
          Length = 227

 Score = 41.6 bits (96), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 51/211 (24%), Positives = 91/211 (43%), Gaps = 31/211 (14%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-----WLFGKTITSVVLR 88
           + Q+ L+  E + A  G+M  MS +++M    + + E G+W       L G++      +
Sbjct: 13  IVQVQLEDGESIRAESGAMVAMSPTIKM----VTKAEGGIWASAKRALLSGESF----FQ 64

Query: 89  NPGPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
           N   ++   G+   + A    I+     GE L+    A++     + + +          
Sbjct: 65  NTFKAENGRGMIFLTSATQGDIEYRKMNGEDLILSKGAYVAGSESLLIDSKW-------- 116

Query: 148 GGIEGFLRQ------KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
           GG +GF         K++G G  F  + G++    +EV     VD   IV    S+N  I
Sbjct: 117 GGFKGFFSGEGLFFLKVSGVGDLFFSSFGAI--HTIEVNGQYVVDTGHIVGFEGSLNYTI 174

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           +  G ++     G+ LV AV +G G ++IQS
Sbjct: 175 QKVGGLKSLFLSGEGLV-AVFSGSGKLYIQS 204


>gi|228478056|ref|ZP_04062667.1| conserved hypothetical protein [Streptococcus salivarius SK126]
 gi|418018712|ref|ZP_12658267.1| hypothetical protein SSALIVM18_09357 [Streptococcus salivarius M18]
 gi|228250236|gb|EEK09489.1| conserved hypothetical protein [Streptococcus salivarius SK126]
 gi|345526154|gb|EGX29466.1| hypothetical protein SSALIVM18_09357 [Streptococcus salivarius M18]
          Length = 231

 Score = 41.6 bits (96), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 39/163 (23%), Positives = 72/163 (44%), Gaps = 7/163 (4%)

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
           IT VV +    ++G++ +A  S  +++P+ L     +      AFL        +  L  
Sbjct: 70  ITQVVSQ---SNNGYIALAPDSPGQVIPLYLGE--KQYRLNDGAFLALDGTAYYTMELQS 124

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
             + + GG  GF      GQG     A GS+ +  L   EV T+D + +VA + +++  I
Sbjct: 125 VGKAIFGGQGGFFVMTTQGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQTLDYNI 183

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
                  +++  G+ +V     G G +++QSL     +  + R
Sbjct: 184 HLENGFWQSIGTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 225


>gi|448585889|ref|ZP_21648061.1| hypothetical protein C454_15881 [Haloferax gibbonsii ATCC 33959]
 gi|445725507|gb|ELZ77130.1| hypothetical protein C454_15881 [Haloferax gibbonsii ATCC 33959]
          Length = 220

 Score = 41.6 bits (96), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 68/157 (43%), Gaps = 28/157 (17%)

Query: 94  DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
           D FV   APS       L     +D+A+  G ++ +  + +    D+          R  
Sbjct: 4   DEFVSTHAPSEGGESFELENSKLLDVAL-DGSVMAKAGSMVGYTGDISFE-------RKS 55

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
            GG+ G L++K TG+G   + A G+           V    L+ GE ++V+ + ++A  S
Sbjct: 56  AGGLTGMLKKKATGEGDVMMQATGTGHLYLADQAKEVQILELDAGEELSVNGNDVLAFES 115

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           SVN  IK    I  A  GG  L    + GPG V I +
Sbjct: 116 SVNWDIKMLTSIAGASSGG--LFNVFLEGPGHVAITT 150


>gi|422824507|ref|ZP_16872694.1| hypothetical protein HMPREF9390_1542 [Streptococcus sanguinis
           SK405]
 gi|422825413|ref|ZP_16873592.1| hypothetical protein HMPREF9392_0347 [Streptococcus sanguinis
           SK678]
 gi|422856609|ref|ZP_16903265.1| hypothetical protein HMPREF9378_1536 [Streptococcus sanguinis SK1]
 gi|422863528|ref|ZP_16910159.1| hypothetical protein HMPREF9391_2053 [Streptococcus sanguinis
           SK408]
 gi|422866440|ref|ZP_16913065.1| hypothetical protein HMPREF9395_2151 [Streptococcus sanguinis
           SK1058]
 gi|422877404|ref|ZP_16923874.1| hypothetical protein HMPREF9393_1638 [Streptococcus sanguinis
           SK1056]
 gi|324992556|gb|EGC24477.1| hypothetical protein HMPREF9390_1542 [Streptococcus sanguinis
           SK405]
 gi|324995915|gb|EGC27826.1| hypothetical protein HMPREF9392_0347 [Streptococcus sanguinis
           SK678]
 gi|327459968|gb|EGF06307.1| hypothetical protein HMPREF9378_1536 [Streptococcus sanguinis SK1]
 gi|327472502|gb|EGF17933.1| hypothetical protein HMPREF9391_2053 [Streptococcus sanguinis
           SK408]
 gi|327488549|gb|EGF20349.1| hypothetical protein HMPREF9395_2151 [Streptococcus sanguinis
           SK1058]
 gi|332360043|gb|EGJ37857.1| hypothetical protein HMPREF9393_1638 [Streptococcus sanguinis
           SK1056]
          Length = 231

 Score = 41.2 bits (95), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 48/220 (21%), Positives = 102/220 (46%), Gaps = 16/220 (7%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL--FGKTITS----VVL 87
           + ++ L   E V  + GSM + + ++ + N  +  +  G+ +++   G+++ S     + 
Sbjct: 13  LVELSLNQGETVFIQRGSMVYHTPNVTL-NTQLNASGSGLGRFVKAVGRSMVSGESTFIT 71

Query: 88  RNPGPSD-GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--R 144
           +    SD G++ +A  +  +++P+ L     +      AFL    D     T++++   +
Sbjct: 72  QAVAQSDNGYLALAPDAPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGK 127

Query: 145 NVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYN 204
            + GG  GF      GQG     A GS+ +  L   EV T+D + +VA + S+N  +   
Sbjct: 128 ALFGGQGGFFVMTTQGQGTLLANAFGSIKKLELHNQEV-TIDNAHVVAWSQSLNYHVHLE 186

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
               +++  G+ +V     G G V++QSL     +  +++
Sbjct: 187 NGFWQSIGTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSK 225


>gi|307152758|ref|YP_003888142.1| hypothetical protein Cyan7822_2910 [Cyanothece sp. PCC 7822]
 gi|306982986|gb|ADN14867.1| protein of unknown function DUF124 [Cyanothece sp. PCC 7822]
          Length = 227

 Score = 41.2 bits (95), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 51/206 (24%), Positives = 92/206 (44%), Gaps = 19/206 (9%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRN--PG 91
           + ++ L   E+V    GSM  MS  + +E          + + L G        +N    
Sbjct: 13  IGRLRLVGGEQVRVESGSMVGMSAGVTLETTTTGGFMQSLRRSLLGG---ESFFQNVFTA 69

Query: 92  PSDGFVGIAAPSLARILPIDLAMFGGE--LLCQPDAFLCSVNDVKVSNTLDQR---VRNV 146
           P+ G   + APSL    P DL +      +L Q   ++ S  D+ ++  LD +    ++ 
Sbjct: 70  PAQGGEVLVAPSL----PGDLKVLDMSEPMLLQSGCYVAS--DMGIA--LDSKWGGSKSF 121

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
            G   G    +  GQG   + + G++ +  L+ GE  TVD   +VA++ S+  + +  G 
Sbjct: 122 FGTGGGLFMLRAEGQGQIVVSSYGAIHEITLDRGESYTVDTGHLVALSESMPFKTRSIGG 181

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQS 232
           ++  +F G+  V   +TGPG   +Q+
Sbjct: 182 MKTFMFSGEGFVID-LTGPGKFLVQT 206


>gi|387760488|ref|YP_006067465.1| hypothetical protein Ssal_00185 [Streptococcus salivarius 57.I]
 gi|339291255|gb|AEJ52602.1| conserved hypothetical protein [Streptococcus salivarius 57.I]
          Length = 231

 Score = 41.2 bits (95), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 39/163 (23%), Positives = 73/163 (44%), Gaps = 7/163 (4%)

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
           IT VV ++   ++G++ +A  S  +++P+ L     +      AFL        +  L  
Sbjct: 70  ITQVVSQS---NNGYIALAPDSPGQVIPLYLGE--KQYRLNDGAFLALDGTAYYTMELQS 124

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
             + + GG  GF      GQG     A GS+ +  L   EV T+D + +VA + +++  I
Sbjct: 125 VGKAIFGGQGGFFVMTTQGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQTLDYDI 183

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
                  +++  G+ +V     G G +++QSL     +  + R
Sbjct: 184 HLENGFWQSIGTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 225


>gi|384567619|ref|ZP_10014723.1| TIGR00266 family protein [Saccharomonospora glauca K62]
 gi|384523473|gb|EIF00669.1| TIGR00266 family protein [Saccharomonospora glauca K62]
          Length = 225

 Score = 41.2 bits (95), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 49/209 (23%), Positives = 87/209 (41%), Gaps = 16/209 (7%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           V +++L P E      G+M   S  M M+         G+ + +FG     V      P+
Sbjct: 13  VARLLLAPNEPAQVEAGAMVATSYGMHMQASTQGGVMKGLGRAIFGGESLFVSTYTAPPN 72

Query: 94  DGFVGIAA--PSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
            G+V +A   P   R++ +D  +          ++L +   + +         N+ GG  
Sbjct: 73  GGWVDVAGGLPGDIRVIEMDGRV---GWCVTRGSWLANSYGIHLETKWGG-FGNLFGGEG 128

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           GFL     GQG   +   G++    L+ GE +T+D   +VA   +V  Q      +R+  
Sbjct: 129 GFLTHA-QGQGQLLVSCYGAIDVVTLQPGEYVTIDSGHVVAYADTVQSQ------LRKVA 181

Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQ 240
            G   ++ ++ +G G VF  + P   L+Q
Sbjct: 182 QG---VIQSLKSGEGFVFDFAGPGQILTQ 207


>gi|300812994|ref|ZP_07093380.1| conserved hypothetical protein TIGR00266 [Lactobacillus delbrueckii
           subsp. bulgaricus PB2003/044-T3-4]
 gi|300496028|gb|EFK31164.1| conserved hypothetical protein TIGR00266 [Lactobacillus delbrueckii
           subsp. bulgaricus PB2003/044-T3-4]
          Length = 233

 Score = 41.2 bits (95), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 50/230 (21%), Positives = 99/230 (43%), Gaps = 18/230 (7%)

Query: 23  IPFQILGGEA-QVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT 81
           + +++ GG    +  + L   E +    G+M +M        +   +  +G      G++
Sbjct: 1   MKYELSGGNTCPLATVTLDKGESIKVENGAMVYMKDVTIAGKMNSSKKGLGGLLGAIGRS 60

Query: 82  ITS----VVLRNPGPSD-GFVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKV 135
           +TS     + +  G +D G +G+A     +I+ + +   G +  C    AFL S + V  
Sbjct: 61  LTSGESMFITQATGDADSGQIGVAPAIPGKIVKLSV---GKQQYCLNTGAFLASDDSVSY 117

Query: 136 SNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTS 195
                   +   GG  GF   +  G+G   + A G +V+  +   + I++D   ++A  +
Sbjct: 118 KMKSQSFSKAAFGGTGGFYVMQTEGEGDMLVNAFGDLVELTVTSNKPISIDNEHVIAWDA 177

Query: 196 SVNVQIKYNGPIRRAVFG---GDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +++  IK    +   +FG   G+ LV    TG G V IQ+     L++ +
Sbjct: 178 NLDYDIK----VASGMFGFTSGEGLVNH-FTGDGKVIIQTRNLRSLAEAL 222


>gi|332672253|ref|YP_004455261.1| hypothetical protein Celf_3768 [Cellulomonas fimi ATCC 484]
 gi|332341291|gb|AEE47874.1| protein of unknown function DUF124 [Cellulomonas fimi ATCC 484]
          Length = 213

 Score = 41.2 bits (95), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 54/220 (24%), Positives = 94/220 (42%), Gaps = 22/220 (10%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI 82
           +PF+ +   ++V +I + PQ  V+AR G+M   SG +    I    + VG          
Sbjct: 1   MPFEKV--NSKVVRIPVTPQNAVLARRGAMLGYSGQVAFRPISGQGHGVGGMVGAAMSGE 58

Query: 83  TSVVLRNPGPSD---GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS--- 136
           ++ ++   G      GF G+       +  IDL   G  L  + D  L     ++ S   
Sbjct: 59  SNPMMATEGNGSVLYGFRGL------HVTVIDLT--GDSLTVEADRVLAHDAHLQTSVEF 110

Query: 137 ---NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
                +   VR  + G +G    +++GQG   +L+ G      L  G+ + VD    V  
Sbjct: 111 IGQGGVRSAVRGAMTG-QGLFTTRISGQGSVAVLSHGGTFPLQLN-GDTVGVDPQAYVGH 168

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAV-VTGPGIVFIQS 232
           T ++NV +K +  +R  V  G      + V+G G V++Q+
Sbjct: 169 TGALNVDLKASVGLRDLVGRGSGEAFQLHVSGHGTVYVQA 208


>gi|322710681|gb|EFZ02255.1| DUF124 domain protein [Metarhizium anisopliae ARSEF 23]
          Length = 270

 Score = 41.2 bits (95), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 51/201 (25%), Positives = 91/201 (45%), Gaps = 16/201 (7%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITSVVLRNPGP 92
           +  + L+P   V ++ G+M  MSG+++++ NI     +  M +   G  ++      PG 
Sbjct: 68  ILNVNLQPGGTVRSKSGAMIHMSGTIQLQGNI-----KFSMKKLFTGGRMSESTYTGPGR 122

Query: 93  SDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
                    P+L   + I L + G        D FL    DV +  T  Q +   +   E
Sbjct: 123 L-----ALGPTLMGDI-ITLHVDGRTPWTVGKDTFLACTPDVAM-ETKSQGISKSMFSGE 175

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
                ++ GQGL ++ + G+V + +L+ GE   VD   +VA +    ++ K  G    ++
Sbjct: 176 DLFVYRVGGQGLLWLTSFGAVDRLDLQPGEQHIVDNGHLVAWSCDYRIE-KAGGGAMSSL 234

Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
             G+ LV    TGPG V++Q+
Sbjct: 235 KTGEGLV-CRFTGPGAVYVQT 254


>gi|398336774|ref|ZP_10521479.1| hypothetical protein LkmesMB_14981 [Leptospira kmetyi serovar
           Malaysia str. Bejo-Iso9]
          Length = 224

 Score = 41.2 bits (95), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 43/135 (31%), Positives = 68/135 (50%), Gaps = 9/135 (6%)

Query: 99  IAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQK 157
           + AP+L   +  IDL+   G +  Q  +FL S  ++++ +T  Q ++  I G E     K
Sbjct: 79  LLAPTLPGDVDRIDLS---GTVYVQSSSFLASSPNIEM-DTKFQGLKGFISG-ESLFFLK 133

Query: 158 LTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNL 217
           L+G G   I + G +   N++ GE+I VD   IVA    +N ++   G  +    GG+  
Sbjct: 134 LSGNGPLLISSYGGIDVLNVD-GEMI-VDTGHIVAFDEGLNYEMTKFGGWKSFFLGGEGF 191

Query: 218 VTAVVTGPGIVFIQS 232
           V A   G G V+IQS
Sbjct: 192 V-ARFKGRGRVWIQS 205


>gi|186681073|ref|YP_001864269.1| hypothetical protein Npun_R0563 [Nostoc punctiforme PCC 73102]
 gi|186463525|gb|ACC79326.1| protein of unknown function DUF124 [Nostoc punctiforme PCC 73102]
          Length = 222

 Score = 41.2 bits (95), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 39/140 (27%), Positives = 70/140 (50%), Gaps = 10/140 (7%)

Query: 102 PSLA--RILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLT 159
           PSLA   IL +D    G E +    A+  S   ++V    ++ V  +IGG EG  + K+ 
Sbjct: 75  PSLAGYHILELD----GSEWILDTGAYWASDGSIEVGIERNKFVSGLIGG-EGLFQTKVK 129

Query: 160 GQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFGGDNLV 218
           G+G   + A G V   NL+  + + VD +  +A T+++N ++ K    +  ++  G+ LV
Sbjct: 130 GRGKVVMAAQGPVEVINLQ-NDRLVVDGNFAIARTNTLNYRVEKATKSLLGSMTSGEFLV 188

Query: 219 TAVVTGPGIVFIQSLPFHRL 238
                G G V +  +P+ ++
Sbjct: 189 N-TFEGTGTVLLAPIPYWKV 207


>gi|292655523|ref|YP_003535420.1| hypothetical protein HVO_1369 [Haloferax volcanii DS2]
 gi|448291905|ref|ZP_21482579.1| hypothetical protein C498_11898 [Haloferax volcanii DS2]
 gi|291371101|gb|ADE03328.1| hypothetical protein HVO_1369 [Haloferax volcanii DS2]
 gi|445573424|gb|ELY27945.1| hypothetical protein C498_11898 [Haloferax volcanii DS2]
          Length = 220

 Score = 41.2 bits (95), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 43/157 (27%), Positives = 68/157 (43%), Gaps = 28/157 (17%)

Query: 94  DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
           D FV   APS       L     +D+A+  G ++ +  + +    D+          R  
Sbjct: 4   DEFVSAHAPSEGGDSFELENSKLLDVAL-DGNVMAKAGSMVGYTGDISFE-------RKS 55

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
            GG++G L+QK TG+G   + A G+           V    L+ GE ++V+ + ++A  S
Sbjct: 56  AGGLKGMLKQKATGEGDVMMQATGTGHLYLADQGKEVQILELDAGEELSVNGNDVLAFES 115

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           SVN  IK    +     GG  L    + GPG V I +
Sbjct: 116 SVNWDIKMLTSVAGTSSGG--LFNVFLEGPGHVAITT 150


>gi|417765826|ref|ZP_12413782.1| TIGR00266 family protein [Leptospira interrogans serovar Bulgarica
           str. Mallika]
 gi|400351765|gb|EJP03978.1| TIGR00266 family protein [Leptospira interrogans serovar Bulgarica
           str. Mallika]
          Length = 224

 Score = 41.2 bits (95), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 40/131 (30%), Positives = 63/131 (48%), Gaps = 7/131 (5%)

Query: 104 LARILPIDLAM--FGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQ 161
           LA  LP D+      G +  Q  +FL S  ++++ +T  Q ++  I G E     +L+G 
Sbjct: 80  LAPTLPGDVERIELSGTVYVQSSSFLASSPNIEM-DTKFQGLKGFISG-ESLFFLRLSGN 137

Query: 162 GLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAV 221
           G   I + G +   N++ GE+I VD   IVA    +N ++   G  +    GG+  V   
Sbjct: 138 GFLLISSYGGIDVLNVD-GEMI-VDTGHIVAFEEGLNYEMTKFGGWKSFFLGGEGFVVR- 194

Query: 222 VTGPGIVFIQS 232
             G G V+IQS
Sbjct: 195 FKGKGKVWIQS 205


>gi|421453270|ref|ZP_15902626.1| hypothetical protein RSSL_01892 [Streptococcus salivarius K12]
 gi|400181579|gb|EJO15846.1| hypothetical protein RSSL_01892 [Streptococcus salivarius K12]
          Length = 253

 Score = 41.2 bits (95), Expect = 0.56,   Method: Compositional matrix adjust.
 Identities = 39/163 (23%), Positives = 73/163 (44%), Gaps = 7/163 (4%)

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
           IT VV ++   ++G++ +A  S  +++P+ L     +      AFL        +  L  
Sbjct: 92  ITQVVSQS---NNGYIALAPDSPGQVIPLYLGE--KQYRLNDGAFLALDGTAYYTMELQS 146

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
             + + GG  GF      GQG     A GS+ +  L   EV T+D + +VA + +++  I
Sbjct: 147 VGKALFGGQGGFFVMTTQGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQTLDYNI 205

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
                  +++  G+ +V     G G +++QSL     +  + R
Sbjct: 206 HLENGFWQSIGTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 247


>gi|344210191|ref|YP_004786367.1| hypothetical protein HAH_5250 [Haloarcula hispanica ATCC 33960]
 gi|343785408|gb|AEM59383.1| conserved hypothetical protein [Haloarcula hispanica ATCC 33960]
          Length = 223

 Score = 40.8 bits (94), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 40/157 (25%), Positives = 75/157 (47%), Gaps = 22/157 (14%)

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
           +R+  P+DG  G    +  R+L I L    G ++ +  + +    +V  +        + 
Sbjct: 7   VRSNAPADGGDGFQKEN-NRLLDIPLD---GTVMVKAGSMVAYTGEVTFTGK-----SSA 57

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS---VVQKN--------LEVGEVITVDVSCIVAVTS 195
            GGI GF+++ ++G+G   + A GS    V +N        L+ GE I+V+ + ++A  S
Sbjct: 58  EGGITGFVKEAVSGEGTPVMEAEGSGHLYVAENGKKVQVLALDDGESISVNGTDVLAFES 117

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           +++ +I   G +     GG  L    +TGPG V + +
Sbjct: 118 TIDYEINTVGSLSGMAAGG--LTNVYLTGPGEVALTT 152


>gi|398345152|ref|ZP_10529855.1| hypothetical protein LinasL1_19337 [Leptospira inadai serovar Lyme
           str. 10]
          Length = 191

 Score = 40.8 bits (94), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 46/139 (33%), Positives = 68/139 (48%), Gaps = 21/139 (15%)

Query: 104 LARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKL-- 158
           LA  LP   ++L M G ++L Q  A+L S + V++ +          GG+   L  +   
Sbjct: 48  LAPELPGDIVELDMNGKKILVQAGAYLASDDAVQIVSKF--------GGLRSLLGGEGLF 99

Query: 159 ----TGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFG 213
               +G G AF+ A GS+    +EV    TVD   IVA  SS+   + K  G  +  +F 
Sbjct: 100 LLELSGIGKAFLSAYGSIFP--IEVQGGYTVDTGHIVAFDSSLQFTVGKAGGSWKSTLFS 157

Query: 214 GDNLVTAVVTGPGIVFIQS 232
           G+ LV A  TG G ++IQS
Sbjct: 158 GEGLV-ANFTGNGTLWIQS 175


>gi|422405472|ref|ZP_16482515.1| hypothetical protein Pgy4_16834, partial [Pseudomonas syringae pv.
           glycinea str. race 4]
 gi|330880406|gb|EGH14555.1| hypothetical protein Pgy4_16834 [Pseudomonas syringae pv. glycinea
           str. race 4]
          Length = 48

 Score = 40.8 bits (94), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 25/32 (78%)

Query: 211 VFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRI 242
           +FGG+ +  A +TGPG V++QSLPF RL+ R+
Sbjct: 4   LFGGEGVFFARLTGPGTVWLQSLPFSRLAGRM 35


>gi|425781278|gb|EKV19254.1| hypothetical protein PDIG_04140 [Penicillium digitatum PHI26]
 gi|425783360|gb|EKV21214.1| hypothetical protein PDIP_08810 [Penicillium digitatum Pd1]
          Length = 184

 Score = 40.8 bits (94), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 40/158 (25%), Positives = 74/158 (46%), Gaps = 7/158 (4%)

Query: 75  QWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
           ++L G ++T+     PG     + +A P L  I+  ++   G +     D+FL   + V 
Sbjct: 17  KFLIGGSMTASHYTGPGE----LLLAPPILGDIIVHEINP-GDKWNVGRDSFLAHTSGVC 71

Query: 135 VSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
                    +    G EGF   K TGQGL ++ + G++++K+L  GE   +D   +VA  
Sbjct: 72  HEYKRQDLSKGFFSG-EGFFVYKFTGQGLFWMQSFGAIIKKDLVEGEAYFIDNGHLVAWN 130

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
              +++   +G +  +   G+ L      GPG V++Q+
Sbjct: 131 CKYDIERVASGGLLSSFSSGEGLACK-FKGPGTVYLQT 167


>gi|448642708|ref|ZP_21678667.1| hypothetical protein C436_17970 [Haloarcula sinaiiensis ATCC 33800]
 gi|448660409|ref|ZP_21683469.1| hypothetical protein C435_20388 [Haloarcula californiae ATCC 33799]
 gi|445759198|gb|EMA10484.1| hypothetical protein C435_20388 [Haloarcula californiae ATCC 33799]
 gi|445759508|gb|EMA10786.1| hypothetical protein C436_17970 [Haloarcula sinaiiensis ATCC 33800]
          Length = 223

 Score = 40.8 bits (94), Expect = 0.69,   Method: Compositional matrix adjust.
 Identities = 40/157 (25%), Positives = 75/157 (47%), Gaps = 22/157 (14%)

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
           +R+  P+DG  G    +  R+L I L    G ++ +  + +    +V  +        + 
Sbjct: 7   VRSNAPADGGDGFQKEN-NRLLDIPLD---GTVMVKAGSMVAYTGEVTFTGK-----SSA 57

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS---VVQKN--------LEVGEVITVDVSCIVAVTS 195
            GGI GF+++ ++G+G   + A GS    V +N        L+ GE I+V+ + ++A  S
Sbjct: 58  EGGITGFVKEAVSGEGTPVMEAEGSGHLYVAENGKKVQVLALDDGESISVNGTDVLAFES 117

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           +++ +I   G +     GG  L    +TGPG V + +
Sbjct: 118 TIDYEINTVGSLSGMAAGG--LTNVYLTGPGEVALTT 152


>gi|254416733|ref|ZP_05030483.1| conserved hypothetical protein TIGR00266 [Coleofasciculus
           chthonoplastes PCC 7420]
 gi|196176473|gb|EDX71487.1| conserved hypothetical protein TIGR00266 [Coleofasciculus
           chthonoplastes PCC 7420]
          Length = 225

 Score = 40.8 bits (94), Expect = 0.73,   Method: Compositional matrix adjust.
 Identities = 53/207 (25%), Positives = 99/207 (47%), Gaps = 13/207 (6%)

Query: 31  EAQVCQIMLKPQEKVVARPGSMCFMSGSME----MENIYIPENEVGMWQWLFGKTITSVV 86
           ++ + +++L+  E++VA+ G M  M+GS+     +          G+ + + G+++   V
Sbjct: 10  DSAIARVVLEANEELVAQAGCMIAMNGSINASTTLRKGKGGGILGGLKRMVAGESLFLSV 69

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
            R+P  + G V +A   +  +L  +  M G EL+ Q  ++L S + V +     Q  +++
Sbjct: 70  FRSP-IAGGEVFLAPKLMGDLLVYE--MKGKELVVQATSYLASESGVDIDLGF-QGFKSL 125

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
             G E      L+G G+  + + G + +  +  GE + VD   IVA   S+N  I   G 
Sbjct: 126 FSG-ESIFWLSLSGYGVVLLTSFGGIYKVPVN-GEYV-VDTGHIVAFEKSLNFDITKAGS 182

Query: 207 IRRAVF-GGDNLVTAVVTGPGIVFIQS 232
              + F GG+ LV     G G V+ Q+
Sbjct: 183 SWLSAFLGGEGLVCR-FKGQGNVYCQT 208


>gi|340399707|ref|YP_004728732.1| hypothetical protein SALIVB_1961 [Streptococcus salivarius CCHSS3]
 gi|338743700|emb|CCB94210.1| uncharacterized protein M6_Spy0233 [Streptococcus salivarius
           CCHSS3]
          Length = 231

 Score = 40.8 bits (94), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 39/163 (23%), Positives = 73/163 (44%), Gaps = 7/163 (4%)

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
           IT VV ++   ++G++ +A  S  +++P+ L     +      AFL        +  L  
Sbjct: 70  ITQVVSQS---NNGYIALAPDSPGQVIPLYLGE--KQYRLNDGAFLALDGTAYYTMELQS 124

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
             + + GG  GF      GQG     A GS+ +  L   EV T+D + +VA + +++  I
Sbjct: 125 VGKALFGGQGGFFVMTTQGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQTLDYDI 183

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
                  +++  G+ +V     G G +++QSL     +  + R
Sbjct: 184 HLENGFWQSIGTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 225


>gi|322515959|ref|ZP_08068900.1| hypothetical protein HMPREF9425_0177 [Streptococcus vestibularis
           ATCC 49124]
 gi|322125633|gb|EFX96963.1| hypothetical protein HMPREF9425_0177 [Streptococcus vestibularis
           ATCC 49124]
          Length = 231

 Score = 40.8 bits (94), Expect = 0.74,   Method: Compositional matrix adjust.
 Identities = 35/153 (22%), Positives = 67/153 (43%), Gaps = 4/153 (2%)

Query: 92  PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
            ++G++ +A  S  +++P+ L     +      AFL        +  L    + + GG  
Sbjct: 77  SNNGYIALAPDSPGQVIPLYLGE--KQYRLNDGAFLALDGTAYYTMELQSVGKALFGGQG 134

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           GF      GQG     A GS+ +  L   EV T+D + +VA + +++  I       +++
Sbjct: 135 GFFVMTTQGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQTLDYNIHLENGFWQSI 193

Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
             G+ +V     G G +++QSL     +  + R
Sbjct: 194 GTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 225


>gi|125718940|ref|YP_001036073.1| hypothetical protein SSA_2155 [Streptococcus sanguinis SK36]
 gi|422850676|ref|ZP_16897346.1| hypothetical protein HMPREF9383_0355 [Streptococcus sanguinis
           SK150]
 gi|422853341|ref|ZP_16900005.1| hypothetical protein HMPREF9384_0724 [Streptococcus sanguinis
           SK160]
 gi|125498857|gb|ABN45523.1| Conserved hypothetical protein [Streptococcus sanguinis SK36]
 gi|325695424|gb|EGD37324.1| hypothetical protein HMPREF9383_0355 [Streptococcus sanguinis
           SK150]
 gi|325697353|gb|EGD39239.1| hypothetical protein HMPREF9384_0724 [Streptococcus sanguinis
           SK160]
          Length = 231

 Score = 40.4 bits (93), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 37/153 (24%), Positives = 70/153 (45%), Gaps = 8/153 (5%)

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGGIE 151
           +G++ +A  S  +++P+ L     +      AFL    D     T++++   + + GG  
Sbjct: 79  NGYLALAPDSPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKAIFGGQG 134

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           G       GQG     A GS+ +  L   EV T+D + +VA + S+N  I       +++
Sbjct: 135 GLFVMTTQGQGTLLANAFGSIKKIELHNQEV-TIDNAHVVAWSQSLNYNIHLENGFWQSI 193

Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
             G+ +V     G G V++QSL     +  +++
Sbjct: 194 GTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSK 225


>gi|312862475|ref|ZP_07722718.1| TIGR00266 family protein [Streptococcus vestibularis F0396]
 gi|311102118|gb|EFQ60318.1| TIGR00266 family protein [Streptococcus vestibularis F0396]
          Length = 231

 Score = 40.4 bits (93), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 35/153 (22%), Positives = 67/153 (43%), Gaps = 4/153 (2%)

Query: 92  PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
            ++G++ +A  S  +++P+ L     +      AFL        +  L    + + GG  
Sbjct: 77  SNNGYIALAPDSPGQVIPLYLGE--KQYRLNDGAFLALDGTAYYTMELQSVGKALFGGQG 134

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           GF      GQG     A GS+ +  L   EV T+D + +VA + +++  I       +++
Sbjct: 135 GFFVMTTRGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQTLDYNIHLENGFWQSI 193

Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
             G+ +V     G G +++QSL     +  + R
Sbjct: 194 GTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 225


>gi|284166382|ref|YP_003404661.1| hypothetical protein Htur_3123 [Haloterrigena turkmenica DSM 5511]
 gi|284016037|gb|ADB61988.1| protein of unknown function DUF124 [Haloterrigena turkmenica DSM
           5511]
          Length = 221

 Score = 40.4 bits (93), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 35/139 (25%), Positives = 65/139 (46%), Gaps = 21/139 (15%)

Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
           +++L +DL    G ++ +  + +    D+           +  GG++GFL++K T +G +
Sbjct: 24  SKLLDVDLD---GAVIAKAGSMIAYSGDISFKGK-----SSAEGGLKGFLKEKATSEGTS 75

Query: 165 FILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
            + A G+           +    L+ GE I+V+ + ++A  SSVN +I+    I  A F 
Sbjct: 76  VMEATGNGHLYLADDEKKIQILELDAGEEISVNGNDVLAFESSVNYEIRTIKSI--AGFS 133

Query: 214 GDNLVTAVVTGPGIVFIQS 232
              L    + GPG V I +
Sbjct: 134 AGGLTNVSLVGPGSVAITT 152


>gi|119485104|ref|ZP_01619489.1| hypothetical protein L8106_06639 [Lyngbya sp. PCC 8106]
 gi|119457332|gb|EAW38457.1| hypothetical protein L8106_06639 [Lyngbya sp. PCC 8106]
          Length = 261

 Score = 40.4 bits (93), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 50/207 (24%), Positives = 98/207 (47%), Gaps = 23/207 (11%)

Query: 36  QIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKT--ITSVVLRNPGPS 93
           ++ L+P + V+   G+M  M   ++M++        G+ + L G++  ++    +N  P 
Sbjct: 19  RLDLQPNQTVIVESGAMAAMDSCIQMQSKLKGGLMKGVGRMLSGESLFLSEFTAKNK-PG 77

Query: 94  DGFVGIAAPSLARILPIDLAMF---GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
           + F       L+  +P D+  +   G  L+ Q   F+ S N+V +     Q  +    G 
Sbjct: 78  ELF-------LSPGIPGDIQHYYLNGKGLMIQSSGFVASGNEVNLDTNF-QGFKGFFSGE 129

Query: 151 EGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG----- 205
             FL  + TGQG  ++ + G +++  +E G+ + VD   IVA   +++  ++  G     
Sbjct: 130 SLFLI-RATGQGDIWLSSYGGILEIPVE-GDYV-VDTGYIVAFEETLDYNVEVIGGLSFK 186

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            ++  +FGG+ LV     G G ++IQS
Sbjct: 187 NLKTGIFGGEGLVCR-FRGQGKLWIQS 212


>gi|149922842|ref|ZP_01911265.1| hypothetical protein PPSIR1_34772 [Plesiocystis pacifica SIR-1]
 gi|149816309|gb|EDM75813.1| hypothetical protein PPSIR1_34772 [Plesiocystis pacifica SIR-1]
          Length = 235

 Score = 40.4 bits (93), Expect = 0.90,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 62/124 (50%), Gaps = 20/124 (16%)

Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVR-----NVIGGIEGFLRQKLTGQGLAFILAGGSVV 173
           L  Q  A+L S      + T+D ++R     ++IGG EG + Q+ TGQG  +I + G + 
Sbjct: 107 LFIQSSAYLAS------TGTVDTKLRFGGFKSMIGG-EGMVLQECTGQGDVWINSYGGIT 159

Query: 174 QKNLEVGEVITVDVSCIVAVTSSVNVQIKYNG-----PIRRAVFGGDNLVTAVVTGPGIV 228
           +  + V     VD   IVA   ++  ++K  G      ++  +F G+ LV    +G G V
Sbjct: 160 E--VPVNGTFIVDTGHIVAFDGNMTFKVKAAGSSGGFSLKSFLFSGEGLVCE-FSGTGRV 216

Query: 229 FIQS 232
           +IQS
Sbjct: 217 WIQS 220


>gi|448571304|ref|ZP_21639649.1| hypothetical protein C456_10074 [Haloferax lucentense DSM 14919]
 gi|448596140|ref|ZP_21653480.1| hypothetical protein C452_03878 [Haloferax alexandrinus JCM 10717]
 gi|445722516|gb|ELZ74174.1| hypothetical protein C456_10074 [Haloferax lucentense DSM 14919]
 gi|445741828|gb|ELZ93326.1| hypothetical protein C452_03878 [Haloferax alexandrinus JCM 10717]
          Length = 220

 Score = 40.4 bits (93), Expect = 0.93,   Method: Compositional matrix adjust.
 Identities = 43/157 (27%), Positives = 68/157 (43%), Gaps = 28/157 (17%)

Query: 94  DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
           D FV   APS       L     +D+A+  G ++ +  + +    D+          R  
Sbjct: 4   DEFVSAHAPSEGGDSFELENSKLLDVAL-DGNVMAKAGSMVGYTGDISFE-------RKS 55

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
            GG++G L++K TG+G   + A G+           V    L+ GE ++V+ + ++A  S
Sbjct: 56  AGGLKGMLKKKATGEGDVMMQATGTGHLYLADQAKEVQILELDAGEELSVNGNDVLAFES 115

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           SVN  IK    I     GG  L    + GPG V I +
Sbjct: 116 SVNWDIKMLTSIAGTSSGG--LFNVFLEGPGHVAITT 150


>gi|433417405|ref|ZP_20404716.1| hypothetical protein D320_00733 [Haloferax sp. BAB2207]
 gi|432200061|gb|ELK56178.1| hypothetical protein D320_00733 [Haloferax sp. BAB2207]
          Length = 220

 Score = 40.0 bits (92), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 43/157 (27%), Positives = 68/157 (43%), Gaps = 28/157 (17%)

Query: 94  DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
           D FV   APS       L     +D+A+  G ++ +  + +    D+          R  
Sbjct: 4   DEFVSAHAPSEGGDSFELENSKLLDVAL-DGNVMAKAGSMVGYTGDISFE-------RKS 55

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
            GG++G L++K TG+G   + A G+           V    L+ GE ++V+ + ++A  S
Sbjct: 56  AGGLKGMLKKKATGEGDVMMQATGTGHLYLADQAKEVQILELDAGEELSVNGNDVLAFES 115

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           SVN  IK    I     GG  L    + GPG V I +
Sbjct: 116 SVNWDIKMLTSIAGTSSGG--LFNVFLEGPGHVAITT 150


>gi|422847658|ref|ZP_16894341.1| hypothetical protein HMPREF9381_2000 [Streptococcus sanguinis SK72]
 gi|325686656|gb|EGD28682.1| hypothetical protein HMPREF9381_2000 [Streptococcus sanguinis SK72]
          Length = 231

 Score = 40.0 bits (92), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 37/155 (23%), Positives = 71/155 (45%), Gaps = 8/155 (5%)

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGGIE 151
           +G++ +A  S  +++P+ L     +      AFL    D     T++++   + + GG  
Sbjct: 79  NGYLALAPDSPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKALFGGQG 134

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           G       GQG     A GS+ +  L   EV T+D + +VA + S+N  I       +++
Sbjct: 135 GLFVMTTQGQGTLLANAFGSIKKIELHNQEV-TIDNAHVVAWSQSLNYNIHLENGFWQSI 193

Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             G+ +V     G G V++QSL     +  +++ +
Sbjct: 194 GTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSKYI 227


>gi|422822558|ref|ZP_16870751.1| hypothetical protein HMPREF9388_2019 [Streptococcus sanguinis
           SK353]
 gi|422857502|ref|ZP_16904152.1| hypothetical protein HMPREF9394_0339 [Streptococcus sanguinis
           SK1057]
 gi|324989828|gb|EGC21771.1| hypothetical protein HMPREF9388_2019 [Streptococcus sanguinis
           SK353]
 gi|327463553|gb|EGF09872.1| hypothetical protein HMPREF9394_0339 [Streptococcus sanguinis
           SK1057]
          Length = 231

 Score = 40.0 bits (92), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 37/155 (23%), Positives = 71/155 (45%), Gaps = 8/155 (5%)

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGGIE 151
           +G++ +A  S  +++P+ L     +      AFL    D     T++++   + + GG  
Sbjct: 79  NGYLALAPDSPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKALFGGQG 134

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           G       GQG     A GS+ +  L   EV T+D + +VA + S+N  I       +++
Sbjct: 135 GLFVMTTQGQGTLLANAFGSIKKIELHNQEV-TIDNAHVVAWSQSLNYNIHLENGFWQSI 193

Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             G+ +V     G G V++QSL     +  +++ +
Sbjct: 194 GTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSKYI 227


>gi|401682713|ref|ZP_10814603.1| TIGR00266 family protein [Streptococcus sp. AS14]
 gi|422872397|ref|ZP_16918890.1| hypothetical protein HMPREF9397_2136 [Streptococcus sanguinis
           SK1087]
 gi|328944647|gb|EGG38808.1| hypothetical protein HMPREF9397_2136 [Streptococcus sanguinis
           SK1087]
 gi|400183953|gb|EJO18200.1| TIGR00266 family protein [Streptococcus sp. AS14]
          Length = 231

 Score = 40.0 bits (92), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 37/153 (24%), Positives = 70/153 (45%), Gaps = 8/153 (5%)

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGGIE 151
           +G++ +A  S  +++P+ L     +      AFL    D     T++++   + + GG  
Sbjct: 79  NGYLALAPDSPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKALFGGQG 134

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           G       GQG     A GS+ +  L   EV T+D + +VA + S+N  I       +++
Sbjct: 135 GLFVMTTQGQGTLLANAFGSIKKIELHNQEV-TIDNAHVVAWSQSLNYNIHLENGFWQSI 193

Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
             G+ +V     G G V++QSL     +  +++
Sbjct: 194 GTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSK 225


>gi|322374139|ref|ZP_08048673.1| hypothetical protein HMPREF0848_01845 [Streptococcus sp. C150]
 gi|321277105|gb|EFX54176.1| hypothetical protein HMPREF0848_01845 [Streptococcus sp. C150]
          Length = 231

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 39/163 (23%), Positives = 71/163 (43%), Gaps = 7/163 (4%)

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
           IT VV +    ++G++ +A  S  +++P+ L     +      AFL        +     
Sbjct: 70  ITQVVAQ---SNNGYLALAPDSPGQVIPLYLGE--KQYRLNDGAFLALDGTAYYTMEYQS 124

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
             + + GG  GF      GQG     A GS+ +  L   EV T+D + +VA + S++  I
Sbjct: 125 VGKALFGGQGGFFVMTTQGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQSLDYNI 183

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
                  +++  G+ +V     G G +++QSL     +  + R
Sbjct: 184 HLENGFWQSIGTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 225


>gi|448477879|ref|ZP_21603763.1| hypothetical protein C462_00162 [Halorubrum arcis JCM 13916]
 gi|445822992|gb|EMA72735.1| hypothetical protein C462_00162 [Halorubrum arcis JCM 13916]
          Length = 219

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 39/153 (25%), Positives = 70/153 (45%), Gaps = 25/153 (16%)

Query: 91  GPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGI 150
           G SDGF  +      R+L I    F G ++ +  + +    DV  +        +  GGI
Sbjct: 14  GSSDGFRKVNK----RLLEIP---FEGTVMVKAGSMIAYTGDVTFTGK-----SSAEGGI 61

Query: 151 EGFLRQKLTGQGLAFILAGGS----VVQK-------NLEVGEVITVDVSCIVAVTSSVNV 199
            GF++  ++G+G   + A GS    V ++        L+  E I+V+ + ++A    ++ 
Sbjct: 62  AGFVKDAVSGEGTPIMEAEGSGTLYVAEQGKKVQILTLDADESISVNGTDVLAFEKKIDY 121

Query: 200 QIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           +I   G + +A  GG  L    +TGPG V + +
Sbjct: 122 EISTVGGLSQAAAGG--LTNVFLTGPGEVAVTT 152


>gi|428773943|ref|YP_007165731.1| hypothetical protein Cyast_2132 [Cyanobacterium stanieri PCC 7202]
 gi|428688222|gb|AFZ48082.1| protein of unknown function DUF124 [Cyanobacterium stanieri PCC
           7202]
          Length = 225

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 47/207 (22%), Positives = 100/207 (48%), Gaps = 13/207 (6%)

Query: 31  EAQVCQIMLKPQEKVVARPGSMC----FMSGSMEMENIYIPENEVGMWQWLFGKTITSVV 86
           ++ + +I L  QE+++A  GSM     F++ S  +          G+ + + G+++   V
Sbjct: 10  DSAIAKITLDAQEELLAEAGSMIAMSDFINASTTLRQGKGGGILGGLKRMMAGESLFLSV 69

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
            R   P +G + +A   +  I+  +  M G EL+ Q  ++L   + V++     Q +++ 
Sbjct: 70  FRCYQP-NGEIYLAPHLMGDIVVYE--MTGNELVVQSGSYLACASGVEIDLGF-QGLKSF 125

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNG 205
             G E     +++G G   + + G + + +++ GE + VD   IVA   +++ +I K + 
Sbjct: 126 FSG-EAIFWLQISGYGAVILTSFGGIYEIDVD-GEYV-VDTGHIVAFEKTLDFKITKASS 182

Query: 206 PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            +  +  GG+ LV     G G ++ Q+
Sbjct: 183 SLLGSFLGGEGLVCRFY-GKGKLYCQT 208


>gi|156937119|ref|YP_001434915.1| hypothetical protein Igni_0325 [Ignicoccus hospitalis KIN4/I]
 gi|156566103|gb|ABU81508.1| protein of unknown function DUF124 [Ignicoccus hospitalis KIN4/I]
          Length = 234

 Score = 39.7 bits (91), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 58/234 (24%), Positives = 104/234 (44%), Gaps = 27/234 (11%)

Query: 28  LGGEAQ-VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGM----------WQW 76
           L GEA  V ++ L+  EK+ A  G+M  M G ++++ + I   E G            + 
Sbjct: 5   LEGEAYPVIRVELEEGEKLYAEAGAMMMMKGDVKVDTVLIDLEEEGNVLRALAEALGRKL 64

Query: 77  LFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPD-AFLCSVNDVKV 135
           L G+T+   V   PG     V   +PSL     ++     GE     D +++    D+K 
Sbjct: 65  LAGETVFHNVFEGPG-----VVWLSPSLPG--GVEYIEVKGECWTVQDYSYVAHFGDLK- 116

Query: 136 SNTLDQRVRNVIGGIEGFL-RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
              LD   +   G   G L   +  G+G  ++ + G +++  +EV + + +D    VA+ 
Sbjct: 117 ---LDLAWKGPKGLFLGDLVWLRACGEGGLWVSSYGDIMK--VEVEDEMVIDNMHFVALP 171

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTS 248
                +++  G ++  + GG+  V   V GP  V++Q+     L+  IAR + S
Sbjct: 172 DDAEWRVEKFGGLKSFITGGEGYVIR-VKGPTTVYVQTRILPPLAAAIARFMPS 224


>gi|322368155|ref|ZP_08042724.1| hypothetical protein ZOD2009_01695 [Haladaptatus paucihalophilus
           DX253]
 gi|320552171|gb|EFW93816.1| hypothetical protein ZOD2009_01695 [Haladaptatus paucihalophilus
           DX253]
          Length = 215

 Score = 39.7 bits (91), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 47/199 (23%), Positives = 84/199 (42%), Gaps = 20/199 (10%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTI---TSVVLRNPGPS 93
           + L   E++    GS+   S  +E+     P++E  + + + G+     T+    +PG  
Sbjct: 16  LTLDADEEIRVEAGSLVSHSAGVEVP----PDSEALLERSVLGEETPFSTTFAADHPGT- 70

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGF 153
              + +A P    I   +L      L  Q  AFL +       + +D   +       G 
Sbjct: 71  ---IRLAPPFPGDIFHYELR--DETLYVQSCAFLAA------ESGIDFEEKGSFVECAGD 119

Query: 154 LRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
              +L+G GLAF+ + G+V   +L+ GE   +D   +VA   SV+ ++     +     G
Sbjct: 120 FLLELSGSGLAFLSSYGAVSIVSLDPGERYILDTGHVVAFEGSVDFEVTRVDTVHSHAGG 179

Query: 214 GDNLVTAVVTGPGIVFIQS 232
              LV   + GPG ++ QS
Sbjct: 180 RGGLVCE-LEGPGTIWTQS 197


>gi|448621380|ref|ZP_21668355.1| hypothetical protein C438_04932 [Haloferax denitrificans ATCC
           35960]
 gi|445755873|gb|EMA07255.1| hypothetical protein C438_04932 [Haloferax denitrificans ATCC
           35960]
          Length = 220

 Score = 39.7 bits (91), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 43/157 (27%), Positives = 67/157 (42%), Gaps = 28/157 (17%)

Query: 94  DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
           D FV   APS       L     +D+A+  G ++ +  + +    D+          R  
Sbjct: 4   DEFVSAHAPSEGGDSFELENSKLLDVAL-DGNVMAKAGSMVGYTGDISFE-------RKS 55

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
            GG+ G L++K TG+G   + A G+           V    L+ GE ++V+ + ++A  S
Sbjct: 56  AGGLTGMLKKKATGEGDVMMQATGTGHLYLADQGKEVQILELDAGEELSVNGNDVLAFES 115

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           SVN  IK    I     GG  L    + GPG V I +
Sbjct: 116 SVNWDIKMLTSIAGTSSGG--LFNVFLEGPGHVAITT 150


>gi|422885126|ref|ZP_16931574.1| hypothetical protein HMPREF9380_2225 [Streptococcus sanguinis SK49]
 gi|332358097|gb|EGJ35929.1| hypothetical protein HMPREF9380_2225 [Streptococcus sanguinis SK49]
          Length = 231

 Score = 39.7 bits (91), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 36/157 (22%), Positives = 72/157 (45%), Gaps = 8/157 (5%)

Query: 92  PSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGG 149
            ++G++ +A  +  +++P+ L     +      AFL    D     T++++   + + GG
Sbjct: 77  SNNGYLALAPDAPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKAIFGG 132

Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
             G       GQG     A GS+ +  L   EV T+D + +VA + S+N  I       +
Sbjct: 133 QGGLFVMTTQGQGTLLANAFGSIKRIELHNQEV-TIDNAHVVAWSQSLNYNIHLENGFWQ 191

Query: 210 AVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
           ++  G+ +V     G G V++QSL     +  +++ +
Sbjct: 192 SIGTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSKYI 227


>gi|448398830|ref|ZP_21570193.1| hypothetical protein C476_05362 [Haloterrigena limicola JCM 13563]
 gi|445670377|gb|ELZ22978.1| hypothetical protein C476_05362 [Haloterrigena limicola JCM 13563]
          Length = 221

 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 35/139 (25%), Positives = 64/139 (46%), Gaps = 21/139 (15%)

Query: 105 ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLA 164
           + +L IDL    G ++ +  + +    D+  +        +  GGI GF+++K TG+G  
Sbjct: 24  STLLDIDLD---GTVMAKAGSMIAYDGDISFTGK-----SSAEGGITGFIKEKATGEGTP 75

Query: 165 FILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFG 213
            + A G+           +    L+ G+ I+V+ + ++A  SSVN +I+    I  A F 
Sbjct: 76  VMEATGTGHLYLADQEKKIQLLELDTGQSISVNGNDVLAFESSVNYEIRTIKSI--AGFS 133

Query: 214 GDNLVTAVVTGPGIVFIQS 232
              +    + GPG V I +
Sbjct: 134 AGGMTNVFLEGPGSVAITT 152


>gi|428769068|ref|YP_007160858.1| hypothetical protein Cyan10605_0676 [Cyanobacterium aponinum PCC
           10605]
 gi|428683347|gb|AFZ52814.1| protein of unknown function DUF124 [Cyanobacterium aponinum PCC
           10605]
          Length = 220

 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 44/201 (21%), Positives = 87/201 (43%), Gaps = 16/201 (7%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIY----IPENEVGMWQWLFGKTITSVVLRNPGP 92
           + L+P+E++ A  G+M  M G + M+  +    IP     + ++L G+++   V  NP  
Sbjct: 16  VTLEPRERITAEAGAMVSMDGDIVMKTEFSGGLIPAL---IRKFLGGESMFVNVFYNPTN 72

Query: 93  SDGFVGIAAPSLARILPIDLAMFGGELLC-QPDAFLCSVNDVKVSNTLDQRVRNVIGGIE 151
               + +    +  I  I L     + +C QP A++      K+            G  E
Sbjct: 73  RPLNLVLTQSMIGDIEAIKLE---NKSICFQPGAYIAHTPGAKMGVKWAGFASWFAG--E 127

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           G  +   +G+G       G + ++  E+     VD + +VA    + + I  +G +  ++
Sbjct: 128 GLFKLHFSGKGRVLFGCYGGITRR--EINGDFIVDNNHLVAYDDGIKMNITLSGNLLSSL 185

Query: 212 FGGDNLVTAVVTGPGIVFIQS 232
             G+  V   +TG G +++QS
Sbjct: 186 TSGEGFVNK-LTGRGAIYLQS 205


>gi|389580754|ref|ZP_10170781.1| hypothetical protein DespoDRAFT_02788 [Desulfobacter postgatei
           2ac9]
 gi|389402389|gb|EIM64611.1| hypothetical protein DespoDRAFT_02788 [Desulfobacter postgatei
           2ac9]
          Length = 139

 Score = 39.3 bits (90), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 47/103 (45%), Gaps = 10/103 (9%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY--IPENEVGMWQWLFG- 79
           + +QI G + Q  ++ L P E V+A  G+M +M   +  E       E + G+   LFG 
Sbjct: 6   VDYQIFGDDMQAVEVGLDPGETVIAEAGAMNWMEQGIAFEAKMGDGSEADKGLMGKLFGA 65

Query: 80  --KTITSVVL-----RNPGPSDGFVGIAAPSLARILPIDLAMF 115
             + +T   L      N       V  +AP   +I+P+D+A F
Sbjct: 66  DKRALTGESLFLTHFTNQAHDKKRVAFSAPYPGKIIPVDMAAF 108


>gi|381163293|ref|ZP_09872523.1| hypothetical protein SacazDRAFT_02213 [Saccharomonospora azurea
           NA-128]
 gi|418461390|ref|ZP_13032465.1| hypothetical protein SZMC14600_10573 [Saccharomonospora azurea SZMC
           14600]
 gi|359738493|gb|EHK87378.1| hypothetical protein SZMC14600_10573 [Saccharomonospora azurea SZMC
           14600]
 gi|379255198|gb|EHY89124.1| hypothetical protein SacazDRAFT_02213 [Saccharomonospora azurea
           NA-128]
          Length = 197

 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 48/92 (52%), Gaps = 3/92 (3%)

Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY 203
           + + GG  GFL+   +G G     + G V    L+ GEV+TV+   ++A   +V V+++ 
Sbjct: 106 QTLFGGDSGFLKH-YSGTGPLVFASRGPVDALTLKAGEVVTVNPLFVLAYPDTVQVRLRA 164

Query: 204 NGPIR-RAVFGGDNLVTAVVTGPGIVFIQSLP 234
             P   +++  G+ L   V  GPG V +Q+ P
Sbjct: 165 LDPAEPQSIKTGEGLALDVA-GPGTVLVQTRP 195


>gi|427724446|ref|YP_007071723.1| hypothetical protein Lepto7376_2618 [Leptolyngbya sp. PCC 7376]
 gi|427356166|gb|AFY38889.1| protein of unknown function DUF124 [Leptolyngbya sp. PCC 7376]
          Length = 231

 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 52/219 (23%), Positives = 95/219 (43%), Gaps = 48/219 (21%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGK----------TITSVV 86
           + L P + ++   G+M  M  ++EM++        G+ + L G+          T T+ +
Sbjct: 19  VKLPPNQTLLVEAGAMAAMDANIEMKSKMRGGLMKGLGRMLSGESLFISEFTAQTTTAEI 78

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTLDQR 142
             +PG                 P DLA +       L+ Q   F+ S   V+    LD +
Sbjct: 79  YLSPGA----------------PGDLAHYELDGSKSLMVQSSGFVASSPSVE----LDTK 118

Query: 143 VRNVIGGIEG----FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVN 198
            + + G   G    FLR   +GQG  +  + G++++  +E G+ + VD   IVA   ++N
Sbjct: 119 FQGLKGFFSGESIFFLRA--SGQGDFWFSSYGAIIEIPVE-GDYV-VDTGYIVAFEDTLN 174

Query: 199 VQIKYNG-----PIRRAVFGGDNLVTAVVTGPGIVFIQS 232
             ++  G      +R  + GG+ LV    +G G +++QS
Sbjct: 175 YNVEMIGGLSFRSLRTGILGGEGLVCR-FSGKGKLWVQS 212


>gi|448575666|ref|ZP_21641946.1| hypothetical protein C455_03259 [Haloferax larsenii JCM 13917]
 gi|445730607|gb|ELZ82195.1| hypothetical protein C455_03259 [Haloferax larsenii JCM 13917]
          Length = 220

 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 32/100 (32%), Positives = 47/100 (47%), Gaps = 13/100 (13%)

Query: 144 RNVIGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVA 192
           R   GG+ G L++K TG+G   + A GS           V    L+ GE ++V+ + ++A
Sbjct: 53  RKSAGGLTGMLKKKATGEGSVMMEASGSGQLYLADQGKEVQILELDAGEELSVNGNDVLA 112

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
              SV+  IK    I  A  GG  L    + GPG V I +
Sbjct: 113 FEDSVSWDIKMLSSIAGAKTGG--LFNVFLEGPGHVAITT 150


>gi|390630390|ref|ZP_10258374.1| Putative uncharacterized protein [Weissella confusa LBAE C39-2]
 gi|390484398|emb|CCF30722.1| Putative uncharacterized protein [Weissella confusa LBAE C39-2]
          Length = 265

 Score = 39.3 bits (90), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 51/235 (21%), Positives = 98/235 (41%), Gaps = 26/235 (11%)

Query: 3   APFFSTPFQPYVYQSPQDAVIPFQILGGEAQVCQIMLKPQE-KVVARPGSMCFMSGSMEM 61
             F    FQ  +  SP DA + + +     +  Q+++  Q   V  + G+M +M G ++ 
Sbjct: 20  GSFQVVEFQRDLSVSPFDAQVQYFMTQMNVKKKQLLINLQNADVTIQAGAMQWMLGDVKA 79

Query: 62  ENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIA----APSLARILPIDLAMFGG 117
                   +        GK I   V         + G       P+   IL +D++ +GG
Sbjct: 80  TTGIKGAGD------FLGKMIKGSVTGEAAIKPEYKGTGMLALEPTYKHILLLDVSEWGG 133

Query: 118 ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNL 177
           +++     +L S + +K          + + G EG     L G G+A + +   V Q+ L
Sbjct: 134 QVVIDDGFYLASEDGLKQKLVARSNFSSAVAGGEGLFNLSLVGNGVAALES--PVPQEEL 191

Query: 178 -EV---GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIV 228
            EV    +V+ +D +  +A +SS+N  ++ +G          +L+ +  +G G+V
Sbjct: 192 IEVELKDDVLKIDGNLAIAWSSSLNFTVERSGK---------SLIGSAASGEGLV 237


>gi|408388668|gb|EKJ68347.1| hypothetical protein FPSE_11355 [Fusarium pseudograminearum CS3096]
          Length = 242

 Score = 39.3 bits (90), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 52/203 (25%), Positives = 92/203 (45%), Gaps = 20/203 (9%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNP--- 90
           V  I L+    V ++ G+M  MS ++E+          G  ++ FGK  T   +      
Sbjct: 39  VLNIDLQQGTTVRSKSGAMIHMSSTVEI---------TGKSKFSFGKLFTGGNMYESLYA 89

Query: 91  GPSDGFVGIAAPSLARILPIDLAMFGGE-LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
           GP  G V +       I  I L + G +      D+FL S ++V          + +  G
Sbjct: 90  GP--GRVALGPTLFGDI--ISLHVDGNQSWTIGKDSFLASTSEVTKKRETQGLGKALFSG 145

Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRR 209
            + F+  ++ GQG+ ++ + G+V + +L+ GE   VD   +VA + +  ++ K  G    
Sbjct: 146 EDLFVF-RVEGQGIMWLTSFGAVDKLDLQAGEQHIVDNGHLVAWSCNYAIE-KAGGGAMS 203

Query: 210 AVFGGDNLVTAVVTGPGIVFIQS 232
            +  G+ LV    TGPG V++Q+
Sbjct: 204 GMKTGEGLV-CRFTGPGSVYVQT 225


>gi|422861607|ref|ZP_16908247.1| hypothetical protein HMPREF9386_1803 [Streptococcus sanguinis
           SK330]
 gi|327467840|gb|EGF13330.1| hypothetical protein HMPREF9386_1803 [Streptococcus sanguinis
           SK330]
          Length = 231

 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 36/153 (23%), Positives = 70/153 (45%), Gaps = 8/153 (5%)

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGGIE 151
           +G++ +A  +  +++P+ L     +      AFL    D     T++++   + + GG  
Sbjct: 79  NGYLALAPDAPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKAIFGGQG 134

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           G       GQG     A GS+ +  L   EV T+D + +VA + S+N  I       +++
Sbjct: 135 GLFVMTTQGQGTLLANAFGSIKKIELHNQEV-TIDNAHVVAWSQSLNYNIHLENGFWQSI 193

Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
             G+ +V     G G V++QSL     +  +++
Sbjct: 194 GTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSK 225


>gi|291302980|ref|YP_003514258.1| hypothetical protein Snas_5534 [Stackebrandtia nassauensis DSM
           44728]
 gi|290572200|gb|ADD45165.1| protein of unknown function DUF124 [Stackebrandtia nassauensis DSM
           44728]
          Length = 228

 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 38/136 (27%), Positives = 63/136 (46%), Gaps = 9/136 (6%)

Query: 103 SLARILPIDLAMF----GGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKL 158
            LA  LP D+A+        L+    +++ S  +V++       ++N+IGG EG    + 
Sbjct: 77  DLAHSLPGDVAVVPVRPEAALMIAKGSWIASNGEVEIQTKWGG-MKNLIGG-EGAFLVRA 134

Query: 159 TGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKY--NGPIRRAVFGGDN 216
            GQG   +   G++    L+ GE I +D   +VA   +V   ++    G   ++   G+ 
Sbjct: 135 QGQGQLVLGCYGAIDSYALDPGETIVIDNGHVVAYEETVGHSLRRAVEGKTMQSRKSGEG 194

Query: 217 LVTAVVTGPGIVFIQS 232
           LV    TGPG V  QS
Sbjct: 195 LVYE-FTGPGRVLTQS 209


>gi|383830130|ref|ZP_09985219.1| hypothetical protein SacxiDRAFT_2653 [Saccharomonospora
           xinjiangensis XJ-54]
 gi|383462783|gb|EID54873.1| hypothetical protein SacxiDRAFT_2653 [Saccharomonospora
           xinjiangensis XJ-54]
          Length = 197

 Score = 39.3 bits (90), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 37/139 (26%), Positives = 67/139 (48%), Gaps = 5/139 (3%)

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           G++ +A      + P++L+   G  +   DA L     V+  +      +++ GG  GFL
Sbjct: 59  GWIDLAPRQPGDVYPLELSGQLGWSVAS-DAVLARPATVRHDHPWPAH-QSLFGGDSGFL 116

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR-RAVFG 213
           +   +G G     + G V    L+ GEV+TV+   ++A   +V V+++   P   ++V  
Sbjct: 117 KH-FSGVGSLVFASRGPVDALTLKAGEVVTVNPLFVLAFPDTVQVRLRALDPAEPQSVRT 175

Query: 214 GDNLVTAVVTGPGIVFIQS 232
           G+ L   V  GPG V +Q+
Sbjct: 176 GEGLALDVA-GPGTVLVQT 193


>gi|403415219|emb|CCM01919.1| predicted protein [Fibroporia radiculosa]
          Length = 268

 Score = 39.3 bits (90), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 52/208 (25%), Positives = 90/208 (43%), Gaps = 29/208 (13%)

Query: 37  IMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLR-NPGPSDG 95
           + ++P  +++A+PGSM  M  S++++         G   + F K +T      + G S  
Sbjct: 60  MRIQPGYEILAKPGSMVTMDPSVQIK---------GKMNFSFKKLLTGGEASIDLGLS-- 108

Query: 96  FVGIAAPSLARILPIDLAMFGGELLCQPD----------AFLCSVNDVKVSNTLDQRVRN 145
           F+    P    + P    M+G     Q D          AFL     V+++ T  Q    
Sbjct: 109 FMHFWGPGEVVLTP---EMWGDITRIQLDGNSPWHFGKHAFLACTPGVQMT-TKSQSFGK 164

Query: 146 VIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ-IKYN 204
           ++    G    + TG G+ F+   G+++ ++L  GE   V+ + +VA     N++ I+  
Sbjct: 165 MLFSGHGLFVSQATGVGMLFLHGLGTIISRDLRPGEQWIVNNNHLVAWNCQYNMEKIQAG 224

Query: 205 GPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           G +  +    D       TGPG VFIQS
Sbjct: 225 GLL--STLQTDEGAVCRFTGPGTVFIQS 250


>gi|205373536|ref|ZP_03226339.1| hypothetical protein Bcoam_09740 [Bacillus coahuilensis m4-4]
          Length = 61

 Score = 39.3 bits (90), Expect = 2.2,   Method: Composition-based stats.
 Identities = 17/43 (39%), Positives = 26/43 (60%)

Query: 23 IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIY 65
          I + I G + Q  ++ L P+E V+A  GS+  M  S+EME I+
Sbjct: 6  IDYVIHGNDMQFVEVELDPKETVIAEAGSLMMMEDSIEMETIF 48


>gi|357636516|ref|ZP_09134391.1| TIGR00266 family protein [Streptococcus macacae NCTC 11558]
 gi|357584970|gb|EHJ52173.1| TIGR00266 family protein [Streptococcus macacae NCTC 11558]
          Length = 237

 Score = 38.9 bits (89), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 47/223 (21%), Positives = 94/223 (42%), Gaps = 10/223 (4%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITS-----VVL 87
           + +I+L+  E    + GSM + + S+ +   +    + VG      G+++TS     +  
Sbjct: 18  LVEIVLETGESAYIQRGSMVYHTPSVTLNTKVNGRGSGVGKLVGAIGRSVTSGESFFITQ 77

Query: 88  RNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVI 147
                 DG + +A     +++ + L     +      AFL      +         R + 
Sbjct: 78  AVSNADDGKLALAPSMPGQVIALKLGE--KQYRLNDGAFLALDGSARYQMKSQSIGRALF 135

Query: 148 GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPI 207
           GG  G       GQG   + + GS+ +  L+  E IT+D + +VA +  +N  I     +
Sbjct: 136 GGQGGLFVMTTEGQGTLLVNSFGSIKKIELQNQE-ITIDNAHVVAWSRDLNYDIHLENGL 194

Query: 208 RRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
            +++  G+ +V     G G V++QSL   + +  +   +T+ N
Sbjct: 195 MQSIGTGEGVVN-TFRGTGEVYVQSLNLRQFAGVLQNFLTNAN 236


>gi|322372055|ref|ZP_08046597.1| hypothetical protein ZOD2009_21187 [Haladaptatus paucihalophilus
           DX253]
 gi|320548477|gb|EFW90149.1| hypothetical protein ZOD2009_21187 [Haladaptatus paucihalophilus
           DX253]
          Length = 220

 Score = 38.9 bits (89), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 46/94 (48%), Gaps = 13/94 (13%)

Query: 148 GGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTSS 196
           GG++G L+QK TG+G   + A G+           V    L+  + I+V+ + ++A   S
Sbjct: 57  GGLKGMLKQKATGEGAVMMKATGTGHLYLADQGKEVQLLELDADDEISVNGNDVLAFEDS 116

Query: 197 VNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFI 230
           V+  IK    I  A  GG  L    ++GPG + I
Sbjct: 117 VSWDIKMMSSIAGATSGG--LFNVYLSGPGYIAI 148


>gi|419707153|ref|ZP_14234650.1| Hypothetical protein PS4_83945 [Streptococcus salivarius PS4]
 gi|383283113|gb|EIC81080.1| Hypothetical protein PS4_83945 [Streptococcus salivarius PS4]
          Length = 231

 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 38/163 (23%), Positives = 71/163 (43%), Gaps = 7/163 (4%)

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
           IT VV +    ++G++ +A  S  +++P+ L     +      AFL        +     
Sbjct: 70  ITQVVAQ---SNNGYLALAPDSPGQVIPLYLGE--KQYRLNDGAFLALDGTAYYTMEFQS 124

Query: 142 RVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI 201
             + + GG  GF      GQG     A GS+ +  L   EV T+D + +VA + +++  I
Sbjct: 125 VGKALFGGQGGFFVMTTQGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQTLDYNI 183

Query: 202 KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
                  +++  G+ +V     G G +++QSL     +  + R
Sbjct: 184 HLENGFWQSIGTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 225


>gi|323350601|ref|ZP_08086263.1| hypothetical protein HMPREF9398_0311 [Streptococcus sanguinis
           VMC66]
 gi|322123283|gb|EFX94968.1| hypothetical protein HMPREF9398_0311 [Streptococcus sanguinis
           VMC66]
          Length = 231

 Score = 38.9 bits (89), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 36/155 (23%), Positives = 71/155 (45%), Gaps = 8/155 (5%)

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGGIE 151
           +G++ +A  +  +++P+ L     +      AFL    D     T++++   + + GG  
Sbjct: 79  NGYLALAPDAPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKALFGGQG 134

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           G       GQG     A GS+ +  L   EV T+D + +VA + S+N  I       +++
Sbjct: 135 GLFVMTTQGQGTLLANAFGSIKKIELHNQEV-TIDNAHVVAWSQSLNYNIHLENGFWQSI 193

Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAV 246
             G+ +V     G G V++QSL     +  +++ +
Sbjct: 194 GTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSKYI 227


>gi|375102748|ref|ZP_09749011.1| hypothetical protein SaccyDRAFT_4550 [Saccharomonospora cyanea
           NA-134]
 gi|374663480|gb|EHR63358.1| hypothetical protein SaccyDRAFT_4550 [Saccharomonospora cyanea
           NA-134]
          Length = 197

 Score = 38.9 bits (89), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 37/141 (26%), Positives = 65/141 (46%), Gaps = 5/141 (3%)

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           G++ +A        P++L+   G  +   DA L     V+  +      +++ GG  GFL
Sbjct: 59  GWIDLAPRHPGDTYPLELSGQLGWSVAS-DAVLARPATVRHDHPWPAH-QSLFGGDSGFL 116

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIR-RAVFG 213
           +   +G G       G V    L+ GEV+TV+   ++A   +V V+++   P   +++  
Sbjct: 117 KH-YSGTGPLVFACRGPVDALTLKAGEVVTVNPLFVLAFPDTVQVRLRAIDPAEPQSIRT 175

Query: 214 GDNLVTAVVTGPGIVFIQSLP 234
           G  L   V  GPG V +Q+ P
Sbjct: 176 GKGLALDVA-GPGTVLVQTRP 195


>gi|422850001|ref|ZP_16896677.1| hypothetical protein HMPREF9382_1981 [Streptococcus sanguinis
           SK115]
 gi|422880913|ref|ZP_16927369.1| hypothetical protein HMPREF9389_0338 [Streptococcus sanguinis
           SK355]
 gi|325688889|gb|EGD30897.1| hypothetical protein HMPREF9382_1981 [Streptococcus sanguinis
           SK115]
 gi|332365613|gb|EGJ43372.1| hypothetical protein HMPREF9389_0338 [Streptococcus sanguinis
           SK355]
          Length = 231

 Score = 38.5 bits (88), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 36/153 (23%), Positives = 70/153 (45%), Gaps = 8/153 (5%)

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGGIE 151
           +G++ +A  +  +++P+ L     +      AFL    D     T++++   + + GG  
Sbjct: 79  NGYLALAPDAPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKALFGGQG 134

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           G       GQG     A GS+ +  L   EV T+D + +VA + S+N  I       +++
Sbjct: 135 GLFVMTTQGQGTLLANAFGSIKKIELHNQEV-TIDNAHVVAWSQSLNYNIHLENGFWQSI 193

Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
             G+ +V     G G V++QSL     +  +++
Sbjct: 194 GTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSK 225


>gi|50085091|ref|YP_046601.1| hypothetical protein ACIAD1953 [Acinetobacter sp. ADP1]
 gi|49531067|emb|CAG68779.1| conserved hypothetical protein [Acinetobacter sp. ADP1]
          Length = 239

 Score = 38.5 bits (88), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 45/216 (20%), Positives = 87/216 (40%), Gaps = 25/216 (11%)

Query: 25  FQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL------- 77
           F ++G       + ++  +K+     +M  M   +E++       + G+ Q L       
Sbjct: 4   FSLVGSPEPFLHVSMRRGDKIYCESNAMVMMESPLELKGRL----QGGLMQSLLRRFAND 59

Query: 78  ---FGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVK 134
              F + I +V     G  D  +  A     +IL +    +    L    AF+ + + V+
Sbjct: 60  ESLFQQHIEAV----RGDGDCLLSAALDGDMQILDVGATQY----LLSDGAFVAAHDSVQ 111

Query: 135 VSNTLDQRVRNVI-GGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
           V   L+  +     G   GFL  + +GQG   +   G++ +  +E  + +T+D   +V  
Sbjct: 112 VKARLNSSLGGAFFGNTGGFLVMETSGQGQVVVSGCGTLFELEVEPNKDVTIDNGHVVCW 171

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVF 229
            S +N  +  +        G  NLV +V +G G+V 
Sbjct: 172 DSRLNYNLSISTSQNSGFLG--NLVNSVTSGEGMVL 205


>gi|449887791|ref|ZP_21787043.1| hypothetical protein SMU104_06916 [Streptococcus mutans SA41]
 gi|449252179|gb|EMC50166.1| hypothetical protein SMU104_06916 [Streptococcus mutans SA41]
          Length = 237

 Score = 38.5 bits (88), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 47/224 (20%), Positives = 95/224 (42%), Gaps = 12/224 (5%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWL--FGKTITS-----VV 86
           + +I L+  E    + GSM + + S+ + N  +     G+ + +   G+++TS     + 
Sbjct: 18  LVEITLETGEFAYIQRGSMVYHTPSVTL-NTKVNGRGSGLGKLVGAIGRSVTSGESFFIT 76

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
                 SDG + +A     +++ ++L     +      AFL      +         R +
Sbjct: 77  QAVSNASDGKLALAPSMPGQVIALELGE--KQYRLNDGAFLALDGSAQYQMKAQSVGRAL 134

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
            GG  G       GQG     + GS+ +  L+  E IT+D + +VA +  +N  I     
Sbjct: 135 FGGQGGLFVMTTEGQGTLLANSFGSIKKIELQNQE-ITIDNAHVVAWSKDLNYDIHLENG 193

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIARAVTSPN 250
             +++  G+ +V     G G +++QSL   + +  +   +T+ N
Sbjct: 194 FMQSIGTGEGVVN-TFRGTGEIYVQSLNLQQFAGVLQGFITNTN 236


>gi|169617449|ref|XP_001802139.1| hypothetical protein SNOG_11904 [Phaeosphaeria nodorum SN15]
 gi|160703411|gb|EAT80948.2| hypothetical protein SNOG_11904 [Phaeosphaeria nodorum SN15]
          Length = 271

 Score = 38.5 bits (88), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 43/181 (23%), Positives = 78/181 (43%), Gaps = 14/181 (7%)

Query: 46  VARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLA 105
            A+PG+M  MS ++ ++     + +  + + L G  +T      PG       + AP  A
Sbjct: 87  TAKPGAMIAMSPTITLKG----QIKFSLKKALMGGDMTKSSYTGPGEL-----LLAP--A 135

Query: 106 RILPIDLAMFGG--ELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGL 163
            I  I +   GG  +     DAFL     + VS    Q +   +   EG    K++GQG+
Sbjct: 136 AIGDITIIKLGGHEQWSVGKDAFLACTQGI-VSEYKSQGIGKAMFSGEGLFVYKISGQGI 194

Query: 164 AFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVT 223
            +  + G++++K+L+  E   +D   +VA      ++   +G I   +   + LV     
Sbjct: 195 LWCTSFGAIIRKDLQKDEKYIIDNGHLVAWNCKYVLERVASGGILSNIHAAEGLVYEKCA 254

Query: 224 G 224
           G
Sbjct: 255 G 255


>gi|398347134|ref|ZP_10531837.1| hypothetical protein Lbro5_07894 [Leptospira broomii str. 5399]
          Length = 223

 Score = 38.5 bits (88), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 32/98 (32%), Positives = 49/98 (50%), Gaps = 9/98 (9%)

Query: 139 LDQRVRNVIGGIEG----FLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVT 194
           +D + + + G   G    FLR  L+G+G   I + G +  + LEV     VD   IVA  
Sbjct: 112 MDTKFQGMKGFFSGESLFFLR--LSGKGTLLIASYGGI--ELLEVDGDFIVDTGHIVAFE 167

Query: 195 SSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
             +N +I   G  +   FGG+ LV A  +G G ++IQ+
Sbjct: 168 EGLNYKITKFGGWKSFFFGGEGLV-ARFSGKGKLWIQT 204


>gi|448733514|ref|ZP_21715758.1| hypothetical protein C450_09578 [Halococcus salifodinae DSM 8989]
 gi|445802751|gb|EMA53054.1| hypothetical protein C450_09578 [Halococcus salifodinae DSM 8989]
          Length = 220

 Score = 38.5 bits (88), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 30/100 (30%), Positives = 47/100 (47%), Gaps = 13/100 (13%)

Query: 144 RNVIGGIEGFLRQKLTGQG-----------LAFILAGGSVVQKNLEVGEVITVDVSCIVA 192
           R   GG++G L++K++G+G           L    AG  V    L+  + I+V+ + ++A
Sbjct: 53  RKSSGGLKGMLKKKVSGEGGVMMRAEGNGHLYLADAGKEVQILELDADDEISVNGNDVLA 112

Query: 193 VTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
              SVN  IK    I  A  GG  L    + GPG + I +
Sbjct: 113 FEESVNWDIKMMNSIAGASSGG--LFNVYLKGPGHIAITT 150


>gi|422879786|ref|ZP_16926251.1| hypothetical protein HMPREF9396_1638 [Streptococcus sanguinis
           SK1059]
 gi|422929630|ref|ZP_16962571.1| hypothetical protein HMPREF8573_1626 [Streptococcus sanguinis ATCC
           29667]
 gi|422932598|ref|ZP_16965529.1| hypothetical protein HMPREF9387_2177 [Streptococcus sanguinis
           SK340]
 gi|332365197|gb|EGJ42960.1| hypothetical protein HMPREF9396_1638 [Streptococcus sanguinis
           SK1059]
 gi|339614532|gb|EGQ19227.1| hypothetical protein HMPREF8573_1626 [Streptococcus sanguinis ATCC
           29667]
 gi|339618349|gb|EGQ22947.1| hypothetical protein HMPREF9387_2177 [Streptococcus sanguinis
           SK340]
          Length = 231

 Score = 38.5 bits (88), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 35/153 (22%), Positives = 70/153 (45%), Gaps = 8/153 (5%)

Query: 94  DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNVIGGIE 151
           +G++ +A  +  +++P+ L     +      AFL    D     T++++   + + GG  
Sbjct: 79  NGYLALAPDAPGQVIPLQLGE--KQYRLNDGAFLAL--DGTAYYTMERQSVGKALFGGQG 134

Query: 152 GFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAV 211
           G       GQG     A GS+ +  L   EV T+D + +VA + S+N  +       +++
Sbjct: 135 GLFVMTTQGQGTLLANAFGSIKKIELHNQEV-TIDNAHVVAWSQSLNYHVHLENGFWQSI 193

Query: 212 FGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
             G+ +V     G G V++QSL     +  +++
Sbjct: 194 GTGEGVVN-TFQGTGEVYVQSLNLQTFAGSLSK 225


>gi|448612787|ref|ZP_21662667.1| hypothetical protein C440_12789 [Haloferax mucosum ATCC BAA-1512]
 gi|445739684|gb|ELZ91190.1| hypothetical protein C440_12789 [Haloferax mucosum ATCC BAA-1512]
          Length = 220

 Score = 38.5 bits (88), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 40/157 (25%), Positives = 68/157 (43%), Gaps = 28/157 (17%)

Query: 94  DGFVGIAAPS-------LARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
           D FV   AP+       L     +D+A+  G ++ +  + +    D+          R  
Sbjct: 4   DEFVDTHAPNEGGDAFELEHSKLLDVAL-DGTIMAKAGSMVGYTGDISFE-------RKS 55

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSCIVAVTS 195
            GG++G L++K TG+G   + A GS           V    L+ GE I+V+ + ++A  +
Sbjct: 56  AGGLKGMLKKKATGEGTVMMEATGSGHLYLADQGKEVQILELDAGEEISVNGNDVLAFEN 115

Query: 196 SVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           ++   I+    I  A  GG  L    + GPG V I +
Sbjct: 116 TIEWDIRMMKSIAGASTGG--LFNVFLEGPGHVAITT 150


>gi|302421758|ref|XP_003008709.1| DUF124 domain-containing protein [Verticillium albo-atrum VaMs.102]
 gi|261351855|gb|EEY14283.1| DUF124 domain-containing protein [Verticillium albo-atrum VaMs.102]
          Length = 469

 Score = 38.5 bits (88), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 33/109 (30%), Positives = 57/109 (52%), Gaps = 2/109 (1%)

Query: 124 DAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVI 183
           DA++ S   V + +   Q +   I   EG    K++G GL +I + G++++K+L  GE  
Sbjct: 346 DAYVASTQGV-ICDYKRQGLGKAIFSGEGLFVYKISGTGLLWITSFGAIIRKDLIPGEKY 404

Query: 184 TVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
            VD   +VA  +   ++   +G I   +  G+ LV    TGPG V++Q+
Sbjct: 405 IVDNGHLVAWNTKYIIERVASGGIISGISSGEGLVCK-FTGPGTVYMQT 452


>gi|398345153|ref|ZP_10529856.1| hypothetical protein LinasL1_19342 [Leptospira inadai serovar Lyme
           str. 10]
          Length = 223

 Score = 38.5 bits (88), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 30/90 (33%), Positives = 45/90 (50%), Gaps = 9/90 (10%)

Query: 149 GIEGFLRQ------KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIK 202
           G++GF         KL+G+G   I + G +  + LEV     VD   IVA    +N +I 
Sbjct: 118 GMKGFFSGESLFFLKLSGKGTLLIASYGGI--ELLEVDGDFIVDTGHIVAFEEGLNYKIA 175

Query: 203 YNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
             G  +   FGG+ LV A  +G G ++IQ+
Sbjct: 176 KFGGWKSFFFGGEGLV-ARFSGKGKLWIQT 204


>gi|384567618|ref|ZP_10014722.1| hypothetical protein SacglDRAFT_03821 [Saccharomonospora glauca
           K62]
 gi|384523472|gb|EIF00668.1| hypothetical protein SacglDRAFT_03821 [Saccharomonospora glauca
           K62]
          Length = 197

 Score = 38.5 bits (88), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 35/140 (25%), Positives = 61/140 (43%), Gaps = 3/140 (2%)

Query: 95  GFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFL 154
           G++ +A        P++L+   G  +   DA L     V+  +      + + GG  GFL
Sbjct: 59  GWIDLAPRHPGDAYPLELSGQLGWSVAS-DAVLARPATVRHDHPWPAH-QTLFGGDSGFL 116

Query: 155 RQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGG 214
           +   +G G       G V   +L+ GEV+TV+   ++A   +V V+++   P        
Sbjct: 117 KH-YSGTGPLVFACRGPVDALSLKAGEVVTVNPLFVLAFPDTVQVRLRALDPAEPQSIKT 175

Query: 215 DNLVTAVVTGPGIVFIQSLP 234
              +   V GPG V +Q+ P
Sbjct: 176 GTGLALDVAGPGTVLVQTRP 195


>gi|422847659|ref|ZP_16894342.1| hypothetical protein HMPREF9381_2001 [Streptococcus sanguinis SK72]
 gi|325686657|gb|EGD28683.1| hypothetical protein HMPREF9381_2001 [Streptococcus sanguinis SK72]
          Length = 238

 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 54/213 (25%), Positives = 93/213 (43%), Gaps = 23/213 (10%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPS 93
           + +I L   E V  + GSM + +      N+ +        Q +FGK I +V  R+    
Sbjct: 18  LAEIALDASESVRIQTGSMIYHT-----PNVVLNAKVNAESQSIFGKAIQAVG-RSMASG 71

Query: 94  DG-FVGIAAP-------SLARILP---IDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
           +G F+  A         +LA  +P   + L +   +      AFL    D   S TL+++
Sbjct: 72  EGVFITEATAQSNNGRLALAPNVPGQIVALELGEKQYRLNDGAFLAM--DGSASYTLERQ 129

Query: 143 V--RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
              R + GG  GF      GQG   + + GS+ +  L   + IT+D + +VA +  +   
Sbjct: 130 SVGRALFGGQGGFYVMSTQGQGTLLVNSFGSIQKVELH-NDRITIDNAHVVAWSRELEYD 188

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSL 233
           I     + +++  G+ +V     G G ++IQSL
Sbjct: 189 IHLENGLWQSMGTGEGVVNTFY-GTGEIYIQSL 220


>gi|387784941|ref|YP_006071024.1| hypothetical protein SALIVA_1894 [Streptococcus salivarius JIM8777]
 gi|338745823|emb|CCB96189.1| uncharacterized conserved protein [Streptococcus salivarius
           JIM8777]
          Length = 231

 Score = 38.5 bits (88), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 38/164 (23%), Positives = 74/164 (45%), Gaps = 9/164 (5%)

Query: 82  ITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQ 141
           IT VV +    ++G++ +A  S  +++P+ L   G +     D    +++         Q
Sbjct: 70  ITQVVSQ---SNNGYIALAPDSPGQVIPLYL---GEKQYRLNDGAFLALDGTAYYTMESQ 123

Query: 142 RV-RNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
            + + + GG  GF      GQG     A GS+ +  L   EV T+D + +VA + +++  
Sbjct: 124 SIGKALFGGQGGFFVMTTQGQGTLLANAYGSIKKIELNNQEV-TIDNAHVVAWSQTLDYD 182

Query: 201 IKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQSLPFHRLSQRIAR 244
           I       +++  G+ +V     G G +++QSL     +  + R
Sbjct: 183 IHLENGFWQSIGTGEGVVN-TFRGTGEIYVQSLNLQTFAGLLNR 225


>gi|336320650|ref|YP_004600618.1| hypothetical protein Celgi_1531 [[Cellvibrio] gilvus ATCC 13127]
 gi|336104231|gb|AEI12050.1| protein of unknown function DUF124 [[Cellvibrio] gilvus ATCC 13127]
          Length = 213

 Score = 38.5 bits (88), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 52/220 (23%), Positives = 96/220 (43%), Gaps = 22/220 (10%)

Query: 23  IPFQILGGEAQVCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVG--MWQWLFGK 80
           +PF+ +   ++V ++ +  Q  V+AR G+M   +G +    I+     VG  +   + G+
Sbjct: 1   MPFEKV--NSKVVRVPVTAQSPVLARRGAMLGYTGQVAFRPIHGQGQGVGGMIGTAMAGE 58

Query: 81  TITSVVLRNPGPS-DGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVS--- 136
           +   +     G    GF G+       +  I+L   G  L  + D  L    +++ S   
Sbjct: 59  SNPMMATEGTGSVLYGFRGL------HVTVIELT--GDSLTVEADRLLAHDANLQTSVEF 110

Query: 137 ---NTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAV 193
                +   VR  + G +G    +++GQG   +L+ G      L  GE + VD    V  
Sbjct: 111 IGQGGIRSAVRGAMTG-QGLFTTRISGQGSVALLSHGGTFPLQL-TGETVGVDPQAYVGH 168

Query: 194 TSSVNVQIKYNGPIRRAVFGGDNLVTAV-VTGPGIVFIQS 232
           T ++NV +K +  +R  V  G      + V+G G V++Q+
Sbjct: 169 TGALNVDLKASVGLRDIVGRGSGEAFQLHVSGHGTVYVQA 208


>gi|397645845|gb|EJK77010.1| hypothetical protein THAOC_01190 [Thalassiosira oceanica]
          Length = 439

 Score = 38.5 bits (88), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 45/159 (28%), Positives = 70/159 (44%), Gaps = 18/159 (11%)

Query: 83  TSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQR 142
           T+ V     P D  V   APS    + +   + G EL     A+L     V    T++  
Sbjct: 219 TTEVRSTAAPGDVLV---APSDPGGVALHRMVRGEELTLTSGAYLAGDGGV----TVNSD 271

Query: 143 VRNVIGGIEGFLRQ-----KLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSV 197
           ++N   G   F        + +GQG   + A GS+++  L+ GE   VD   +VA ++++
Sbjct: 272 MQNPFSGFTNFSGTGVFLLRASGQGNLALSAYGSIIKYTLQPGEKRAVDNGHLVAWSATM 331

Query: 198 NVQIKYNGPIRRAVFG----GDNLVTAVVTGPGIVFIQS 232
             Q+      +  VFG    G+ L      GPGIV+IQS
Sbjct: 332 RTQMVL-ASRKAGVFGSLTSGEGLHVE-FQGPGIVYIQS 368


>gi|365960834|ref|YP_004942401.1| hypothetical protein FCOL_08995 [Flavobacterium columnare ATCC
           49512]
 gi|365737515|gb|AEW86608.1| hypothetical protein FCOL_08995 [Flavobacterium columnare ATCC
           49512]
          Length = 227

 Score = 38.5 bits (88), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 30/111 (27%), Positives = 49/111 (44%), Gaps = 4/111 (3%)

Query: 122 QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGE 181
           Q  AFL S  ++ + +      +    G E     K +G+G  +  + G ++  N+E G 
Sbjct: 101 QNSAFLASSENITIDSKFQGLTKGFFSG-ENLFLIKCSGEGDLWFNSYGGIIPINVEDGY 159

Query: 182 VITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           VI  D   IVA T  +   +   G  +   F G+ LV     G G ++IQ+
Sbjct: 160 VI--DTGHIVAFTEGLTYDVTRVGGYKSLFFSGEGLV-CRFQGKGRIWIQT 207


>gi|262281812|ref|ZP_06059581.1| conserved hypothetical protein [Streptococcus sp. 2_1_36FAA]
 gi|262262266|gb|EEY80963.1| conserved hypothetical protein [Streptococcus sp. 2_1_36FAA]
          Length = 236

 Score = 38.1 bits (87), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 47/207 (22%), Positives = 95/207 (45%), Gaps = 14/207 (6%)

Query: 35  CQIMLKPQEKVVARPGSMCFMSGSMEME-NIYIPENEVGMWQWLFGKTITS----VVLRN 89
            +I+L   EK   + GSM + S S+++   +    + +G      G+++ S     +   
Sbjct: 19  VEIVLDKGEKAYIQRGSMVYHSSSVKLRTKLNAKGSGLGKLVKAVGRSMVSGESTFITEA 78

Query: 90  PGPSDGFVGIAAPSL-ARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRV--RNV 146
              +DG +   AP++  +++P+ L     +      AFL    D     T++++   R +
Sbjct: 79  LSDADGGLLALAPNVPGQVMPLYLGEH--QYRLNDGAFLAL--DGSAFYTMERQSIGRAL 134

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
            GG  GF      G+G   + A G++ +  L+  E +T+D + +VA +  ++  I     
Sbjct: 135 FGGQGGFFVMTTQGEGTLLVNAFGAIKKIELKDQE-MTIDNAHVVAWSKDLHYDIHLENG 193

Query: 207 IRRAVFGGDNLVTAVVTGPGIVFIQSL 233
             ++V  G+ +V     G G +++QSL
Sbjct: 194 FLQSVGTGEGVVN-TFRGTGEIYVQSL 219


>gi|170077980|ref|YP_001734618.1| hypothetical protein SYNPCC7002_A1366 [Synechococcus sp. PCC 7002]
 gi|169885649|gb|ACA99362.1| conserved hypothetical protein TIGR00266 [Synechococcus sp. PCC
           7002]
          Length = 225

 Score = 38.1 bits (87), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 52/204 (25%), Positives = 92/204 (45%), Gaps = 13/204 (6%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGKTITSVVLRN 89
           +  + L   E+++A+ G+M  MSG    S  +          G+ + L G+++   V R 
Sbjct: 13  IAHVTLDAGEEIIAQAGAMVAMSGNINASTTLRKGKGGGIMGGLKRMLAGESLFLSVFRA 72

Query: 90  PGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNVIGG 149
           P P+       AP L   L +   M G E + Q  ++L S  +V +     Q  ++   G
Sbjct: 73  PLPNSEI--WFAPKLMGDLLL-YEMRGDEFVVQASSYLASGPNVDIDLGW-QGFKSFFSG 128

Query: 150 IEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQI-KYNGPIR 208
            E      ++GQG   + + G++ + +++ GE + VD   IVA   S++  + K N    
Sbjct: 129 -ESIFWLSISGQGPLLVTSFGAIYEVDVD-GEYV-VDTGHIVAFEKSLSFTVGKANPSWI 185

Query: 209 RAVFGGDNLVTAVVTGPGIVFIQS 232
            A  GG+ LV     G G ++ Q+
Sbjct: 186 GAFLGGEGLVCR-FRGKGKLYCQT 208


>gi|456986310|gb|EMG21906.1| TIGR00266-like family protein [Leptospira interrogans serovar
           Copenhageni str. LT2050]
          Length = 124

 Score = 38.1 bits (87), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 34/112 (30%), Positives = 52/112 (46%), Gaps = 6/112 (5%)

Query: 122 QPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGE 181
           +  A+L S   + +       +R+  GG EG    +  G G  F+ A G ++   ++V  
Sbjct: 2   ESGAYLASDETISIKPMFGG-IRSFFGG-EGIFLLEAFGNGKLFLNAYGGIIP--IDVQG 57

Query: 182 VITVDVSCIVAVTSSVNVQI-KYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
             T+D   IVA   S+  +I K  G  +   FGG+ LV    TG G V IQ+
Sbjct: 58  SYTIDTGHIVAFDKSLQYKIAKAGGSWKSTFFGGEGLVMEF-TGHGRVLIQT 108


>gi|332707811|ref|ZP_08427838.1| conserved hypothetical protein TIGR00266 [Moorea producens 3L]
 gi|332353514|gb|EGJ33027.1| conserved hypothetical protein TIGR00266 [Moorea producens 3L]
          Length = 225

 Score = 38.1 bits (87), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 54/207 (26%), Positives = 94/207 (45%), Gaps = 13/207 (6%)

Query: 31  EAQVCQIMLKPQEKVVARPGSMCFMSG----SMEMENIYIPENEVGMWQWLFGKTITSVV 86
           ++ + +++L   ++ VA  G M  MSG    S  +          G+ + + G+++   V
Sbjct: 10  DSAIARVVLDENQEFVAEAGCMIAMSGYINASTTLRQGKGGGIFGGLKRLVGGESLFLSV 69

Query: 87  LRNPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPDAFLCSVNDVKVSNTLDQRVRNV 146
            R+P    G + +A   +  IL     M G EL+ Q  ++L S + V +     Q  +++
Sbjct: 70  FRSP-VGGGEIFLAPKFIGDILVYQ--MNGQELVVQATSYLASESGVDIDLGF-QGFKSL 125

Query: 147 IGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQIKYNGP 206
             G E      ++G G   + + G++ +  +E GE I VD   IVA   S+N +I   G 
Sbjct: 126 FSG-ESIFWLTISGYGSVLLNSFGAIYEVPVE-GEYI-VDTGHIVAFEKSLNFEITKPGS 182

Query: 207 IRRAVF-GGDNLVTAVVTGPGIVFIQS 232
                F GG+ LV     G G V+ Q+
Sbjct: 183 SWIGAFLGGEGLVCR-FKGQGKVYCQT 208


>gi|296141808|ref|YP_003649051.1| hypothetical protein [Tsukamurella paurometabola DSM 20162]
 gi|296029942|gb|ADG80712.1| protein of unknown function DUF124 [Tsukamurella paurometabola DSM
           20162]
          Length = 226

 Score = 38.1 bits (87), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 50/214 (23%), Positives = 91/214 (42%), Gaps = 32/214 (14%)

Query: 34  VCQIMLKPQEKVVARPGSMCFMSGSMEMENIYIPENEVGMWQ-----WLFGKTITSVVLR 88
           + +  L P E +    G+M   S  M +++    + + G+ Q      L G++       
Sbjct: 13  MARCHLAPNEPMTVESGAMVAHSAGMNLQS----QAQGGIMQGLKRSLLSGESFFVSTFT 68

Query: 89  NPGPSDGFVGIAAPSLARILPIDLAMFGGELLCQPD--------AFLCSVNDVKVSNTLD 140
            P  S G+V +A       LP D+A+    +  +PD         +L S + V+ S    
Sbjct: 69  AP-ASGGWVDVAG-----TLPGDIAL----VQIRPDRPFYITRGGWLASAHGVQTSAQFG 118

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGSVVQKNLEVGEVITVDVSCIVAVTSSVNVQ 200
              +   GG EG    + +GQG   +   G++   +L+ GE + +D   +VA   S+   
Sbjct: 119 G-AKTFFGG-EGAFGLQASGQGEVLVACYGAIDVIDLQPGEQVVIDTGHVVAYDLSIRFS 176

Query: 201 IKY--NGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           ++    G   ++   G+ +V    TGPG V +Q+
Sbjct: 177 MRRAVQGKWMQSAKSGEGMVFE-FTGPGRVLLQT 209


>gi|433589647|ref|YP_007279143.1| hypothetical protein Natpe_0305 [Natrinema pellirubrum DSM 15624]
 gi|448332759|ref|ZP_21521986.1| hypothetical protein C488_05282 [Natrinema pellirubrum DSM 15624]
 gi|448379084|ref|ZP_21561048.1| hypothetical protein C478_01650 [Haloterrigena thermotolerans DSM
           11522]
 gi|433304427|gb|AGB30239.1| hypothetical protein Natpe_0305 [Natrinema pellirubrum DSM 15624]
 gi|445625372|gb|ELY78733.1| hypothetical protein C488_05282 [Natrinema pellirubrum DSM 15624]
 gi|445665646|gb|ELZ18322.1| hypothetical protein C478_01650 [Haloterrigena thermotolerans DSM
           11522]
          Length = 221

 Score = 38.1 bits (87), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 29/103 (28%), Positives = 50/103 (48%), Gaps = 13/103 (12%)

Query: 141 QRVRNVIGGIEGFLRQKLTGQGLAFILAGGS-----------VVQKNLEVGEVITVDVSC 189
           + + +  GGI GFL++K TG+G   + A G+           +    L+ G+ I+V+ + 
Sbjct: 52  EGISSAEGGITGFLKEKATGEGTPVMEATGTGHLYLADQEKKIQLLELDAGQSISVNGND 111

Query: 190 IVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIVFIQS 232
           ++A   SV+ +I+    I  A F    L    + GPG V I +
Sbjct: 112 VLAFEPSVDYEIRTVESI--AGFSAGGLTNVFLEGPGTVAITT 152


>gi|325681067|ref|ZP_08160599.1| hypothetical protein CUS_5721 [Ruminococcus albus 8]
 gi|324107296|gb|EGC01580.1| hypothetical protein CUS_5721 [Ruminococcus albus 8]
          Length = 251

 Score = 37.7 bits (86), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 32/125 (25%), Positives = 58/125 (46%), Gaps = 8/125 (6%)

Query: 116 GGELLCQPDAFLCSVNDVKVSNTLDQR---VRNVIGGI---EGFLRQKLTGQGLAFIL-A 168
           G  ++ Q  A      DV+V+  +       + +IGG    E  ++ + TGQG  ++   
Sbjct: 60  GNGIILQAGAMQVMTGDVQVATNVKGAGDFFKKMIGGSVTGESAIKPRYTGQGFVYLEPT 119

Query: 169 GGSVVQKNLEV-GEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGI 227
              ++ ++L+  G  +T+D    +A   SVNV +     +  A  GG+ L    + G GI
Sbjct: 120 YKYIILEDLDTWGGAVTIDDGMFLACDDSVNVGVVARSNLSSAALGGEGLFNTSLNGKGI 179

Query: 228 VFIQS 232
           V ++S
Sbjct: 180 VALES 184


>gi|317498198|ref|ZP_07956499.1| hypothetical protein HMPREF0996_01480 [Lachnospiraceae bacterium
           5_1_63FAA]
 gi|316894553|gb|EFV16734.1| hypothetical protein HMPREF0996_01480 [Lachnospiraceae bacterium
           5_1_63FAA]
          Length = 696

 Score = 37.7 bits (86), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 52/233 (22%), Positives = 100/233 (42%), Gaps = 25/233 (10%)

Query: 7   STPFQPYVYQS-----PQDAVIPFQILGGEAQVCQIMLK---PQEKVVARPGSMCFMSGS 58
           S PF+   YQ      P+DA + F     +  V +  LK    +  V  + G+M +M+G 
Sbjct: 441 SGPFEVIEYQRDLSVMPEDAQLAF--FCSQMNVRKRQLKCELSRGNVTIQSGAMQWMAGD 498

Query: 59  MEMENIYIPENEVGMWQWLFGKTITSVVLRNPGPSDGFVGIAAPSLARILPIDLAMFGGE 118
           + +        +  + + + GK     V++     DG + +  P+   I+ +DL  +G  
Sbjct: 499 VSVTTGIKGAGDF-LSKTIRGKVTGESVIKPEYTGDGTL-VLEPTYKHIILLDLDEWGNS 556

Query: 119 LLCQPDAFLCSVNDVKVSNTLDQRVRNVIGGIEGFLRQKLTGQGLAFILAG---GSVVQK 175
           ++     FL   + +K      +   +   G EGF    L G G+  I +      +V+ 
Sbjct: 557 IVLDDGLFLACESTLKQKAVRRKTFSSAFAGGEGFFNLGLKGSGVVCIESDCPREELVEI 616

Query: 176 NLEVGEVITVDVSCIVAVTSSVNVQIKYNGPIRRAVFGGDNLVTAVVTGPGIV 228
            LE  +V+ +D +  +A + S++        + RA   G +L+ +  +G G+V
Sbjct: 617 TLE-DDVVKIDGNLAIAWSGSLDF------TVERA---GKSLLGSAASGEGLV 659


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.325    0.140    0.417 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,317,950,768
Number of Sequences: 23463169
Number of extensions: 175345379
Number of successful extensions: 490460
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 682
Number of HSP's successfully gapped in prelim test: 476
Number of HSP's that attempted gapping in prelim test: 488090
Number of HSP's gapped (non-prelim): 1256
length of query: 281
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 140
effective length of database: 9,050,888,538
effective search space: 1267124395320
effective search space used: 1267124395320
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 76 (33.9 bits)